Amino acid dipepetide frequency for Streptococcus satellite phage Javan545

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.87AlaAla: 2.87 ± 0.735
0.0AlaCys: 0.0 ± 0.0
5.741AlaAsp: 5.741 ± 1.892
7.535AlaGlu: 7.535 ± 1.77
2.512AlaPhe: 2.512 ± 0.793
3.229AlaGly: 3.229 ± 0.836
1.076AlaHis: 1.076 ± 0.513
5.023AlaIle: 5.023 ± 1.195
6.459AlaLys: 6.459 ± 1.389
3.947AlaLeu: 3.947 ± 0.705
2.87AlaMet: 2.87 ± 0.97
3.229AlaAsn: 3.229 ± 0.957
2.87AlaPro: 2.87 ± 0.94
3.588AlaGln: 3.588 ± 0.844
2.153AlaArg: 2.153 ± 0.832
3.588AlaSer: 3.588 ± 0.776
2.153AlaThr: 2.153 ± 0.866
2.512AlaVal: 2.512 ± 1.02
0.359AlaTrp: 0.359 ± 0.348
1.435AlaTyr: 1.435 ± 0.777
0.0AlaXaa: 0.0 ± 0.0
Cys
0.359CysAla: 0.359 ± 0.301
0.0CysCys: 0.0 ± 0.0
0.718CysAsp: 0.718 ± 0.555
1.076CysGlu: 1.076 ± 0.58
0.0CysPhe: 0.0 ± 0.0
0.359CysGly: 0.359 ± 0.348
0.359CysHis: 0.359 ± 0.332
0.718CysIle: 0.718 ± 0.448
1.076CysLys: 1.076 ± 0.589
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.359CysPro: 0.359 ± 0.348
0.359CysGln: 0.359 ± 0.359
0.359CysArg: 0.359 ± 0.332
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.359CysTyr: 0.359 ± 0.306
0.0CysXaa: 0.0 ± 0.0
Asp
1.435AspAla: 1.435 ± 0.582
1.076AspCys: 1.076 ± 0.524
3.588AspAsp: 3.588 ± 1.311
4.306AspGlu: 4.306 ± 1.236
3.947AspPhe: 3.947 ± 1.251
1.794AspGly: 1.794 ± 0.518
0.359AspHis: 0.359 ± 0.478
4.665AspIle: 4.665 ± 1.099
9.329AspLys: 9.329 ± 1.673
3.229AspLeu: 3.229 ± 0.837
1.794AspMet: 1.794 ± 0.937
2.512AspAsn: 2.512 ± 1.054
1.435AspPro: 1.435 ± 0.553
0.718AspGln: 0.718 ± 0.5
3.947AspArg: 3.947 ± 1.376
2.87AspSer: 2.87 ± 0.969
1.794AspThr: 1.794 ± 0.719
2.512AspVal: 2.512 ± 0.809
0.359AspTrp: 0.359 ± 0.45
4.306AspTyr: 4.306 ± 1.177
0.0AspXaa: 0.0 ± 0.0
Glu
2.87GluAla: 2.87 ± 1.154
1.435GluCys: 1.435 ± 0.892
3.229GluAsp: 3.229 ± 0.999
9.688GluGlu: 9.688 ± 2.654
3.229GluPhe: 3.229 ± 1.242
2.153GluGly: 2.153 ± 0.747
1.076GluHis: 1.076 ± 0.526
11.482GluIle: 11.482 ± 1.848
10.405GluLys: 10.405 ± 2.236
11.841GluLeu: 11.841 ± 1.338
2.153GluMet: 2.153 ± 0.807
5.023GluAsn: 5.023 ± 1.806
1.435GluPro: 1.435 ± 0.555
4.306GluGln: 4.306 ± 1.389
6.817GluArg: 6.817 ± 1.904
6.459GluSer: 6.459 ± 1.557
5.382GluThr: 5.382 ± 1.045
3.229GluVal: 3.229 ± 1.309
0.359GluTrp: 0.359 ± 0.332
2.153GluTyr: 2.153 ± 0.697
0.0GluXaa: 0.0 ± 0.0
Phe
1.435PheAla: 1.435 ± 0.734
0.0PheCys: 0.0 ± 0.0
2.87PheAsp: 2.87 ± 1.104
5.741PheGlu: 5.741 ± 1.608
1.794PhePhe: 1.794 ± 0.946
2.153PheGly: 2.153 ± 0.922
0.359PheHis: 0.359 ± 0.341
2.512PheIle: 2.512 ± 0.969
5.023PheLys: 5.023 ± 1.285
2.87PheLeu: 2.87 ± 0.844
1.076PheMet: 1.076 ± 0.662
2.512PheAsn: 2.512 ± 0.962
0.718PhePro: 0.718 ± 0.562
0.718PheGln: 0.718 ± 0.516
1.435PheArg: 1.435 ± 0.542
3.229PheSer: 3.229 ± 1.164
2.512PheThr: 2.512 ± 1.033
1.794PheVal: 1.794 ± 0.772
0.718PheTrp: 0.718 ± 0.487
1.435PheTyr: 1.435 ± 0.754
0.0PheXaa: 0.0 ± 0.0
Gly
2.512GlyAla: 2.512 ± 0.944
0.359GlyCys: 0.359 ± 0.332
0.359GlyAsp: 0.359 ± 0.281
2.153GlyGlu: 2.153 ± 0.822
1.076GlyPhe: 1.076 ± 0.678
1.435GlyGly: 1.435 ± 0.873
1.076GlyHis: 1.076 ± 0.501
4.665GlyIle: 4.665 ± 1.131
3.947GlyLys: 3.947 ± 1.148
4.665GlyLeu: 4.665 ± 1.594
2.153GlyMet: 2.153 ± 0.736
2.153GlyAsn: 2.153 ± 0.904
0.359GlyPro: 0.359 ± 0.306
0.359GlyGln: 0.359 ± 0.281
3.947GlyArg: 3.947 ± 0.884
1.076GlySer: 1.076 ± 0.996
1.794GlyThr: 1.794 ± 0.913
4.306GlyVal: 4.306 ± 1.005
0.359GlyTrp: 0.359 ± 0.306
4.665GlyTyr: 4.665 ± 1.37
0.0GlyXaa: 0.0 ± 0.0
His
3.588HisAla: 3.588 ± 0.897
0.0HisCys: 0.0 ± 0.0
0.718HisAsp: 0.718 ± 0.434
0.0HisGlu: 0.0 ± 0.0
0.359HisPhe: 0.359 ± 0.281
0.718HisGly: 0.718 ± 0.459
0.718HisHis: 0.718 ± 0.352
0.718HisIle: 0.718 ± 0.434
1.076HisLys: 1.076 ± 0.702
1.794HisLeu: 1.794 ± 0.776
0.0HisMet: 0.0 ± 0.0
0.718HisAsn: 0.718 ± 0.641
0.718HisPro: 0.718 ± 0.434
0.359HisGln: 0.359 ± 0.332
0.718HisArg: 0.718 ± 0.471
2.87HisSer: 2.87 ± 0.76
1.794HisThr: 1.794 ± 0.492
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.359HisTyr: 0.359 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
7.176IleAla: 7.176 ± 1.955
0.359IleCys: 0.359 ± 0.359
5.382IleAsp: 5.382 ± 1.337
6.817IleGlu: 6.817 ± 1.686
3.947IlePhe: 3.947 ± 0.952
3.947IleGly: 3.947 ± 1.068
1.794IleHis: 1.794 ± 0.647
4.665IleIle: 4.665 ± 1.321
5.741IleLys: 5.741 ± 1.525
6.817IleLeu: 6.817 ± 1.505
0.718IleMet: 0.718 ± 0.477
6.459IleAsn: 6.459 ± 1.521
2.512IlePro: 2.512 ± 1.264
2.87IleGln: 2.87 ± 0.813
2.87IleArg: 2.87 ± 0.926
4.665IleSer: 4.665 ± 1.2
5.023IleThr: 5.023 ± 1.064
1.794IleVal: 1.794 ± 0.609
0.0IleTrp: 0.0 ± 0.0
1.794IleTyr: 1.794 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
7.176LysAla: 7.176 ± 1.281
0.0LysCys: 0.0 ± 0.0
5.023LysAsp: 5.023 ± 1.245
13.635LysGlu: 13.635 ± 2.51
1.435LysPhe: 1.435 ± 0.485
3.229LysGly: 3.229 ± 1.091
1.076LysHis: 1.076 ± 0.425
6.459LysIle: 6.459 ± 1.721
8.97LysLys: 8.97 ± 1.928
12.199LysLeu: 12.199 ± 1.808
1.435LysMet: 1.435 ± 0.652
3.588LysAsn: 3.588 ± 0.968
2.512LysPro: 2.512 ± 0.792
4.665LysGln: 4.665 ± 0.989
5.741LysArg: 5.741 ± 1.964
5.741LysSer: 5.741 ± 1.299
9.329LysThr: 9.329 ± 2.17
4.306LysVal: 4.306 ± 1.369
0.0LysTrp: 0.0 ± 0.0
3.229LysTyr: 3.229 ± 1.269
0.0LysXaa: 0.0 ± 0.0
Leu
7.176LeuAla: 7.176 ± 1.799
0.359LeuCys: 0.359 ± 0.306
7.535LeuAsp: 7.535 ± 1.549
10.047LeuGlu: 10.047 ± 1.628
3.229LeuPhe: 3.229 ± 1.142
5.023LeuGly: 5.023 ± 1.517
0.718LeuHis: 0.718 ± 0.492
6.1LeuIle: 6.1 ± 1.763
8.253LeuLys: 8.253 ± 1.606
9.688LeuLeu: 9.688 ± 1.729
2.87LeuMet: 2.87 ± 1.217
5.741LeuAsn: 5.741 ± 1.902
1.435LeuPro: 1.435 ± 0.861
3.947LeuGln: 3.947 ± 1.318
3.229LeuArg: 3.229 ± 1.03
5.382LeuSer: 5.382 ± 1.616
5.382LeuThr: 5.382 ± 1.083
3.947LeuVal: 3.947 ± 1.33
0.718LeuTrp: 0.718 ± 0.411
2.153LeuTyr: 2.153 ± 0.832
0.0LeuXaa: 0.0 ± 0.0
Met
3.588MetAla: 3.588 ± 1.216
0.0MetCys: 0.0 ± 0.0
1.076MetAsp: 1.076 ± 0.503
2.512MetGlu: 2.512 ± 1.01
0.718MetPhe: 0.718 ± 0.52
1.076MetGly: 1.076 ± 0.529
0.0MetHis: 0.0 ± 0.0
2.153MetIle: 2.153 ± 0.671
2.153MetLys: 2.153 ± 0.779
1.435MetLeu: 1.435 ± 0.744
0.0MetMet: 0.0 ± 0.0
2.153MetAsn: 2.153 ± 0.842
1.076MetPro: 1.076 ± 0.603
0.0MetGln: 0.0 ± 0.0
1.076MetArg: 1.076 ± 0.722
0.359MetSer: 0.359 ± 0.301
3.947MetThr: 3.947 ± 1.01
0.718MetVal: 0.718 ± 0.454
0.0MetTrp: 0.0 ± 0.0
0.359MetTyr: 0.359 ± 0.341
0.0MetXaa: 0.0 ± 0.0
Asn
5.382AsnAla: 5.382 ± 1.226
0.0AsnCys: 0.0 ± 0.0
3.229AsnAsp: 3.229 ± 1.167
3.229AsnGlu: 3.229 ± 0.886
2.512AsnPhe: 2.512 ± 0.971
3.588AsnGly: 3.588 ± 0.954
1.435AsnHis: 1.435 ± 0.659
3.588AsnIle: 3.588 ± 0.864
3.229AsnLys: 3.229 ± 0.951
5.023AsnLeu: 5.023 ± 1.131
0.359AsnMet: 0.359 ± 0.301
3.229AsnAsn: 3.229 ± 0.903
1.076AsnPro: 1.076 ± 0.521
3.588AsnGln: 3.588 ± 1.28
2.153AsnArg: 2.153 ± 0.936
4.306AsnSer: 4.306 ± 1.07
3.229AsnThr: 3.229 ± 1.085
1.076AsnVal: 1.076 ± 0.607
1.435AsnTrp: 1.435 ± 0.78
3.229AsnTyr: 3.229 ± 1.056
0.0AsnXaa: 0.0 ± 0.0
Pro
1.435ProAla: 1.435 ± 0.608
0.0ProCys: 0.0 ± 0.0
1.076ProAsp: 1.076 ± 0.54
2.87ProGlu: 2.87 ± 0.781
1.076ProPhe: 1.076 ± 0.765
0.718ProGly: 0.718 ± 0.429
0.359ProHis: 0.359 ± 0.385
0.359ProIle: 0.359 ± 0.385
3.947ProLys: 3.947 ± 1.158
1.435ProLeu: 1.435 ± 0.78
0.0ProMet: 0.0 ± 0.0
1.794ProAsn: 1.794 ± 0.955
1.076ProPro: 1.076 ± 0.562
1.076ProGln: 1.076 ± 0.414
1.435ProArg: 1.435 ± 1.041
2.153ProSer: 2.153 ± 0.95
1.435ProThr: 1.435 ± 0.537
1.076ProVal: 1.076 ± 0.567
0.0ProTrp: 0.0 ± 0.0
0.718ProTyr: 0.718 ± 0.432
0.0ProXaa: 0.0 ± 0.0
Gln
3.947GlnAla: 3.947 ± 1.098
0.359GlnCys: 0.359 ± 0.348
2.87GlnAsp: 2.87 ± 0.999
4.665GlnGlu: 4.665 ± 1.256
2.153GlnPhe: 2.153 ± 1.098
1.435GlnGly: 1.435 ± 0.665
1.435GlnHis: 1.435 ± 0.755
2.153GlnIle: 2.153 ± 0.822
4.306GlnLys: 4.306 ± 1.715
2.512GlnLeu: 2.512 ± 0.959
0.359GlnMet: 0.359 ± 0.324
3.229GlnAsn: 3.229 ± 0.915
1.076GlnPro: 1.076 ± 0.836
3.588GlnGln: 3.588 ± 1.091
2.512GlnArg: 2.512 ± 0.92
1.794GlnSer: 1.794 ± 0.803
2.153GlnThr: 2.153 ± 0.711
2.87GlnVal: 2.87 ± 1.12
0.718GlnTrp: 0.718 ± 0.445
2.512GlnTyr: 2.512 ± 0.711
0.0GlnXaa: 0.0 ± 0.0
Arg
2.512ArgAla: 2.512 ± 0.703
0.359ArgCys: 0.359 ± 0.332
2.87ArgAsp: 2.87 ± 0.779
4.665ArgGlu: 4.665 ± 1.217
1.076ArgPhe: 1.076 ± 0.717
2.512ArgGly: 2.512 ± 0.981
1.076ArgHis: 1.076 ± 0.729
4.306ArgIle: 4.306 ± 0.825
4.306ArgLys: 4.306 ± 1.083
7.176ArgLeu: 7.176 ± 1.476
2.87ArgMet: 2.87 ± 0.798
1.794ArgAsn: 1.794 ± 0.894
1.076ArgPro: 1.076 ± 0.565
3.588ArgGln: 3.588 ± 1.542
1.435ArgArg: 1.435 ± 0.78
1.076ArgSer: 1.076 ± 0.555
2.153ArgThr: 2.153 ± 0.984
2.87ArgVal: 2.87 ± 0.843
0.718ArgTrp: 0.718 ± 0.553
2.512ArgTyr: 2.512 ± 0.907
0.0ArgXaa: 0.0 ± 0.0
Ser
2.153SerAla: 2.153 ± 0.851
0.359SerCys: 0.359 ± 0.348
4.306SerAsp: 4.306 ± 0.991
5.382SerGlu: 5.382 ± 1.433
3.229SerPhe: 3.229 ± 1.205
3.229SerGly: 3.229 ± 1.121
1.794SerHis: 1.794 ± 0.688
5.741SerIle: 5.741 ± 1.31
7.894SerLys: 7.894 ± 1.575
6.459SerLeu: 6.459 ± 1.459
1.794SerMet: 1.794 ± 0.719
2.153SerAsn: 2.153 ± 0.852
1.435SerPro: 1.435 ± 0.62
1.794SerGln: 1.794 ± 0.763
0.718SerArg: 0.718 ± 0.44
2.87SerSer: 2.87 ± 1.013
2.87SerThr: 2.87 ± 0.624
1.794SerVal: 1.794 ± 0.706
0.359SerTrp: 0.359 ± 0.301
3.588SerTyr: 3.588 ± 1.143
0.0SerXaa: 0.0 ± 0.0
Thr
3.588ThrAla: 3.588 ± 0.978
0.359ThrCys: 0.359 ± 0.332
2.153ThrAsp: 2.153 ± 0.685
3.229ThrGlu: 3.229 ± 1.277
3.229ThrPhe: 3.229 ± 1.2
3.229ThrGly: 3.229 ± 1.022
0.718ThrHis: 0.718 ± 0.43
5.382ThrIle: 5.382 ± 1.39
2.87ThrLys: 2.87 ± 0.985
5.023ThrLeu: 5.023 ± 1.271
1.076ThrMet: 1.076 ± 0.607
2.512ThrAsn: 2.512 ± 0.709
2.153ThrPro: 2.153 ± 0.613
3.588ThrGln: 3.588 ± 1.288
2.87ThrArg: 2.87 ± 1.064
3.947ThrSer: 3.947 ± 1.233
2.153ThrThr: 2.153 ± 0.772
5.741ThrVal: 5.741 ± 1.311
0.0ThrTrp: 0.0 ± 0.0
2.87ThrTyr: 2.87 ± 1.28
0.0ThrXaa: 0.0 ± 0.0
Val
2.87ValAla: 2.87 ± 1.044
0.359ValCys: 0.359 ± 0.351
2.512ValAsp: 2.512 ± 0.815
2.87ValGlu: 2.87 ± 1.015
2.512ValPhe: 2.512 ± 0.92
2.153ValGly: 2.153 ± 0.566
0.0ValHis: 0.0 ± 0.0
1.794ValIle: 1.794 ± 0.732
5.023ValLys: 5.023 ± 1.894
2.512ValLeu: 2.512 ± 1.023
0.718ValMet: 0.718 ± 0.485
2.87ValAsn: 2.87 ± 0.912
0.0ValPro: 0.0 ± 0.0
4.306ValGln: 4.306 ± 1.199
3.229ValArg: 3.229 ± 0.925
4.665ValSer: 4.665 ± 1.119
2.153ValThr: 2.153 ± 1.066
0.718ValVal: 0.718 ± 0.664
0.359ValTrp: 0.359 ± 0.281
1.794ValTyr: 1.794 ± 0.879
0.0ValXaa: 0.0 ± 0.0
Trp
0.718TrpAla: 0.718 ± 0.487
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.794TrpGlu: 1.794 ± 0.788
0.0TrpPhe: 0.0 ± 0.0
0.359TrpGly: 0.359 ± 0.281
0.359TrpHis: 0.359 ± 0.37
0.0TrpIle: 0.0 ± 0.0
0.359TrpLys: 0.359 ± 0.419
0.718TrpLeu: 0.718 ± 0.411
0.0TrpMet: 0.0 ± 0.0
0.359TrpAsn: 0.359 ± 0.281
0.0TrpPro: 0.0 ± 0.0
0.359TrpGln: 0.359 ± 0.306
0.0TrpArg: 0.0 ± 0.0
0.359TrpSer: 0.359 ± 0.332
0.359TrpThr: 0.359 ± 0.348
0.359TrpVal: 0.359 ± 0.306
0.359TrpTrp: 0.359 ± 0.281
0.359TrpTyr: 0.359 ± 0.348
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.359TyrAla: 0.359 ± 0.332
0.718TyrCys: 0.718 ± 0.426
0.718TyrAsp: 0.718 ± 0.482
2.512TyrGlu: 2.512 ± 0.728
2.87TyrPhe: 2.87 ± 1.416
1.076TyrGly: 1.076 ± 0.538
1.435TyrHis: 1.435 ± 0.586
2.87TyrIle: 2.87 ± 1.149
5.382TyrLys: 5.382 ± 1.566
3.947TyrLeu: 3.947 ± 1.151
1.794TyrMet: 1.794 ± 0.698
2.87TyrAsn: 2.87 ± 1.008
0.718TyrPro: 0.718 ± 0.467
2.87TyrGln: 2.87 ± 0.875
4.306TyrArg: 4.306 ± 0.855
2.512TyrSer: 2.512 ± 0.822
1.076TyrThr: 1.076 ± 0.523
1.794TyrVal: 1.794 ± 0.712
0.0TyrTrp: 0.0 ± 0.0
2.87TyrTyr: 2.87 ± 1.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (2788 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski