Amino acid dipepetide frequency for Streptococcus phage Javan579

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.26AlaAla: 3.26 ± 0.788
0.362AlaCys: 0.362 ± 0.252
4.528AlaAsp: 4.528 ± 0.784
7.245AlaGlu: 7.245 ± 0.876
3.623AlaPhe: 3.623 ± 0.866
4.709AlaGly: 4.709 ± 0.815
1.449AlaHis: 1.449 ± 0.504
3.804AlaIle: 3.804 ± 0.878
6.521AlaLys: 6.521 ± 0.842
5.072AlaLeu: 5.072 ± 0.882
1.811AlaMet: 1.811 ± 0.503
4.528AlaAsn: 4.528 ± 0.932
1.087AlaPro: 1.087 ± 0.449
2.898AlaGln: 2.898 ± 1.071
2.898AlaArg: 2.898 ± 0.727
5.615AlaSer: 5.615 ± 1.289
3.441AlaThr: 3.441 ± 0.756
5.072AlaVal: 5.072 ± 0.728
0.362AlaTrp: 0.362 ± 0.247
2.355AlaTyr: 2.355 ± 0.544
0.0AlaXaa: 0.0 ± 0.0
Cys
0.906CysAla: 0.906 ± 0.335
0.362CysCys: 0.362 ± 0.253
0.181CysAsp: 0.181 ± 0.162
0.543CysGlu: 0.543 ± 0.258
0.543CysPhe: 0.543 ± 0.306
1.449CysGly: 1.449 ± 0.689
0.181CysHis: 0.181 ± 0.151
0.543CysIle: 0.543 ± 0.309
0.543CysLys: 0.543 ± 0.235
0.725CysLeu: 0.725 ± 0.356
0.0CysMet: 0.0 ± 0.0
0.181CysAsn: 0.181 ± 0.181
0.362CysPro: 0.362 ± 0.257
0.181CysGln: 0.181 ± 0.178
0.543CysArg: 0.543 ± 0.397
0.543CysSer: 0.543 ± 0.321
0.181CysThr: 0.181 ± 0.177
0.362CysVal: 0.362 ± 0.294
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.166AspAla: 4.166 ± 0.909
0.543AspCys: 0.543 ± 0.281
4.528AspAsp: 4.528 ± 0.975
4.528AspGlu: 4.528 ± 1.009
4.166AspPhe: 4.166 ± 0.874
4.166AspGly: 4.166 ± 0.568
0.906AspHis: 0.906 ± 0.381
6.158AspIle: 6.158 ± 0.943
5.434AspLys: 5.434 ± 0.914
5.615AspLeu: 5.615 ± 0.847
0.906AspMet: 0.906 ± 0.407
4.166AspAsn: 4.166 ± 0.955
1.63AspPro: 1.63 ± 0.513
1.087AspGln: 1.087 ± 0.433
3.26AspArg: 3.26 ± 0.856
3.26AspSer: 3.26 ± 0.784
3.079AspThr: 3.079 ± 0.779
4.709AspVal: 4.709 ± 0.849
1.087AspTrp: 1.087 ± 0.396
3.985AspTyr: 3.985 ± 0.916
0.0AspXaa: 0.0 ± 0.0
Glu
6.521GluAla: 6.521 ± 1.086
0.362GluCys: 0.362 ± 0.309
3.26GluAsp: 3.26 ± 0.667
6.158GluGlu: 6.158 ± 1.265
2.717GluPhe: 2.717 ± 0.629
2.174GluGly: 2.174 ± 0.541
1.449GluHis: 1.449 ± 0.409
5.434GluIle: 5.434 ± 1.259
5.072GluLys: 5.072 ± 0.778
8.151GluLeu: 8.151 ± 1.249
3.26GluMet: 3.26 ± 0.862
3.623GluAsn: 3.623 ± 0.83
1.811GluPro: 1.811 ± 0.575
2.717GluGln: 2.717 ± 0.774
3.985GluArg: 3.985 ± 0.759
4.709GluSer: 4.709 ± 0.788
2.536GluThr: 2.536 ± 0.692
4.89GluVal: 4.89 ± 0.864
0.725GluTrp: 0.725 ± 0.352
2.898GluTyr: 2.898 ± 0.778
0.0GluXaa: 0.0 ± 0.0
Phe
4.166PheAla: 4.166 ± 0.83
0.362PheCys: 0.362 ± 0.287
3.441PheAsp: 3.441 ± 0.806
4.709PheGlu: 4.709 ± 0.981
1.449PhePhe: 1.449 ± 0.403
3.26PheGly: 3.26 ± 0.812
0.181PheHis: 0.181 ± 0.177
1.992PheIle: 1.992 ± 0.505
3.441PheLys: 3.441 ± 0.61
3.079PheLeu: 3.079 ± 0.562
1.268PheMet: 1.268 ± 0.409
1.811PheAsn: 1.811 ± 0.649
1.268PhePro: 1.268 ± 0.474
1.268PheGln: 1.268 ± 0.438
1.449PheArg: 1.449 ± 0.666
1.087PheSer: 1.087 ± 0.372
3.079PheThr: 3.079 ± 0.873
2.536PheVal: 2.536 ± 0.639
0.543PheTrp: 0.543 ± 0.335
1.449PheTyr: 1.449 ± 0.584
0.0PheXaa: 0.0 ± 0.0
Gly
3.985GlyAla: 3.985 ± 0.589
0.0GlyCys: 0.0 ± 0.0
3.441GlyAsp: 3.441 ± 0.741
3.26GlyGlu: 3.26 ± 0.841
2.898GlyPhe: 2.898 ± 0.7
2.174GlyGly: 2.174 ± 0.72
1.63GlyHis: 1.63 ± 0.543
4.528GlyIle: 4.528 ± 0.821
4.709GlyLys: 4.709 ± 1.236
4.89GlyLeu: 4.89 ± 0.967
2.174GlyMet: 2.174 ± 0.611
2.355GlyAsn: 2.355 ± 0.787
0.543GlyPro: 0.543 ± 0.277
3.441GlyGln: 3.441 ± 0.658
3.079GlyArg: 3.079 ± 0.656
2.717GlySer: 2.717 ± 0.721
1.63GlyThr: 1.63 ± 0.576
3.441GlyVal: 3.441 ± 0.607
0.543GlyTrp: 0.543 ± 0.307
2.536GlyTyr: 2.536 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
0.725HisAla: 0.725 ± 0.397
0.181HisCys: 0.181 ± 0.181
1.63HisAsp: 1.63 ± 0.455
1.268HisGlu: 1.268 ± 0.473
1.087HisPhe: 1.087 ± 0.622
0.543HisGly: 0.543 ± 0.31
0.181HisHis: 0.181 ± 0.181
1.268HisIle: 1.268 ± 0.506
1.811HisLys: 1.811 ± 0.517
0.543HisLeu: 0.543 ± 0.284
0.543HisMet: 0.543 ± 0.326
1.63HisAsn: 1.63 ± 0.486
1.087HisPro: 1.087 ± 0.466
0.725HisGln: 0.725 ± 0.345
0.906HisArg: 0.906 ± 0.354
1.087HisSer: 1.087 ± 0.433
0.543HisThr: 0.543 ± 0.286
0.543HisVal: 0.543 ± 0.323
0.0HisTrp: 0.0 ± 0.0
0.906HisTyr: 0.906 ± 0.458
0.0HisXaa: 0.0 ± 0.0
Ile
5.253IleAla: 5.253 ± 0.846
0.543IleCys: 0.543 ± 0.289
5.434IleAsp: 5.434 ± 0.93
4.166IleGlu: 4.166 ± 0.684
3.623IlePhe: 3.623 ± 0.838
3.441IleGly: 3.441 ± 0.758
1.449IleHis: 1.449 ± 0.52
2.536IleIle: 2.536 ± 0.882
5.796IleLys: 5.796 ± 1.358
5.253IleLeu: 5.253 ± 0.766
1.449IleMet: 1.449 ± 0.573
1.992IleAsn: 1.992 ± 0.667
1.449IlePro: 1.449 ± 0.45
2.355IleGln: 2.355 ± 0.659
3.441IleArg: 3.441 ± 0.735
3.804IleSer: 3.804 ± 0.695
3.079IleThr: 3.079 ± 0.717
4.528IleVal: 4.528 ± 1.084
0.725IleTrp: 0.725 ± 0.326
3.26IleTyr: 3.26 ± 1.09
0.0IleXaa: 0.0 ± 0.0
Lys
5.434LysAla: 5.434 ± 0.876
0.906LysCys: 0.906 ± 0.424
4.166LysAsp: 4.166 ± 0.664
8.151LysGlu: 8.151 ± 1.368
1.992LysPhe: 1.992 ± 0.786
4.347LysGly: 4.347 ± 0.78
1.449LysHis: 1.449 ± 0.756
5.615LysIle: 5.615 ± 0.836
10.686LysLys: 10.686 ± 1.763
5.253LysLeu: 5.253 ± 0.677
1.992LysMet: 1.992 ± 0.495
5.434LysAsn: 5.434 ± 1.25
1.811LysPro: 1.811 ± 0.525
5.434LysGln: 5.434 ± 1.061
4.347LysArg: 4.347 ± 0.954
3.623LysSer: 3.623 ± 0.781
6.883LysThr: 6.883 ± 1.04
5.615LysVal: 5.615 ± 1.392
1.63LysTrp: 1.63 ± 0.499
3.985LysTyr: 3.985 ± 0.972
0.0LysXaa: 0.0 ± 0.0
Leu
7.064LeuAla: 7.064 ± 0.855
0.725LeuCys: 0.725 ± 0.359
4.347LeuAsp: 4.347 ± 0.85
5.796LeuGlu: 5.796 ± 0.781
3.623LeuPhe: 3.623 ± 0.749
5.434LeuGly: 5.434 ± 0.766
0.543LeuHis: 0.543 ± 0.284
4.347LeuIle: 4.347 ± 0.66
7.788LeuLys: 7.788 ± 1.339
7.607LeuLeu: 7.607 ± 0.949
1.811LeuMet: 1.811 ± 0.668
4.89LeuAsn: 4.89 ± 1.132
3.26LeuPro: 3.26 ± 0.648
3.441LeuGln: 3.441 ± 0.767
5.253LeuArg: 5.253 ± 1.009
5.072LeuSer: 5.072 ± 1.196
3.441LeuThr: 3.441 ± 0.771
6.339LeuVal: 6.339 ± 1.117
0.725LeuTrp: 0.725 ± 0.34
1.811LeuTyr: 1.811 ± 0.512
0.0LeuXaa: 0.0 ± 0.0
Met
2.355MetAla: 2.355 ± 0.643
0.181MetCys: 0.181 ± 0.181
1.811MetAsp: 1.811 ± 0.569
1.811MetGlu: 1.811 ± 0.426
0.181MetPhe: 0.181 ± 0.152
0.906MetGly: 0.906 ± 0.417
0.725MetHis: 0.725 ± 0.351
2.174MetIle: 2.174 ± 0.675
2.355MetLys: 2.355 ± 0.71
2.174MetLeu: 2.174 ± 0.571
0.362MetMet: 0.362 ± 0.242
0.725MetAsn: 0.725 ± 0.424
0.362MetPro: 0.362 ± 0.256
0.725MetGln: 0.725 ± 0.308
1.449MetArg: 1.449 ± 0.431
1.811MetSer: 1.811 ± 0.607
2.898MetThr: 2.898 ± 0.738
1.992MetVal: 1.992 ± 0.647
0.181MetTrp: 0.181 ± 0.182
1.268MetTyr: 1.268 ± 0.629
0.0MetXaa: 0.0 ± 0.0
Asn
2.536AsnAla: 2.536 ± 0.753
0.543AsnCys: 0.543 ± 0.301
4.89AsnAsp: 4.89 ± 0.842
1.992AsnGlu: 1.992 ± 0.604
1.811AsnPhe: 1.811 ± 0.5
3.804AsnGly: 3.804 ± 0.751
0.725AsnHis: 0.725 ± 0.305
2.898AsnIle: 2.898 ± 0.687
4.528AsnLys: 4.528 ± 0.756
4.166AsnLeu: 4.166 ± 0.843
1.449AsnMet: 1.449 ± 0.636
4.166AsnAsn: 4.166 ± 0.814
3.441AsnPro: 3.441 ± 1.07
3.623AsnGln: 3.623 ± 0.704
3.441AsnArg: 3.441 ± 0.886
2.536AsnSer: 2.536 ± 0.71
1.992AsnThr: 1.992 ± 0.653
2.355AsnVal: 2.355 ± 0.647
0.543AsnTrp: 0.543 ± 0.349
1.992AsnTyr: 1.992 ± 0.707
0.0AsnXaa: 0.0 ± 0.0
Pro
1.449ProAla: 1.449 ± 0.512
0.725ProCys: 0.725 ± 0.39
2.717ProAsp: 2.717 ± 0.93
2.355ProGlu: 2.355 ± 0.721
1.268ProPhe: 1.268 ± 0.513
1.268ProGly: 1.268 ± 0.395
0.725ProHis: 0.725 ± 0.3
1.268ProIle: 1.268 ± 0.332
3.26ProLys: 3.26 ± 0.908
2.898ProLeu: 2.898 ± 0.567
1.087ProMet: 1.087 ± 0.425
2.355ProAsn: 2.355 ± 0.535
0.0ProPro: 0.0 ± 0.0
0.725ProGln: 0.725 ± 0.305
0.362ProArg: 0.362 ± 0.244
2.536ProSer: 2.536 ± 0.556
1.268ProThr: 1.268 ± 0.486
1.087ProVal: 1.087 ± 0.363
0.362ProTrp: 0.362 ± 0.261
0.543ProTyr: 0.543 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
3.623GlnAla: 3.623 ± 0.799
0.0GlnCys: 0.0 ± 0.0
3.26GlnAsp: 3.26 ± 0.773
1.811GlnGlu: 1.811 ± 0.527
1.63GlnPhe: 1.63 ± 0.55
0.906GlnGly: 0.906 ± 0.422
1.268GlnHis: 1.268 ± 0.596
2.355GlnIle: 2.355 ± 0.59
3.26GlnLys: 3.26 ± 0.73
4.166GlnLeu: 4.166 ± 0.821
0.725GlnMet: 0.725 ± 0.31
2.174GlnAsn: 2.174 ± 0.69
1.63GlnPro: 1.63 ± 0.543
1.268GlnGln: 1.268 ± 0.676
2.355GlnArg: 2.355 ± 0.512
3.623GlnSer: 3.623 ± 0.766
2.717GlnThr: 2.717 ± 0.651
2.174GlnVal: 2.174 ± 0.39
0.181GlnTrp: 0.181 ± 0.186
1.811GlnTyr: 1.811 ± 0.518
0.0GlnXaa: 0.0 ± 0.0
Arg
3.623ArgAla: 3.623 ± 0.633
0.543ArgCys: 0.543 ± 0.389
4.528ArgAsp: 4.528 ± 0.906
3.079ArgGlu: 3.079 ± 0.961
2.174ArgPhe: 2.174 ± 0.599
3.26ArgGly: 3.26 ± 0.789
1.087ArgHis: 1.087 ± 0.331
3.804ArgIle: 3.804 ± 0.721
5.072ArgLys: 5.072 ± 0.913
4.528ArgLeu: 4.528 ± 0.944
1.811ArgMet: 1.811 ± 0.517
3.079ArgAsn: 3.079 ± 0.793
0.906ArgPro: 0.906 ± 0.441
1.63ArgGln: 1.63 ± 0.83
4.166ArgArg: 4.166 ± 1.017
2.174ArgSer: 2.174 ± 0.537
2.717ArgThr: 2.717 ± 0.738
2.174ArgVal: 2.174 ± 0.476
0.543ArgTrp: 0.543 ± 0.292
2.355ArgTyr: 2.355 ± 0.644
0.0ArgXaa: 0.0 ± 0.0
Ser
2.717SerAla: 2.717 ± 0.639
0.543SerCys: 0.543 ± 0.292
3.623SerAsp: 3.623 ± 0.711
2.898SerGlu: 2.898 ± 0.716
1.63SerPhe: 1.63 ± 0.515
3.441SerGly: 3.441 ± 0.541
0.181SerHis: 0.181 ± 0.181
5.072SerIle: 5.072 ± 1.056
5.072SerLys: 5.072 ± 0.823
3.985SerLeu: 3.985 ± 0.724
2.355SerMet: 2.355 ± 0.692
4.166SerAsn: 4.166 ± 0.818
1.811SerPro: 1.811 ± 0.51
2.717SerGln: 2.717 ± 0.665
3.079SerArg: 3.079 ± 0.663
4.347SerSer: 4.347 ± 1.084
4.166SerThr: 4.166 ± 1.525
4.166SerVal: 4.166 ± 0.754
0.725SerTrp: 0.725 ± 0.423
2.536SerTyr: 2.536 ± 0.505
0.0SerXaa: 0.0 ± 0.0
Thr
4.166ThrAla: 4.166 ± 0.95
0.181ThrCys: 0.181 ± 0.162
1.992ThrAsp: 1.992 ± 0.568
3.804ThrGlu: 3.804 ± 1.217
2.174ThrPhe: 2.174 ± 0.619
4.166ThrGly: 4.166 ± 0.922
1.087ThrHis: 1.087 ± 0.541
3.623ThrIle: 3.623 ± 0.941
4.89ThrLys: 4.89 ± 1.066
4.347ThrLeu: 4.347 ± 0.932
1.63ThrMet: 1.63 ± 0.504
1.268ThrAsn: 1.268 ± 0.382
2.355ThrPro: 2.355 ± 0.52
1.811ThrGln: 1.811 ± 0.511
3.079ThrArg: 3.079 ± 0.717
3.985ThrSer: 3.985 ± 1.281
3.623ThrThr: 3.623 ± 0.832
3.441ThrVal: 3.441 ± 0.929
0.725ThrTrp: 0.725 ± 0.324
2.355ThrTyr: 2.355 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
5.434ValAla: 5.434 ± 0.934
0.543ValCys: 0.543 ± 0.25
5.434ValAsp: 5.434 ± 1.28
4.347ValGlu: 4.347 ± 1.043
2.174ValPhe: 2.174 ± 0.653
2.717ValGly: 2.717 ± 0.564
0.906ValHis: 0.906 ± 0.396
3.804ValIle: 3.804 ± 0.812
6.158ValLys: 6.158 ± 1.115
6.339ValLeu: 6.339 ± 1.077
0.906ValMet: 0.906 ± 0.331
3.26ValAsn: 3.26 ± 0.66
1.63ValPro: 1.63 ± 0.532
2.174ValGln: 2.174 ± 0.863
3.623ValArg: 3.623 ± 0.778
3.441ValSer: 3.441 ± 0.695
3.441ValThr: 3.441 ± 1.169
4.709ValVal: 4.709 ± 1.188
0.543ValTrp: 0.543 ± 0.282
2.355ValTyr: 2.355 ± 0.614
0.0ValXaa: 0.0 ± 0.0
Trp
0.725TrpAla: 0.725 ± 0.34
0.362TrpCys: 0.362 ± 0.226
0.543TrpAsp: 0.543 ± 0.296
1.811TrpGlu: 1.811 ± 0.495
0.543TrpPhe: 0.543 ± 0.316
0.543TrpGly: 0.543 ± 0.291
0.181TrpHis: 0.181 ± 0.175
0.362TrpIle: 0.362 ± 0.246
0.725TrpLys: 0.725 ± 0.361
0.362TrpLeu: 0.362 ± 0.241
0.362TrpMet: 0.362 ± 0.238
0.181TrpAsn: 0.181 ± 0.186
0.0TrpPro: 0.0 ± 0.0
0.725TrpGln: 0.725 ± 0.57
0.543TrpArg: 0.543 ± 0.269
0.543TrpSer: 0.543 ± 0.307
1.087TrpThr: 1.087 ± 0.46
0.543TrpVal: 0.543 ± 0.31
0.0TrpTrp: 0.0 ± 0.0
0.543TrpTyr: 0.543 ± 0.249
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.717TyrAla: 2.717 ± 0.887
0.362TyrCys: 0.362 ± 0.253
3.623TyrAsp: 3.623 ± 0.766
2.717TyrGlu: 2.717 ± 0.58
2.536TyrPhe: 2.536 ± 0.563
1.268TyrGly: 1.268 ± 0.433
0.906TyrHis: 0.906 ± 0.359
1.992TyrIle: 1.992 ± 0.606
1.63TyrLys: 1.63 ± 0.575
4.166TyrLeu: 4.166 ± 0.852
0.362TyrMet: 0.362 ± 0.241
1.449TyrAsn: 1.449 ± 0.547
1.811TyrPro: 1.811 ± 0.37
1.992TyrGln: 1.992 ± 0.591
1.992TyrArg: 1.992 ± 0.593
2.717TyrSer: 2.717 ± 0.704
2.898TyrThr: 2.898 ± 0.742
3.26TyrVal: 3.26 ± 0.457
0.543TyrTrp: 0.543 ± 0.303
2.717TyrTyr: 2.717 ± 0.605
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 33 proteins (5522 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski