Amino acid dipepetide frequency for Streptococcus satellite phage Javan538

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.238AlaCys: 1.238 ± 0.715
2.889AlaAsp: 2.889 ± 1.105
4.127AlaGlu: 4.127 ± 1.752
2.476AlaPhe: 2.476 ± 0.981
2.889AlaGly: 2.889 ± 0.861
0.413AlaHis: 0.413 ± 0.438
5.365AlaIle: 5.365 ± 1.617
4.127AlaLys: 4.127 ± 1.046
4.54AlaLeu: 4.54 ± 1.267
1.651AlaMet: 1.651 ± 0.799
5.778AlaAsn: 5.778 ± 1.972
3.302AlaPro: 3.302 ± 1.124
3.302AlaGln: 3.302 ± 1.018
3.714AlaArg: 3.714 ± 1.364
2.064AlaSer: 2.064 ± 0.997
4.127AlaThr: 4.127 ± 1.305
4.127AlaVal: 4.127 ± 1.158
0.825AlaTrp: 0.825 ± 0.463
2.476AlaTyr: 2.476 ± 0.703
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.413CysAsp: 0.413 ± 0.506
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.413CysGly: 0.413 ± 0.274
0.0CysHis: 0.0 ± 0.0
0.413CysIle: 0.413 ± 0.438
0.413CysLys: 0.413 ± 0.274
0.825CysLeu: 0.825 ± 0.526
0.0CysMet: 0.0 ± 0.0
0.413CysAsn: 0.413 ± 0.274
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.413CysArg: 0.413 ± 0.362
0.0CysSer: 0.0 ± 0.0
0.413CysThr: 0.413 ± 0.383
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.825CysTyr: 0.825 ± 0.645
0.0CysXaa: 0.0 ± 0.0
Asp
1.238AspAla: 1.238 ± 0.589
0.825AspCys: 0.825 ± 0.534
2.476AspAsp: 2.476 ± 0.708
3.714AspGlu: 3.714 ± 1.449
4.54AspPhe: 4.54 ± 1.138
2.476AspGly: 2.476 ± 0.887
0.825AspHis: 0.825 ± 0.534
6.191AspIle: 6.191 ± 1.432
4.953AspLys: 4.953 ± 1.66
4.127AspLeu: 4.127 ± 0.794
2.476AspMet: 2.476 ± 1.074
5.778AspAsn: 5.778 ± 1.638
0.825AspPro: 0.825 ± 0.456
1.238AspGln: 1.238 ± 0.51
2.064AspArg: 2.064 ± 0.868
2.889AspSer: 2.889 ± 0.596
4.127AspThr: 4.127 ± 0.992
2.476AspVal: 2.476 ± 0.867
0.825AspTrp: 0.825 ± 0.548
3.302AspTyr: 3.302 ± 1.564
0.0AspXaa: 0.0 ± 0.0
Glu
6.191GluAla: 6.191 ± 1.161
0.0GluCys: 0.0 ± 0.0
4.54GluAsp: 4.54 ± 1.375
4.127GluGlu: 4.127 ± 1.317
2.476GluPhe: 2.476 ± 0.855
2.476GluGly: 2.476 ± 0.698
0.413GluHis: 0.413 ± 0.413
4.953GluIle: 4.953 ± 1.037
5.778GluLys: 5.778 ± 1.03
14.032GluLeu: 14.032 ± 2.122
1.238GluMet: 1.238 ± 0.613
4.54GluAsn: 4.54 ± 0.946
2.476GluPro: 2.476 ± 1.041
5.778GluGln: 5.778 ± 2.109
2.476GluArg: 2.476 ± 0.834
2.889GluSer: 2.889 ± 0.84
3.714GluThr: 3.714 ± 1.012
2.889GluVal: 2.889 ± 1.184
0.825GluTrp: 0.825 ± 0.526
3.302GluTyr: 3.302 ± 1.416
0.0GluXaa: 0.0 ± 0.0
Phe
2.064PheAla: 2.064 ± 0.89
0.825PheCys: 0.825 ± 0.536
4.54PheAsp: 4.54 ± 1.295
1.238PheGlu: 1.238 ± 0.589
1.238PhePhe: 1.238 ± 0.909
2.476PheGly: 2.476 ± 0.954
0.825PheHis: 0.825 ± 0.463
2.889PheIle: 2.889 ± 0.793
3.714PheLys: 3.714 ± 1.216
4.127PheLeu: 4.127 ± 0.815
0.825PheMet: 0.825 ± 0.569
3.302PheAsn: 3.302 ± 1.324
0.413PhePro: 0.413 ± 0.43
1.651PheGln: 1.651 ± 0.859
1.651PheArg: 1.651 ± 0.755
2.889PheSer: 2.889 ± 0.847
1.238PheThr: 1.238 ± 0.821
1.651PheVal: 1.651 ± 0.719
0.413PheTrp: 0.413 ± 0.274
1.238PheTyr: 1.238 ± 0.769
0.0PheXaa: 0.0 ± 0.0
Gly
1.651GlyAla: 1.651 ± 0.688
0.825GlyCys: 0.825 ± 0.463
3.714GlyAsp: 3.714 ± 1.34
4.127GlyGlu: 4.127 ± 1.353
3.302GlyPhe: 3.302 ± 1.288
2.476GlyGly: 2.476 ± 0.545
0.825GlyHis: 0.825 ± 0.445
3.302GlyIle: 3.302 ± 0.846
5.365GlyLys: 5.365 ± 1.269
3.714GlyLeu: 3.714 ± 1.63
0.825GlyMet: 0.825 ± 0.575
4.54GlyAsn: 4.54 ± 1.682
0.0GlyPro: 0.0 ± 0.0
2.476GlyGln: 2.476 ± 1.09
1.238GlyArg: 1.238 ± 0.951
2.064GlySer: 2.064 ± 1.18
1.238GlyThr: 1.238 ± 0.573
4.953GlyVal: 4.953 ± 1.652
0.825GlyTrp: 0.825 ± 0.548
4.953GlyTyr: 4.953 ± 1.603
0.0GlyXaa: 0.0 ± 0.0
His
0.413HisAla: 0.413 ± 0.362
0.0HisCys: 0.0 ± 0.0
0.825HisAsp: 0.825 ± 0.611
0.825HisGlu: 0.825 ± 0.468
2.476HisPhe: 2.476 ± 0.836
1.238HisGly: 1.238 ± 0.951
0.413HisHis: 0.413 ± 0.506
0.825HisIle: 0.825 ± 0.463
0.0HisLys: 0.0 ± 0.0
0.825HisLeu: 0.825 ± 0.617
0.0HisMet: 0.0 ± 0.0
0.825HisAsn: 0.825 ± 0.462
0.0HisPro: 0.0 ± 0.0
0.413HisGln: 0.413 ± 0.467
1.651HisArg: 1.651 ± 0.72
2.476HisSer: 2.476 ± 0.791
2.064HisThr: 2.064 ± 1.053
0.413HisVal: 0.413 ± 0.362
0.0HisTrp: 0.0 ± 0.0
1.238HisTyr: 1.238 ± 0.71
0.0HisXaa: 0.0 ± 0.0
Ile
4.953IleAla: 4.953 ± 1.339
0.0IleCys: 0.0 ± 0.0
3.714IleAsp: 3.714 ± 1.353
4.953IleGlu: 4.953 ± 1.217
2.889IlePhe: 2.889 ± 1.024
2.476IleGly: 2.476 ± 0.649
0.825IleHis: 0.825 ± 0.577
4.127IleIle: 4.127 ± 1.406
5.778IleLys: 5.778 ± 1.431
4.54IleLeu: 4.54 ± 1.137
1.238IleMet: 1.238 ± 0.473
3.302IleAsn: 3.302 ± 0.922
2.476IlePro: 2.476 ± 0.899
2.064IleGln: 2.064 ± 0.654
2.064IleArg: 2.064 ± 0.841
4.54IleSer: 4.54 ± 2.024
7.016IleThr: 7.016 ± 0.776
5.365IleVal: 5.365 ± 1.664
0.0IleTrp: 0.0 ± 0.0
4.953IleTyr: 4.953 ± 1.336
0.0IleXaa: 0.0 ± 0.0
Lys
7.429LysAla: 7.429 ± 2.787
0.0LysCys: 0.0 ± 0.0
4.953LysAsp: 4.953 ± 1.217
9.08LysGlu: 9.08 ± 2.214
1.238LysPhe: 1.238 ± 0.492
3.714LysGly: 3.714 ± 1.401
2.476LysHis: 2.476 ± 0.624
4.953LysIle: 4.953 ± 1.434
11.143LysLys: 11.143 ± 1.976
6.603LysLeu: 6.603 ± 1.776
0.825LysMet: 0.825 ± 0.645
5.365LysAsn: 5.365 ± 1.496
4.953LysPro: 4.953 ± 1.59
4.127LysGln: 4.127 ± 0.942
4.127LysArg: 4.127 ± 1.496
2.889LysSer: 2.889 ± 0.801
5.778LysThr: 5.778 ± 1.38
4.127LysVal: 4.127 ± 1.36
0.825LysTrp: 0.825 ± 0.556
2.064LysTyr: 2.064 ± 1.124
0.0LysXaa: 0.0 ± 0.0
Leu
7.429LeuAla: 7.429 ± 1.956
0.0LeuCys: 0.0 ± 0.0
7.429LeuAsp: 7.429 ± 1.196
10.73LeuGlu: 10.73 ± 2.862
3.714LeuPhe: 3.714 ± 0.999
4.953LeuGly: 4.953 ± 1.828
1.651LeuHis: 1.651 ± 0.879
4.54LeuIle: 4.54 ± 1.099
7.016LeuLys: 7.016 ± 1.422
8.667LeuLeu: 8.667 ± 1.147
1.651LeuMet: 1.651 ± 0.873
5.365LeuAsn: 5.365 ± 1.419
4.127LeuPro: 4.127 ± 1.083
2.889LeuGln: 2.889 ± 1.13
2.064LeuArg: 2.064 ± 0.716
5.778LeuSer: 5.778 ± 1.363
4.54LeuThr: 4.54 ± 1.136
4.54LeuVal: 4.54 ± 1.317
0.825LeuTrp: 0.825 ± 0.547
3.714LeuTyr: 3.714 ± 1.235
0.0LeuXaa: 0.0 ± 0.0
Met
2.889MetAla: 2.889 ± 1.943
0.0MetCys: 0.0 ± 0.0
2.476MetAsp: 2.476 ± 0.885
1.651MetGlu: 1.651 ± 0.68
0.0MetPhe: 0.0 ± 0.0
0.413MetGly: 0.413 ± 0.274
0.0MetHis: 0.0 ± 0.0
0.825MetIle: 0.825 ± 0.536
0.825MetLys: 0.825 ± 0.348
1.651MetLeu: 1.651 ± 0.71
0.0MetMet: 0.0 ± 0.0
1.651MetAsn: 1.651 ± 0.726
0.413MetPro: 0.413 ± 0.438
0.413MetGln: 0.413 ± 0.467
1.238MetArg: 1.238 ± 0.75
1.651MetSer: 1.651 ± 0.851
2.064MetThr: 2.064 ± 0.598
1.238MetVal: 1.238 ± 0.583
0.413MetTrp: 0.413 ± 0.413
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.127AsnAla: 4.127 ± 0.69
0.0AsnCys: 0.0 ± 0.0
1.651AsnAsp: 1.651 ± 0.45
6.603AsnGlu: 6.603 ± 1.604
0.825AsnPhe: 0.825 ± 0.534
7.429AsnGly: 7.429 ± 2.103
0.825AsnHis: 0.825 ± 0.617
4.953AsnIle: 4.953 ± 1.204
6.191AsnLys: 6.191 ± 1.297
4.127AsnLeu: 4.127 ± 1.373
1.651AsnMet: 1.651 ± 0.839
3.714AsnAsn: 3.714 ± 1.339
2.476AsnPro: 2.476 ± 0.75
2.476AsnGln: 2.476 ± 0.826
2.064AsnArg: 2.064 ± 1.077
2.889AsnSer: 2.889 ± 1.753
3.714AsnThr: 3.714 ± 1.722
2.889AsnVal: 2.889 ± 1.001
0.825AsnTrp: 0.825 ± 0.526
4.54AsnTyr: 4.54 ± 0.92
0.0AsnXaa: 0.0 ± 0.0
Pro
2.476ProAla: 2.476 ± 0.751
0.0ProCys: 0.0 ± 0.0
1.238ProAsp: 1.238 ± 0.678
1.651ProGlu: 1.651 ± 0.79
2.064ProPhe: 2.064 ± 1.043
0.413ProGly: 0.413 ± 0.39
1.238ProHis: 1.238 ± 0.577
2.064ProIle: 2.064 ± 0.635
4.953ProLys: 4.953 ± 1.488
2.889ProLeu: 2.889 ± 0.563
0.0ProMet: 0.0 ± 0.0
3.302ProAsn: 3.302 ± 1.529
2.476ProPro: 2.476 ± 1.007
1.238ProGln: 1.238 ± 0.642
2.889ProArg: 2.889 ± 1.431
3.302ProSer: 3.302 ± 1.36
2.889ProThr: 2.889 ± 0.896
2.064ProVal: 2.064 ± 1.021
0.413ProTrp: 0.413 ± 0.274
0.825ProTyr: 0.825 ± 0.571
0.0ProXaa: 0.0 ± 0.0
Gln
4.127GlnAla: 4.127 ± 2.148
0.0GlnCys: 0.0 ± 0.0
0.825GlnAsp: 0.825 ± 0.556
4.54GlnGlu: 4.54 ± 1.374
2.064GlnPhe: 2.064 ± 0.852
2.476GlnGly: 2.476 ± 1.224
1.238GlnHis: 1.238 ± 0.573
3.302GlnIle: 3.302 ± 1.208
3.302GlnLys: 3.302 ± 0.846
2.064GlnLeu: 2.064 ± 0.846
0.0GlnMet: 0.0 ± 0.0
0.413GlnAsn: 0.413 ± 0.274
1.651GlnPro: 1.651 ± 0.95
2.476GlnGln: 2.476 ± 1.024
2.064GlnArg: 2.064 ± 0.617
2.889GlnSer: 2.889 ± 0.765
4.54GlnThr: 4.54 ± 0.717
2.064GlnVal: 2.064 ± 0.941
0.825GlnTrp: 0.825 ± 0.584
1.651GlnTyr: 1.651 ± 0.608
0.0GlnXaa: 0.0 ± 0.0
Arg
2.064ArgAla: 2.064 ± 0.718
0.0ArgCys: 0.0 ± 0.0
2.889ArgAsp: 2.889 ± 0.783
3.302ArgGlu: 3.302 ± 1.166
2.476ArgPhe: 2.476 ± 1.232
2.064ArgGly: 2.064 ± 1.01
0.413ArgHis: 0.413 ± 0.362
2.889ArgIle: 2.889 ± 0.756
2.476ArgLys: 2.476 ± 1.015
4.953ArgLeu: 4.953 ± 1.321
0.413ArgMet: 0.413 ± 0.438
4.953ArgAsn: 4.953 ± 1.041
0.413ArgPro: 0.413 ± 0.438
1.238ArgGln: 1.238 ± 0.921
1.651ArgArg: 1.651 ± 0.933
1.651ArgSer: 1.651 ± 0.933
2.889ArgThr: 2.889 ± 1.13
2.889ArgVal: 2.889 ± 0.978
1.238ArgTrp: 1.238 ± 0.716
3.302ArgTyr: 3.302 ± 0.852
0.0ArgXaa: 0.0 ± 0.0
Ser
3.714SerAla: 3.714 ± 1.214
0.0SerCys: 0.0 ± 0.0
4.127SerAsp: 4.127 ± 1.258
2.889SerGlu: 2.889 ± 0.972
1.238SerPhe: 1.238 ± 0.629
2.064SerGly: 2.064 ± 0.657
1.238SerHis: 1.238 ± 0.755
2.889SerIle: 2.889 ± 1.008
6.603SerLys: 6.603 ± 1.142
4.54SerLeu: 4.54 ± 0.89
1.238SerMet: 1.238 ± 0.418
2.064SerAsn: 2.064 ± 0.817
2.889SerPro: 2.889 ± 0.853
2.889SerGln: 2.889 ± 1.523
2.064SerArg: 2.064 ± 0.673
4.54SerSer: 4.54 ± 1.241
4.54SerThr: 4.54 ± 1.513
3.302SerVal: 3.302 ± 1.39
0.825SerTrp: 0.825 ± 0.724
2.889SerTyr: 2.889 ± 0.715
0.0SerXaa: 0.0 ± 0.0
Thr
4.127ThrAla: 4.127 ± 1.268
0.0ThrCys: 0.0 ± 0.0
3.302ThrAsp: 3.302 ± 1.264
3.302ThrGlu: 3.302 ± 1.359
3.714ThrPhe: 3.714 ± 1.332
2.889ThrGly: 2.889 ± 1.481
1.238ThrHis: 1.238 ± 0.655
3.714ThrIle: 3.714 ± 1.248
4.127ThrLys: 4.127 ± 1.186
6.191ThrLeu: 6.191 ± 0.854
3.302ThrMet: 3.302 ± 0.888
2.889ThrAsn: 2.889 ± 0.811
4.127ThrPro: 4.127 ± 0.963
3.714ThrGln: 3.714 ± 1.657
3.714ThrArg: 3.714 ± 1.047
3.302ThrSer: 3.302 ± 0.962
4.127ThrThr: 4.127 ± 1.065
5.365ThrVal: 5.365 ± 0.992
0.413ThrTrp: 0.413 ± 0.438
2.476ThrTyr: 2.476 ± 1.596
0.0ThrXaa: 0.0 ± 0.0
Val
3.302ValAla: 3.302 ± 1.318
0.413ValCys: 0.413 ± 0.274
2.476ValAsp: 2.476 ± 0.774
2.064ValGlu: 2.064 ± 1.135
0.825ValPhe: 0.825 ± 0.348
3.714ValGly: 3.714 ± 0.906
0.825ValHis: 0.825 ± 0.463
4.127ValIle: 4.127 ± 1.379
5.365ValLys: 5.365 ± 1.048
7.016ValLeu: 7.016 ± 1.645
0.413ValMet: 0.413 ± 0.438
4.127ValAsn: 4.127 ± 0.851
2.889ValPro: 2.889 ± 1.167
1.651ValGln: 1.651 ± 0.583
0.825ValArg: 0.825 ± 0.468
4.127ValSer: 4.127 ± 0.944
5.365ValThr: 5.365 ± 1.786
3.302ValVal: 3.302 ± 1.496
0.0ValTrp: 0.0 ± 0.0
3.302ValTyr: 3.302 ± 1.506
0.0ValXaa: 0.0 ± 0.0
Trp
0.413TrpAla: 0.413 ± 0.438
0.0TrpCys: 0.0 ± 0.0
0.825TrpAsp: 0.825 ± 0.463
1.238TrpGlu: 1.238 ± 0.908
0.0TrpPhe: 0.0 ± 0.0
0.825TrpGly: 0.825 ± 0.462
0.825TrpHis: 0.825 ± 0.462
0.413TrpIle: 0.413 ± 0.274
0.413TrpLys: 0.413 ± 0.362
1.238TrpLeu: 1.238 ± 0.726
0.413TrpMet: 0.413 ± 0.413
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.413TrpGln: 0.413 ± 0.274
0.825TrpArg: 0.825 ± 0.497
0.825TrpSer: 0.825 ± 0.571
0.0TrpThr: 0.0 ± 0.0
0.413TrpVal: 0.413 ± 0.438
0.413TrpTrp: 0.413 ± 0.362
1.238TrpTyr: 1.238 ± 0.554
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.825TyrAla: 0.825 ± 0.497
0.413TyrCys: 0.413 ± 0.438
2.064TyrAsp: 2.064 ± 0.591
4.953TyrGlu: 4.953 ± 1.553
1.651TyrPhe: 1.651 ± 0.952
4.54TyrGly: 4.54 ± 1.304
0.413TyrHis: 0.413 ± 0.438
4.54TyrIle: 4.54 ± 1.222
4.127TyrLys: 4.127 ± 2.171
4.953TyrLeu: 4.953 ± 1.157
1.238TyrMet: 1.238 ± 0.974
2.064TyrAsn: 2.064 ± 1.295
2.476TyrPro: 2.476 ± 1.089
2.064TyrGln: 2.064 ± 1.001
5.365TyrArg: 5.365 ± 1.775
2.889TyrSer: 2.889 ± 1.502
1.651TyrThr: 1.651 ± 0.782
2.064TyrVal: 2.064 ± 0.933
0.0TyrTrp: 0.0 ± 0.0
2.889TyrTyr: 2.889 ± 1.219
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2424 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski