Amino acid dipepetide frequency for Streptococcus satellite phage Javan286

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.753AlaAla: 0.753 ± 0.752
1.505AlaCys: 1.505 ± 0.863
4.14AlaAsp: 4.14 ± 1.431
3.011AlaGlu: 3.011 ± 1.043
3.387AlaPhe: 3.387 ± 0.858
2.258AlaGly: 2.258 ± 1.029
0.753AlaHis: 0.753 ± 0.435
4.893AlaIle: 4.893 ± 1.46
3.764AlaLys: 3.764 ± 1.356
3.764AlaLeu: 3.764 ± 1.27
1.129AlaMet: 1.129 ± 0.633
2.258AlaAsn: 2.258 ± 1.012
1.882AlaPro: 1.882 ± 0.533
0.753AlaGln: 0.753 ± 0.507
2.258AlaArg: 2.258 ± 0.963
2.258AlaSer: 2.258 ± 0.919
3.011AlaThr: 3.011 ± 1.026
4.14AlaVal: 4.14 ± 1.692
0.376AlaTrp: 0.376 ± 0.377
3.387AlaTyr: 3.387 ± 0.722
0.0AlaXaa: 0.0 ± 0.0
Cys
0.753CysAla: 0.753 ± 0.531
0.0CysCys: 0.0 ± 0.0
0.376CysAsp: 0.376 ± 0.377
0.376CysGlu: 0.376 ± 0.376
0.376CysPhe: 0.376 ± 0.305
0.376CysGly: 0.376 ± 0.378
0.753CysHis: 0.753 ± 0.756
1.129CysIle: 1.129 ± 0.791
0.753CysLys: 0.753 ± 0.577
2.258CysLeu: 2.258 ± 1.011
0.376CysMet: 0.376 ± 0.305
0.376CysAsn: 0.376 ± 0.378
0.753CysPro: 0.753 ± 0.522
0.753CysGln: 0.753 ± 0.636
0.753CysArg: 0.753 ± 0.526
0.753CysSer: 0.753 ± 0.498
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.376CysTyr: 0.376 ± 0.391
0.0CysXaa: 0.0 ± 0.0
Asp
0.753AspAla: 0.753 ± 0.587
0.753AspCys: 0.753 ± 0.469
1.505AspAsp: 1.505 ± 0.786
3.011AspGlu: 3.011 ± 1.326
3.011AspPhe: 3.011 ± 1.263
0.376AspGly: 0.376 ± 0.456
0.753AspHis: 0.753 ± 0.752
3.011AspIle: 3.011 ± 0.88
4.893AspLys: 4.893 ± 1.835
3.764AspLeu: 3.764 ± 1.477
0.376AspMet: 0.376 ± 0.402
4.516AspAsn: 4.516 ± 1.21
0.753AspPro: 0.753 ± 0.46
0.376AspGln: 0.376 ± 0.391
2.635AspArg: 2.635 ± 0.826
1.129AspSer: 1.129 ± 0.735
4.516AspThr: 4.516 ± 1.159
1.129AspVal: 1.129 ± 0.624
0.0AspTrp: 0.0 ± 0.0
1.882AspTyr: 1.882 ± 0.623
0.0AspXaa: 0.0 ± 0.0
Glu
3.011GluAla: 3.011 ± 1.146
1.129GluCys: 1.129 ± 0.628
1.882GluAsp: 1.882 ± 1.086
7.151GluGlu: 7.151 ± 2.744
2.635GluPhe: 2.635 ± 0.857
2.635GluGly: 2.635 ± 1.091
1.129GluHis: 1.129 ± 0.694
5.645GluIle: 5.645 ± 2.095
8.28GluLys: 8.28 ± 2.34
12.796GluLeu: 12.796 ± 3.514
1.505GluMet: 1.505 ± 0.961
4.14GluAsn: 4.14 ± 0.988
1.129GluPro: 1.129 ± 0.553
3.387GluGln: 3.387 ± 1.066
4.893GluArg: 4.893 ± 1.254
1.882GluSer: 1.882 ± 0.82
3.011GluThr: 3.011 ± 1.333
3.011GluVal: 3.011 ± 1.072
0.376GluTrp: 0.376 ± 0.305
1.505GluTyr: 1.505 ± 0.634
0.0GluXaa: 0.0 ± 0.0
Phe
2.635PheAla: 2.635 ± 1.062
0.753PheCys: 0.753 ± 0.681
1.129PheAsp: 1.129 ± 0.665
3.387PheGlu: 3.387 ± 1.19
1.505PhePhe: 1.505 ± 0.757
4.14PheGly: 4.14 ± 1.695
0.376PheHis: 0.376 ± 0.377
7.527PheIle: 7.527 ± 3.046
4.516PheLys: 4.516 ± 1.083
3.011PheLeu: 3.011 ± 1.072
0.753PheMet: 0.753 ± 0.509
2.635PheAsn: 2.635 ± 0.866
1.505PhePro: 1.505 ± 0.887
1.129PheGln: 1.129 ± 0.586
1.882PheArg: 1.882 ± 0.948
4.14PheSer: 4.14 ± 1.615
0.376PheThr: 0.376 ± 0.377
2.258PheVal: 2.258 ± 0.873
0.376PheTrp: 0.376 ± 0.341
3.387PheTyr: 3.387 ± 1.286
0.0PheXaa: 0.0 ± 0.0
Gly
0.753GlyAla: 0.753 ± 0.527
0.0GlyCys: 0.0 ± 0.0
1.882GlyAsp: 1.882 ± 0.618
1.882GlyGlu: 1.882 ± 1.05
2.258GlyPhe: 2.258 ± 0.828
3.011GlyGly: 3.011 ± 0.886
1.882GlyHis: 1.882 ± 0.49
4.893GlyIle: 4.893 ± 1.207
5.645GlyLys: 5.645 ± 1.353
6.022GlyLeu: 6.022 ± 1.768
1.505GlyMet: 1.505 ± 0.619
0.753GlyAsn: 0.753 ± 0.571
0.0GlyPro: 0.0 ± 0.0
1.505GlyGln: 1.505 ± 0.896
1.129GlyArg: 1.129 ± 0.887
3.011GlySer: 3.011 ± 0.724
1.505GlyThr: 1.505 ± 0.709
2.258GlyVal: 2.258 ± 0.849
0.376GlyTrp: 0.376 ± 0.336
5.269GlyTyr: 5.269 ± 1.341
0.0GlyXaa: 0.0 ± 0.0
His
2.635HisAla: 2.635 ± 0.975
0.376HisCys: 0.376 ± 0.456
1.129HisAsp: 1.129 ± 0.554
1.129HisGlu: 1.129 ± 0.634
1.505HisPhe: 1.505 ± 0.887
0.753HisGly: 0.753 ± 0.418
1.129HisHis: 1.129 ± 0.681
2.635HisIle: 2.635 ± 0.795
2.635HisLys: 2.635 ± 1.021
4.14HisLeu: 4.14 ± 1.575
0.376HisMet: 0.376 ± 0.376
2.635HisAsn: 2.635 ± 1.184
1.129HisPro: 1.129 ± 0.527
1.505HisGln: 1.505 ± 0.622
1.129HisArg: 1.129 ± 1.133
1.882HisSer: 1.882 ± 0.984
0.753HisThr: 0.753 ± 0.469
0.753HisVal: 0.753 ± 0.469
0.376HisTrp: 0.376 ± 0.391
1.129HisTyr: 1.129 ± 0.515
0.0HisXaa: 0.0 ± 0.0
Ile
3.387IleAla: 3.387 ± 0.995
1.882IleCys: 1.882 ± 0.957
5.269IleAsp: 5.269 ± 1.291
5.645IleGlu: 5.645 ± 1.779
3.764IlePhe: 3.764 ± 2.314
2.635IleGly: 2.635 ± 0.893
1.505IleHis: 1.505 ± 0.839
3.764IleIle: 3.764 ± 0.955
6.022IleLys: 6.022 ± 1.262
9.409IleLeu: 9.409 ± 1.91
1.129IleMet: 1.129 ± 0.896
4.893IleAsn: 4.893 ± 0.813
3.387IlePro: 3.387 ± 1.214
1.882IleGln: 1.882 ± 0.788
4.516IleArg: 4.516 ± 0.884
6.022IleSer: 6.022 ± 2.015
5.269IleThr: 5.269 ± 1.442
3.011IleVal: 3.011 ± 0.749
0.753IleTrp: 0.753 ± 0.444
3.764IleTyr: 3.764 ± 1.184
0.0IleXaa: 0.0 ± 0.0
Lys
5.645LysAla: 5.645 ± 1.685
0.0LysCys: 0.0 ± 0.0
3.011LysAsp: 3.011 ± 0.659
10.162LysGlu: 10.162 ± 1.777
1.129LysPhe: 1.129 ± 0.448
4.893LysGly: 4.893 ± 1.456
4.14LysHis: 4.14 ± 1.022
5.269LysIle: 5.269 ± 1.685
7.904LysLys: 7.904 ± 1.35
7.904LysLeu: 7.904 ± 1.818
4.14LysMet: 4.14 ± 1.402
5.269LysAsn: 5.269 ± 1.387
3.011LysPro: 3.011 ± 0.884
5.645LysGln: 5.645 ± 1.13
4.893LysArg: 4.893 ± 1.322
5.269LysSer: 5.269 ± 2.115
7.151LysThr: 7.151 ± 1.647
3.387LysVal: 3.387 ± 1.398
1.129LysTrp: 1.129 ± 0.635
2.635LysTyr: 2.635 ± 0.739
0.0LysXaa: 0.0 ± 0.0
Leu
6.775LeuAla: 6.775 ± 1.208
0.753LeuCys: 0.753 ± 0.526
3.011LeuAsp: 3.011 ± 0.715
7.904LeuGlu: 7.904 ± 2.048
6.022LeuPhe: 6.022 ± 2.073
6.775LeuGly: 6.775 ± 2.123
1.505LeuHis: 1.505 ± 0.668
7.904LeuIle: 7.904 ± 2.267
10.162LeuLys: 10.162 ± 2.127
14.302LeuLeu: 14.302 ± 4.288
3.387LeuMet: 3.387 ± 0.783
6.398LeuAsn: 6.398 ± 1.634
4.516LeuPro: 4.516 ± 1.361
5.269LeuGln: 5.269 ± 1.197
4.14LeuArg: 4.14 ± 1.157
6.775LeuSer: 6.775 ± 1.525
6.398LeuThr: 6.398 ± 1.534
4.516LeuVal: 4.516 ± 1.546
1.129LeuTrp: 1.129 ± 0.443
4.893LeuTyr: 4.893 ± 1.214
0.0LeuXaa: 0.0 ± 0.0
Met
2.258MetAla: 2.258 ± 1.078
0.0MetCys: 0.0 ± 0.0
1.129MetAsp: 1.129 ± 0.601
1.882MetGlu: 1.882 ± 0.942
0.0MetPhe: 0.0 ± 0.0
0.376MetGly: 0.376 ± 0.341
0.376MetHis: 0.376 ± 0.341
1.882MetIle: 1.882 ± 0.777
1.129MetLys: 1.129 ± 0.59
2.635MetLeu: 2.635 ± 1.015
2.258MetMet: 2.258 ± 0.861
2.635MetAsn: 2.635 ± 1.067
0.0MetPro: 0.0 ± 0.0
0.753MetGln: 0.753 ± 0.567
1.505MetArg: 1.505 ± 0.82
1.505MetSer: 1.505 ± 0.924
1.505MetThr: 1.505 ± 0.835
2.258MetVal: 2.258 ± 0.777
0.376MetTrp: 0.376 ± 0.341
0.753MetTyr: 0.753 ± 0.47
0.0MetXaa: 0.0 ± 0.0
Asn
3.387AsnAla: 3.387 ± 1.081
1.129AsnCys: 1.129 ± 0.648
1.882AsnAsp: 1.882 ± 1.222
3.764AsnGlu: 3.764 ± 1.331
3.011AsnPhe: 3.011 ± 0.702
4.893AsnGly: 4.893 ± 1.655
2.258AsnHis: 2.258 ± 0.831
4.14AsnIle: 4.14 ± 1.062
6.775AsnLys: 6.775 ± 1.387
1.505AsnLeu: 1.505 ± 0.687
1.129AsnMet: 1.129 ± 0.667
3.011AsnAsn: 3.011 ± 0.606
3.387AsnPro: 3.387 ± 0.896
2.258AsnGln: 2.258 ± 0.703
3.011AsnArg: 3.011 ± 1.054
4.14AsnSer: 4.14 ± 1.169
2.635AsnThr: 2.635 ± 0.94
2.258AsnVal: 2.258 ± 1.014
1.129AsnTrp: 1.129 ± 0.628
3.764AsnTyr: 3.764 ± 1.037
0.0AsnXaa: 0.0 ± 0.0
Pro
1.505ProAla: 1.505 ± 0.573
0.376ProCys: 0.376 ± 0.378
0.753ProAsp: 0.753 ± 0.51
2.258ProGlu: 2.258 ± 0.887
1.129ProPhe: 1.129 ± 0.742
1.129ProGly: 1.129 ± 0.522
0.0ProHis: 0.0 ± 0.0
1.882ProIle: 1.882 ± 0.59
4.893ProLys: 4.893 ± 1.368
1.129ProLeu: 1.129 ± 0.75
1.129ProMet: 1.129 ± 0.795
2.258ProAsn: 2.258 ± 0.735
0.0ProPro: 0.0 ± 0.0
1.129ProGln: 1.129 ± 0.448
1.505ProArg: 1.505 ± 0.809
1.882ProSer: 1.882 ± 0.624
3.387ProThr: 3.387 ± 0.981
1.882ProVal: 1.882 ± 0.982
0.0ProTrp: 0.0 ± 0.0
1.882ProTyr: 1.882 ± 0.844
0.0ProXaa: 0.0 ± 0.0
Gln
4.14GlnAla: 4.14 ± 1.217
0.753GlnCys: 0.753 ± 0.473
0.753GlnAsp: 0.753 ± 0.606
2.258GlnGlu: 2.258 ± 1.019
1.129GlnPhe: 1.129 ± 0.539
0.753GlnGly: 0.753 ± 0.623
1.129GlnHis: 1.129 ± 0.601
3.387GlnIle: 3.387 ± 1.009
4.14GlnLys: 4.14 ± 1.041
6.398GlnLeu: 6.398 ± 1.606
1.505GlnMet: 1.505 ± 0.618
1.505GlnAsn: 1.505 ± 0.905
1.129GlnPro: 1.129 ± 1.039
2.258GlnGln: 2.258 ± 1.041
1.505GlnArg: 1.505 ± 0.59
1.129GlnSer: 1.129 ± 0.648
2.635GlnThr: 2.635 ± 1.054
3.387GlnVal: 3.387 ± 1.129
0.376GlnTrp: 0.376 ± 0.378
1.505GlnTyr: 1.505 ± 0.707
0.0GlnXaa: 0.0 ± 0.0
Arg
3.011ArgAla: 3.011 ± 1.5
0.0ArgCys: 0.0 ± 0.0
0.753ArgAsp: 0.753 ± 0.603
4.14ArgGlu: 4.14 ± 1.316
2.258ArgPhe: 2.258 ± 0.795
2.258ArgGly: 2.258 ± 1.054
3.764ArgHis: 3.764 ± 1.186
2.258ArgIle: 2.258 ± 0.886
4.516ArgLys: 4.516 ± 1.142
5.269ArgLeu: 5.269 ± 1.477
1.505ArgMet: 1.505 ± 0.646
2.635ArgAsn: 2.635 ± 0.941
0.753ArgPro: 0.753 ± 0.447
3.764ArgGln: 3.764 ± 1.126
3.011ArgArg: 3.011 ± 1.221
3.011ArgSer: 3.011 ± 1.177
1.129ArgThr: 1.129 ± 0.698
2.258ArgVal: 2.258 ± 0.872
0.753ArgTrp: 0.753 ± 0.672
1.129ArgTyr: 1.129 ± 0.589
0.0ArgXaa: 0.0 ± 0.0
Ser
1.882SerAla: 1.882 ± 0.771
0.0SerCys: 0.0 ± 0.0
2.635SerAsp: 2.635 ± 1.551
4.14SerGlu: 4.14 ± 1.24
3.764SerPhe: 3.764 ± 1.676
1.882SerGly: 1.882 ± 0.847
2.258SerHis: 2.258 ± 1.163
5.269SerIle: 5.269 ± 1.04
4.14SerLys: 4.14 ± 1.483
7.527SerLeu: 7.527 ± 2.462
1.505SerMet: 1.505 ± 0.718
3.011SerAsn: 3.011 ± 0.862
2.258SerPro: 2.258 ± 1.079
3.011SerGln: 3.011 ± 1.195
2.635SerArg: 2.635 ± 0.971
2.635SerSer: 2.635 ± 1.188
3.387SerThr: 3.387 ± 0.829
4.516SerVal: 4.516 ± 1.182
0.753SerTrp: 0.753 ± 0.459
2.635SerTyr: 2.635 ± 1.005
0.0SerXaa: 0.0 ± 0.0
Thr
2.258ThrAla: 2.258 ± 1.115
0.0ThrCys: 0.0 ± 0.0
3.011ThrAsp: 3.011 ± 1.114
3.011ThrGlu: 3.011 ± 0.882
3.011ThrPhe: 3.011 ± 1.385
3.387ThrGly: 3.387 ± 1.193
1.505ThrHis: 1.505 ± 0.783
3.764ThrIle: 3.764 ± 1.128
4.14ThrLys: 4.14 ± 1.209
8.28ThrLeu: 8.28 ± 1.374
0.753ThrMet: 0.753 ± 0.481
3.011ThrAsn: 3.011 ± 1.083
1.505ThrPro: 1.505 ± 0.811
1.505ThrGln: 1.505 ± 0.667
3.011ThrArg: 3.011 ± 1.059
2.635ThrSer: 2.635 ± 1.064
1.882ThrThr: 1.882 ± 0.974
4.14ThrVal: 4.14 ± 1.187
0.376ThrTrp: 0.376 ± 0.484
2.635ThrTyr: 2.635 ± 1.048
0.0ThrXaa: 0.0 ± 0.0
Val
1.882ValAla: 1.882 ± 0.617
0.753ValCys: 0.753 ± 0.526
1.505ValAsp: 1.505 ± 0.715
3.011ValGlu: 3.011 ± 1.043
4.516ValPhe: 4.516 ± 1.069
1.882ValGly: 1.882 ± 1.117
3.011ValHis: 3.011 ± 1.199
4.14ValIle: 4.14 ± 0.821
3.764ValLys: 3.764 ± 1.343
3.764ValLeu: 3.764 ± 1.276
0.376ValMet: 0.376 ± 0.489
3.387ValAsn: 3.387 ± 1.443
1.505ValPro: 1.505 ± 0.524
2.258ValGln: 2.258 ± 0.817
1.505ValArg: 1.505 ± 0.711
4.516ValSer: 4.516 ± 1.579
3.011ValThr: 3.011 ± 1.434
1.505ValVal: 1.505 ± 0.711
0.376ValTrp: 0.376 ± 0.305
1.505ValTyr: 1.505 ± 0.831
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.376TrpAsp: 0.376 ± 0.377
1.129TrpGlu: 1.129 ± 0.499
0.753TrpPhe: 0.753 ± 0.423
0.0TrpGly: 0.0 ± 0.0
0.376TrpHis: 0.376 ± 0.416
1.505TrpIle: 1.505 ± 0.742
0.0TrpLys: 0.0 ± 0.0
2.635TrpLeu: 2.635 ± 0.691
0.0TrpMet: 0.0 ± 0.0
0.753TrpAsn: 0.753 ± 0.647
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.753TrpSer: 0.753 ± 0.415
0.376TrpThr: 0.376 ± 0.336
0.376TrpVal: 0.376 ± 0.489
0.0TrpTrp: 0.0 ± 0.0
0.376TrpTyr: 0.376 ± 0.341
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.258TyrAla: 2.258 ± 0.872
1.129TyrCys: 1.129 ± 0.672
3.011TyrAsp: 3.011 ± 1.289
1.882TyrGlu: 1.882 ± 0.828
2.258TyrPhe: 2.258 ± 0.831
1.129TyrGly: 1.129 ± 0.443
1.129TyrHis: 1.129 ± 0.696
3.011TyrIle: 3.011 ± 0.632
4.516TyrLys: 4.516 ± 1.345
6.022TyrLeu: 6.022 ± 1.801
0.0TyrMet: 0.0 ± 0.0
3.387TyrAsn: 3.387 ± 1.058
1.505TyrPro: 1.505 ± 0.675
2.635TyrGln: 2.635 ± 1.223
2.635TyrArg: 2.635 ± 0.979
4.516TyrSer: 4.516 ± 1.273
1.882TyrThr: 1.882 ± 0.75
1.129TyrVal: 1.129 ± 0.783
0.376TyrTrp: 0.376 ± 0.336
2.635TyrTyr: 2.635 ± 1.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (2658 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski