Amino acid dipepetide frequency for Streptococcus satellite phage Javan30

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.869AlaAla: 0.869 ± 0.442
0.579AlaCys: 0.579 ± 0.379
3.765AlaAsp: 3.765 ± 0.861
4.634AlaGlu: 4.634 ± 1.173
3.186AlaPhe: 3.186 ± 0.923
1.738AlaGly: 1.738 ± 0.831
0.0AlaHis: 0.0 ± 0.0
6.082AlaIle: 6.082 ± 0.98
4.054AlaLys: 4.054 ± 0.979
4.923AlaLeu: 4.923 ± 0.832
1.158AlaMet: 1.158 ± 0.539
4.923AlaAsn: 4.923 ± 0.762
2.027AlaPro: 2.027 ± 0.855
3.186AlaGln: 3.186 ± 0.863
2.896AlaArg: 2.896 ± 0.93
3.765AlaSer: 3.765 ± 1.236
3.765AlaThr: 3.765 ± 0.952
3.765AlaVal: 3.765 ± 1.39
0.579AlaTrp: 0.579 ± 0.372
2.896AlaTyr: 2.896 ± 0.753
0.0AlaXaa: 0.0 ± 0.0
Cys
0.29CysAla: 0.29 ± 0.27
0.0CysCys: 0.0 ± 0.0
0.869CysAsp: 0.869 ± 0.571
0.29CysGlu: 0.29 ± 0.271
0.0CysPhe: 0.0 ± 0.0
1.448CysGly: 1.448 ± 0.741
0.579CysHis: 0.579 ± 0.4
0.579CysIle: 0.579 ± 0.372
0.0CysLys: 0.0 ± 0.0
1.448CysLeu: 1.448 ± 0.535
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.29CysPro: 0.29 ± 0.271
0.579CysGln: 0.579 ± 0.542
1.158CysArg: 1.158 ± 0.645
0.29CysSer: 0.29 ± 0.3
0.0CysThr: 0.0 ± 0.0
0.29CysVal: 0.29 ± 0.27
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.579AspAla: 0.579 ± 0.378
1.738AspCys: 1.738 ± 0.867
3.765AspAsp: 3.765 ± 1.203
2.896AspGlu: 2.896 ± 1.063
1.738AspPhe: 1.738 ± 0.77
1.158AspGly: 1.158 ± 0.602
1.448AspHis: 1.448 ± 0.592
5.213AspIle: 5.213 ± 1.277
8.109AspLys: 8.109 ± 0.954
5.792AspLeu: 5.792 ± 1.057
2.027AspMet: 2.027 ± 0.881
4.054AspAsn: 4.054 ± 0.985
0.869AspPro: 0.869 ± 0.437
1.158AspGln: 1.158 ± 0.516
2.317AspArg: 2.317 ± 0.885
4.923AspSer: 4.923 ± 1.523
5.213AspThr: 5.213 ± 1.406
2.027AspVal: 2.027 ± 0.723
0.29AspTrp: 0.29 ± 0.302
5.792AspTyr: 5.792 ± 1.777
0.0AspXaa: 0.0 ± 0.0
Glu
4.923GluAla: 4.923 ± 1.252
0.579GluCys: 0.579 ± 0.313
4.344GluAsp: 4.344 ± 1.149
3.765GluGlu: 3.765 ± 0.95
2.896GluPhe: 2.896 ± 0.812
2.606GluGly: 2.606 ± 0.591
2.027GluHis: 2.027 ± 0.778
8.109GluIle: 8.109 ± 1.346
7.24GluLys: 7.24 ± 1.003
9.847GluLeu: 9.847 ± 1.89
1.448GluMet: 1.448 ± 1.054
2.606GluAsn: 2.606 ± 0.701
1.158GluPro: 1.158 ± 0.803
4.634GluGln: 4.634 ± 1.231
4.054GluArg: 4.054 ± 1.138
2.317GluSer: 2.317 ± 0.692
5.792GluThr: 5.792 ± 1.219
6.082GluVal: 6.082 ± 1.314
0.579GluTrp: 0.579 ± 0.479
2.606GluTyr: 2.606 ± 1.063
0.0GluXaa: 0.0 ± 0.0
Phe
1.448PheAla: 1.448 ± 0.649
0.0PheCys: 0.0 ± 0.0
2.606PheAsp: 2.606 ± 1.01
3.475PheGlu: 3.475 ± 0.855
1.448PhePhe: 1.448 ± 0.624
2.896PheGly: 2.896 ± 1.074
0.869PheHis: 0.869 ± 0.432
2.896PheIle: 2.896 ± 1.084
4.634PheLys: 4.634 ± 0.892
4.054PheLeu: 4.054 ± 1.001
0.29PheMet: 0.29 ± 0.302
5.213PheAsn: 5.213 ± 0.66
0.869PhePro: 0.869 ± 0.4
0.579PheGln: 0.579 ± 0.405
2.027PheArg: 2.027 ± 0.635
2.606PheSer: 2.606 ± 0.715
2.027PheThr: 2.027 ± 0.687
1.448PheVal: 1.448 ± 0.581
0.29PheTrp: 0.29 ± 0.332
1.158PheTyr: 1.158 ± 0.481
0.0PheXaa: 0.0 ± 0.0
Gly
1.738GlyAla: 1.738 ± 0.798
0.579GlyCys: 0.579 ± 0.346
2.606GlyAsp: 2.606 ± 0.708
4.634GlyGlu: 4.634 ± 1.301
2.896GlyPhe: 2.896 ± 0.738
3.765GlyGly: 3.765 ± 1.492
0.869GlyHis: 0.869 ± 0.46
3.765GlyIle: 3.765 ± 0.965
4.054GlyLys: 4.054 ± 1.169
5.792GlyLeu: 5.792 ± 1.548
0.869GlyMet: 0.869 ± 0.403
0.869GlyAsn: 0.869 ± 0.447
0.0GlyPro: 0.0 ± 0.0
1.448GlyGln: 1.448 ± 0.638
2.317GlyArg: 2.317 ± 0.634
1.738GlySer: 1.738 ± 0.824
3.475GlyThr: 3.475 ± 1.187
1.448GlyVal: 1.448 ± 0.645
0.579GlyTrp: 0.579 ± 0.54
4.054GlyTyr: 4.054 ± 1.387
0.0GlyXaa: 0.0 ± 0.0
His
2.027HisAla: 2.027 ± 0.709
0.29HisCys: 0.29 ± 0.27
0.29HisAsp: 0.29 ± 0.314
0.579HisGlu: 0.579 ± 0.458
0.869HisPhe: 0.869 ± 0.576
1.448HisGly: 1.448 ± 0.581
0.29HisHis: 0.29 ± 0.3
1.738HisIle: 1.738 ± 0.634
1.158HisLys: 1.158 ± 0.598
0.869HisLeu: 0.869 ± 0.452
0.0HisMet: 0.0 ± 0.0
0.869HisAsn: 0.869 ± 0.673
1.158HisPro: 1.158 ± 0.787
1.158HisGln: 1.158 ± 0.731
0.29HisArg: 0.29 ± 0.304
0.869HisSer: 0.869 ± 0.641
1.158HisThr: 1.158 ± 0.543
0.869HisVal: 0.869 ± 0.41
0.29HisTrp: 0.29 ± 0.271
3.186HisTyr: 3.186 ± 1.162
0.0HisXaa: 0.0 ± 0.0
Ile
4.634IleAla: 4.634 ± 1.292
0.579IleCys: 0.579 ± 0.4
4.923IleAsp: 4.923 ± 1.091
6.371IleGlu: 6.371 ± 1.549
2.606IlePhe: 2.606 ± 0.921
2.027IleGly: 2.027 ± 0.498
1.158IleHis: 1.158 ± 0.695
4.054IleIle: 4.054 ± 0.804
7.53IleLys: 7.53 ± 1.262
4.344IleLeu: 4.344 ± 1.091
1.158IleMet: 1.158 ± 0.569
2.606IleAsn: 2.606 ± 1.329
2.606IlePro: 2.606 ± 1.006
1.738IleGln: 1.738 ± 0.735
2.606IleArg: 2.606 ± 0.856
3.475IleSer: 3.475 ± 1.192
3.475IleThr: 3.475 ± 0.813
3.186IleVal: 3.186 ± 0.715
0.0IleTrp: 0.0 ± 0.0
4.634IleTyr: 4.634 ± 1.083
0.0IleXaa: 0.0 ± 0.0
Lys
8.978LysAla: 8.978 ± 1.356
0.29LysCys: 0.29 ± 0.314
4.634LysAsp: 4.634 ± 1.139
8.688LysGlu: 8.688 ± 1.28
2.606LysPhe: 2.606 ± 1.081
4.054LysGly: 4.054 ± 1.189
2.027LysHis: 2.027 ± 0.726
3.475LysIle: 3.475 ± 0.789
7.819LysLys: 7.819 ± 1.389
6.661LysLeu: 6.661 ± 1.671
2.027LysMet: 2.027 ± 0.899
4.344LysAsn: 4.344 ± 1.117
4.344LysPro: 4.344 ± 1.613
5.792LysGln: 5.792 ± 1.202
7.24LysArg: 7.24 ± 0.964
7.24LysSer: 7.24 ± 2.131
6.661LysThr: 6.661 ± 1.205
4.634LysVal: 4.634 ± 1.207
0.869LysTrp: 0.869 ± 0.419
2.896LysTyr: 2.896 ± 1.152
0.0LysXaa: 0.0 ± 0.0
Leu
7.53LeuAla: 7.53 ± 1.205
0.869LeuCys: 0.869 ± 0.444
6.95LeuAsp: 6.95 ± 1.31
8.398LeuGlu: 8.398 ± 2.338
3.186LeuPhe: 3.186 ± 0.924
5.502LeuGly: 5.502 ± 1.206
1.158LeuHis: 1.158 ± 0.604
6.082LeuIle: 6.082 ± 1.323
6.95LeuLys: 6.95 ± 1.307
6.95LeuLeu: 6.95 ± 1.491
2.027LeuMet: 2.027 ± 0.859
4.923LeuAsn: 4.923 ± 1.365
3.765LeuPro: 3.765 ± 1.249
2.896LeuGln: 2.896 ± 0.875
2.606LeuArg: 2.606 ± 0.662
5.213LeuSer: 5.213 ± 1.03
5.502LeuThr: 5.502 ± 0.819
3.186LeuVal: 3.186 ± 1.26
0.869LeuTrp: 0.869 ± 0.373
2.606LeuTyr: 2.606 ± 1.01
0.0LeuXaa: 0.0 ± 0.0
Met
1.738MetAla: 1.738 ± 0.801
0.0MetCys: 0.0 ± 0.0
1.738MetAsp: 1.738 ± 0.702
0.869MetGlu: 0.869 ± 0.516
0.29MetPhe: 0.29 ± 0.249
0.579MetGly: 0.579 ± 0.413
0.579MetHis: 0.579 ± 0.368
0.869MetIle: 0.869 ± 0.525
2.027MetLys: 2.027 ± 0.466
3.475MetLeu: 3.475 ± 1.349
0.29MetMet: 0.29 ± 0.253
2.317MetAsn: 2.317 ± 0.695
0.0MetPro: 0.0 ± 0.0
0.579MetGln: 0.579 ± 0.441
1.448MetArg: 1.448 ± 0.61
0.869MetSer: 0.869 ± 0.668
2.896MetThr: 2.896 ± 0.937
0.869MetVal: 0.869 ± 0.498
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.765AsnAla: 3.765 ± 1.042
0.29AsnCys: 0.29 ± 0.27
3.186AsnAsp: 3.186 ± 0.757
2.606AsnGlu: 2.606 ± 0.846
1.738AsnPhe: 1.738 ± 0.644
3.765AsnGly: 3.765 ± 1.192
1.738AsnHis: 1.738 ± 0.72
3.186AsnIle: 3.186 ± 0.892
5.213AsnLys: 5.213 ± 1.095
3.186AsnLeu: 3.186 ± 1.114
2.027AsnMet: 2.027 ± 1.183
2.317AsnAsn: 2.317 ± 0.695
2.027AsnPro: 2.027 ± 0.61
3.475AsnGln: 3.475 ± 1.135
2.317AsnArg: 2.317 ± 0.662
4.054AsnSer: 4.054 ± 1.03
3.475AsnThr: 3.475 ± 1.193
3.186AsnVal: 3.186 ± 1.295
0.579AsnTrp: 0.579 ± 0.424
2.027AsnTyr: 2.027 ± 0.813
0.0AsnXaa: 0.0 ± 0.0
Pro
1.448ProAla: 1.448 ± 0.785
0.29ProCys: 0.29 ± 0.3
1.158ProAsp: 1.158 ± 0.579
3.475ProGlu: 3.475 ± 1.481
1.448ProPhe: 1.448 ± 0.696
0.869ProGly: 0.869 ± 0.465
0.869ProHis: 0.869 ± 0.406
1.738ProIle: 1.738 ± 0.745
3.186ProLys: 3.186 ± 0.895
2.317ProLeu: 2.317 ± 0.656
1.738ProMet: 1.738 ± 0.633
2.896ProAsn: 2.896 ± 1.051
0.869ProPro: 0.869 ± 0.491
0.0ProGln: 0.0 ± 0.0
2.317ProArg: 2.317 ± 0.744
2.027ProSer: 2.027 ± 0.783
0.869ProThr: 0.869 ± 0.448
1.448ProVal: 1.448 ± 0.764
0.29ProTrp: 0.29 ± 0.27
1.738ProTyr: 1.738 ± 0.644
0.0ProXaa: 0.0 ± 0.0
Gln
4.054GlnAla: 4.054 ± 0.945
0.29GlnCys: 0.29 ± 0.298
2.027GlnAsp: 2.027 ± 0.787
5.792GlnGlu: 5.792 ± 1.286
0.579GlnPhe: 0.579 ± 0.405
2.896GlnGly: 2.896 ± 0.619
0.0GlnHis: 0.0 ± 0.0
1.738GlnIle: 1.738 ± 0.547
3.765GlnLys: 3.765 ± 1.087
5.213GlnLeu: 5.213 ± 0.881
0.29GlnMet: 0.29 ± 0.291
2.027GlnAsn: 2.027 ± 0.975
2.896GlnPro: 2.896 ± 1.019
1.738GlnGln: 1.738 ± 0.621
1.738GlnArg: 1.738 ± 0.698
2.317GlnSer: 2.317 ± 0.97
2.896GlnThr: 2.896 ± 0.74
2.896GlnVal: 2.896 ± 0.714
0.579GlnTrp: 0.579 ± 0.604
2.027GlnTyr: 2.027 ± 0.885
0.0GlnXaa: 0.0 ± 0.0
Arg
2.606ArgAla: 2.606 ± 1.037
0.579ArgCys: 0.579 ± 0.396
1.738ArgAsp: 1.738 ± 0.622
2.896ArgGlu: 2.896 ± 0.826
2.606ArgPhe: 2.606 ± 0.825
2.896ArgGly: 2.896 ± 0.896
0.579ArgHis: 0.579 ± 0.346
2.896ArgIle: 2.896 ± 0.965
5.792ArgLys: 5.792 ± 1.236
3.765ArgLeu: 3.765 ± 0.679
0.869ArgMet: 0.869 ± 0.482
3.475ArgAsn: 3.475 ± 1.07
1.158ArgPro: 1.158 ± 0.581
2.896ArgGln: 2.896 ± 0.871
2.317ArgArg: 2.317 ± 0.744
2.317ArgSer: 2.317 ± 1.003
2.896ArgThr: 2.896 ± 1.026
4.054ArgVal: 4.054 ± 0.952
0.579ArgTrp: 0.579 ± 0.434
2.317ArgTyr: 2.317 ± 0.762
0.0ArgXaa: 0.0 ± 0.0
Ser
3.475SerAla: 3.475 ± 1.246
0.29SerCys: 0.29 ± 0.314
6.082SerAsp: 6.082 ± 0.998
4.923SerGlu: 4.923 ± 1.537
3.475SerPhe: 3.475 ± 0.755
1.448SerGly: 1.448 ± 0.606
1.158SerHis: 1.158 ± 0.607
3.186SerIle: 3.186 ± 0.896
7.53SerLys: 7.53 ± 1.308
3.475SerLeu: 3.475 ± 1.173
1.158SerMet: 1.158 ± 0.527
1.738SerAsn: 1.738 ± 0.794
0.869SerPro: 0.869 ± 0.56
4.923SerGln: 4.923 ± 1.26
2.317SerArg: 2.317 ± 0.843
4.344SerSer: 4.344 ± 2.124
3.475SerThr: 3.475 ± 1.132
2.606SerVal: 2.606 ± 0.752
0.29SerTrp: 0.29 ± 0.304
4.634SerTyr: 4.634 ± 1.141
0.0SerXaa: 0.0 ± 0.0
Thr
4.344ThrAla: 4.344 ± 1.023
0.29ThrCys: 0.29 ± 0.271
4.344ThrAsp: 4.344 ± 1.031
3.765ThrGlu: 3.765 ± 0.932
2.896ThrPhe: 2.896 ± 1.445
4.634ThrGly: 4.634 ± 0.839
1.158ThrHis: 1.158 ± 0.65
3.186ThrIle: 3.186 ± 1.123
4.923ThrLys: 4.923 ± 1.427
5.213ThrLeu: 5.213 ± 1.635
1.158ThrMet: 1.158 ± 0.701
3.765ThrAsn: 3.765 ± 0.864
3.186ThrPro: 3.186 ± 1.142
4.634ThrGln: 4.634 ± 0.849
3.186ThrArg: 3.186 ± 0.735
4.634ThrSer: 4.634 ± 1.265
4.344ThrThr: 4.344 ± 1.876
4.054ThrVal: 4.054 ± 1.246
0.579ThrTrp: 0.579 ± 0.433
2.896ThrTyr: 2.896 ± 0.877
0.0ThrXaa: 0.0 ± 0.0
Val
2.896ValAla: 2.896 ± 0.991
0.29ValCys: 0.29 ± 0.32
2.027ValAsp: 2.027 ± 0.682
4.054ValGlu: 4.054 ± 1.08
3.475ValPhe: 3.475 ± 0.911
2.027ValGly: 2.027 ± 0.542
0.869ValHis: 0.869 ± 0.602
2.896ValIle: 2.896 ± 0.926
5.502ValLys: 5.502 ± 1.004
4.344ValLeu: 4.344 ± 1.1
1.158ValMet: 1.158 ± 0.543
1.738ValAsn: 1.738 ± 0.82
1.158ValPro: 1.158 ± 0.452
1.738ValGln: 1.738 ± 0.699
1.448ValArg: 1.448 ± 0.591
4.344ValSer: 4.344 ± 1.322
6.371ValThr: 6.371 ± 1.316
3.186ValVal: 3.186 ± 0.669
0.579ValTrp: 0.579 ± 0.54
1.158ValTyr: 1.158 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.29TrpAla: 0.29 ± 0.332
0.0TrpCys: 0.0 ± 0.0
1.158TrpAsp: 1.158 ± 0.618
0.869TrpGlu: 0.869 ± 0.443
0.29TrpPhe: 0.29 ± 0.27
0.29TrpGly: 0.29 ± 0.3
0.0TrpHis: 0.0 ± 0.0
0.29TrpIle: 0.29 ± 0.302
1.158TrpLys: 1.158 ± 0.522
1.158TrpLeu: 1.158 ± 0.535
0.0TrpMet: 0.0 ± 0.0
0.29TrpAsn: 0.29 ± 0.332
0.0TrpPro: 0.0 ± 0.0
0.29TrpGln: 0.29 ± 0.271
0.0TrpArg: 0.0 ± 0.0
1.158TrpSer: 1.158 ± 0.39
0.29TrpThr: 0.29 ± 0.27
0.869TrpVal: 0.869 ± 0.544
0.579TrpTrp: 0.579 ± 0.386
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.158TyrAla: 1.158 ± 0.625
0.29TyrCys: 0.29 ± 0.291
2.896TyrAsp: 2.896 ± 0.951
4.634TyrGlu: 4.634 ± 1.134
2.606TyrPhe: 2.606 ± 0.765
1.448TyrGly: 1.448 ± 0.674
2.027TyrHis: 2.027 ± 0.635
1.738TyrIle: 1.738 ± 0.878
4.923TyrLys: 4.923 ± 1.453
4.344TyrLeu: 4.344 ± 0.69
1.448TyrMet: 1.448 ± 0.686
3.186TyrAsn: 3.186 ± 0.791
1.738TyrPro: 1.738 ± 0.822
2.317TyrGln: 2.317 ± 0.891
4.634TyrArg: 4.634 ± 1.249
2.896TyrSer: 2.896 ± 0.769
2.606TyrThr: 2.606 ± 0.962
1.158TyrVal: 1.158 ± 0.54
0.579TyrTrp: 0.579 ± 0.542
3.475TyrTyr: 3.475 ± 1.155
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3454 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski