Amino acid dipepetide frequency for Streptococcus satellite phage Javan543

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.379AlaAla: 4.379 ± 1.28
1.01AlaCys: 1.01 ± 0.723
4.042AlaAsp: 4.042 ± 1.46
5.726AlaGlu: 5.726 ± 1.683
2.358AlaPhe: 2.358 ± 0.943
4.715AlaGly: 4.715 ± 1.072
0.674AlaHis: 0.674 ± 0.39
6.399AlaIle: 6.399 ± 1.166
5.726AlaLys: 5.726 ± 1.385
8.084AlaLeu: 8.084 ± 1.448
1.01AlaMet: 1.01 ± 0.601
3.368AlaAsn: 3.368 ± 1.26
1.347AlaPro: 1.347 ± 0.7
3.705AlaGln: 3.705 ± 1.142
2.021AlaArg: 2.021 ± 0.776
3.368AlaSer: 3.368 ± 1.032
2.358AlaThr: 2.358 ± 0.887
4.379AlaVal: 4.379 ± 1.054
0.0AlaTrp: 0.0 ± 0.0
1.684AlaTyr: 1.684 ± 0.694
0.0AlaXaa: 0.0 ± 0.0
Cys
0.674CysAla: 0.674 ± 0.457
0.0CysCys: 0.0 ± 0.0
0.337CysAsp: 0.337 ± 0.404
1.01CysGlu: 1.01 ± 0.562
0.0CysPhe: 0.0 ± 0.0
0.337CysGly: 0.337 ± 0.303
0.337CysHis: 0.337 ± 0.334
0.0CysIle: 0.0 ± 0.0
1.01CysLys: 1.01 ± 0.644
1.347CysLeu: 1.347 ± 0.587
0.337CysMet: 0.337 ± 0.314
0.337CysAsn: 0.337 ± 0.257
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.01CysArg: 1.01 ± 0.559
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.674CysTyr: 0.674 ± 0.365
0.0CysXaa: 0.0 ± 0.0
Asp
4.042AspAla: 4.042 ± 1.268
0.674AspCys: 0.674 ± 0.514
5.726AspAsp: 5.726 ± 1.568
3.031AspGlu: 3.031 ± 1.089
1.684AspPhe: 1.684 ± 0.609
3.031AspGly: 3.031 ± 0.835
1.347AspHis: 1.347 ± 0.845
5.389AspIle: 5.389 ± 2.321
5.389AspLys: 5.389 ± 1.788
6.063AspLeu: 6.063 ± 1.036
2.021AspMet: 2.021 ± 0.793
2.021AspAsn: 2.021 ± 0.702
1.01AspPro: 1.01 ± 0.379
0.674AspGln: 0.674 ± 0.534
2.021AspArg: 2.021 ± 0.855
2.695AspSer: 2.695 ± 0.835
2.358AspThr: 2.358 ± 0.909
3.705AspVal: 3.705 ± 0.777
0.337AspTrp: 0.337 ± 0.351
3.368AspTyr: 3.368 ± 1.151
0.0AspXaa: 0.0 ± 0.0
Glu
5.052GluAla: 5.052 ± 1.503
0.337GluCys: 0.337 ± 0.404
3.368GluAsp: 3.368 ± 1.01
8.084GluGlu: 8.084 ± 1.707
4.715GluPhe: 4.715 ± 0.757
2.358GluGly: 2.358 ± 0.844
0.674GluHis: 0.674 ± 0.449
6.063GluIle: 6.063 ± 1.547
10.104GluLys: 10.104 ± 2.702
10.104GluLeu: 10.104 ± 2.128
1.347GluMet: 1.347 ± 0.619
3.031GluAsn: 3.031 ± 1.134
2.358GluPro: 2.358 ± 0.843
4.379GluGln: 4.379 ± 1.122
3.031GluArg: 3.031 ± 0.874
2.695GluSer: 2.695 ± 0.88
5.726GluThr: 5.726 ± 1.321
3.705GluVal: 3.705 ± 1.211
2.021GluTrp: 2.021 ± 0.631
3.031GluTyr: 3.031 ± 0.915
0.0GluXaa: 0.0 ± 0.0
Phe
2.358PheAla: 2.358 ± 0.712
0.0PheCys: 0.0 ± 0.0
2.358PheAsp: 2.358 ± 0.886
4.042PheGlu: 4.042 ± 1.206
1.01PhePhe: 1.01 ± 0.535
2.358PheGly: 2.358 ± 0.591
0.337PheHis: 0.337 ± 0.364
2.695PheIle: 2.695 ± 1.215
3.705PheLys: 3.705 ± 1.657
4.379PheLeu: 4.379 ± 1.41
0.674PheMet: 0.674 ± 0.389
1.01PheAsn: 1.01 ± 0.662
1.01PhePro: 1.01 ± 0.527
1.684PheGln: 1.684 ± 0.768
1.684PheArg: 1.684 ± 0.637
3.705PheSer: 3.705 ± 1.336
4.715PheThr: 4.715 ± 1.432
2.695PheVal: 2.695 ± 0.975
0.337PheTrp: 0.337 ± 0.257
1.684PheTyr: 1.684 ± 0.615
0.0PheXaa: 0.0 ± 0.0
Gly
1.684GlyAla: 1.684 ± 0.476
0.674GlyCys: 0.674 ± 0.535
3.368GlyAsp: 3.368 ± 1.017
2.695GlyGlu: 2.695 ± 0.672
2.695GlyPhe: 2.695 ± 1.012
1.684GlyGly: 1.684 ± 0.854
0.674GlyHis: 0.674 ± 0.51
5.389GlyIle: 5.389 ± 1.356
5.389GlyLys: 5.389 ± 1.712
4.042GlyLeu: 4.042 ± 1.434
0.674GlyMet: 0.674 ± 0.342
1.347GlyAsn: 1.347 ± 0.56
0.337GlyPro: 0.337 ± 0.531
2.021GlyGln: 2.021 ± 0.926
3.705GlyArg: 3.705 ± 1.045
2.358GlySer: 2.358 ± 1.057
2.358GlyThr: 2.358 ± 0.706
4.042GlyVal: 4.042 ± 0.927
1.347GlyTrp: 1.347 ± 0.753
3.031GlyTyr: 3.031 ± 1.032
0.0GlyXaa: 0.0 ± 0.0
His
1.01HisAla: 1.01 ± 0.742
0.0HisCys: 0.0 ± 0.0
0.337HisAsp: 0.337 ± 0.303
0.674HisGlu: 0.674 ± 0.462
0.337HisPhe: 0.337 ± 0.257
0.674HisGly: 0.674 ± 0.417
1.01HisHis: 1.01 ± 0.515
1.347HisIle: 1.347 ± 0.66
0.337HisLys: 0.337 ± 0.334
2.358HisLeu: 2.358 ± 1.09
0.337HisMet: 0.337 ± 0.355
0.337HisAsn: 0.337 ± 0.432
0.0HisPro: 0.0 ± 0.0
0.337HisGln: 0.337 ± 0.303
1.347HisArg: 1.347 ± 0.693
1.684HisSer: 1.684 ± 0.738
0.674HisThr: 0.674 ± 0.479
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.01HisTyr: 1.01 ± 0.461
0.0HisXaa: 0.0 ± 0.0
Ile
3.705IleAla: 3.705 ± 1.13
1.347IleCys: 1.347 ± 0.772
5.726IleAsp: 5.726 ± 1.867
6.063IleGlu: 6.063 ± 1.865
3.368IlePhe: 3.368 ± 1.297
4.379IleGly: 4.379 ± 1.148
0.337IleHis: 0.337 ± 0.314
6.063IleIle: 6.063 ± 1.457
8.42IleLys: 8.42 ± 2.059
6.063IleLeu: 6.063 ± 1.184
1.01IleMet: 1.01 ± 0.544
3.705IleAsn: 3.705 ± 1.222
3.368IlePro: 3.368 ± 0.891
2.695IleGln: 2.695 ± 0.887
3.705IleArg: 3.705 ± 0.952
4.379IleSer: 4.379 ± 1.365
4.379IleThr: 4.379 ± 1.394
3.031IleVal: 3.031 ± 0.897
0.674IleTrp: 0.674 ± 0.464
1.684IleTyr: 1.684 ± 0.727
0.0IleXaa: 0.0 ± 0.0
Lys
9.431LysAla: 9.431 ± 1.737
0.0LysCys: 0.0 ± 0.0
7.073LysAsp: 7.073 ± 1.514
9.768LysGlu: 9.768 ± 2.328
3.031LysPhe: 3.031 ± 1.442
6.063LysGly: 6.063 ± 2.147
1.347LysHis: 1.347 ± 0.643
8.42LysIle: 8.42 ± 2.263
9.094LysLys: 9.094 ± 2.283
9.431LysLeu: 9.431 ± 1.947
4.042LysMet: 4.042 ± 1.244
7.747LysAsn: 7.747 ± 2.014
1.684LysPro: 1.684 ± 0.682
4.042LysGln: 4.042 ± 1.062
5.726LysArg: 5.726 ± 1.748
3.705LysSer: 3.705 ± 1.285
8.42LysThr: 8.42 ± 2.395
3.031LysVal: 3.031 ± 0.881
0.337LysTrp: 0.337 ± 0.314
2.358LysTyr: 2.358 ± 1.169
0.0LysXaa: 0.0 ± 0.0
Leu
8.084LeuAla: 8.084 ± 1.567
0.674LeuCys: 0.674 ± 0.365
4.715LeuAsp: 4.715 ± 0.712
9.094LeuGlu: 9.094 ± 1.818
4.042LeuPhe: 4.042 ± 1.53
6.399LeuGly: 6.399 ± 1.263
1.347LeuHis: 1.347 ± 0.73
3.705LeuIle: 3.705 ± 0.833
12.799LeuLys: 12.799 ± 1.952
8.757LeuLeu: 8.757 ± 1.412
2.021LeuMet: 2.021 ± 1.255
4.379LeuAsn: 4.379 ± 1.019
3.031LeuPro: 3.031 ± 1.539
5.726LeuGln: 5.726 ± 1.107
3.705LeuArg: 3.705 ± 1.581
6.736LeuSer: 6.736 ± 1.206
4.715LeuThr: 4.715 ± 1.3
4.379LeuVal: 4.379 ± 1.219
1.347LeuTrp: 1.347 ± 0.583
3.368LeuTyr: 3.368 ± 1.466
0.0LeuXaa: 0.0 ± 0.0
Met
2.021MetAla: 2.021 ± 1.006
0.0MetCys: 0.0 ± 0.0
1.684MetAsp: 1.684 ± 0.812
2.021MetGlu: 2.021 ± 0.897
0.674MetPhe: 0.674 ± 0.668
0.337MetGly: 0.337 ± 0.314
0.0MetHis: 0.0 ± 0.0
2.021MetIle: 2.021 ± 1.076
2.695MetLys: 2.695 ± 0.967
1.684MetLeu: 1.684 ± 0.447
0.0MetMet: 0.0 ± 0.0
0.337MetAsn: 0.337 ± 0.36
0.674MetPro: 0.674 ± 0.435
1.01MetGln: 1.01 ± 0.487
1.01MetArg: 1.01 ± 0.525
2.358MetSer: 2.358 ± 1.041
3.031MetThr: 3.031 ± 0.827
0.674MetVal: 0.674 ± 0.469
0.0MetTrp: 0.0 ± 0.0
0.674MetTyr: 0.674 ± 0.464
0.0MetXaa: 0.0 ± 0.0
Asn
3.705AsnAla: 3.705 ± 0.984
0.0AsnCys: 0.0 ± 0.0
2.021AsnAsp: 2.021 ± 0.926
3.705AsnGlu: 3.705 ± 0.981
1.684AsnPhe: 1.684 ± 0.717
3.031AsnGly: 3.031 ± 1.003
0.674AsnHis: 0.674 ± 0.45
3.705AsnIle: 3.705 ± 1.055
3.368AsnLys: 3.368 ± 1.143
5.052AsnLeu: 5.052 ± 1.661
0.0AsnMet: 0.0 ± 0.0
3.368AsnAsn: 3.368 ± 0.751
4.379AsnPro: 4.379 ± 1.175
2.358AsnGln: 2.358 ± 1.12
2.358AsnArg: 2.358 ± 0.891
2.021AsnSer: 2.021 ± 0.768
2.358AsnThr: 2.358 ± 0.785
2.021AsnVal: 2.021 ± 0.604
0.337AsnTrp: 0.337 ± 0.351
2.695AsnTyr: 2.695 ± 0.946
0.0AsnXaa: 0.0 ± 0.0
Pro
2.021ProAla: 2.021 ± 0.97
0.0ProCys: 0.0 ± 0.0
2.695ProAsp: 2.695 ± 0.858
1.347ProGlu: 1.347 ± 0.617
2.021ProPhe: 2.021 ± 0.701
1.01ProGly: 1.01 ± 0.379
0.674ProHis: 0.674 ± 0.421
2.021ProIle: 2.021 ± 0.789
2.358ProLys: 2.358 ± 0.861
2.021ProLeu: 2.021 ± 0.698
0.674ProMet: 0.674 ± 0.54
1.347ProAsn: 1.347 ± 0.76
0.337ProPro: 0.337 ± 0.4
2.695ProGln: 2.695 ± 0.79
3.031ProArg: 3.031 ± 0.579
1.684ProSer: 1.684 ± 0.718
1.347ProThr: 1.347 ± 0.761
0.337ProVal: 0.337 ± 0.314
0.0ProTrp: 0.0 ± 0.0
1.01ProTyr: 1.01 ± 0.477
0.0ProXaa: 0.0 ± 0.0
Gln
4.715GlnAla: 4.715 ± 1.222
0.0GlnCys: 0.0 ± 0.0
1.684GlnAsp: 1.684 ± 0.85
6.399GlnGlu: 6.399 ± 1.285
1.684GlnPhe: 1.684 ± 0.952
2.695GlnGly: 2.695 ± 0.903
0.337GlnHis: 0.337 ± 0.334
3.031GlnIle: 3.031 ± 0.956
5.052GlnLys: 5.052 ± 1.545
4.042GlnLeu: 4.042 ± 1.251
0.337GlnMet: 0.337 ± 0.257
2.021GlnAsn: 2.021 ± 0.751
1.01GlnPro: 1.01 ± 0.486
2.021GlnGln: 2.021 ± 0.973
1.684GlnArg: 1.684 ± 0.654
1.684GlnSer: 1.684 ± 0.774
2.358GlnThr: 2.358 ± 0.797
2.358GlnVal: 2.358 ± 0.809
0.0GlnTrp: 0.0 ± 0.0
2.021GlnTyr: 2.021 ± 0.898
0.0GlnXaa: 0.0 ± 0.0
Arg
2.358ArgAla: 2.358 ± 1.107
0.674ArgCys: 0.674 ± 0.417
2.358ArgAsp: 2.358 ± 0.892
5.389ArgGlu: 5.389 ± 0.83
1.347ArgPhe: 1.347 ± 0.736
2.021ArgGly: 2.021 ± 1.109
1.684ArgHis: 1.684 ± 0.819
3.368ArgIle: 3.368 ± 0.858
7.41ArgLys: 7.41 ± 1.523
5.389ArgLeu: 5.389 ± 1.533
2.021ArgMet: 2.021 ± 0.625
1.347ArgAsn: 1.347 ± 0.602
1.684ArgPro: 1.684 ± 0.714
2.358ArgGln: 2.358 ± 0.841
3.031ArgArg: 3.031 ± 0.791
2.695ArgSer: 2.695 ± 1.205
1.347ArgThr: 1.347 ± 0.612
0.337ArgVal: 0.337 ± 0.257
0.674ArgTrp: 0.674 ± 0.718
2.358ArgTyr: 2.358 ± 1.018
0.0ArgXaa: 0.0 ± 0.0
Ser
2.695SerAla: 2.695 ± 1.006
0.674SerCys: 0.674 ± 0.443
3.031SerAsp: 3.031 ± 1.025
1.347SerGlu: 1.347 ± 0.841
3.031SerPhe: 3.031 ± 1.116
1.684SerGly: 1.684 ± 0.819
1.01SerHis: 1.01 ± 0.663
3.031SerIle: 3.031 ± 1.265
5.052SerLys: 5.052 ± 1.45
5.389SerLeu: 5.389 ± 0.606
1.684SerMet: 1.684 ± 0.888
5.389SerAsn: 5.389 ± 1.356
1.347SerPro: 1.347 ± 0.553
4.042SerGln: 4.042 ± 0.923
1.347SerArg: 1.347 ± 0.76
3.031SerSer: 3.031 ± 0.924
2.695SerThr: 2.695 ± 1.013
3.368SerVal: 3.368 ± 0.944
0.337SerTrp: 0.337 ± 0.271
3.368SerTyr: 3.368 ± 0.999
0.0SerXaa: 0.0 ± 0.0
Thr
3.368ThrAla: 3.368 ± 1.088
0.337ThrCys: 0.337 ± 0.334
2.021ThrAsp: 2.021 ± 0.76
5.389ThrGlu: 5.389 ± 1.175
2.695ThrPhe: 2.695 ± 0.92
4.042ThrGly: 4.042 ± 1.274
1.01ThrHis: 1.01 ± 0.608
5.052ThrIle: 5.052 ± 1.158
4.379ThrLys: 4.379 ± 0.936
5.389ThrLeu: 5.389 ± 1.452
1.684ThrMet: 1.684 ± 0.86
1.684ThrAsn: 1.684 ± 0.739
3.031ThrPro: 3.031 ± 0.826
1.684ThrGln: 1.684 ± 0.759
2.695ThrArg: 2.695 ± 0.949
1.684ThrSer: 1.684 ± 0.53
2.021ThrThr: 2.021 ± 0.64
5.052ThrVal: 5.052 ± 1.618
0.674ThrTrp: 0.674 ± 0.628
2.021ThrTyr: 2.021 ± 0.622
0.0ThrXaa: 0.0 ± 0.0
Val
2.358ValAla: 2.358 ± 0.901
0.337ValCys: 0.337 ± 0.371
1.347ValAsp: 1.347 ± 0.516
1.684ValGlu: 1.684 ± 0.764
3.031ValPhe: 3.031 ± 0.941
1.684ValGly: 1.684 ± 0.534
0.0ValHis: 0.0 ± 0.0
3.368ValIle: 3.368 ± 1.178
6.063ValLys: 6.063 ± 2.198
3.705ValLeu: 3.705 ± 1.314
1.347ValMet: 1.347 ± 0.662
3.368ValAsn: 3.368 ± 1.019
1.347ValPro: 1.347 ± 0.821
1.01ValGln: 1.01 ± 0.559
4.042ValArg: 4.042 ± 1.258
4.042ValSer: 4.042 ± 0.812
2.695ValThr: 2.695 ± 0.827
2.021ValVal: 2.021 ± 0.924
0.674ValTrp: 0.674 ± 0.514
2.695ValTyr: 2.695 ± 0.969
0.0ValXaa: 0.0 ± 0.0
Trp
0.337TrpAla: 0.337 ± 0.314
0.0TrpCys: 0.0 ± 0.0
0.337TrpAsp: 0.337 ± 0.303
0.674TrpGlu: 0.674 ± 0.448
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.674TrpIle: 0.674 ± 0.472
1.684TrpLys: 1.684 ± 0.725
1.684TrpLeu: 1.684 ± 0.76
0.337TrpMet: 0.337 ± 0.271
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.337TrpGln: 0.337 ± 0.351
0.0TrpArg: 0.0 ± 0.0
0.674TrpSer: 0.674 ± 0.613
1.01TrpThr: 1.01 ± 0.688
0.674TrpVal: 0.674 ± 0.365
0.337TrpTrp: 0.337 ± 0.257
0.674TrpTyr: 0.674 ± 0.388
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.358TyrAla: 2.358 ± 0.736
0.674TyrCys: 0.674 ± 0.605
2.358TyrAsp: 2.358 ± 0.931
4.042TyrGlu: 4.042 ± 1.568
3.031TyrPhe: 3.031 ± 0.891
1.01TyrGly: 1.01 ± 0.689
0.337TyrHis: 0.337 ± 0.364
2.358TyrIle: 2.358 ± 0.903
4.715TyrLys: 4.715 ± 1.573
4.042TyrLeu: 4.042 ± 0.989
1.01TyrMet: 1.01 ± 0.85
2.695TyrAsn: 2.695 ± 1.156
0.674TyrPro: 0.674 ± 0.605
2.358TyrGln: 2.358 ± 0.509
2.695TyrArg: 2.695 ± 0.92
2.358TyrSer: 2.358 ± 0.713
1.347TyrThr: 1.347 ± 0.604
1.01TyrVal: 1.01 ± 0.682
0.0TyrTrp: 0.0 ± 0.0
3.705TyrTyr: 3.705 ± 0.926
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (2970 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski