Amino acid dipepetide frequency for Cucumber leaf spot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.874AlaAla: 2.874 ± 2.014
2.299AlaCys: 2.299 ± 1.125
0.575AlaAsp: 0.575 ± 0.787
2.299AlaGlu: 2.299 ± 0.755
1.149AlaPhe: 1.149 ± 0.474
5.747AlaGly: 5.747 ± 1.362
2.299AlaHis: 2.299 ± 0.855
4.023AlaIle: 4.023 ± 0.897
1.149AlaLys: 1.149 ± 0.563
3.448AlaLeu: 3.448 ± 0.937
1.149AlaMet: 1.149 ± 0.815
4.598AlaAsn: 4.598 ± 0.793
3.448AlaPro: 3.448 ± 1.005
2.874AlaGln: 2.874 ± 1.215
1.149AlaArg: 1.149 ± 0.474
6.322AlaSer: 6.322 ± 1.174
4.598AlaThr: 4.598 ± 2.297
5.747AlaVal: 5.747 ± 1.3
1.724AlaTrp: 1.724 ± 0.812
4.023AlaTyr: 4.023 ± 0.876
0.0AlaXaa: 0.0 ± 0.0
Cys
0.575CysAla: 0.575 ± 0.537
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.149CysGlu: 1.149 ± 0.741
0.0CysPhe: 0.0 ± 0.0
0.575CysGly: 0.575 ± 0.371
0.0CysHis: 0.0 ± 0.0
2.299CysIle: 2.299 ± 0.731
3.448CysLys: 3.448 ± 1.114
5.747CysLeu: 5.747 ± 2.21
1.149CysMet: 1.149 ± 0.563
0.0CysAsn: 0.0 ± 0.0
1.724CysPro: 1.724 ± 0.76
0.575CysGln: 0.575 ± 0.371
2.874CysArg: 2.874 ± 0.908
1.724CysSer: 1.724 ± 0.76
1.724CysThr: 1.724 ± 0.792
1.724CysVal: 1.724 ± 0.66
0.0CysTrp: 0.0 ± 0.0
0.575CysTyr: 0.575 ± 0.787
0.0CysXaa: 0.0 ± 0.0
Asp
3.448AspAla: 3.448 ± 0.679
2.299AspCys: 2.299 ± 1.101
4.598AspAsp: 4.598 ± 2.107
2.299AspGlu: 2.299 ± 1.4
0.0AspPhe: 0.0 ± 0.0
4.023AspGly: 4.023 ± 1.338
0.575AspHis: 0.575 ± 0.371
1.724AspIle: 1.724 ± 0.544
1.149AspLys: 1.149 ± 0.474
3.448AspLeu: 3.448 ± 1.84
0.575AspMet: 0.575 ± 0.371
0.575AspAsn: 0.575 ± 0.371
2.299AspPro: 2.299 ± 0.96
1.149AspGln: 1.149 ± 0.741
4.023AspArg: 4.023 ± 0.71
4.023AspSer: 4.023 ± 2.475
0.0AspThr: 0.0 ± 0.0
5.172AspVal: 5.172 ± 1.6
0.0AspTrp: 0.0 ± 0.0
1.149AspTyr: 1.149 ± 0.661
0.0AspXaa: 0.0 ± 0.0
Glu
5.747GluAla: 5.747 ± 1.511
1.149GluCys: 1.149 ± 0.563
1.149GluAsp: 1.149 ± 0.661
4.023GluGlu: 4.023 ± 1.306
4.598GluPhe: 4.598 ± 0.393
1.724GluGly: 1.724 ± 0.502
1.149GluHis: 1.149 ± 0.741
4.023GluIle: 4.023 ± 0.897
4.598GluLys: 4.598 ± 2.511
2.299GluLeu: 2.299 ± 0.474
0.0GluMet: 0.0 ± 0.0
1.149GluAsn: 1.149 ± 1.574
2.299GluPro: 2.299 ± 0.84
0.575GluGln: 0.575 ± 0.787
6.322GluArg: 6.322 ± 2.475
4.023GluSer: 4.023 ± 1.04
2.299GluThr: 2.299 ± 1.429
6.322GluVal: 6.322 ± 1.324
1.149GluTrp: 1.149 ± 0.661
2.299GluTyr: 2.299 ± 0.474
0.0GluXaa: 0.0 ± 0.0
Phe
1.149PheAla: 1.149 ± 1.073
4.023PheCys: 4.023 ± 1.41
2.299PheAsp: 2.299 ± 0.637
2.299PheGlu: 2.299 ± 1.323
0.575PhePhe: 0.575 ± 0.537
0.575PheGly: 0.575 ± 0.371
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
2.299PheLys: 2.299 ± 1.323
2.299PheLeu: 2.299 ± 0.605
1.724PheMet: 1.724 ± 0.939
1.149PheAsn: 1.149 ± 1.106
2.299PhePro: 2.299 ± 0.731
0.575PheGln: 0.575 ± 0.787
1.724PheArg: 1.724 ± 1.61
2.299PheSer: 2.299 ± 1.458
4.598PheThr: 4.598 ± 1.452
3.448PheVal: 3.448 ± 0.679
1.149PheTrp: 1.149 ± 0.474
1.149PheTyr: 1.149 ± 0.661
0.575PheXaa: 0.575 ± 0.371
Gly
2.874GlyAla: 2.874 ± 1.764
1.149GlyCys: 1.149 ± 0.741
6.322GlyAsp: 6.322 ± 0.515
4.023GlyGlu: 4.023 ± 2.149
3.448GlyPhe: 3.448 ± 0.87
8.046GlyGly: 8.046 ± 2.735
1.724GlyHis: 1.724 ± 1.16
1.724GlyIle: 1.724 ± 1.725
3.448GlyLys: 3.448 ± 1.884
8.621GlyLeu: 8.621 ± 1.255
2.874GlyMet: 2.874 ± 0.889
2.874GlyAsn: 2.874 ± 1.269
1.724GlyPro: 1.724 ± 1.408
2.299GlyGln: 2.299 ± 1.193
6.897GlyArg: 6.897 ± 1.697
5.172GlySer: 5.172 ± 0.921
2.874GlyThr: 2.874 ± 1.269
5.172GlyVal: 5.172 ± 1.642
0.575GlyTrp: 0.575 ± 0.371
2.299GlyTyr: 2.299 ± 1.029
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.575HisCys: 0.575 ± 0.371
1.149HisAsp: 1.149 ± 0.563
0.575HisGlu: 0.575 ± 0.371
0.0HisPhe: 0.0 ± 0.0
1.149HisGly: 1.149 ± 1.106
0.0HisHis: 0.0 ± 0.0
1.149HisIle: 1.149 ± 0.745
2.299HisLys: 2.299 ± 0.855
0.575HisLeu: 0.575 ± 0.371
0.0HisMet: 0.0 ± 0.0
1.724HisAsn: 1.724 ± 1.347
1.149HisPro: 1.149 ± 0.679
0.575HisGln: 0.575 ± 0.734
1.149HisArg: 1.149 ± 0.474
1.724HisSer: 1.724 ± 0.66
0.575HisThr: 0.575 ± 0.787
0.575HisVal: 0.575 ± 0.734
0.575HisTrp: 0.575 ± 0.371
1.149HisTyr: 1.149 ± 0.563
0.0HisXaa: 0.0 ± 0.0
Ile
5.747IleAla: 5.747 ± 2.249
0.0IleCys: 0.0 ± 0.0
2.299IleAsp: 2.299 ± 1.323
2.874IleGlu: 2.874 ± 1.039
0.575IlePhe: 0.575 ± 0.371
4.023IleGly: 4.023 ± 1.715
0.0IleHis: 0.0 ± 0.0
3.448IleIle: 3.448 ± 1.114
1.724IleLys: 1.724 ± 0.792
4.598IleLeu: 4.598 ± 1.794
1.724IleMet: 1.724 ± 0.792
2.299IleAsn: 2.299 ± 1.458
2.299IlePro: 2.299 ± 0.855
2.874IleGln: 2.874 ± 2.307
2.299IleArg: 2.299 ± 1.524
2.299IleSer: 2.299 ± 0.855
5.172IleThr: 5.172 ± 1.736
8.046IleVal: 8.046 ± 1.177
0.575IleTrp: 0.575 ± 0.537
0.575IleTyr: 0.575 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
6.322LysAla: 6.322 ± 1.747
0.0LysCys: 0.0 ± 0.0
2.299LysAsp: 2.299 ± 0.96
4.023LysGlu: 4.023 ± 1.027
1.724LysPhe: 1.724 ± 1.347
4.598LysGly: 4.598 ± 0.793
0.575LysHis: 0.575 ± 0.371
2.874LysIle: 2.874 ± 1.039
3.448LysLys: 3.448 ± 0.565
6.897LysLeu: 6.897 ± 1.927
1.724LysMet: 1.724 ± 0.812
1.149LysAsn: 1.149 ± 0.474
2.299LysPro: 2.299 ± 1.029
2.299LysGln: 2.299 ± 0.474
1.724LysArg: 1.724 ± 0.544
3.448LysSer: 3.448 ± 1.522
1.724LysThr: 1.724 ± 1.347
3.448LysVal: 3.448 ± 0.713
1.149LysTrp: 1.149 ± 0.661
1.724LysTyr: 1.724 ± 0.544
0.0LysXaa: 0.0 ± 0.0
Leu
5.172LeuAla: 5.172 ± 2.011
2.299LeuCys: 2.299 ± 0.777
4.598LeuAsp: 4.598 ± 1.941
4.598LeuGlu: 4.598 ± 1.75
2.299LeuPhe: 2.299 ± 0.922
5.172LeuGly: 5.172 ± 0.99
0.0LeuHis: 0.0 ± 0.0
5.172LeuIle: 5.172 ± 1.729
5.747LeuLys: 5.747 ± 2.88
8.046LeuLeu: 8.046 ± 2.191
2.299LeuMet: 2.299 ± 0.818
1.149LeuAsn: 1.149 ± 1.006
8.621LeuPro: 8.621 ± 0.812
2.299LeuGln: 2.299 ± 1.323
2.874LeuArg: 2.874 ± 1.44
10.345LeuSer: 10.345 ± 2.021
5.172LeuThr: 5.172 ± 0.803
8.621LeuVal: 8.621 ± 1.919
0.575LeuTrp: 0.575 ± 0.371
6.322LeuTyr: 6.322 ± 2.279
0.0LeuXaa: 0.0 ± 0.0
Met
1.724MetAla: 1.724 ± 0.544
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
4.023MetGlu: 4.023 ± 0.877
0.0MetPhe: 0.0 ± 0.0
2.299MetGly: 2.299 ± 0.474
0.0MetHis: 0.0 ± 0.0
1.149MetIle: 1.149 ± 0.563
4.598MetLys: 4.598 ± 1.711
1.149MetLeu: 1.149 ± 0.563
1.724MetMet: 1.724 ± 0.812
0.575MetAsn: 0.575 ± 0.371
1.149MetPro: 1.149 ± 0.563
0.0MetGln: 0.0 ± 0.0
1.724MetArg: 1.724 ± 0.544
2.874MetSer: 2.874 ± 1.039
1.149MetThr: 1.149 ± 0.741
3.448MetVal: 3.448 ± 1.584
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.299AsnAla: 2.299 ± 0.922
0.575AsnCys: 0.575 ± 0.371
0.575AsnAsp: 0.575 ± 0.734
1.149AsnGlu: 1.149 ± 0.745
2.299AsnPhe: 2.299 ± 1.429
1.149AsnGly: 1.149 ± 0.745
0.575AsnHis: 0.575 ± 0.787
2.299AsnIle: 2.299 ± 0.474
1.149AsnLys: 1.149 ± 1.006
1.149AsnLeu: 1.149 ± 0.745
2.299AsnMet: 2.299 ± 1.125
2.299AsnAsn: 2.299 ± 1.878
0.575AsnPro: 0.575 ± 0.537
1.724AsnGln: 1.724 ± 1.07
0.575AsnArg: 0.575 ± 0.787
4.023AsnSer: 4.023 ± 1.013
2.299AsnThr: 2.299 ± 0.755
5.172AsnVal: 5.172 ± 2.319
0.575AsnTrp: 0.575 ± 0.371
0.575AsnTyr: 0.575 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
2.874ProAla: 2.874 ± 0.644
0.575ProCys: 0.575 ± 0.371
2.299ProAsp: 2.299 ± 1.482
1.724ProGlu: 1.724 ± 0.792
1.724ProPhe: 1.724 ± 0.812
0.575ProGly: 0.575 ± 0.537
1.724ProHis: 1.724 ± 0.812
1.149ProIle: 1.149 ± 1.106
0.575ProLys: 0.575 ± 0.734
5.747ProLeu: 5.747 ± 1.487
0.0ProMet: 0.0 ± 0.0
2.299ProAsn: 2.299 ± 1.507
1.724ProPro: 1.724 ± 0.544
1.724ProGln: 1.724 ± 0.72
6.897ProArg: 6.897 ± 2.854
4.598ProSer: 4.598 ± 1.906
2.299ProThr: 2.299 ± 2.146
6.322ProVal: 6.322 ± 0.663
0.575ProTrp: 0.575 ± 0.537
1.724ProTyr: 1.724 ± 0.502
0.0ProXaa: 0.0 ± 0.0
Gln
3.448GlnAla: 3.448 ± 0.563
0.0GlnCys: 0.0 ± 0.0
1.149GlnAsp: 1.149 ± 1.006
0.0GlnGlu: 0.0 ± 0.0
3.448GlnPhe: 3.448 ± 1.005
3.448GlnGly: 3.448 ± 4.043
1.724GlnHis: 1.724 ± 0.782
2.299GlnIle: 2.299 ± 1.458
0.575GlnLys: 0.575 ± 0.734
2.874GlnLeu: 2.874 ± 1.4
0.575GlnMet: 0.575 ± 0.537
1.149GlnAsn: 1.149 ± 0.745
1.724GlnPro: 1.724 ± 0.502
0.575GlnGln: 0.575 ± 0.371
4.023GlnArg: 4.023 ± 0.897
2.299GlnSer: 2.299 ± 2.212
1.149GlnThr: 1.149 ± 1.073
1.149GlnVal: 1.149 ± 0.745
0.0GlnTrp: 0.0 ± 0.0
2.299GlnTyr: 2.299 ± 1.193
0.0GlnXaa: 0.0 ± 0.0
Arg
4.023ArgAla: 4.023 ± 0.71
0.575ArgCys: 0.575 ± 0.787
2.874ArgAsp: 2.874 ± 1.335
1.724ArgGlu: 1.724 ± 1.16
4.598ArgPhe: 4.598 ± 1.266
7.471ArgGly: 7.471 ± 2.22
0.575ArgHis: 0.575 ± 0.371
4.598ArgIle: 4.598 ± 1.298
4.023ArgLys: 4.023 ± 0.884
6.322ArgLeu: 6.322 ± 1.962
1.724ArgMet: 1.724 ± 0.77
0.0ArgAsn: 0.0 ± 0.0
4.598ArgPro: 4.598 ± 1.616
1.724ArgGln: 1.724 ± 0.809
2.299ArgArg: 2.299 ± 1.029
2.874ArgSer: 2.874 ± 1.039
2.299ArgThr: 2.299 ± 0.755
6.897ArgVal: 6.897 ± 1.701
1.724ArgTrp: 1.724 ± 0.792
3.448ArgTyr: 3.448 ± 1.791
0.0ArgXaa: 0.0 ± 0.0
Ser
0.575SerAla: 0.575 ± 0.537
3.448SerCys: 3.448 ± 1.584
2.299SerAsp: 2.299 ± 1.507
6.322SerGlu: 6.322 ± 1.901
4.598SerPhe: 4.598 ± 1.616
2.299SerGly: 2.299 ± 1.358
1.724SerHis: 1.724 ± 1.705
6.322SerIle: 6.322 ± 1.63
3.448SerLys: 3.448 ± 1.398
8.621SerLeu: 8.621 ± 1.255
3.448SerMet: 3.448 ± 1.204
3.448SerAsn: 3.448 ± 3.219
1.724SerPro: 1.724 ± 1.07
4.023SerGln: 4.023 ± 3.449
6.322SerArg: 6.322 ± 1.707
7.471SerSer: 7.471 ± 2.185
5.747SerThr: 5.747 ± 2.817
5.172SerVal: 5.172 ± 1.923
1.149SerTrp: 1.149 ± 0.563
2.874SerTyr: 2.874 ± 1.391
0.0SerXaa: 0.0 ± 0.0
Thr
5.747ThrAla: 5.747 ± 1.254
1.149ThrCys: 1.149 ± 1.006
2.299ThrAsp: 2.299 ± 1.458
1.724ThrGlu: 1.724 ± 1.725
2.299ThrPhe: 2.299 ± 0.802
2.874ThrGly: 2.874 ± 2.683
0.0ThrHis: 0.0 ± 0.0
3.448ThrIle: 3.448 ± 1.92
4.023ThrLys: 4.023 ± 1.952
4.598ThrLeu: 4.598 ± 1.274
1.149ThrMet: 1.149 ± 0.563
0.575ThrAsn: 0.575 ± 0.537
4.023ThrPro: 4.023 ± 1.551
1.724ThrGln: 1.724 ± 1.408
2.874ThrArg: 2.874 ± 1.067
4.023ThrSer: 4.023 ± 2.065
8.046ThrThr: 8.046 ± 2.385
3.448ThrVal: 3.448 ± 1.397
1.724ThrTrp: 1.724 ± 0.809
1.724ThrTyr: 1.724 ± 0.66
0.0ThrXaa: 0.0 ± 0.0
Val
3.448ValAla: 3.448 ± 1.44
5.172ValCys: 5.172 ± 1.607
4.598ValAsp: 4.598 ± 1.487
7.471ValGlu: 7.471 ± 1.516
2.299ValPhe: 2.299 ± 0.755
11.494ValGly: 11.494 ± 1.917
2.299ValHis: 2.299 ± 0.947
2.299ValIle: 2.299 ± 0.755
5.172ValLys: 5.172 ± 2.372
8.621ValLeu: 8.621 ± 2.573
2.874ValMet: 2.874 ± 0.579
3.448ValAsn: 3.448 ± 1.3
2.874ValPro: 2.874 ± 2.683
2.874ValGln: 2.874 ± 1.116
5.747ValArg: 5.747 ± 1.17
5.172ValSer: 5.172 ± 3.051
3.448ValThr: 3.448 ± 1.062
7.471ValVal: 7.471 ± 1.646
0.0ValTrp: 0.0 ± 0.0
4.023ValTyr: 4.023 ± 1.184
0.0ValXaa: 0.0 ± 0.0
Trp
0.575TrpAla: 0.575 ± 0.537
0.0TrpCys: 0.0 ± 0.0
0.575TrpAsp: 0.575 ± 0.537
1.724TrpGlu: 1.724 ± 0.792
0.575TrpPhe: 0.575 ± 0.371
1.149TrpGly: 1.149 ± 0.661
1.149TrpHis: 1.149 ± 0.563
1.724TrpIle: 1.724 ± 0.72
0.575TrpLys: 0.575 ± 0.734
1.724TrpLeu: 1.724 ± 0.66
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.575TrpGln: 0.575 ± 0.371
0.575TrpArg: 0.575 ± 0.734
0.0TrpSer: 0.0 ± 0.0
1.149TrpThr: 1.149 ± 0.474
0.575TrpVal: 0.575 ± 0.371
0.0TrpTrp: 0.0 ± 0.0
0.575TrpTyr: 0.575 ± 0.371
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.448TyrAla: 3.448 ± 0.679
0.575TyrCys: 0.575 ± 0.371
0.575TyrAsp: 0.575 ± 0.787
2.874TyrGlu: 2.874 ± 1.44
0.0TyrPhe: 0.0 ± 0.0
5.172TyrGly: 5.172 ± 1.302
0.575TyrHis: 0.575 ± 0.734
1.724TyrIle: 1.724 ± 0.544
0.0TyrLys: 0.0 ± 0.0
4.598TyrLeu: 4.598 ± 2.511
0.0TyrMet: 0.0 ± 0.0
2.874TyrAsn: 2.874 ± 1.853
0.575TyrPro: 0.575 ± 0.734
2.874TyrGln: 2.874 ± 0.872
2.299TyrArg: 2.299 ± 0.865
6.322TyrSer: 6.322 ± 2.379
1.149TyrThr: 1.149 ± 0.474
2.874TyrVal: 2.874 ± 1.985
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.575XaaGly: 0.575 ± 0.371
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1741 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski