Amino acid dipepetide frequency for White clover mosaic virus (strain M) (WCMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.734AlaAla: 4.734 ± 1.767
1.052AlaCys: 1.052 ± 1.252
3.156AlaAsp: 3.156 ± 1.676
2.63AlaGlu: 2.63 ± 2.886
5.786AlaPhe: 5.786 ± 2.776
4.208AlaGly: 4.208 ± 1.312
1.578AlaHis: 1.578 ± 1.015
5.786AlaIle: 5.786 ± 0.993
5.26AlaLys: 5.26 ± 2.307
10.521AlaLeu: 10.521 ± 3.092
1.052AlaMet: 1.052 ± 0.556
3.682AlaAsn: 3.682 ± 1.297
3.156AlaPro: 3.156 ± 1.183
1.578AlaGln: 1.578 ± 1.015
2.63AlaArg: 2.63 ± 0.94
3.682AlaSer: 3.682 ± 3.686
4.734AlaThr: 4.734 ± 1.767
3.682AlaVal: 3.682 ± 1.945
0.0AlaTrp: 0.0 ± 0.0
3.682AlaTyr: 3.682 ± 1.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.052CysAsp: 1.052 ± 1.426
0.526CysGlu: 0.526 ± 0.278
1.052CysPhe: 1.052 ± 0.556
1.052CysGly: 1.052 ± 0.556
0.0CysHis: 0.0 ± 0.0
0.526CysIle: 0.526 ± 0.278
1.052CysLys: 1.052 ± 0.718
1.052CysLeu: 1.052 ± 0.718
0.526CysMet: 0.526 ± 1.894
0.526CysAsn: 0.526 ± 0.825
0.526CysPro: 0.526 ± 0.278
2.104CysGln: 2.104 ± 1.104
0.526CysArg: 0.526 ± 0.278
1.052CysSer: 1.052 ± 0.556
2.104CysThr: 2.104 ± 1.923
1.052CysVal: 1.052 ± 0.928
0.526CysTrp: 0.526 ± 1.068
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.208AspAla: 4.208 ± 1.312
1.578AspCys: 1.578 ± 0.834
1.052AspAsp: 1.052 ± 0.556
3.682AspGlu: 3.682 ± 1.416
4.208AspPhe: 4.208 ± 1.642
2.104AspGly: 2.104 ± 2.195
2.63AspHis: 2.63 ± 2.151
5.26AspIle: 5.26 ± 1.259
0.0AspLys: 0.0 ± 0.0
6.839AspLeu: 6.839 ± 1.581
0.526AspMet: 0.526 ± 0.278
2.63AspAsn: 2.63 ± 1.739
3.156AspPro: 3.156 ± 2.154
0.526AspGln: 0.526 ± 0.278
1.052AspArg: 1.052 ± 0.556
4.208AspSer: 4.208 ± 0.999
3.682AspThr: 3.682 ± 0.649
2.63AspVal: 2.63 ± 1.751
1.052AspTrp: 1.052 ± 0.556
2.104AspTyr: 2.104 ± 1.111
0.0AspXaa: 0.0 ± 0.0
Glu
4.208GluAla: 4.208 ± 1.521
0.0GluCys: 0.0 ± 0.0
2.63GluAsp: 2.63 ± 1.389
4.734GluGlu: 4.734 ± 0.995
2.104GluPhe: 2.104 ± 0.805
1.052GluGly: 1.052 ± 0.556
1.052GluHis: 1.052 ± 0.556
5.26GluIle: 5.26 ± 3.496
3.682GluLys: 3.682 ± 1.945
3.156GluLeu: 3.156 ± 1.098
0.526GluMet: 0.526 ± 0.278
2.63GluAsn: 2.63 ± 1.389
3.682GluPro: 3.682 ± 1.416
0.526GluGln: 0.526 ± 0.278
3.156GluArg: 3.156 ± 1.183
4.208GluSer: 4.208 ± 0.809
3.156GluThr: 3.156 ± 0.584
2.104GluVal: 2.104 ± 1.111
1.052GluTrp: 1.052 ± 0.556
1.052GluTyr: 1.052 ± 1.515
0.0GluXaa: 0.0 ± 0.0
Phe
4.734PheAla: 4.734 ± 3.286
1.578PheCys: 1.578 ± 0.838
3.682PheAsp: 3.682 ± 1.42
2.63PheGlu: 2.63 ± 1.739
2.104PhePhe: 2.104 ± 0.804
1.578PheGly: 1.578 ± 2.198
3.156PheHis: 3.156 ± 1.667
3.682PheIle: 3.682 ± 1.411
2.63PheLys: 2.63 ± 1.397
4.208PheLeu: 4.208 ± 1.592
1.578PheMet: 1.578 ± 0.834
5.786PheAsn: 5.786 ± 1.749
2.104PhePro: 2.104 ± 1.111
2.63PheGln: 2.63 ± 1.869
0.526PheArg: 0.526 ± 0.278
1.578PheSer: 1.578 ± 0.834
4.734PheThr: 4.734 ± 1.254
1.578PheVal: 1.578 ± 0.868
0.526PheTrp: 0.526 ± 0.278
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.682GlyAla: 3.682 ± 1.648
1.578GlyCys: 1.578 ± 1.33
3.682GlyAsp: 3.682 ± 0.84
1.578GlyGlu: 1.578 ± 0.868
2.104GlyPhe: 2.104 ± 0.804
2.104GlyGly: 2.104 ± 0.895
1.052GlyHis: 1.052 ± 0.556
1.578GlyIle: 1.578 ± 0.834
2.63GlyLys: 2.63 ± 1.401
4.208GlyLeu: 4.208 ± 3.921
0.526GlyMet: 0.526 ± 0.278
0.526GlyAsn: 0.526 ± 0.278
3.156GlyPro: 3.156 ± 1.216
2.104GlyGln: 2.104 ± 0.805
0.526GlyArg: 0.526 ± 0.278
3.156GlySer: 3.156 ± 1.421
3.156GlyThr: 3.156 ± 1.748
2.104GlyVal: 2.104 ± 2.986
0.526GlyTrp: 0.526 ± 0.278
2.104GlyTyr: 2.104 ± 0.804
0.0GlyXaa: 0.0 ± 0.0
His
2.63HisAla: 2.63 ± 0.973
1.052HisCys: 1.052 ± 1.528
1.052HisAsp: 1.052 ± 0.556
2.104HisGlu: 2.104 ± 1.111
1.578HisPhe: 1.578 ± 0.834
4.208HisGly: 4.208 ± 1.335
1.052HisHis: 1.052 ± 0.556
2.104HisIle: 2.104 ± 0.805
1.052HisLys: 1.052 ± 0.919
3.682HisLeu: 3.682 ± 1.665
0.526HisMet: 0.526 ± 0.976
2.63HisAsn: 2.63 ± 1.776
2.63HisPro: 2.63 ± 0.94
2.63HisGln: 2.63 ± 0.645
2.104HisArg: 2.104 ± 0.846
2.104HisSer: 2.104 ± 1.856
1.052HisThr: 1.052 ± 0.718
0.526HisVal: 0.526 ± 0.278
0.0HisTrp: 0.0 ± 0.0
1.052HisTyr: 1.052 ± 1.515
0.0HisXaa: 0.0 ± 0.0
Ile
3.682IleAla: 3.682 ± 1.149
0.0IleCys: 0.0 ± 0.0
0.526IleAsp: 0.526 ± 0.278
7.891IleGlu: 7.891 ± 2.625
4.208IlePhe: 4.208 ± 1.26
3.682IleGly: 3.682 ± 1.411
2.104IleHis: 2.104 ± 0.804
3.156IleIle: 3.156 ± 2.66
5.26IleLys: 5.26 ± 0.599
11.573IleLeu: 11.573 ± 2.804
1.578IleMet: 1.578 ± 0.838
4.734IleAsn: 4.734 ± 1.852
3.682IlePro: 3.682 ± 0.649
2.104IleGln: 2.104 ± 1.111
3.156IleArg: 3.156 ± 2.787
4.734IleSer: 4.734 ± 4.623
4.208IleThr: 4.208 ± 0.809
4.208IleVal: 4.208 ± 1.78
0.526IleTrp: 0.526 ± 0.278
1.578IleTyr: 1.578 ± 0.834
0.0IleXaa: 0.0 ± 0.0
Lys
4.734LysAla: 4.734 ± 1.764
0.0LysCys: 0.0 ± 0.0
3.156LysAsp: 3.156 ± 1.098
1.578LysGlu: 1.578 ± 0.838
2.63LysPhe: 2.63 ± 0.94
1.578LysGly: 1.578 ± 0.834
2.63LysHis: 2.63 ± 1.001
6.839LysIle: 6.839 ± 1.558
2.63LysLys: 2.63 ± 1.389
5.26LysLeu: 5.26 ± 1.767
1.578LysMet: 1.578 ± 0.8
2.104LysAsn: 2.104 ± 1.111
4.734LysPro: 4.734 ± 0.995
4.208LysGln: 4.208 ± 1.692
2.104LysArg: 2.104 ± 0.804
6.312LysSer: 6.312 ± 1.731
6.839LysThr: 6.839 ± 1.667
4.208LysVal: 4.208 ± 1.26
0.0LysTrp: 0.0 ± 0.0
1.578LysTyr: 1.578 ± 1.304
0.0LysXaa: 0.0 ± 0.0
Leu
9.995LeuAla: 9.995 ± 4.746
3.156LeuCys: 3.156 ± 1.043
8.417LeuAsp: 8.417 ± 2.103
4.734LeuGlu: 4.734 ± 1.919
5.786LeuPhe: 5.786 ± 1.332
4.734LeuGly: 4.734 ± 2.249
5.26LeuHis: 5.26 ± 3.892
5.786LeuIle: 5.786 ± 3.754
7.891LeuLys: 7.891 ± 3.324
8.417LeuLeu: 8.417 ± 3.869
1.578LeuMet: 1.578 ± 0.838
3.156LeuAsn: 3.156 ± 1.183
8.943LeuPro: 8.943 ± 2.043
3.682LeuGln: 3.682 ± 1.945
3.682LeuArg: 3.682 ± 1.503
6.312LeuSer: 6.312 ± 2.005
6.312LeuThr: 6.312 ± 1.149
3.682LeuVal: 3.682 ± 2.063
1.052LeuTrp: 1.052 ± 0.556
2.63LeuTyr: 2.63 ± 0.973
0.0LeuXaa: 0.0 ± 0.0
Met
1.578MetAla: 1.578 ± 0.838
0.0MetCys: 0.0 ± 0.0
1.052MetAsp: 1.052 ± 1.698
0.526MetGlu: 0.526 ± 0.278
0.0MetPhe: 0.0 ± 0.0
0.526MetGly: 0.526 ± 0.278
0.0MetHis: 0.0 ± 0.0
2.104MetIle: 2.104 ± 1.111
1.052MetLys: 1.052 ± 0.556
1.052MetLeu: 1.052 ± 0.919
0.0MetMet: 0.0 ± 0.0
1.578MetAsn: 1.578 ± 0.834
1.578MetPro: 1.578 ± 1.304
1.052MetGln: 1.052 ± 0.556
1.578MetArg: 1.578 ± 0.834
2.104MetSer: 2.104 ± 1.111
0.526MetThr: 0.526 ± 0.278
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.526MetTyr: 0.526 ± 1.068
0.0MetXaa: 0.0 ± 0.0
Asn
2.104AsnAla: 2.104 ± 1.111
0.526AsnCys: 0.526 ± 0.278
1.578AsnAsp: 1.578 ± 0.838
2.104AsnGlu: 2.104 ± 1.111
1.578AsnPhe: 1.578 ± 0.834
0.526AsnGly: 0.526 ± 0.278
2.63AsnHis: 2.63 ± 1.794
5.786AsnIle: 5.786 ± 1.199
3.156AsnLys: 3.156 ± 1.166
5.786AsnLeu: 5.786 ± 1.784
1.578AsnMet: 1.578 ± 1.169
3.682AsnAsn: 3.682 ± 1.741
6.312AsnPro: 6.312 ± 1.342
2.63AsnGln: 2.63 ± 1.117
1.578AsnArg: 1.578 ± 0.868
1.578AsnSer: 1.578 ± 0.834
5.786AsnThr: 5.786 ± 1.251
1.052AsnVal: 1.052 ± 0.556
0.0AsnTrp: 0.0 ± 0.0
1.578AsnTyr: 1.578 ± 0.834
0.0AsnXaa: 0.0 ± 0.0
Pro
4.208ProAla: 4.208 ± 3.675
1.578ProCys: 1.578 ± 0.834
5.786ProAsp: 5.786 ± 2.291
3.682ProGlu: 3.682 ± 1.416
3.156ProPhe: 3.156 ± 1.487
0.526ProGly: 0.526 ± 0.278
1.052ProHis: 1.052 ± 0.718
3.156ProIle: 3.156 ± 1.183
4.208ProLys: 4.208 ± 1.335
2.63ProLeu: 2.63 ± 1.287
0.526ProMet: 0.526 ± 0.743
3.156ProAsn: 3.156 ± 2.216
3.156ProPro: 3.156 ± 2.608
2.104ProGln: 2.104 ± 0.895
2.104ProArg: 2.104 ± 1.111
5.786ProSer: 5.786 ± 2.257
7.365ProThr: 7.365 ± 2.061
2.63ProVal: 2.63 ± 0.973
1.052ProTrp: 1.052 ± 0.556
2.63ProTyr: 2.63 ± 2.232
0.0ProXaa: 0.0 ± 0.0
Gln
5.26GlnAla: 5.26 ± 1.639
0.526GlnCys: 0.526 ± 0.278
1.052GlnAsp: 1.052 ± 0.556
1.052GlnGlu: 1.052 ± 0.556
1.578GlnPhe: 1.578 ± 1.015
1.578GlnGly: 1.578 ± 0.834
1.052GlnHis: 1.052 ± 0.928
2.104GlnIle: 2.104 ± 0.895
2.104GlnLys: 2.104 ± 0.895
5.786GlnLeu: 5.786 ± 1.33
0.526GlnMet: 0.526 ± 0.278
2.104GlnAsn: 2.104 ± 0.895
1.578GlnPro: 1.578 ± 0.71
1.052GlnGln: 1.052 ± 0.556
1.578GlnArg: 1.578 ± 0.868
2.63GlnSer: 2.63 ± 0.973
3.156GlnThr: 3.156 ± 1.098
1.578GlnVal: 1.578 ± 0.838
1.578GlnTrp: 1.578 ± 0.71
1.578GlnTyr: 1.578 ± 0.834
0.0GlnXaa: 0.0 ± 0.0
Arg
3.682ArgAla: 3.682 ± 1.661
0.526ArgCys: 0.526 ± 1.068
3.156ArgAsp: 3.156 ± 1.098
2.104ArgGlu: 2.104 ± 1.111
0.526ArgPhe: 0.526 ± 0.825
2.104ArgGly: 2.104 ± 0.804
1.578ArgHis: 1.578 ± 1.522
1.578ArgIle: 1.578 ± 0.834
2.63ArgLys: 2.63 ± 0.645
2.104ArgLeu: 2.104 ± 0.805
0.526ArgMet: 0.526 ± 0.278
2.104ArgAsn: 2.104 ± 1.111
1.578ArgPro: 1.578 ± 1.885
2.104ArgGln: 2.104 ± 0.846
1.578ArgArg: 1.578 ± 0.868
2.104ArgSer: 2.104 ± 1.111
1.578ArgThr: 1.578 ± 0.71
1.578ArgVal: 1.578 ± 0.868
0.0ArgTrp: 0.0 ± 0.0
2.63ArgTyr: 2.63 ± 1.389
0.0ArgXaa: 0.0 ± 0.0
Ser
1.578SerAla: 1.578 ± 1.015
0.526SerCys: 0.526 ± 0.278
4.208SerAsp: 4.208 ± 1.409
3.156SerGlu: 3.156 ± 0.584
4.208SerPhe: 4.208 ± 2.102
3.156SerGly: 3.156 ± 2.642
3.156SerHis: 3.156 ± 1.166
6.839SerIle: 6.839 ± 2.613
6.312SerLys: 6.312 ± 1.973
7.365SerLeu: 7.365 ± 2.585
0.526SerMet: 0.526 ± 0.278
4.208SerAsn: 4.208 ± 1.306
4.208SerPro: 4.208 ± 1.409
2.63SerGln: 2.63 ± 1.389
1.052SerArg: 1.052 ± 0.919
5.26SerSer: 5.26 ± 2.574
2.63SerThr: 2.63 ± 0.645
3.156SerVal: 3.156 ± 1.7
1.052SerTrp: 1.052 ± 0.919
2.63SerTyr: 2.63 ± 1.389
0.0SerXaa: 0.0 ± 0.0
Thr
4.208ThrAla: 4.208 ± 1.692
1.052ThrCys: 1.052 ± 0.718
5.26ThrAsp: 5.26 ± 1.639
2.63ThrGlu: 2.63 ± 0.973
3.682ThrPhe: 3.682 ± 1.297
2.104ThrGly: 2.104 ± 0.846
2.63ThrHis: 2.63 ± 1.389
5.26ThrIle: 5.26 ± 1.798
5.26ThrLys: 5.26 ± 1.145
10.521ThrLeu: 10.521 ± 3.089
1.052ThrMet: 1.052 ± 0.556
1.578ThrAsn: 1.578 ± 0.834
4.208ThrPro: 4.208 ± 1.556
2.104ThrGln: 2.104 ± 1.111
4.734ThrArg: 4.734 ± 2.131
3.682ThrSer: 3.682 ± 1.449
7.891ThrThr: 7.891 ± 2.461
3.156ThrVal: 3.156 ± 1.216
1.578ThrTrp: 1.578 ± 1.972
4.208ThrTyr: 4.208 ± 1.625
0.0ThrXaa: 0.0 ± 0.0
Val
2.104ValAla: 2.104 ± 1.837
0.526ValCys: 0.526 ± 1.565
1.578ValAsp: 1.578 ± 0.71
1.052ValGlu: 1.052 ± 0.556
2.104ValPhe: 2.104 ± 1.289
2.63ValGly: 2.63 ± 1.287
2.104ValHis: 2.104 ± 0.805
3.682ValIle: 3.682 ± 1.493
5.786ValLys: 5.786 ± 3.056
5.26ValLeu: 5.26 ± 2.43
0.526ValMet: 0.526 ± 0.278
2.63ValAsn: 2.63 ± 0.94
1.578ValPro: 1.578 ± 0.834
1.052ValGln: 1.052 ± 0.556
1.578ValArg: 1.578 ± 0.834
2.63ValSer: 2.63 ± 2.29
2.104ValThr: 2.104 ± 2.021
1.578ValVal: 1.578 ± 0.71
0.526ValTrp: 0.526 ± 1.068
2.104ValTyr: 2.104 ± 2.852
0.0ValXaa: 0.0 ± 0.0
Trp
1.578TrpAla: 1.578 ± 0.838
0.0TrpCys: 0.0 ± 0.0
0.526TrpAsp: 0.526 ± 1.068
1.052TrpGlu: 1.052 ± 0.556
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.526TrpHis: 0.526 ± 1.068
0.0TrpIle: 0.0 ± 0.0
0.526TrpLys: 0.526 ± 0.278
1.578TrpLeu: 1.578 ± 0.834
0.526TrpMet: 0.526 ± 0.278
1.052TrpAsn: 1.052 ± 0.919
0.526TrpPro: 0.526 ± 0.825
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.052TrpSer: 1.052 ± 0.919
2.104TrpThr: 2.104 ± 1.111
0.526TrpVal: 0.526 ± 0.278
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.682TyrAla: 3.682 ± 1.945
0.526TyrCys: 0.526 ± 0.278
0.526TyrAsp: 0.526 ± 0.278
0.526TyrGlu: 0.526 ± 0.278
2.63TyrPhe: 2.63 ± 3.125
2.63TyrGly: 2.63 ± 0.973
0.526TyrHis: 0.526 ± 0.278
2.104TyrIle: 2.104 ± 1.111
1.578TyrLys: 1.578 ± 1.304
4.208TyrLeu: 4.208 ± 2.043
0.526TyrMet: 0.526 ± 0.278
1.052TyrAsn: 1.052 ± 0.556
0.0TyrPro: 0.0 ± 0.0
2.63TyrGln: 2.63 ± 1.869
0.526TyrArg: 0.526 ± 0.825
3.682TyrSer: 3.682 ± 1.416
3.682TyrThr: 3.682 ± 1.078
2.104TyrVal: 2.104 ± 1.111
0.526TyrTrp: 0.526 ± 0.278
0.526TyrTyr: 0.526 ± 0.278
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski