Amino acid dipepetide frequency for Maize chlorotic mottle virus (isolate United States/Kansas/1987) (MCMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.367AlaAla: 4.367 ± 4.786
0.0AlaCys: 0.0 ± 0.0
3.821AlaAsp: 3.821 ± 2.498
2.183AlaGlu: 2.183 ± 1.241
2.729AlaPhe: 2.729 ± 1.153
4.913AlaGly: 4.913 ± 3.021
0.0AlaHis: 0.0 ± 0.0
4.913AlaIle: 4.913 ± 1.954
3.275AlaLys: 3.275 ± 1.262
6.55AlaLeu: 6.55 ± 2.242
1.638AlaMet: 1.638 ± 0.733
2.729AlaAsn: 2.729 ± 1.451
3.275AlaPro: 3.275 ± 1.386
2.729AlaGln: 2.729 ± 0.512
5.459AlaArg: 5.459 ± 1.288
4.913AlaSer: 4.913 ± 1.031
3.821AlaThr: 3.821 ± 2.31
7.096AlaVal: 7.096 ± 1.059
1.092AlaTrp: 1.092 ± 0.615
2.729AlaTyr: 2.729 ± 1.234
0.0AlaXaa: 0.0 ± 0.0
Cys
1.638CysAla: 1.638 ± 1.136
0.0CysCys: 0.0 ± 0.0
0.546CysAsp: 0.546 ± 0.678
0.546CysGlu: 0.546 ± 0.337
0.0CysPhe: 0.0 ± 0.0
1.638CysGly: 1.638 ± 0.693
0.0CysHis: 0.0 ± 0.0
1.092CysIle: 1.092 ± 0.732
1.092CysLys: 1.092 ± 1.355
3.821CysLeu: 3.821 ± 1.202
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.729CysPro: 2.729 ± 1.121
1.638CysGln: 1.638 ± 0.693
2.729CysArg: 2.729 ± 1.255
2.729CysSer: 2.729 ± 0.512
0.546CysThr: 0.546 ± 0.678
2.729CysVal: 2.729 ± 2.866
0.0CysTrp: 0.0 ± 0.0
0.546CysTyr: 0.546 ± 0.718
0.0CysXaa: 0.0 ± 0.0
Asp
3.821AspAla: 3.821 ± 2.473
1.092AspCys: 1.092 ± 0.62
4.367AspAsp: 4.367 ± 2.481
2.183AspGlu: 2.183 ± 0.884
2.183AspPhe: 2.183 ± 0.706
3.275AspGly: 3.275 ± 1.861
0.0AspHis: 0.0 ± 0.0
1.638AspIle: 1.638 ± 1.198
1.638AspLys: 1.638 ± 0.757
1.638AspLeu: 1.638 ± 0.693
1.092AspMet: 1.092 ± 0.595
1.092AspAsn: 1.092 ± 0.615
5.459AspPro: 5.459 ± 2.238
2.183AspGln: 2.183 ± 1.25
1.638AspArg: 1.638 ± 1.478
3.821AspSer: 3.821 ± 1.999
4.367AspThr: 4.367 ± 3.122
3.275AspVal: 3.275 ± 1.55
0.546AspTrp: 0.546 ± 0.678
1.092AspTyr: 1.092 ± 1.016
0.0AspXaa: 0.0 ± 0.0
Glu
1.638GluAla: 1.638 ± 0.81
2.183GluCys: 2.183 ± 1.106
2.183GluAsp: 2.183 ± 1.607
3.821GluGlu: 3.821 ± 1.848
2.729GluPhe: 2.729 ± 1.151
1.638GluGly: 1.638 ± 0.693
2.183GluHis: 2.183 ± 0.958
2.729GluIle: 2.729 ± 1.451
1.092GluLys: 1.092 ± 0.674
6.004GluLeu: 6.004 ± 2.063
0.546GluMet: 0.546 ± 0.337
2.183GluAsn: 2.183 ± 1.225
2.729GluPro: 2.729 ± 1.204
4.913GluGln: 4.913 ± 1.814
3.275GluArg: 3.275 ± 0.929
2.729GluSer: 2.729 ± 1.102
3.821GluThr: 3.821 ± 0.762
2.729GluVal: 2.729 ± 1.255
2.729GluTrp: 2.729 ± 1.86
1.638GluTyr: 1.638 ± 1.01
0.0GluXaa: 0.0 ± 0.0
Phe
0.546PheAla: 0.546 ± 0.851
1.638PheCys: 1.638 ± 0.693
3.275PheAsp: 3.275 ± 0.929
1.638PheGlu: 1.638 ± 0.683
0.0PhePhe: 0.0 ± 0.0
2.183PheGly: 2.183 ± 1.347
0.0PheHis: 0.0 ± 0.0
1.092PheIle: 1.092 ± 1.563
1.638PheLys: 1.638 ± 1.01
2.729PheLeu: 2.729 ± 1.153
1.092PheMet: 1.092 ± 0.62
4.367PheAsn: 4.367 ± 1.481
1.638PhePro: 1.638 ± 1.136
4.367PheGln: 4.367 ± 1.282
2.729PheArg: 2.729 ± 1.153
1.638PheSer: 1.638 ± 0.683
2.183PheThr: 2.183 ± 0.711
3.275PheVal: 3.275 ± 0.929
1.638PheTrp: 1.638 ± 1.01
1.092PheTyr: 1.092 ± 0.62
0.0PheXaa: 0.0 ± 0.0
Gly
4.367GlyAla: 4.367 ± 1.207
0.546GlyCys: 0.546 ± 0.337
2.729GlyAsp: 2.729 ± 1.102
1.092GlyGlu: 1.092 ± 0.674
1.638GlyPhe: 1.638 ± 1.01
5.459GlyGly: 5.459 ± 2.442
1.638GlyHis: 1.638 ± 0.693
5.459GlyIle: 5.459 ± 0.883
2.729GlyLys: 2.729 ± 1.227
8.734GlyLeu: 8.734 ± 2.051
0.0GlyMet: 0.0 ± 0.0
1.092GlyAsn: 1.092 ± 0.674
4.913GlyPro: 4.913 ± 1.807
1.638GlyGln: 1.638 ± 1.551
5.459GlyArg: 5.459 ± 1.775
1.638GlySer: 1.638 ± 0.844
5.459GlyThr: 5.459 ± 2.213
2.729GlyVal: 2.729 ± 1.207
1.638GlyTrp: 1.638 ± 1.294
3.275GlyTyr: 3.275 ± 1.116
0.0GlyXaa: 0.0 ± 0.0
His
1.638HisAla: 1.638 ± 1.01
0.0HisCys: 0.0 ± 0.0
0.546HisAsp: 0.546 ± 0.337
1.092HisGlu: 1.092 ± 1.369
1.638HisPhe: 1.638 ± 0.693
1.638HisGly: 1.638 ± 1.01
0.546HisHis: 0.546 ± 0.337
0.546HisIle: 0.546 ± 0.337
0.546HisLys: 0.546 ± 0.337
1.092HisLeu: 1.092 ± 0.615
0.0HisMet: 0.0 ± 0.702
2.729HisAsn: 2.729 ± 1.227
1.638HisPro: 1.638 ± 1.01
0.0HisGln: 0.0 ± 0.0
2.183HisArg: 2.183 ± 2.001
1.638HisSer: 1.638 ± 1.294
1.638HisThr: 1.638 ± 0.693
1.638HisVal: 1.638 ± 1.01
0.546HisTrp: 0.546 ± 0.337
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.275IleAla: 3.275 ± 1.104
1.092IleCys: 1.092 ± 0.674
2.183IleAsp: 2.183 ± 2.472
1.638IleGlu: 1.638 ± 0.733
1.092IlePhe: 1.092 ± 0.674
2.183IleGly: 2.183 ± 0.706
1.638IleHis: 1.638 ± 1.076
0.546IleIle: 0.546 ± 0.718
1.638IleLys: 1.638 ± 0.81
7.096IleLeu: 7.096 ± 4.783
0.546IleMet: 0.546 ± 0.337
1.638IleAsn: 1.638 ± 0.683
6.004IlePro: 6.004 ± 1.59
3.275IleGln: 3.275 ± 0.296
2.729IleArg: 2.729 ± 0.764
5.459IleSer: 5.459 ± 1.477
2.729IleThr: 2.729 ± 0.512
3.821IleVal: 3.821 ± 2.997
0.546IleTrp: 0.546 ± 0.337
1.638IleTyr: 1.638 ± 0.733
0.0IleXaa: 0.0 ± 0.0
Lys
3.821LysAla: 3.821 ± 1.731
1.092LysCys: 1.092 ± 0.62
2.729LysAsp: 2.729 ± 1.151
2.183LysGlu: 2.183 ± 1.229
1.638LysPhe: 1.638 ± 1.353
1.638LysGly: 1.638 ± 1.01
2.183LysHis: 2.183 ± 1.347
3.275LysIle: 3.275 ± 1.271
1.092LysLys: 1.092 ± 0.674
5.459LysLeu: 5.459 ± 2.728
1.092LysMet: 1.092 ± 0.886
1.638LysAsn: 1.638 ± 0.757
0.0LysPro: 0.0 ± 0.0
3.275LysGln: 3.275 ± 1.262
1.638LysArg: 1.638 ± 0.733
2.729LysSer: 2.729 ± 0.512
2.729LysThr: 2.729 ± 1.339
0.0LysVal: 0.0 ± 0.0
1.092LysTrp: 1.092 ± 0.732
0.0LysTyr: 0.0 ± 0.0
0.546LysXaa: 0.546 ± 0.337
Leu
7.642LeuAla: 7.642 ± 2.135
2.729LeuCys: 2.729 ± 1.255
3.821LeuAsp: 3.821 ± 2.471
5.459LeuGlu: 5.459 ± 0.634
2.729LeuPhe: 2.729 ± 1.204
8.734LeuGly: 8.734 ± 1.702
0.0LeuHis: 0.0 ± 0.0
3.275LeuIle: 3.275 ± 2.153
3.821LeuLys: 3.821 ± 1.731
7.096LeuLeu: 7.096 ± 1.12
1.092LeuMet: 1.092 ± 0.674
1.092LeuAsn: 1.092 ± 0.674
5.459LeuPro: 5.459 ± 1.639
3.275LeuGln: 3.275 ± 1.025
5.459LeuArg: 5.459 ± 2.16
9.825LeuSer: 9.825 ± 1.912
6.55LeuThr: 6.55 ± 2.353
4.913LeuVal: 4.913 ± 1.508
1.092LeuTrp: 1.092 ± 0.674
2.183LeuTyr: 2.183 ± 1.347
0.0LeuXaa: 0.0 ± 0.0
Met
2.729MetAla: 2.729 ± 1.387
0.546MetCys: 0.546 ± 0.678
0.0MetAsp: 0.0 ± 0.0
2.183MetGlu: 2.183 ± 1.106
0.546MetPhe: 0.546 ± 0.678
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.092MetLys: 1.092 ± 0.674
1.638MetLeu: 1.638 ± 0.844
1.092MetMet: 1.092 ± 0.674
1.638MetAsn: 1.638 ± 1.01
2.183MetPro: 2.183 ± 0.711
0.546MetGln: 0.546 ± 0.337
2.183MetArg: 2.183 ± 1.347
2.729MetSer: 2.729 ± 0.716
1.638MetThr: 1.638 ± 1.01
0.546MetVal: 0.546 ± 0.337
0.546MetTrp: 0.546 ± 0.851
1.092MetTyr: 1.092 ± 0.732
0.0MetXaa: 0.0 ± 0.0
Asn
0.546AsnAla: 0.546 ± 0.337
1.638AsnCys: 1.638 ± 0.733
0.546AsnAsp: 0.546 ± 0.678
1.092AsnGlu: 1.092 ± 0.62
2.183AsnPhe: 2.183 ± 1.225
3.275AsnGly: 3.275 ± 1.298
2.183AsnHis: 2.183 ± 2.26
1.638AsnIle: 1.638 ± 0.683
0.546AsnLys: 0.546 ± 0.337
4.367AsnLeu: 4.367 ± 1.192
1.092AsnMet: 1.092 ± 0.674
2.183AsnAsn: 2.183 ± 0.884
4.367AsnPro: 4.367 ± 1.339
0.546AsnGln: 0.546 ± 0.718
2.729AsnArg: 2.729 ± 2.272
3.275AsnSer: 3.275 ± 1.116
1.092AsnThr: 1.092 ± 0.732
2.729AsnVal: 2.729 ± 0.713
0.546AsnTrp: 0.546 ± 0.337
1.092AsnTyr: 1.092 ± 0.674
0.546AsnXaa: 0.546 ± 0.718
Pro
5.459ProAla: 5.459 ± 1.775
1.638ProCys: 1.638 ± 0.693
2.729ProAsp: 2.729 ± 1.855
5.459ProGlu: 5.459 ± 1.639
1.638ProPhe: 1.638 ± 1.01
2.729ProGly: 2.729 ± 2.571
0.546ProHis: 0.546 ± 0.337
3.275ProIle: 3.275 ± 1.074
3.821ProLys: 3.821 ± 1.098
4.913ProLeu: 4.913 ± 1.649
0.546ProMet: 0.546 ± 0.678
2.183ProAsn: 2.183 ± 1.533
4.367ProPro: 4.367 ± 2.232
2.729ProGln: 2.729 ± 2.125
3.821ProArg: 3.821 ± 1.304
3.821ProSer: 3.821 ± 2.578
8.734ProThr: 8.734 ± 1.617
5.459ProVal: 5.459 ± 2.307
0.546ProTrp: 0.546 ± 0.718
1.638ProTyr: 1.638 ± 0.733
0.0ProXaa: 0.0 ± 0.0
Gln
3.275GlnAla: 3.275 ± 1.619
1.092GlnCys: 1.092 ± 1.437
1.092GlnAsp: 1.092 ± 1.087
3.275GlnGlu: 3.275 ± 2.021
1.638GlnPhe: 1.638 ± 0.81
1.092GlnGly: 1.092 ± 0.615
1.092GlnHis: 1.092 ± 0.615
6.004GlnIle: 6.004 ± 1.994
1.638GlnLys: 1.638 ± 2.262
3.275GlnLeu: 3.275 ± 2.588
2.729GlnMet: 2.729 ± 1.431
2.729GlnAsn: 2.729 ± 1.207
2.729GlnPro: 2.729 ± 0.512
3.821GlnGln: 3.821 ± 3.292
1.638GlnArg: 1.638 ± 1.551
3.275GlnSer: 3.275 ± 0.867
4.367GlnThr: 4.367 ± 2.064
3.275GlnVal: 3.275 ± 1.074
1.638GlnTrp: 1.638 ± 1.614
1.638GlnTyr: 1.638 ± 0.683
0.0GlnXaa: 0.0 ± 0.0
Arg
5.459ArgAla: 5.459 ± 1.023
2.729ArgCys: 2.729 ± 1.305
1.638ArgAsp: 1.638 ± 1.01
3.275ArgGlu: 3.275 ± 0.929
3.821ArgPhe: 3.821 ± 1.719
4.367ArgGly: 4.367 ± 2.877
4.913ArgHis: 4.913 ± 2.025
2.729ArgIle: 2.729 ± 1.387
3.821ArgLys: 3.821 ± 1.731
3.275ArgLeu: 3.275 ± 1.844
1.638ArgMet: 1.638 ± 0.683
5.459ArgAsn: 5.459 ± 1.545
4.367ArgPro: 4.367 ± 1.629
2.183ArgGln: 2.183 ± 1.229
3.275ArgArg: 3.275 ± 1.366
5.459ArgSer: 5.459 ± 1.746
3.821ArgThr: 3.821 ± 1.731
3.275ArgVal: 3.275 ± 1.535
2.729ArgTrp: 2.729 ± 1.684
3.275ArgTyr: 3.275 ± 0.942
0.0ArgXaa: 0.0 ± 0.0
Ser
5.459SerAla: 5.459 ± 2.581
1.638SerCys: 1.638 ± 0.683
5.459SerAsp: 5.459 ± 3.585
3.821SerGlu: 3.821 ± 1.694
1.638SerPhe: 1.638 ± 0.733
5.459SerGly: 5.459 ± 2.244
0.546SerHis: 0.546 ± 0.337
4.913SerIle: 4.913 ± 2.37
2.183SerLys: 2.183 ± 0.958
7.096SerLeu: 7.096 ± 2.199
1.638SerMet: 1.638 ± 1.551
1.092SerAsn: 1.092 ± 0.674
3.275SerPro: 3.275 ± 1.802
3.821SerGln: 3.821 ± 1.098
8.188SerArg: 8.188 ± 0.784
6.004SerSer: 6.004 ± 3.706
3.821SerThr: 3.821 ± 0.374
6.004SerVal: 6.004 ± 2.428
1.638SerTrp: 1.638 ± 1.198
1.638SerTyr: 1.638 ± 1.551
0.0SerXaa: 0.0 ± 0.0
Thr
6.004ThrAla: 6.004 ± 1.215
1.092ThrCys: 1.092 ± 1.074
4.367ThrAsp: 4.367 ± 2.28
2.183ThrGlu: 2.183 ± 2.71
5.459ThrPhe: 5.459 ± 1.288
4.367ThrGly: 4.367 ± 0.652
0.546ThrHis: 0.546 ± 0.337
2.183ThrIle: 2.183 ± 1.541
3.275ThrLys: 3.275 ± 1.638
4.367ThrLeu: 4.367 ± 1.235
3.275ThrMet: 3.275 ± 1.965
1.638ThrAsn: 1.638 ± 2.033
4.367ThrPro: 4.367 ± 1.501
5.459ThrGln: 5.459 ± 2.304
7.096ThrArg: 7.096 ± 1.486
3.275ThrSer: 3.275 ± 1.3
3.821ThrThr: 3.821 ± 2.064
4.367ThrVal: 4.367 ± 1.246
1.092ThrTrp: 1.092 ± 1.016
0.546ThrTyr: 0.546 ± 0.678
0.0ThrXaa: 0.0 ± 0.0
Val
4.913ValAla: 4.913 ± 2.536
3.275ValCys: 3.275 ± 2.433
3.275ValAsp: 3.275 ± 1.426
6.55ValGlu: 6.55 ± 1.461
3.821ValPhe: 3.821 ± 1.647
3.821ValGly: 3.821 ± 0.374
2.183ValHis: 2.183 ± 0.617
2.729ValIle: 2.729 ± 1.255
3.275ValLys: 3.275 ± 1.447
3.275ValLeu: 3.275 ± 1.065
1.092ValMet: 1.092 ± 0.732
1.638ValAsn: 1.638 ± 0.733
3.275ValPro: 3.275 ± 2.216
2.183ValGln: 2.183 ± 0.884
4.913ValArg: 4.913 ± 0.962
3.275ValSer: 3.275 ± 1.515
3.275ValThr: 3.275 ± 1.466
2.183ValVal: 2.183 ± 1.268
0.546ValTrp: 0.546 ± 0.337
3.275ValTyr: 3.275 ± 2.104
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.638TrpGlu: 1.638 ± 0.757
0.546TrpPhe: 0.546 ± 0.678
1.092TrpGly: 1.092 ± 0.62
0.546TrpHis: 0.546 ± 0.718
0.0TrpIle: 0.0 ± 0.0
1.638TrpLys: 1.638 ± 0.683
2.729TrpLeu: 2.729 ± 1.21
1.638TrpMet: 1.638 ± 0.733
0.546TrpAsn: 0.546 ± 0.851
1.638TrpPro: 1.638 ± 1.076
1.638TrpGln: 1.638 ± 0.683
2.183TrpArg: 2.183 ± 1.229
2.729TrpSer: 2.729 ± 1.86
1.092TrpThr: 1.092 ± 0.62
0.546TrpVal: 0.546 ± 0.337
0.546TrpTrp: 0.546 ± 0.337
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.092TyrAla: 1.092 ± 0.674
0.0TyrCys: 0.0 ± 0.0
1.092TyrAsp: 1.092 ± 0.62
2.183TyrGlu: 2.183 ± 0.958
1.092TyrPhe: 1.092 ± 0.62
2.183TyrGly: 2.183 ± 1.241
1.092TyrHis: 1.092 ± 1.087
2.729TyrIle: 2.729 ± 1.451
0.0TyrLys: 0.0 ± 0.0
1.092TyrLeu: 1.092 ± 0.674
0.546TyrMet: 0.546 ± 0.851
0.546TyrAsn: 0.546 ± 0.337
1.638TyrPro: 1.638 ± 0.693
1.092TyrGln: 1.092 ± 0.615
2.183TyrArg: 2.183 ± 0.916
4.367TyrSer: 4.367 ± 1.755
3.275TyrThr: 3.275 ± 1.466
2.183TyrVal: 2.183 ± 1.347
0.0TyrTrp: 0.0 ± 0.0
1.638TyrTyr: 1.638 ± 1.01
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.546XaaAla: 0.546 ± 0.718
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.546XaaGly: 0.546 ± 0.337
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1833 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski