Amino acid dipepetide frequency for Cacao mild mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.691AlaAla: 1.691 ± 0.85
0.846AlaCys: 0.846 ± 1.303
2.537AlaAsp: 2.537 ± 1.076
6.342AlaGlu: 6.342 ± 2.273
2.114AlaPhe: 2.114 ± 1.062
1.268AlaGly: 1.268 ± 1.195
0.423AlaHis: 0.423 ± 0.212
4.228AlaIle: 4.228 ± 3.004
3.383AlaLys: 3.383 ± 2.047
4.228AlaLeu: 4.228 ± 3.66
1.268AlaMet: 1.268 ± 0.637
1.691AlaAsn: 1.691 ± 3.428
1.691AlaPro: 1.691 ± 0.85
4.228AlaGln: 4.228 ± 2.434
2.96AlaArg: 2.96 ± 1.132
3.383AlaSer: 3.383 ± 3.631
2.537AlaThr: 2.537 ± 3.366
3.383AlaVal: 3.383 ± 1.351
1.268AlaTrp: 1.268 ± 2.025
2.537AlaTyr: 2.537 ± 1.749
0.0AlaXaa: 0.0 ± 0.0
Cys
0.846CysAla: 0.846 ± 0.425
0.846CysCys: 0.846 ± 0.425
0.0CysAsp: 0.0 ± 0.0
0.423CysGlu: 0.423 ± 1.434
0.423CysPhe: 0.423 ± 0.212
0.846CysGly: 0.846 ± 0.425
0.0CysHis: 0.0 ± 0.0
0.423CysIle: 0.423 ± 0.212
3.383CysLys: 3.383 ± 1.202
1.268CysLeu: 1.268 ± 1.352
0.423CysMet: 0.423 ± 0.212
1.691CysAsn: 1.691 ± 0.85
0.423CysPro: 0.423 ± 0.212
0.846CysGln: 0.846 ± 0.425
0.846CysArg: 0.846 ± 0.425
0.423CysSer: 0.423 ± 0.212
1.691CysThr: 1.691 ± 1.117
0.846CysVal: 0.846 ± 0.425
0.0CysTrp: 0.0 ± 0.0
0.423CysTyr: 0.423 ± 0.212
0.0CysXaa: 0.0 ± 0.0
Asp
2.96AspAla: 2.96 ± 2.303
1.691AspCys: 1.691 ± 1.143
4.228AspAsp: 4.228 ± 1.437
3.805AspGlu: 3.805 ± 1.305
2.537AspPhe: 2.537 ± 1.275
1.691AspGly: 1.691 ± 1.143
0.846AspHis: 0.846 ± 0.425
3.383AspIle: 3.383 ± 1.7
2.537AspLys: 2.537 ± 1.659
5.497AspLeu: 5.497 ± 3.957
0.846AspMet: 0.846 ± 0.425
3.383AspAsn: 3.383 ± 2.233
3.805AspPro: 3.805 ± 2.411
2.537AspGln: 2.537 ± 1.275
2.96AspArg: 2.96 ± 1.129
1.268AspSer: 1.268 ± 1.352
1.691AspThr: 1.691 ± 0.85
2.114AspVal: 2.114 ± 1.076
1.268AspTrp: 1.268 ± 0.637
0.423AspTyr: 0.423 ± 0.212
0.0AspXaa: 0.0 ± 0.0
Glu
4.651GluAla: 4.651 ± 4.178
1.268GluCys: 1.268 ± 0.637
4.228GluAsp: 4.228 ± 2.125
14.799GluGlu: 14.799 ± 1.493
3.805GluPhe: 3.805 ± 2.183
3.383GluGly: 3.383 ± 1.171
3.383GluHis: 3.383 ± 1.7
6.342GluIle: 6.342 ± 3.295
6.765GluLys: 6.765 ± 4.989
7.188GluLeu: 7.188 ± 4.066
0.423GluMet: 0.423 ± 0.212
5.074GluAsn: 5.074 ± 2.123
2.96GluPro: 2.96 ± 1.487
4.228GluGln: 4.228 ± 2.354
2.114GluArg: 2.114 ± 2.552
4.228GluSer: 4.228 ± 1.437
7.188GluThr: 7.188 ± 1.599
4.651GluVal: 4.651 ± 2.182
1.691GluTrp: 1.691 ± 1.143
3.383GluTyr: 3.383 ± 1.202
0.0GluXaa: 0.0 ± 0.0
Phe
1.268PheAla: 1.268 ± 0.637
0.423PheCys: 0.423 ± 0.212
2.96PheAsp: 2.96 ± 1.487
2.537PheGlu: 2.537 ± 1.275
0.423PhePhe: 0.423 ± 0.212
1.691PheGly: 1.691 ± 0.85
0.846PheHis: 0.846 ± 1.303
3.383PheIle: 3.383 ± 3.041
1.691PheLys: 1.691 ± 1.247
1.691PheLeu: 1.691 ± 0.85
1.691PheMet: 1.691 ± 0.85
1.691PheAsn: 1.691 ± 0.85
0.423PhePro: 0.423 ± 0.212
1.268PheGln: 1.268 ± 0.637
1.268PheArg: 1.268 ± 0.637
2.96PheSer: 2.96 ± 1.487
2.114PheThr: 2.114 ± 1.062
1.691PheVal: 1.691 ± 0.85
0.0PheTrp: 0.0 ± 0.0
2.114PheTyr: 2.114 ± 1.062
0.0PheXaa: 0.0 ± 0.0
Gly
1.691GlyAla: 1.691 ± 1.117
0.846GlyCys: 0.846 ± 0.425
3.383GlyAsp: 3.383 ± 3.683
5.92GlyGlu: 5.92 ± 1.268
2.114GlyPhe: 2.114 ± 1.172
2.537GlyGly: 2.537 ± 1.275
1.268GlyHis: 1.268 ± 0.637
1.691GlyIle: 1.691 ± 0.85
4.228GlyLys: 4.228 ± 2.125
3.383GlyLeu: 3.383 ± 1.171
0.423GlyMet: 0.423 ± 0.212
2.537GlyAsn: 2.537 ± 1.076
2.96GlyPro: 2.96 ± 1.132
1.691GlyGln: 1.691 ± 0.85
3.383GlyArg: 3.383 ± 1.196
2.537GlySer: 2.537 ± 1.076
5.074GlyThr: 5.074 ± 2.265
4.228GlyVal: 4.228 ± 1.434
1.268GlyTrp: 1.268 ± 0.637
2.537GlyTyr: 2.537 ± 1.275
0.0GlyXaa: 0.0 ± 0.0
His
0.846HisAla: 0.846 ± 0.425
0.423HisCys: 0.423 ± 0.212
0.846HisAsp: 0.846 ± 0.425
0.0HisGlu: 0.0 ± 0.0
1.268HisPhe: 1.268 ± 0.637
2.114HisGly: 2.114 ± 1.076
0.423HisHis: 0.423 ± 0.212
3.383HisIle: 3.383 ± 1.202
2.114HisLys: 2.114 ± 1.679
3.383HisLeu: 3.383 ± 2.233
0.846HisMet: 0.846 ± 0.425
0.846HisAsn: 0.846 ± 0.425
0.0HisPro: 0.0 ± 0.0
1.268HisGln: 1.268 ± 0.637
2.114HisArg: 2.114 ± 1.076
1.268HisSer: 1.268 ± 1.195
0.423HisThr: 0.423 ± 0.212
1.268HisVal: 1.268 ± 0.637
1.691HisTrp: 1.691 ± 0.85
0.423HisTyr: 0.423 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
4.651IleAla: 4.651 ± 2.781
1.268IleCys: 1.268 ± 1.352
3.805IleAsp: 3.805 ± 2.768
5.92IleGlu: 5.92 ± 1.229
1.268IlePhe: 1.268 ± 0.637
4.228IleGly: 4.228 ± 2.125
2.114IleHis: 2.114 ± 1.076
7.188IleIle: 7.188 ± 3.612
5.074IleLys: 5.074 ± 2.615
4.228IleLeu: 4.228 ± 2.31
1.691IleMet: 1.691 ± 0.896
4.228IleAsn: 4.228 ± 2.125
3.805IlePro: 3.805 ± 1.912
3.805IleGln: 3.805 ± 1.912
3.805IleArg: 3.805 ± 1.305
5.92IleSer: 5.92 ± 1.229
3.383IleThr: 3.383 ± 2.287
1.691IleVal: 1.691 ± 0.85
0.0IleTrp: 0.0 ± 0.0
3.805IleTyr: 3.805 ± 1.912
0.0IleXaa: 0.0 ± 0.0
Lys
4.651LysAla: 4.651 ± 4.106
1.268LysCys: 1.268 ± 0.637
3.805LysAsp: 3.805 ± 1.223
9.725LysGlu: 9.725 ± 0.987
2.537LysPhe: 2.537 ± 1.275
3.383LysGly: 3.383 ± 1.468
2.96LysHis: 2.96 ± 1.487
5.497LysIle: 5.497 ± 2.184
4.651LysLys: 4.651 ± 1.58
8.034LysLeu: 8.034 ± 2.936
2.114LysMet: 2.114 ± 1.076
3.383LysAsn: 3.383 ± 1.196
2.96LysPro: 2.96 ± 1.118
4.228LysGln: 4.228 ± 2.614
2.114LysArg: 2.114 ± 1.83
4.228LysSer: 4.228 ± 1.434
2.96LysThr: 2.96 ± 1.132
3.805LysVal: 3.805 ± 2.768
0.423LysTrp: 0.423 ± 0.212
1.691LysTyr: 1.691 ± 0.85
0.0LysXaa: 0.0 ± 0.0
Leu
5.497LeuAla: 5.497 ± 5.119
1.691LeuCys: 1.691 ± 1.117
2.96LeuAsp: 2.96 ± 1.377
9.302LeuGlu: 9.302 ± 6.423
2.114LeuPhe: 2.114 ± 1.062
6.342LeuGly: 6.342 ± 3.063
2.114LeuHis: 2.114 ± 1.076
5.497LeuIle: 5.497 ± 1.214
8.034LeuLys: 8.034 ± 2.065
6.342LeuLeu: 6.342 ± 1.886
0.423LeuMet: 0.423 ± 0.212
4.651LeuAsn: 4.651 ± 1.055
4.651LeuPro: 4.651 ± 1.586
3.383LeuGln: 3.383 ± 1.171
4.651LeuArg: 4.651 ± 1.055
6.765LeuSer: 6.765 ± 1.488
5.92LeuThr: 5.92 ± 4.519
4.651LeuVal: 4.651 ± 2.47
0.0LeuTrp: 0.0 ± 0.0
0.846LeuTyr: 0.846 ± 1.303
0.0LeuXaa: 0.0 ± 0.0
Met
1.268MetAla: 1.268 ± 1.195
0.0MetCys: 0.0 ± 0.0
1.268MetAsp: 1.268 ± 1.195
2.114MetGlu: 2.114 ± 1.062
0.846MetPhe: 0.846 ± 0.425
1.691MetGly: 1.691 ± 0.85
0.0MetHis: 0.0 ± 0.0
1.268MetIle: 1.268 ± 0.637
0.846MetLys: 0.846 ± 0.425
1.691MetLeu: 1.691 ± 0.85
1.268MetMet: 1.268 ± 0.637
1.268MetAsn: 1.268 ± 0.637
1.268MetPro: 1.268 ± 0.637
2.114MetGln: 2.114 ± 1.062
1.268MetArg: 1.268 ± 1.195
2.114MetSer: 2.114 ± 1.83
2.114MetThr: 2.114 ± 1.062
0.846MetVal: 0.846 ± 0.425
0.0MetTrp: 0.0 ± 0.0
1.268MetTyr: 1.268 ± 0.637
0.0MetXaa: 0.0 ± 0.0
Asn
2.114AsnAla: 2.114 ± 1.076
1.268AsnCys: 1.268 ± 0.637
2.537AsnAsp: 2.537 ± 1.275
2.114AsnGlu: 2.114 ± 1.172
2.114AsnPhe: 2.114 ± 1.062
2.96AsnGly: 2.96 ± 1.487
1.691AsnHis: 1.691 ± 1.117
2.537AsnIle: 2.537 ± 1.094
2.96AsnLys: 2.96 ± 1.487
6.765AsnLeu: 6.765 ± 2.887
0.846AsnMet: 0.846 ± 0.425
1.268AsnAsn: 1.268 ± 1.224
1.691AsnPro: 1.691 ± 1.117
2.537AsnGln: 2.537 ± 1.094
3.383AsnArg: 3.383 ± 2.74
3.383AsnSer: 3.383 ± 1.351
3.383AsnThr: 3.383 ± 2.047
1.691AsnVal: 1.691 ± 0.85
1.691AsnTrp: 1.691 ± 0.85
1.268AsnTyr: 1.268 ± 0.637
0.0AsnXaa: 0.0 ± 0.0
Pro
1.268ProAla: 1.268 ± 1.352
0.0ProCys: 0.0 ± 0.0
2.537ProAsp: 2.537 ± 1.275
3.805ProGlu: 3.805 ± 1.912
0.846ProPhe: 0.846 ± 0.425
2.96ProGly: 2.96 ± 1.487
0.423ProHis: 0.423 ± 0.212
2.96ProIle: 2.96 ± 1.129
4.228ProLys: 4.228 ± 1.122
2.96ProLeu: 2.96 ± 1.487
1.268ProMet: 1.268 ± 1.195
1.268ProAsn: 1.268 ± 0.637
2.537ProPro: 2.537 ± 1.133
3.805ProGln: 3.805 ± 1.355
1.691ProArg: 1.691 ± 1.117
2.537ProSer: 2.537 ± 1.275
5.074ProThr: 5.074 ± 2.265
0.846ProVal: 0.846 ± 0.425
0.846ProTrp: 0.846 ± 0.425
2.114ProTyr: 2.114 ± 1.076
0.0ProXaa: 0.0 ± 0.0
Gln
4.651GlnAla: 4.651 ± 1.065
1.268GlnCys: 1.268 ± 0.637
2.537GlnAsp: 2.537 ± 1.275
5.497GlnGlu: 5.497 ± 1.911
0.846GlnPhe: 0.846 ± 0.425
2.96GlnGly: 2.96 ± 1.132
2.114GlnHis: 2.114 ± 1.076
5.92GlnIle: 5.92 ± 0.777
3.383GlnLys: 3.383 ± 1.351
2.114GlnLeu: 2.114 ± 1.172
2.537GlnMet: 2.537 ± 1.275
1.691GlnAsn: 1.691 ± 0.85
3.383GlnPro: 3.383 ± 1.202
3.383GlnGln: 3.383 ± 1.7
3.805GlnArg: 3.805 ± 1.912
1.268GlnSer: 1.268 ± 0.637
2.114GlnThr: 2.114 ± 1.076
3.805GlnVal: 3.805 ± 3.182
0.846GlnTrp: 0.846 ± 0.425
2.537GlnTyr: 2.537 ± 3.139
0.0GlnXaa: 0.0 ± 0.0
Arg
3.805ArgAla: 3.805 ± 1.223
0.846ArgCys: 0.846 ± 0.425
0.846ArgAsp: 0.846 ± 1.303
2.114ArgGlu: 2.114 ± 1.076
1.691ArgPhe: 1.691 ± 0.85
2.537ArgGly: 2.537 ± 1.749
0.423ArgHis: 0.423 ± 0.212
4.228ArgIle: 4.228 ± 1.434
2.96ArgLys: 2.96 ± 1.129
5.497ArgLeu: 5.497 ± 1.911
0.846ArgMet: 0.846 ± 1.092
3.805ArgAsn: 3.805 ± 1.245
1.691ArgPro: 1.691 ± 2.896
4.651ArgGln: 4.651 ± 1.58
3.383ArgArg: 3.383 ± 1.171
5.497ArgSer: 5.497 ± 2.19
3.805ArgThr: 3.805 ± 1.155
2.537ArgVal: 2.537 ± 1.659
1.268ArgTrp: 1.268 ± 1.224
1.268ArgTyr: 1.268 ± 0.637
0.0ArgXaa: 0.0 ± 0.0
Ser
1.691SerAla: 1.691 ± 2.96
0.423SerCys: 0.423 ± 0.212
4.228SerAsp: 4.228 ± 2.197
4.651SerGlu: 4.651 ± 1.58
2.537SerPhe: 2.537 ± 1.275
2.96SerGly: 2.96 ± 1.487
0.846SerHis: 0.846 ± 1.334
4.651SerIle: 4.651 ± 2.141
4.651SerLys: 4.651 ± 1.58
6.342SerLeu: 6.342 ± 3.063
1.691SerMet: 1.691 ± 0.779
4.228SerAsn: 4.228 ± 2.434
2.537SerPro: 2.537 ± 1.133
4.228SerGln: 4.228 ± 3.495
5.074SerArg: 5.074 ± 3.696
2.96SerSer: 2.96 ± 2.26
5.497SerThr: 5.497 ± 2.762
2.114SerVal: 2.114 ± 1.062
1.268SerTrp: 1.268 ± 0.637
1.691SerTyr: 1.691 ± 0.85
0.0SerXaa: 0.0 ± 0.0
Thr
3.383ThrAla: 3.383 ± 3.041
0.846ThrCys: 0.846 ± 0.425
4.228ThrAsp: 4.228 ± 1.349
5.074ThrGlu: 5.074 ± 5.776
0.846ThrPhe: 0.846 ± 0.425
5.92ThrGly: 5.92 ± 2.101
1.268ThrHis: 1.268 ± 0.637
3.383ThrIle: 3.383 ± 1.7
5.497ThrLys: 5.497 ± 2.893
6.342ThrLeu: 6.342 ± 3.763
2.96ThrMet: 2.96 ± 1.125
1.691ThrAsn: 1.691 ± 1.848
2.114ThrPro: 2.114 ± 1.062
4.228ThrGln: 4.228 ± 2.534
2.114ThrArg: 2.114 ± 1.062
6.342ThrSer: 6.342 ± 1.194
1.268ThrThr: 1.268 ± 0.637
3.805ThrVal: 3.805 ± 2.232
0.0ThrTrp: 0.0 ± 0.0
0.423ThrTyr: 0.423 ± 0.212
0.0ThrXaa: 0.0 ± 0.0
Val
2.537ValAla: 2.537 ± 1.094
0.0ValCys: 0.0 ± 0.0
1.691ValAsp: 1.691 ± 1.247
3.805ValGlu: 3.805 ± 3.431
3.383ValPhe: 3.383 ± 2.495
1.691ValGly: 1.691 ± 0.85
2.114ValHis: 2.114 ± 1.098
2.537ValIle: 2.537 ± 1.275
3.383ValLys: 3.383 ± 1.202
3.805ValLeu: 3.805 ± 3.672
0.846ValMet: 0.846 ± 0.425
2.114ValAsn: 2.114 ± 1.172
2.537ValPro: 2.537 ± 1.275
0.846ValGln: 0.846 ± 0.425
3.383ValArg: 3.383 ± 1.171
5.074ValSer: 5.074 ± 2.123
2.96ValThr: 2.96 ± 1.129
2.114ValVal: 2.114 ± 1.83
0.0ValTrp: 0.0 ± 0.0
2.537ValTyr: 2.537 ± 1.094
0.0ValXaa: 0.0 ± 0.0
Trp
0.423TrpAla: 0.423 ± 0.212
0.423TrpCys: 0.423 ± 0.212
0.0TrpAsp: 0.0 ± 0.0
1.691TrpGlu: 1.691 ± 1.143
0.0TrpPhe: 0.0 ± 0.0
1.268TrpGly: 1.268 ± 1.195
0.0TrpHis: 0.0 ± 0.0
1.268TrpIle: 1.268 ± 0.637
1.691TrpLys: 1.691 ± 0.85
0.846TrpLeu: 0.846 ± 0.425
0.846TrpMet: 0.846 ± 0.425
0.423TrpAsn: 0.423 ± 0.212
0.423TrpPro: 0.423 ± 0.212
1.268TrpGln: 1.268 ± 0.637
0.846TrpArg: 0.846 ± 0.425
0.423TrpSer: 0.423 ± 0.212
1.268TrpThr: 1.268 ± 1.224
0.0TrpVal: 0.0 ± 0.0
0.423TrpTrp: 0.423 ± 0.212
0.423TrpTyr: 0.423 ± 1.467
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.114TyrAla: 2.114 ± 1.076
0.423TyrCys: 0.423 ± 0.212
0.846TyrAsp: 0.846 ± 2.934
2.114TyrGlu: 2.114 ± 1.076
0.423TyrPhe: 0.423 ± 0.212
1.268TyrGly: 1.268 ± 0.637
1.691TyrHis: 1.691 ± 1.117
2.114TyrIle: 2.114 ± 1.076
2.96TyrLys: 2.96 ± 1.118
4.228TyrLeu: 4.228 ± 1.434
0.846TyrMet: 0.846 ± 0.425
1.268TyrAsn: 1.268 ± 0.637
2.114TyrPro: 2.114 ± 1.062
2.114TyrGln: 2.114 ± 1.062
2.96TyrArg: 2.96 ± 1.132
1.691TyrSer: 1.691 ± 0.85
1.268TyrThr: 1.268 ± 1.352
1.268TyrVal: 1.268 ± 0.637
0.0TyrTrp: 0.0 ± 0.0
1.691TyrTyr: 1.691 ± 1.117
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2366 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski