Amino acid dipepetide frequency for Cucumber green mottle mosaic virus (strain watermelon SH) (CGMMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.309AlaAla: 5.309 ± 2.134
1.931AlaCys: 1.931 ± 0.588
4.344AlaAsp: 4.344 ± 1.101
2.896AlaGlu: 2.896 ± 2.357
3.861AlaPhe: 3.861 ± 4.352
3.861AlaGly: 3.861 ± 1.783
0.483AlaHis: 0.483 ± 0.242
2.413AlaIle: 2.413 ± 1.209
1.931AlaLys: 1.931 ± 0.967
7.239AlaLeu: 7.239 ± 0.205
1.931AlaMet: 1.931 ± 0.687
1.931AlaAsn: 1.931 ± 0.967
0.483AlaPro: 0.483 ± 1.109
1.448AlaGln: 1.448 ± 0.724
3.378AlaArg: 3.378 ± 0.733
7.239AlaSer: 7.239 ± 4.111
5.792AlaThr: 5.792 ± 2.002
7.722AlaVal: 7.722 ± 1.566
0.965AlaTrp: 0.965 ± 0.484
1.448AlaTyr: 1.448 ± 0.965
0.0AlaXaa: 0.0 ± 0.0
Cys
0.965CysAla: 0.965 ± 0.484
0.483CysCys: 0.483 ± 0.242
0.0CysAsp: 0.0 ± 0.0
1.448CysGlu: 1.448 ± 0.725
0.483CysPhe: 0.483 ± 0.242
1.448CysGly: 1.448 ± 0.725
0.0CysHis: 0.0 ± 0.0
0.483CysIle: 0.483 ± 0.242
2.413CysLys: 2.413 ± 0.535
1.931CysLeu: 1.931 ± 0.588
0.0CysMet: 0.0 ± 0.0
0.483CysAsn: 0.483 ± 0.242
0.483CysPro: 0.483 ± 0.242
0.483CysGln: 0.483 ± 0.242
1.448CysArg: 1.448 ± 0.724
1.931CysSer: 1.931 ± 0.967
0.0CysThr: 0.0 ± 0.0
1.931CysVal: 1.931 ± 0.967
0.0CysTrp: 0.0 ± 0.0
0.965CysTyr: 0.965 ± 0.484
0.0CysXaa: 0.0 ± 0.0
Asp
6.757AspAla: 6.757 ± 1.223
1.448AspCys: 1.448 ± 0.725
4.344AspAsp: 4.344 ± 1.367
3.378AspGlu: 3.378 ± 1.693
2.896AspPhe: 2.896 ± 0.585
1.931AspGly: 1.931 ± 1.809
1.448AspHis: 1.448 ± 0.724
3.861AspIle: 3.861 ± 1.192
4.826AspLys: 4.826 ± 1.069
4.826AspLeu: 4.826 ± 1.318
1.931AspMet: 1.931 ± 0.967
3.378AspAsn: 3.378 ± 1.044
2.896AspPro: 2.896 ± 2.112
1.448AspGln: 1.448 ± 0.724
1.931AspArg: 1.931 ± 0.892
4.344AspSer: 4.344 ± 1.391
4.344AspThr: 4.344 ± 0.38
6.757AspVal: 6.757 ± 2.422
0.965AspTrp: 0.965 ± 0.484
1.931AspTyr: 1.931 ± 0.967
0.0AspXaa: 0.0 ± 0.0
Glu
5.792GluAla: 5.792 ± 1.87
0.483GluCys: 0.483 ± 0.242
2.896GluAsp: 2.896 ± 0.585
2.896GluGlu: 2.896 ± 0.585
4.344GluPhe: 4.344 ± 1.101
1.931GluGly: 1.931 ± 0.588
0.965GluHis: 0.965 ± 0.484
2.413GluIle: 2.413 ± 0.881
3.861GluLys: 3.861 ± 1.177
4.826GluLeu: 4.826 ± 2.418
1.448GluMet: 1.448 ± 0.725
1.448GluAsn: 1.448 ± 0.724
0.965GluPro: 0.965 ± 0.905
0.483GluGln: 0.483 ± 0.242
2.413GluArg: 2.413 ± 1.209
5.792GluSer: 5.792 ± 5.356
1.448GluThr: 1.448 ± 0.725
1.931GluVal: 1.931 ± 1.416
0.483GluTrp: 0.483 ± 0.242
2.413GluTyr: 2.413 ± 1.209
0.0GluXaa: 0.0 ± 0.0
Phe
2.413PheAla: 2.413 ± 1.209
1.448PheCys: 1.448 ± 0.725
4.826PheAsp: 4.826 ± 0.351
2.413PheGlu: 2.413 ± 1.182
3.861PhePhe: 3.861 ± 0.898
1.931PheGly: 1.931 ± 0.967
1.931PheHis: 1.931 ± 0.588
2.413PheIle: 2.413 ± 0.535
2.896PheLys: 2.896 ± 1.451
4.826PheLeu: 4.826 ± 1.762
0.965PheMet: 0.965 ± 0.484
1.448PheAsn: 1.448 ± 0.725
1.931PhePro: 1.931 ± 0.892
2.896PheGln: 2.896 ± 0.935
3.378PheArg: 3.378 ± 1.044
9.653PheSer: 9.653 ± 2.304
0.965PheThr: 0.965 ± 0.484
5.309PheVal: 5.309 ± 0.914
0.0PheTrp: 0.0 ± 0.0
1.931PheTyr: 1.931 ± 0.892
0.0PheXaa: 0.0 ± 0.0
Gly
2.896GlyAla: 2.896 ± 0.585
2.413GlyCys: 2.413 ± 1.209
2.896GlyAsp: 2.896 ± 1.451
0.483GlyGlu: 0.483 ± 0.242
1.931GlyPhe: 1.931 ± 2.782
2.896GlyGly: 2.896 ± 1.447
1.448GlyHis: 1.448 ± 0.725
1.931GlyIle: 1.931 ± 0.588
2.413GlyLys: 2.413 ± 0.535
6.274GlyLeu: 6.274 ± 1.287
0.483GlyMet: 0.483 ± 0.242
3.861GlyAsn: 3.861 ± 1.935
1.448GlyPro: 1.448 ± 0.965
0.483GlyGln: 0.483 ± 0.242
2.896GlyArg: 2.896 ± 0.954
2.896GlySer: 2.896 ± 1.451
3.378GlyThr: 3.378 ± 0.733
3.861GlyVal: 3.861 ± 2.341
0.483GlyTrp: 0.483 ± 0.242
1.448GlyTyr: 1.448 ± 0.725
0.0GlyXaa: 0.0 ± 0.0
His
1.448HisAla: 1.448 ± 0.725
0.483HisCys: 0.483 ± 0.242
0.0HisAsp: 0.0 ± 0.0
0.965HisGlu: 0.965 ± 0.484
1.448HisPhe: 1.448 ± 0.725
0.0HisGly: 0.0 ± 0.0
0.483HisHis: 0.483 ± 0.242
0.965HisIle: 0.965 ± 0.484
1.931HisLys: 1.931 ± 0.967
1.931HisLeu: 1.931 ± 0.588
0.965HisMet: 0.965 ± 0.484
0.0HisAsn: 0.0 ± 0.0
0.965HisPro: 0.965 ± 0.905
0.483HisGln: 0.483 ± 0.242
1.448HisArg: 1.448 ± 0.725
4.344HisSer: 4.344 ± 1.098
2.413HisThr: 2.413 ± 0.535
1.931HisVal: 1.931 ± 0.967
0.0HisTrp: 0.0 ± 0.0
0.965HisTyr: 0.965 ± 0.484
0.0HisXaa: 0.0 ± 0.0
Ile
2.413IleAla: 2.413 ± 0.881
0.965IleCys: 0.965 ± 0.905
3.378IleAsp: 3.378 ± 1.044
3.378IleGlu: 3.378 ± 1.842
3.378IlePhe: 3.378 ± 1.044
2.896IleGly: 2.896 ± 1.451
0.965IleHis: 0.965 ± 0.484
4.344IleIle: 4.344 ± 1.101
5.309IleLys: 5.309 ± 1.094
2.896IleLeu: 2.896 ± 0.585
0.0IleMet: 0.0 ± 0.0
2.896IleAsn: 2.896 ± 0.935
2.896IlePro: 2.896 ± 0.585
1.931IleGln: 1.931 ± 0.967
2.413IleArg: 2.413 ± 1.209
5.309IleSer: 5.309 ± 0.914
3.378IleThr: 3.378 ± 0.733
4.826IleVal: 4.826 ± 1.318
0.483IleTrp: 0.483 ± 0.242
1.931IleTyr: 1.931 ± 0.967
0.0IleXaa: 0.0 ± 0.0
Lys
5.309LysAla: 5.309 ± 0.468
0.965LysCys: 0.965 ± 0.484
0.965LysAsp: 0.965 ± 0.905
1.931LysGlu: 1.931 ± 0.967
3.378LysPhe: 3.378 ± 1.297
3.861LysGly: 3.861 ± 0.531
0.965LysHis: 0.965 ± 0.905
3.861LysIle: 3.861 ± 0.898
3.861LysLys: 3.861 ± 0.898
4.344LysLeu: 4.344 ± 1.391
0.483LysMet: 0.483 ± 0.436
2.896LysAsn: 2.896 ± 1.451
2.413LysPro: 2.413 ± 1.62
0.965LysGln: 0.965 ± 0.484
4.826LysArg: 4.826 ± 0.351
5.792LysSer: 5.792 ± 2.902
2.413LysThr: 2.413 ± 1.209
6.757LysVal: 6.757 ± 1.624
0.483LysTrp: 0.483 ± 0.242
3.378LysTyr: 3.378 ± 0.718
0.483LysXaa: 0.483 ± 0.242
Leu
3.378LeuAla: 3.378 ± 1.693
0.483LeuCys: 0.483 ± 0.242
6.757LeuAsp: 6.757 ± 1.436
4.826LeuGlu: 4.826 ± 1.318
3.861LeuPhe: 3.861 ± 0.898
3.861LeuGly: 3.861 ± 0.898
2.413LeuHis: 2.413 ± 1.209
6.274LeuIle: 6.274 ± 1.967
4.344LeuLys: 4.344 ± 1.098
9.17LeuLeu: 9.17 ± 1.309
1.931LeuMet: 1.931 ± 0.967
4.826LeuAsn: 4.826 ± 2.803
5.792LeuPro: 5.792 ± 0.657
2.413LeuGln: 2.413 ± 0.535
5.309LeuArg: 5.309 ± 2.229
8.687LeuSer: 8.687 ± 2.782
4.344LeuThr: 4.344 ± 1.098
8.687LeuVal: 8.687 ± 0.836
0.0LeuTrp: 0.0 ± 0.0
1.931LeuTyr: 1.931 ± 0.588
0.0LeuXaa: 0.0 ± 0.0
Met
2.896MetAla: 2.896 ± 0.954
0.0MetCys: 0.0 ± 0.0
0.483MetAsp: 0.483 ± 0.242
0.483MetGlu: 0.483 ± 0.242
0.483MetPhe: 0.483 ± 0.242
0.965MetGly: 0.965 ± 0.484
0.965MetHis: 0.965 ± 0.905
1.931MetIle: 1.931 ± 0.967
0.483MetLys: 0.483 ± 0.242
1.931MetLeu: 1.931 ± 0.967
0.965MetMet: 0.965 ± 0.484
2.413MetAsn: 2.413 ± 1.209
0.0MetPro: 0.0 ± 0.0
1.931MetGln: 1.931 ± 0.967
0.965MetArg: 0.965 ± 0.484
1.931MetSer: 1.931 ± 0.588
0.965MetThr: 0.965 ± 0.484
1.931MetVal: 1.931 ± 0.967
0.483MetTrp: 0.483 ± 0.242
0.965MetTyr: 0.965 ± 0.484
0.0MetXaa: 0.0 ± 0.0
Asn
1.448AsnAla: 1.448 ± 0.965
0.483AsnCys: 0.483 ± 0.242
1.931AsnAsp: 1.931 ± 0.967
1.448AsnGlu: 1.448 ± 0.725
4.826AsnPhe: 4.826 ± 1.561
2.413AsnGly: 2.413 ± 0.881
0.965AsnHis: 0.965 ± 0.484
1.448AsnIle: 1.448 ± 0.725
1.931AsnLys: 1.931 ± 0.967
3.861AsnLeu: 3.861 ± 1.192
1.448AsnMet: 1.448 ± 0.725
1.931AsnAsn: 1.931 ± 0.967
1.931AsnPro: 1.931 ± 2.176
0.483AsnGln: 0.483 ± 0.242
1.931AsnArg: 1.931 ± 1.416
3.378AsnSer: 3.378 ± 1.871
2.413AsnThr: 2.413 ± 1.209
4.826AsnVal: 4.826 ± 1.069
0.965AsnTrp: 0.965 ± 0.484
2.413AsnTyr: 2.413 ± 1.62
0.0AsnXaa: 0.0 ± 0.0
Pro
3.861ProAla: 3.861 ± 0.898
1.931ProCys: 1.931 ± 0.967
1.931ProAsp: 1.931 ± 0.892
4.344ProGlu: 4.344 ± 1.098
1.448ProPhe: 1.448 ± 0.724
2.413ProGly: 2.413 ± 1.209
0.483ProHis: 0.483 ± 0.242
2.896ProIle: 2.896 ± 2.357
2.896ProLys: 2.896 ± 1.451
2.896ProLeu: 2.896 ± 0.585
0.483ProMet: 0.483 ± 0.242
1.931ProAsn: 1.931 ± 3.117
0.965ProPro: 0.965 ± 0.905
0.483ProGln: 0.483 ± 0.242
0.965ProArg: 0.965 ± 0.484
1.931ProSer: 1.931 ± 4.191
2.413ProThr: 2.413 ± 0.881
6.274ProVal: 6.274 ± 1.684
0.483ProTrp: 0.483 ± 1.109
0.965ProTyr: 0.965 ± 0.484
0.0ProXaa: 0.0 ± 0.0
Gln
1.448GlnAla: 1.448 ± 0.965
0.0GlnCys: 0.0 ± 0.0
1.448GlnAsp: 1.448 ± 0.725
1.931GlnGlu: 1.931 ± 0.967
0.965GlnPhe: 0.965 ± 0.484
0.965GlnGly: 0.965 ± 1.088
0.965GlnHis: 0.965 ± 0.484
1.931GlnIle: 1.931 ± 0.967
0.483GlnLys: 0.483 ± 0.242
1.931GlnLeu: 1.931 ± 0.967
0.483GlnMet: 0.483 ± 0.242
0.965GlnAsn: 0.965 ± 0.484
1.448GlnPro: 1.448 ± 0.725
0.0GlnGln: 0.0 ± 0.0
1.448GlnArg: 1.448 ± 0.725
5.309GlnSer: 5.309 ± 3.063
1.931GlnThr: 1.931 ± 0.892
0.483GlnVal: 0.483 ± 0.242
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.378ArgAla: 3.378 ± 2.154
0.965ArgCys: 0.965 ± 0.484
4.344ArgAsp: 4.344 ± 1.391
2.413ArgGlu: 2.413 ± 1.182
2.896ArgPhe: 2.896 ± 0.954
1.931ArgGly: 1.931 ± 3.117
1.448ArgHis: 1.448 ± 0.724
1.931ArgIle: 1.931 ± 0.967
3.861ArgLys: 3.861 ± 1.935
4.826ArgLeu: 4.826 ± 1.069
0.965ArgMet: 0.965 ± 0.905
2.896ArgAsn: 2.896 ± 0.935
2.413ArgPro: 2.413 ± 0.881
0.965ArgGln: 0.965 ± 0.484
3.378ArgArg: 3.378 ± 0.718
3.861ArgSer: 3.861 ± 1.177
3.378ArgThr: 3.378 ± 1.842
4.344ArgVal: 4.344 ± 1.367
0.483ArgTrp: 0.483 ± 0.242
0.965ArgTyr: 0.965 ± 0.484
0.0ArgXaa: 0.0 ± 0.0
Ser
3.861SerAla: 3.861 ± 3.963
0.965SerCys: 0.965 ± 0.905
7.239SerAsp: 7.239 ± 1.608
4.826SerGlu: 4.826 ± 1.152
6.274SerPhe: 6.274 ± 1.262
6.274SerGly: 6.274 ± 2.003
0.965SerHis: 0.965 ± 0.484
8.687SerIle: 8.687 ± 2.059
6.757SerLys: 6.757 ± 2.883
10.618SerLeu: 10.618 ± 4.129
2.413SerMet: 2.413 ± 1.209
1.931SerAsn: 1.931 ± 1.416
4.344SerPro: 4.344 ± 2.176
2.413SerGln: 2.413 ± 1.182
3.378SerArg: 3.378 ± 0.733
6.274SerSer: 6.274 ± 2.755
3.378SerThr: 3.378 ± 1.842
9.17SerVal: 9.17 ± 7.787
0.483SerTrp: 0.483 ± 1.109
3.378SerTyr: 3.378 ± 1.871
0.0SerXaa: 0.0 ± 0.0
Thr
3.861ThrAla: 3.861 ± 3.004
0.965ThrCys: 0.965 ± 0.484
3.378ThrAsp: 3.378 ± 2.154
1.448ThrGlu: 1.448 ± 0.725
4.826ThrPhe: 4.826 ± 2.418
2.413ThrGly: 2.413 ± 1.209
1.448ThrHis: 1.448 ± 0.725
3.378ThrIle: 3.378 ± 0.718
3.378ThrLys: 3.378 ± 0.718
5.309ThrLeu: 5.309 ± 0.914
1.448ThrMet: 1.448 ± 0.541
1.448ThrAsn: 1.448 ± 0.724
3.378ThrPro: 3.378 ± 1.044
1.931ThrGln: 1.931 ± 0.892
2.896ThrArg: 2.896 ± 0.954
4.344ThrSer: 4.344 ± 1.367
3.861ThrThr: 3.861 ± 1.96
3.378ThrVal: 3.378 ± 1.297
0.483ThrTrp: 0.483 ± 0.242
1.448ThrTyr: 1.448 ± 0.725
0.0ThrXaa: 0.0 ± 0.0
Val
6.757ValAla: 6.757 ± 4.211
0.965ValCys: 0.965 ± 0.484
8.687ValAsp: 8.687 ± 1.549
5.309ValGlu: 5.309 ± 1.879
2.413ValPhe: 2.413 ± 1.209
3.378ValGly: 3.378 ± 1.297
4.344ValHis: 4.344 ± 2.176
2.896ValIle: 2.896 ± 0.935
4.826ValLys: 4.826 ± 2.459
6.274ValLeu: 6.274 ± 0.873
2.413ValMet: 2.413 ± 0.535
2.413ValAsn: 2.413 ± 1.209
4.344ValPro: 4.344 ± 2.691
1.931ValGln: 1.931 ± 0.967
6.757ValArg: 6.757 ± 1.561
7.239ValSer: 7.239 ± 2.67
5.309ValThr: 5.309 ± 1.542
8.205ValVal: 8.205 ± 6.576
2.413ValTrp: 2.413 ± 1.182
4.826ValTyr: 4.826 ± 1.561
0.0ValXaa: 0.0 ± 0.0
Trp
1.448TrpAla: 1.448 ± 0.725
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.483TrpGlu: 0.483 ± 0.242
1.448TrpPhe: 1.448 ± 0.725
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.448TrpLys: 1.448 ± 0.725
0.483TrpLeu: 0.483 ± 0.242
0.483TrpMet: 0.483 ± 0.242
1.931TrpAsn: 1.931 ± 0.588
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.448TrpSer: 1.448 ± 1.651
0.0TrpThr: 0.0 ± 0.0
0.483TrpVal: 0.483 ± 1.109
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.965TyrAla: 0.965 ± 1.088
0.0TyrCys: 0.0 ± 0.0
5.792TyrAsp: 5.792 ± 1.979
1.931TyrGlu: 1.931 ± 0.588
2.413TyrPhe: 2.413 ± 0.535
1.448TyrGly: 1.448 ± 0.725
0.483TyrHis: 0.483 ± 0.242
1.931TyrIle: 1.931 ± 0.967
0.965TyrLys: 0.965 ± 0.484
2.896TyrLeu: 2.896 ± 0.585
1.448TyrMet: 1.448 ± 0.725
1.448TyrAsn: 1.448 ± 0.965
2.896TyrPro: 2.896 ± 0.585
0.483TyrGln: 0.483 ± 0.242
0.483TyrArg: 0.483 ± 1.109
1.931TyrSer: 1.931 ± 0.967
2.896TyrThr: 2.896 ± 1.451
2.896TyrVal: 2.896 ± 0.954
0.0TyrTrp: 0.0 ± 0.0
0.965TyrTyr: 0.965 ± 0.484
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.483XaaGln: 0.483 ± 0.242
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2073 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski