Amino acid dipepetide frequency for East African cassava mosaic virus-Uganda2 Severe

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.195AlaAla: 3.195 ± 2.405
3.195AlaCys: 3.195 ± 0.497
0.0AlaAsp: 0.0 ± 0.0
3.195AlaGlu: 3.195 ± 3.317
1.065AlaPhe: 1.065 ± 0.802
2.13AlaGly: 2.13 ± 0.981
1.065AlaHis: 1.065 ± 1.106
2.13AlaIle: 2.13 ± 0.86
2.13AlaLys: 2.13 ± 1.603
3.195AlaLeu: 3.195 ± 1.675
0.0AlaMet: 0.0 ± 0.0
2.13AlaAsn: 2.13 ± 0.917
4.26AlaPro: 4.26 ± 1.834
1.065AlaGln: 1.065 ± 0.802
2.13AlaArg: 2.13 ± 0.86
3.195AlaSer: 3.195 ± 0.497
6.39AlaThr: 6.39 ± 2.282
2.13AlaVal: 2.13 ± 1.246
0.0AlaTrp: 0.0 ± 0.0
1.065AlaTyr: 1.065 ± 0.877
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
2.13CysCys: 2.13 ± 2.211
1.065CysAsp: 1.065 ± 0.877
1.065CysGlu: 1.065 ± 0.94
0.0CysPhe: 0.0 ± 0.0
1.065CysGly: 1.065 ± 0.877
0.0CysHis: 0.0 ± 0.0
2.13CysIle: 2.13 ± 0.917
3.195CysLys: 3.195 ± 1.911
3.195CysLeu: 3.195 ± 1.542
1.065CysMet: 1.065 ± 0.802
1.065CysAsn: 1.065 ± 0.802
3.195CysPro: 3.195 ± 3.317
0.0CysGln: 0.0 ± 0.0
2.13CysArg: 2.13 ± 1.754
0.0CysSer: 0.0 ± 0.0
1.065CysThr: 1.065 ± 0.94
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.195AspAla: 3.195 ± 1.443
1.065AspCys: 1.065 ± 0.802
2.13AspAsp: 2.13 ± 0.981
2.13AspGlu: 2.13 ± 0.981
4.26AspPhe: 4.26 ± 1.409
1.065AspGly: 1.065 ± 0.877
3.195AspHis: 3.195 ± 1.413
3.195AspIle: 3.195 ± 0.497
2.13AspLys: 2.13 ± 0.86
4.26AspLeu: 4.26 ± 1.409
0.0AspMet: 0.0 ± 0.0
6.39AspAsn: 6.39 ± 1.508
2.13AspPro: 2.13 ± 1.218
0.0AspGln: 0.0 ± 0.0
3.195AspArg: 3.195 ± 1.911
5.325AspSer: 5.325 ± 2.901
0.0AspThr: 0.0 ± 0.0
6.39AspVal: 6.39 ± 2.495
0.0AspTrp: 0.0 ± 0.0
1.065AspTyr: 1.065 ± 0.877
0.0AspXaa: 0.0 ± 0.0
Glu
4.26GluAla: 4.26 ± 1.964
0.0GluCys: 0.0 ± 0.0
3.195GluAsp: 3.195 ± 1.413
2.13GluGlu: 2.13 ± 1.372
0.0GluPhe: 0.0 ± 0.0
3.195GluGly: 3.195 ± 0.497
1.065GluHis: 1.065 ± 0.877
1.065GluIle: 1.065 ± 0.802
1.065GluLys: 1.065 ± 0.802
3.195GluLeu: 3.195 ± 1.213
0.0GluMet: 0.0 ± 0.0
4.26GluAsn: 4.26 ± 2.579
3.195GluPro: 3.195 ± 1.061
4.26GluGln: 4.26 ± 1.593
1.065GluArg: 1.065 ± 0.877
3.195GluSer: 3.195 ± 1.542
5.325GluThr: 5.325 ± 2.895
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
6.39GluTyr: 6.39 ± 2.63
0.0GluXaa: 0.0 ± 0.0
Phe
1.065PheAla: 1.065 ± 0.802
1.065PheCys: 1.065 ± 0.94
3.195PheAsp: 3.195 ± 1.675
2.13PheGlu: 2.13 ± 0.917
1.065PhePhe: 1.065 ± 0.877
2.13PheGly: 2.13 ± 0.981
1.065PheHis: 1.065 ± 0.94
0.0PheIle: 0.0 ± 0.0
2.13PheLys: 2.13 ± 0.86
0.0PheLeu: 0.0 ± 0.0
1.065PheMet: 1.065 ± 0.877
3.195PheAsn: 3.195 ± 0.497
3.195PhePro: 3.195 ± 1.74
1.065PheGln: 1.065 ± 0.94
2.13PheArg: 2.13 ± 1.218
3.195PheSer: 3.195 ± 1.413
2.13PheThr: 2.13 ± 1.603
2.13PheVal: 2.13 ± 1.754
1.065PheTrp: 1.065 ± 0.802
3.195PheTyr: 3.195 ± 2.821
0.0PheXaa: 0.0 ± 0.0
Gly
4.26GlyAla: 4.26 ± 2.743
1.065GlyCys: 1.065 ± 0.94
4.26GlyAsp: 4.26 ± 1.091
4.26GlyGlu: 4.26 ± 1.721
1.065GlyPhe: 1.065 ± 1.106
5.325GlyGly: 5.325 ± 3.228
0.0GlyHis: 0.0 ± 0.0
3.195GlyIle: 3.195 ± 1.605
5.325GlyLys: 5.325 ± 2.624
2.13GlyLeu: 2.13 ± 0.917
2.13GlyMet: 2.13 ± 1.51
3.195GlyAsn: 3.195 ± 1.605
5.325GlyPro: 5.325 ± 2.532
2.13GlyGln: 2.13 ± 1.246
2.13GlyArg: 2.13 ± 0.86
1.065GlySer: 1.065 ± 0.802
3.195GlyThr: 3.195 ± 2.019
3.195GlyVal: 3.195 ± 1.213
0.0GlyTrp: 0.0 ± 0.0
2.13GlyTyr: 2.13 ± 0.86
0.0GlyXaa: 0.0 ± 0.0
His
3.195HisAla: 3.195 ± 1.675
1.065HisCys: 1.065 ± 1.106
0.0HisAsp: 0.0 ± 0.0
1.065HisGlu: 1.065 ± 1.106
1.065HisPhe: 1.065 ± 0.802
1.065HisGly: 1.065 ± 1.106
1.065HisHis: 1.065 ± 0.94
3.195HisIle: 3.195 ± 1.542
0.0HisLys: 0.0 ± 0.0
2.13HisLeu: 2.13 ± 1.246
0.0HisMet: 0.0 ± 0.0
1.065HisAsn: 1.065 ± 0.877
0.0HisPro: 0.0 ± 0.0
2.13HisGln: 2.13 ± 1.246
4.26HisArg: 4.26 ± 0.904
1.065HisSer: 1.065 ± 0.802
3.195HisThr: 3.195 ± 1.675
2.13HisVal: 2.13 ± 1.246
0.0HisTrp: 0.0 ± 0.0
1.065HisTyr: 1.065 ± 0.802
0.0HisXaa: 0.0 ± 0.0
Ile
1.065IleAla: 1.065 ± 0.802
1.065IleCys: 1.065 ± 0.94
3.195IleAsp: 3.195 ± 1.413
2.13IleGlu: 2.13 ± 1.603
2.13IlePhe: 2.13 ± 0.981
4.26IleGly: 4.26 ± 1.531
2.13IleHis: 2.13 ± 1.372
4.26IleIle: 4.26 ± 1.834
7.455IleLys: 7.455 ± 2.06
5.325IleLeu: 5.325 ± 1.044
1.065IleMet: 1.065 ± 0.802
5.325IleAsn: 5.325 ± 2.892
0.0IlePro: 0.0 ± 0.0
2.13IleGln: 2.13 ± 1.372
4.26IleArg: 4.26 ± 1.409
3.195IleSer: 3.195 ± 1.71
3.195IleThr: 3.195 ± 1.413
1.065IleVal: 1.065 ± 0.802
2.13IleTrp: 2.13 ± 1.881
2.13IleTyr: 2.13 ± 1.881
0.0IleXaa: 0.0 ± 0.0
Lys
1.065LysAla: 1.065 ± 0.802
2.13LysCys: 2.13 ± 0.86
3.195LysAsp: 3.195 ± 1.413
2.13LysGlu: 2.13 ± 0.981
2.13LysPhe: 2.13 ± 0.917
4.26LysGly: 4.26 ± 2.146
2.13LysHis: 2.13 ± 0.917
5.325LysIle: 5.325 ± 2.532
2.13LysLys: 2.13 ± 0.981
6.39LysLeu: 6.39 ± 2.774
1.065LysMet: 1.065 ± 0.94
5.325LysAsn: 5.325 ± 1.121
5.325LysPro: 5.325 ± 1.121
3.195LysGln: 3.195 ± 1.233
4.26LysArg: 4.26 ± 2.558
4.26LysSer: 4.26 ± 0.904
2.13LysThr: 2.13 ± 1.754
3.195LysVal: 3.195 ± 2.821
0.0LysTrp: 0.0 ± 0.0
3.195LysTyr: 3.195 ± 1.233
0.0LysXaa: 0.0 ± 0.0
Leu
2.13LeuAla: 2.13 ± 1.372
1.065LeuCys: 1.065 ± 0.877
2.13LeuAsp: 2.13 ± 1.603
5.325LeuGlu: 5.325 ± 2.122
1.065LeuPhe: 1.065 ± 0.877
2.13LeuGly: 2.13 ± 1.881
4.26LeuHis: 4.26 ± 2.42
2.13LeuIle: 2.13 ± 1.218
4.26LeuLys: 4.26 ± 1.091
7.455LeuLeu: 7.455 ± 1.565
1.065LeuMet: 1.065 ± 0.769
5.325LeuAsn: 5.325 ± 1.069
5.325LeuPro: 5.325 ± 2.338
1.065LeuGln: 1.065 ± 0.802
8.52LeuArg: 8.52 ± 2.118
5.325LeuSer: 5.325 ± 2.198
2.13LeuThr: 2.13 ± 0.917
4.26LeuVal: 4.26 ± 1.739
0.0LeuTrp: 0.0 ± 0.0
3.195LeuTyr: 3.195 ± 1.605
0.0LeuXaa: 0.0 ± 0.0
Met
2.13MetAla: 2.13 ± 0.981
0.0MetCys: 0.0 ± 0.0
2.13MetAsp: 2.13 ± 0.917
0.0MetGlu: 0.0 ± 0.0
2.13MetPhe: 2.13 ± 1.881
3.195MetGly: 3.195 ± 1.213
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.13MetLys: 2.13 ± 0.917
1.065MetLeu: 1.065 ± 1.106
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.065MetPro: 1.065 ± 0.877
0.0MetGln: 0.0 ± 0.0
3.195MetArg: 3.195 ± 1.413
1.065MetSer: 1.065 ± 0.94
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
2.13MetTrp: 2.13 ± 1.372
5.325MetTyr: 5.325 ± 2.137
0.0MetXaa: 0.0 ± 0.0
Asn
4.26AsnAla: 4.26 ± 1.234
0.0AsnCys: 0.0 ± 0.0
4.26AsnAsp: 4.26 ± 1.409
2.13AsnGlu: 2.13 ± 0.981
1.065AsnPhe: 1.065 ± 0.94
2.13AsnGly: 2.13 ± 1.754
4.26AsnHis: 4.26 ± 3.761
4.26AsnIle: 4.26 ± 1.091
2.13AsnLys: 2.13 ± 0.86
2.13AsnLeu: 2.13 ± 1.218
4.26AsnMet: 4.26 ± 1.607
2.13AsnAsn: 2.13 ± 1.246
2.13AsnPro: 2.13 ± 0.917
2.13AsnGln: 2.13 ± 1.603
3.195AsnArg: 3.195 ± 1.675
3.195AsnSer: 3.195 ± 1.413
1.065AsnThr: 1.065 ± 1.106
5.325AsnVal: 5.325 ± 0.781
0.0AsnTrp: 0.0 ± 0.0
4.26AsnTyr: 4.26 ± 1.731
0.0AsnXaa: 0.0 ± 0.0
Pro
1.065ProAla: 1.065 ± 0.802
3.195ProCys: 3.195 ± 1.233
1.065ProAsp: 1.065 ± 0.94
3.195ProGlu: 3.195 ± 1.213
3.195ProPhe: 3.195 ± 1.542
4.26ProGly: 4.26 ± 1.834
2.13ProHis: 2.13 ± 1.218
5.325ProIle: 5.325 ± 3.158
3.195ProLys: 3.195 ± 1.71
1.065ProLeu: 1.065 ± 1.106
3.195ProMet: 3.195 ± 1.139
0.0ProAsn: 0.0 ± 0.0
1.065ProPro: 1.065 ± 0.802
2.13ProGln: 2.13 ± 0.981
5.325ProArg: 5.325 ± 2.462
7.455ProSer: 7.455 ± 3.569
4.26ProThr: 4.26 ± 0.904
2.13ProVal: 2.13 ± 1.881
1.065ProTrp: 1.065 ± 0.802
5.325ProTyr: 5.325 ± 1.332
0.0ProXaa: 0.0 ± 0.0
Gln
4.26GlnAla: 4.26 ± 1.409
1.065GlnCys: 1.065 ± 0.877
2.13GlnAsp: 2.13 ± 1.246
2.13GlnGlu: 2.13 ± 1.246
3.195GlnPhe: 3.195 ± 1.413
2.13GlnGly: 2.13 ± 1.372
0.0GlnHis: 0.0 ± 0.0
2.13GlnIle: 2.13 ± 0.917
2.13GlnLys: 2.13 ± 1.372
1.065GlnLeu: 1.065 ± 0.802
0.0GlnMet: 0.0 ± 0.0
3.195GlnAsn: 3.195 ± 2.16
1.065GlnPro: 1.065 ± 1.106
2.13GlnGln: 2.13 ± 1.372
2.13GlnArg: 2.13 ± 0.917
2.13GlnSer: 2.13 ± 0.917
3.195GlnThr: 3.195 ± 1.413
2.13GlnVal: 2.13 ± 0.917
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
3.195ArgCys: 3.195 ± 2.184
5.325ArgAsp: 5.325 ± 2.392
0.0ArgGlu: 0.0 ± 0.0
4.26ArgPhe: 4.26 ± 2.558
5.325ArgGly: 5.325 ± 2.463
2.13ArgHis: 2.13 ± 1.372
3.195ArgIle: 3.195 ± 1.061
5.325ArgLys: 5.325 ± 1.332
8.52ArgLeu: 8.52 ± 2.915
2.13ArgMet: 2.13 ± 0.917
1.065ArgAsn: 1.065 ± 0.802
5.325ArgPro: 5.325 ± 2.532
2.13ArgGln: 2.13 ± 1.218
10.65ArgArg: 10.65 ± 1.844
8.52ArgSer: 8.52 ± 2.487
2.13ArgThr: 2.13 ± 0.86
9.585ArgVal: 9.585 ± 2.246
0.0ArgTrp: 0.0 ± 0.0
2.13ArgTyr: 2.13 ± 1.372
0.0ArgXaa: 0.0 ± 0.0
Ser
1.065SerAla: 1.065 ± 0.802
0.0SerCys: 0.0 ± 0.0
2.13SerAsp: 2.13 ± 0.917
3.195SerGlu: 3.195 ± 1.74
3.195SerPhe: 3.195 ± 0.497
4.26SerGly: 4.26 ± 1.091
0.0SerHis: 0.0 ± 0.0
3.195SerIle: 3.195 ± 0.497
8.52SerLys: 8.52 ± 1.108
7.455SerLeu: 7.455 ± 3.884
2.13SerMet: 2.13 ± 1.218
2.13SerAsn: 2.13 ± 0.917
4.26SerPro: 4.26 ± 1.234
6.39SerGln: 6.39 ± 2.826
2.13SerArg: 2.13 ± 0.86
8.52SerSer: 8.52 ± 3.594
9.585SerThr: 9.585 ± 2.517
7.455SerVal: 7.455 ± 3.024
0.0SerTrp: 0.0 ± 0.0
3.195SerTyr: 3.195 ± 2.631
0.0SerXaa: 0.0 ± 0.0
Thr
2.13ThrAla: 2.13 ± 0.981
1.065ThrCys: 1.065 ± 0.877
4.26ThrAsp: 4.26 ± 2.13
3.195ThrGlu: 3.195 ± 1.443
2.13ThrPhe: 2.13 ± 0.86
1.065ThrGly: 1.065 ± 0.877
3.195ThrHis: 3.195 ± 1.675
4.26ThrIle: 4.26 ± 2.146
2.13ThrLys: 2.13 ± 1.603
4.26ThrLeu: 4.26 ± 0.904
2.13ThrMet: 2.13 ± 0.86
4.26ThrAsn: 4.26 ± 2.558
6.39ThrPro: 6.39 ± 1.508
1.065ThrGln: 1.065 ± 1.106
6.39ThrArg: 6.39 ± 1.792
3.195ThrSer: 3.195 ± 2.631
5.325ThrThr: 5.325 ± 2.198
6.39ThrVal: 6.39 ± 1.792
1.065ThrTrp: 1.065 ± 0.802
2.13ThrTyr: 2.13 ± 0.917
0.0ThrXaa: 0.0 ± 0.0
Val
1.065ValAla: 1.065 ± 0.94
0.0ValCys: 0.0 ± 0.0
4.26ValAsp: 4.26 ± 2.356
4.26ValGlu: 4.26 ± 2.42
0.0ValPhe: 0.0 ± 0.0
4.26ValGly: 4.26 ± 1.711
1.065ValHis: 1.065 ± 1.106
4.26ValIle: 4.26 ± 0.904
4.26ValLys: 4.26 ± 1.593
3.195ValLeu: 3.195 ± 1.233
1.065ValMet: 1.065 ± 0.94
2.13ValAsn: 2.13 ± 1.372
4.26ValPro: 4.26 ± 1.091
2.13ValGln: 2.13 ± 1.246
3.195ValArg: 3.195 ± 1.71
10.65ValSer: 10.65 ± 2.588
7.455ValThr: 7.455 ± 2.872
2.13ValVal: 2.13 ± 1.881
1.065ValTrp: 1.065 ± 0.877
5.325ValTyr: 5.325 ± 3.228
0.0ValXaa: 0.0 ± 0.0
Trp
2.13TrpAla: 2.13 ± 0.917
0.0TrpCys: 0.0 ± 0.0
1.065TrpAsp: 1.065 ± 1.106
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.065TrpMet: 1.065 ± 0.94
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.065TrpArg: 1.065 ± 0.802
2.13TrpSer: 2.13 ± 0.86
0.0TrpThr: 0.0 ± 0.0
1.065TrpVal: 1.065 ± 0.877
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.13TyrAla: 2.13 ± 0.917
0.0TyrCys: 0.0 ± 0.0
2.13TyrAsp: 2.13 ± 1.246
3.195TyrGlu: 3.195 ± 1.605
3.195TyrPhe: 3.195 ± 1.443
3.195TyrGly: 3.195 ± 1.605
0.0TyrHis: 0.0 ± 0.0
4.26TyrIle: 4.26 ± 0.65
3.195TyrLys: 3.195 ± 1.605
2.13TyrLeu: 2.13 ± 1.246
1.065TyrMet: 1.065 ± 0.94
2.13TyrAsn: 2.13 ± 1.246
2.13TyrPro: 2.13 ± 0.86
1.065TyrGln: 1.065 ± 0.802
9.585TyrArg: 9.585 ± 2.246
2.13TyrSer: 2.13 ± 1.754
4.26TyrThr: 4.26 ± 1.091
5.325TyrVal: 5.325 ± 3.617
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (940 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski