Amino acid dipepetide frequency for Paspalum dilatatum striate mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.915AlaAla: 2.915 ± 2.22
2.915AlaCys: 2.915 ± 1.049
1.944AlaAsp: 1.944 ± 0.767
5.831AlaGlu: 5.831 ± 1.783
3.887AlaPhe: 3.887 ± 0.825
0.972AlaGly: 0.972 ± 0.969
0.0AlaHis: 0.0 ± 0.0
1.944AlaIle: 1.944 ± 1.478
7.775AlaLys: 7.775 ± 2.083
6.803AlaLeu: 6.803 ± 2.898
0.972AlaMet: 0.972 ± 0.642
0.972AlaAsn: 0.972 ± 0.969
6.803AlaPro: 6.803 ± 1.549
2.915AlaGln: 2.915 ± 1.311
9.718AlaArg: 9.718 ± 2.698
9.718AlaSer: 9.718 ± 2.079
2.915AlaThr: 2.915 ± 0.5
3.887AlaVal: 3.887 ± 2.065
2.915AlaTrp: 2.915 ± 1.311
5.831AlaTyr: 5.831 ± 1.435
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.972CysCys: 0.972 ± 0.969
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
3.887CysPhe: 3.887 ± 1.533
0.972CysGly: 0.972 ± 0.757
2.915CysHis: 2.915 ± 1.311
0.972CysIle: 0.972 ± 1.152
0.0CysLys: 0.0 ± 0.0
3.887CysLeu: 3.887 ± 1.533
0.0CysMet: 0.0 ± 0.0
1.944CysAsn: 1.944 ± 1.938
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.944CysArg: 1.944 ± 0.767
0.972CysSer: 0.972 ± 0.969
0.0CysThr: 0.0 ± 0.0
1.944CysVal: 1.944 ± 1.193
1.944CysTrp: 1.944 ± 0.865
0.972CysTyr: 0.972 ± 0.969
0.0CysXaa: 0.0 ± 0.0
Asp
3.887AspAla: 3.887 ± 1.533
0.0AspCys: 0.0 ± 0.0
0.972AspAsp: 0.972 ± 0.757
3.887AspGlu: 3.887 ± 1.324
0.972AspPhe: 0.972 ± 0.757
1.944AspGly: 1.944 ± 0.767
0.0AspHis: 0.0 ± 0.0
3.887AspIle: 3.887 ± 1.73
0.972AspLys: 0.972 ± 0.969
1.944AspLeu: 1.944 ± 1.014
0.0AspMet: 0.0 ± 0.0
0.972AspAsn: 0.972 ± 0.757
4.859AspPro: 4.859 ± 1.431
2.915AspGln: 2.915 ± 1.523
0.972AspArg: 0.972 ± 0.757
0.0AspSer: 0.0 ± 0.0
1.944AspThr: 1.944 ± 1.014
3.887AspVal: 3.887 ± 0.978
2.915AspTrp: 2.915 ± 1.311
2.915AspTyr: 2.915 ± 0.5
0.0AspXaa: 0.0 ± 0.0
Glu
6.803GluAla: 6.803 ± 1.345
0.972GluCys: 0.972 ± 0.969
7.775GluAsp: 7.775 ± 1.548
7.775GluGlu: 7.775 ± 3.066
4.859GluPhe: 4.859 ± 1.431
2.915GluGly: 2.915 ± 1.344
5.831GluHis: 5.831 ± 2.3
0.0GluIle: 0.0 ± 0.0
0.0GluLys: 0.0 ± 0.0
1.944GluLeu: 1.944 ± 0.767
0.0GluMet: 0.0 ± 0.0
0.972GluAsn: 0.972 ± 0.969
1.944GluPro: 1.944 ± 0.767
0.972GluGln: 0.972 ± 0.776
2.915GluArg: 2.915 ± 1.311
2.915GluSer: 2.915 ± 2.466
3.887GluThr: 3.887 ± 0.786
6.803GluVal: 6.803 ± 2.499
0.972GluTrp: 0.972 ± 0.969
2.915GluTyr: 2.915 ± 1.049
0.0GluXaa: 0.0 ± 0.0
Phe
2.915PheAla: 2.915 ± 2.466
0.0PheCys: 0.0 ± 0.0
4.859PheAsp: 4.859 ± 0.858
5.831PheGlu: 5.831 ± 2.622
2.915PhePhe: 2.915 ± 1.311
2.915PheGly: 2.915 ± 1.049
1.944PheHis: 1.944 ± 0.767
2.915PheIle: 2.915 ± 3.456
4.859PheLys: 4.859 ± 1.181
2.915PheLeu: 2.915 ± 1.311
0.0PheMet: 0.0 ± 0.0
0.972PheAsn: 0.972 ± 0.969
4.859PhePro: 4.859 ± 2.046
0.972PheGln: 0.972 ± 0.969
0.972PheArg: 0.972 ± 0.757
0.0PheSer: 0.0 ± 0.0
1.944PheThr: 1.944 ± 1.938
3.887PheVal: 3.887 ± 0.786
0.0PheTrp: 0.0 ± 0.0
0.972PheTyr: 0.972 ± 0.757
0.0PheXaa: 0.0 ± 0.0
Gly
14.577GlyAla: 14.577 ± 3.671
0.972GlyCys: 0.972 ± 0.757
0.0GlyAsp: 0.0 ± 0.0
3.887GlyGlu: 3.887 ± 0.978
1.944GlyPhe: 1.944 ± 1.193
1.944GlyGly: 1.944 ± 1.938
0.0GlyHis: 0.0 ± 0.0
3.887GlyIle: 3.887 ± 2.222
1.944GlyLys: 1.944 ± 1.513
4.859GlyLeu: 4.859 ± 2.709
3.887GlyMet: 3.887 ± 1.533
1.944GlyAsn: 1.944 ± 1.478
3.887GlyPro: 3.887 ± 1.324
2.915GlyGln: 2.915 ± 1.344
1.944GlyArg: 1.944 ± 1.478
6.803GlySer: 6.803 ± 3.11
0.972GlyThr: 0.972 ± 0.969
3.887GlyVal: 3.887 ± 2.741
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.944HisCys: 1.944 ± 0.767
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.944HisGly: 1.944 ± 0.767
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.887HisLys: 3.887 ± 1.533
3.887HisLeu: 3.887 ± 1.533
0.0HisMet: 0.0 ± 0.0
1.944HisAsn: 1.944 ± 0.767
1.944HisPro: 1.944 ± 0.767
1.944HisGln: 1.944 ± 0.767
0.972HisArg: 0.972 ± 0.969
0.972HisSer: 0.972 ± 0.757
0.0HisThr: 0.0 ± 0.0
2.915HisVal: 2.915 ± 1.311
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.915IleAla: 2.915 ± 2.466
1.944IleCys: 1.944 ± 0.865
0.972IleAsp: 0.972 ± 0.757
0.0IleGlu: 0.0 ± 0.0
0.972IlePhe: 0.972 ± 0.969
0.0IleGly: 0.0 ± 0.0
0.972IleHis: 0.972 ± 0.776
2.915IleIle: 2.915 ± 1.311
3.887IleLys: 3.887 ± 1.338
1.944IleLeu: 1.944 ± 0.865
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
7.775IlePro: 7.775 ± 4.261
0.0IleGln: 0.0 ± 0.0
1.944IleArg: 1.944 ± 0.767
3.887IleSer: 3.887 ± 1.407
4.859IleThr: 4.859 ± 1.463
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
0.972IleTyr: 0.972 ± 1.152
0.0IleXaa: 0.0 ± 0.0
Lys
1.944LysAla: 1.944 ± 0.767
2.915LysCys: 2.915 ± 0.5
7.775LysAsp: 7.775 ± 1.178
1.944LysGlu: 1.944 ± 0.767
2.915LysPhe: 2.915 ± 1.311
4.859LysGly: 4.859 ± 1.181
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
13.605LysLys: 13.605 ± 4.195
1.944LysLeu: 1.944 ± 1.014
1.944LysMet: 1.944 ± 0.767
0.0LysAsn: 0.0 ± 0.0
0.972LysPro: 0.972 ± 0.969
0.0LysGln: 0.0 ± 0.0
7.775LysArg: 7.775 ± 3.872
3.887LysSer: 3.887 ± 1.338
2.915LysThr: 2.915 ± 1.232
4.859LysVal: 4.859 ± 1.463
2.915LysTrp: 2.915 ± 0.5
3.887LysTyr: 3.887 ± 0.786
0.0LysXaa: 0.0 ± 0.0
Leu
1.944LeuAla: 1.944 ± 0.767
0.0LeuCys: 0.0 ± 0.0
1.944LeuAsp: 1.944 ± 1.551
4.859LeuGlu: 4.859 ± 0.858
2.915LeuPhe: 2.915 ± 1.049
7.775LeuGly: 7.775 ± 1.178
2.915LeuHis: 2.915 ± 1.311
0.0LeuIle: 0.0 ± 0.0
3.887LeuLys: 3.887 ± 0.825
5.831LeuLeu: 5.831 ± 1.435
0.0LeuMet: 0.0 ± 0.0
1.944LeuAsn: 1.944 ± 1.014
4.859LeuPro: 4.859 ± 0.858
4.859LeuGln: 4.859 ± 1.408
3.887LeuArg: 3.887 ± 1.533
7.775LeuSer: 7.775 ± 4.153
2.915LeuThr: 2.915 ± 0.5
8.746LeuVal: 8.746 ± 2.653
0.0LeuTrp: 0.0 ± 0.0
3.887LeuTyr: 3.887 ± 0.786
0.0LeuXaa: 0.0 ± 0.0
Met
4.859MetAla: 4.859 ± 1.431
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.944MetLeu: 1.944 ± 0.767
0.0MetMet: 0.0 ± 0.0
0.972MetAsn: 0.972 ± 0.757
2.915MetPro: 2.915 ± 0.5
0.972MetGln: 0.972 ± 0.757
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
1.944MetVal: 1.944 ± 0.767
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.915AsnAla: 2.915 ± 0.5
0.972AsnCys: 0.972 ± 0.757
0.0AsnAsp: 0.0 ± 0.0
1.944AsnGlu: 1.944 ± 0.767
0.0AsnPhe: 0.0 ± 0.0
3.887AsnGly: 3.887 ± 1.338
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
2.915AsnLeu: 2.915 ± 2.907
0.972AsnMet: 0.972 ± 0.992
0.0AsnAsn: 0.0 ± 0.0
0.972AsnPro: 0.972 ± 0.757
2.915AsnGln: 2.915 ± 1.674
0.0AsnArg: 0.0 ± 0.0
0.0AsnSer: 0.0 ± 0.0
0.972AsnThr: 0.972 ± 0.969
4.859AsnVal: 4.859 ± 2.01
0.972AsnTrp: 0.972 ± 0.969
0.972AsnTyr: 0.972 ± 0.757
0.0AsnXaa: 0.0 ± 0.0
Pro
4.859ProAla: 4.859 ± 0.756
2.915ProCys: 2.915 ± 1.311
0.972ProAsp: 0.972 ± 0.776
0.972ProGlu: 0.972 ± 1.152
5.831ProPhe: 5.831 ± 1.29
2.915ProGly: 2.915 ± 1.049
1.944ProHis: 1.944 ± 0.767
0.972ProIle: 0.972 ± 0.969
0.972ProLys: 0.972 ± 0.757
2.915ProLeu: 2.915 ± 1.344
0.0ProMet: 0.0 ± 0.0
2.915ProAsn: 2.915 ± 1.311
7.775ProPro: 7.775 ± 2.389
0.972ProGln: 0.972 ± 0.757
2.915ProArg: 2.915 ± 2.466
10.69ProSer: 10.69 ± 2.0
4.859ProThr: 4.859 ± 1.46
4.859ProVal: 4.859 ± 1.94
0.972ProTrp: 0.972 ± 0.776
2.915ProTyr: 2.915 ± 1.344
0.0ProXaa: 0.0 ± 0.0
Gln
7.775GlnAla: 7.775 ± 2.083
1.944GlnCys: 1.944 ± 0.767
1.944GlnAsp: 1.944 ± 1.193
2.915GlnGlu: 2.915 ± 1.523
1.944GlnPhe: 1.944 ± 0.767
0.972GlnGly: 0.972 ± 0.757
0.972GlnHis: 0.972 ± 0.757
0.0GlnIle: 0.0 ± 0.0
0.972GlnLys: 0.972 ± 0.969
0.972GlnLeu: 0.972 ± 0.776
0.0GlnMet: 0.0 ± 0.804
3.887GlnAsn: 3.887 ± 1.338
1.944GlnPro: 1.944 ± 0.865
2.915GlnGln: 2.915 ± 1.344
1.944GlnArg: 1.944 ± 0.865
1.944GlnSer: 1.944 ± 0.767
0.972GlnThr: 0.972 ± 0.969
4.859GlnVal: 4.859 ± 0.858
0.972GlnTrp: 0.972 ± 0.776
1.944GlnTyr: 1.944 ± 1.193
0.0GlnXaa: 0.0 ± 0.0
Arg
0.972ArgAla: 0.972 ± 0.969
0.0ArgCys: 0.0 ± 0.0
0.972ArgAsp: 0.972 ± 0.757
7.775ArgGlu: 7.775 ± 1.178
2.915ArgPhe: 2.915 ± 1.674
6.803ArgGly: 6.803 ± 1.773
0.0ArgHis: 0.0 ± 0.0
0.0ArgIle: 0.0 ± 0.0
6.803ArgLys: 6.803 ± 1.773
8.746ArgLeu: 8.746 ± 2.733
0.0ArgMet: 0.0 ± 0.0
1.944ArgAsn: 1.944 ± 0.767
0.0ArgPro: 0.0 ± 0.0
0.0ArgGln: 0.0 ± 0.0
4.859ArgArg: 4.859 ± 1.79
7.775ArgSer: 7.775 ± 2.333
7.775ArgThr: 7.775 ± 1.65
1.944ArgVal: 1.944 ± 0.767
0.972ArgTrp: 0.972 ± 0.969
0.972ArgTyr: 0.972 ± 0.757
0.0ArgXaa: 0.0 ± 0.0
Ser
8.746SerAla: 8.746 ± 2.556
0.0SerCys: 0.0 ± 0.0
1.944SerAsp: 1.944 ± 1.938
3.887SerGlu: 3.887 ± 0.825
2.915SerPhe: 2.915 ± 2.287
8.746SerGly: 8.746 ± 7.399
2.915SerHis: 2.915 ± 0.5
5.831SerIle: 5.831 ± 1.883
2.915SerLys: 2.915 ± 0.5
3.887SerLeu: 3.887 ± 2.386
0.972SerMet: 0.972 ± 0.615
0.0SerAsn: 0.0 ± 0.0
1.944SerPro: 1.944 ± 0.767
9.718SerGln: 9.718 ± 0.593
8.746SerArg: 8.746 ± 1.677
7.775SerSer: 7.775 ± 4.849
8.746SerThr: 8.746 ± 2.222
5.831SerVal: 5.831 ± 1.412
0.972SerTrp: 0.972 ± 0.757
1.944SerTyr: 1.944 ± 0.767
0.0SerXaa: 0.0 ± 0.0
Thr
5.831ThrAla: 5.831 ± 2.463
0.972ThrCys: 0.972 ± 1.152
0.972ThrAsp: 0.972 ± 0.969
4.859ThrGlu: 4.859 ± 1.692
1.944ThrPhe: 1.944 ± 1.938
3.887ThrGly: 3.887 ± 1.338
0.0ThrHis: 0.0 ± 0.0
0.972ThrIle: 0.972 ± 0.969
5.831ThrLys: 5.831 ± 1.0
0.0ThrLeu: 0.0 ± 0.0
0.0ThrMet: 0.0 ± 0.0
0.972ThrAsn: 0.972 ± 0.969
1.944ThrPro: 1.944 ± 1.938
1.944ThrGln: 1.944 ± 1.478
4.859ThrArg: 4.859 ± 1.181
8.746ThrSer: 8.746 ± 2.63
5.831ThrThr: 5.831 ± 2.211
2.915ThrVal: 2.915 ± 1.232
1.944ThrTrp: 1.944 ± 1.938
6.803ThrTyr: 6.803 ± 1.549
0.0ThrXaa: 0.0 ± 0.0
Val
3.887ValAla: 3.887 ± 0.825
2.915ValCys: 2.915 ± 0.5
3.887ValAsp: 3.887 ± 1.533
6.803ValGlu: 6.803 ± 2.043
3.887ValPhe: 3.887 ± 0.786
5.831ValGly: 5.831 ± 1.29
0.0ValHis: 0.0 ± 0.0
5.831ValIle: 5.831 ± 1.126
3.887ValLys: 3.887 ± 1.338
7.775ValLeu: 7.775 ± 1.877
0.0ValMet: 0.0 ± 0.0
1.944ValAsn: 1.944 ± 1.513
4.859ValPro: 4.859 ± 0.756
4.859ValGln: 4.859 ± 0.858
3.887ValArg: 3.887 ± 0.825
6.803ValSer: 6.803 ± 2.158
3.887ValThr: 3.887 ± 2.344
2.915ValVal: 2.915 ± 1.232
0.0ValTrp: 0.0 ± 0.0
0.972ValTyr: 0.972 ± 0.969
0.0ValXaa: 0.0 ± 0.0
Trp
2.915TrpAla: 2.915 ± 1.311
0.0TrpCys: 0.0 ± 0.0
1.944TrpAsp: 1.944 ± 0.767
0.0TrpGlu: 0.0 ± 0.0
0.972TrpPhe: 0.972 ± 0.757
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.972TrpIle: 0.972 ± 0.776
2.915TrpLys: 2.915 ± 1.674
2.915TrpLeu: 2.915 ± 0.5
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.915TrpPro: 2.915 ± 1.674
0.972TrpGln: 0.972 ± 0.757
0.0TrpArg: 0.0 ± 0.0
0.972TrpSer: 0.972 ± 0.969
1.944TrpThr: 1.944 ± 1.014
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.972TyrAla: 0.972 ± 0.776
0.972TyrCys: 0.972 ± 0.969
0.972TyrAsp: 0.972 ± 0.969
0.0TyrGlu: 0.0 ± 0.0
0.972TyrPhe: 0.972 ± 0.969
0.972TyrGly: 0.972 ± 0.757
1.944TyrHis: 1.944 ± 0.767
4.859TyrIle: 4.859 ± 2.01
1.944TyrLys: 1.944 ± 1.478
2.915TyrLeu: 2.915 ± 0.5
3.887TyrMet: 3.887 ± 1.998
0.972TyrAsn: 0.972 ± 0.757
0.972TyrPro: 0.972 ± 1.152
0.0TyrGln: 0.0 ± 0.0
0.972TyrArg: 0.972 ± 1.152
6.803TyrSer: 6.803 ± 2.782
3.887TyrThr: 3.887 ± 1.338
3.887TyrVal: 3.887 ± 1.533
0.972TyrTrp: 0.972 ± 0.757
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1030 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski