Amino acid dipepetide frequency for Wenzhou tombus-like virus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
5.831AlaAsp: 5.831 ± 0.326
0.0AlaGlu: 0.0 ± 0.0
4.373AlaPhe: 4.373 ± 0.39
0.0AlaGly: 0.0 ± 0.0
4.373AlaHis: 4.373 ± 2.148
1.458AlaIle: 1.458 ± 1.822
1.458AlaLys: 1.458 ± 0.716
0.0AlaLeu: 0.0 ± 0.0
1.458AlaMet: 1.458 ± 1.822
0.0AlaAsn: 0.0 ± 0.0
0.0AlaPro: 0.0 ± 0.0
2.915AlaGln: 2.915 ± 3.645
2.915AlaArg: 2.915 ± 1.106
4.373AlaSer: 4.373 ± 0.39
2.915AlaThr: 2.915 ± 1.432
1.458AlaVal: 1.458 ± 0.716
2.915AlaTrp: 2.915 ± 1.106
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.458CysGlu: 1.458 ± 0.716
1.458CysPhe: 1.458 ± 0.716
1.458CysGly: 1.458 ± 0.716
0.0CysHis: 0.0 ± 0.0
2.915CysIle: 2.915 ± 1.106
2.915CysLys: 2.915 ± 1.432
1.458CysLeu: 1.458 ± 0.716
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.915CysPro: 2.915 ± 1.106
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
4.373CysSer: 4.373 ± 2.929
2.915CysThr: 2.915 ± 1.106
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.915AspAla: 2.915 ± 1.106
0.0AspCys: 0.0 ± 0.0
2.915AspAsp: 2.915 ± 1.106
2.915AspGlu: 2.915 ± 1.106
2.915AspPhe: 2.915 ± 1.106
2.915AspGly: 2.915 ± 1.432
1.458AspHis: 1.458 ± 0.716
7.289AspIle: 7.289 ± 1.042
5.831AspLys: 5.831 ± 0.326
1.458AspLeu: 1.458 ± 0.716
1.458AspMet: 1.458 ± 1.822
1.458AspAsn: 1.458 ± 1.822
2.915AspPro: 2.915 ± 1.106
1.458AspGln: 1.458 ± 0.716
5.831AspArg: 5.831 ± 2.213
1.458AspSer: 1.458 ± 0.716
2.915AspThr: 2.915 ± 1.106
1.458AspVal: 1.458 ± 0.716
0.0AspTrp: 0.0 ± 0.0
10.204AspTyr: 10.204 ± 0.065
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
2.915GluAsp: 2.915 ± 1.432
2.915GluGlu: 2.915 ± 1.432
4.373GluPhe: 4.373 ± 2.929
1.458GluGly: 1.458 ± 0.716
4.373GluHis: 4.373 ± 2.148
2.915GluIle: 2.915 ± 1.106
2.915GluLys: 2.915 ± 1.432
7.289GluLeu: 7.289 ± 1.042
4.373GluMet: 4.373 ± 2.148
1.458GluAsn: 1.458 ± 0.716
0.0GluPro: 0.0 ± 0.0
1.458GluGln: 1.458 ± 0.716
7.289GluArg: 7.289 ± 1.497
2.915GluSer: 2.915 ± 1.432
1.458GluThr: 1.458 ± 0.716
5.831GluVal: 5.831 ± 2.864
1.458GluTrp: 1.458 ± 0.716
2.915GluTyr: 2.915 ± 1.106
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.915PheCys: 2.915 ± 1.106
2.915PheAsp: 2.915 ± 1.432
2.915PheGlu: 2.915 ± 1.106
0.0PhePhe: 0.0 ± 0.0
2.915PheGly: 2.915 ± 1.106
0.0PheHis: 0.0 ± 0.0
2.915PheIle: 2.915 ± 1.106
7.289PheLys: 7.289 ± 1.042
5.831PheLeu: 5.831 ± 2.213
1.458PheMet: 1.458 ± 1.822
1.458PheAsn: 1.458 ± 0.716
2.915PhePro: 2.915 ± 3.645
0.0PheGln: 0.0 ± 0.0
1.458PheArg: 1.458 ± 1.822
5.831PheSer: 5.831 ± 0.326
5.831PheThr: 5.831 ± 2.864
4.373PheVal: 4.373 ± 2.148
1.458PheTrp: 1.458 ± 1.822
2.915PheTyr: 2.915 ± 1.106
0.0PheXaa: 0.0 ± 0.0
Gly
1.458GlyAla: 1.458 ± 0.716
1.458GlyCys: 1.458 ± 0.716
1.458GlyAsp: 1.458 ± 0.716
2.915GlyGlu: 2.915 ± 1.106
1.458GlyPhe: 1.458 ± 0.716
0.0GlyGly: 0.0 ± 0.0
0.0GlyHis: 0.0 ± 0.0
5.831GlyIle: 5.831 ± 0.326
0.0GlyLys: 0.0 ± 0.0
7.289GlyLeu: 7.289 ± 3.58
1.458GlyMet: 1.458 ± 1.822
4.373GlyAsn: 4.373 ± 2.148
2.915GlyPro: 2.915 ± 1.106
0.0GlyGln: 0.0 ± 0.0
1.458GlyArg: 1.458 ± 1.822
1.458GlySer: 1.458 ± 0.716
1.458GlyThr: 1.458 ± 1.822
2.915GlyVal: 2.915 ± 1.432
1.458GlyTrp: 1.458 ± 0.716
1.458GlyTyr: 1.458 ± 1.822
0.0GlyXaa: 0.0 ± 0.0
His
2.915HisAla: 2.915 ± 1.106
0.0HisCys: 0.0 ± 0.0
1.458HisAsp: 1.458 ± 0.716
1.458HisGlu: 1.458 ± 0.716
1.458HisPhe: 1.458 ± 1.822
1.458HisGly: 1.458 ± 0.716
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
4.373HisLys: 4.373 ± 2.148
0.0HisLeu: 0.0 ± 0.0
1.458HisMet: 1.458 ± 0.716
1.458HisAsn: 1.458 ± 1.822
0.0HisPro: 0.0 ± 0.0
1.458HisGln: 1.458 ± 0.716
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.458HisThr: 1.458 ± 1.822
4.373HisVal: 4.373 ± 2.929
0.0HisTrp: 0.0 ± 0.0
2.915HisTyr: 2.915 ± 1.432
0.0HisXaa: 0.0 ± 0.0
Ile
2.915IleAla: 2.915 ± 1.432
1.458IleCys: 1.458 ± 1.822
1.458IleAsp: 1.458 ± 1.822
4.373IleGlu: 4.373 ± 2.148
2.915IlePhe: 2.915 ± 1.432
1.458IleGly: 1.458 ± 0.716
2.915IleHis: 2.915 ± 1.106
4.373IleIle: 4.373 ± 0.39
7.289IleLys: 7.289 ± 1.042
11.662IleLeu: 11.662 ± 4.425
2.915IleMet: 2.915 ± 1.106
8.746IleAsn: 8.746 ± 0.781
2.915IlePro: 2.915 ± 1.432
7.289IleGln: 7.289 ± 1.042
2.915IleArg: 2.915 ± 1.432
2.915IleSer: 2.915 ± 1.432
1.458IleThr: 1.458 ± 0.716
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
2.915IleTyr: 2.915 ± 1.432
0.0IleXaa: 0.0 ± 0.0
Lys
1.458LysAla: 1.458 ± 0.716
2.915LysCys: 2.915 ± 1.432
2.915LysAsp: 2.915 ± 1.432
5.831LysGlu: 5.831 ± 0.326
1.458LysPhe: 1.458 ± 0.716
0.0LysGly: 0.0 ± 0.0
4.373LysHis: 4.373 ± 0.39
5.831LysIle: 5.831 ± 2.864
4.373LysLys: 4.373 ± 2.148
7.289LysLeu: 7.289 ± 3.58
1.458LysMet: 1.458 ± 0.716
2.915LysAsn: 2.915 ± 1.432
1.458LysPro: 1.458 ± 0.716
4.373LysGln: 4.373 ± 0.39
1.458LysArg: 1.458 ± 0.716
5.831LysSer: 5.831 ± 2.864
10.204LysThr: 10.204 ± 0.065
4.373LysVal: 4.373 ± 2.148
1.458LysTrp: 1.458 ± 0.716
2.915LysTyr: 2.915 ± 1.106
0.0LysXaa: 0.0 ± 0.0
Leu
11.662LeuAla: 11.662 ± 1.887
0.0LeuCys: 0.0 ± 0.0
5.831LeuAsp: 5.831 ± 0.326
4.373LeuGlu: 4.373 ± 2.148
2.915LeuPhe: 2.915 ± 1.432
2.915LeuGly: 2.915 ± 1.432
0.0LeuHis: 0.0 ± 0.0
7.289LeuIle: 7.289 ± 1.042
11.662LeuLys: 11.662 ± 0.651
8.746LeuLeu: 8.746 ± 0.781
2.915LeuMet: 2.915 ± 1.432
4.373LeuAsn: 4.373 ± 2.148
1.458LeuPro: 1.458 ± 0.716
2.915LeuGln: 2.915 ± 1.106
5.831LeuArg: 5.831 ± 0.326
7.289LeuSer: 7.289 ± 1.497
5.831LeuThr: 5.831 ± 2.864
4.373LeuVal: 4.373 ± 2.929
0.0LeuTrp: 0.0 ± 0.0
4.373LeuTyr: 4.373 ± 2.929
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
2.915MetCys: 2.915 ± 1.106
0.0MetAsp: 0.0 ± 0.0
2.915MetGlu: 2.915 ± 1.106
2.915MetPhe: 2.915 ± 1.432
1.458MetGly: 1.458 ± 0.716
0.0MetHis: 0.0 ± 0.0
1.458MetIle: 1.458 ± 0.716
1.458MetLys: 1.458 ± 0.716
1.458MetLeu: 1.458 ± 1.822
0.0MetMet: 0.0 ± 0.0
1.458MetAsn: 1.458 ± 0.716
2.915MetPro: 2.915 ± 1.106
1.458MetGln: 1.458 ± 1.822
0.0MetArg: 0.0 ± 0.0
4.373MetSer: 4.373 ± 2.148
0.0MetThr: 0.0 ± 0.0
2.915MetVal: 2.915 ± 1.106
0.0MetTrp: 0.0 ± 0.0
2.915MetTyr: 2.915 ± 1.106
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.458AsnCys: 1.458 ± 1.822
5.831AsnAsp: 5.831 ± 0.326
0.0AsnGlu: 0.0 ± 0.0
2.915AsnPhe: 2.915 ± 1.106
5.831AsnGly: 5.831 ± 2.213
0.0AsnHis: 0.0 ± 0.0
1.458AsnIle: 1.458 ± 0.716
1.458AsnLys: 1.458 ± 0.716
7.289AsnLeu: 7.289 ± 1.042
1.458AsnMet: 1.458 ± 0.716
2.915AsnAsn: 2.915 ± 1.432
1.458AsnPro: 1.458 ± 0.716
0.0AsnGln: 0.0 ± 0.0
1.458AsnArg: 1.458 ± 0.716
8.746AsnSer: 8.746 ± 1.758
4.373AsnThr: 4.373 ± 2.929
4.373AsnVal: 4.373 ± 2.148
1.458AsnTrp: 1.458 ± 0.716
2.915AsnTyr: 2.915 ± 1.432
0.0AsnXaa: 0.0 ± 0.0
Pro
1.458ProAla: 1.458 ± 1.822
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
2.915ProGlu: 2.915 ± 1.432
2.915ProPhe: 2.915 ± 3.645
2.915ProGly: 2.915 ± 1.106
1.458ProHis: 1.458 ± 1.822
10.204ProIle: 10.204 ± 2.474
1.458ProLys: 1.458 ± 0.716
0.0ProLeu: 0.0 ± 0.0
0.0ProMet: 0.0 ± 0.0
1.458ProAsn: 1.458 ± 0.716
2.915ProPro: 2.915 ± 1.432
1.458ProGln: 1.458 ± 1.822
4.373ProArg: 4.373 ± 0.39
2.915ProSer: 2.915 ± 1.432
0.0ProThr: 0.0 ± 0.0
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.458GlnCys: 1.458 ± 1.822
1.458GlnAsp: 1.458 ± 0.716
2.915GlnGlu: 2.915 ± 1.432
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
5.831GlnHis: 5.831 ± 2.213
1.458GlnIle: 1.458 ± 0.716
0.0GlnLys: 0.0 ± 0.0
4.373GlnLeu: 4.373 ± 0.39
1.458GlnMet: 1.458 ± 1.076
2.915GlnAsn: 2.915 ± 1.432
1.458GlnPro: 1.458 ± 1.822
1.458GlnGln: 1.458 ± 1.822
1.458GlnArg: 1.458 ± 1.822
4.373GlnSer: 4.373 ± 2.929
0.0GlnThr: 0.0 ± 0.0
1.458GlnVal: 1.458 ± 0.716
0.0GlnTrp: 0.0 ± 0.0
2.915GlnTyr: 2.915 ± 1.432
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
1.458ArgCys: 1.458 ± 0.716
2.915ArgAsp: 2.915 ± 1.106
4.373ArgGlu: 4.373 ± 0.39
1.458ArgPhe: 1.458 ± 1.822
5.831ArgGly: 5.831 ± 2.864
0.0ArgHis: 0.0 ± 0.0
2.915ArgIle: 2.915 ± 1.106
4.373ArgLys: 4.373 ± 2.148
4.373ArgLeu: 4.373 ± 2.148
2.915ArgMet: 2.915 ± 1.893
2.915ArgAsn: 2.915 ± 1.106
1.458ArgPro: 1.458 ± 0.716
1.458ArgGln: 1.458 ± 1.822
5.831ArgArg: 5.831 ± 2.213
2.915ArgSer: 2.915 ± 1.106
0.0ArgThr: 0.0 ± 0.0
2.915ArgVal: 2.915 ± 3.645
0.0ArgTrp: 0.0 ± 0.0
5.831ArgTyr: 5.831 ± 2.213
0.0ArgXaa: 0.0 ± 0.0
Ser
8.746SerAla: 8.746 ± 1.758
1.458SerCys: 1.458 ± 0.716
4.373SerAsp: 4.373 ± 0.39
4.373SerGlu: 4.373 ± 2.148
5.831SerPhe: 5.831 ± 0.326
1.458SerGly: 1.458 ± 0.716
1.458SerHis: 1.458 ± 1.822
4.373SerIle: 4.373 ± 0.39
4.373SerLys: 4.373 ± 2.148
13.12SerLeu: 13.12 ± 1.367
1.458SerMet: 1.458 ± 1.822
5.831SerAsn: 5.831 ± 0.326
0.0SerPro: 0.0 ± 0.0
1.458SerGln: 1.458 ± 0.716
1.458SerArg: 1.458 ± 0.716
4.373SerSer: 4.373 ± 0.39
4.373SerThr: 4.373 ± 0.39
1.458SerVal: 1.458 ± 0.716
1.458SerTrp: 1.458 ± 0.716
4.373SerTyr: 4.373 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
1.458ThrAla: 1.458 ± 1.822
0.0ThrCys: 0.0 ± 0.0
7.289ThrAsp: 7.289 ± 1.497
1.458ThrGlu: 1.458 ± 0.716
2.915ThrPhe: 2.915 ± 1.106
5.831ThrGly: 5.831 ± 4.751
0.0ThrHis: 0.0 ± 0.0
1.458ThrIle: 1.458 ± 0.716
2.915ThrLys: 2.915 ± 1.432
1.458ThrLeu: 1.458 ± 0.716
0.0ThrMet: 0.0 ± 0.0
2.915ThrAsn: 2.915 ± 1.106
1.458ThrPro: 1.458 ± 0.716
1.458ThrGln: 1.458 ± 0.716
8.746ThrArg: 8.746 ± 4.296
2.915ThrSer: 2.915 ± 1.432
0.0ThrThr: 0.0 ± 0.0
8.746ThrVal: 8.746 ± 3.319
0.0ThrTrp: 0.0 ± 0.0
1.458ThrTyr: 1.458 ± 0.716
0.0ThrXaa: 0.0 ± 0.0
Val
1.458ValAla: 1.458 ± 1.822
0.0ValCys: 0.0 ± 0.0
5.831ValAsp: 5.831 ± 2.213
5.831ValGlu: 5.831 ± 2.864
4.373ValPhe: 4.373 ± 2.929
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
2.915ValIle: 2.915 ± 1.432
5.831ValLys: 5.831 ± 2.864
5.831ValLeu: 5.831 ± 2.213
1.458ValMet: 1.458 ± 0.716
5.831ValAsn: 5.831 ± 0.326
0.0ValPro: 0.0 ± 0.0
1.458ValGln: 1.458 ± 0.716
1.458ValArg: 1.458 ± 1.822
4.373ValSer: 4.373 ± 0.39
4.373ValThr: 4.373 ± 2.148
1.458ValVal: 1.458 ± 0.716
0.0ValTrp: 0.0 ± 0.0
2.915ValTyr: 2.915 ± 1.106
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.458TrpCys: 1.458 ± 0.716
0.0TrpAsp: 0.0 ± 0.0
1.458TrpGlu: 1.458 ± 0.716
1.458TrpPhe: 1.458 ± 1.822
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.915TrpLeu: 2.915 ± 1.432
0.0TrpMet: 0.0 ± 0.0
1.458TrpAsn: 1.458 ± 0.716
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.458TrpArg: 1.458 ± 0.716
0.0TrpSer: 0.0 ± 0.0
1.458TrpThr: 1.458 ± 1.822
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
2.915TyrCys: 2.915 ± 1.432
4.373TyrAsp: 4.373 ± 5.467
2.915TyrGlu: 2.915 ± 1.106
7.289TyrPhe: 7.289 ± 3.58
2.915TyrGly: 2.915 ± 1.432
0.0TyrHis: 0.0 ± 0.0
5.831TyrIle: 5.831 ± 2.213
2.915TyrLys: 2.915 ± 1.106
2.915TyrLeu: 2.915 ± 1.106
1.458TyrMet: 1.458 ± 0.716
1.458TyrAsn: 1.458 ± 1.822
7.289TyrPro: 7.289 ± 1.042
2.915TyrGln: 2.915 ± 1.106
0.0TyrArg: 0.0 ± 0.0
4.373TyrSer: 4.373 ± 2.148
1.458TyrThr: 1.458 ± 1.822
2.915TyrVal: 2.915 ± 1.106
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (687 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski