Amino acid dipepetide frequency for Wenzhou picorna-like virus 54

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.745AlaAla: 6.745 ± 0.0
1.265AlaCys: 1.265 ± 0.0
5.059AlaAsp: 5.059 ± 0.0
3.794AlaGlu: 3.794 ± 0.0
5.481AlaPhe: 5.481 ± 0.0
5.902AlaGly: 5.902 ± 0.0
0.843AlaHis: 0.843 ± 0.0
3.794AlaIle: 3.794 ± 0.0
7.589AlaLys: 7.589 ± 0.0
5.481AlaLeu: 5.481 ± 0.0
1.686AlaMet: 1.686 ± 0.0
1.686AlaAsn: 1.686 ± 0.0
2.53AlaPro: 2.53 ± 0.0
0.843AlaGln: 0.843 ± 0.0
3.373AlaArg: 3.373 ± 0.0
2.951AlaSer: 2.951 ± 0.0
1.265AlaThr: 1.265 ± 0.0
3.794AlaVal: 3.794 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
2.53AlaTyr: 2.53 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.265CysAla: 1.265 ± 0.0
0.422CysCys: 0.422 ± 0.0
0.843CysAsp: 0.843 ± 0.0
0.843CysGlu: 0.843 ± 0.0
1.265CysPhe: 1.265 ± 0.0
2.951CysGly: 2.951 ± 0.0
1.265CysHis: 1.265 ± 0.0
0.843CysIle: 0.843 ± 0.0
1.686CysLys: 1.686 ± 0.0
2.951CysLeu: 2.951 ± 0.0
0.843CysMet: 0.843 ± 0.0
0.422CysAsn: 0.422 ± 0.0
0.422CysPro: 0.422 ± 0.0
1.686CysGln: 1.686 ± 0.0
0.843CysArg: 0.843 ± 0.0
1.265CysSer: 1.265 ± 0.0
1.265CysThr: 1.265 ± 0.0
1.686CysVal: 1.686 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.843CysTyr: 0.843 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.794AspAla: 3.794 ± 0.0
2.108AspCys: 2.108 ± 0.0
8.01AspAsp: 8.01 ± 0.0
5.481AspGlu: 5.481 ± 0.0
2.53AspPhe: 2.53 ± 0.0
5.059AspGly: 5.059 ± 0.0
1.265AspHis: 1.265 ± 0.0
1.686AspIle: 1.686 ± 0.0
3.794AspLys: 3.794 ± 0.0
6.324AspLeu: 6.324 ± 0.0
1.686AspMet: 1.686 ± 0.0
0.843AspAsn: 0.843 ± 0.0
5.481AspPro: 5.481 ± 0.0
5.059AspGln: 5.059 ± 0.0
1.265AspArg: 1.265 ± 0.0
8.01AspSer: 8.01 ± 0.0
2.951AspThr: 2.951 ± 0.0
5.902AspVal: 5.902 ± 0.0
0.843AspTrp: 0.843 ± 0.0
3.373AspTyr: 3.373 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.637GluAla: 4.637 ± 0.0
0.422GluCys: 0.422 ± 0.0
2.951GluAsp: 2.951 ± 0.0
3.794GluGlu: 3.794 ± 0.0
4.216GluPhe: 4.216 ± 0.0
5.059GluGly: 5.059 ± 0.0
0.843GluHis: 0.843 ± 0.0
2.108GluIle: 2.108 ± 0.0
3.373GluLys: 3.373 ± 0.0
5.059GluLeu: 5.059 ± 0.0
2.108GluMet: 2.108 ± 0.0
1.686GluAsn: 1.686 ± 0.0
1.265GluPro: 1.265 ± 0.0
3.373GluGln: 3.373 ± 0.0
3.794GluArg: 3.794 ± 0.0
5.059GluSer: 5.059 ± 0.0
2.951GluThr: 2.951 ± 0.0
4.216GluVal: 4.216 ± 0.0
0.0GluTrp: 0.0 ± 0.0
0.843GluTyr: 0.843 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.794PheAla: 3.794 ± 0.0
2.951PheCys: 2.951 ± 0.0
3.373PheAsp: 3.373 ± 0.0
1.686PheGlu: 1.686 ± 0.0
2.53PhePhe: 2.53 ± 0.0
3.373PheGly: 3.373 ± 0.0
0.843PheHis: 0.843 ± 0.0
2.53PheIle: 2.53 ± 0.0
2.951PheLys: 2.951 ± 0.0
4.637PheLeu: 4.637 ± 0.0
1.686PheMet: 1.686 ± 0.0
3.373PheAsn: 3.373 ± 0.0
2.951PhePro: 2.951 ± 0.0
1.265PheGln: 1.265 ± 0.0
2.53PheArg: 2.53 ± 0.0
3.373PheSer: 3.373 ± 0.0
2.53PheThr: 2.53 ± 0.0
2.951PheVal: 2.951 ± 0.0
2.108PheTrp: 2.108 ± 0.0
0.422PheTyr: 0.422 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.373GlyAla: 3.373 ± 0.0
1.686GlyCys: 1.686 ± 0.0
4.216GlyAsp: 4.216 ± 0.0
3.794GlyGlu: 3.794 ± 0.0
3.373GlyPhe: 3.373 ± 0.0
2.53GlyGly: 2.53 ± 0.0
1.265GlyHis: 1.265 ± 0.0
3.373GlyIle: 3.373 ± 0.0
5.059GlyLys: 5.059 ± 0.0
5.481GlyLeu: 5.481 ± 0.0
0.422GlyMet: 0.422 ± 0.0
5.059GlyAsn: 5.059 ± 0.0
2.53GlyPro: 2.53 ± 0.0
1.686GlyGln: 1.686 ± 0.0
3.373GlyArg: 3.373 ± 0.0
5.481GlySer: 5.481 ± 0.0
2.108GlyThr: 2.108 ± 0.0
3.794GlyVal: 3.794 ± 0.0
0.843GlyTrp: 0.843 ± 0.0
2.108GlyTyr: 2.108 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.422HisAla: 0.422 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.108HisAsp: 2.108 ± 0.0
0.843HisGlu: 0.843 ± 0.0
0.843HisPhe: 0.843 ± 0.0
0.422HisGly: 0.422 ± 0.0
0.422HisHis: 0.422 ± 0.0
1.686HisIle: 1.686 ± 0.0
0.422HisLys: 0.422 ± 0.0
3.373HisLeu: 3.373 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.843HisAsn: 0.843 ± 0.0
1.265HisPro: 1.265 ± 0.0
0.422HisGln: 0.422 ± 0.0
0.422HisArg: 0.422 ± 0.0
1.686HisSer: 1.686 ± 0.0
1.265HisThr: 1.265 ± 0.0
1.686HisVal: 1.686 ± 0.0
0.422HisTrp: 0.422 ± 0.0
0.422HisTyr: 0.422 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.108IleAla: 2.108 ± 0.0
0.422IleCys: 0.422 ± 0.0
5.059IleAsp: 5.059 ± 0.0
2.53IleGlu: 2.53 ± 0.0
2.108IlePhe: 2.108 ± 0.0
3.794IleGly: 3.794 ± 0.0
1.686IleHis: 1.686 ± 0.0
2.53IleIle: 2.53 ± 0.0
3.373IleLys: 3.373 ± 0.0
4.637IleLeu: 4.637 ± 0.0
3.373IleMet: 3.373 ± 0.0
1.686IleAsn: 1.686 ± 0.0
2.53IlePro: 2.53 ± 0.0
0.843IleGln: 0.843 ± 0.0
1.265IleArg: 1.265 ± 0.0
2.951IleSer: 2.951 ± 0.0
1.686IleThr: 1.686 ± 0.0
2.53IleVal: 2.53 ± 0.0
0.422IleTrp: 0.422 ± 0.0
0.843IleTyr: 0.843 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.794LysAla: 3.794 ± 0.0
1.686LysCys: 1.686 ± 0.0
5.902LysAsp: 5.902 ± 0.0
3.373LysGlu: 3.373 ± 0.0
1.686LysPhe: 1.686 ± 0.0
2.108LysGly: 2.108 ± 0.0
2.108LysHis: 2.108 ± 0.0
2.53LysIle: 2.53 ± 0.0
4.637LysLys: 4.637 ± 0.0
4.216LysLeu: 4.216 ± 0.0
0.422LysMet: 0.422 ± 0.0
1.265LysAsn: 1.265 ± 0.0
3.373LysPro: 3.373 ± 0.0
2.53LysGln: 2.53 ± 0.0
6.324LysArg: 6.324 ± 0.0
3.794LysSer: 3.794 ± 0.0
3.794LysThr: 3.794 ± 0.0
4.637LysVal: 4.637 ± 0.0
0.422LysTrp: 0.422 ± 0.0
2.108LysTyr: 2.108 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.637LeuAla: 4.637 ± 0.0
2.53LeuCys: 2.53 ± 0.0
3.373LeuAsp: 3.373 ± 0.0
6.324LeuGlu: 6.324 ± 0.0
2.53LeuPhe: 2.53 ± 0.0
3.373LeuGly: 3.373 ± 0.0
2.108LeuHis: 2.108 ± 0.0
2.951LeuIle: 2.951 ± 0.0
5.481LeuLys: 5.481 ± 0.0
7.589LeuLeu: 7.589 ± 0.0
1.265LeuMet: 1.265 ± 0.0
3.373LeuAsn: 3.373 ± 0.0
3.794LeuPro: 3.794 ± 0.0
3.373LeuGln: 3.373 ± 0.0
8.01LeuArg: 8.01 ± 0.0
3.794LeuSer: 3.794 ± 0.0
7.167LeuThr: 7.167 ± 0.0
10.118LeuVal: 10.118 ± 0.0
0.843LeuTrp: 0.843 ± 0.0
2.108LeuTyr: 2.108 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.686MetAla: 1.686 ± 0.0
1.265MetCys: 1.265 ± 0.0
1.265MetAsp: 1.265 ± 0.0
2.53MetGlu: 2.53 ± 0.0
1.265MetPhe: 1.265 ± 0.0
0.422MetGly: 0.422 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.265MetIle: 1.265 ± 0.0
2.951MetLys: 2.951 ± 0.0
2.53MetLeu: 2.53 ± 0.0
0.843MetMet: 0.843 ± 0.0
1.686MetAsn: 1.686 ± 0.0
0.843MetPro: 0.843 ± 0.0
1.686MetGln: 1.686 ± 0.0
0.843MetArg: 0.843 ± 0.0
1.686MetSer: 1.686 ± 0.0
1.686MetThr: 1.686 ± 0.0
2.951MetVal: 2.951 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.422MetTyr: 0.422 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.265AsnAla: 1.265 ± 0.0
0.843AsnCys: 0.843 ± 0.0
1.686AsnAsp: 1.686 ± 0.0
0.843AsnGlu: 0.843 ± 0.0
2.951AsnPhe: 2.951 ± 0.0
3.373AsnGly: 3.373 ± 0.0
1.265AsnHis: 1.265 ± 0.0
1.686AsnIle: 1.686 ± 0.0
1.686AsnLys: 1.686 ± 0.0
3.373AsnLeu: 3.373 ± 0.0
1.265AsnMet: 1.265 ± 0.0
1.686AsnAsn: 1.686 ± 0.0
2.53AsnPro: 2.53 ± 0.0
1.686AsnGln: 1.686 ± 0.0
1.686AsnArg: 1.686 ± 0.0
1.686AsnSer: 1.686 ± 0.0
2.951AsnThr: 2.951 ± 0.0
3.794AsnVal: 3.794 ± 0.0
1.265AsnTrp: 1.265 ± 0.0
2.108AsnTyr: 2.108 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.373ProAla: 3.373 ± 0.0
0.843ProCys: 0.843 ± 0.0
4.637ProAsp: 4.637 ± 0.0
2.108ProGlu: 2.108 ± 0.0
2.53ProPhe: 2.53 ± 0.0
2.951ProGly: 2.951 ± 0.0
0.422ProHis: 0.422 ± 0.0
2.951ProIle: 2.951 ± 0.0
0.843ProLys: 0.843 ± 0.0
3.794ProLeu: 3.794 ± 0.0
2.53ProMet: 2.53 ± 0.0
1.686ProAsn: 1.686 ± 0.0
1.686ProPro: 1.686 ± 0.0
1.265ProGln: 1.265 ± 0.0
0.422ProArg: 0.422 ± 0.0
2.108ProSer: 2.108 ± 0.0
2.951ProThr: 2.951 ± 0.0
5.481ProVal: 5.481 ± 0.0
0.843ProTrp: 0.843 ± 0.0
1.265ProTyr: 1.265 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.108GlnAla: 2.108 ± 0.0
1.686GlnCys: 1.686 ± 0.0
2.108GlnAsp: 2.108 ± 0.0
0.843GlnGlu: 0.843 ± 0.0
1.265GlnPhe: 1.265 ± 0.0
2.108GlnGly: 2.108 ± 0.0
0.843GlnHis: 0.843 ± 0.0
2.53GlnIle: 2.53 ± 0.0
2.108GlnLys: 2.108 ± 0.0
1.265GlnLeu: 1.265 ± 0.0
1.265GlnMet: 1.265 ± 0.0
0.422GlnAsn: 0.422 ± 0.0
0.422GlnPro: 0.422 ± 0.0
0.422GlnGln: 0.422 ± 0.0
2.951GlnArg: 2.951 ± 0.0
5.059GlnSer: 5.059 ± 0.0
3.373GlnThr: 3.373 ± 0.0
5.059GlnVal: 5.059 ± 0.0
0.422GlnTrp: 0.422 ± 0.0
1.686GlnTyr: 1.686 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.059ArgAla: 5.059 ± 0.0
0.843ArgCys: 0.843 ± 0.0
2.108ArgAsp: 2.108 ± 0.0
4.216ArgGlu: 4.216 ± 0.0
3.794ArgPhe: 3.794 ± 0.0
2.53ArgGly: 2.53 ± 0.0
1.265ArgHis: 1.265 ± 0.0
3.373ArgIle: 3.373 ± 0.0
4.216ArgLys: 4.216 ± 0.0
4.216ArgLeu: 4.216 ± 0.0
1.686ArgMet: 1.686 ± 0.0
4.216ArgAsn: 4.216 ± 0.0
0.843ArgPro: 0.843 ± 0.0
3.373ArgGln: 3.373 ± 0.0
4.216ArgArg: 4.216 ± 0.0
3.373ArgSer: 3.373 ± 0.0
0.843ArgThr: 0.843 ± 0.0
3.373ArgVal: 3.373 ± 0.0
0.422ArgTrp: 0.422 ± 0.0
0.843ArgTyr: 0.843 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.216SerAla: 4.216 ± 0.0
1.265SerCys: 1.265 ± 0.0
6.745SerAsp: 6.745 ± 0.0
4.637SerGlu: 4.637 ± 0.0
4.637SerPhe: 4.637 ± 0.0
5.059SerGly: 5.059 ± 0.0
2.108SerHis: 2.108 ± 0.0
2.951SerIle: 2.951 ± 0.0
3.794SerLys: 3.794 ± 0.0
5.481SerLeu: 5.481 ± 0.0
1.686SerMet: 1.686 ± 0.0
3.373SerAsn: 3.373 ± 0.0
3.794SerPro: 3.794 ± 0.0
2.53SerGln: 2.53 ± 0.0
4.637SerArg: 4.637 ± 0.0
5.059SerSer: 5.059 ± 0.0
3.794SerThr: 3.794 ± 0.0
3.373SerVal: 3.373 ± 0.0
0.0SerTrp: 0.0 ± 0.0
2.108SerTyr: 2.108 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.216ThrAla: 4.216 ± 0.0
0.843ThrCys: 0.843 ± 0.0
4.216ThrAsp: 4.216 ± 0.0
1.686ThrGlu: 1.686 ± 0.0
3.373ThrPhe: 3.373 ± 0.0
3.794ThrGly: 3.794 ± 0.0
0.0ThrHis: 0.0 ± 0.0
2.53ThrIle: 2.53 ± 0.0
1.686ThrLys: 1.686 ± 0.0
4.637ThrLeu: 4.637 ± 0.0
0.0ThrMet: 0.0 ± 0.0
1.265ThrAsn: 1.265 ± 0.0
2.53ThrPro: 2.53 ± 0.0
0.843ThrGln: 0.843 ± 0.0
4.637ThrArg: 4.637 ± 0.0
4.216ThrSer: 4.216 ± 0.0
4.637ThrThr: 4.637 ± 0.0
4.216ThrVal: 4.216 ± 0.0
0.843ThrTrp: 0.843 ± 0.0
2.951ThrTyr: 2.951 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.589ValAla: 7.589 ± 0.0
0.843ValCys: 0.843 ± 0.0
7.589ValAsp: 7.589 ± 0.0
6.745ValGlu: 6.745 ± 0.0
3.794ValPhe: 3.794 ± 0.0
3.373ValGly: 3.373 ± 0.0
0.843ValHis: 0.843 ± 0.0
3.373ValIle: 3.373 ± 0.0
3.794ValLys: 3.794 ± 0.0
5.902ValLeu: 5.902 ± 0.0
2.108ValMet: 2.108 ± 0.0
2.53ValAsn: 2.53 ± 0.0
4.216ValPro: 4.216 ± 0.0
3.373ValGln: 3.373 ± 0.0
3.373ValArg: 3.373 ± 0.0
8.01ValSer: 8.01 ± 0.0
3.794ValThr: 3.794 ± 0.0
5.481ValVal: 5.481 ± 0.0
0.843ValTrp: 0.843 ± 0.0
2.53ValTyr: 2.53 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.265TrpAla: 1.265 ± 0.0
1.265TrpCys: 1.265 ± 0.0
0.843TrpAsp: 0.843 ± 0.0
0.422TrpGlu: 0.422 ± 0.0
0.843TrpPhe: 0.843 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.422TrpIle: 0.422 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.843TrpLeu: 0.843 ± 0.0
1.265TrpMet: 1.265 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.422TrpGln: 0.422 ± 0.0
0.422TrpArg: 0.422 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.422TrpThr: 0.422 ± 0.0
0.422TrpVal: 0.422 ± 0.0
0.422TrpTrp: 0.422 ± 0.0
1.686TrpTyr: 1.686 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.265TyrAla: 1.265 ± 0.0
0.422TyrCys: 0.422 ± 0.0
3.373TyrAsp: 3.373 ± 0.0
1.265TyrGlu: 1.265 ± 0.0
0.843TyrPhe: 0.843 ± 0.0
3.373TyrGly: 3.373 ± 0.0
0.0TyrHis: 0.0 ± 0.0
1.265TyrIle: 1.265 ± 0.0
1.265TyrLys: 1.265 ± 0.0
2.951TyrLeu: 2.951 ± 0.0
1.265TyrMet: 1.265 ± 0.0
2.53TyrAsn: 2.53 ± 0.0
1.686TyrPro: 1.686 ± 0.0
1.265TyrGln: 1.265 ± 0.0
0.422TyrArg: 0.422 ± 0.0
1.686TyrSer: 1.686 ± 0.0
1.686TyrThr: 1.686 ± 0.0
4.637TyrVal: 4.637 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.843TyrTyr: 0.843 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2373 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski