Amino acid dipepetide frequency for Hubei picorna-like virus 33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.783AlaAla: 3.783 ± 0.0
0.688AlaCys: 0.688 ± 0.0
3.439AlaAsp: 3.439 ± 0.0
6.19AlaGlu: 6.19 ± 0.0
3.095AlaPhe: 3.095 ± 0.0
4.814AlaGly: 4.814 ± 0.0
0.344AlaHis: 0.344 ± 0.0
3.095AlaIle: 3.095 ± 0.0
4.127AlaLys: 4.127 ± 0.0
4.814AlaLeu: 4.814 ± 0.0
2.407AlaMet: 2.407 ± 0.0
2.407AlaAsn: 2.407 ± 0.0
2.751AlaPro: 2.751 ± 0.0
4.127AlaGln: 4.127 ± 0.0
3.095AlaArg: 3.095 ± 0.0
4.127AlaSer: 4.127 ± 0.0
3.783AlaThr: 3.783 ± 0.0
3.783AlaVal: 3.783 ± 0.0
0.688AlaTrp: 0.688 ± 0.0
3.095AlaTyr: 3.095 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.032CysAla: 1.032 ± 0.0
0.344CysCys: 0.344 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.688CysGlu: 0.688 ± 0.0
1.032CysPhe: 1.032 ± 0.0
1.376CysGly: 1.376 ± 0.0
0.688CysHis: 0.688 ± 0.0
1.376CysIle: 1.376 ± 0.0
1.032CysLys: 1.032 ± 0.0
2.751CysLeu: 2.751 ± 0.0
0.344CysMet: 0.344 ± 0.0
1.032CysAsn: 1.032 ± 0.0
0.688CysPro: 0.688 ± 0.0
0.344CysGln: 0.344 ± 0.0
0.344CysArg: 0.344 ± 0.0
1.376CysSer: 1.376 ± 0.0
0.344CysThr: 0.344 ± 0.0
0.344CysVal: 0.344 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.344CysTyr: 0.344 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.814AspAla: 4.814 ± 0.0
0.688AspCys: 0.688 ± 0.0
3.439AspAsp: 3.439 ± 0.0
2.407AspGlu: 2.407 ± 0.0
2.407AspPhe: 2.407 ± 0.0
3.095AspGly: 3.095 ± 0.0
1.032AspHis: 1.032 ± 0.0
5.158AspIle: 5.158 ± 0.0
3.095AspLys: 3.095 ± 0.0
3.439AspLeu: 3.439 ± 0.0
1.376AspMet: 1.376 ± 0.0
2.407AspAsn: 2.407 ± 0.0
4.47AspPro: 4.47 ± 0.0
1.719AspGln: 1.719 ± 0.0
3.095AspArg: 3.095 ± 0.0
4.814AspSer: 4.814 ± 0.0
3.783AspThr: 3.783 ± 0.0
4.814AspVal: 4.814 ± 0.0
1.032AspTrp: 1.032 ± 0.0
2.407AspTyr: 2.407 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.783GluAla: 3.783 ± 0.0
0.344GluCys: 0.344 ± 0.0
2.751GluAsp: 2.751 ± 0.0
2.063GluGlu: 2.063 ± 0.0
1.719GluPhe: 1.719 ± 0.0
2.407GluGly: 2.407 ± 0.0
2.063GluHis: 2.063 ± 0.0
3.783GluIle: 3.783 ± 0.0
2.751GluLys: 2.751 ± 0.0
6.878GluLeu: 6.878 ± 0.0
1.032GluMet: 1.032 ± 0.0
1.032GluAsn: 1.032 ± 0.0
2.407GluPro: 2.407 ± 0.0
1.032GluGln: 1.032 ± 0.0
1.376GluArg: 1.376 ± 0.0
3.439GluSer: 3.439 ± 0.0
2.407GluThr: 2.407 ± 0.0
4.814GluVal: 4.814 ± 0.0
1.032GluTrp: 1.032 ± 0.0
1.376GluTyr: 1.376 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.095PheAla: 3.095 ± 0.0
1.032PheCys: 1.032 ± 0.0
2.407PheAsp: 2.407 ± 0.0
2.407PheGlu: 2.407 ± 0.0
1.032PhePhe: 1.032 ± 0.0
2.407PheGly: 2.407 ± 0.0
0.344PheHis: 0.344 ± 0.0
2.407PheIle: 2.407 ± 0.0
2.751PheLys: 2.751 ± 0.0
4.127PheLeu: 4.127 ± 0.0
1.376PheMet: 1.376 ± 0.0
1.719PheAsn: 1.719 ± 0.0
2.407PhePro: 2.407 ± 0.0
0.688PheGln: 0.688 ± 0.0
2.063PheArg: 2.063 ± 0.0
4.127PheSer: 4.127 ± 0.0
2.063PheThr: 2.063 ± 0.0
2.063PheVal: 2.063 ± 0.0
0.688PheTrp: 0.688 ± 0.0
2.407PheTyr: 2.407 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.376GlyAla: 1.376 ± 0.0
1.032GlyCys: 1.032 ± 0.0
3.783GlyAsp: 3.783 ± 0.0
2.751GlyGlu: 2.751 ± 0.0
1.376GlyPhe: 1.376 ± 0.0
1.719GlyGly: 1.719 ± 0.0
1.376GlyHis: 1.376 ± 0.0
2.063GlyIle: 2.063 ± 0.0
2.063GlyLys: 2.063 ± 0.0
4.47GlyLeu: 4.47 ± 0.0
1.376GlyMet: 1.376 ± 0.0
3.439GlyAsn: 3.439 ± 0.0
2.063GlyPro: 2.063 ± 0.0
3.095GlyGln: 3.095 ± 0.0
3.439GlyArg: 3.439 ± 0.0
4.47GlySer: 4.47 ± 0.0
2.407GlyThr: 2.407 ± 0.0
5.158GlyVal: 5.158 ± 0.0
1.032GlyTrp: 1.032 ± 0.0
3.783GlyTyr: 3.783 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.063HisAla: 2.063 ± 0.0
0.344HisCys: 0.344 ± 0.0
1.719HisAsp: 1.719 ± 0.0
0.344HisGlu: 0.344 ± 0.0
1.032HisPhe: 1.032 ± 0.0
1.032HisGly: 1.032 ± 0.0
0.688HisHis: 0.688 ± 0.0
1.376HisIle: 1.376 ± 0.0
1.032HisLys: 1.032 ± 0.0
2.063HisLeu: 2.063 ± 0.0
0.688HisMet: 0.688 ± 0.0
0.688HisAsn: 0.688 ± 0.0
1.719HisPro: 1.719 ± 0.0
0.688HisGln: 0.688 ± 0.0
0.344HisArg: 0.344 ± 0.0
2.407HisSer: 2.407 ± 0.0
1.719HisThr: 1.719 ± 0.0
1.032HisVal: 1.032 ± 0.0
0.344HisTrp: 0.344 ± 0.0
0.688HisTyr: 0.688 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.814IleAla: 4.814 ± 0.0
0.344IleCys: 0.344 ± 0.0
5.846IleAsp: 5.846 ± 0.0
3.439IleGlu: 3.439 ± 0.0
1.719IlePhe: 1.719 ± 0.0
4.47IleGly: 4.47 ± 0.0
3.439IleHis: 3.439 ± 0.0
5.158IleIle: 5.158 ± 0.0
6.19IleLys: 6.19 ± 0.0
4.814IleLeu: 4.814 ± 0.0
2.063IleMet: 2.063 ± 0.0
4.127IleAsn: 4.127 ± 0.0
5.846IlePro: 5.846 ± 0.0
1.032IleGln: 1.032 ± 0.0
4.127IleArg: 4.127 ± 0.0
3.783IleSer: 3.783 ± 0.0
4.127IleThr: 4.127 ± 0.0
3.783IleVal: 3.783 ± 0.0
0.344IleTrp: 0.344 ± 0.0
1.719IleTyr: 1.719 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.127LysAla: 4.127 ± 0.0
0.344LysCys: 0.344 ± 0.0
3.439LysAsp: 3.439 ± 0.0
1.719LysGlu: 1.719 ± 0.0
2.751LysPhe: 2.751 ± 0.0
2.063LysGly: 2.063 ± 0.0
3.095LysHis: 3.095 ± 0.0
5.158LysIle: 5.158 ± 0.0
2.751LysLys: 2.751 ± 0.0
3.439LysLeu: 3.439 ± 0.0
1.032LysMet: 1.032 ± 0.0
4.127LysAsn: 4.127 ± 0.0
4.814LysPro: 4.814 ± 0.0
1.376LysGln: 1.376 ± 0.0
2.063LysArg: 2.063 ± 0.0
5.502LysSer: 5.502 ± 0.0
3.783LysThr: 3.783 ± 0.0
5.158LysVal: 5.158 ± 0.0
1.719LysTrp: 1.719 ± 0.0
1.719LysTyr: 1.719 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.502LeuAla: 5.502 ± 0.0
1.719LeuCys: 1.719 ± 0.0
5.502LeuAsp: 5.502 ± 0.0
4.814LeuGlu: 4.814 ± 0.0
3.439LeuPhe: 3.439 ± 0.0
3.783LeuGly: 3.783 ± 0.0
1.376LeuHis: 1.376 ± 0.0
5.502LeuIle: 5.502 ± 0.0
6.19LeuLys: 6.19 ± 0.0
9.629LeuLeu: 9.629 ± 0.0
2.063LeuMet: 2.063 ± 0.0
4.47LeuAsn: 4.47 ± 0.0
4.47LeuPro: 4.47 ± 0.0
4.814LeuGln: 4.814 ± 0.0
5.502LeuArg: 5.502 ± 0.0
4.47LeuSer: 4.47 ± 0.0
6.878LeuThr: 6.878 ± 0.0
4.47LeuVal: 4.47 ± 0.0
1.376LeuTrp: 1.376 ± 0.0
2.407LeuTyr: 2.407 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.095MetAla: 3.095 ± 0.0
1.376MetCys: 1.376 ± 0.0
1.376MetAsp: 1.376 ± 0.0
1.376MetGlu: 1.376 ± 0.0
0.688MetPhe: 0.688 ± 0.0
1.376MetGly: 1.376 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.032MetIle: 1.032 ± 0.0
1.376MetLys: 1.376 ± 0.0
1.376MetLeu: 1.376 ± 0.0
0.688MetMet: 0.688 ± 0.0
1.376MetAsn: 1.376 ± 0.0
1.032MetPro: 1.032 ± 0.0
1.032MetGln: 1.032 ± 0.0
2.063MetArg: 2.063 ± 0.0
1.719MetSer: 1.719 ± 0.0
0.344MetThr: 0.344 ± 0.0
2.407MetVal: 2.407 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.032MetTyr: 1.032 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.376AsnAla: 1.376 ± 0.0
1.032AsnCys: 1.032 ± 0.0
2.751AsnAsp: 2.751 ± 0.0
1.719AsnGlu: 1.719 ± 0.0
3.095AsnPhe: 3.095 ± 0.0
1.719AsnGly: 1.719 ± 0.0
0.344AsnHis: 0.344 ± 0.0
4.814AsnIle: 4.814 ± 0.0
3.783AsnLys: 3.783 ± 0.0
3.095AsnLeu: 3.095 ± 0.0
1.032AsnMet: 1.032 ± 0.0
3.095AsnAsn: 3.095 ± 0.0
4.127AsnPro: 4.127 ± 0.0
0.688AsnGln: 0.688 ± 0.0
2.063AsnArg: 2.063 ± 0.0
3.095AsnSer: 3.095 ± 0.0
3.439AsnThr: 3.439 ± 0.0
4.127AsnVal: 4.127 ± 0.0
1.032AsnTrp: 1.032 ± 0.0
2.751AsnTyr: 2.751 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.032ProAla: 1.032 ± 0.0
1.376ProCys: 1.376 ± 0.0
2.751ProAsp: 2.751 ± 0.0
2.407ProGlu: 2.407 ± 0.0
2.751ProPhe: 2.751 ± 0.0
3.783ProGly: 3.783 ± 0.0
1.032ProHis: 1.032 ± 0.0
4.127ProIle: 4.127 ± 0.0
2.751ProLys: 2.751 ± 0.0
5.502ProLeu: 5.502 ± 0.0
1.376ProMet: 1.376 ± 0.0
2.063ProAsn: 2.063 ± 0.0
3.783ProPro: 3.783 ± 0.0
2.407ProGln: 2.407 ± 0.0
3.439ProArg: 3.439 ± 0.0
4.127ProSer: 4.127 ± 0.0
5.158ProThr: 5.158 ± 0.0
2.063ProVal: 2.063 ± 0.0
1.719ProTrp: 1.719 ± 0.0
4.814ProTyr: 4.814 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.063GlnAla: 2.063 ± 0.0
0.688GlnCys: 0.688 ± 0.0
0.344GlnAsp: 0.344 ± 0.0
1.032GlnGlu: 1.032 ± 0.0
1.376GlnPhe: 1.376 ± 0.0
1.719GlnGly: 1.719 ± 0.0
0.688GlnHis: 0.688 ± 0.0
4.47GlnIle: 4.47 ± 0.0
2.751GlnLys: 2.751 ± 0.0
2.751GlnLeu: 2.751 ± 0.0
1.032GlnMet: 1.032 ± 0.0
1.376GlnAsn: 1.376 ± 0.0
2.063GlnPro: 2.063 ± 0.0
2.063GlnGln: 2.063 ± 0.0
1.719GlnArg: 1.719 ± 0.0
1.719GlnSer: 1.719 ± 0.0
2.063GlnThr: 2.063 ± 0.0
1.376GlnVal: 1.376 ± 0.0
0.344GlnTrp: 0.344 ± 0.0
1.032GlnTyr: 1.032 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.751ArgAla: 2.751 ± 0.0
0.688ArgCys: 0.688 ± 0.0
4.814ArgAsp: 4.814 ± 0.0
1.719ArgGlu: 1.719 ± 0.0
2.751ArgPhe: 2.751 ± 0.0
3.095ArgGly: 3.095 ± 0.0
0.0ArgHis: 0.0 ± 0.0
3.439ArgIle: 3.439 ± 0.0
3.783ArgLys: 3.783 ± 0.0
4.127ArgLeu: 4.127 ± 0.0
0.688ArgMet: 0.688 ± 0.0
2.407ArgAsn: 2.407 ± 0.0
3.783ArgPro: 3.783 ± 0.0
1.032ArgGln: 1.032 ± 0.0
2.751ArgArg: 2.751 ± 0.0
4.127ArgSer: 4.127 ± 0.0
2.407ArgThr: 2.407 ± 0.0
2.407ArgVal: 2.407 ± 0.0
1.032ArgTrp: 1.032 ± 0.0
3.095ArgTyr: 3.095 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.407SerAla: 2.407 ± 0.0
0.344SerCys: 0.344 ± 0.0
3.439SerAsp: 3.439 ± 0.0
4.127SerGlu: 4.127 ± 0.0
3.095SerPhe: 3.095 ± 0.0
3.439SerGly: 3.439 ± 0.0
0.688SerHis: 0.688 ± 0.0
3.439SerIle: 3.439 ± 0.0
3.095SerLys: 3.095 ± 0.0
7.565SerLeu: 7.565 ± 0.0
1.376SerMet: 1.376 ± 0.0
6.19SerAsn: 6.19 ± 0.0
2.751SerPro: 2.751 ± 0.0
2.407SerGln: 2.407 ± 0.0
3.783SerArg: 3.783 ± 0.0
5.502SerSer: 5.502 ± 0.0
7.565SerThr: 7.565 ± 0.0
4.127SerVal: 4.127 ± 0.0
0.688SerTrp: 0.688 ± 0.0
4.127SerTyr: 4.127 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.502ThrAla: 5.502 ± 0.0
0.688ThrCys: 0.688 ± 0.0
4.47ThrAsp: 4.47 ± 0.0
3.095ThrGlu: 3.095 ± 0.0
2.751ThrPhe: 2.751 ± 0.0
4.127ThrGly: 4.127 ± 0.0
2.063ThrHis: 2.063 ± 0.0
4.127ThrIle: 4.127 ± 0.0
4.47ThrLys: 4.47 ± 0.0
5.158ThrLeu: 5.158 ± 0.0
2.751ThrMet: 2.751 ± 0.0
2.407ThrAsn: 2.407 ± 0.0
4.814ThrPro: 4.814 ± 0.0
0.688ThrGln: 0.688 ± 0.0
1.376ThrArg: 1.376 ± 0.0
3.439ThrSer: 3.439 ± 0.0
5.502ThrThr: 5.502 ± 0.0
5.158ThrVal: 5.158 ± 0.0
0.688ThrTrp: 0.688 ± 0.0
2.063ThrTyr: 2.063 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.565ValAla: 7.565 ± 0.0
0.344ValCys: 0.344 ± 0.0
3.095ValAsp: 3.095 ± 0.0
3.095ValGlu: 3.095 ± 0.0
3.095ValPhe: 3.095 ± 0.0
2.407ValGly: 2.407 ± 0.0
2.063ValHis: 2.063 ± 0.0
4.47ValIle: 4.47 ± 0.0
2.751ValLys: 2.751 ± 0.0
6.19ValLeu: 6.19 ± 0.0
1.719ValMet: 1.719 ± 0.0
2.751ValAsn: 2.751 ± 0.0
3.783ValPro: 3.783 ± 0.0
2.407ValGln: 2.407 ± 0.0
3.783ValArg: 3.783 ± 0.0
4.47ValSer: 4.47 ± 0.0
2.751ValThr: 2.751 ± 0.0
3.439ValVal: 3.439 ± 0.0
0.344ValTrp: 0.344 ± 0.0
4.127ValTyr: 4.127 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.032TrpAla: 1.032 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.344TrpGlu: 0.344 ± 0.0
0.688TrpPhe: 0.688 ± 0.0
0.344TrpGly: 0.344 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.719TrpIle: 1.719 ± 0.0
1.032TrpLys: 1.032 ± 0.0
2.407TrpLeu: 2.407 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.032TrpAsn: 1.032 ± 0.0
0.344TrpPro: 0.344 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.376TrpArg: 1.376 ± 0.0
1.376TrpSer: 1.376 ± 0.0
0.688TrpThr: 0.688 ± 0.0
0.344TrpVal: 0.344 ± 0.0
0.344TrpTrp: 0.344 ± 0.0
1.376TrpTyr: 1.376 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.439TyrAla: 3.439 ± 0.0
2.063TyrCys: 2.063 ± 0.0
4.127TyrAsp: 4.127 ± 0.0
2.751TyrGlu: 2.751 ± 0.0
1.719TyrPhe: 1.719 ± 0.0
3.095TyrGly: 3.095 ± 0.0
0.688TyrHis: 0.688 ± 0.0
4.47TyrIle: 4.47 ± 0.0
2.407TyrLys: 2.407 ± 0.0
3.783TyrLeu: 3.783 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.376TyrAsn: 1.376 ± 0.0
0.344TyrPro: 0.344 ± 0.0
1.032TyrGln: 1.032 ± 0.0
3.095TyrArg: 3.095 ± 0.0
2.063TyrSer: 2.063 ± 0.0
4.127TyrThr: 4.127 ± 0.0
3.783TyrVal: 3.783 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.376TyrTyr: 1.376 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2909 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski