Amino acid dipepetide frequency for Hubei picorna-like virus 41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.79AlaAla: 4.79 ± 0.0
2.053AlaCys: 2.053 ± 0.0
2.395AlaAsp: 2.395 ± 0.0
3.421AlaGlu: 3.421 ± 0.0
1.711AlaPhe: 1.711 ± 0.0
6.5AlaGly: 6.5 ± 0.0
1.368AlaHis: 1.368 ± 0.0
3.763AlaIle: 3.763 ± 0.0
3.421AlaLys: 3.421 ± 0.0
4.79AlaLeu: 4.79 ± 0.0
0.684AlaMet: 0.684 ± 0.0
5.132AlaAsn: 5.132 ± 0.0
1.711AlaPro: 1.711 ± 0.0
3.079AlaGln: 3.079 ± 0.0
2.737AlaArg: 2.737 ± 0.0
3.421AlaSer: 3.421 ± 0.0
4.447AlaThr: 4.447 ± 0.0
3.079AlaVal: 3.079 ± 0.0
1.026AlaTrp: 1.026 ± 0.0
4.447AlaTyr: 4.447 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.368CysAla: 1.368 ± 0.0
0.342CysCys: 0.342 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.684CysGlu: 0.684 ± 0.0
0.342CysPhe: 0.342 ± 0.0
1.026CysGly: 1.026 ± 0.0
0.342CysHis: 0.342 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.368CysLys: 1.368 ± 0.0
2.053CysLeu: 2.053 ± 0.0
0.342CysMet: 0.342 ± 0.0
1.026CysAsn: 1.026 ± 0.0
0.684CysPro: 0.684 ± 0.0
0.342CysGln: 0.342 ± 0.0
1.026CysArg: 1.026 ± 0.0
2.053CysSer: 2.053 ± 0.0
1.368CysThr: 1.368 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.342CysTrp: 0.342 ± 0.0
1.368CysTyr: 1.368 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.026AspAla: 1.026 ± 0.0
0.342AspCys: 0.342 ± 0.0
3.421AspAsp: 3.421 ± 0.0
4.105AspGlu: 4.105 ± 0.0
2.737AspPhe: 2.737 ± 0.0
1.368AspGly: 1.368 ± 0.0
0.684AspHis: 0.684 ± 0.0
3.079AspIle: 3.079 ± 0.0
3.421AspLys: 3.421 ± 0.0
4.105AspLeu: 4.105 ± 0.0
1.368AspMet: 1.368 ± 0.0
1.368AspAsn: 1.368 ± 0.0
2.737AspPro: 2.737 ± 0.0
4.447AspGln: 4.447 ± 0.0
2.395AspArg: 2.395 ± 0.0
2.395AspSer: 2.395 ± 0.0
3.421AspThr: 3.421 ± 0.0
5.132AspVal: 5.132 ± 0.0
2.737AspTrp: 2.737 ± 0.0
2.395AspTyr: 2.395 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.421GluAla: 3.421 ± 0.0
1.368GluCys: 1.368 ± 0.0
3.421GluAsp: 3.421 ± 0.0
1.711GluGlu: 1.711 ± 0.0
2.737GluPhe: 2.737 ± 0.0
4.105GluGly: 4.105 ± 0.0
0.684GluHis: 0.684 ± 0.0
3.763GluIle: 3.763 ± 0.0
6.5GluLys: 6.5 ± 0.0
4.79GluLeu: 4.79 ± 0.0
1.368GluMet: 1.368 ± 0.0
2.737GluAsn: 2.737 ± 0.0
1.368GluPro: 1.368 ± 0.0
3.421GluGln: 3.421 ± 0.0
2.053GluArg: 2.053 ± 0.0
4.447GluSer: 4.447 ± 0.0
3.763GluThr: 3.763 ± 0.0
4.447GluVal: 4.447 ± 0.0
1.368GluTrp: 1.368 ± 0.0
3.421GluTyr: 3.421 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.026PheAla: 1.026 ± 0.0
0.342PheCys: 0.342 ± 0.0
3.421PheAsp: 3.421 ± 0.0
3.763PheGlu: 3.763 ± 0.0
2.053PhePhe: 2.053 ± 0.0
2.395PheGly: 2.395 ± 0.0
0.342PheHis: 0.342 ± 0.0
2.737PheIle: 2.737 ± 0.0
2.395PheLys: 2.395 ± 0.0
2.737PheLeu: 2.737 ± 0.0
1.026PheMet: 1.026 ± 0.0
1.368PheAsn: 1.368 ± 0.0
1.026PhePro: 1.026 ± 0.0
0.684PheGln: 0.684 ± 0.0
1.711PheArg: 1.711 ± 0.0
2.053PheSer: 2.053 ± 0.0
2.053PheThr: 2.053 ± 0.0
3.079PheVal: 3.079 ± 0.0
1.368PheTrp: 1.368 ± 0.0
1.711PheTyr: 1.711 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.79GlyAla: 4.79 ± 0.0
0.342GlyCys: 0.342 ± 0.0
3.079GlyAsp: 3.079 ± 0.0
4.447GlyGlu: 4.447 ± 0.0
2.395GlyPhe: 2.395 ± 0.0
3.079GlyGly: 3.079 ± 0.0
1.026GlyHis: 1.026 ± 0.0
4.447GlyIle: 4.447 ± 0.0
2.395GlyLys: 2.395 ± 0.0
4.79GlyLeu: 4.79 ± 0.0
1.711GlyMet: 1.711 ± 0.0
2.737GlyAsn: 2.737 ± 0.0
2.395GlyPro: 2.395 ± 0.0
1.368GlyGln: 1.368 ± 0.0
2.053GlyArg: 2.053 ± 0.0
2.395GlySer: 2.395 ± 0.0
5.474GlyThr: 5.474 ± 0.0
7.184GlyVal: 7.184 ± 0.0
0.342GlyTrp: 0.342 ± 0.0
2.737GlyTyr: 2.737 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.342HisAla: 0.342 ± 0.0
0.342HisCys: 0.342 ± 0.0
0.684HisAsp: 0.684 ± 0.0
0.684HisGlu: 0.684 ± 0.0
1.368HisPhe: 1.368 ± 0.0
2.053HisGly: 2.053 ± 0.0
0.684HisHis: 0.684 ± 0.0
2.053HisIle: 2.053 ± 0.0
0.684HisLys: 0.684 ± 0.0
1.368HisLeu: 1.368 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.342HisAsn: 0.342 ± 0.0
1.368HisPro: 1.368 ± 0.0
0.684HisGln: 0.684 ± 0.0
1.026HisArg: 1.026 ± 0.0
1.711HisSer: 1.711 ± 0.0
1.368HisThr: 1.368 ± 0.0
0.684HisVal: 0.684 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.368HisTyr: 1.368 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.79IleAla: 4.79 ± 0.0
1.711IleCys: 1.711 ± 0.0
3.079IleAsp: 3.079 ± 0.0
4.79IleGlu: 4.79 ± 0.0
2.053IlePhe: 2.053 ± 0.0
5.132IleGly: 5.132 ± 0.0
0.684IleHis: 0.684 ± 0.0
4.447IleIle: 4.447 ± 0.0
2.737IleLys: 2.737 ± 0.0
3.763IleLeu: 3.763 ± 0.0
4.105IleMet: 4.105 ± 0.0
4.447IleAsn: 4.447 ± 0.0
4.447IlePro: 4.447 ± 0.0
2.395IleGln: 2.395 ± 0.0
4.447IleArg: 4.447 ± 0.0
2.737IleSer: 2.737 ± 0.0
4.447IleThr: 4.447 ± 0.0
4.447IleVal: 4.447 ± 0.0
1.026IleTrp: 1.026 ± 0.0
2.737IleTyr: 2.737 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.474LysAla: 5.474 ± 0.0
0.684LysCys: 0.684 ± 0.0
3.763LysAsp: 3.763 ± 0.0
3.421LysGlu: 3.421 ± 0.0
3.079LysPhe: 3.079 ± 0.0
2.395LysGly: 2.395 ± 0.0
1.368LysHis: 1.368 ± 0.0
5.132LysIle: 5.132 ± 0.0
4.105LysLys: 4.105 ± 0.0
4.105LysLeu: 4.105 ± 0.0
1.711LysMet: 1.711 ± 0.0
2.737LysAsn: 2.737 ± 0.0
1.368LysPro: 1.368 ± 0.0
3.079LysGln: 3.079 ± 0.0
1.026LysArg: 1.026 ± 0.0
4.447LysSer: 4.447 ± 0.0
2.395LysThr: 2.395 ± 0.0
5.474LysVal: 5.474 ± 0.0
1.711LysTrp: 1.711 ± 0.0
2.053LysTyr: 2.053 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.158LeuAla: 6.158 ± 0.0
1.368LeuCys: 1.368 ± 0.0
5.132LeuAsp: 5.132 ± 0.0
4.79LeuGlu: 4.79 ± 0.0
1.711LeuPhe: 1.711 ± 0.0
3.763LeuGly: 3.763 ± 0.0
1.368LeuHis: 1.368 ± 0.0
5.816LeuIle: 5.816 ± 0.0
5.132LeuLys: 5.132 ± 0.0
7.869LeuLeu: 7.869 ± 0.0
1.711LeuMet: 1.711 ± 0.0
2.737LeuAsn: 2.737 ± 0.0
2.737LeuPro: 2.737 ± 0.0
2.053LeuGln: 2.053 ± 0.0
6.158LeuArg: 6.158 ± 0.0
5.816LeuSer: 5.816 ± 0.0
6.5LeuThr: 6.5 ± 0.0
5.132LeuVal: 5.132 ± 0.0
0.342LeuTrp: 0.342 ± 0.0
2.395LeuTyr: 2.395 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.053MetAla: 2.053 ± 0.0
1.026MetCys: 1.026 ± 0.0
3.421MetAsp: 3.421 ± 0.0
1.368MetGlu: 1.368 ± 0.0
0.684MetPhe: 0.684 ± 0.0
1.026MetGly: 1.026 ± 0.0
1.711MetHis: 1.711 ± 0.0
2.395MetIle: 2.395 ± 0.0
1.711MetLys: 1.711 ± 0.0
0.342MetLeu: 0.342 ± 0.0
0.684MetMet: 0.684 ± 0.0
1.026MetAsn: 1.026 ± 0.0
1.368MetPro: 1.368 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.026MetArg: 1.026 ± 0.0
1.711MetSer: 1.711 ± 0.0
1.711MetThr: 1.711 ± 0.0
0.684MetVal: 0.684 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.711MetTyr: 1.711 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.763AsnAla: 3.763 ± 0.0
0.342AsnCys: 0.342 ± 0.0
1.711AsnAsp: 1.711 ± 0.0
3.421AsnGlu: 3.421 ± 0.0
3.079AsnPhe: 3.079 ± 0.0
2.395AsnGly: 2.395 ± 0.0
1.368AsnHis: 1.368 ± 0.0
2.737AsnIle: 2.737 ± 0.0
2.737AsnLys: 2.737 ± 0.0
2.053AsnLeu: 2.053 ± 0.0
1.026AsnMet: 1.026 ± 0.0
4.79AsnAsn: 4.79 ± 0.0
3.763AsnPro: 3.763 ± 0.0
1.711AsnGln: 1.711 ± 0.0
2.395AsnArg: 2.395 ± 0.0
1.368AsnSer: 1.368 ± 0.0
4.447AsnThr: 4.447 ± 0.0
2.053AsnVal: 2.053 ± 0.0
0.684AsnTrp: 0.684 ± 0.0
3.763AsnTyr: 3.763 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.395ProAla: 2.395 ± 0.0
1.026ProCys: 1.026 ± 0.0
0.0ProAsp: 0.0 ± 0.0
3.763ProGlu: 3.763 ± 0.0
1.368ProPhe: 1.368 ± 0.0
0.684ProGly: 0.684 ± 0.0
2.053ProHis: 2.053 ± 0.0
3.421ProIle: 3.421 ± 0.0
1.711ProLys: 1.711 ± 0.0
4.79ProLeu: 4.79 ± 0.0
0.684ProMet: 0.684 ± 0.0
0.342ProAsn: 0.342 ± 0.0
1.368ProPro: 1.368 ± 0.0
1.711ProGln: 1.711 ± 0.0
3.421ProArg: 3.421 ± 0.0
2.395ProSer: 2.395 ± 0.0
6.158ProThr: 6.158 ± 0.0
3.421ProVal: 3.421 ± 0.0
0.684ProTrp: 0.684 ± 0.0
2.053ProTyr: 2.053 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.079GlnAla: 3.079 ± 0.0
0.342GlnCys: 0.342 ± 0.0
3.421GlnAsp: 3.421 ± 0.0
1.368GlnGlu: 1.368 ± 0.0
1.711GlnPhe: 1.711 ± 0.0
2.053GlnGly: 2.053 ± 0.0
1.026GlnHis: 1.026 ± 0.0
3.763GlnIle: 3.763 ± 0.0
1.026GlnLys: 1.026 ± 0.0
4.79GlnLeu: 4.79 ± 0.0
1.711GlnMet: 1.711 ± 0.0
2.395GlnAsn: 2.395 ± 0.0
2.053GlnPro: 2.053 ± 0.0
1.368GlnGln: 1.368 ± 0.0
3.763GlnArg: 3.763 ± 0.0
3.763GlnSer: 3.763 ± 0.0
2.053GlnThr: 2.053 ± 0.0
1.368GlnVal: 1.368 ± 0.0
1.711GlnTrp: 1.711 ± 0.0
1.026GlnTyr: 1.026 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.368ArgAla: 1.368 ± 0.0
1.026ArgCys: 1.026 ± 0.0
2.395ArgAsp: 2.395 ± 0.0
3.079ArgGlu: 3.079 ± 0.0
2.395ArgPhe: 2.395 ± 0.0
2.737ArgGly: 2.737 ± 0.0
1.368ArgHis: 1.368 ± 0.0
2.053ArgIle: 2.053 ± 0.0
3.763ArgLys: 3.763 ± 0.0
2.737ArgLeu: 2.737 ± 0.0
1.026ArgMet: 1.026 ± 0.0
3.763ArgAsn: 3.763 ± 0.0
3.763ArgPro: 3.763 ± 0.0
3.079ArgGln: 3.079 ± 0.0
3.763ArgArg: 3.763 ± 0.0
4.105ArgSer: 4.105 ± 0.0
2.737ArgThr: 2.737 ± 0.0
4.105ArgVal: 4.105 ± 0.0
1.026ArgTrp: 1.026 ± 0.0
2.053ArgTyr: 2.053 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.447SerAla: 4.447 ± 0.0
1.711SerCys: 1.711 ± 0.0
3.421SerAsp: 3.421 ± 0.0
2.395SerGlu: 2.395 ± 0.0
2.053SerPhe: 2.053 ± 0.0
4.447SerGly: 4.447 ± 0.0
0.342SerHis: 0.342 ± 0.0
4.105SerIle: 4.105 ± 0.0
4.105SerLys: 4.105 ± 0.0
7.184SerLeu: 7.184 ± 0.0
1.368SerMet: 1.368 ± 0.0
2.395SerAsn: 2.395 ± 0.0
2.737SerPro: 2.737 ± 0.0
2.395SerGln: 2.395 ± 0.0
2.053SerArg: 2.053 ± 0.0
4.79SerSer: 4.79 ± 0.0
5.474SerThr: 5.474 ± 0.0
4.105SerVal: 4.105 ± 0.0
2.053SerTrp: 2.053 ± 0.0
1.711SerTyr: 1.711 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.447ThrAla: 4.447 ± 0.0
1.026ThrCys: 1.026 ± 0.0
3.421ThrAsp: 3.421 ± 0.0
3.421ThrGlu: 3.421 ± 0.0
1.711ThrPhe: 1.711 ± 0.0
4.105ThrGly: 4.105 ± 0.0
0.684ThrHis: 0.684 ± 0.0
5.132ThrIle: 5.132 ± 0.0
5.132ThrLys: 5.132 ± 0.0
7.184ThrLeu: 7.184 ± 0.0
1.711ThrMet: 1.711 ± 0.0
4.105ThrAsn: 4.105 ± 0.0
2.737ThrPro: 2.737 ± 0.0
4.447ThrGln: 4.447 ± 0.0
2.737ThrArg: 2.737 ± 0.0
5.132ThrSer: 5.132 ± 0.0
6.842ThrThr: 6.842 ± 0.0
5.132ThrVal: 5.132 ± 0.0
0.342ThrTrp: 0.342 ± 0.0
2.053ThrTyr: 2.053 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.816ValAla: 5.816 ± 0.0
0.342ValCys: 0.342 ± 0.0
2.737ValAsp: 2.737 ± 0.0
4.447ValGlu: 4.447 ± 0.0
2.737ValPhe: 2.737 ± 0.0
5.132ValGly: 5.132 ± 0.0
0.342ValHis: 0.342 ± 0.0
5.474ValIle: 5.474 ± 0.0
1.368ValLys: 1.368 ± 0.0
4.447ValLeu: 4.447 ± 0.0
1.711ValMet: 1.711 ± 0.0
3.421ValAsn: 3.421 ± 0.0
4.105ValPro: 4.105 ± 0.0
4.105ValGln: 4.105 ± 0.0
4.105ValArg: 4.105 ± 0.0
4.79ValSer: 4.79 ± 0.0
3.421ValThr: 3.421 ± 0.0
5.474ValVal: 5.474 ± 0.0
2.737ValTrp: 2.737 ± 0.0
3.079ValTyr: 3.079 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.684TrpAla: 0.684 ± 0.0
0.342TrpCys: 0.342 ± 0.0
1.368TrpAsp: 1.368 ± 0.0
1.368TrpGlu: 1.368 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.684TrpGly: 0.684 ± 0.0
0.342TrpHis: 0.342 ± 0.0
1.026TrpIle: 1.026 ± 0.0
3.079TrpLys: 3.079 ± 0.0
1.368TrpLeu: 1.368 ± 0.0
1.026TrpMet: 1.026 ± 0.0
1.026TrpAsn: 1.026 ± 0.0
0.342TrpPro: 0.342 ± 0.0
0.342TrpGln: 0.342 ± 0.0
1.368TrpArg: 1.368 ± 0.0
1.711TrpSer: 1.711 ± 0.0
1.711TrpThr: 1.711 ± 0.0
1.711TrpVal: 1.711 ± 0.0
0.684TrpTrp: 0.684 ± 0.0
0.342TrpTyr: 0.342 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.737TyrAla: 2.737 ± 0.0
0.0TyrCys: 0.0 ± 0.0
2.395TyrAsp: 2.395 ± 0.0
4.447TyrGlu: 4.447 ± 0.0
1.368TyrPhe: 1.368 ± 0.0
4.105TyrGly: 4.105 ± 0.0
1.026TyrHis: 1.026 ± 0.0
3.079TyrIle: 3.079 ± 0.0
2.737TyrLys: 2.737 ± 0.0
3.421TyrLeu: 3.421 ± 0.0
0.342TyrMet: 0.342 ± 0.0
1.711TyrAsn: 1.711 ± 0.0
1.368TyrPro: 1.368 ± 0.0
3.079TyrGln: 3.079 ± 0.0
3.079TyrArg: 3.079 ± 0.0
2.053TyrSer: 2.053 ± 0.0
1.711TyrThr: 1.711 ± 0.0
3.079TyrVal: 3.079 ± 0.0
0.342TyrTrp: 0.342 ± 0.0
2.395TyrTyr: 2.395 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2924 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski