Amino acid dipepetide frequency for Hubei picorna-like virus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.419AlaAla: 4.419 ± 0.0
0.947AlaCys: 0.947 ± 0.0
4.419AlaAsp: 4.419 ± 0.0
4.735AlaGlu: 4.735 ± 0.0
2.21AlaPhe: 2.21 ± 0.0
4.104AlaGly: 4.104 ± 0.0
1.263AlaHis: 1.263 ± 0.0
3.472AlaIle: 3.472 ± 0.0
1.894AlaLys: 1.894 ± 0.0
5.366AlaLeu: 5.366 ± 0.0
2.21AlaMet: 2.21 ± 0.0
2.841AlaAsn: 2.841 ± 0.0
2.525AlaPro: 2.525 ± 0.0
4.735AlaGln: 4.735 ± 0.0
5.051AlaArg: 5.051 ± 0.0
4.419AlaSer: 4.419 ± 0.0
5.366AlaThr: 5.366 ± 0.0
6.313AlaVal: 6.313 ± 0.0
0.631AlaTrp: 0.631 ± 0.0
3.472AlaTyr: 3.472 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.263CysAla: 1.263 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.947CysAsp: 0.947 ± 0.0
1.263CysGlu: 1.263 ± 0.0
0.631CysPhe: 0.631 ± 0.0
0.631CysGly: 0.631 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.316CysIle: 0.316 ± 0.0
0.947CysLys: 0.947 ± 0.0
2.21CysLeu: 2.21 ± 0.0
0.316CysMet: 0.316 ± 0.0
0.316CysAsn: 0.316 ± 0.0
2.525CysPro: 2.525 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.631CysArg: 0.631 ± 0.0
0.631CysSer: 0.631 ± 0.0
1.263CysThr: 1.263 ± 0.0
0.631CysVal: 0.631 ± 0.0
0.631CysTrp: 0.631 ± 0.0
0.631CysTyr: 0.631 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.735AspAla: 4.735 ± 0.0
0.947AspCys: 0.947 ± 0.0
4.735AspAsp: 4.735 ± 0.0
4.419AspGlu: 4.419 ± 0.0
4.735AspPhe: 4.735 ± 0.0
0.947AspGly: 0.947 ± 0.0
1.263AspHis: 1.263 ± 0.0
2.525AspIle: 2.525 ± 0.0
2.21AspLys: 2.21 ± 0.0
6.313AspLeu: 6.313 ± 0.0
2.525AspMet: 2.525 ± 0.0
1.578AspAsn: 1.578 ± 0.0
2.841AspPro: 2.841 ± 0.0
1.894AspGln: 1.894 ± 0.0
3.157AspArg: 3.157 ± 0.0
4.419AspSer: 4.419 ± 0.0
2.841AspThr: 2.841 ± 0.0
5.682AspVal: 5.682 ± 0.0
1.894AspTrp: 1.894 ± 0.0
2.841AspTyr: 2.841 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.735GluAla: 4.735 ± 0.0
1.263GluCys: 1.263 ± 0.0
2.525GluAsp: 2.525 ± 0.0
4.104GluGlu: 4.104 ± 0.0
2.841GluPhe: 2.841 ± 0.0
3.472GluGly: 3.472 ± 0.0
0.316GluHis: 0.316 ± 0.0
4.419GluIle: 4.419 ± 0.0
4.419GluLys: 4.419 ± 0.0
4.419GluLeu: 4.419 ± 0.0
1.578GluMet: 1.578 ± 0.0
2.841GluAsn: 2.841 ± 0.0
2.525GluPro: 2.525 ± 0.0
2.525GluGln: 2.525 ± 0.0
3.788GluArg: 3.788 ± 0.0
2.525GluSer: 2.525 ± 0.0
1.578GluThr: 1.578 ± 0.0
4.735GluVal: 4.735 ± 0.0
2.525GluTrp: 2.525 ± 0.0
2.525GluTyr: 2.525 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.631PheAla: 0.631 ± 0.0
0.316PheCys: 0.316 ± 0.0
4.104PheAsp: 4.104 ± 0.0
2.525PheGlu: 2.525 ± 0.0
1.263PhePhe: 1.263 ± 0.0
3.472PheGly: 3.472 ± 0.0
1.263PheHis: 1.263 ± 0.0
2.525PheIle: 2.525 ± 0.0
0.631PheLys: 0.631 ± 0.0
4.735PheLeu: 4.735 ± 0.0
0.947PheMet: 0.947 ± 0.0
0.631PheAsn: 0.631 ± 0.0
1.894PhePro: 1.894 ± 0.0
1.578PheGln: 1.578 ± 0.0
3.157PheArg: 3.157 ± 0.0
3.472PheSer: 3.472 ± 0.0
1.578PheThr: 1.578 ± 0.0
5.366PheVal: 5.366 ± 0.0
0.631PheTrp: 0.631 ± 0.0
1.263PheTyr: 1.263 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.104GlyAla: 4.104 ± 0.0
0.0GlyCys: 0.0 ± 0.0
4.419GlyAsp: 4.419 ± 0.0
3.788GlyGlu: 3.788 ± 0.0
2.841GlyPhe: 2.841 ± 0.0
2.841GlyGly: 2.841 ± 0.0
0.947GlyHis: 0.947 ± 0.0
3.472GlyIle: 3.472 ± 0.0
3.157GlyLys: 3.157 ± 0.0
7.576GlyLeu: 7.576 ± 0.0
1.894GlyMet: 1.894 ± 0.0
2.525GlyAsn: 2.525 ± 0.0
4.419GlyPro: 4.419 ± 0.0
3.788GlyGln: 3.788 ± 0.0
3.472GlyArg: 3.472 ± 0.0
5.051GlySer: 5.051 ± 0.0
3.788GlyThr: 3.788 ± 0.0
5.051GlyVal: 5.051 ± 0.0
0.631GlyTrp: 0.631 ± 0.0
1.894GlyTyr: 1.894 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.21HisAla: 2.21 ± 0.0
0.316HisCys: 0.316 ± 0.0
0.947HisAsp: 0.947 ± 0.0
1.578HisGlu: 1.578 ± 0.0
0.947HisPhe: 0.947 ± 0.0
2.525HisGly: 2.525 ± 0.0
0.316HisHis: 0.316 ± 0.0
1.263HisIle: 1.263 ± 0.0
0.631HisLys: 0.631 ± 0.0
1.263HisLeu: 1.263 ± 0.0
0.316HisMet: 0.316 ± 0.0
0.631HisAsn: 0.631 ± 0.0
0.631HisPro: 0.631 ± 0.0
0.631HisGln: 0.631 ± 0.0
1.263HisArg: 1.263 ± 0.0
0.947HisSer: 0.947 ± 0.0
0.631HisThr: 0.631 ± 0.0
1.578HisVal: 1.578 ± 0.0
0.947HisTrp: 0.947 ± 0.0
1.263HisTyr: 1.263 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.472IleAla: 3.472 ± 0.0
1.578IleCys: 1.578 ± 0.0
2.841IleAsp: 2.841 ± 0.0
2.21IleGlu: 2.21 ± 0.0
2.21IlePhe: 2.21 ± 0.0
4.104IleGly: 4.104 ± 0.0
1.263IleHis: 1.263 ± 0.0
2.21IleIle: 2.21 ± 0.0
2.525IleLys: 2.525 ± 0.0
4.735IleLeu: 4.735 ± 0.0
1.578IleMet: 1.578 ± 0.0
0.947IleAsn: 0.947 ± 0.0
3.157IlePro: 3.157 ± 0.0
1.578IleGln: 1.578 ± 0.0
2.841IleArg: 2.841 ± 0.0
3.788IleSer: 3.788 ± 0.0
5.051IleThr: 5.051 ± 0.0
3.472IleVal: 3.472 ± 0.0
0.316IleTrp: 0.316 ± 0.0
2.525IleTyr: 2.525 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.21LysAla: 2.21 ± 0.0
1.578LysCys: 1.578 ± 0.0
4.104LysAsp: 4.104 ± 0.0
2.841LysGlu: 2.841 ± 0.0
1.263LysPhe: 1.263 ± 0.0
4.735LysGly: 4.735 ± 0.0
0.947LysHis: 0.947 ± 0.0
1.263LysIle: 1.263 ± 0.0
3.472LysLys: 3.472 ± 0.0
4.104LysLeu: 4.104 ± 0.0
0.631LysMet: 0.631 ± 0.0
1.263LysAsn: 1.263 ± 0.0
2.525LysPro: 2.525 ± 0.0
1.578LysGln: 1.578 ± 0.0
2.841LysArg: 2.841 ± 0.0
3.788LysSer: 3.788 ± 0.0
1.578LysThr: 1.578 ± 0.0
3.472LysVal: 3.472 ± 0.0
0.0LysTrp: 0.0 ± 0.0
0.947LysTyr: 0.947 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.891LeuAla: 7.891 ± 0.0
0.947LeuCys: 0.947 ± 0.0
6.313LeuAsp: 6.313 ± 0.0
3.157LeuGlu: 3.157 ± 0.0
3.472LeuPhe: 3.472 ± 0.0
5.051LeuGly: 5.051 ± 0.0
3.157LeuHis: 3.157 ± 0.0
4.104LeuIle: 4.104 ± 0.0
6.313LeuLys: 6.313 ± 0.0
5.366LeuLeu: 5.366 ± 0.0
1.578LeuMet: 1.578 ± 0.0
2.841LeuAsn: 2.841 ± 0.0
5.997LeuPro: 5.997 ± 0.0
2.21LeuGln: 2.21 ± 0.0
3.788LeuArg: 3.788 ± 0.0
7.26LeuSer: 7.26 ± 0.0
3.472LeuThr: 3.472 ± 0.0
7.891LeuVal: 7.891 ± 0.0
1.578LeuTrp: 1.578 ± 0.0
1.263LeuTyr: 1.263 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.894MetAla: 1.894 ± 0.0
0.947MetCys: 0.947 ± 0.0
2.21MetAsp: 2.21 ± 0.0
2.21MetGlu: 2.21 ± 0.0
0.316MetPhe: 0.316 ± 0.0
2.21MetGly: 2.21 ± 0.0
0.316MetHis: 0.316 ± 0.0
2.21MetIle: 2.21 ± 0.0
2.525MetLys: 2.525 ± 0.0
1.263MetLeu: 1.263 ± 0.0
0.631MetMet: 0.631 ± 0.0
0.316MetAsn: 0.316 ± 0.0
1.578MetPro: 1.578 ± 0.0
0.631MetGln: 0.631 ± 0.0
0.947MetArg: 0.947 ± 0.0
2.21MetSer: 2.21 ± 0.0
2.21MetThr: 2.21 ± 0.0
4.419MetVal: 4.419 ± 0.0
0.316MetTrp: 0.316 ± 0.0
0.316MetTyr: 0.316 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.947AsnAla: 0.947 ± 0.0
0.631AsnCys: 0.631 ± 0.0
1.894AsnAsp: 1.894 ± 0.0
1.894AsnGlu: 1.894 ± 0.0
1.578AsnPhe: 1.578 ± 0.0
1.894AsnGly: 1.894 ± 0.0
0.947AsnHis: 0.947 ± 0.0
1.263AsnIle: 1.263 ± 0.0
2.525AsnLys: 2.525 ± 0.0
1.894AsnLeu: 1.894 ± 0.0
0.947AsnMet: 0.947 ± 0.0
0.631AsnAsn: 0.631 ± 0.0
3.472AsnPro: 3.472 ± 0.0
1.263AsnGln: 1.263 ± 0.0
1.578AsnArg: 1.578 ± 0.0
1.894AsnSer: 1.894 ± 0.0
1.894AsnThr: 1.894 ± 0.0
1.263AsnVal: 1.263 ± 0.0
0.316AsnTrp: 0.316 ± 0.0
2.21AsnTyr: 2.21 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.788ProAla: 3.788 ± 0.0
0.316ProCys: 0.316 ± 0.0
5.051ProAsp: 5.051 ± 0.0
3.157ProGlu: 3.157 ± 0.0
3.157ProPhe: 3.157 ± 0.0
5.366ProGly: 5.366 ± 0.0
0.631ProHis: 0.631 ± 0.0
2.21ProIle: 2.21 ± 0.0
1.263ProLys: 1.263 ± 0.0
6.313ProLeu: 6.313 ± 0.0
1.578ProMet: 1.578 ± 0.0
1.578ProAsn: 1.578 ± 0.0
3.788ProPro: 3.788 ± 0.0
2.525ProGln: 2.525 ± 0.0
3.157ProArg: 3.157 ± 0.0
6.313ProSer: 6.313 ± 0.0
3.472ProThr: 3.472 ± 0.0
4.419ProVal: 4.419 ± 0.0
0.631ProTrp: 0.631 ± 0.0
2.21ProTyr: 2.21 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.525GlnAla: 2.525 ± 0.0
0.316GlnCys: 0.316 ± 0.0
2.21GlnAsp: 2.21 ± 0.0
2.841GlnGlu: 2.841 ± 0.0
0.947GlnPhe: 0.947 ± 0.0
3.472GlnGly: 3.472 ± 0.0
0.631GlnHis: 0.631 ± 0.0
2.841GlnIle: 2.841 ± 0.0
1.578GlnLys: 1.578 ± 0.0
2.525GlnLeu: 2.525 ± 0.0
1.263GlnMet: 1.263 ± 0.0
2.841GlnAsn: 2.841 ± 0.0
3.157GlnPro: 3.157 ± 0.0
1.894GlnGln: 1.894 ± 0.0
2.841GlnArg: 2.841 ± 0.0
1.263GlnSer: 1.263 ± 0.0
1.263GlnThr: 1.263 ± 0.0
2.525GlnVal: 2.525 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.578GlnTyr: 1.578 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.525ArgAla: 2.525 ± 0.0
0.631ArgCys: 0.631 ± 0.0
2.21ArgAsp: 2.21 ± 0.0
3.157ArgGlu: 3.157 ± 0.0
2.841ArgPhe: 2.841 ± 0.0
1.578ArgGly: 1.578 ± 0.0
0.947ArgHis: 0.947 ± 0.0
3.788ArgIle: 3.788 ± 0.0
2.525ArgLys: 2.525 ± 0.0
6.313ArgLeu: 6.313 ± 0.0
1.578ArgMet: 1.578 ± 0.0
1.578ArgAsn: 1.578 ± 0.0
2.841ArgPro: 2.841 ± 0.0
1.894ArgGln: 1.894 ± 0.0
4.735ArgArg: 4.735 ± 0.0
4.104ArgSer: 4.104 ± 0.0
2.525ArgThr: 2.525 ± 0.0
4.735ArgVal: 4.735 ± 0.0
0.947ArgTrp: 0.947 ± 0.0
3.157ArgTyr: 3.157 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.366SerAla: 5.366 ± 0.0
1.578SerCys: 1.578 ± 0.0
4.419SerAsp: 4.419 ± 0.0
4.104SerGlu: 4.104 ± 0.0
3.788SerPhe: 3.788 ± 0.0
7.576SerGly: 7.576 ± 0.0
1.578SerHis: 1.578 ± 0.0
2.525SerIle: 2.525 ± 0.0
1.578SerLys: 1.578 ± 0.0
3.472SerLeu: 3.472 ± 0.0
2.841SerMet: 2.841 ± 0.0
1.894SerAsn: 1.894 ± 0.0
3.472SerPro: 3.472 ± 0.0
1.894SerGln: 1.894 ± 0.0
1.894SerArg: 1.894 ± 0.0
5.051SerSer: 5.051 ± 0.0
5.997SerThr: 5.997 ± 0.0
5.997SerVal: 5.997 ± 0.0
1.263SerTrp: 1.263 ± 0.0
3.157SerTyr: 3.157 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.997ThrAla: 5.997 ± 0.0
1.263ThrCys: 1.263 ± 0.0
2.21ThrAsp: 2.21 ± 0.0
3.157ThrGlu: 3.157 ± 0.0
2.21ThrPhe: 2.21 ± 0.0
3.157ThrGly: 3.157 ± 0.0
0.316ThrHis: 0.316 ± 0.0
3.157ThrIle: 3.157 ± 0.0
1.894ThrLys: 1.894 ± 0.0
5.051ThrLeu: 5.051 ± 0.0
2.525ThrMet: 2.525 ± 0.0
0.947ThrAsn: 0.947 ± 0.0
2.21ThrPro: 2.21 ± 0.0
2.841ThrGln: 2.841 ± 0.0
2.21ThrArg: 2.21 ± 0.0
3.157ThrSer: 3.157 ± 0.0
5.682ThrThr: 5.682 ± 0.0
5.366ThrVal: 5.366 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
3.157ThrTyr: 3.157 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.576ValAla: 7.576 ± 0.0
1.263ValCys: 1.263 ± 0.0
2.841ValAsp: 2.841 ± 0.0
5.997ValGlu: 5.997 ± 0.0
1.894ValPhe: 1.894 ± 0.0
5.682ValGly: 5.682 ± 0.0
1.263ValHis: 1.263 ± 0.0
6.629ValIle: 6.629 ± 0.0
2.841ValLys: 2.841 ± 0.0
5.997ValLeu: 5.997 ± 0.0
3.157ValMet: 3.157 ± 0.0
4.104ValAsn: 4.104 ± 0.0
8.207ValPro: 8.207 ± 0.0
3.157ValGln: 3.157 ± 0.0
4.104ValArg: 4.104 ± 0.0
5.682ValSer: 5.682 ± 0.0
2.525ValThr: 2.525 ± 0.0
9.154ValVal: 9.154 ± 0.0
1.578ValTrp: 1.578 ± 0.0
3.157ValTyr: 3.157 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.578TrpAla: 1.578 ± 0.0
0.316TrpCys: 0.316 ± 0.0
0.947TrpAsp: 0.947 ± 0.0
0.631TrpGlu: 0.631 ± 0.0
0.316TrpPhe: 0.316 ± 0.0
0.631TrpGly: 0.631 ± 0.0
0.631TrpHis: 0.631 ± 0.0
0.631TrpIle: 0.631 ± 0.0
0.316TrpLys: 0.316 ± 0.0
2.21TrpLeu: 2.21 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.631TrpPro: 0.631 ± 0.0
0.316TrpGln: 0.316 ± 0.0
1.578TrpArg: 1.578 ± 0.0
2.525TrpSer: 2.525 ± 0.0
0.631TrpThr: 0.631 ± 0.0
1.263TrpVal: 1.263 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.525TyrAla: 2.525 ± 0.0
0.631TyrCys: 0.631 ± 0.0
3.157TyrAsp: 3.157 ± 0.0
1.894TyrGlu: 1.894 ± 0.0
2.525TyrPhe: 2.525 ± 0.0
2.21TyrGly: 2.21 ± 0.0
2.841TyrHis: 2.841 ± 0.0
1.578TyrIle: 1.578 ± 0.0
1.894TyrLys: 1.894 ± 0.0
2.841TyrLeu: 2.841 ± 0.0
1.263TyrMet: 1.263 ± 0.0
0.631TyrAsn: 0.631 ± 0.0
2.525TyrPro: 2.525 ± 0.0
1.578TyrGln: 1.578 ± 0.0
1.578TyrArg: 1.578 ± 0.0
0.947TyrSer: 0.947 ± 0.0
3.157TyrThr: 3.157 ± 0.0
3.472TyrVal: 3.472 ± 0.0
0.316TyrTrp: 0.316 ± 0.0
1.894TyrTyr: 1.894 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3169 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski