Amino acid dipepetide frequency for Hubei picorna-like virus 26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.873AlaAla: 2.873 ± 0.0
0.359AlaCys: 0.359 ± 0.0
1.795AlaAsp: 1.795 ± 0.0
2.873AlaGlu: 2.873 ± 0.0
2.873AlaPhe: 2.873 ± 0.0
1.795AlaGly: 1.795 ± 0.0
0.718AlaHis: 0.718 ± 0.0
4.668AlaIle: 4.668 ± 0.0
2.513AlaLys: 2.513 ± 0.0
2.513AlaLeu: 2.513 ± 0.0
1.077AlaMet: 1.077 ± 0.0
3.95AlaAsn: 3.95 ± 0.0
1.795AlaPro: 1.795 ± 0.0
1.436AlaGln: 1.436 ± 0.0
1.795AlaArg: 1.795 ± 0.0
6.463AlaSer: 6.463 ± 0.0
4.309AlaThr: 4.309 ± 0.0
3.591AlaVal: 3.591 ± 0.0
2.154AlaTrp: 2.154 ± 0.0
1.436AlaTyr: 1.436 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.154CysAla: 2.154 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.154CysAsp: 2.154 ± 0.0
0.718CysGlu: 0.718 ± 0.0
1.436CysPhe: 1.436 ± 0.0
2.513CysGly: 2.513 ± 0.0
0.359CysHis: 0.359 ± 0.0
1.436CysIle: 1.436 ± 0.0
0.718CysLys: 0.718 ± 0.0
3.232CysLeu: 3.232 ± 0.0
1.436CysMet: 1.436 ± 0.0
1.436CysAsn: 1.436 ± 0.0
0.718CysPro: 0.718 ± 0.0
0.718CysGln: 0.718 ± 0.0
1.077CysArg: 1.077 ± 0.0
1.436CysSer: 1.436 ± 0.0
1.077CysThr: 1.077 ± 0.0
1.077CysVal: 1.077 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.077CysTyr: 1.077 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.591AspAla: 3.591 ± 0.0
2.513AspCys: 2.513 ± 0.0
2.154AspAsp: 2.154 ± 0.0
3.591AspGlu: 3.591 ± 0.0
2.513AspPhe: 2.513 ± 0.0
2.154AspGly: 2.154 ± 0.0
1.436AspHis: 1.436 ± 0.0
4.668AspIle: 4.668 ± 0.0
5.027AspLys: 5.027 ± 0.0
4.668AspLeu: 4.668 ± 0.0
1.436AspMet: 1.436 ± 0.0
3.232AspAsn: 3.232 ± 0.0
2.513AspPro: 2.513 ± 0.0
0.0AspGln: 0.0 ± 0.0
3.232AspArg: 3.232 ± 0.0
1.795AspSer: 1.795 ± 0.0
2.154AspThr: 2.154 ± 0.0
5.027AspVal: 5.027 ± 0.0
0.359AspTrp: 0.359 ± 0.0
1.436AspTyr: 1.436 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.027GluAla: 5.027 ± 0.0
2.154GluCys: 2.154 ± 0.0
2.154GluAsp: 2.154 ± 0.0
3.591GluGlu: 3.591 ± 0.0
4.309GluPhe: 4.309 ± 0.0
1.436GluGly: 1.436 ± 0.0
1.077GluHis: 1.077 ± 0.0
2.154GluIle: 2.154 ± 0.0
2.513GluLys: 2.513 ± 0.0
6.104GluLeu: 6.104 ± 0.0
2.154GluMet: 2.154 ± 0.0
2.873GluAsn: 2.873 ± 0.0
1.077GluPro: 1.077 ± 0.0
2.154GluGln: 2.154 ± 0.0
3.591GluArg: 3.591 ± 0.0
5.386GluSer: 5.386 ± 0.0
2.154GluThr: 2.154 ± 0.0
4.668GluVal: 4.668 ± 0.0
2.154GluTrp: 2.154 ± 0.0
2.873GluTyr: 2.873 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.718PheAla: 0.718 ± 0.0
0.359PheCys: 0.359 ± 0.0
4.309PheAsp: 4.309 ± 0.0
5.027PheGlu: 5.027 ± 0.0
1.436PhePhe: 1.436 ± 0.0
2.873PheGly: 2.873 ± 0.0
0.718PheHis: 0.718 ± 0.0
3.591PheIle: 3.591 ± 0.0
3.232PheLys: 3.232 ± 0.0
5.027PheLeu: 5.027 ± 0.0
1.436PheMet: 1.436 ± 0.0
3.591PheAsn: 3.591 ± 0.0
2.513PhePro: 2.513 ± 0.0
2.513PheGln: 2.513 ± 0.0
3.232PheArg: 3.232 ± 0.0
2.154PheSer: 2.154 ± 0.0
2.513PheThr: 2.513 ± 0.0
1.795PheVal: 1.795 ± 0.0
1.436PheTrp: 1.436 ± 0.0
3.95PheTyr: 3.95 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.513GlyAla: 2.513 ± 0.0
1.436GlyCys: 1.436 ± 0.0
2.154GlyAsp: 2.154 ± 0.0
3.232GlyGlu: 3.232 ± 0.0
2.873GlyPhe: 2.873 ± 0.0
0.718GlyGly: 0.718 ± 0.0
0.718GlyHis: 0.718 ± 0.0
2.873GlyIle: 2.873 ± 0.0
4.668GlyLys: 4.668 ± 0.0
3.591GlyLeu: 3.591 ± 0.0
1.795GlyMet: 1.795 ± 0.0
3.591GlyAsn: 3.591 ± 0.0
2.873GlyPro: 2.873 ± 0.0
0.718GlyGln: 0.718 ± 0.0
1.795GlyArg: 1.795 ± 0.0
4.668GlySer: 4.668 ± 0.0
2.513GlyThr: 2.513 ± 0.0
5.027GlyVal: 5.027 ± 0.0
0.718GlyTrp: 0.718 ± 0.0
1.795GlyTyr: 1.795 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.436HisAla: 1.436 ± 0.0
1.077HisCys: 1.077 ± 0.0
1.077HisAsp: 1.077 ± 0.0
0.359HisGlu: 0.359 ± 0.0
0.359HisPhe: 0.359 ± 0.0
2.513HisGly: 2.513 ± 0.0
0.718HisHis: 0.718 ± 0.0
1.795HisIle: 1.795 ± 0.0
1.795HisLys: 1.795 ± 0.0
1.436HisLeu: 1.436 ± 0.0
0.359HisMet: 0.359 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.077HisPro: 1.077 ± 0.0
0.718HisGln: 0.718 ± 0.0
2.154HisArg: 2.154 ± 0.0
2.513HisSer: 2.513 ± 0.0
0.359HisThr: 0.359 ± 0.0
2.154HisVal: 2.154 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.513HisTyr: 2.513 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.309IleAla: 4.309 ± 0.0
0.718IleCys: 0.718 ± 0.0
2.154IleAsp: 2.154 ± 0.0
2.154IleGlu: 2.154 ± 0.0
6.463IlePhe: 6.463 ± 0.0
3.232IleGly: 3.232 ± 0.0
2.513IleHis: 2.513 ± 0.0
2.873IleIle: 2.873 ± 0.0
3.591IleLys: 3.591 ± 0.0
6.104IleLeu: 6.104 ± 0.0
1.795IleMet: 1.795 ± 0.0
2.873IleAsn: 2.873 ± 0.0
6.822IlePro: 6.822 ± 0.0
2.513IleGln: 2.513 ± 0.0
3.591IleArg: 3.591 ± 0.0
5.027IleSer: 5.027 ± 0.0
6.104IleThr: 6.104 ± 0.0
4.668IleVal: 4.668 ± 0.0
0.359IleTrp: 0.359 ± 0.0
1.436IleTyr: 1.436 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.668LysAla: 4.668 ± 0.0
2.154LysCys: 2.154 ± 0.0
4.668LysAsp: 4.668 ± 0.0
2.873LysGlu: 2.873 ± 0.0
4.309LysPhe: 4.309 ± 0.0
3.591LysGly: 3.591 ± 0.0
2.513LysHis: 2.513 ± 0.0
5.745LysIle: 5.745 ± 0.0
3.232LysLys: 3.232 ± 0.0
6.822LysLeu: 6.822 ± 0.0
1.077LysMet: 1.077 ± 0.0
3.591LysAsn: 3.591 ± 0.0
2.154LysPro: 2.154 ± 0.0
2.873LysGln: 2.873 ± 0.0
2.513LysArg: 2.513 ± 0.0
2.873LysSer: 2.873 ± 0.0
2.873LysThr: 2.873 ± 0.0
2.154LysVal: 2.154 ± 0.0
0.359LysTrp: 0.359 ± 0.0
3.232LysTyr: 3.232 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
1.795LeuAla: 1.795 ± 0.0
2.154LeuCys: 2.154 ± 0.0
4.668LeuAsp: 4.668 ± 0.0
5.745LeuGlu: 5.745 ± 0.0
5.386LeuPhe: 5.386 ± 0.0
2.513LeuGly: 2.513 ± 0.0
1.077LeuHis: 1.077 ± 0.0
5.386LeuIle: 5.386 ± 0.0
9.336LeuLys: 9.336 ± 0.0
8.259LeuLeu: 8.259 ± 0.0
0.718LeuMet: 0.718 ± 0.0
2.513LeuAsn: 2.513 ± 0.0
3.95LeuPro: 3.95 ± 0.0
3.95LeuGln: 3.95 ± 0.0
3.95LeuArg: 3.95 ± 0.0
7.54LeuSer: 7.54 ± 0.0
5.386LeuThr: 5.386 ± 0.0
5.745LeuVal: 5.745 ± 0.0
1.077LeuTrp: 1.077 ± 0.0
5.027LeuTyr: 5.027 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.077MetAla: 1.077 ± 0.0
1.077MetCys: 1.077 ± 0.0
2.154MetAsp: 2.154 ± 0.0
3.232MetGlu: 3.232 ± 0.0
0.718MetPhe: 0.718 ± 0.0
2.513MetGly: 2.513 ± 0.0
1.795MetHis: 1.795 ± 0.0
1.077MetIle: 1.077 ± 0.0
1.795MetLys: 1.795 ± 0.0
1.436MetLeu: 1.436 ± 0.0
0.359MetMet: 0.359 ± 0.0
2.873MetAsn: 2.873 ± 0.0
0.359MetPro: 0.359 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.718MetArg: 0.718 ± 0.0
1.795MetSer: 1.795 ± 0.0
1.077MetThr: 1.077 ± 0.0
1.795MetVal: 1.795 ± 0.0
0.718MetTrp: 0.718 ± 0.0
0.359MetTyr: 0.359 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.232AsnAla: 3.232 ± 0.0
1.077AsnCys: 1.077 ± 0.0
2.513AsnAsp: 2.513 ± 0.0
3.591AsnGlu: 3.591 ± 0.0
1.077AsnPhe: 1.077 ± 0.0
1.436AsnGly: 1.436 ± 0.0
1.795AsnHis: 1.795 ± 0.0
7.54AsnIle: 7.54 ± 0.0
3.232AsnLys: 3.232 ± 0.0
6.104AsnLeu: 6.104 ± 0.0
1.436AsnMet: 1.436 ± 0.0
2.873AsnAsn: 2.873 ± 0.0
3.591AsnPro: 3.591 ± 0.0
0.718AsnGln: 0.718 ± 0.0
0.718AsnArg: 0.718 ± 0.0
5.745AsnSer: 5.745 ± 0.0
1.436AsnThr: 1.436 ± 0.0
5.027AsnVal: 5.027 ± 0.0
1.077AsnTrp: 1.077 ± 0.0
3.232AsnTyr: 3.232 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.077ProAla: 1.077 ± 0.0
1.436ProCys: 1.436 ± 0.0
3.232ProAsp: 3.232 ± 0.0
4.309ProGlu: 4.309 ± 0.0
2.513ProPhe: 2.513 ± 0.0
1.436ProGly: 1.436 ± 0.0
2.154ProHis: 2.154 ± 0.0
2.513ProIle: 2.513 ± 0.0
2.154ProLys: 2.154 ± 0.0
6.463ProLeu: 6.463 ± 0.0
1.436ProMet: 1.436 ± 0.0
2.154ProAsn: 2.154 ± 0.0
2.513ProPro: 2.513 ± 0.0
1.077ProGln: 1.077 ± 0.0
1.436ProArg: 1.436 ± 0.0
3.591ProSer: 3.591 ± 0.0
1.436ProThr: 1.436 ± 0.0
2.873ProVal: 2.873 ± 0.0
0.359ProTrp: 0.359 ± 0.0
1.436ProTyr: 1.436 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.795GlnAla: 1.795 ± 0.0
0.718GlnCys: 0.718 ± 0.0
0.718GlnAsp: 0.718 ± 0.0
2.873GlnGlu: 2.873 ± 0.0
3.591GlnPhe: 3.591 ± 0.0
1.795GlnGly: 1.795 ± 0.0
0.718GlnHis: 0.718 ± 0.0
3.232GlnIle: 3.232 ± 0.0
1.077GlnLys: 1.077 ± 0.0
3.591GlnLeu: 3.591 ± 0.0
1.436GlnMet: 1.436 ± 0.0
2.513GlnAsn: 2.513 ± 0.0
1.077GlnPro: 1.077 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.795GlnArg: 1.795 ± 0.0
2.154GlnSer: 2.154 ± 0.0
1.795GlnThr: 1.795 ± 0.0
0.359GlnVal: 0.359 ± 0.0
0.359GlnTrp: 0.359 ± 0.0
2.154GlnTyr: 2.154 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.795ArgAla: 1.795 ± 0.0
1.436ArgCys: 1.436 ± 0.0
2.873ArgAsp: 2.873 ± 0.0
1.436ArgGlu: 1.436 ± 0.0
2.513ArgPhe: 2.513 ± 0.0
2.513ArgGly: 2.513 ± 0.0
1.077ArgHis: 1.077 ± 0.0
3.232ArgIle: 3.232 ± 0.0
3.591ArgLys: 3.591 ± 0.0
2.154ArgLeu: 2.154 ± 0.0
1.436ArgMet: 1.436 ± 0.0
2.873ArgAsn: 2.873 ± 0.0
1.077ArgPro: 1.077 ± 0.0
1.436ArgGln: 1.436 ± 0.0
2.154ArgArg: 2.154 ± 0.0
1.795ArgSer: 1.795 ± 0.0
2.513ArgThr: 2.513 ± 0.0
2.873ArgVal: 2.873 ± 0.0
0.718ArgTrp: 0.718 ± 0.0
1.436ArgTyr: 1.436 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.309SerAla: 4.309 ± 0.0
1.436SerCys: 1.436 ± 0.0
3.232SerAsp: 3.232 ± 0.0
4.309SerGlu: 4.309 ± 0.0
2.513SerPhe: 2.513 ± 0.0
5.745SerGly: 5.745 ± 0.0
0.718SerHis: 0.718 ± 0.0
5.745SerIle: 5.745 ± 0.0
6.463SerLys: 6.463 ± 0.0
5.745SerLeu: 5.745 ± 0.0
2.154SerMet: 2.154 ± 0.0
3.232SerAsn: 3.232 ± 0.0
3.591SerPro: 3.591 ± 0.0
2.513SerGln: 2.513 ± 0.0
2.873SerArg: 2.873 ± 0.0
5.386SerSer: 5.386 ± 0.0
4.309SerThr: 4.309 ± 0.0
5.386SerVal: 5.386 ± 0.0
0.359SerTrp: 0.359 ± 0.0
4.309SerTyr: 4.309 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.513ThrAla: 2.513 ± 0.0
2.513ThrCys: 2.513 ± 0.0
4.309ThrAsp: 4.309 ± 0.0
2.873ThrGlu: 2.873 ± 0.0
2.154ThrPhe: 2.154 ± 0.0
3.232ThrGly: 3.232 ± 0.0
1.436ThrHis: 1.436 ± 0.0
3.95ThrIle: 3.95 ± 0.0
2.513ThrLys: 2.513 ± 0.0
2.873ThrLeu: 2.873 ± 0.0
0.718ThrMet: 0.718 ± 0.0
5.745ThrAsn: 5.745 ± 0.0
2.513ThrPro: 2.513 ± 0.0
2.513ThrGln: 2.513 ± 0.0
1.077ThrArg: 1.077 ± 0.0
4.309ThrSer: 4.309 ± 0.0
5.386ThrThr: 5.386 ± 0.0
2.873ThrVal: 2.873 ± 0.0
0.359ThrTrp: 0.359 ± 0.0
1.795ThrTyr: 1.795 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.309ValAla: 4.309 ± 0.0
1.795ValCys: 1.795 ± 0.0
4.668ValAsp: 4.668 ± 0.0
3.232ValGlu: 3.232 ± 0.0
3.95ValPhe: 3.95 ± 0.0
4.309ValGly: 4.309 ± 0.0
1.077ValHis: 1.077 ± 0.0
2.873ValIle: 2.873 ± 0.0
2.873ValLys: 2.873 ± 0.0
6.822ValLeu: 6.822 ± 0.0
1.436ValMet: 1.436 ± 0.0
2.873ValAsn: 2.873 ± 0.0
3.95ValPro: 3.95 ± 0.0
3.95ValGln: 3.95 ± 0.0
1.436ValArg: 1.436 ± 0.0
4.668ValSer: 4.668 ± 0.0
2.873ValThr: 2.873 ± 0.0
6.463ValVal: 6.463 ± 0.0
0.718ValTrp: 0.718 ± 0.0
3.95ValTyr: 3.95 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.359TrpCys: 0.359 ± 0.0
1.077TrpAsp: 1.077 ± 0.0
0.718TrpGlu: 0.718 ± 0.0
0.718TrpPhe: 0.718 ± 0.0
0.718TrpGly: 0.718 ± 0.0
0.359TrpHis: 0.359 ± 0.0
1.436TrpIle: 1.436 ± 0.0
1.077TrpLys: 1.077 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.718TrpMet: 0.718 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.718TrpPro: 0.718 ± 0.0
0.359TrpGln: 0.359 ± 0.0
0.359TrpArg: 0.359 ± 0.0
2.154TrpSer: 2.154 ± 0.0
1.436TrpThr: 1.436 ± 0.0
1.077TrpVal: 1.077 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.718TrpTyr: 0.718 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.795TyrAla: 1.795 ± 0.0
0.0TyrCys: 0.0 ± 0.0
1.436TyrAsp: 1.436 ± 0.0
2.873TyrGlu: 2.873 ± 0.0
0.718TyrPhe: 0.718 ± 0.0
3.232TyrGly: 3.232 ± 0.0
1.436TyrHis: 1.436 ± 0.0
2.513TyrIle: 2.513 ± 0.0
2.873TyrLys: 2.873 ± 0.0
3.232TyrLeu: 3.232 ± 0.0
2.154TyrMet: 2.154 ± 0.0
4.668TyrAsn: 4.668 ± 0.0
0.718TyrPro: 0.718 ± 0.0
3.591TyrGln: 3.591 ± 0.0
1.436TyrArg: 1.436 ± 0.0
3.232TyrSer: 3.232 ± 0.0
3.95TyrThr: 3.95 ± 0.0
3.591TyrVal: 3.591 ± 0.0
0.718TyrTrp: 0.718 ± 0.0
3.232TyrTyr: 3.232 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2786 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski