Amino acid dipepetide frequency for Hubei picorna-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.491AlaAla: 2.491 ± 0.0
0.356AlaCys: 0.356 ± 0.0
2.491AlaAsp: 2.491 ± 0.0
3.559AlaGlu: 3.559 ± 0.0
3.559AlaPhe: 3.559 ± 0.0
5.338AlaGly: 5.338 ± 0.0
2.135AlaHis: 2.135 ± 0.0
4.27AlaIle: 4.27 ± 0.0
1.779AlaLys: 1.779 ± 0.0
5.694AlaLeu: 5.694 ± 0.0
1.779AlaMet: 1.779 ± 0.0
2.135AlaAsn: 2.135 ± 0.0
5.338AlaPro: 5.338 ± 0.0
3.559AlaGln: 3.559 ± 0.0
2.847AlaArg: 2.847 ± 0.0
3.203AlaSer: 3.203 ± 0.0
3.559AlaThr: 3.559 ± 0.0
3.915AlaVal: 3.915 ± 0.0
0.712AlaTrp: 0.712 ± 0.0
2.135AlaTyr: 2.135 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.356CysAsp: 0.356 ± 0.0
1.423CysGlu: 1.423 ± 0.0
0.712CysPhe: 0.712 ± 0.0
1.423CysGly: 1.423 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.068CysIle: 1.068 ± 0.0
0.712CysLys: 0.712 ± 0.0
1.779CysLeu: 1.779 ± 0.0
0.356CysMet: 0.356 ± 0.0
0.356CysAsn: 0.356 ± 0.0
1.068CysPro: 1.068 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.068CysSer: 1.068 ± 0.0
1.779CysThr: 1.779 ± 0.0
1.068CysVal: 1.068 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.356CysTyr: 0.356 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.203AspAla: 3.203 ± 0.0
1.423AspCys: 1.423 ± 0.0
4.982AspAsp: 4.982 ± 0.0
5.338AspGlu: 5.338 ± 0.0
4.27AspPhe: 4.27 ± 0.0
2.491AspGly: 2.491 ± 0.0
1.779AspHis: 1.779 ± 0.0
3.203AspIle: 3.203 ± 0.0
2.847AspLys: 2.847 ± 0.0
4.27AspLeu: 4.27 ± 0.0
2.491AspMet: 2.491 ± 0.0
2.847AspAsn: 2.847 ± 0.0
2.491AspPro: 2.491 ± 0.0
0.712AspGln: 0.712 ± 0.0
1.068AspArg: 1.068 ± 0.0
2.491AspSer: 2.491 ± 0.0
3.559AspThr: 3.559 ± 0.0
6.05AspVal: 6.05 ± 0.0
0.356AspTrp: 0.356 ± 0.0
2.135AspTyr: 2.135 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.559GluAla: 3.559 ± 0.0
0.356GluCys: 0.356 ± 0.0
2.847GluAsp: 2.847 ± 0.0
4.982GluGlu: 4.982 ± 0.0
4.982GluPhe: 4.982 ± 0.0
2.135GluGly: 2.135 ± 0.0
1.423GluHis: 1.423 ± 0.0
3.915GluIle: 3.915 ± 0.0
3.203GluLys: 3.203 ± 0.0
6.05GluLeu: 6.05 ± 0.0
1.779GluMet: 1.779 ± 0.0
2.847GluAsn: 2.847 ± 0.0
2.847GluPro: 2.847 ± 0.0
2.847GluGln: 2.847 ± 0.0
4.27GluArg: 4.27 ± 0.0
3.915GluSer: 3.915 ± 0.0
3.915GluThr: 3.915 ± 0.0
4.626GluVal: 4.626 ± 0.0
1.068GluTrp: 1.068 ± 0.0
1.068GluTyr: 1.068 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.559PheAla: 3.559 ± 0.0
1.423PheCys: 1.423 ± 0.0
6.05PheAsp: 6.05 ± 0.0
4.626PheGlu: 4.626 ± 0.0
3.559PhePhe: 3.559 ± 0.0
3.559PheGly: 3.559 ± 0.0
1.068PheHis: 1.068 ± 0.0
2.491PheIle: 2.491 ± 0.0
1.423PheLys: 1.423 ± 0.0
4.982PheLeu: 4.982 ± 0.0
1.068PheMet: 1.068 ± 0.0
4.982PheAsn: 4.982 ± 0.0
2.491PhePro: 2.491 ± 0.0
1.779PheGln: 1.779 ± 0.0
3.559PheArg: 3.559 ± 0.0
4.626PheSer: 4.626 ± 0.0
3.915PheThr: 3.915 ± 0.0
2.847PheVal: 2.847 ± 0.0
1.779PheTrp: 1.779 ± 0.0
1.068PheTyr: 1.068 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.915GlyAla: 3.915 ± 0.0
0.356GlyCys: 0.356 ± 0.0
3.203GlyAsp: 3.203 ± 0.0
4.27GlyGlu: 4.27 ± 0.0
4.626GlyPhe: 4.626 ± 0.0
2.491GlyGly: 2.491 ± 0.0
2.135GlyHis: 2.135 ± 0.0
3.559GlyIle: 3.559 ± 0.0
3.559GlyLys: 3.559 ± 0.0
5.338GlyLeu: 5.338 ± 0.0
1.068GlyMet: 1.068 ± 0.0
2.847GlyAsn: 2.847 ± 0.0
2.847GlyPro: 2.847 ± 0.0
2.847GlyGln: 2.847 ± 0.0
3.203GlyArg: 3.203 ± 0.0
1.423GlySer: 1.423 ± 0.0
4.626GlyThr: 4.626 ± 0.0
4.626GlyVal: 4.626 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
1.423GlyTyr: 1.423 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.712HisAla: 0.712 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.068HisAsp: 1.068 ± 0.0
2.847HisGlu: 2.847 ± 0.0
1.068HisPhe: 1.068 ± 0.0
1.779HisGly: 1.779 ± 0.0
1.779HisHis: 1.779 ± 0.0
1.068HisIle: 1.068 ± 0.0
1.779HisLys: 1.779 ± 0.0
2.135HisLeu: 2.135 ± 0.0
0.356HisMet: 0.356 ± 0.0
0.356HisAsn: 0.356 ± 0.0
2.491HisPro: 2.491 ± 0.0
1.068HisGln: 1.068 ± 0.0
1.068HisArg: 1.068 ± 0.0
2.135HisSer: 2.135 ± 0.0
0.356HisThr: 0.356 ± 0.0
2.491HisVal: 2.491 ± 0.0
0.356HisTrp: 0.356 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.762IleAla: 6.762 ± 0.0
0.0IleCys: 0.0 ± 0.0
4.626IleAsp: 4.626 ± 0.0
1.423IleGlu: 1.423 ± 0.0
2.847IlePhe: 2.847 ± 0.0
4.27IleGly: 4.27 ± 0.0
1.068IleHis: 1.068 ± 0.0
2.491IleIle: 2.491 ± 0.0
2.491IleLys: 2.491 ± 0.0
4.626IleLeu: 4.626 ± 0.0
0.356IleMet: 0.356 ± 0.0
4.626IleAsn: 4.626 ± 0.0
4.27IlePro: 4.27 ± 0.0
1.423IleGln: 1.423 ± 0.0
2.847IleArg: 2.847 ± 0.0
4.27IleSer: 4.27 ± 0.0
2.847IleThr: 2.847 ± 0.0
3.203IleVal: 3.203 ± 0.0
0.712IleTrp: 0.712 ± 0.0
2.847IleTyr: 2.847 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.203LysAla: 3.203 ± 0.0
0.712LysCys: 0.712 ± 0.0
3.915LysAsp: 3.915 ± 0.0
4.27LysGlu: 4.27 ± 0.0
2.847LysPhe: 2.847 ± 0.0
2.135LysGly: 2.135 ± 0.0
1.779LysHis: 1.779 ± 0.0
4.626LysIle: 4.626 ± 0.0
3.203LysLys: 3.203 ± 0.0
3.559LysLeu: 3.559 ± 0.0
1.068LysMet: 1.068 ± 0.0
2.847LysAsn: 2.847 ± 0.0
2.847LysPro: 2.847 ± 0.0
1.068LysGln: 1.068 ± 0.0
1.779LysArg: 1.779 ± 0.0
1.779LysSer: 1.779 ± 0.0
3.203LysThr: 3.203 ± 0.0
4.27LysVal: 4.27 ± 0.0
0.356LysTrp: 0.356 ± 0.0
1.068LysTyr: 1.068 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.694LeuAla: 5.694 ± 0.0
2.491LeuCys: 2.491 ± 0.0
3.915LeuAsp: 3.915 ± 0.0
2.847LeuGlu: 2.847 ± 0.0
2.847LeuPhe: 2.847 ± 0.0
6.762LeuGly: 6.762 ± 0.0
1.068LeuHis: 1.068 ± 0.0
4.626LeuIle: 4.626 ± 0.0
3.559LeuLys: 3.559 ± 0.0
6.05LeuLeu: 6.05 ± 0.0
2.135LeuMet: 2.135 ± 0.0
5.338LeuAsn: 5.338 ± 0.0
1.423LeuPro: 1.423 ± 0.0
3.915LeuGln: 3.915 ± 0.0
4.626LeuArg: 4.626 ± 0.0
5.694LeuSer: 5.694 ± 0.0
7.117LeuThr: 7.117 ± 0.0
4.982LeuVal: 4.982 ± 0.0
0.712LeuTrp: 0.712 ± 0.0
2.135LeuTyr: 2.135 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.779MetAla: 1.779 ± 0.0
0.356MetCys: 0.356 ± 0.0
1.068MetAsp: 1.068 ± 0.0
2.491MetGlu: 2.491 ± 0.0
1.423MetPhe: 1.423 ± 0.0
0.712MetGly: 0.712 ± 0.0
0.712MetHis: 0.712 ± 0.0
1.779MetIle: 1.779 ± 0.0
1.068MetLys: 1.068 ± 0.0
1.779MetLeu: 1.779 ± 0.0
1.423MetMet: 1.423 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.491MetPro: 2.491 ± 0.0
0.356MetGln: 0.356 ± 0.0
0.712MetArg: 0.712 ± 0.0
2.847MetSer: 2.847 ± 0.0
1.779MetThr: 1.779 ± 0.0
3.203MetVal: 3.203 ± 0.0
0.356MetTrp: 0.356 ± 0.0
0.356MetTyr: 0.356 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.423AsnAla: 1.423 ± 0.0
0.712AsnCys: 0.712 ± 0.0
2.491AsnAsp: 2.491 ± 0.0
2.847AsnGlu: 2.847 ± 0.0
4.626AsnPhe: 4.626 ± 0.0
2.847AsnGly: 2.847 ± 0.0
2.491AsnHis: 2.491 ± 0.0
1.423AsnIle: 1.423 ± 0.0
1.779AsnLys: 1.779 ± 0.0
2.491AsnLeu: 2.491 ± 0.0
2.135AsnMet: 2.135 ± 0.0
1.423AsnAsn: 1.423 ± 0.0
2.847AsnPro: 2.847 ± 0.0
2.847AsnGln: 2.847 ± 0.0
3.559AsnArg: 3.559 ± 0.0
2.135AsnSer: 2.135 ± 0.0
3.915AsnThr: 3.915 ± 0.0
4.626AsnVal: 4.626 ± 0.0
0.356AsnTrp: 0.356 ± 0.0
0.356AsnTyr: 0.356 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.779ProAla: 1.779 ± 0.0
0.356ProCys: 0.356 ± 0.0
1.779ProAsp: 1.779 ± 0.0
4.626ProGlu: 4.626 ± 0.0
2.847ProPhe: 2.847 ± 0.0
2.135ProGly: 2.135 ± 0.0
0.712ProHis: 0.712 ± 0.0
4.27ProIle: 4.27 ± 0.0
3.203ProLys: 3.203 ± 0.0
4.27ProLeu: 4.27 ± 0.0
2.135ProMet: 2.135 ± 0.0
2.847ProAsn: 2.847 ± 0.0
3.915ProPro: 3.915 ± 0.0
1.779ProGln: 1.779 ± 0.0
3.203ProArg: 3.203 ± 0.0
3.203ProSer: 3.203 ± 0.0
3.915ProThr: 3.915 ± 0.0
4.982ProVal: 4.982 ± 0.0
1.068ProTrp: 1.068 ± 0.0
2.491ProTyr: 2.491 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.779GlnAla: 1.779 ± 0.0
0.0GlnCys: 0.0 ± 0.0
3.559GlnAsp: 3.559 ± 0.0
1.068GlnGlu: 1.068 ± 0.0
1.423GlnPhe: 1.423 ± 0.0
4.982GlnGly: 4.982 ± 0.0
0.712GlnHis: 0.712 ± 0.0
1.423GlnIle: 1.423 ± 0.0
1.423GlnLys: 1.423 ± 0.0
2.847GlnLeu: 2.847 ± 0.0
0.356GlnMet: 0.356 ± 0.0
1.068GlnAsn: 1.068 ± 0.0
1.779GlnPro: 1.779 ± 0.0
1.779GlnGln: 1.779 ± 0.0
2.135GlnArg: 2.135 ± 0.0
4.626GlnSer: 4.626 ± 0.0
2.135GlnThr: 2.135 ± 0.0
1.779GlnVal: 1.779 ± 0.0
0.356GlnTrp: 0.356 ± 0.0
1.068GlnTyr: 1.068 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.626ArgAla: 4.626 ± 0.0
1.423ArgCys: 1.423 ± 0.0
2.491ArgAsp: 2.491 ± 0.0
3.559ArgGlu: 3.559 ± 0.0
4.982ArgPhe: 4.982 ± 0.0
1.423ArgGly: 1.423 ± 0.0
0.356ArgHis: 0.356 ± 0.0
4.626ArgIle: 4.626 ± 0.0
2.847ArgLys: 2.847 ± 0.0
2.847ArgLeu: 2.847 ± 0.0
1.068ArgMet: 1.068 ± 0.0
1.779ArgAsn: 1.779 ± 0.0
2.135ArgPro: 2.135 ± 0.0
1.068ArgGln: 1.068 ± 0.0
4.626ArgArg: 4.626 ± 0.0
3.203ArgSer: 3.203 ± 0.0
6.05ArgThr: 6.05 ± 0.0
2.847ArgVal: 2.847 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
2.135ArgTyr: 2.135 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.982SerAla: 4.982 ± 0.0
0.712SerCys: 0.712 ± 0.0
5.338SerAsp: 5.338 ± 0.0
1.779SerGlu: 1.779 ± 0.0
4.982SerPhe: 4.982 ± 0.0
5.338SerGly: 5.338 ± 0.0
1.779SerHis: 1.779 ± 0.0
3.915SerIle: 3.915 ± 0.0
3.915SerLys: 3.915 ± 0.0
1.779SerLeu: 1.779 ± 0.0
1.068SerMet: 1.068 ± 0.0
1.779SerAsn: 1.779 ± 0.0
3.915SerPro: 3.915 ± 0.0
2.491SerGln: 2.491 ± 0.0
2.847SerArg: 2.847 ± 0.0
4.27SerSer: 4.27 ± 0.0
7.473SerThr: 7.473 ± 0.0
7.117SerVal: 7.117 ± 0.0
1.423SerTrp: 1.423 ± 0.0
1.423SerTyr: 1.423 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.27ThrAla: 4.27 ± 0.0
0.712ThrCys: 0.712 ± 0.0
1.779ThrAsp: 1.779 ± 0.0
4.982ThrGlu: 4.982 ± 0.0
4.27ThrPhe: 4.27 ± 0.0
3.559ThrGly: 3.559 ± 0.0
2.135ThrHis: 2.135 ± 0.0
3.915ThrIle: 3.915 ± 0.0
3.203ThrLys: 3.203 ± 0.0
8.541ThrLeu: 8.541 ± 0.0
1.068ThrMet: 1.068 ± 0.0
2.491ThrAsn: 2.491 ± 0.0
6.406ThrPro: 6.406 ± 0.0
3.559ThrGln: 3.559 ± 0.0
4.626ThrArg: 4.626 ± 0.0
4.982ThrSer: 4.982 ± 0.0
6.05ThrThr: 6.05 ± 0.0
3.915ThrVal: 3.915 ± 0.0
0.712ThrTrp: 0.712 ± 0.0
3.559ThrTyr: 3.559 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.626ValAla: 4.626 ± 0.0
1.779ValCys: 1.779 ± 0.0
4.626ValAsp: 4.626 ± 0.0
3.203ValGlu: 3.203 ± 0.0
3.559ValPhe: 3.559 ± 0.0
3.203ValGly: 3.203 ± 0.0
1.423ValHis: 1.423 ± 0.0
2.135ValIle: 2.135 ± 0.0
6.762ValLys: 6.762 ± 0.0
4.982ValLeu: 4.982 ± 0.0
3.203ValMet: 3.203 ± 0.0
3.915ValAsn: 3.915 ± 0.0
2.847ValPro: 2.847 ± 0.0
2.491ValGln: 2.491 ± 0.0
3.915ValArg: 3.915 ± 0.0
7.117ValSer: 7.117 ± 0.0
4.626ValThr: 4.626 ± 0.0
3.203ValVal: 3.203 ± 0.0
1.068ValTrp: 1.068 ± 0.0
2.491ValTyr: 2.491 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.356TrpAla: 0.356 ± 0.0
0.356TrpCys: 0.356 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.712TrpGlu: 0.712 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.356TrpGly: 0.356 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.068TrpIle: 1.068 ± 0.0
0.356TrpLys: 0.356 ± 0.0
1.068TrpLeu: 1.068 ± 0.0
0.712TrpMet: 0.712 ± 0.0
1.779TrpAsn: 1.779 ± 0.0
0.356TrpPro: 0.356 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.068TrpArg: 1.068 ± 0.0
1.779TrpSer: 1.779 ± 0.0
1.779TrpThr: 1.779 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.491TyrAla: 2.491 ± 0.0
0.0TyrCys: 0.0 ± 0.0
1.423TyrAsp: 1.423 ± 0.0
1.779TyrGlu: 1.779 ± 0.0
1.423TyrPhe: 1.423 ± 0.0
1.423TyrGly: 1.423 ± 0.0
0.356TyrHis: 0.356 ± 0.0
2.135TyrIle: 2.135 ± 0.0
1.423TyrLys: 1.423 ± 0.0
2.847TyrLeu: 2.847 ± 0.0
0.712TyrMet: 0.712 ± 0.0
1.068TyrAsn: 1.068 ± 0.0
0.712TyrPro: 0.712 ± 0.0
0.712TyrGln: 0.712 ± 0.0
2.135TyrArg: 2.135 ± 0.0
3.559TyrSer: 3.559 ± 0.0
2.135TyrThr: 2.135 ± 0.0
1.423TyrVal: 1.423 ± 0.0
0.356TyrTrp: 0.356 ± 0.0
0.356TyrTyr: 0.356 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2811 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski