Amino acid dipepetide frequency for Hubei picorna-like virus 40

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.126AlaAla: 3.126 ± 0.0
1.042AlaCys: 1.042 ± 0.0
3.126AlaAsp: 3.126 ± 0.0
4.168AlaGlu: 4.168 ± 0.0
3.126AlaPhe: 3.126 ± 0.0
5.905AlaGly: 5.905 ± 0.0
1.042AlaHis: 1.042 ± 0.0
3.126AlaIle: 3.126 ± 0.0
2.084AlaLys: 2.084 ± 0.0
5.557AlaLeu: 5.557 ± 0.0
1.042AlaMet: 1.042 ± 0.0
2.779AlaAsn: 2.779 ± 0.0
2.084AlaPro: 2.084 ± 0.0
3.126AlaGln: 3.126 ± 0.0
4.168AlaArg: 4.168 ± 0.0
4.515AlaSer: 4.515 ± 0.0
4.863AlaThr: 4.863 ± 0.0
3.473AlaVal: 3.473 ± 0.0
1.389AlaTrp: 1.389 ± 0.0
1.389AlaTyr: 1.389 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.695CysAla: 0.695 ± 0.0
0.347CysCys: 0.347 ± 0.0
0.347CysAsp: 0.347 ± 0.0
1.042CysGlu: 1.042 ± 0.0
0.347CysPhe: 0.347 ± 0.0
1.389CysGly: 1.389 ± 0.0
0.347CysHis: 0.347 ± 0.0
0.695CysIle: 0.695 ± 0.0
1.389CysLys: 1.389 ± 0.0
2.779CysLeu: 2.779 ± 0.0
0.347CysMet: 0.347 ± 0.0
0.695CysAsn: 0.695 ± 0.0
1.737CysPro: 1.737 ± 0.0
0.695CysGln: 0.695 ± 0.0
1.042CysArg: 1.042 ± 0.0
1.389CysSer: 1.389 ± 0.0
1.389CysThr: 1.389 ± 0.0
2.084CysVal: 2.084 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.695CysTyr: 0.695 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.779AspAla: 2.779 ± 0.0
0.695AspCys: 0.695 ± 0.0
2.431AspAsp: 2.431 ± 0.0
2.431AspGlu: 2.431 ± 0.0
3.473AspPhe: 3.473 ± 0.0
1.042AspGly: 1.042 ± 0.0
0.347AspHis: 0.347 ± 0.0
1.389AspIle: 1.389 ± 0.0
3.126AspLys: 3.126 ± 0.0
4.863AspLeu: 4.863 ± 0.0
0.695AspMet: 0.695 ± 0.0
1.042AspAsn: 1.042 ± 0.0
1.737AspPro: 1.737 ± 0.0
4.168AspGln: 4.168 ± 0.0
3.126AspArg: 3.126 ± 0.0
3.126AspSer: 3.126 ± 0.0
4.168AspThr: 4.168 ± 0.0
3.126AspVal: 3.126 ± 0.0
1.042AspTrp: 1.042 ± 0.0
2.779AspTyr: 2.779 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.821GluAla: 3.821 ± 0.0
1.389GluCys: 1.389 ± 0.0
1.737GluAsp: 1.737 ± 0.0
5.21GluGlu: 5.21 ± 0.0
1.389GluPhe: 1.389 ± 0.0
3.126GluGly: 3.126 ± 0.0
0.695GluHis: 0.695 ± 0.0
5.21GluIle: 5.21 ± 0.0
5.557GluLys: 5.557 ± 0.0
2.431GluLeu: 2.431 ± 0.0
2.779GluMet: 2.779 ± 0.0
3.126GluAsn: 3.126 ± 0.0
4.515GluPro: 4.515 ± 0.0
3.821GluGln: 3.821 ± 0.0
1.737GluArg: 1.737 ± 0.0
3.126GluSer: 3.126 ± 0.0
3.821GluThr: 3.821 ± 0.0
3.473GluVal: 3.473 ± 0.0
2.431GluTrp: 2.431 ± 0.0
2.084GluTyr: 2.084 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.695PheAla: 0.695 ± 0.0
0.347PheCys: 0.347 ± 0.0
1.389PheAsp: 1.389 ± 0.0
1.737PheGlu: 1.737 ± 0.0
1.389PhePhe: 1.389 ± 0.0
4.168PheGly: 4.168 ± 0.0
1.042PheHis: 1.042 ± 0.0
2.431PheIle: 2.431 ± 0.0
3.821PheLys: 3.821 ± 0.0
2.779PheLeu: 2.779 ± 0.0
2.431PheMet: 2.431 ± 0.0
3.821PheAsn: 3.821 ± 0.0
2.431PhePro: 2.431 ± 0.0
1.042PheGln: 1.042 ± 0.0
2.431PheArg: 2.431 ± 0.0
1.042PheSer: 1.042 ± 0.0
1.737PheThr: 1.737 ± 0.0
3.821PheVal: 3.821 ± 0.0
1.389PheTrp: 1.389 ± 0.0
1.737PheTyr: 1.737 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.431GlyAla: 2.431 ± 0.0
0.695GlyCys: 0.695 ± 0.0
3.126GlyAsp: 3.126 ± 0.0
3.821GlyGlu: 3.821 ± 0.0
1.737GlyPhe: 1.737 ± 0.0
2.084GlyGly: 2.084 ± 0.0
1.737GlyHis: 1.737 ± 0.0
4.515GlyIle: 4.515 ± 0.0
3.473GlyLys: 3.473 ± 0.0
5.557GlyLeu: 5.557 ± 0.0
1.389GlyMet: 1.389 ± 0.0
1.737GlyAsn: 1.737 ± 0.0
3.126GlyPro: 3.126 ± 0.0
0.0GlyGln: 0.0 ± 0.0
1.389GlyArg: 1.389 ± 0.0
4.168GlySer: 4.168 ± 0.0
3.821GlyThr: 3.821 ± 0.0
6.6GlyVal: 6.6 ± 0.0
0.695GlyTrp: 0.695 ± 0.0
3.126GlyTyr: 3.126 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.347HisAla: 0.347 ± 0.0
1.042HisCys: 1.042 ± 0.0
0.695HisAsp: 0.695 ± 0.0
0.347HisGlu: 0.347 ± 0.0
1.389HisPhe: 1.389 ± 0.0
3.126HisGly: 3.126 ± 0.0
0.347HisHis: 0.347 ± 0.0
0.695HisIle: 0.695 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.084HisLeu: 2.084 ± 0.0
0.347HisMet: 0.347 ± 0.0
0.347HisAsn: 0.347 ± 0.0
0.695HisPro: 0.695 ± 0.0
1.042HisGln: 1.042 ± 0.0
1.389HisArg: 1.389 ± 0.0
1.389HisSer: 1.389 ± 0.0
1.737HisThr: 1.737 ± 0.0
1.042HisVal: 1.042 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.042HisTyr: 1.042 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.779IleAla: 2.779 ± 0.0
1.737IleCys: 1.737 ± 0.0
2.779IleAsp: 2.779 ± 0.0
5.557IleGlu: 5.557 ± 0.0
4.168IlePhe: 4.168 ± 0.0
2.431IleGly: 2.431 ± 0.0
1.042IleHis: 1.042 ± 0.0
4.168IleIle: 4.168 ± 0.0
2.084IleLys: 2.084 ± 0.0
4.863IleLeu: 4.863 ± 0.0
2.431IleMet: 2.431 ± 0.0
3.126IleAsn: 3.126 ± 0.0
3.126IlePro: 3.126 ± 0.0
3.126IleGln: 3.126 ± 0.0
2.431IleArg: 2.431 ± 0.0
2.084IleSer: 2.084 ± 0.0
5.557IleThr: 5.557 ± 0.0
4.863IleVal: 4.863 ± 0.0
1.042IleTrp: 1.042 ± 0.0
1.389IleTyr: 1.389 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.779LysAla: 2.779 ± 0.0
0.695LysCys: 0.695 ± 0.0
3.821LysAsp: 3.821 ± 0.0
2.431LysGlu: 2.431 ± 0.0
3.126LysPhe: 3.126 ± 0.0
1.042LysGly: 1.042 ± 0.0
1.042LysHis: 1.042 ± 0.0
3.821LysIle: 3.821 ± 0.0
4.863LysLys: 4.863 ± 0.0
4.168LysLeu: 4.168 ± 0.0
2.084LysMet: 2.084 ± 0.0
3.126LysAsn: 3.126 ± 0.0
2.431LysPro: 2.431 ± 0.0
2.779LysGln: 2.779 ± 0.0
3.473LysArg: 3.473 ± 0.0
4.515LysSer: 4.515 ± 0.0
3.473LysThr: 3.473 ± 0.0
2.084LysVal: 2.084 ± 0.0
0.0LysTrp: 0.0 ± 0.0
2.431LysTyr: 2.431 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.515LeuAla: 4.515 ± 0.0
2.084LeuCys: 2.084 ± 0.0
3.821LeuAsp: 3.821 ± 0.0
6.947LeuGlu: 6.947 ± 0.0
2.431LeuPhe: 2.431 ± 0.0
4.168LeuGly: 4.168 ± 0.0
1.737LeuHis: 1.737 ± 0.0
3.473LeuIle: 3.473 ± 0.0
6.252LeuLys: 6.252 ± 0.0
8.684LeuLeu: 8.684 ± 0.0
1.737LeuMet: 1.737 ± 0.0
3.821LeuAsn: 3.821 ± 0.0
3.126LeuPro: 3.126 ± 0.0
3.821LeuGln: 3.821 ± 0.0
5.905LeuArg: 5.905 ± 0.0
5.21LeuSer: 5.21 ± 0.0
10.073LeuThr: 10.073 ± 0.0
7.642LeuVal: 7.642 ± 0.0
1.042LeuTrp: 1.042 ± 0.0
1.737LeuTyr: 1.737 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.779MetAla: 2.779 ± 0.0
1.389MetCys: 1.389 ± 0.0
3.821MetAsp: 3.821 ± 0.0
1.737MetGlu: 1.737 ± 0.0
1.042MetPhe: 1.042 ± 0.0
0.347MetGly: 0.347 ± 0.0
1.737MetHis: 1.737 ± 0.0
1.042MetIle: 1.042 ± 0.0
0.695MetLys: 0.695 ± 0.0
2.779MetLeu: 2.779 ± 0.0
1.042MetMet: 1.042 ± 0.0
1.389MetAsn: 1.389 ± 0.0
1.389MetPro: 1.389 ± 0.0
0.347MetGln: 0.347 ± 0.0
2.084MetArg: 2.084 ± 0.0
2.084MetSer: 2.084 ± 0.0
1.737MetThr: 1.737 ± 0.0
2.431MetVal: 2.431 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.695MetTyr: 0.695 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.084AsnAla: 2.084 ± 0.0
1.042AsnCys: 1.042 ± 0.0
2.779AsnAsp: 2.779 ± 0.0
2.431AsnGlu: 2.431 ± 0.0
2.431AsnPhe: 2.431 ± 0.0
1.737AsnGly: 1.737 ± 0.0
1.737AsnHis: 1.737 ± 0.0
1.042AsnIle: 1.042 ± 0.0
1.389AsnLys: 1.389 ± 0.0
4.515AsnLeu: 4.515 ± 0.0
2.084AsnMet: 2.084 ± 0.0
3.821AsnAsn: 3.821 ± 0.0
3.821AsnPro: 3.821 ± 0.0
2.084AsnGln: 2.084 ± 0.0
1.042AsnArg: 1.042 ± 0.0
2.779AsnSer: 2.779 ± 0.0
3.126AsnThr: 3.126 ± 0.0
2.779AsnVal: 2.779 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
2.779AsnTyr: 2.779 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.695ProAla: 0.695 ± 0.0
1.042ProCys: 1.042 ± 0.0
2.084ProAsp: 2.084 ± 0.0
3.821ProGlu: 3.821 ± 0.0
2.084ProPhe: 2.084 ± 0.0
1.737ProGly: 1.737 ± 0.0
1.389ProHis: 1.389 ± 0.0
3.126ProIle: 3.126 ± 0.0
4.515ProLys: 4.515 ± 0.0
6.947ProLeu: 6.947 ± 0.0
1.737ProMet: 1.737 ± 0.0
1.389ProAsn: 1.389 ± 0.0
0.695ProPro: 0.695 ± 0.0
1.389ProGln: 1.389 ± 0.0
2.431ProArg: 2.431 ± 0.0
3.126ProSer: 3.126 ± 0.0
4.863ProThr: 4.863 ± 0.0
2.779ProVal: 2.779 ± 0.0
0.695ProTrp: 0.695 ± 0.0
3.126ProTyr: 3.126 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
5.905GlnAla: 5.905 ± 0.0
0.695GlnCys: 0.695 ± 0.0
1.737GlnAsp: 1.737 ± 0.0
1.737GlnGlu: 1.737 ± 0.0
1.042GlnPhe: 1.042 ± 0.0
2.431GlnGly: 2.431 ± 0.0
0.695GlnHis: 0.695 ± 0.0
3.473GlnIle: 3.473 ± 0.0
1.737GlnLys: 1.737 ± 0.0
3.821GlnLeu: 3.821 ± 0.0
1.737GlnMet: 1.737 ± 0.0
1.389GlnAsn: 1.389 ± 0.0
2.084GlnPro: 2.084 ± 0.0
2.084GlnGln: 2.084 ± 0.0
2.084GlnArg: 2.084 ± 0.0
4.168GlnSer: 4.168 ± 0.0
1.737GlnThr: 1.737 ± 0.0
2.084GlnVal: 2.084 ± 0.0
0.695GlnTrp: 0.695 ± 0.0
3.126GlnTyr: 3.126 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.473ArgAla: 3.473 ± 0.0
1.389ArgCys: 1.389 ± 0.0
2.431ArgAsp: 2.431 ± 0.0
3.473ArgGlu: 3.473 ± 0.0
2.779ArgPhe: 2.779 ± 0.0
2.431ArgGly: 2.431 ± 0.0
1.389ArgHis: 1.389 ± 0.0
2.431ArgIle: 2.431 ± 0.0
2.084ArgLys: 2.084 ± 0.0
5.905ArgLeu: 5.905 ± 0.0
1.737ArgMet: 1.737 ± 0.0
2.084ArgAsn: 2.084 ± 0.0
2.779ArgPro: 2.779 ± 0.0
0.347ArgGln: 0.347 ± 0.0
5.557ArgArg: 5.557 ± 0.0
2.431ArgSer: 2.431 ± 0.0
2.779ArgThr: 2.779 ± 0.0
4.168ArgVal: 4.168 ± 0.0
1.389ArgTrp: 1.389 ± 0.0
3.473ArgTyr: 3.473 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.905SerAla: 5.905 ± 0.0
1.389SerCys: 1.389 ± 0.0
2.084SerAsp: 2.084 ± 0.0
3.473SerGlu: 3.473 ± 0.0
2.779SerPhe: 2.779 ± 0.0
3.473SerGly: 3.473 ± 0.0
0.695SerHis: 0.695 ± 0.0
4.863SerIle: 4.863 ± 0.0
2.779SerLys: 2.779 ± 0.0
3.473SerLeu: 3.473 ± 0.0
1.042SerMet: 1.042 ± 0.0
3.473SerAsn: 3.473 ± 0.0
4.168SerPro: 4.168 ± 0.0
3.821SerGln: 3.821 ± 0.0
3.821SerArg: 3.821 ± 0.0
7.642SerSer: 7.642 ± 0.0
3.821SerThr: 3.821 ± 0.0
5.557SerVal: 5.557 ± 0.0
1.737SerTrp: 1.737 ± 0.0
3.126SerTyr: 3.126 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.863ThrAla: 4.863 ± 0.0
1.389ThrCys: 1.389 ± 0.0
3.821ThrAsp: 3.821 ± 0.0
4.168ThrGlu: 4.168 ± 0.0
1.737ThrPhe: 1.737 ± 0.0
5.21ThrGly: 5.21 ± 0.0
0.347ThrHis: 0.347 ± 0.0
7.294ThrIle: 7.294 ± 0.0
2.084ThrLys: 2.084 ± 0.0
5.557ThrLeu: 5.557 ± 0.0
1.737ThrMet: 1.737 ± 0.0
2.779ThrAsn: 2.779 ± 0.0
3.126ThrPro: 3.126 ± 0.0
3.821ThrGln: 3.821 ± 0.0
2.084ThrArg: 2.084 ± 0.0
5.557ThrSer: 5.557 ± 0.0
6.252ThrThr: 6.252 ± 0.0
4.863ThrVal: 4.863 ± 0.0
0.695ThrTrp: 0.695 ± 0.0
2.779ThrTyr: 2.779 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.6ValAla: 6.6 ± 0.0
1.042ValCys: 1.042 ± 0.0
3.126ValAsp: 3.126 ± 0.0
3.126ValGlu: 3.126 ± 0.0
2.779ValPhe: 2.779 ± 0.0
5.21ValGly: 5.21 ± 0.0
0.695ValHis: 0.695 ± 0.0
3.473ValIle: 3.473 ± 0.0
3.473ValLys: 3.473 ± 0.0
6.6ValLeu: 6.6 ± 0.0
2.084ValMet: 2.084 ± 0.0
3.821ValAsn: 3.821 ± 0.0
5.557ValPro: 5.557 ± 0.0
4.515ValGln: 4.515 ± 0.0
3.473ValArg: 3.473 ± 0.0
5.557ValSer: 5.557 ± 0.0
2.431ValThr: 2.431 ± 0.0
3.821ValVal: 3.821 ± 0.0
1.042ValTrp: 1.042 ± 0.0
1.737ValTyr: 1.737 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.737TrpAla: 1.737 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.695TrpAsp: 0.695 ± 0.0
1.389TrpGlu: 1.389 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.042TrpGly: 1.042 ± 0.0
0.347TrpHis: 0.347 ± 0.0
0.695TrpIle: 0.695 ± 0.0
1.042TrpLys: 1.042 ± 0.0
1.042TrpLeu: 1.042 ± 0.0
1.389TrpMet: 1.389 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.389TrpArg: 1.389 ± 0.0
2.084TrpSer: 2.084 ± 0.0
1.042TrpThr: 1.042 ± 0.0
1.389TrpVal: 1.389 ± 0.0
0.347TrpTrp: 0.347 ± 0.0
0.347TrpTyr: 0.347 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.126TyrAla: 3.126 ± 0.0
0.0TyrCys: 0.0 ± 0.0
1.737TyrAsp: 1.737 ± 0.0
2.431TyrGlu: 2.431 ± 0.0
2.431TyrPhe: 2.431 ± 0.0
3.126TyrGly: 3.126 ± 0.0
0.695TyrHis: 0.695 ± 0.0
4.168TyrIle: 4.168 ± 0.0
1.737TyrLys: 1.737 ± 0.0
3.821TyrLeu: 3.821 ± 0.0
0.347TyrMet: 0.347 ± 0.0
1.737TyrAsn: 1.737 ± 0.0
1.042TyrPro: 1.042 ± 0.0
2.431TyrGln: 2.431 ± 0.0
3.473TyrArg: 3.473 ± 0.0
3.126TyrSer: 3.126 ± 0.0
1.737TyrThr: 1.737 ± 0.0
2.084TyrVal: 2.084 ± 0.0
0.347TyrTrp: 0.347 ± 0.0
1.389TyrTyr: 1.389 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2880 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski