Amino acid dipepetide frequency for Hubei picorna-like virus 65

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.62AlaAla: 10.62 ± 0.0
1.713AlaCys: 1.713 ± 0.0
5.139AlaAsp: 5.139 ± 0.0
2.741AlaGlu: 2.741 ± 0.0
4.111AlaPhe: 4.111 ± 0.0
7.194AlaGly: 7.194 ± 0.0
0.685AlaHis: 0.685 ± 0.0
3.768AlaIle: 3.768 ± 0.0
4.454AlaLys: 4.454 ± 0.0
6.166AlaLeu: 6.166 ± 0.0
1.028AlaMet: 1.028 ± 0.0
3.426AlaAsn: 3.426 ± 0.0
4.454AlaPro: 4.454 ± 0.0
4.111AlaGln: 4.111 ± 0.0
2.741AlaArg: 2.741 ± 0.0
9.25AlaSer: 9.25 ± 0.0
4.796AlaThr: 4.796 ± 0.0
5.481AlaVal: 5.481 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
4.796AlaTyr: 4.796 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.37CysAla: 1.37 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.343CysAsp: 0.343 ± 0.0
0.343CysGlu: 0.343 ± 0.0
1.37CysPhe: 1.37 ± 0.0
1.37CysGly: 1.37 ± 0.0
0.685CysHis: 0.685 ± 0.0
0.343CysIle: 0.343 ± 0.0
0.685CysLys: 0.685 ± 0.0
2.055CysLeu: 2.055 ± 0.0
1.37CysMet: 1.37 ± 0.0
0.685CysAsn: 0.685 ± 0.0
0.343CysPro: 0.343 ± 0.0
1.028CysGln: 1.028 ± 0.0
0.343CysArg: 0.343 ± 0.0
1.028CysSer: 1.028 ± 0.0
1.028CysThr: 1.028 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.685CysTrp: 0.685 ± 0.0
0.685CysTyr: 0.685 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.454AspAla: 4.454 ± 0.0
1.713AspCys: 1.713 ± 0.0
3.083AspAsp: 3.083 ± 0.0
3.426AspGlu: 3.426 ± 0.0
1.713AspPhe: 1.713 ± 0.0
2.398AspGly: 2.398 ± 0.0
1.37AspHis: 1.37 ± 0.0
3.426AspIle: 3.426 ± 0.0
2.055AspLys: 2.055 ± 0.0
3.768AspLeu: 3.768 ± 0.0
2.741AspMet: 2.741 ± 0.0
1.37AspAsn: 1.37 ± 0.0
3.083AspPro: 3.083 ± 0.0
0.685AspGln: 0.685 ± 0.0
1.37AspArg: 1.37 ± 0.0
3.426AspSer: 3.426 ± 0.0
2.741AspThr: 2.741 ± 0.0
4.796AspVal: 4.796 ± 0.0
1.028AspTrp: 1.028 ± 0.0
1.37AspTyr: 1.37 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.713GluAla: 1.713 ± 0.0
0.0GluCys: 0.0 ± 0.0
1.028GluAsp: 1.028 ± 0.0
2.741GluGlu: 2.741 ± 0.0
3.426GluPhe: 3.426 ± 0.0
2.055GluGly: 2.055 ± 0.0
0.685GluHis: 0.685 ± 0.0
2.741GluIle: 2.741 ± 0.0
3.083GluLys: 3.083 ± 0.0
5.481GluLeu: 5.481 ± 0.0
2.055GluMet: 2.055 ± 0.0
1.37GluAsn: 1.37 ± 0.0
1.028GluPro: 1.028 ± 0.0
1.713GluGln: 1.713 ± 0.0
4.454GluArg: 4.454 ± 0.0
2.741GluSer: 2.741 ± 0.0
4.111GluThr: 4.111 ± 0.0
3.426GluVal: 3.426 ± 0.0
0.343GluTrp: 0.343 ± 0.0
2.055GluTyr: 2.055 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.426PheAla: 3.426 ± 0.0
1.37PheCys: 1.37 ± 0.0
3.768PheAsp: 3.768 ± 0.0
2.398PheGlu: 2.398 ± 0.0
2.398PhePhe: 2.398 ± 0.0
3.768PheGly: 3.768 ± 0.0
1.028PheHis: 1.028 ± 0.0
1.028PheIle: 1.028 ± 0.0
0.685PheLys: 0.685 ± 0.0
3.426PheLeu: 3.426 ± 0.0
1.713PheMet: 1.713 ± 0.0
4.796PheAsn: 4.796 ± 0.0
2.398PhePro: 2.398 ± 0.0
1.028PheGln: 1.028 ± 0.0
1.37PheArg: 1.37 ± 0.0
2.055PheSer: 2.055 ± 0.0
2.398PheThr: 2.398 ± 0.0
2.741PheVal: 2.741 ± 0.0
1.37PheTrp: 1.37 ± 0.0
1.028PheTyr: 1.028 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.111GlyAla: 4.111 ± 0.0
0.343GlyCys: 0.343 ± 0.0
3.083GlyAsp: 3.083 ± 0.0
3.426GlyGlu: 3.426 ± 0.0
2.055GlyPhe: 2.055 ± 0.0
6.166GlyGly: 6.166 ± 0.0
0.685GlyHis: 0.685 ± 0.0
4.111GlyIle: 4.111 ± 0.0
5.481GlyLys: 5.481 ± 0.0
5.824GlyLeu: 5.824 ± 0.0
3.426GlyMet: 3.426 ± 0.0
2.398GlyAsn: 2.398 ± 0.0
2.055GlyPro: 2.055 ± 0.0
2.741GlyGln: 2.741 ± 0.0
2.741GlyArg: 2.741 ± 0.0
6.166GlySer: 6.166 ± 0.0
5.824GlyThr: 5.824 ± 0.0
4.454GlyVal: 4.454 ± 0.0
0.343GlyTrp: 0.343 ± 0.0
2.055GlyTyr: 2.055 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.741HisAla: 2.741 ± 0.0
0.685HisCys: 0.685 ± 0.0
1.028HisAsp: 1.028 ± 0.0
1.028HisGlu: 1.028 ± 0.0
2.398HisPhe: 2.398 ± 0.0
1.37HisGly: 1.37 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.685HisIle: 0.685 ± 0.0
0.343HisLys: 0.343 ± 0.0
1.37HisLeu: 1.37 ± 0.0
1.713HisMet: 1.713 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.37HisPro: 1.37 ± 0.0
1.028HisGln: 1.028 ± 0.0
1.37HisArg: 1.37 ± 0.0
0.343HisSer: 0.343 ± 0.0
0.343HisThr: 0.343 ± 0.0
1.028HisVal: 1.028 ± 0.0
0.343HisTrp: 0.343 ± 0.0
1.028HisTyr: 1.028 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.481IleAla: 5.481 ± 0.0
1.028IleCys: 1.028 ± 0.0
3.083IleAsp: 3.083 ± 0.0
2.398IleGlu: 2.398 ± 0.0
2.055IlePhe: 2.055 ± 0.0
2.055IleGly: 2.055 ± 0.0
1.713IleHis: 1.713 ± 0.0
4.111IleIle: 4.111 ± 0.0
2.055IleLys: 2.055 ± 0.0
3.426IleLeu: 3.426 ± 0.0
1.713IleMet: 1.713 ± 0.0
1.37IleAsn: 1.37 ± 0.0
4.454IlePro: 4.454 ± 0.0
0.343IleGln: 0.343 ± 0.0
2.741IleArg: 2.741 ± 0.0
3.426IleSer: 3.426 ± 0.0
2.741IleThr: 2.741 ± 0.0
5.481IleVal: 5.481 ± 0.0
2.398IleTrp: 2.398 ± 0.0
3.426IleTyr: 3.426 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.796LysAla: 4.796 ± 0.0
0.343LysCys: 0.343 ± 0.0
2.741LysAsp: 2.741 ± 0.0
2.398LysGlu: 2.398 ± 0.0
3.083LysPhe: 3.083 ± 0.0
2.398LysGly: 2.398 ± 0.0
2.398LysHis: 2.398 ± 0.0
2.398LysIle: 2.398 ± 0.0
2.398LysLys: 2.398 ± 0.0
4.111LysLeu: 4.111 ± 0.0
0.0LysMet: 0.0 ± 0.0
3.083LysAsn: 3.083 ± 0.0
3.083LysPro: 3.083 ± 0.0
2.741LysGln: 2.741 ± 0.0
4.454LysArg: 4.454 ± 0.0
2.741LysSer: 2.741 ± 0.0
3.083LysThr: 3.083 ± 0.0
2.741LysVal: 2.741 ± 0.0
0.0LysTrp: 0.0 ± 0.0
1.713LysTyr: 1.713 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.768LeuAla: 3.768 ± 0.0
1.713LeuCys: 1.713 ± 0.0
3.083LeuAsp: 3.083 ± 0.0
4.111LeuGlu: 4.111 ± 0.0
2.055LeuPhe: 2.055 ± 0.0
6.509LeuGly: 6.509 ± 0.0
1.37LeuHis: 1.37 ± 0.0
4.111LeuIle: 4.111 ± 0.0
7.194LeuLys: 7.194 ± 0.0
6.852LeuLeu: 6.852 ± 0.0
1.37LeuMet: 1.37 ± 0.0
5.481LeuAsn: 5.481 ± 0.0
4.454LeuPro: 4.454 ± 0.0
4.454LeuGln: 4.454 ± 0.0
3.768LeuArg: 3.768 ± 0.0
4.111LeuSer: 4.111 ± 0.0
6.166LeuThr: 6.166 ± 0.0
5.481LeuVal: 5.481 ± 0.0
0.343LeuTrp: 0.343 ± 0.0
1.713LeuTyr: 1.713 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.398MetAla: 2.398 ± 0.0
0.685MetCys: 0.685 ± 0.0
1.713MetAsp: 1.713 ± 0.0
1.028MetGlu: 1.028 ± 0.0
0.685MetPhe: 0.685 ± 0.0
1.37MetGly: 1.37 ± 0.0
1.028MetHis: 1.028 ± 0.0
1.713MetIle: 1.713 ± 0.0
1.713MetLys: 1.713 ± 0.0
1.37MetLeu: 1.37 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.37MetAsn: 1.37 ± 0.0
2.741MetPro: 2.741 ± 0.0
2.055MetGln: 2.055 ± 0.0
2.055MetArg: 2.055 ± 0.0
2.398MetSer: 2.398 ± 0.0
2.741MetThr: 2.741 ± 0.0
2.055MetVal: 2.055 ± 0.0
0.343MetTrp: 0.343 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.796AsnAla: 4.796 ± 0.0
0.343AsnCys: 0.343 ± 0.0
1.713AsnAsp: 1.713 ± 0.0
0.685AsnGlu: 0.685 ± 0.0
3.083AsnPhe: 3.083 ± 0.0
2.741AsnGly: 2.741 ± 0.0
0.0AsnHis: 0.0 ± 0.0
1.37AsnIle: 1.37 ± 0.0
1.028AsnLys: 1.028 ± 0.0
6.166AsnLeu: 6.166 ± 0.0
1.713AsnMet: 1.713 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
3.768AsnPro: 3.768 ± 0.0
2.398AsnGln: 2.398 ± 0.0
3.083AsnArg: 3.083 ± 0.0
2.055AsnSer: 2.055 ± 0.0
4.796AsnThr: 4.796 ± 0.0
2.055AsnVal: 2.055 ± 0.0
0.343AsnTrp: 0.343 ± 0.0
1.713AsnTyr: 1.713 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.481ProAla: 5.481 ± 0.0
0.343ProCys: 0.343 ± 0.0
1.028ProAsp: 1.028 ± 0.0
3.083ProGlu: 3.083 ± 0.0
1.37ProPhe: 1.37 ± 0.0
3.083ProGly: 3.083 ± 0.0
0.685ProHis: 0.685 ± 0.0
4.111ProIle: 4.111 ± 0.0
2.741ProLys: 2.741 ± 0.0
4.454ProLeu: 4.454 ± 0.0
1.37ProMet: 1.37 ± 0.0
1.713ProAsn: 1.713 ± 0.0
4.111ProPro: 4.111 ± 0.0
0.685ProGln: 0.685 ± 0.0
2.398ProArg: 2.398 ± 0.0
5.481ProSer: 5.481 ± 0.0
5.139ProThr: 5.139 ± 0.0
3.083ProVal: 3.083 ± 0.0
1.37ProTrp: 1.37 ± 0.0
2.741ProTyr: 2.741 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.111GlnAla: 4.111 ± 0.0
1.028GlnCys: 1.028 ± 0.0
3.083GlnAsp: 3.083 ± 0.0
2.055GlnGlu: 2.055 ± 0.0
2.398GlnPhe: 2.398 ± 0.0
2.398GlnGly: 2.398 ± 0.0
1.028GlnHis: 1.028 ± 0.0
3.768GlnIle: 3.768 ± 0.0
2.741GlnLys: 2.741 ± 0.0
1.713GlnLeu: 1.713 ± 0.0
2.398GlnMet: 2.398 ± 0.0
1.37GlnAsn: 1.37 ± 0.0
1.028GlnPro: 1.028 ± 0.0
1.713GlnGln: 1.713 ± 0.0
0.685GlnArg: 0.685 ± 0.0
1.713GlnSer: 1.713 ± 0.0
4.454GlnThr: 4.454 ± 0.0
1.028GlnVal: 1.028 ± 0.0
0.343GlnTrp: 0.343 ± 0.0
3.426GlnTyr: 3.426 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.398ArgAla: 2.398 ± 0.0
0.685ArgCys: 0.685 ± 0.0
1.713ArgAsp: 1.713 ± 0.0
2.398ArgGlu: 2.398 ± 0.0
2.055ArgPhe: 2.055 ± 0.0
2.741ArgGly: 2.741 ± 0.0
1.713ArgHis: 1.713 ± 0.0
1.713ArgIle: 1.713 ± 0.0
3.426ArgLys: 3.426 ± 0.0
2.398ArgLeu: 2.398 ± 0.0
1.37ArgMet: 1.37 ± 0.0
1.37ArgAsn: 1.37 ± 0.0
2.398ArgPro: 2.398 ± 0.0
2.741ArgGln: 2.741 ± 0.0
2.398ArgArg: 2.398 ± 0.0
4.796ArgSer: 4.796 ± 0.0
2.741ArgThr: 2.741 ± 0.0
3.768ArgVal: 3.768 ± 0.0
1.028ArgTrp: 1.028 ± 0.0
1.713ArgTyr: 1.713 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.879SerAla: 7.879 ± 0.0
1.37SerCys: 1.37 ± 0.0
3.768SerAsp: 3.768 ± 0.0
2.055SerGlu: 2.055 ± 0.0
2.741SerPhe: 2.741 ± 0.0
3.083SerGly: 3.083 ± 0.0
1.028SerHis: 1.028 ± 0.0
4.111SerIle: 4.111 ± 0.0
2.055SerLys: 2.055 ± 0.0
5.139SerLeu: 5.139 ± 0.0
2.741SerMet: 2.741 ± 0.0
3.083SerAsn: 3.083 ± 0.0
1.37SerPro: 1.37 ± 0.0
3.426SerGln: 3.426 ± 0.0
2.055SerArg: 2.055 ± 0.0
3.768SerSer: 3.768 ± 0.0
8.907SerThr: 8.907 ± 0.0
4.454SerVal: 4.454 ± 0.0
1.37SerTrp: 1.37 ± 0.0
5.481SerTyr: 5.481 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.852ThrAla: 6.852 ± 0.0
1.028ThrCys: 1.028 ± 0.0
2.741ThrAsp: 2.741 ± 0.0
4.454ThrGlu: 4.454 ± 0.0
2.741ThrPhe: 2.741 ± 0.0
6.852ThrGly: 6.852 ± 0.0
0.685ThrHis: 0.685 ± 0.0
5.139ThrIle: 5.139 ± 0.0
2.398ThrLys: 2.398 ± 0.0
3.768ThrLeu: 3.768 ± 0.0
1.37ThrMet: 1.37 ± 0.0
5.824ThrAsn: 5.824 ± 0.0
5.139ThrPro: 5.139 ± 0.0
3.426ThrGln: 3.426 ± 0.0
3.426ThrArg: 3.426 ± 0.0
6.852ThrSer: 6.852 ± 0.0
8.907ThrThr: 8.907 ± 0.0
5.481ThrVal: 5.481 ± 0.0
0.343ThrTrp: 0.343 ± 0.0
2.398ThrTyr: 2.398 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.796ValAla: 4.796 ± 0.0
1.37ValCys: 1.37 ± 0.0
3.426ValAsp: 3.426 ± 0.0
1.713ValGlu: 1.713 ± 0.0
2.398ValPhe: 2.398 ± 0.0
4.111ValGly: 4.111 ± 0.0
2.398ValHis: 2.398 ± 0.0
4.454ValIle: 4.454 ± 0.0
2.398ValLys: 2.398 ± 0.0
7.194ValLeu: 7.194 ± 0.0
0.685ValMet: 0.685 ± 0.0
2.398ValAsn: 2.398 ± 0.0
4.111ValPro: 4.111 ± 0.0
2.741ValGln: 2.741 ± 0.0
1.713ValArg: 1.713 ± 0.0
6.166ValSer: 6.166 ± 0.0
6.852ValThr: 6.852 ± 0.0
6.166ValVal: 6.166 ± 0.0
0.343ValTrp: 0.343 ± 0.0
1.37ValTyr: 1.37 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.028TrpAla: 1.028 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.713TrpAsp: 1.713 ± 0.0
1.028TrpGlu: 1.028 ± 0.0
0.343TrpPhe: 0.343 ± 0.0
1.37TrpGly: 1.37 ± 0.0
0.343TrpHis: 0.343 ± 0.0
1.028TrpIle: 1.028 ± 0.0
2.055TrpLys: 2.055 ± 0.0
1.028TrpLeu: 1.028 ± 0.0
0.343TrpMet: 0.343 ± 0.0
0.343TrpAsn: 0.343 ± 0.0
0.343TrpPro: 0.343 ± 0.0
0.685TrpGln: 0.685 ± 0.0
0.343TrpArg: 0.343 ± 0.0
0.343TrpSer: 0.343 ± 0.0
0.343TrpThr: 0.343 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.343TrpTrp: 0.343 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.796TyrAla: 4.796 ± 0.0
0.0TyrCys: 0.0 ± 0.0
2.741TyrAsp: 2.741 ± 0.0
2.741TyrGlu: 2.741 ± 0.0
1.713TyrPhe: 1.713 ± 0.0
4.454TyrGly: 4.454 ± 0.0
0.685TyrHis: 0.685 ± 0.0
1.37TyrIle: 1.37 ± 0.0
1.713TyrLys: 1.713 ± 0.0
2.398TyrLeu: 2.398 ± 0.0
0.0TyrMet: 0.0 ± 0.0
2.398TyrAsn: 2.398 ± 0.0
2.741TyrPro: 2.741 ± 0.0
2.741TyrGln: 2.741 ± 0.0
2.055TyrArg: 2.055 ± 0.0
1.028TyrSer: 1.028 ± 0.0
1.713TyrThr: 1.713 ± 0.0
3.083TyrVal: 3.083 ± 0.0
0.343TyrTrp: 0.343 ± 0.0
1.028TyrTyr: 1.028 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2920 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski