Amino acid dipepetide frequency for Hubei picorna-like virus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.743AlaAla: 4.743 ± 0.0
0.678AlaCys: 0.678 ± 0.0
3.049AlaAsp: 3.049 ± 0.0
2.371AlaGlu: 2.371 ± 0.0
5.081AlaPhe: 5.081 ± 0.0
6.775AlaGly: 6.775 ± 0.0
1.355AlaHis: 1.355 ± 0.0
4.065AlaIle: 4.065 ± 0.0
3.726AlaLys: 3.726 ± 0.0
4.743AlaLeu: 4.743 ± 0.0
2.033AlaMet: 2.033 ± 0.0
3.049AlaAsn: 3.049 ± 0.0
3.049AlaPro: 3.049 ± 0.0
4.404AlaGln: 4.404 ± 0.0
2.71AlaArg: 2.71 ± 0.0
6.436AlaSer: 6.436 ± 0.0
4.404AlaThr: 4.404 ± 0.0
5.759AlaVal: 5.759 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
2.371AlaTyr: 2.371 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.678CysAla: 0.678 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.678CysGlu: 0.678 ± 0.0
0.678CysPhe: 0.678 ± 0.0
1.355CysGly: 1.355 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.339CysLys: 0.339 ± 0.0
1.016CysLeu: 1.016 ± 0.0
0.678CysMet: 0.678 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.016CysPro: 1.016 ± 0.0
0.339CysGln: 0.339 ± 0.0
1.016CysArg: 1.016 ± 0.0
1.355CysSer: 1.355 ± 0.0
0.339CysThr: 0.339 ± 0.0
2.71CysVal: 2.71 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.678CysTyr: 0.678 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.371AspAla: 2.371 ± 0.0
1.016AspCys: 1.016 ± 0.0
5.42AspAsp: 5.42 ± 0.0
5.42AspGlu: 5.42 ± 0.0
4.065AspPhe: 4.065 ± 0.0
4.404AspGly: 4.404 ± 0.0
1.694AspHis: 1.694 ± 0.0
2.371AspIle: 2.371 ± 0.0
1.016AspLys: 1.016 ± 0.0
5.759AspLeu: 5.759 ± 0.0
1.016AspMet: 1.016 ± 0.0
4.065AspAsn: 4.065 ± 0.0
3.726AspPro: 3.726 ± 0.0
1.694AspGln: 1.694 ± 0.0
3.726AspArg: 3.726 ± 0.0
5.759AspSer: 5.759 ± 0.0
2.371AspThr: 2.371 ± 0.0
6.436AspVal: 6.436 ± 0.0
1.355AspTrp: 1.355 ± 0.0
2.71AspTyr: 2.71 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.098GluAla: 6.098 ± 0.0
0.339GluCys: 0.339 ± 0.0
2.371GluAsp: 2.371 ± 0.0
3.726GluGlu: 3.726 ± 0.0
3.388GluPhe: 3.388 ± 0.0
1.355GluGly: 1.355 ± 0.0
1.694GluHis: 1.694 ± 0.0
2.033GluIle: 2.033 ± 0.0
2.033GluLys: 2.033 ± 0.0
4.743GluLeu: 4.743 ± 0.0
1.355GluMet: 1.355 ± 0.0
1.355GluAsn: 1.355 ± 0.0
2.033GluPro: 2.033 ± 0.0
1.694GluGln: 1.694 ± 0.0
3.388GluArg: 3.388 ± 0.0
4.743GluSer: 4.743 ± 0.0
1.355GluThr: 1.355 ± 0.0
4.404GluVal: 4.404 ± 0.0
1.694GluTrp: 1.694 ± 0.0
2.371GluTyr: 2.371 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.049PheAla: 3.049 ± 0.0
1.355PheCys: 1.355 ± 0.0
5.081PheAsp: 5.081 ± 0.0
3.726PheGlu: 3.726 ± 0.0
3.388PhePhe: 3.388 ± 0.0
4.065PheGly: 4.065 ± 0.0
0.0PheHis: 0.0 ± 0.0
1.694PheIle: 1.694 ± 0.0
1.694PheLys: 1.694 ± 0.0
3.388PheLeu: 3.388 ± 0.0
2.71PheMet: 2.71 ± 0.0
1.355PheAsn: 1.355 ± 0.0
0.339PhePro: 0.339 ± 0.0
1.355PheGln: 1.355 ± 0.0
4.065PheArg: 4.065 ± 0.0
4.743PheSer: 4.743 ± 0.0
3.726PheThr: 3.726 ± 0.0
3.726PheVal: 3.726 ± 0.0
0.339PheTrp: 0.339 ± 0.0
1.694PheTyr: 1.694 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.404GlyAla: 4.404 ± 0.0
1.694GlyCys: 1.694 ± 0.0
5.759GlyAsp: 5.759 ± 0.0
3.049GlyGlu: 3.049 ± 0.0
3.049GlyPhe: 3.049 ± 0.0
3.388GlyGly: 3.388 ± 0.0
2.033GlyHis: 2.033 ± 0.0
2.71GlyIle: 2.71 ± 0.0
4.065GlyLys: 4.065 ± 0.0
5.42GlyLeu: 5.42 ± 0.0
2.71GlyMet: 2.71 ± 0.0
2.033GlyAsn: 2.033 ± 0.0
2.71GlyPro: 2.71 ± 0.0
2.71GlyGln: 2.71 ± 0.0
3.726GlyArg: 3.726 ± 0.0
4.065GlySer: 4.065 ± 0.0
3.388GlyThr: 3.388 ± 0.0
4.743GlyVal: 4.743 ± 0.0
1.016GlyTrp: 1.016 ± 0.0
3.049GlyTyr: 3.049 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.016HisAla: 1.016 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.339HisAsp: 0.339 ± 0.0
0.339HisGlu: 0.339 ± 0.0
2.71HisPhe: 2.71 ± 0.0
2.033HisGly: 2.033 ± 0.0
0.339HisHis: 0.339 ± 0.0
1.355HisIle: 1.355 ± 0.0
0.339HisLys: 0.339 ± 0.0
1.355HisLeu: 1.355 ± 0.0
0.678HisMet: 0.678 ± 0.0
0.339HisAsn: 0.339 ± 0.0
2.371HisPro: 2.371 ± 0.0
1.355HisGln: 1.355 ± 0.0
1.016HisArg: 1.016 ± 0.0
0.678HisSer: 0.678 ± 0.0
1.016HisThr: 1.016 ± 0.0
1.694HisVal: 1.694 ± 0.0
0.339HisTrp: 0.339 ± 0.0
1.016HisTyr: 1.016 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.371IleAla: 2.371 ± 0.0
0.339IleCys: 0.339 ± 0.0
2.371IleAsp: 2.371 ± 0.0
3.049IleGlu: 3.049 ± 0.0
1.694IlePhe: 1.694 ± 0.0
2.71IleGly: 2.71 ± 0.0
1.016IleHis: 1.016 ± 0.0
2.71IleIle: 2.71 ± 0.0
1.355IleLys: 1.355 ± 0.0
3.388IleLeu: 3.388 ± 0.0
2.033IleMet: 2.033 ± 0.0
2.371IleAsn: 2.371 ± 0.0
4.404IlePro: 4.404 ± 0.0
2.71IleGln: 2.71 ± 0.0
2.033IleArg: 2.033 ± 0.0
3.388IleSer: 3.388 ± 0.0
2.71IleThr: 2.71 ± 0.0
2.371IleVal: 2.371 ± 0.0
0.339IleTrp: 0.339 ± 0.0
0.339IleTyr: 0.339 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.016LysAla: 1.016 ± 0.0
0.339LysCys: 0.339 ± 0.0
3.726LysAsp: 3.726 ± 0.0
1.694LysGlu: 1.694 ± 0.0
2.033LysPhe: 2.033 ± 0.0
2.371LysGly: 2.371 ± 0.0
0.678LysHis: 0.678 ± 0.0
2.033LysIle: 2.033 ± 0.0
1.694LysLys: 1.694 ± 0.0
3.726LysLeu: 3.726 ± 0.0
1.694LysMet: 1.694 ± 0.0
1.016LysAsn: 1.016 ± 0.0
0.678LysPro: 0.678 ± 0.0
1.016LysGln: 1.016 ± 0.0
2.033LysArg: 2.033 ± 0.0
2.371LysSer: 2.371 ± 0.0
4.743LysThr: 4.743 ± 0.0
2.71LysVal: 2.71 ± 0.0
0.678LysTrp: 0.678 ± 0.0
1.694LysTyr: 1.694 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.42LeuAla: 5.42 ± 0.0
0.678LeuCys: 0.678 ± 0.0
4.065LeuAsp: 4.065 ± 0.0
4.404LeuGlu: 4.404 ± 0.0
2.71LeuPhe: 2.71 ± 0.0
3.388LeuGly: 3.388 ± 0.0
1.694LeuHis: 1.694 ± 0.0
2.371LeuIle: 2.371 ± 0.0
4.065LeuLys: 4.065 ± 0.0
6.775LeuLeu: 6.775 ± 0.0
2.033LeuMet: 2.033 ± 0.0
4.743LeuAsn: 4.743 ± 0.0
6.436LeuPro: 6.436 ± 0.0
4.065LeuGln: 4.065 ± 0.0
5.081LeuArg: 5.081 ± 0.0
5.759LeuSer: 5.759 ± 0.0
6.098LeuThr: 6.098 ± 0.0
8.808LeuVal: 8.808 ± 0.0
0.678LeuTrp: 0.678 ± 0.0
1.694LeuTyr: 1.694 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.033MetAla: 2.033 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.371MetAsp: 2.371 ± 0.0
2.033MetGlu: 2.033 ± 0.0
1.016MetPhe: 1.016 ± 0.0
0.678MetGly: 0.678 ± 0.0
1.016MetHis: 1.016 ± 0.0
1.694MetIle: 1.694 ± 0.0
1.016MetLys: 1.016 ± 0.0
4.743MetLeu: 4.743 ± 0.0
1.355MetMet: 1.355 ± 0.0
1.016MetAsn: 1.016 ± 0.0
1.355MetPro: 1.355 ± 0.0
0.0MetGln: 0.0 ± 0.0
4.743MetArg: 4.743 ± 0.0
1.694MetSer: 1.694 ± 0.0
1.694MetThr: 1.694 ± 0.0
2.71MetVal: 2.71 ± 0.0
0.678MetTrp: 0.678 ± 0.0
0.339MetTyr: 0.339 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.694AsnAla: 1.694 ± 0.0
0.339AsnCys: 0.339 ± 0.0
3.388AsnAsp: 3.388 ± 0.0
3.049AsnGlu: 3.049 ± 0.0
1.355AsnPhe: 1.355 ± 0.0
3.049AsnGly: 3.049 ± 0.0
0.339AsnHis: 0.339 ± 0.0
2.033AsnIle: 2.033 ± 0.0
0.678AsnLys: 0.678 ± 0.0
2.371AsnLeu: 2.371 ± 0.0
1.016AsnMet: 1.016 ± 0.0
2.033AsnAsn: 2.033 ± 0.0
1.694AsnPro: 1.694 ± 0.0
0.678AsnGln: 0.678 ± 0.0
3.049AsnArg: 3.049 ± 0.0
2.033AsnSer: 2.033 ± 0.0
2.371AsnThr: 2.371 ± 0.0
3.726AsnVal: 3.726 ± 0.0
1.016AsnTrp: 1.016 ± 0.0
1.694AsnTyr: 1.694 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.388ProAla: 3.388 ± 0.0
1.694ProCys: 1.694 ± 0.0
4.743ProAsp: 4.743 ± 0.0
2.371ProGlu: 2.371 ± 0.0
3.049ProPhe: 3.049 ± 0.0
4.065ProGly: 4.065 ± 0.0
0.339ProHis: 0.339 ± 0.0
1.355ProIle: 1.355 ± 0.0
1.694ProLys: 1.694 ± 0.0
5.081ProLeu: 5.081 ± 0.0
2.033ProMet: 2.033 ± 0.0
1.694ProAsn: 1.694 ± 0.0
6.098ProPro: 6.098 ± 0.0
2.71ProGln: 2.71 ± 0.0
2.371ProArg: 2.371 ± 0.0
3.049ProSer: 3.049 ± 0.0
3.049ProThr: 3.049 ± 0.0
7.114ProVal: 7.114 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.355ProTyr: 1.355 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.71GlnAla: 2.71 ± 0.0
1.355GlnCys: 1.355 ± 0.0
0.678GlnAsp: 0.678 ± 0.0
1.016GlnGlu: 1.016 ± 0.0
1.016GlnPhe: 1.016 ± 0.0
4.743GlnGly: 4.743 ± 0.0
1.694GlnHis: 1.694 ± 0.0
1.355GlnIle: 1.355 ± 0.0
1.016GlnLys: 1.016 ± 0.0
3.388GlnLeu: 3.388 ± 0.0
0.0GlnMet: 0.0 ± 0.0
1.016GlnAsn: 1.016 ± 0.0
2.033GlnPro: 2.033 ± 0.0
2.371GlnGln: 2.371 ± 0.0
1.694GlnArg: 1.694 ± 0.0
3.388GlnSer: 3.388 ± 0.0
2.71GlnThr: 2.71 ± 0.0
3.388GlnVal: 3.388 ± 0.0
1.016GlnTrp: 1.016 ± 0.0
1.016GlnTyr: 1.016 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.42ArgAla: 5.42 ± 0.0
0.339ArgCys: 0.339 ± 0.0
5.081ArgAsp: 5.081 ± 0.0
3.049ArgGlu: 3.049 ± 0.0
3.726ArgPhe: 3.726 ± 0.0
3.388ArgGly: 3.388 ± 0.0
1.694ArgHis: 1.694 ± 0.0
2.033ArgIle: 2.033 ± 0.0
2.033ArgLys: 2.033 ± 0.0
6.098ArgLeu: 6.098 ± 0.0
2.71ArgMet: 2.71 ± 0.0
2.371ArgAsn: 2.371 ± 0.0
3.726ArgPro: 3.726 ± 0.0
2.371ArgGln: 2.371 ± 0.0
3.049ArgArg: 3.049 ± 0.0
4.743ArgSer: 4.743 ± 0.0
2.371ArgThr: 2.371 ± 0.0
5.081ArgVal: 5.081 ± 0.0
0.339ArgTrp: 0.339 ± 0.0
1.355ArgTyr: 1.355 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.081SerAla: 5.081 ± 0.0
1.016SerCys: 1.016 ± 0.0
3.726SerAsp: 3.726 ± 0.0
3.726SerGlu: 3.726 ± 0.0
4.065SerPhe: 4.065 ± 0.0
6.436SerGly: 6.436 ± 0.0
1.355SerHis: 1.355 ± 0.0
4.404SerIle: 4.404 ± 0.0
3.049SerLys: 3.049 ± 0.0
5.42SerLeu: 5.42 ± 0.0
2.71SerMet: 2.71 ± 0.0
3.049SerAsn: 3.049 ± 0.0
3.388SerPro: 3.388 ± 0.0
2.033SerGln: 2.033 ± 0.0
3.726SerArg: 3.726 ± 0.0
6.775SerSer: 6.775 ± 0.0
4.404SerThr: 4.404 ± 0.0
5.42SerVal: 5.42 ± 0.0
1.355SerTrp: 1.355 ± 0.0
3.388SerTyr: 3.388 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.743ThrAla: 4.743 ± 0.0
0.678ThrCys: 0.678 ± 0.0
4.065ThrAsp: 4.065 ± 0.0
2.371ThrGlu: 2.371 ± 0.0
2.033ThrPhe: 2.033 ± 0.0
3.049ThrGly: 3.049 ± 0.0
1.016ThrHis: 1.016 ± 0.0
3.049ThrIle: 3.049 ± 0.0
3.388ThrLys: 3.388 ± 0.0
5.42ThrLeu: 5.42 ± 0.0
1.355ThrMet: 1.355 ± 0.0
0.678ThrAsn: 0.678 ± 0.0
3.049ThrPro: 3.049 ± 0.0
1.016ThrGln: 1.016 ± 0.0
3.388ThrArg: 3.388 ± 0.0
4.743ThrSer: 4.743 ± 0.0
2.371ThrThr: 2.371 ± 0.0
8.13ThrVal: 8.13 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
3.049ThrTyr: 3.049 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
10.501ValAla: 10.501 ± 0.0
0.0ValCys: 0.0 ± 0.0
6.098ValAsp: 6.098 ± 0.0
4.065ValGlu: 4.065 ± 0.0
5.759ValPhe: 5.759 ± 0.0
6.436ValGly: 6.436 ± 0.0
2.033ValHis: 2.033 ± 0.0
2.71ValIle: 2.71 ± 0.0
3.388ValLys: 3.388 ± 0.0
4.065ValLeu: 4.065 ± 0.0
3.049ValMet: 3.049 ± 0.0
4.065ValAsn: 4.065 ± 0.0
7.114ValPro: 7.114 ± 0.0
2.71ValGln: 2.71 ± 0.0
6.098ValArg: 6.098 ± 0.0
6.098ValSer: 6.098 ± 0.0
4.404ValThr: 4.404 ± 0.0
10.501ValVal: 10.501 ± 0.0
1.016ValTrp: 1.016 ± 0.0
2.71ValTyr: 2.71 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.016TrpAla: 1.016 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.339TrpAsp: 0.339 ± 0.0
0.678TrpGlu: 0.678 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.678TrpGly: 0.678 ± 0.0
0.339TrpHis: 0.339 ± 0.0
2.033TrpIle: 2.033 ± 0.0
0.678TrpLys: 0.678 ± 0.0
0.678TrpLeu: 0.678 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.339TrpPro: 0.339 ± 0.0
0.339TrpGln: 0.339 ± 0.0
1.694TrpArg: 1.694 ± 0.0
0.678TrpSer: 0.678 ± 0.0
1.694TrpThr: 1.694 ± 0.0
1.016TrpVal: 1.016 ± 0.0
0.339TrpTrp: 0.339 ± 0.0
0.339TrpTyr: 0.339 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.049TyrAla: 3.049 ± 0.0
1.016TyrCys: 1.016 ± 0.0
3.726TyrAsp: 3.726 ± 0.0
1.355TyrGlu: 1.355 ± 0.0
1.016TyrPhe: 1.016 ± 0.0
1.355TyrGly: 1.355 ± 0.0
0.0TyrHis: 0.0 ± 0.0
1.694TyrIle: 1.694 ± 0.0
0.678TyrLys: 0.678 ± 0.0
3.388TyrLeu: 3.388 ± 0.0
0.678TyrMet: 0.678 ± 0.0
1.355TyrAsn: 1.355 ± 0.0
1.694TyrPro: 1.694 ± 0.0
2.033TyrGln: 2.033 ± 0.0
2.371TyrArg: 2.371 ± 0.0
2.033TyrSer: 2.033 ± 0.0
2.371TyrThr: 2.371 ± 0.0
2.371TyrVal: 2.371 ± 0.0
0.678TyrTrp: 0.678 ± 0.0
1.355TyrTyr: 1.355 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2953 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski