Amino acid dipepetide frequency for Hubei picorna-like virus 68

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.383AlaAla: 6.383 ± 0.0
0.608AlaCys: 0.608 ± 0.0
3.04AlaAsp: 3.04 ± 0.0
4.255AlaGlu: 4.255 ± 0.0
3.343AlaPhe: 3.343 ± 0.0
4.863AlaGly: 4.863 ± 0.0
0.608AlaHis: 0.608 ± 0.0
4.255AlaIle: 4.255 ± 0.0
4.559AlaLys: 4.559 ± 0.0
7.903AlaLeu: 7.903 ± 0.0
2.128AlaMet: 2.128 ± 0.0
2.736AlaAsn: 2.736 ± 0.0
3.04AlaPro: 3.04 ± 0.0
3.647AlaGln: 3.647 ± 0.0
3.951AlaArg: 3.951 ± 0.0
5.167AlaSer: 5.167 ± 0.0
7.599AlaThr: 7.599 ± 0.0
4.559AlaVal: 4.559 ± 0.0
0.304AlaTrp: 0.304 ± 0.0
2.128AlaTyr: 2.128 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.824CysAla: 1.824 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.912CysAsp: 0.912 ± 0.0
1.824CysGlu: 1.824 ± 0.0
0.912CysPhe: 0.912 ± 0.0
1.824CysGly: 1.824 ± 0.0
0.304CysHis: 0.304 ± 0.0
1.216CysIle: 1.216 ± 0.0
0.304CysLys: 0.304 ± 0.0
1.216CysLeu: 1.216 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.912CysAsn: 0.912 ± 0.0
0.912CysPro: 0.912 ± 0.0
0.304CysGln: 0.304 ± 0.0
0.304CysArg: 0.304 ± 0.0
0.608CysSer: 0.608 ± 0.0
1.216CysThr: 1.216 ± 0.0
0.608CysVal: 0.608 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.304CysTyr: 0.304 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.951AspAla: 3.951 ± 0.0
1.824AspCys: 1.824 ± 0.0
2.736AspAsp: 2.736 ± 0.0
4.863AspGlu: 4.863 ± 0.0
2.128AspPhe: 2.128 ± 0.0
3.647AspGly: 3.647 ± 0.0
0.912AspHis: 0.912 ± 0.0
3.951AspIle: 3.951 ± 0.0
3.647AspLys: 3.647 ± 0.0
6.383AspLeu: 6.383 ± 0.0
1.216AspMet: 1.216 ± 0.0
4.863AspAsn: 4.863 ± 0.0
3.04AspPro: 3.04 ± 0.0
1.824AspGln: 1.824 ± 0.0
1.52AspArg: 1.52 ± 0.0
3.647AspSer: 3.647 ± 0.0
3.04AspThr: 3.04 ± 0.0
3.951AspVal: 3.951 ± 0.0
0.912AspTrp: 0.912 ± 0.0
2.128AspTyr: 2.128 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.04GluAla: 3.04 ± 0.0
1.216GluCys: 1.216 ± 0.0
1.216GluAsp: 1.216 ± 0.0
3.04GluGlu: 3.04 ± 0.0
1.824GluPhe: 1.824 ± 0.0
1.52GluGly: 1.52 ± 0.0
0.0GluHis: 0.0 ± 0.0
3.951GluIle: 3.951 ± 0.0
5.167GluLys: 5.167 ± 0.0
6.687GluLeu: 6.687 ± 0.0
1.824GluMet: 1.824 ± 0.0
2.128GluAsn: 2.128 ± 0.0
2.736GluPro: 2.736 ± 0.0
2.432GluGln: 2.432 ± 0.0
3.647GluArg: 3.647 ± 0.0
2.432GluSer: 2.432 ± 0.0
3.951GluThr: 3.951 ± 0.0
2.128GluVal: 2.128 ± 0.0
1.52GluTrp: 1.52 ± 0.0
1.52GluTyr: 1.52 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.647PheAla: 3.647 ± 0.0
1.216PheCys: 1.216 ± 0.0
2.128PheAsp: 2.128 ± 0.0
1.52PheGlu: 1.52 ± 0.0
2.128PhePhe: 2.128 ± 0.0
2.736PheGly: 2.736 ± 0.0
0.304PheHis: 0.304 ± 0.0
2.432PheIle: 2.432 ± 0.0
1.216PheLys: 1.216 ± 0.0
3.951PheLeu: 3.951 ± 0.0
0.608PheMet: 0.608 ± 0.0
1.52PheAsn: 1.52 ± 0.0
0.912PhePro: 0.912 ± 0.0
1.216PheGln: 1.216 ± 0.0
1.824PheArg: 1.824 ± 0.0
3.951PheSer: 3.951 ± 0.0
4.863PheThr: 4.863 ± 0.0
3.647PheVal: 3.647 ± 0.0
0.912PheTrp: 0.912 ± 0.0
2.128PheTyr: 2.128 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.863GlyAla: 4.863 ± 0.0
0.304GlyCys: 0.304 ± 0.0
4.559GlyAsp: 4.559 ± 0.0
2.128GlyGlu: 2.128 ± 0.0
1.824GlyPhe: 1.824 ± 0.0
3.647GlyGly: 3.647 ± 0.0
1.52GlyHis: 1.52 ± 0.0
3.647GlyIle: 3.647 ± 0.0
3.343GlyLys: 3.343 ± 0.0
6.383GlyLeu: 6.383 ± 0.0
1.216GlyMet: 1.216 ± 0.0
2.432GlyAsn: 2.432 ± 0.0
3.04GlyPro: 3.04 ± 0.0
1.216GlyGln: 1.216 ± 0.0
2.736GlyArg: 2.736 ± 0.0
3.04GlySer: 3.04 ± 0.0
3.647GlyThr: 3.647 ± 0.0
3.04GlyVal: 3.04 ± 0.0
0.304GlyTrp: 0.304 ± 0.0
2.432GlyTyr: 2.432 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.608HisAla: 0.608 ± 0.0
0.912HisCys: 0.912 ± 0.0
1.824HisAsp: 1.824 ± 0.0
1.216HisGlu: 1.216 ± 0.0
0.304HisPhe: 0.304 ± 0.0
1.52HisGly: 1.52 ± 0.0
0.304HisHis: 0.304 ± 0.0
0.912HisIle: 0.912 ± 0.0
1.824HisLys: 1.824 ± 0.0
2.128HisLeu: 2.128 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.304HisAsn: 0.304 ± 0.0
1.824HisPro: 1.824 ± 0.0
1.824HisGln: 1.824 ± 0.0
1.216HisArg: 1.216 ± 0.0
1.216HisSer: 1.216 ± 0.0
2.128HisThr: 2.128 ± 0.0
0.912HisVal: 0.912 ± 0.0
0.608HisTrp: 0.608 ± 0.0
0.608HisTyr: 0.608 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.255IleAla: 4.255 ± 0.0
0.0IleCys: 0.0 ± 0.0
3.647IleAsp: 3.647 ± 0.0
1.824IleGlu: 1.824 ± 0.0
2.432IlePhe: 2.432 ± 0.0
3.04IleGly: 3.04 ± 0.0
0.912IleHis: 0.912 ± 0.0
4.255IleIle: 4.255 ± 0.0
3.343IleLys: 3.343 ± 0.0
4.255IleLeu: 4.255 ± 0.0
0.304IleMet: 0.304 ± 0.0
3.951IleAsn: 3.951 ± 0.0
3.951IlePro: 3.951 ± 0.0
2.736IleGln: 2.736 ± 0.0
4.559IleArg: 4.559 ± 0.0
4.559IleSer: 4.559 ± 0.0
6.079IleThr: 6.079 ± 0.0
4.255IleVal: 4.255 ± 0.0
1.52IleTrp: 1.52 ± 0.0
3.343IleTyr: 3.343 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.52LysAla: 1.52 ± 0.0
0.304LysCys: 0.304 ± 0.0
2.736LysAsp: 2.736 ± 0.0
1.824LysGlu: 1.824 ± 0.0
2.432LysPhe: 2.432 ± 0.0
2.128LysGly: 2.128 ± 0.0
1.52LysHis: 1.52 ± 0.0
3.04LysIle: 3.04 ± 0.0
3.647LysLys: 3.647 ± 0.0
5.775LysLeu: 5.775 ± 0.0
1.52LysMet: 1.52 ± 0.0
3.647LysAsn: 3.647 ± 0.0
4.255LysPro: 4.255 ± 0.0
1.216LysGln: 1.216 ± 0.0
2.128LysArg: 2.128 ± 0.0
3.04LysSer: 3.04 ± 0.0
4.559LysThr: 4.559 ± 0.0
2.432LysVal: 2.432 ± 0.0
0.304LysTrp: 0.304 ± 0.0
3.343LysTyr: 3.343 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.255LeuAla: 4.255 ± 0.0
1.216LeuCys: 1.216 ± 0.0
7.599LeuAsp: 7.599 ± 0.0
6.079LeuGlu: 6.079 ± 0.0
3.647LeuPhe: 3.647 ± 0.0
2.432LeuGly: 2.432 ± 0.0
3.647LeuHis: 3.647 ± 0.0
5.775LeuIle: 5.775 ± 0.0
5.775LeuLys: 5.775 ± 0.0
7.903LeuLeu: 7.903 ± 0.0
1.216LeuMet: 1.216 ± 0.0
4.559LeuAsn: 4.559 ± 0.0
6.079LeuPro: 6.079 ± 0.0
4.559LeuGln: 4.559 ± 0.0
4.863LeuArg: 4.863 ± 0.0
5.775LeuSer: 5.775 ± 0.0
7.295LeuThr: 7.295 ± 0.0
5.775LeuVal: 5.775 ± 0.0
0.608LeuTrp: 0.608 ± 0.0
4.255LeuTyr: 4.255 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.432MetAla: 2.432 ± 0.0
0.304MetCys: 0.304 ± 0.0
0.608MetAsp: 0.608 ± 0.0
1.52MetGlu: 1.52 ± 0.0
0.912MetPhe: 0.912 ± 0.0
1.216MetGly: 1.216 ± 0.0
1.824MetHis: 1.824 ± 0.0
1.216MetIle: 1.216 ± 0.0
1.216MetLys: 1.216 ± 0.0
1.824MetLeu: 1.824 ± 0.0
0.912MetMet: 0.912 ± 0.0
1.824MetAsn: 1.824 ± 0.0
0.912MetPro: 0.912 ± 0.0
0.304MetGln: 0.304 ± 0.0
1.216MetArg: 1.216 ± 0.0
1.216MetSer: 1.216 ± 0.0
0.608MetThr: 0.608 ± 0.0
1.216MetVal: 1.216 ± 0.0
0.304MetTrp: 0.304 ± 0.0
0.304MetTyr: 0.304 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.559AsnAla: 4.559 ± 0.0
1.216AsnCys: 1.216 ± 0.0
3.04AsnAsp: 3.04 ± 0.0
2.432AsnGlu: 2.432 ± 0.0
2.432AsnPhe: 2.432 ± 0.0
2.432AsnGly: 2.432 ± 0.0
1.52AsnHis: 1.52 ± 0.0
4.255AsnIle: 4.255 ± 0.0
1.824AsnLys: 1.824 ± 0.0
3.951AsnLeu: 3.951 ± 0.0
1.52AsnMet: 1.52 ± 0.0
3.951AsnAsn: 3.951 ± 0.0
2.128AsnPro: 2.128 ± 0.0
1.52AsnGln: 1.52 ± 0.0
2.432AsnArg: 2.432 ± 0.0
2.128AsnSer: 2.128 ± 0.0
2.432AsnThr: 2.432 ± 0.0
4.559AsnVal: 4.559 ± 0.0
0.304AsnTrp: 0.304 ± 0.0
3.343AsnTyr: 3.343 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.255ProAla: 4.255 ± 0.0
0.912ProCys: 0.912 ± 0.0
3.04ProAsp: 3.04 ± 0.0
2.432ProGlu: 2.432 ± 0.0
1.52ProPhe: 1.52 ± 0.0
3.343ProGly: 3.343 ± 0.0
0.608ProHis: 0.608 ± 0.0
6.991ProIle: 6.991 ± 0.0
1.824ProLys: 1.824 ± 0.0
3.647ProLeu: 3.647 ± 0.0
0.912ProMet: 0.912 ± 0.0
3.04ProAsn: 3.04 ± 0.0
3.04ProPro: 3.04 ± 0.0
1.52ProGln: 1.52 ± 0.0
0.304ProArg: 0.304 ± 0.0
3.04ProSer: 3.04 ± 0.0
4.863ProThr: 4.863 ± 0.0
4.255ProVal: 4.255 ± 0.0
0.912ProTrp: 0.912 ± 0.0
2.736ProTyr: 2.736 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.951GlnAla: 3.951 ± 0.0
0.304GlnCys: 0.304 ± 0.0
1.52GlnAsp: 1.52 ± 0.0
2.128GlnGlu: 2.128 ± 0.0
1.216GlnPhe: 1.216 ± 0.0
0.304GlnGly: 0.304 ± 0.0
0.608GlnHis: 0.608 ± 0.0
1.824GlnIle: 1.824 ± 0.0
2.128GlnLys: 2.128 ± 0.0
3.04GlnLeu: 3.04 ± 0.0
0.912GlnMet: 0.912 ± 0.0
0.912GlnAsn: 0.912 ± 0.0
3.343GlnPro: 3.343 ± 0.0
2.736GlnGln: 2.736 ± 0.0
2.432GlnArg: 2.432 ± 0.0
2.736GlnSer: 2.736 ± 0.0
1.216GlnThr: 1.216 ± 0.0
1.824GlnVal: 1.824 ± 0.0
0.912GlnTrp: 0.912 ± 0.0
1.216GlnTyr: 1.216 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.951ArgAla: 3.951 ± 0.0
0.304ArgCys: 0.304 ± 0.0
3.343ArgAsp: 3.343 ± 0.0
1.824ArgGlu: 1.824 ± 0.0
1.824ArgPhe: 1.824 ± 0.0
2.432ArgGly: 2.432 ± 0.0
1.52ArgHis: 1.52 ± 0.0
1.824ArgIle: 1.824 ± 0.0
2.736ArgLys: 2.736 ± 0.0
5.471ArgLeu: 5.471 ± 0.0
0.912ArgMet: 0.912 ± 0.0
1.216ArgAsn: 1.216 ± 0.0
1.52ArgPro: 1.52 ± 0.0
1.216ArgGln: 1.216 ± 0.0
2.432ArgArg: 2.432 ± 0.0
3.647ArgSer: 3.647 ± 0.0
3.951ArgThr: 3.951 ± 0.0
4.255ArgVal: 4.255 ± 0.0
0.608ArgTrp: 0.608 ± 0.0
1.216ArgTyr: 1.216 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.599SerAla: 7.599 ± 0.0
1.52SerCys: 1.52 ± 0.0
3.647SerAsp: 3.647 ± 0.0
3.04SerGlu: 3.04 ± 0.0
3.343SerPhe: 3.343 ± 0.0
4.559SerGly: 4.559 ± 0.0
1.52SerHis: 1.52 ± 0.0
4.255SerIle: 4.255 ± 0.0
2.432SerLys: 2.432 ± 0.0
5.471SerLeu: 5.471 ± 0.0
1.824SerMet: 1.824 ± 0.0
2.736SerAsn: 2.736 ± 0.0
3.04SerPro: 3.04 ± 0.0
0.304SerGln: 0.304 ± 0.0
2.736SerArg: 2.736 ± 0.0
5.775SerSer: 5.775 ± 0.0
5.471SerThr: 5.471 ± 0.0
5.167SerVal: 5.167 ± 0.0
0.304SerTrp: 0.304 ± 0.0
0.608SerTyr: 0.608 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.951ThrAla: 3.951 ± 0.0
1.216ThrCys: 1.216 ± 0.0
3.04ThrAsp: 3.04 ± 0.0
3.04ThrGlu: 3.04 ± 0.0
3.951ThrPhe: 3.951 ± 0.0
4.863ThrGly: 4.863 ± 0.0
1.216ThrHis: 1.216 ± 0.0
3.647ThrIle: 3.647 ± 0.0
2.432ThrLys: 2.432 ± 0.0
5.471ThrLeu: 5.471 ± 0.0
3.343ThrMet: 3.343 ± 0.0
4.863ThrAsn: 4.863 ± 0.0
6.687ThrPro: 6.687 ± 0.0
3.04ThrGln: 3.04 ± 0.0
3.647ThrArg: 3.647 ± 0.0
5.775ThrSer: 5.775 ± 0.0
5.775ThrThr: 5.775 ± 0.0
4.863ThrVal: 4.863 ± 0.0
0.608ThrTrp: 0.608 ± 0.0
4.559ThrTyr: 4.559 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.079ValAla: 6.079 ± 0.0
1.52ValCys: 1.52 ± 0.0
6.079ValAsp: 6.079 ± 0.0
4.863ValGlu: 4.863 ± 0.0
2.736ValPhe: 2.736 ± 0.0
6.079ValGly: 6.079 ± 0.0
1.52ValHis: 1.52 ± 0.0
3.343ValIle: 3.343 ± 0.0
1.824ValLys: 1.824 ± 0.0
6.079ValLeu: 6.079 ± 0.0
0.0ValMet: 0.0 ± 0.0
2.128ValAsn: 2.128 ± 0.0
2.128ValPro: 2.128 ± 0.0
1.824ValGln: 1.824 ± 0.0
2.128ValArg: 2.128 ± 0.0
4.559ValSer: 4.559 ± 0.0
4.255ValThr: 4.255 ± 0.0
5.167ValVal: 5.167 ± 0.0
1.52ValTrp: 1.52 ± 0.0
2.128ValTyr: 2.128 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.912TrpAla: 0.912 ± 0.0
0.304TrpCys: 0.304 ± 0.0
2.128TrpAsp: 2.128 ± 0.0
0.304TrpGlu: 0.304 ± 0.0
0.304TrpPhe: 0.304 ± 0.0
0.912TrpGly: 0.912 ± 0.0
0.304TrpHis: 0.304 ± 0.0
0.304TrpIle: 0.304 ± 0.0
1.52TrpLys: 1.52 ± 0.0
0.304TrpLeu: 0.304 ± 0.0
0.608TrpMet: 0.608 ± 0.0
1.824TrpAsn: 1.824 ± 0.0
0.304TrpPro: 0.304 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.912TrpSer: 0.912 ± 0.0
0.912TrpThr: 0.912 ± 0.0
0.912TrpVal: 0.912 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.608TrpTyr: 0.608 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.04TyrAla: 3.04 ± 0.0
0.304TyrCys: 0.304 ± 0.0
3.343TyrAsp: 3.343 ± 0.0
1.824TyrGlu: 1.824 ± 0.0
3.647TyrPhe: 3.647 ± 0.0
1.824TyrGly: 1.824 ± 0.0
1.52TyrHis: 1.52 ± 0.0
2.128TyrIle: 2.128 ± 0.0
0.608TyrLys: 0.608 ± 0.0
5.471TyrLeu: 5.471 ± 0.0
0.608TyrMet: 0.608 ± 0.0
2.432TyrAsn: 2.432 ± 0.0
0.304TyrPro: 0.304 ± 0.0
1.824TyrGln: 1.824 ± 0.0
2.432TyrArg: 2.432 ± 0.0
2.128TyrSer: 2.128 ± 0.0
2.128TyrThr: 2.128 ± 0.0
2.432TyrVal: 2.432 ± 0.0
0.912TyrTrp: 0.912 ± 0.0
0.912TyrTyr: 0.912 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3291 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski