Amino acid dipepetide frequency for Hubei picorna-like virus 31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.262AlaAla: 3.262 ± 0.0
1.087AlaCys: 1.087 ± 0.0
3.625AlaAsp: 3.625 ± 0.0
2.537AlaGlu: 2.537 ± 0.0
2.175AlaPhe: 2.175 ± 0.0
3.262AlaGly: 3.262 ± 0.0
0.362AlaHis: 0.362 ± 0.0
4.712AlaIle: 4.712 ± 0.0
5.074AlaLys: 5.074 ± 0.0
3.262AlaLeu: 3.262 ± 0.0
2.175AlaMet: 2.175 ± 0.0
3.625AlaAsn: 3.625 ± 0.0
2.537AlaPro: 2.537 ± 0.0
3.625AlaGln: 3.625 ± 0.0
0.725AlaArg: 0.725 ± 0.0
3.987AlaSer: 3.987 ± 0.0
3.625AlaThr: 3.625 ± 0.0
5.074AlaVal: 5.074 ± 0.0
1.087AlaTrp: 1.087 ± 0.0
3.262AlaTyr: 3.262 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.45CysAla: 1.45 ± 0.0
0.362CysCys: 0.362 ± 0.0
1.087CysAsp: 1.087 ± 0.0
0.725CysGlu: 0.725 ± 0.0
0.725CysPhe: 0.725 ± 0.0
2.537CysGly: 2.537 ± 0.0
0.362CysHis: 0.362 ± 0.0
0.725CysIle: 0.725 ± 0.0
1.087CysLys: 1.087 ± 0.0
1.812CysLeu: 1.812 ± 0.0
0.725CysMet: 0.725 ± 0.0
1.45CysAsn: 1.45 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.725CysGln: 0.725 ± 0.0
1.087CysArg: 1.087 ± 0.0
1.087CysSer: 1.087 ± 0.0
1.45CysThr: 1.45 ± 0.0
2.537CysVal: 2.537 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.725CysTyr: 0.725 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.712AspAla: 4.712 ± 0.0
1.812AspCys: 1.812 ± 0.0
6.162AspAsp: 6.162 ± 0.0
4.712AspGlu: 4.712 ± 0.0
3.262AspPhe: 3.262 ± 0.0
2.537AspGly: 2.537 ± 0.0
1.45AspHis: 1.45 ± 0.0
4.712AspIle: 4.712 ± 0.0
4.712AspLys: 4.712 ± 0.0
3.625AspLeu: 3.625 ± 0.0
2.537AspMet: 2.537 ± 0.0
4.349AspAsn: 4.349 ± 0.0
3.987AspPro: 3.987 ± 0.0
1.812AspGln: 1.812 ± 0.0
3.625AspArg: 3.625 ± 0.0
4.712AspSer: 4.712 ± 0.0
1.45AspThr: 1.45 ± 0.0
3.625AspVal: 3.625 ± 0.0
1.087AspTrp: 1.087 ± 0.0
1.812AspTyr: 1.812 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.712GluAla: 4.712 ± 0.0
0.725GluCys: 0.725 ± 0.0
2.537GluAsp: 2.537 ± 0.0
2.9GluGlu: 2.9 ± 0.0
3.625GluPhe: 3.625 ± 0.0
2.175GluGly: 2.175 ± 0.0
0.725GluHis: 0.725 ± 0.0
2.9GluIle: 2.9 ± 0.0
3.987GluLys: 3.987 ± 0.0
4.712GluLeu: 4.712 ± 0.0
1.812GluMet: 1.812 ± 0.0
3.625GluAsn: 3.625 ± 0.0
1.812GluPro: 1.812 ± 0.0
1.812GluGln: 1.812 ± 0.0
1.087GluArg: 1.087 ± 0.0
4.712GluSer: 4.712 ± 0.0
3.262GluThr: 3.262 ± 0.0
5.799GluVal: 5.799 ± 0.0
0.362GluTrp: 0.362 ± 0.0
2.537GluTyr: 2.537 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.812PheAla: 1.812 ± 0.0
1.087PheCys: 1.087 ± 0.0
3.625PheAsp: 3.625 ± 0.0
2.175PheGlu: 2.175 ± 0.0
1.45PhePhe: 1.45 ± 0.0
2.175PheGly: 2.175 ± 0.0
1.087PheHis: 1.087 ± 0.0
2.537PheIle: 2.537 ± 0.0
3.625PheLys: 3.625 ± 0.0
3.987PheLeu: 3.987 ± 0.0
2.175PheMet: 2.175 ± 0.0
1.812PheAsn: 1.812 ± 0.0
1.812PhePro: 1.812 ± 0.0
1.087PheGln: 1.087 ± 0.0
2.537PheArg: 2.537 ± 0.0
4.712PheSer: 4.712 ± 0.0
1.812PheThr: 1.812 ± 0.0
4.712PheVal: 4.712 ± 0.0
1.087PheTrp: 1.087 ± 0.0
3.987PheTyr: 3.987 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.9GlyAla: 2.9 ± 0.0
1.087GlyCys: 1.087 ± 0.0
4.349GlyAsp: 4.349 ± 0.0
2.175GlyGlu: 2.175 ± 0.0
3.262GlyPhe: 3.262 ± 0.0
1.812GlyGly: 1.812 ± 0.0
0.725GlyHis: 0.725 ± 0.0
3.625GlyIle: 3.625 ± 0.0
2.9GlyLys: 2.9 ± 0.0
3.625GlyLeu: 3.625 ± 0.0
0.725GlyMet: 0.725 ± 0.0
1.812GlyAsn: 1.812 ± 0.0
1.45GlyPro: 1.45 ± 0.0
1.812GlyGln: 1.812 ± 0.0
3.625GlyArg: 3.625 ± 0.0
3.262GlySer: 3.262 ± 0.0
1.45GlyThr: 1.45 ± 0.0
3.987GlyVal: 3.987 ± 0.0
0.362GlyTrp: 0.362 ± 0.0
4.349GlyTyr: 4.349 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.725HisAla: 0.725 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.362HisAsp: 0.362 ± 0.0
0.725HisGlu: 0.725 ± 0.0
1.087HisPhe: 1.087 ± 0.0
0.725HisGly: 0.725 ± 0.0
1.45HisHis: 1.45 ± 0.0
1.087HisIle: 1.087 ± 0.0
0.362HisLys: 0.362 ± 0.0
1.087HisLeu: 1.087 ± 0.0
0.362HisMet: 0.362 ± 0.0
0.725HisAsn: 0.725 ± 0.0
1.087HisPro: 1.087 ± 0.0
0.725HisGln: 0.725 ± 0.0
0.362HisArg: 0.362 ± 0.0
1.812HisSer: 1.812 ± 0.0
1.087HisThr: 1.087 ± 0.0
3.625HisVal: 3.625 ± 0.0
0.362HisTrp: 0.362 ± 0.0
1.45HisTyr: 1.45 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.625IleAla: 3.625 ± 0.0
1.087IleCys: 1.087 ± 0.0
4.349IleAsp: 4.349 ± 0.0
3.987IleGlu: 3.987 ± 0.0
2.9IlePhe: 2.9 ± 0.0
3.262IleGly: 3.262 ± 0.0
0.725IleHis: 0.725 ± 0.0
4.712IleIle: 4.712 ± 0.0
4.349IleLys: 4.349 ± 0.0
3.625IleLeu: 3.625 ± 0.0
0.0IleMet: 0.0 ± 0.0
2.175IleAsn: 2.175 ± 0.0
6.524IlePro: 6.524 ± 0.0
1.812IleGln: 1.812 ± 0.0
3.987IleArg: 3.987 ± 0.0
8.699IleSer: 8.699 ± 0.0
2.537IleThr: 2.537 ± 0.0
4.712IleVal: 4.712 ± 0.0
0.725IleTrp: 0.725 ± 0.0
1.812IleTyr: 1.812 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.262LysAla: 3.262 ± 0.0
1.812LysCys: 1.812 ± 0.0
4.349LysAsp: 4.349 ± 0.0
5.437LysGlu: 5.437 ± 0.0
2.9LysPhe: 2.9 ± 0.0
2.537LysGly: 2.537 ± 0.0
1.45LysHis: 1.45 ± 0.0
4.349LysIle: 4.349 ± 0.0
4.712LysLys: 4.712 ± 0.0
5.437LysLeu: 5.437 ± 0.0
2.175LysMet: 2.175 ± 0.0
2.175LysAsn: 2.175 ± 0.0
5.799LysPro: 5.799 ± 0.0
2.537LysGln: 2.537 ± 0.0
1.812LysArg: 1.812 ± 0.0
2.537LysSer: 2.537 ± 0.0
4.712LysThr: 4.712 ± 0.0
3.987LysVal: 3.987 ± 0.0
1.812LysTrp: 1.812 ± 0.0
3.262LysTyr: 3.262 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.611LeuAla: 7.611 ± 0.0
2.175LeuCys: 2.175 ± 0.0
4.712LeuAsp: 4.712 ± 0.0
6.524LeuGlu: 6.524 ± 0.0
4.349LeuPhe: 4.349 ± 0.0
2.175LeuGly: 2.175 ± 0.0
0.725LeuHis: 0.725 ± 0.0
4.349LeuIle: 4.349 ± 0.0
5.437LeuLys: 5.437 ± 0.0
7.974LeuLeu: 7.974 ± 0.0
2.9LeuMet: 2.9 ± 0.0
2.9LeuAsn: 2.9 ± 0.0
3.625LeuPro: 3.625 ± 0.0
2.9LeuGln: 2.9 ± 0.0
5.437LeuArg: 5.437 ± 0.0
3.262LeuSer: 3.262 ± 0.0
3.625LeuThr: 3.625 ± 0.0
4.712LeuVal: 4.712 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
2.175LeuTyr: 2.175 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.087MetAla: 1.087 ± 0.0
0.725MetCys: 0.725 ± 0.0
1.45MetAsp: 1.45 ± 0.0
1.812MetGlu: 1.812 ± 0.0
0.725MetPhe: 0.725 ± 0.0
2.537MetGly: 2.537 ± 0.0
0.362MetHis: 0.362 ± 0.0
1.45MetIle: 1.45 ± 0.0
1.812MetLys: 1.812 ± 0.0
2.9MetLeu: 2.9 ± 0.0
0.362MetMet: 0.362 ± 0.0
0.725MetAsn: 0.725 ± 0.0
1.45MetPro: 1.45 ± 0.0
2.175MetGln: 2.175 ± 0.0
0.362MetArg: 0.362 ± 0.0
1.812MetSer: 1.812 ± 0.0
0.362MetThr: 0.362 ± 0.0
0.725MetVal: 0.725 ± 0.0
1.087MetTrp: 1.087 ± 0.0
1.45MetTyr: 1.45 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.262AsnAla: 3.262 ± 0.0
1.087AsnCys: 1.087 ± 0.0
2.175AsnAsp: 2.175 ± 0.0
1.45AsnGlu: 1.45 ± 0.0
2.537AsnPhe: 2.537 ± 0.0
2.9AsnGly: 2.9 ± 0.0
0.725AsnHis: 0.725 ± 0.0
2.175AsnIle: 2.175 ± 0.0
2.537AsnLys: 2.537 ± 0.0
2.9AsnLeu: 2.9 ± 0.0
1.087AsnMet: 1.087 ± 0.0
1.812AsnAsn: 1.812 ± 0.0
4.712AsnPro: 4.712 ± 0.0
2.175AsnGln: 2.175 ± 0.0
2.175AsnArg: 2.175 ± 0.0
2.175AsnSer: 2.175 ± 0.0
4.712AsnThr: 4.712 ± 0.0
3.625AsnVal: 3.625 ± 0.0
0.725AsnTrp: 0.725 ± 0.0
2.537AsnTyr: 2.537 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.45ProAla: 1.45 ± 0.0
0.362ProCys: 0.362 ± 0.0
2.9ProAsp: 2.9 ± 0.0
1.45ProGlu: 1.45 ± 0.0
2.537ProPhe: 2.537 ± 0.0
2.175ProGly: 2.175 ± 0.0
1.087ProHis: 1.087 ± 0.0
3.262ProIle: 3.262 ± 0.0
3.987ProLys: 3.987 ± 0.0
4.349ProLeu: 4.349 ± 0.0
1.087ProMet: 1.087 ± 0.0
3.625ProAsn: 3.625 ± 0.0
3.625ProPro: 3.625 ± 0.0
1.45ProGln: 1.45 ± 0.0
2.9ProArg: 2.9 ± 0.0
3.987ProSer: 3.987 ± 0.0
4.349ProThr: 4.349 ± 0.0
3.987ProVal: 3.987 ± 0.0
1.087ProTrp: 1.087 ± 0.0
2.9ProTyr: 2.9 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.262GlnAla: 3.262 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.812GlnAsp: 1.812 ± 0.0
2.537GlnGlu: 2.537 ± 0.0
1.45GlnPhe: 1.45 ± 0.0
1.45GlnGly: 1.45 ± 0.0
2.175GlnHis: 2.175 ± 0.0
2.9GlnIle: 2.9 ± 0.0
3.262GlnLys: 3.262 ± 0.0
4.349GlnLeu: 4.349 ± 0.0
1.087GlnMet: 1.087 ± 0.0
1.087GlnAsn: 1.087 ± 0.0
2.175GlnPro: 2.175 ± 0.0
0.725GlnGln: 0.725 ± 0.0
2.9GlnArg: 2.9 ± 0.0
2.175GlnSer: 2.175 ± 0.0
1.45GlnThr: 1.45 ± 0.0
3.625GlnVal: 3.625 ± 0.0
0.362GlnTrp: 0.362 ± 0.0
1.812GlnTyr: 1.812 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.45ArgAla: 1.45 ± 0.0
0.725ArgCys: 0.725 ± 0.0
6.524ArgAsp: 6.524 ± 0.0
2.175ArgGlu: 2.175 ± 0.0
2.537ArgPhe: 2.537 ± 0.0
1.812ArgGly: 1.812 ± 0.0
0.725ArgHis: 0.725 ± 0.0
3.262ArgIle: 3.262 ± 0.0
3.262ArgLys: 3.262 ± 0.0
3.987ArgLeu: 3.987 ± 0.0
1.087ArgMet: 1.087 ± 0.0
3.262ArgAsn: 3.262 ± 0.0
1.812ArgPro: 1.812 ± 0.0
1.087ArgGln: 1.087 ± 0.0
3.625ArgArg: 3.625 ± 0.0
1.45ArgSer: 1.45 ± 0.0
2.9ArgThr: 2.9 ± 0.0
1.45ArgVal: 1.45 ± 0.0
1.087ArgTrp: 1.087 ± 0.0
2.9ArgTyr: 2.9 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.987SerAla: 3.987 ± 0.0
0.725SerCys: 0.725 ± 0.0
4.712SerAsp: 4.712 ± 0.0
2.9SerGlu: 2.9 ± 0.0
3.987SerPhe: 3.987 ± 0.0
2.9SerGly: 2.9 ± 0.0
0.362SerHis: 0.362 ± 0.0
4.349SerIle: 4.349 ± 0.0
6.162SerLys: 6.162 ± 0.0
5.437SerLeu: 5.437 ± 0.0
1.45SerMet: 1.45 ± 0.0
1.812SerAsn: 1.812 ± 0.0
2.9SerPro: 2.9 ± 0.0
4.349SerGln: 4.349 ± 0.0
1.812SerArg: 1.812 ± 0.0
3.262SerSer: 3.262 ± 0.0
4.349SerThr: 4.349 ± 0.0
5.799SerVal: 5.799 ± 0.0
1.087SerTrp: 1.087 ± 0.0
3.262SerTyr: 3.262 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.812ThrAla: 1.812 ± 0.0
1.087ThrCys: 1.087 ± 0.0
2.537ThrAsp: 2.537 ± 0.0
3.262ThrGlu: 3.262 ± 0.0
3.625ThrPhe: 3.625 ± 0.0
2.9ThrGly: 2.9 ± 0.0
1.812ThrHis: 1.812 ± 0.0
6.524ThrIle: 6.524 ± 0.0
3.262ThrLys: 3.262 ± 0.0
3.625ThrLeu: 3.625 ± 0.0
0.0ThrMet: 0.0 ± 0.0
3.625ThrAsn: 3.625 ± 0.0
2.537ThrPro: 2.537 ± 0.0
2.537ThrGln: 2.537 ± 0.0
1.812ThrArg: 1.812 ± 0.0
3.987ThrSer: 3.987 ± 0.0
2.175ThrThr: 2.175 ± 0.0
3.625ThrVal: 3.625 ± 0.0
0.362ThrTrp: 0.362 ± 0.0
1.087ThrTyr: 1.087 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.074ValAla: 5.074 ± 0.0
1.087ValCys: 1.087 ± 0.0
6.524ValAsp: 6.524 ± 0.0
5.074ValGlu: 5.074 ± 0.0
3.987ValPhe: 3.987 ± 0.0
5.799ValGly: 5.799 ± 0.0
1.45ValHis: 1.45 ± 0.0
2.9ValIle: 2.9 ± 0.0
3.262ValLys: 3.262 ± 0.0
5.437ValLeu: 5.437 ± 0.0
1.45ValMet: 1.45 ± 0.0
3.987ValAsn: 3.987 ± 0.0
2.537ValPro: 2.537 ± 0.0
2.537ValGln: 2.537 ± 0.0
2.9ValArg: 2.9 ± 0.0
4.712ValSer: 4.712 ± 0.0
4.712ValThr: 4.712 ± 0.0
6.524ValVal: 6.524 ± 0.0
0.725ValTrp: 0.725 ± 0.0
3.625ValTyr: 3.625 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.362TrpAla: 0.362 ± 0.0
0.725TrpCys: 0.725 ± 0.0
0.362TrpAsp: 0.362 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.725TrpPhe: 0.725 ± 0.0
0.362TrpGly: 0.362 ± 0.0
0.725TrpHis: 0.725 ± 0.0
2.175TrpIle: 2.175 ± 0.0
0.362TrpLys: 0.362 ± 0.0
1.45TrpLeu: 1.45 ± 0.0
0.725TrpMet: 0.725 ± 0.0
0.362TrpAsn: 0.362 ± 0.0
0.362TrpPro: 0.362 ± 0.0
1.087TrpGln: 1.087 ± 0.0
1.087TrpArg: 1.087 ± 0.0
1.45TrpSer: 1.45 ± 0.0
0.725TrpThr: 0.725 ± 0.0
0.362TrpVal: 0.362 ± 0.0
0.362TrpTrp: 0.362 ± 0.0
1.45TrpTyr: 1.45 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.262TyrAla: 3.262 ± 0.0
2.9TyrCys: 2.9 ± 0.0
2.537TyrAsp: 2.537 ± 0.0
3.262TyrGlu: 3.262 ± 0.0
1.45TyrPhe: 1.45 ± 0.0
3.262TyrGly: 3.262 ± 0.0
0.725TyrHis: 0.725 ± 0.0
2.9TyrIle: 2.9 ± 0.0
2.9TyrLys: 2.9 ± 0.0
3.987TyrLeu: 3.987 ± 0.0
1.087TyrMet: 1.087 ± 0.0
2.9TyrAsn: 2.9 ± 0.0
1.812TyrPro: 1.812 ± 0.0
3.625TyrGln: 3.625 ± 0.0
3.262TyrArg: 3.262 ± 0.0
1.812TyrSer: 1.812 ± 0.0
1.45TyrThr: 1.45 ± 0.0
1.812TyrVal: 1.812 ± 0.0
1.45TyrTrp: 1.45 ± 0.0
2.9TyrTyr: 2.9 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2760 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski