Amino acid dipepetide frequency for Hubei picorna-like virus 43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.191AlaAla: 3.191 ± 0.0
0.638AlaCys: 0.638 ± 0.0
4.467AlaAsp: 4.467 ± 0.0
3.191AlaGlu: 3.191 ± 0.0
3.191AlaPhe: 3.191 ± 0.0
3.829AlaGly: 3.829 ± 0.0
1.276AlaHis: 1.276 ± 0.0
5.424AlaIle: 5.424 ± 0.0
1.595AlaLys: 1.595 ± 0.0
5.105AlaLeu: 5.105 ± 0.0
1.595AlaMet: 1.595 ± 0.0
1.914AlaAsn: 1.914 ± 0.0
2.234AlaPro: 2.234 ± 0.0
2.234AlaGln: 2.234 ± 0.0
3.51AlaArg: 3.51 ± 0.0
2.872AlaSer: 2.872 ± 0.0
4.467AlaThr: 4.467 ± 0.0
4.786AlaVal: 4.786 ± 0.0
0.638AlaTrp: 0.638 ± 0.0
1.914AlaTyr: 1.914 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.595CysAla: 1.595 ± 0.0
0.319CysCys: 0.319 ± 0.0
0.957CysAsp: 0.957 ± 0.0
1.595CysGlu: 1.595 ± 0.0
0.638CysPhe: 0.638 ± 0.0
0.957CysGly: 0.957 ± 0.0
0.638CysHis: 0.638 ± 0.0
2.553CysIle: 2.553 ± 0.0
0.638CysLys: 0.638 ± 0.0
1.914CysLeu: 1.914 ± 0.0
0.957CysMet: 0.957 ± 0.0
0.319CysAsn: 0.319 ± 0.0
0.638CysPro: 0.638 ± 0.0
0.638CysGln: 0.638 ± 0.0
1.595CysArg: 1.595 ± 0.0
2.234CysSer: 2.234 ± 0.0
0.638CysThr: 0.638 ± 0.0
3.191CysVal: 3.191 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.319CysTyr: 0.319 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.191AspAla: 3.191 ± 0.0
0.319AspCys: 0.319 ± 0.0
2.553AspAsp: 2.553 ± 0.0
5.424AspGlu: 5.424 ± 0.0
1.595AspPhe: 1.595 ± 0.0
1.595AspGly: 1.595 ± 0.0
1.595AspHis: 1.595 ± 0.0
3.829AspIle: 3.829 ± 0.0
2.553AspLys: 2.553 ± 0.0
7.658AspLeu: 7.658 ± 0.0
2.553AspMet: 2.553 ± 0.0
1.276AspAsn: 1.276 ± 0.0
3.829AspPro: 3.829 ± 0.0
0.957AspGln: 0.957 ± 0.0
3.191AspArg: 3.191 ± 0.0
2.872AspSer: 2.872 ± 0.0
2.872AspThr: 2.872 ± 0.0
5.105AspVal: 5.105 ± 0.0
0.319AspTrp: 0.319 ± 0.0
1.276AspTyr: 1.276 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.467GluAla: 4.467 ± 0.0
1.595GluCys: 1.595 ± 0.0
3.191GluAsp: 3.191 ± 0.0
2.234GluGlu: 2.234 ± 0.0
3.191GluPhe: 3.191 ± 0.0
3.829GluGly: 3.829 ± 0.0
0.638GluHis: 0.638 ± 0.0
4.467GluIle: 4.467 ± 0.0
5.424GluLys: 5.424 ± 0.0
4.148GluLeu: 4.148 ± 0.0
1.595GluMet: 1.595 ± 0.0
1.914GluAsn: 1.914 ± 0.0
3.191GluPro: 3.191 ± 0.0
2.234GluGln: 2.234 ± 0.0
4.148GluArg: 4.148 ± 0.0
4.786GluSer: 4.786 ± 0.0
4.148GluThr: 4.148 ± 0.0
5.424GluVal: 5.424 ± 0.0
0.638GluTrp: 0.638 ± 0.0
3.51GluTyr: 3.51 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.467PheAla: 4.467 ± 0.0
1.276PheCys: 1.276 ± 0.0
2.234PheAsp: 2.234 ± 0.0
3.829PheGlu: 3.829 ± 0.0
0.957PhePhe: 0.957 ± 0.0
4.148PheGly: 4.148 ± 0.0
1.276PheHis: 1.276 ± 0.0
1.276PheIle: 1.276 ± 0.0
2.872PheLys: 2.872 ± 0.0
2.234PheLeu: 2.234 ± 0.0
0.957PheMet: 0.957 ± 0.0
1.914PheAsn: 1.914 ± 0.0
1.595PhePro: 1.595 ± 0.0
2.553PheGln: 2.553 ± 0.0
2.234PheArg: 2.234 ± 0.0
3.191PheSer: 3.191 ± 0.0
2.234PheThr: 2.234 ± 0.0
3.191PheVal: 3.191 ± 0.0
1.276PheTrp: 1.276 ± 0.0
1.914PheTyr: 1.914 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.872GlyAla: 2.872 ± 0.0
2.234GlyCys: 2.234 ± 0.0
2.872GlyAsp: 2.872 ± 0.0
4.786GlyGlu: 4.786 ± 0.0
1.914GlyPhe: 1.914 ± 0.0
1.276GlyGly: 1.276 ± 0.0
0.319GlyHis: 0.319 ± 0.0
3.51GlyIle: 3.51 ± 0.0
1.914GlyLys: 1.914 ± 0.0
4.467GlyLeu: 4.467 ± 0.0
1.276GlyMet: 1.276 ± 0.0
3.829GlyAsn: 3.829 ± 0.0
4.467GlyPro: 4.467 ± 0.0
1.276GlyGln: 1.276 ± 0.0
2.553GlyArg: 2.553 ± 0.0
3.191GlySer: 3.191 ± 0.0
4.148GlyThr: 4.148 ± 0.0
8.934GlyVal: 8.934 ± 0.0
1.595GlyTrp: 1.595 ± 0.0
2.553GlyTyr: 2.553 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.957HisAla: 0.957 ± 0.0
0.319HisCys: 0.319 ± 0.0
0.957HisAsp: 0.957 ± 0.0
1.914HisGlu: 1.914 ± 0.0
1.914HisPhe: 1.914 ± 0.0
0.957HisGly: 0.957 ± 0.0
0.319HisHis: 0.319 ± 0.0
1.276HisIle: 1.276 ± 0.0
0.638HisLys: 0.638 ± 0.0
1.914HisLeu: 1.914 ± 0.0
0.319HisMet: 0.319 ± 0.0
0.638HisAsn: 0.638 ± 0.0
1.276HisPro: 1.276 ± 0.0
0.957HisGln: 0.957 ± 0.0
0.957HisArg: 0.957 ± 0.0
1.276HisSer: 1.276 ± 0.0
0.319HisThr: 0.319 ± 0.0
3.51HisVal: 3.51 ± 0.0
0.319HisTrp: 0.319 ± 0.0
0.319HisTyr: 0.319 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.829IleAla: 3.829 ± 0.0
1.914IleCys: 1.914 ± 0.0
4.467IleAsp: 4.467 ± 0.0
4.467IleGlu: 4.467 ± 0.0
2.872IlePhe: 2.872 ± 0.0
4.148IleGly: 4.148 ± 0.0
1.595IleHis: 1.595 ± 0.0
3.191IleIle: 3.191 ± 0.0
4.467IleLys: 4.467 ± 0.0
4.148IleLeu: 4.148 ± 0.0
1.595IleMet: 1.595 ± 0.0
2.553IleAsn: 2.553 ± 0.0
3.191IlePro: 3.191 ± 0.0
1.276IleGln: 1.276 ± 0.0
2.553IleArg: 2.553 ± 0.0
3.51IleSer: 3.51 ± 0.0
4.467IleThr: 4.467 ± 0.0
2.872IleVal: 2.872 ± 0.0
1.595IleTrp: 1.595 ± 0.0
1.914IleTyr: 1.914 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.234LysAla: 2.234 ± 0.0
0.957LysCys: 0.957 ± 0.0
3.191LysAsp: 3.191 ± 0.0
4.467LysGlu: 4.467 ± 0.0
5.105LysPhe: 5.105 ± 0.0
3.829LysGly: 3.829 ± 0.0
1.595LysHis: 1.595 ± 0.0
5.105LysIle: 5.105 ± 0.0
3.51LysLys: 3.51 ± 0.0
4.148LysLeu: 4.148 ± 0.0
1.276LysMet: 1.276 ± 0.0
3.51LysAsn: 3.51 ± 0.0
3.829LysPro: 3.829 ± 0.0
2.234LysGln: 2.234 ± 0.0
2.872LysArg: 2.872 ± 0.0
4.148LysSer: 4.148 ± 0.0
1.914LysThr: 1.914 ± 0.0
2.872LysVal: 2.872 ± 0.0
0.638LysTrp: 0.638 ± 0.0
2.553LysTyr: 2.553 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.829LeuAla: 3.829 ± 0.0
3.51LeuCys: 3.51 ± 0.0
3.51LeuAsp: 3.51 ± 0.0
2.553LeuGlu: 2.553 ± 0.0
2.553LeuPhe: 2.553 ± 0.0
6.382LeuGly: 6.382 ± 0.0
1.276LeuHis: 1.276 ± 0.0
4.786LeuIle: 4.786 ± 0.0
7.658LeuLys: 7.658 ± 0.0
7.977LeuLeu: 7.977 ± 0.0
1.595LeuMet: 1.595 ± 0.0
3.829LeuAsn: 3.829 ± 0.0
4.467LeuPro: 4.467 ± 0.0
1.914LeuGln: 1.914 ± 0.0
5.743LeuArg: 5.743 ± 0.0
4.148LeuSer: 4.148 ± 0.0
7.02LeuThr: 7.02 ± 0.0
6.382LeuVal: 6.382 ± 0.0
0.638LeuTrp: 0.638 ± 0.0
3.51LeuTyr: 3.51 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.595MetAla: 1.595 ± 0.0
0.319MetCys: 0.319 ± 0.0
2.234MetAsp: 2.234 ± 0.0
1.595MetGlu: 1.595 ± 0.0
1.914MetPhe: 1.914 ± 0.0
1.276MetGly: 1.276 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.276MetIle: 1.276 ± 0.0
0.638MetLys: 0.638 ± 0.0
2.553MetLeu: 2.553 ± 0.0
0.957MetMet: 0.957 ± 0.0
2.234MetAsn: 2.234 ± 0.0
1.914MetPro: 1.914 ± 0.0
0.638MetGln: 0.638 ± 0.0
2.553MetArg: 2.553 ± 0.0
2.872MetSer: 2.872 ± 0.0
1.595MetThr: 1.595 ± 0.0
1.914MetVal: 1.914 ± 0.0
0.319MetTrp: 0.319 ± 0.0
1.595MetTyr: 1.595 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.595AsnAla: 1.595 ± 0.0
0.957AsnCys: 0.957 ± 0.0
0.638AsnAsp: 0.638 ± 0.0
3.51AsnGlu: 3.51 ± 0.0
2.553AsnPhe: 2.553 ± 0.0
3.51AsnGly: 3.51 ± 0.0
0.638AsnHis: 0.638 ± 0.0
3.51AsnIle: 3.51 ± 0.0
2.234AsnLys: 2.234 ± 0.0
4.467AsnLeu: 4.467 ± 0.0
1.595AsnMet: 1.595 ± 0.0
1.595AsnAsn: 1.595 ± 0.0
2.234AsnPro: 2.234 ± 0.0
0.638AsnGln: 0.638 ± 0.0
3.51AsnArg: 3.51 ± 0.0
1.595AsnSer: 1.595 ± 0.0
2.234AsnThr: 2.234 ± 0.0
4.786AsnVal: 4.786 ± 0.0
0.638AsnTrp: 0.638 ± 0.0
0.957AsnTyr: 0.957 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.553ProAla: 2.553 ± 0.0
1.276ProCys: 1.276 ± 0.0
3.51ProAsp: 3.51 ± 0.0
4.786ProGlu: 4.786 ± 0.0
2.553ProPhe: 2.553 ± 0.0
3.829ProGly: 3.829 ± 0.0
1.595ProHis: 1.595 ± 0.0
3.51ProIle: 3.51 ± 0.0
2.872ProLys: 2.872 ± 0.0
4.148ProLeu: 4.148 ± 0.0
1.276ProMet: 1.276 ± 0.0
1.595ProAsn: 1.595 ± 0.0
1.276ProPro: 1.276 ± 0.0
0.638ProGln: 0.638 ± 0.0
3.191ProArg: 3.191 ± 0.0
3.191ProSer: 3.191 ± 0.0
3.829ProThr: 3.829 ± 0.0
4.786ProVal: 4.786 ± 0.0
0.319ProTrp: 0.319 ± 0.0
1.914ProTyr: 1.914 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.276GlnAla: 1.276 ± 0.0
0.957GlnCys: 0.957 ± 0.0
0.957GlnAsp: 0.957 ± 0.0
1.914GlnGlu: 1.914 ± 0.0
0.319GlnPhe: 0.319 ± 0.0
0.319GlnGly: 0.319 ± 0.0
0.957GlnHis: 0.957 ± 0.0
0.638GlnIle: 0.638 ± 0.0
1.914GlnLys: 1.914 ± 0.0
3.191GlnLeu: 3.191 ± 0.0
1.595GlnMet: 1.595 ± 0.0
1.276GlnAsn: 1.276 ± 0.0
1.595GlnPro: 1.595 ± 0.0
0.957GlnGln: 0.957 ± 0.0
2.872GlnArg: 2.872 ± 0.0
0.319GlnSer: 0.319 ± 0.0
2.553GlnThr: 2.553 ± 0.0
3.191GlnVal: 3.191 ± 0.0
0.638GlnTrp: 0.638 ± 0.0
0.957GlnTyr: 0.957 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.553ArgAla: 2.553 ± 0.0
0.638ArgCys: 0.638 ± 0.0
4.148ArgAsp: 4.148 ± 0.0
2.553ArgGlu: 2.553 ± 0.0
3.191ArgPhe: 3.191 ± 0.0
3.191ArgGly: 3.191 ± 0.0
0.957ArgHis: 0.957 ± 0.0
2.872ArgIle: 2.872 ± 0.0
2.872ArgLys: 2.872 ± 0.0
5.105ArgLeu: 5.105 ± 0.0
1.914ArgMet: 1.914 ± 0.0
1.914ArgAsn: 1.914 ± 0.0
1.276ArgPro: 1.276 ± 0.0
1.914ArgGln: 1.914 ± 0.0
4.148ArgArg: 4.148 ± 0.0
3.51ArgSer: 3.51 ± 0.0
4.467ArgThr: 4.467 ± 0.0
4.467ArgVal: 4.467 ± 0.0
2.234ArgTrp: 2.234 ± 0.0
2.234ArgTyr: 2.234 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.829SerAla: 3.829 ± 0.0
0.957SerCys: 0.957 ± 0.0
3.829SerAsp: 3.829 ± 0.0
2.872SerGlu: 2.872 ± 0.0
3.829SerPhe: 3.829 ± 0.0
3.191SerGly: 3.191 ± 0.0
1.276SerHis: 1.276 ± 0.0
3.51SerIle: 3.51 ± 0.0
5.105SerLys: 5.105 ± 0.0
4.467SerLeu: 4.467 ± 0.0
2.234SerMet: 2.234 ± 0.0
2.872SerAsn: 2.872 ± 0.0
3.191SerPro: 3.191 ± 0.0
0.957SerGln: 0.957 ± 0.0
2.553SerArg: 2.553 ± 0.0
5.105SerSer: 5.105 ± 0.0
4.786SerThr: 4.786 ± 0.0
5.424SerVal: 5.424 ± 0.0
0.319SerTrp: 0.319 ± 0.0
2.234SerTyr: 2.234 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.105ThrAla: 5.105 ± 0.0
1.595ThrCys: 1.595 ± 0.0
3.191ThrAsp: 3.191 ± 0.0
3.191ThrGlu: 3.191 ± 0.0
3.191ThrPhe: 3.191 ± 0.0
4.786ThrGly: 4.786 ± 0.0
1.914ThrHis: 1.914 ± 0.0
3.829ThrIle: 3.829 ± 0.0
3.51ThrLys: 3.51 ± 0.0
4.786ThrLeu: 4.786 ± 0.0
2.872ThrMet: 2.872 ± 0.0
3.191ThrAsn: 3.191 ± 0.0
3.191ThrPro: 3.191 ± 0.0
0.957ThrGln: 0.957 ± 0.0
2.553ThrArg: 2.553 ± 0.0
3.829ThrSer: 3.829 ± 0.0
6.063ThrThr: 6.063 ± 0.0
5.105ThrVal: 5.105 ± 0.0
1.276ThrTrp: 1.276 ± 0.0
0.957ThrTyr: 0.957 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.382ValAla: 6.382 ± 0.0
0.957ValCys: 0.957 ± 0.0
4.467ValAsp: 4.467 ± 0.0
5.424ValGlu: 5.424 ± 0.0
2.872ValPhe: 2.872 ± 0.0
5.743ValGly: 5.743 ± 0.0
1.914ValHis: 1.914 ± 0.0
3.191ValIle: 3.191 ± 0.0
7.977ValLys: 7.977 ± 0.0
8.934ValLeu: 8.934 ± 0.0
1.276ValMet: 1.276 ± 0.0
3.191ValAsn: 3.191 ± 0.0
7.339ValPro: 7.339 ± 0.0
2.872ValGln: 2.872 ± 0.0
2.872ValArg: 2.872 ± 0.0
7.339ValSer: 7.339 ± 0.0
2.872ValThr: 2.872 ± 0.0
4.786ValVal: 4.786 ± 0.0
0.319ValTrp: 0.319 ± 0.0
1.595ValTyr: 1.595 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.638TrpAla: 0.638 ± 0.0
0.638TrpCys: 0.638 ± 0.0
0.957TrpAsp: 0.957 ± 0.0
0.319TrpGlu: 0.319 ± 0.0
0.319TrpPhe: 0.319 ± 0.0
0.319TrpGly: 0.319 ± 0.0
0.638TrpHis: 0.638 ± 0.0
1.276TrpIle: 1.276 ± 0.0
0.638TrpLys: 0.638 ± 0.0
0.957TrpLeu: 0.957 ± 0.0
0.957TrpMet: 0.957 ± 0.0
1.276TrpAsn: 1.276 ± 0.0
0.638TrpPro: 0.638 ± 0.0
0.319TrpGln: 0.319 ± 0.0
0.957TrpArg: 0.957 ± 0.0
1.595TrpSer: 1.595 ± 0.0
1.276TrpThr: 1.276 ± 0.0
0.319TrpVal: 0.319 ± 0.0
0.638TrpTrp: 0.638 ± 0.0
0.638TrpTyr: 0.638 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.914TyrAla: 1.914 ± 0.0
0.957TyrCys: 0.957 ± 0.0
2.234TyrAsp: 2.234 ± 0.0
4.148TyrGlu: 4.148 ± 0.0
0.957TyrPhe: 0.957 ± 0.0
2.872TyrGly: 2.872 ± 0.0
0.638TyrHis: 0.638 ± 0.0
1.595TyrIle: 1.595 ± 0.0
0.957TyrLys: 0.957 ± 0.0
0.957TyrLeu: 0.957 ± 0.0
1.276TyrMet: 1.276 ± 0.0
2.553TyrAsn: 2.553 ± 0.0
1.276TyrPro: 1.276 ± 0.0
2.234TyrGln: 2.234 ± 0.0
1.595TyrArg: 1.595 ± 0.0
0.957TyrSer: 0.957 ± 0.0
3.191TyrThr: 3.191 ± 0.0
1.276TyrVal: 1.276 ± 0.0
0.957TyrTrp: 0.957 ± 0.0
1.276TyrTyr: 1.276 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3135 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski