Amino acid dipepetide frequency for Beihai zhaovirus-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.858AlaAla: 2.858 ± 0.0
0.0AlaCys: 0.0 ± 0.0
2.858AlaAsp: 2.858 ± 0.0
2.144AlaGlu: 2.144 ± 0.0
1.072AlaPhe: 1.072 ± 0.0
4.287AlaGly: 4.287 ± 0.0
1.429AlaHis: 1.429 ± 0.0
3.573AlaIle: 3.573 ± 0.0
2.501AlaLys: 2.501 ± 0.0
3.215AlaLeu: 3.215 ± 0.0
1.072AlaMet: 1.072 ± 0.0
2.501AlaAsn: 2.501 ± 0.0
2.144AlaPro: 2.144 ± 0.0
3.573AlaGln: 3.573 ± 0.0
2.858AlaArg: 2.858 ± 0.0
2.144AlaSer: 2.144 ± 0.0
2.501AlaThr: 2.501 ± 0.0
1.786AlaVal: 1.786 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
1.072AlaTyr: 1.072 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.357CysAla: 0.357 ± 0.0
0.715CysCys: 0.715 ± 0.0
0.715CysAsp: 0.715 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.144CysPhe: 2.144 ± 0.0
1.072CysGly: 1.072 ± 0.0
1.072CysHis: 1.072 ± 0.0
0.715CysIle: 0.715 ± 0.0
0.715CysLys: 0.715 ± 0.0
1.072CysLeu: 1.072 ± 0.0
0.357CysMet: 0.357 ± 0.0
0.357CysAsn: 0.357 ± 0.0
1.072CysPro: 1.072 ± 0.0
1.786CysGln: 1.786 ± 0.0
1.072CysArg: 1.072 ± 0.0
1.072CysSer: 1.072 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.144CysVal: 2.144 ± 0.0
0.715CysTrp: 0.715 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.215AspAla: 3.215 ± 0.0
1.072AspCys: 1.072 ± 0.0
3.93AspAsp: 3.93 ± 0.0
4.287AspGlu: 4.287 ± 0.0
6.074AspPhe: 6.074 ± 0.0
1.786AspGly: 1.786 ± 0.0
1.429AspHis: 1.429 ± 0.0
5.716AspIle: 5.716 ± 0.0
3.93AspLys: 3.93 ± 0.0
7.145AspLeu: 7.145 ± 0.0
1.072AspMet: 1.072 ± 0.0
3.573AspAsn: 3.573 ± 0.0
2.501AspPro: 2.501 ± 0.0
5.002AspGln: 5.002 ± 0.0
0.715AspArg: 0.715 ± 0.0
3.93AspSer: 3.93 ± 0.0
0.715AspThr: 0.715 ± 0.0
5.002AspVal: 5.002 ± 0.0
0.357AspTrp: 0.357 ± 0.0
3.573AspTyr: 3.573 ± 0.0
0.357AspXaa: 0.357 ± 0.0
Glu
2.144GluAla: 2.144 ± 0.0
1.429GluCys: 1.429 ± 0.0
4.287GluAsp: 4.287 ± 0.0
7.503GluGlu: 7.503 ± 0.0
4.287GluPhe: 4.287 ± 0.0
1.786GluGly: 1.786 ± 0.0
1.429GluHis: 1.429 ± 0.0
3.93GluIle: 3.93 ± 0.0
3.573GluLys: 3.573 ± 0.0
3.93GluLeu: 3.93 ± 0.0
1.072GluMet: 1.072 ± 0.0
5.002GluAsn: 5.002 ± 0.0
2.144GluPro: 2.144 ± 0.0
4.287GluGln: 4.287 ± 0.0
1.786GluArg: 1.786 ± 0.0
5.002GluSer: 5.002 ± 0.0
2.858GluThr: 2.858 ± 0.0
2.501GluVal: 2.501 ± 0.0
0.0GluTrp: 0.0 ± 0.0
2.501GluTyr: 2.501 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.715PheAla: 0.715 ± 0.0
1.429PheCys: 1.429 ± 0.0
2.858PheAsp: 2.858 ± 0.0
2.501PheGlu: 2.501 ± 0.0
1.072PhePhe: 1.072 ± 0.0
2.144PheGly: 2.144 ± 0.0
0.715PheHis: 0.715 ± 0.0
4.287PheIle: 4.287 ± 0.0
4.287PheLys: 4.287 ± 0.0
2.501PheLeu: 2.501 ± 0.0
1.786PheMet: 1.786 ± 0.0
2.501PheAsn: 2.501 ± 0.0
1.072PhePro: 1.072 ± 0.0
4.287PheGln: 4.287 ± 0.0
3.573PheArg: 3.573 ± 0.0
3.573PheSer: 3.573 ± 0.0
2.144PheThr: 2.144 ± 0.0
2.858PheVal: 2.858 ± 0.0
1.072PheTrp: 1.072 ± 0.0
4.645PheTyr: 4.645 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.429GlyAla: 1.429 ± 0.0
0.357GlyCys: 0.357 ± 0.0
3.215GlyAsp: 3.215 ± 0.0
2.858GlyGlu: 2.858 ± 0.0
1.786GlyPhe: 1.786 ± 0.0
0.715GlyGly: 0.715 ± 0.0
1.072GlyHis: 1.072 ± 0.0
2.858GlyIle: 2.858 ± 0.0
3.215GlyLys: 3.215 ± 0.0
3.215GlyLeu: 3.215 ± 0.0
0.715GlyMet: 0.715 ± 0.0
1.786GlyAsn: 1.786 ± 0.0
1.072GlyPro: 1.072 ± 0.0
3.573GlyGln: 3.573 ± 0.0
2.858GlyArg: 2.858 ± 0.0
2.501GlySer: 2.501 ± 0.0
1.786GlyThr: 1.786 ± 0.0
1.072GlyVal: 1.072 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
4.645GlyTyr: 4.645 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.429HisAla: 1.429 ± 0.0
0.357HisCys: 0.357 ± 0.0
1.072HisAsp: 1.072 ± 0.0
1.072HisGlu: 1.072 ± 0.0
0.715HisPhe: 0.715 ± 0.0
1.786HisGly: 1.786 ± 0.0
0.357HisHis: 0.357 ± 0.0
2.144HisIle: 2.144 ± 0.0
2.144HisLys: 2.144 ± 0.0
1.429HisLeu: 1.429 ± 0.0
0.715HisMet: 0.715 ± 0.0
1.429HisAsn: 1.429 ± 0.0
1.429HisPro: 1.429 ± 0.0
1.786HisGln: 1.786 ± 0.0
1.786HisArg: 1.786 ± 0.0
1.429HisSer: 1.429 ± 0.0
0.715HisThr: 0.715 ± 0.0
1.072HisVal: 1.072 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.786HisTyr: 1.786 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.858IleAla: 2.858 ± 0.0
1.786IleCys: 1.786 ± 0.0
7.145IleAsp: 7.145 ± 0.0
5.716IleGlu: 5.716 ± 0.0
0.715IlePhe: 0.715 ± 0.0
3.215IleGly: 3.215 ± 0.0
1.072IleHis: 1.072 ± 0.0
3.573IleIle: 3.573 ± 0.0
3.93IleLys: 3.93 ± 0.0
6.431IleLeu: 6.431 ± 0.0
0.715IleMet: 0.715 ± 0.0
7.145IleAsn: 7.145 ± 0.0
3.93IlePro: 3.93 ± 0.0
3.93IleGln: 3.93 ± 0.0
1.072IleArg: 1.072 ± 0.0
5.002IleSer: 5.002 ± 0.0
3.215IleThr: 3.215 ± 0.0
2.858IleVal: 2.858 ± 0.0
1.429IleTrp: 1.429 ± 0.0
1.072IleTyr: 1.072 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.144LysAla: 2.144 ± 0.0
1.072LysCys: 1.072 ± 0.0
2.501LysAsp: 2.501 ± 0.0
4.287LysGlu: 4.287 ± 0.0
5.002LysPhe: 5.002 ± 0.0
1.429LysGly: 1.429 ± 0.0
2.858LysHis: 2.858 ± 0.0
5.359LysIle: 5.359 ± 0.0
10.718LysLys: 10.718 ± 0.0
6.431LysLeu: 6.431 ± 0.0
2.144LysMet: 2.144 ± 0.0
5.359LysAsn: 5.359 ± 0.0
2.144LysPro: 2.144 ± 0.0
4.645LysGln: 4.645 ± 0.0
4.645LysArg: 4.645 ± 0.0
3.573LysSer: 3.573 ± 0.0
2.501LysThr: 2.501 ± 0.0
7.503LysVal: 7.503 ± 0.0
0.357LysTrp: 0.357 ± 0.0
2.144LysTyr: 2.144 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.215LeuAla: 3.215 ± 0.0
1.072LeuCys: 1.072 ± 0.0
4.287LeuAsp: 4.287 ± 0.0
3.93LeuGlu: 3.93 ± 0.0
3.573LeuPhe: 3.573 ± 0.0
4.287LeuGly: 4.287 ± 0.0
1.786LeuHis: 1.786 ± 0.0
4.645LeuIle: 4.645 ± 0.0
6.788LeuLys: 6.788 ± 0.0
4.287LeuLeu: 4.287 ± 0.0
2.144LeuMet: 2.144 ± 0.0
5.359LeuAsn: 5.359 ± 0.0
4.645LeuPro: 4.645 ± 0.0
4.287LeuGln: 4.287 ± 0.0
7.145LeuArg: 7.145 ± 0.0
5.359LeuSer: 5.359 ± 0.0
2.144LeuThr: 2.144 ± 0.0
2.858LeuVal: 2.858 ± 0.0
1.429LeuTrp: 1.429 ± 0.0
3.573LeuTyr: 3.573 ± 0.0
0.357LeuXaa: 0.357 ± 0.0
Met
0.715MetAla: 0.715 ± 0.0
0.715MetCys: 0.715 ± 0.0
0.715MetAsp: 0.715 ± 0.0
1.072MetGlu: 1.072 ± 0.0
0.715MetPhe: 0.715 ± 0.0
0.715MetGly: 0.715 ± 0.0
0.0MetHis: 0.0 ± 0.0
2.144MetIle: 2.144 ± 0.0
1.429MetLys: 1.429 ± 0.0
1.072MetLeu: 1.072 ± 0.0
0.357MetMet: 0.357 ± 0.0
1.786MetAsn: 1.786 ± 0.0
0.0MetPro: 0.0 ± 0.0
2.144MetGln: 2.144 ± 0.0
1.072MetArg: 1.072 ± 0.0
0.715MetSer: 0.715 ± 0.0
1.786MetThr: 1.786 ± 0.0
2.501MetVal: 2.501 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.715MetTyr: 0.715 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.359AsnAla: 5.359 ± 0.0
1.072AsnCys: 1.072 ± 0.0
3.215AsnAsp: 3.215 ± 0.0
3.573AsnGlu: 3.573 ± 0.0
2.858AsnPhe: 2.858 ± 0.0
4.645AsnGly: 4.645 ± 0.0
2.144AsnHis: 2.144 ± 0.0
2.501AsnIle: 2.501 ± 0.0
5.359AsnLys: 5.359 ± 0.0
7.503AsnLeu: 7.503 ± 0.0
1.072AsnMet: 1.072 ± 0.0
4.645AsnAsn: 4.645 ± 0.0
1.429AsnPro: 1.429 ± 0.0
5.359AsnGln: 5.359 ± 0.0
3.215AsnArg: 3.215 ± 0.0
6.074AsnSer: 6.074 ± 0.0
3.215AsnThr: 3.215 ± 0.0
3.573AsnVal: 3.573 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.429AsnTyr: 1.429 ± 0.0
0.357AsnXaa: 0.357 ± 0.0
Pro
0.715ProAla: 0.715 ± 0.0
0.0ProCys: 0.0 ± 0.0
3.573ProAsp: 3.573 ± 0.0
1.429ProGlu: 1.429 ± 0.0
1.786ProPhe: 1.786 ± 0.0
1.429ProGly: 1.429 ± 0.0
2.144ProHis: 2.144 ± 0.0
3.215ProIle: 3.215 ± 0.0
2.144ProLys: 2.144 ± 0.0
4.645ProLeu: 4.645 ± 0.0
1.429ProMet: 1.429 ± 0.0
2.501ProAsn: 2.501 ± 0.0
1.786ProPro: 1.786 ± 0.0
3.573ProGln: 3.573 ± 0.0
1.429ProArg: 1.429 ± 0.0
2.144ProSer: 2.144 ± 0.0
1.429ProThr: 1.429 ± 0.0
1.786ProVal: 1.786 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.072ProTyr: 1.072 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.858GlnAla: 2.858 ± 0.0
1.072GlnCys: 1.072 ± 0.0
4.287GlnAsp: 4.287 ± 0.0
4.645GlnGlu: 4.645 ± 0.0
2.501GlnPhe: 2.501 ± 0.0
2.501GlnGly: 2.501 ± 0.0
2.144GlnHis: 2.144 ± 0.0
6.431GlnIle: 6.431 ± 0.0
5.359GlnLys: 5.359 ± 0.0
6.788GlnLeu: 6.788 ± 0.0
0.715GlnMet: 0.715 ± 0.0
6.431GlnAsn: 6.431 ± 0.0
2.144GlnPro: 2.144 ± 0.0
9.646GlnGln: 9.646 ± 0.0
5.359GlnArg: 5.359 ± 0.0
5.716GlnSer: 5.716 ± 0.0
4.287GlnThr: 4.287 ± 0.0
1.786GlnVal: 1.786 ± 0.0
0.357GlnTrp: 0.357 ± 0.0
4.645GlnTyr: 4.645 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.215ArgAla: 3.215 ± 0.0
1.072ArgCys: 1.072 ± 0.0
2.501ArgAsp: 2.501 ± 0.0
4.287ArgGlu: 4.287 ± 0.0
2.501ArgPhe: 2.501 ± 0.0
2.144ArgGly: 2.144 ± 0.0
0.715ArgHis: 0.715 ± 0.0
2.144ArgIle: 2.144 ± 0.0
3.93ArgLys: 3.93 ± 0.0
3.573ArgLeu: 3.573 ± 0.0
1.072ArgMet: 1.072 ± 0.0
4.287ArgAsn: 4.287 ± 0.0
1.072ArgPro: 1.072 ± 0.0
5.716ArgGln: 5.716 ± 0.0
2.501ArgArg: 2.501 ± 0.0
5.359ArgSer: 5.359 ± 0.0
2.144ArgThr: 2.144 ± 0.0
1.429ArgVal: 1.429 ± 0.0
0.357ArgTrp: 0.357 ± 0.0
1.786ArgTyr: 1.786 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.501SerAla: 2.501 ± 0.0
0.357SerCys: 0.357 ± 0.0
3.573SerAsp: 3.573 ± 0.0
3.215SerGlu: 3.215 ± 0.0
3.573SerPhe: 3.573 ± 0.0
2.501SerGly: 2.501 ± 0.0
0.715SerHis: 0.715 ± 0.0
5.002SerIle: 5.002 ± 0.0
5.716SerLys: 5.716 ± 0.0
3.215SerLeu: 3.215 ± 0.0
0.715SerMet: 0.715 ± 0.0
3.93SerAsn: 3.93 ± 0.0
3.573SerPro: 3.573 ± 0.0
6.431SerGln: 6.431 ± 0.0
3.93SerArg: 3.93 ± 0.0
4.645SerSer: 4.645 ± 0.0
3.215SerThr: 3.215 ± 0.0
5.359SerVal: 5.359 ± 0.0
0.357SerTrp: 0.357 ± 0.0
2.858SerTyr: 2.858 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
0.715ThrAla: 0.715 ± 0.0
0.715ThrCys: 0.715 ± 0.0
4.287ThrAsp: 4.287 ± 0.0
2.501ThrGlu: 2.501 ± 0.0
2.501ThrPhe: 2.501 ± 0.0
1.429ThrGly: 1.429 ± 0.0
0.357ThrHis: 0.357 ± 0.0
3.93ThrIle: 3.93 ± 0.0
4.287ThrLys: 4.287 ± 0.0
1.429ThrLeu: 1.429 ± 0.0
0.715ThrMet: 0.715 ± 0.0
1.786ThrAsn: 1.786 ± 0.0
2.858ThrPro: 2.858 ± 0.0
2.858ThrGln: 2.858 ± 0.0
2.144ThrArg: 2.144 ± 0.0
1.786ThrSer: 1.786 ± 0.0
1.429ThrThr: 1.429 ± 0.0
3.215ThrVal: 3.215 ± 0.0
0.357ThrTrp: 0.357 ± 0.0
2.858ThrTyr: 2.858 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.93ValAla: 3.93 ± 0.0
1.072ValCys: 1.072 ± 0.0
6.074ValAsp: 6.074 ± 0.0
4.287ValGlu: 4.287 ± 0.0
3.215ValPhe: 3.215 ± 0.0
1.072ValGly: 1.072 ± 0.0
1.786ValHis: 1.786 ± 0.0
2.144ValIle: 2.144 ± 0.0
2.501ValLys: 2.501 ± 0.0
5.002ValLeu: 5.002 ± 0.0
0.715ValMet: 0.715 ± 0.0
3.93ValAsn: 3.93 ± 0.0
2.144ValPro: 2.144 ± 0.0
3.215ValGln: 3.215 ± 0.0
2.144ValArg: 2.144 ± 0.0
2.144ValSer: 2.144 ± 0.0
2.501ValThr: 2.501 ± 0.0
2.144ValVal: 2.144 ± 0.0
0.715ValTrp: 0.715 ± 0.0
2.144ValTyr: 2.144 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.357TrpAla: 0.357 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.357TrpAsp: 0.357 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.072TrpPhe: 1.072 ± 0.0
0.357TrpGly: 0.357 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.429TrpLys: 1.429 ± 0.0
0.357TrpLeu: 0.357 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.072TrpAsn: 1.072 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.715TrpGln: 0.715 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.715TrpSer: 0.715 ± 0.0
0.715TrpThr: 0.715 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.715TrpTrp: 0.715 ± 0.0
1.429TrpTyr: 1.429 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.858TyrAla: 2.858 ± 0.0
1.786TyrCys: 1.786 ± 0.0
3.93TyrAsp: 3.93 ± 0.0
2.144TyrGlu: 2.144 ± 0.0
2.858TyrPhe: 2.858 ± 0.0
1.786TyrGly: 1.786 ± 0.0
1.429TyrHis: 1.429 ± 0.0
2.144TyrIle: 2.144 ± 0.0
2.858TyrLys: 2.858 ± 0.0
3.93TyrLeu: 3.93 ± 0.0
1.072TyrMet: 1.072 ± 0.0
3.215TyrAsn: 3.215 ± 0.0
1.072TyrPro: 1.072 ± 0.0
2.858TyrGln: 2.858 ± 0.0
2.501TyrArg: 2.501 ± 0.0
2.501TyrSer: 2.501 ± 0.0
2.501TyrThr: 2.501 ± 0.0
1.429TyrVal: 1.429 ± 0.0
1.072TyrTrp: 1.072 ± 0.0
2.858TyrTyr: 2.858 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.357XaaIle: 0.357 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.357XaaMet: 0.357 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.357XaaThr: 0.357 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2800 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski