Amino acid dipepetide frequency for Beihai picorna-like virus 66

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.463AlaAla: 5.463 ± 0.0
1.707AlaCys: 1.707 ± 0.0
3.756AlaAsp: 3.756 ± 0.0
5.121AlaGlu: 5.121 ± 0.0
4.097AlaPhe: 4.097 ± 0.0
3.073AlaGly: 3.073 ± 0.0
1.024AlaHis: 1.024 ± 0.0
2.048AlaIle: 2.048 ± 0.0
4.438AlaLys: 4.438 ± 0.0
6.487AlaLeu: 6.487 ± 0.0
2.39AlaMet: 2.39 ± 0.0
1.707AlaAsn: 1.707 ± 0.0
3.073AlaPro: 3.073 ± 0.0
0.683AlaGln: 0.683 ± 0.0
3.073AlaArg: 3.073 ± 0.0
5.463AlaSer: 5.463 ± 0.0
4.78AlaThr: 4.78 ± 0.0
4.097AlaVal: 4.097 ± 0.0
1.366AlaTrp: 1.366 ± 0.0
1.707AlaTyr: 1.707 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.707CysAla: 1.707 ± 0.0
0.341CysCys: 0.341 ± 0.0
1.024CysAsp: 1.024 ± 0.0
1.024CysGlu: 1.024 ± 0.0
1.024CysPhe: 1.024 ± 0.0
1.024CysGly: 1.024 ± 0.0
0.341CysHis: 0.341 ± 0.0
1.024CysIle: 1.024 ± 0.0
1.024CysLys: 1.024 ± 0.0
1.366CysLeu: 1.366 ± 0.0
1.024CysMet: 1.024 ± 0.0
0.341CysAsn: 0.341 ± 0.0
1.024CysPro: 1.024 ± 0.0
1.024CysGln: 1.024 ± 0.0
0.341CysArg: 0.341 ± 0.0
1.707CysSer: 1.707 ± 0.0
0.341CysThr: 0.341 ± 0.0
2.048CysVal: 2.048 ± 0.0
1.024CysTrp: 1.024 ± 0.0
1.024CysTyr: 1.024 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.756AspAla: 3.756 ± 0.0
0.683AspCys: 0.683 ± 0.0
5.804AspAsp: 5.804 ± 0.0
7.17AspGlu: 7.17 ± 0.0
4.438AspPhe: 4.438 ± 0.0
3.073AspGly: 3.073 ± 0.0
0.341AspHis: 0.341 ± 0.0
3.073AspIle: 3.073 ± 0.0
2.048AspLys: 2.048 ± 0.0
3.073AspLeu: 3.073 ± 0.0
1.024AspMet: 1.024 ± 0.0
1.707AspAsn: 1.707 ± 0.0
2.731AspPro: 2.731 ± 0.0
2.39AspGln: 2.39 ± 0.0
2.048AspArg: 2.048 ± 0.0
2.731AspSer: 2.731 ± 0.0
3.756AspThr: 3.756 ± 0.0
5.121AspVal: 5.121 ± 0.0
0.683AspTrp: 0.683 ± 0.0
2.048AspTyr: 2.048 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.121GluAla: 5.121 ± 0.0
0.341GluCys: 0.341 ± 0.0
3.073GluAsp: 3.073 ± 0.0
6.145GluGlu: 6.145 ± 0.0
3.073GluPhe: 3.073 ± 0.0
2.39GluGly: 2.39 ± 0.0
1.366GluHis: 1.366 ± 0.0
3.073GluIle: 3.073 ± 0.0
4.438GluLys: 4.438 ± 0.0
4.097GluLeu: 4.097 ± 0.0
2.731GluMet: 2.731 ± 0.0
2.731GluAsn: 2.731 ± 0.0
3.073GluPro: 3.073 ± 0.0
3.073GluGln: 3.073 ± 0.0
3.073GluArg: 3.073 ± 0.0
4.438GluSer: 4.438 ± 0.0
5.804GluThr: 5.804 ± 0.0
4.78GluVal: 4.78 ± 0.0
1.707GluTrp: 1.707 ± 0.0
2.39GluTyr: 2.39 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.39PheAla: 2.39 ± 0.0
1.707PheCys: 1.707 ± 0.0
2.731PheAsp: 2.731 ± 0.0
2.731PheGlu: 2.731 ± 0.0
2.048PhePhe: 2.048 ± 0.0
4.438PheGly: 4.438 ± 0.0
2.048PheHis: 2.048 ± 0.0
1.707PheIle: 1.707 ± 0.0
3.073PheLys: 3.073 ± 0.0
4.097PheLeu: 4.097 ± 0.0
2.39PheMet: 2.39 ± 0.0
1.024PheAsn: 1.024 ± 0.0
2.39PhePro: 2.39 ± 0.0
3.073PheGln: 3.073 ± 0.0
4.097PheArg: 4.097 ± 0.0
4.78PheSer: 4.78 ± 0.0
3.414PheThr: 3.414 ± 0.0
2.048PheVal: 2.048 ± 0.0
2.048PheTrp: 2.048 ± 0.0
3.073PheTyr: 3.073 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.121GlyAla: 5.121 ± 0.0
1.707GlyCys: 1.707 ± 0.0
5.463GlyAsp: 5.463 ± 0.0
4.438GlyGlu: 4.438 ± 0.0
4.438GlyPhe: 4.438 ± 0.0
3.414GlyGly: 3.414 ± 0.0
0.341GlyHis: 0.341 ± 0.0
2.731GlyIle: 2.731 ± 0.0
6.487GlyLys: 6.487 ± 0.0
5.804GlyLeu: 5.804 ± 0.0
2.048GlyMet: 2.048 ± 0.0
2.731GlyAsn: 2.731 ± 0.0
3.756GlyPro: 3.756 ± 0.0
1.707GlyGln: 1.707 ± 0.0
3.073GlyArg: 3.073 ± 0.0
5.463GlySer: 5.463 ± 0.0
4.097GlyThr: 4.097 ± 0.0
5.121GlyVal: 5.121 ± 0.0
1.366GlyTrp: 1.366 ± 0.0
1.707GlyTyr: 1.707 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.024HisAla: 1.024 ± 0.0
1.024HisCys: 1.024 ± 0.0
0.683HisAsp: 0.683 ± 0.0
2.39HisGlu: 2.39 ± 0.0
0.683HisPhe: 0.683 ± 0.0
1.366HisGly: 1.366 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.683HisIle: 0.683 ± 0.0
0.341HisLys: 0.341 ± 0.0
2.39HisLeu: 2.39 ± 0.0
0.683HisMet: 0.683 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.683HisPro: 0.683 ± 0.0
0.341HisGln: 0.341 ± 0.0
1.024HisArg: 1.024 ± 0.0
2.39HisSer: 2.39 ± 0.0
0.683HisThr: 0.683 ± 0.0
2.048HisVal: 2.048 ± 0.0
0.683HisTrp: 0.683 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.39IleAla: 2.39 ± 0.0
1.024IleCys: 1.024 ± 0.0
1.707IleAsp: 1.707 ± 0.0
4.78IleGlu: 4.78 ± 0.0
2.39IlePhe: 2.39 ± 0.0
4.097IleGly: 4.097 ± 0.0
1.366IleHis: 1.366 ± 0.0
2.048IleIle: 2.048 ± 0.0
2.39IleLys: 2.39 ± 0.0
3.414IleLeu: 3.414 ± 0.0
1.366IleMet: 1.366 ± 0.0
0.683IleAsn: 0.683 ± 0.0
3.756IlePro: 3.756 ± 0.0
2.048IleGln: 2.048 ± 0.0
2.39IleArg: 2.39 ± 0.0
2.048IleSer: 2.048 ± 0.0
2.39IleThr: 2.39 ± 0.0
2.39IleVal: 2.39 ± 0.0
0.341IleTrp: 0.341 ± 0.0
1.024IleTyr: 1.024 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.145LysAla: 6.145 ± 0.0
0.683LysCys: 0.683 ± 0.0
3.756LysAsp: 3.756 ± 0.0
5.804LysGlu: 5.804 ± 0.0
3.414LysPhe: 3.414 ± 0.0
3.073LysGly: 3.073 ± 0.0
1.366LysHis: 1.366 ± 0.0
4.438LysIle: 4.438 ± 0.0
6.487LysLys: 6.487 ± 0.0
5.463LysLeu: 5.463 ± 0.0
1.707LysMet: 1.707 ± 0.0
2.048LysAsn: 2.048 ± 0.0
3.073LysPro: 3.073 ± 0.0
2.39LysGln: 2.39 ± 0.0
2.048LysArg: 2.048 ± 0.0
2.048LysSer: 2.048 ± 0.0
2.731LysThr: 2.731 ± 0.0
4.097LysVal: 4.097 ± 0.0
1.366LysTrp: 1.366 ± 0.0
2.048LysTyr: 2.048 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.78LeuAla: 4.78 ± 0.0
2.048LeuCys: 2.048 ± 0.0
5.804LeuAsp: 5.804 ± 0.0
4.097LeuGlu: 4.097 ± 0.0
2.731LeuPhe: 2.731 ± 0.0
5.463LeuGly: 5.463 ± 0.0
2.048LeuHis: 2.048 ± 0.0
3.414LeuIle: 3.414 ± 0.0
4.438LeuLys: 4.438 ± 0.0
4.438LeuLeu: 4.438 ± 0.0
1.024LeuMet: 1.024 ± 0.0
3.414LeuAsn: 3.414 ± 0.0
6.487LeuPro: 6.487 ± 0.0
1.707LeuGln: 1.707 ± 0.0
4.438LeuArg: 4.438 ± 0.0
6.487LeuSer: 6.487 ± 0.0
3.414LeuThr: 3.414 ± 0.0
7.511LeuVal: 7.511 ± 0.0
1.024LeuTrp: 1.024 ± 0.0
1.707LeuTyr: 1.707 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.731MetAla: 2.731 ± 0.0
0.341MetCys: 0.341 ± 0.0
2.048MetAsp: 2.048 ± 0.0
1.707MetGlu: 1.707 ± 0.0
2.39MetPhe: 2.39 ± 0.0
2.731MetGly: 2.731 ± 0.0
1.024MetHis: 1.024 ± 0.0
0.683MetIle: 0.683 ± 0.0
2.731MetLys: 2.731 ± 0.0
2.048MetLeu: 2.048 ± 0.0
1.024MetMet: 1.024 ± 0.0
1.024MetAsn: 1.024 ± 0.0
1.024MetPro: 1.024 ± 0.0
0.341MetGln: 0.341 ± 0.0
1.366MetArg: 1.366 ± 0.0
2.048MetSer: 2.048 ± 0.0
2.048MetThr: 2.048 ± 0.0
2.39MetVal: 2.39 ± 0.0
0.683MetTrp: 0.683 ± 0.0
1.024MetTyr: 1.024 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.731AsnAla: 2.731 ± 0.0
0.683AsnCys: 0.683 ± 0.0
1.024AsnAsp: 1.024 ± 0.0
2.048AsnGlu: 2.048 ± 0.0
3.756AsnPhe: 3.756 ± 0.0
2.39AsnGly: 2.39 ± 0.0
0.341AsnHis: 0.341 ± 0.0
2.731AsnIle: 2.731 ± 0.0
1.366AsnLys: 1.366 ± 0.0
2.731AsnLeu: 2.731 ± 0.0
1.707AsnMet: 1.707 ± 0.0
1.366AsnAsn: 1.366 ± 0.0
2.048AsnPro: 2.048 ± 0.0
2.731AsnGln: 2.731 ± 0.0
1.366AsnArg: 1.366 ± 0.0
1.707AsnSer: 1.707 ± 0.0
2.048AsnThr: 2.048 ± 0.0
1.366AsnVal: 1.366 ± 0.0
0.683AsnTrp: 0.683 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.731ProAla: 2.731 ± 0.0
0.683ProCys: 0.683 ± 0.0
3.073ProAsp: 3.073 ± 0.0
3.414ProGlu: 3.414 ± 0.0
3.414ProPhe: 3.414 ± 0.0
2.731ProGly: 2.731 ± 0.0
1.366ProHis: 1.366 ± 0.0
3.073ProIle: 3.073 ± 0.0
3.073ProLys: 3.073 ± 0.0
3.414ProLeu: 3.414 ± 0.0
2.048ProMet: 2.048 ± 0.0
1.366ProAsn: 1.366 ± 0.0
0.683ProPro: 0.683 ± 0.0
1.024ProGln: 1.024 ± 0.0
3.756ProArg: 3.756 ± 0.0
2.731ProSer: 2.731 ± 0.0
3.414ProThr: 3.414 ± 0.0
5.121ProVal: 5.121 ± 0.0
0.683ProTrp: 0.683 ± 0.0
1.024ProTyr: 1.024 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.707GlnAla: 1.707 ± 0.0
0.683GlnCys: 0.683 ± 0.0
1.707GlnAsp: 1.707 ± 0.0
1.366GlnGlu: 1.366 ± 0.0
1.024GlnPhe: 1.024 ± 0.0
3.073GlnGly: 3.073 ± 0.0
0.341GlnHis: 0.341 ± 0.0
1.366GlnIle: 1.366 ± 0.0
2.048GlnLys: 2.048 ± 0.0
2.048GlnLeu: 2.048 ± 0.0
1.366GlnMet: 1.366 ± 0.0
2.048GlnAsn: 2.048 ± 0.0
2.048GlnPro: 2.048 ± 0.0
1.366GlnGln: 1.366 ± 0.0
1.366GlnArg: 1.366 ± 0.0
3.073GlnSer: 3.073 ± 0.0
0.683GlnThr: 0.683 ± 0.0
3.414GlnVal: 3.414 ± 0.0
0.341GlnTrp: 0.341 ± 0.0
0.683GlnTyr: 0.683 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.073ArgAla: 3.073 ± 0.0
1.024ArgCys: 1.024 ± 0.0
2.39ArgAsp: 2.39 ± 0.0
2.39ArgGlu: 2.39 ± 0.0
2.048ArgPhe: 2.048 ± 0.0
4.097ArgGly: 4.097 ± 0.0
0.341ArgHis: 0.341 ± 0.0
1.366ArgIle: 1.366 ± 0.0
4.097ArgLys: 4.097 ± 0.0
4.097ArgLeu: 4.097 ± 0.0
1.707ArgMet: 1.707 ± 0.0
2.048ArgAsn: 2.048 ± 0.0
1.366ArgPro: 1.366 ± 0.0
0.683ArgGln: 0.683 ± 0.0
3.073ArgArg: 3.073 ± 0.0
3.414ArgSer: 3.414 ± 0.0
3.414ArgThr: 3.414 ± 0.0
4.438ArgVal: 4.438 ± 0.0
1.707ArgTrp: 1.707 ± 0.0
1.707ArgTyr: 1.707 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.756SerAla: 3.756 ± 0.0
2.048SerCys: 2.048 ± 0.0
4.438SerAsp: 4.438 ± 0.0
5.121SerGlu: 5.121 ± 0.0
3.414SerPhe: 3.414 ± 0.0
9.56SerGly: 9.56 ± 0.0
1.366SerHis: 1.366 ± 0.0
3.073SerIle: 3.073 ± 0.0
4.097SerLys: 4.097 ± 0.0
6.487SerLeu: 6.487 ± 0.0
2.048SerMet: 2.048 ± 0.0
3.073SerAsn: 3.073 ± 0.0
2.731SerPro: 2.731 ± 0.0
1.707SerGln: 1.707 ± 0.0
3.756SerArg: 3.756 ± 0.0
9.218SerSer: 9.218 ± 0.0
4.097SerThr: 4.097 ± 0.0
5.804SerVal: 5.804 ± 0.0
1.366SerTrp: 1.366 ± 0.0
1.707SerTyr: 1.707 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.414ThrAla: 3.414 ± 0.0
1.024ThrCys: 1.024 ± 0.0
1.707ThrAsp: 1.707 ± 0.0
2.048ThrGlu: 2.048 ± 0.0
3.756ThrPhe: 3.756 ± 0.0
5.121ThrGly: 5.121 ± 0.0
1.366ThrHis: 1.366 ± 0.0
2.048ThrIle: 2.048 ± 0.0
3.756ThrLys: 3.756 ± 0.0
4.097ThrLeu: 4.097 ± 0.0
2.39ThrMet: 2.39 ± 0.0
2.39ThrAsn: 2.39 ± 0.0
2.048ThrPro: 2.048 ± 0.0
2.048ThrGln: 2.048 ± 0.0
4.097ThrArg: 4.097 ± 0.0
6.487ThrSer: 6.487 ± 0.0
2.731ThrThr: 2.731 ± 0.0
3.756ThrVal: 3.756 ± 0.0
0.683ThrTrp: 0.683 ± 0.0
3.756ThrTyr: 3.756 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.78ValAla: 4.78 ± 0.0
2.048ValCys: 2.048 ± 0.0
4.438ValAsp: 4.438 ± 0.0
2.048ValGlu: 2.048 ± 0.0
3.756ValPhe: 3.756 ± 0.0
6.828ValGly: 6.828 ± 0.0
2.048ValHis: 2.048 ± 0.0
3.073ValIle: 3.073 ± 0.0
5.121ValLys: 5.121 ± 0.0
7.17ValLeu: 7.17 ± 0.0
1.366ValMet: 1.366 ± 0.0
2.048ValAsn: 2.048 ± 0.0
5.121ValPro: 5.121 ± 0.0
2.39ValGln: 2.39 ± 0.0
2.731ValArg: 2.731 ± 0.0
6.828ValSer: 6.828 ± 0.0
4.438ValThr: 4.438 ± 0.0
6.487ValVal: 6.487 ± 0.0
1.024ValTrp: 1.024 ± 0.0
3.073ValTyr: 3.073 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.683TrpAla: 0.683 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.366TrpAsp: 1.366 ± 0.0
0.341TrpGlu: 0.341 ± 0.0
1.707TrpPhe: 1.707 ± 0.0
2.048TrpGly: 2.048 ± 0.0
0.341TrpHis: 0.341 ± 0.0
1.707TrpIle: 1.707 ± 0.0
2.048TrpLys: 2.048 ± 0.0
1.707TrpLeu: 1.707 ± 0.0
0.341TrpMet: 0.341 ± 0.0
0.341TrpAsn: 0.341 ± 0.0
0.683TrpPro: 0.683 ± 0.0
0.683TrpGln: 0.683 ± 0.0
0.341TrpArg: 0.341 ± 0.0
1.024TrpSer: 1.024 ± 0.0
2.731TrpThr: 2.731 ± 0.0
1.366TrpVal: 1.366 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.048TyrAla: 2.048 ± 0.0
0.0TyrCys: 0.0 ± 0.0
1.707TyrAsp: 1.707 ± 0.0
2.731TyrGlu: 2.731 ± 0.0
1.707TyrPhe: 1.707 ± 0.0
1.366TyrGly: 1.366 ± 0.0
0.341TyrHis: 0.341 ± 0.0
0.683TyrIle: 0.683 ± 0.0
0.683TyrLys: 0.683 ± 0.0
2.048TyrLeu: 2.048 ± 0.0
0.683TyrMet: 0.683 ± 0.0
2.731TyrAsn: 2.731 ± 0.0
1.024TyrPro: 1.024 ± 0.0
0.341TyrGln: 0.341 ± 0.0
1.024TyrArg: 1.024 ± 0.0
4.78TyrSer: 4.78 ± 0.0
1.707TyrThr: 1.707 ± 0.0
3.414TyrVal: 3.414 ± 0.0
0.683TyrTrp: 0.683 ± 0.0
1.024TyrTyr: 1.024 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2930 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski