Amino acid dipepetide frequency for Beihai picorna-like virus 50

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.478AlaAla: 6.478 ± 0.0
1.619AlaCys: 1.619 ± 0.0
4.049AlaAsp: 4.049 ± 0.0
2.834AlaGlu: 2.834 ± 0.0
2.024AlaPhe: 2.024 ± 0.0
2.834AlaGly: 2.834 ± 0.0
1.619AlaHis: 1.619 ± 0.0
3.239AlaIle: 3.239 ± 0.0
3.644AlaLys: 3.644 ± 0.0
5.263AlaLeu: 5.263 ± 0.0
2.429AlaMet: 2.429 ± 0.0
4.049AlaAsn: 4.049 ± 0.0
3.644AlaPro: 3.644 ± 0.0
2.024AlaGln: 2.024 ± 0.0
0.81AlaArg: 0.81 ± 0.0
3.239AlaSer: 3.239 ± 0.0
4.453AlaThr: 4.453 ± 0.0
5.263AlaVal: 5.263 ± 0.0
1.619AlaTrp: 1.619 ± 0.0
3.239AlaTyr: 3.239 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.405CysAla: 0.405 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.215CysAsp: 1.215 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.81CysGly: 0.81 ± 0.0
0.405CysHis: 0.405 ± 0.0
0.81CysIle: 0.81 ± 0.0
0.81CysLys: 0.81 ± 0.0
1.215CysLeu: 1.215 ± 0.0
0.405CysMet: 0.405 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.215CysPro: 1.215 ± 0.0
0.81CysGln: 0.81 ± 0.0
0.81CysArg: 0.81 ± 0.0
2.024CysSer: 2.024 ± 0.0
0.81CysThr: 0.81 ± 0.0
1.215CysVal: 1.215 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.049AspAla: 4.049 ± 0.0
0.81AspCys: 0.81 ± 0.0
3.239AspAsp: 3.239 ± 0.0
3.644AspGlu: 3.644 ± 0.0
5.263AspPhe: 5.263 ± 0.0
4.453AspGly: 4.453 ± 0.0
1.619AspHis: 1.619 ± 0.0
4.858AspIle: 4.858 ± 0.0
3.239AspLys: 3.239 ± 0.0
4.453AspLeu: 4.453 ± 0.0
1.619AspMet: 1.619 ± 0.0
4.049AspAsn: 4.049 ± 0.0
3.644AspPro: 3.644 ± 0.0
1.215AspGln: 1.215 ± 0.0
1.215AspArg: 1.215 ± 0.0
3.239AspSer: 3.239 ± 0.0
2.429AspThr: 2.429 ± 0.0
3.644AspVal: 3.644 ± 0.0
0.405AspTrp: 0.405 ± 0.0
3.239AspTyr: 3.239 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.644GluAla: 3.644 ± 0.0
1.215GluCys: 1.215 ± 0.0
2.429GluAsp: 2.429 ± 0.0
4.453GluGlu: 4.453 ± 0.0
2.024GluPhe: 2.024 ± 0.0
2.429GluGly: 2.429 ± 0.0
1.215GluHis: 1.215 ± 0.0
2.429GluIle: 2.429 ± 0.0
2.834GluLys: 2.834 ± 0.0
5.668GluLeu: 5.668 ± 0.0
0.0GluMet: 0.0 ± 0.0
1.619GluAsn: 1.619 ± 0.0
1.215GluPro: 1.215 ± 0.0
0.81GluGln: 0.81 ± 0.0
0.81GluArg: 0.81 ± 0.0
3.644GluSer: 3.644 ± 0.0
4.858GluThr: 4.858 ± 0.0
4.453GluVal: 4.453 ± 0.0
0.405GluTrp: 0.405 ± 0.0
2.024GluTyr: 2.024 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.024PheAla: 2.024 ± 0.0
0.81PheCys: 0.81 ± 0.0
2.834PheAsp: 2.834 ± 0.0
3.644PheGlu: 3.644 ± 0.0
0.405PhePhe: 0.405 ± 0.0
5.263PheGly: 5.263 ± 0.0
2.834PheHis: 2.834 ± 0.0
2.024PheIle: 2.024 ± 0.0
2.834PheLys: 2.834 ± 0.0
4.049PheLeu: 4.049 ± 0.0
0.81PheMet: 0.81 ± 0.0
4.858PheAsn: 4.858 ± 0.0
2.429PhePro: 2.429 ± 0.0
1.619PheGln: 1.619 ± 0.0
3.644PheArg: 3.644 ± 0.0
2.834PheSer: 2.834 ± 0.0
2.834PheThr: 2.834 ± 0.0
2.834PheVal: 2.834 ± 0.0
0.405PheTrp: 0.405 ± 0.0
3.239PheTyr: 3.239 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.668GlyAla: 5.668 ± 0.0
0.405GlyCys: 0.405 ± 0.0
5.668GlyAsp: 5.668 ± 0.0
2.834GlyGlu: 2.834 ± 0.0
4.858GlyPhe: 4.858 ± 0.0
6.073GlyGly: 6.073 ± 0.0
1.215GlyHis: 1.215 ± 0.0
4.049GlyIle: 4.049 ± 0.0
6.883GlyLys: 6.883 ± 0.0
5.263GlyLeu: 5.263 ± 0.0
0.405GlyMet: 0.405 ± 0.0
3.644GlyAsn: 3.644 ± 0.0
2.429GlyPro: 2.429 ± 0.0
2.834GlyGln: 2.834 ± 0.0
2.429GlyArg: 2.429 ± 0.0
5.263GlySer: 5.263 ± 0.0
4.858GlyThr: 4.858 ± 0.0
4.049GlyVal: 4.049 ± 0.0
0.81GlyTrp: 0.81 ± 0.0
2.429GlyTyr: 2.429 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
3.239HisAla: 3.239 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.024HisAsp: 2.024 ± 0.0
0.405HisGlu: 0.405 ± 0.0
0.81HisPhe: 0.81 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.81HisHis: 0.81 ± 0.0
2.024HisIle: 2.024 ± 0.0
0.405HisLys: 0.405 ± 0.0
2.834HisLeu: 2.834 ± 0.0
0.81HisMet: 0.81 ± 0.0
0.405HisAsn: 0.405 ± 0.0
1.215HisPro: 1.215 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.81HisArg: 0.81 ± 0.0
2.024HisSer: 2.024 ± 0.0
1.215HisThr: 1.215 ± 0.0
0.405HisVal: 0.405 ± 0.0
0.405HisTrp: 0.405 ± 0.0
1.619HisTyr: 1.619 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.429IleAla: 2.429 ± 0.0
0.81IleCys: 0.81 ± 0.0
4.049IleAsp: 4.049 ± 0.0
3.239IleGlu: 3.239 ± 0.0
1.215IlePhe: 1.215 ± 0.0
4.858IleGly: 4.858 ± 0.0
0.81IleHis: 0.81 ± 0.0
5.263IleIle: 5.263 ± 0.0
4.453IleLys: 4.453 ± 0.0
4.858IleLeu: 4.858 ± 0.0
2.429IleMet: 2.429 ± 0.0
2.429IleAsn: 2.429 ± 0.0
3.239IlePro: 3.239 ± 0.0
1.619IleGln: 1.619 ± 0.0
2.024IleArg: 2.024 ± 0.0
3.239IleSer: 3.239 ± 0.0
2.834IleThr: 2.834 ± 0.0
1.619IleVal: 1.619 ± 0.0
0.405IleTrp: 0.405 ± 0.0
2.024IleTyr: 2.024 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.024LysAla: 2.024 ± 0.0
0.405LysCys: 0.405 ± 0.0
2.834LysAsp: 2.834 ± 0.0
2.834LysGlu: 2.834 ± 0.0
2.834LysPhe: 2.834 ± 0.0
3.644LysGly: 3.644 ± 0.0
1.215LysHis: 1.215 ± 0.0
3.239LysIle: 3.239 ± 0.0
2.834LysLys: 2.834 ± 0.0
5.263LysLeu: 5.263 ± 0.0
0.405LysMet: 0.405 ± 0.0
1.215LysAsn: 1.215 ± 0.0
1.619LysPro: 1.619 ± 0.0
2.429LysGln: 2.429 ± 0.0
0.81LysArg: 0.81 ± 0.0
3.644LysSer: 3.644 ± 0.0
4.049LysThr: 4.049 ± 0.0
4.453LysVal: 4.453 ± 0.0
0.81LysTrp: 0.81 ± 0.0
2.834LysTyr: 2.834 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.502LeuAla: 8.502 ± 0.0
2.024LeuCys: 2.024 ± 0.0
7.692LeuAsp: 7.692 ± 0.0
4.858LeuGlu: 4.858 ± 0.0
6.073LeuPhe: 6.073 ± 0.0
4.049LeuGly: 4.049 ± 0.0
1.215LeuHis: 1.215 ± 0.0
3.239LeuIle: 3.239 ± 0.0
3.239LeuLys: 3.239 ± 0.0
8.097LeuLeu: 8.097 ± 0.0
2.024LeuMet: 2.024 ± 0.0
4.858LeuAsn: 4.858 ± 0.0
4.049LeuPro: 4.049 ± 0.0
3.239LeuGln: 3.239 ± 0.0
3.239LeuArg: 3.239 ± 0.0
6.883LeuSer: 6.883 ± 0.0
4.453LeuThr: 4.453 ± 0.0
8.097LeuVal: 8.097 ± 0.0
1.619LeuTrp: 1.619 ± 0.0
2.429LeuTyr: 2.429 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.405MetAla: 0.405 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.619MetAsp: 1.619 ± 0.0
0.405MetGlu: 0.405 ± 0.0
1.215MetPhe: 1.215 ± 0.0
2.024MetGly: 2.024 ± 0.0
0.405MetHis: 0.405 ± 0.0
1.215MetIle: 1.215 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.81MetLeu: 0.81 ± 0.0
0.81MetMet: 0.81 ± 0.0
2.429MetAsn: 2.429 ± 0.0
2.024MetPro: 2.024 ± 0.0
0.405MetGln: 0.405 ± 0.0
1.619MetArg: 1.619 ± 0.0
4.049MetSer: 4.049 ± 0.0
2.024MetThr: 2.024 ± 0.0
1.619MetVal: 1.619 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.215MetTyr: 1.215 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.215AsnAla: 1.215 ± 0.0
0.0AsnCys: 0.0 ± 0.0
2.834AsnAsp: 2.834 ± 0.0
3.239AsnGlu: 3.239 ± 0.0
3.239AsnPhe: 3.239 ± 0.0
4.858AsnGly: 4.858 ± 0.0
1.619AsnHis: 1.619 ± 0.0
4.858AsnIle: 4.858 ± 0.0
1.619AsnLys: 1.619 ± 0.0
3.644AsnLeu: 3.644 ± 0.0
1.619AsnMet: 1.619 ± 0.0
2.429AsnAsn: 2.429 ± 0.0
2.024AsnPro: 2.024 ± 0.0
2.024AsnGln: 2.024 ± 0.0
2.429AsnArg: 2.429 ± 0.0
2.429AsnSer: 2.429 ± 0.0
3.644AsnThr: 3.644 ± 0.0
6.073AsnVal: 6.073 ± 0.0
1.215AsnTrp: 1.215 ± 0.0
2.024AsnTyr: 2.024 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.215ProAla: 1.215 ± 0.0
0.81ProCys: 0.81 ± 0.0
3.644ProAsp: 3.644 ± 0.0
2.024ProGlu: 2.024 ± 0.0
4.858ProPhe: 4.858 ± 0.0
2.024ProGly: 2.024 ± 0.0
0.81ProHis: 0.81 ± 0.0
2.024ProIle: 2.024 ± 0.0
3.644ProLys: 3.644 ± 0.0
5.668ProLeu: 5.668 ± 0.0
0.405ProMet: 0.405 ± 0.0
2.024ProAsn: 2.024 ± 0.0
2.024ProPro: 2.024 ± 0.0
3.644ProGln: 3.644 ± 0.0
1.215ProArg: 1.215 ± 0.0
5.263ProSer: 5.263 ± 0.0
4.049ProThr: 4.049 ± 0.0
5.668ProVal: 5.668 ± 0.0
0.405ProTrp: 0.405 ± 0.0
2.024ProTyr: 2.024 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.619GlnAla: 1.619 ± 0.0
0.405GlnCys: 0.405 ± 0.0
0.81GlnAsp: 0.81 ± 0.0
1.215GlnGlu: 1.215 ± 0.0
3.239GlnPhe: 3.239 ± 0.0
2.834GlnGly: 2.834 ± 0.0
0.405GlnHis: 0.405 ± 0.0
0.81GlnIle: 0.81 ± 0.0
2.429GlnLys: 2.429 ± 0.0
2.834GlnLeu: 2.834 ± 0.0
0.81GlnMet: 0.81 ± 0.0
2.024GlnAsn: 2.024 ± 0.0
1.619GlnPro: 1.619 ± 0.0
2.834GlnGln: 2.834 ± 0.0
3.239GlnArg: 3.239 ± 0.0
2.429GlnSer: 2.429 ± 0.0
2.834GlnThr: 2.834 ± 0.0
2.429GlnVal: 2.429 ± 0.0
1.619GlnTrp: 1.619 ± 0.0
1.215GlnTyr: 1.215 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.619ArgAla: 1.619 ± 0.0
0.0ArgCys: 0.0 ± 0.0
0.81ArgAsp: 0.81 ± 0.0
1.215ArgGlu: 1.215 ± 0.0
1.619ArgPhe: 1.619 ± 0.0
3.239ArgGly: 3.239 ± 0.0
0.405ArgHis: 0.405 ± 0.0
1.619ArgIle: 1.619 ± 0.0
1.215ArgLys: 1.215 ± 0.0
4.049ArgLeu: 4.049 ± 0.0
1.619ArgMet: 1.619 ± 0.0
2.834ArgAsn: 2.834 ± 0.0
4.049ArgPro: 4.049 ± 0.0
0.405ArgGln: 0.405 ± 0.0
2.429ArgArg: 2.429 ± 0.0
3.644ArgSer: 3.644 ± 0.0
1.215ArgThr: 1.215 ± 0.0
3.644ArgVal: 3.644 ± 0.0
1.619ArgTrp: 1.619 ± 0.0
2.024ArgTyr: 2.024 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.453SerAla: 4.453 ± 0.0
1.215SerCys: 1.215 ± 0.0
3.239SerAsp: 3.239 ± 0.0
1.619SerGlu: 1.619 ± 0.0
2.834SerPhe: 2.834 ± 0.0
6.478SerGly: 6.478 ± 0.0
0.405SerHis: 0.405 ± 0.0
5.668SerIle: 5.668 ± 0.0
2.834SerLys: 2.834 ± 0.0
7.287SerLeu: 7.287 ± 0.0
2.024SerMet: 2.024 ± 0.0
4.858SerAsn: 4.858 ± 0.0
5.263SerPro: 5.263 ± 0.0
3.239SerGln: 3.239 ± 0.0
3.644SerArg: 3.644 ± 0.0
4.049SerSer: 4.049 ± 0.0
4.858SerThr: 4.858 ± 0.0
6.073SerVal: 6.073 ± 0.0
1.215SerTrp: 1.215 ± 0.0
2.429SerTyr: 2.429 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.668ThrAla: 5.668 ± 0.0
0.405ThrCys: 0.405 ± 0.0
5.263ThrAsp: 5.263 ± 0.0
3.644ThrGlu: 3.644 ± 0.0
4.049ThrPhe: 4.049 ± 0.0
4.858ThrGly: 4.858 ± 0.0
1.215ThrHis: 1.215 ± 0.0
2.024ThrIle: 2.024 ± 0.0
2.024ThrLys: 2.024 ± 0.0
5.263ThrLeu: 5.263 ± 0.0
0.81ThrMet: 0.81 ± 0.0
2.429ThrAsn: 2.429 ± 0.0
2.834ThrPro: 2.834 ± 0.0
2.024ThrGln: 2.024 ± 0.0
1.215ThrArg: 1.215 ± 0.0
7.692ThrSer: 7.692 ± 0.0
4.453ThrThr: 4.453 ± 0.0
4.049ThrVal: 4.049 ± 0.0
0.81ThrTrp: 0.81 ± 0.0
2.834ThrTyr: 2.834 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.478ValAla: 6.478 ± 0.0
1.215ValCys: 1.215 ± 0.0
2.429ValAsp: 2.429 ± 0.0
4.858ValGlu: 4.858 ± 0.0
2.834ValPhe: 2.834 ± 0.0
4.049ValGly: 4.049 ± 0.0
2.024ValHis: 2.024 ± 0.0
2.429ValIle: 2.429 ± 0.0
2.834ValLys: 2.834 ± 0.0
9.717ValLeu: 9.717 ± 0.0
2.429ValMet: 2.429 ± 0.0
3.239ValAsn: 3.239 ± 0.0
4.858ValPro: 4.858 ± 0.0
3.239ValGln: 3.239 ± 0.0
5.263ValArg: 5.263 ± 0.0
4.049ValSer: 4.049 ± 0.0
3.239ValThr: 3.239 ± 0.0
3.644ValVal: 3.644 ± 0.0
0.0ValTrp: 0.0 ± 0.0
4.453ValTyr: 4.453 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.405TrpAla: 0.405 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.215TrpAsp: 1.215 ± 0.0
0.405TrpGlu: 0.405 ± 0.0
0.81TrpPhe: 0.81 ± 0.0
1.215TrpGly: 1.215 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.405TrpLys: 0.405 ± 0.0
0.81TrpLeu: 0.81 ± 0.0
1.215TrpMet: 1.215 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.619TrpPro: 1.619 ± 0.0
0.81TrpGln: 0.81 ± 0.0
1.215TrpArg: 1.215 ± 0.0
1.215TrpSer: 1.215 ± 0.0
0.81TrpThr: 0.81 ± 0.0
0.81TrpVal: 0.81 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.81TrpTyr: 0.81 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.834TyrAla: 2.834 ± 0.0
0.81TyrCys: 0.81 ± 0.0
2.429TyrAsp: 2.429 ± 0.0
0.81TyrGlu: 0.81 ± 0.0
2.024TyrPhe: 2.024 ± 0.0
6.073TyrGly: 6.073 ± 0.0
1.215TyrHis: 1.215 ± 0.0
2.024TyrIle: 2.024 ± 0.0
1.215TyrLys: 1.215 ± 0.0
2.834TyrLeu: 2.834 ± 0.0
0.81TyrMet: 0.81 ± 0.0
3.644TyrAsn: 3.644 ± 0.0
2.429TyrPro: 2.429 ± 0.0
2.429TyrGln: 2.429 ± 0.0
0.405TyrArg: 0.405 ± 0.0
2.834TyrSer: 2.834 ± 0.0
4.049TyrThr: 4.049 ± 0.0
3.239TyrVal: 3.239 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.215TyrTyr: 1.215 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2471 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski