Amino acid dipepetide frequency for Human feces smacovirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.608AlaAla: 1.608 ± 1.03
0.0AlaCys: 0.0 ± 0.0
4.823AlaAsp: 4.823 ± 0.686
1.608AlaGlu: 1.608 ± 1.374
0.0AlaPhe: 0.0 ± 0.0
3.215AlaGly: 3.215 ± 2.06
1.608AlaHis: 1.608 ± 1.374
6.431AlaIle: 6.431 ± 0.688
3.215AlaLys: 3.215 ± 0.344
8.039AlaLeu: 8.039 ± 4.466
4.823AlaMet: 4.823 ± 0.686
3.215AlaAsn: 3.215 ± 2.06
4.823AlaPro: 4.823 ± 0.686
3.215AlaGln: 3.215 ± 0.344
0.0AlaArg: 0.0 ± 0.0
9.646AlaSer: 9.646 ± 3.776
4.823AlaThr: 4.823 ± 3.09
1.608AlaVal: 1.608 ± 1.03
3.215AlaTrp: 3.215 ± 0.344
3.215AlaTyr: 3.215 ± 0.344
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.608CysIle: 1.608 ± 1.374
3.215CysLys: 3.215 ± 2.748
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.608CysSer: 1.608 ± 1.03
0.0CysThr: 0.0 ± 0.0
1.608CysVal: 1.608 ± 1.374
0.0CysTrp: 0.0 ± 0.0
1.608CysTyr: 1.608 ± 1.374
0.0CysXaa: 0.0 ± 0.0
Asp
1.608AspAla: 1.608 ± 1.03
1.608AspCys: 1.608 ± 1.374
1.608AspAsp: 1.608 ± 1.03
4.823AspGlu: 4.823 ± 1.718
1.608AspPhe: 1.608 ± 1.374
3.215AspGly: 3.215 ± 0.344
0.0AspHis: 0.0 ± 0.0
1.608AspIle: 1.608 ± 1.374
1.608AspLys: 1.608 ± 1.374
0.0AspLeu: 0.0 ± 0.0
3.215AspMet: 3.215 ± 2.06
1.608AspAsn: 1.608 ± 1.03
1.608AspPro: 1.608 ± 1.03
1.608AspGln: 1.608 ± 1.03
8.039AspArg: 8.039 ± 2.062
4.823AspSer: 4.823 ± 0.686
4.823AspThr: 4.823 ± 1.718
3.215AspVal: 3.215 ± 2.06
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.823GluAla: 4.823 ± 0.686
0.0GluCys: 0.0 ± 0.0
1.608GluAsp: 1.608 ± 1.03
0.0GluGlu: 0.0 ± 0.0
1.608GluPhe: 1.608 ± 1.03
4.823GluGly: 4.823 ± 1.718
1.608GluHis: 1.608 ± 1.374
4.823GluIle: 4.823 ± 0.686
1.608GluLys: 1.608 ± 1.374
1.608GluLeu: 1.608 ± 1.03
0.0GluMet: 0.0 ± 0.0
1.608GluAsn: 1.608 ± 1.03
3.215GluPro: 3.215 ± 2.06
1.608GluGln: 1.608 ± 1.374
4.823GluArg: 4.823 ± 4.122
11.254GluSer: 11.254 ± 2.406
3.215GluThr: 3.215 ± 0.344
1.608GluVal: 1.608 ± 1.374
1.608GluTrp: 1.608 ± 1.374
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.608PheAla: 1.608 ± 1.03
0.0PheCys: 0.0 ± 0.0
1.608PheAsp: 1.608 ± 1.03
4.823PheGlu: 4.823 ± 0.686
4.823PhePhe: 4.823 ± 0.686
3.215PheGly: 3.215 ± 0.344
1.608PheHis: 1.608 ± 1.03
0.0PheIle: 0.0 ± 0.0
3.215PheLys: 3.215 ± 2.06
0.0PheLeu: 0.0 ± 0.0
6.431PheMet: 6.431 ± 3.655
1.608PheAsn: 1.608 ± 1.03
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
6.431PheArg: 6.431 ± 1.716
1.608PheSer: 1.608 ± 1.03
1.608PheThr: 1.608 ± 1.03
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
3.215PheTyr: 3.215 ± 0.344
0.0PheXaa: 0.0 ± 0.0
Gly
8.039GlyAla: 8.039 ± 2.746
1.608GlyCys: 1.608 ± 1.03
0.0GlyAsp: 0.0 ± 0.0
4.823GlyGlu: 4.823 ± 3.09
8.039GlyPhe: 8.039 ± 2.746
6.431GlyGly: 6.431 ± 3.092
0.0GlyHis: 0.0 ± 0.0
4.823GlyIle: 4.823 ± 3.09
4.823GlyLys: 4.823 ± 4.122
3.215GlyLeu: 3.215 ± 2.748
0.0GlyMet: 0.0 ± 0.884
4.823GlyAsn: 4.823 ± 0.686
0.0GlyPro: 0.0 ± 0.0
1.608GlyGln: 1.608 ± 1.374
1.608GlyArg: 1.608 ± 1.374
4.823GlySer: 4.823 ± 0.686
6.431GlyThr: 6.431 ± 4.12
4.823GlyVal: 4.823 ± 1.718
3.215GlyTrp: 3.215 ± 0.344
3.215GlyTyr: 3.215 ± 0.344
0.0GlyXaa: 0.0 ± 0.0
His
1.608HisAla: 1.608 ± 1.374
0.0HisCys: 0.0 ± 0.0
1.608HisAsp: 1.608 ± 1.03
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.215HisGly: 3.215 ± 2.06
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.608HisThr: 1.608 ± 1.03
3.215HisVal: 3.215 ± 2.748
3.215HisTrp: 3.215 ± 0.344
1.608HisTyr: 1.608 ± 1.03
0.0HisXaa: 0.0 ± 0.0
Ile
1.608IleAla: 1.608 ± 1.03
1.608IleCys: 1.608 ± 1.374
6.431IleAsp: 6.431 ± 0.688
4.823IleGlu: 4.823 ± 1.718
3.215IlePhe: 3.215 ± 2.06
3.215IleGly: 3.215 ± 2.06
1.608IleHis: 1.608 ± 1.03
3.215IleIle: 3.215 ± 0.344
1.608IleLys: 1.608 ± 1.374
12.862IleLeu: 12.862 ± 5.836
1.608IleMet: 1.608 ± 1.374
1.608IleAsn: 1.608 ± 1.374
3.215IlePro: 3.215 ± 2.748
1.608IleGln: 1.608 ± 1.03
6.431IleArg: 6.431 ± 5.496
0.0IleSer: 0.0 ± 0.0
3.215IleThr: 3.215 ± 0.344
4.823IleVal: 4.823 ± 1.718
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.823LysAla: 4.823 ± 1.718
0.0LysCys: 0.0 ± 0.0
1.608LysAsp: 1.608 ± 1.374
4.823LysGlu: 4.823 ± 1.718
3.215LysPhe: 3.215 ± 2.06
4.823LysGly: 4.823 ± 3.09
0.0LysHis: 0.0 ± 0.0
1.608LysIle: 1.608 ± 1.03
1.608LysLys: 1.608 ± 1.374
6.431LysLeu: 6.431 ± 3.092
1.608LysMet: 1.608 ± 1.374
1.608LysAsn: 1.608 ± 1.374
0.0LysPro: 0.0 ± 0.0
3.215LysGln: 3.215 ± 0.344
1.608LysArg: 1.608 ± 1.374
1.608LysSer: 1.608 ± 1.374
0.0LysThr: 0.0 ± 0.0
1.608LysVal: 1.608 ± 1.374
3.215LysTrp: 3.215 ± 2.748
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.215LeuAla: 3.215 ± 0.344
0.0LeuCys: 0.0 ± 0.0
3.215LeuAsp: 3.215 ± 0.344
4.823LeuGlu: 4.823 ± 1.718
1.608LeuPhe: 1.608 ± 1.03
3.215LeuGly: 3.215 ± 0.344
0.0LeuHis: 0.0 ± 0.0
1.608LeuIle: 1.608 ± 1.374
1.608LeuLys: 1.608 ± 1.374
0.0LeuLeu: 0.0 ± 0.0
0.0LeuMet: 0.0 ± 0.0
1.608LeuAsn: 1.608 ± 1.03
11.254LeuPro: 11.254 ± 7.21
4.823LeuGln: 4.823 ± 3.09
1.608LeuArg: 1.608 ± 1.03
6.431LeuSer: 6.431 ± 1.716
3.215LeuThr: 3.215 ± 2.748
6.431LeuVal: 6.431 ± 3.092
3.215LeuTrp: 3.215 ± 2.748
6.431LeuTyr: 6.431 ± 3.092
0.0LeuXaa: 0.0 ± 0.0
Met
3.215MetAla: 3.215 ± 2.06
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.215MetGly: 3.215 ± 2.06
0.0MetHis: 0.0 ± 0.0
3.215MetIle: 3.215 ± 2.748
0.0MetLys: 0.0 ± 0.0
1.608MetLeu: 1.608 ± 1.03
0.0MetMet: 0.0 ± 0.0
1.608MetAsn: 1.608 ± 1.03
3.215MetPro: 3.215 ± 2.06
0.0MetGln: 0.0 ± 0.0
3.215MetArg: 3.215 ± 0.344
3.215MetSer: 3.215 ± 2.06
0.0MetThr: 0.0 ± 0.0
3.215MetVal: 3.215 ± 2.748
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.215AsnAla: 3.215 ± 2.06
0.0AsnCys: 0.0 ± 0.0
3.215AsnAsp: 3.215 ± 2.748
0.0AsnGlu: 0.0 ± 0.0
1.608AsnPhe: 1.608 ± 1.03
1.608AsnGly: 1.608 ± 1.374
1.608AsnHis: 1.608 ± 1.03
6.431AsnIle: 6.431 ± 1.716
0.0AsnLys: 0.0 ± 0.0
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
1.608AsnAsn: 1.608 ± 1.03
1.608AsnPro: 1.608 ± 1.03
1.608AsnGln: 1.608 ± 1.374
0.0AsnArg: 0.0 ± 0.0
6.431AsnSer: 6.431 ± 4.12
9.646AsnThr: 9.646 ± 1.372
3.215AsnVal: 3.215 ± 0.344
0.0AsnTrp: 0.0 ± 0.0
1.608AsnTyr: 1.608 ± 1.03
0.0AsnXaa: 0.0 ± 0.0
Pro
6.431ProAla: 6.431 ± 4.12
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
1.608ProGlu: 1.608 ± 1.03
1.608ProPhe: 1.608 ± 1.03
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
3.215ProIle: 3.215 ± 2.06
3.215ProLys: 3.215 ± 0.344
4.823ProLeu: 4.823 ± 0.686
0.0ProMet: 0.0 ± 0.0
3.215ProAsn: 3.215 ± 0.344
6.431ProPro: 6.431 ± 0.688
1.608ProGln: 1.608 ± 1.03
4.823ProArg: 4.823 ± 1.718
4.823ProSer: 4.823 ± 3.09
11.254ProThr: 11.254 ± 2.402
4.823ProVal: 4.823 ± 3.09
0.0ProTrp: 0.0 ± 0.0
1.608ProTyr: 1.608 ± 1.374
0.0ProXaa: 0.0 ± 0.0
Gln
3.215GlnAla: 3.215 ± 0.344
1.608GlnCys: 1.608 ± 1.374
3.215GlnAsp: 3.215 ± 0.344
3.215GlnGlu: 3.215 ± 2.06
1.608GlnPhe: 1.608 ± 1.03
4.823GlnGly: 4.823 ± 3.09
1.608GlnHis: 1.608 ± 1.03
4.823GlnIle: 4.823 ± 1.718
0.0GlnLys: 0.0 ± 0.0
1.608GlnLeu: 1.608 ± 1.374
1.608GlnMet: 1.608 ± 1.03
0.0GlnAsn: 0.0 ± 0.0
3.215GlnPro: 3.215 ± 0.344
0.0GlnGln: 0.0 ± 0.0
1.608GlnArg: 1.608 ± 1.374
3.215GlnSer: 3.215 ± 0.344
1.608GlnThr: 1.608 ± 1.03
3.215GlnVal: 3.215 ± 2.06
1.608GlnTrp: 1.608 ± 1.374
3.215GlnTyr: 3.215 ± 0.344
0.0GlnXaa: 0.0 ± 0.0
Arg
3.215ArgAla: 3.215 ± 2.748
0.0ArgCys: 0.0 ± 0.0
1.608ArgAsp: 1.608 ± 1.03
1.608ArgGlu: 1.608 ± 1.374
3.215ArgPhe: 3.215 ± 0.344
9.646ArgGly: 9.646 ± 8.244
0.0ArgHis: 0.0 ± 0.0
4.823ArgIle: 4.823 ± 4.122
3.215ArgLys: 3.215 ± 2.06
1.608ArgLeu: 1.608 ± 1.03
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
3.215ArgPro: 3.215 ± 2.748
0.0ArgGln: 0.0 ± 0.0
4.823ArgArg: 4.823 ± 0.686
1.608ArgSer: 1.608 ± 1.374
3.215ArgThr: 3.215 ± 0.344
8.039ArgVal: 8.039 ± 2.746
1.608ArgTrp: 1.608 ± 1.374
4.823ArgTyr: 4.823 ± 1.718
0.0ArgXaa: 0.0 ± 0.0
Ser
4.823SerAla: 4.823 ± 0.686
1.608SerCys: 1.608 ± 1.374
1.608SerAsp: 1.608 ± 1.03
1.608SerGlu: 1.608 ± 1.374
1.608SerPhe: 1.608 ± 1.03
6.431SerGly: 6.431 ± 1.716
0.0SerHis: 0.0 ± 0.0
1.608SerIle: 1.608 ± 1.374
1.608SerLys: 1.608 ± 1.374
4.823SerLeu: 4.823 ± 3.09
0.0SerMet: 0.0 ± 0.0
3.215SerAsn: 3.215 ± 0.344
4.823SerPro: 4.823 ± 3.09
6.431SerGln: 6.431 ± 1.716
3.215SerArg: 3.215 ± 0.344
3.215SerSer: 3.215 ± 0.344
11.254SerThr: 11.254 ± 4.806
4.823SerVal: 4.823 ± 0.686
3.215SerTrp: 3.215 ± 2.748
3.215SerTyr: 3.215 ± 2.06
0.0SerXaa: 0.0 ± 0.0
Thr
8.039ThrAla: 8.039 ± 0.342
0.0ThrCys: 0.0 ± 0.0
6.431ThrAsp: 6.431 ± 1.716
1.608ThrGlu: 1.608 ± 1.03
3.215ThrPhe: 3.215 ± 2.06
8.039ThrGly: 8.039 ± 0.342
4.823ThrHis: 4.823 ± 0.686
4.823ThrIle: 4.823 ± 3.09
1.608ThrLys: 1.608 ± 1.374
4.823ThrLeu: 4.823 ± 0.686
1.608ThrMet: 1.608 ± 1.03
8.039ThrAsn: 8.039 ± 2.062
4.823ThrPro: 4.823 ± 3.09
4.823ThrGln: 4.823 ± 0.686
1.608ThrArg: 1.608 ± 1.374
0.0ThrSer: 0.0 ± 0.0
1.608ThrThr: 1.608 ± 1.03
6.431ThrVal: 6.431 ± 1.716
1.608ThrTrp: 1.608 ± 1.03
1.608ThrTyr: 1.608 ± 1.03
0.0ThrXaa: 0.0 ± 0.0
Val
4.823ValAla: 4.823 ± 4.122
0.0ValCys: 0.0 ± 0.0
3.215ValAsp: 3.215 ± 2.748
4.823ValGlu: 4.823 ± 1.718
3.215ValPhe: 3.215 ± 0.344
4.823ValGly: 4.823 ± 0.686
0.0ValHis: 0.0 ± 0.0
3.215ValIle: 3.215 ± 0.344
6.431ValLys: 6.431 ± 0.688
8.039ValLeu: 8.039 ± 4.466
0.0ValMet: 0.0 ± 0.0
4.823ValAsn: 4.823 ± 3.09
3.215ValPro: 3.215 ± 0.344
4.823ValGln: 4.823 ± 1.718
1.608ValArg: 1.608 ± 1.03
3.215ValSer: 3.215 ± 0.344
6.431ValThr: 6.431 ± 1.716
1.608ValVal: 1.608 ± 1.03
3.215ValTrp: 3.215 ± 0.344
4.823ValTyr: 4.823 ± 0.686
0.0ValXaa: 0.0 ± 0.0
Trp
1.608TrpAla: 1.608 ± 1.374
1.608TrpCys: 1.608 ± 1.374
0.0TrpAsp: 0.0 ± 0.0
1.608TrpGlu: 1.608 ± 1.374
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.608TrpIle: 1.608 ± 1.374
3.215TrpLys: 3.215 ± 0.344
3.215TrpLeu: 3.215 ± 0.344
3.215TrpMet: 3.215 ± 0.344
1.608TrpAsn: 1.608 ± 1.03
0.0TrpPro: 0.0 ± 0.0
1.608TrpGln: 1.608 ± 1.374
4.823TrpArg: 4.823 ± 1.718
1.608TrpSer: 1.608 ± 1.374
0.0TrpThr: 0.0 ± 0.0
1.608TrpVal: 1.608 ± 1.374
0.0TrpTrp: 0.0 ± 0.0
1.608TrpTyr: 1.608 ± 1.374
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.608TyrAla: 1.608 ± 1.03
0.0TyrCys: 0.0 ± 0.0
4.823TyrAsp: 4.823 ± 1.718
4.823TyrGlu: 4.823 ± 1.718
1.608TyrPhe: 1.608 ± 1.03
0.0TyrGly: 0.0 ± 0.0
1.608TyrHis: 1.608 ± 1.374
3.215TyrIle: 3.215 ± 0.344
1.608TyrLys: 1.608 ± 1.03
1.608TyrLeu: 1.608 ± 1.03
0.0TyrMet: 0.0 ± 0.0
1.608TyrAsn: 1.608 ± 1.03
4.823TyrPro: 4.823 ± 0.686
6.431TyrGln: 6.431 ± 1.716
0.0TyrArg: 0.0 ± 0.0
0.0TyrSer: 0.0 ± 0.0
1.608TyrThr: 1.608 ± 1.374
6.431TyrVal: 6.431 ± 5.496
0.0TyrTrp: 0.0 ± 0.0
4.823TyrTyr: 4.823 ± 0.686
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski