Amino acid dipepetide frequency for Canine feces-associated gemycircularvirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
3.215AlaAsp: 3.215 ± 2.385
1.608AlaGlu: 1.608 ± 1.193
1.608AlaPhe: 1.608 ± 1.193
6.431AlaGly: 6.431 ± 2.31
0.0AlaHis: 0.0 ± 0.0
6.431AlaIle: 6.431 ± 2.611
1.608AlaLys: 1.608 ± 1.193
0.0AlaLeu: 0.0 ± 0.0
1.608AlaMet: 1.608 ± 0.843
1.608AlaAsn: 1.608 ± 1.193
4.823AlaPro: 4.823 ± 1.343
0.0AlaGln: 0.0 ± 0.0
6.431AlaArg: 6.431 ± 2.611
1.608AlaSer: 1.608 ± 1.193
6.431AlaThr: 6.431 ± 2.31
4.823AlaVal: 4.823 ± 3.803
1.608AlaTrp: 1.608 ± 1.268
4.823AlaTyr: 4.823 ± 1.343
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.608CysGlu: 1.608 ± 1.193
3.215CysPhe: 3.215 ± 2.535
1.608CysGly: 1.608 ± 1.193
0.0CysHis: 0.0 ± 0.0
4.823CysIle: 4.823 ± 3.578
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.608CysPro: 1.608 ± 1.268
1.608CysGln: 1.608 ± 1.193
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.608CysThr: 1.608 ± 1.193
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.608CysTyr: 1.608 ± 1.193
0.0CysXaa: 0.0 ± 0.0
Asp
3.215AspAla: 3.215 ± 0.075
0.0AspCys: 0.0 ± 0.0
4.823AspAsp: 4.823 ± 1.117
1.608AspGlu: 1.608 ± 1.193
0.0AspPhe: 0.0 ± 0.0
4.823AspGly: 4.823 ± 1.117
0.0AspHis: 0.0 ± 0.0
1.608AspIle: 1.608 ± 1.268
3.215AspLys: 3.215 ± 0.075
4.823AspLeu: 4.823 ± 1.117
3.215AspMet: 3.215 ± 2.385
1.608AspAsn: 1.608 ± 1.268
3.215AspPro: 3.215 ± 0.075
6.431AspGln: 6.431 ± 0.15
4.823AspArg: 4.823 ± 1.343
3.215AspSer: 3.215 ± 2.535
8.039AspThr: 8.039 ± 6.339
8.039AspVal: 8.039 ± 5.963
6.431AspTrp: 6.431 ± 0.15
4.823AspTyr: 4.823 ± 1.117
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
1.608GluCys: 1.608 ± 1.193
1.608GluAsp: 1.608 ± 1.193
0.0GluGlu: 0.0 ± 0.0
3.215GluPhe: 3.215 ± 0.075
3.215GluGly: 3.215 ± 2.385
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
3.215GluLys: 3.215 ± 0.075
1.608GluLeu: 1.608 ± 1.193
0.0GluMet: 0.0 ± 0.0
4.823GluAsn: 4.823 ± 3.803
6.431GluPro: 6.431 ± 0.15
0.0GluGln: 0.0 ± 0.0
4.823GluArg: 4.823 ± 3.578
4.823GluSer: 4.823 ± 3.578
4.823GluThr: 4.823 ± 1.117
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.608PheAla: 1.608 ± 1.268
0.0PheCys: 0.0 ± 0.0
4.823PheAsp: 4.823 ± 1.117
6.431PheGlu: 6.431 ± 2.31
4.823PhePhe: 4.823 ± 3.578
3.215PheGly: 3.215 ± 0.075
1.608PheHis: 1.608 ± 1.193
1.608PheIle: 1.608 ± 1.268
1.608PheLys: 1.608 ± 1.268
1.608PheLeu: 1.608 ± 1.268
3.215PheMet: 3.215 ± 0.075
0.0PheAsn: 0.0 ± 0.0
3.215PhePro: 3.215 ± 0.075
1.608PheGln: 1.608 ± 1.193
6.431PheArg: 6.431 ± 0.15
6.431PheSer: 6.431 ± 2.31
3.215PheThr: 3.215 ± 0.075
1.608PheVal: 1.608 ± 1.193
3.215PheTrp: 3.215 ± 2.385
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.823GlyAla: 4.823 ± 1.343
1.608GlyCys: 1.608 ± 1.193
12.862GlyAsp: 12.862 ± 2.761
3.215GlyGlu: 3.215 ± 0.075
3.215GlyPhe: 3.215 ± 0.075
9.646GlyGly: 9.646 ± 2.235
3.215GlyHis: 3.215 ± 2.385
4.823GlyIle: 4.823 ± 1.343
1.608GlyLys: 1.608 ± 1.193
6.431GlyLeu: 6.431 ± 4.77
3.215GlyMet: 3.215 ± 2.535
3.215GlyAsn: 3.215 ± 2.385
1.608GlyPro: 1.608 ± 1.193
3.215GlyGln: 3.215 ± 0.075
4.823GlyArg: 4.823 ± 3.578
1.608GlySer: 1.608 ± 1.268
6.431GlyThr: 6.431 ± 2.611
3.215GlyVal: 3.215 ± 0.075
3.215GlyTrp: 3.215 ± 2.385
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
3.215HisAla: 3.215 ± 2.385
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.608HisGlu: 1.608 ± 1.268
1.608HisPhe: 1.608 ± 1.268
0.0HisGly: 0.0 ± 0.0
1.608HisHis: 1.608 ± 1.193
0.0HisIle: 0.0 ± 0.0
1.608HisLys: 1.608 ± 1.193
1.608HisLeu: 1.608 ± 1.193
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.215HisPro: 3.215 ± 2.385
1.608HisGln: 1.608 ± 1.268
0.0HisArg: 0.0 ± 0.0
1.608HisSer: 1.608 ± 1.193
1.608HisThr: 1.608 ± 1.193
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.215IleAla: 3.215 ± 2.535
1.608IleCys: 1.608 ± 1.268
8.039IleAsp: 8.039 ± 3.878
3.215IleGlu: 3.215 ± 2.535
3.215IlePhe: 3.215 ± 2.385
1.608IleGly: 1.608 ± 1.193
3.215IleHis: 3.215 ± 2.385
3.215IleIle: 3.215 ± 0.075
1.608IleLys: 1.608 ± 1.193
3.215IleLeu: 3.215 ± 0.075
0.0IleMet: 0.0 ± 0.0
1.608IleAsn: 1.608 ± 1.193
1.608IlePro: 1.608 ± 1.268
1.608IleGln: 1.608 ± 1.268
0.0IleArg: 0.0 ± 0.0
4.823IleSer: 4.823 ± 1.117
0.0IleThr: 0.0 ± 0.0
1.608IleVal: 1.608 ± 1.193
0.0IleTrp: 0.0 ± 0.0
4.823IleTyr: 4.823 ± 1.343
0.0IleXaa: 0.0 ± 0.0
Lys
1.608LysAla: 1.608 ± 1.268
0.0LysCys: 0.0 ± 0.0
1.608LysAsp: 1.608 ± 1.268
1.608LysGlu: 1.608 ± 1.268
3.215LysPhe: 3.215 ± 2.385
3.215LysGly: 3.215 ± 0.075
0.0LysHis: 0.0 ± 0.0
3.215LysIle: 3.215 ± 0.075
3.215LysLys: 3.215 ± 2.535
0.0LysLeu: 0.0 ± 0.0
1.608LysMet: 1.608 ± 0.819
1.608LysAsn: 1.608 ± 1.268
0.0LysPro: 0.0 ± 0.0
3.215LysGln: 3.215 ± 0.075
3.215LysArg: 3.215 ± 2.535
0.0LysSer: 0.0 ± 0.0
4.823LysThr: 4.823 ± 1.343
0.0LysVal: 0.0 ± 0.0
1.608LysTrp: 1.608 ± 1.193
4.823LysTyr: 4.823 ± 3.578
0.0LysXaa: 0.0 ± 0.0
Leu
6.431LeuAla: 6.431 ± 0.15
0.0LeuCys: 0.0 ± 0.0
4.823LeuAsp: 4.823 ± 3.578
3.215LeuGlu: 3.215 ± 2.385
1.608LeuPhe: 1.608 ± 1.193
6.431LeuGly: 6.431 ± 4.77
1.608LeuHis: 1.608 ± 1.193
0.0LeuIle: 0.0 ± 0.0
0.0LeuLys: 0.0 ± 0.0
4.823LeuLeu: 4.823 ± 3.578
1.608LeuMet: 1.608 ± 1.193
3.215LeuAsn: 3.215 ± 2.535
0.0LeuPro: 0.0 ± 0.0
0.0LeuGln: 0.0 ± 0.0
6.431LeuArg: 6.431 ± 0.15
3.215LeuSer: 3.215 ± 2.385
1.608LeuThr: 1.608 ± 1.193
4.823LeuVal: 4.823 ± 1.117
3.215LeuTrp: 3.215 ± 0.075
3.215LeuTyr: 3.215 ± 0.075
0.0LeuXaa: 0.0 ± 0.0
Met
1.608MetAla: 1.608 ± 1.268
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
3.215MetPhe: 3.215 ± 0.075
1.608MetGly: 1.608 ± 1.268
0.0MetHis: 0.0 ± 0.0
1.608MetIle: 1.608 ± 1.193
1.608MetLys: 1.608 ± 1.268
1.608MetLeu: 1.608 ± 1.193
1.608MetMet: 1.608 ± 1.268
1.608MetAsn: 1.608 ± 1.268
1.608MetPro: 1.608 ± 1.193
3.215MetGln: 3.215 ± 0.075
3.215MetArg: 3.215 ± 2.535
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
3.215AsnCys: 3.215 ± 0.075
3.215AsnAsp: 3.215 ± 2.535
0.0AsnGlu: 0.0 ± 0.0
3.215AsnPhe: 3.215 ± 0.075
3.215AsnGly: 3.215 ± 2.535
1.608AsnHis: 1.608 ± 1.268
3.215AsnIle: 3.215 ± 0.075
4.823AsnLys: 4.823 ± 3.803
1.608AsnLeu: 1.608 ± 1.193
0.0AsnMet: 0.0 ± 0.0
4.823AsnAsn: 4.823 ± 1.117
1.608AsnPro: 1.608 ± 1.193
1.608AsnGln: 1.608 ± 1.268
1.608AsnArg: 1.608 ± 1.193
1.608AsnSer: 1.608 ± 1.268
3.215AsnThr: 3.215 ± 0.075
4.823AsnVal: 4.823 ± 1.343
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.608ProAla: 1.608 ± 1.268
1.608ProCys: 1.608 ± 1.193
1.608ProAsp: 1.608 ± 1.268
6.431ProGlu: 6.431 ± 4.77
3.215ProPhe: 3.215 ± 2.385
8.039ProGly: 8.039 ± 1.418
0.0ProHis: 0.0 ± 0.0
4.823ProIle: 4.823 ± 1.117
0.0ProLys: 0.0 ± 0.0
1.608ProLeu: 1.608 ± 1.193
0.0ProMet: 0.0 ± 0.0
1.608ProAsn: 1.608 ± 1.193
1.608ProPro: 1.608 ± 1.193
3.215ProGln: 3.215 ± 2.535
3.215ProArg: 3.215 ± 0.075
1.608ProSer: 1.608 ± 1.268
1.608ProThr: 1.608 ± 1.268
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
4.823ProTyr: 4.823 ± 1.343
0.0ProXaa: 0.0 ± 0.0
Gln
3.215GlnAla: 3.215 ± 0.075
1.608GlnCys: 1.608 ± 1.193
1.608GlnAsp: 1.608 ± 1.268
1.608GlnGlu: 1.608 ± 1.268
1.608GlnPhe: 1.608 ± 1.268
4.823GlnGly: 4.823 ± 1.343
1.608GlnHis: 1.608 ± 1.193
0.0GlnIle: 0.0 ± 0.0
1.608GlnLys: 1.608 ± 1.193
1.608GlnLeu: 1.608 ± 1.193
0.0GlnMet: 0.0 ± 0.0
3.215GlnAsn: 3.215 ± 0.075
1.608GlnPro: 1.608 ± 1.193
1.608GlnGln: 1.608 ± 1.193
0.0GlnArg: 0.0 ± 0.0
1.608GlnSer: 1.608 ± 1.268
6.431GlnThr: 6.431 ± 5.071
3.215GlnVal: 3.215 ± 2.535
1.608GlnTrp: 1.608 ± 1.193
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.215ArgAla: 3.215 ± 0.075
1.608ArgCys: 1.608 ± 1.193
1.608ArgAsp: 1.608 ± 1.193
3.215ArgGlu: 3.215 ± 2.385
4.823ArgPhe: 4.823 ± 1.343
4.823ArgGly: 4.823 ± 3.803
1.608ArgHis: 1.608 ± 1.193
1.608ArgIle: 1.608 ± 1.268
4.823ArgLys: 4.823 ± 1.343
6.431ArgLeu: 6.431 ± 2.611
0.0ArgMet: 0.0 ± 0.0
1.608ArgAsn: 1.608 ± 1.193
9.646ArgPro: 9.646 ± 0.225
1.608ArgGln: 1.608 ± 1.268
20.9ArgArg: 20.9 ± 11.56
6.431ArgSer: 6.431 ± 2.31
8.039ArgThr: 8.039 ± 3.878
8.039ArgVal: 8.039 ± 1.042
1.608ArgTrp: 1.608 ± 1.268
4.823ArgTyr: 4.823 ± 1.343
0.0ArgXaa: 0.0 ± 0.0
Ser
4.823SerAla: 4.823 ± 1.343
0.0SerCys: 0.0 ± 0.0
1.608SerAsp: 1.608 ± 1.268
1.608SerGlu: 1.608 ± 1.193
4.823SerPhe: 4.823 ± 3.578
4.823SerGly: 4.823 ± 1.343
0.0SerHis: 0.0 ± 0.0
0.0SerIle: 0.0 ± 0.0
1.608SerLys: 1.608 ± 1.268
9.646SerLeu: 9.646 ± 2.235
1.608SerMet: 1.608 ± 1.268
6.431SerAsn: 6.431 ± 2.611
0.0SerPro: 0.0 ± 0.0
1.608SerGln: 1.608 ± 1.193
6.431SerArg: 6.431 ± 0.15
6.431SerSer: 6.431 ± 2.31
4.823SerThr: 4.823 ± 1.343
6.431SerVal: 6.431 ± 2.31
0.0SerTrp: 0.0 ± 0.0
1.608SerTyr: 1.608 ± 1.193
0.0SerXaa: 0.0 ± 0.0
Thr
4.823ThrAla: 4.823 ± 1.117
1.608ThrCys: 1.608 ± 1.193
8.039ThrAsp: 8.039 ± 1.418
0.0ThrGlu: 0.0 ± 0.0
3.215ThrPhe: 3.215 ± 0.075
4.823ThrGly: 4.823 ± 1.343
0.0ThrHis: 0.0 ± 0.0
3.215ThrIle: 3.215 ± 0.075
1.608ThrLys: 1.608 ± 1.193
4.823ThrLeu: 4.823 ± 1.117
3.215ThrMet: 3.215 ± 2.535
4.823ThrAsn: 4.823 ± 3.803
3.215ThrPro: 3.215 ± 0.075
3.215ThrGln: 3.215 ± 2.535
4.823ThrArg: 4.823 ± 3.803
8.039ThrSer: 8.039 ± 3.878
1.608ThrThr: 1.608 ± 1.268
1.608ThrVal: 1.608 ± 1.268
1.608ThrTrp: 1.608 ± 1.268
3.215ThrTyr: 3.215 ± 0.075
0.0ThrXaa: 0.0 ± 0.0
Val
3.215ValAla: 3.215 ± 0.075
1.608ValCys: 1.608 ± 1.193
9.646ValAsp: 9.646 ± 7.156
3.215ValGlu: 3.215 ± 0.075
3.215ValPhe: 3.215 ± 0.075
3.215ValGly: 3.215 ± 2.385
1.608ValHis: 1.608 ± 1.268
3.215ValIle: 3.215 ± 2.535
1.608ValLys: 1.608 ± 1.268
1.608ValLeu: 1.608 ± 1.193
0.0ValMet: 0.0 ± 0.0
1.608ValAsn: 1.608 ± 1.268
1.608ValPro: 1.608 ± 1.268
1.608ValGln: 1.608 ± 1.193
8.039ValArg: 8.039 ± 1.418
3.215ValSer: 3.215 ± 0.075
1.608ValThr: 1.608 ± 1.268
4.823ValVal: 4.823 ± 1.117
3.215ValTrp: 3.215 ± 0.075
1.608ValTyr: 1.608 ± 1.268
0.0ValXaa: 0.0 ± 0.0
Trp
4.823TrpAla: 4.823 ± 3.578
1.608TrpCys: 1.608 ± 1.268
1.608TrpAsp: 1.608 ± 1.193
0.0TrpGlu: 0.0 ± 0.0
1.608TrpPhe: 1.608 ± 1.268
1.608TrpGly: 1.608 ± 1.193
1.608TrpHis: 1.608 ± 1.268
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.215TrpLeu: 3.215 ± 2.385
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
4.823TrpArg: 4.823 ± 1.343
3.215TrpSer: 3.215 ± 0.075
1.608TrpThr: 1.608 ± 1.268
3.215TrpVal: 3.215 ± 0.075
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.823TyrAla: 4.823 ± 3.578
0.0TyrCys: 0.0 ± 0.0
3.215TyrAsp: 3.215 ± 2.535
0.0TyrGlu: 0.0 ± 0.0
1.608TyrPhe: 1.608 ± 1.193
4.823TyrGly: 4.823 ± 3.578
0.0TyrHis: 0.0 ± 0.0
4.823TyrIle: 4.823 ± 1.117
3.215TyrLys: 3.215 ± 0.075
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
1.608TyrGln: 1.608 ± 1.268
6.431TyrArg: 6.431 ± 0.15
4.823TyrSer: 4.823 ± 1.343
0.0TyrThr: 0.0 ± 0.0
3.215TyrVal: 3.215 ± 2.535
1.608TyrTrp: 1.608 ± 1.268
4.823TyrTyr: 4.823 ± 3.803
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski