Amino acid dipepetide frequency for Bovine faeces associated circular DNA virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.802AlaAla: 12.802 ± 0.028
0.0AlaCys: 0.0 ± 0.0
8.535AlaAsp: 8.535 ± 1.423
5.69AlaGlu: 5.69 ± 1.65
2.845AlaPhe: 2.845 ± 2.333
7.112AlaGly: 7.112 ± 3.728
2.845AlaHis: 2.845 ± 0.228
1.422AlaIle: 1.422 ± 0.939
7.112AlaLys: 7.112 ± 3.728
5.69AlaLeu: 5.69 ± 1.65
4.267AlaMet: 4.267 ± 0.711
4.267AlaAsn: 4.267 ± 3.5
2.845AlaPro: 2.845 ± 0.228
4.267AlaGln: 4.267 ± 1.394
5.69AlaArg: 5.69 ± 0.455
8.535AlaSer: 8.535 ± 2.789
2.845AlaThr: 2.845 ± 2.333
7.112AlaVal: 7.112 ± 3.728
1.422AlaTrp: 1.422 ± 1.167
7.112AlaTyr: 7.112 ± 1.622
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.422CysPhe: 1.422 ± 0.939
5.69CysGly: 5.69 ± 3.756
0.0CysHis: 0.0 ± 0.0
4.267CysIle: 4.267 ± 1.394
1.422CysLys: 1.422 ± 0.939
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.422CysPro: 1.422 ± 1.167
1.422CysGln: 1.422 ± 1.167
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.422CysThr: 1.422 ± 1.167
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.422CysTyr: 1.422 ± 1.167
0.0CysXaa: 0.0 ± 0.0
Asp
1.422AspAla: 1.422 ± 0.939
0.0AspCys: 0.0 ± 0.0
4.267AspAsp: 4.267 ± 0.711
9.957AspGlu: 9.957 ± 2.362
1.422AspPhe: 1.422 ± 0.939
4.267AspGly: 4.267 ± 2.817
1.422AspHis: 1.422 ± 0.939
5.69AspIle: 5.69 ± 0.455
1.422AspLys: 1.422 ± 0.939
7.112AspLeu: 7.112 ± 3.728
1.422AspMet: 1.422 ± 0.939
0.0AspAsn: 0.0 ± 0.0
5.69AspPro: 5.69 ± 3.756
4.267AspGln: 4.267 ± 0.711
4.267AspArg: 4.267 ± 0.711
4.267AspSer: 4.267 ± 0.711
1.422AspThr: 1.422 ± 0.939
1.422AspVal: 1.422 ± 1.167
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.267GluAla: 4.267 ± 1.394
0.0GluCys: 0.0 ± 0.0
7.112GluAsp: 7.112 ± 1.622
5.69GluGlu: 5.69 ± 1.65
2.845GluPhe: 2.845 ± 0.228
0.0GluGly: 0.0 ± 0.0
1.422GluHis: 1.422 ± 0.939
2.845GluIle: 2.845 ± 1.878
2.845GluLys: 2.845 ± 0.228
4.267GluLeu: 4.267 ± 0.711
0.0GluMet: 0.0 ± 0.0
2.845GluAsn: 2.845 ± 0.228
2.845GluPro: 2.845 ± 0.228
0.0GluGln: 0.0 ± 0.0
2.845GluArg: 2.845 ± 1.878
2.845GluSer: 2.845 ± 0.228
2.845GluThr: 2.845 ± 1.878
9.957GluVal: 9.957 ± 1.85
1.422GluTrp: 1.422 ± 0.939
2.845GluTyr: 2.845 ± 1.878
0.0GluXaa: 0.0 ± 0.0
Phe
1.422PheAla: 1.422 ± 1.167
2.845PheCys: 2.845 ± 0.228
0.0PheAsp: 0.0 ± 0.0
7.112PheGlu: 7.112 ± 0.484
2.845PhePhe: 2.845 ± 0.228
1.422PheGly: 1.422 ± 1.167
0.0PheHis: 0.0 ± 0.0
2.845PheIle: 2.845 ± 1.878
4.267PheLys: 4.267 ± 0.711
1.422PheLeu: 1.422 ± 0.939
0.0PheMet: 0.0 ± 0.0
1.422PheAsn: 1.422 ± 0.939
1.422PhePro: 1.422 ± 0.939
1.422PheGln: 1.422 ± 0.939
2.845PheArg: 2.845 ± 2.333
0.0PheSer: 0.0 ± 0.0
5.69PheThr: 5.69 ± 0.455
1.422PheVal: 1.422 ± 1.167
1.422PheTrp: 1.422 ± 0.939
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
8.535GlyAla: 8.535 ± 0.683
0.0GlyCys: 0.0 ± 0.0
2.845GlyAsp: 2.845 ± 0.228
1.422GlyGlu: 1.422 ± 0.939
7.112GlyPhe: 7.112 ± 0.484
1.422GlyGly: 1.422 ± 0.939
2.845GlyHis: 2.845 ± 1.878
0.0GlyIle: 0.0 ± 0.0
2.845GlyLys: 2.845 ± 1.878
2.845GlyLeu: 2.845 ± 2.333
1.422GlyMet: 1.422 ± 0.939
1.422GlyAsn: 1.422 ± 0.939
2.845GlyPro: 2.845 ± 1.878
4.267GlyGln: 4.267 ± 2.817
5.69GlyArg: 5.69 ± 0.455
7.112GlySer: 7.112 ± 3.728
8.535GlyThr: 8.535 ± 0.683
1.422GlyVal: 1.422 ± 0.939
2.845GlyTrp: 2.845 ± 1.878
4.267GlyTyr: 4.267 ± 1.394
0.0GlyXaa: 0.0 ± 0.0
His
1.422HisAla: 1.422 ± 0.939
0.0HisCys: 0.0 ± 0.0
1.422HisAsp: 1.422 ± 1.167
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.422HisGly: 1.422 ± 0.939
2.845HisHis: 2.845 ± 1.878
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.422HisLeu: 1.422 ± 0.939
1.422HisMet: 1.422 ± 0.671
0.0HisAsn: 0.0 ± 0.0
2.845HisPro: 2.845 ± 0.228
0.0HisGln: 0.0 ± 0.0
1.422HisArg: 1.422 ± 0.939
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
2.845HisTrp: 2.845 ± 1.878
1.422HisTyr: 1.422 ± 0.939
0.0HisXaa: 0.0 ± 0.0
Ile
5.69IleAla: 5.69 ± 0.455
1.422IleCys: 1.422 ± 0.939
4.267IleAsp: 4.267 ± 2.817
2.845IleGlu: 2.845 ± 1.878
1.422IlePhe: 1.422 ± 0.939
2.845IleGly: 2.845 ± 0.228
0.0IleHis: 0.0 ± 0.0
8.535IleIle: 8.535 ± 3.528
2.845IleLys: 2.845 ± 0.228
1.422IleLeu: 1.422 ± 0.939
1.422IleMet: 1.422 ± 1.167
2.845IleAsn: 2.845 ± 0.228
4.267IlePro: 4.267 ± 0.711
2.845IleGln: 2.845 ± 1.878
1.422IleArg: 1.422 ± 0.939
0.0IleSer: 0.0 ± 0.0
2.845IleThr: 2.845 ± 1.878
1.422IleVal: 1.422 ± 1.167
0.0IleTrp: 0.0 ± 0.0
1.422IleTyr: 1.422 ± 0.939
0.0IleXaa: 0.0 ± 0.0
Lys
2.845LysAla: 2.845 ± 0.228
1.422LysCys: 1.422 ± 1.167
7.112LysAsp: 7.112 ± 4.695
1.422LysGlu: 1.422 ± 0.939
1.422LysPhe: 1.422 ± 1.167
2.845LysGly: 2.845 ± 1.878
0.0LysHis: 0.0 ± 0.0
2.845LysIle: 2.845 ± 0.228
5.69LysLys: 5.69 ± 0.455
4.267LysLeu: 4.267 ± 0.711
0.0LysMet: 0.0 ± 0.754
0.0LysAsn: 0.0 ± 0.0
0.0LysPro: 0.0 ± 0.0
0.0LysGln: 0.0 ± 0.0
8.535LysArg: 8.535 ± 3.528
2.845LysSer: 2.845 ± 0.228
2.845LysThr: 2.845 ± 2.333
5.69LysVal: 5.69 ± 0.455
1.422LysTrp: 1.422 ± 1.167
1.422LysTyr: 1.422 ± 1.167
0.0LysXaa: 0.0 ± 0.0
Leu
12.802LeuAla: 12.802 ± 2.077
0.0LeuCys: 0.0 ± 0.0
7.112LeuAsp: 7.112 ± 2.589
2.845LeuGlu: 2.845 ± 0.228
0.0LeuPhe: 0.0 ± 0.0
5.69LeuGly: 5.69 ± 0.455
0.0LeuHis: 0.0 ± 0.0
0.0LeuIle: 0.0 ± 0.0
4.267LeuLys: 4.267 ± 2.817
1.422LeuLeu: 1.422 ± 0.939
2.845LeuMet: 2.845 ± 0.228
1.422LeuAsn: 1.422 ± 1.167
0.0LeuPro: 0.0 ± 0.0
5.69LeuGln: 5.69 ± 3.756
1.422LeuArg: 1.422 ± 0.939
0.0LeuSer: 0.0 ± 0.0
2.845LeuThr: 2.845 ± 1.878
0.0LeuVal: 0.0 ± 0.0
1.422LeuTrp: 1.422 ± 0.939
5.69LeuTyr: 5.69 ± 4.667
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.422MetGlu: 1.422 ± 1.167
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
2.845MetIle: 2.845 ± 0.228
1.422MetLys: 1.422 ± 0.939
2.845MetLeu: 2.845 ± 1.878
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.845MetPro: 2.845 ± 0.228
1.422MetGln: 1.422 ± 0.939
2.845MetArg: 2.845 ± 0.228
1.422MetSer: 1.422 ± 0.939
0.0MetThr: 0.0 ± 0.0
2.845MetVal: 2.845 ± 1.878
0.0MetTrp: 0.0 ± 0.0
2.845MetTyr: 2.845 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
1.422AsnAla: 1.422 ± 1.167
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
1.422AsnPhe: 1.422 ± 1.167
5.69AsnGly: 5.69 ± 0.455
1.422AsnHis: 1.422 ± 0.939
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
1.422AsnLeu: 1.422 ± 1.167
0.0AsnMet: 0.0 ± 0.0
2.845AsnAsn: 2.845 ± 1.878
5.69AsnPro: 5.69 ± 0.455
1.422AsnGln: 1.422 ± 0.939
0.0AsnArg: 0.0 ± 0.0
2.845AsnSer: 2.845 ± 2.333
1.422AsnThr: 1.422 ± 0.939
2.845AsnVal: 2.845 ± 1.878
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
7.112ProAla: 7.112 ± 2.589
0.0ProCys: 0.0 ± 0.0
2.845ProAsp: 2.845 ± 1.878
2.845ProGlu: 2.845 ± 0.228
4.267ProPhe: 4.267 ± 0.711
11.38ProGly: 11.38 ± 1.195
1.422ProHis: 1.422 ± 0.939
1.422ProIle: 1.422 ± 0.939
4.267ProLys: 4.267 ± 0.711
4.267ProLeu: 4.267 ± 1.394
0.0ProMet: 0.0 ± 0.0
2.845ProAsn: 2.845 ± 0.228
1.422ProPro: 1.422 ± 0.939
0.0ProGln: 0.0 ± 0.0
0.0ProArg: 0.0 ± 0.0
2.845ProSer: 2.845 ± 0.228
1.422ProThr: 1.422 ± 0.939
8.535ProVal: 8.535 ± 0.683
0.0ProTrp: 0.0 ± 0.0
2.845ProTyr: 2.845 ± 1.878
0.0ProXaa: 0.0 ± 0.0
Gln
7.112GlnAla: 7.112 ± 1.622
1.422GlnCys: 1.422 ± 0.939
0.0GlnAsp: 0.0 ± 0.0
2.845GlnGlu: 2.845 ± 0.228
1.422GlnPhe: 1.422 ± 1.167
2.845GlnGly: 2.845 ± 1.878
0.0GlnHis: 0.0 ± 0.0
4.267GlnIle: 4.267 ± 2.817
1.422GlnLys: 1.422 ± 0.939
1.422GlnLeu: 1.422 ± 0.939
0.0GlnMet: 0.0 ± 0.0
1.422GlnAsn: 1.422 ± 0.939
1.422GlnPro: 1.422 ± 0.939
4.267GlnGln: 4.267 ± 0.711
1.422GlnArg: 1.422 ± 0.939
0.0GlnSer: 0.0 ± 0.0
0.0GlnThr: 0.0 ± 0.0
4.267GlnVal: 4.267 ± 3.5
2.845GlnTrp: 2.845 ± 0.228
2.845GlnTyr: 2.845 ± 1.878
0.0GlnXaa: 0.0 ± 0.0
Arg
8.535ArgAla: 8.535 ± 4.894
1.422ArgCys: 1.422 ± 0.939
2.845ArgAsp: 2.845 ± 0.228
1.422ArgGlu: 1.422 ± 1.167
4.267ArgPhe: 4.267 ± 0.711
2.845ArgGly: 2.845 ± 1.878
0.0ArgHis: 0.0 ± 0.0
1.422ArgIle: 1.422 ± 1.167
1.422ArgLys: 1.422 ± 1.167
4.267ArgLeu: 4.267 ± 2.817
1.422ArgMet: 1.422 ± 0.939
2.845ArgAsn: 2.845 ± 0.228
5.69ArgPro: 5.69 ± 3.756
0.0ArgGln: 0.0 ± 0.0
11.38ArgArg: 11.38 ± 9.333
5.69ArgSer: 5.69 ± 2.561
5.69ArgThr: 5.69 ± 1.65
0.0ArgVal: 0.0 ± 0.0
1.422ArgTrp: 1.422 ± 0.939
8.535ArgTyr: 8.535 ± 4.894
0.0ArgXaa: 0.0 ± 0.0
Ser
5.69SerAla: 5.69 ± 4.667
2.845SerCys: 2.845 ± 2.333
2.845SerAsp: 2.845 ± 0.228
4.267SerGlu: 4.267 ± 1.394
0.0SerPhe: 0.0 ± 0.0
2.845SerGly: 2.845 ± 0.228
0.0SerHis: 0.0 ± 0.0
4.267SerIle: 4.267 ± 0.711
1.422SerLys: 1.422 ± 1.167
5.69SerLeu: 5.69 ± 2.561
1.422SerMet: 1.422 ± 0.939
2.845SerAsn: 2.845 ± 0.228
2.845SerPro: 2.845 ± 0.228
4.267SerGln: 4.267 ± 1.394
1.422SerArg: 1.422 ± 1.167
5.69SerSer: 5.69 ± 4.667
1.422SerThr: 1.422 ± 1.167
7.112SerVal: 7.112 ± 3.728
0.0SerTrp: 0.0 ± 0.0
4.267SerTyr: 4.267 ± 1.394
0.0SerXaa: 0.0 ± 0.0
Thr
8.535ThrAla: 8.535 ± 2.789
1.422ThrCys: 1.422 ± 0.939
1.422ThrAsp: 1.422 ± 0.939
0.0ThrGlu: 0.0 ± 0.0
0.0ThrPhe: 0.0 ± 0.0
4.267ThrGly: 4.267 ± 0.711
0.0ThrHis: 0.0 ± 0.0
2.845ThrIle: 2.845 ± 1.878
4.267ThrLys: 4.267 ± 1.394
2.845ThrLeu: 2.845 ± 1.878
1.422ThrMet: 1.422 ± 0.939
1.422ThrAsn: 1.422 ± 0.939
7.112ThrPro: 7.112 ± 1.622
0.0ThrGln: 0.0 ± 0.0
2.845ThrArg: 2.845 ± 1.878
7.112ThrSer: 7.112 ± 3.728
0.0ThrThr: 0.0 ± 0.0
2.845ThrVal: 2.845 ± 0.228
0.0ThrTrp: 0.0 ± 0.0
1.422ThrTyr: 1.422 ± 0.939
0.0ThrXaa: 0.0 ± 0.0
Val
5.69ValAla: 5.69 ± 0.455
2.845ValCys: 2.845 ± 2.333
2.845ValAsp: 2.845 ± 0.228
4.267ValGlu: 4.267 ± 1.394
4.267ValPhe: 4.267 ± 0.711
1.422ValGly: 1.422 ± 1.167
0.0ValHis: 0.0 ± 0.0
4.267ValIle: 4.267 ± 2.817
4.267ValLys: 4.267 ± 1.394
0.0ValLeu: 0.0 ± 0.0
2.845ValMet: 2.845 ± 0.228
0.0ValAsn: 0.0 ± 0.0
4.267ValPro: 4.267 ± 0.711
2.845ValGln: 2.845 ± 2.333
7.112ValArg: 7.112 ± 3.728
5.69ValSer: 5.69 ± 4.667
5.69ValThr: 5.69 ± 0.455
1.422ValVal: 1.422 ± 0.939
1.422ValTrp: 1.422 ± 0.939
1.422ValTyr: 1.422 ± 1.167
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.422TrpCys: 1.422 ± 0.939
2.845TrpAsp: 2.845 ± 0.228
2.845TrpGlu: 2.845 ± 1.878
1.422TrpPhe: 1.422 ± 0.939
1.422TrpGly: 1.422 ± 0.939
1.422TrpHis: 1.422 ± 1.167
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.422TrpLeu: 1.422 ± 0.939
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.422TrpGln: 1.422 ± 0.939
0.0TrpArg: 0.0 ± 0.0
2.845TrpSer: 2.845 ± 0.228
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.845TrpTyr: 2.845 ± 1.878
0.0TrpXaa: 0.0 ± 0.0
Tyr
7.112TyrAla: 7.112 ± 3.728
1.422TyrCys: 1.422 ± 0.939
1.422TyrAsp: 1.422 ± 0.939
2.845TyrGlu: 2.845 ± 0.228
1.422TyrPhe: 1.422 ± 0.939
4.267TyrGly: 4.267 ± 3.5
2.845TyrHis: 2.845 ± 1.878
0.0TyrIle: 0.0 ± 0.0
2.845TyrLys: 2.845 ± 1.878
1.422TyrLeu: 1.422 ± 0.939
1.422TyrMet: 1.422 ± 0.939
0.0TyrAsn: 0.0 ± 0.0
4.267TyrPro: 4.267 ± 0.711
1.422TyrGln: 1.422 ± 1.167
9.957TyrArg: 9.957 ± 6.061
1.422TyrSer: 1.422 ± 1.167
2.845TyrThr: 2.845 ± 0.228
4.267TyrVal: 4.267 ± 1.394
1.422TyrTrp: 1.422 ± 0.939
4.267TyrTyr: 4.267 ± 3.5
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski