Amino acid dipepetide frequency for Bovine faeces associated smacovirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.311AlaAla: 3.311 ± 2.03
1.656AlaCys: 1.656 ± 1.404
4.967AlaAsp: 4.967 ± 0.627
0.0AlaGlu: 0.0 ± 0.0
1.656AlaPhe: 1.656 ± 1.015
3.311AlaGly: 3.311 ± 2.03
1.656AlaHis: 1.656 ± 1.015
1.656AlaIle: 1.656 ± 1.015
1.656AlaLys: 1.656 ± 1.015
1.656AlaLeu: 1.656 ± 1.015
3.311AlaMet: 3.311 ± 0.388
3.311AlaAsn: 3.311 ± 2.03
1.656AlaPro: 1.656 ± 1.015
0.0AlaGln: 0.0 ± 0.0
4.967AlaArg: 4.967 ± 1.792
1.656AlaSer: 1.656 ± 1.015
1.656AlaThr: 1.656 ± 1.015
8.278AlaVal: 8.278 ± 2.181
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.656CysAsp: 1.656 ± 1.404
3.311CysGlu: 3.311 ± 0.388
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.656CysLeu: 1.656 ± 1.404
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.656CysSer: 1.656 ± 1.404
1.656CysThr: 1.656 ± 1.404
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.967AspAla: 4.967 ± 1.792
1.656AspCys: 1.656 ± 1.404
3.311AspAsp: 3.311 ± 2.807
3.311AspGlu: 3.311 ± 2.03
3.311AspPhe: 3.311 ± 2.807
4.967AspGly: 4.967 ± 0.627
1.656AspHis: 1.656 ± 1.404
4.967AspIle: 4.967 ± 1.792
3.311AspLys: 3.311 ± 2.03
8.278AspLeu: 8.278 ± 0.238
1.656AspMet: 1.656 ± 1.015
4.967AspAsn: 4.967 ± 0.627
1.656AspPro: 1.656 ± 1.404
1.656AspGln: 1.656 ± 1.015
4.967AspArg: 4.967 ± 4.211
1.656AspSer: 1.656 ± 1.015
1.656AspThr: 1.656 ± 1.015
6.623AspVal: 6.623 ± 4.061
1.656AspTrp: 1.656 ± 1.404
1.656AspTyr: 1.656 ± 1.015
0.0AspXaa: 0.0 ± 0.0
Glu
1.656GluAla: 1.656 ± 1.015
0.0GluCys: 0.0 ± 0.0
3.311GluAsp: 3.311 ± 2.807
0.0GluGlu: 0.0 ± 0.0
1.656GluPhe: 1.656 ± 1.015
1.656GluGly: 1.656 ± 1.015
0.0GluHis: 0.0 ± 0.0
1.656GluIle: 1.656 ± 1.015
4.967GluLys: 4.967 ± 1.792
4.967GluLeu: 4.967 ± 4.211
1.656GluMet: 1.656 ± 1.404
1.656GluAsn: 1.656 ± 1.404
1.656GluPro: 1.656 ± 1.404
1.656GluGln: 1.656 ± 1.404
3.311GluArg: 3.311 ± 2.807
8.278GluSer: 8.278 ± 4.599
4.967GluThr: 4.967 ± 0.627
6.623GluVal: 6.623 ± 4.061
0.0GluTrp: 0.0 ± 0.0
3.311GluTyr: 3.311 ± 2.807
0.0GluXaa: 0.0 ± 0.0
Phe
3.311PheAla: 3.311 ± 0.388
0.0PheCys: 0.0 ± 0.0
1.656PheAsp: 1.656 ± 1.015
0.0PheGlu: 0.0 ± 0.0
1.656PhePhe: 1.656 ± 1.404
4.967PheGly: 4.967 ± 1.792
0.0PheHis: 0.0 ± 0.0
1.656PheIle: 1.656 ± 1.404
0.0PheLys: 0.0 ± 0.0
0.0PheLeu: 0.0 ± 0.0
1.656PheMet: 1.656 ± 1.015
1.656PheAsn: 1.656 ± 1.015
1.656PhePro: 1.656 ± 1.015
1.656PheGln: 1.656 ± 1.404
1.656PheArg: 1.656 ± 1.015
1.656PheSer: 1.656 ± 1.015
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
1.656PheTrp: 1.656 ± 1.404
3.311PheTyr: 3.311 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
3.311GlyAla: 3.311 ± 2.03
0.0GlyCys: 0.0 ± 0.0
4.967GlyAsp: 4.967 ± 0.627
0.0GlyGlu: 0.0 ± 0.0
3.311GlyPhe: 3.311 ± 2.03
1.656GlyGly: 1.656 ± 1.404
1.656GlyHis: 1.656 ± 1.404
9.934GlyIle: 9.934 ± 1.165
8.278GlyLys: 8.278 ± 4.599
9.934GlyLeu: 9.934 ± 6.091
1.656GlyMet: 1.656 ± 1.015
0.0GlyAsn: 0.0 ± 0.0
0.0GlyPro: 0.0 ± 0.0
1.656GlyGln: 1.656 ± 1.015
8.278GlyArg: 8.278 ± 2.181
9.934GlySer: 9.934 ± 3.672
3.311GlyThr: 3.311 ± 0.388
0.0GlyVal: 0.0 ± 0.0
1.656GlyTrp: 1.656 ± 1.404
4.967GlyTyr: 4.967 ± 1.792
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.656HisAsp: 1.656 ± 1.404
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.311HisGly: 3.311 ± 0.388
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.311HisLys: 3.311 ± 2.807
1.656HisLeu: 1.656 ± 1.404
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.656HisPro: 1.656 ± 1.015
0.0HisGln: 0.0 ± 0.0
3.311HisArg: 3.311 ± 0.388
0.0HisSer: 0.0 ± 0.0
3.311HisThr: 3.311 ± 2.03
1.656HisVal: 1.656 ± 1.404
0.0HisTrp: 0.0 ± 0.0
3.311HisTyr: 3.311 ± 2.807
0.0HisXaa: 0.0 ± 0.0
Ile
6.623IleAla: 6.623 ± 1.642
0.0IleCys: 0.0 ± 0.0
3.311IleAsp: 3.311 ± 0.388
4.967IleGlu: 4.967 ± 0.627
0.0IlePhe: 0.0 ± 0.0
3.311IleGly: 3.311 ± 0.388
1.656IleHis: 1.656 ± 1.015
3.311IleIle: 3.311 ± 0.388
3.311IleLys: 3.311 ± 2.807
4.967IleLeu: 4.967 ± 0.627
3.311IleMet: 3.311 ± 2.03
3.311IleAsn: 3.311 ± 0.388
1.656IlePro: 1.656 ± 1.404
1.656IleGln: 1.656 ± 1.015
1.656IleArg: 1.656 ± 1.404
6.623IleSer: 6.623 ± 1.642
1.656IleThr: 1.656 ± 1.015
1.656IleVal: 1.656 ± 1.404
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.311LysAla: 3.311 ± 0.388
1.656LysCys: 1.656 ± 1.404
1.656LysAsp: 1.656 ± 1.404
1.656LysGlu: 1.656 ± 1.404
3.311LysPhe: 3.311 ± 2.807
1.656LysGly: 1.656 ± 1.404
0.0LysHis: 0.0 ± 0.0
1.656LysIle: 1.656 ± 1.015
1.656LysLys: 1.656 ± 1.404
1.656LysLeu: 1.656 ± 1.404
1.656LysMet: 1.656 ± 1.015
1.656LysAsn: 1.656 ± 1.404
0.0LysPro: 0.0 ± 0.0
1.656LysGln: 1.656 ± 1.015
6.623LysArg: 6.623 ± 3.196
3.311LysSer: 3.311 ± 2.03
1.656LysThr: 1.656 ± 1.404
4.967LysVal: 4.967 ± 0.627
1.656LysTrp: 1.656 ± 1.404
4.967LysTyr: 4.967 ± 0.627
0.0LysXaa: 0.0 ± 0.0
Leu
1.656LeuAla: 1.656 ± 1.015
1.656LeuCys: 1.656 ± 1.404
1.656LeuAsp: 1.656 ± 1.015
4.967LeuGlu: 4.967 ± 4.211
0.0LeuPhe: 0.0 ± 0.0
4.967LeuGly: 4.967 ± 0.627
4.967LeuHis: 4.967 ± 1.792
4.967LeuIle: 4.967 ± 0.627
1.656LeuLys: 1.656 ± 1.404
6.623LeuLeu: 6.623 ± 0.777
3.311LeuMet: 3.311 ± 1.452
4.967LeuAsn: 4.967 ± 0.627
3.311LeuPro: 3.311 ± 2.03
3.311LeuGln: 3.311 ± 2.03
3.311LeuArg: 3.311 ± 2.807
11.589LeuSer: 11.589 ± 0.15
1.656LeuThr: 1.656 ± 1.015
4.967LeuVal: 4.967 ± 3.046
0.0LeuTrp: 0.0 ± 0.0
8.278LeuTyr: 8.278 ± 0.238
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.311MetGlu: 3.311 ± 2.03
1.656MetPhe: 1.656 ± 1.015
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.656MetIle: 1.656 ± 1.015
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
1.656MetMet: 1.656 ± 1.015
3.311MetAsn: 3.311 ± 2.03
4.967MetPro: 4.967 ± 3.046
0.0MetGln: 0.0 ± 0.0
3.311MetArg: 3.311 ± 2.03
6.623MetSer: 6.623 ± 1.642
3.311MetThr: 3.311 ± 0.388
4.967MetVal: 4.967 ± 3.046
0.0MetTrp: 0.0 ± 0.0
1.656MetTyr: 1.656 ± 1.404
0.0MetXaa: 0.0 ± 0.0
Asn
3.311AsnAla: 3.311 ± 0.388
1.656AsnCys: 1.656 ± 1.015
6.623AsnAsp: 6.623 ± 0.777
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
6.623AsnGly: 6.623 ± 3.196
3.311AsnHis: 3.311 ± 2.807
4.967AsnIle: 4.967 ± 3.046
1.656AsnLys: 1.656 ± 1.015
1.656AsnLeu: 1.656 ± 1.404
0.0AsnMet: 0.0 ± 0.0
1.656AsnAsn: 1.656 ± 1.404
1.656AsnPro: 1.656 ± 1.015
3.311AsnGln: 3.311 ± 0.388
3.311AsnArg: 3.311 ± 2.03
4.967AsnSer: 4.967 ± 3.046
1.656AsnThr: 1.656 ± 1.404
4.967AsnVal: 4.967 ± 3.046
0.0AsnTrp: 0.0 ± 0.0
1.656AsnTyr: 1.656 ± 1.404
0.0AsnXaa: 0.0 ± 0.0
Pro
3.311ProAla: 3.311 ± 0.388
0.0ProCys: 0.0 ± 0.0
4.967ProAsp: 4.967 ± 0.627
0.0ProGlu: 0.0 ± 0.0
0.0ProPhe: 0.0 ± 0.0
3.311ProGly: 3.311 ± 2.03
1.656ProHis: 1.656 ± 1.015
3.311ProIle: 3.311 ± 2.03
0.0ProLys: 0.0 ± 0.0
3.311ProLeu: 3.311 ± 2.03
1.656ProMet: 1.656 ± 1.015
0.0ProAsn: 0.0 ± 0.0
3.311ProPro: 3.311 ± 0.388
4.967ProGln: 4.967 ± 0.627
1.656ProArg: 1.656 ± 1.015
3.311ProSer: 3.311 ± 0.388
3.311ProThr: 3.311 ± 2.03
3.311ProVal: 3.311 ± 2.03
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.656GlnAla: 1.656 ± 1.015
0.0GlnCys: 0.0 ± 0.0
1.656GlnAsp: 1.656 ± 1.015
1.656GlnGlu: 1.656 ± 1.015
1.656GlnPhe: 1.656 ± 1.404
1.656GlnGly: 1.656 ± 1.015
0.0GlnHis: 0.0 ± 0.0
1.656GlnIle: 1.656 ± 1.404
1.656GlnLys: 1.656 ± 1.015
0.0GlnLeu: 0.0 ± 0.0
1.656GlnMet: 1.656 ± 1.015
1.656GlnAsn: 1.656 ± 1.015
1.656GlnPro: 1.656 ± 1.015
1.656GlnGln: 1.656 ± 1.015
1.656GlnArg: 1.656 ± 1.015
4.967GlnSer: 4.967 ± 0.627
1.656GlnThr: 1.656 ± 1.404
6.623GlnVal: 6.623 ± 0.777
0.0GlnTrp: 0.0 ± 0.0
1.656GlnTyr: 1.656 ± 1.015
0.0GlnXaa: 0.0 ± 0.0
Arg
1.656ArgAla: 1.656 ± 1.015
0.0ArgCys: 0.0 ± 0.0
3.311ArgAsp: 3.311 ± 0.388
8.278ArgGlu: 8.278 ± 7.018
3.311ArgPhe: 3.311 ± 0.388
3.311ArgGly: 3.311 ± 2.03
3.311ArgHis: 3.311 ± 2.807
3.311ArgIle: 3.311 ± 0.388
1.656ArgLys: 1.656 ± 1.404
4.967ArgLeu: 4.967 ± 1.792
4.967ArgMet: 4.967 ± 2.605
3.311ArgAsn: 3.311 ± 2.807
6.623ArgPro: 6.623 ± 1.642
0.0ArgGln: 0.0 ± 0.0
1.656ArgArg: 1.656 ± 1.404
3.311ArgSer: 3.311 ± 2.807
3.311ArgThr: 3.311 ± 0.388
1.656ArgVal: 1.656 ± 1.015
3.311ArgTrp: 3.311 ± 2.807
4.967ArgTyr: 4.967 ± 4.211
0.0ArgXaa: 0.0 ± 0.0
Ser
1.656SerAla: 1.656 ± 1.015
0.0SerCys: 0.0 ± 0.0
4.967SerAsp: 4.967 ± 1.792
16.556SerGlu: 16.556 ± 4.361
1.656SerPhe: 1.656 ± 1.015
9.934SerGly: 9.934 ± 3.672
0.0SerHis: 0.0 ± 0.0
1.656SerIle: 1.656 ± 1.015
6.623SerLys: 6.623 ± 0.777
9.934SerLeu: 9.934 ± 1.253
4.967SerMet: 4.967 ± 3.046
11.589SerAsn: 11.589 ± 2.569
1.656SerPro: 1.656 ± 1.015
3.311SerGln: 3.311 ± 2.03
1.656SerArg: 1.656 ± 1.404
4.967SerSer: 4.967 ± 0.627
6.623SerThr: 6.623 ± 4.061
1.656SerVal: 1.656 ± 1.015
0.0SerTrp: 0.0 ± 0.0
1.656SerTyr: 1.656 ± 1.015
0.0SerXaa: 0.0 ± 0.0
Thr
1.656ThrAla: 1.656 ± 1.015
0.0ThrCys: 0.0 ± 0.0
6.623ThrAsp: 6.623 ± 1.642
0.0ThrGlu: 0.0 ± 0.0
1.656ThrPhe: 1.656 ± 1.015
6.623ThrGly: 6.623 ± 0.777
0.0ThrHis: 0.0 ± 0.0
1.656ThrIle: 1.656 ± 1.404
1.656ThrLys: 1.656 ± 1.015
8.278ThrLeu: 8.278 ± 5.076
0.0ThrMet: 0.0 ± 0.0
3.311ThrAsn: 3.311 ± 2.03
3.311ThrPro: 3.311 ± 2.03
1.656ThrGln: 1.656 ± 1.015
3.311ThrArg: 3.311 ± 2.807
4.967ThrSer: 4.967 ± 1.792
1.656ThrThr: 1.656 ± 1.015
6.623ThrVal: 6.623 ± 1.642
0.0ThrTrp: 0.0 ± 0.0
3.311ThrTyr: 3.311 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
3.311ValAla: 3.311 ± 0.388
0.0ValCys: 0.0 ± 0.0
8.278ValAsp: 8.278 ± 2.657
0.0ValGlu: 0.0 ± 0.0
1.656ValPhe: 1.656 ± 1.015
9.934ValGly: 9.934 ± 1.165
0.0ValHis: 0.0 ± 0.0
1.656ValIle: 1.656 ± 1.404
0.0ValLys: 0.0 ± 0.0
3.311ValLeu: 3.311 ± 0.388
1.656ValMet: 1.656 ± 1.015
3.311ValAsn: 3.311 ± 2.03
1.656ValPro: 1.656 ± 1.015
3.311ValGln: 3.311 ± 2.03
4.967ValArg: 4.967 ± 0.627
6.623ValSer: 6.623 ± 4.061
11.589ValThr: 11.589 ± 4.688
4.967ValVal: 4.967 ± 0.627
0.0ValTrp: 0.0 ± 0.0
6.623ValTyr: 6.623 ± 1.642
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.656TrpHis: 1.656 ± 1.404
1.656TrpIle: 1.656 ± 1.404
1.656TrpLys: 1.656 ± 1.404
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.656TrpAsn: 1.656 ± 1.404
0.0TrpPro: 0.0 ± 0.0
1.656TrpGln: 1.656 ± 1.404
0.0TrpArg: 0.0 ± 0.0
1.656TrpSer: 1.656 ± 1.404
0.0TrpThr: 0.0 ± 0.0
1.656TrpVal: 1.656 ± 1.404
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.656TyrAla: 1.656 ± 1.015
1.656TyrCys: 1.656 ± 1.404
3.311TyrAsp: 3.311 ± 0.388
4.967TyrGlu: 4.967 ± 4.211
1.656TyrPhe: 1.656 ± 1.404
3.311TyrGly: 3.311 ± 0.388
1.656TyrHis: 1.656 ± 1.015
1.656TyrIle: 1.656 ± 1.015
3.311TyrLys: 3.311 ± 0.388
6.623TyrLeu: 6.623 ± 3.196
0.0TyrMet: 0.0 ± 0.0
1.656TyrAsn: 1.656 ± 1.015
3.311TyrPro: 3.311 ± 2.03
1.656TyrGln: 1.656 ± 1.404
6.623TyrArg: 6.623 ± 3.196
3.311TyrSer: 3.311 ± 2.03
1.656TyrThr: 1.656 ± 1.404
1.656TyrVal: 1.656 ± 1.015
1.656TyrTrp: 1.656 ± 1.404
4.967TyrTyr: 4.967 ± 0.627
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (605 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski