Amino acid dipepetide frequency for Bovine faeces associated smacovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
5.357AlaCys: 5.357 ± 1.726
7.143AlaAsp: 7.143 ± 1.834
1.786AlaGlu: 1.786 ± 1.402
3.571AlaPhe: 3.571 ± 2.805
5.357AlaGly: 5.357 ± 3.237
1.786AlaHis: 1.786 ± 1.402
1.786AlaIle: 1.786 ± 1.402
3.571AlaLys: 3.571 ± 0.323
7.143AlaLeu: 7.143 ± 0.647
5.357AlaMet: 5.357 ± 1.726
1.786AlaAsn: 1.786 ± 1.079
1.786AlaPro: 1.786 ± 1.079
1.786AlaGln: 1.786 ± 1.079
1.786AlaArg: 1.786 ± 1.079
3.571AlaSer: 3.571 ± 2.158
5.357AlaThr: 5.357 ± 3.237
5.357AlaVal: 5.357 ± 0.755
0.0AlaTrp: 0.0 ± 0.0
1.786AlaTyr: 1.786 ± 1.402
0.0AlaXaa: 0.0 ± 0.0
Cys
3.571CysAla: 3.571 ± 0.323
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.786CysGlu: 1.786 ± 1.402
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
3.571CysHis: 3.571 ± 0.323
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.786CysAsn: 1.786 ± 1.402
1.786CysPro: 1.786 ± 1.079
0.0CysGln: 0.0 ± 0.0
1.786CysArg: 1.786 ± 1.402
3.571CysSer: 3.571 ± 2.805
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.571AspAla: 3.571 ± 0.323
0.0AspCys: 0.0 ± 0.0
3.571AspAsp: 3.571 ± 0.323
3.571AspGlu: 3.571 ± 0.323
7.143AspPhe: 7.143 ± 0.647
7.143AspGly: 7.143 ± 3.128
0.0AspHis: 0.0 ± 0.0
7.143AspIle: 7.143 ± 0.647
5.357AspLys: 5.357 ± 4.207
1.786AspLeu: 1.786 ± 1.079
1.786AspMet: 1.786 ± 1.402
0.0AspAsn: 0.0 ± 0.0
3.571AspPro: 3.571 ± 0.323
0.0AspGln: 0.0 ± 0.0
5.357AspArg: 5.357 ± 1.726
1.786AspSer: 1.786 ± 1.079
3.571AspThr: 3.571 ± 2.158
3.571AspVal: 3.571 ± 2.158
0.0AspTrp: 0.0 ± 0.0
1.786AspTyr: 1.786 ± 1.079
0.0AspXaa: 0.0 ± 0.0
Glu
3.571GluAla: 3.571 ± 2.805
0.0GluCys: 0.0 ± 0.0
3.571GluAsp: 3.571 ± 2.805
0.0GluGlu: 0.0 ± 0.0
1.786GluPhe: 1.786 ± 1.402
3.571GluGly: 3.571 ± 0.323
5.357GluHis: 5.357 ± 0.755
3.571GluIle: 3.571 ± 0.323
7.143GluLys: 7.143 ± 3.128
1.786GluLeu: 1.786 ± 1.079
1.786GluMet: 1.786 ± 0.945
1.786GluAsn: 1.786 ± 1.079
1.786GluPro: 1.786 ± 1.402
1.786GluGln: 1.786 ± 1.402
3.571GluArg: 3.571 ± 2.805
1.786GluSer: 1.786 ± 1.079
3.571GluThr: 3.571 ± 2.158
1.786GluVal: 1.786 ± 1.402
0.0GluTrp: 0.0 ± 0.0
1.786GluTyr: 1.786 ± 1.402
0.0GluXaa: 0.0 ± 0.0
Phe
1.786PheAla: 1.786 ± 1.079
1.786PheCys: 1.786 ± 1.402
7.143PheAsp: 7.143 ± 0.647
8.929PheGlu: 8.929 ± 2.913
1.786PhePhe: 1.786 ± 1.402
1.786PheGly: 1.786 ± 1.079
0.0PheHis: 0.0 ± 0.0
3.571PheIle: 3.571 ± 2.805
0.0PheLys: 0.0 ± 0.0
5.357PheLeu: 5.357 ± 1.726
0.0PheMet: 0.0 ± 0.0
1.786PheAsn: 1.786 ± 1.079
7.143PhePro: 7.143 ± 3.128
3.571PheGln: 3.571 ± 0.323
0.0PheArg: 0.0 ± 0.0
1.786PheSer: 1.786 ± 1.402
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
8.929GlyAla: 8.929 ± 5.394
0.0GlyCys: 0.0 ± 0.0
1.786GlyAsp: 1.786 ± 1.079
0.0GlyGlu: 0.0 ± 0.0
3.571GlyPhe: 3.571 ± 0.323
1.786GlyGly: 1.786 ± 1.079
0.0GlyHis: 0.0 ± 0.0
10.714GlyIle: 10.714 ± 3.992
8.929GlyLys: 8.929 ± 2.049
5.357GlyLeu: 5.357 ± 3.237
0.0GlyMet: 0.0 ± 0.0
7.143GlyAsn: 7.143 ± 0.647
1.786GlyPro: 1.786 ± 1.079
0.0GlyGln: 0.0 ± 0.0
3.571GlyArg: 3.571 ± 0.323
5.357GlySer: 5.357 ± 3.237
3.571GlyThr: 3.571 ± 2.805
0.0GlyVal: 0.0 ± 0.0
1.786GlyTrp: 1.786 ± 1.079
5.357GlyTyr: 5.357 ± 4.207
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.786HisAsp: 1.786 ± 1.079
0.0HisGlu: 0.0 ± 0.0
1.786HisPhe: 1.786 ± 1.402
1.786HisGly: 1.786 ± 1.402
0.0HisHis: 0.0 ± 0.0
3.571HisIle: 3.571 ± 2.158
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
1.786HisMet: 1.786 ± 1.402
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.786HisSer: 1.786 ± 1.079
5.357HisThr: 5.357 ± 0.755
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.786HisTyr: 1.786 ± 1.079
0.0HisXaa: 0.0 ± 0.0
Ile
3.571IleAla: 3.571 ± 0.323
1.786IleCys: 1.786 ± 1.402
5.357IleAsp: 5.357 ± 0.755
5.357IleGlu: 5.357 ± 1.726
0.0IlePhe: 0.0 ± 0.0
5.357IleGly: 5.357 ± 3.237
1.786IleHis: 1.786 ± 1.079
1.786IleIle: 1.786 ± 1.402
1.786IleLys: 1.786 ± 1.079
7.143IleLeu: 7.143 ± 1.834
5.357IleMet: 5.357 ± 1.726
0.0IleAsn: 0.0 ± 0.0
5.357IlePro: 5.357 ± 1.726
1.786IleGln: 1.786 ± 1.079
3.571IleArg: 3.571 ± 2.158
3.571IleSer: 3.571 ± 2.158
7.143IleThr: 7.143 ± 3.128
5.357IleVal: 5.357 ± 0.755
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
8.929LysAla: 8.929 ± 0.432
0.0LysCys: 0.0 ± 0.0
5.357LysAsp: 5.357 ± 1.726
1.786LysGlu: 1.786 ± 1.402
0.0LysPhe: 0.0 ± 0.0
5.357LysGly: 5.357 ± 1.726
0.0LysHis: 0.0 ± 0.0
7.143LysIle: 7.143 ± 0.647
3.571LysLys: 3.571 ± 0.323
1.786LysLeu: 1.786 ± 1.079
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
3.571LysPro: 3.571 ± 2.805
3.571LysGln: 3.571 ± 2.805
1.786LysArg: 1.786 ± 1.402
3.571LysSer: 3.571 ± 0.323
1.786LysThr: 1.786 ± 1.402
5.357LysVal: 5.357 ± 0.755
1.786LysTrp: 1.786 ± 1.402
1.786LysTyr: 1.786 ± 1.402
0.0LysXaa: 0.0 ± 0.0
Leu
1.786LeuAla: 1.786 ± 1.079
1.786LeuCys: 1.786 ± 1.079
3.571LeuAsp: 3.571 ± 0.323
3.571LeuGlu: 3.571 ± 2.805
1.786LeuPhe: 1.786 ± 1.079
3.571LeuGly: 3.571 ± 2.158
1.786LeuHis: 1.786 ± 1.079
3.571LeuIle: 3.571 ± 0.323
3.571LeuLys: 3.571 ± 0.323
3.571LeuLeu: 3.571 ± 0.323
5.357LeuMet: 5.357 ± 0.755
0.0LeuAsn: 0.0 ± 0.0
7.143LeuPro: 7.143 ± 4.316
1.786LeuGln: 1.786 ± 1.402
7.143LeuArg: 7.143 ± 0.647
12.5LeuSer: 12.5 ± 2.59
0.0LeuThr: 0.0 ± 0.0
7.143LeuVal: 7.143 ± 1.834
1.786LeuTrp: 1.786 ± 1.079
5.357LeuTyr: 5.357 ± 1.726
0.0LeuXaa: 0.0 ± 0.0
Met
5.357MetAla: 5.357 ± 1.726
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.571MetGly: 3.571 ± 2.805
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.786MetLys: 1.786 ± 1.402
1.786MetLeu: 1.786 ± 1.402
0.0MetMet: 0.0 ± 0.831
1.786MetAsn: 1.786 ± 1.079
5.357MetPro: 5.357 ± 0.755
0.0MetGln: 0.0 ± 0.0
1.786MetArg: 1.786 ± 1.079
3.571MetSer: 3.571 ± 2.158
0.0MetThr: 0.0 ± 0.0
3.571MetVal: 3.571 ± 2.805
5.357MetTrp: 5.357 ± 4.207
1.786MetTyr: 1.786 ± 1.079
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
5.357AsnAsp: 5.357 ± 1.726
1.786AsnGlu: 1.786 ± 1.402
3.571AsnPhe: 3.571 ± 0.323
5.357AsnGly: 5.357 ± 0.755
0.0AsnHis: 0.0 ± 0.0
3.571AsnIle: 3.571 ± 2.158
0.0AsnLys: 0.0 ± 0.0
3.571AsnLeu: 3.571 ± 0.323
0.0AsnMet: 0.0 ± 0.0
3.571AsnAsn: 3.571 ± 2.158
3.571AsnPro: 3.571 ± 2.158
0.0AsnGln: 0.0 ± 0.0
1.786AsnArg: 1.786 ± 1.079
3.571AsnSer: 3.571 ± 2.158
0.0AsnThr: 0.0 ± 0.0
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
3.571AsnTyr: 3.571 ± 2.158
0.0AsnXaa: 0.0 ± 0.0
Pro
5.357ProAla: 5.357 ± 0.755
1.786ProCys: 1.786 ± 1.079
0.0ProAsp: 0.0 ± 0.0
1.786ProGlu: 1.786 ± 1.402
1.786ProPhe: 1.786 ± 1.079
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
3.571ProIle: 3.571 ± 2.158
1.786ProLys: 1.786 ± 1.402
14.286ProLeu: 14.286 ± 3.669
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
7.143ProPro: 7.143 ± 0.647
3.571ProGln: 3.571 ± 0.323
5.357ProArg: 5.357 ± 1.726
5.357ProSer: 5.357 ± 0.755
7.143ProThr: 7.143 ± 0.647
1.786ProVal: 1.786 ± 1.079
0.0ProTrp: 0.0 ± 0.0
3.571ProTyr: 3.571 ± 0.323
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.786GlnCys: 1.786 ± 1.402
1.786GlnAsp: 1.786 ± 1.079
5.357GlnGlu: 5.357 ± 4.207
1.786GlnPhe: 1.786 ± 1.402
1.786GlnGly: 1.786 ± 1.079
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.786GlnLys: 1.786 ± 1.402
1.786GlnLeu: 1.786 ± 1.402
3.571GlnMet: 3.571 ± 0.323
1.786GlnAsn: 1.786 ± 1.402
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.786GlnArg: 1.786 ± 1.079
3.571GlnSer: 3.571 ± 0.323
1.786GlnThr: 1.786 ± 1.079
1.786GlnVal: 1.786 ± 1.402
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.357ArgAla: 5.357 ± 1.726
0.0ArgCys: 0.0 ± 0.0
1.786ArgAsp: 1.786 ± 1.079
1.786ArgGlu: 1.786 ± 1.402
3.571ArgPhe: 3.571 ± 2.158
3.571ArgGly: 3.571 ± 0.323
1.786ArgHis: 1.786 ± 1.402
5.357ArgIle: 5.357 ± 0.755
5.357ArgLys: 5.357 ± 1.726
1.786ArgLeu: 1.786 ± 1.079
0.0ArgMet: 0.0 ± 0.0
3.571ArgAsn: 3.571 ± 0.323
1.786ArgPro: 1.786 ± 1.079
0.0ArgGln: 0.0 ± 0.0
0.0ArgArg: 0.0 ± 0.0
1.786ArgSer: 1.786 ± 1.079
1.786ArgThr: 1.786 ± 1.079
1.786ArgVal: 1.786 ± 1.402
1.786ArgTrp: 1.786 ± 1.402
3.571ArgTyr: 3.571 ± 2.805
0.0ArgXaa: 0.0 ± 0.0
Ser
7.143SerAla: 7.143 ± 1.834
0.0SerCys: 0.0 ± 0.0
1.786SerAsp: 1.786 ± 1.079
1.786SerGlu: 1.786 ± 1.079
3.571SerPhe: 3.571 ± 0.323
5.357SerGly: 5.357 ± 3.237
0.0SerHis: 0.0 ± 0.0
1.786SerIle: 1.786 ± 1.079
5.357SerLys: 5.357 ± 1.726
1.786SerLeu: 1.786 ± 1.079
3.571SerMet: 3.571 ± 0.323
8.929SerAsn: 8.929 ± 2.913
5.357SerPro: 5.357 ± 0.755
0.0SerGln: 0.0 ± 0.0
0.0SerArg: 0.0 ± 0.0
5.357SerSer: 5.357 ± 3.237
14.286SerThr: 14.286 ± 6.15
1.786SerVal: 1.786 ± 1.079
5.357SerTrp: 5.357 ± 4.207
1.786SerTyr: 1.786 ± 1.079
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
0.0ThrCys: 0.0 ± 0.0
7.143ThrAsp: 7.143 ± 5.609
5.357ThrGlu: 5.357 ± 0.755
7.143ThrPhe: 7.143 ± 1.834
3.571ThrGly: 3.571 ± 2.158
0.0ThrHis: 0.0 ± 0.0
1.786ThrIle: 1.786 ± 1.079
1.786ThrLys: 1.786 ± 1.079
8.929ThrLeu: 8.929 ± 2.913
0.0ThrMet: 0.0 ± 0.0
5.357ThrAsn: 5.357 ± 3.237
3.571ThrPro: 3.571 ± 0.323
1.786ThrGln: 1.786 ± 1.402
0.0ThrArg: 0.0 ± 0.0
5.357ThrSer: 5.357 ± 0.755
5.357ThrThr: 5.357 ± 3.237
1.786ThrVal: 1.786 ± 1.079
1.786ThrTrp: 1.786 ± 1.079
5.357ThrTyr: 5.357 ± 3.237
0.0ThrXaa: 0.0 ± 0.0
Val
3.571ValAla: 3.571 ± 0.323
1.786ValCys: 1.786 ± 1.402
0.0ValAsp: 0.0 ± 0.0
1.786ValGlu: 1.786 ± 1.079
3.571ValPhe: 3.571 ± 2.805
5.357ValGly: 5.357 ± 0.755
1.786ValHis: 1.786 ± 1.079
3.571ValIle: 3.571 ± 0.323
0.0ValLys: 0.0 ± 0.0
5.357ValLeu: 5.357 ± 0.755
3.571ValMet: 3.571 ± 2.158
0.0ValAsn: 0.0 ± 0.0
1.786ValPro: 1.786 ± 1.079
3.571ValGln: 3.571 ± 0.323
3.571ValArg: 3.571 ± 2.805
1.786ValSer: 1.786 ± 1.079
1.786ValThr: 1.786 ± 1.079
1.786ValVal: 1.786 ± 1.079
0.0ValTrp: 0.0 ± 0.0
1.786ValTyr: 1.786 ± 1.079
0.0ValXaa: 0.0 ± 0.0
Trp
1.786TrpAla: 1.786 ± 1.079
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.786TrpGlu: 1.786 ± 1.402
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.786TrpHis: 1.786 ± 1.079
1.786TrpIle: 1.786 ± 1.402
3.571TrpLys: 3.571 ± 0.323
1.786TrpLeu: 1.786 ± 1.402
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
7.143TrpGln: 7.143 ± 3.128
0.0TrpArg: 0.0 ± 0.0
1.786TrpSer: 1.786 ± 1.402
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.786TyrAla: 1.786 ± 1.402
1.786TyrCys: 1.786 ± 1.402
3.571TyrAsp: 3.571 ± 0.323
1.786TyrGlu: 1.786 ± 1.402
0.0TyrPhe: 0.0 ± 0.0
5.357TyrGly: 5.357 ± 0.755
0.0TyrHis: 0.0 ± 0.0
1.786TyrIle: 1.786 ± 1.402
1.786TyrLys: 1.786 ± 1.079
1.786TyrLeu: 1.786 ± 1.402
3.571TyrMet: 3.571 ± 2.805
0.0TyrAsn: 0.0 ± 0.0
1.786TyrPro: 1.786 ± 1.079
0.0TyrGln: 0.0 ± 0.0
3.571TyrArg: 3.571 ± 2.158
3.571TyrSer: 3.571 ± 0.323
3.571TyrThr: 3.571 ± 2.158
3.571TyrVal: 3.571 ± 0.323
1.786TyrTrp: 1.786 ± 1.079
3.571TyrTyr: 3.571 ± 2.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (561 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski