Amino acid dipepetide frequency for Bovine faeces associated smacovirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
4.959AlaAsp: 4.959 ± 1.629
1.653AlaGlu: 1.653 ± 1.315
3.306AlaPhe: 3.306 ± 2.002
6.612AlaGly: 6.612 ± 1.688
1.653AlaHis: 1.653 ± 1.001
1.653AlaIle: 1.653 ± 1.315
1.653AlaLys: 1.653 ± 1.315
0.0AlaLeu: 0.0 ± 0.0
4.959AlaMet: 4.959 ± 0.687
0.0AlaAsn: 0.0 ± 0.0
0.0AlaPro: 0.0 ± 0.0
1.653AlaGln: 1.653 ± 1.001
0.0AlaArg: 0.0 ± 0.0
4.959AlaSer: 4.959 ± 0.687
3.306AlaThr: 3.306 ± 0.314
3.306AlaVal: 3.306 ± 0.314
0.0AlaTrp: 0.0 ± 0.0
1.653AlaTyr: 1.653 ± 1.315
0.0AlaXaa: 0.0 ± 0.0
Cys
3.306CysAla: 3.306 ± 2.63
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.653CysGlu: 1.653 ± 1.001
0.0CysPhe: 0.0 ± 0.0
1.653CysGly: 1.653 ± 1.315
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
3.306CysArg: 3.306 ± 0.314
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.653CysVal: 1.653 ± 1.315
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
3.306AspCys: 3.306 ± 2.63
1.653AspAsp: 1.653 ± 1.001
6.612AspGlu: 6.612 ± 4.005
1.653AspPhe: 1.653 ± 1.001
8.264AspGly: 8.264 ± 0.373
0.0AspHis: 0.0 ± 0.0
0.0AspIle: 0.0 ± 0.0
3.306AspLys: 3.306 ± 2.63
9.917AspLeu: 9.917 ± 3.258
3.306AspMet: 3.306 ± 2.63
1.653AspAsn: 1.653 ± 1.315
1.653AspPro: 1.653 ± 1.001
1.653AspGln: 1.653 ± 1.001
4.959AspArg: 4.959 ± 3.945
4.959AspSer: 4.959 ± 1.629
6.612AspThr: 6.612 ± 0.628
4.959AspVal: 4.959 ± 1.629
0.0AspTrp: 0.0 ± 0.0
4.959AspTyr: 4.959 ± 1.629
0.0AspXaa: 0.0 ± 0.0
Glu
3.306GluAla: 3.306 ± 2.002
0.0GluCys: 0.0 ± 0.0
6.612GluAsp: 6.612 ± 0.628
3.306GluGlu: 3.306 ± 0.314
1.653GluPhe: 1.653 ± 1.001
8.264GluGly: 8.264 ± 2.689
0.0GluHis: 0.0 ± 0.0
3.306GluIle: 3.306 ± 0.314
3.306GluLys: 3.306 ± 0.314
1.653GluLeu: 1.653 ± 1.001
0.0GluMet: 0.0 ± 0.783
3.306GluAsn: 3.306 ± 2.002
1.653GluPro: 1.653 ± 1.315
3.306GluGln: 3.306 ± 0.314
3.306GluArg: 3.306 ± 2.63
3.306GluSer: 3.306 ± 2.002
6.612GluThr: 6.612 ± 1.688
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
4.959GluTyr: 4.959 ± 1.629
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.653PheCys: 1.653 ± 1.001
0.0PheAsp: 0.0 ± 0.0
1.653PheGlu: 1.653 ± 1.001
1.653PhePhe: 1.653 ± 1.001
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
1.653PheLeu: 1.653 ± 1.001
0.0PheMet: 0.0 ± 0.0
8.264PheAsn: 8.264 ± 2.689
1.653PhePro: 1.653 ± 1.001
1.653PheGln: 1.653 ± 1.001
1.653PheArg: 1.653 ± 1.001
1.653PheSer: 1.653 ± 1.001
3.306PheThr: 3.306 ± 0.314
1.653PheVal: 1.653 ± 1.315
1.653PheTrp: 1.653 ± 1.001
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.306GlyAla: 3.306 ± 2.002
0.0GlyCys: 0.0 ± 0.0
8.264GlyAsp: 8.264 ± 4.259
4.959GlyGlu: 4.959 ± 0.687
1.653GlyPhe: 1.653 ± 1.001
8.264GlyGly: 8.264 ± 6.575
1.653GlyHis: 1.653 ± 1.315
1.653GlyIle: 1.653 ± 1.001
8.264GlyLys: 8.264 ± 1.943
13.223GlyLeu: 13.223 ± 1.06
3.306GlyMet: 3.306 ± 0.688
3.306GlyAsn: 3.306 ± 0.314
0.0GlyPro: 0.0 ± 0.0
4.959GlyGln: 4.959 ± 1.629
3.306GlyArg: 3.306 ± 0.314
3.306GlySer: 3.306 ± 0.314
1.653GlyThr: 1.653 ± 1.315
13.223GlyVal: 13.223 ± 5.693
3.306GlyTrp: 3.306 ± 0.314
3.306GlyTyr: 3.306 ± 2.63
0.0GlyXaa: 0.0 ± 0.0
His
1.653HisAla: 1.653 ± 1.001
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.653HisGly: 1.653 ± 1.315
1.653HisHis: 1.653 ± 1.315
4.959HisIle: 4.959 ± 1.629
0.0HisLys: 0.0 ± 0.0
3.306HisLeu: 3.306 ± 0.314
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.653HisPro: 1.653 ± 1.001
1.653HisGln: 1.653 ± 1.315
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.653HisThr: 1.653 ± 1.315
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
3.306HisTyr: 3.306 ± 2.63
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.653IleCys: 1.653 ± 1.315
6.612IleAsp: 6.612 ± 5.26
6.612IleGlu: 6.612 ± 2.944
0.0IlePhe: 0.0 ± 0.0
8.264IleGly: 8.264 ± 1.943
1.653IleHis: 1.653 ± 1.001
1.653IleIle: 1.653 ± 1.001
1.653IleLys: 1.653 ± 1.315
8.264IleLeu: 8.264 ± 2.689
3.306IleMet: 3.306 ± 0.314
1.653IleAsn: 1.653 ± 1.001
3.306IlePro: 3.306 ± 2.002
1.653IleGln: 1.653 ± 1.001
1.653IleArg: 1.653 ± 1.001
3.306IleSer: 3.306 ± 2.63
1.653IleThr: 1.653 ± 1.001
0.0IleVal: 0.0 ± 0.0
1.653IleTrp: 1.653 ± 1.315
3.306IleTyr: 3.306 ± 0.314
0.0IleXaa: 0.0 ± 0.0
Lys
3.306LysAla: 3.306 ± 0.314
0.0LysCys: 0.0 ± 0.0
3.306LysAsp: 3.306 ± 0.314
1.653LysGlu: 1.653 ± 1.315
1.653LysPhe: 1.653 ± 1.001
8.264LysGly: 8.264 ± 6.575
0.0LysHis: 0.0 ± 0.0
3.306LysIle: 3.306 ± 0.314
3.306LysLys: 3.306 ± 0.314
6.612LysLeu: 6.612 ± 0.628
3.306LysMet: 3.306 ± 2.002
0.0LysAsn: 0.0 ± 0.0
1.653LysPro: 1.653 ± 1.315
3.306LysGln: 3.306 ± 0.314
0.0LysArg: 0.0 ± 0.0
0.0LysSer: 0.0 ± 0.0
1.653LysThr: 1.653 ± 1.315
3.306LysVal: 3.306 ± 0.314
1.653LysTrp: 1.653 ± 1.001
4.959LysTyr: 4.959 ± 1.629
0.0LysXaa: 0.0 ± 0.0
Leu
1.653LeuAla: 1.653 ± 1.315
0.0LeuCys: 0.0 ± 0.0
4.959LeuAsp: 4.959 ± 0.687
6.612LeuGlu: 6.612 ± 0.628
0.0LeuPhe: 0.0 ± 0.0
3.306LeuGly: 3.306 ± 2.63
0.0LeuHis: 0.0 ± 0.0
4.959LeuIle: 4.959 ± 0.687
3.306LeuLys: 3.306 ± 0.314
3.306LeuLeu: 3.306 ± 2.63
0.0LeuMet: 0.0 ± 0.0
0.0LeuAsn: 0.0 ± 0.0
6.612LeuPro: 6.612 ± 1.688
0.0LeuGln: 0.0 ± 0.0
6.612LeuArg: 6.612 ± 1.688
13.223LeuSer: 13.223 ± 3.377
3.306LeuThr: 3.306 ± 2.002
8.264LeuVal: 8.264 ± 0.373
1.653LeuTrp: 1.653 ± 1.315
4.959LeuTyr: 4.959 ± 3.003
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.653MetCys: 1.653 ± 1.001
3.306MetAsp: 3.306 ± 0.314
3.306MetGlu: 3.306 ± 2.002
0.0MetPhe: 0.0 ± 0.0
3.306MetGly: 3.306 ± 0.314
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.653MetLys: 1.653 ± 1.315
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
3.306MetAsn: 3.306 ± 0.314
4.959MetPro: 4.959 ± 3.003
0.0MetGln: 0.0 ± 0.0
3.306MetArg: 3.306 ± 2.002
0.0MetSer: 0.0 ± 0.0
1.653MetThr: 1.653 ± 1.001
4.959MetVal: 4.959 ± 0.687
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.306AsnAla: 3.306 ± 0.314
0.0AsnCys: 0.0 ± 0.0
4.959AsnAsp: 4.959 ± 0.687
4.959AsnGlu: 4.959 ± 0.687
0.0AsnPhe: 0.0 ± 0.0
4.959AsnGly: 4.959 ± 0.687
1.653AsnHis: 1.653 ± 1.315
1.653AsnIle: 1.653 ± 1.001
1.653AsnLys: 1.653 ± 1.001
1.653AsnLeu: 1.653 ± 1.001
1.653AsnMet: 1.653 ± 1.001
1.653AsnAsn: 1.653 ± 1.001
1.653AsnPro: 1.653 ± 1.001
1.653AsnGln: 1.653 ± 1.001
1.653AsnArg: 1.653 ± 1.001
0.0AsnSer: 0.0 ± 0.0
6.612AsnThr: 6.612 ± 1.688
4.959AsnVal: 4.959 ± 3.003
0.0AsnTrp: 0.0 ± 0.0
4.959AsnTyr: 4.959 ± 0.687
0.0AsnXaa: 0.0 ± 0.0
Pro
4.959ProAla: 4.959 ± 0.687
0.0ProCys: 0.0 ± 0.0
1.653ProAsp: 1.653 ± 1.001
1.653ProGlu: 1.653 ± 1.001
0.0ProPhe: 0.0 ± 0.0
3.306ProGly: 3.306 ± 2.002
1.653ProHis: 1.653 ± 1.001
3.306ProIle: 3.306 ± 2.002
0.0ProLys: 0.0 ± 0.0
4.959ProLeu: 4.959 ± 0.687
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
3.306ProPro: 3.306 ± 0.314
1.653ProGln: 1.653 ± 1.001
3.306ProArg: 3.306 ± 2.63
1.653ProSer: 1.653 ± 1.001
6.612ProThr: 6.612 ± 0.628
6.612ProVal: 6.612 ± 1.688
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.653GlnAla: 1.653 ± 1.001
1.653GlnCys: 1.653 ± 1.315
1.653GlnAsp: 1.653 ± 1.001
1.653GlnGlu: 1.653 ± 1.001
0.0GlnPhe: 0.0 ± 0.0
1.653GlnGly: 1.653 ± 1.001
1.653GlnHis: 1.653 ± 1.315
1.653GlnIle: 1.653 ± 1.315
0.0GlnLys: 0.0 ± 0.0
1.653GlnLeu: 1.653 ± 1.315
1.653GlnMet: 1.653 ± 1.001
4.959GlnAsn: 4.959 ± 0.687
0.0GlnPro: 0.0 ± 0.0
1.653GlnGln: 1.653 ± 1.001
1.653GlnArg: 1.653 ± 1.001
3.306GlnSer: 3.306 ± 0.314
1.653GlnThr: 1.653 ± 1.001
3.306GlnVal: 3.306 ± 0.314
0.0GlnTrp: 0.0 ± 0.0
1.653GlnTyr: 1.653 ± 1.001
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.0ArgCys: 0.0 ± 0.0
4.959ArgAsp: 4.959 ± 1.629
4.959ArgGlu: 4.959 ± 1.629
3.306ArgPhe: 3.306 ± 0.314
6.612ArgGly: 6.612 ± 1.688
1.653ArgHis: 1.653 ± 1.315
6.612ArgIle: 6.612 ± 1.688
4.959ArgLys: 4.959 ± 1.629
4.959ArgLeu: 4.959 ± 0.687
3.306ArgMet: 3.306 ± 2.002
3.306ArgAsn: 3.306 ± 0.314
3.306ArgPro: 3.306 ± 2.002
1.653ArgGln: 1.653 ± 1.315
0.0ArgArg: 0.0 ± 0.0
4.959ArgSer: 4.959 ± 1.629
4.959ArgThr: 4.959 ± 1.629
0.0ArgVal: 0.0 ± 0.0
1.653ArgTrp: 1.653 ± 1.315
3.306ArgTyr: 3.306 ± 2.63
0.0ArgXaa: 0.0 ± 0.0
Ser
8.264SerAla: 8.264 ± 1.943
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
4.959SerGlu: 4.959 ± 3.003
3.306SerPhe: 3.306 ± 2.002
3.306SerGly: 3.306 ± 0.314
1.653SerHis: 1.653 ± 1.001
4.959SerIle: 4.959 ± 1.629
0.0SerLys: 0.0 ± 0.0
3.306SerLeu: 3.306 ± 2.002
3.306SerMet: 3.306 ± 2.002
8.264SerAsn: 8.264 ± 5.006
3.306SerPro: 3.306 ± 0.314
0.0SerGln: 0.0 ± 0.0
8.264SerArg: 8.264 ± 1.943
1.653SerSer: 1.653 ± 1.001
1.653SerThr: 1.653 ± 1.001
4.959SerVal: 4.959 ± 0.687
1.653SerTrp: 1.653 ± 1.315
1.653SerTyr: 1.653 ± 1.315
0.0SerXaa: 0.0 ± 0.0
Thr
3.306ThrAla: 3.306 ± 0.314
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
0.0ThrGlu: 0.0 ± 0.0
1.653ThrPhe: 1.653 ± 1.001
6.612ThrGly: 6.612 ± 0.628
3.306ThrHis: 3.306 ± 2.63
8.264ThrIle: 8.264 ± 1.943
8.264ThrLys: 8.264 ± 0.373
4.959ThrLeu: 4.959 ± 3.003
1.653ThrMet: 1.653 ± 1.001
1.653ThrAsn: 1.653 ± 1.315
4.959ThrPro: 4.959 ± 1.629
0.0ThrGln: 0.0 ± 0.0
6.612ThrArg: 6.612 ± 0.628
3.306ThrSer: 3.306 ± 2.002
6.612ThrThr: 6.612 ± 2.944
4.959ThrVal: 4.959 ± 3.003
0.0ThrTrp: 0.0 ± 0.0
1.653ThrTyr: 1.653 ± 1.001
0.0ThrXaa: 0.0 ± 0.0
Val
3.306ValAla: 3.306 ± 0.314
0.0ValCys: 0.0 ± 0.0
8.264ValAsp: 8.264 ± 0.373
1.653ValGlu: 1.653 ± 1.001
4.959ValPhe: 4.959 ± 0.687
4.959ValGly: 4.959 ± 0.687
0.0ValHis: 0.0 ± 0.0
1.653ValIle: 1.653 ± 1.315
1.653ValLys: 1.653 ± 1.315
1.653ValLeu: 1.653 ± 1.315
0.0ValMet: 0.0 ± 0.0
6.612ValAsn: 6.612 ± 4.005
3.306ValPro: 3.306 ± 0.314
3.306ValGln: 3.306 ± 2.002
6.612ValArg: 6.612 ± 2.944
9.917ValSer: 9.917 ± 6.007
6.612ValThr: 6.612 ± 0.628
6.612ValVal: 6.612 ± 0.628
0.0ValTrp: 0.0 ± 0.0
3.306ValTyr: 3.306 ± 2.002
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
3.306TrpAsp: 3.306 ± 2.63
0.0TrpGlu: 0.0 ± 0.0
1.653TrpPhe: 1.653 ± 1.001
1.653TrpGly: 1.653 ± 1.001
1.653TrpHis: 1.653 ± 1.315
1.653TrpIle: 1.653 ± 1.315
3.306TrpLys: 3.306 ± 0.314
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.653TrpAsn: 1.653 ± 1.001
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.653TyrCys: 1.653 ± 1.315
4.959TyrAsp: 4.959 ± 1.629
1.653TyrGlu: 1.653 ± 1.315
1.653TyrPhe: 1.653 ± 1.315
0.0TyrGly: 0.0 ± 0.0
1.653TyrHis: 1.653 ± 1.315
6.612TyrIle: 6.612 ± 2.944
4.959TyrLys: 4.959 ± 0.687
1.653TyrLeu: 1.653 ± 1.001
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.653TyrPro: 1.653 ± 1.001
3.306TyrGln: 3.306 ± 0.314
8.264TyrArg: 8.264 ± 0.373
4.959TyrSer: 4.959 ± 1.629
1.653TyrThr: 1.653 ± 1.001
1.653TyrVal: 1.653 ± 1.315
1.653TyrTrp: 1.653 ± 1.001
4.959TyrTyr: 4.959 ± 0.687
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (606 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski