Amino acid dipepetide frequency for Bovine faeces associated smacovirus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
3.306AlaCys: 3.306 ± 1.993
4.959AlaAsp: 4.959 ± 0.761
0.0AlaGlu: 0.0 ± 0.0
3.306AlaPhe: 3.306 ± 1.993
8.264AlaGly: 8.264 ± 4.983
0.0AlaHis: 0.0 ± 0.0
3.306AlaIle: 3.306 ± 1.993
6.612AlaLys: 6.612 ± 0.472
6.612AlaLeu: 6.612 ± 2.7
3.306AlaMet: 3.306 ± 0.236
1.653AlaAsn: 1.653 ± 1.232
3.306AlaPro: 3.306 ± 1.993
0.0AlaGln: 0.0 ± 0.0
0.0AlaArg: 0.0 ± 0.0
0.0AlaSer: 0.0 ± 0.0
4.959AlaThr: 4.959 ± 1.468
4.959AlaVal: 4.959 ± 0.761
0.0AlaTrp: 0.0 ± 0.0
4.959AlaTyr: 4.959 ± 3.697
0.0AlaXaa: 0.0 ± 0.0
Cys
1.653CysAla: 1.653 ± 0.997
0.0CysCys: 0.0 ± 0.0
1.653CysAsp: 1.653 ± 0.997
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.653CysIle: 1.653 ± 0.997
1.653CysLys: 1.653 ± 1.232
0.0CysLeu: 0.0 ± 0.0
1.653CysMet: 1.653 ± 1.232
0.0CysAsn: 0.0 ± 0.0
1.653CysPro: 1.653 ± 1.232
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
3.306CysThr: 3.306 ± 1.993
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.959AspAla: 4.959 ± 0.761
0.0AspCys: 0.0 ± 0.0
8.264AspAsp: 8.264 ± 3.933
9.917AspGlu: 9.917 ± 0.707
0.0AspPhe: 0.0 ± 0.0
4.959AspGly: 4.959 ± 1.468
0.0AspHis: 0.0 ± 0.0
9.917AspIle: 9.917 ± 0.707
3.306AspLys: 3.306 ± 0.236
4.959AspLeu: 4.959 ± 0.761
1.653AspMet: 1.653 ± 0.997
0.0AspAsn: 0.0 ± 0.0
1.653AspPro: 1.653 ± 0.997
0.0AspGln: 0.0 ± 0.0
3.306AspArg: 3.306 ± 2.465
4.959AspSer: 4.959 ± 1.468
6.612AspThr: 6.612 ± 1.757
3.306AspVal: 3.306 ± 0.236
0.0AspTrp: 0.0 ± 0.0
4.959AspTyr: 4.959 ± 1.468
0.0AspXaa: 0.0 ± 0.0
Glu
1.653GluAla: 1.653 ± 1.232
0.0GluCys: 0.0 ± 0.0
3.306GluAsp: 3.306 ± 2.465
4.959GluGlu: 4.959 ± 1.468
3.306GluPhe: 3.306 ± 0.236
6.612GluGly: 6.612 ± 1.757
1.653GluHis: 1.653 ± 1.232
4.959GluIle: 4.959 ± 0.761
6.612GluLys: 6.612 ± 2.7
3.306GluLeu: 3.306 ± 0.236
0.0GluMet: 0.0 ± 0.0
1.653GluAsn: 1.653 ± 0.997
4.959GluPro: 4.959 ± 0.761
1.653GluGln: 1.653 ± 1.232
1.653GluArg: 1.653 ± 1.232
3.306GluSer: 3.306 ± 2.465
1.653GluThr: 1.653 ± 0.997
1.653GluVal: 1.653 ± 1.232
1.653GluTrp: 1.653 ± 0.997
1.653GluTyr: 1.653 ± 1.232
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.306PheAsp: 3.306 ± 0.236
0.0PheGlu: 0.0 ± 0.0
1.653PhePhe: 1.653 ± 0.997
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
0.0PheLeu: 0.0 ± 0.0
1.653PheMet: 1.653 ± 0.997
0.0PheAsn: 0.0 ± 0.0
3.306PhePro: 3.306 ± 0.236
0.0PheGln: 0.0 ± 0.0
1.653PheArg: 1.653 ± 0.997
3.306PheSer: 3.306 ± 0.236
8.264PheThr: 8.264 ± 0.525
3.306PheVal: 3.306 ± 0.236
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.653GlyAla: 1.653 ± 0.997
0.0GlyCys: 0.0 ± 0.0
1.653GlyAsp: 1.653 ± 0.997
3.306GlyGlu: 3.306 ± 2.465
0.0GlyPhe: 0.0 ± 0.0
1.653GlyGly: 1.653 ± 1.232
3.306GlyHis: 3.306 ± 0.236
1.653GlyIle: 1.653 ± 1.232
8.264GlyLys: 8.264 ± 1.704
11.57GlyLeu: 11.57 ± 4.747
1.653GlyMet: 1.653 ± 0.997
3.306GlyAsn: 3.306 ± 0.236
1.653GlyPro: 1.653 ± 1.232
6.612GlyGln: 6.612 ± 3.986
0.0GlyArg: 0.0 ± 0.0
4.959GlySer: 4.959 ± 0.761
4.959GlyThr: 4.959 ± 0.761
4.959GlyVal: 4.959 ± 0.761
3.306GlyTrp: 3.306 ± 0.236
1.653GlyTyr: 1.653 ± 1.232
0.0GlyXaa: 0.0 ± 0.0
His
3.306HisAla: 3.306 ± 0.236
0.0HisCys: 0.0 ± 0.0
1.653HisAsp: 1.653 ± 1.232
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.306HisGly: 3.306 ± 0.236
0.0HisHis: 0.0 ± 0.0
1.653HisIle: 1.653 ± 1.232
3.306HisLys: 3.306 ± 0.236
3.306HisLeu: 3.306 ± 0.236
0.0HisMet: 0.0 ± 0.0
1.653HisAsn: 1.653 ± 1.232
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.653HisSer: 1.653 ± 0.997
1.653HisThr: 1.653 ± 0.997
1.653HisVal: 1.653 ± 1.232
0.0HisTrp: 0.0 ± 0.0
1.653HisTyr: 1.653 ± 1.232
0.0HisXaa: 0.0 ± 0.0
Ile
8.264IleAla: 8.264 ± 1.704
0.0IleCys: 0.0 ± 0.0
4.959IleAsp: 4.959 ± 0.761
4.959IleGlu: 4.959 ± 1.468
0.0IlePhe: 0.0 ± 0.0
6.612IleGly: 6.612 ± 0.472
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
6.612IleLys: 6.612 ± 2.7
3.306IleLeu: 3.306 ± 0.236
4.959IleMet: 4.959 ± 1.506
4.959IleAsn: 4.959 ± 0.761
3.306IlePro: 3.306 ± 0.236
0.0IleGln: 0.0 ± 0.0
1.653IleArg: 1.653 ± 0.997
6.612IleSer: 6.612 ± 1.757
3.306IleThr: 3.306 ± 0.236
1.653IleVal: 1.653 ± 0.997
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.306LysAla: 3.306 ± 0.236
0.0LysCys: 0.0 ± 0.0
3.306LysAsp: 3.306 ± 0.236
3.306LysGlu: 3.306 ± 0.236
3.306LysPhe: 3.306 ± 0.236
0.0LysGly: 0.0 ± 0.0
1.653LysHis: 1.653 ± 1.232
1.653LysIle: 1.653 ± 0.997
1.653LysLys: 1.653 ± 1.232
1.653LysLeu: 1.653 ± 1.232
3.306LysMet: 3.306 ± 0.664
4.959LysAsn: 4.959 ± 3.697
3.306LysPro: 3.306 ± 2.465
1.653LysGln: 1.653 ± 1.232
6.612LysArg: 6.612 ± 2.7
8.264LysSer: 8.264 ± 2.754
4.959LysThr: 4.959 ± 1.468
0.0LysVal: 0.0 ± 0.0
3.306LysTrp: 3.306 ± 2.465
1.653LysTyr: 1.653 ± 1.232
0.0LysXaa: 0.0 ± 0.0
Leu
4.959LeuAla: 4.959 ± 1.468
1.653LeuCys: 1.653 ± 0.997
3.306LeuAsp: 3.306 ± 0.236
1.653LeuGlu: 1.653 ± 1.232
1.653LeuPhe: 1.653 ± 0.997
3.306LeuGly: 3.306 ± 1.993
1.653LeuHis: 1.653 ± 1.232
4.959LeuIle: 4.959 ± 1.468
3.306LeuLys: 3.306 ± 0.236
11.57LeuLeu: 11.57 ± 1.94
1.653LeuMet: 1.653 ± 0.997
1.653LeuAsn: 1.653 ± 0.997
6.612LeuPro: 6.612 ± 0.472
1.653LeuGln: 1.653 ± 0.997
1.653LeuArg: 1.653 ± 0.997
8.264LeuSer: 8.264 ± 1.704
0.0LeuThr: 0.0 ± 0.0
6.612LeuVal: 6.612 ± 1.757
0.0LeuTrp: 0.0 ± 0.0
8.264LeuTyr: 8.264 ± 0.525
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.653MetAsp: 1.653 ± 1.232
4.959MetGlu: 4.959 ± 1.468
0.0MetPhe: 0.0 ± 0.0
1.653MetGly: 1.653 ± 1.232
1.653MetHis: 1.653 ± 0.997
1.653MetIle: 1.653 ± 0.997
1.653MetLys: 1.653 ± 0.997
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
3.306MetAsn: 3.306 ± 1.993
4.959MetPro: 4.959 ± 2.99
0.0MetGln: 0.0 ± 0.0
3.306MetArg: 3.306 ± 1.993
1.653MetSer: 1.653 ± 0.997
0.0MetThr: 0.0 ± 0.0
3.306MetVal: 3.306 ± 0.236
3.306MetTrp: 3.306 ± 2.465
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.306AsnAla: 3.306 ± 0.236
0.0AsnCys: 0.0 ± 0.0
6.612AsnAsp: 6.612 ± 2.7
1.653AsnGlu: 1.653 ± 0.997
0.0AsnPhe: 0.0 ± 0.0
4.959AsnGly: 4.959 ± 3.697
1.653AsnHis: 1.653 ± 0.997
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
1.653AsnLeu: 1.653 ± 1.232
1.653AsnMet: 1.653 ± 1.232
0.0AsnAsn: 0.0 ± 0.0
3.306AsnPro: 3.306 ± 1.993
3.306AsnGln: 3.306 ± 0.236
1.653AsnArg: 1.653 ± 0.997
3.306AsnSer: 3.306 ± 1.993
3.306AsnThr: 3.306 ± 1.993
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.653AsnTyr: 1.653 ± 0.997
0.0AsnXaa: 0.0 ± 0.0
Pro
3.306ProAla: 3.306 ± 1.993
1.653ProCys: 1.653 ± 0.997
1.653ProAsp: 1.653 ± 1.232
3.306ProGlu: 3.306 ± 0.236
0.0ProPhe: 0.0 ± 0.0
4.959ProGly: 4.959 ± 2.99
1.653ProHis: 1.653 ± 1.232
3.306ProIle: 3.306 ± 0.236
3.306ProLys: 3.306 ± 0.236
4.959ProLeu: 4.959 ± 2.99
3.306ProMet: 3.306 ± 0.236
0.0ProAsn: 0.0 ± 0.0
4.959ProPro: 4.959 ± 0.761
4.959ProGln: 4.959 ± 2.99
4.959ProArg: 4.959 ± 1.468
6.612ProSer: 6.612 ± 0.472
3.306ProThr: 3.306 ± 0.236
6.612ProVal: 6.612 ± 1.757
1.653ProTrp: 1.653 ± 1.232
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.653GlnAla: 1.653 ± 0.997
1.653GlnCys: 1.653 ± 1.232
6.612GlnAsp: 6.612 ± 1.757
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
1.653GlnGly: 1.653 ± 1.232
0.0GlnHis: 0.0 ± 0.0
3.306GlnIle: 3.306 ± 0.236
0.0GlnLys: 0.0 ± 0.0
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.653GlnPro: 1.653 ± 1.232
0.0GlnGln: 0.0 ± 0.0
3.306GlnArg: 3.306 ± 0.236
0.0GlnSer: 0.0 ± 0.0
0.0GlnThr: 0.0 ± 0.0
3.306GlnVal: 3.306 ± 1.993
0.0GlnTrp: 0.0 ± 0.0
1.653GlnTyr: 1.653 ± 0.997
0.0GlnXaa: 0.0 ± 0.0
Arg
4.959ArgAla: 4.959 ± 1.468
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
0.0ArgGlu: 0.0 ± 0.0
1.653ArgPhe: 1.653 ± 1.232
3.306ArgGly: 3.306 ± 0.236
1.653ArgHis: 1.653 ± 1.232
3.306ArgIle: 3.306 ± 1.993
3.306ArgLys: 3.306 ± 0.236
8.264ArgLeu: 8.264 ± 1.704
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
1.653ArgPro: 1.653 ± 0.997
0.0ArgGln: 0.0 ± 0.0
3.306ArgArg: 3.306 ± 0.236
1.653ArgSer: 1.653 ± 1.232
3.306ArgThr: 3.306 ± 0.236
4.959ArgVal: 4.959 ± 2.99
3.306ArgTrp: 3.306 ± 0.236
1.653ArgTyr: 1.653 ± 1.232
0.0ArgXaa: 0.0 ± 0.0
Ser
3.306SerAla: 3.306 ± 1.993
1.653SerCys: 1.653 ± 0.997
3.306SerAsp: 3.306 ± 2.465
4.959SerGlu: 4.959 ± 0.761
3.306SerPhe: 3.306 ± 1.993
6.612SerGly: 6.612 ± 0.472
1.653SerHis: 1.653 ± 1.232
9.917SerIle: 9.917 ± 2.936
4.959SerLys: 4.959 ± 1.468
4.959SerLeu: 4.959 ± 2.99
1.653SerMet: 1.653 ± 0.997
3.306SerAsn: 3.306 ± 0.236
3.306SerPro: 3.306 ± 2.465
0.0SerGln: 0.0 ± 0.0
3.306SerArg: 3.306 ± 2.465
8.264SerSer: 8.264 ± 4.983
3.306SerThr: 3.306 ± 0.236
1.653SerVal: 1.653 ± 0.997
0.0SerTrp: 0.0 ± 0.0
6.612SerTyr: 6.612 ± 1.757
0.0SerXaa: 0.0 ± 0.0
Thr
8.264ThrAla: 8.264 ± 2.754
0.0ThrCys: 0.0 ± 0.0
6.612ThrAsp: 6.612 ± 3.986
3.306ThrGlu: 3.306 ± 1.993
1.653ThrPhe: 1.653 ± 1.232
3.306ThrGly: 3.306 ± 0.236
1.653ThrHis: 1.653 ± 1.232
0.0ThrIle: 0.0 ± 0.0
0.0ThrLys: 0.0 ± 0.0
1.653ThrLeu: 1.653 ± 1.232
1.653ThrMet: 1.653 ± 1.232
4.959ThrAsn: 4.959 ± 0.761
4.959ThrPro: 4.959 ± 2.99
1.653ThrGln: 1.653 ± 1.232
0.0ThrArg: 0.0 ± 0.0
6.612ThrSer: 6.612 ± 2.7
8.264ThrThr: 8.264 ± 2.754
8.264ThrVal: 8.264 ± 2.754
3.306ThrTrp: 3.306 ± 0.236
3.306ThrTyr: 3.306 ± 1.993
0.0ThrXaa: 0.0 ± 0.0
Val
4.959ValAla: 4.959 ± 0.761
1.653ValCys: 1.653 ± 1.232
4.959ValAsp: 4.959 ± 1.468
3.306ValGlu: 3.306 ± 0.236
6.612ValPhe: 6.612 ± 0.472
1.653ValGly: 1.653 ± 0.997
3.306ValHis: 3.306 ± 1.993
3.306ValIle: 3.306 ± 0.236
1.653ValLys: 1.653 ± 1.232
4.959ValLeu: 4.959 ± 0.761
3.306ValMet: 3.306 ± 1.993
3.306ValAsn: 3.306 ± 1.993
4.959ValPro: 4.959 ± 2.99
0.0ValGln: 0.0 ± 0.0
3.306ValArg: 3.306 ± 0.236
1.653ValSer: 1.653 ± 1.232
3.306ValThr: 3.306 ± 1.993
6.612ValVal: 6.612 ± 2.7
0.0ValTrp: 0.0 ± 0.0
4.959ValTyr: 4.959 ± 2.99
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.653TrpAsp: 1.653 ± 1.232
3.306TrpGlu: 3.306 ± 2.465
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
3.306TrpHis: 3.306 ± 0.236
1.653TrpIle: 1.653 ± 1.232
1.653TrpLys: 1.653 ± 1.232
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.653TrpAsn: 1.653 ± 0.997
1.653TrpPro: 1.653 ± 1.232
0.0TrpGln: 0.0 ± 0.0
1.653TrpArg: 1.653 ± 0.997
1.653TrpSer: 1.653 ± 1.232
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.653TrpTyr: 1.653 ± 0.997
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.653TyrCys: 1.653 ± 1.232
3.306TyrAsp: 3.306 ± 1.993
3.306TyrGlu: 3.306 ± 2.465
0.0TyrPhe: 0.0 ± 0.0
3.306TyrGly: 3.306 ± 1.993
0.0TyrHis: 0.0 ± 0.0
4.959TyrIle: 4.959 ± 0.761
1.653TyrLys: 1.653 ± 1.232
1.653TyrLeu: 1.653 ± 1.232
0.0TyrMet: 0.0 ± 0.0
1.653TyrAsn: 1.653 ± 1.232
3.306TyrPro: 3.306 ± 1.993
3.306TyrGln: 3.306 ± 2.465
4.959TyrArg: 4.959 ± 0.761
3.306TyrSer: 3.306 ± 1.993
4.959TyrThr: 4.959 ± 0.761
4.959TyrVal: 4.959 ± 1.468
0.0TyrTrp: 0.0 ± 0.0
1.653TyrTyr: 1.653 ± 0.997
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (606 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski