Amino acid dipepetide frequency for Encephalomyocarditis virus (strain Rueckert) (EMCV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.441AlaAla: 7.441 ± 3.641
0.0AlaCys: 0.0 ± 0.0
3.721AlaAsp: 3.721 ± 1.821
3.307AlaGlu: 3.307 ± 2.003
4.547AlaPhe: 4.547 ± 2.008
8.268AlaGly: 8.268 ± 3.651
1.24AlaHis: 1.24 ± 0.548
2.067AlaIle: 2.067 ± 0.913
4.547AlaLys: 4.547 ± 2.008
5.788AlaLeu: 5.788 ± 4.372
4.134AlaMet: 4.134 ± 1.826
2.48AlaAsn: 2.48 ± 1.095
5.374AlaPro: 5.374 ± 2.373
2.894AlaGln: 2.894 ± 1.278
2.894AlaArg: 2.894 ± 2.186
5.374AlaSer: 5.374 ± 1.09
4.547AlaThr: 4.547 ± 2.008
7.854AlaVal: 7.854 ± 3.469
0.413AlaTrp: 0.413 ± 0.183
1.654AlaTyr: 1.654 ± 0.73
0.0AlaXaa: 0.0 ± 0.0
Cys
2.067CysAla: 2.067 ± 2.551
0.0CysCys: 0.0 ± 0.0
0.827CysAsp: 0.827 ± 0.365
1.24CysGlu: 1.24 ± 0.548
0.413CysPhe: 0.413 ± 0.183
0.413CysGly: 0.413 ± 0.183
0.0CysHis: 0.0 ± 0.0
0.827CysIle: 0.827 ± 0.365
0.827CysLys: 0.827 ± 3.099
2.067CysLeu: 2.067 ± 0.913
0.413CysMet: 0.413 ± 3.281
0.413CysAsn: 0.413 ± 0.183
2.067CysPro: 2.067 ± 0.913
1.24CysGln: 1.24 ± 6.38
0.827CysArg: 0.827 ± 3.099
1.24CysSer: 1.24 ± 0.548
0.0CysThr: 0.0 ± 0.0
0.413CysVal: 0.413 ± 0.183
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.48AspAla: 2.48 ± 2.368
0.413AspCys: 0.413 ± 3.281
2.067AspAsp: 2.067 ± 2.551
2.48AspGlu: 2.48 ± 1.095
4.134AspPhe: 4.134 ± 1.826
3.307AspGly: 3.307 ± 2.003
0.827AspHis: 0.827 ± 0.365
3.721AspIle: 3.721 ± 1.821
1.654AspLys: 1.654 ± 2.733
4.547AspLeu: 4.547 ± 2.008
0.827AspMet: 0.827 ± 0.365
2.894AspAsn: 2.894 ± 2.186
3.721AspPro: 3.721 ± 1.821
2.067AspGln: 2.067 ± 0.913
1.24AspArg: 1.24 ± 0.548
2.067AspSer: 2.067 ± 2.551
0.827AspThr: 0.827 ± 0.365
5.374AspVal: 5.374 ± 2.373
2.48AspTrp: 2.48 ± 1.095
2.067AspTyr: 2.067 ± 6.015
0.0AspXaa: 0.0 ± 0.0
Glu
3.307GluAla: 3.307 ± 2.003
0.827GluCys: 0.827 ± 0.365
2.067GluAsp: 2.067 ± 2.551
4.547GluGlu: 4.547 ± 2.008
1.654GluPhe: 1.654 ± 0.73
2.894GluGly: 2.894 ± 1.278
1.654GluHis: 1.654 ± 0.73
2.067GluIle: 2.067 ± 0.913
4.961GluLys: 4.961 ± 2.191
2.48GluLeu: 2.48 ± 1.095
0.827GluMet: 0.827 ± 0.365
3.307GluAsn: 3.307 ± 1.461
1.24GluPro: 1.24 ± 0.548
2.48GluGln: 2.48 ± 2.368
4.547GluArg: 4.547 ± 1.456
3.307GluSer: 3.307 ± 1.461
3.721GluThr: 3.721 ± 1.643
3.721GluVal: 3.721 ± 1.643
0.827GluTrp: 0.827 ± 0.365
2.894GluTyr: 2.894 ± 1.278
0.0GluXaa: 0.0 ± 0.0
Phe
4.961PheAla: 4.961 ± 1.273
1.24PheCys: 1.24 ± 0.548
4.961PheAsp: 4.961 ± 1.273
4.547PheGlu: 4.547 ± 2.008
2.48PhePhe: 2.48 ± 1.095
2.48PheGly: 2.48 ± 1.095
0.827PheHis: 0.827 ± 0.365
1.654PheIle: 1.654 ± 0.73
1.24PheLys: 1.24 ± 0.548
4.547PheLeu: 4.547 ± 2.008
1.24PheMet: 1.24 ± 2.916
1.654PheAsn: 1.654 ± 0.73
2.48PhePro: 2.48 ± 2.368
2.067PheGln: 2.067 ± 0.913
5.374PheArg: 5.374 ± 1.09
4.547PheSer: 4.547 ± 1.456
3.721PheThr: 3.721 ± 1.643
2.48PheVal: 2.48 ± 1.095
1.654PheTrp: 1.654 ± 0.73
1.654PheTyr: 1.654 ± 2.733
0.0PheXaa: 0.0 ± 0.0
Gly
3.307GlyAla: 3.307 ± 1.461
1.654GlyCys: 1.654 ± 2.733
2.067GlyAsp: 2.067 ± 0.913
2.067GlyGlu: 2.067 ± 0.913
2.48GlyPhe: 2.48 ± 2.368
1.654GlyGly: 1.654 ± 0.73
0.827GlyHis: 0.827 ± 0.365
2.894GlyIle: 2.894 ± 2.186
4.134GlyLys: 4.134 ± 1.638
4.961GlyLeu: 4.961 ± 2.191
1.654GlyMet: 1.654 ± 0.73
3.307GlyAsn: 3.307 ± 1.461
4.961GlyPro: 4.961 ± 1.273
1.654GlyGln: 1.654 ± 0.73
3.307GlyArg: 3.307 ± 5.467
4.961GlySer: 4.961 ± 4.737
5.788GlyThr: 5.788 ± 2.556
4.134GlyVal: 4.134 ± 1.826
1.24GlyTrp: 1.24 ± 0.548
2.067GlyTyr: 2.067 ± 0.913
0.0GlyXaa: 0.0 ± 0.0
His
2.894HisAla: 2.894 ± 1.278
0.827HisCys: 0.827 ± 0.365
1.24HisAsp: 1.24 ± 0.548
0.827HisGlu: 0.827 ± 0.365
0.827HisPhe: 0.827 ± 0.365
0.827HisGly: 0.827 ± 0.365
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.413HisLys: 0.413 ± 0.183
0.0HisLeu: 0.0 ± 0.0
0.413HisMet: 0.413 ± 0.183
0.0HisAsn: 0.0 ± 0.0
0.827HisPro: 0.827 ± 0.365
0.827HisGln: 0.827 ± 0.365
0.413HisArg: 0.413 ± 0.183
2.067HisSer: 2.067 ± 0.913
0.827HisThr: 0.827 ± 0.365
1.654HisVal: 1.654 ± 0.73
0.413HisTrp: 0.413 ± 0.183
1.654HisTyr: 1.654 ± 0.73
0.0HisXaa: 0.0 ± 0.0
Ile
5.374IleAla: 5.374 ± 2.373
0.0IleCys: 0.0 ± 0.0
1.24IleAsp: 1.24 ± 2.916
2.067IleGlu: 2.067 ± 0.913
1.654IlePhe: 1.654 ± 0.73
2.067IleGly: 2.067 ± 2.551
1.654IleHis: 1.654 ± 0.73
2.067IleIle: 2.067 ± 0.913
1.24IleLys: 1.24 ± 2.916
2.894IleLeu: 2.894 ± 1.278
1.24IleMet: 1.24 ± 0.548
0.827IleAsn: 0.827 ± 3.099
2.48IlePro: 2.48 ± 1.095
1.24IleGln: 1.24 ± 0.548
2.067IleArg: 2.067 ± 0.913
4.547IleSer: 4.547 ± 1.456
3.307IleThr: 3.307 ± 1.461
2.067IleVal: 2.067 ± 0.913
0.827IleTrp: 0.827 ± 0.365
0.827IleTyr: 0.827 ± 0.365
0.0IleXaa: 0.0 ± 0.0
Lys
3.721LysAla: 3.721 ± 1.643
1.24LysCys: 1.24 ± 2.916
2.48LysAsp: 2.48 ± 2.368
3.721LysGlu: 3.721 ± 1.643
4.134LysPhe: 4.134 ± 1.638
2.067LysGly: 2.067 ± 0.913
1.24LysHis: 1.24 ± 0.548
3.721LysIle: 3.721 ± 1.643
2.067LysLys: 2.067 ± 0.913
0.827LysLeu: 0.827 ± 0.365
1.24LysMet: 1.24 ± 0.548
2.067LysAsn: 2.067 ± 0.913
2.894LysPro: 2.894 ± 2.186
4.134LysGln: 4.134 ± 5.102
3.307LysArg: 3.307 ± 2.003
2.067LysSer: 2.067 ± 6.015
5.374LysThr: 5.374 ± 2.373
4.134LysVal: 4.134 ± 1.826
0.0LysTrp: 0.0 ± 0.0
0.827LysTyr: 0.827 ± 0.365
0.0LysXaa: 0.0 ± 0.0
Leu
5.788LeuAla: 5.788 ± 2.556
1.24LeuCys: 1.24 ± 0.548
4.547LeuAsp: 4.547 ± 2.008
7.441LeuGlu: 7.441 ± 3.641
3.721LeuPhe: 3.721 ± 1.643
4.547LeuGly: 4.547 ± 2.008
0.827LeuHis: 0.827 ± 0.365
2.48LeuIle: 2.48 ± 1.095
3.721LeuLys: 3.721 ± 1.643
7.854LeuLeu: 7.854 ± 0.005
0.413LeuMet: 0.413 ± 0.183
2.067LeuAsn: 2.067 ± 0.913
6.614LeuPro: 6.614 ± 2.921
2.48LeuGln: 2.48 ± 1.095
5.374LeuArg: 5.374 ± 1.09
8.681LeuSer: 8.681 ± 3.834
6.201LeuThr: 6.201 ± 2.738
5.788LeuVal: 5.788 ± 2.556
0.0LeuTrp: 0.0 ± 0.0
2.067LeuTyr: 2.067 ± 0.913
0.0LeuXaa: 0.0 ± 0.0
Met
2.894MetAla: 2.894 ± 1.278
0.413MetCys: 0.413 ± 3.281
3.307MetAsp: 3.307 ± 1.461
1.24MetGlu: 1.24 ± 0.548
1.24MetPhe: 1.24 ± 2.916
1.24MetGly: 1.24 ± 2.916
0.413MetHis: 0.413 ± 0.183
0.413MetIle: 0.413 ± 0.183
0.827MetLys: 0.827 ± 0.365
2.48MetLeu: 2.48 ± 1.095
0.827MetMet: 0.827 ± 0.365
1.654MetAsn: 1.654 ± 0.73
0.413MetPro: 0.413 ± 0.183
0.827MetGln: 0.827 ± 0.365
0.413MetArg: 0.413 ± 0.183
0.0MetSer: 0.0 ± 0.0
0.827MetThr: 0.827 ± 0.365
2.48MetVal: 2.48 ± 1.095
0.0MetTrp: 0.0 ± 0.0
0.413MetTyr: 0.413 ± 0.183
0.0MetXaa: 0.0 ± 0.0
Asn
4.134AsnAla: 4.134 ± 1.826
0.413AsnCys: 0.413 ± 0.183
1.24AsnAsp: 1.24 ± 0.548
2.48AsnGlu: 2.48 ± 2.368
2.48AsnPhe: 2.48 ± 1.095
4.547AsnGly: 4.547 ± 1.456
1.24AsnHis: 1.24 ± 0.548
1.654AsnIle: 1.654 ± 2.733
1.24AsnLys: 1.24 ± 0.548
2.48AsnLeu: 2.48 ± 1.095
0.827AsnMet: 0.827 ± 0.365
2.067AsnAsn: 2.067 ± 0.913
2.48AsnPro: 2.48 ± 2.368
2.48AsnGln: 2.48 ± 1.095
2.48AsnArg: 2.48 ± 1.095
4.547AsnSer: 4.547 ± 4.919
6.614AsnThr: 6.614 ± 0.543
0.827AsnVal: 0.827 ± 0.365
0.0AsnTrp: 0.0 ± 0.0
1.24AsnTyr: 1.24 ± 0.548
0.0AsnXaa: 0.0 ± 0.0
Pro
2.894ProAla: 2.894 ± 1.278
1.24ProCys: 1.24 ± 2.916
4.134ProAsp: 4.134 ± 5.102
3.307ProGlu: 3.307 ± 1.461
5.374ProPhe: 5.374 ± 4.554
3.307ProGly: 3.307 ± 1.461
1.24ProHis: 1.24 ± 0.548
2.48ProIle: 2.48 ± 1.095
1.24ProLys: 1.24 ± 2.916
6.201ProLeu: 6.201 ± 2.738
0.827ProMet: 0.827 ± 0.365
3.307ProAsn: 3.307 ± 1.461
3.307ProPro: 3.307 ± 1.461
2.067ProGln: 2.067 ± 0.913
4.547ProArg: 4.547 ± 8.383
1.654ProSer: 1.654 ± 0.73
7.441ProThr: 7.441 ± 0.178
6.201ProVal: 6.201 ± 2.738
1.654ProTrp: 1.654 ± 0.73
2.067ProTyr: 2.067 ± 0.913
0.0ProXaa: 0.0 ± 0.0
Gln
3.307GlnAla: 3.307 ± 1.461
0.413GlnCys: 0.413 ± 0.183
0.827GlnAsp: 0.827 ± 0.365
3.307GlnGlu: 3.307 ± 1.461
2.894GlnPhe: 2.894 ± 1.278
3.721GlnGly: 3.721 ± 1.643
0.413GlnHis: 0.413 ± 0.183
0.827GlnIle: 0.827 ± 0.365
1.654GlnLys: 1.654 ± 0.73
4.961GlnLeu: 4.961 ± 2.191
1.24GlnMet: 1.24 ± 2.916
2.067GlnAsn: 2.067 ± 0.913
2.48GlnPro: 2.48 ± 2.368
2.067GlnGln: 2.067 ± 2.551
1.24GlnArg: 1.24 ± 2.916
2.067GlnSer: 2.067 ± 2.551
4.547GlnThr: 4.547 ± 2.008
4.961GlnVal: 4.961 ± 1.273
0.413GlnTrp: 0.413 ± 0.183
1.24GlnTyr: 1.24 ± 0.548
0.0GlnXaa: 0.0 ± 0.0
Arg
3.721ArgAla: 3.721 ± 1.643
1.24ArgCys: 1.24 ± 2.916
2.067ArgAsp: 2.067 ± 6.015
2.067ArgGlu: 2.067 ± 0.913
1.654ArgPhe: 1.654 ± 0.73
2.48ArgGly: 2.48 ± 2.368
1.654ArgHis: 1.654 ± 0.73
2.894ArgIle: 2.894 ± 1.278
3.307ArgLys: 3.307 ± 2.003
4.134ArgLeu: 4.134 ± 1.826
0.827ArgMet: 0.827 ± 0.365
3.307ArgAsn: 3.307 ± 8.931
5.788ArgPro: 5.788 ± 7.835
2.48ArgGln: 2.48 ± 1.095
2.067ArgArg: 2.067 ± 6.015
2.067ArgSer: 2.067 ± 2.551
4.134ArgThr: 4.134 ± 1.826
4.134ArgVal: 4.134 ± 1.638
1.24ArgTrp: 1.24 ± 0.548
0.827ArgTyr: 0.827 ± 0.365
0.0ArgXaa: 0.0 ± 0.0
Ser
5.788SerAla: 5.788 ± 0.908
1.24SerCys: 1.24 ± 0.548
4.547SerAsp: 4.547 ± 1.456
2.067SerGlu: 2.067 ± 0.913
3.721SerPhe: 3.721 ± 1.821
4.547SerGly: 4.547 ± 4.919
0.0SerHis: 0.0 ± 0.0
2.48SerIle: 2.48 ± 2.368
4.961SerLys: 4.961 ± 2.191
7.854SerLeu: 7.854 ± 3.469
2.067SerMet: 2.067 ± 0.806
4.961SerAsn: 4.961 ± 4.737
4.547SerPro: 4.547 ± 0.324
2.894SerGln: 2.894 ± 1.278
2.48SerArg: 2.48 ± 1.095
5.374SerSer: 5.374 ± 4.554
5.788SerThr: 5.788 ± 2.556
4.547SerVal: 4.547 ± 4.919
0.827SerTrp: 0.827 ± 0.365
1.654SerTyr: 1.654 ± 2.733
0.0SerXaa: 0.0 ± 0.0
Thr
6.201ThrAla: 6.201 ± 0.725
1.24ThrCys: 1.24 ± 0.548
2.48ThrAsp: 2.48 ± 1.095
2.067ThrGlu: 2.067 ± 0.913
4.547ThrPhe: 4.547 ± 2.008
5.788ThrGly: 5.788 ± 2.556
1.654ThrHis: 1.654 ± 0.73
2.894ThrIle: 2.894 ± 1.278
2.48ThrLys: 2.48 ± 2.368
9.508ThrLeu: 9.508 ± 4.199
1.654ThrMet: 1.654 ± 0.73
3.721ThrAsn: 3.721 ± 1.643
4.547ThrPro: 4.547 ± 2.008
3.721ThrGln: 3.721 ± 1.643
2.067ThrArg: 2.067 ± 0.913
7.854ThrSer: 7.854 ± 3.469
3.307ThrThr: 3.307 ± 1.461
6.201ThrVal: 6.201 ± 2.738
0.827ThrTrp: 0.827 ± 0.365
3.307ThrTyr: 3.307 ± 1.461
0.0ThrXaa: 0.0 ± 0.0
Val
5.788ValAla: 5.788 ± 2.556
0.827ValCys: 0.827 ± 0.365
3.307ValAsp: 3.307 ± 1.461
3.307ValGlu: 3.307 ± 1.461
7.028ValPhe: 7.028 ± 0.36
2.067ValGly: 2.067 ± 0.913
1.24ValHis: 1.24 ± 0.548
3.307ValIle: 3.307 ± 2.003
5.788ValLys: 5.788 ± 2.556
5.374ValLeu: 5.374 ± 1.09
1.24ValMet: 1.24 ± 0.548
2.894ValAsn: 2.894 ± 1.278
3.721ValPro: 3.721 ± 1.643
4.134ValGln: 4.134 ± 1.826
4.134ValArg: 4.134 ± 1.638
5.374ValSer: 5.374 ± 2.373
4.547ValThr: 4.547 ± 2.008
6.614ValVal: 6.614 ± 2.921
0.827ValTrp: 0.827 ± 0.365
3.721ValTyr: 3.721 ± 1.643
0.0ValXaa: 0.0 ± 0.0
Trp
0.827TrpAla: 0.827 ± 0.365
0.827TrpCys: 0.827 ± 0.365
0.413TrpAsp: 0.413 ± 0.183
0.827TrpGlu: 0.827 ± 0.365
0.413TrpPhe: 0.413 ± 0.183
0.413TrpGly: 0.413 ± 0.183
0.0TrpHis: 0.0 ± 0.0
0.413TrpIle: 0.413 ± 0.183
0.827TrpLys: 0.827 ± 0.365
0.413TrpLeu: 0.413 ± 0.183
0.0TrpMet: 0.0 ± 0.094
0.413TrpAsn: 0.413 ± 0.183
0.827TrpPro: 0.827 ± 0.451
0.827TrpGln: 0.827 ± 0.365
0.827TrpArg: 0.827 ± 0.365
0.827TrpSer: 0.827 ± 0.365
2.48TrpThr: 2.48 ± 1.095
0.413TrpVal: 0.413 ± 0.183
0.0TrpTrp: 0.0 ± 0.0
1.24TrpTyr: 1.24 ± 0.548
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.654TyrAla: 1.654 ± 0.73
0.0TyrCys: 0.0 ± 0.0
1.654TyrAsp: 1.654 ± 0.73
0.413TyrGlu: 0.413 ± 0.183
0.413TyrPhe: 0.413 ± 0.183
2.48TyrGly: 2.48 ± 5.832
0.0TyrHis: 0.0 ± 0.0
0.827TyrIle: 0.827 ± 0.365
4.134TyrLys: 4.134 ± 5.102
2.48TyrLeu: 2.48 ± 1.095
0.413TyrMet: 0.413 ± 0.183
2.067TyrAsn: 2.067 ± 0.913
3.307TyrPro: 3.307 ± 1.461
2.067TyrGln: 2.067 ± 0.913
2.48TyrArg: 2.48 ± 1.095
3.721TyrSer: 3.721 ± 1.643
2.067TyrThr: 2.067 ± 0.913
1.24TyrVal: 1.24 ± 0.548
0.0TyrTrp: 0.0 ± 0.0
1.654TyrTyr: 1.654 ± 0.73
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2420 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski