Amino acid dipepetide frequency for Beihai rhabdo-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.182AlaAla: 8.182 ± 2.295
2.49AlaCys: 2.49 ± 0.88
2.134AlaAsp: 2.134 ± 0.632
5.336AlaGlu: 5.336 ± 2.828
2.846AlaPhe: 2.846 ± 1.335
3.202AlaGly: 3.202 ± 1.749
1.779AlaHis: 1.779 ± 1.654
3.913AlaIle: 3.913 ± 1.046
3.913AlaLys: 3.913 ± 2.081
10.317AlaLeu: 10.317 ± 2.079
1.423AlaMet: 1.423 ± 0.667
1.067AlaAsn: 1.067 ± 1.578
2.49AlaPro: 2.49 ± 0.925
2.846AlaGln: 2.846 ± 1.071
4.98AlaArg: 4.98 ± 1.006
5.692AlaSer: 5.692 ± 1.04
6.048AlaThr: 6.048 ± 2.604
3.202AlaVal: 3.202 ± 0.549
0.711AlaTrp: 0.711 ± 0.38
2.49AlaTyr: 2.49 ± 1.21
0.0AlaXaa: 0.0 ± 0.0
Cys
1.423CysAla: 1.423 ± 0.651
0.356CysCys: 0.356 ± 0.19
0.711CysAsp: 0.711 ± 0.38
1.067CysGlu: 1.067 ± 1.281
0.356CysPhe: 0.356 ± 0.442
0.0CysGly: 0.0 ± 0.0
1.423CysHis: 1.423 ± 1.412
1.779CysIle: 1.779 ± 0.949
0.711CysLys: 0.711 ± 0.38
3.202CysLeu: 3.202 ± 0.763
0.711CysMet: 0.711 ± 0.38
0.711CysAsn: 0.711 ± 0.38
0.711CysPro: 0.711 ± 0.38
1.067CysGln: 1.067 ± 0.569
0.711CysArg: 0.711 ± 0.38
1.067CysSer: 1.067 ± 0.316
1.423CysThr: 1.423 ± 0.4
1.423CysVal: 1.423 ± 0.4
0.0CysTrp: 0.0 ± 0.0
1.423CysTyr: 1.423 ± 0.4
0.0CysXaa: 0.0 ± 0.0
Asp
3.557AspAla: 3.557 ± 0.78
1.423AspCys: 1.423 ± 1.628
2.134AspAsp: 2.134 ± 0.51
4.625AspGlu: 4.625 ± 1.027
2.846AspPhe: 2.846 ± 0.801
2.846AspGly: 2.846 ± 1.518
1.067AspHis: 1.067 ± 0.569
3.202AspIle: 3.202 ± 1.151
2.134AspLys: 2.134 ± 0.799
6.048AspLeu: 6.048 ± 0.713
0.711AspMet: 0.711 ± 0.322
1.779AspAsn: 1.779 ± 0.949
2.846AspPro: 2.846 ± 1.06
1.423AspGln: 1.423 ± 0.759
1.423AspArg: 1.423 ± 0.667
2.49AspSer: 2.49 ± 2.313
1.779AspThr: 1.779 ± 0.702
2.134AspVal: 2.134 ± 0.705
0.0AspTrp: 0.0 ± 0.0
3.202AspTyr: 3.202 ± 0.549
0.0AspXaa: 0.0 ± 0.0
Glu
5.336GluAla: 5.336 ± 2.095
0.356GluCys: 0.356 ± 0.19
2.49GluAsp: 2.49 ± 0.88
4.625GluGlu: 4.625 ± 1.939
4.269GluPhe: 4.269 ± 1.08
4.625GluGly: 4.625 ± 1.0
1.779GluHis: 1.779 ± 0.949
4.625GluIle: 4.625 ± 1.862
2.846GluLys: 2.846 ± 0.568
4.269GluLeu: 4.269 ± 0.844
1.779GluMet: 1.779 ± 0.541
1.779GluAsn: 1.779 ± 0.553
1.423GluPro: 1.423 ± 1.063
6.048GluGln: 6.048 ± 3.45
2.134GluArg: 2.134 ± 0.51
3.557GluSer: 3.557 ± 1.004
3.557GluThr: 3.557 ± 1.004
4.625GluVal: 4.625 ± 1.193
1.067GluTrp: 1.067 ± 0.759
3.913GluTyr: 3.913 ± 0.735
0.0GluXaa: 0.0 ± 0.0
Phe
3.913PheAla: 3.913 ± 1.15
1.423PheCys: 1.423 ± 0.4
4.269PheAsp: 4.269 ± 0.653
2.846PheGlu: 2.846 ± 2.685
1.423PhePhe: 1.423 ± 0.4
3.202PheGly: 3.202 ± 1.242
1.067PheHis: 1.067 ± 1.221
1.779PheIle: 1.779 ± 1.049
3.202PheLys: 3.202 ± 0.939
3.557PheLeu: 3.557 ± 2.359
1.067PheMet: 1.067 ± 1.628
2.134PheAsn: 2.134 ± 1.168
2.134PhePro: 2.134 ± 1.479
2.134PheGln: 2.134 ± 0.705
3.202PheArg: 3.202 ± 1.242
3.913PheSer: 3.913 ± 1.239
1.779PheThr: 1.779 ± 0.949
1.423PheVal: 1.423 ± 0.4
0.711PheTrp: 0.711 ± 0.38
1.423PheTyr: 1.423 ± 1.155
0.0PheXaa: 0.0 ± 0.0
Gly
1.779GlyAla: 1.779 ± 1.636
0.711GlyCys: 0.711 ± 0.883
2.49GlyAsp: 2.49 ± 0.696
2.134GlyGlu: 2.134 ± 1.101
1.779GlyPhe: 1.779 ± 0.622
2.846GlyGly: 2.846 ± 1.06
1.423GlyHis: 1.423 ± 1.155
2.134GlyIle: 2.134 ± 0.968
3.913GlyLys: 3.913 ± 0.94
4.625GlyLeu: 4.625 ± 1.917
2.134GlyMet: 2.134 ± 0.705
2.134GlyAsn: 2.134 ± 1.479
2.134GlyPro: 2.134 ± 0.632
1.067GlyGln: 1.067 ± 0.783
2.846GlyArg: 2.846 ± 2.31
4.98GlySer: 4.98 ± 1.893
2.49GlyThr: 2.49 ± 0.92
4.269GlyVal: 4.269 ± 1.41
0.356GlyTrp: 0.356 ± 0.19
3.202GlyTyr: 3.202 ± 0.939
0.0GlyXaa: 0.0 ± 0.0
His
2.134HisAla: 2.134 ± 0.705
0.356HisCys: 0.356 ± 0.19
2.846HisAsp: 2.846 ± 1.233
2.846HisGlu: 2.846 ± 1.06
0.711HisPhe: 0.711 ± 1.295
1.423HisGly: 1.423 ± 1.155
0.711HisHis: 0.711 ± 0.334
1.779HisIle: 1.779 ± 0.541
1.067HisLys: 1.067 ± 0.316
2.49HisLeu: 2.49 ± 0.696
0.356HisMet: 0.356 ± 0.19
1.067HisAsn: 1.067 ± 0.569
2.49HisPro: 2.49 ± 0.696
0.711HisGln: 0.711 ± 1.295
2.49HisArg: 2.49 ± 1.41
1.779HisSer: 1.779 ± 1.155
2.134HisThr: 2.134 ± 1.168
1.423HisVal: 1.423 ± 0.759
0.356HisTrp: 0.356 ± 0.19
1.423HisTyr: 1.423 ± 0.4
0.0HisXaa: 0.0 ± 0.0
Ile
4.625IleAla: 4.625 ± 1.987
1.423IleCys: 1.423 ± 0.4
3.202IleAsp: 3.202 ± 1.242
2.49IleGlu: 2.49 ± 0.925
1.779IlePhe: 1.779 ± 0.541
3.202IleGly: 3.202 ± 2.181
2.134IleHis: 2.134 ± 0.705
2.134IleIle: 2.134 ± 0.51
3.913IleLys: 3.913 ± 1.054
4.625IleLeu: 4.625 ± 2.245
1.423IleMet: 1.423 ± 0.759
1.067IleAsn: 1.067 ± 0.783
3.202IlePro: 3.202 ± 2.575
2.846IleGln: 2.846 ± 1.615
3.202IleArg: 3.202 ± 1.228
4.625IleSer: 4.625 ± 2.382
1.067IleThr: 1.067 ± 0.569
2.846IleVal: 2.846 ± 1.518
0.0IleTrp: 0.0 ± 0.0
2.134IleTyr: 2.134 ± 0.632
0.0IleXaa: 0.0 ± 0.0
Lys
3.202LysAla: 3.202 ± 1.276
0.711LysCys: 0.711 ± 0.38
2.49LysAsp: 2.49 ± 0.535
3.202LysGlu: 3.202 ± 1.276
2.134LysPhe: 2.134 ± 0.763
1.423LysGly: 1.423 ± 0.4
1.423LysHis: 1.423 ± 1.173
3.557LysIle: 3.557 ± 0.718
4.625LysLys: 4.625 ± 0.849
4.98LysLeu: 4.98 ± 3.99
1.067LysMet: 1.067 ± 0.759
2.134LysAsn: 2.134 ± 0.705
2.49LysPro: 2.49 ± 0.92
1.779LysGln: 1.779 ± 0.622
6.403LysArg: 6.403 ± 1.336
4.269LysSer: 4.269 ± 0.818
3.557LysThr: 3.557 ± 1.945
4.269LysVal: 4.269 ± 1.08
1.423LysTrp: 1.423 ± 0.759
3.202LysTyr: 3.202 ± 1.242
0.0LysXaa: 0.0 ± 0.0
Leu
10.317LeuAla: 10.317 ± 3.092
1.423LeuCys: 1.423 ± 0.4
4.269LeuAsp: 4.269 ± 1.524
7.826LeuGlu: 7.826 ± 1.837
4.625LeuPhe: 4.625 ± 2.055
5.692LeuGly: 5.692 ± 3.558
2.49LeuHis: 2.49 ± 1.329
4.269LeuIle: 4.269 ± 0.818
2.846LeuLys: 2.846 ± 1.146
11.384LeuLeu: 11.384 ± 2.074
1.067LeuMet: 1.067 ± 0.759
3.913LeuAsn: 3.913 ± 1.922
2.846LeuPro: 2.846 ± 1.06
3.913LeuGln: 3.913 ± 1.09
8.894LeuArg: 8.894 ± 1.563
9.249LeuSer: 9.249 ± 1.603
8.538LeuThr: 8.538 ± 1.704
4.269LeuVal: 4.269 ± 0.736
0.356LeuTrp: 0.356 ± 0.442
4.269LeuTyr: 4.269 ± 1.215
0.0LeuXaa: 0.0 ± 0.0
Met
0.356MetAla: 0.356 ± 0.19
0.356MetCys: 0.356 ± 1.392
1.423MetAsp: 1.423 ± 0.4
2.134MetGlu: 2.134 ± 1.101
0.711MetPhe: 0.711 ± 0.38
1.423MetGly: 1.423 ± 0.667
0.356MetHis: 0.356 ± 0.19
1.423MetIle: 1.423 ± 0.4
0.711MetLys: 0.711 ± 0.38
2.134MetLeu: 2.134 ± 1.168
0.356MetMet: 0.356 ± 0.442
2.134MetAsn: 2.134 ± 0.968
1.779MetPro: 1.779 ± 0.622
0.356MetGln: 0.356 ± 0.19
0.711MetArg: 0.711 ± 0.38
1.067MetSer: 1.067 ± 0.783
2.134MetThr: 2.134 ± 1.519
2.134MetVal: 2.134 ± 1.168
0.0MetTrp: 0.0 ± 0.0
0.711MetTyr: 0.711 ± 0.334
0.0MetXaa: 0.0 ± 0.0
Asn
4.269AsnAla: 4.269 ± 1.783
0.356AsnCys: 0.356 ± 1.392
1.779AsnAsp: 1.779 ± 0.541
1.423AsnGlu: 1.423 ± 0.647
0.356AsnPhe: 0.356 ± 0.442
1.067AsnGly: 1.067 ± 0.569
1.067AsnHis: 1.067 ± 0.316
0.0AsnIle: 0.0 ± 0.0
3.202AsnLys: 3.202 ± 1.708
6.403AsnLeu: 6.403 ± 2.116
0.711AsnMet: 0.711 ± 0.334
1.779AsnAsn: 1.779 ± 1.332
1.423AsnPro: 1.423 ± 1.173
1.067AsnGln: 1.067 ± 0.569
1.779AsnArg: 1.779 ± 2.383
1.779AsnSer: 1.779 ± 0.702
3.913AsnThr: 3.913 ± 1.054
2.846AsnVal: 2.846 ± 0.801
0.356AsnTrp: 0.356 ± 0.442
1.779AsnTyr: 1.779 ± 0.702
0.0AsnXaa: 0.0 ± 0.0
Pro
2.846ProAla: 2.846 ± 1.146
0.356ProCys: 0.356 ± 0.19
2.49ProAsp: 2.49 ± 0.88
3.202ProGlu: 3.202 ± 0.992
1.779ProPhe: 1.779 ± 0.541
1.423ProGly: 1.423 ± 0.667
1.423ProHis: 1.423 ± 1.74
4.269ProIle: 4.269 ± 1.799
1.779ProLys: 1.779 ± 2.511
3.557ProLeu: 3.557 ± 1.427
0.711ProMet: 0.711 ± 0.883
1.779ProAsn: 1.779 ± 0.541
2.134ProPro: 2.134 ± 1.139
2.49ProGln: 2.49 ± 0.535
1.779ProArg: 1.779 ± 1.636
3.913ProSer: 3.913 ± 0.689
2.49ProThr: 2.49 ± 1.329
1.779ProVal: 1.779 ± 1.595
0.356ProTrp: 0.356 ± 0.19
1.779ProTyr: 1.779 ± 0.541
0.0ProXaa: 0.0 ± 0.0
Gln
2.846GlnAla: 2.846 ± 2.136
0.711GlnCys: 0.711 ± 0.38
1.779GlnAsp: 1.779 ± 0.541
3.202GlnGlu: 3.202 ± 0.866
1.779GlnPhe: 1.779 ± 1.155
1.423GlnGly: 1.423 ± 0.4
1.423GlnHis: 1.423 ± 0.759
1.067GlnIle: 1.067 ± 0.316
2.846GlnLys: 2.846 ± 1.845
4.98GlnLeu: 4.98 ± 1.471
0.711GlnMet: 0.711 ± 0.334
0.711GlnAsn: 0.711 ± 0.38
1.067GlnPro: 1.067 ± 0.569
1.067GlnGln: 1.067 ± 0.316
1.779GlnArg: 1.779 ± 1.155
4.269GlnSer: 4.269 ± 1.987
2.846GlnThr: 2.846 ± 1.06
3.913GlnVal: 3.913 ± 1.664
0.356GlnTrp: 0.356 ± 0.19
2.49GlnTyr: 2.49 ± 0.964
0.0GlnXaa: 0.0 ± 0.0
Arg
3.913ArgAla: 3.913 ± 1.243
1.067ArgCys: 1.067 ± 0.569
2.846ArgAsp: 2.846 ± 1.071
3.557ArgGlu: 3.557 ± 0.892
4.625ArgPhe: 4.625 ± 3.509
4.269ArgGly: 4.269 ± 1.987
2.49ArgHis: 2.49 ± 1.21
2.134ArgIle: 2.134 ± 0.705
5.692ArgLys: 5.692 ± 2.314
6.403ArgLeu: 6.403 ± 1.613
1.423ArgMet: 1.423 ± 2.267
2.49ArgAsn: 2.49 ± 1.329
2.134ArgPro: 2.134 ± 1.479
3.557ArgGln: 3.557 ± 1.004
6.403ArgArg: 6.403 ± 3.694
3.202ArgSer: 3.202 ± 0.744
3.202ArgThr: 3.202 ± 1.12
2.49ArgVal: 2.49 ± 1.329
0.711ArgTrp: 0.711 ± 0.334
2.134ArgTyr: 2.134 ± 0.705
0.0ArgXaa: 0.0 ± 0.0
Ser
4.269SerAla: 4.269 ± 2.091
1.423SerCys: 1.423 ± 0.4
4.625SerAsp: 4.625 ± 4.926
3.202SerGlu: 3.202 ± 0.866
4.98SerPhe: 4.98 ± 1.718
3.557SerGly: 3.557 ± 0.852
1.779SerHis: 1.779 ± 0.622
4.625SerIle: 4.625 ± 0.782
2.846SerLys: 2.846 ± 0.568
7.471SerLeu: 7.471 ± 2.38
2.49SerMet: 2.49 ± 1.329
3.557SerAsn: 3.557 ± 1.082
2.49SerPro: 2.49 ± 1.417
1.779SerGln: 1.779 ± 0.541
4.269SerArg: 4.269 ± 1.41
7.826SerSer: 7.826 ± 2.795
4.98SerThr: 4.98 ± 1.741
4.269SerVal: 4.269 ± 2.062
1.067SerTrp: 1.067 ± 0.316
2.134SerTyr: 2.134 ± 1.46
0.0SerXaa: 0.0 ± 0.0
Thr
3.202ThrAla: 3.202 ± 0.933
1.067ThrCys: 1.067 ± 0.316
2.846ThrAsp: 2.846 ± 0.62
4.625ThrGlu: 4.625 ± 0.866
2.846ThrPhe: 2.846 ± 1.518
2.134ThrGly: 2.134 ± 0.632
2.846ThrHis: 2.846 ± 1.518
3.913ThrIle: 3.913 ± 0.734
5.336ThrLys: 5.336 ± 2.484
7.826ThrLeu: 7.826 ± 1.787
0.711ThrMet: 0.711 ± 0.883
2.49ThrAsn: 2.49 ± 1.428
3.202ThrPro: 3.202 ± 1.228
2.134ThrGln: 2.134 ± 0.763
4.98ThrArg: 4.98 ± 0.645
3.913ThrSer: 3.913 ± 1.666
3.557ThrThr: 3.557 ± 1.082
2.846ThrVal: 2.846 ± 0.62
0.711ThrTrp: 0.711 ± 0.38
1.779ThrTyr: 1.779 ± 0.702
0.0ThrXaa: 0.0 ± 0.0
Val
3.913ValAla: 3.913 ± 1.143
1.423ValCys: 1.423 ± 0.4
1.067ValAsp: 1.067 ± 0.569
4.625ValGlu: 4.625 ± 1.193
3.202ValPhe: 3.202 ± 1.38
3.202ValGly: 3.202 ± 0.549
2.49ValHis: 2.49 ± 0.925
2.134ValIle: 2.134 ± 0.632
3.913ValLys: 3.913 ± 1.243
3.913ValLeu: 3.913 ± 1.15
1.067ValMet: 1.067 ± 0.569
1.423ValAsn: 1.423 ± 0.647
3.557ValPro: 3.557 ± 1.082
3.202ValGln: 3.202 ± 0.763
3.913ValArg: 3.913 ± 2.088
3.913ValSer: 3.913 ± 2.085
3.913ValThr: 3.913 ± 0.689
2.846ValVal: 2.846 ± 0.801
0.0ValTrp: 0.0 ± 0.0
2.134ValTyr: 2.134 ± 1.139
0.0ValXaa: 0.0 ± 0.0
Trp
1.423TrpAla: 1.423 ± 0.667
0.356TrpCys: 0.356 ± 0.19
0.356TrpAsp: 0.356 ± 0.19
0.356TrpGlu: 0.356 ± 0.19
0.711TrpPhe: 0.711 ± 1.295
0.711TrpGly: 0.711 ± 0.38
0.0TrpHis: 0.0 ± 0.0
0.356TrpIle: 0.356 ± 0.19
0.356TrpLys: 0.356 ± 0.19
0.711TrpLeu: 0.711 ± 0.38
0.356TrpMet: 0.356 ± 0.19
1.067TrpAsn: 1.067 ± 0.316
0.356TrpPro: 0.356 ± 0.442
0.0TrpGln: 0.0 ± 0.0
0.711TrpArg: 0.711 ± 0.38
0.356TrpSer: 0.356 ± 0.442
0.711TrpThr: 0.711 ± 0.38
0.356TrpVal: 0.356 ± 0.19
0.0TrpTrp: 0.0 ± 0.0
0.356TrpTyr: 0.356 ± 0.442
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.557TyrAla: 3.557 ± 1.004
2.49TyrCys: 2.49 ± 0.696
1.423TyrAsp: 1.423 ± 0.759
1.779TyrGlu: 1.779 ± 0.905
2.846TyrPhe: 2.846 ± 0.91
1.779TyrGly: 1.779 ± 1.087
1.423TyrHis: 1.423 ± 0.759
2.846TyrIle: 2.846 ± 0.91
2.49TyrLys: 2.49 ± 1.28
3.557TyrLeu: 3.557 ± 0.593
1.779TyrMet: 1.779 ± 2.463
2.134TyrAsn: 2.134 ± 1.139
1.423TyrPro: 1.423 ± 0.759
1.779TyrGln: 1.779 ± 1.087
2.134TyrArg: 2.134 ± 2.443
1.779TyrSer: 1.779 ± 0.622
2.846TyrThr: 2.846 ± 1.518
2.49TyrVal: 2.49 ± 0.88
1.067TyrTrp: 1.067 ± 1.221
0.711TyrTyr: 0.711 ± 0.38
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2812 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski