Amino acid dipepetide frequency for Epizootic hemorrhagic disease virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.642AlaAla: 5.642 ± 1.396
0.645AlaCys: 0.645 ± 0.199
1.773AlaAsp: 1.773 ± 0.509
4.997AlaGlu: 4.997 ± 0.767
2.74AlaPhe: 2.74 ± 0.849
3.546AlaGly: 3.546 ± 0.929
1.289AlaHis: 1.289 ± 0.395
5.158AlaIle: 5.158 ± 1.436
2.579AlaLys: 2.579 ± 0.429
5.803AlaLeu: 5.803 ± 1.047
2.095AlaMet: 2.095 ± 0.859
3.224AlaAsn: 3.224 ± 0.913
3.063AlaPro: 3.063 ± 0.748
3.063AlaGln: 3.063 ± 0.876
4.03AlaArg: 4.03 ± 0.543
2.901AlaSer: 2.901 ± 0.718
4.191AlaThr: 4.191 ± 0.462
5.642AlaVal: 5.642 ± 0.627
0.645AlaTrp: 0.645 ± 0.363
2.901AlaTyr: 2.901 ± 0.904
0.0AlaXaa: 0.0 ± 0.0
Cys
0.484CysAla: 0.484 ± 0.266
0.161CysCys: 0.161 ± 0.124
0.967CysAsp: 0.967 ± 0.625
0.484CysGlu: 0.484 ± 0.359
0.967CysPhe: 0.967 ± 0.644
0.967CysGly: 0.967 ± 0.25
0.161CysHis: 0.161 ± 0.124
0.161CysIle: 0.161 ± 0.124
0.967CysLys: 0.967 ± 0.33
0.967CysLeu: 0.967 ± 0.217
0.161CysMet: 0.161 ± 0.124
0.806CysAsn: 0.806 ± 0.346
0.322CysPro: 0.322 ± 0.204
0.322CysGln: 0.322 ± 0.262
0.322CysArg: 0.322 ± 0.204
1.451CysSer: 1.451 ± 0.461
0.0CysThr: 0.0 ± 0.0
0.645CysVal: 0.645 ± 0.262
0.0CysTrp: 0.0 ± 0.0
0.484CysTyr: 0.484 ± 0.212
0.0CysXaa: 0.0 ± 0.0
Asp
3.707AspAla: 3.707 ± 0.575
0.484AspCys: 0.484 ± 0.184
4.03AspAsp: 4.03 ± 0.825
5.964AspGlu: 5.964 ± 1.466
1.773AspPhe: 1.773 ± 0.471
4.191AspGly: 4.191 ± 0.904
1.612AspHis: 1.612 ± 0.391
4.03AspIle: 4.03 ± 0.484
2.418AspLys: 2.418 ± 0.653
5.803AspLeu: 5.803 ± 1.111
1.773AspMet: 1.773 ± 0.653
1.451AspAsn: 1.451 ± 0.607
2.257AspPro: 2.257 ± 0.646
1.612AspGln: 1.612 ± 0.402
3.385AspArg: 3.385 ± 0.513
3.063AspSer: 3.063 ± 0.784
1.773AspThr: 1.773 ± 0.513
6.931AspVal: 6.931 ± 1.262
0.645AspTrp: 0.645 ± 0.432
1.934AspTyr: 1.934 ± 0.491
0.0AspXaa: 0.0 ± 0.0
Glu
5.642GluAla: 5.642 ± 0.744
0.645GluCys: 0.645 ± 0.322
4.352GluAsp: 4.352 ± 1.184
8.543GluGlu: 8.543 ± 1.324
2.095GluPhe: 2.095 ± 0.537
4.513GluGly: 4.513 ± 0.768
1.612GluHis: 1.612 ± 0.57
5.158GluIle: 5.158 ± 1.004
5.48GluLys: 5.48 ± 1.513
7.092GluLeu: 7.092 ± 0.703
2.257GluMet: 2.257 ± 0.488
2.74GluAsn: 2.74 ± 0.446
3.224GluPro: 3.224 ± 0.824
2.901GluGln: 2.901 ± 0.852
7.092GluArg: 7.092 ± 1.269
3.868GluSer: 3.868 ± 0.933
3.868GluThr: 3.868 ± 0.622
4.352GluVal: 4.352 ± 0.751
1.451GluTrp: 1.451 ± 0.631
2.74GluTyr: 2.74 ± 0.615
0.0GluXaa: 0.0 ± 0.0
Phe
2.095PheAla: 2.095 ± 0.333
0.645PheCys: 0.645 ± 0.241
2.74PheAsp: 2.74 ± 0.376
2.74PheGlu: 2.74 ± 0.477
1.451PhePhe: 1.451 ± 0.485
2.579PheGly: 2.579 ± 0.38
0.645PheHis: 0.645 ± 0.202
3.546PheIle: 3.546 ± 0.759
2.257PheLys: 2.257 ± 0.551
3.385PheLeu: 3.385 ± 0.501
0.967PheMet: 0.967 ± 0.354
1.934PheAsn: 1.934 ± 0.487
1.451PhePro: 1.451 ± 0.458
1.289PheGln: 1.289 ± 0.37
2.418PheArg: 2.418 ± 0.629
2.579PheSer: 2.579 ± 0.448
2.095PheThr: 2.095 ± 0.4
1.773PheVal: 1.773 ± 0.501
0.322PheTrp: 0.322 ± 0.192
1.934PheTyr: 1.934 ± 0.682
0.0PheXaa: 0.0 ± 0.0
Gly
4.191GlyAla: 4.191 ± 0.907
0.322GlyCys: 0.322 ± 0.327
4.513GlyAsp: 4.513 ± 0.897
5.48GlyGlu: 5.48 ± 1.066
2.418GlyPhe: 2.418 ± 0.6
4.191GlyGly: 4.191 ± 1.686
1.128GlyHis: 1.128 ± 0.34
3.385GlyIle: 3.385 ± 0.702
4.513GlyLys: 4.513 ± 1.218
3.385GlyLeu: 3.385 ± 0.998
1.612GlyMet: 1.612 ± 0.47
1.773GlyAsn: 1.773 ± 0.484
2.418GlyPro: 2.418 ± 0.664
1.612GlyGln: 1.612 ± 0.763
3.224GlyArg: 3.224 ± 0.845
3.063GlySer: 3.063 ± 0.594
3.385GlyThr: 3.385 ± 0.821
5.158GlyVal: 5.158 ± 1.043
0.484GlyTrp: 0.484 ± 0.273
2.901GlyTyr: 2.901 ± 0.646
0.0GlyXaa: 0.0 ± 0.0
His
1.289HisAla: 1.289 ± 0.432
0.322HisCys: 0.322 ± 0.214
0.806HisAsp: 0.806 ± 0.279
0.967HisGlu: 0.967 ± 0.282
0.645HisPhe: 0.645 ± 0.343
1.451HisGly: 1.451 ± 0.652
0.484HisHis: 0.484 ± 0.2
2.095HisIle: 2.095 ± 0.627
0.645HisLys: 0.645 ± 0.283
2.257HisLeu: 2.257 ± 0.632
0.645HisMet: 0.645 ± 0.252
0.806HisAsn: 0.806 ± 0.286
0.967HisPro: 0.967 ± 0.317
1.451HisGln: 1.451 ± 0.51
1.289HisArg: 1.289 ± 0.649
0.806HisSer: 0.806 ± 0.39
0.967HisThr: 0.967 ± 0.267
1.612HisVal: 1.612 ± 0.433
0.161HisTrp: 0.161 ± 0.165
0.806HisTyr: 0.806 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
5.964IleAla: 5.964 ± 0.776
0.484IleCys: 0.484 ± 0.412
4.674IleAsp: 4.674 ± 0.9
4.836IleGlu: 4.836 ± 0.874
1.451IlePhe: 1.451 ± 0.418
4.513IleGly: 4.513 ± 0.709
0.967IleHis: 0.967 ± 0.353
4.513IleIle: 4.513 ± 0.614
5.803IleLys: 5.803 ± 0.604
6.77IleLeu: 6.77 ± 1.224
3.707IleMet: 3.707 ± 0.523
3.707IleAsn: 3.707 ± 0.639
3.385IlePro: 3.385 ± 0.569
3.868IleGln: 3.868 ± 0.543
4.352IleArg: 4.352 ± 0.587
3.063IleSer: 3.063 ± 0.691
4.191IleThr: 4.191 ± 1.082
3.224IleVal: 3.224 ± 0.637
1.289IleTrp: 1.289 ± 0.419
3.868IleTyr: 3.868 ± 0.584
0.0IleXaa: 0.0 ± 0.0
Lys
3.063LysAla: 3.063 ± 0.686
0.645LysCys: 0.645 ± 0.266
3.868LysAsp: 3.868 ± 0.505
5.158LysGlu: 5.158 ± 1.219
2.901LysPhe: 2.901 ± 0.355
3.707LysGly: 3.707 ± 0.778
1.289LysHis: 1.289 ± 0.389
5.964LysIle: 5.964 ± 0.856
4.836LysLys: 4.836 ± 1.318
4.513LysLeu: 4.513 ± 1.119
2.257LysMet: 2.257 ± 0.601
2.257LysAsn: 2.257 ± 0.573
1.773LysPro: 1.773 ± 0.475
2.418LysGln: 2.418 ± 0.825
5.48LysArg: 5.48 ± 1.178
1.934LysSer: 1.934 ± 0.528
3.063LysThr: 3.063 ± 0.41
4.352LysVal: 4.352 ± 0.659
0.806LysTrp: 0.806 ± 0.444
2.418LysTyr: 2.418 ± 0.643
0.0LysXaa: 0.0 ± 0.0
Leu
4.997LeuAla: 4.997 ± 1.422
0.645LeuCys: 0.645 ± 0.316
6.286LeuAsp: 6.286 ± 0.748
5.803LeuGlu: 5.803 ± 0.967
3.546LeuPhe: 3.546 ± 0.98
3.546LeuGly: 3.546 ± 0.487
1.773LeuHis: 1.773 ± 0.542
5.158LeuIle: 5.158 ± 0.535
5.158LeuLys: 5.158 ± 0.917
5.803LeuLeu: 5.803 ± 0.703
2.579LeuMet: 2.579 ± 0.574
2.418LeuAsn: 2.418 ± 0.691
5.158LeuPro: 5.158 ± 0.632
2.579LeuGln: 2.579 ± 0.586
8.543LeuArg: 8.543 ± 0.762
4.836LeuSer: 4.836 ± 0.84
4.352LeuThr: 4.352 ± 0.919
4.191LeuVal: 4.191 ± 0.538
0.806LeuTrp: 0.806 ± 0.322
2.257LeuTyr: 2.257 ± 0.587
0.0LeuXaa: 0.0 ± 0.0
Met
2.901MetAla: 2.901 ± 0.775
0.806MetCys: 0.806 ± 0.317
1.128MetAsp: 1.128 ± 0.499
2.095MetGlu: 2.095 ± 0.641
1.773MetPhe: 1.773 ± 0.675
1.289MetGly: 1.289 ± 0.447
0.806MetHis: 0.806 ± 0.512
3.063MetIle: 3.063 ± 0.762
2.257MetLys: 2.257 ± 0.746
4.352MetLeu: 4.352 ± 0.735
1.128MetMet: 1.128 ± 0.373
1.773MetAsn: 1.773 ± 0.487
0.967MetPro: 0.967 ± 0.322
1.612MetGln: 1.612 ± 0.509
3.224MetArg: 3.224 ± 0.478
2.257MetSer: 2.257 ± 0.622
1.128MetThr: 1.128 ± 0.317
2.257MetVal: 2.257 ± 0.535
0.645MetTrp: 0.645 ± 0.333
0.806MetTyr: 0.806 ± 0.533
0.0MetXaa: 0.0 ± 0.0
Asn
2.095AsnAla: 2.095 ± 0.743
0.322AsnCys: 0.322 ± 0.199
2.901AsnAsp: 2.901 ± 0.838
4.836AsnGlu: 4.836 ± 0.87
1.773AsnPhe: 1.773 ± 0.482
2.257AsnGly: 2.257 ± 0.525
0.484AsnHis: 0.484 ± 0.211
3.868AsnIle: 3.868 ± 0.77
1.773AsnLys: 1.773 ± 0.345
2.901AsnLeu: 2.901 ± 0.668
1.934AsnMet: 1.934 ± 0.797
0.967AsnAsn: 0.967 ± 0.436
1.451AsnPro: 1.451 ± 0.569
1.128AsnGln: 1.128 ± 0.416
2.418AsnArg: 2.418 ± 0.624
2.095AsnSer: 2.095 ± 0.805
1.612AsnThr: 1.612 ± 0.503
3.063AsnVal: 3.063 ± 0.838
0.0AsnTrp: 0.0 ± 0.0
1.289AsnTyr: 1.289 ± 0.436
0.0AsnXaa: 0.0 ± 0.0
Pro
1.612ProAla: 1.612 ± 0.814
0.0ProCys: 0.0 ± 0.0
3.385ProAsp: 3.385 ± 0.677
2.74ProGlu: 2.74 ± 0.366
1.934ProPhe: 1.934 ± 0.349
2.257ProGly: 2.257 ± 0.436
0.967ProHis: 0.967 ± 0.319
2.901ProIle: 2.901 ± 0.678
2.257ProLys: 2.257 ± 0.586
2.579ProLeu: 2.579 ± 0.515
1.289ProMet: 1.289 ± 0.68
0.645ProAsn: 0.645 ± 0.384
1.773ProPro: 1.773 ± 0.661
1.612ProGln: 1.612 ± 0.487
2.74ProArg: 2.74 ± 0.813
1.612ProSer: 1.612 ± 0.456
2.901ProThr: 2.901 ± 0.715
3.546ProVal: 3.546 ± 0.456
0.484ProTrp: 0.484 ± 0.245
2.579ProTyr: 2.579 ± 0.601
0.0ProXaa: 0.0 ± 0.0
Gln
2.901GlnAla: 2.901 ± 0.723
0.484GlnCys: 0.484 ± 0.236
1.128GlnAsp: 1.128 ± 0.308
3.063GlnGlu: 3.063 ± 0.84
0.806GlnPhe: 0.806 ± 0.355
1.934GlnGly: 1.934 ± 0.616
0.645GlnHis: 0.645 ± 0.307
4.191GlnIle: 4.191 ± 0.887
4.352GlnLys: 4.352 ± 1.06
2.418GlnLeu: 2.418 ± 0.664
1.289GlnMet: 1.289 ± 0.63
2.901GlnAsn: 2.901 ± 0.989
1.451GlnPro: 1.451 ± 0.567
2.095GlnGln: 2.095 ± 0.585
2.901GlnArg: 2.901 ± 0.585
1.773GlnSer: 1.773 ± 0.426
2.418GlnThr: 2.418 ± 0.573
2.74GlnVal: 2.74 ± 0.416
0.161GlnTrp: 0.161 ± 0.164
1.451GlnTyr: 1.451 ± 0.343
0.0GlnXaa: 0.0 ± 0.0
Arg
5.319ArgAla: 5.319 ± 0.786
0.645ArgCys: 0.645 ± 0.226
2.418ArgAsp: 2.418 ± 0.583
5.803ArgGlu: 5.803 ± 1.014
4.191ArgPhe: 4.191 ± 0.753
3.868ArgGly: 3.868 ± 0.813
1.289ArgHis: 1.289 ± 0.494
6.931ArgIle: 6.931 ± 0.985
4.191ArgLys: 4.191 ± 0.636
4.352ArgLeu: 4.352 ± 0.811
2.74ArgMet: 2.74 ± 0.561
3.385ArgAsn: 3.385 ± 0.514
1.773ArgPro: 1.773 ± 0.355
2.901ArgGln: 2.901 ± 0.702
5.158ArgArg: 5.158 ± 0.639
3.385ArgSer: 3.385 ± 0.748
3.707ArgThr: 3.707 ± 0.577
5.319ArgVal: 5.319 ± 1.049
1.451ArgTrp: 1.451 ± 0.592
2.257ArgTyr: 2.257 ± 0.326
0.0ArgXaa: 0.0 ± 0.0
Ser
3.063SerAla: 3.063 ± 1.018
0.806SerCys: 0.806 ± 0.383
3.868SerAsp: 3.868 ± 0.621
4.997SerGlu: 4.997 ± 1.215
1.934SerPhe: 1.934 ± 0.488
3.385SerGly: 3.385 ± 0.51
0.806SerHis: 0.806 ± 0.348
3.385SerIle: 3.385 ± 1.329
2.418SerLys: 2.418 ± 0.697
4.352SerLeu: 4.352 ± 0.659
2.579SerMet: 2.579 ± 0.396
1.773SerAsn: 1.773 ± 0.526
1.934SerPro: 1.934 ± 0.378
1.934SerGln: 1.934 ± 0.388
2.74SerArg: 2.74 ± 0.672
2.579SerSer: 2.579 ± 0.762
2.901SerThr: 2.901 ± 0.528
3.546SerVal: 3.546 ± 0.635
0.645SerTrp: 0.645 ± 0.334
2.579SerTyr: 2.579 ± 0.598
0.0SerXaa: 0.0 ± 0.0
Thr
3.063ThrAla: 3.063 ± 0.832
0.645ThrCys: 0.645 ± 0.254
1.773ThrAsp: 1.773 ± 0.557
3.707ThrGlu: 3.707 ± 0.898
2.418ThrPhe: 2.418 ± 0.379
3.063ThrGly: 3.063 ± 0.884
1.289ThrHis: 1.289 ± 0.562
3.868ThrIle: 3.868 ± 0.689
2.901ThrLys: 2.901 ± 0.843
3.546ThrLeu: 3.546 ± 1.217
1.934ThrMet: 1.934 ± 0.748
1.451ThrAsn: 1.451 ± 0.405
1.934ThrPro: 1.934 ± 0.495
3.063ThrGln: 3.063 ± 0.705
3.868ThrArg: 3.868 ± 1.301
3.546ThrSer: 3.546 ± 0.641
2.74ThrThr: 2.74 ± 0.592
2.418ThrVal: 2.418 ± 0.636
1.128ThrTrp: 1.128 ± 0.215
1.612ThrTyr: 1.612 ± 0.592
0.0ThrXaa: 0.0 ± 0.0
Val
4.513ValAla: 4.513 ± 0.747
0.967ValCys: 0.967 ± 0.367
3.868ValAsp: 3.868 ± 0.645
4.352ValGlu: 4.352 ± 0.709
2.257ValPhe: 2.257 ± 0.672
4.03ValGly: 4.03 ± 0.529
1.451ValHis: 1.451 ± 0.779
4.513ValIle: 4.513 ± 0.643
3.868ValLys: 3.868 ± 1.179
6.447ValLeu: 6.447 ± 0.751
4.03ValMet: 4.03 ± 0.907
2.257ValAsn: 2.257 ± 0.727
3.224ValPro: 3.224 ± 0.71
4.191ValGln: 4.191 ± 0.76
4.513ValArg: 4.513 ± 1.087
4.352ValSer: 4.352 ± 0.816
3.063ValThr: 3.063 ± 0.714
3.385ValVal: 3.385 ± 0.585
0.645ValTrp: 0.645 ± 0.373
2.257ValTyr: 2.257 ± 0.444
0.0ValXaa: 0.0 ± 0.0
Trp
0.484TrpAla: 0.484 ± 0.209
0.0TrpCys: 0.0 ± 0.0
0.967TrpAsp: 0.967 ± 0.365
1.289TrpGlu: 1.289 ± 0.58
0.484TrpPhe: 0.484 ± 0.204
0.645TrpGly: 0.645 ± 0.231
0.967TrpHis: 0.967 ± 0.402
0.645TrpIle: 0.645 ± 0.395
0.967TrpLys: 0.967 ± 0.291
0.967TrpLeu: 0.967 ± 0.412
0.161TrpMet: 0.161 ± 0.163
0.645TrpAsn: 0.645 ± 0.365
0.0TrpPro: 0.0 ± 0.0
0.484TrpGln: 0.484 ± 0.332
0.806TrpArg: 0.806 ± 0.407
0.645TrpSer: 0.645 ± 0.319
0.322TrpThr: 0.322 ± 0.262
0.645TrpVal: 0.645 ± 0.299
0.161TrpTrp: 0.161 ± 0.159
0.645TrpTyr: 0.645 ± 0.329
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.901TyrAla: 2.901 ± 0.924
1.128TyrCys: 1.128 ± 0.303
3.385TyrAsp: 3.385 ± 1.198
1.773TyrGlu: 1.773 ± 0.41
1.289TyrPhe: 1.289 ± 0.422
3.224TyrGly: 3.224 ± 0.872
0.806TyrHis: 0.806 ± 0.475
2.095TyrIle: 2.095 ± 0.509
3.063TyrLys: 3.063 ± 0.577
2.74TyrLeu: 2.74 ± 0.633
0.967TyrMet: 0.967 ± 0.336
2.257TyrAsn: 2.257 ± 0.53
1.289TyrPro: 1.289 ± 0.429
0.967TyrGln: 0.967 ± 0.376
2.579TyrArg: 2.579 ± 0.428
2.418TyrSer: 2.418 ± 0.843
1.289TyrThr: 1.289 ± 0.458
3.546TyrVal: 3.546 ± 0.551
0.0TyrTrp: 0.0 ± 0.0
1.128TyrTyr: 1.128 ± 0.448
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6205 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski