Amino acid dipepetide frequency for Epizootic hemorrhagic disease virus (serotype 1 / strain New Jersey)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.481AlaAla: 5.481 ± 1.402
0.645AlaCys: 0.645 ± 0.218
1.935AlaAsp: 1.935 ± 0.375
4.675AlaGlu: 4.675 ± 0.927
2.741AlaPhe: 2.741 ± 0.842
2.902AlaGly: 2.902 ± 0.615
1.612AlaHis: 1.612 ± 0.371
4.836AlaIle: 4.836 ± 1.027
2.096AlaLys: 2.096 ± 0.449
6.61AlaLeu: 6.61 ± 1.149
2.579AlaMet: 2.579 ± 0.889
2.902AlaAsn: 2.902 ± 1.258
3.063AlaPro: 3.063 ± 0.848
3.385AlaGln: 3.385 ± 0.472
3.547AlaArg: 3.547 ± 0.657
2.257AlaSer: 2.257 ± 0.691
4.192AlaThr: 4.192 ± 0.5
5.159AlaVal: 5.159 ± 0.911
0.645AlaTrp: 0.645 ± 0.413
2.741AlaTyr: 2.741 ± 0.79
0.0AlaXaa: 0.0 ± 0.0
Cys
0.806CysAla: 0.806 ± 0.375
0.322CysCys: 0.322 ± 0.172
0.806CysAsp: 0.806 ± 0.413
0.484CysGlu: 0.484 ± 0.332
1.29CysPhe: 1.29 ± 0.556
0.806CysGly: 0.806 ± 0.269
0.161CysHis: 0.161 ± 0.141
0.484CysIle: 0.484 ± 0.215
0.645CysLys: 0.645 ± 0.294
0.967CysLeu: 0.967 ± 0.245
0.161CysMet: 0.161 ± 0.141
0.645CysAsn: 0.645 ± 0.351
0.161CysPro: 0.161 ± 0.148
0.484CysGln: 0.484 ± 0.252
0.484CysArg: 0.484 ± 0.235
1.451CysSer: 1.451 ± 0.422
0.161CysThr: 0.161 ± 0.151
0.806CysVal: 0.806 ± 0.455
0.0CysTrp: 0.0 ± 0.0
0.645CysTyr: 0.645 ± 0.301
0.0CysXaa: 0.0 ± 0.0
Asp
3.063AspAla: 3.063 ± 0.517
0.484AspCys: 0.484 ± 0.17
4.03AspAsp: 4.03 ± 0.861
6.771AspGlu: 6.771 ± 1.231
2.257AspPhe: 2.257 ± 0.605
3.224AspGly: 3.224 ± 0.574
1.29AspHis: 1.29 ± 0.456
4.675AspIle: 4.675 ± 0.421
3.547AspLys: 3.547 ± 0.992
5.804AspLeu: 5.804 ± 1.185
1.612AspMet: 1.612 ± 0.676
1.128AspAsn: 1.128 ± 0.436
2.418AspPro: 2.418 ± 0.685
2.096AspGln: 2.096 ± 0.414
4.675AspArg: 4.675 ± 1.098
2.418AspSer: 2.418 ± 0.592
2.096AspThr: 2.096 ± 0.636
6.126AspVal: 6.126 ± 1.103
0.645AspTrp: 0.645 ± 0.429
2.096AspTyr: 2.096 ± 0.567
0.0AspXaa: 0.0 ± 0.0
Glu
5.642GluAla: 5.642 ± 0.801
0.645GluCys: 0.645 ± 0.334
4.675GluAsp: 4.675 ± 1.231
8.383GluGlu: 8.383 ± 1.328
2.257GluPhe: 2.257 ± 0.552
4.353GluGly: 4.353 ± 0.783
1.29GluHis: 1.29 ± 0.446
5.32GluIle: 5.32 ± 1.028
5.32GluLys: 5.32 ± 1.497
7.738GluLeu: 7.738 ± 0.746
2.741GluMet: 2.741 ± 0.3
2.902GluAsn: 2.902 ± 0.701
2.902GluPro: 2.902 ± 0.952
2.902GluGln: 2.902 ± 0.809
5.965GluArg: 5.965 ± 1.481
4.192GluSer: 4.192 ± 1.16
3.547GluThr: 3.547 ± 0.64
4.514GluVal: 4.514 ± 1.058
1.451GluTrp: 1.451 ± 0.714
2.902GluTyr: 2.902 ± 0.513
0.0GluXaa: 0.0 ± 0.0
Phe
1.935PheAla: 1.935 ± 0.383
0.645PheCys: 0.645 ± 0.239
2.741PheAsp: 2.741 ± 0.433
2.741PheGlu: 2.741 ± 0.44
0.967PhePhe: 0.967 ± 0.274
2.741PheGly: 2.741 ± 0.572
0.806PheHis: 0.806 ± 0.199
3.063PheIle: 3.063 ± 0.706
2.418PheLys: 2.418 ± 0.56
3.224PheLeu: 3.224 ± 0.323
0.967PheMet: 0.967 ± 0.405
1.612PheAsn: 1.612 ± 0.426
1.451PhePro: 1.451 ± 0.447
1.128PheGln: 1.128 ± 0.399
2.741PheArg: 2.741 ± 0.584
2.902PheSer: 2.902 ± 0.53
1.773PheThr: 1.773 ± 0.41
1.773PheVal: 1.773 ± 0.499
0.161PheTrp: 0.161 ± 0.161
2.418PheTyr: 2.418 ± 0.541
0.0PheXaa: 0.0 ± 0.0
Gly
4.514GlyAla: 4.514 ± 0.86
0.645GlyCys: 0.645 ± 0.373
4.514GlyAsp: 4.514 ± 0.925
4.353GlyGlu: 4.353 ± 0.947
2.257GlyPhe: 2.257 ± 0.393
4.353GlyGly: 4.353 ± 1.653
1.29GlyHis: 1.29 ± 0.333
3.385GlyIle: 3.385 ± 0.705
3.224GlyLys: 3.224 ± 0.653
3.547GlyLeu: 3.547 ± 0.926
1.773GlyMet: 1.773 ± 0.339
1.935GlyAsn: 1.935 ± 0.507
2.257GlyPro: 2.257 ± 0.661
1.935GlyGln: 1.935 ± 0.957
3.224GlyArg: 3.224 ± 0.75
2.418GlySer: 2.418 ± 0.408
3.385GlyThr: 3.385 ± 0.749
4.998GlyVal: 4.998 ± 1.093
0.645GlyTrp: 0.645 ± 0.282
3.063GlyTyr: 3.063 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
1.773HisAla: 1.773 ± 0.364
0.322HisCys: 0.322 ± 0.206
0.806HisAsp: 0.806 ± 0.323
0.967HisGlu: 0.967 ± 0.317
0.484HisPhe: 0.484 ± 0.25
1.29HisGly: 1.29 ± 0.622
0.806HisHis: 0.806 ± 0.32
1.612HisIle: 1.612 ± 0.392
0.806HisLys: 0.806 ± 0.302
2.579HisLeu: 2.579 ± 0.689
0.645HisMet: 0.645 ± 0.288
0.806HisAsn: 0.806 ± 0.311
1.29HisPro: 1.29 ± 0.539
1.29HisGln: 1.29 ± 0.483
1.773HisArg: 1.773 ± 0.648
0.806HisSer: 0.806 ± 0.427
1.128HisThr: 1.128 ± 0.339
1.29HisVal: 1.29 ± 0.506
0.161HisTrp: 0.161 ± 0.159
0.806HisTyr: 0.806 ± 0.329
0.0HisXaa: 0.0 ± 0.0
Ile
5.804IleAla: 5.804 ± 0.711
0.322IleCys: 0.322 ± 0.197
4.675IleAsp: 4.675 ± 0.797
5.159IleGlu: 5.159 ± 0.598
2.579IlePhe: 2.579 ± 1.211
3.708IleGly: 3.708 ± 0.354
0.967IleHis: 0.967 ± 0.354
5.804IleIle: 5.804 ± 1.201
5.481IleLys: 5.481 ± 0.804
6.61IleLeu: 6.61 ± 1.119
2.579IleMet: 2.579 ± 0.82
3.547IleAsn: 3.547 ± 0.614
3.063IlePro: 3.063 ± 0.601
4.998IleGln: 4.998 ± 1.161
4.514IleArg: 4.514 ± 0.623
3.224IleSer: 3.224 ± 0.599
3.547IleThr: 3.547 ± 0.912
3.063IleVal: 3.063 ± 0.664
1.128IleTrp: 1.128 ± 0.498
3.708IleTyr: 3.708 ± 0.628
0.0IleXaa: 0.0 ± 0.0
Lys
3.063LysAla: 3.063 ± 0.688
0.484LysCys: 0.484 ± 0.265
4.353LysAsp: 4.353 ± 0.642
6.448LysGlu: 6.448 ± 1.634
2.902LysPhe: 2.902 ± 0.476
3.708LysGly: 3.708 ± 0.858
1.451LysHis: 1.451 ± 0.577
5.965LysIle: 5.965 ± 0.817
5.32LysLys: 5.32 ± 1.418
4.514LysLeu: 4.514 ± 1.072
2.257LysMet: 2.257 ± 0.818
1.935LysAsn: 1.935 ± 0.493
1.612LysPro: 1.612 ± 0.502
1.128LysGln: 1.128 ± 0.457
6.126LysArg: 6.126 ± 1.111
1.612LysSer: 1.612 ± 0.672
2.902LysThr: 2.902 ± 0.481
4.03LysVal: 4.03 ± 0.657
0.806LysTrp: 0.806 ± 0.451
1.935LysTyr: 1.935 ± 0.43
0.0LysXaa: 0.0 ± 0.0
Leu
5.481LeuAla: 5.481 ± 1.571
0.806LeuCys: 0.806 ± 0.365
5.965LeuAsp: 5.965 ± 0.968
5.642LeuGlu: 5.642 ± 1.169
3.224LeuPhe: 3.224 ± 1.003
3.708LeuGly: 3.708 ± 0.637
1.612LeuHis: 1.612 ± 0.422
5.481LeuIle: 5.481 ± 0.664
5.804LeuLys: 5.804 ± 1.184
5.804LeuLeu: 5.804 ± 0.709
2.579LeuMet: 2.579 ± 0.554
3.385LeuAsn: 3.385 ± 0.857
5.32LeuPro: 5.32 ± 0.821
2.902LeuGln: 2.902 ± 0.691
8.061LeuArg: 8.061 ± 0.651
5.481LeuSer: 5.481 ± 0.971
4.998LeuThr: 4.998 ± 1.036
3.708LeuVal: 3.708 ± 0.504
0.806LeuTrp: 0.806 ± 0.318
2.579LeuTyr: 2.579 ± 0.731
0.0LeuXaa: 0.0 ± 0.0
Met
2.579MetAla: 2.579 ± 0.975
0.806MetCys: 0.806 ± 0.279
1.128MetAsp: 1.128 ± 0.583
2.257MetGlu: 2.257 ± 0.478
2.096MetPhe: 2.096 ± 0.615
1.29MetGly: 1.29 ± 0.388
0.967MetHis: 0.967 ± 0.5
2.579MetIle: 2.579 ± 0.664
2.579MetLys: 2.579 ± 0.711
3.708MetLeu: 3.708 ± 1.17
1.128MetMet: 1.128 ± 0.322
1.935MetAsn: 1.935 ± 0.603
0.645MetPro: 0.645 ± 0.306
1.612MetGln: 1.612 ± 0.607
3.385MetArg: 3.385 ± 0.398
2.096MetSer: 2.096 ± 0.603
0.967MetThr: 0.967 ± 0.297
2.579MetVal: 2.579 ± 0.564
0.645MetTrp: 0.645 ± 0.32
0.967MetTyr: 0.967 ± 0.47
0.0MetXaa: 0.0 ± 0.0
Asn
1.612AsnAla: 1.612 ± 0.845
0.484AsnCys: 0.484 ± 0.306
3.063AsnAsp: 3.063 ± 0.851
4.192AsnGlu: 4.192 ± 0.631
1.451AsnPhe: 1.451 ± 0.382
3.063AsnGly: 3.063 ± 0.624
0.484AsnHis: 0.484 ± 0.161
3.869AsnIle: 3.869 ± 0.764
1.451AsnLys: 1.451 ± 0.353
3.708AsnLeu: 3.708 ± 0.82
1.935AsnMet: 1.935 ± 0.892
0.967AsnAsn: 0.967 ± 0.426
1.451AsnPro: 1.451 ± 0.494
1.128AsnGln: 1.128 ± 0.458
2.418AsnArg: 2.418 ± 0.545
2.096AsnSer: 2.096 ± 0.779
1.451AsnThr: 1.451 ± 0.632
3.869AsnVal: 3.869 ± 1.272
0.0AsnTrp: 0.0 ± 0.0
0.967AsnTyr: 0.967 ± 0.283
0.0AsnXaa: 0.0 ± 0.0
Pro
1.773ProAla: 1.773 ± 0.93
0.484ProCys: 0.484 ± 0.454
3.385ProAsp: 3.385 ± 0.84
2.902ProGlu: 2.902 ± 0.506
1.451ProPhe: 1.451 ± 0.534
2.096ProGly: 2.096 ± 0.353
1.29ProHis: 1.29 ± 0.353
2.579ProIle: 2.579 ± 0.515
2.096ProLys: 2.096 ± 0.7
2.741ProLeu: 2.741 ± 0.535
1.128ProMet: 1.128 ± 0.701
0.967ProAsn: 0.967 ± 0.41
1.773ProPro: 1.773 ± 0.683
1.451ProGln: 1.451 ± 0.374
2.741ProArg: 2.741 ± 0.86
1.612ProSer: 1.612 ± 0.631
2.902ProThr: 2.902 ± 0.894
3.547ProVal: 3.547 ± 0.436
0.484ProTrp: 0.484 ± 0.264
2.096ProTyr: 2.096 ± 0.689
0.0ProXaa: 0.0 ± 0.0
Gln
2.579GlnAla: 2.579 ± 0.826
0.484GlnCys: 0.484 ± 0.253
0.967GlnAsp: 0.967 ± 0.295
3.547GlnGlu: 3.547 ± 0.878
1.29GlnPhe: 1.29 ± 0.337
2.902GlnGly: 2.902 ± 0.754
0.484GlnHis: 0.484 ± 0.283
4.03GlnIle: 4.03 ± 0.739
4.03GlnLys: 4.03 ± 0.951
2.741GlnLeu: 2.741 ± 0.606
1.773GlnMet: 1.773 ± 0.514
2.579GlnAsn: 2.579 ± 0.95
1.451GlnPro: 1.451 ± 0.568
1.451GlnGln: 1.451 ± 0.444
3.224GlnArg: 3.224 ± 0.72
1.773GlnSer: 1.773 ± 0.52
2.096GlnThr: 2.096 ± 0.76
2.257GlnVal: 2.257 ± 0.481
0.161GlnTrp: 0.161 ± 0.148
1.128GlnTyr: 1.128 ± 0.459
0.0GlnXaa: 0.0 ± 0.0
Arg
4.675ArgAla: 4.675 ± 0.639
0.806ArgCys: 0.806 ± 0.18
3.224ArgAsp: 3.224 ± 0.542
5.481ArgGlu: 5.481 ± 1.013
4.03ArgPhe: 4.03 ± 0.754
3.385ArgGly: 3.385 ± 0.81
1.451ArgHis: 1.451 ± 0.483
5.965ArgIle: 5.965 ± 1.195
4.192ArgLys: 4.192 ± 0.627
4.836ArgLeu: 4.836 ± 1.055
2.257ArgMet: 2.257 ± 0.734
3.547ArgAsn: 3.547 ± 0.639
2.257ArgPro: 2.257 ± 0.276
3.224ArgGln: 3.224 ± 0.69
5.159ArgArg: 5.159 ± 0.461
3.385ArgSer: 3.385 ± 0.638
4.03ArgThr: 4.03 ± 0.607
5.804ArgVal: 5.804 ± 0.979
1.451ArgTrp: 1.451 ± 0.621
2.741ArgTyr: 2.741 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
3.063SerAla: 3.063 ± 0.829
0.645SerCys: 0.645 ± 0.398
3.547SerAsp: 3.547 ± 0.688
5.481SerGlu: 5.481 ± 1.096
1.612SerPhe: 1.612 ± 0.418
3.224SerGly: 3.224 ± 0.535
1.29SerHis: 1.29 ± 0.483
3.708SerIle: 3.708 ± 1.212
2.741SerLys: 2.741 ± 0.757
3.547SerLeu: 3.547 ± 0.608
2.418SerMet: 2.418 ± 0.41
2.418SerAsn: 2.418 ± 0.631
1.773SerPro: 1.773 ± 0.503
2.096SerGln: 2.096 ± 0.431
2.741SerArg: 2.741 ± 0.723
3.385SerSer: 3.385 ± 0.984
2.902SerThr: 2.902 ± 0.54
2.741SerVal: 2.741 ± 0.556
0.645SerTrp: 0.645 ± 0.298
2.579SerTyr: 2.579 ± 0.547
0.0SerXaa: 0.0 ± 0.0
Thr
3.063ThrAla: 3.063 ± 0.674
0.967ThrCys: 0.967 ± 0.406
2.418ThrAsp: 2.418 ± 0.455
2.579ThrGlu: 2.579 ± 0.655
1.612ThrPhe: 1.612 ± 0.43
4.192ThrGly: 4.192 ± 1.022
0.967ThrHis: 0.967 ± 0.539
3.708ThrIle: 3.708 ± 0.401
3.385ThrLys: 3.385 ± 0.781
4.192ThrLeu: 4.192 ± 1.086
1.935ThrMet: 1.935 ± 0.761
1.612ThrAsn: 1.612 ± 0.397
1.935ThrPro: 1.935 ± 0.585
2.741ThrGln: 2.741 ± 0.651
2.902ThrArg: 2.902 ± 0.536
4.03ThrSer: 4.03 ± 0.761
2.418ThrThr: 2.418 ± 0.736
2.418ThrVal: 2.418 ± 0.699
0.967ThrTrp: 0.967 ± 0.26
1.29ThrTyr: 1.29 ± 0.582
0.0ThrXaa: 0.0 ± 0.0
Val
3.547ValAla: 3.547 ± 1.073
0.967ValCys: 0.967 ± 0.342
4.03ValAsp: 4.03 ± 0.838
4.192ValGlu: 4.192 ± 0.652
2.096ValPhe: 2.096 ± 0.606
3.869ValGly: 3.869 ± 0.806
1.29ValHis: 1.29 ± 0.636
4.675ValIle: 4.675 ± 0.798
3.224ValLys: 3.224 ± 0.6
6.61ValLeu: 6.61 ± 0.663
4.192ValMet: 4.192 ± 1.082
2.257ValAsn: 2.257 ± 0.99
3.385ValPro: 3.385 ± 0.735
3.385ValGln: 3.385 ± 0.637
5.481ValArg: 5.481 ± 1.19
3.547ValSer: 3.547 ± 0.79
2.741ValThr: 2.741 ± 0.852
3.224ValVal: 3.224 ± 0.519
0.967ValTrp: 0.967 ± 0.306
2.257ValTyr: 2.257 ± 0.422
0.0ValXaa: 0.0 ± 0.0
Trp
0.484TrpAla: 0.484 ± 0.201
0.0TrpCys: 0.0 ± 0.0
1.29TrpAsp: 1.29 ± 0.399
1.29TrpGlu: 1.29 ± 0.568
0.322TrpPhe: 0.322 ± 0.19
0.645TrpGly: 0.645 ± 0.279
0.967TrpHis: 0.967 ± 0.384
0.967TrpIle: 0.967 ± 0.461
1.29TrpLys: 1.29 ± 0.348
0.967TrpLeu: 0.967 ± 0.446
0.161TrpMet: 0.161 ± 0.151
0.484TrpAsn: 0.484 ± 0.423
0.0TrpPro: 0.0 ± 0.0
0.322TrpGln: 0.322 ± 0.296
0.484TrpArg: 0.484 ± 0.317
0.806TrpSer: 0.806 ± 0.394
0.322TrpThr: 0.322 ± 0.245
0.806TrpVal: 0.806 ± 0.327
0.161TrpTrp: 0.161 ± 0.145
0.322TrpTyr: 0.322 ± 0.192
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.063TyrAla: 3.063 ± 1.06
0.806TyrCys: 0.806 ± 0.31
2.902TyrAsp: 2.902 ± 0.613
2.096TyrGlu: 2.096 ± 0.597
1.29TyrPhe: 1.29 ± 0.443
2.257TyrGly: 2.257 ± 0.775
0.967TyrHis: 0.967 ± 0.466
1.773TyrIle: 1.773 ± 0.589
2.902TyrLys: 2.902 ± 0.599
2.902TyrLeu: 2.902 ± 0.7
0.967TyrMet: 0.967 ± 0.36
2.096TyrAsn: 2.096 ± 0.685
1.29TyrPro: 1.29 ± 0.416
1.29TyrGln: 1.29 ± 0.347
2.418TyrArg: 2.418 ± 0.311
3.063TyrSer: 3.063 ± 0.9
1.773TyrThr: 1.773 ± 0.422
3.224TyrVal: 3.224 ± 0.477
0.161TyrTrp: 0.161 ± 0.151
1.29TyrTyr: 1.29 ± 0.556
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6204 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski