Amino acid dipepetide frequency for Enterobacteria phage HK97 (Bacteriophage HK97)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.118AlaAla: 11.118 ± 1.508
0.674AlaCys: 0.674 ± 0.282
7.328AlaAsp: 7.328 ± 0.693
6.57AlaGlu: 6.57 ± 0.846
3.537AlaPhe: 3.537 ± 0.506
6.991AlaGly: 6.991 ± 0.825
1.769AlaHis: 1.769 ± 0.498
6.991AlaIle: 6.991 ± 0.632
5.222AlaLys: 5.222 ± 0.611
7.58AlaLeu: 7.58 ± 0.895
4.043AlaMet: 4.043 ± 0.545
4.548AlaAsn: 4.548 ± 0.712
2.19AlaPro: 2.19 ± 0.525
4.969AlaGln: 4.969 ± 0.711
5.559AlaArg: 5.559 ± 0.713
6.317AlaSer: 6.317 ± 1.139
5.727AlaThr: 5.727 ± 0.748
6.401AlaVal: 6.401 ± 0.82
1.348AlaTrp: 1.348 ± 0.3
2.021AlaTyr: 2.021 ± 0.388
0.0AlaXaa: 0.0 ± 0.0
Cys
1.179CysAla: 1.179 ± 0.322
0.168CysCys: 0.168 ± 0.125
0.758CysAsp: 0.758 ± 0.228
0.59CysGlu: 0.59 ± 0.209
0.084CysPhe: 0.084 ± 0.086
0.842CysGly: 0.842 ± 0.362
0.421CysHis: 0.421 ± 0.223
0.59CysIle: 0.59 ± 0.231
0.674CysLys: 0.674 ± 0.252
0.842CysLeu: 0.842 ± 0.267
0.168CysMet: 0.168 ± 0.121
0.842CysAsn: 0.842 ± 0.255
0.421CysPro: 0.421 ± 0.207
0.168CysGln: 0.168 ± 0.139
0.505CysArg: 0.505 ± 0.244
0.758CysSer: 0.758 ± 0.281
0.505CysThr: 0.505 ± 0.246
0.421CysVal: 0.421 ± 0.175
0.337CysTrp: 0.337 ± 0.203
0.084CysTyr: 0.084 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
6.738AspAla: 6.738 ± 0.765
0.505AspCys: 0.505 ± 0.247
2.948AspAsp: 2.948 ± 0.478
4.211AspGlu: 4.211 ± 0.775
2.021AspPhe: 2.021 ± 0.559
4.548AspGly: 4.548 ± 0.8
0.337AspHis: 0.337 ± 0.194
3.116AspIle: 3.116 ± 0.408
2.443AspLys: 2.443 ± 0.496
4.969AspLeu: 4.969 ± 0.584
1.263AspMet: 1.263 ± 0.338
3.369AspAsn: 3.369 ± 0.542
1.769AspPro: 1.769 ± 0.466
1.684AspGln: 1.684 ± 0.393
3.537AspArg: 3.537 ± 0.483
3.453AspSer: 3.453 ± 0.554
2.779AspThr: 2.779 ± 0.412
4.295AspVal: 4.295 ± 0.76
0.421AspTrp: 0.421 ± 0.209
2.358AspTyr: 2.358 ± 0.523
0.0AspXaa: 0.0 ± 0.0
Glu
5.39GluAla: 5.39 ± 0.767
0.674GluCys: 0.674 ± 0.286
2.443GluAsp: 2.443 ± 0.486
4.127GluGlu: 4.127 ± 0.778
2.358GluPhe: 2.358 ± 0.382
4.295GluGly: 4.295 ± 0.668
0.842GluHis: 0.842 ± 0.311
4.548GluIle: 4.548 ± 0.592
3.959GluLys: 3.959 ± 0.435
5.727GluLeu: 5.727 ± 0.512
1.516GluMet: 1.516 ± 0.371
2.864GluAsn: 2.864 ± 0.374
1.853GluPro: 1.853 ± 0.384
3.79GluGln: 3.79 ± 0.516
3.622GluArg: 3.622 ± 0.617
4.211GluSer: 4.211 ± 0.561
3.622GluThr: 3.622 ± 0.65
3.874GluVal: 3.874 ± 0.465
0.926GluTrp: 0.926 ± 0.355
2.021GluTyr: 2.021 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
3.369PheAla: 3.369 ± 0.498
0.337PheCys: 0.337 ± 0.166
1.937PheAsp: 1.937 ± 0.356
1.6PheGlu: 1.6 ± 0.352
0.337PhePhe: 0.337 ± 0.198
2.864PheGly: 2.864 ± 0.446
0.505PheHis: 0.505 ± 0.207
2.021PheIle: 2.021 ± 0.57
1.769PheLys: 1.769 ± 0.413
1.769PheLeu: 1.769 ± 0.506
0.842PheMet: 0.842 ± 0.269
1.769PheAsn: 1.769 ± 0.298
1.348PhePro: 1.348 ± 0.325
1.011PheGln: 1.011 ± 0.271
1.769PheArg: 1.769 ± 0.52
2.695PheSer: 2.695 ± 0.385
2.527PheThr: 2.527 ± 0.441
1.263PheVal: 1.263 ± 0.306
0.59PheTrp: 0.59 ± 0.195
1.348PheTyr: 1.348 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
5.727GlyAla: 5.727 ± 0.925
0.505GlyCys: 0.505 ± 0.212
5.138GlyAsp: 5.138 ± 0.73
4.295GlyGlu: 4.295 ± 0.604
2.864GlyPhe: 2.864 ± 0.654
5.138GlyGly: 5.138 ± 0.748
1.263GlyHis: 1.263 ± 0.339
4.043GlyIle: 4.043 ± 0.516
3.537GlyLys: 3.537 ± 0.423
5.812GlyLeu: 5.812 ± 0.726
3.032GlyMet: 3.032 ± 0.558
4.38GlyAsn: 4.38 ± 0.727
1.6GlyPro: 1.6 ± 0.404
3.453GlyGln: 3.453 ± 0.532
4.127GlyArg: 4.127 ± 0.525
3.79GlySer: 3.79 ± 0.643
4.717GlyThr: 4.717 ± 0.779
4.969GlyVal: 4.969 ± 0.642
1.095GlyTrp: 1.095 ± 0.266
2.358GlyTyr: 2.358 ± 0.412
0.0GlyXaa: 0.0 ± 0.0
His
1.179HisAla: 1.179 ± 0.362
0.084HisCys: 0.084 ± 0.083
0.59HisAsp: 0.59 ± 0.205
1.011HisGlu: 1.011 ± 0.336
0.505HisPhe: 0.505 ± 0.241
1.095HisGly: 1.095 ± 0.285
0.59HisHis: 0.59 ± 0.267
0.674HisIle: 0.674 ± 0.279
1.348HisLys: 1.348 ± 0.443
1.011HisLeu: 1.011 ± 0.307
0.253HisMet: 0.253 ± 0.188
0.59HisAsn: 0.59 ± 0.27
0.674HisPro: 0.674 ± 0.278
0.758HisGln: 0.758 ± 0.263
1.348HisArg: 1.348 ± 0.383
1.179HisSer: 1.179 ± 0.296
0.59HisThr: 0.59 ± 0.216
1.011HisVal: 1.011 ± 0.387
0.253HisTrp: 0.253 ± 0.147
0.421HisTyr: 0.421 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
6.064IleAla: 6.064 ± 0.612
0.421IleCys: 0.421 ± 0.206
4.632IleAsp: 4.632 ± 0.525
3.201IleGlu: 3.201 ± 0.5
1.684IlePhe: 1.684 ± 0.399
3.537IleGly: 3.537 ± 0.651
0.758IleHis: 0.758 ± 0.26
3.959IleIle: 3.959 ± 0.612
3.032IleLys: 3.032 ± 0.532
3.706IleLeu: 3.706 ± 0.616
1.179IleMet: 1.179 ± 0.427
3.537IleAsn: 3.537 ± 0.522
2.106IlePro: 2.106 ± 0.38
3.032IleGln: 3.032 ± 0.515
3.959IleArg: 3.959 ± 0.595
5.138IleSer: 5.138 ± 0.656
4.211IleThr: 4.211 ± 0.535
2.779IleVal: 2.779 ± 0.594
0.842IleTrp: 0.842 ± 0.269
1.6IleTyr: 1.6 ± 0.43
0.0IleXaa: 0.0 ± 0.0
Lys
6.064LysAla: 6.064 ± 0.582
0.59LysCys: 0.59 ± 0.281
2.779LysAsp: 2.779 ± 0.635
3.79LysGlu: 3.79 ± 0.654
1.432LysPhe: 1.432 ± 0.385
3.032LysGly: 3.032 ± 0.465
0.421LysHis: 0.421 ± 0.194
2.527LysIle: 2.527 ± 0.545
3.201LysLys: 3.201 ± 0.837
3.622LysLeu: 3.622 ± 0.59
1.853LysMet: 1.853 ± 0.451
1.937LysAsn: 1.937 ± 0.373
3.032LysPro: 3.032 ± 0.686
3.706LysGln: 3.706 ± 0.689
2.358LysArg: 2.358 ± 0.477
3.959LysSer: 3.959 ± 0.648
4.127LysThr: 4.127 ± 0.57
3.537LysVal: 3.537 ± 0.588
0.926LysTrp: 0.926 ± 0.284
1.853LysTyr: 1.853 ± 0.42
0.0LysXaa: 0.0 ± 0.0
Leu
7.833LeuAla: 7.833 ± 0.848
1.516LeuCys: 1.516 ± 0.438
4.295LeuAsp: 4.295 ± 0.519
5.222LeuGlu: 5.222 ± 0.657
1.853LeuPhe: 1.853 ± 0.365
3.79LeuGly: 3.79 ± 0.565
0.842LeuHis: 0.842 ± 0.324
5.559LeuIle: 5.559 ± 0.573
5.138LeuLys: 5.138 ± 0.606
5.053LeuLeu: 5.053 ± 0.555
1.432LeuMet: 1.432 ± 0.373
4.38LeuAsn: 4.38 ± 0.643
3.622LeuPro: 3.622 ± 0.467
3.032LeuGln: 3.032 ± 0.516
5.643LeuArg: 5.643 ± 0.769
6.064LeuSer: 6.064 ± 0.767
4.717LeuThr: 4.717 ± 0.591
4.464LeuVal: 4.464 ± 0.564
0.674LeuTrp: 0.674 ± 0.263
2.106LeuTyr: 2.106 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
3.285MetAla: 3.285 ± 0.556
0.421MetCys: 0.421 ± 0.211
1.263MetAsp: 1.263 ± 0.336
1.179MetGlu: 1.179 ± 0.279
0.59MetPhe: 0.59 ± 0.24
1.432MetGly: 1.432 ± 0.319
0.505MetHis: 0.505 ± 0.198
0.926MetIle: 0.926 ± 0.275
1.769MetLys: 1.769 ± 0.354
2.19MetLeu: 2.19 ± 0.349
0.421MetMet: 0.421 ± 0.18
0.59MetAsn: 0.59 ± 0.243
1.179MetPro: 1.179 ± 0.265
1.348MetGln: 1.348 ± 0.277
2.19MetArg: 2.19 ± 0.373
2.864MetSer: 2.864 ± 0.505
2.358MetThr: 2.358 ± 0.413
0.674MetVal: 0.674 ± 0.265
0.337MetTrp: 0.337 ± 0.196
0.674MetTyr: 0.674 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
5.643AsnAla: 5.643 ± 0.644
0.253AsnCys: 0.253 ± 0.151
2.021AsnAsp: 2.021 ± 0.313
2.611AsnGlu: 2.611 ± 0.421
1.432AsnPhe: 1.432 ± 0.399
5.39AsnGly: 5.39 ± 0.715
0.758AsnHis: 0.758 ± 0.297
3.285AsnIle: 3.285 ± 0.486
2.779AsnLys: 2.779 ± 0.485
3.453AsnLeu: 3.453 ± 0.556
0.758AsnMet: 0.758 ± 0.256
2.19AsnAsn: 2.19 ± 0.494
2.358AsnPro: 2.358 ± 0.495
2.106AsnGln: 2.106 ± 0.402
1.6AsnArg: 1.6 ± 0.366
2.611AsnSer: 2.611 ± 0.475
3.032AsnThr: 3.032 ± 0.632
1.853AsnVal: 1.853 ± 0.544
0.758AsnTrp: 0.758 ± 0.205
1.432AsnTyr: 1.432 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
3.369ProAla: 3.369 ± 0.475
0.505ProCys: 0.505 ± 0.223
1.937ProAsp: 1.937 ± 0.441
3.032ProGlu: 3.032 ± 0.548
1.516ProPhe: 1.516 ± 0.363
2.948ProGly: 2.948 ± 0.391
0.674ProHis: 0.674 ± 0.293
1.263ProIle: 1.263 ± 0.346
1.432ProLys: 1.432 ± 0.319
2.948ProLeu: 2.948 ± 0.713
1.263ProMet: 1.263 ± 0.318
1.516ProAsn: 1.516 ± 0.422
1.937ProPro: 1.937 ± 0.39
1.684ProGln: 1.684 ± 0.42
2.106ProArg: 2.106 ± 0.363
3.285ProSer: 3.285 ± 0.626
2.358ProThr: 2.358 ± 0.568
3.201ProVal: 3.201 ± 0.577
0.59ProTrp: 0.59 ± 0.267
1.095ProTyr: 1.095 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
5.643GlnAla: 5.643 ± 0.773
0.337GlnCys: 0.337 ± 0.178
1.853GlnAsp: 1.853 ± 0.404
2.527GlnGlu: 2.527 ± 0.35
1.179GlnPhe: 1.179 ± 0.307
2.611GlnGly: 2.611 ± 0.475
0.842GlnHis: 0.842 ± 0.27
2.864GlnIle: 2.864 ± 0.374
2.695GlnLys: 2.695 ± 0.51
4.295GlnLeu: 4.295 ± 0.737
1.179GlnMet: 1.179 ± 0.292
2.106GlnAsn: 2.106 ± 0.503
1.853GlnPro: 1.853 ± 0.423
3.116GlnGln: 3.116 ± 0.723
3.285GlnArg: 3.285 ± 0.639
3.285GlnSer: 3.285 ± 0.624
2.695GlnThr: 2.695 ± 0.589
3.369GlnVal: 3.369 ± 0.496
0.758GlnTrp: 0.758 ± 0.295
1.263GlnTyr: 1.263 ± 0.381
0.0GlnXaa: 0.0 ± 0.0
Arg
4.632ArgAla: 4.632 ± 0.737
0.59ArgCys: 0.59 ± 0.246
3.79ArgAsp: 3.79 ± 0.685
4.464ArgGlu: 4.464 ± 0.764
2.021ArgPhe: 2.021 ± 0.416
4.211ArgGly: 4.211 ± 0.518
1.432ArgHis: 1.432 ± 0.266
2.527ArgIle: 2.527 ± 0.475
4.38ArgLys: 4.38 ± 0.575
5.98ArgLeu: 5.98 ± 0.626
1.6ArgMet: 1.6 ± 0.363
3.369ArgAsn: 3.369 ± 0.577
1.853ArgPro: 1.853 ± 0.405
3.032ArgGln: 3.032 ± 0.52
4.043ArgArg: 4.043 ± 0.942
3.453ArgSer: 3.453 ± 0.652
2.695ArgThr: 2.695 ± 0.493
3.453ArgVal: 3.453 ± 0.528
1.432ArgTrp: 1.432 ± 0.402
2.19ArgTyr: 2.19 ± 0.382
0.0ArgXaa: 0.0 ± 0.0
Ser
7.58SerAla: 7.58 ± 0.95
0.505SerCys: 0.505 ± 0.186
4.38SerAsp: 4.38 ± 0.619
3.959SerGlu: 3.959 ± 0.632
2.527SerPhe: 2.527 ± 0.513
6.064SerGly: 6.064 ± 0.692
0.842SerHis: 0.842 ± 0.248
3.874SerIle: 3.874 ± 0.635
2.864SerLys: 2.864 ± 0.63
5.475SerLeu: 5.475 ± 0.67
1.937SerMet: 1.937 ± 0.376
2.358SerAsn: 2.358 ± 0.391
3.369SerPro: 3.369 ± 0.486
4.043SerGln: 4.043 ± 0.752
5.053SerArg: 5.053 ± 0.651
5.559SerSer: 5.559 ± 1.166
3.453SerThr: 3.453 ± 0.508
5.896SerVal: 5.896 ± 0.853
1.348SerTrp: 1.348 ± 0.3
0.674SerTyr: 0.674 ± 0.237
0.0SerXaa: 0.0 ± 0.0
Thr
6.233ThrAla: 6.233 ± 0.749
0.505ThrCys: 0.505 ± 0.221
3.285ThrAsp: 3.285 ± 0.45
3.453ThrGlu: 3.453 ± 0.546
2.611ThrPhe: 2.611 ± 0.608
6.485ThrGly: 6.485 ± 0.723
0.758ThrHis: 0.758 ± 0.249
3.959ThrIle: 3.959 ± 0.736
2.864ThrLys: 2.864 ± 0.497
3.79ThrLeu: 3.79 ± 0.438
1.179ThrMet: 1.179 ± 0.282
2.106ThrAsn: 2.106 ± 0.45
3.116ThrPro: 3.116 ± 0.542
2.779ThrGln: 2.779 ± 0.53
2.611ThrArg: 2.611 ± 0.352
4.127ThrSer: 4.127 ± 0.532
3.285ThrThr: 3.285 ± 0.653
4.801ThrVal: 4.801 ± 0.514
1.179ThrTrp: 1.179 ± 0.371
1.348ThrTyr: 1.348 ± 0.433
0.0ThrXaa: 0.0 ± 0.0
Val
5.475ValAla: 5.475 ± 0.692
0.926ValCys: 0.926 ± 0.273
2.779ValAsp: 2.779 ± 0.322
4.38ValGlu: 4.38 ± 0.42
2.106ValPhe: 2.106 ± 0.328
3.537ValGly: 3.537 ± 0.622
0.758ValHis: 0.758 ± 0.24
3.874ValIle: 3.874 ± 0.722
4.211ValLys: 4.211 ± 0.669
5.306ValLeu: 5.306 ± 0.753
1.516ValMet: 1.516 ± 0.294
2.358ValAsn: 2.358 ± 0.539
2.695ValPro: 2.695 ± 0.529
1.516ValGln: 1.516 ± 0.324
3.706ValArg: 3.706 ± 0.529
6.233ValSer: 6.233 ± 1.019
4.548ValThr: 4.548 ± 0.691
4.801ValVal: 4.801 ± 0.961
0.758ValTrp: 0.758 ± 0.315
1.937ValTyr: 1.937 ± 0.405
0.0ValXaa: 0.0 ± 0.0
Trp
0.674TrpAla: 0.674 ± 0.231
0.337TrpCys: 0.337 ± 0.167
0.926TrpAsp: 0.926 ± 0.249
0.926TrpGlu: 0.926 ± 0.22
0.337TrpPhe: 0.337 ± 0.151
1.011TrpGly: 1.011 ± 0.327
0.505TrpHis: 0.505 ± 0.191
0.842TrpIle: 0.842 ± 0.236
0.842TrpLys: 0.842 ± 0.255
1.937TrpLeu: 1.937 ± 0.484
0.337TrpMet: 0.337 ± 0.154
0.421TrpAsn: 0.421 ± 0.191
0.505TrpPro: 0.505 ± 0.185
0.674TrpGln: 0.674 ± 0.229
1.011TrpArg: 1.011 ± 0.241
0.842TrpSer: 0.842 ± 0.25
1.011TrpThr: 1.011 ± 0.336
1.179TrpVal: 1.179 ± 0.257
0.337TrpTrp: 0.337 ± 0.183
0.674TrpTyr: 0.674 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.706TyrAla: 3.706 ± 0.553
0.505TyrCys: 0.505 ± 0.199
1.684TyrAsp: 1.684 ± 0.334
1.516TyrGlu: 1.516 ± 0.504
0.674TyrPhe: 0.674 ± 0.257
2.021TyrGly: 2.021 ± 0.448
0.337TyrHis: 0.337 ± 0.203
1.937TyrIle: 1.937 ± 0.391
0.59TyrLys: 0.59 ± 0.201
1.769TyrLeu: 1.769 ± 0.341
0.421TyrMet: 0.421 ± 0.195
1.095TyrAsn: 1.095 ± 0.281
1.179TyrPro: 1.179 ± 0.408
1.853TyrGln: 1.853 ± 0.367
3.116TyrArg: 3.116 ± 0.45
1.937TyrSer: 1.937 ± 0.478
1.516TyrThr: 1.516 ± 0.423
1.179TyrVal: 1.179 ± 0.232
0.421TyrTrp: 0.421 ± 0.169
0.758TyrTyr: 0.758 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (11874 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski