Amino acid dipepetide frequency for Enterobacteria phage HK106

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.304AlaAla: 10.304 ± 1.49
0.559AlaCys: 0.559 ± 0.22
6.151AlaAsp: 6.151 ± 0.647
5.831AlaGlu: 5.831 ± 0.775
3.834AlaPhe: 3.834 ± 0.525
7.349AlaGly: 7.349 ± 0.784
1.438AlaHis: 1.438 ± 0.38
6.79AlaIle: 6.79 ± 0.692
5.512AlaLys: 5.512 ± 0.581
7.509AlaLeu: 7.509 ± 0.937
3.994AlaMet: 3.994 ± 0.608
4.952AlaAsn: 4.952 ± 0.689
1.917AlaPro: 1.917 ± 0.402
4.553AlaGln: 4.553 ± 0.652
4.793AlaArg: 4.793 ± 0.656
5.272AlaSer: 5.272 ± 1.199
5.911AlaThr: 5.911 ± 0.929
6.231AlaVal: 6.231 ± 0.711
1.518AlaTrp: 1.518 ± 0.387
2.636AlaTyr: 2.636 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
1.118CysAla: 1.118 ± 0.374
0.32CysCys: 0.32 ± 0.14
1.118CysAsp: 1.118 ± 0.303
1.118CysGlu: 1.118 ± 0.268
0.16CysPhe: 0.16 ± 0.116
1.038CysGly: 1.038 ± 0.327
0.399CysHis: 0.399 ± 0.231
0.719CysIle: 0.719 ± 0.254
0.799CysLys: 0.799 ± 0.272
1.038CysLeu: 1.038 ± 0.295
0.16CysMet: 0.16 ± 0.107
0.639CysAsn: 0.639 ± 0.211
0.399CysPro: 0.399 ± 0.21
0.16CysGln: 0.16 ± 0.114
0.959CysArg: 0.959 ± 0.333
0.959CysSer: 0.959 ± 0.279
0.559CysThr: 0.559 ± 0.191
0.719CysVal: 0.719 ± 0.264
0.24CysTrp: 0.24 ± 0.144
0.24CysTyr: 0.24 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
6.071AspAla: 6.071 ± 0.712
0.719AspCys: 0.719 ± 0.285
3.674AspAsp: 3.674 ± 0.618
4.393AspGlu: 4.393 ± 0.52
1.997AspPhe: 1.997 ± 0.477
4.713AspGly: 4.713 ± 0.608
0.559AspHis: 0.559 ± 0.258
2.876AspIle: 2.876 ± 0.424
2.956AspLys: 2.956 ± 0.437
4.952AspLeu: 4.952 ± 0.599
1.358AspMet: 1.358 ± 0.337
3.115AspAsn: 3.115 ± 0.444
1.598AspPro: 1.598 ± 0.399
1.997AspGln: 1.997 ± 0.405
3.115AspArg: 3.115 ± 0.562
3.515AspSer: 3.515 ± 0.565
2.556AspThr: 2.556 ± 0.454
4.313AspVal: 4.313 ± 0.67
0.879AspTrp: 0.879 ± 0.291
2.556AspTyr: 2.556 ± 0.54
0.0AspXaa: 0.0 ± 0.0
Glu
5.192GluAla: 5.192 ± 0.702
1.118GluCys: 1.118 ± 0.345
2.476GluAsp: 2.476 ± 0.431
4.873GluGlu: 4.873 ± 0.749
2.476GluPhe: 2.476 ± 0.381
4.074GluGly: 4.074 ± 0.459
0.799GluHis: 0.799 ± 0.248
4.154GluIle: 4.154 ± 0.602
4.234GluLys: 4.234 ± 0.537
5.671GluLeu: 5.671 ± 0.582
1.438GluMet: 1.438 ± 0.346
3.834GluAsn: 3.834 ± 0.485
1.997GluPro: 1.997 ± 0.427
3.674GluGln: 3.674 ± 0.629
3.674GluArg: 3.674 ± 0.732
4.154GluSer: 4.154 ± 0.536
3.435GluThr: 3.435 ± 0.493
3.435GluVal: 3.435 ± 0.458
1.358GluTrp: 1.358 ± 0.349
1.917GluTyr: 1.917 ± 0.403
0.0GluXaa: 0.0 ± 0.0
Phe
3.355PheAla: 3.355 ± 0.474
0.479PheCys: 0.479 ± 0.236
2.077PheAsp: 2.077 ± 0.427
1.997PheGlu: 1.997 ± 0.416
0.799PhePhe: 0.799 ± 0.241
2.716PheGly: 2.716 ± 0.38
0.719PheHis: 0.719 ± 0.222
2.556PheIle: 2.556 ± 0.659
1.757PheLys: 1.757 ± 0.432
1.757PheLeu: 1.757 ± 0.423
0.879PheMet: 0.879 ± 0.233
2.157PheAsn: 2.157 ± 0.367
1.438PhePro: 1.438 ± 0.353
0.879PheGln: 0.879 ± 0.219
1.757PheArg: 1.757 ± 0.468
2.956PheSer: 2.956 ± 0.431
2.237PheThr: 2.237 ± 0.41
1.438PheVal: 1.438 ± 0.344
0.479PheTrp: 0.479 ± 0.191
1.358PheTyr: 1.358 ± 0.377
0.0PheXaa: 0.0 ± 0.0
Gly
6.31GlyAla: 6.31 ± 0.958
0.719GlyCys: 0.719 ± 0.213
4.633GlyAsp: 4.633 ± 0.563
4.313GlyGlu: 4.313 ± 0.508
3.355GlyPhe: 3.355 ± 0.54
5.112GlyGly: 5.112 ± 0.69
1.198GlyHis: 1.198 ± 0.304
3.515GlyIle: 3.515 ± 0.484
4.074GlyLys: 4.074 ± 0.479
5.911GlyLeu: 5.911 ± 0.794
2.796GlyMet: 2.796 ± 0.524
4.154GlyAsn: 4.154 ± 0.789
1.598GlyPro: 1.598 ± 0.344
3.435GlyGln: 3.435 ± 0.41
3.994GlyArg: 3.994 ± 0.608
3.834GlySer: 3.834 ± 0.577
4.952GlyThr: 4.952 ± 0.682
4.633GlyVal: 4.633 ± 0.489
1.198GlyTrp: 1.198 ± 0.32
2.556GlyTyr: 2.556 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.358HisAla: 1.358 ± 0.338
0.24HisCys: 0.24 ± 0.133
0.479HisAsp: 0.479 ± 0.192
1.358HisGlu: 1.358 ± 0.421
0.479HisPhe: 0.479 ± 0.215
1.598HisGly: 1.598 ± 0.381
0.32HisHis: 0.32 ± 0.161
0.479HisIle: 0.479 ± 0.22
1.598HisLys: 1.598 ± 0.415
0.879HisLeu: 0.879 ± 0.307
0.08HisMet: 0.08 ± 0.083
0.559HisAsn: 0.559 ± 0.194
0.719HisPro: 0.719 ± 0.27
0.799HisGln: 0.799 ± 0.287
1.598HisArg: 1.598 ± 0.347
1.038HisSer: 1.038 ± 0.274
0.559HisThr: 0.559 ± 0.192
0.639HisVal: 0.639 ± 0.303
0.399HisTrp: 0.399 ± 0.178
0.32HisTyr: 0.32 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
5.032IleAla: 5.032 ± 0.709
0.719IleCys: 0.719 ± 0.283
4.074IleAsp: 4.074 ± 0.582
3.035IleGlu: 3.035 ± 0.498
1.837IlePhe: 1.837 ± 0.404
3.435IleGly: 3.435 ± 0.575
1.118IleHis: 1.118 ± 0.31
2.237IleIle: 2.237 ± 0.403
2.556IleLys: 2.556 ± 0.384
3.674IleLeu: 3.674 ± 0.663
0.719IleMet: 0.719 ± 0.322
3.674IleAsn: 3.674 ± 0.483
2.157IlePro: 2.157 ± 0.401
2.636IleGln: 2.636 ± 0.514
3.914IleArg: 3.914 ± 0.535
5.192IleSer: 5.192 ± 0.73
3.914IleThr: 3.914 ± 0.469
3.115IleVal: 3.115 ± 0.648
1.038IleTrp: 1.038 ± 0.304
1.438IleTyr: 1.438 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
6.39LysAla: 6.39 ± 0.664
0.799LysCys: 0.799 ± 0.314
3.195LysAsp: 3.195 ± 0.675
3.914LysGlu: 3.914 ± 0.688
1.438LysPhe: 1.438 ± 0.325
3.115LysGly: 3.115 ± 0.53
0.479LysHis: 0.479 ± 0.208
2.476LysIle: 2.476 ± 0.451
4.154LysLys: 4.154 ± 0.996
4.473LysLeu: 4.473 ± 0.566
2.077LysMet: 2.077 ± 0.426
2.237LysAsn: 2.237 ± 0.41
3.035LysPro: 3.035 ± 0.573
4.074LysGln: 4.074 ± 0.774
2.636LysArg: 2.636 ± 0.556
3.515LysSer: 3.515 ± 0.604
3.275LysThr: 3.275 ± 0.411
2.956LysVal: 2.956 ± 0.482
1.118LysTrp: 1.118 ± 0.309
2.316LysTyr: 2.316 ± 0.431
0.0LysXaa: 0.0 ± 0.0
Leu
7.269LeuAla: 7.269 ± 0.934
1.598LeuCys: 1.598 ± 0.417
4.553LeuAsp: 4.553 ± 0.519
5.432LeuGlu: 5.432 ± 0.635
2.237LeuPhe: 2.237 ± 0.423
3.595LeuGly: 3.595 ± 0.747
0.959LeuHis: 0.959 ± 0.264
4.873LeuIle: 4.873 ± 0.666
4.952LeuLys: 4.952 ± 0.659
5.432LeuLeu: 5.432 ± 0.67
1.917LeuMet: 1.917 ± 0.436
4.633LeuAsn: 4.633 ± 0.623
3.515LeuPro: 3.515 ± 0.517
3.035LeuGln: 3.035 ± 0.544
5.512LeuArg: 5.512 ± 0.635
6.31LeuSer: 6.31 ± 0.708
4.873LeuThr: 4.873 ± 0.581
4.154LeuVal: 4.154 ± 0.56
0.559LeuTrp: 0.559 ± 0.217
2.316LeuTyr: 2.316 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
3.515MetAla: 3.515 ± 0.555
0.32MetCys: 0.32 ± 0.18
1.198MetAsp: 1.198 ± 0.292
1.358MetGlu: 1.358 ± 0.391
0.719MetPhe: 0.719 ± 0.223
1.757MetGly: 1.757 ± 0.386
0.559MetHis: 0.559 ± 0.252
1.278MetIle: 1.278 ± 0.383
1.917MetLys: 1.917 ± 0.311
2.396MetLeu: 2.396 ± 0.362
0.719MetMet: 0.719 ± 0.228
0.479MetAsn: 0.479 ± 0.228
1.278MetPro: 1.278 ± 0.299
1.198MetGln: 1.198 ± 0.22
1.997MetArg: 1.997 ± 0.375
1.917MetSer: 1.917 ± 0.382
2.237MetThr: 2.237 ± 0.386
0.559MetVal: 0.559 ± 0.229
0.399MetTrp: 0.399 ± 0.213
0.879MetTyr: 0.879 ± 0.237
0.0MetXaa: 0.0 ± 0.0
Asn
5.192AsnAla: 5.192 ± 0.775
0.559AsnCys: 0.559 ± 0.228
2.716AsnAsp: 2.716 ± 0.396
2.396AsnGlu: 2.396 ± 0.403
1.118AsnPhe: 1.118 ± 0.317
5.112AsnGly: 5.112 ± 0.694
0.959AsnHis: 0.959 ± 0.34
3.595AsnIle: 3.595 ± 0.471
2.396AsnLys: 2.396 ± 0.399
3.435AsnLeu: 3.435 ± 0.475
1.038AsnMet: 1.038 ± 0.254
2.396AsnAsn: 2.396 ± 0.565
2.237AsnPro: 2.237 ± 0.546
2.316AsnGln: 2.316 ± 0.413
1.997AsnArg: 1.997 ± 0.532
2.796AsnSer: 2.796 ± 0.489
3.115AsnThr: 3.115 ± 0.476
1.757AsnVal: 1.757 ± 0.479
0.799AsnTrp: 0.799 ± 0.223
1.757AsnTyr: 1.757 ± 0.396
0.0AsnXaa: 0.0 ± 0.0
Pro
3.275ProAla: 3.275 ± 0.493
0.799ProCys: 0.799 ± 0.227
1.757ProAsp: 1.757 ± 0.387
2.716ProGlu: 2.716 ± 0.553
1.358ProPhe: 1.358 ± 0.452
3.195ProGly: 3.195 ± 0.432
0.799ProHis: 0.799 ± 0.285
1.358ProIle: 1.358 ± 0.391
1.438ProLys: 1.438 ± 0.343
2.876ProLeu: 2.876 ± 0.678
0.799ProMet: 0.799 ± 0.296
1.038ProAsn: 1.038 ± 0.266
1.837ProPro: 1.837 ± 0.372
2.077ProGln: 2.077 ± 0.389
2.157ProArg: 2.157 ± 0.449
3.275ProSer: 3.275 ± 0.594
1.917ProThr: 1.917 ± 0.404
3.515ProVal: 3.515 ± 0.546
0.559ProTrp: 0.559 ± 0.255
0.879ProTyr: 0.879 ± 0.23
0.0ProXaa: 0.0 ± 0.0
Gln
5.272GlnAla: 5.272 ± 0.838
0.399GlnCys: 0.399 ± 0.164
1.598GlnAsp: 1.598 ± 0.384
2.157GlnGlu: 2.157 ± 0.395
0.879GlnPhe: 0.879 ± 0.253
2.157GlnGly: 2.157 ± 0.471
0.719GlnHis: 0.719 ± 0.216
2.556GlnIle: 2.556 ± 0.384
3.355GlnLys: 3.355 ± 0.56
4.074GlnLeu: 4.074 ± 0.677
1.438GlnMet: 1.438 ± 0.331
2.157GlnAsn: 2.157 ± 0.457
1.677GlnPro: 1.677 ± 0.415
2.796GlnGln: 2.796 ± 0.55
3.834GlnArg: 3.834 ± 0.707
3.515GlnSer: 3.515 ± 0.599
3.195GlnThr: 3.195 ± 0.649
3.275GlnVal: 3.275 ± 0.491
0.639GlnTrp: 0.639 ± 0.284
1.278GlnTyr: 1.278 ± 0.423
0.0GlnXaa: 0.0 ± 0.0
Arg
4.873ArgAla: 4.873 ± 0.752
0.639ArgCys: 0.639 ± 0.263
3.674ArgAsp: 3.674 ± 0.654
4.952ArgGlu: 4.952 ± 0.713
1.917ArgPhe: 1.917 ± 0.411
3.355ArgGly: 3.355 ± 0.595
1.198ArgHis: 1.198 ± 0.269
2.956ArgIle: 2.956 ± 0.518
4.553ArgLys: 4.553 ± 0.608
4.793ArgLeu: 4.793 ± 0.522
1.917ArgMet: 1.917 ± 0.369
3.195ArgAsn: 3.195 ± 0.413
1.518ArgPro: 1.518 ± 0.349
2.876ArgGln: 2.876 ± 0.538
3.914ArgArg: 3.914 ± 0.803
3.195ArgSer: 3.195 ± 0.528
3.115ArgThr: 3.115 ± 0.462
3.595ArgVal: 3.595 ± 0.59
1.518ArgTrp: 1.518 ± 0.371
2.237ArgTyr: 2.237 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
7.189SerAla: 7.189 ± 1.23
0.719SerCys: 0.719 ± 0.19
3.595SerAsp: 3.595 ± 0.547
4.234SerGlu: 4.234 ± 0.61
2.396SerPhe: 2.396 ± 0.465
6.39SerGly: 6.39 ± 0.769
0.639SerHis: 0.639 ± 0.226
3.035SerIle: 3.035 ± 0.524
3.435SerLys: 3.435 ± 0.721
5.831SerLeu: 5.831 ± 0.569
1.757SerMet: 1.757 ± 0.376
2.316SerAsn: 2.316 ± 0.449
3.195SerPro: 3.195 ± 0.562
3.754SerGln: 3.754 ± 0.852
4.713SerArg: 4.713 ± 0.751
5.032SerSer: 5.032 ± 0.974
3.595SerThr: 3.595 ± 0.523
5.272SerVal: 5.272 ± 0.851
1.198SerTrp: 1.198 ± 0.265
0.719SerTyr: 0.719 ± 0.193
0.0SerXaa: 0.0 ± 0.0
Thr
6.55ThrAla: 6.55 ± 0.807
0.639ThrCys: 0.639 ± 0.239
3.754ThrAsp: 3.754 ± 0.531
3.355ThrGlu: 3.355 ± 0.46
3.195ThrPhe: 3.195 ± 0.63
5.831ThrGly: 5.831 ± 0.676
0.559ThrHis: 0.559 ± 0.243
3.595ThrIle: 3.595 ± 0.542
2.556ThrLys: 2.556 ± 0.453
4.154ThrLeu: 4.154 ± 0.539
1.358ThrMet: 1.358 ± 0.268
2.077ThrAsn: 2.077 ± 0.396
2.796ThrPro: 2.796 ± 0.62
2.636ThrGln: 2.636 ± 0.47
2.476ThrArg: 2.476 ± 0.428
3.674ThrSer: 3.674 ± 0.53
3.035ThrThr: 3.035 ± 0.466
4.473ThrVal: 4.473 ± 0.532
0.879ThrTrp: 0.879 ± 0.27
1.278ThrTyr: 1.278 ± 0.451
0.0ThrXaa: 0.0 ± 0.0
Val
5.032ValAla: 5.032 ± 0.693
0.559ValCys: 0.559 ± 0.23
3.754ValAsp: 3.754 ± 0.443
3.754ValGlu: 3.754 ± 0.452
2.237ValPhe: 2.237 ± 0.336
4.473ValGly: 4.473 ± 0.7
1.198ValHis: 1.198 ± 0.321
3.595ValIle: 3.595 ± 0.674
3.914ValLys: 3.914 ± 0.564
5.432ValLeu: 5.432 ± 0.717
1.198ValMet: 1.198 ± 0.262
2.316ValAsn: 2.316 ± 0.589
2.556ValPro: 2.556 ± 0.429
1.757ValGln: 1.757 ± 0.333
3.275ValArg: 3.275 ± 0.494
5.592ValSer: 5.592 ± 0.768
3.914ValThr: 3.914 ± 0.541
5.032ValVal: 5.032 ± 0.898
0.879ValTrp: 0.879 ± 0.294
1.837ValTyr: 1.837 ± 0.339
0.0ValXaa: 0.0 ± 0.0
Trp
0.559TrpAla: 0.559 ± 0.185
0.399TrpCys: 0.399 ± 0.184
1.198TrpAsp: 1.198 ± 0.32
0.959TrpGlu: 0.959 ± 0.217
0.879TrpPhe: 0.879 ± 0.324
1.118TrpGly: 1.118 ± 0.339
0.639TrpHis: 0.639 ± 0.195
0.799TrpIle: 0.799 ± 0.271
0.639TrpLys: 0.639 ± 0.245
1.438TrpLeu: 1.438 ± 0.329
0.559TrpMet: 0.559 ± 0.187
0.559TrpAsn: 0.559 ± 0.197
0.639TrpPro: 0.639 ± 0.193
0.559TrpGln: 0.559 ± 0.216
1.038TrpArg: 1.038 ± 0.259
1.198TrpSer: 1.198 ± 0.301
0.959TrpThr: 0.959 ± 0.36
1.438TrpVal: 1.438 ± 0.266
0.559TrpTrp: 0.559 ± 0.209
0.559TrpTyr: 0.559 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.195TyrAla: 3.195 ± 0.522
0.559TyrCys: 0.559 ± 0.198
2.237TyrAsp: 2.237 ± 0.32
1.837TyrGlu: 1.837 ± 0.403
0.479TyrPhe: 0.479 ± 0.187
2.157TyrGly: 2.157 ± 0.455
0.24TyrHis: 0.24 ± 0.147
2.077TyrIle: 2.077 ± 0.358
0.879TyrLys: 0.879 ± 0.285
1.997TyrLeu: 1.997 ± 0.406
0.399TyrMet: 0.399 ± 0.189
1.198TyrAsn: 1.198 ± 0.274
1.518TyrPro: 1.518 ± 0.385
1.757TyrGln: 1.757 ± 0.373
2.636TyrArg: 2.636 ± 0.438
2.077TyrSer: 2.077 ± 0.445
1.598TyrThr: 1.598 ± 0.384
1.757TyrVal: 1.757 ± 0.303
0.479TyrTrp: 0.479 ± 0.183
0.799TyrTyr: 0.799 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12520 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski