Amino acid dipepetide frequency for Enterobacteria phage ES18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.44AlaAla: 12.44 ± 2.496
1.375AlaCys: 1.375 ± 0.378
5.361AlaAsp: 5.361 ± 0.586
8.247AlaGlu: 8.247 ± 0.747
3.436AlaPhe: 3.436 ± 0.5
6.873AlaGly: 6.873 ± 0.742
1.1AlaHis: 1.1 ± 0.2
5.636AlaIle: 5.636 ± 0.536
5.979AlaLys: 5.979 ± 0.57
7.079AlaLeu: 7.079 ± 0.998
3.574AlaMet: 3.574 ± 0.633
4.605AlaAsn: 4.605 ± 0.851
2.131AlaPro: 2.131 ± 0.386
3.299AlaGln: 3.299 ± 0.859
4.811AlaArg: 4.811 ± 0.632
5.704AlaSer: 5.704 ± 0.638
5.979AlaThr: 5.979 ± 0.982
5.979AlaVal: 5.979 ± 0.652
2.199AlaTrp: 2.199 ± 0.423
3.024AlaTyr: 3.024 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
0.962CysAla: 0.962 ± 0.289
0.275CysCys: 0.275 ± 0.12
0.825CysAsp: 0.825 ± 0.251
0.825CysGlu: 0.825 ± 0.262
0.275CysPhe: 0.275 ± 0.133
1.443CysGly: 1.443 ± 0.41
0.481CysHis: 0.481 ± 0.184
0.687CysIle: 0.687 ± 0.232
0.825CysLys: 0.825 ± 0.218
0.55CysLeu: 0.55 ± 0.201
0.137CysMet: 0.137 ± 0.099
1.031CysAsn: 1.031 ± 0.279
0.55CysPro: 0.55 ± 0.189
0.481CysGln: 0.481 ± 0.174
1.375CysArg: 1.375 ± 0.316
0.825CysSer: 0.825 ± 0.262
0.55CysThr: 0.55 ± 0.212
0.756CysVal: 0.756 ± 0.208
0.275CysTrp: 0.275 ± 0.131
0.412CysTyr: 0.412 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
6.598AspAla: 6.598 ± 0.652
0.687AspCys: 0.687 ± 0.24
3.711AspAsp: 3.711 ± 0.434
3.986AspGlu: 3.986 ± 0.556
1.993AspPhe: 1.993 ± 0.363
4.467AspGly: 4.467 ± 0.794
0.756AspHis: 0.756 ± 0.244
3.436AspIle: 3.436 ± 0.448
3.986AspLys: 3.986 ± 0.48
3.986AspLeu: 3.986 ± 0.362
1.512AspMet: 1.512 ± 0.274
2.268AspAsn: 2.268 ± 0.533
2.199AspPro: 2.199 ± 0.364
1.856AspGln: 1.856 ± 0.475
1.993AspArg: 1.993 ± 0.336
3.093AspSer: 3.093 ± 0.396
2.268AspThr: 2.268 ± 0.344
3.505AspVal: 3.505 ± 0.549
0.756AspTrp: 0.756 ± 0.314
1.993AspTyr: 1.993 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
6.46GluAla: 6.46 ± 0.641
0.962GluCys: 0.962 ± 0.205
2.749GluAsp: 2.749 ± 0.592
4.948GluGlu: 4.948 ± 0.884
2.199GluPhe: 2.199 ± 0.467
3.918GluGly: 3.918 ± 0.496
0.825GluHis: 0.825 ± 0.238
4.33GluIle: 4.33 ± 0.523
5.43GluLys: 5.43 ± 0.901
6.529GluLeu: 6.529 ± 0.798
1.718GluMet: 1.718 ± 0.303
2.887GluAsn: 2.887 ± 0.47
1.993GluPro: 1.993 ± 0.354
4.261GluGln: 4.261 ± 0.602
5.498GluArg: 5.498 ± 0.794
3.574GluSer: 3.574 ± 0.447
3.162GluThr: 3.162 ± 0.538
3.711GluVal: 3.711 ± 0.449
1.856GluTrp: 1.856 ± 0.348
2.818GluTyr: 2.818 ± 0.381
0.0GluXaa: 0.0 ± 0.0
Phe
3.024PheAla: 3.024 ± 0.482
0.481PheCys: 0.481 ± 0.172
2.131PheAsp: 2.131 ± 0.458
2.062PheGlu: 2.062 ± 0.365
0.893PhePhe: 0.893 ± 0.217
2.749PheGly: 2.749 ± 0.45
0.481PheHis: 0.481 ± 0.193
2.474PheIle: 2.474 ± 0.485
1.856PheLys: 1.856 ± 0.343
2.131PheLeu: 2.131 ± 0.436
0.619PheMet: 0.619 ± 0.179
1.718PheAsn: 1.718 ± 0.386
0.825PhePro: 0.825 ± 0.285
0.619PheGln: 0.619 ± 0.207
2.268PheArg: 2.268 ± 0.468
1.718PheSer: 1.718 ± 0.293
1.993PheThr: 1.993 ± 0.394
1.718PheVal: 1.718 ± 0.3
0.412PheTrp: 0.412 ± 0.151
1.031PheTyr: 1.031 ± 0.281
0.0PheXaa: 0.0 ± 0.0
Gly
5.223GlyAla: 5.223 ± 0.631
0.962GlyCys: 0.962 ± 0.317
4.33GlyAsp: 4.33 ± 0.743
4.261GlyGlu: 4.261 ± 0.581
2.131GlyPhe: 2.131 ± 0.326
4.674GlyGly: 4.674 ± 0.625
0.962GlyHis: 0.962 ± 0.29
4.192GlyIle: 4.192 ± 0.464
6.048GlyLys: 6.048 ± 0.616
4.261GlyLeu: 4.261 ± 0.584
1.993GlyMet: 1.993 ± 0.336
3.78GlyAsn: 3.78 ± 0.458
1.1GlyPro: 1.1 ± 0.282
2.818GlyGln: 2.818 ± 0.475
3.299GlyArg: 3.299 ± 0.384
3.505GlySer: 3.505 ± 0.538
3.986GlyThr: 3.986 ± 0.5
5.567GlyVal: 5.567 ± 0.57
1.237GlyTrp: 1.237 ± 0.305
3.093GlyTyr: 3.093 ± 0.501
0.0GlyXaa: 0.0 ± 0.0
His
1.306HisAla: 1.306 ± 0.33
0.481HisCys: 0.481 ± 0.167
1.512HisAsp: 1.512 ± 0.289
1.237HisGlu: 1.237 ± 0.303
0.825HisPhe: 0.825 ± 0.184
1.237HisGly: 1.237 ± 0.32
0.412HisHis: 0.412 ± 0.16
0.893HisIle: 0.893 ± 0.207
0.962HisLys: 0.962 ± 0.312
1.856HisLeu: 1.856 ± 0.377
0.206HisMet: 0.206 ± 0.101
0.825HisAsn: 0.825 ± 0.22
1.031HisPro: 1.031 ± 0.294
0.825HisGln: 0.825 ± 0.195
0.893HisArg: 0.893 ± 0.229
0.619HisSer: 0.619 ± 0.207
0.687HisThr: 0.687 ± 0.226
0.275HisVal: 0.275 ± 0.132
0.137HisTrp: 0.137 ± 0.09
0.756HisTyr: 0.756 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
5.223IleAla: 5.223 ± 0.681
0.55IleCys: 0.55 ± 0.184
3.574IleAsp: 3.574 ± 0.492
3.849IleGlu: 3.849 ± 0.43
1.581IlePhe: 1.581 ± 0.284
4.261IleGly: 4.261 ± 0.574
1.237IleHis: 1.237 ± 0.31
3.849IleIle: 3.849 ± 0.493
3.162IleLys: 3.162 ± 0.449
3.643IleLeu: 3.643 ± 0.41
1.443IleMet: 1.443 ± 0.404
3.368IleAsn: 3.368 ± 0.37
2.474IlePro: 2.474 ± 0.363
2.405IleGln: 2.405 ± 0.398
3.918IleArg: 3.918 ± 0.475
4.467IleSer: 4.467 ± 0.441
4.811IleThr: 4.811 ± 0.581
2.474IleVal: 2.474 ± 0.416
1.375IleTrp: 1.375 ± 0.257
0.756IleTyr: 0.756 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
7.01LysAla: 7.01 ± 0.734
1.237LysCys: 1.237 ± 0.262
2.818LysAsp: 2.818 ± 0.473
4.192LysGlu: 4.192 ± 0.655
2.405LysPhe: 2.405 ± 0.336
3.368LysGly: 3.368 ± 0.492
1.031LysHis: 1.031 ± 0.3
3.78LysIle: 3.78 ± 0.516
4.399LysLys: 4.399 ± 0.688
6.048LysLeu: 6.048 ± 0.808
1.649LysMet: 1.649 ± 0.387
2.612LysAsn: 2.612 ± 0.504
2.68LysPro: 2.68 ± 0.499
3.711LysGln: 3.711 ± 0.544
3.986LysArg: 3.986 ± 0.486
4.536LysSer: 4.536 ± 0.58
3.368LysThr: 3.368 ± 0.48
3.711LysVal: 3.711 ± 0.496
0.756LysTrp: 0.756 ± 0.192
1.993LysTyr: 1.993 ± 0.381
0.0LysXaa: 0.0 ± 0.0
Leu
8.247LeuAla: 8.247 ± 1.289
1.306LeuCys: 1.306 ± 0.292
3.849LeuAsp: 3.849 ± 0.419
4.88LeuGlu: 4.88 ± 0.635
2.612LeuPhe: 2.612 ± 0.438
3.78LeuGly: 3.78 ± 0.414
1.237LeuHis: 1.237 ± 0.269
4.055LeuIle: 4.055 ± 0.515
4.948LeuLys: 4.948 ± 0.734
6.323LeuLeu: 6.323 ± 0.886
1.512LeuMet: 1.512 ± 0.32
3.78LeuAsn: 3.78 ± 0.519
2.818LeuPro: 2.818 ± 0.482
2.955LeuGln: 2.955 ± 0.412
4.261LeuArg: 4.261 ± 0.541
6.117LeuSer: 6.117 ± 0.686
5.155LeuThr: 5.155 ± 0.763
4.811LeuVal: 4.811 ± 0.506
0.962LeuTrp: 0.962 ± 0.251
2.199LeuTyr: 2.199 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
2.818MetAla: 2.818 ± 0.5
0.275MetCys: 0.275 ± 0.155
0.962MetAsp: 0.962 ± 0.286
1.649MetGlu: 1.649 ± 0.338
0.344MetPhe: 0.344 ± 0.161
1.581MetGly: 1.581 ± 0.426
0.206MetHis: 0.206 ± 0.144
1.787MetIle: 1.787 ± 0.256
1.924MetLys: 1.924 ± 0.336
1.375MetLeu: 1.375 ± 0.239
0.893MetMet: 0.893 ± 0.262
1.443MetAsn: 1.443 ± 0.266
1.512MetPro: 1.512 ± 0.353
1.237MetGln: 1.237 ± 0.253
1.443MetArg: 1.443 ± 0.244
2.062MetSer: 2.062 ± 0.442
2.749MetThr: 2.749 ± 0.451
1.581MetVal: 1.581 ± 0.252
0.206MetTrp: 0.206 ± 0.146
0.481MetTyr: 0.481 ± 0.197
0.0MetXaa: 0.0 ± 0.0
Asn
4.399AsnAla: 4.399 ± 0.604
0.344AsnCys: 0.344 ± 0.156
2.543AsnAsp: 2.543 ± 0.349
3.643AsnGlu: 3.643 ± 0.46
0.825AsnPhe: 0.825 ± 0.216
5.223AsnGly: 5.223 ± 0.681
1.031AsnHis: 1.031 ± 0.274
2.612AsnIle: 2.612 ± 0.425
3.024AsnLys: 3.024 ± 0.456
3.505AsnLeu: 3.505 ± 0.396
0.962AsnMet: 0.962 ± 0.28
2.131AsnAsn: 2.131 ± 0.436
2.405AsnPro: 2.405 ± 0.464
2.337AsnGln: 2.337 ± 0.522
2.68AsnArg: 2.68 ± 0.429
3.162AsnSer: 3.162 ± 0.351
2.955AsnThr: 2.955 ± 0.82
2.543AsnVal: 2.543 ± 0.498
1.031AsnTrp: 1.031 ± 0.389
1.924AsnTyr: 1.924 ± 0.438
0.0AsnXaa: 0.0 ± 0.0
Pro
3.436ProAla: 3.436 ± 0.448
0.344ProCys: 0.344 ± 0.151
2.749ProAsp: 2.749 ± 0.454
3.643ProGlu: 3.643 ± 0.469
1.512ProPhe: 1.512 ± 0.28
1.237ProGly: 1.237 ± 0.388
0.344ProHis: 0.344 ± 0.153
2.199ProIle: 2.199 ± 0.424
1.924ProLys: 1.924 ± 0.364
2.749ProLeu: 2.749 ± 0.512
0.825ProMet: 0.825 ± 0.22
2.062ProAsn: 2.062 ± 0.306
2.062ProPro: 2.062 ± 0.515
1.443ProGln: 1.443 ± 0.313
1.649ProArg: 1.649 ± 0.307
1.993ProSer: 1.993 ± 0.388
1.993ProThr: 1.993 ± 0.497
3.093ProVal: 3.093 ± 0.428
0.275ProTrp: 0.275 ± 0.112
0.962ProTyr: 0.962 ± 0.259
0.0ProXaa: 0.0 ± 0.0
Gln
4.399GlnAla: 4.399 ± 0.719
0.55GlnCys: 0.55 ± 0.198
1.718GlnAsp: 1.718 ± 0.366
3.162GlnGlu: 3.162 ± 0.444
1.1GlnPhe: 1.1 ± 0.31
2.337GlnGly: 2.337 ± 0.398
0.825GlnHis: 0.825 ± 0.238
2.543GlnIle: 2.543 ± 0.403
2.543GlnLys: 2.543 ± 0.377
2.474GlnLeu: 2.474 ± 0.399
1.306GlnMet: 1.306 ± 0.334
1.443GlnAsn: 1.443 ± 0.286
1.649GlnPro: 1.649 ± 0.353
3.643GlnGln: 3.643 ± 0.894
2.68GlnArg: 2.68 ± 0.449
3.986GlnSer: 3.986 ± 0.656
2.199GlnThr: 2.199 ± 0.487
3.093GlnVal: 3.093 ± 0.384
1.306GlnTrp: 1.306 ± 0.283
1.649GlnTyr: 1.649 ± 0.343
0.0GlnXaa: 0.0 ± 0.0
Arg
4.261ArgAla: 4.261 ± 0.583
1.031ArgCys: 1.031 ± 0.252
3.643ArgAsp: 3.643 ± 0.478
4.605ArgGlu: 4.605 ± 0.792
1.375ArgPhe: 1.375 ± 0.33
3.574ArgGly: 3.574 ± 0.417
1.856ArgHis: 1.856 ± 0.411
3.505ArgIle: 3.505 ± 0.433
4.467ArgLys: 4.467 ± 0.647
4.467ArgLeu: 4.467 ± 0.459
1.924ArgMet: 1.924 ± 0.39
2.68ArgAsn: 2.68 ± 0.464
1.856ArgPro: 1.856 ± 0.327
3.23ArgGln: 3.23 ± 0.46
4.055ArgArg: 4.055 ± 0.78
2.887ArgSer: 2.887 ± 0.456
2.474ArgThr: 2.474 ± 0.371
3.436ArgVal: 3.436 ± 0.431
0.756ArgTrp: 0.756 ± 0.249
2.405ArgTyr: 2.405 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
6.392SerAla: 6.392 ± 0.923
0.825SerCys: 0.825 ± 0.261
3.162SerAsp: 3.162 ± 0.532
4.536SerGlu: 4.536 ± 0.613
2.68SerPhe: 2.68 ± 0.397
5.086SerGly: 5.086 ± 0.631
1.306SerHis: 1.306 ± 0.297
3.23SerIle: 3.23 ± 0.458
3.162SerLys: 3.162 ± 0.606
5.773SerLeu: 5.773 ± 0.698
1.924SerMet: 1.924 ± 0.305
2.337SerAsn: 2.337 ± 0.411
2.543SerPro: 2.543 ± 0.496
2.612SerGln: 2.612 ± 0.624
3.711SerArg: 3.711 ± 0.406
4.674SerSer: 4.674 ± 0.661
2.68SerThr: 2.68 ± 0.399
4.399SerVal: 4.399 ± 0.523
1.031SerTrp: 1.031 ± 0.243
1.924SerTyr: 1.924 ± 0.292
0.0SerXaa: 0.0 ± 0.0
Thr
6.804ThrAla: 6.804 ± 0.687
0.687ThrCys: 0.687 ± 0.258
3.162ThrAsp: 3.162 ± 0.631
4.192ThrGlu: 4.192 ± 0.555
1.581ThrPhe: 1.581 ± 0.317
4.88ThrGly: 4.88 ± 0.88
0.619ThrHis: 0.619 ± 0.182
3.299ThrIle: 3.299 ± 0.487
3.643ThrLys: 3.643 ± 0.571
4.261ThrLeu: 4.261 ± 0.493
2.062ThrMet: 2.062 ± 0.427
3.368ThrAsn: 3.368 ± 0.818
2.543ThrPro: 2.543 ± 0.54
1.993ThrGln: 1.993 ± 0.333
2.749ThrArg: 2.749 ± 0.326
2.68ThrSer: 2.68 ± 0.434
3.849ThrThr: 3.849 ± 0.763
2.749ThrVal: 2.749 ± 0.579
0.893ThrTrp: 0.893 ± 0.297
1.581ThrTyr: 1.581 ± 0.421
0.0ThrXaa: 0.0 ± 0.0
Val
5.842ValAla: 5.842 ± 0.752
0.619ValCys: 0.619 ± 0.247
3.299ValAsp: 3.299 ± 0.405
3.093ValGlu: 3.093 ± 0.45
1.856ValPhe: 1.856 ± 0.374
3.849ValGly: 3.849 ± 0.566
0.687ValHis: 0.687 ± 0.222
3.23ValIle: 3.23 ± 0.431
4.261ValLys: 4.261 ± 0.466
4.192ValLeu: 4.192 ± 0.542
1.649ValMet: 1.649 ± 0.344
4.124ValAsn: 4.124 ± 0.605
2.749ValPro: 2.749 ± 0.421
2.131ValGln: 2.131 ± 0.386
3.299ValArg: 3.299 ± 0.503
5.292ValSer: 5.292 ± 0.62
4.124ValThr: 4.124 ± 0.644
4.192ValVal: 4.192 ± 0.502
0.55ValTrp: 0.55 ± 0.181
2.131ValTyr: 2.131 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
1.168TrpAla: 1.168 ± 0.287
0.412TrpCys: 0.412 ± 0.189
1.1TrpAsp: 1.1 ± 0.255
0.962TrpGlu: 0.962 ± 0.225
0.687TrpPhe: 0.687 ± 0.233
0.687TrpGly: 0.687 ± 0.206
0.55TrpHis: 0.55 ± 0.246
0.893TrpIle: 0.893 ± 0.191
1.1TrpLys: 1.1 ± 0.361
1.581TrpLeu: 1.581 ± 0.44
0.275TrpMet: 0.275 ± 0.126
1.031TrpAsn: 1.031 ± 0.245
0.137TrpPro: 0.137 ± 0.083
0.962TrpGln: 0.962 ± 0.219
1.375TrpArg: 1.375 ± 0.274
0.893TrpSer: 0.893 ± 0.495
0.962TrpThr: 0.962 ± 0.363
1.306TrpVal: 1.306 ± 0.289
0.206TrpTrp: 0.206 ± 0.165
0.687TrpTyr: 0.687 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.818TyrAla: 2.818 ± 0.421
0.137TyrCys: 0.137 ± 0.085
1.856TyrAsp: 1.856 ± 0.264
1.856TyrGlu: 1.856 ± 0.393
0.893TyrPhe: 0.893 ± 0.206
2.199TyrGly: 2.199 ± 0.421
1.031TyrHis: 1.031 ± 0.248
1.787TyrIle: 1.787 ± 0.432
1.649TyrLys: 1.649 ± 0.344
3.024TyrLeu: 3.024 ± 0.443
0.275TyrMet: 0.275 ± 0.106
1.787TyrAsn: 1.787 ± 0.332
1.237TyrPro: 1.237 ± 0.297
1.787TyrGln: 1.787 ± 0.474
2.543TyrArg: 2.543 ± 0.385
2.268TyrSer: 2.268 ± 0.46
1.649TyrThr: 1.649 ± 0.34
2.268TyrVal: 2.268 ± 0.429
0.756TyrTrp: 0.756 ± 0.248
0.687TyrTyr: 0.687 ± 0.171
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (14551 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski