Amino acid dipepetide frequency for Escherichia phage mEp234

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.078AlaAla: 11.078 ± 1.643
0.507AlaCys: 0.507 ± 0.248
7.019AlaAsp: 7.019 ± 0.757
6.765AlaGlu: 6.765 ± 1.634
3.383AlaPhe: 3.383 ± 0.545
7.188AlaGly: 7.188 ± 0.979
1.438AlaHis: 1.438 ± 0.416
7.273AlaIle: 7.273 ± 0.721
5.92AlaLys: 5.92 ± 1.153
7.273AlaLeu: 7.273 ± 0.829
3.89AlaMet: 3.89 ± 0.618
4.989AlaAsn: 4.989 ± 0.631
1.691AlaPro: 1.691 ± 0.329
4.482AlaGln: 4.482 ± 0.677
5.581AlaArg: 5.581 ± 0.777
5.751AlaSer: 5.751 ± 1.168
5.666AlaThr: 5.666 ± 0.975
5.581AlaVal: 5.581 ± 0.804
1.438AlaTrp: 1.438 ± 0.379
2.791AlaTyr: 2.791 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
1.099CysAla: 1.099 ± 0.368
0.254CysCys: 0.254 ± 0.167
1.015CysAsp: 1.015 ± 0.292
0.846CysGlu: 0.846 ± 0.226
0.169CysPhe: 0.169 ± 0.118
0.93CysGly: 0.93 ± 0.309
0.423CysHis: 0.423 ± 0.202
0.761CysIle: 0.761 ± 0.268
0.761CysLys: 0.761 ± 0.257
0.846CysLeu: 0.846 ± 0.339
0.254CysMet: 0.254 ± 0.16
0.507CysAsn: 0.507 ± 0.207
0.423CysPro: 0.423 ± 0.234
0.085CysGln: 0.085 ± 0.086
0.677CysArg: 0.677 ± 0.289
0.761CysSer: 0.761 ± 0.232
0.677CysThr: 0.677 ± 0.271
0.761CysVal: 0.761 ± 0.291
0.169CysTrp: 0.169 ± 0.18
0.085CysTyr: 0.085 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
6.258AspAla: 6.258 ± 0.702
0.592AspCys: 0.592 ± 0.273
3.214AspAsp: 3.214 ± 0.582
4.144AspGlu: 4.144 ± 0.649
2.537AspPhe: 2.537 ± 0.521
5.328AspGly: 5.328 ± 0.807
0.592AspHis: 0.592 ± 0.253
2.452AspIle: 2.452 ± 0.396
3.129AspLys: 3.129 ± 0.489
5.581AspLeu: 5.581 ± 0.596
1.522AspMet: 1.522 ± 0.341
3.214AspAsn: 3.214 ± 0.473
1.691AspPro: 1.691 ± 0.353
2.03AspGln: 2.03 ± 0.438
3.214AspArg: 3.214 ± 0.528
3.214AspSer: 3.214 ± 0.511
2.622AspThr: 2.622 ± 0.634
4.059AspVal: 4.059 ± 0.789
0.677AspTrp: 0.677 ± 0.245
2.706AspTyr: 2.706 ± 0.496
0.0AspXaa: 0.0 ± 0.0
Glu
5.92GluAla: 5.92 ± 1.458
0.93GluCys: 0.93 ± 0.355
2.368GluAsp: 2.368 ± 0.495
5.159GluGlu: 5.159 ± 1.284
2.452GluPhe: 2.452 ± 0.389
3.467GluGly: 3.467 ± 0.565
0.846GluHis: 0.846 ± 0.279
4.144GluIle: 4.144 ± 0.551
4.397GluLys: 4.397 ± 0.816
4.989GluLeu: 4.989 ± 0.72
1.438GluMet: 1.438 ± 0.341
3.383GluAsn: 3.383 ± 0.594
1.607GluPro: 1.607 ± 0.388
4.059GluGln: 4.059 ± 0.61
4.82GluArg: 4.82 ± 1.154
4.144GluSer: 4.144 ± 0.62
2.706GluThr: 2.706 ± 0.425
4.228GluVal: 4.228 ± 0.614
0.761GluTrp: 0.761 ± 0.295
1.438GluTyr: 1.438 ± 0.379
0.0GluXaa: 0.0 ± 0.0
Phe
3.044PheAla: 3.044 ± 0.441
0.338PheCys: 0.338 ± 0.153
2.537PheAsp: 2.537 ± 0.406
2.452PheGlu: 2.452 ± 0.565
0.761PhePhe: 0.761 ± 0.271
3.129PheGly: 3.129 ± 0.422
0.507PheHis: 0.507 ± 0.21
2.283PheIle: 2.283 ± 0.559
2.03PheLys: 2.03 ± 0.496
1.522PheLeu: 1.522 ± 0.379
0.93PheMet: 0.93 ± 0.279
1.691PheAsn: 1.691 ± 0.313
1.438PhePro: 1.438 ± 0.36
1.099PheGln: 1.099 ± 0.279
1.522PheArg: 1.522 ± 0.369
2.706PheSer: 2.706 ± 0.44
2.368PheThr: 2.368 ± 0.441
1.691PheVal: 1.691 ± 0.325
0.592PheTrp: 0.592 ± 0.217
1.438PheTyr: 1.438 ± 0.344
0.0PheXaa: 0.0 ± 0.0
Gly
6.681GlyAla: 6.681 ± 1.064
0.93GlyCys: 0.93 ± 0.295
4.567GlyAsp: 4.567 ± 0.607
3.636GlyGlu: 3.636 ± 0.463
3.044GlyPhe: 3.044 ± 0.573
5.835GlyGly: 5.835 ± 0.859
1.268GlyHis: 1.268 ± 0.329
3.298GlyIle: 3.298 ± 0.496
3.89GlyLys: 3.89 ± 0.49
6.427GlyLeu: 6.427 ± 0.746
2.96GlyMet: 2.96 ± 0.584
4.228GlyAsn: 4.228 ± 0.893
1.522GlyPro: 1.522 ± 0.374
3.129GlyGln: 3.129 ± 0.474
3.975GlyArg: 3.975 ± 0.605
3.636GlySer: 3.636 ± 0.574
5.074GlyThr: 5.074 ± 0.821
4.651GlyVal: 4.651 ± 0.637
1.099GlyTrp: 1.099 ± 0.37
2.452GlyTyr: 2.452 ± 0.42
0.0GlyXaa: 0.0 ± 0.0
His
1.268HisAla: 1.268 ± 0.356
0.085HisCys: 0.085 ± 0.09
1.015HisAsp: 1.015 ± 0.316
1.607HisGlu: 1.607 ± 0.468
0.338HisPhe: 0.338 ± 0.17
1.353HisGly: 1.353 ± 0.363
0.592HisHis: 0.592 ± 0.216
0.592HisIle: 0.592 ± 0.238
1.268HisLys: 1.268 ± 0.357
0.93HisLeu: 0.93 ± 0.411
0.085HisMet: 0.085 ± 0.084
0.423HisAsn: 0.423 ± 0.188
1.015HisPro: 1.015 ± 0.317
1.015HisGln: 1.015 ± 0.324
1.607HisArg: 1.607 ± 0.418
0.93HisSer: 0.93 ± 0.3
0.423HisThr: 0.423 ± 0.218
0.592HisVal: 0.592 ± 0.266
0.254HisTrp: 0.254 ± 0.139
0.338HisTyr: 0.338 ± 0.157
0.0HisXaa: 0.0 ± 0.0
Ile
4.82IleAla: 4.82 ± 0.614
0.677IleCys: 0.677 ± 0.279
4.397IleAsp: 4.397 ± 0.706
3.298IleGlu: 3.298 ± 0.645
2.03IlePhe: 2.03 ± 0.475
3.298IleGly: 3.298 ± 0.647
0.93IleHis: 0.93 ± 0.31
2.791IleIle: 2.791 ± 0.54
2.622IleLys: 2.622 ± 0.415
4.144IleLeu: 4.144 ± 0.653
0.846IleMet: 0.846 ± 0.362
3.552IleAsn: 3.552 ± 0.507
2.114IlePro: 2.114 ± 0.42
2.875IleGln: 2.875 ± 0.595
3.298IleArg: 3.298 ± 0.614
5.159IleSer: 5.159 ± 0.888
3.89IleThr: 3.89 ± 0.466
2.875IleVal: 2.875 ± 0.601
1.184IleTrp: 1.184 ± 0.332
1.522IleTyr: 1.522 ± 0.37
0.0IleXaa: 0.0 ± 0.0
Lys
7.188LysAla: 7.188 ± 1.297
0.338LysCys: 0.338 ± 0.176
3.805LysAsp: 3.805 ± 0.586
3.975LysGlu: 3.975 ± 0.741
1.184LysPhe: 1.184 ± 0.336
3.636LysGly: 3.636 ± 0.564
0.592LysHis: 0.592 ± 0.235
2.283LysIle: 2.283 ± 0.531
3.721LysLys: 3.721 ± 0.939
4.567LysLeu: 4.567 ± 0.567
1.776LysMet: 1.776 ± 0.463
2.537LysAsn: 2.537 ± 0.391
3.383LysPro: 3.383 ± 0.655
3.467LysGln: 3.467 ± 0.679
3.044LysArg: 3.044 ± 0.67
3.721LysSer: 3.721 ± 0.536
3.636LysThr: 3.636 ± 0.487
3.044LysVal: 3.044 ± 0.449
0.846LysTrp: 0.846 ± 0.289
2.283LysTyr: 2.283 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
7.696LeuAla: 7.696 ± 0.745
1.438LeuCys: 1.438 ± 0.424
4.397LeuAsp: 4.397 ± 0.537
4.651LeuGlu: 4.651 ± 0.674
2.452LeuPhe: 2.452 ± 0.503
4.059LeuGly: 4.059 ± 0.818
1.015LeuHis: 1.015 ± 0.357
4.82LeuIle: 4.82 ± 0.621
4.82LeuLys: 4.82 ± 0.562
5.328LeuLeu: 5.328 ± 0.713
1.86LeuMet: 1.86 ± 0.412
4.567LeuAsn: 4.567 ± 0.747
3.383LeuPro: 3.383 ± 0.505
2.791LeuGln: 2.791 ± 0.541
5.497LeuArg: 5.497 ± 0.667
6.173LeuSer: 6.173 ± 0.826
4.82LeuThr: 4.82 ± 0.599
4.482LeuVal: 4.482 ± 0.603
0.592LeuTrp: 0.592 ± 0.233
2.283LeuTyr: 2.283 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
3.044MetAla: 3.044 ± 0.513
0.254MetCys: 0.254 ± 0.146
1.268MetAsp: 1.268 ± 0.306
1.438MetGlu: 1.438 ± 0.421
0.761MetPhe: 0.761 ± 0.272
1.86MetGly: 1.86 ± 0.373
0.423MetHis: 0.423 ± 0.169
1.353MetIle: 1.353 ± 0.307
2.114MetLys: 2.114 ± 0.486
2.283MetLeu: 2.283 ± 0.366
0.761MetMet: 0.761 ± 0.262
0.592MetAsn: 0.592 ± 0.219
1.099MetPro: 1.099 ± 0.238
1.268MetGln: 1.268 ± 0.258
1.86MetArg: 1.86 ± 0.354
2.452MetSer: 2.452 ± 0.516
2.452MetThr: 2.452 ± 0.494
0.677MetVal: 0.677 ± 0.269
0.254MetTrp: 0.254 ± 0.122
0.846MetTyr: 0.846 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
5.074AsnAla: 5.074 ± 0.857
0.423AsnCys: 0.423 ± 0.224
2.368AsnAsp: 2.368 ± 0.359
2.199AsnGlu: 2.199 ± 0.421
1.268AsnPhe: 1.268 ± 0.363
4.567AsnGly: 4.567 ± 0.677
1.099AsnHis: 1.099 ± 0.328
3.805AsnIle: 3.805 ± 0.512
2.706AsnLys: 2.706 ± 0.518
3.129AsnLeu: 3.129 ± 0.483
0.761AsnMet: 0.761 ± 0.228
2.452AsnAsn: 2.452 ± 0.498
2.283AsnPro: 2.283 ± 0.465
2.452AsnGln: 2.452 ± 0.46
2.283AsnArg: 2.283 ± 0.685
2.706AsnSer: 2.706 ± 0.617
3.298AsnThr: 3.298 ± 0.674
1.86AsnVal: 1.86 ± 0.588
0.761AsnTrp: 0.761 ± 0.222
1.522AsnTyr: 1.522 ± 0.391
0.0AsnXaa: 0.0 ± 0.0
Pro
3.383ProAla: 3.383 ± 0.533
0.677ProCys: 0.677 ± 0.224
1.522ProAsp: 1.522 ± 0.375
2.368ProGlu: 2.368 ± 0.598
1.268ProPhe: 1.268 ± 0.331
2.706ProGly: 2.706 ± 0.452
0.846ProHis: 0.846 ± 0.28
1.691ProIle: 1.691 ± 0.467
2.03ProLys: 2.03 ± 0.42
2.791ProLeu: 2.791 ± 0.698
0.846ProMet: 0.846 ± 0.314
1.184ProAsn: 1.184 ± 0.373
1.438ProPro: 1.438 ± 0.318
1.86ProGln: 1.86 ± 0.409
2.03ProArg: 2.03 ± 0.394
3.383ProSer: 3.383 ± 0.634
1.691ProThr: 1.691 ± 0.405
2.791ProVal: 2.791 ± 0.47
0.507ProTrp: 0.507 ± 0.233
0.93ProTyr: 0.93 ± 0.207
0.0ProXaa: 0.0 ± 0.0
Gln
5.159GlnAla: 5.159 ± 0.88
0.423GlnCys: 0.423 ± 0.164
1.522GlnAsp: 1.522 ± 0.382
2.114GlnGlu: 2.114 ± 0.324
1.268GlnPhe: 1.268 ± 0.323
2.114GlnGly: 2.114 ± 0.504
0.677GlnHis: 0.677 ± 0.229
2.791GlnIle: 2.791 ± 0.401
3.636GlnLys: 3.636 ± 0.624
3.89GlnLeu: 3.89 ± 0.644
1.353GlnMet: 1.353 ± 0.298
2.368GlnAsn: 2.368 ± 0.504
1.86GlnPro: 1.86 ± 0.397
2.875GlnGln: 2.875 ± 0.544
3.298GlnArg: 3.298 ± 0.703
3.298GlnSer: 3.298 ± 0.636
2.706GlnThr: 2.706 ± 0.549
3.467GlnVal: 3.467 ± 0.48
0.761GlnTrp: 0.761 ± 0.291
1.184GlnTyr: 1.184 ± 0.408
0.0GlnXaa: 0.0 ± 0.0
Arg
5.497ArgAla: 5.497 ± 0.812
0.677ArgCys: 0.677 ± 0.243
3.89ArgAsp: 3.89 ± 0.567
4.736ArgGlu: 4.736 ± 1.175
2.199ArgPhe: 2.199 ± 0.474
3.129ArgGly: 3.129 ± 0.464
1.184ArgHis: 1.184 ± 0.304
3.044ArgIle: 3.044 ± 0.736
4.651ArgLys: 4.651 ± 0.644
5.159ArgLeu: 5.159 ± 0.582
1.945ArgMet: 1.945 ± 0.415
3.129ArgAsn: 3.129 ± 0.446
1.522ArgPro: 1.522 ± 0.356
3.129ArgGln: 3.129 ± 0.577
4.059ArgArg: 4.059 ± 0.985
2.622ArgSer: 2.622 ± 0.571
3.636ArgThr: 3.636 ± 0.565
3.298ArgVal: 3.298 ± 0.653
1.607ArgTrp: 1.607 ± 0.401
2.283ArgTyr: 2.283 ± 0.426
0.0ArgXaa: 0.0 ± 0.0
Ser
6.934SerAla: 6.934 ± 1.021
0.761SerCys: 0.761 ± 0.217
3.805SerAsp: 3.805 ± 0.529
4.144SerGlu: 4.144 ± 0.571
2.199SerPhe: 2.199 ± 0.384
5.581SerGly: 5.581 ± 0.828
0.677SerHis: 0.677 ± 0.294
3.298SerIle: 3.298 ± 0.457
2.791SerLys: 2.791 ± 0.644
5.243SerLeu: 5.243 ± 0.706
2.03SerMet: 2.03 ± 0.475
2.452SerAsn: 2.452 ± 0.512
3.129SerPro: 3.129 ± 0.494
3.805SerGln: 3.805 ± 0.795
4.905SerArg: 4.905 ± 0.773
4.905SerSer: 4.905 ± 1.045
3.805SerThr: 3.805 ± 0.574
5.328SerVal: 5.328 ± 0.99
1.268SerTrp: 1.268 ± 0.282
1.099SerTyr: 1.099 ± 0.263
0.0SerXaa: 0.0 ± 0.0
Thr
6.004ThrAla: 6.004 ± 0.723
0.592ThrCys: 0.592 ± 0.207
3.636ThrAsp: 3.636 ± 0.527
3.129ThrGlu: 3.129 ± 0.64
3.214ThrPhe: 3.214 ± 0.67
5.497ThrGly: 5.497 ± 0.81
0.507ThrHis: 0.507 ± 0.205
3.636ThrIle: 3.636 ± 0.525
2.96ThrLys: 2.96 ± 0.469
4.144ThrLeu: 4.144 ± 0.496
1.438ThrMet: 1.438 ± 0.281
1.86ThrAsn: 1.86 ± 0.415
2.368ThrPro: 2.368 ± 0.491
2.537ThrGln: 2.537 ± 0.526
2.368ThrArg: 2.368 ± 0.381
3.805ThrSer: 3.805 ± 0.518
2.875ThrThr: 2.875 ± 0.556
4.567ThrVal: 4.567 ± 0.582
1.184ThrTrp: 1.184 ± 0.39
1.945ThrTyr: 1.945 ± 0.469
0.0ThrXaa: 0.0 ± 0.0
Val
5.497ValAla: 5.497 ± 0.68
0.592ValCys: 0.592 ± 0.239
3.383ValAsp: 3.383 ± 0.468
4.059ValGlu: 4.059 ± 0.55
2.199ValPhe: 2.199 ± 0.386
4.482ValGly: 4.482 ± 0.827
0.93ValHis: 0.93 ± 0.283
3.552ValIle: 3.552 ± 0.6
3.975ValLys: 3.975 ± 0.467
5.497ValLeu: 5.497 ± 0.759
1.607ValMet: 1.607 ± 0.322
2.283ValAsn: 2.283 ± 0.47
2.283ValPro: 2.283 ± 0.415
1.776ValGln: 1.776 ± 0.394
3.298ValArg: 3.298 ± 0.501
5.412ValSer: 5.412 ± 0.847
3.383ValThr: 3.383 ± 0.446
4.651ValVal: 4.651 ± 0.873
0.93ValTrp: 0.93 ± 0.303
1.691ValTyr: 1.691 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
0.677TrpAla: 0.677 ± 0.197
0.507TrpCys: 0.507 ± 0.216
1.184TrpAsp: 1.184 ± 0.282
0.761TrpGlu: 0.761 ± 0.217
0.846TrpPhe: 0.846 ± 0.289
1.353TrpGly: 1.353 ± 0.327
0.846TrpHis: 0.846 ± 0.261
0.761TrpIle: 0.761 ± 0.207
0.846TrpLys: 0.846 ± 0.265
1.522TrpLeu: 1.522 ± 0.433
0.338TrpMet: 0.338 ± 0.166
0.338TrpAsn: 0.338 ± 0.172
0.423TrpPro: 0.423 ± 0.186
0.507TrpGln: 0.507 ± 0.216
1.015TrpArg: 1.015 ± 0.299
1.015TrpSer: 1.015 ± 0.295
1.015TrpThr: 1.015 ± 0.384
1.184TrpVal: 1.184 ± 0.27
0.423TrpTrp: 0.423 ± 0.181
0.423TrpTyr: 0.423 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.383TyrAla: 3.383 ± 0.508
0.507TyrCys: 0.507 ± 0.175
1.86TyrAsp: 1.86 ± 0.315
1.776TyrGlu: 1.776 ± 0.428
0.507TyrPhe: 0.507 ± 0.243
2.791TyrGly: 2.791 ± 0.469
0.338TyrHis: 0.338 ± 0.165
1.691TyrIle: 1.691 ± 0.358
0.846TyrLys: 0.846 ± 0.266
1.86TyrLeu: 1.86 ± 0.35
0.338TyrMet: 0.338 ± 0.201
1.268TyrAsn: 1.268 ± 0.33
1.268TyrPro: 1.268 ± 0.362
1.522TyrGln: 1.522 ± 0.364
3.044TyrArg: 3.044 ± 0.56
2.368TyrSer: 2.368 ± 0.574
1.522TyrThr: 1.522 ± 0.461
1.691TyrVal: 1.691 ± 0.349
0.677TyrTrp: 0.677 ± 0.221
0.846TyrTyr: 0.846 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (11826 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski