Amino acid dipepetide frequency for Escherichia phage Evi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.949AlaAla: 9.949 ± 1.276
0.952AlaCys: 0.952 ± 0.297
4.758AlaAsp: 4.758 ± 0.659
8.478AlaGlu: 8.478 ± 1.122
3.201AlaPhe: 3.201 ± 0.626
9.084AlaGly: 9.084 ± 1.239
1.211AlaHis: 1.211 ± 0.279
5.537AlaIle: 5.537 ± 0.701
4.585AlaLys: 4.585 ± 0.794
8.911AlaLeu: 8.911 ± 0.975
2.768AlaMet: 2.768 ± 0.524
3.028AlaAsn: 3.028 ± 0.462
2.249AlaPro: 2.249 ± 0.51
4.153AlaGln: 4.153 ± 0.626
7.008AlaArg: 7.008 ± 1.079
6.921AlaSer: 6.921 ± 0.955
6.056AlaThr: 6.056 ± 0.863
6.488AlaVal: 6.488 ± 0.65
1.903AlaTrp: 1.903 ± 0.34
2.768AlaTyr: 2.768 ± 0.471
0.0AlaXaa: 0.0 ± 0.0
Cys
0.606CysAla: 0.606 ± 0.247
0.519CysCys: 0.519 ± 0.216
0.519CysAsp: 0.519 ± 0.214
0.865CysGlu: 0.865 ± 0.282
0.173CysPhe: 0.173 ± 0.119
1.038CysGly: 1.038 ± 0.296
0.26CysHis: 0.26 ± 0.164
0.606CysIle: 0.606 ± 0.258
0.692CysLys: 0.692 ± 0.253
0.865CysLeu: 0.865 ± 0.257
0.173CysMet: 0.173 ± 0.116
0.346CysAsn: 0.346 ± 0.135
0.519CysPro: 0.519 ± 0.216
0.433CysGln: 0.433 ± 0.186
1.038CysArg: 1.038 ± 0.362
0.952CysSer: 0.952 ± 0.255
1.298CysThr: 1.298 ± 0.354
0.779CysVal: 0.779 ± 0.246
0.173CysTrp: 0.173 ± 0.124
0.433CysTyr: 0.433 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
5.45AspAla: 5.45 ± 0.762
1.125AspCys: 1.125 ± 0.278
3.461AspAsp: 3.461 ± 0.661
4.066AspGlu: 4.066 ± 0.576
2.249AspPhe: 2.249 ± 0.415
5.018AspGly: 5.018 ± 0.792
0.433AspHis: 0.433 ± 0.157
2.941AspIle: 2.941 ± 0.518
2.595AspLys: 2.595 ± 0.426
3.634AspLeu: 3.634 ± 0.58
1.211AspMet: 1.211 ± 0.359
2.941AspAsn: 2.941 ± 0.362
3.114AspPro: 3.114 ± 0.672
1.73AspGln: 1.73 ± 0.444
2.249AspArg: 2.249 ± 0.418
2.682AspSer: 2.682 ± 0.427
3.114AspThr: 3.114 ± 0.526
4.931AspVal: 4.931 ± 0.69
0.865AspTrp: 0.865 ± 0.27
1.644AspTyr: 1.644 ± 0.345
0.0AspXaa: 0.0 ± 0.0
Glu
5.623GluAla: 5.623 ± 0.98
0.865GluCys: 0.865 ± 0.304
3.028GluAsp: 3.028 ± 0.51
4.066GluGlu: 4.066 ± 0.643
1.644GluPhe: 1.644 ± 0.277
4.326GluGly: 4.326 ± 0.768
0.952GluHis: 0.952 ± 0.323
3.893GluIle: 3.893 ± 0.639
4.153GluLys: 4.153 ± 0.548
6.835GluLeu: 6.835 ± 0.734
2.076GluMet: 2.076 ± 0.44
2.941GluAsn: 2.941 ± 0.482
2.336GluPro: 2.336 ± 0.489
5.45GluGln: 5.45 ± 0.606
5.018GluArg: 5.018 ± 0.668
3.461GluSer: 3.461 ± 0.725
5.104GluThr: 5.104 ± 0.893
4.153GluVal: 4.153 ± 0.525
1.125GluTrp: 1.125 ± 0.256
2.163GluTyr: 2.163 ± 0.426
0.0GluXaa: 0.0 ± 0.0
Phe
2.076PheAla: 2.076 ± 0.486
0.346PheCys: 0.346 ± 0.164
1.903PheAsp: 1.903 ± 0.613
2.076PheGlu: 2.076 ± 0.299
0.865PhePhe: 0.865 ± 0.215
2.163PheGly: 2.163 ± 0.363
0.519PheHis: 0.519 ± 0.159
1.557PheIle: 1.557 ± 0.387
1.384PheLys: 1.384 ± 0.363
2.682PheLeu: 2.682 ± 0.551
0.865PheMet: 0.865 ± 0.269
1.903PheAsn: 1.903 ± 0.397
0.952PhePro: 0.952 ± 0.279
0.779PheGln: 0.779 ± 0.22
2.595PheArg: 2.595 ± 0.486
2.941PheSer: 2.941 ± 0.518
1.99PheThr: 1.99 ± 0.422
2.249PheVal: 2.249 ± 0.414
0.346PheTrp: 0.346 ± 0.162
1.211PheTyr: 1.211 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
6.661GlyAla: 6.661 ± 0.824
0.779GlyCys: 0.779 ± 0.224
4.239GlyAsp: 4.239 ± 0.77
5.537GlyGlu: 5.537 ± 0.918
2.595GlyPhe: 2.595 ± 0.501
4.499GlyGly: 4.499 ± 0.675
1.384GlyHis: 1.384 ± 0.322
4.066GlyIle: 4.066 ± 0.782
3.634GlyLys: 3.634 ± 0.585
5.623GlyLeu: 5.623 ± 0.836
1.817GlyMet: 1.817 ± 0.428
2.941GlyAsn: 2.941 ± 0.662
3.461GlyPro: 3.461 ± 1.768
2.768GlyGln: 2.768 ± 0.399
4.239GlyArg: 4.239 ± 0.608
3.893GlySer: 3.893 ± 0.53
3.98GlyThr: 3.98 ± 0.853
5.45GlyVal: 5.45 ± 0.678
1.038GlyTrp: 1.038 ± 0.27
2.163GlyTyr: 2.163 ± 0.364
0.0GlyXaa: 0.0 ± 0.0
His
2.076HisAla: 2.076 ± 0.462
0.346HisCys: 0.346 ± 0.155
1.298HisAsp: 1.298 ± 0.344
0.606HisGlu: 0.606 ± 0.247
0.346HisPhe: 0.346 ± 0.169
0.865HisGly: 0.865 ± 0.323
0.952HisHis: 0.952 ± 0.288
1.211HisIle: 1.211 ± 0.293
0.952HisLys: 0.952 ± 0.307
1.817HisLeu: 1.817 ± 0.417
0.692HisMet: 0.692 ± 0.218
0.865HisAsn: 0.865 ± 0.27
1.125HisPro: 1.125 ± 0.31
1.038HisGln: 1.038 ± 0.33
0.865HisArg: 0.865 ± 0.209
0.606HisSer: 0.606 ± 0.233
0.519HisThr: 0.519 ± 0.182
1.211HisVal: 1.211 ± 0.396
0.26HisTrp: 0.26 ± 0.138
0.346HisTyr: 0.346 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
5.104IleAla: 5.104 ± 0.687
1.038IleCys: 1.038 ± 0.304
3.72IleAsp: 3.72 ± 0.522
3.028IleGlu: 3.028 ± 0.41
1.99IlePhe: 1.99 ± 0.446
2.249IleGly: 2.249 ± 0.414
1.125IleHis: 1.125 ± 0.307
2.768IleIle: 2.768 ± 0.453
3.287IleLys: 3.287 ± 0.525
2.768IleLeu: 2.768 ± 0.553
1.298IleMet: 1.298 ± 0.332
1.644IleAsn: 1.644 ± 0.281
2.682IlePro: 2.682 ± 0.555
1.471IleGln: 1.471 ± 0.289
3.72IleArg: 3.72 ± 0.539
5.191IleSer: 5.191 ± 0.553
3.72IleThr: 3.72 ± 0.687
2.855IleVal: 2.855 ± 0.488
0.606IleTrp: 0.606 ± 0.224
1.038IleTyr: 1.038 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
4.845LysAla: 4.845 ± 0.663
0.952LysCys: 0.952 ± 0.303
2.855LysAsp: 2.855 ± 0.43
3.114LysGlu: 3.114 ± 0.523
1.471LysPhe: 1.471 ± 0.436
4.239LysGly: 4.239 ± 0.733
1.125LysHis: 1.125 ± 0.26
3.374LysIle: 3.374 ± 0.615
3.547LysLys: 3.547 ± 0.668
3.72LysLeu: 3.72 ± 0.601
1.125LysMet: 1.125 ± 0.268
2.076LysAsn: 2.076 ± 0.436
1.644LysPro: 1.644 ± 0.354
2.855LysGln: 2.855 ± 0.46
3.634LysArg: 3.634 ± 0.531
2.509LysSer: 2.509 ± 0.578
3.807LysThr: 3.807 ± 0.508
3.98LysVal: 3.98 ± 0.602
1.038LysTrp: 1.038 ± 0.303
1.644LysTyr: 1.644 ± 0.366
0.0LysXaa: 0.0 ± 0.0
Leu
7.959LeuAla: 7.959 ± 0.921
1.125LeuCys: 1.125 ± 0.306
4.672LeuAsp: 4.672 ± 0.467
3.893LeuGlu: 3.893 ± 0.438
1.73LeuPhe: 1.73 ± 0.398
5.018LeuGly: 5.018 ± 0.625
1.471LeuHis: 1.471 ± 0.42
4.066LeuIle: 4.066 ± 0.665
4.758LeuLys: 4.758 ± 0.473
5.969LeuLeu: 5.969 ± 0.689
2.076LeuMet: 2.076 ± 0.44
3.114LeuAsn: 3.114 ± 0.484
3.547LeuPro: 3.547 ± 0.636
2.595LeuGln: 2.595 ± 0.504
6.315LeuArg: 6.315 ± 0.682
6.835LeuSer: 6.835 ± 0.745
5.537LeuThr: 5.537 ± 0.737
5.364LeuVal: 5.364 ± 0.551
1.384LeuTrp: 1.384 ± 0.264
2.163LeuTyr: 2.163 ± 0.552
0.0LeuXaa: 0.0 ± 0.0
Met
2.509MetAla: 2.509 ± 0.466
0.0MetCys: 0.0 ± 0.0
1.038MetAsp: 1.038 ± 0.347
0.779MetGlu: 0.779 ± 0.27
0.433MetPhe: 0.433 ± 0.183
1.471MetGly: 1.471 ± 0.356
0.087MetHis: 0.087 ± 0.098
1.384MetIle: 1.384 ± 0.376
1.471MetLys: 1.471 ± 0.349
1.817MetLeu: 1.817 ± 0.443
0.433MetMet: 0.433 ± 0.247
1.903MetAsn: 1.903 ± 0.282
1.73MetPro: 1.73 ± 0.43
1.557MetGln: 1.557 ± 0.367
2.422MetArg: 2.422 ± 0.397
2.249MetSer: 2.249 ± 0.509
1.298MetThr: 1.298 ± 0.282
1.471MetVal: 1.471 ± 0.325
0.519MetTrp: 0.519 ± 0.184
0.606MetTyr: 0.606 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
5.796AsnAla: 5.796 ± 0.671
0.433AsnCys: 0.433 ± 0.155
2.249AsnAsp: 2.249 ± 0.463
3.374AsnGlu: 3.374 ± 0.575
1.211AsnPhe: 1.211 ± 0.363
4.239AsnGly: 4.239 ± 0.656
0.952AsnHis: 0.952 ± 0.261
2.768AsnIle: 2.768 ± 0.475
1.903AsnLys: 1.903 ± 0.355
2.336AsnLeu: 2.336 ± 0.538
0.865AsnMet: 0.865 ± 0.202
1.73AsnAsn: 1.73 ± 0.388
1.99AsnPro: 1.99 ± 0.387
2.768AsnGln: 2.768 ± 0.557
1.99AsnArg: 1.99 ± 0.453
2.076AsnSer: 2.076 ± 0.432
2.249AsnThr: 2.249 ± 0.385
2.076AsnVal: 2.076 ± 0.362
0.779AsnTrp: 0.779 ± 0.242
0.606AsnTyr: 0.606 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
4.412ProAla: 4.412 ± 0.825
0.346ProCys: 0.346 ± 0.152
3.547ProAsp: 3.547 ± 0.523
4.412ProGlu: 4.412 ± 0.74
1.211ProPhe: 1.211 ± 0.353
2.682ProGly: 2.682 ± 0.49
1.125ProHis: 1.125 ± 0.351
1.211ProIle: 1.211 ± 0.299
2.768ProLys: 2.768 ± 0.673
3.114ProLeu: 3.114 ± 0.609
0.519ProMet: 0.519 ± 0.2
1.125ProAsn: 1.125 ± 0.367
2.249ProPro: 2.249 ± 0.492
1.644ProGln: 1.644 ± 0.453
1.644ProArg: 1.644 ± 0.394
2.336ProSer: 2.336 ± 0.461
1.557ProThr: 1.557 ± 0.4
4.153ProVal: 4.153 ± 0.682
0.779ProTrp: 0.779 ± 0.286
0.865ProTyr: 0.865 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
4.499GlnAla: 4.499 ± 0.842
0.692GlnCys: 0.692 ± 0.244
1.903GlnAsp: 1.903 ± 0.429
3.201GlnGlu: 3.201 ± 0.58
1.298GlnPhe: 1.298 ± 0.273
2.509GlnGly: 2.509 ± 0.618
1.471GlnHis: 1.471 ± 0.34
1.99GlnIle: 1.99 ± 0.418
2.855GlnLys: 2.855 ± 0.582
3.114GlnLeu: 3.114 ± 0.473
1.211GlnMet: 1.211 ± 0.365
3.287GlnAsn: 3.287 ± 0.704
1.73GlnPro: 1.73 ± 0.442
2.595GlnGln: 2.595 ± 0.534
2.855GlnArg: 2.855 ± 0.483
3.461GlnSer: 3.461 ± 0.536
2.509GlnThr: 2.509 ± 0.441
3.547GlnVal: 3.547 ± 0.478
0.26GlnTrp: 0.26 ± 0.157
1.211GlnTyr: 1.211 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
7.354ArgAla: 7.354 ± 0.875
0.692ArgCys: 0.692 ± 0.2
3.287ArgAsp: 3.287 ± 0.713
5.537ArgGlu: 5.537 ± 0.9
2.336ArgPhe: 2.336 ± 0.564
4.239ArgGly: 4.239 ± 0.61
1.211ArgHis: 1.211 ± 0.252
3.028ArgIle: 3.028 ± 0.526
2.941ArgLys: 2.941 ± 0.493
6.488ArgLeu: 6.488 ± 0.855
1.817ArgMet: 1.817 ± 0.442
3.461ArgAsn: 3.461 ± 0.609
1.903ArgPro: 1.903 ± 0.399
3.201ArgGln: 3.201 ± 0.733
4.326ArgArg: 4.326 ± 0.74
2.595ArgSer: 2.595 ± 0.503
2.768ArgThr: 2.768 ± 0.348
3.547ArgVal: 3.547 ± 0.583
1.384ArgTrp: 1.384 ± 0.309
2.422ArgTyr: 2.422 ± 0.387
0.0ArgXaa: 0.0 ± 0.0
Ser
8.392SerAla: 8.392 ± 1.409
0.346SerCys: 0.346 ± 0.218
4.153SerAsp: 4.153 ± 0.551
4.499SerGlu: 4.499 ± 0.644
1.557SerPhe: 1.557 ± 0.371
6.748SerGly: 6.748 ± 0.682
0.865SerHis: 0.865 ± 0.267
1.73SerIle: 1.73 ± 0.454
2.682SerLys: 2.682 ± 0.395
4.066SerLeu: 4.066 ± 0.752
2.163SerMet: 2.163 ± 0.528
1.817SerAsn: 1.817 ± 0.348
3.028SerPro: 3.028 ± 0.592
2.941SerGln: 2.941 ± 0.484
4.845SerArg: 4.845 ± 0.612
3.893SerSer: 3.893 ± 0.643
3.547SerThr: 3.547 ± 0.586
4.845SerVal: 4.845 ± 0.803
0.606SerTrp: 0.606 ± 0.239
1.73SerTyr: 1.73 ± 0.393
0.0SerXaa: 0.0 ± 0.0
Thr
6.402ThrAla: 6.402 ± 0.845
0.606ThrCys: 0.606 ± 0.203
3.72ThrAsp: 3.72 ± 0.531
5.191ThrGlu: 5.191 ± 0.612
1.903ThrPhe: 1.903 ± 0.455
4.845ThrGly: 4.845 ± 1.056
0.865ThrHis: 0.865 ± 0.211
2.855ThrIle: 2.855 ± 0.501
2.682ThrLys: 2.682 ± 0.412
5.537ThrLeu: 5.537 ± 0.621
1.298ThrMet: 1.298 ± 0.359
1.73ThrAsn: 1.73 ± 0.353
2.163ThrPro: 2.163 ± 0.757
2.595ThrGln: 2.595 ± 0.495
3.028ThrArg: 3.028 ± 0.448
3.634ThrSer: 3.634 ± 0.518
3.547ThrThr: 3.547 ± 0.629
5.018ThrVal: 5.018 ± 0.771
0.952ThrTrp: 0.952 ± 0.297
1.903ThrTyr: 1.903 ± 0.33
0.0ThrXaa: 0.0 ± 0.0
Val
5.883ValAla: 5.883 ± 0.769
0.692ValCys: 0.692 ± 0.27
3.028ValAsp: 3.028 ± 0.516
3.72ValGlu: 3.72 ± 0.69
2.509ValPhe: 2.509 ± 0.436
2.595ValGly: 2.595 ± 0.61
0.952ValHis: 0.952 ± 0.293
4.499ValIle: 4.499 ± 0.575
4.499ValLys: 4.499 ± 0.616
6.661ValLeu: 6.661 ± 1.003
1.644ValMet: 1.644 ± 0.33
3.72ValAsn: 3.72 ± 0.475
3.461ValPro: 3.461 ± 0.442
2.682ValGln: 2.682 ± 0.588
4.153ValArg: 4.153 ± 0.548
5.364ValSer: 5.364 ± 0.574
5.45ValThr: 5.45 ± 0.801
4.412ValVal: 4.412 ± 0.633
1.384ValTrp: 1.384 ± 0.385
1.817ValTyr: 1.817 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
1.384TrpAla: 1.384 ± 0.299
0.087TrpCys: 0.087 ± 0.072
0.952TrpAsp: 0.952 ± 0.368
0.952TrpGlu: 0.952 ± 0.274
0.606TrpPhe: 0.606 ± 0.259
0.952TrpGly: 0.952 ± 0.297
0.519TrpHis: 0.519 ± 0.171
0.606TrpIle: 0.606 ± 0.199
1.038TrpLys: 1.038 ± 0.301
1.557TrpLeu: 1.557 ± 0.323
0.606TrpMet: 0.606 ± 0.238
0.692TrpAsn: 0.692 ± 0.223
0.519TrpPro: 0.519 ± 0.243
1.125TrpGln: 1.125 ± 0.289
1.298TrpArg: 1.298 ± 0.303
0.865TrpSer: 0.865 ± 0.228
0.606TrpThr: 0.606 ± 0.191
1.211TrpVal: 1.211 ± 0.261
0.606TrpTrp: 0.606 ± 0.203
0.433TrpTyr: 0.433 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.028TyrAla: 3.028 ± 0.517
0.346TyrCys: 0.346 ± 0.151
1.125TyrAsp: 1.125 ± 0.314
1.817TyrGlu: 1.817 ± 0.48
2.076TyrPhe: 2.076 ± 0.429
1.99TyrGly: 1.99 ± 0.322
0.519TyrHis: 0.519 ± 0.155
1.211TyrIle: 1.211 ± 0.326
1.125TyrLys: 1.125 ± 0.333
2.076TyrLeu: 2.076 ± 0.44
0.433TyrMet: 0.433 ± 0.215
1.125TyrAsn: 1.125 ± 0.252
1.298TyrPro: 1.298 ± 0.295
1.817TyrGln: 1.817 ± 0.444
1.384TyrArg: 1.384 ± 0.343
1.99TyrSer: 1.99 ± 0.41
1.817TyrThr: 1.817 ± 0.375
1.384TyrVal: 1.384 ± 0.392
0.606TyrTrp: 0.606 ± 0.214
0.606TyrTyr: 0.606 ± 0.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski