Amino acid dipepetide frequency for Enterococcus phage IME-EF4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.66AlaAla: 0.66 ± 0.332
0.413AlaCys: 0.413 ± 0.212
3.302AlaAsp: 3.302 ± 0.604
3.963AlaGlu: 3.963 ± 0.483
2.724AlaPhe: 2.724 ± 0.48
3.137AlaGly: 3.137 ± 0.631
0.991AlaHis: 0.991 ± 0.262
5.201AlaIle: 5.201 ± 0.937
5.449AlaLys: 5.449 ± 0.802
5.779AlaLeu: 5.779 ± 0.645
2.724AlaMet: 2.724 ± 0.679
3.963AlaAsn: 3.963 ± 0.651
2.146AlaPro: 2.146 ± 0.342
1.486AlaGln: 1.486 ± 0.29
1.651AlaArg: 1.651 ± 0.335
3.55AlaSer: 3.55 ± 0.473
4.045AlaThr: 4.045 ± 0.68
4.375AlaVal: 4.375 ± 0.757
0.495AlaTrp: 0.495 ± 0.202
2.642AlaTyr: 2.642 ± 0.433
0.0AlaXaa: 0.0 ± 0.0
Cys
0.248CysAla: 0.248 ± 0.123
0.0CysCys: 0.0 ± 0.0
0.413CysAsp: 0.413 ± 0.186
0.66CysGlu: 0.66 ± 0.224
0.165CysPhe: 0.165 ± 0.124
0.413CysGly: 0.413 ± 0.216
0.248CysHis: 0.248 ± 0.152
0.33CysIle: 0.33 ± 0.209
0.413CysLys: 0.413 ± 0.189
0.413CysLeu: 0.413 ± 0.191
0.33CysMet: 0.33 ± 0.163
0.495CysAsn: 0.495 ± 0.21
0.0CysPro: 0.0 ± 0.0
0.083CysGln: 0.083 ± 0.073
0.248CysArg: 0.248 ± 0.161
0.495CysSer: 0.495 ± 0.185
0.495CysThr: 0.495 ± 0.214
0.248CysVal: 0.248 ± 0.126
0.165CysTrp: 0.165 ± 0.124
0.165CysTyr: 0.165 ± 0.109
0.0CysXaa: 0.0 ± 0.0
Asp
3.137AspAla: 3.137 ± 0.602
0.33AspCys: 0.33 ± 0.195
2.312AspAsp: 2.312 ± 0.525
4.375AspGlu: 4.375 ± 0.489
2.724AspPhe: 2.724 ± 0.572
4.623AspGly: 4.623 ± 0.627
0.578AspHis: 0.578 ± 0.234
4.541AspIle: 4.541 ± 0.807
6.604AspLys: 6.604 ± 0.751
5.201AspLeu: 5.201 ± 0.664
1.899AspMet: 1.899 ± 0.436
4.706AspAsn: 4.706 ± 0.612
2.146AspPro: 2.146 ± 0.468
1.238AspGln: 1.238 ± 0.206
2.312AspArg: 2.312 ± 0.48
3.055AspSer: 3.055 ± 0.499
3.88AspThr: 3.88 ± 0.729
4.293AspVal: 4.293 ± 0.623
0.66AspTrp: 0.66 ± 0.276
2.972AspTyr: 2.972 ± 0.547
0.0AspXaa: 0.0 ± 0.0
Glu
4.871GluAla: 4.871 ± 0.675
0.578GluCys: 0.578 ± 0.243
4.706GluAsp: 4.706 ± 0.908
8.008GluGlu: 8.008 ± 1.741
2.972GluPhe: 2.972 ± 0.608
4.458GluGly: 4.458 ± 0.525
1.403GluHis: 1.403 ± 0.262
3.385GluIle: 3.385 ± 0.488
6.274GluLys: 6.274 ± 0.755
9.246GluLeu: 9.246 ± 1.102
2.312GluMet: 2.312 ± 0.466
3.963GluAsn: 3.963 ± 0.622
2.477GluPro: 2.477 ± 0.675
3.467GluGln: 3.467 ± 0.631
3.22GluArg: 3.22 ± 0.552
4.375GluSer: 4.375 ± 0.613
4.375GluThr: 4.375 ± 0.513
7.513GluVal: 7.513 ± 0.806
1.486GluTrp: 1.486 ± 0.354
3.137GluTyr: 3.137 ± 0.678
0.0GluXaa: 0.0 ± 0.0
Phe
1.816PheAla: 1.816 ± 0.322
0.248PheCys: 0.248 ± 0.168
3.137PheAsp: 3.137 ± 0.58
2.889PheGlu: 2.889 ± 0.638
0.991PhePhe: 0.991 ± 0.332
2.477PheGly: 2.477 ± 0.411
0.165PheHis: 0.165 ± 0.139
4.045PheIle: 4.045 ± 0.713
4.045PheLys: 4.045 ± 0.653
2.229PheLeu: 2.229 ± 0.457
1.156PheMet: 1.156 ± 0.299
2.972PheAsn: 2.972 ± 0.464
0.743PhePro: 0.743 ± 0.237
1.734PheGln: 1.734 ± 0.487
1.651PheArg: 1.651 ± 0.387
2.229PheSer: 2.229 ± 0.44
4.128PheThr: 4.128 ± 0.76
2.642PheVal: 2.642 ± 0.486
0.495PheTrp: 0.495 ± 0.187
0.908PheTyr: 0.908 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
4.375GlyAla: 4.375 ± 1.386
0.248GlyCys: 0.248 ± 0.128
3.302GlyAsp: 3.302 ± 0.699
3.385GlyGlu: 3.385 ± 0.603
3.385GlyPhe: 3.385 ± 0.476
4.045GlyGly: 4.045 ± 1.115
0.826GlyHis: 0.826 ± 0.255
5.531GlyIle: 5.531 ± 0.97
5.861GlyLys: 5.861 ± 0.733
5.614GlyLeu: 5.614 ± 0.706
1.651GlyMet: 1.651 ± 0.41
3.22GlyAsn: 3.22 ± 0.429
0.826GlyPro: 0.826 ± 0.278
2.146GlyGln: 2.146 ± 0.365
2.146GlyArg: 2.146 ± 0.364
3.55GlySer: 3.55 ± 0.498
4.541GlyThr: 4.541 ± 0.976
3.88GlyVal: 3.88 ± 0.598
1.321GlyTrp: 1.321 ± 0.269
3.137GlyTyr: 3.137 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
0.413HisAla: 0.413 ± 0.2
0.165HisCys: 0.165 ± 0.11
0.908HisAsp: 0.908 ± 0.284
1.073HisGlu: 1.073 ± 0.314
0.66HisPhe: 0.66 ± 0.263
0.991HisGly: 0.991 ± 0.323
0.33HisHis: 0.33 ± 0.145
0.743HisIle: 0.743 ± 0.214
1.816HisLys: 1.816 ± 0.418
1.073HisLeu: 1.073 ± 0.355
0.33HisMet: 0.33 ± 0.164
1.403HisAsn: 1.403 ± 0.401
0.33HisPro: 0.33 ± 0.156
0.578HisGln: 0.578 ± 0.17
0.991HisArg: 0.991 ± 0.302
0.33HisSer: 0.33 ± 0.155
0.826HisThr: 0.826 ± 0.39
0.743HisVal: 0.743 ± 0.22
0.165HisTrp: 0.165 ± 0.153
0.826HisTyr: 0.826 ± 0.291
0.0HisXaa: 0.0 ± 0.0
Ile
4.21IleAla: 4.21 ± 0.602
0.743IleCys: 0.743 ± 0.264
5.201IleAsp: 5.201 ± 0.567
7.43IleGlu: 7.43 ± 0.99
2.064IlePhe: 2.064 ± 0.433
4.375IleGly: 4.375 ± 0.672
0.991IleHis: 0.991 ± 0.282
4.375IleIle: 4.375 ± 0.634
6.439IleLys: 6.439 ± 0.732
4.953IleLeu: 4.953 ± 0.618
1.569IleMet: 1.569 ± 0.381
4.706IleAsn: 4.706 ± 0.798
2.312IlePro: 2.312 ± 0.426
3.385IleGln: 3.385 ± 0.434
2.146IleArg: 2.146 ± 0.444
3.632IleSer: 3.632 ± 0.446
3.88IleThr: 3.88 ± 0.532
4.293IleVal: 4.293 ± 0.502
0.495IleTrp: 0.495 ± 0.192
1.651IleTyr: 1.651 ± 0.437
0.0IleXaa: 0.0 ± 0.0
Lys
7.347LysAla: 7.347 ± 1.016
0.413LysCys: 0.413 ± 0.19
4.871LysAsp: 4.871 ± 0.643
7.843LysGlu: 7.843 ± 1.004
3.963LysPhe: 3.963 ± 0.558
4.623LysGly: 4.623 ± 0.859
1.321LysHis: 1.321 ± 0.392
4.541LysIle: 4.541 ± 0.696
5.861LysLys: 5.861 ± 1.106
6.852LysLeu: 6.852 ± 0.905
2.889LysMet: 2.889 ± 0.443
6.109LysAsn: 6.109 ± 0.761
3.055LysPro: 3.055 ± 0.519
4.706LysGln: 4.706 ± 0.555
4.293LysArg: 4.293 ± 0.536
3.715LysSer: 3.715 ± 0.714
5.201LysThr: 5.201 ± 0.538
6.027LysVal: 6.027 ± 0.649
1.321LysTrp: 1.321 ± 0.303
3.302LysTyr: 3.302 ± 0.468
0.0LysXaa: 0.0 ± 0.0
Leu
4.21LeuAla: 4.21 ± 0.71
0.413LeuCys: 0.413 ± 0.229
6.522LeuAsp: 6.522 ± 0.706
8.833LeuGlu: 8.833 ± 0.994
3.302LeuPhe: 3.302 ± 0.535
4.788LeuGly: 4.788 ± 0.722
0.991LeuHis: 0.991 ± 0.239
5.036LeuIle: 5.036 ± 0.727
6.274LeuLys: 6.274 ± 1.085
6.522LeuLeu: 6.522 ± 0.899
1.899LeuMet: 1.899 ± 0.336
5.284LeuAsn: 5.284 ± 0.665
2.889LeuPro: 2.889 ± 0.454
4.045LeuGln: 4.045 ± 0.583
2.807LeuArg: 2.807 ± 0.523
4.706LeuSer: 4.706 ± 0.66
4.045LeuThr: 4.045 ± 0.545
5.531LeuVal: 5.531 ± 0.785
0.991LeuTrp: 0.991 ± 0.241
2.642LeuTyr: 2.642 ± 0.537
0.0LeuXaa: 0.0 ± 0.0
Met
1.321MetAla: 1.321 ± 0.38
0.248MetCys: 0.248 ± 0.152
1.651MetAsp: 1.651 ± 0.394
2.394MetGlu: 2.394 ± 0.421
1.321MetPhe: 1.321 ± 0.47
1.486MetGly: 1.486 ± 0.341
0.248MetHis: 0.248 ± 0.139
1.486MetIle: 1.486 ± 0.315
2.724MetLys: 2.724 ± 0.45
2.312MetLeu: 2.312 ± 0.379
0.495MetMet: 0.495 ± 0.204
1.816MetAsn: 1.816 ± 0.367
0.991MetPro: 0.991 ± 0.294
1.073MetGln: 1.073 ± 0.262
1.569MetArg: 1.569 ± 0.373
1.651MetSer: 1.651 ± 0.366
1.816MetThr: 1.816 ± 0.416
2.146MetVal: 2.146 ± 0.449
0.495MetTrp: 0.495 ± 0.252
1.486MetTyr: 1.486 ± 0.435
0.0MetXaa: 0.0 ± 0.0
Asn
5.036AsnAla: 5.036 ± 0.796
0.248AsnCys: 0.248 ± 0.141
3.055AsnAsp: 3.055 ± 0.479
5.696AsnGlu: 5.696 ± 0.705
1.651AsnPhe: 1.651 ± 0.326
6.77AsnGly: 6.77 ± 0.739
0.991AsnHis: 0.991 ± 0.286
4.623AsnIle: 4.623 ± 0.794
6.192AsnLys: 6.192 ± 0.836
4.293AsnLeu: 4.293 ± 0.56
1.651AsnMet: 1.651 ± 0.334
3.632AsnAsn: 3.632 ± 0.461
1.651AsnPro: 1.651 ± 0.415
1.569AsnGln: 1.569 ± 0.308
1.899AsnArg: 1.899 ± 0.367
3.467AsnSer: 3.467 ± 0.607
5.036AsnThr: 5.036 ± 0.765
3.302AsnVal: 3.302 ± 0.601
0.743AsnTrp: 0.743 ± 0.255
3.137AsnTyr: 3.137 ± 0.613
0.0AsnXaa: 0.0 ± 0.0
Pro
1.486ProAla: 1.486 ± 0.387
0.165ProCys: 0.165 ± 0.112
2.394ProAsp: 2.394 ± 0.494
2.724ProGlu: 2.724 ± 0.477
1.073ProPhe: 1.073 ± 0.252
0.083ProGly: 0.083 ± 0.073
0.33ProHis: 0.33 ± 0.165
1.899ProIle: 1.899 ± 0.403
2.477ProLys: 2.477 ± 0.598
2.972ProLeu: 2.972 ± 0.618
1.156ProMet: 1.156 ± 0.213
1.816ProAsn: 1.816 ± 0.368
0.413ProPro: 0.413 ± 0.193
1.486ProGln: 1.486 ± 0.378
0.578ProArg: 0.578 ± 0.197
1.734ProSer: 1.734 ± 0.443
1.899ProThr: 1.899 ± 0.429
2.146ProVal: 2.146 ± 0.413
0.248ProTrp: 0.248 ± 0.135
1.651ProTyr: 1.651 ± 0.453
0.0ProXaa: 0.0 ± 0.0
Gln
1.981GlnAla: 1.981 ± 0.413
0.495GlnCys: 0.495 ± 0.229
2.146GlnAsp: 2.146 ± 0.292
2.394GlnGlu: 2.394 ± 0.405
1.816GlnPhe: 1.816 ± 0.449
1.899GlnGly: 1.899 ± 0.412
0.66GlnHis: 0.66 ± 0.228
3.22GlnIle: 3.22 ± 0.737
2.064GlnLys: 2.064 ± 0.4
3.137GlnLeu: 3.137 ± 0.406
1.403GlnMet: 1.403 ± 0.352
1.569GlnAsn: 1.569 ± 0.333
1.156GlnPro: 1.156 ± 0.292
1.981GlnGln: 1.981 ± 0.337
1.981GlnArg: 1.981 ± 0.493
2.146GlnSer: 2.146 ± 0.451
1.816GlnThr: 1.816 ± 0.331
3.055GlnVal: 3.055 ± 0.383
0.66GlnTrp: 0.66 ± 0.352
2.394GlnTyr: 2.394 ± 0.405
0.0GlnXaa: 0.0 ± 0.0
Arg
1.734ArgAla: 1.734 ± 0.352
0.248ArgCys: 0.248 ± 0.123
2.394ArgAsp: 2.394 ± 0.476
1.816ArgGlu: 1.816 ± 0.424
1.569ArgPhe: 1.569 ± 0.403
1.816ArgGly: 1.816 ± 0.478
0.743ArgHis: 0.743 ± 0.274
2.229ArgIle: 2.229 ± 0.393
3.467ArgLys: 3.467 ± 0.64
3.302ArgLeu: 3.302 ± 0.685
1.073ArgMet: 1.073 ± 0.286
2.807ArgAsn: 2.807 ± 0.387
1.073ArgPro: 1.073 ± 0.351
1.156ArgGln: 1.156 ± 0.386
1.156ArgArg: 1.156 ± 0.296
1.981ArgSer: 1.981 ± 0.414
1.816ArgThr: 1.816 ± 0.336
2.642ArgVal: 2.642 ± 0.475
0.33ArgTrp: 0.33 ± 0.219
1.651ArgTyr: 1.651 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
2.889SerAla: 2.889 ± 0.674
0.0SerCys: 0.0 ± 0.0
3.22SerAsp: 3.22 ± 0.47
3.798SerGlu: 3.798 ± 0.419
2.559SerPhe: 2.559 ± 0.358
4.953SerGly: 4.953 ± 0.762
1.238SerHis: 1.238 ± 0.314
4.128SerIle: 4.128 ± 0.552
5.201SerLys: 5.201 ± 0.866
3.22SerLeu: 3.22 ± 0.63
1.734SerMet: 1.734 ± 0.365
3.22SerAsn: 3.22 ± 0.514
1.073SerPro: 1.073 ± 0.298
2.559SerGln: 2.559 ± 0.483
1.156SerArg: 1.156 ± 0.317
2.724SerSer: 2.724 ± 0.402
3.88SerThr: 3.88 ± 0.872
2.889SerVal: 2.889 ± 0.583
0.826SerTrp: 0.826 ± 0.25
2.477SerTyr: 2.477 ± 0.59
0.0SerXaa: 0.0 ± 0.0
Thr
3.963ThrAla: 3.963 ± 0.505
0.083ThrCys: 0.083 ± 0.077
3.302ThrAsp: 3.302 ± 0.594
4.375ThrGlu: 4.375 ± 0.61
2.312ThrPhe: 2.312 ± 0.5
4.706ThrGly: 4.706 ± 0.59
1.321ThrHis: 1.321 ± 0.365
4.788ThrIle: 4.788 ± 0.72
7.017ThrLys: 7.017 ± 0.777
5.614ThrLeu: 5.614 ± 0.942
1.403ThrMet: 1.403 ± 0.322
3.798ThrAsn: 3.798 ± 0.463
2.394ThrPro: 2.394 ± 0.36
2.312ThrGln: 2.312 ± 0.443
1.651ThrArg: 1.651 ± 0.384
2.312ThrSer: 2.312 ± 0.412
4.623ThrThr: 4.623 ± 1.024
4.045ThrVal: 4.045 ± 0.452
0.743ThrTrp: 0.743 ± 0.273
2.559ThrTyr: 2.559 ± 0.58
0.0ThrXaa: 0.0 ± 0.0
Val
5.779ValAla: 5.779 ± 0.636
0.248ValCys: 0.248 ± 0.141
4.706ValAsp: 4.706 ± 0.617
5.118ValGlu: 5.118 ± 0.765
2.972ValPhe: 2.972 ± 0.437
4.623ValGly: 4.623 ± 0.697
0.743ValHis: 0.743 ± 0.235
4.293ValIle: 4.293 ± 0.612
5.449ValLys: 5.449 ± 0.947
4.458ValLeu: 4.458 ± 0.615
1.816ValMet: 1.816 ± 0.339
4.788ValAsn: 4.788 ± 0.884
2.229ValPro: 2.229 ± 0.361
1.734ValGln: 1.734 ± 0.476
1.981ValArg: 1.981 ± 0.412
5.696ValSer: 5.696 ± 0.872
3.798ValThr: 3.798 ± 0.683
4.128ValVal: 4.128 ± 0.567
0.66ValTrp: 0.66 ± 0.27
2.559ValTyr: 2.559 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
0.495TrpAla: 0.495 ± 0.223
0.165TrpCys: 0.165 ± 0.12
0.826TrpAsp: 0.826 ± 0.241
1.569TrpGlu: 1.569 ± 0.296
0.908TrpPhe: 0.908 ± 0.27
0.826TrpGly: 0.826 ± 0.29
0.083TrpHis: 0.083 ± 0.093
0.826TrpIle: 0.826 ± 0.358
0.66TrpLys: 0.66 ± 0.234
1.156TrpLeu: 1.156 ± 0.25
0.083TrpMet: 0.083 ± 0.093
0.908TrpAsn: 0.908 ± 0.331
0.0TrpPro: 0.0 ± 0.0
0.413TrpGln: 0.413 ± 0.159
0.495TrpArg: 0.495 ± 0.184
0.66TrpSer: 0.66 ± 0.187
0.578TrpThr: 0.578 ± 0.232
1.321TrpVal: 1.321 ± 0.24
0.248TrpTrp: 0.248 ± 0.107
0.495TrpTyr: 0.495 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.724TyrAla: 2.724 ± 0.521
0.578TyrCys: 0.578 ± 0.229
3.22TyrAsp: 3.22 ± 0.56
3.88TyrGlu: 3.88 ± 0.664
1.569TyrPhe: 1.569 ± 0.348
1.899TyrGly: 1.899 ± 0.451
0.495TyrHis: 0.495 ± 0.215
3.632TyrIle: 3.632 ± 0.658
4.045TyrLys: 4.045 ± 0.616
3.385TyrLeu: 3.385 ± 0.501
1.073TyrMet: 1.073 ± 0.271
3.467TyrAsn: 3.467 ± 0.547
0.826TyrPro: 0.826 ± 0.326
0.826TyrGln: 0.826 ± 0.321
1.073TyrArg: 1.073 ± 0.381
1.734TyrSer: 1.734 ± 0.445
2.642TyrThr: 2.642 ± 0.608
2.559TyrVal: 2.559 ± 0.483
0.165TyrTrp: 0.165 ± 0.12
1.816TyrTyr: 1.816 ± 0.445
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12114 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski