Amino acid dipepetide frequency for Enterococcus phage vB_EfaS_IME198

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.653AlaAla: 5.653 ± 1.244
0.661AlaCys: 0.661 ± 0.224
4.27AlaAsp: 4.27 ± 0.66
6.314AlaGlu: 6.314 ± 0.599
2.706AlaPhe: 2.706 ± 0.39
4.51AlaGly: 4.51 ± 0.739
0.842AlaHis: 0.842 ± 0.211
5.713AlaIle: 5.713 ± 0.58
4.39AlaLys: 4.39 ± 0.72
5.953AlaLeu: 5.953 ± 0.819
3.067AlaMet: 3.067 ± 0.487
3.849AlaAsn: 3.849 ± 0.522
2.405AlaPro: 2.405 ± 0.392
2.466AlaGln: 2.466 ± 0.339
2.887AlaArg: 2.887 ± 0.347
4.21AlaSer: 4.21 ± 0.525
3.789AlaThr: 3.789 ± 0.482
4.57AlaVal: 4.57 ± 0.58
0.661AlaTrp: 0.661 ± 0.191
3.548AlaTyr: 3.548 ± 0.562
0.0AlaXaa: 0.0 ± 0.0
Cys
0.241CysAla: 0.241 ± 0.105
0.241CysCys: 0.241 ± 0.159
0.481CysAsp: 0.481 ± 0.161
0.481CysGlu: 0.481 ± 0.177
0.541CysPhe: 0.541 ± 0.168
0.782CysGly: 0.782 ± 0.242
0.06CysHis: 0.06 ± 0.06
0.241CysIle: 0.241 ± 0.134
0.541CysLys: 0.541 ± 0.223
0.601CysLeu: 0.601 ± 0.187
0.541CysMet: 0.541 ± 0.174
0.241CysAsn: 0.241 ± 0.122
0.12CysPro: 0.12 ± 0.079
0.301CysGln: 0.301 ± 0.129
0.12CysArg: 0.12 ± 0.088
0.361CysSer: 0.361 ± 0.152
0.421CysThr: 0.421 ± 0.157
0.421CysVal: 0.421 ± 0.189
0.0CysTrp: 0.0 ± 0.0
0.06CysTyr: 0.06 ± 0.056
0.0CysXaa: 0.0 ± 0.0
Asp
4.029AspAla: 4.029 ± 0.615
0.601AspCys: 0.601 ± 0.2
3.127AspAsp: 3.127 ± 0.577
5.051AspGlu: 5.051 ± 0.656
3.608AspPhe: 3.608 ± 0.529
3.909AspGly: 3.909 ± 0.457
0.481AspHis: 0.481 ± 0.182
4.21AspIle: 4.21 ± 0.521
4.39AspLys: 4.39 ± 0.488
6.314AspLeu: 6.314 ± 0.72
1.864AspMet: 1.864 ± 0.401
3.067AspAsn: 3.067 ± 0.415
1.383AspPro: 1.383 ± 0.272
1.143AspGln: 1.143 ± 0.26
2.466AspArg: 2.466 ± 0.378
3.668AspSer: 3.668 ± 0.4
4.029AspThr: 4.029 ± 0.626
4.029AspVal: 4.029 ± 0.55
0.962AspTrp: 0.962 ± 0.399
3.849AspTyr: 3.849 ± 0.44
0.0AspXaa: 0.0 ± 0.0
Glu
9.081GluAla: 9.081 ± 0.762
0.481GluCys: 0.481 ± 0.168
6.254GluAsp: 6.254 ± 0.698
11.005GluGlu: 11.005 ± 1.291
3.307GluPhe: 3.307 ± 0.535
5.533GluGly: 5.533 ± 0.578
0.601GluHis: 0.601 ± 0.214
4.51GluIle: 4.51 ± 0.545
4.691GluLys: 4.691 ± 0.688
8.058GluLeu: 8.058 ± 0.918
2.887GluMet: 2.887 ± 0.414
4.51GluAsn: 4.51 ± 0.582
2.165GluPro: 2.165 ± 0.427
3.127GluGln: 3.127 ± 0.458
5.051GluArg: 5.051 ± 0.615
4.751GluSer: 4.751 ± 0.538
4.149GluThr: 4.149 ± 0.537
6.435GluVal: 6.435 ± 0.632
1.924GluTrp: 1.924 ± 0.353
3.548GluTyr: 3.548 ± 0.513
0.0GluXaa: 0.0 ± 0.0
Phe
2.526PheAla: 2.526 ± 0.492
0.241PheCys: 0.241 ± 0.13
2.405PheAsp: 2.405 ± 0.437
3.608PheGlu: 3.608 ± 0.461
1.323PhePhe: 1.323 ± 0.296
2.947PheGly: 2.947 ± 0.443
0.601PheHis: 0.601 ± 0.178
2.887PheIle: 2.887 ± 0.483
3.368PheLys: 3.368 ± 0.397
3.668PheLeu: 3.668 ± 0.423
0.962PheMet: 0.962 ± 0.26
2.405PheAsn: 2.405 ± 0.328
1.323PhePro: 1.323 ± 0.346
1.203PheGln: 1.203 ± 0.27
1.203PheArg: 1.203 ± 0.258
2.646PheSer: 2.646 ± 0.505
2.646PheThr: 2.646 ± 0.349
2.405PheVal: 2.405 ± 0.438
0.661PheTrp: 0.661 ± 0.209
1.924PheTyr: 1.924 ± 0.465
0.0PheXaa: 0.0 ± 0.0
Gly
3.548GlyAla: 3.548 ± 0.441
0.361GlyCys: 0.361 ± 0.202
3.668GlyAsp: 3.668 ± 0.437
3.849GlyGlu: 3.849 ± 0.494
3.428GlyPhe: 3.428 ± 0.484
3.368GlyGly: 3.368 ± 0.828
1.443GlyHis: 1.443 ± 0.23
4.811GlyIle: 4.811 ± 0.64
5.653GlyLys: 5.653 ± 0.699
4.751GlyLeu: 4.751 ± 0.566
1.864GlyMet: 1.864 ± 0.308
2.887GlyAsn: 2.887 ± 0.432
0.06GlyPro: 0.06 ± 0.054
2.045GlyGln: 2.045 ± 0.44
2.646GlyArg: 2.646 ± 0.448
3.548GlySer: 3.548 ± 0.374
4.39GlyThr: 4.39 ± 0.784
4.149GlyVal: 4.149 ± 0.618
1.022GlyTrp: 1.022 ± 0.278
2.826GlyTyr: 2.826 ± 0.388
0.0GlyXaa: 0.0 ± 0.0
His
0.962HisAla: 0.962 ± 0.234
0.18HisCys: 0.18 ± 0.102
0.421HisAsp: 0.421 ± 0.168
1.203HisGlu: 1.203 ± 0.216
0.842HisPhe: 0.842 ± 0.219
0.842HisGly: 0.842 ± 0.201
0.241HisHis: 0.241 ± 0.124
1.323HisIle: 1.323 ± 0.244
1.203HisLys: 1.203 ± 0.269
1.263HisLeu: 1.263 ± 0.258
0.18HisMet: 0.18 ± 0.114
0.842HisAsn: 0.842 ± 0.185
0.782HisPro: 0.782 ± 0.217
0.481HisGln: 0.481 ± 0.204
0.962HisArg: 0.962 ± 0.193
0.541HisSer: 0.541 ± 0.165
0.421HisThr: 0.421 ± 0.151
1.022HisVal: 1.022 ± 0.253
0.12HisTrp: 0.12 ± 0.085
0.782HisTyr: 0.782 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
4.691IleAla: 4.691 ± 0.491
0.421IleCys: 0.421 ± 0.179
4.33IleAsp: 4.33 ± 0.486
5.893IleGlu: 5.893 ± 0.711
2.105IlePhe: 2.105 ± 0.349
3.127IleGly: 3.127 ± 0.371
1.443IleHis: 1.443 ± 0.287
3.307IleIle: 3.307 ± 0.468
5.172IleLys: 5.172 ± 0.586
4.149IleLeu: 4.149 ± 0.588
1.744IleMet: 1.744 ± 0.262
4.029IleAsn: 4.029 ± 0.494
1.503IlePro: 1.503 ± 0.261
2.766IleGln: 2.766 ± 0.5
2.887IleArg: 2.887 ± 0.434
3.668IleSer: 3.668 ± 0.469
3.608IleThr: 3.608 ± 0.618
4.029IleVal: 4.029 ± 0.389
0.361IleTrp: 0.361 ± 0.163
2.887IleTyr: 2.887 ± 0.44
0.0IleXaa: 0.0 ± 0.0
Lys
7.577LysAla: 7.577 ± 0.634
0.241LysCys: 0.241 ± 0.121
5.953LysAsp: 5.953 ± 0.575
7.216LysGlu: 7.216 ± 0.705
2.766LysPhe: 2.766 ± 0.398
4.149LysGly: 4.149 ± 0.528
0.962LysHis: 0.962 ± 0.326
3.307LysIle: 3.307 ± 0.423
5.713LysLys: 5.713 ± 0.807
6.675LysLeu: 6.675 ± 0.619
2.105LysMet: 2.105 ± 0.448
2.466LysAsn: 2.466 ± 0.39
3.007LysPro: 3.007 ± 0.467
2.826LysGln: 2.826 ± 0.489
3.247LysArg: 3.247 ± 0.648
4.089LysSer: 4.089 ± 0.635
4.089LysThr: 4.089 ± 0.482
6.014LysVal: 6.014 ± 0.589
1.082LysTrp: 1.082 ± 0.243
2.766LysTyr: 2.766 ± 0.411
0.0LysXaa: 0.0 ± 0.0
Leu
5.352LeuAla: 5.352 ± 0.528
0.481LeuCys: 0.481 ± 0.189
4.57LeuAsp: 4.57 ± 0.363
9.862LeuGlu: 9.862 ± 1.078
3.127LeuPhe: 3.127 ± 0.452
5.472LeuGly: 5.472 ± 0.692
1.203LeuHis: 1.203 ± 0.249
5.472LeuIle: 5.472 ± 0.722
6.555LeuLys: 6.555 ± 0.708
6.555LeuLeu: 6.555 ± 0.887
2.586LeuMet: 2.586 ± 0.38
4.21LeuAsn: 4.21 ± 0.595
3.067LeuPro: 3.067 ± 0.374
3.548LeuGln: 3.548 ± 0.459
3.307LeuArg: 3.307 ± 0.424
5.533LeuSer: 5.533 ± 0.573
6.615LeuThr: 6.615 ± 0.6
5.472LeuVal: 5.472 ± 0.648
0.782LeuTrp: 0.782 ± 0.201
2.285LeuTyr: 2.285 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
2.826MetAla: 2.826 ± 0.556
0.06MetCys: 0.06 ± 0.065
1.684MetAsp: 1.684 ± 0.256
2.947MetGlu: 2.947 ± 0.425
0.962MetPhe: 0.962 ± 0.281
0.962MetGly: 0.962 ± 0.299
0.301MetHis: 0.301 ± 0.129
1.323MetIle: 1.323 ± 0.319
2.345MetLys: 2.345 ± 0.357
2.766MetLeu: 2.766 ± 0.399
0.782MetMet: 0.782 ± 0.239
1.864MetAsn: 1.864 ± 0.275
0.962MetPro: 0.962 ± 0.257
0.722MetGln: 0.722 ± 0.19
1.383MetArg: 1.383 ± 0.263
2.165MetSer: 2.165 ± 0.333
1.263MetThr: 1.263 ± 0.215
1.624MetVal: 1.624 ± 0.28
0.18MetTrp: 0.18 ± 0.114
1.022MetTyr: 1.022 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
2.947AsnAla: 2.947 ± 0.404
0.361AsnCys: 0.361 ± 0.15
2.947AsnAsp: 2.947 ± 0.401
4.33AsnGlu: 4.33 ± 0.499
1.924AsnPhe: 1.924 ± 0.265
3.849AsnGly: 3.849 ± 0.538
1.082AsnHis: 1.082 ± 0.292
2.947AsnIle: 2.947 ± 0.397
4.45AsnLys: 4.45 ± 0.703
4.089AsnLeu: 4.089 ± 0.391
0.902AsnMet: 0.902 ± 0.186
2.285AsnAsn: 2.285 ± 0.495
2.766AsnPro: 2.766 ± 0.507
1.924AsnGln: 1.924 ± 0.33
2.045AsnArg: 2.045 ± 0.336
3.127AsnSer: 3.127 ± 0.419
2.826AsnThr: 2.826 ± 0.395
3.307AsnVal: 3.307 ± 0.368
0.421AsnTrp: 0.421 ± 0.173
2.105AsnTyr: 2.105 ± 0.314
0.0AsnXaa: 0.0 ± 0.0
Pro
2.045ProAla: 2.045 ± 0.405
0.18ProCys: 0.18 ± 0.1
1.864ProAsp: 1.864 ± 0.325
3.488ProGlu: 3.488 ± 0.581
1.203ProPhe: 1.203 ± 0.26
0.241ProGly: 0.241 ± 0.107
0.481ProHis: 0.481 ± 0.157
2.285ProIle: 2.285 ± 0.434
2.706ProLys: 2.706 ± 0.416
2.345ProLeu: 2.345 ± 0.385
0.842ProMet: 0.842 ± 0.226
1.503ProAsn: 1.503 ± 0.397
0.661ProPro: 0.661 ± 0.207
0.722ProGln: 0.722 ± 0.21
1.022ProArg: 1.022 ± 0.288
2.826ProSer: 2.826 ± 0.452
1.443ProThr: 1.443 ± 0.273
1.924ProVal: 1.924 ± 0.362
0.481ProTrp: 0.481 ± 0.18
1.744ProTyr: 1.744 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
2.105GlnAla: 2.105 ± 0.42
0.12GlnCys: 0.12 ± 0.067
2.105GlnAsp: 2.105 ± 0.276
3.368GlnGlu: 3.368 ± 0.49
1.143GlnPhe: 1.143 ± 0.311
2.466GlnGly: 2.466 ± 0.386
0.601GlnHis: 0.601 ± 0.205
1.564GlnIle: 1.564 ± 0.316
2.285GlnLys: 2.285 ± 0.357
3.548GlnLeu: 3.548 ± 0.457
0.722GlnMet: 0.722 ± 0.266
1.143GlnAsn: 1.143 ± 0.276
0.962GlnPro: 0.962 ± 0.22
1.804GlnGln: 1.804 ± 0.469
1.804GlnArg: 1.804 ± 0.29
1.984GlnSer: 1.984 ± 0.386
2.826GlnThr: 2.826 ± 0.378
2.526GlnVal: 2.526 ± 0.391
0.481GlnTrp: 0.481 ± 0.186
1.323GlnTyr: 1.323 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
3.127ArgAla: 3.127 ± 0.466
0.481ArgCys: 0.481 ± 0.173
2.285ArgAsp: 2.285 ± 0.332
3.368ArgGlu: 3.368 ± 0.444
2.105ArgPhe: 2.105 ± 0.332
2.706ArgGly: 2.706 ± 0.455
0.782ArgHis: 0.782 ± 0.23
3.127ArgIle: 3.127 ± 0.497
3.247ArgLys: 3.247 ± 0.49
4.27ArgLeu: 4.27 ± 0.56
0.782ArgMet: 0.782 ± 0.209
1.924ArgAsn: 1.924 ± 0.346
1.503ArgPro: 1.503 ± 0.327
1.564ArgGln: 1.564 ± 0.295
1.744ArgArg: 1.744 ± 0.314
1.864ArgSer: 1.864 ± 0.406
1.804ArgThr: 1.804 ± 0.277
2.345ArgVal: 2.345 ± 0.38
0.361ArgTrp: 0.361 ± 0.134
2.165ArgTyr: 2.165 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
2.947SerAla: 2.947 ± 0.492
0.361SerCys: 0.361 ± 0.15
3.849SerAsp: 3.849 ± 0.478
4.45SerGlu: 4.45 ± 0.427
2.766SerPhe: 2.766 ± 0.348
4.149SerGly: 4.149 ± 0.514
0.722SerHis: 0.722 ± 0.212
4.27SerIle: 4.27 ± 0.424
5.593SerLys: 5.593 ± 0.46
5.051SerLeu: 5.051 ± 0.565
1.564SerMet: 1.564 ± 0.262
3.127SerAsn: 3.127 ± 0.434
1.624SerPro: 1.624 ± 0.26
1.744SerGln: 1.744 ± 0.399
2.045SerArg: 2.045 ± 0.311
3.187SerSer: 3.187 ± 0.442
3.488SerThr: 3.488 ± 0.444
4.39SerVal: 4.39 ± 0.553
0.962SerTrp: 0.962 ± 0.233
2.345SerTyr: 2.345 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
4.51ThrAla: 4.51 ± 0.87
0.361ThrCys: 0.361 ± 0.127
3.368ThrAsp: 3.368 ± 0.46
3.789ThrGlu: 3.789 ± 0.391
2.766ThrPhe: 2.766 ± 0.407
3.608ThrGly: 3.608 ± 0.475
0.962ThrHis: 0.962 ± 0.232
4.33ThrIle: 4.33 ± 0.524
3.909ThrLys: 3.909 ± 0.425
6.435ThrLeu: 6.435 ± 0.604
1.383ThrMet: 1.383 ± 0.333
3.187ThrAsn: 3.187 ± 0.527
2.766ThrPro: 2.766 ± 0.467
1.624ThrGln: 1.624 ± 0.316
1.564ThrArg: 1.564 ± 0.317
3.668ThrSer: 3.668 ± 0.546
3.067ThrThr: 3.067 ± 0.453
4.63ThrVal: 4.63 ± 0.641
0.481ThrTrp: 0.481 ± 0.178
2.706ThrTyr: 2.706 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
5.352ValAla: 5.352 ± 0.595
0.601ValCys: 0.601 ± 0.202
4.751ValAsp: 4.751 ± 0.441
6.014ValGlu: 6.014 ± 0.679
2.586ValPhe: 2.586 ± 0.431
4.149ValGly: 4.149 ± 0.56
0.902ValHis: 0.902 ± 0.228
4.029ValIle: 4.029 ± 0.506
5.713ValLys: 5.713 ± 0.541
5.051ValLeu: 5.051 ± 0.506
1.443ValMet: 1.443 ± 0.272
3.728ValAsn: 3.728 ± 0.512
1.804ValPro: 1.804 ± 0.33
2.285ValGln: 2.285 ± 0.349
2.766ValArg: 2.766 ± 0.415
3.488ValSer: 3.488 ± 0.441
4.33ValThr: 4.33 ± 0.499
4.33ValVal: 4.33 ± 0.612
1.263ValTrp: 1.263 ± 0.258
2.466ValTyr: 2.466 ± 0.406
0.0ValXaa: 0.0 ± 0.0
Trp
0.601TrpAla: 0.601 ± 0.232
0.06TrpCys: 0.06 ± 0.052
0.782TrpAsp: 0.782 ± 0.25
1.564TrpGlu: 1.564 ± 0.285
0.421TrpPhe: 0.421 ± 0.185
0.962TrpGly: 0.962 ± 0.26
0.18TrpHis: 0.18 ± 0.091
0.962TrpIle: 0.962 ± 0.213
0.902TrpLys: 0.902 ± 0.235
0.782TrpLeu: 0.782 ± 0.268
0.06TrpMet: 0.06 ± 0.048
1.383TrpAsn: 1.383 ± 0.257
0.0TrpPro: 0.0 ± 0.0
0.722TrpGln: 0.722 ± 0.211
0.361TrpArg: 0.361 ± 0.143
0.962TrpSer: 0.962 ± 0.256
1.022TrpThr: 1.022 ± 0.257
0.541TrpVal: 0.541 ± 0.174
0.241TrpTrp: 0.241 ± 0.143
0.541TrpTyr: 0.541 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.345TyrAla: 2.345 ± 0.33
0.301TyrCys: 0.301 ± 0.121
3.067TyrAsp: 3.067 ± 0.423
4.27TyrGlu: 4.27 ± 0.594
1.323TyrPhe: 1.323 ± 0.309
2.586TyrGly: 2.586 ± 0.379
0.722TyrHis: 0.722 ± 0.211
1.503TyrIle: 1.503 ± 0.341
3.428TyrLys: 3.428 ± 0.492
3.789TyrLeu: 3.789 ± 0.428
1.804TyrMet: 1.804 ± 0.312
1.984TyrAsn: 1.984 ± 0.38
1.143TyrPro: 1.143 ± 0.233
1.864TyrGln: 1.864 ± 0.303
2.045TyrArg: 2.045 ± 0.383
2.345TyrSer: 2.345 ± 0.312
2.887TyrThr: 2.887 ± 0.414
2.887TyrVal: 2.887 ± 0.314
0.601TyrTrp: 0.601 ± 0.198
2.225TyrTyr: 2.225 ± 0.363
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (16630 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski