Amino acid dipepetide frequency for Escherichia phage ECBP2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.609AlaAla: 7.609 ± 1.085
0.88AlaCys: 0.88 ± 0.203
5.454AlaAsp: 5.454 ± 0.686
5.498AlaGlu: 5.498 ± 0.855
2.991AlaPhe: 2.991 ± 0.367
5.586AlaGly: 5.586 ± 0.604
1.276AlaHis: 1.276 ± 0.239
4.662AlaIle: 4.662 ± 0.495
4.926AlaLys: 4.926 ± 0.763
6.993AlaLeu: 6.993 ± 0.602
2.111AlaMet: 2.111 ± 0.319
4.046AlaAsn: 4.046 ± 0.489
2.947AlaPro: 2.947 ± 0.375
4.222AlaGln: 4.222 ± 0.639
4.354AlaArg: 4.354 ± 0.637
5.234AlaSer: 5.234 ± 0.526
4.178AlaThr: 4.178 ± 0.502
4.662AlaVal: 4.662 ± 0.443
1.056AlaTrp: 1.056 ± 0.194
2.199AlaTyr: 2.199 ± 0.328
0.0AlaXaa: 0.0 ± 0.0
Cys
1.144CysAla: 1.144 ± 0.269
0.0CysCys: 0.0 ± 0.0
0.528CysAsp: 0.528 ± 0.159
0.66CysGlu: 0.66 ± 0.153
0.264CysPhe: 0.264 ± 0.119
0.836CysGly: 0.836 ± 0.207
0.308CysHis: 0.308 ± 0.109
0.748CysIle: 0.748 ± 0.226
0.924CysLys: 0.924 ± 0.23
0.88CysLeu: 0.88 ± 0.243
0.352CysMet: 0.352 ± 0.15
0.352CysAsn: 0.352 ± 0.126
0.308CysPro: 0.308 ± 0.128
0.308CysGln: 0.308 ± 0.104
0.528CysArg: 0.528 ± 0.168
0.616CysSer: 0.616 ± 0.188
0.836CysThr: 0.836 ± 0.212
1.012CysVal: 1.012 ± 0.238
0.044CysTrp: 0.044 ± 0.046
0.616CysTyr: 0.616 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
6.07AspAla: 6.07 ± 0.621
0.528AspCys: 0.528 ± 0.167
3.123AspAsp: 3.123 ± 0.403
4.662AspGlu: 4.662 ± 0.544
2.331AspPhe: 2.331 ± 0.305
5.102AspGly: 5.102 ± 0.574
0.924AspHis: 0.924 ± 0.2
4.794AspIle: 4.794 ± 0.506
3.783AspLys: 3.783 ± 0.39
4.53AspLeu: 4.53 ± 0.405
1.671AspMet: 1.671 ± 0.224
3.299AspAsn: 3.299 ± 0.369
2.199AspPro: 2.199 ± 0.334
1.715AspGln: 1.715 ± 0.282
2.639AspArg: 2.639 ± 0.42
4.134AspSer: 4.134 ± 0.489
2.727AspThr: 2.727 ± 0.32
4.53AspVal: 4.53 ± 0.441
0.88AspTrp: 0.88 ± 0.168
2.551AspTyr: 2.551 ± 0.345
0.0AspXaa: 0.0 ± 0.0
Glu
5.982GluAla: 5.982 ± 0.785
0.748GluCys: 0.748 ± 0.187
4.838GluAsp: 4.838 ± 0.705
5.982GluGlu: 5.982 ± 0.727
2.551GluPhe: 2.551 ± 0.397
4.398GluGly: 4.398 ± 0.391
1.539GluHis: 1.539 ± 0.323
4.354GluIle: 4.354 ± 0.349
4.442GluLys: 4.442 ± 0.611
5.454GluLeu: 5.454 ± 0.636
2.683GluMet: 2.683 ± 0.336
3.079GluAsn: 3.079 ± 0.407
1.715GluPro: 1.715 ± 0.425
2.375GluGln: 2.375 ± 0.466
4.002GluArg: 4.002 ± 0.483
4.09GluSer: 4.09 ± 0.417
2.947GluThr: 2.947 ± 0.319
4.882GluVal: 4.882 ± 0.46
1.012GluTrp: 1.012 ± 0.218
2.331GluTyr: 2.331 ± 0.305
0.0GluXaa: 0.0 ± 0.0
Phe
2.551PheAla: 2.551 ± 0.324
0.748PheCys: 0.748 ± 0.234
2.859PheAsp: 2.859 ± 0.413
2.771PheGlu: 2.771 ± 0.36
1.495PhePhe: 1.495 ± 0.329
2.419PheGly: 2.419 ± 0.351
0.748PheHis: 0.748 ± 0.207
2.243PheIle: 2.243 ± 0.299
2.243PheLys: 2.243 ± 0.308
2.903PheLeu: 2.903 ± 0.441
0.616PheMet: 0.616 ± 0.172
2.551PheAsn: 2.551 ± 0.277
1.232PhePro: 1.232 ± 0.28
1.627PheGln: 1.627 ± 0.274
2.155PheArg: 2.155 ± 0.275
2.287PheSer: 2.287 ± 0.364
1.979PheThr: 1.979 ± 0.298
2.551PheVal: 2.551 ± 0.305
0.44PheTrp: 0.44 ± 0.12
2.067PheTyr: 2.067 ± 0.291
0.0PheXaa: 0.0 ± 0.0
Gly
5.674GlyAla: 5.674 ± 0.733
1.1GlyCys: 1.1 ± 0.236
5.322GlyAsp: 5.322 ± 0.527
4.09GlyGlu: 4.09 ± 0.416
3.079GlyPhe: 3.079 ± 0.318
6.026GlyGly: 6.026 ± 0.665
1.012GlyHis: 1.012 ± 0.234
3.783GlyIle: 3.783 ± 0.363
4.046GlyLys: 4.046 ± 0.507
5.278GlyLeu: 5.278 ± 0.615
1.979GlyMet: 1.979 ± 0.338
3.343GlyAsn: 3.343 ± 0.4
1.232GlyPro: 1.232 ± 0.269
2.639GlyGln: 2.639 ± 0.426
3.475GlyArg: 3.475 ± 0.387
5.586GlySer: 5.586 ± 0.521
4.134GlyThr: 4.134 ± 0.539
5.366GlyVal: 5.366 ± 0.555
1.232GlyTrp: 1.232 ± 0.165
2.683GlyTyr: 2.683 ± 0.378
0.0GlyXaa: 0.0 ± 0.0
His
1.188HisAla: 1.188 ± 0.229
0.132HisCys: 0.132 ± 0.07
0.968HisAsp: 0.968 ± 0.246
1.056HisGlu: 1.056 ± 0.224
0.616HisPhe: 0.616 ± 0.169
1.232HisGly: 1.232 ± 0.23
0.22HisHis: 0.22 ± 0.121
1.056HisIle: 1.056 ± 0.248
1.407HisLys: 1.407 ± 0.239
1.715HisLeu: 1.715 ± 0.324
0.484HisMet: 0.484 ± 0.162
1.627HisAsn: 1.627 ± 0.236
0.704HisPro: 0.704 ± 0.17
0.352HisGln: 0.352 ± 0.109
1.319HisArg: 1.319 ± 0.305
1.319HisSer: 1.319 ± 0.356
0.572HisThr: 0.572 ± 0.155
1.319HisVal: 1.319 ± 0.224
0.22HisTrp: 0.22 ± 0.098
0.792HisTyr: 0.792 ± 0.218
0.0HisXaa: 0.0 ± 0.0
Ile
3.958IleAla: 3.958 ± 0.478
0.704IleCys: 0.704 ± 0.235
3.739IleAsp: 3.739 ± 0.433
4.046IleGlu: 4.046 ± 0.379
1.979IlePhe: 1.979 ± 0.356
3.827IleGly: 3.827 ± 0.377
0.836IleHis: 0.836 ± 0.194
2.375IleIle: 2.375 ± 0.366
4.398IleLys: 4.398 ± 0.383
3.343IleLeu: 3.343 ± 0.352
1.1IleMet: 1.1 ± 0.234
3.783IleAsn: 3.783 ± 0.381
4.002IlePro: 4.002 ± 0.437
2.155IleGln: 2.155 ± 0.306
3.211IleArg: 3.211 ± 0.397
3.563IleSer: 3.563 ± 0.303
3.871IleThr: 3.871 ± 0.509
2.551IleVal: 2.551 ± 0.47
0.836IleTrp: 0.836 ± 0.182
2.287IleTyr: 2.287 ± 0.304
0.0IleXaa: 0.0 ± 0.0
Lys
5.938LysAla: 5.938 ± 0.779
0.616LysCys: 0.616 ± 0.191
3.431LysAsp: 3.431 ± 0.394
5.718LysGlu: 5.718 ± 0.597
2.155LysPhe: 2.155 ± 0.35
3.871LysGly: 3.871 ± 0.376
1.319LysHis: 1.319 ± 0.257
3.035LysIle: 3.035 ± 0.328
3.958LysLys: 3.958 ± 0.472
5.146LysLeu: 5.146 ± 0.425
2.155LysMet: 2.155 ± 0.286
2.639LysAsn: 2.639 ± 0.353
1.935LysPro: 1.935 ± 0.353
2.419LysGln: 2.419 ± 0.365
3.079LysArg: 3.079 ± 0.495
3.343LysSer: 3.343 ± 0.413
2.903LysThr: 2.903 ± 0.315
4.31LysVal: 4.31 ± 0.425
1.232LysTrp: 1.232 ± 0.22
2.375LysTyr: 2.375 ± 0.365
0.0LysXaa: 0.0 ± 0.0
Leu
5.762LeuAla: 5.762 ± 0.446
0.836LeuCys: 0.836 ± 0.211
4.354LeuAsp: 4.354 ± 0.378
5.718LeuGlu: 5.718 ± 0.549
2.903LeuPhe: 2.903 ± 0.365
4.794LeuGly: 4.794 ± 0.683
1.144LeuHis: 1.144 ± 0.233
4.134LeuIle: 4.134 ± 0.451
4.618LeuLys: 4.618 ± 0.438
4.398LeuLeu: 4.398 ± 0.495
2.243LeuMet: 2.243 ± 0.285
4.046LeuAsn: 4.046 ± 0.533
3.211LeuPro: 3.211 ± 0.296
3.255LeuGln: 3.255 ± 0.462
4.09LeuArg: 4.09 ± 0.471
5.63LeuSer: 5.63 ± 0.553
4.662LeuThr: 4.662 ± 0.499
4.706LeuVal: 4.706 ± 0.421
0.572LeuTrp: 0.572 ± 0.166
2.551LeuTyr: 2.551 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
2.067MetAla: 2.067 ± 0.287
0.308MetCys: 0.308 ± 0.13
1.671MetAsp: 1.671 ± 0.243
1.803MetGlu: 1.803 ± 0.293
1.495MetPhe: 1.495 ± 0.266
1.759MetGly: 1.759 ± 0.281
0.264MetHis: 0.264 ± 0.097
1.759MetIle: 1.759 ± 0.335
1.847MetLys: 1.847 ± 0.333
2.023MetLeu: 2.023 ± 0.337
0.836MetMet: 0.836 ± 0.17
1.232MetAsn: 1.232 ± 0.247
0.836MetPro: 0.836 ± 0.18
1.144MetGln: 1.144 ± 0.226
1.583MetArg: 1.583 ± 0.263
2.375MetSer: 2.375 ± 0.3
1.803MetThr: 1.803 ± 0.251
1.319MetVal: 1.319 ± 0.237
0.22MetTrp: 0.22 ± 0.098
0.748MetTyr: 0.748 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
4.09AsnAla: 4.09 ± 0.451
0.396AsnCys: 0.396 ± 0.133
2.639AsnAsp: 2.639 ± 0.328
1.891AsnGlu: 1.891 ± 0.339
2.155AsnPhe: 2.155 ± 0.292
3.914AsnGly: 3.914 ± 0.46
0.968AsnHis: 0.968 ± 0.217
3.211AsnIle: 3.211 ± 0.362
3.651AsnLys: 3.651 ± 0.389
5.278AsnLeu: 5.278 ± 0.651
0.968AsnMet: 0.968 ± 0.184
3.035AsnAsn: 3.035 ± 0.46
2.947AsnPro: 2.947 ± 0.425
2.683AsnGln: 2.683 ± 0.46
3.475AsnArg: 3.475 ± 0.356
4.046AsnSer: 4.046 ± 0.47
3.871AsnThr: 3.871 ± 0.397
3.299AsnVal: 3.299 ± 0.381
0.88AsnTrp: 0.88 ± 0.19
1.276AsnTyr: 1.276 ± 0.247
0.0AsnXaa: 0.0 ± 0.0
Pro
2.991ProAla: 2.991 ± 0.361
0.352ProCys: 0.352 ± 0.133
2.375ProAsp: 2.375 ± 0.309
3.343ProGlu: 3.343 ± 0.563
1.627ProPhe: 1.627 ± 0.266
1.847ProGly: 1.847 ± 0.289
0.704ProHis: 0.704 ± 0.185
2.023ProIle: 2.023 ± 0.253
1.803ProLys: 1.803 ± 0.26
2.639ProLeu: 2.639 ± 0.307
1.012ProMet: 1.012 ± 0.257
2.111ProAsn: 2.111 ± 0.295
1.363ProPro: 1.363 ± 0.229
1.495ProGln: 1.495 ± 0.211
1.319ProArg: 1.319 ± 0.253
3.211ProSer: 3.211 ± 0.344
2.155ProThr: 2.155 ± 0.364
3.431ProVal: 3.431 ± 0.435
0.484ProTrp: 0.484 ± 0.134
1.012ProTyr: 1.012 ± 0.189
0.0ProXaa: 0.0 ± 0.0
Gln
3.827GlnAla: 3.827 ± 0.608
0.484GlnCys: 0.484 ± 0.19
2.243GlnAsp: 2.243 ± 0.424
3.123GlnGlu: 3.123 ± 0.481
1.627GlnPhe: 1.627 ± 0.245
2.463GlnGly: 2.463 ± 0.373
0.704GlnHis: 0.704 ± 0.204
2.067GlnIle: 2.067 ± 0.31
2.023GlnLys: 2.023 ± 0.312
2.595GlnLeu: 2.595 ± 0.395
1.276GlnMet: 1.276 ± 0.223
2.155GlnAsn: 2.155 ± 0.346
1.056GlnPro: 1.056 ± 0.208
3.079GlnGln: 3.079 ± 0.757
2.463GlnArg: 2.463 ± 0.431
2.815GlnSer: 2.815 ± 0.435
2.419GlnThr: 2.419 ± 0.304
2.419GlnVal: 2.419 ± 0.361
0.44GlnTrp: 0.44 ± 0.128
1.495GlnTyr: 1.495 ± 0.26
0.0GlnXaa: 0.0 ± 0.0
Arg
4.266ArgAla: 4.266 ± 0.679
0.572ArgCys: 0.572 ± 0.14
3.958ArgAsp: 3.958 ± 0.468
3.519ArgGlu: 3.519 ± 0.469
2.023ArgPhe: 2.023 ± 0.294
3.123ArgGly: 3.123 ± 0.376
1.232ArgHis: 1.232 ± 0.231
3.123ArgIle: 3.123 ± 0.386
3.299ArgLys: 3.299 ± 0.422
3.607ArgLeu: 3.607 ± 0.349
1.144ArgMet: 1.144 ± 0.22
2.947ArgAsn: 2.947 ± 0.432
1.891ArgPro: 1.891 ± 0.29
2.639ArgGln: 2.639 ± 0.452
3.123ArgArg: 3.123 ± 0.398
3.431ArgSer: 3.431 ± 0.473
2.771ArgThr: 2.771 ± 0.327
3.519ArgVal: 3.519 ± 0.402
1.012ArgTrp: 1.012 ± 0.194
1.847ArgTyr: 1.847 ± 0.297
0.0ArgXaa: 0.0 ± 0.0
Ser
4.266SerAla: 4.266 ± 0.439
0.836SerCys: 0.836 ± 0.2
4.134SerAsp: 4.134 ± 0.481
3.871SerGlu: 3.871 ± 0.513
2.507SerPhe: 2.507 ± 0.349
5.806SerGly: 5.806 ± 0.706
1.319SerHis: 1.319 ± 0.313
3.563SerIle: 3.563 ± 0.358
3.827SerLys: 3.827 ± 0.395
5.058SerLeu: 5.058 ± 0.362
2.243SerMet: 2.243 ± 0.319
4.31SerAsn: 4.31 ± 0.584
2.507SerPro: 2.507 ± 0.373
2.023SerGln: 2.023 ± 0.387
3.871SerArg: 3.871 ± 0.352
5.454SerSer: 5.454 ± 0.629
4.222SerThr: 4.222 ± 0.486
4.706SerVal: 4.706 ± 0.435
1.056SerTrp: 1.056 ± 0.218
2.903SerTyr: 2.903 ± 0.364
0.0SerXaa: 0.0 ± 0.0
Thr
5.19ThrAla: 5.19 ± 0.59
0.44ThrCys: 0.44 ± 0.137
3.255ThrAsp: 3.255 ± 0.397
3.431ThrGlu: 3.431 ± 0.343
2.551ThrPhe: 2.551 ± 0.418
6.026ThrGly: 6.026 ± 0.684
1.276ThrHis: 1.276 ± 0.215
3.079ThrIle: 3.079 ± 0.395
3.035ThrLys: 3.035 ± 0.396
4.618ThrLeu: 4.618 ± 0.437
0.616ThrMet: 0.616 ± 0.141
3.343ThrAsn: 3.343 ± 0.482
2.815ThrPro: 2.815 ± 0.324
2.463ThrGln: 2.463 ± 0.321
2.419ThrArg: 2.419 ± 0.258
3.035ThrSer: 3.035 ± 0.433
3.695ThrThr: 3.695 ± 0.438
4.266ThrVal: 4.266 ± 0.411
0.968ThrTrp: 0.968 ± 0.242
1.979ThrTyr: 1.979 ± 0.291
0.0ThrXaa: 0.0 ± 0.0
Val
5.014ValAla: 5.014 ± 0.485
0.968ValCys: 0.968 ± 0.299
4.222ValAsp: 4.222 ± 0.544
5.014ValGlu: 5.014 ± 0.454
2.507ValPhe: 2.507 ± 0.4
4.926ValGly: 4.926 ± 0.569
1.407ValHis: 1.407 ± 0.217
3.783ValIle: 3.783 ± 0.421
3.958ValLys: 3.958 ± 0.419
4.002ValLeu: 4.002 ± 0.385
1.627ValMet: 1.627 ± 0.311
3.783ValAsn: 3.783 ± 0.438
2.419ValPro: 2.419 ± 0.326
2.287ValGln: 2.287 ± 0.361
2.947ValArg: 2.947 ± 0.336
4.31ValSer: 4.31 ± 0.531
5.498ValThr: 5.498 ± 0.577
4.31ValVal: 4.31 ± 0.527
1.363ValTrp: 1.363 ± 0.311
2.155ValTyr: 2.155 ± 0.312
0.0ValXaa: 0.0 ± 0.0
Trp
1.012TrpAla: 1.012 ± 0.21
0.264TrpCys: 0.264 ± 0.119
0.968TrpAsp: 0.968 ± 0.231
0.704TrpGlu: 0.704 ± 0.156
0.528TrpPhe: 0.528 ± 0.14
0.88TrpGly: 0.88 ± 0.244
0.264TrpHis: 0.264 ± 0.122
0.968TrpIle: 0.968 ± 0.178
1.012TrpLys: 1.012 ± 0.194
0.836TrpLeu: 0.836 ± 0.197
0.836TrpMet: 0.836 ± 0.211
0.924TrpAsn: 0.924 ± 0.192
0.352TrpPro: 0.352 ± 0.126
0.352TrpGln: 0.352 ± 0.141
0.704TrpArg: 0.704 ± 0.181
1.276TrpSer: 1.276 ± 0.206
1.056TrpThr: 1.056 ± 0.207
1.056TrpVal: 1.056 ± 0.219
0.132TrpTrp: 0.132 ± 0.085
0.44TrpTyr: 0.44 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.287TyrAla: 2.287 ± 0.237
0.352TyrCys: 0.352 ± 0.115
2.111TyrAsp: 2.111 ± 0.351
2.507TyrGlu: 2.507 ± 0.3
1.144TyrPhe: 1.144 ± 0.207
2.243TyrGly: 2.243 ± 0.286
0.836TyrHis: 0.836 ± 0.185
1.803TyrIle: 1.803 ± 0.273
2.375TyrLys: 2.375 ± 0.298
2.331TyrLeu: 2.331 ± 0.299
1.1TyrMet: 1.1 ± 0.191
2.067TyrAsn: 2.067 ± 0.266
1.627TyrPro: 1.627 ± 0.283
1.539TyrGln: 1.539 ± 0.252
2.287TyrArg: 2.287 ± 0.358
2.727TyrSer: 2.727 ± 0.399
2.199TyrThr: 2.199 ± 0.36
2.287TyrVal: 2.287 ± 0.37
0.484TyrTrp: 0.484 ± 0.131
1.407TyrTyr: 1.407 ± 0.256
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 120 proteins (22737 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski