Amino acid dipepetide frequency for Escherichia phage KBNP1711

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.31AlaAla: 7.31 ± 1.171
0.62AlaCys: 0.62 ± 0.172
5.183AlaAsp: 5.183 ± 0.625
5.051AlaGlu: 5.051 ± 0.907
3.456AlaPhe: 3.456 ± 0.352
5.183AlaGly: 5.183 ± 0.781
1.418AlaHis: 1.418 ± 0.254
4.918AlaIle: 4.918 ± 0.481
4.829AlaLys: 4.829 ± 0.847
7.177AlaLeu: 7.177 ± 0.702
2.127AlaMet: 2.127 ± 0.286
4.12AlaAsn: 4.12 ± 0.438
3.057AlaPro: 3.057 ± 0.436
4.342AlaGln: 4.342 ± 0.623
4.475AlaArg: 4.475 ± 0.827
4.785AlaSer: 4.785 ± 0.475
3.544AlaThr: 3.544 ± 0.459
4.342AlaVal: 4.342 ± 0.397
1.063AlaTrp: 1.063 ± 0.195
2.215AlaTyr: 2.215 ± 0.308
0.0AlaXaa: 0.0 ± 0.0
Cys
1.152CysAla: 1.152 ± 0.272
0.177CysCys: 0.177 ± 0.109
0.532CysAsp: 0.532 ± 0.15
0.62CysGlu: 0.62 ± 0.155
0.399CysPhe: 0.399 ± 0.131
1.373CysGly: 1.373 ± 0.276
0.266CysHis: 0.266 ± 0.098
0.93CysIle: 0.93 ± 0.246
1.196CysLys: 1.196 ± 0.267
0.62CysLeu: 0.62 ± 0.199
0.354CysMet: 0.354 ± 0.121
0.487CysAsn: 0.487 ± 0.147
0.31CysPro: 0.31 ± 0.109
0.31CysGln: 0.31 ± 0.124
0.532CysArg: 0.532 ± 0.141
0.709CysSer: 0.709 ± 0.183
0.709CysThr: 0.709 ± 0.225
1.063CysVal: 1.063 ± 0.216
0.133CysTrp: 0.133 ± 0.063
0.576CysTyr: 0.576 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
5.494AspAla: 5.494 ± 0.562
0.665AspCys: 0.665 ± 0.193
3.367AspAsp: 3.367 ± 0.438
4.475AspGlu: 4.475 ± 0.635
2.348AspPhe: 2.348 ± 0.324
4.652AspGly: 4.652 ± 0.548
0.709AspHis: 0.709 ± 0.208
4.253AspIle: 4.253 ± 0.5
3.633AspLys: 3.633 ± 0.396
4.873AspLeu: 4.873 ± 0.483
2.127AspMet: 2.127 ± 0.312
3.81AspAsn: 3.81 ± 0.377
2.082AspPro: 2.082 ± 0.407
1.905AspGln: 1.905 ± 0.375
2.658AspArg: 2.658 ± 0.332
4.164AspSer: 4.164 ± 0.392
3.278AspThr: 3.278 ± 0.384
5.051AspVal: 5.051 ± 0.517
0.753AspTrp: 0.753 ± 0.171
2.525AspTyr: 2.525 ± 0.36
0.0AspXaa: 0.0 ± 0.0
Glu
6.247GluAla: 6.247 ± 0.859
0.886GluCys: 0.886 ± 0.169
5.139GluAsp: 5.139 ± 0.536
6.69GluGlu: 6.69 ± 0.799
2.88GluPhe: 2.88 ± 0.369
4.652GluGly: 4.652 ± 0.496
1.285GluHis: 1.285 ± 0.268
3.766GluIle: 3.766 ± 0.41
3.81GluLys: 3.81 ± 0.535
4.607GluLeu: 4.607 ± 0.496
2.481GluMet: 2.481 ± 0.365
2.747GluAsn: 2.747 ± 0.281
1.905GluPro: 1.905 ± 0.306
2.525GluGln: 2.525 ± 0.392
4.164GluArg: 4.164 ± 0.537
3.943GluSer: 3.943 ± 0.499
2.968GluThr: 2.968 ± 0.347
5.006GluVal: 5.006 ± 0.488
1.285GluTrp: 1.285 ± 0.249
2.348GluTyr: 2.348 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
2.304PheAla: 2.304 ± 0.311
0.886PheCys: 0.886 ± 0.217
3.323PheAsp: 3.323 ± 0.43
2.392PheGlu: 2.392 ± 0.344
1.418PhePhe: 1.418 ± 0.248
2.481PheGly: 2.481 ± 0.361
0.354PheHis: 0.354 ± 0.133
1.861PheIle: 1.861 ± 0.256
2.437PheLys: 2.437 ± 0.388
3.323PheLeu: 3.323 ± 0.435
0.797PheMet: 0.797 ± 0.177
3.057PheAsn: 3.057 ± 0.367
1.108PhePro: 1.108 ± 0.215
1.462PheGln: 1.462 ± 0.234
2.259PheArg: 2.259 ± 0.302
2.702PheSer: 2.702 ± 0.381
2.304PheThr: 2.304 ± 0.282
2.57PheVal: 2.57 ± 0.32
0.487PheTrp: 0.487 ± 0.135
2.038PheTyr: 2.038 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
5.449GlyAla: 5.449 ± 0.756
0.886GlyCys: 0.886 ± 0.209
4.475GlyAsp: 4.475 ± 0.429
4.076GlyGlu: 4.076 ± 0.522
3.456GlyPhe: 3.456 ± 0.333
6.424GlyGly: 6.424 ± 0.843
1.196GlyHis: 1.196 ± 0.276
3.323GlyIle: 3.323 ± 0.45
4.209GlyLys: 4.209 ± 0.459
5.361GlyLeu: 5.361 ± 0.626
1.949GlyMet: 1.949 ± 0.357
3.456GlyAsn: 3.456 ± 0.393
1.063GlyPro: 1.063 ± 0.199
2.614GlyGln: 2.614 ± 0.396
3.899GlyArg: 3.899 ± 0.492
6.38GlySer: 6.38 ± 0.695
3.633GlyThr: 3.633 ± 0.422
5.494GlyVal: 5.494 ± 0.58
1.019GlyTrp: 1.019 ± 0.194
2.392GlyTyr: 2.392 ± 0.332
0.0GlyXaa: 0.0 ± 0.0
His
1.063HisAla: 1.063 ± 0.248
0.133HisCys: 0.133 ± 0.083
1.24HisAsp: 1.24 ± 0.3
1.196HisGlu: 1.196 ± 0.27
0.709HisPhe: 0.709 ± 0.207
1.063HisGly: 1.063 ± 0.231
0.354HisHis: 0.354 ± 0.113
1.24HisIle: 1.24 ± 0.274
1.063HisLys: 1.063 ± 0.237
1.24HisLeu: 1.24 ± 0.249
0.399HisMet: 0.399 ± 0.118
1.506HisAsn: 1.506 ± 0.247
0.62HisPro: 0.62 ± 0.151
0.487HisGln: 0.487 ± 0.153
1.108HisArg: 1.108 ± 0.235
1.152HisSer: 1.152 ± 0.297
0.709HisThr: 0.709 ± 0.153
1.019HisVal: 1.019 ± 0.215
0.089HisTrp: 0.089 ± 0.075
0.842HisTyr: 0.842 ± 0.243
0.0HisXaa: 0.0 ± 0.0
Ile
3.5IleAla: 3.5 ± 0.34
0.93IleCys: 0.93 ± 0.218
3.5IleAsp: 3.5 ± 0.403
3.589IleGlu: 3.589 ± 0.374
1.905IlePhe: 1.905 ± 0.355
3.544IleGly: 3.544 ± 0.389
0.709IleHis: 0.709 ± 0.216
3.145IleIle: 3.145 ± 0.414
3.899IleLys: 3.899 ± 0.479
3.456IleLeu: 3.456 ± 0.38
1.063IleMet: 1.063 ± 0.239
4.164IleAsn: 4.164 ± 0.431
3.899IlePro: 3.899 ± 0.493
1.905IleGln: 1.905 ± 0.295
3.145IleArg: 3.145 ± 0.32
4.342IleSer: 4.342 ± 0.366
4.032IleThr: 4.032 ± 0.46
3.145IleVal: 3.145 ± 0.481
0.797IleTrp: 0.797 ± 0.213
1.551IleTyr: 1.551 ± 0.311
0.0IleXaa: 0.0 ± 0.0
Lys
5.139LysAla: 5.139 ± 0.84
0.62LysCys: 0.62 ± 0.188
3.81LysAsp: 3.81 ± 0.384
5.759LysGlu: 5.759 ± 0.586
2.038LysPhe: 2.038 ± 0.276
4.032LysGly: 4.032 ± 0.488
1.595LysHis: 1.595 ± 0.328
3.19LysIle: 3.19 ± 0.415
4.563LysLys: 4.563 ± 0.502
4.519LysLeu: 4.519 ± 0.369
2.082LysMet: 2.082 ± 0.347
3.057LysAsn: 3.057 ± 0.39
1.728LysPro: 1.728 ± 0.332
2.835LysGln: 2.835 ± 0.35
2.304LysArg: 2.304 ± 0.374
3.5LysSer: 3.5 ± 0.385
3.677LysThr: 3.677 ± 0.485
4.785LysVal: 4.785 ± 0.454
1.019LysTrp: 1.019 ± 0.208
1.816LysTyr: 1.816 ± 0.303
0.0LysXaa: 0.0 ± 0.0
Leu
5.582LeuAla: 5.582 ± 0.582
1.196LeuCys: 1.196 ± 0.273
4.12LeuAsp: 4.12 ± 0.376
5.804LeuGlu: 5.804 ± 0.588
2.791LeuPhe: 2.791 ± 0.44
5.051LeuGly: 5.051 ± 0.6
0.975LeuHis: 0.975 ± 0.194
3.589LeuIle: 3.589 ± 0.549
4.696LeuLys: 4.696 ± 0.45
4.785LeuLeu: 4.785 ± 0.535
2.259LeuMet: 2.259 ± 0.275
3.899LeuAsn: 3.899 ± 0.469
3.323LeuPro: 3.323 ± 0.357
3.145LeuGln: 3.145 ± 0.486
3.943LeuArg: 3.943 ± 0.413
5.449LeuSer: 5.449 ± 0.562
4.519LeuThr: 4.519 ± 0.509
5.006LeuVal: 5.006 ± 0.559
0.753LeuTrp: 0.753 ± 0.147
2.392LeuTyr: 2.392 ± 0.296
0.0LeuXaa: 0.0 ± 0.0
Met
2.525MetAla: 2.525 ± 0.32
0.222MetCys: 0.222 ± 0.132
1.861MetAsp: 1.861 ± 0.264
2.038MetGlu: 2.038 ± 0.252
1.373MetPhe: 1.373 ± 0.232
1.551MetGly: 1.551 ± 0.292
0.354MetHis: 0.354 ± 0.12
1.285MetIle: 1.285 ± 0.244
2.259MetLys: 2.259 ± 0.379
2.215MetLeu: 2.215 ± 0.368
0.797MetMet: 0.797 ± 0.179
1.506MetAsn: 1.506 ± 0.226
0.797MetPro: 0.797 ± 0.161
0.975MetGln: 0.975 ± 0.214
1.418MetArg: 1.418 ± 0.221
2.348MetSer: 2.348 ± 0.301
1.816MetThr: 1.816 ± 0.246
1.108MetVal: 1.108 ± 0.289
0.31MetTrp: 0.31 ± 0.102
0.709MetTyr: 0.709 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
5.051AsnAla: 5.051 ± 0.508
0.266AsnCys: 0.266 ± 0.105
2.924AsnAsp: 2.924 ± 0.269
2.437AsnGlu: 2.437 ± 0.361
2.437AsnPhe: 2.437 ± 0.343
4.43AsnGly: 4.43 ± 0.597
0.709AsnHis: 0.709 ± 0.184
3.057AsnIle: 3.057 ± 0.327
3.367AsnLys: 3.367 ± 0.405
3.766AsnLeu: 3.766 ± 0.489
1.506AsnMet: 1.506 ± 0.29
3.145AsnAsn: 3.145 ± 0.481
3.145AsnPro: 3.145 ± 0.459
2.747AsnGln: 2.747 ± 0.46
3.145AsnArg: 3.145 ± 0.42
3.854AsnSer: 3.854 ± 0.43
3.5AsnThr: 3.5 ± 0.464
3.766AsnVal: 3.766 ± 0.418
0.975AsnTrp: 0.975 ± 0.197
1.772AsnTyr: 1.772 ± 0.256
0.0AsnXaa: 0.0 ± 0.0
Pro
2.747ProAla: 2.747 ± 0.329
0.399ProCys: 0.399 ± 0.13
2.88ProAsp: 2.88 ± 0.399
2.88ProGlu: 2.88 ± 0.403
1.639ProPhe: 1.639 ± 0.303
1.506ProGly: 1.506 ± 0.251
0.709ProHis: 0.709 ± 0.198
2.127ProIle: 2.127 ± 0.266
2.304ProLys: 2.304 ± 0.346
2.658ProLeu: 2.658 ± 0.398
0.886ProMet: 0.886 ± 0.232
2.038ProAsn: 2.038 ± 0.313
1.418ProPro: 1.418 ± 0.248
1.772ProGln: 1.772 ± 0.268
1.24ProArg: 1.24 ± 0.268
3.013ProSer: 3.013 ± 0.441
2.127ProThr: 2.127 ± 0.331
3.278ProVal: 3.278 ± 0.352
0.354ProTrp: 0.354 ± 0.103
0.842ProTyr: 0.842 ± 0.176
0.0ProXaa: 0.0 ± 0.0
Gln
4.12GlnAla: 4.12 ± 0.674
0.709GlnCys: 0.709 ± 0.194
1.994GlnAsp: 1.994 ± 0.301
2.835GlnGlu: 2.835 ± 0.389
1.418GlnPhe: 1.418 ± 0.201
2.259GlnGly: 2.259 ± 0.413
0.532GlnHis: 0.532 ± 0.152
2.392GlnIle: 2.392 ± 0.355
2.348GlnLys: 2.348 ± 0.348
2.835GlnLeu: 2.835 ± 0.453
1.108GlnMet: 1.108 ± 0.226
2.348GlnAsn: 2.348 ± 0.437
1.152GlnPro: 1.152 ± 0.201
2.392GlnGln: 2.392 ± 0.669
2.392GlnArg: 2.392 ± 0.284
2.304GlnSer: 2.304 ± 0.335
2.304GlnThr: 2.304 ± 0.342
2.88GlnVal: 2.88 ± 0.407
0.487GlnTrp: 0.487 ± 0.155
1.684GlnTyr: 1.684 ± 0.285
0.0GlnXaa: 0.0 ± 0.0
Arg
3.81ArgAla: 3.81 ± 0.675
0.709ArgCys: 0.709 ± 0.18
3.323ArgAsp: 3.323 ± 0.337
3.854ArgGlu: 3.854 ± 0.557
2.082ArgPhe: 2.082 ± 0.261
2.835ArgGly: 2.835 ± 0.341
0.93ArgHis: 0.93 ± 0.18
3.278ArgIle: 3.278 ± 0.377
2.747ArgLys: 2.747 ± 0.439
3.633ArgLeu: 3.633 ± 0.371
1.373ArgMet: 1.373 ± 0.229
2.968ArgAsn: 2.968 ± 0.39
1.684ArgPro: 1.684 ± 0.184
2.525ArgGln: 2.525 ± 0.42
3.234ArgArg: 3.234 ± 0.467
3.633ArgSer: 3.633 ± 0.393
2.658ArgThr: 2.658 ± 0.38
4.032ArgVal: 4.032 ± 0.435
0.886ArgTrp: 0.886 ± 0.187
1.816ArgTyr: 1.816 ± 0.3
0.0ArgXaa: 0.0 ± 0.0
Ser
4.829SerAla: 4.829 ± 0.591
0.532SerCys: 0.532 ± 0.151
4.164SerAsp: 4.164 ± 0.426
3.456SerGlu: 3.456 ± 0.395
2.437SerPhe: 2.437 ± 0.343
6.247SerGly: 6.247 ± 0.659
1.551SerHis: 1.551 ± 0.29
3.81SerIle: 3.81 ± 0.395
4.032SerLys: 4.032 ± 0.47
5.981SerLeu: 5.981 ± 0.485
1.905SerMet: 1.905 ± 0.373
4.209SerAsn: 4.209 ± 0.535
2.791SerPro: 2.791 ± 0.504
1.816SerGln: 1.816 ± 0.29
3.5SerArg: 3.5 ± 0.327
4.297SerSer: 4.297 ± 0.472
4.342SerThr: 4.342 ± 0.496
4.12SerVal: 4.12 ± 0.442
1.063SerTrp: 1.063 ± 0.198
2.614SerTyr: 2.614 ± 0.362
0.0SerXaa: 0.0 ± 0.0
Thr
4.209ThrAla: 4.209 ± 0.586
0.62ThrCys: 0.62 ± 0.185
3.411ThrAsp: 3.411 ± 0.37
3.943ThrGlu: 3.943 ± 0.415
2.614ThrPhe: 2.614 ± 0.37
5.361ThrGly: 5.361 ± 0.58
0.975ThrHis: 0.975 ± 0.222
3.367ThrIle: 3.367 ± 0.512
3.145ThrLys: 3.145 ± 0.325
4.563ThrLeu: 4.563 ± 0.427
0.665ThrMet: 0.665 ± 0.147
2.525ThrAsn: 2.525 ± 0.34
2.835ThrPro: 2.835 ± 0.302
1.949ThrGln: 1.949 ± 0.266
2.791ThrArg: 2.791 ± 0.315
3.19ThrSer: 3.19 ± 0.398
3.633ThrThr: 3.633 ± 0.565
4.873ThrVal: 4.873 ± 0.528
1.019ThrTrp: 1.019 ± 0.294
1.994ThrTyr: 1.994 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
5.183ValAla: 5.183 ± 0.431
1.373ValCys: 1.373 ± 0.332
4.297ValAsp: 4.297 ± 0.588
4.873ValGlu: 4.873 ± 0.471
2.348ValPhe: 2.348 ± 0.337
4.785ValGly: 4.785 ± 0.439
1.462ValHis: 1.462 ± 0.283
4.032ValIle: 4.032 ± 0.417
4.475ValLys: 4.475 ± 0.414
4.342ValLeu: 4.342 ± 0.397
1.816ValMet: 1.816 ± 0.265
4.209ValAsn: 4.209 ± 0.469
2.171ValPro: 2.171 ± 0.323
2.57ValGln: 2.57 ± 0.383
2.835ValArg: 2.835 ± 0.385
4.696ValSer: 4.696 ± 0.462
4.918ValThr: 4.918 ± 0.596
4.873ValVal: 4.873 ± 0.475
1.418ValTrp: 1.418 ± 0.326
2.968ValTyr: 2.968 ± 0.387
0.0ValXaa: 0.0 ± 0.0
Trp
0.93TrpAla: 0.93 ± 0.187
0.31TrpCys: 0.31 ± 0.112
1.019TrpAsp: 1.019 ± 0.197
0.797TrpGlu: 0.797 ± 0.162
0.665TrpPhe: 0.665 ± 0.179
1.063TrpGly: 1.063 ± 0.253
0.31TrpHis: 0.31 ± 0.127
1.019TrpIle: 1.019 ± 0.211
0.886TrpLys: 0.886 ± 0.178
0.842TrpLeu: 0.842 ± 0.23
0.487TrpMet: 0.487 ± 0.168
0.93TrpAsn: 0.93 ± 0.184
0.31TrpPro: 0.31 ± 0.127
0.354TrpGln: 0.354 ± 0.126
0.797TrpArg: 0.797 ± 0.164
0.93TrpSer: 0.93 ± 0.19
1.108TrpThr: 1.108 ± 0.195
1.108TrpVal: 1.108 ± 0.211
0.089TrpTrp: 0.089 ± 0.095
0.443TrpTyr: 0.443 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.57TyrAla: 2.57 ± 0.28
0.31TyrCys: 0.31 ± 0.117
2.259TyrAsp: 2.259 ± 0.377
2.57TyrGlu: 2.57 ± 0.328
1.152TyrPhe: 1.152 ± 0.25
2.259TyrGly: 2.259 ± 0.277
0.842TyrHis: 0.842 ± 0.229
1.639TyrIle: 1.639 ± 0.287
2.215TyrLys: 2.215 ± 0.272
2.481TyrLeu: 2.481 ± 0.29
1.24TyrMet: 1.24 ± 0.234
1.816TyrAsn: 1.816 ± 0.234
1.418TyrPro: 1.418 ± 0.271
1.861TyrGln: 1.861 ± 0.216
2.171TyrArg: 2.171 ± 0.294
2.304TyrSer: 2.304 ± 0.322
1.905TyrThr: 1.905 ± 0.29
2.127TyrVal: 2.127 ± 0.305
0.399TyrTrp: 0.399 ± 0.141
1.329TyrTyr: 1.329 ± 0.243
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 126 proteins (22573 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski