Amino acid dipepetide frequency for Escherichia phage vB_Eco_SLUR26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.519AlaAla: 10.519 ± 1.589
1.254AlaCys: 1.254 ± 0.255
4.306AlaAsp: 4.306 ± 0.545
5.941AlaGlu: 5.941 ± 0.829
3.488AlaPhe: 3.488 ± 0.612
5.178AlaGly: 5.178 ± 0.552
1.526AlaHis: 1.526 ± 0.272
4.687AlaIle: 4.687 ± 0.571
6.323AlaLys: 6.323 ± 0.625
7.249AlaLeu: 7.249 ± 0.665
3.815AlaMet: 3.815 ± 0.691
3.597AlaAsn: 3.597 ± 0.497
3.924AlaPro: 3.924 ± 0.702
4.633AlaGln: 4.633 ± 0.856
4.033AlaArg: 4.033 ± 0.603
5.287AlaSer: 5.287 ± 0.678
5.996AlaThr: 5.996 ± 0.817
5.669AlaVal: 5.669 ± 0.492
1.417AlaTrp: 1.417 ± 0.239
2.725AlaTyr: 2.725 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.872CysAla: 0.872 ± 0.224
0.436CysCys: 0.436 ± 0.214
1.145CysAsp: 1.145 ± 0.234
0.981CysGlu: 0.981 ± 0.268
0.436CysPhe: 0.436 ± 0.124
1.363CysGly: 1.363 ± 0.418
0.491CysHis: 0.491 ± 0.183
0.872CysIle: 0.872 ± 0.234
0.927CysLys: 0.927 ± 0.21
0.818CysLeu: 0.818 ± 0.234
0.218CysMet: 0.218 ± 0.103
0.872CysAsn: 0.872 ± 0.215
0.709CysPro: 0.709 ± 0.16
0.6CysGln: 0.6 ± 0.166
0.709CysArg: 0.709 ± 0.188
0.654CysSer: 0.654 ± 0.197
0.491CysThr: 0.491 ± 0.146
1.145CysVal: 1.145 ± 0.227
0.273CysTrp: 0.273 ± 0.159
0.6CysTyr: 0.6 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
5.669AspAla: 5.669 ± 0.509
0.763AspCys: 0.763 ± 0.2
2.453AspAsp: 2.453 ± 0.483
3.924AspGlu: 3.924 ± 0.418
2.398AspPhe: 2.398 ± 0.345
4.905AspGly: 4.905 ± 0.561
1.254AspHis: 1.254 ± 0.256
3.761AspIle: 3.761 ± 0.478
3.434AspLys: 3.434 ± 0.513
4.36AspLeu: 4.36 ± 0.407
1.417AspMet: 1.417 ± 0.252
2.289AspAsn: 2.289 ± 0.384
2.453AspPro: 2.453 ± 0.392
1.853AspGln: 1.853 ± 0.241
2.616AspArg: 2.616 ± 0.326
2.943AspSer: 2.943 ± 0.421
2.562AspThr: 2.562 ± 0.383
3.87AspVal: 3.87 ± 0.347
1.199AspTrp: 1.199 ± 0.224
2.398AspTyr: 2.398 ± 0.357
0.0AspXaa: 0.0 ± 0.0
Glu
6.922GluAla: 6.922 ± 0.781
0.981GluCys: 0.981 ± 0.264
4.142GluAsp: 4.142 ± 0.486
4.033GluGlu: 4.033 ± 0.511
2.017GluPhe: 2.017 ± 0.401
4.36GluGly: 4.36 ± 0.503
1.09GluHis: 1.09 ± 0.303
3.706GluIle: 3.706 ± 0.506
3.597GluLys: 3.597 ± 0.43
5.396GluLeu: 5.396 ± 0.537
2.235GluMet: 2.235 ± 0.399
3.052GluAsn: 3.052 ± 0.332
2.126GluPro: 2.126 ± 0.389
3.597GluGln: 3.597 ± 0.421
3.597GluArg: 3.597 ± 0.447
2.78GluSer: 2.78 ± 0.435
3.161GluThr: 3.161 ± 0.44
3.87GluVal: 3.87 ± 0.421
1.036GluTrp: 1.036 ± 0.198
2.289GluTyr: 2.289 ± 0.346
0.0GluXaa: 0.0 ± 0.0
Phe
3.325PheAla: 3.325 ± 0.44
0.6PheCys: 0.6 ± 0.238
2.453PheAsp: 2.453 ± 0.416
2.071PheGlu: 2.071 ± 0.339
1.199PhePhe: 1.199 ± 0.201
2.507PheGly: 2.507 ± 0.382
0.927PheHis: 0.927 ± 0.228
2.507PheIle: 2.507 ± 0.483
2.453PheLys: 2.453 ± 0.286
2.834PheLeu: 2.834 ± 0.435
1.036PheMet: 1.036 ± 0.235
2.071PheAsn: 2.071 ± 0.287
1.472PhePro: 1.472 ± 0.297
0.763PheGln: 0.763 ± 0.216
1.908PheArg: 1.908 ± 0.305
2.071PheSer: 2.071 ± 0.367
2.017PheThr: 2.017 ± 0.352
2.562PheVal: 2.562 ± 0.475
0.382PheTrp: 0.382 ± 0.144
0.981PheTyr: 0.981 ± 0.264
0.0PheXaa: 0.0 ± 0.0
Gly
6.105GlyAla: 6.105 ± 0.74
1.145GlyCys: 1.145 ± 0.253
3.379GlyAsp: 3.379 ± 0.319
4.306GlyGlu: 4.306 ± 0.426
2.889GlyPhe: 2.889 ± 0.484
5.069GlyGly: 5.069 ± 0.614
1.363GlyHis: 1.363 ± 0.344
3.543GlyIle: 3.543 ± 0.383
4.796GlyLys: 4.796 ± 0.473
5.832GlyLeu: 5.832 ± 0.488
1.962GlyMet: 1.962 ± 0.347
3.161GlyAsn: 3.161 ± 0.433
2.18GlyPro: 2.18 ± 0.295
2.889GlyGln: 2.889 ± 0.356
3.761GlyArg: 3.761 ± 0.335
4.633GlySer: 4.633 ± 0.504
4.524GlyThr: 4.524 ± 0.536
5.832GlyVal: 5.832 ± 0.49
1.417GlyTrp: 1.417 ± 0.283
3.052GlyTyr: 3.052 ± 0.497
0.0GlyXaa: 0.0 ± 0.0
His
1.472HisAla: 1.472 ± 0.301
0.273HisCys: 0.273 ± 0.108
0.818HisAsp: 0.818 ± 0.221
1.199HisGlu: 1.199 ± 0.339
0.818HisPhe: 0.818 ± 0.234
1.526HisGly: 1.526 ± 0.285
0.6HisHis: 0.6 ± 0.18
0.981HisIle: 0.981 ± 0.233
0.872HisLys: 0.872 ± 0.208
1.363HisLeu: 1.363 ± 0.298
0.6HisMet: 0.6 ± 0.203
0.872HisAsn: 0.872 ± 0.245
0.818HisPro: 0.818 ± 0.194
0.927HisGln: 0.927 ± 0.229
0.654HisArg: 0.654 ± 0.219
1.145HisSer: 1.145 ± 0.236
1.145HisThr: 1.145 ± 0.225
1.417HisVal: 1.417 ± 0.306
0.382HisTrp: 0.382 ± 0.145
0.763HisTyr: 0.763 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
6.159IleAla: 6.159 ± 0.596
0.927IleCys: 0.927 ± 0.238
3.87IleAsp: 3.87 ± 0.552
4.633IleGlu: 4.633 ± 0.397
1.526IlePhe: 1.526 ± 0.32
3.87IleGly: 3.87 ± 0.505
0.818IleHis: 0.818 ± 0.206
3.87IleIle: 3.87 ± 0.399
3.379IleLys: 3.379 ± 0.515
3.434IleLeu: 3.434 ± 0.351
1.581IleMet: 1.581 ± 0.328
3.761IleAsn: 3.761 ± 0.386
2.671IlePro: 2.671 ± 0.296
1.744IleGln: 1.744 ± 0.28
3.052IleArg: 3.052 ± 0.464
3.543IleSer: 3.543 ± 0.417
3.597IleThr: 3.597 ± 0.4
4.142IleVal: 4.142 ± 0.596
0.6IleTrp: 0.6 ± 0.156
1.744IleTyr: 1.744 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
6.105LysAla: 6.105 ± 0.857
0.491LysCys: 0.491 ± 0.182
3.815LysAsp: 3.815 ± 0.552
4.796LysGlu: 4.796 ± 0.609
2.889LysPhe: 2.889 ± 0.404
4.251LysGly: 4.251 ± 0.444
1.853LysHis: 1.853 ± 0.259
3.652LysIle: 3.652 ± 0.408
3.761LysLys: 3.761 ± 0.602
4.742LysLeu: 4.742 ± 0.578
1.472LysMet: 1.472 ± 0.343
2.834LysAsn: 2.834 ± 0.331
2.507LysPro: 2.507 ± 0.329
2.126LysGln: 2.126 ± 0.346
2.889LysArg: 2.889 ± 0.484
2.78LysSer: 2.78 ± 0.337
3.216LysThr: 3.216 ± 0.456
4.96LysVal: 4.96 ± 0.542
0.763LysTrp: 0.763 ± 0.25
2.344LysTyr: 2.344 ± 0.385
0.0LysXaa: 0.0 ± 0.0
Leu
6.268LeuAla: 6.268 ± 0.829
1.199LeuCys: 1.199 ± 0.255
4.742LeuAsp: 4.742 ± 0.457
4.415LeuGlu: 4.415 ± 0.506
2.398LeuPhe: 2.398 ± 0.428
5.123LeuGly: 5.123 ± 0.494
1.526LeuHis: 1.526 ± 0.272
4.796LeuIle: 4.796 ± 0.606
4.633LeuLys: 4.633 ± 0.471
4.905LeuLeu: 4.905 ± 0.694
1.69LeuMet: 1.69 ± 0.267
4.96LeuAsn: 4.96 ± 0.582
4.142LeuPro: 4.142 ± 0.593
3.216LeuGln: 3.216 ± 0.371
5.014LeuArg: 5.014 ± 0.447
4.469LeuSer: 4.469 ± 0.465
5.123LeuThr: 5.123 ± 0.474
4.033LeuVal: 4.033 ± 0.431
0.654LeuTrp: 0.654 ± 0.164
2.344LeuTyr: 2.344 ± 0.416
0.0LeuXaa: 0.0 ± 0.0
Met
2.344MetAla: 2.344 ± 0.391
0.327MetCys: 0.327 ± 0.146
1.308MetAsp: 1.308 ± 0.204
2.235MetGlu: 2.235 ± 0.329
0.654MetPhe: 0.654 ± 0.211
2.562MetGly: 2.562 ± 0.434
0.436MetHis: 0.436 ± 0.143
1.581MetIle: 1.581 ± 0.292
2.235MetLys: 2.235 ± 0.301
2.398MetLeu: 2.398 ± 0.405
0.654MetMet: 0.654 ± 0.201
1.036MetAsn: 1.036 ± 0.227
1.145MetPro: 1.145 ± 0.231
1.417MetGln: 1.417 ± 0.314
1.581MetArg: 1.581 ± 0.321
2.071MetSer: 2.071 ± 0.377
1.908MetThr: 1.908 ± 0.336
1.254MetVal: 1.254 ± 0.31
0.218MetTrp: 0.218 ± 0.104
1.254MetTyr: 1.254 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
3.706AsnAla: 3.706 ± 0.508
0.763AsnCys: 0.763 ± 0.222
3.216AsnAsp: 3.216 ± 0.397
2.725AsnGlu: 2.725 ± 0.394
1.69AsnPhe: 1.69 ± 0.256
5.341AsnGly: 5.341 ± 0.661
0.872AsnHis: 0.872 ± 0.221
2.834AsnIle: 2.834 ± 0.483
2.562AsnLys: 2.562 ± 0.422
3.488AsnLeu: 3.488 ± 0.39
1.199AsnMet: 1.199 ± 0.223
3.216AsnAsn: 3.216 ± 0.486
2.344AsnPro: 2.344 ± 0.473
1.526AsnGln: 1.526 ± 0.218
2.78AsnArg: 2.78 ± 0.352
1.962AsnSer: 1.962 ± 0.46
2.78AsnThr: 2.78 ± 0.417
4.033AsnVal: 4.033 ± 0.517
0.818AsnTrp: 0.818 ± 0.203
1.69AsnTyr: 1.69 ± 0.292
0.0AsnXaa: 0.0 ± 0.0
Pro
3.052ProAla: 3.052 ± 0.542
0.436ProCys: 0.436 ± 0.14
2.671ProAsp: 2.671 ± 0.423
3.488ProGlu: 3.488 ± 0.426
1.799ProPhe: 1.799 ± 0.349
2.943ProGly: 2.943 ± 0.387
0.709ProHis: 0.709 ± 0.203
2.126ProIle: 2.126 ± 0.318
2.562ProLys: 2.562 ± 0.433
2.616ProLeu: 2.616 ± 0.466
1.581ProMet: 1.581 ± 0.293
2.017ProAsn: 2.017 ± 0.341
1.69ProPro: 1.69 ± 0.337
1.908ProGln: 1.908 ± 0.414
1.363ProArg: 1.363 ± 0.298
3.107ProSer: 3.107 ± 0.388
2.235ProThr: 2.235 ± 0.393
3.27ProVal: 3.27 ± 0.651
0.818ProTrp: 0.818 ± 0.3
1.363ProTyr: 1.363 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
4.306GlnAla: 4.306 ± 0.797
0.654GlnCys: 0.654 ± 0.177
2.507GlnAsp: 2.507 ± 0.382
2.289GlnGlu: 2.289 ± 0.325
1.254GlnPhe: 1.254 ± 0.24
3.161GlnGly: 3.161 ± 0.415
0.818GlnHis: 0.818 ± 0.205
2.344GlnIle: 2.344 ± 0.331
1.853GlnLys: 1.853 ± 0.369
3.924GlnLeu: 3.924 ± 0.471
1.472GlnMet: 1.472 ± 0.264
1.799GlnAsn: 1.799 ± 0.283
2.18GlnPro: 2.18 ± 0.718
3.488GlnGln: 3.488 ± 1.345
2.344GlnArg: 2.344 ± 0.358
1.744GlnSer: 1.744 ± 0.359
2.398GlnThr: 2.398 ± 0.351
2.943GlnVal: 2.943 ± 0.382
0.6GlnTrp: 0.6 ± 0.185
1.254GlnTyr: 1.254 ± 0.311
0.0GlnXaa: 0.0 ± 0.0
Arg
3.488ArgAla: 3.488 ± 0.45
0.763ArgCys: 0.763 ± 0.282
2.998ArgAsp: 2.998 ± 0.384
2.998ArgGlu: 2.998 ± 0.44
1.853ArgPhe: 1.853 ± 0.259
3.379ArgGly: 3.379 ± 0.379
0.872ArgHis: 0.872 ± 0.214
3.216ArgIle: 3.216 ± 0.36
3.434ArgLys: 3.434 ± 0.528
4.524ArgLeu: 4.524 ± 0.504
1.472ArgMet: 1.472 ± 0.267
2.071ArgAsn: 2.071 ± 0.327
2.071ArgPro: 2.071 ± 0.343
1.853ArgGln: 1.853 ± 0.245
2.78ArgArg: 2.78 ± 0.374
2.671ArgSer: 2.671 ± 0.343
2.889ArgThr: 2.889 ± 0.396
3.597ArgVal: 3.597 ± 0.416
1.199ArgTrp: 1.199 ± 0.275
2.235ArgTyr: 2.235 ± 0.334
0.0ArgXaa: 0.0 ± 0.0
Ser
5.723SerAla: 5.723 ± 0.664
0.818SerCys: 0.818 ± 0.206
2.725SerAsp: 2.725 ± 0.498
2.943SerGlu: 2.943 ± 0.375
2.616SerPhe: 2.616 ± 0.436
4.415SerGly: 4.415 ± 0.462
0.709SerHis: 0.709 ± 0.182
3.052SerIle: 3.052 ± 0.444
2.998SerLys: 2.998 ± 0.341
4.251SerLeu: 4.251 ± 0.564
1.581SerMet: 1.581 ± 0.35
2.562SerAsn: 2.562 ± 0.346
2.071SerPro: 2.071 ± 0.421
2.671SerGln: 2.671 ± 0.427
2.834SerArg: 2.834 ± 0.405
3.488SerSer: 3.488 ± 0.716
3.488SerThr: 3.488 ± 0.491
3.434SerVal: 3.434 ± 0.379
1.09SerTrp: 1.09 ± 0.227
1.908SerTyr: 1.908 ± 0.274
0.0SerXaa: 0.0 ± 0.0
Thr
5.669ThrAla: 5.669 ± 0.734
0.763ThrCys: 0.763 ± 0.253
3.107ThrAsp: 3.107 ± 0.377
2.998ThrGlu: 2.998 ± 0.39
2.235ThrPhe: 2.235 ± 0.318
4.633ThrGly: 4.633 ± 0.537
0.709ThrHis: 0.709 ± 0.225
4.197ThrIle: 4.197 ± 0.583
3.107ThrLys: 3.107 ± 0.346
4.306ThrLeu: 4.306 ± 0.405
1.69ThrMet: 1.69 ± 0.26
3.27ThrAsn: 3.27 ± 0.506
2.616ThrPro: 2.616 ± 0.392
2.834ThrGln: 2.834 ± 0.391
2.289ThrArg: 2.289 ± 0.315
3.597ThrSer: 3.597 ± 0.642
3.543ThrThr: 3.543 ± 0.575
4.742ThrVal: 4.742 ± 0.802
1.036ThrTrp: 1.036 ± 0.261
1.908ThrTyr: 1.908 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
5.669ValAla: 5.669 ± 0.574
1.254ValCys: 1.254 ± 0.226
3.652ValAsp: 3.652 ± 0.299
4.851ValGlu: 4.851 ± 0.585
2.289ValPhe: 2.289 ± 0.405
3.488ValGly: 3.488 ± 0.393
1.145ValHis: 1.145 ± 0.216
4.251ValIle: 4.251 ± 0.463
5.887ValLys: 5.887 ± 0.507
4.905ValLeu: 4.905 ± 0.418
1.69ValMet: 1.69 ± 0.281
3.597ValAsn: 3.597 ± 0.504
2.453ValPro: 2.453 ± 0.303
3.488ValGln: 3.488 ± 0.606
2.998ValArg: 2.998 ± 0.399
3.27ValSer: 3.27 ± 0.385
5.341ValThr: 5.341 ± 0.508
6.05ValVal: 6.05 ± 0.682
0.872ValTrp: 0.872 ± 0.201
2.998ValTyr: 2.998 ± 0.499
0.0ValXaa: 0.0 ± 0.0
Trp
1.145TrpAla: 1.145 ± 0.226
0.218TrpCys: 0.218 ± 0.107
1.09TrpAsp: 1.09 ± 0.232
0.763TrpGlu: 0.763 ± 0.245
0.491TrpPhe: 0.491 ± 0.167
0.981TrpGly: 0.981 ± 0.212
0.218TrpHis: 0.218 ± 0.104
0.6TrpIle: 0.6 ± 0.178
1.254TrpLys: 1.254 ± 0.295
1.363TrpLeu: 1.363 ± 0.218
0.436TrpMet: 0.436 ± 0.136
0.654TrpAsn: 0.654 ± 0.155
0.654TrpPro: 0.654 ± 0.191
0.872TrpGln: 0.872 ± 0.204
1.09TrpArg: 1.09 ± 0.259
0.709TrpSer: 0.709 ± 0.202
1.036TrpThr: 1.036 ± 0.261
0.981TrpVal: 0.981 ± 0.195
0.382TrpTrp: 0.382 ± 0.127
0.654TrpTyr: 0.654 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.052TyrAla: 3.052 ± 0.328
0.545TyrCys: 0.545 ± 0.163
2.126TyrAsp: 2.126 ± 0.365
2.398TyrGlu: 2.398 ± 0.415
1.199TyrPhe: 1.199 ± 0.266
2.78TyrGly: 2.78 ± 0.396
0.709TyrHis: 0.709 ± 0.229
2.344TyrIle: 2.344 ± 0.295
2.18TyrLys: 2.18 ± 0.3
2.943TyrLeu: 2.943 ± 0.399
0.763TyrMet: 0.763 ± 0.221
1.853TyrAsn: 1.853 ± 0.312
1.526TyrPro: 1.526 ± 0.395
0.927TyrGln: 0.927 ± 0.233
2.071TyrArg: 2.071 ± 0.436
2.453TyrSer: 2.453 ± 0.391
1.799TyrThr: 1.799 ± 0.349
2.398TyrVal: 2.398 ± 0.331
0.436TyrTrp: 0.436 ± 0.14
1.417TyrTyr: 1.417 ± 0.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (18348 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski