Amino acid dipepetide frequency for Escherichia phage RDN8.1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.28AlaAla: 10.28 ± 1.074
0.975AlaCys: 0.975 ± 0.287
5.229AlaAsp: 5.229 ± 0.676
5.583AlaGlu: 5.583 ± 0.835
2.924AlaPhe: 2.924 ± 0.416
8.153AlaGly: 8.153 ± 1.045
1.063AlaHis: 1.063 ± 0.271
5.317AlaIle: 5.317 ± 0.749
6.292AlaLys: 6.292 ± 0.716
7.178AlaLeu: 7.178 ± 1.082
2.481AlaMet: 2.481 ± 0.395
3.545AlaAsn: 3.545 ± 0.548
2.836AlaPro: 2.836 ± 0.611
3.19AlaGln: 3.19 ± 0.508
3.19AlaArg: 3.19 ± 0.474
4.786AlaSer: 4.786 ± 0.667
4.342AlaThr: 4.342 ± 0.703
6.203AlaVal: 6.203 ± 1.043
1.684AlaTrp: 1.684 ± 0.486
2.836AlaTyr: 2.836 ± 0.478
0.0AlaXaa: 0.0 ± 0.0
Cys
0.62CysAla: 0.62 ± 0.196
0.0CysCys: 0.0 ± 0.0
0.709CysAsp: 0.709 ± 0.35
0.709CysGlu: 0.709 ± 0.22
0.532CysPhe: 0.532 ± 0.221
0.532CysGly: 0.532 ± 0.208
0.266CysHis: 0.266 ± 0.149
0.443CysIle: 0.443 ± 0.195
0.532CysLys: 0.532 ± 0.246
0.798CysLeu: 0.798 ± 0.309
0.266CysMet: 0.266 ± 0.187
0.177CysAsn: 0.177 ± 0.126
0.354CysPro: 0.354 ± 0.196
0.177CysGln: 0.177 ± 0.123
0.532CysArg: 0.532 ± 0.224
0.532CysSer: 0.532 ± 0.27
0.354CysThr: 0.354 ± 0.174
0.62CysVal: 0.62 ± 0.265
0.089CysTrp: 0.089 ± 0.093
0.177CysTyr: 0.177 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
6.026AspAla: 6.026 ± 0.727
0.532AspCys: 0.532 ± 0.282
3.633AspAsp: 3.633 ± 0.516
3.456AspGlu: 3.456 ± 0.572
1.772AspPhe: 1.772 ± 0.474
7.178AspGly: 7.178 ± 0.738
1.595AspHis: 1.595 ± 0.324
3.545AspIle: 3.545 ± 0.439
3.279AspLys: 3.279 ± 0.549
5.051AspLeu: 5.051 ± 0.691
1.95AspMet: 1.95 ± 0.412
2.393AspAsn: 2.393 ± 0.496
2.747AspPro: 2.747 ± 0.518
2.127AspGln: 2.127 ± 0.435
2.038AspArg: 2.038 ± 0.394
3.899AspSer: 3.899 ± 0.442
3.545AspThr: 3.545 ± 0.567
4.52AspVal: 4.52 ± 0.627
0.798AspTrp: 0.798 ± 0.294
2.393AspTyr: 2.393 ± 0.46
0.0AspXaa: 0.0 ± 0.0
Glu
7.09GluAla: 7.09 ± 0.979
0.266GluCys: 0.266 ± 0.15
4.52GluAsp: 4.52 ± 0.638
5.229GluGlu: 5.229 ± 0.798
2.216GluPhe: 2.216 ± 0.356
4.963GluGly: 4.963 ± 0.728
0.975GluHis: 0.975 ± 0.271
3.102GluIle: 3.102 ± 0.333
2.481GluLys: 2.481 ± 0.399
5.76GluLeu: 5.76 ± 0.616
1.684GluMet: 1.684 ± 0.364
2.57GluAsn: 2.57 ± 0.497
2.127GluPro: 2.127 ± 0.424
3.279GluGln: 3.279 ± 0.588
3.811GluArg: 3.811 ± 0.533
3.988GluSer: 3.988 ± 0.583
4.342GluThr: 4.342 ± 0.494
4.342GluVal: 4.342 ± 0.685
1.418GluTrp: 1.418 ± 0.265
2.747GluTyr: 2.747 ± 0.464
0.0GluXaa: 0.0 ± 0.0
Phe
2.304PheAla: 2.304 ± 0.486
0.354PheCys: 0.354 ± 0.224
2.836PheAsp: 2.836 ± 0.454
1.595PheGlu: 1.595 ± 0.328
0.886PhePhe: 0.886 ± 0.312
2.216PheGly: 2.216 ± 0.507
0.798PheHis: 0.798 ± 0.225
1.418PheIle: 1.418 ± 0.417
3.102PheLys: 3.102 ± 0.588
3.013PheLeu: 3.013 ± 0.41
0.798PheMet: 0.798 ± 0.299
2.393PheAsn: 2.393 ± 0.421
1.772PhePro: 1.772 ± 0.443
0.975PheGln: 0.975 ± 0.319
1.595PheArg: 1.595 ± 0.327
2.659PheSer: 2.659 ± 0.325
2.393PheThr: 2.393 ± 0.353
2.57PheVal: 2.57 ± 0.571
0.266PheTrp: 0.266 ± 0.128
1.152PheTyr: 1.152 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
6.824GlyAla: 6.824 ± 0.849
0.709GlyCys: 0.709 ± 0.251
4.786GlyAsp: 4.786 ± 1.045
5.317GlyGlu: 5.317 ± 0.425
2.304GlyPhe: 2.304 ± 0.466
5.849GlyGly: 5.849 ± 0.741
1.241GlyHis: 1.241 ± 0.388
3.988GlyIle: 3.988 ± 0.652
5.938GlyLys: 5.938 ± 0.793
5.672GlyLeu: 5.672 ± 0.771
2.747GlyMet: 2.747 ± 0.396
2.57GlyAsn: 2.57 ± 0.361
1.152GlyPro: 1.152 ± 0.404
3.279GlyGln: 3.279 ± 0.465
5.583GlyArg: 5.583 ± 0.728
5.583GlySer: 5.583 ± 0.549
4.431GlyThr: 4.431 ± 0.525
5.672GlyVal: 5.672 ± 0.846
1.329GlyTrp: 1.329 ± 0.344
3.811GlyTyr: 3.811 ± 0.653
0.0GlyXaa: 0.0 ± 0.0
His
0.798HisAla: 0.798 ± 0.264
0.266HisCys: 0.266 ± 0.148
1.329HisAsp: 1.329 ± 0.533
1.418HisGlu: 1.418 ± 0.43
0.443HisPhe: 0.443 ± 0.227
1.595HisGly: 1.595 ± 0.366
0.62HisHis: 0.62 ± 0.237
1.418HisIle: 1.418 ± 0.311
1.329HisLys: 1.329 ± 0.291
1.684HisLeu: 1.684 ± 0.307
0.532HisMet: 0.532 ± 0.235
0.443HisAsn: 0.443 ± 0.17
0.532HisPro: 0.532 ± 0.19
0.532HisGln: 0.532 ± 0.193
1.063HisArg: 1.063 ± 0.267
0.975HisSer: 0.975 ± 0.281
0.975HisThr: 0.975 ± 0.265
1.507HisVal: 1.507 ± 0.275
0.443HisTrp: 0.443 ± 0.203
0.532HisTyr: 0.532 ± 0.237
0.0HisXaa: 0.0 ± 0.0
Ile
3.811IleAla: 3.811 ± 0.594
0.532IleCys: 0.532 ± 0.308
3.19IleAsp: 3.19 ± 0.448
3.279IleGlu: 3.279 ± 0.475
0.62IlePhe: 0.62 ± 0.198
3.722IleGly: 3.722 ± 0.438
0.886IleHis: 0.886 ± 0.306
2.038IleIle: 2.038 ± 0.41
3.811IleLys: 3.811 ± 0.582
2.836IleLeu: 2.836 ± 0.509
0.975IleMet: 0.975 ± 0.37
3.279IleAsn: 3.279 ± 0.546
2.393IlePro: 2.393 ± 0.55
1.772IleGln: 1.772 ± 0.509
3.811IleArg: 3.811 ± 0.599
2.836IleSer: 2.836 ± 0.52
2.924IleThr: 2.924 ± 0.39
4.254IleVal: 4.254 ± 0.46
0.532IleTrp: 0.532 ± 0.221
1.063IleTyr: 1.063 ± 0.287
0.0IleXaa: 0.0 ± 0.0
Lys
7.533LysAla: 7.533 ± 0.822
0.532LysCys: 0.532 ± 0.254
3.811LysAsp: 3.811 ± 0.525
3.102LysGlu: 3.102 ± 0.438
2.57LysPhe: 2.57 ± 0.526
3.899LysGly: 3.899 ± 0.588
1.152LysHis: 1.152 ± 0.318
1.772LysIle: 1.772 ± 0.406
3.988LysLys: 3.988 ± 0.98
6.026LysLeu: 6.026 ± 0.597
2.127LysMet: 2.127 ± 0.372
2.127LysAsn: 2.127 ± 0.393
2.57LysPro: 2.57 ± 0.506
2.038LysGln: 2.038 ± 0.48
4.342LysArg: 4.342 ± 0.594
5.051LysSer: 5.051 ± 0.655
3.722LysThr: 3.722 ± 0.362
4.874LysVal: 4.874 ± 0.63
1.418LysTrp: 1.418 ± 0.361
2.127LysTyr: 2.127 ± 0.302
0.0LysXaa: 0.0 ± 0.0
Leu
5.849LeuAla: 5.849 ± 0.621
0.177LeuCys: 0.177 ± 0.113
4.52LeuAsp: 4.52 ± 0.446
6.381LeuGlu: 6.381 ± 0.715
2.216LeuPhe: 2.216 ± 0.371
4.431LeuGly: 4.431 ± 0.782
0.798LeuHis: 0.798 ± 0.231
3.722LeuIle: 3.722 ± 0.603
7.001LeuLys: 7.001 ± 0.645
4.874LeuLeu: 4.874 ± 0.462
2.924LeuMet: 2.924 ± 0.586
3.988LeuAsn: 3.988 ± 0.562
3.102LeuPro: 3.102 ± 0.374
4.786LeuGln: 4.786 ± 0.791
4.608LeuArg: 4.608 ± 0.464
5.672LeuSer: 5.672 ± 0.729
4.697LeuThr: 4.697 ± 0.672
4.874LeuVal: 4.874 ± 0.631
0.798LeuTrp: 0.798 ± 0.252
2.216LeuTyr: 2.216 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
3.102MetAla: 3.102 ± 0.425
0.532MetCys: 0.532 ± 0.238
0.709MetAsp: 0.709 ± 0.25
2.304MetGlu: 2.304 ± 0.402
1.329MetPhe: 1.329 ± 0.366
2.659MetGly: 2.659 ± 0.523
0.354MetHis: 0.354 ± 0.18
1.418MetIle: 1.418 ± 0.255
0.975MetLys: 0.975 ± 0.234
2.481MetLeu: 2.481 ± 0.431
0.798MetMet: 0.798 ± 0.246
0.975MetAsn: 0.975 ± 0.259
0.886MetPro: 0.886 ± 0.335
0.975MetGln: 0.975 ± 0.331
1.684MetArg: 1.684 ± 0.316
1.861MetSer: 1.861 ± 0.378
2.038MetThr: 2.038 ± 0.373
2.393MetVal: 2.393 ± 0.392
0.266MetTrp: 0.266 ± 0.196
1.063MetTyr: 1.063 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
4.254AsnAla: 4.254 ± 0.694
0.62AsnCys: 0.62 ± 0.235
2.038AsnAsp: 2.038 ± 0.49
2.57AsnGlu: 2.57 ± 0.486
1.418AsnPhe: 1.418 ± 0.266
4.254AsnGly: 4.254 ± 0.644
0.62AsnHis: 0.62 ± 0.209
2.304AsnIle: 2.304 ± 0.31
2.57AsnLys: 2.57 ± 0.414
3.456AsnLeu: 3.456 ± 0.62
1.507AsnMet: 1.507 ± 0.307
2.038AsnAsn: 2.038 ± 0.442
2.924AsnPro: 2.924 ± 0.58
1.329AsnGln: 1.329 ± 0.362
2.304AsnArg: 2.304 ± 0.596
2.127AsnSer: 2.127 ± 0.452
2.393AsnThr: 2.393 ± 0.49
2.924AsnVal: 2.924 ± 0.463
0.266AsnTrp: 0.266 ± 0.152
1.241AsnTyr: 1.241 ± 0.327
0.0AsnXaa: 0.0 ± 0.0
Pro
2.924ProAla: 2.924 ± 0.541
0.443ProCys: 0.443 ± 0.244
2.127ProAsp: 2.127 ± 0.418
3.368ProGlu: 3.368 ± 0.679
1.329ProPhe: 1.329 ± 0.235
1.507ProGly: 1.507 ± 0.337
0.709ProHis: 0.709 ± 0.224
1.684ProIle: 1.684 ± 0.36
2.836ProLys: 2.836 ± 0.547
2.127ProLeu: 2.127 ± 0.389
0.886ProMet: 0.886 ± 0.296
2.393ProAsn: 2.393 ± 0.314
0.886ProPro: 0.886 ± 0.349
1.595ProGln: 1.595 ± 0.367
1.595ProArg: 1.595 ± 0.323
2.481ProSer: 2.481 ± 0.412
2.924ProThr: 2.924 ± 0.369
3.545ProVal: 3.545 ± 0.431
0.709ProTrp: 0.709 ± 0.247
0.975ProTyr: 0.975 ± 0.216
0.0ProXaa: 0.0 ± 0.0
Gln
4.077GlnAla: 4.077 ± 0.452
0.089GlnCys: 0.089 ± 0.093
3.279GlnAsp: 3.279 ± 0.714
2.304GlnGlu: 2.304 ± 0.47
1.595GlnPhe: 1.595 ± 0.278
2.924GlnGly: 2.924 ± 0.419
0.532GlnHis: 0.532 ± 0.224
1.684GlnIle: 1.684 ± 0.371
1.684GlnLys: 1.684 ± 0.341
3.722GlnLeu: 3.722 ± 0.664
0.886GlnMet: 0.886 ± 0.283
1.595GlnAsn: 1.595 ± 0.412
1.152GlnPro: 1.152 ± 0.304
1.95GlnGln: 1.95 ± 0.588
2.836GlnArg: 2.836 ± 0.682
2.747GlnSer: 2.747 ± 0.592
2.393GlnThr: 2.393 ± 0.515
3.013GlnVal: 3.013 ± 0.514
0.532GlnTrp: 0.532 ± 0.218
1.063GlnTyr: 1.063 ± 0.285
0.0GlnXaa: 0.0 ± 0.0
Arg
4.963ArgAla: 4.963 ± 0.83
0.443ArgCys: 0.443 ± 0.198
4.874ArgAsp: 4.874 ± 0.537
3.368ArgGlu: 3.368 ± 0.513
2.924ArgPhe: 2.924 ± 0.494
3.279ArgGly: 3.279 ± 0.412
1.063ArgHis: 1.063 ± 0.338
3.368ArgIle: 3.368 ± 0.675
3.456ArgLys: 3.456 ± 0.604
5.495ArgLeu: 5.495 ± 0.742
1.418ArgMet: 1.418 ± 0.341
2.127ArgAsn: 2.127 ± 0.533
1.861ArgPro: 1.861 ± 0.375
2.216ArgGln: 2.216 ± 0.419
2.481ArgArg: 2.481 ± 0.423
3.456ArgSer: 3.456 ± 0.485
2.659ArgThr: 2.659 ± 0.385
3.545ArgVal: 3.545 ± 0.628
0.975ArgTrp: 0.975 ± 0.289
1.772ArgTyr: 1.772 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
4.342SerAla: 4.342 ± 0.665
0.798SerCys: 0.798 ± 0.359
5.14SerAsp: 5.14 ± 0.492
3.102SerGlu: 3.102 ± 0.606
2.924SerPhe: 2.924 ± 0.482
6.115SerGly: 6.115 ± 0.722
2.481SerHis: 2.481 ± 0.439
3.279SerIle: 3.279 ± 0.588
3.722SerLys: 3.722 ± 0.471
4.077SerLeu: 4.077 ± 0.68
1.507SerMet: 1.507 ± 0.347
2.57SerAsn: 2.57 ± 0.459
2.304SerPro: 2.304 ± 0.47
2.393SerGln: 2.393 ± 0.348
3.633SerArg: 3.633 ± 0.572
4.254SerSer: 4.254 ± 0.654
3.811SerThr: 3.811 ± 0.502
4.342SerVal: 4.342 ± 0.597
0.709SerTrp: 0.709 ± 0.212
2.304SerTyr: 2.304 ± 0.392
0.0SerXaa: 0.0 ± 0.0
Thr
3.013ThrAla: 3.013 ± 0.608
0.354ThrCys: 0.354 ± 0.212
2.924ThrAsp: 2.924 ± 0.55
5.406ThrGlu: 5.406 ± 0.525
2.304ThrPhe: 2.304 ± 0.428
5.76ThrGly: 5.76 ± 0.65
0.709ThrHis: 0.709 ± 0.273
3.102ThrIle: 3.102 ± 0.585
3.102ThrLys: 3.102 ± 0.412
5.051ThrLeu: 5.051 ± 0.718
2.127ThrMet: 2.127 ± 0.344
2.038ThrAsn: 2.038 ± 0.44
3.013ThrPro: 3.013 ± 0.374
2.127ThrGln: 2.127 ± 0.493
3.102ThrArg: 3.102 ± 0.381
2.304ThrSer: 2.304 ± 0.541
3.456ThrThr: 3.456 ± 0.634
5.76ThrVal: 5.76 ± 0.797
0.709ThrTrp: 0.709 ± 0.188
2.127ThrTyr: 2.127 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
5.849ValAla: 5.849 ± 0.636
0.354ValCys: 0.354 ± 0.162
3.545ValAsp: 3.545 ± 0.466
5.583ValGlu: 5.583 ± 0.77
2.747ValPhe: 2.747 ± 0.592
5.938ValGly: 5.938 ± 0.641
1.507ValHis: 1.507 ± 0.627
2.924ValIle: 2.924 ± 0.551
5.317ValLys: 5.317 ± 0.785
4.608ValLeu: 4.608 ± 0.678
1.772ValMet: 1.772 ± 0.452
3.279ValAsn: 3.279 ± 0.83
2.924ValPro: 2.924 ± 0.501
3.19ValGln: 3.19 ± 0.634
4.431ValArg: 4.431 ± 0.648
5.583ValSer: 5.583 ± 0.628
4.52ValThr: 4.52 ± 0.537
6.115ValVal: 6.115 ± 0.975
1.063ValTrp: 1.063 ± 0.319
3.19ValTyr: 3.19 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.62TrpAla: 0.62 ± 0.176
0.089TrpCys: 0.089 ± 0.107
0.709TrpAsp: 0.709 ± 0.247
1.063TrpGlu: 1.063 ± 0.306
0.709TrpPhe: 0.709 ± 0.294
1.063TrpGly: 1.063 ± 0.269
0.443TrpHis: 0.443 ± 0.195
0.354TrpIle: 0.354 ± 0.216
1.241TrpLys: 1.241 ± 0.312
1.772TrpLeu: 1.772 ± 0.377
0.266TrpMet: 0.266 ± 0.148
0.886TrpAsn: 0.886 ± 0.28
0.177TrpPro: 0.177 ± 0.11
0.62TrpGln: 0.62 ± 0.304
0.886TrpArg: 0.886 ± 0.259
1.152TrpSer: 1.152 ± 0.411
0.62TrpThr: 0.62 ± 0.252
1.418TrpVal: 1.418 ± 0.349
0.266TrpTrp: 0.266 ± 0.154
0.443TrpTyr: 0.443 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.811TyrAla: 3.811 ± 0.639
0.354TyrCys: 0.354 ± 0.176
2.659TyrAsp: 2.659 ± 0.485
1.772TyrGlu: 1.772 ± 0.422
1.329TyrPhe: 1.329 ± 0.284
2.747TyrGly: 2.747 ± 0.539
0.62TyrHis: 0.62 ± 0.285
1.684TyrIle: 1.684 ± 0.431
2.127TyrLys: 2.127 ± 0.361
1.95TyrLeu: 1.95 ± 0.377
1.063TyrMet: 1.063 ± 0.277
1.772TyrAsn: 1.772 ± 0.352
1.329TyrPro: 1.329 ± 0.394
1.507TyrGln: 1.507 ± 0.458
2.216TyrArg: 2.216 ± 0.538
1.861TyrSer: 1.861 ± 0.46
1.95TyrThr: 1.95 ± 0.423
1.95TyrVal: 1.95 ± 0.484
0.532TyrTrp: 0.532 ± 0.2
1.241TyrTyr: 1.241 ± 0.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11285 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski