Amino acid dipepetide frequency for Mycobacterium phage Mdavu

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.84AlaAla: 21.84 ± 1.884
1.089AlaCys: 1.089 ± 0.266
8.598AlaAsp: 8.598 ± 0.783
9.172AlaGlu: 9.172 ± 1.166
2.981AlaPhe: 2.981 ± 0.69
8.14AlaGly: 8.14 ± 0.981
2.752AlaHis: 2.752 ± 0.375
4.7AlaIle: 4.7 ± 0.498
4.414AlaLys: 4.414 ± 0.585
10.949AlaLeu: 10.949 ± 0.904
3.382AlaMet: 3.382 ± 0.413
3.439AlaAsn: 3.439 ± 0.49
6.478AlaPro: 6.478 ± 0.731
4.643AlaGln: 4.643 ± 0.672
8.885AlaArg: 8.885 ± 0.784
4.758AlaSer: 4.758 ± 0.692
6.42AlaThr: 6.42 ± 0.805
9.286AlaVal: 9.286 ± 1.062
2.522AlaTrp: 2.522 ± 0.368
2.522AlaTyr: 2.522 ± 0.415
0.0AlaXaa: 0.0 ± 0.0
Cys
1.146CysAla: 1.146 ± 0.385
0.115CysCys: 0.115 ± 0.08
0.688CysAsp: 0.688 ± 0.188
0.974CysGlu: 0.974 ± 0.267
0.115CysPhe: 0.115 ± 0.084
1.261CysGly: 1.261 ± 0.294
0.172CysHis: 0.172 ± 0.103
0.401CysIle: 0.401 ± 0.147
0.229CysLys: 0.229 ± 0.095
0.459CysLeu: 0.459 ± 0.146
0.229CysMet: 0.229 ± 0.114
0.229CysAsn: 0.229 ± 0.122
0.917CysPro: 0.917 ± 0.25
0.287CysGln: 0.287 ± 0.14
0.917CysArg: 0.917 ± 0.193
0.974CysSer: 0.974 ± 0.308
0.229CysThr: 0.229 ± 0.129
0.803CysVal: 0.803 ± 0.227
0.516CysTrp: 0.516 ± 0.184
0.229CysTyr: 0.229 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
7.509AspAla: 7.509 ± 0.613
0.803AspCys: 0.803 ± 0.223
5.503AspAsp: 5.503 ± 0.729
6.306AspGlu: 6.306 ± 0.726
1.433AspPhe: 1.433 ± 0.241
5.904AspGly: 5.904 ± 0.559
0.917AspHis: 0.917 ± 0.233
1.089AspIle: 1.089 ± 0.246
2.121AspLys: 2.121 ± 0.305
6.191AspLeu: 6.191 ± 0.645
1.548AspMet: 1.548 ± 0.336
1.662AspAsn: 1.662 ± 0.245
4.013AspPro: 4.013 ± 0.489
2.178AspGln: 2.178 ± 0.375
4.357AspArg: 4.357 ± 0.407
2.522AspSer: 2.522 ± 0.384
2.637AspThr: 2.637 ± 0.408
5.274AspVal: 5.274 ± 0.475
1.204AspTrp: 1.204 ± 0.289
1.376AspTyr: 1.376 ± 0.323
0.0AspXaa: 0.0 ± 0.0
Glu
7.853GluAla: 7.853 ± 0.992
1.032GluCys: 1.032 ± 0.295
3.153GluAsp: 3.153 ± 0.513
1.662GluGlu: 1.662 ± 0.293
1.949GluPhe: 1.949 ± 0.276
4.872GluGly: 4.872 ± 0.498
1.662GluHis: 1.662 ± 0.376
2.121GluIle: 2.121 ± 0.38
1.49GluLys: 1.49 ± 0.354
6.649GluLeu: 6.649 ± 0.583
1.376GluMet: 1.376 ± 0.274
1.146GluAsn: 1.146 ± 0.24
3.267GluPro: 3.267 ± 0.523
2.637GluGln: 2.637 ± 0.361
5.274GluArg: 5.274 ± 0.708
2.809GluSer: 2.809 ± 0.406
2.866GluThr: 2.866 ± 0.319
5.904GluVal: 5.904 ± 0.922
1.089GluTrp: 1.089 ± 0.265
2.064GluTyr: 2.064 ± 0.403
0.0GluXaa: 0.0 ± 0.0
Phe
2.465PheAla: 2.465 ± 0.393
0.229PheCys: 0.229 ± 0.105
3.038PheAsp: 3.038 ± 0.421
1.892PheGlu: 1.892 ± 0.335
0.631PhePhe: 0.631 ± 0.159
3.21PheGly: 3.21 ± 0.49
0.631PheHis: 0.631 ± 0.211
0.745PheIle: 0.745 ± 0.194
0.974PheLys: 0.974 ± 0.251
2.006PheLeu: 2.006 ± 0.323
0.401PheMet: 0.401 ± 0.143
1.261PheAsn: 1.261 ± 0.424
0.917PhePro: 0.917 ± 0.218
0.745PheGln: 0.745 ± 0.204
1.548PheArg: 1.548 ± 0.315
0.86PheSer: 0.86 ± 0.279
1.433PheThr: 1.433 ± 0.314
2.35PheVal: 2.35 ± 0.407
0.459PheTrp: 0.459 ± 0.146
0.688PheTyr: 0.688 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
9.0GlyAla: 9.0 ± 1.129
0.803GlyCys: 0.803 ± 0.246
5.388GlyAsp: 5.388 ± 0.531
5.044GlyGlu: 5.044 ± 0.464
1.949GlyPhe: 1.949 ± 0.315
9.057GlyGly: 9.057 ± 1.577
1.548GlyHis: 1.548 ± 0.363
2.866GlyIle: 2.866 ± 0.746
3.267GlyLys: 3.267 ± 0.579
6.649GlyLeu: 6.649 ± 0.759
1.49GlyMet: 1.49 ± 0.281
3.038GlyAsn: 3.038 ± 0.508
3.898GlyPro: 3.898 ± 0.733
2.694GlyGln: 2.694 ± 0.388
6.134GlyArg: 6.134 ± 0.671
5.216GlySer: 5.216 ± 0.656
6.076GlyThr: 6.076 ± 0.594
7.509GlyVal: 7.509 ± 0.517
2.637GlyTrp: 2.637 ± 0.396
2.35GlyTyr: 2.35 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
2.465HisAla: 2.465 ± 0.429
0.287HisCys: 0.287 ± 0.13
1.204HisAsp: 1.204 ± 0.328
1.318HisGlu: 1.318 ± 0.235
0.688HisPhe: 0.688 ± 0.186
2.35HisGly: 2.35 ± 0.377
0.631HisHis: 0.631 ± 0.195
0.917HisIle: 0.917 ± 0.244
0.401HisLys: 0.401 ± 0.163
1.949HisLeu: 1.949 ± 0.318
0.573HisMet: 0.573 ± 0.199
0.631HisAsn: 0.631 ± 0.179
1.376HisPro: 1.376 ± 0.292
0.516HisGln: 0.516 ± 0.188
1.49HisArg: 1.49 ± 0.392
0.688HisSer: 0.688 ± 0.181
1.548HisThr: 1.548 ± 0.285
2.064HisVal: 2.064 ± 0.345
0.401HisTrp: 0.401 ± 0.185
0.688HisTyr: 0.688 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
5.388IleAla: 5.388 ± 0.469
0.229IleCys: 0.229 ± 0.128
2.923IleAsp: 2.923 ± 0.381
2.981IleGlu: 2.981 ± 0.532
0.917IlePhe: 0.917 ± 0.262
3.21IleGly: 3.21 ± 0.737
0.631IleHis: 0.631 ± 0.207
0.803IleIle: 0.803 ± 0.213
1.605IleLys: 1.605 ± 0.325
2.923IleLeu: 2.923 ± 0.377
0.401IleMet: 0.401 ± 0.137
1.261IleAsn: 1.261 ± 0.269
2.408IlePro: 2.408 ± 0.457
0.631IleGln: 0.631 ± 0.226
2.58IleArg: 2.58 ± 0.319
1.548IleSer: 1.548 ± 0.3
2.522IleThr: 2.522 ± 0.318
3.267IleVal: 3.267 ± 0.497
0.917IleTrp: 0.917 ± 0.217
0.573IleTyr: 0.573 ± 0.189
0.0IleXaa: 0.0 ± 0.0
Lys
4.414LysAla: 4.414 ± 0.641
0.459LysCys: 0.459 ± 0.185
1.089LysAsp: 1.089 ± 0.228
0.803LysGlu: 0.803 ± 0.224
1.49LysPhe: 1.49 ± 0.381
3.038LysGly: 3.038 ± 0.562
0.631LysHis: 0.631 ± 0.189
1.662LysIle: 1.662 ± 0.409
0.803LysLys: 0.803 ± 0.242
2.923LysLeu: 2.923 ± 0.389
0.688LysMet: 0.688 ± 0.182
0.516LysAsn: 0.516 ± 0.175
2.293LysPro: 2.293 ± 0.391
0.631LysGln: 0.631 ± 0.198
3.153LysArg: 3.153 ± 0.552
1.72LysSer: 1.72 ± 0.385
1.662LysThr: 1.662 ± 0.324
2.809LysVal: 2.809 ± 0.475
0.344LysTrp: 0.344 ± 0.13
0.803LysTyr: 0.803 ± 0.281
0.0LysXaa: 0.0 ± 0.0
Leu
12.152LeuAla: 12.152 ± 0.735
0.631LeuCys: 0.631 ± 0.211
7.337LeuAsp: 7.337 ± 0.729
3.153LeuGlu: 3.153 ± 0.379
2.178LeuPhe: 2.178 ± 0.389
6.478LeuGly: 6.478 ± 0.723
1.949LeuHis: 1.949 ± 0.316
3.439LeuIle: 3.439 ± 0.447
2.637LeuLys: 2.637 ± 0.432
5.044LeuLeu: 5.044 ± 0.608
1.548LeuMet: 1.548 ± 0.31
2.408LeuAsn: 2.408 ± 0.27
4.127LeuPro: 4.127 ± 0.591
3.153LeuGln: 3.153 ± 0.38
7.165LeuArg: 7.165 ± 0.753
5.904LeuSer: 5.904 ± 0.585
5.044LeuThr: 5.044 ± 0.576
5.503LeuVal: 5.503 ± 0.546
1.318LeuTrp: 1.318 ± 0.381
1.777LeuTyr: 1.777 ± 0.399
0.0LeuXaa: 0.0 ± 0.0
Met
2.58MetAla: 2.58 ± 0.32
0.172MetCys: 0.172 ± 0.104
0.86MetAsp: 0.86 ± 0.238
0.516MetGlu: 0.516 ± 0.179
0.917MetPhe: 0.917 ± 0.249
1.204MetGly: 1.204 ± 0.279
0.401MetHis: 0.401 ± 0.156
1.032MetIle: 1.032 ± 0.271
0.688MetLys: 0.688 ± 0.192
1.376MetLeu: 1.376 ± 0.401
0.287MetMet: 0.287 ± 0.116
0.573MetAsn: 0.573 ± 0.163
1.261MetPro: 1.261 ± 0.313
0.631MetGln: 0.631 ± 0.168
1.49MetArg: 1.49 ± 0.319
2.293MetSer: 2.293 ± 0.333
1.892MetThr: 1.892 ± 0.369
1.605MetVal: 1.605 ± 0.331
0.401MetTrp: 0.401 ± 0.151
0.688MetTyr: 0.688 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
4.07AsnAla: 4.07 ± 0.665
0.516AsnCys: 0.516 ± 0.175
1.146AsnAsp: 1.146 ± 0.214
1.261AsnGlu: 1.261 ± 0.245
0.631AsnPhe: 0.631 ± 0.167
3.497AsnGly: 3.497 ± 0.611
0.459AsnHis: 0.459 ± 0.159
0.803AsnIle: 0.803 ± 0.237
1.146AsnLys: 1.146 ± 0.304
2.121AsnLeu: 2.121 ± 0.31
0.745AsnMet: 0.745 ± 0.173
0.803AsnAsn: 0.803 ± 0.348
2.694AsnPro: 2.694 ± 0.384
0.573AsnGln: 0.573 ± 0.183
1.548AsnArg: 1.548 ± 0.277
1.433AsnSer: 1.433 ± 0.302
1.433AsnThr: 1.433 ± 0.309
2.064AsnVal: 2.064 ± 0.333
0.401AsnTrp: 0.401 ± 0.147
0.516AsnTyr: 0.516 ± 0.17
0.0AsnXaa: 0.0 ± 0.0
Pro
7.28ProAla: 7.28 ± 0.651
0.573ProCys: 0.573 ± 0.23
3.095ProAsp: 3.095 ± 0.519
4.758ProGlu: 4.758 ± 0.644
1.376ProPhe: 1.376 ± 0.25
6.019ProGly: 6.019 ± 0.55
1.318ProHis: 1.318 ± 0.273
2.35ProIle: 2.35 ± 0.319
1.662ProLys: 1.662 ± 0.286
4.185ProLeu: 4.185 ± 0.547
0.974ProMet: 0.974 ± 0.193
1.032ProAsn: 1.032 ± 0.323
2.981ProPro: 2.981 ± 0.429
1.204ProGln: 1.204 ± 0.477
3.325ProArg: 3.325 ± 0.5
2.866ProSer: 2.866 ± 0.44
2.809ProThr: 2.809 ± 0.415
5.56ProVal: 5.56 ± 0.801
1.204ProTrp: 1.204 ± 0.223
1.261ProTyr: 1.261 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
4.07GlnAla: 4.07 ± 0.57
0.172GlnCys: 0.172 ± 0.115
1.204GlnAsp: 1.204 ± 0.276
1.49GlnGlu: 1.49 ± 0.249
0.803GlnPhe: 0.803 ± 0.185
2.35GlnGly: 2.35 ± 0.319
1.261GlnHis: 1.261 ± 0.212
1.777GlnIle: 1.777 ± 0.294
0.803GlnLys: 0.803 ± 0.228
3.038GlnLeu: 3.038 ± 0.473
0.573GlnMet: 0.573 ± 0.149
0.688GlnAsn: 0.688 ± 0.278
2.178GlnPro: 2.178 ± 0.344
1.49GlnGln: 1.49 ± 0.274
2.866GlnArg: 2.866 ± 0.376
1.49GlnSer: 1.49 ± 0.357
1.72GlnThr: 1.72 ± 0.25
3.038GlnVal: 3.038 ± 0.441
0.459GlnTrp: 0.459 ± 0.153
1.089GlnTyr: 1.089 ± 0.244
0.0GlnXaa: 0.0 ± 0.0
Arg
7.337ArgAla: 7.337 ± 1.064
1.089ArgCys: 1.089 ± 0.302
4.013ArgAsp: 4.013 ± 0.371
4.414ArgGlu: 4.414 ± 0.545
2.465ArgPhe: 2.465 ± 0.397
5.159ArgGly: 5.159 ± 0.68
2.236ArgHis: 2.236 ± 0.433
3.554ArgIle: 3.554 ± 0.482
2.923ArgLys: 2.923 ± 0.505
6.535ArgLeu: 6.535 ± 0.767
2.465ArgMet: 2.465 ± 0.299
2.408ArgAsn: 2.408 ± 0.354
3.439ArgPro: 3.439 ± 0.487
2.866ArgGln: 2.866 ± 0.395
6.649ArgArg: 6.649 ± 0.774
4.529ArgSer: 4.529 ± 0.695
3.955ArgThr: 3.955 ± 0.48
5.847ArgVal: 5.847 ± 0.698
2.178ArgTrp: 2.178 ± 0.412
1.433ArgTyr: 1.433 ± 0.231
0.0ArgXaa: 0.0 ± 0.0
Ser
6.42SerAla: 6.42 ± 0.647
0.516SerCys: 0.516 ± 0.218
3.267SerAsp: 3.267 ± 0.451
2.58SerGlu: 2.58 ± 0.386
1.433SerPhe: 1.433 ± 0.218
5.216SerGly: 5.216 ± 0.699
1.089SerHis: 1.089 ± 0.293
2.178SerIle: 2.178 ± 0.414
1.204SerLys: 1.204 ± 0.275
4.357SerLeu: 4.357 ± 0.392
0.917SerMet: 0.917 ± 0.249
1.376SerAsn: 1.376 ± 0.254
2.981SerPro: 2.981 ± 0.366
1.662SerGln: 1.662 ± 0.254
4.357SerArg: 4.357 ± 0.506
3.325SerSer: 3.325 ± 0.529
2.694SerThr: 2.694 ± 0.362
4.299SerVal: 4.299 ± 0.508
1.261SerTrp: 1.261 ± 0.217
1.089SerTyr: 1.089 ± 0.266
0.0SerXaa: 0.0 ± 0.0
Thr
6.879ThrAla: 6.879 ± 0.752
0.459ThrCys: 0.459 ± 0.177
3.038ThrAsp: 3.038 ± 0.449
4.013ThrGlu: 4.013 ± 0.435
1.949ThrPhe: 1.949 ± 0.319
5.331ThrGly: 5.331 ± 0.494
1.605ThrHis: 1.605 ± 0.295
2.923ThrIle: 2.923 ± 0.42
1.892ThrLys: 1.892 ± 0.349
4.414ThrLeu: 4.414 ± 0.573
0.631ThrMet: 0.631 ± 0.214
1.49ThrAsn: 1.49 ± 0.291
3.382ThrPro: 3.382 ± 0.515
1.433ThrGln: 1.433 ± 0.257
3.669ThrArg: 3.669 ± 0.607
2.923ThrSer: 2.923 ± 0.656
2.694ThrThr: 2.694 ± 0.417
4.758ThrVal: 4.758 ± 0.478
1.204ThrTrp: 1.204 ± 0.236
1.204ThrTyr: 1.204 ± 0.263
0.0ThrXaa: 0.0 ± 0.0
Val
9.802ValAla: 9.802 ± 0.857
0.974ValCys: 0.974 ± 0.227
6.363ValAsp: 6.363 ± 0.586
6.879ValGlu: 6.879 ± 0.781
1.49ValPhe: 1.49 ± 0.352
6.306ValGly: 6.306 ± 0.717
1.662ValHis: 1.662 ± 0.373
3.038ValIle: 3.038 ± 0.43
2.465ValLys: 2.465 ± 0.388
6.592ValLeu: 6.592 ± 0.658
1.72ValMet: 1.72 ± 0.264
2.236ValAsn: 2.236 ± 0.374
5.56ValPro: 5.56 ± 0.61
2.236ValGln: 2.236 ± 0.372
5.388ValArg: 5.388 ± 0.656
3.611ValSer: 3.611 ± 0.571
5.675ValThr: 5.675 ± 0.555
7.567ValVal: 7.567 ± 0.831
1.949ValTrp: 1.949 ± 0.323
1.892ValTyr: 1.892 ± 0.318
0.0ValXaa: 0.0 ± 0.0
Trp
2.064TrpAla: 2.064 ± 0.334
0.344TrpCys: 0.344 ± 0.126
0.974TrpAsp: 0.974 ± 0.282
0.631TrpGlu: 0.631 ± 0.199
0.573TrpPhe: 0.573 ± 0.186
1.49TrpGly: 1.49 ± 0.303
0.401TrpHis: 0.401 ± 0.147
0.573TrpIle: 0.573 ± 0.17
0.401TrpLys: 0.401 ± 0.123
3.153TrpLeu: 3.153 ± 0.461
0.344TrpMet: 0.344 ± 0.139
0.86TrpAsn: 0.86 ± 0.246
0.86TrpPro: 0.86 ± 0.198
1.433TrpGln: 1.433 ± 0.279
2.408TrpArg: 2.408 ± 0.352
1.433TrpSer: 1.433 ± 0.294
0.86TrpThr: 0.86 ± 0.249
1.777TrpVal: 1.777 ± 0.319
0.516TrpTrp: 0.516 ± 0.177
0.459TrpTyr: 0.459 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.236TyrAla: 2.236 ± 0.335
0.401TyrCys: 0.401 ± 0.136
1.662TyrAsp: 1.662 ± 0.323
1.032TyrGlu: 1.032 ± 0.247
0.516TyrPhe: 0.516 ± 0.168
2.522TyrGly: 2.522 ± 0.389
0.229TyrHis: 0.229 ± 0.108
0.745TyrIle: 0.745 ± 0.229
0.803TyrLys: 0.803 ± 0.233
1.433TyrLeu: 1.433 ± 0.315
0.516TyrMet: 0.516 ± 0.164
0.803TyrAsn: 0.803 ± 0.201
0.803TyrPro: 0.803 ± 0.229
0.917TyrGln: 0.917 ± 0.301
2.293TyrArg: 2.293 ± 0.403
1.376TyrSer: 1.376 ± 0.247
1.72TyrThr: 1.72 ± 0.383
2.121TyrVal: 2.121 ± 0.361
0.573TyrTrp: 0.573 ± 0.209
0.459TyrTyr: 0.459 ± 0.16
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (17446 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski