Amino acid dipepetide frequency for Mycobacterium phage Omega (Mycobacteriophage Omega)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.949AlaAla: 10.949 ± 0.637
1.441AlaCys: 1.441 ± 0.249
6.829AlaAsp: 6.829 ± 0.476
7.923AlaGlu: 7.923 ± 0.552
3.112AlaPhe: 3.112 ± 0.297
8.298AlaGly: 8.298 ± 0.712
1.93AlaHis: 1.93 ± 0.253
5.59AlaIle: 5.59 ± 0.408
4.207AlaLys: 4.207 ± 0.356
8.327AlaLeu: 8.327 ± 0.508
2.622AlaMet: 2.622 ± 0.312
2.997AlaAsn: 2.997 ± 0.292
4.61AlaPro: 4.61 ± 0.458
2.824AlaGln: 2.824 ± 0.25
6.195AlaArg: 6.195 ± 0.429
4.495AlaSer: 4.495 ± 0.424
5.244AlaThr: 5.244 ± 0.393
5.82AlaVal: 5.82 ± 0.427
1.758AlaTrp: 1.758 ± 0.217
2.507AlaTyr: 2.507 ± 0.278
0.0AlaXaa: 0.0 ± 0.0
Cys
0.893CysAla: 0.893 ± 0.169
0.202CysCys: 0.202 ± 0.086
1.181CysAsp: 1.181 ± 0.225
0.864CysGlu: 0.864 ± 0.166
0.49CysPhe: 0.49 ± 0.122
1.93CysGly: 1.93 ± 0.313
0.432CysHis: 0.432 ± 0.115
0.576CysIle: 0.576 ± 0.13
0.663CysLys: 0.663 ± 0.156
1.21CysLeu: 1.21 ± 0.208
0.144CysMet: 0.144 ± 0.068
0.634CysAsn: 0.634 ± 0.132
0.807CysPro: 0.807 ± 0.152
0.403CysGln: 0.403 ± 0.113
1.21CysArg: 1.21 ± 0.218
0.807CysSer: 0.807 ± 0.171
0.807CysThr: 0.807 ± 0.194
1.037CysVal: 1.037 ± 0.214
0.375CysTrp: 0.375 ± 0.114
0.461CysTyr: 0.461 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
5.474AspAla: 5.474 ± 0.387
1.181AspCys: 1.181 ± 0.219
4.091AspAsp: 4.091 ± 0.313
4.869AspGlu: 4.869 ± 0.433
2.334AspPhe: 2.334 ± 0.284
6.281AspGly: 6.281 ± 0.489
1.873AspHis: 1.873 ± 0.232
3.169AspIle: 3.169 ± 0.312
2.997AspLys: 2.997 ± 0.298
5.244AspLeu: 5.244 ± 0.349
1.354AspMet: 1.354 ± 0.208
2.161AspAsn: 2.161 ± 0.249
3.717AspPro: 3.717 ± 0.34
2.132AspGln: 2.132 ± 0.225
3.89AspArg: 3.89 ± 0.317
2.708AspSer: 2.708 ± 0.339
3.169AspThr: 3.169 ± 0.27
4.322AspVal: 4.322 ± 0.45
2.017AspTrp: 2.017 ± 0.267
2.334AspTyr: 2.334 ± 0.258
0.0AspXaa: 0.0 ± 0.0
Glu
6.685GluAla: 6.685 ± 0.559
1.239GluCys: 1.239 ± 0.186
4.408GluAsp: 4.408 ± 0.323
5.071GluGlu: 5.071 ± 0.495
2.42GluPhe: 2.42 ± 0.298
4.61GluGly: 4.61 ± 0.353
1.239GluHis: 1.239 ± 0.176
4.063GluIle: 4.063 ± 0.338
2.997GluLys: 2.997 ± 0.382
6.886GluLeu: 6.886 ± 0.491
2.564GluMet: 2.564 ± 0.248
1.959GluAsn: 1.959 ± 0.229
2.968GluPro: 2.968 ± 0.282
2.564GluGln: 2.564 ± 0.234
5.042GluArg: 5.042 ± 0.432
3.458GluSer: 3.458 ± 0.339
2.91GluThr: 2.91 ± 0.251
4.783GluVal: 4.783 ± 0.41
1.441GluTrp: 1.441 ± 0.236
3.025GluTyr: 3.025 ± 0.357
0.0GluXaa: 0.0 ± 0.0
Phe
2.737PheAla: 2.737 ± 0.273
0.49PheCys: 0.49 ± 0.133
2.708PheAsp: 2.708 ± 0.31
2.219PheGlu: 2.219 ± 0.247
1.124PhePhe: 1.124 ± 0.189
3.198PheGly: 3.198 ± 0.341
0.778PheHis: 0.778 ± 0.155
1.268PheIle: 1.268 ± 0.185
1.095PheLys: 1.095 ± 0.186
1.959PheLeu: 1.959 ± 0.237
0.807PheMet: 0.807 ± 0.156
1.268PheAsn: 1.268 ± 0.248
2.046PhePro: 2.046 ± 0.288
0.749PheGln: 0.749 ± 0.181
1.786PheArg: 1.786 ± 0.246
1.7PheSer: 1.7 ± 0.204
1.758PheThr: 1.758 ± 0.22
2.075PheVal: 2.075 ± 0.194
0.605PheTrp: 0.605 ± 0.144
0.864PheTyr: 0.864 ± 0.152
0.0PheXaa: 0.0 ± 0.0
Gly
7.981GlyAla: 7.981 ± 0.728
1.325GlyCys: 1.325 ± 0.216
5.705GlyAsp: 5.705 ± 0.404
5.417GlyGlu: 5.417 ± 0.366
3.141GlyPhe: 3.141 ± 0.355
9.595GlyGly: 9.595 ± 1.509
2.305GlyHis: 2.305 ± 0.314
4.264GlyIle: 4.264 ± 0.436
3.486GlyLys: 3.486 ± 0.336
7.693GlyLeu: 7.693 ± 0.626
2.305GlyMet: 2.305 ± 0.248
2.91GlyAsn: 2.91 ± 0.28
4.063GlyPro: 4.063 ± 0.385
2.737GlyGln: 2.737 ± 0.314
5.763GlyArg: 5.763 ± 0.451
5.071GlySer: 5.071 ± 0.387
4.783GlyThr: 4.783 ± 0.526
5.071GlyVal: 5.071 ± 0.387
2.161GlyTrp: 2.161 ± 0.312
3.256GlyTyr: 3.256 ± 0.241
0.0GlyXaa: 0.0 ± 0.0
His
1.7HisAla: 1.7 ± 0.188
0.375HisCys: 0.375 ± 0.097
1.383HisAsp: 1.383 ± 0.18
1.844HisGlu: 1.844 ± 0.207
0.375HisPhe: 0.375 ± 0.116
2.103HisGly: 2.103 ± 0.251
0.864HisHis: 0.864 ± 0.166
1.037HisIle: 1.037 ± 0.165
0.72HisLys: 0.72 ± 0.125
2.247HisLeu: 2.247 ± 0.266
0.432HisMet: 0.432 ± 0.109
0.692HisAsn: 0.692 ± 0.133
1.412HisPro: 1.412 ± 0.191
0.605HisGln: 0.605 ± 0.125
1.959HisArg: 1.959 ± 0.228
0.864HisSer: 0.864 ± 0.149
0.692HisThr: 0.692 ± 0.137
1.498HisVal: 1.498 ± 0.206
0.778HisTrp: 0.778 ± 0.195
0.519HisTyr: 0.519 ± 0.106
0.0HisXaa: 0.0 ± 0.0
Ile
5.618IleAla: 5.618 ± 0.417
0.576IleCys: 0.576 ± 0.162
3.458IleAsp: 3.458 ± 0.284
4.235IleGlu: 4.235 ± 0.308
1.383IlePhe: 1.383 ± 0.188
3.659IleGly: 3.659 ± 0.354
1.239IleHis: 1.239 ± 0.189
1.873IleIle: 1.873 ± 0.226
1.786IleLys: 1.786 ± 0.222
3.774IleLeu: 3.774 ± 0.34
0.807IleMet: 0.807 ± 0.166
1.614IleAsn: 1.614 ± 0.205
2.852IlePro: 2.852 ± 0.305
1.527IleGln: 1.527 ± 0.182
3.198IleArg: 3.198 ± 0.285
2.42IleSer: 2.42 ± 0.258
2.737IleThr: 2.737 ± 0.29
3.486IleVal: 3.486 ± 0.354
0.778IleTrp: 0.778 ± 0.131
1.297IleTyr: 1.297 ± 0.194
0.0IleXaa: 0.0 ± 0.0
Lys
4.351LysAla: 4.351 ± 0.399
0.663LysCys: 0.663 ± 0.134
1.844LysAsp: 1.844 ± 0.287
2.075LysGlu: 2.075 ± 0.258
1.556LysPhe: 1.556 ± 0.203
2.68LysGly: 2.68 ± 0.311
1.008LysHis: 1.008 ± 0.182
1.585LysIle: 1.585 ± 0.202
2.247LysLys: 2.247 ± 0.31
3.63LysLeu: 3.63 ± 0.373
1.469LysMet: 1.469 ± 0.187
1.066LysAsn: 1.066 ± 0.189
2.795LysPro: 2.795 ± 0.297
1.21LysGln: 1.21 ± 0.156
2.737LysArg: 2.737 ± 0.296
2.276LysSer: 2.276 ± 0.334
1.758LysThr: 1.758 ± 0.228
3.169LysVal: 3.169 ± 0.306
1.066LysTrp: 1.066 ± 0.173
1.268LysTyr: 1.268 ± 0.199
0.0LysXaa: 0.0 ± 0.0
Leu
9.105LeuAla: 9.105 ± 0.517
0.663LeuCys: 0.663 ± 0.131
5.417LeuAsp: 5.417 ± 0.38
5.042LeuGlu: 5.042 ± 0.427
2.132LeuPhe: 2.132 ± 0.239
7.347LeuGly: 7.347 ± 0.469
1.7LeuHis: 1.7 ± 0.186
3.169LeuIle: 3.169 ± 0.304
2.939LeuLys: 2.939 ± 0.289
5.878LeuLeu: 5.878 ± 0.395
1.873LeuMet: 1.873 ± 0.201
3.141LeuAsn: 3.141 ± 0.247
4.322LeuPro: 4.322 ± 0.336
2.593LeuGln: 2.593 ± 0.298
6.022LeuArg: 6.022 ± 0.42
5.532LeuSer: 5.532 ± 0.357
5.446LeuThr: 5.446 ± 0.382
5.013LeuVal: 5.013 ± 0.355
1.556LeuTrp: 1.556 ± 0.207
2.103LeuTyr: 2.103 ± 0.207
0.0LeuXaa: 0.0 ± 0.0
Met
2.19MetAla: 2.19 ± 0.275
0.086MetCys: 0.086 ± 0.052
1.124MetAsp: 1.124 ± 0.183
1.268MetGlu: 1.268 ± 0.176
0.692MetPhe: 0.692 ± 0.133
1.786MetGly: 1.786 ± 0.18
0.403MetHis: 0.403 ± 0.115
1.325MetIle: 1.325 ± 0.193
1.412MetLys: 1.412 ± 0.206
2.046MetLeu: 2.046 ± 0.251
0.461MetMet: 0.461 ± 0.099
1.268MetAsn: 1.268 ± 0.2
1.124MetPro: 1.124 ± 0.212
0.576MetGln: 0.576 ± 0.142
1.758MetArg: 1.758 ± 0.183
2.42MetSer: 2.42 ± 0.244
2.247MetThr: 2.247 ± 0.291
1.297MetVal: 1.297 ± 0.185
0.576MetTrp: 0.576 ± 0.125
0.49MetTyr: 0.49 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
3.198AsnAla: 3.198 ± 0.27
0.547AsnCys: 0.547 ± 0.123
2.247AsnAsp: 2.247 ± 0.263
2.017AsnGlu: 2.017 ± 0.255
1.008AsnPhe: 1.008 ± 0.203
3.573AsnGly: 3.573 ± 0.327
0.864AsnHis: 0.864 ± 0.153
1.498AsnIle: 1.498 ± 0.269
1.181AsnLys: 1.181 ± 0.157
2.219AsnLeu: 2.219 ± 0.268
0.663AsnMet: 0.663 ± 0.14
0.951AsnAsn: 0.951 ± 0.162
2.564AsnPro: 2.564 ± 0.248
0.893AsnGln: 0.893 ± 0.188
2.564AsnArg: 2.564 ± 0.24
1.642AsnSer: 1.642 ± 0.224
1.7AsnThr: 1.7 ± 0.256
2.046AsnVal: 2.046 ± 0.222
0.605AsnTrp: 0.605 ± 0.115
0.72AsnTyr: 0.72 ± 0.123
0.0AsnXaa: 0.0 ± 0.0
Pro
5.1ProAla: 5.1 ± 0.371
0.922ProCys: 0.922 ± 0.204
3.774ProAsp: 3.774 ± 0.351
4.639ProGlu: 4.639 ± 0.372
1.902ProPhe: 1.902 ± 0.202
5.907ProGly: 5.907 ± 0.549
1.008ProHis: 1.008 ± 0.175
2.622ProIle: 2.622 ± 0.298
1.844ProLys: 1.844 ± 0.241
3.832ProLeu: 3.832 ± 0.288
1.066ProMet: 1.066 ± 0.172
2.075ProAsn: 2.075 ± 0.204
3.141ProPro: 3.141 ± 0.372
1.153ProGln: 1.153 ± 0.202
2.997ProArg: 2.997 ± 0.311
2.968ProSer: 2.968 ± 0.359
2.881ProThr: 2.881 ± 0.296
4.207ProVal: 4.207 ± 0.354
1.325ProTrp: 1.325 ± 0.168
1.383ProTyr: 1.383 ± 0.186
0.0ProXaa: 0.0 ± 0.0
Gln
2.824GlnAla: 2.824 ± 0.302
0.634GlnCys: 0.634 ± 0.134
1.412GlnAsp: 1.412 ± 0.209
1.7GlnGlu: 1.7 ± 0.259
0.951GlnPhe: 0.951 ± 0.162
2.046GlnGly: 2.046 ± 0.299
0.547GlnHis: 0.547 ± 0.115
1.786GlnIle: 1.786 ± 0.205
1.729GlnLys: 1.729 ± 0.264
2.478GlnLeu: 2.478 ± 0.294
1.037GlnMet: 1.037 ± 0.158
0.692GlnAsn: 0.692 ± 0.123
1.556GlnPro: 1.556 ± 0.203
1.268GlnGln: 1.268 ± 0.246
2.42GlnArg: 2.42 ± 0.258
1.902GlnSer: 1.902 ± 0.243
1.7GlnThr: 1.7 ± 0.23
1.815GlnVal: 1.815 ± 0.245
0.432GlnTrp: 0.432 ± 0.133
0.864GlnTyr: 0.864 ± 0.15
0.0GlnXaa: 0.0 ± 0.0
Arg
6.339ArgAla: 6.339 ± 0.439
1.037ArgCys: 1.037 ± 0.193
3.919ArgAsp: 3.919 ± 0.292
4.783ArgGlu: 4.783 ± 0.413
2.075ArgPhe: 2.075 ± 0.24
5.302ArgGly: 5.302 ± 0.469
1.239ArgHis: 1.239 ± 0.23
3.659ArgIle: 3.659 ± 0.396
2.997ArgLys: 2.997 ± 0.269
5.446ArgLeu: 5.446 ± 0.444
2.276ArgMet: 2.276 ± 0.259
2.276ArgAsn: 2.276 ± 0.24
3.515ArgPro: 3.515 ± 0.371
2.564ArgGln: 2.564 ± 0.297
5.474ArgArg: 5.474 ± 0.54
3.313ArgSer: 3.313 ± 0.284
3.227ArgThr: 3.227 ± 0.352
4.61ArgVal: 4.61 ± 0.383
1.758ArgTrp: 1.758 ± 0.233
2.305ArgTyr: 2.305 ± 0.295
0.0ArgXaa: 0.0 ± 0.0
Ser
5.561SerAla: 5.561 ± 0.446
0.778SerCys: 0.778 ± 0.17
3.717SerAsp: 3.717 ± 0.31
3.746SerGlu: 3.746 ± 0.355
1.498SerPhe: 1.498 ± 0.245
6.108SerGly: 6.108 ± 0.463
1.124SerHis: 1.124 ± 0.189
2.507SerIle: 2.507 ± 0.286
1.988SerLys: 1.988 ± 0.272
4.322SerLeu: 4.322 ± 0.319
1.21SerMet: 1.21 ± 0.171
1.642SerAsn: 1.642 ± 0.229
3.544SerPro: 3.544 ± 0.298
1.383SerGln: 1.383 ± 0.225
3.198SerArg: 3.198 ± 0.31
3.313SerSer: 3.313 ± 0.292
2.622SerThr: 2.622 ± 0.282
3.774SerVal: 3.774 ± 0.394
1.239SerTrp: 1.239 ± 0.212
1.614SerTyr: 1.614 ± 0.223
0.0SerXaa: 0.0 ± 0.0
Thr
5.013ThrAla: 5.013 ± 0.452
0.98ThrCys: 0.98 ± 0.19
3.4ThrAsp: 3.4 ± 0.313
3.429ThrGlu: 3.429 ± 0.319
1.93ThrPhe: 1.93 ± 0.219
4.812ThrGly: 4.812 ± 0.372
0.98ThrHis: 0.98 ± 0.164
2.997ThrIle: 2.997 ± 0.311
1.729ThrLys: 1.729 ± 0.248
4.927ThrLeu: 4.927 ± 0.386
0.98ThrMet: 0.98 ± 0.169
1.441ThrAsn: 1.441 ± 0.214
3.688ThrPro: 3.688 ± 0.338
1.153ThrGln: 1.153 ± 0.17
2.651ThrArg: 2.651 ± 0.264
2.824ThrSer: 2.824 ± 0.297
3.112ThrThr: 3.112 ± 0.379
4.38ThrVal: 4.38 ± 0.382
1.498ThrTrp: 1.498 ± 0.229
1.902ThrTyr: 1.902 ± 0.24
0.0ThrXaa: 0.0 ± 0.0
Val
6.685ValAla: 6.685 ± 0.464
0.98ValCys: 0.98 ± 0.195
5.388ValAsp: 5.388 ± 0.41
5.359ValGlu: 5.359 ± 0.372
1.844ValPhe: 1.844 ± 0.235
5.129ValGly: 5.129 ± 0.483
1.498ValHis: 1.498 ± 0.236
3.054ValIle: 3.054 ± 0.324
2.824ValLys: 2.824 ± 0.296
4.322ValLeu: 4.322 ± 0.338
1.671ValMet: 1.671 ± 0.2
2.507ValAsn: 2.507 ± 0.22
3.573ValPro: 3.573 ± 0.345
2.017ValGln: 2.017 ± 0.301
4.61ValArg: 4.61 ± 0.399
4.005ValSer: 4.005 ± 0.267
4.178ValThr: 4.178 ± 0.428
5.59ValVal: 5.59 ± 0.53
1.037ValTrp: 1.037 ± 0.194
1.959ValTyr: 1.959 ± 0.256
0.0ValXaa: 0.0 ± 0.0
Trp
1.902TrpAla: 1.902 ± 0.24
0.519TrpCys: 0.519 ± 0.113
1.469TrpAsp: 1.469 ± 0.204
1.383TrpGlu: 1.383 ± 0.195
0.634TrpPhe: 0.634 ± 0.153
1.556TrpGly: 1.556 ± 0.244
0.576TrpHis: 0.576 ± 0.13
1.095TrpIle: 1.095 ± 0.164
0.519TrpLys: 0.519 ± 0.115
1.786TrpLeu: 1.786 ± 0.248
0.634TrpMet: 0.634 ± 0.167
0.634TrpAsn: 0.634 ± 0.133
1.037TrpPro: 1.037 ± 0.162
0.49TrpGln: 0.49 ± 0.102
1.988TrpArg: 1.988 ± 0.264
1.729TrpSer: 1.729 ± 0.262
1.297TrpThr: 1.297 ± 0.204
2.017TrpVal: 2.017 ± 0.267
0.576TrpTrp: 0.576 ± 0.132
0.49TrpTyr: 0.49 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.371TyrAla: 3.371 ± 0.286
0.49TyrCys: 0.49 ± 0.117
1.844TyrAsp: 1.844 ± 0.267
2.449TyrGlu: 2.449 ± 0.278
0.634TyrPhe: 0.634 ± 0.139
3.112TyrGly: 3.112 ± 0.268
0.49TyrHis: 0.49 ± 0.116
1.181TyrIle: 1.181 ± 0.176
1.095TyrLys: 1.095 ± 0.208
2.766TyrLeu: 2.766 ± 0.268
0.144TyrMet: 0.144 ± 0.072
0.778TyrAsn: 0.778 ± 0.138
1.412TyrPro: 1.412 ± 0.225
0.951TyrGln: 0.951 ± 0.153
2.651TyrArg: 2.651 ± 0.305
1.469TyrSer: 1.469 ± 0.208
1.556TyrThr: 1.556 ± 0.233
2.219TyrVal: 2.219 ± 0.264
0.749TyrTrp: 0.749 ± 0.148
0.893TyrTyr: 0.893 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 237 proteins (34708 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski