Amino acid dipepetide frequency for Mycobacterium phage Redi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.944AlaAla: 20.944 ± 2.95
1.091AlaCys: 1.091 ± 0.363
8.581AlaAsp: 8.581 ± 0.816
8.581AlaGlu: 8.581 ± 1.038
3.636AlaPhe: 3.636 ± 0.602
9.745AlaGly: 9.745 ± 1.524
2.618AlaHis: 2.618 ± 0.518
5.963AlaIle: 5.963 ± 0.822
3.782AlaLys: 3.782 ± 0.616
10.399AlaLeu: 10.399 ± 1.053
2.618AlaMet: 2.618 ± 0.482
3.563AlaAsn: 3.563 ± 0.589
6.981AlaPro: 6.981 ± 0.684
4.945AlaGln: 4.945 ± 0.661
8.508AlaArg: 8.508 ± 0.959
5.818AlaSer: 5.818 ± 0.571
8.072AlaThr: 8.072 ± 0.859
6.909AlaVal: 6.909 ± 0.688
2.254AlaTrp: 2.254 ± 0.471
2.473AlaTyr: 2.473 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
1.891CysAla: 1.891 ± 0.466
0.073CysCys: 0.073 ± 0.073
0.873CysAsp: 0.873 ± 0.264
0.582CysGlu: 0.582 ± 0.238
0.0CysPhe: 0.0 ± 0.0
1.527CysGly: 1.527 ± 0.463
0.073CysHis: 0.073 ± 0.071
0.218CysIle: 0.218 ± 0.14
0.291CysLys: 0.291 ± 0.139
0.509CysLeu: 0.509 ± 0.171
0.073CysMet: 0.073 ± 0.073
0.291CysAsn: 0.291 ± 0.128
0.945CysPro: 0.945 ± 0.364
0.364CysGln: 0.364 ± 0.164
0.945CysArg: 0.945 ± 0.298
0.654CysSer: 0.654 ± 0.207
0.8CysThr: 0.8 ± 0.243
0.654CysVal: 0.654 ± 0.247
0.073CysTrp: 0.073 ± 0.066
0.218CysTyr: 0.218 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
7.563AspAla: 7.563 ± 0.781
0.873AspCys: 0.873 ± 0.284
5.236AspAsp: 5.236 ± 0.701
4.727AspGlu: 4.727 ± 0.656
1.454AspPhe: 1.454 ± 0.333
6.618AspGly: 6.618 ± 0.695
1.527AspHis: 1.527 ± 0.333
2.254AspIle: 2.254 ± 0.323
1.527AspLys: 1.527 ± 0.338
6.472AspLeu: 6.472 ± 0.677
1.164AspMet: 1.164 ± 0.268
1.454AspAsn: 1.454 ± 0.277
4.727AspPro: 4.727 ± 0.648
1.818AspGln: 1.818 ± 0.307
4.872AspArg: 4.872 ± 0.8
3.127AspSer: 3.127 ± 0.479
3.2AspThr: 3.2 ± 0.489
4.436AspVal: 4.436 ± 0.573
1.309AspTrp: 1.309 ± 0.486
1.891AspTyr: 1.891 ± 0.331
0.0AspXaa: 0.0 ± 0.0
Glu
6.836GluAla: 6.836 ± 0.786
0.727GluCys: 0.727 ± 0.224
3.2GluAsp: 3.2 ± 0.474
2.327GluGlu: 2.327 ± 0.422
2.545GluPhe: 2.545 ± 0.391
2.545GluGly: 2.545 ± 0.469
0.873GluHis: 0.873 ± 0.309
3.054GluIle: 3.054 ± 0.406
2.327GluLys: 2.327 ± 0.348
6.109GluLeu: 6.109 ± 0.735
1.527GluMet: 1.527 ± 0.392
1.673GluAsn: 1.673 ± 0.291
2.763GluPro: 2.763 ± 0.468
3.491GluGln: 3.491 ± 0.5
3.927GluArg: 3.927 ± 0.635
2.763GluSer: 2.763 ± 0.583
4.072GluThr: 4.072 ± 0.625
4.291GluVal: 4.291 ± 0.656
1.236GluTrp: 1.236 ± 0.298
1.091GluTyr: 1.091 ± 0.28
0.0GluXaa: 0.0 ± 0.0
Phe
3.927PheAla: 3.927 ± 0.547
0.291PheCys: 0.291 ± 0.157
1.818PheAsp: 1.818 ± 0.57
1.6PheGlu: 1.6 ± 0.368
0.873PhePhe: 0.873 ± 0.287
2.545PheGly: 2.545 ± 0.403
0.727PheHis: 0.727 ± 0.245
1.018PheIle: 1.018 ± 0.317
0.873PheLys: 0.873 ± 0.214
2.036PheLeu: 2.036 ± 0.432
0.364PheMet: 0.364 ± 0.167
0.8PheAsn: 0.8 ± 0.276
0.873PhePro: 0.873 ± 0.347
0.364PheGln: 0.364 ± 0.158
1.745PheArg: 1.745 ± 0.32
1.745PheSer: 1.745 ± 0.327
2.473PheThr: 2.473 ± 0.471
2.545PheVal: 2.545 ± 0.514
0.364PheTrp: 0.364 ± 0.155
0.8PheTyr: 0.8 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
7.49GlyAla: 7.49 ± 1.198
0.8GlyCys: 0.8 ± 0.228
5.963GlyAsp: 5.963 ± 0.739
3.854GlyGlu: 3.854 ± 0.678
2.182GlyPhe: 2.182 ± 0.501
9.09GlyGly: 9.09 ± 1.307
1.454GlyHis: 1.454 ± 0.285
5.018GlyIle: 5.018 ± 0.757
2.473GlyLys: 2.473 ± 0.375
6.036GlyLeu: 6.036 ± 0.584
1.454GlyMet: 1.454 ± 0.32
2.836GlyAsn: 2.836 ± 0.45
4.654GlyPro: 4.654 ± 0.596
3.491GlyGln: 3.491 ± 0.469
5.745GlyArg: 5.745 ± 0.635
5.745GlySer: 5.745 ± 0.871
6.036GlyThr: 6.036 ± 0.688
6.327GlyVal: 6.327 ± 0.624
1.745GlyTrp: 1.745 ± 0.32
2.618GlyTyr: 2.618 ± 0.557
0.0GlyXaa: 0.0 ± 0.0
His
1.309HisAla: 1.309 ± 0.352
0.436HisCys: 0.436 ± 0.178
1.018HisAsp: 1.018 ± 0.193
1.6HisGlu: 1.6 ± 0.4
0.509HisPhe: 0.509 ± 0.193
2.036HisGly: 2.036 ± 0.41
0.654HisHis: 0.654 ± 0.189
0.727HisIle: 0.727 ± 0.227
1.018HisLys: 1.018 ± 0.271
1.818HisLeu: 1.818 ± 0.417
0.218HisMet: 0.218 ± 0.142
0.073HisAsn: 0.073 ± 0.064
1.164HisPro: 1.164 ± 0.303
1.018HisGln: 1.018 ± 0.28
2.182HisArg: 2.182 ± 0.384
0.873HisSer: 0.873 ± 0.266
1.236HisThr: 1.236 ± 0.303
1.527HisVal: 1.527 ± 0.435
0.364HisTrp: 0.364 ± 0.156
0.727HisTyr: 0.727 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
6.181IleAla: 6.181 ± 0.697
0.291IleCys: 0.291 ± 0.14
3.491IleAsp: 3.491 ± 0.418
3.782IleGlu: 3.782 ± 0.414
0.582IlePhe: 0.582 ± 0.225
4.436IleGly: 4.436 ± 0.722
1.018IleHis: 1.018 ± 0.288
1.382IleIle: 1.382 ± 0.33
1.018IleLys: 1.018 ± 0.246
2.909IleLeu: 2.909 ± 0.429
0.291IleMet: 0.291 ± 0.146
1.963IleAsn: 1.963 ± 0.316
2.473IlePro: 2.473 ± 0.502
1.6IleGln: 1.6 ± 0.296
3.2IleArg: 3.2 ± 0.414
2.182IleSer: 2.182 ± 0.45
3.636IleThr: 3.636 ± 0.549
3.418IleVal: 3.418 ± 0.532
0.654IleTrp: 0.654 ± 0.196
0.8IleTyr: 0.8 ± 0.181
0.0IleXaa: 0.0 ± 0.0
Lys
5.163LysAla: 5.163 ± 1.078
0.364LysCys: 0.364 ± 0.155
1.236LysAsp: 1.236 ± 0.388
1.018LysGlu: 1.018 ± 0.238
0.873LysPhe: 0.873 ± 0.198
2.909LysGly: 2.909 ± 0.407
0.436LysHis: 0.436 ± 0.165
1.673LysIle: 1.673 ± 0.317
0.509LysLys: 0.509 ± 0.181
1.963LysLeu: 1.963 ± 0.448
0.654LysMet: 0.654 ± 0.19
0.873LysAsn: 0.873 ± 0.218
2.109LysPro: 2.109 ± 0.405
1.6LysGln: 1.6 ± 0.424
1.891LysArg: 1.891 ± 0.415
1.527LysSer: 1.527 ± 0.298
1.164LysThr: 1.164 ± 0.251
2.327LysVal: 2.327 ± 0.32
0.364LysTrp: 0.364 ± 0.147
0.8LysTyr: 0.8 ± 0.185
0.0LysXaa: 0.0 ± 0.0
Leu
11.417LeuAla: 11.417 ± 1.094
0.509LeuCys: 0.509 ± 0.19
5.963LeuAsp: 5.963 ± 0.641
3.272LeuGlu: 3.272 ± 0.357
2.618LeuPhe: 2.618 ± 0.413
8.145LeuGly: 8.145 ± 0.967
2.036LeuHis: 2.036 ± 0.384
3.272LeuIle: 3.272 ± 0.379
2.836LeuLys: 2.836 ± 0.413
6.763LeuLeu: 6.763 ± 0.844
1.527LeuMet: 1.527 ± 0.335
2.982LeuAsn: 2.982 ± 0.575
3.709LeuPro: 3.709 ± 0.579
2.982LeuGln: 2.982 ± 0.551
5.381LeuArg: 5.381 ± 0.634
4.218LeuSer: 4.218 ± 0.54
5.018LeuThr: 5.018 ± 0.672
5.381LeuVal: 5.381 ± 0.59
1.382LeuTrp: 1.382 ± 0.363
1.236LeuTyr: 1.236 ± 0.256
0.0LeuXaa: 0.0 ± 0.0
Met
2.836MetAla: 2.836 ± 0.452
0.0MetCys: 0.0 ± 0.0
0.509MetAsp: 0.509 ± 0.204
0.8MetGlu: 0.8 ± 0.248
1.018MetPhe: 1.018 ± 0.318
0.8MetGly: 0.8 ± 0.206
0.364MetHis: 0.364 ± 0.154
1.018MetIle: 1.018 ± 0.273
0.364MetLys: 0.364 ± 0.161
1.382MetLeu: 1.382 ± 0.315
0.291MetMet: 0.291 ± 0.15
0.654MetAsn: 0.654 ± 0.22
0.945MetPro: 0.945 ± 0.292
0.727MetGln: 0.727 ± 0.226
1.382MetArg: 1.382 ± 0.338
2.4MetSer: 2.4 ± 0.293
2.036MetThr: 2.036 ± 0.488
1.091MetVal: 1.091 ± 0.277
0.654MetTrp: 0.654 ± 0.258
0.145MetTyr: 0.145 ± 0.094
0.0MetXaa: 0.0 ± 0.0
Asn
2.909AsnAla: 2.909 ± 0.649
0.436AsnCys: 0.436 ± 0.178
1.382AsnAsp: 1.382 ± 0.311
0.8AsnGlu: 0.8 ± 0.236
0.945AsnPhe: 0.945 ± 0.227
4.0AsnGly: 4.0 ± 0.653
0.145AsnHis: 0.145 ± 0.097
1.6AsnIle: 1.6 ± 0.375
0.727AsnLys: 0.727 ± 0.263
2.4AsnLeu: 2.4 ± 0.492
0.364AsnMet: 0.364 ± 0.165
1.018AsnAsn: 1.018 ± 0.235
2.691AsnPro: 2.691 ± 0.34
1.236AsnGln: 1.236 ± 0.302
2.327AsnArg: 2.327 ± 0.385
1.673AsnSer: 1.673 ± 0.522
1.963AsnThr: 1.963 ± 0.376
1.745AsnVal: 1.745 ± 0.327
0.509AsnTrp: 0.509 ± 0.172
0.509AsnTyr: 0.509 ± 0.202
0.0AsnXaa: 0.0 ± 0.0
Pro
7.636ProAla: 7.636 ± 0.835
0.654ProCys: 0.654 ± 0.247
4.218ProAsp: 4.218 ± 0.485
3.854ProGlu: 3.854 ± 0.613
2.109ProPhe: 2.109 ± 0.371
5.163ProGly: 5.163 ± 0.69
1.091ProHis: 1.091 ± 0.243
2.109ProIle: 2.109 ± 0.397
1.382ProLys: 1.382 ± 0.324
4.945ProLeu: 4.945 ± 0.72
1.527ProMet: 1.527 ± 0.35
1.891ProAsn: 1.891 ± 0.39
2.982ProPro: 2.982 ± 0.457
2.036ProGln: 2.036 ± 0.407
3.054ProArg: 3.054 ± 0.463
2.4ProSer: 2.4 ± 0.412
3.854ProThr: 3.854 ± 0.525
4.145ProVal: 4.145 ± 0.5
1.6ProTrp: 1.6 ± 0.378
1.309ProTyr: 1.309 ± 0.389
0.0ProXaa: 0.0 ± 0.0
Gln
5.672GlnAla: 5.672 ± 0.937
0.654GlnCys: 0.654 ± 0.251
1.309GlnAsp: 1.309 ± 0.398
1.6GlnGlu: 1.6 ± 0.315
0.945GlnPhe: 0.945 ± 0.232
1.454GlnGly: 1.454 ± 0.332
1.164GlnHis: 1.164 ± 0.261
2.545GlnIle: 2.545 ± 0.369
1.382GlnLys: 1.382 ± 0.396
4.0GlnLeu: 4.0 ± 0.545
1.091GlnMet: 1.091 ± 0.296
0.654GlnAsn: 0.654 ± 0.257
2.473GlnPro: 2.473 ± 0.479
2.182GlnGln: 2.182 ± 0.447
3.345GlnArg: 3.345 ± 0.522
1.6GlnSer: 1.6 ± 0.347
2.618GlnThr: 2.618 ± 0.449
2.691GlnVal: 2.691 ± 0.402
0.8GlnTrp: 0.8 ± 0.222
0.8GlnTyr: 0.8 ± 0.271
0.0GlnXaa: 0.0 ± 0.0
Arg
7.49ArgAla: 7.49 ± 0.753
0.8ArgCys: 0.8 ± 0.288
5.018ArgAsp: 5.018 ± 0.659
4.727ArgGlu: 4.727 ± 0.792
1.745ArgPhe: 1.745 ± 0.416
4.8ArgGly: 4.8 ± 0.631
1.963ArgHis: 1.963 ± 0.422
3.2ArgIle: 3.2 ± 0.467
2.109ArgLys: 2.109 ± 0.399
5.818ArgLeu: 5.818 ± 0.69
1.963ArgMet: 1.963 ± 0.531
1.963ArgAsn: 1.963 ± 0.344
4.145ArgPro: 4.145 ± 0.477
3.054ArgGln: 3.054 ± 0.651
6.763ArgArg: 6.763 ± 0.906
2.618ArgSer: 2.618 ± 0.505
4.291ArgThr: 4.291 ± 0.624
5.527ArgVal: 5.527 ± 0.754
1.527ArgTrp: 1.527 ± 0.262
2.036ArgTyr: 2.036 ± 0.362
0.0ArgXaa: 0.0 ± 0.0
Ser
6.69SerAla: 6.69 ± 0.778
0.218SerCys: 0.218 ± 0.123
3.272SerAsp: 3.272 ± 0.442
1.818SerGlu: 1.818 ± 0.34
1.236SerPhe: 1.236 ± 0.296
5.018SerGly: 5.018 ± 0.727
0.945SerHis: 0.945 ± 0.303
2.036SerIle: 2.036 ± 0.36
1.454SerLys: 1.454 ± 0.328
3.345SerLeu: 3.345 ± 0.507
1.673SerMet: 1.673 ± 0.314
1.382SerAsn: 1.382 ± 0.483
3.345SerPro: 3.345 ± 0.429
1.963SerGln: 1.963 ± 0.349
3.491SerArg: 3.491 ± 0.586
2.545SerSer: 2.545 ± 0.452
3.563SerThr: 3.563 ± 0.437
3.782SerVal: 3.782 ± 0.521
1.091SerTrp: 1.091 ± 0.28
1.527SerTyr: 1.527 ± 0.322
0.0SerXaa: 0.0 ± 0.0
Thr
9.09ThrAla: 9.09 ± 0.912
0.8ThrCys: 0.8 ± 0.303
5.018ThrAsp: 5.018 ± 0.633
3.418ThrGlu: 3.418 ± 0.437
1.6ThrPhe: 1.6 ± 0.384
6.4ThrGly: 6.4 ± 0.861
0.945ThrHis: 0.945 ± 0.295
2.836ThrIle: 2.836 ± 0.437
1.963ThrLys: 1.963 ± 0.306
6.545ThrLeu: 6.545 ± 0.731
0.8ThrMet: 0.8 ± 0.194
1.382ThrAsn: 1.382 ± 0.289
5.018ThrPro: 5.018 ± 0.773
2.109ThrGln: 2.109 ± 0.434
4.654ThrArg: 4.654 ± 0.569
2.473ThrSer: 2.473 ± 0.502
3.491ThrThr: 3.491 ± 0.612
6.254ThrVal: 6.254 ± 0.725
0.582ThrTrp: 0.582 ± 0.215
1.527ThrTyr: 1.527 ± 0.269
0.0ThrXaa: 0.0 ± 0.0
Val
8.945ValAla: 8.945 ± 0.806
0.945ValCys: 0.945 ± 0.291
5.309ValAsp: 5.309 ± 0.767
6.545ValGlu: 6.545 ± 0.666
1.673ValPhe: 1.673 ± 0.363
4.581ValGly: 4.581 ± 0.462
1.382ValHis: 1.382 ± 0.342
3.491ValIle: 3.491 ± 0.569
2.4ValLys: 2.4 ± 0.466
3.782ValLeu: 3.782 ± 0.482
0.945ValMet: 0.945 ± 0.251
2.836ValAsn: 2.836 ± 0.47
4.363ValPro: 4.363 ± 0.592
2.618ValGln: 2.618 ± 0.435
5.309ValArg: 5.309 ± 0.601
3.709ValSer: 3.709 ± 0.499
5.745ValThr: 5.745 ± 0.825
6.181ValVal: 6.181 ± 0.854
1.236ValTrp: 1.236 ± 0.252
1.091ValTyr: 1.091 ± 0.259
0.0ValXaa: 0.0 ± 0.0
Trp
1.382TrpAla: 1.382 ± 0.372
0.654TrpCys: 0.654 ± 0.26
1.236TrpAsp: 1.236 ± 0.259
0.582TrpGlu: 0.582 ± 0.205
0.8TrpPhe: 0.8 ± 0.32
1.164TrpGly: 1.164 ± 0.319
0.509TrpHis: 0.509 ± 0.18
1.018TrpIle: 1.018 ± 0.285
0.436TrpLys: 0.436 ± 0.158
1.818TrpLeu: 1.818 ± 0.424
0.291TrpMet: 0.291 ± 0.155
0.364TrpAsn: 0.364 ± 0.143
0.8TrpPro: 0.8 ± 0.29
0.727TrpGln: 0.727 ± 0.216
1.382TrpArg: 1.382 ± 0.313
1.236TrpSer: 1.236 ± 0.273
1.527TrpThr: 1.527 ± 0.379
1.6TrpVal: 1.6 ± 0.347
0.654TrpTrp: 0.654 ± 0.208
0.509TrpTyr: 0.509 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.109TyrAla: 2.109 ± 0.411
0.436TyrCys: 0.436 ± 0.188
1.6TyrAsp: 1.6 ± 0.405
1.891TyrGlu: 1.891 ± 0.348
0.436TyrPhe: 0.436 ± 0.189
2.036TyrGly: 2.036 ± 0.329
0.582TyrHis: 0.582 ± 0.222
0.873TyrIle: 0.873 ± 0.295
0.582TyrLys: 0.582 ± 0.197
1.6TyrLeu: 1.6 ± 0.285
0.509TyrMet: 0.509 ± 0.178
0.727TyrAsn: 0.727 ± 0.231
0.8TyrPro: 0.8 ± 0.215
0.654TyrGln: 0.654 ± 0.183
1.454TyrArg: 1.454 ± 0.321
1.091TyrSer: 1.091 ± 0.345
2.109TyrThr: 2.109 ± 0.453
2.254TyrVal: 2.254 ± 0.504
0.364TyrTrp: 0.364 ± 0.231
0.654TyrTyr: 0.654 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13752 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski