Amino acid dipepetide frequency for Klebsiella phage vB_KpnM_IME346

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.67AlaAla: 8.67 ± 1.072
1.04AlaCys: 1.04 ± 0.364
4.439AlaAsp: 4.439 ± 0.61
5.965AlaGlu: 5.965 ± 0.574
2.566AlaPhe: 2.566 ± 0.442
7.075AlaGly: 7.075 ± 0.978
1.11AlaHis: 1.11 ± 0.241
5.133AlaIle: 5.133 ± 0.614
4.439AlaLys: 4.439 ± 0.641
6.312AlaLeu: 6.312 ± 0.649
2.428AlaMet: 2.428 ± 0.39
4.3AlaAsn: 4.3 ± 0.646
3.191AlaPro: 3.191 ± 0.504
4.647AlaGln: 4.647 ± 0.594
4.023AlaArg: 4.023 ± 0.619
5.618AlaSer: 5.618 ± 0.759
5.41AlaThr: 5.41 ± 0.762
5.896AlaVal: 5.896 ± 0.693
1.11AlaTrp: 1.11 ± 0.255
2.566AlaTyr: 2.566 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
1.526CysAla: 1.526 ± 0.35
0.069CysCys: 0.069 ± 0.07
1.179CysAsp: 1.179 ± 0.294
1.179CysGlu: 1.179 ± 0.245
0.0CysPhe: 0.0 ± 0.0
0.971CysGly: 0.971 ± 0.317
0.694CysHis: 0.694 ± 0.24
0.763CysIle: 0.763 ± 0.272
0.971CysLys: 0.971 ± 0.356
0.902CysLeu: 0.902 ± 0.306
0.208CysMet: 0.208 ± 0.107
1.04CysAsn: 1.04 ± 0.312
0.763CysPro: 0.763 ± 0.257
0.347CysGln: 0.347 ± 0.13
0.902CysArg: 0.902 ± 0.27
1.318CysSer: 1.318 ± 0.303
0.832CysThr: 0.832 ± 0.23
0.902CysVal: 0.902 ± 0.311
0.208CysTrp: 0.208 ± 0.123
0.486CysTyr: 0.486 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
5.688AspAla: 5.688 ± 0.596
0.486AspCys: 0.486 ± 0.189
3.676AspAsp: 3.676 ± 0.563
4.925AspGlu: 4.925 ± 0.628
3.26AspPhe: 3.26 ± 0.532
6.589AspGly: 6.589 ± 0.603
0.416AspHis: 0.416 ± 0.146
4.855AspIle: 4.855 ± 0.597
2.289AspLys: 2.289 ± 0.341
3.746AspLeu: 3.746 ± 0.608
1.873AspMet: 1.873 ± 0.408
3.052AspAsn: 3.052 ± 0.41
2.081AspPro: 2.081 ± 0.361
1.457AspGln: 1.457 ± 0.288
3.121AspArg: 3.121 ± 0.549
3.191AspSer: 3.191 ± 0.483
4.023AspThr: 4.023 ± 0.564
4.578AspVal: 4.578 ± 0.458
1.457AspTrp: 1.457 ± 0.302
1.457AspTyr: 1.457 ± 0.333
0.0AspXaa: 0.0 ± 0.0
Glu
5.063GluAla: 5.063 ± 0.725
1.11GluCys: 1.11 ± 0.306
3.121GluAsp: 3.121 ± 0.469
3.26GluGlu: 3.26 ± 0.407
2.081GluPhe: 2.081 ± 0.39
3.468GluGly: 3.468 ± 0.387
1.179GluHis: 1.179 ± 0.29
3.884GluIle: 3.884 ± 0.552
2.983GluLys: 2.983 ± 0.476
5.826GluLeu: 5.826 ± 0.673
2.012GluMet: 2.012 ± 0.432
2.081GluAsn: 2.081 ± 0.286
1.803GluPro: 1.803 ± 0.382
3.884GluGln: 3.884 ± 0.616
3.468GluArg: 3.468 ± 0.628
2.983GluSer: 2.983 ± 0.398
2.775GluThr: 2.775 ± 0.348
3.537GluVal: 3.537 ± 0.52
1.11GluTrp: 1.11 ± 0.269
2.081GluTyr: 2.081 ± 0.304
0.0GluXaa: 0.0 ± 0.0
Phe
2.913PheAla: 2.913 ± 0.453
0.555PheCys: 0.555 ± 0.193
2.358PheAsp: 2.358 ± 0.362
2.012PheGlu: 2.012 ± 0.374
1.457PhePhe: 1.457 ± 0.372
2.705PheGly: 2.705 ± 0.363
0.624PheHis: 0.624 ± 0.186
1.595PheIle: 1.595 ± 0.325
2.15PheLys: 2.15 ± 0.368
2.012PheLeu: 2.012 ± 0.37
0.555PheMet: 0.555 ± 0.181
1.665PheAsn: 1.665 ± 0.329
1.04PhePro: 1.04 ± 0.268
0.902PheGln: 0.902 ± 0.24
1.734PheArg: 1.734 ± 0.348
1.942PheSer: 1.942 ± 0.478
2.705PheThr: 2.705 ± 0.373
2.289PheVal: 2.289 ± 0.356
0.277PheTrp: 0.277 ± 0.126
1.179PheTyr: 1.179 ± 0.266
0.0PheXaa: 0.0 ± 0.0
Gly
5.41GlyAla: 5.41 ± 0.836
1.11GlyCys: 1.11 ± 0.367
5.41GlyAsp: 5.41 ± 0.577
4.231GlyGlu: 4.231 ± 0.609
2.081GlyPhe: 2.081 ± 0.396
6.52GlyGly: 6.52 ± 1.12
1.734GlyHis: 1.734 ± 0.452
3.954GlyIle: 3.954 ± 0.62
4.509GlyLys: 4.509 ± 0.651
5.272GlyLeu: 5.272 ± 0.52
2.15GlyMet: 2.15 ± 0.347
2.983GlyAsn: 2.983 ± 0.447
1.734GlyPro: 1.734 ± 0.378
3.468GlyGln: 3.468 ± 0.407
3.954GlyArg: 3.954 ± 0.482
5.063GlySer: 5.063 ± 0.622
5.618GlyThr: 5.618 ± 0.889
6.451GlyVal: 6.451 ± 0.672
1.526GlyTrp: 1.526 ± 0.34
4.162GlyTyr: 4.162 ± 0.436
0.0GlyXaa: 0.0 ± 0.0
His
1.595HisAla: 1.595 ± 0.319
0.416HisCys: 0.416 ± 0.178
0.763HisAsp: 0.763 ± 0.222
1.457HisGlu: 1.457 ± 0.417
0.555HisPhe: 0.555 ± 0.176
1.665HisGly: 1.665 ± 0.394
0.763HisHis: 0.763 ± 0.246
1.457HisIle: 1.457 ± 0.319
0.694HisLys: 0.694 ± 0.192
1.318HisLeu: 1.318 ± 0.283
0.139HisMet: 0.139 ± 0.085
1.11HisAsn: 1.11 ± 0.256
0.763HisPro: 0.763 ± 0.228
0.208HisGln: 0.208 ± 0.117
1.179HisArg: 1.179 ± 0.319
0.971HisSer: 0.971 ± 0.23
0.971HisThr: 0.971 ± 0.211
1.11HisVal: 1.11 ± 0.272
0.555HisTrp: 0.555 ± 0.191
0.832HisTyr: 0.832 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
4.786IleAla: 4.786 ± 0.602
1.249IleCys: 1.249 ± 0.263
4.023IleAsp: 4.023 ± 0.4
3.815IleGlu: 3.815 ± 0.496
1.387IlePhe: 1.387 ± 0.323
3.815IleGly: 3.815 ± 0.429
0.971IleHis: 0.971 ± 0.257
3.26IleIle: 3.26 ± 0.478
3.26IleLys: 3.26 ± 0.509
3.329IleLeu: 3.329 ± 0.53
1.457IleMet: 1.457 ± 0.309
3.052IleAsn: 3.052 ± 0.5
3.954IlePro: 3.954 ± 0.552
1.873IleGln: 1.873 ± 0.335
3.329IleArg: 3.329 ± 0.477
3.746IleSer: 3.746 ± 0.486
4.092IleThr: 4.092 ± 0.459
4.647IleVal: 4.647 ± 0.545
0.624IleTrp: 0.624 ± 0.193
1.803IleTyr: 1.803 ± 0.365
0.0IleXaa: 0.0 ± 0.0
Lys
4.994LysAla: 4.994 ± 0.654
1.457LysCys: 1.457 ± 0.404
2.913LysAsp: 2.913 ± 0.409
2.566LysGlu: 2.566 ± 0.453
1.734LysPhe: 1.734 ± 0.277
2.983LysGly: 2.983 ± 0.449
0.971LysHis: 0.971 ± 0.275
2.775LysIle: 2.775 ± 0.445
2.081LysLys: 2.081 ± 0.35
4.509LysLeu: 4.509 ± 0.559
2.497LysMet: 2.497 ± 0.428
2.497LysAsn: 2.497 ± 0.464
2.636LysPro: 2.636 ± 0.456
2.566LysGln: 2.566 ± 0.466
3.191LysArg: 3.191 ± 0.372
3.26LysSer: 3.26 ± 0.523
2.705LysThr: 2.705 ± 0.42
3.121LysVal: 3.121 ± 0.522
1.249LysTrp: 1.249 ± 0.284
1.387LysTyr: 1.387 ± 0.302
0.0LysXaa: 0.0 ± 0.0
Leu
5.965LeuAla: 5.965 ± 0.763
1.11LeuCys: 1.11 ± 0.292
6.381LeuAsp: 6.381 ± 0.75
4.439LeuGlu: 4.439 ± 0.564
2.012LeuPhe: 2.012 ± 0.378
4.786LeuGly: 4.786 ± 0.495
1.457LeuHis: 1.457 ± 0.321
4.162LeuIle: 4.162 ± 0.476
4.162LeuLys: 4.162 ± 0.743
5.48LeuLeu: 5.48 ± 0.82
2.705LeuMet: 2.705 ± 0.463
4.509LeuAsn: 4.509 ± 0.536
3.121LeuPro: 3.121 ± 0.464
2.428LeuGln: 2.428 ± 0.343
4.786LeuArg: 4.786 ± 0.616
4.509LeuSer: 4.509 ± 0.566
5.063LeuThr: 5.063 ± 0.638
3.884LeuVal: 3.884 ± 0.509
1.387LeuTrp: 1.387 ± 0.331
2.289LeuTyr: 2.289 ± 0.394
0.0LeuXaa: 0.0 ± 0.0
Met
2.428MetAla: 2.428 ± 0.404
0.555MetCys: 0.555 ± 0.243
1.665MetAsp: 1.665 ± 0.425
1.249MetGlu: 1.249 ± 0.272
1.179MetPhe: 1.179 ± 0.299
1.387MetGly: 1.387 ± 0.254
0.347MetHis: 0.347 ± 0.203
1.387MetIle: 1.387 ± 0.341
1.04MetLys: 1.04 ± 0.279
2.081MetLeu: 2.081 ± 0.333
1.249MetMet: 1.249 ± 0.281
1.595MetAsn: 1.595 ± 0.365
0.763MetPro: 0.763 ± 0.169
1.249MetGln: 1.249 ± 0.291
1.803MetArg: 1.803 ± 0.353
2.566MetSer: 2.566 ± 0.415
2.983MetThr: 2.983 ± 0.442
1.873MetVal: 1.873 ± 0.342
0.208MetTrp: 0.208 ± 0.117
0.624MetTyr: 0.624 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
4.162AsnAla: 4.162 ± 0.556
1.04AsnCys: 1.04 ± 0.38
2.636AsnAsp: 2.636 ± 0.414
2.012AsnGlu: 2.012 ± 0.416
2.15AsnPhe: 2.15 ± 0.378
5.48AsnGly: 5.48 ± 0.761
1.457AsnHis: 1.457 ± 0.302
2.566AsnIle: 2.566 ± 0.396
2.497AsnLys: 2.497 ± 0.506
2.913AsnLeu: 2.913 ± 0.399
1.04AsnMet: 1.04 ± 0.268
2.705AsnAsn: 2.705 ± 0.479
2.913AsnPro: 2.913 ± 0.475
1.595AsnGln: 1.595 ± 0.348
1.665AsnArg: 1.665 ± 0.325
4.023AsnSer: 4.023 ± 0.71
2.983AsnThr: 2.983 ± 0.604
3.815AsnVal: 3.815 ± 0.566
0.555AsnTrp: 0.555 ± 0.17
1.249AsnTyr: 1.249 ± 0.318
0.0AsnXaa: 0.0 ± 0.0
Pro
3.537ProAla: 3.537 ± 0.6
0.486ProCys: 0.486 ± 0.175
2.497ProAsp: 2.497 ± 0.384
2.15ProGlu: 2.15 ± 0.378
1.457ProPhe: 1.457 ± 0.327
3.676ProGly: 3.676 ± 0.585
0.555ProHis: 0.555 ± 0.193
2.012ProIle: 2.012 ± 0.41
1.179ProLys: 1.179 ± 0.319
3.191ProLeu: 3.191 ± 0.373
0.902ProMet: 0.902 ± 0.275
2.566ProAsn: 2.566 ± 0.404
2.22ProPro: 2.22 ± 0.577
1.595ProGln: 1.595 ± 0.323
1.942ProArg: 1.942 ± 0.358
2.497ProSer: 2.497 ± 0.354
2.636ProThr: 2.636 ± 0.499
4.231ProVal: 4.231 ± 0.513
0.832ProTrp: 0.832 ± 0.201
1.318ProTyr: 1.318 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
2.636GlnAla: 2.636 ± 0.416
0.347GlnCys: 0.347 ± 0.159
1.595GlnAsp: 1.595 ± 0.269
2.636GlnGlu: 2.636 ± 0.34
1.942GlnPhe: 1.942 ± 0.337
2.705GlnGly: 2.705 ± 0.44
0.555GlnHis: 0.555 ± 0.179
2.22GlnIle: 2.22 ± 0.426
1.734GlnLys: 1.734 ± 0.429
4.162GlnLeu: 4.162 ± 0.625
1.11GlnMet: 1.11 ± 0.292
1.526GlnAsn: 1.526 ± 0.316
1.526GlnPro: 1.526 ± 0.325
2.22GlnGln: 2.22 ± 0.54
2.289GlnArg: 2.289 ± 0.469
2.705GlnSer: 2.705 ± 0.545
3.191GlnThr: 3.191 ± 0.466
2.775GlnVal: 2.775 ± 0.408
0.624GlnTrp: 0.624 ± 0.196
1.942GlnTyr: 1.942 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
3.815ArgAla: 3.815 ± 0.578
0.832ArgCys: 0.832 ± 0.242
3.121ArgAsp: 3.121 ± 0.557
2.913ArgGlu: 2.913 ± 0.592
1.942ArgPhe: 1.942 ± 0.321
2.289ArgGly: 2.289 ± 0.415
1.179ArgHis: 1.179 ± 0.25
3.954ArgIle: 3.954 ± 0.452
3.468ArgLys: 3.468 ± 0.543
4.647ArgLeu: 4.647 ± 0.513
1.11ArgMet: 1.11 ± 0.278
2.428ArgAsn: 2.428 ± 0.301
2.289ArgPro: 2.289 ± 0.384
2.636ArgGln: 2.636 ± 0.469
2.566ArgArg: 2.566 ± 0.456
2.289ArgSer: 2.289 ± 0.384
2.012ArgThr: 2.012 ± 0.289
4.231ArgVal: 4.231 ± 0.561
1.179ArgTrp: 1.179 ± 0.278
1.942ArgTyr: 1.942 ± 0.36
0.0ArgXaa: 0.0 ± 0.0
Ser
5.896SerAla: 5.896 ± 1.027
0.555SerCys: 0.555 ± 0.171
3.746SerAsp: 3.746 ± 0.479
2.289SerGlu: 2.289 ± 0.364
1.665SerPhe: 1.665 ± 0.364
5.896SerGly: 5.896 ± 0.601
1.249SerHis: 1.249 ± 0.337
3.26SerIle: 3.26 ± 0.486
4.162SerLys: 4.162 ± 0.577
5.063SerLeu: 5.063 ± 0.506
1.873SerMet: 1.873 ± 0.422
3.329SerAsn: 3.329 ± 0.487
2.289SerPro: 2.289 ± 0.44
3.121SerGln: 3.121 ± 0.496
2.636SerArg: 2.636 ± 0.406
3.399SerSer: 3.399 ± 0.493
4.023SerThr: 4.023 ± 0.708
4.578SerVal: 4.578 ± 0.573
0.763SerTrp: 0.763 ± 0.203
2.636SerTyr: 2.636 ± 0.342
0.0SerXaa: 0.0 ± 0.0
Thr
5.826ThrAla: 5.826 ± 0.65
0.416ThrCys: 0.416 ± 0.182
4.023ThrAsp: 4.023 ± 0.596
4.023ThrGlu: 4.023 ± 0.558
1.526ThrPhe: 1.526 ± 0.308
6.451ThrGly: 6.451 ± 0.605
1.595ThrHis: 1.595 ± 0.327
4.786ThrIle: 4.786 ± 0.652
3.329ThrLys: 3.329 ± 0.578
5.826ThrLeu: 5.826 ± 0.642
1.457ThrMet: 1.457 ± 0.323
2.497ThrAsn: 2.497 ± 0.501
3.884ThrPro: 3.884 ± 0.589
1.873ThrGln: 1.873 ± 0.412
2.15ThrArg: 2.15 ± 0.325
5.063ThrSer: 5.063 ± 0.655
5.133ThrThr: 5.133 ± 0.843
5.965ThrVal: 5.965 ± 0.636
1.04ThrTrp: 1.04 ± 0.277
2.15ThrTyr: 2.15 ± 0.331
0.0ThrXaa: 0.0 ± 0.0
Val
7.144ValAla: 7.144 ± 0.784
1.249ValCys: 1.249 ± 0.301
5.549ValAsp: 5.549 ± 0.605
4.162ValGlu: 4.162 ± 0.503
2.22ValPhe: 2.22 ± 0.484
4.925ValGly: 4.925 ± 0.665
0.971ValHis: 0.971 ± 0.24
4.37ValIle: 4.37 ± 0.457
4.578ValLys: 4.578 ± 0.475
4.023ValLeu: 4.023 ± 0.465
2.15ValMet: 2.15 ± 0.319
4.162ValAsn: 4.162 ± 0.581
2.705ValPro: 2.705 ± 0.504
2.428ValGln: 2.428 ± 0.437
3.399ValArg: 3.399 ± 0.488
3.815ValSer: 3.815 ± 0.501
7.63ValThr: 7.63 ± 0.789
3.884ValVal: 3.884 ± 0.502
0.694ValTrp: 0.694 ± 0.192
2.566ValTyr: 2.566 ± 0.393
0.0ValXaa: 0.0 ± 0.0
Trp
1.179TrpAla: 1.179 ± 0.216
0.277TrpCys: 0.277 ± 0.128
0.971TrpAsp: 0.971 ± 0.241
0.624TrpGlu: 0.624 ± 0.255
0.624TrpPhe: 0.624 ± 0.222
0.902TrpGly: 0.902 ± 0.238
0.069TrpHis: 0.069 ± 0.052
0.486TrpIle: 0.486 ± 0.212
0.971TrpLys: 0.971 ± 0.238
1.387TrpLeu: 1.387 ± 0.258
0.347TrpMet: 0.347 ± 0.16
0.832TrpAsn: 0.832 ± 0.241
0.624TrpPro: 0.624 ± 0.23
0.763TrpGln: 0.763 ± 0.269
0.832TrpArg: 0.832 ± 0.234
0.971TrpSer: 0.971 ± 0.232
1.249TrpThr: 1.249 ± 0.256
1.803TrpVal: 1.803 ± 0.369
0.416TrpTrp: 0.416 ± 0.182
0.832TrpTyr: 0.832 ± 0.266
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.705TyrAla: 2.705 ± 0.391
0.902TyrCys: 0.902 ± 0.266
2.15TyrAsp: 2.15 ± 0.454
1.803TyrGlu: 1.803 ± 0.349
0.832TyrPhe: 0.832 ± 0.229
3.052TyrGly: 3.052 ± 0.408
0.624TyrHis: 0.624 ± 0.209
1.595TyrIle: 1.595 ± 0.331
1.595TyrLys: 1.595 ± 0.325
2.636TyrLeu: 2.636 ± 0.463
0.902TyrMet: 0.902 ± 0.246
1.526TyrAsn: 1.526 ± 0.348
1.179TyrPro: 1.179 ± 0.25
1.249TyrGln: 1.249 ± 0.305
1.873TyrArg: 1.873 ± 0.321
2.497TyrSer: 2.497 ± 0.393
2.983TyrThr: 2.983 ± 0.368
3.052TyrVal: 3.052 ± 0.359
0.347TyrTrp: 0.347 ± 0.172
0.971TyrTyr: 0.971 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (14418 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski