Amino acid dipepetide frequency for Klebsiella phage ST512-KPC3phi13.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.388AlaAla: 10.388 ± 1.446
1.176AlaCys: 1.176 ± 0.38
7.154AlaAsp: 7.154 ± 0.86
6.272AlaGlu: 6.272 ± 1.015
3.038AlaPhe: 3.038 ± 0.494
7.546AlaGly: 7.546 ± 1.709
1.176AlaHis: 1.176 ± 0.325
5.292AlaIle: 5.292 ± 0.47
5.096AlaLys: 5.096 ± 0.652
10.192AlaLeu: 10.192 ± 1.083
2.45AlaMet: 2.45 ± 0.438
2.45AlaAsn: 2.45 ± 0.402
2.842AlaPro: 2.842 ± 0.648
3.822AlaGln: 3.822 ± 0.806
5.684AlaArg: 5.684 ± 0.717
6.468AlaSer: 6.468 ± 0.699
4.998AlaThr: 4.998 ± 0.617
7.448AlaVal: 7.448 ± 0.805
1.372AlaTrp: 1.372 ± 0.269
3.528AlaTyr: 3.528 ± 0.568
0.0AlaXaa: 0.0 ± 0.0
Cys
0.98CysAla: 0.98 ± 0.326
0.0CysCys: 0.0 ± 0.0
0.392CysAsp: 0.392 ± 0.177
0.588CysGlu: 0.588 ± 0.19
0.196CysPhe: 0.196 ± 0.187
0.784CysGly: 0.784 ± 0.226
0.196CysHis: 0.196 ± 0.114
0.686CysIle: 0.686 ± 0.294
0.196CysLys: 0.196 ± 0.126
1.176CysLeu: 1.176 ± 0.424
0.196CysMet: 0.196 ± 0.13
0.392CysAsn: 0.392 ± 0.169
0.686CysPro: 0.686 ± 0.272
0.392CysGln: 0.392 ± 0.194
0.882CysArg: 0.882 ± 0.28
0.784CysSer: 0.784 ± 0.228
0.49CysThr: 0.49 ± 0.208
0.686CysVal: 0.686 ± 0.236
0.49CysTrp: 0.49 ± 0.205
0.196CysTyr: 0.196 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
5.978AspAla: 5.978 ± 0.659
0.784AspCys: 0.784 ± 0.235
3.43AspAsp: 3.43 ± 0.665
2.744AspGlu: 2.744 ± 0.474
2.352AspPhe: 2.352 ± 0.429
6.174AspGly: 6.174 ± 0.644
0.49AspHis: 0.49 ± 0.211
2.45AspIle: 2.45 ± 0.481
3.43AspLys: 3.43 ± 0.507
5.39AspLeu: 5.39 ± 0.802
1.764AspMet: 1.764 ± 0.427
2.744AspAsn: 2.744 ± 0.613
2.156AspPro: 2.156 ± 0.381
1.96AspGln: 1.96 ± 0.445
2.352AspArg: 2.352 ± 0.36
3.038AspSer: 3.038 ± 0.622
3.136AspThr: 3.136 ± 0.63
4.214AspVal: 4.214 ± 0.474
0.294AspTrp: 0.294 ± 0.218
2.842AspTyr: 2.842 ± 0.464
0.0AspXaa: 0.0 ± 0.0
Glu
5.586GluAla: 5.586 ± 0.885
0.392GluCys: 0.392 ± 0.181
2.45GluAsp: 2.45 ± 0.539
2.744GluGlu: 2.744 ± 0.568
2.548GluPhe: 2.548 ± 0.532
2.646GluGly: 2.646 ± 0.448
0.882GluHis: 0.882 ± 0.277
4.018GluIle: 4.018 ± 0.61
3.234GluLys: 3.234 ± 0.482
7.448GluLeu: 7.448 ± 0.779
2.156GluMet: 2.156 ± 0.568
2.646GluAsn: 2.646 ± 0.554
2.744GluPro: 2.744 ± 0.498
3.332GluGln: 3.332 ± 0.776
3.724GluArg: 3.724 ± 0.569
3.724GluSer: 3.724 ± 0.496
3.332GluThr: 3.332 ± 0.803
4.312GluVal: 4.312 ± 0.719
0.882GluTrp: 0.882 ± 0.275
1.764GluTyr: 1.764 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.842PheAla: 2.842 ± 0.568
0.294PheCys: 0.294 ± 0.178
2.058PheAsp: 2.058 ± 0.354
1.666PheGlu: 1.666 ± 0.329
1.47PhePhe: 1.47 ± 0.337
1.666PheGly: 1.666 ± 0.486
0.882PheHis: 0.882 ± 0.274
1.372PheIle: 1.372 ± 0.41
1.568PheLys: 1.568 ± 0.368
2.058PheLeu: 2.058 ± 0.493
1.078PheMet: 1.078 ± 0.277
2.94PheAsn: 2.94 ± 0.554
0.98PhePro: 0.98 ± 0.356
1.372PheGln: 1.372 ± 0.424
2.058PheArg: 2.058 ± 0.468
3.234PheSer: 3.234 ± 0.508
2.254PheThr: 2.254 ± 0.548
1.862PheVal: 1.862 ± 0.484
0.784PheTrp: 0.784 ± 0.321
0.882PheTyr: 0.882 ± 0.288
0.0PheXaa: 0.0 ± 0.0
Gly
5.782GlyAla: 5.782 ± 0.601
0.686GlyCys: 0.686 ± 0.238
4.802GlyAsp: 4.802 ± 0.555
3.626GlyGlu: 3.626 ± 0.667
2.45GlyPhe: 2.45 ± 0.541
6.664GlyGly: 6.664 ± 1.202
0.882GlyHis: 0.882 ± 0.323
3.43GlyIle: 3.43 ± 1.005
4.214GlyLys: 4.214 ± 0.523
6.272GlyLeu: 6.272 ± 0.728
2.352GlyMet: 2.352 ± 0.611
4.018GlyAsn: 4.018 ± 0.672
1.96GlyPro: 1.96 ± 0.61
1.96GlyGln: 1.96 ± 0.482
3.626GlyArg: 3.626 ± 0.547
4.704GlySer: 4.704 ± 0.65
3.822GlyThr: 3.822 ± 0.461
5.292GlyVal: 5.292 ± 0.664
1.372GlyTrp: 1.372 ± 0.351
2.156GlyTyr: 2.156 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
1.372HisAla: 1.372 ± 0.423
0.196HisCys: 0.196 ± 0.14
0.882HisAsp: 0.882 ± 0.353
0.882HisGlu: 0.882 ± 0.345
0.392HisPhe: 0.392 ± 0.186
1.666HisGly: 1.666 ± 0.385
0.686HisHis: 0.686 ± 0.215
0.98HisIle: 0.98 ± 0.241
1.666HisLys: 1.666 ± 0.446
0.98HisLeu: 0.98 ± 0.316
0.588HisMet: 0.588 ± 0.235
0.784HisAsn: 0.784 ± 0.299
0.98HisPro: 0.98 ± 0.28
0.294HisGln: 0.294 ± 0.153
1.372HisArg: 1.372 ± 0.287
1.078HisSer: 1.078 ± 0.412
1.078HisThr: 1.078 ± 0.436
0.98HisVal: 0.98 ± 0.275
0.294HisTrp: 0.294 ± 0.139
0.784HisTyr: 0.784 ± 0.298
0.0HisXaa: 0.0 ± 0.0
Ile
5.194IleAla: 5.194 ± 0.662
0.686IleCys: 0.686 ± 0.213
3.136IleAsp: 3.136 ± 0.594
3.724IleGlu: 3.724 ± 0.747
1.862IlePhe: 1.862 ± 0.491
3.332IleGly: 3.332 ± 0.569
0.784IleHis: 0.784 ± 0.253
2.842IleIle: 2.842 ± 0.55
2.842IleLys: 2.842 ± 0.743
3.724IleLeu: 3.724 ± 0.513
1.568IleMet: 1.568 ± 0.337
3.43IleAsn: 3.43 ± 0.439
2.842IlePro: 2.842 ± 0.81
1.764IleGln: 1.764 ± 0.559
4.018IleArg: 4.018 ± 0.463
3.724IleSer: 3.724 ± 0.702
4.41IleThr: 4.41 ± 0.771
2.254IleVal: 2.254 ± 0.547
0.686IleTrp: 0.686 ± 0.281
1.47IleTyr: 1.47 ± 0.294
0.0IleXaa: 0.0 ± 0.0
Lys
4.9LysAla: 4.9 ± 0.769
0.392LysCys: 0.392 ± 0.165
2.352LysAsp: 2.352 ± 0.434
2.744LysGlu: 2.744 ± 0.53
2.156LysPhe: 2.156 ± 0.452
3.92LysGly: 3.92 ± 0.655
0.98LysHis: 0.98 ± 0.385
2.45LysIle: 2.45 ± 0.778
3.234LysLys: 3.234 ± 0.626
4.802LysLeu: 4.802 ± 0.85
0.98LysMet: 0.98 ± 0.336
2.744LysAsn: 2.744 ± 0.452
1.96LysPro: 1.96 ± 0.523
2.058LysGln: 2.058 ± 0.499
3.43LysArg: 3.43 ± 0.563
3.136LysSer: 3.136 ± 0.573
3.626LysThr: 3.626 ± 0.553
2.744LysVal: 2.744 ± 0.612
0.882LysTrp: 0.882 ± 0.377
0.98LysTyr: 0.98 ± 0.288
0.0LysXaa: 0.0 ± 0.0
Leu
9.996LeuAla: 9.996 ± 0.962
1.176LeuCys: 1.176 ± 0.307
4.9LeuAsp: 4.9 ± 0.634
5.39LeuGlu: 5.39 ± 0.56
3.136LeuPhe: 3.136 ± 0.669
4.508LeuGly: 4.508 ± 0.776
1.274LeuHis: 1.274 ± 0.333
4.606LeuIle: 4.606 ± 0.642
4.116LeuLys: 4.116 ± 0.675
7.644LeuLeu: 7.644 ± 1.123
3.038LeuMet: 3.038 ± 0.562
5.292LeuAsn: 5.292 ± 0.787
4.41LeuPro: 4.41 ± 0.757
4.214LeuGln: 4.214 ± 0.739
6.37LeuArg: 6.37 ± 0.97
7.154LeuSer: 7.154 ± 1.081
6.174LeuThr: 6.174 ± 1.084
5.194LeuVal: 5.194 ± 0.849
0.784LeuTrp: 0.784 ± 0.208
3.038LeuTyr: 3.038 ± 0.56
0.0LeuXaa: 0.0 ± 0.0
Met
3.92MetAla: 3.92 ± 0.765
0.49MetCys: 0.49 ± 0.207
0.98MetAsp: 0.98 ± 0.263
1.176MetGlu: 1.176 ± 0.308
0.784MetPhe: 0.784 ± 0.271
1.47MetGly: 1.47 ± 0.427
0.49MetHis: 0.49 ± 0.255
1.176MetIle: 1.176 ± 0.368
1.862MetLys: 1.862 ± 0.351
2.352MetLeu: 2.352 ± 0.514
1.764MetMet: 1.764 ± 0.854
0.882MetAsn: 0.882 ± 0.304
1.372MetPro: 1.372 ± 0.435
0.882MetGln: 0.882 ± 0.232
1.666MetArg: 1.666 ± 0.363
2.058MetSer: 2.058 ± 0.578
1.862MetThr: 1.862 ± 0.486
1.274MetVal: 1.274 ± 0.369
0.294MetTrp: 0.294 ± 0.137
0.784MetTyr: 0.784 ± 0.303
0.0MetXaa: 0.0 ± 0.0
Asn
3.43AsnAla: 3.43 ± 0.647
0.49AsnCys: 0.49 ± 0.185
2.352AsnAsp: 2.352 ± 0.455
3.724AsnGlu: 3.724 ± 0.501
1.078AsnPhe: 1.078 ± 0.334
3.822AsnGly: 3.822 ± 0.669
0.784AsnHis: 0.784 ± 0.406
3.234AsnIle: 3.234 ± 0.496
2.254AsnLys: 2.254 ± 0.369
4.41AsnLeu: 4.41 ± 0.581
0.882AsnMet: 0.882 ± 0.286
2.744AsnAsn: 2.744 ± 0.445
4.018AsnPro: 4.018 ± 0.589
2.156AsnGln: 2.156 ± 0.437
3.136AsnArg: 3.136 ± 0.616
2.45AsnSer: 2.45 ± 0.607
1.862AsnThr: 1.862 ± 0.404
3.038AsnVal: 3.038 ± 0.625
0.882AsnTrp: 0.882 ± 0.321
1.274AsnTyr: 1.274 ± 0.41
0.0AsnXaa: 0.0 ± 0.0
Pro
5.39ProAla: 5.39 ± 0.622
0.294ProCys: 0.294 ± 0.177
3.136ProAsp: 3.136 ± 0.523
2.254ProGlu: 2.254 ± 0.43
0.882ProPhe: 0.882 ± 0.261
3.724ProGly: 3.724 ± 0.677
1.372ProHis: 1.372 ± 0.36
1.568ProIle: 1.568 ± 0.33
1.372ProLys: 1.372 ± 0.284
3.43ProLeu: 3.43 ± 0.498
0.686ProMet: 0.686 ± 0.285
1.47ProAsn: 1.47 ± 0.33
2.156ProPro: 2.156 ± 0.398
1.372ProGln: 1.372 ± 0.39
2.646ProArg: 2.646 ± 0.481
2.352ProSer: 2.352 ± 0.413
1.47ProThr: 1.47 ± 0.356
3.332ProVal: 3.332 ± 0.544
0.294ProTrp: 0.294 ± 0.15
1.274ProTyr: 1.274 ± 0.416
0.0ProXaa: 0.0 ± 0.0
Gln
4.704GlnAla: 4.704 ± 0.507
0.196GlnCys: 0.196 ± 0.137
1.764GlnAsp: 1.764 ± 0.414
2.646GlnGlu: 2.646 ± 0.449
0.588GlnPhe: 0.588 ± 0.203
3.528GlnGly: 3.528 ± 0.439
0.294GlnHis: 0.294 ± 0.16
1.666GlnIle: 1.666 ± 0.399
1.568GlnLys: 1.568 ± 0.383
5.096GlnLeu: 5.096 ± 0.838
1.078GlnMet: 1.078 ± 0.355
2.058GlnAsn: 2.058 ± 0.379
1.568GlnPro: 1.568 ± 0.328
2.842GlnGln: 2.842 ± 0.586
3.234GlnArg: 3.234 ± 0.733
2.156GlnSer: 2.156 ± 0.488
2.45GlnThr: 2.45 ± 0.448
2.254GlnVal: 2.254 ± 0.398
0.588GlnTrp: 0.588 ± 0.221
1.372GlnTyr: 1.372 ± 0.4
0.0GlnXaa: 0.0 ± 0.0
Arg
4.9ArgAla: 4.9 ± 0.733
0.98ArgCys: 0.98 ± 0.277
3.332ArgAsp: 3.332 ± 0.404
4.9ArgGlu: 4.9 ± 0.773
2.254ArgPhe: 2.254 ± 0.512
3.528ArgGly: 3.528 ± 0.6
2.058ArgHis: 2.058 ± 0.497
4.312ArgIle: 4.312 ± 0.681
3.234ArgLys: 3.234 ± 0.986
5.978ArgLeu: 5.978 ± 0.825
1.47ArgMet: 1.47 ± 0.303
4.018ArgAsn: 4.018 ± 0.683
1.372ArgPro: 1.372 ± 0.335
3.724ArgGln: 3.724 ± 0.547
5.488ArgArg: 5.488 ± 0.806
2.744ArgSer: 2.744 ± 0.602
2.646ArgThr: 2.646 ± 0.38
4.508ArgVal: 4.508 ± 0.633
1.666ArgTrp: 1.666 ± 0.392
1.274ArgTyr: 1.274 ± 0.286
0.0ArgXaa: 0.0 ± 0.0
Ser
7.154SerAla: 7.154 ± 0.943
0.49SerCys: 0.49 ± 0.168
3.92SerAsp: 3.92 ± 0.502
3.724SerGlu: 3.724 ± 0.556
2.548SerPhe: 2.548 ± 0.429
4.41SerGly: 4.41 ± 0.619
1.47SerHis: 1.47 ± 0.464
3.626SerIle: 3.626 ± 0.769
2.352SerLys: 2.352 ± 0.542
5.39SerLeu: 5.39 ± 0.792
1.372SerMet: 1.372 ± 0.376
2.744SerAsn: 2.744 ± 0.446
2.45SerPro: 2.45 ± 0.505
2.94SerGln: 2.94 ± 0.547
3.822SerArg: 3.822 ± 0.556
3.136SerSer: 3.136 ± 0.37
3.528SerThr: 3.528 ± 0.663
4.214SerVal: 4.214 ± 0.623
1.372SerTrp: 1.372 ± 0.373
1.666SerTyr: 1.666 ± 0.423
0.0SerXaa: 0.0 ± 0.0
Thr
6.174ThrAla: 6.174 ± 0.862
0.392ThrCys: 0.392 ± 0.206
3.822ThrAsp: 3.822 ± 0.551
4.018ThrGlu: 4.018 ± 0.74
1.666ThrPhe: 1.666 ± 0.394
4.606ThrGly: 4.606 ± 0.829
1.078ThrHis: 1.078 ± 0.243
3.136ThrIle: 3.136 ± 0.665
2.156ThrLys: 2.156 ± 0.546
6.174ThrLeu: 6.174 ± 0.553
0.784ThrMet: 0.784 ± 0.298
2.156ThrAsn: 2.156 ± 0.55
1.96ThrPro: 1.96 ± 0.566
1.274ThrGln: 1.274 ± 0.323
3.626ThrArg: 3.626 ± 0.734
3.234ThrSer: 3.234 ± 0.42
4.606ThrThr: 4.606 ± 0.891
5.39ThrVal: 5.39 ± 0.73
0.588ThrTrp: 0.588 ± 0.242
1.764ThrTyr: 1.764 ± 0.495
0.0ThrXaa: 0.0 ± 0.0
Val
6.076ValAla: 6.076 ± 0.857
0.49ValCys: 0.49 ± 0.286
4.802ValAsp: 4.802 ± 0.738
3.822ValGlu: 3.822 ± 0.686
2.156ValPhe: 2.156 ± 0.452
3.43ValGly: 3.43 ± 0.733
0.98ValHis: 0.98 ± 0.37
4.802ValIle: 4.802 ± 0.647
4.214ValLys: 4.214 ± 0.742
5.096ValLeu: 5.096 ± 0.838
2.352ValMet: 2.352 ± 0.503
2.842ValAsn: 2.842 ± 0.723
2.646ValPro: 2.646 ± 0.506
3.234ValGln: 3.234 ± 0.645
2.842ValArg: 2.842 ± 0.537
4.9ValSer: 4.9 ± 0.641
4.312ValThr: 4.312 ± 0.664
4.214ValVal: 4.214 ± 0.799
0.588ValTrp: 0.588 ± 0.229
1.96ValTyr: 1.96 ± 0.481
0.0ValXaa: 0.0 ± 0.0
Trp
1.372TrpAla: 1.372 ± 0.373
0.196TrpCys: 0.196 ± 0.132
0.588TrpAsp: 0.588 ± 0.223
1.764TrpGlu: 1.764 ± 0.392
0.196TrpPhe: 0.196 ± 0.118
0.49TrpGly: 0.49 ± 0.197
0.392TrpHis: 0.392 ± 0.193
1.078TrpIle: 1.078 ± 0.351
0.686TrpLys: 0.686 ± 0.321
1.666TrpLeu: 1.666 ± 0.361
0.294TrpMet: 0.294 ± 0.153
0.294TrpAsn: 0.294 ± 0.162
0.588TrpPro: 0.588 ± 0.226
0.49TrpGln: 0.49 ± 0.219
1.372TrpArg: 1.372 ± 0.321
0.49TrpSer: 0.49 ± 0.231
0.784TrpThr: 0.784 ± 0.204
0.588TrpVal: 0.588 ± 0.271
0.49TrpTrp: 0.49 ± 0.189
0.98TrpTyr: 0.98 ± 0.338
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.352TyrAla: 2.352 ± 0.462
0.588TyrCys: 0.588 ± 0.251
1.47TyrAsp: 1.47 ± 0.318
2.156TyrGlu: 2.156 ± 0.517
1.568TyrPhe: 1.568 ± 0.482
2.156TyrGly: 2.156 ± 0.415
0.784TyrHis: 0.784 ± 0.281
1.568TyrIle: 1.568 ± 0.394
1.176TyrLys: 1.176 ± 0.308
3.136TyrLeu: 3.136 ± 0.478
0.588TyrMet: 0.588 ± 0.238
1.372TyrAsn: 1.372 ± 0.384
0.784TyrPro: 0.784 ± 0.242
1.568TyrGln: 1.568 ± 0.404
3.038TyrArg: 3.038 ± 0.527
1.764TyrSer: 1.764 ± 0.473
1.764TyrThr: 1.764 ± 0.559
1.96TyrVal: 1.96 ± 0.374
0.196TyrTrp: 0.196 ± 0.13
0.882TyrTyr: 0.882 ± 0.41
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (10205 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski