Amino acid dipepetide frequency for Klebsiella phage ST846-OXA48phi9.1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.185AlaAla: 12.185 ± 1.762
1.1AlaCys: 1.1 ± 0.367
8.292AlaAsp: 8.292 ± 1.389
5.585AlaGlu: 5.585 ± 0.572
1.862AlaPhe: 1.862 ± 0.576
8.716AlaGly: 8.716 ± 1.106
1.1AlaHis: 1.1 ± 0.411
5.077AlaIle: 5.077 ± 0.673
5.585AlaLys: 5.585 ± 0.787
9.646AlaLeu: 9.646 ± 0.775
2.539AlaMet: 2.539 ± 0.541
3.554AlaAsn: 3.554 ± 1.383
2.031AlaPro: 2.031 ± 0.298
4.739AlaGln: 4.739 ± 1.014
4.992AlaArg: 4.992 ± 0.801
6.769AlaSer: 6.769 ± 1.287
7.785AlaThr: 7.785 ± 1.325
7.531AlaVal: 7.531 ± 1.27
1.777AlaTrp: 1.777 ± 0.4
2.708AlaTyr: 2.708 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.338CysAla: 0.338 ± 0.268
0.169CysCys: 0.169 ± 0.164
0.592CysAsp: 0.592 ± 0.28
0.677CysGlu: 0.677 ± 0.352
0.338CysPhe: 0.338 ± 0.21
0.508CysGly: 0.508 ± 0.221
0.085CysHis: 0.085 ± 0.11
0.508CysIle: 0.508 ± 0.324
0.508CysLys: 0.508 ± 0.304
0.423CysLeu: 0.423 ± 0.281
0.338CysMet: 0.338 ± 0.224
0.338CysAsn: 0.338 ± 0.212
0.338CysPro: 0.338 ± 0.136
0.423CysGln: 0.423 ± 0.208
0.762CysArg: 0.762 ± 0.427
0.508CysSer: 0.508 ± 0.323
0.423CysThr: 0.423 ± 0.192
0.423CysVal: 0.423 ± 0.173
0.677CysTrp: 0.677 ± 0.348
0.423CysTyr: 0.423 ± 0.186
0.0CysXaa: 0.0 ± 0.0
Asp
6.939AspAla: 6.939 ± 1.248
0.423AspCys: 0.423 ± 0.263
3.385AspAsp: 3.385 ± 0.795
3.215AspGlu: 3.215 ± 0.702
2.369AspPhe: 2.369 ± 0.447
6.854AspGly: 6.854 ± 0.971
0.338AspHis: 0.338 ± 0.166
3.3AspIle: 3.3 ± 0.529
2.792AspLys: 2.792 ± 0.678
4.823AspLeu: 4.823 ± 0.886
1.438AspMet: 1.438 ± 0.575
2.454AspAsn: 2.454 ± 0.552
2.454AspPro: 2.454 ± 0.51
2.115AspGln: 2.115 ± 0.506
3.046AspArg: 3.046 ± 0.608
3.131AspSer: 3.131 ± 0.638
3.046AspThr: 3.046 ± 0.478
3.131AspVal: 3.131 ± 0.87
0.931AspTrp: 0.931 ± 0.302
1.354AspTyr: 1.354 ± 0.453
0.0AspXaa: 0.0 ± 0.0
Glu
4.569GluAla: 4.569 ± 0.734
0.592GluCys: 0.592 ± 0.24
2.031GluAsp: 2.031 ± 0.592
2.792GluGlu: 2.792 ± 0.846
2.115GluPhe: 2.115 ± 0.286
3.3GluGly: 3.3 ± 0.507
0.931GluHis: 0.931 ± 0.473
3.639GluIle: 3.639 ± 0.696
3.723GluLys: 3.723 ± 0.853
5.5GluLeu: 5.5 ± 1.192
1.354GluMet: 1.354 ± 0.363
2.623GluAsn: 2.623 ± 0.54
1.438GluPro: 1.438 ± 0.406
2.623GluGln: 2.623 ± 0.486
3.385GluArg: 3.385 ± 0.956
3.131GluSer: 3.131 ± 0.643
3.808GluThr: 3.808 ± 0.545
2.792GluVal: 2.792 ± 0.514
0.254GluTrp: 0.254 ± 0.153
1.523GluTyr: 1.523 ± 0.312
0.0GluXaa: 0.0 ± 0.0
Phe
2.031PheAla: 2.031 ± 0.509
0.254PheCys: 0.254 ± 0.179
2.708PheAsp: 2.708 ± 0.56
2.454PheGlu: 2.454 ± 0.387
0.677PhePhe: 0.677 ± 0.356
2.285PheGly: 2.285 ± 0.711
0.677PheHis: 0.677 ± 0.282
2.031PheIle: 2.031 ± 0.699
1.354PheLys: 1.354 ± 0.459
2.031PheLeu: 2.031 ± 0.42
0.592PheMet: 0.592 ± 0.38
2.031PheAsn: 2.031 ± 0.456
1.185PhePro: 1.185 ± 0.467
0.762PheGln: 0.762 ± 0.228
1.862PheArg: 1.862 ± 0.432
2.2PheSer: 2.2 ± 0.395
1.692PheThr: 1.692 ± 0.289
1.354PheVal: 1.354 ± 0.272
0.508PheTrp: 0.508 ± 0.199
0.846PheTyr: 0.846 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
7.7GlyAla: 7.7 ± 0.98
0.677GlyCys: 0.677 ± 0.234
5.246GlyAsp: 5.246 ± 0.56
2.708GlyGlu: 2.708 ± 0.498
2.285GlyPhe: 2.285 ± 0.687
6.346GlyGly: 6.346 ± 0.695
0.508GlyHis: 0.508 ± 0.247
4.569GlyIle: 4.569 ± 0.83
4.823GlyLys: 4.823 ± 0.535
6.6GlyLeu: 6.6 ± 0.961
2.369GlyMet: 2.369 ± 0.57
4.823GlyAsn: 4.823 ± 0.929
1.523GlyPro: 1.523 ± 0.706
3.554GlyGln: 3.554 ± 0.581
3.808GlyArg: 3.808 ± 0.732
5.331GlySer: 5.331 ± 0.881
6.008GlyThr: 6.008 ± 0.763
4.992GlyVal: 4.992 ± 0.823
1.692GlyTrp: 1.692 ± 0.483
2.031GlyTyr: 2.031 ± 0.421
0.0GlyXaa: 0.0 ± 0.0
His
1.015HisAla: 1.015 ± 0.321
0.085HisCys: 0.085 ± 0.109
0.846HisAsp: 0.846 ± 0.417
0.846HisGlu: 0.846 ± 0.391
0.508HisPhe: 0.508 ± 0.251
0.846HisGly: 0.846 ± 0.301
1.015HisHis: 1.015 ± 0.622
0.508HisIle: 0.508 ± 0.192
0.338HisLys: 0.338 ± 0.204
1.015HisLeu: 1.015 ± 0.511
0.592HisMet: 0.592 ± 0.271
0.169HisAsn: 0.169 ± 0.154
0.677HisPro: 0.677 ± 0.247
0.254HisGln: 0.254 ± 0.189
0.592HisArg: 0.592 ± 0.199
1.015HisSer: 1.015 ± 0.348
0.677HisThr: 0.677 ± 0.361
0.592HisVal: 0.592 ± 0.224
0.085HisTrp: 0.085 ± 0.093
0.592HisTyr: 0.592 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
5.839IleAla: 5.839 ± 1.09
0.338IleCys: 0.338 ± 0.228
4.231IleAsp: 4.231 ± 0.712
3.469IleGlu: 3.469 ± 0.921
1.1IlePhe: 1.1 ± 0.362
3.3IleGly: 3.3 ± 0.494
0.846IleHis: 0.846 ± 0.466
3.3IleIle: 3.3 ± 0.997
3.469IleLys: 3.469 ± 0.877
3.3IleLeu: 3.3 ± 0.945
0.508IleMet: 0.508 ± 0.258
2.2IleAsn: 2.2 ± 0.316
2.792IlePro: 2.792 ± 0.468
2.454IleGln: 2.454 ± 0.643
3.131IleArg: 3.131 ± 0.571
4.569IleSer: 4.569 ± 0.767
6.262IleThr: 6.262 ± 1.28
2.877IleVal: 2.877 ± 0.651
0.677IleTrp: 0.677 ± 0.242
1.1IleTyr: 1.1 ± 0.331
0.0IleXaa: 0.0 ± 0.0
Lys
6.177LysAla: 6.177 ± 0.643
0.254LysCys: 0.254 ± 0.262
2.369LysAsp: 2.369 ± 0.435
2.708LysGlu: 2.708 ± 0.964
1.015LysPhe: 1.015 ± 0.286
2.623LysGly: 2.623 ± 0.762
0.423LysHis: 0.423 ± 0.278
3.723LysIle: 3.723 ± 1.022
3.723LysLys: 3.723 ± 0.695
4.485LysLeu: 4.485 ± 0.651
1.1LysMet: 1.1 ± 0.393
3.385LysAsn: 3.385 ± 0.936
2.115LysPro: 2.115 ± 0.719
3.385LysGln: 3.385 ± 0.692
2.2LysArg: 2.2 ± 0.725
3.385LysSer: 3.385 ± 0.679
3.723LysThr: 3.723 ± 0.504
3.554LysVal: 3.554 ± 0.465
0.846LysTrp: 0.846 ± 0.252
0.846LysTyr: 0.846 ± 0.321
0.0LysXaa: 0.0 ± 0.0
Leu
8.546LeuAla: 8.546 ± 1.405
0.762LeuCys: 0.762 ± 0.376
4.823LeuAsp: 4.823 ± 0.637
4.146LeuGlu: 4.146 ± 0.417
1.946LeuPhe: 1.946 ± 0.692
5.754LeuGly: 5.754 ± 1.047
0.931LeuHis: 0.931 ± 0.487
4.062LeuIle: 4.062 ± 0.837
3.723LeuLys: 3.723 ± 0.657
6.515LeuLeu: 6.515 ± 1.799
1.946LeuMet: 1.946 ± 0.683
4.146LeuAsn: 4.146 ± 0.719
3.723LeuPro: 3.723 ± 1.104
4.569LeuGln: 4.569 ± 0.737
4.908LeuArg: 4.908 ± 0.885
7.192LeuSer: 7.192 ± 0.943
10.323LeuThr: 10.323 ± 2.233
4.908LeuVal: 4.908 ± 0.94
0.338LeuTrp: 0.338 ± 0.206
2.454LeuTyr: 2.454 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
2.539MetAla: 2.539 ± 0.745
0.169MetCys: 0.169 ± 0.155
0.677MetAsp: 0.677 ± 0.36
0.931MetGlu: 0.931 ± 0.282
0.508MetPhe: 0.508 ± 0.261
1.269MetGly: 1.269 ± 0.26
0.169MetHis: 0.169 ± 0.135
1.185MetIle: 1.185 ± 0.561
2.285MetLys: 2.285 ± 0.756
2.539MetLeu: 2.539 ± 1.005
0.931MetMet: 0.931 ± 0.362
1.354MetAsn: 1.354 ± 0.251
1.185MetPro: 1.185 ± 0.254
0.762MetGln: 0.762 ± 0.262
1.015MetArg: 1.015 ± 0.258
1.862MetSer: 1.862 ± 0.5
1.269MetThr: 1.269 ± 0.422
1.608MetVal: 1.608 ± 0.377
0.169MetTrp: 0.169 ± 0.181
0.931MetTyr: 0.931 ± 0.311
0.0MetXaa: 0.0 ± 0.0
Asn
5.839AsnAla: 5.839 ± 1.045
0.677AsnCys: 0.677 ± 0.453
1.777AsnAsp: 1.777 ± 0.339
1.608AsnGlu: 1.608 ± 0.376
2.454AsnPhe: 2.454 ± 0.536
3.892AsnGly: 3.892 ± 0.779
0.338AsnHis: 0.338 ± 0.177
2.539AsnIle: 2.539 ± 0.475
2.877AsnLys: 2.877 ± 0.62
4.146AsnLeu: 4.146 ± 0.718
1.015AsnMet: 1.015 ± 0.405
2.962AsnAsn: 2.962 ± 0.919
1.438AsnPro: 1.438 ± 0.351
2.623AsnGln: 2.623 ± 0.517
1.946AsnArg: 1.946 ± 0.295
3.723AsnSer: 3.723 ± 1.119
3.639AsnThr: 3.639 ± 1.413
3.046AsnVal: 3.046 ± 0.414
0.508AsnTrp: 0.508 ± 0.212
1.185AsnTyr: 1.185 ± 0.368
0.0AsnXaa: 0.0 ± 0.0
Pro
3.723ProAla: 3.723 ± 0.652
0.423ProCys: 0.423 ± 0.222
2.2ProAsp: 2.2 ± 0.393
2.369ProGlu: 2.369 ± 0.79
1.608ProPhe: 1.608 ± 0.325
4.146ProGly: 4.146 ± 0.71
0.931ProHis: 0.931 ± 0.345
1.269ProIle: 1.269 ± 0.603
1.015ProLys: 1.015 ± 0.352
3.3ProLeu: 3.3 ± 1.052
0.592ProMet: 0.592 ± 0.293
1.354ProAsn: 1.354 ± 0.379
1.015ProPro: 1.015 ± 0.401
1.269ProGln: 1.269 ± 0.283
1.269ProArg: 1.269 ± 0.408
2.539ProSer: 2.539 ± 0.392
1.523ProThr: 1.523 ± 0.416
2.962ProVal: 2.962 ± 0.76
0.762ProTrp: 0.762 ± 0.257
0.931ProTyr: 0.931 ± 0.235
0.0ProXaa: 0.0 ± 0.0
Gln
5.246GlnAla: 5.246 ± 1.57
0.169GlnCys: 0.169 ± 0.122
2.285GlnAsp: 2.285 ± 0.311
3.385GlnGlu: 3.385 ± 0.625
1.269GlnPhe: 1.269 ± 0.505
3.977GlnGly: 3.977 ± 0.956
0.508GlnHis: 0.508 ± 0.256
1.523GlnIle: 1.523 ± 0.431
2.031GlnLys: 2.031 ± 0.323
5.077GlnLeu: 5.077 ± 0.504
1.692GlnMet: 1.692 ± 0.403
2.285GlnAsn: 2.285 ± 0.586
1.946GlnPro: 1.946 ± 0.688
3.215GlnGln: 3.215 ± 0.563
1.946GlnArg: 1.946 ± 0.516
4.485GlnSer: 4.485 ± 1.088
3.469GlnThr: 3.469 ± 0.63
3.639GlnVal: 3.639 ± 0.472
0.846GlnTrp: 0.846 ± 0.276
1.777GlnTyr: 1.777 ± 0.658
0.0GlnXaa: 0.0 ± 0.0
Arg
3.892ArgAla: 3.892 ± 0.7
0.338ArgCys: 0.338 ± 0.241
2.962ArgAsp: 2.962 ± 0.713
2.2ArgGlu: 2.2 ± 0.56
1.862ArgPhe: 1.862 ± 0.544
3.131ArgGly: 3.131 ± 0.606
1.185ArgHis: 1.185 ± 0.539
3.469ArgIle: 3.469 ± 0.584
3.046ArgLys: 3.046 ± 0.818
3.977ArgLeu: 3.977 ± 0.486
1.1ArgMet: 1.1 ± 0.427
2.539ArgAsn: 2.539 ± 0.341
1.777ArgPro: 1.777 ± 0.317
2.877ArgGln: 2.877 ± 0.671
3.385ArgArg: 3.385 ± 1.057
3.3ArgSer: 3.3 ± 0.575
3.131ArgThr: 3.131 ± 0.425
4.315ArgVal: 4.315 ± 1.108
0.592ArgTrp: 0.592 ± 0.207
2.031ArgTyr: 2.031 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
8.123SerAla: 8.123 ± 1.118
0.677SerCys: 0.677 ± 0.24
3.977SerAsp: 3.977 ± 0.561
3.046SerGlu: 3.046 ± 0.805
2.539SerPhe: 2.539 ± 0.519
6.431SerGly: 6.431 ± 0.936
0.592SerHis: 0.592 ± 0.255
3.808SerIle: 3.808 ± 0.491
3.131SerLys: 3.131 ± 0.613
8.631SerLeu: 8.631 ± 1.664
1.777SerMet: 1.777 ± 0.531
3.131SerAsn: 3.131 ± 0.685
2.708SerPro: 2.708 ± 0.352
5.585SerGln: 5.585 ± 1.401
3.046SerArg: 3.046 ± 0.428
4.823SerSer: 4.823 ± 1.004
4.146SerThr: 4.146 ± 0.714
4.908SerVal: 4.908 ± 0.59
1.015SerTrp: 1.015 ± 0.4
2.2SerTyr: 2.2 ± 0.412
0.0SerXaa: 0.0 ± 0.0
Thr
10.662ThrAla: 10.662 ± 2.254
0.592ThrCys: 0.592 ± 0.247
4.315ThrAsp: 4.315 ± 1.066
3.723ThrGlu: 3.723 ± 0.535
1.438ThrPhe: 1.438 ± 0.32
6.6ThrGly: 6.6 ± 1.002
0.592ThrHis: 0.592 ± 0.214
3.892ThrIle: 3.892 ± 0.503
2.962ThrLys: 2.962 ± 0.566
6.262ThrLeu: 6.262 ± 0.639
1.015ThrMet: 1.015 ± 0.287
4.062ThrAsn: 4.062 ± 1.54
1.692ThrPro: 1.692 ± 0.312
4.231ThrGln: 4.231 ± 1.313
3.723ThrArg: 3.723 ± 0.482
7.277ThrSer: 7.277 ± 1.582
6.346ThrThr: 6.346 ± 1.83
5.162ThrVal: 5.162 ± 0.998
1.1ThrTrp: 1.1 ± 0.308
1.015ThrTyr: 1.015 ± 0.298
0.0ThrXaa: 0.0 ± 0.0
Val
4.485ValAla: 4.485 ± 0.687
0.338ValCys: 0.338 ± 0.249
3.639ValAsp: 3.639 ± 0.392
3.469ValGlu: 3.469 ± 0.471
1.608ValPhe: 1.608 ± 0.421
4.823ValGly: 4.823 ± 0.777
0.508ValHis: 0.508 ± 0.19
4.146ValIle: 4.146 ± 0.781
3.215ValLys: 3.215 ± 0.439
4.4ValLeu: 4.4 ± 1.07
1.608ValMet: 1.608 ± 0.389
3.385ValAsn: 3.385 ± 0.699
3.046ValPro: 3.046 ± 0.549
2.962ValGln: 2.962 ± 0.406
3.469ValArg: 3.469 ± 0.586
5.923ValSer: 5.923 ± 1.255
5.923ValThr: 5.923 ± 1.79
3.977ValVal: 3.977 ± 0.558
1.015ValTrp: 1.015 ± 0.441
2.115ValTyr: 2.115 ± 0.457
0.0ValXaa: 0.0 ± 0.0
Trp
1.269TrpAla: 1.269 ± 0.393
0.169TrpCys: 0.169 ± 0.157
0.085TrpAsp: 0.085 ± 0.056
0.931TrpGlu: 0.931 ± 0.261
0.423TrpPhe: 0.423 ± 0.18
0.931TrpGly: 0.931 ± 0.381
0.423TrpHis: 0.423 ± 0.182
0.677TrpIle: 0.677 ± 0.254
0.592TrpLys: 0.592 ± 0.227
1.1TrpLeu: 1.1 ± 0.324
0.338TrpMet: 0.338 ± 0.214
0.592TrpAsn: 0.592 ± 0.17
0.592TrpPro: 0.592 ± 0.207
1.1TrpGln: 1.1 ± 0.308
0.677TrpArg: 0.677 ± 0.202
1.1TrpSer: 1.1 ± 0.378
1.608TrpThr: 1.608 ± 0.512
1.185TrpVal: 1.185 ± 0.365
0.423TrpTrp: 0.423 ± 0.246
0.254TrpTyr: 0.254 ± 0.133
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.454TyrAla: 2.454 ± 0.527
0.677TyrCys: 0.677 ± 0.311
1.1TyrAsp: 1.1 ± 0.3
1.862TyrGlu: 1.862 ± 0.312
1.608TyrPhe: 1.608 ± 0.496
1.862TyrGly: 1.862 ± 0.507
0.169TyrHis: 0.169 ± 0.133
2.539TyrIle: 2.539 ± 0.39
0.846TyrLys: 0.846 ± 0.407
1.946TyrLeu: 1.946 ± 0.388
0.423TyrMet: 0.423 ± 0.188
0.846TyrAsn: 0.846 ± 0.252
1.523TyrPro: 1.523 ± 0.474
1.269TyrGln: 1.269 ± 0.479
1.946TyrArg: 1.946 ± 0.371
2.031TyrSer: 2.031 ± 0.413
1.862TyrThr: 1.862 ± 0.313
1.1TyrVal: 1.1 ± 0.343
0.254TyrTrp: 0.254 ± 0.15
0.592TyrTyr: 0.592 ± 0.171
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 38 proteins (11819 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski