Amino acid dipepetide frequency for Klebsiella phage ST16-OXA48phi5.4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.009AlaAla: 9.009 ± 1.226
0.944AlaCys: 0.944 ± 0.327
5.663AlaAsp: 5.663 ± 0.962
6.778AlaGlu: 6.778 ± 0.906
4.376AlaPhe: 4.376 ± 0.79
7.808AlaGly: 7.808 ± 1.234
1.373AlaHis: 1.373 ± 0.328
4.633AlaIle: 4.633 ± 0.533
3.604AlaLys: 3.604 ± 0.561
11.068AlaLeu: 11.068 ± 0.964
2.145AlaMet: 2.145 ± 0.51
2.488AlaAsn: 2.488 ± 0.484
2.746AlaPro: 2.746 ± 0.52
3.089AlaGln: 3.089 ± 0.535
6.178AlaArg: 6.178 ± 0.966
6.349AlaSer: 6.349 ± 0.708
5.577AlaThr: 5.577 ± 0.68
6.607AlaVal: 6.607 ± 0.612
1.287AlaTrp: 1.287 ± 0.315
2.402AlaTyr: 2.402 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
1.115CysAla: 1.115 ± 0.347
0.172CysCys: 0.172 ± 0.123
0.686CysAsp: 0.686 ± 0.3
0.429CysGlu: 0.429 ± 0.18
0.515CysPhe: 0.515 ± 0.234
1.201CysGly: 1.201 ± 0.412
0.086CysHis: 0.086 ± 0.104
0.772CysIle: 0.772 ± 0.224
0.515CysLys: 0.515 ± 0.256
0.858CysLeu: 0.858 ± 0.257
0.172CysMet: 0.172 ± 0.124
0.686CysAsn: 0.686 ± 0.257
0.515CysPro: 0.515 ± 0.212
0.343CysGln: 0.343 ± 0.186
0.858CysArg: 0.858 ± 0.244
1.716CysSer: 1.716 ± 0.868
1.287CysThr: 1.287 ± 0.442
0.601CysVal: 0.601 ± 0.264
0.257CysTrp: 0.257 ± 0.123
0.172CysTyr: 0.172 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
5.32AspAla: 5.32 ± 0.665
0.858AspCys: 0.858 ± 0.224
3.175AspAsp: 3.175 ± 0.542
3.689AspGlu: 3.689 ± 0.532
3.175AspPhe: 3.175 ± 0.42
5.148AspGly: 5.148 ± 0.463
0.772AspHis: 0.772 ± 0.282
4.118AspIle: 4.118 ± 0.622
3.432AspLys: 3.432 ± 0.561
4.547AspLeu: 4.547 ± 0.663
1.287AspMet: 1.287 ± 0.313
2.402AspAsn: 2.402 ± 0.514
1.973AspPro: 1.973 ± 0.526
2.059AspGln: 2.059 ± 0.391
3.346AspArg: 3.346 ± 0.502
3.518AspSer: 3.518 ± 0.63
2.746AspThr: 2.746 ± 0.435
3.089AspVal: 3.089 ± 0.479
0.772AspTrp: 0.772 ± 0.298
2.574AspTyr: 2.574 ± 0.493
0.0AspXaa: 0.0 ± 0.0
Glu
6.178GluAla: 6.178 ± 0.668
0.686GluCys: 0.686 ± 0.223
2.66GluAsp: 2.66 ± 0.451
3.175GluGlu: 3.175 ± 0.53
1.888GluPhe: 1.888 ± 0.565
3.432GluGly: 3.432 ± 0.543
1.287GluHis: 1.287 ± 0.395
3.604GluIle: 3.604 ± 0.574
3.861GluLys: 3.861 ± 0.606
6.778GluLeu: 6.778 ± 0.632
1.373GluMet: 1.373 ± 0.396
3.003GluAsn: 3.003 ± 0.568
1.888GluPro: 1.888 ± 0.37
2.746GluGln: 2.746 ± 0.667
3.089GluArg: 3.089 ± 0.509
4.033GluSer: 4.033 ± 0.711
4.204GluThr: 4.204 ± 0.538
3.775GluVal: 3.775 ± 0.417
0.772GluTrp: 0.772 ± 0.258
2.317GluTyr: 2.317 ± 0.513
0.0GluXaa: 0.0 ± 0.0
Phe
3.775PheAla: 3.775 ± 0.975
0.343PheCys: 0.343 ± 0.155
2.574PheAsp: 2.574 ± 0.423
1.373PheGlu: 1.373 ± 0.371
1.201PhePhe: 1.201 ± 0.305
2.831PheGly: 2.831 ± 0.461
0.686PheHis: 0.686 ± 0.214
3.175PheIle: 3.175 ± 0.676
2.231PheLys: 2.231 ± 0.441
2.231PheLeu: 2.231 ± 0.477
1.287PheMet: 1.287 ± 0.283
2.574PheAsn: 2.574 ± 0.675
0.944PhePro: 0.944 ± 0.261
1.287PheGln: 1.287 ± 0.29
2.574PheArg: 2.574 ± 0.503
4.376PheSer: 4.376 ± 0.94
4.033PheThr: 4.033 ± 0.602
2.145PheVal: 2.145 ± 0.359
0.515PheTrp: 0.515 ± 0.239
1.115PheTyr: 1.115 ± 0.337
0.0PheXaa: 0.0 ± 0.0
Gly
7.207GlyAla: 7.207 ± 0.798
2.402GlyCys: 2.402 ± 1.244
4.805GlyAsp: 4.805 ± 0.544
4.118GlyGlu: 4.118 ± 0.781
2.66GlyPhe: 2.66 ± 0.464
6.263GlyGly: 6.263 ± 1.278
0.772GlyHis: 0.772 ± 0.245
4.033GlyIle: 4.033 ± 0.675
3.947GlyLys: 3.947 ± 0.586
6.178GlyLeu: 6.178 ± 0.777
1.802GlyMet: 1.802 ± 0.387
2.831GlyAsn: 2.831 ± 0.402
1.63GlyPro: 1.63 ± 0.426
2.574GlyGln: 2.574 ± 0.447
3.432GlyArg: 3.432 ± 0.563
4.204GlySer: 4.204 ± 0.669
4.118GlyThr: 4.118 ± 0.841
4.805GlyVal: 4.805 ± 0.784
1.201GlyTrp: 1.201 ± 0.308
2.059GlyTyr: 2.059 ± 0.375
0.0GlyXaa: 0.0 ± 0.0
His
1.459HisAla: 1.459 ± 0.363
0.343HisCys: 0.343 ± 0.153
1.03HisAsp: 1.03 ± 0.31
1.115HisGlu: 1.115 ± 0.33
1.03HisPhe: 1.03 ± 0.351
1.459HisGly: 1.459 ± 0.304
0.257HisHis: 0.257 ± 0.145
1.03HisIle: 1.03 ± 0.315
0.858HisLys: 0.858 ± 0.297
2.231HisLeu: 2.231 ± 0.502
0.343HisMet: 0.343 ± 0.158
0.343HisAsn: 0.343 ± 0.145
0.429HisPro: 0.429 ± 0.188
0.515HisGln: 0.515 ± 0.217
0.772HisArg: 0.772 ± 0.263
2.145HisSer: 2.145 ± 0.35
0.601HisThr: 0.601 ± 0.242
0.772HisVal: 0.772 ± 0.236
0.257HisTrp: 0.257 ± 0.148
1.03HisTyr: 1.03 ± 0.273
0.0HisXaa: 0.0 ± 0.0
Ile
5.749IleAla: 5.749 ± 0.797
0.257IleCys: 0.257 ± 0.141
4.462IleAsp: 4.462 ± 0.774
3.861IleGlu: 3.861 ± 0.873
1.973IlePhe: 1.973 ± 0.485
3.947IleGly: 3.947 ± 0.834
0.858IleHis: 0.858 ± 0.215
2.059IleIle: 2.059 ± 0.622
2.574IleLys: 2.574 ± 0.491
3.689IleLeu: 3.689 ± 0.511
0.858IleMet: 0.858 ± 0.269
3.947IleAsn: 3.947 ± 0.727
2.317IlePro: 2.317 ± 0.42
1.287IleGln: 1.287 ± 0.358
3.947IleArg: 3.947 ± 0.603
4.805IleSer: 4.805 ± 0.655
4.376IleThr: 4.376 ± 0.635
3.26IleVal: 3.26 ± 0.472
0.601IleTrp: 0.601 ± 0.207
1.716IleTyr: 1.716 ± 0.435
0.0IleXaa: 0.0 ± 0.0
Lys
4.719LysAla: 4.719 ± 0.703
0.343LysCys: 0.343 ± 0.116
2.66LysAsp: 2.66 ± 0.537
3.175LysGlu: 3.175 ± 0.622
1.373LysPhe: 1.373 ± 0.39
2.488LysGly: 2.488 ± 0.403
0.858LysHis: 0.858 ± 0.33
3.003LysIle: 3.003 ± 0.553
4.633LysLys: 4.633 ± 0.624
4.976LysLeu: 4.976 ± 0.609
0.686LysMet: 0.686 ± 0.223
3.518LysAsn: 3.518 ± 0.548
2.059LysPro: 2.059 ± 0.454
2.145LysGln: 2.145 ± 0.452
3.861LysArg: 3.861 ± 0.573
2.917LysSer: 2.917 ± 0.6
4.204LysThr: 4.204 ± 0.562
2.831LysVal: 2.831 ± 0.518
0.515LysTrp: 0.515 ± 0.225
1.287LysTyr: 1.287 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
8.151LeuAla: 8.151 ± 0.934
1.201LeuCys: 1.201 ± 0.372
5.491LeuAsp: 5.491 ± 0.676
5.32LeuGlu: 5.32 ± 0.791
4.204LeuPhe: 4.204 ± 0.908
5.32LeuGly: 5.32 ± 1.119
2.059LeuHis: 2.059 ± 0.39
5.148LeuIle: 5.148 ± 0.759
5.405LeuLys: 5.405 ± 0.737
8.58LeuLeu: 8.58 ± 0.963
2.402LeuMet: 2.402 ± 0.346
4.204LeuAsn: 4.204 ± 0.643
4.462LeuPro: 4.462 ± 0.88
3.775LeuGln: 3.775 ± 0.612
6.692LeuArg: 6.692 ± 0.9
7.55LeuSer: 7.55 ± 1.091
7.036LeuThr: 7.036 ± 0.869
4.204LeuVal: 4.204 ± 0.808
0.686LeuTrp: 0.686 ± 0.242
2.059LeuTyr: 2.059 ± 0.537
0.0LeuXaa: 0.0 ± 0.0
Met
2.488MetAla: 2.488 ± 0.426
0.172MetCys: 0.172 ± 0.118
1.03MetAsp: 1.03 ± 0.224
0.772MetGlu: 0.772 ± 0.262
1.201MetPhe: 1.201 ± 0.311
1.63MetGly: 1.63 ± 0.414
0.601MetHis: 0.601 ± 0.229
1.03MetIle: 1.03 ± 0.371
1.802MetLys: 1.802 ± 0.395
1.802MetLeu: 1.802 ± 0.359
0.343MetMet: 0.343 ± 0.257
0.601MetAsn: 0.601 ± 0.223
0.858MetPro: 0.858 ± 0.327
0.772MetGln: 0.772 ± 0.271
0.686MetArg: 0.686 ± 0.299
1.63MetSer: 1.63 ± 0.355
1.459MetThr: 1.459 ± 0.396
0.944MetVal: 0.944 ± 0.245
0.343MetTrp: 0.343 ± 0.146
0.601MetTyr: 0.601 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
3.26AsnAla: 3.26 ± 0.564
0.944AsnCys: 0.944 ± 0.387
2.145AsnAsp: 2.145 ± 0.439
2.402AsnGlu: 2.402 ± 0.491
1.63AsnPhe: 1.63 ± 0.36
3.689AsnGly: 3.689 ± 0.518
1.115AsnHis: 1.115 ± 0.325
1.802AsnIle: 1.802 ± 0.39
1.888AsnLys: 1.888 ± 0.484
5.062AsnLeu: 5.062 ± 0.89
0.601AsnMet: 0.601 ± 0.264
2.746AsnAsn: 2.746 ± 0.471
2.66AsnPro: 2.66 ± 0.379
1.888AsnGln: 1.888 ± 0.323
2.574AsnArg: 2.574 ± 0.567
2.574AsnSer: 2.574 ± 0.645
1.973AsnThr: 1.973 ± 0.361
3.003AsnVal: 3.003 ± 0.483
0.686AsnTrp: 0.686 ± 0.199
1.03AsnTyr: 1.03 ± 0.288
0.0AsnXaa: 0.0 ± 0.0
Pro
4.547ProAla: 4.547 ± 0.735
0.429ProCys: 0.429 ± 0.215
3.089ProAsp: 3.089 ± 0.377
3.689ProGlu: 3.689 ± 0.682
1.115ProPhe: 1.115 ± 0.388
2.66ProGly: 2.66 ± 0.473
1.03ProHis: 1.03 ± 0.312
1.373ProIle: 1.373 ± 0.262
1.544ProLys: 1.544 ± 0.356
3.089ProLeu: 3.089 ± 0.43
0.858ProMet: 0.858 ± 0.284
1.03ProAsn: 1.03 ± 0.317
1.802ProPro: 1.802 ± 0.356
1.63ProGln: 1.63 ± 0.484
1.63ProArg: 1.63 ± 0.359
1.544ProSer: 1.544 ± 0.356
1.03ProThr: 1.03 ± 0.324
3.003ProVal: 3.003 ± 0.669
0.601ProTrp: 0.601 ± 0.226
1.544ProTyr: 1.544 ± 0.503
0.0ProXaa: 0.0 ± 0.0
Gln
3.26GlnAla: 3.26 ± 0.487
0.343GlnCys: 0.343 ± 0.185
1.544GlnAsp: 1.544 ± 0.303
2.66GlnGlu: 2.66 ± 0.528
0.944GlnPhe: 0.944 ± 0.236
1.716GlnGly: 1.716 ± 0.373
0.686GlnHis: 0.686 ± 0.271
3.432GlnIle: 3.432 ± 0.563
2.059GlnLys: 2.059 ± 0.475
4.462GlnLeu: 4.462 ± 0.836
0.686GlnMet: 0.686 ± 0.22
1.373GlnAsn: 1.373 ± 0.354
1.201GlnPro: 1.201 ± 0.306
2.402GlnGln: 2.402 ± 0.814
3.26GlnArg: 3.26 ± 0.614
2.574GlnSer: 2.574 ± 0.736
2.059GlnThr: 2.059 ± 0.411
1.802GlnVal: 1.802 ± 0.395
0.515GlnTrp: 0.515 ± 0.216
1.115GlnTyr: 1.115 ± 0.343
0.0GlnXaa: 0.0 ± 0.0
Arg
6.092ArgAla: 6.092 ± 0.806
0.429ArgCys: 0.429 ± 0.166
3.003ArgAsp: 3.003 ± 0.558
3.689ArgGlu: 3.689 ± 0.584
2.746ArgPhe: 2.746 ± 0.385
3.175ArgGly: 3.175 ± 0.514
1.373ArgHis: 1.373 ± 0.347
3.346ArgIle: 3.346 ± 0.595
3.775ArgLys: 3.775 ± 0.592
6.349ArgLeu: 6.349 ± 1.165
1.802ArgMet: 1.802 ± 0.383
2.66ArgAsn: 2.66 ± 0.518
2.059ArgPro: 2.059 ± 0.364
3.518ArgGln: 3.518 ± 0.601
4.719ArgArg: 4.719 ± 0.917
3.518ArgSer: 3.518 ± 0.43
3.346ArgThr: 3.346 ± 0.821
4.033ArgVal: 4.033 ± 0.781
1.287ArgTrp: 1.287 ± 0.344
1.888ArgTyr: 1.888 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
7.036SerAla: 7.036 ± 0.808
0.858SerCys: 0.858 ± 0.403
4.891SerAsp: 4.891 ± 0.682
4.891SerGlu: 4.891 ± 0.773
2.574SerPhe: 2.574 ± 0.473
5.663SerGly: 5.663 ± 0.807
0.686SerHis: 0.686 ± 0.273
4.204SerIle: 4.204 ± 0.502
2.917SerLys: 2.917 ± 0.459
8.323SerLeu: 8.323 ± 1.163
1.03SerMet: 1.03 ± 0.243
3.003SerAsn: 3.003 ± 0.487
2.574SerPro: 2.574 ± 0.429
2.66SerGln: 2.66 ± 0.445
3.861SerArg: 3.861 ± 0.635
4.118SerSer: 4.118 ± 0.622
3.432SerThr: 3.432 ± 0.497
4.633SerVal: 4.633 ± 0.548
0.944SerTrp: 0.944 ± 0.235
1.888SerTyr: 1.888 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
6.95ThrAla: 6.95 ± 1.156
0.429ThrCys: 0.429 ± 0.212
3.861ThrAsp: 3.861 ± 0.644
3.947ThrGlu: 3.947 ± 0.465
3.518ThrPhe: 3.518 ± 1.032
5.92ThrGly: 5.92 ± 0.787
1.63ThrHis: 1.63 ± 0.34
3.003ThrIle: 3.003 ± 0.539
2.317ThrLys: 2.317 ± 0.55
5.577ThrLeu: 5.577 ± 0.78
0.772ThrMet: 0.772 ± 0.239
1.888ThrAsn: 1.888 ± 0.534
2.66ThrPro: 2.66 ± 0.383
1.544ThrGln: 1.544 ± 0.437
4.29ThrArg: 4.29 ± 0.597
4.633ThrSer: 4.633 ± 0.682
4.29ThrThr: 4.29 ± 0.766
4.547ThrVal: 4.547 ± 0.823
0.772ThrTrp: 0.772 ± 0.287
1.888ThrTyr: 1.888 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
3.861ValAla: 3.861 ± 0.53
0.686ValCys: 0.686 ± 0.253
3.003ValAsp: 3.003 ± 0.443
3.432ValGlu: 3.432 ± 0.591
3.26ValPhe: 3.26 ± 0.636
3.26ValGly: 3.26 ± 0.522
0.686ValHis: 0.686 ± 0.2
4.204ValIle: 4.204 ± 0.53
3.175ValLys: 3.175 ± 0.55
4.719ValLeu: 4.719 ± 0.737
1.716ValMet: 1.716 ± 0.382
2.317ValAsn: 2.317 ± 0.503
2.746ValPro: 2.746 ± 0.49
1.287ValGln: 1.287 ± 0.297
4.033ValArg: 4.033 ± 0.647
4.976ValSer: 4.976 ± 0.568
6.178ValThr: 6.178 ± 1.181
3.947ValVal: 3.947 ± 0.555
0.858ValTrp: 0.858 ± 0.241
1.973ValTyr: 1.973 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
0.772TrpAla: 0.772 ± 0.252
0.257TrpCys: 0.257 ± 0.138
0.686TrpAsp: 0.686 ± 0.248
0.772TrpGlu: 0.772 ± 0.259
1.03TrpPhe: 1.03 ± 0.29
0.686TrpGly: 0.686 ± 0.223
0.686TrpHis: 0.686 ± 0.216
0.858TrpIle: 0.858 ± 0.366
1.03TrpLys: 1.03 ± 0.325
1.115TrpLeu: 1.115 ± 0.315
0.172TrpMet: 0.172 ± 0.12
0.772TrpAsn: 0.772 ± 0.204
0.601TrpPro: 0.601 ± 0.184
0.858TrpGln: 0.858 ± 0.309
1.287TrpArg: 1.287 ± 0.358
0.686TrpSer: 0.686 ± 0.316
0.515TrpThr: 0.515 ± 0.242
0.515TrpVal: 0.515 ± 0.179
0.257TrpTrp: 0.257 ± 0.155
0.257TrpTyr: 0.257 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.175TyrAla: 3.175 ± 0.664
0.858TyrCys: 0.858 ± 0.42
1.544TyrAsp: 1.544 ± 0.324
1.544TyrGlu: 1.544 ± 0.316
0.858TyrPhe: 0.858 ± 0.283
2.746TyrGly: 2.746 ± 0.568
0.515TyrHis: 0.515 ± 0.206
1.716TyrIle: 1.716 ± 0.606
0.686TyrLys: 0.686 ± 0.267
2.317TyrLeu: 2.317 ± 0.455
0.343TyrMet: 0.343 ± 0.162
1.287TyrAsn: 1.287 ± 0.308
1.115TyrPro: 1.115 ± 0.353
1.716TyrGln: 1.716 ± 0.417
1.716TyrArg: 1.716 ± 0.382
2.317TyrSer: 2.317 ± 0.384
1.973TyrThr: 1.973 ± 0.382
1.802TyrVal: 1.802 ± 0.444
0.686TyrTrp: 0.686 ± 0.206
0.858TyrTyr: 0.858 ± 0.343
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11656 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski