Amino acid dipepetide frequency for Klebsiella phage ST13-OXA48phi12.3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.8AlaAla: 8.8 ± 1.176
1.009AlaCys: 1.009 ± 0.205
6.216AlaAsp: 6.216 ± 0.491
6.418AlaGlu: 6.418 ± 0.757
3.229AlaPhe: 3.229 ± 0.474
7.185AlaGly: 7.185 ± 0.667
1.292AlaHis: 1.292 ± 0.236
4.4AlaIle: 4.4 ± 0.381
6.216AlaLys: 6.216 ± 0.476
8.033AlaLeu: 8.033 ± 0.554
2.826AlaMet: 2.826 ± 0.368
3.512AlaAsn: 3.512 ± 0.533
2.341AlaPro: 2.341 ± 0.25
3.633AlaGln: 3.633 ± 0.475
5.126AlaArg: 5.126 ± 0.489
6.135AlaSer: 6.135 ± 0.594
5.328AlaThr: 5.328 ± 0.704
5.49AlaVal: 5.49 ± 0.486
0.888AlaTrp: 0.888 ± 0.183
2.745AlaTyr: 2.745 ± 0.361
0.0AlaXaa: 0.0 ± 0.0
Cys
0.727CysAla: 0.727 ± 0.181
0.161CysCys: 0.161 ± 0.09
0.605CysAsp: 0.605 ± 0.187
0.848CysGlu: 0.848 ± 0.231
0.323CysPhe: 0.323 ± 0.15
0.727CysGly: 0.727 ± 0.21
0.242CysHis: 0.242 ± 0.113
0.686CysIle: 0.686 ± 0.192
0.727CysLys: 0.727 ± 0.191
0.646CysLeu: 0.646 ± 0.209
0.323CysMet: 0.323 ± 0.113
0.444CysAsn: 0.444 ± 0.128
0.323CysPro: 0.323 ± 0.093
0.565CysGln: 0.565 ± 0.171
0.605CysArg: 0.605 ± 0.167
0.484CysSer: 0.484 ± 0.147
0.444CysThr: 0.444 ± 0.152
0.646CysVal: 0.646 ± 0.167
0.081CysTrp: 0.081 ± 0.058
0.081CysTyr: 0.081 ± 0.057
0.0CysXaa: 0.0 ± 0.0
Asp
5.934AspAla: 5.934 ± 0.599
0.484AspCys: 0.484 ± 0.142
4.198AspAsp: 4.198 ± 0.549
5.247AspGlu: 5.247 ± 0.665
2.785AspPhe: 2.785 ± 0.276
4.682AspGly: 4.682 ± 0.416
0.767AspHis: 0.767 ± 0.174
3.592AspIle: 3.592 ± 0.424
3.875AspLys: 3.875 ± 0.483
5.974AspLeu: 5.974 ± 0.4
1.816AspMet: 1.816 ± 0.287
2.059AspAsn: 2.059 ± 0.423
1.938AspPro: 1.938 ± 0.321
2.503AspGln: 2.503 ± 0.317
2.543AspArg: 2.543 ± 0.338
3.835AspSer: 3.835 ± 0.474
3.31AspThr: 3.31 ± 0.396
3.714AspVal: 3.714 ± 0.441
0.686AspTrp: 0.686 ± 0.149
1.615AspTyr: 1.615 ± 0.283
0.0AspXaa: 0.0 ± 0.0
Glu
6.176GluAla: 6.176 ± 0.561
0.686GluCys: 0.686 ± 0.201
3.754GluAsp: 3.754 ± 0.661
5.328GluGlu: 5.328 ± 0.762
2.866GluPhe: 2.866 ± 0.417
3.512GluGly: 3.512 ± 0.423
1.494GluHis: 1.494 ± 0.286
4.521GluIle: 4.521 ± 0.458
4.723GluLys: 4.723 ± 0.585
7.225GluLeu: 7.225 ± 0.91
2.018GluMet: 2.018 ± 0.368
2.624GluAsn: 2.624 ± 0.298
2.18GluPro: 2.18 ± 0.275
2.704GluGln: 2.704 ± 0.314
4.602GluArg: 4.602 ± 0.535
5.49GluSer: 5.49 ± 0.433
3.31GluThr: 3.31 ± 0.339
3.754GluVal: 3.754 ± 0.47
1.009GluTrp: 1.009 ± 0.228
1.978GluTyr: 1.978 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
2.947PheAla: 2.947 ± 0.428
0.444PheCys: 0.444 ± 0.109
3.35PheAsp: 3.35 ± 0.377
2.26PheGlu: 2.26 ± 0.317
0.969PhePhe: 0.969 ± 0.206
2.947PheGly: 2.947 ± 0.318
0.525PheHis: 0.525 ± 0.169
1.897PheIle: 1.897 ± 0.33
2.422PheLys: 2.422 ± 0.321
2.543PheLeu: 2.543 ± 0.354
1.13PheMet: 1.13 ± 0.215
1.857PheAsn: 1.857 ± 0.215
1.615PhePro: 1.615 ± 0.341
1.251PheGln: 1.251 ± 0.213
1.655PheArg: 1.655 ± 0.202
3.027PheSer: 3.027 ± 0.362
2.22PheThr: 2.22 ± 0.37
2.22PheVal: 2.22 ± 0.294
0.404PheTrp: 0.404 ± 0.121
1.372PheTyr: 1.372 ± 0.305
0.0PheXaa: 0.0 ± 0.0
Gly
5.49GlyAla: 5.49 ± 0.504
0.767GlyCys: 0.767 ± 0.184
4.481GlyAsp: 4.481 ± 0.496
5.005GlyGlu: 5.005 ± 0.424
3.229GlyPhe: 3.229 ± 0.403
3.996GlyGly: 3.996 ± 0.462
1.009GlyHis: 1.009 ± 0.267
3.633GlyIle: 3.633 ± 0.61
4.521GlyLys: 4.521 ± 0.442
5.611GlyLeu: 5.611 ± 0.54
2.059GlyMet: 2.059 ± 0.278
3.31GlyAsn: 3.31 ± 0.708
1.009GlyPro: 1.009 ± 0.227
2.26GlyGln: 2.26 ± 0.338
3.633GlyArg: 3.633 ± 0.358
4.117GlySer: 4.117 ± 0.413
3.592GlyThr: 3.592 ± 0.531
5.974GlyVal: 5.974 ± 0.542
0.767GlyTrp: 0.767 ± 0.251
2.462GlyTyr: 2.462 ± 0.336
0.0GlyXaa: 0.0 ± 0.0
His
1.13HisAla: 1.13 ± 0.204
0.161HisCys: 0.161 ± 0.095
0.888HisAsp: 0.888 ± 0.245
1.332HisGlu: 1.332 ± 0.248
1.009HisPhe: 1.009 ± 0.234
1.009HisGly: 1.009 ± 0.235
0.323HisHis: 0.323 ± 0.173
1.009HisIle: 1.009 ± 0.215
1.049HisLys: 1.049 ± 0.248
1.534HisLeu: 1.534 ± 0.28
0.605HisMet: 0.605 ± 0.17
0.727HisAsn: 0.727 ± 0.165
0.807HisPro: 0.807 ± 0.185
0.807HisGln: 0.807 ± 0.227
0.969HisArg: 0.969 ± 0.223
1.049HisSer: 1.049 ± 0.223
0.807HisThr: 0.807 ± 0.178
0.767HisVal: 0.767 ± 0.211
0.242HisTrp: 0.242 ± 0.11
0.565HisTyr: 0.565 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
4.803IleAla: 4.803 ± 0.418
0.646IleCys: 0.646 ± 0.185
3.915IleAsp: 3.915 ± 0.431
4.279IleGlu: 4.279 ± 0.476
1.413IlePhe: 1.413 ± 0.216
3.754IleGly: 3.754 ± 0.383
0.969IleHis: 0.969 ± 0.259
2.422IleIle: 2.422 ± 0.316
3.794IleLys: 3.794 ± 0.37
3.714IleLeu: 3.714 ± 0.379
1.211IleMet: 1.211 ± 0.228
2.866IleAsn: 2.866 ± 0.348
2.462IlePro: 2.462 ± 0.328
2.018IleGln: 2.018 ± 0.283
3.068IleArg: 3.068 ± 0.311
4.521IleSer: 4.521 ± 0.433
3.996IleThr: 3.996 ± 0.891
3.915IleVal: 3.915 ± 0.413
0.525IleTrp: 0.525 ± 0.129
1.776IleTyr: 1.776 ± 0.392
0.0IleXaa: 0.0 ± 0.0
Lys
6.943LysAla: 6.943 ± 0.689
0.686LysCys: 0.686 ± 0.138
3.431LysAsp: 3.431 ± 0.331
5.005LysGlu: 5.005 ± 0.583
1.615LysPhe: 1.615 ± 0.238
3.229LysGly: 3.229 ± 0.35
1.372LysHis: 1.372 ± 0.26
4.158LysIle: 4.158 ± 0.379
3.794LysLys: 3.794 ± 0.439
5.369LysLeu: 5.369 ± 0.602
2.059LysMet: 2.059 ± 0.321
3.189LysAsn: 3.189 ± 0.409
2.341LysPro: 2.341 ± 0.366
2.382LysGln: 2.382 ± 0.319
3.754LysArg: 3.754 ± 0.39
3.875LysSer: 3.875 ± 0.406
3.835LysThr: 3.835 ± 0.395
4.723LysVal: 4.723 ± 0.439
0.525LysTrp: 0.525 ± 0.137
1.615LysTyr: 1.615 ± 0.232
0.0LysXaa: 0.0 ± 0.0
Leu
7.952LeuAla: 7.952 ± 0.745
1.049LeuCys: 1.049 ± 0.251
5.167LeuAsp: 5.167 ± 0.473
5.772LeuGlu: 5.772 ± 0.637
2.503LeuPhe: 2.503 ± 0.318
5.409LeuGly: 5.409 ± 0.539
1.13LeuHis: 1.13 ± 0.274
5.046LeuIle: 5.046 ± 0.374
5.611LeuLys: 5.611 ± 0.36
5.934LeuLeu: 5.934 ± 0.564
3.148LeuMet: 3.148 ± 0.401
3.754LeuAsn: 3.754 ± 0.431
3.31LeuPro: 3.31 ± 0.38
2.906LeuGln: 2.906 ± 0.355
4.723LeuArg: 4.723 ± 0.606
7.548LeuSer: 7.548 ± 0.628
5.409LeuThr: 5.409 ± 0.568
5.57LeuVal: 5.57 ± 0.561
0.969LeuTrp: 0.969 ± 0.242
1.736LeuTyr: 1.736 ± 0.298
0.0LeuXaa: 0.0 ± 0.0
Met
2.826MetAla: 2.826 ± 0.383
0.242MetCys: 0.242 ± 0.116
1.211MetAsp: 1.211 ± 0.241
2.059MetGlu: 2.059 ± 0.338
0.848MetPhe: 0.848 ± 0.2
1.534MetGly: 1.534 ± 0.37
0.363MetHis: 0.363 ± 0.137
1.736MetIle: 1.736 ± 0.227
1.857MetLys: 1.857 ± 0.293
2.583MetLeu: 2.583 ± 0.321
1.09MetMet: 1.09 ± 0.274
1.695MetAsn: 1.695 ± 0.269
1.453MetPro: 1.453 ± 0.293
1.009MetGln: 1.009 ± 0.23
1.655MetArg: 1.655 ± 0.252
2.826MetSer: 2.826 ± 0.323
2.18MetThr: 2.18 ± 0.328
1.897MetVal: 1.897 ± 0.329
0.283MetTrp: 0.283 ± 0.097
0.646MetTyr: 0.646 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
4.602AsnAla: 4.602 ± 0.946
0.283AsnCys: 0.283 ± 0.094
2.301AsnAsp: 2.301 ± 0.369
2.664AsnGlu: 2.664 ± 0.332
1.413AsnPhe: 1.413 ± 0.248
3.552AsnGly: 3.552 ± 0.362
0.605AsnHis: 0.605 ± 0.176
2.341AsnIle: 2.341 ± 0.371
2.987AsnLys: 2.987 ± 0.315
3.431AsnLeu: 3.431 ± 0.481
1.251AsnMet: 1.251 ± 0.194
1.938AsnAsn: 1.938 ± 0.514
1.938AsnPro: 1.938 ± 0.254
1.534AsnGln: 1.534 ± 0.335
2.382AsnArg: 2.382 ± 0.351
3.835AsnSer: 3.835 ± 1.022
2.382AsnThr: 2.382 ± 0.333
3.592AsnVal: 3.592 ± 0.428
0.565AsnTrp: 0.565 ± 0.137
1.413AsnTyr: 1.413 ± 0.241
0.0AsnXaa: 0.0 ± 0.0
Pro
2.866ProAla: 2.866 ± 0.336
0.283ProCys: 0.283 ± 0.113
2.382ProAsp: 2.382 ± 0.293
3.27ProGlu: 3.27 ± 0.414
1.655ProPhe: 1.655 ± 0.256
2.382ProGly: 2.382 ± 0.316
0.848ProHis: 0.848 ± 0.186
1.695ProIle: 1.695 ± 0.291
2.18ProLys: 2.18 ± 0.307
2.26ProLeu: 2.26 ± 0.342
1.009ProMet: 1.009 ± 0.196
1.332ProAsn: 1.332 ± 0.245
0.928ProPro: 0.928 ± 0.214
1.049ProGln: 1.049 ± 0.288
1.615ProArg: 1.615 ± 0.327
1.938ProSer: 1.938 ± 0.246
2.341ProThr: 2.341 ± 0.298
2.947ProVal: 2.947 ± 0.428
0.444ProTrp: 0.444 ± 0.155
1.292ProTyr: 1.292 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
3.27GlnAla: 3.27 ± 0.375
0.121GlnCys: 0.121 ± 0.084
1.816GlnAsp: 1.816 ± 0.328
1.978GlnGlu: 1.978 ± 0.298
1.655GlnPhe: 1.655 ± 0.252
2.301GlnGly: 2.301 ± 0.477
0.565GlnHis: 0.565 ± 0.207
2.624GlnIle: 2.624 ± 0.391
2.503GlnLys: 2.503 ± 0.39
3.714GlnLeu: 3.714 ± 0.397
1.695GlnMet: 1.695 ± 0.365
1.938GlnAsn: 1.938 ± 0.321
1.009GlnPro: 1.009 ± 0.189
1.655GlnGln: 1.655 ± 0.335
2.18GlnArg: 2.18 ± 0.302
2.22GlnSer: 2.22 ± 0.438
2.543GlnThr: 2.543 ± 0.434
2.099GlnVal: 2.099 ± 0.292
0.444GlnTrp: 0.444 ± 0.132
0.928GlnTyr: 0.928 ± 0.224
0.0GlnXaa: 0.0 ± 0.0
Arg
5.49ArgAla: 5.49 ± 0.496
0.323ArgCys: 0.323 ± 0.103
3.592ArgAsp: 3.592 ± 0.361
4.077ArgGlu: 4.077 ± 0.512
1.938ArgPhe: 1.938 ± 0.283
3.835ArgGly: 3.835 ± 0.317
1.372ArgHis: 1.372 ± 0.284
3.068ArgIle: 3.068 ± 0.483
3.673ArgLys: 3.673 ± 0.406
5.207ArgLeu: 5.207 ± 0.619
1.494ArgMet: 1.494 ± 0.309
2.382ArgAsn: 2.382 ± 0.38
2.059ArgPro: 2.059 ± 0.299
1.938ArgGln: 1.938 ± 0.281
2.866ArgArg: 2.866 ± 0.48
2.987ArgSer: 2.987 ± 0.35
2.543ArgThr: 2.543 ± 0.362
3.714ArgVal: 3.714 ± 0.479
0.565ArgTrp: 0.565 ± 0.168
1.938ArgTyr: 1.938 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
5.57SerAla: 5.57 ± 0.669
0.727SerCys: 0.727 ± 0.253
3.794SerAsp: 3.794 ± 0.511
4.279SerGlu: 4.279 ± 0.408
2.987SerPhe: 2.987 ± 0.4
5.813SerGly: 5.813 ± 0.705
1.211SerHis: 1.211 ± 0.236
4.481SerIle: 4.481 ± 0.566
3.714SerLys: 3.714 ± 0.335
7.145SerLeu: 7.145 ± 0.603
1.736SerMet: 1.736 ± 0.29
3.673SerAsn: 3.673 ± 0.631
2.422SerPro: 2.422 ± 0.331
2.745SerGln: 2.745 ± 0.602
3.35SerArg: 3.35 ± 0.343
6.014SerSer: 6.014 ± 1.034
4.521SerThr: 4.521 ± 0.676
5.288SerVal: 5.288 ± 0.57
0.686SerTrp: 0.686 ± 0.183
1.534SerTyr: 1.534 ± 0.237
0.0SerXaa: 0.0 ± 0.0
Thr
5.53ThrAla: 5.53 ± 0.668
0.404ThrCys: 0.404 ± 0.11
2.906ThrAsp: 2.906 ± 0.419
3.148ThrGlu: 3.148 ± 0.356
2.462ThrPhe: 2.462 ± 0.306
3.714ThrGly: 3.714 ± 0.506
0.848ThrHis: 0.848 ± 0.192
3.148ThrIle: 3.148 ± 0.376
3.673ThrLys: 3.673 ± 0.458
5.611ThrLeu: 5.611 ± 0.476
1.574ThrMet: 1.574 ± 0.222
2.826ThrAsn: 2.826 ± 0.597
2.826ThrPro: 2.826 ± 0.314
1.857ThrGln: 1.857 ± 0.317
2.866ThrArg: 2.866 ± 0.376
4.965ThrSer: 4.965 ± 1.121
4.036ThrThr: 4.036 ± 0.59
4.723ThrVal: 4.723 ± 0.62
0.848ThrTrp: 0.848 ± 0.312
1.736ThrTyr: 1.736 ± 0.329
0.0ThrXaa: 0.0 ± 0.0
Val
6.62ValAla: 6.62 ± 0.563
0.646ValCys: 0.646 ± 0.163
4.965ValAsp: 4.965 ± 0.386
4.844ValGlu: 4.844 ± 0.417
2.583ValPhe: 2.583 ± 0.343
4.481ValGly: 4.481 ± 0.402
0.888ValHis: 0.888 ± 0.252
3.592ValIle: 3.592 ± 0.341
4.561ValLys: 4.561 ± 0.42
4.561ValLeu: 4.561 ± 0.55
1.938ValMet: 1.938 ± 0.307
3.391ValAsn: 3.391 ± 0.518
2.26ValPro: 2.26 ± 0.237
2.26ValGln: 2.26 ± 0.291
3.714ValArg: 3.714 ± 0.493
4.803ValSer: 4.803 ± 0.504
4.803ValThr: 4.803 ± 0.795
4.077ValVal: 4.077 ± 0.384
1.009ValTrp: 1.009 ± 0.21
2.462ValTyr: 2.462 ± 0.281
0.0ValXaa: 0.0 ± 0.0
Trp
0.928TrpAla: 0.928 ± 0.197
0.121TrpCys: 0.121 ± 0.06
0.686TrpAsp: 0.686 ± 0.168
0.444TrpGlu: 0.444 ± 0.161
0.242TrpPhe: 0.242 ± 0.094
0.727TrpGly: 0.727 ± 0.18
0.363TrpHis: 0.363 ± 0.111
0.727TrpIle: 0.727 ± 0.161
0.767TrpLys: 0.767 ± 0.227
0.848TrpLeu: 0.848 ± 0.197
0.444TrpMet: 0.444 ± 0.148
0.404TrpAsn: 0.404 ± 0.15
0.202TrpPro: 0.202 ± 0.082
0.565TrpGln: 0.565 ± 0.147
1.049TrpArg: 1.049 ± 0.187
0.686TrpSer: 0.686 ± 0.186
0.686TrpThr: 0.686 ± 0.191
0.969TrpVal: 0.969 ± 0.201
0.04TrpTrp: 0.04 ± 0.038
0.484TrpTyr: 0.484 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.22TyrAla: 2.22 ± 0.414
0.444TyrCys: 0.444 ± 0.148
1.897TyrAsp: 1.897 ± 0.309
1.736TyrGlu: 1.736 ± 0.279
1.332TyrPhe: 1.332 ± 0.295
2.139TyrGly: 2.139 ± 0.322
0.605TyrHis: 0.605 ± 0.189
1.292TyrIle: 1.292 ± 0.184
1.251TyrLys: 1.251 ± 0.196
2.704TyrLeu: 2.704 ± 0.383
0.525TyrMet: 0.525 ± 0.127
1.09TyrAsn: 1.09 ± 0.211
1.292TyrPro: 1.292 ± 0.284
1.615TyrGln: 1.615 ± 0.247
2.704TyrArg: 2.704 ± 0.325
1.332TyrSer: 1.332 ± 0.229
1.413TyrThr: 1.413 ± 0.257
2.382TyrVal: 2.382 ± 0.308
0.404TyrTrp: 0.404 ± 0.131
0.969TyrTyr: 0.969 ± 0.27
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (24775 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski