Amino acid dipepetide frequency for Klebsiella phage KP32_isolate 194

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.135AlaAla: 8.135 ± 0.88
0.814AlaCys: 0.814 ± 0.243
6.183AlaAsp: 6.183 ± 0.973
5.613AlaGlu: 5.613 ± 0.728
3.173AlaPhe: 3.173 ± 0.447
7.078AlaGly: 7.078 ± 1.019
1.302AlaHis: 1.302 ± 0.314
4.23AlaIle: 4.23 ± 0.598
6.264AlaLys: 6.264 ± 0.554
7.647AlaLeu: 7.647 ± 1.056
2.685AlaMet: 2.685 ± 0.631
4.637AlaAsn: 4.637 ± 0.404
2.522AlaPro: 2.522 ± 0.442
3.742AlaGln: 3.742 ± 0.737
5.125AlaArg: 5.125 ± 0.635
5.613AlaSer: 5.613 ± 0.805
3.986AlaThr: 3.986 ± 0.448
5.695AlaVal: 5.695 ± 0.57
1.302AlaTrp: 1.302 ± 0.358
2.847AlaTyr: 2.847 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.569CysAla: 0.569 ± 0.201
0.081CysCys: 0.081 ± 0.1
0.569CysAsp: 0.569 ± 0.323
0.814CysGlu: 0.814 ± 0.305
0.325CysPhe: 0.325 ± 0.192
0.814CysGly: 0.814 ± 0.238
0.244CysHis: 0.244 ± 0.149
0.732CysIle: 0.732 ± 0.238
0.325CysLys: 0.325 ± 0.157
0.814CysLeu: 0.814 ± 0.256
0.163CysMet: 0.163 ± 0.12
0.244CysAsn: 0.244 ± 0.147
0.488CysPro: 0.488 ± 0.184
0.488CysGln: 0.488 ± 0.223
0.651CysArg: 0.651 ± 0.284
0.732CysSer: 0.732 ± 0.244
0.569CysThr: 0.569 ± 0.282
0.895CysVal: 0.895 ± 0.246
0.163CysTrp: 0.163 ± 0.133
0.325CysTyr: 0.325 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
5.044AspAla: 5.044 ± 0.578
0.488AspCys: 0.488 ± 0.271
4.393AspAsp: 4.393 ± 0.484
3.905AspGlu: 3.905 ± 0.529
2.522AspPhe: 2.522 ± 0.533
6.264AspGly: 6.264 ± 0.588
0.976AspHis: 0.976 ± 0.295
2.522AspIle: 2.522 ± 0.433
3.986AspLys: 3.986 ± 0.732
4.23AspLeu: 4.23 ± 0.637
2.278AspMet: 2.278 ± 0.42
2.441AspAsn: 2.441 ± 0.553
2.766AspPro: 2.766 ± 0.412
2.522AspGln: 2.522 ± 0.534
3.173AspArg: 3.173 ± 0.562
3.661AspSer: 3.661 ± 0.45
4.312AspThr: 4.312 ± 0.514
3.905AspVal: 3.905 ± 0.469
0.895AspTrp: 0.895 ± 0.388
2.522AspTyr: 2.522 ± 0.456
0.0AspXaa: 0.0 ± 0.0
Glu
7.566GluAla: 7.566 ± 0.907
0.651GluCys: 0.651 ± 0.255
4.393GluAsp: 4.393 ± 0.738
5.044GluGlu: 5.044 ± 0.86
2.685GluPhe: 2.685 ± 0.451
5.776GluGly: 5.776 ± 0.821
1.546GluHis: 1.546 ± 0.479
2.766GluIle: 2.766 ± 0.399
3.498GluLys: 3.498 ± 0.72
5.857GluLeu: 5.857 ± 0.761
1.383GluMet: 1.383 ± 0.561
2.278GluAsn: 2.278 ± 0.423
2.278GluPro: 2.278 ± 0.672
3.091GluGln: 3.091 ± 0.802
4.068GluArg: 4.068 ± 0.673
4.23GluSer: 4.23 ± 0.557
3.254GluThr: 3.254 ± 0.65
4.8GluVal: 4.8 ± 0.542
0.569GluTrp: 0.569 ± 0.185
2.766GluTyr: 2.766 ± 0.356
0.0GluXaa: 0.0 ± 0.0
Phe
2.522PheAla: 2.522 ± 0.418
0.488PheCys: 0.488 ± 0.19
2.685PheAsp: 2.685 ± 0.493
1.79PheGlu: 1.79 ± 0.407
0.814PhePhe: 0.814 ± 0.223
3.01PheGly: 3.01 ± 0.612
0.651PheHis: 0.651 ± 0.269
2.034PheIle: 2.034 ± 0.526
2.603PheLys: 2.603 ± 0.457
3.091PheLeu: 3.091 ± 0.512
0.976PheMet: 0.976 ± 0.267
2.197PheAsn: 2.197 ± 0.367
1.627PhePro: 1.627 ± 0.405
1.302PheGln: 1.302 ± 0.321
2.278PheArg: 2.278 ± 0.452
2.441PheSer: 2.441 ± 0.556
2.441PheThr: 2.441 ± 0.379
2.278PheVal: 2.278 ± 0.424
0.244PheTrp: 0.244 ± 0.115
1.139PheTyr: 1.139 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
7.729GlyAla: 7.729 ± 1.284
0.895GlyCys: 0.895 ± 0.268
5.288GlyAsp: 5.288 ± 0.524
5.613GlyGlu: 5.613 ± 0.67
3.01GlyPhe: 3.01 ± 0.368
6.346GlyGly: 6.346 ± 0.774
1.546GlyHis: 1.546 ± 0.421
4.8GlyIle: 4.8 ± 0.872
5.695GlyLys: 5.695 ± 0.895
6.427GlyLeu: 6.427 ± 0.703
2.034GlyMet: 2.034 ± 0.409
3.254GlyAsn: 3.254 ± 0.474
1.627GlyPro: 1.627 ± 0.463
2.603GlyGln: 2.603 ± 0.428
4.393GlyArg: 4.393 ± 0.425
6.102GlySer: 6.102 ± 0.69
4.556GlyThr: 4.556 ± 0.793
5.044GlyVal: 5.044 ± 0.746
1.79GlyTrp: 1.79 ± 0.387
2.929GlyTyr: 2.929 ± 0.491
0.0GlyXaa: 0.0 ± 0.0
His
1.22HisAla: 1.22 ± 0.271
0.488HisCys: 0.488 ± 0.146
1.058HisAsp: 1.058 ± 0.25
1.464HisGlu: 1.464 ± 0.467
0.732HisPhe: 0.732 ± 0.231
1.79HisGly: 1.79 ± 0.406
0.569HisHis: 0.569 ± 0.227
1.302HisIle: 1.302 ± 0.354
1.383HisLys: 1.383 ± 0.28
1.058HisLeu: 1.058 ± 0.335
0.651HisMet: 0.651 ± 0.191
0.407HisAsn: 0.407 ± 0.154
0.814HisPro: 0.814 ± 0.224
0.569HisGln: 0.569 ± 0.229
0.569HisArg: 0.569 ± 0.195
0.732HisSer: 0.732 ± 0.261
0.895HisThr: 0.895 ± 0.205
1.383HisVal: 1.383 ± 0.315
0.163HisTrp: 0.163 ± 0.102
1.22HisTyr: 1.22 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
4.474IleAla: 4.474 ± 0.521
0.569IleCys: 0.569 ± 0.183
3.498IleAsp: 3.498 ± 0.592
2.929IleGlu: 2.929 ± 0.503
0.895IlePhe: 0.895 ± 0.302
3.824IleGly: 3.824 ± 0.627
0.895IleHis: 0.895 ± 0.263
2.603IleIle: 2.603 ± 0.472
3.173IleLys: 3.173 ± 0.472
3.58IleLeu: 3.58 ± 0.466
1.058IleMet: 1.058 ± 0.343
2.359IleAsn: 2.359 ± 0.523
2.685IlePro: 2.685 ± 0.512
1.871IleGln: 1.871 ± 0.468
3.742IleArg: 3.742 ± 0.619
2.847IleSer: 2.847 ± 0.529
2.522IleThr: 2.522 ± 0.395
2.929IleVal: 2.929 ± 0.48
0.407IleTrp: 0.407 ± 0.184
1.871IleTyr: 1.871 ± 0.352
0.0IleXaa: 0.0 ± 0.0
Lys
7.566LysAla: 7.566 ± 1.001
0.569LysCys: 0.569 ± 0.231
3.58LysAsp: 3.58 ± 0.573
5.125LysGlu: 5.125 ± 0.688
2.522LysPhe: 2.522 ± 0.428
6.264LysGly: 6.264 ± 1.159
1.627LysHis: 1.627 ± 0.358
2.197LysIle: 2.197 ± 0.49
3.336LysLys: 3.336 ± 0.933
5.451LysLeu: 5.451 ± 0.756
1.627LysMet: 1.627 ± 0.357
2.522LysAsn: 2.522 ± 0.438
2.522LysPro: 2.522 ± 0.589
2.197LysGln: 2.197 ± 0.422
3.254LysArg: 3.254 ± 0.735
3.58LysSer: 3.58 ± 0.498
3.091LysThr: 3.091 ± 0.476
5.451LysVal: 5.451 ± 0.732
0.814LysTrp: 0.814 ± 0.304
1.546LysTyr: 1.546 ± 0.381
0.0LysXaa: 0.0 ± 0.0
Leu
7.403LeuAla: 7.403 ± 0.936
0.407LeuCys: 0.407 ± 0.178
4.881LeuAsp: 4.881 ± 0.483
6.59LeuGlu: 6.59 ± 0.978
2.603LeuPhe: 2.603 ± 0.514
5.125LeuGly: 5.125 ± 0.664
1.058LeuHis: 1.058 ± 0.307
3.417LeuIle: 3.417 ± 0.505
6.102LeuLys: 6.102 ± 0.715
6.346LeuLeu: 6.346 ± 0.977
2.197LeuMet: 2.197 ± 0.338
4.312LeuAsn: 4.312 ± 0.604
3.091LeuPro: 3.091 ± 0.481
3.417LeuGln: 3.417 ± 0.511
5.125LeuArg: 5.125 ± 0.586
4.312LeuSer: 4.312 ± 0.576
4.719LeuThr: 4.719 ± 0.732
4.8LeuVal: 4.8 ± 0.618
1.546LeuTrp: 1.546 ± 0.475
2.603LeuTyr: 2.603 ± 0.471
0.0LeuXaa: 0.0 ± 0.0
Met
3.173MetAla: 3.173 ± 0.429
0.163MetCys: 0.163 ± 0.123
2.197MetAsp: 2.197 ± 0.408
1.22MetGlu: 1.22 ± 0.304
0.976MetPhe: 0.976 ± 0.252
1.464MetGly: 1.464 ± 0.288
0.325MetHis: 0.325 ± 0.188
1.22MetIle: 1.22 ± 0.296
1.22MetLys: 1.22 ± 0.264
2.929MetLeu: 2.929 ± 0.447
0.488MetMet: 0.488 ± 0.257
0.814MetAsn: 0.814 ± 0.234
0.895MetPro: 0.895 ± 0.248
1.952MetGln: 1.952 ± 0.477
1.464MetArg: 1.464 ± 0.321
1.22MetSer: 1.22 ± 0.33
2.441MetThr: 2.441 ± 0.457
1.464MetVal: 1.464 ± 0.386
0.081MetTrp: 0.081 ± 0.107
0.732MetTyr: 0.732 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
3.742AsnAla: 3.742 ± 0.664
0.569AsnCys: 0.569 ± 0.177
2.115AsnAsp: 2.115 ± 0.388
2.929AsnGlu: 2.929 ± 0.555
1.952AsnPhe: 1.952 ± 0.426
4.556AsnGly: 4.556 ± 0.693
0.407AsnHis: 0.407 ± 0.153
2.685AsnIle: 2.685 ± 0.463
2.197AsnLys: 2.197 ± 0.404
3.173AsnLeu: 3.173 ± 0.401
0.895AsnMet: 0.895 ± 0.304
1.464AsnAsn: 1.464 ± 0.356
2.359AsnPro: 2.359 ± 0.386
1.464AsnGln: 1.464 ± 0.225
2.115AsnArg: 2.115 ± 0.553
3.091AsnSer: 3.091 ± 0.53
2.278AsnThr: 2.278 ± 0.437
3.254AsnVal: 3.254 ± 0.518
0.651AsnTrp: 0.651 ± 0.251
1.79AsnTyr: 1.79 ± 0.431
0.0AsnXaa: 0.0 ± 0.0
Pro
3.091ProAla: 3.091 ± 0.4
0.569ProCys: 0.569 ± 0.234
2.197ProAsp: 2.197 ± 0.391
4.068ProGlu: 4.068 ± 0.613
1.302ProPhe: 1.302 ± 0.327
2.603ProGly: 2.603 ± 0.513
0.325ProHis: 0.325 ± 0.135
1.058ProIle: 1.058 ± 0.383
2.441ProLys: 2.441 ± 0.385
2.685ProLeu: 2.685 ± 0.374
1.058ProMet: 1.058 ± 0.27
1.952ProAsn: 1.952 ± 0.452
0.976ProPro: 0.976 ± 0.332
1.383ProGln: 1.383 ± 0.329
1.708ProArg: 1.708 ± 0.427
2.034ProSer: 2.034 ± 0.34
1.952ProThr: 1.952 ± 0.523
2.766ProVal: 2.766 ± 0.422
0.814ProTrp: 0.814 ± 0.236
1.871ProTyr: 1.871 ± 0.453
0.0ProXaa: 0.0 ± 0.0
Gln
3.336GlnAla: 3.336 ± 0.711
0.163GlnCys: 0.163 ± 0.114
2.197GlnAsp: 2.197 ± 0.309
3.01GlnGlu: 3.01 ± 0.411
1.708GlnPhe: 1.708 ± 0.319
2.929GlnGly: 2.929 ± 0.497
0.325GlnHis: 0.325 ± 0.172
1.79GlnIle: 1.79 ± 0.399
2.929GlnLys: 2.929 ± 0.495
3.824GlnLeu: 3.824 ± 0.539
1.302GlnMet: 1.302 ± 0.493
1.546GlnAsn: 1.546 ± 0.258
1.464GlnPro: 1.464 ± 0.249
3.336GlnGln: 3.336 ± 0.646
2.441GlnArg: 2.441 ± 0.648
2.685GlnSer: 2.685 ± 0.49
1.708GlnThr: 1.708 ± 0.437
2.603GlnVal: 2.603 ± 0.5
0.732GlnTrp: 0.732 ± 0.264
1.79GlnTyr: 1.79 ± 0.446
0.0GlnXaa: 0.0 ± 0.0
Arg
5.369ArgAla: 5.369 ± 0.666
0.814ArgCys: 0.814 ± 0.289
3.498ArgAsp: 3.498 ± 0.426
3.58ArgGlu: 3.58 ± 0.526
2.034ArgPhe: 2.034 ± 0.438
4.068ArgGly: 4.068 ± 0.513
0.895ArgHis: 0.895 ± 0.255
3.01ArgIle: 3.01 ± 0.566
3.824ArgLys: 3.824 ± 0.545
4.637ArgLeu: 4.637 ± 0.57
1.546ArgMet: 1.546 ± 0.321
2.278ArgAsn: 2.278 ± 0.367
2.278ArgPro: 2.278 ± 0.426
2.522ArgGln: 2.522 ± 0.519
3.173ArgArg: 3.173 ± 0.53
3.986ArgSer: 3.986 ± 0.429
3.01ArgThr: 3.01 ± 0.488
3.824ArgVal: 3.824 ± 0.555
0.895ArgTrp: 0.895 ± 0.274
1.22ArgTyr: 1.22 ± 0.265
0.0ArgXaa: 0.0 ± 0.0
Ser
4.149SerAla: 4.149 ± 0.679
0.569SerCys: 0.569 ± 0.196
4.556SerAsp: 4.556 ± 0.586
3.986SerGlu: 3.986 ± 0.517
3.254SerPhe: 3.254 ± 0.54
5.207SerGly: 5.207 ± 0.858
1.627SerHis: 1.627 ± 0.333
2.929SerIle: 2.929 ± 0.482
3.905SerLys: 3.905 ± 0.566
4.393SerLeu: 4.393 ± 0.721
1.22SerMet: 1.22 ± 0.369
1.79SerAsn: 1.79 ± 0.519
2.115SerPro: 2.115 ± 0.403
2.766SerGln: 2.766 ± 0.477
3.336SerArg: 3.336 ± 0.721
2.929SerSer: 2.929 ± 0.59
4.474SerThr: 4.474 ± 0.82
4.149SerVal: 4.149 ± 0.562
1.139SerTrp: 1.139 ± 0.353
2.441SerTyr: 2.441 ± 0.501
0.0SerXaa: 0.0 ± 0.0
Thr
4.8ThrAla: 4.8 ± 0.756
0.569ThrCys: 0.569 ± 0.253
3.173ThrAsp: 3.173 ± 0.479
3.254ThrGlu: 3.254 ± 0.477
2.359ThrPhe: 2.359 ± 0.488
5.369ThrGly: 5.369 ± 0.942
1.302ThrHis: 1.302 ± 0.259
3.905ThrIle: 3.905 ± 0.648
4.068ThrLys: 4.068 ± 0.621
5.125ThrLeu: 5.125 ± 0.661
1.546ThrMet: 1.546 ± 0.323
2.115ThrAsn: 2.115 ± 0.476
2.603ThrPro: 2.603 ± 0.453
2.197ThrGln: 2.197 ± 0.446
2.359ThrArg: 2.359 ± 0.365
4.23ThrSer: 4.23 ± 0.675
2.685ThrThr: 2.685 ± 0.701
3.417ThrVal: 3.417 ± 0.564
0.488ThrTrp: 0.488 ± 0.199
1.627ThrTyr: 1.627 ± 0.345
0.0ThrXaa: 0.0 ± 0.0
Val
5.695ValAla: 5.695 ± 0.748
0.569ValCys: 0.569 ± 0.173
3.091ValAsp: 3.091 ± 0.462
4.149ValGlu: 4.149 ± 0.606
2.278ValPhe: 2.278 ± 0.603
5.369ValGly: 5.369 ± 0.639
1.464ValHis: 1.464 ± 0.479
3.742ValIle: 3.742 ± 0.672
4.556ValLys: 4.556 ± 0.499
5.288ValLeu: 5.288 ± 0.772
1.464ValMet: 1.464 ± 0.398
4.149ValAsn: 4.149 ± 0.754
2.278ValPro: 2.278 ± 0.478
2.197ValGln: 2.197 ± 0.323
3.986ValArg: 3.986 ± 0.552
4.068ValSer: 4.068 ± 0.739
5.532ValThr: 5.532 ± 0.678
4.719ValVal: 4.719 ± 0.749
0.814ValTrp: 0.814 ± 0.286
2.197ValTyr: 2.197 ± 0.436
0.0ValXaa: 0.0 ± 0.0
Trp
0.651TrpAla: 0.651 ± 0.216
0.325TrpCys: 0.325 ± 0.179
0.569TrpAsp: 0.569 ± 0.214
1.302TrpGlu: 1.302 ± 0.302
0.325TrpPhe: 0.325 ± 0.143
0.732TrpGly: 0.732 ± 0.249
0.488TrpHis: 0.488 ± 0.238
0.651TrpIle: 0.651 ± 0.27
1.302TrpLys: 1.302 ± 0.382
1.302TrpLeu: 1.302 ± 0.374
0.325TrpMet: 0.325 ± 0.13
0.814TrpAsn: 0.814 ± 0.255
0.244TrpPro: 0.244 ± 0.133
0.651TrpGln: 0.651 ± 0.185
0.895TrpArg: 0.895 ± 0.239
1.22TrpSer: 1.22 ± 0.367
0.732TrpThr: 0.732 ± 0.265
1.464TrpVal: 1.464 ± 0.391
0.244TrpTrp: 0.244 ± 0.145
0.081TrpTyr: 0.081 ± 0.076
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.359TyrAla: 2.359 ± 0.496
0.244TyrCys: 0.244 ± 0.159
2.685TyrAsp: 2.685 ± 0.473
2.115TyrGlu: 2.115 ± 0.41
1.139TyrPhe: 1.139 ± 0.295
3.091TyrGly: 3.091 ± 0.425
0.895TyrHis: 0.895 ± 0.245
1.139TyrIle: 1.139 ± 0.316
2.034TyrLys: 2.034 ± 0.337
2.278TyrLeu: 2.278 ± 0.373
1.464TyrMet: 1.464 ± 0.403
2.115TyrAsn: 2.115 ± 0.45
1.22TyrPro: 1.22 ± 0.308
1.546TyrGln: 1.546 ± 0.458
2.603TyrArg: 2.603 ± 0.503
1.22TyrSer: 1.22 ± 0.305
2.034TyrThr: 2.034 ± 0.501
2.766TyrVal: 2.766 ± 0.532
0.569TyrTrp: 0.569 ± 0.216
0.895TyrTyr: 0.895 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (12293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski