Amino acid dipepetide frequency for Klebsiella phage ST101-KPC2phi6.1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.773AlaAla: 11.773 ± 1.515
1.102AlaCys: 1.102 ± 0.285
6.265AlaAsp: 6.265 ± 0.623
8.124AlaGlu: 8.124 ± 1.108
2.754AlaPhe: 2.754 ± 0.372
7.367AlaGly: 7.367 ± 0.796
1.515AlaHis: 1.515 ± 0.335
5.164AlaIle: 5.164 ± 0.575
6.54AlaLys: 6.54 ± 0.787
7.917AlaLeu: 7.917 ± 0.968
2.823AlaMet: 2.823 ± 0.369
3.511AlaAsn: 3.511 ± 0.457
1.997AlaPro: 1.997 ± 0.382
4.75AlaGln: 4.75 ± 0.661
5.37AlaArg: 5.37 ± 0.58
6.059AlaSer: 6.059 ± 0.678
5.921AlaThr: 5.921 ± 0.716
5.37AlaVal: 5.37 ± 0.646
1.583AlaTrp: 1.583 ± 0.369
3.442AlaTyr: 3.442 ± 0.48
0.0AlaXaa: 0.0 ± 0.0
Cys
1.17CysAla: 1.17 ± 0.342
0.275CysCys: 0.275 ± 0.129
1.239CysAsp: 1.239 ± 0.317
0.482CysGlu: 0.482 ± 0.155
0.138CysPhe: 0.138 ± 0.094
0.964CysGly: 0.964 ± 0.314
0.344CysHis: 0.344 ± 0.158
0.757CysIle: 0.757 ± 0.214
0.482CysLys: 0.482 ± 0.187
0.757CysLeu: 0.757 ± 0.241
0.344CysMet: 0.344 ± 0.14
0.413CysAsn: 0.413 ± 0.151
0.413CysPro: 0.413 ± 0.159
0.62CysGln: 0.62 ± 0.302
1.17CysArg: 1.17 ± 0.314
0.757CysSer: 0.757 ± 0.247
0.482CysThr: 0.482 ± 0.224
0.482CysVal: 0.482 ± 0.198
0.275CysTrp: 0.275 ± 0.126
0.275CysTyr: 0.275 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
5.714AspAla: 5.714 ± 0.697
0.895AspCys: 0.895 ± 0.285
5.37AspAsp: 5.37 ± 0.692
3.58AspGlu: 3.58 ± 0.485
2.41AspPhe: 2.41 ± 0.369
5.37AspGly: 5.37 ± 0.707
0.826AspHis: 0.826 ± 0.172
2.892AspIle: 2.892 ± 0.667
3.098AspLys: 3.098 ± 0.618
4.269AspLeu: 4.269 ± 0.425
1.79AspMet: 1.79 ± 0.301
2.96AspAsn: 2.96 ± 0.543
2.478AspPro: 2.478 ± 0.463
1.652AspGln: 1.652 ± 0.31
2.41AspArg: 2.41 ± 0.385
3.167AspSer: 3.167 ± 0.512
3.029AspThr: 3.029 ± 0.361
4.475AspVal: 4.475 ± 0.608
1.652AspTrp: 1.652 ± 0.355
2.754AspTyr: 2.754 ± 0.516
0.0AspXaa: 0.0 ± 0.0
Glu
6.747GluAla: 6.747 ± 0.766
0.688GluCys: 0.688 ± 0.259
2.341GluAsp: 2.341 ± 0.336
4.406GluGlu: 4.406 ± 0.808
2.892GluPhe: 2.892 ± 0.435
4.131GluGly: 4.131 ± 0.669
1.102GluHis: 1.102 ± 0.279
4.544GluIle: 4.544 ± 0.615
4.062GluLys: 4.062 ± 0.57
5.852GluLeu: 5.852 ± 0.776
2.41GluMet: 2.41 ± 0.444
2.065GluAsn: 2.065 ± 0.391
2.685GluPro: 2.685 ± 0.397
3.855GluGln: 3.855 ± 0.633
5.026GluArg: 5.026 ± 0.721
4.062GluSer: 4.062 ± 0.487
1.997GluThr: 1.997 ± 0.358
3.373GluVal: 3.373 ± 0.475
1.17GluTrp: 1.17 ± 0.229
2.547GluTyr: 2.547 ± 0.37
0.0GluXaa: 0.0 ± 0.0
Phe
2.478PheAla: 2.478 ± 0.434
0.413PheCys: 0.413 ± 0.163
2.823PheAsp: 2.823 ± 0.508
2.203PheGlu: 2.203 ± 0.386
1.515PhePhe: 1.515 ± 0.444
2.754PheGly: 2.754 ± 0.442
0.895PheHis: 0.895 ± 0.242
1.79PheIle: 1.79 ± 0.39
1.446PheLys: 1.446 ± 0.249
1.928PheLeu: 1.928 ± 0.363
0.757PheMet: 0.757 ± 0.206
1.515PheAsn: 1.515 ± 0.403
0.688PhePro: 0.688 ± 0.214
0.62PheGln: 0.62 ± 0.193
2.685PheArg: 2.685 ± 0.484
3.029PheSer: 3.029 ± 0.381
2.547PheThr: 2.547 ± 0.425
2.065PheVal: 2.065 ± 0.36
0.62PheTrp: 0.62 ± 0.218
1.583PheTyr: 1.583 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
6.196GlyAla: 6.196 ± 0.767
1.308GlyCys: 1.308 ± 0.293
5.37GlyAsp: 5.37 ± 0.489
4.819GlyGlu: 4.819 ± 0.572
2.823GlyPhe: 2.823 ± 0.46
5.026GlyGly: 5.026 ± 0.703
0.826GlyHis: 0.826 ± 0.333
5.37GlyIle: 5.37 ± 0.659
5.714GlyLys: 5.714 ± 0.68
6.403GlyLeu: 6.403 ± 0.689
1.859GlyMet: 1.859 ± 0.323
3.305GlyAsn: 3.305 ± 0.535
1.515GlyPro: 1.515 ± 0.298
3.167GlyGln: 3.167 ± 0.454
4.131GlyArg: 4.131 ± 0.586
3.236GlySer: 3.236 ± 0.531
3.58GlyThr: 3.58 ± 0.552
4.131GlyVal: 4.131 ± 0.517
1.102GlyTrp: 1.102 ± 0.287
2.823GlyTyr: 2.823 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
1.102HisAla: 1.102 ± 0.255
0.344HisCys: 0.344 ± 0.164
1.033HisAsp: 1.033 ± 0.3
1.102HisGlu: 1.102 ± 0.279
1.239HisPhe: 1.239 ± 0.305
1.377HisGly: 1.377 ± 0.319
0.275HisHis: 0.275 ± 0.137
1.033HisIle: 1.033 ± 0.286
1.102HisLys: 1.102 ± 0.282
1.652HisLeu: 1.652 ± 0.485
0.275HisMet: 0.275 ± 0.133
0.482HisAsn: 0.482 ± 0.2
0.757HisPro: 0.757 ± 0.196
0.757HisGln: 0.757 ± 0.233
1.239HisArg: 1.239 ± 0.266
1.033HisSer: 1.033 ± 0.247
1.239HisThr: 1.239 ± 0.29
1.17HisVal: 1.17 ± 0.27
0.413HisTrp: 0.413 ± 0.163
0.757HisTyr: 0.757 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
5.508IleAla: 5.508 ± 0.712
0.413IleCys: 0.413 ± 0.156
3.787IleAsp: 3.787 ± 0.515
4.957IleGlu: 4.957 ± 0.511
1.583IlePhe: 1.583 ± 0.373
4.544IleGly: 4.544 ± 0.574
0.895IleHis: 0.895 ± 0.187
2.41IleIle: 2.41 ± 0.481
2.96IleLys: 2.96 ± 0.474
4.269IleLeu: 4.269 ± 0.469
0.551IleMet: 0.551 ± 0.209
2.41IleAsn: 2.41 ± 0.366
2.341IlePro: 2.341 ± 0.349
1.583IleGln: 1.583 ± 0.298
3.511IleArg: 3.511 ± 0.488
5.232IleSer: 5.232 ± 0.554
4.75IleThr: 4.75 ± 0.481
3.718IleVal: 3.718 ± 0.621
0.688IleTrp: 0.688 ± 0.176
1.515IleTyr: 1.515 ± 0.364
0.0IleXaa: 0.0 ± 0.0
Lys
6.54LysAla: 6.54 ± 0.623
0.757LysCys: 0.757 ± 0.229
3.236LysAsp: 3.236 ± 0.555
3.236LysGlu: 3.236 ± 0.494
1.583LysPhe: 1.583 ± 0.3
3.373LysGly: 3.373 ± 0.365
1.17LysHis: 1.17 ± 0.323
1.859LysIle: 1.859 ± 0.431
3.305LysLys: 3.305 ± 0.565
4.613LysLeu: 4.613 ± 0.544
1.652LysMet: 1.652 ± 0.431
1.583LysAsn: 1.583 ± 0.294
3.305LysPro: 3.305 ± 0.437
2.685LysGln: 2.685 ± 0.36
4.337LysArg: 4.337 ± 0.566
4.131LysSer: 4.131 ± 0.531
3.855LysThr: 3.855 ± 0.581
3.855LysVal: 3.855 ± 0.519
1.17LysTrp: 1.17 ± 0.298
1.515LysTyr: 1.515 ± 0.281
0.0LysXaa: 0.0 ± 0.0
Leu
8.881LeuAla: 8.881 ± 1.041
1.033LeuCys: 1.033 ± 0.289
3.993LeuAsp: 3.993 ± 0.45
4.819LeuGlu: 4.819 ± 0.545
2.478LeuPhe: 2.478 ± 0.34
5.164LeuGly: 5.164 ± 0.592
1.583LeuHis: 1.583 ± 0.314
6.334LeuIle: 6.334 ± 0.682
5.095LeuLys: 5.095 ± 0.55
6.678LeuLeu: 6.678 ± 0.823
1.515LeuMet: 1.515 ± 0.428
4.406LeuAsn: 4.406 ± 0.429
2.892LeuPro: 2.892 ± 0.48
3.167LeuGln: 3.167 ± 0.41
4.957LeuArg: 4.957 ± 0.681
4.682LeuSer: 4.682 ± 0.58
4.062LeuThr: 4.062 ± 0.582
4.613LeuVal: 4.613 ± 0.559
0.757LeuTrp: 0.757 ± 0.283
2.478LeuTyr: 2.478 ± 0.281
0.0LeuXaa: 0.0 ± 0.0
Met
1.928MetAla: 1.928 ± 0.421
0.138MetCys: 0.138 ± 0.104
1.308MetAsp: 1.308 ± 0.347
1.446MetGlu: 1.446 ± 0.274
0.688MetPhe: 0.688 ± 0.198
1.377MetGly: 1.377 ± 0.387
0.62MetHis: 0.62 ± 0.223
1.033MetIle: 1.033 ± 0.256
2.134MetLys: 2.134 ± 0.489
0.826MetLeu: 0.826 ± 0.253
0.482MetMet: 0.482 ± 0.207
1.446MetAsn: 1.446 ± 0.36
1.79MetPro: 1.79 ± 0.464
0.757MetGln: 0.757 ± 0.181
1.17MetArg: 1.17 ± 0.238
2.41MetSer: 2.41 ± 0.331
2.203MetThr: 2.203 ± 0.467
1.17MetVal: 1.17 ± 0.287
0.62MetTrp: 0.62 ± 0.204
0.62MetTyr: 0.62 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
5.783AsnAla: 5.783 ± 0.852
0.344AsnCys: 0.344 ± 0.133
2.685AsnAsp: 2.685 ± 0.386
2.065AsnGlu: 2.065 ± 0.354
0.826AsnPhe: 0.826 ± 0.189
4.269AsnGly: 4.269 ± 0.59
0.757AsnHis: 0.757 ± 0.258
2.547AsnIle: 2.547 ± 0.363
2.41AsnLys: 2.41 ± 0.394
3.167AsnLeu: 3.167 ± 0.493
0.826AsnMet: 0.826 ± 0.202
2.272AsnAsn: 2.272 ± 0.36
2.203AsnPro: 2.203 ± 0.322
1.859AsnGln: 1.859 ± 0.339
2.341AsnArg: 2.341 ± 0.373
2.341AsnSer: 2.341 ± 0.413
1.721AsnThr: 1.721 ± 0.261
2.754AsnVal: 2.754 ± 0.452
0.413AsnTrp: 0.413 ± 0.183
1.652AsnTyr: 1.652 ± 0.364
0.0AsnXaa: 0.0 ± 0.0
Pro
3.855ProAla: 3.855 ± 0.561
0.275ProCys: 0.275 ± 0.131
2.754ProAsp: 2.754 ± 0.531
2.96ProGlu: 2.96 ± 0.506
1.17ProPhe: 1.17 ± 0.332
3.58ProGly: 3.58 ± 0.588
0.895ProHis: 0.895 ± 0.212
1.997ProIle: 1.997 ± 0.448
1.928ProLys: 1.928 ± 0.386
3.029ProLeu: 3.029 ± 0.474
0.688ProMet: 0.688 ± 0.247
1.102ProAsn: 1.102 ± 0.269
0.964ProPro: 0.964 ± 0.269
1.446ProGln: 1.446 ± 0.261
1.17ProArg: 1.17 ± 0.305
2.134ProSer: 2.134 ± 0.344
2.341ProThr: 2.341 ± 0.481
3.787ProVal: 3.787 ± 0.535
0.344ProTrp: 0.344 ± 0.176
0.688ProTyr: 0.688 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
4.544GlnAla: 4.544 ± 0.784
0.826GlnCys: 0.826 ± 0.248
1.239GlnAsp: 1.239 ± 0.248
2.065GlnGlu: 2.065 ± 0.303
1.377GlnPhe: 1.377 ± 0.284
2.41GlnGly: 2.41 ± 0.424
1.033GlnHis: 1.033 ± 0.232
2.478GlnIle: 2.478 ± 0.465
2.685GlnLys: 2.685 ± 0.444
3.855GlnLeu: 3.855 ± 0.4
0.826GlnMet: 0.826 ± 0.258
0.895GlnAsn: 0.895 ± 0.251
1.239GlnPro: 1.239 ± 0.368
3.511GlnGln: 3.511 ± 0.86
2.96GlnArg: 2.96 ± 0.531
2.341GlnSer: 2.341 ± 0.365
2.547GlnThr: 2.547 ± 0.525
3.373GlnVal: 3.373 ± 0.556
0.964GlnTrp: 0.964 ± 0.212
1.239GlnTyr: 1.239 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
5.439ArgAla: 5.439 ± 0.658
0.551ArgCys: 0.551 ± 0.241
3.993ArgAsp: 3.993 ± 0.539
5.508ArgGlu: 5.508 ± 0.845
1.583ArgPhe: 1.583 ± 0.338
3.924ArgGly: 3.924 ± 0.477
1.308ArgHis: 1.308 ± 0.286
3.718ArgIle: 3.718 ± 0.428
4.475ArgLys: 4.475 ± 0.684
4.957ArgLeu: 4.957 ± 0.551
1.515ArgMet: 1.515 ± 0.285
3.511ArgAsn: 3.511 ± 0.502
2.547ArgPro: 2.547 ± 0.436
2.616ArgGln: 2.616 ± 0.435
4.475ArgArg: 4.475 ± 0.811
3.511ArgSer: 3.511 ± 0.525
2.616ArgThr: 2.616 ± 0.458
3.787ArgVal: 3.787 ± 0.521
0.964ArgTrp: 0.964 ± 0.284
1.859ArgTyr: 1.859 ± 0.437
0.0ArgXaa: 0.0 ± 0.0
Ser
5.439SerAla: 5.439 ± 0.608
0.826SerCys: 0.826 ± 0.268
4.2SerAsp: 4.2 ± 0.53
4.475SerGlu: 4.475 ± 0.611
3.029SerPhe: 3.029 ± 0.491
5.37SerGly: 5.37 ± 0.58
1.308SerHis: 1.308 ± 0.297
3.305SerIle: 3.305 ± 0.53
2.478SerLys: 2.478 ± 0.466
4.957SerLeu: 4.957 ± 0.783
1.721SerMet: 1.721 ± 0.375
3.098SerAsn: 3.098 ± 0.548
1.859SerPro: 1.859 ± 0.403
2.754SerGln: 2.754 ± 0.573
4.2SerArg: 4.2 ± 0.547
4.062SerSer: 4.062 ± 0.555
2.823SerThr: 2.823 ± 0.534
4.406SerVal: 4.406 ± 0.56
0.688SerTrp: 0.688 ± 0.217
1.239SerTyr: 1.239 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
5.99ThrAla: 5.99 ± 0.681
0.344ThrCys: 0.344 ± 0.22
2.754ThrAsp: 2.754 ± 0.466
3.305ThrGlu: 3.305 ± 0.445
1.859ThrPhe: 1.859 ± 0.359
5.439ThrGly: 5.439 ± 0.54
0.826ThrHis: 0.826 ± 0.253
2.892ThrIle: 2.892 ± 0.59
2.341ThrLys: 2.341 ± 0.438
5.783ThrLeu: 5.783 ± 0.705
0.895ThrMet: 0.895 ± 0.239
2.754ThrAsn: 2.754 ± 0.354
2.96ThrPro: 2.96 ± 0.413
2.203ThrGln: 2.203 ± 0.537
2.96ThrArg: 2.96 ± 0.529
3.305ThrSer: 3.305 ± 0.548
4.062ThrThr: 4.062 ± 0.672
4.475ThrVal: 4.475 ± 0.722
1.377ThrTrp: 1.377 ± 0.292
0.895ThrTyr: 0.895 ± 0.261
0.0ThrXaa: 0.0 ± 0.0
Val
5.852ValAla: 5.852 ± 0.684
0.551ValCys: 0.551 ± 0.227
3.511ValAsp: 3.511 ± 0.617
3.855ValGlu: 3.855 ± 0.446
2.547ValPhe: 2.547 ± 0.462
3.649ValGly: 3.649 ± 0.486
1.033ValHis: 1.033 ± 0.269
4.819ValIle: 4.819 ± 0.586
3.511ValLys: 3.511 ± 0.642
4.613ValLeu: 4.613 ± 0.533
1.721ValMet: 1.721 ± 0.316
3.373ValAsn: 3.373 ± 0.617
2.547ValPro: 2.547 ± 0.486
2.134ValGln: 2.134 ± 0.449
4.544ValArg: 4.544 ± 0.618
3.511ValSer: 3.511 ± 0.516
4.75ValThr: 4.75 ± 0.822
4.406ValVal: 4.406 ± 0.639
1.446ValTrp: 1.446 ± 0.302
1.583ValTyr: 1.583 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
0.895TrpAla: 0.895 ± 0.245
0.207TrpCys: 0.207 ± 0.111
0.688TrpAsp: 0.688 ± 0.195
1.17TrpGlu: 1.17 ± 0.31
0.482TrpPhe: 0.482 ± 0.167
0.62TrpGly: 0.62 ± 0.204
0.482TrpHis: 0.482 ± 0.223
0.895TrpIle: 0.895 ± 0.263
0.895TrpLys: 0.895 ± 0.28
2.065TrpLeu: 2.065 ± 0.383
0.482TrpMet: 0.482 ± 0.201
0.826TrpAsn: 0.826 ± 0.222
0.757TrpPro: 0.757 ± 0.276
0.688TrpGln: 0.688 ± 0.216
1.928TrpArg: 1.928 ± 0.419
1.377TrpSer: 1.377 ± 0.287
0.895TrpThr: 0.895 ± 0.241
1.033TrpVal: 1.033 ± 0.273
0.275TrpTrp: 0.275 ± 0.134
0.62TrpTyr: 0.62 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.511TyrAla: 3.511 ± 0.492
0.551TyrCys: 0.551 ± 0.175
2.203TyrAsp: 2.203 ± 0.367
1.377TyrGlu: 1.377 ± 0.325
1.377TyrPhe: 1.377 ± 0.419
1.997TyrGly: 1.997 ± 0.345
0.688TyrHis: 0.688 ± 0.22
1.79TyrIle: 1.79 ± 0.334
1.102TyrLys: 1.102 ± 0.304
1.997TyrLeu: 1.997 ± 0.327
0.964TyrMet: 0.964 ± 0.305
1.308TyrAsn: 1.308 ± 0.358
1.239TyrPro: 1.239 ± 0.288
1.583TyrGln: 1.583 ± 0.357
2.272TyrArg: 2.272 ± 0.335
1.859TyrSer: 1.859 ± 0.332
1.997TyrThr: 1.997 ± 0.383
1.515TyrVal: 1.515 ± 0.382
0.757TyrTrp: 0.757 ± 0.201
0.895TyrTyr: 0.895 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (14526 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski