Amino acid dipepetide frequency for Klebsiella phage ST512-KPC3phi13.6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.392AlaAla: 9.392 ± 1.425
1.307AlaCys: 1.307 ± 0.327
6.615AlaAsp: 6.615 ± 0.828
6.207AlaGlu: 6.207 ± 0.762
3.594AlaPhe: 3.594 ± 0.455
8.086AlaGly: 8.086 ± 1.471
1.47AlaHis: 1.47 ± 0.245
4.329AlaIle: 4.329 ± 0.558
4.084AlaLys: 4.084 ± 0.631
9.882AlaLeu: 9.882 ± 1.322
2.123AlaMet: 2.123 ± 0.386
2.205AlaAsn: 2.205 ± 0.41
2.859AlaPro: 2.859 ± 0.533
2.859AlaGln: 2.859 ± 0.592
5.88AlaArg: 5.88 ± 0.62
6.125AlaSer: 6.125 ± 0.698
4.9AlaThr: 4.9 ± 0.589
6.779AlaVal: 6.779 ± 0.678
1.307AlaTrp: 1.307 ± 0.289
3.022AlaTyr: 3.022 ± 0.517
0.0AlaXaa: 0.0 ± 0.0
Cys
1.143CysAla: 1.143 ± 0.338
0.327CysCys: 0.327 ± 0.175
0.49CysAsp: 0.49 ± 0.218
0.408CysGlu: 0.408 ± 0.164
0.245CysPhe: 0.245 ± 0.159
0.49CysGly: 0.49 ± 0.245
0.082CysHis: 0.082 ± 0.09
0.572CysIle: 0.572 ± 0.295
0.572CysLys: 0.572 ± 0.237
1.225CysLeu: 1.225 ± 0.309
0.327CysMet: 0.327 ± 0.142
0.49CysAsn: 0.49 ± 0.214
0.653CysPro: 0.653 ± 0.289
0.408CysGln: 0.408 ± 0.228
0.653CysArg: 0.653 ± 0.235
0.572CysSer: 0.572 ± 0.246
0.49CysThr: 0.49 ± 0.188
1.225CysVal: 1.225 ± 0.291
0.327CysTrp: 0.327 ± 0.157
0.408CysTyr: 0.408 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
5.962AspAla: 5.962 ± 0.743
0.735AspCys: 0.735 ± 0.232
2.859AspAsp: 2.859 ± 0.458
3.675AspGlu: 3.675 ± 0.45
2.614AspPhe: 2.614 ± 0.345
5.064AspGly: 5.064 ± 0.56
0.572AspHis: 0.572 ± 0.216
3.022AspIle: 3.022 ± 0.5
3.675AspLys: 3.675 ± 0.677
5.309AspLeu: 5.309 ± 0.766
0.98AspMet: 0.98 ± 0.222
2.287AspAsn: 2.287 ± 0.487
2.859AspPro: 2.859 ± 0.472
2.45AspGln: 2.45 ± 0.42
2.777AspArg: 2.777 ± 0.429
4.084AspSer: 4.084 ± 0.594
2.614AspThr: 2.614 ± 0.518
3.757AspVal: 3.757 ± 0.522
0.817AspTrp: 0.817 ± 0.252
2.205AspTyr: 2.205 ± 0.463
0.0AspXaa: 0.0 ± 0.0
Glu
5.635GluAla: 5.635 ± 0.613
0.653GluCys: 0.653 ± 0.298
3.104GluAsp: 3.104 ± 0.585
2.695GluGlu: 2.695 ± 0.52
2.287GluPhe: 2.287 ± 0.419
3.267GluGly: 3.267 ± 0.621
1.062GluHis: 1.062 ± 0.249
3.675GluIle: 3.675 ± 0.566
3.022GluLys: 3.022 ± 0.414
7.024GluLeu: 7.024 ± 1.001
2.859GluMet: 2.859 ± 0.476
3.675GluAsn: 3.675 ± 0.441
2.123GluPro: 2.123 ± 0.575
3.92GluGln: 3.92 ± 0.68
3.594GluArg: 3.594 ± 0.446
3.349GluSer: 3.349 ± 0.473
3.594GluThr: 3.594 ± 0.617
4.247GluVal: 4.247 ± 0.529
1.143GluTrp: 1.143 ± 0.268
1.96GluTyr: 1.96 ± 0.46
0.0GluXaa: 0.0 ± 0.0
Phe
3.43PheAla: 3.43 ± 0.521
0.245PheCys: 0.245 ± 0.164
2.123PheAsp: 2.123 ± 0.486
2.205PheGlu: 2.205 ± 0.411
1.878PhePhe: 1.878 ± 0.429
1.878PheGly: 1.878 ± 0.356
0.653PheHis: 0.653 ± 0.227
2.777PheIle: 2.777 ± 0.579
1.388PheLys: 1.388 ± 0.274
2.695PheLeu: 2.695 ± 0.549
0.898PheMet: 0.898 ± 0.286
1.715PheAsn: 1.715 ± 0.44
0.898PhePro: 0.898 ± 0.307
1.552PheGln: 1.552 ± 0.379
2.287PheArg: 2.287 ± 0.478
3.022PheSer: 3.022 ± 0.361
2.532PheThr: 2.532 ± 0.505
1.552PheVal: 1.552 ± 0.292
0.98PheTrp: 0.98 ± 0.302
0.898PheTyr: 0.898 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
4.329GlyAla: 4.329 ± 0.728
0.572GlyCys: 0.572 ± 0.198
3.839GlyAsp: 3.839 ± 0.481
4.737GlyGlu: 4.737 ± 0.623
2.695GlyPhe: 2.695 ± 0.457
5.717GlyGly: 5.717 ± 1.204
1.47GlyHis: 1.47 ± 0.43
3.675GlyIle: 3.675 ± 0.543
4.492GlyLys: 4.492 ± 0.671
5.554GlyLeu: 5.554 ± 0.638
2.123GlyMet: 2.123 ± 0.392
2.532GlyAsn: 2.532 ± 0.478
2.042GlyPro: 2.042 ± 0.427
2.123GlyGln: 2.123 ± 0.482
3.92GlyArg: 3.92 ± 0.511
3.512GlySer: 3.512 ± 0.528
4.002GlyThr: 4.002 ± 0.58
5.554GlyVal: 5.554 ± 0.656
1.062GlyTrp: 1.062 ± 0.306
1.715GlyTyr: 1.715 ± 0.366
0.0GlyXaa: 0.0 ± 0.0
His
1.307HisAla: 1.307 ± 0.383
0.408HisCys: 0.408 ± 0.208
1.143HisAsp: 1.143 ± 0.321
1.388HisGlu: 1.388 ± 0.404
0.49HisPhe: 0.49 ± 0.227
1.388HisGly: 1.388 ± 0.341
1.143HisHis: 1.143 ± 0.393
0.98HisIle: 0.98 ± 0.273
1.307HisLys: 1.307 ± 0.255
1.96HisLeu: 1.96 ± 0.398
0.735HisMet: 0.735 ± 0.276
0.245HisAsn: 0.245 ± 0.149
1.143HisPro: 1.143 ± 0.387
0.572HisGln: 0.572 ± 0.212
1.307HisArg: 1.307 ± 0.343
1.47HisSer: 1.47 ± 0.356
1.225HisThr: 1.225 ± 0.342
0.817HisVal: 0.817 ± 0.253
0.572HisTrp: 0.572 ± 0.206
1.143HisTyr: 1.143 ± 0.338
0.0HisXaa: 0.0 ± 0.0
Ile
4.329IleAla: 4.329 ± 0.625
0.898IleCys: 0.898 ± 0.236
4.41IleAsp: 4.41 ± 0.573
3.594IleGlu: 3.594 ± 0.595
1.715IlePhe: 1.715 ± 0.452
3.512IleGly: 3.512 ± 0.556
1.307IleHis: 1.307 ± 0.337
2.695IleIle: 2.695 ± 0.526
3.022IleLys: 3.022 ± 0.547
3.512IleLeu: 3.512 ± 0.542
2.123IleMet: 2.123 ± 0.423
3.512IleAsn: 3.512 ± 0.599
2.287IlePro: 2.287 ± 0.682
1.307IleGln: 1.307 ± 0.286
3.757IleArg: 3.757 ± 0.523
4.329IleSer: 4.329 ± 0.724
4.165IleThr: 4.165 ± 0.522
2.777IleVal: 2.777 ± 0.553
0.653IleTrp: 0.653 ± 0.174
1.715IleTyr: 1.715 ± 0.292
0.0IleXaa: 0.0 ± 0.0
Lys
3.757LysAla: 3.757 ± 0.586
0.408LysCys: 0.408 ± 0.204
2.45LysAsp: 2.45 ± 0.451
3.185LysGlu: 3.185 ± 0.528
1.878LysPhe: 1.878 ± 0.43
3.185LysGly: 3.185 ± 0.431
1.797LysHis: 1.797 ± 0.42
3.43LysIle: 3.43 ± 0.829
4.819LysLys: 4.819 ± 0.801
4.819LysLeu: 4.819 ± 0.754
1.552LysMet: 1.552 ± 0.315
2.695LysAsn: 2.695 ± 0.447
1.878LysPro: 1.878 ± 0.373
2.369LysGln: 2.369 ± 0.649
4.165LysArg: 4.165 ± 0.564
2.94LysSer: 2.94 ± 0.596
3.675LysThr: 3.675 ± 0.555
2.287LysVal: 2.287 ± 0.448
1.388LysTrp: 1.388 ± 0.413
1.633LysTyr: 1.633 ± 0.351
0.0LysXaa: 0.0 ± 0.0
Leu
9.637LeuAla: 9.637 ± 1.067
1.062LeuCys: 1.062 ± 0.288
5.88LeuAsp: 5.88 ± 0.749
5.227LeuGlu: 5.227 ± 0.649
3.757LeuPhe: 3.757 ± 0.515
5.064LeuGly: 5.064 ± 0.766
1.797LeuHis: 1.797 ± 0.467
4.41LeuIle: 4.41 ± 0.5
5.309LeuLys: 5.309 ± 0.759
8.086LeuLeu: 8.086 ± 0.838
3.022LeuMet: 3.022 ± 0.501
5.227LeuAsn: 5.227 ± 0.655
3.757LeuPro: 3.757 ± 0.506
5.39LeuGln: 5.39 ± 0.693
6.044LeuArg: 6.044 ± 0.877
7.596LeuSer: 7.596 ± 0.812
5.39LeuThr: 5.39 ± 0.716
4.492LeuVal: 4.492 ± 0.702
0.817LeuTrp: 0.817 ± 0.211
3.349LeuTyr: 3.349 ± 0.555
0.0LeuXaa: 0.0 ± 0.0
Met
4.002MetAla: 4.002 ± 0.622
0.327MetCys: 0.327 ± 0.15
1.143MetAsp: 1.143 ± 0.275
1.062MetGlu: 1.062 ± 0.277
1.062MetPhe: 1.062 ± 0.221
0.98MetGly: 0.98 ± 0.358
0.49MetHis: 0.49 ± 0.193
1.225MetIle: 1.225 ± 0.278
1.715MetLys: 1.715 ± 0.345
1.96MetLeu: 1.96 ± 0.515
1.552MetMet: 1.552 ± 0.576
1.47MetAsn: 1.47 ± 0.363
1.388MetPro: 1.388 ± 0.377
1.225MetGln: 1.225 ± 0.372
1.552MetArg: 1.552 ± 0.418
2.042MetSer: 2.042 ± 0.412
2.45MetThr: 2.45 ± 0.404
1.307MetVal: 1.307 ± 0.292
0.245MetTrp: 0.245 ± 0.116
0.572MetTyr: 0.572 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
3.349AsnAla: 3.349 ± 0.5
0.408AsnCys: 0.408 ± 0.254
1.797AsnAsp: 1.797 ± 0.354
3.594AsnGlu: 3.594 ± 0.56
1.143AsnPhe: 1.143 ± 0.329
3.022AsnGly: 3.022 ± 0.532
0.572AsnHis: 0.572 ± 0.266
3.267AsnIle: 3.267 ± 0.493
2.205AsnLys: 2.205 ± 0.525
4.41AsnLeu: 4.41 ± 0.575
0.653AsnMet: 0.653 ± 0.251
1.878AsnAsn: 1.878 ± 0.304
3.185AsnPro: 3.185 ± 0.542
1.797AsnGln: 1.797 ± 0.459
2.94AsnArg: 2.94 ± 0.601
2.287AsnSer: 2.287 ± 0.381
1.307AsnThr: 1.307 ± 0.285
3.104AsnVal: 3.104 ± 0.605
0.898AsnTrp: 0.898 ± 0.268
0.898AsnTyr: 0.898 ± 0.354
0.0AsnXaa: 0.0 ± 0.0
Pro
4.165ProAla: 4.165 ± 0.621
0.408ProCys: 0.408 ± 0.173
3.757ProAsp: 3.757 ± 0.594
3.104ProGlu: 3.104 ± 0.49
0.898ProPhe: 0.898 ± 0.346
2.94ProGly: 2.94 ± 0.455
1.225ProHis: 1.225 ± 0.348
1.633ProIle: 1.633 ± 0.357
2.042ProLys: 2.042 ± 0.364
3.349ProLeu: 3.349 ± 0.471
0.49ProMet: 0.49 ± 0.213
1.715ProAsn: 1.715 ± 0.435
2.369ProPro: 2.369 ± 0.522
1.797ProGln: 1.797 ± 0.367
2.123ProArg: 2.123 ± 0.327
2.369ProSer: 2.369 ± 0.404
1.633ProThr: 1.633 ± 0.317
3.185ProVal: 3.185 ± 0.571
0.49ProTrp: 0.49 ± 0.22
1.552ProTyr: 1.552 ± 0.533
0.0ProXaa: 0.0 ± 0.0
Gln
5.145GlnAla: 5.145 ± 0.693
0.653GlnCys: 0.653 ± 0.232
2.123GlnAsp: 2.123 ± 0.439
2.205GlnGlu: 2.205 ± 0.494
0.898GlnPhe: 0.898 ± 0.232
3.267GlnGly: 3.267 ± 0.548
0.408GlnHis: 0.408 ± 0.19
2.042GlnIle: 2.042 ± 0.468
1.715GlnLys: 1.715 ± 0.404
5.145GlnLeu: 5.145 ± 0.648
1.388GlnMet: 1.388 ± 0.408
1.47GlnAsn: 1.47 ± 0.358
1.878GlnPro: 1.878 ± 0.367
2.205GlnGln: 2.205 ± 0.459
3.43GlnArg: 3.43 ± 0.595
2.532GlnSer: 2.532 ± 0.441
2.287GlnThr: 2.287 ± 0.437
2.205GlnVal: 2.205 ± 0.414
0.408GlnTrp: 0.408 ± 0.186
0.898GlnTyr: 0.898 ± 0.27
0.0GlnXaa: 0.0 ± 0.0
Arg
6.207ArgAla: 6.207 ± 0.829
0.49ArgCys: 0.49 ± 0.184
3.104ArgAsp: 3.104 ± 0.439
4.9ArgGlu: 4.9 ± 0.712
2.369ArgPhe: 2.369 ± 0.55
3.349ArgGly: 3.349 ± 0.484
1.878ArgHis: 1.878 ± 0.347
4.165ArgIle: 4.165 ± 0.535
3.594ArgLys: 3.594 ± 0.918
6.289ArgLeu: 6.289 ± 0.89
1.307ArgMet: 1.307 ± 0.308
2.859ArgAsn: 2.859 ± 0.51
1.47ArgPro: 1.47 ± 0.335
3.839ArgGln: 3.839 ± 0.479
5.227ArgArg: 5.227 ± 0.824
2.695ArgSer: 2.695 ± 0.589
3.349ArgThr: 3.349 ± 0.406
4.41ArgVal: 4.41 ± 0.751
1.062ArgTrp: 1.062 ± 0.244
1.878ArgTyr: 1.878 ± 0.358
0.0ArgXaa: 0.0 ± 0.0
Ser
6.452SerAla: 6.452 ± 0.65
0.572SerCys: 0.572 ± 0.205
3.267SerAsp: 3.267 ± 0.351
3.512SerGlu: 3.512 ± 0.553
2.614SerPhe: 2.614 ± 0.503
4.737SerGly: 4.737 ± 0.521
1.307SerHis: 1.307 ± 0.357
2.859SerIle: 2.859 ± 0.603
3.185SerLys: 3.185 ± 0.622
7.351SerLeu: 7.351 ± 0.851
1.878SerMet: 1.878 ± 0.36
2.123SerAsn: 2.123 ± 0.376
2.859SerPro: 2.859 ± 0.372
2.614SerGln: 2.614 ± 0.398
3.43SerArg: 3.43 ± 0.488
3.185SerSer: 3.185 ± 0.462
3.104SerThr: 3.104 ± 0.534
4.002SerVal: 4.002 ± 0.698
1.062SerTrp: 1.062 ± 0.271
1.715SerTyr: 1.715 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
5.799ThrAla: 5.799 ± 0.807
0.327ThrCys: 0.327 ± 0.169
3.757ThrAsp: 3.757 ± 0.704
2.94ThrGlu: 2.94 ± 0.542
2.287ThrPhe: 2.287 ± 0.628
4.819ThrGly: 4.819 ± 0.735
1.47ThrHis: 1.47 ± 0.311
2.695ThrIle: 2.695 ± 0.457
2.369ThrLys: 2.369 ± 0.455
5.962ThrLeu: 5.962 ± 0.63
0.817ThrMet: 0.817 ± 0.295
1.797ThrAsn: 1.797 ± 0.342
3.022ThrPro: 3.022 ± 0.458
1.388ThrGln: 1.388 ± 0.292
3.267ThrArg: 3.267 ± 0.524
3.757ThrSer: 3.757 ± 0.59
4.737ThrThr: 4.737 ± 0.823
4.329ThrVal: 4.329 ± 0.695
0.572ThrTrp: 0.572 ± 0.212
2.123ThrTyr: 2.123 ± 0.506
0.0ThrXaa: 0.0 ± 0.0
Val
5.39ValAla: 5.39 ± 0.752
0.327ValCys: 0.327 ± 0.156
3.675ValAsp: 3.675 ± 0.648
4.655ValGlu: 4.655 ± 0.587
1.552ValPhe: 1.552 ± 0.303
2.45ValGly: 2.45 ± 0.479
1.062ValHis: 1.062 ± 0.317
4.492ValIle: 4.492 ± 0.741
4.165ValLys: 4.165 ± 0.659
5.717ValLeu: 5.717 ± 0.555
2.042ValMet: 2.042 ± 0.461
2.94ValAsn: 2.94 ± 0.619
2.532ValPro: 2.532 ± 0.465
2.45ValGln: 2.45 ± 0.568
4.329ValArg: 4.329 ± 0.628
3.92ValSer: 3.92 ± 0.624
4.247ValThr: 4.247 ± 0.585
3.594ValVal: 3.594 ± 0.515
0.898ValTrp: 0.898 ± 0.259
1.797ValTyr: 1.797 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
1.388TrpAla: 1.388 ± 0.343
0.245TrpCys: 0.245 ± 0.145
0.817TrpAsp: 0.817 ± 0.265
1.633TrpGlu: 1.633 ± 0.371
0.327TrpPhe: 0.327 ± 0.146
0.327TrpGly: 0.327 ± 0.186
0.49TrpHis: 0.49 ± 0.17
1.307TrpIle: 1.307 ± 0.325
0.735TrpLys: 0.735 ± 0.185
2.123TrpLeu: 2.123 ± 0.475
0.408TrpMet: 0.408 ± 0.207
0.163TrpAsn: 0.163 ± 0.104
0.653TrpPro: 0.653 ± 0.241
0.572TrpGln: 0.572 ± 0.177
1.307TrpArg: 1.307 ± 0.314
0.735TrpSer: 0.735 ± 0.262
0.572TrpThr: 0.572 ± 0.173
0.653TrpVal: 0.653 ± 0.265
0.408TrpTrp: 0.408 ± 0.17
0.572TrpTyr: 0.572 ± 0.27
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.205TyrAla: 2.205 ± 0.362
0.735TyrCys: 0.735 ± 0.272
1.96TyrAsp: 1.96 ± 0.448
1.96TyrGlu: 1.96 ± 0.475
1.307TyrPhe: 1.307 ± 0.471
2.123TyrGly: 2.123 ± 0.455
0.49TyrHis: 0.49 ± 0.193
2.369TyrIle: 2.369 ± 0.483
0.898TyrLys: 0.898 ± 0.225
2.94TyrLeu: 2.94 ± 0.493
0.408TyrMet: 0.408 ± 0.192
1.715TyrAsn: 1.715 ± 0.467
1.388TyrPro: 1.388 ± 0.387
1.552TyrGln: 1.552 ± 0.413
2.45TyrArg: 2.45 ± 0.448
1.307TyrSer: 1.307 ± 0.231
1.96TyrThr: 1.96 ± 0.465
1.96TyrVal: 1.96 ± 0.436
0.327TyrTrp: 0.327 ± 0.165
1.143TyrTyr: 1.143 ± 0.29
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (12245 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski