Amino acid dipepetide frequency for Klebsiella phage ST899-OXA48phi17.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.624AlaAla: 11.624 ± 3.548
0.347AlaCys: 0.347 ± 0.236
6.246AlaAsp: 6.246 ± 1.138
4.511AlaGlu: 4.511 ± 0.855
3.817AlaPhe: 3.817 ± 0.628
8.501AlaGly: 8.501 ± 1.25
0.694AlaHis: 0.694 ± 0.316
3.296AlaIle: 3.296 ± 0.918
5.552AlaLys: 5.552 ± 1.135
8.848AlaLeu: 8.848 ± 1.956
3.123AlaMet: 3.123 ± 0.624
2.429AlaAsn: 2.429 ± 0.455
3.99AlaPro: 3.99 ± 1.433
5.205AlaGln: 5.205 ± 1.351
6.246AlaArg: 6.246 ± 0.823
7.113AlaSer: 7.113 ± 0.845
7.287AlaThr: 7.287 ± 1.225
6.593AlaVal: 6.593 ± 0.842
1.735AlaTrp: 1.735 ± 0.482
1.388AlaTyr: 1.388 ± 0.494
0.0AlaXaa: 0.0 ± 0.0
Cys
0.347CysAla: 0.347 ± 0.276
0.0CysCys: 0.0 ± 0.0
0.347CysAsp: 0.347 ± 0.306
0.347CysGlu: 0.347 ± 0.255
0.347CysPhe: 0.347 ± 0.255
0.52CysGly: 0.52 ± 0.282
0.347CysHis: 0.347 ± 0.207
0.0CysIle: 0.0 ± 0.0
0.347CysLys: 0.347 ± 0.313
0.867CysLeu: 0.867 ± 0.444
0.0CysMet: 0.0 ± 0.0
0.173CysAsn: 0.173 ± 0.196
0.173CysPro: 0.173 ± 0.142
0.694CysGln: 0.694 ± 0.385
0.52CysArg: 0.52 ± 0.299
0.347CysSer: 0.347 ± 0.252
0.347CysThr: 0.347 ± 0.269
0.52CysVal: 0.52 ± 0.355
0.347CysTrp: 0.347 ± 0.207
0.694CysTyr: 0.694 ± 0.393
0.0CysXaa: 0.0 ± 0.0
Asp
4.684AspAla: 4.684 ± 0.747
0.173AspCys: 0.173 ± 0.196
6.419AspAsp: 6.419 ± 2.08
4.511AspGlu: 4.511 ± 0.743
2.082AspPhe: 2.082 ± 0.491
5.378AspGly: 5.378 ± 0.781
0.694AspHis: 0.694 ± 0.352
5.378AspIle: 5.378 ± 1.146
2.255AspLys: 2.255 ± 0.506
5.378AspLeu: 5.378 ± 1.032
0.694AspMet: 0.694 ± 0.328
2.602AspAsn: 2.602 ± 0.679
2.255AspPro: 2.255 ± 0.79
3.296AspGln: 3.296 ± 0.774
3.643AspArg: 3.643 ± 0.658
5.205AspSer: 5.205 ± 1.157
2.776AspThr: 2.776 ± 1.012
3.99AspVal: 3.99 ± 0.68
0.867AspTrp: 0.867 ± 0.24
1.908AspTyr: 1.908 ± 0.79
0.0AspXaa: 0.0 ± 0.0
Glu
6.246GluAla: 6.246 ± 0.957
0.52GluCys: 0.52 ± 0.282
3.47GluAsp: 3.47 ± 0.615
3.99GluGlu: 3.99 ± 0.731
2.602GluPhe: 2.602 ± 0.646
3.47GluGly: 3.47 ± 0.706
0.867GluHis: 0.867 ± 0.459
3.47GluIle: 3.47 ± 0.73
3.123GluLys: 3.123 ± 0.916
5.031GluLeu: 5.031 ± 1.142
1.561GluMet: 1.561 ± 0.378
1.908GluAsn: 1.908 ± 0.512
1.908GluPro: 1.908 ± 0.48
3.817GluGln: 3.817 ± 1.228
3.817GluArg: 3.817 ± 0.984
4.511GluSer: 4.511 ± 0.807
2.776GluThr: 2.776 ± 0.77
4.511GluVal: 4.511 ± 0.844
1.214GluTrp: 1.214 ± 0.553
1.388GluTyr: 1.388 ± 0.398
0.0GluXaa: 0.0 ± 0.0
Phe
2.949PheAla: 2.949 ± 0.478
0.347PheCys: 0.347 ± 0.252
2.255PheAsp: 2.255 ± 0.938
1.214PheGlu: 1.214 ± 0.585
1.388PhePhe: 1.388 ± 0.719
2.602PheGly: 2.602 ± 0.631
0.173PheHis: 0.173 ± 0.231
0.867PheIle: 0.867 ± 0.404
2.082PheLys: 2.082 ± 0.683
2.776PheLeu: 2.776 ± 0.578
0.694PheMet: 0.694 ± 0.355
1.214PheAsn: 1.214 ± 0.314
0.867PhePro: 0.867 ± 0.395
0.694PheGln: 0.694 ± 0.24
2.776PheArg: 2.776 ± 0.804
4.164PheSer: 4.164 ± 0.979
2.082PheThr: 2.082 ± 0.629
2.255PheVal: 2.255 ± 0.986
0.867PheTrp: 0.867 ± 0.42
1.214PheTyr: 1.214 ± 0.536
0.0PheXaa: 0.0 ± 0.0
Gly
5.725GlyAla: 5.725 ± 1.207
0.52GlyCys: 0.52 ± 0.352
4.511GlyAsp: 4.511 ± 0.928
5.205GlyGlu: 5.205 ± 0.846
3.296GlyPhe: 3.296 ± 0.687
6.072GlyGly: 6.072 ± 1.362
0.694GlyHis: 0.694 ± 0.526
4.337GlyIle: 4.337 ± 0.938
3.643GlyLys: 3.643 ± 1.067
8.501GlyLeu: 8.501 ± 1.2
2.602GlyMet: 2.602 ± 0.539
2.429GlyAsn: 2.429 ± 0.692
2.082GlyPro: 2.082 ± 0.635
3.123GlyGln: 3.123 ± 0.618
5.031GlyArg: 5.031 ± 0.647
5.552GlySer: 5.552 ± 0.894
4.858GlyThr: 4.858 ± 1.1
5.725GlyVal: 5.725 ± 1.213
0.867GlyTrp: 0.867 ± 0.384
2.949GlyTyr: 2.949 ± 0.644
0.0GlyXaa: 0.0 ± 0.0
His
0.173HisAla: 0.173 ± 0.173
0.173HisCys: 0.173 ± 0.173
0.52HisAsp: 0.52 ± 0.352
0.867HisGlu: 0.867 ± 0.399
0.0HisPhe: 0.0 ± 0.0
0.694HisGly: 0.694 ± 0.495
0.173HisHis: 0.173 ± 0.207
1.041HisIle: 1.041 ± 0.436
0.347HisLys: 0.347 ± 0.259
0.867HisLeu: 0.867 ± 0.456
0.0HisMet: 0.0 ± 0.0
0.173HisAsn: 0.173 ± 0.175
0.867HisPro: 0.867 ± 0.451
0.347HisGln: 0.347 ± 0.284
0.867HisArg: 0.867 ± 0.441
0.52HisSer: 0.52 ± 0.266
0.867HisThr: 0.867 ± 0.488
0.694HisVal: 0.694 ± 0.389
0.173HisTrp: 0.173 ± 0.207
0.173HisTyr: 0.173 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
5.378IleAla: 5.378 ± 0.819
0.52IleCys: 0.52 ± 0.322
5.899IleAsp: 5.899 ± 1.068
2.082IleGlu: 2.082 ± 0.687
2.082IlePhe: 2.082 ± 0.598
2.949IleGly: 2.949 ± 0.588
0.347IleHis: 0.347 ± 0.276
3.643IleIle: 3.643 ± 0.788
2.429IleLys: 2.429 ± 0.587
3.47IleLeu: 3.47 ± 0.756
0.867IleMet: 0.867 ± 0.36
1.388IleAsn: 1.388 ± 0.516
1.735IlePro: 1.735 ± 0.593
1.561IleGln: 1.561 ± 0.485
3.817IleArg: 3.817 ± 0.849
5.899IleSer: 5.899 ± 1.086
4.164IleThr: 4.164 ± 1.008
2.602IleVal: 2.602 ± 0.943
0.347IleTrp: 0.347 ± 0.276
2.082IleTyr: 2.082 ± 0.579
0.0IleXaa: 0.0 ± 0.0
Lys
4.858LysAla: 4.858 ± 0.891
0.0LysCys: 0.0 ± 0.0
2.602LysAsp: 2.602 ± 0.752
3.817LysGlu: 3.817 ± 0.954
1.041LysPhe: 1.041 ± 0.361
3.123LysGly: 3.123 ± 0.787
0.173LysHis: 0.173 ± 0.242
1.561LysIle: 1.561 ± 0.642
3.123LysLys: 3.123 ± 0.803
3.47LysLeu: 3.47 ± 0.813
1.041LysMet: 1.041 ± 0.528
3.123LysAsn: 3.123 ± 0.771
2.255LysPro: 2.255 ± 0.654
2.429LysGln: 2.429 ± 0.829
1.908LysArg: 1.908 ± 0.428
4.858LysSer: 4.858 ± 1.197
4.164LysThr: 4.164 ± 0.877
3.47LysVal: 3.47 ± 0.907
0.694LysTrp: 0.694 ± 0.374
1.908LysTyr: 1.908 ± 0.459
0.0LysXaa: 0.0 ± 0.0
Leu
6.766LeuAla: 6.766 ± 1.183
1.214LeuCys: 1.214 ± 0.605
5.725LeuAsp: 5.725 ± 1.344
4.511LeuGlu: 4.511 ± 0.883
2.082LeuPhe: 2.082 ± 0.718
5.725LeuGly: 5.725 ± 1.182
0.867LeuHis: 0.867 ± 0.447
4.684LeuIle: 4.684 ± 1.037
5.378LeuLys: 5.378 ± 1.317
6.419LeuLeu: 6.419 ± 1.15
1.908LeuMet: 1.908 ± 0.466
3.47LeuAsn: 3.47 ± 0.648
3.817LeuPro: 3.817 ± 0.997
3.296LeuGln: 3.296 ± 0.794
5.205LeuArg: 5.205 ± 1.136
7.981LeuSer: 7.981 ± 1.118
6.94LeuThr: 6.94 ± 1.12
4.511LeuVal: 4.511 ± 0.728
1.214LeuTrp: 1.214 ± 0.718
1.388LeuTyr: 1.388 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
3.123MetAla: 3.123 ± 0.573
0.173MetCys: 0.173 ± 0.207
1.214MetAsp: 1.214 ± 0.509
0.867MetGlu: 0.867 ± 0.314
0.694MetPhe: 0.694 ± 0.327
1.908MetGly: 1.908 ± 0.381
0.173MetHis: 0.173 ± 0.175
1.041MetIle: 1.041 ± 0.459
1.908MetLys: 1.908 ± 0.767
1.214MetLeu: 1.214 ± 0.381
0.867MetMet: 0.867 ± 0.228
1.735MetAsn: 1.735 ± 0.714
1.214MetPro: 1.214 ± 0.392
1.214MetGln: 1.214 ± 0.426
1.735MetArg: 1.735 ± 0.386
1.908MetSer: 1.908 ± 0.536
2.602MetThr: 2.602 ± 0.773
1.388MetVal: 1.388 ± 0.583
0.347MetTrp: 0.347 ± 0.266
0.867MetTyr: 0.867 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
4.684AsnAla: 4.684 ± 0.902
0.173AsnCys: 0.173 ± 0.142
2.949AsnAsp: 2.949 ± 0.781
2.429AsnGlu: 2.429 ± 0.829
1.561AsnPhe: 1.561 ± 0.448
3.47AsnGly: 3.47 ± 0.846
0.0AsnHis: 0.0 ± 0.0
2.082AsnIle: 2.082 ± 0.672
2.082AsnLys: 2.082 ± 0.759
3.123AsnLeu: 3.123 ± 0.533
1.214AsnMet: 1.214 ± 0.432
1.561AsnAsn: 1.561 ± 0.455
2.602AsnPro: 2.602 ± 0.8
1.388AsnGln: 1.388 ± 0.458
1.561AsnArg: 1.561 ± 0.539
3.123AsnSer: 3.123 ± 0.705
1.388AsnThr: 1.388 ± 0.506
1.735AsnVal: 1.735 ± 0.507
0.694AsnTrp: 0.694 ± 0.424
0.867AsnTyr: 0.867 ± 0.352
0.0AsnXaa: 0.0 ± 0.0
Pro
4.164ProAla: 4.164 ± 1.232
0.347ProCys: 0.347 ± 0.255
2.776ProAsp: 2.776 ± 1.108
4.164ProGlu: 4.164 ± 1.252
1.214ProPhe: 1.214 ± 0.567
3.47ProGly: 3.47 ± 0.715
0.694ProHis: 0.694 ± 0.544
3.123ProIle: 3.123 ± 0.916
2.082ProLys: 2.082 ± 0.622
2.949ProLeu: 2.949 ± 0.959
0.867ProMet: 0.867 ± 0.426
0.694ProAsn: 0.694 ± 0.355
1.388ProPro: 1.388 ± 0.481
1.908ProGln: 1.908 ± 0.657
1.561ProArg: 1.561 ± 0.481
3.123ProSer: 3.123 ± 0.694
2.082ProThr: 2.082 ± 0.649
3.817ProVal: 3.817 ± 1.148
0.52ProTrp: 0.52 ± 0.367
2.082ProTyr: 2.082 ± 0.676
0.0ProXaa: 0.0 ± 0.0
Gln
5.031GlnAla: 5.031 ± 1.376
0.347GlnCys: 0.347 ± 0.207
1.735GlnAsp: 1.735 ± 0.493
3.123GlnGlu: 3.123 ± 1.165
1.388GlnPhe: 1.388 ± 0.36
3.123GlnGly: 3.123 ± 0.766
0.52GlnHis: 0.52 ± 0.39
2.429GlnIle: 2.429 ± 0.725
2.949GlnLys: 2.949 ± 0.801
2.776GlnLeu: 2.776 ± 0.624
0.867GlnMet: 0.867 ± 0.431
2.602GlnAsn: 2.602 ± 0.959
2.429GlnPro: 2.429 ± 0.888
2.429GlnGln: 2.429 ± 1.128
2.776GlnArg: 2.776 ± 0.611
4.164GlnSer: 4.164 ± 0.993
3.643GlnThr: 3.643 ± 1.059
2.429GlnVal: 2.429 ± 0.547
0.52GlnTrp: 0.52 ± 0.267
1.214GlnTyr: 1.214 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
6.419ArgAla: 6.419 ± 1.316
0.347ArgCys: 0.347 ± 0.236
3.296ArgAsp: 3.296 ± 0.577
3.296ArgGlu: 3.296 ± 0.706
1.561ArgPhe: 1.561 ± 0.589
2.949ArgGly: 2.949 ± 0.685
0.694ArgHis: 0.694 ± 0.339
4.164ArgIle: 4.164 ± 0.909
2.949ArgLys: 2.949 ± 0.557
6.072ArgLeu: 6.072 ± 0.991
2.429ArgMet: 2.429 ± 0.609
3.47ArgAsn: 3.47 ± 0.935
2.255ArgPro: 2.255 ± 0.669
3.643ArgGln: 3.643 ± 0.777
3.296ArgArg: 3.296 ± 0.759
1.735ArgSer: 1.735 ± 0.459
3.643ArgThr: 3.643 ± 0.892
2.429ArgVal: 2.429 ± 0.452
0.867ArgTrp: 0.867 ± 0.423
1.388ArgTyr: 1.388 ± 0.871
0.0ArgXaa: 0.0 ± 0.0
Ser
10.062SerAla: 10.062 ± 1.465
1.214SerCys: 1.214 ± 0.616
4.337SerAsp: 4.337 ± 0.934
3.643SerGlu: 3.643 ± 0.749
2.602SerPhe: 2.602 ± 0.735
10.93SerGly: 10.93 ± 1.412
0.52SerHis: 0.52 ± 0.402
3.47SerIle: 3.47 ± 0.845
2.255SerLys: 2.255 ± 0.777
6.246SerLeu: 6.246 ± 0.961
2.776SerMet: 2.776 ± 0.708
2.776SerAsn: 2.776 ± 0.396
2.602SerPro: 2.602 ± 0.597
3.123SerGln: 3.123 ± 1.239
3.296SerArg: 3.296 ± 0.698
4.858SerSer: 4.858 ± 1.293
5.552SerThr: 5.552 ± 0.781
5.205SerVal: 5.205 ± 0.845
0.694SerTrp: 0.694 ± 0.359
2.082SerTyr: 2.082 ± 0.912
0.0SerXaa: 0.0 ± 0.0
Thr
7.113ThrAla: 7.113 ± 1.393
0.0ThrCys: 0.0 ± 0.0
3.296ThrAsp: 3.296 ± 0.518
4.684ThrGlu: 4.684 ± 1.018
1.908ThrPhe: 1.908 ± 0.533
6.246ThrGly: 6.246 ± 1.364
0.867ThrHis: 0.867 ± 0.407
3.643ThrIle: 3.643 ± 0.864
2.429ThrLys: 2.429 ± 0.652
6.072ThrLeu: 6.072 ± 1.23
1.908ThrMet: 1.908 ± 0.467
2.602ThrAsn: 2.602 ± 0.779
3.817ThrPro: 3.817 ± 1.049
2.949ThrGln: 2.949 ± 0.656
2.949ThrArg: 2.949 ± 0.865
4.684ThrSer: 4.684 ± 1.091
1.908ThrThr: 1.908 ± 0.763
5.725ThrVal: 5.725 ± 1.325
1.214ThrTrp: 1.214 ± 0.645
1.214ThrTyr: 1.214 ± 0.568
0.173ThrXaa: 0.173 ± 0.196
Val
5.378ValAla: 5.378 ± 0.883
0.52ValCys: 0.52 ± 0.278
3.47ValAsp: 3.47 ± 0.574
4.858ValGlu: 4.858 ± 0.956
2.082ValPhe: 2.082 ± 0.743
2.776ValGly: 2.776 ± 0.572
0.52ValHis: 0.52 ± 0.327
3.123ValIle: 3.123 ± 0.838
2.602ValLys: 2.602 ± 0.82
5.378ValLeu: 5.378 ± 1.259
1.561ValMet: 1.561 ± 0.373
2.949ValAsn: 2.949 ± 0.766
3.817ValPro: 3.817 ± 0.784
2.776ValGln: 2.776 ± 0.561
3.123ValArg: 3.123 ± 0.909
5.378ValSer: 5.378 ± 0.89
5.899ValThr: 5.899 ± 0.849
4.337ValVal: 4.337 ± 1.208
1.388ValTrp: 1.388 ± 0.538
2.255ValTyr: 2.255 ± 0.557
0.0ValXaa: 0.0 ± 0.0
Trp
1.041TrpAla: 1.041 ± 0.451
0.173TrpCys: 0.173 ± 0.142
0.867TrpAsp: 0.867 ± 0.465
0.867TrpGlu: 0.867 ± 0.419
0.694TrpPhe: 0.694 ± 0.35
1.388TrpGly: 1.388 ± 0.707
0.173TrpHis: 0.173 ± 0.221
0.694TrpIle: 0.694 ± 0.44
0.52TrpLys: 0.52 ± 0.387
1.735TrpLeu: 1.735 ± 0.8
0.694TrpMet: 0.694 ± 0.518
0.52TrpAsn: 0.52 ± 0.354
1.214TrpPro: 1.214 ± 0.463
1.561TrpGln: 1.561 ± 0.518
0.867TrpArg: 0.867 ± 0.416
0.867TrpSer: 0.867 ± 0.376
0.694TrpThr: 0.694 ± 0.328
0.867TrpVal: 0.867 ± 0.312
0.0TrpTrp: 0.0 ± 0.0
0.173TrpTyr: 0.173 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.776TyrAla: 2.776 ± 0.599
0.173TyrCys: 0.173 ± 0.173
2.255TyrAsp: 2.255 ± 0.816
1.214TyrGlu: 1.214 ± 0.439
1.041TyrPhe: 1.041 ± 0.289
2.602TyrGly: 2.602 ± 0.608
0.347TyrHis: 0.347 ± 0.207
1.214TyrIle: 1.214 ± 0.565
1.561TyrLys: 1.561 ± 0.469
1.908TyrLeu: 1.908 ± 0.526
0.347TyrMet: 0.347 ± 0.429
0.867TyrAsn: 0.867 ± 0.459
1.735TyrPro: 1.735 ± 0.779
1.041TyrGln: 1.041 ± 0.427
1.908TyrArg: 1.908 ± 0.396
2.255TyrSer: 2.255 ± 0.467
1.735TyrThr: 1.735 ± 0.521
1.388TyrVal: 1.388 ± 0.383
0.867TyrTrp: 0.867 ± 0.453
0.867TyrTyr: 0.867 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.173XaaTrp: 0.173 ± 0.196
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (5765 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski