Amino acid dipepetide frequency for Klebsiella phage ST16-OXA48phi5.3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.258AlaAla: 12.258 ± 1.933
0.817AlaCys: 0.817 ± 0.37
4.203AlaAsp: 4.203 ± 0.556
8.639AlaGlu: 8.639 ± 1.024
3.385AlaPhe: 3.385 ± 0.465
6.537AlaGly: 6.537 ± 1.018
2.101AlaHis: 2.101 ± 0.497
7.121AlaIle: 7.121 ± 0.931
5.253AlaLys: 5.253 ± 1.15
9.223AlaLeu: 9.223 ± 0.992
3.502AlaMet: 3.502 ± 0.854
3.852AlaAsn: 3.852 ± 0.635
3.736AlaPro: 3.736 ± 0.804
4.436AlaGln: 4.436 ± 0.724
5.02AlaArg: 5.02 ± 0.871
4.67AlaSer: 4.67 ± 0.674
5.604AlaThr: 5.604 ± 0.858
4.319AlaVal: 4.319 ± 1.05
0.934AlaTrp: 0.934 ± 0.418
1.985AlaTyr: 1.985 ± 0.528
0.0AlaXaa: 0.0 ± 0.0
Cys
1.284CysAla: 1.284 ± 0.478
0.117CysCys: 0.117 ± 0.116
0.35CysAsp: 0.35 ± 0.188
1.051CysGlu: 1.051 ± 0.376
0.233CysPhe: 0.233 ± 0.169
1.284CysGly: 1.284 ± 0.459
0.35CysHis: 0.35 ± 0.192
0.584CysIle: 0.584 ± 0.235
0.467CysLys: 0.467 ± 0.237
0.817CysLeu: 0.817 ± 0.27
0.233CysMet: 0.233 ± 0.173
0.117CysAsn: 0.117 ± 0.114
0.817CysPro: 0.817 ± 0.347
0.467CysGln: 0.467 ± 0.247
0.7CysArg: 0.7 ± 0.296
0.584CysSer: 0.584 ± 0.338
0.467CysThr: 0.467 ± 0.227
0.35CysVal: 0.35 ± 0.192
0.0CysTrp: 0.0 ± 0.0
0.35CysTyr: 0.35 ± 0.262
0.0CysXaa: 0.0 ± 0.0
Asp
5.137AspAla: 5.137 ± 0.825
0.7AspCys: 0.7 ± 0.315
3.152AspAsp: 3.152 ± 0.544
1.985AspGlu: 1.985 ± 0.492
2.335AspPhe: 2.335 ± 0.518
5.72AspGly: 5.72 ± 0.747
1.051AspHis: 1.051 ± 0.359
3.736AspIle: 3.736 ± 0.705
3.152AspLys: 3.152 ± 0.573
5.604AspLeu: 5.604 ± 0.936
1.518AspMet: 1.518 ± 0.367
3.269AspAsn: 3.269 ± 0.617
2.802AspPro: 2.802 ± 0.616
1.868AspGln: 1.868 ± 0.469
2.919AspArg: 2.919 ± 0.509
3.502AspSer: 3.502 ± 0.574
2.218AspThr: 2.218 ± 0.574
4.319AspVal: 4.319 ± 0.795
0.7AspTrp: 0.7 ± 0.265
1.051AspTyr: 1.051 ± 0.254
0.0AspXaa: 0.0 ± 0.0
Glu
5.137GluAla: 5.137 ± 0.874
1.051GluCys: 1.051 ± 0.528
1.868GluAsp: 1.868 ± 0.397
4.903GluGlu: 4.903 ± 0.763
3.852GluPhe: 3.852 ± 0.579
3.736GluGly: 3.736 ± 0.658
0.817GluHis: 0.817 ± 0.273
4.319GluIle: 4.319 ± 0.682
4.319GluLys: 4.319 ± 0.658
6.771GluLeu: 6.771 ± 0.856
1.868GluMet: 1.868 ± 0.563
2.218GluAsn: 2.218 ± 0.497
1.868GluPro: 1.868 ± 0.504
2.919GluGln: 2.919 ± 0.671
5.137GluArg: 5.137 ± 0.681
2.802GluSer: 2.802 ± 0.515
2.685GluThr: 2.685 ± 0.623
4.67GluVal: 4.67 ± 0.634
0.934GluTrp: 0.934 ± 0.361
2.101GluTyr: 2.101 ± 0.385
0.0GluXaa: 0.0 ± 0.0
Phe
3.619PheAla: 3.619 ± 0.604
0.35PheCys: 0.35 ± 0.192
3.035PheAsp: 3.035 ± 0.553
1.751PheGlu: 1.751 ± 0.337
1.051PhePhe: 1.051 ± 0.311
2.802PheGly: 2.802 ± 0.549
0.35PheHis: 0.35 ± 0.195
2.568PheIle: 2.568 ± 0.497
1.518PheLys: 1.518 ± 0.414
2.568PheLeu: 2.568 ± 0.482
1.401PheMet: 1.401 ± 0.427
2.335PheAsn: 2.335 ± 0.49
1.868PhePro: 1.868 ± 0.503
1.051PheGln: 1.051 ± 0.338
2.101PheArg: 2.101 ± 0.594
2.919PheSer: 2.919 ± 0.509
2.685PheThr: 2.685 ± 0.545
2.919PheVal: 2.919 ± 0.747
0.35PheTrp: 0.35 ± 0.178
1.167PheTyr: 1.167 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
5.954GlyAla: 5.954 ± 0.972
0.467GlyCys: 0.467 ± 0.295
5.02GlyAsp: 5.02 ± 0.884
3.736GlyGlu: 3.736 ± 0.636
3.969GlyPhe: 3.969 ± 0.702
3.852GlyGly: 3.852 ± 0.797
1.051GlyHis: 1.051 ± 0.313
3.969GlyIle: 3.969 ± 0.788
4.903GlyLys: 4.903 ± 0.657
6.888GlyLeu: 6.888 ± 0.912
2.685GlyMet: 2.685 ± 0.455
3.152GlyAsn: 3.152 ± 0.557
1.634GlyPro: 1.634 ± 0.451
1.868GlyGln: 1.868 ± 0.468
4.67GlyArg: 4.67 ± 0.694
3.385GlySer: 3.385 ± 0.504
4.319GlyThr: 4.319 ± 0.765
5.37GlyVal: 5.37 ± 0.896
1.634GlyTrp: 1.634 ± 0.34
2.218GlyTyr: 2.218 ± 0.626
0.0GlyXaa: 0.0 ± 0.0
His
1.868HisAla: 1.868 ± 0.566
0.35HisCys: 0.35 ± 0.261
1.518HisAsp: 1.518 ± 0.393
1.051HisGlu: 1.051 ± 0.3
0.35HisPhe: 0.35 ± 0.184
1.518HisGly: 1.518 ± 0.374
0.584HisHis: 0.584 ± 0.296
1.751HisIle: 1.751 ± 0.491
0.817HisLys: 0.817 ± 0.311
1.751HisLeu: 1.751 ± 0.408
0.584HisMet: 0.584 ± 0.259
0.584HisAsn: 0.584 ± 0.244
1.051HisPro: 1.051 ± 0.374
1.284HisGln: 1.284 ± 0.472
1.284HisArg: 1.284 ± 0.454
0.467HisSer: 0.467 ± 0.237
0.467HisThr: 0.467 ± 0.221
0.467HisVal: 0.467 ± 0.236
0.467HisTrp: 0.467 ± 0.258
0.817HisTyr: 0.817 ± 0.306
0.0HisXaa: 0.0 ± 0.0
Ile
5.02IleAla: 5.02 ± 0.643
1.167IleCys: 1.167 ± 0.459
4.67IleAsp: 4.67 ± 0.771
5.837IleGlu: 5.837 ± 0.821
2.452IlePhe: 2.452 ± 0.399
3.619IleGly: 3.619 ± 0.655
0.7IleHis: 0.7 ± 0.277
3.269IleIle: 3.269 ± 0.618
2.685IleLys: 2.685 ± 0.667
4.203IleLeu: 4.203 ± 0.75
1.051IleMet: 1.051 ± 0.393
2.802IleAsn: 2.802 ± 0.53
1.518IlePro: 1.518 ± 0.436
2.335IleGln: 2.335 ± 0.381
3.736IleArg: 3.736 ± 0.633
4.67IleSer: 4.67 ± 0.669
3.969IleThr: 3.969 ± 0.588
2.802IleVal: 2.802 ± 0.579
0.7IleTrp: 0.7 ± 0.266
1.985IleTyr: 1.985 ± 0.361
0.0IleXaa: 0.0 ± 0.0
Lys
6.771LysAla: 6.771 ± 0.95
0.35LysCys: 0.35 ± 0.17
2.218LysAsp: 2.218 ± 0.658
4.67LysGlu: 4.67 ± 0.714
2.452LysPhe: 2.452 ± 0.586
3.969LysGly: 3.969 ± 0.699
2.335LysHis: 2.335 ± 0.475
2.919LysIle: 2.919 ± 0.956
3.269LysLys: 3.269 ± 0.663
4.67LysLeu: 4.67 ± 0.724
1.167LysMet: 1.167 ± 0.326
2.101LysAsn: 2.101 ± 0.436
2.802LysPro: 2.802 ± 0.712
2.101LysGln: 2.101 ± 0.537
3.269LysArg: 3.269 ± 0.586
5.604LysSer: 5.604 ± 0.756
3.619LysThr: 3.619 ± 0.585
2.452LysVal: 2.452 ± 0.476
0.817LysTrp: 0.817 ± 0.378
1.051LysTyr: 1.051 ± 0.369
0.0LysXaa: 0.0 ± 0.0
Leu
11.207LeuAla: 11.207 ± 1.104
0.7LeuCys: 0.7 ± 0.27
4.203LeuAsp: 4.203 ± 0.873
4.903LeuGlu: 4.903 ± 0.835
2.568LeuPhe: 2.568 ± 0.605
4.436LeuGly: 4.436 ± 0.675
1.868LeuHis: 1.868 ± 0.469
5.02LeuIle: 5.02 ± 1.048
5.72LeuLys: 5.72 ± 0.786
8.872LeuLeu: 8.872 ± 1.159
1.985LeuMet: 1.985 ± 0.502
3.969LeuAsn: 3.969 ± 0.778
4.436LeuPro: 4.436 ± 0.641
4.436LeuGln: 4.436 ± 1.155
5.954LeuArg: 5.954 ± 0.972
6.421LeuSer: 6.421 ± 0.666
4.786LeuThr: 4.786 ± 0.665
5.72LeuVal: 5.72 ± 0.827
1.051LeuTrp: 1.051 ± 0.348
2.218LeuTyr: 2.218 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
2.919MetAla: 2.919 ± 0.639
0.35MetCys: 0.35 ± 0.246
1.051MetAsp: 1.051 ± 0.313
1.284MetGlu: 1.284 ± 0.426
0.934MetPhe: 0.934 ± 0.295
1.401MetGly: 1.401 ± 0.332
0.467MetHis: 0.467 ± 0.255
1.518MetIle: 1.518 ± 0.41
2.685MetLys: 2.685 ± 0.588
2.802MetLeu: 2.802 ± 0.696
1.051MetMet: 1.051 ± 0.374
1.401MetAsn: 1.401 ± 0.436
1.284MetPro: 1.284 ± 0.43
2.335MetGln: 2.335 ± 0.587
1.401MetArg: 1.401 ± 0.354
2.101MetSer: 2.101 ± 0.494
1.401MetThr: 1.401 ± 0.478
1.284MetVal: 1.284 ± 0.392
0.0MetTrp: 0.0 ± 0.0
0.35MetTyr: 0.35 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
3.269AsnAla: 3.269 ± 0.622
0.233AsnCys: 0.233 ± 0.155
2.568AsnAsp: 2.568 ± 0.429
1.401AsnGlu: 1.401 ± 0.39
1.167AsnPhe: 1.167 ± 0.32
3.035AsnGly: 3.035 ± 0.627
0.467AsnHis: 0.467 ± 0.224
3.269AsnIle: 3.269 ± 0.501
2.685AsnLys: 2.685 ± 0.46
3.385AsnLeu: 3.385 ± 0.737
1.634AsnMet: 1.634 ± 0.412
1.518AsnAsn: 1.518 ± 0.406
2.218AsnPro: 2.218 ± 0.45
2.335AsnGln: 2.335 ± 0.555
2.218AsnArg: 2.218 ± 0.484
2.685AsnSer: 2.685 ± 0.543
1.634AsnThr: 1.634 ± 0.446
2.452AsnVal: 2.452 ± 0.511
0.467AsnTrp: 0.467 ± 0.205
1.401AsnTyr: 1.401 ± 0.441
0.0AsnXaa: 0.0 ± 0.0
Pro
3.969ProAla: 3.969 ± 0.803
0.467ProCys: 0.467 ± 0.247
2.919ProAsp: 2.919 ± 0.514
4.553ProGlu: 4.553 ± 0.882
1.985ProPhe: 1.985 ± 0.392
2.919ProGly: 2.919 ± 0.488
1.051ProHis: 1.051 ± 0.272
1.868ProIle: 1.868 ± 0.451
1.985ProLys: 1.985 ± 0.539
3.852ProLeu: 3.852 ± 0.847
0.467ProMet: 0.467 ± 0.218
1.167ProAsn: 1.167 ± 0.468
1.751ProPro: 1.751 ± 0.433
1.284ProGln: 1.284 ± 0.415
2.452ProArg: 2.452 ± 0.572
2.101ProSer: 2.101 ± 0.468
2.685ProThr: 2.685 ± 0.581
3.852ProVal: 3.852 ± 0.699
0.233ProTrp: 0.233 ± 0.165
0.7ProTyr: 0.7 ± 0.258
0.0ProXaa: 0.0 ± 0.0
Gln
3.502GlnAla: 3.502 ± 0.623
0.117GlnCys: 0.117 ± 0.105
2.335GlnAsp: 2.335 ± 0.49
1.985GlnGlu: 1.985 ± 0.463
1.167GlnPhe: 1.167 ± 0.407
2.802GlnGly: 2.802 ± 0.602
0.467GlnHis: 0.467 ± 0.206
1.868GlnIle: 1.868 ± 0.527
2.919GlnLys: 2.919 ± 0.631
4.203GlnLeu: 4.203 ± 0.658
1.284GlnMet: 1.284 ± 0.326
1.518GlnAsn: 1.518 ± 0.405
1.751GlnPro: 1.751 ± 0.414
2.452GlnGln: 2.452 ± 0.615
3.502GlnArg: 3.502 ± 0.867
3.035GlnSer: 3.035 ± 0.63
2.452GlnThr: 2.452 ± 0.553
3.269GlnVal: 3.269 ± 0.764
0.35GlnTrp: 0.35 ± 0.224
0.584GlnTyr: 0.584 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
4.67ArgAla: 4.67 ± 0.916
0.817ArgCys: 0.817 ± 0.486
3.736ArgAsp: 3.736 ± 0.553
4.086ArgGlu: 4.086 ± 0.846
2.452ArgPhe: 2.452 ± 0.465
3.969ArgGly: 3.969 ± 0.603
1.751ArgHis: 1.751 ± 0.467
3.035ArgIle: 3.035 ± 0.658
3.502ArgLys: 3.502 ± 0.591
6.654ArgLeu: 6.654 ± 0.85
2.568ArgMet: 2.568 ± 0.538
2.452ArgAsn: 2.452 ± 0.564
2.218ArgPro: 2.218 ± 0.476
2.452ArgGln: 2.452 ± 0.468
5.253ArgArg: 5.253 ± 0.961
3.152ArgSer: 3.152 ± 0.542
2.919ArgThr: 2.919 ± 0.458
3.502ArgVal: 3.502 ± 0.583
1.634ArgTrp: 1.634 ± 0.478
2.101ArgTyr: 2.101 ± 0.628
0.0ArgXaa: 0.0 ± 0.0
Ser
5.837SerAla: 5.837 ± 0.728
0.467SerCys: 0.467 ± 0.215
4.319SerAsp: 4.319 ± 0.699
3.502SerGlu: 3.502 ± 0.718
2.802SerPhe: 2.802 ± 0.656
6.654SerGly: 6.654 ± 1.05
1.167SerHis: 1.167 ± 0.347
2.452SerIle: 2.452 ± 0.527
3.969SerLys: 3.969 ± 0.739
5.253SerLeu: 5.253 ± 1.051
1.634SerMet: 1.634 ± 0.361
2.335SerAsn: 2.335 ± 0.687
2.919SerPro: 2.919 ± 0.689
2.919SerGln: 2.919 ± 0.617
3.852SerArg: 3.852 ± 0.737
3.269SerSer: 3.269 ± 0.722
3.385SerThr: 3.385 ± 0.704
5.137SerVal: 5.137 ± 0.825
1.051SerTrp: 1.051 ± 0.381
1.051SerTyr: 1.051 ± 0.25
0.0SerXaa: 0.0 ± 0.0
Thr
6.071ThrAla: 6.071 ± 0.782
0.35ThrCys: 0.35 ± 0.29
3.619ThrAsp: 3.619 ± 0.707
2.452ThrGlu: 2.452 ± 0.641
1.518ThrPhe: 1.518 ± 0.414
4.67ThrGly: 4.67 ± 0.881
0.7ThrHis: 0.7 ± 0.288
2.802ThrIle: 2.802 ± 0.511
2.568ThrLys: 2.568 ± 0.731
3.736ThrLeu: 3.736 ± 0.657
0.934ThrMet: 0.934 ± 0.33
1.167ThrAsn: 1.167 ± 0.435
3.269ThrPro: 3.269 ± 0.662
2.218ThrGln: 2.218 ± 0.503
3.502ThrArg: 3.502 ± 0.587
4.436ThrSer: 4.436 ± 0.713
3.736ThrThr: 3.736 ± 0.51
4.553ThrVal: 4.553 ± 0.687
0.7ThrTrp: 0.7 ± 0.28
1.051ThrTyr: 1.051 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
5.837ValAla: 5.837 ± 0.844
0.817ValCys: 0.817 ± 0.34
3.736ValAsp: 3.736 ± 0.638
4.553ValGlu: 4.553 ± 0.911
1.868ValPhe: 1.868 ± 0.429
4.786ValGly: 4.786 ± 0.807
0.817ValHis: 0.817 ± 0.323
4.786ValIle: 4.786 ± 0.834
3.969ValLys: 3.969 ± 0.661
5.02ValLeu: 5.02 ± 0.767
1.751ValMet: 1.751 ± 0.44
2.919ValAsn: 2.919 ± 0.548
2.568ValPro: 2.568 ± 0.626
0.934ValGln: 0.934 ± 0.29
3.502ValArg: 3.502 ± 0.542
5.137ValSer: 5.137 ± 0.717
3.619ValThr: 3.619 ± 0.68
3.152ValVal: 3.152 ± 0.723
0.934ValTrp: 0.934 ± 0.311
2.101ValTyr: 2.101 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
1.167TrpAla: 1.167 ± 0.406
0.584TrpCys: 0.584 ± 0.221
0.817TrpAsp: 0.817 ± 0.332
0.7TrpGlu: 0.7 ± 0.264
0.934TrpPhe: 0.934 ± 0.392
0.817TrpGly: 0.817 ± 0.265
0.35TrpHis: 0.35 ± 0.171
1.401TrpIle: 1.401 ± 0.401
0.817TrpLys: 0.817 ± 0.263
1.634TrpLeu: 1.634 ± 0.429
0.467TrpMet: 0.467 ± 0.204
0.117TrpAsn: 0.117 ± 0.132
0.35TrpPro: 0.35 ± 0.177
0.35TrpGln: 0.35 ± 0.196
0.7TrpArg: 0.7 ± 0.27
0.934TrpSer: 0.934 ± 0.342
0.35TrpThr: 0.35 ± 0.214
0.467TrpVal: 0.467 ± 0.292
0.117TrpTrp: 0.117 ± 0.101
0.467TrpTyr: 0.467 ± 0.24
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.218TyrAla: 2.218 ± 0.469
0.35TyrCys: 0.35 ± 0.208
1.401TyrAsp: 1.401 ± 0.397
0.934TyrGlu: 0.934 ± 0.326
0.934TyrPhe: 0.934 ± 0.388
2.218TyrGly: 2.218 ± 0.477
0.584TyrHis: 0.584 ± 0.258
0.934TyrIle: 0.934 ± 0.322
1.401TyrLys: 1.401 ± 0.367
2.568TyrLeu: 2.568 ± 0.476
0.117TyrMet: 0.117 ± 0.133
1.051TyrAsn: 1.051 ± 0.362
1.401TyrPro: 1.401 ± 0.51
1.518TyrGln: 1.518 ± 0.465
1.751TyrArg: 1.751 ± 0.484
2.101TyrSer: 2.101 ± 0.525
1.051TyrThr: 1.051 ± 0.31
1.751TyrVal: 1.751 ± 0.41
0.467TyrTrp: 0.467 ± 0.242
0.7TyrTyr: 0.7 ± 0.216
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (8567 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski