Amino acid dipepetide frequency for Klebsiella phage ST437-OXA245phi4.1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.344AlaAla: 9.344 ± 1.319
1.3AlaCys: 1.3 ± 0.335
6.581AlaAsp: 6.581 ± 0.81
6.175AlaGlu: 6.175 ± 0.781
3.575AlaPhe: 3.575 ± 0.471
8.044AlaGly: 8.044 ± 1.455
1.462AlaHis: 1.462 ± 0.241
4.306AlaIle: 4.306 ± 0.539
4.062AlaLys: 4.062 ± 0.522
9.912AlaLeu: 9.912 ± 1.14
2.112AlaMet: 2.112 ± 0.382
2.194AlaAsn: 2.194 ± 0.455
2.844AlaPro: 2.844 ± 0.528
2.925AlaGln: 2.925 ± 0.589
5.85AlaArg: 5.85 ± 0.747
6.094AlaSer: 6.094 ± 0.629
4.875AlaThr: 4.875 ± 0.619
6.744AlaVal: 6.744 ± 0.682
1.3AlaTrp: 1.3 ± 0.307
3.006AlaTyr: 3.006 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
1.137CysAla: 1.137 ± 0.356
0.325CysCys: 0.325 ± 0.17
0.487CysAsp: 0.487 ± 0.21
0.406CysGlu: 0.406 ± 0.167
0.244CysPhe: 0.244 ± 0.166
0.487CysGly: 0.487 ± 0.231
0.081CysHis: 0.081 ± 0.077
0.569CysIle: 0.569 ± 0.285
0.569CysLys: 0.569 ± 0.233
1.219CysLeu: 1.219 ± 0.375
0.325CysMet: 0.325 ± 0.154
0.487CysAsn: 0.487 ± 0.203
0.65CysPro: 0.65 ± 0.253
0.406CysGln: 0.406 ± 0.189
0.65CysArg: 0.65 ± 0.214
0.569CysSer: 0.569 ± 0.248
0.487CysThr: 0.487 ± 0.187
1.219CysVal: 1.219 ± 0.273
0.325CysTrp: 0.325 ± 0.148
0.406CysTyr: 0.406 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
5.931AspAla: 5.931 ± 0.755
0.731AspCys: 0.731 ± 0.232
2.844AspAsp: 2.844 ± 0.465
3.656AspGlu: 3.656 ± 0.49
2.6AspPhe: 2.6 ± 0.401
5.037AspGly: 5.037 ± 0.565
0.569AspHis: 0.569 ± 0.208
3.006AspIle: 3.006 ± 0.574
3.737AspLys: 3.737 ± 0.696
5.281AspLeu: 5.281 ± 0.822
0.975AspMet: 0.975 ± 0.239
2.275AspAsn: 2.275 ± 0.481
2.844AspPro: 2.844 ± 0.48
2.437AspGln: 2.437 ± 0.43
2.762AspArg: 2.762 ± 0.431
4.062AspSer: 4.062 ± 0.594
2.6AspThr: 2.6 ± 0.555
3.737AspVal: 3.737 ± 0.447
0.812AspTrp: 0.812 ± 0.208
2.194AspTyr: 2.194 ± 0.485
0.0AspXaa: 0.0 ± 0.0
Glu
5.606GluAla: 5.606 ± 0.7
0.65GluCys: 0.65 ± 0.28
3.087GluAsp: 3.087 ± 0.656
2.681GluGlu: 2.681 ± 0.58
2.275GluPhe: 2.275 ± 0.406
3.331GluGly: 3.331 ± 0.598
1.056GluHis: 1.056 ± 0.24
3.656GluIle: 3.656 ± 0.502
3.006GluLys: 3.006 ± 0.493
6.987GluLeu: 6.987 ± 1.013
2.844GluMet: 2.844 ± 0.445
3.737GluAsn: 3.737 ± 0.481
2.112GluPro: 2.112 ± 0.47
3.9GluGln: 3.9 ± 0.663
3.575GluArg: 3.575 ± 0.509
3.331GluSer: 3.331 ± 0.553
3.575GluThr: 3.575 ± 0.603
4.225GluVal: 4.225 ± 0.501
1.137GluTrp: 1.137 ± 0.31
1.95GluTyr: 1.95 ± 0.415
0.0GluXaa: 0.0 ± 0.0
Phe
3.494PheAla: 3.494 ± 0.484
0.244PheCys: 0.244 ± 0.168
2.112PheAsp: 2.112 ± 0.437
2.194PheGlu: 2.194 ± 0.37
1.869PhePhe: 1.869 ± 0.487
1.869PheGly: 1.869 ± 0.361
0.65PheHis: 0.65 ± 0.246
2.762PheIle: 2.762 ± 0.543
1.381PheLys: 1.381 ± 0.261
2.681PheLeu: 2.681 ± 0.558
0.894PheMet: 0.894 ± 0.251
1.706PheAsn: 1.706 ± 0.516
0.894PhePro: 0.894 ± 0.291
1.544PheGln: 1.544 ± 0.38
2.275PheArg: 2.275 ± 0.473
3.006PheSer: 3.006 ± 0.358
2.6PheThr: 2.6 ± 0.507
1.544PheVal: 1.544 ± 0.354
0.975PheTrp: 0.975 ± 0.319
0.894PheTyr: 0.894 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
4.306GlyAla: 4.306 ± 0.747
0.569GlyCys: 0.569 ± 0.199
3.819GlyAsp: 3.819 ± 0.436
4.712GlyGlu: 4.712 ± 0.661
2.681GlyPhe: 2.681 ± 0.432
5.687GlyGly: 5.687 ± 1.178
1.462GlyHis: 1.462 ± 0.435
3.656GlyIle: 3.656 ± 0.668
4.469GlyLys: 4.469 ± 0.646
5.606GlyLeu: 5.606 ± 0.601
2.194GlyMet: 2.194 ± 0.412
2.519GlyAsn: 2.519 ± 0.481
2.112GlyPro: 2.112 ± 0.457
2.112GlyGln: 2.112 ± 0.482
3.9GlyArg: 3.9 ± 0.48
3.494GlySer: 3.494 ± 0.477
3.981GlyThr: 3.981 ± 0.684
5.525GlyVal: 5.525 ± 0.748
1.056GlyTrp: 1.056 ± 0.289
1.706GlyTyr: 1.706 ± 0.327
0.0GlyXaa: 0.0 ± 0.0
His
1.3HisAla: 1.3 ± 0.413
0.406HisCys: 0.406 ± 0.2
1.137HisAsp: 1.137 ± 0.316
1.381HisGlu: 1.381 ± 0.403
0.487HisPhe: 0.487 ± 0.234
1.381HisGly: 1.381 ± 0.354
1.137HisHis: 1.137 ± 0.32
0.975HisIle: 0.975 ± 0.254
1.3HisLys: 1.3 ± 0.251
1.95HisLeu: 1.95 ± 0.417
0.731HisMet: 0.731 ± 0.235
0.406HisAsn: 0.406 ± 0.202
1.137HisPro: 1.137 ± 0.383
0.569HisGln: 0.569 ± 0.178
1.3HisArg: 1.3 ± 0.326
1.462HisSer: 1.462 ± 0.409
1.219HisThr: 1.219 ± 0.285
0.812HisVal: 0.812 ± 0.255
0.569HisTrp: 0.569 ± 0.178
1.137HisTyr: 1.137 ± 0.298
0.0HisXaa: 0.0 ± 0.0
Ile
4.306IleAla: 4.306 ± 0.567
0.894IleCys: 0.894 ± 0.251
4.387IleAsp: 4.387 ± 0.549
3.575IleGlu: 3.575 ± 0.591
1.706IlePhe: 1.706 ± 0.495
3.575IleGly: 3.575 ± 0.555
1.3IleHis: 1.3 ± 0.322
2.681IleIle: 2.681 ± 0.465
3.006IleLys: 3.006 ± 0.624
3.494IleLeu: 3.494 ± 0.643
2.031IleMet: 2.031 ± 0.431
3.494IleAsn: 3.494 ± 0.535
2.275IlePro: 2.275 ± 0.624
1.3IleGln: 1.3 ± 0.331
3.737IleArg: 3.737 ± 0.547
4.387IleSer: 4.387 ± 0.644
4.225IleThr: 4.225 ± 0.553
2.762IleVal: 2.762 ± 0.493
0.65IleTrp: 0.65 ± 0.17
1.787IleTyr: 1.787 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
3.737LysAla: 3.737 ± 0.581
0.406LysCys: 0.406 ± 0.171
2.437LysAsp: 2.437 ± 0.47
3.25LysGlu: 3.25 ± 0.52
1.869LysPhe: 1.869 ± 0.418
3.169LysGly: 3.169 ± 0.437
1.787LysHis: 1.787 ± 0.422
3.412LysIle: 3.412 ± 0.709
4.794LysLys: 4.794 ± 0.835
4.794LysLeu: 4.794 ± 0.807
1.544LysMet: 1.544 ± 0.342
2.681LysAsn: 2.681 ± 0.419
1.869LysPro: 1.869 ± 0.391
2.356LysGln: 2.356 ± 0.528
4.144LysArg: 4.144 ± 0.592
3.006LysSer: 3.006 ± 0.542
3.737LysThr: 3.737 ± 0.586
2.275LysVal: 2.275 ± 0.464
1.381LysTrp: 1.381 ± 0.375
1.625LysTyr: 1.625 ± 0.339
0.0LysXaa: 0.0 ± 0.0
Leu
9.587LeuAla: 9.587 ± 0.826
1.056LeuCys: 1.056 ± 0.274
5.85LeuAsp: 5.85 ± 0.711
5.2LeuGlu: 5.2 ± 0.641
3.737LeuPhe: 3.737 ± 0.553
5.119LeuGly: 5.119 ± 0.762
1.787LeuHis: 1.787 ± 0.396
4.469LeuIle: 4.469 ± 0.526
5.281LeuLys: 5.281 ± 0.694
8.206LeuLeu: 8.206 ± 0.888
3.006LeuMet: 3.006 ± 0.548
5.281LeuAsn: 5.281 ± 0.666
3.819LeuPro: 3.819 ± 0.546
5.362LeuGln: 5.362 ± 0.607
6.094LeuArg: 6.094 ± 0.92
7.556LeuSer: 7.556 ± 0.893
5.362LeuThr: 5.362 ± 0.667
4.469LeuVal: 4.469 ± 0.753
0.812LeuTrp: 0.812 ± 0.212
3.331LeuTyr: 3.331 ± 0.516
0.0LeuXaa: 0.0 ± 0.0
Met
3.981MetAla: 3.981 ± 0.55
0.325MetCys: 0.325 ± 0.158
1.137MetAsp: 1.137 ± 0.27
1.056MetGlu: 1.056 ± 0.262
1.056MetPhe: 1.056 ± 0.202
0.975MetGly: 0.975 ± 0.345
0.487MetHis: 0.487 ± 0.181
1.219MetIle: 1.219 ± 0.302
1.706MetLys: 1.706 ± 0.329
1.95MetLeu: 1.95 ± 0.501
1.544MetMet: 1.544 ± 0.504
1.462MetAsn: 1.462 ± 0.304
1.462MetPro: 1.462 ± 0.389
1.219MetGln: 1.219 ± 0.375
1.544MetArg: 1.544 ± 0.354
2.031MetSer: 2.031 ± 0.368
2.437MetThr: 2.437 ± 0.427
1.3MetVal: 1.3 ± 0.286
0.244MetTrp: 0.244 ± 0.138
0.569MetTyr: 0.569 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
3.331AsnAla: 3.331 ± 0.48
0.406AsnCys: 0.406 ± 0.188
1.787AsnAsp: 1.787 ± 0.374
3.575AsnGlu: 3.575 ± 0.621
1.137AsnPhe: 1.137 ± 0.369
3.006AsnGly: 3.006 ± 0.505
0.569AsnHis: 0.569 ± 0.281
3.331AsnIle: 3.331 ± 0.477
2.194AsnLys: 2.194 ± 0.496
4.469AsnLeu: 4.469 ± 0.643
0.65AsnMet: 0.65 ± 0.201
1.869AsnAsn: 1.869 ± 0.371
3.331AsnPro: 3.331 ± 0.552
1.787AsnGln: 1.787 ± 0.456
2.925AsnArg: 2.925 ± 0.536
2.356AsnSer: 2.356 ± 0.373
1.3AsnThr: 1.3 ± 0.253
3.087AsnVal: 3.087 ± 0.509
0.894AsnTrp: 0.894 ± 0.283
0.894AsnTyr: 0.894 ± 0.358
0.0AsnXaa: 0.0 ± 0.0
Pro
4.225ProAla: 4.225 ± 0.742
0.406ProCys: 0.406 ± 0.17
3.737ProAsp: 3.737 ± 0.562
3.169ProGlu: 3.169 ± 0.43
0.975ProPhe: 0.975 ± 0.341
2.925ProGly: 2.925 ± 0.474
1.219ProHis: 1.219 ± 0.319
1.706ProIle: 1.706 ± 0.366
2.031ProLys: 2.031 ± 0.37
3.331ProLeu: 3.331 ± 0.411
0.487ProMet: 0.487 ± 0.213
1.787ProAsn: 1.787 ± 0.448
2.437ProPro: 2.437 ± 0.473
1.787ProGln: 1.787 ± 0.379
2.275ProArg: 2.275 ± 0.387
2.519ProSer: 2.519 ± 0.416
1.706ProThr: 1.706 ± 0.348
3.169ProVal: 3.169 ± 0.54
0.487ProTrp: 0.487 ± 0.217
1.544ProTyr: 1.544 ± 0.46
0.0ProXaa: 0.0 ± 0.0
Gln
5.119GlnAla: 5.119 ± 0.665
0.65GlnCys: 0.65 ± 0.2
2.194GlnAsp: 2.194 ± 0.422
2.194GlnGlu: 2.194 ± 0.438
0.894GlnPhe: 0.894 ± 0.202
3.25GlnGly: 3.25 ± 0.494
0.406GlnHis: 0.406 ± 0.176
2.031GlnIle: 2.031 ± 0.405
1.706GlnLys: 1.706 ± 0.382
5.119GlnLeu: 5.119 ± 0.562
1.462GlnMet: 1.462 ± 0.353
1.462GlnAsn: 1.462 ± 0.349
1.869GlnPro: 1.869 ± 0.383
2.194GlnGln: 2.194 ± 0.48
3.412GlnArg: 3.412 ± 0.594
2.6GlnSer: 2.6 ± 0.417
2.275GlnThr: 2.275 ± 0.463
2.194GlnVal: 2.194 ± 0.385
0.406GlnTrp: 0.406 ± 0.168
0.894GlnTyr: 0.894 ± 0.261
0.0GlnXaa: 0.0 ± 0.0
Arg
6.175ArgAla: 6.175 ± 0.656
0.487ArgCys: 0.487 ± 0.245
3.087ArgAsp: 3.087 ± 0.477
4.875ArgGlu: 4.875 ± 0.696
2.356ArgPhe: 2.356 ± 0.546
3.331ArgGly: 3.331 ± 0.532
1.869ArgHis: 1.869 ± 0.405
4.144ArgIle: 4.144 ± 0.599
3.575ArgLys: 3.575 ± 0.804
6.256ArgLeu: 6.256 ± 0.878
1.3ArgMet: 1.3 ± 0.274
2.844ArgAsn: 2.844 ± 0.504
1.787ArgPro: 1.787 ± 0.447
3.819ArgGln: 3.819 ± 0.535
5.2ArgArg: 5.2 ± 0.732
2.681ArgSer: 2.681 ± 0.54
3.331ArgThr: 3.331 ± 0.494
4.387ArgVal: 4.387 ± 0.679
1.056ArgTrp: 1.056 ± 0.282
1.869ArgTyr: 1.869 ± 0.342
0.0ArgXaa: 0.0 ± 0.0
Ser
6.419SerAla: 6.419 ± 0.65
0.569SerCys: 0.569 ± 0.182
3.25SerAsp: 3.25 ± 0.427
3.494SerGlu: 3.494 ± 0.552
2.6SerPhe: 2.6 ± 0.405
4.712SerGly: 4.712 ± 0.621
1.381SerHis: 1.381 ± 0.397
2.925SerIle: 2.925 ± 0.564
3.25SerLys: 3.25 ± 0.619
7.394SerLeu: 7.394 ± 0.892
1.869SerMet: 1.869 ± 0.361
2.112SerAsn: 2.112 ± 0.399
2.844SerPro: 2.844 ± 0.393
2.681SerGln: 2.681 ± 0.452
3.412SerArg: 3.412 ± 0.595
3.25SerSer: 3.25 ± 0.486
3.087SerThr: 3.087 ± 0.566
4.062SerVal: 4.062 ± 0.609
1.056SerTrp: 1.056 ± 0.284
1.706SerTyr: 1.706 ± 0.351
0.0SerXaa: 0.0 ± 0.0
Thr
5.769ThrAla: 5.769 ± 0.647
0.325ThrCys: 0.325 ± 0.164
3.737ThrAsp: 3.737 ± 0.681
2.925ThrGlu: 2.925 ± 0.59
2.275ThrPhe: 2.275 ± 0.605
4.794ThrGly: 4.794 ± 0.717
1.544ThrHis: 1.544 ± 0.309
2.681ThrIle: 2.681 ± 0.522
2.437ThrLys: 2.437 ± 0.525
6.012ThrLeu: 6.012 ± 0.554
0.812ThrMet: 0.812 ± 0.264
1.787ThrAsn: 1.787 ± 0.443
3.006ThrPro: 3.006 ± 0.476
1.381ThrGln: 1.381 ± 0.271
3.331ThrArg: 3.331 ± 0.508
3.737ThrSer: 3.737 ± 0.545
4.794ThrThr: 4.794 ± 0.788
4.306ThrVal: 4.306 ± 0.597
0.569ThrTrp: 0.569 ± 0.229
2.112ThrTyr: 2.112 ± 0.5
0.0ThrXaa: 0.0 ± 0.0
Val
5.362ValAla: 5.362 ± 0.783
0.325ValCys: 0.325 ± 0.163
3.656ValAsp: 3.656 ± 0.565
4.631ValGlu: 4.631 ± 0.624
1.625ValPhe: 1.625 ± 0.259
2.437ValGly: 2.437 ± 0.548
1.056ValHis: 1.056 ± 0.284
4.469ValIle: 4.469 ± 0.539
4.144ValLys: 4.144 ± 0.611
5.687ValLeu: 5.687 ± 0.607
2.031ValMet: 2.031 ± 0.411
2.925ValAsn: 2.925 ± 0.605
2.519ValPro: 2.519 ± 0.454
2.437ValGln: 2.437 ± 0.473
4.306ValArg: 4.306 ± 0.585
3.9ValSer: 3.9 ± 0.529
4.225ValThr: 4.225 ± 0.589
3.656ValVal: 3.656 ± 0.503
0.894ValTrp: 0.894 ± 0.314
1.787ValTyr: 1.787 ± 0.516
0.0ValXaa: 0.0 ± 0.0
Trp
1.381TrpAla: 1.381 ± 0.38
0.244TrpCys: 0.244 ± 0.148
0.812TrpAsp: 0.812 ± 0.238
1.625TrpGlu: 1.625 ± 0.435
0.325TrpPhe: 0.325 ± 0.148
0.325TrpGly: 0.325 ± 0.184
0.487TrpHis: 0.487 ± 0.163
1.3TrpIle: 1.3 ± 0.359
0.731TrpLys: 0.731 ± 0.223
2.112TrpLeu: 2.112 ± 0.47
0.406TrpMet: 0.406 ± 0.186
0.162TrpAsn: 0.162 ± 0.124
0.65TrpPro: 0.65 ± 0.269
0.569TrpGln: 0.569 ± 0.174
1.3TrpArg: 1.3 ± 0.305
0.731TrpSer: 0.731 ± 0.249
0.569TrpThr: 0.569 ± 0.203
0.65TrpVal: 0.65 ± 0.249
0.406TrpTrp: 0.406 ± 0.202
0.569TrpTyr: 0.569 ± 0.273
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.194TyrAla: 2.194 ± 0.417
0.731TyrCys: 0.731 ± 0.292
1.95TyrAsp: 1.95 ± 0.413
1.95TyrGlu: 1.95 ± 0.415
1.3TyrPhe: 1.3 ± 0.475
2.112TyrGly: 2.112 ± 0.401
0.487TyrHis: 0.487 ± 0.167
2.356TyrIle: 2.356 ± 0.577
0.894TyrLys: 0.894 ± 0.218
2.925TyrLeu: 2.925 ± 0.544
0.406TyrMet: 0.406 ± 0.183
1.706TyrAsn: 1.706 ± 0.47
1.462TyrPro: 1.462 ± 0.38
1.544TyrGln: 1.544 ± 0.369
2.437TyrArg: 2.437 ± 0.396
1.3TyrSer: 1.3 ± 0.284
1.95TyrThr: 1.95 ± 0.478
1.95TyrVal: 1.95 ± 0.346
0.325TyrTrp: 0.325 ± 0.159
1.137TyrTyr: 1.137 ± 0.286
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12309 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski