Amino acid dipepetide frequency for Klebsiella phage 2 LV-2017

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.289AlaAla: 15.289 ± 2.797
1.365AlaCys: 1.365 ± 0.406
7.281AlaAsp: 7.281 ± 0.731
7.827AlaGlu: 7.827 ± 0.96
2.73AlaPhe: 2.73 ± 0.433
8.555AlaGly: 8.555 ± 0.975
1.729AlaHis: 1.729 ± 0.781
5.552AlaIle: 5.552 ± 0.741
5.278AlaLys: 5.278 ± 0.646
8.464AlaLeu: 8.464 ± 0.966
2.73AlaMet: 2.73 ± 0.477
4.55AlaAsn: 4.55 ± 0.773
2.366AlaPro: 2.366 ± 0.412
6.462AlaGln: 6.462 ± 1.414
5.825AlaArg: 5.825 ± 0.729
6.917AlaSer: 6.917 ± 0.788
6.007AlaThr: 6.007 ± 0.794
6.462AlaVal: 6.462 ± 0.869
2.002AlaTrp: 2.002 ± 0.424
2.457AlaTyr: 2.457 ± 0.435
0.0AlaXaa: 0.0 ± 0.0
Cys
1.456CysAla: 1.456 ± 0.437
0.182CysCys: 0.182 ± 0.145
1.001CysAsp: 1.001 ± 0.368
0.637CysGlu: 0.637 ± 0.287
0.364CysPhe: 0.364 ± 0.186
1.001CysGly: 1.001 ± 0.285
0.364CysHis: 0.364 ± 0.217
0.455CysIle: 0.455 ± 0.228
0.819CysLys: 0.819 ± 0.31
0.637CysLeu: 0.637 ± 0.243
0.364CysMet: 0.364 ± 0.232
0.364CysAsn: 0.364 ± 0.168
0.546CysPro: 0.546 ± 0.182
0.637CysGln: 0.637 ± 0.25
1.001CysArg: 1.001 ± 0.239
0.819CysSer: 0.819 ± 0.286
0.637CysThr: 0.637 ± 0.268
0.455CysVal: 0.455 ± 0.241
0.728CysTrp: 0.728 ± 0.253
0.364CysTyr: 0.364 ± 0.179
0.0CysXaa: 0.0 ± 0.0
Asp
5.734AspAla: 5.734 ± 0.67
1.092AspCys: 1.092 ± 0.383
4.55AspAsp: 4.55 ± 1.039
4.095AspGlu: 4.095 ± 0.719
2.457AspPhe: 2.457 ± 0.548
5.552AspGly: 5.552 ± 0.86
0.91AspHis: 0.91 ± 0.298
2.548AspIle: 2.548 ± 0.41
2.366AspLys: 2.366 ± 0.479
4.55AspLeu: 4.55 ± 0.453
1.456AspMet: 1.456 ± 0.315
2.821AspAsn: 2.821 ± 0.598
2.366AspPro: 2.366 ± 0.476
1.365AspGln: 1.365 ± 0.355
3.185AspArg: 3.185 ± 0.459
3.094AspSer: 3.094 ± 0.627
2.639AspThr: 2.639 ± 0.575
4.368AspVal: 4.368 ± 0.665
1.456AspTrp: 1.456 ± 0.407
2.821AspTyr: 2.821 ± 0.525
0.0AspXaa: 0.0 ± 0.0
Glu
7.008GluAla: 7.008 ± 0.854
0.728GluCys: 0.728 ± 0.289
3.549GluAsp: 3.549 ± 0.727
3.731GluGlu: 3.731 ± 0.783
3.276GluPhe: 3.276 ± 0.508
3.458GluGly: 3.458 ± 0.709
1.274GluHis: 1.274 ± 0.387
3.731GluIle: 3.731 ± 0.485
3.822GluLys: 3.822 ± 0.637
5.916GluLeu: 5.916 ± 0.789
2.275GluMet: 2.275 ± 0.396
2.821GluAsn: 2.821 ± 0.516
2.639GluPro: 2.639 ± 0.556
4.459GluGln: 4.459 ± 0.753
3.913GluArg: 3.913 ± 0.666
3.822GluSer: 3.822 ± 0.581
2.639GluThr: 2.639 ± 0.618
3.913GluVal: 3.913 ± 0.621
1.729GluTrp: 1.729 ± 0.373
2.184GluTyr: 2.184 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
2.821PheAla: 2.821 ± 0.472
0.455PheCys: 0.455 ± 0.176
2.73PheAsp: 2.73 ± 0.439
1.547PheGlu: 1.547 ± 0.366
1.092PhePhe: 1.092 ± 0.337
2.912PheGly: 2.912 ± 0.507
0.364PheHis: 0.364 ± 0.167
1.82PheIle: 1.82 ± 0.466
1.638PheLys: 1.638 ± 0.37
1.82PheLeu: 1.82 ± 0.405
1.183PheMet: 1.183 ± 0.281
2.366PheAsn: 2.366 ± 0.479
1.001PhePro: 1.001 ± 0.363
0.91PheGln: 0.91 ± 0.246
2.093PheArg: 2.093 ± 0.463
2.366PheSer: 2.366 ± 0.413
2.366PheThr: 2.366 ± 0.559
2.457PheVal: 2.457 ± 0.358
0.637PheTrp: 0.637 ± 0.257
1.001PheTyr: 1.001 ± 0.295
0.0PheXaa: 0.0 ± 0.0
Gly
7.099GlyAla: 7.099 ± 0.924
1.274GlyCys: 1.274 ± 0.388
4.459GlyAsp: 4.459 ± 0.707
6.098GlyGlu: 6.098 ± 0.61
2.73GlyPhe: 2.73 ± 0.584
4.55GlyGly: 4.55 ± 0.797
1.365GlyHis: 1.365 ± 0.385
4.368GlyIle: 4.368 ± 0.47
5.096GlyLys: 5.096 ± 0.784
6.007GlyLeu: 6.007 ± 0.725
2.457GlyMet: 2.457 ± 0.435
3.64GlyAsn: 3.64 ± 0.725
1.001GlyPro: 1.001 ± 0.305
3.094GlyGln: 3.094 ± 0.525
4.641GlyArg: 4.641 ± 0.747
3.458GlySer: 3.458 ± 0.442
3.458GlyThr: 3.458 ± 0.457
4.004GlyVal: 4.004 ± 0.59
1.274GlyTrp: 1.274 ± 0.431
2.457GlyTyr: 2.457 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
1.729HisAla: 1.729 ± 0.474
0.273HisCys: 0.273 ± 0.15
1.092HisAsp: 1.092 ± 0.305
1.092HisGlu: 1.092 ± 0.583
0.546HisPhe: 0.546 ± 0.219
1.274HisGly: 1.274 ± 0.376
0.637HisHis: 0.637 ± 0.236
1.001HisIle: 1.001 ± 0.369
0.455HisLys: 0.455 ± 0.174
1.092HisLeu: 1.092 ± 0.404
0.182HisMet: 0.182 ± 0.128
1.092HisAsn: 1.092 ± 0.313
0.819HisPro: 0.819 ± 0.26
0.728HisGln: 0.728 ± 0.321
0.819HisArg: 0.819 ± 0.224
0.819HisSer: 0.819 ± 0.3
0.91HisThr: 0.91 ± 0.313
1.547HisVal: 1.547 ± 0.342
0.364HisTrp: 0.364 ± 0.228
0.273HisTyr: 0.273 ± 0.141
0.0HisXaa: 0.0 ± 0.0
Ile
5.734IleAla: 5.734 ± 0.714
0.546IleCys: 0.546 ± 0.28
3.003IleAsp: 3.003 ± 0.486
3.276IleGlu: 3.276 ± 0.546
1.547IlePhe: 1.547 ± 0.331
4.004IleGly: 4.004 ± 0.566
1.183IleHis: 1.183 ± 0.293
2.366IleIle: 2.366 ± 0.345
2.366IleLys: 2.366 ± 0.421
2.821IleLeu: 2.821 ± 0.442
0.637IleMet: 0.637 ± 0.246
1.82IleAsn: 1.82 ± 0.459
2.366IlePro: 2.366 ± 0.452
1.365IleGln: 1.365 ± 0.358
3.094IleArg: 3.094 ± 0.658
4.55IleSer: 4.55 ± 0.607
4.186IleThr: 4.186 ± 0.525
3.003IleVal: 3.003 ± 0.512
1.001IleTrp: 1.001 ± 0.252
1.911IleTyr: 1.911 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
6.189LysAla: 6.189 ± 0.854
0.637LysCys: 0.637 ± 0.252
3.276LysAsp: 3.276 ± 0.552
2.912LysGlu: 2.912 ± 0.535
2.093LysPhe: 2.093 ± 0.401
3.64LysGly: 3.64 ± 0.482
1.001LysHis: 1.001 ± 0.33
1.547LysIle: 1.547 ± 0.359
3.549LysLys: 3.549 ± 0.568
4.459LysLeu: 4.459 ± 0.569
1.092LysMet: 1.092 ± 0.419
1.82LysAsn: 1.82 ± 0.509
2.639LysPro: 2.639 ± 0.51
3.094LysGln: 3.094 ± 0.548
3.458LysArg: 3.458 ± 0.592
3.094LysSer: 3.094 ± 0.577
2.639LysThr: 2.639 ± 0.388
4.368LysVal: 4.368 ± 0.815
1.274LysTrp: 1.274 ± 0.312
1.365LysTyr: 1.365 ± 0.359
0.0LysXaa: 0.0 ± 0.0
Leu
8.646LeuAla: 8.646 ± 1.276
1.274LeuCys: 1.274 ± 0.329
4.095LeuAsp: 4.095 ± 0.673
5.005LeuGlu: 5.005 ± 0.718
1.911LeuPhe: 1.911 ± 0.385
4.732LeuGly: 4.732 ± 0.624
1.183LeuHis: 1.183 ± 0.353
4.368LeuIle: 4.368 ± 0.614
4.004LeuLys: 4.004 ± 0.694
6.189LeuLeu: 6.189 ± 0.815
1.456LeuMet: 1.456 ± 0.32
3.367LeuAsn: 3.367 ± 0.43
2.912LeuPro: 2.912 ± 0.502
2.457LeuGln: 2.457 ± 0.471
5.552LeuArg: 5.552 ± 0.795
5.643LeuSer: 5.643 ± 1.008
3.913LeuThr: 3.913 ± 0.691
3.64LeuVal: 3.64 ± 0.682
0.819LeuTrp: 0.819 ± 0.3
3.003LeuTyr: 3.003 ± 0.439
0.0LeuXaa: 0.0 ± 0.0
Met
2.639MetAla: 2.639 ± 0.592
0.091MetCys: 0.091 ± 0.096
0.819MetAsp: 0.819 ± 0.247
1.456MetGlu: 1.456 ± 0.371
0.819MetPhe: 0.819 ± 0.244
1.638MetGly: 1.638 ± 0.55
0.455MetHis: 0.455 ± 0.191
0.819MetIle: 0.819 ± 0.277
1.82MetLys: 1.82 ± 0.43
1.274MetLeu: 1.274 ± 0.404
0.91MetMet: 0.91 ± 0.326
1.456MetAsn: 1.456 ± 0.451
1.638MetPro: 1.638 ± 0.586
1.001MetGln: 1.001 ± 0.218
1.274MetArg: 1.274 ± 0.376
3.094MetSer: 3.094 ± 0.458
1.729MetThr: 1.729 ± 0.45
1.911MetVal: 1.911 ± 0.374
0.637MetTrp: 0.637 ± 0.198
0.819MetTyr: 0.819 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
5.278AsnAla: 5.278 ± 0.876
0.182AsnCys: 0.182 ± 0.125
2.366AsnAsp: 2.366 ± 0.54
2.548AsnGlu: 2.548 ± 0.49
1.274AsnPhe: 1.274 ± 0.32
4.095AsnGly: 4.095 ± 0.495
0.637AsnHis: 0.637 ± 0.21
2.548AsnIle: 2.548 ± 0.458
2.912AsnLys: 2.912 ± 0.401
3.185AsnLeu: 3.185 ± 0.524
1.638AsnMet: 1.638 ± 0.409
1.911AsnAsn: 1.911 ± 0.461
2.457AsnPro: 2.457 ± 0.523
2.457AsnGln: 2.457 ± 0.533
3.185AsnArg: 3.185 ± 0.466
2.73AsnSer: 2.73 ± 0.545
1.638AsnThr: 1.638 ± 0.444
2.639AsnVal: 2.639 ± 0.496
0.455AsnTrp: 0.455 ± 0.19
1.82AsnTyr: 1.82 ± 0.288
0.0AsnXaa: 0.0 ± 0.0
Pro
3.731ProAla: 3.731 ± 0.679
0.364ProCys: 0.364 ± 0.184
3.185ProAsp: 3.185 ± 0.54
3.276ProGlu: 3.276 ± 0.564
1.365ProPhe: 1.365 ± 0.415
3.458ProGly: 3.458 ± 0.622
0.728ProHis: 0.728 ± 0.265
1.82ProIle: 1.82 ± 0.497
1.82ProLys: 1.82 ± 0.35
3.003ProLeu: 3.003 ± 0.68
0.637ProMet: 0.637 ± 0.201
1.547ProAsn: 1.547 ± 0.407
1.456ProPro: 1.456 ± 0.39
1.183ProGln: 1.183 ± 0.388
1.092ProArg: 1.092 ± 0.31
1.911ProSer: 1.911 ± 0.478
2.912ProThr: 2.912 ± 0.629
3.276ProVal: 3.276 ± 0.54
0.728ProTrp: 0.728 ± 0.32
0.546ProTyr: 0.546 ± 0.18
0.0ProXaa: 0.0 ± 0.0
Gln
4.914GlnAla: 4.914 ± 0.831
1.001GlnCys: 1.001 ± 0.299
1.911GlnAsp: 1.911 ± 0.435
2.548GlnGlu: 2.548 ± 0.564
1.638GlnPhe: 1.638 ± 0.36
2.184GlnGly: 2.184 ± 0.578
0.819GlnHis: 0.819 ± 0.308
3.094GlnIle: 3.094 ± 0.53
2.73GlnLys: 2.73 ± 0.542
3.822GlnLeu: 3.822 ± 0.497
1.274GlnMet: 1.274 ± 0.326
1.547GlnAsn: 1.547 ± 0.478
1.729GlnPro: 1.729 ± 0.427
3.731GlnGln: 3.731 ± 1.215
2.366GlnArg: 2.366 ± 0.546
2.093GlnSer: 2.093 ± 0.533
2.73GlnThr: 2.73 ± 0.539
3.367GlnVal: 3.367 ± 0.677
1.547GlnTrp: 1.547 ± 0.314
1.365GlnTyr: 1.365 ± 0.424
0.0GlnXaa: 0.0 ± 0.0
Arg
6.735ArgAla: 6.735 ± 0.727
0.546ArgCys: 0.546 ± 0.227
3.367ArgAsp: 3.367 ± 0.561
5.005ArgGlu: 5.005 ± 0.803
1.365ArgPhe: 1.365 ± 0.438
4.186ArgGly: 4.186 ± 0.512
0.91ArgHis: 0.91 ± 0.328
3.276ArgIle: 3.276 ± 0.576
4.368ArgLys: 4.368 ± 0.726
5.825ArgLeu: 5.825 ± 0.596
1.638ArgMet: 1.638 ± 0.404
2.912ArgAsn: 2.912 ± 0.473
2.821ArgPro: 2.821 ± 0.549
2.912ArgGln: 2.912 ± 0.475
4.459ArgArg: 4.459 ± 0.865
2.639ArgSer: 2.639 ± 0.461
2.548ArgThr: 2.548 ± 0.522
3.003ArgVal: 3.003 ± 0.454
0.91ArgTrp: 0.91 ± 0.324
2.002ArgTyr: 2.002 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
6.371SerAla: 6.371 ± 0.787
0.819SerCys: 0.819 ± 0.301
3.276SerAsp: 3.276 ± 0.414
4.914SerGlu: 4.914 ± 0.639
1.82SerPhe: 1.82 ± 0.374
5.552SerGly: 5.552 ± 0.702
0.819SerHis: 0.819 ± 0.252
2.002SerIle: 2.002 ± 0.389
2.821SerLys: 2.821 ± 0.484
4.641SerLeu: 4.641 ± 0.878
2.184SerMet: 2.184 ± 0.437
3.003SerAsn: 3.003 ± 0.643
1.82SerPro: 1.82 ± 0.36
2.548SerGln: 2.548 ± 0.589
3.64SerArg: 3.64 ± 0.575
3.367SerSer: 3.367 ± 0.522
2.821SerThr: 2.821 ± 0.575
4.914SerVal: 4.914 ± 0.762
1.456SerTrp: 1.456 ± 0.389
1.183SerTyr: 1.183 ± 0.303
0.0SerXaa: 0.0 ± 0.0
Thr
6.644ThrAla: 6.644 ± 0.747
0.637ThrCys: 0.637 ± 0.278
3.185ThrAsp: 3.185 ± 0.601
3.458ThrGlu: 3.458 ± 0.567
2.002ThrPhe: 2.002 ± 0.513
4.641ThrGly: 4.641 ± 0.573
0.455ThrHis: 0.455 ± 0.202
3.367ThrIle: 3.367 ± 0.715
2.093ThrLys: 2.093 ± 0.376
3.822ThrLeu: 3.822 ± 0.676
1.092ThrMet: 1.092 ± 0.303
2.457ThrAsn: 2.457 ± 0.448
2.639ThrPro: 2.639 ± 0.49
1.82ThrGln: 1.82 ± 0.802
3.549ThrArg: 3.549 ± 0.473
3.276ThrSer: 3.276 ± 0.551
3.64ThrThr: 3.64 ± 0.844
4.277ThrVal: 4.277 ± 0.589
1.092ThrTrp: 1.092 ± 0.325
1.001ThrTyr: 1.001 ± 0.377
0.0ThrXaa: 0.0 ± 0.0
Val
7.463ValAla: 7.463 ± 0.817
0.728ValCys: 0.728 ± 0.313
3.458ValAsp: 3.458 ± 0.577
4.641ValGlu: 4.641 ± 0.63
2.457ValPhe: 2.457 ± 0.44
3.64ValGly: 3.64 ± 0.57
0.91ValHis: 0.91 ± 0.276
3.822ValIle: 3.822 ± 0.441
3.913ValLys: 3.913 ± 0.605
3.367ValLeu: 3.367 ± 0.502
1.638ValMet: 1.638 ± 0.417
4.095ValAsn: 4.095 ± 0.761
2.457ValPro: 2.457 ± 0.54
3.367ValGln: 3.367 ± 0.542
4.55ValArg: 4.55 ± 0.819
3.549ValSer: 3.549 ± 0.582
4.732ValThr: 4.732 ± 0.761
5.005ValVal: 5.005 ± 0.867
1.183ValTrp: 1.183 ± 0.37
1.638ValTyr: 1.638 ± 0.43
0.0ValXaa: 0.0 ± 0.0
Trp
1.274TrpAla: 1.274 ± 0.392
0.273TrpCys: 0.273 ± 0.155
0.819TrpAsp: 0.819 ± 0.226
1.729TrpGlu: 1.729 ± 0.417
0.819TrpPhe: 0.819 ± 0.194
0.91TrpGly: 0.91 ± 0.324
0.546TrpHis: 0.546 ± 0.265
0.637TrpIle: 0.637 ± 0.303
1.456TrpLys: 1.456 ± 0.463
2.184TrpLeu: 2.184 ± 0.446
0.364TrpMet: 0.364 ± 0.186
0.91TrpAsn: 0.91 ± 0.308
1.001TrpPro: 1.001 ± 0.322
0.637TrpGln: 0.637 ± 0.227
2.002TrpArg: 2.002 ± 0.471
1.092TrpSer: 1.092 ± 0.342
0.91TrpThr: 0.91 ± 0.35
1.82TrpVal: 1.82 ± 0.412
0.091TrpTrp: 0.091 ± 0.084
0.364TrpTyr: 0.364 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.094TyrAla: 3.094 ± 0.496
0.364TyrCys: 0.364 ± 0.186
2.002TyrAsp: 2.002 ± 0.483
1.092TyrGlu: 1.092 ± 0.317
1.274TyrPhe: 1.274 ± 0.38
2.548TyrGly: 2.548 ± 0.471
0.455TyrHis: 0.455 ± 0.218
1.638TyrIle: 1.638 ± 0.397
0.819TyrLys: 0.819 ± 0.285
1.092TyrLeu: 1.092 ± 0.318
1.001TyrMet: 1.001 ± 0.296
1.456TyrAsn: 1.456 ± 0.346
1.365TyrPro: 1.365 ± 0.405
2.184TyrGln: 2.184 ± 0.327
1.82TyrArg: 1.82 ± 0.35
1.638TyrSer: 1.638 ± 0.509
2.093TyrThr: 2.093 ± 0.483
2.184TyrVal: 2.184 ± 0.375
0.455TyrTrp: 0.455 ± 0.225
0.728TyrTyr: 0.728 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10989 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski