Amino acid dipepetide frequency for Klebsiella phage KMI8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.639AlaAla: 8.639 ± 1.008
0.87AlaCys: 0.87 ± 0.349
4.475AlaAsp: 4.475 ± 0.523
6.215AlaGlu: 6.215 ± 0.641
2.797AlaPhe: 2.797 ± 0.492
6.34AlaGly: 6.34 ± 0.698
0.932AlaHis: 0.932 ± 0.306
5.905AlaIle: 5.905 ± 1.27
8.826AlaLys: 8.826 ± 1.253
7.148AlaLeu: 7.148 ± 0.788
3.17AlaMet: 3.17 ± 0.641
3.543AlaAsn: 3.543 ± 0.867
1.678AlaPro: 1.678 ± 0.276
3.667AlaGln: 3.667 ± 0.619
5.283AlaArg: 5.283 ± 0.834
5.78AlaSer: 5.78 ± 0.798
6.464AlaThr: 6.464 ± 0.971
5.905AlaVal: 5.905 ± 0.583
1.243AlaTrp: 1.243 ± 0.206
2.673AlaTyr: 2.673 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.808CysAla: 0.808 ± 0.219
0.062CysCys: 0.062 ± 0.063
0.87CysAsp: 0.87 ± 0.206
0.87CysGlu: 0.87 ± 0.226
0.373CysPhe: 0.373 ± 0.167
1.305CysGly: 1.305 ± 0.383
0.124CysHis: 0.124 ± 0.088
0.808CysIle: 0.808 ± 0.238
0.559CysLys: 0.559 ± 0.207
0.497CysLeu: 0.497 ± 0.202
0.622CysMet: 0.622 ± 0.237
0.808CysAsn: 0.808 ± 0.273
0.497CysPro: 0.497 ± 0.155
0.249CysGln: 0.249 ± 0.129
0.808CysArg: 0.808 ± 0.257
0.622CysSer: 0.622 ± 0.2
0.684CysThr: 0.684 ± 0.233
0.994CysVal: 0.994 ± 0.266
0.249CysTrp: 0.249 ± 0.117
0.497CysTyr: 0.497 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
7.645AspAla: 7.645 ± 1.275
1.057AspCys: 1.057 ± 0.261
3.916AspAsp: 3.916 ± 0.615
4.289AspGlu: 4.289 ± 0.418
2.424AspPhe: 2.424 ± 0.38
5.097AspGly: 5.097 ± 0.624
0.87AspHis: 0.87 ± 0.205
3.791AspIle: 3.791 ± 0.452
4.04AspLys: 4.04 ± 0.465
5.034AspLeu: 5.034 ± 0.591
1.554AspMet: 1.554 ± 0.358
2.921AspAsn: 2.921 ± 0.391
1.243AspPro: 1.243 ± 0.284
1.43AspGln: 1.43 ± 0.349
2.921AspArg: 2.921 ± 0.426
3.854AspSer: 3.854 ± 0.421
3.294AspThr: 3.294 ± 0.528
3.605AspVal: 3.605 ± 0.383
1.243AspTrp: 1.243 ± 0.334
2.175AspTyr: 2.175 ± 0.399
0.0AspXaa: 0.0 ± 0.0
Glu
7.023GluAla: 7.023 ± 1.298
0.932GluCys: 0.932 ± 0.28
3.356GluAsp: 3.356 ± 0.623
6.091GluGlu: 6.091 ± 0.657
3.667GluPhe: 3.667 ± 0.475
3.854GluGly: 3.854 ± 0.429
1.678GluHis: 1.678 ± 0.415
4.351GluIle: 4.351 ± 0.449
4.04GluLys: 4.04 ± 0.695
5.034GluLeu: 5.034 ± 0.744
2.859GluMet: 2.859 ± 0.398
2.61GluAsn: 2.61 ± 0.453
2.051GluPro: 2.051 ± 0.322
3.729GluGln: 3.729 ± 0.444
2.797GluArg: 2.797 ± 0.472
4.04GluSer: 4.04 ± 0.506
3.046GluThr: 3.046 ± 0.66
6.464GluVal: 6.464 ± 0.58
0.559GluTrp: 0.559 ± 0.174
2.548GluTyr: 2.548 ± 0.427
0.0GluXaa: 0.0 ± 0.0
Phe
2.735PheAla: 2.735 ± 0.481
0.497PheCys: 0.497 ± 0.218
3.916PheAsp: 3.916 ± 0.658
2.859PheGlu: 2.859 ± 0.489
1.43PhePhe: 1.43 ± 0.395
3.543PheGly: 3.543 ± 0.514
0.808PheHis: 0.808 ± 0.214
2.175PheIle: 2.175 ± 0.527
2.983PheLys: 2.983 ± 0.476
2.113PheLeu: 2.113 ± 0.46
0.994PheMet: 0.994 ± 0.236
2.3PheAsn: 2.3 ± 0.397
1.43PhePro: 1.43 ± 0.367
1.243PheGln: 1.243 ± 0.326
2.238PheArg: 2.238 ± 0.372
2.051PheSer: 2.051 ± 0.469
2.983PheThr: 2.983 ± 0.406
2.424PheVal: 2.424 ± 0.419
0.622PheTrp: 0.622 ± 0.248
1.119PheTyr: 1.119 ± 0.327
0.0PheXaa: 0.0 ± 0.0
Gly
4.972GlyAla: 4.972 ± 0.767
1.119GlyCys: 1.119 ± 0.254
4.786GlyAsp: 4.786 ± 0.624
5.47GlyGlu: 5.47 ± 0.575
3.418GlyPhe: 3.418 ± 0.544
7.21GlyGly: 7.21 ± 1.063
1.119GlyHis: 1.119 ± 0.41
4.599GlyIle: 4.599 ± 0.395
6.091GlyLys: 6.091 ± 0.587
5.905GlyLeu: 5.905 ± 0.499
2.238GlyMet: 2.238 ± 0.367
3.854GlyAsn: 3.854 ± 0.527
1.181GlyPro: 1.181 ± 0.269
2.175GlyGln: 2.175 ± 0.284
3.605GlyArg: 3.605 ± 0.463
4.662GlySer: 4.662 ± 0.552
3.232GlyThr: 3.232 ± 0.454
5.532GlyVal: 5.532 ± 0.533
1.43GlyTrp: 1.43 ± 0.268
3.294GlyTyr: 3.294 ± 0.493
0.0GlyXaa: 0.0 ± 0.0
His
0.994HisAla: 0.994 ± 0.289
0.249HisCys: 0.249 ± 0.124
1.305HisAsp: 1.305 ± 0.368
1.057HisGlu: 1.057 ± 0.281
0.808HisPhe: 0.808 ± 0.24
1.927HisGly: 1.927 ± 0.425
0.497HisHis: 0.497 ± 0.157
0.932HisIle: 0.932 ± 0.3
0.994HisLys: 0.994 ± 0.306
1.119HisLeu: 1.119 ± 0.326
0.249HisMet: 0.249 ± 0.129
0.746HisAsn: 0.746 ± 0.192
0.559HisPro: 0.559 ± 0.23
0.559HisGln: 0.559 ± 0.185
1.119HisArg: 1.119 ± 0.316
1.243HisSer: 1.243 ± 0.315
0.994HisThr: 0.994 ± 0.273
0.994HisVal: 0.994 ± 0.349
0.373HisTrp: 0.373 ± 0.176
0.808HisTyr: 0.808 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
6.775IleAla: 6.775 ± 0.735
0.746IleCys: 0.746 ± 0.272
3.978IleAsp: 3.978 ± 0.458
3.978IleGlu: 3.978 ± 0.489
2.238IlePhe: 2.238 ± 0.397
4.289IleGly: 4.289 ± 0.419
1.367IleHis: 1.367 ± 0.337
3.232IleIle: 3.232 ± 0.43
3.916IleLys: 3.916 ± 0.486
3.543IleLeu: 3.543 ± 0.499
1.243IleMet: 1.243 ± 0.24
3.978IleAsn: 3.978 ± 0.611
2.238IlePro: 2.238 ± 0.367
2.175IleGln: 2.175 ± 0.422
2.797IleArg: 2.797 ± 0.454
3.108IleSer: 3.108 ± 0.505
4.351IleThr: 4.351 ± 0.652
4.351IleVal: 4.351 ± 0.648
0.808IleTrp: 0.808 ± 0.238
0.994IleTyr: 0.994 ± 0.263
0.0IleXaa: 0.0 ± 0.0
Lys
7.148LysAla: 7.148 ± 1.01
0.994LysCys: 0.994 ± 0.3
4.289LysAsp: 4.289 ± 0.552
5.221LysGlu: 5.221 ± 0.575
2.3LysPhe: 2.3 ± 0.343
3.854LysGly: 3.854 ± 0.472
1.367LysHis: 1.367 ± 0.378
3.543LysIle: 3.543 ± 0.362
4.226LysLys: 4.226 ± 0.471
4.91LysLeu: 4.91 ± 0.563
2.548LysMet: 2.548 ± 0.562
3.046LysAsn: 3.046 ± 0.482
2.797LysPro: 2.797 ± 0.509
2.983LysGln: 2.983 ± 0.506
4.289LysArg: 4.289 ± 0.643
3.605LysSer: 3.605 ± 0.491
4.786LysThr: 4.786 ± 0.623
4.972LysVal: 4.972 ± 0.524
0.994LysTrp: 0.994 ± 0.255
1.243LysTyr: 1.243 ± 0.282
0.0LysXaa: 0.0 ± 0.0
Leu
6.713LeuAla: 6.713 ± 0.833
0.87LeuCys: 0.87 ± 0.254
4.413LeuAsp: 4.413 ± 0.515
4.04LeuGlu: 4.04 ± 0.422
2.548LeuPhe: 2.548 ± 0.476
4.662LeuGly: 4.662 ± 0.693
1.367LeuHis: 1.367 ± 0.414
3.481LeuIle: 3.481 ± 0.458
5.159LeuLys: 5.159 ± 0.471
4.04LeuLeu: 4.04 ± 0.467
1.865LeuMet: 1.865 ± 0.39
3.543LeuAsn: 3.543 ± 0.474
2.921LeuPro: 2.921 ± 0.61
2.3LeuGln: 2.3 ± 0.323
3.978LeuArg: 3.978 ± 0.539
4.04LeuSer: 4.04 ± 0.422
4.413LeuThr: 4.413 ± 0.567
3.978LeuVal: 3.978 ± 0.549
0.746LeuTrp: 0.746 ± 0.203
2.3LeuTyr: 2.3 ± 0.481
0.0LeuXaa: 0.0 ± 0.0
Met
3.356MetAla: 3.356 ± 0.501
0.186MetCys: 0.186 ± 0.103
1.305MetAsp: 1.305 ± 0.309
1.367MetGlu: 1.367 ± 0.325
0.87MetPhe: 0.87 ± 0.222
1.678MetGly: 1.678 ± 0.339
0.311MetHis: 0.311 ± 0.148
1.678MetIle: 1.678 ± 0.418
2.921MetLys: 2.921 ± 0.428
1.865MetLeu: 1.865 ± 0.389
1.057MetMet: 1.057 ± 0.249
1.678MetAsn: 1.678 ± 0.314
0.994MetPro: 0.994 ± 0.204
1.305MetGln: 1.305 ± 0.349
1.74MetArg: 1.74 ± 0.43
1.865MetSer: 1.865 ± 0.364
1.865MetThr: 1.865 ± 0.351
1.865MetVal: 1.865 ± 0.319
0.311MetTrp: 0.311 ± 0.135
0.994MetTyr: 0.994 ± 0.231
0.0MetXaa: 0.0 ± 0.0
Asn
4.537AsnAla: 4.537 ± 0.936
0.373AsnCys: 0.373 ± 0.139
3.108AsnAsp: 3.108 ± 0.415
4.662AsnGlu: 4.662 ± 0.788
1.554AsnPhe: 1.554 ± 0.327
5.221AsnGly: 5.221 ± 0.608
0.994AsnHis: 0.994 ± 0.3
2.61AsnIle: 2.61 ± 0.399
2.983AsnLys: 2.983 ± 0.514
2.735AsnLeu: 2.735 ± 0.479
1.057AsnMet: 1.057 ± 0.297
2.673AsnAsn: 2.673 ± 0.498
2.424AsnPro: 2.424 ± 0.374
1.865AsnGln: 1.865 ± 0.446
2.113AsnArg: 2.113 ± 0.418
2.424AsnSer: 2.424 ± 0.408
2.113AsnThr: 2.113 ± 0.539
3.17AsnVal: 3.17 ± 0.47
0.808AsnTrp: 0.808 ± 0.208
1.181AsnTyr: 1.181 ± 0.28
0.0AsnXaa: 0.0 ± 0.0
Pro
1.989ProAla: 1.989 ± 0.372
0.435ProCys: 0.435 ± 0.176
2.859ProAsp: 2.859 ± 0.416
2.61ProGlu: 2.61 ± 0.498
1.243ProPhe: 1.243 ± 0.274
2.238ProGly: 2.238 ± 0.455
0.684ProHis: 0.684 ± 0.212
1.802ProIle: 1.802 ± 0.413
1.616ProLys: 1.616 ± 0.318
1.305ProLeu: 1.305 ± 0.254
0.684ProMet: 0.684 ± 0.214
1.927ProAsn: 1.927 ± 0.396
0.435ProPro: 0.435 ± 0.156
0.932ProGln: 0.932 ± 0.257
1.678ProArg: 1.678 ± 0.322
1.554ProSer: 1.554 ± 0.393
2.051ProThr: 2.051 ± 0.342
3.356ProVal: 3.356 ± 0.466
0.497ProTrp: 0.497 ± 0.179
1.181ProTyr: 1.181 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
4.662GlnAla: 4.662 ± 0.846
0.497GlnCys: 0.497 ± 0.196
1.678GlnAsp: 1.678 ± 0.347
1.74GlnGlu: 1.74 ± 0.288
1.492GlnPhe: 1.492 ± 0.284
2.797GlnGly: 2.797 ± 0.518
0.186GlnHis: 0.186 ± 0.12
2.735GlnIle: 2.735 ± 0.463
2.051GlnLys: 2.051 ± 0.407
2.548GlnLeu: 2.548 ± 0.449
0.932GlnMet: 0.932 ± 0.24
1.43GlnAsn: 1.43 ± 0.324
1.678GlnPro: 1.678 ± 0.373
2.051GlnGln: 2.051 ± 0.373
1.74GlnArg: 1.74 ± 0.306
2.735GlnSer: 2.735 ± 0.387
1.43GlnThr: 1.43 ± 0.337
3.667GlnVal: 3.667 ± 0.587
0.622GlnTrp: 0.622 ± 0.207
1.927GlnTyr: 1.927 ± 0.732
0.0GlnXaa: 0.0 ± 0.0
Arg
4.91ArgAla: 4.91 ± 0.69
0.559ArgCys: 0.559 ± 0.256
2.859ArgAsp: 2.859 ± 0.479
3.854ArgGlu: 3.854 ± 0.495
2.983ArgPhe: 2.983 ± 0.476
3.854ArgGly: 3.854 ± 0.581
0.808ArgHis: 0.808 ± 0.268
3.418ArgIle: 3.418 ± 0.456
3.418ArgLys: 3.418 ± 0.49
4.226ArgLeu: 4.226 ± 0.534
1.492ArgMet: 1.492 ± 0.314
1.802ArgAsn: 1.802 ± 0.406
1.367ArgPro: 1.367 ± 0.315
2.362ArgGln: 2.362 ± 0.526
2.983ArgArg: 2.983 ± 0.453
2.113ArgSer: 2.113 ± 0.419
2.486ArgThr: 2.486 ± 0.363
3.916ArgVal: 3.916 ± 0.758
0.87ArgTrp: 0.87 ± 0.256
2.175ArgTyr: 2.175 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
4.537SerAla: 4.537 ± 0.674
0.622SerCys: 0.622 ± 0.228
3.791SerAsp: 3.791 ± 0.428
3.978SerGlu: 3.978 ± 0.442
3.232SerPhe: 3.232 ± 0.489
5.034SerGly: 5.034 ± 0.644
0.932SerHis: 0.932 ± 0.277
4.226SerIle: 4.226 ± 0.498
4.226SerLys: 4.226 ± 0.569
3.729SerLeu: 3.729 ± 0.401
1.492SerMet: 1.492 ± 0.373
2.673SerAsn: 2.673 ± 0.63
1.865SerPro: 1.865 ± 0.313
2.362SerGln: 2.362 ± 0.366
3.418SerArg: 3.418 ± 0.411
3.356SerSer: 3.356 ± 0.521
2.735SerThr: 2.735 ± 0.585
3.791SerVal: 3.791 ± 0.394
0.808SerTrp: 0.808 ± 0.231
1.678SerTyr: 1.678 ± 0.341
0.0SerXaa: 0.0 ± 0.0
Thr
5.221ThrAla: 5.221 ± 0.882
0.622ThrCys: 0.622 ± 0.181
3.854ThrAsp: 3.854 ± 0.648
5.097ThrGlu: 5.097 ± 1.545
2.3ThrPhe: 2.3 ± 0.498
4.972ThrGly: 4.972 ± 0.784
0.87ThrHis: 0.87 ± 0.237
3.854ThrIle: 3.854 ± 0.562
2.983ThrLys: 2.983 ± 0.406
3.729ThrLeu: 3.729 ± 0.423
1.554ThrMet: 1.554 ± 0.261
2.859ThrAsn: 2.859 ± 0.512
1.74ThrPro: 1.74 ± 0.285
2.548ThrGln: 2.548 ± 0.626
1.74ThrArg: 1.74 ± 0.288
2.735ThrSer: 2.735 ± 0.436
3.232ThrThr: 3.232 ± 0.419
3.418ThrVal: 3.418 ± 0.516
0.684ThrTrp: 0.684 ± 0.191
1.989ThrTyr: 1.989 ± 0.375
0.0ThrXaa: 0.0 ± 0.0
Val
4.848ValAla: 4.848 ± 0.559
0.808ValCys: 0.808 ± 0.189
4.475ValAsp: 4.475 ± 0.751
4.724ValGlu: 4.724 ± 0.541
2.61ValPhe: 2.61 ± 0.429
4.786ValGly: 4.786 ± 0.662
1.492ValHis: 1.492 ± 0.374
4.786ValIle: 4.786 ± 0.533
5.843ValLys: 5.843 ± 0.626
4.226ValLeu: 4.226 ± 0.441
2.113ValMet: 2.113 ± 0.42
3.605ValAsn: 3.605 ± 0.49
2.3ValPro: 2.3 ± 0.426
2.61ValGln: 2.61 ± 0.475
3.543ValArg: 3.543 ± 0.478
5.159ValSer: 5.159 ± 0.544
3.046ValThr: 3.046 ± 0.405
4.848ValVal: 4.848 ± 0.51
0.932ValTrp: 0.932 ± 0.21
3.232ValTyr: 3.232 ± 0.413
0.0ValXaa: 0.0 ± 0.0
Trp
1.305TrpAla: 1.305 ± 0.356
0.249TrpCys: 0.249 ± 0.154
0.87TrpAsp: 0.87 ± 0.178
0.559TrpGlu: 0.559 ± 0.212
0.87TrpPhe: 0.87 ± 0.275
0.994TrpGly: 0.994 ± 0.217
0.497TrpHis: 0.497 ± 0.182
0.622TrpIle: 0.622 ± 0.169
0.622TrpLys: 0.622 ± 0.171
1.243TrpLeu: 1.243 ± 0.281
0.311TrpMet: 0.311 ± 0.114
0.497TrpAsn: 0.497 ± 0.191
0.435TrpPro: 0.435 ± 0.195
0.373TrpGln: 0.373 ± 0.158
1.305TrpArg: 1.305 ± 0.226
1.43TrpSer: 1.43 ± 0.357
0.746TrpThr: 0.746 ± 0.215
0.808TrpVal: 0.808 ± 0.223
0.249TrpTrp: 0.249 ± 0.135
0.497TrpTyr: 0.497 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.735TyrAla: 2.735 ± 0.386
0.559TyrCys: 0.559 ± 0.173
2.3TyrAsp: 2.3 ± 0.331
2.051TyrGlu: 2.051 ± 0.352
1.305TyrPhe: 1.305 ± 0.319
2.238TyrGly: 2.238 ± 0.356
0.622TyrHis: 0.622 ± 0.196
1.554TyrIle: 1.554 ± 0.246
1.678TyrLys: 1.678 ± 0.35
2.424TyrLeu: 2.424 ± 0.457
1.119TyrMet: 1.119 ± 0.228
2.113TyrAsn: 2.113 ± 0.535
1.243TyrPro: 1.243 ± 0.245
1.678TyrGln: 1.678 ± 0.261
2.362TyrArg: 2.362 ± 0.418
2.113TyrSer: 2.113 ± 0.347
2.051TyrThr: 2.051 ± 0.327
1.927TyrVal: 1.927 ± 0.333
0.435TyrTrp: 0.435 ± 0.175
1.119TyrTyr: 1.119 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (16090 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski