Amino acid dipepetide frequency for Klebsiella phage vB_KpnP_IME308

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.603AlaAla: 16.603 ± 1.455
0.751AlaCys: 0.751 ± 0.288
5.86AlaAsp: 5.86 ± 0.731
5.033AlaGlu: 5.033 ± 0.629
3.456AlaPhe: 3.456 ± 0.42
8.489AlaGly: 8.489 ± 1.163
1.503AlaHis: 1.503 ± 0.481
4.282AlaIle: 4.282 ± 0.602
5.559AlaLys: 5.559 ± 1.037
9.316AlaLeu: 9.316 ± 0.796
2.705AlaMet: 2.705 ± 0.394
3.381AlaAsn: 3.381 ± 0.51
3.982AlaPro: 3.982 ± 0.848
5.184AlaGln: 5.184 ± 0.856
5.71AlaArg: 5.71 ± 0.731
6.085AlaSer: 6.085 ± 0.807
5.184AlaThr: 5.184 ± 0.704
6.836AlaVal: 6.836 ± 0.92
1.277AlaTrp: 1.277 ± 0.295
4.282AlaTyr: 4.282 ± 0.617
0.0AlaXaa: 0.0 ± 0.0
Cys
0.826CysAla: 0.826 ± 0.329
0.376CysCys: 0.376 ± 0.226
0.301CysAsp: 0.301 ± 0.15
0.451CysGlu: 0.451 ± 0.178
0.451CysPhe: 0.451 ± 0.193
0.902CysGly: 0.902 ± 0.28
0.451CysHis: 0.451 ± 0.189
0.301CysIle: 0.301 ± 0.164
0.601CysLys: 0.601 ± 0.238
0.826CysLeu: 0.826 ± 0.3
0.751CysMet: 0.751 ± 0.218
0.601CysAsn: 0.601 ± 0.285
0.601CysPro: 0.601 ± 0.226
0.225CysGln: 0.225 ± 0.114
0.751CysArg: 0.751 ± 0.22
1.052CysSer: 1.052 ± 0.339
0.826CysThr: 0.826 ± 0.221
0.676CysVal: 0.676 ± 0.241
0.225CysTrp: 0.225 ± 0.139
0.376CysTyr: 0.376 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
6.912AspAla: 6.912 ± 0.87
1.352AspCys: 1.352 ± 0.348
3.005AspAsp: 3.005 ± 0.488
3.756AspGlu: 3.756 ± 0.543
2.179AspPhe: 2.179 ± 0.406
4.808AspGly: 4.808 ± 0.625
0.751AspHis: 0.751 ± 0.28
3.306AspIle: 3.306 ± 0.557
3.08AspLys: 3.08 ± 0.429
5.184AspLeu: 5.184 ± 0.596
2.254AspMet: 2.254 ± 0.398
2.705AspAsn: 2.705 ± 0.442
2.554AspPro: 2.554 ± 0.486
1.578AspGln: 1.578 ± 0.318
2.479AspArg: 2.479 ± 0.592
4.508AspSer: 4.508 ± 0.522
3.907AspThr: 3.907 ± 0.643
3.456AspVal: 3.456 ± 0.428
0.902AspTrp: 0.902 ± 0.182
2.179AspTyr: 2.179 ± 0.551
0.0AspXaa: 0.0 ± 0.0
Glu
5.409GluAla: 5.409 ± 0.911
0.526GluCys: 0.526 ± 0.199
3.005GluAsp: 3.005 ± 0.489
3.456GluGlu: 3.456 ± 0.855
2.254GluPhe: 2.254 ± 0.329
4.132GluGly: 4.132 ± 0.628
2.104GluHis: 2.104 ± 0.371
2.028GluIle: 2.028 ± 0.366
2.028GluLys: 2.028 ± 0.428
5.259GluLeu: 5.259 ± 0.506
1.803GluMet: 1.803 ± 0.354
1.953GluAsn: 1.953 ± 0.429
2.028GluPro: 2.028 ± 0.453
3.681GluGln: 3.681 ± 0.586
3.907GluArg: 3.907 ± 0.628
2.404GluSer: 2.404 ± 0.517
2.554GluThr: 2.554 ± 0.429
4.883GluVal: 4.883 ± 0.636
0.977GluTrp: 0.977 ± 0.263
2.705GluTyr: 2.705 ± 0.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.855PheAla: 2.855 ± 0.432
0.376PheCys: 0.376 ± 0.231
2.104PheAsp: 2.104 ± 0.411
2.179PheGlu: 2.179 ± 0.443
1.352PhePhe: 1.352 ± 0.339
2.254PheGly: 2.254 ± 0.316
0.526PheHis: 0.526 ± 0.202
1.202PheIle: 1.202 ± 0.272
1.653PheLys: 1.653 ± 0.423
2.104PheLeu: 2.104 ± 0.461
0.601PheMet: 0.601 ± 0.231
1.503PheAsn: 1.503 ± 0.39
1.427PhePro: 1.427 ± 0.301
1.277PheGln: 1.277 ± 0.244
1.878PheArg: 1.878 ± 0.462
1.653PheSer: 1.653 ± 0.391
1.953PheThr: 1.953 ± 0.461
1.953PheVal: 1.953 ± 0.467
0.526PheTrp: 0.526 ± 0.191
1.503PheTyr: 1.503 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
6.836GlyAla: 6.836 ± 0.78
1.277GlyCys: 1.277 ± 0.328
4.883GlyAsp: 4.883 ± 0.513
3.982GlyGlu: 3.982 ± 0.527
2.179GlyPhe: 2.179 ± 0.478
5.259GlyGly: 5.259 ± 0.634
1.427GlyHis: 1.427 ± 0.417
4.282GlyIle: 4.282 ± 0.648
4.432GlyLys: 4.432 ± 0.805
6.987GlyLeu: 6.987 ± 0.804
1.653GlyMet: 1.653 ± 0.445
3.907GlyAsn: 3.907 ± 0.623
1.728GlyPro: 1.728 ± 0.368
3.08GlyGln: 3.08 ± 0.465
4.508GlyArg: 4.508 ± 0.576
5.409GlySer: 5.409 ± 0.671
5.334GlyThr: 5.334 ± 0.789
6.01GlyVal: 6.01 ± 0.69
0.826GlyTrp: 0.826 ± 0.205
3.456GlyTyr: 3.456 ± 0.537
0.0GlyXaa: 0.0 ± 0.0
His
1.503HisAla: 1.503 ± 0.4
0.451HisCys: 0.451 ± 0.144
1.503HisAsp: 1.503 ± 0.358
1.202HisGlu: 1.202 ± 0.324
0.376HisPhe: 0.376 ± 0.159
1.728HisGly: 1.728 ± 0.447
0.301HisHis: 0.301 ± 0.148
1.578HisIle: 1.578 ± 0.394
0.826HisLys: 0.826 ± 0.209
2.254HisLeu: 2.254 ± 0.432
0.451HisMet: 0.451 ± 0.178
0.751HisAsn: 0.751 ± 0.217
0.751HisPro: 0.751 ± 0.357
0.601HisGln: 0.601 ± 0.24
1.052HisArg: 1.052 ± 0.299
0.977HisSer: 0.977 ± 0.293
0.751HisThr: 0.751 ± 0.255
0.902HisVal: 0.902 ± 0.256
0.15HisTrp: 0.15 ± 0.124
0.751HisTyr: 0.751 ± 0.263
0.0HisXaa: 0.0 ± 0.0
Ile
3.23IleAla: 3.23 ± 0.593
0.601IleCys: 0.601 ± 0.237
3.005IleAsp: 3.005 ± 0.433
2.479IleGlu: 2.479 ± 0.495
0.526IlePhe: 0.526 ± 0.155
2.855IleGly: 2.855 ± 0.513
0.751IleHis: 0.751 ± 0.238
1.653IleIle: 1.653 ± 0.314
2.705IleLys: 2.705 ± 0.51
4.432IleLeu: 4.432 ± 0.577
1.202IleMet: 1.202 ± 0.212
2.254IleAsn: 2.254 ± 0.457
2.329IlePro: 2.329 ± 0.5
2.78IleGln: 2.78 ± 0.462
2.855IleArg: 2.855 ± 0.421
2.93IleSer: 2.93 ± 0.485
2.93IleThr: 2.93 ± 0.555
2.404IleVal: 2.404 ± 0.374
0.225IleTrp: 0.225 ± 0.147
1.127IleTyr: 1.127 ± 0.283
0.0IleXaa: 0.0 ± 0.0
Lys
6.235LysAla: 6.235 ± 0.871
0.451LysCys: 0.451 ± 0.186
2.78LysAsp: 2.78 ± 0.439
3.155LysGlu: 3.155 ± 0.525
1.202LysPhe: 1.202 ± 0.384
2.93LysGly: 2.93 ± 0.548
1.202LysHis: 1.202 ± 0.314
1.127LysIle: 1.127 ± 0.284
1.803LysLys: 1.803 ± 0.374
5.109LysLeu: 5.109 ± 0.803
1.578LysMet: 1.578 ± 0.315
1.503LysAsn: 1.503 ± 0.273
1.578LysPro: 1.578 ± 0.452
3.155LysGln: 3.155 ± 0.631
3.681LysArg: 3.681 ± 0.585
2.855LysSer: 2.855 ± 0.415
2.93LysThr: 2.93 ± 0.46
3.381LysVal: 3.381 ± 0.595
1.127LysTrp: 1.127 ± 0.261
1.503LysTyr: 1.503 ± 0.304
0.0LysXaa: 0.0 ± 0.0
Leu
8.79LeuAla: 8.79 ± 0.806
0.902LeuCys: 0.902 ± 0.293
6.912LeuAsp: 6.912 ± 0.638
5.033LeuGlu: 5.033 ± 0.57
2.554LeuPhe: 2.554 ± 0.391
7.137LeuGly: 7.137 ± 0.827
1.653LeuHis: 1.653 ± 0.329
3.907LeuIle: 3.907 ± 0.649
3.155LeuLys: 3.155 ± 0.493
7.287LeuLeu: 7.287 ± 0.665
2.179LeuMet: 2.179 ± 0.379
3.456LeuAsn: 3.456 ± 0.56
3.456LeuPro: 3.456 ± 0.506
3.831LeuGln: 3.831 ± 0.599
6.085LeuArg: 6.085 ± 0.61
5.86LeuSer: 5.86 ± 0.72
4.658LeuThr: 4.658 ± 0.609
6.311LeuVal: 6.311 ± 0.724
1.052LeuTrp: 1.052 ± 0.303
3.23LeuTyr: 3.23 ± 0.471
0.0LeuXaa: 0.0 ± 0.0
Met
3.23MetAla: 3.23 ± 0.55
0.301MetCys: 0.301 ± 0.173
1.953MetAsp: 1.953 ± 0.498
1.127MetGlu: 1.127 ± 0.248
1.052MetPhe: 1.052 ± 0.291
1.803MetGly: 1.803 ± 0.268
0.902MetHis: 0.902 ± 0.29
0.676MetIle: 0.676 ± 0.23
1.202MetLys: 1.202 ± 0.315
3.23MetLeu: 3.23 ± 0.551
0.601MetMet: 0.601 ± 0.27
0.826MetAsn: 0.826 ± 0.31
0.977MetPro: 0.977 ± 0.276
2.179MetGln: 2.179 ± 0.404
1.728MetArg: 1.728 ± 0.381
1.803MetSer: 1.803 ± 0.36
0.977MetThr: 0.977 ± 0.316
2.104MetVal: 2.104 ± 0.385
0.451MetTrp: 0.451 ± 0.175
1.127MetTyr: 1.127 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
2.93AsnAla: 2.93 ± 0.45
0.301AsnCys: 0.301 ± 0.144
1.878AsnAsp: 1.878 ± 0.458
1.052AsnGlu: 1.052 ± 0.301
0.751AsnPhe: 0.751 ± 0.239
3.831AsnGly: 3.831 ± 0.59
0.225AsnHis: 0.225 ± 0.1
2.705AsnIle: 2.705 ± 0.573
2.179AsnLys: 2.179 ± 0.362
3.08AsnLeu: 3.08 ± 0.541
1.427AsnMet: 1.427 ± 0.359
1.503AsnAsn: 1.503 ± 0.451
2.404AsnPro: 2.404 ± 0.446
1.427AsnGln: 1.427 ± 0.364
2.179AsnArg: 2.179 ± 0.425
3.756AsnSer: 3.756 ± 0.57
2.93AsnThr: 2.93 ± 0.481
3.306AsnVal: 3.306 ± 0.402
0.751AsnTrp: 0.751 ± 0.236
1.728AsnTyr: 1.728 ± 0.352
0.0AsnXaa: 0.0 ± 0.0
Pro
4.432ProAla: 4.432 ± 0.765
0.15ProCys: 0.15 ± 0.098
2.705ProAsp: 2.705 ± 0.506
3.756ProGlu: 3.756 ± 0.497
0.977ProPhe: 0.977 ± 0.211
2.705ProGly: 2.705 ± 0.535
0.526ProHis: 0.526 ± 0.223
1.803ProIle: 1.803 ± 0.364
1.728ProLys: 1.728 ± 0.381
3.005ProLeu: 3.005 ± 0.506
1.052ProMet: 1.052 ± 0.299
1.503ProAsn: 1.503 ± 0.369
0.601ProPro: 0.601 ± 0.224
1.352ProGln: 1.352 ± 0.262
1.953ProArg: 1.953 ± 0.366
2.329ProSer: 2.329 ± 0.437
2.629ProThr: 2.629 ± 0.378
2.78ProVal: 2.78 ± 0.426
0.601ProTrp: 0.601 ± 0.23
1.127ProTyr: 1.127 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
4.808GlnAla: 4.808 ± 0.686
0.301GlnCys: 0.301 ± 0.153
2.855GlnAsp: 2.855 ± 0.44
3.756GlnGlu: 3.756 ± 0.602
1.503GlnPhe: 1.503 ± 0.316
2.78GlnGly: 2.78 ± 0.464
1.127GlnHis: 1.127 ± 0.322
1.202GlnIle: 1.202 ± 0.353
2.554GlnLys: 2.554 ± 0.532
4.733GlnLeu: 4.733 ± 0.602
1.127GlnMet: 1.127 ± 0.265
2.028GlnAsn: 2.028 ± 0.427
1.427GlnPro: 1.427 ± 0.396
2.855GlnGln: 2.855 ± 0.626
2.629GlnArg: 2.629 ± 0.378
3.155GlnSer: 3.155 ± 0.585
1.953GlnThr: 1.953 ± 0.521
3.005GlnVal: 3.005 ± 0.506
0.601GlnTrp: 0.601 ± 0.258
1.953GlnTyr: 1.953 ± 0.436
0.0GlnXaa: 0.0 ± 0.0
Arg
6.761ArgAla: 6.761 ± 1.087
0.676ArgCys: 0.676 ± 0.29
3.306ArgAsp: 3.306 ± 0.574
3.907ArgGlu: 3.907 ± 0.611
2.179ArgPhe: 2.179 ± 0.367
4.432ArgGly: 4.432 ± 0.785
1.052ArgHis: 1.052 ± 0.247
3.155ArgIle: 3.155 ± 0.579
3.23ArgLys: 3.23 ± 0.649
4.958ArgLeu: 4.958 ± 0.617
2.028ArgMet: 2.028 ± 0.45
2.329ArgAsn: 2.329 ± 0.403
1.803ArgPro: 1.803 ± 0.359
2.554ArgGln: 2.554 ± 0.391
4.508ArgArg: 4.508 ± 0.646
2.705ArgSer: 2.705 ± 0.595
2.78ArgThr: 2.78 ± 0.387
4.207ArgVal: 4.207 ± 0.559
0.826ArgTrp: 0.826 ± 0.259
1.878ArgTyr: 1.878 ± 0.335
0.0ArgXaa: 0.0 ± 0.0
Ser
8.489SerAla: 8.489 ± 0.905
0.751SerCys: 0.751 ± 0.265
3.831SerAsp: 3.831 ± 0.481
3.606SerGlu: 3.606 ± 0.626
1.878SerPhe: 1.878 ± 0.367
5.935SerGly: 5.935 ± 0.922
0.526SerHis: 0.526 ± 0.193
2.554SerIle: 2.554 ± 0.528
4.357SerLys: 4.357 ± 0.546
4.658SerLeu: 4.658 ± 0.576
2.705SerMet: 2.705 ± 0.385
2.554SerAsn: 2.554 ± 0.443
2.404SerPro: 2.404 ± 0.403
1.503SerGln: 1.503 ± 0.284
3.08SerArg: 3.08 ± 0.389
3.982SerSer: 3.982 ± 0.797
4.583SerThr: 4.583 ± 0.607
4.658SerVal: 4.658 ± 0.591
1.127SerTrp: 1.127 ± 0.302
1.653SerTyr: 1.653 ± 0.407
0.0SerXaa: 0.0 ± 0.0
Thr
5.484ThrAla: 5.484 ± 0.893
0.751ThrCys: 0.751 ± 0.27
2.93ThrAsp: 2.93 ± 0.413
2.404ThrGlu: 2.404 ± 0.505
2.104ThrPhe: 2.104 ± 0.454
5.484ThrGly: 5.484 ± 0.953
1.052ThrHis: 1.052 ± 0.298
2.028ThrIle: 2.028 ± 0.345
2.705ThrLys: 2.705 ± 0.466
5.033ThrLeu: 5.033 ± 0.741
1.352ThrMet: 1.352 ± 0.444
2.479ThrAsn: 2.479 ± 0.494
3.08ThrPro: 3.08 ± 0.397
2.855ThrGln: 2.855 ± 0.4
2.254ThrArg: 2.254 ± 0.49
4.883ThrSer: 4.883 ± 0.727
4.057ThrThr: 4.057 ± 0.744
4.808ThrVal: 4.808 ± 0.71
0.826ThrTrp: 0.826 ± 0.244
2.179ThrTyr: 2.179 ± 0.466
0.0ThrXaa: 0.0 ± 0.0
Val
6.461ValAla: 6.461 ± 0.73
0.376ValCys: 0.376 ± 0.176
4.883ValAsp: 4.883 ± 0.558
3.831ValGlu: 3.831 ± 0.624
1.728ValPhe: 1.728 ± 0.471
6.311ValGly: 6.311 ± 0.79
1.803ValHis: 1.803 ± 0.408
2.78ValIle: 2.78 ± 0.465
2.855ValLys: 2.855 ± 0.524
5.334ValLeu: 5.334 ± 0.885
1.878ValMet: 1.878 ± 0.425
3.005ValAsn: 3.005 ± 0.599
3.005ValPro: 3.005 ± 0.639
3.306ValGln: 3.306 ± 0.885
4.432ValArg: 4.432 ± 0.504
5.109ValSer: 5.109 ± 0.726
4.207ValThr: 4.207 ± 0.871
5.71ValVal: 5.71 ± 0.629
0.751ValTrp: 0.751 ± 0.244
2.629ValTyr: 2.629 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
1.202TrpAla: 1.202 ± 0.229
0.225TrpCys: 0.225 ± 0.112
0.902TrpAsp: 0.902 ± 0.235
1.127TrpGlu: 1.127 ± 0.312
0.751TrpPhe: 0.751 ± 0.301
0.601TrpGly: 0.601 ± 0.232
0.301TrpHis: 0.301 ± 0.125
0.451TrpIle: 0.451 ± 0.182
0.676TrpLys: 0.676 ± 0.213
1.277TrpLeu: 1.277 ± 0.279
0.15TrpMet: 0.15 ± 0.17
0.751TrpAsn: 0.751 ± 0.251
0.376TrpPro: 0.376 ± 0.218
0.451TrpGln: 0.451 ± 0.189
1.052TrpArg: 1.052 ± 0.258
0.601TrpSer: 0.601 ± 0.223
0.902TrpThr: 0.902 ± 0.206
1.052TrpVal: 1.052 ± 0.332
0.451TrpTrp: 0.451 ± 0.182
0.977TrpTyr: 0.977 ± 0.303
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.479TyrAla: 2.479 ± 0.449
0.676TyrCys: 0.676 ± 0.258
2.254TyrAsp: 2.254 ± 0.363
1.803TyrGlu: 1.803 ± 0.464
1.427TyrPhe: 1.427 ± 0.33
3.08TyrGly: 3.08 ± 0.518
0.676TyrHis: 0.676 ± 0.241
2.254TyrIle: 2.254 ± 0.466
1.953TyrLys: 1.953 ± 0.385
3.381TyrLeu: 3.381 ± 0.429
0.676TyrMet: 0.676 ± 0.287
1.352TyrAsn: 1.352 ± 0.299
1.202TyrPro: 1.202 ± 0.263
2.479TyrGln: 2.479 ± 0.358
2.629TyrArg: 2.629 ± 0.553
2.78TyrSer: 2.78 ± 0.426
2.705TyrThr: 2.705 ± 0.555
1.953TyrVal: 1.953 ± 0.478
0.601TyrTrp: 0.601 ± 0.23
1.277TyrTyr: 1.277 ± 0.375
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (13312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski