Amino acid dipepetide frequency for Klebsiella phage IME304

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.644AlaAla: 9.644 ± 0.992
0.608AlaCys: 0.608 ± 0.279
5.56AlaAsp: 5.56 ± 0.729
5.387AlaGlu: 5.387 ± 0.925
3.128AlaPhe: 3.128 ± 0.56
7.819AlaGly: 7.819 ± 1.214
1.129AlaHis: 1.129 ± 0.26
3.823AlaIle: 3.823 ± 0.564
6.516AlaLys: 6.516 ± 0.808
8.514AlaLeu: 8.514 ± 0.789
3.128AlaMet: 3.128 ± 0.716
4.344AlaAsn: 4.344 ± 0.641
2.606AlaPro: 2.606 ± 0.598
3.736AlaGln: 3.736 ± 0.561
4.778AlaArg: 4.778 ± 0.664
5.821AlaSer: 5.821 ± 0.787
3.562AlaThr: 3.562 ± 0.585
5.821AlaVal: 5.821 ± 0.649
1.043AlaTrp: 1.043 ± 0.341
2.606AlaTyr: 2.606 ± 0.515
0.0AlaXaa: 0.0 ± 0.0
Cys
0.782CysAla: 0.782 ± 0.233
0.087CysCys: 0.087 ± 0.088
0.608CysAsp: 0.608 ± 0.313
0.956CysGlu: 0.956 ± 0.327
0.434CysPhe: 0.434 ± 0.197
0.434CysGly: 0.434 ± 0.167
0.174CysHis: 0.174 ± 0.108
0.608CysIle: 0.608 ± 0.205
0.434CysLys: 0.434 ± 0.218
0.869CysLeu: 0.869 ± 0.321
0.0CysMet: 0.0 ± 0.0
0.174CysAsn: 0.174 ± 0.131
0.782CysPro: 0.782 ± 0.307
0.434CysGln: 0.434 ± 0.231
0.608CysArg: 0.608 ± 0.297
0.869CysSer: 0.869 ± 0.304
0.348CysThr: 0.348 ± 0.199
0.956CysVal: 0.956 ± 0.254
0.434CysTrp: 0.434 ± 0.238
0.434CysTyr: 0.434 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
5.474AspAla: 5.474 ± 0.712
0.434AspCys: 0.434 ± 0.231
4.083AspAsp: 4.083 ± 0.583
3.215AspGlu: 3.215 ± 0.622
2.867AspPhe: 2.867 ± 0.405
6.429AspGly: 6.429 ± 0.607
0.869AspHis: 0.869 ± 0.239
2.346AspIle: 2.346 ± 0.375
4.692AspLys: 4.692 ± 0.749
3.736AspLeu: 3.736 ± 0.662
2.085AspMet: 2.085 ± 0.416
2.867AspAsn: 2.867 ± 0.488
2.606AspPro: 2.606 ± 0.572
2.172AspGln: 2.172 ± 0.587
2.78AspArg: 2.78 ± 0.506
3.649AspSer: 3.649 ± 0.533
3.823AspThr: 3.823 ± 0.474
4.778AspVal: 4.778 ± 0.529
0.869AspTrp: 0.869 ± 0.344
2.606AspTyr: 2.606 ± 0.61
0.0AspXaa: 0.0 ± 0.0
Glu
7.993GluAla: 7.993 ± 0.941
0.608GluCys: 0.608 ± 0.221
4.344GluAsp: 4.344 ± 0.763
5.3GluGlu: 5.3 ± 1.03
2.52GluPhe: 2.52 ± 0.441
5.387GluGly: 5.387 ± 0.709
1.651GluHis: 1.651 ± 0.548
2.606GluIle: 2.606 ± 0.354
2.867GluLys: 2.867 ± 0.592
6.342GluLeu: 6.342 ± 0.823
1.651GluMet: 1.651 ± 0.547
2.259GluAsn: 2.259 ± 0.369
2.52GluPro: 2.52 ± 0.689
3.475GluGln: 3.475 ± 0.78
4.17GluArg: 4.17 ± 0.621
4.17GluSer: 4.17 ± 0.602
3.649GluThr: 3.649 ± 0.566
5.039GluVal: 5.039 ± 0.703
0.521GluTrp: 0.521 ± 0.24
2.693GluTyr: 2.693 ± 0.367
0.0GluXaa: 0.0 ± 0.0
Phe
2.259PheAla: 2.259 ± 0.557
0.521PheCys: 0.521 ± 0.21
3.041PheAsp: 3.041 ± 0.556
1.738PheGlu: 1.738 ± 0.3
0.695PhePhe: 0.695 ± 0.262
3.128PheGly: 3.128 ± 0.604
0.521PheHis: 0.521 ± 0.298
1.39PheIle: 1.39 ± 0.389
2.172PheLys: 2.172 ± 0.403
2.954PheLeu: 2.954 ± 0.56
0.869PheMet: 0.869 ± 0.272
1.998PheAsn: 1.998 ± 0.46
1.738PhePro: 1.738 ± 0.451
1.129PheGln: 1.129 ± 0.331
1.564PheArg: 1.564 ± 0.372
2.693PheSer: 2.693 ± 0.521
1.825PheThr: 1.825 ± 0.388
2.346PheVal: 2.346 ± 0.488
0.521PheTrp: 0.521 ± 0.189
1.043PheTyr: 1.043 ± 0.227
0.0PheXaa: 0.0 ± 0.0
Gly
7.124GlyAla: 7.124 ± 0.957
1.303GlyCys: 1.303 ± 0.579
5.734GlyAsp: 5.734 ± 0.528
5.734GlyGlu: 5.734 ± 0.743
2.606GlyPhe: 2.606 ± 0.372
6.169GlyGly: 6.169 ± 0.818
1.477GlyHis: 1.477 ± 0.351
5.821GlyIle: 5.821 ± 1.137
6.082GlyLys: 6.082 ± 0.93
6.429GlyLeu: 6.429 ± 0.734
2.172GlyMet: 2.172 ± 0.468
3.041GlyAsn: 3.041 ± 0.497
1.564GlyPro: 1.564 ± 0.422
2.693GlyGln: 2.693 ± 0.517
4.257GlyArg: 4.257 ± 0.456
6.255GlySer: 6.255 ± 0.875
4.431GlyThr: 4.431 ± 0.702
5.3GlyVal: 5.3 ± 0.939
1.564GlyTrp: 1.564 ± 0.466
2.867GlyTyr: 2.867 ± 0.502
0.0GlyXaa: 0.0 ± 0.0
His
1.043HisAla: 1.043 ± 0.29
0.434HisCys: 0.434 ± 0.196
1.303HisAsp: 1.303 ± 0.294
1.477HisGlu: 1.477 ± 0.483
0.782HisPhe: 0.782 ± 0.226
1.39HisGly: 1.39 ± 0.367
0.608HisHis: 0.608 ± 0.258
1.216HisIle: 1.216 ± 0.402
1.303HisLys: 1.303 ± 0.29
1.129HisLeu: 1.129 ± 0.413
0.521HisMet: 0.521 ± 0.186
0.261HisAsn: 0.261 ± 0.135
0.956HisPro: 0.956 ± 0.249
0.348HisGln: 0.348 ± 0.177
0.869HisArg: 0.869 ± 0.338
0.608HisSer: 0.608 ± 0.218
0.869HisThr: 0.869 ± 0.245
1.651HisVal: 1.651 ± 0.351
0.174HisTrp: 0.174 ± 0.114
1.129HisTyr: 1.129 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
4.431IleAla: 4.431 ± 0.5
0.608IleCys: 0.608 ± 0.244
3.736IleAsp: 3.736 ± 0.543
3.041IleGlu: 3.041 ± 0.468
0.782IlePhe: 0.782 ± 0.251
3.91IleGly: 3.91 ± 0.487
0.782IleHis: 0.782 ± 0.268
2.52IleIle: 2.52 ± 0.519
2.954IleLys: 2.954 ± 0.472
3.562IleLeu: 3.562 ± 0.594
0.782IleMet: 0.782 ± 0.292
1.998IleAsn: 1.998 ± 0.489
2.693IlePro: 2.693 ± 0.479
2.346IleGln: 2.346 ± 0.531
3.823IleArg: 3.823 ± 0.668
2.433IleSer: 2.433 ± 0.363
2.867IleThr: 2.867 ± 0.888
3.041IleVal: 3.041 ± 0.562
0.521IleTrp: 0.521 ± 0.204
1.564IleTyr: 1.564 ± 0.376
0.0IleXaa: 0.0 ± 0.0
Lys
7.819LysAla: 7.819 ± 1.125
0.608LysCys: 0.608 ± 0.22
3.649LysAsp: 3.649 ± 0.574
5.387LysGlu: 5.387 ± 0.641
2.52LysPhe: 2.52 ± 0.415
6.864LysGly: 6.864 ± 0.88
1.825LysHis: 1.825 ± 0.419
2.172LysIle: 2.172 ± 0.517
3.475LysLys: 3.475 ± 1.043
5.821LysLeu: 5.821 ± 0.751
2.085LysMet: 2.085 ± 0.443
2.867LysAsn: 2.867 ± 0.396
2.954LysPro: 2.954 ± 0.702
2.085LysGln: 2.085 ± 0.504
3.736LysArg: 3.736 ± 0.777
3.91LysSer: 3.91 ± 0.67
3.041LysThr: 3.041 ± 0.401
5.039LysVal: 5.039 ± 0.764
0.956LysTrp: 0.956 ± 0.355
1.303LysTyr: 1.303 ± 0.414
0.0LysXaa: 0.0 ± 0.0
Leu
8.08LeuAla: 8.08 ± 0.945
0.608LeuCys: 0.608 ± 0.258
4.692LeuAsp: 4.692 ± 0.576
7.211LeuGlu: 7.211 ± 1.113
2.693LeuPhe: 2.693 ± 0.387
5.126LeuGly: 5.126 ± 0.653
1.303LeuHis: 1.303 ± 0.342
2.954LeuIle: 2.954 ± 0.448
6.95LeuLys: 6.95 ± 0.646
6.169LeuLeu: 6.169 ± 0.866
2.346LeuMet: 2.346 ± 0.361
4.431LeuAsn: 4.431 ± 0.6
3.128LeuPro: 3.128 ± 0.547
3.215LeuGln: 3.215 ± 0.509
4.778LeuArg: 4.778 ± 0.717
4.692LeuSer: 4.692 ± 0.513
4.952LeuThr: 4.952 ± 0.902
4.778LeuVal: 4.778 ± 0.544
1.216LeuTrp: 1.216 ± 0.415
2.78LeuTyr: 2.78 ± 0.509
0.0LeuXaa: 0.0 ± 0.0
Met
3.215MetAla: 3.215 ± 0.461
0.174MetCys: 0.174 ± 0.138
1.911MetAsp: 1.911 ± 0.39
1.303MetGlu: 1.303 ± 0.297
0.782MetPhe: 0.782 ± 0.277
1.825MetGly: 1.825 ± 0.397
0.434MetHis: 0.434 ± 0.235
0.956MetIle: 0.956 ± 0.317
1.564MetLys: 1.564 ± 0.295
2.954MetLeu: 2.954 ± 0.51
0.521MetMet: 0.521 ± 0.218
1.043MetAsn: 1.043 ± 0.282
0.695MetPro: 0.695 ± 0.199
2.172MetGln: 2.172 ± 0.449
0.869MetArg: 0.869 ± 0.257
1.564MetSer: 1.564 ± 0.377
2.085MetThr: 2.085 ± 0.514
1.477MetVal: 1.477 ± 0.389
0.0MetTrp: 0.0 ± 0.0
0.521MetTyr: 0.521 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
3.215AsnAla: 3.215 ± 0.444
0.521AsnCys: 0.521 ± 0.215
2.085AsnAsp: 2.085 ± 0.361
2.693AsnGlu: 2.693 ± 0.542
1.39AsnPhe: 1.39 ± 0.375
3.736AsnGly: 3.736 ± 0.505
0.695AsnHis: 0.695 ± 0.341
3.041AsnIle: 3.041 ± 0.589
2.172AsnLys: 2.172 ± 0.303
3.301AsnLeu: 3.301 ± 0.619
1.043AsnMet: 1.043 ± 0.268
1.911AsnAsn: 1.911 ± 0.498
2.259AsnPro: 2.259 ± 0.342
1.303AsnGln: 1.303 ± 0.295
1.738AsnArg: 1.738 ± 0.52
3.475AsnSer: 3.475 ± 0.689
2.606AsnThr: 2.606 ± 0.467
2.693AsnVal: 2.693 ± 0.538
0.869AsnTrp: 0.869 ± 0.276
1.651AsnTyr: 1.651 ± 0.359
0.0AsnXaa: 0.0 ± 0.0
Pro
2.693ProAla: 2.693 ± 0.507
0.434ProCys: 0.434 ± 0.194
1.998ProAsp: 1.998 ± 0.472
4.431ProGlu: 4.431 ± 0.766
1.477ProPhe: 1.477 ± 0.302
2.693ProGly: 2.693 ± 0.464
0.608ProHis: 0.608 ± 0.195
1.477ProIle: 1.477 ± 0.412
3.128ProLys: 3.128 ± 0.576
2.867ProLeu: 2.867 ± 0.452
0.521ProMet: 0.521 ± 0.223
1.998ProAsn: 1.998 ± 0.49
0.521ProPro: 0.521 ± 0.254
1.39ProGln: 1.39 ± 0.302
1.825ProArg: 1.825 ± 0.373
2.346ProSer: 2.346 ± 0.389
1.564ProThr: 1.564 ± 0.479
3.041ProVal: 3.041 ± 0.392
0.782ProTrp: 0.782 ± 0.203
1.651ProTyr: 1.651 ± 0.434
0.0ProXaa: 0.0 ± 0.0
Gln
3.736GlnAla: 3.736 ± 0.709
0.261GlnCys: 0.261 ± 0.15
2.172GlnAsp: 2.172 ± 0.344
2.867GlnGlu: 2.867 ± 0.442
1.651GlnPhe: 1.651 ± 0.296
2.78GlnGly: 2.78 ± 0.497
0.348GlnHis: 0.348 ± 0.198
1.825GlnIle: 1.825 ± 0.381
2.52GlnLys: 2.52 ± 0.51
3.91GlnLeu: 3.91 ± 0.56
1.738GlnMet: 1.738 ± 0.584
1.129GlnAsn: 1.129 ± 0.334
1.564GlnPro: 1.564 ± 0.203
3.649GlnGln: 3.649 ± 0.668
2.172GlnArg: 2.172 ± 0.457
2.606GlnSer: 2.606 ± 0.542
1.477GlnThr: 1.477 ± 0.496
2.52GlnVal: 2.52 ± 0.476
0.956GlnTrp: 0.956 ± 0.266
1.477GlnTyr: 1.477 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
4.778ArgAla: 4.778 ± 0.808
0.782ArgCys: 0.782 ± 0.268
3.128ArgAsp: 3.128 ± 0.569
4.083ArgGlu: 4.083 ± 0.615
1.564ArgPhe: 1.564 ± 0.396
4.257ArgGly: 4.257 ± 0.694
0.782ArgHis: 0.782 ± 0.292
2.867ArgIle: 2.867 ± 0.487
3.823ArgLys: 3.823 ± 0.603
4.952ArgLeu: 4.952 ± 0.742
1.303ArgMet: 1.303 ± 0.342
2.085ArgAsn: 2.085 ± 0.371
2.085ArgPro: 2.085 ± 0.341
2.606ArgGln: 2.606 ± 0.46
2.954ArgArg: 2.954 ± 0.409
3.562ArgSer: 3.562 ± 0.463
2.606ArgThr: 2.606 ± 0.527
3.475ArgVal: 3.475 ± 0.704
1.216ArgTrp: 1.216 ± 0.34
1.043ArgTyr: 1.043 ± 0.239
0.0ArgXaa: 0.0 ± 0.0
Ser
4.952SerAla: 4.952 ± 0.678
0.695SerCys: 0.695 ± 0.227
4.431SerAsp: 4.431 ± 0.674
3.215SerGlu: 3.215 ± 0.622
2.867SerPhe: 2.867 ± 0.447
5.56SerGly: 5.56 ± 0.848
1.477SerHis: 1.477 ± 0.356
3.041SerIle: 3.041 ± 0.708
4.605SerLys: 4.605 ± 0.665
4.344SerLeu: 4.344 ± 0.741
1.043SerMet: 1.043 ± 0.303
2.606SerAsn: 2.606 ± 0.709
1.998SerPro: 1.998 ± 0.384
2.693SerGln: 2.693 ± 0.437
3.475SerArg: 3.475 ± 0.67
3.388SerSer: 3.388 ± 0.595
3.91SerThr: 3.91 ± 0.703
4.344SerVal: 4.344 ± 0.683
0.956SerTrp: 0.956 ± 0.481
2.259SerTyr: 2.259 ± 0.559
0.0SerXaa: 0.0 ± 0.0
Thr
3.823ThrAla: 3.823 ± 0.797
0.695ThrCys: 0.695 ± 0.283
3.041ThrAsp: 3.041 ± 0.37
3.388ThrGlu: 3.388 ± 0.557
1.825ThrPhe: 1.825 ± 0.464
5.387ThrGly: 5.387 ± 0.965
1.39ThrHis: 1.39 ± 0.302
3.649ThrIle: 3.649 ± 0.477
4.865ThrLys: 4.865 ± 0.727
5.126ThrLeu: 5.126 ± 0.716
1.39ThrMet: 1.39 ± 0.289
1.651ThrAsn: 1.651 ± 0.549
2.954ThrPro: 2.954 ± 0.516
1.825ThrGln: 1.825 ± 0.349
2.52ThrArg: 2.52 ± 0.422
3.215ThrSer: 3.215 ± 0.553
2.606ThrThr: 2.606 ± 0.627
4.083ThrVal: 4.083 ± 0.962
0.348ThrTrp: 0.348 ± 0.168
1.477ThrTyr: 1.477 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
5.734ValAla: 5.734 ± 0.702
0.348ValCys: 0.348 ± 0.161
3.823ValAsp: 3.823 ± 0.577
4.778ValGlu: 4.778 ± 0.611
1.911ValPhe: 1.911 ± 0.372
5.821ValGly: 5.821 ± 0.747
1.303ValHis: 1.303 ± 0.363
3.823ValIle: 3.823 ± 0.541
4.518ValLys: 4.518 ± 0.596
5.474ValLeu: 5.474 ± 0.721
1.129ValMet: 1.129 ± 0.293
2.954ValAsn: 2.954 ± 0.501
2.606ValPro: 2.606 ± 0.434
1.911ValGln: 1.911 ± 0.384
4.344ValArg: 4.344 ± 0.699
4.431ValSer: 4.431 ± 0.92
6.169ValThr: 6.169 ± 1.045
5.387ValVal: 5.387 ± 0.978
0.782ValTrp: 0.782 ± 0.326
2.259ValTyr: 2.259 ± 0.538
0.0ValXaa: 0.0 ± 0.0
Trp
0.782TrpAla: 0.782 ± 0.271
0.434TrpCys: 0.434 ± 0.199
0.521TrpAsp: 0.521 ± 0.17
1.129TrpGlu: 1.129 ± 0.305
0.521TrpPhe: 0.521 ± 0.22
0.695TrpGly: 0.695 ± 0.272
0.261TrpHis: 0.261 ± 0.207
0.695TrpIle: 0.695 ± 0.337
1.39TrpLys: 1.39 ± 0.439
1.477TrpLeu: 1.477 ± 0.415
0.434TrpMet: 0.434 ± 0.183
0.782TrpAsn: 0.782 ± 0.319
0.261TrpPro: 0.261 ± 0.151
0.869TrpGln: 0.869 ± 0.31
0.956TrpArg: 0.956 ± 0.296
0.956TrpSer: 0.956 ± 0.301
0.434TrpThr: 0.434 ± 0.184
1.477TrpVal: 1.477 ± 0.403
0.261TrpTrp: 0.261 ± 0.145
0.174TrpTyr: 0.174 ± 0.133
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.998TyrAla: 1.998 ± 0.326
0.261TyrCys: 0.261 ± 0.129
2.52TyrAsp: 2.52 ± 0.583
2.433TyrGlu: 2.433 ± 0.49
1.129TyrPhe: 1.129 ± 0.259
3.215TyrGly: 3.215 ± 0.457
0.434TyrHis: 0.434 ± 0.184
1.564TyrIle: 1.564 ± 0.554
2.085TyrLys: 2.085 ± 0.393
2.172TyrLeu: 2.172 ± 0.354
1.216TyrMet: 1.216 ± 0.308
1.998TyrAsn: 1.998 ± 0.48
0.956TyrPro: 0.956 ± 0.257
1.303TyrGln: 1.303 ± 0.423
1.825TyrArg: 1.825 ± 0.39
1.216TyrSer: 1.216 ± 0.388
2.433TyrThr: 2.433 ± 0.467
2.259TyrVal: 2.259 ± 0.511
0.521TyrTrp: 0.521 ± 0.205
0.521TyrTyr: 0.521 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (11511 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski