Amino acid dipepetide frequency for Lactobacillus phage KC5a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.8AlaAla: 6.8 ± 2.883
0.258AlaCys: 0.258 ± 0.144
4.734AlaAsp: 4.734 ± 0.69
4.476AlaGlu: 4.476 ± 0.749
2.582AlaPhe: 2.582 ± 0.637
5.25AlaGly: 5.25 ± 0.934
0.861AlaHis: 0.861 ± 0.248
5.595AlaIle: 5.595 ± 1.197
6.628AlaLys: 6.628 ± 0.758
7.058AlaLeu: 7.058 ± 1.848
2.066AlaMet: 2.066 ± 0.958
4.39AlaAsn: 4.39 ± 0.654
1.463AlaPro: 1.463 ± 0.513
1.98AlaGln: 1.98 ± 0.328
2.496AlaArg: 2.496 ± 0.663
4.39AlaSer: 4.39 ± 0.947
4.906AlaThr: 4.906 ± 0.646
5.595AlaVal: 5.595 ± 2.136
0.861AlaTrp: 0.861 ± 0.219
2.066AlaTyr: 2.066 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.344CysAla: 0.344 ± 0.171
0.086CysCys: 0.086 ± 0.073
0.775CysAsp: 0.775 ± 0.232
0.603CysGlu: 0.603 ± 0.241
0.086CysPhe: 0.086 ± 0.091
0.516CysGly: 0.516 ± 0.264
0.086CysHis: 0.086 ± 0.076
0.516CysIle: 0.516 ± 0.239
0.172CysLys: 0.172 ± 0.125
0.775CysLeu: 0.775 ± 0.262
0.172CysMet: 0.172 ± 0.134
0.0CysAsn: 0.0 ± 0.0
0.344CysPro: 0.344 ± 0.156
0.086CysGln: 0.086 ± 0.101
0.344CysArg: 0.344 ± 0.171
0.516CysSer: 0.516 ± 0.215
0.516CysThr: 0.516 ± 0.208
0.516CysVal: 0.516 ± 0.197
0.0CysTrp: 0.0 ± 0.0
0.516CysTyr: 0.516 ± 0.255
0.0CysXaa: 0.0 ± 0.0
Asp
4.906AspAla: 4.906 ± 0.698
0.516AspCys: 0.516 ± 0.198
5.767AspAsp: 5.767 ± 0.661
5.078AspGlu: 5.078 ± 0.738
2.84AspPhe: 2.84 ± 0.5
4.304AspGly: 4.304 ± 0.529
0.344AspHis: 0.344 ± 0.195
4.132AspIle: 4.132 ± 0.668
6.972AspLys: 6.972 ± 0.817
6.111AspLeu: 6.111 ± 0.842
2.152AspMet: 2.152 ± 0.386
3.357AspAsn: 3.357 ± 0.716
2.582AspPro: 2.582 ± 0.421
2.926AspGln: 2.926 ± 0.517
2.41AspArg: 2.41 ± 0.478
3.443AspSer: 3.443 ± 0.536
3.357AspThr: 3.357 ± 0.454
4.218AspVal: 4.218 ± 0.692
1.205AspTrp: 1.205 ± 0.32
3.443AspTyr: 3.443 ± 0.598
0.0AspXaa: 0.0 ± 0.0
Glu
4.132GluAla: 4.132 ± 0.504
0.775GluCys: 0.775 ± 0.252
3.959GluAsp: 3.959 ± 0.968
3.959GluGlu: 3.959 ± 0.813
2.238GluPhe: 2.238 ± 0.487
3.185GluGly: 3.185 ± 0.695
0.775GluHis: 0.775 ± 0.273
4.476GluIle: 4.476 ± 0.54
5.681GluLys: 5.681 ± 1.206
6.025GluLeu: 6.025 ± 0.787
1.549GluMet: 1.549 ± 0.466
4.132GluAsn: 4.132 ± 0.748
1.377GluPro: 1.377 ± 0.295
3.099GluGln: 3.099 ± 0.531
2.41GluArg: 2.41 ± 0.717
2.926GluSer: 2.926 ± 0.482
2.84GluThr: 2.84 ± 0.484
3.529GluVal: 3.529 ± 0.653
0.861GluTrp: 0.861 ± 0.298
2.668GluTyr: 2.668 ± 0.475
0.0GluXaa: 0.0 ± 0.0
Phe
2.238PheAla: 2.238 ± 0.409
0.344PheCys: 0.344 ± 0.167
4.045PheAsp: 4.045 ± 0.799
3.099PheGlu: 3.099 ± 0.579
1.291PhePhe: 1.291 ± 0.345
2.582PheGly: 2.582 ± 0.498
0.258PheHis: 0.258 ± 0.151
1.377PheIle: 1.377 ± 0.458
3.099PheLys: 3.099 ± 0.765
1.894PheLeu: 1.894 ± 0.347
0.775PheMet: 0.775 ± 0.306
2.324PheAsn: 2.324 ± 0.44
0.689PhePro: 0.689 ± 0.278
1.205PheGln: 1.205 ± 0.338
1.549PheArg: 1.549 ± 0.409
2.41PheSer: 2.41 ± 0.45
2.668PheThr: 2.668 ± 0.576
1.721PheVal: 1.721 ± 0.384
0.258PheTrp: 0.258 ± 0.132
1.377PheTyr: 1.377 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
5.25GlyAla: 5.25 ± 1.831
0.258GlyCys: 0.258 ± 0.149
3.099GlyAsp: 3.099 ± 0.439
3.013GlyGlu: 3.013 ± 0.477
3.185GlyPhe: 3.185 ± 0.736
4.906GlyGly: 4.906 ± 1.083
1.119GlyHis: 1.119 ± 0.331
5.25GlyIle: 5.25 ± 0.926
4.734GlyLys: 4.734 ± 0.607
5.853GlyLeu: 5.853 ± 1.909
1.549GlyMet: 1.549 ± 0.503
3.701GlyAsn: 3.701 ± 0.716
1.205GlyPro: 1.205 ± 0.375
2.496GlyGln: 2.496 ± 0.479
2.754GlyArg: 2.754 ± 0.358
3.615GlySer: 3.615 ± 0.52
3.787GlyThr: 3.787 ± 0.841
4.562GlyVal: 4.562 ± 0.967
0.775GlyTrp: 0.775 ± 0.279
3.013GlyTyr: 3.013 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
0.947HisAla: 0.947 ± 0.265
0.172HisCys: 0.172 ± 0.118
0.947HisAsp: 0.947 ± 0.341
0.947HisGlu: 0.947 ± 0.342
0.344HisPhe: 0.344 ± 0.171
0.516HisGly: 0.516 ± 0.222
0.43HisHis: 0.43 ± 0.203
0.775HisIle: 0.775 ± 0.245
1.033HisLys: 1.033 ± 0.267
1.033HisLeu: 1.033 ± 0.283
0.172HisMet: 0.172 ± 0.128
0.689HisAsn: 0.689 ± 0.293
0.603HisPro: 0.603 ± 0.23
0.516HisGln: 0.516 ± 0.203
0.775HisArg: 0.775 ± 0.325
0.603HisSer: 0.603 ± 0.26
0.603HisThr: 0.603 ± 0.23
0.516HisVal: 0.516 ± 0.236
0.344HisTrp: 0.344 ± 0.165
0.775HisTyr: 0.775 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
5.25IleAla: 5.25 ± 1.245
0.258IleCys: 0.258 ± 0.142
6.197IleAsp: 6.197 ± 0.808
3.959IleGlu: 3.959 ± 0.639
2.496IlePhe: 2.496 ± 0.434
3.185IleGly: 3.185 ± 0.68
0.43IleHis: 0.43 ± 0.185
3.357IleIle: 3.357 ± 0.542
6.025IleLys: 6.025 ± 0.847
4.992IleLeu: 4.992 ± 0.807
1.894IleMet: 1.894 ± 0.382
5.681IleAsn: 5.681 ± 0.581
2.152IlePro: 2.152 ± 0.497
2.668IleGln: 2.668 ± 0.47
2.238IleArg: 2.238 ± 0.446
4.132IleSer: 4.132 ± 0.619
4.39IleThr: 4.39 ± 0.665
3.271IleVal: 3.271 ± 0.638
0.775IleTrp: 0.775 ± 0.261
2.238IleTyr: 2.238 ± 0.457
0.0IleXaa: 0.0 ± 0.0
Lys
6.456LysAla: 6.456 ± 0.595
0.172LysCys: 0.172 ± 0.14
6.972LysAsp: 6.972 ± 0.945
6.456LysGlu: 6.456 ± 0.999
2.926LysPhe: 2.926 ± 0.611
3.701LysGly: 3.701 ± 0.501
1.205LysHis: 1.205 ± 0.294
5.423LysIle: 5.423 ± 0.603
7.833LysLys: 7.833 ± 1.199
6.972LysLeu: 6.972 ± 0.798
2.84LysMet: 2.84 ± 0.375
4.648LysAsn: 4.648 ± 0.607
2.84LysPro: 2.84 ± 0.608
3.443LysGln: 3.443 ± 0.565
3.787LysArg: 3.787 ± 0.708
5.337LysSer: 5.337 ± 0.709
5.595LysThr: 5.595 ± 0.709
5.078LysVal: 5.078 ± 0.581
0.861LysTrp: 0.861 ± 0.34
3.013LysTyr: 3.013 ± 0.622
0.0LysXaa: 0.0 ± 0.0
Leu
6.025LeuAla: 6.025 ± 1.178
0.516LeuCys: 0.516 ± 0.198
5.595LeuAsp: 5.595 ± 0.788
3.959LeuGlu: 3.959 ± 0.825
2.238LeuPhe: 2.238 ± 0.426
6.025LeuGly: 6.025 ± 0.745
1.549LeuHis: 1.549 ± 0.357
4.82LeuIle: 4.82 ± 0.766
8.177LeuLys: 8.177 ± 0.824
8.005LeuLeu: 8.005 ± 1.664
2.152LeuMet: 2.152 ± 0.833
5.767LeuAsn: 5.767 ± 0.754
3.013LeuPro: 3.013 ± 0.552
2.582LeuGln: 2.582 ± 0.455
4.218LeuArg: 4.218 ± 0.734
4.562LeuSer: 4.562 ± 0.571
5.853LeuThr: 5.853 ± 0.83
6.972LeuVal: 6.972 ± 1.825
0.603LeuTrp: 0.603 ± 0.226
2.238LeuTyr: 2.238 ± 0.521
0.0LeuXaa: 0.0 ± 0.0
Met
2.582MetAla: 2.582 ± 0.879
0.086MetCys: 0.086 ± 0.088
1.377MetAsp: 1.377 ± 0.321
1.463MetGlu: 1.463 ± 0.333
0.775MetPhe: 0.775 ± 0.293
1.549MetGly: 1.549 ± 0.398
0.172MetHis: 0.172 ± 0.11
2.152MetIle: 2.152 ± 0.538
1.635MetLys: 1.635 ± 0.382
2.582MetLeu: 2.582 ± 0.7
0.689MetMet: 0.689 ± 0.235
1.463MetAsn: 1.463 ± 0.405
0.861MetPro: 0.861 ± 0.271
0.775MetGln: 0.775 ± 0.251
0.947MetArg: 0.947 ± 0.317
1.377MetSer: 1.377 ± 0.37
2.238MetThr: 2.238 ± 0.427
2.496MetVal: 2.496 ± 0.759
0.0MetTrp: 0.0 ± 0.0
1.205MetTyr: 1.205 ± 0.325
0.0MetXaa: 0.0 ± 0.0
Asn
4.045AsnAla: 4.045 ± 0.518
0.861AsnCys: 0.861 ± 0.253
4.045AsnAsp: 4.045 ± 0.711
2.926AsnGlu: 2.926 ± 0.61
2.324AsnPhe: 2.324 ± 0.513
5.078AsnGly: 5.078 ± 0.845
0.775AsnHis: 0.775 ± 0.301
3.443AsnIle: 3.443 ± 0.679
5.681AsnLys: 5.681 ± 0.737
4.304AsnLeu: 4.304 ± 0.85
1.894AsnMet: 1.894 ± 0.385
4.132AsnAsn: 4.132 ± 0.639
2.152AsnPro: 2.152 ± 0.351
1.721AsnGln: 1.721 ± 0.407
2.152AsnArg: 2.152 ± 0.453
4.218AsnSer: 4.218 ± 0.575
3.357AsnThr: 3.357 ± 0.479
4.304AsnVal: 4.304 ± 0.661
0.861AsnTrp: 0.861 ± 0.251
3.185AsnTyr: 3.185 ± 0.632
0.0AsnXaa: 0.0 ± 0.0
Pro
2.582ProAla: 2.582 ± 0.532
0.0ProCys: 0.0 ± 0.0
2.84ProAsp: 2.84 ± 0.554
2.066ProGlu: 2.066 ± 0.4
0.689ProPhe: 0.689 ± 0.275
1.808ProGly: 1.808 ± 0.422
0.43ProHis: 0.43 ± 0.174
2.668ProIle: 2.668 ± 0.523
2.84ProLys: 2.84 ± 0.55
1.463ProLeu: 1.463 ± 0.294
0.775ProMet: 0.775 ± 0.311
1.635ProAsn: 1.635 ± 0.325
0.603ProPro: 0.603 ± 0.217
1.033ProGln: 1.033 ± 0.249
1.291ProArg: 1.291 ± 0.315
2.238ProSer: 2.238 ± 0.524
1.808ProThr: 1.808 ± 0.353
2.152ProVal: 2.152 ± 0.489
0.603ProTrp: 0.603 ± 0.218
1.119ProTyr: 1.119 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
3.185GlnAla: 3.185 ± 0.545
0.258GlnCys: 0.258 ± 0.141
2.238GlnAsp: 2.238 ± 0.493
1.894GlnGlu: 1.894 ± 0.455
1.549GlnPhe: 1.549 ± 0.427
1.635GlnGly: 1.635 ± 0.403
0.172GlnHis: 0.172 ± 0.131
2.84GlnIle: 2.84 ± 0.537
2.668GlnLys: 2.668 ± 0.507
4.304GlnLeu: 4.304 ± 0.844
1.377GlnMet: 1.377 ± 0.235
2.152GlnAsn: 2.152 ± 0.482
0.947GlnPro: 0.947 ± 0.226
2.582GlnGln: 2.582 ± 0.507
1.463GlnArg: 1.463 ± 0.374
2.324GlnSer: 2.324 ± 0.375
3.013GlnThr: 3.013 ± 0.47
1.894GlnVal: 1.894 ± 0.368
0.516GlnTrp: 0.516 ± 0.197
1.635GlnTyr: 1.635 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
2.754ArgAla: 2.754 ± 0.516
0.516ArgCys: 0.516 ± 0.246
1.894ArgAsp: 1.894 ± 0.431
2.41ArgGlu: 2.41 ± 0.629
0.861ArgPhe: 0.861 ± 0.241
2.152ArgGly: 2.152 ± 0.455
0.603ArgHis: 0.603 ± 0.267
3.099ArgIle: 3.099 ± 0.611
3.701ArgLys: 3.701 ± 0.627
3.873ArgLeu: 3.873 ± 0.69
0.603ArgMet: 0.603 ± 0.24
2.668ArgAsn: 2.668 ± 0.474
1.463ArgPro: 1.463 ± 0.471
1.119ArgGln: 1.119 ± 0.378
2.41ArgArg: 2.41 ± 0.424
2.238ArgSer: 2.238 ± 0.457
1.98ArgThr: 1.98 ± 0.426
2.496ArgVal: 2.496 ± 0.49
0.775ArgTrp: 0.775 ± 0.239
1.894ArgTyr: 1.894 ± 0.45
0.0ArgXaa: 0.0 ± 0.0
Ser
4.734SerAla: 4.734 ± 1.031
0.344SerCys: 0.344 ± 0.172
3.787SerAsp: 3.787 ± 0.615
3.443SerGlu: 3.443 ± 0.666
2.582SerPhe: 2.582 ± 0.432
3.873SerGly: 3.873 ± 0.995
1.205SerHis: 1.205 ± 0.36
3.873SerIle: 3.873 ± 0.636
4.734SerLys: 4.734 ± 0.6
4.734SerLeu: 4.734 ± 0.683
1.463SerMet: 1.463 ± 0.53
3.873SerAsn: 3.873 ± 0.526
2.582SerPro: 2.582 ± 0.48
2.324SerGln: 2.324 ± 0.425
2.066SerArg: 2.066 ± 0.349
4.734SerSer: 4.734 ± 0.622
3.529SerThr: 3.529 ± 0.564
3.873SerVal: 3.873 ± 0.67
0.947SerTrp: 0.947 ± 0.304
2.668SerTyr: 2.668 ± 0.522
0.0SerXaa: 0.0 ± 0.0
Thr
4.218ThrAla: 4.218 ± 0.693
0.603ThrCys: 0.603 ± 0.233
4.218ThrAsp: 4.218 ± 0.61
3.099ThrGlu: 3.099 ± 0.669
1.808ThrPhe: 1.808 ± 0.322
4.045ThrGly: 4.045 ± 0.601
0.689ThrHis: 0.689 ± 0.258
5.078ThrIle: 5.078 ± 0.669
5.595ThrLys: 5.595 ± 0.783
5.25ThrLeu: 5.25 ± 0.904
1.721ThrMet: 1.721 ± 0.452
3.013ThrAsn: 3.013 ± 0.61
1.635ThrPro: 1.635 ± 0.34
2.668ThrGln: 2.668 ± 0.527
2.238ThrArg: 2.238 ± 0.489
3.443ThrSer: 3.443 ± 0.483
3.787ThrThr: 3.787 ± 0.824
4.734ThrVal: 4.734 ± 0.861
0.861ThrTrp: 0.861 ± 0.275
3.271ThrTyr: 3.271 ± 0.665
0.0ThrXaa: 0.0 ± 0.0
Val
5.509ValAla: 5.509 ± 1.657
0.344ValCys: 0.344 ± 0.205
4.39ValAsp: 4.39 ± 0.649
4.39ValGlu: 4.39 ± 0.861
2.066ValPhe: 2.066 ± 0.342
5.595ValGly: 5.595 ± 2.386
0.603ValHis: 0.603 ± 0.228
3.271ValIle: 3.271 ± 0.436
5.078ValLys: 5.078 ± 0.702
4.39ValLeu: 4.39 ± 0.723
1.205ValMet: 1.205 ± 0.318
5.337ValAsn: 5.337 ± 0.882
2.668ValPro: 2.668 ± 0.531
2.582ValGln: 2.582 ± 0.618
2.324ValArg: 2.324 ± 0.434
4.734ValSer: 4.734 ± 0.843
4.648ValThr: 4.648 ± 0.832
5.939ValVal: 5.939 ± 0.784
0.43ValTrp: 0.43 ± 0.22
1.98ValTyr: 1.98 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
0.43TrpAla: 0.43 ± 0.21
0.086TrpCys: 0.086 ± 0.081
0.861TrpAsp: 0.861 ± 0.228
0.516TrpGlu: 0.516 ± 0.246
0.258TrpPhe: 0.258 ± 0.136
1.119TrpGly: 1.119 ± 0.276
0.344TrpHis: 0.344 ± 0.167
0.775TrpIle: 0.775 ± 0.257
0.947TrpLys: 0.947 ± 0.328
1.291TrpLeu: 1.291 ± 0.312
0.0TrpMet: 0.0 ± 0.0
1.291TrpAsn: 1.291 ± 0.377
0.344TrpPro: 0.344 ± 0.192
1.119TrpGln: 1.119 ± 0.277
0.172TrpArg: 0.172 ± 0.127
0.516TrpSer: 0.516 ± 0.255
0.775TrpThr: 0.775 ± 0.306
1.033TrpVal: 1.033 ± 0.293
0.258TrpTrp: 0.258 ± 0.154
0.43TrpTyr: 0.43 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.066TyrAla: 2.066 ± 0.406
0.516TyrCys: 0.516 ± 0.215
2.324TyrAsp: 2.324 ± 0.529
2.84TyrGlu: 2.84 ± 0.583
2.066TyrPhe: 2.066 ± 0.521
3.099TyrGly: 3.099 ± 0.431
0.689TyrHis: 0.689 ± 0.241
2.668TyrIle: 2.668 ± 0.489
2.324TyrLys: 2.324 ± 0.589
3.959TyrLeu: 3.959 ± 0.674
1.291TyrMet: 1.291 ± 0.315
1.291TyrAsn: 1.291 ± 0.408
1.119TyrPro: 1.119 ± 0.368
1.721TyrGln: 1.721 ± 0.556
1.463TyrArg: 1.463 ± 0.305
3.615TyrSer: 3.615 ± 0.753
2.324TyrThr: 2.324 ± 0.487
2.582TyrVal: 2.582 ± 0.601
0.775TyrTrp: 0.775 ± 0.258
1.033TyrTyr: 1.033 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (11619 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski