Amino acid dipepetide frequency for Lactobacillus phage Lb

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.786AlaAla: 6.786 ± 0.932
0.316AlaCys: 0.316 ± 0.152
5.286AlaAsp: 5.286 ± 0.703
3.866AlaGlu: 3.866 ± 0.509
2.525AlaPhe: 2.525 ± 0.434
4.971AlaGly: 4.971 ± 0.818
0.71AlaHis: 0.71 ± 0.188
4.418AlaIle: 4.418 ± 0.716
6.233AlaLys: 6.233 ± 1.23
5.208AlaLeu: 5.208 ± 0.54
2.525AlaMet: 2.525 ± 0.418
5.208AlaAsn: 5.208 ± 0.551
1.815AlaPro: 1.815 ± 0.33
3.077AlaGln: 3.077 ± 0.54
2.367AlaArg: 2.367 ± 0.441
5.444AlaSer: 5.444 ± 0.675
5.286AlaThr: 5.286 ± 0.838
4.261AlaVal: 4.261 ± 0.518
0.868AlaTrp: 0.868 ± 0.219
3.551AlaTyr: 3.551 ± 0.57
0.0AlaXaa: 0.0 ± 0.0
Cys
0.237CysAla: 0.237 ± 0.132
0.079CysCys: 0.079 ± 0.079
0.552CysAsp: 0.552 ± 0.228
0.395CysGlu: 0.395 ± 0.159
0.237CysPhe: 0.237 ± 0.13
0.552CysGly: 0.552 ± 0.233
0.079CysHis: 0.079 ± 0.078
0.395CysIle: 0.395 ± 0.174
0.316CysLys: 0.316 ± 0.171
0.316CysLeu: 0.316 ± 0.159
0.237CysMet: 0.237 ± 0.141
0.158CysAsn: 0.158 ± 0.111
0.079CysPro: 0.079 ± 0.078
0.473CysGln: 0.473 ± 0.226
0.473CysArg: 0.473 ± 0.23
0.395CysSer: 0.395 ± 0.185
0.079CysThr: 0.079 ± 0.067
0.158CysVal: 0.158 ± 0.107
0.079CysTrp: 0.079 ± 0.077
0.158CysTyr: 0.158 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
3.156AspAla: 3.156 ± 0.497
0.631AspCys: 0.631 ± 0.2
6.312AspAsp: 6.312 ± 1.094
4.182AspGlu: 4.182 ± 0.637
3.077AspPhe: 3.077 ± 0.504
5.444AspGly: 5.444 ± 0.793
1.578AspHis: 1.578 ± 0.373
3.314AspIle: 3.314 ± 0.495
5.129AspLys: 5.129 ± 0.567
4.971AspLeu: 4.971 ± 0.871
1.894AspMet: 1.894 ± 0.418
3.393AspAsn: 3.393 ± 0.544
2.051AspPro: 2.051 ± 0.349
2.525AspGln: 2.525 ± 0.537
2.288AspArg: 2.288 ± 0.452
5.129AspSer: 5.129 ± 0.552
5.286AspThr: 5.286 ± 0.975
4.497AspVal: 4.497 ± 0.558
1.42AspTrp: 1.42 ± 0.378
3.551AspTyr: 3.551 ± 0.634
0.0AspXaa: 0.0 ± 0.0
Glu
4.103GluAla: 4.103 ± 0.587
0.316GluCys: 0.316 ± 0.146
2.525GluAsp: 2.525 ± 0.502
2.446GluGlu: 2.446 ± 0.476
2.604GluPhe: 2.604 ± 0.387
3.156GluGly: 3.156 ± 0.477
1.499GluHis: 1.499 ± 0.457
3.314GluIle: 3.314 ± 0.56
3.314GluLys: 3.314 ± 0.581
5.129GluLeu: 5.129 ± 0.574
1.973GluMet: 1.973 ± 0.328
2.051GluAsn: 2.051 ± 0.486
1.815GluPro: 1.815 ± 0.445
2.919GluGln: 2.919 ± 0.498
2.367GluArg: 2.367 ± 0.473
2.683GluSer: 2.683 ± 0.45
3.945GluThr: 3.945 ± 0.564
3.314GluVal: 3.314 ± 0.493
0.789GluTrp: 0.789 ± 0.293
1.499GluTyr: 1.499 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
2.446PheAla: 2.446 ± 0.503
0.158PheCys: 0.158 ± 0.115
3.314PheAsp: 3.314 ± 0.591
1.42PheGlu: 1.42 ± 0.399
1.105PhePhe: 1.105 ± 0.234
2.998PheGly: 2.998 ± 0.493
0.237PheHis: 0.237 ± 0.113
1.815PheIle: 1.815 ± 0.343
2.525PheLys: 2.525 ± 0.45
2.998PheLeu: 2.998 ± 0.512
1.341PheMet: 1.341 ± 0.313
2.13PheAsn: 2.13 ± 0.344
0.631PhePro: 0.631 ± 0.206
2.051PheGln: 2.051 ± 0.365
0.789PheArg: 0.789 ± 0.266
2.84PheSer: 2.84 ± 0.421
2.367PheThr: 2.367 ± 0.343
1.894PheVal: 1.894 ± 0.364
0.552PheTrp: 0.552 ± 0.235
1.105PheTyr: 1.105 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
4.813GlyAla: 4.813 ± 0.689
0.316GlyCys: 0.316 ± 0.158
4.182GlyAsp: 4.182 ± 0.634
3.156GlyGlu: 3.156 ± 0.478
2.762GlyPhe: 2.762 ± 0.512
4.576GlyGly: 4.576 ± 1.133
0.868GlyHis: 0.868 ± 0.337
4.497GlyIle: 4.497 ± 0.848
6.864GlyLys: 6.864 ± 0.743
4.892GlyLeu: 4.892 ± 0.776
2.762GlyMet: 2.762 ± 0.53
4.576GlyAsn: 4.576 ± 0.772
1.184GlyPro: 1.184 ± 0.222
2.919GlyGln: 2.919 ± 0.488
2.446GlyArg: 2.446 ± 0.532
5.05GlySer: 5.05 ± 1.021
5.05GlyThr: 5.05 ± 0.773
5.129GlyVal: 5.129 ± 0.676
1.026GlyTrp: 1.026 ± 0.347
4.261GlyTyr: 4.261 ± 0.595
0.0GlyXaa: 0.0 ± 0.0
His
0.631HisAla: 0.631 ± 0.242
0.316HisCys: 0.316 ± 0.168
1.973HisAsp: 1.973 ± 0.397
1.026HisGlu: 1.026 ± 0.263
1.184HisPhe: 1.184 ± 0.295
1.42HisGly: 1.42 ± 0.53
0.395HisHis: 0.395 ± 0.178
1.736HisIle: 1.736 ± 0.337
1.499HisLys: 1.499 ± 0.313
1.815HisLeu: 1.815 ± 0.474
0.316HisMet: 0.316 ± 0.171
0.868HisAsn: 0.868 ± 0.188
0.158HisPro: 0.158 ± 0.096
0.237HisGln: 0.237 ± 0.134
0.71HisArg: 0.71 ± 0.199
1.026HisSer: 1.026 ± 0.23
1.184HisThr: 1.184 ± 0.281
1.42HisVal: 1.42 ± 0.296
0.395HisTrp: 0.395 ± 0.216
0.552HisTyr: 0.552 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
5.129IleAla: 5.129 ± 0.56
0.316IleCys: 0.316 ± 0.221
5.286IleAsp: 5.286 ± 0.713
3.314IleGlu: 3.314 ± 0.53
1.262IlePhe: 1.262 ± 0.266
3.077IleGly: 3.077 ± 0.765
1.262IleHis: 1.262 ± 0.278
2.604IleIle: 2.604 ± 0.513
5.208IleLys: 5.208 ± 0.606
2.762IleLeu: 2.762 ± 0.471
1.262IleMet: 1.262 ± 0.294
4.261IleAsn: 4.261 ± 0.545
2.209IlePro: 2.209 ± 0.401
2.209IleGln: 2.209 ± 0.38
2.13IleArg: 2.13 ± 0.351
4.182IleSer: 4.182 ± 0.943
4.497IleThr: 4.497 ± 0.621
3.156IleVal: 3.156 ± 0.513
0.71IleTrp: 0.71 ± 0.246
2.209IleTyr: 2.209 ± 0.426
0.0IleXaa: 0.0 ± 0.0
Lys
5.681LysAla: 5.681 ± 0.835
0.237LysCys: 0.237 ± 0.136
4.497LysAsp: 4.497 ± 0.553
3.551LysGlu: 3.551 ± 0.577
2.604LysPhe: 2.604 ± 0.421
4.813LysGly: 4.813 ± 0.897
1.815LysHis: 1.815 ± 0.507
4.497LysIle: 4.497 ± 0.568
7.732LysLys: 7.732 ± 1.371
7.022LysLeu: 7.022 ± 0.692
2.525LysMet: 2.525 ± 0.477
4.655LysAsn: 4.655 ± 0.65
2.525LysPro: 2.525 ± 0.342
4.103LysGln: 4.103 ± 0.626
2.525LysArg: 2.525 ± 0.516
6.075LysSer: 6.075 ± 0.804
5.76LysThr: 5.76 ± 1.116
4.497LysVal: 4.497 ± 0.635
0.71LysTrp: 0.71 ± 0.324
3.156LysTyr: 3.156 ± 0.467
0.0LysXaa: 0.0 ± 0.0
Leu
7.022LeuAla: 7.022 ± 0.762
0.316LeuCys: 0.316 ± 0.178
5.286LeuAsp: 5.286 ± 0.789
4.182LeuGlu: 4.182 ± 0.594
2.288LeuPhe: 2.288 ± 0.438
4.418LeuGly: 4.418 ± 0.568
0.789LeuHis: 0.789 ± 0.224
3.629LeuIle: 3.629 ± 0.56
5.05LeuLys: 5.05 ± 0.543
5.286LeuLeu: 5.286 ± 0.835
1.815LeuMet: 1.815 ± 0.393
5.286LeuAsn: 5.286 ± 0.629
2.367LeuPro: 2.367 ± 0.381
3.551LeuGln: 3.551 ± 0.586
3.472LeuArg: 3.472 ± 0.544
6.075LeuSer: 6.075 ± 0.682
5.286LeuThr: 5.286 ± 0.656
4.813LeuVal: 4.813 ± 0.654
1.578LeuTrp: 1.578 ± 0.424
3.314LeuTyr: 3.314 ± 0.59
0.0LeuXaa: 0.0 ± 0.0
Met
3.551MetAla: 3.551 ± 0.511
0.079MetCys: 0.079 ± 0.084
1.499MetAsp: 1.499 ± 0.353
1.341MetGlu: 1.341 ± 0.374
1.262MetPhe: 1.262 ± 0.343
1.894MetGly: 1.894 ± 0.527
0.237MetHis: 0.237 ± 0.145
1.736MetIle: 1.736 ± 0.429
1.894MetLys: 1.894 ± 0.396
2.288MetLeu: 2.288 ± 0.411
0.868MetMet: 0.868 ± 0.259
1.578MetAsn: 1.578 ± 0.346
0.947MetPro: 0.947 ± 0.31
1.341MetGln: 1.341 ± 0.297
1.184MetArg: 1.184 ± 0.23
2.051MetSer: 2.051 ± 0.327
2.367MetThr: 2.367 ± 0.36
1.578MetVal: 1.578 ± 0.364
0.079MetTrp: 0.079 ± 0.082
0.395MetTyr: 0.395 ± 0.239
0.0MetXaa: 0.0 ± 0.0
Asn
3.945AsnAla: 3.945 ± 0.668
0.395AsnCys: 0.395 ± 0.2
4.418AsnAsp: 4.418 ± 0.487
3.472AsnGlu: 3.472 ± 0.551
2.13AsnPhe: 2.13 ± 0.531
5.76AsnGly: 5.76 ± 0.768
1.341AsnHis: 1.341 ± 0.397
2.919AsnIle: 2.919 ± 0.447
3.945AsnLys: 3.945 ± 0.564
4.103AsnLeu: 4.103 ± 0.427
1.736AsnMet: 1.736 ± 0.406
3.314AsnAsn: 3.314 ± 0.595
2.683AsnPro: 2.683 ± 0.411
2.288AsnGln: 2.288 ± 0.499
1.894AsnArg: 1.894 ± 0.361
4.497AsnSer: 4.497 ± 0.571
4.182AsnThr: 4.182 ± 0.649
4.813AsnVal: 4.813 ± 0.68
0.71AsnTrp: 0.71 ± 0.22
2.13AsnTyr: 2.13 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
2.446ProAla: 2.446 ± 0.477
0.079ProCys: 0.079 ± 0.068
2.762ProAsp: 2.762 ± 0.572
2.209ProGlu: 2.209 ± 0.405
1.184ProPhe: 1.184 ± 0.331
1.894ProGly: 1.894 ± 0.312
0.395ProHis: 0.395 ± 0.161
1.657ProIle: 1.657 ± 0.303
2.525ProLys: 2.525 ± 0.423
2.683ProLeu: 2.683 ± 0.501
0.552ProMet: 0.552 ± 0.225
1.973ProAsn: 1.973 ± 0.412
0.237ProPro: 0.237 ± 0.136
2.051ProGln: 2.051 ± 0.572
1.262ProArg: 1.262 ± 0.293
2.209ProSer: 2.209 ± 0.52
2.288ProThr: 2.288 ± 0.389
2.446ProVal: 2.446 ± 0.393
0.237ProTrp: 0.237 ± 0.109
0.631ProTyr: 0.631 ± 0.211
0.0ProXaa: 0.0 ± 0.0
Gln
4.892GlnAla: 4.892 ± 0.556
0.158GlnCys: 0.158 ± 0.113
2.288GlnAsp: 2.288 ± 0.363
2.604GlnGlu: 2.604 ± 0.476
1.578GlnPhe: 1.578 ± 0.416
2.367GlnGly: 2.367 ± 0.348
1.184GlnHis: 1.184 ± 0.334
3.156GlnIle: 3.156 ± 0.49
3.314GlnLys: 3.314 ± 0.583
3.787GlnLeu: 3.787 ± 0.607
0.631GlnMet: 0.631 ± 0.226
2.762GlnAsn: 2.762 ± 0.396
1.657GlnPro: 1.657 ± 0.364
3.551GlnGln: 3.551 ± 0.551
2.525GlnArg: 2.525 ± 0.347
2.367GlnSer: 2.367 ± 0.416
3.629GlnThr: 3.629 ± 0.73
2.367GlnVal: 2.367 ± 0.372
0.552GlnTrp: 0.552 ± 0.167
1.815GlnTyr: 1.815 ± 0.376
0.0GlnXaa: 0.0 ± 0.0
Arg
2.051ArgAla: 2.051 ± 0.389
0.552ArgCys: 0.552 ± 0.243
2.051ArgAsp: 2.051 ± 0.391
2.13ArgGlu: 2.13 ± 0.601
0.947ArgPhe: 0.947 ± 0.294
1.894ArgGly: 1.894 ± 0.396
0.868ArgHis: 0.868 ± 0.249
2.209ArgIle: 2.209 ± 0.442
3.708ArgLys: 3.708 ± 0.59
2.998ArgLeu: 2.998 ± 0.442
1.105ArgMet: 1.105 ± 0.286
2.051ArgAsn: 2.051 ± 0.486
1.42ArgPro: 1.42 ± 0.307
1.894ArgGln: 1.894 ± 0.466
1.499ArgArg: 1.499 ± 0.303
2.446ArgSer: 2.446 ± 0.479
2.209ArgThr: 2.209 ± 0.424
2.209ArgVal: 2.209 ± 0.394
0.552ArgTrp: 0.552 ± 0.212
1.42ArgTyr: 1.42 ± 0.477
0.0ArgXaa: 0.0 ± 0.0
Ser
4.655SerAla: 4.655 ± 0.538
0.079SerCys: 0.079 ± 0.081
4.655SerAsp: 4.655 ± 0.662
2.84SerGlu: 2.84 ± 0.409
2.209SerPhe: 2.209 ± 0.548
7.496SerGly: 7.496 ± 1.084
1.026SerHis: 1.026 ± 0.306
3.551SerIle: 3.551 ± 0.475
6.391SerLys: 6.391 ± 1.009
5.365SerLeu: 5.365 ± 0.602
2.209SerMet: 2.209 ± 0.569
4.182SerAsn: 4.182 ± 0.533
1.578SerPro: 1.578 ± 0.392
2.998SerGln: 2.998 ± 0.486
1.42SerArg: 1.42 ± 0.35
4.892SerSer: 4.892 ± 0.916
4.261SerThr: 4.261 ± 0.545
5.602SerVal: 5.602 ± 0.602
0.71SerTrp: 0.71 ± 0.228
2.84SerTyr: 2.84 ± 0.68
0.0SerXaa: 0.0 ± 0.0
Thr
5.602ThrAla: 5.602 ± 0.967
0.395ThrCys: 0.395 ± 0.16
5.05ThrAsp: 5.05 ± 0.824
3.472ThrGlu: 3.472 ± 0.491
2.288ThrPhe: 2.288 ± 0.472
6.154ThrGly: 6.154 ± 0.699
1.736ThrHis: 1.736 ± 0.343
4.813ThrIle: 4.813 ± 0.54
4.813ThrLys: 4.813 ± 0.778
5.05ThrLeu: 5.05 ± 0.816
1.341ThrMet: 1.341 ± 0.303
5.681ThrAsn: 5.681 ± 0.977
3.945ThrPro: 3.945 ± 0.709
2.84ThrGln: 2.84 ± 0.496
1.973ThrArg: 1.973 ± 0.421
4.024ThrSer: 4.024 ± 0.41
5.129ThrThr: 5.129 ± 1.188
3.866ThrVal: 3.866 ± 0.637
0.868ThrTrp: 0.868 ± 0.24
2.367ThrTyr: 2.367 ± 0.581
0.0ThrXaa: 0.0 ± 0.0
Val
4.34ValAla: 4.34 ± 0.564
0.316ValCys: 0.316 ± 0.163
5.05ValAsp: 5.05 ± 0.572
3.551ValGlu: 3.551 ± 0.629
1.815ValPhe: 1.815 ± 0.286
4.261ValGly: 4.261 ± 0.475
1.736ValHis: 1.736 ± 0.347
3.708ValIle: 3.708 ± 0.613
4.892ValLys: 4.892 ± 0.555
4.576ValLeu: 4.576 ± 0.622
1.973ValMet: 1.973 ± 0.347
4.103ValAsn: 4.103 ± 0.51
2.446ValPro: 2.446 ± 0.393
2.919ValGln: 2.919 ± 0.571
2.604ValArg: 2.604 ± 0.403
3.945ValSer: 3.945 ± 0.529
4.813ValThr: 4.813 ± 0.833
4.261ValVal: 4.261 ± 0.746
0.631ValTrp: 0.631 ± 0.233
2.683ValTyr: 2.683 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
0.631TrpAla: 0.631 ± 0.213
0.158TrpCys: 0.158 ± 0.105
0.473TrpAsp: 0.473 ± 0.162
0.316TrpGlu: 0.316 ± 0.105
0.395TrpPhe: 0.395 ± 0.153
0.868TrpGly: 0.868 ± 0.218
0.316TrpHis: 0.316 ± 0.149
1.105TrpIle: 1.105 ± 0.325
1.341TrpLys: 1.341 ± 0.457
1.184TrpLeu: 1.184 ± 0.338
0.316TrpMet: 0.316 ± 0.141
0.947TrpAsn: 0.947 ± 0.301
0.552TrpPro: 0.552 ± 0.166
0.71TrpGln: 0.71 ± 0.183
0.395TrpArg: 0.395 ± 0.178
1.026TrpSer: 1.026 ± 0.246
0.947TrpThr: 0.947 ± 0.268
1.184TrpVal: 1.184 ± 0.328
0.237TrpTrp: 0.237 ± 0.115
0.316TrpTyr: 0.316 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.604TyrAla: 2.604 ± 0.366
0.237TyrCys: 0.237 ± 0.155
2.367TyrAsp: 2.367 ± 0.525
2.288TyrGlu: 2.288 ± 0.435
1.262TyrPhe: 1.262 ± 0.317
3.314TyrGly: 3.314 ± 0.583
0.789TyrHis: 0.789 ± 0.295
1.973TyrIle: 1.973 ± 0.354
2.367TyrLys: 2.367 ± 0.466
3.393TyrLeu: 3.393 ± 0.621
0.71TyrMet: 0.71 ± 0.229
1.499TyrAsn: 1.499 ± 0.344
1.42TyrPro: 1.42 ± 0.343
2.525TyrGln: 2.525 ± 0.449
1.894TyrArg: 1.894 ± 0.396
2.604TyrSer: 2.604 ± 0.411
2.84TyrThr: 2.84 ± 0.539
3.156TyrVal: 3.156 ± 0.545
0.71TyrTrp: 0.71 ± 0.236
1.184TyrTyr: 1.184 ± 0.431
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (12675 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski