Amino acid dipepetide frequency for Lactobacillus phage JNU_P1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.756AlaAla: 4.756 ± 0.608
0.507AlaCys: 0.507 ± 0.203
6.025AlaAsp: 6.025 ± 0.74
4.186AlaGlu: 4.186 ± 0.785
2.6AlaPhe: 2.6 ± 0.47
5.518AlaGly: 5.518 ± 0.787
1.332AlaHis: 1.332 ± 0.307
6.279AlaIle: 6.279 ± 0.64
6.976AlaLys: 6.976 ± 0.865
6.152AlaLeu: 6.152 ± 0.554
2.537AlaMet: 2.537 ± 0.59
6.279AlaAsn: 6.279 ± 0.784
1.459AlaPro: 1.459 ± 0.242
3.678AlaGln: 3.678 ± 0.525
2.854AlaArg: 2.854 ± 0.511
7.357AlaSer: 7.357 ± 1.254
6.786AlaThr: 6.786 ± 0.999
5.391AlaVal: 5.391 ± 0.608
1.522AlaTrp: 1.522 ± 0.35
2.473AlaTyr: 2.473 ± 0.314
0.0AlaXaa: 0.0 ± 0.0
Cys
0.381CysAla: 0.381 ± 0.155
0.0CysCys: 0.0 ± 0.0
0.127CysAsp: 0.127 ± 0.083
0.19CysGlu: 0.19 ± 0.106
0.127CysPhe: 0.127 ± 0.094
0.444CysGly: 0.444 ± 0.219
0.254CysHis: 0.254 ± 0.113
0.317CysIle: 0.317 ± 0.169
0.19CysLys: 0.19 ± 0.117
0.381CysLeu: 0.381 ± 0.158
0.127CysMet: 0.127 ± 0.09
0.254CysAsn: 0.254 ± 0.129
0.254CysPro: 0.254 ± 0.2
0.381CysGln: 0.381 ± 0.144
0.317CysArg: 0.317 ± 0.155
0.19CysSer: 0.19 ± 0.115
0.19CysThr: 0.19 ± 0.107
0.127CysVal: 0.127 ± 0.089
0.063CysTrp: 0.063 ± 0.058
0.254CysTyr: 0.254 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
5.771AspAla: 5.771 ± 0.893
0.381AspCys: 0.381 ± 0.131
5.327AspAsp: 5.327 ± 0.91
3.615AspGlu: 3.615 ± 0.682
3.108AspPhe: 3.108 ± 0.429
5.898AspGly: 5.898 ± 0.8
0.824AspHis: 0.824 ± 0.279
3.615AspIle: 3.615 ± 0.524
4.756AspLys: 4.756 ± 0.585
5.454AspLeu: 5.454 ± 0.605
2.156AspMet: 2.156 ± 0.414
3.932AspAsn: 3.932 ± 0.619
2.6AspPro: 2.6 ± 0.417
2.79AspGln: 2.79 ± 0.391
2.473AspArg: 2.473 ± 0.419
2.981AspSer: 2.981 ± 0.42
4.693AspThr: 4.693 ± 0.687
3.742AspVal: 3.742 ± 0.771
1.395AspTrp: 1.395 ± 0.341
3.615AspTyr: 3.615 ± 0.61
0.0AspXaa: 0.0 ± 0.0
Glu
4.883GluAla: 4.883 ± 0.542
0.127GluCys: 0.127 ± 0.088
2.727GluAsp: 2.727 ± 0.404
3.678GluGlu: 3.678 ± 0.504
1.776GluPhe: 1.776 ± 0.353
2.6GluGly: 2.6 ± 0.484
0.888GluHis: 0.888 ± 0.257
3.869GluIle: 3.869 ± 0.652
4.186GluLys: 4.186 ± 0.758
6.152GluLeu: 6.152 ± 0.946
1.712GluMet: 1.712 ± 0.352
2.854GluAsn: 2.854 ± 0.443
2.156GluPro: 2.156 ± 0.481
3.425GluGln: 3.425 ± 0.544
3.488GluArg: 3.488 ± 0.597
3.361GluSer: 3.361 ± 0.66
2.664GluThr: 2.664 ± 0.375
3.488GluVal: 3.488 ± 0.634
1.015GluTrp: 1.015 ± 0.307
2.41GluTyr: 2.41 ± 0.521
0.0GluXaa: 0.0 ± 0.0
Phe
2.473PheAla: 2.473 ± 0.41
0.19PheCys: 0.19 ± 0.115
2.79PheAsp: 2.79 ± 0.442
1.712PheGlu: 1.712 ± 0.35
1.268PhePhe: 1.268 ± 0.259
2.093PheGly: 2.093 ± 0.432
0.888PheHis: 0.888 ± 0.24
2.156PheIle: 2.156 ± 0.404
3.044PheLys: 3.044 ± 0.606
2.283PheLeu: 2.283 ± 0.388
0.507PheMet: 0.507 ± 0.198
2.6PheAsn: 2.6 ± 0.524
0.951PhePro: 0.951 ± 0.24
0.951PheGln: 0.951 ± 0.246
1.205PheArg: 1.205 ± 0.301
2.41PheSer: 2.41 ± 0.563
3.615PheThr: 3.615 ± 0.494
1.332PheVal: 1.332 ± 0.27
0.507PheTrp: 0.507 ± 0.199
1.268PheTyr: 1.268 ± 0.318
0.0PheXaa: 0.0 ± 0.0
Gly
4.249GlyAla: 4.249 ± 0.624
0.317GlyCys: 0.317 ± 0.145
4.376GlyAsp: 4.376 ± 0.474
4.122GlyGlu: 4.122 ± 0.449
2.917GlyPhe: 2.917 ± 0.554
3.995GlyGly: 3.995 ± 0.954
1.205GlyHis: 1.205 ± 0.355
4.122GlyIle: 4.122 ± 0.537
5.01GlyLys: 5.01 ± 1.087
6.279GlyLeu: 6.279 ± 0.818
1.522GlyMet: 1.522 ± 0.295
3.805GlyAsn: 3.805 ± 0.65
1.205GlyPro: 1.205 ± 0.294
2.283GlyGln: 2.283 ± 0.344
2.283GlyArg: 2.283 ± 0.285
4.883GlySer: 4.883 ± 0.849
5.074GlyThr: 5.074 ± 0.748
3.742GlyVal: 3.742 ± 0.415
1.142GlyTrp: 1.142 ± 0.296
2.41GlyTyr: 2.41 ± 0.468
0.0GlyXaa: 0.0 ± 0.0
His
1.015HisAla: 1.015 ± 0.239
0.19HisCys: 0.19 ± 0.113
1.078HisAsp: 1.078 ± 0.259
1.015HisGlu: 1.015 ± 0.24
0.824HisPhe: 0.824 ± 0.251
1.015HisGly: 1.015 ± 0.214
0.381HisHis: 0.381 ± 0.124
0.888HisIle: 0.888 ± 0.236
0.507HisLys: 0.507 ± 0.152
1.078HisLeu: 1.078 ± 0.279
0.444HisMet: 0.444 ± 0.186
0.571HisAsn: 0.571 ± 0.18
0.761HisPro: 0.761 ± 0.225
0.888HisGln: 0.888 ± 0.218
1.015HisArg: 1.015 ± 0.349
1.205HisSer: 1.205 ± 0.308
0.824HisThr: 0.824 ± 0.228
0.761HisVal: 0.761 ± 0.251
0.507HisTrp: 0.507 ± 0.194
0.698HisTyr: 0.698 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.264IleAla: 5.264 ± 0.562
0.381IleCys: 0.381 ± 0.179
4.059IleAsp: 4.059 ± 0.393
4.313IleGlu: 4.313 ± 0.566
1.839IlePhe: 1.839 ± 0.385
4.376IleGly: 4.376 ± 0.783
0.824IleHis: 0.824 ± 0.203
2.917IleIle: 2.917 ± 0.591
5.644IleLys: 5.644 ± 0.519
3.551IleLeu: 3.551 ± 0.619
0.761IleMet: 0.761 ± 0.248
5.454IleAsn: 5.454 ± 0.653
2.029IlePro: 2.029 ± 0.412
2.347IleGln: 2.347 ± 0.417
2.537IleArg: 2.537 ± 0.342
4.947IleSer: 4.947 ± 0.837
4.566IleThr: 4.566 ± 0.48
3.361IleVal: 3.361 ± 0.614
0.444IleTrp: 0.444 ± 0.187
1.903IleTyr: 1.903 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
6.088LysAla: 6.088 ± 1.152
0.19LysCys: 0.19 ± 0.125
4.756LysAsp: 4.756 ± 0.643
4.059LysGlu: 4.059 ± 0.693
2.537LysPhe: 2.537 ± 0.479
3.869LysGly: 3.869 ± 0.727
1.395LysHis: 1.395 ± 0.302
3.995LysIle: 3.995 ± 0.465
4.566LysLys: 4.566 ± 0.726
6.786LysLeu: 6.786 ± 0.776
2.283LysMet: 2.283 ± 0.496
4.186LysAsn: 4.186 ± 0.562
1.268LysPro: 1.268 ± 0.248
4.566LysGln: 4.566 ± 0.508
3.298LysArg: 3.298 ± 0.528
5.454LysSer: 5.454 ± 0.895
4.82LysThr: 4.82 ± 0.704
4.439LysVal: 4.439 ± 0.477
1.142LysTrp: 1.142 ± 0.343
2.537LysTyr: 2.537 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
6.849LeuAla: 6.849 ± 0.75
0.444LeuCys: 0.444 ± 0.208
7.293LeuAsp: 7.293 ± 0.718
4.947LeuGlu: 4.947 ± 0.577
2.473LeuPhe: 2.473 ± 0.349
5.581LeuGly: 5.581 ± 0.977
1.078LeuHis: 1.078 ± 0.308
5.644LeuIle: 5.644 ± 0.708
7.23LeuLys: 7.23 ± 0.672
5.264LeuLeu: 5.264 ± 0.811
1.903LeuMet: 1.903 ± 0.364
4.883LeuAsn: 4.883 ± 0.723
2.347LeuPro: 2.347 ± 0.484
2.917LeuGln: 2.917 ± 0.603
3.551LeuArg: 3.551 ± 0.578
6.596LeuSer: 6.596 ± 0.58
5.391LeuThr: 5.391 ± 0.51
4.122LeuVal: 4.122 ± 0.51
0.634LeuTrp: 0.634 ± 0.248
2.473LeuTyr: 2.473 ± 0.546
0.0LeuXaa: 0.0 ± 0.0
Met
2.727MetAla: 2.727 ± 0.643
0.127MetCys: 0.127 ± 0.097
1.268MetAsp: 1.268 ± 0.286
1.459MetGlu: 1.459 ± 0.276
0.951MetPhe: 0.951 ± 0.243
1.015MetGly: 1.015 ± 0.267
0.254MetHis: 0.254 ± 0.129
1.776MetIle: 1.776 ± 0.39
2.093MetLys: 2.093 ± 0.454
1.776MetLeu: 1.776 ± 0.324
0.254MetMet: 0.254 ± 0.126
2.156MetAsn: 2.156 ± 0.428
0.571MetPro: 0.571 ± 0.195
1.522MetGln: 1.522 ± 0.321
1.268MetArg: 1.268 ± 0.317
1.078MetSer: 1.078 ± 0.217
2.283MetThr: 2.283 ± 0.32
0.951MetVal: 0.951 ± 0.266
0.254MetTrp: 0.254 ± 0.122
0.507MetTyr: 0.507 ± 0.197
0.0MetXaa: 0.0 ± 0.0
Asn
5.708AsnAla: 5.708 ± 0.888
0.19AsnCys: 0.19 ± 0.111
3.995AsnAsp: 3.995 ± 0.51
3.361AsnGlu: 3.361 ± 0.484
1.903AsnPhe: 1.903 ± 0.313
4.947AsnGly: 4.947 ± 0.733
0.634AsnHis: 0.634 ± 0.211
2.917AsnIle: 2.917 ± 0.481
3.615AsnLys: 3.615 ± 0.641
5.01AsnLeu: 5.01 ± 0.829
1.459AsnMet: 1.459 ± 0.323
4.059AsnAsn: 4.059 ± 0.717
2.41AsnPro: 2.41 ± 0.393
2.917AsnGln: 2.917 ± 0.431
3.234AsnArg: 3.234 ± 0.565
5.01AsnSer: 5.01 ± 0.6
3.742AsnThr: 3.742 ± 0.723
3.805AsnVal: 3.805 ± 0.482
0.951AsnTrp: 0.951 ± 0.201
2.41AsnTyr: 2.41 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
2.029ProAla: 2.029 ± 0.358
0.19ProCys: 0.19 ± 0.11
1.839ProAsp: 1.839 ± 0.366
2.79ProGlu: 2.79 ± 0.535
0.698ProPhe: 0.698 ± 0.236
1.903ProGly: 1.903 ± 0.382
0.19ProHis: 0.19 ± 0.127
1.712ProIle: 1.712 ± 0.357
1.903ProLys: 1.903 ± 0.384
1.903ProLeu: 1.903 ± 0.451
0.888ProMet: 0.888 ± 0.273
1.649ProAsn: 1.649 ± 0.375
0.571ProPro: 0.571 ± 0.204
1.078ProGln: 1.078 ± 0.273
0.444ProArg: 0.444 ± 0.17
2.22ProSer: 2.22 ± 0.401
2.537ProThr: 2.537 ± 0.651
1.395ProVal: 1.395 ± 0.295
0.507ProTrp: 0.507 ± 0.224
0.888ProTyr: 0.888 ± 0.238
0.0ProXaa: 0.0 ± 0.0
Gln
5.835GlnAla: 5.835 ± 0.653
0.127GlnCys: 0.127 ± 0.094
2.854GlnAsp: 2.854 ± 0.436
2.79GlnGlu: 2.79 ± 0.532
1.649GlnPhe: 1.649 ± 0.388
2.727GlnGly: 2.727 ± 0.598
1.078GlnHis: 1.078 ± 0.21
2.917GlnIle: 2.917 ± 0.352
2.347GlnLys: 2.347 ± 0.489
5.01GlnLeu: 5.01 ± 0.549
1.078GlnMet: 1.078 ± 0.269
2.41GlnAsn: 2.41 ± 0.469
0.951GlnPro: 0.951 ± 0.268
2.283GlnGln: 2.283 ± 0.467
1.078GlnArg: 1.078 ± 0.278
2.981GlnSer: 2.981 ± 0.54
3.742GlnThr: 3.742 ± 1.253
2.6GlnVal: 2.6 ± 0.454
0.444GlnTrp: 0.444 ± 0.115
1.205GlnTyr: 1.205 ± 0.328
0.0GlnXaa: 0.0 ± 0.0
Arg
2.79ArgAla: 2.79 ± 0.489
0.317ArgCys: 0.317 ± 0.137
2.537ArgAsp: 2.537 ± 0.423
2.283ArgGlu: 2.283 ± 0.424
1.776ArgPhe: 1.776 ± 0.283
2.156ArgGly: 2.156 ± 0.376
0.507ArgHis: 0.507 ± 0.244
2.22ArgIle: 2.22 ± 0.345
2.854ArgLys: 2.854 ± 0.537
4.249ArgLeu: 4.249 ± 0.638
0.824ArgMet: 0.824 ± 0.217
2.22ArgAsn: 2.22 ± 0.356
1.332ArgPro: 1.332 ± 0.359
2.473ArgGln: 2.473 ± 0.473
2.917ArgArg: 2.917 ± 0.487
2.854ArgSer: 2.854 ± 0.331
2.79ArgThr: 2.79 ± 0.4
2.41ArgVal: 2.41 ± 0.402
0.254ArgTrp: 0.254 ± 0.116
2.347ArgTyr: 2.347 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
6.532SerAla: 6.532 ± 1.431
0.19SerCys: 0.19 ± 0.107
6.025SerAsp: 6.025 ± 1.473
3.678SerGlu: 3.678 ± 0.517
2.854SerPhe: 2.854 ± 0.404
4.947SerGly: 4.947 ± 0.834
1.078SerHis: 1.078 ± 0.276
4.439SerIle: 4.439 ± 0.655
5.327SerLys: 5.327 ± 1.136
5.454SerLeu: 5.454 ± 0.758
1.966SerMet: 1.966 ± 0.555
4.503SerAsn: 4.503 ± 0.572
1.459SerPro: 1.459 ± 0.373
2.981SerGln: 2.981 ± 0.903
3.044SerArg: 3.044 ± 0.431
5.01SerSer: 5.01 ± 0.879
5.01SerThr: 5.01 ± 0.671
4.566SerVal: 4.566 ± 0.572
1.205SerTrp: 1.205 ± 0.258
2.156SerTyr: 2.156 ± 0.315
0.0SerXaa: 0.0 ± 0.0
Thr
7.23ThrAla: 7.23 ± 1.125
0.127ThrCys: 0.127 ± 0.084
3.742ThrAsp: 3.742 ± 0.386
3.234ThrGlu: 3.234 ± 0.503
1.459ThrPhe: 1.459 ± 0.286
4.756ThrGly: 4.756 ± 0.875
1.078ThrHis: 1.078 ± 0.31
4.82ThrIle: 4.82 ± 0.402
5.01ThrLys: 5.01 ± 0.728
6.088ThrLeu: 6.088 ± 1.021
1.078ThrMet: 1.078 ± 0.287
4.503ThrAsn: 4.503 ± 0.839
2.283ThrPro: 2.283 ± 0.516
3.615ThrGln: 3.615 ± 0.584
2.347ThrArg: 2.347 ± 0.342
5.454ThrSer: 5.454 ± 0.621
6.913ThrThr: 6.913 ± 1.274
5.264ThrVal: 5.264 ± 0.938
0.824ThrTrp: 0.824 ± 0.256
2.093ThrTyr: 2.093 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
5.898ValAla: 5.898 ± 0.94
0.317ValCys: 0.317 ± 0.142
4.883ValAsp: 4.883 ± 0.685
2.981ValGlu: 2.981 ± 0.383
1.459ValPhe: 1.459 ± 0.343
3.234ValGly: 3.234 ± 0.529
0.888ValHis: 0.888 ± 0.182
4.186ValIle: 4.186 ± 0.525
4.439ValLys: 4.439 ± 0.537
3.678ValLeu: 3.678 ± 0.458
1.712ValMet: 1.712 ± 0.312
3.488ValAsn: 3.488 ± 0.358
1.205ValPro: 1.205 ± 0.314
2.283ValGln: 2.283 ± 0.409
2.093ValArg: 2.093 ± 0.347
5.137ValSer: 5.137 ± 0.606
3.108ValThr: 3.108 ± 0.583
3.044ValVal: 3.044 ± 0.405
0.571ValTrp: 0.571 ± 0.218
2.6ValTyr: 2.6 ± 0.5
0.0ValXaa: 0.0 ± 0.0
Trp
1.205TrpAla: 1.205 ± 0.256
0.063TrpCys: 0.063 ± 0.069
1.015TrpAsp: 1.015 ± 0.227
0.571TrpGlu: 0.571 ± 0.154
0.507TrpPhe: 0.507 ± 0.158
0.698TrpGly: 0.698 ± 0.17
0.19TrpHis: 0.19 ± 0.119
0.761TrpIle: 0.761 ± 0.182
0.698TrpLys: 0.698 ± 0.305
1.649TrpLeu: 1.649 ± 0.368
0.444TrpMet: 0.444 ± 0.165
0.698TrpAsn: 0.698 ± 0.298
0.444TrpPro: 0.444 ± 0.185
0.888TrpGln: 0.888 ± 0.18
0.698TrpArg: 0.698 ± 0.192
0.824TrpSer: 0.824 ± 0.212
0.761TrpThr: 0.761 ± 0.227
1.015TrpVal: 1.015 ± 0.26
0.19TrpTrp: 0.19 ± 0.101
0.507TrpTyr: 0.507 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.981TyrAla: 2.981 ± 0.262
0.19TyrCys: 0.19 ± 0.111
2.41TyrAsp: 2.41 ± 0.411
2.473TyrGlu: 2.473 ± 0.54
1.522TyrPhe: 1.522 ± 0.339
3.171TyrGly: 3.171 ± 0.516
0.698TyrHis: 0.698 ± 0.167
1.966TyrIle: 1.966 ± 0.307
1.839TyrLys: 1.839 ± 0.473
3.425TyrLeu: 3.425 ± 0.549
0.571TyrMet: 0.571 ± 0.197
1.776TyrAsn: 1.776 ± 0.3
1.078TyrPro: 1.078 ± 0.298
1.839TyrGln: 1.839 ± 0.321
1.966TyrArg: 1.966 ± 0.368
2.473TyrSer: 2.473 ± 0.368
2.41TyrThr: 2.41 ± 0.388
1.649TyrVal: 1.649 ± 0.285
0.317TyrTrp: 0.317 ± 0.19
1.712TyrTyr: 1.712 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (15769 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski