Amino acid dipepetide frequency for Lactobacillus phage Ld3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.424AlaAla: 1.424 ± 0.687
0.548AlaCys: 0.548 ± 0.227
4.271AlaAsp: 4.271 ± 0.585
4.381AlaGlu: 4.381 ± 0.663
3.286AlaPhe: 3.286 ± 0.508
5.476AlaGly: 5.476 ± 1.348
0.548AlaHis: 0.548 ± 0.269
4.819AlaIle: 4.819 ± 0.761
5.804AlaLys: 5.804 ± 0.893
4.162AlaLeu: 4.162 ± 0.734
1.533AlaMet: 1.533 ± 0.553
4.928AlaAsn: 4.928 ± 0.784
1.314AlaPro: 1.314 ± 0.464
2.738AlaGln: 2.738 ± 0.773
3.066AlaArg: 3.066 ± 0.679
4.49AlaSer: 4.49 ± 1.235
3.614AlaThr: 3.614 ± 0.672
4.6AlaVal: 4.6 ± 0.834
1.095AlaTrp: 1.095 ± 0.302
3.505AlaTyr: 3.505 ± 0.759
0.0AlaXaa: 0.0 ± 0.0
Cys
0.548CysAla: 0.548 ± 0.269
0.219CysCys: 0.219 ± 0.169
0.11CysAsp: 0.11 ± 0.088
0.657CysGlu: 0.657 ± 0.287
0.0CysPhe: 0.0 ± 0.0
0.986CysGly: 0.986 ± 0.276
0.219CysHis: 0.219 ± 0.166
0.438CysIle: 0.438 ± 0.194
1.314CysLys: 1.314 ± 0.382
0.657CysLeu: 0.657 ± 0.302
0.438CysMet: 0.438 ± 0.18
0.548CysAsn: 0.548 ± 0.222
0.11CysPro: 0.11 ± 0.109
0.548CysGln: 0.548 ± 0.207
0.548CysArg: 0.548 ± 0.236
0.219CysSer: 0.219 ± 0.131
0.548CysThr: 0.548 ± 0.248
0.548CysVal: 0.548 ± 0.267
0.329CysTrp: 0.329 ± 0.181
0.219CysTyr: 0.219 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
2.628AspAla: 2.628 ± 0.446
0.438AspCys: 0.438 ± 0.219
5.038AspAsp: 5.038 ± 0.77
5.038AspGlu: 5.038 ± 0.872
2.738AspPhe: 2.738 ± 0.547
5.476AspGly: 5.476 ± 0.839
0.548AspHis: 0.548 ± 0.206
4.819AspIle: 4.819 ± 0.887
5.257AspLys: 5.257 ± 0.676
5.366AspLeu: 5.366 ± 0.656
2.409AspMet: 2.409 ± 0.541
4.49AspAsn: 4.49 ± 0.859
1.205AspPro: 1.205 ± 0.529
2.19AspGln: 2.19 ± 0.321
1.862AspArg: 1.862 ± 0.422
4.6AspSer: 4.6 ± 0.697
3.614AspThr: 3.614 ± 0.721
3.833AspVal: 3.833 ± 0.639
1.095AspTrp: 1.095 ± 0.386
3.176AspTyr: 3.176 ± 0.665
0.0AspXaa: 0.0 ± 0.0
Glu
4.49GluAla: 4.49 ± 0.652
0.438GluCys: 0.438 ± 0.23
5.257GluAsp: 5.257 ± 0.665
4.819GluGlu: 4.819 ± 0.712
2.3GluPhe: 2.3 ± 0.447
3.066GluGly: 3.066 ± 0.422
1.095GluHis: 1.095 ± 0.385
6.133GluIle: 6.133 ± 1.171
5.585GluLys: 5.585 ± 0.72
6.242GluLeu: 6.242 ± 1.122
2.19GluMet: 2.19 ± 0.487
3.833GluAsn: 3.833 ± 0.742
1.643GluPro: 1.643 ± 0.384
1.971GluGln: 1.971 ± 0.471
2.847GluArg: 2.847 ± 0.561
2.738GluSer: 2.738 ± 0.658
4.052GluThr: 4.052 ± 0.626
5.695GluVal: 5.695 ± 0.727
0.329GluTrp: 0.329 ± 0.18
2.738GluTyr: 2.738 ± 0.482
0.0GluXaa: 0.0 ± 0.0
Phe
2.409PheAla: 2.409 ± 0.559
0.329PheCys: 0.329 ± 0.199
2.738PheAsp: 2.738 ± 0.512
2.738PheGlu: 2.738 ± 0.482
1.424PhePhe: 1.424 ± 0.419
2.738PheGly: 2.738 ± 0.538
0.219PheHis: 0.219 ± 0.151
3.614PheIle: 3.614 ± 0.625
3.724PheLys: 3.724 ± 0.47
2.847PheLeu: 2.847 ± 0.68
0.657PheMet: 0.657 ± 0.29
2.409PheAsn: 2.409 ± 0.527
0.438PhePro: 0.438 ± 0.225
1.205PheGln: 1.205 ± 0.367
1.971PheArg: 1.971 ± 0.524
2.19PheSer: 2.19 ± 0.516
3.176PheThr: 3.176 ± 0.556
2.19PheVal: 2.19 ± 0.417
0.0PheTrp: 0.0 ± 0.0
1.095PheTyr: 1.095 ± 0.401
0.0PheXaa: 0.0 ± 0.0
Gly
4.819GlyAla: 4.819 ± 0.977
0.329GlyCys: 0.329 ± 0.174
4.49GlyAsp: 4.49 ± 0.76
3.176GlyGlu: 3.176 ± 0.639
3.395GlyPhe: 3.395 ± 0.634
3.505GlyGly: 3.505 ± 0.716
0.986GlyHis: 0.986 ± 0.315
5.366GlyIle: 5.366 ± 0.941
6.023GlyLys: 6.023 ± 0.862
5.914GlyLeu: 5.914 ± 0.804
1.424GlyMet: 1.424 ± 0.48
3.833GlyAsn: 3.833 ± 0.754
0.0GlyPro: 0.0 ± 0.0
1.752GlyGln: 1.752 ± 0.442
3.286GlyArg: 3.286 ± 0.507
4.162GlySer: 4.162 ± 0.893
3.395GlyThr: 3.395 ± 0.669
5.695GlyVal: 5.695 ± 0.805
0.876GlyTrp: 0.876 ± 0.262
3.286GlyTyr: 3.286 ± 0.758
0.0GlyXaa: 0.0 ± 0.0
His
0.876HisAla: 0.876 ± 0.294
0.11HisCys: 0.11 ± 0.102
0.767HisAsp: 0.767 ± 0.274
0.657HisGlu: 0.657 ± 0.315
0.219HisPhe: 0.219 ± 0.167
0.438HisGly: 0.438 ± 0.232
0.548HisHis: 0.548 ± 0.291
0.986HisIle: 0.986 ± 0.282
1.424HisLys: 1.424 ± 0.479
0.876HisLeu: 0.876 ± 0.256
0.329HisMet: 0.329 ± 0.173
0.657HisAsn: 0.657 ± 0.275
0.219HisPro: 0.219 ± 0.14
0.438HisGln: 0.438 ± 0.182
0.548HisArg: 0.548 ± 0.233
0.876HisSer: 0.876 ± 0.248
0.657HisThr: 0.657 ± 0.208
1.533HisVal: 1.533 ± 0.458
0.0HisTrp: 0.0 ± 0.0
0.986HisTyr: 0.986 ± 0.311
0.0HisXaa: 0.0 ± 0.0
Ile
3.833IleAla: 3.833 ± 0.562
0.876IleCys: 0.876 ± 0.316
5.038IleAsp: 5.038 ± 0.754
5.147IleGlu: 5.147 ± 0.718
1.533IlePhe: 1.533 ± 0.355
3.614IleGly: 3.614 ± 0.671
0.657IleHis: 0.657 ± 0.232
3.614IleIle: 3.614 ± 0.73
6.681IleLys: 6.681 ± 0.951
4.928IleLeu: 4.928 ± 0.963
1.862IleMet: 1.862 ± 0.5
5.585IleAsn: 5.585 ± 0.717
2.738IlePro: 2.738 ± 0.457
1.533IleGln: 1.533 ± 0.38
2.3IleArg: 2.3 ± 0.497
5.476IleSer: 5.476 ± 0.71
5.038IleThr: 5.038 ± 0.762
3.724IleVal: 3.724 ± 0.64
0.986IleTrp: 0.986 ± 0.396
3.066IleTyr: 3.066 ± 0.414
0.0IleXaa: 0.0 ± 0.0
Lys
6.681LysAla: 6.681 ± 0.893
0.219LysCys: 0.219 ± 0.233
4.162LysAsp: 4.162 ± 0.64
7.009LysGlu: 7.009 ± 1.068
3.176LysPhe: 3.176 ± 0.757
6.133LysGly: 6.133 ± 0.66
0.876LysHis: 0.876 ± 0.375
5.476LysIle: 5.476 ± 0.78
7.557LysLys: 7.557 ± 1.305
7.338LysLeu: 7.338 ± 1.03
1.862LysMet: 1.862 ± 0.577
7.119LysAsn: 7.119 ± 0.969
1.424LysPro: 1.424 ± 0.361
3.286LysGln: 3.286 ± 0.638
4.162LysArg: 4.162 ± 0.899
5.147LysSer: 5.147 ± 0.869
5.147LysThr: 5.147 ± 0.679
6.352LysVal: 6.352 ± 0.837
0.548LysTrp: 0.548 ± 0.197
4.162LysTyr: 4.162 ± 0.804
0.0LysXaa: 0.0 ± 0.0
Leu
5.585LeuAla: 5.585 ± 1.093
1.095LeuCys: 1.095 ± 0.357
5.257LeuAsp: 5.257 ± 0.838
4.709LeuGlu: 4.709 ± 0.785
2.409LeuPhe: 2.409 ± 0.557
4.928LeuGly: 4.928 ± 0.963
1.095LeuHis: 1.095 ± 0.314
5.038LeuIle: 5.038 ± 0.819
7.557LeuLys: 7.557 ± 1.004
6.352LeuLeu: 6.352 ± 0.931
1.862LeuMet: 1.862 ± 0.337
5.366LeuAsn: 5.366 ± 0.654
3.286LeuPro: 3.286 ± 0.686
1.424LeuGln: 1.424 ± 0.336
3.614LeuArg: 3.614 ± 0.693
4.819LeuSer: 4.819 ± 0.785
4.709LeuThr: 4.709 ± 0.759
4.162LeuVal: 4.162 ± 0.622
0.329LeuTrp: 0.329 ± 0.191
3.066LeuTyr: 3.066 ± 0.653
0.0LeuXaa: 0.0 ± 0.0
Met
2.957MetAla: 2.957 ± 0.63
0.329MetCys: 0.329 ± 0.193
1.643MetAsp: 1.643 ± 0.396
1.752MetGlu: 1.752 ± 0.425
0.657MetPhe: 0.657 ± 0.255
1.095MetGly: 1.095 ± 0.346
0.11MetHis: 0.11 ± 0.119
1.314MetIle: 1.314 ± 0.385
3.176MetLys: 3.176 ± 0.711
1.205MetLeu: 1.205 ± 0.348
0.548MetMet: 0.548 ± 0.252
1.205MetAsn: 1.205 ± 0.376
0.548MetPro: 0.548 ± 0.248
0.548MetGln: 0.548 ± 0.192
1.643MetArg: 1.643 ± 0.36
1.533MetSer: 1.533 ± 0.461
1.643MetThr: 1.643 ± 0.446
2.081MetVal: 2.081 ± 0.544
0.329MetTrp: 0.329 ± 0.135
0.11MetTyr: 0.11 ± 0.119
0.0MetXaa: 0.0 ± 0.0
Asn
5.585AsnAla: 5.585 ± 1.239
0.767AsnCys: 0.767 ± 0.321
2.957AsnAsp: 2.957 ± 0.591
6.023AsnGlu: 6.023 ± 0.943
2.957AsnPhe: 2.957 ± 0.584
7.009AsnGly: 7.009 ± 0.791
0.986AsnHis: 0.986 ± 0.417
3.286AsnIle: 3.286 ± 0.62
4.381AsnLys: 4.381 ± 0.674
5.476AsnLeu: 5.476 ± 0.938
1.533AsnMet: 1.533 ± 0.337
3.943AsnAsn: 3.943 ± 0.981
1.424AsnPro: 1.424 ± 0.372
2.628AsnGln: 2.628 ± 0.526
2.628AsnArg: 2.628 ± 0.469
5.147AsnSer: 5.147 ± 0.758
3.286AsnThr: 3.286 ± 0.726
3.724AsnVal: 3.724 ± 0.495
0.767AsnTrp: 0.767 ± 0.262
2.738AsnTyr: 2.738 ± 0.502
0.0AsnXaa: 0.0 ± 0.0
Pro
2.409ProAla: 2.409 ± 0.553
0.11ProCys: 0.11 ± 0.109
1.205ProAsp: 1.205 ± 0.33
0.876ProGlu: 0.876 ± 0.295
1.095ProPhe: 1.095 ± 0.294
1.095ProGly: 1.095 ± 0.467
0.438ProHis: 0.438 ± 0.223
1.533ProIle: 1.533 ± 0.402
2.738ProLys: 2.738 ± 0.512
1.424ProLeu: 1.424 ± 0.391
0.329ProMet: 0.329 ± 0.168
1.643ProAsn: 1.643 ± 0.444
0.548ProPro: 0.548 ± 0.183
0.657ProGln: 0.657 ± 0.27
0.876ProArg: 0.876 ± 0.246
1.643ProSer: 1.643 ± 0.485
1.205ProThr: 1.205 ± 0.396
2.628ProVal: 2.628 ± 0.602
0.548ProTrp: 0.548 ± 0.273
1.314ProTyr: 1.314 ± 0.336
0.0ProXaa: 0.0 ± 0.0
Gln
2.519GlnAla: 2.519 ± 0.348
0.438GlnCys: 0.438 ± 0.218
2.081GlnAsp: 2.081 ± 0.374
1.314GlnGlu: 1.314 ± 0.448
1.095GlnPhe: 1.095 ± 0.229
2.081GlnGly: 2.081 ± 0.459
0.548GlnHis: 0.548 ± 0.234
2.738GlnIle: 2.738 ± 0.49
2.738GlnLys: 2.738 ± 0.63
2.738GlnLeu: 2.738 ± 0.928
0.986GlnMet: 0.986 ± 0.351
2.081GlnAsn: 2.081 ± 0.529
1.205GlnPro: 1.205 ± 0.344
1.314GlnGln: 1.314 ± 0.33
1.095GlnArg: 1.095 ± 0.248
2.3GlnSer: 2.3 ± 0.541
1.862GlnThr: 1.862 ± 0.395
2.3GlnVal: 2.3 ± 0.446
0.11GlnTrp: 0.11 ± 0.102
1.643GlnTyr: 1.643 ± 0.425
0.0GlnXaa: 0.0 ± 0.0
Arg
2.628ArgAla: 2.628 ± 0.647
0.767ArgCys: 0.767 ± 0.299
3.286ArgAsp: 3.286 ± 0.661
2.3ArgGlu: 2.3 ± 0.559
2.081ArgPhe: 2.081 ± 0.403
1.971ArgGly: 1.971 ± 0.497
0.329ArgHis: 0.329 ± 0.21
2.738ArgIle: 2.738 ± 0.441
3.505ArgLys: 3.505 ± 0.65
3.395ArgLeu: 3.395 ± 0.595
1.205ArgMet: 1.205 ± 0.352
2.409ArgAsn: 2.409 ± 0.678
0.876ArgPro: 0.876 ± 0.357
1.533ArgGln: 1.533 ± 0.288
1.752ArgArg: 1.752 ± 0.356
1.971ArgSer: 1.971 ± 0.471
2.847ArgThr: 2.847 ± 0.605
2.628ArgVal: 2.628 ± 0.603
0.876ArgTrp: 0.876 ± 0.403
2.738ArgTyr: 2.738 ± 0.501
0.0ArgXaa: 0.0 ± 0.0
Ser
3.724SerAla: 3.724 ± 0.574
0.548SerCys: 0.548 ± 0.211
3.395SerAsp: 3.395 ± 0.549
4.162SerGlu: 4.162 ± 0.666
3.395SerPhe: 3.395 ± 0.616
3.943SerGly: 3.943 ± 0.744
0.986SerHis: 0.986 ± 0.263
3.943SerIle: 3.943 ± 0.496
5.914SerLys: 5.914 ± 0.674
3.614SerLeu: 3.614 ± 0.496
1.752SerMet: 1.752 ± 0.5
5.257SerAsn: 5.257 ± 0.573
1.971SerPro: 1.971 ± 0.456
2.19SerGln: 2.19 ± 0.532
2.519SerArg: 2.519 ± 0.662
5.695SerSer: 5.695 ± 0.955
3.943SerThr: 3.943 ± 0.729
5.038SerVal: 5.038 ± 0.665
0.657SerTrp: 0.657 ± 0.265
2.847SerTyr: 2.847 ± 0.642
0.11SerXaa: 0.11 ± 0.088
Thr
4.381ThrAla: 4.381 ± 0.692
0.329ThrCys: 0.329 ± 0.167
4.271ThrAsp: 4.271 ± 0.908
4.49ThrGlu: 4.49 ± 0.601
2.3ThrPhe: 2.3 ± 0.591
4.271ThrGly: 4.271 ± 0.573
0.657ThrHis: 0.657 ± 0.225
4.381ThrIle: 4.381 ± 0.545
5.038ThrLys: 5.038 ± 0.784
4.928ThrLeu: 4.928 ± 0.791
0.219ThrMet: 0.219 ± 0.134
2.957ThrAsn: 2.957 ± 0.815
1.862ThrPro: 1.862 ± 0.436
1.971ThrGln: 1.971 ± 0.439
1.752ThrArg: 1.752 ± 0.442
3.614ThrSer: 3.614 ± 0.687
2.957ThrThr: 2.957 ± 0.6
5.147ThrVal: 5.147 ± 0.708
1.095ThrTrp: 1.095 ± 0.351
2.3ThrTyr: 2.3 ± 0.519
0.11ThrXaa: 0.11 ± 0.088
Val
5.695ValAla: 5.695 ± 0.99
0.548ValCys: 0.548 ± 0.224
4.709ValAsp: 4.709 ± 0.594
4.819ValGlu: 4.819 ± 0.799
1.752ValPhe: 1.752 ± 0.419
4.381ValGly: 4.381 ± 0.659
1.533ValHis: 1.533 ± 0.342
4.381ValIle: 4.381 ± 0.532
5.804ValLys: 5.804 ± 1.038
5.038ValLeu: 5.038 ± 0.641
1.752ValMet: 1.752 ± 0.464
5.476ValAsn: 5.476 ± 0.815
1.533ValPro: 1.533 ± 0.438
2.3ValGln: 2.3 ± 0.446
2.519ValArg: 2.519 ± 0.403
5.257ValSer: 5.257 ± 0.628
4.381ValThr: 4.381 ± 0.607
4.162ValVal: 4.162 ± 0.716
0.548ValTrp: 0.548 ± 0.257
3.724ValTyr: 3.724 ± 0.631
0.0ValXaa: 0.0 ± 0.0
Trp
0.657TrpAla: 0.657 ± 0.281
0.11TrpCys: 0.11 ± 0.122
0.438TrpAsp: 0.438 ± 0.265
0.986TrpGlu: 0.986 ± 0.297
0.876TrpPhe: 0.876 ± 0.36
0.876TrpGly: 0.876 ± 0.273
0.438TrpHis: 0.438 ± 0.197
0.657TrpIle: 0.657 ± 0.29
0.767TrpLys: 0.767 ± 0.289
1.205TrpLeu: 1.205 ± 0.338
0.219TrpMet: 0.219 ± 0.17
0.767TrpAsn: 0.767 ± 0.395
0.11TrpPro: 0.11 ± 0.113
0.548TrpGln: 0.548 ± 0.237
0.219TrpArg: 0.219 ± 0.153
0.219TrpSer: 0.219 ± 0.152
0.438TrpThr: 0.438 ± 0.199
0.767TrpVal: 0.767 ± 0.295
0.0TrpTrp: 0.0 ± 0.0
0.548TrpTyr: 0.548 ± 0.269
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.752TyrAla: 1.752 ± 0.452
0.657TyrCys: 0.657 ± 0.328
4.6TyrAsp: 4.6 ± 0.46
2.409TyrGlu: 2.409 ± 0.579
1.424TyrPhe: 1.424 ± 0.432
2.519TyrGly: 2.519 ± 0.596
0.329TyrHis: 0.329 ± 0.182
2.957TyrIle: 2.957 ± 0.465
2.847TyrLys: 2.847 ± 0.579
2.957TyrLeu: 2.957 ± 0.692
1.205TyrMet: 1.205 ± 0.36
2.738TyrAsn: 2.738 ± 0.595
1.862TyrPro: 1.862 ± 0.389
2.409TyrGln: 2.409 ± 0.451
2.628TyrArg: 2.628 ± 0.575
3.614TyrSer: 3.614 ± 0.73
2.519TyrThr: 2.519 ± 0.556
3.505TyrVal: 3.505 ± 0.546
0.329TyrTrp: 0.329 ± 0.217
1.971TyrTyr: 1.971 ± 0.475
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.11XaaAsp: 0.11 ± 0.088
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.11XaaGly: 0.11 ± 0.088
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (9132 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski