Amino acid dipepetide frequency for Leuconostoc phage LN34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.719AlaAla: 0.719 ± 0.331
0.36AlaCys: 0.36 ± 0.189
3.956AlaAsp: 3.956 ± 0.823
2.997AlaGlu: 2.997 ± 0.551
2.997AlaPhe: 2.997 ± 0.841
4.316AlaGly: 4.316 ± 0.846
0.719AlaHis: 0.719 ± 0.32
5.155AlaIle: 5.155 ± 1.07
4.676AlaLys: 4.676 ± 0.676
6.354AlaLeu: 6.354 ± 0.974
1.439AlaMet: 1.439 ± 0.356
3.836AlaAsn: 3.836 ± 0.712
1.678AlaPro: 1.678 ± 0.377
1.678AlaGln: 1.678 ± 0.484
1.439AlaArg: 1.439 ± 0.374
3.237AlaSer: 3.237 ± 1.06
4.436AlaThr: 4.436 ± 0.61
3.956AlaVal: 3.956 ± 0.664
1.079AlaTrp: 1.079 ± 0.434
1.798AlaTyr: 1.798 ± 0.396
0.0AlaXaa: 0.0 ± 0.0
Cys
0.24CysAla: 0.24 ± 0.168
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.36CysGlu: 0.36 ± 0.197
0.24CysPhe: 0.24 ± 0.148
0.36CysGly: 0.36 ± 0.293
0.36CysHis: 0.36 ± 0.195
0.12CysIle: 0.12 ± 0.119
0.12CysLys: 0.12 ± 0.126
0.24CysLeu: 0.24 ± 0.156
0.12CysMet: 0.12 ± 0.126
0.24CysAsn: 0.24 ± 0.157
0.0CysPro: 0.0 ± 0.0
0.12CysGln: 0.12 ± 0.126
0.12CysArg: 0.12 ± 0.106
0.36CysSer: 0.36 ± 0.208
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.24CysTrp: 0.24 ± 0.17
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.597AspAla: 3.597 ± 0.792
0.24AspCys: 0.24 ± 0.169
5.035AspAsp: 5.035 ± 1.095
5.515AspGlu: 5.515 ± 0.961
2.638AspPhe: 2.638 ± 0.676
4.556AspGly: 4.556 ± 0.708
1.319AspHis: 1.319 ± 0.38
5.035AspIle: 5.035 ± 0.793
4.676AspLys: 4.676 ± 0.883
4.676AspLeu: 4.676 ± 0.612
1.319AspMet: 1.319 ± 0.333
5.275AspAsn: 5.275 ± 0.943
2.038AspPro: 2.038 ± 0.484
1.079AspGln: 1.079 ± 0.299
2.038AspArg: 2.038 ± 0.394
3.717AspSer: 3.717 ± 0.588
3.597AspThr: 3.597 ± 0.71
5.155AspVal: 5.155 ± 0.741
1.319AspTrp: 1.319 ± 0.33
3.956AspTyr: 3.956 ± 0.932
0.0AspXaa: 0.0 ± 0.0
Glu
2.278GluAla: 2.278 ± 0.711
0.12GluCys: 0.12 ± 0.126
3.717GluAsp: 3.717 ± 0.625
4.676GluGlu: 4.676 ± 1.191
2.518GluPhe: 2.518 ± 0.594
2.398GluGly: 2.398 ± 0.439
0.959GluHis: 0.959 ± 0.329
3.597GluIle: 3.597 ± 0.774
4.796GluLys: 4.796 ± 0.899
5.875GluLeu: 5.875 ± 0.949
3.117GluMet: 3.117 ± 0.615
4.196GluAsn: 4.196 ± 0.674
0.959GluPro: 0.959 ± 0.29
2.638GluGln: 2.638 ± 0.616
1.798GluArg: 1.798 ± 0.466
2.638GluSer: 2.638 ± 0.571
3.357GluThr: 3.357 ± 0.599
5.755GluVal: 5.755 ± 0.887
0.719GluTrp: 0.719 ± 0.264
2.877GluTyr: 2.877 ± 0.613
0.0GluXaa: 0.0 ± 0.0
Phe
2.038PheAla: 2.038 ± 0.583
0.0PheCys: 0.0 ± 0.0
4.796PheAsp: 4.796 ± 0.592
3.117PheGlu: 3.117 ± 0.665
0.839PhePhe: 0.839 ± 0.259
2.877PheGly: 2.877 ± 0.359
0.24PheHis: 0.24 ± 0.149
3.956PheIle: 3.956 ± 0.513
2.398PheLys: 2.398 ± 0.601
2.997PheLeu: 2.997 ± 0.625
1.199PheMet: 1.199 ± 0.317
2.877PheAsn: 2.877 ± 0.565
1.079PhePro: 1.079 ± 0.343
0.959PheGln: 0.959 ± 0.38
1.439PheArg: 1.439 ± 0.365
5.755PheSer: 5.755 ± 1.825
3.597PheThr: 3.597 ± 0.642
2.278PheVal: 2.278 ± 0.341
0.48PheTrp: 0.48 ± 0.301
2.038PheTyr: 2.038 ± 0.559
0.0PheXaa: 0.0 ± 0.0
Gly
3.477GlyAla: 3.477 ± 0.683
0.24GlyCys: 0.24 ± 0.147
4.796GlyAsp: 4.796 ± 0.858
2.278GlyGlu: 2.278 ± 0.502
4.076GlyPhe: 4.076 ± 0.622
5.275GlyGly: 5.275 ± 1.224
0.839GlyHis: 0.839 ± 0.337
5.755GlyIle: 5.755 ± 2.113
6.354GlyLys: 6.354 ± 0.859
5.635GlyLeu: 5.635 ± 0.689
1.918GlyMet: 1.918 ± 0.375
3.477GlyAsn: 3.477 ± 0.456
0.24GlyPro: 0.24 ± 0.158
2.278GlyGln: 2.278 ± 0.535
3.237GlyArg: 3.237 ± 0.675
5.515GlySer: 5.515 ± 1.085
3.956GlyThr: 3.956 ± 0.722
5.035GlyVal: 5.035 ± 1.082
0.48GlyTrp: 0.48 ± 0.224
3.717GlyTyr: 3.717 ± 0.826
0.0GlyXaa: 0.0 ± 0.0
His
0.599HisAla: 0.599 ± 0.344
0.0HisCys: 0.0 ± 0.0
0.959HisAsp: 0.959 ± 0.34
1.319HisGlu: 1.319 ± 0.396
0.719HisPhe: 0.719 ± 0.255
0.959HisGly: 0.959 ± 0.299
0.24HisHis: 0.24 ± 0.157
0.719HisIle: 0.719 ± 0.258
0.48HisLys: 0.48 ± 0.251
1.439HisLeu: 1.439 ± 0.561
1.079HisMet: 1.079 ± 0.311
0.36HisAsn: 0.36 ± 0.221
0.599HisPro: 0.599 ± 0.256
0.599HisGln: 0.599 ± 0.309
0.36HisArg: 0.36 ± 0.181
0.839HisSer: 0.839 ± 0.306
1.319HisThr: 1.319 ± 0.559
0.48HisVal: 0.48 ± 0.233
0.0HisTrp: 0.0 ± 0.0
0.839HisTyr: 0.839 ± 0.375
0.0HisXaa: 0.0 ± 0.0
Ile
4.676IleAla: 4.676 ± 1.163
0.12IleCys: 0.12 ± 0.125
5.515IleAsp: 5.515 ± 0.708
5.035IleGlu: 5.035 ± 0.856
3.117IlePhe: 3.117 ± 0.608
5.275IleGly: 5.275 ± 1.486
0.839IleHis: 0.839 ± 0.292
4.676IleIle: 4.676 ± 0.663
5.635IleLys: 5.635 ± 0.993
4.915IleLeu: 4.915 ± 0.985
1.798IleMet: 1.798 ± 0.401
4.436IleAsn: 4.436 ± 0.49
1.798IlePro: 1.798 ± 0.418
2.877IleGln: 2.877 ± 0.513
1.678IleArg: 1.678 ± 0.382
6.474IleSer: 6.474 ± 1.033
5.515IleThr: 5.515 ± 0.878
4.196IleVal: 4.196 ± 0.604
0.48IleTrp: 0.48 ± 0.236
2.757IleTyr: 2.757 ± 0.596
0.0IleXaa: 0.0 ± 0.0
Lys
5.035LysAla: 5.035 ± 0.696
0.24LysCys: 0.24 ± 0.184
4.196LysAsp: 4.196 ± 0.803
3.956LysGlu: 3.956 ± 0.835
2.997LysPhe: 2.997 ± 0.585
4.076LysGly: 4.076 ± 0.605
1.798LysHis: 1.798 ± 0.488
5.635LysIle: 5.635 ± 0.622
5.515LysLys: 5.515 ± 1.046
6.954LysLeu: 6.954 ± 1.134
2.997LysMet: 2.997 ± 0.724
4.076LysAsn: 4.076 ± 0.653
2.278LysPro: 2.278 ± 0.626
2.877LysGln: 2.877 ± 0.528
3.597LysArg: 3.597 ± 0.763
4.676LysSer: 4.676 ± 0.793
5.635LysThr: 5.635 ± 0.781
3.717LysVal: 3.717 ± 0.795
0.959LysTrp: 0.959 ± 0.335
3.717LysTyr: 3.717 ± 0.835
0.0LysXaa: 0.0 ± 0.0
Leu
5.035LeuAla: 5.035 ± 0.796
0.36LeuCys: 0.36 ± 0.194
6.954LeuAsp: 6.954 ± 0.642
5.994LeuGlu: 5.994 ± 0.99
3.357LeuPhe: 3.357 ± 0.579
6.594LeuGly: 6.594 ± 1.077
0.959LeuHis: 0.959 ± 0.292
4.076LeuIle: 4.076 ± 0.737
7.913LeuLys: 7.913 ± 1.122
5.635LeuLeu: 5.635 ± 0.691
2.278LeuMet: 2.278 ± 0.489
6.114LeuAsn: 6.114 ± 0.801
2.518LeuPro: 2.518 ± 0.556
2.757LeuGln: 2.757 ± 0.569
2.638LeuArg: 2.638 ± 0.479
5.755LeuSer: 5.755 ± 1.041
5.275LeuThr: 5.275 ± 0.921
5.395LeuVal: 5.395 ± 1.003
0.36LeuTrp: 0.36 ± 0.179
2.518LeuTyr: 2.518 ± 0.575
0.0LeuXaa: 0.0 ± 0.0
Met
2.638MetAla: 2.638 ± 0.468
0.12MetCys: 0.12 ± 0.123
1.199MetAsp: 1.199 ± 0.365
1.079MetGlu: 1.079 ± 0.357
1.199MetPhe: 1.199 ± 0.418
2.158MetGly: 2.158 ± 0.531
0.24MetHis: 0.24 ± 0.16
1.319MetIle: 1.319 ± 0.362
2.038MetLys: 2.038 ± 0.478
1.918MetLeu: 1.918 ± 0.463
0.599MetMet: 0.599 ± 0.268
2.038MetAsn: 2.038 ± 0.634
1.199MetPro: 1.199 ± 0.366
0.599MetGln: 0.599 ± 0.242
0.839MetArg: 0.839 ± 0.3
2.518MetSer: 2.518 ± 0.486
2.757MetThr: 2.757 ± 0.811
1.918MetVal: 1.918 ± 0.378
0.36MetTrp: 0.36 ± 0.186
1.798MetTyr: 1.798 ± 0.404
0.0MetXaa: 0.0 ± 0.0
Asn
4.076AsnAla: 4.076 ± 0.715
0.36AsnCys: 0.36 ± 0.2
2.757AsnAsp: 2.757 ± 0.564
4.556AsnGlu: 4.556 ± 0.876
2.997AsnPhe: 2.997 ± 0.64
5.395AsnGly: 5.395 ± 0.969
0.959AsnHis: 0.959 ± 0.314
5.395AsnIle: 5.395 ± 0.811
4.436AsnLys: 4.436 ± 0.711
4.796AsnLeu: 4.796 ± 1.07
1.798AsnMet: 1.798 ± 0.388
5.275AsnAsn: 5.275 ± 0.922
3.237AsnPro: 3.237 ± 0.704
1.798AsnGln: 1.798 ± 0.402
2.757AsnArg: 2.757 ± 0.665
3.956AsnSer: 3.956 ± 0.687
4.436AsnThr: 4.436 ± 0.843
4.796AsnVal: 4.796 ± 0.563
0.48AsnTrp: 0.48 ± 0.235
3.357AsnTyr: 3.357 ± 0.773
0.0AsnXaa: 0.0 ± 0.0
Pro
2.038ProAla: 2.038 ± 0.509
0.0ProCys: 0.0 ± 0.0
2.518ProAsp: 2.518 ± 0.537
1.798ProGlu: 1.798 ± 0.464
1.439ProPhe: 1.439 ± 0.361
0.719ProGly: 0.719 ± 0.242
0.24ProHis: 0.24 ± 0.19
1.678ProIle: 1.678 ± 0.419
2.518ProLys: 2.518 ± 0.528
1.798ProLeu: 1.798 ± 0.474
1.079ProMet: 1.079 ± 0.377
2.638ProAsn: 2.638 ± 0.54
0.36ProPro: 0.36 ± 0.193
0.959ProGln: 0.959 ± 0.362
0.719ProArg: 0.719 ± 0.395
2.038ProSer: 2.038 ± 0.4
2.877ProThr: 2.877 ± 0.501
1.319ProVal: 1.319 ± 0.432
0.0ProTrp: 0.0 ± 0.0
1.439ProTyr: 1.439 ± 0.41
0.0ProXaa: 0.0 ± 0.0
Gln
3.117GlnAla: 3.117 ± 0.599
0.0GlnCys: 0.0 ± 0.0
2.518GlnAsp: 2.518 ± 0.484
1.918GlnGlu: 1.918 ± 0.442
1.918GlnPhe: 1.918 ± 0.453
1.559GlnGly: 1.559 ± 0.392
0.48GlnHis: 0.48 ± 0.257
2.877GlnIle: 2.877 ± 0.569
2.518GlnLys: 2.518 ± 0.524
2.997GlnLeu: 2.997 ± 0.594
0.719GlnMet: 0.719 ± 0.264
2.278GlnAsn: 2.278 ± 0.498
0.839GlnPro: 0.839 ± 0.277
0.959GlnGln: 0.959 ± 0.375
1.678GlnArg: 1.678 ± 0.411
2.638GlnSer: 2.638 ± 0.484
1.559GlnThr: 1.559 ± 0.558
2.877GlnVal: 2.877 ± 0.452
0.24GlnTrp: 0.24 ± 0.161
0.959GlnTyr: 0.959 ± 0.362
0.0GlnXaa: 0.0 ± 0.0
Arg
2.038ArgAla: 2.038 ± 0.409
0.36ArgCys: 0.36 ± 0.249
2.518ArgAsp: 2.518 ± 0.526
1.678ArgGlu: 1.678 ± 0.524
1.319ArgPhe: 1.319 ± 0.465
2.158ArgGly: 2.158 ± 0.545
0.36ArgHis: 0.36 ± 0.202
2.638ArgIle: 2.638 ± 0.643
1.798ArgLys: 1.798 ± 0.47
4.915ArgLeu: 4.915 ± 0.713
1.079ArgMet: 1.079 ± 0.287
2.877ArgAsn: 2.877 ± 0.501
0.959ArgPro: 0.959 ± 0.399
1.439ArgGln: 1.439 ± 0.384
1.439ArgArg: 1.439 ± 0.566
2.398ArgSer: 2.398 ± 0.53
2.518ArgThr: 2.518 ± 0.455
2.518ArgVal: 2.518 ± 0.496
0.599ArgTrp: 0.599 ± 0.249
0.959ArgTyr: 0.959 ± 0.354
0.0ArgXaa: 0.0 ± 0.0
Ser
4.076SerAla: 4.076 ± 1.088
0.0SerCys: 0.0 ± 0.0
4.796SerAsp: 4.796 ± 0.855
2.518SerGlu: 2.518 ± 0.533
3.597SerPhe: 3.597 ± 0.559
5.755SerGly: 5.755 ± 1.692
0.719SerHis: 0.719 ± 0.33
5.755SerIle: 5.755 ± 1.134
4.196SerLys: 4.196 ± 0.658
5.035SerLeu: 5.035 ± 0.893
1.918SerMet: 1.918 ± 0.456
4.556SerAsn: 4.556 ± 0.863
1.798SerPro: 1.798 ± 0.46
3.237SerGln: 3.237 ± 0.566
2.638SerArg: 2.638 ± 0.6
6.354SerSer: 6.354 ± 1.98
6.474SerThr: 6.474 ± 0.813
5.635SerVal: 5.635 ± 1.062
0.599SerTrp: 0.599 ± 0.297
3.237SerTyr: 3.237 ± 0.648
0.0SerXaa: 0.0 ± 0.0
Thr
4.076ThrAla: 4.076 ± 0.567
0.12ThrCys: 0.12 ± 0.126
2.757ThrAsp: 2.757 ± 0.584
2.638ThrGlu: 2.638 ± 0.619
3.477ThrPhe: 3.477 ± 0.689
5.155ThrGly: 5.155 ± 0.593
1.079ThrHis: 1.079 ± 0.39
4.556ThrIle: 4.556 ± 0.838
6.834ThrLys: 6.834 ± 0.588
6.114ThrLeu: 6.114 ± 1.026
1.079ThrMet: 1.079 ± 0.316
5.275ThrAsn: 5.275 ± 0.825
3.237ThrPro: 3.237 ± 0.513
4.076ThrGln: 4.076 ± 0.737
2.638ThrArg: 2.638 ± 0.397
5.035ThrSer: 5.035 ± 0.727
4.316ThrThr: 4.316 ± 0.801
3.717ThrVal: 3.717 ± 0.623
0.48ThrTrp: 0.48 ± 0.198
1.918ThrTyr: 1.918 ± 0.517
0.0ThrXaa: 0.0 ± 0.0
Val
4.076ValAla: 4.076 ± 0.606
0.24ValCys: 0.24 ± 0.158
3.836ValAsp: 3.836 ± 0.662
3.717ValGlu: 3.717 ± 0.446
3.477ValPhe: 3.477 ± 0.723
5.035ValGly: 5.035 ± 0.993
0.36ValHis: 0.36 ± 0.175
5.755ValIle: 5.755 ± 0.986
4.076ValLys: 4.076 ± 0.6
4.676ValLeu: 4.676 ± 0.656
1.559ValMet: 1.559 ± 0.361
3.956ValAsn: 3.956 ± 0.694
1.918ValPro: 1.918 ± 0.46
1.798ValGln: 1.798 ± 0.481
3.836ValArg: 3.836 ± 0.694
4.796ValSer: 4.796 ± 0.853
4.076ValThr: 4.076 ± 0.669
3.717ValVal: 3.717 ± 0.818
1.559ValTrp: 1.559 ± 0.451
3.477ValTyr: 3.477 ± 0.647
0.0ValXaa: 0.0 ± 0.0
Trp
0.36TrpAla: 0.36 ± 0.178
0.0TrpCys: 0.0 ± 0.0
1.079TrpAsp: 1.079 ± 0.378
0.599TrpGlu: 0.599 ± 0.276
0.24TrpPhe: 0.24 ± 0.158
1.319TrpGly: 1.319 ± 0.348
0.48TrpHis: 0.48 ± 0.233
0.12TrpIle: 0.12 ± 0.106
0.599TrpLys: 0.599 ± 0.219
0.959TrpLeu: 0.959 ± 0.493
0.24TrpMet: 0.24 ± 0.224
0.48TrpAsn: 0.48 ± 0.217
0.0TrpPro: 0.0 ± 0.0
0.719TrpGln: 0.719 ± 0.246
0.48TrpArg: 0.48 ± 0.3
1.079TrpSer: 1.079 ± 0.342
0.839TrpThr: 0.839 ± 0.39
0.48TrpVal: 0.48 ± 0.191
0.24TrpTrp: 0.24 ± 0.154
0.48TrpTyr: 0.48 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.638TyrAla: 2.638 ± 0.453
0.36TyrCys: 0.36 ± 0.195
2.877TyrAsp: 2.877 ± 0.717
3.237TyrGlu: 3.237 ± 0.713
1.798TyrPhe: 1.798 ± 0.468
2.638TyrGly: 2.638 ± 0.616
0.719TyrHis: 0.719 ± 0.261
2.997TyrIle: 2.997 ± 0.666
3.357TyrLys: 3.357 ± 0.722
4.556TyrLeu: 4.556 ± 0.689
1.079TyrMet: 1.079 ± 0.307
3.357TyrAsn: 3.357 ± 0.66
1.439TyrPro: 1.439 ± 0.446
1.199TyrGln: 1.199 ± 0.399
1.199TyrArg: 1.199 ± 0.484
3.117TyrSer: 3.117 ± 0.789
2.038TyrThr: 2.038 ± 0.469
2.997TyrVal: 2.997 ± 0.585
0.24TyrTrp: 0.24 ± 0.172
1.918TyrTyr: 1.918 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (8342 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski