Amino acid dipepetide frequency for Escherichia phage ECP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.051AlaAla: 10.051 ± 1.625
0.852AlaCys: 0.852 ± 0.323
6.303AlaAsp: 6.303 ± 1.018
6.474AlaGlu: 6.474 ± 0.695
3.237AlaPhe: 3.237 ± 0.514
7.922AlaGly: 7.922 ± 0.895
1.789AlaHis: 1.789 ± 0.426
6.814AlaIle: 6.814 ± 0.597
4.514AlaLys: 4.514 ± 0.806
7.24AlaLeu: 7.24 ± 0.713
3.492AlaMet: 3.492 ± 0.571
4.94AlaAsn: 4.94 ± 0.662
2.129AlaPro: 2.129 ± 0.368
4.94AlaGln: 4.94 ± 0.741
4.003AlaArg: 4.003 ± 0.598
6.559AlaSer: 6.559 ± 1.34
5.111AlaThr: 5.111 ± 0.79
5.963AlaVal: 5.963 ± 0.814
1.789AlaTrp: 1.789 ± 0.33
3.237AlaTyr: 3.237 ± 0.444
0.0AlaXaa: 0.0 ± 0.0
Cys
1.278CysAla: 1.278 ± 0.375
0.17CysCys: 0.17 ± 0.111
0.511CysAsp: 0.511 ± 0.176
0.937CysGlu: 0.937 ± 0.286
0.17CysPhe: 0.17 ± 0.121
1.448CysGly: 1.448 ± 0.372
0.596CysHis: 0.596 ± 0.27
0.681CysIle: 0.681 ± 0.234
0.852CysLys: 0.852 ± 0.239
1.022CysLeu: 1.022 ± 0.319
0.085CysMet: 0.085 ± 0.081
0.256CysAsn: 0.256 ± 0.134
0.426CysPro: 0.426 ± 0.258
0.341CysGln: 0.341 ± 0.188
0.852CysArg: 0.852 ± 0.302
0.767CysSer: 0.767 ± 0.272
0.256CysThr: 0.256 ± 0.124
0.767CysVal: 0.767 ± 0.287
0.341CysTrp: 0.341 ± 0.196
0.256CysTyr: 0.256 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
5.792AspAla: 5.792 ± 0.649
0.852AspCys: 0.852 ± 0.28
3.833AspAsp: 3.833 ± 0.625
3.492AspGlu: 3.492 ± 0.497
1.704AspPhe: 1.704 ± 0.478
5.196AspGly: 5.196 ± 0.694
0.767AspHis: 0.767 ± 0.321
2.726AspIle: 2.726 ± 0.415
2.726AspLys: 2.726 ± 0.493
4.514AspLeu: 4.514 ± 0.621
1.874AspMet: 1.874 ± 0.459
2.555AspAsn: 2.555 ± 0.389
1.533AspPro: 1.533 ± 0.331
2.044AspGln: 2.044 ± 0.367
3.578AspArg: 3.578 ± 0.545
3.492AspSer: 3.492 ± 0.506
1.618AspThr: 1.618 ± 0.481
5.026AspVal: 5.026 ± 0.805
0.937AspTrp: 0.937 ± 0.224
2.044AspTyr: 2.044 ± 0.503
0.0AspXaa: 0.0 ± 0.0
Glu
5.963GluAla: 5.963 ± 0.948
0.852GluCys: 0.852 ± 0.306
2.385GluAsp: 2.385 ± 0.337
4.429GluGlu: 4.429 ± 0.808
2.641GluPhe: 2.641 ± 0.444
3.663GluGly: 3.663 ± 0.61
1.022GluHis: 1.022 ± 0.284
3.152GluIle: 3.152 ± 0.458
3.492GluLys: 3.492 ± 0.494
5.792GluLeu: 5.792 ± 0.638
1.789GluMet: 1.789 ± 0.46
3.152GluAsn: 3.152 ± 0.474
1.959GluPro: 1.959 ± 0.376
4.003GluGln: 4.003 ± 0.632
3.578GluArg: 3.578 ± 0.62
4.855GluSer: 4.855 ± 0.78
3.066GluThr: 3.066 ± 0.632
3.066GluVal: 3.066 ± 0.55
1.107GluTrp: 1.107 ± 0.374
1.533GluTyr: 1.533 ± 0.352
0.0GluXaa: 0.0 ± 0.0
Phe
2.981PheAla: 2.981 ± 0.505
0.341PheCys: 0.341 ± 0.169
2.385PheAsp: 2.385 ± 0.388
1.789PheGlu: 1.789 ± 0.486
0.681PhePhe: 0.681 ± 0.243
2.555PheGly: 2.555 ± 0.454
0.256PheHis: 0.256 ± 0.137
2.385PheIle: 2.385 ± 0.537
1.533PheLys: 1.533 ± 0.353
1.363PheLeu: 1.363 ± 0.329
0.767PheMet: 0.767 ± 0.263
1.533PheAsn: 1.533 ± 0.307
1.363PhePro: 1.363 ± 0.342
0.767PheGln: 0.767 ± 0.233
1.874PheArg: 1.874 ± 0.447
2.129PheSer: 2.129 ± 0.433
2.641PheThr: 2.641 ± 0.538
2.044PheVal: 2.044 ± 0.386
0.767PheTrp: 0.767 ± 0.194
1.022PheTyr: 1.022 ± 0.31
0.0PheXaa: 0.0 ± 0.0
Gly
6.474GlyAla: 6.474 ± 1.009
0.767GlyCys: 0.767 ± 0.215
3.833GlyAsp: 3.833 ± 0.619
4.344GlyGlu: 4.344 ± 0.613
3.237GlyPhe: 3.237 ± 0.514
6.303GlyGly: 6.303 ± 0.92
1.107GlyHis: 1.107 ± 0.286
4.77GlyIle: 4.77 ± 0.586
4.259GlyLys: 4.259 ± 0.615
6.303GlyLeu: 6.303 ± 0.725
2.896GlyMet: 2.896 ± 0.454
4.174GlyAsn: 4.174 ± 0.685
1.533GlyPro: 1.533 ± 0.402
3.066GlyGln: 3.066 ± 0.504
4.6GlyArg: 4.6 ± 0.655
4.77GlySer: 4.77 ± 0.713
5.537GlyThr: 5.537 ± 0.824
4.94GlyVal: 4.94 ± 0.509
1.278GlyTrp: 1.278 ± 0.415
2.385GlyTyr: 2.385 ± 0.434
0.0GlyXaa: 0.0 ± 0.0
His
1.363HisAla: 1.363 ± 0.412
0.341HisCys: 0.341 ± 0.167
1.022HisAsp: 1.022 ± 0.263
1.107HisGlu: 1.107 ± 0.356
0.511HisPhe: 0.511 ± 0.209
1.278HisGly: 1.278 ± 0.277
0.511HisHis: 0.511 ± 0.25
0.681HisIle: 0.681 ± 0.266
1.022HisLys: 1.022 ± 0.362
1.704HisLeu: 1.704 ± 0.544
0.17HisMet: 0.17 ± 0.112
0.596HisAsn: 0.596 ± 0.214
1.022HisPro: 1.022 ± 0.311
0.767HisGln: 0.767 ± 0.274
2.044HisArg: 2.044 ± 0.388
1.022HisSer: 1.022 ± 0.313
0.426HisThr: 0.426 ± 0.188
0.681HisVal: 0.681 ± 0.302
0.256HisTrp: 0.256 ± 0.142
0.426HisTyr: 0.426 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
4.514IleAla: 4.514 ± 0.56
0.767IleCys: 0.767 ± 0.337
4.259IleAsp: 4.259 ± 0.548
3.237IleGlu: 3.237 ± 0.386
1.704IlePhe: 1.704 ± 0.352
3.663IleGly: 3.663 ± 0.567
0.937IleHis: 0.937 ± 0.265
2.896IleIle: 2.896 ± 0.481
2.641IleLys: 2.641 ± 0.51
4.174IleLeu: 4.174 ± 0.546
1.193IleMet: 1.193 ± 0.371
2.896IleAsn: 2.896 ± 0.511
2.129IlePro: 2.129 ± 0.385
2.641IleGln: 2.641 ± 0.581
3.663IleArg: 3.663 ± 0.631
4.344IleSer: 4.344 ± 0.682
4.259IleThr: 4.259 ± 0.565
3.322IleVal: 3.322 ± 0.603
0.767IleTrp: 0.767 ± 0.229
1.533IleTyr: 1.533 ± 0.293
0.0IleXaa: 0.0 ± 0.0
Lys
6.644LysAla: 6.644 ± 0.932
0.767LysCys: 0.767 ± 0.313
2.981LysAsp: 2.981 ± 0.594
3.918LysGlu: 3.918 ± 0.596
1.448LysPhe: 1.448 ± 0.359
3.748LysGly: 3.748 ± 0.532
0.767LysHis: 0.767 ± 0.33
2.044LysIle: 2.044 ± 0.472
3.833LysLys: 3.833 ± 0.738
4.429LysLeu: 4.429 ± 0.546
2.129LysMet: 2.129 ± 0.463
2.215LysAsn: 2.215 ± 0.403
2.555LysPro: 2.555 ± 0.585
2.981LysGln: 2.981 ± 0.654
2.47LysArg: 2.47 ± 0.575
3.833LysSer: 3.833 ± 0.748
3.152LysThr: 3.152 ± 0.424
2.981LysVal: 2.981 ± 0.486
1.193LysTrp: 1.193 ± 0.278
1.704LysTyr: 1.704 ± 0.343
0.0LysXaa: 0.0 ± 0.0
Leu
9.114LeuAla: 9.114 ± 1.068
1.022LeuCys: 1.022 ± 0.314
4.6LeuAsp: 4.6 ± 0.403
5.963LeuGlu: 5.963 ± 0.65
1.874LeuPhe: 1.874 ± 0.4
4.089LeuGly: 4.089 ± 0.51
0.852LeuHis: 0.852 ± 0.315
4.344LeuIle: 4.344 ± 0.5
4.259LeuLys: 4.259 ± 0.676
4.94LeuLeu: 4.94 ± 0.632
1.363LeuMet: 1.363 ± 0.326
4.344LeuAsn: 4.344 ± 0.612
3.152LeuPro: 3.152 ± 0.515
2.896LeuGln: 2.896 ± 0.54
6.388LeuArg: 6.388 ± 0.826
5.622LeuSer: 5.622 ± 0.774
5.622LeuThr: 5.622 ± 0.623
4.089LeuVal: 4.089 ± 0.611
0.852LeuTrp: 0.852 ± 0.261
2.641LeuTyr: 2.641 ± 0.459
0.0LeuXaa: 0.0 ± 0.0
Met
3.066MetAla: 3.066 ± 0.418
0.341MetCys: 0.341 ± 0.165
1.107MetAsp: 1.107 ± 0.314
1.959MetGlu: 1.959 ± 0.398
0.937MetPhe: 0.937 ± 0.267
1.704MetGly: 1.704 ± 0.266
0.426MetHis: 0.426 ± 0.174
0.937MetIle: 0.937 ± 0.295
2.215MetLys: 2.215 ± 0.492
3.152MetLeu: 3.152 ± 0.644
1.022MetMet: 1.022 ± 0.393
0.767MetAsn: 0.767 ± 0.294
1.363MetPro: 1.363 ± 0.293
1.278MetGln: 1.278 ± 0.289
1.618MetArg: 1.618 ± 0.32
2.3MetSer: 2.3 ± 0.425
2.215MetThr: 2.215 ± 0.373
1.022MetVal: 1.022 ± 0.253
0.426MetTrp: 0.426 ± 0.185
0.596MetTyr: 0.596 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
4.429AsnAla: 4.429 ± 0.569
0.596AsnCys: 0.596 ± 0.26
2.129AsnAsp: 2.129 ± 0.41
2.215AsnGlu: 2.215 ± 0.433
1.107AsnPhe: 1.107 ± 0.326
4.514AsnGly: 4.514 ± 0.511
1.107AsnHis: 1.107 ± 0.322
2.726AsnIle: 2.726 ± 0.501
2.215AsnLys: 2.215 ± 0.393
2.811AsnLeu: 2.811 ± 0.47
1.022AsnMet: 1.022 ± 0.341
2.3AsnAsn: 2.3 ± 0.457
2.47AsnPro: 2.47 ± 0.489
2.555AsnGln: 2.555 ± 0.379
2.3AsnArg: 2.3 ± 0.508
2.896AsnSer: 2.896 ± 0.624
3.663AsnThr: 3.663 ± 0.539
1.874AsnVal: 1.874 ± 0.398
1.107AsnTrp: 1.107 ± 0.237
1.448AsnTyr: 1.448 ± 0.43
0.0AsnXaa: 0.0 ± 0.0
Pro
3.322ProAla: 3.322 ± 0.538
0.426ProCys: 0.426 ± 0.229
2.3ProAsp: 2.3 ± 0.515
2.3ProGlu: 2.3 ± 0.448
1.363ProPhe: 1.363 ± 0.359
3.237ProGly: 3.237 ± 0.499
0.681ProHis: 0.681 ± 0.239
1.107ProIle: 1.107 ± 0.314
1.959ProLys: 1.959 ± 0.457
2.47ProLeu: 2.47 ± 0.65
0.937ProMet: 0.937 ± 0.301
1.022ProAsn: 1.022 ± 0.33
1.874ProPro: 1.874 ± 0.333
2.47ProGln: 2.47 ± 0.454
2.129ProArg: 2.129 ± 0.39
2.641ProSer: 2.641 ± 0.464
1.533ProThr: 1.533 ± 0.325
2.641ProVal: 2.641 ± 0.454
0.511ProTrp: 0.511 ± 0.218
1.278ProTyr: 1.278 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
5.622GlnAla: 5.622 ± 0.986
0.341GlnCys: 0.341 ± 0.169
1.618GlnAsp: 1.618 ± 0.364
1.959GlnGlu: 1.959 ± 0.367
1.107GlnPhe: 1.107 ± 0.315
2.129GlnGly: 2.129 ± 0.41
0.767GlnHis: 0.767 ± 0.272
2.47GlnIle: 2.47 ± 0.356
3.748GlnLys: 3.748 ± 0.612
3.748GlnLeu: 3.748 ± 0.631
1.278GlnMet: 1.278 ± 0.352
1.959GlnAsn: 1.959 ± 0.467
1.533GlnPro: 1.533 ± 0.337
3.066GlnGln: 3.066 ± 0.545
4.003GlnArg: 4.003 ± 0.662
3.492GlnSer: 3.492 ± 0.526
3.237GlnThr: 3.237 ± 0.602
3.492GlnVal: 3.492 ± 0.547
0.426GlnTrp: 0.426 ± 0.226
1.278GlnTyr: 1.278 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
5.196ArgAla: 5.196 ± 0.678
0.767ArgCys: 0.767 ± 0.289
3.748ArgAsp: 3.748 ± 0.603
3.492ArgGlu: 3.492 ± 0.704
1.618ArgPhe: 1.618 ± 0.311
3.918ArgGly: 3.918 ± 0.541
1.448ArgHis: 1.448 ± 0.357
3.578ArgIle: 3.578 ± 0.506
4.855ArgLys: 4.855 ± 0.612
5.196ArgLeu: 5.196 ± 0.558
2.044ArgMet: 2.044 ± 0.467
3.663ArgAsn: 3.663 ± 0.436
1.533ArgPro: 1.533 ± 0.319
2.3ArgGln: 2.3 ± 0.416
3.748ArgArg: 3.748 ± 0.762
3.407ArgSer: 3.407 ± 0.515
3.918ArgThr: 3.918 ± 0.456
4.174ArgVal: 4.174 ± 0.639
1.448ArgTrp: 1.448 ± 0.394
2.215ArgTyr: 2.215 ± 0.436
0.0ArgXaa: 0.0 ± 0.0
Ser
5.537SerAla: 5.537 ± 0.821
0.767SerCys: 0.767 ± 0.256
3.833SerAsp: 3.833 ± 0.515
4.174SerGlu: 4.174 ± 0.553
2.3SerPhe: 2.3 ± 0.446
6.729SerGly: 6.729 ± 0.744
0.852SerHis: 0.852 ± 0.25
3.492SerIle: 3.492 ± 0.421
3.407SerLys: 3.407 ± 0.569
6.218SerLeu: 6.218 ± 0.832
1.704SerMet: 1.704 ± 0.378
2.385SerAsn: 2.385 ± 0.506
2.811SerPro: 2.811 ± 0.465
3.322SerGln: 3.322 ± 0.523
5.026SerArg: 5.026 ± 0.623
4.003SerSer: 4.003 ± 0.858
3.663SerThr: 3.663 ± 0.561
5.451SerVal: 5.451 ± 0.868
1.874SerTrp: 1.874 ± 0.329
0.937SerTyr: 0.937 ± 0.209
0.0SerXaa: 0.0 ± 0.0
Thr
6.559ThrAla: 6.559 ± 0.878
0.596ThrCys: 0.596 ± 0.211
3.833ThrAsp: 3.833 ± 0.478
3.748ThrGlu: 3.748 ± 0.539
2.641ThrPhe: 2.641 ± 0.499
5.792ThrGly: 5.792 ± 0.753
0.681ThrHis: 0.681 ± 0.189
3.492ThrIle: 3.492 ± 0.659
2.981ThrLys: 2.981 ± 0.48
3.748ThrLeu: 3.748 ± 0.521
1.193ThrMet: 1.193 ± 0.248
2.129ThrAsn: 2.129 ± 0.354
2.385ThrPro: 2.385 ± 0.529
2.726ThrGln: 2.726 ± 0.693
2.385ThrArg: 2.385 ± 0.361
4.174ThrSer: 4.174 ± 0.538
3.066ThrThr: 3.066 ± 0.5
4.259ThrVal: 4.259 ± 0.545
1.193ThrTrp: 1.193 ± 0.331
1.789ThrTyr: 1.789 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
5.451ValAla: 5.451 ± 0.655
0.681ValCys: 0.681 ± 0.256
2.981ValAsp: 2.981 ± 0.38
3.663ValGlu: 3.663 ± 0.485
1.789ValPhe: 1.789 ± 0.293
5.026ValGly: 5.026 ± 0.75
1.022ValHis: 1.022 ± 0.264
4.514ValIle: 4.514 ± 0.677
3.237ValLys: 3.237 ± 0.362
5.026ValLeu: 5.026 ± 0.639
2.3ValMet: 2.3 ± 0.37
2.726ValAsn: 2.726 ± 0.498
2.641ValPro: 2.641 ± 0.412
2.385ValGln: 2.385 ± 0.325
3.748ValArg: 3.748 ± 0.5
5.026ValSer: 5.026 ± 0.786
3.578ValThr: 3.578 ± 0.668
5.026ValVal: 5.026 ± 0.637
1.022ValTrp: 1.022 ± 0.308
1.704ValTyr: 1.704 ± 0.345
0.0ValXaa: 0.0 ± 0.0
Trp
1.193TrpAla: 1.193 ± 0.304
0.426TrpCys: 0.426 ± 0.225
1.107TrpAsp: 1.107 ± 0.274
0.767TrpGlu: 0.767 ± 0.232
0.511TrpPhe: 0.511 ± 0.204
1.448TrpGly: 1.448 ± 0.384
0.852TrpHis: 0.852 ± 0.23
0.596TrpIle: 0.596 ± 0.206
1.022TrpLys: 1.022 ± 0.323
2.044TrpLeu: 2.044 ± 0.504
0.596TrpMet: 0.596 ± 0.225
0.852TrpAsn: 0.852 ± 0.318
0.256TrpPro: 0.256 ± 0.103
0.767TrpGln: 0.767 ± 0.23
1.448TrpArg: 1.448 ± 0.368
1.022TrpSer: 1.022 ± 0.301
1.107TrpThr: 1.107 ± 0.36
1.363TrpVal: 1.363 ± 0.29
0.341TrpTrp: 0.341 ± 0.153
0.681TrpTyr: 0.681 ± 0.213
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.896TyrAla: 2.896 ± 0.509
0.341TyrCys: 0.341 ± 0.157
1.533TyrAsp: 1.533 ± 0.297
1.533TyrGlu: 1.533 ± 0.313
0.596TyrPhe: 0.596 ± 0.207
2.215TyrGly: 2.215 ± 0.37
0.511TyrHis: 0.511 ± 0.199
2.129TyrIle: 2.129 ± 0.315
1.022TyrLys: 1.022 ± 0.294
2.129TyrLeu: 2.129 ± 0.428
0.511TyrMet: 0.511 ± 0.238
1.022TyrAsn: 1.022 ± 0.273
1.533TyrPro: 1.533 ± 0.421
1.959TyrGln: 1.959 ± 0.369
2.811TyrArg: 2.811 ± 0.526
2.129TyrSer: 2.129 ± 0.54
1.704TyrThr: 1.704 ± 0.412
1.363TyrVal: 1.363 ± 0.303
0.767TyrTrp: 0.767 ± 0.286
0.596TyrTyr: 0.596 ± 0.245
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (11741 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski