Amino acid dipepetide frequency for Escherichia phage vB_EcoS-26047II

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.669AlaAla: 8.669 ± 1.173
0.697AlaCys: 0.697 ± 0.239
4.257AlaAsp: 4.257 ± 0.574
5.65AlaGlu: 5.65 ± 0.705
3.638AlaPhe: 3.638 ± 0.486
5.882AlaGly: 5.882 ± 0.742
1.006AlaHis: 1.006 ± 0.342
6.656AlaIle: 6.656 ± 0.656
6.192AlaLys: 6.192 ± 1.207
7.817AlaLeu: 7.817 ± 1.11
2.012AlaMet: 2.012 ± 0.444
4.412AlaAsn: 4.412 ± 0.605
2.245AlaPro: 2.245 ± 0.424
3.483AlaGln: 3.483 ± 0.654
4.954AlaArg: 4.954 ± 0.629
5.882AlaSer: 5.882 ± 0.869
4.334AlaThr: 4.334 ± 0.6
4.954AlaVal: 4.954 ± 0.59
0.929AlaTrp: 0.929 ± 0.307
2.09AlaTyr: 2.09 ± 0.382
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.277
0.155CysCys: 0.155 ± 0.105
0.851CysAsp: 0.851 ± 0.273
0.929CysGlu: 0.929 ± 0.284
0.31CysPhe: 0.31 ± 0.161
1.393CysGly: 1.393 ± 0.371
0.232CysHis: 0.232 ± 0.13
0.31CysIle: 0.31 ± 0.179
1.238CysLys: 1.238 ± 0.322
1.006CysLeu: 1.006 ± 0.263
0.31CysMet: 0.31 ± 0.154
0.387CysAsn: 0.387 ± 0.203
0.387CysPro: 0.387 ± 0.176
0.077CysGln: 0.077 ± 0.075
0.774CysArg: 0.774 ± 0.255
1.084CysSer: 1.084 ± 0.299
1.006CysThr: 1.006 ± 0.294
0.851CysVal: 0.851 ± 0.305
0.31CysTrp: 0.31 ± 0.155
0.155CysTyr: 0.155 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
4.876AspAla: 4.876 ± 0.765
0.542AspCys: 0.542 ± 0.196
3.019AspAsp: 3.019 ± 0.614
4.644AspGlu: 4.644 ± 0.669
2.012AspPhe: 2.012 ± 0.331
6.734AspGly: 6.734 ± 0.823
0.851AspHis: 0.851 ± 0.363
4.18AspIle: 4.18 ± 0.395
4.644AspLys: 4.644 ± 0.472
4.334AspLeu: 4.334 ± 0.517
1.548AspMet: 1.548 ± 0.351
2.786AspAsn: 2.786 ± 0.483
1.703AspPro: 1.703 ± 0.313
1.316AspGln: 1.316 ± 0.309
2.09AspArg: 2.09 ± 0.469
2.709AspSer: 2.709 ± 0.484
3.483AspThr: 3.483 ± 0.591
3.793AspVal: 3.793 ± 0.518
1.238AspTrp: 1.238 ± 0.255
3.251AspTyr: 3.251 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
5.805GluAla: 5.805 ± 0.628
0.851GluCys: 0.851 ± 0.316
2.864GluAsp: 2.864 ± 0.427
4.025GluGlu: 4.025 ± 0.646
3.173GluPhe: 3.173 ± 0.587
3.947GluGly: 3.947 ± 0.509
0.387GluHis: 0.387 ± 0.162
5.728GluIle: 5.728 ± 0.53
3.483GluLys: 3.483 ± 0.747
6.192GluLeu: 6.192 ± 0.863
2.941GluMet: 2.941 ± 0.618
3.173GluAsn: 3.173 ± 0.516
1.935GluPro: 1.935 ± 0.42
2.941GluGln: 2.941 ± 0.658
2.941GluArg: 2.941 ± 0.507
4.799GluSer: 4.799 ± 0.667
3.406GluThr: 3.406 ± 0.432
4.954GluVal: 4.954 ± 0.522
0.464GluTrp: 0.464 ± 0.174
2.477GluTyr: 2.477 ± 0.353
0.0GluXaa: 0.0 ± 0.0
Phe
2.245PheAla: 2.245 ± 0.395
0.697PheCys: 0.697 ± 0.192
3.328PheAsp: 3.328 ± 0.455
2.554PheGlu: 2.554 ± 0.513
0.851PhePhe: 0.851 ± 0.275
3.483PheGly: 3.483 ± 0.507
0.774PheHis: 0.774 ± 0.261
2.399PheIle: 2.399 ± 0.374
2.477PheLys: 2.477 ± 0.518
1.858PheLeu: 1.858 ± 0.478
0.929PheMet: 0.929 ± 0.247
2.322PheAsn: 2.322 ± 0.391
1.084PhePro: 1.084 ± 0.246
2.012PheGln: 2.012 ± 0.369
1.703PheArg: 1.703 ± 0.452
2.554PheSer: 2.554 ± 0.439
2.477PheThr: 2.477 ± 0.394
2.477PheVal: 2.477 ± 0.44
0.387PheTrp: 0.387 ± 0.168
0.697PheTyr: 0.697 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
4.721GlyAla: 4.721 ± 0.685
1.471GlyCys: 1.471 ± 0.327
4.025GlyAsp: 4.025 ± 0.463
4.799GlyGlu: 4.799 ± 0.689
2.399GlyPhe: 2.399 ± 0.52
5.65GlyGly: 5.65 ± 1.005
0.851GlyHis: 0.851 ± 0.381
6.115GlyIle: 6.115 ± 0.536
5.65GlyLys: 5.65 ± 0.784
7.121GlyLeu: 7.121 ± 0.736
2.245GlyMet: 2.245 ± 0.561
3.173GlyAsn: 3.173 ± 0.463
0.774GlyPro: 0.774 ± 0.226
2.322GlyGln: 2.322 ± 0.372
2.864GlyArg: 2.864 ± 0.381
5.573GlySer: 5.573 ± 0.796
4.412GlyThr: 4.412 ± 0.81
6.269GlyVal: 6.269 ± 0.667
1.006GlyTrp: 1.006 ± 0.26
4.025GlyTyr: 4.025 ± 0.575
0.0GlyXaa: 0.0 ± 0.0
His
0.851HisAla: 0.851 ± 0.338
0.31HisCys: 0.31 ± 0.186
0.774HisAsp: 0.774 ± 0.254
0.774HisGlu: 0.774 ± 0.244
0.697HisPhe: 0.697 ± 0.209
0.774HisGly: 0.774 ± 0.358
0.387HisHis: 0.387 ± 0.222
1.316HisIle: 1.316 ± 0.385
1.006HisLys: 1.006 ± 0.287
1.084HisLeu: 1.084 ± 0.284
0.31HisMet: 0.31 ± 0.174
0.387HisAsn: 0.387 ± 0.172
0.232HisPro: 0.232 ± 0.175
0.387HisGln: 0.387 ± 0.158
0.774HisArg: 0.774 ± 0.23
0.542HisSer: 0.542 ± 0.266
0.774HisThr: 0.774 ± 0.268
0.697HisVal: 0.697 ± 0.204
0.077HisTrp: 0.077 ± 0.073
0.464HisTyr: 0.464 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
6.347IleAla: 6.347 ± 1.016
0.464IleCys: 0.464 ± 0.178
5.341IleAsp: 5.341 ± 0.512
3.793IleGlu: 3.793 ± 0.421
2.012IlePhe: 2.012 ± 0.366
4.489IleGly: 4.489 ± 0.595
1.084IleHis: 1.084 ± 0.298
3.715IleIle: 3.715 ± 0.586
5.108IleLys: 5.108 ± 0.632
3.483IleLeu: 3.483 ± 0.481
1.858IleMet: 1.858 ± 0.483
4.721IleAsn: 4.721 ± 0.76
3.019IlePro: 3.019 ± 0.488
2.554IleGln: 2.554 ± 0.554
3.87IleArg: 3.87 ± 0.474
5.96IleSer: 5.96 ± 0.766
5.263IleThr: 5.263 ± 0.524
3.483IleVal: 3.483 ± 0.421
0.851IleTrp: 0.851 ± 0.208
3.173IleTyr: 3.173 ± 0.619
0.0IleXaa: 0.0 ± 0.0
Lys
6.656LysAla: 6.656 ± 0.876
0.542LysCys: 0.542 ± 0.206
4.257LysAsp: 4.257 ± 0.603
5.418LysGlu: 5.418 ± 0.885
2.477LysPhe: 2.477 ± 0.435
2.941LysGly: 2.941 ± 0.556
0.697LysHis: 0.697 ± 0.206
3.715LysIle: 3.715 ± 0.445
3.406LysLys: 3.406 ± 0.491
4.799LysLeu: 4.799 ± 0.729
2.399LysMet: 2.399 ± 0.5
2.941LysAsn: 2.941 ± 0.542
2.09LysPro: 2.09 ± 0.392
2.864LysGln: 2.864 ± 0.517
2.786LysArg: 2.786 ± 0.524
4.567LysSer: 4.567 ± 0.695
3.173LysThr: 3.173 ± 0.439
4.876LysVal: 4.876 ± 0.674
0.774LysTrp: 0.774 ± 0.273
3.715LysTyr: 3.715 ± 0.477
0.0LysXaa: 0.0 ± 0.0
Leu
6.734LeuAla: 6.734 ± 0.842
0.851LeuCys: 0.851 ± 0.197
4.799LeuAsp: 4.799 ± 0.531
4.257LeuGlu: 4.257 ± 0.708
1.625LeuPhe: 1.625 ± 0.28
4.954LeuGly: 4.954 ± 0.793
1.084LeuHis: 1.084 ± 0.395
4.954LeuIle: 4.954 ± 0.64
3.947LeuLys: 3.947 ± 0.574
4.257LeuLeu: 4.257 ± 0.505
1.471LeuMet: 1.471 ± 0.304
3.483LeuAsn: 3.483 ± 0.54
2.709LeuPro: 2.709 ± 0.412
2.786LeuGln: 2.786 ± 0.895
3.87LeuArg: 3.87 ± 0.619
6.192LeuSer: 6.192 ± 0.722
4.799LeuThr: 4.799 ± 0.662
5.805LeuVal: 5.805 ± 0.435
0.31LeuTrp: 0.31 ± 0.174
2.167LeuTyr: 2.167 ± 0.498
0.0LeuXaa: 0.0 ± 0.0
Met
3.483MetAla: 3.483 ± 0.459
0.232MetCys: 0.232 ± 0.126
1.316MetAsp: 1.316 ± 0.353
0.851MetGlu: 0.851 ± 0.258
1.161MetPhe: 1.161 ± 0.329
0.774MetGly: 0.774 ± 0.251
0.232MetHis: 0.232 ± 0.127
2.012MetIle: 2.012 ± 0.321
1.625MetLys: 1.625 ± 0.371
2.012MetLeu: 2.012 ± 0.503
0.851MetMet: 0.851 ± 0.339
1.703MetAsn: 1.703 ± 0.423
0.619MetPro: 0.619 ± 0.197
1.084MetGln: 1.084 ± 0.288
1.471MetArg: 1.471 ± 0.318
1.625MetSer: 1.625 ± 0.344
2.09MetThr: 2.09 ± 0.389
1.393MetVal: 1.393 ± 0.312
0.387MetTrp: 0.387 ± 0.178
0.464MetTyr: 0.464 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
4.257AsnAla: 4.257 ± 0.571
0.542AsnCys: 0.542 ± 0.213
2.786AsnAsp: 2.786 ± 0.332
3.56AsnGlu: 3.56 ± 0.668
2.322AsnPhe: 2.322 ± 0.428
6.269AsnGly: 6.269 ± 1.166
0.619AsnHis: 0.619 ± 0.226
2.786AsnIle: 2.786 ± 0.434
2.941AsnLys: 2.941 ± 0.456
3.947AsnLeu: 3.947 ± 0.57
1.161AsnMet: 1.161 ± 0.283
2.864AsnAsn: 2.864 ± 0.48
1.858AsnPro: 1.858 ± 0.347
2.554AsnGln: 2.554 ± 0.594
2.245AsnArg: 2.245 ± 0.283
3.947AsnSer: 3.947 ± 0.46
2.012AsnThr: 2.012 ± 0.438
3.483AsnVal: 3.483 ± 0.443
0.542AsnTrp: 0.542 ± 0.15
1.471AsnTyr: 1.471 ± 0.368
0.0AsnXaa: 0.0 ± 0.0
Pro
2.786ProAla: 2.786 ± 0.328
0.31ProCys: 0.31 ± 0.194
2.167ProAsp: 2.167 ± 0.509
3.019ProGlu: 3.019 ± 0.437
1.471ProPhe: 1.471 ± 0.269
1.858ProGly: 1.858 ± 0.372
0.464ProHis: 0.464 ± 0.169
2.012ProIle: 2.012 ± 0.375
1.238ProLys: 1.238 ± 0.335
2.012ProLeu: 2.012 ± 0.327
0.464ProMet: 0.464 ± 0.181
1.238ProAsn: 1.238 ± 0.258
0.464ProPro: 0.464 ± 0.188
1.548ProGln: 1.548 ± 0.402
1.471ProArg: 1.471 ± 0.338
1.238ProSer: 1.238 ± 0.235
1.625ProThr: 1.625 ± 0.357
3.173ProVal: 3.173 ± 0.519
0.464ProTrp: 0.464 ± 0.2
1.471ProTyr: 1.471 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
4.489GlnAla: 4.489 ± 1.0
0.697GlnCys: 0.697 ± 0.308
1.78GlnAsp: 1.78 ± 0.379
2.864GlnGlu: 2.864 ± 0.554
1.625GlnPhe: 1.625 ± 0.348
2.554GlnGly: 2.554 ± 0.421
0.31GlnHis: 0.31 ± 0.145
3.483GlnIle: 3.483 ± 0.637
2.09GlnLys: 2.09 ± 0.345
2.786GlnLeu: 2.786 ± 0.652
0.929GlnMet: 0.929 ± 0.293
1.548GlnAsn: 1.548 ± 0.443
1.161GlnPro: 1.161 ± 0.373
2.399GlnGln: 2.399 ± 0.825
1.935GlnArg: 1.935 ± 0.436
2.786GlnSer: 2.786 ± 0.53
1.625GlnThr: 1.625 ± 0.392
2.399GlnVal: 2.399 ± 0.444
0.542GlnTrp: 0.542 ± 0.225
1.238GlnTyr: 1.238 ± 0.289
0.0GlnXaa: 0.0 ± 0.0
Arg
4.412ArgAla: 4.412 ± 0.638
0.774ArgCys: 0.774 ± 0.356
2.167ArgAsp: 2.167 ± 0.35
3.715ArgGlu: 3.715 ± 0.531
2.554ArgPhe: 2.554 ± 0.28
2.941ArgGly: 2.941 ± 0.387
0.464ArgHis: 0.464 ± 0.199
4.025ArgIle: 4.025 ± 0.497
4.876ArgLys: 4.876 ± 0.541
3.406ArgLeu: 3.406 ± 0.421
1.316ArgMet: 1.316 ± 0.358
2.012ArgAsn: 2.012 ± 0.443
1.393ArgPro: 1.393 ± 0.39
1.703ArgGln: 1.703 ± 0.432
2.864ArgArg: 2.864 ± 0.498
2.709ArgSer: 2.709 ± 0.409
1.78ArgThr: 1.78 ± 0.431
3.87ArgVal: 3.87 ± 0.603
0.464ArgTrp: 0.464 ± 0.178
2.09ArgTyr: 2.09 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
5.341SerAla: 5.341 ± 0.859
0.697SerCys: 0.697 ± 0.29
5.186SerAsp: 5.186 ± 0.661
5.341SerGlu: 5.341 ± 0.551
2.477SerPhe: 2.477 ± 0.465
7.198SerGly: 7.198 ± 1.001
1.161SerHis: 1.161 ± 0.309
4.18SerIle: 4.18 ± 0.541
3.947SerLys: 3.947 ± 0.695
4.644SerLeu: 4.644 ± 0.64
1.548SerMet: 1.548 ± 0.371
4.18SerAsn: 4.18 ± 0.827
2.399SerPro: 2.399 ± 0.385
2.941SerGln: 2.941 ± 0.535
3.328SerArg: 3.328 ± 0.509
4.721SerSer: 4.721 ± 0.748
3.638SerThr: 3.638 ± 0.548
5.186SerVal: 5.186 ± 0.611
0.697SerTrp: 0.697 ± 0.237
2.245SerTyr: 2.245 ± 0.457
0.0SerXaa: 0.0 ± 0.0
Thr
5.186ThrAla: 5.186 ± 0.738
0.542ThrCys: 0.542 ± 0.182
2.864ThrAsp: 2.864 ± 0.395
2.632ThrGlu: 2.632 ± 0.332
2.399ThrPhe: 2.399 ± 0.419
6.424ThrGly: 6.424 ± 0.834
0.464ThrHis: 0.464 ± 0.172
4.334ThrIle: 4.334 ± 0.615
2.864ThrLys: 2.864 ± 0.51
3.251ThrLeu: 3.251 ± 0.362
0.851ThrMet: 0.851 ± 0.201
3.638ThrAsn: 3.638 ± 0.551
2.245ThrPro: 2.245 ± 0.369
2.399ThrGln: 2.399 ± 0.431
2.786ThrArg: 2.786 ± 0.368
3.56ThrSer: 3.56 ± 0.477
3.173ThrThr: 3.173 ± 0.623
3.793ThrVal: 3.793 ± 0.561
0.387ThrTrp: 0.387 ± 0.176
2.632ThrTyr: 2.632 ± 0.484
0.0ThrXaa: 0.0 ± 0.0
Val
5.341ValAla: 5.341 ± 0.604
1.161ValCys: 1.161 ± 0.384
4.799ValAsp: 4.799 ± 0.557
4.412ValGlu: 4.412 ± 0.791
2.477ValPhe: 2.477 ± 0.34
3.638ValGly: 3.638 ± 0.683
0.851ValHis: 0.851 ± 0.25
4.954ValIle: 4.954 ± 0.684
5.495ValLys: 5.495 ± 0.725
3.715ValLeu: 3.715 ± 0.514
1.316ValMet: 1.316 ± 0.401
4.18ValAsn: 4.18 ± 0.535
2.09ValPro: 2.09 ± 0.49
2.167ValGln: 2.167 ± 0.691
3.947ValArg: 3.947 ± 0.515
6.192ValSer: 6.192 ± 0.59
4.489ValThr: 4.489 ± 0.564
5.65ValVal: 5.65 ± 0.89
0.851ValTrp: 0.851 ± 0.231
2.399ValTyr: 2.399 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
0.542TrpAla: 0.542 ± 0.208
0.31TrpCys: 0.31 ± 0.178
0.697TrpAsp: 0.697 ± 0.167
0.774TrpGlu: 0.774 ± 0.24
0.619TrpPhe: 0.619 ± 0.269
0.851TrpGly: 0.851 ± 0.226
0.232TrpHis: 0.232 ± 0.106
1.006TrpIle: 1.006 ± 0.319
1.006TrpLys: 1.006 ± 0.231
0.619TrpLeu: 0.619 ± 0.174
0.232TrpMet: 0.232 ± 0.124
0.542TrpAsn: 0.542 ± 0.176
0.464TrpPro: 0.464 ± 0.211
0.464TrpGln: 0.464 ± 0.173
0.697TrpArg: 0.697 ± 0.222
0.774TrpSer: 0.774 ± 0.357
0.464TrpThr: 0.464 ± 0.192
0.542TrpVal: 0.542 ± 0.188
0.077TrpTrp: 0.077 ± 0.066
0.31TrpTyr: 0.31 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.477TyrAla: 2.477 ± 0.462
0.464TyrCys: 0.464 ± 0.173
2.477TyrAsp: 2.477 ± 0.511
2.477TyrGlu: 2.477 ± 0.467
1.006TyrPhe: 1.006 ± 0.284
2.709TyrGly: 2.709 ± 0.407
0.464TyrHis: 0.464 ± 0.246
2.709TyrIle: 2.709 ± 0.466
1.935TyrLys: 1.935 ± 0.447
2.709TyrLeu: 2.709 ± 0.365
0.619TyrMet: 0.619 ± 0.202
2.477TyrAsn: 2.477 ± 0.366
1.625TyrPro: 1.625 ± 0.293
1.238TyrGln: 1.238 ± 0.407
2.167TyrArg: 2.167 ± 0.396
3.638TyrSer: 3.638 ± 0.799
2.399TyrThr: 2.399 ± 0.344
2.477TyrVal: 2.477 ± 0.426
0.464TyrTrp: 0.464 ± 0.19
1.393TyrTyr: 1.393 ± 0.335
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12921 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski