Amino acid dipepetide frequency for Escherichia phage DN1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.274AlaAla: 15.274 ± 2.675
0.72AlaCys: 0.72 ± 0.258
6.238AlaAsp: 6.238 ± 0.529
6.477AlaGlu: 6.477 ± 0.947
3.519AlaPhe: 3.519 ± 0.502
8.717AlaGly: 8.717 ± 1.179
1.679AlaHis: 1.679 ± 0.303
5.678AlaIle: 5.678 ± 0.635
4.558AlaLys: 4.558 ± 0.732
9.036AlaLeu: 9.036 ± 0.857
3.279AlaMet: 3.279 ± 0.447
2.639AlaAsn: 2.639 ± 0.427
2.959AlaPro: 2.959 ± 0.53
4.798AlaGln: 4.798 ± 0.982
7.117AlaArg: 7.117 ± 1.044
8.317AlaSer: 8.317 ± 1.397
5.918AlaThr: 5.918 ± 1.112
7.357AlaVal: 7.357 ± 0.813
1.999AlaTrp: 1.999 ± 0.374
2.639AlaTyr: 2.639 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
0.88CysAla: 0.88 ± 0.274
0.24CysCys: 0.24 ± 0.172
0.48CysAsp: 0.48 ± 0.209
0.56CysGlu: 0.56 ± 0.212
0.4CysPhe: 0.4 ± 0.168
0.8CysGly: 0.8 ± 0.273
0.4CysHis: 0.4 ± 0.2
0.56CysIle: 0.56 ± 0.193
0.48CysLys: 0.48 ± 0.251
0.8CysLeu: 0.8 ± 0.237
0.24CysMet: 0.24 ± 0.185
0.4CysAsn: 0.4 ± 0.2
0.32CysPro: 0.32 ± 0.191
0.32CysGln: 0.32 ± 0.133
0.8CysArg: 0.8 ± 0.306
1.12CysSer: 1.12 ± 0.33
0.56CysThr: 0.56 ± 0.248
0.4CysVal: 0.4 ± 0.14
0.4CysTrp: 0.4 ± 0.192
0.4CysTyr: 0.4 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
6.637AspAla: 6.637 ± 0.976
0.72AspCys: 0.72 ± 0.196
3.599AspAsp: 3.599 ± 0.456
3.838AspGlu: 3.838 ± 0.57
1.519AspPhe: 1.519 ± 0.286
5.038AspGly: 5.038 ± 0.809
0.56AspHis: 0.56 ± 0.178
2.959AspIle: 2.959 ± 0.409
2.079AspLys: 2.079 ± 0.371
4.318AspLeu: 4.318 ± 0.67
1.599AspMet: 1.599 ± 0.336
2.399AspAsn: 2.399 ± 0.404
2.639AspPro: 2.639 ± 0.702
1.359AspGln: 1.359 ± 0.403
2.799AspArg: 2.799 ± 0.499
2.799AspSer: 2.799 ± 0.498
3.998AspThr: 3.998 ± 0.542
4.158AspVal: 4.158 ± 0.51
1.2AspTrp: 1.2 ± 0.321
2.079AspTyr: 2.079 ± 0.361
0.0AspXaa: 0.0 ± 0.0
Glu
6.477GluAla: 6.477 ± 1.019
0.48GluCys: 0.48 ± 0.193
2.799GluAsp: 2.799 ± 0.451
4.078GluGlu: 4.078 ± 0.735
1.759GluPhe: 1.759 ± 0.429
3.599GluGly: 3.599 ± 0.538
1.2GluHis: 1.2 ± 0.391
3.359GluIle: 3.359 ± 0.369
3.199GluLys: 3.199 ± 0.433
6.078GluLeu: 6.078 ± 0.707
1.519GluMet: 1.519 ± 0.355
3.439GluAsn: 3.439 ± 0.466
1.999GluPro: 1.999 ± 0.404
5.518GluGln: 5.518 ± 0.668
3.918GluArg: 3.918 ± 0.683
3.359GluSer: 3.359 ± 0.394
3.758GluThr: 3.758 ± 0.599
3.679GluVal: 3.679 ± 0.521
0.8GluTrp: 0.8 ± 0.225
1.679GluTyr: 1.679 ± 0.279
0.0GluXaa: 0.0 ± 0.0
Phe
2.719PheAla: 2.719 ± 0.495
0.72PheCys: 0.72 ± 0.246
2.799PheAsp: 2.799 ± 0.524
1.439PheGlu: 1.439 ± 0.365
1.279PhePhe: 1.279 ± 0.323
2.719PheGly: 2.719 ± 0.396
0.8PheHis: 0.8 ± 0.242
1.279PheIle: 1.279 ± 0.324
1.2PheLys: 1.2 ± 0.28
2.319PheLeu: 2.319 ± 0.451
0.72PheMet: 0.72 ± 0.216
0.88PheAsn: 0.88 ± 0.245
1.04PhePro: 1.04 ± 0.225
0.64PheGln: 0.64 ± 0.178
2.719PheArg: 2.719 ± 0.371
2.719PheSer: 2.719 ± 0.773
2.959PheThr: 2.959 ± 0.462
2.239PheVal: 2.239 ± 0.351
0.4PheTrp: 0.4 ± 0.148
1.04PheTyr: 1.04 ± 0.262
0.0PheXaa: 0.0 ± 0.0
Gly
6.078GlyAla: 6.078 ± 0.947
1.04GlyCys: 1.04 ± 0.265
4.478GlyAsp: 4.478 ± 0.469
4.878GlyGlu: 4.878 ± 0.608
2.239GlyPhe: 2.239 ± 0.423
5.758GlyGly: 5.758 ± 0.895
0.72GlyHis: 0.72 ± 0.267
3.519GlyIle: 3.519 ± 0.486
4.078GlyLys: 4.078 ± 0.702
5.838GlyLeu: 5.838 ± 0.764
2.879GlyMet: 2.879 ± 0.472
3.519GlyAsn: 3.519 ± 0.54
1.359GlyPro: 1.359 ± 0.246
2.879GlyGln: 2.879 ± 0.473
5.118GlyArg: 5.118 ± 0.426
4.318GlySer: 4.318 ± 0.497
4.558GlyThr: 4.558 ± 0.741
4.958GlyVal: 4.958 ± 0.528
1.519GlyTrp: 1.519 ± 0.347
3.199GlyTyr: 3.199 ± 0.493
0.0GlyXaa: 0.0 ± 0.0
His
1.599HisAla: 1.599 ± 0.45
0.08HisCys: 0.08 ± 0.075
0.8HisAsp: 0.8 ± 0.239
1.04HisGlu: 1.04 ± 0.271
0.96HisPhe: 0.96 ± 0.296
1.519HisGly: 1.519 ± 0.358
0.32HisHis: 0.32 ± 0.15
0.96HisIle: 0.96 ± 0.275
0.8HisLys: 0.8 ± 0.232
1.599HisLeu: 1.599 ± 0.422
0.16HisMet: 0.16 ± 0.119
1.04HisAsn: 1.04 ± 0.289
0.96HisPro: 0.96 ± 0.283
0.56HisGln: 0.56 ± 0.206
1.2HisArg: 1.2 ± 0.274
0.88HisSer: 0.88 ± 0.279
0.56HisThr: 0.56 ± 0.186
1.04HisVal: 1.04 ± 0.203
0.32HisTrp: 0.32 ± 0.151
1.04HisTyr: 1.04 ± 0.315
0.0HisXaa: 0.0 ± 0.0
Ile
4.558IleAla: 4.558 ± 0.58
0.56IleCys: 0.56 ± 0.221
2.959IleAsp: 2.959 ± 0.449
3.439IleGlu: 3.439 ± 0.645
0.88IlePhe: 0.88 ± 0.28
3.039IleGly: 3.039 ± 0.638
0.8IleHis: 0.8 ± 0.275
2.159IleIle: 2.159 ± 0.44
1.999IleLys: 1.999 ± 0.421
3.119IleLeu: 3.119 ± 0.437
0.8IleMet: 0.8 ± 0.237
2.239IleAsn: 2.239 ± 0.366
2.239IlePro: 2.239 ± 0.444
1.919IleGln: 1.919 ± 0.348
3.359IleArg: 3.359 ± 0.482
3.279IleSer: 3.279 ± 0.515
3.918IleThr: 3.918 ± 0.636
2.879IleVal: 2.879 ± 0.416
0.64IleTrp: 0.64 ± 0.256
1.12IleTyr: 1.12 ± 0.308
0.0IleXaa: 0.0 ± 0.0
Lys
4.798LysAla: 4.798 ± 0.88
0.32LysCys: 0.32 ± 0.2
2.399LysAsp: 2.399 ± 0.554
3.918LysGlu: 3.918 ± 0.436
1.599LysPhe: 1.599 ± 0.31
3.359LysGly: 3.359 ± 0.535
0.88LysHis: 0.88 ± 0.277
1.679LysIle: 1.679 ± 0.304
2.559LysLys: 2.559 ± 0.604
3.439LysLeu: 3.439 ± 0.606
1.04LysMet: 1.04 ± 0.317
1.759LysAsn: 1.759 ± 0.495
2.159LysPro: 2.159 ± 0.402
1.839LysGln: 1.839 ± 0.335
2.879LysArg: 2.879 ± 0.525
3.119LysSer: 3.119 ± 0.476
3.838LysThr: 3.838 ± 0.67
3.279LysVal: 3.279 ± 0.557
0.8LysTrp: 0.8 ± 0.235
1.359LysTyr: 1.359 ± 0.32
0.0LysXaa: 0.0 ± 0.0
Leu
10.156LeuAla: 10.156 ± 1.123
0.64LeuCys: 0.64 ± 0.21
4.638LeuAsp: 4.638 ± 0.595
5.198LeuGlu: 5.198 ± 0.859
2.399LeuPhe: 2.399 ± 0.504
4.798LeuGly: 4.798 ± 0.695
1.2LeuHis: 1.2 ± 0.38
3.199LeuIle: 3.199 ± 0.568
4.478LeuLys: 4.478 ± 0.69
6.477LeuLeu: 6.477 ± 0.831
2.399LeuMet: 2.399 ± 0.408
2.799LeuAsn: 2.799 ± 0.379
4.718LeuPro: 4.718 ± 0.653
3.599LeuGln: 3.599 ± 0.581
4.878LeuArg: 4.878 ± 0.606
7.517LeuSer: 7.517 ± 0.7
6.877LeuThr: 6.877 ± 0.818
5.118LeuVal: 5.118 ± 0.563
1.359LeuTrp: 1.359 ± 0.33
1.359LeuTyr: 1.359 ± 0.33
0.0LeuXaa: 0.0 ± 0.0
Met
3.599MetAla: 3.599 ± 0.61
0.4MetCys: 0.4 ± 0.156
1.04MetAsp: 1.04 ± 0.271
0.96MetGlu: 0.96 ± 0.295
1.12MetPhe: 1.12 ± 0.258
1.519MetGly: 1.519 ± 0.295
0.4MetHis: 0.4 ± 0.219
0.96MetIle: 0.96 ± 0.309
1.359MetLys: 1.359 ± 0.357
2.399MetLeu: 2.399 ± 0.363
0.88MetMet: 0.88 ± 0.281
1.12MetAsn: 1.12 ± 0.286
1.839MetPro: 1.839 ± 0.374
1.2MetGln: 1.2 ± 0.356
1.919MetArg: 1.919 ± 0.349
1.999MetSer: 1.999 ± 0.401
3.039MetThr: 3.039 ± 0.545
2.079MetVal: 2.079 ± 0.33
0.16MetTrp: 0.16 ± 0.102
0.64MetTyr: 0.64 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
3.758AsnAla: 3.758 ± 0.553
0.72AsnCys: 0.72 ± 0.24
1.919AsnAsp: 1.919 ± 0.31
2.559AsnGlu: 2.559 ± 0.392
1.12AsnPhe: 1.12 ± 0.365
4.078AsnGly: 4.078 ± 0.601
0.96AsnHis: 0.96 ± 0.294
1.999AsnIle: 1.999 ± 0.422
1.839AsnLys: 1.839 ± 0.433
3.439AsnLeu: 3.439 ± 0.641
1.12AsnMet: 1.12 ± 0.327
1.359AsnAsn: 1.359 ± 0.313
2.079AsnPro: 2.079 ± 0.37
0.8AsnGln: 0.8 ± 0.277
2.239AsnArg: 2.239 ± 0.531
2.639AsnSer: 2.639 ± 0.391
1.919AsnThr: 1.919 ± 0.398
2.399AsnVal: 2.399 ± 0.388
0.48AsnTrp: 0.48 ± 0.214
0.8AsnTyr: 0.8 ± 0.282
0.0AsnXaa: 0.0 ± 0.0
Pro
4.638ProAla: 4.638 ± 0.615
0.08ProCys: 0.08 ± 0.077
3.439ProAsp: 3.439 ± 0.524
3.039ProGlu: 3.039 ± 0.509
1.12ProPhe: 1.12 ± 0.344
3.199ProGly: 3.199 ± 0.386
0.88ProHis: 0.88 ± 0.262
1.279ProIle: 1.279 ± 0.267
2.079ProLys: 2.079 ± 0.425
3.039ProLeu: 3.039 ± 0.469
1.12ProMet: 1.12 ± 0.264
1.439ProAsn: 1.439 ± 0.293
1.839ProPro: 1.839 ± 0.39
2.159ProGln: 2.159 ± 0.401
1.839ProArg: 1.839 ± 0.418
2.479ProSer: 2.479 ± 0.523
2.239ProThr: 2.239 ± 0.359
3.199ProVal: 3.199 ± 0.398
0.64ProTrp: 0.64 ± 0.223
1.04ProTyr: 1.04 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
5.118GlnAla: 5.118 ± 0.874
0.4GlnCys: 0.4 ± 0.178
1.12GlnAsp: 1.12 ± 0.307
2.559GlnGlu: 2.559 ± 0.481
1.2GlnPhe: 1.2 ± 0.335
2.479GlnGly: 2.479 ± 0.405
0.72GlnHis: 0.72 ± 0.234
2.559GlnIle: 2.559 ± 0.452
1.919GlnLys: 1.919 ± 0.366
4.558GlnLeu: 4.558 ± 0.464
1.999GlnMet: 1.999 ± 0.441
1.919GlnAsn: 1.919 ± 0.394
1.599GlnPro: 1.599 ± 0.341
3.359GlnGln: 3.359 ± 0.66
3.359GlnArg: 3.359 ± 0.566
3.039GlnSer: 3.039 ± 0.566
2.319GlnThr: 2.319 ± 0.532
3.679GlnVal: 3.679 ± 0.609
0.32GlnTrp: 0.32 ± 0.163
1.359GlnTyr: 1.359 ± 0.267
0.0GlnXaa: 0.0 ± 0.0
Arg
5.838ArgAla: 5.838 ± 0.698
0.8ArgCys: 0.8 ± 0.281
3.599ArgAsp: 3.599 ± 0.664
4.638ArgGlu: 4.638 ± 0.674
2.239ArgPhe: 2.239 ± 0.295
3.998ArgGly: 3.998 ± 0.559
1.759ArgHis: 1.759 ± 0.344
3.599ArgIle: 3.599 ± 0.586
3.279ArgLys: 3.279 ± 0.516
5.918ArgLeu: 5.918 ± 0.705
1.999ArgMet: 1.999 ± 0.344
2.719ArgAsn: 2.719 ± 0.451
2.159ArgPro: 2.159 ± 0.409
3.599ArgGln: 3.599 ± 0.482
5.598ArgArg: 5.598 ± 1.016
2.639ArgSer: 2.639 ± 0.372
2.799ArgThr: 2.799 ± 0.478
3.519ArgVal: 3.519 ± 0.682
0.96ArgTrp: 0.96 ± 0.267
2.239ArgTyr: 2.239 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
9.036SerAla: 9.036 ± 1.794
0.48SerCys: 0.48 ± 0.263
3.679SerAsp: 3.679 ± 0.564
4.078SerGlu: 4.078 ± 0.643
1.919SerPhe: 1.919 ± 0.4
7.357SerGly: 7.357 ± 0.837
1.04SerHis: 1.04 ± 0.294
1.999SerIle: 1.999 ± 0.347
2.479SerLys: 2.479 ± 0.457
5.518SerLeu: 5.518 ± 0.779
2.159SerMet: 2.159 ± 0.391
1.759SerAsn: 1.759 ± 0.319
2.639SerPro: 2.639 ± 0.473
3.119SerGln: 3.119 ± 0.43
4.878SerArg: 4.878 ± 0.573
3.838SerSer: 3.838 ± 0.681
4.318SerThr: 4.318 ± 0.618
5.358SerVal: 5.358 ± 0.759
0.64SerTrp: 0.64 ± 0.272
1.359SerTyr: 1.359 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
7.197ThrAla: 7.197 ± 0.907
0.48ThrCys: 0.48 ± 0.173
3.918ThrAsp: 3.918 ± 0.504
4.158ThrGlu: 4.158 ± 0.638
3.439ThrPhe: 3.439 ± 0.556
4.718ThrGly: 4.718 ± 0.611
1.279ThrHis: 1.279 ± 0.293
2.639ThrIle: 2.639 ± 0.544
2.239ThrLys: 2.239 ± 0.42
5.998ThrLeu: 5.998 ± 0.633
1.279ThrMet: 1.279 ± 0.37
1.599ThrAsn: 1.599 ± 0.476
4.238ThrPro: 4.238 ± 0.738
2.959ThrGln: 2.959 ± 0.38
3.359ThrArg: 3.359 ± 0.463
4.078ThrSer: 4.078 ± 0.651
3.599ThrThr: 3.599 ± 0.558
5.198ThrVal: 5.198 ± 0.915
0.96ThrTrp: 0.96 ± 0.241
1.679ThrTyr: 1.679 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
7.197ValAla: 7.197 ± 0.722
0.64ValCys: 0.64 ± 0.27
3.838ValAsp: 3.838 ± 0.48
3.679ValGlu: 3.679 ± 0.522
1.999ValPhe: 1.999 ± 0.315
3.679ValGly: 3.679 ± 0.662
0.8ValHis: 0.8 ± 0.221
3.599ValIle: 3.599 ± 0.538
4.478ValLys: 4.478 ± 0.661
5.678ValLeu: 5.678 ± 0.611
2.399ValMet: 2.399 ± 0.448
3.758ValAsn: 3.758 ± 0.504
2.079ValPro: 2.079 ± 0.452
2.959ValGln: 2.959 ± 0.688
3.039ValArg: 3.039 ± 0.624
5.598ValSer: 5.598 ± 0.695
5.278ValThr: 5.278 ± 0.831
5.358ValVal: 5.358 ± 0.642
0.64ValTrp: 0.64 ± 0.227
1.919ValTyr: 1.919 ± 0.438
0.0ValXaa: 0.0 ± 0.0
Trp
1.12TrpAla: 1.12 ± 0.266
0.4TrpCys: 0.4 ± 0.167
1.04TrpAsp: 1.04 ± 0.343
0.56TrpGlu: 0.56 ± 0.211
0.64TrpPhe: 0.64 ± 0.242
0.72TrpGly: 0.72 ± 0.194
0.4TrpHis: 0.4 ± 0.18
0.56TrpIle: 0.56 ± 0.261
0.8TrpLys: 0.8 ± 0.231
1.439TrpLeu: 1.439 ± 0.399
0.64TrpMet: 0.64 ± 0.191
0.56TrpAsn: 0.56 ± 0.2
0.48TrpPro: 0.48 ± 0.198
0.72TrpGln: 0.72 ± 0.214
0.96TrpArg: 0.96 ± 0.314
0.96TrpSer: 0.96 ± 0.261
0.88TrpThr: 0.88 ± 0.245
1.279TrpVal: 1.279 ± 0.417
0.4TrpTrp: 0.4 ± 0.21
0.4TrpTyr: 0.4 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.239TyrAla: 2.239 ± 0.359
0.64TyrCys: 0.64 ± 0.22
1.599TyrAsp: 1.599 ± 0.311
1.599TyrGlu: 1.599 ± 0.363
1.279TyrPhe: 1.279 ± 0.296
1.839TyrGly: 1.839 ± 0.335
0.64TyrHis: 0.64 ± 0.291
1.359TyrIle: 1.359 ± 0.338
1.12TyrLys: 1.12 ± 0.31
2.799TyrLeu: 2.799 ± 0.543
0.32TyrMet: 0.32 ± 0.161
0.88TyrAsn: 0.88 ± 0.233
1.439TyrPro: 1.439 ± 0.309
1.279TyrGln: 1.279 ± 0.244
1.919TyrArg: 1.919 ± 0.373
2.959TyrSer: 2.959 ± 0.462
1.679TyrThr: 1.679 ± 0.318
1.439TyrVal: 1.439 ± 0.329
0.32TyrTrp: 0.32 ± 0.126
1.12TyrTyr: 1.12 ± 0.354
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (12506 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski