Amino acid dipepetide frequency for Escherichia phage phiEB49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.345AlaAla: 8.345 ± 1.02
0.713AlaCys: 0.713 ± 0.172
4.065AlaAsp: 4.065 ± 0.606
5.848AlaGlu: 5.848 ± 0.927
2.568AlaPhe: 2.568 ± 0.421
6.704AlaGly: 6.704 ± 0.701
1.426AlaHis: 1.426 ± 0.393
6.419AlaIle: 6.419 ± 0.725
6.562AlaLys: 6.562 ± 1.042
6.704AlaLeu: 6.704 ± 0.896
3.209AlaMet: 3.209 ± 0.618
4.065AlaAsn: 4.065 ± 0.695
2.282AlaPro: 2.282 ± 0.397
3.495AlaGln: 3.495 ± 0.433
4.422AlaArg: 4.422 ± 0.664
5.706AlaSer: 5.706 ± 0.74
4.565AlaThr: 4.565 ± 0.503
4.493AlaVal: 4.493 ± 0.745
1.212AlaTrp: 1.212 ± 0.296
2.853AlaTyr: 2.853 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.713CysAla: 0.713 ± 0.256
0.143CysCys: 0.143 ± 0.091
1.07CysAsp: 1.07 ± 0.263
0.927CysGlu: 0.927 ± 0.269
0.285CysPhe: 0.285 ± 0.139
0.785CysGly: 0.785 ± 0.277
0.285CysHis: 0.285 ± 0.141
0.785CysIle: 0.785 ± 0.268
1.212CysLys: 1.212 ± 0.316
1.141CysLeu: 1.141 ± 0.324
0.285CysMet: 0.285 ± 0.149
0.499CysAsn: 0.499 ± 0.196
0.357CysPro: 0.357 ± 0.162
0.357CysGln: 0.357 ± 0.167
0.713CysArg: 0.713 ± 0.234
1.07CysSer: 1.07 ± 0.255
0.571CysThr: 0.571 ± 0.179
0.927CysVal: 0.927 ± 0.241
0.285CysTrp: 0.285 ± 0.138
0.357CysTyr: 0.357 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
4.707AspAla: 4.707 ± 0.573
0.428AspCys: 0.428 ± 0.155
3.994AspAsp: 3.994 ± 0.641
4.565AspGlu: 4.565 ± 0.615
2.853AspPhe: 2.853 ± 0.509
7.275AspGly: 7.275 ± 0.707
1.212AspHis: 1.212 ± 0.263
4.279AspIle: 4.279 ± 0.537
4.351AspLys: 4.351 ± 0.509
3.281AspLeu: 3.281 ± 0.645
1.355AspMet: 1.355 ± 0.369
3.209AspAsn: 3.209 ± 0.445
1.355AspPro: 1.355 ± 0.251
1.426AspGln: 1.426 ± 0.373
1.783AspArg: 1.783 ± 0.432
4.85AspSer: 4.85 ± 0.522
3.495AspThr: 3.495 ± 0.402
4.493AspVal: 4.493 ± 0.493
0.785AspTrp: 0.785 ± 0.317
2.924AspTyr: 2.924 ± 0.538
0.0AspXaa: 0.0 ± 0.0
Glu
5.991GluAla: 5.991 ± 0.807
0.999GluCys: 0.999 ± 0.295
2.639GluAsp: 2.639 ± 0.324
3.851GluGlu: 3.851 ± 0.709
3.566GluPhe: 3.566 ± 0.467
4.422GluGly: 4.422 ± 0.457
0.927GluHis: 0.927 ± 0.291
5.563GluIle: 5.563 ± 0.519
3.566GluLys: 3.566 ± 0.677
6.348GluLeu: 6.348 ± 0.663
2.425GluMet: 2.425 ± 0.478
3.138GluAsn: 3.138 ± 0.555
1.212GluPro: 1.212 ± 0.307
2.282GluGln: 2.282 ± 0.462
2.425GluArg: 2.425 ± 0.458
3.209GluSer: 3.209 ± 0.562
3.352GluThr: 3.352 ± 0.488
4.636GluVal: 4.636 ± 0.695
0.856GluTrp: 0.856 ± 0.246
2.425GluTyr: 2.425 ± 0.536
0.0GluXaa: 0.0 ± 0.0
Phe
3.281PheAla: 3.281 ± 0.504
0.856PheCys: 0.856 ± 0.217
4.065PheAsp: 4.065 ± 0.548
2.639PheGlu: 2.639 ± 0.504
0.927PhePhe: 0.927 ± 0.277
3.423PheGly: 3.423 ± 0.514
0.927PheHis: 0.927 ± 0.277
2.282PheIle: 2.282 ± 0.418
2.568PheLys: 2.568 ± 0.413
2.568PheLeu: 2.568 ± 0.432
0.713PheMet: 0.713 ± 0.255
2.211PheAsn: 2.211 ± 0.375
1.07PhePro: 1.07 ± 0.271
1.212PheGln: 1.212 ± 0.278
1.64PheArg: 1.64 ± 0.376
2.71PheSer: 2.71 ± 0.478
2.211PheThr: 2.211 ± 0.29
2.282PheVal: 2.282 ± 0.374
0.357PheTrp: 0.357 ± 0.144
0.999PheTyr: 0.999 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
4.493GlyAla: 4.493 ± 0.702
1.355GlyCys: 1.355 ± 0.267
4.636GlyAsp: 4.636 ± 0.523
4.636GlyGlu: 4.636 ± 0.549
3.067GlyPhe: 3.067 ± 0.558
6.348GlyGly: 6.348 ± 1.069
0.999GlyHis: 0.999 ± 0.292
5.206GlyIle: 5.206 ± 0.533
5.349GlyLys: 5.349 ± 0.651
6.562GlyLeu: 6.562 ± 0.634
2.496GlyMet: 2.496 ± 0.498
3.78GlyAsn: 3.78 ± 0.465
0.642GlyPro: 0.642 ± 0.198
1.498GlyGln: 1.498 ± 0.4
3.352GlyArg: 3.352 ± 0.414
5.777GlySer: 5.777 ± 0.639
4.208GlyThr: 4.208 ± 0.554
6.847GlyVal: 6.847 ± 0.76
0.999GlyTrp: 0.999 ± 0.267
3.994GlyTyr: 3.994 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
1.284HisAla: 1.284 ± 0.389
0.214HisCys: 0.214 ± 0.109
1.284HisAsp: 1.284 ± 0.339
0.713HisGlu: 0.713 ± 0.189
0.428HisPhe: 0.428 ± 0.177
1.07HisGly: 1.07 ± 0.281
0.285HisHis: 0.285 ± 0.185
0.856HisIle: 0.856 ± 0.252
1.712HisLys: 1.712 ± 0.399
0.927HisLeu: 0.927 ± 0.281
0.357HisMet: 0.357 ± 0.136
0.856HisAsn: 0.856 ± 0.297
0.357HisPro: 0.357 ± 0.15
0.357HisGln: 0.357 ± 0.172
1.07HisArg: 1.07 ± 0.266
0.713HisSer: 0.713 ± 0.241
1.07HisThr: 1.07 ± 0.295
0.999HisVal: 0.999 ± 0.285
0.0HisTrp: 0.0 ± 0.0
0.499HisTyr: 0.499 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
5.706IleAla: 5.706 ± 0.777
0.927IleCys: 0.927 ± 0.267
5.349IleAsp: 5.349 ± 0.741
3.923IleGlu: 3.923 ± 0.52
2.14IlePhe: 2.14 ± 0.492
3.851IleGly: 3.851 ± 0.58
0.927IleHis: 0.927 ± 0.27
3.851IleIle: 3.851 ± 0.498
4.422IleLys: 4.422 ± 0.642
2.782IleLeu: 2.782 ± 0.454
1.569IleMet: 1.569 ± 0.434
4.493IleAsn: 4.493 ± 0.574
2.782IlePro: 2.782 ± 0.489
2.71IleGln: 2.71 ± 0.503
3.281IleArg: 3.281 ± 0.54
4.208IleSer: 4.208 ± 0.607
4.85IleThr: 4.85 ± 0.705
4.351IleVal: 4.351 ± 0.413
0.642IleTrp: 0.642 ± 0.181
2.639IleTyr: 2.639 ± 0.344
0.0IleXaa: 0.0 ± 0.0
Lys
7.275LysAla: 7.275 ± 0.751
0.428LysCys: 0.428 ± 0.216
3.566LysAsp: 3.566 ± 0.501
4.422LysGlu: 4.422 ± 0.806
2.496LysPhe: 2.496 ± 0.441
3.209LysGly: 3.209 ± 0.382
1.07LysHis: 1.07 ± 0.289
3.923LysIle: 3.923 ± 0.543
4.137LysLys: 4.137 ± 0.629
5.706LysLeu: 5.706 ± 0.767
3.138LysMet: 3.138 ± 0.597
2.71LysAsn: 2.71 ± 0.456
1.997LysPro: 1.997 ± 0.396
3.067LysGln: 3.067 ± 0.43
2.782LysArg: 2.782 ± 0.594
4.208LysSer: 4.208 ± 0.545
3.209LysThr: 3.209 ± 0.438
5.135LysVal: 5.135 ± 0.488
0.856LysTrp: 0.856 ± 0.26
3.138LysTyr: 3.138 ± 0.414
0.0LysXaa: 0.0 ± 0.0
Leu
6.918LeuAla: 6.918 ± 0.759
1.212LeuCys: 1.212 ± 0.282
3.923LeuAsp: 3.923 ± 0.528
3.709LeuGlu: 3.709 ± 0.532
2.068LeuPhe: 2.068 ± 0.378
5.349LeuGly: 5.349 ± 0.692
0.927LeuHis: 0.927 ± 0.221
4.208LeuIle: 4.208 ± 0.509
4.422LeuLys: 4.422 ± 0.508
4.493LeuLeu: 4.493 ± 0.576
1.569LeuMet: 1.569 ± 0.334
4.208LeuAsn: 4.208 ± 0.425
3.209LeuPro: 3.209 ± 0.537
2.568LeuGln: 2.568 ± 0.564
3.637LeuArg: 3.637 ± 0.49
4.565LeuSer: 4.565 ± 0.405
4.85LeuThr: 4.85 ± 0.68
4.208LeuVal: 4.208 ± 0.542
0.642LeuTrp: 0.642 ± 0.279
2.14LeuTyr: 2.14 ± 0.407
0.0LeuXaa: 0.0 ± 0.0
Met
3.423MetAla: 3.423 ± 0.571
0.214MetCys: 0.214 ± 0.113
1.141MetAsp: 1.141 ± 0.341
1.355MetGlu: 1.355 ± 0.333
1.07MetPhe: 1.07 ± 0.272
0.856MetGly: 0.856 ± 0.235
0.642MetHis: 0.642 ± 0.19
2.068MetIle: 2.068 ± 0.417
2.282MetLys: 2.282 ± 0.385
1.355MetLeu: 1.355 ± 0.429
1.141MetMet: 1.141 ± 0.373
1.926MetAsn: 1.926 ± 0.438
0.642MetPro: 0.642 ± 0.23
1.212MetGln: 1.212 ± 0.288
1.498MetArg: 1.498 ± 0.368
1.712MetSer: 1.712 ± 0.389
2.425MetThr: 2.425 ± 0.439
1.712MetVal: 1.712 ± 0.322
0.285MetTrp: 0.285 ± 0.143
0.785MetTyr: 0.785 ± 0.283
0.0MetXaa: 0.0 ± 0.0
Asn
3.994AsnAla: 3.994 ± 0.55
0.499AsnCys: 0.499 ± 0.194
3.281AsnAsp: 3.281 ± 0.506
3.851AsnGlu: 3.851 ± 0.579
1.712AsnPhe: 1.712 ± 0.489
6.134AsnGly: 6.134 ± 0.799
0.856AsnHis: 0.856 ± 0.246
2.496AsnIle: 2.496 ± 0.448
3.281AsnLys: 3.281 ± 0.459
3.566AsnLeu: 3.566 ± 0.504
1.212AsnMet: 1.212 ± 0.316
3.495AsnAsn: 3.495 ± 0.587
1.854AsnPro: 1.854 ± 0.321
2.211AsnGln: 2.211 ± 0.418
2.425AsnArg: 2.425 ± 0.424
3.851AsnSer: 3.851 ± 0.546
3.138AsnThr: 3.138 ± 0.623
3.566AsnVal: 3.566 ± 0.551
0.856AsnTrp: 0.856 ± 0.245
1.64AsnTyr: 1.64 ± 0.336
0.0AsnXaa: 0.0 ± 0.0
Pro
2.425ProAla: 2.425 ± 0.313
0.428ProCys: 0.428 ± 0.177
1.569ProAsp: 1.569 ± 0.458
2.853ProGlu: 2.853 ± 0.474
1.64ProPhe: 1.64 ± 0.319
2.068ProGly: 2.068 ± 0.336
0.428ProHis: 0.428 ± 0.179
1.783ProIle: 1.783 ± 0.308
1.284ProLys: 1.284 ± 0.268
1.426ProLeu: 1.426 ± 0.305
0.642ProMet: 0.642 ± 0.222
1.141ProAsn: 1.141 ± 0.269
0.856ProPro: 0.856 ± 0.255
1.498ProGln: 1.498 ± 0.352
1.569ProArg: 1.569 ± 0.304
1.783ProSer: 1.783 ± 0.302
1.355ProThr: 1.355 ± 0.311
3.281ProVal: 3.281 ± 0.539
0.357ProTrp: 0.357 ± 0.139
1.141ProTyr: 1.141 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
3.994GlnAla: 3.994 ± 0.758
0.499GlnCys: 0.499 ± 0.195
1.64GlnAsp: 1.64 ± 0.328
3.423GlnGlu: 3.423 ± 0.545
1.355GlnPhe: 1.355 ± 0.281
1.926GlnGly: 1.926 ± 0.355
0.214GlnHis: 0.214 ± 0.095
3.138GlnIle: 3.138 ± 0.74
1.783GlnLys: 1.783 ± 0.402
2.996GlnLeu: 2.996 ± 0.524
0.571GlnMet: 0.571 ± 0.17
1.926GlnAsn: 1.926 ± 0.461
1.141GlnPro: 1.141 ± 0.258
2.354GlnGln: 2.354 ± 0.682
1.854GlnArg: 1.854 ± 0.463
2.996GlnSer: 2.996 ± 0.412
1.426GlnThr: 1.426 ± 0.356
1.997GlnVal: 1.997 ± 0.252
0.428GlnTrp: 0.428 ± 0.189
1.569GlnTyr: 1.569 ± 0.434
0.0GlnXaa: 0.0 ± 0.0
Arg
3.923ArgAla: 3.923 ± 0.513
1.07ArgCys: 1.07 ± 0.396
2.71ArgAsp: 2.71 ± 0.535
3.352ArgGlu: 3.352 ± 0.424
2.068ArgPhe: 2.068 ± 0.313
2.568ArgGly: 2.568 ± 0.468
0.571ArgHis: 0.571 ± 0.178
3.067ArgIle: 3.067 ± 0.526
3.423ArgLys: 3.423 ± 0.572
3.709ArgLeu: 3.709 ± 0.53
0.785ArgMet: 0.785 ± 0.264
2.354ArgAsn: 2.354 ± 0.48
1.783ArgPro: 1.783 ± 0.335
2.068ArgGln: 2.068 ± 0.515
2.068ArgArg: 2.068 ± 0.418
2.14ArgSer: 2.14 ± 0.459
2.211ArgThr: 2.211 ± 0.362
3.566ArgVal: 3.566 ± 0.442
0.785ArgTrp: 0.785 ± 0.202
2.211ArgTyr: 2.211 ± 0.355
0.0ArgXaa: 0.0 ± 0.0
Ser
5.42SerAla: 5.42 ± 0.634
0.357SerCys: 0.357 ± 0.138
4.707SerAsp: 4.707 ± 0.446
4.779SerGlu: 4.779 ± 0.458
2.924SerPhe: 2.924 ± 0.49
6.704SerGly: 6.704 ± 0.975
0.642SerHis: 0.642 ± 0.189
3.994SerIle: 3.994 ± 0.601
4.422SerLys: 4.422 ± 0.687
4.208SerLeu: 4.208 ± 0.5
1.712SerMet: 1.712 ± 0.385
3.566SerAsn: 3.566 ± 0.602
1.783SerPro: 1.783 ± 0.406
2.853SerGln: 2.853 ± 0.408
2.425SerArg: 2.425 ± 0.521
5.278SerSer: 5.278 ± 0.901
3.851SerThr: 3.851 ± 0.646
5.278SerVal: 5.278 ± 0.607
0.499SerTrp: 0.499 ± 0.188
3.352SerTyr: 3.352 ± 0.562
0.0SerXaa: 0.0 ± 0.0
Thr
5.492ThrAla: 5.492 ± 0.919
0.856ThrCys: 0.856 ± 0.309
3.495ThrAsp: 3.495 ± 0.472
2.568ThrGlu: 2.568 ± 0.406
2.211ThrPhe: 2.211 ± 0.354
5.349ThrGly: 5.349 ± 0.682
0.713ThrHis: 0.713 ± 0.238
3.923ThrIle: 3.923 ± 0.494
2.782ThrLys: 2.782 ± 0.505
4.422ThrLeu: 4.422 ± 0.493
1.212ThrMet: 1.212 ± 0.299
3.709ThrAsn: 3.709 ± 0.614
2.568ThrPro: 2.568 ± 0.395
1.783ThrGln: 1.783 ± 0.442
1.926ThrArg: 1.926 ± 0.291
4.279ThrSer: 4.279 ± 0.555
3.209ThrThr: 3.209 ± 0.605
4.422ThrVal: 4.422 ± 0.593
0.856ThrTrp: 0.856 ± 0.237
2.068ThrTyr: 2.068 ± 0.398
0.0ThrXaa: 0.0 ± 0.0
Val
5.492ValAla: 5.492 ± 1.025
0.499ValCys: 0.499 ± 0.197
5.706ValAsp: 5.706 ± 0.581
3.566ValGlu: 3.566 ± 0.759
2.782ValPhe: 2.782 ± 0.45
4.422ValGly: 4.422 ± 0.508
0.927ValHis: 0.927 ± 0.263
3.994ValIle: 3.994 ± 0.423
5.848ValLys: 5.848 ± 0.645
3.637ValLeu: 3.637 ± 0.509
1.783ValMet: 1.783 ± 0.294
3.923ValAsn: 3.923 ± 0.599
2.068ValPro: 2.068 ± 0.494
2.496ValGln: 2.496 ± 0.567
3.994ValArg: 3.994 ± 0.661
5.848ValSer: 5.848 ± 0.536
4.636ValThr: 4.636 ± 0.518
6.633ValVal: 6.633 ± 0.693
0.999ValTrp: 0.999 ± 0.225
2.496ValTyr: 2.496 ± 0.5
0.0ValXaa: 0.0 ± 0.0
Trp
0.571TrpAla: 0.571 ± 0.199
0.428TrpCys: 0.428 ± 0.191
0.785TrpAsp: 0.785 ± 0.188
0.428TrpGlu: 0.428 ± 0.161
0.571TrpPhe: 0.571 ± 0.205
1.141TrpGly: 1.141 ± 0.286
0.499TrpHis: 0.499 ± 0.173
0.999TrpIle: 0.999 ± 0.253
0.856TrpLys: 0.856 ± 0.244
0.927TrpLeu: 0.927 ± 0.248
0.214TrpMet: 0.214 ± 0.118
0.642TrpAsn: 0.642 ± 0.181
0.357TrpPro: 0.357 ± 0.129
0.357TrpGln: 0.357 ± 0.144
0.785TrpArg: 0.785 ± 0.215
0.927TrpSer: 0.927 ± 0.331
0.642TrpThr: 0.642 ± 0.189
0.927TrpVal: 0.927 ± 0.269
0.143TrpTrp: 0.143 ± 0.097
0.143TrpTyr: 0.143 ± 0.088
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.211TyrAla: 2.211 ± 0.396
0.642TyrCys: 0.642 ± 0.228
2.996TyrAsp: 2.996 ± 0.51
2.425TyrGlu: 2.425 ± 0.382
2.211TyrPhe: 2.211 ± 0.421
2.639TyrGly: 2.639 ± 0.426
0.571TyrHis: 0.571 ± 0.221
2.568TyrIle: 2.568 ± 0.389
2.282TyrLys: 2.282 ± 0.454
2.068TyrLeu: 2.068 ± 0.368
1.284TyrMet: 1.284 ± 0.325
2.068TyrAsn: 2.068 ± 0.415
1.212TyrPro: 1.212 ± 0.287
1.426TyrGln: 1.426 ± 0.264
2.782TyrArg: 2.782 ± 0.419
2.924TyrSer: 2.924 ± 0.498
2.568TyrThr: 2.568 ± 0.425
1.997TyrVal: 1.997 ± 0.358
0.499TyrTrp: 0.499 ± 0.171
1.284TyrTyr: 1.284 ± 0.38
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (14022 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski