Amino acid dipepetide frequency for Escherichia phage EG1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.893AlaAla: 9.893 ± 1.524
1.164AlaCys: 1.164 ± 0.385
6.401AlaAsp: 6.401 ± 1.232
5.071AlaGlu: 5.071 ± 0.504
2.993AlaPhe: 2.993 ± 0.417
6.484AlaGly: 6.484 ± 0.564
1.912AlaHis: 1.912 ± 0.348
5.653AlaIle: 5.653 ± 0.783
6.817AlaLys: 6.817 ± 0.674
8.064AlaLeu: 8.064 ± 0.871
2.743AlaMet: 2.743 ± 0.474
4.157AlaAsn: 4.157 ± 0.597
2.245AlaPro: 2.245 ± 0.384
4.157AlaGln: 4.157 ± 0.811
5.237AlaArg: 5.237 ± 0.551
5.237AlaSer: 5.237 ± 0.759
3.99AlaThr: 3.99 ± 0.754
5.819AlaVal: 5.819 ± 0.55
1.663AlaTrp: 1.663 ± 0.426
2.743AlaTyr: 2.743 ± 0.453
0.0AlaXaa: 0.0 ± 0.0
Cys
0.914CysAla: 0.914 ± 0.357
0.166CysCys: 0.166 ± 0.142
0.998CysAsp: 0.998 ± 0.429
0.499CysGlu: 0.499 ± 0.279
0.665CysPhe: 0.665 ± 0.271
0.831CysGly: 0.831 ± 0.317
0.582CysHis: 0.582 ± 0.262
0.748CysIle: 0.748 ± 0.303
0.582CysLys: 0.582 ± 0.252
0.831CysLeu: 0.831 ± 0.255
0.249CysMet: 0.249 ± 0.143
0.499CysAsn: 0.499 ± 0.214
0.416CysPro: 0.416 ± 0.159
0.416CysGln: 0.416 ± 0.175
0.499CysArg: 0.499 ± 0.224
0.665CysSer: 0.665 ± 0.234
0.249CysThr: 0.249 ± 0.174
0.831CysVal: 0.831 ± 0.343
0.333CysTrp: 0.333 ± 0.186
0.333CysTyr: 0.333 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
5.487AspAla: 5.487 ± 0.899
0.499AspCys: 0.499 ± 0.24
4.323AspAsp: 4.323 ± 0.677
4.655AspGlu: 4.655 ± 0.725
3.076AspPhe: 3.076 ± 0.357
5.653AspGly: 5.653 ± 0.668
0.914AspHis: 0.914 ± 0.281
3.575AspIle: 3.575 ± 0.464
4.24AspLys: 4.24 ± 0.726
4.24AspLeu: 4.24 ± 0.622
2.245AspMet: 2.245 ± 0.451
2.577AspAsn: 2.577 ± 0.448
2.91AspPro: 2.91 ± 0.544
1.496AspGln: 1.496 ± 0.382
3.575AspArg: 3.575 ± 0.855
4.24AspSer: 4.24 ± 0.764
3.575AspThr: 3.575 ± 0.471
5.237AspVal: 5.237 ± 0.573
1.247AspTrp: 1.247 ± 0.415
1.58AspTyr: 1.58 ± 0.337
0.0AspXaa: 0.0 ± 0.0
Glu
7.731GluAla: 7.731 ± 0.988
0.665GluCys: 0.665 ± 0.241
4.822GluAsp: 4.822 ± 0.783
6.983GluGlu: 6.983 ± 1.224
2.245GluPhe: 2.245 ± 0.443
4.572GluGly: 4.572 ± 0.528
1.995GluHis: 1.995 ± 0.369
3.658GluIle: 3.658 ± 0.488
3.492GluLys: 3.492 ± 0.604
5.986GluLeu: 5.986 ± 0.532
2.827GluMet: 2.827 ± 0.55
2.411GluAsn: 2.411 ± 0.485
1.663GluPro: 1.663 ± 0.435
3.159GluGln: 3.159 ± 0.729
3.824GluArg: 3.824 ± 0.645
4.572GluSer: 4.572 ± 0.689
4.073GluThr: 4.073 ± 0.499
4.739GluVal: 4.739 ± 0.698
1.081GluTrp: 1.081 ± 0.291
3.325GluTyr: 3.325 ± 0.623
0.0GluXaa: 0.0 ± 0.0
Phe
2.328PheAla: 2.328 ± 0.373
0.582PheCys: 0.582 ± 0.205
2.827PheAsp: 2.827 ± 0.512
2.577PheGlu: 2.577 ± 0.466
0.831PhePhe: 0.831 ± 0.264
3.242PheGly: 3.242 ± 0.59
0.582PheHis: 0.582 ± 0.264
1.829PheIle: 1.829 ± 0.402
2.411PheLys: 2.411 ± 0.4
2.577PheLeu: 2.577 ± 0.429
1.413PheMet: 1.413 ± 0.32
1.829PheAsn: 1.829 ± 0.381
1.413PhePro: 1.413 ± 0.443
1.164PheGln: 1.164 ± 0.276
1.912PheArg: 1.912 ± 0.351
1.746PheSer: 1.746 ± 0.416
2.411PheThr: 2.411 ± 0.337
2.078PheVal: 2.078 ± 0.44
0.333PheTrp: 0.333 ± 0.155
1.413PheTyr: 1.413 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
6.567GlyAla: 6.567 ± 0.833
0.998GlyCys: 0.998 ± 0.328
4.406GlyAsp: 4.406 ± 0.667
4.905GlyGlu: 4.905 ± 0.627
3.159GlyPhe: 3.159 ± 0.451
5.32GlyGly: 5.32 ± 0.712
1.58GlyHis: 1.58 ± 0.413
4.655GlyIle: 4.655 ± 0.684
5.653GlyLys: 5.653 ± 0.778
5.819GlyLeu: 5.819 ± 0.785
1.912GlyMet: 1.912 ± 0.416
2.66GlyAsn: 2.66 ± 0.552
0.665GlyPro: 0.665 ± 0.233
2.827GlyGln: 2.827 ± 0.496
4.739GlyArg: 4.739 ± 0.565
4.655GlySer: 4.655 ± 0.626
3.492GlyThr: 3.492 ± 0.481
3.907GlyVal: 3.907 ± 0.535
1.912GlyTrp: 1.912 ± 0.494
2.827GlyTyr: 2.827 ± 0.51
0.0GlyXaa: 0.0 ± 0.0
His
1.33HisAla: 1.33 ± 0.376
0.416HisCys: 0.416 ± 0.179
1.413HisAsp: 1.413 ± 0.349
1.663HisGlu: 1.663 ± 0.42
0.665HisPhe: 0.665 ± 0.226
1.081HisGly: 1.081 ± 0.234
0.748HisHis: 0.748 ± 0.212
1.746HisIle: 1.746 ± 0.339
1.247HisLys: 1.247 ± 0.322
2.161HisLeu: 2.161 ± 0.471
0.416HisMet: 0.416 ± 0.213
0.914HisAsn: 0.914 ± 0.32
0.416HisPro: 0.416 ± 0.151
0.166HisGln: 0.166 ± 0.127
0.665HisArg: 0.665 ± 0.162
1.164HisSer: 1.164 ± 0.268
0.831HisThr: 0.831 ± 0.239
1.33HisVal: 1.33 ± 0.293
0.499HisTrp: 0.499 ± 0.216
0.914HisTyr: 0.914 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
4.655IleAla: 4.655 ± 0.705
0.499IleCys: 0.499 ± 0.184
4.073IleAsp: 4.073 ± 0.559
3.741IleGlu: 3.741 ± 0.684
0.831IlePhe: 0.831 ± 0.27
3.907IleGly: 3.907 ± 0.429
1.663IleHis: 1.663 ± 0.436
3.242IleIle: 3.242 ± 0.68
4.323IleLys: 4.323 ± 0.511
3.824IleLeu: 3.824 ± 0.623
1.496IleMet: 1.496 ± 0.34
2.577IleAsn: 2.577 ± 0.578
2.66IlePro: 2.66 ± 0.358
1.58IleGln: 1.58 ± 0.428
3.408IleArg: 3.408 ± 0.537
3.159IleSer: 3.159 ± 0.547
2.743IleThr: 2.743 ± 0.444
3.159IleVal: 3.159 ± 0.412
0.499IleTrp: 0.499 ± 0.197
1.829IleTyr: 1.829 ± 0.284
0.0IleXaa: 0.0 ± 0.0
Lys
7.149LysAla: 7.149 ± 0.993
0.748LysCys: 0.748 ± 0.353
3.824LysAsp: 3.824 ± 0.47
4.572LysGlu: 4.572 ± 0.627
2.494LysPhe: 2.494 ± 0.365
5.32LysGly: 5.32 ± 0.688
1.496LysHis: 1.496 ± 0.371
2.411LysIle: 2.411 ± 0.419
4.489LysLys: 4.489 ± 0.921
4.739LysLeu: 4.739 ± 0.749
2.411LysMet: 2.411 ± 0.514
2.494LysAsn: 2.494 ± 0.361
2.577LysPro: 2.577 ± 0.543
3.408LysGln: 3.408 ± 0.671
4.073LysArg: 4.073 ± 0.519
3.907LysSer: 3.907 ± 0.554
2.494LysThr: 2.494 ± 0.392
4.739LysVal: 4.739 ± 0.598
0.582LysTrp: 0.582 ± 0.191
2.411LysTyr: 2.411 ± 0.384
0.0LysXaa: 0.0 ± 0.0
Leu
8.48LeuAla: 8.48 ± 1.097
0.333LeuCys: 0.333 ± 0.173
4.323LeuAsp: 4.323 ± 0.565
6.235LeuGlu: 6.235 ± 0.954
2.078LeuPhe: 2.078 ± 0.458
4.406LeuGly: 4.406 ± 0.705
0.914LeuHis: 0.914 ± 0.25
3.741LeuIle: 3.741 ± 0.546
5.237LeuLys: 5.237 ± 0.692
4.406LeuLeu: 4.406 ± 0.841
2.66LeuMet: 2.66 ± 0.493
4.157LeuAsn: 4.157 ± 0.46
3.159LeuPro: 3.159 ± 0.591
4.073LeuGln: 4.073 ± 0.562
5.819LeuArg: 5.819 ± 0.62
4.739LeuSer: 4.739 ± 0.57
4.655LeuThr: 4.655 ± 0.645
3.907LeuVal: 3.907 ± 0.531
1.247LeuTrp: 1.247 ± 0.446
2.827LeuTyr: 2.827 ± 0.502
0.0LeuXaa: 0.0 ± 0.0
Met
3.076MetAla: 3.076 ± 0.474
0.166MetCys: 0.166 ± 0.123
1.663MetAsp: 1.663 ± 0.354
2.411MetGlu: 2.411 ± 0.487
1.164MetPhe: 1.164 ± 0.389
1.995MetGly: 1.995 ± 0.33
0.665MetHis: 0.665 ± 0.326
1.58MetIle: 1.58 ± 0.35
0.998MetLys: 0.998 ± 0.258
3.492MetLeu: 3.492 ± 0.563
0.665MetMet: 0.665 ± 0.245
0.914MetAsn: 0.914 ± 0.232
1.247MetPro: 1.247 ± 0.302
1.081MetGln: 1.081 ± 0.277
1.995MetArg: 1.995 ± 0.478
1.58MetSer: 1.58 ± 0.391
1.829MetThr: 1.829 ± 0.426
2.411MetVal: 2.411 ± 0.425
0.166MetTrp: 0.166 ± 0.151
0.748MetTyr: 0.748 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
3.907AsnAla: 3.907 ± 0.492
0.748AsnCys: 0.748 ± 0.284
2.743AsnAsp: 2.743 ± 0.482
2.827AsnGlu: 2.827 ± 0.51
1.995AsnPhe: 1.995 ± 0.405
4.489AsnGly: 4.489 ± 0.634
0.748AsnHis: 0.748 ± 0.287
2.66AsnIle: 2.66 ± 0.377
2.66AsnLys: 2.66 ± 0.428
3.159AsnLeu: 3.159 ± 0.552
1.081AsnMet: 1.081 ± 0.347
2.411AsnAsn: 2.411 ± 0.709
2.494AsnPro: 2.494 ± 0.423
1.247AsnGln: 1.247 ± 0.292
2.577AsnArg: 2.577 ± 0.582
1.829AsnSer: 1.829 ± 0.388
2.245AsnThr: 2.245 ± 0.631
2.66AsnVal: 2.66 ± 0.432
0.333AsnTrp: 0.333 ± 0.143
1.746AsnTyr: 1.746 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
2.743ProAla: 2.743 ± 0.363
0.499ProCys: 0.499 ± 0.21
2.411ProAsp: 2.411 ± 0.364
3.907ProGlu: 3.907 ± 0.696
1.247ProPhe: 1.247 ± 0.296
0.831ProGly: 0.831 ± 0.269
0.582ProHis: 0.582 ± 0.18
1.496ProIle: 1.496 ± 0.352
2.328ProLys: 2.328 ± 0.522
2.577ProLeu: 2.577 ± 0.539
1.33ProMet: 1.33 ± 0.336
2.411ProAsn: 2.411 ± 0.569
0.748ProPro: 0.748 ± 0.198
1.081ProGln: 1.081 ± 0.246
1.995ProArg: 1.995 ± 0.49
2.245ProSer: 2.245 ± 0.458
1.995ProThr: 1.995 ± 0.345
1.829ProVal: 1.829 ± 0.365
0.665ProTrp: 0.665 ± 0.199
1.496ProTyr: 1.496 ± 0.395
0.0ProXaa: 0.0 ± 0.0
Gln
4.24GlnAla: 4.24 ± 0.717
0.083GlnCys: 0.083 ± 0.093
1.58GlnAsp: 1.58 ± 0.356
3.076GlnGlu: 3.076 ± 0.577
1.829GlnPhe: 1.829 ± 0.316
1.912GlnGly: 1.912 ± 0.358
0.416GlnHis: 0.416 ± 0.228
1.663GlnIle: 1.663 ± 0.296
2.078GlnLys: 2.078 ± 0.589
3.907GlnLeu: 3.907 ± 0.451
0.914GlnMet: 0.914 ± 0.251
1.164GlnAsn: 1.164 ± 0.322
1.496GlnPro: 1.496 ± 0.409
1.663GlnGln: 1.663 ± 0.339
2.078GlnArg: 2.078 ± 0.506
2.078GlnSer: 2.078 ± 0.445
1.663GlnThr: 1.663 ± 0.373
2.328GlnVal: 2.328 ± 0.455
1.413GlnTrp: 1.413 ± 0.285
1.33GlnTyr: 1.33 ± 0.456
0.0GlnXaa: 0.0 ± 0.0
Arg
5.57ArgAla: 5.57 ± 0.782
1.164ArgCys: 1.164 ± 0.393
4.406ArgAsp: 4.406 ± 0.58
5.071ArgGlu: 5.071 ± 0.608
2.245ArgPhe: 2.245 ± 0.43
3.492ArgGly: 3.492 ± 0.499
0.748ArgHis: 0.748 ± 0.249
2.827ArgIle: 2.827 ± 0.394
4.406ArgLys: 4.406 ± 0.71
4.655ArgLeu: 4.655 ± 0.544
1.58ArgMet: 1.58 ± 0.412
2.411ArgAsn: 2.411 ± 0.529
1.58ArgPro: 1.58 ± 0.251
1.746ArgGln: 1.746 ± 0.407
2.993ArgArg: 2.993 ± 0.611
4.655ArgSer: 4.655 ± 0.81
2.577ArgThr: 2.577 ± 0.43
3.325ArgVal: 3.325 ± 0.686
0.748ArgTrp: 0.748 ± 0.254
1.496ArgTyr: 1.496 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
4.822SerAla: 4.822 ± 0.609
0.748SerCys: 0.748 ± 0.275
5.154SerAsp: 5.154 ± 0.646
3.408SerGlu: 3.408 ± 0.462
2.577SerPhe: 2.577 ± 0.504
5.986SerGly: 5.986 ± 0.834
1.247SerHis: 1.247 ± 0.259
2.993SerIle: 2.993 ± 0.556
3.824SerLys: 3.824 ± 0.652
3.824SerLeu: 3.824 ± 0.508
1.58SerMet: 1.58 ± 0.368
2.993SerAsn: 2.993 ± 0.584
2.078SerPro: 2.078 ± 0.555
1.746SerGln: 1.746 ± 0.291
2.993SerArg: 2.993 ± 0.471
3.824SerSer: 3.824 ± 0.781
2.827SerThr: 2.827 ± 0.44
4.572SerVal: 4.572 ± 0.605
0.582SerTrp: 0.582 ± 0.153
2.494SerTyr: 2.494 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
4.24ThrAla: 4.24 ± 0.512
0.665ThrCys: 0.665 ± 0.327
3.824ThrAsp: 3.824 ± 0.505
3.325ThrGlu: 3.325 ± 0.598
1.663ThrPhe: 1.663 ± 0.334
4.822ThrGly: 4.822 ± 0.608
1.164ThrHis: 1.164 ± 0.312
3.076ThrIle: 3.076 ± 0.696
4.572ThrLys: 4.572 ± 0.446
4.157ThrLeu: 4.157 ± 0.577
1.247ThrMet: 1.247 ± 0.299
1.912ThrAsn: 1.912 ± 0.36
2.245ThrPro: 2.245 ± 0.406
1.995ThrGln: 1.995 ± 0.418
2.66ThrArg: 2.66 ± 0.446
2.993ThrSer: 2.993 ± 0.626
3.076ThrThr: 3.076 ± 0.516
3.325ThrVal: 3.325 ± 0.495
0.416ThrTrp: 0.416 ± 0.14
1.413ThrTyr: 1.413 ± 0.25
0.0ThrXaa: 0.0 ± 0.0
Val
5.57ValAla: 5.57 ± 0.641
0.416ValCys: 0.416 ± 0.196
2.91ValAsp: 2.91 ± 0.407
5.071ValGlu: 5.071 ± 0.611
2.328ValPhe: 2.328 ± 0.546
4.822ValGly: 4.822 ± 0.651
0.998ValHis: 0.998 ± 0.291
3.492ValIle: 3.492 ± 0.577
3.658ValLys: 3.658 ± 0.708
4.406ValLeu: 4.406 ± 0.476
1.746ValMet: 1.746 ± 0.428
2.827ValAsn: 2.827 ± 0.552
2.993ValPro: 2.993 ± 0.46
1.995ValGln: 1.995 ± 0.446
4.24ValArg: 4.24 ± 0.488
3.99ValSer: 3.99 ± 0.518
5.237ValThr: 5.237 ± 0.657
4.24ValVal: 4.24 ± 0.671
1.247ValTrp: 1.247 ± 0.353
2.411ValTyr: 2.411 ± 0.617
0.0ValXaa: 0.0 ± 0.0
Trp
0.998TrpAla: 0.998 ± 0.296
0.499TrpCys: 0.499 ± 0.242
0.665TrpAsp: 0.665 ± 0.212
0.998TrpGlu: 0.998 ± 0.26
0.582TrpPhe: 0.582 ± 0.261
0.748TrpGly: 0.748 ± 0.242
0.416TrpHis: 0.416 ± 0.261
0.665TrpIle: 0.665 ± 0.271
1.58TrpLys: 1.58 ± 0.341
1.663TrpLeu: 1.663 ± 0.402
0.166TrpMet: 0.166 ± 0.099
1.496TrpAsn: 1.496 ± 0.327
0.249TrpPro: 0.249 ± 0.127
0.333TrpGln: 0.333 ± 0.144
0.831TrpArg: 0.831 ± 0.283
0.914TrpSer: 0.914 ± 0.317
0.831TrpThr: 0.831 ± 0.265
1.663TrpVal: 1.663 ± 0.41
0.499TrpTrp: 0.499 ± 0.18
0.083TrpTyr: 0.083 ± 0.083
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.743TyrAla: 2.743 ± 0.517
0.249TyrCys: 0.249 ± 0.151
2.577TyrAsp: 2.577 ± 0.669
2.827TyrGlu: 2.827 ± 0.475
0.831TyrPhe: 0.831 ± 0.263
2.743TyrGly: 2.743 ± 0.407
0.416TyrHis: 0.416 ± 0.202
2.245TyrIle: 2.245 ± 0.471
1.746TyrLys: 1.746 ± 0.32
2.91TyrLeu: 2.91 ± 0.47
1.081TyrMet: 1.081 ± 0.278
1.746TyrAsn: 1.746 ± 0.372
1.164TyrPro: 1.164 ± 0.339
1.58TyrGln: 1.58 ± 0.471
1.58TyrArg: 1.58 ± 0.342
2.078TyrSer: 2.078 ± 0.552
1.995TyrThr: 1.995 ± 0.402
2.494TyrVal: 2.494 ± 0.514
0.416TyrTrp: 0.416 ± 0.283
1.164TyrTyr: 1.164 ± 0.378
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (12030 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski