Amino acid dipepetide frequency for Escherichia phage TL-2011c

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.855AlaAla: 8.855 ± 1.019
0.99AlaCys: 0.99 ± 0.306
5.534AlaAsp: 5.534 ± 0.604
8.272AlaGlu: 8.272 ± 0.789
3.728AlaPhe: 3.728 ± 0.598
7.981AlaGly: 7.981 ± 1.383
1.748AlaHis: 1.748 ± 0.375
4.253AlaIle: 4.253 ± 0.649
5.301AlaLys: 5.301 ± 0.607
6.816AlaLeu: 6.816 ± 0.695
2.796AlaMet: 2.796 ± 0.364
2.388AlaAsn: 2.388 ± 0.378
3.321AlaPro: 3.321 ± 0.462
5.301AlaGln: 5.301 ± 0.793
5.767AlaArg: 5.767 ± 0.603
5.534AlaSer: 5.534 ± 0.55
5.942AlaThr: 5.942 ± 0.986
6.816AlaVal: 6.816 ± 0.655
1.456AlaTrp: 1.456 ± 0.311
2.505AlaTyr: 2.505 ± 0.351
0.0AlaXaa: 0.0 ± 0.0
Cys
1.049CysAla: 1.049 ± 0.27
0.233CysCys: 0.233 ± 0.125
0.641CysAsp: 0.641 ± 0.269
0.641CysGlu: 0.641 ± 0.233
0.524CysPhe: 0.524 ± 0.159
1.049CysGly: 1.049 ± 0.373
0.175CysHis: 0.175 ± 0.096
0.583CysIle: 0.583 ± 0.173
0.524CysLys: 0.524 ± 0.216
0.99CysLeu: 0.99 ± 0.247
0.117CysMet: 0.117 ± 0.072
0.466CysAsn: 0.466 ± 0.219
0.641CysPro: 0.641 ± 0.2
0.466CysGln: 0.466 ± 0.185
0.874CysArg: 0.874 ± 0.257
0.932CysSer: 0.932 ± 0.278
0.291CysThr: 0.291 ± 0.145
1.049CysVal: 1.049 ± 0.295
0.175CysTrp: 0.175 ± 0.122
0.35CysTyr: 0.35 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
6.525AspAla: 6.525 ± 0.614
0.641AspCys: 0.641 ± 0.2
3.728AspAsp: 3.728 ± 0.46
4.253AspGlu: 4.253 ± 0.508
1.864AspPhe: 1.864 ± 0.309
4.253AspGly: 4.253 ± 0.528
0.816AspHis: 0.816 ± 0.283
3.262AspIle: 3.262 ± 0.409
3.845AspLys: 3.845 ± 0.423
4.02AspLeu: 4.02 ± 0.446
1.689AspMet: 1.689 ± 0.277
2.505AspAsn: 2.505 ± 0.447
2.272AspPro: 2.272 ± 0.351
1.456AspGln: 1.456 ± 0.348
3.612AspArg: 3.612 ± 0.467
2.68AspSer: 2.68 ± 0.338
3.146AspThr: 3.146 ± 0.525
4.369AspVal: 4.369 ± 0.389
0.699AspTrp: 0.699 ± 0.25
1.689AspTyr: 1.689 ± 0.308
0.0AspXaa: 0.0 ± 0.0
Glu
6.525GluAla: 6.525 ± 0.602
0.932GluCys: 0.932 ± 0.243
2.447GluAsp: 2.447 ± 0.403
4.369GluGlu: 4.369 ± 0.533
2.738GluPhe: 2.738 ± 0.465
4.194GluGly: 4.194 ± 0.537
1.515GluHis: 1.515 ± 0.296
3.554GluIle: 3.554 ± 0.463
4.544GluLys: 4.544 ± 0.536
5.825GluLeu: 5.825 ± 0.56
2.388GluMet: 2.388 ± 0.369
3.029GluAsn: 3.029 ± 0.518
1.864GluPro: 1.864 ± 0.409
3.728GluGln: 3.728 ± 0.533
5.767GluArg: 5.767 ± 0.643
3.67GluSer: 3.67 ± 0.465
4.078GluThr: 4.078 ± 0.743
4.893GluVal: 4.893 ± 0.648
0.641GluTrp: 0.641 ± 0.188
2.214GluTyr: 2.214 ± 0.305
0.0GluXaa: 0.0 ± 0.0
Phe
2.796PheAla: 2.796 ± 0.416
0.699PheCys: 0.699 ± 0.2
2.214PheAsp: 2.214 ± 0.38
1.631PheGlu: 1.631 ± 0.357
1.049PhePhe: 1.049 ± 0.262
2.388PheGly: 2.388 ± 0.263
0.583PheHis: 0.583 ± 0.166
2.039PheIle: 2.039 ± 0.279
1.689PheLys: 1.689 ± 0.294
2.272PheLeu: 2.272 ± 0.294
1.34PheMet: 1.34 ± 0.251
1.864PheAsn: 1.864 ± 0.38
1.34PhePro: 1.34 ± 0.275
0.99PheGln: 0.99 ± 0.261
2.971PheArg: 2.971 ± 0.459
3.612PheSer: 3.612 ± 0.501
2.039PheThr: 2.039 ± 0.282
3.087PheVal: 3.087 ± 0.533
0.408PheTrp: 0.408 ± 0.16
0.874PheTyr: 0.874 ± 0.206
0.0PheXaa: 0.0 ± 0.0
Gly
6.408GlyAla: 6.408 ± 0.931
0.466GlyCys: 0.466 ± 0.17
4.835GlyAsp: 4.835 ± 0.941
6.058GlyGlu: 6.058 ± 1.568
3.087GlyPhe: 3.087 ± 0.458
5.476GlyGly: 5.476 ± 0.743
1.107GlyHis: 1.107 ± 0.331
3.845GlyIle: 3.845 ± 0.562
5.068GlyLys: 5.068 ± 0.84
5.185GlyLeu: 5.185 ± 0.476
2.155GlyMet: 2.155 ± 0.404
2.796GlyAsn: 2.796 ± 0.466
4.194GlyPro: 4.194 ± 2.706
3.087GlyGln: 3.087 ± 0.497
4.66GlyArg: 4.66 ± 0.542
3.961GlySer: 3.961 ± 0.504
4.136GlyThr: 4.136 ± 0.654
5.126GlyVal: 5.126 ± 0.476
0.816GlyTrp: 0.816 ± 0.208
2.621GlyTyr: 2.621 ± 0.373
0.0GlyXaa: 0.0 ± 0.0
His
2.039HisAla: 2.039 ± 0.249
0.175HisCys: 0.175 ± 0.102
0.816HisAsp: 0.816 ± 0.211
0.757HisGlu: 0.757 ± 0.219
0.816HisPhe: 0.816 ± 0.278
1.398HisGly: 1.398 ± 0.376
0.291HisHis: 0.291 ± 0.157
0.932HisIle: 0.932 ± 0.277
0.699HisLys: 0.699 ± 0.203
2.155HisLeu: 2.155 ± 0.518
0.524HisMet: 0.524 ± 0.151
0.874HisAsn: 0.874 ± 0.197
0.932HisPro: 0.932 ± 0.227
0.524HisGln: 0.524 ± 0.186
0.932HisArg: 0.932 ± 0.203
1.34HisSer: 1.34 ± 0.256
0.874HisThr: 0.874 ± 0.202
0.757HisVal: 0.757 ± 0.211
0.291HisTrp: 0.291 ± 0.156
0.699HisTyr: 0.699 ± 0.174
0.0HisXaa: 0.0 ± 0.0
Ile
4.777IleAla: 4.777 ± 0.428
0.874IleCys: 0.874 ± 0.229
3.379IleAsp: 3.379 ± 0.415
3.146IleGlu: 3.146 ± 0.483
0.99IlePhe: 0.99 ± 0.254
2.68IleGly: 2.68 ± 0.48
0.932IleHis: 0.932 ± 0.168
2.621IleIle: 2.621 ± 0.409
2.854IleLys: 2.854 ± 0.456
3.321IleLeu: 3.321 ± 0.593
1.107IleMet: 1.107 ± 0.256
2.913IleAsn: 2.913 ± 0.466
2.447IlePro: 2.447 ± 0.333
1.922IleGln: 1.922 ± 0.323
4.486IleArg: 4.486 ± 0.534
4.311IleSer: 4.311 ± 0.734
3.204IleThr: 3.204 ± 0.53
1.981IleVal: 1.981 ± 0.378
0.175IleTrp: 0.175 ± 0.108
1.223IleTyr: 1.223 ± 0.291
0.0IleXaa: 0.0 ± 0.0
Lys
6.058LysAla: 6.058 ± 0.631
0.757LysCys: 0.757 ± 0.231
2.68LysAsp: 2.68 ± 0.401
3.67LysGlu: 3.67 ± 0.452
1.573LysPhe: 1.573 ± 0.346
5.825LysGly: 5.825 ± 1.256
1.223LysHis: 1.223 ± 0.28
3.146LysIle: 3.146 ± 0.42
3.728LysLys: 3.728 ± 0.578
4.427LysLeu: 4.427 ± 0.561
1.515LysMet: 1.515 ± 0.322
3.321LysAsn: 3.321 ± 0.454
2.505LysPro: 2.505 ± 0.323
2.738LysGln: 2.738 ± 0.426
2.447LysArg: 2.447 ± 0.316
2.621LysSer: 2.621 ± 0.402
3.612LysThr: 3.612 ± 0.454
2.913LysVal: 2.913 ± 0.444
0.583LysTrp: 0.583 ± 0.186
1.398LysTyr: 1.398 ± 0.291
0.0LysXaa: 0.0 ± 0.0
Leu
8.272LeuAla: 8.272 ± 0.792
1.049LeuCys: 1.049 ± 0.295
3.903LeuAsp: 3.903 ± 0.502
4.02LeuGlu: 4.02 ± 0.528
2.68LeuPhe: 2.68 ± 0.444
5.01LeuGly: 5.01 ± 0.534
1.456LeuHis: 1.456 ± 0.28
3.437LeuIle: 3.437 ± 0.544
4.078LeuLys: 4.078 ± 0.527
6.466LeuLeu: 6.466 ± 0.649
2.214LeuMet: 2.214 ± 0.399
4.486LeuAsn: 4.486 ± 0.589
4.311LeuPro: 4.311 ± 0.454
3.321LeuGln: 3.321 ± 0.681
4.719LeuArg: 4.719 ± 0.499
5.825LeuSer: 5.825 ± 0.566
5.359LeuThr: 5.359 ± 0.547
4.835LeuVal: 4.835 ± 0.502
0.291LeuTrp: 0.291 ± 0.143
1.864LeuTyr: 1.864 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
3.029MetAla: 3.029 ± 0.391
0.058MetCys: 0.058 ± 0.047
1.223MetAsp: 1.223 ± 0.238
1.282MetGlu: 1.282 ± 0.287
0.874MetPhe: 0.874 ± 0.198
1.573MetGly: 1.573 ± 0.293
0.408MetHis: 0.408 ± 0.126
1.107MetIle: 1.107 ± 0.337
2.155MetLys: 2.155 ± 0.29
1.631MetLeu: 1.631 ± 0.245
0.757MetMet: 0.757 ± 0.221
2.039MetAsn: 2.039 ± 0.432
1.806MetPro: 1.806 ± 0.31
1.049MetGln: 1.049 ± 0.217
1.689MetArg: 1.689 ± 0.35
1.922MetSer: 1.922 ± 0.287
2.447MetThr: 2.447 ± 0.324
1.223MetVal: 1.223 ± 0.292
0.291MetTrp: 0.291 ± 0.119
0.641MetTyr: 0.641 ± 0.237
0.0MetXaa: 0.0 ± 0.0
Asn
5.359AsnAla: 5.359 ± 0.607
0.524AsnCys: 0.524 ± 0.181
1.981AsnAsp: 1.981 ± 0.349
3.087AsnGlu: 3.087 ± 0.501
1.282AsnPhe: 1.282 ± 0.317
3.903AsnGly: 3.903 ± 0.595
1.223AsnHis: 1.223 ± 0.247
2.447AsnIle: 2.447 ± 0.335
2.33AsnLys: 2.33 ± 0.413
3.146AsnLeu: 3.146 ± 0.418
1.107AsnMet: 1.107 ± 0.255
1.922AsnAsn: 1.922 ± 0.409
1.806AsnPro: 1.806 ± 0.318
1.981AsnGln: 1.981 ± 0.409
2.447AsnArg: 2.447 ± 0.405
2.621AsnSer: 2.621 ± 0.4
1.922AsnThr: 1.922 ± 0.359
2.563AsnVal: 2.563 ± 0.4
0.641AsnTrp: 0.641 ± 0.163
1.223AsnTyr: 1.223 ± 0.317
0.0AsnXaa: 0.0 ± 0.0
Pro
3.495ProAla: 3.495 ± 0.703
0.466ProCys: 0.466 ± 0.201
4.136ProAsp: 4.136 ± 0.608
5.301ProGlu: 5.301 ± 0.819
1.573ProPhe: 1.573 ± 0.283
3.495ProGly: 3.495 ± 0.588
0.35ProHis: 0.35 ± 0.138
0.641ProIle: 0.641 ± 0.224
2.68ProLys: 2.68 ± 0.883
3.146ProLeu: 3.146 ± 0.445
0.641ProMet: 0.641 ± 0.18
0.932ProAsn: 0.932 ± 0.232
1.398ProPro: 1.398 ± 0.27
2.097ProGln: 2.097 ± 0.614
2.505ProArg: 2.505 ± 0.438
2.68ProSer: 2.68 ± 0.417
2.155ProThr: 2.155 ± 0.335
4.777ProVal: 4.777 ± 0.519
0.583ProTrp: 0.583 ± 0.199
1.689ProTyr: 1.689 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
3.961GlnAla: 3.961 ± 0.623
0.932GlnCys: 0.932 ± 0.257
2.505GlnAsp: 2.505 ± 0.364
2.505GlnGlu: 2.505 ± 0.389
1.456GlnPhe: 1.456 ± 0.297
3.379GlnGly: 3.379 ± 0.857
0.99GlnHis: 0.99 ± 0.257
2.33GlnIle: 2.33 ± 0.462
3.146GlnLys: 3.146 ± 0.544
3.728GlnLeu: 3.728 ± 0.455
1.049GlnMet: 1.049 ± 0.314
1.864GlnAsn: 1.864 ± 0.313
2.097GlnPro: 2.097 ± 0.398
3.845GlnGln: 3.845 ± 0.865
2.68GlnArg: 2.68 ± 0.587
2.563GlnSer: 2.563 ± 0.417
1.689GlnThr: 1.689 ± 0.4
2.388GlnVal: 2.388 ± 0.474
0.641GlnTrp: 0.641 ± 0.171
1.107GlnTyr: 1.107 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
4.194ArgAla: 4.194 ± 0.501
0.408ArgCys: 0.408 ± 0.176
3.903ArgAsp: 3.903 ± 0.686
5.359ArgGlu: 5.359 ± 0.716
3.262ArgPhe: 3.262 ± 0.452
5.185ArgGly: 5.185 ± 1.064
1.631ArgHis: 1.631 ± 0.289
3.495ArgIle: 3.495 ± 0.452
4.427ArgLys: 4.427 ± 0.569
4.777ArgLeu: 4.777 ± 0.464
1.689ArgMet: 1.689 ± 0.287
2.738ArgAsn: 2.738 ± 0.448
2.214ArgPro: 2.214 ± 0.359
2.738ArgGln: 2.738 ± 0.405
5.359ArgArg: 5.359 ± 0.673
3.845ArgSer: 3.845 ± 0.553
3.728ArgThr: 3.728 ± 0.633
3.787ArgVal: 3.787 ± 0.565
1.107ArgTrp: 1.107 ± 0.217
2.214ArgTyr: 2.214 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
6.058SerAla: 6.058 ± 0.552
0.757SerCys: 0.757 ± 0.272
3.787SerAsp: 3.787 ± 0.462
4.311SerGlu: 4.311 ± 0.442
2.039SerPhe: 2.039 ± 0.338
5.301SerGly: 5.301 ± 0.642
1.049SerHis: 1.049 ± 0.263
2.447SerIle: 2.447 ± 0.374
2.33SerLys: 2.33 ± 0.384
6.058SerLeu: 6.058 ± 0.614
1.806SerMet: 1.806 ± 0.327
2.272SerAsn: 2.272 ± 0.368
3.321SerPro: 3.321 ± 0.469
3.146SerGln: 3.146 ± 0.48
4.369SerArg: 4.369 ± 0.592
2.854SerSer: 2.854 ± 0.586
3.495SerThr: 3.495 ± 0.541
4.078SerVal: 4.078 ± 0.49
0.874SerTrp: 0.874 ± 0.24
1.631SerTyr: 1.631 ± 0.425
0.0SerXaa: 0.0 ± 0.0
Thr
6.058ThrAla: 6.058 ± 0.607
0.35ThrCys: 0.35 ± 0.148
3.146ThrAsp: 3.146 ± 0.42
4.369ThrGlu: 4.369 ± 0.483
2.155ThrPhe: 2.155 ± 0.452
5.942ThrGly: 5.942 ± 0.942
1.049ThrHis: 1.049 ± 0.238
3.495ThrIle: 3.495 ± 0.549
2.272ThrLys: 2.272 ± 0.348
5.418ThrLeu: 5.418 ± 0.471
1.223ThrMet: 1.223 ± 0.264
1.34ThrAsn: 1.34 ± 0.249
3.612ThrPro: 3.612 ± 0.413
1.806ThrGln: 1.806 ± 0.337
2.68ThrArg: 2.68 ± 0.374
3.67ThrSer: 3.67 ± 0.553
3.961ThrThr: 3.961 ± 0.64
4.253ThrVal: 4.253 ± 0.617
0.99ThrTrp: 0.99 ± 0.202
1.165ThrTyr: 1.165 ± 0.342
0.0ThrXaa: 0.0 ± 0.0
Val
6.583ValAla: 6.583 ± 0.584
1.049ValCys: 1.049 ± 0.327
4.369ValAsp: 4.369 ± 0.482
3.845ValGlu: 3.845 ± 0.431
2.039ValPhe: 2.039 ± 0.33
2.854ValGly: 2.854 ± 0.512
0.641ValHis: 0.641 ± 0.185
3.554ValIle: 3.554 ± 0.557
3.554ValLys: 3.554 ± 0.482
5.476ValLeu: 5.476 ± 0.628
1.748ValMet: 1.748 ± 0.32
3.612ValAsn: 3.612 ± 0.51
2.621ValPro: 2.621 ± 0.364
2.33ValGln: 2.33 ± 0.339
4.835ValArg: 4.835 ± 1.007
5.068ValSer: 5.068 ± 0.613
4.893ValThr: 4.893 ± 0.615
4.194ValVal: 4.194 ± 0.58
0.874ValTrp: 0.874 ± 0.187
1.864ValTyr: 1.864 ± 0.326
0.0ValXaa: 0.0 ± 0.0
Trp
0.641TrpAla: 0.641 ± 0.157
0.175TrpCys: 0.175 ± 0.098
0.466TrpAsp: 0.466 ± 0.201
0.816TrpGlu: 0.816 ± 0.21
0.641TrpPhe: 0.641 ± 0.178
0.874TrpGly: 0.874 ± 0.245
0.233TrpHis: 0.233 ± 0.111
0.757TrpIle: 0.757 ± 0.26
0.699TrpLys: 0.699 ± 0.166
0.99TrpLeu: 0.99 ± 0.271
0.583TrpMet: 0.583 ± 0.162
0.466TrpAsn: 0.466 ± 0.19
0.699TrpPro: 0.699 ± 0.217
0.932TrpGln: 0.932 ± 0.211
0.99TrpArg: 0.99 ± 0.28
0.757TrpSer: 0.757 ± 0.214
0.408TrpThr: 0.408 ± 0.17
0.932TrpVal: 0.932 ± 0.211
0.291TrpTrp: 0.291 ± 0.141
0.233TrpTyr: 0.233 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.621TyrAla: 2.621 ± 0.432
0.233TyrCys: 0.233 ± 0.124
1.631TyrAsp: 1.631 ± 0.319
1.223TyrGlu: 1.223 ± 0.352
1.515TyrPhe: 1.515 ± 0.259
2.33TyrGly: 2.33 ± 0.434
0.408TyrHis: 0.408 ± 0.169
1.398TyrIle: 1.398 ± 0.263
0.816TyrLys: 0.816 ± 0.226
1.806TyrLeu: 1.806 ± 0.358
0.699TyrMet: 0.699 ± 0.208
1.689TyrAsn: 1.689 ± 0.302
1.34TyrPro: 1.34 ± 0.33
1.398TyrGln: 1.398 ± 0.303
2.272TyrArg: 2.272 ± 0.42
1.515TyrSer: 1.515 ± 0.263
1.573TyrThr: 1.573 ± 0.464
1.981TyrVal: 1.981 ± 0.32
0.757TyrTrp: 0.757 ± 0.193
1.107TyrTyr: 1.107 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (17167 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski