Amino acid dipepetide frequency for Escherichia phage HK578

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.336AlaAla: 12.336 ± 1.369
0.808AlaCys: 0.808 ± 0.32
6.094AlaAsp: 6.094 ± 0.84
6.535AlaGlu: 6.535 ± 0.749
3.818AlaPhe: 3.818 ± 0.57
8.224AlaGly: 8.224 ± 0.733
1.983AlaHis: 1.983 ± 0.361
6.094AlaIle: 6.094 ± 0.622
7.122AlaLys: 7.122 ± 0.803
7.343AlaLeu: 7.343 ± 0.72
2.717AlaMet: 2.717 ± 0.449
3.818AlaAsn: 3.818 ± 0.848
3.011AlaPro: 3.011 ± 0.501
4.626AlaGln: 4.626 ± 0.914
6.168AlaArg: 6.168 ± 0.668
5.654AlaSer: 5.654 ± 0.603
5.287AlaThr: 5.287 ± 0.756
7.857AlaVal: 7.857 ± 0.853
1.615AlaTrp: 1.615 ± 0.342
2.717AlaTyr: 2.717 ± 0.388
0.0AlaXaa: 0.0 ± 0.0
Cys
1.028CysAla: 1.028 ± 0.238
0.294CysCys: 0.294 ± 0.146
0.808CysAsp: 0.808 ± 0.275
0.514CysGlu: 0.514 ± 0.188
0.514CysPhe: 0.514 ± 0.235
0.808CysGly: 0.808 ± 0.232
0.22CysHis: 0.22 ± 0.145
0.441CysIle: 0.441 ± 0.194
0.881CysLys: 0.881 ± 0.224
0.881CysLeu: 0.881 ± 0.263
0.22CysMet: 0.22 ± 0.132
0.587CysAsn: 0.587 ± 0.193
0.367CysPro: 0.367 ± 0.165
0.441CysGln: 0.441 ± 0.186
0.587CysArg: 0.587 ± 0.186
0.661CysSer: 0.661 ± 0.247
0.734CysThr: 0.734 ± 0.209
0.514CysVal: 0.514 ± 0.162
0.22CysTrp: 0.22 ± 0.124
0.587CysTyr: 0.587 ± 0.235
0.0CysXaa: 0.0 ± 0.0
Asp
6.535AspAla: 6.535 ± 0.664
0.881AspCys: 0.881 ± 0.247
4.406AspAsp: 4.406 ± 0.91
5.434AspGlu: 5.434 ± 1.208
3.011AspPhe: 3.011 ± 0.427
5.948AspGly: 5.948 ± 0.689
1.028AspHis: 1.028 ± 0.282
2.864AspIle: 2.864 ± 0.425
2.717AspLys: 2.717 ± 0.474
5.066AspLeu: 5.066 ± 0.837
1.836AspMet: 1.836 ± 0.38
2.79AspAsn: 2.79 ± 0.465
3.157AspPro: 3.157 ± 0.537
1.983AspGln: 1.983 ± 0.448
2.79AspArg: 2.79 ± 0.374
2.57AspSer: 2.57 ± 0.437
4.038AspThr: 4.038 ± 0.592
4.406AspVal: 4.406 ± 0.468
1.028AspTrp: 1.028 ± 0.299
1.248AspTyr: 1.248 ± 0.247
0.0AspXaa: 0.0 ± 0.0
Glu
5.801GluAla: 5.801 ± 0.786
0.587GluCys: 0.587 ± 0.22
4.626GluAsp: 4.626 ± 0.839
3.157GluGlu: 3.157 ± 0.496
2.57GluPhe: 2.57 ± 0.442
2.864GluGly: 2.864 ± 0.573
1.175GluHis: 1.175 ± 0.335
5.213GluIle: 5.213 ± 0.576
3.157GluLys: 3.157 ± 0.581
4.993GluLeu: 4.993 ± 0.562
1.836GluMet: 1.836 ± 0.423
2.497GluAsn: 2.497 ± 0.388
1.322GluPro: 1.322 ± 0.397
2.497GluGln: 2.497 ± 0.435
4.038GluArg: 4.038 ± 0.614
2.864GluSer: 2.864 ± 0.491
3.011GluThr: 3.011 ± 0.375
4.626GluVal: 4.626 ± 0.426
1.322GluTrp: 1.322 ± 0.377
3.231GluTyr: 3.231 ± 0.538
0.0GluXaa: 0.0 ± 0.0
Phe
3.818PheAla: 3.818 ± 0.512
0.367PheCys: 0.367 ± 0.135
2.717PheAsp: 2.717 ± 0.421
1.983PheGlu: 1.983 ± 0.371
0.955PhePhe: 0.955 ± 0.234
3.598PheGly: 3.598 ± 0.446
0.661PheHis: 0.661 ± 0.223
1.836PheIle: 1.836 ± 0.392
1.836PheLys: 1.836 ± 0.325
1.983PheLeu: 1.983 ± 0.354
0.514PheMet: 0.514 ± 0.165
1.762PheAsn: 1.762 ± 0.442
1.542PhePro: 1.542 ± 0.39
1.248PheGln: 1.248 ± 0.242
2.423PheArg: 2.423 ± 0.439
2.276PheSer: 2.276 ± 0.414
2.423PheThr: 2.423 ± 0.394
2.643PheVal: 2.643 ± 0.573
0.514PheTrp: 0.514 ± 0.21
1.469PheTyr: 1.469 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
6.535GlyAla: 6.535 ± 0.658
1.028GlyCys: 1.028 ± 0.295
5.066GlyAsp: 5.066 ± 0.777
4.846GlyGlu: 4.846 ± 0.684
3.598GlyPhe: 3.598 ± 0.516
7.122GlyGly: 7.122 ± 0.818
1.615GlyHis: 1.615 ± 0.445
3.965GlyIle: 3.965 ± 0.469
6.241GlyLys: 6.241 ± 0.877
6.902GlyLeu: 6.902 ± 0.863
2.57GlyMet: 2.57 ± 0.443
3.965GlyAsn: 3.965 ± 0.655
2.056GlyPro: 2.056 ± 0.388
2.35GlyGln: 2.35 ± 0.393
4.699GlyArg: 4.699 ± 0.655
3.892GlySer: 3.892 ± 0.619
3.892GlyThr: 3.892 ± 0.461
6.829GlyVal: 6.829 ± 0.759
1.101GlyTrp: 1.101 ± 0.282
1.983GlyTyr: 1.983 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
1.248HisAla: 1.248 ± 0.274
0.294HisCys: 0.294 ± 0.158
1.322HisAsp: 1.322 ± 0.39
0.955HisGlu: 0.955 ± 0.252
0.514HisPhe: 0.514 ± 0.204
1.615HisGly: 1.615 ± 0.341
0.734HisHis: 0.734 ± 0.288
1.101HisIle: 1.101 ± 0.283
1.469HisLys: 1.469 ± 0.321
1.175HisLeu: 1.175 ± 0.355
0.441HisMet: 0.441 ± 0.197
0.587HisAsn: 0.587 ± 0.198
1.101HisPro: 1.101 ± 0.287
0.955HisGln: 0.955 ± 0.27
0.587HisArg: 0.587 ± 0.185
0.881HisSer: 0.881 ± 0.258
1.395HisThr: 1.395 ± 0.313
0.808HisVal: 0.808 ± 0.228
0.514HisTrp: 0.514 ± 0.185
0.955HisTyr: 0.955 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
6.315IleAla: 6.315 ± 0.625
0.734IleCys: 0.734 ± 0.195
3.598IleAsp: 3.598 ± 0.403
3.378IleGlu: 3.378 ± 0.413
1.542IlePhe: 1.542 ± 0.4
3.892IleGly: 3.892 ± 0.499
0.955IleHis: 0.955 ± 0.287
2.717IleIle: 2.717 ± 0.523
3.671IleLys: 3.671 ± 0.497
3.084IleLeu: 3.084 ± 0.475
1.322IleMet: 1.322 ± 0.291
3.011IleAsn: 3.011 ± 0.569
2.056IlePro: 2.056 ± 0.505
1.689IleGln: 1.689 ± 0.449
3.011IleArg: 3.011 ± 0.455
2.35IleSer: 2.35 ± 0.423
4.479IleThr: 4.479 ± 0.695
3.818IleVal: 3.818 ± 0.423
0.955IleTrp: 0.955 ± 0.291
1.469IleTyr: 1.469 ± 0.355
0.0IleXaa: 0.0 ± 0.0
Lys
7.049LysAla: 7.049 ± 0.871
0.734LysCys: 0.734 ± 0.212
2.57LysAsp: 2.57 ± 0.443
3.524LysGlu: 3.524 ± 0.582
1.836LysPhe: 1.836 ± 0.343
4.773LysGly: 4.773 ± 0.476
1.028LysHis: 1.028 ± 0.327
2.497LysIle: 2.497 ± 0.515
2.864LysLys: 2.864 ± 0.603
4.552LysLeu: 4.552 ± 0.652
2.497LysMet: 2.497 ± 0.295
2.423LysAsn: 2.423 ± 0.426
2.937LysPro: 2.937 ± 0.531
2.864LysGln: 2.864 ± 0.518
4.038LysArg: 4.038 ± 0.591
3.378LysSer: 3.378 ± 0.702
4.038LysThr: 4.038 ± 0.569
3.965LysVal: 3.965 ± 0.461
1.248LysTrp: 1.248 ± 0.254
1.469LysTyr: 1.469 ± 0.304
0.0LysXaa: 0.0 ± 0.0
Leu
5.727LeuAla: 5.727 ± 0.673
0.734LeuCys: 0.734 ± 0.264
4.846LeuAsp: 4.846 ± 0.5
3.524LeuGlu: 3.524 ± 0.494
2.79LeuPhe: 2.79 ± 0.376
5.213LeuGly: 5.213 ± 0.583
1.469LeuHis: 1.469 ± 0.401
4.552LeuIle: 4.552 ± 0.522
4.846LeuLys: 4.846 ± 0.573
4.92LeuLeu: 4.92 ± 0.561
1.615LeuMet: 1.615 ± 0.331
3.965LeuAsn: 3.965 ± 0.461
3.378LeuPro: 3.378 ± 0.607
2.937LeuGln: 2.937 ± 0.394
4.699LeuArg: 4.699 ± 0.604
4.552LeuSer: 4.552 ± 0.537
4.773LeuThr: 4.773 ± 0.511
5.434LeuVal: 5.434 ± 0.809
0.881LeuTrp: 0.881 ± 0.217
2.276LeuTyr: 2.276 ± 0.348
0.0LeuXaa: 0.0 ± 0.0
Met
3.231MetAla: 3.231 ± 0.479
0.367MetCys: 0.367 ± 0.151
1.395MetAsp: 1.395 ± 0.328
1.615MetGlu: 1.615 ± 0.324
0.881MetPhe: 0.881 ± 0.285
1.762MetGly: 1.762 ± 0.395
0.587MetHis: 0.587 ± 0.174
1.615MetIle: 1.615 ± 0.46
1.909MetLys: 1.909 ± 0.371
1.909MetLeu: 1.909 ± 0.379
0.441MetMet: 0.441 ± 0.191
1.248MetAsn: 1.248 ± 0.353
0.881MetPro: 0.881 ± 0.267
1.175MetGln: 1.175 ± 0.243
1.395MetArg: 1.395 ± 0.338
1.909MetSer: 1.909 ± 0.363
2.056MetThr: 2.056 ± 0.396
1.836MetVal: 1.836 ± 0.4
0.367MetTrp: 0.367 ± 0.143
0.808MetTyr: 0.808 ± 0.28
0.0MetXaa: 0.0 ± 0.0
Asn
4.699AsnAla: 4.699 ± 0.774
0.147AsnCys: 0.147 ± 0.107
2.423AsnAsp: 2.423 ± 0.514
2.423AsnGlu: 2.423 ± 0.446
1.028AsnPhe: 1.028 ± 0.253
4.479AsnGly: 4.479 ± 0.524
1.322AsnHis: 1.322 ± 0.301
1.909AsnIle: 1.909 ± 0.378
1.762AsnLys: 1.762 ± 0.366
2.864AsnLeu: 2.864 ± 0.425
1.322AsnMet: 1.322 ± 0.284
1.836AsnAsn: 1.836 ± 0.391
2.056AsnPro: 2.056 ± 0.454
1.762AsnGln: 1.762 ± 0.324
2.57AsnArg: 2.57 ± 0.407
1.395AsnSer: 1.395 ± 0.315
2.643AsnThr: 2.643 ± 0.413
3.304AsnVal: 3.304 ± 0.45
0.955AsnTrp: 0.955 ± 0.286
1.469AsnTyr: 1.469 ± 0.338
0.0AsnXaa: 0.0 ± 0.0
Pro
4.112ProAla: 4.112 ± 0.571
0.367ProCys: 0.367 ± 0.179
3.598ProAsp: 3.598 ± 0.474
2.276ProGlu: 2.276 ± 0.598
1.175ProPhe: 1.175 ± 0.307
3.598ProGly: 3.598 ± 0.686
0.734ProHis: 0.734 ± 0.268
2.203ProIle: 2.203 ± 0.378
1.836ProLys: 1.836 ± 0.461
2.937ProLeu: 2.937 ± 0.631
0.367ProMet: 0.367 ± 0.147
1.322ProAsn: 1.322 ± 0.288
1.615ProPro: 1.615 ± 0.395
1.028ProGln: 1.028 ± 0.249
2.79ProArg: 2.79 ± 0.5
1.395ProSer: 1.395 ± 0.309
2.643ProThr: 2.643 ± 0.521
3.378ProVal: 3.378 ± 0.418
0.514ProTrp: 0.514 ± 0.197
1.101ProTyr: 1.101 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
4.332GlnAla: 4.332 ± 0.726
0.22GlnCys: 0.22 ± 0.152
1.542GlnAsp: 1.542 ± 0.289
1.542GlnGlu: 1.542 ± 0.364
1.469GlnPhe: 1.469 ± 0.307
2.203GlnGly: 2.203 ± 0.424
1.028GlnHis: 1.028 ± 0.276
3.084GlnIle: 3.084 ± 0.634
1.762GlnLys: 1.762 ± 0.398
3.745GlnLeu: 3.745 ± 0.562
1.322GlnMet: 1.322 ± 0.268
0.881GlnAsn: 0.881 ± 0.254
1.836GlnPro: 1.836 ± 0.284
1.395GlnGln: 1.395 ± 0.443
2.423GlnArg: 2.423 ± 0.471
1.689GlnSer: 1.689 ± 0.375
2.79GlnThr: 2.79 ± 0.443
2.717GlnVal: 2.717 ± 0.472
0.881GlnTrp: 0.881 ± 0.227
0.881GlnTyr: 0.881 ± 0.256
0.0GlnXaa: 0.0 ± 0.0
Arg
6.094ArgAla: 6.094 ± 0.807
0.661ArgCys: 0.661 ± 0.211
3.671ArgAsp: 3.671 ± 0.671
3.524ArgGlu: 3.524 ± 0.61
2.57ArgPhe: 2.57 ± 0.365
3.892ArgGly: 3.892 ± 0.531
1.248ArgHis: 1.248 ± 0.253
3.598ArgIle: 3.598 ± 0.768
4.332ArgLys: 4.332 ± 0.613
4.773ArgLeu: 4.773 ± 0.444
1.469ArgMet: 1.469 ± 0.364
2.276ArgAsn: 2.276 ± 0.425
2.423ArgPro: 2.423 ± 0.511
2.423ArgGln: 2.423 ± 0.439
4.846ArgArg: 4.846 ± 0.789
2.35ArgSer: 2.35 ± 0.4
2.937ArgThr: 2.937 ± 0.458
4.846ArgVal: 4.846 ± 0.555
0.881ArgTrp: 0.881 ± 0.219
1.836ArgTyr: 1.836 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
3.892SerAla: 3.892 ± 0.57
0.294SerCys: 0.294 ± 0.162
3.011SerAsp: 3.011 ± 0.379
3.745SerGlu: 3.745 ± 0.627
2.056SerPhe: 2.056 ± 0.397
6.241SerGly: 6.241 ± 0.761
0.734SerHis: 0.734 ± 0.248
1.983SerIle: 1.983 ± 0.359
2.864SerLys: 2.864 ± 0.575
3.745SerLeu: 3.745 ± 0.484
1.322SerMet: 1.322 ± 0.282
1.909SerAsn: 1.909 ± 0.355
1.395SerPro: 1.395 ± 0.243
1.469SerGln: 1.469 ± 0.417
1.836SerArg: 1.836 ± 0.365
2.276SerSer: 2.276 ± 0.385
3.524SerThr: 3.524 ± 0.567
3.304SerVal: 3.304 ± 0.488
0.808SerTrp: 0.808 ± 0.218
1.542SerTyr: 1.542 ± 0.33
0.0SerXaa: 0.0 ± 0.0
Thr
7.49ThrAla: 7.49 ± 0.955
0.514ThrCys: 0.514 ± 0.181
3.745ThrAsp: 3.745 ± 0.506
3.818ThrGlu: 3.818 ± 0.576
2.717ThrPhe: 2.717 ± 0.501
5.434ThrGly: 5.434 ± 0.707
0.661ThrHis: 0.661 ± 0.191
3.084ThrIle: 3.084 ± 0.428
3.157ThrLys: 3.157 ± 0.54
4.993ThrLeu: 4.993 ± 0.701
1.983ThrMet: 1.983 ± 0.361
2.57ThrAsn: 2.57 ± 0.408
3.231ThrPro: 3.231 ± 0.478
2.423ThrGln: 2.423 ± 0.434
3.378ThrArg: 3.378 ± 0.467
2.643ThrSer: 2.643 ± 0.429
3.084ThrThr: 3.084 ± 0.636
4.552ThrVal: 4.552 ± 0.567
0.881ThrTrp: 0.881 ± 0.265
1.395ThrTyr: 1.395 ± 0.39
0.0ThrXaa: 0.0 ± 0.0
Val
8.297ValAla: 8.297 ± 0.749
1.028ValCys: 1.028 ± 0.271
4.846ValAsp: 4.846 ± 0.662
5.948ValGlu: 5.948 ± 0.646
1.983ValPhe: 1.983 ± 0.395
4.92ValGly: 4.92 ± 0.78
0.661ValHis: 0.661 ± 0.216
3.671ValIle: 3.671 ± 0.646
4.993ValLys: 4.993 ± 0.697
4.552ValLeu: 4.552 ± 0.501
1.909ValMet: 1.909 ± 0.416
3.084ValAsn: 3.084 ± 0.5
2.423ValPro: 2.423 ± 0.444
2.864ValGln: 2.864 ± 0.431
5.213ValArg: 5.213 ± 0.522
3.084ValSer: 3.084 ± 0.518
4.699ValThr: 4.699 ± 0.691
7.122ValVal: 7.122 ± 0.774
0.881ValTrp: 0.881 ± 0.269
1.909ValTyr: 1.909 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
2.129TrpAla: 2.129 ± 0.537
0.514TrpCys: 0.514 ± 0.181
1.028TrpAsp: 1.028 ± 0.239
0.881TrpGlu: 0.881 ± 0.282
0.661TrpPhe: 0.661 ± 0.237
0.955TrpGly: 0.955 ± 0.258
0.441TrpHis: 0.441 ± 0.165
0.514TrpIle: 0.514 ± 0.198
0.955TrpLys: 0.955 ± 0.236
0.881TrpLeu: 0.881 ± 0.27
0.514TrpMet: 0.514 ± 0.196
0.881TrpAsn: 0.881 ± 0.294
0.808TrpPro: 0.808 ± 0.255
0.587TrpGln: 0.587 ± 0.221
1.175TrpArg: 1.175 ± 0.277
0.661TrpSer: 0.661 ± 0.18
0.808TrpThr: 0.808 ± 0.197
1.175TrpVal: 1.175 ± 0.297
0.514TrpTrp: 0.514 ± 0.177
0.367TrpTyr: 0.367 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.157TyrAla: 3.157 ± 0.493
0.734TyrCys: 0.734 ± 0.212
2.423TyrAsp: 2.423 ± 0.462
1.762TyrGlu: 1.762 ± 0.377
0.881TyrPhe: 0.881 ± 0.246
2.717TyrGly: 2.717 ± 0.539
0.367TyrHis: 0.367 ± 0.147
1.101TyrIle: 1.101 ± 0.331
1.909TyrLys: 1.909 ± 0.336
1.836TyrLeu: 1.836 ± 0.402
1.101TyrMet: 1.101 ± 0.259
0.955TyrAsn: 0.955 ± 0.297
1.395TyrPro: 1.395 ± 0.447
1.028TyrGln: 1.028 ± 0.342
1.983TyrArg: 1.983 ± 0.456
1.469TyrSer: 1.469 ± 0.362
2.497TyrThr: 2.497 ± 0.418
1.028TyrVal: 1.028 ± 0.282
0.367TyrTrp: 0.367 ± 0.18
0.734TyrTyr: 0.734 ± 0.205
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (13620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski