Amino acid dipepetide frequency for Escherichia phage vB_EcoS_HSE2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.933AlaAla: 11.933 ± 1.456
1.288AlaCys: 1.288 ± 0.358
6.439AlaAsp: 6.439 ± 1.015
8.156AlaGlu: 8.156 ± 0.756
4.035AlaPhe: 4.035 ± 0.548
7.984AlaGly: 7.984 ± 1.144
1.288AlaHis: 1.288 ± 0.375
5.838AlaIle: 5.838 ± 0.77
6.353AlaLys: 6.353 ± 0.901
7.984AlaLeu: 7.984 ± 0.914
1.717AlaMet: 1.717 ± 0.4
3.863AlaAsn: 3.863 ± 0.57
3.606AlaPro: 3.606 ± 0.479
3.606AlaGln: 3.606 ± 0.646
4.55AlaArg: 4.55 ± 0.644
6.181AlaSer: 6.181 ± 0.953
6.868AlaThr: 6.868 ± 0.841
7.04AlaVal: 7.04 ± 0.724
1.116AlaTrp: 1.116 ± 0.273
3.863AlaTyr: 3.863 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
0.601CysAla: 0.601 ± 0.244
0.258CysCys: 0.258 ± 0.168
1.03CysAsp: 1.03 ± 0.342
1.03CysGlu: 1.03 ± 0.404
0.343CysPhe: 0.343 ± 0.201
1.202CysGly: 1.202 ± 0.338
0.258CysHis: 0.258 ± 0.127
0.429CysIle: 0.429 ± 0.19
0.515CysLys: 0.515 ± 0.26
0.859CysLeu: 0.859 ± 0.31
0.172CysMet: 0.172 ± 0.122
0.343CysAsn: 0.343 ± 0.177
0.515CysPro: 0.515 ± 0.197
0.515CysGln: 0.515 ± 0.225
0.944CysArg: 0.944 ± 0.275
0.601CysSer: 0.601 ± 0.235
0.515CysThr: 0.515 ± 0.246
0.859CysVal: 0.859 ± 0.274
0.258CysTrp: 0.258 ± 0.138
0.429CysTyr: 0.429 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
6.525AspAla: 6.525 ± 0.801
0.343AspCys: 0.343 ± 0.165
4.636AspAsp: 4.636 ± 0.636
4.378AspGlu: 4.378 ± 0.679
2.919AspPhe: 2.919 ± 0.5
4.378AspGly: 4.378 ± 0.644
1.03AspHis: 1.03 ± 0.298
4.035AspIle: 4.035 ± 0.502
2.661AspLys: 2.661 ± 0.386
4.55AspLeu: 4.55 ± 0.588
1.803AspMet: 1.803 ± 0.29
1.717AspAsn: 1.717 ± 0.386
1.803AspPro: 1.803 ± 0.411
0.859AspGln: 0.859 ± 0.329
2.49AspArg: 2.49 ± 0.485
2.232AspSer: 2.232 ± 0.399
3.863AspThr: 3.863 ± 0.456
4.293AspVal: 4.293 ± 0.661
0.944AspTrp: 0.944 ± 0.267
1.975AspTyr: 1.975 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
6.267GluAla: 6.267 ± 0.663
0.258GluCys: 0.258 ± 0.157
2.919GluAsp: 2.919 ± 0.434
4.979GluGlu: 4.979 ± 0.936
2.919GluPhe: 2.919 ± 0.581
4.293GluGly: 4.293 ± 0.665
1.288GluHis: 1.288 ± 0.331
2.833GluIle: 2.833 ± 0.563
3.692GluLys: 3.692 ± 0.685
6.267GluLeu: 6.267 ± 0.721
2.576GluMet: 2.576 ± 0.424
2.576GluAsn: 2.576 ± 0.529
1.889GluPro: 1.889 ± 0.44
3.52GluGln: 3.52 ± 0.966
3.434GluArg: 3.434 ± 0.637
3.177GluSer: 3.177 ± 0.533
3.434GluThr: 3.434 ± 0.639
5.237GluVal: 5.237 ± 0.607
0.859GluTrp: 0.859 ± 0.346
2.146GluTyr: 2.146 ± 0.488
0.0GluXaa: 0.0 ± 0.0
Phe
2.404PheAla: 2.404 ± 0.519
0.258PheCys: 0.258 ± 0.206
2.661PheAsp: 2.661 ± 0.481
2.576PheGlu: 2.576 ± 0.41
0.944PhePhe: 0.944 ± 0.24
3.262PheGly: 3.262 ± 0.503
0.601PheHis: 0.601 ± 0.255
2.06PheIle: 2.06 ± 0.478
2.49PheLys: 2.49 ± 0.419
1.545PheLeu: 1.545 ± 0.368
0.429PheMet: 0.429 ± 0.199
1.545PheAsn: 1.545 ± 0.451
1.288PhePro: 1.288 ± 0.313
1.717PheGln: 1.717 ± 0.423
2.318PheArg: 2.318 ± 0.431
3.434PheSer: 3.434 ± 0.619
3.177PheThr: 3.177 ± 0.418
2.06PheVal: 2.06 ± 0.416
0.601PheTrp: 0.601 ± 0.249
1.374PheTyr: 1.374 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
8.499GlyAla: 8.499 ± 1.144
1.202GlyCys: 1.202 ± 0.329
4.464GlyAsp: 4.464 ± 0.635
5.323GlyGlu: 5.323 ± 0.708
3.091GlyPhe: 3.091 ± 0.518
6.01GlyGly: 6.01 ± 0.685
1.374GlyHis: 1.374 ± 0.429
2.576GlyIle: 2.576 ± 0.526
5.838GlyLys: 5.838 ± 0.683
4.722GlyLeu: 4.722 ± 0.596
2.404GlyMet: 2.404 ± 0.499
3.005GlyAsn: 3.005 ± 0.473
1.717GlyPro: 1.717 ± 0.315
2.49GlyGln: 2.49 ± 0.643
3.777GlyArg: 3.777 ± 0.598
5.495GlySer: 5.495 ± 0.752
4.207GlyThr: 4.207 ± 0.756
5.323GlyVal: 5.323 ± 0.808
1.202GlyTrp: 1.202 ± 0.252
2.576GlyTyr: 2.576 ± 0.417
0.0GlyXaa: 0.0 ± 0.0
His
1.116HisAla: 1.116 ± 0.274
0.258HisCys: 0.258 ± 0.15
0.773HisAsp: 0.773 ± 0.261
1.116HisGlu: 1.116 ± 0.429
0.601HisPhe: 0.601 ± 0.224
1.202HisGly: 1.202 ± 0.347
0.687HisHis: 0.687 ± 0.268
0.773HisIle: 0.773 ± 0.213
1.116HisLys: 1.116 ± 0.325
1.717HisLeu: 1.717 ± 0.42
0.258HisMet: 0.258 ± 0.139
0.859HisAsn: 0.859 ± 0.262
1.03HisPro: 1.03 ± 0.352
0.773HisGln: 0.773 ± 0.252
1.288HisArg: 1.288 ± 0.324
0.687HisSer: 0.687 ± 0.247
0.859HisThr: 0.859 ± 0.309
1.631HisVal: 1.631 ± 0.431
0.086HisTrp: 0.086 ± 0.089
0.343HisTyr: 0.343 ± 0.147
0.0HisXaa: 0.0 ± 0.0
Ile
4.979IleAla: 4.979 ± 0.742
0.687IleCys: 0.687 ± 0.219
3.434IleAsp: 3.434 ± 0.59
3.005IleGlu: 3.005 ± 0.52
1.116IlePhe: 1.116 ± 0.316
2.49IleGly: 2.49 ± 0.423
0.601IleHis: 0.601 ± 0.208
2.576IleIle: 2.576 ± 0.472
2.919IleLys: 2.919 ± 0.585
3.52IleLeu: 3.52 ± 0.535
1.03IleMet: 1.03 ± 0.307
3.606IleAsn: 3.606 ± 0.472
2.747IlePro: 2.747 ± 0.434
2.232IleGln: 2.232 ± 0.424
2.318IleArg: 2.318 ± 0.33
4.035IleSer: 4.035 ± 0.632
3.777IleThr: 3.777 ± 0.602
3.606IleVal: 3.606 ± 0.482
1.374IleTrp: 1.374 ± 0.335
0.859IleTyr: 0.859 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
6.353LysAla: 6.353 ± 0.758
0.773LysCys: 0.773 ± 0.209
3.005LysAsp: 3.005 ± 0.649
3.606LysGlu: 3.606 ± 0.565
1.975LysPhe: 1.975 ± 0.352
3.692LysGly: 3.692 ± 0.488
1.116LysHis: 1.116 ± 0.324
1.717LysIle: 1.717 ± 0.531
3.262LysLys: 3.262 ± 0.561
5.409LysLeu: 5.409 ± 0.819
2.576LysMet: 2.576 ± 0.489
2.232LysAsn: 2.232 ± 0.381
2.232LysPro: 2.232 ± 0.535
2.232LysGln: 2.232 ± 0.439
3.863LysArg: 3.863 ± 0.556
3.005LysSer: 3.005 ± 0.456
4.207LysThr: 4.207 ± 0.577
3.52LysVal: 3.52 ± 0.634
0.944LysTrp: 0.944 ± 0.302
3.005LysTyr: 3.005 ± 0.516
0.0LysXaa: 0.0 ± 0.0
Leu
7.727LeuAla: 7.727 ± 0.885
0.944LeuCys: 0.944 ± 0.329
4.378LeuAsp: 4.378 ± 0.597
4.636LeuGlu: 4.636 ± 0.889
2.919LeuPhe: 2.919 ± 0.607
5.237LeuGly: 5.237 ± 0.579
1.03LeuHis: 1.03 ± 0.373
4.378LeuIle: 4.378 ± 0.528
4.808LeuLys: 4.808 ± 0.678
6.954LeuLeu: 6.954 ± 0.786
1.459LeuMet: 1.459 ± 0.29
4.035LeuAsn: 4.035 ± 0.688
3.434LeuPro: 3.434 ± 0.625
2.833LeuGln: 2.833 ± 0.573
4.979LeuArg: 4.979 ± 0.701
5.752LeuSer: 5.752 ± 0.634
5.323LeuThr: 5.323 ± 0.859
4.55LeuVal: 4.55 ± 0.592
0.944LeuTrp: 0.944 ± 0.261
2.232LeuTyr: 2.232 ± 0.362
0.0LeuXaa: 0.0 ± 0.0
Met
3.005MetAla: 3.005 ± 0.537
0.343MetCys: 0.343 ± 0.158
0.944MetAsp: 0.944 ± 0.399
1.202MetGlu: 1.202 ± 0.38
0.515MetPhe: 0.515 ± 0.187
1.717MetGly: 1.717 ± 0.343
0.086MetHis: 0.086 ± 0.089
1.545MetIle: 1.545 ± 0.349
1.889MetLys: 1.889 ± 0.391
1.889MetLeu: 1.889 ± 0.348
0.601MetMet: 0.601 ± 0.255
0.429MetAsn: 0.429 ± 0.2
1.631MetPro: 1.631 ± 0.425
0.601MetGln: 0.601 ± 0.214
1.288MetArg: 1.288 ± 0.232
2.318MetSer: 2.318 ± 0.402
1.803MetThr: 1.803 ± 0.465
1.803MetVal: 1.803 ± 0.401
0.515MetTrp: 0.515 ± 0.174
0.343MetTyr: 0.343 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
4.55AsnAla: 4.55 ± 0.705
0.601AsnCys: 0.601 ± 0.213
2.146AsnAsp: 2.146 ± 0.376
2.576AsnGlu: 2.576 ± 0.429
0.859AsnPhe: 0.859 ± 0.222
3.091AsnGly: 3.091 ± 0.615
0.944AsnHis: 0.944 ± 0.296
2.318AsnIle: 2.318 ± 0.512
1.975AsnLys: 1.975 ± 0.417
3.262AsnLeu: 3.262 ± 0.44
0.773AsnMet: 0.773 ± 0.291
1.717AsnAsn: 1.717 ± 0.419
1.889AsnPro: 1.889 ± 0.367
1.631AsnGln: 1.631 ± 0.417
1.631AsnArg: 1.631 ± 0.381
2.576AsnSer: 2.576 ± 0.408
3.348AsnThr: 3.348 ± 0.494
3.177AsnVal: 3.177 ± 0.421
0.687AsnTrp: 0.687 ± 0.219
1.288AsnTyr: 1.288 ± 0.28
0.0AsnXaa: 0.0 ± 0.0
Pro
3.692ProAla: 3.692 ± 0.597
0.515ProCys: 0.515 ± 0.22
2.747ProAsp: 2.747 ± 0.514
3.692ProGlu: 3.692 ± 0.507
1.459ProPhe: 1.459 ± 0.4
3.177ProGly: 3.177 ± 0.629
0.773ProHis: 0.773 ± 0.236
1.717ProIle: 1.717 ± 0.326
1.545ProLys: 1.545 ± 0.416
3.005ProLeu: 3.005 ± 0.565
0.773ProMet: 0.773 ± 0.254
1.545ProAsn: 1.545 ± 0.324
1.545ProPro: 1.545 ± 0.4
1.116ProGln: 1.116 ± 0.281
1.717ProArg: 1.717 ± 0.431
2.747ProSer: 2.747 ± 0.398
2.318ProThr: 2.318 ± 0.503
4.121ProVal: 4.121 ± 0.587
0.258ProTrp: 0.258 ± 0.15
0.944ProTyr: 0.944 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
3.949GlnAla: 3.949 ± 0.643
0.258GlnCys: 0.258 ± 0.14
1.545GlnAsp: 1.545 ± 0.396
2.318GlnGlu: 2.318 ± 0.578
1.202GlnPhe: 1.202 ± 0.286
2.49GlnGly: 2.49 ± 0.495
0.859GlnHis: 0.859 ± 0.314
2.318GlnIle: 2.318 ± 0.388
2.318GlnLys: 2.318 ± 0.516
3.348GlnLeu: 3.348 ± 0.632
1.202GlnMet: 1.202 ± 0.293
1.459GlnAsn: 1.459 ± 0.455
1.717GlnPro: 1.717 ± 0.3
2.576GlnGln: 2.576 ± 0.639
2.404GlnArg: 2.404 ± 0.589
1.889GlnSer: 1.889 ± 0.398
1.803GlnThr: 1.803 ± 0.461
3.005GlnVal: 3.005 ± 0.511
0.687GlnTrp: 0.687 ± 0.231
1.459GlnTyr: 1.459 ± 0.405
0.0GlnXaa: 0.0 ± 0.0
Arg
4.722ArgAla: 4.722 ± 0.693
0.601ArgCys: 0.601 ± 0.226
2.661ArgAsp: 2.661 ± 0.462
3.177ArgGlu: 3.177 ± 0.523
2.06ArgPhe: 2.06 ± 0.429
4.293ArgGly: 4.293 ± 0.544
1.116ArgHis: 1.116 ± 0.318
2.833ArgIle: 2.833 ± 0.522
3.005ArgLys: 3.005 ± 0.571
4.293ArgLeu: 4.293 ± 0.451
1.545ArgMet: 1.545 ± 0.355
2.318ArgAsn: 2.318 ± 0.427
1.803ArgPro: 1.803 ± 0.418
3.177ArgGln: 3.177 ± 0.587
4.808ArgArg: 4.808 ± 0.865
3.262ArgSer: 3.262 ± 0.371
2.919ArgThr: 2.919 ± 0.488
4.035ArgVal: 4.035 ± 0.53
0.687ArgTrp: 0.687 ± 0.273
1.889ArgTyr: 1.889 ± 0.393
0.0ArgXaa: 0.0 ± 0.0
Ser
6.353SerAla: 6.353 ± 0.776
0.859SerCys: 0.859 ± 0.261
3.777SerAsp: 3.777 ± 0.549
2.404SerGlu: 2.404 ± 0.418
2.576SerPhe: 2.576 ± 0.414
6.525SerGly: 6.525 ± 0.714
0.944SerHis: 0.944 ± 0.292
2.833SerIle: 2.833 ± 0.711
3.434SerLys: 3.434 ± 0.55
5.409SerLeu: 5.409 ± 0.721
1.545SerMet: 1.545 ± 0.319
2.404SerAsn: 2.404 ± 0.52
2.576SerPro: 2.576 ± 0.532
2.318SerGln: 2.318 ± 0.441
3.262SerArg: 3.262 ± 0.538
2.747SerSer: 2.747 ± 0.429
4.808SerThr: 4.808 ± 0.671
4.55SerVal: 4.55 ± 0.494
0.773SerTrp: 0.773 ± 0.268
2.232SerTyr: 2.232 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
8.156ThrAla: 8.156 ± 0.959
0.429ThrCys: 0.429 ± 0.195
3.606ThrAsp: 3.606 ± 0.572
3.177ThrGlu: 3.177 ± 0.643
3.091ThrPhe: 3.091 ± 0.534
6.01ThrGly: 6.01 ± 1.076
1.116ThrHis: 1.116 ± 0.351
3.606ThrIle: 3.606 ± 0.468
3.434ThrLys: 3.434 ± 0.502
5.409ThrLeu: 5.409 ± 0.735
1.03ThrMet: 1.03 ± 0.323
2.146ThrAsn: 2.146 ± 0.498
4.55ThrPro: 4.55 ± 0.496
1.631ThrGln: 1.631 ± 0.348
2.661ThrArg: 2.661 ± 0.324
3.863ThrSer: 3.863 ± 0.568
3.863ThrThr: 3.863 ± 0.619
3.863ThrVal: 3.863 ± 0.524
0.859ThrTrp: 0.859 ± 0.285
2.404ThrTyr: 2.404 ± 0.468
0.0ThrXaa: 0.0 ± 0.0
Val
7.984ValAla: 7.984 ± 1.056
0.859ValCys: 0.859 ± 0.27
3.692ValAsp: 3.692 ± 0.544
4.293ValGlu: 4.293 ± 0.656
1.631ValPhe: 1.631 ± 0.307
4.636ValGly: 4.636 ± 0.676
0.859ValHis: 0.859 ± 0.32
4.722ValIle: 4.722 ± 0.698
4.979ValLys: 4.979 ± 0.616
4.979ValLeu: 4.979 ± 0.684
1.202ValMet: 1.202 ± 0.373
3.005ValAsn: 3.005 ± 0.548
1.975ValPro: 1.975 ± 0.466
2.833ValGln: 2.833 ± 0.527
4.207ValArg: 4.207 ± 0.492
5.065ValSer: 5.065 ± 0.598
5.495ValThr: 5.495 ± 0.829
5.838ValVal: 5.838 ± 0.816
0.859ValTrp: 0.859 ± 0.279
2.49ValTyr: 2.49 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
1.459TrpAla: 1.459 ± 0.566
0.343TrpCys: 0.343 ± 0.151
1.03TrpAsp: 1.03 ± 0.239
0.343TrpGlu: 0.343 ± 0.175
0.687TrpPhe: 0.687 ± 0.311
1.03TrpGly: 1.03 ± 0.263
0.429TrpHis: 0.429 ± 0.222
0.258TrpIle: 0.258 ± 0.145
0.515TrpLys: 0.515 ± 0.178
1.459TrpLeu: 1.459 ± 0.301
0.429TrpMet: 0.429 ± 0.195
0.687TrpAsn: 0.687 ± 0.34
0.258TrpPro: 0.258 ± 0.195
0.515TrpGln: 0.515 ± 0.231
1.459TrpArg: 1.459 ± 0.43
0.944TrpSer: 0.944 ± 0.29
0.687TrpThr: 0.687 ± 0.268
1.03TrpVal: 1.03 ± 0.292
0.172TrpTrp: 0.172 ± 0.109
0.429TrpTyr: 0.429 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.434TyrAla: 3.434 ± 0.641
0.601TyrCys: 0.601 ± 0.275
1.545TyrAsp: 1.545 ± 0.461
2.576TyrGlu: 2.576 ± 0.397
1.803TyrPhe: 1.803 ± 0.371
2.576TyrGly: 2.576 ± 0.425
0.773TyrHis: 0.773 ± 0.262
1.545TyrIle: 1.545 ± 0.375
1.975TyrLys: 1.975 ± 0.487
1.975TyrLeu: 1.975 ± 0.323
0.859TyrMet: 0.859 ± 0.254
1.545TyrAsn: 1.545 ± 0.37
1.202TyrPro: 1.202 ± 0.424
1.631TyrGln: 1.631 ± 0.321
1.889TyrArg: 1.889 ± 0.501
2.318TyrSer: 2.318 ± 0.499
1.631TyrThr: 1.631 ± 0.368
2.06TyrVal: 2.06 ± 0.377
0.343TyrTrp: 0.343 ± 0.162
1.03TyrTyr: 1.03 ± 0.262
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11649 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski