Amino acid dipepetide frequency for Escherichia phage HX01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.768AlaAla: 2.768 ± 0.85
0.426AlaCys: 0.426 ± 0.276
2.342AlaAsp: 2.342 ± 0.681
5.323AlaGlu: 5.323 ± 0.961
2.342AlaPhe: 2.342 ± 0.885
3.619AlaGly: 3.619 ± 0.853
1.065AlaHis: 1.065 ± 0.449
5.11AlaIle: 5.11 ± 1.312
4.897AlaLys: 4.897 ± 1.151
5.748AlaLeu: 5.748 ± 1.043
1.916AlaMet: 1.916 ± 0.693
1.277AlaAsn: 1.277 ± 0.51
2.342AlaPro: 2.342 ± 0.688
2.129AlaGln: 2.129 ± 0.753
1.277AlaArg: 1.277 ± 0.534
3.832AlaSer: 3.832 ± 1.098
1.065AlaThr: 1.065 ± 0.57
4.684AlaVal: 4.684 ± 0.9
0.639AlaTrp: 0.639 ± 0.344
2.981AlaTyr: 2.981 ± 0.687
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.373
0.213CysCys: 0.213 ± 0.205
1.703CysAsp: 1.703 ± 0.526
0.639CysGlu: 0.639 ± 0.331
0.0CysPhe: 0.0 ± 0.0
1.065CysGly: 1.065 ± 0.44
0.213CysHis: 0.213 ± 0.183
0.639CysIle: 0.639 ± 0.325
1.065CysLys: 1.065 ± 0.5
0.639CysLeu: 0.639 ± 0.307
0.852CysMet: 0.852 ± 0.438
0.213CysAsn: 0.213 ± 0.21
0.639CysPro: 0.639 ± 0.359
0.0CysGln: 0.0 ± 0.0
0.426CysArg: 0.426 ± 0.321
1.49CysSer: 1.49 ± 0.654
0.426CysThr: 0.426 ± 0.313
1.277CysVal: 1.277 ± 0.514
0.426CysTrp: 0.426 ± 0.259
0.852CysTyr: 0.852 ± 0.433
0.0CysXaa: 0.0 ± 0.0
Asp
4.045AspAla: 4.045 ± 0.786
0.639AspCys: 0.639 ± 0.324
4.258AspAsp: 4.258 ± 1.189
6.174AspGlu: 6.174 ± 1.028
2.981AspPhe: 2.981 ± 0.719
5.535AspGly: 5.535 ± 1.001
0.426AspHis: 0.426 ± 0.257
6.174AspIle: 6.174 ± 1.202
3.619AspLys: 3.619 ± 0.897
3.406AspLeu: 3.406 ± 0.762
1.703AspMet: 1.703 ± 0.653
1.703AspAsn: 1.703 ± 0.576
1.065AspPro: 1.065 ± 0.461
2.342AspGln: 2.342 ± 0.629
2.129AspArg: 2.129 ± 0.557
3.194AspSer: 3.194 ± 0.733
2.981AspThr: 2.981 ± 0.742
3.619AspVal: 3.619 ± 0.804
1.277AspTrp: 1.277 ± 0.467
3.832AspTyr: 3.832 ± 0.906
0.0AspXaa: 0.0 ± 0.0
Glu
6.387GluAla: 6.387 ± 1.306
1.703GluCys: 1.703 ± 0.574
5.748GluAsp: 5.748 ± 1.101
6.6GluGlu: 6.6 ± 1.278
5.535GluPhe: 5.535 ± 1.012
3.194GluGly: 3.194 ± 0.678
1.703GluHis: 1.703 ± 0.609
8.09GluIle: 8.09 ± 1.417
6.813GluLys: 6.813 ± 1.18
8.516GluLeu: 8.516 ± 1.53
3.619GluMet: 3.619 ± 0.752
5.11GluAsn: 5.11 ± 0.928
2.129GluPro: 2.129 ± 0.552
3.194GluGln: 3.194 ± 0.737
2.981GluArg: 2.981 ± 0.962
2.981GluSer: 2.981 ± 1.089
4.045GluThr: 4.045 ± 0.874
7.239GluVal: 7.239 ± 1.185
1.277GluTrp: 1.277 ± 0.564
3.832GluTyr: 3.832 ± 0.839
0.0GluXaa: 0.0 ± 0.0
Phe
2.129PheAla: 2.129 ± 0.706
0.852PheCys: 0.852 ± 0.48
4.045PheAsp: 4.045 ± 1.04
3.619PheGlu: 3.619 ± 0.961
1.49PhePhe: 1.49 ± 0.655
1.49PheGly: 1.49 ± 0.496
0.213PheHis: 0.213 ± 0.214
3.194PheIle: 3.194 ± 1.164
5.961PheLys: 5.961 ± 1.169
3.619PheLeu: 3.619 ± 1.132
1.916PheMet: 1.916 ± 0.71
4.045PheAsn: 4.045 ± 0.934
1.703PhePro: 1.703 ± 0.538
1.277PheGln: 1.277 ± 0.64
2.981PheArg: 2.981 ± 0.989
2.342PheSer: 2.342 ± 0.613
2.555PheThr: 2.555 ± 0.904
3.619PheVal: 3.619 ± 0.853
0.426PheTrp: 0.426 ± 0.311
1.065PheTyr: 1.065 ± 0.518
0.0PheXaa: 0.0 ± 0.0
Gly
1.277GlyAla: 1.277 ± 0.443
0.426GlyCys: 0.426 ± 0.306
2.555GlyAsp: 2.555 ± 0.714
3.832GlyGlu: 3.832 ± 0.794
3.406GlyPhe: 3.406 ± 1.242
2.555GlyGly: 2.555 ± 0.669
1.277GlyHis: 1.277 ± 0.51
3.832GlyIle: 3.832 ± 0.871
4.471GlyLys: 4.471 ± 0.961
4.471GlyLeu: 4.471 ± 0.951
2.129GlyMet: 2.129 ± 0.685
1.065GlyAsn: 1.065 ± 0.491
1.277GlyPro: 1.277 ± 0.531
0.852GlyGln: 0.852 ± 0.416
1.277GlyArg: 1.277 ± 0.538
3.194GlySer: 3.194 ± 0.807
4.471GlyThr: 4.471 ± 0.905
3.406GlyVal: 3.406 ± 0.762
1.277GlyTrp: 1.277 ± 0.49
2.768GlyTyr: 2.768 ± 0.75
0.0GlyXaa: 0.0 ± 0.0
His
0.639HisAla: 0.639 ± 0.386
0.639HisCys: 0.639 ± 0.393
1.065HisAsp: 1.065 ± 0.371
1.277HisGlu: 1.277 ± 0.613
0.213HisPhe: 0.213 ± 0.195
1.703HisGly: 1.703 ± 0.62
0.426HisHis: 0.426 ± 0.342
1.065HisIle: 1.065 ± 0.433
1.916HisLys: 1.916 ± 0.533
1.065HisLeu: 1.065 ± 0.471
0.426HisMet: 0.426 ± 0.296
0.852HisAsn: 0.852 ± 0.424
1.065HisPro: 1.065 ± 0.444
0.213HisGln: 0.213 ± 0.238
1.277HisArg: 1.277 ± 0.495
1.065HisSer: 1.065 ± 0.434
0.639HisThr: 0.639 ± 0.352
1.277HisVal: 1.277 ± 0.649
0.426HisTrp: 0.426 ± 0.308
0.852HisTyr: 0.852 ± 0.54
0.0HisXaa: 0.0 ± 0.0
Ile
4.897IleAla: 4.897 ± 0.995
0.852IleCys: 0.852 ± 0.43
3.832IleAsp: 3.832 ± 0.947
6.387IleGlu: 6.387 ± 1.211
3.194IlePhe: 3.194 ± 0.758
3.619IleGly: 3.619 ± 1.052
1.277IleHis: 1.277 ± 0.475
6.174IleIle: 6.174 ± 1.391
8.09IleLys: 8.09 ± 1.267
3.406IleLeu: 3.406 ± 0.871
2.555IleMet: 2.555 ± 0.691
4.045IleAsn: 4.045 ± 0.839
1.065IlePro: 1.065 ± 0.496
2.555IleGln: 2.555 ± 0.974
4.045IleArg: 4.045 ± 0.994
4.045IleSer: 4.045 ± 0.796
5.748IleThr: 5.748 ± 0.986
4.897IleVal: 4.897 ± 1.365
0.639IleTrp: 0.639 ± 0.419
2.555IleTyr: 2.555 ± 0.612
0.0IleXaa: 0.0 ± 0.0
Lys
7.452LysAla: 7.452 ± 1.081
1.065LysCys: 1.065 ± 0.434
5.748LysAsp: 5.748 ± 1.066
8.516LysGlu: 8.516 ± 1.708
3.406LysPhe: 3.406 ± 0.978
3.832LysGly: 3.832 ± 0.81
1.49LysHis: 1.49 ± 0.66
5.323LysIle: 5.323 ± 0.998
7.877LysLys: 7.877 ± 1.348
7.452LysLeu: 7.452 ± 1.187
4.258LysMet: 4.258 ± 0.854
4.897LysAsn: 4.897 ± 0.936
2.129LysPro: 2.129 ± 0.629
2.129LysGln: 2.129 ± 0.596
5.323LysArg: 5.323 ± 1.156
4.471LysSer: 4.471 ± 0.999
2.768LysThr: 2.768 ± 0.774
5.961LysVal: 5.961 ± 1.042
1.277LysTrp: 1.277 ± 0.42
2.555LysTyr: 2.555 ± 0.679
0.0LysXaa: 0.0 ± 0.0
Leu
4.258LeuAla: 4.258 ± 0.94
1.49LeuCys: 1.49 ± 0.563
6.174LeuAsp: 6.174 ± 1.406
6.813LeuGlu: 6.813 ± 1.448
2.555LeuPhe: 2.555 ± 0.715
4.471LeuGly: 4.471 ± 0.822
0.639LeuHis: 0.639 ± 0.369
7.452LeuIle: 7.452 ± 1.427
5.11LeuLys: 5.11 ± 0.836
6.813LeuLeu: 6.813 ± 0.934
2.342LeuMet: 2.342 ± 0.658
5.748LeuAsn: 5.748 ± 1.009
2.129LeuPro: 2.129 ± 0.702
2.342LeuGln: 2.342 ± 0.723
5.748LeuArg: 5.748 ± 0.986
2.555LeuSer: 2.555 ± 0.74
4.045LeuThr: 4.045 ± 1.042
5.535LeuVal: 5.535 ± 1.06
0.639LeuTrp: 0.639 ± 0.384
3.194LeuTyr: 3.194 ± 0.905
0.0LeuXaa: 0.0 ± 0.0
Met
1.916MetAla: 1.916 ± 0.704
0.213MetCys: 0.213 ± 0.194
1.277MetAsp: 1.277 ± 0.505
3.194MetGlu: 3.194 ± 0.808
2.555MetPhe: 2.555 ± 0.877
1.916MetGly: 1.916 ± 0.58
0.0MetHis: 0.0 ± 0.0
0.852MetIle: 0.852 ± 0.434
2.768MetLys: 2.768 ± 0.811
2.981MetLeu: 2.981 ± 0.785
1.277MetMet: 1.277 ± 0.447
2.342MetAsn: 2.342 ± 0.725
0.852MetPro: 0.852 ± 0.489
1.277MetGln: 1.277 ± 0.577
2.342MetArg: 2.342 ± 0.741
1.49MetSer: 1.49 ± 0.542
1.916MetThr: 1.916 ± 0.616
3.194MetVal: 3.194 ± 0.929
0.0MetTrp: 0.0 ± 0.0
1.065MetTyr: 1.065 ± 0.52
0.0MetXaa: 0.0 ± 0.0
Asn
3.406AsnAla: 3.406 ± 0.952
0.213AsnCys: 0.213 ± 0.235
2.129AsnAsp: 2.129 ± 0.576
7.452AsnGlu: 7.452 ± 1.242
2.981AsnPhe: 2.981 ± 0.692
4.045AsnGly: 4.045 ± 1.028
0.639AsnHis: 0.639 ± 0.342
2.768AsnIle: 2.768 ± 0.635
4.897AsnLys: 4.897 ± 0.965
3.194AsnLeu: 3.194 ± 0.901
2.129AsnMet: 2.129 ± 0.688
2.981AsnAsn: 2.981 ± 0.787
1.916AsnPro: 1.916 ± 0.707
1.277AsnGln: 1.277 ± 0.521
1.916AsnArg: 1.916 ± 0.639
3.194AsnSer: 3.194 ± 0.786
2.342AsnThr: 2.342 ± 0.642
2.555AsnVal: 2.555 ± 0.794
0.426AsnTrp: 0.426 ± 0.336
2.129AsnTyr: 2.129 ± 0.721
0.0AsnXaa: 0.0 ± 0.0
Pro
1.916ProAla: 1.916 ± 0.767
0.639ProCys: 0.639 ± 0.344
1.49ProAsp: 1.49 ± 0.57
1.277ProGlu: 1.277 ± 0.512
2.129ProPhe: 2.129 ± 0.621
1.49ProGly: 1.49 ± 0.56
0.852ProHis: 0.852 ± 0.384
1.916ProIle: 1.916 ± 0.634
1.703ProLys: 1.703 ± 0.579
2.555ProLeu: 2.555 ± 0.675
0.639ProMet: 0.639 ± 0.381
1.916ProAsn: 1.916 ± 0.742
0.852ProPro: 0.852 ± 0.528
1.277ProGln: 1.277 ± 0.493
0.852ProArg: 0.852 ± 0.468
2.129ProSer: 2.129 ± 0.566
1.277ProThr: 1.277 ± 0.505
1.49ProVal: 1.49 ± 0.502
0.639ProTrp: 0.639 ± 0.512
0.639ProTyr: 0.639 ± 0.365
0.0ProXaa: 0.0 ± 0.0
Gln
2.342GlnAla: 2.342 ± 0.718
0.639GlnCys: 0.639 ± 0.41
1.703GlnAsp: 1.703 ± 0.604
3.194GlnGlu: 3.194 ± 0.737
0.852GlnPhe: 0.852 ± 0.368
0.852GlnGly: 0.852 ± 0.426
0.426GlnHis: 0.426 ± 0.277
2.768GlnIle: 2.768 ± 0.695
3.194GlnLys: 3.194 ± 0.831
4.045GlnLeu: 4.045 ± 1.102
0.426GlnMet: 0.426 ± 0.328
1.065GlnAsn: 1.065 ± 0.49
1.49GlnPro: 1.49 ± 0.527
1.703GlnGln: 1.703 ± 0.644
1.277GlnArg: 1.277 ± 0.433
1.065GlnSer: 1.065 ± 0.442
1.49GlnThr: 1.49 ± 0.474
1.49GlnVal: 1.49 ± 0.487
1.065GlnTrp: 1.065 ± 0.414
1.49GlnTyr: 1.49 ± 0.632
0.0GlnXaa: 0.0 ± 0.0
Arg
2.342ArgAla: 2.342 ± 0.776
0.213ArgCys: 0.213 ± 0.238
3.194ArgAsp: 3.194 ± 0.861
4.471ArgGlu: 4.471 ± 0.998
3.406ArgPhe: 3.406 ± 0.878
1.916ArgGly: 1.916 ± 0.624
1.277ArgHis: 1.277 ± 0.436
2.342ArgIle: 2.342 ± 0.565
5.535ArgLys: 5.535 ± 1.434
4.684ArgLeu: 4.684 ± 0.843
1.703ArgMet: 1.703 ± 0.535
0.852ArgAsn: 0.852 ± 0.411
0.639ArgPro: 0.639 ± 0.351
2.768ArgGln: 2.768 ± 0.87
1.703ArgArg: 1.703 ± 0.59
2.342ArgSer: 2.342 ± 0.735
1.703ArgThr: 1.703 ± 0.577
1.916ArgVal: 1.916 ± 0.8
0.426ArgTrp: 0.426 ± 0.41
1.49ArgTyr: 1.49 ± 0.502
0.0ArgXaa: 0.0 ± 0.0
Ser
2.129SerAla: 2.129 ± 0.565
1.065SerCys: 1.065 ± 0.419
2.129SerAsp: 2.129 ± 0.642
5.323SerGlu: 5.323 ± 0.948
4.045SerPhe: 4.045 ± 0.832
1.49SerGly: 1.49 ± 0.632
1.916SerHis: 1.916 ± 0.582
5.535SerIle: 5.535 ± 1.089
3.406SerLys: 3.406 ± 0.842
4.684SerLeu: 4.684 ± 0.902
1.277SerMet: 1.277 ± 0.53
3.406SerAsn: 3.406 ± 0.946
1.49SerPro: 1.49 ± 0.587
1.277SerGln: 1.277 ± 0.514
1.277SerArg: 1.277 ± 0.588
3.194SerSer: 3.194 ± 0.998
1.49SerThr: 1.49 ± 0.754
4.684SerVal: 4.684 ± 0.968
0.639SerTrp: 0.639 ± 0.395
1.916SerTyr: 1.916 ± 0.711
0.0SerXaa: 0.0 ± 0.0
Thr
1.916ThrAla: 1.916 ± 0.669
0.0ThrCys: 0.0 ± 0.0
2.981ThrAsp: 2.981 ± 0.705
5.11ThrGlu: 5.11 ± 1.053
1.065ThrPhe: 1.065 ± 0.437
2.129ThrGly: 2.129 ± 0.684
0.852ThrHis: 0.852 ± 0.471
2.981ThrIle: 2.981 ± 0.704
4.471ThrLys: 4.471 ± 1.013
4.684ThrLeu: 4.684 ± 1.006
1.703ThrMet: 1.703 ± 0.634
2.129ThrAsn: 2.129 ± 0.634
1.916ThrPro: 1.916 ± 0.671
1.916ThrGln: 1.916 ± 0.569
2.129ThrArg: 2.129 ± 0.698
1.49ThrSer: 1.49 ± 0.551
3.406ThrThr: 3.406 ± 1.036
4.471ThrVal: 4.471 ± 0.974
0.426ThrTrp: 0.426 ± 0.295
2.768ThrTyr: 2.768 ± 0.723
0.0ThrXaa: 0.0 ± 0.0
Val
3.194ValAla: 3.194 ± 0.928
1.49ValCys: 1.49 ± 0.617
5.11ValAsp: 5.11 ± 1.029
6.387ValGlu: 6.387 ± 1.301
2.981ValPhe: 2.981 ± 0.783
2.129ValGly: 2.129 ± 0.614
2.129ValHis: 2.129 ± 0.677
3.194ValIle: 3.194 ± 0.861
7.452ValLys: 7.452 ± 1.144
5.11ValLeu: 5.11 ± 0.975
1.065ValMet: 1.065 ± 0.413
4.045ValAsn: 4.045 ± 1.013
1.49ValPro: 1.49 ± 0.605
2.342ValGln: 2.342 ± 0.664
3.619ValArg: 3.619 ± 0.994
4.258ValSer: 4.258 ± 1.047
3.619ValThr: 3.619 ± 0.834
7.239ValVal: 7.239 ± 1.3
1.065ValTrp: 1.065 ± 0.526
4.045ValTyr: 4.045 ± 1.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.639TrpAla: 0.639 ± 0.377
0.0TrpCys: 0.0 ± 0.0
0.852TrpAsp: 0.852 ± 0.378
0.639TrpGlu: 0.639 ± 0.379
0.639TrpPhe: 0.639 ± 0.342
0.0TrpGly: 0.0 ± 0.0
0.213TrpHis: 0.213 ± 0.238
2.129TrpIle: 2.129 ± 0.616
2.129TrpLys: 2.129 ± 0.593
0.852TrpLeu: 0.852 ± 0.467
0.0TrpMet: 0.0 ± 0.0
0.852TrpAsn: 0.852 ± 0.379
0.639TrpPro: 0.639 ± 0.357
0.639TrpGln: 0.639 ± 0.38
0.213TrpArg: 0.213 ± 0.223
1.065TrpSer: 1.065 ± 0.48
0.639TrpThr: 0.639 ± 0.397
0.852TrpVal: 0.852 ± 0.429
0.426TrpTrp: 0.426 ± 0.251
0.639TrpTyr: 0.639 ± 0.327
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.49TyrAla: 1.49 ± 0.524
0.639TyrCys: 0.639 ± 0.33
2.768TyrAsp: 2.768 ± 0.751
4.471TyrGlu: 4.471 ± 1.059
2.768TyrPhe: 2.768 ± 0.948
2.768TyrGly: 2.768 ± 0.844
1.065TyrHis: 1.065 ± 0.436
2.342TyrIle: 2.342 ± 0.603
2.768TyrLys: 2.768 ± 0.769
2.129TyrLeu: 2.129 ± 0.788
1.277TyrMet: 1.277 ± 0.511
4.045TyrAsn: 4.045 ± 0.924
0.852TyrPro: 0.852 ± 0.409
1.065TyrGln: 1.065 ± 0.421
2.129TyrArg: 2.129 ± 0.77
2.981TyrSer: 2.981 ± 0.761
1.916TyrThr: 1.916 ± 0.593
2.555TyrVal: 2.555 ± 0.67
0.639TyrTrp: 0.639 ± 0.349
1.916TyrTyr: 1.916 ± 0.722
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (4698 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski