Amino acid dipepetide frequency for Escherichia phage ID52

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.785AlaAla: 3.785 ± 0.991
1.262AlaCys: 1.262 ± 0.969
6.308AlaAsp: 6.308 ± 1.533
3.364AlaGlu: 3.364 ± 0.966
3.364AlaPhe: 3.364 ± 0.962
9.251AlaGly: 9.251 ± 2.3
2.944AlaHis: 2.944 ± 1.108
4.205AlaIle: 4.205 ± 1.657
7.569AlaLys: 7.569 ± 1.717
5.467AlaLeu: 5.467 ± 1.416
0.841AlaMet: 0.841 ± 0.401
3.364AlaAsn: 3.364 ± 1.252
2.944AlaPro: 2.944 ± 0.871
3.364AlaGln: 3.364 ± 1.306
0.841AlaArg: 0.841 ± 0.7
5.046AlaSer: 5.046 ± 2.418
4.626AlaThr: 4.626 ± 1.454
7.149AlaVal: 7.149 ± 1.175
0.841AlaTrp: 0.841 ± 0.543
2.103AlaTyr: 2.103 ± 0.758
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.262CysAsp: 1.262 ± 0.757
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.421CysHis: 0.421 ± 0.418
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.421CysLeu: 0.421 ± 0.463
0.0CysMet: 0.0 ± 0.0
0.841CysAsn: 0.841 ± 0.549
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.421CysArg: 0.421 ± 0.355
1.262CysSer: 1.262 ± 0.56
0.0CysThr: 0.0 ± 0.0
1.262CysVal: 1.262 ± 0.499
0.0CysTrp: 0.0 ± 0.0
1.262CysTyr: 1.262 ± 0.604
0.0CysXaa: 0.0 ± 0.0
Asp
5.046AspAla: 5.046 ± 1.535
0.841AspCys: 0.841 ± 0.401
2.944AspAsp: 2.944 ± 0.796
3.364AspGlu: 3.364 ± 1.164
2.523AspPhe: 2.523 ± 1.261
2.944AspGly: 2.944 ± 1.337
0.841AspHis: 0.841 ± 0.488
5.467AspIle: 5.467 ± 1.441
0.421AspLys: 0.421 ± 0.38
2.103AspLeu: 2.103 ± 1.055
2.103AspMet: 2.103 ± 0.564
3.364AspAsn: 3.364 ± 0.711
2.523AspPro: 2.523 ± 1.039
1.262AspGln: 1.262 ± 0.506
3.364AspArg: 3.364 ± 1.169
2.944AspSer: 2.944 ± 1.783
2.103AspThr: 2.103 ± 0.643
5.046AspVal: 5.046 ± 1.634
0.421AspTrp: 0.421 ± 0.435
3.785AspTyr: 3.785 ± 0.941
0.0AspXaa: 0.0 ± 0.0
Glu
4.205GluAla: 4.205 ± 1.177
1.262GluCys: 1.262 ± 0.679
0.841GluAsp: 0.841 ± 0.455
2.523GluGlu: 2.523 ± 1.361
2.523GluPhe: 2.523 ± 1.407
2.103GluGly: 2.103 ± 0.562
0.841GluHis: 0.841 ± 0.543
4.205GluIle: 4.205 ± 1.122
1.262GluLys: 1.262 ± 0.76
5.887GluLeu: 5.887 ± 1.154
1.262GluMet: 1.262 ± 0.801
2.523GluAsn: 2.523 ± 1.106
0.841GluPro: 0.841 ± 0.401
2.103GluGln: 2.103 ± 0.874
3.364GluArg: 3.364 ± 1.238
2.944GluSer: 2.944 ± 0.748
4.205GluThr: 4.205 ± 0.983
1.682GluVal: 1.682 ± 0.6
0.841GluTrp: 0.841 ± 0.401
0.841GluTyr: 0.841 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
1.682PheAla: 1.682 ± 0.876
0.841PheCys: 0.841 ± 0.401
1.682PheAsp: 1.682 ± 0.65
1.262PheGlu: 1.262 ± 0.725
0.421PhePhe: 0.421 ± 0.39
2.944PheGly: 2.944 ± 0.635
1.262PheHis: 1.262 ± 0.412
2.103PheIle: 2.103 ± 0.829
2.944PheLys: 2.944 ± 0.802
1.262PheLeu: 1.262 ± 0.748
2.103PheMet: 2.103 ± 0.685
2.944PheAsn: 2.944 ± 0.757
2.103PhePro: 2.103 ± 1.208
1.682PheGln: 1.682 ± 0.67
3.364PheArg: 3.364 ± 1.2
2.103PheSer: 2.103 ± 0.708
2.523PheThr: 2.523 ± 0.825
1.262PheVal: 1.262 ± 0.927
0.841PheTrp: 0.841 ± 0.453
3.364PheTyr: 3.364 ± 1.025
0.0PheXaa: 0.0 ± 0.0
Gly
4.626GlyAla: 4.626 ± 2.346
0.0GlyCys: 0.0 ± 0.0
1.262GlyAsp: 1.262 ± 0.783
1.262GlyGlu: 1.262 ± 0.604
2.523GlyPhe: 2.523 ± 0.613
4.205GlyGly: 4.205 ± 1.62
1.262GlyHis: 1.262 ± 0.525
5.467GlyIle: 5.467 ± 1.176
7.149GlyLys: 7.149 ± 1.377
5.046GlyLeu: 5.046 ± 0.997
1.682GlyMet: 1.682 ± 0.536
3.364GlyAsn: 3.364 ± 0.845
0.0GlyPro: 0.0 ± 0.0
2.523GlyGln: 2.523 ± 1.162
3.364GlyArg: 3.364 ± 1.094
2.523GlySer: 2.523 ± 0.854
3.785GlyThr: 3.785 ± 1.03
4.626GlyVal: 4.626 ± 1.103
2.103GlyTrp: 2.103 ± 0.713
3.364GlyTyr: 3.364 ± 0.768
0.0GlyXaa: 0.0 ± 0.0
His
2.523HisAla: 2.523 ± 0.773
0.0HisCys: 0.0 ± 0.0
0.421HisAsp: 0.421 ± 0.39
1.682HisGlu: 1.682 ± 0.47
1.682HisPhe: 1.682 ± 0.567
1.682HisGly: 1.682 ± 0.511
0.841HisHis: 0.841 ± 0.594
0.421HisIle: 0.421 ± 0.39
0.841HisLys: 0.841 ± 0.453
2.944HisLeu: 2.944 ± 0.766
0.421HisMet: 0.421 ± 0.39
0.841HisAsn: 0.841 ± 0.544
0.841HisPro: 0.841 ± 0.627
0.841HisGln: 0.841 ± 0.453
0.841HisArg: 0.841 ± 0.491
1.262HisSer: 1.262 ± 0.521
2.103HisThr: 2.103 ± 0.674
1.262HisVal: 1.262 ± 0.893
1.262HisTrp: 1.262 ± 0.679
0.841HisTyr: 0.841 ± 0.453
0.0HisXaa: 0.0 ± 0.0
Ile
7.149IleAla: 7.149 ± 1.494
0.421IleCys: 0.421 ± 0.355
3.364IleAsp: 3.364 ± 0.947
2.103IleGlu: 2.103 ± 0.767
0.841IlePhe: 0.841 ± 0.78
3.364IleGly: 3.364 ± 0.587
0.421IleHis: 0.421 ± 0.355
1.262IleIle: 1.262 ± 0.743
3.785IleLys: 3.785 ± 0.993
3.785IleLeu: 3.785 ± 1.129
3.364IleMet: 3.364 ± 1.42
4.205IleAsn: 4.205 ± 0.957
2.523IlePro: 2.523 ± 0.739
2.103IleGln: 2.103 ± 1.127
3.364IleArg: 3.364 ± 0.871
2.944IleSer: 2.944 ± 1.113
1.262IleThr: 1.262 ± 0.676
1.262IleVal: 1.262 ± 0.718
0.841IleTrp: 0.841 ± 0.453
0.841IleTyr: 0.841 ± 0.78
0.0IleXaa: 0.0 ± 0.0
Lys
2.103LysAla: 2.103 ± 0.731
0.0LysCys: 0.0 ± 0.0
6.308LysAsp: 6.308 ± 1.363
6.728LysGlu: 6.728 ± 1.867
3.364LysPhe: 3.364 ± 0.962
4.205LysGly: 4.205 ± 1.127
1.262LysHis: 1.262 ± 0.553
2.944LysIle: 2.944 ± 0.73
4.205LysLys: 4.205 ± 1.665
5.467LysLeu: 5.467 ± 1.145
2.103LysMet: 2.103 ± 1.1
0.421LysAsn: 0.421 ± 0.435
2.944LysPro: 2.944 ± 1.65
3.364LysGln: 3.364 ± 0.955
0.421LysArg: 0.421 ± 0.318
7.149LysSer: 7.149 ± 1.783
4.626LysThr: 4.626 ± 1.115
2.944LysVal: 2.944 ± 0.777
1.262LysTrp: 1.262 ± 0.412
1.682LysTyr: 1.682 ± 0.641
0.0LysXaa: 0.0 ± 0.0
Leu
7.149LeuAla: 7.149 ± 1.459
0.841LeuCys: 0.841 ± 0.667
4.626LeuAsp: 4.626 ± 1.07
2.944LeuGlu: 2.944 ± 0.941
1.262LeuPhe: 1.262 ± 0.816
4.205LeuGly: 4.205 ± 0.912
1.682LeuHis: 1.682 ± 0.695
2.523LeuIle: 2.523 ± 1.444
10.093LeuLys: 10.093 ± 2.253
10.934LeuLeu: 10.934 ± 5.709
4.626LeuMet: 4.626 ± 0.955
2.944LeuAsn: 2.944 ± 1.412
3.364LeuPro: 3.364 ± 1.147
3.785LeuGln: 3.785 ± 0.958
5.046LeuArg: 5.046 ± 1.42
8.831LeuSer: 8.831 ± 1.997
9.251LeuThr: 9.251 ± 1.742
4.205LeuVal: 4.205 ± 1.142
1.682LeuTrp: 1.682 ± 0.736
0.841LeuTyr: 0.841 ± 0.544
0.0LeuXaa: 0.0 ± 0.0
Met
2.944MetAla: 2.944 ± 1.081
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.944MetGlu: 2.944 ± 0.652
1.262MetPhe: 1.262 ± 0.709
0.421MetGly: 0.421 ± 0.39
0.421MetHis: 0.421 ± 0.39
0.421MetIle: 0.421 ± 0.418
3.364MetLys: 3.364 ± 0.975
2.523MetLeu: 2.523 ± 1.44
0.421MetMet: 0.421 ± 0.445
1.262MetAsn: 1.262 ± 0.604
2.103MetPro: 2.103 ± 0.708
2.523MetGln: 2.523 ± 1.282
3.785MetArg: 3.785 ± 0.95
2.523MetSer: 2.523 ± 0.374
2.523MetThr: 2.523 ± 1.423
1.682MetVal: 1.682 ± 0.529
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.887AsnAla: 5.887 ± 1.008
0.421AsnCys: 0.421 ± 0.513
1.262AsnAsp: 1.262 ± 0.643
2.944AsnGlu: 2.944 ± 1.429
2.523AsnPhe: 2.523 ± 0.613
1.682AsnGly: 1.682 ± 0.768
1.262AsnHis: 1.262 ± 0.513
3.785AsnIle: 3.785 ± 1.193
3.364AsnLys: 3.364 ± 0.836
6.308AsnLeu: 6.308 ± 1.662
2.103AsnMet: 2.103 ± 1.331
4.626AsnAsn: 4.626 ± 0.79
2.944AsnPro: 2.944 ± 0.924
2.523AsnGln: 2.523 ± 1.33
2.523AsnArg: 2.523 ± 0.634
4.205AsnSer: 4.205 ± 1.067
4.205AsnThr: 4.205 ± 1.036
2.944AsnVal: 2.944 ± 0.989
0.0AsnTrp: 0.0 ± 0.0
2.103AsnTyr: 2.103 ± 0.733
0.0AsnXaa: 0.0 ± 0.0
Pro
1.682ProAla: 1.682 ± 0.884
0.0ProCys: 0.0 ± 0.0
1.682ProAsp: 1.682 ± 1.152
2.944ProGlu: 2.944 ± 0.585
1.262ProPhe: 1.262 ± 0.412
0.421ProGly: 0.421 ± 0.418
1.262ProHis: 1.262 ± 0.757
2.103ProIle: 2.103 ± 1.042
2.944ProLys: 2.944 ± 1.128
5.887ProLeu: 5.887 ± 1.513
0.0ProMet: 0.0 ± 0.0
4.626ProAsn: 4.626 ± 1.498
2.103ProPro: 2.103 ± 1.526
0.841ProGln: 0.841 ± 0.488
1.682ProArg: 1.682 ± 0.821
3.364ProSer: 3.364 ± 1.406
2.523ProThr: 2.523 ± 0.924
5.887ProVal: 5.887 ± 1.436
0.421ProTrp: 0.421 ± 0.39
1.262ProTyr: 1.262 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
1.262GlnAla: 1.262 ± 0.65
0.421GlnCys: 0.421 ± 0.318
1.682GlnAsp: 1.682 ± 0.384
2.523GlnGlu: 2.523 ± 0.827
1.262GlnPhe: 1.262 ± 0.702
2.523GlnGly: 2.523 ± 0.687
1.682GlnHis: 1.682 ± 0.545
2.103GlnIle: 2.103 ± 0.679
2.944GlnLys: 2.944 ± 1.797
3.785GlnLeu: 3.785 ± 1.142
0.421GlnMet: 0.421 ± 0.38
4.626GlnAsn: 4.626 ± 1.696
2.523GlnPro: 2.523 ± 0.778
2.103GlnGln: 2.103 ± 0.932
1.262GlnArg: 1.262 ± 0.582
4.626GlnSer: 4.626 ± 0.809
3.785GlnThr: 3.785 ± 1.34
2.103GlnVal: 2.103 ± 0.91
0.841GlnTrp: 0.841 ± 0.78
1.682GlnTyr: 1.682 ± 0.722
0.0GlnXaa: 0.0 ± 0.0
Arg
5.046ArgAla: 5.046 ± 1.776
0.0ArgCys: 0.0 ± 0.0
3.364ArgAsp: 3.364 ± 0.988
1.682ArgGlu: 1.682 ± 0.67
2.523ArgPhe: 2.523 ± 0.989
2.944ArgGly: 2.944 ± 0.636
1.262ArgHis: 1.262 ± 0.839
2.944ArgIle: 2.944 ± 1.286
0.841ArgLys: 0.841 ± 0.569
5.046ArgLeu: 5.046 ± 1.575
2.103ArgMet: 2.103 ± 0.864
2.523ArgAsn: 2.523 ± 0.971
2.103ArgPro: 2.103 ± 1.064
1.682ArgGln: 1.682 ± 0.545
4.626ArgArg: 4.626 ± 1.54
4.626ArgSer: 4.626 ± 0.963
5.467ArgThr: 5.467 ± 1.163
3.364ArgVal: 3.364 ± 2.301
0.421ArgTrp: 0.421 ± 0.513
2.103ArgTyr: 2.103 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
7.149SerAla: 7.149 ± 1.519
0.0SerCys: 0.0 ± 0.0
4.205SerAsp: 4.205 ± 0.91
1.682SerGlu: 1.682 ± 0.888
2.944SerPhe: 2.944 ± 0.693
5.467SerGly: 5.467 ± 1.463
1.262SerHis: 1.262 ± 0.771
2.944SerIle: 2.944 ± 0.997
4.626SerLys: 4.626 ± 1.394
7.569SerLeu: 7.569 ± 1.957
4.205SerMet: 4.205 ± 0.921
2.944SerAsn: 2.944 ± 1.048
3.364SerPro: 3.364 ± 1.349
2.523SerGln: 2.523 ± 1.003
6.308SerArg: 6.308 ± 1.224
7.569SerSer: 7.569 ± 1.104
4.626SerThr: 4.626 ± 0.855
4.626SerVal: 4.626 ± 1.276
0.841SerTrp: 0.841 ± 0.568
1.682SerTyr: 1.682 ± 0.511
0.0SerXaa: 0.0 ± 0.0
Thr
6.728ThrAla: 6.728 ± 1.473
0.841ThrCys: 0.841 ± 0.444
4.626ThrAsp: 4.626 ± 1.153
2.523ThrGlu: 2.523 ± 1.174
1.262ThrPhe: 1.262 ± 0.758
2.523ThrGly: 2.523 ± 1.0
2.103ThrHis: 2.103 ± 0.625
3.785ThrIle: 3.785 ± 1.188
5.467ThrLys: 5.467 ± 1.334
7.149ThrLeu: 7.149 ± 1.608
0.421ThrMet: 0.421 ± 0.318
5.046ThrAsn: 5.046 ± 1.311
2.103ThrPro: 2.103 ± 0.626
5.887ThrGln: 5.887 ± 1.258
3.785ThrArg: 3.785 ± 1.256
5.887ThrSer: 5.887 ± 1.686
5.467ThrThr: 5.467 ± 1.889
2.523ThrVal: 2.523 ± 1.405
0.841ThrTrp: 0.841 ± 0.401
2.103ThrTyr: 2.103 ± 0.806
0.0ThrXaa: 0.0 ± 0.0
Val
6.308ValAla: 6.308 ± 1.089
0.0ValCys: 0.0 ± 0.0
2.944ValAsp: 2.944 ± 1.06
2.103ValGlu: 2.103 ± 0.98
2.103ValPhe: 2.103 ± 0.713
5.887ValGly: 5.887 ± 1.589
2.523ValHis: 2.523 ± 0.848
1.262ValIle: 1.262 ± 0.65
0.421ValLys: 0.421 ± 0.463
6.308ValLeu: 6.308 ± 1.714
0.841ValMet: 0.841 ± 0.731
3.785ValAsn: 3.785 ± 1.463
4.626ValPro: 4.626 ± 1.326
4.205ValGln: 4.205 ± 1.054
3.785ValArg: 3.785 ± 0.6
3.364ValSer: 3.364 ± 0.57
4.626ValThr: 4.626 ± 1.919
3.364ValVal: 3.364 ± 1.088
0.421ValTrp: 0.421 ± 0.463
2.944ValTyr: 2.944 ± 0.959
0.0ValXaa: 0.0 ± 0.0
Trp
0.421TrpAla: 0.421 ± 0.39
0.0TrpCys: 0.0 ± 0.0
0.841TrpAsp: 0.841 ± 0.453
0.421TrpGlu: 0.421 ± 0.38
0.421TrpPhe: 0.421 ± 0.435
0.421TrpGly: 0.421 ± 0.39
0.0TrpHis: 0.0 ± 0.0
0.841TrpIle: 0.841 ± 0.575
0.841TrpLys: 0.841 ± 0.549
0.841TrpLeu: 0.841 ± 0.401
0.421TrpMet: 0.421 ± 0.318
1.262TrpAsn: 1.262 ± 0.412
1.682TrpPro: 1.682 ± 0.802
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.262TrpSer: 1.262 ± 0.771
2.523TrpThr: 2.523 ± 0.739
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.682TrpTyr: 1.682 ± 0.592
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.944TyrAla: 2.944 ± 0.793
0.0TyrCys: 0.0 ± 0.0
3.785TyrAsp: 3.785 ± 1.052
1.262TyrGlu: 1.262 ± 0.724
4.626TyrPhe: 4.626 ± 0.662
2.944TyrGly: 2.944 ± 0.732
0.0TyrHis: 0.0 ± 0.0
0.421TyrIle: 0.421 ± 0.39
0.841TyrLys: 0.841 ± 0.453
2.103TyrLeu: 2.103 ± 1.178
1.262TyrMet: 1.262 ± 0.582
2.103TyrAsn: 2.103 ± 0.865
1.262TyrPro: 1.262 ± 0.637
0.841TyrGln: 0.841 ± 0.401
2.523TyrArg: 2.523 ± 1.179
2.103TyrSer: 2.103 ± 0.713
0.841TyrThr: 0.841 ± 0.453
4.626TyrVal: 4.626 ± 1.402
0.0TyrTrp: 0.0 ± 0.0
0.421TyrTyr: 0.421 ± 0.463
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (2379 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski