Amino acid dipepetide frequency for Escherichia phage Lilleven

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.882AlaAla: 5.882 ± 2.259
1.604AlaCys: 1.604 ± 1.246
3.743AlaAsp: 3.743 ± 1.291
6.417AlaGlu: 6.417 ± 2.114
2.139AlaPhe: 2.139 ± 0.793
9.626AlaGly: 9.626 ± 4.473
2.139AlaHis: 2.139 ± 1.195
3.209AlaIle: 3.209 ± 0.754
4.813AlaLys: 4.813 ± 2.156
7.487AlaLeu: 7.487 ± 2.066
1.07AlaMet: 1.07 ± 0.714
2.674AlaAsn: 2.674 ± 1.066
2.674AlaPro: 2.674 ± 1.386
3.209AlaGln: 3.209 ± 1.39
3.209AlaArg: 3.209 ± 1.093
6.417AlaSer: 6.417 ± 3.336
5.882AlaThr: 5.882 ± 1.377
5.348AlaVal: 5.348 ± 1.972
0.535AlaTrp: 0.535 ± 0.42
2.674AlaTyr: 2.674 ± 1.288
0.0AlaXaa: 0.0 ± 0.0
Cys
0.535CysAla: 0.535 ± 0.42
0.535CysCys: 0.535 ± 0.458
0.535CysAsp: 0.535 ± 0.845
0.0CysGlu: 0.0 ± 0.0
1.604CysPhe: 1.604 ± 1.192
0.535CysGly: 0.535 ± 0.458
0.535CysHis: 0.535 ± 0.458
0.0CysIle: 0.0 ± 0.0
0.535CysLys: 0.535 ± 0.42
1.07CysLeu: 1.07 ± 0.715
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.604CysPro: 1.604 ± 1.113
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.07CysSer: 1.07 ± 1.206
0.0CysThr: 0.0 ± 0.0
3.209CysVal: 3.209 ± 1.892
0.0CysTrp: 0.0 ± 0.0
0.535CysTyr: 0.535 ± 0.42
0.0CysXaa: 0.0 ± 0.0
Asp
6.952AspAla: 6.952 ± 1.985
1.604AspCys: 1.604 ± 0.64
4.278AspAsp: 4.278 ± 2.117
2.139AspGlu: 2.139 ± 1.464
3.743AspPhe: 3.743 ± 2.283
4.813AspGly: 4.813 ± 1.575
1.604AspHis: 1.604 ± 0.789
3.743AspIle: 3.743 ± 1.15
2.139AspLys: 2.139 ± 0.939
5.348AspLeu: 5.348 ± 1.91
1.07AspMet: 1.07 ± 0.774
2.674AspAsn: 2.674 ± 0.971
2.139AspPro: 2.139 ± 0.756
1.604AspGln: 1.604 ± 0.85
2.139AspArg: 2.139 ± 0.528
4.813AspSer: 4.813 ± 1.246
3.743AspThr: 3.743 ± 1.885
4.813AspVal: 4.813 ± 2.184
1.604AspTrp: 1.604 ± 1.238
2.674AspTyr: 2.674 ± 0.951
0.0AspXaa: 0.0 ± 0.0
Glu
2.139GluAla: 2.139 ± 0.528
1.07GluCys: 1.07 ± 0.774
2.139GluAsp: 2.139 ± 1.278
2.674GluGlu: 2.674 ± 1.687
2.139GluPhe: 2.139 ± 1.43
1.604GluGly: 1.604 ± 0.698
2.674GluHis: 2.674 ± 1.463
3.743GluIle: 3.743 ± 1.579
2.674GluLys: 2.674 ± 1.595
3.743GluLeu: 3.743 ± 0.887
3.209GluMet: 3.209 ± 1.064
2.139GluAsn: 2.139 ± 1.43
0.0GluPro: 0.0 ± 0.0
1.604GluGln: 1.604 ± 0.934
4.813GluArg: 4.813 ± 1.151
3.743GluSer: 3.743 ± 0.883
2.674GluThr: 2.674 ± 0.819
1.604GluVal: 1.604 ± 0.915
0.535GluTrp: 0.535 ± 0.845
1.07GluTyr: 1.07 ± 1.02
0.0GluXaa: 0.0 ± 0.0
Phe
2.139PheAla: 2.139 ± 0.82
0.0PheCys: 0.0 ± 0.0
2.674PheAsp: 2.674 ± 0.999
1.604PheGlu: 1.604 ± 0.64
2.139PhePhe: 2.139 ± 0.791
4.278PheGly: 4.278 ± 0.861
0.0PheHis: 0.0 ± 0.0
2.139PheIle: 2.139 ± 1.376
2.674PheLys: 2.674 ± 0.784
3.209PheLeu: 3.209 ± 1.104
1.07PheMet: 1.07 ± 0.915
1.07PheAsn: 1.07 ± 0.839
2.674PhePro: 2.674 ± 1.088
1.604PheGln: 1.604 ± 1.242
3.209PheArg: 3.209 ± 1.468
3.209PheSer: 3.209 ± 0.712
1.604PheThr: 1.604 ± 0.667
3.743PheVal: 3.743 ± 1.538
0.535PheTrp: 0.535 ± 0.458
0.535PheTyr: 0.535 ± 0.458
0.0PheXaa: 0.0 ± 0.0
Gly
2.674GlyAla: 2.674 ± 0.789
0.0GlyCys: 0.0 ± 0.0
4.278GlyAsp: 4.278 ± 1.589
0.535GlyGlu: 0.535 ± 0.845
2.674GlyPhe: 2.674 ± 0.745
4.278GlyGly: 4.278 ± 1.757
0.0GlyHis: 0.0 ± 0.0
5.348GlyIle: 5.348 ± 2.278
4.278GlyLys: 4.278 ± 1.839
3.743GlyLeu: 3.743 ± 1.35
2.139GlyMet: 2.139 ± 0.712
3.743GlyAsn: 3.743 ± 1.916
0.0GlyPro: 0.0 ± 0.0
2.674GlyGln: 2.674 ± 1.4
5.348GlyArg: 5.348 ± 1.471
5.882GlySer: 5.882 ± 2.483
4.813GlyThr: 4.813 ± 1.493
4.278GlyVal: 4.278 ± 1.583
1.07GlyTrp: 1.07 ± 0.839
1.604GlyTyr: 1.604 ± 1.021
0.0GlyXaa: 0.0 ± 0.0
His
1.604HisAla: 1.604 ± 0.84
0.0HisCys: 0.0 ± 0.0
1.604HisAsp: 1.604 ± 1.004
1.07HisGlu: 1.07 ± 0.713
0.0HisPhe: 0.0 ± 0.0
1.604HisGly: 1.604 ± 0.759
1.07HisHis: 1.07 ± 0.914
1.07HisIle: 1.07 ± 0.719
0.535HisLys: 0.535 ± 0.706
4.813HisLeu: 4.813 ± 1.137
1.07HisMet: 1.07 ± 0.839
0.535HisAsn: 0.535 ± 0.458
2.139HisPro: 2.139 ± 1.212
1.604HisGln: 1.604 ± 0.511
1.604HisArg: 1.604 ± 0.759
0.535HisSer: 0.535 ± 0.458
1.07HisThr: 1.07 ± 0.915
1.604HisVal: 1.604 ± 0.789
0.535HisTrp: 0.535 ± 0.42
1.07HisTyr: 1.07 ± 0.719
0.0HisXaa: 0.0 ± 0.0
Ile
7.487IleAla: 7.487 ± 2.278
1.604IleCys: 1.604 ± 1.004
4.278IleAsp: 4.278 ± 1.015
2.139IleGlu: 2.139 ± 1.227
1.07IlePhe: 1.07 ± 0.839
4.278IleGly: 4.278 ± 1.924
1.604IleHis: 1.604 ± 1.211
2.139IleIle: 2.139 ± 0.82
2.139IleLys: 2.139 ± 1.307
2.139IleLeu: 2.139 ± 0.721
2.139IleMet: 2.139 ± 1.213
3.743IleAsn: 3.743 ± 1.02
2.674IlePro: 2.674 ± 1.244
3.209IleGln: 3.209 ± 1.94
3.209IleArg: 3.209 ± 1.628
4.813IleSer: 4.813 ± 1.172
3.743IleThr: 3.743 ± 1.432
4.278IleVal: 4.278 ± 1.869
1.604IleTrp: 1.604 ± 1.021
0.535IleTyr: 0.535 ± 0.458
0.0IleXaa: 0.0 ± 0.0
Lys
3.743LysAla: 3.743 ± 0.986
1.604LysCys: 1.604 ± 0.915
3.743LysAsp: 3.743 ± 2.497
2.139LysGlu: 2.139 ± 0.939
1.07LysPhe: 1.07 ± 0.811
1.604LysGly: 1.604 ± 0.891
1.604LysHis: 1.604 ± 0.64
1.604LysIle: 1.604 ± 0.656
3.743LysLys: 3.743 ± 1.797
3.209LysLeu: 3.209 ± 0.79
4.813LysMet: 4.813 ± 2.198
2.674LysAsn: 2.674 ± 1.391
1.604LysPro: 1.604 ± 0.915
2.139LysGln: 2.139 ± 0.979
1.07LysArg: 1.07 ± 0.686
1.604LysSer: 1.604 ± 0.511
3.743LysThr: 3.743 ± 1.944
4.813LysVal: 4.813 ± 0.645
1.07LysTrp: 1.07 ± 0.862
2.674LysTyr: 2.674 ± 1.212
0.0LysXaa: 0.0 ± 0.0
Leu
5.882LeuAla: 5.882 ± 1.727
0.535LeuCys: 0.535 ± 0.42
7.487LeuAsp: 7.487 ± 1.453
3.743LeuGlu: 3.743 ± 1.166
2.674LeuPhe: 2.674 ± 1.686
3.209LeuGly: 3.209 ± 0.628
3.209LeuHis: 3.209 ± 1.12
3.743LeuIle: 3.743 ± 1.633
5.348LeuLys: 5.348 ± 0.813
7.487LeuLeu: 7.487 ± 2.315
3.209LeuMet: 3.209 ± 1.366
3.743LeuAsn: 3.743 ± 1.556
3.743LeuPro: 3.743 ± 1.292
4.278LeuGln: 4.278 ± 0.994
5.348LeuArg: 5.348 ± 2.071
6.417LeuSer: 6.417 ± 2.274
5.882LeuThr: 5.882 ± 2.353
4.278LeuVal: 4.278 ± 0.943
1.07LeuTrp: 1.07 ± 0.839
2.674LeuTyr: 2.674 ± 1.846
0.0LeuXaa: 0.0 ± 0.0
Met
4.278MetAla: 4.278 ± 2.198
0.0MetCys: 0.0 ± 0.0
1.07MetAsp: 1.07 ± 1.051
3.209MetGlu: 3.209 ± 1.887
1.604MetPhe: 1.604 ± 1.389
1.07MetGly: 1.07 ± 0.915
0.535MetHis: 0.535 ± 0.458
2.139MetIle: 2.139 ± 0.914
2.139MetLys: 2.139 ± 1.206
3.743MetLeu: 3.743 ± 1.168
0.535MetMet: 0.535 ± 0.458
0.535MetAsn: 0.535 ± 0.616
1.07MetPro: 1.07 ± 0.606
2.674MetGln: 2.674 ± 0.971
1.604MetArg: 1.604 ± 0.511
1.07MetSer: 1.07 ± 0.639
2.139MetThr: 2.139 ± 0.979
1.07MetVal: 1.07 ± 0.839
0.535MetTrp: 0.535 ± 0.42
0.535MetTyr: 0.535 ± 0.603
0.0MetXaa: 0.0 ± 0.0
Asn
3.743AsnAla: 3.743 ± 1.157
1.07AsnCys: 1.07 ± 1.02
2.139AsnAsp: 2.139 ± 0.793
1.604AsnGlu: 1.604 ± 0.84
1.604AsnPhe: 1.604 ± 0.64
2.139AsnGly: 2.139 ± 1.134
0.535AsnHis: 0.535 ± 0.706
3.209AsnIle: 3.209 ± 1.892
0.535AsnLys: 0.535 ± 0.525
3.743AsnLeu: 3.743 ± 1.473
1.07AsnMet: 1.07 ± 1.051
2.674AsnAsn: 2.674 ± 1.244
2.674AsnPro: 2.674 ± 0.651
2.139AsnGln: 2.139 ± 1.53
3.209AsnArg: 3.209 ± 1.272
4.813AsnSer: 4.813 ± 1.438
3.209AsnThr: 3.209 ± 1.963
2.674AsnVal: 2.674 ± 1.244
0.535AsnTrp: 0.535 ± 0.42
2.674AsnTyr: 2.674 ± 0.984
0.0AsnXaa: 0.0 ± 0.0
Pro
3.209ProAla: 3.209 ± 1.165
0.0ProCys: 0.0 ± 0.0
1.604ProAsp: 1.604 ± 1.013
4.813ProGlu: 4.813 ± 1.501
0.535ProPhe: 0.535 ± 0.458
1.604ProGly: 1.604 ± 0.85
0.535ProHis: 0.535 ± 0.458
2.674ProIle: 2.674 ± 1.711
1.604ProLys: 1.604 ± 1.173
6.417ProLeu: 6.417 ± 2.063
0.535ProMet: 0.535 ± 0.676
1.604ProAsn: 1.604 ± 1.004
2.139ProPro: 2.139 ± 1.344
0.0ProGln: 0.0 ± 0.0
3.209ProArg: 3.209 ± 1.451
1.604ProSer: 1.604 ± 0.85
2.674ProThr: 2.674 ± 1.178
4.278ProVal: 4.278 ± 2.524
1.07ProTrp: 1.07 ± 0.606
1.07ProTyr: 1.07 ± 0.839
0.0ProXaa: 0.0 ± 0.0
Gln
4.813GlnAla: 4.813 ± 1.021
0.0GlnCys: 0.0 ± 0.0
1.07GlnAsp: 1.07 ± 0.606
2.139GlnGlu: 2.139 ± 0.712
0.535GlnPhe: 0.535 ± 0.458
1.07GlnGly: 1.07 ± 1.051
1.604GlnHis: 1.604 ± 0.84
3.209GlnIle: 3.209 ± 1.264
4.278GlnLys: 4.278 ± 1.368
4.813GlnLeu: 4.813 ± 1.048
1.07GlnMet: 1.07 ± 0.606
2.674GlnAsn: 2.674 ± 2.062
1.604GlnPro: 1.604 ± 1.253
2.674GlnGln: 2.674 ± 1.324
1.604GlnArg: 1.604 ± 0.511
4.278GlnSer: 4.278 ± 1.316
3.209GlnThr: 3.209 ± 1.048
2.674GlnVal: 2.674 ± 1.291
2.674GlnTrp: 2.674 ± 0.951
1.604GlnTyr: 1.604 ± 0.511
0.0GlnXaa: 0.0 ± 0.0
Arg
6.417ArgAla: 6.417 ± 1.334
1.604ArgCys: 1.604 ± 0.877
5.348ArgAsp: 5.348 ± 1.904
1.604ArgGlu: 1.604 ± 0.937
3.743ArgPhe: 3.743 ± 1.765
1.604ArgGly: 1.604 ± 0.511
1.07ArgHis: 1.07 ± 0.915
3.209ArgIle: 3.209 ± 1.396
3.209ArgLys: 3.209 ± 1.998
5.348ArgLeu: 5.348 ± 1.948
2.139ArgMet: 2.139 ± 0.876
2.139ArgAsn: 2.139 ± 0.528
2.139ArgPro: 2.139 ± 0.939
4.813ArgGln: 4.813 ± 1.76
2.139ArgArg: 2.139 ± 0.979
3.209ArgSer: 3.209 ± 1.677
3.743ArgThr: 3.743 ± 1.392
2.139ArgVal: 2.139 ± 0.712
0.0ArgTrp: 0.0 ± 0.0
5.348ArgTyr: 5.348 ± 0.952
0.0ArgXaa: 0.0 ± 0.0
Ser
5.348SerAla: 5.348 ± 2.59
0.0SerCys: 0.0 ± 0.0
4.813SerAsp: 4.813 ± 1.637
2.139SerGlu: 2.139 ± 1.187
4.813SerPhe: 4.813 ± 1.177
5.348SerGly: 5.348 ± 2.098
1.604SerHis: 1.604 ± 0.789
4.278SerIle: 4.278 ± 1.277
1.604SerLys: 1.604 ± 0.511
3.743SerLeu: 3.743 ± 0.756
2.139SerMet: 2.139 ± 1.242
4.278SerAsn: 4.278 ± 1.582
2.674SerPro: 2.674 ± 0.969
4.278SerGln: 4.278 ± 1.751
5.348SerArg: 5.348 ± 1.59
8.556SerSer: 8.556 ± 2.261
1.604SerThr: 1.604 ± 0.97
5.348SerVal: 5.348 ± 1.371
1.07SerTrp: 1.07 ± 0.914
3.209SerTyr: 3.209 ± 1.171
0.0SerXaa: 0.0 ± 0.0
Thr
3.743ThrAla: 3.743 ± 1.414
0.0ThrCys: 0.0 ± 0.0
5.348ThrAsp: 5.348 ± 1.207
2.674ThrGlu: 2.674 ± 1.036
2.674ThrPhe: 2.674 ± 0.879
2.674ThrGly: 2.674 ± 1.285
2.139ThrHis: 2.139 ± 0.685
3.743ThrIle: 3.743 ± 0.87
4.278ThrLys: 4.278 ± 1.494
5.348ThrLeu: 5.348 ± 2.372
0.535ThrMet: 0.535 ± 0.845
2.139ThrAsn: 2.139 ± 1.863
2.674ThrPro: 2.674 ± 0.966
3.209ThrGln: 3.209 ± 1.04
3.743ThrArg: 3.743 ± 2.239
7.487ThrSer: 7.487 ± 3.169
2.674ThrThr: 2.674 ± 1.244
3.743ThrVal: 3.743 ± 1.262
0.535ThrTrp: 0.535 ± 0.845
1.07ThrTyr: 1.07 ± 0.915
0.0ThrXaa: 0.0 ± 0.0
Val
5.348ValAla: 5.348 ± 1.797
0.535ValCys: 0.535 ± 0.603
5.882ValAsp: 5.882 ± 1.995
2.674ValGlu: 2.674 ± 2.654
2.139ValPhe: 2.139 ± 1.363
4.813ValGly: 4.813 ± 1.646
1.07ValHis: 1.07 ± 0.715
6.417ValIle: 6.417 ± 2.608
2.674ValLys: 2.674 ± 1.595
3.209ValLeu: 3.209 ± 1.617
2.139ValMet: 2.139 ± 0.923
3.209ValAsn: 3.209 ± 2.165
2.674ValPro: 2.674 ± 1.187
1.604ValGln: 1.604 ± 0.85
6.952ValArg: 6.952 ± 2.712
1.604ValSer: 1.604 ± 0.866
5.348ValThr: 5.348 ± 1.881
2.674ValVal: 2.674 ± 1.322
0.535ValTrp: 0.535 ± 0.42
4.278ValTyr: 4.278 ± 0.727
0.0ValXaa: 0.0 ± 0.0
Trp
1.07TrpAla: 1.07 ± 0.606
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.535TrpGlu: 0.535 ± 0.525
1.07TrpPhe: 1.07 ± 0.686
0.0TrpGly: 0.0 ± 0.0
0.535TrpHis: 0.535 ± 0.42
1.07TrpIle: 1.07 ± 0.708
0.535TrpLys: 0.535 ± 0.706
2.674TrpLeu: 2.674 ± 2.1
0.535TrpMet: 0.535 ± 0.458
1.07TrpAsn: 1.07 ± 0.606
1.604TrpPro: 1.604 ± 1.259
0.535TrpGln: 0.535 ± 0.42
0.0TrpArg: 0.0 ± 0.0
1.07TrpSer: 1.07 ± 0.489
1.604TrpThr: 1.604 ± 0.64
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.604TrpTyr: 1.604 ± 0.759
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.743TyrAla: 3.743 ± 1.368
0.535TyrCys: 0.535 ± 0.651
1.604TyrAsp: 1.604 ± 1.173
2.139TyrGlu: 2.139 ± 1.835
2.139TyrPhe: 2.139 ± 0.92
3.209TyrGly: 3.209 ± 0.928
1.07TyrHis: 1.07 ± 0.914
2.139TyrIle: 2.139 ± 0.985
0.535TyrLys: 0.535 ± 0.458
2.674TyrLeu: 2.674 ± 0.951
0.535TyrMet: 0.535 ± 0.42
2.674TyrAsn: 2.674 ± 1.187
2.674TyrPro: 2.674 ± 1.355
3.743TyrGln: 3.743 ± 1.825
3.209TyrArg: 3.209 ± 1.239
0.535TyrSer: 0.535 ± 0.42
1.07TyrThr: 1.07 ± 0.708
3.209TyrVal: 3.209 ± 1.182
0.0TyrTrp: 0.0 ± 0.0
1.07TyrTyr: 1.07 ± 0.927
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1871 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski