Amino acid dipepetide frequency for Escherichia phage vB_EcoM_G5211

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.742AlaAla: 4.742 ± 0.326
0.839AlaCys: 0.839 ± 0.131
3.942AlaAsp: 3.942 ± 0.297
4.41AlaGlu: 4.41 ± 0.33
2.751AlaPhe: 2.751 ± 0.233
4.566AlaGly: 4.566 ± 0.401
1.268AlaHis: 1.268 ± 0.18
4.859AlaIle: 4.859 ± 0.335
5.171AlaLys: 5.171 ± 0.407
5.483AlaLeu: 5.483 ± 0.333
2.381AlaMet: 2.381 ± 0.183
3.415AlaAsn: 3.415 ± 0.276
1.756AlaPro: 1.756 ± 0.152
2.088AlaGln: 2.088 ± 0.254
3.22AlaArg: 3.22 ± 0.292
4.039AlaSer: 4.039 ± 0.319
4.098AlaThr: 4.098 ± 0.455
4.371AlaVal: 4.371 ± 0.26
0.878AlaTrp: 0.878 ± 0.119
2.556AlaTyr: 2.556 ± 0.206
0.0AlaXaa: 0.0 ± 0.0
Cys
0.839CysAla: 0.839 ± 0.153
0.273CysCys: 0.273 ± 0.075
0.917CysAsp: 0.917 ± 0.139
0.917CysGlu: 0.917 ± 0.171
0.917CysPhe: 0.917 ± 0.128
1.054CysGly: 1.054 ± 0.15
0.351CysHis: 0.351 ± 0.073
0.624CysIle: 0.624 ± 0.119
0.859CysLys: 0.859 ± 0.119
1.054CysLeu: 1.054 ± 0.132
0.332CysMet: 0.332 ± 0.073
0.741CysAsn: 0.741 ± 0.126
0.468CysPro: 0.468 ± 0.102
0.293CysGln: 0.293 ± 0.076
0.39CysArg: 0.39 ± 0.091
0.839CysSer: 0.839 ± 0.124
0.683CysThr: 0.683 ± 0.124
1.034CysVal: 1.034 ± 0.134
0.195CysTrp: 0.195 ± 0.059
0.332CysTyr: 0.332 ± 0.078
0.0CysXaa: 0.0 ± 0.0
Asp
4.41AspAla: 4.41 ± 0.331
0.8AspCys: 0.8 ± 0.135
4.195AspAsp: 4.195 ± 0.286
4.82AspGlu: 4.82 ± 0.276
3.22AspPhe: 3.22 ± 0.245
5.015AspGly: 5.015 ± 0.311
1.327AspHis: 1.327 ± 0.16
4.195AspIle: 4.195 ± 0.288
4.527AspLys: 4.527 ± 0.316
5.444AspLeu: 5.444 ± 0.301
1.834AspMet: 1.834 ± 0.177
3.122AspAsn: 3.122 ± 0.247
2.849AspPro: 2.849 ± 0.264
1.795AspGln: 1.795 ± 0.173
3.688AspArg: 3.688 ± 0.289
3.473AspSer: 3.473 ± 0.232
3.083AspThr: 3.083 ± 0.245
4.254AspVal: 4.254 ± 0.278
1.112AspTrp: 1.112 ± 0.148
3.161AspTyr: 3.161 ± 0.223
0.0AspXaa: 0.0 ± 0.0
Glu
5.093GluAla: 5.093 ± 0.322
1.093GluCys: 1.093 ± 0.166
4.215GluAsp: 4.215 ± 0.289
5.444GluGlu: 5.444 ± 0.37
2.849GluPhe: 2.849 ± 0.228
4.429GluGly: 4.429 ± 0.275
1.385GluHis: 1.385 ± 0.173
5.99GluIle: 5.99 ± 0.313
5.912GluLys: 5.912 ± 0.372
6.673GluLeu: 6.673 ± 0.359
1.912GluMet: 1.912 ± 0.198
4.02GluAsn: 4.02 ± 0.241
1.444GluPro: 1.444 ± 0.181
2.42GluGln: 2.42 ± 0.239
3.727GluArg: 3.727 ± 0.31
4.117GluSer: 4.117 ± 0.315
4.156GluThr: 4.156 ± 0.308
4.586GluVal: 4.586 ± 0.285
1.151GluTrp: 1.151 ± 0.163
3.688GluTyr: 3.688 ± 0.317
0.0GluXaa: 0.0 ± 0.0
Phe
2.712PheAla: 2.712 ± 0.263
0.683PheCys: 0.683 ± 0.119
3.707PheAsp: 3.707 ± 0.253
3.786PheGlu: 3.786 ± 0.253
1.424PhePhe: 1.424 ± 0.178
2.888PheGly: 2.888 ± 0.27
0.8PheHis: 0.8 ± 0.128
3.025PheIle: 3.025 ± 0.253
3.512PheLys: 3.512 ± 0.281
2.712PheLeu: 2.712 ± 0.237
1.522PheMet: 1.522 ± 0.178
2.654PheAsn: 2.654 ± 0.17
1.054PhePro: 1.054 ± 0.135
1.171PheGln: 1.171 ± 0.132
2.107PheArg: 2.107 ± 0.171
3.083PheSer: 3.083 ± 0.237
2.927PheThr: 2.927 ± 0.256
2.829PheVal: 2.829 ± 0.28
0.449PheTrp: 0.449 ± 0.089
1.698PheTyr: 1.698 ± 0.175
0.0PheXaa: 0.0 ± 0.0
Gly
4.098GlyAla: 4.098 ± 0.325
0.8GlyCys: 0.8 ± 0.132
4.547GlyAsp: 4.547 ± 0.391
4.332GlyGlu: 4.332 ± 0.277
2.751GlyPhe: 2.751 ± 0.228
4.0GlyGly: 4.0 ± 0.563
1.151GlyHis: 1.151 ± 0.171
4.41GlyIle: 4.41 ± 0.284
4.761GlyLys: 4.761 ± 0.323
4.878GlyLeu: 4.878 ± 0.295
1.893GlyMet: 1.893 ± 0.188
3.22GlyAsn: 3.22 ± 0.357
0.781GlyPro: 0.781 ± 0.112
2.068GlyGln: 2.068 ± 0.226
3.044GlyArg: 3.044 ± 0.205
3.493GlySer: 3.493 ± 0.339
3.356GlyThr: 3.356 ± 0.33
4.995GlyVal: 4.995 ± 0.34
0.859GlyTrp: 0.859 ± 0.123
2.927GlyTyr: 2.927 ± 0.241
0.0GlyXaa: 0.0 ± 0.0
His
1.19HisAla: 1.19 ± 0.153
0.234HisCys: 0.234 ± 0.066
1.112HisAsp: 1.112 ± 0.152
1.346HisGlu: 1.346 ± 0.182
0.937HisPhe: 0.937 ± 0.115
1.737HisGly: 1.737 ± 0.174
0.585HisHis: 0.585 ± 0.1
1.288HisIle: 1.288 ± 0.167
1.21HisLys: 1.21 ± 0.172
1.444HisLeu: 1.444 ± 0.164
0.488HisMet: 0.488 ± 0.101
1.268HisAsn: 1.268 ± 0.16
0.878HisPro: 0.878 ± 0.129
0.527HisGln: 0.527 ± 0.101
0.917HisArg: 0.917 ± 0.108
1.288HisSer: 1.288 ± 0.156
1.19HisThr: 1.19 ± 0.142
1.366HisVal: 1.366 ± 0.155
0.254HisTrp: 0.254 ± 0.065
1.034HisTyr: 1.034 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
5.405IleAla: 5.405 ± 0.368
0.8IleCys: 0.8 ± 0.127
5.522IleAsp: 5.522 ± 0.315
5.405IleGlu: 5.405 ± 0.357
2.205IlePhe: 2.205 ± 0.221
3.825IleGly: 3.825 ± 0.278
1.307IleHis: 1.307 ± 0.157
4.468IleIle: 4.468 ± 0.295
5.522IleLys: 5.522 ± 0.325
4.215IleLeu: 4.215 ± 0.269
2.361IleMet: 2.361 ± 0.224
4.078IleAsn: 4.078 ± 0.263
2.478IlePro: 2.478 ± 0.228
2.4IleGln: 2.4 ± 0.22
3.376IleArg: 3.376 ± 0.209
3.649IleSer: 3.649 ± 0.285
4.703IleThr: 4.703 ± 0.291
4.937IleVal: 4.937 ± 0.288
0.585IleTrp: 0.585 ± 0.116
2.361IleTyr: 2.361 ± 0.185
0.0IleXaa: 0.0 ± 0.0
Lys
5.347LysAla: 5.347 ± 0.468
0.917LysCys: 0.917 ± 0.159
4.8LysAsp: 4.8 ± 0.374
6.478LysGlu: 6.478 ± 0.404
3.454LysPhe: 3.454 ± 0.294
3.981LysGly: 3.981 ± 0.254
1.737LysHis: 1.737 ± 0.191
5.464LysIle: 5.464 ± 0.299
5.151LysLys: 5.151 ± 0.355
5.834LysLeu: 5.834 ± 0.345
2.868LysMet: 2.868 ± 0.245
3.844LysAsn: 3.844 ± 0.287
2.79LysPro: 2.79 ± 0.252
2.927LysGln: 2.927 ± 0.249
3.766LysArg: 3.766 ± 0.307
3.571LysSer: 3.571 ± 0.262
4.586LysThr: 4.586 ± 0.323
5.151LysVal: 5.151 ± 0.263
1.054LysTrp: 1.054 ± 0.148
3.025LysTyr: 3.025 ± 0.29
0.0LysXaa: 0.0 ± 0.0
Leu
5.464LeuAla: 5.464 ± 0.3
1.073LeuCys: 1.073 ± 0.163
5.093LeuAsp: 5.093 ± 0.362
5.678LeuGlu: 5.678 ± 0.336
3.064LeuPhe: 3.064 ± 0.228
3.883LeuGly: 3.883 ± 0.249
1.19LeuHis: 1.19 ± 0.145
4.683LeuIle: 4.683 ± 0.339
6.205LeuLys: 6.205 ± 0.366
4.41LeuLeu: 4.41 ± 0.347
2.556LeuMet: 2.556 ± 0.241
3.805LeuAsn: 3.805 ± 0.281
2.868LeuPro: 2.868 ± 0.236
2.264LeuGln: 2.264 ± 0.203
3.864LeuArg: 3.864 ± 0.259
4.273LeuSer: 4.273 ± 0.311
4.273LeuThr: 4.273 ± 0.303
4.39LeuVal: 4.39 ± 0.324
0.722LeuTrp: 0.722 ± 0.11
3.298LeuTyr: 3.298 ± 0.256
0.0LeuXaa: 0.0 ± 0.0
Met
1.854MetAla: 1.854 ± 0.206
0.449MetCys: 0.449 ± 0.097
1.522MetAsp: 1.522 ± 0.182
1.698MetGlu: 1.698 ± 0.178
1.639MetPhe: 1.639 ± 0.164
1.581MetGly: 1.581 ± 0.177
0.624MetHis: 0.624 ± 0.097
2.537MetIle: 2.537 ± 0.26
2.907MetLys: 2.907 ± 0.234
2.361MetLeu: 2.361 ± 0.207
1.073MetMet: 1.073 ± 0.151
1.424MetAsn: 1.424 ± 0.152
0.839MetPro: 0.839 ± 0.12
1.19MetGln: 1.19 ± 0.13
1.288MetArg: 1.288 ± 0.158
1.932MetSer: 1.932 ± 0.2
1.756MetThr: 1.756 ± 0.19
1.951MetVal: 1.951 ± 0.165
0.488MetTrp: 0.488 ± 0.108
0.878MetTyr: 0.878 ± 0.113
0.0MetXaa: 0.0 ± 0.0
Asn
3.707AsnAla: 3.707 ± 0.254
0.663AsnCys: 0.663 ± 0.118
3.181AsnAsp: 3.181 ± 0.26
3.395AsnGlu: 3.395 ± 0.259
2.4AsnPhe: 2.4 ± 0.183
4.0AsnGly: 4.0 ± 0.309
1.229AsnHis: 1.229 ± 0.146
3.825AsnIle: 3.825 ± 0.277
3.707AsnLys: 3.707 ± 0.286
3.629AsnLeu: 3.629 ± 0.285
1.424AsnMet: 1.424 ± 0.178
2.732AsnAsn: 2.732 ± 0.274
2.478AsnPro: 2.478 ± 0.171
1.463AsnGln: 1.463 ± 0.188
2.478AsnArg: 2.478 ± 0.22
2.634AsnSer: 2.634 ± 0.223
2.81AsnThr: 2.81 ± 0.239
3.961AsnVal: 3.961 ± 0.315
0.566AsnTrp: 0.566 ± 0.102
2.107AsnTyr: 2.107 ± 0.193
0.0AsnXaa: 0.0 ± 0.0
Pro
2.107ProAla: 2.107 ± 0.208
0.41ProCys: 0.41 ± 0.086
2.576ProAsp: 2.576 ± 0.219
3.103ProGlu: 3.103 ± 0.222
1.678ProPhe: 1.678 ± 0.196
1.015ProGly: 1.015 ± 0.143
0.82ProHis: 0.82 ± 0.124
2.088ProIle: 2.088 ± 0.218
2.615ProLys: 2.615 ± 0.258
1.971ProLeu: 1.971 ± 0.194
0.683ProMet: 0.683 ± 0.112
1.561ProAsn: 1.561 ± 0.137
0.82ProPro: 0.82 ± 0.135
0.995ProGln: 0.995 ± 0.128
1.424ProArg: 1.424 ± 0.172
2.107ProSer: 2.107 ± 0.17
2.224ProThr: 2.224 ± 0.232
2.478ProVal: 2.478 ± 0.211
0.507ProTrp: 0.507 ± 0.106
1.424ProTyr: 1.424 ± 0.154
0.0ProXaa: 0.0 ± 0.0
Gln
1.834GlnAla: 1.834 ± 0.177
0.351GlnCys: 0.351 ± 0.08
1.483GlnAsp: 1.483 ± 0.161
2.322GlnGlu: 2.322 ± 0.195
1.581GlnPhe: 1.581 ± 0.196
1.522GlnGly: 1.522 ± 0.191
0.624GlnHis: 0.624 ± 0.103
2.439GlnIle: 2.439 ± 0.228
2.342GlnLys: 2.342 ± 0.237
2.595GlnLeu: 2.595 ± 0.247
0.956GlnMet: 0.956 ± 0.128
1.678GlnAsn: 1.678 ± 0.221
1.093GlnPro: 1.093 ± 0.154
0.859GlnGln: 0.859 ± 0.138
1.932GlnArg: 1.932 ± 0.188
1.444GlnSer: 1.444 ± 0.171
1.717GlnThr: 1.717 ± 0.181
2.01GlnVal: 2.01 ± 0.193
0.722GlnTrp: 0.722 ± 0.112
1.522GlnTyr: 1.522 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
2.478ArgAla: 2.478 ± 0.197
0.741ArgCys: 0.741 ± 0.142
3.454ArgAsp: 3.454 ± 0.209
3.981ArgGlu: 3.981 ± 0.284
2.439ArgPhe: 2.439 ± 0.21
3.161ArgGly: 3.161 ± 0.239
0.976ArgHis: 0.976 ± 0.124
3.2ArgIle: 3.2 ± 0.23
4.195ArgLys: 4.195 ± 0.318
3.649ArgLeu: 3.649 ± 0.285
1.171ArgMet: 1.171 ± 0.168
2.4ArgAsn: 2.4 ± 0.219
1.678ArgPro: 1.678 ± 0.163
1.463ArgGln: 1.463 ± 0.145
2.459ArgArg: 2.459 ± 0.205
2.42ArgSer: 2.42 ± 0.204
2.088ArgThr: 2.088 ± 0.217
3.571ArgVal: 3.571 ± 0.254
0.82ArgTrp: 0.82 ± 0.117
2.342ArgTyr: 2.342 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
3.395SerAla: 3.395 ± 0.259
0.741SerCys: 0.741 ± 0.138
3.59SerAsp: 3.59 ± 0.203
3.883SerGlu: 3.883 ± 0.296
2.985SerPhe: 2.985 ± 0.245
4.02SerGly: 4.02 ± 0.318
1.073SerHis: 1.073 ± 0.114
4.0SerIle: 4.0 ± 0.322
4.234SerLys: 4.234 ± 0.287
3.883SerLeu: 3.883 ± 0.26
1.698SerMet: 1.698 ± 0.146
2.771SerAsn: 2.771 ± 0.262
1.854SerPro: 1.854 ± 0.205
1.581SerGln: 1.581 ± 0.16
3.083SerArg: 3.083 ± 0.208
3.064SerSer: 3.064 ± 0.295
2.966SerThr: 2.966 ± 0.295
4.0SerVal: 4.0 ± 0.296
0.859SerTrp: 0.859 ± 0.112
1.756SerTyr: 1.756 ± 0.168
0.0SerXaa: 0.0 ± 0.0
Thr
4.117ThrAla: 4.117 ± 0.322
0.702ThrCys: 0.702 ± 0.139
3.454ThrAsp: 3.454 ± 0.244
3.922ThrGlu: 3.922 ± 0.292
2.498ThrPhe: 2.498 ± 0.205
3.942ThrGly: 3.942 ± 0.355
1.307ThrHis: 1.307 ± 0.187
3.766ThrIle: 3.766 ± 0.31
4.468ThrLys: 4.468 ± 0.269
4.39ThrLeu: 4.39 ± 0.31
1.405ThrMet: 1.405 ± 0.163
2.634ThrAsn: 2.634 ± 0.252
3.044ThrPro: 3.044 ± 0.302
2.01ThrGln: 2.01 ± 0.278
2.439ThrArg: 2.439 ± 0.222
2.654ThrSer: 2.654 ± 0.216
3.064ThrThr: 3.064 ± 0.299
4.273ThrVal: 4.273 ± 0.378
0.741ThrTrp: 0.741 ± 0.123
2.478ThrTyr: 2.478 ± 0.197
0.0ThrXaa: 0.0 ± 0.0
Val
4.215ValAla: 4.215 ± 0.311
0.605ValCys: 0.605 ± 0.108
5.132ValAsp: 5.132 ± 0.297
5.756ValGlu: 5.756 ± 0.404
3.298ValPhe: 3.298 ± 0.264
4.059ValGly: 4.059 ± 0.306
1.21ValHis: 1.21 ± 0.164
4.644ValIle: 4.644 ± 0.315
5.62ValLys: 5.62 ± 0.326
4.527ValLeu: 4.527 ± 0.283
1.815ValMet: 1.815 ± 0.167
3.649ValAsn: 3.649 ± 0.268
1.912ValPro: 1.912 ± 0.173
1.932ValGln: 1.932 ± 0.204
2.927ValArg: 2.927 ± 0.21
4.098ValSer: 4.098 ± 0.283
3.942ValThr: 3.942 ± 0.276
4.839ValVal: 4.839 ± 0.385
1.015ValTrp: 1.015 ± 0.153
3.473ValTyr: 3.473 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.644TrpAla: 0.644 ± 0.111
0.293TrpCys: 0.293 ± 0.087
0.859TrpAsp: 0.859 ± 0.127
1.151TrpGlu: 1.151 ± 0.155
0.585TrpPhe: 0.585 ± 0.108
0.878TrpGly: 0.878 ± 0.152
0.332TrpHis: 0.332 ± 0.077
0.859TrpIle: 0.859 ± 0.126
0.937TrpLys: 0.937 ± 0.132
0.976TrpLeu: 0.976 ± 0.14
0.566TrpMet: 0.566 ± 0.095
0.702TrpAsn: 0.702 ± 0.104
0.195TrpPro: 0.195 ± 0.057
0.351TrpGln: 0.351 ± 0.084
0.722TrpArg: 0.722 ± 0.12
0.82TrpSer: 0.82 ± 0.114
0.898TrpThr: 0.898 ± 0.143
1.073TrpVal: 1.073 ± 0.158
0.195TrpTrp: 0.195 ± 0.061
0.702TrpTyr: 0.702 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.181TyrAla: 3.181 ± 0.208
0.644TyrCys: 0.644 ± 0.109
3.025TyrAsp: 3.025 ± 0.228
2.478TyrGlu: 2.478 ± 0.222
2.068TyrPhe: 2.068 ± 0.206
2.81TyrGly: 2.81 ± 0.224
0.995TyrHis: 0.995 ± 0.149
3.239TyrIle: 3.239 ± 0.248
2.907TyrLys: 2.907 ± 0.258
2.927TyrLeu: 2.927 ± 0.243
1.034TyrMet: 1.034 ± 0.12
2.673TyrAsn: 2.673 ± 0.236
1.327TyrPro: 1.327 ± 0.158
1.229TyrGln: 1.229 ± 0.157
1.834TyrArg: 1.834 ± 0.177
2.439TyrSer: 2.439 ± 0.216
2.79TyrThr: 2.79 ± 0.219
2.595TyrVal: 2.595 ± 0.179
0.566TyrTrp: 0.566 ± 0.121
2.01TyrTyr: 2.01 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 270 proteins (51249 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski