Amino acid dipepetide frequency for Erwinia phage vB_EamM_Huxley

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.465AlaAla: 7.465 ± 0.477
0.749AlaCys: 0.749 ± 0.101
4.863AlaAsp: 4.863 ± 0.313
5.139AlaGlu: 5.139 ± 0.287
3.444AlaPhe: 3.444 ± 0.213
5.481AlaGly: 5.481 ± 0.369
1.406AlaHis: 1.406 ± 0.139
5.349AlaIle: 5.349 ± 0.259
4.324AlaLys: 4.324 ± 0.356
7.36AlaLeu: 7.36 ± 0.375
2.51AlaMet: 2.51 ± 0.183
3.667AlaAsn: 3.667 ± 0.258
2.931AlaPro: 2.931 ± 0.229
2.918AlaGln: 2.918 ± 0.216
3.72AlaArg: 3.72 ± 0.235
4.285AlaSer: 4.285 ± 0.308
4.797AlaThr: 4.797 ± 0.315
5.454AlaVal: 5.454 ± 0.29
0.841AlaTrp: 0.841 ± 0.115
3.168AlaTyr: 3.168 ± 0.225
0.0AlaXaa: 0.0 ± 0.0
Cys
0.644CysAla: 0.644 ± 0.105
0.131CysCys: 0.131 ± 0.048
0.499CysAsp: 0.499 ± 0.086
0.591CysGlu: 0.591 ± 0.085
0.276CysPhe: 0.276 ± 0.062
0.683CysGly: 0.683 ± 0.099
0.276CysHis: 0.276 ± 0.064
0.407CysIle: 0.407 ± 0.089
0.486CysLys: 0.486 ± 0.077
0.762CysLeu: 0.762 ± 0.104
0.25CysMet: 0.25 ± 0.066
0.421CysAsn: 0.421 ± 0.081
0.46CysPro: 0.46 ± 0.074
0.394CysGln: 0.394 ± 0.071
0.618CysArg: 0.618 ± 0.092
0.486CysSer: 0.486 ± 0.081
0.644CysThr: 0.644 ± 0.111
0.71CysVal: 0.71 ± 0.107
0.105CysTrp: 0.105 ± 0.032
0.342CysTyr: 0.342 ± 0.078
0.0CysXaa: 0.0 ± 0.0
Asp
5.231AspAla: 5.231 ± 0.282
0.539AspCys: 0.539 ± 0.092
3.943AspAsp: 3.943 ± 0.264
4.258AspGlu: 4.258 ± 0.236
2.76AspPhe: 2.76 ± 0.173
4.718AspGly: 4.718 ± 0.257
1.183AspHis: 1.183 ± 0.14
3.917AspIle: 3.917 ± 0.216
3.483AspLys: 3.483 ± 0.223
5.428AspLeu: 5.428 ± 0.278
1.84AspMet: 1.84 ± 0.149
2.905AspAsn: 2.905 ± 0.231
3.023AspPro: 3.023 ± 0.209
1.603AspGln: 1.603 ± 0.143
3.352AspArg: 3.352 ± 0.215
2.997AspSer: 2.997 ± 0.193
3.969AspThr: 3.969 ± 0.261
5.047AspVal: 5.047 ± 0.242
1.143AspTrp: 1.143 ± 0.124
2.431AspTyr: 2.431 ± 0.185
0.0AspXaa: 0.0 ± 0.0
Glu
4.508GluAla: 4.508 ± 0.275
0.539GluCys: 0.539 ± 0.093
3.667GluAsp: 3.667 ± 0.299
4.088GluGlu: 4.088 ± 0.294
2.681GluPhe: 2.681 ± 0.199
3.444GluGly: 3.444 ± 0.193
1.59GluHis: 1.59 ± 0.188
3.536GluIle: 3.536 ± 0.234
3.273GluLys: 3.273 ± 0.209
6.992GluLeu: 6.992 ± 0.302
2.077GluMet: 2.077 ± 0.169
2.839GluAsn: 2.839 ± 0.209
2.155GluPro: 2.155 ± 0.19
2.287GluGln: 2.287 ± 0.217
3.509GluArg: 3.509 ± 0.219
3.43GluSer: 3.43 ± 0.194
3.693GluThr: 3.693 ± 0.214
4.101GluVal: 4.101 ± 0.242
0.959GluTrp: 0.959 ± 0.105
2.418GluTyr: 2.418 ± 0.193
0.0GluXaa: 0.0 ± 0.0
Phe
2.734PheAla: 2.734 ± 0.197
0.302PheCys: 0.302 ± 0.076
3.089PheAsp: 3.089 ± 0.197
2.431PheGlu: 2.431 ± 0.208
1.656PhePhe: 1.656 ± 0.147
2.892PheGly: 2.892 ± 0.207
0.841PheHis: 0.841 ± 0.112
2.142PheIle: 2.142 ± 0.175
1.958PheLys: 1.958 ± 0.172
2.642PheLeu: 2.642 ± 0.192
1.235PheMet: 1.235 ± 0.117
2.655PheAsn: 2.655 ± 0.164
1.787PhePro: 1.787 ± 0.163
1.196PheGln: 1.196 ± 0.136
2.063PheArg: 2.063 ± 0.171
2.602PheSer: 2.602 ± 0.178
3.43PheThr: 3.43 ± 0.244
2.405PheVal: 2.405 ± 0.177
0.486PheTrp: 0.486 ± 0.079
1.787PheTyr: 1.787 ± 0.158
0.0PheXaa: 0.0 ± 0.0
Gly
4.127GlyAla: 4.127 ± 0.344
0.697GlyCys: 0.697 ± 0.1
3.759GlyAsp: 3.759 ± 0.239
4.232GlyGlu: 4.232 ± 0.207
2.694GlyPhe: 2.694 ± 0.184
4.285GlyGly: 4.285 ± 0.506
1.078GlyHis: 1.078 ± 0.12
3.851GlyIle: 3.851 ± 0.222
4.469GlyLys: 4.469 ± 0.282
5.52GlyLeu: 5.52 ± 0.262
1.971GlyMet: 1.971 ± 0.149
3.404GlyAsn: 3.404 ± 0.227
1.958GlyPro: 1.958 ± 0.213
2.458GlyGln: 2.458 ± 0.219
3.536GlyArg: 3.536 ± 0.225
4.193GlySer: 4.193 ± 0.251
4.206GlyThr: 4.206 ± 0.283
5.244GlyVal: 5.244 ± 0.328
1.235GlyTrp: 1.235 ± 0.125
2.721GlyTyr: 2.721 ± 0.19
0.0GlyXaa: 0.0 ± 0.0
His
1.59HisAla: 1.59 ± 0.152
0.276HisCys: 0.276 ± 0.051
1.327HisAsp: 1.327 ± 0.111
1.38HisGlu: 1.38 ± 0.141
0.894HisPhe: 0.894 ± 0.112
0.959HisGly: 0.959 ± 0.111
0.513HisHis: 0.513 ± 0.099
1.354HisIle: 1.354 ± 0.151
1.078HisLys: 1.078 ± 0.133
1.879HisLeu: 1.879 ± 0.162
0.499HisMet: 0.499 ± 0.083
0.854HisAsn: 0.854 ± 0.102
1.314HisPro: 1.314 ± 0.149
0.697HisGln: 0.697 ± 0.098
1.183HisArg: 1.183 ± 0.138
0.907HisSer: 0.907 ± 0.102
1.183HisThr: 1.183 ± 0.123
1.262HisVal: 1.262 ± 0.128
0.263HisTrp: 0.263 ± 0.055
0.959HisTyr: 0.959 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
4.377IleAla: 4.377 ± 0.243
0.67IleCys: 0.67 ± 0.097
4.022IleAsp: 4.022 ± 0.26
3.838IleGlu: 3.838 ± 0.254
1.656IlePhe: 1.656 ± 0.157
3.444IleGly: 3.444 ± 0.231
1.367IleHis: 1.367 ± 0.127
2.642IleIle: 2.642 ± 0.219
2.839IleLys: 2.839 ± 0.211
3.969IleLeu: 3.969 ± 0.227
1.275IleMet: 1.275 ± 0.142
3.154IleAsn: 3.154 ± 0.194
2.839IlePro: 2.839 ± 0.202
1.761IleGln: 1.761 ± 0.158
3.522IleArg: 3.522 ± 0.202
3.246IleSer: 3.246 ± 0.223
3.706IleThr: 3.706 ± 0.299
3.877IleVal: 3.877 ± 0.247
0.499IleTrp: 0.499 ± 0.091
2.116IleTyr: 2.116 ± 0.157
0.0IleXaa: 0.0 ± 0.0
Lys
4.548LysAla: 4.548 ± 0.261
0.302LysCys: 0.302 ± 0.07
3.141LysAsp: 3.141 ± 0.24
3.404LysGlu: 3.404 ± 0.24
2.287LysPhe: 2.287 ± 0.229
3.168LysGly: 3.168 ± 0.286
1.091LysHis: 1.091 ± 0.093
2.813LysIle: 2.813 ± 0.186
3.01LysLys: 3.01 ± 0.232
5.481LysLeu: 5.481 ± 0.275
1.669LysMet: 1.669 ± 0.158
2.563LysAsn: 2.563 ± 0.204
2.418LysPro: 2.418 ± 0.201
2.287LysGln: 2.287 ± 0.194
3.076LysArg: 3.076 ± 0.226
2.602LysSer: 2.602 ± 0.221
3.457LysThr: 3.457 ± 0.204
3.943LysVal: 3.943 ± 0.26
0.697LysTrp: 0.697 ± 0.099
1.827LysTyr: 1.827 ± 0.154
0.0LysXaa: 0.0 ± 0.0
Leu
7.61LeuAla: 7.61 ± 0.354
0.775LeuCys: 0.775 ± 0.095
6.125LeuAsp: 6.125 ± 0.296
5.349LeuGlu: 5.349 ± 0.276
3.483LeuPhe: 3.483 ± 0.199
5.139LeuGly: 5.139 ± 0.297
1.735LeuHis: 1.735 ± 0.165
4.18LeuIle: 4.18 ± 0.24
4.771LeuLys: 4.771 ± 0.235
7.86LeuLeu: 7.86 ± 0.376
2.182LeuMet: 2.182 ± 0.167
5.086LeuAsn: 5.086 ± 0.225
4.902LeuPro: 4.902 ± 0.234
3.181LeuGln: 3.181 ± 0.207
5.297LeuArg: 5.297 ± 0.228
6.02LeuSer: 6.02 ± 0.287
6.322LeuThr: 6.322 ± 0.309
5.415LeuVal: 5.415 ± 0.258
0.999LeuTrp: 0.999 ± 0.12
3.338LeuTyr: 3.338 ± 0.226
0.0LeuXaa: 0.0 ± 0.0
Met
2.471MetAla: 2.471 ± 0.174
0.237MetCys: 0.237 ± 0.059
1.735MetAsp: 1.735 ± 0.167
1.59MetGlu: 1.59 ± 0.15
1.301MetPhe: 1.301 ± 0.14
1.827MetGly: 1.827 ± 0.193
0.526MetHis: 0.526 ± 0.084
1.17MetIle: 1.17 ± 0.132
1.511MetLys: 1.511 ± 0.173
2.681MetLeu: 2.681 ± 0.185
0.894MetMet: 0.894 ± 0.117
1.367MetAsn: 1.367 ± 0.158
1.104MetPro: 1.104 ± 0.109
0.854MetGln: 0.854 ± 0.118
1.893MetArg: 1.893 ± 0.168
2.077MetSer: 2.077 ± 0.175
1.735MetThr: 1.735 ± 0.156
2.063MetVal: 2.063 ± 0.158
0.302MetTrp: 0.302 ± 0.067
0.907MetTyr: 0.907 ± 0.111
0.0MetXaa: 0.0 ± 0.0
Asn
4.548AsnAla: 4.548 ± 0.259
0.421AsnCys: 0.421 ± 0.075
2.905AsnAsp: 2.905 ± 0.159
2.865AsnGlu: 2.865 ± 0.178
1.906AsnPhe: 1.906 ± 0.136
4.442AsnGly: 4.442 ± 0.317
1.012AsnHis: 1.012 ± 0.115
2.589AsnIle: 2.589 ± 0.208
2.247AsnLys: 2.247 ± 0.19
3.825AsnLeu: 3.825 ± 0.225
1.143AsnMet: 1.143 ± 0.118
2.326AsnAsn: 2.326 ± 0.164
2.747AsnPro: 2.747 ± 0.234
1.643AsnGln: 1.643 ± 0.167
2.97AsnArg: 2.97 ± 0.163
2.839AsnSer: 2.839 ± 0.19
3.246AsnThr: 3.246 ± 0.198
3.706AsnVal: 3.706 ± 0.233
0.789AsnTrp: 0.789 ± 0.099
1.906AsnTyr: 1.906 ± 0.177
0.0AsnXaa: 0.0 ± 0.0
Pro
3.509ProAla: 3.509 ± 0.27
0.355ProCys: 0.355 ± 0.072
3.444ProAsp: 3.444 ± 0.271
3.181ProGlu: 3.181 ± 0.263
1.617ProPhe: 1.617 ± 0.164
2.668ProGly: 2.668 ± 0.218
0.867ProHis: 0.867 ± 0.11
2.523ProIle: 2.523 ± 0.199
2.366ProLys: 2.366 ± 0.19
3.746ProLeu: 3.746 ± 0.258
1.354ProMet: 1.354 ± 0.154
2.208ProAsn: 2.208 ± 0.184
1.682ProPro: 1.682 ± 0.173
1.709ProGln: 1.709 ± 0.133
1.748ProArg: 1.748 ± 0.18
2.353ProSer: 2.353 ± 0.171
3.365ProThr: 3.365 ± 0.194
3.89ProVal: 3.89 ± 0.217
0.407ProTrp: 0.407 ± 0.071
1.63ProTyr: 1.63 ± 0.151
0.0ProXaa: 0.0 ± 0.0
Gln
3.076GlnAla: 3.076 ± 0.212
0.315GlnCys: 0.315 ± 0.075
1.682GlnAsp: 1.682 ± 0.153
1.945GlnGlu: 1.945 ± 0.193
1.59GlnPhe: 1.59 ± 0.129
2.116GlnGly: 2.116 ± 0.16
0.697GlnHis: 0.697 ± 0.08
1.735GlnIle: 1.735 ± 0.169
1.682GlnLys: 1.682 ± 0.143
3.917GlnLeu: 3.917 ± 0.244
0.894GlnMet: 0.894 ± 0.101
1.617GlnAsn: 1.617 ± 0.141
1.525GlnPro: 1.525 ± 0.148
1.577GlnGln: 1.577 ± 0.154
2.234GlnArg: 2.234 ± 0.192
2.405GlnSer: 2.405 ± 0.185
2.129GlnThr: 2.129 ± 0.136
2.116GlnVal: 2.116 ± 0.177
0.526GlnTrp: 0.526 ± 0.088
1.551GlnTyr: 1.551 ± 0.132
0.0GlnXaa: 0.0 ± 0.0
Arg
3.628ArgAla: 3.628 ± 0.237
0.683ArgCys: 0.683 ± 0.097
3.26ArgAsp: 3.26 ± 0.219
3.404ArgGlu: 3.404 ± 0.244
2.471ArgPhe: 2.471 ± 0.176
3.614ArgGly: 3.614 ± 0.23
1.196ArgHis: 1.196 ± 0.153
3.391ArgIle: 3.391 ± 0.236
3.246ArgLys: 3.246 ± 0.218
4.981ArgLeu: 4.981 ± 0.246
1.998ArgMet: 1.998 ± 0.179
3.01ArgAsn: 3.01 ± 0.193
1.866ArgPro: 1.866 ± 0.159
2.234ArgGln: 2.234 ± 0.205
3.404ArgArg: 3.404 ± 0.299
3.036ArgSer: 3.036 ± 0.202
2.957ArgThr: 2.957 ± 0.21
3.693ArgVal: 3.693 ± 0.216
0.986ArgTrp: 0.986 ± 0.116
2.458ArgTyr: 2.458 ± 0.17
0.0ArgXaa: 0.0 ± 0.0
Ser
4.074SerAla: 4.074 ± 0.278
0.46SerCys: 0.46 ± 0.098
3.588SerAsp: 3.588 ± 0.212
3.26SerGlu: 3.26 ± 0.233
2.576SerPhe: 2.576 ± 0.182
4.245SerGly: 4.245 ± 0.302
0.959SerHis: 0.959 ± 0.125
3.43SerIle: 3.43 ± 0.219
3.273SerLys: 3.273 ± 0.19
5.56SerLeu: 5.56 ± 0.319
1.748SerMet: 1.748 ± 0.158
2.615SerAsn: 2.615 ± 0.204
2.668SerPro: 2.668 ± 0.177
2.024SerGln: 2.024 ± 0.141
2.826SerArg: 2.826 ± 0.196
3.444SerSer: 3.444 ± 0.277
3.759SerThr: 3.759 ± 0.238
4.206SerVal: 4.206 ± 0.276
0.92SerTrp: 0.92 ± 0.115
2.247SerTyr: 2.247 ± 0.161
0.0SerXaa: 0.0 ± 0.0
Thr
5.796ThrAla: 5.796 ± 0.258
0.381ThrCys: 0.381 ± 0.073
4.061ThrAsp: 4.061 ± 0.221
3.522ThrGlu: 3.522 ± 0.22
2.629ThrPhe: 2.629 ± 0.207
4.416ThrGly: 4.416 ± 0.238
1.262ThrHis: 1.262 ± 0.149
3.457ThrIle: 3.457 ± 0.213
2.984ThrLys: 2.984 ± 0.182
6.598ThrLeu: 6.598 ± 0.342
1.603ThrMet: 1.603 ± 0.171
3.089ThrAsn: 3.089 ± 0.238
3.575ThrPro: 3.575 ± 0.275
1.958ThrGln: 1.958 ± 0.184
3.207ThrArg: 3.207 ± 0.182
3.444ThrSer: 3.444 ± 0.208
4.508ThrThr: 4.508 ± 0.374
4.889ThrVal: 4.889 ± 0.33
1.012ThrTrp: 1.012 ± 0.103
2.247ThrTyr: 2.247 ± 0.162
0.0ThrXaa: 0.0 ± 0.0
Val
6.164ValAla: 6.164 ± 0.258
0.802ValCys: 0.802 ± 0.096
4.784ValAsp: 4.784 ± 0.298
4.784ValGlu: 4.784 ± 0.251
2.405ValPhe: 2.405 ± 0.153
4.469ValGly: 4.469 ± 0.214
1.209ValHis: 1.209 ± 0.135
3.772ValIle: 3.772 ± 0.22
4.232ValLys: 4.232 ± 0.267
5.704ValLeu: 5.704 ± 0.254
1.748ValMet: 1.748 ± 0.16
3.509ValAsn: 3.509 ± 0.24
3.207ValPro: 3.207 ± 0.251
2.326ValGln: 2.326 ± 0.188
3.93ValArg: 3.93 ± 0.205
4.364ValSer: 4.364 ± 0.221
4.456ValThr: 4.456 ± 0.242
5.441ValVal: 5.441 ± 0.302
0.946ValTrp: 0.946 ± 0.114
2.984ValTyr: 2.984 ± 0.237
0.0ValXaa: 0.0 ± 0.0
Trp
1.065TrpAla: 1.065 ± 0.126
0.092TrpCys: 0.092 ± 0.037
1.091TrpAsp: 1.091 ± 0.119
0.644TrpGlu: 0.644 ± 0.101
0.552TrpPhe: 0.552 ± 0.103
0.683TrpGly: 0.683 ± 0.1
0.381TrpHis: 0.381 ± 0.071
0.644TrpIle: 0.644 ± 0.093
0.828TrpLys: 0.828 ± 0.099
1.196TrpLeu: 1.196 ± 0.128
0.46TrpMet: 0.46 ± 0.079
0.644TrpAsn: 0.644 ± 0.083
0.578TrpPro: 0.578 ± 0.094
0.526TrpGln: 0.526 ± 0.079
0.946TrpArg: 0.946 ± 0.121
0.907TrpSer: 0.907 ± 0.094
0.631TrpThr: 0.631 ± 0.083
1.235TrpVal: 1.235 ± 0.139
0.315TrpTrp: 0.315 ± 0.075
0.473TrpTyr: 0.473 ± 0.099
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.707TyrAla: 2.707 ± 0.189
0.46TyrCys: 0.46 ± 0.078
2.786TyrAsp: 2.786 ± 0.199
1.879TyrGlu: 1.879 ± 0.137
1.459TyrPhe: 1.459 ± 0.145
2.944TyrGly: 2.944 ± 0.224
1.183TyrHis: 1.183 ± 0.154
1.919TyrIle: 1.919 ± 0.166
1.722TyrLys: 1.722 ± 0.149
3.68TyrLeu: 3.68 ± 0.21
0.881TyrMet: 0.881 ± 0.099
2.142TyrAsn: 2.142 ± 0.184
1.879TyrPro: 1.879 ± 0.147
1.695TyrGln: 1.695 ± 0.173
2.484TyrArg: 2.484 ± 0.202
2.247TyrSer: 2.247 ± 0.2
2.445TyrThr: 2.445 ± 0.186
2.537TyrVal: 2.537 ± 0.209
0.46TyrTrp: 0.46 ± 0.065
1.722TyrTyr: 1.722 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 271 proteins (76086 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski