Amino acid dipepetide frequency for Enterobacter phage vB_EhoM-IME523

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.014AlaAla: 6.014 ± 0.405
0.65AlaCys: 0.65 ± 0.11
4.25AlaAsp: 4.25 ± 0.294
4.937AlaGlu: 4.937 ± 0.356
2.524AlaPhe: 2.524 ± 0.23
4.919AlaGly: 4.919 ± 0.382
1.225AlaHis: 1.225 ± 0.148
4.881AlaIle: 4.881 ± 0.237
5.902AlaLys: 5.902 ± 0.341
6.366AlaLeu: 6.366 ± 0.438
1.875AlaMet: 1.875 ± 0.201
3.378AlaAsn: 3.378 ± 0.25
2.654AlaPro: 2.654 ± 0.244
2.691AlaGln: 2.691 ± 0.268
3.359AlaArg: 3.359 ± 0.208
4.269AlaSer: 4.269 ± 0.341
4.622AlaThr: 4.622 ± 0.412
4.863AlaVal: 4.863 ± 0.331
0.854AlaTrp: 0.854 ± 0.121
2.654AlaTyr: 2.654 ± 0.213
0.0AlaXaa: 0.0 ± 0.0
Cys
0.817CysAla: 0.817 ± 0.121
0.148CysCys: 0.148 ± 0.054
0.872CysAsp: 0.872 ± 0.127
0.78CysGlu: 0.78 ± 0.134
0.334CysPhe: 0.334 ± 0.095
0.631CysGly: 0.631 ± 0.116
0.204CysHis: 0.204 ± 0.059
0.52CysIle: 0.52 ± 0.126
0.687CysLys: 0.687 ± 0.121
0.742CysLeu: 0.742 ± 0.102
0.427CysMet: 0.427 ± 0.089
0.538CysAsn: 0.538 ± 0.1
0.501CysPro: 0.501 ± 0.105
0.445CysGln: 0.445 ± 0.106
0.65CysArg: 0.65 ± 0.119
0.761CysSer: 0.761 ± 0.129
0.668CysThr: 0.668 ± 0.113
0.631CysVal: 0.631 ± 0.101
0.111CysTrp: 0.111 ± 0.046
0.464CysTyr: 0.464 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
4.603AspAla: 4.603 ± 0.313
0.501AspCys: 0.501 ± 0.104
4.232AspAsp: 4.232 ± 0.298
5.104AspGlu: 5.104 ± 0.335
3.471AspPhe: 3.471 ± 0.246
5.048AspGly: 5.048 ± 0.317
0.891AspHis: 0.891 ± 0.146
4.362AspIle: 4.362 ± 0.253
3.916AspLys: 3.916 ± 0.255
5.011AspLeu: 5.011 ± 0.277
1.967AspMet: 1.967 ± 0.176
3.025AspAsn: 3.025 ± 0.265
2.394AspPro: 2.394 ± 0.23
1.763AspGln: 1.763 ± 0.164
2.339AspArg: 2.339 ± 0.208
3.712AspSer: 3.712 ± 0.274
3.174AspThr: 3.174 ± 0.227
4.25AspVal: 4.25 ± 0.283
1.225AspTrp: 1.225 ± 0.151
3.062AspTyr: 3.062 ± 0.283
0.0AspXaa: 0.0 ± 0.0
Glu
5.884GluAla: 5.884 ± 0.404
0.984GluCys: 0.984 ± 0.152
4.12GluAsp: 4.12 ± 0.336
5.271GluGlu: 5.271 ± 0.34
3.471GluPhe: 3.471 ± 0.226
4.158GluGly: 4.158 ± 0.29
1.355GluHis: 1.355 ± 0.156
5.865GluIle: 5.865 ± 0.325
4.362GluLys: 4.362 ± 0.318
6.626GluLeu: 6.626 ± 0.364
2.45GluMet: 2.45 ± 0.208
3.23GluAsn: 3.23 ± 0.26
2.134GluPro: 2.134 ± 0.235
2.951GluGln: 2.951 ± 0.26
3.211GluArg: 3.211 ± 0.23
3.675GluSer: 3.675 ± 0.339
3.842GluThr: 3.842 ± 0.256
4.993GluVal: 4.993 ± 0.292
1.039GluTrp: 1.039 ± 0.132
3.062GluTyr: 3.062 ± 0.273
0.0GluXaa: 0.0 ± 0.0
Phe
3.267PheAla: 3.267 ± 0.212
0.501PheCys: 0.501 ± 0.097
3.192PheAsp: 3.192 ± 0.261
3.619PheGlu: 3.619 ± 0.324
1.355PhePhe: 1.355 ± 0.154
3.081PheGly: 3.081 ± 0.215
0.687PheHis: 0.687 ± 0.102
2.988PheIle: 2.988 ± 0.248
4.139PheLys: 4.139 ± 0.281
2.45PheLeu: 2.45 ± 0.205
1.466PheMet: 1.466 ± 0.169
2.914PheAsn: 2.914 ± 0.229
1.262PhePro: 1.262 ± 0.139
1.485PheGln: 1.485 ± 0.169
1.8PheArg: 1.8 ± 0.187
2.413PheSer: 2.413 ± 0.181
2.524PheThr: 2.524 ± 0.215
3.155PheVal: 3.155 ± 0.222
0.909PheTrp: 0.909 ± 0.151
1.541PheTyr: 1.541 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
3.675GlyAla: 3.675 ± 0.312
0.687GlyCys: 0.687 ± 0.13
4.213GlyAsp: 4.213 ± 0.281
4.455GlyGlu: 4.455 ± 0.285
3.081GlyPhe: 3.081 ± 0.25
3.712GlyGly: 3.712 ± 0.311
1.206GlyHis: 1.206 ± 0.163
4.046GlyIle: 4.046 ± 0.264
4.547GlyLys: 4.547 ± 0.303
5.123GlyLeu: 5.123 ± 0.383
1.986GlyMet: 1.986 ± 0.198
3.118GlyAsn: 3.118 ± 0.37
2.116GlyPro: 2.116 ± 0.185
2.394GlyGln: 2.394 ± 0.266
2.933GlyArg: 2.933 ± 0.214
4.028GlySer: 4.028 ± 0.282
4.046GlyThr: 4.046 ± 0.335
3.972GlyVal: 3.972 ± 0.245
1.188GlyTrp: 1.188 ± 0.121
2.858GlyTyr: 2.858 ± 0.233
0.0GlyXaa: 0.0 ± 0.0
His
1.021HisAla: 1.021 ± 0.125
0.353HisCys: 0.353 ± 0.088
1.039HisAsp: 1.039 ± 0.142
0.891HisGlu: 0.891 ± 0.143
0.761HisPhe: 0.761 ± 0.135
1.281HisGly: 1.281 ± 0.158
0.353HisHis: 0.353 ± 0.089
1.355HisIle: 1.355 ± 0.191
1.188HisLys: 1.188 ± 0.16
1.615HisLeu: 1.615 ± 0.178
0.483HisMet: 0.483 ± 0.089
0.909HisAsn: 0.909 ± 0.128
1.021HisPro: 1.021 ± 0.161
0.668HisGln: 0.668 ± 0.111
0.668HisArg: 0.668 ± 0.122
1.077HisSer: 1.077 ± 0.147
0.705HisThr: 0.705 ± 0.124
1.466HisVal: 1.466 ± 0.175
0.167HisTrp: 0.167 ± 0.059
0.854HisTyr: 0.854 ± 0.126
0.0HisXaa: 0.0 ± 0.0
Ile
4.566IleAla: 4.566 ± 0.287
0.835IleCys: 0.835 ± 0.13
5.123IleAsp: 5.123 ± 0.345
5.104IleGlu: 5.104 ± 0.293
2.58IlePhe: 2.58 ± 0.208
3.935IleGly: 3.935 ± 0.276
1.114IleHis: 1.114 ± 0.173
4.436IleIle: 4.436 ± 0.276
5.364IleLys: 5.364 ± 0.319
3.972IleLeu: 3.972 ± 0.255
2.06IleMet: 2.06 ± 0.192
3.712IleAsn: 3.712 ± 0.231
2.747IlePro: 2.747 ± 0.211
2.803IleGln: 2.803 ± 0.208
3.267IleArg: 3.267 ± 0.252
3.842IleSer: 3.842 ± 0.279
4.232IleThr: 4.232 ± 0.281
4.325IleVal: 4.325 ± 0.285
0.464IleTrp: 0.464 ± 0.098
2.45IleTyr: 2.45 ± 0.203
0.0IleXaa: 0.0 ± 0.0
Lys
6.515LysAla: 6.515 ± 0.432
0.557LysCys: 0.557 ± 0.111
4.38LysAsp: 4.38 ± 0.287
5.884LysGlu: 5.884 ± 0.391
3.786LysPhe: 3.786 ± 0.3
3.916LysGly: 3.916 ± 0.275
1.466LysHis: 1.466 ± 0.166
4.733LysIle: 4.733 ± 0.29
4.584LysLys: 4.584 ± 0.382
6.106LysLeu: 6.106 ± 0.338
2.413LysMet: 2.413 ± 0.226
3.582LysAsn: 3.582 ± 0.249
2.357LysPro: 2.357 ± 0.241
2.524LysGln: 2.524 ± 0.248
3.044LysArg: 3.044 ± 0.227
3.898LysSer: 3.898 ± 0.272
3.601LysThr: 3.601 ± 0.233
5.215LysVal: 5.215 ± 0.338
0.947LysTrp: 0.947 ± 0.127
2.766LysTyr: 2.766 ± 0.228
0.0LysXaa: 0.0 ± 0.0
Leu
6.032LeuAla: 6.032 ± 0.363
0.724LeuCys: 0.724 ± 0.131
4.863LeuAsp: 4.863 ± 0.288
5.308LeuGlu: 5.308 ± 0.337
3.434LeuPhe: 3.434 ± 0.297
4.269LeuGly: 4.269 ± 0.313
1.411LeuHis: 1.411 ± 0.161
4.417LeuIle: 4.417 ± 0.304
6.329LeuLys: 6.329 ± 0.371
4.622LeuLeu: 4.622 ± 0.307
2.246LeuMet: 2.246 ± 0.225
4.362LeuAsn: 4.362 ± 0.275
3.526LeuPro: 3.526 ± 0.244
2.636LeuGln: 2.636 ± 0.2
3.545LeuArg: 3.545 ± 0.198
4.362LeuSer: 4.362 ± 0.322
4.9LeuThr: 4.9 ± 0.291
4.473LeuVal: 4.473 ± 0.302
0.909LeuTrp: 0.909 ± 0.117
2.933LeuTyr: 2.933 ± 0.258
0.0LeuXaa: 0.0 ± 0.0
Met
2.45MetAla: 2.45 ± 0.207
0.316MetCys: 0.316 ± 0.078
1.782MetAsp: 1.782 ± 0.164
1.93MetGlu: 1.93 ± 0.191
1.689MetPhe: 1.689 ± 0.159
1.652MetGly: 1.652 ± 0.164
0.445MetHis: 0.445 ± 0.081
2.023MetIle: 2.023 ± 0.245
2.747MetLys: 2.747 ± 0.267
2.172MetLeu: 2.172 ± 0.205
0.668MetMet: 0.668 ± 0.127
1.726MetAsn: 1.726 ± 0.176
0.891MetPro: 0.891 ± 0.15
1.188MetGln: 1.188 ± 0.186
1.503MetArg: 1.503 ± 0.153
1.93MetSer: 1.93 ± 0.193
1.893MetThr: 1.893 ± 0.172
1.652MetVal: 1.652 ± 0.176
0.241MetTrp: 0.241 ± 0.064
0.984MetTyr: 0.984 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
3.471AsnAla: 3.471 ± 0.277
0.594AsnCys: 0.594 ± 0.102
2.858AsnAsp: 2.858 ± 0.199
3.935AsnGlu: 3.935 ± 0.237
2.172AsnPhe: 2.172 ± 0.196
3.935AsnGly: 3.935 ± 0.289
0.928AsnHis: 0.928 ± 0.119
3.619AsnIle: 3.619 ± 0.256
3.267AsnLys: 3.267 ± 0.246
4.046AsnLeu: 4.046 ± 0.268
1.244AsnMet: 1.244 ± 0.155
2.84AsnAsn: 2.84 ± 0.254
2.339AsnPro: 2.339 ± 0.23
1.819AsnGln: 1.819 ± 0.176
2.413AsnArg: 2.413 ± 0.211
3.248AsnSer: 3.248 ± 0.251
2.691AsnThr: 2.691 ± 0.253
3.267AsnVal: 3.267 ± 0.29
0.65AsnTrp: 0.65 ± 0.103
1.986AsnTyr: 1.986 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
2.636ProAla: 2.636 ± 0.26
0.501ProCys: 0.501 ± 0.093
2.728ProAsp: 2.728 ± 0.214
2.914ProGlu: 2.914 ± 0.236
1.503ProPhe: 1.503 ± 0.156
2.617ProGly: 2.617 ± 0.244
0.65ProHis: 0.65 ± 0.103
2.005ProIle: 2.005 ± 0.209
2.617ProLys: 2.617 ± 0.222
2.431ProLeu: 2.431 ± 0.206
0.835ProMet: 0.835 ± 0.112
1.8ProAsn: 1.8 ± 0.198
1.132ProPro: 1.132 ± 0.203
1.206ProGln: 1.206 ± 0.142
1.392ProArg: 1.392 ± 0.155
2.58ProSer: 2.58 ± 0.188
1.967ProThr: 1.967 ± 0.198
3.341ProVal: 3.341 ± 0.227
0.872ProTrp: 0.872 ± 0.156
1.541ProTyr: 1.541 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
2.97GlnAla: 2.97 ± 0.277
0.316GlnCys: 0.316 ± 0.08
1.893GlnAsp: 1.893 ± 0.224
1.967GlnGlu: 1.967 ± 0.172
1.837GlnPhe: 1.837 ± 0.185
2.32GlnGly: 2.32 ± 0.199
0.65GlnHis: 0.65 ± 0.118
2.487GlnIle: 2.487 ± 0.227
2.506GlnLys: 2.506 ± 0.234
3.081GlnLeu: 3.081 ± 0.212
1.244GlnMet: 1.244 ± 0.128
1.633GlnAsn: 1.633 ± 0.147
1.448GlnPro: 1.448 ± 0.136
1.095GlnGln: 1.095 ± 0.173
1.93GlnArg: 1.93 ± 0.18
2.005GlnSer: 2.005 ± 0.211
2.19GlnThr: 2.19 ± 0.216
2.543GlnVal: 2.543 ± 0.197
0.909GlnTrp: 0.909 ± 0.124
1.596GlnTyr: 1.596 ± 0.196
0.0GlnXaa: 0.0 ± 0.0
Arg
2.951ArgAla: 2.951 ± 0.222
0.557ArgCys: 0.557 ± 0.097
2.914ArgAsp: 2.914 ± 0.263
3.694ArgGlu: 3.694 ± 0.243
1.986ArgPhe: 1.986 ± 0.18
2.58ArgGly: 2.58 ± 0.232
0.854ArgHis: 0.854 ± 0.127
3.415ArgIle: 3.415 ± 0.259
3.118ArgLys: 3.118 ± 0.245
3.564ArgLeu: 3.564 ± 0.289
1.411ArgMet: 1.411 ± 0.172
2.413ArgAsn: 2.413 ± 0.201
1.485ArgPro: 1.485 ± 0.15
1.967ArgGln: 1.967 ± 0.179
1.837ArgArg: 1.837 ± 0.185
2.654ArgSer: 2.654 ± 0.204
2.079ArgThr: 2.079 ± 0.205
3.192ArgVal: 3.192 ± 0.259
0.798ArgTrp: 0.798 ± 0.149
1.615ArgTyr: 1.615 ± 0.172
0.0ArgXaa: 0.0 ± 0.0
Ser
3.823SerAla: 3.823 ± 0.266
0.575SerCys: 0.575 ± 0.107
3.471SerAsp: 3.471 ± 0.26
3.749SerGlu: 3.749 ± 0.217
2.803SerPhe: 2.803 ± 0.229
4.566SerGly: 4.566 ± 0.335
0.947SerHis: 0.947 ± 0.122
4.343SerIle: 4.343 ± 0.288
4.102SerLys: 4.102 ± 0.269
4.64SerLeu: 4.64 ± 0.309
1.559SerMet: 1.559 ± 0.19
2.877SerAsn: 2.877 ± 0.242
2.023SerPro: 2.023 ± 0.216
2.246SerGln: 2.246 ± 0.199
3.044SerArg: 3.044 ± 0.241
3.898SerSer: 3.898 ± 0.273
3.23SerThr: 3.23 ± 0.255
3.712SerVal: 3.712 ± 0.301
0.854SerTrp: 0.854 ± 0.12
2.413SerTyr: 2.413 ± 0.178
0.0SerXaa: 0.0 ± 0.0
Thr
3.749ThrAla: 3.749 ± 0.295
0.464ThrCys: 0.464 ± 0.095
3.545ThrAsp: 3.545 ± 0.274
3.842ThrGlu: 3.842 ± 0.296
2.32ThrPhe: 2.32 ± 0.236
3.916ThrGly: 3.916 ± 0.298
1.114ThrHis: 1.114 ± 0.146
3.935ThrIle: 3.935 ± 0.267
4.102ThrLys: 4.102 ± 0.247
4.38ThrLeu: 4.38 ± 0.281
1.392ThrMet: 1.392 ± 0.15
2.747ThrAsn: 2.747 ± 0.253
2.654ThrPro: 2.654 ± 0.24
1.949ThrGln: 1.949 ± 0.215
2.45ThrArg: 2.45 ± 0.19
3.378ThrSer: 3.378 ± 0.262
3.1ThrThr: 3.1 ± 0.258
4.213ThrVal: 4.213 ± 0.334
0.798ThrTrp: 0.798 ± 0.097
2.042ThrTyr: 2.042 ± 0.17
0.0ThrXaa: 0.0 ± 0.0
Val
4.547ValAla: 4.547 ± 0.298
0.984ValCys: 0.984 ± 0.143
4.714ValAsp: 4.714 ± 0.265
5.884ValGlu: 5.884 ± 0.33
2.914ValPhe: 2.914 ± 0.213
3.861ValGly: 3.861 ± 0.266
1.244ValHis: 1.244 ± 0.133
4.362ValIle: 4.362 ± 0.291
4.696ValLys: 4.696 ± 0.275
4.362ValLeu: 4.362 ± 0.277
2.097ValMet: 2.097 ± 0.216
3.434ValAsn: 3.434 ± 0.274
2.45ValPro: 2.45 ± 0.205
2.561ValGln: 2.561 ± 0.233
3.211ValArg: 3.211 ± 0.247
4.269ValSer: 4.269 ± 0.305
3.675ValThr: 3.675 ± 0.243
5.345ValVal: 5.345 ± 0.378
0.947ValTrp: 0.947 ± 0.129
3.025ValTyr: 3.025 ± 0.237
0.0ValXaa: 0.0 ± 0.0
Trp
1.095TrpAla: 1.095 ± 0.147
0.223TrpCys: 0.223 ± 0.066
0.872TrpAsp: 0.872 ± 0.126
0.928TrpGlu: 0.928 ± 0.139
0.835TrpPhe: 0.835 ± 0.108
0.668TrpGly: 0.668 ± 0.098
0.427TrpHis: 0.427 ± 0.101
0.65TrpIle: 0.65 ± 0.105
1.188TrpLys: 1.188 ± 0.152
1.039TrpLeu: 1.039 ± 0.136
0.687TrpMet: 0.687 ± 0.118
0.78TrpAsn: 0.78 ± 0.111
0.445TrpPro: 0.445 ± 0.085
0.575TrpGln: 0.575 ± 0.096
0.612TrpArg: 0.612 ± 0.104
0.687TrpSer: 0.687 ± 0.115
0.798TrpThr: 0.798 ± 0.126
1.095TrpVal: 1.095 ± 0.15
0.353TrpTrp: 0.353 ± 0.081
0.872TrpTyr: 0.872 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.803TyrAla: 2.803 ± 0.215
0.464TyrCys: 0.464 ± 0.108
2.988TyrAsp: 2.988 ± 0.27
2.506TyrGlu: 2.506 ± 0.222
1.893TyrPhe: 1.893 ± 0.192
2.32TyrGly: 2.32 ± 0.225
0.705TyrHis: 0.705 ± 0.099
2.487TyrIle: 2.487 ± 0.193
2.914TyrLys: 2.914 ± 0.224
2.951TyrLeu: 2.951 ± 0.24
1.355TyrMet: 1.355 ± 0.183
2.301TyrAsn: 2.301 ± 0.22
1.689TyrPro: 1.689 ± 0.17
1.652TyrGln: 1.652 ± 0.177
1.875TyrArg: 1.875 ± 0.172
2.19TyrSer: 2.19 ± 0.18
2.153TyrThr: 2.153 ± 0.211
2.877TyrVal: 2.877 ± 0.228
0.594TyrTrp: 0.594 ± 0.107
1.67TyrTyr: 1.67 ± 0.183
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 289 proteins (53879 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski