Amino acid dipepetide frequency for Escherichia phage vB_EcoM_011D4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.527AlaAla: 4.527 ± 0.349
0.941AlaCys: 0.941 ± 0.112
4.135AlaAsp: 4.135 ± 0.28
4.644AlaGlu: 4.644 ± 0.321
2.724AlaPhe: 2.724 ± 0.221
4.546AlaGly: 4.546 ± 0.449
1.313AlaHis: 1.313 ± 0.191
4.997AlaIle: 4.997 ± 0.354
4.958AlaLys: 4.958 ± 0.329
5.232AlaLeu: 5.232 ± 0.289
2.626AlaMet: 2.626 ± 0.227
3.508AlaAsn: 3.508 ± 0.268
1.842AlaPro: 1.842 ± 0.203
2.058AlaGln: 2.058 ± 0.245
3.096AlaArg: 3.096 ± 0.291
3.88AlaSer: 3.88 ± 0.318
4.056AlaThr: 4.056 ± 0.534
4.487AlaVal: 4.487 ± 0.299
0.862AlaTrp: 0.862 ± 0.109
2.685AlaTyr: 2.685 ± 0.231
0.0AlaXaa: 0.0 ± 0.0
Cys
0.803CysAla: 0.803 ± 0.147
0.216CysCys: 0.216 ± 0.062
0.98CysAsp: 0.98 ± 0.137
0.96CysGlu: 0.96 ± 0.165
0.823CysPhe: 0.823 ± 0.107
0.98CysGly: 0.98 ± 0.142
0.353CysHis: 0.353 ± 0.082
0.725CysIle: 0.725 ± 0.131
0.823CysLys: 0.823 ± 0.13
0.999CysLeu: 0.999 ± 0.138
0.333CysMet: 0.333 ± 0.073
0.705CysAsn: 0.705 ± 0.113
0.47CysPro: 0.47 ± 0.109
0.294CysGln: 0.294 ± 0.07
0.372CysArg: 0.372 ± 0.086
0.823CysSer: 0.823 ± 0.119
0.666CysThr: 0.666 ± 0.122
0.999CysVal: 0.999 ± 0.145
0.176CysTrp: 0.176 ± 0.063
0.353CysTyr: 0.353 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
4.644AspAla: 4.644 ± 0.365
0.803AspCys: 0.803 ± 0.139
4.135AspAsp: 4.135 ± 0.27
4.664AspGlu: 4.664 ± 0.273
3.331AspPhe: 3.331 ± 0.242
4.762AspGly: 4.762 ± 0.331
1.372AspHis: 1.372 ± 0.169
4.252AspIle: 4.252 ± 0.225
4.546AspLys: 4.546 ± 0.374
5.526AspLeu: 5.526 ± 0.329
1.783AspMet: 1.783 ± 0.167
3.116AspAsn: 3.116 ± 0.245
2.783AspPro: 2.783 ± 0.226
1.92AspGln: 1.92 ± 0.19
3.586AspArg: 3.586 ± 0.257
3.331AspSer: 3.331 ± 0.265
3.194AspThr: 3.194 ± 0.329
4.311AspVal: 4.311 ± 0.301
1.195AspTrp: 1.195 ± 0.155
3.194AspTyr: 3.194 ± 0.215
0.0AspXaa: 0.0 ± 0.0
Glu
4.919GluAla: 4.919 ± 0.301
1.039GluCys: 1.039 ± 0.176
4.389GluAsp: 4.389 ± 0.32
5.604GluGlu: 5.604 ± 0.39
3.037GluPhe: 3.037 ± 0.215
4.233GluGly: 4.233 ± 0.304
1.411GluHis: 1.411 ± 0.165
5.742GluIle: 5.742 ± 0.311
5.84GluLys: 5.84 ± 0.384
6.741GluLeu: 6.741 ± 0.358
1.999GluMet: 1.999 ± 0.234
4.292GluAsn: 4.292 ± 0.31
1.45GluPro: 1.45 ± 0.164
2.43GluGln: 2.43 ± 0.231
3.606GluArg: 3.606 ± 0.298
4.233GluSer: 4.233 ± 0.268
4.115GluThr: 4.115 ± 0.298
4.821GluVal: 4.821 ± 0.288
1.176GluTrp: 1.176 ± 0.163
3.625GluTyr: 3.625 ± 0.294
0.0GluXaa: 0.0 ± 0.0
Phe
2.489PheAla: 2.489 ± 0.264
0.666PheCys: 0.666 ± 0.122
3.645PheAsp: 3.645 ± 0.283
3.664PheGlu: 3.664 ± 0.293
1.509PhePhe: 1.509 ± 0.16
2.998PheGly: 2.998 ± 0.23
0.823PheHis: 0.823 ± 0.127
3.155PheIle: 3.155 ± 0.25
3.508PheLys: 3.508 ± 0.288
2.645PheLeu: 2.645 ± 0.228
1.587PheMet: 1.587 ± 0.188
2.606PheAsn: 2.606 ± 0.229
1.097PhePro: 1.097 ± 0.144
1.137PheGln: 1.137 ± 0.142
2.195PheArg: 2.195 ± 0.213
3.037PheSer: 3.037 ± 0.222
2.939PheThr: 2.939 ± 0.226
2.822PheVal: 2.822 ± 0.267
0.392PheTrp: 0.392 ± 0.074
1.646PheTyr: 1.646 ± 0.174
0.0PheXaa: 0.0 ± 0.0
Gly
3.9GlyAla: 3.9 ± 0.342
0.823GlyCys: 0.823 ± 0.122
4.409GlyAsp: 4.409 ± 0.378
4.135GlyGlu: 4.135 ± 0.304
2.763GlyPhe: 2.763 ± 0.22
3.88GlyGly: 3.88 ± 0.622
1.058GlyHis: 1.058 ± 0.159
4.311GlyIle: 4.311 ± 0.319
4.605GlyLys: 4.605 ± 0.279
4.781GlyLeu: 4.781 ± 0.296
1.764GlyMet: 1.764 ± 0.208
3.351GlyAsn: 3.351 ± 0.423
0.705GlyPro: 0.705 ± 0.1
1.94GlyGln: 1.94 ± 0.23
3.214GlyArg: 3.214 ± 0.297
3.508GlySer: 3.508 ± 0.348
3.468GlyThr: 3.468 ± 0.395
5.173GlyVal: 5.173 ± 0.335
0.882GlyTrp: 0.882 ± 0.14
2.959GlyTyr: 2.959 ± 0.237
0.0GlyXaa: 0.0 ± 0.0
His
1.274HisAla: 1.274 ± 0.155
0.196HisCys: 0.196 ± 0.057
1.195HisAsp: 1.195 ± 0.147
1.411HisGlu: 1.411 ± 0.185
0.921HisPhe: 0.921 ± 0.12
1.666HisGly: 1.666 ± 0.204
0.49HisHis: 0.49 ± 0.109
1.058HisIle: 1.058 ± 0.147
1.352HisLys: 1.352 ± 0.188
1.313HisLeu: 1.313 ± 0.153
0.529HisMet: 0.529 ± 0.093
1.156HisAsn: 1.156 ± 0.167
0.901HisPro: 0.901 ± 0.147
0.549HisGln: 0.549 ± 0.103
0.823HisArg: 0.823 ± 0.134
1.156HisSer: 1.156 ± 0.151
1.058HisThr: 1.058 ± 0.142
1.391HisVal: 1.391 ± 0.163
0.353HisTrp: 0.353 ± 0.092
1.058HisTyr: 1.058 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
5.252IleAla: 5.252 ± 0.377
0.725IleCys: 0.725 ± 0.122
5.33IleAsp: 5.33 ± 0.333
5.31IleGlu: 5.31 ± 0.319
2.175IlePhe: 2.175 ± 0.229
3.743IleGly: 3.743 ± 0.28
1.313IleHis: 1.313 ± 0.182
4.585IleIle: 4.585 ± 0.307
5.565IleLys: 5.565 ± 0.364
4.096IleLeu: 4.096 ± 0.269
2.371IleMet: 2.371 ± 0.224
3.998IleAsn: 3.998 ± 0.23
2.587IlePro: 2.587 ± 0.257
2.293IleGln: 2.293 ± 0.247
3.488IleArg: 3.488 ± 0.256
3.704IleSer: 3.704 ± 0.291
4.899IleThr: 4.899 ± 0.289
4.958IleVal: 4.958 ± 0.292
0.647IleTrp: 0.647 ± 0.112
2.449IleTyr: 2.449 ± 0.202
0.0IleXaa: 0.0 ± 0.0
Lys
5.604LysAla: 5.604 ± 0.424
0.862LysCys: 0.862 ± 0.172
4.585LysAsp: 4.585 ± 0.38
6.427LysGlu: 6.427 ± 0.413
3.371LysPhe: 3.371 ± 0.283
4.056LysGly: 4.056 ± 0.276
1.548LysHis: 1.548 ± 0.172
5.369LysIle: 5.369 ± 0.277
4.919LysLys: 4.919 ± 0.336
5.996LysLeu: 5.996 ± 0.356
2.763LysMet: 2.763 ± 0.272
4.017LysAsn: 4.017 ± 0.303
2.822LysPro: 2.822 ± 0.23
3.194LysGln: 3.194 ± 0.253
3.488LysArg: 3.488 ± 0.281
3.508LysSer: 3.508 ± 0.265
4.723LysThr: 4.723 ± 0.296
5.115LysVal: 5.115 ± 0.31
1.039LysTrp: 1.039 ± 0.136
3.077LysTyr: 3.077 ± 0.249
0.0LysXaa: 0.0 ± 0.0
Leu
5.506LeuAla: 5.506 ± 0.319
0.999LeuCys: 0.999 ± 0.154
5.075LeuAsp: 5.075 ± 0.295
5.8LeuGlu: 5.8 ± 0.401
2.998LeuPhe: 2.998 ± 0.253
3.88LeuGly: 3.88 ± 0.23
1.235LeuHis: 1.235 ± 0.153
4.84LeuIle: 4.84 ± 0.272
6.055LeuLys: 6.055 ± 0.393
4.468LeuLeu: 4.468 ± 0.316
2.665LeuMet: 2.665 ± 0.235
3.547LeuAsn: 3.547 ± 0.233
2.861LeuPro: 2.861 ± 0.253
2.352LeuGln: 2.352 ± 0.195
3.527LeuArg: 3.527 ± 0.266
4.625LeuSer: 4.625 ± 0.34
4.331LeuThr: 4.331 ± 0.275
4.115LeuVal: 4.115 ± 0.292
0.686LeuTrp: 0.686 ± 0.122
3.39LeuTyr: 3.39 ± 0.268
0.0LeuXaa: 0.0 ± 0.0
Met
1.96MetAla: 1.96 ± 0.19
0.451MetCys: 0.451 ± 0.094
1.489MetAsp: 1.489 ± 0.177
1.842MetGlu: 1.842 ± 0.196
1.568MetPhe: 1.568 ± 0.158
1.607MetGly: 1.607 ± 0.184
0.647MetHis: 0.647 ± 0.122
2.567MetIle: 2.567 ± 0.205
3.077MetLys: 3.077 ± 0.306
2.391MetLeu: 2.391 ± 0.189
1.078MetMet: 1.078 ± 0.167
1.607MetAsn: 1.607 ± 0.166
0.901MetPro: 0.901 ± 0.104
1.313MetGln: 1.313 ± 0.172
1.333MetArg: 1.333 ± 0.167
1.842MetSer: 1.842 ± 0.208
1.724MetThr: 1.724 ± 0.162
1.92MetVal: 1.92 ± 0.195
0.509MetTrp: 0.509 ± 0.108
0.803MetTyr: 0.803 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
3.704AsnAla: 3.704 ± 0.222
0.705AsnCys: 0.705 ± 0.136
3.371AsnAsp: 3.371 ± 0.283
3.547AsnGlu: 3.547 ± 0.215
2.391AsnPhe: 2.391 ± 0.211
4.194AsnGly: 4.194 ± 0.339
1.215AsnHis: 1.215 ± 0.178
3.684AsnIle: 3.684 ± 0.255
3.821AsnLys: 3.821 ± 0.27
3.586AsnLeu: 3.586 ± 0.259
1.431AsnMet: 1.431 ± 0.163
2.606AsnAsn: 2.606 ± 0.241
2.254AsnPro: 2.254 ± 0.173
1.45AsnGln: 1.45 ± 0.188
2.547AsnArg: 2.547 ± 0.205
2.587AsnSer: 2.587 ± 0.233
2.704AsnThr: 2.704 ± 0.238
4.272AsnVal: 4.272 ± 0.297
0.49AsnTrp: 0.49 ± 0.089
2.254AsnTyr: 2.254 ± 0.217
0.0AsnXaa: 0.0 ± 0.0
Pro
2.097ProAla: 2.097 ± 0.236
0.412ProCys: 0.412 ± 0.082
2.704ProAsp: 2.704 ± 0.246
3.077ProGlu: 3.077 ± 0.242
1.607ProPhe: 1.607 ± 0.198
0.96ProGly: 0.96 ± 0.136
0.862ProHis: 0.862 ± 0.151
2.097ProIle: 2.097 ± 0.214
2.606ProLys: 2.606 ± 0.272
1.979ProLeu: 1.979 ± 0.179
0.725ProMet: 0.725 ± 0.105
1.372ProAsn: 1.372 ± 0.142
0.843ProPro: 0.843 ± 0.141
1.039ProGln: 1.039 ± 0.115
1.411ProArg: 1.411 ± 0.171
2.116ProSer: 2.116 ± 0.163
2.254ProThr: 2.254 ± 0.227
2.665ProVal: 2.665 ± 0.258
0.451ProTrp: 0.451 ± 0.087
1.352ProTyr: 1.352 ± 0.146
0.0ProXaa: 0.0 ± 0.0
Gln
1.901GlnAla: 1.901 ± 0.207
0.392GlnCys: 0.392 ± 0.087
1.528GlnAsp: 1.528 ± 0.204
2.293GlnGlu: 2.293 ± 0.21
1.724GlnPhe: 1.724 ± 0.188
1.626GlnGly: 1.626 ± 0.227
0.666GlnHis: 0.666 ± 0.117
2.273GlnIle: 2.273 ± 0.195
2.371GlnLys: 2.371 ± 0.227
2.685GlnLeu: 2.685 ± 0.224
0.882GlnMet: 0.882 ± 0.146
1.646GlnAsn: 1.646 ± 0.222
1.176GlnPro: 1.176 ± 0.167
0.823GlnGln: 0.823 ± 0.115
1.881GlnArg: 1.881 ± 0.166
1.587GlnSer: 1.587 ± 0.187
1.803GlnThr: 1.803 ± 0.224
2.018GlnVal: 2.018 ± 0.209
0.607GlnTrp: 0.607 ± 0.102
1.489GlnTyr: 1.489 ± 0.164
0.0GlnXaa: 0.0 ± 0.0
Arg
2.508ArgAla: 2.508 ± 0.235
0.627ArgCys: 0.627 ± 0.118
3.371ArgAsp: 3.371 ± 0.269
3.762ArgGlu: 3.762 ± 0.268
2.449ArgPhe: 2.449 ± 0.223
3.077ArgGly: 3.077 ± 0.243
0.941ArgHis: 0.941 ± 0.125
3.39ArgIle: 3.39 ± 0.204
4.194ArgLys: 4.194 ± 0.316
3.468ArgLeu: 3.468 ± 0.295
1.215ArgMet: 1.215 ± 0.149
2.567ArgAsn: 2.567 ± 0.2
1.685ArgPro: 1.685 ± 0.188
1.528ArgGln: 1.528 ± 0.175
2.547ArgArg: 2.547 ± 0.239
2.41ArgSer: 2.41 ± 0.201
2.077ArgThr: 2.077 ± 0.207
3.351ArgVal: 3.351 ± 0.226
0.823ArgTrp: 0.823 ± 0.133
2.156ArgTyr: 2.156 ± 0.211
0.0ArgXaa: 0.0 ± 0.0
Ser
3.566SerAla: 3.566 ± 0.302
0.784SerCys: 0.784 ± 0.111
3.547SerAsp: 3.547 ± 0.241
3.958SerGlu: 3.958 ± 0.287
2.92SerPhe: 2.92 ± 0.191
3.978SerGly: 3.978 ± 0.332
0.96SerHis: 0.96 ± 0.126
3.782SerIle: 3.782 ± 0.269
4.135SerLys: 4.135 ± 0.314
3.998SerLeu: 3.998 ± 0.282
1.587SerMet: 1.587 ± 0.169
2.979SerAsn: 2.979 ± 0.218
1.822SerPro: 1.822 ± 0.19
1.411SerGln: 1.411 ± 0.183
3.175SerArg: 3.175 ± 0.241
3.077SerSer: 3.077 ± 0.237
2.861SerThr: 2.861 ± 0.231
4.017SerVal: 4.017 ± 0.311
0.705SerTrp: 0.705 ± 0.13
1.783SerTyr: 1.783 ± 0.171
0.0SerXaa: 0.0 ± 0.0
Thr
4.468ThrAla: 4.468 ± 0.35
0.705ThrCys: 0.705 ± 0.113
3.527ThrAsp: 3.527 ± 0.237
3.939ThrGlu: 3.939 ± 0.302
2.43ThrPhe: 2.43 ± 0.186
3.723ThrGly: 3.723 ± 0.378
1.333ThrHis: 1.333 ± 0.179
3.802ThrIle: 3.802 ± 0.336
4.507ThrLys: 4.507 ± 0.258
4.546ThrLeu: 4.546 ± 0.316
1.626ThrMet: 1.626 ± 0.188
2.685ThrAsn: 2.685 ± 0.211
2.9ThrPro: 2.9 ± 0.306
1.999ThrGln: 1.999 ± 0.262
2.547ThrArg: 2.547 ± 0.245
2.587ThrSer: 2.587 ± 0.23
3.194ThrThr: 3.194 ± 0.301
4.194ThrVal: 4.194 ± 0.345
0.764ThrTrp: 0.764 ± 0.11
2.371ThrTyr: 2.371 ± 0.198
0.0ThrXaa: 0.0 ± 0.0
Val
4.194ValAla: 4.194 ± 0.327
0.588ValCys: 0.588 ± 0.12
5.134ValAsp: 5.134 ± 0.34
5.938ValGlu: 5.938 ± 0.393
3.194ValPhe: 3.194 ± 0.255
4.076ValGly: 4.076 ± 0.312
1.078ValHis: 1.078 ± 0.124
4.742ValIle: 4.742 ± 0.332
5.624ValLys: 5.624 ± 0.358
4.625ValLeu: 4.625 ± 0.319
2.018ValMet: 2.018 ± 0.17
3.841ValAsn: 3.841 ± 0.319
1.881ValPro: 1.881 ± 0.171
2.038ValGln: 2.038 ± 0.2
2.861ValArg: 2.861 ± 0.225
3.958ValSer: 3.958 ± 0.255
4.213ValThr: 4.213 ± 0.334
4.821ValVal: 4.821 ± 0.311
0.96ValTrp: 0.96 ± 0.142
3.41ValTyr: 3.41 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.666TrpAla: 0.666 ± 0.104
0.294TrpCys: 0.294 ± 0.076
0.843TrpAsp: 0.843 ± 0.135
1.137TrpGlu: 1.137 ± 0.139
0.588TrpPhe: 0.588 ± 0.099
0.823TrpGly: 0.823 ± 0.134
0.353TrpHis: 0.353 ± 0.072
0.803TrpIle: 0.803 ± 0.119
0.98TrpLys: 0.98 ± 0.136
0.941TrpLeu: 0.941 ± 0.105
0.529TrpMet: 0.529 ± 0.094
0.607TrpAsn: 0.607 ± 0.116
0.235TrpPro: 0.235 ± 0.077
0.353TrpGln: 0.353 ± 0.077
0.607TrpArg: 0.607 ± 0.1
0.745TrpSer: 0.745 ± 0.098
0.901TrpThr: 0.901 ± 0.108
1.097TrpVal: 1.097 ± 0.133
0.196TrpTrp: 0.196 ± 0.064
0.705TrpTyr: 0.705 ± 0.099
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.135TyrAla: 3.135 ± 0.245
0.627TyrCys: 0.627 ± 0.107
3.057TyrAsp: 3.057 ± 0.23
2.587TyrGlu: 2.587 ± 0.229
2.038TyrPhe: 2.038 ± 0.175
2.822TyrGly: 2.822 ± 0.237
0.882TyrHis: 0.882 ± 0.114
3.312TyrIle: 3.312 ± 0.231
2.998TyrLys: 2.998 ± 0.226
2.939TyrLeu: 2.939 ± 0.269
1.097TyrMet: 1.097 ± 0.151
2.606TyrAsn: 2.606 ± 0.239
1.411TyrPro: 1.411 ± 0.161
1.254TyrGln: 1.254 ± 0.148
1.842TyrArg: 1.842 ± 0.213
2.332TyrSer: 2.332 ± 0.226
2.665TyrThr: 2.665 ± 0.203
2.645TyrVal: 2.645 ± 0.221
0.509TyrTrp: 0.509 ± 0.098
2.077TyrTyr: 2.077 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 273 proteins (51032 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski