Amino acid dipepetide frequency for Escherichia phage RB49 (Bacteriophage RB49)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.625AlaAla: 4.625 ± 0.334
0.913AlaCys: 0.913 ± 0.132
4.14AlaAsp: 4.14 ± 0.289
4.45AlaGlu: 4.45 ± 0.343
2.682AlaPhe: 2.682 ± 0.22
4.567AlaGly: 4.567 ± 0.403
1.322AlaHis: 1.322 ± 0.162
5.111AlaIle: 5.111 ± 0.319
5.092AlaLys: 5.092 ± 0.358
5.364AlaLeu: 5.364 ± 0.325
2.546AlaMet: 2.546 ± 0.216
3.44AlaAsn: 3.44 ± 0.249
1.807AlaPro: 1.807 ± 0.179
2.002AlaGln: 2.002 ± 0.254
3.168AlaArg: 3.168 ± 0.27
3.906AlaSer: 3.906 ± 0.287
3.984AlaThr: 3.984 ± 0.478
4.431AlaVal: 4.431 ± 0.303
0.797AlaTrp: 0.797 ± 0.127
2.721AlaTyr: 2.721 ± 0.232
0.0AlaXaa: 0.0 ± 0.0
Cys
0.913CysAla: 0.913 ± 0.155
0.253CysCys: 0.253 ± 0.067
0.913CysAsp: 0.913 ± 0.146
0.933CysGlu: 0.933 ± 0.147
0.972CysPhe: 0.972 ± 0.153
0.991CysGly: 0.991 ± 0.125
0.389CysHis: 0.389 ± 0.083
0.68CysIle: 0.68 ± 0.113
0.933CysLys: 0.933 ± 0.18
0.933CysLeu: 0.933 ± 0.105
0.35CysMet: 0.35 ± 0.078
0.739CysAsn: 0.739 ± 0.116
0.544CysPro: 0.544 ± 0.115
0.253CysGln: 0.253 ± 0.072
0.369CysArg: 0.369 ± 0.091
0.913CysSer: 0.913 ± 0.149
0.602CysThr: 0.602 ± 0.108
1.011CysVal: 1.011 ± 0.118
0.175CysTrp: 0.175 ± 0.061
0.33CysTyr: 0.33 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
4.431AspAla: 4.431 ± 0.334
0.777AspCys: 0.777 ± 0.122
4.256AspAsp: 4.256 ± 0.281
4.606AspGlu: 4.606 ± 0.305
3.11AspPhe: 3.11 ± 0.226
4.839AspGly: 4.839 ± 0.282
1.341AspHis: 1.341 ± 0.16
4.295AspIle: 4.295 ± 0.257
4.723AspLys: 4.723 ± 0.292
5.383AspLeu: 5.383 ± 0.342
1.71AspMet: 1.71 ± 0.187
3.148AspAsn: 3.148 ± 0.216
2.643AspPro: 2.643 ± 0.243
1.943AspGln: 1.943 ± 0.174
3.731AspArg: 3.731 ± 0.278
3.459AspSer: 3.459 ± 0.272
2.973AspThr: 2.973 ± 0.287
4.256AspVal: 4.256 ± 0.299
1.147AspTrp: 1.147 ± 0.154
3.226AspTyr: 3.226 ± 0.237
0.0AspXaa: 0.0 ± 0.0
Glu
4.936GluAla: 4.936 ± 0.347
1.088GluCys: 1.088 ± 0.147
4.412GluAsp: 4.412 ± 0.297
5.558GluGlu: 5.558 ± 0.39
3.051GluPhe: 3.051 ± 0.2
4.276GluGly: 4.276 ± 0.301
1.36GluHis: 1.36 ± 0.18
5.85GluIle: 5.85 ± 0.347
5.597GluLys: 5.597 ± 0.342
6.763GluLeu: 6.763 ± 0.396
1.905GluMet: 1.905 ± 0.19
4.14GluAsn: 4.14 ± 0.267
1.555GluPro: 1.555 ± 0.164
2.352GluGln: 2.352 ± 0.234
3.595GluArg: 3.595 ± 0.26
4.412GluSer: 4.412 ± 0.33
4.198GluThr: 4.198 ± 0.258
4.606GluVal: 4.606 ± 0.298
1.108GluTrp: 1.108 ± 0.169
3.693GluTyr: 3.693 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
2.721PheAla: 2.721 ± 0.275
0.583PheCys: 0.583 ± 0.102
3.79PheAsp: 3.79 ± 0.28
3.615PheGlu: 3.615 ± 0.284
1.496PhePhe: 1.496 ± 0.188
2.935PheGly: 2.935 ± 0.235
0.777PheHis: 0.777 ± 0.13
3.187PheIle: 3.187 ± 0.244
3.518PheLys: 3.518 ± 0.291
2.74PheLeu: 2.74 ± 0.213
1.574PheMet: 1.574 ± 0.209
2.585PheAsn: 2.585 ± 0.203
1.03PhePro: 1.03 ± 0.145
1.127PheGln: 1.127 ± 0.156
2.196PheArg: 2.196 ± 0.195
3.032PheSer: 3.032 ± 0.229
2.915PheThr: 2.915 ± 0.263
2.818PheVal: 2.818 ± 0.248
0.408PheTrp: 0.408 ± 0.076
1.691PheTyr: 1.691 ± 0.171
0.0PheXaa: 0.0 ± 0.0
Gly
3.867GlyAla: 3.867 ± 0.319
0.913GlyCys: 0.913 ± 0.183
4.295GlyAsp: 4.295 ± 0.333
4.159GlyGlu: 4.159 ± 0.316
2.896GlyPhe: 2.896 ± 0.217
3.751GlyGly: 3.751 ± 0.498
1.108GlyHis: 1.108 ± 0.18
4.256GlyIle: 4.256 ± 0.28
4.587GlyLys: 4.587 ± 0.289
4.781GlyLeu: 4.781 ± 0.297
1.671GlyMet: 1.671 ± 0.192
3.304GlyAsn: 3.304 ± 0.379
0.68GlyPro: 0.68 ± 0.106
1.905GlyGln: 1.905 ± 0.201
3.187GlyArg: 3.187 ± 0.271
3.537GlySer: 3.537 ± 0.323
3.479GlyThr: 3.479 ± 0.381
4.975GlyVal: 4.975 ± 0.398
1.011GlyTrp: 1.011 ± 0.135
3.051GlyTyr: 3.051 ± 0.252
0.0GlyXaa: 0.0 ± 0.0
His
1.224HisAla: 1.224 ± 0.173
0.233HisCys: 0.233 ± 0.077
1.244HisAsp: 1.244 ± 0.173
1.322HisGlu: 1.322 ± 0.158
0.875HisPhe: 0.875 ± 0.125
1.71HisGly: 1.71 ± 0.156
0.525HisHis: 0.525 ± 0.098
1.108HisIle: 1.108 ± 0.136
1.283HisLys: 1.283 ± 0.163
1.477HisLeu: 1.477 ± 0.175
0.408HisMet: 0.408 ± 0.088
1.186HisAsn: 1.186 ± 0.132
0.875HisPro: 0.875 ± 0.142
0.525HisGln: 0.525 ± 0.098
0.855HisArg: 0.855 ± 0.125
1.166HisSer: 1.166 ± 0.146
1.069HisThr: 1.069 ± 0.151
1.341HisVal: 1.341 ± 0.174
0.311HisTrp: 0.311 ± 0.091
1.108HisTyr: 1.108 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
5.306IleAla: 5.306 ± 0.434
0.816IleCys: 0.816 ± 0.141
5.442IleAsp: 5.442 ± 0.287
5.131IleGlu: 5.131 ± 0.338
2.216IlePhe: 2.216 ± 0.2
3.79IleGly: 3.79 ± 0.251
1.224IleHis: 1.224 ± 0.137
4.839IleIle: 4.839 ± 0.301
5.558IleLys: 5.558 ± 0.346
4.334IleLeu: 4.334 ± 0.285
2.352IleMet: 2.352 ± 0.198
4.042IleAsn: 4.042 ± 0.266
2.449IlePro: 2.449 ± 0.253
2.352IleGln: 2.352 ± 0.23
3.382IleArg: 3.382 ± 0.258
3.731IleSer: 3.731 ± 0.258
4.917IleThr: 4.917 ± 0.287
4.917IleVal: 4.917 ± 0.278
0.564IleTrp: 0.564 ± 0.118
2.449IleTyr: 2.449 ± 0.217
0.0IleXaa: 0.0 ± 0.0
Lys
5.636LysAla: 5.636 ± 0.346
0.855LysCys: 0.855 ± 0.177
4.412LysAsp: 4.412 ± 0.369
6.608LysGlu: 6.608 ± 0.348
3.343LysPhe: 3.343 ± 0.273
4.023LysGly: 4.023 ± 0.27
1.749LysHis: 1.749 ± 0.183
5.422LysIle: 5.422 ± 0.232
5.014LysLys: 5.014 ± 0.348
5.908LysLeu: 5.908 ± 0.323
3.012LysMet: 3.012 ± 0.244
4.178LysAsn: 4.178 ± 0.352
2.876LysPro: 2.876 ± 0.254
3.012LysGln: 3.012 ± 0.222
3.498LysArg: 3.498 ± 0.23
3.382LysSer: 3.382 ± 0.255
4.684LysThr: 4.684 ± 0.347
5.15LysVal: 5.15 ± 0.278
1.049LysTrp: 1.049 ± 0.158
3.051LysTyr: 3.051 ± 0.279
0.0LysXaa: 0.0 ± 0.0
Leu
5.306LeuAla: 5.306 ± 0.298
1.069LeuCys: 1.069 ± 0.191
4.995LeuAsp: 4.995 ± 0.348
5.85LeuGlu: 5.85 ± 0.352
2.973LeuPhe: 2.973 ± 0.283
3.79LeuGly: 3.79 ± 0.24
1.244LeuHis: 1.244 ± 0.146
4.8LeuIle: 4.8 ± 0.298
6.102LeuLys: 6.102 ± 0.365
4.314LeuLeu: 4.314 ± 0.318
2.76LeuMet: 2.76 ± 0.253
3.615LeuAsn: 3.615 ± 0.245
2.818LeuPro: 2.818 ± 0.195
2.274LeuGln: 2.274 ± 0.199
3.673LeuArg: 3.673 ± 0.259
4.781LeuSer: 4.781 ± 0.319
4.489LeuThr: 4.489 ± 0.311
4.295LeuVal: 4.295 ± 0.245
0.739LeuTrp: 0.739 ± 0.121
3.362LeuTyr: 3.362 ± 0.264
0.0LeuXaa: 0.0 ± 0.0
Met
1.905MetAla: 1.905 ± 0.203
0.428MetCys: 0.428 ± 0.078
1.535MetAsp: 1.535 ± 0.182
1.827MetGlu: 1.827 ± 0.189
1.574MetPhe: 1.574 ± 0.173
1.73MetGly: 1.73 ± 0.189
0.564MetHis: 0.564 ± 0.095
2.604MetIle: 2.604 ± 0.258
3.11MetLys: 3.11 ± 0.255
2.41MetLeu: 2.41 ± 0.223
1.108MetMet: 1.108 ± 0.142
1.496MetAsn: 1.496 ± 0.175
0.836MetPro: 0.836 ± 0.109
1.244MetGln: 1.244 ± 0.14
1.302MetArg: 1.302 ± 0.184
1.885MetSer: 1.885 ± 0.201
1.827MetThr: 1.827 ± 0.189
2.021MetVal: 2.021 ± 0.181
0.505MetTrp: 0.505 ± 0.104
0.836MetTyr: 0.836 ± 0.108
0.0MetXaa: 0.0 ± 0.0
Asn
3.809AsnAla: 3.809 ± 0.285
0.739AsnCys: 0.739 ± 0.134
3.129AsnAsp: 3.129 ± 0.255
3.459AsnGlu: 3.459 ± 0.28
2.39AsnPhe: 2.39 ± 0.206
4.14AsnGly: 4.14 ± 0.29
1.186AsnHis: 1.186 ± 0.179
3.809AsnIle: 3.809 ± 0.314
3.77AsnLys: 3.77 ± 0.248
3.615AsnLeu: 3.615 ± 0.249
1.38AsnMet: 1.38 ± 0.153
2.682AsnAsn: 2.682 ± 0.289
2.216AsnPro: 2.216 ± 0.19
1.458AsnGln: 1.458 ± 0.183
2.39AsnArg: 2.39 ± 0.212
2.624AsnSer: 2.624 ± 0.217
2.799AsnThr: 2.799 ± 0.213
4.198AsnVal: 4.198 ± 0.301
0.505AsnTrp: 0.505 ± 0.106
2.332AsnTyr: 2.332 ± 0.238
0.0AsnXaa: 0.0 ± 0.0
Pro
1.982ProAla: 1.982 ± 0.201
0.408ProCys: 0.408 ± 0.094
2.565ProAsp: 2.565 ± 0.213
3.032ProGlu: 3.032 ± 0.245
1.613ProPhe: 1.613 ± 0.194
0.894ProGly: 0.894 ± 0.144
0.758ProHis: 0.758 ± 0.123
2.06ProIle: 2.06 ± 0.215
2.546ProLys: 2.546 ± 0.288
1.982ProLeu: 1.982 ± 0.182
0.719ProMet: 0.719 ± 0.114
1.438ProAsn: 1.438 ± 0.137
0.758ProPro: 0.758 ± 0.138
1.03ProGln: 1.03 ± 0.131
1.399ProArg: 1.399 ± 0.152
2.235ProSer: 2.235 ± 0.189
2.274ProThr: 2.274 ± 0.261
2.546ProVal: 2.546 ± 0.252
0.428ProTrp: 0.428 ± 0.085
1.38ProTyr: 1.38 ± 0.153
0.0ProXaa: 0.0 ± 0.0
Gln
1.807GlnAla: 1.807 ± 0.163
0.408GlnCys: 0.408 ± 0.095
1.438GlnAsp: 1.438 ± 0.178
2.39GlnGlu: 2.39 ± 0.217
1.632GlnPhe: 1.632 ± 0.186
1.477GlnGly: 1.477 ± 0.161
0.68GlnHis: 0.68 ± 0.109
2.429GlnIle: 2.429 ± 0.261
2.313GlnLys: 2.313 ± 0.248
2.701GlnLeu: 2.701 ± 0.242
0.875GlnMet: 0.875 ± 0.132
1.496GlnAsn: 1.496 ± 0.216
1.127GlnPro: 1.127 ± 0.15
0.777GlnGln: 0.777 ± 0.125
1.866GlnArg: 1.866 ± 0.157
1.594GlnSer: 1.594 ± 0.176
1.866GlnThr: 1.866 ± 0.204
2.002GlnVal: 2.002 ± 0.207
0.602GlnTrp: 0.602 ± 0.093
1.458GlnTyr: 1.458 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
2.468ArgAla: 2.468 ± 0.211
0.641ArgCys: 0.641 ± 0.127
3.362ArgAsp: 3.362 ± 0.265
3.79ArgGlu: 3.79 ± 0.299
2.585ArgPhe: 2.585 ± 0.24
3.09ArgGly: 3.09 ± 0.25
1.011ArgHis: 1.011 ± 0.138
3.265ArgIle: 3.265 ± 0.253
4.003ArgLys: 4.003 ± 0.305
3.459ArgLeu: 3.459 ± 0.308
1.166ArgMet: 1.166 ± 0.175
2.565ArgAsn: 2.565 ± 0.216
1.613ArgPro: 1.613 ± 0.152
1.496ArgGln: 1.496 ± 0.156
2.526ArgArg: 2.526 ± 0.26
2.429ArgSer: 2.429 ± 0.2
2.041ArgThr: 2.041 ± 0.22
3.44ArgVal: 3.44 ± 0.251
0.758ArgTrp: 0.758 ± 0.143
2.118ArgTyr: 2.118 ± 0.211
0.0ArgXaa: 0.0 ± 0.0
Ser
3.751SerAla: 3.751 ± 0.269
0.777SerCys: 0.777 ± 0.129
3.595SerAsp: 3.595 ± 0.23
4.237SerGlu: 4.237 ± 0.286
2.779SerPhe: 2.779 ± 0.208
4.023SerGly: 4.023 ± 0.313
0.952SerHis: 0.952 ± 0.121
3.731SerIle: 3.731 ± 0.289
4.334SerLys: 4.334 ± 0.317
4.062SerLeu: 4.062 ± 0.299
1.788SerMet: 1.788 ± 0.178
3.012SerAsn: 3.012 ± 0.226
1.749SerPro: 1.749 ± 0.177
1.555SerGln: 1.555 ± 0.178
2.993SerArg: 2.993 ± 0.218
3.032SerSer: 3.032 ± 0.247
2.857SerThr: 2.857 ± 0.22
4.042SerVal: 4.042 ± 0.286
0.836SerTrp: 0.836 ± 0.117
1.691SerTyr: 1.691 ± 0.15
0.0SerXaa: 0.0 ± 0.0
Thr
4.256ThrAla: 4.256 ± 0.336
0.739ThrCys: 0.739 ± 0.122
3.557ThrAsp: 3.557 ± 0.281
3.984ThrGlu: 3.984 ± 0.317
2.565ThrPhe: 2.565 ± 0.191
3.731ThrGly: 3.731 ± 0.322
1.322ThrHis: 1.322 ± 0.16
3.809ThrIle: 3.809 ± 0.272
4.509ThrLys: 4.509 ± 0.234
4.548ThrLeu: 4.548 ± 0.336
1.613ThrMet: 1.613 ± 0.176
2.76ThrAsn: 2.76 ± 0.227
2.935ThrPro: 2.935 ± 0.288
2.021ThrGln: 2.021 ± 0.309
2.41ThrArg: 2.41 ± 0.252
2.721ThrSer: 2.721 ± 0.305
3.11ThrThr: 3.11 ± 0.322
4.237ThrVal: 4.237 ± 0.374
0.719ThrTrp: 0.719 ± 0.129
2.352ThrTyr: 2.352 ± 0.192
0.0ThrXaa: 0.0 ± 0.0
Val
4.314ValAla: 4.314 ± 0.263
0.564ValCys: 0.564 ± 0.11
5.053ValAsp: 5.053 ± 0.291
5.85ValGlu: 5.85 ± 0.356
3.323ValPhe: 3.323 ± 0.249
4.12ValGly: 4.12 ± 0.318
1.069ValHis: 1.069 ± 0.13
4.47ValIle: 4.47 ± 0.336
5.733ValLys: 5.733 ± 0.337
4.528ValLeu: 4.528 ± 0.283
2.118ValMet: 2.118 ± 0.204
3.673ValAsn: 3.673 ± 0.284
1.885ValPro: 1.885 ± 0.222
2.06ValGln: 2.06 ± 0.214
2.799ValArg: 2.799 ± 0.229
3.926ValSer: 3.926 ± 0.278
4.12ValThr: 4.12 ± 0.328
4.956ValVal: 4.956 ± 0.314
1.069ValTrp: 1.069 ± 0.136
3.42ValTyr: 3.42 ± 0.247
0.0ValXaa: 0.0 ± 0.0
Trp
0.68TrpAla: 0.68 ± 0.102
0.311TrpCys: 0.311 ± 0.075
0.836TrpAsp: 0.836 ± 0.128
1.108TrpGlu: 1.108 ± 0.128
0.602TrpPhe: 0.602 ± 0.114
0.758TrpGly: 0.758 ± 0.14
0.389TrpHis: 0.389 ± 0.083
0.855TrpIle: 0.855 ± 0.129
0.952TrpLys: 0.952 ± 0.129
0.952TrpLeu: 0.952 ± 0.145
0.544TrpMet: 0.544 ± 0.11
0.641TrpAsn: 0.641 ± 0.111
0.233TrpPro: 0.233 ± 0.06
0.311TrpGln: 0.311 ± 0.083
0.622TrpArg: 0.622 ± 0.103
0.719TrpSer: 0.719 ± 0.106
0.913TrpThr: 0.913 ± 0.115
1.108TrpVal: 1.108 ± 0.154
0.233TrpTrp: 0.233 ± 0.067
0.719TrpTyr: 0.719 ± 0.098
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.129TyrAla: 3.129 ± 0.236
0.68TyrCys: 0.68 ± 0.099
3.032TyrAsp: 3.032 ± 0.226
2.546TyrGlu: 2.546 ± 0.208
2.06TyrPhe: 2.06 ± 0.202
2.799TyrGly: 2.799 ± 0.235
0.875TyrHis: 0.875 ± 0.121
3.226TyrIle: 3.226 ± 0.248
3.187TyrLys: 3.187 ± 0.264
2.993TyrLeu: 2.993 ± 0.259
1.127TyrMet: 1.127 ± 0.159
2.488TyrAsn: 2.488 ± 0.192
1.419TyrPro: 1.419 ± 0.15
1.244TyrGln: 1.244 ± 0.153
1.866TyrArg: 1.866 ± 0.2
2.313TyrSer: 2.313 ± 0.231
2.779TyrThr: 2.779 ± 0.202
2.701TyrVal: 2.701 ± 0.202
0.525TyrTrp: 0.525 ± 0.113
1.963TyrTyr: 1.963 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 277 proteins (51456 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski