Amino acid dipepetide frequency for Escherichia phage RB16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.339AlaAla: 4.339 ± 0.296
0.534AlaCys: 0.534 ± 0.097
3.824AlaAsp: 3.824 ± 0.303
4.908AlaGlu: 4.908 ± 0.313
2.508AlaPhe: 2.508 ± 0.231
4.713AlaGly: 4.713 ± 0.42
1.334AlaHis: 1.334 ± 0.159
4.837AlaIle: 4.837 ± 0.295
4.908AlaLys: 4.908 ± 0.347
5.78AlaLeu: 5.78 ± 0.342
2.721AlaMet: 2.721 ± 0.208
3.646AlaAsn: 3.646 ± 0.303
2.205AlaPro: 2.205 ± 0.223
2.205AlaGln: 2.205 ± 0.235
3.041AlaArg: 3.041 ± 0.261
3.824AlaSer: 3.824 ± 0.266
4.286AlaThr: 4.286 ± 0.504
5.193AlaVal: 5.193 ± 0.3
0.925AlaTrp: 0.925 ± 0.14
3.006AlaTyr: 3.006 ± 0.245
0.0AlaXaa: 0.0 ± 0.0
Cys
0.978CysAla: 0.978 ± 0.156
0.16CysCys: 0.16 ± 0.049
0.729CysAsp: 0.729 ± 0.124
0.818CysGlu: 0.818 ± 0.148
0.569CysPhe: 0.569 ± 0.106
0.996CysGly: 0.996 ± 0.155
0.285CysHis: 0.285 ± 0.072
0.622CysIle: 0.622 ± 0.106
0.943CysLys: 0.943 ± 0.129
0.694CysLeu: 0.694 ± 0.115
0.373CysMet: 0.373 ± 0.068
0.534CysAsn: 0.534 ± 0.102
0.569CysPro: 0.569 ± 0.1
0.231CysGln: 0.231 ± 0.068
0.516CysArg: 0.516 ± 0.098
0.8CysSer: 0.8 ± 0.13
0.48CysThr: 0.48 ± 0.092
0.765CysVal: 0.765 ± 0.137
0.142CysTrp: 0.142 ± 0.049
0.498CysTyr: 0.498 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
4.428AspAla: 4.428 ± 0.328
0.694AspCys: 0.694 ± 0.12
4.428AspAsp: 4.428 ± 0.293
4.446AspGlu: 4.446 ± 0.306
2.774AspPhe: 2.774 ± 0.21
4.553AspGly: 4.553 ± 0.312
1.138AspHis: 1.138 ± 0.152
4.535AspIle: 4.535 ± 0.29
4.588AspLys: 4.588 ± 0.311
5.282AspLeu: 5.282 ± 0.362
1.903AspMet: 1.903 ± 0.194
3.272AspAsn: 3.272 ± 0.26
2.703AspPro: 2.703 ± 0.215
1.69AspGln: 1.69 ± 0.174
2.721AspArg: 2.721 ± 0.166
4.179AspSer: 4.179 ± 0.268
3.646AspThr: 3.646 ± 0.261
4.233AspVal: 4.233 ± 0.301
0.996AspTrp: 0.996 ± 0.136
3.486AspTyr: 3.486 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
5.371GluAla: 5.371 ± 0.312
1.049GluCys: 1.049 ± 0.139
3.539GluAsp: 3.539 ± 0.264
4.926GluGlu: 4.926 ± 0.335
2.846GluPhe: 2.846 ± 0.23
3.237GluGly: 3.237 ± 0.27
1.352GluHis: 1.352 ± 0.171
5.691GluIle: 5.691 ± 0.31
5.478GluLys: 5.478 ± 0.389
6.153GluLeu: 6.153 ± 0.306
2.152GluMet: 2.152 ± 0.228
3.557GluAsn: 3.557 ± 0.229
1.743GluPro: 1.743 ± 0.207
2.685GluGln: 2.685 ± 0.26
3.272GluArg: 3.272 ± 0.263
3.717GluSer: 3.717 ± 0.248
3.877GluThr: 3.877 ± 0.239
4.055GluVal: 4.055 ± 0.284
1.031GluTrp: 1.031 ± 0.133
2.508GluTyr: 2.508 ± 0.209
0.0GluXaa: 0.0 ± 0.0
Phe
2.703PheAla: 2.703 ± 0.219
0.462PheCys: 0.462 ± 0.08
3.841PheAsp: 3.841 ± 0.276
2.525PheGlu: 2.525 ± 0.224
1.387PhePhe: 1.387 ± 0.161
2.774PheGly: 2.774 ± 0.247
0.658PheHis: 0.658 ± 0.11
2.454PheIle: 2.454 ± 0.217
2.952PheLys: 2.952 ± 0.257
2.614PheLeu: 2.614 ± 0.197
1.138PheMet: 1.138 ± 0.142
2.383PheAsn: 2.383 ± 0.238
1.387PhePro: 1.387 ± 0.155
1.423PheGln: 1.423 ± 0.17
1.974PheArg: 1.974 ± 0.201
2.632PheSer: 2.632 ± 0.243
2.579PheThr: 2.579 ± 0.219
2.988PheVal: 2.988 ± 0.2
0.587PheTrp: 0.587 ± 0.089
1.583PheTyr: 1.583 ± 0.191
0.0PheXaa: 0.0 ± 0.0
Gly
4.073GlyAla: 4.073 ± 0.385
0.854GlyCys: 0.854 ± 0.128
4.482GlyAsp: 4.482 ± 0.325
3.806GlyGlu: 3.806 ± 0.305
3.183GlyPhe: 3.183 ± 0.294
4.126GlyGly: 4.126 ± 0.406
1.103GlyHis: 1.103 ± 0.152
4.073GlyIle: 4.073 ± 0.262
5.14GlyLys: 5.14 ± 0.311
4.357GlyLeu: 4.357 ± 0.261
1.743GlyMet: 1.743 ± 0.168
3.29GlyAsn: 3.29 ± 0.317
0.836GlyPro: 0.836 ± 0.117
1.512GlyGln: 1.512 ± 0.177
2.436GlyArg: 2.436 ± 0.227
4.411GlySer: 4.411 ± 0.309
3.735GlyThr: 3.735 ± 0.399
5.3GlyVal: 5.3 ± 0.347
0.96GlyTrp: 0.96 ± 0.155
3.664GlyTyr: 3.664 ± 0.24
0.0GlyXaa: 0.0 ± 0.0
His
1.12HisAla: 1.12 ± 0.133
0.213HisCys: 0.213 ± 0.072
1.014HisAsp: 1.014 ± 0.152
1.174HisGlu: 1.174 ± 0.143
0.96HisPhe: 0.96 ± 0.137
0.925HisGly: 0.925 ± 0.166
0.409HisHis: 0.409 ± 0.083
1.494HisIle: 1.494 ± 0.166
1.174HisLys: 1.174 ± 0.173
1.28HisLeu: 1.28 ± 0.14
0.445HisMet: 0.445 ± 0.094
1.031HisAsn: 1.031 ± 0.137
0.907HisPro: 0.907 ± 0.13
0.729HisGln: 0.729 ± 0.122
1.174HisArg: 1.174 ± 0.147
0.871HisSer: 0.871 ± 0.142
1.067HisThr: 1.067 ± 0.145
1.316HisVal: 1.316 ± 0.144
0.249HisTrp: 0.249 ± 0.075
0.836HisTyr: 0.836 ± 0.138
0.0HisXaa: 0.0 ± 0.0
Ile
4.66IleAla: 4.66 ± 0.278
0.551IleCys: 0.551 ± 0.089
4.962IleAsp: 4.962 ± 0.277
4.926IleGlu: 4.926 ± 0.325
2.187IlePhe: 2.187 ± 0.176
3.557IleGly: 3.557 ± 0.282
0.978IleHis: 0.978 ± 0.114
3.841IleIle: 3.841 ± 0.3
4.553IleLys: 4.553 ± 0.326
4.571IleLeu: 4.571 ± 0.3
1.743IleMet: 1.743 ± 0.185
3.841IleAsn: 3.841 ± 0.263
2.525IlePro: 2.525 ± 0.243
2.01IleGln: 2.01 ± 0.163
4.144IleArg: 4.144 ± 0.251
3.824IleSer: 3.824 ± 0.279
4.499IleThr: 4.499 ± 0.268
4.66IleVal: 4.66 ± 0.265
0.765IleTrp: 0.765 ± 0.14
2.543IleTyr: 2.543 ± 0.205
0.0IleXaa: 0.0 ± 0.0
Lys
5.816LysAla: 5.816 ± 0.366
0.871LysCys: 0.871 ± 0.129
4.606LysAsp: 4.606 ± 0.34
5.069LysGlu: 5.069 ± 0.345
2.934LysPhe: 2.934 ± 0.233
4.25LysGly: 4.25 ± 0.316
1.725LysHis: 1.725 ± 0.226
4.748LysIle: 4.748 ± 0.288
4.695LysLys: 4.695 ± 0.38
6.029LysLeu: 6.029 ± 0.39
2.17LysMet: 2.17 ± 0.213
3.788LysAsn: 3.788 ± 0.306
2.846LysPro: 2.846 ± 0.247
2.721LysGln: 2.721 ± 0.231
3.806LysArg: 3.806 ± 0.291
3.735LysSer: 3.735 ± 0.297
4.642LysThr: 4.642 ± 0.345
4.624LysVal: 4.624 ± 0.329
1.103LysTrp: 1.103 ± 0.153
3.521LysTyr: 3.521 ± 0.289
0.0LysXaa: 0.0 ± 0.0
Leu
5.389LeuAla: 5.389 ± 0.33
0.871LeuCys: 0.871 ± 0.128
4.962LeuAsp: 4.962 ± 0.29
4.624LeuGlu: 4.624 ± 0.285
3.094LeuPhe: 3.094 ± 0.245
4.197LeuGly: 4.197 ± 0.255
1.601LeuHis: 1.601 ± 0.151
4.411LeuIle: 4.411 ± 0.352
6.082LeuLys: 6.082 ± 0.403
4.944LeuLeu: 4.944 ± 0.325
2.454LeuMet: 2.454 ± 0.227
4.553LeuAsn: 4.553 ± 0.282
3.539LeuPro: 3.539 ± 0.246
2.454LeuGln: 2.454 ± 0.24
4.037LeuArg: 4.037 ± 0.285
4.642LeuSer: 4.642 ± 0.248
4.428LeuThr: 4.428 ± 0.284
5.051LeuVal: 5.051 ± 0.272
0.836LeuTrp: 0.836 ± 0.12
3.219LeuTyr: 3.219 ± 0.237
0.0LeuXaa: 0.0 ± 0.0
Met
1.778MetAla: 1.778 ± 0.193
0.285MetCys: 0.285 ± 0.082
1.654MetAsp: 1.654 ± 0.173
1.885MetGlu: 1.885 ± 0.191
1.209MetPhe: 1.209 ± 0.14
1.529MetGly: 1.529 ± 0.154
0.445MetHis: 0.445 ± 0.084
2.081MetIle: 2.081 ± 0.179
3.13MetLys: 3.13 ± 0.241
2.01MetLeu: 2.01 ± 0.199
0.818MetMet: 0.818 ± 0.117
1.761MetAsn: 1.761 ± 0.194
0.818MetPro: 0.818 ± 0.111
1.28MetGln: 1.28 ± 0.146
1.618MetArg: 1.618 ± 0.184
1.921MetSer: 1.921 ± 0.212
1.725MetThr: 1.725 ± 0.172
1.512MetVal: 1.512 ± 0.158
0.373MetTrp: 0.373 ± 0.076
1.209MetTyr: 1.209 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
4.322AsnAla: 4.322 ± 0.287
0.711AsnCys: 0.711 ± 0.116
3.148AsnAsp: 3.148 ± 0.216
3.753AsnGlu: 3.753 ± 0.255
2.187AsnPhe: 2.187 ± 0.198
4.357AsnGly: 4.357 ± 0.307
0.996AsnHis: 0.996 ± 0.135
3.521AsnIle: 3.521 ± 0.244
3.468AsnLys: 3.468 ± 0.242
4.428AsnLeu: 4.428 ± 0.278
1.298AsnMet: 1.298 ± 0.148
3.112AsnAsn: 3.112 ± 0.284
2.703AsnPro: 2.703 ± 0.229
1.939AsnGln: 1.939 ± 0.177
2.454AsnArg: 2.454 ± 0.204
2.917AsnSer: 2.917 ± 0.203
3.486AsnThr: 3.486 ± 0.327
3.77AsnVal: 3.77 ± 0.262
0.498AsnTrp: 0.498 ± 0.092
1.796AsnTyr: 1.796 ± 0.188
0.0AsnXaa: 0.0 ± 0.0
Pro
2.668ProAla: 2.668 ± 0.217
0.48ProCys: 0.48 ± 0.085
2.81ProAsp: 2.81 ± 0.222
3.148ProGlu: 3.148 ± 0.312
1.565ProPhe: 1.565 ± 0.151
1.707ProGly: 1.707 ± 0.191
0.622ProHis: 0.622 ± 0.118
1.85ProIle: 1.85 ± 0.198
2.508ProLys: 2.508 ± 0.257
2.703ProLeu: 2.703 ± 0.242
0.783ProMet: 0.783 ± 0.125
1.707ProAsn: 1.707 ± 0.156
1.067ProPro: 1.067 ± 0.151
0.996ProGln: 0.996 ± 0.126
1.743ProArg: 1.743 ± 0.168
2.027ProSer: 2.027 ± 0.206
2.454ProThr: 2.454 ± 0.228
2.97ProVal: 2.97 ± 0.223
0.48ProTrp: 0.48 ± 0.083
1.654ProTyr: 1.654 ± 0.149
0.0ProXaa: 0.0 ± 0.0
Gln
2.561GlnAla: 2.561 ± 0.251
0.409GlnCys: 0.409 ± 0.081
1.583GlnAsp: 1.583 ± 0.171
1.796GlnGlu: 1.796 ± 0.165
1.227GlnPhe: 1.227 ± 0.123
1.707GlnGly: 1.707 ± 0.183
0.729GlnHis: 0.729 ± 0.106
2.543GlnIle: 2.543 ± 0.225
2.276GlnLys: 2.276 ± 0.191
2.774GlnLeu: 2.774 ± 0.233
1.156GlnMet: 1.156 ± 0.151
1.725GlnAsn: 1.725 ± 0.192
1.103GlnPro: 1.103 ± 0.167
1.725GlnGln: 1.725 ± 0.201
1.672GlnArg: 1.672 ± 0.191
2.027GlnSer: 2.027 ± 0.19
2.152GlnThr: 2.152 ± 0.193
2.276GlnVal: 2.276 ± 0.215
0.48GlnTrp: 0.48 ± 0.102
1.547GlnTyr: 1.547 ± 0.16
0.0GlnXaa: 0.0 ± 0.0
Arg
2.774ArgAla: 2.774 ± 0.213
0.427ArgCys: 0.427 ± 0.094
3.006ArgAsp: 3.006 ± 0.186
3.735ArgGlu: 3.735 ± 0.302
1.956ArgPhe: 1.956 ± 0.202
3.343ArgGly: 3.343 ± 0.265
0.711ArgHis: 0.711 ± 0.118
3.379ArgIle: 3.379 ± 0.241
3.913ArgLys: 3.913 ± 0.303
3.628ArgLeu: 3.628 ± 0.243
1.547ArgMet: 1.547 ± 0.151
2.65ArgAsn: 2.65 ± 0.205
1.441ArgPro: 1.441 ± 0.157
1.494ArgGln: 1.494 ± 0.163
2.276ArgArg: 2.276 ± 0.226
2.952ArgSer: 2.952 ± 0.235
2.739ArgThr: 2.739 ± 0.204
3.468ArgVal: 3.468 ± 0.271
0.871ArgTrp: 0.871 ± 0.124
1.956ArgTyr: 1.956 ± 0.188
0.0ArgXaa: 0.0 ± 0.0
Ser
3.521SerAla: 3.521 ± 0.259
0.658SerCys: 0.658 ± 0.111
3.824SerAsp: 3.824 ± 0.289
3.45SerGlu: 3.45 ± 0.249
2.934SerPhe: 2.934 ± 0.255
5.015SerGly: 5.015 ± 0.349
1.049SerHis: 1.049 ± 0.139
3.859SerIle: 3.859 ± 0.29
4.482SerLys: 4.482 ± 0.291
4.286SerLeu: 4.286 ± 0.259
1.601SerMet: 1.601 ± 0.166
3.183SerAsn: 3.183 ± 0.267
2.205SerPro: 2.205 ± 0.224
1.796SerGln: 1.796 ± 0.169
2.917SerArg: 2.917 ± 0.218
3.379SerSer: 3.379 ± 0.258
3.237SerThr: 3.237 ± 0.251
4.446SerVal: 4.446 ± 0.284
0.658SerTrp: 0.658 ± 0.116
2.17SerTyr: 2.17 ± 0.218
0.0SerXaa: 0.0 ± 0.0
Thr
4.055ThrAla: 4.055 ± 0.399
0.551ThrCys: 0.551 ± 0.101
3.681ThrAsp: 3.681 ± 0.295
4.019ThrGlu: 4.019 ± 0.273
2.401ThrPhe: 2.401 ± 0.216
4.677ThrGly: 4.677 ± 0.414
1.067ThrHis: 1.067 ± 0.139
3.895ThrIle: 3.895 ± 0.253
4.073ThrLys: 4.073 ± 0.226
5.246ThrLeu: 5.246 ± 0.299
1.28ThrMet: 1.28 ± 0.15
3.023ThrAsn: 3.023 ± 0.303
2.881ThrPro: 2.881 ± 0.241
2.116ThrGln: 2.116 ± 0.194
2.65ThrArg: 2.65 ± 0.213
3.255ThrSer: 3.255 ± 0.239
3.379ThrThr: 3.379 ± 0.337
4.748ThrVal: 4.748 ± 0.372
0.747ThrTrp: 0.747 ± 0.12
2.241ThrTyr: 2.241 ± 0.228
0.0ThrXaa: 0.0 ± 0.0
Val
4.713ValAla: 4.713 ± 0.312
1.12ValCys: 1.12 ± 0.174
5.478ValAsp: 5.478 ± 0.351
5.157ValGlu: 5.157 ± 0.31
3.041ValPhe: 3.041 ± 0.238
3.913ValGly: 3.913 ± 0.214
1.085ValHis: 1.085 ± 0.125
4.162ValIle: 4.162 ± 0.245
4.766ValLys: 4.766 ± 0.309
4.695ValLeu: 4.695 ± 0.272
1.885ValMet: 1.885 ± 0.169
4.25ValAsn: 4.25 ± 0.284
2.774ValPro: 2.774 ± 0.216
2.383ValGln: 2.383 ± 0.223
3.201ValArg: 3.201 ± 0.219
4.428ValSer: 4.428 ± 0.298
4.144ValThr: 4.144 ± 0.411
5.318ValVal: 5.318 ± 0.3
0.978ValTrp: 0.978 ± 0.132
3.735ValTyr: 3.735 ± 0.252
0.0ValXaa: 0.0 ± 0.0
Trp
0.711TrpAla: 0.711 ± 0.113
0.142TrpCys: 0.142 ± 0.049
1.12TrpAsp: 1.12 ± 0.145
0.871TrpGlu: 0.871 ± 0.125
0.569TrpPhe: 0.569 ± 0.095
0.836TrpGly: 0.836 ± 0.13
0.285TrpHis: 0.285 ± 0.066
0.729TrpIle: 0.729 ± 0.142
1.423TrpLys: 1.423 ± 0.169
1.014TrpLeu: 1.014 ± 0.15
0.569TrpMet: 0.569 ± 0.131
0.747TrpAsn: 0.747 ± 0.12
0.267TrpPro: 0.267 ± 0.079
0.516TrpGln: 0.516 ± 0.094
0.569TrpArg: 0.569 ± 0.1
0.534TrpSer: 0.534 ± 0.103
0.729TrpThr: 0.729 ± 0.112
0.925TrpVal: 0.925 ± 0.144
0.231TrpTrp: 0.231 ± 0.077
0.729TrpTyr: 0.729 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.774TyrAla: 2.774 ± 0.235
0.658TyrCys: 0.658 ± 0.1
3.006TyrAsp: 3.006 ± 0.223
3.201TyrGlu: 3.201 ± 0.216
1.512TyrPhe: 1.512 ± 0.169
2.668TyrGly: 2.668 ± 0.217
0.818TyrHis: 0.818 ± 0.126
2.614TyrIle: 2.614 ± 0.173
3.148TyrLys: 3.148 ± 0.277
2.899TyrLeu: 2.899 ± 0.206
1.245TyrMet: 1.245 ± 0.147
2.81TyrAsn: 2.81 ± 0.23
1.423TyrPro: 1.423 ± 0.163
1.565TyrGln: 1.565 ± 0.178
2.045TyrArg: 2.045 ± 0.195
2.597TyrSer: 2.597 ± 0.214
2.703TyrThr: 2.703 ± 0.235
3.61TyrVal: 3.61 ± 0.281
0.605TyrTrp: 0.605 ± 0.1
1.796TyrTyr: 1.796 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 270 proteins (56230 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski