Amino acid dipepetide frequency for Klebsiella phage vB_KleM_RaK2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.549AlaAla: 2.549 ± 0.26
0.46AlaCys: 0.46 ± 0.065
2.175AlaAsp: 2.175 ± 0.144
2.683AlaGlu: 2.683 ± 0.181
2.089AlaPhe: 2.089 ± 0.153
2.223AlaGly: 2.223 ± 0.212
0.69AlaHis: 0.69 ± 0.07
3.19AlaIle: 3.19 ± 0.193
2.874AlaLys: 2.874 ± 0.178
3.669AlaLeu: 3.669 ± 0.224
1.341AlaMet: 1.341 ± 0.114
2.328AlaAsn: 2.328 ± 0.143
1.274AlaPro: 1.274 ± 0.116
1.514AlaGln: 1.514 ± 0.151
1.772AlaArg: 1.772 ± 0.137
2.731AlaSer: 2.731 ± 0.204
2.06AlaThr: 2.06 ± 0.177
2.74AlaVal: 2.74 ± 0.182
0.489AlaTrp: 0.489 ± 0.066
2.089AlaTyr: 2.089 ± 0.127
0.0AlaXaa: 0.0 ± 0.0
Cys
0.661CysAla: 0.661 ± 0.087
0.278CysCys: 0.278 ± 0.047
0.92CysAsp: 0.92 ± 0.104
0.862CysGlu: 0.862 ± 0.106
0.45CysPhe: 0.45 ± 0.068
0.901CysGly: 0.901 ± 0.1
0.326CysHis: 0.326 ± 0.06
1.351CysIle: 1.351 ± 0.128
1.083CysLys: 1.083 ± 0.103
1.016CysLeu: 1.016 ± 0.1
0.172CysMet: 0.172 ± 0.043
1.178CysAsn: 1.178 ± 0.11
0.489CysPro: 0.489 ± 0.083
0.393CysGln: 0.393 ± 0.064
0.412CysArg: 0.412 ± 0.062
1.313CysSer: 1.313 ± 0.13
1.37CysThr: 1.37 ± 0.12
0.92CysVal: 0.92 ± 0.095
0.105CysTrp: 0.105 ± 0.03
0.776CysTyr: 0.776 ± 0.092
0.0CysXaa: 0.0 ± 0.0
Asp
2.961AspAla: 2.961 ± 0.169
1.016AspCys: 1.016 ± 0.115
4.273AspAsp: 4.273 ± 0.259
4.656AspGlu: 4.656 ± 0.256
3.612AspPhe: 3.612 ± 0.202
3.986AspGly: 3.986 ± 0.216
0.901AspHis: 0.901 ± 0.096
6.046AspIle: 6.046 ± 0.263
4.062AspLys: 4.062 ± 0.207
5.164AspLeu: 5.164 ± 0.26
1.878AspMet: 1.878 ± 0.123
4.139AspAsn: 4.139 ± 0.24
1.935AspPro: 1.935 ± 0.142
1.14AspGln: 1.14 ± 0.096
1.686AspArg: 1.686 ± 0.13
4.915AspSer: 4.915 ± 0.264
3.44AspThr: 3.44 ± 0.213
4.149AspVal: 4.149 ± 0.212
0.881AspTrp: 0.881 ± 0.086
3.947AspTyr: 3.947 ± 0.21
0.0AspXaa: 0.0 ± 0.0
Glu
2.788GluAla: 2.788 ± 0.177
1.054GluCys: 1.054 ± 0.113
3.947GluAsp: 3.947 ± 0.23
5.174GluGlu: 5.174 ± 0.319
3.88GluPhe: 3.88 ± 0.203
2.328GluGly: 2.328 ± 0.172
1.571GluHis: 1.571 ± 0.136
5.72GluIle: 5.72 ± 0.254
4.877GluLys: 4.877 ± 0.263
7.195GluLeu: 7.195 ± 0.299
2.089GluMet: 2.089 ± 0.163
4.072GluAsn: 4.072 ± 0.199
1.648GluPro: 1.648 ± 0.12
2.29GluGln: 2.29 ± 0.15
2.414GluArg: 2.414 ± 0.169
4.359GluSer: 4.359 ± 0.226
3.085GluThr: 3.085 ± 0.175
4.388GluVal: 4.388 ± 0.229
0.891GluTrp: 0.891 ± 0.09
4.273GluTyr: 4.273 ± 0.221
0.0GluXaa: 0.0 ± 0.0
Phe
1.705PheAla: 1.705 ± 0.13
0.719PheCys: 0.719 ± 0.089
3.784PheAsp: 3.784 ± 0.181
2.98PheGlu: 2.98 ± 0.171
1.993PhePhe: 1.993 ± 0.137
2.654PheGly: 2.654 ± 0.167
0.795PheHis: 0.795 ± 0.087
3.804PheIle: 3.804 ± 0.201
3.487PheLys: 3.487 ± 0.214
2.941PheLeu: 2.941 ± 0.201
1.255PheMet: 1.255 ± 0.102
3.622PheAsn: 3.622 ± 0.195
1.083PhePro: 1.083 ± 0.103
1.753PheGln: 1.753 ± 0.135
1.217PheArg: 1.217 ± 0.1
4.053PheSer: 4.053 ± 0.175
2.462PheThr: 2.462 ± 0.173
2.913PheVal: 2.913 ± 0.171
0.469PheTrp: 0.469 ± 0.068
2.51PheTyr: 2.51 ± 0.167
0.0PheXaa: 0.0 ± 0.0
Gly
2.175GlyAla: 2.175 ± 0.185
0.747GlyCys: 0.747 ± 0.101
3.075GlyAsp: 3.075 ± 0.195
2.807GlyGlu: 2.807 ± 0.189
2.663GlyPhe: 2.663 ± 0.153
2.481GlyGly: 2.481 ± 0.217
0.69GlyHis: 0.69 ± 0.08
4.158GlyIle: 4.158 ± 0.22
3.727GlyLys: 3.727 ± 0.207
3.717GlyLeu: 3.717 ± 0.225
1.169GlyMet: 1.169 ± 0.13
3.411GlyAsn: 3.411 ± 0.25
0.422GlyPro: 0.422 ± 0.066
1.332GlyGln: 1.332 ± 0.149
1.811GlyArg: 1.811 ± 0.133
4.302GlySer: 4.302 ± 0.239
4.58GlyThr: 4.58 ± 0.329
3.545GlyVal: 3.545 ± 0.252
0.575GlyTrp: 0.575 ± 0.082
3.382GlyTyr: 3.382 ± 0.192
0.0GlyXaa: 0.0 ± 0.0
His
0.642HisAla: 0.642 ± 0.085
0.45HisCys: 0.45 ± 0.085
1.246HisAsp: 1.246 ± 0.118
1.322HisGlu: 1.322 ± 0.113
0.987HisPhe: 0.987 ± 0.102
0.881HisGly: 0.881 ± 0.104
0.364HisHis: 0.364 ± 0.064
1.437HisIle: 1.437 ± 0.143
1.61HisLys: 1.61 ± 0.124
1.428HisLeu: 1.428 ± 0.112
0.45HisMet: 0.45 ± 0.073
1.475HisAsn: 1.475 ± 0.124
0.632HisPro: 0.632 ± 0.09
0.489HisGln: 0.489 ± 0.082
0.709HisArg: 0.709 ± 0.079
1.303HisSer: 1.303 ± 0.116
0.939HisThr: 0.939 ± 0.083
1.035HisVal: 1.035 ± 0.103
0.211HisTrp: 0.211 ± 0.04
1.207HisTyr: 1.207 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
3.277IleAla: 3.277 ± 0.217
1.178IleCys: 1.178 ± 0.115
5.509IleAsp: 5.509 ± 0.235
5.979IleGlu: 5.979 ± 0.262
2.999IlePhe: 2.999 ± 0.184
3.344IleGly: 3.344 ± 0.169
1.83IleHis: 1.83 ± 0.146
5.854IleIle: 5.854 ± 0.282
6.707IleLys: 6.707 ± 0.309
5.959IleLeu: 5.959 ± 0.286
2.06IleMet: 2.06 ± 0.146
6.017IleAsn: 6.017 ± 0.254
2.635IlePro: 2.635 ± 0.17
3.353IleGln: 3.353 ± 0.166
2.903IleArg: 2.903 ± 0.189
6.544IleSer: 6.544 ± 0.306
4.608IleThr: 4.608 ± 0.225
4.455IleVal: 4.455 ± 0.211
0.671IleTrp: 0.671 ± 0.083
3.392IleTyr: 3.392 ± 0.173
0.0IleXaa: 0.0 ± 0.0
Lys
2.788LysAla: 2.788 ± 0.208
1.389LysCys: 1.389 ± 0.136
4.723LysAsp: 4.723 ± 0.226
6.228LysGlu: 6.228 ± 0.326
3.622LysPhe: 3.622 ± 0.194
2.989LysGly: 2.989 ± 0.152
1.782LysHis: 1.782 ± 0.15
6.362LysIle: 6.362 ± 0.279
5.998LysLys: 5.998 ± 0.351
5.816LysLeu: 5.816 ± 0.268
2.491LysMet: 2.491 ± 0.187
5.95LysAsn: 5.95 ± 0.287
2.108LysPro: 2.108 ± 0.154
2.616LysGln: 2.616 ± 0.2
3.162LysArg: 3.162 ± 0.201
4.963LysSer: 4.963 ± 0.212
4.091LysThr: 4.091 ± 0.205
4.264LysVal: 4.264 ± 0.208
0.69LysTrp: 0.69 ± 0.079
4.608LysTyr: 4.608 ± 0.231
0.0LysXaa: 0.0 ± 0.0
Leu
2.932LeuAla: 2.932 ± 0.175
1.303LeuCys: 1.303 ± 0.117
5.03LeuAsp: 5.03 ± 0.222
5.04LeuGlu: 5.04 ± 0.276
3.583LeuPhe: 3.583 ± 0.188
3.468LeuGly: 3.468 ± 0.197
1.495LeuHis: 1.495 ± 0.118
6.046LeuIle: 6.046 ± 0.244
6.764LeuLys: 6.764 ± 0.27
6.141LeuLeu: 6.141 ± 0.338
2.117LeuMet: 2.117 ± 0.16
6.065LeuAsn: 6.065 ± 0.257
2.798LeuPro: 2.798 ± 0.191
3.219LeuGln: 3.219 ± 0.198
3.19LeuArg: 3.19 ± 0.21
6.295LeuSer: 6.295 ± 0.237
4.005LeuThr: 4.005 ± 0.205
4.57LeuVal: 4.57 ± 0.213
0.824LeuTrp: 0.824 ± 0.081
4.685LeuTyr: 4.685 ± 0.229
0.0LeuXaa: 0.0 ± 0.0
Met
1.284MetAla: 1.284 ± 0.104
0.249MetCys: 0.249 ± 0.047
1.389MetAsp: 1.389 ± 0.132
1.466MetGlu: 1.466 ± 0.122
1.389MetPhe: 1.389 ± 0.124
1.006MetGly: 1.006 ± 0.104
0.565MetHis: 0.565 ± 0.084
2.117MetIle: 2.117 ± 0.154
2.865MetLys: 2.865 ± 0.195
2.022MetLeu: 2.022 ± 0.15
0.623MetMet: 0.623 ± 0.08
2.146MetAsn: 2.146 ± 0.165
0.594MetPro: 0.594 ± 0.065
0.987MetGln: 0.987 ± 0.092
1.131MetArg: 1.131 ± 0.109
1.983MetSer: 1.983 ± 0.137
1.552MetThr: 1.552 ± 0.119
1.313MetVal: 1.313 ± 0.122
0.278MetTrp: 0.278 ± 0.049
1.36MetTyr: 1.36 ± 0.129
0.0MetXaa: 0.0 ± 0.0
Asn
3.181AsnAla: 3.181 ± 0.177
0.949AsnCys: 0.949 ± 0.109
4.283AsnAsp: 4.283 ± 0.193
4.743AsnGlu: 4.743 ± 0.228
2.74AsnPhe: 2.74 ± 0.16
4.963AsnGly: 4.963 ± 0.308
1.102AsnHis: 1.102 ± 0.103
5.988AsnIle: 5.988 ± 0.271
5.413AsnLys: 5.413 ± 0.303
4.771AsnLeu: 4.771 ± 0.227
1.753AsnMet: 1.753 ± 0.141
5.174AsnAsn: 5.174 ± 0.268
2.472AsnPro: 2.472 ± 0.159
1.61AsnGln: 1.61 ± 0.142
2.117AsnArg: 2.117 ± 0.114
5.787AsnSer: 5.787 ± 0.256
5.279AsnThr: 5.279 ± 0.314
4.158AsnVal: 4.158 ± 0.204
0.546AsnTrp: 0.546 ± 0.078
3.497AsnTyr: 3.497 ± 0.23
0.0AsnXaa: 0.0 ± 0.0
Pro
1.178ProAla: 1.178 ± 0.102
0.402ProCys: 0.402 ± 0.06
2.366ProAsp: 2.366 ± 0.154
2.366ProGlu: 2.366 ± 0.161
1.265ProPhe: 1.265 ± 0.101
1.581ProGly: 1.581 ± 0.111
0.594ProHis: 0.594 ± 0.073
1.619ProIle: 1.619 ± 0.114
1.983ProLys: 1.983 ± 0.179
1.897ProLeu: 1.897 ± 0.125
0.68ProMet: 0.68 ± 0.09
1.897ProAsn: 1.897 ± 0.122
0.661ProPro: 0.661 ± 0.105
1.063ProGln: 1.063 ± 0.117
0.853ProArg: 0.853 ± 0.091
2.041ProSer: 2.041 ± 0.143
1.849ProThr: 1.849 ± 0.165
2.434ProVal: 2.434 ± 0.168
0.287ProTrp: 0.287 ± 0.062
1.725ProTyr: 1.725 ± 0.139
0.0ProXaa: 0.0 ± 0.0
Gln
1.456GlnAla: 1.456 ± 0.116
0.537GlnCys: 0.537 ± 0.081
2.127GlnAsp: 2.127 ± 0.165
2.51GlnGlu: 2.51 ± 0.153
1.36GlnPhe: 1.36 ± 0.121
1.638GlnGly: 1.638 ± 0.14
0.594GlnHis: 0.594 ± 0.076
2.29GlnIle: 2.29 ± 0.14
2.606GlnLys: 2.606 ± 0.149
2.702GlnLeu: 2.702 ± 0.183
1.044GlnMet: 1.044 ± 0.096
2.386GlnAsn: 2.386 ± 0.166
0.91GlnPro: 0.91 ± 0.102
1.475GlnGln: 1.475 ± 0.149
1.246GlnArg: 1.246 ± 0.097
2.223GlnSer: 2.223 ± 0.17
1.514GlnThr: 1.514 ± 0.13
1.495GlnVal: 1.495 ± 0.111
0.374GlnTrp: 0.374 ± 0.069
2.759GlnTyr: 2.759 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
1.246ArgAla: 1.246 ± 0.128
0.671ArgCys: 0.671 ± 0.076
2.204ArgAsp: 2.204 ± 0.159
2.098ArgGlu: 2.098 ± 0.159
1.475ArgPhe: 1.475 ± 0.144
1.705ArgGly: 1.705 ± 0.126
0.699ArgHis: 0.699 ± 0.083
2.932ArgIle: 2.932 ± 0.152
2.769ArgLys: 2.769 ± 0.145
2.759ArgLeu: 2.759 ± 0.164
1.025ArgMet: 1.025 ± 0.101
2.299ArgAsn: 2.299 ± 0.142
0.834ArgPro: 0.834 ± 0.088
1.092ArgGln: 1.092 ± 0.108
1.303ArgArg: 1.303 ± 0.107
2.558ArgSer: 2.558 ± 0.162
2.242ArgThr: 2.242 ± 0.156
2.242ArgVal: 2.242 ± 0.161
0.469ArgTrp: 0.469 ± 0.076
1.859ArgTyr: 1.859 ± 0.141
0.0ArgXaa: 0.0 ± 0.0
Ser
2.932SerAla: 2.932 ± 0.214
0.872SerCys: 0.872 ± 0.098
4.743SerAsp: 4.743 ± 0.245
4.446SerGlu: 4.446 ± 0.208
3.305SerPhe: 3.305 ± 0.147
4.225SerGly: 4.225 ± 0.314
0.996SerHis: 0.996 ± 0.11
5.777SerIle: 5.777 ± 0.239
5.011SerLys: 5.011 ± 0.196
5.586SerLeu: 5.586 ± 0.25
1.59SerMet: 1.59 ± 0.138
5.212SerAsn: 5.212 ± 0.326
2.022SerPro: 2.022 ± 0.141
1.82SerGln: 1.82 ± 0.132
2.405SerArg: 2.405 ± 0.127
5.883SerSer: 5.883 ± 0.323
8.173SerThr: 8.173 ± 0.318
5.107SerVal: 5.107 ± 0.208
0.834SerTrp: 0.834 ± 0.086
4.206SerTyr: 4.206 ± 0.24
0.0SerXaa: 0.0 ± 0.0
Thr
2.616ThrAla: 2.616 ± 0.219
0.546ThrCys: 0.546 ± 0.072
4.024ThrAsp: 4.024 ± 0.216
5.059ThrGlu: 5.059 ± 0.254
2.673ThrPhe: 2.673 ± 0.181
3.995ThrGly: 3.995 ± 0.284
0.881ThrHis: 0.881 ± 0.095
4.934ThrIle: 4.934 ± 0.214
4.915ThrLys: 4.915 ± 0.215
4.551ThrLeu: 4.551 ± 0.222
1.437ThrMet: 1.437 ± 0.105
4.168ThrAsn: 4.168 ± 0.281
2.127ThrPro: 2.127 ± 0.154
1.907ThrGln: 1.907 ± 0.137
1.916ThrArg: 1.916 ± 0.144
4.618ThrSer: 4.618 ± 0.357
3.44ThrThr: 3.44 ± 0.276
4.695ThrVal: 4.695 ± 0.249
0.661ThrTrp: 0.661 ± 0.075
3.047ThrTyr: 3.047 ± 0.207
0.0ThrXaa: 0.0 ± 0.0
Val
2.137ValAla: 2.137 ± 0.174
0.805ValCys: 0.805 ± 0.087
3.986ValAsp: 3.986 ± 0.23
3.756ValGlu: 3.756 ± 0.196
2.826ValPhe: 2.826 ± 0.177
2.778ValGly: 2.778 ± 0.187
1.514ValHis: 1.514 ± 0.123
4.292ValIle: 4.292 ± 0.193
4.062ValLys: 4.062 ± 0.191
7.78ValLeu: 7.78 ± 0.27
1.523ValMet: 1.523 ± 0.121
3.507ValAsn: 3.507 ± 0.172
2.635ValPro: 2.635 ± 0.184
3.219ValGln: 3.219 ± 0.162
1.859ValArg: 1.859 ± 0.135
4.206ValSer: 4.206 ± 0.226
3.2ValThr: 3.2 ± 0.209
3.88ValVal: 3.88 ± 0.23
0.594ValTrp: 0.594 ± 0.08
3.574ValTyr: 3.574 ± 0.181
0.0ValXaa: 0.0 ± 0.0
Trp
0.402TrpAla: 0.402 ± 0.069
0.192TrpCys: 0.192 ± 0.039
0.69TrpAsp: 0.69 ± 0.092
0.642TrpGlu: 0.642 ± 0.079
0.527TrpPhe: 0.527 ± 0.073
0.508TrpGly: 0.508 ± 0.069
0.268TrpHis: 0.268 ± 0.047
0.776TrpIle: 0.776 ± 0.091
0.872TrpLys: 0.872 ± 0.087
0.776TrpLeu: 0.776 ± 0.089
0.307TrpMet: 0.307 ± 0.058
0.862TrpAsn: 0.862 ± 0.092
0.105TrpPro: 0.105 ± 0.03
0.259TrpGln: 0.259 ± 0.057
0.345TrpArg: 0.345 ± 0.051
0.872TrpSer: 0.872 ± 0.102
0.632TrpThr: 0.632 ± 0.086
0.776TrpVal: 0.776 ± 0.104
0.105TrpTrp: 0.105 ± 0.031
0.604TrpTyr: 0.604 ± 0.081
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.041TyrAla: 2.041 ± 0.139
1.159TyrCys: 1.159 ± 0.109
4.292TyrAsp: 4.292 ± 0.227
3.229TyrGlu: 3.229 ± 0.21
2.759TyrPhe: 2.759 ± 0.176
3.114TyrGly: 3.114 ± 0.194
1.159TyrHis: 1.159 ± 0.111
4.58TyrIle: 4.58 ± 0.225
4.953TyrLys: 4.953 ± 0.246
3.679TyrLeu: 3.679 ± 0.198
1.303TyrMet: 1.303 ± 0.108
4.618TyrAsn: 4.618 ± 0.219
1.36TyrPro: 1.36 ± 0.111
1.715TyrGln: 1.715 ± 0.139
2.031TyrArg: 2.031 ± 0.146
4.196TyrSer: 4.196 ± 0.232
3.689TyrThr: 3.689 ± 0.204
3.056TyrVal: 3.056 ± 0.159
0.565TyrTrp: 0.565 ± 0.072
3.171TyrTyr: 3.171 ± 0.221
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 534 proteins (104375 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski