Amino acid dipepetide frequency for Citrobacter phage Miller

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.529AlaAla: 4.529 ± 0.301
0.566AlaCys: 0.566 ± 0.097
4.051AlaAsp: 4.051 ± 0.271
4.847AlaGlu: 4.847 ± 0.362
2.494AlaPhe: 2.494 ± 0.203
4.547AlaGly: 4.547 ± 0.35
1.256AlaHis: 1.256 ± 0.146
4.67AlaIle: 4.67 ± 0.332
4.883AlaLys: 4.883 ± 0.297
6.05AlaLeu: 6.05 ± 0.341
2.636AlaMet: 2.636 ± 0.229
3.821AlaAsn: 3.821 ± 0.256
2.194AlaPro: 2.194 ± 0.19
2.052AlaGln: 2.052 ± 0.21
3.202AlaArg: 3.202 ± 0.273
3.556AlaSer: 3.556 ± 0.238
4.246AlaThr: 4.246 ± 0.483
5.042AlaVal: 5.042 ± 0.352
1.026AlaTrp: 1.026 ± 0.128
3.131AlaTyr: 3.131 ± 0.222
0.0AlaXaa: 0.0 ± 0.0
Cys
1.008CysAla: 1.008 ± 0.141
0.159CysCys: 0.159 ± 0.055
0.708CysAsp: 0.708 ± 0.113
0.778CysGlu: 0.778 ± 0.129
0.637CysPhe: 0.637 ± 0.113
0.991CysGly: 0.991 ± 0.149
0.248CysHis: 0.248 ± 0.065
0.725CysIle: 0.725 ± 0.118
0.938CysLys: 0.938 ± 0.135
0.619CysLeu: 0.619 ± 0.109
0.425CysMet: 0.425 ± 0.073
0.584CysAsn: 0.584 ± 0.108
0.531CysPro: 0.531 ± 0.115
0.23CysGln: 0.23 ± 0.063
0.442CysArg: 0.442 ± 0.089
0.672CysSer: 0.672 ± 0.112
0.478CysThr: 0.478 ± 0.088
0.867CysVal: 0.867 ± 0.144
0.159CysTrp: 0.159 ± 0.049
0.425CysTyr: 0.425 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
4.582AspAla: 4.582 ± 0.296
0.69AspCys: 0.69 ± 0.109
4.494AspAsp: 4.494 ± 0.289
4.6AspGlu: 4.6 ± 0.259
2.99AspPhe: 2.99 ± 0.225
4.564AspGly: 4.564 ± 0.313
1.044AspHis: 1.044 ± 0.153
4.865AspIle: 4.865 ± 0.262
4.334AspLys: 4.334 ± 0.302
5.413AspLeu: 5.413 ± 0.346
1.769AspMet: 1.769 ± 0.201
3.308AspAsn: 3.308 ± 0.246
2.795AspPro: 2.795 ± 0.231
1.734AspGln: 1.734 ± 0.192
2.884AspArg: 2.884 ± 0.257
3.91AspSer: 3.91 ± 0.266
3.644AspThr: 3.644 ± 0.269
4.458AspVal: 4.458 ± 0.287
1.026AspTrp: 1.026 ± 0.153
3.432AspTyr: 3.432 ± 0.268
0.0AspXaa: 0.0 ± 0.0
Glu
5.219GluAla: 5.219 ± 0.34
1.008GluCys: 1.008 ± 0.148
3.609GluAsp: 3.609 ± 0.238
5.095GluGlu: 5.095 ± 0.352
2.937GluPhe: 2.937 ± 0.201
3.202GluGly: 3.202 ± 0.285
1.203GluHis: 1.203 ± 0.183
5.679GluIle: 5.679 ± 0.356
5.75GluLys: 5.75 ± 0.388
6.44GluLeu: 6.44 ± 0.328
2.176GluMet: 2.176 ± 0.196
3.627GluAsn: 3.627 ± 0.261
1.751GluPro: 1.751 ± 0.209
2.724GluGln: 2.724 ± 0.277
3.167GluArg: 3.167 ± 0.219
3.715GluSer: 3.715 ± 0.285
3.857GluThr: 3.857 ± 0.255
3.945GluVal: 3.945 ± 0.308
1.115GluTrp: 1.115 ± 0.141
2.848GluTyr: 2.848 ± 0.243
0.0GluXaa: 0.0 ± 0.0
Phe
2.618PheAla: 2.618 ± 0.215
0.495PheCys: 0.495 ± 0.093
3.556PheAsp: 3.556 ± 0.304
2.884PheGlu: 2.884 ± 0.242
1.398PhePhe: 1.398 ± 0.164
2.654PheGly: 2.654 ± 0.225
0.584PheHis: 0.584 ± 0.097
2.53PheIle: 2.53 ± 0.196
2.884PheLys: 2.884 ± 0.254
2.654PheLeu: 2.654 ± 0.233
1.238PheMet: 1.238 ± 0.127
2.494PheAsn: 2.494 ± 0.199
1.345PhePro: 1.345 ± 0.187
1.468PheGln: 1.468 ± 0.175
1.999PheArg: 1.999 ± 0.24
2.724PheSer: 2.724 ± 0.235
2.548PheThr: 2.548 ± 0.226
3.131PheVal: 3.131 ± 0.227
0.601PheTrp: 0.601 ± 0.084
1.769PheTyr: 1.769 ± 0.182
0.0PheXaa: 0.0 ± 0.0
Gly
4.193GlyAla: 4.193 ± 0.411
0.92GlyCys: 0.92 ± 0.126
4.67GlyAsp: 4.67 ± 0.311
3.945GlyGlu: 3.945 ± 0.29
3.237GlyPhe: 3.237 ± 0.275
4.034GlyGly: 4.034 ± 0.432
1.256GlyHis: 1.256 ± 0.155
4.016GlyIle: 4.016 ± 0.253
4.971GlyLys: 4.971 ± 0.298
4.423GlyLeu: 4.423 ± 0.282
1.751GlyMet: 1.751 ± 0.188
3.273GlyAsn: 3.273 ± 0.362
0.885GlyPro: 0.885 ± 0.138
1.451GlyGln: 1.451 ± 0.185
2.53GlyArg: 2.53 ± 0.211
4.264GlySer: 4.264 ± 0.284
3.715GlyThr: 3.715 ± 0.381
4.918GlyVal: 4.918 ± 0.346
0.955GlyTrp: 0.955 ± 0.135
3.184GlyTyr: 3.184 ± 0.183
0.0GlyXaa: 0.0 ± 0.0
His
1.291HisAla: 1.291 ± 0.162
0.248HisCys: 0.248 ± 0.073
1.079HisAsp: 1.079 ± 0.161
1.221HisGlu: 1.221 ± 0.131
0.849HisPhe: 0.849 ± 0.121
1.008HisGly: 1.008 ± 0.152
0.46HisHis: 0.46 ± 0.093
1.486HisIle: 1.486 ± 0.157
1.132HisLys: 1.132 ± 0.134
1.362HisLeu: 1.362 ± 0.164
0.548HisMet: 0.548 ± 0.112
0.814HisAsn: 0.814 ± 0.123
1.061HisPro: 1.061 ± 0.133
0.601HisGln: 0.601 ± 0.118
1.008HisArg: 1.008 ± 0.151
0.761HisSer: 0.761 ± 0.117
0.955HisThr: 0.955 ± 0.13
1.38HisVal: 1.38 ± 0.148
0.212HisTrp: 0.212 ± 0.061
0.725HisTyr: 0.725 ± 0.112
0.0HisXaa: 0.0 ± 0.0
Ile
4.918IleAla: 4.918 ± 0.271
0.619IleCys: 0.619 ± 0.111
4.723IleAsp: 4.723 ± 0.271
4.865IleGlu: 4.865 ± 0.3
2.105IlePhe: 2.105 ± 0.182
3.733IleGly: 3.733 ± 0.288
1.15IleHis: 1.15 ± 0.138
3.538IleIle: 3.538 ± 0.283
4.777IleLys: 4.777 ± 0.409
4.777IleLeu: 4.777 ± 0.269
1.928IleMet: 1.928 ± 0.177
3.733IleAsn: 3.733 ± 0.257
2.583IlePro: 2.583 ± 0.215
1.84IleGln: 1.84 ± 0.17
3.963IleArg: 3.963 ± 0.269
3.521IleSer: 3.521 ± 0.295
4.6IleThr: 4.6 ± 0.292
4.883IleVal: 4.883 ± 0.28
0.796IleTrp: 0.796 ± 0.126
2.53IleTyr: 2.53 ± 0.205
0.0IleXaa: 0.0 ± 0.0
Lys
5.52LysAla: 5.52 ± 0.36
0.778LysCys: 0.778 ± 0.12
4.759LysAsp: 4.759 ± 0.311
5.254LysGlu: 5.254 ± 0.371
2.972LysPhe: 2.972 ± 0.258
3.874LysGly: 3.874 ± 0.262
1.698LysHis: 1.698 ± 0.201
4.706LysIle: 4.706 ± 0.266
4.812LysLys: 4.812 ± 0.349
6.21LysLeu: 6.21 ± 0.374
2.264LysMet: 2.264 ± 0.196
3.857LysAsn: 3.857 ± 0.25
2.689LysPro: 2.689 ± 0.222
2.848LysGln: 2.848 ± 0.237
4.069LysArg: 4.069 ± 0.329
3.503LysSer: 3.503 ± 0.244
4.317LysThr: 4.317 ± 0.305
4.971LysVal: 4.971 ± 0.284
1.221LysTrp: 1.221 ± 0.145
3.308LysTyr: 3.308 ± 0.299
0.0LysXaa: 0.0 ± 0.0
Leu
5.661LeuAla: 5.661 ± 0.363
1.008LeuCys: 1.008 ± 0.148
5.413LeuAsp: 5.413 ± 0.346
4.883LeuGlu: 4.883 ± 0.321
3.273LeuPhe: 3.273 ± 0.234
4.016LeuGly: 4.016 ± 0.237
1.38LeuHis: 1.38 ± 0.149
4.706LeuIle: 4.706 ± 0.329
5.891LeuLys: 5.891 ± 0.404
5.042LeuLeu: 5.042 ± 0.313
2.53LeuMet: 2.53 ± 0.217
4.44LeuAsn: 4.44 ± 0.23
3.609LeuPro: 3.609 ± 0.25
2.459LeuGln: 2.459 ± 0.246
4.016LeuArg: 4.016 ± 0.258
5.201LeuSer: 5.201 ± 0.3
4.157LeuThr: 4.157 ± 0.321
5.307LeuVal: 5.307 ± 0.329
0.831LeuTrp: 0.831 ± 0.115
3.184LeuTyr: 3.184 ± 0.236
0.0LeuXaa: 0.0 ± 0.0
Met
1.769MetAla: 1.769 ± 0.177
0.354MetCys: 0.354 ± 0.079
1.716MetAsp: 1.716 ± 0.173
1.858MetGlu: 1.858 ± 0.198
1.38MetPhe: 1.38 ± 0.166
1.698MetGly: 1.698 ± 0.167
0.548MetHis: 0.548 ± 0.104
1.981MetIle: 1.981 ± 0.196
3.078MetLys: 3.078 ± 0.237
2.158MetLeu: 2.158 ± 0.194
0.973MetMet: 0.973 ± 0.129
1.928MetAsn: 1.928 ± 0.176
0.796MetPro: 0.796 ± 0.115
1.291MetGln: 1.291 ± 0.147
1.486MetArg: 1.486 ± 0.18
1.964MetSer: 1.964 ± 0.198
1.716MetThr: 1.716 ± 0.192
1.504MetVal: 1.504 ± 0.153
0.283MetTrp: 0.283 ± 0.061
1.115MetTyr: 1.115 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
4.334AsnAla: 4.334 ± 0.298
0.531AsnCys: 0.531 ± 0.09
3.114AsnAsp: 3.114 ± 0.257
3.91AsnGlu: 3.91 ± 0.22
1.928AsnPhe: 1.928 ± 0.19
4.37AsnGly: 4.37 ± 0.298
0.92AsnHis: 0.92 ± 0.129
3.255AsnIle: 3.255 ± 0.195
3.574AsnLys: 3.574 ± 0.241
4.67AsnLeu: 4.67 ± 0.301
1.574AsnMet: 1.574 ± 0.155
3.043AsnAsn: 3.043 ± 0.276
2.866AsnPro: 2.866 ± 0.236
1.716AsnGln: 1.716 ± 0.174
2.601AsnArg: 2.601 ± 0.213
2.866AsnSer: 2.866 ± 0.194
3.397AsnThr: 3.397 ± 0.253
3.715AsnVal: 3.715 ± 0.271
0.478AsnTrp: 0.478 ± 0.086
1.628AsnTyr: 1.628 ± 0.199
0.0AsnXaa: 0.0 ± 0.0
Pro
2.548ProAla: 2.548 ± 0.222
0.478ProCys: 0.478 ± 0.087
3.114ProAsp: 3.114 ± 0.223
2.884ProGlu: 2.884 ± 0.259
1.663ProPhe: 1.663 ± 0.163
1.893ProGly: 1.893 ± 0.229
0.619ProHis: 0.619 ± 0.091
1.964ProIle: 1.964 ± 0.192
2.353ProLys: 2.353 ± 0.219
2.636ProLeu: 2.636 ± 0.202
0.637ProMet: 0.637 ± 0.11
1.716ProAsn: 1.716 ± 0.212
0.973ProPro: 0.973 ± 0.14
0.902ProGln: 0.902 ± 0.104
1.521ProArg: 1.521 ± 0.167
2.141ProSer: 2.141 ± 0.193
2.601ProThr: 2.601 ± 0.236
3.007ProVal: 3.007 ± 0.224
0.531ProTrp: 0.531 ± 0.1
1.946ProTyr: 1.946 ± 0.175
0.0ProXaa: 0.0 ± 0.0
Gln
2.282GlnAla: 2.282 ± 0.254
0.372GlnCys: 0.372 ± 0.083
1.628GlnAsp: 1.628 ± 0.193
1.928GlnGlu: 1.928 ± 0.177
1.345GlnPhe: 1.345 ± 0.15
1.822GlnGly: 1.822 ± 0.183
0.655GlnHis: 0.655 ± 0.105
2.3GlnIle: 2.3 ± 0.187
2.388GlnLys: 2.388 ± 0.257
2.618GlnLeu: 2.618 ± 0.228
1.168GlnMet: 1.168 ± 0.148
1.521GlnAsn: 1.521 ± 0.181
1.026GlnPro: 1.026 ± 0.141
1.468GlnGln: 1.468 ± 0.212
1.716GlnArg: 1.716 ± 0.192
1.804GlnSer: 1.804 ± 0.194
2.229GlnThr: 2.229 ± 0.195
2.123GlnVal: 2.123 ± 0.196
0.372GlnTrp: 0.372 ± 0.092
1.557GlnTyr: 1.557 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
2.777ArgAla: 2.777 ± 0.199
0.425ArgCys: 0.425 ± 0.092
3.149ArgAsp: 3.149 ± 0.196
3.644ArgGlu: 3.644 ± 0.326
1.946ArgPhe: 1.946 ± 0.208
3.414ArgGly: 3.414 ± 0.232
0.725ArgHis: 0.725 ± 0.127
3.414ArgIle: 3.414 ± 0.282
3.68ArgLys: 3.68 ± 0.268
3.627ArgLeu: 3.627 ± 0.251
1.486ArgMet: 1.486 ± 0.16
2.795ArgAsn: 2.795 ± 0.23
1.433ArgPro: 1.433 ± 0.125
1.433ArgGln: 1.433 ± 0.152
2.247ArgArg: 2.247 ± 0.225
2.919ArgSer: 2.919 ± 0.225
2.618ArgThr: 2.618 ± 0.194
3.414ArgVal: 3.414 ± 0.266
0.831ArgTrp: 0.831 ± 0.139
2.141ArgTyr: 2.141 ± 0.211
0.0ArgXaa: 0.0 ± 0.0
Ser
3.591SerAla: 3.591 ± 0.262
0.619SerCys: 0.619 ± 0.11
3.609SerAsp: 3.609 ± 0.25
3.538SerGlu: 3.538 ± 0.259
2.884SerPhe: 2.884 ± 0.212
4.759SerGly: 4.759 ± 0.339
1.061SerHis: 1.061 ± 0.145
3.75SerIle: 3.75 ± 0.282
4.476SerLys: 4.476 ± 0.287
4.423SerLeu: 4.423 ± 0.295
1.645SerMet: 1.645 ± 0.165
3.061SerAsn: 3.061 ± 0.225
2.158SerPro: 2.158 ± 0.173
1.875SerGln: 1.875 ± 0.205
2.795SerArg: 2.795 ± 0.215
3.184SerSer: 3.184 ± 0.291
2.972SerThr: 2.972 ± 0.265
4.582SerVal: 4.582 ± 0.274
0.601SerTrp: 0.601 ± 0.12
1.999SerTyr: 1.999 ± 0.179
0.0SerXaa: 0.0 ± 0.0
Thr
3.874ThrAla: 3.874 ± 0.325
0.531ThrCys: 0.531 ± 0.093
3.521ThrAsp: 3.521 ± 0.278
3.786ThrGlu: 3.786 ± 0.236
2.477ThrPhe: 2.477 ± 0.225
4.529ThrGly: 4.529 ± 0.394
1.061ThrHis: 1.061 ± 0.138
3.786ThrIle: 3.786 ± 0.218
4.175ThrLys: 4.175 ± 0.236
5.183ThrLeu: 5.183 ± 0.287
1.309ThrMet: 1.309 ± 0.136
2.742ThrAsn: 2.742 ± 0.322
3.061ThrPro: 3.061 ± 0.255
1.911ThrGln: 1.911 ± 0.19
2.724ThrArg: 2.724 ± 0.18
3.291ThrSer: 3.291 ± 0.297
3.644ThrThr: 3.644 ± 0.343
4.741ThrVal: 4.741 ± 0.312
0.796ThrTrp: 0.796 ± 0.113
2.3ThrTyr: 2.3 ± 0.18
0.0ThrXaa: 0.0 ± 0.0
Val
4.476ValAla: 4.476 ± 0.325
1.061ValCys: 1.061 ± 0.141
5.555ValAsp: 5.555 ± 0.304
5.201ValGlu: 5.201 ± 0.292
2.99ValPhe: 2.99 ± 0.206
3.892ValGly: 3.892 ± 0.233
1.115ValHis: 1.115 ± 0.143
4.67ValIle: 4.67 ± 0.289
5.148ValLys: 5.148 ± 0.282
4.723ValLeu: 4.723 ± 0.305
1.84ValMet: 1.84 ± 0.163
4.317ValAsn: 4.317 ± 0.3
2.654ValPro: 2.654 ± 0.229
2.264ValGln: 2.264 ± 0.259
3.078ValArg: 3.078 ± 0.216
4.299ValSer: 4.299 ± 0.278
4.281ValThr: 4.281 ± 0.411
4.989ValVal: 4.989 ± 0.33
0.849ValTrp: 0.849 ± 0.129
3.574ValTyr: 3.574 ± 0.283
0.0ValXaa: 0.0 ± 0.0
Trp
0.761TrpAla: 0.761 ± 0.117
0.159TrpCys: 0.159 ± 0.055
0.991TrpAsp: 0.991 ± 0.12
1.008TrpGlu: 1.008 ± 0.151
0.566TrpPhe: 0.566 ± 0.099
0.831TrpGly: 0.831 ± 0.123
0.354TrpHis: 0.354 ± 0.069
0.69TrpIle: 0.69 ± 0.109
1.362TrpLys: 1.362 ± 0.159
1.008TrpLeu: 1.008 ± 0.148
0.619TrpMet: 0.619 ± 0.114
0.761TrpAsn: 0.761 ± 0.124
0.283TrpPro: 0.283 ± 0.067
0.513TrpGln: 0.513 ± 0.087
0.601TrpArg: 0.601 ± 0.119
0.513TrpSer: 0.513 ± 0.098
0.743TrpThr: 0.743 ± 0.104
0.849TrpVal: 0.849 ± 0.119
0.195TrpTrp: 0.195 ± 0.061
0.69TrpTyr: 0.69 ± 0.098
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.866TyrAla: 2.866 ± 0.249
0.601TyrCys: 0.601 ± 0.1
3.078TyrAsp: 3.078 ± 0.205
2.937TyrGlu: 2.937 ± 0.234
1.504TyrPhe: 1.504 ± 0.15
2.76TyrGly: 2.76 ± 0.231
0.867TyrHis: 0.867 ± 0.111
2.848TyrIle: 2.848 ± 0.213
3.025TyrLys: 3.025 ± 0.234
2.972TyrLeu: 2.972 ± 0.225
1.061TyrMet: 1.061 ± 0.15
2.654TyrAsn: 2.654 ± 0.231
1.433TyrPro: 1.433 ± 0.173
1.539TyrGln: 1.539 ± 0.154
2.07TyrArg: 2.07 ± 0.181
2.777TyrSer: 2.777 ± 0.23
2.724TyrThr: 2.724 ± 0.23
3.149TyrVal: 3.149 ± 0.225
0.619TyrTrp: 0.619 ± 0.102
1.751TyrTyr: 1.751 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 277 proteins (56527 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski