Amino acid dipepetide frequency for Cronobacter phage S13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.07AlaAla: 4.07 ± 0.31
0.531AlaCys: 0.531 ± 0.097
3.91AlaAsp: 3.91 ± 0.245
4.07AlaGlu: 4.07 ± 0.307
2.743AlaPhe: 2.743 ± 0.192
4.565AlaGly: 4.565 ± 0.495
0.955AlaHis: 0.955 ± 0.123
4.424AlaIle: 4.424 ± 0.284
4.618AlaLys: 4.618 ± 0.307
5.273AlaLeu: 5.273 ± 0.31
1.911AlaMet: 1.911 ± 0.17
3.433AlaAsn: 3.433 ± 0.247
1.964AlaPro: 1.964 ± 0.214
2.07AlaGln: 2.07 ± 0.18
2.884AlaArg: 2.884 ± 0.257
3.132AlaSer: 3.132 ± 0.224
3.751AlaThr: 3.751 ± 0.451
4.034AlaVal: 4.034 ± 0.302
0.849AlaTrp: 0.849 ± 0.125
2.159AlaTyr: 2.159 ± 0.213
0.0AlaXaa: 0.0 ± 0.0
Cys
0.602CysAla: 0.602 ± 0.109
0.159CysCys: 0.159 ± 0.053
0.619CysAsp: 0.619 ± 0.103
0.69CysGlu: 0.69 ± 0.116
0.584CysPhe: 0.584 ± 0.119
0.725CysGly: 0.725 ± 0.133
0.283CysHis: 0.283 ± 0.066
0.566CysIle: 0.566 ± 0.112
0.885CysLys: 0.885 ± 0.13
0.955CysLeu: 0.955 ± 0.136
0.301CysMet: 0.301 ± 0.069
0.779CysAsn: 0.779 ± 0.093
0.372CysPro: 0.372 ± 0.096
0.372CysGln: 0.372 ± 0.085
0.549CysArg: 0.549 ± 0.1
0.672CysSer: 0.672 ± 0.108
0.513CysThr: 0.513 ± 0.093
0.796CysVal: 0.796 ± 0.131
0.106CysTrp: 0.106 ± 0.041
0.425CysTyr: 0.425 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
4.14AspAla: 4.14 ± 0.304
0.725AspCys: 0.725 ± 0.11
3.539AspAsp: 3.539 ± 0.295
4.654AspGlu: 4.654 ± 0.258
3.557AspPhe: 3.557 ± 0.274
4.477AspGly: 4.477 ± 0.288
0.867AspHis: 0.867 ± 0.126
4.884AspIle: 4.884 ± 0.318
4.105AspLys: 4.105 ± 0.252
4.99AspLeu: 4.99 ± 0.315
2.088AspMet: 2.088 ± 0.194
2.707AspAsn: 2.707 ± 0.229
2.3AspPro: 2.3 ± 0.231
1.911AspGln: 1.911 ± 0.203
2.778AspArg: 2.778 ± 0.249
4.229AspSer: 4.229 ± 0.258
3.273AspThr: 3.273 ± 0.271
4.954AspVal: 4.954 ± 0.327
0.92AspTrp: 0.92 ± 0.141
3.15AspTyr: 3.15 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
3.963GluAla: 3.963 ± 0.269
0.602GluCys: 0.602 ± 0.107
3.857GluAsp: 3.857 ± 0.284
4.459GluGlu: 4.459 ± 0.344
3.486GluPhe: 3.486 ± 0.283
3.433GluGly: 3.433 ± 0.244
1.362GluHis: 1.362 ± 0.137
6.016GluIle: 6.016 ± 0.349
5.344GluLys: 5.344 ± 0.316
6.565GluLeu: 6.565 ± 0.347
2.3GluMet: 2.3 ± 0.204
3.592GluAsn: 3.592 ± 0.268
1.628GluPro: 1.628 ± 0.162
2.548GluGln: 2.548 ± 0.194
3.397GluArg: 3.397 ± 0.272
4.512GluSer: 4.512 ± 0.349
4.017GluThr: 4.017 ± 0.267
4.83GluVal: 4.83 ± 0.254
1.097GluTrp: 1.097 ± 0.115
2.566GluTyr: 2.566 ± 0.191
0.0GluXaa: 0.0 ± 0.0
Phe
2.601PheAla: 2.601 ± 0.209
0.743PheCys: 0.743 ± 0.122
3.963PheAsp: 3.963 ± 0.276
3.521PheGlu: 3.521 ± 0.267
1.557PhePhe: 1.557 ± 0.16
2.796PheGly: 2.796 ± 0.205
0.779PheHis: 0.779 ± 0.11
2.902PheIle: 2.902 ± 0.239
3.698PheLys: 3.698 ± 0.27
2.672PheLeu: 2.672 ± 0.25
1.362PheMet: 1.362 ± 0.147
3.079PheAsn: 3.079 ± 0.265
1.309PhePro: 1.309 ± 0.155
1.221PheGln: 1.221 ± 0.147
2.053PheArg: 2.053 ± 0.222
2.406PheSer: 2.406 ± 0.19
2.459PheThr: 2.459 ± 0.183
3.486PheVal: 3.486 ± 0.265
0.566PheTrp: 0.566 ± 0.096
1.876PheTyr: 1.876 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
3.733GlyAla: 3.733 ± 0.3
0.743GlyCys: 0.743 ± 0.121
4.034GlyAsp: 4.034 ± 0.357
3.733GlyGlu: 3.733 ± 0.255
2.548GlyPhe: 2.548 ± 0.219
3.698GlyGly: 3.698 ± 0.485
1.239GlyHis: 1.239 ± 0.132
4.247GlyIle: 4.247 ± 0.253
5.237GlyLys: 5.237 ± 0.306
4.353GlyLeu: 4.353 ± 0.291
1.734GlyMet: 1.734 ± 0.173
3.521GlyAsn: 3.521 ± 0.363
1.362GlyPro: 1.362 ± 0.166
2.389GlyGln: 2.389 ± 0.244
3.15GlyArg: 3.15 ± 0.24
4.14GlySer: 4.14 ± 0.415
3.627GlyThr: 3.627 ± 0.304
4.353GlyVal: 4.353 ± 0.306
1.009GlyTrp: 1.009 ± 0.126
2.866GlyTyr: 2.866 ± 0.223
0.0GlyXaa: 0.0 ± 0.0
His
0.973HisAla: 0.973 ± 0.118
0.248HisCys: 0.248 ± 0.06
1.433HisAsp: 1.433 ± 0.18
1.38HisGlu: 1.38 ± 0.162
0.991HisPhe: 0.991 ± 0.138
1.168HisGly: 1.168 ± 0.147
0.389HisHis: 0.389 ± 0.092
1.522HisIle: 1.522 ± 0.181
1.345HisLys: 1.345 ± 0.133
1.362HisLeu: 1.362 ± 0.169
0.425HisMet: 0.425 ± 0.099
0.92HisAsn: 0.92 ± 0.135
0.885HisPro: 0.885 ± 0.108
0.495HisGln: 0.495 ± 0.107
0.725HisArg: 0.725 ± 0.111
0.973HisSer: 0.973 ± 0.146
0.796HisThr: 0.796 ± 0.108
1.433HisVal: 1.433 ± 0.131
0.318HisTrp: 0.318 ± 0.074
0.832HisTyr: 0.832 ± 0.131
0.0HisXaa: 0.0 ± 0.0
Ile
4.353IleAla: 4.353 ± 0.293
0.779IleCys: 0.779 ± 0.125
4.583IleAsp: 4.583 ± 0.26
5.361IleGlu: 5.361 ± 0.323
2.424IlePhe: 2.424 ± 0.164
3.875IleGly: 3.875 ± 0.293
1.734IleHis: 1.734 ± 0.175
4.565IleIle: 4.565 ± 0.356
6.299IleLys: 6.299 ± 0.385
4.565IleLeu: 4.565 ± 0.308
1.876IleMet: 1.876 ± 0.178
4.424IleAsn: 4.424 ± 0.248
2.92IlePro: 2.92 ± 0.271
2.459IleGln: 2.459 ± 0.211
3.415IleArg: 3.415 ± 0.239
4.282IleSer: 4.282 ± 0.278
4.512IleThr: 4.512 ± 0.256
4.76IleVal: 4.76 ± 0.278
0.69IleTrp: 0.69 ± 0.11
2.902IleTyr: 2.902 ± 0.239
0.0IleXaa: 0.0 ± 0.0
Lys
5.291LysAla: 5.291 ± 0.34
0.743LysCys: 0.743 ± 0.123
5.379LysAsp: 5.379 ± 0.281
5.574LysGlu: 5.574 ± 0.363
3.663LysPhe: 3.663 ± 0.276
4.247LysGly: 4.247 ± 0.309
1.557LysHis: 1.557 ± 0.185
5.591LysIle: 5.591 ± 0.28
5.662LysLys: 5.662 ± 0.357
6.069LysLeu: 6.069 ± 0.331
2.636LysMet: 2.636 ± 0.239
4.53LysAsn: 4.53 ± 0.274
2.265LysPro: 2.265 ± 0.188
2.902LysGln: 2.902 ± 0.241
3.627LysArg: 3.627 ± 0.267
4.583LysSer: 4.583 ± 0.311
4.724LysThr: 4.724 ± 0.288
4.83LysVal: 4.83 ± 0.272
0.955LysTrp: 0.955 ± 0.129
3.061LysTyr: 3.061 ± 0.233
0.0LysXaa: 0.0 ± 0.0
Leu
5.007LeuAla: 5.007 ± 0.328
0.867LeuCys: 0.867 ± 0.122
5.131LeuAsp: 5.131 ± 0.265
4.618LeuGlu: 4.618 ± 0.351
3.256LeuPhe: 3.256 ± 0.277
4.264LeuGly: 4.264 ± 0.254
1.309LeuHis: 1.309 ± 0.149
4.954LeuIle: 4.954 ± 0.314
6.706LeuLys: 6.706 ± 0.351
5.025LeuLeu: 5.025 ± 0.292
2.849LeuMet: 2.849 ± 0.246
4.459LeuAsn: 4.459 ± 0.28
2.53LeuPro: 2.53 ± 0.223
2.813LeuGln: 2.813 ± 0.196
3.716LeuArg: 3.716 ± 0.262
5.344LeuSer: 5.344 ± 0.295
4.972LeuThr: 4.972 ± 0.285
4.052LeuVal: 4.052 ± 0.249
0.619LeuTrp: 0.619 ± 0.085
2.884LeuTyr: 2.884 ± 0.215
0.0LeuXaa: 0.0 ± 0.0
Met
1.486MetAla: 1.486 ± 0.15
0.372MetCys: 0.372 ± 0.079
1.309MetAsp: 1.309 ± 0.163
1.929MetGlu: 1.929 ± 0.188
1.557MetPhe: 1.557 ± 0.179
1.61MetGly: 1.61 ± 0.17
0.672MetHis: 0.672 ± 0.103
2.725MetIle: 2.725 ± 0.237
3.397MetLys: 3.397 ± 0.302
2.353MetLeu: 2.353 ± 0.215
1.097MetMet: 1.097 ± 0.143
1.893MetAsn: 1.893 ± 0.175
0.602MetPro: 0.602 ± 0.109
1.274MetGln: 1.274 ± 0.145
1.292MetArg: 1.292 ± 0.153
2.371MetSer: 2.371 ± 0.193
2.159MetThr: 2.159 ± 0.174
1.168MetVal: 1.168 ± 0.145
0.478MetTrp: 0.478 ± 0.091
1.132MetTyr: 1.132 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
3.68AsnAla: 3.68 ± 0.274
0.46AsnCys: 0.46 ± 0.091
3.114AsnAsp: 3.114 ± 0.255
4.105AsnGlu: 4.105 ± 0.217
2.318AsnPhe: 2.318 ± 0.199
3.928AsnGly: 3.928 ± 0.298
1.079AsnHis: 1.079 ± 0.143
3.716AsnIle: 3.716 ± 0.249
3.787AsnLys: 3.787 ± 0.258
4.724AsnLeu: 4.724 ± 0.273
1.752AsnMet: 1.752 ± 0.186
2.583AsnAsn: 2.583 ± 0.268
2.336AsnPro: 2.336 ± 0.21
1.752AsnGln: 1.752 ± 0.16
2.619AsnArg: 2.619 ± 0.2
3.946AsnSer: 3.946 ± 0.273
2.619AsnThr: 2.619 ± 0.184
4.229AsnVal: 4.229 ± 0.265
0.69AsnTrp: 0.69 ± 0.122
1.628AsnTyr: 1.628 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
1.999ProAla: 1.999 ± 0.199
0.318ProCys: 0.318 ± 0.075
2.017ProAsp: 2.017 ± 0.188
2.566ProGlu: 2.566 ± 0.22
1.433ProPhe: 1.433 ± 0.159
2.123ProGly: 2.123 ± 0.265
0.69ProHis: 0.69 ± 0.107
1.911ProIle: 1.911 ± 0.219
1.929ProLys: 1.929 ± 0.186
1.876ProLeu: 1.876 ± 0.199
1.009ProMet: 1.009 ± 0.129
2.07ProAsn: 2.07 ± 0.208
0.655ProPro: 0.655 ± 0.12
0.991ProGln: 0.991 ± 0.137
1.787ProArg: 1.787 ± 0.193
2.088ProSer: 2.088 ± 0.211
2.229ProThr: 2.229 ± 0.206
2.601ProVal: 2.601 ± 0.216
0.442ProTrp: 0.442 ± 0.085
1.362ProTyr: 1.362 ± 0.155
0.0ProXaa: 0.0 ± 0.0
Gln
2.212GlnAla: 2.212 ± 0.19
0.407GlnCys: 0.407 ± 0.091
1.734GlnAsp: 1.734 ± 0.165
2.424GlnGlu: 2.424 ± 0.235
1.734GlnPhe: 1.734 ± 0.164
1.929GlnGly: 1.929 ± 0.144
0.584GlnHis: 0.584 ± 0.104
2.831GlnIle: 2.831 ± 0.213
2.477GlnLys: 2.477 ± 0.213
3.008GlnLeu: 3.008 ± 0.264
1.15GlnMet: 1.15 ± 0.115
1.787GlnAsn: 1.787 ± 0.181
0.743GlnPro: 0.743 ± 0.124
1.168GlnGln: 1.168 ± 0.151
1.699GlnArg: 1.699 ± 0.15
2.229GlnSer: 2.229 ± 0.231
2.212GlnThr: 2.212 ± 0.225
2.601GlnVal: 2.601 ± 0.205
0.566GlnTrp: 0.566 ± 0.11
1.398GlnTyr: 1.398 ± 0.164
0.0GlnXaa: 0.0 ± 0.0
Arg
2.884ArgAla: 2.884 ± 0.243
0.602ArgCys: 0.602 ± 0.104
3.008ArgAsp: 3.008 ± 0.243
3.539ArgGlu: 3.539 ± 0.251
2.3ArgPhe: 2.3 ± 0.203
2.831ArgGly: 2.831 ± 0.219
0.796ArgHis: 0.796 ± 0.121
3.627ArgIle: 3.627 ± 0.224
3.769ArgLys: 3.769 ± 0.264
3.468ArgLeu: 3.468 ± 0.263
1.327ArgMet: 1.327 ± 0.152
2.831ArgAsn: 2.831 ± 0.227
1.451ArgPro: 1.451 ± 0.175
1.84ArgGln: 1.84 ± 0.166
2.495ArgArg: 2.495 ± 0.235
3.026ArgSer: 3.026 ± 0.237
2.707ArgThr: 2.707 ± 0.222
3.079ArgVal: 3.079 ± 0.252
0.832ArgTrp: 0.832 ± 0.139
2.353ArgTyr: 2.353 ± 0.197
0.0ArgXaa: 0.0 ± 0.0
Ser
3.698SerAla: 3.698 ± 0.319
0.637SerCys: 0.637 ± 0.111
4.229SerAsp: 4.229 ± 0.267
4.3SerGlu: 4.3 ± 0.312
2.619SerPhe: 2.619 ± 0.228
4.335SerGly: 4.335 ± 0.368
1.026SerHis: 1.026 ± 0.143
4.459SerIle: 4.459 ± 0.232
4.671SerLys: 4.671 ± 0.303
4.99SerLeu: 4.99 ± 0.323
1.858SerMet: 1.858 ± 0.18
2.99SerAsn: 2.99 ± 0.234
1.946SerPro: 1.946 ± 0.198
2.123SerGln: 2.123 ± 0.223
3.362SerArg: 3.362 ± 0.219
3.893SerSer: 3.893 ± 0.309
3.15SerThr: 3.15 ± 0.254
4.583SerVal: 4.583 ± 0.34
0.92SerTrp: 0.92 ± 0.131
2.813SerTyr: 2.813 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
3.291ThrAla: 3.291 ± 0.287
0.69ThrCys: 0.69 ± 0.094
3.627ThrAsp: 3.627 ± 0.262
4.441ThrGlu: 4.441 ± 0.287
2.76ThrPhe: 2.76 ± 0.235
3.999ThrGly: 3.999 ± 0.373
0.991ThrHis: 0.991 ± 0.129
3.751ThrIle: 3.751 ± 0.238
4.194ThrLys: 4.194 ± 0.267
4.565ThrLeu: 4.565 ± 0.271
1.628ThrMet: 1.628 ± 0.2
2.53ThrAsn: 2.53 ± 0.238
2.69ThrPro: 2.69 ± 0.271
1.752ThrGln: 1.752 ± 0.147
2.831ThrArg: 2.831 ± 0.193
3.326ThrSer: 3.326 ± 0.265
2.902ThrThr: 2.902 ± 0.263
4.6ThrVal: 4.6 ± 0.332
0.92ThrTrp: 0.92 ± 0.126
1.946ThrTyr: 1.946 ± 0.206
0.0ThrXaa: 0.0 ± 0.0
Val
4.105ValAla: 4.105 ± 0.257
0.619ValCys: 0.619 ± 0.086
4.813ValAsp: 4.813 ± 0.298
4.76ValGlu: 4.76 ± 0.323
3.167ValPhe: 3.167 ± 0.241
4.459ValGly: 4.459 ± 0.323
1.203ValHis: 1.203 ± 0.144
4.583ValIle: 4.583 ± 0.28
5.432ValLys: 5.432 ± 0.33
4.512ValLeu: 4.512 ± 0.234
1.699ValMet: 1.699 ± 0.181
4.052ValAsn: 4.052 ± 0.233
2.229ValPro: 2.229 ± 0.173
2.495ValGln: 2.495 ± 0.227
3.61ValArg: 3.61 ± 0.235
4.388ValSer: 4.388 ± 0.28
3.804ValThr: 3.804 ± 0.322
4.777ValVal: 4.777 ± 0.279
0.885ValTrp: 0.885 ± 0.12
3.114ValTyr: 3.114 ± 0.226
0.0ValXaa: 0.0 ± 0.0
Trp
0.637TrpAla: 0.637 ± 0.097
0.088TrpCys: 0.088 ± 0.037
0.849TrpAsp: 0.849 ± 0.115
0.938TrpGlu: 0.938 ± 0.111
0.495TrpPhe: 0.495 ± 0.083
0.725TrpGly: 0.725 ± 0.124
0.248TrpHis: 0.248 ± 0.064
0.955TrpIle: 0.955 ± 0.114
1.362TrpLys: 1.362 ± 0.146
1.221TrpLeu: 1.221 ± 0.141
0.69TrpMet: 0.69 ± 0.117
0.725TrpAsn: 0.725 ± 0.129
0.23TrpPro: 0.23 ± 0.055
0.531TrpGln: 0.531 ± 0.089
0.637TrpArg: 0.637 ± 0.111
0.761TrpSer: 0.761 ± 0.125
0.867TrpThr: 0.867 ± 0.124
0.743TrpVal: 0.743 ± 0.11
0.124TrpTrp: 0.124 ± 0.047
0.531TrpTyr: 0.531 ± 0.087
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.619TyrAla: 2.619 ± 0.216
0.619TyrCys: 0.619 ± 0.105
3.096TyrAsp: 3.096 ± 0.234
2.513TyrGlu: 2.513 ± 0.237
1.858TyrPhe: 1.858 ± 0.147
2.53TyrGly: 2.53 ± 0.181
0.849TyrHis: 0.849 ± 0.135
2.548TyrIle: 2.548 ± 0.22
2.973TyrLys: 2.973 ± 0.231
2.831TyrLeu: 2.831 ± 0.255
1.168TyrMet: 1.168 ± 0.135
2.159TyrAsn: 2.159 ± 0.237
1.681TyrPro: 1.681 ± 0.193
1.787TyrGln: 1.787 ± 0.176
2.141TyrArg: 2.141 ± 0.19
2.318TyrSer: 2.318 ± 0.217
2.141TyrThr: 2.141 ± 0.201
2.796TyrVal: 2.796 ± 0.199
0.372TyrTrp: 0.372 ± 0.078
1.663TyrTyr: 1.663 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 268 proteins (56517 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski