Amino acid dipepetide frequency for Enterobacteria phage IME08

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.651AlaAla: 5.651 ± 0.363
0.389AlaCys: 0.389 ± 0.085
3.705AlaAsp: 3.705 ± 0.318
5.317AlaGlu: 5.317 ± 0.399
2.279AlaPhe: 2.279 ± 0.191
4.817AlaGly: 4.817 ± 0.396
1.204AlaHis: 1.204 ± 0.154
4.372AlaIle: 4.372 ± 0.29
4.724AlaLys: 4.724 ± 0.324
6.169AlaLeu: 6.169 ± 0.363
1.871AlaMet: 1.871 ± 0.188
3.168AlaAsn: 3.168 ± 0.218
2.705AlaPro: 2.705 ± 0.233
2.52AlaGln: 2.52 ± 0.197
3.001AlaArg: 3.001 ± 0.241
4.632AlaSer: 4.632 ± 0.344
3.891AlaThr: 3.891 ± 0.382
4.798AlaVal: 4.798 ± 0.289
0.945AlaTrp: 0.945 ± 0.134
2.835AlaTyr: 2.835 ± 0.256
0.0AlaXaa: 0.0 ± 0.0
Cys
0.797CysAla: 0.797 ± 0.117
0.148CysCys: 0.148 ± 0.065
0.63CysAsp: 0.63 ± 0.113
0.852CysGlu: 0.852 ± 0.138
0.408CysPhe: 0.408 ± 0.09
0.704CysGly: 0.704 ± 0.119
0.259CysHis: 0.259 ± 0.069
0.611CysIle: 0.611 ± 0.105
0.63CysLys: 0.63 ± 0.11
0.815CysLeu: 0.815 ± 0.114
0.408CysMet: 0.408 ± 0.085
0.371CysAsn: 0.371 ± 0.084
0.593CysPro: 0.593 ± 0.109
0.222CysGln: 0.222 ± 0.06
0.667CysArg: 0.667 ± 0.128
0.667CysSer: 0.667 ± 0.112
0.445CysThr: 0.445 ± 0.089
0.648CysVal: 0.648 ± 0.114
0.111CysTrp: 0.111 ± 0.044
0.408CysTyr: 0.408 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
4.28AspAla: 4.28 ± 0.263
0.704AspCys: 0.704 ± 0.112
3.946AspAsp: 3.946 ± 0.32
4.465AspGlu: 4.465 ± 0.297
3.298AspPhe: 3.298 ± 0.282
4.798AspGly: 4.798 ± 0.364
0.797AspHis: 0.797 ± 0.121
5.002AspIle: 5.002 ± 0.303
4.613AspLys: 4.613 ± 0.303
5.187AspLeu: 5.187 ± 0.31
1.519AspMet: 1.519 ± 0.162
2.872AspAsn: 2.872 ± 0.209
2.205AspPro: 2.205 ± 0.227
1.816AspGln: 1.816 ± 0.199
2.075AspArg: 2.075 ± 0.178
3.835AspSer: 3.835 ± 0.218
3.131AspThr: 3.131 ± 0.275
3.983AspVal: 3.983 ± 0.281
1.223AspTrp: 1.223 ± 0.143
2.946AspTyr: 2.946 ± 0.26
0.0AspXaa: 0.0 ± 0.0
Glu
5.224GluAla: 5.224 ± 0.409
0.871GluCys: 0.871 ± 0.142
4.168GluAsp: 4.168 ± 0.32
5.002GluGlu: 5.002 ± 0.426
3.464GluPhe: 3.464 ± 0.233
3.909GluGly: 3.909 ± 0.272
1.334GluHis: 1.334 ± 0.157
5.669GluIle: 5.669 ± 0.364
4.465GluLys: 4.465 ± 0.347
6.688GluLeu: 6.688 ± 0.403
2.075GluMet: 2.075 ± 0.21
4.317GluAsn: 4.317 ± 0.276
1.871GluPro: 1.871 ± 0.164
2.557GluGln: 2.557 ± 0.234
2.612GluArg: 2.612 ± 0.247
3.594GluSer: 3.594 ± 0.314
4.28GluThr: 4.28 ± 0.253
5.391GluVal: 5.391 ± 0.318
0.945GluTrp: 0.945 ± 0.127
3.353GluTyr: 3.353 ± 0.269
0.0GluXaa: 0.0 ± 0.0
Phe
2.705PheAla: 2.705 ± 0.221
0.445PheCys: 0.445 ± 0.079
3.131PheAsp: 3.131 ± 0.241
3.464PheGlu: 3.464 ± 0.244
1.334PhePhe: 1.334 ± 0.194
2.853PheGly: 2.853 ± 0.217
0.574PheHis: 0.574 ± 0.098
2.909PheIle: 2.909 ± 0.208
4.002PheLys: 4.002 ± 0.293
2.334PheLeu: 2.334 ± 0.196
1.352PheMet: 1.352 ± 0.162
2.612PheAsn: 2.612 ± 0.207
1.093PhePro: 1.093 ± 0.159
1.482PheGln: 1.482 ± 0.16
1.982PheArg: 1.982 ± 0.162
2.557PheSer: 2.557 ± 0.224
2.353PheThr: 2.353 ± 0.211
2.575PheVal: 2.575 ± 0.207
0.611PheTrp: 0.611 ± 0.096
1.667PheTyr: 1.667 ± 0.171
0.0PheXaa: 0.0 ± 0.0
Gly
3.631GlyAla: 3.631 ± 0.3
0.685GlyCys: 0.685 ± 0.121
4.187GlyAsp: 4.187 ± 0.383
4.131GlyGlu: 4.131 ± 0.265
2.501GlyPhe: 2.501 ± 0.242
4.205GlyGly: 4.205 ± 0.585
0.926GlyHis: 0.926 ± 0.147
4.205GlyIle: 4.205 ± 0.28
4.317GlyLys: 4.317 ± 0.309
5.762GlyLeu: 5.762 ± 0.363
1.982GlyMet: 1.982 ± 0.212
3.539GlyAsn: 3.539 ± 0.395
2.408GlyPro: 2.408 ± 0.486
2.186GlyGln: 2.186 ± 0.225
2.872GlyArg: 2.872 ± 0.207
4.502GlySer: 4.502 ± 0.318
4.354GlyThr: 4.354 ± 0.408
3.631GlyVal: 3.631 ± 0.245
1.149GlyTrp: 1.149 ± 0.188
2.779GlyTyr: 2.779 ± 0.262
0.0GlyXaa: 0.0 ± 0.0
His
0.797HisAla: 0.797 ± 0.123
0.259HisCys: 0.259 ± 0.072
0.963HisAsp: 0.963 ± 0.125
1.167HisGlu: 1.167 ± 0.15
0.926HisPhe: 0.926 ± 0.138
1.037HisGly: 1.037 ± 0.125
0.333HisHis: 0.333 ± 0.096
1.186HisIle: 1.186 ± 0.163
1.204HisLys: 1.204 ± 0.142
1.315HisLeu: 1.315 ± 0.13
0.352HisMet: 0.352 ± 0.08
0.834HisAsn: 0.834 ± 0.122
1.019HisPro: 1.019 ± 0.131
0.5HisGln: 0.5 ± 0.102
0.685HisArg: 0.685 ± 0.116
1.093HisSer: 1.093 ± 0.112
0.815HisThr: 0.815 ± 0.111
1.056HisVal: 1.056 ± 0.142
0.241HisTrp: 0.241 ± 0.065
0.63HisTyr: 0.63 ± 0.119
0.0HisXaa: 0.0 ± 0.0
Ile
4.52IleAla: 4.52 ± 0.279
0.685IleCys: 0.685 ± 0.117
5.206IleAsp: 5.206 ± 0.306
4.947IleGlu: 4.947 ± 0.317
2.408IlePhe: 2.408 ± 0.2
3.483IleGly: 3.483 ± 0.252
1.315IleHis: 1.315 ± 0.18
3.928IleIle: 3.928 ± 0.27
6.558IleLys: 6.558 ± 0.389
4.224IleLeu: 4.224 ± 0.257
2.445IleMet: 2.445 ± 0.198
4.317IleAsn: 4.317 ± 0.252
2.723IlePro: 2.723 ± 0.215
2.483IleGln: 2.483 ± 0.257
3.761IleArg: 3.761 ± 0.276
4.131IleSer: 4.131 ± 0.285
4.539IleThr: 4.539 ± 0.273
4.131IleVal: 4.131 ± 0.276
0.537IleTrp: 0.537 ± 0.095
2.223IleTyr: 2.223 ± 0.225
0.0IleXaa: 0.0 ± 0.0
Lys
5.891LysAla: 5.891 ± 0.361
0.797LysCys: 0.797 ± 0.137
4.335LysAsp: 4.335 ± 0.323
5.521LysGlu: 5.521 ± 0.373
3.353LysPhe: 3.353 ± 0.238
4.557LysGly: 4.557 ± 0.35
1.464LysHis: 1.464 ± 0.159
5.299LysIle: 5.299 ± 0.301
4.372LysLys: 4.372 ± 0.345
6.021LysLeu: 6.021 ± 0.323
2.964LysMet: 2.964 ± 0.282
3.464LysAsn: 3.464 ± 0.197
2.445LysPro: 2.445 ± 0.244
2.686LysGln: 2.686 ± 0.233
3.02LysArg: 3.02 ± 0.264
4.205LysSer: 4.205 ± 0.302
4.039LysThr: 4.039 ± 0.261
5.484LysVal: 5.484 ± 0.395
1.019LysTrp: 1.019 ± 0.114
2.946LysTyr: 2.946 ± 0.234
0.0LysXaa: 0.0 ± 0.0
Leu
5.595LeuAla: 5.595 ± 0.379
0.63LeuCys: 0.63 ± 0.108
4.872LeuAsp: 4.872 ± 0.291
5.447LeuGlu: 5.447 ± 0.431
3.112LeuPhe: 3.112 ± 0.279
4.131LeuGly: 4.131 ± 0.355
1.167LeuHis: 1.167 ± 0.149
4.947LeuIle: 4.947 ± 0.272
6.318LeuLys: 6.318 ± 0.345
5.484LeuLeu: 5.484 ± 0.351
2.52LeuMet: 2.52 ± 0.237
4.576LeuAsn: 4.576 ± 0.277
3.001LeuPro: 3.001 ± 0.225
2.594LeuGln: 2.594 ± 0.236
3.631LeuArg: 3.631 ± 0.234
5.28LeuSer: 5.28 ± 0.321
4.761LeuThr: 4.761 ± 0.295
4.984LeuVal: 4.984 ± 0.3
0.741LeuTrp: 0.741 ± 0.13
3.576LeuTyr: 3.576 ± 0.277
0.0LeuXaa: 0.0 ± 0.0
Met
2.408MetAla: 2.408 ± 0.197
0.389MetCys: 0.389 ± 0.092
2.075MetAsp: 2.075 ± 0.211
1.704MetGlu: 1.704 ± 0.202
1.408MetPhe: 1.408 ± 0.163
1.556MetGly: 1.556 ± 0.152
0.315MetHis: 0.315 ± 0.086
1.89MetIle: 1.89 ± 0.161
2.52MetLys: 2.52 ± 0.204
2.26MetLeu: 2.26 ± 0.222
1.538MetMet: 1.538 ± 0.171
1.927MetAsn: 1.927 ± 0.176
0.908MetPro: 0.908 ± 0.117
0.926MetGln: 0.926 ± 0.135
1.241MetArg: 1.241 ± 0.146
2.149MetSer: 2.149 ± 0.193
1.853MetThr: 1.853 ± 0.16
2.001MetVal: 2.001 ± 0.191
0.371MetTrp: 0.371 ± 0.087
1.0MetTyr: 1.0 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
3.446AsnAla: 3.446 ± 0.294
0.426AsnCys: 0.426 ± 0.106
2.964AsnAsp: 2.964 ± 0.233
3.835AsnGlu: 3.835 ± 0.273
2.575AsnPhe: 2.575 ± 0.222
4.28AsnGly: 4.28 ± 0.371
1.149AsnHis: 1.149 ± 0.154
3.946AsnIle: 3.946 ± 0.272
3.853AsnLys: 3.853 ± 0.288
3.891AsnLeu: 3.891 ± 0.229
1.612AsnMet: 1.612 ± 0.196
3.075AsnAsn: 3.075 ± 0.28
2.39AsnPro: 2.39 ± 0.2
1.908AsnGln: 1.908 ± 0.232
2.131AsnArg: 2.131 ± 0.184
3.928AsnSer: 3.928 ± 0.278
3.187AsnThr: 3.187 ± 0.322
3.131AsnVal: 3.131 ± 0.234
0.704AsnTrp: 0.704 ± 0.124
2.038AsnTyr: 2.038 ± 0.231
0.0AsnXaa: 0.0 ± 0.0
Pro
2.371ProAla: 2.371 ± 0.22
0.389ProCys: 0.389 ± 0.085
2.316ProAsp: 2.316 ± 0.23
3.149ProGlu: 3.149 ± 0.286
1.315ProPhe: 1.315 ± 0.134
2.594ProGly: 2.594 ± 0.209
0.537ProHis: 0.537 ± 0.09
2.39ProIle: 2.39 ± 0.224
2.371ProLys: 2.371 ± 0.229
2.575ProLeu: 2.575 ± 0.199
0.889ProMet: 0.889 ± 0.129
1.908ProAsn: 1.908 ± 0.165
0.871ProPro: 0.871 ± 0.134
1.26ProGln: 1.26 ± 0.251
1.445ProArg: 1.445 ± 0.162
2.501ProSer: 2.501 ± 0.224
2.353ProThr: 2.353 ± 0.179
2.631ProVal: 2.631 ± 0.202
0.519ProTrp: 0.519 ± 0.098
1.371ProTyr: 1.371 ± 0.166
0.0ProXaa: 0.0 ± 0.0
Gln
2.946GlnAla: 2.946 ± 0.255
0.371GlnCys: 0.371 ± 0.092
1.797GlnAsp: 1.797 ± 0.182
2.297GlnGlu: 2.297 ± 0.214
1.63GlnPhe: 1.63 ± 0.163
2.538GlnGly: 2.538 ± 0.679
0.445GlnHis: 0.445 ± 0.086
2.483GlnIle: 2.483 ± 0.231
1.853GlnLys: 1.853 ± 0.192
2.946GlnLeu: 2.946 ± 0.228
1.056GlnMet: 1.056 ± 0.144
1.63GlnAsn: 1.63 ± 0.196
1.056GlnPro: 1.056 ± 0.146
1.223GlnGln: 1.223 ± 0.181
1.945GlnArg: 1.945 ± 0.182
1.871GlnSer: 1.871 ± 0.181
2.353GlnThr: 2.353 ± 0.269
2.297GlnVal: 2.297 ± 0.21
0.797GlnTrp: 0.797 ± 0.13
1.593GlnTyr: 1.593 ± 0.174
0.0GlnXaa: 0.0 ± 0.0
Arg
2.742ArgAla: 2.742 ± 0.271
0.463ArgCys: 0.463 ± 0.089
2.371ArgAsp: 2.371 ± 0.231
3.168ArgGlu: 3.168 ± 0.257
2.038ArgPhe: 2.038 ± 0.199
2.594ArgGly: 2.594 ± 0.235
0.741ArgHis: 0.741 ± 0.114
3.501ArgIle: 3.501 ± 0.245
3.149ArgLys: 3.149 ± 0.24
3.631ArgLeu: 3.631 ± 0.309
1.297ArgMet: 1.297 ± 0.142
2.464ArgAsn: 2.464 ± 0.215
1.371ArgPro: 1.371 ± 0.144
1.797ArgGln: 1.797 ± 0.176
2.279ArgArg: 2.279 ± 0.216
2.501ArgSer: 2.501 ± 0.205
2.538ArgThr: 2.538 ± 0.221
2.89ArgVal: 2.89 ± 0.255
0.648ArgTrp: 0.648 ± 0.106
1.741ArgTyr: 1.741 ± 0.186
0.0ArgXaa: 0.0 ± 0.0
Ser
3.779SerAla: 3.779 ± 0.276
0.741SerCys: 0.741 ± 0.137
4.317SerAsp: 4.317 ± 0.308
4.243SerGlu: 4.243 ± 0.28
2.501SerPhe: 2.501 ± 0.217
4.502SerGly: 4.502 ± 0.328
1.075SerHis: 1.075 ± 0.138
4.187SerIle: 4.187 ± 0.294
4.947SerLys: 4.947 ± 0.248
4.965SerLeu: 4.965 ± 0.284
1.723SerMet: 1.723 ± 0.154
3.187SerAsn: 3.187 ± 0.306
2.149SerPro: 2.149 ± 0.232
2.279SerGln: 2.279 ± 0.183
2.705SerArg: 2.705 ± 0.204
4.168SerSer: 4.168 ± 0.363
3.798SerThr: 3.798 ± 0.318
3.853SerVal: 3.853 ± 0.273
0.871SerTrp: 0.871 ± 0.118
2.835SerTyr: 2.835 ± 0.229
0.0SerXaa: 0.0 ± 0.0
Thr
4.409ThrAla: 4.409 ± 0.39
0.519ThrCys: 0.519 ± 0.089
3.39ThrAsp: 3.39 ± 0.246
4.391ThrGlu: 4.391 ± 0.281
2.52ThrPhe: 2.52 ± 0.203
4.52ThrGly: 4.52 ± 0.349
0.871ThrHis: 0.871 ± 0.117
3.853ThrIle: 3.853 ± 0.282
4.168ThrLys: 4.168 ± 0.328
4.706ThrLeu: 4.706 ± 0.334
1.408ThrMet: 1.408 ± 0.203
3.094ThrAsn: 3.094 ± 0.215
2.538ThrPro: 2.538 ± 0.254
2.001ThrGln: 2.001 ± 0.26
2.946ThrArg: 2.946 ± 0.296
3.483ThrSer: 3.483 ± 0.293
3.409ThrThr: 3.409 ± 0.312
4.483ThrVal: 4.483 ± 0.318
0.593ThrTrp: 0.593 ± 0.091
2.112ThrTyr: 2.112 ± 0.184
0.0ThrXaa: 0.0 ± 0.0
Val
4.354ValAla: 4.354 ± 0.266
0.815ValCys: 0.815 ± 0.115
4.446ValAsp: 4.446 ± 0.231
5.299ValGlu: 5.299 ± 0.399
2.575ValPhe: 2.575 ± 0.188
3.557ValGly: 3.557 ± 0.265
0.889ValHis: 0.889 ± 0.119
4.465ValIle: 4.465 ± 0.28
5.187ValLys: 5.187 ± 0.317
4.576ValLeu: 4.576 ± 0.288
1.908ValMet: 1.908 ± 0.159
3.779ValAsn: 3.779 ± 0.293
2.557ValPro: 2.557 ± 0.204
2.612ValGln: 2.612 ± 0.212
2.797ValArg: 2.797 ± 0.212
4.131ValSer: 4.131 ± 0.286
4.02ValThr: 4.02 ± 0.37
4.409ValVal: 4.409 ± 0.296
0.778ValTrp: 0.778 ± 0.116
3.261ValTyr: 3.261 ± 0.272
0.0ValXaa: 0.0 ± 0.0
Trp
0.76TrpAla: 0.76 ± 0.088
0.167TrpCys: 0.167 ± 0.054
0.889TrpAsp: 0.889 ± 0.106
0.723TrpGlu: 0.723 ± 0.112
0.648TrpPhe: 0.648 ± 0.093
0.408TrpGly: 0.408 ± 0.097
0.296TrpHis: 0.296 ± 0.073
0.778TrpIle: 0.778 ± 0.117
1.371TrpLys: 1.371 ± 0.184
1.13TrpLeu: 1.13 ± 0.155
0.445TrpMet: 0.445 ± 0.093
0.723TrpAsn: 0.723 ± 0.123
0.389TrpPro: 0.389 ± 0.09
0.5TrpGln: 0.5 ± 0.088
0.463TrpArg: 0.463 ± 0.095
0.815TrpSer: 0.815 ± 0.118
0.797TrpThr: 0.797 ± 0.11
1.167TrpVal: 1.167 ± 0.162
0.241TrpTrp: 0.241 ± 0.068
0.889TrpTyr: 0.889 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.686TyrAla: 2.686 ± 0.263
0.5TyrCys: 0.5 ± 0.086
2.983TyrAsp: 2.983 ± 0.224
2.668TyrGlu: 2.668 ± 0.24
1.76TyrPhe: 1.76 ± 0.186
2.909TyrGly: 2.909 ± 0.231
0.723TyrHis: 0.723 ± 0.113
2.946TyrIle: 2.946 ± 0.198
3.261TyrLys: 3.261 ± 0.271
2.686TyrLeu: 2.686 ± 0.23
1.056TyrMet: 1.056 ± 0.167
2.612TyrAsn: 2.612 ± 0.219
1.464TyrPro: 1.464 ± 0.17
1.575TyrGln: 1.575 ± 0.168
1.667TyrArg: 1.667 ± 0.187
2.779TyrSer: 2.779 ± 0.197
2.501TyrThr: 2.501 ± 0.197
2.872TyrVal: 2.872 ± 0.248
0.556TyrTrp: 0.556 ± 0.106
1.741TyrTyr: 1.741 ± 0.177
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 253 proteins (53978 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski