Amino acid dipepetide frequency for Escherichia phage nom

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.852AlaAla: 5.852 ± 0.501
1.32AlaCys: 1.32 ± 0.204
4.432AlaAsp: 4.432 ± 0.289
4.856AlaGlu: 4.856 ± 0.36
2.839AlaPhe: 2.839 ± 0.196
5.03AlaGly: 5.03 ± 0.402
1.145AlaHis: 1.145 ± 0.16
3.735AlaIle: 3.735 ± 0.317
5.677AlaLys: 5.677 ± 0.513
5.279AlaLeu: 5.279 ± 0.386
2.366AlaMet: 2.366 ± 0.26
3.735AlaAsn: 3.735 ± 0.351
2.017AlaPro: 2.017 ± 0.256
2.216AlaGln: 2.216 ± 0.284
2.963AlaArg: 2.963 ± 0.252
3.262AlaSer: 3.262 ± 0.356
4.009AlaThr: 4.009 ± 0.418
4.557AlaVal: 4.557 ± 0.335
1.245AlaTrp: 1.245 ± 0.17
3.187AlaTyr: 3.187 ± 0.287
0.0AlaXaa: 0.0 ± 0.0
Cys
0.946CysAla: 0.946 ± 0.183
0.523CysCys: 0.523 ± 0.135
0.896CysAsp: 0.896 ± 0.178
0.996CysGlu: 0.996 ± 0.159
0.573CysPhe: 0.573 ± 0.126
1.37CysGly: 1.37 ± 0.188
0.473CysHis: 0.473 ± 0.13
0.921CysIle: 0.921 ± 0.183
1.071CysLys: 1.071 ± 0.199
1.121CysLeu: 1.121 ± 0.206
0.523CysMet: 0.523 ± 0.106
0.872CysAsn: 0.872 ± 0.137
0.822CysPro: 0.822 ± 0.165
0.473CysGln: 0.473 ± 0.103
0.921CysArg: 0.921 ± 0.154
0.847CysSer: 0.847 ± 0.142
0.847CysThr: 0.847 ± 0.148
1.22CysVal: 1.22 ± 0.168
0.324CysTrp: 0.324 ± 0.093
0.722CysTyr: 0.722 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
4.358AspAla: 4.358 ± 0.296
1.17AspCys: 1.17 ± 0.169
4.233AspAsp: 4.233 ± 0.386
4.432AspGlu: 4.432 ± 0.343
3.262AspPhe: 3.262 ± 0.309
5.129AspGly: 5.129 ± 0.367
1.569AspHis: 1.569 ± 0.195
4.183AspIle: 4.183 ± 0.371
5.08AspLys: 5.08 ± 0.307
5.976AspLeu: 5.976 ± 0.408
1.868AspMet: 1.868 ± 0.23
3.635AspAsn: 3.635 ± 0.293
3.536AspPro: 3.536 ± 0.322
1.843AspGln: 1.843 ± 0.207
2.864AspArg: 2.864 ± 0.267
3.561AspSer: 3.561 ± 0.274
3.038AspThr: 3.038 ± 0.25
4.656AspVal: 4.656 ± 0.307
1.27AspTrp: 1.27 ± 0.157
2.789AspTyr: 2.789 ± 0.262
0.0AspXaa: 0.0 ± 0.0
Glu
5.179GluAla: 5.179 ± 0.356
0.747GluCys: 0.747 ± 0.143
4.781GluAsp: 4.781 ± 0.432
6.997GluGlu: 6.997 ± 0.712
3.063GluPhe: 3.063 ± 0.228
5.304GluGly: 5.304 ± 0.402
1.295GluHis: 1.295 ± 0.181
4.432GluIle: 4.432 ± 0.386
4.283GluLys: 4.283 ± 0.429
6.325GluLeu: 6.325 ± 0.419
2.366GluMet: 2.366 ± 0.29
3.337GluAsn: 3.337 ± 0.318
1.942GluPro: 1.942 ± 0.232
1.992GluGln: 1.992 ± 0.282
3.162GluArg: 3.162 ± 0.277
3.337GluSer: 3.337 ± 0.295
2.764GluThr: 2.764 ± 0.29
5.503GluVal: 5.503 ± 0.39
1.27GluTrp: 1.27 ± 0.19
2.938GluTyr: 2.938 ± 0.272
0.0GluXaa: 0.0 ± 0.0
Phe
2.44PheAla: 2.44 ± 0.237
0.623PheCys: 0.623 ± 0.12
3.362PheAsp: 3.362 ± 0.324
3.237PheGlu: 3.237 ± 0.296
1.469PhePhe: 1.469 ± 0.191
3.038PheGly: 3.038 ± 0.286
0.772PheHis: 0.772 ± 0.128
2.639PheIle: 2.639 ± 0.264
2.988PheLys: 2.988 ± 0.229
3.088PheLeu: 3.088 ± 0.305
1.394PheMet: 1.394 ± 0.202
2.341PheAsn: 2.341 ± 0.217
1.419PhePro: 1.419 ± 0.173
1.071PheGln: 1.071 ± 0.191
1.843PheArg: 1.843 ± 0.22
2.689PheSer: 2.689 ± 0.26
2.191PheThr: 2.191 ± 0.207
2.839PheVal: 2.839 ± 0.265
0.697PheTrp: 0.697 ± 0.123
2.216PheTyr: 2.216 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
4.582GlyAla: 4.582 ± 0.412
1.295GlyCys: 1.295 ± 0.172
5.005GlyAsp: 5.005 ± 0.326
4.856GlyGlu: 4.856 ± 0.356
3.362GlyPhe: 3.362 ± 0.394
5.105GlyGly: 5.105 ± 0.58
1.27GlyHis: 1.27 ± 0.196
3.984GlyIle: 3.984 ± 0.365
5.453GlyLys: 5.453 ± 0.414
4.93GlyLeu: 4.93 ± 0.374
1.892GlyMet: 1.892 ± 0.234
3.86GlyAsn: 3.86 ± 0.311
1.444GlyPro: 1.444 ± 0.269
1.793GlyGln: 1.793 ± 0.212
3.411GlyArg: 3.411 ± 0.262
3.511GlySer: 3.511 ± 0.268
4.233GlyThr: 4.233 ± 0.635
4.98GlyVal: 4.98 ± 0.324
1.37GlyTrp: 1.37 ± 0.176
3.76GlyTyr: 3.76 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
1.046HisAla: 1.046 ± 0.161
0.498HisCys: 0.498 ± 0.132
1.096HisAsp: 1.096 ± 0.167
0.996HisGlu: 0.996 ± 0.153
0.822HisPhe: 0.822 ± 0.14
1.195HisGly: 1.195 ± 0.166
0.448HisHis: 0.448 ± 0.11
1.096HisIle: 1.096 ± 0.157
1.569HisLys: 1.569 ± 0.205
1.818HisLeu: 1.818 ± 0.221
0.523HisMet: 0.523 ± 0.11
0.946HisAsn: 0.946 ± 0.149
1.17HisPro: 1.17 ± 0.168
0.623HisGln: 0.623 ± 0.125
0.921HisArg: 0.921 ± 0.147
1.071HisSer: 1.071 ± 0.166
1.469HisThr: 1.469 ± 0.189
1.046HisVal: 1.046 ± 0.166
0.249HisTrp: 0.249 ± 0.079
1.17HisTyr: 1.17 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
4.109IleAla: 4.109 ± 0.344
0.896IleCys: 0.896 ± 0.202
4.208IleAsp: 4.208 ± 0.379
3.337IleGlu: 3.337 ± 0.248
2.042IlePhe: 2.042 ± 0.247
3.81IleGly: 3.81 ± 0.339
1.195IleHis: 1.195 ± 0.164
3.735IleIle: 3.735 ± 0.328
3.611IleLys: 3.611 ± 0.325
4.482IleLeu: 4.482 ± 0.366
1.544IleMet: 1.544 ± 0.189
3.237IleAsn: 3.237 ± 0.226
2.689IlePro: 2.689 ± 0.212
2.042IleGln: 2.042 ± 0.265
2.963IleArg: 2.963 ± 0.255
3.411IleSer: 3.411 ± 0.299
3.635IleThr: 3.635 ± 0.283
4.582IleVal: 4.582 ± 0.377
0.797IleTrp: 0.797 ± 0.137
2.366IleTyr: 2.366 ± 0.225
0.0IleXaa: 0.0 ± 0.0
Lys
5.627LysAla: 5.627 ± 0.415
1.145LysCys: 1.145 ± 0.165
5.005LysAsp: 5.005 ± 0.382
5.478LysGlu: 5.478 ± 0.445
2.59LysPhe: 2.59 ± 0.3
4.432LysGly: 4.432 ± 0.355
1.245LysHis: 1.245 ± 0.175
4.557LysIle: 4.557 ± 0.345
4.507LysLys: 4.507 ± 0.364
5.055LysLeu: 5.055 ± 0.417
2.341LysMet: 2.341 ± 0.235
2.913LysAsn: 2.913 ± 0.276
2.49LysPro: 2.49 ± 0.224
2.39LysGln: 2.39 ± 0.22
2.764LysArg: 2.764 ± 0.252
3.536LysSer: 3.536 ± 0.327
4.009LysThr: 4.009 ± 0.353
5.254LysVal: 5.254 ± 0.433
1.046LysTrp: 1.046 ± 0.162
2.814LysTyr: 2.814 ± 0.243
0.0LysXaa: 0.0 ± 0.0
Leu
5.603LeuAla: 5.603 ± 0.376
0.996LeuCys: 0.996 ± 0.164
5.852LeuAsp: 5.852 ± 0.424
6.35LeuGlu: 6.35 ± 0.441
2.615LeuPhe: 2.615 ± 0.251
4.482LeuGly: 4.482 ± 0.354
1.793LeuHis: 1.793 ± 0.223
4.183LeuIle: 4.183 ± 0.281
5.304LeuLys: 5.304 ± 0.452
4.93LeuLeu: 4.93 ± 0.442
2.316LeuMet: 2.316 ± 0.262
4.133LeuAsn: 4.133 ± 0.316
3.909LeuPro: 3.909 ± 0.268
2.814LeuGln: 2.814 ± 0.262
3.411LeuArg: 3.411 ± 0.323
4.432LeuSer: 4.432 ± 0.408
4.457LeuThr: 4.457 ± 0.347
5.055LeuVal: 5.055 ± 0.398
1.419LeuTrp: 1.419 ± 0.217
2.714LeuTyr: 2.714 ± 0.287
0.0LeuXaa: 0.0 ± 0.0
Met
2.415MetAla: 2.415 ± 0.239
0.448MetCys: 0.448 ± 0.12
1.345MetAsp: 1.345 ± 0.188
1.619MetGlu: 1.619 ± 0.206
1.046MetPhe: 1.046 ± 0.197
1.892MetGly: 1.892 ± 0.216
0.573MetHis: 0.573 ± 0.128
1.818MetIle: 1.818 ± 0.204
2.963MetLys: 2.963 ± 0.281
2.515MetLeu: 2.515 ± 0.25
0.872MetMet: 0.872 ± 0.149
1.096MetAsn: 1.096 ± 0.179
1.071MetPro: 1.071 ± 0.168
1.121MetGln: 1.121 ± 0.19
1.22MetArg: 1.22 ± 0.175
2.117MetSer: 2.117 ± 0.25
1.494MetThr: 1.494 ± 0.188
1.519MetVal: 1.519 ± 0.212
0.573MetTrp: 0.573 ± 0.131
0.847MetTyr: 0.847 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
3.685AsnAla: 3.685 ± 0.334
0.797AsnCys: 0.797 ± 0.136
2.913AsnAsp: 2.913 ± 0.214
2.59AsnGlu: 2.59 ± 0.262
2.59AsnPhe: 2.59 ± 0.234
4.681AsnGly: 4.681 ± 0.45
0.822AsnHis: 0.822 ± 0.125
3.088AsnIle: 3.088 ± 0.26
3.187AsnLys: 3.187 ± 0.246
4.183AsnLeu: 4.183 ± 0.332
1.345AsnMet: 1.345 ± 0.171
3.137AsnAsn: 3.137 ± 0.392
2.44AsnPro: 2.44 ± 0.27
1.544AsnGln: 1.544 ± 0.19
2.117AsnArg: 2.117 ± 0.205
2.515AsnSer: 2.515 ± 0.286
3.063AsnThr: 3.063 ± 0.285
2.789AsnVal: 2.789 ± 0.242
0.896AsnTrp: 0.896 ± 0.16
1.569AsnTyr: 1.569 ± 0.204
0.0AsnXaa: 0.0 ± 0.0
Pro
2.241ProAla: 2.241 ± 0.251
0.548ProCys: 0.548 ± 0.132
3.486ProAsp: 3.486 ± 0.282
3.959ProGlu: 3.959 ± 0.304
1.967ProPhe: 1.967 ± 0.211
2.316ProGly: 2.316 ± 0.277
0.697ProHis: 0.697 ± 0.119
1.569ProIle: 1.569 ± 0.213
2.515ProLys: 2.515 ± 0.288
2.888ProLeu: 2.888 ± 0.263
0.847ProMet: 0.847 ± 0.158
1.519ProAsn: 1.519 ± 0.212
1.121ProPro: 1.121 ± 0.201
1.469ProGln: 1.469 ± 0.176
1.594ProArg: 1.594 ± 0.259
2.689ProSer: 2.689 ± 0.269
2.166ProThr: 2.166 ± 0.266
3.137ProVal: 3.137 ± 0.252
0.448ProTrp: 0.448 ± 0.146
1.32ProTyr: 1.32 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
2.59GlnAla: 2.59 ± 0.269
0.374GlnCys: 0.374 ± 0.108
1.818GlnAsp: 1.818 ± 0.248
2.639GlnGlu: 2.639 ± 0.271
1.37GlnPhe: 1.37 ± 0.166
1.743GlnGly: 1.743 ± 0.194
0.448GlnHis: 0.448 ± 0.093
1.818GlnIle: 1.818 ± 0.199
1.992GlnLys: 1.992 ± 0.238
2.739GlnLeu: 2.739 ± 0.233
1.046GlnMet: 1.046 ± 0.169
1.195GlnAsn: 1.195 ± 0.167
1.419GlnPro: 1.419 ± 0.175
1.768GlnGln: 1.768 ± 0.295
1.594GlnArg: 1.594 ± 0.152
1.569GlnSer: 1.569 ± 0.205
1.743GlnThr: 1.743 ± 0.229
2.316GlnVal: 2.316 ± 0.324
0.772GlnTrp: 0.772 ± 0.119
1.345GlnTyr: 1.345 ± 0.208
0.0GlnXaa: 0.0 ± 0.0
Arg
2.615ArgAla: 2.615 ± 0.215
0.971ArgCys: 0.971 ± 0.167
3.038ArgAsp: 3.038 ± 0.266
2.739ArgGlu: 2.739 ± 0.274
1.818ArgPhe: 1.818 ± 0.197
2.739ArgGly: 2.739 ± 0.26
0.747ArgHis: 0.747 ± 0.143
2.639ArgIle: 2.639 ± 0.281
3.436ArgLys: 3.436 ± 0.315
3.735ArgLeu: 3.735 ± 0.297
1.544ArgMet: 1.544 ± 0.18
2.117ArgAsn: 2.117 ± 0.218
1.27ArgPro: 1.27 ± 0.174
1.619ArgGln: 1.619 ± 0.157
2.49ArgArg: 2.49 ± 0.264
2.689ArgSer: 2.689 ± 0.293
2.59ArgThr: 2.59 ± 0.264
3.411ArgVal: 3.411 ± 0.286
0.822ArgTrp: 0.822 ± 0.134
1.967ArgTyr: 1.967 ± 0.187
0.0ArgXaa: 0.0 ± 0.0
Ser
3.81SerAla: 3.81 ± 0.357
0.946SerCys: 0.946 ± 0.164
3.511SerAsp: 3.511 ± 0.262
3.137SerGlu: 3.137 ± 0.287
2.615SerPhe: 2.615 ± 0.31
4.955SerGly: 4.955 ± 0.409
1.245SerHis: 1.245 ± 0.161
3.162SerIle: 3.162 ± 0.282
3.884SerLys: 3.884 ± 0.345
3.436SerLeu: 3.436 ± 0.322
1.469SerMet: 1.469 ± 0.191
2.739SerAsn: 2.739 ± 0.237
2.39SerPro: 2.39 ± 0.219
1.718SerGln: 1.718 ± 0.191
2.639SerArg: 2.639 ± 0.274
3.312SerSer: 3.312 ± 0.419
3.436SerThr: 3.436 ± 0.321
3.86SerVal: 3.86 ± 0.278
0.847SerTrp: 0.847 ± 0.171
2.017SerTyr: 2.017 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
3.909ThrAla: 3.909 ± 0.366
0.971ThrCys: 0.971 ± 0.172
3.063ThrAsp: 3.063 ± 0.277
3.735ThrGlu: 3.735 ± 0.303
2.764ThrPhe: 2.764 ± 0.222
4.482ThrGly: 4.482 ± 0.422
1.32ThrHis: 1.32 ± 0.214
3.835ThrIle: 3.835 ± 0.343
3.237ThrLys: 3.237 ± 0.283
4.731ThrLeu: 4.731 ± 0.419
1.145ThrMet: 1.145 ± 0.153
2.316ThrAsn: 2.316 ± 0.24
2.639ThrPro: 2.639 ± 0.235
1.818ThrGln: 1.818 ± 0.217
1.992ThrArg: 1.992 ± 0.233
3.312ThrSer: 3.312 ± 0.421
3.461ThrThr: 3.461 ± 0.355
4.507ThrVal: 4.507 ± 0.387
1.021ThrTrp: 1.021 ± 0.161
2.316ThrTyr: 2.316 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
4.781ValAla: 4.781 ± 0.347
1.145ValCys: 1.145 ± 0.165
5.926ValAsp: 5.926 ± 0.386
5.304ValGlu: 5.304 ± 0.463
3.287ValPhe: 3.287 ± 0.275
4.407ValGly: 4.407 ± 0.324
1.245ValHis: 1.245 ± 0.196
4.382ValIle: 4.382 ± 0.335
4.781ValLys: 4.781 ± 0.37
4.382ValLeu: 4.382 ± 0.372
1.793ValMet: 1.793 ± 0.228
3.635ValAsn: 3.635 ± 0.35
2.117ValPro: 2.117 ± 0.234
1.892ValGln: 1.892 ± 0.186
3.137ValArg: 3.137 ± 0.243
3.835ValSer: 3.835 ± 0.266
4.158ValThr: 4.158 ± 0.371
6.698ValVal: 6.698 ± 0.477
1.469ValTrp: 1.469 ± 0.192
2.963ValTyr: 2.963 ± 0.252
0.0ValXaa: 0.0 ± 0.0
Trp
0.996TrpAla: 0.996 ± 0.202
0.398TrpCys: 0.398 ± 0.099
1.17TrpAsp: 1.17 ± 0.16
1.743TrpGlu: 1.743 ± 0.225
0.747TrpPhe: 0.747 ± 0.158
1.021TrpGly: 1.021 ± 0.199
0.374TrpHis: 0.374 ± 0.103
1.22TrpIle: 1.22 ± 0.159
1.145TrpLys: 1.145 ± 0.201
1.394TrpLeu: 1.394 ± 0.213
0.423TrpMet: 0.423 ± 0.105
0.847TrpAsn: 0.847 ± 0.187
0.548TrpPro: 0.548 ± 0.116
0.523TrpGln: 0.523 ± 0.123
0.822TrpArg: 0.822 ± 0.147
0.946TrpSer: 0.946 ± 0.163
1.096TrpThr: 1.096 ± 0.158
1.046TrpVal: 1.046 ± 0.152
0.523TrpTrp: 0.523 ± 0.136
0.946TrpTyr: 0.946 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.938TyrAla: 2.938 ± 0.241
0.647TyrCys: 0.647 ± 0.126
3.312TyrAsp: 3.312 ± 0.318
2.017TyrGlu: 2.017 ± 0.257
1.519TyrPhe: 1.519 ± 0.204
2.963TyrGly: 2.963 ± 0.261
1.096TyrHis: 1.096 ± 0.161
1.718TyrIle: 1.718 ± 0.21
2.515TyrLys: 2.515 ± 0.214
3.76TyrLeu: 3.76 ± 0.305
0.921TyrMet: 0.921 ± 0.13
2.341TyrAsn: 2.341 ± 0.239
2.042TyrPro: 2.042 ± 0.219
1.569TyrGln: 1.569 ± 0.196
2.166TyrArg: 2.166 ± 0.244
2.49TyrSer: 2.49 ± 0.245
2.714TyrThr: 2.714 ± 0.244
2.366TyrVal: 2.366 ± 0.269
0.872TyrTrp: 0.872 ± 0.131
2.241TyrTyr: 2.241 ± 0.275
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 208 proteins (40161 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski