Amino acid dipepetide frequency for Enterococcus phage EfsSzw-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.101AlaAla: 0.101 ± 0.064
0.531AlaCys: 0.531 ± 0.107
4.448AlaAsp: 4.448 ± 0.424
4.877AlaGlu: 4.877 ± 0.327
2.502AlaPhe: 2.502 ± 0.248
3.487AlaGly: 3.487 ± 0.498
1.061AlaHis: 1.061 ± 0.147
4.524AlaIle: 4.524 ± 0.367
5.13AlaLys: 5.13 ± 0.386
5.484AlaLeu: 5.484 ± 0.424
1.567AlaMet: 1.567 ± 0.221
3.715AlaAsn: 3.715 ± 0.418
1.87AlaPro: 1.87 ± 0.247
2.755AlaGln: 2.755 ± 0.33
3.108AlaArg: 3.108 ± 0.283
3.841AlaSer: 3.841 ± 0.446
4.524AlaThr: 4.524 ± 0.495
3.816AlaVal: 3.816 ± 0.289
0.581AlaTrp: 0.581 ± 0.107
2.704AlaTyr: 2.704 ± 0.28
0.0AlaXaa: 0.0 ± 0.0
Cys
0.329CysAla: 0.329 ± 0.098
0.101CysCys: 0.101 ± 0.057
0.556CysAsp: 0.556 ± 0.139
0.505CysGlu: 0.505 ± 0.122
0.43CysPhe: 0.43 ± 0.093
0.607CysGly: 0.607 ± 0.136
0.101CysHis: 0.101 ± 0.048
0.303CysIle: 0.303 ± 0.094
0.708CysLys: 0.708 ± 0.161
0.581CysLeu: 0.581 ± 0.11
0.101CysMet: 0.101 ± 0.048
0.303CysAsn: 0.303 ± 0.087
0.657CysPro: 0.657 ± 0.147
0.227CysGln: 0.227 ± 0.069
0.379CysArg: 0.379 ± 0.1
0.505CysSer: 0.505 ± 0.116
0.48CysThr: 0.48 ± 0.104
0.531CysVal: 0.531 ± 0.113
0.101CysTrp: 0.101 ± 0.053
0.531CysTyr: 0.531 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
3.311AspAla: 3.311 ± 0.265
0.43AspCys: 0.43 ± 0.118
2.982AspAsp: 2.982 ± 0.299
4.549AspGlu: 4.549 ± 0.414
3.033AspPhe: 3.033 ± 0.266
3.942AspGly: 3.942 ± 0.42
0.607AspHis: 0.607 ± 0.14
4.397AspIle: 4.397 ± 0.319
5.357AspLys: 5.357 ± 0.366
5.459AspLeu: 5.459 ± 0.397
2.249AspMet: 2.249 ± 0.249
3.235AspAsn: 3.235 ± 0.328
1.668AspPro: 1.668 ± 0.275
1.289AspGln: 1.289 ± 0.207
2.401AspArg: 2.401 ± 0.284
4.069AspSer: 4.069 ± 0.363
4.246AspThr: 4.246 ± 0.377
4.22AspVal: 4.22 ± 0.334
0.859AspTrp: 0.859 ± 0.128
3.942AspTyr: 3.942 ± 0.398
0.0AspXaa: 0.0 ± 0.0
Glu
5.408GluAla: 5.408 ± 0.408
0.682GluCys: 0.682 ± 0.167
5.079GluAsp: 5.079 ± 0.476
8.163GluGlu: 8.163 ± 0.853
2.552GluPhe: 2.552 ± 0.262
3.791GluGly: 3.791 ± 0.301
1.769GluHis: 1.769 ± 0.213
5.054GluIle: 5.054 ± 0.38
6.015GluLys: 6.015 ± 0.426
7.43GluLeu: 7.43 ± 0.543
2.805GluMet: 2.805 ± 0.293
3.942GluAsn: 3.942 ± 0.34
2.097GluPro: 2.097 ± 0.232
4.144GluGln: 4.144 ± 0.334
3.412GluArg: 3.412 ± 0.32
3.892GluSer: 3.892 ± 0.331
4.195GluThr: 4.195 ± 0.292
5.383GluVal: 5.383 ± 0.378
0.96GluTrp: 0.96 ± 0.134
3.462GluTyr: 3.462 ± 0.301
0.0GluXaa: 0.0 ± 0.0
Phe
2.274PheAla: 2.274 ± 0.241
0.303PheCys: 0.303 ± 0.082
2.628PheAsp: 2.628 ± 0.284
2.552PheGlu: 2.552 ± 0.284
1.137PhePhe: 1.137 ± 0.203
2.552PheGly: 2.552 ± 0.314
0.607PheHis: 0.607 ± 0.138
2.704PheIle: 2.704 ± 0.285
2.679PheLys: 2.679 ± 0.212
3.007PheLeu: 3.007 ± 0.281
1.238PheMet: 1.238 ± 0.189
2.375PheAsn: 2.375 ± 0.279
1.314PhePro: 1.314 ± 0.201
1.036PheGln: 1.036 ± 0.166
1.466PheArg: 1.466 ± 0.223
2.982PheSer: 2.982 ± 0.3
3.033PheThr: 3.033 ± 0.281
2.704PheVal: 2.704 ± 0.265
0.354PheTrp: 0.354 ± 0.1
1.921PheTyr: 1.921 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
3.639GlyAla: 3.639 ± 0.443
0.556GlyCys: 0.556 ± 0.146
3.487GlyAsp: 3.487 ± 0.339
4.599GlyGlu: 4.599 ± 0.301
2.451GlyPhe: 2.451 ± 0.307
4.498GlyGly: 4.498 ± 0.494
1.137GlyHis: 1.137 ± 0.139
4.726GlyIle: 4.726 ± 0.332
5.332GlyLys: 5.332 ± 0.318
4.776GlyLeu: 4.776 ± 0.326
1.339GlyMet: 1.339 ± 0.26
3.412GlyAsn: 3.412 ± 0.295
0.025GlyPro: 0.025 ± 0.025
2.249GlyGln: 2.249 ± 0.289
2.3GlyArg: 2.3 ± 0.238
3.892GlySer: 3.892 ± 0.346
4.877GlyThr: 4.877 ± 0.472
4.801GlyVal: 4.801 ± 0.33
0.758GlyTrp: 0.758 ± 0.137
2.931GlyTyr: 2.931 ± 0.3
0.0GlyXaa: 0.0 ± 0.0
His
0.809HisAla: 0.809 ± 0.136
0.126HisCys: 0.126 ± 0.055
0.733HisAsp: 0.733 ± 0.116
1.087HisGlu: 1.087 ± 0.171
0.758HisPhe: 0.758 ± 0.128
0.96HisGly: 0.96 ± 0.176
0.43HisHis: 0.43 ± 0.113
1.011HisIle: 1.011 ± 0.186
1.264HisLys: 1.264 ± 0.187
1.264HisLeu: 1.264 ± 0.16
0.455HisMet: 0.455 ± 0.107
0.834HisAsn: 0.834 ± 0.135
0.505HisPro: 0.505 ± 0.108
0.404HisGln: 0.404 ± 0.1
0.657HisArg: 0.657 ± 0.118
0.91HisSer: 0.91 ± 0.126
1.188HisThr: 1.188 ± 0.215
1.415HisVal: 1.415 ± 0.191
0.329HisTrp: 0.329 ± 0.084
0.96HisTyr: 0.96 ± 0.158
0.0HisXaa: 0.0 ± 0.0
Ile
5.004IleAla: 5.004 ± 0.346
0.581IleCys: 0.581 ± 0.122
4.877IleAsp: 4.877 ± 0.413
5.004IleGlu: 5.004 ± 0.385
1.82IlePhe: 1.82 ± 0.212
3.866IleGly: 3.866 ± 0.338
0.884IleHis: 0.884 ± 0.158
4.22IleIle: 4.22 ± 0.359
4.978IleLys: 4.978 ± 0.36
4.473IleLeu: 4.473 ± 0.37
1.693IleMet: 1.693 ± 0.199
3.437IleAsn: 3.437 ± 0.261
2.603IlePro: 2.603 ± 0.249
2.502IleGln: 2.502 ± 0.24
2.552IleArg: 2.552 ± 0.249
4.069IleSer: 4.069 ± 0.303
4.65IleThr: 4.65 ± 0.424
3.563IleVal: 3.563 ± 0.293
0.632IleTrp: 0.632 ± 0.153
2.249IleTyr: 2.249 ± 0.23
0.0IleXaa: 0.0 ± 0.0
Lys
5.13LysAla: 5.13 ± 0.374
0.531LysCys: 0.531 ± 0.134
4.549LysAsp: 4.549 ± 0.287
8.491LysGlu: 8.491 ± 0.491
2.628LysPhe: 2.628 ± 0.237
4.473LysGly: 4.473 ± 0.415
1.087LysHis: 1.087 ± 0.161
3.866LysIle: 3.866 ± 0.317
5.711LysLys: 5.711 ± 0.433
6.191LysLeu: 6.191 ± 0.314
2.3LysMet: 2.3 ± 0.26
3.765LysAsn: 3.765 ± 0.353
3.184LysPro: 3.184 ± 0.312
3.588LysGln: 3.588 ± 0.353
3.816LysArg: 3.816 ± 0.358
4.195LysSer: 4.195 ± 0.344
4.574LysThr: 4.574 ± 0.26
5.762LysVal: 5.762 ± 0.387
0.834LysTrp: 0.834 ± 0.144
3.285LysTyr: 3.285 ± 0.26
0.0LysXaa: 0.0 ± 0.0
Leu
5.484LeuAla: 5.484 ± 0.393
0.682LeuCys: 0.682 ± 0.14
5.585LeuAsp: 5.585 ± 0.429
6.621LeuGlu: 6.621 ± 0.46
3.083LeuPhe: 3.083 ± 0.233
5.635LeuGly: 5.635 ± 0.335
1.365LeuHis: 1.365 ± 0.192
4.827LeuIle: 4.827 ± 0.408
6.318LeuLys: 6.318 ± 0.334
6.368LeuLeu: 6.368 ± 0.432
1.617LeuMet: 1.617 ± 0.183
5.256LeuAsn: 5.256 ± 0.351
2.603LeuPro: 2.603 ± 0.287
3.538LeuGln: 3.538 ± 0.252
3.841LeuArg: 3.841 ± 0.319
5.61LeuSer: 5.61 ± 0.369
5.989LeuThr: 5.989 ± 0.383
5.307LeuVal: 5.307 ± 0.38
0.682LeuTrp: 0.682 ± 0.136
3.209LeuTyr: 3.209 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
1.996MetAla: 1.996 ± 0.225
0.278MetCys: 0.278 ± 0.084
1.087MetAsp: 1.087 ± 0.154
2.148MetGlu: 2.148 ± 0.255
1.112MetPhe: 1.112 ± 0.16
1.289MetGly: 1.289 ± 0.21
0.253MetHis: 0.253 ± 0.076
1.617MetIle: 1.617 ± 0.191
2.35MetLys: 2.35 ± 0.23
2.173MetLeu: 2.173 ± 0.207
0.682MetMet: 0.682 ± 0.137
1.491MetAsn: 1.491 ± 0.2
0.607MetPro: 0.607 ± 0.124
1.087MetGln: 1.087 ± 0.154
1.466MetArg: 1.466 ± 0.183
1.921MetSer: 1.921 ± 0.173
1.643MetThr: 1.643 ± 0.171
1.289MetVal: 1.289 ± 0.163
0.202MetTrp: 0.202 ± 0.067
1.592MetTyr: 1.592 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
3.033AsnAla: 3.033 ± 0.308
0.379AsnCys: 0.379 ± 0.12
3.058AsnAsp: 3.058 ± 0.274
3.816AsnGlu: 3.816 ± 0.283
1.971AsnPhe: 1.971 ± 0.235
4.119AsnGly: 4.119 ± 0.33
0.884AsnHis: 0.884 ± 0.156
3.108AsnIle: 3.108 ± 0.234
4.675AsnLys: 4.675 ± 0.326
4.675AsnLeu: 4.675 ± 0.372
1.365AsnMet: 1.365 ± 0.196
2.881AsnAsn: 2.881 ± 0.291
2.173AsnPro: 2.173 ± 0.259
1.693AsnGln: 1.693 ± 0.207
2.856AsnArg: 2.856 ± 0.286
3.563AsnSer: 3.563 ± 0.366
3.361AsnThr: 3.361 ± 0.329
3.588AsnVal: 3.588 ± 0.357
0.708AsnTrp: 0.708 ± 0.133
2.375AsnTyr: 2.375 ± 0.247
0.0AsnXaa: 0.0 ± 0.0
Pro
1.744ProAla: 1.744 ± 0.239
0.227ProCys: 0.227 ± 0.072
2.199ProAsp: 2.199 ± 0.267
2.628ProGlu: 2.628 ± 0.226
1.339ProPhe: 1.339 ± 0.207
0.531ProGly: 0.531 ± 0.127
0.556ProHis: 0.556 ± 0.121
1.921ProIle: 1.921 ± 0.216
2.653ProLys: 2.653 ± 0.293
2.325ProLeu: 2.325 ± 0.247
0.733ProMet: 0.733 ± 0.123
1.769ProAsn: 1.769 ± 0.246
0.48ProPro: 0.48 ± 0.116
0.91ProGln: 0.91 ± 0.148
1.137ProArg: 1.137 ± 0.133
2.375ProSer: 2.375 ± 0.239
2.249ProThr: 2.249 ± 0.257
2.805ProVal: 2.805 ± 0.296
0.43ProTrp: 0.43 ± 0.101
1.617ProTyr: 1.617 ± 0.18
0.0ProXaa: 0.0 ± 0.0
Gln
3.942GlnAla: 3.942 ± 0.469
0.177GlnCys: 0.177 ± 0.062
2.072GlnAsp: 2.072 ± 0.239
3.74GlnGlu: 3.74 ± 0.356
1.289GlnPhe: 1.289 ± 0.205
2.249GlnGly: 2.249 ± 0.233
0.531GlnHis: 0.531 ± 0.116
2.047GlnIle: 2.047 ± 0.202
2.451GlnLys: 2.451 ± 0.274
3.715GlnLeu: 3.715 ± 0.271
1.011GlnMet: 1.011 ± 0.174
1.668GlnAsn: 1.668 ± 0.207
1.036GlnPro: 1.036 ± 0.184
1.643GlnGln: 1.643 ± 0.239
1.567GlnArg: 1.567 ± 0.223
2.274GlnSer: 2.274 ± 0.256
1.82GlnThr: 1.82 ± 0.214
2.679GlnVal: 2.679 ± 0.257
0.329GlnTrp: 0.329 ± 0.086
1.567GlnTyr: 1.567 ± 0.184
0.0GlnXaa: 0.0 ± 0.0
Arg
2.502ArgAla: 2.502 ± 0.322
0.303ArgCys: 0.303 ± 0.115
2.729ArgAsp: 2.729 ± 0.244
3.715ArgGlu: 3.715 ± 0.369
1.794ArgPhe: 1.794 ± 0.224
2.35ArgGly: 2.35 ± 0.307
0.682ArgHis: 0.682 ± 0.123
2.982ArgIle: 2.982 ± 0.293
3.664ArgLys: 3.664 ± 0.37
4.372ArgLeu: 4.372 ± 0.288
1.314ArgMet: 1.314 ± 0.206
2.552ArgAsn: 2.552 ± 0.254
1.061ArgPro: 1.061 ± 0.154
1.592ArgGln: 1.592 ± 0.249
1.794ArgArg: 1.794 ± 0.237
2.148ArgSer: 2.148 ± 0.25
2.325ArgThr: 2.325 ± 0.262
3.033ArgVal: 3.033 ± 0.323
0.379ArgTrp: 0.379 ± 0.112
2.072ArgTyr: 2.072 ± 0.253
0.0ArgXaa: 0.0 ± 0.0
Ser
3.614SerAla: 3.614 ± 0.344
0.278SerCys: 0.278 ± 0.098
4.347SerAsp: 4.347 ± 0.315
3.816SerGlu: 3.816 ± 0.337
3.033SerPhe: 3.033 ± 0.254
5.029SerGly: 5.029 ± 0.462
0.859SerHis: 0.859 ± 0.168
4.119SerIle: 4.119 ± 0.345
4.549SerLys: 4.549 ± 0.347
5.206SerLeu: 5.206 ± 0.452
1.516SerMet: 1.516 ± 0.208
2.881SerAsn: 2.881 ± 0.296
1.996SerPro: 1.996 ± 0.231
2.022SerGln: 2.022 ± 0.241
2.603SerArg: 2.603 ± 0.291
3.917SerSer: 3.917 ± 0.32
3.588SerThr: 3.588 ± 0.3
3.917SerVal: 3.917 ± 0.304
0.91SerTrp: 0.91 ± 0.145
2.906SerTyr: 2.906 ± 0.315
0.0SerXaa: 0.0 ± 0.0
Thr
4.397ThrAla: 4.397 ± 0.419
0.531ThrCys: 0.531 ± 0.129
3.74ThrAsp: 3.74 ± 0.306
4.751ThrGlu: 4.751 ± 0.36
3.26ThrPhe: 3.26 ± 0.286
4.422ThrGly: 4.422 ± 0.41
1.087ThrHis: 1.087 ± 0.134
4.296ThrIle: 4.296 ± 0.309
4.448ThrLys: 4.448 ± 0.311
6.065ThrLeu: 6.065 ± 0.437
1.592ThrMet: 1.592 ± 0.224
3.285ThrAsn: 3.285 ± 0.305
2.679ThrPro: 2.679 ± 0.253
2.249ThrGln: 2.249 ± 0.247
2.603ThrArg: 2.603 ± 0.228
3.462ThrSer: 3.462 ± 0.314
4.473ThrThr: 4.473 ± 0.496
5.585ThrVal: 5.585 ± 0.472
0.758ThrTrp: 0.758 ± 0.124
3.033ThrTyr: 3.033 ± 0.414
0.0ThrXaa: 0.0 ± 0.0
Val
4.625ValAla: 4.625 ± 0.399
0.581ValCys: 0.581 ± 0.106
4.726ValAsp: 4.726 ± 0.4
5.079ValGlu: 5.079 ± 0.482
2.729ValPhe: 2.729 ± 0.262
4.296ValGly: 4.296 ± 0.396
1.238ValHis: 1.238 ± 0.228
4.271ValIle: 4.271 ± 0.329
5.408ValLys: 5.408 ± 0.36
5.408ValLeu: 5.408 ± 0.44
1.466ValMet: 1.466 ± 0.193
3.715ValAsn: 3.715 ± 0.32
2.35ValPro: 2.35 ± 0.26
2.502ValGln: 2.502 ± 0.263
3.007ValArg: 3.007 ± 0.327
4.271ValSer: 4.271 ± 0.304
5.181ValThr: 5.181 ± 0.482
4.65ValVal: 4.65 ± 0.389
0.607ValTrp: 0.607 ± 0.118
3.184ValTyr: 3.184 ± 0.283
0.0ValXaa: 0.0 ± 0.0
Trp
0.632TrpAla: 0.632 ± 0.125
0.202TrpCys: 0.202 ± 0.08
0.581TrpAsp: 0.581 ± 0.116
1.162TrpGlu: 1.162 ± 0.174
0.505TrpPhe: 0.505 ± 0.131
0.859TrpGly: 0.859 ± 0.151
0.126TrpHis: 0.126 ± 0.06
0.607TrpIle: 0.607 ± 0.124
0.834TrpLys: 0.834 ± 0.148
0.834TrpLeu: 0.834 ± 0.146
0.025TrpMet: 0.025 ± 0.026
0.581TrpAsn: 0.581 ± 0.129
0.0TrpPro: 0.0 ± 0.0
0.43TrpGln: 0.43 ± 0.101
0.455TrpArg: 0.455 ± 0.109
0.556TrpSer: 0.556 ± 0.113
0.96TrpThr: 0.96 ± 0.14
0.935TrpVal: 0.935 ± 0.189
0.253TrpTrp: 0.253 ± 0.094
0.657TrpTyr: 0.657 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.653TyrAla: 2.653 ± 0.274
0.556TyrCys: 0.556 ± 0.114
2.78TyrAsp: 2.78 ± 0.276
2.931TyrGlu: 2.931 ± 0.265
1.44TyrPhe: 1.44 ± 0.17
2.856TyrGly: 2.856 ± 0.32
0.834TyrHis: 0.834 ± 0.145
3.058TyrIle: 3.058 ± 0.323
3.437TyrLys: 3.437 ± 0.382
3.816TyrLeu: 3.816 ± 0.304
1.188TyrMet: 1.188 ± 0.141
3.184TyrAsn: 3.184 ± 0.332
1.718TyrPro: 1.718 ± 0.209
1.946TyrGln: 1.946 ± 0.241
1.946TyrArg: 1.946 ± 0.222
2.653TyrSer: 2.653 ± 0.298
3.361TyrThr: 3.361 ± 0.339
3.311TyrVal: 3.311 ± 0.334
0.531TyrTrp: 0.531 ± 0.136
1.996TyrTyr: 1.996 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 172 proteins (39572 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski