Amino acid dipepetide frequency for Enterococcus phage vB_EfaM_Ef2.1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.145AlaAla: 0.145 ± 0.073
0.339AlaCys: 0.339 ± 0.09
3.394AlaAsp: 3.394 ± 0.332
4.412AlaGlu: 4.412 ± 0.321
2.182AlaPhe: 2.182 ± 0.213
3.176AlaGly: 3.176 ± 0.382
0.727AlaHis: 0.727 ± 0.115
4.218AlaIle: 4.218 ± 0.38
5.649AlaLys: 5.649 ± 0.396
5.067AlaLeu: 5.067 ± 0.328
1.576AlaMet: 1.576 ± 0.212
3.273AlaAsn: 3.273 ± 0.379
1.891AlaPro: 1.891 ± 0.232
2.376AlaGln: 2.376 ± 0.288
2.158AlaArg: 2.158 ± 0.225
3.443AlaSer: 3.443 ± 0.343
4.024AlaThr: 4.024 ± 0.357
4.121AlaVal: 4.121 ± 0.311
0.63AlaTrp: 0.63 ± 0.106
2.23AlaTyr: 2.23 ± 0.236
0.0AlaXaa: 0.0 ± 0.0
Cys
0.436CysAla: 0.436 ± 0.109
0.073CysCys: 0.073 ± 0.039
0.533CysAsp: 0.533 ± 0.127
0.582CysGlu: 0.582 ± 0.136
0.339CysPhe: 0.339 ± 0.088
0.873CysGly: 0.873 ± 0.195
0.121CysHis: 0.121 ± 0.072
0.412CysIle: 0.412 ± 0.118
0.509CysLys: 0.509 ± 0.106
0.849CysLeu: 0.849 ± 0.146
0.073CysMet: 0.073 ± 0.039
0.339CysAsn: 0.339 ± 0.079
0.388CysPro: 0.388 ± 0.095
0.121CysGln: 0.121 ± 0.048
0.315CysArg: 0.315 ± 0.077
0.412CysSer: 0.412 ± 0.094
0.509CysThr: 0.509 ± 0.129
0.63CysVal: 0.63 ± 0.129
0.097CysTrp: 0.097 ± 0.059
0.485CysTyr: 0.485 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
3.176AspAla: 3.176 ± 0.364
0.606AspCys: 0.606 ± 0.142
2.861AspAsp: 2.861 ± 0.264
5.503AspGlu: 5.503 ± 0.418
2.497AspPhe: 2.497 ± 0.208
4.024AspGly: 4.024 ± 0.38
0.63AspHis: 0.63 ± 0.122
5.115AspIle: 5.115 ± 0.326
5.455AspLys: 5.455 ± 0.361
5.043AspLeu: 5.043 ± 0.285
2.327AspMet: 2.327 ± 0.268
3.588AspAsn: 3.588 ± 0.287
1.358AspPro: 1.358 ± 0.21
0.97AspGln: 0.97 ± 0.16
2.182AspArg: 2.182 ± 0.231
3.903AspSer: 3.903 ± 0.342
3.927AspThr: 3.927 ± 0.323
4.703AspVal: 4.703 ± 0.327
0.873AspTrp: 0.873 ± 0.126
3.37AspTyr: 3.37 ± 0.319
0.0AspXaa: 0.0 ± 0.0
Glu
5.091GluAla: 5.091 ± 0.438
0.776GluCys: 0.776 ± 0.162
5.018GluAsp: 5.018 ± 0.381
9.261GluGlu: 9.261 ± 0.815
2.764GluPhe: 2.764 ± 0.261
4.776GluGly: 4.776 ± 0.332
1.527GluHis: 1.527 ± 0.249
5.261GluIle: 5.261 ± 0.45
6.4GluLys: 6.4 ± 0.482
8.97GluLeu: 8.97 ± 0.621
1.818GluMet: 1.818 ± 0.197
4.509GluAsn: 4.509 ± 0.341
2.57GluPro: 2.57 ± 0.246
4.024GluGln: 4.024 ± 0.351
3.273GluArg: 3.273 ± 0.314
4.388GluSer: 4.388 ± 0.341
4.073GluThr: 4.073 ± 0.319
6.328GluVal: 6.328 ± 0.393
0.97GluTrp: 0.97 ± 0.157
4.0GluTyr: 4.0 ± 0.292
0.0GluXaa: 0.0 ± 0.0
Phe
1.794PheAla: 1.794 ± 0.198
0.291PheCys: 0.291 ± 0.084
2.473PheAsp: 2.473 ± 0.252
2.473PheGlu: 2.473 ± 0.289
1.236PhePhe: 1.236 ± 0.17
2.158PheGly: 2.158 ± 0.243
0.533PheHis: 0.533 ± 0.133
3.103PheIle: 3.103 ± 0.284
2.594PheLys: 2.594 ± 0.278
3.079PheLeu: 3.079 ± 0.301
0.97PheMet: 0.97 ± 0.157
2.594PheAsn: 2.594 ± 0.305
1.018PhePro: 1.018 ± 0.155
1.042PheGln: 1.042 ± 0.15
1.333PheArg: 1.333 ± 0.174
3.273PheSer: 3.273 ± 0.294
3.006PheThr: 3.006 ± 0.27
2.303PheVal: 2.303 ± 0.204
0.388PheTrp: 0.388 ± 0.099
1.818PheTyr: 1.818 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
3.612GlyAla: 3.612 ± 0.45
0.436GlyCys: 0.436 ± 0.089
3.879GlyAsp: 3.879 ± 0.297
4.921GlyGlu: 4.921 ± 0.291
2.4GlyPhe: 2.4 ± 0.224
4.824GlyGly: 4.824 ± 0.606
0.946GlyHis: 0.946 ± 0.149
4.0GlyIle: 4.0 ± 0.315
5.237GlyLys: 5.237 ± 0.368
4.824GlyLeu: 4.824 ± 0.339
1.746GlyMet: 1.746 ± 0.225
3.418GlyAsn: 3.418 ± 0.291
0.024GlyPro: 0.024 ± 0.023
1.915GlyGln: 1.915 ± 0.28
2.376GlyArg: 2.376 ± 0.237
4.073GlySer: 4.073 ± 0.49
4.849GlyThr: 4.849 ± 0.435
4.606GlyVal: 4.606 ± 0.357
1.042GlyTrp: 1.042 ± 0.182
3.491GlyTyr: 3.491 ± 0.339
0.0GlyXaa: 0.0 ± 0.0
His
0.8HisAla: 0.8 ± 0.12
0.242HisCys: 0.242 ± 0.093
0.97HisAsp: 0.97 ± 0.16
0.946HisGlu: 0.946 ± 0.16
0.655HisPhe: 0.655 ± 0.125
0.994HisGly: 0.994 ± 0.18
0.291HisHis: 0.291 ± 0.095
1.164HisIle: 1.164 ± 0.196
1.236HisLys: 1.236 ± 0.176
1.164HisLeu: 1.164 ± 0.212
0.242HisMet: 0.242 ± 0.072
0.921HisAsn: 0.921 ± 0.166
0.606HisPro: 0.606 ± 0.099
0.339HisGln: 0.339 ± 0.089
0.558HisArg: 0.558 ± 0.103
0.8HisSer: 0.8 ± 0.149
1.018HisThr: 1.018 ± 0.144
1.188HisVal: 1.188 ± 0.165
0.242HisTrp: 0.242 ± 0.093
0.849HisTyr: 0.849 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
3.927IleAla: 3.927 ± 0.331
0.436IleCys: 0.436 ± 0.102
4.97IleAsp: 4.97 ± 0.355
5.843IleGlu: 5.843 ± 0.413
2.061IlePhe: 2.061 ± 0.25
3.637IleGly: 3.637 ± 0.382
1.042IleHis: 1.042 ± 0.182
4.679IleIle: 4.679 ± 0.373
5.867IleLys: 5.867 ± 0.355
4.437IleLeu: 4.437 ± 0.307
1.649IleMet: 1.649 ± 0.202
4.461IleAsn: 4.461 ± 0.348
2.206IlePro: 2.206 ± 0.244
2.449IleGln: 2.449 ± 0.282
2.74IleArg: 2.74 ± 0.216
4.703IleSer: 4.703 ± 0.343
3.83IleThr: 3.83 ± 0.378
4.097IleVal: 4.097 ± 0.411
0.533IleTrp: 0.533 ± 0.113
2.618IleTyr: 2.618 ± 0.255
0.0IleXaa: 0.0 ± 0.0
Lys
4.437LysAla: 4.437 ± 0.31
0.727LysCys: 0.727 ± 0.174
5.261LysAsp: 5.261 ± 0.418
9.188LysGlu: 9.188 ± 0.636
2.473LysPhe: 2.473 ± 0.227
4.437LysGly: 4.437 ± 0.348
1.333LysHis: 1.333 ± 0.202
4.121LysIle: 4.121 ± 0.317
5.94LysLys: 5.94 ± 0.401
6.812LysLeu: 6.812 ± 0.398
2.012LysMet: 2.012 ± 0.2
4.194LysAsn: 4.194 ± 0.308
2.861LysPro: 2.861 ± 0.297
3.079LysGln: 3.079 ± 0.278
3.734LysArg: 3.734 ± 0.315
3.976LysSer: 3.976 ± 0.345
4.534LysThr: 4.534 ± 0.287
6.303LysVal: 6.303 ± 0.449
0.606LysTrp: 0.606 ± 0.113
3.103LysTyr: 3.103 ± 0.263
0.0LysXaa: 0.0 ± 0.0
Leu
5.285LeuAla: 5.285 ± 0.349
0.606LeuCys: 0.606 ± 0.132
5.915LeuAsp: 5.915 ± 0.392
7.879LeuGlu: 7.879 ± 0.587
3.006LeuPhe: 3.006 ± 0.297
6.085LeuGly: 6.085 ± 0.417
1.018LeuHis: 1.018 ± 0.153
4.97LeuIle: 4.97 ± 0.363
6.206LeuLys: 6.206 ± 0.39
7.54LeuLeu: 7.54 ± 0.535
2.206LeuMet: 2.206 ± 0.252
5.794LeuAsn: 5.794 ± 0.509
2.643LeuPro: 2.643 ± 0.267
3.321LeuGln: 3.321 ± 0.277
3.685LeuArg: 3.685 ± 0.28
5.625LeuSer: 5.625 ± 0.394
4.946LeuThr: 4.946 ± 0.355
4.776LeuVal: 4.776 ± 0.386
0.63LeuTrp: 0.63 ± 0.133
3.297LeuTyr: 3.297 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
1.77MetAla: 1.77 ± 0.206
0.097MetCys: 0.097 ± 0.045
1.382MetAsp: 1.382 ± 0.178
2.012MetGlu: 2.012 ± 0.251
0.849MetPhe: 0.849 ± 0.134
1.503MetGly: 1.503 ± 0.229
0.412MetHis: 0.412 ± 0.119
1.624MetIle: 1.624 ± 0.21
2.158MetLys: 2.158 ± 0.226
2.376MetLeu: 2.376 ± 0.24
0.485MetMet: 0.485 ± 0.111
1.939MetAsn: 1.939 ± 0.228
0.655MetPro: 0.655 ± 0.125
0.897MetGln: 0.897 ± 0.186
1.091MetArg: 1.091 ± 0.177
1.939MetSer: 1.939 ± 0.177
1.527MetThr: 1.527 ± 0.192
1.212MetVal: 1.212 ± 0.161
0.17MetTrp: 0.17 ± 0.069
1.139MetTyr: 1.139 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
2.885AsnAla: 2.885 ± 0.273
0.388AsnCys: 0.388 ± 0.105
3.079AsnAsp: 3.079 ± 0.247
4.315AsnGlu: 4.315 ± 0.301
1.988AsnPhe: 1.988 ± 0.254
4.776AsnGly: 4.776 ± 0.4
1.212AsnHis: 1.212 ± 0.203
3.661AsnIle: 3.661 ± 0.229
5.334AsnLys: 5.334 ± 0.361
4.34AsnLeu: 4.34 ± 0.314
1.479AsnMet: 1.479 ± 0.199
3.83AsnAsn: 3.83 ± 0.387
1.891AsnPro: 1.891 ± 0.193
2.036AsnGln: 2.036 ± 0.251
2.643AsnArg: 2.643 ± 0.238
3.806AsnSer: 3.806 ± 0.316
3.491AsnThr: 3.491 ± 0.311
3.515AsnVal: 3.515 ± 0.262
0.655AsnTrp: 0.655 ± 0.102
2.691AsnTyr: 2.691 ± 0.27
0.0AsnXaa: 0.0 ± 0.0
Pro
1.406ProAla: 1.406 ± 0.238
0.267ProCys: 0.267 ± 0.082
2.255ProAsp: 2.255 ± 0.228
2.764ProGlu: 2.764 ± 0.276
1.261ProPhe: 1.261 ± 0.201
0.533ProGly: 0.533 ± 0.134
0.485ProHis: 0.485 ± 0.099
2.158ProIle: 2.158 ± 0.271
2.327ProLys: 2.327 ± 0.318
2.473ProLeu: 2.473 ± 0.259
0.679ProMet: 0.679 ± 0.12
1.843ProAsn: 1.843 ± 0.261
0.533ProPro: 0.533 ± 0.131
1.115ProGln: 1.115 ± 0.23
1.115ProArg: 1.115 ± 0.167
2.109ProSer: 2.109 ± 0.257
2.279ProThr: 2.279 ± 0.252
2.497ProVal: 2.497 ± 0.291
0.364ProTrp: 0.364 ± 0.092
1.624ProTyr: 1.624 ± 0.188
0.0ProXaa: 0.0 ± 0.0
Gln
2.861GlnAla: 2.861 ± 0.308
0.17GlnCys: 0.17 ± 0.066
1.503GlnAsp: 1.503 ± 0.158
2.958GlnGlu: 2.958 ± 0.272
1.527GlnPhe: 1.527 ± 0.216
2.085GlnGly: 2.085 ± 0.234
0.388GlnHis: 0.388 ± 0.092
2.012GlnIle: 2.012 ± 0.239
2.279GlnLys: 2.279 ± 0.256
3.249GlnLeu: 3.249 ± 0.271
0.824GlnMet: 0.824 ± 0.157
1.455GlnAsn: 1.455 ± 0.191
1.552GlnPro: 1.552 ± 0.406
1.915GlnGln: 1.915 ± 0.414
1.43GlnArg: 1.43 ± 0.206
1.891GlnSer: 1.891 ± 0.237
1.673GlnThr: 1.673 ± 0.158
2.812GlnVal: 2.812 ± 0.286
0.339GlnTrp: 0.339 ± 0.093
1.673GlnTyr: 1.673 ± 0.198
0.0GlnXaa: 0.0 ± 0.0
Arg
2.424ArgAla: 2.424 ± 0.337
0.412ArgCys: 0.412 ± 0.122
2.23ArgAsp: 2.23 ± 0.257
3.612ArgGlu: 3.612 ± 0.28
1.746ArgPhe: 1.746 ± 0.235
2.546ArgGly: 2.546 ± 0.276
0.655ArgHis: 0.655 ± 0.123
2.594ArgIle: 2.594 ± 0.306
3.2ArgLys: 3.2 ± 0.333
3.806ArgLeu: 3.806 ± 0.28
1.067ArgMet: 1.067 ± 0.19
1.988ArgAsn: 1.988 ± 0.25
0.994ArgPro: 0.994 ± 0.139
1.455ArgGln: 1.455 ± 0.219
1.624ArgArg: 1.624 ± 0.166
2.036ArgSer: 2.036 ± 0.232
2.303ArgThr: 2.303 ± 0.2
3.394ArgVal: 3.394 ± 0.324
0.291ArgTrp: 0.291 ± 0.079
2.012ArgTyr: 2.012 ± 0.252
0.0ArgXaa: 0.0 ± 0.0
Ser
3.152SerAla: 3.152 ± 0.284
0.436SerCys: 0.436 ± 0.128
4.291SerAsp: 4.291 ± 0.342
4.0SerGlu: 4.0 ± 0.314
2.74SerPhe: 2.74 ± 0.267
4.097SerGly: 4.097 ± 0.408
0.946SerHis: 0.946 ± 0.151
4.558SerIle: 4.558 ± 0.363
5.382SerLys: 5.382 ± 0.468
5.431SerLeu: 5.431 ± 0.323
1.843SerMet: 1.843 ± 0.184
3.152SerAsn: 3.152 ± 0.283
1.818SerPro: 1.818 ± 0.201
2.303SerGln: 2.303 ± 0.268
2.4SerArg: 2.4 ± 0.266
4.388SerSer: 4.388 ± 0.362
3.637SerThr: 3.637 ± 0.476
4.243SerVal: 4.243 ± 0.332
0.8SerTrp: 0.8 ± 0.128
2.885SerTyr: 2.885 ± 0.312
0.0SerXaa: 0.0 ± 0.0
Thr
3.806ThrAla: 3.806 ± 0.401
0.461ThrCys: 0.461 ± 0.117
3.637ThrAsp: 3.637 ± 0.316
4.606ThrGlu: 4.606 ± 0.401
2.885ThrPhe: 2.885 ± 0.234
3.855ThrGly: 3.855 ± 0.366
0.946ThrHis: 0.946 ± 0.148
4.631ThrIle: 4.631 ± 0.365
4.437ThrLys: 4.437 ± 0.355
5.673ThrLeu: 5.673 ± 0.357
1.164ThrMet: 1.164 ± 0.161
3.127ThrAsn: 3.127 ± 0.253
2.57ThrPro: 2.57 ± 0.32
1.867ThrGln: 1.867 ± 0.266
2.618ThrArg: 2.618 ± 0.257
3.879ThrSer: 3.879 ± 0.403
4.364ThrThr: 4.364 ± 0.456
4.752ThrVal: 4.752 ± 0.492
0.703ThrTrp: 0.703 ± 0.124
3.321ThrTyr: 3.321 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
4.606ValAla: 4.606 ± 0.329
0.727ValCys: 0.727 ± 0.132
5.067ValAsp: 5.067 ± 0.391
5.988ValGlu: 5.988 ± 0.436
3.006ValPhe: 3.006 ± 0.241
4.291ValGly: 4.291 ± 0.351
1.139ValHis: 1.139 ± 0.169
4.558ValIle: 4.558 ± 0.317
4.412ValLys: 4.412 ± 0.328
5.406ValLeu: 5.406 ± 0.436
1.843ValMet: 1.843 ± 0.207
4.097ValAsn: 4.097 ± 0.366
2.594ValPro: 2.594 ± 0.242
1.576ValGln: 1.576 ± 0.232
2.958ValArg: 2.958 ± 0.31
4.703ValSer: 4.703 ± 0.358
5.212ValThr: 5.212 ± 0.492
5.188ValVal: 5.188 ± 0.418
0.655ValTrp: 0.655 ± 0.133
3.127ValTyr: 3.127 ± 0.269
0.0ValXaa: 0.0 ± 0.0
Trp
0.703TrpAla: 0.703 ± 0.139
0.097TrpCys: 0.097 ± 0.051
0.558TrpAsp: 0.558 ± 0.123
1.042TrpGlu: 1.042 ± 0.14
0.533TrpPhe: 0.533 ± 0.114
0.824TrpGly: 0.824 ± 0.128
0.17TrpHis: 0.17 ± 0.072
0.461TrpIle: 0.461 ± 0.1
0.655TrpLys: 0.655 ± 0.109
1.067TrpLeu: 1.067 ± 0.144
0.145TrpMet: 0.145 ± 0.053
0.655TrpAsn: 0.655 ± 0.131
0.0TrpPro: 0.0 ± 0.0
0.436TrpGln: 0.436 ± 0.107
0.339TrpArg: 0.339 ± 0.12
0.461TrpSer: 0.461 ± 0.113
0.824TrpThr: 0.824 ± 0.175
0.873TrpVal: 0.873 ± 0.178
0.291TrpTrp: 0.291 ± 0.09
0.727TrpTyr: 0.727 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.715TyrAla: 2.715 ± 0.266
0.509TyrCys: 0.509 ± 0.102
2.837TyrAsp: 2.837 ± 0.293
3.491TyrGlu: 3.491 ± 0.332
1.333TyrPhe: 1.333 ± 0.186
3.127TyrGly: 3.127 ± 0.339
0.703TyrHis: 0.703 ± 0.118
2.885TyrIle: 2.885 ± 0.261
3.855TyrLys: 3.855 ± 0.338
3.83TyrLeu: 3.83 ± 0.306
1.067TyrMet: 1.067 ± 0.184
2.861TyrAsn: 2.861 ± 0.278
1.915TyrPro: 1.915 ± 0.21
1.358TyrGln: 1.358 ± 0.22
1.818TyrArg: 1.818 ± 0.178
2.691TyrSer: 2.691 ± 0.238
3.273TyrThr: 3.273 ± 0.353
3.612TyrVal: 3.612 ± 0.309
0.582TyrTrp: 0.582 ± 0.141
2.279TyrTyr: 2.279 ± 0.247
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 204 proteins (41249 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski