Amino acid dipepetide frequency for Escherichia phage vB_EcoM_IME392

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.26AlaAla: 5.26 ± 0.56
0.546AlaCys: 0.546 ± 0.137
4.44AlaAsp: 4.44 ± 0.379
4.611AlaGlu: 4.611 ± 0.534
2.186AlaPhe: 2.186 ± 0.316
4.987AlaGly: 4.987 ± 0.455
1.093AlaHis: 1.093 ± 0.191
4.474AlaIle: 4.474 ± 0.329
5.397AlaLys: 5.397 ± 0.445
5.499AlaLeu: 5.499 ± 0.418
1.879AlaMet: 1.879 ± 0.239
3.894AlaAsn: 3.894 ± 0.38
2.425AlaPro: 2.425 ± 0.29
1.947AlaGln: 1.947 ± 0.272
2.664AlaArg: 2.664 ± 0.321
4.953AlaSer: 4.953 ± 0.498
4.611AlaThr: 4.611 ± 0.38
4.748AlaVal: 4.748 ± 0.567
0.82AlaTrp: 0.82 ± 0.181
2.527AlaTyr: 2.527 ± 0.272
0.0AlaXaa: 0.0 ± 0.0
Cys
0.478CysAla: 0.478 ± 0.122
0.205CysCys: 0.205 ± 0.084
0.478CysAsp: 0.478 ± 0.134
0.786CysGlu: 0.786 ± 0.174
0.137CysPhe: 0.137 ± 0.068
0.888CysGly: 0.888 ± 0.157
0.205CysHis: 0.205 ± 0.103
0.444CysIle: 0.444 ± 0.127
0.683CysLys: 0.683 ± 0.182
0.546CysLeu: 0.546 ± 0.147
0.273CysMet: 0.273 ± 0.092
0.581CysAsn: 0.581 ± 0.129
0.751CysPro: 0.751 ± 0.135
0.478CysGln: 0.478 ± 0.118
0.649CysArg: 0.649 ± 0.163
0.581CysSer: 0.581 ± 0.147
0.786CysThr: 0.786 ± 0.17
0.512CysVal: 0.512 ± 0.11
0.137CysTrp: 0.137 ± 0.071
0.581CysTyr: 0.581 ± 0.188
0.0CysXaa: 0.0 ± 0.0
Asp
4.713AspAla: 4.713 ± 0.356
0.717AspCys: 0.717 ± 0.129
3.757AspAsp: 3.757 ± 0.342
4.099AspGlu: 4.099 ± 0.434
2.562AspPhe: 2.562 ± 0.332
6.524AspGly: 6.524 ± 0.548
1.161AspHis: 1.161 ± 0.2
3.689AspIle: 3.689 ± 0.282
4.167AspLys: 4.167 ± 0.444
6.216AspLeu: 6.216 ± 0.55
2.015AspMet: 2.015 ± 0.228
2.767AspAsn: 2.767 ± 0.351
3.45AspPro: 3.45 ± 0.376
2.493AspGln: 2.493 ± 0.303
3.381AspArg: 3.381 ± 0.357
4.167AspSer: 4.167 ± 0.407
3.928AspThr: 3.928 ± 0.459
4.201AspVal: 4.201 ± 0.362
1.059AspTrp: 1.059 ± 0.189
2.63AspTyr: 2.63 ± 0.304
0.0AspXaa: 0.0 ± 0.0
Glu
4.304GluAla: 4.304 ± 0.422
0.683GluCys: 0.683 ± 0.137
4.713GluAsp: 4.713 ± 0.423
4.611GluGlu: 4.611 ± 0.486
2.903GluPhe: 2.903 ± 0.268
3.586GluGly: 3.586 ± 0.324
1.537GluHis: 1.537 ± 0.21
4.406GluIle: 4.406 ± 0.392
2.835GluLys: 2.835 ± 0.316
6.319GluLeu: 6.319 ± 0.525
1.571GluMet: 1.571 ± 0.271
2.459GluAsn: 2.459 ± 0.279
1.708GluPro: 1.708 ± 0.235
1.913GluGln: 1.913 ± 0.239
2.22GluArg: 2.22 ± 0.295
4.235GluSer: 4.235 ± 0.531
3.279GluThr: 3.279 ± 0.281
5.362GluVal: 5.362 ± 0.464
0.854GluTrp: 0.854 ± 0.165
3.04GluTyr: 3.04 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
2.118PheAla: 2.118 ± 0.221
0.444PheCys: 0.444 ± 0.109
2.767PheAsp: 2.767 ± 0.313
1.776PheGlu: 1.776 ± 0.189
1.674PhePhe: 1.674 ± 0.233
2.732PheGly: 2.732 ± 0.344
0.615PheHis: 0.615 ± 0.127
2.391PheIle: 2.391 ± 0.306
2.63PheLys: 2.63 ± 0.316
2.869PheLeu: 2.869 ± 0.317
0.956PheMet: 0.956 ± 0.175
2.049PheAsn: 2.049 ± 0.238
1.503PhePro: 1.503 ± 0.261
1.025PheGln: 1.025 ± 0.172
2.083PheArg: 2.083 ± 0.225
3.176PheSer: 3.176 ± 0.387
2.903PheThr: 2.903 ± 0.273
2.288PheVal: 2.288 ± 0.333
0.478PheTrp: 0.478 ± 0.121
1.674PheTyr: 1.674 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
5.089GlyAla: 5.089 ± 0.543
0.683GlyCys: 0.683 ± 0.132
4.918GlyAsp: 4.918 ± 0.454
4.03GlyGlu: 4.03 ± 0.387
2.732GlyPhe: 2.732 ± 0.316
5.089GlyGly: 5.089 ± 0.537
0.854GlyHis: 0.854 ± 0.205
4.85GlyIle: 4.85 ± 0.412
4.406GlyLys: 4.406 ± 0.399
5.806GlyLeu: 5.806 ± 0.545
2.425GlyMet: 2.425 ± 0.217
3.45GlyAsn: 3.45 ± 0.321
1.4GlyPro: 1.4 ± 0.216
2.732GlyGln: 2.732 ± 0.296
3.825GlyArg: 3.825 ± 0.483
4.713GlySer: 4.713 ± 0.392
4.953GlyThr: 4.953 ± 0.539
5.397GlyVal: 5.397 ± 0.486
1.025GlyTrp: 1.025 ± 0.186
3.142GlyTyr: 3.142 ± 0.297
0.0GlyXaa: 0.0 ± 0.0
His
0.956HisAla: 0.956 ± 0.162
0.546HisCys: 0.546 ± 0.125
1.23HisAsp: 1.23 ± 0.195
1.195HisGlu: 1.195 ± 0.208
0.751HisPhe: 0.751 ± 0.183
1.571HisGly: 1.571 ± 0.219
0.546HisHis: 0.546 ± 0.121
1.435HisIle: 1.435 ± 0.204
0.956HisLys: 0.956 ± 0.162
1.537HisLeu: 1.537 ± 0.223
0.649HisMet: 0.649 ± 0.134
0.581HisAsn: 0.581 ± 0.155
1.093HisPro: 1.093 ± 0.175
0.546HisGln: 0.546 ± 0.148
0.956HisArg: 0.956 ± 0.19
1.059HisSer: 1.059 ± 0.192
0.786HisThr: 0.786 ± 0.187
1.093HisVal: 1.093 ± 0.196
0.41HisTrp: 0.41 ± 0.097
0.717HisTyr: 0.717 ± 0.168
0.0HisXaa: 0.0 ± 0.0
Ile
4.235IleAla: 4.235 ± 0.337
0.888IleCys: 0.888 ± 0.2
4.235IleAsp: 4.235 ± 0.374
3.723IleGlu: 3.723 ± 0.367
1.879IlePhe: 1.879 ± 0.237
4.474IleGly: 4.474 ± 0.377
1.127IleHis: 1.127 ± 0.23
3.62IleIle: 3.62 ± 0.372
4.099IleLys: 4.099 ± 0.482
4.816IleLeu: 4.816 ± 0.458
1.605IleMet: 1.605 ± 0.245
4.713IleAsn: 4.713 ± 0.405
3.279IlePro: 3.279 ± 0.321
2.254IleGln: 2.254 ± 0.269
3.723IleArg: 3.723 ± 0.425
3.894IleSer: 3.894 ± 0.409
4.543IleThr: 4.543 ± 0.355
3.484IleVal: 3.484 ± 0.318
0.717IleTrp: 0.717 ± 0.147
2.288IleTyr: 2.288 ± 0.29
0.0IleXaa: 0.0 ± 0.0
Lys
4.064LysAla: 4.064 ± 0.387
0.649LysCys: 0.649 ± 0.151
3.757LysAsp: 3.757 ± 0.366
3.996LysGlu: 3.996 ± 0.41
2.357LysPhe: 2.357 ± 0.393
3.791LysGly: 3.791 ± 0.357
1.264LysHis: 1.264 ± 0.241
4.133LysIle: 4.133 ± 0.445
2.323LysLys: 2.323 ± 0.287
5.192LysLeu: 5.192 ± 0.516
2.698LysMet: 2.698 ± 0.324
2.391LysAsn: 2.391 ± 0.305
1.674LysPro: 1.674 ± 0.25
1.742LysGln: 1.742 ± 0.231
1.469LysArg: 1.469 ± 0.243
4.782LysSer: 4.782 ± 0.507
3.04LysThr: 3.04 ± 0.359
4.679LysVal: 4.679 ± 0.44
1.093LysTrp: 1.093 ± 0.171
2.869LysTyr: 2.869 ± 0.292
0.0LysXaa: 0.0 ± 0.0
Leu
6.797LeuAla: 6.797 ± 0.463
0.854LeuCys: 0.854 ± 0.156
6.694LeuAsp: 6.694 ± 0.567
4.85LeuGlu: 4.85 ± 0.485
2.835LeuPhe: 2.835 ± 0.28
5.567LeuGly: 5.567 ± 0.448
1.708LeuHis: 1.708 ± 0.219
4.645LeuIle: 4.645 ± 0.497
4.338LeuLys: 4.338 ± 0.433
5.875LeuLeu: 5.875 ± 0.633
2.254LeuMet: 2.254 ± 0.309
4.85LeuAsn: 4.85 ± 0.419
3.689LeuPro: 3.689 ± 0.358
2.835LeuGln: 2.835 ± 0.272
4.133LeuArg: 4.133 ± 0.347
6.319LeuSer: 6.319 ± 0.54
5.841LeuThr: 5.841 ± 0.429
5.636LeuVal: 5.636 ± 0.327
1.264LeuTrp: 1.264 ± 0.193
2.562LeuTyr: 2.562 ± 0.343
0.0LeuXaa: 0.0 ± 0.0
Met
1.776MetAla: 1.776 ± 0.203
0.205MetCys: 0.205 ± 0.097
2.015MetAsp: 2.015 ± 0.272
1.81MetGlu: 1.81 ± 0.245
1.23MetPhe: 1.23 ± 0.206
2.186MetGly: 2.186 ± 0.298
0.546MetHis: 0.546 ± 0.131
1.605MetIle: 1.605 ± 0.205
1.639MetLys: 1.639 ± 0.243
2.391MetLeu: 2.391 ± 0.282
1.059MetMet: 1.059 ± 0.206
1.161MetAsn: 1.161 ± 0.226
0.717MetPro: 0.717 ± 0.14
0.683MetGln: 0.683 ± 0.143
0.854MetArg: 0.854 ± 0.137
2.698MetSer: 2.698 ± 0.284
1.776MetThr: 1.776 ± 0.229
2.288MetVal: 2.288 ± 0.32
0.273MetTrp: 0.273 ± 0.113
1.093MetTyr: 1.093 ± 0.22
0.0MetXaa: 0.0 ± 0.0
Asn
2.869AsnAla: 2.869 ± 0.268
0.512AsnCys: 0.512 ± 0.12
2.083AsnAsp: 2.083 ± 0.289
2.835AsnGlu: 2.835 ± 0.326
2.049AsnPhe: 2.049 ± 0.255
3.996AsnGly: 3.996 ± 0.344
0.991AsnHis: 0.991 ± 0.155
3.074AsnIle: 3.074 ± 0.354
2.596AsnLys: 2.596 ± 0.309
4.406AsnLeu: 4.406 ± 0.356
1.4AsnMet: 1.4 ± 0.222
2.493AsnAsn: 2.493 ± 0.361
2.972AsnPro: 2.972 ± 0.355
1.708AsnGln: 1.708 ± 0.251
2.767AsnArg: 2.767 ± 0.294
4.509AsnSer: 4.509 ± 0.429
2.698AsnThr: 2.698 ± 0.247
3.552AsnVal: 3.552 ± 0.321
1.059AsnTrp: 1.059 ± 0.185
2.63AsnTyr: 2.63 ± 0.28
0.0AsnXaa: 0.0 ± 0.0
Pro
2.527ProAla: 2.527 ± 0.435
0.239ProCys: 0.239 ± 0.074
3.518ProAsp: 3.518 ± 0.42
4.064ProGlu: 4.064 ± 0.421
1.605ProPhe: 1.605 ± 0.241
2.596ProGly: 2.596 ± 0.268
0.888ProHis: 0.888 ± 0.201
2.732ProIle: 2.732 ± 0.359
2.63ProLys: 2.63 ± 0.285
2.323ProLeu: 2.323 ± 0.306
0.82ProMet: 0.82 ± 0.141
2.254ProAsn: 2.254 ± 0.312
1.127ProPro: 1.127 ± 0.257
0.786ProGln: 0.786 ± 0.152
1.503ProArg: 1.503 ± 0.226
2.937ProSer: 2.937 ± 0.293
2.493ProThr: 2.493 ± 0.338
3.211ProVal: 3.211 ± 0.423
0.478ProTrp: 0.478 ± 0.139
1.332ProTyr: 1.332 ± 0.227
0.0ProXaa: 0.0 ± 0.0
Gln
1.639GlnAla: 1.639 ± 0.22
0.205GlnCys: 0.205 ± 0.074
2.049GlnAsp: 2.049 ± 0.253
2.186GlnGlu: 2.186 ± 0.256
1.708GlnPhe: 1.708 ± 0.203
1.879GlnGly: 1.879 ± 0.228
0.581GlnHis: 0.581 ± 0.137
2.357GlnIle: 2.357 ± 0.295
1.879GlnLys: 1.879 ± 0.234
3.176GlnLeu: 3.176 ± 0.352
1.093GlnMet: 1.093 ± 0.186
1.742GlnAsn: 1.742 ± 0.232
0.751GlnPro: 0.751 ± 0.139
1.195GlnGln: 1.195 ± 0.197
1.23GlnArg: 1.23 ± 0.185
2.459GlnSer: 2.459 ± 0.294
1.913GlnThr: 1.913 ± 0.243
2.493GlnVal: 2.493 ± 0.262
0.581GlnTrp: 0.581 ± 0.147
1.537GlnTyr: 1.537 ± 0.283
0.0GlnXaa: 0.0 ± 0.0
Arg
3.176ArgAla: 3.176 ± 0.381
0.376ArgCys: 0.376 ± 0.116
3.108ArgAsp: 3.108 ± 0.319
3.484ArgGlu: 3.484 ± 0.385
2.049ArgPhe: 2.049 ± 0.231
3.074ArgGly: 3.074 ± 0.4
0.615ArgHis: 0.615 ± 0.156
2.972ArgIle: 2.972 ± 0.37
2.664ArgLys: 2.664 ± 0.376
4.235ArgLeu: 4.235 ± 0.432
1.264ArgMet: 1.264 ± 0.234
2.186ArgAsn: 2.186 ± 0.275
1.503ArgPro: 1.503 ± 0.207
1.947ArgGln: 1.947 ± 0.207
2.459ArgArg: 2.459 ± 0.342
3.586ArgSer: 3.586 ± 0.304
2.254ArgThr: 2.254 ± 0.231
3.074ArgVal: 3.074 ± 0.312
0.649ArgTrp: 0.649 ± 0.187
1.981ArgTyr: 1.981 ± 0.229
0.0ArgXaa: 0.0 ± 0.0
Ser
4.679SerAla: 4.679 ± 0.379
0.342SerCys: 0.342 ± 0.105
5.157SerAsp: 5.157 ± 0.411
3.381SerGlu: 3.381 ± 0.351
2.493SerPhe: 2.493 ± 0.242
5.499SerGly: 5.499 ± 0.514
1.23SerHis: 1.23 ± 0.178
5.055SerIle: 5.055 ± 0.521
4.099SerLys: 4.099 ± 0.413
6.182SerLeu: 6.182 ± 0.487
1.742SerMet: 1.742 ± 0.236
3.45SerAsn: 3.45 ± 0.362
2.767SerPro: 2.767 ± 0.271
2.049SerGln: 2.049 ± 0.238
3.45SerArg: 3.45 ± 0.306
4.304SerSer: 4.304 ± 0.404
4.611SerThr: 4.611 ± 0.544
5.841SerVal: 5.841 ± 0.429
1.025SerTrp: 1.025 ± 0.164
2.903SerTyr: 2.903 ± 0.34
0.0SerXaa: 0.0 ± 0.0
Thr
4.748ThrAla: 4.748 ± 0.519
0.615ThrCys: 0.615 ± 0.163
4.235ThrAsp: 4.235 ± 0.37
3.347ThrGlu: 3.347 ± 0.332
2.288ThrPhe: 2.288 ± 0.282
5.431ThrGly: 5.431 ± 0.543
1.025ThrHis: 1.025 ± 0.193
4.44ThrIle: 4.44 ± 0.46
3.86ThrLys: 3.86 ± 0.347
5.601ThrLeu: 5.601 ± 0.505
1.093ThrMet: 1.093 ± 0.191
2.527ThrAsn: 2.527 ± 0.24
3.074ThrPro: 3.074 ± 0.294
2.22ThrGln: 2.22 ± 0.257
2.903ThrArg: 2.903 ± 0.325
3.484ThrSer: 3.484 ± 0.341
3.45ThrThr: 3.45 ± 0.388
4.884ThrVal: 4.884 ± 0.473
1.025ThrTrp: 1.025 ± 0.207
2.391ThrTyr: 2.391 ± 0.27
0.0ThrXaa: 0.0 ± 0.0
Val
5.533ValAla: 5.533 ± 0.428
0.854ValCys: 0.854 ± 0.167
4.44ValAsp: 4.44 ± 0.318
4.679ValGlu: 4.679 ± 0.522
2.425ValPhe: 2.425 ± 0.331
3.928ValGly: 3.928 ± 0.385
1.4ValHis: 1.4 ± 0.221
4.269ValIle: 4.269 ± 0.446
3.928ValLys: 3.928 ± 0.33
5.601ValLeu: 5.601 ± 0.423
1.879ValMet: 1.879 ± 0.288
4.679ValAsn: 4.679 ± 0.466
3.996ValPro: 3.996 ± 0.324
1.913ValGln: 1.913 ± 0.306
3.791ValArg: 3.791 ± 0.312
4.85ValSer: 4.85 ± 0.518
5.533ValThr: 5.533 ± 0.56
4.782ValVal: 4.782 ± 0.519
0.683ValTrp: 0.683 ± 0.152
1.913ValTyr: 1.913 ± 0.253
0.0ValXaa: 0.0 ± 0.0
Trp
0.956TrpAla: 0.956 ± 0.196
0.068TrpCys: 0.068 ± 0.052
1.195TrpAsp: 1.195 ± 0.234
0.888TrpGlu: 0.888 ± 0.159
0.615TrpPhe: 0.615 ± 0.15
0.991TrpGly: 0.991 ± 0.191
0.376TrpHis: 0.376 ± 0.129
0.854TrpIle: 0.854 ± 0.196
0.615TrpLys: 0.615 ± 0.137
1.503TrpLeu: 1.503 ± 0.253
0.41TrpMet: 0.41 ± 0.116
0.615TrpAsn: 0.615 ± 0.146
0.444TrpPro: 0.444 ± 0.123
0.239TrpGln: 0.239 ± 0.078
0.649TrpArg: 0.649 ± 0.161
0.786TrpSer: 0.786 ± 0.158
0.888TrpThr: 0.888 ± 0.151
1.23TrpVal: 1.23 ± 0.212
0.205TrpTrp: 0.205 ± 0.081
0.82TrpTyr: 0.82 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.04TyrAla: 3.04 ± 0.385
0.546TyrCys: 0.546 ± 0.124
2.903TyrAsp: 2.903 ± 0.332
2.22TyrGlu: 2.22 ± 0.289
1.674TyrPhe: 1.674 ± 0.216
2.732TyrGly: 2.732 ± 0.329
0.854TyrHis: 0.854 ± 0.166
2.152TyrIle: 2.152 ± 0.257
2.22TyrLys: 2.22 ± 0.224
3.45TyrLeu: 3.45 ± 0.333
0.888TyrMet: 0.888 ± 0.203
2.357TyrAsn: 2.357 ± 0.218
1.742TyrPro: 1.742 ± 0.26
1.947TyrGln: 1.947 ± 0.299
1.947TyrArg: 1.947 ± 0.245
2.767TyrSer: 2.767 ± 0.301
2.357TyrThr: 2.357 ± 0.26
2.323TyrVal: 2.323 ± 0.315
0.512TyrTrp: 0.512 ± 0.111
1.708TyrTyr: 1.708 ± 0.219
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 85 proteins (29279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski