Amino acid dipepetide frequency for Escherichia virus EPS7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.46AlaAla: 8.46 ± 1.55
0.48AlaCys: 0.48 ± 0.125
3.836AlaAsp: 3.836 ± 0.343
5.652AlaGlu: 5.652 ± 0.551
3.185AlaPhe: 3.185 ± 0.292
5.172AlaGly: 5.172 ± 0.532
1.439AlaHis: 1.439 ± 0.277
5.241AlaIle: 5.241 ± 0.4
6.679AlaLys: 6.679 ± 0.589
6.508AlaLeu: 6.508 ± 0.515
2.089AlaMet: 2.089 ± 0.304
3.699AlaAsn: 3.699 ± 0.378
2.363AlaPro: 2.363 ± 0.28
3.151AlaGln: 3.151 ± 0.368
3.117AlaArg: 3.117 ± 0.342
4.761AlaSer: 4.761 ± 0.8
4.487AlaThr: 4.487 ± 0.61
4.419AlaVal: 4.419 ± 0.444
0.891AlaTrp: 0.891 ± 0.193
2.261AlaTyr: 2.261 ± 0.28
0.0AlaXaa: 0.0 ± 0.0
Cys
0.514CysAla: 0.514 ± 0.141
0.171CysCys: 0.171 ± 0.084
0.617CysAsp: 0.617 ± 0.16
0.685CysGlu: 0.685 ± 0.147
0.617CysPhe: 0.617 ± 0.149
0.754CysGly: 0.754 ± 0.178
0.24CysHis: 0.24 ± 0.086
0.651CysIle: 0.651 ± 0.146
0.822CysLys: 0.822 ± 0.208
0.856CysLeu: 0.856 ± 0.148
0.411CysMet: 0.411 ± 0.124
0.514CysAsn: 0.514 ± 0.145
0.308CysPro: 0.308 ± 0.085
0.411CysGln: 0.411 ± 0.151
0.48CysArg: 0.48 ± 0.134
0.822CysSer: 0.822 ± 0.168
0.48CysThr: 0.48 ± 0.111
0.719CysVal: 0.719 ± 0.144
0.069CysTrp: 0.069 ± 0.056
0.343CysTyr: 0.343 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
4.693AspAla: 4.693 ± 0.423
0.582AspCys: 0.582 ± 0.152
3.459AspAsp: 3.459 ± 0.424
4.864AspGlu: 4.864 ± 0.388
2.946AspPhe: 2.946 ± 0.325
3.391AspGly: 3.391 ± 0.351
1.165AspHis: 1.165 ± 0.195
4.556AspIle: 4.556 ± 0.46
4.521AspLys: 4.521 ± 0.396
5.412AspLeu: 5.412 ± 0.399
2.466AspMet: 2.466 ± 0.292
2.569AspAsn: 2.569 ± 0.32
2.74AspPro: 2.74 ± 0.282
1.747AspGln: 1.747 ± 0.241
2.637AspArg: 2.637 ± 0.304
3.973AspSer: 3.973 ± 0.472
3.631AspThr: 3.631 ± 0.334
4.282AspVal: 4.282 ± 0.433
0.822AspTrp: 0.822 ± 0.184
3.151AspTyr: 3.151 ± 0.324
0.0AspXaa: 0.0 ± 0.0
Glu
5.891GluAla: 5.891 ± 0.532
0.822GluCys: 0.822 ± 0.189
3.905GluAsp: 3.905 ± 0.364
5.994GluGlu: 5.994 ± 0.615
2.809GluPhe: 2.809 ± 0.362
4.11GluGly: 4.11 ± 0.403
1.644GluHis: 1.644 ± 0.234
5.309GluIle: 5.309 ± 0.354
4.761GluLys: 4.761 ± 0.552
7.159GluLeu: 7.159 ± 0.457
2.363GluMet: 2.363 ± 0.269
2.706GluAsn: 2.706 ± 0.336
1.747GluPro: 1.747 ± 0.271
3.151GluGln: 3.151 ± 0.377
3.083GluArg: 3.083 ± 0.343
3.494GluSer: 3.494 ± 0.333
4.419GluThr: 4.419 ± 0.388
4.419GluVal: 4.419 ± 0.322
1.199GluTrp: 1.199 ± 0.215
3.083GluTyr: 3.083 ± 0.34
0.0GluXaa: 0.0 ± 0.0
Phe
2.5PheAla: 2.5 ± 0.294
0.377PheCys: 0.377 ± 0.118
3.117PheAsp: 3.117 ± 0.358
3.048PheGlu: 3.048 ± 0.295
1.37PhePhe: 1.37 ± 0.193
2.946PheGly: 2.946 ± 0.381
1.028PheHis: 1.028 ± 0.206
3.014PheIle: 3.014 ± 0.39
3.048PheLys: 3.048 ± 0.306
2.946PheLeu: 2.946 ± 0.285
0.959PheMet: 0.959 ± 0.19
2.843PheAsn: 2.843 ± 0.316
1.918PhePro: 1.918 ± 0.312
1.096PheGln: 1.096 ± 0.19
2.089PheArg: 2.089 ± 0.346
2.5PheSer: 2.5 ± 0.295
2.535PheThr: 2.535 ± 0.325
2.466PheVal: 2.466 ± 0.268
0.445PheTrp: 0.445 ± 0.135
1.644PheTyr: 1.644 ± 0.242
0.0PheXaa: 0.0 ± 0.0
Gly
3.836GlyAla: 3.836 ± 0.389
0.993GlyCys: 0.993 ± 0.209
3.562GlyAsp: 3.562 ± 0.44
5.001GlyGlu: 5.001 ± 0.398
3.014GlyPhe: 3.014 ± 0.285
3.083GlyGly: 3.083 ± 0.327
1.233GlyHis: 1.233 ± 0.213
4.521GlyIle: 4.521 ± 0.379
5.104GlyLys: 5.104 ± 0.397
4.35GlyLeu: 4.35 ± 0.354
2.192GlyMet: 2.192 ± 0.281
3.151GlyAsn: 3.151 ± 0.355
0.788GlyPro: 0.788 ± 0.154
2.363GlyGln: 2.363 ± 0.287
2.74GlyArg: 2.74 ± 0.341
4.008GlySer: 4.008 ± 0.437
3.22GlyThr: 3.22 ± 0.384
4.864GlyVal: 4.864 ± 0.405
1.062GlyTrp: 1.062 ± 0.187
2.774GlyTyr: 2.774 ± 0.309
0.0GlyXaa: 0.0 ± 0.0
His
1.13HisAla: 1.13 ± 0.232
0.308HisCys: 0.308 ± 0.098
1.473HisAsp: 1.473 ± 0.243
1.062HisGlu: 1.062 ± 0.183
0.754HisPhe: 0.754 ± 0.183
1.302HisGly: 1.302 ± 0.226
0.685HisHis: 0.685 ± 0.162
1.541HisIle: 1.541 ± 0.233
1.267HisLys: 1.267 ± 0.214
1.61HisLeu: 1.61 ± 0.239
0.445HisMet: 0.445 ± 0.146
0.891HisAsn: 0.891 ± 0.202
0.925HisPro: 0.925 ± 0.203
0.582HisGln: 0.582 ± 0.162
0.754HisArg: 0.754 ± 0.181
1.13HisSer: 1.13 ± 0.219
0.925HisThr: 0.925 ± 0.207
1.028HisVal: 1.028 ± 0.202
0.206HisTrp: 0.206 ± 0.092
0.959HisTyr: 0.959 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
5.617IleAla: 5.617 ± 0.445
0.822IleCys: 0.822 ± 0.193
5.275IleAsp: 5.275 ± 0.427
4.145IleGlu: 4.145 ± 0.383
2.774IlePhe: 2.774 ± 0.293
4.042IleGly: 4.042 ± 0.356
1.096IleHis: 1.096 ± 0.175
4.35IleIle: 4.35 ± 0.391
4.35IleLys: 4.35 ± 0.385
5.241IleLeu: 5.241 ± 0.516
2.089IleMet: 2.089 ± 0.323
3.802IleAsn: 3.802 ± 0.364
2.809IlePro: 2.809 ± 0.314
2.226IleGln: 2.226 ± 0.275
2.843IleArg: 2.843 ± 0.234
4.316IleSer: 4.316 ± 0.361
4.624IleThr: 4.624 ± 0.42
4.145IleVal: 4.145 ± 0.373
0.822IleTrp: 0.822 ± 0.195
2.226IleTyr: 2.226 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
6.2LysAla: 6.2 ± 0.596
0.514LysCys: 0.514 ± 0.137
5.172LysAsp: 5.172 ± 0.44
5.241LysGlu: 5.241 ± 0.438
3.391LysPhe: 3.391 ± 0.311
3.391LysGly: 3.391 ± 0.342
1.233LysHis: 1.233 ± 0.204
3.631LysIle: 3.631 ± 0.415
4.624LysLys: 4.624 ± 0.475
6.885LysLeu: 6.885 ± 0.472
2.535LysMet: 2.535 ± 0.328
3.22LysAsn: 3.22 ± 0.34
2.398LysPro: 2.398 ± 0.375
3.322LysGln: 3.322 ± 0.345
3.528LysArg: 3.528 ± 0.337
3.768LysSer: 3.768 ± 0.38
4.145LysThr: 4.145 ± 0.439
4.864LysVal: 4.864 ± 0.422
0.822LysTrp: 0.822 ± 0.197
3.185LysTyr: 3.185 ± 0.296
0.0LysXaa: 0.0 ± 0.0
Leu
7.57LeuAla: 7.57 ± 0.578
0.822LeuCys: 0.822 ± 0.141
6.405LeuAsp: 6.405 ± 0.514
6.885LeuGlu: 6.885 ± 0.529
2.946LeuPhe: 2.946 ± 0.342
5.104LeuGly: 5.104 ± 0.433
1.85LeuHis: 1.85 ± 0.283
5.241LeuIle: 5.241 ± 0.481
5.583LeuLys: 5.583 ± 0.454
6.782LeuLeu: 6.782 ± 0.548
2.089LeuMet: 2.089 ± 0.302
5.069LeuAsn: 5.069 ± 0.45
3.254LeuPro: 3.254 ± 0.396
3.151LeuGln: 3.151 ± 0.374
3.562LeuArg: 3.562 ± 0.322
4.898LeuSer: 4.898 ± 0.456
4.179LeuThr: 4.179 ± 0.373
5.001LeuVal: 5.001 ± 0.453
0.754LeuTrp: 0.754 ± 0.18
2.672LeuTyr: 2.672 ± 0.305
0.0LeuXaa: 0.0 ± 0.0
Met
1.781MetAla: 1.781 ± 0.255
0.24MetCys: 0.24 ± 0.086
1.439MetAsp: 1.439 ± 0.211
2.226MetGlu: 2.226 ± 0.336
1.028MetPhe: 1.028 ± 0.232
1.644MetGly: 1.644 ± 0.256
0.514MetHis: 0.514 ± 0.139
2.021MetIle: 2.021 ± 0.296
2.809MetLys: 2.809 ± 0.335
2.809MetLeu: 2.809 ± 0.37
0.617MetMet: 0.617 ± 0.149
0.959MetAsn: 0.959 ± 0.177
0.754MetPro: 0.754 ± 0.15
1.404MetGln: 1.404 ± 0.217
0.891MetArg: 0.891 ± 0.168
2.363MetSer: 2.363 ± 0.3
1.987MetThr: 1.987 ± 0.242
1.404MetVal: 1.404 ± 0.257
0.377MetTrp: 0.377 ± 0.116
1.267MetTyr: 1.267 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.836AsnAla: 3.836 ± 0.495
0.617AsnCys: 0.617 ± 0.172
2.843AsnAsp: 2.843 ± 0.352
2.603AsnGlu: 2.603 ± 0.317
1.678AsnPhe: 1.678 ± 0.219
3.905AsnGly: 3.905 ± 0.439
0.891AsnHis: 0.891 ± 0.22
3.939AsnIle: 3.939 ± 0.389
3.597AsnLys: 3.597 ± 0.367
4.693AsnLeu: 4.693 ± 0.418
1.199AsnMet: 1.199 ± 0.187
1.918AsnAsn: 1.918 ± 0.306
2.398AsnPro: 2.398 ± 0.292
1.713AsnGln: 1.713 ± 0.238
2.5AsnArg: 2.5 ± 0.316
3.631AsnSer: 3.631 ± 0.429
2.569AsnThr: 2.569 ± 0.298
3.391AsnVal: 3.391 ± 0.311
0.617AsnTrp: 0.617 ± 0.159
1.747AsnTyr: 1.747 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
2.192ProAla: 2.192 ± 0.231
0.308ProCys: 0.308 ± 0.093
2.432ProAsp: 2.432 ± 0.323
3.597ProGlu: 3.597 ± 0.364
1.541ProPhe: 1.541 ± 0.248
1.713ProGly: 1.713 ± 0.223
0.411ProHis: 0.411 ± 0.132
2.329ProIle: 2.329 ± 0.34
1.815ProLys: 1.815 ± 0.26
2.226ProLeu: 2.226 ± 0.277
0.548ProMet: 0.548 ± 0.134
2.329ProAsn: 2.329 ± 0.283
1.37ProPro: 1.37 ± 0.26
0.925ProGln: 0.925 ± 0.17
1.781ProArg: 1.781 ± 0.262
1.644ProSer: 1.644 ± 0.249
1.815ProThr: 1.815 ± 0.245
2.672ProVal: 2.672 ± 0.301
0.445ProTrp: 0.445 ± 0.114
1.713ProTyr: 1.713 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
3.288GlnAla: 3.288 ± 0.495
0.48GlnCys: 0.48 ± 0.118
2.124GlnAsp: 2.124 ± 0.278
2.672GlnGlu: 2.672 ± 0.361
1.781GlnPhe: 1.781 ± 0.218
1.678GlnGly: 1.678 ± 0.203
0.548GlnHis: 0.548 ± 0.157
1.918GlnIle: 1.918 ± 0.287
2.74GlnLys: 2.74 ± 0.326
3.905GlnLeu: 3.905 ± 0.371
0.856GlnMet: 0.856 ± 0.237
1.61GlnAsn: 1.61 ± 0.254
0.445GlnPro: 0.445 ± 0.108
2.055GlnGln: 2.055 ± 0.275
1.678GlnArg: 1.678 ± 0.266
2.226GlnSer: 2.226 ± 0.287
1.85GlnThr: 1.85 ± 0.246
3.083GlnVal: 3.083 ± 0.361
0.548GlnTrp: 0.548 ± 0.15
1.507GlnTyr: 1.507 ± 0.214
0.0GlnXaa: 0.0 ± 0.0
Arg
2.774ArgAla: 2.774 ± 0.33
0.377ArgCys: 0.377 ± 0.125
3.048ArgAsp: 3.048 ± 0.284
2.98ArgGlu: 2.98 ± 0.32
1.85ArgPhe: 1.85 ± 0.238
3.562ArgGly: 3.562 ± 0.333
0.651ArgHis: 0.651 ± 0.163
3.22ArgIle: 3.22 ± 0.325
3.151ArgLys: 3.151 ± 0.408
3.528ArgLeu: 3.528 ± 0.365
1.473ArgMet: 1.473 ± 0.245
2.261ArgAsn: 2.261 ± 0.273
1.302ArgPro: 1.302 ± 0.209
1.678ArgGln: 1.678 ± 0.228
1.987ArgArg: 1.987 ± 0.321
2.466ArgSer: 2.466 ± 0.298
2.774ArgThr: 2.774 ± 0.293
3.185ArgVal: 3.185 ± 0.317
0.445ArgTrp: 0.445 ± 0.119
1.747ArgTyr: 1.747 ± 0.303
0.0ArgXaa: 0.0 ± 0.0
Ser
4.967SerAla: 4.967 ± 1.197
0.651SerCys: 0.651 ± 0.143
3.288SerAsp: 3.288 ± 0.343
4.076SerGlu: 4.076 ± 0.642
2.946SerPhe: 2.946 ± 0.293
4.658SerGly: 4.658 ± 0.497
0.959SerHis: 0.959 ± 0.146
4.556SerIle: 4.556 ± 0.416
4.453SerLys: 4.453 ± 0.373
5.241SerLeu: 5.241 ± 0.379
1.576SerMet: 1.576 ± 0.241
3.083SerAsn: 3.083 ± 0.307
1.815SerPro: 1.815 ± 0.248
1.678SerGln: 1.678 ± 0.229
2.74SerArg: 2.74 ± 0.311
4.179SerSer: 4.179 ± 0.461
3.322SerThr: 3.322 ± 0.333
3.939SerVal: 3.939 ± 0.328
1.028SerTrp: 1.028 ± 0.207
2.398SerTyr: 2.398 ± 0.297
0.0SerXaa: 0.0 ± 0.0
Thr
4.453ThrAla: 4.453 ± 0.469
0.445ThrCys: 0.445 ± 0.122
3.322ThrAsp: 3.322 ± 0.313
3.699ThrGlu: 3.699 ± 0.347
2.261ThrPhe: 2.261 ± 0.279
4.521ThrGly: 4.521 ± 0.502
0.891ThrHis: 0.891 ± 0.185
3.905ThrIle: 3.905 ± 0.377
4.042ThrLys: 4.042 ± 0.372
4.145ThrLeu: 4.145 ± 0.451
1.473ThrMet: 1.473 ± 0.227
3.562ThrAsn: 3.562 ± 0.378
2.158ThrPro: 2.158 ± 0.223
2.055ThrGln: 2.055 ± 0.254
2.672ThrArg: 2.672 ± 0.292
3.768ThrSer: 3.768 ± 0.498
2.911ThrThr: 2.911 ± 0.338
4.042ThrVal: 4.042 ± 0.48
0.685ThrTrp: 0.685 ± 0.149
1.987ThrTyr: 1.987 ± 0.246
0.0ThrXaa: 0.0 ± 0.0
Val
4.727ValAla: 4.727 ± 0.351
0.548ValCys: 0.548 ± 0.135
4.453ValAsp: 4.453 ± 0.389
4.384ValGlu: 4.384 ± 0.378
2.569ValPhe: 2.569 ± 0.313
3.734ValGly: 3.734 ± 0.407
1.267ValHis: 1.267 ± 0.222
4.35ValIle: 4.35 ± 0.372
4.932ValLys: 4.932 ± 0.386
5.104ValLeu: 5.104 ± 0.413
1.781ValMet: 1.781 ± 0.238
3.185ValAsn: 3.185 ± 0.379
2.774ValPro: 2.774 ± 0.266
2.295ValGln: 2.295 ± 0.317
2.877ValArg: 2.877 ± 0.303
4.316ValSer: 4.316 ± 0.458
4.11ValThr: 4.11 ± 0.436
4.556ValVal: 4.556 ± 0.504
0.48ValTrp: 0.48 ± 0.151
2.843ValTyr: 2.843 ± 0.32
0.0ValXaa: 0.0 ± 0.0
Trp
0.377TrpAla: 0.377 ± 0.119
0.274TrpCys: 0.274 ± 0.099
1.165TrpAsp: 1.165 ± 0.193
1.028TrpGlu: 1.028 ± 0.22
0.445TrpPhe: 0.445 ± 0.136
0.856TrpGly: 0.856 ± 0.183
0.171TrpHis: 0.171 ± 0.083
0.719TrpIle: 0.719 ± 0.117
1.062TrpLys: 1.062 ± 0.207
1.439TrpLeu: 1.439 ± 0.252
0.343TrpMet: 0.343 ± 0.108
0.582TrpAsn: 0.582 ± 0.136
0.343TrpPro: 0.343 ± 0.112
0.514TrpGln: 0.514 ± 0.15
0.582TrpArg: 0.582 ± 0.139
0.685TrpSer: 0.685 ± 0.199
0.788TrpThr: 0.788 ± 0.146
0.548TrpVal: 0.548 ± 0.142
0.24TrpTrp: 0.24 ± 0.102
0.308TrpTyr: 0.308 ± 0.098
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.083TyrAla: 3.083 ± 0.31
0.651TyrCys: 0.651 ± 0.139
2.672TyrAsp: 2.672 ± 0.326
2.226TyrGlu: 2.226 ± 0.258
1.987TyrPhe: 1.987 ± 0.272
2.295TyrGly: 2.295 ± 0.326
1.062TyrHis: 1.062 ± 0.158
2.603TyrIle: 2.603 ± 0.322
2.706TyrLys: 2.706 ± 0.329
2.98TyrLeu: 2.98 ± 0.342
0.993TyrMet: 0.993 ± 0.175
2.329TyrAsn: 2.329 ± 0.27
1.199TyrPro: 1.199 ± 0.199
1.473TyrGln: 1.473 ± 0.213
1.918TyrArg: 1.918 ± 0.269
2.603TyrSer: 2.603 ± 0.263
2.226TyrThr: 2.226 ± 0.272
2.261TyrVal: 2.261 ± 0.265
0.514TyrTrp: 0.514 ± 0.134
1.576TyrTyr: 1.576 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 170 proteins (29196 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski