Amino acid dipepetide frequency for Escherichia phage ECD7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.46AlaAla: 4.46 ± 0.37
0.757AlaCys: 0.757 ± 0.136
4.201AlaAsp: 4.201 ± 0.291
4.599AlaGlu: 4.599 ± 0.323
2.728AlaPhe: 2.728 ± 0.219
4.718AlaGly: 4.718 ± 0.434
1.433AlaHis: 1.433 ± 0.196
4.778AlaIle: 4.778 ± 0.355
5.276AlaLys: 5.276 ± 0.374
5.276AlaLeu: 5.276 ± 0.325
2.389AlaMet: 2.389 ± 0.194
3.524AlaAsn: 3.524 ± 0.278
1.891AlaPro: 1.891 ± 0.188
2.09AlaGln: 2.09 ± 0.237
3.185AlaArg: 3.185 ± 0.269
3.862AlaSer: 3.862 ± 0.315
4.141AlaThr: 4.141 ± 0.454
4.4AlaVal: 4.4 ± 0.309
0.816AlaTrp: 0.816 ± 0.111
2.668AlaTyr: 2.668 ± 0.226
0.0AlaXaa: 0.0 ± 0.0
Cys
0.757CysAla: 0.757 ± 0.126
0.299CysCys: 0.299 ± 0.079
0.976CysAsp: 0.976 ± 0.13
0.876CysGlu: 0.876 ± 0.158
0.836CysPhe: 0.836 ± 0.125
1.035CysGly: 1.035 ± 0.154
0.338CysHis: 0.338 ± 0.087
0.717CysIle: 0.717 ± 0.125
0.976CysLys: 0.976 ± 0.196
0.976CysLeu: 0.976 ± 0.142
0.358CysMet: 0.358 ± 0.085
0.717CysAsn: 0.717 ± 0.117
0.577CysPro: 0.577 ± 0.118
0.239CysGln: 0.239 ± 0.064
0.438CysArg: 0.438 ± 0.088
0.776CysSer: 0.776 ± 0.124
0.697CysThr: 0.697 ± 0.135
0.916CysVal: 0.916 ± 0.126
0.139CysTrp: 0.139 ± 0.057
0.358CysTyr: 0.358 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
4.46AspAla: 4.46 ± 0.315
0.796AspCys: 0.796 ± 0.125
4.241AspAsp: 4.241 ± 0.28
4.619AspGlu: 4.619 ± 0.303
3.285AspPhe: 3.285 ± 0.237
4.758AspGly: 4.758 ± 0.301
1.374AspHis: 1.374 ± 0.182
4.38AspIle: 4.38 ± 0.289
4.539AspLys: 4.539 ± 0.368
5.455AspLeu: 5.455 ± 0.308
1.871AspMet: 1.871 ± 0.169
3.146AspAsn: 3.146 ± 0.242
2.787AspPro: 2.787 ± 0.213
1.891AspGln: 1.891 ± 0.161
3.723AspArg: 3.723 ± 0.322
3.325AspSer: 3.325 ± 0.256
3.106AspThr: 3.106 ± 0.301
4.36AspVal: 4.36 ± 0.284
1.115AspTrp: 1.115 ± 0.161
3.205AspTyr: 3.205 ± 0.28
0.0AspXaa: 0.0 ± 0.0
Glu
4.599GluAla: 4.599 ± 0.345
0.995GluCys: 0.995 ± 0.154
4.34GluAsp: 4.34 ± 0.281
5.893GluGlu: 5.893 ± 0.353
3.086GluPhe: 3.086 ± 0.238
4.46GluGly: 4.46 ± 0.342
1.513GluHis: 1.513 ± 0.193
5.853GluIle: 5.853 ± 0.377
5.694GluLys: 5.694 ± 0.413
6.65GluLeu: 6.65 ± 0.382
2.09GluMet: 2.09 ± 0.21
4.181GluAsn: 4.181 ± 0.284
1.433GluPro: 1.433 ± 0.17
2.329GluGln: 2.329 ± 0.251
3.544GluArg: 3.544 ± 0.295
4.44GluSer: 4.44 ± 0.305
4.201GluThr: 4.201 ± 0.301
4.758GluVal: 4.758 ± 0.31
1.254GluTrp: 1.254 ± 0.207
3.643GluTyr: 3.643 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
2.747PheAla: 2.747 ± 0.291
0.637PheCys: 0.637 ± 0.127
3.703PheAsp: 3.703 ± 0.265
3.544PheGlu: 3.544 ± 0.282
1.473PhePhe: 1.473 ± 0.185
2.867PheGly: 2.867 ± 0.241
0.697PheHis: 0.697 ± 0.111
3.166PheIle: 3.166 ± 0.256
3.544PheLys: 3.544 ± 0.28
2.628PheLeu: 2.628 ± 0.26
1.453PheMet: 1.453 ± 0.17
2.608PheAsn: 2.608 ± 0.204
1.095PhePro: 1.095 ± 0.154
1.175PheGln: 1.175 ± 0.139
2.09PheArg: 2.09 ± 0.199
3.006PheSer: 3.006 ± 0.258
2.847PheThr: 2.847 ± 0.239
2.747PheVal: 2.747 ± 0.267
0.398PheTrp: 0.398 ± 0.085
1.712PheTyr: 1.712 ± 0.163
0.0PheXaa: 0.0 ± 0.0
Gly
3.862GlyAla: 3.862 ± 0.36
0.976GlyCys: 0.976 ± 0.153
4.539GlyAsp: 4.539 ± 0.334
4.241GlyGlu: 4.241 ± 0.313
2.847GlyPhe: 2.847 ± 0.248
3.803GlyGly: 3.803 ± 0.59
1.155GlyHis: 1.155 ± 0.157
4.201GlyIle: 4.201 ± 0.371
4.539GlyLys: 4.539 ± 0.32
4.798GlyLeu: 4.798 ± 0.328
1.752GlyMet: 1.752 ± 0.204
3.444GlyAsn: 3.444 ± 0.358
0.717GlyPro: 0.717 ± 0.12
1.931GlyGln: 1.931 ± 0.186
3.185GlyArg: 3.185 ± 0.31
3.763GlySer: 3.763 ± 0.337
3.424GlyThr: 3.424 ± 0.354
5.017GlyVal: 5.017 ± 0.397
0.956GlyTrp: 0.956 ± 0.137
2.986GlyTyr: 2.986 ± 0.268
0.0GlyXaa: 0.0 ± 0.0
His
1.254HisAla: 1.254 ± 0.177
0.279HisCys: 0.279 ± 0.089
1.175HisAsp: 1.175 ± 0.15
1.334HisGlu: 1.334 ± 0.166
0.936HisPhe: 0.936 ± 0.125
1.613HisGly: 1.613 ± 0.17
0.577HisHis: 0.577 ± 0.11
1.175HisIle: 1.175 ± 0.166
1.195HisLys: 1.195 ± 0.17
1.394HisLeu: 1.394 ± 0.148
0.378HisMet: 0.378 ± 0.083
1.175HisAsn: 1.175 ± 0.143
0.916HisPro: 0.916 ± 0.172
0.577HisGln: 0.577 ± 0.107
0.896HisArg: 0.896 ± 0.133
1.234HisSer: 1.234 ± 0.155
1.155HisThr: 1.155 ± 0.148
1.394HisVal: 1.394 ± 0.162
0.338HisTrp: 0.338 ± 0.095
0.995HisTyr: 0.995 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
5.355IleAla: 5.355 ± 0.407
0.776IleCys: 0.776 ± 0.131
5.336IleAsp: 5.336 ± 0.344
5.296IleGlu: 5.296 ± 0.315
2.11IlePhe: 2.11 ± 0.2
3.763IleGly: 3.763 ± 0.264
1.334IleHis: 1.334 ± 0.182
4.38IleIle: 4.38 ± 0.31
5.415IleLys: 5.415 ± 0.412
4.34IleLeu: 4.34 ± 0.316
2.329IleMet: 2.329 ± 0.236
4.022IleAsn: 4.022 ± 0.292
2.588IlePro: 2.588 ± 0.219
2.349IleGln: 2.349 ± 0.212
3.504IleArg: 3.504 ± 0.269
3.643IleSer: 3.643 ± 0.255
4.957IleThr: 4.957 ± 0.313
4.858IleVal: 4.858 ± 0.281
0.677IleTrp: 0.677 ± 0.128
2.389IleTyr: 2.389 ± 0.203
0.0IleXaa: 0.0 ± 0.0
Lys
5.515LysAla: 5.515 ± 0.411
0.956LysCys: 0.956 ± 0.184
4.698LysAsp: 4.698 ± 0.412
6.311LysGlu: 6.311 ± 0.427
3.325LysPhe: 3.325 ± 0.313
4.141LysGly: 4.141 ± 0.271
1.593LysHis: 1.593 ± 0.185
5.296LysIle: 5.296 ± 0.311
5.097LysLys: 5.097 ± 0.331
5.793LysLeu: 5.793 ± 0.312
2.807LysMet: 2.807 ± 0.196
3.942LysAsn: 3.942 ± 0.292
2.847LysPro: 2.847 ± 0.237
3.126LysGln: 3.126 ± 0.236
3.902LysArg: 3.902 ± 0.344
3.683LysSer: 3.683 ± 0.242
4.619LysThr: 4.619 ± 0.29
5.057LysVal: 5.057 ± 0.341
0.995LysTrp: 0.995 ± 0.124
3.066LysTyr: 3.066 ± 0.296
0.0LysXaa: 0.0 ± 0.0
Leu
5.634LeuAla: 5.634 ± 0.335
0.976LeuCys: 0.976 ± 0.155
4.917LeuAsp: 4.917 ± 0.299
5.734LeuGlu: 5.734 ± 0.327
2.907LeuPhe: 2.907 ± 0.231
3.902LeuGly: 3.902 ± 0.267
1.095LeuHis: 1.095 ± 0.144
4.539LeuIle: 4.539 ± 0.288
6.132LeuLys: 6.132 ± 0.41
4.34LeuLeu: 4.34 ± 0.326
2.648LeuMet: 2.648 ± 0.269
3.743LeuAsn: 3.743 ± 0.247
2.947LeuPro: 2.947 ± 0.224
2.23LeuGln: 2.23 ± 0.19
3.703LeuArg: 3.703 ± 0.267
4.499LeuSer: 4.499 ± 0.265
4.4LeuThr: 4.4 ± 0.279
4.161LeuVal: 4.161 ± 0.334
0.757LeuTrp: 0.757 ± 0.14
3.225LeuTyr: 3.225 ± 0.273
0.0LeuXaa: 0.0 ± 0.0
Met
2.011MetAla: 2.011 ± 0.225
0.438MetCys: 0.438 ± 0.097
1.473MetAsp: 1.473 ± 0.179
1.852MetGlu: 1.852 ± 0.165
1.573MetPhe: 1.573 ± 0.181
1.473MetGly: 1.473 ± 0.203
0.538MetHis: 0.538 ± 0.116
2.489MetIle: 2.489 ± 0.213
2.986MetLys: 2.986 ± 0.266
2.509MetLeu: 2.509 ± 0.229
1.055MetMet: 1.055 ± 0.147
1.473MetAsn: 1.473 ± 0.178
0.757MetPro: 0.757 ± 0.111
1.294MetGln: 1.294 ± 0.153
1.354MetArg: 1.354 ± 0.158
1.991MetSer: 1.991 ± 0.251
1.752MetThr: 1.752 ± 0.185
2.031MetVal: 2.031 ± 0.202
0.498MetTrp: 0.498 ± 0.108
0.757MetTyr: 0.757 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
3.922AsnAla: 3.922 ± 0.288
0.597AsnCys: 0.597 ± 0.123
3.245AsnAsp: 3.245 ± 0.287
3.365AsnGlu: 3.365 ± 0.247
2.309AsnPhe: 2.309 ± 0.219
4.002AsnGly: 4.002 ± 0.292
1.195AsnHis: 1.195 ± 0.18
4.061AsnIle: 4.061 ± 0.279
3.822AsnLys: 3.822 ± 0.245
3.603AsnLeu: 3.603 ± 0.287
1.394AsnMet: 1.394 ± 0.176
2.568AsnAsn: 2.568 ± 0.223
2.389AsnPro: 2.389 ± 0.21
1.433AsnGln: 1.433 ± 0.174
2.469AsnArg: 2.469 ± 0.252
2.648AsnSer: 2.648 ± 0.245
2.787AsnThr: 2.787 ± 0.219
4.26AsnVal: 4.26 ± 0.318
0.478AsnTrp: 0.478 ± 0.111
2.13AsnTyr: 2.13 ± 0.196
0.0AsnXaa: 0.0 ± 0.0
Pro
2.21ProAla: 2.21 ± 0.197
0.418ProCys: 0.418 ± 0.096
2.608ProAsp: 2.608 ± 0.244
3.126ProGlu: 3.126 ± 0.238
1.672ProPhe: 1.672 ± 0.204
0.956ProGly: 0.956 ± 0.151
0.796ProHis: 0.796 ± 0.126
2.17ProIle: 2.17 ± 0.23
2.548ProLys: 2.548 ± 0.256
1.911ProLeu: 1.911 ± 0.197
0.737ProMet: 0.737 ± 0.129
1.374ProAsn: 1.374 ± 0.138
0.796ProPro: 0.796 ± 0.15
1.015ProGln: 1.015 ± 0.119
1.453ProArg: 1.453 ± 0.171
1.991ProSer: 1.991 ± 0.155
2.15ProThr: 2.15 ± 0.219
2.767ProVal: 2.767 ± 0.259
0.458ProTrp: 0.458 ± 0.099
1.374ProTyr: 1.374 ± 0.161
0.0ProXaa: 0.0 ± 0.0
Gln
1.732GlnAla: 1.732 ± 0.189
0.398GlnCys: 0.398 ± 0.087
1.453GlnAsp: 1.453 ± 0.176
2.449GlnGlu: 2.449 ± 0.219
1.573GlnPhe: 1.573 ± 0.178
1.593GlnGly: 1.593 ± 0.197
0.657GlnHis: 0.657 ± 0.107
2.548GlnIle: 2.548 ± 0.229
2.349GlnLys: 2.349 ± 0.243
2.548GlnLeu: 2.548 ± 0.226
0.876GlnMet: 0.876 ± 0.13
1.354GlnAsn: 1.354 ± 0.17
1.135GlnPro: 1.135 ± 0.147
0.757GlnGln: 0.757 ± 0.134
1.712GlnArg: 1.712 ± 0.184
1.672GlnSer: 1.672 ± 0.164
1.772GlnThr: 1.772 ± 0.183
2.011GlnVal: 2.011 ± 0.164
0.597GlnTrp: 0.597 ± 0.101
1.433GlnTyr: 1.433 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
2.548ArgAla: 2.548 ± 0.198
0.657ArgCys: 0.657 ± 0.129
3.424ArgAsp: 3.424 ± 0.252
3.882ArgGlu: 3.882 ± 0.304
2.588ArgPhe: 2.588 ± 0.251
3.185ArgGly: 3.185 ± 0.263
0.936ArgHis: 0.936 ± 0.142
3.285ArgIle: 3.285 ± 0.229
4.161ArgLys: 4.161 ± 0.315
3.464ArgLeu: 3.464 ± 0.258
1.175ArgMet: 1.175 ± 0.171
2.469ArgAsn: 2.469 ± 0.185
1.712ArgPro: 1.712 ± 0.199
1.433ArgGln: 1.433 ± 0.167
2.668ArgArg: 2.668 ± 0.218
2.489ArgSer: 2.489 ± 0.205
2.15ArgThr: 2.15 ± 0.213
3.265ArgVal: 3.265 ± 0.267
0.796ArgTrp: 0.796 ± 0.15
2.15ArgTyr: 2.15 ± 0.198
0.0ArgXaa: 0.0 ± 0.0
Ser
3.663SerAla: 3.663 ± 0.288
0.776SerCys: 0.776 ± 0.129
3.584SerAsp: 3.584 ± 0.238
4.141SerGlu: 4.141 ± 0.282
2.887SerPhe: 2.887 ± 0.199
4.101SerGly: 4.101 ± 0.335
0.896SerHis: 0.896 ± 0.128
3.803SerIle: 3.803 ± 0.324
4.101SerLys: 4.101 ± 0.296
4.081SerLeu: 4.081 ± 0.328
1.891SerMet: 1.891 ± 0.18
3.185SerAsn: 3.185 ± 0.237
1.633SerPro: 1.633 ± 0.197
1.374SerGln: 1.374 ± 0.189
2.947SerArg: 2.947 ± 0.215
2.966SerSer: 2.966 ± 0.264
2.927SerThr: 2.927 ± 0.251
4.061SerVal: 4.061 ± 0.283
0.816SerTrp: 0.816 ± 0.121
1.752SerTyr: 1.752 ± 0.179
0.0SerXaa: 0.0 ± 0.0
Thr
4.479ThrAla: 4.479 ± 0.346
0.757ThrCys: 0.757 ± 0.132
3.424ThrAsp: 3.424 ± 0.275
3.942ThrGlu: 3.942 ± 0.309
2.449ThrPhe: 2.449 ± 0.233
3.922ThrGly: 3.922 ± 0.359
1.513ThrHis: 1.513 ± 0.182
3.783ThrIle: 3.783 ± 0.262
4.698ThrLys: 4.698 ± 0.273
4.499ThrLeu: 4.499 ± 0.34
1.473ThrMet: 1.473 ± 0.187
2.827ThrAsn: 2.827 ± 0.217
2.927ThrPro: 2.927 ± 0.252
1.911ThrGln: 1.911 ± 0.264
2.429ThrArg: 2.429 ± 0.22
2.509ThrSer: 2.509 ± 0.233
3.185ThrThr: 3.185 ± 0.327
4.3ThrVal: 4.3 ± 0.324
0.737ThrTrp: 0.737 ± 0.121
2.389ThrTyr: 2.389 ± 0.213
0.0ThrXaa: 0.0 ± 0.0
Val
4.121ValAla: 4.121 ± 0.315
0.577ValCys: 0.577 ± 0.123
5.316ValAsp: 5.316 ± 0.319
5.893ValGlu: 5.893 ± 0.39
3.026ValPhe: 3.026 ± 0.29
4.061ValGly: 4.061 ± 0.313
1.135ValHis: 1.135 ± 0.158
4.659ValIle: 4.659 ± 0.309
5.774ValLys: 5.774 ± 0.305
4.36ValLeu: 4.36 ± 0.294
1.991ValMet: 1.991 ± 0.181
3.783ValAsn: 3.783 ± 0.294
1.852ValPro: 1.852 ± 0.196
1.891ValGln: 1.891 ± 0.187
2.747ValArg: 2.747 ± 0.243
4.121ValSer: 4.121 ± 0.296
4.26ValThr: 4.26 ± 0.331
4.917ValVal: 4.917 ± 0.336
1.015ValTrp: 1.015 ± 0.128
3.564ValTyr: 3.564 ± 0.308
0.0ValXaa: 0.0 ± 0.0
Trp
0.717TrpAla: 0.717 ± 0.112
0.279TrpCys: 0.279 ± 0.066
0.856TrpAsp: 0.856 ± 0.122
1.175TrpGlu: 1.175 ± 0.127
0.597TrpPhe: 0.597 ± 0.11
0.836TrpGly: 0.836 ± 0.144
0.319TrpHis: 0.319 ± 0.082
0.956TrpIle: 0.956 ± 0.15
1.015TrpLys: 1.015 ± 0.146
0.956TrpLeu: 0.956 ± 0.15
0.538TrpMet: 0.538 ± 0.101
0.617TrpAsn: 0.617 ± 0.112
0.179TrpPro: 0.179 ± 0.052
0.319TrpGln: 0.319 ± 0.08
0.657TrpArg: 0.657 ± 0.121
0.737TrpSer: 0.737 ± 0.114
0.916TrpThr: 0.916 ± 0.116
1.135TrpVal: 1.135 ± 0.18
0.219TrpTrp: 0.219 ± 0.063
0.637TrpTyr: 0.637 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.146TyrAla: 3.146 ± 0.24
0.657TyrCys: 0.657 ± 0.105
2.927TyrAsp: 2.927 ± 0.246
2.688TyrGlu: 2.688 ± 0.254
1.931TyrPhe: 1.931 ± 0.181
2.947TyrGly: 2.947 ± 0.232
0.896TyrHis: 0.896 ± 0.133
3.205TyrIle: 3.205 ± 0.264
2.907TyrLys: 2.907 ± 0.207
2.907TyrLeu: 2.907 ± 0.272
1.075TyrMet: 1.075 ± 0.157
2.528TyrAsn: 2.528 ± 0.22
1.354TyrPro: 1.354 ± 0.178
1.254TyrGln: 1.254 ± 0.146
1.832TyrArg: 1.832 ± 0.256
2.23TyrSer: 2.23 ± 0.219
2.708TyrThr: 2.708 ± 0.202
2.648TyrVal: 2.648 ± 0.226
0.597TyrTrp: 0.597 ± 0.118
2.011TyrTyr: 2.011 ± 0.238
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 262 proteins (50230 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski