Amino acid dipepetide frequency for Escherichia phage p000v

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.887AlaAla: 4.887 ± 0.353
0.383AlaCys: 0.383 ± 0.081
4.082AlaAsp: 4.082 ± 0.28
5.04AlaGlu: 5.04 ± 0.362
2.415AlaPhe: 2.415 ± 0.206
4.101AlaGly: 4.101 ± 0.318
1.188AlaHis: 1.188 ± 0.181
4.963AlaIle: 4.963 ± 0.275
5.366AlaLys: 5.366 ± 0.331
5.73AlaLeu: 5.73 ± 0.344
1.437AlaMet: 1.437 ± 0.15
3.354AlaAsn: 3.354 ± 0.269
2.491AlaPro: 2.491 ± 0.222
2.357AlaGln: 2.357 ± 0.232
2.894AlaArg: 2.894 ± 0.24
4.772AlaSer: 4.772 ± 0.304
3.775AlaThr: 3.775 ± 0.409
4.331AlaVal: 4.331 ± 0.306
1.035AlaTrp: 1.035 ± 0.142
2.568AlaTyr: 2.568 ± 0.193
0.0AlaXaa: 0.0 ± 0.0
Cys
0.671CysAla: 0.671 ± 0.124
0.192CysCys: 0.192 ± 0.058
0.613CysAsp: 0.613 ± 0.104
0.747CysGlu: 0.747 ± 0.125
0.383CysPhe: 0.383 ± 0.087
0.747CysGly: 0.747 ± 0.13
0.268CysHis: 0.268 ± 0.079
0.632CysIle: 0.632 ± 0.105
0.767CysLys: 0.767 ± 0.132
0.613CysLeu: 0.613 ± 0.114
0.268CysMet: 0.268 ± 0.074
0.594CysAsn: 0.594 ± 0.099
0.594CysPro: 0.594 ± 0.109
0.345CysGln: 0.345 ± 0.081
0.671CysArg: 0.671 ± 0.115
0.747CysSer: 0.747 ± 0.138
0.441CysThr: 0.441 ± 0.082
0.537CysVal: 0.537 ± 0.112
0.172CysTrp: 0.172 ± 0.057
0.441CysTyr: 0.441 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
4.618AspAla: 4.618 ± 0.3
0.575AspCys: 0.575 ± 0.103
4.005AspAsp: 4.005 ± 0.283
4.714AspGlu: 4.714 ± 0.393
3.296AspPhe: 3.296 ± 0.236
5.193AspGly: 5.193 ± 0.321
0.862AspHis: 0.862 ± 0.138
4.887AspIle: 4.887 ± 0.25
4.082AspLys: 4.082 ± 0.307
4.733AspLeu: 4.733 ± 0.297
1.629AspMet: 1.629 ± 0.163
2.683AspAsn: 2.683 ± 0.211
2.146AspPro: 2.146 ± 0.241
1.552AspGln: 1.552 ± 0.18
2.127AspArg: 2.127 ± 0.21
3.871AspSer: 3.871 ± 0.269
3.392AspThr: 3.392 ± 0.295
4.714AspVal: 4.714 ± 0.308
1.246AspTrp: 1.246 ± 0.192
3.2AspTyr: 3.2 ± 0.281
0.0AspXaa: 0.0 ± 0.0
Glu
5.213GluAla: 5.213 ± 0.357
1.016GluCys: 1.016 ± 0.156
4.465GluAsp: 4.465 ± 0.292
5.04GluGlu: 5.04 ± 0.353
3.718GluPhe: 3.718 ± 0.304
3.622GluGly: 3.622 ± 0.214
1.246GluHis: 1.246 ± 0.161
6.267GluIle: 6.267 ± 0.393
5.059GluLys: 5.059 ± 0.369
6.382GluLeu: 6.382 ± 0.45
2.261GluMet: 2.261 ± 0.2
3.718GluAsn: 3.718 ± 0.251
1.859GluPro: 1.859 ± 0.219
2.472GluGln: 2.472 ± 0.224
2.97GluArg: 2.97 ± 0.262
4.044GluSer: 4.044 ± 0.307
4.293GluThr: 4.293 ± 0.301
5.078GluVal: 5.078 ± 0.334
0.862GluTrp: 0.862 ± 0.133
3.564GluTyr: 3.564 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
2.76PheAla: 2.76 ± 0.254
0.46PheCys: 0.46 ± 0.091
3.334PheAsp: 3.334 ± 0.267
3.699PheGlu: 3.699 ± 0.308
1.399PhePhe: 1.399 ± 0.166
2.491PheGly: 2.491 ± 0.217
0.594PheHis: 0.594 ± 0.121
2.932PheIle: 2.932 ± 0.244
4.139PheLys: 4.139 ± 0.271
2.453PheLeu: 2.453 ± 0.182
1.284PheMet: 1.284 ± 0.15
2.932PheAsn: 2.932 ± 0.209
1.131PhePro: 1.131 ± 0.15
1.322PheGln: 1.322 ± 0.175
1.897PheArg: 1.897 ± 0.215
2.951PheSer: 2.951 ± 0.216
2.453PheThr: 2.453 ± 0.236
2.932PheVal: 2.932 ± 0.242
0.575PheTrp: 0.575 ± 0.104
1.916PheTyr: 1.916 ± 0.192
0.0PheXaa: 0.0 ± 0.0
Gly
3.162GlyAla: 3.162 ± 0.309
0.402GlyCys: 0.402 ± 0.093
3.794GlyAsp: 3.794 ± 0.295
3.833GlyGlu: 3.833 ± 0.261
2.683GlyPhe: 2.683 ± 0.224
3.948GlyGly: 3.948 ± 0.366
0.92GlyHis: 0.92 ± 0.151
4.484GlyIle: 4.484 ± 0.295
4.369GlyLys: 4.369 ± 0.28
5.404GlyLeu: 5.404 ± 0.359
1.61GlyMet: 1.61 ± 0.215
3.315GlyAsn: 3.315 ± 0.464
1.916GlyPro: 1.916 ± 0.193
2.319GlyGln: 2.319 ± 0.262
2.855GlyArg: 2.855 ± 0.232
4.254GlySer: 4.254 ± 0.345
4.446GlyThr: 4.446 ± 0.439
3.89GlyVal: 3.89 ± 0.263
0.958GlyTrp: 0.958 ± 0.122
3.105GlyTyr: 3.105 ± 0.213
0.0GlyXaa: 0.0 ± 0.0
His
0.958HisAla: 0.958 ± 0.118
0.326HisCys: 0.326 ± 0.086
1.111HisAsp: 1.111 ± 0.14
1.131HisGlu: 1.131 ± 0.147
0.939HisPhe: 0.939 ± 0.135
1.035HisGly: 1.035 ± 0.183
0.364HisHis: 0.364 ± 0.089
1.341HisIle: 1.341 ± 0.14
1.188HisLys: 1.188 ± 0.15
1.284HisLeu: 1.284 ± 0.16
0.441HisMet: 0.441 ± 0.105
0.709HisAsn: 0.709 ± 0.113
0.901HisPro: 0.901 ± 0.11
0.517HisGln: 0.517 ± 0.081
0.843HisArg: 0.843 ± 0.124
1.15HisSer: 1.15 ± 0.174
0.977HisThr: 0.977 ± 0.199
1.035HisVal: 1.035 ± 0.14
0.192HisTrp: 0.192 ± 0.068
0.652HisTyr: 0.652 ± 0.096
0.0HisXaa: 0.0 ± 0.0
Ile
4.906IleAla: 4.906 ± 0.356
0.709IleCys: 0.709 ± 0.115
5.078IleAsp: 5.078 ± 0.378
5.308IleGlu: 5.308 ± 0.34
2.51IlePhe: 2.51 ± 0.218
4.178IleGly: 4.178 ± 0.306
1.092IleHis: 1.092 ± 0.146
4.963IleIle: 4.963 ± 0.341
6.976IleLys: 6.976 ± 0.362
4.254IleLeu: 4.254 ± 0.3
1.686IleMet: 1.686 ± 0.182
4.983IleAsn: 4.983 ± 0.296
2.779IlePro: 2.779 ± 0.196
2.472IleGln: 2.472 ± 0.232
3.105IleArg: 3.105 ± 0.264
4.58IleSer: 4.58 ± 0.282
4.963IleThr: 4.963 ± 0.32
4.427IleVal: 4.427 ± 0.321
0.556IleTrp: 0.556 ± 0.107
2.664IleTyr: 2.664 ± 0.239
0.0IleXaa: 0.0 ± 0.0
Lys
6.19LysAla: 6.19 ± 0.42
0.747LysCys: 0.747 ± 0.125
4.446LysAsp: 4.446 ± 0.251
6.094LysGlu: 6.094 ± 0.532
3.507LysPhe: 3.507 ± 0.223
4.638LysGly: 4.638 ± 0.276
1.514LysHis: 1.514 ± 0.173
5.289LysIle: 5.289 ± 0.372
4.733LysLys: 4.733 ± 0.41
6.458LysLeu: 6.458 ± 0.37
2.28LysMet: 2.28 ± 0.19
4.159LysAsn: 4.159 ± 0.269
2.3LysPro: 2.3 ± 0.208
2.338LysGln: 2.338 ± 0.235
3.124LysArg: 3.124 ± 0.271
4.408LysSer: 4.408 ± 0.3
4.465LysThr: 4.465 ± 0.284
4.81LysVal: 4.81 ± 0.346
1.265LysTrp: 1.265 ± 0.156
3.449LysTyr: 3.449 ± 0.235
0.0LysXaa: 0.0 ± 0.0
Leu
5.385LeuAla: 5.385 ± 0.322
0.92LeuCys: 0.92 ± 0.142
5.155LeuAsp: 5.155 ± 0.37
5.155LeuGlu: 5.155 ± 0.383
3.296LeuPhe: 3.296 ± 0.289
4.12LeuGly: 4.12 ± 0.274
1.169LeuHis: 1.169 ± 0.153
4.599LeuIle: 4.599 ± 0.325
5.845LeuLys: 5.845 ± 0.315
5.002LeuLeu: 5.002 ± 0.392
2.146LeuMet: 2.146 ± 0.222
4.887LeuAsn: 4.887 ± 0.286
3.009LeuPro: 3.009 ± 0.248
2.721LeuGln: 2.721 ± 0.224
3.641LeuArg: 3.641 ± 0.262
4.254LeuSer: 4.254 ± 0.281
4.542LeuThr: 4.542 ± 0.36
4.484LeuVal: 4.484 ± 0.278
0.652LeuTrp: 0.652 ± 0.11
3.296LeuTyr: 3.296 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
2.28MetAla: 2.28 ± 0.179
0.364MetCys: 0.364 ± 0.079
1.322MetAsp: 1.322 ± 0.134
1.801MetGlu: 1.801 ± 0.197
1.303MetPhe: 1.303 ± 0.159
1.399MetGly: 1.399 ± 0.145
0.287MetHis: 0.287 ± 0.074
1.686MetIle: 1.686 ± 0.182
2.357MetLys: 2.357 ± 0.223
2.146MetLeu: 2.146 ± 0.185
0.882MetMet: 0.882 ± 0.135
1.629MetAsn: 1.629 ± 0.177
0.786MetPro: 0.786 ± 0.128
0.786MetGln: 0.786 ± 0.109
1.131MetArg: 1.131 ± 0.13
1.955MetSer: 1.955 ± 0.208
1.725MetThr: 1.725 ± 0.203
1.437MetVal: 1.437 ± 0.175
0.192MetTrp: 0.192 ± 0.058
0.958MetTyr: 0.958 ± 0.124
0.0MetXaa: 0.0 ± 0.0
Asn
3.603AsnAla: 3.603 ± 0.301
0.441AsnCys: 0.441 ± 0.104
3.066AsnAsp: 3.066 ± 0.258
4.005AsnGlu: 4.005 ± 0.243
2.683AsnPhe: 2.683 ± 0.208
3.89AsnGly: 3.89 ± 0.307
1.054AsnHis: 1.054 ± 0.142
3.871AsnIle: 3.871 ± 0.318
4.139AsnLys: 4.139 ± 0.323
4.024AsnLeu: 4.024 ± 0.266
1.667AsnMet: 1.667 ± 0.184
3.239AsnAsn: 3.239 ± 0.272
2.491AsnPro: 2.491 ± 0.23
1.533AsnGln: 1.533 ± 0.173
2.204AsnArg: 2.204 ± 0.18
3.507AsnSer: 3.507 ± 0.297
3.066AsnThr: 3.066 ± 0.286
3.488AsnVal: 3.488 ± 0.285
0.613AsnTrp: 0.613 ± 0.102
2.089AsnTyr: 2.089 ± 0.199
0.0AsnXaa: 0.0 ± 0.0
Pro
2.127ProAla: 2.127 ± 0.199
0.402ProCys: 0.402 ± 0.082
2.319ProAsp: 2.319 ± 0.177
3.22ProGlu: 3.22 ± 0.284
1.763ProPhe: 1.763 ± 0.198
2.28ProGly: 2.28 ± 0.219
0.537ProHis: 0.537 ± 0.098
2.031ProIle: 2.031 ± 0.205
2.549ProLys: 2.549 ± 0.254
2.357ProLeu: 2.357 ± 0.239
0.69ProMet: 0.69 ± 0.103
1.974ProAsn: 1.974 ± 0.196
1.016ProPro: 1.016 ± 0.138
1.035ProGln: 1.035 ± 0.142
1.38ProArg: 1.38 ± 0.192
2.28ProSer: 2.28 ± 0.217
2.357ProThr: 2.357 ± 0.256
2.779ProVal: 2.779 ± 0.229
0.786ProTrp: 0.786 ± 0.131
1.361ProTyr: 1.361 ± 0.155
0.0ProXaa: 0.0 ± 0.0
Gln
2.242GlnAla: 2.242 ± 0.216
0.307GlnCys: 0.307 ± 0.08
1.725GlnAsp: 1.725 ± 0.205
2.434GlnGlu: 2.434 ± 0.176
1.533GlnPhe: 1.533 ± 0.156
2.127GlnGly: 2.127 ± 0.232
0.652GlnHis: 0.652 ± 0.123
2.798GlnIle: 2.798 ± 0.205
2.223GlnLys: 2.223 ± 0.203
2.645GlnLeu: 2.645 ± 0.222
0.997GlnMet: 0.997 ± 0.151
1.456GlnAsn: 1.456 ± 0.168
1.073GlnPro: 1.073 ± 0.131
1.15GlnGln: 1.15 ± 0.166
1.725GlnArg: 1.725 ± 0.183
1.341GlnSer: 1.341 ± 0.158
1.936GlnThr: 1.936 ± 0.182
2.53GlnVal: 2.53 ± 0.255
0.671GlnTrp: 0.671 ± 0.111
1.591GlnTyr: 1.591 ± 0.17
0.0GlnXaa: 0.0 ± 0.0
Arg
2.702ArgAla: 2.702 ± 0.258
0.498ArgCys: 0.498 ± 0.104
2.702ArgAsp: 2.702 ± 0.288
3.315ArgGlu: 3.315 ± 0.263
2.051ArgPhe: 2.051 ± 0.187
2.645ArgGly: 2.645 ± 0.234
0.671ArgHis: 0.671 ± 0.12
3.411ArgIle: 3.411 ± 0.254
3.488ArgLys: 3.488 ± 0.288
3.334ArgLeu: 3.334 ± 0.262
1.226ArgMet: 1.226 ± 0.166
1.974ArgAsn: 1.974 ± 0.182
1.303ArgPro: 1.303 ± 0.151
1.744ArgGln: 1.744 ± 0.201
2.165ArgArg: 2.165 ± 0.204
2.836ArgSer: 2.836 ± 0.259
2.319ArgThr: 2.319 ± 0.212
2.836ArgVal: 2.836 ± 0.23
0.575ArgTrp: 0.575 ± 0.12
1.725ArgTyr: 1.725 ± 0.18
0.0ArgXaa: 0.0 ± 0.0
Ser
3.411SerAla: 3.411 ± 0.299
0.632SerCys: 0.632 ± 0.135
4.044SerAsp: 4.044 ± 0.288
4.139SerGlu: 4.139 ± 0.294
2.721SerPhe: 2.721 ± 0.19
4.312SerGly: 4.312 ± 0.408
1.246SerHis: 1.246 ± 0.169
4.81SerIle: 4.81 ± 0.267
4.638SerLys: 4.638 ± 0.275
4.676SerLeu: 4.676 ± 0.26
1.418SerMet: 1.418 ± 0.177
2.97SerAsn: 2.97 ± 0.273
2.3SerPro: 2.3 ± 0.205
2.319SerGln: 2.319 ± 0.194
2.894SerArg: 2.894 ± 0.261
4.81SerSer: 4.81 ± 0.346
4.063SerThr: 4.063 ± 0.334
4.274SerVal: 4.274 ± 0.316
0.939SerTrp: 0.939 ± 0.148
2.817SerTyr: 2.817 ± 0.211
0.0SerXaa: 0.0 ± 0.0
Thr
3.871ThrAla: 3.871 ± 0.376
0.537ThrCys: 0.537 ± 0.112
3.833ThrAsp: 3.833 ± 0.309
4.254ThrGlu: 4.254 ± 0.261
2.415ThrPhe: 2.415 ± 0.187
4.235ThrGly: 4.235 ± 0.433
1.265ThrHis: 1.265 ± 0.165
4.35ThrIle: 4.35 ± 0.327
4.274ThrLys: 4.274 ± 0.256
4.312ThrLeu: 4.312 ± 0.323
1.188ThrMet: 1.188 ± 0.138
2.76ThrAsn: 2.76 ± 0.224
3.047ThrPro: 3.047 ± 0.267
1.84ThrGln: 1.84 ± 0.267
2.625ThrArg: 2.625 ± 0.28
3.699ThrSer: 3.699 ± 0.399
3.66ThrThr: 3.66 ± 0.374
4.446ThrVal: 4.446 ± 0.331
0.882ThrTrp: 0.882 ± 0.118
2.721ThrTyr: 2.721 ± 0.227
0.0ThrXaa: 0.0 ± 0.0
Val
4.254ValAla: 4.254 ± 0.301
0.843ValCys: 0.843 ± 0.128
4.733ValAsp: 4.733 ± 0.266
5.653ValGlu: 5.653 ± 0.316
2.51ValPhe: 2.51 ± 0.205
3.526ValGly: 3.526 ± 0.324
1.131ValHis: 1.131 ± 0.142
4.561ValIle: 4.561 ± 0.33
5.596ValLys: 5.596 ± 0.348
4.676ValLeu: 4.676 ± 0.298
1.591ValMet: 1.591 ± 0.207
3.794ValAsn: 3.794 ± 0.266
2.07ValPro: 2.07 ± 0.184
2.338ValGln: 2.338 ± 0.23
3.181ValArg: 3.181 ± 0.211
4.542ValSer: 4.542 ± 0.243
3.852ValThr: 3.852 ± 0.322
4.657ValVal: 4.657 ± 0.343
0.786ValTrp: 0.786 ± 0.103
2.625ValTyr: 2.625 ± 0.213
0.0ValXaa: 0.0 ± 0.0
Trp
0.824TrpAla: 0.824 ± 0.137
0.153TrpCys: 0.153 ± 0.051
0.862TrpAsp: 0.862 ± 0.122
0.747TrpGlu: 0.747 ± 0.126
0.709TrpPhe: 0.709 ± 0.103
0.537TrpGly: 0.537 ± 0.106
0.134TrpHis: 0.134 ± 0.057
1.169TrpIle: 1.169 ± 0.133
1.361TrpLys: 1.361 ± 0.159
0.901TrpLeu: 0.901 ± 0.134
0.498TrpMet: 0.498 ± 0.096
0.997TrpAsn: 0.997 ± 0.126
0.383TrpPro: 0.383 ± 0.088
0.498TrpGln: 0.498 ± 0.095
0.402TrpArg: 0.402 ± 0.08
0.767TrpSer: 0.767 ± 0.104
0.958TrpThr: 0.958 ± 0.14
0.997TrpVal: 0.997 ± 0.123
0.172TrpTrp: 0.172 ± 0.063
0.747TrpTyr: 0.747 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.105TyrAla: 3.105 ± 0.209
0.537TyrCys: 0.537 ± 0.105
2.836TyrAsp: 2.836 ± 0.233
2.817TyrGlu: 2.817 ± 0.229
1.821TyrPhe: 1.821 ± 0.166
2.645TyrGly: 2.645 ± 0.198
0.901TyrHis: 0.901 ± 0.131
3.143TyrIle: 3.143 ± 0.252
3.085TyrLys: 3.085 ± 0.222
2.894TyrLeu: 2.894 ± 0.243
1.207TyrMet: 1.207 ± 0.143
2.549TyrAsn: 2.549 ± 0.237
1.648TyrPro: 1.648 ± 0.159
1.495TyrGln: 1.495 ± 0.181
1.686TyrArg: 1.686 ± 0.189
2.664TyrSer: 2.664 ± 0.213
2.53TyrThr: 2.53 ± 0.219
3.239TyrVal: 3.239 ± 0.197
0.69TyrTrp: 0.69 ± 0.116
1.725TyrTyr: 1.725 ± 0.205
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 263 proteins (52183 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski