Amino acid dipepetide frequency for Bacillus phage vB_BveM-Goe7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.167AlaAla: 5.167 ± 0.494
0.349AlaCys: 0.349 ± 0.089
4.033AlaAsp: 4.033 ± 0.275
4.797AlaGlu: 4.797 ± 0.328
2.071AlaPhe: 2.071 ± 0.235
3.924AlaGly: 3.924 ± 0.46
0.981AlaHis: 0.981 ± 0.174
3.772AlaIle: 3.772 ± 0.28
4.404AlaLys: 4.404 ± 0.379
5.56AlaLeu: 5.56 ± 0.379
1.526AlaMet: 1.526 ± 0.178
2.943AlaAsn: 2.943 ± 0.268
2.071AlaPro: 2.071 ± 0.187
2.66AlaGln: 2.66 ± 0.241
2.878AlaArg: 2.878 ± 0.24
3.815AlaSer: 3.815 ± 0.321
3.685AlaThr: 3.685 ± 0.285
4.055AlaVal: 4.055 ± 0.307
0.458AlaTrp: 0.458 ± 0.097
2.769AlaTyr: 2.769 ± 0.244
0.0AlaXaa: 0.0 ± 0.0
Cys
0.283CysAla: 0.283 ± 0.074
0.109CysCys: 0.109 ± 0.051
0.523CysAsp: 0.523 ± 0.117
0.567CysGlu: 0.567 ± 0.101
0.414CysPhe: 0.414 ± 0.094
0.741CysGly: 0.741 ± 0.196
0.196CysHis: 0.196 ± 0.068
0.48CysIle: 0.48 ± 0.124
0.698CysLys: 0.698 ± 0.134
0.48CysLeu: 0.48 ± 0.104
0.262CysMet: 0.262 ± 0.071
0.392CysAsn: 0.392 ± 0.098
0.349CysPro: 0.349 ± 0.088
0.196CysGln: 0.196 ± 0.056
0.305CysArg: 0.305 ± 0.088
0.414CysSer: 0.414 ± 0.089
0.676CysThr: 0.676 ± 0.126
0.501CysVal: 0.501 ± 0.103
0.087CysTrp: 0.087 ± 0.047
0.305CysTyr: 0.305 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
3.772AspAla: 3.772 ± 0.325
0.61AspCys: 0.61 ± 0.154
3.445AspAsp: 3.445 ± 0.309
4.818AspGlu: 4.818 ± 0.356
3.336AspPhe: 3.336 ± 0.26
4.121AspGly: 4.121 ± 0.428
1.112AspHis: 1.112 ± 0.165
4.731AspIle: 4.731 ± 0.33
4.84AspLys: 4.84 ± 0.322
6.105AspLeu: 6.105 ± 0.351
2.093AspMet: 2.093 ± 0.235
3.488AspAsn: 3.488 ± 0.263
2.115AspPro: 2.115 ± 0.241
2.028AspGln: 2.028 ± 0.195
3.379AspArg: 3.379 ± 0.263
3.728AspSer: 3.728 ± 0.345
4.317AspThr: 4.317 ± 0.33
4.644AspVal: 4.644 ± 0.384
0.763AspTrp: 0.763 ± 0.12
2.943AspTyr: 2.943 ± 0.256
0.0AspXaa: 0.0 ± 0.0
Glu
4.927GluAla: 4.927 ± 0.302
0.501GluCys: 0.501 ± 0.117
5.909GluAsp: 5.909 ± 0.365
9.593GluGlu: 9.593 ± 0.622
3.031GluPhe: 3.031 ± 0.316
5.909GluGly: 5.909 ± 0.346
1.417GluHis: 1.417 ± 0.202
4.993GluIle: 4.993 ± 0.372
5.647GluLys: 5.647 ± 0.444
7.544GluLeu: 7.544 ± 0.449
2.464GluMet: 2.464 ± 0.24
3.837GluAsn: 3.837 ± 0.324
1.788GluPro: 1.788 ± 0.258
3.096GluGln: 3.096 ± 0.251
3.161GluArg: 3.161 ± 0.297
4.099GluSer: 4.099 ± 0.331
3.379GluThr: 3.379 ± 0.261
6.061GluVal: 6.061 ± 0.39
1.047GluTrp: 1.047 ± 0.18
3.576GluTyr: 3.576 ± 0.303
0.0GluXaa: 0.0 ± 0.0
Phe
2.093PheAla: 2.093 ± 0.256
0.414PheCys: 0.414 ± 0.105
2.028PheAsp: 2.028 ± 0.207
2.573PheGlu: 2.573 ± 0.259
1.352PhePhe: 1.352 ± 0.207
2.18PheGly: 2.18 ± 0.217
0.719PheHis: 0.719 ± 0.127
2.943PheIle: 2.943 ± 0.304
2.856PheLys: 2.856 ± 0.226
2.9PheLeu: 2.9 ± 0.258
0.894PheMet: 0.894 ± 0.12
2.289PheAsn: 2.289 ± 0.207
1.243PhePro: 1.243 ± 0.168
1.308PheGln: 1.308 ± 0.196
1.722PheArg: 1.722 ± 0.213
2.943PheSer: 2.943 ± 0.255
2.9PheThr: 2.9 ± 0.225
2.595PheVal: 2.595 ± 0.239
0.392PheTrp: 0.392 ± 0.095
1.701PheTyr: 1.701 ± 0.227
0.0PheXaa: 0.0 ± 0.0
Gly
3.815GlyAla: 3.815 ± 0.467
0.785GlyCys: 0.785 ± 0.14
4.143GlyAsp: 4.143 ± 0.402
4.949GlyGlu: 4.949 ± 0.357
2.311GlyPhe: 2.311 ± 0.204
5.342GlyGly: 5.342 ± 0.623
1.025GlyHis: 1.025 ± 0.159
4.033GlyIle: 4.033 ± 0.322
5.276GlyLys: 5.276 ± 0.387
4.709GlyLeu: 4.709 ± 0.365
1.657GlyMet: 1.657 ± 0.198
3.227GlyAsn: 3.227 ± 0.304
0.807GlyPro: 0.807 ± 0.12
1.962GlyGln: 1.962 ± 0.259
3.009GlyArg: 3.009 ± 0.222
4.23GlySer: 4.23 ± 0.383
3.924GlyThr: 3.924 ± 0.344
6.018GlyVal: 6.018 ± 0.328
0.785GlyTrp: 0.785 ± 0.138
3.423GlyTyr: 3.423 ± 0.378
0.0GlyXaa: 0.0 ± 0.0
His
0.894HisAla: 0.894 ± 0.134
0.131HisCys: 0.131 ± 0.051
0.916HisAsp: 0.916 ± 0.144
1.047HisGlu: 1.047 ± 0.179
0.567HisPhe: 0.567 ± 0.119
1.156HisGly: 1.156 ± 0.138
0.414HisHis: 0.414 ± 0.108
1.199HisIle: 1.199 ± 0.18
1.417HisLys: 1.417 ± 0.164
1.722HisLeu: 1.722 ± 0.22
0.589HisMet: 0.589 ± 0.118
0.763HisAsn: 0.763 ± 0.135
0.61HisPro: 0.61 ± 0.136
0.567HisGln: 0.567 ± 0.102
0.938HisArg: 0.938 ± 0.133
1.33HisSer: 1.33 ± 0.154
1.134HisThr: 1.134 ± 0.145
1.374HisVal: 1.374 ± 0.156
0.218HisTrp: 0.218 ± 0.084
1.112HisTyr: 1.112 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
4.317IleAla: 4.317 ± 0.313
0.414IleCys: 0.414 ± 0.09
4.797IleAsp: 4.797 ± 0.33
4.993IleGlu: 4.993 ± 0.357
1.766IlePhe: 1.766 ± 0.181
3.641IleGly: 3.641 ± 0.321
1.156IleHis: 1.156 ± 0.151
3.532IleIle: 3.532 ± 0.327
5.015IleLys: 5.015 ± 0.316
4.339IleLeu: 4.339 ± 0.307
1.526IleMet: 1.526 ± 0.172
3.183IleAsn: 3.183 ± 0.272
2.638IlePro: 2.638 ± 0.271
2.464IleGln: 2.464 ± 0.236
3.074IleArg: 3.074 ± 0.269
4.186IleSer: 4.186 ± 0.284
4.295IleThr: 4.295 ± 0.402
3.881IleVal: 3.881 ± 0.297
0.349IleTrp: 0.349 ± 0.095
1.853IleTyr: 1.853 ± 0.182
0.0IleXaa: 0.0 ± 0.0
Lys
4.797LysAla: 4.797 ± 0.346
0.392LysCys: 0.392 ± 0.116
4.818LysAsp: 4.818 ± 0.399
7.064LysGlu: 7.064 ± 0.501
2.638LysPhe: 2.638 ± 0.264
4.993LysGly: 4.993 ± 0.349
1.81LysHis: 1.81 ± 0.204
3.619LysIle: 3.619 ± 0.278
5.974LysLys: 5.974 ± 0.512
6.41LysLeu: 6.41 ± 0.414
2.18LysMet: 2.18 ± 0.216
3.815LysAsn: 3.815 ± 0.315
2.267LysPro: 2.267 ± 0.236
2.42LysGln: 2.42 ± 0.219
3.379LysArg: 3.379 ± 0.29
5.124LysSer: 5.124 ± 0.449
3.946LysThr: 3.946 ± 0.288
5.342LysVal: 5.342 ± 0.407
0.829LysTrp: 0.829 ± 0.137
2.856LysTyr: 2.856 ± 0.255
0.0LysXaa: 0.0 ± 0.0
Leu
4.971LeuAla: 4.971 ± 0.353
1.025LeuCys: 1.025 ± 0.165
6.105LeuAsp: 6.105 ± 0.397
7.042LeuGlu: 7.042 ± 0.435
3.27LeuPhe: 3.27 ± 0.282
5.08LeuGly: 5.08 ± 0.312
1.766LeuHis: 1.766 ± 0.22
4.404LeuIle: 4.404 ± 0.347
6.672LeuLys: 6.672 ± 0.32
6.497LeuLeu: 6.497 ± 0.433
1.919LeuMet: 1.919 ± 0.22
4.688LeuAsn: 4.688 ± 0.31
2.769LeuPro: 2.769 ± 0.295
3.27LeuGln: 3.27 ± 0.283
4.295LeuArg: 4.295 ± 0.36
5.756LeuSer: 5.756 ± 0.389
5.429LeuThr: 5.429 ± 0.335
5.036LeuVal: 5.036 ± 0.423
0.763LeuTrp: 0.763 ± 0.159
3.51LeuTyr: 3.51 ± 0.233
0.0LeuXaa: 0.0 ± 0.0
Met
1.679MetAla: 1.679 ± 0.197
0.174MetCys: 0.174 ± 0.062
2.115MetAsp: 2.115 ± 0.197
2.071MetGlu: 2.071 ± 0.215
1.003MetPhe: 1.003 ± 0.142
1.352MetGly: 1.352 ± 0.156
0.305MetHis: 0.305 ± 0.077
1.221MetIle: 1.221 ± 0.165
2.529MetLys: 2.529 ± 0.261
2.071MetLeu: 2.071 ± 0.21
0.458MetMet: 0.458 ± 0.085
1.417MetAsn: 1.417 ± 0.164
0.676MetPro: 0.676 ± 0.138
0.719MetGln: 0.719 ± 0.146
1.156MetArg: 1.156 ± 0.157
2.289MetSer: 2.289 ± 0.225
1.875MetThr: 1.875 ± 0.156
1.199MetVal: 1.199 ± 0.162
0.196MetTrp: 0.196 ± 0.062
0.981MetTyr: 0.981 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
2.769AsnAla: 2.769 ± 0.246
0.349AsnCys: 0.349 ± 0.092
2.9AsnAsp: 2.9 ± 0.243
3.096AsnGlu: 3.096 ± 0.263
1.701AsnPhe: 1.701 ± 0.189
3.706AsnGly: 3.706 ± 0.414
0.981AsnHis: 0.981 ± 0.191
3.663AsnIle: 3.663 ± 0.248
3.772AsnLys: 3.772 ± 0.274
3.99AsnLeu: 3.99 ± 0.239
1.265AsnMet: 1.265 ± 0.169
2.965AsnAsn: 2.965 ± 0.254
2.551AsnPro: 2.551 ± 0.248
1.504AsnGln: 1.504 ± 0.184
2.289AsnArg: 2.289 ± 0.205
3.401AsnSer: 3.401 ± 0.316
3.227AsnThr: 3.227 ± 0.271
3.445AsnVal: 3.445 ± 0.276
0.523AsnTrp: 0.523 ± 0.096
1.984AsnTyr: 1.984 ± 0.207
0.0AsnXaa: 0.0 ± 0.0
Pro
2.224ProAla: 2.224 ± 0.265
0.218ProCys: 0.218 ± 0.068
2.682ProAsp: 2.682 ± 0.239
3.597ProGlu: 3.597 ± 0.433
1.243ProPhe: 1.243 ± 0.156
0.85ProGly: 0.85 ± 0.132
0.632ProHis: 0.632 ± 0.144
1.984ProIle: 1.984 ± 0.238
2.049ProLys: 2.049 ± 0.244
2.9ProLeu: 2.9 ± 0.267
0.654ProMet: 0.654 ± 0.131
1.722ProAsn: 1.722 ± 0.231
0.807ProPro: 0.807 ± 0.147
1.134ProGln: 1.134 ± 0.253
1.221ProArg: 1.221 ± 0.137
2.028ProSer: 2.028 ± 0.216
2.224ProThr: 2.224 ± 0.246
1.875ProVal: 1.875 ± 0.199
0.24ProTrp: 0.24 ± 0.071
1.417ProTyr: 1.417 ± 0.163
0.0ProXaa: 0.0 ± 0.0
Gln
2.376GlnAla: 2.376 ± 0.186
0.196GlnCys: 0.196 ± 0.062
2.049GlnAsp: 2.049 ± 0.216
3.074GlnGlu: 3.074 ± 0.267
1.395GlnPhe: 1.395 ± 0.163
2.071GlnGly: 2.071 ± 0.227
0.589GlnHis: 0.589 ± 0.114
1.94GlnIle: 1.94 ± 0.21
2.638GlnLys: 2.638 ± 0.22
3.663GlnLeu: 3.663 ± 0.292
1.112GlnMet: 1.112 ± 0.19
1.548GlnAsn: 1.548 ± 0.19
1.177GlnPro: 1.177 ± 0.225
2.049GlnGln: 2.049 ± 0.467
1.831GlnArg: 1.831 ± 0.191
2.333GlnSer: 2.333 ± 0.291
1.483GlnThr: 1.483 ± 0.199
2.638GlnVal: 2.638 ± 0.235
0.196GlnTrp: 0.196 ± 0.065
1.548GlnTyr: 1.548 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
2.965ArgAla: 2.965 ± 0.281
0.392ArgCys: 0.392 ± 0.097
2.943ArgAsp: 2.943 ± 0.252
4.252ArgGlu: 4.252 ± 0.292
1.853ArgPhe: 1.853 ± 0.217
3.314ArgGly: 3.314 ± 0.277
0.676ArgHis: 0.676 ± 0.11
3.161ArgIle: 3.161 ± 0.249
3.249ArgLys: 3.249 ± 0.282
4.077ArgLeu: 4.077 ± 0.31
1.635ArgMet: 1.635 ± 0.186
1.897ArgAsn: 1.897 ± 0.226
1.09ArgPro: 1.09 ± 0.166
1.744ArgGln: 1.744 ± 0.186
2.028ArgArg: 2.028 ± 0.189
2.66ArgSer: 2.66 ± 0.307
2.355ArgThr: 2.355 ± 0.279
3.815ArgVal: 3.815 ± 0.312
0.436ArgTrp: 0.436 ± 0.091
1.919ArgTyr: 1.919 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
3.597SerAla: 3.597 ± 0.266
0.48SerCys: 0.48 ± 0.106
3.881SerAsp: 3.881 ± 0.329
4.535SerGlu: 4.535 ± 0.398
3.249SerPhe: 3.249 ± 0.254
4.775SerGly: 4.775 ± 0.431
0.938SerHis: 0.938 ± 0.158
4.273SerIle: 4.273 ± 0.319
4.731SerLys: 4.731 ± 0.384
5.778SerLeu: 5.778 ± 0.339
1.57SerMet: 1.57 ± 0.19
2.813SerAsn: 2.813 ± 0.308
2.137SerPro: 2.137 ± 0.249
2.006SerGln: 2.006 ± 0.196
2.813SerArg: 2.813 ± 0.217
5.254SerSer: 5.254 ± 0.329
4.033SerThr: 4.033 ± 0.384
4.775SerVal: 4.775 ± 0.315
0.719SerTrp: 0.719 ± 0.127
2.791SerTyr: 2.791 ± 0.236
0.0SerXaa: 0.0 ± 0.0
Thr
3.75ThrAla: 3.75 ± 0.341
0.327ThrCys: 0.327 ± 0.114
3.837ThrAsp: 3.837 ± 0.323
4.317ThrGlu: 4.317 ± 0.311
2.246ThrPhe: 2.246 ± 0.261
4.404ThrGly: 4.404 ± 0.398
1.243ThrHis: 1.243 ± 0.165
3.641ThrIle: 3.641 ± 0.346
4.143ThrLys: 4.143 ± 0.267
5.516ThrLeu: 5.516 ± 0.328
1.068ThrMet: 1.068 ± 0.156
2.704ThrAsn: 2.704 ± 0.292
2.878ThrPro: 2.878 ± 0.291
2.289ThrGln: 2.289 ± 0.209
2.9ThrArg: 2.9 ± 0.282
3.576ThrSer: 3.576 ± 0.354
3.663ThrThr: 3.663 ± 0.32
4.949ThrVal: 4.949 ± 0.338
0.632ThrTrp: 0.632 ± 0.103
2.813ThrTyr: 2.813 ± 0.253
0.0ThrXaa: 0.0 ± 0.0
Val
4.186ValAla: 4.186 ± 0.317
0.676ValCys: 0.676 ± 0.142
5.472ValAsp: 5.472 ± 0.34
5.124ValGlu: 5.124 ± 0.341
2.965ValPhe: 2.965 ± 0.255
4.252ValGly: 4.252 ± 0.278
1.33ValHis: 1.33 ± 0.16
4.099ValIle: 4.099 ± 0.319
5.254ValLys: 5.254 ± 0.333
5.385ValLeu: 5.385 ± 0.373
1.57ValMet: 1.57 ± 0.198
3.75ValAsn: 3.75 ± 0.31
2.442ValPro: 2.442 ± 0.255
2.507ValGln: 2.507 ± 0.215
3.358ValArg: 3.358 ± 0.282
4.579ValSer: 4.579 ± 0.299
4.709ValThr: 4.709 ± 0.305
5.167ValVal: 5.167 ± 0.406
0.741ValTrp: 0.741 ± 0.15
3.445ValTyr: 3.445 ± 0.26
0.0ValXaa: 0.0 ± 0.0
Trp
0.523TrpAla: 0.523 ± 0.112
0.131TrpCys: 0.131 ± 0.057
0.829TrpAsp: 0.829 ± 0.111
0.894TrpGlu: 0.894 ± 0.119
0.414TrpPhe: 0.414 ± 0.087
0.763TrpGly: 0.763 ± 0.124
0.153TrpHis: 0.153 ± 0.07
0.523TrpIle: 0.523 ± 0.11
0.589TrpLys: 0.589 ± 0.108
0.872TrpLeu: 0.872 ± 0.141
0.131TrpMet: 0.131 ± 0.051
0.436TrpAsn: 0.436 ± 0.104
0.0TrpPro: 0.0 ± 0.0
0.327TrpGln: 0.327 ± 0.088
0.523TrpArg: 0.523 ± 0.106
0.61TrpSer: 0.61 ± 0.117
0.589TrpThr: 0.589 ± 0.108
1.047TrpVal: 1.047 ± 0.147
0.218TrpTrp: 0.218 ± 0.076
0.414TrpTyr: 0.414 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.813TyrAla: 2.813 ± 0.25
0.305TyrCys: 0.305 ± 0.071
2.965TyrAsp: 2.965 ± 0.269
3.597TyrGlu: 3.597 ± 0.255
1.221TyrPhe: 1.221 ± 0.139
2.856TyrGly: 2.856 ± 0.274
0.654TyrHis: 0.654 ± 0.111
3.205TyrIle: 3.205 ± 0.213
2.922TyrLys: 2.922 ± 0.241
3.75TyrLeu: 3.75 ± 0.324
0.894TyrMet: 0.894 ± 0.152
2.202TyrAsn: 2.202 ± 0.261
1.286TyrPro: 1.286 ± 0.192
1.657TyrGln: 1.657 ± 0.18
2.202TyrArg: 2.202 ± 0.231
2.791TyrSer: 2.791 ± 0.249
3.052TyrThr: 3.052 ± 0.258
2.595TyrVal: 2.595 ± 0.253
0.392TyrTrp: 0.392 ± 0.079
1.766TyrTyr: 1.766 ± 0.218
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 240 proteins (45867 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski