Amino acid dipepetide frequency for Bacillus phage 0305phi8-36

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.205AlaAla: 3.205 ± 0.261
0.415AlaCys: 0.415 ± 0.088
4.481AlaAsp: 4.481 ± 0.321
4.525AlaGlu: 4.525 ± 0.32
2.359AlaPhe: 2.359 ± 0.221
4.896AlaGly: 4.896 ± 0.285
1.335AlaHis: 1.335 ± 0.131
4.229AlaIle: 4.229 ± 0.272
5.252AlaLys: 5.252 ± 0.319
4.807AlaLeu: 4.807 ± 0.277
2.329AlaMet: 2.329 ± 0.197
3.22AlaAsn: 3.22 ± 0.276
2.166AlaPro: 2.166 ± 0.185
2.507AlaGln: 2.507 ± 0.201
3.101AlaArg: 3.101 ± 0.233
3.709AlaSer: 3.709 ± 0.217
4.021AlaThr: 4.021 ± 0.226
4.332AlaVal: 4.332 ± 0.253
1.039AlaTrp: 1.039 ± 0.121
2.715AlaTyr: 2.715 ± 0.193
0.0AlaXaa: 0.0 ± 0.0
Cys
0.415CysAla: 0.415 ± 0.075
0.059CysCys: 0.059 ± 0.038
0.415CysAsp: 0.415 ± 0.084
0.504CysGlu: 0.504 ± 0.088
0.267CysPhe: 0.267 ± 0.063
0.668CysGly: 0.668 ± 0.14
0.104CysHis: 0.104 ± 0.047
0.341CysIle: 0.341 ± 0.069
0.593CysLys: 0.593 ± 0.1
0.43CysLeu: 0.43 ± 0.091
0.223CysMet: 0.223 ± 0.062
0.341CysAsn: 0.341 ± 0.075
0.386CysPro: 0.386 ± 0.09
0.223CysGln: 0.223 ± 0.061
0.326CysArg: 0.326 ± 0.074
0.371CysSer: 0.371 ± 0.068
0.445CysThr: 0.445 ± 0.079
0.386CysVal: 0.386 ± 0.07
0.089CysTrp: 0.089 ± 0.038
0.297CysTyr: 0.297 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
4.629AspAla: 4.629 ± 0.239
0.445AspCys: 0.445 ± 0.086
5.104AspAsp: 5.104 ± 0.331
6.261AspGlu: 6.261 ± 0.474
2.804AspPhe: 2.804 ± 0.199
5.223AspGly: 5.223 ± 0.317
1.039AspHis: 1.039 ± 0.127
4.54AspIle: 4.54 ± 0.239
4.763AspLys: 4.763 ± 0.265
5.282AspLeu: 5.282 ± 0.319
2.315AspMet: 2.315 ± 0.187
3.101AspAsn: 3.101 ± 0.254
2.849AspPro: 2.849 ± 0.212
1.706AspGln: 1.706 ± 0.14
2.908AspArg: 2.908 ± 0.243
2.834AspSer: 2.834 ± 0.211
4.288AspThr: 4.288 ± 0.307
4.837AspVal: 4.837 ± 0.347
0.994AspTrp: 0.994 ± 0.118
3.19AspTyr: 3.19 ± 0.206
0.0AspXaa: 0.0 ± 0.0
Glu
5.49GluAla: 5.49 ± 0.378
0.534GluCys: 0.534 ± 0.099
4.748GluAsp: 4.748 ± 0.419
7.789GluGlu: 7.789 ± 0.66
2.685GluPhe: 2.685 ± 0.2
4.54GluGly: 4.54 ± 0.353
1.424GluHis: 1.424 ± 0.142
5.104GluIle: 5.104 ± 0.308
5.757GluLys: 5.757 ± 0.34
6.766GluLeu: 6.766 ± 0.309
2.374GluMet: 2.374 ± 0.165
3.576GluAsn: 3.576 ± 0.233
2.374GluPro: 2.374 ± 0.175
3.457GluGln: 3.457 ± 0.247
3.769GluArg: 3.769 ± 0.239
4.169GluSer: 4.169 ± 0.243
3.576GluThr: 3.576 ± 0.25
5.668GluVal: 5.668 ± 0.329
0.95GluTrp: 0.95 ± 0.114
3.591GluTyr: 3.591 ± 0.213
0.0GluXaa: 0.0 ± 0.0
Phe
2.077PheAla: 2.077 ± 0.194
0.282PheCys: 0.282 ± 0.066
2.967PheAsp: 2.967 ± 0.22
2.864PheGlu: 2.864 ± 0.19
1.558PhePhe: 1.558 ± 0.201
2.018PheGly: 2.018 ± 0.226
0.994PheHis: 0.994 ± 0.13
2.344PheIle: 2.344 ± 0.153
2.641PheLys: 2.641 ± 0.228
2.685PheLeu: 2.685 ± 0.217
1.291PheMet: 1.291 ± 0.113
1.944PheAsn: 1.944 ± 0.24
1.439PhePro: 1.439 ± 0.147
1.41PheGln: 1.41 ± 0.161
2.107PheArg: 2.107 ± 0.191
2.033PheSer: 2.033 ± 0.168
2.493PheThr: 2.493 ± 0.185
2.656PheVal: 2.656 ± 0.241
0.312PheTrp: 0.312 ± 0.068
1.81PheTyr: 1.81 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
4.169GlyAla: 4.169 ± 0.345
0.386GlyCys: 0.386 ± 0.086
3.947GlyAsp: 3.947 ± 0.258
4.718GlyGlu: 4.718 ± 0.258
2.775GlyPhe: 2.775 ± 0.216
4.748GlyGly: 4.748 ± 0.502
1.32GlyHis: 1.32 ± 0.141
4.258GlyIle: 4.258 ± 0.308
5.623GlyLys: 5.623 ± 0.361
4.407GlyLeu: 4.407 ± 0.263
2.181GlyMet: 2.181 ± 0.198
3.027GlyAsn: 3.027 ± 0.231
1.009GlyPro: 1.009 ± 0.125
2.493GlyGln: 2.493 ± 0.213
3.591GlyArg: 3.591 ± 0.31
3.828GlySer: 3.828 ± 0.266
4.229GlyThr: 4.229 ± 0.346
4.243GlyVal: 4.243 ± 0.255
0.89GlyTrp: 0.89 ± 0.128
3.086GlyTyr: 3.086 ± 0.219
0.0GlyXaa: 0.0 ± 0.0
His
1.202HisAla: 1.202 ± 0.15
0.089HisCys: 0.089 ± 0.041
1.172HisAsp: 1.172 ± 0.143
1.38HisGlu: 1.38 ± 0.175
0.757HisPhe: 0.757 ± 0.106
1.202HisGly: 1.202 ± 0.138
0.668HisHis: 0.668 ± 0.131
1.662HisIle: 1.662 ± 0.164
1.454HisLys: 1.454 ± 0.149
1.602HisLeu: 1.602 ± 0.172
0.549HisMet: 0.549 ± 0.111
1.113HisAsn: 1.113 ± 0.133
0.964HisPro: 0.964 ± 0.111
0.697HisGln: 0.697 ± 0.109
0.786HisArg: 0.786 ± 0.11
1.395HisSer: 1.395 ± 0.156
1.261HisThr: 1.261 ± 0.151
1.469HisVal: 1.469 ± 0.126
0.297HisTrp: 0.297 ± 0.073
1.039HisTyr: 1.039 ± 0.123
0.0HisXaa: 0.0 ± 0.0
Ile
4.154IleAla: 4.154 ± 0.216
0.653IleCys: 0.653 ± 0.109
4.956IleAsp: 4.956 ± 0.281
5.267IleGlu: 5.267 ± 0.278
1.944IlePhe: 1.944 ± 0.182
4.273IleGly: 4.273 ± 0.246
1.217IleHis: 1.217 ± 0.128
3.413IleIle: 3.413 ± 0.289
4.688IleLys: 4.688 ± 0.262
4.229IleLeu: 4.229 ± 0.306
2.151IleMet: 2.151 ± 0.178
3.16IleAsn: 3.16 ± 0.236
2.567IlePro: 2.567 ± 0.246
2.344IleGln: 2.344 ± 0.185
3.145IleArg: 3.145 ± 0.21
3.383IleSer: 3.383 ± 0.234
3.724IleThr: 3.724 ± 0.276
4.407IleVal: 4.407 ± 0.306
0.46IleTrp: 0.46 ± 0.092
2.418IleTyr: 2.418 ± 0.235
0.0IleXaa: 0.0 ± 0.0
Lys
5.297LysAla: 5.297 ± 0.291
0.43LysCys: 0.43 ± 0.097
5.089LysAsp: 5.089 ± 0.282
7.033LysGlu: 7.033 ± 0.309
2.433LysPhe: 2.433 ± 0.198
4.896LysGly: 4.896 ± 0.29
1.513LysHis: 1.513 ± 0.154
3.976LysIle: 3.976 ± 0.237
6.469LysLys: 6.469 ± 0.561
5.623LysLeu: 5.623 ± 0.35
2.389LysMet: 2.389 ± 0.172
3.19LysAsn: 3.19 ± 0.282
3.368LysPro: 3.368 ± 0.236
3.546LysGln: 3.546 ± 0.253
4.303LysArg: 4.303 ± 0.296
3.516LysSer: 3.516 ± 0.311
4.51LysThr: 4.51 ± 0.25
4.926LysVal: 4.926 ± 0.286
0.816LysTrp: 0.816 ± 0.113
3.116LysTyr: 3.116 ± 0.205
0.0LysXaa: 0.0 ± 0.0
Leu
5.267LeuAla: 5.267 ± 0.301
0.549LeuCys: 0.549 ± 0.107
5.519LeuAsp: 5.519 ± 0.358
5.46LeuGlu: 5.46 ± 0.324
2.582LeuPhe: 2.582 ± 0.222
4.318LeuGly: 4.318 ± 0.264
1.84LeuHis: 1.84 ± 0.176
4.125LeuIle: 4.125 ± 0.236
5.371LeuLys: 5.371 ± 0.242
5.223LeuLeu: 5.223 ± 0.332
2.24LeuMet: 2.24 ± 0.199
3.709LeuAsn: 3.709 ± 0.253
3.323LeuPro: 3.323 ± 0.208
3.947LeuGln: 3.947 ± 0.267
3.798LeuArg: 3.798 ± 0.242
4.362LeuSer: 4.362 ± 0.254
4.688LeuThr: 4.688 ± 0.293
4.125LeuVal: 4.125 ± 0.284
0.816LeuTrp: 0.816 ± 0.097
2.715LeuTyr: 2.715 ± 0.196
0.0LeuXaa: 0.0 ± 0.0
Met
2.285MetAla: 2.285 ± 0.162
0.252MetCys: 0.252 ± 0.063
2.048MetAsp: 2.048 ± 0.196
2.418MetGlu: 2.418 ± 0.245
1.157MetPhe: 1.157 ± 0.123
1.869MetGly: 1.869 ± 0.185
0.712MetHis: 0.712 ± 0.099
1.84MetIle: 1.84 ± 0.18
2.849MetLys: 2.849 ± 0.179
2.359MetLeu: 2.359 ± 0.238
1.261MetMet: 1.261 ± 0.152
1.988MetAsn: 1.988 ± 0.196
1.246MetPro: 1.246 ± 0.129
1.291MetGln: 1.291 ± 0.131
1.617MetArg: 1.617 ± 0.155
1.647MetSer: 1.647 ± 0.185
1.825MetThr: 1.825 ± 0.187
1.944MetVal: 1.944 ± 0.191
0.371MetTrp: 0.371 ± 0.056
0.994MetTyr: 0.994 ± 0.114
0.0MetXaa: 0.0 ± 0.0
Asn
2.745AsnAla: 2.745 ± 0.216
0.297AsnCys: 0.297 ± 0.065
2.878AsnAsp: 2.878 ± 0.185
3.576AsnGlu: 3.576 ± 0.237
1.825AsnPhe: 1.825 ± 0.171
3.961AsnGly: 3.961 ± 0.268
1.024AsnHis: 1.024 ± 0.128
3.101AsnIle: 3.101 ± 0.186
3.976AsnLys: 3.976 ± 0.245
3.101AsnLeu: 3.101 ± 0.209
1.602AsnMet: 1.602 ± 0.195
2.315AsnAsn: 2.315 ± 0.187
2.967AsnPro: 2.967 ± 0.223
2.122AsnGln: 2.122 ± 0.205
2.389AsnArg: 2.389 ± 0.187
2.344AsnSer: 2.344 ± 0.21
2.685AsnThr: 2.685 ± 0.249
3.457AsnVal: 3.457 ± 0.266
0.653AsnTrp: 0.653 ± 0.114
1.662AsnTyr: 1.662 ± 0.131
0.0AsnXaa: 0.0 ± 0.0
Pro
2.611ProAla: 2.611 ± 0.213
0.178ProCys: 0.178 ± 0.052
2.819ProAsp: 2.819 ± 0.178
3.071ProGlu: 3.071 ± 0.251
1.602ProPhe: 1.602 ± 0.189
2.151ProGly: 2.151 ± 0.2
0.697ProHis: 0.697 ± 0.131
2.3ProIle: 2.3 ± 0.234
2.775ProLys: 2.775 ± 0.226
2.775ProLeu: 2.775 ± 0.23
1.098ProMet: 1.098 ± 0.125
1.855ProAsn: 1.855 ± 0.176
1.231ProPro: 1.231 ± 0.154
1.172ProGln: 1.172 ± 0.132
1.588ProArg: 1.588 ± 0.141
2.166ProSer: 2.166 ± 0.178
2.582ProThr: 2.582 ± 0.216
2.849ProVal: 2.849 ± 0.214
0.43ProTrp: 0.43 ± 0.086
1.602ProTyr: 1.602 ± 0.157
0.0ProXaa: 0.0 ± 0.0
Gln
2.507GlnAla: 2.507 ± 0.219
0.163GlnCys: 0.163 ± 0.047
2.374GlnAsp: 2.374 ± 0.235
3.22GlnGlu: 3.22 ± 0.234
1.632GlnPhe: 1.632 ± 0.16
1.929GlnGly: 1.929 ± 0.163
0.92GlnHis: 0.92 ± 0.117
2.285GlnIle: 2.285 ± 0.215
2.7GlnLys: 2.7 ± 0.189
3.561GlnLeu: 3.561 ± 0.218
1.231GlnMet: 1.231 ± 0.169
1.869GlnAsn: 1.869 ± 0.174
1.246GlnPro: 1.246 ± 0.145
1.78GlnGln: 1.78 ± 0.232
2.3GlnArg: 2.3 ± 0.22
2.196GlnSer: 2.196 ± 0.181
2.478GlnThr: 2.478 ± 0.215
3.131GlnVal: 3.131 ± 0.21
0.504GlnTrp: 0.504 ± 0.083
1.944GlnTyr: 1.944 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
2.953ArgAla: 2.953 ± 0.214
0.415ArgCys: 0.415 ± 0.079
3.22ArgAsp: 3.22 ± 0.259
3.398ArgGlu: 3.398 ± 0.253
2.048ArgPhe: 2.048 ± 0.209
2.685ArgGly: 2.685 ± 0.193
0.95ArgHis: 0.95 ± 0.109
3.947ArgIle: 3.947 ± 0.255
4.525ArgLys: 4.525 ± 0.371
4.362ArgLeu: 4.362 ± 0.259
2.033ArgMet: 2.033 ± 0.143
2.448ArgAsn: 2.448 ± 0.231
1.528ArgPro: 1.528 ± 0.14
2.018ArgGln: 2.018 ± 0.19
2.849ArgArg: 2.849 ± 0.322
2.151ArgSer: 2.151 ± 0.201
2.567ArgThr: 2.567 ± 0.173
3.338ArgVal: 3.338 ± 0.244
0.43ArgTrp: 0.43 ± 0.068
2.3ArgTyr: 2.3 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
3.264SerAla: 3.264 ± 0.201
0.356SerCys: 0.356 ± 0.084
3.754SerAsp: 3.754 ± 0.255
3.591SerGlu: 3.591 ± 0.232
2.359SerPhe: 2.359 ± 0.174
3.828SerGly: 3.828 ± 0.251
1.32SerHis: 1.32 ± 0.165
3.338SerIle: 3.338 ± 0.215
3.798SerLys: 3.798 ± 0.244
4.021SerLeu: 4.021 ± 0.223
1.602SerMet: 1.602 ± 0.15
2.789SerAsn: 2.789 ± 0.213
2.137SerPro: 2.137 ± 0.209
1.736SerGln: 1.736 ± 0.167
2.745SerArg: 2.745 ± 0.167
2.953SerSer: 2.953 ± 0.213
3.427SerThr: 3.427 ± 0.272
3.739SerVal: 3.739 ± 0.268
0.727SerTrp: 0.727 ± 0.097
2.582SerTyr: 2.582 ± 0.187
0.0SerXaa: 0.0 ± 0.0
Thr
4.08ThrAla: 4.08 ± 0.28
0.49ThrCys: 0.49 ± 0.094
4.184ThrAsp: 4.184 ± 0.255
3.961ThrGlu: 3.961 ± 0.285
2.671ThrPhe: 2.671 ± 0.188
4.481ThrGly: 4.481 ± 0.346
1.113ThrHis: 1.113 ± 0.124
4.051ThrIle: 4.051 ± 0.254
4.451ThrLys: 4.451 ± 0.217
5.104ThrLeu: 5.104 ± 0.36
1.588ThrMet: 1.588 ± 0.165
2.864ThrAsn: 2.864 ± 0.254
2.656ThrPro: 2.656 ± 0.183
2.062ThrGln: 2.062 ± 0.221
2.374ThrArg: 2.374 ± 0.172
3.546ThrSer: 3.546 ± 0.289
3.101ThrThr: 3.101 ± 0.229
4.08ThrVal: 4.08 ± 0.296
0.608ThrTrp: 0.608 ± 0.126
2.552ThrTyr: 2.552 ± 0.255
0.0ThrXaa: 0.0 ± 0.0
Val
4.896ValAla: 4.896 ± 0.281
0.415ValCys: 0.415 ± 0.093
5.089ValAsp: 5.089 ± 0.291
5.297ValGlu: 5.297 ± 0.278
2.611ValPhe: 2.611 ± 0.213
3.65ValGly: 3.65 ± 0.25
1.469ValHis: 1.469 ± 0.178
4.11ValIle: 4.11 ± 0.271
4.629ValLys: 4.629 ± 0.285
4.229ValLeu: 4.229 ± 0.255
1.81ValMet: 1.81 ± 0.165
3.383ValAsn: 3.383 ± 0.275
2.552ValPro: 2.552 ± 0.236
2.982ValGln: 2.982 ± 0.231
3.947ValArg: 3.947 ± 0.233
4.466ValSer: 4.466 ± 0.236
4.54ValThr: 4.54 ± 0.325
4.525ValVal: 4.525 ± 0.334
0.564ValTrp: 0.564 ± 0.083
2.685ValTyr: 2.685 ± 0.205
0.0ValXaa: 0.0 ± 0.0
Trp
0.638TrpAla: 0.638 ± 0.109
0.104TrpCys: 0.104 ± 0.039
0.92TrpAsp: 0.92 ± 0.112
0.772TrpGlu: 0.772 ± 0.113
0.46TrpPhe: 0.46 ± 0.089
0.757TrpGly: 0.757 ± 0.112
0.193TrpHis: 0.193 ± 0.066
0.593TrpIle: 0.593 ± 0.113
1.098TrpLys: 1.098 ± 0.16
0.786TrpLeu: 0.786 ± 0.099
0.549TrpMet: 0.549 ± 0.086
0.653TrpAsn: 0.653 ± 0.101
0.163TrpPro: 0.163 ± 0.049
0.445TrpGln: 0.445 ± 0.08
0.445TrpArg: 0.445 ± 0.085
0.742TrpSer: 0.742 ± 0.102
0.757TrpThr: 0.757 ± 0.125
0.831TrpVal: 0.831 ± 0.105
0.119TrpTrp: 0.119 ± 0.039
0.475TrpTyr: 0.475 ± 0.081
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.76TyrAla: 2.76 ± 0.192
0.371TyrCys: 0.371 ± 0.07
3.427TyrAsp: 3.427 ± 0.194
3.234TyrGlu: 3.234 ± 0.215
1.395TyrPhe: 1.395 ± 0.147
2.834TyrGly: 2.834 ± 0.201
0.905TyrHis: 0.905 ± 0.113
2.997TyrIle: 2.997 ± 0.218
3.027TyrLys: 3.027 ± 0.209
2.626TyrLeu: 2.626 ± 0.231
1.142TyrMet: 1.142 ± 0.158
2.137TyrAsn: 2.137 ± 0.165
1.38TyrPro: 1.38 ± 0.19
2.033TyrGln: 2.033 ± 0.179
2.077TyrArg: 2.077 ± 0.168
2.255TyrSer: 2.255 ± 0.232
2.819TyrThr: 2.819 ± 0.228
2.908TyrVal: 2.908 ± 0.234
0.43TyrTrp: 0.43 ± 0.072
1.914TyrTyr: 1.914 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 239 proteins (67400 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski