Amino acid dipepetide frequency for Bacillus phage SP01 (Bacteriophage SP01)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.997AlaAla: 1.997 ± 0.26
0.581AlaCys: 0.581 ± 0.107
3.058AlaAsp: 3.058 ± 0.312
3.892AlaGlu: 3.892 ± 0.334
2.401AlaPhe: 2.401 ± 0.298
3.867AlaGly: 3.867 ± 0.378
0.885AlaHis: 0.885 ± 0.136
3.842AlaIle: 3.842 ± 0.321
4.751AlaLys: 4.751 ± 0.336
5.307AlaLeu: 5.307 ± 0.36
1.567AlaMet: 1.567 ± 0.247
2.654AlaAsn: 2.654 ± 0.247
1.592AlaPro: 1.592 ± 0.236
1.971AlaGln: 1.971 ± 0.231
2.502AlaArg: 2.502 ± 0.238
2.982AlaSer: 2.982 ± 0.295
2.957AlaThr: 2.957 ± 0.319
4.297AlaVal: 4.297 ± 0.395
0.581AlaTrp: 0.581 ± 0.11
2.401AlaTyr: 2.401 ± 0.246
0.0AlaXaa: 0.0 ± 0.0
Cys
0.505CysAla: 0.505 ± 0.115
0.076CysCys: 0.076 ± 0.042
0.404CysAsp: 0.404 ± 0.1
0.91CysGlu: 0.91 ± 0.168
0.202CysPhe: 0.202 ± 0.064
0.581CysGly: 0.581 ± 0.122
0.253CysHis: 0.253 ± 0.073
0.556CysIle: 0.556 ± 0.132
0.885CysLys: 0.885 ± 0.156
0.758CysLeu: 0.758 ± 0.163
0.253CysMet: 0.253 ± 0.09
0.43CysAsn: 0.43 ± 0.111
0.303CysPro: 0.303 ± 0.104
0.455CysGln: 0.455 ± 0.085
0.43CysArg: 0.43 ± 0.099
0.708CysSer: 0.708 ± 0.138
0.783CysThr: 0.783 ± 0.168
0.708CysVal: 0.708 ± 0.137
0.101CysTrp: 0.101 ± 0.049
0.43CysTyr: 0.43 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
3.311AspAla: 3.311 ± 0.284
0.885AspCys: 0.885 ± 0.172
3.74AspAsp: 3.74 ± 0.389
4.827AspGlu: 4.827 ± 0.461
2.578AspPhe: 2.578 ± 0.258
4.372AspGly: 4.372 ± 0.366
0.96AspHis: 0.96 ± 0.156
4.777AspIle: 4.777 ± 0.338
4.701AspLys: 4.701 ± 0.329
6.698AspLeu: 6.698 ± 0.486
1.971AspMet: 1.971 ± 0.249
3.715AspAsn: 3.715 ± 0.351
2.654AspPro: 2.654 ± 0.282
2.072AspGln: 2.072 ± 0.278
2.805AspArg: 2.805 ± 0.272
4.246AspSer: 4.246 ± 0.34
4.271AspThr: 4.271 ± 0.31
4.448AspVal: 4.448 ± 0.351
0.859AspTrp: 0.859 ± 0.166
2.755AspTyr: 2.755 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
4.322GluAla: 4.322 ± 0.303
0.682GluCys: 0.682 ± 0.137
7.38GluAsp: 7.38 ± 0.562
8.467GluGlu: 8.467 ± 0.74
2.982GluPhe: 2.982 ± 0.228
6.47GluGly: 6.47 ± 0.456
1.441GluHis: 1.441 ± 0.208
5.156GluIle: 5.156 ± 0.405
6.773GluLys: 6.773 ± 0.525
6.217GluLeu: 6.217 ± 0.453
2.224GluMet: 2.224 ± 0.246
3.639GluAsn: 3.639 ± 0.325
2.3GluPro: 2.3 ± 0.286
2.805GluGln: 2.805 ± 0.265
3.058GluArg: 3.058 ± 0.321
4.195GluSer: 4.195 ± 0.339
3.109GluThr: 3.109 ± 0.289
6.293GluVal: 6.293 ± 0.493
1.036GluTrp: 1.036 ± 0.148
3.387GluTyr: 3.387 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
1.34PheAla: 1.34 ± 0.177
0.278PheCys: 0.278 ± 0.082
2.426PheAsp: 2.426 ± 0.286
2.553PheGlu: 2.553 ± 0.271
1.238PhePhe: 1.238 ± 0.18
2.452PheGly: 2.452 ± 0.288
0.859PheHis: 0.859 ± 0.162
2.224PheIle: 2.224 ± 0.252
2.932PheLys: 2.932 ± 0.32
3.083PheLeu: 3.083 ± 0.305
1.213PheMet: 1.213 ± 0.183
1.87PheAsn: 1.87 ± 0.211
1.112PhePro: 1.112 ± 0.207
1.061PheGln: 1.061 ± 0.161
2.123PheArg: 2.123 ± 0.249
2.477PheSer: 2.477 ± 0.246
2.123PheThr: 2.123 ± 0.231
2.199PheVal: 2.199 ± 0.218
0.404PheTrp: 0.404 ± 0.106
1.441PheTyr: 1.441 ± 0.208
0.0PheXaa: 0.0 ± 0.0
Gly
3.74GlyAla: 3.74 ± 0.4
0.834GlyCys: 0.834 ± 0.141
4.473GlyAsp: 4.473 ± 0.352
5.585GlyGlu: 5.585 ± 0.313
2.477GlyPhe: 2.477 ± 0.253
4.979GlyGly: 4.979 ± 0.488
0.783GlyHis: 0.783 ± 0.139
4.726GlyIle: 4.726 ± 0.362
4.676GlyLys: 4.676 ± 0.374
5.712GlyLeu: 5.712 ± 0.397
1.719GlyMet: 1.719 ± 0.24
3.134GlyAsn: 3.134 ± 0.34
1.137GlyPro: 1.137 ± 0.189
1.643GlyGln: 1.643 ± 0.221
2.755GlyArg: 2.755 ± 0.271
4.6GlySer: 4.6 ± 0.401
4.524GlyThr: 4.524 ± 0.405
5.535GlyVal: 5.535 ± 0.396
0.783GlyTrp: 0.783 ± 0.149
3.589GlyTyr: 3.589 ± 0.348
0.0GlyXaa: 0.0 ± 0.0
His
0.91HisAla: 0.91 ± 0.15
0.177HisCys: 0.177 ± 0.064
1.112HisAsp: 1.112 ± 0.167
1.011HisGlu: 1.011 ± 0.167
0.657HisPhe: 0.657 ± 0.114
1.34HisGly: 1.34 ± 0.219
0.531HisHis: 0.531 ± 0.121
1.365HisIle: 1.365 ± 0.224
1.592HisLys: 1.592 ± 0.226
1.668HisLeu: 1.668 ± 0.219
0.632HisMet: 0.632 ± 0.12
0.809HisAsn: 0.809 ± 0.152
0.885HisPro: 0.885 ± 0.129
0.354HisGln: 0.354 ± 0.092
0.91HisArg: 0.91 ± 0.165
1.163HisSer: 1.163 ± 0.174
1.188HisThr: 1.188 ± 0.157
1.264HisVal: 1.264 ± 0.177
0.303HisTrp: 0.303 ± 0.078
1.011HisTyr: 1.011 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
3.311IleAla: 3.311 ± 0.296
0.758IleCys: 0.758 ± 0.143
4.676IleAsp: 4.676 ± 0.322
5.661IleGlu: 5.661 ± 0.453
1.896IlePhe: 1.896 ± 0.235
3.892IleGly: 3.892 ± 0.373
1.542IleHis: 1.542 ± 0.221
4.246IleIle: 4.246 ± 0.332
5.863IleLys: 5.863 ± 0.376
4.928IleLeu: 4.928 ± 0.346
1.567IleMet: 1.567 ± 0.172
3.488IleAsn: 3.488 ± 0.277
2.654IlePro: 2.654 ± 0.271
2.275IleGln: 2.275 ± 0.231
2.881IleArg: 2.881 ± 0.29
3.513IleSer: 3.513 ± 0.387
4.372IleThr: 4.372 ± 0.383
3.842IleVal: 3.842 ± 0.315
0.43IleTrp: 0.43 ± 0.096
2.325IleTyr: 2.325 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
5.409LysAla: 5.409 ± 0.459
0.581LysCys: 0.581 ± 0.147
5.661LysAsp: 5.661 ± 0.453
7.203LysGlu: 7.203 ± 0.59
2.426LysPhe: 2.426 ± 0.285
5.156LysGly: 5.156 ± 0.408
1.971LysHis: 1.971 ± 0.275
4.322LysIle: 4.322 ± 0.328
7.582LysLys: 7.582 ± 0.67
5.535LysLeu: 5.535 ± 0.349
2.376LysMet: 2.376 ± 0.214
3.412LysAsn: 3.412 ± 0.363
2.856LysPro: 2.856 ± 0.286
2.502LysGln: 2.502 ± 0.263
3.816LysArg: 3.816 ± 0.34
4.853LysSer: 4.853 ± 0.396
4.019LysThr: 4.019 ± 0.267
5.99LysVal: 5.99 ± 0.337
1.011LysTrp: 1.011 ± 0.132
3.993LysTyr: 3.993 ± 0.278
0.0LysXaa: 0.0 ± 0.0
Leu
4.827LeuAla: 4.827 ± 0.346
0.809LeuCys: 0.809 ± 0.152
6.091LeuAsp: 6.091 ± 0.362
8.037LeuGlu: 8.037 ± 0.603
2.982LeuPhe: 2.982 ± 0.261
5.156LeuGly: 5.156 ± 0.337
1.466LeuHis: 1.466 ± 0.201
4.17LeuIle: 4.17 ± 0.356
6.217LeuLys: 6.217 ± 0.401
5.99LeuLeu: 5.99 ± 0.453
2.224LeuMet: 2.224 ± 0.218
3.589LeuAsn: 3.589 ± 0.309
2.831LeuPro: 2.831 ± 0.249
3.134LeuGln: 3.134 ± 0.344
3.842LeuArg: 3.842 ± 0.31
5.889LeuSer: 5.889 ± 0.378
4.625LeuThr: 4.625 ± 0.352
4.979LeuVal: 4.979 ± 0.372
0.935LeuTrp: 0.935 ± 0.142
3.235LeuTyr: 3.235 ± 0.279
0.0LeuXaa: 0.0 ± 0.0
Met
2.224MetAla: 2.224 ± 0.261
0.202MetCys: 0.202 ± 0.061
2.123MetAsp: 2.123 ± 0.236
2.376MetGlu: 2.376 ± 0.283
0.91MetPhe: 0.91 ± 0.168
1.744MetGly: 1.744 ± 0.186
0.43MetHis: 0.43 ± 0.099
1.542MetIle: 1.542 ± 0.206
2.35MetLys: 2.35 ± 0.289
1.82MetLeu: 1.82 ± 0.221
0.758MetMet: 0.758 ± 0.139
1.491MetAsn: 1.491 ± 0.178
0.809MetPro: 0.809 ± 0.137
0.885MetGln: 0.885 ± 0.152
1.112MetArg: 1.112 ± 0.183
2.072MetSer: 2.072 ± 0.204
1.592MetThr: 1.592 ± 0.203
1.693MetVal: 1.693 ± 0.193
0.303MetTrp: 0.303 ± 0.084
1.238MetTyr: 1.238 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
2.35AsnAla: 2.35 ± 0.231
0.632AsnCys: 0.632 ± 0.156
2.401AsnAsp: 2.401 ± 0.242
2.856AsnGlu: 2.856 ± 0.265
1.87AsnPhe: 1.87 ± 0.238
3.361AsnGly: 3.361 ± 0.295
0.935AsnHis: 0.935 ± 0.174
3.589AsnIle: 3.589 ± 0.276
3.993AsnLys: 3.993 ± 0.296
3.842AsnLeu: 3.842 ± 0.319
1.264AsnMet: 1.264 ± 0.168
2.881AsnAsn: 2.881 ± 0.332
2.376AsnPro: 2.376 ± 0.283
1.618AsnGln: 1.618 ± 0.212
2.249AsnArg: 2.249 ± 0.265
3.311AsnSer: 3.311 ± 0.289
3.69AsnThr: 3.69 ± 0.474
3.235AsnVal: 3.235 ± 0.328
0.505AsnTrp: 0.505 ± 0.111
1.845AsnTyr: 1.845 ± 0.214
0.0AsnXaa: 0.0 ± 0.0
Pro
1.567ProAla: 1.567 ± 0.192
0.253ProCys: 0.253 ± 0.073
2.502ProAsp: 2.502 ± 0.254
3.184ProGlu: 3.184 ± 0.281
1.289ProPhe: 1.289 ± 0.153
1.744ProGly: 1.744 ± 0.216
0.43ProHis: 0.43 ± 0.105
2.123ProIle: 2.123 ± 0.229
2.78ProLys: 2.78 ± 0.294
2.249ProLeu: 2.249 ± 0.204
0.607ProMet: 0.607 ± 0.138
1.592ProAsn: 1.592 ± 0.228
0.859ProPro: 0.859 ± 0.182
1.036ProGln: 1.036 ± 0.207
1.415ProArg: 1.415 ± 0.162
2.578ProSer: 2.578 ± 0.271
2.477ProThr: 2.477 ± 0.273
2.831ProVal: 2.831 ± 0.304
0.329ProTrp: 0.329 ± 0.087
1.668ProTyr: 1.668 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
1.971GlnAla: 1.971 ± 0.226
0.354GlnCys: 0.354 ± 0.081
1.997GlnAsp: 1.997 ± 0.182
2.603GlnGlu: 2.603 ± 0.313
0.986GlnPhe: 0.986 ± 0.168
1.87GlnGly: 1.87 ± 0.223
0.733GlnHis: 0.733 ± 0.129
2.325GlnIle: 2.325 ± 0.213
2.578GlnLys: 2.578 ± 0.268
3.134GlnLeu: 3.134 ± 0.244
1.011GlnMet: 1.011 ± 0.139
1.516GlnAsn: 1.516 ± 0.193
0.809GlnPro: 0.809 ± 0.134
1.163GlnGln: 1.163 ± 0.195
1.466GlnArg: 1.466 ± 0.169
2.047GlnSer: 2.047 ± 0.219
1.87GlnThr: 1.87 ± 0.231
2.502GlnVal: 2.502 ± 0.202
0.455GlnTrp: 0.455 ± 0.112
1.794GlnTyr: 1.794 ± 0.206
0.0GlnXaa: 0.0 ± 0.0
Arg
2.174ArgAla: 2.174 ± 0.222
0.404ArgCys: 0.404 ± 0.097
2.452ArgAsp: 2.452 ± 0.21
3.058ArgGlu: 3.058 ± 0.234
1.769ArgPhe: 1.769 ± 0.188
2.78ArgGly: 2.78 ± 0.287
0.505ArgHis: 0.505 ± 0.115
3.134ArgIle: 3.134 ± 0.209
3.993ArgLys: 3.993 ± 0.331
3.917ArgLeu: 3.917 ± 0.312
1.643ArgMet: 1.643 ± 0.21
2.452ArgAsn: 2.452 ± 0.209
1.39ArgPro: 1.39 ± 0.213
1.769ArgGln: 1.769 ± 0.19
2.679ArgArg: 2.679 ± 0.265
2.603ArgSer: 2.603 ± 0.258
2.35ArgThr: 2.35 ± 0.202
3.437ArgVal: 3.437 ± 0.287
0.657ArgTrp: 0.657 ± 0.108
1.744ArgTyr: 1.744 ± 0.198
0.0ArgXaa: 0.0 ± 0.0
Ser
3.26SerAla: 3.26 ± 0.323
0.253SerCys: 0.253 ± 0.098
4.17SerAsp: 4.17 ± 0.326
4.676SerGlu: 4.676 ± 0.334
2.376SerPhe: 2.376 ± 0.197
4.777SerGly: 4.777 ± 0.391
1.087SerHis: 1.087 ± 0.17
4.195SerIle: 4.195 ± 0.293
5.282SerLys: 5.282 ± 0.423
5.307SerLeu: 5.307 ± 0.404
1.921SerMet: 1.921 ± 0.277
2.932SerAsn: 2.932 ± 0.311
2.148SerPro: 2.148 ± 0.247
2.174SerGln: 2.174 ± 0.241
3.184SerArg: 3.184 ± 0.298
3.665SerSer: 3.665 ± 0.415
4.17SerThr: 4.17 ± 0.372
4.625SerVal: 4.625 ± 0.415
0.632SerTrp: 0.632 ± 0.123
3.033SerTyr: 3.033 ± 0.276
0.0SerXaa: 0.0 ± 0.0
Thr
4.17ThrAla: 4.17 ± 0.398
0.581ThrCys: 0.581 ± 0.105
3.917ThrAsp: 3.917 ± 0.295
4.701ThrGlu: 4.701 ± 0.442
2.527ThrPhe: 2.527 ± 0.268
4.751ThrGly: 4.751 ± 0.477
1.264ThrHis: 1.264 ± 0.224
4.12ThrIle: 4.12 ± 0.349
4.347ThrLys: 4.347 ± 0.382
4.6ThrLeu: 4.6 ± 0.335
1.188ThrMet: 1.188 ± 0.169
2.881ThrAsn: 2.881 ± 0.289
2.553ThrPro: 2.553 ± 0.233
2.249ThrGln: 2.249 ± 0.28
2.047ThrArg: 2.047 ± 0.201
3.665ThrSer: 3.665 ± 0.436
3.564ThrThr: 3.564 ± 0.363
4.448ThrVal: 4.448 ± 0.429
0.708ThrTrp: 0.708 ± 0.144
2.426ThrTyr: 2.426 ± 0.246
0.0ThrXaa: 0.0 ± 0.0
Val
4.322ValAla: 4.322 ± 0.334
0.657ValCys: 0.657 ± 0.108
4.448ValAsp: 4.448 ± 0.293
6.243ValGlu: 6.243 ± 0.456
1.769ValPhe: 1.769 ± 0.205
4.347ValGly: 4.347 ± 0.335
1.415ValHis: 1.415 ± 0.222
4.044ValIle: 4.044 ± 0.273
5.51ValLys: 5.51 ± 0.438
5.636ValLeu: 5.636 ± 0.436
1.87ValMet: 1.87 ± 0.231
3.058ValAsn: 3.058 ± 0.275
2.906ValPro: 2.906 ± 0.303
2.401ValGln: 2.401 ± 0.192
2.805ValArg: 2.805 ± 0.298
5.788ValSer: 5.788 ± 0.435
5.636ValThr: 5.636 ± 0.392
5.965ValVal: 5.965 ± 0.495
0.581ValTrp: 0.581 ± 0.135
3.159ValTyr: 3.159 ± 0.274
0.0ValXaa: 0.0 ± 0.0
Trp
0.581TrpAla: 0.581 ± 0.099
0.278TrpCys: 0.278 ± 0.09
0.859TrpAsp: 0.859 ± 0.152
0.986TrpGlu: 0.986 ± 0.177
0.455TrpPhe: 0.455 ± 0.115
0.986TrpGly: 0.986 ± 0.168
0.278TrpHis: 0.278 ± 0.09
0.758TrpIle: 0.758 ± 0.139
0.758TrpLys: 0.758 ± 0.121
0.96TrpLeu: 0.96 ± 0.167
0.354TrpMet: 0.354 ± 0.1
0.632TrpAsn: 0.632 ± 0.13
0.0TrpPro: 0.0 ± 0.0
0.278TrpGln: 0.278 ± 0.086
0.455TrpArg: 0.455 ± 0.094
0.581TrpSer: 0.581 ± 0.096
0.354TrpThr: 0.354 ± 0.094
1.087TrpVal: 1.087 ± 0.156
0.278TrpTrp: 0.278 ± 0.093
0.404TrpTyr: 0.404 ± 0.089
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.794TyrAla: 1.794 ± 0.223
0.455TyrCys: 0.455 ± 0.103
2.755TyrAsp: 2.755 ± 0.247
3.109TyrGlu: 3.109 ± 0.267
1.618TyrPhe: 1.618 ± 0.224
2.78TyrGly: 2.78 ± 0.277
1.112TyrHis: 1.112 ± 0.174
2.906TyrIle: 2.906 ± 0.294
3.109TyrLys: 3.109 ± 0.322
3.665TyrLeu: 3.665 ± 0.278
1.289TyrMet: 1.289 ± 0.177
2.654TyrAsn: 2.654 ± 0.246
1.34TyrPro: 1.34 ± 0.207
1.34TyrGln: 1.34 ± 0.174
2.376TyrArg: 2.376 ± 0.248
2.932TyrSer: 2.932 ± 0.261
3.033TyrThr: 3.033 ± 0.231
3.184TyrVal: 3.184 ± 0.284
0.43TyrTrp: 0.43 ± 0.101
1.668TyrTyr: 1.668 ± 0.196
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 204 proteins (39568 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski