Amino acid dipepetide frequency for Bacillus phage vB_BsuS-Goe13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.507AlaAla: 3.507 ± 0.439
0.788AlaCys: 0.788 ± 0.168
3.154AlaAsp: 3.154 ± 0.298
3.29AlaGlu: 3.29 ± 0.331
1.93AlaPhe: 1.93 ± 0.219
2.882AlaGly: 2.882 ± 0.355
0.924AlaHis: 0.924 ± 0.15
4.785AlaIle: 4.785 ± 0.344
5.437AlaLys: 5.437 ± 0.552
4.078AlaLeu: 4.078 ± 0.552
1.169AlaMet: 1.169 ± 0.189
2.284AlaAsn: 2.284 ± 0.233
1.087AlaPro: 1.087 ± 0.185
1.713AlaGln: 1.713 ± 0.36
1.631AlaArg: 1.631 ± 0.185
3.262AlaSer: 3.262 ± 0.397
2.991AlaThr: 2.991 ± 0.252
2.855AlaVal: 2.855 ± 0.293
0.489AlaTrp: 0.489 ± 0.099
2.692AlaTyr: 2.692 ± 0.298
0.0AlaXaa: 0.0 ± 0.0
Cys
0.353CysAla: 0.353 ± 0.1
0.217CysCys: 0.217 ± 0.071
0.462CysAsp: 0.462 ± 0.118
0.897CysGlu: 0.897 ± 0.172
0.734CysPhe: 0.734 ± 0.152
0.897CysGly: 0.897 ± 0.153
0.272CysHis: 0.272 ± 0.085
0.761CysIle: 0.761 ± 0.158
0.843CysLys: 0.843 ± 0.23
1.169CysLeu: 1.169 ± 0.187
0.19CysMet: 0.19 ± 0.073
0.68CysAsn: 0.68 ± 0.137
0.326CysPro: 0.326 ± 0.107
0.299CysGln: 0.299 ± 0.084
0.435CysArg: 0.435 ± 0.125
0.68CysSer: 0.68 ± 0.136
0.408CysThr: 0.408 ± 0.124
0.625CysVal: 0.625 ± 0.162
0.136CysTrp: 0.136 ± 0.071
0.272CysTyr: 0.272 ± 0.096
0.0CysXaa: 0.0 ± 0.0
Asp
2.528AspAla: 2.528 ± 0.32
0.788AspCys: 0.788 ± 0.174
3.507AspAsp: 3.507 ± 0.334
5.41AspGlu: 5.41 ± 0.388
3.072AspPhe: 3.072 ± 0.245
4.187AspGly: 4.187 ± 0.363
0.897AspHis: 0.897 ± 0.167
5.492AspIle: 5.492 ± 0.436
5.41AspLys: 5.41 ± 0.424
5.138AspLeu: 5.138 ± 0.342
1.55AspMet: 1.55 ± 0.197
3.534AspAsn: 3.534 ± 0.335
1.767AspPro: 1.767 ± 0.204
2.229AspGln: 2.229 ± 0.259
1.985AspArg: 1.985 ± 0.284
4.296AspSer: 4.296 ± 0.281
2.392AspThr: 2.392 ± 0.288
3.752AspVal: 3.752 ± 0.317
0.517AspTrp: 0.517 ± 0.119
3.833AspTyr: 3.833 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
4.54GluAla: 4.54 ± 0.443
0.87GluCys: 0.87 ± 0.179
4.622GluAsp: 4.622 ± 0.417
7.205GluGlu: 7.205 ± 0.436
2.827GluPhe: 2.827 ± 0.232
3.779GluGly: 3.779 ± 0.371
1.414GluHis: 1.414 ± 0.25
6.824GluIle: 6.824 ± 0.435
7.748GluLys: 7.748 ± 0.504
8.183GluLeu: 8.183 ± 0.549
2.392GluMet: 2.392 ± 0.271
4.132GluAsn: 4.132 ± 0.379
1.495GluPro: 1.495 ± 0.202
3.48GluGln: 3.48 ± 0.36
3.806GluArg: 3.806 ± 0.394
4.323GluSer: 4.323 ± 0.357
4.024GluThr: 4.024 ± 0.34
4.921GluVal: 4.921 ± 0.422
1.223GluTrp: 1.223 ± 0.22
3.752GluTyr: 3.752 ± 0.321
0.0GluXaa: 0.0 ± 0.0
Phe
1.686PheAla: 1.686 ± 0.192
0.408PheCys: 0.408 ± 0.103
2.909PheAsp: 2.909 ± 0.279
3.942PheGlu: 3.942 ± 0.37
1.985PhePhe: 1.985 ± 0.326
2.121PheGly: 2.121 ± 0.245
1.006PheHis: 1.006 ± 0.177
2.963PheIle: 2.963 ± 0.277
4.459PheLys: 4.459 ± 0.386
3.072PheLeu: 3.072 ± 0.383
0.952PheMet: 0.952 ± 0.183
3.616PheAsn: 3.616 ± 0.339
1.223PhePro: 1.223 ± 0.19
0.788PheGln: 0.788 ± 0.138
1.414PheArg: 1.414 ± 0.187
2.909PheSer: 2.909 ± 0.292
2.175PheThr: 2.175 ± 0.207
2.284PheVal: 2.284 ± 0.319
0.272PheTrp: 0.272 ± 0.093
2.121PheTyr: 2.121 ± 0.275
0.0PheXaa: 0.0 ± 0.0
Gly
2.42GlyAla: 2.42 ± 0.267
0.734GlyCys: 0.734 ± 0.142
2.936GlyAsp: 2.936 ± 0.245
3.67GlyGlu: 3.67 ± 0.32
2.719GlyPhe: 2.719 ± 0.235
3.072GlyGly: 3.072 ± 0.381
1.033GlyHis: 1.033 ± 0.175
4.214GlyIle: 4.214 ± 0.418
4.867GlyLys: 4.867 ± 0.366
5.22GlyLeu: 5.22 ± 0.355
1.387GlyMet: 1.387 ± 0.192
3.534GlyAsn: 3.534 ± 0.368
0.272GlyPro: 0.272 ± 0.085
2.039GlyGln: 2.039 ± 0.244
1.631GlyArg: 1.631 ± 0.22
3.67GlySer: 3.67 ± 0.313
3.235GlyThr: 3.235 ± 0.34
3.398GlyVal: 3.398 ± 0.32
0.408GlyTrp: 0.408 ± 0.1
2.909GlyTyr: 2.909 ± 0.282
0.0GlyXaa: 0.0 ± 0.0
His
0.652HisAla: 0.652 ± 0.125
0.163HisCys: 0.163 ± 0.067
0.761HisAsp: 0.761 ± 0.126
1.441HisGlu: 1.441 ± 0.22
0.924HisPhe: 0.924 ± 0.174
0.979HisGly: 0.979 ± 0.145
0.272HisHis: 0.272 ± 0.08
1.278HisIle: 1.278 ± 0.217
1.985HisLys: 1.985 ± 0.267
1.387HisLeu: 1.387 ± 0.178
0.517HisMet: 0.517 ± 0.127
1.441HisAsn: 1.441 ± 0.229
0.517HisPro: 0.517 ± 0.105
0.598HisGln: 0.598 ± 0.14
0.707HisArg: 0.707 ± 0.134
1.196HisSer: 1.196 ± 0.185
1.06HisThr: 1.06 ± 0.181
0.979HisVal: 0.979 ± 0.152
0.272HisTrp: 0.272 ± 0.094
0.843HisTyr: 0.843 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
4.459IleAla: 4.459 ± 0.39
0.952IleCys: 0.952 ± 0.193
5.465IleAsp: 5.465 ± 0.355
6.96IleGlu: 6.96 ± 0.483
2.637IlePhe: 2.637 ± 0.365
4.132IleGly: 4.132 ± 0.388
1.822IleHis: 1.822 ± 0.286
4.758IleIle: 4.758 ± 0.5
8.075IleLys: 8.075 ± 0.511
5.057IleLeu: 5.057 ± 0.448
1.387IleMet: 1.387 ± 0.207
5.03IleAsn: 5.03 ± 0.34
2.528IlePro: 2.528 ± 0.266
2.8IleGln: 2.8 ± 0.3
2.991IleArg: 2.991 ± 0.313
5.193IleSer: 5.193 ± 0.473
4.54IleThr: 4.54 ± 0.401
3.752IleVal: 3.752 ± 0.35
0.897IleTrp: 0.897 ± 0.172
2.664IleTyr: 2.664 ± 0.311
0.0IleXaa: 0.0 ± 0.0
Lys
5.057LysAla: 5.057 ± 0.457
0.761LysCys: 0.761 ± 0.175
6.389LysAsp: 6.389 ± 0.44
9.244LysGlu: 9.244 ± 0.54
3.208LysPhe: 3.208 ± 0.318
5.274LysGly: 5.274 ± 0.544
1.93LysHis: 1.93 ± 0.275
6.144LysIle: 6.144 ± 0.416
9.624LysLys: 9.624 ± 0.649
8.537LysLeu: 8.537 ± 0.539
1.93LysMet: 1.93 ± 0.228
5.927LysAsn: 5.927 ± 0.449
2.066LysPro: 2.066 ± 0.285
3.861LysGln: 3.861 ± 0.425
4.241LysArg: 4.241 ± 0.403
6.307LysSer: 6.307 ± 0.583
5.764LysThr: 5.764 ± 0.463
6.199LysVal: 6.199 ± 0.436
0.897LysTrp: 0.897 ± 0.199
4.54LysTyr: 4.54 ± 0.399
0.0LysXaa: 0.0 ± 0.0
Leu
4.105LeuAla: 4.105 ± 0.321
0.897LeuCys: 0.897 ± 0.149
5.519LeuAsp: 5.519 ± 0.329
5.601LeuGlu: 5.601 ± 0.549
4.051LeuPhe: 4.051 ± 0.424
4.024LeuGly: 4.024 ± 0.283
1.251LeuHis: 1.251 ± 0.189
6.906LeuIle: 6.906 ± 0.477
8.7LeuLys: 8.7 ± 0.602
7.857LeuLeu: 7.857 ± 0.514
1.794LeuMet: 1.794 ± 0.208
6.715LeuAsn: 6.715 ± 0.464
2.093LeuPro: 2.093 ± 0.253
3.262LeuGln: 3.262 ± 0.522
3.616LeuArg: 3.616 ± 0.322
6.797LeuSer: 6.797 ± 0.384
5.166LeuThr: 5.166 ± 0.382
4.377LeuVal: 4.377 ± 0.396
0.68LeuTrp: 0.68 ± 0.158
3.752LeuTyr: 3.752 ± 0.426
0.0LeuXaa: 0.0 ± 0.0
Met
1.522MetAla: 1.522 ± 0.162
0.272MetCys: 0.272 ± 0.095
1.359MetAsp: 1.359 ± 0.184
1.794MetGlu: 1.794 ± 0.221
0.952MetPhe: 0.952 ± 0.185
1.115MetGly: 1.115 ± 0.189
0.299MetHis: 0.299 ± 0.088
1.849MetIle: 1.849 ± 0.221
2.556MetLys: 2.556 ± 0.27
1.985MetLeu: 1.985 ± 0.254
0.652MetMet: 0.652 ± 0.145
1.522MetAsn: 1.522 ± 0.233
0.734MetPro: 0.734 ± 0.121
0.924MetGln: 0.924 ± 0.182
0.816MetArg: 0.816 ± 0.131
1.794MetSer: 1.794 ± 0.213
1.55MetThr: 1.55 ± 0.206
0.816MetVal: 0.816 ± 0.147
0.272MetTrp: 0.272 ± 0.093
0.68MetTyr: 0.68 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
3.262AsnAla: 3.262 ± 0.361
0.489AsnCys: 0.489 ± 0.132
4.241AsnAsp: 4.241 ± 0.341
5.356AsnGlu: 5.356 ± 0.313
2.392AsnPhe: 2.392 ± 0.256
3.915AsnGly: 3.915 ± 0.333
0.87AsnHis: 0.87 ± 0.164
4.323AsnIle: 4.323 ± 0.339
7.123AsnLys: 7.123 ± 0.506
4.676AsnLeu: 4.676 ± 0.299
1.495AsnMet: 1.495 ± 0.207
4.16AsnAsn: 4.16 ± 0.391
2.093AsnPro: 2.093 ± 0.269
1.876AsnGln: 1.876 ± 0.194
2.311AsnArg: 2.311 ± 0.228
3.969AsnSer: 3.969 ± 0.327
3.099AsnThr: 3.099 ± 0.384
3.67AsnVal: 3.67 ± 0.294
0.571AsnTrp: 0.571 ± 0.124
2.474AsnTyr: 2.474 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
1.604ProAla: 1.604 ± 0.192
0.136ProCys: 0.136 ± 0.063
2.012ProAsp: 2.012 ± 0.201
2.257ProGlu: 2.257 ± 0.255
0.897ProPhe: 0.897 ± 0.18
0.897ProGly: 0.897 ± 0.129
0.734ProHis: 0.734 ± 0.126
1.55ProIle: 1.55 ± 0.211
2.175ProLys: 2.175 ± 0.258
2.257ProLeu: 2.257 ± 0.276
0.462ProMet: 0.462 ± 0.117
1.387ProAsn: 1.387 ± 0.225
0.489ProPro: 0.489 ± 0.118
0.816ProGln: 0.816 ± 0.137
0.788ProArg: 0.788 ± 0.179
2.121ProSer: 2.121 ± 0.273
1.359ProThr: 1.359 ± 0.214
1.169ProVal: 1.169 ± 0.196
0.217ProTrp: 0.217 ± 0.089
1.495ProTyr: 1.495 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
2.284GlnAla: 2.284 ± 0.338
0.245GlnCys: 0.245 ± 0.081
1.495GlnAsp: 1.495 ± 0.175
3.127GlnGlu: 3.127 ± 0.462
1.794GlnPhe: 1.794 ± 0.204
1.658GlnGly: 1.658 ± 0.233
0.408GlnHis: 0.408 ± 0.112
2.692GlnIle: 2.692 ± 0.349
3.426GlnLys: 3.426 ± 0.448
3.697GlnLeu: 3.697 ± 0.356
1.115GlnMet: 1.115 ± 0.168
2.392GlnAsn: 2.392 ± 0.299
0.652GlnPro: 0.652 ± 0.131
1.903GlnGln: 1.903 ± 0.481
1.441GlnArg: 1.441 ± 0.199
2.148GlnSer: 2.148 ± 0.276
1.468GlnThr: 1.468 ± 0.213
2.284GlnVal: 2.284 ± 0.247
0.489GlnTrp: 0.489 ± 0.122
1.414GlnTyr: 1.414 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
1.74ArgAla: 1.74 ± 0.234
0.326ArgCys: 0.326 ± 0.102
2.039ArgAsp: 2.039 ± 0.246
2.719ArgGlu: 2.719 ± 0.31
1.93ArgPhe: 1.93 ± 0.24
1.903ArgGly: 1.903 ± 0.305
0.761ArgHis: 0.761 ± 0.176
2.882ArgIle: 2.882 ± 0.28
3.29ArgLys: 3.29 ± 0.265
3.344ArgLeu: 3.344 ± 0.348
1.196ArgMet: 1.196 ± 0.177
2.556ArgAsn: 2.556 ± 0.298
0.843ArgPro: 0.843 ± 0.154
1.577ArgGln: 1.577 ± 0.197
1.577ArgArg: 1.577 ± 0.228
2.284ArgSer: 2.284 ± 0.263
1.957ArgThr: 1.957 ± 0.212
2.338ArgVal: 2.338 ± 0.26
0.326ArgTrp: 0.326 ± 0.083
1.767ArgTyr: 1.767 ± 0.221
0.0ArgXaa: 0.0 ± 0.0
Ser
2.963SerAla: 2.963 ± 0.322
0.952SerCys: 0.952 ± 0.165
4.54SerAsp: 4.54 ± 0.422
5.22SerGlu: 5.22 ± 0.425
3.344SerPhe: 3.344 ± 0.295
3.507SerGly: 3.507 ± 0.328
1.006SerHis: 1.006 ± 0.159
5.601SerIle: 5.601 ± 0.387
5.9SerLys: 5.9 ± 0.427
6.199SerLeu: 6.199 ± 0.405
1.631SerMet: 1.631 ± 0.217
4.241SerAsn: 4.241 ± 0.289
1.985SerPro: 1.985 ± 0.207
2.012SerGln: 2.012 ± 0.227
2.012SerArg: 2.012 ± 0.163
5.41SerSer: 5.41 ± 0.555
3.398SerThr: 3.398 ± 0.399
4.296SerVal: 4.296 ± 0.332
0.734SerTrp: 0.734 ± 0.142
2.583SerTyr: 2.583 ± 0.24
0.0SerXaa: 0.0 ± 0.0
Thr
3.154ThrAla: 3.154 ± 0.412
0.408ThrCys: 0.408 ± 0.13
2.991ThrAsp: 2.991 ± 0.327
4.296ThrGlu: 4.296 ± 0.32
1.93ThrPhe: 1.93 ± 0.22
3.344ThrGly: 3.344 ± 0.335
0.68ThrHis: 0.68 ± 0.106
4.268ThrIle: 4.268 ± 0.319
4.567ThrLys: 4.567 ± 0.377
4.948ThrLeu: 4.948 ± 0.405
1.087ThrMet: 1.087 ± 0.141
2.664ThrAsn: 2.664 ± 0.273
1.713ThrPro: 1.713 ± 0.196
1.849ThrGln: 1.849 ± 0.278
1.658ThrArg: 1.658 ± 0.21
3.426ThrSer: 3.426 ± 0.41
2.909ThrThr: 2.909 ± 0.371
4.268ThrVal: 4.268 ± 0.338
0.544ThrTrp: 0.544 ± 0.13
2.963ThrTyr: 2.963 ± 0.241
0.0ThrXaa: 0.0 ± 0.0
Val
2.936ValAla: 2.936 ± 0.26
0.544ValCys: 0.544 ± 0.124
3.888ValAsp: 3.888 ± 0.277
4.513ValGlu: 4.513 ± 0.379
2.365ValPhe: 2.365 ± 0.261
2.827ValGly: 2.827 ± 0.369
1.278ValHis: 1.278 ± 0.208
4.051ValIle: 4.051 ± 0.343
5.818ValLys: 5.818 ± 0.411
5.138ValLeu: 5.138 ± 0.404
1.305ValMet: 1.305 ± 0.202
3.589ValAsn: 3.589 ± 0.384
1.522ValPro: 1.522 ± 0.209
1.985ValGln: 1.985 ± 0.299
2.365ValArg: 2.365 ± 0.26
4.078ValSer: 4.078 ± 0.352
3.208ValThr: 3.208 ± 0.342
3.045ValVal: 3.045 ± 0.269
0.571ValTrp: 0.571 ± 0.133
2.963ValTyr: 2.963 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
0.571TrpAla: 0.571 ± 0.133
0.19TrpCys: 0.19 ± 0.069
0.816TrpAsp: 0.816 ± 0.146
0.788TrpGlu: 0.788 ± 0.173
0.489TrpPhe: 0.489 ± 0.111
0.625TrpGly: 0.625 ± 0.15
0.136TrpHis: 0.136 ± 0.07
0.924TrpIle: 0.924 ± 0.176
0.952TrpLys: 0.952 ± 0.184
0.924TrpLeu: 0.924 ± 0.166
0.217TrpMet: 0.217 ± 0.074
0.517TrpAsn: 0.517 ± 0.13
0.082TrpPro: 0.082 ± 0.042
0.245TrpGln: 0.245 ± 0.094
0.435TrpArg: 0.435 ± 0.103
0.598TrpSer: 0.598 ± 0.123
0.353TrpThr: 0.353 ± 0.083
0.517TrpVal: 0.517 ± 0.117
0.054TrpTrp: 0.054 ± 0.041
0.625TrpTyr: 0.625 ± 0.133
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.604TyrAla: 1.604 ± 0.188
0.517TyrCys: 0.517 ± 0.123
3.181TyrAsp: 3.181 ± 0.254
3.833TyrGlu: 3.833 ± 0.353
2.121TyrPhe: 2.121 ± 0.268
2.229TyrGly: 2.229 ± 0.278
0.979TyrHis: 0.979 ± 0.162
3.833TyrIle: 3.833 ± 0.334
4.948TyrLys: 4.948 ± 0.381
4.296TyrLeu: 4.296 ± 0.338
1.006TyrMet: 1.006 ± 0.164
2.447TyrAsn: 2.447 ± 0.272
1.332TyrPro: 1.332 ± 0.184
1.876TyrGln: 1.876 ± 0.191
1.522TyrArg: 1.522 ± 0.183
3.072TyrSer: 3.072 ± 0.294
2.61TyrThr: 2.61 ± 0.227
2.447TyrVal: 2.447 ± 0.254
0.489TyrTrp: 0.489 ± 0.12
2.012TyrTyr: 2.012 ± 0.247
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 179 proteins (36783 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski