Amino acid dipepetide frequency for Bacillus phage BCP78

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.105AlaAla: 5.105 ± 0.427
0.279AlaCys: 0.279 ± 0.084
3.903AlaAsp: 3.903 ± 0.314
4.89AlaGlu: 4.89 ± 0.375
2.359AlaPhe: 2.359 ± 0.18
3.882AlaGly: 3.882 ± 0.42
1.137AlaHis: 1.137 ± 0.162
4.504AlaIle: 4.504 ± 0.379
5.469AlaLys: 5.469 ± 0.31
5.598AlaLeu: 5.598 ± 0.37
2.016AlaMet: 2.016 ± 0.189
3.11AlaAsn: 3.11 ± 0.417
2.531AlaPro: 2.531 ± 0.462
2.788AlaGln: 2.788 ± 0.258
3.131AlaArg: 3.131 ± 0.273
3.41AlaSer: 3.41 ± 0.315
4.375AlaThr: 4.375 ± 0.408
3.989AlaVal: 3.989 ± 0.283
1.115AlaTrp: 1.115 ± 0.168
2.531AlaTyr: 2.531 ± 0.213
0.0AlaXaa: 0.0 ± 0.0
Cys
0.322CysAla: 0.322 ± 0.068
0.086CysCys: 0.086 ± 0.047
0.343CysAsp: 0.343 ± 0.089
0.257CysGlu: 0.257 ± 0.066
0.257CysPhe: 0.257 ± 0.067
0.536CysGly: 0.536 ± 0.113
0.214CysHis: 0.214 ± 0.058
0.536CysIle: 0.536 ± 0.118
0.686CysLys: 0.686 ± 0.125
0.472CysLeu: 0.472 ± 0.092
0.322CysMet: 0.322 ± 0.075
0.601CysAsn: 0.601 ± 0.123
0.408CysPro: 0.408 ± 0.103
0.107CysGln: 0.107 ± 0.045
0.257CysArg: 0.257 ± 0.069
0.493CysSer: 0.493 ± 0.115
0.472CysThr: 0.472 ± 0.106
0.45CysVal: 0.45 ± 0.097
0.043CysTrp: 0.043 ± 0.03
0.343CysTyr: 0.343 ± 0.081
0.0CysXaa: 0.0 ± 0.0
Asp
3.882AspAla: 3.882 ± 0.291
0.45AspCys: 0.45 ± 0.095
2.788AspAsp: 2.788 ± 0.26
4.718AspGlu: 4.718 ± 0.277
2.745AspPhe: 2.745 ± 0.246
3.903AspGly: 3.903 ± 0.333
1.029AspHis: 1.029 ± 0.152
4.568AspIle: 4.568 ± 0.346
5.105AspLys: 5.105 ± 0.351
4.761AspLeu: 4.761 ± 0.332
1.737AspMet: 1.737 ± 0.213
3.239AspAsn: 3.239 ± 0.352
1.694AspPro: 1.694 ± 0.2
1.265AspGln: 1.265 ± 0.201
2.767AspArg: 2.767 ± 0.255
3.131AspSer: 3.131 ± 0.246
3.046AspThr: 3.046 ± 0.285
4.354AspVal: 4.354 ± 0.274
0.794AspTrp: 0.794 ± 0.122
2.96AspTyr: 2.96 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
4.44GluAla: 4.44 ± 0.351
0.665GluCys: 0.665 ± 0.108
4.654GluAsp: 4.654 ± 0.339
8.15GluGlu: 8.15 ± 0.686
2.981GluPhe: 2.981 ± 0.264
4.997GluGly: 4.997 ± 0.378
1.523GluHis: 1.523 ± 0.204
4.847GluIle: 4.847 ± 0.372
6.885GluLys: 6.885 ± 0.493
7.378GluLeu: 7.378 ± 0.437
2.273GluMet: 2.273 ± 0.226
3.346GluAsn: 3.346 ± 0.319
2.209GluPro: 2.209 ± 0.371
3.174GluGln: 3.174 ± 0.248
3.582GluArg: 3.582 ± 0.296
3.11GluSer: 3.11 ± 0.223
3.625GluThr: 3.625 ± 0.279
5.491GluVal: 5.491 ± 0.45
0.987GluTrp: 0.987 ± 0.129
3.324GluTyr: 3.324 ± 0.306
0.0GluXaa: 0.0 ± 0.0
Phe
2.402PheAla: 2.402 ± 0.188
0.472PheCys: 0.472 ± 0.099
2.874PheAsp: 2.874 ± 0.239
2.466PheGlu: 2.466 ± 0.221
1.351PhePhe: 1.351 ± 0.177
2.424PheGly: 2.424 ± 0.284
0.879PheHis: 0.879 ± 0.148
2.209PheIle: 2.209 ± 0.213
2.96PheLys: 2.96 ± 0.264
3.367PheLeu: 3.367 ± 0.255
1.201PheMet: 1.201 ± 0.161
2.316PheAsn: 2.316 ± 0.254
1.115PhePro: 1.115 ± 0.156
1.566PheGln: 1.566 ± 0.204
1.737PheArg: 1.737 ± 0.185
3.046PheSer: 3.046 ± 0.299
2.66PheThr: 2.66 ± 0.27
2.81PheVal: 2.81 ± 0.244
0.322PheTrp: 0.322 ± 0.088
1.737PheTyr: 1.737 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
4.139GlyAla: 4.139 ± 0.405
0.515GlyCys: 0.515 ± 0.114
3.453GlyAsp: 3.453 ± 0.278
4.268GlyGlu: 4.268 ± 0.306
2.767GlyPhe: 2.767 ± 0.262
4.761GlyGly: 4.761 ± 0.61
1.33GlyHis: 1.33 ± 0.171
3.818GlyIle: 3.818 ± 0.318
5.426GlyLys: 5.426 ± 0.349
4.826GlyLeu: 4.826 ± 0.322
1.995GlyMet: 1.995 ± 0.235
3.646GlyAsn: 3.646 ± 0.333
0.0GlyPro: 0.0 ± 0.0
2.273GlyGln: 2.273 ± 0.215
2.66GlyArg: 2.66 ± 0.222
4.761GlySer: 4.761 ± 0.488
5.19GlyThr: 5.19 ± 0.431
4.933GlyVal: 4.933 ± 0.34
0.836GlyTrp: 0.836 ± 0.14
2.917GlyTyr: 2.917 ± 0.271
0.0GlyXaa: 0.0 ± 0.0
His
1.029HisAla: 1.029 ± 0.162
0.172HisCys: 0.172 ± 0.068
1.094HisAsp: 1.094 ± 0.163
1.115HisGlu: 1.115 ± 0.149
0.708HisPhe: 0.708 ± 0.13
0.987HisGly: 0.987 ± 0.167
0.322HisHis: 0.322 ± 0.088
1.008HisIle: 1.008 ± 0.159
1.373HisLys: 1.373 ± 0.192
1.759HisLeu: 1.759 ± 0.209
0.408HisMet: 0.408 ± 0.087
0.965HisAsn: 0.965 ± 0.142
0.901HisPro: 0.901 ± 0.128
0.45HisGln: 0.45 ± 0.082
1.051HisArg: 1.051 ± 0.175
1.029HisSer: 1.029 ± 0.192
1.115HisThr: 1.115 ± 0.136
1.523HisVal: 1.523 ± 0.164
0.236HisTrp: 0.236 ± 0.075
0.879HisTyr: 0.879 ± 0.139
0.0HisXaa: 0.0 ± 0.0
Ile
4.225IleAla: 4.225 ± 0.307
0.558IleCys: 0.558 ± 0.098
4.483IleAsp: 4.483 ± 0.395
5.169IleGlu: 5.169 ± 0.406
2.231IlePhe: 2.231 ± 0.221
4.311IleGly: 4.311 ± 0.37
1.029IleHis: 1.029 ± 0.131
3.989IleIle: 3.989 ± 0.313
4.912IleLys: 4.912 ± 0.3
4.697IleLeu: 4.697 ± 0.389
1.63IleMet: 1.63 ± 0.191
3.668IleAsn: 3.668 ± 0.252
2.788IlePro: 2.788 ± 0.252
2.381IleGln: 2.381 ± 0.182
2.681IleArg: 2.681 ± 0.218
4.483IleSer: 4.483 ± 0.284
4.375IleThr: 4.375 ± 0.329
4.225IleVal: 4.225 ± 0.321
0.515IleTrp: 0.515 ± 0.115
2.531IleTyr: 2.531 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
5.383LysAla: 5.383 ± 0.314
0.579LysCys: 0.579 ± 0.108
4.354LysAsp: 4.354 ± 0.293
8.193LysGlu: 8.193 ± 0.445
2.831LysPhe: 2.831 ± 0.268
5.276LysGly: 5.276 ± 0.333
1.544LysHis: 1.544 ± 0.184
4.311LysIle: 4.311 ± 0.255
6.863LysLys: 6.863 ± 0.44
6.499LysLeu: 6.499 ± 0.409
2.745LysMet: 2.745 ± 0.25
3.775LysAsn: 3.775 ± 0.263
2.874LysPro: 2.874 ± 0.258
3.71LysGln: 3.71 ± 0.339
3.753LysArg: 3.753 ± 0.296
3.882LysSer: 3.882 ± 0.358
3.882LysThr: 3.882 ± 0.265
5.855LysVal: 5.855 ± 0.357
0.879LysTrp: 0.879 ± 0.113
3.26LysTyr: 3.26 ± 0.277
0.0LysXaa: 0.0 ± 0.0
Leu
6.027LeuAla: 6.027 ± 0.364
0.45LeuCys: 0.45 ± 0.096
5.34LeuAsp: 5.34 ± 0.321
6.799LeuGlu: 6.799 ± 0.488
3.153LeuPhe: 3.153 ± 0.242
4.847LeuGly: 4.847 ± 0.338
1.351LeuHis: 1.351 ± 0.18
5.019LeuIle: 5.019 ± 0.361
6.499LeuLys: 6.499 ± 0.375
5.598LeuLeu: 5.598 ± 0.389
2.038LeuMet: 2.038 ± 0.22
3.946LeuAsn: 3.946 ± 0.302
3.26LeuPro: 3.26 ± 0.289
3.046LeuGln: 3.046 ± 0.257
3.818LeuArg: 3.818 ± 0.311
4.697LeuSer: 4.697 ± 0.337
5.04LeuThr: 5.04 ± 0.308
5.362LeuVal: 5.362 ± 0.369
0.901LeuTrp: 0.901 ± 0.15
3.024LeuTyr: 3.024 ± 0.226
0.0LeuXaa: 0.0 ± 0.0
Met
2.231MetAla: 2.231 ± 0.21
0.193MetCys: 0.193 ± 0.06
1.48MetAsp: 1.48 ± 0.191
2.273MetGlu: 2.273 ± 0.231
1.115MetPhe: 1.115 ± 0.143
1.244MetGly: 1.244 ± 0.162
0.515MetHis: 0.515 ± 0.112
1.823MetIle: 1.823 ± 0.176
2.681MetLys: 2.681 ± 0.23
2.188MetLeu: 2.188 ± 0.238
0.536MetMet: 0.536 ± 0.102
1.651MetAsn: 1.651 ± 0.201
0.708MetPro: 0.708 ± 0.123
1.029MetGln: 1.029 ± 0.138
1.501MetArg: 1.501 ± 0.202
1.673MetSer: 1.673 ± 0.228
1.737MetThr: 1.737 ± 0.176
1.587MetVal: 1.587 ± 0.184
0.279MetTrp: 0.279 ± 0.087
1.244MetTyr: 1.244 ± 0.171
0.0MetXaa: 0.0 ± 0.0
Asn
3.367AsnAla: 3.367 ± 0.303
0.408AsnCys: 0.408 ± 0.102
2.123AsnAsp: 2.123 ± 0.179
3.753AsnGlu: 3.753 ± 0.261
1.737AsnPhe: 1.737 ± 0.19
4.654AsnGly: 4.654 ± 0.391
1.029AsnHis: 1.029 ± 0.14
3.71AsnIle: 3.71 ± 0.307
4.011AsnLys: 4.011 ± 0.273
3.882AsnLeu: 3.882 ± 0.281
1.952AsnMet: 1.952 ± 0.222
3.153AsnAsn: 3.153 ± 0.337
2.788AsnPro: 2.788 ± 0.324
2.08AsnGln: 2.08 ± 0.201
2.488AsnArg: 2.488 ± 0.238
2.638AsnSer: 2.638 ± 0.255
3.367AsnThr: 3.367 ± 0.293
3.389AsnVal: 3.389 ± 0.245
0.665AsnTrp: 0.665 ± 0.118
2.273AsnTyr: 2.273 ± 0.208
0.0AsnXaa: 0.0 ± 0.0
Pro
2.252ProAla: 2.252 ± 0.301
0.236ProCys: 0.236 ± 0.089
1.887ProAsp: 1.887 ± 0.242
2.445ProGlu: 2.445 ± 0.291
1.33ProPhe: 1.33 ± 0.168
1.909ProGly: 1.909 ± 0.263
0.815ProHis: 0.815 ± 0.127
2.252ProIle: 2.252 ± 0.232
2.767ProLys: 2.767 ± 0.251
2.445ProLeu: 2.445 ± 0.228
0.815ProMet: 0.815 ± 0.16
2.252ProAsn: 2.252 ± 0.27
1.072ProPro: 1.072 ± 0.166
1.523ProGln: 1.523 ± 0.243
1.287ProArg: 1.287 ± 0.156
1.887ProSer: 1.887 ± 0.228
2.724ProThr: 2.724 ± 0.278
2.831ProVal: 2.831 ± 0.318
0.236ProTrp: 0.236 ± 0.085
1.609ProTyr: 1.609 ± 0.264
0.0ProXaa: 0.0 ± 0.0
Gln
3.003GlnAla: 3.003 ± 0.313
0.279GlnCys: 0.279 ± 0.069
2.059GlnAsp: 2.059 ± 0.259
3.603GlnGlu: 3.603 ± 0.269
1.33GlnPhe: 1.33 ± 0.145
2.316GlnGly: 2.316 ± 0.245
0.558GlnHis: 0.558 ± 0.119
2.402GlnIle: 2.402 ± 0.196
2.81GlnLys: 2.81 ± 0.243
3.046GlnLeu: 3.046 ± 0.275
1.308GlnMet: 1.308 ± 0.2
1.651GlnAsn: 1.651 ± 0.157
1.437GlnPro: 1.437 ± 0.261
2.402GlnGln: 2.402 ± 0.248
1.501GlnArg: 1.501 ± 0.217
2.188GlnSer: 2.188 ± 0.26
2.038GlnThr: 2.038 ± 0.241
2.488GlnVal: 2.488 ± 0.223
0.472GlnTrp: 0.472 ± 0.084
1.33GlnTyr: 1.33 ± 0.18
0.0GlnXaa: 0.0 ± 0.0
Arg
2.531ArgAla: 2.531 ± 0.288
0.236ArgCys: 0.236 ± 0.066
2.466ArgAsp: 2.466 ± 0.235
3.26ArgGlu: 3.26 ± 0.233
2.145ArgPhe: 2.145 ± 0.201
2.724ArgGly: 2.724 ± 0.251
0.601ArgHis: 0.601 ± 0.107
2.895ArgIle: 2.895 ± 0.249
3.775ArgLys: 3.775 ± 0.267
3.946ArgLeu: 3.946 ± 0.304
1.587ArgMet: 1.587 ± 0.185
2.574ArgAsn: 2.574 ± 0.255
1.351ArgPro: 1.351 ± 0.191
1.587ArgGln: 1.587 ± 0.172
2.059ArgArg: 2.059 ± 0.21
2.381ArgSer: 2.381 ± 0.245
2.381ArgThr: 2.381 ± 0.234
3.131ArgVal: 3.131 ± 0.256
0.45ArgTrp: 0.45 ± 0.111
2.209ArgTyr: 2.209 ± 0.199
0.0ArgXaa: 0.0 ± 0.0
Ser
3.475SerAla: 3.475 ± 0.299
0.257SerCys: 0.257 ± 0.075
3.625SerAsp: 3.625 ± 0.298
3.11SerGlu: 3.11 ± 0.3
2.853SerPhe: 2.853 ± 0.313
4.418SerGly: 4.418 ± 0.454
0.901SerHis: 0.901 ± 0.121
4.247SerIle: 4.247 ± 0.318
4.697SerLys: 4.697 ± 0.322
4.74SerLeu: 4.74 ± 0.285
1.373SerMet: 1.373 ± 0.192
2.853SerAsn: 2.853 ± 0.327
1.887SerPro: 1.887 ± 0.183
1.823SerGln: 1.823 ± 0.182
2.102SerArg: 2.102 ± 0.205
4.011SerSer: 4.011 ± 0.762
3.668SerThr: 3.668 ± 0.39
3.903SerVal: 3.903 ± 0.26
0.879SerTrp: 0.879 ± 0.154
2.381SerTyr: 2.381 ± 0.2
0.0SerXaa: 0.0 ± 0.0
Thr
3.796ThrAla: 3.796 ± 0.471
0.45ThrCys: 0.45 ± 0.088
3.71ThrAsp: 3.71 ± 0.301
4.547ThrGlu: 4.547 ± 0.268
2.96ThrPhe: 2.96 ± 0.286
4.504ThrGly: 4.504 ± 0.35
0.965ThrHis: 0.965 ± 0.146
4.354ThrIle: 4.354 ± 0.336
4.225ThrLys: 4.225 ± 0.297
5.083ThrLeu: 5.083 ± 0.335
0.944ThrMet: 0.944 ± 0.14
3.067ThrAsn: 3.067 ± 0.282
3.046ThrPro: 3.046 ± 0.322
1.909ThrGln: 1.909 ± 0.213
2.316ThrArg: 2.316 ± 0.258
3.282ThrSer: 3.282 ± 0.312
3.646ThrThr: 3.646 ± 0.375
5.298ThrVal: 5.298 ± 0.364
0.622ThrTrp: 0.622 ± 0.123
3.067ThrTyr: 3.067 ± 0.285
0.0ThrXaa: 0.0 ± 0.0
Val
4.89ValAla: 4.89 ± 0.378
0.386ValCys: 0.386 ± 0.081
4.311ValAsp: 4.311 ± 0.276
5.169ValGlu: 5.169 ± 0.372
3.088ValPhe: 3.088 ± 0.298
3.796ValGly: 3.796 ± 0.265
1.265ValHis: 1.265 ± 0.183
4.697ValIle: 4.697 ± 0.313
5.405ValLys: 5.405 ± 0.357
5.04ValLeu: 5.04 ± 0.359
1.694ValMet: 1.694 ± 0.161
3.903ValAsn: 3.903 ± 0.346
3.067ValPro: 3.067 ± 0.342
2.96ValGln: 2.96 ± 0.219
3.174ValArg: 3.174 ± 0.305
3.903ValSer: 3.903 ± 0.303
4.761ValThr: 4.761 ± 0.337
4.783ValVal: 4.783 ± 0.36
0.665ValTrp: 0.665 ± 0.112
3.11ValTyr: 3.11 ± 0.254
0.0ValXaa: 0.0 ± 0.0
Trp
0.815TrpAla: 0.815 ± 0.156
0.214TrpCys: 0.214 ± 0.067
0.901TrpAsp: 0.901 ± 0.162
0.965TrpGlu: 0.965 ± 0.151
0.493TrpPhe: 0.493 ± 0.111
0.515TrpGly: 0.515 ± 0.107
0.214TrpHis: 0.214 ± 0.059
0.686TrpIle: 0.686 ± 0.104
0.751TrpLys: 0.751 ± 0.131
1.008TrpLeu: 1.008 ± 0.147
0.107TrpMet: 0.107 ± 0.048
0.815TrpAsn: 0.815 ± 0.156
0.0TrpPro: 0.0 ± 0.0
0.472TrpGln: 0.472 ± 0.109
0.386TrpArg: 0.386 ± 0.082
0.815TrpSer: 0.815 ± 0.131
0.729TrpThr: 0.729 ± 0.115
0.815TrpVal: 0.815 ± 0.147
0.214TrpTrp: 0.214 ± 0.059
0.686TrpTyr: 0.686 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.81TyrAla: 2.81 ± 0.257
0.322TyrCys: 0.322 ± 0.078
3.088TyrAsp: 3.088 ± 0.261
2.488TyrGlu: 2.488 ± 0.231
1.48TyrPhe: 1.48 ± 0.184
2.531TyrGly: 2.531 ± 0.274
0.901TyrHis: 0.901 ± 0.159
3.003TyrIle: 3.003 ± 0.256
3.153TyrLys: 3.153 ± 0.256
3.882TyrLeu: 3.882 ± 0.266
0.987TyrMet: 0.987 ± 0.144
2.938TyrAsn: 2.938 ± 0.238
1.437TyrPro: 1.437 ± 0.182
1.673TyrGln: 1.673 ± 0.196
2.038TyrArg: 2.038 ± 0.219
2.381TyrSer: 2.381 ± 0.248
2.96TyrThr: 2.96 ± 0.228
2.853TyrVal: 2.853 ± 0.266
0.493TyrTrp: 0.493 ± 0.1
1.673TyrTyr: 1.673 ± 0.18
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 227 proteins (46626 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski