Amino acid dipepetide frequency for Bacillus phage Bp8p-C

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.16AlaAla: 6.16 ± 0.623
0.422AlaCys: 0.422 ± 0.111
4.52AlaAsp: 4.52 ± 0.288
5.27AlaGlu: 5.27 ± 0.442
2.553AlaPhe: 2.553 ± 0.204
4.075AlaGly: 4.075 ± 0.417
1.007AlaHis: 1.007 ± 0.136
4.403AlaIle: 4.403 ± 0.311
4.028AlaLys: 4.028 ± 0.377
5.48AlaLeu: 5.48 ± 0.337
1.616AlaMet: 1.616 ± 0.184
3.021AlaAsn: 3.021 ± 0.258
2.131AlaPro: 2.131 ± 0.534
2.646AlaGln: 2.646 ± 0.304
2.67AlaArg: 2.67 ± 0.241
3.7AlaSer: 3.7 ± 0.332
3.935AlaThr: 3.935 ± 0.478
5.293AlaVal: 5.293 ± 0.393
0.632AlaTrp: 0.632 ± 0.118
2.646AlaTyr: 2.646 ± 0.241
0.0AlaXaa: 0.0 ± 0.0
Cys
0.304CysAla: 0.304 ± 0.087
0.117CysCys: 0.117 ± 0.046
0.422CysAsp: 0.422 ± 0.107
0.515CysGlu: 0.515 ± 0.119
0.445CysPhe: 0.445 ± 0.099
0.351CysGly: 0.351 ± 0.093
0.258CysHis: 0.258 ± 0.075
0.258CysIle: 0.258 ± 0.075
0.703CysLys: 0.703 ± 0.131
0.632CysLeu: 0.632 ± 0.114
0.281CysMet: 0.281 ± 0.085
0.234CysAsn: 0.234 ± 0.066
0.375CysPro: 0.375 ± 0.093
0.094CysGln: 0.094 ± 0.047
0.304CysArg: 0.304 ± 0.086
0.562CysSer: 0.562 ± 0.113
0.375CysThr: 0.375 ± 0.1
0.422CysVal: 0.422 ± 0.094
0.094CysTrp: 0.094 ± 0.054
0.375CysTyr: 0.375 ± 0.092
0.0CysXaa: 0.0 ± 0.0
Asp
4.028AspAla: 4.028 ± 0.356
0.773AspCys: 0.773 ± 0.13
4.052AspAsp: 4.052 ± 0.326
4.895AspGlu: 4.895 ± 0.376
2.951AspPhe: 2.951 ± 0.278
3.771AspGly: 3.771 ± 0.326
1.265AspHis: 1.265 ± 0.189
4.731AspIle: 4.731 ± 0.264
4.544AspLys: 4.544 ± 0.386
5.223AspLeu: 5.223 ± 0.326
2.131AspMet: 2.131 ± 0.234
3.349AspAsn: 3.349 ± 0.278
1.897AspPro: 1.897 ± 0.222
2.202AspGln: 2.202 ± 0.219
2.928AspArg: 2.928 ± 0.276
3.888AspSer: 3.888 ± 0.305
3.607AspThr: 3.607 ± 0.306
5.035AspVal: 5.035 ± 0.292
0.726AspTrp: 0.726 ± 0.124
3.373AspTyr: 3.373 ± 0.282
0.0AspXaa: 0.0 ± 0.0
Glu
5.293GluAla: 5.293 ± 0.39
0.562GluCys: 0.562 ± 0.101
5.551GluAsp: 5.551 ± 0.47
9.509GluGlu: 9.509 ± 0.871
3.373GluPhe: 3.373 ± 0.342
4.871GluGly: 4.871 ± 0.364
1.593GluHis: 1.593 ± 0.197
5.082GluIle: 5.082 ± 0.391
5.761GluLys: 5.761 ± 0.483
8.174GluLeu: 8.174 ± 0.551
2.248GluMet: 2.248 ± 0.281
3.935GluAsn: 3.935 ± 0.248
1.663GluPro: 1.663 ± 0.205
2.998GluGln: 2.998 ± 0.315
3.56GluArg: 3.56 ± 0.314
4.497GluSer: 4.497 ± 0.363
3.607GluThr: 3.607 ± 0.323
5.972GluVal: 5.972 ± 0.448
1.077GluTrp: 1.077 ± 0.173
3.255GluTyr: 3.255 ± 0.246
0.0GluXaa: 0.0 ± 0.0
Phe
2.084PheAla: 2.084 ± 0.249
0.281PheCys: 0.281 ± 0.077
2.787PheAsp: 2.787 ± 0.231
3.021PheGlu: 3.021 ± 0.25
1.733PhePhe: 1.733 ± 0.242
2.155PheGly: 2.155 ± 0.232
1.03PheHis: 1.03 ± 0.164
2.764PheIle: 2.764 ± 0.282
2.928PheLys: 2.928 ± 0.282
3.021PheLeu: 3.021 ± 0.326
1.054PheMet: 1.054 ± 0.138
2.81PheAsn: 2.81 ± 0.282
1.288PhePro: 1.288 ± 0.175
1.335PheGln: 1.335 ± 0.206
1.686PheArg: 1.686 ± 0.246
3.162PheSer: 3.162 ± 0.278
2.717PheThr: 2.717 ± 0.213
2.74PheVal: 2.74 ± 0.301
0.258PheTrp: 0.258 ± 0.087
1.733PheTyr: 1.733 ± 0.226
0.0PheXaa: 0.0 ± 0.0
Gly
4.028GlyAla: 4.028 ± 0.489
0.539GlyCys: 0.539 ± 0.106
3.466GlyAsp: 3.466 ± 0.301
4.239GlyGlu: 4.239 ± 0.269
2.646GlyPhe: 2.646 ± 0.244
4.871GlyGly: 4.871 ± 0.621
1.218GlyHis: 1.218 ± 0.154
4.052GlyIle: 4.052 ± 0.407
4.661GlyLys: 4.661 ± 0.405
4.145GlyLeu: 4.145 ± 0.291
1.71GlyMet: 1.71 ± 0.206
3.255GlyAsn: 3.255 ± 0.197
0.281GlyPro: 0.281 ± 0.082
2.061GlyGln: 2.061 ± 0.247
2.74GlyArg: 2.74 ± 0.233
3.771GlySer: 3.771 ± 0.379
3.677GlyThr: 3.677 ± 0.344
4.59GlyVal: 4.59 ± 0.36
0.749GlyTrp: 0.749 ± 0.158
2.693GlyTyr: 2.693 ± 0.249
0.0GlyXaa: 0.0 ± 0.0
His
1.194HisAla: 1.194 ± 0.176
0.117HisCys: 0.117 ± 0.056
1.312HisAsp: 1.312 ± 0.166
1.358HisGlu: 1.358 ± 0.169
0.913HisPhe: 0.913 ± 0.171
1.124HisGly: 1.124 ± 0.157
0.515HisHis: 0.515 ± 0.106
1.639HisIle: 1.639 ± 0.195
1.475HisLys: 1.475 ± 0.195
1.71HisLeu: 1.71 ± 0.236
0.375HisMet: 0.375 ± 0.095
0.984HisAsn: 0.984 ± 0.16
0.586HisPro: 0.586 ± 0.106
0.328HisGln: 0.328 ± 0.082
0.89HisArg: 0.89 ± 0.125
1.335HisSer: 1.335 ± 0.178
1.101HisThr: 1.101 ± 0.174
1.335HisVal: 1.335 ± 0.174
0.281HisTrp: 0.281 ± 0.087
1.288HisTyr: 1.288 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
4.52IleAla: 4.52 ± 0.301
0.562IleCys: 0.562 ± 0.124
4.989IleAsp: 4.989 ± 0.337
5.152IleGlu: 5.152 ± 0.417
2.155IlePhe: 2.155 ± 0.208
3.583IleGly: 3.583 ± 0.296
1.124IleHis: 1.124 ± 0.144
3.536IleIle: 3.536 ± 0.301
4.473IleLys: 4.473 ± 0.31
4.801IleLeu: 4.801 ± 0.363
1.358IleMet: 1.358 ± 0.171
3.209IleAsn: 3.209 ± 0.234
2.061IlePro: 2.061 ± 0.186
2.576IleGln: 2.576 ± 0.236
3.232IleArg: 3.232 ± 0.265
4.239IleSer: 4.239 ± 0.302
4.403IleThr: 4.403 ± 0.417
4.309IleVal: 4.309 ± 0.265
0.328IleTrp: 0.328 ± 0.071
2.295IleTyr: 2.295 ± 0.219
0.0IleXaa: 0.0 ± 0.0
Lys
5.316LysAla: 5.316 ± 0.442
0.351LysCys: 0.351 ± 0.124
5.059LysAsp: 5.059 ± 0.317
7.565LysGlu: 7.565 ± 0.633
2.951LysPhe: 2.951 ± 0.295
3.841LysGly: 3.841 ± 0.318
1.358LysHis: 1.358 ± 0.196
4.216LysIle: 4.216 ± 0.316
6.042LysLys: 6.042 ± 0.541
5.434LysLeu: 5.434 ± 0.342
2.038LysMet: 2.038 ± 0.237
3.326LysAsn: 3.326 ± 0.29
2.202LysPro: 2.202 ± 0.244
2.998LysGln: 2.998 ± 0.284
3.841LysArg: 3.841 ± 0.378
4.262LysSer: 4.262 ± 0.395
3.443LysThr: 3.443 ± 0.287
4.59LysVal: 4.59 ± 0.33
0.656LysTrp: 0.656 ± 0.122
2.787LysTyr: 2.787 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
5.691LeuAla: 5.691 ± 0.311
0.445LeuCys: 0.445 ± 0.129
5.363LeuAsp: 5.363 ± 0.306
6.932LeuGlu: 6.932 ± 0.557
3.021LeuPhe: 3.021 ± 0.271
4.38LeuGly: 4.38 ± 0.319
2.038LeuHis: 2.038 ± 0.219
4.38LeuIle: 4.38 ± 0.341
6.23LeuLys: 6.23 ± 0.374
6.792LeuLeu: 6.792 ± 0.462
2.225LeuMet: 2.225 ± 0.193
4.684LeuAsn: 4.684 ± 0.412
3.209LeuPro: 3.209 ± 0.247
3.56LeuGln: 3.56 ± 0.286
4.145LeuArg: 4.145 ± 0.319
5.715LeuSer: 5.715 ± 0.309
5.738LeuThr: 5.738 ± 0.42
5.199LeuVal: 5.199 ± 0.343
0.586LeuTrp: 0.586 ± 0.115
3.091LeuTyr: 3.091 ± 0.264
0.0LeuXaa: 0.0 ± 0.0
Met
1.335MetAla: 1.335 ± 0.187
0.117MetCys: 0.117 ± 0.05
1.663MetAsp: 1.663 ± 0.234
1.967MetGlu: 1.967 ± 0.175
0.796MetPhe: 0.796 ± 0.149
1.241MetGly: 1.241 ± 0.176
0.468MetHis: 0.468 ± 0.104
1.663MetIle: 1.663 ± 0.185
2.131MetLys: 2.131 ± 0.247
2.084MetLeu: 2.084 ± 0.227
0.773MetMet: 0.773 ± 0.13
1.71MetAsn: 1.71 ± 0.16
1.03MetPro: 1.03 ± 0.155
1.148MetGln: 1.148 ± 0.182
1.171MetArg: 1.171 ± 0.176
1.85MetSer: 1.85 ± 0.243
1.803MetThr: 1.803 ± 0.165
1.335MetVal: 1.335 ± 0.173
0.375MetTrp: 0.375 ± 0.086
1.218MetTyr: 1.218 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
2.74AsnAla: 2.74 ± 0.247
0.258AsnCys: 0.258 ± 0.069
3.091AsnAsp: 3.091 ± 0.27
3.56AsnGlu: 3.56 ± 0.346
2.038AsnPhe: 2.038 ± 0.216
3.935AsnGly: 3.935 ± 0.404
1.007AsnHis: 1.007 ± 0.135
3.419AsnIle: 3.419 ± 0.324
3.607AsnLys: 3.607 ± 0.288
4.45AsnLeu: 4.45 ± 0.33
1.265AsnMet: 1.265 ± 0.194
3.091AsnAsn: 3.091 ± 0.269
2.6AsnPro: 2.6 ± 0.241
1.874AsnGln: 1.874 ± 0.24
2.717AsnArg: 2.717 ± 0.263
3.583AsnSer: 3.583 ± 0.324
2.928AsnThr: 2.928 ± 0.242
3.255AsnVal: 3.255 ± 0.314
0.539AsnTrp: 0.539 ± 0.111
2.108AsnTyr: 2.108 ± 0.212
0.0AsnXaa: 0.0 ± 0.0
Pro
2.857ProAla: 2.857 ± 0.585
0.117ProCys: 0.117 ± 0.05
1.803ProAsp: 1.803 ± 0.217
2.295ProGlu: 2.295 ± 0.254
1.499ProPhe: 1.499 ± 0.186
0.96ProGly: 0.96 ± 0.149
0.796ProHis: 0.796 ± 0.14
1.944ProIle: 1.944 ± 0.172
1.733ProLys: 1.733 ± 0.167
2.928ProLeu: 2.928 ± 0.284
0.703ProMet: 0.703 ± 0.161
1.757ProAsn: 1.757 ± 0.204
1.124ProPro: 1.124 ± 0.269
1.171ProGln: 1.171 ± 0.17
1.054ProArg: 1.054 ± 0.157
2.108ProSer: 2.108 ± 0.26
2.248ProThr: 2.248 ± 0.252
1.757ProVal: 1.757 ± 0.215
0.117ProTrp: 0.117 ± 0.05
1.686ProTyr: 1.686 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
2.412GlnAla: 2.412 ± 0.219
0.187GlnCys: 0.187 ± 0.06
2.225GlnAsp: 2.225 ± 0.243
3.724GlnGlu: 3.724 ± 0.282
1.382GlnPhe: 1.382 ± 0.157
2.155GlnGly: 2.155 ± 0.262
0.586GlnHis: 0.586 ± 0.111
2.202GlnIle: 2.202 ± 0.265
2.483GlnLys: 2.483 ± 0.282
3.981GlnLeu: 3.981 ± 0.274
1.148GlnMet: 1.148 ± 0.153
1.733GlnAsn: 1.733 ± 0.188
1.124GlnPro: 1.124 ± 0.216
2.014GlnGln: 2.014 ± 0.266
1.639GlnArg: 1.639 ± 0.189
2.459GlnSer: 2.459 ± 0.266
1.874GlnThr: 1.874 ± 0.236
2.459GlnVal: 2.459 ± 0.296
0.492GlnTrp: 0.492 ± 0.097
1.499GlnTyr: 1.499 ± 0.175
0.0GlnXaa: 0.0 ± 0.0
Arg
2.74ArgAla: 2.74 ± 0.241
0.328ArgCys: 0.328 ± 0.088
3.162ArgAsp: 3.162 ± 0.253
3.911ArgGlu: 3.911 ± 0.315
1.827ArgPhe: 1.827 ± 0.182
2.67ArgGly: 2.67 ± 0.271
1.007ArgHis: 1.007 ± 0.141
3.326ArgIle: 3.326 ± 0.287
3.864ArgLys: 3.864 ± 0.324
4.216ArgLeu: 4.216 ± 0.348
1.382ArgMet: 1.382 ± 0.175
2.225ArgAsn: 2.225 ± 0.227
0.984ArgPro: 0.984 ± 0.132
1.827ArgGln: 1.827 ± 0.21
2.225ArgArg: 2.225 ± 0.252
2.529ArgSer: 2.529 ± 0.275
2.529ArgThr: 2.529 ± 0.236
3.63ArgVal: 3.63 ± 0.289
0.281ArgTrp: 0.281 ± 0.078
2.014ArgTyr: 2.014 ± 0.216
0.0ArgXaa: 0.0 ± 0.0
Ser
4.192SerAla: 4.192 ± 0.358
0.539SerCys: 0.539 ± 0.109
3.747SerAsp: 3.747 ± 0.375
4.216SerGlu: 4.216 ± 0.287
3.115SerPhe: 3.115 ± 0.306
4.216SerGly: 4.216 ± 0.392
0.913SerHis: 0.913 ± 0.142
4.169SerIle: 4.169 ± 0.325
4.684SerLys: 4.684 ± 0.291
5.878SerLeu: 5.878 ± 0.333
1.967SerMet: 1.967 ± 0.24
3.443SerAsn: 3.443 ± 0.279
1.944SerPro: 1.944 ± 0.204
2.389SerGln: 2.389 ± 0.259
3.513SerArg: 3.513 ± 0.292
5.082SerSer: 5.082 ± 0.422
4.099SerThr: 4.099 ± 0.362
3.888SerVal: 3.888 ± 0.297
0.703SerTrp: 0.703 ± 0.116
2.6SerTyr: 2.6 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
3.935ThrAla: 3.935 ± 0.32
0.492ThrCys: 0.492 ± 0.123
3.911ThrAsp: 3.911 ± 0.296
4.731ThrGlu: 4.731 ± 0.319
2.389ThrPhe: 2.389 ± 0.257
4.028ThrGly: 4.028 ± 0.42
1.03ThrHis: 1.03 ± 0.173
3.536ThrIle: 3.536 ± 0.364
4.099ThrLys: 4.099 ± 0.312
4.333ThrLeu: 4.333 ± 0.291
1.007ThrMet: 1.007 ± 0.136
2.881ThrAsn: 2.881 ± 0.307
2.272ThrPro: 2.272 ± 0.272
2.342ThrGln: 2.342 ± 0.284
2.693ThrArg: 2.693 ± 0.236
3.841ThrSer: 3.841 ± 0.355
3.56ThrThr: 3.56 ± 0.343
5.504ThrVal: 5.504 ± 0.43
0.398ThrTrp: 0.398 ± 0.093
2.506ThrTyr: 2.506 ± 0.275
0.0ThrXaa: 0.0 ± 0.0
Val
4.052ValAla: 4.052 ± 0.28
0.492ValCys: 0.492 ± 0.102
4.707ValAsp: 4.707 ± 0.345
5.41ValGlu: 5.41 ± 0.421
3.115ValPhe: 3.115 ± 0.323
3.7ValGly: 3.7 ± 0.236
1.569ValHis: 1.569 ± 0.175
4.216ValIle: 4.216 ± 0.279
5.316ValLys: 5.316 ± 0.348
5.246ValLeu: 5.246 ± 0.333
1.405ValMet: 1.405 ± 0.199
3.326ValAsn: 3.326 ± 0.308
2.646ValPro: 2.646 ± 0.28
2.459ValGln: 2.459 ± 0.195
3.349ValArg: 3.349 ± 0.357
5.223ValSer: 5.223 ± 0.417
4.684ValThr: 4.684 ± 0.438
5.106ValVal: 5.106 ± 0.409
0.656ValTrp: 0.656 ± 0.124
3.419ValTyr: 3.419 ± 0.269
0.0ValXaa: 0.0 ± 0.0
Trp
0.632TrpAla: 0.632 ± 0.11
0.047TrpCys: 0.047 ± 0.037
0.679TrpAsp: 0.679 ± 0.125
0.937TrpGlu: 0.937 ± 0.139
0.351TrpPhe: 0.351 ± 0.077
0.609TrpGly: 0.609 ± 0.116
0.281TrpHis: 0.281 ± 0.079
0.562TrpIle: 0.562 ± 0.116
0.867TrpLys: 0.867 ± 0.12
0.82TrpLeu: 0.82 ± 0.12
0.187TrpMet: 0.187 ± 0.061
0.679TrpAsn: 0.679 ± 0.116
0.0TrpPro: 0.0 ± 0.0
0.234TrpGln: 0.234 ± 0.07
0.211TrpArg: 0.211 ± 0.071
0.562TrpSer: 0.562 ± 0.141
0.468TrpThr: 0.468 ± 0.095
0.749TrpVal: 0.749 ± 0.133
0.234TrpTrp: 0.234 ± 0.089
0.515TrpTyr: 0.515 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.693TyrAla: 2.693 ± 0.238
0.375TyrCys: 0.375 ± 0.086
2.857TyrAsp: 2.857 ± 0.258
3.373TyrGlu: 3.373 ± 0.221
1.335TyrPhe: 1.335 ± 0.195
2.857TyrGly: 2.857 ± 0.278
0.796TyrHis: 0.796 ± 0.136
2.81TyrIle: 2.81 ± 0.243
2.67TyrLys: 2.67 ± 0.261
4.075TyrLeu: 4.075 ± 0.28
0.984TyrMet: 0.984 ± 0.143
2.412TyrAsn: 2.412 ± 0.251
1.241TyrPro: 1.241 ± 0.178
1.522TyrGln: 1.522 ± 0.144
2.108TyrArg: 2.108 ± 0.196
2.928TyrSer: 2.928 ± 0.24
2.693TyrThr: 2.693 ± 0.229
2.928TyrVal: 2.928 ± 0.258
0.445TyrTrp: 0.445 ± 0.093
1.616TyrTyr: 1.616 ± 0.233
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 211 proteins (42699 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski