Amino acid dipepetide frequency for Bacillus phage B5S

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.114AlaAla: 3.114 ± 0.504
0.471AlaCys: 0.471 ± 0.108
3.668AlaAsp: 3.668 ± 0.262
3.975AlaGlu: 3.975 ± 0.303
2.438AlaPhe: 2.438 ± 0.219
3.524AlaGly: 3.524 ± 0.38
1.147AlaHis: 1.147 ± 0.153
3.811AlaIle: 3.811 ± 0.297
5.163AlaLys: 5.163 ± 0.381
5.409AlaLeu: 5.409 ± 0.42
2.11AlaMet: 2.11 ± 0.218
3.012AlaAsn: 3.012 ± 0.307
2.479AlaPro: 2.479 ± 0.327
2.151AlaGln: 2.151 ± 0.221
2.561AlaArg: 2.561 ± 0.229
3.401AlaSer: 3.401 ± 0.312
3.893AlaThr: 3.893 ± 0.371
3.852AlaVal: 3.852 ± 0.278
0.717AlaTrp: 0.717 ± 0.125
2.725AlaTyr: 2.725 ± 0.229
0.0AlaXaa: 0.0 ± 0.0
Cys
0.389CysAla: 0.389 ± 0.089
0.205CysCys: 0.205 ± 0.066
0.615CysAsp: 0.615 ± 0.108
0.738CysGlu: 0.738 ± 0.135
0.205CysPhe: 0.205 ± 0.055
0.676CysGly: 0.676 ± 0.147
0.143CysHis: 0.143 ± 0.054
0.553CysIle: 0.553 ± 0.11
0.676CysLys: 0.676 ± 0.124
0.553CysLeu: 0.553 ± 0.107
0.205CysMet: 0.205 ± 0.068
0.41CysAsn: 0.41 ± 0.09
0.369CysPro: 0.369 ± 0.089
0.205CysGln: 0.205 ± 0.071
0.348CysArg: 0.348 ± 0.085
0.287CysSer: 0.287 ± 0.081
0.369CysThr: 0.369 ± 0.077
0.492CysVal: 0.492 ± 0.105
0.164CysTrp: 0.164 ± 0.052
0.492CysTyr: 0.492 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
3.647AspAla: 3.647 ± 0.255
0.574AspCys: 0.574 ± 0.13
3.893AspAsp: 3.893 ± 0.296
5.512AspGlu: 5.512 ± 0.379
3.217AspPhe: 3.217 ± 0.273
4.2AspGly: 4.2 ± 0.384
0.635AspHis: 0.635 ± 0.131
5.778AspIle: 5.778 ± 0.323
5.717AspLys: 5.717 ± 0.342
5.245AspLeu: 5.245 ± 0.313
1.926AspMet: 1.926 ± 0.235
3.196AspAsn: 3.196 ± 0.271
1.516AspPro: 1.516 ± 0.19
0.943AspGln: 0.943 ± 0.138
2.725AspArg: 2.725 ± 0.248
2.93AspSer: 2.93 ± 0.223
3.401AspThr: 3.401 ± 0.266
4.344AspVal: 4.344 ± 0.32
0.902AspTrp: 0.902 ± 0.134
3.606AspTyr: 3.606 ± 0.297
0.0AspXaa: 0.0 ± 0.0
Glu
4.938GluAla: 4.938 ± 0.324
0.471GluCys: 0.471 ± 0.089
5.573GluAsp: 5.573 ± 0.429
9.343GluGlu: 9.343 ± 0.941
3.299GluPhe: 3.299 ± 0.228
5.122GluGly: 5.122 ± 0.295
1.598GluHis: 1.598 ± 0.224
5.204GluIle: 5.204 ± 0.359
6.639GluLys: 6.639 ± 0.458
8.831GluLeu: 8.831 ± 0.568
2.643GluMet: 2.643 ± 0.254
4.159GluAsn: 4.159 ± 0.298
1.537GluPro: 1.537 ± 0.205
3.032GluGln: 3.032 ± 0.245
3.36GluArg: 3.36 ± 0.318
3.811GluSer: 3.811 ± 0.33
3.852GluThr: 3.852 ± 0.261
6.29GluVal: 6.29 ± 0.441
1.27GluTrp: 1.27 ± 0.197
3.975GluTyr: 3.975 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
2.274PheAla: 2.274 ± 0.223
0.533PheCys: 0.533 ± 0.094
2.971PheAsp: 2.971 ± 0.248
2.93PheGlu: 2.93 ± 0.24
1.106PhePhe: 1.106 ± 0.211
2.028PheGly: 2.028 ± 0.229
0.82PheHis: 0.82 ± 0.124
2.766PheIle: 2.766 ± 0.228
3.196PheLys: 3.196 ± 0.287
3.196PheLeu: 3.196 ± 0.262
1.045PheMet: 1.045 ± 0.137
2.377PheAsn: 2.377 ± 0.225
1.27PhePro: 1.27 ± 0.177
1.147PheGln: 1.147 ± 0.165
1.455PheArg: 1.455 ± 0.17
2.664PheSer: 2.664 ± 0.251
2.889PheThr: 2.889 ± 0.241
2.787PheVal: 2.787 ± 0.24
0.451PheTrp: 0.451 ± 0.11
1.865PheTyr: 1.865 ± 0.221
0.0PheXaa: 0.0 ± 0.0
Gly
3.688GlyAla: 3.688 ± 0.351
0.635GlyCys: 0.635 ± 0.127
4.057GlyAsp: 4.057 ± 0.314
5.389GlyGlu: 5.389 ± 0.386
2.91GlyPhe: 2.91 ± 0.214
5.758GlyGly: 5.758 ± 0.734
1.332GlyHis: 1.332 ± 0.167
4.385GlyIle: 4.385 ± 0.369
5.368GlyLys: 5.368 ± 0.344
4.61GlyLeu: 4.61 ± 0.338
1.824GlyMet: 1.824 ± 0.206
3.319GlyAsn: 3.319 ± 0.308
0.0GlyPro: 0.0 ± 0.0
2.172GlyGln: 2.172 ± 0.318
2.438GlyArg: 2.438 ± 0.213
3.811GlySer: 3.811 ± 0.318
4.139GlyThr: 4.139 ± 0.454
4.549GlyVal: 4.549 ± 0.35
1.127GlyTrp: 1.127 ± 0.164
3.34GlyTyr: 3.34 ± 0.305
0.0GlyXaa: 0.0 ± 0.0
His
0.984HisAla: 0.984 ± 0.164
0.164HisCys: 0.164 ± 0.061
1.086HisAsp: 1.086 ± 0.124
1.701HisGlu: 1.701 ± 0.209
0.799HisPhe: 0.799 ± 0.154
0.799HisGly: 0.799 ± 0.13
0.369HisHis: 0.369 ± 0.095
1.291HisIle: 1.291 ± 0.172
1.393HisLys: 1.393 ± 0.191
1.619HisLeu: 1.619 ± 0.18
0.41HisMet: 0.41 ± 0.093
1.127HisAsn: 1.127 ± 0.151
0.635HisPro: 0.635 ± 0.12
0.512HisGln: 0.512 ± 0.099
0.738HisArg: 0.738 ± 0.129
0.84HisSer: 0.84 ± 0.16
1.332HisThr: 1.332 ± 0.169
1.557HisVal: 1.557 ± 0.218
0.225HisTrp: 0.225 ± 0.073
0.84HisTyr: 0.84 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
4.549IleAla: 4.549 ± 0.311
0.594IleCys: 0.594 ± 0.115
5.061IleAsp: 5.061 ± 0.384
6.393IleGlu: 6.393 ± 0.398
1.742IlePhe: 1.742 ± 0.211
3.545IleGly: 3.545 ± 0.295
1.168IleHis: 1.168 ± 0.155
4.159IleIle: 4.159 ± 0.302
5.86IleLys: 5.86 ± 0.385
4.508IleLeu: 4.508 ± 0.282
1.578IleMet: 1.578 ± 0.201
4.057IleAsn: 4.057 ± 0.373
2.172IlePro: 2.172 ± 0.205
2.008IleGln: 2.008 ± 0.19
2.725IleArg: 2.725 ± 0.285
3.873IleSer: 3.873 ± 0.25
4.59IleThr: 4.59 ± 0.371
4.098IleVal: 4.098 ± 0.292
0.41IleTrp: 0.41 ± 0.083
2.869IleTyr: 2.869 ± 0.239
0.0IleXaa: 0.0 ± 0.0
Lys
5.307LysAla: 5.307 ± 0.357
0.553LysCys: 0.553 ± 0.122
4.877LysAsp: 4.877 ± 0.311
8.114LysGlu: 8.114 ± 0.591
2.705LysPhe: 2.705 ± 0.227
5.881LysGly: 5.881 ± 0.368
1.393LysHis: 1.393 ± 0.174
4.467LysIle: 4.467 ± 0.314
6.598LysLys: 6.598 ± 0.503
6.454LysLeu: 6.454 ± 0.401
2.541LysMet: 2.541 ± 0.257
4.016LysAsn: 4.016 ± 0.294
2.418LysPro: 2.418 ± 0.289
3.278LysGln: 3.278 ± 0.273
3.668LysArg: 3.668 ± 0.313
4.2LysSer: 4.2 ± 0.376
4.385LysThr: 4.385 ± 0.384
5.922LysVal: 5.922 ± 0.361
1.024LysTrp: 1.024 ± 0.131
3.647LysTyr: 3.647 ± 0.294
0.0LysXaa: 0.0 ± 0.0
Leu
4.959LeuAla: 4.959 ± 0.373
0.635LeuCys: 0.635 ± 0.119
5.635LeuAsp: 5.635 ± 0.322
7.458LeuGlu: 7.458 ± 0.514
3.135LeuPhe: 3.135 ± 0.245
5.307LeuGly: 5.307 ± 0.329
1.578LeuHis: 1.578 ± 0.186
4.815LeuIle: 4.815 ± 0.341
6.188LeuLys: 6.188 ± 0.382
5.737LeuLeu: 5.737 ± 0.449
2.274LeuMet: 2.274 ± 0.185
3.729LeuAsn: 3.729 ± 0.24
2.787LeuPro: 2.787 ± 0.238
2.664LeuGln: 2.664 ± 0.215
4.016LeuArg: 4.016 ± 0.332
4.856LeuSer: 4.856 ± 0.328
5.717LeuThr: 5.717 ± 0.33
5.368LeuVal: 5.368 ± 0.437
0.82LeuTrp: 0.82 ± 0.124
3.36LeuTyr: 3.36 ± 0.246
0.0LeuXaa: 0.0 ± 0.0
Met
1.967MetAla: 1.967 ± 0.172
0.287MetCys: 0.287 ± 0.071
1.557MetAsp: 1.557 ± 0.201
2.254MetGlu: 2.254 ± 0.205
1.106MetPhe: 1.106 ± 0.164
1.619MetGly: 1.619 ± 0.209
0.533MetHis: 0.533 ± 0.107
1.496MetIle: 1.496 ± 0.196
2.828MetLys: 2.828 ± 0.215
2.336MetLeu: 2.336 ± 0.222
0.697MetMet: 0.697 ± 0.149
1.557MetAsn: 1.557 ± 0.212
0.656MetPro: 0.656 ± 0.094
0.963MetGln: 0.963 ± 0.114
1.373MetArg: 1.373 ± 0.182
1.762MetSer: 1.762 ± 0.185
2.172MetThr: 2.172 ± 0.222
1.639MetVal: 1.639 ± 0.184
0.328MetTrp: 0.328 ± 0.084
1.373MetTyr: 1.373 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
3.053AsnAla: 3.053 ± 0.287
0.451AsnCys: 0.451 ± 0.106
3.053AsnAsp: 3.053 ± 0.253
3.422AsnGlu: 3.422 ± 0.24
1.926AsnPhe: 1.926 ± 0.213
4.241AsnGly: 4.241 ± 0.348
1.045AsnHis: 1.045 ± 0.131
3.114AsnIle: 3.114 ± 0.264
4.549AsnLys: 4.549 ± 0.284
4.405AsnLeu: 4.405 ± 0.316
1.947AsnMet: 1.947 ± 0.188
3.237AsnAsn: 3.237 ± 0.323
2.295AsnPro: 2.295 ± 0.341
1.803AsnGln: 1.803 ± 0.178
2.725AsnArg: 2.725 ± 0.256
2.418AsnSer: 2.418 ± 0.269
3.012AsnThr: 3.012 ± 0.299
3.524AsnVal: 3.524 ± 0.25
0.717AsnTrp: 0.717 ± 0.126
2.479AsnTyr: 2.479 ± 0.223
0.0AsnXaa: 0.0 ± 0.0
Pro
1.906ProAla: 1.906 ± 0.292
0.164ProCys: 0.164 ± 0.059
1.783ProAsp: 1.783 ± 0.229
2.602ProGlu: 2.602 ± 0.252
1.516ProPhe: 1.516 ± 0.198
0.881ProGly: 0.881 ± 0.156
0.635ProHis: 0.635 ± 0.112
1.721ProIle: 1.721 ± 0.188
2.069ProLys: 2.069 ± 0.187
2.254ProLeu: 2.254 ± 0.24
0.82ProMet: 0.82 ± 0.143
1.496ProAsn: 1.496 ± 0.174
0.881ProPro: 0.881 ± 0.17
1.168ProGln: 1.168 ± 0.264
1.168ProArg: 1.168 ± 0.165
1.926ProSer: 1.926 ± 0.229
2.11ProThr: 2.11 ± 0.255
2.418ProVal: 2.418 ± 0.277
0.266ProTrp: 0.266 ± 0.074
1.475ProTyr: 1.475 ± 0.182
0.0ProXaa: 0.0 ± 0.0
Gln
2.602GlnAla: 2.602 ± 0.265
0.184GlnCys: 0.184 ± 0.063
1.967GlnAsp: 1.967 ± 0.195
2.582GlnGlu: 2.582 ± 0.228
1.065GlnPhe: 1.065 ± 0.172
2.295GlnGly: 2.295 ± 0.244
0.656GlnHis: 0.656 ± 0.153
2.11GlnIle: 2.11 ± 0.228
2.213GlnLys: 2.213 ± 0.216
2.643GlnLeu: 2.643 ± 0.237
1.065GlnMet: 1.065 ± 0.116
1.762GlnAsn: 1.762 ± 0.233
1.147GlnPro: 1.147 ± 0.207
1.578GlnGln: 1.578 ± 0.273
1.475GlnArg: 1.475 ± 0.155
1.803GlnSer: 1.803 ± 0.184
2.028GlnThr: 2.028 ± 0.23
2.192GlnVal: 2.192 ± 0.209
0.43GlnTrp: 0.43 ± 0.1
1.393GlnTyr: 1.393 ± 0.137
0.0GlnXaa: 0.0 ± 0.0
Arg
2.274ArgAla: 2.274 ± 0.213
0.307ArgCys: 0.307 ± 0.074
2.5ArgAsp: 2.5 ± 0.274
4.057ArgGlu: 4.057 ± 0.312
2.192ArgPhe: 2.192 ± 0.226
3.073ArgGly: 3.073 ± 0.243
0.615ArgHis: 0.615 ± 0.121
3.012ArgIle: 3.012 ± 0.3
3.053ArgLys: 3.053 ± 0.286
4.016ArgLeu: 4.016 ± 0.271
1.414ArgMet: 1.414 ± 0.168
2.151ArgAsn: 2.151 ± 0.235
1.106ArgPro: 1.106 ± 0.148
1.516ArgGln: 1.516 ± 0.184
1.557ArgArg: 1.557 ± 0.196
1.701ArgSer: 1.701 ± 0.195
2.397ArgThr: 2.397 ± 0.247
3.217ArgVal: 3.217 ± 0.285
0.512ArgTrp: 0.512 ± 0.097
2.008ArgTyr: 2.008 ± 0.223
0.0ArgXaa: 0.0 ± 0.0
Ser
3.422SerAla: 3.422 ± 0.309
0.266SerCys: 0.266 ± 0.071
2.951SerAsp: 2.951 ± 0.257
3.278SerGlu: 3.278 ± 0.295
2.869SerPhe: 2.869 ± 0.262
3.729SerGly: 3.729 ± 0.293
1.065SerHis: 1.065 ± 0.154
4.282SerIle: 4.282 ± 0.305
4.426SerLys: 4.426 ± 0.341
4.631SerLeu: 4.631 ± 0.283
1.557SerMet: 1.557 ± 0.167
2.951SerAsn: 2.951 ± 0.255
1.537SerPro: 1.537 ± 0.197
1.311SerGln: 1.311 ± 0.166
2.172SerArg: 2.172 ± 0.189
3.094SerSer: 3.094 ± 0.303
3.319SerThr: 3.319 ± 0.302
3.196SerVal: 3.196 ± 0.267
0.861SerTrp: 0.861 ± 0.135
2.807SerTyr: 2.807 ± 0.232
0.0SerXaa: 0.0 ± 0.0
Thr
3.565ThrAla: 3.565 ± 0.325
0.471ThrCys: 0.471 ± 0.111
3.319ThrAsp: 3.319 ± 0.278
4.323ThrGlu: 4.323 ± 0.251
2.623ThrPhe: 2.623 ± 0.273
4.364ThrGly: 4.364 ± 0.354
0.984ThrHis: 0.984 ± 0.154
4.262ThrIle: 4.262 ± 0.369
4.528ThrLys: 4.528 ± 0.326
5.491ThrLeu: 5.491 ± 0.317
1.434ThrMet: 1.434 ± 0.199
3.688ThrAsn: 3.688 ± 0.42
2.459ThrPro: 2.459 ± 0.33
1.967ThrGln: 1.967 ± 0.201
2.828ThrArg: 2.828 ± 0.251
3.34ThrSer: 3.34 ± 0.267
3.524ThrThr: 3.524 ± 0.321
5.04ThrVal: 5.04 ± 0.385
0.676ThrTrp: 0.676 ± 0.125
3.36ThrTyr: 3.36 ± 0.211
0.0ThrXaa: 0.0 ± 0.0
Val
3.975ValAla: 3.975 ± 0.268
0.492ValCys: 0.492 ± 0.114
4.999ValAsp: 4.999 ± 0.301
6.085ValGlu: 6.085 ± 0.416
2.93ValPhe: 2.93 ± 0.274
4.139ValGly: 4.139 ± 0.299
1.291ValHis: 1.291 ± 0.168
4.077ValIle: 4.077 ± 0.264
5.819ValLys: 5.819 ± 0.395
4.733ValLeu: 4.733 ± 0.302
1.496ValMet: 1.496 ± 0.171
3.565ValAsn: 3.565 ± 0.294
2.459ValPro: 2.459 ± 0.236
2.418ValGln: 2.418 ± 0.207
2.643ValArg: 2.643 ± 0.216
3.688ValSer: 3.688 ± 0.331
5.389ValThr: 5.389 ± 0.446
4.795ValVal: 4.795 ± 0.362
0.635ValTrp: 0.635 ± 0.105
3.34ValTyr: 3.34 ± 0.303
0.0ValXaa: 0.0 ± 0.0
Trp
0.574TrpAla: 0.574 ± 0.104
0.123TrpCys: 0.123 ± 0.049
0.984TrpAsp: 0.984 ± 0.135
0.922TrpGlu: 0.922 ± 0.153
0.451TrpPhe: 0.451 ± 0.103
0.82TrpGly: 0.82 ± 0.135
0.369TrpHis: 0.369 ± 0.086
0.984TrpIle: 0.984 ± 0.144
0.963TrpLys: 0.963 ± 0.121
0.861TrpLeu: 0.861 ± 0.15
0.389TrpMet: 0.389 ± 0.088
0.656TrpAsn: 0.656 ± 0.132
0.0TrpPro: 0.0 ± 0.0
0.389TrpGln: 0.389 ± 0.098
0.512TrpArg: 0.512 ± 0.098
0.799TrpSer: 0.799 ± 0.132
0.717TrpThr: 0.717 ± 0.126
0.799TrpVal: 0.799 ± 0.133
0.184TrpTrp: 0.184 ± 0.061
0.676TrpTyr: 0.676 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.438TyrAla: 2.438 ± 0.232
0.512TyrCys: 0.512 ± 0.114
3.36TyrAsp: 3.36 ± 0.333
3.873TyrGlu: 3.873 ± 0.336
1.537TyrPhe: 1.537 ± 0.182
2.848TyrGly: 2.848 ± 0.221
1.045TyrHis: 1.045 ± 0.157
3.668TyrIle: 3.668 ± 0.263
4.2TyrLys: 4.2 ± 0.333
3.504TyrLeu: 3.504 ± 0.321
1.106TyrMet: 1.106 ± 0.15
2.951TyrAsn: 2.951 ± 0.208
1.537TyrPro: 1.537 ± 0.189
1.967TyrGln: 1.967 ± 0.24
2.192TyrArg: 2.192 ± 0.222
2.52TyrSer: 2.52 ± 0.188
3.012TyrThr: 3.012 ± 0.248
2.91TyrVal: 2.91 ± 0.306
0.451TyrTrp: 0.451 ± 0.124
2.151TyrTyr: 2.151 ± 0.259
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 272 proteins (48806 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski