Amino acid dipepetide frequency for Klebsiella phage vB_KpnM_KB57

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.0AlaAla: 6.0 ± 0.523
0.83AlaCys: 0.83 ± 0.141
4.458AlaAsp: 4.458 ± 0.336
4.956AlaGlu: 4.956 ± 0.356
2.869AlaPhe: 2.869 ± 0.26
5.573AlaGly: 5.573 ± 0.364
0.996AlaHis: 0.996 ± 0.15
4.34AlaIle: 4.34 ± 0.322
5.383AlaLys: 5.383 ± 0.483
6.095AlaLeu: 6.095 ± 0.391
2.514AlaMet: 2.514 ± 0.23
3.201AlaAsn: 3.201 ± 0.286
2.182AlaPro: 2.182 ± 0.272
2.419AlaGln: 2.419 ± 0.258
3.439AlaArg: 3.439 ± 0.353
4.126AlaSer: 4.126 ± 0.36
4.316AlaThr: 4.316 ± 0.335
4.719AlaVal: 4.719 ± 0.286
1.138AlaTrp: 1.138 ± 0.153
2.727AlaTyr: 2.727 ± 0.267
0.0AlaXaa: 0.0 ± 0.0
Cys
0.735CysAla: 0.735 ± 0.148
0.213CysCys: 0.213 ± 0.062
0.83CysAsp: 0.83 ± 0.162
0.64CysGlu: 0.64 ± 0.126
0.569CysPhe: 0.569 ± 0.12
1.115CysGly: 1.115 ± 0.194
0.379CysHis: 0.379 ± 0.104
0.617CysIle: 0.617 ± 0.1
0.996CysLys: 0.996 ± 0.203
0.901CysLeu: 0.901 ± 0.145
0.308CysMet: 0.308 ± 0.081
0.522CysAsn: 0.522 ± 0.111
0.379CysPro: 0.379 ± 0.131
0.427CysGln: 0.427 ± 0.102
0.759CysArg: 0.759 ± 0.166
0.877CysSer: 0.877 ± 0.198
0.617CysThr: 0.617 ± 0.126
0.617CysVal: 0.617 ± 0.106
0.237CysTrp: 0.237 ± 0.073
0.498CysTyr: 0.498 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
3.937AspAla: 3.937 ± 0.323
0.735AspCys: 0.735 ± 0.113
4.008AspAsp: 4.008 ± 0.349
4.956AspGlu: 4.956 ± 0.355
3.035AspPhe: 3.035 ± 0.279
5.193AspGly: 5.193 ± 0.396
1.352AspHis: 1.352 ± 0.167
3.984AspIle: 3.984 ± 0.349
4.15AspLys: 4.15 ± 0.356
6.095AspLeu: 6.095 ± 0.375
1.755AspMet: 1.755 ± 0.208
3.273AspAsn: 3.273 ± 0.293
2.941AspPro: 2.941 ± 0.245
1.802AspGln: 1.802 ± 0.223
2.846AspArg: 2.846 ± 0.224
3.273AspSer: 3.273 ± 0.294
3.225AspThr: 3.225 ± 0.339
4.577AspVal: 4.577 ± 0.347
1.304AspTrp: 1.304 ± 0.229
2.703AspTyr: 2.703 ± 0.265
0.0AspXaa: 0.0 ± 0.0
Glu
5.834GluAla: 5.834 ± 0.432
0.806GluCys: 0.806 ± 0.165
4.245GluAsp: 4.245 ± 0.322
5.241GluGlu: 5.241 ± 0.377
3.012GluPhe: 3.012 ± 0.247
3.462GluGly: 3.462 ± 0.309
1.257GluHis: 1.257 ± 0.19
4.648GluIle: 4.648 ± 0.338
5.573GluLys: 5.573 ± 0.407
5.407GluLeu: 5.407 ± 0.386
2.182GluMet: 2.182 ± 0.242
3.201GluAsn: 3.201 ± 0.28
1.565GluPro: 1.565 ± 0.178
2.134GluGln: 2.134 ± 0.199
3.439GluArg: 3.439 ± 0.323
3.676GluSer: 3.676 ± 0.365
3.344GluThr: 3.344 ± 0.27
4.814GluVal: 4.814 ± 0.367
1.186GluTrp: 1.186 ± 0.17
2.798GluTyr: 2.798 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
2.585PheAla: 2.585 ± 0.288
0.711PheCys: 0.711 ± 0.139
3.083PheAsp: 3.083 ± 0.308
2.751PheGlu: 2.751 ± 0.283
1.826PhePhe: 1.826 ± 0.223
3.605PheGly: 3.605 ± 0.297
0.64PheHis: 0.64 ± 0.124
2.49PheIle: 2.49 ± 0.226
2.158PheLys: 2.158 ± 0.222
3.178PheLeu: 3.178 ± 0.306
1.067PheMet: 1.067 ± 0.133
2.324PheAsn: 2.324 ± 0.243
1.47PhePro: 1.47 ± 0.165
1.47PheGln: 1.47 ± 0.203
1.897PheArg: 1.897 ± 0.204
2.822PheSer: 2.822 ± 0.272
2.632PheThr: 2.632 ± 0.258
2.893PheVal: 2.893 ± 0.237
0.711PheTrp: 0.711 ± 0.129
1.636PheTyr: 1.636 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
4.719GlyAla: 4.719 ± 0.43
0.925GlyCys: 0.925 ± 0.161
4.672GlyAsp: 4.672 ± 0.362
4.411GlyGlu: 4.411 ± 0.314
3.32GlyPhe: 3.32 ± 0.298
5.099GlyGly: 5.099 ± 0.469
1.304GlyHis: 1.304 ± 0.155
4.008GlyIle: 4.008 ± 0.34
5.549GlyLys: 5.549 ± 0.355
5.407GlyLeu: 5.407 ± 0.393
2.395GlyMet: 2.395 ± 0.188
3.249GlyAsn: 3.249 ± 0.302
1.66GlyPro: 1.66 ± 0.188
2.253GlyGln: 2.253 ± 0.223
3.13GlyArg: 3.13 ± 0.276
4.15GlySer: 4.15 ± 0.399
4.245GlyThr: 4.245 ± 0.455
5.691GlyVal: 5.691 ± 0.423
1.589GlyTrp: 1.589 ± 0.223
3.201GlyTyr: 3.201 ± 0.286
0.0GlyXaa: 0.0 ± 0.0
His
1.02HisAla: 1.02 ± 0.152
0.285HisCys: 0.285 ± 0.072
1.138HisAsp: 1.138 ± 0.158
1.304HisGlu: 1.304 ± 0.193
1.067HisPhe: 1.067 ± 0.154
1.518HisGly: 1.518 ± 0.27
0.427HisHis: 0.427 ± 0.094
1.447HisIle: 1.447 ± 0.191
1.233HisLys: 1.233 ± 0.157
1.423HisLeu: 1.423 ± 0.185
0.522HisMet: 0.522 ± 0.093
0.854HisAsn: 0.854 ± 0.131
0.783HisPro: 0.783 ± 0.122
0.474HisGln: 0.474 ± 0.115
0.949HisArg: 0.949 ± 0.161
1.162HisSer: 1.162 ± 0.17
1.02HisThr: 1.02 ± 0.161
0.949HisVal: 0.949 ± 0.147
0.237HisTrp: 0.237 ± 0.066
0.877HisTyr: 0.877 ± 0.138
0.0HisXaa: 0.0 ± 0.0
Ile
4.292IleAla: 4.292 ± 0.352
0.925IleCys: 0.925 ± 0.15
3.913IleAsp: 3.913 ± 0.277
4.031IleGlu: 4.031 ± 0.322
2.182IlePhe: 2.182 ± 0.254
3.889IleGly: 3.889 ± 0.313
1.399IleHis: 1.399 ± 0.168
3.628IleIle: 3.628 ± 0.304
4.079IleLys: 4.079 ± 0.31
4.506IleLeu: 4.506 ± 0.321
1.826IleMet: 1.826 ± 0.185
2.537IleAsn: 2.537 ± 0.25
2.609IlePro: 2.609 ± 0.211
1.541IleGln: 1.541 ± 0.215
3.581IleArg: 3.581 ± 0.282
4.363IleSer: 4.363 ± 0.309
3.462IleThr: 3.462 ± 0.306
4.34IleVal: 4.34 ± 0.346
0.593IleTrp: 0.593 ± 0.128
2.561IleTyr: 2.561 ± 0.284
0.0IleXaa: 0.0 ± 0.0
Lys
5.834LysAla: 5.834 ± 0.442
0.593LysCys: 0.593 ± 0.128
4.767LysAsp: 4.767 ± 0.384
5.265LysGlu: 5.265 ± 0.388
3.083LysPhe: 3.083 ± 0.289
4.743LysGly: 4.743 ± 0.34
1.494LysHis: 1.494 ± 0.187
4.126LysIle: 4.126 ± 0.312
5.193LysLys: 5.193 ± 0.426
5.027LysLeu: 5.027 ± 0.329
2.087LysMet: 2.087 ± 0.199
3.035LysAsn: 3.035 ± 0.267
2.039LysPro: 2.039 ± 0.211
2.68LysGln: 2.68 ± 0.233
3.462LysArg: 3.462 ± 0.287
3.723LysSer: 3.723 ± 0.317
4.363LysThr: 4.363 ± 0.355
5.241LysVal: 5.241 ± 0.348
1.02LysTrp: 1.02 ± 0.174
2.822LysTyr: 2.822 ± 0.259
0.0LysXaa: 0.0 ± 0.0
Leu
6.711LeuAla: 6.711 ± 0.41
0.925LeuCys: 0.925 ± 0.154
5.62LeuAsp: 5.62 ± 0.388
5.929LeuGlu: 5.929 ± 0.404
2.585LeuPhe: 2.585 ± 0.266
4.648LeuGly: 4.648 ± 0.372
1.613LeuHis: 1.613 ± 0.186
4.221LeuIle: 4.221 ± 0.315
5.502LeuLys: 5.502 ± 0.32
5.644LeuLeu: 5.644 ± 0.425
2.229LeuMet: 2.229 ± 0.246
3.96LeuAsn: 3.96 ± 0.313
3.439LeuPro: 3.439 ± 0.347
2.466LeuGln: 2.466 ± 0.224
4.482LeuArg: 4.482 ± 0.348
5.502LeuSer: 5.502 ± 0.354
4.838LeuThr: 4.838 ± 0.355
5.051LeuVal: 5.051 ± 0.329
1.138LeuTrp: 1.138 ± 0.191
3.178LeuTyr: 3.178 ± 0.305
0.0LeuXaa: 0.0 ± 0.0
Met
2.371MetAla: 2.371 ± 0.231
0.356MetCys: 0.356 ± 0.102
1.352MetAsp: 1.352 ± 0.179
1.826MetGlu: 1.826 ± 0.201
1.589MetPhe: 1.589 ± 0.198
1.707MetGly: 1.707 ± 0.251
0.403MetHis: 0.403 ± 0.084
2.111MetIle: 2.111 ± 0.223
3.154MetLys: 3.154 ± 0.285
1.992MetLeu: 1.992 ± 0.213
1.043MetMet: 1.043 ± 0.175
1.565MetAsn: 1.565 ± 0.191
1.02MetPro: 1.02 ± 0.166
0.735MetGln: 0.735 ± 0.122
1.233MetArg: 1.233 ± 0.156
2.016MetSer: 2.016 ± 0.225
1.873MetThr: 1.873 ± 0.207
1.826MetVal: 1.826 ± 0.229
0.261MetTrp: 0.261 ± 0.076
0.877MetTyr: 0.877 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.201AsnAla: 3.201 ± 0.29
0.617AsnCys: 0.617 ± 0.111
2.656AsnAsp: 2.656 ± 0.234
2.158AsnGlu: 2.158 ± 0.232
1.968AsnPhe: 1.968 ± 0.203
4.008AsnGly: 4.008 ± 0.285
0.925AsnHis: 0.925 ± 0.149
2.988AsnIle: 2.988 ± 0.272
3.201AsnLys: 3.201 ± 0.321
3.771AsnLeu: 3.771 ± 0.275
1.565AsnMet: 1.565 ± 0.237
2.443AsnAsn: 2.443 ± 0.256
2.087AsnPro: 2.087 ± 0.198
1.613AsnGln: 1.613 ± 0.172
2.419AsnArg: 2.419 ± 0.229
2.798AsnSer: 2.798 ± 0.321
3.344AsnThr: 3.344 ± 0.327
3.225AsnVal: 3.225 ± 0.269
0.664AsnTrp: 0.664 ± 0.152
1.779AsnTyr: 1.779 ± 0.168
0.0AsnXaa: 0.0 ± 0.0
Pro
2.324ProAla: 2.324 ± 0.264
0.379ProCys: 0.379 ± 0.108
2.775ProAsp: 2.775 ± 0.257
3.367ProGlu: 3.367 ± 0.307
1.589ProPhe: 1.589 ± 0.184
2.703ProGly: 2.703 ± 0.282
0.64ProHis: 0.64 ± 0.118
1.447ProIle: 1.447 ± 0.193
2.063ProLys: 2.063 ± 0.228
2.632ProLeu: 2.632 ± 0.242
0.877ProMet: 0.877 ± 0.154
1.636ProAsn: 1.636 ± 0.195
0.877ProPro: 0.877 ± 0.16
1.47ProGln: 1.47 ± 0.188
1.423ProArg: 1.423 ± 0.173
1.897ProSer: 1.897 ± 0.229
2.063ProThr: 2.063 ± 0.241
3.13ProVal: 3.13 ± 0.267
0.522ProTrp: 0.522 ± 0.106
1.565ProTyr: 1.565 ± 0.208
0.0ProXaa: 0.0 ± 0.0
Gln
2.3GlnAla: 2.3 ± 0.215
0.403GlnCys: 0.403 ± 0.101
1.85GlnAsp: 1.85 ± 0.199
2.277GlnGlu: 2.277 ± 0.247
1.209GlnPhe: 1.209 ± 0.211
2.134GlnGly: 2.134 ± 0.196
0.379GlnHis: 0.379 ± 0.084
2.395GlnIle: 2.395 ± 0.205
2.277GlnLys: 2.277 ± 0.234
2.751GlnLeu: 2.751 ± 0.252
0.83GlnMet: 0.83 ± 0.126
1.802GlnAsn: 1.802 ± 0.203
1.02GlnPro: 1.02 ± 0.17
1.636GlnGln: 1.636 ± 0.224
1.589GlnArg: 1.589 ± 0.177
1.755GlnSer: 1.755 ± 0.202
1.921GlnThr: 1.921 ± 0.222
2.443GlnVal: 2.443 ± 0.249
0.593GlnTrp: 0.593 ± 0.124
1.162GlnTyr: 1.162 ± 0.157
0.0GlnXaa: 0.0 ± 0.0
Arg
2.988ArgAla: 2.988 ± 0.214
0.83ArgCys: 0.83 ± 0.182
3.676ArgAsp: 3.676 ± 0.312
3.154ArgGlu: 3.154 ± 0.281
1.613ArgPhe: 1.613 ± 0.227
3.439ArgGly: 3.439 ± 0.295
0.925ArgHis: 0.925 ± 0.144
2.988ArgIle: 2.988 ± 0.262
3.723ArgLys: 3.723 ± 0.233
4.103ArgLeu: 4.103 ± 0.306
1.328ArgMet: 1.328 ± 0.151
2.656ArgAsn: 2.656 ± 0.236
1.684ArgPro: 1.684 ± 0.196
1.921ArgGln: 1.921 ± 0.216
2.632ArgArg: 2.632 ± 0.302
3.035ArgSer: 3.035 ± 0.26
2.609ArgThr: 2.609 ± 0.281
3.391ArgVal: 3.391 ± 0.308
0.593ArgTrp: 0.593 ± 0.127
1.945ArgTyr: 1.945 ± 0.209
0.0ArgXaa: 0.0 ± 0.0
Ser
4.197SerAla: 4.197 ± 0.315
0.64SerCys: 0.64 ± 0.12
3.889SerAsp: 3.889 ± 0.265
3.605SerGlu: 3.605 ± 0.304
2.585SerPhe: 2.585 ± 0.224
4.885SerGly: 4.885 ± 0.335
0.901SerHis: 0.901 ± 0.158
3.652SerIle: 3.652 ± 0.285
3.937SerLys: 3.937 ± 0.32
5.051SerLeu: 5.051 ± 0.371
1.684SerMet: 1.684 ± 0.209
2.941SerAsn: 2.941 ± 0.243
2.253SerPro: 2.253 ± 0.211
1.897SerGln: 1.897 ± 0.213
3.083SerArg: 3.083 ± 0.313
3.462SerSer: 3.462 ± 0.315
3.581SerThr: 3.581 ± 0.369
4.34SerVal: 4.34 ± 0.308
1.115SerTrp: 1.115 ± 0.151
1.826SerTyr: 1.826 ± 0.19
0.0SerXaa: 0.0 ± 0.0
Thr
4.34ThrAla: 4.34 ± 0.388
0.569ThrCys: 0.569 ± 0.135
3.723ThrAsp: 3.723 ± 0.312
3.628ThrGlu: 3.628 ± 0.334
2.49ThrPhe: 2.49 ± 0.236
5.099ThrGly: 5.099 ± 0.482
1.257ThrHis: 1.257 ± 0.145
3.557ThrIle: 3.557 ± 0.285
3.842ThrLys: 3.842 ± 0.272
5.099ThrLeu: 5.099 ± 0.338
1.375ThrMet: 1.375 ± 0.177
2.229ThrAsn: 2.229 ± 0.245
2.964ThrPro: 2.964 ± 0.311
1.85ThrGln: 1.85 ± 0.202
2.419ThrArg: 2.419 ± 0.235
3.225ThrSer: 3.225 ± 0.258
3.367ThrThr: 3.367 ± 0.424
4.648ThrVal: 4.648 ± 0.448
1.233ThrTrp: 1.233 ± 0.202
2.229ThrTyr: 2.229 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
5.075ValAla: 5.075 ± 0.352
0.901ValCys: 0.901 ± 0.169
4.624ValAsp: 4.624 ± 0.34
4.767ValGlu: 4.767 ± 0.323
3.249ValPhe: 3.249 ± 0.274
4.624ValGly: 4.624 ± 0.346
1.043ValHis: 1.043 ± 0.179
4.648ValIle: 4.648 ± 0.338
4.909ValLys: 4.909 ± 0.398
5.573ValLeu: 5.573 ± 0.304
1.873ValMet: 1.873 ± 0.21
3.059ValAsn: 3.059 ± 0.295
2.253ValPro: 2.253 ± 0.224
2.111ValGln: 2.111 ± 0.259
3.605ValArg: 3.605 ± 0.332
4.292ValSer: 4.292 ± 0.302
4.814ValThr: 4.814 ± 0.409
6.166ValVal: 6.166 ± 0.483
1.067ValTrp: 1.067 ± 0.149
2.775ValTyr: 2.775 ± 0.283
0.0ValXaa: 0.0 ± 0.0
Trp
0.972TrpAla: 0.972 ± 0.162
0.166TrpCys: 0.166 ± 0.069
1.067TrpAsp: 1.067 ± 0.186
1.399TrpGlu: 1.399 ± 0.217
0.688TrpPhe: 0.688 ± 0.114
0.972TrpGly: 0.972 ± 0.163
0.261TrpHis: 0.261 ± 0.083
0.901TrpIle: 0.901 ± 0.175
0.972TrpLys: 0.972 ± 0.148
1.565TrpLeu: 1.565 ± 0.197
0.711TrpMet: 0.711 ± 0.115
0.901TrpAsn: 0.901 ± 0.159
0.356TrpPro: 0.356 ± 0.09
0.522TrpGln: 0.522 ± 0.108
0.593TrpArg: 0.593 ± 0.133
1.02TrpSer: 1.02 ± 0.145
0.83TrpThr: 0.83 ± 0.15
0.83TrpVal: 0.83 ± 0.131
0.498TrpTrp: 0.498 ± 0.115
1.02TrpTyr: 1.02 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.846TyrAla: 2.846 ± 0.269
0.427TyrCys: 0.427 ± 0.099
2.798TyrAsp: 2.798 ± 0.286
2.158TyrGlu: 2.158 ± 0.221
1.423TyrPhe: 1.423 ± 0.168
2.561TyrGly: 2.561 ± 0.241
1.091TyrHis: 1.091 ± 0.154
2.063TyrIle: 2.063 ± 0.217
2.703TyrLys: 2.703 ± 0.269
3.486TyrLeu: 3.486 ± 0.271
1.115TyrMet: 1.115 ± 0.174
2.039TyrAsn: 2.039 ± 0.21
1.802TyrPro: 1.802 ± 0.18
1.304TyrGln: 1.304 ± 0.164
2.253TyrArg: 2.253 ± 0.218
2.348TyrSer: 2.348 ± 0.276
2.68TyrThr: 2.68 ± 0.266
2.537TyrVal: 2.537 ± 0.248
0.545TyrTrp: 0.545 ± 0.117
1.779TyrTyr: 1.779 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 245 proteins (42170 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski