Amino acid dipepetide frequency for Klebsiella phage KPN1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.795AlaAla: 5.795 ± 0.587
0.527AlaCys: 0.527 ± 0.127
4.511AlaAsp: 4.511 ± 0.349
5.269AlaGlu: 5.269 ± 0.549
2.667AlaPhe: 2.667 ± 0.304
4.61AlaGly: 4.61 ± 0.541
1.251AlaHis: 1.251 ± 0.203
4.742AlaIle: 4.742 ± 0.447
5.4AlaLys: 5.4 ± 0.483
5.73AlaLeu: 5.73 ± 0.568
1.679AlaMet: 1.679 ± 0.22
3.787AlaAsn: 3.787 ± 0.342
2.996AlaPro: 2.996 ± 0.338
2.766AlaGln: 2.766 ± 0.325
3.392AlaArg: 3.392 ± 0.311
4.182AlaSer: 4.182 ± 0.417
3.984AlaThr: 3.984 ± 0.511
5.005AlaVal: 5.005 ± 0.475
1.152AlaTrp: 1.152 ± 0.226
3.194AlaTyr: 3.194 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
0.527CysAla: 0.527 ± 0.15
0.263CysCys: 0.263 ± 0.103
0.823CysAsp: 0.823 ± 0.179
0.659CysGlu: 0.659 ± 0.139
0.263CysPhe: 0.263 ± 0.086
0.79CysGly: 0.79 ± 0.187
0.132CysHis: 0.132 ± 0.067
0.395CysIle: 0.395 ± 0.119
0.461CysLys: 0.461 ± 0.135
0.527CysLeu: 0.527 ± 0.163
0.296CysMet: 0.296 ± 0.09
0.428CysAsn: 0.428 ± 0.091
0.428CysPro: 0.428 ± 0.113
0.527CysGln: 0.527 ± 0.134
0.527CysArg: 0.527 ± 0.123
0.626CysSer: 0.626 ± 0.151
0.56CysThr: 0.56 ± 0.126
0.79CysVal: 0.79 ± 0.161
0.198CysTrp: 0.198 ± 0.077
0.494CysTyr: 0.494 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
5.071AspAla: 5.071 ± 0.604
0.527AspCys: 0.527 ± 0.132
4.215AspAsp: 4.215 ± 0.356
4.61AspGlu: 4.61 ± 0.441
2.996AspPhe: 2.996 ± 0.316
5.269AspGly: 5.269 ± 0.424
0.593AspHis: 0.593 ± 0.141
4.972AspIle: 4.972 ± 0.352
4.478AspLys: 4.478 ± 0.383
5.203AspLeu: 5.203 ± 0.452
1.811AspMet: 1.811 ± 0.276
2.634AspAsn: 2.634 ± 0.279
2.173AspPro: 2.173 ± 0.307
1.778AspGln: 1.778 ± 0.274
2.009AspArg: 2.009 ± 0.24
4.314AspSer: 4.314 ± 0.388
3.161AspThr: 3.161 ± 0.321
4.017AspVal: 4.017 ± 0.313
1.35AspTrp: 1.35 ± 0.212
3.128AspTyr: 3.128 ± 0.415
0.0AspXaa: 0.0 ± 0.0
Glu
5.927GluAla: 5.927 ± 0.595
1.054GluCys: 1.054 ± 0.181
3.951GluAsp: 3.951 ± 0.426
5.269GluGlu: 5.269 ± 0.591
3.194GluPhe: 3.194 ± 0.32
4.149GluGly: 4.149 ± 0.412
1.12GluHis: 1.12 ± 0.2
5.334GluIle: 5.334 ± 0.42
4.379GluLys: 4.379 ± 0.407
6.717GluLeu: 6.717 ± 0.553
2.305GluMet: 2.305 ± 0.254
3.787GluAsn: 3.787 ± 0.34
2.272GluPro: 2.272 ± 0.291
2.535GluGln: 2.535 ± 0.297
2.7GluArg: 2.7 ± 0.314
4.149GluSer: 4.149 ± 0.426
3.787GluThr: 3.787 ± 0.349
5.005GluVal: 5.005 ± 0.383
1.251GluTrp: 1.251 ± 0.188
3.26GluTyr: 3.26 ± 0.354
0.0GluXaa: 0.0 ± 0.0
Phe
3.227PheAla: 3.227 ± 0.353
0.494PheCys: 0.494 ± 0.105
2.47PheAsp: 2.47 ± 0.246
3.556PheGlu: 3.556 ± 0.356
1.449PhePhe: 1.449 ± 0.209
3.194PheGly: 3.194 ± 0.301
0.724PheHis: 0.724 ± 0.155
2.535PheIle: 2.535 ± 0.264
3.523PheLys: 3.523 ± 0.394
2.239PheLeu: 2.239 ± 0.261
1.449PheMet: 1.449 ± 0.24
2.601PheAsn: 2.601 ± 0.27
1.021PhePro: 1.021 ± 0.155
1.218PheGln: 1.218 ± 0.162
1.613PheArg: 1.613 ± 0.228
3.556PheSer: 3.556 ± 0.319
2.404PheThr: 2.404 ± 0.292
2.832PheVal: 2.832 ± 0.317
0.56PheTrp: 0.56 ± 0.131
1.745PheTyr: 1.745 ± 0.214
0.0PheXaa: 0.0 ± 0.0
Gly
4.116GlyAla: 4.116 ± 0.467
0.395GlyCys: 0.395 ± 0.122
3.886GlyAsp: 3.886 ± 0.48
4.281GlyGlu: 4.281 ± 0.403
2.634GlyPhe: 2.634 ± 0.304
3.128GlyGly: 3.128 ± 0.394
0.988GlyHis: 0.988 ± 0.216
4.676GlyIle: 4.676 ± 0.327
5.17GlyLys: 5.17 ± 0.38
5.236GlyLeu: 5.236 ± 0.493
2.009GlyMet: 2.009 ± 0.292
3.359GlyAsn: 3.359 ± 0.495
1.679GlyPro: 1.679 ± 0.233
2.47GlyGln: 2.47 ± 0.354
2.7GlyArg: 2.7 ± 0.332
4.511GlySer: 4.511 ± 0.441
3.853GlyThr: 3.853 ± 0.323
4.643GlyVal: 4.643 ± 0.405
0.823GlyTrp: 0.823 ± 0.177
2.898GlyTyr: 2.898 ± 0.337
0.0GlyXaa: 0.0 ± 0.0
His
0.823HisAla: 0.823 ± 0.16
0.099HisCys: 0.099 ± 0.079
1.218HisAsp: 1.218 ± 0.154
1.185HisGlu: 1.185 ± 0.232
0.856HisPhe: 0.856 ± 0.192
1.021HisGly: 1.021 ± 0.213
0.296HisHis: 0.296 ± 0.095
1.383HisIle: 1.383 ± 0.205
1.054HisLys: 1.054 ± 0.227
1.548HisLeu: 1.548 ± 0.239
0.395HisMet: 0.395 ± 0.114
0.626HisAsn: 0.626 ± 0.117
1.251HisPro: 1.251 ± 0.183
0.428HisGln: 0.428 ± 0.108
0.724HisArg: 0.724 ± 0.141
0.757HisSer: 0.757 ± 0.148
0.955HisThr: 0.955 ± 0.173
1.317HisVal: 1.317 ± 0.209
0.296HisTrp: 0.296 ± 0.103
0.56HisTyr: 0.56 ± 0.138
0.0HisXaa: 0.0 ± 0.0
Ile
3.721IleAla: 3.721 ± 0.35
0.659IleCys: 0.659 ± 0.146
4.248IleAsp: 4.248 ± 0.331
5.828IleGlu: 5.828 ± 0.487
2.535IlePhe: 2.535 ± 0.271
3.853IleGly: 3.853 ± 0.37
1.218IleHis: 1.218 ± 0.184
4.215IleIle: 4.215 ± 0.386
5.993IleLys: 5.993 ± 0.483
3.655IleLeu: 3.655 ± 0.307
2.009IleMet: 2.009 ± 0.298
4.412IleAsn: 4.412 ± 0.392
2.601IlePro: 2.601 ± 0.279
2.437IleGln: 2.437 ± 0.279
3.293IleArg: 3.293 ± 0.385
4.116IleSer: 4.116 ± 0.407
4.84IleThr: 4.84 ± 0.421
4.742IleVal: 4.742 ± 0.463
0.428IleTrp: 0.428 ± 0.145
2.239IleTyr: 2.239 ± 0.211
0.0IleXaa: 0.0 ± 0.0
Lys
5.664LysAla: 5.664 ± 0.588
0.626LysCys: 0.626 ± 0.148
4.972LysAsp: 4.972 ± 0.36
5.499LysGlu: 5.499 ± 0.502
3.951LysPhe: 3.951 ± 0.36
4.379LysGly: 4.379 ± 0.363
1.745LysHis: 1.745 ± 0.268
4.248LysIle: 4.248 ± 0.328
4.215LysLys: 4.215 ± 0.421
6.092LysLeu: 6.092 ± 0.503
2.568LysMet: 2.568 ± 0.314
2.832LysAsn: 2.832 ± 0.33
2.14LysPro: 2.14 ± 0.318
2.371LysGln: 2.371 ± 0.264
3.392LysArg: 3.392 ± 0.355
4.577LysSer: 4.577 ± 0.39
4.314LysThr: 4.314 ± 0.412
4.906LysVal: 4.906 ± 0.577
1.251LysTrp: 1.251 ± 0.17
3.194LysTyr: 3.194 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
5.532LeuAla: 5.532 ± 0.499
0.659LeuCys: 0.659 ± 0.152
4.84LeuAsp: 4.84 ± 0.404
5.433LeuGlu: 5.433 ± 0.44
2.667LeuPhe: 2.667 ± 0.266
4.248LeuGly: 4.248 ± 0.377
1.251LeuHis: 1.251 ± 0.188
5.203LeuIle: 5.203 ± 0.427
5.795LeuLys: 5.795 ± 0.545
4.939LeuLeu: 4.939 ± 0.349
2.371LeuMet: 2.371 ± 0.294
3.82LeuAsn: 3.82 ± 0.332
3.293LeuPro: 3.293 ± 0.389
2.733LeuGln: 2.733 ± 0.331
3.754LeuArg: 3.754 ± 0.341
4.775LeuSer: 4.775 ± 0.407
4.083LeuThr: 4.083 ± 0.375
4.05LeuVal: 4.05 ± 0.384
0.79LeuTrp: 0.79 ± 0.169
2.996LeuTyr: 2.996 ± 0.36
0.0LeuXaa: 0.0 ± 0.0
Met
2.404MetAla: 2.404 ± 0.271
0.198MetCys: 0.198 ± 0.075
1.811MetAsp: 1.811 ± 0.217
1.515MetGlu: 1.515 ± 0.253
1.251MetPhe: 1.251 ± 0.177
1.548MetGly: 1.548 ± 0.238
0.593MetHis: 0.593 ± 0.143
1.745MetIle: 1.745 ± 0.272
2.964MetLys: 2.964 ± 0.275
2.173MetLeu: 2.173 ± 0.24
0.988MetMet: 0.988 ± 0.183
1.646MetAsn: 1.646 ± 0.258
0.988MetPro: 0.988 ± 0.153
1.054MetGln: 1.054 ± 0.182
1.416MetArg: 1.416 ± 0.229
2.272MetSer: 2.272 ± 0.252
1.515MetThr: 1.515 ± 0.166
1.416MetVal: 1.416 ± 0.184
0.263MetTrp: 0.263 ± 0.082
1.021MetTyr: 1.021 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.688AsnAla: 3.688 ± 0.358
0.527AsnCys: 0.527 ± 0.12
3.26AsnAsp: 3.26 ± 0.308
3.523AsnGlu: 3.523 ± 0.321
2.173AsnPhe: 2.173 ± 0.236
4.017AsnGly: 4.017 ± 0.385
0.955AsnHis: 0.955 ± 0.168
3.622AsnIle: 3.622 ± 0.345
2.865AsnLys: 2.865 ± 0.254
3.918AsnLeu: 3.918 ± 0.372
1.515AsnMet: 1.515 ± 0.273
3.194AsnAsn: 3.194 ± 0.501
2.601AsnPro: 2.601 ± 0.336
1.679AsnGln: 1.679 ± 0.247
2.799AsnArg: 2.799 ± 0.272
3.095AsnSer: 3.095 ± 0.318
3.029AsnThr: 3.029 ± 0.3
3.326AsnVal: 3.326 ± 0.287
0.757AsnTrp: 0.757 ± 0.168
2.338AsnTyr: 2.338 ± 0.247
0.0AsnXaa: 0.0 ± 0.0
Pro
2.799ProAla: 2.799 ± 0.323
0.296ProCys: 0.296 ± 0.09
2.568ProAsp: 2.568 ± 0.246
3.161ProGlu: 3.161 ± 0.402
1.35ProPhe: 1.35 ± 0.19
2.996ProGly: 2.996 ± 0.367
0.659ProHis: 0.659 ± 0.147
2.733ProIle: 2.733 ± 0.297
2.47ProLys: 2.47 ± 0.29
2.042ProLeu: 2.042 ± 0.229
0.955ProMet: 0.955 ± 0.173
1.745ProAsn: 1.745 ± 0.264
1.185ProPro: 1.185 ± 0.224
1.021ProGln: 1.021 ± 0.204
1.581ProArg: 1.581 ± 0.223
2.239ProSer: 2.239 ± 0.231
2.272ProThr: 2.272 ± 0.309
2.601ProVal: 2.601 ± 0.226
0.691ProTrp: 0.691 ± 0.14
1.35ProTyr: 1.35 ± 0.224
0.0ProXaa: 0.0 ± 0.0
Gln
2.305GlnAla: 2.305 ± 0.314
0.165GlnCys: 0.165 ± 0.071
2.206GlnAsp: 2.206 ± 0.286
2.404GlnGlu: 2.404 ± 0.299
1.383GlnPhe: 1.383 ± 0.247
2.206GlnGly: 2.206 ± 0.283
0.659GlnHis: 0.659 ± 0.166
2.206GlnIle: 2.206 ± 0.258
2.173GlnLys: 2.173 ± 0.284
2.964GlnLeu: 2.964 ± 0.305
1.317GlnMet: 1.317 ± 0.195
1.35GlnAsn: 1.35 ± 0.187
1.383GlnPro: 1.383 ± 0.248
1.087GlnGln: 1.087 ± 0.239
1.844GlnArg: 1.844 ± 0.271
1.613GlnSer: 1.613 ± 0.246
2.338GlnThr: 2.338 ± 0.291
2.535GlnVal: 2.535 ± 0.322
0.724GlnTrp: 0.724 ± 0.154
1.745GlnTyr: 1.745 ± 0.279
0.0GlnXaa: 0.0 ± 0.0
Arg
3.128ArgAla: 3.128 ± 0.341
0.593ArgCys: 0.593 ± 0.132
2.832ArgAsp: 2.832 ± 0.305
3.49ArgGlu: 3.49 ± 0.318
1.712ArgPhe: 1.712 ± 0.213
2.898ArgGly: 2.898 ± 0.32
0.724ArgHis: 0.724 ± 0.17
3.227ArgIle: 3.227 ± 0.295
3.49ArgLys: 3.49 ± 0.358
3.622ArgLeu: 3.622 ± 0.355
1.054ArgMet: 1.054 ± 0.155
2.601ArgAsn: 2.601 ± 0.283
1.054ArgPro: 1.054 ± 0.204
1.712ArgGln: 1.712 ± 0.286
2.305ArgArg: 2.305 ± 0.354
2.535ArgSer: 2.535 ± 0.294
2.667ArgThr: 2.667 ± 0.289
2.766ArgVal: 2.766 ± 0.346
1.087ArgTrp: 1.087 ± 0.196
1.943ArgTyr: 1.943 ± 0.249
0.0ArgXaa: 0.0 ± 0.0
Ser
5.038SerAla: 5.038 ± 0.4
0.691SerCys: 0.691 ± 0.149
4.05SerAsp: 4.05 ± 0.369
3.82SerGlu: 3.82 ± 0.388
2.832SerPhe: 2.832 ± 0.265
4.709SerGly: 4.709 ± 0.362
1.021SerHis: 1.021 ± 0.182
3.886SerIle: 3.886 ± 0.389
4.972SerLys: 4.972 ± 0.465
4.215SerLeu: 4.215 ± 0.385
1.548SerMet: 1.548 ± 0.214
3.128SerAsn: 3.128 ± 0.287
2.338SerPro: 2.338 ± 0.272
1.745SerGln: 1.745 ± 0.262
3.194SerArg: 3.194 ± 0.351
4.149SerSer: 4.149 ± 0.481
3.82SerThr: 3.82 ± 0.442
4.412SerVal: 4.412 ± 0.416
0.724SerTrp: 0.724 ± 0.184
2.535SerTyr: 2.535 ± 0.289
0.0SerXaa: 0.0 ± 0.0
Thr
4.314ThrAla: 4.314 ± 0.466
0.428ThrCys: 0.428 ± 0.128
2.964ThrAsp: 2.964 ± 0.374
3.853ThrGlu: 3.853 ± 0.415
2.535ThrPhe: 2.535 ± 0.363
3.655ThrGly: 3.655 ± 0.402
0.922ThrHis: 0.922 ± 0.179
3.886ThrIle: 3.886 ± 0.402
3.886ThrLys: 3.886 ± 0.404
4.347ThrLeu: 4.347 ± 0.497
1.284ThrMet: 1.284 ± 0.243
2.766ThrAsn: 2.766 ± 0.285
2.7ThrPro: 2.7 ± 0.328
2.173ThrGln: 2.173 ± 0.256
3.029ThrArg: 3.029 ± 0.302
3.457ThrSer: 3.457 ± 0.359
3.886ThrThr: 3.886 ± 0.412
5.203ThrVal: 5.203 ± 0.452
0.757ThrTrp: 0.757 ± 0.147
2.107ThrTyr: 2.107 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
4.643ValAla: 4.643 ± 0.49
0.823ValCys: 0.823 ± 0.164
5.269ValAsp: 5.269 ± 0.446
5.466ValGlu: 5.466 ± 0.444
3.425ValPhe: 3.425 ± 0.33
3.688ValGly: 3.688 ± 0.467
1.12ValHis: 1.12 ± 0.17
4.511ValIle: 4.511 ± 0.374
5.499ValLys: 5.499 ± 0.439
4.116ValLeu: 4.116 ± 0.379
1.943ValMet: 1.943 ± 0.199
4.05ValAsn: 4.05 ± 0.319
2.568ValPro: 2.568 ± 0.322
2.667ValGln: 2.667 ± 0.253
2.437ValArg: 2.437 ± 0.266
3.918ValSer: 3.918 ± 0.424
3.787ValThr: 3.787 ± 0.48
4.445ValVal: 4.445 ± 0.406
1.152ValTrp: 1.152 ± 0.158
2.832ValTyr: 2.832 ± 0.271
0.0ValXaa: 0.0 ± 0.0
Trp
0.626TrpAla: 0.626 ± 0.129
0.099TrpCys: 0.099 ± 0.056
1.087TrpAsp: 1.087 ± 0.217
1.054TrpGlu: 1.054 ± 0.196
0.757TrpPhe: 0.757 ± 0.15
0.691TrpGly: 0.691 ± 0.163
0.165TrpHis: 0.165 ± 0.071
0.823TrpIle: 0.823 ± 0.157
1.548TrpLys: 1.548 ± 0.226
1.218TrpLeu: 1.218 ± 0.22
0.428TrpMet: 0.428 ± 0.102
1.021TrpAsn: 1.021 ± 0.212
0.461TrpPro: 0.461 ± 0.115
0.494TrpGln: 0.494 ± 0.126
0.56TrpArg: 0.56 ± 0.128
0.922TrpSer: 0.922 ± 0.155
1.152TrpThr: 1.152 ± 0.185
1.087TrpVal: 1.087 ± 0.154
0.329TrpTrp: 0.329 ± 0.117
0.856TrpTyr: 0.856 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.293TyrAla: 3.293 ± 0.326
0.593TyrCys: 0.593 ± 0.149
3.128TyrAsp: 3.128 ± 0.276
2.305TyrGlu: 2.305 ± 0.314
1.778TyrPhe: 1.778 ± 0.246
2.404TyrGly: 2.404 ± 0.292
0.659TyrHis: 0.659 ± 0.154
2.7TyrIle: 2.7 ± 0.283
2.601TyrLys: 2.601 ± 0.28
2.733TyrLeu: 2.733 ± 0.366
0.823TyrMet: 0.823 ± 0.141
3.095TyrAsn: 3.095 ± 0.331
1.581TyrPro: 1.581 ± 0.244
1.712TyrGln: 1.712 ± 0.233
2.173TyrArg: 2.173 ± 0.272
3.062TyrSer: 3.062 ± 0.349
1.745TyrThr: 1.745 ± 0.215
3.26TyrVal: 3.26 ± 0.367
0.79TyrTrp: 0.79 ± 0.163
1.745TyrTyr: 1.745 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 153 proteins (30370 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski