Amino acid dipepetide frequency for Klebsiella phage K64-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.842AlaAla: 2.842 ± 0.402
0.482AlaCys: 0.482 ± 0.119
2.183AlaAsp: 2.183 ± 0.244
2.741AlaGlu: 2.741 ± 0.301
1.65AlaPhe: 1.65 ± 0.196
3.223AlaGly: 3.223 ± 0.453
0.533AlaHis: 0.533 ± 0.12
3.629AlaIle: 3.629 ± 0.357
2.893AlaLys: 2.893 ± 0.346
3.578AlaLeu: 3.578 ± 0.361
1.294AlaMet: 1.294 ± 0.209
3.071AlaAsn: 3.071 ± 0.245
1.345AlaPro: 1.345 ± 0.234
1.7AlaGln: 1.7 ± 0.199
2.157AlaArg: 2.157 ± 0.25
3.883AlaSer: 3.883 ± 0.433
2.842AlaThr: 2.842 ± 0.264
2.614AlaVal: 2.614 ± 0.298
0.558AlaTrp: 0.558 ± 0.12
2.259AlaTyr: 2.259 ± 0.22
0.0AlaXaa: 0.0 ± 0.0
Cys
0.609CysAla: 0.609 ± 0.147
0.102CysCys: 0.102 ± 0.049
0.787CysAsp: 0.787 ± 0.182
0.431CysGlu: 0.431 ± 0.114
0.381CysPhe: 0.381 ± 0.103
1.041CysGly: 1.041 ± 0.207
0.33CysHis: 0.33 ± 0.126
0.685CysIle: 0.685 ± 0.136
1.041CysLys: 1.041 ± 0.187
0.609CysLeu: 0.609 ± 0.132
0.076CysMet: 0.076 ± 0.043
0.964CysAsn: 0.964 ± 0.205
0.584CysPro: 0.584 ± 0.176
0.355CysGln: 0.355 ± 0.089
0.558CysArg: 0.558 ± 0.135
0.914CysSer: 0.914 ± 0.195
1.041CysThr: 1.041 ± 0.139
0.711CysVal: 0.711 ± 0.139
0.127CysTrp: 0.127 ± 0.059
0.431CysTyr: 0.431 ± 0.136
0.0CysXaa: 0.0 ± 0.0
Asp
3.223AspAla: 3.223 ± 0.314
0.787AspCys: 0.787 ± 0.177
3.959AspAsp: 3.959 ± 0.369
4.111AspGlu: 4.111 ± 0.355
3.477AspPhe: 3.477 ± 0.316
5.152AspGly: 5.152 ± 0.402
0.685AspHis: 0.685 ± 0.125
5.71AspIle: 5.71 ± 0.428
3.807AspLys: 3.807 ± 0.334
4.467AspLeu: 4.467 ± 0.386
1.853AspMet: 1.853 ± 0.226
4.314AspAsn: 4.314 ± 0.428
2.817AspPro: 2.817 ± 0.249
1.32AspGln: 1.32 ± 0.193
1.523AspArg: 1.523 ± 0.21
5.482AspSer: 5.482 ± 0.511
4.137AspThr: 4.137 ± 0.355
4.34AspVal: 4.34 ± 0.349
1.091AspTrp: 1.091 ± 0.152
3.375AspTyr: 3.375 ± 0.298
0.0AspXaa: 0.0 ± 0.0
Glu
2.716GluAla: 2.716 ± 0.262
0.761GluCys: 0.761 ± 0.173
3.274GluAsp: 3.274 ± 0.373
3.68GluGlu: 3.68 ± 0.383
3.147GluPhe: 3.147 ± 0.318
2.081GluGly: 2.081 ± 0.279
1.041GluHis: 1.041 ± 0.219
5.05GluIle: 5.05 ± 0.348
3.908GluLys: 3.908 ± 0.468
5.964GluLeu: 5.964 ± 0.528
1.497GluMet: 1.497 ± 0.231
4.086GluAsn: 4.086 ± 0.316
1.65GluPro: 1.65 ± 0.159
2.208GluGln: 2.208 ± 0.241
2.183GluArg: 2.183 ± 0.254
3.578GluSer: 3.578 ± 0.429
3.528GluThr: 3.528 ± 0.308
3.401GluVal: 3.401 ± 0.31
0.939GluTrp: 0.939 ± 0.145
3.096GluTyr: 3.096 ± 0.308
0.0GluXaa: 0.0 ± 0.0
Phe
1.497PheAla: 1.497 ± 0.176
0.406PheCys: 0.406 ± 0.095
3.578PheAsp: 3.578 ± 0.269
2.462PheGlu: 2.462 ± 0.259
1.32PhePhe: 1.32 ± 0.285
3.045PheGly: 3.045 ± 0.273
0.584PheHis: 0.584 ± 0.12
3.274PheIle: 3.274 ± 0.302
3.071PheLys: 3.071 ± 0.32
2.233PheLeu: 2.233 ± 0.224
0.964PheMet: 0.964 ± 0.151
4.213PheAsn: 4.213 ± 0.321
1.193PhePro: 1.193 ± 0.182
1.472PheGln: 1.472 ± 0.182
1.193PheArg: 1.193 ± 0.158
3.781PheSer: 3.781 ± 0.272
2.868PheThr: 2.868 ± 0.279
2.944PheVal: 2.944 ± 0.255
0.457PheTrp: 0.457 ± 0.107
2.259PheTyr: 2.259 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
2.233GlyAla: 2.233 ± 0.282
0.685GlyCys: 0.685 ± 0.171
3.807GlyAsp: 3.807 ± 0.392
3.248GlyGlu: 3.248 ± 0.31
2.716GlyPhe: 2.716 ± 0.261
3.274GlyGly: 3.274 ± 0.371
0.736GlyHis: 0.736 ± 0.114
4.771GlyIle: 4.771 ± 0.445
3.883GlyLys: 3.883 ± 0.331
4.771GlyLeu: 4.771 ± 0.325
1.396GlyMet: 1.396 ± 0.198
4.847GlyAsn: 4.847 ± 0.478
0.508GlyPro: 0.508 ± 0.11
2.233GlyGln: 2.233 ± 0.253
2.589GlyArg: 2.589 ± 0.273
6.395GlySer: 6.395 ± 0.558
6.7GlyThr: 6.7 ± 0.747
4.467GlyVal: 4.467 ± 0.398
0.711GlyTrp: 0.711 ± 0.125
3.604GlyTyr: 3.604 ± 0.373
0.0GlyXaa: 0.0 ± 0.0
His
0.634HisAla: 0.634 ± 0.138
0.431HisCys: 0.431 ± 0.153
0.837HisAsp: 0.837 ± 0.143
0.939HisGlu: 0.939 ± 0.193
0.685HisPhe: 0.685 ± 0.138
0.837HisGly: 0.837 ± 0.201
0.203HisHis: 0.203 ± 0.092
0.939HisIle: 0.939 ± 0.198
0.99HisLys: 0.99 ± 0.186
1.041HisLeu: 1.041 ± 0.146
0.406HisMet: 0.406 ± 0.1
1.193HisAsn: 1.193 ± 0.227
0.406HisPro: 0.406 ± 0.092
0.457HisGln: 0.457 ± 0.128
0.558HisArg: 0.558 ± 0.139
0.837HisSer: 0.837 ± 0.133
0.609HisThr: 0.609 ± 0.123
0.939HisVal: 0.939 ± 0.147
0.102HisTrp: 0.102 ± 0.053
0.863HisTyr: 0.863 ± 0.145
0.0HisXaa: 0.0 ± 0.0
Ile
3.35IleAla: 3.35 ± 0.382
0.711IleCys: 0.711 ± 0.123
5.38IleAsp: 5.38 ± 0.408
4.416IleGlu: 4.416 ± 0.413
2.183IlePhe: 2.183 ± 0.189
4.289IleGly: 4.289 ± 0.32
1.37IleHis: 1.37 ± 0.188
4.568IleIle: 4.568 ± 0.33
5.786IleLys: 5.786 ± 0.498
5.786IleLeu: 5.786 ± 0.408
1.472IleMet: 1.472 ± 0.19
7.131IleAsn: 7.131 ± 0.686
2.792IlePro: 2.792 ± 0.264
3.35IleGln: 3.35 ± 0.294
3.578IleArg: 3.578 ± 0.304
7.055IleSer: 7.055 ± 0.52
5.482IleThr: 5.482 ± 0.435
3.807IleVal: 3.807 ± 0.31
0.66IleTrp: 0.66 ± 0.124
2.716IleTyr: 2.716 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
2.766LysAla: 2.766 ± 0.355
1.142LysCys: 1.142 ± 0.23
4.061LysAsp: 4.061 ± 0.329
5.482LysGlu: 5.482 ± 0.602
2.995LysPhe: 2.995 ± 0.303
2.944LysGly: 2.944 ± 0.265
1.041LysHis: 1.041 ± 0.199
5.431LysIle: 5.431 ± 0.517
4.365LysLys: 4.365 ± 0.565
5.38LysLeu: 5.38 ± 0.437
1.472LysMet: 1.472 ± 0.227
5.05LysAsn: 5.05 ± 0.418
2.056LysPro: 2.056 ± 0.223
2.868LysGln: 2.868 ± 0.306
2.563LysArg: 2.563 ± 0.314
4.695LysSer: 4.695 ± 0.445
4.289LysThr: 4.289 ± 0.322
3.705LysVal: 3.705 ± 0.352
0.457LysTrp: 0.457 ± 0.131
3.68LysTyr: 3.68 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
3.248LeuAla: 3.248 ± 0.315
0.761LeuCys: 0.761 ± 0.14
5.406LeuAsp: 5.406 ± 0.421
4.111LeuGlu: 4.111 ± 0.421
2.868LeuPhe: 2.868 ± 0.255
4.061LeuGly: 4.061 ± 0.381
0.837LeuHis: 0.837 ± 0.132
5.177LeuIle: 5.177 ± 0.351
6.192LeuLys: 6.192 ± 0.558
4.594LeuLeu: 4.594 ± 0.419
1.523LeuMet: 1.523 ± 0.181
5.406LeuAsn: 5.406 ± 0.345
3.426LeuPro: 3.426 ± 0.281
3.299LeuGln: 3.299 ± 0.296
3.122LeuArg: 3.122 ± 0.302
5.989LeuSer: 5.989 ± 0.406
4.137LeuThr: 4.137 ± 0.329
4.061LeuVal: 4.061 ± 0.371
0.736LeuTrp: 0.736 ± 0.109
4.061LeuTyr: 4.061 ± 0.417
0.0LeuXaa: 0.0 ± 0.0
Met
1.447MetAla: 1.447 ± 0.225
0.127MetCys: 0.127 ± 0.057
1.32MetAsp: 1.32 ± 0.181
1.269MetGlu: 1.269 ± 0.186
1.117MetPhe: 1.117 ± 0.168
1.37MetGly: 1.37 ± 0.187
0.254MetHis: 0.254 ± 0.085
1.218MetIle: 1.218 ± 0.158
1.777MetLys: 1.777 ± 0.226
1.37MetLeu: 1.37 ± 0.198
0.508MetMet: 0.508 ± 0.111
1.472MetAsn: 1.472 ± 0.157
0.609MetPro: 0.609 ± 0.133
1.015MetGln: 1.015 ± 0.142
1.066MetArg: 1.066 ± 0.162
1.853MetSer: 1.853 ± 0.22
1.497MetThr: 1.497 ± 0.196
1.447MetVal: 1.447 ± 0.2
0.305MetTrp: 0.305 ± 0.105
1.117MetTyr: 1.117 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
3.35AsnAla: 3.35 ± 0.326
1.193AsnCys: 1.193 ± 0.194
3.959AsnAsp: 3.959 ± 0.282
3.705AsnGlu: 3.705 ± 0.353
3.147AsnPhe: 3.147 ± 0.282
6.345AsnGly: 6.345 ± 0.451
0.685AsnHis: 0.685 ± 0.138
7.081AsnIle: 7.081 ± 0.478
5.076AsnLys: 5.076 ± 0.503
5.101AsnLeu: 5.101 ± 0.383
1.726AsnMet: 1.726 ± 0.218
5.913AsnAsn: 5.913 ± 0.463
2.741AsnPro: 2.741 ± 0.224
1.7AsnGln: 1.7 ± 0.228
2.436AsnArg: 2.436 ± 0.218
7.005AsnSer: 7.005 ± 0.486
6.142AsnThr: 6.142 ± 0.466
4.543AsnVal: 4.543 ± 0.277
0.634AsnTrp: 0.634 ± 0.126
3.578AsnTyr: 3.578 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
1.726ProAla: 1.726 ± 0.148
0.203ProCys: 0.203 ± 0.072
2.969ProAsp: 2.969 ± 0.257
2.259ProGlu: 2.259 ± 0.31
1.37ProPhe: 1.37 ± 0.207
1.802ProGly: 1.802 ± 0.188
0.482ProHis: 0.482 ± 0.118
1.878ProIle: 1.878 ± 0.207
1.853ProLys: 1.853 ± 0.301
2.233ProLeu: 2.233 ± 0.18
0.533ProMet: 0.533 ± 0.114
2.386ProAsn: 2.386 ± 0.243
0.736ProPro: 0.736 ± 0.138
1.244ProGln: 1.244 ± 0.165
1.091ProArg: 1.091 ± 0.173
2.665ProSer: 2.665 ± 0.206
2.716ProThr: 2.716 ± 0.292
2.614ProVal: 2.614 ± 0.263
0.355ProTrp: 0.355 ± 0.09
1.929ProTyr: 1.929 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
1.878GlnAla: 1.878 ± 0.261
0.33GlnCys: 0.33 ± 0.08
2.411GlnAsp: 2.411 ± 0.259
2.081GlnGlu: 2.081 ± 0.274
1.853GlnPhe: 1.853 ± 0.187
2.284GlnGly: 2.284 ± 0.312
0.457GlnHis: 0.457 ± 0.108
2.716GlnIle: 2.716 ± 0.249
2.411GlnLys: 2.411 ± 0.288
2.512GlnLeu: 2.512 ± 0.249
0.914GlnMet: 0.914 ± 0.154
3.071GlnAsn: 3.071 ± 0.34
1.218GlnPro: 1.218 ± 0.187
1.599GlnGln: 1.599 ± 0.202
1.269GlnArg: 1.269 ± 0.176
2.741GlnSer: 2.741 ± 0.26
2.005GlnThr: 2.005 ± 0.283
1.878GlnVal: 1.878 ± 0.192
0.508GlnTrp: 0.508 ± 0.132
2.563GlnTyr: 2.563 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
1.624ArgAla: 1.624 ± 0.277
0.584ArgCys: 0.584 ± 0.119
2.411ArgAsp: 2.411 ± 0.246
1.777ArgGlu: 1.777 ± 0.274
1.777ArgPhe: 1.777 ± 0.224
2.183ArgGly: 2.183 ± 0.243
0.685ArgHis: 0.685 ± 0.132
3.071ArgIle: 3.071 ± 0.203
2.411ArgLys: 2.411 ± 0.308
2.563ArgLeu: 2.563 ± 0.229
1.041ArgMet: 1.041 ± 0.164
2.132ArgAsn: 2.132 ± 0.195
1.193ArgPro: 1.193 ± 0.175
1.447ArgGln: 1.447 ± 0.195
1.193ArgArg: 1.193 ± 0.181
2.614ArgSer: 2.614 ± 0.24
2.233ArgThr: 2.233 ± 0.225
2.208ArgVal: 2.208 ± 0.221
0.533ArgTrp: 0.533 ± 0.118
1.573ArgTyr: 1.573 ± 0.269
0.025ArgXaa: 0.025 ± 0.022
Ser
3.604SerAla: 3.604 ± 0.371
1.091SerCys: 1.091 ± 0.186
5.279SerAsp: 5.279 ± 0.433
3.705SerGlu: 3.705 ± 0.331
3.375SerPhe: 3.375 ± 0.237
6.192SerGly: 6.192 ± 0.67
0.66SerHis: 0.66 ± 0.155
6.066SerIle: 6.066 ± 0.351
4.543SerLys: 4.543 ± 0.328
5.685SerLeu: 5.685 ± 0.338
1.548SerMet: 1.548 ± 0.207
7.081SerAsn: 7.081 ± 0.498
2.335SerPro: 2.335 ± 0.18
2.436SerGln: 2.436 ± 0.247
2.309SerArg: 2.309 ± 0.259
7.487SerSer: 7.487 ± 0.625
9.619SerThr: 9.619 ± 0.635
5.583SerVal: 5.583 ± 0.272
1.142SerTrp: 1.142 ± 0.181
4.061SerTyr: 4.061 ± 0.403
0.025SerXaa: 0.025 ± 0.024
Thr
3.781ThrAla: 3.781 ± 0.408
0.533ThrCys: 0.533 ± 0.123
4.467ThrAsp: 4.467 ± 0.348
4.213ThrGlu: 4.213 ± 0.312
3.045ThrPhe: 3.045 ± 0.319
5.761ThrGly: 5.761 ± 0.569
0.837ThrHis: 0.837 ± 0.131
5.355ThrIle: 5.355 ± 0.401
4.771ThrLys: 4.771 ± 0.417
5.203ThrLeu: 5.203 ± 0.455
1.218ThrMet: 1.218 ± 0.15
4.746ThrAsn: 4.746 ± 0.499
2.893ThrPro: 2.893 ± 0.306
2.259ThrGln: 2.259 ± 0.285
2.081ThrArg: 2.081 ± 0.258
6.827ThrSer: 6.827 ± 0.629
5.127ThrThr: 5.127 ± 0.457
5.253ThrVal: 5.253 ± 0.412
0.761ThrTrp: 0.761 ± 0.164
3.553ThrTyr: 3.553 ± 0.339
0.0ThrXaa: 0.0 ± 0.0
Val
2.411ValAla: 2.411 ± 0.298
0.508ValCys: 0.508 ± 0.113
4.568ValAsp: 4.568 ± 0.496
3.578ValGlu: 3.578 ± 0.331
2.893ValPhe: 2.893 ± 0.25
3.502ValGly: 3.502 ± 0.262
1.345ValHis: 1.345 ± 0.19
4.213ValIle: 4.213 ± 0.319
3.401ValLys: 3.401 ± 0.288
6.091ValLeu: 6.091 ± 0.39
1.294ValMet: 1.294 ± 0.196
3.832ValAsn: 3.832 ± 0.301
3.071ValPro: 3.071 ± 0.332
3.426ValGln: 3.426 ± 0.348
1.726ValArg: 1.726 ± 0.288
5.279ValSer: 5.279 ± 0.447
3.984ValThr: 3.984 ± 0.338
3.807ValVal: 3.807 ± 0.414
0.609ValTrp: 0.609 ± 0.127
2.741ValTyr: 2.741 ± 0.356
0.0ValXaa: 0.0 ± 0.0
Trp
0.381TrpAla: 0.381 ± 0.112
0.228TrpCys: 0.228 ± 0.09
0.787TrpAsp: 0.787 ± 0.142
0.761TrpGlu: 0.761 ± 0.107
0.33TrpPhe: 0.33 ± 0.079
0.609TrpGly: 0.609 ± 0.124
0.33TrpHis: 0.33 ± 0.088
0.711TrpIle: 0.711 ± 0.137
1.015TrpLys: 1.015 ± 0.158
0.66TrpLeu: 0.66 ± 0.137
0.254TrpMet: 0.254 ± 0.087
0.964TrpAsn: 0.964 ± 0.149
0.127TrpPro: 0.127 ± 0.055
0.508TrpGln: 0.508 ± 0.118
0.279TrpArg: 0.279 ± 0.079
1.218TrpSer: 1.218 ± 0.168
0.66TrpThr: 0.66 ± 0.128
0.914TrpVal: 0.914 ± 0.142
0.102TrpTrp: 0.102 ± 0.055
0.609TrpTyr: 0.609 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.157TyrAla: 2.157 ± 0.24
0.711TyrCys: 0.711 ± 0.171
4.187TyrAsp: 4.187 ± 0.343
2.817TyrGlu: 2.817 ± 0.343
2.487TyrPhe: 2.487 ± 0.241
3.553TyrGly: 3.553 ± 0.269
0.812TyrHis: 0.812 ± 0.178
4.34TyrIle: 4.34 ± 0.404
3.223TyrLys: 3.223 ± 0.327
3.172TyrLeu: 3.172 ± 0.284
1.142TyrMet: 1.142 ± 0.182
4.187TyrAsn: 4.187 ± 0.317
1.345TyrPro: 1.345 ± 0.211
1.827TyrGln: 1.827 ± 0.268
1.802TyrArg: 1.802 ± 0.194
3.528TyrSer: 3.528 ± 0.39
3.071TyrThr: 3.071 ± 0.339
3.045TyrVal: 3.045 ± 0.229
0.66TyrTrp: 0.66 ± 0.113
2.106TyrTyr: 2.106 ± 0.414
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.025XaaGly: 0.025 ± 0.024
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.025XaaLys: 0.025 ± 0.022
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (39404 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski