Amino acid dipepetide frequency for Klebsiella phage vB_KpnM_15-38_KLPPOU148

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.141AlaAla: 8.141 ± 0.986
0.974AlaCys: 0.974 ± 0.333
4.871AlaAsp: 4.871 ± 0.46
5.567AlaGlu: 5.567 ± 0.718
2.227AlaPhe: 2.227 ± 0.392
6.75AlaGly: 6.75 ± 0.984
1.113AlaHis: 1.113 ± 0.275
3.966AlaIle: 3.966 ± 0.611
5.219AlaLys: 5.219 ± 0.701
7.098AlaLeu: 7.098 ± 0.63
2.783AlaMet: 2.783 ± 0.325
3.41AlaAsn: 3.41 ± 0.45
2.227AlaPro: 2.227 ± 0.425
4.105AlaGln: 4.105 ± 0.601
4.871AlaArg: 4.871 ± 0.621
5.149AlaSer: 5.149 ± 0.77
4.732AlaThr: 4.732 ± 0.621
5.984AlaVal: 5.984 ± 0.778
1.044AlaTrp: 1.044 ± 0.243
3.549AlaTyr: 3.549 ± 0.459
0.0AlaXaa: 0.0 ± 0.0
Cys
1.322CysAla: 1.322 ± 0.32
0.139CysCys: 0.139 ± 0.107
1.113CysAsp: 1.113 ± 0.316
1.253CysGlu: 1.253 ± 0.286
0.209CysPhe: 0.209 ± 0.106
0.974CysGly: 0.974 ± 0.3
0.418CysHis: 0.418 ± 0.18
0.905CysIle: 0.905 ± 0.275
0.696CysLys: 0.696 ± 0.228
0.905CysLeu: 0.905 ± 0.212
0.209CysMet: 0.209 ± 0.094
0.835CysAsn: 0.835 ± 0.252
1.113CysPro: 1.113 ± 0.335
0.557CysGln: 0.557 ± 0.203
0.974CysArg: 0.974 ± 0.296
0.835CysSer: 0.835 ± 0.204
1.253CysThr: 1.253 ± 0.351
0.974CysVal: 0.974 ± 0.341
0.348CysTrp: 0.348 ± 0.173
0.348CysTyr: 0.348 ± 0.144
0.0CysXaa: 0.0 ± 0.0
Asp
6.332AspAla: 6.332 ± 0.674
0.557AspCys: 0.557 ± 0.262
3.758AspAsp: 3.758 ± 0.589
4.105AspGlu: 4.105 ± 0.502
3.201AspPhe: 3.201 ± 0.552
6.68AspGly: 6.68 ± 0.631
0.974AspHis: 0.974 ± 0.201
3.758AspIle: 3.758 ± 0.502
2.366AspLys: 2.366 ± 0.377
3.618AspLeu: 3.618 ± 0.512
1.461AspMet: 1.461 ± 0.333
3.062AspAsn: 3.062 ± 0.483
2.575AspPro: 2.575 ± 0.32
1.461AspGln: 1.461 ± 0.293
2.783AspArg: 2.783 ± 0.4
3.618AspSer: 3.618 ± 0.39
3.41AspThr: 3.41 ± 0.498
5.01AspVal: 5.01 ± 0.541
1.461AspTrp: 1.461 ± 0.256
2.366AspTyr: 2.366 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
5.358GluAla: 5.358 ± 0.728
0.905GluCys: 0.905 ± 0.315
2.923GluAsp: 2.923 ± 0.484
3.897GluGlu: 3.897 ± 0.527
1.948GluPhe: 1.948 ± 0.393
3.549GluGly: 3.549 ± 0.429
1.113GluHis: 1.113 ± 0.268
4.314GluIle: 4.314 ± 0.568
2.853GluLys: 2.853 ± 0.544
5.428GluLeu: 5.428 ± 0.63
2.227GluMet: 2.227 ± 0.493
2.923GluAsn: 2.923 ± 0.399
1.879GluPro: 1.879 ± 0.375
3.131GluGln: 3.131 ± 0.514
3.131GluArg: 3.131 ± 0.435
3.27GluSer: 3.27 ± 0.402
3.41GluThr: 3.41 ± 0.451
3.27GluVal: 3.27 ± 0.43
1.67GluTrp: 1.67 ± 0.434
2.853GluTyr: 2.853 ± 0.46
0.0GluXaa: 0.0 ± 0.0
Phe
2.923PheAla: 2.923 ± 0.506
0.905PheCys: 0.905 ± 0.302
2.366PheAsp: 2.366 ± 0.334
2.088PheGlu: 2.088 ± 0.338
1.461PhePhe: 1.461 ± 0.343
2.435PheGly: 2.435 ± 0.369
0.765PheHis: 0.765 ± 0.238
2.088PheIle: 2.088 ± 0.453
1.879PheLys: 1.879 ± 0.288
1.531PheLeu: 1.531 ± 0.323
0.626PheMet: 0.626 ± 0.183
2.296PheAsn: 2.296 ± 0.39
1.531PhePro: 1.531 ± 0.339
1.113PheGln: 1.113 ± 0.265
1.74PheArg: 1.74 ± 0.336
2.088PheSer: 2.088 ± 0.355
2.714PheThr: 2.714 ± 0.421
2.435PheVal: 2.435 ± 0.369
0.626PheTrp: 0.626 ± 0.199
1.531PheTyr: 1.531 ± 0.299
0.0PheXaa: 0.0 ± 0.0
Gly
5.288GlyAla: 5.288 ± 0.931
1.531GlyCys: 1.531 ± 0.417
4.453GlyAsp: 4.453 ± 0.454
4.036GlyGlu: 4.036 ± 0.526
2.575GlyPhe: 2.575 ± 0.38
6.263GlyGly: 6.263 ± 1.048
1.67GlyHis: 1.67 ± 0.428
4.871GlyIle: 4.871 ± 0.673
4.593GlyLys: 4.593 ± 0.567
4.314GlyLeu: 4.314 ± 0.53
2.296GlyMet: 2.296 ± 0.447
3.062GlyAsn: 3.062 ± 0.462
2.157GlyPro: 2.157 ± 0.372
3.27GlyGln: 3.27 ± 0.315
4.036GlyArg: 4.036 ± 0.449
4.245GlySer: 4.245 ± 0.723
5.915GlyThr: 5.915 ± 0.915
6.054GlyVal: 6.054 ± 0.719
1.253GlyTrp: 1.253 ± 0.283
3.27GlyTyr: 3.27 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
1.67HisAla: 1.67 ± 0.354
0.696HisCys: 0.696 ± 0.239
1.044HisAsp: 1.044 ± 0.312
0.835HisGlu: 0.835 ± 0.219
0.487HisPhe: 0.487 ± 0.199
2.018HisGly: 2.018 ± 0.634
0.974HisHis: 0.974 ± 0.232
1.67HisIle: 1.67 ± 0.359
0.974HisLys: 0.974 ± 0.271
1.531HisLeu: 1.531 ± 0.249
0.139HisMet: 0.139 ± 0.086
0.765HisAsn: 0.765 ± 0.225
0.905HisPro: 0.905 ± 0.235
0.487HisGln: 0.487 ± 0.167
0.557HisArg: 0.557 ± 0.168
1.322HisSer: 1.322 ± 0.26
1.113HisThr: 1.113 ± 0.26
1.74HisVal: 1.74 ± 0.363
0.487HisTrp: 0.487 ± 0.176
0.696HisTyr: 0.696 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
3.966IleAla: 3.966 ± 0.529
1.183IleCys: 1.183 ± 0.326
4.036IleAsp: 4.036 ± 0.42
4.453IleGlu: 4.453 ± 0.634
2.227IlePhe: 2.227 ± 0.441
3.618IleGly: 3.618 ± 0.456
0.835IleHis: 0.835 ± 0.309
3.618IleIle: 3.618 ± 0.59
4.314IleLys: 4.314 ± 0.524
3.062IleLeu: 3.062 ± 0.456
1.67IleMet: 1.67 ± 0.397
2.992IleAsn: 2.992 ± 0.586
3.27IlePro: 3.27 ± 0.487
1.6IleGln: 1.6 ± 0.39
3.131IleArg: 3.131 ± 0.466
3.41IleSer: 3.41 ± 0.442
4.662IleThr: 4.662 ± 0.592
4.662IleVal: 4.662 ± 0.615
0.418IleTrp: 0.418 ± 0.173
1.392IleTyr: 1.392 ± 0.289
0.0IleXaa: 0.0 ± 0.0
Lys
5.149LysAla: 5.149 ± 0.649
0.765LysCys: 0.765 ± 0.241
3.827LysAsp: 3.827 ± 0.637
2.435LysGlu: 2.435 ± 0.463
2.296LysPhe: 2.296 ± 0.343
2.923LysGly: 2.923 ± 0.461
1.461LysHis: 1.461 ± 0.298
3.688LysIle: 3.688 ± 0.466
2.296LysLys: 2.296 ± 0.34
4.453LysLeu: 4.453 ± 0.601
1.948LysMet: 1.948 ± 0.364
2.366LysAsn: 2.366 ± 0.465
2.435LysPro: 2.435 ± 0.326
2.714LysGln: 2.714 ± 0.419
4.245LysArg: 4.245 ± 0.698
3.688LysSer: 3.688 ± 0.478
2.714LysThr: 2.714 ± 0.397
3.062LysVal: 3.062 ± 0.506
0.905LysTrp: 0.905 ± 0.241
1.392LysTyr: 1.392 ± 0.428
0.0LysXaa: 0.0 ± 0.0
Leu
5.219LeuAla: 5.219 ± 0.589
1.392LeuCys: 1.392 ± 0.345
6.263LeuAsp: 6.263 ± 0.699
4.732LeuGlu: 4.732 ± 0.578
2.018LeuPhe: 2.018 ± 0.339
4.871LeuGly: 4.871 ± 0.501
1.6LeuHis: 1.6 ± 0.317
4.314LeuIle: 4.314 ± 0.572
4.453LeuLys: 4.453 ± 0.658
5.358LeuLeu: 5.358 ± 0.672
1.879LeuMet: 1.879 ± 0.356
4.036LeuAsn: 4.036 ± 0.515
2.644LeuPro: 2.644 ± 0.406
2.853LeuGln: 2.853 ± 0.427
4.314LeuArg: 4.314 ± 0.48
4.732LeuSer: 4.732 ± 0.508
4.384LeuThr: 4.384 ± 0.46
4.105LeuVal: 4.105 ± 0.533
1.392LeuTrp: 1.392 ± 0.307
2.505LeuTyr: 2.505 ± 0.35
0.0LeuXaa: 0.0 ± 0.0
Met
2.435MetAla: 2.435 ± 0.423
0.487MetCys: 0.487 ± 0.2
1.253MetAsp: 1.253 ± 0.277
1.113MetGlu: 1.113 ± 0.266
1.392MetPhe: 1.392 ± 0.269
1.74MetGly: 1.74 ± 0.336
0.348MetHis: 0.348 ± 0.164
1.74MetIle: 1.74 ± 0.285
0.835MetLys: 0.835 ± 0.207
2.157MetLeu: 2.157 ± 0.387
0.905MetMet: 0.905 ± 0.235
0.974MetAsn: 0.974 ± 0.254
1.183MetPro: 1.183 ± 0.317
1.183MetGln: 1.183 ± 0.264
2.366MetArg: 2.366 ± 0.483
2.227MetSer: 2.227 ± 0.449
2.575MetThr: 2.575 ± 0.31
1.948MetVal: 1.948 ± 0.322
0.139MetTrp: 0.139 ± 0.108
0.835MetTyr: 0.835 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
3.688AsnAla: 3.688 ± 0.587
0.974AsnCys: 0.974 ± 0.25
2.783AsnAsp: 2.783 ± 0.379
2.366AsnGlu: 2.366 ± 0.463
1.6AsnPhe: 1.6 ± 0.339
3.897AsnGly: 3.897 ± 0.611
1.322AsnHis: 1.322 ± 0.315
2.853AsnIle: 2.853 ± 0.349
2.575AsnLys: 2.575 ± 0.545
2.992AsnLeu: 2.992 ± 0.438
1.183AsnMet: 1.183 ± 0.283
2.992AsnAsn: 2.992 ± 0.499
2.575AsnPro: 2.575 ± 0.503
1.461AsnGln: 1.461 ± 0.247
1.809AsnArg: 1.809 ± 0.305
3.618AsnSer: 3.618 ± 0.621
2.992AsnThr: 2.992 ± 0.492
3.201AsnVal: 3.201 ± 0.389
0.974AsnTrp: 0.974 ± 0.29
1.461AsnTyr: 1.461 ± 0.277
0.0AsnXaa: 0.0 ± 0.0
Pro
3.479ProAla: 3.479 ± 0.56
0.557ProCys: 0.557 ± 0.144
2.575ProAsp: 2.575 ± 0.401
2.714ProGlu: 2.714 ± 0.415
1.253ProPhe: 1.253 ± 0.278
3.479ProGly: 3.479 ± 0.609
0.696ProHis: 0.696 ± 0.194
1.74ProIle: 1.74 ± 0.37
1.74ProLys: 1.74 ± 0.334
3.201ProLeu: 3.201 ± 0.414
0.557ProMet: 0.557 ± 0.161
2.018ProAsn: 2.018 ± 0.32
1.948ProPro: 1.948 ± 0.465
1.6ProGln: 1.6 ± 0.264
2.296ProArg: 2.296 ± 0.332
1.809ProSer: 1.809 ± 0.339
2.435ProThr: 2.435 ± 0.418
4.732ProVal: 4.732 ± 0.653
0.974ProTrp: 0.974 ± 0.269
1.322ProTyr: 1.322 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
2.783GlnAla: 2.783 ± 0.521
0.278GlnCys: 0.278 ± 0.129
1.67GlnAsp: 1.67 ± 0.29
2.435GlnGlu: 2.435 ± 0.338
1.879GlnPhe: 1.879 ± 0.286
2.366GlnGly: 2.366 ± 0.392
0.835GlnHis: 0.835 ± 0.203
2.435GlnIle: 2.435 ± 0.451
1.392GlnLys: 1.392 ± 0.334
4.384GlnLeu: 4.384 ± 0.526
1.322GlnMet: 1.322 ± 0.301
1.253GlnAsn: 1.253 ± 0.294
1.322GlnPro: 1.322 ± 0.299
2.853GlnGln: 2.853 ± 0.503
2.366GlnArg: 2.366 ± 0.455
2.575GlnSer: 2.575 ± 0.471
2.714GlnThr: 2.714 ± 0.406
2.853GlnVal: 2.853 ± 0.416
0.487GlnTrp: 0.487 ± 0.185
1.879GlnTyr: 1.879 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
3.966ArgAla: 3.966 ± 0.562
0.765ArgCys: 0.765 ± 0.217
3.062ArgAsp: 3.062 ± 0.551
3.41ArgGlu: 3.41 ± 0.503
1.6ArgPhe: 1.6 ± 0.347
2.992ArgGly: 2.992 ± 0.48
1.6ArgHis: 1.6 ± 0.298
3.34ArgIle: 3.34 ± 0.411
3.897ArgLys: 3.897 ± 0.583
5.08ArgLeu: 5.08 ± 0.515
1.044ArgMet: 1.044 ± 0.247
2.575ArgAsn: 2.575 ± 0.343
1.948ArgPro: 1.948 ± 0.343
2.435ArgGln: 2.435 ± 0.461
3.201ArgArg: 3.201 ± 0.607
2.714ArgSer: 2.714 ± 0.382
2.157ArgThr: 2.157 ± 0.295
3.41ArgVal: 3.41 ± 0.538
1.461ArgTrp: 1.461 ± 0.339
2.366ArgTyr: 2.366 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
5.358SerAla: 5.358 ± 0.873
0.418SerCys: 0.418 ± 0.142
3.966SerAsp: 3.966 ± 0.454
3.201SerGlu: 3.201 ± 0.576
2.018SerPhe: 2.018 ± 0.41
5.497SerGly: 5.497 ± 0.634
1.322SerHis: 1.322 ± 0.318
2.435SerIle: 2.435 ± 0.444
4.105SerLys: 4.105 ± 0.623
5.149SerLeu: 5.149 ± 0.51
1.948SerMet: 1.948 ± 0.416
2.992SerAsn: 2.992 ± 0.514
2.227SerPro: 2.227 ± 0.449
2.783SerGln: 2.783 ± 0.367
2.366SerArg: 2.366 ± 0.399
3.41SerSer: 3.41 ± 0.677
3.897SerThr: 3.897 ± 0.596
3.897SerVal: 3.897 ± 0.631
0.835SerTrp: 0.835 ± 0.239
2.505SerTyr: 2.505 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
5.845ThrAla: 5.845 ± 0.788
0.765ThrCys: 0.765 ± 0.238
4.175ThrAsp: 4.175 ± 0.486
3.34ThrGlu: 3.34 ± 0.641
2.366ThrPhe: 2.366 ± 0.453
6.054ThrGly: 6.054 ± 0.681
1.531ThrHis: 1.531 ± 0.377
4.175ThrIle: 4.175 ± 0.525
2.923ThrLys: 2.923 ± 0.518
5.08ThrLeu: 5.08 ± 0.682
1.809ThrMet: 1.809 ± 0.348
2.296ThrAsn: 2.296 ± 0.463
4.036ThrPro: 4.036 ± 0.605
2.227ThrGln: 2.227 ± 0.375
2.505ThrArg: 2.505 ± 0.355
4.036ThrSer: 4.036 ± 0.495
4.105ThrThr: 4.105 ± 0.642
5.567ThrVal: 5.567 ± 0.718
1.183ThrTrp: 1.183 ± 0.341
2.366ThrTyr: 2.366 ± 0.409
0.0ThrXaa: 0.0 ± 0.0
Val
6.541ValAla: 6.541 ± 0.793
1.044ValCys: 1.044 ± 0.283
4.871ValAsp: 4.871 ± 0.618
4.871ValGlu: 4.871 ± 0.676
2.505ValPhe: 2.505 ± 0.424
4.732ValGly: 4.732 ± 0.543
0.974ValHis: 0.974 ± 0.378
3.897ValIle: 3.897 ± 0.525
4.384ValLys: 4.384 ± 0.579
4.105ValLeu: 4.105 ± 0.555
2.157ValMet: 2.157 ± 0.384
3.827ValAsn: 3.827 ± 0.388
2.853ValPro: 2.853 ± 0.378
2.018ValGln: 2.018 ± 0.356
3.062ValArg: 3.062 ± 0.455
4.175ValSer: 4.175 ± 0.504
7.654ValThr: 7.654 ± 0.782
3.27ValVal: 3.27 ± 0.437
0.765ValTrp: 0.765 ± 0.205
2.783ValTyr: 2.783 ± 0.431
0.0ValXaa: 0.0 ± 0.0
Trp
1.183TrpAla: 1.183 ± 0.26
0.209TrpCys: 0.209 ± 0.11
1.392TrpAsp: 1.392 ± 0.252
0.765TrpGlu: 0.765 ± 0.259
0.557TrpPhe: 0.557 ± 0.197
1.392TrpGly: 1.392 ± 0.28
0.209TrpHis: 0.209 ± 0.117
1.044TrpIle: 1.044 ± 0.287
1.253TrpLys: 1.253 ± 0.32
1.392TrpLeu: 1.392 ± 0.294
0.209TrpMet: 0.209 ± 0.112
0.835TrpAsn: 0.835 ± 0.292
0.626TrpPro: 0.626 ± 0.194
0.835TrpGln: 0.835 ± 0.286
1.322TrpArg: 1.322 ± 0.328
1.253TrpSer: 1.253 ± 0.264
0.835TrpThr: 0.835 ± 0.236
1.322TrpVal: 1.322 ± 0.331
0.07TrpTrp: 0.07 ± 0.057
0.696TrpTyr: 0.696 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.131TyrAla: 3.131 ± 0.501
0.835TyrCys: 0.835 ± 0.244
2.157TyrAsp: 2.157 ± 0.349
2.435TyrGlu: 2.435 ± 0.415
1.322TyrPhe: 1.322 ± 0.272
2.923TyrGly: 2.923 ± 0.426
0.348TyrHis: 0.348 ± 0.146
1.67TyrIle: 1.67 ± 0.333
2.088TyrLys: 2.088 ± 0.314
2.227TyrLeu: 2.227 ± 0.398
1.392TyrMet: 1.392 ± 0.279
1.809TyrAsn: 1.809 ± 0.293
1.67TyrPro: 1.67 ± 0.39
1.392TyrGln: 1.392 ± 0.271
1.948TyrArg: 1.948 ± 0.404
2.157TyrSer: 2.157 ± 0.356
2.714TyrThr: 2.714 ± 0.36
2.992TyrVal: 2.992 ± 0.428
0.905TyrTrp: 0.905 ± 0.296
1.392TyrTyr: 1.392 ± 0.318
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (14372 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski