Amino acid dipepetide frequency for Klebsiella phage vB_KpnP_SU552A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.721AlaAla: 15.721 ± 1.541
0.873AlaCys: 0.873 ± 0.258
6.259AlaAsp: 6.259 ± 0.706
5.167AlaGlu: 5.167 ± 0.621
3.421AlaPhe: 3.421 ± 0.405
8.297AlaGly: 8.297 ± 0.975
1.674AlaHis: 1.674 ± 0.516
4.367AlaIle: 4.367 ± 0.599
5.895AlaLys: 5.895 ± 1.044
9.607AlaLeu: 9.607 ± 0.866
2.984AlaMet: 2.984 ± 0.421
3.421AlaAsn: 3.421 ± 0.559
4.294AlaPro: 4.294 ± 1.071
5.677AlaGln: 5.677 ± 0.863
5.531AlaArg: 5.531 ± 0.755
6.259AlaSer: 6.259 ± 0.659
4.658AlaThr: 4.658 ± 0.704
7.132AlaVal: 7.132 ± 0.86
1.237AlaTrp: 1.237 ± 0.357
4.076AlaTyr: 4.076 ± 0.643
0.0AlaXaa: 0.0 ± 0.0
Cys
0.801CysAla: 0.801 ± 0.275
0.437CysCys: 0.437 ± 0.237
0.728CysAsp: 0.728 ± 0.26
0.582CysGlu: 0.582 ± 0.203
0.364CysPhe: 0.364 ± 0.162
1.019CysGly: 1.019 ± 0.341
0.291CysHis: 0.291 ± 0.157
0.291CysIle: 0.291 ± 0.129
0.437CysLys: 0.437 ± 0.238
0.873CysLeu: 0.873 ± 0.261
0.509CysMet: 0.509 ± 0.176
0.509CysAsn: 0.509 ± 0.188
0.582CysPro: 0.582 ± 0.222
0.291CysGln: 0.291 ± 0.149
0.873CysArg: 0.873 ± 0.234
0.946CysSer: 0.946 ± 0.248
0.873CysThr: 0.873 ± 0.232
0.946CysVal: 0.946 ± 0.2
0.218CysTrp: 0.218 ± 0.13
0.509CysTyr: 0.509 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
7.132AspAla: 7.132 ± 0.909
1.092AspCys: 1.092 ± 0.327
3.275AspAsp: 3.275 ± 0.487
3.712AspGlu: 3.712 ± 0.571
2.402AspPhe: 2.402 ± 0.423
4.658AspGly: 4.658 ± 0.595
0.364AspHis: 0.364 ± 0.214
2.984AspIle: 2.984 ± 0.518
2.693AspLys: 2.693 ± 0.401
5.022AspLeu: 5.022 ± 0.548
2.693AspMet: 2.693 ± 0.424
2.693AspAsn: 2.693 ± 0.47
2.547AspPro: 2.547 ± 0.4
1.747AspGln: 1.747 ± 0.312
2.62AspArg: 2.62 ± 0.551
5.386AspSer: 5.386 ± 0.549
4.076AspThr: 4.076 ± 0.53
4.148AspVal: 4.148 ± 0.432
1.092AspTrp: 1.092 ± 0.201
2.693AspTyr: 2.693 ± 0.47
0.0AspXaa: 0.0 ± 0.0
Glu
4.949GluAla: 4.949 ± 0.916
0.364GluCys: 0.364 ± 0.153
2.984GluAsp: 2.984 ± 0.423
4.003GluGlu: 4.003 ± 0.902
2.62GluPhe: 2.62 ± 0.406
4.148GluGly: 4.148 ± 0.516
2.475GluHis: 2.475 ± 0.338
2.402GluIle: 2.402 ± 0.373
2.038GluLys: 2.038 ± 0.46
5.386GluLeu: 5.386 ± 0.637
1.965GluMet: 1.965 ± 0.325
1.82GluAsn: 1.82 ± 0.317
1.747GluPro: 1.747 ± 0.312
3.275GluGln: 3.275 ± 0.56
3.857GluArg: 3.857 ± 0.549
2.475GluSer: 2.475 ± 0.464
3.13GluThr: 3.13 ± 0.471
4.731GluVal: 4.731 ± 0.624
0.946GluTrp: 0.946 ± 0.237
2.402GluTyr: 2.402 ± 0.363
0.0GluXaa: 0.0 ± 0.0
Phe
2.838PheAla: 2.838 ± 0.46
0.509PheCys: 0.509 ± 0.259
2.183PheAsp: 2.183 ± 0.369
1.965PheGlu: 1.965 ± 0.37
1.092PhePhe: 1.092 ± 0.213
2.038PheGly: 2.038 ± 0.307
0.364PheHis: 0.364 ± 0.195
1.092PheIle: 1.092 ± 0.237
1.747PheLys: 1.747 ± 0.445
2.402PheLeu: 2.402 ± 0.475
0.655PheMet: 0.655 ± 0.212
1.747PheAsn: 1.747 ± 0.362
1.456PhePro: 1.456 ± 0.269
1.237PheGln: 1.237 ± 0.269
1.892PheArg: 1.892 ± 0.507
1.892PheSer: 1.892 ± 0.496
2.111PheThr: 2.111 ± 0.466
2.038PheVal: 2.038 ± 0.452
0.509PheTrp: 0.509 ± 0.144
1.31PheTyr: 1.31 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
6.259GlyAla: 6.259 ± 0.69
1.674GlyCys: 1.674 ± 0.365
4.731GlyAsp: 4.731 ± 0.503
3.93GlyGlu: 3.93 ± 0.427
2.475GlyPhe: 2.475 ± 0.461
4.294GlyGly: 4.294 ± 0.667
1.092GlyHis: 1.092 ± 0.279
3.93GlyIle: 3.93 ± 0.583
3.857GlyLys: 3.857 ± 0.596
6.477GlyLeu: 6.477 ± 0.716
1.82GlyMet: 1.82 ± 0.414
3.566GlyAsn: 3.566 ± 0.559
1.528GlyPro: 1.528 ± 0.296
2.693GlyGln: 2.693 ± 0.421
5.022GlyArg: 5.022 ± 0.392
5.604GlySer: 5.604 ± 0.707
5.24GlyThr: 5.24 ± 1.007
6.259GlyVal: 6.259 ± 0.783
0.655GlyTrp: 0.655 ± 0.198
2.475GlyTyr: 2.475 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
1.601HisAla: 1.601 ± 0.37
0.291HisCys: 0.291 ± 0.152
1.092HisAsp: 1.092 ± 0.29
1.237HisGlu: 1.237 ± 0.348
0.218HisPhe: 0.218 ± 0.102
1.892HisGly: 1.892 ± 0.445
0.073HisHis: 0.073 ± 0.084
1.019HisIle: 1.019 ± 0.289
0.801HisLys: 0.801 ± 0.232
2.62HisLeu: 2.62 ± 0.475
0.291HisMet: 0.291 ± 0.124
0.728HisAsn: 0.728 ± 0.205
0.655HisPro: 0.655 ± 0.303
0.437HisGln: 0.437 ± 0.208
1.528HisArg: 1.528 ± 0.336
1.237HisSer: 1.237 ± 0.339
0.509HisThr: 0.509 ± 0.177
0.655HisVal: 0.655 ± 0.242
0.218HisTrp: 0.218 ± 0.121
0.655HisTyr: 0.655 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
3.493IleAla: 3.493 ± 0.533
0.437IleCys: 0.437 ± 0.199
2.911IleAsp: 2.911 ± 0.392
2.766IleGlu: 2.766 ± 0.507
0.801IlePhe: 0.801 ± 0.214
2.693IleGly: 2.693 ± 0.388
0.728IleHis: 0.728 ± 0.215
1.82IleIle: 1.82 ± 0.331
3.057IleLys: 3.057 ± 0.53
4.294IleLeu: 4.294 ± 0.622
1.528IleMet: 1.528 ± 0.247
1.892IleAsn: 1.892 ± 0.341
2.038IlePro: 2.038 ± 0.373
2.547IleGln: 2.547 ± 0.425
2.766IleArg: 2.766 ± 0.453
2.984IleSer: 2.984 ± 0.433
2.766IleThr: 2.766 ± 0.538
2.838IleVal: 2.838 ± 0.425
0.146IleTrp: 0.146 ± 0.103
1.31IleTyr: 1.31 ± 0.293
0.0IleXaa: 0.0 ± 0.0
Lys
6.332LysAla: 6.332 ± 0.906
0.437LysCys: 0.437 ± 0.199
3.13LysAsp: 3.13 ± 0.506
3.348LysGlu: 3.348 ± 0.583
0.946LysPhe: 0.946 ± 0.267
2.984LysGly: 2.984 ± 0.556
0.946LysHis: 0.946 ± 0.26
1.383LysIle: 1.383 ± 0.288
2.038LysLys: 2.038 ± 0.422
4.803LysLeu: 4.803 ± 0.568
1.237LysMet: 1.237 ± 0.284
1.31LysAsn: 1.31 ± 0.245
1.456LysPro: 1.456 ± 0.358
3.348LysGln: 3.348 ± 0.589
3.348LysArg: 3.348 ± 0.526
2.911LysSer: 2.911 ± 0.409
2.547LysThr: 2.547 ± 0.412
3.202LysVal: 3.202 ± 0.538
1.019LysTrp: 1.019 ± 0.266
1.383LysTyr: 1.383 ± 0.366
0.0LysXaa: 0.0 ± 0.0
Leu
8.37LeuAla: 8.37 ± 0.935
1.092LeuCys: 1.092 ± 0.244
7.132LeuAsp: 7.132 ± 0.616
5.604LeuGlu: 5.604 ± 0.517
2.693LeuPhe: 2.693 ± 0.369
6.696LeuGly: 6.696 ± 0.663
1.528LeuHis: 1.528 ± 0.362
4.658LeuIle: 4.658 ± 0.694
2.911LeuLys: 2.911 ± 0.494
7.278LeuLeu: 7.278 ± 0.815
1.747LeuMet: 1.747 ± 0.357
3.493LeuAsn: 3.493 ± 0.508
3.202LeuPro: 3.202 ± 0.44
4.148LeuGln: 4.148 ± 0.529
6.623LeuArg: 6.623 ± 0.567
4.876LeuSer: 4.876 ± 0.662
5.75LeuThr: 5.75 ± 0.607
5.604LeuVal: 5.604 ± 0.633
1.237LeuTrp: 1.237 ± 0.277
3.348LeuTyr: 3.348 ± 0.428
0.0LeuXaa: 0.0 ± 0.0
Met
3.566MetAla: 3.566 ± 0.62
0.291MetCys: 0.291 ± 0.128
1.965MetAsp: 1.965 ± 0.447
1.092MetGlu: 1.092 ± 0.204
0.873MetPhe: 0.873 ± 0.248
1.237MetGly: 1.237 ± 0.272
0.873MetHis: 0.873 ± 0.288
0.655MetIle: 0.655 ± 0.205
1.164MetLys: 1.164 ± 0.303
3.566MetLeu: 3.566 ± 0.608
0.509MetMet: 0.509 ± 0.226
0.946MetAsn: 0.946 ± 0.281
1.164MetPro: 1.164 ± 0.24
2.038MetGln: 2.038 ± 0.409
1.892MetArg: 1.892 ± 0.433
2.256MetSer: 2.256 ± 0.459
0.946MetThr: 0.946 ± 0.305
1.965MetVal: 1.965 ± 0.417
0.291MetTrp: 0.291 ± 0.131
1.092MetTyr: 1.092 ± 0.332
0.0MetXaa: 0.0 ± 0.0
Asn
3.566AsnAla: 3.566 ± 0.424
0.364AsnCys: 0.364 ± 0.19
2.62AsnAsp: 2.62 ± 0.421
1.601AsnGlu: 1.601 ± 0.328
1.092AsnPhe: 1.092 ± 0.334
3.275AsnGly: 3.275 ± 0.424
0.218AsnHis: 0.218 ± 0.122
2.62AsnIle: 2.62 ± 0.444
1.965AsnLys: 1.965 ± 0.296
3.057AsnLeu: 3.057 ± 0.582
1.237AsnMet: 1.237 ± 0.296
1.383AsnAsn: 1.383 ± 0.349
2.402AsnPro: 2.402 ± 0.351
1.456AsnGln: 1.456 ± 0.412
1.747AsnArg: 1.747 ± 0.331
3.275AsnSer: 3.275 ± 0.581
2.62AsnThr: 2.62 ± 0.397
3.202AsnVal: 3.202 ± 0.341
0.728AsnTrp: 0.728 ± 0.285
1.456AsnTyr: 1.456 ± 0.409
0.0AsnXaa: 0.0 ± 0.0
Pro
4.512ProAla: 4.512 ± 0.97
0.146ProCys: 0.146 ± 0.109
2.838ProAsp: 2.838 ± 0.424
3.202ProGlu: 3.202 ± 0.607
0.946ProPhe: 0.946 ± 0.212
2.547ProGly: 2.547 ± 0.585
0.509ProHis: 0.509 ± 0.216
1.601ProIle: 1.601 ± 0.363
1.456ProLys: 1.456 ± 0.344
2.766ProLeu: 2.766 ± 0.451
1.092ProMet: 1.092 ± 0.256
1.601ProAsn: 1.601 ± 0.357
0.801ProPro: 0.801 ± 0.23
1.383ProGln: 1.383 ± 0.274
1.892ProArg: 1.892 ± 0.353
2.329ProSer: 2.329 ± 0.414
2.329ProThr: 2.329 ± 0.386
2.838ProVal: 2.838 ± 0.417
0.582ProTrp: 0.582 ± 0.203
1.528ProTyr: 1.528 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
5.095GlnAla: 5.095 ± 0.727
0.437GlnCys: 0.437 ± 0.221
2.838GlnAsp: 2.838 ± 0.469
3.639GlnGlu: 3.639 ± 0.607
1.237GlnPhe: 1.237 ± 0.308
2.693GlnGly: 2.693 ± 0.411
1.237GlnHis: 1.237 ± 0.353
0.873GlnIle: 0.873 ± 0.271
2.62GlnLys: 2.62 ± 0.473
4.731GlnLeu: 4.731 ± 0.616
1.237GlnMet: 1.237 ± 0.234
2.038GlnAsn: 2.038 ± 0.408
1.601GlnPro: 1.601 ± 0.418
3.057GlnGln: 3.057 ± 0.603
2.838GlnArg: 2.838 ± 0.364
3.712GlnSer: 3.712 ± 0.455
1.456GlnThr: 1.456 ± 0.401
2.838GlnVal: 2.838 ± 0.423
0.655GlnTrp: 0.655 ± 0.215
2.256GlnTyr: 2.256 ± 0.604
0.0GlnXaa: 0.0 ± 0.0
Arg
6.987ArgAla: 6.987 ± 1.112
0.728ArgCys: 0.728 ± 0.273
3.566ArgAsp: 3.566 ± 0.536
3.348ArgGlu: 3.348 ± 0.549
2.183ArgPhe: 2.183 ± 0.33
3.93ArgGly: 3.93 ± 0.656
1.019ArgHis: 1.019 ± 0.213
3.275ArgIle: 3.275 ± 0.546
2.984ArgLys: 2.984 ± 0.528
4.949ArgLeu: 4.949 ± 0.485
2.183ArgMet: 2.183 ± 0.373
2.547ArgAsn: 2.547 ± 0.301
1.674ArgPro: 1.674 ± 0.342
2.475ArgGln: 2.475 ± 0.436
4.44ArgArg: 4.44 ± 0.708
3.057ArgSer: 3.057 ± 0.545
3.493ArgThr: 3.493 ± 0.343
3.639ArgVal: 3.639 ± 0.538
0.873ArgTrp: 0.873 ± 0.245
2.038ArgTyr: 2.038 ± 0.348
0.0ArgXaa: 0.0 ± 0.0
Ser
8.661SerAla: 8.661 ± 0.728
0.582SerCys: 0.582 ± 0.189
4.148SerAsp: 4.148 ± 0.514
3.202SerGlu: 3.202 ± 0.566
1.965SerPhe: 1.965 ± 0.396
6.186SerGly: 6.186 ± 0.984
0.801SerHis: 0.801 ± 0.242
2.766SerIle: 2.766 ± 0.581
3.93SerLys: 3.93 ± 0.446
4.367SerLeu: 4.367 ± 0.778
2.402SerMet: 2.402 ± 0.441
3.13SerAsn: 3.13 ± 0.6
2.402SerPro: 2.402 ± 0.34
1.82SerGln: 1.82 ± 0.336
2.766SerArg: 2.766 ± 0.385
3.712SerSer: 3.712 ± 0.689
4.731SerThr: 4.731 ± 0.663
5.095SerVal: 5.095 ± 0.66
1.237SerTrp: 1.237 ± 0.343
1.674SerTyr: 1.674 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
5.75ThrAla: 5.75 ± 0.7
0.509ThrCys: 0.509 ± 0.219
3.13ThrAsp: 3.13 ± 0.424
2.62ThrGlu: 2.62 ± 0.54
2.111ThrPhe: 2.111 ± 0.408
4.876ThrGly: 4.876 ± 0.683
1.092ThrHis: 1.092 ± 0.273
2.693ThrIle: 2.693 ± 0.444
2.547ThrLys: 2.547 ± 0.431
5.75ThrLeu: 5.75 ± 0.687
1.383ThrMet: 1.383 ± 0.426
1.82ThrAsn: 1.82 ± 0.465
2.402ThrPro: 2.402 ± 0.323
2.984ThrGln: 2.984 ± 0.562
2.838ThrArg: 2.838 ± 0.452
4.076ThrSer: 4.076 ± 0.686
3.639ThrThr: 3.639 ± 0.553
4.803ThrVal: 4.803 ± 0.674
1.019ThrTrp: 1.019 ± 0.245
2.256ThrTyr: 2.256 ± 0.459
0.0ThrXaa: 0.0 ± 0.0
Val
6.623ValAla: 6.623 ± 0.74
0.801ValCys: 0.801 ± 0.281
4.512ValAsp: 4.512 ± 0.612
4.003ValGlu: 4.003 ± 0.615
1.528ValPhe: 1.528 ± 0.329
6.623ValGly: 6.623 ± 0.75
1.674ValHis: 1.674 ± 0.283
2.766ValIle: 2.766 ± 0.504
3.275ValLys: 3.275 ± 0.524
5.531ValLeu: 5.531 ± 0.82
1.82ValMet: 1.82 ± 0.395
3.057ValAsn: 3.057 ± 0.608
3.13ValPro: 3.13 ± 0.462
3.93ValGln: 3.93 ± 0.87
4.003ValArg: 4.003 ± 0.42
5.313ValSer: 5.313 ± 0.804
3.348ValThr: 3.348 ± 0.564
5.677ValVal: 5.677 ± 0.514
0.655ValTrp: 0.655 ± 0.203
2.838ValTyr: 2.838 ± 0.433
0.0ValXaa: 0.0 ± 0.0
Trp
1.237TrpAla: 1.237 ± 0.244
0.218TrpCys: 0.218 ± 0.116
0.728TrpAsp: 0.728 ± 0.227
1.019TrpGlu: 1.019 ± 0.305
0.655TrpPhe: 0.655 ± 0.353
0.582TrpGly: 0.582 ± 0.253
0.364TrpHis: 0.364 ± 0.145
0.509TrpIle: 0.509 ± 0.224
0.509TrpLys: 0.509 ± 0.216
1.383TrpLeu: 1.383 ± 0.233
0.146TrpMet: 0.146 ± 0.159
0.946TrpAsn: 0.946 ± 0.243
0.437TrpPro: 0.437 ± 0.207
0.437TrpGln: 0.437 ± 0.153
0.946TrpArg: 0.946 ± 0.238
0.655TrpSer: 0.655 ± 0.226
1.092TrpThr: 1.092 ± 0.206
1.237TrpVal: 1.237 ± 0.279
0.364TrpTrp: 0.364 ± 0.182
0.801TrpTyr: 0.801 ± 0.259
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.057TyrAla: 3.057 ± 0.452
0.801TyrCys: 0.801 ± 0.258
2.256TyrAsp: 2.256 ± 0.464
1.674TyrGlu: 1.674 ± 0.557
1.456TyrPhe: 1.456 ± 0.286
2.911TyrGly: 2.911 ± 0.509
0.437TyrHis: 0.437 ± 0.176
2.111TyrIle: 2.111 ± 0.462
2.183TyrLys: 2.183 ± 0.363
3.13TyrLeu: 3.13 ± 0.343
0.873TyrMet: 0.873 ± 0.299
1.237TyrAsn: 1.237 ± 0.257
1.383TyrPro: 1.383 ± 0.263
2.111TyrGln: 2.111 ± 0.403
2.038TyrArg: 2.038 ± 0.504
2.62TyrSer: 2.62 ± 0.383
2.984TyrThr: 2.984 ± 0.553
2.256TyrVal: 2.256 ± 0.534
0.582TyrTrp: 0.582 ± 0.194
1.31TyrTyr: 1.31 ± 0.339
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (13741 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski