Amino acid dipepetide frequency for Klebsiella phage vB_Kpn_Chronis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.585AlaAla: 8.585 ± 1.103
0.477AlaCys: 0.477 ± 0.153
5.11AlaAsp: 5.11 ± 1.017
6.269AlaGlu: 6.269 ± 0.946
2.998AlaPhe: 2.998 ± 0.42
8.108AlaGly: 8.108 ± 0.875
1.158AlaHis: 1.158 ± 0.258
6.678AlaIle: 6.678 ± 0.56
5.11AlaLys: 5.11 ± 0.888
7.836AlaLeu: 7.836 ± 0.753
3.679AlaMet: 3.679 ± 0.401
4.293AlaAsn: 4.293 ± 0.546
2.453AlaPro: 2.453 ± 0.349
3.748AlaGln: 3.748 ± 0.623
4.974AlaArg: 4.974 ± 0.667
5.519AlaSer: 5.519 ± 0.802
5.11AlaThr: 5.11 ± 0.653
5.315AlaVal: 5.315 ± 0.751
1.363AlaTrp: 1.363 ± 0.292
2.521AlaTyr: 2.521 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
1.295CysAla: 1.295 ± 0.259
0.068CysCys: 0.068 ± 0.063
0.818CysAsp: 0.818 ± 0.244
1.022CysGlu: 1.022 ± 0.282
0.136CysPhe: 0.136 ± 0.083
1.226CysGly: 1.226 ± 0.322
0.204CysHis: 0.204 ± 0.132
0.273CysIle: 0.273 ± 0.131
0.477CysLys: 0.477 ± 0.19
0.75CysLeu: 0.75 ± 0.24
0.136CysMet: 0.136 ± 0.113
0.204CysAsn: 0.204 ± 0.165
0.477CysPro: 0.477 ± 0.223
0.545CysGln: 0.545 ± 0.195
1.09CysArg: 1.09 ± 0.286
1.022CysSer: 1.022 ± 0.261
0.341CysThr: 0.341 ± 0.155
0.818CysVal: 0.818 ± 0.251
0.0CysTrp: 0.0 ± 0.0
0.477CysTyr: 0.477 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
5.247AspAla: 5.247 ± 0.819
0.818AspCys: 0.818 ± 0.25
4.633AspAsp: 4.633 ± 0.815
3.543AspGlu: 3.543 ± 0.663
2.657AspPhe: 2.657 ± 0.432
5.383AspGly: 5.383 ± 0.592
1.09AspHis: 1.09 ± 0.277
4.088AspIle: 4.088 ± 0.561
2.998AspLys: 2.998 ± 0.514
4.088AspLeu: 4.088 ± 0.418
1.84AspMet: 1.84 ± 0.299
2.18AspAsn: 2.18 ± 0.29
2.521AspPro: 2.521 ± 0.442
1.908AspGln: 1.908 ± 0.433
2.249AspArg: 2.249 ± 0.44
4.225AspSer: 4.225 ± 0.638
3.203AspThr: 3.203 ± 0.456
3.543AspVal: 3.543 ± 0.563
1.431AspTrp: 1.431 ± 0.333
2.657AspTyr: 2.657 ± 0.396
0.0AspXaa: 0.0 ± 0.0
Glu
5.315GluAla: 5.315 ± 0.723
0.613GluCys: 0.613 ± 0.215
2.657GluAsp: 2.657 ± 0.433
3.884GluGlu: 3.884 ± 0.652
2.453GluPhe: 2.453 ± 0.346
3.952GluGly: 3.952 ± 0.511
1.022GluHis: 1.022 ± 0.246
4.02GluIle: 4.02 ± 0.44
4.702GluLys: 4.702 ± 0.839
4.974GluLeu: 4.974 ± 0.668
2.317GluMet: 2.317 ± 0.424
1.976GluAsn: 1.976 ± 0.419
1.908GluPro: 1.908 ± 0.407
2.657GluGln: 2.657 ± 0.482
3.611GluArg: 3.611 ± 0.507
3.271GluSer: 3.271 ± 0.416
2.589GluThr: 2.589 ± 0.313
3.475GluVal: 3.475 ± 0.574
1.09GluTrp: 1.09 ± 0.263
1.635GluTyr: 1.635 ± 0.362
0.0GluXaa: 0.0 ± 0.0
Phe
1.908PheAla: 1.908 ± 0.3
0.681PheCys: 0.681 ± 0.207
1.84PheAsp: 1.84 ± 0.308
1.703PheGlu: 1.703 ± 0.325
1.431PhePhe: 1.431 ± 0.274
2.317PheGly: 2.317 ± 0.433
0.613PheHis: 0.613 ± 0.194
2.521PheIle: 2.521 ± 0.397
2.044PheLys: 2.044 ± 0.318
2.385PheLeu: 2.385 ± 0.355
1.09PheMet: 1.09 ± 0.233
2.453PheAsn: 2.453 ± 0.367
1.158PhePro: 1.158 ± 0.308
1.226PheGln: 1.226 ± 0.234
2.044PheArg: 2.044 ± 0.433
2.657PheSer: 2.657 ± 0.45
2.93PheThr: 2.93 ± 0.417
2.18PheVal: 2.18 ± 0.4
0.545PheTrp: 0.545 ± 0.142
1.363PheTyr: 1.363 ± 0.355
0.0PheXaa: 0.0 ± 0.0
Gly
6.269GlyAla: 6.269 ± 0.909
1.226GlyCys: 1.226 ± 0.281
4.156GlyAsp: 4.156 ± 0.469
4.633GlyGlu: 4.633 ± 0.488
3.475GlyPhe: 3.475 ± 0.508
7.291GlyGly: 7.291 ± 1.216
1.09GlyHis: 1.09 ± 0.242
4.361GlyIle: 4.361 ± 0.536
4.361GlyLys: 4.361 ± 0.544
6.269GlyLeu: 6.269 ± 0.649
2.044GlyMet: 2.044 ± 0.326
3.407GlyAsn: 3.407 ± 0.652
1.908GlyPro: 1.908 ± 0.356
2.385GlyGln: 2.385 ± 0.454
4.429GlyArg: 4.429 ± 0.57
5.724GlySer: 5.724 ± 0.853
5.587GlyThr: 5.587 ± 0.881
5.724GlyVal: 5.724 ± 0.676
1.431GlyTrp: 1.431 ± 0.276
2.044GlyTyr: 2.044 ± 0.364
0.0GlyXaa: 0.0 ± 0.0
His
0.681HisAla: 0.681 ± 0.223
0.341HisCys: 0.341 ± 0.155
0.818HisAsp: 0.818 ± 0.237
0.886HisGlu: 0.886 ± 0.239
0.341HisPhe: 0.341 ± 0.158
1.431HisGly: 1.431 ± 0.267
0.341HisHis: 0.341 ± 0.152
0.954HisIle: 0.954 ± 0.222
1.022HisLys: 1.022 ± 0.275
1.226HisLeu: 1.226 ± 0.266
0.409HisMet: 0.409 ± 0.168
0.818HisAsn: 0.818 ± 0.265
0.477HisPro: 0.477 ± 0.149
0.545HisGln: 0.545 ± 0.159
1.158HisArg: 1.158 ± 0.267
1.022HisSer: 1.022 ± 0.224
0.477HisThr: 0.477 ± 0.167
0.681HisVal: 0.681 ± 0.217
0.273HisTrp: 0.273 ± 0.122
1.022HisTyr: 1.022 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
6.064IleAla: 6.064 ± 0.572
0.818IleCys: 0.818 ± 0.211
4.293IleAsp: 4.293 ± 0.435
3.884IleGlu: 3.884 ± 0.548
1.431IlePhe: 1.431 ± 0.271
3.611IleGly: 3.611 ± 0.397
0.477IleHis: 0.477 ± 0.195
3.134IleIle: 3.134 ± 0.442
2.249IleLys: 2.249 ± 0.365
4.156IleLeu: 4.156 ± 0.563
1.703IleMet: 1.703 ± 0.31
3.543IleAsn: 3.543 ± 0.452
3.407IlePro: 3.407 ± 0.53
1.772IleGln: 1.772 ± 0.396
3.339IleArg: 3.339 ± 0.397
4.838IleSer: 4.838 ± 0.696
4.77IleThr: 4.77 ± 0.498
3.543IleVal: 3.543 ± 0.452
0.545IleTrp: 0.545 ± 0.198
2.112IleTyr: 2.112 ± 0.415
0.0IleXaa: 0.0 ± 0.0
Lys
5.519LysAla: 5.519 ± 0.949
0.545LysCys: 0.545 ± 0.233
2.453LysAsp: 2.453 ± 0.482
4.02LysGlu: 4.02 ± 0.633
1.567LysPhe: 1.567 ± 0.389
3.475LysGly: 3.475 ± 0.449
0.681LysHis: 0.681 ± 0.199
2.862LysIle: 2.862 ± 0.475
2.93LysLys: 2.93 ± 0.511
3.543LysLeu: 3.543 ± 0.525
1.703LysMet: 1.703 ± 0.353
2.18LysAsn: 2.18 ± 0.422
3.543LysPro: 3.543 ± 0.534
2.112LysGln: 2.112 ± 0.327
3.339LysArg: 3.339 ± 0.581
2.998LysSer: 2.998 ± 0.55
4.02LysThr: 4.02 ± 0.574
2.385LysVal: 2.385 ± 0.365
1.022LysTrp: 1.022 ± 0.244
1.499LysTyr: 1.499 ± 0.299
0.0LysXaa: 0.0 ± 0.0
Leu
7.563LeuAla: 7.563 ± 0.728
0.75LeuCys: 0.75 ± 0.229
4.838LeuAsp: 4.838 ± 0.541
4.497LeuGlu: 4.497 ± 0.666
1.976LeuPhe: 1.976 ± 0.381
5.179LeuGly: 5.179 ± 0.822
1.499LeuHis: 1.499 ± 0.342
4.906LeuIle: 4.906 ± 0.611
4.361LeuLys: 4.361 ± 0.616
6.132LeuLeu: 6.132 ± 0.763
1.499LeuMet: 1.499 ± 0.329
4.633LeuAsn: 4.633 ± 0.531
2.93LeuPro: 2.93 ± 0.494
3.543LeuGln: 3.543 ± 0.453
5.315LeuArg: 5.315 ± 0.702
6.541LeuSer: 6.541 ± 0.635
6.132LeuThr: 6.132 ± 0.571
3.543LeuVal: 3.543 ± 0.468
0.613LeuTrp: 0.613 ± 0.182
2.112LeuTyr: 2.112 ± 0.335
0.0LeuXaa: 0.0 ± 0.0
Met
2.862MetAla: 2.862 ± 0.418
0.204MetCys: 0.204 ± 0.121
1.09MetAsp: 1.09 ± 0.279
1.363MetGlu: 1.363 ± 0.319
0.818MetPhe: 0.818 ± 0.218
1.635MetGly: 1.635 ± 0.361
0.409MetHis: 0.409 ± 0.177
1.84MetIle: 1.84 ± 0.292
1.295MetLys: 1.295 ± 0.28
2.657MetLeu: 2.657 ± 0.385
0.954MetMet: 0.954 ± 0.268
1.363MetAsn: 1.363 ± 0.312
1.499MetPro: 1.499 ± 0.307
1.431MetGln: 1.431 ± 0.324
1.772MetArg: 1.772 ± 0.433
2.317MetSer: 2.317 ± 0.346
2.589MetThr: 2.589 ± 0.407
1.635MetVal: 1.635 ± 0.308
0.545MetTrp: 0.545 ± 0.16
0.818MetTyr: 0.818 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
4.77AsnAla: 4.77 ± 0.478
0.681AsnCys: 0.681 ± 0.188
2.794AsnAsp: 2.794 ± 0.298
2.657AsnGlu: 2.657 ± 0.44
1.772AsnPhe: 1.772 ± 0.351
3.952AsnGly: 3.952 ± 0.664
0.818AsnHis: 0.818 ± 0.206
2.726AsnIle: 2.726 ± 0.451
2.317AsnLys: 2.317 ± 0.38
3.816AsnLeu: 3.816 ± 0.555
1.09AsnMet: 1.09 ± 0.21
2.657AsnAsn: 2.657 ± 0.513
2.657AsnPro: 2.657 ± 0.473
1.703AsnGln: 1.703 ± 0.344
2.249AsnArg: 2.249 ± 0.298
2.93AsnSer: 2.93 ± 0.456
3.271AsnThr: 3.271 ± 0.513
2.589AsnVal: 2.589 ± 0.426
0.273AsnTrp: 0.273 ± 0.151
1.84AsnTyr: 1.84 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
4.633ProAla: 4.633 ± 0.605
0.341ProCys: 0.341 ± 0.166
2.862ProAsp: 2.862 ± 0.464
3.066ProGlu: 3.066 ± 0.494
1.703ProPhe: 1.703 ± 0.276
3.679ProGly: 3.679 ± 0.533
0.409ProHis: 0.409 ± 0.155
2.044ProIle: 2.044 ± 0.339
1.84ProLys: 1.84 ± 0.347
3.884ProLeu: 3.884 ± 0.602
1.635ProMet: 1.635 ± 0.379
1.09ProAsn: 1.09 ± 0.247
1.772ProPro: 1.772 ± 0.321
1.772ProGln: 1.772 ± 0.372
1.431ProArg: 1.431 ± 0.283
2.453ProSer: 2.453 ± 0.463
2.385ProThr: 2.385 ± 0.431
3.203ProVal: 3.203 ± 0.49
0.409ProTrp: 0.409 ± 0.145
1.295ProTyr: 1.295 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
4.088GlnAla: 4.088 ± 0.641
0.613GlnCys: 0.613 ± 0.178
1.772GlnAsp: 1.772 ± 0.359
2.18GlnGlu: 2.18 ± 0.358
1.022GlnPhe: 1.022 ± 0.271
2.794GlnGly: 2.794 ± 0.335
0.545GlnHis: 0.545 ± 0.211
2.112GlnIle: 2.112 ± 0.372
2.589GlnLys: 2.589 ± 0.512
3.134GlnLeu: 3.134 ± 0.499
1.158GlnMet: 1.158 ± 0.258
2.18GlnAsn: 2.18 ± 0.52
2.18GlnPro: 2.18 ± 0.407
2.317GlnGln: 2.317 ± 0.413
3.066GlnArg: 3.066 ± 0.435
2.93GlnSer: 2.93 ± 0.392
1.226GlnThr: 1.226 ± 0.257
3.203GlnVal: 3.203 ± 0.401
1.022GlnTrp: 1.022 ± 0.265
1.703GlnTyr: 1.703 ± 0.297
0.0GlnXaa: 0.0 ± 0.0
Arg
4.702ArgAla: 4.702 ± 0.666
1.022ArgCys: 1.022 ± 0.224
3.816ArgAsp: 3.816 ± 0.475
3.475ArgGlu: 3.475 ± 0.555
2.112ArgPhe: 2.112 ± 0.418
3.611ArgGly: 3.611 ± 0.458
0.886ArgHis: 0.886 ± 0.204
2.998ArgIle: 2.998 ± 0.5
3.271ArgLys: 3.271 ± 0.563
4.361ArgLeu: 4.361 ± 0.492
1.908ArgMet: 1.908 ± 0.435
2.862ArgAsn: 2.862 ± 0.435
1.703ArgPro: 1.703 ± 0.347
3.339ArgGln: 3.339 ± 0.536
4.838ArgArg: 4.838 ± 0.782
3.134ArgSer: 3.134 ± 0.494
2.453ArgThr: 2.453 ± 0.495
3.339ArgVal: 3.339 ± 0.338
1.499ArgTrp: 1.499 ± 0.32
1.908ArgTyr: 1.908 ± 0.369
0.0ArgXaa: 0.0 ± 0.0
Ser
7.563SerAla: 7.563 ± 0.785
0.613SerCys: 0.613 ± 0.18
4.497SerAsp: 4.497 ± 0.729
2.862SerGlu: 2.862 ± 0.51
2.589SerPhe: 2.589 ± 0.397
5.792SerGly: 5.792 ± 0.714
1.022SerHis: 1.022 ± 0.277
3.679SerIle: 3.679 ± 0.471
3.066SerLys: 3.066 ± 0.587
5.928SerLeu: 5.928 ± 0.618
1.908SerMet: 1.908 ± 0.451
2.726SerAsn: 2.726 ± 0.501
3.339SerPro: 3.339 ± 0.476
2.453SerGln: 2.453 ± 0.325
3.611SerArg: 3.611 ± 0.513
4.838SerSer: 4.838 ± 0.607
3.543SerThr: 3.543 ± 0.491
5.587SerVal: 5.587 ± 0.687
0.681SerTrp: 0.681 ± 0.247
1.908SerTyr: 1.908 ± 0.383
0.0SerXaa: 0.0 ± 0.0
Thr
5.587ThrAla: 5.587 ± 0.639
0.409ThrCys: 0.409 ± 0.169
3.748ThrAsp: 3.748 ± 0.608
3.407ThrGlu: 3.407 ± 0.538
2.862ThrPhe: 2.862 ± 0.405
6.541ThrGly: 6.541 ± 0.668
0.818ThrHis: 0.818 ± 0.215
4.156ThrIle: 4.156 ± 0.65
2.862ThrLys: 2.862 ± 0.41
4.633ThrLeu: 4.633 ± 0.517
0.818ThrMet: 0.818 ± 0.242
2.93ThrAsn: 2.93 ± 0.533
3.339ThrPro: 3.339 ± 0.486
3.066ThrGln: 3.066 ± 0.541
2.112ThrArg: 2.112 ± 0.349
3.952ThrSer: 3.952 ± 0.465
3.475ThrThr: 3.475 ± 0.707
5.179ThrVal: 5.179 ± 0.826
1.226ThrTrp: 1.226 ± 0.313
2.112ThrTyr: 2.112 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
4.838ValAla: 4.838 ± 0.833
0.613ValCys: 0.613 ± 0.218
3.884ValAsp: 3.884 ± 0.485
2.998ValGlu: 2.998 ± 0.436
2.249ValPhe: 2.249 ± 0.495
4.293ValGly: 4.293 ± 0.478
0.75ValHis: 0.75 ± 0.184
4.088ValIle: 4.088 ± 0.561
3.407ValLys: 3.407 ± 0.458
4.497ValLeu: 4.497 ± 0.551
1.976ValMet: 1.976 ± 0.275
3.339ValAsn: 3.339 ± 0.609
2.862ValPro: 2.862 ± 0.486
1.908ValGln: 1.908 ± 0.421
2.862ValArg: 2.862 ± 0.419
5.11ValSer: 5.11 ± 0.85
6.541ValThr: 6.541 ± 0.794
4.361ValVal: 4.361 ± 0.494
1.09ValTrp: 1.09 ± 0.281
1.772ValTyr: 1.772 ± 0.398
0.0ValXaa: 0.0 ± 0.0
Trp
1.363TrpAla: 1.363 ± 0.262
0.204TrpCys: 0.204 ± 0.108
1.295TrpAsp: 1.295 ± 0.251
0.477TrpGlu: 0.477 ± 0.189
0.409TrpPhe: 0.409 ± 0.178
0.818TrpGly: 0.818 ± 0.246
0.273TrpHis: 0.273 ± 0.136
0.886TrpIle: 0.886 ± 0.219
0.545TrpLys: 0.545 ± 0.17
1.09TrpLeu: 1.09 ± 0.253
0.681TrpMet: 0.681 ± 0.231
0.75TrpAsn: 0.75 ± 0.213
0.681TrpPro: 0.681 ± 0.228
1.022TrpGln: 1.022 ± 0.259
0.886TrpArg: 0.886 ± 0.241
1.09TrpSer: 1.09 ± 0.24
0.954TrpThr: 0.954 ± 0.289
1.635TrpVal: 1.635 ± 0.374
0.136TrpTrp: 0.136 ± 0.085
0.545TrpTyr: 0.545 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.453TyrAla: 2.453 ± 0.464
0.341TyrCys: 0.341 ± 0.143
2.93TyrAsp: 2.93 ± 0.417
1.295TyrGlu: 1.295 ± 0.235
1.09TyrPhe: 1.09 ± 0.251
2.726TyrGly: 2.726 ± 0.421
0.818TyrHis: 0.818 ± 0.254
1.363TyrIle: 1.363 ± 0.32
1.022TyrLys: 1.022 ± 0.359
3.203TyrLeu: 3.203 ± 0.399
0.477TyrMet: 0.477 ± 0.189
1.976TyrAsn: 1.976 ± 0.559
1.09TyrPro: 1.09 ± 0.249
2.385TyrGln: 2.385 ± 0.415
2.862TyrArg: 2.862 ± 0.443
1.635TyrSer: 1.635 ± 0.333
1.635TyrThr: 1.635 ± 0.297
1.499TyrVal: 1.499 ± 0.318
0.545TyrTrp: 0.545 ± 0.172
0.818TyrTyr: 0.818 ± 0.209
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (14677 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski