Amino acid dipepetide frequency for Klebsiella phage 117

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.142AlaAla: 9.142 ± 1.073
0.735AlaCys: 0.735 ± 0.252
6.775AlaAsp: 6.775 ± 0.777
5.387AlaGlu: 5.387 ± 0.599
3.836AlaPhe: 3.836 ± 0.554
6.53AlaGly: 6.53 ± 1.12
1.306AlaHis: 1.306 ± 0.221
4.326AlaIle: 4.326 ± 0.569
7.02AlaLys: 7.02 ± 0.753
7.836AlaLeu: 7.836 ± 0.955
2.939AlaMet: 2.939 ± 0.657
4.408AlaAsn: 4.408 ± 0.519
2.694AlaPro: 2.694 ± 0.573
3.836AlaGln: 3.836 ± 0.607
5.142AlaArg: 5.142 ± 0.64
5.306AlaSer: 5.306 ± 0.682
3.836AlaThr: 3.836 ± 0.515
5.306AlaVal: 5.306 ± 0.569
1.306AlaTrp: 1.306 ± 0.358
3.102AlaTyr: 3.102 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
0.653CysAla: 0.653 ± 0.262
0.163CysCys: 0.163 ± 0.121
0.653CysAsp: 0.653 ± 0.34
0.816CysGlu: 0.816 ± 0.293
0.327CysPhe: 0.327 ± 0.184
0.735CysGly: 0.735 ± 0.247
0.163CysHis: 0.163 ± 0.107
0.571CysIle: 0.571 ± 0.213
0.327CysLys: 0.327 ± 0.138
0.898CysLeu: 0.898 ± 0.289
0.082CysMet: 0.082 ± 0.078
0.408CysAsn: 0.408 ± 0.204
0.735CysPro: 0.735 ± 0.234
0.408CysGln: 0.408 ± 0.158
0.735CysArg: 0.735 ± 0.319
0.735CysSer: 0.735 ± 0.283
0.082CysThr: 0.082 ± 0.086
0.735CysVal: 0.735 ± 0.234
0.245CysTrp: 0.245 ± 0.171
0.653CysTyr: 0.653 ± 0.3
0.0CysXaa: 0.0 ± 0.0
Asp
5.795AspAla: 5.795 ± 0.673
0.898AspCys: 0.898 ± 0.327
4.0AspAsp: 4.0 ± 0.569
3.755AspGlu: 3.755 ± 0.543
2.694AspPhe: 2.694 ± 0.426
5.959AspGly: 5.959 ± 0.575
0.898AspHis: 0.898 ± 0.256
2.775AspIle: 2.775 ± 0.474
4.245AspLys: 4.245 ± 0.693
4.0AspLeu: 4.0 ± 0.582
2.204AspMet: 2.204 ± 0.376
2.53AspAsn: 2.53 ± 0.39
3.02AspPro: 3.02 ± 0.503
2.122AspGln: 2.122 ± 0.443
2.775AspArg: 2.775 ± 0.497
3.428AspSer: 3.428 ± 0.451
4.245AspThr: 4.245 ± 0.575
4.979AspVal: 4.979 ± 0.59
0.898AspTrp: 0.898 ± 0.315
2.122AspTyr: 2.122 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
7.591GluAla: 7.591 ± 0.79
0.653GluCys: 0.653 ± 0.332
4.571GluAsp: 4.571 ± 0.638
6.122GluGlu: 6.122 ± 1.056
2.612GluPhe: 2.612 ± 0.42
5.714GluGly: 5.714 ± 0.768
1.551GluHis: 1.551 ± 0.47
2.775GluIle: 2.775 ± 0.433
2.939GluLys: 2.939 ± 0.637
6.285GluLeu: 6.285 ± 0.883
1.714GluMet: 1.714 ± 0.62
2.204GluAsn: 2.204 ± 0.332
2.53GluPro: 2.53 ± 0.68
3.183GluGln: 3.183 ± 0.767
4.245GluArg: 4.245 ± 0.622
4.0GluSer: 4.0 ± 0.483
3.265GluThr: 3.265 ± 0.555
4.571GluVal: 4.571 ± 0.765
0.571GluTrp: 0.571 ± 0.21
3.02GluTyr: 3.02 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
2.612PheAla: 2.612 ± 0.48
0.245PheCys: 0.245 ± 0.154
2.857PheAsp: 2.857 ± 0.501
2.122PheGlu: 2.122 ± 0.337
0.816PhePhe: 0.816 ± 0.22
3.102PheGly: 3.102 ± 0.637
0.816PheHis: 0.816 ± 0.269
1.796PheIle: 1.796 ± 0.392
2.449PheLys: 2.449 ± 0.354
2.939PheLeu: 2.939 ± 0.549
0.98PheMet: 0.98 ± 0.212
2.367PheAsn: 2.367 ± 0.43
1.469PhePro: 1.469 ± 0.427
1.143PheGln: 1.143 ± 0.292
1.551PheArg: 1.551 ± 0.401
2.286PheSer: 2.286 ± 0.375
2.367PheThr: 2.367 ± 0.406
2.449PheVal: 2.449 ± 0.394
0.327PheTrp: 0.327 ± 0.139
0.98PheTyr: 0.98 ± 0.262
0.0PheXaa: 0.0 ± 0.0
Gly
6.693GlyAla: 6.693 ± 0.954
0.735GlyCys: 0.735 ± 0.224
5.551GlyAsp: 5.551 ± 0.561
5.469GlyGlu: 5.469 ± 0.61
2.775GlyPhe: 2.775 ± 0.432
5.387GlyGly: 5.387 ± 0.762
1.143GlyHis: 1.143 ± 0.286
4.898GlyIle: 4.898 ± 0.722
5.387GlyLys: 5.387 ± 0.705
6.938GlyLeu: 6.938 ± 0.803
1.796GlyMet: 1.796 ± 0.424
2.939GlyAsn: 2.939 ± 0.438
1.551GlyPro: 1.551 ± 0.421
2.612GlyGln: 2.612 ± 0.48
4.163GlyArg: 4.163 ± 0.436
5.224GlySer: 5.224 ± 0.575
4.245GlyThr: 4.245 ± 0.535
5.387GlyVal: 5.387 ± 0.904
1.633GlyTrp: 1.633 ± 0.442
3.673GlyTyr: 3.673 ± 0.482
0.0GlyXaa: 0.0 ± 0.0
His
1.306HisAla: 1.306 ± 0.365
0.571HisCys: 0.571 ± 0.182
1.224HisAsp: 1.224 ± 0.282
1.469HisGlu: 1.469 ± 0.362
0.653HisPhe: 0.653 ± 0.219
1.388HisGly: 1.388 ± 0.334
0.49HisHis: 0.49 ± 0.208
0.816HisIle: 0.816 ± 0.235
0.98HisLys: 0.98 ± 0.25
1.633HisLeu: 1.633 ± 0.469
0.408HisMet: 0.408 ± 0.164
0.327HisAsn: 0.327 ± 0.159
0.816HisPro: 0.816 ± 0.199
0.653HisGln: 0.653 ± 0.225
0.653HisArg: 0.653 ± 0.212
0.98HisSer: 0.98 ± 0.264
0.816HisThr: 0.816 ± 0.242
1.388HisVal: 1.388 ± 0.319
0.245HisTrp: 0.245 ± 0.127
0.816HisTyr: 0.816 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
4.326IleAla: 4.326 ± 0.583
0.571IleCys: 0.571 ± 0.188
2.939IleAsp: 2.939 ± 0.54
3.02IleGlu: 3.02 ± 0.59
0.816IlePhe: 0.816 ± 0.273
3.592IleGly: 3.592 ± 0.601
0.735IleHis: 0.735 ± 0.257
2.612IleIle: 2.612 ± 0.543
3.183IleLys: 3.183 ± 0.494
3.183IleLeu: 3.183 ± 0.489
1.306IleMet: 1.306 ± 0.294
2.286IleAsn: 2.286 ± 0.571
2.694IlePro: 2.694 ± 0.447
1.877IleGln: 1.877 ± 0.406
3.428IleArg: 3.428 ± 0.525
3.183IleSer: 3.183 ± 0.436
2.612IleThr: 2.612 ± 0.59
3.51IleVal: 3.51 ± 0.576
0.571IleTrp: 0.571 ± 0.23
1.633IleTyr: 1.633 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
7.673LysAla: 7.673 ± 1.033
0.571LysCys: 0.571 ± 0.237
3.592LysAsp: 3.592 ± 0.597
4.979LysGlu: 4.979 ± 0.615
2.449LysPhe: 2.449 ± 0.458
5.795LysGly: 5.795 ± 0.927
1.633LysHis: 1.633 ± 0.337
2.694LysIle: 2.694 ± 0.452
3.347LysLys: 3.347 ± 0.788
5.632LysLeu: 5.632 ± 0.547
1.714LysMet: 1.714 ± 0.429
2.53LysAsn: 2.53 ± 0.383
2.286LysPro: 2.286 ± 0.562
2.204LysGln: 2.204 ± 0.334
3.836LysArg: 3.836 ± 0.618
3.592LysSer: 3.592 ± 0.612
2.939LysThr: 2.939 ± 0.406
5.551LysVal: 5.551 ± 0.731
0.653LysTrp: 0.653 ± 0.286
2.122LysTyr: 2.122 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
7.836LeuAla: 7.836 ± 1.183
0.245LeuCys: 0.245 ± 0.154
4.816LeuAsp: 4.816 ± 0.621
6.857LeuGlu: 6.857 ± 1.095
2.53LeuPhe: 2.53 ± 0.5
4.979LeuGly: 4.979 ± 0.518
1.306LeuHis: 1.306 ± 0.314
3.51LeuIle: 3.51 ± 0.569
6.938LeuLys: 6.938 ± 0.526
5.795LeuLeu: 5.795 ± 0.924
2.041LeuMet: 2.041 ± 0.29
4.571LeuAsn: 4.571 ± 0.605
3.02LeuPro: 3.02 ± 0.475
3.673LeuGln: 3.673 ± 0.562
4.653LeuArg: 4.653 ± 0.582
4.816LeuSer: 4.816 ± 0.627
4.489LeuThr: 4.489 ± 0.72
5.387LeuVal: 5.387 ± 0.769
1.306LeuTrp: 1.306 ± 0.39
2.53LeuTyr: 2.53 ± 0.465
0.0LeuXaa: 0.0 ± 0.0
Met
3.347MetAla: 3.347 ± 0.494
0.163MetCys: 0.163 ± 0.137
1.633MetAsp: 1.633 ± 0.384
1.306MetGlu: 1.306 ± 0.31
0.898MetPhe: 0.898 ± 0.252
1.633MetGly: 1.633 ± 0.281
0.571MetHis: 0.571 ± 0.206
0.98MetIle: 0.98 ± 0.29
1.388MetLys: 1.388 ± 0.316
2.694MetLeu: 2.694 ± 0.486
0.653MetMet: 0.653 ± 0.215
0.898MetAsn: 0.898 ± 0.235
0.653MetPro: 0.653 ± 0.217
2.286MetGln: 2.286 ± 0.435
0.98MetArg: 0.98 ± 0.236
1.551MetSer: 1.551 ± 0.384
1.714MetThr: 1.714 ± 0.456
1.551MetVal: 1.551 ± 0.378
0.082MetTrp: 0.082 ± 0.074
0.735MetTyr: 0.735 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
3.673AsnAla: 3.673 ± 0.691
0.408AsnCys: 0.408 ± 0.159
1.959AsnAsp: 1.959 ± 0.362
2.612AsnGlu: 2.612 ± 0.485
2.041AsnPhe: 2.041 ± 0.428
4.489AsnGly: 4.489 ± 0.698
0.245AsnHis: 0.245 ± 0.134
3.02AsnIle: 3.02 ± 0.51
1.959AsnLys: 1.959 ± 0.382
3.102AsnLeu: 3.102 ± 0.377
0.98AsnMet: 0.98 ± 0.311
1.388AsnAsn: 1.388 ± 0.328
2.449AsnPro: 2.449 ± 0.368
1.306AsnGln: 1.306 ± 0.257
2.286AsnArg: 2.286 ± 0.57
2.775AsnSer: 2.775 ± 0.538
2.449AsnThr: 2.449 ± 0.351
3.02AsnVal: 3.02 ± 0.523
0.735AsnTrp: 0.735 ± 0.232
1.714AsnTyr: 1.714 ± 0.373
0.0AsnXaa: 0.0 ± 0.0
Pro
3.102ProAla: 3.102 ± 0.454
0.49ProCys: 0.49 ± 0.199
2.041ProAsp: 2.041 ± 0.379
4.326ProGlu: 4.326 ± 0.658
1.551ProPhe: 1.551 ± 0.301
1.959ProGly: 1.959 ± 0.485
0.327ProHis: 0.327 ± 0.149
1.061ProIle: 1.061 ± 0.3
2.694ProLys: 2.694 ± 0.47
2.449ProLeu: 2.449 ± 0.403
0.735ProMet: 0.735 ± 0.242
1.959ProAsn: 1.959 ± 0.441
0.98ProPro: 0.98 ± 0.328
1.224ProGln: 1.224 ± 0.26
1.714ProArg: 1.714 ± 0.352
2.204ProSer: 2.204 ± 0.363
2.286ProThr: 2.286 ± 0.485
2.857ProVal: 2.857 ± 0.425
0.816ProTrp: 0.816 ± 0.188
2.122ProTyr: 2.122 ± 0.465
0.0ProXaa: 0.0 ± 0.0
Gln
3.592GlnAla: 3.592 ± 0.499
0.163GlnCys: 0.163 ± 0.126
2.775GlnAsp: 2.775 ± 0.313
2.857GlnGlu: 2.857 ± 0.435
1.469GlnPhe: 1.469 ± 0.285
2.775GlnGly: 2.775 ± 0.516
0.327GlnHis: 0.327 ± 0.193
1.714GlnIle: 1.714 ± 0.495
3.428GlnLys: 3.428 ± 0.566
4.081GlnLeu: 4.081 ± 0.607
1.388GlnMet: 1.388 ± 0.451
1.388GlnAsn: 1.388 ± 0.269
1.714GlnPro: 1.714 ± 0.245
3.428GlnGln: 3.428 ± 0.681
2.449GlnArg: 2.449 ± 0.54
2.449GlnSer: 2.449 ± 0.482
1.959GlnThr: 1.959 ± 0.421
2.612GlnVal: 2.612 ± 0.481
0.816GlnTrp: 0.816 ± 0.261
1.469GlnTyr: 1.469 ± 0.42
0.0GlnXaa: 0.0 ± 0.0
Arg
4.898ArgAla: 4.898 ± 0.856
0.816ArgCys: 0.816 ± 0.298
3.183ArgAsp: 3.183 ± 0.459
4.245ArgGlu: 4.245 ± 0.521
1.551ArgPhe: 1.551 ± 0.35
3.918ArgGly: 3.918 ± 0.561
0.898ArgHis: 0.898 ± 0.267
2.775ArgIle: 2.775 ± 0.498
3.836ArgLys: 3.836 ± 0.65
4.571ArgLeu: 4.571 ± 0.667
1.061ArgMet: 1.061 ± 0.251
2.286ArgAsn: 2.286 ± 0.34
2.041ArgPro: 2.041 ± 0.341
2.694ArgGln: 2.694 ± 0.519
2.612ArgArg: 2.612 ± 0.404
3.755ArgSer: 3.755 ± 0.41
3.183ArgThr: 3.183 ± 0.492
3.918ArgVal: 3.918 ± 0.725
1.061ArgTrp: 1.061 ± 0.309
1.143ArgTyr: 1.143 ± 0.231
0.0ArgXaa: 0.0 ± 0.0
Ser
5.306SerAla: 5.306 ± 0.837
0.735SerCys: 0.735 ± 0.233
4.245SerAsp: 4.245 ± 0.528
3.428SerGlu: 3.428 ± 0.548
3.183SerPhe: 3.183 ± 0.526
5.387SerGly: 5.387 ± 0.891
1.633SerHis: 1.633 ± 0.402
2.857SerIle: 2.857 ± 0.453
4.0SerLys: 4.0 ± 0.4
4.898SerLeu: 4.898 ± 0.847
1.388SerMet: 1.388 ± 0.361
1.877SerAsn: 1.877 ± 0.446
1.796SerPro: 1.796 ± 0.331
2.775SerGln: 2.775 ± 0.43
3.428SerArg: 3.428 ± 0.497
3.102SerSer: 3.102 ± 0.472
4.081SerThr: 4.081 ± 0.668
4.0SerVal: 4.0 ± 0.558
0.653SerTrp: 0.653 ± 0.213
1.959SerTyr: 1.959 ± 0.49
0.0SerXaa: 0.0 ± 0.0
Thr
4.163ThrAla: 4.163 ± 0.687
0.98ThrCys: 0.98 ± 0.335
3.265ThrAsp: 3.265 ± 0.464
3.836ThrGlu: 3.836 ± 0.56
1.714ThrPhe: 1.714 ± 0.384
5.224ThrGly: 5.224 ± 0.549
1.143ThrHis: 1.143 ± 0.255
3.51ThrIle: 3.51 ± 0.464
3.918ThrLys: 3.918 ± 0.523
4.898ThrLeu: 4.898 ± 0.694
1.224ThrMet: 1.224 ± 0.322
2.122ThrAsn: 2.122 ± 0.515
2.775ThrPro: 2.775 ± 0.431
2.286ThrGln: 2.286 ± 0.376
2.612ThrArg: 2.612 ± 0.467
3.51ThrSer: 3.51 ± 0.468
2.775ThrThr: 2.775 ± 0.658
3.836ThrVal: 3.836 ± 0.67
0.816ThrTrp: 0.816 ± 0.265
1.633ThrTyr: 1.633 ± 0.411
0.0ThrXaa: 0.0 ± 0.0
Val
6.122ValAla: 6.122 ± 0.661
0.408ValCys: 0.408 ± 0.19
3.673ValAsp: 3.673 ± 0.553
4.326ValGlu: 4.326 ± 0.696
2.204ValPhe: 2.204 ± 0.618
5.142ValGly: 5.142 ± 0.601
1.469ValHis: 1.469 ± 0.388
3.51ValIle: 3.51 ± 0.514
4.816ValLys: 4.816 ± 0.598
5.469ValLeu: 5.469 ± 0.796
1.306ValMet: 1.306 ± 0.27
3.592ValAsn: 3.592 ± 0.574
2.286ValPro: 2.286 ± 0.513
2.449ValGln: 2.449 ± 0.399
4.0ValArg: 4.0 ± 0.611
4.898ValSer: 4.898 ± 0.911
5.632ValThr: 5.632 ± 0.808
4.571ValVal: 4.571 ± 0.633
0.898ValTrp: 0.898 ± 0.33
2.775ValTyr: 2.775 ± 0.459
0.0ValXaa: 0.0 ± 0.0
Trp
0.571TrpAla: 0.571 ± 0.175
0.327TrpCys: 0.327 ± 0.166
0.653TrpAsp: 0.653 ± 0.231
1.306TrpGlu: 1.306 ± 0.272
0.327TrpPhe: 0.327 ± 0.182
0.816TrpGly: 0.816 ± 0.289
0.408TrpHis: 0.408 ± 0.271
0.653TrpIle: 0.653 ± 0.377
1.143TrpLys: 1.143 ± 0.36
1.469TrpLeu: 1.469 ± 0.369
0.49TrpMet: 0.49 ± 0.217
0.571TrpAsn: 0.571 ± 0.209
0.327TrpPro: 0.327 ± 0.137
0.898TrpGln: 0.898 ± 0.276
0.98TrpArg: 0.98 ± 0.254
1.224TrpSer: 1.224 ± 0.385
0.816TrpThr: 0.816 ± 0.281
1.143TrpVal: 1.143 ± 0.316
0.408TrpTrp: 0.408 ± 0.148
0.082TrpTyr: 0.082 ± 0.079
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.367TyrAla: 2.367 ± 0.451
0.245TyrCys: 0.245 ± 0.135
2.939TyrAsp: 2.939 ± 0.457
1.877TyrGlu: 1.877 ± 0.456
1.224TyrPhe: 1.224 ± 0.212
3.673TyrGly: 3.673 ± 0.472
0.571TyrHis: 0.571 ± 0.249
1.224TyrIle: 1.224 ± 0.308
1.796TyrLys: 1.796 ± 0.351
2.53TyrLeu: 2.53 ± 0.439
1.224TyrMet: 1.224 ± 0.305
1.959TyrAsn: 1.959 ± 0.454
1.061TyrPro: 1.061 ± 0.271
1.796TyrGln: 1.796 ± 0.421
2.286TyrArg: 2.286 ± 0.372
1.714TyrSer: 1.714 ± 0.501
2.449TyrThr: 2.449 ± 0.49
2.775TyrVal: 2.775 ± 0.623
0.571TyrTrp: 0.571 ± 0.195
1.061TyrTyr: 1.061 ± 0.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (12252 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski