Amino acid dipepetide frequency for Klebsiella phage CX1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.888AlaAla: 15.888 ± 1.768
0.791AlaCys: 0.791 ± 0.229
6.183AlaAsp: 6.183 ± 0.688
5.32AlaGlu: 5.32 ± 0.817
2.876AlaPhe: 2.876 ± 0.366
8.627AlaGly: 8.627 ± 1.14
1.438AlaHis: 1.438 ± 0.354
4.242AlaIle: 4.242 ± 0.614
5.607AlaLys: 5.607 ± 1.035
8.914AlaLeu: 8.914 ± 0.857
2.948AlaMet: 2.948 ± 0.388
3.163AlaAsn: 3.163 ± 0.371
4.385AlaPro: 4.385 ± 1.002
5.607AlaGln: 5.607 ± 0.939
6.111AlaArg: 6.111 ± 0.727
5.536AlaSer: 5.536 ± 0.658
6.254AlaThr: 6.254 ± 1.075
7.117AlaVal: 7.117 ± 0.731
1.15AlaTrp: 1.15 ± 0.302
4.17AlaTyr: 4.17 ± 0.469
0.0AlaXaa: 0.0 ± 0.0
Cys
1.006CysAla: 1.006 ± 0.346
0.359CysCys: 0.359 ± 0.238
0.503CysAsp: 0.503 ± 0.177
0.503CysGlu: 0.503 ± 0.193
0.359CysPhe: 0.359 ± 0.185
0.647CysGly: 0.647 ± 0.232
0.575CysHis: 0.575 ± 0.232
0.503CysIle: 0.503 ± 0.184
0.431CysLys: 0.431 ± 0.215
0.935CysLeu: 0.935 ± 0.309
0.431CysMet: 0.431 ± 0.176
0.288CysAsn: 0.288 ± 0.149
0.503CysPro: 0.503 ± 0.247
0.288CysGln: 0.288 ± 0.148
0.647CysArg: 0.647 ± 0.213
0.719CysSer: 0.719 ± 0.262
1.078CysThr: 1.078 ± 0.276
0.863CysVal: 0.863 ± 0.258
0.216CysTrp: 0.216 ± 0.133
0.503CysTyr: 0.503 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
7.62AspAla: 7.62 ± 0.964
0.863AspCys: 0.863 ± 0.284
3.163AspAsp: 3.163 ± 0.427
3.523AspGlu: 3.523 ± 0.622
2.372AspPhe: 2.372 ± 0.372
4.457AspGly: 4.457 ± 0.534
0.288AspHis: 0.288 ± 0.146
2.948AspIle: 2.948 ± 0.369
2.804AspLys: 2.804 ± 0.457
5.392AspLeu: 5.392 ± 0.525
2.804AspMet: 2.804 ± 0.383
2.66AspAsn: 2.66 ± 0.449
2.444AspPro: 2.444 ± 0.345
1.582AspGln: 1.582 ± 0.369
2.516AspArg: 2.516 ± 0.568
4.601AspSer: 4.601 ± 0.542
3.882AspThr: 3.882 ± 0.527
4.17AspVal: 4.17 ± 0.419
1.078AspTrp: 1.078 ± 0.206
2.157AspTyr: 2.157 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
6.039GluAla: 6.039 ± 0.787
0.359GluCys: 0.359 ± 0.152
2.948GluAsp: 2.948 ± 0.372
3.81GluGlu: 3.81 ± 0.659
2.372GluPhe: 2.372 ± 0.32
3.81GluGly: 3.81 ± 0.59
2.157GluHis: 2.157 ± 0.521
2.157GluIle: 2.157 ± 0.331
1.869GluLys: 1.869 ± 0.379
5.536GluLeu: 5.536 ± 0.613
2.085GluMet: 2.085 ± 0.357
2.013GluAsn: 2.013 ± 0.465
1.941GluPro: 1.941 ± 0.38
3.451GluGln: 3.451 ± 0.611
3.738GluArg: 3.738 ± 0.601
2.948GluSer: 2.948 ± 0.518
2.876GluThr: 2.876 ± 0.469
5.679GluVal: 5.679 ± 0.524
0.863GluTrp: 0.863 ± 0.204
2.301GluTyr: 2.301 ± 0.374
0.0GluXaa: 0.0 ± 0.0
Phe
2.804PheAla: 2.804 ± 0.378
0.647PheCys: 0.647 ± 0.312
2.013PheAsp: 2.013 ± 0.314
2.013PheGlu: 2.013 ± 0.408
1.15PhePhe: 1.15 ± 0.234
2.013PheGly: 2.013 ± 0.32
0.288PheHis: 0.288 ± 0.137
1.222PheIle: 1.222 ± 0.264
1.725PheLys: 1.725 ± 0.383
2.157PheLeu: 2.157 ± 0.378
0.575PheMet: 0.575 ± 0.23
1.725PheAsn: 1.725 ± 0.376
1.366PhePro: 1.366 ± 0.267
1.438PheGln: 1.438 ± 0.236
2.013PheArg: 2.013 ± 0.442
1.582PheSer: 1.582 ± 0.315
2.013PheThr: 2.013 ± 0.43
1.797PheVal: 1.797 ± 0.434
0.503PheTrp: 0.503 ± 0.183
1.51PheTyr: 1.51 ± 0.258
0.0PheXaa: 0.0 ± 0.0
Gly
6.039GlyAla: 6.039 ± 0.63
1.51GlyCys: 1.51 ± 0.394
5.104GlyAsp: 5.104 ± 0.629
3.954GlyGlu: 3.954 ± 0.516
2.444GlyPhe: 2.444 ± 0.441
4.817GlyGly: 4.817 ± 0.677
1.366GlyHis: 1.366 ± 0.34
4.385GlyIle: 4.385 ± 0.585
3.81GlyLys: 3.81 ± 0.607
6.902GlyLeu: 6.902 ± 0.698
1.725GlyMet: 1.725 ± 0.417
3.882GlyAsn: 3.882 ± 0.691
1.653GlyPro: 1.653 ± 0.337
3.091GlyGln: 3.091 ± 0.439
5.32GlyArg: 5.32 ± 0.564
4.96GlySer: 4.96 ± 0.612
4.673GlyThr: 4.673 ± 0.552
5.679GlyVal: 5.679 ± 0.551
0.719GlyTrp: 0.719 ± 0.216
3.235GlyTyr: 3.235 ± 0.636
0.0GlyXaa: 0.0 ± 0.0
His
1.582HisAla: 1.582 ± 0.381
0.216HisCys: 0.216 ± 0.119
1.15HisAsp: 1.15 ± 0.309
1.294HisGlu: 1.294 ± 0.38
0.144HisPhe: 0.144 ± 0.098
2.157HisGly: 2.157 ± 0.515
0.072HisHis: 0.072 ± 0.078
0.719HisIle: 0.719 ± 0.271
0.935HisLys: 0.935 ± 0.231
2.157HisLeu: 2.157 ± 0.423
0.359HisMet: 0.359 ± 0.135
0.791HisAsn: 0.791 ± 0.238
0.719HisPro: 0.719 ± 0.286
0.431HisGln: 0.431 ± 0.212
1.438HisArg: 1.438 ± 0.307
0.935HisSer: 0.935 ± 0.262
0.935HisThr: 0.935 ± 0.272
0.719HisVal: 0.719 ± 0.236
0.288HisTrp: 0.288 ± 0.135
0.791HisTyr: 0.791 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
3.163IleAla: 3.163 ± 0.493
0.359IleCys: 0.359 ± 0.13
2.876IleAsp: 2.876 ± 0.425
2.732IleGlu: 2.732 ± 0.499
0.575IlePhe: 0.575 ± 0.141
2.516IleGly: 2.516 ± 0.399
0.791IleHis: 0.791 ± 0.222
1.941IleIle: 1.941 ± 0.324
3.307IleLys: 3.307 ± 0.469
4.313IleLeu: 4.313 ± 0.577
1.582IleMet: 1.582 ± 0.218
1.869IleAsn: 1.869 ± 0.431
1.869IlePro: 1.869 ± 0.449
2.588IleGln: 2.588 ± 0.421
2.372IleArg: 2.372 ± 0.335
3.091IleSer: 3.091 ± 0.448
3.163IleThr: 3.163 ± 0.563
2.66IleVal: 2.66 ± 0.443
0.216IleTrp: 0.216 ± 0.125
1.294IleTyr: 1.294 ± 0.271
0.0IleXaa: 0.0 ± 0.0
Lys
6.47LysAla: 6.47 ± 0.894
0.359LysCys: 0.359 ± 0.201
2.372LysAsp: 2.372 ± 0.385
3.163LysGlu: 3.163 ± 0.482
1.222LysPhe: 1.222 ± 0.279
3.379LysGly: 3.379 ± 0.52
1.078LysHis: 1.078 ± 0.295
1.222LysIle: 1.222 ± 0.321
1.869LysLys: 1.869 ± 0.406
4.889LysLeu: 4.889 ± 0.64
1.51LysMet: 1.51 ± 0.277
1.438LysAsn: 1.438 ± 0.297
1.222LysPro: 1.222 ± 0.335
3.091LysGln: 3.091 ± 0.559
2.804LysArg: 2.804 ± 0.536
3.235LysSer: 3.235 ± 0.465
2.588LysThr: 2.588 ± 0.373
3.738LysVal: 3.738 ± 0.647
1.006LysTrp: 1.006 ± 0.254
1.438LysTyr: 1.438 ± 0.375
0.0LysXaa: 0.0 ± 0.0
Leu
8.267LeuAla: 8.267 ± 0.758
1.078LeuCys: 1.078 ± 0.315
6.614LeuAsp: 6.614 ± 0.666
5.536LeuGlu: 5.536 ± 0.581
2.372LeuPhe: 2.372 ± 0.369
6.902LeuGly: 6.902 ± 0.882
1.366LeuHis: 1.366 ± 0.325
4.242LeuIle: 4.242 ± 0.663
2.876LeuLys: 2.876 ± 0.396
6.398LeuLeu: 6.398 ± 0.608
1.725LeuMet: 1.725 ± 0.308
3.81LeuAsn: 3.81 ± 0.517
3.307LeuPro: 3.307 ± 0.509
4.098LeuGln: 4.098 ± 0.51
6.758LeuArg: 6.758 ± 0.65
5.248LeuSer: 5.248 ± 0.805
5.32LeuThr: 5.32 ± 0.616
6.758LeuVal: 6.758 ± 0.753
1.222LeuTrp: 1.222 ± 0.315
3.523LeuTyr: 3.523 ± 0.507
0.0LeuXaa: 0.0 ± 0.0
Met
3.738MetAla: 3.738 ± 0.625
0.288MetCys: 0.288 ± 0.13
1.797MetAsp: 1.797 ± 0.449
1.222MetGlu: 1.222 ± 0.302
0.719MetPhe: 0.719 ± 0.293
1.582MetGly: 1.582 ± 0.296
0.935MetHis: 0.935 ± 0.28
0.719MetIle: 0.719 ± 0.246
1.078MetLys: 1.078 ± 0.287
3.595MetLeu: 3.595 ± 0.496
0.575MetMet: 0.575 ± 0.22
0.935MetAsn: 0.935 ± 0.288
0.935MetPro: 0.935 ± 0.232
1.869MetGln: 1.869 ± 0.444
2.157MetArg: 2.157 ± 0.368
2.372MetSer: 2.372 ± 0.418
0.863MetThr: 0.863 ± 0.292
2.013MetVal: 2.013 ± 0.345
0.575MetTrp: 0.575 ± 0.164
1.294MetTyr: 1.294 ± 0.336
0.0MetXaa: 0.0 ± 0.0
Asn
2.876AsnAla: 2.876 ± 0.443
0.359AsnCys: 0.359 ± 0.183
2.372AsnAsp: 2.372 ± 0.502
1.438AsnGlu: 1.438 ± 0.319
0.791AsnPhe: 0.791 ± 0.181
4.026AsnGly: 4.026 ± 0.606
0.288AsnHis: 0.288 ± 0.165
2.804AsnIle: 2.804 ± 0.426
2.085AsnLys: 2.085 ± 0.368
2.948AsnLeu: 2.948 ± 0.43
1.294AsnMet: 1.294 ± 0.374
1.653AsnAsn: 1.653 ± 0.329
2.804AsnPro: 2.804 ± 0.487
1.653AsnGln: 1.653 ± 0.372
1.941AsnArg: 1.941 ± 0.323
2.732AsnSer: 2.732 ± 0.38
2.804AsnThr: 2.804 ± 0.441
3.307AsnVal: 3.307 ± 0.393
0.791AsnTrp: 0.791 ± 0.225
1.51AsnTyr: 1.51 ± 0.367
0.0AsnXaa: 0.0 ± 0.0
Pro
4.529ProAla: 4.529 ± 0.879
0.144ProCys: 0.144 ± 0.097
2.301ProAsp: 2.301 ± 0.469
3.523ProGlu: 3.523 ± 0.475
1.15ProPhe: 1.15 ± 0.246
2.876ProGly: 2.876 ± 0.636
0.575ProHis: 0.575 ± 0.249
1.582ProIle: 1.582 ± 0.354
1.366ProLys: 1.366 ± 0.384
2.804ProLeu: 2.804 ± 0.409
1.15ProMet: 1.15 ± 0.235
1.294ProAsn: 1.294 ± 0.361
0.575ProPro: 0.575 ± 0.193
1.366ProGln: 1.366 ± 0.273
2.013ProArg: 2.013 ± 0.364
2.157ProSer: 2.157 ± 0.483
3.019ProThr: 3.019 ± 0.404
2.804ProVal: 2.804 ± 0.421
0.647ProTrp: 0.647 ± 0.224
1.438ProTyr: 1.438 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
5.248GlnAla: 5.248 ± 0.785
0.575GlnCys: 0.575 ± 0.209
2.948GlnAsp: 2.948 ± 0.491
3.738GlnGlu: 3.738 ± 0.534
1.438GlnPhe: 1.438 ± 0.4
3.235GlnGly: 3.235 ± 0.52
1.294GlnHis: 1.294 ± 0.316
1.078GlnIle: 1.078 ± 0.338
2.372GlnLys: 2.372 ± 0.518
4.457GlnLeu: 4.457 ± 0.612
1.366GlnMet: 1.366 ± 0.27
2.157GlnAsn: 2.157 ± 0.377
1.438GlnPro: 1.438 ± 0.361
2.876GlnGln: 2.876 ± 0.653
2.948GlnArg: 2.948 ± 0.406
2.948GlnSer: 2.948 ± 0.445
1.653GlnThr: 1.653 ± 0.39
2.588GlnVal: 2.588 ± 0.408
0.719GlnTrp: 0.719 ± 0.245
1.797GlnTyr: 1.797 ± 0.454
0.0GlnXaa: 0.0 ± 0.0
Arg
6.326ArgAla: 6.326 ± 0.915
0.719ArgCys: 0.719 ± 0.247
3.666ArgAsp: 3.666 ± 0.619
3.954ArgGlu: 3.954 ± 0.508
2.66ArgPhe: 2.66 ± 0.452
3.882ArgGly: 3.882 ± 0.667
0.791ArgHis: 0.791 ± 0.195
3.091ArgIle: 3.091 ± 0.556
3.235ArgLys: 3.235 ± 0.623
5.032ArgLeu: 5.032 ± 0.521
1.941ArgMet: 1.941 ± 0.394
2.66ArgAsn: 2.66 ± 0.389
1.797ArgPro: 1.797 ± 0.336
2.588ArgGln: 2.588 ± 0.327
4.385ArgArg: 4.385 ± 0.761
2.588ArgSer: 2.588 ± 0.561
3.523ArgThr: 3.523 ± 0.313
3.738ArgVal: 3.738 ± 0.475
1.006ArgTrp: 1.006 ± 0.243
1.869ArgTyr: 1.869 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
8.627SerAla: 8.627 ± 0.846
0.647SerCys: 0.647 ± 0.222
4.385SerAsp: 4.385 ± 0.472
3.019SerGlu: 3.019 ± 0.473
2.157SerPhe: 2.157 ± 0.313
5.464SerGly: 5.464 ± 0.782
0.503SerHis: 0.503 ± 0.204
2.876SerIle: 2.876 ± 0.441
4.17SerLys: 4.17 ± 0.513
4.745SerLeu: 4.745 ± 0.606
2.732SerMet: 2.732 ± 0.456
2.804SerAsn: 2.804 ± 0.576
2.516SerPro: 2.516 ± 0.366
1.653SerGln: 1.653 ± 0.389
2.588SerArg: 2.588 ± 0.368
4.313SerSer: 4.313 ± 0.697
3.81SerThr: 3.81 ± 0.434
4.026SerVal: 4.026 ± 0.534
0.863SerTrp: 0.863 ± 0.256
1.51SerTyr: 1.51 ± 0.374
0.0SerXaa: 0.0 ± 0.0
Thr
5.895ThrAla: 5.895 ± 0.78
0.575ThrCys: 0.575 ± 0.234
3.163ThrAsp: 3.163 ± 0.437
2.732ThrGlu: 2.732 ± 0.564
2.372ThrPhe: 2.372 ± 0.561
4.96ThrGly: 4.96 ± 0.787
1.222ThrHis: 1.222 ± 0.302
2.085ThrIle: 2.085 ± 0.397
2.948ThrLys: 2.948 ± 0.411
5.679ThrLeu: 5.679 ± 0.645
1.582ThrMet: 1.582 ± 0.432
2.444ThrAsn: 2.444 ± 0.52
3.019ThrPro: 3.019 ± 0.354
2.588ThrGln: 2.588 ± 0.382
2.588ThrArg: 2.588 ± 0.536
4.457ThrSer: 4.457 ± 0.548
4.17ThrThr: 4.17 ± 0.749
4.313ThrVal: 4.313 ± 0.576
0.935ThrTrp: 0.935 ± 0.206
2.085ThrTyr: 2.085 ± 0.437
0.0ThrXaa: 0.0 ± 0.0
Val
6.398ValAla: 6.398 ± 0.686
0.431ValCys: 0.431 ± 0.176
4.817ValAsp: 4.817 ± 0.665
4.026ValGlu: 4.026 ± 0.629
1.438ValPhe: 1.438 ± 0.287
6.326ValGly: 6.326 ± 0.643
1.797ValHis: 1.797 ± 0.385
2.66ValIle: 2.66 ± 0.389
3.163ValLys: 3.163 ± 0.687
5.823ValLeu: 5.823 ± 0.898
1.941ValMet: 1.941 ± 0.352
2.876ValAsn: 2.876 ± 0.464
3.019ValPro: 3.019 ± 0.522
3.954ValGln: 3.954 ± 0.808
3.954ValArg: 3.954 ± 0.553
5.248ValSer: 5.248 ± 0.631
3.523ValThr: 3.523 ± 0.649
5.967ValVal: 5.967 ± 0.781
0.719ValTrp: 0.719 ± 0.241
2.804ValTyr: 2.804 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
1.366TrpAla: 1.366 ± 0.277
0.288TrpCys: 0.288 ± 0.12
0.575TrpAsp: 0.575 ± 0.19
1.006TrpGlu: 1.006 ± 0.267
0.791TrpPhe: 0.791 ± 0.321
0.719TrpGly: 0.719 ± 0.253
0.431TrpHis: 0.431 ± 0.186
0.647TrpIle: 0.647 ± 0.26
0.647TrpLys: 0.647 ± 0.21
1.15TrpLeu: 1.15 ± 0.24
0.144TrpMet: 0.144 ± 0.097
0.863TrpAsn: 0.863 ± 0.234
0.431TrpPro: 0.431 ± 0.213
0.719TrpGln: 0.719 ± 0.218
0.719TrpArg: 0.719 ± 0.221
0.719TrpSer: 0.719 ± 0.265
1.15TrpThr: 1.15 ± 0.252
1.006TrpVal: 1.006 ± 0.272
0.431TrpTrp: 0.431 ± 0.192
0.791TrpTyr: 0.791 ± 0.258
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.444TyrAla: 2.444 ± 0.479
0.791TyrCys: 0.791 ± 0.292
2.372TyrAsp: 2.372 ± 0.41
2.444TyrGlu: 2.444 ± 0.524
1.15TyrPhe: 1.15 ± 0.245
2.948TyrGly: 2.948 ± 0.581
0.575TyrHis: 0.575 ± 0.21
2.157TyrIle: 2.157 ± 0.46
1.797TyrLys: 1.797 ± 0.327
3.595TyrLeu: 3.595 ± 0.457
0.719TyrMet: 0.719 ± 0.266
1.15TyrAsn: 1.15 ± 0.22
1.366TyrPro: 1.366 ± 0.244
1.941TyrGln: 1.941 ± 0.391
2.444TyrArg: 2.444 ± 0.451
3.019TyrSer: 3.019 ± 0.42
2.516TyrThr: 2.516 ± 0.439
1.941TyrVal: 1.941 ± 0.33
0.647TyrTrp: 0.647 ± 0.217
1.51TyrTyr: 1.51 ± 0.379
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (13911 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski