Amino acid dipepetide frequency for Klebsiella phage NTUH-K2044-K1-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.575AlaAla: 14.575 ± 1.406
0.567AlaCys: 0.567 ± 0.224
5.587AlaAsp: 5.587 ± 0.692
5.263AlaGlu: 5.263 ± 0.649
3.32AlaPhe: 3.32 ± 0.456
8.34AlaGly: 8.34 ± 1.282
1.538AlaHis: 1.538 ± 0.464
4.211AlaIle: 4.211 ± 0.587
4.453AlaLys: 4.453 ± 0.99
9.231AlaLeu: 9.231 ± 0.733
2.996AlaMet: 2.996 ± 0.345
3.158AlaAsn: 3.158 ± 0.509
4.615AlaPro: 4.615 ± 1.136
4.858AlaGln: 4.858 ± 0.97
5.668AlaArg: 5.668 ± 0.794
5.506AlaSer: 5.506 ± 0.637
5.101AlaThr: 5.101 ± 0.655
6.397AlaVal: 6.397 ± 0.698
1.134AlaTrp: 1.134 ± 0.329
4.291AlaTyr: 4.291 ± 0.569
0.0AlaXaa: 0.0 ± 0.0
Cys
0.81CysAla: 0.81 ± 0.287
0.486CysCys: 0.486 ± 0.238
0.486CysAsp: 0.486 ± 0.161
0.648CysGlu: 0.648 ± 0.218
0.324CysPhe: 0.324 ± 0.143
0.486CysGly: 0.486 ± 0.167
0.324CysHis: 0.324 ± 0.155
0.243CysIle: 0.243 ± 0.139
0.567CysLys: 0.567 ± 0.222
1.134CysLeu: 1.134 ± 0.275
0.891CysMet: 0.891 ± 0.249
0.567CysAsn: 0.567 ± 0.249
0.567CysPro: 0.567 ± 0.267
0.243CysGln: 0.243 ± 0.148
1.053CysArg: 1.053 ± 0.219
0.891CysSer: 0.891 ± 0.283
0.81CysThr: 0.81 ± 0.259
1.053CysVal: 1.053 ± 0.311
0.324CysTrp: 0.324 ± 0.163
0.648CysTyr: 0.648 ± 0.194
0.0CysXaa: 0.0 ± 0.0
Asp
7.449AspAla: 7.449 ± 1.144
1.053AspCys: 1.053 ± 0.353
2.996AspAsp: 2.996 ± 0.419
3.32AspGlu: 3.32 ± 0.581
2.429AspPhe: 2.429 ± 0.456
5.101AspGly: 5.101 ± 0.7
0.405AspHis: 0.405 ± 0.18
2.996AspIle: 2.996 ± 0.469
2.915AspLys: 2.915 ± 0.46
5.344AspLeu: 5.344 ± 0.593
2.429AspMet: 2.429 ± 0.4
2.834AspAsn: 2.834 ± 0.442
2.186AspPro: 2.186 ± 0.428
1.781AspGln: 1.781 ± 0.417
2.51AspArg: 2.51 ± 0.671
4.939AspSer: 4.939 ± 0.525
4.13AspThr: 4.13 ± 0.569
4.049AspVal: 4.049 ± 0.558
1.215AspTrp: 1.215 ± 0.248
2.105AspTyr: 2.105 ± 0.347
0.0AspXaa: 0.0 ± 0.0
Glu
5.263GluAla: 5.263 ± 0.834
0.648GluCys: 0.648 ± 0.199
3.077GluAsp: 3.077 ± 0.466
3.806GluGlu: 3.806 ± 0.891
2.672GluPhe: 2.672 ± 0.434
3.887GluGly: 3.887 ± 0.517
2.186GluHis: 2.186 ± 0.408
2.267GluIle: 2.267 ± 0.385
1.619GluLys: 1.619 ± 0.408
5.263GluLeu: 5.263 ± 0.579
2.024GluMet: 2.024 ± 0.413
2.186GluAsn: 2.186 ± 0.539
1.7GluPro: 1.7 ± 0.278
3.482GluGln: 3.482 ± 0.566
3.563GluArg: 3.563 ± 0.577
2.348GluSer: 2.348 ± 0.537
3.077GluThr: 3.077 ± 0.446
5.344GluVal: 5.344 ± 0.669
0.729GluTrp: 0.729 ± 0.206
2.51GluTyr: 2.51 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
2.753PheAla: 2.753 ± 0.491
0.486PheCys: 0.486 ± 0.272
1.781PheAsp: 1.781 ± 0.358
2.105PheGlu: 2.105 ± 0.452
1.296PhePhe: 1.296 ± 0.275
2.024PheGly: 2.024 ± 0.301
0.648PheHis: 0.648 ± 0.272
1.053PheIle: 1.053 ± 0.246
1.538PheLys: 1.538 ± 0.392
2.105PheLeu: 2.105 ± 0.496
0.648PheMet: 0.648 ± 0.228
1.377PheAsn: 1.377 ± 0.341
1.377PhePro: 1.377 ± 0.277
1.619PheGln: 1.619 ± 0.223
1.619PheArg: 1.619 ± 0.449
1.862PheSer: 1.862 ± 0.258
2.105PheThr: 2.105 ± 0.506
2.429PheVal: 2.429 ± 0.553
0.648PheTrp: 0.648 ± 0.179
1.457PheTyr: 1.457 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
5.425GlyAla: 5.425 ± 0.571
1.538GlyCys: 1.538 ± 0.411
5.02GlyAsp: 5.02 ± 0.604
3.806GlyGlu: 3.806 ± 0.436
2.672GlyPhe: 2.672 ± 0.56
4.777GlyGly: 4.777 ± 0.77
1.457GlyHis: 1.457 ± 0.273
4.777GlyIle: 4.777 ± 0.575
3.725GlyLys: 3.725 ± 0.591
6.154GlyLeu: 6.154 ± 0.728
2.105GlyMet: 2.105 ± 0.548
3.725GlyAsn: 3.725 ± 0.533
1.781GlyPro: 1.781 ± 0.31
3.401GlyGln: 3.401 ± 0.428
5.02GlyArg: 5.02 ± 0.435
5.344GlySer: 5.344 ± 0.528
4.939GlyThr: 4.939 ± 0.682
6.154GlyVal: 6.154 ± 0.849
0.891GlyTrp: 0.891 ± 0.243
2.996GlyTyr: 2.996 ± 0.507
0.0GlyXaa: 0.0 ± 0.0
His
1.619HisAla: 1.619 ± 0.374
0.324HisCys: 0.324 ± 0.147
1.296HisAsp: 1.296 ± 0.356
1.215HisGlu: 1.215 ± 0.306
0.324HisPhe: 0.324 ± 0.141
1.943HisGly: 1.943 ± 0.454
0.162HisHis: 0.162 ± 0.107
1.296HisIle: 1.296 ± 0.357
1.053HisLys: 1.053 ± 0.235
2.267HisLeu: 2.267 ± 0.507
0.486HisMet: 0.486 ± 0.167
0.648HisAsn: 0.648 ± 0.246
0.972HisPro: 0.972 ± 0.362
0.486HisGln: 0.486 ± 0.231
1.377HisArg: 1.377 ± 0.299
0.81HisSer: 0.81 ± 0.222
0.648HisThr: 0.648 ± 0.221
0.729HisVal: 0.729 ± 0.247
0.405HisTrp: 0.405 ± 0.269
0.648HisTyr: 0.648 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
2.753IleAla: 2.753 ± 0.481
0.486IleCys: 0.486 ± 0.185
2.996IleAsp: 2.996 ± 0.394
2.591IleGlu: 2.591 ± 0.484
0.567IlePhe: 0.567 ± 0.165
2.996IleGly: 2.996 ± 0.492
1.053IleHis: 1.053 ± 0.226
1.943IleIle: 1.943 ± 0.287
2.591IleLys: 2.591 ± 0.438
3.968IleLeu: 3.968 ± 0.551
1.215IleMet: 1.215 ± 0.237
1.862IleAsn: 1.862 ± 0.294
2.591IlePro: 2.591 ± 0.477
2.915IleGln: 2.915 ± 0.423
3.077IleArg: 3.077 ± 0.444
3.239IleSer: 3.239 ± 0.431
2.591IleThr: 2.591 ± 0.415
2.672IleVal: 2.672 ± 0.323
0.162IleTrp: 0.162 ± 0.123
1.296IleTyr: 1.296 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
5.263LysAla: 5.263 ± 0.8
0.405LysCys: 0.405 ± 0.196
2.51LysAsp: 2.51 ± 0.338
2.996LysGlu: 2.996 ± 0.489
1.296LysPhe: 1.296 ± 0.361
3.077LysGly: 3.077 ± 0.583
0.81LysHis: 0.81 ± 0.212
1.377LysIle: 1.377 ± 0.274
1.781LysLys: 1.781 ± 0.462
4.453LysLeu: 4.453 ± 0.673
1.377LysMet: 1.377 ± 0.345
0.972LysAsn: 0.972 ± 0.289
1.781LysPro: 1.781 ± 0.442
2.834LysGln: 2.834 ± 0.543
2.996LysArg: 2.996 ± 0.512
3.158LysSer: 3.158 ± 0.397
2.429LysThr: 2.429 ± 0.426
2.915LysVal: 2.915 ± 0.523
1.053LysTrp: 1.053 ± 0.25
1.296LysTyr: 1.296 ± 0.399
0.0LysXaa: 0.0 ± 0.0
Leu
8.34LeuAla: 8.34 ± 0.883
1.377LeuCys: 1.377 ± 0.343
6.721LeuAsp: 6.721 ± 0.587
5.02LeuGlu: 5.02 ± 0.553
2.672LeuPhe: 2.672 ± 0.358
6.478LeuGly: 6.478 ± 0.606
1.7LeuHis: 1.7 ± 0.332
3.968LeuIle: 3.968 ± 0.609
2.915LeuLys: 2.915 ± 0.561
6.802LeuLeu: 6.802 ± 0.781
1.943LeuMet: 1.943 ± 0.327
4.049LeuAsn: 4.049 ± 0.582
3.482LeuPro: 3.482 ± 0.544
4.211LeuGln: 4.211 ± 0.581
6.721LeuArg: 6.721 ± 0.664
5.263LeuSer: 5.263 ± 0.74
4.696LeuThr: 4.696 ± 0.629
6.316LeuVal: 6.316 ± 0.586
1.215LeuTrp: 1.215 ± 0.298
3.32LeuTyr: 3.32 ± 0.482
0.0LeuXaa: 0.0 ± 0.0
Met
2.915MetAla: 2.915 ± 0.527
0.162MetCys: 0.162 ± 0.11
2.105MetAsp: 2.105 ± 0.442
1.296MetGlu: 1.296 ± 0.331
0.648MetPhe: 0.648 ± 0.251
1.7MetGly: 1.7 ± 0.288
0.891MetHis: 0.891 ± 0.289
0.486MetIle: 0.486 ± 0.172
1.377MetLys: 1.377 ± 0.384
3.401MetLeu: 3.401 ± 0.571
0.567MetMet: 0.567 ± 0.265
1.053MetAsn: 1.053 ± 0.319
1.134MetPro: 1.134 ± 0.289
2.51MetGln: 2.51 ± 0.473
2.348MetArg: 2.348 ± 0.469
2.024MetSer: 2.024 ± 0.394
0.972MetThr: 0.972 ± 0.251
2.429MetVal: 2.429 ± 0.434
0.486MetTrp: 0.486 ± 0.145
1.134MetTyr: 1.134 ± 0.306
0.0MetXaa: 0.0 ± 0.0
Asn
2.996AsnAla: 2.996 ± 0.45
0.567AsnCys: 0.567 ± 0.332
2.429AsnAsp: 2.429 ± 0.504
1.457AsnGlu: 1.457 ± 0.323
0.891AsnPhe: 0.891 ± 0.272
3.725AsnGly: 3.725 ± 0.407
0.243AsnHis: 0.243 ± 0.141
2.186AsnIle: 2.186 ± 0.38
2.186AsnLys: 2.186 ± 0.361
3.077AsnLeu: 3.077 ± 0.583
1.053AsnMet: 1.053 ± 0.317
1.457AsnAsn: 1.457 ± 0.304
2.348AsnPro: 2.348 ± 0.324
1.7AsnGln: 1.7 ± 0.37
2.348AsnArg: 2.348 ± 0.459
3.077AsnSer: 3.077 ± 0.467
2.753AsnThr: 2.753 ± 0.398
3.077AsnVal: 3.077 ± 0.403
0.567AsnTrp: 0.567 ± 0.25
1.538AsnTyr: 1.538 ± 0.334
0.0AsnXaa: 0.0 ± 0.0
Pro
4.858ProAla: 4.858 ± 1.037
0.405ProCys: 0.405 ± 0.206
2.915ProAsp: 2.915 ± 0.526
3.401ProGlu: 3.401 ± 0.527
0.891ProPhe: 0.891 ± 0.24
2.51ProGly: 2.51 ± 0.489
0.486ProHis: 0.486 ± 0.218
2.348ProIle: 2.348 ± 0.419
1.862ProLys: 1.862 ± 0.422
2.834ProLeu: 2.834 ± 0.446
1.053ProMet: 1.053 ± 0.223
1.457ProAsn: 1.457 ± 0.421
0.729ProPro: 0.729 ± 0.252
1.296ProGln: 1.296 ± 0.281
1.943ProArg: 1.943 ± 0.328
3.077ProSer: 3.077 ± 0.569
2.834ProThr: 2.834 ± 0.457
2.915ProVal: 2.915 ± 0.413
0.648ProTrp: 0.648 ± 0.223
1.296ProTyr: 1.296 ± 0.327
0.0ProXaa: 0.0 ± 0.0
Gln
5.02GlnAla: 5.02 ± 0.792
0.324GlnCys: 0.324 ± 0.191
3.401GlnAsp: 3.401 ± 0.413
4.049GlnGlu: 4.049 ± 0.576
1.134GlnPhe: 1.134 ± 0.31
2.753GlnGly: 2.753 ± 0.496
1.296GlnHis: 1.296 ± 0.386
1.215GlnIle: 1.215 ± 0.329
2.348GlnLys: 2.348 ± 0.521
4.615GlnLeu: 4.615 ± 0.616
1.215GlnMet: 1.215 ± 0.255
2.429GlnAsn: 2.429 ± 0.38
1.538GlnPro: 1.538 ± 0.383
2.591GlnGln: 2.591 ± 0.662
3.32GlnArg: 3.32 ± 0.403
3.077GlnSer: 3.077 ± 0.475
1.862GlnThr: 1.862 ± 0.434
2.834GlnVal: 2.834 ± 0.414
0.648GlnTrp: 0.648 ± 0.225
2.024GlnTyr: 2.024 ± 0.478
0.0GlnXaa: 0.0 ± 0.0
Arg
6.64ArgAla: 6.64 ± 1.022
0.891ArgCys: 0.891 ± 0.282
3.401ArgAsp: 3.401 ± 0.584
3.806ArgGlu: 3.806 ± 0.522
2.348ArgPhe: 2.348 ± 0.354
4.534ArgGly: 4.534 ± 0.782
0.891ArgHis: 0.891 ± 0.208
3.158ArgIle: 3.158 ± 0.616
2.996ArgLys: 2.996 ± 0.583
5.425ArgLeu: 5.425 ± 0.542
2.186ArgMet: 2.186 ± 0.417
2.591ArgAsn: 2.591 ± 0.387
1.619ArgPro: 1.619 ± 0.405
2.429ArgGln: 2.429 ± 0.439
4.13ArgArg: 4.13 ± 0.715
3.239ArgSer: 3.239 ± 0.526
3.644ArgThr: 3.644 ± 0.412
3.806ArgVal: 3.806 ± 0.467
0.972ArgTrp: 0.972 ± 0.202
2.348ArgTyr: 2.348 ± 0.301
0.0ArgXaa: 0.0 ± 0.0
Ser
8.259SerAla: 8.259 ± 0.912
0.891SerCys: 0.891 ± 0.28
4.049SerAsp: 4.049 ± 0.546
2.996SerGlu: 2.996 ± 0.541
2.105SerPhe: 2.105 ± 0.359
5.668SerGly: 5.668 ± 0.592
0.729SerHis: 0.729 ± 0.235
3.239SerIle: 3.239 ± 0.601
3.644SerLys: 3.644 ± 0.645
4.372SerLeu: 4.372 ± 0.657
3.32SerMet: 3.32 ± 0.362
3.32SerAsn: 3.32 ± 0.666
3.077SerPro: 3.077 ± 0.368
1.943SerGln: 1.943 ± 0.341
2.753SerArg: 2.753 ± 0.423
4.13SerSer: 4.13 ± 0.728
4.372SerThr: 4.372 ± 0.781
4.291SerVal: 4.291 ± 0.538
0.81SerTrp: 0.81 ± 0.278
1.862SerTyr: 1.862 ± 0.34
0.0SerXaa: 0.0 ± 0.0
Thr
5.911ThrAla: 5.911 ± 0.839
0.81ThrCys: 0.81 ± 0.3
2.834ThrAsp: 2.834 ± 0.372
2.672ThrGlu: 2.672 ± 0.572
1.862ThrPhe: 1.862 ± 0.421
5.263ThrGly: 5.263 ± 0.692
1.215ThrHis: 1.215 ± 0.26
1.7ThrIle: 1.7 ± 0.345
2.024ThrLys: 2.024 ± 0.447
5.182ThrLeu: 5.182 ± 0.663
1.781ThrMet: 1.781 ± 0.336
1.619ThrAsn: 1.619 ± 0.406
3.158ThrPro: 3.158 ± 0.329
2.672ThrGln: 2.672 ± 0.342
2.753ThrArg: 2.753 ± 0.592
5.02ThrSer: 5.02 ± 0.654
3.32ThrThr: 3.32 ± 0.405
4.696ThrVal: 4.696 ± 0.608
0.81ThrTrp: 0.81 ± 0.205
2.753ThrTyr: 2.753 ± 0.453
0.0ThrXaa: 0.0 ± 0.0
Val
6.802ValAla: 6.802 ± 0.808
0.324ValCys: 0.324 ± 0.136
5.02ValAsp: 5.02 ± 0.591
3.806ValGlu: 3.806 ± 0.584
1.296ValPhe: 1.296 ± 0.358
6.559ValGly: 6.559 ± 0.853
1.862ValHis: 1.862 ± 0.38
2.591ValIle: 2.591 ± 0.46
3.158ValLys: 3.158 ± 0.726
6.235ValLeu: 6.235 ± 0.827
1.457ValMet: 1.457 ± 0.358
2.51ValAsn: 2.51 ± 0.474
3.158ValPro: 3.158 ± 0.496
3.887ValGln: 3.887 ± 0.79
4.211ValArg: 4.211 ± 0.438
5.182ValSer: 5.182 ± 0.83
3.806ValThr: 3.806 ± 0.58
5.425ValVal: 5.425 ± 0.547
0.81ValTrp: 0.81 ± 0.234
2.996ValTyr: 2.996 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.972TrpAla: 0.972 ± 0.313
0.324TrpCys: 0.324 ± 0.15
0.81TrpAsp: 0.81 ± 0.259
1.296TrpGlu: 1.296 ± 0.28
0.81TrpPhe: 0.81 ± 0.321
0.81TrpGly: 0.81 ± 0.24
0.324TrpHis: 0.324 ± 0.196
0.405TrpIle: 0.405 ± 0.197
0.405TrpLys: 0.405 ± 0.214
1.377TrpLeu: 1.377 ± 0.222
0.081TrpMet: 0.081 ± 0.077
0.729TrpAsn: 0.729 ± 0.208
0.405TrpPro: 0.405 ± 0.181
0.567TrpGln: 0.567 ± 0.184
1.053TrpArg: 1.053 ± 0.246
0.648TrpSer: 0.648 ± 0.237
1.053TrpThr: 1.053 ± 0.246
1.377TrpVal: 1.377 ± 0.297
0.324TrpTrp: 0.324 ± 0.133
0.81TrpTyr: 0.81 ± 0.266
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.186TyrAla: 2.186 ± 0.498
0.486TyrCys: 0.486 ± 0.232
2.753TyrAsp: 2.753 ± 0.522
2.105TyrGlu: 2.105 ± 0.526
1.377TyrPhe: 1.377 ± 0.276
3.077TyrGly: 3.077 ± 0.575
0.486TyrHis: 0.486 ± 0.157
2.186TyrIle: 2.186 ± 0.425
1.619TyrLys: 1.619 ± 0.405
3.968TyrLeu: 3.968 ± 0.561
0.729TyrMet: 0.729 ± 0.293
1.134TyrAsn: 1.134 ± 0.283
1.538TyrPro: 1.538 ± 0.255
2.267TyrGln: 2.267 ± 0.504
2.591TyrArg: 2.591 ± 0.497
2.996TyrSer: 2.996 ± 0.289
2.915TyrThr: 2.915 ± 0.455
2.186TyrVal: 2.186 ± 0.53
0.729TyrTrp: 0.729 ± 0.258
1.215TyrTyr: 1.215 ± 0.278
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 35 proteins (12351 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski