Amino acid dipepetide frequency for Klebsiella phage ST974-OXA48phi18.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.467AlaAla: 9.467 ± 1.039
0.902AlaCys: 0.902 ± 0.242
5.86AlaAsp: 5.86 ± 0.651
5.667AlaGlu: 5.667 ± 0.771
3.349AlaPhe: 3.349 ± 0.479
6.826AlaGly: 6.826 ± 0.65
1.481AlaHis: 1.481 ± 0.293
7.148AlaIle: 7.148 ± 0.674
3.606AlaLys: 3.606 ± 0.385
9.531AlaLeu: 9.531 ± 0.905
3.22AlaMet: 3.22 ± 0.406
3.993AlaAsn: 3.993 ± 0.482
3.928AlaPro: 3.928 ± 0.477
4.637AlaGln: 4.637 ± 0.75
5.088AlaArg: 5.088 ± 0.586
6.698AlaSer: 6.698 ± 0.572
5.088AlaThr: 5.088 ± 0.806
6.118AlaVal: 6.118 ± 0.627
1.996AlaTrp: 1.996 ± 0.346
2.318AlaTyr: 2.318 ± 0.5
0.0AlaXaa: 0.0 ± 0.0
Cys
0.966CysAla: 0.966 ± 0.261
0.193CysCys: 0.193 ± 0.111
0.644CysAsp: 0.644 ± 0.225
0.644CysGlu: 0.644 ± 0.219
0.258CysPhe: 0.258 ± 0.121
1.03CysGly: 1.03 ± 0.248
0.258CysHis: 0.258 ± 0.142
0.644CysIle: 0.644 ± 0.243
0.837CysLys: 0.837 ± 0.268
0.708CysLeu: 0.708 ± 0.19
0.193CysMet: 0.193 ± 0.098
0.837CysAsn: 0.837 ± 0.278
0.837CysPro: 0.837 ± 0.165
0.58CysGln: 0.58 ± 0.196
1.095CysArg: 1.095 ± 0.266
0.708CysSer: 0.708 ± 0.181
0.451CysThr: 0.451 ± 0.174
0.515CysVal: 0.515 ± 0.153
0.258CysTrp: 0.258 ± 0.154
0.451CysTyr: 0.451 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
5.216AspAla: 5.216 ± 0.695
0.902AspCys: 0.902 ± 0.272
3.8AspAsp: 3.8 ± 0.611
4.894AspGlu: 4.894 ± 0.545
2.576AspPhe: 2.576 ± 0.331
4.959AspGly: 4.959 ± 0.553
0.902AspHis: 0.902 ± 0.248
3.284AspIle: 3.284 ± 0.421
2.834AspLys: 2.834 ± 0.548
3.864AspLeu: 3.864 ± 0.619
1.224AspMet: 1.224 ± 0.342
2.318AspAsn: 2.318 ± 0.366
2.769AspPro: 2.769 ± 0.403
2.318AspGln: 2.318 ± 0.467
2.834AspArg: 2.834 ± 0.393
2.898AspSer: 2.898 ± 0.414
2.705AspThr: 2.705 ± 0.369
4.25AspVal: 4.25 ± 0.541
1.159AspTrp: 1.159 ± 0.277
2.447AspTyr: 2.447 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
5.86GluAla: 5.86 ± 0.684
0.837GluCys: 0.837 ± 0.262
2.512GluAsp: 2.512 ± 0.494
3.413GluGlu: 3.413 ± 0.487
1.674GluPhe: 1.674 ± 0.296
4.315GluGly: 4.315 ± 0.539
0.644GluHis: 0.644 ± 0.167
4.25GluIle: 4.25 ± 0.502
3.671GluLys: 3.671 ± 0.535
6.247GluLeu: 6.247 ± 0.538
1.481GluMet: 1.481 ± 0.335
2.64GluAsn: 2.64 ± 0.47
1.803GluPro: 1.803 ± 0.371
2.962GluGln: 2.962 ± 0.535
3.928GluArg: 3.928 ± 0.471
2.898GluSer: 2.898 ± 0.454
3.091GluThr: 3.091 ± 0.675
3.542GluVal: 3.542 ± 0.475
1.352GluTrp: 1.352 ± 0.334
2.254GluTyr: 2.254 ± 0.589
0.0GluXaa: 0.0 ± 0.0
Phe
3.735PheAla: 3.735 ± 0.484
0.708PheCys: 0.708 ± 0.197
2.705PheAsp: 2.705 ± 0.394
1.095PheGlu: 1.095 ± 0.258
1.352PhePhe: 1.352 ± 0.295
2.576PheGly: 2.576 ± 0.493
0.644PheHis: 0.644 ± 0.167
1.352PheIle: 1.352 ± 0.309
1.352PheLys: 1.352 ± 0.33
3.027PheLeu: 3.027 ± 0.438
0.966PheMet: 0.966 ± 0.234
1.417PheAsn: 1.417 ± 0.238
1.932PhePro: 1.932 ± 0.381
1.546PheGln: 1.546 ± 0.345
1.996PheArg: 1.996 ± 0.409
2.125PheSer: 2.125 ± 0.474
2.576PheThr: 2.576 ± 0.528
2.19PheVal: 2.19 ± 0.428
0.515PheTrp: 0.515 ± 0.166
1.417PheTyr: 1.417 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
6.118GlyAla: 6.118 ± 0.729
0.837GlyCys: 0.837 ± 0.255
4.186GlyAsp: 4.186 ± 0.605
4.572GlyGlu: 4.572 ± 0.522
3.22GlyPhe: 3.22 ± 0.429
5.152GlyGly: 5.152 ± 0.811
1.546GlyHis: 1.546 ± 0.292
3.993GlyIle: 3.993 ± 0.474
5.023GlyLys: 5.023 ± 0.526
6.569GlyLeu: 6.569 ± 0.624
2.125GlyMet: 2.125 ± 0.279
2.512GlyAsn: 2.512 ± 0.359
1.868GlyPro: 1.868 ± 0.407
2.576GlyGln: 2.576 ± 0.392
3.735GlyArg: 3.735 ± 0.478
4.637GlySer: 4.637 ± 0.642
3.928GlyThr: 3.928 ± 0.691
6.118GlyVal: 6.118 ± 0.609
1.095GlyTrp: 1.095 ± 0.311
2.64GlyTyr: 2.64 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
1.352HisAla: 1.352 ± 0.25
0.258HisCys: 0.258 ± 0.131
1.159HisAsp: 1.159 ± 0.296
0.773HisGlu: 0.773 ± 0.212
1.288HisPhe: 1.288 ± 0.322
1.224HisGly: 1.224 ± 0.295
0.58HisHis: 0.58 ± 0.2
0.58HisIle: 0.58 ± 0.199
0.902HisLys: 0.902 ± 0.265
2.061HisLeu: 2.061 ± 0.393
0.193HisMet: 0.193 ± 0.121
0.58HisAsn: 0.58 ± 0.233
1.224HisPro: 1.224 ± 0.258
0.708HisGln: 0.708 ± 0.187
1.095HisArg: 1.095 ± 0.306
0.837HisSer: 0.837 ± 0.251
0.451HisThr: 0.451 ± 0.174
1.481HisVal: 1.481 ± 0.354
0.386HisTrp: 0.386 ± 0.221
0.966HisTyr: 0.966 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
6.376IleAla: 6.376 ± 0.651
0.966IleCys: 0.966 ± 0.234
3.8IleAsp: 3.8 ± 0.509
3.027IleGlu: 3.027 ± 0.398
1.03IlePhe: 1.03 ± 0.288
3.735IleGly: 3.735 ± 0.494
0.708IleHis: 0.708 ± 0.158
3.864IleIle: 3.864 ± 0.537
3.156IleLys: 3.156 ± 0.444
4.057IleLeu: 4.057 ± 0.542
0.708IleMet: 0.708 ± 0.212
2.19IleAsn: 2.19 ± 0.367
2.318IlePro: 2.318 ± 0.36
1.996IleGln: 1.996 ± 0.357
3.735IleArg: 3.735 ± 0.451
3.606IleSer: 3.606 ± 0.478
4.572IleThr: 4.572 ± 0.691
3.542IleVal: 3.542 ± 0.554
0.58IleTrp: 0.58 ± 0.177
1.803IleTyr: 1.803 ± 0.376
0.0IleXaa: 0.0 ± 0.0
Lys
5.281LysAla: 5.281 ± 0.842
0.902LysCys: 0.902 ± 0.267
2.19LysAsp: 2.19 ± 0.405
3.091LysGlu: 3.091 ± 0.464
1.739LysPhe: 1.739 ± 0.337
3.027LysGly: 3.027 ± 0.482
1.674LysHis: 1.674 ± 0.337
2.447LysIle: 2.447 ± 0.481
3.349LysLys: 3.349 ± 0.621
4.186LysLeu: 4.186 ± 0.488
1.352LysMet: 1.352 ± 0.26
1.546LysAsn: 1.546 ± 0.315
3.22LysPro: 3.22 ± 0.46
2.125LysGln: 2.125 ± 0.41
3.156LysArg: 3.156 ± 0.49
3.156LysSer: 3.156 ± 0.493
3.993LysThr: 3.993 ± 0.677
3.928LysVal: 3.928 ± 0.481
0.58LysTrp: 0.58 ± 0.172
1.803LysTyr: 1.803 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
8.179LeuAla: 8.179 ± 0.719
0.902LeuCys: 0.902 ± 0.27
5.925LeuAsp: 5.925 ± 0.583
5.474LeuGlu: 5.474 ± 0.735
3.349LeuPhe: 3.349 ± 0.492
7.02LeuGly: 7.02 ± 0.83
1.674LeuHis: 1.674 ± 0.309
4.959LeuIle: 4.959 ± 0.714
4.572LeuLys: 4.572 ± 0.684
9.016LeuLeu: 9.016 ± 1.167
1.996LeuMet: 1.996 ± 0.325
4.25LeuAsn: 4.25 ± 0.435
5.088LeuPro: 5.088 ± 0.633
3.542LeuGln: 3.542 ± 0.555
4.894LeuArg: 4.894 ± 0.562
6.44LeuSer: 6.44 ± 0.704
6.182LeuThr: 6.182 ± 0.71
5.023LeuVal: 5.023 ± 0.656
1.095LeuTrp: 1.095 ± 0.3
2.125LeuTyr: 2.125 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
2.64MetAla: 2.64 ± 0.332
0.129MetCys: 0.129 ± 0.128
1.159MetAsp: 1.159 ± 0.254
0.837MetGlu: 0.837 ± 0.207
0.708MetPhe: 0.708 ± 0.196
1.224MetGly: 1.224 ± 0.373
0.386MetHis: 0.386 ± 0.165
1.224MetIle: 1.224 ± 0.269
1.674MetLys: 1.674 ± 0.329
2.19MetLeu: 2.19 ± 0.421
0.451MetMet: 0.451 ± 0.166
0.708MetAsn: 0.708 ± 0.211
1.417MetPro: 1.417 ± 0.312
1.288MetGln: 1.288 ± 0.372
1.674MetArg: 1.674 ± 0.346
2.19MetSer: 2.19 ± 0.439
2.254MetThr: 2.254 ± 0.343
1.868MetVal: 1.868 ± 0.422
0.386MetTrp: 0.386 ± 0.163
0.644MetTyr: 0.644 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.542AsnAla: 3.542 ± 0.504
0.451AsnCys: 0.451 ± 0.152
2.576AsnAsp: 2.576 ± 0.444
1.932AsnGlu: 1.932 ± 0.385
0.773AsnPhe: 0.773 ± 0.243
3.993AsnGly: 3.993 ± 0.529
0.837AsnHis: 0.837 ± 0.255
2.254AsnIle: 2.254 ± 0.348
1.868AsnLys: 1.868 ± 0.384
3.349AsnLeu: 3.349 ± 0.388
0.966AsnMet: 0.966 ± 0.279
1.417AsnAsn: 1.417 ± 0.284
2.254AsnPro: 2.254 ± 0.408
2.061AsnGln: 2.061 ± 0.384
1.417AsnArg: 1.417 ± 0.348
1.932AsnSer: 1.932 ± 0.407
1.61AsnThr: 1.61 ± 0.309
2.254AsnVal: 2.254 ± 0.365
0.322AsnTrp: 0.322 ± 0.136
1.095AsnTyr: 1.095 ± 0.264
0.0AsnXaa: 0.0 ± 0.0
Pro
4.894ProAla: 4.894 ± 0.69
0.193ProCys: 0.193 ± 0.111
4.057ProAsp: 4.057 ± 0.588
3.8ProGlu: 3.8 ± 0.52
1.739ProPhe: 1.739 ± 0.399
3.671ProGly: 3.671 ± 0.591
1.224ProHis: 1.224 ± 0.283
1.417ProIle: 1.417 ± 0.261
1.803ProLys: 1.803 ± 0.336
3.542ProLeu: 3.542 ± 0.519
0.708ProMet: 0.708 ± 0.203
1.159ProAsn: 1.159 ± 0.243
1.674ProPro: 1.674 ± 0.345
1.674ProGln: 1.674 ± 0.408
1.996ProArg: 1.996 ± 0.384
2.383ProSer: 2.383 ± 0.42
2.576ProThr: 2.576 ± 0.372
3.993ProVal: 3.993 ± 0.509
0.644ProTrp: 0.644 ± 0.204
1.352ProTyr: 1.352 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
4.186GlnAla: 4.186 ± 0.553
0.129GlnCys: 0.129 ± 0.093
1.932GlnAsp: 1.932 ± 0.341
2.576GlnGlu: 2.576 ± 0.491
1.03GlnPhe: 1.03 ± 0.293
2.125GlnGly: 2.125 ± 0.313
0.644GlnHis: 0.644 ± 0.211
2.962GlnIle: 2.962 ± 0.535
2.64GlnLys: 2.64 ± 0.585
4.508GlnLeu: 4.508 ± 0.589
1.417GlnMet: 1.417 ± 0.277
1.159GlnAsn: 1.159 ± 0.281
2.125GlnPro: 2.125 ± 0.414
3.156GlnGln: 3.156 ± 0.805
3.413GlnArg: 3.413 ± 0.454
2.576GlnSer: 2.576 ± 0.519
2.64GlnThr: 2.64 ± 0.411
2.769GlnVal: 2.769 ± 0.541
0.386GlnTrp: 0.386 ± 0.154
1.224GlnTyr: 1.224 ± 0.269
0.0GlnXaa: 0.0 ± 0.0
Arg
6.182ArgAla: 6.182 ± 0.749
1.03ArgCys: 1.03 ± 0.207
2.512ArgAsp: 2.512 ± 0.367
3.22ArgGlu: 3.22 ± 0.417
1.674ArgPhe: 1.674 ± 0.32
3.027ArgGly: 3.027 ± 0.377
1.417ArgHis: 1.417 ± 0.318
3.542ArgIle: 3.542 ± 0.469
3.156ArgLys: 3.156 ± 0.459
7.084ArgLeu: 7.084 ± 0.878
1.868ArgMet: 1.868 ± 0.384
2.061ArgAsn: 2.061 ± 0.337
1.674ArgPro: 1.674 ± 0.347
3.027ArgGln: 3.027 ± 0.555
4.637ArgArg: 4.637 ± 0.778
2.962ArgSer: 2.962 ± 0.39
2.769ArgThr: 2.769 ± 0.376
3.864ArgVal: 3.864 ± 0.478
0.966ArgTrp: 0.966 ± 0.253
2.898ArgTyr: 2.898 ± 0.472
0.0ArgXaa: 0.0 ± 0.0
Ser
5.152SerAla: 5.152 ± 0.801
0.708SerCys: 0.708 ± 0.185
3.671SerAsp: 3.671 ± 0.418
3.478SerGlu: 3.478 ± 0.481
2.447SerPhe: 2.447 ± 0.5
5.538SerGly: 5.538 ± 0.695
0.837SerHis: 0.837 ± 0.179
2.898SerIle: 2.898 ± 0.416
3.349SerLys: 3.349 ± 0.436
5.41SerLeu: 5.41 ± 0.581
2.512SerMet: 2.512 ± 0.345
2.061SerAsn: 2.061 ± 0.291
2.254SerPro: 2.254 ± 0.397
2.061SerGln: 2.061 ± 0.374
3.735SerArg: 3.735 ± 0.537
4.25SerSer: 4.25 ± 0.655
4.315SerThr: 4.315 ± 0.597
4.572SerVal: 4.572 ± 0.575
1.159SerTrp: 1.159 ± 0.301
1.803SerTyr: 1.803 ± 0.414
0.0SerXaa: 0.0 ± 0.0
Thr
7.47ThrAla: 7.47 ± 0.797
0.451ThrCys: 0.451 ± 0.174
3.606ThrAsp: 3.606 ± 0.395
4.379ThrGlu: 4.379 ± 0.48
2.576ThrPhe: 2.576 ± 0.526
5.474ThrGly: 5.474 ± 0.849
0.902ThrHis: 0.902 ± 0.199
2.705ThrIle: 2.705 ± 0.431
2.19ThrLys: 2.19 ± 0.339
5.732ThrLeu: 5.732 ± 0.558
0.966ThrMet: 0.966 ± 0.248
1.417ThrAsn: 1.417 ± 0.311
2.834ThrPro: 2.834 ± 0.531
2.318ThrGln: 2.318 ± 0.437
2.898ThrArg: 2.898 ± 0.361
3.993ThrSer: 3.993 ± 0.434
3.091ThrThr: 3.091 ± 0.518
4.701ThrVal: 4.701 ± 0.702
1.095ThrTrp: 1.095 ± 0.267
1.288ThrTyr: 1.288 ± 0.316
0.0ThrXaa: 0.0 ± 0.0
Val
6.182ValAla: 6.182 ± 0.662
0.773ValCys: 0.773 ± 0.214
3.413ValAsp: 3.413 ± 0.501
4.186ValGlu: 4.186 ± 0.579
2.318ValPhe: 2.318 ± 0.416
4.766ValGly: 4.766 ± 0.662
0.966ValHis: 0.966 ± 0.244
4.057ValIle: 4.057 ± 0.617
4.572ValLys: 4.572 ± 0.589
6.569ValLeu: 6.569 ± 0.815
1.481ValMet: 1.481 ± 0.287
3.091ValAsn: 3.091 ± 0.506
2.512ValPro: 2.512 ± 0.425
2.318ValGln: 2.318 ± 0.446
3.993ValArg: 3.993 ± 0.461
4.959ValSer: 4.959 ± 0.608
5.152ValThr: 5.152 ± 0.82
4.379ValVal: 4.379 ± 0.548
1.159ValTrp: 1.159 ± 0.349
1.674ValTyr: 1.674 ± 0.26
0.0ValXaa: 0.0 ± 0.0
Trp
1.288TrpAla: 1.288 ± 0.296
0.258TrpCys: 0.258 ± 0.124
0.708TrpAsp: 0.708 ± 0.197
0.837TrpGlu: 0.837 ± 0.237
1.095TrpPhe: 1.095 ± 0.277
0.773TrpGly: 0.773 ± 0.246
0.129TrpHis: 0.129 ± 0.09
0.644TrpIle: 0.644 ± 0.254
0.773TrpLys: 0.773 ± 0.251
1.481TrpLeu: 1.481 ± 0.379
0.322TrpMet: 0.322 ± 0.151
0.58TrpAsn: 0.58 ± 0.197
0.773TrpPro: 0.773 ± 0.22
0.902TrpGln: 0.902 ± 0.277
1.481TrpArg: 1.481 ± 0.355
0.837TrpSer: 0.837 ± 0.215
0.966TrpThr: 0.966 ± 0.29
1.739TrpVal: 1.739 ± 0.295
0.129TrpTrp: 0.129 ± 0.085
0.193TrpTyr: 0.193 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.834TyrAla: 2.834 ± 0.415
0.58TyrCys: 0.58 ± 0.192
1.739TyrAsp: 1.739 ± 0.336
1.481TyrGlu: 1.481 ± 0.291
1.03TyrPhe: 1.03 ± 0.313
1.996TyrGly: 1.996 ± 0.467
0.644TyrHis: 0.644 ± 0.347
1.546TyrIle: 1.546 ± 0.295
1.417TyrLys: 1.417 ± 0.333
2.576TyrLeu: 2.576 ± 0.329
0.837TyrMet: 0.837 ± 0.262
1.03TyrAsn: 1.03 ± 0.301
1.996TyrPro: 1.996 ± 0.488
1.803TyrGln: 1.803 ± 0.268
2.769TyrArg: 2.769 ± 0.471
2.125TyrSer: 2.125 ± 0.341
1.674TyrThr: 1.674 ± 0.322
1.674TyrVal: 1.674 ± 0.25
0.644TyrTrp: 0.644 ± 0.204
0.708TyrTyr: 0.708 ± 0.24
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (15529 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski