Amino acid dipepetide frequency for Klebsiella phage ST11-VIM1phi8.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.583AlaAla: 8.583 ± 1.12
0.866AlaCys: 0.866 ± 0.283
6.203AlaAsp: 6.203 ± 0.618
5.482AlaGlu: 5.482 ± 0.765
3.174AlaPhe: 3.174 ± 0.628
7.069AlaGly: 7.069 ± 0.744
1.659AlaHis: 1.659 ± 0.359
6.78AlaIle: 6.78 ± 0.69
4.111AlaLys: 4.111 ± 0.57
9.16AlaLeu: 9.16 ± 0.89
3.102AlaMet: 3.102 ± 0.445
3.823AlaAsn: 3.823 ± 0.563
3.679AlaPro: 3.679 ± 0.538
5.121AlaGln: 5.121 ± 0.686
5.41AlaArg: 5.41 ± 0.672
6.492AlaSer: 6.492 ± 0.664
4.833AlaThr: 4.833 ± 0.942
5.77AlaVal: 5.77 ± 0.607
2.236AlaTrp: 2.236 ± 0.39
3.029AlaTyr: 3.029 ± 0.562
0.0AlaXaa: 0.0 ± 0.0
Cys
0.938CysAla: 0.938 ± 0.25
0.072CysCys: 0.072 ± 0.076
0.649CysAsp: 0.649 ± 0.238
0.505CysGlu: 0.505 ± 0.232
0.289CysPhe: 0.289 ± 0.146
1.082CysGly: 1.082 ± 0.289
0.216CysHis: 0.216 ± 0.162
0.505CysIle: 0.505 ± 0.213
0.361CysLys: 0.361 ± 0.164
0.938CysLeu: 0.938 ± 0.251
0.072CysMet: 0.072 ± 0.085
0.505CysAsn: 0.505 ± 0.188
0.938CysPro: 0.938 ± 0.217
0.361CysGln: 0.361 ± 0.168
1.298CysArg: 1.298 ± 0.222
1.01CysSer: 1.01 ± 0.258
0.577CysThr: 0.577 ± 0.243
0.505CysVal: 0.505 ± 0.173
0.216CysTrp: 0.216 ± 0.132
0.144CysTyr: 0.144 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
5.193AspAla: 5.193 ± 0.57
0.289AspCys: 0.289 ± 0.142
3.39AspAsp: 3.39 ± 0.579
3.895AspGlu: 3.895 ± 0.511
2.236AspPhe: 2.236 ± 0.372
4.616AspGly: 4.616 ± 0.616
1.443AspHis: 1.443 ± 0.335
3.39AspIle: 3.39 ± 0.483
3.102AspLys: 3.102 ± 0.546
4.688AspLeu: 4.688 ± 0.541
1.154AspMet: 1.154 ± 0.247
2.092AspAsn: 2.092 ± 0.382
2.669AspPro: 2.669 ± 0.475
1.803AspGln: 1.803 ± 0.417
3.246AspArg: 3.246 ± 0.447
3.102AspSer: 3.102 ± 0.422
2.957AspThr: 2.957 ± 0.47
4.183AspVal: 4.183 ± 0.519
1.154AspTrp: 1.154 ± 0.294
2.308AspTyr: 2.308 ± 0.479
0.0AspXaa: 0.0 ± 0.0
Glu
5.987GluAla: 5.987 ± 0.72
0.577GluCys: 0.577 ± 0.245
1.731GluAsp: 1.731 ± 0.395
2.957GluGlu: 2.957 ± 0.508
2.236GluPhe: 2.236 ± 0.364
3.823GluGly: 3.823 ± 0.532
0.505GluHis: 0.505 ± 0.174
3.967GluIle: 3.967 ± 0.519
3.606GluLys: 3.606 ± 0.753
5.842GluLeu: 5.842 ± 0.727
1.587GluMet: 1.587 ± 0.307
1.947GluAsn: 1.947 ± 0.421
1.803GluPro: 1.803 ± 0.397
2.741GluGln: 2.741 ± 0.579
3.823GluArg: 3.823 ± 0.637
3.679GluSer: 3.679 ± 0.58
2.741GluThr: 2.741 ± 0.415
3.39GluVal: 3.39 ± 0.471
1.443GluTrp: 1.443 ± 0.292
1.659GluTyr: 1.659 ± 0.321
0.0GluXaa: 0.0 ± 0.0
Phe
3.029PheAla: 3.029 ± 0.434
0.793PheCys: 0.793 ± 0.207
2.597PheAsp: 2.597 ± 0.485
1.515PheGlu: 1.515 ± 0.384
1.515PhePhe: 1.515 ± 0.452
2.741PheGly: 2.741 ± 0.445
0.577PheHis: 0.577 ± 0.202
1.226PheIle: 1.226 ± 0.266
1.443PheLys: 1.443 ± 0.333
3.029PheLeu: 3.029 ± 0.476
0.866PheMet: 0.866 ± 0.213
1.947PheAsn: 1.947 ± 0.375
1.587PhePro: 1.587 ± 0.322
1.37PheGln: 1.37 ± 0.284
1.875PheArg: 1.875 ± 0.499
2.525PheSer: 2.525 ± 0.511
3.534PheThr: 3.534 ± 0.495
1.947PheVal: 1.947 ± 0.425
0.505PheTrp: 0.505 ± 0.221
1.37PheTyr: 1.37 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
6.347GlyAla: 6.347 ± 0.968
1.01GlyCys: 1.01 ± 0.3
4.256GlyAsp: 4.256 ± 0.661
4.4GlyGlu: 4.4 ± 0.593
3.102GlyPhe: 3.102 ± 0.614
4.977GlyGly: 4.977 ± 0.847
1.082GlyHis: 1.082 ± 0.303
4.833GlyIle: 4.833 ± 0.588
4.761GlyLys: 4.761 ± 0.592
6.708GlyLeu: 6.708 ± 0.695
1.803GlyMet: 1.803 ± 0.325
2.957GlyAsn: 2.957 ± 0.504
2.236GlyPro: 2.236 ± 0.427
2.669GlyGln: 2.669 ± 0.542
2.669GlyArg: 2.669 ± 0.342
4.833GlySer: 4.833 ± 0.717
3.751GlyThr: 3.751 ± 0.703
5.77GlyVal: 5.77 ± 0.689
1.226GlyTrp: 1.226 ± 0.304
2.308GlyTyr: 2.308 ± 0.497
0.0GlyXaa: 0.0 ± 0.0
His
1.298HisAla: 1.298 ± 0.281
0.289HisCys: 0.289 ± 0.129
0.793HisAsp: 0.793 ± 0.243
0.793HisGlu: 0.793 ± 0.235
0.866HisPhe: 0.866 ± 0.311
1.082HisGly: 1.082 ± 0.22
0.361HisHis: 0.361 ± 0.175
0.649HisIle: 0.649 ± 0.224
0.649HisLys: 0.649 ± 0.215
1.587HisLeu: 1.587 ± 0.35
0.433HisMet: 0.433 ± 0.163
0.721HisAsn: 0.721 ± 0.239
0.938HisPro: 0.938 ± 0.242
0.793HisGln: 0.793 ± 0.228
0.938HisArg: 0.938 ± 0.25
1.082HisSer: 1.082 ± 0.305
0.505HisThr: 0.505 ± 0.218
1.226HisVal: 1.226 ± 0.311
0.577HisTrp: 0.577 ± 0.255
0.505HisTyr: 0.505 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
7.862IleAla: 7.862 ± 0.881
1.226IleCys: 1.226 ± 0.334
4.183IleAsp: 4.183 ± 0.649
3.029IleGlu: 3.029 ± 0.477
1.37IlePhe: 1.37 ± 0.366
3.751IleGly: 3.751 ± 0.485
0.505IleHis: 0.505 ± 0.152
4.544IleIle: 4.544 ± 0.615
3.102IleLys: 3.102 ± 0.518
3.606IleLeu: 3.606 ± 0.582
0.649IleMet: 0.649 ± 0.233
3.029IleAsn: 3.029 ± 0.486
2.38IlePro: 2.38 ± 0.385
1.947IleGln: 1.947 ± 0.342
3.751IleArg: 3.751 ± 0.569
3.462IleSer: 3.462 ± 0.451
4.905IleThr: 4.905 ± 0.691
3.462IleVal: 3.462 ± 0.601
0.938IleTrp: 0.938 ± 0.326
2.092IleTyr: 2.092 ± 0.415
0.0IleXaa: 0.0 ± 0.0
Lys
4.905LysAla: 4.905 ± 0.701
0.938LysCys: 0.938 ± 0.264
3.102LysAsp: 3.102 ± 0.586
3.102LysGlu: 3.102 ± 0.563
1.731LysPhe: 1.731 ± 0.327
3.246LysGly: 3.246 ± 0.625
0.938LysHis: 0.938 ± 0.214
2.741LysIle: 2.741 ± 0.48
3.174LysLys: 3.174 ± 0.588
3.895LysLeu: 3.895 ± 0.552
1.515LysMet: 1.515 ± 0.337
1.443LysAsn: 1.443 ± 0.364
2.452LysPro: 2.452 ± 0.465
2.597LysGln: 2.597 ± 0.555
3.318LysArg: 3.318 ± 0.608
3.679LysSer: 3.679 ± 0.585
3.967LysThr: 3.967 ± 0.728
3.39LysVal: 3.39 ± 0.557
0.361LysTrp: 0.361 ± 0.157
1.37LysTyr: 1.37 ± 0.328
0.0LysXaa: 0.0 ± 0.0
Leu
9.16LeuAla: 9.16 ± 0.906
1.226LeuCys: 1.226 ± 0.295
5.41LeuAsp: 5.41 ± 0.559
4.328LeuGlu: 4.328 ± 0.437
3.246LeuPhe: 3.246 ± 0.602
5.626LeuGly: 5.626 ± 0.765
1.226LeuHis: 1.226 ± 0.292
7.141LeuIle: 7.141 ± 1.208
4.328LeuLys: 4.328 ± 0.927
9.665LeuLeu: 9.665 ± 1.192
1.803LeuMet: 1.803 ± 0.44
3.823LeuAsn: 3.823 ± 0.489
5.338LeuPro: 5.338 ± 0.772
4.256LeuGln: 4.256 ± 0.644
4.4LeuArg: 4.4 ± 0.543
6.42LeuSer: 6.42 ± 0.776
5.554LeuThr: 5.554 ± 0.71
6.492LeuVal: 6.492 ± 0.831
0.793LeuTrp: 0.793 ± 0.274
1.803LeuTyr: 1.803 ± 0.372
0.0LeuXaa: 0.0 ± 0.0
Met
2.236MetAla: 2.236 ± 0.367
0.216MetCys: 0.216 ± 0.158
1.01MetAsp: 1.01 ± 0.288
0.866MetGlu: 0.866 ± 0.267
0.721MetPhe: 0.721 ± 0.205
0.793MetGly: 0.793 ± 0.314
0.577MetHis: 0.577 ± 0.188
1.443MetIle: 1.443 ± 0.277
1.731MetLys: 1.731 ± 0.431
2.092MetLeu: 2.092 ± 0.43
0.649MetMet: 0.649 ± 0.214
1.01MetAsn: 1.01 ± 0.332
1.298MetPro: 1.298 ± 0.331
0.866MetGln: 0.866 ± 0.346
1.298MetArg: 1.298 ± 0.324
2.38MetSer: 2.38 ± 0.565
2.02MetThr: 2.02 ± 0.36
1.947MetVal: 1.947 ± 0.394
0.433MetTrp: 0.433 ± 0.196
0.721MetTyr: 0.721 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
3.39AsnAla: 3.39 ± 0.442
0.216AsnCys: 0.216 ± 0.123
2.308AsnAsp: 2.308 ± 0.442
2.741AsnGlu: 2.741 ± 0.339
0.433AsnPhe: 0.433 ± 0.172
3.967AsnGly: 3.967 ± 0.543
0.938AsnHis: 0.938 ± 0.239
2.525AsnIle: 2.525 ± 0.44
1.947AsnLys: 1.947 ± 0.413
3.751AsnLeu: 3.751 ± 0.618
0.721AsnMet: 0.721 ± 0.207
1.587AsnAsn: 1.587 ± 0.45
1.803AsnPro: 1.803 ± 0.275
1.947AsnGln: 1.947 ± 0.548
2.38AsnArg: 2.38 ± 0.474
1.875AsnSer: 1.875 ± 0.456
2.02AsnThr: 2.02 ± 0.402
2.813AsnVal: 2.813 ± 0.468
0.433AsnTrp: 0.433 ± 0.165
1.443AsnTyr: 1.443 ± 0.292
0.0AsnXaa: 0.0 ± 0.0
Pro
4.183ProAla: 4.183 ± 0.549
0.289ProCys: 0.289 ± 0.148
3.174ProAsp: 3.174 ± 0.462
2.885ProGlu: 2.885 ± 0.453
1.731ProPhe: 1.731 ± 0.369
3.751ProGly: 3.751 ± 0.671
1.298ProHis: 1.298 ± 0.298
1.154ProIle: 1.154 ± 0.256
1.947ProLys: 1.947 ± 0.459
3.462ProLeu: 3.462 ± 0.621
0.649ProMet: 0.649 ± 0.202
1.587ProAsn: 1.587 ± 0.352
1.298ProPro: 1.298 ± 0.376
1.515ProGln: 1.515 ± 0.368
2.236ProArg: 2.236 ± 0.467
2.02ProSer: 2.02 ± 0.421
2.669ProThr: 2.669 ± 0.489
3.823ProVal: 3.823 ± 0.461
0.793ProTrp: 0.793 ± 0.239
1.37ProTyr: 1.37 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
4.111GlnAla: 4.111 ± 0.651
0.289GlnCys: 0.289 ± 0.155
1.875GlnAsp: 1.875 ± 0.408
2.164GlnGlu: 2.164 ± 0.521
1.226GlnPhe: 1.226 ± 0.378
2.452GlnGly: 2.452 ± 0.365
0.577GlnHis: 0.577 ± 0.172
2.741GlnIle: 2.741 ± 0.41
2.741GlnLys: 2.741 ± 0.534
5.049GlnLeu: 5.049 ± 0.831
1.443GlnMet: 1.443 ± 0.35
1.154GlnAsn: 1.154 ± 0.37
1.587GlnPro: 1.587 ± 0.319
3.174GlnGln: 3.174 ± 0.938
3.029GlnArg: 3.029 ± 0.566
2.813GlnSer: 2.813 ± 0.49
2.813GlnThr: 2.813 ± 0.423
2.38GlnVal: 2.38 ± 0.443
0.433GlnTrp: 0.433 ± 0.177
1.37GlnTyr: 1.37 ± 0.288
0.0GlnXaa: 0.0 ± 0.0
Arg
4.833ArgAla: 4.833 ± 0.642
0.793ArgCys: 0.793 ± 0.219
3.318ArgAsp: 3.318 ± 0.607
3.029ArgGlu: 3.029 ± 0.59
1.947ArgPhe: 1.947 ± 0.317
3.39ArgGly: 3.39 ± 0.497
1.01ArgHis: 1.01 ± 0.258
3.39ArgIle: 3.39 ± 0.49
3.029ArgLys: 3.029 ± 0.472
6.42ArgLeu: 6.42 ± 0.745
2.02ArgMet: 2.02 ± 0.373
2.813ArgAsn: 2.813 ± 0.427
1.587ArgPro: 1.587 ± 0.374
3.174ArgGln: 3.174 ± 0.578
5.41ArgArg: 5.41 ± 1.046
3.751ArgSer: 3.751 ± 0.458
3.102ArgThr: 3.102 ± 0.41
3.823ArgVal: 3.823 ± 0.489
1.154ArgTrp: 1.154 ± 0.277
2.669ArgTyr: 2.669 ± 0.603
0.0ArgXaa: 0.0 ± 0.0
Ser
5.554SerAla: 5.554 ± 0.755
0.649SerCys: 0.649 ± 0.183
3.751SerAsp: 3.751 ± 0.481
3.895SerGlu: 3.895 ± 0.583
2.885SerPhe: 2.885 ± 0.407
6.203SerGly: 6.203 ± 0.846
0.938SerHis: 0.938 ± 0.26
3.606SerIle: 3.606 ± 0.484
2.669SerLys: 2.669 ± 0.428
5.41SerLeu: 5.41 ± 0.708
2.02SerMet: 2.02 ± 0.422
2.525SerAsn: 2.525 ± 0.385
2.597SerPro: 2.597 ± 0.394
2.308SerGln: 2.308 ± 0.414
3.823SerArg: 3.823 ± 0.529
3.967SerSer: 3.967 ± 0.683
4.761SerThr: 4.761 ± 0.718
4.544SerVal: 4.544 ± 0.546
1.298SerTrp: 1.298 ± 0.306
2.02SerTyr: 2.02 ± 0.419
0.0SerXaa: 0.0 ± 0.0
Thr
8.656ThrAla: 8.656 ± 0.905
0.144ThrCys: 0.144 ± 0.089
3.39ThrAsp: 3.39 ± 0.501
4.183ThrGlu: 4.183 ± 0.556
2.957ThrPhe: 2.957 ± 0.642
5.554ThrGly: 5.554 ± 0.887
0.721ThrHis: 0.721 ± 0.21
3.102ThrIle: 3.102 ± 0.53
2.525ThrLys: 2.525 ± 0.394
6.492ThrLeu: 6.492 ± 0.602
0.866ThrMet: 0.866 ± 0.224
1.37ThrAsn: 1.37 ± 0.274
2.669ThrPro: 2.669 ± 0.542
2.38ThrGln: 2.38 ± 0.408
3.318ThrArg: 3.318 ± 0.554
3.462ThrSer: 3.462 ± 0.597
3.39ThrThr: 3.39 ± 0.674
4.616ThrVal: 4.616 ± 0.664
1.298ThrTrp: 1.298 ± 0.309
1.587ThrTyr: 1.587 ± 0.331
0.0ThrXaa: 0.0 ± 0.0
Val
6.203ValAla: 6.203 ± 0.549
0.577ValCys: 0.577 ± 0.229
3.462ValAsp: 3.462 ± 0.567
4.183ValGlu: 4.183 ± 0.656
2.02ValPhe: 2.02 ± 0.37
4.905ValGly: 4.905 ± 0.59
0.938ValHis: 0.938 ± 0.267
4.039ValIle: 4.039 ± 0.613
4.544ValLys: 4.544 ± 0.605
5.265ValLeu: 5.265 ± 0.774
1.803ValMet: 1.803 ± 0.387
3.029ValAsn: 3.029 ± 0.455
2.38ValPro: 2.38 ± 0.42
1.875ValGln: 1.875 ± 0.364
3.606ValArg: 3.606 ± 0.539
5.698ValSer: 5.698 ± 0.534
5.482ValThr: 5.482 ± 0.794
4.183ValVal: 4.183 ± 0.544
1.01ValTrp: 1.01 ± 0.319
1.659ValTyr: 1.659 ± 0.315
0.0ValXaa: 0.0 ± 0.0
Trp
1.154TrpAla: 1.154 ± 0.311
0.289TrpCys: 0.289 ± 0.14
0.793TrpAsp: 0.793 ± 0.231
0.938TrpGlu: 0.938 ± 0.262
1.154TrpPhe: 1.154 ± 0.334
0.793TrpGly: 0.793 ± 0.264
0.216TrpHis: 0.216 ± 0.123
0.433TrpIle: 0.433 ± 0.182
0.721TrpLys: 0.721 ± 0.245
2.525TrpLeu: 2.525 ± 0.484
0.361TrpMet: 0.361 ± 0.195
0.938TrpAsn: 0.938 ± 0.28
0.721TrpPro: 0.721 ± 0.201
1.01TrpGln: 1.01 ± 0.28
1.515TrpArg: 1.515 ± 0.396
1.082TrpSer: 1.082 ± 0.265
0.938TrpThr: 0.938 ± 0.254
1.298TrpVal: 1.298 ± 0.302
0.072TrpTrp: 0.072 ± 0.075
0.216TrpTyr: 0.216 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.957TyrAla: 2.957 ± 0.428
0.433TyrCys: 0.433 ± 0.164
1.515TyrAsp: 1.515 ± 0.427
1.154TyrGlu: 1.154 ± 0.269
1.154TyrPhe: 1.154 ± 0.332
1.875TyrGly: 1.875 ± 0.438
0.361TyrHis: 0.361 ± 0.142
1.587TyrIle: 1.587 ± 0.386
1.443TyrLys: 1.443 ± 0.349
2.452TyrLeu: 2.452 ± 0.423
0.721TyrMet: 0.721 ± 0.267
0.938TyrAsn: 0.938 ± 0.235
1.875TyrPro: 1.875 ± 0.529
1.515TyrGln: 1.515 ± 0.301
3.318TyrArg: 3.318 ± 0.503
2.092TyrSer: 2.092 ± 0.439
2.164TyrThr: 2.164 ± 0.402
1.298TyrVal: 1.298 ± 0.295
0.793TyrTrp: 0.793 ± 0.235
0.577TyrTyr: 0.577 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13865 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski