Amino acid dipepetide frequency for Klebsiella phage 2b LV-2017

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.914AlaAla: 14.914 ± 2.212
0.944AlaCys: 0.944 ± 0.305
7.079AlaAsp: 7.079 ± 0.995
7.646AlaGlu: 7.646 ± 1.12
2.643AlaPhe: 2.643 ± 0.463
7.835AlaGly: 7.835 ± 0.955
1.605AlaHis: 1.605 ± 0.631
6.324AlaIle: 6.324 ± 0.851
6.324AlaLys: 6.324 ± 1.001
8.967AlaLeu: 8.967 ± 1.043
3.021AlaMet: 3.021 ± 0.558
4.625AlaAsn: 4.625 ± 0.727
2.454AlaPro: 2.454 ± 0.533
6.608AlaGln: 6.608 ± 1.269
6.23AlaArg: 6.23 ± 0.781
7.268AlaSer: 7.268 ± 0.966
5.947AlaThr: 5.947 ± 0.787
6.513AlaVal: 6.513 ± 0.712
1.793AlaTrp: 1.793 ± 0.423
3.021AlaTyr: 3.021 ± 0.591
0.0AlaXaa: 0.0 ± 0.0
Cys
1.51CysAla: 1.51 ± 0.483
0.378CysCys: 0.378 ± 0.188
1.038CysAsp: 1.038 ± 0.358
0.566CysGlu: 0.566 ± 0.26
0.094CysPhe: 0.094 ± 0.099
0.755CysGly: 0.755 ± 0.269
0.378CysHis: 0.378 ± 0.174
0.472CysIle: 0.472 ± 0.2
0.755CysLys: 0.755 ± 0.286
0.661CysLeu: 0.661 ± 0.278
0.283CysMet: 0.283 ± 0.171
0.283CysAsn: 0.283 ± 0.177
0.472CysPro: 0.472 ± 0.188
0.755CysGln: 0.755 ± 0.342
1.322CysArg: 1.322 ± 0.374
0.85CysSer: 0.85 ± 0.338
0.566CysThr: 0.566 ± 0.323
0.755CysVal: 0.755 ± 0.263
0.472CysTrp: 0.472 ± 0.211
0.283CysTyr: 0.283 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
5.947AspAla: 5.947 ± 0.637
0.661AspCys: 0.661 ± 0.247
4.72AspAsp: 4.72 ± 0.745
3.965AspGlu: 3.965 ± 0.65
2.077AspPhe: 2.077 ± 0.443
5.286AspGly: 5.286 ± 0.978
0.661AspHis: 0.661 ± 0.249
2.926AspIle: 2.926 ± 0.536
3.115AspLys: 3.115 ± 0.77
4.248AspLeu: 4.248 ± 0.544
1.51AspMet: 1.51 ± 0.371
2.454AspAsn: 2.454 ± 0.664
2.549AspPro: 2.549 ± 0.487
1.793AspGln: 1.793 ± 0.394
2.454AspArg: 2.454 ± 0.475
4.059AspSer: 4.059 ± 0.599
3.021AspThr: 3.021 ± 0.726
3.681AspVal: 3.681 ± 0.611
1.605AspTrp: 1.605 ± 0.405
2.737AspTyr: 2.737 ± 0.574
0.0AspXaa: 0.0 ± 0.0
Glu
6.419GluAla: 6.419 ± 0.87
0.85GluCys: 0.85 ± 0.32
2.171GluAsp: 2.171 ± 0.449
3.587GluGlu: 3.587 ± 0.729
3.398GluPhe: 3.398 ± 0.588
4.153GluGly: 4.153 ± 0.768
0.944GluHis: 0.944 ± 0.254
4.059GluIle: 4.059 ± 0.667
3.776GluLys: 3.776 ± 0.786
4.814GluLeu: 4.814 ± 0.734
2.549GluMet: 2.549 ± 0.562
2.265GluAsn: 2.265 ± 0.389
2.737GluPro: 2.737 ± 0.52
4.059GluGln: 4.059 ± 0.771
4.531GluArg: 4.531 ± 0.671
3.87GluSer: 3.87 ± 0.737
1.888GluThr: 1.888 ± 0.377
4.059GluVal: 4.059 ± 0.594
1.605GluTrp: 1.605 ± 0.392
2.643GluTyr: 2.643 ± 0.542
0.0GluXaa: 0.0 ± 0.0
Phe
3.398PheAla: 3.398 ± 0.653
0.378PheCys: 0.378 ± 0.181
2.832PheAsp: 2.832 ± 0.527
1.605PheGlu: 1.605 ± 0.427
1.605PhePhe: 1.605 ± 0.577
2.926PheGly: 2.926 ± 0.479
0.755PheHis: 0.755 ± 0.253
1.699PheIle: 1.699 ± 0.422
1.699PheLys: 1.699 ± 0.469
1.793PheLeu: 1.793 ± 0.488
0.85PheMet: 0.85 ± 0.245
2.36PheAsn: 2.36 ± 0.525
0.755PhePro: 0.755 ± 0.386
0.944PheGln: 0.944 ± 0.257
2.454PheArg: 2.454 ± 0.656
2.265PheSer: 2.265 ± 0.374
2.643PheThr: 2.643 ± 0.501
2.077PheVal: 2.077 ± 0.303
0.85PheTrp: 0.85 ± 0.289
0.944PheTyr: 0.944 ± 0.285
0.0PheXaa: 0.0 ± 0.0
Gly
7.174GlyAla: 7.174 ± 0.886
1.699GlyCys: 1.699 ± 0.456
4.531GlyAsp: 4.531 ± 0.634
6.041GlyGlu: 6.041 ± 0.628
2.643GlyPhe: 2.643 ± 0.539
4.531GlyGly: 4.531 ± 0.735
0.661GlyHis: 0.661 ± 0.278
4.342GlyIle: 4.342 ± 0.591
5.569GlyLys: 5.569 ± 0.815
6.608GlyLeu: 6.608 ± 0.792
0.85GlyMet: 0.85 ± 0.229
3.115GlyAsn: 3.115 ± 0.652
1.322GlyPro: 1.322 ± 0.299
3.493GlyGln: 3.493 ± 0.487
3.965GlyArg: 3.965 ± 0.753
2.926GlySer: 2.926 ± 0.546
3.965GlyThr: 3.965 ± 0.662
4.059GlyVal: 4.059 ± 0.57
1.133GlyTrp: 1.133 ± 0.358
2.643GlyTyr: 2.643 ± 0.605
0.0GlyXaa: 0.0 ± 0.0
His
1.133HisAla: 1.133 ± 0.417
0.283HisCys: 0.283 ± 0.161
1.038HisAsp: 1.038 ± 0.28
0.755HisGlu: 0.755 ± 0.442
0.661HisPhe: 0.661 ± 0.287
0.944HisGly: 0.944 ± 0.298
0.472HisHis: 0.472 ± 0.203
1.038HisIle: 1.038 ± 0.346
0.566HisLys: 0.566 ± 0.253
1.322HisLeu: 1.322 ± 0.498
0.189HisMet: 0.189 ± 0.137
0.566HisAsn: 0.566 ± 0.261
0.755HisPro: 0.755 ± 0.24
0.85HisGln: 0.85 ± 0.327
1.133HisArg: 1.133 ± 0.29
0.85HisSer: 0.85 ± 0.319
1.227HisThr: 1.227 ± 0.368
1.51HisVal: 1.51 ± 0.383
0.283HisTrp: 0.283 ± 0.177
0.378HisTyr: 0.378 ± 0.177
0.0HisXaa: 0.0 ± 0.0
Ile
5.475IleAla: 5.475 ± 0.658
0.378IleCys: 0.378 ± 0.155
3.398IleAsp: 3.398 ± 0.407
3.587IleGlu: 3.587 ± 0.684
2.077IlePhe: 2.077 ± 0.498
4.625IleGly: 4.625 ± 0.715
0.85IleHis: 0.85 ± 0.258
2.171IleIle: 2.171 ± 0.42
2.832IleLys: 2.832 ± 0.429
3.021IleLeu: 3.021 ± 0.533
0.566IleMet: 0.566 ± 0.261
1.982IleAsn: 1.982 ± 0.35
2.454IlePro: 2.454 ± 0.526
1.605IleGln: 1.605 ± 0.536
3.021IleArg: 3.021 ± 0.667
5.003IleSer: 5.003 ± 0.707
4.248IleThr: 4.248 ± 0.593
3.209IleVal: 3.209 ± 0.586
0.661IleTrp: 0.661 ± 0.301
1.51IleTyr: 1.51 ± 0.431
0.0IleXaa: 0.0 ± 0.0
Lys
6.419LysAla: 6.419 ± 0.9
1.038LysCys: 1.038 ± 0.306
3.209LysAsp: 3.209 ± 0.589
3.681LysGlu: 3.681 ± 0.685
1.605LysPhe: 1.605 ± 0.263
3.587LysGly: 3.587 ± 0.5
0.944LysHis: 0.944 ± 0.364
2.077LysIle: 2.077 ± 0.444
3.021LysLys: 3.021 ± 0.596
3.87LysLeu: 3.87 ± 0.475
1.416LysMet: 1.416 ± 0.436
1.699LysAsn: 1.699 ± 0.408
2.643LysPro: 2.643 ± 0.645
2.926LysGln: 2.926 ± 0.444
4.248LysArg: 4.248 ± 0.68
3.021LysSer: 3.021 ± 0.498
2.832LysThr: 2.832 ± 0.525
4.625LysVal: 4.625 ± 0.565
0.755LysTrp: 0.755 ± 0.243
1.227LysTyr: 1.227 ± 0.28
0.0LysXaa: 0.0 ± 0.0
Leu
9.062LeuAla: 9.062 ± 1.296
1.038LeuCys: 1.038 ± 0.354
4.342LeuAsp: 4.342 ± 0.626
4.059LeuGlu: 4.059 ± 0.494
2.926LeuPhe: 2.926 ± 0.584
4.342LeuGly: 4.342 ± 0.587
1.227LeuHis: 1.227 ± 0.347
4.153LeuIle: 4.153 ± 0.563
4.248LeuLys: 4.248 ± 0.598
5.852LeuLeu: 5.852 ± 0.676
1.605LeuMet: 1.605 ± 0.412
3.398LeuAsn: 3.398 ± 0.518
3.587LeuPro: 3.587 ± 0.686
3.87LeuGln: 3.87 ± 0.541
4.814LeuArg: 4.814 ± 0.742
5.38LeuSer: 5.38 ± 0.754
4.531LeuThr: 4.531 ± 0.664
3.398LeuVal: 3.398 ± 0.583
0.661LeuTrp: 0.661 ± 0.276
2.832LeuTyr: 2.832 ± 0.458
0.0LeuXaa: 0.0 ± 0.0
Met
2.549MetAla: 2.549 ± 0.635
0.094MetCys: 0.094 ± 0.088
0.944MetAsp: 0.944 ± 0.259
1.416MetGlu: 1.416 ± 0.31
0.566MetPhe: 0.566 ± 0.191
1.227MetGly: 1.227 ± 0.387
0.472MetHis: 0.472 ± 0.205
0.944MetIle: 0.944 ± 0.329
1.699MetLys: 1.699 ± 0.488
1.133MetLeu: 1.133 ± 0.339
0.378MetMet: 0.378 ± 0.235
1.038MetAsn: 1.038 ± 0.276
1.605MetPro: 1.605 ± 0.524
1.038MetGln: 1.038 ± 0.279
1.322MetArg: 1.322 ± 0.478
2.454MetSer: 2.454 ± 0.41
1.605MetThr: 1.605 ± 0.363
1.51MetVal: 1.51 ± 0.44
0.378MetTrp: 0.378 ± 0.168
0.85MetTyr: 0.85 ± 0.246
0.0MetXaa: 0.0 ± 0.0
Asn
5.38AsnAla: 5.38 ± 0.789
0.189AsnCys: 0.189 ± 0.122
2.265AsnAsp: 2.265 ± 0.609
3.021AsnGlu: 3.021 ± 0.528
1.416AsnPhe: 1.416 ± 0.356
3.209AsnGly: 3.209 ± 0.47
0.283AsnHis: 0.283 ± 0.162
2.265AsnIle: 2.265 ± 0.424
1.888AsnLys: 1.888 ± 0.331
3.115AsnLeu: 3.115 ± 0.537
1.51AsnMet: 1.51 ± 0.321
2.171AsnAsn: 2.171 ± 0.496
2.265AsnPro: 2.265 ± 0.388
1.793AsnGln: 1.793 ± 0.455
2.926AsnArg: 2.926 ± 0.491
2.643AsnSer: 2.643 ± 0.565
1.699AsnThr: 1.699 ± 0.448
2.549AsnVal: 2.549 ± 0.474
0.472AsnTrp: 0.472 ± 0.233
1.605AsnTyr: 1.605 ± 0.314
0.0AsnXaa: 0.0 ± 0.0
Pro
3.587ProAla: 3.587 ± 0.669
0.378ProCys: 0.378 ± 0.195
3.115ProAsp: 3.115 ± 0.555
3.304ProGlu: 3.304 ± 0.641
1.227ProPhe: 1.227 ± 0.405
3.965ProGly: 3.965 ± 0.696
0.566ProHis: 0.566 ± 0.249
2.36ProIle: 2.36 ± 0.641
1.322ProLys: 1.322 ± 0.343
3.398ProLeu: 3.398 ± 0.587
0.944ProMet: 0.944 ± 0.322
1.416ProAsn: 1.416 ± 0.402
1.51ProPro: 1.51 ± 0.423
1.322ProGln: 1.322 ± 0.35
1.699ProArg: 1.699 ± 0.431
1.982ProSer: 1.982 ± 0.399
2.643ProThr: 2.643 ± 0.616
3.587ProVal: 3.587 ± 0.705
0.472ProTrp: 0.472 ± 0.213
0.472ProTyr: 0.472 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
5.286GlnAla: 5.286 ± 1.438
0.85GlnCys: 0.85 ± 0.31
2.077GlnAsp: 2.077 ± 0.382
2.549GlnGlu: 2.549 ± 0.563
1.51GlnPhe: 1.51 ± 0.381
2.265GlnGly: 2.265 ± 0.635
0.944GlnHis: 0.944 ± 0.329
2.832GlnIle: 2.832 ± 0.461
3.021GlnLys: 3.021 ± 0.652
4.153GlnLeu: 4.153 ± 0.473
0.85GlnMet: 0.85 ± 0.27
1.699GlnAsn: 1.699 ± 0.428
1.605GlnPro: 1.605 ± 0.434
2.643GlnGln: 2.643 ± 0.598
2.926GlnArg: 2.926 ± 0.559
2.549GlnSer: 2.549 ± 0.488
2.832GlnThr: 2.832 ± 0.521
2.643GlnVal: 2.643 ± 0.451
1.51GlnTrp: 1.51 ± 0.397
1.133GlnTyr: 1.133 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
7.268ArgAla: 7.268 ± 1.064
0.755ArgCys: 0.755 ± 0.308
3.493ArgAsp: 3.493 ± 0.692
5.192ArgGlu: 5.192 ± 0.978
1.133ArgPhe: 1.133 ± 0.281
4.342ArgGly: 4.342 ± 0.451
1.322ArgHis: 1.322 ± 0.307
2.737ArgIle: 2.737 ± 0.583
4.153ArgLys: 4.153 ± 0.729
5.475ArgLeu: 5.475 ± 0.635
1.416ArgMet: 1.416 ± 0.373
2.454ArgAsn: 2.454 ± 0.478
2.549ArgPro: 2.549 ± 0.514
3.115ArgGln: 3.115 ± 0.551
4.436ArgArg: 4.436 ± 1.023
3.87ArgSer: 3.87 ± 0.616
2.36ArgThr: 2.36 ± 0.467
3.776ArgVal: 3.776 ± 0.566
1.038ArgTrp: 1.038 ± 0.383
2.36ArgTyr: 2.36 ± 0.591
0.0ArgXaa: 0.0 ± 0.0
Ser
7.646SerAla: 7.646 ± 1.195
0.566SerCys: 0.566 ± 0.241
3.209SerAsp: 3.209 ± 0.507
4.436SerGlu: 4.436 ± 0.759
1.982SerPhe: 1.982 ± 0.467
6.324SerGly: 6.324 ± 0.854
1.227SerHis: 1.227 ± 0.413
3.021SerIle: 3.021 ± 0.477
2.926SerLys: 2.926 ± 0.621
5.947SerLeu: 5.947 ± 1.119
2.171SerMet: 2.171 ± 0.508
2.643SerAsn: 2.643 ± 0.639
2.265SerPro: 2.265 ± 0.473
1.888SerGln: 1.888 ± 0.513
3.776SerArg: 3.776 ± 0.567
3.398SerSer: 3.398 ± 0.605
3.021SerThr: 3.021 ± 0.516
4.531SerVal: 4.531 ± 0.593
1.227SerTrp: 1.227 ± 0.331
0.85SerTyr: 0.85 ± 0.295
0.0SerXaa: 0.0 ± 0.0
Thr
7.363ThrAla: 7.363 ± 0.829
0.378ThrCys: 0.378 ± 0.201
3.398ThrAsp: 3.398 ± 0.668
3.209ThrGlu: 3.209 ± 0.525
2.265ThrPhe: 2.265 ± 0.496
4.814ThrGly: 4.814 ± 0.716
0.472ThrHis: 0.472 ± 0.185
3.398ThrIle: 3.398 ± 0.711
2.171ThrLys: 2.171 ± 0.43
4.153ThrLeu: 4.153 ± 0.783
0.566ThrMet: 0.566 ± 0.175
2.265ThrAsn: 2.265 ± 0.399
2.832ThrPro: 2.832 ± 0.38
2.077ThrGln: 2.077 ± 0.53
3.398ThrArg: 3.398 ± 0.551
3.87ThrSer: 3.87 ± 0.71
4.531ThrThr: 4.531 ± 0.726
3.398ThrVal: 3.398 ± 0.695
1.322ThrTrp: 1.322 ± 0.429
1.322ThrTyr: 1.322 ± 0.477
0.0ThrXaa: 0.0 ± 0.0
Val
6.985ValAla: 6.985 ± 1.035
0.85ValCys: 0.85 ± 0.35
3.776ValAsp: 3.776 ± 0.81
3.021ValGlu: 3.021 ± 0.552
2.643ValPhe: 2.643 ± 0.463
3.776ValGly: 3.776 ± 0.598
1.133ValHis: 1.133 ± 0.312
3.681ValIle: 3.681 ± 0.572
3.87ValLys: 3.87 ± 0.675
2.643ValLeu: 2.643 ± 0.491
1.416ValMet: 1.416 ± 0.351
4.153ValAsn: 4.153 ± 1.029
2.926ValPro: 2.926 ± 0.574
2.737ValGln: 2.737 ± 0.534
3.87ValArg: 3.87 ± 0.811
4.059ValSer: 4.059 ± 0.566
4.908ValThr: 4.908 ± 0.816
5.38ValVal: 5.38 ± 0.941
1.227ValTrp: 1.227 ± 0.365
1.322ValTyr: 1.322 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
0.944TrpAla: 0.944 ± 0.286
0.378TrpCys: 0.378 ± 0.169
0.85TrpAsp: 0.85 ± 0.248
1.51TrpGlu: 1.51 ± 0.459
0.755TrpPhe: 0.755 ± 0.244
0.661TrpGly: 0.661 ± 0.235
0.378TrpHis: 0.378 ± 0.169
0.85TrpIle: 0.85 ± 0.323
1.416TrpLys: 1.416 ± 0.467
2.077TrpLeu: 2.077 ± 0.537
0.283TrpMet: 0.283 ± 0.163
0.85TrpAsn: 0.85 ± 0.26
0.566TrpPro: 0.566 ± 0.222
0.661TrpGln: 0.661 ± 0.185
2.265TrpArg: 2.265 ± 0.546
0.85TrpSer: 0.85 ± 0.37
0.85TrpThr: 0.85 ± 0.33
1.322TrpVal: 1.322 ± 0.416
0.189TrpTrp: 0.189 ± 0.135
0.566TrpTyr: 0.566 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.398TyrAla: 3.398 ± 0.471
0.566TyrCys: 0.566 ± 0.198
1.982TyrAsp: 1.982 ± 0.528
1.227TyrGlu: 1.227 ± 0.408
1.605TyrPhe: 1.605 ± 0.611
1.699TyrGly: 1.699 ± 0.341
0.661TyrHis: 0.661 ± 0.296
1.322TyrIle: 1.322 ± 0.456
0.472TyrLys: 0.472 ± 0.227
2.077TyrLeu: 2.077 ± 0.442
0.755TyrMet: 0.755 ± 0.281
1.227TyrAsn: 1.227 ± 0.38
1.416TyrPro: 1.416 ± 0.367
1.699TyrGln: 1.699 ± 0.353
2.454TyrArg: 2.454 ± 0.466
1.888TyrSer: 1.888 ± 0.455
1.888TyrThr: 1.888 ± 0.485
1.793TyrVal: 1.793 ± 0.443
0.566TyrTrp: 0.566 ± 0.202
1.038TyrTyr: 1.038 ± 0.367
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10595 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski