Amino acid dipepetide frequency for Klebsiella phage 4LV2017

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.783AlaAla: 11.783 ± 2.605
0.938AlaCys: 0.938 ± 0.376
6.361AlaAsp: 6.361 ± 0.867
6.361AlaGlu: 6.361 ± 1.113
3.441AlaPhe: 3.441 ± 0.4
6.048AlaGly: 6.048 ± 1.093
1.564AlaHis: 1.564 ± 0.471
3.858AlaIle: 3.858 ± 0.569
4.588AlaLys: 4.588 ± 0.621
10.845AlaLeu: 10.845 ± 1.805
3.128AlaMet: 3.128 ± 0.923
3.024AlaAsn: 3.024 ± 0.609
3.441AlaPro: 3.441 ± 0.781
3.545AlaGln: 3.545 ± 0.582
5.527AlaArg: 5.527 ± 0.813
6.152AlaSer: 6.152 ± 0.737
5.318AlaThr: 5.318 ± 0.797
8.238AlaVal: 8.238 ± 0.987
1.251AlaTrp: 1.251 ± 0.332
2.607AlaTyr: 2.607 ± 0.547
0.0AlaXaa: 0.0 ± 0.0
Cys
0.834CysAla: 0.834 ± 0.355
0.209CysCys: 0.209 ± 0.124
0.313CysAsp: 0.313 ± 0.192
0.417CysGlu: 0.417 ± 0.188
0.417CysPhe: 0.417 ± 0.226
0.73CysGly: 0.73 ± 0.254
0.104CysHis: 0.104 ± 0.098
0.209CysIle: 0.209 ± 0.138
0.417CysLys: 0.417 ± 0.187
1.251CysLeu: 1.251 ± 0.364
0.313CysMet: 0.313 ± 0.174
0.417CysAsn: 0.417 ± 0.241
0.521CysPro: 0.521 ± 0.214
0.209CysGln: 0.209 ± 0.142
0.73CysArg: 0.73 ± 0.366
0.834CysSer: 0.834 ± 0.375
0.417CysThr: 0.417 ± 0.219
0.834CysVal: 0.834 ± 0.305
0.417CysTrp: 0.417 ± 0.234
0.417CysTyr: 0.417 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
5.839AspAla: 5.839 ± 0.786
0.834AspCys: 0.834 ± 0.287
3.441AspAsp: 3.441 ± 0.564
3.024AspGlu: 3.024 ± 0.549
1.773AspPhe: 1.773 ± 0.469
4.38AspGly: 4.38 ± 0.712
0.938AspHis: 0.938 ± 0.29
3.024AspIle: 3.024 ± 0.448
3.233AspLys: 3.233 ± 0.557
4.588AspLeu: 4.588 ± 0.703
1.564AspMet: 1.564 ± 0.509
2.711AspAsn: 2.711 ± 0.531
1.877AspPro: 1.877 ± 0.533
2.19AspGln: 2.19 ± 0.328
2.086AspArg: 2.086 ± 0.399
2.398AspSer: 2.398 ± 0.444
3.441AspThr: 3.441 ± 0.679
3.65AspVal: 3.65 ± 0.664
0.521AspTrp: 0.521 ± 0.217
2.607AspTyr: 2.607 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
5.109GluAla: 5.109 ± 0.579
0.209GluCys: 0.209 ± 0.151
2.607GluAsp: 2.607 ± 0.479
2.92GluGlu: 2.92 ± 0.525
2.607GluPhe: 2.607 ± 0.618
3.65GluGly: 3.65 ± 0.637
1.043GluHis: 1.043 ± 0.319
3.65GluIle: 3.65 ± 0.61
3.441GluLys: 3.441 ± 0.459
8.655GluLeu: 8.655 ± 0.814
1.877GluMet: 1.877 ± 0.497
2.815GluAsn: 2.815 ± 0.464
2.503GluPro: 2.503 ± 0.527
3.962GluGln: 3.962 ± 0.765
3.858GluArg: 3.858 ± 0.591
3.441GluSer: 3.441 ± 0.58
2.92GluThr: 2.92 ± 0.454
3.65GluVal: 3.65 ± 0.579
1.043GluTrp: 1.043 ± 0.306
1.46GluTyr: 1.46 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
3.024PheAla: 3.024 ± 0.554
0.417PheCys: 0.417 ± 0.205
1.668PheAsp: 1.668 ± 0.388
1.981PheGlu: 1.981 ± 0.321
1.668PhePhe: 1.668 ± 0.53
1.877PheGly: 1.877 ± 0.513
0.626PheHis: 0.626 ± 0.232
1.564PheIle: 1.564 ± 0.691
1.773PheLys: 1.773 ± 0.503
2.086PheLeu: 2.086 ± 0.425
1.356PheMet: 1.356 ± 0.356
2.711PheAsn: 2.711 ± 0.606
1.564PhePro: 1.564 ± 0.369
1.043PheGln: 1.043 ± 0.365
2.503PheArg: 2.503 ± 0.517
3.337PheSer: 3.337 ± 0.536
3.128PheThr: 3.128 ± 0.612
2.19PheVal: 2.19 ± 0.528
0.73PheTrp: 0.73 ± 0.323
0.834PheTyr: 0.834 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
5.839GlyAla: 5.839 ± 1.088
0.521GlyCys: 0.521 ± 0.241
3.545GlyAsp: 3.545 ± 0.466
3.233GlyGlu: 3.233 ± 0.573
2.815GlyPhe: 2.815 ± 0.454
5.318GlyGly: 5.318 ± 1.029
0.626GlyHis: 0.626 ± 0.326
2.607GlyIle: 2.607 ± 0.581
4.38GlyLys: 4.38 ± 0.644
5.631GlyLeu: 5.631 ± 0.8
1.981GlyMet: 1.981 ± 0.41
3.337GlyAsn: 3.337 ± 0.609
2.398GlyPro: 2.398 ± 0.632
1.564GlyGln: 1.564 ± 0.307
2.92GlyArg: 2.92 ± 0.447
3.65GlySer: 3.65 ± 0.764
3.754GlyThr: 3.754 ± 0.538
5.735GlyVal: 5.735 ± 0.737
1.356GlyTrp: 1.356 ± 0.328
1.564GlyTyr: 1.564 ± 0.381
0.0GlyXaa: 0.0 ± 0.0
His
1.564HisAla: 1.564 ± 0.496
0.313HisCys: 0.313 ± 0.191
0.626HisAsp: 0.626 ± 0.332
0.834HisGlu: 0.834 ± 0.306
0.521HisPhe: 0.521 ± 0.23
0.938HisGly: 0.938 ± 0.323
0.73HisHis: 0.73 ± 0.267
0.73HisIle: 0.73 ± 0.242
1.668HisLys: 1.668 ± 0.42
1.773HisLeu: 1.773 ± 0.445
0.73HisMet: 0.73 ± 0.226
0.313HisAsn: 0.313 ± 0.201
1.251HisPro: 1.251 ± 0.301
0.521HisGln: 0.521 ± 0.195
1.251HisArg: 1.251 ± 0.288
1.147HisSer: 1.147 ± 0.398
1.251HisThr: 1.251 ± 0.442
0.73HisVal: 0.73 ± 0.257
0.417HisTrp: 0.417 ± 0.182
0.834HisTyr: 0.834 ± 0.292
0.0HisXaa: 0.0 ± 0.0
Ile
4.067IleAla: 4.067 ± 0.533
0.73IleCys: 0.73 ± 0.208
2.607IleAsp: 2.607 ± 0.526
2.503IleGlu: 2.503 ± 0.533
1.877IlePhe: 1.877 ± 0.54
2.815IleGly: 2.815 ± 0.532
0.521IleHis: 0.521 ± 0.267
3.233IleIle: 3.233 ± 0.689
1.981IleLys: 1.981 ± 0.355
3.024IleLeu: 3.024 ± 0.634
1.251IleMet: 1.251 ± 0.35
4.275IleAsn: 4.275 ± 0.827
2.503IlePro: 2.503 ± 0.532
1.668IleGln: 1.668 ± 0.37
2.815IleArg: 2.815 ± 0.499
4.901IleSer: 4.901 ± 0.97
4.692IleThr: 4.692 ± 0.788
2.815IleVal: 2.815 ± 0.434
0.313IleTrp: 0.313 ± 0.153
0.938IleTyr: 0.938 ± 0.269
0.0IleXaa: 0.0 ± 0.0
Lys
5.527LysAla: 5.527 ± 0.73
0.209LysCys: 0.209 ± 0.138
2.815LysAsp: 2.815 ± 0.431
4.171LysGlu: 4.171 ± 0.611
2.294LysPhe: 2.294 ± 0.564
3.233LysGly: 3.233 ± 0.584
0.938LysHis: 0.938 ± 0.385
2.086LysIle: 2.086 ± 0.507
4.275LysLys: 4.275 ± 0.922
4.484LysLeu: 4.484 ± 0.604
1.251LysMet: 1.251 ± 0.425
1.564LysAsn: 1.564 ± 0.377
2.294LysPro: 2.294 ± 0.738
2.294LysGln: 2.294 ± 0.512
3.545LysArg: 3.545 ± 0.538
4.067LysSer: 4.067 ± 0.536
3.337LysThr: 3.337 ± 0.453
3.233LysVal: 3.233 ± 0.44
0.834LysTrp: 0.834 ± 0.308
2.503LysTyr: 2.503 ± 0.433
0.0LysXaa: 0.0 ± 0.0
Leu
10.845LeuAla: 10.845 ± 1.41
1.043LeuCys: 1.043 ± 0.297
5.318LeuAsp: 5.318 ± 0.623
5.735LeuGlu: 5.735 ± 0.644
2.92LeuPhe: 2.92 ± 0.687
6.361LeuGly: 6.361 ± 1.252
1.46LeuHis: 1.46 ± 0.381
4.901LeuIle: 4.901 ± 0.861
4.692LeuLys: 4.692 ± 0.737
9.385LeuLeu: 9.385 ± 0.887
3.024LeuMet: 3.024 ± 0.594
4.588LeuAsn: 4.588 ± 0.703
4.067LeuPro: 4.067 ± 0.695
4.38LeuGln: 4.38 ± 0.696
6.986LeuArg: 6.986 ± 0.869
8.238LeuSer: 8.238 ± 0.819
6.986LeuThr: 6.986 ± 0.865
5.109LeuVal: 5.109 ± 0.7
1.356LeuTrp: 1.356 ± 0.362
2.607LeuTyr: 2.607 ± 0.409
0.0LeuXaa: 0.0 ± 0.0
Met
3.65MetAla: 3.65 ± 0.546
0.209MetCys: 0.209 ± 0.12
0.938MetAsp: 0.938 ± 0.353
1.668MetGlu: 1.668 ± 0.333
0.834MetPhe: 0.834 ± 0.221
1.251MetGly: 1.251 ± 0.379
0.417MetHis: 0.417 ± 0.199
0.73MetIle: 0.73 ± 0.295
1.564MetLys: 1.564 ± 0.337
3.024MetLeu: 3.024 ± 0.531
1.564MetMet: 1.564 ± 0.348
1.46MetAsn: 1.46 ± 0.396
1.251MetPro: 1.251 ± 0.431
0.938MetGln: 0.938 ± 0.364
1.773MetArg: 1.773 ± 0.495
2.19MetSer: 2.19 ± 0.414
1.251MetThr: 1.251 ± 0.397
1.877MetVal: 1.877 ± 0.281
0.313MetTrp: 0.313 ± 0.181
0.626MetTyr: 0.626 ± 0.274
0.0MetXaa: 0.0 ± 0.0
Asn
4.38AsnAla: 4.38 ± 0.557
0.521AsnCys: 0.521 ± 0.227
1.981AsnAsp: 1.981 ± 0.497
3.233AsnGlu: 3.233 ± 0.557
1.564AsnPhe: 1.564 ± 0.488
2.92AsnGly: 2.92 ± 0.485
0.626AsnHis: 0.626 ± 0.322
2.398AsnIle: 2.398 ± 0.506
2.398AsnLys: 2.398 ± 0.546
3.962AsnLeu: 3.962 ± 0.512
0.626AsnMet: 0.626 ± 0.215
2.503AsnAsn: 2.503 ± 0.557
3.545AsnPro: 3.545 ± 0.645
1.877AsnGln: 1.877 ± 0.473
3.233AsnArg: 3.233 ± 0.681
2.711AsnSer: 2.711 ± 0.677
1.877AsnThr: 1.877 ± 0.391
2.294AsnVal: 2.294 ± 0.488
0.73AsnTrp: 0.73 ± 0.378
1.251AsnTyr: 1.251 ± 0.442
0.0AsnXaa: 0.0 ± 0.0
Pro
4.901ProAla: 4.901 ± 0.867
0.313ProCys: 0.313 ± 0.161
2.607ProAsp: 2.607 ± 0.536
2.503ProGlu: 2.503 ± 0.475
0.938ProPhe: 0.938 ± 0.282
1.668ProGly: 1.668 ± 0.415
1.043ProHis: 1.043 ± 0.425
1.564ProIle: 1.564 ± 0.413
2.086ProLys: 2.086 ± 0.461
4.901ProLeu: 4.901 ± 0.725
0.626ProMet: 0.626 ± 0.201
1.46ProAsn: 1.46 ± 0.412
1.773ProPro: 1.773 ± 0.307
1.356ProGln: 1.356 ± 0.404
2.398ProArg: 2.398 ± 0.467
2.607ProSer: 2.607 ± 0.423
2.19ProThr: 2.19 ± 0.525
3.962ProVal: 3.962 ± 0.562
0.313ProTrp: 0.313 ± 0.176
1.46ProTyr: 1.46 ± 0.477
0.0ProXaa: 0.0 ± 0.0
Gln
3.441GlnAla: 3.441 ± 0.585
0.521GlnCys: 0.521 ± 0.218
1.773GlnAsp: 1.773 ± 0.428
2.503GlnGlu: 2.503 ± 0.442
0.834GlnPhe: 0.834 ± 0.31
2.607GlnGly: 2.607 ± 0.643
0.73GlnHis: 0.73 ± 0.274
1.773GlnIle: 1.773 ± 0.443
1.877GlnLys: 1.877 ± 0.461
4.38GlnLeu: 4.38 ± 0.581
1.043GlnMet: 1.043 ± 0.251
1.564GlnAsn: 1.564 ± 0.416
2.19GlnPro: 2.19 ± 0.512
2.398GlnGln: 2.398 ± 0.612
3.65GlnArg: 3.65 ± 0.63
2.503GlnSer: 2.503 ± 0.471
2.711GlnThr: 2.711 ± 0.643
2.398GlnVal: 2.398 ± 0.492
0.834GlnTrp: 0.834 ± 0.264
1.147GlnTyr: 1.147 ± 0.361
0.0GlnXaa: 0.0 ± 0.0
Arg
3.962ArgAla: 3.962 ± 0.481
0.626ArgCys: 0.626 ± 0.246
3.337ArgAsp: 3.337 ± 0.46
4.692ArgGlu: 4.692 ± 0.655
2.19ArgPhe: 2.19 ± 0.514
3.024ArgGly: 3.024 ± 0.658
2.294ArgHis: 2.294 ± 0.486
3.545ArgIle: 3.545 ± 0.522
4.171ArgLys: 4.171 ± 1.04
7.091ArgLeu: 7.091 ± 1.139
1.356ArgMet: 1.356 ± 0.316
2.607ArgAsn: 2.607 ± 0.532
0.938ArgPro: 0.938 ± 0.258
3.754ArgGln: 3.754 ± 0.631
5.109ArgArg: 5.109 ± 0.798
3.337ArgSer: 3.337 ± 0.631
2.92ArgThr: 2.92 ± 0.535
4.067ArgVal: 4.067 ± 0.632
1.356ArgTrp: 1.356 ± 0.337
1.981ArgTyr: 1.981 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
6.986SerAla: 6.986 ± 0.936
0.313SerCys: 0.313 ± 0.21
3.337SerAsp: 3.337 ± 0.526
3.754SerGlu: 3.754 ± 0.495
3.024SerPhe: 3.024 ± 0.795
5.839SerGly: 5.839 ± 0.83
1.356SerHis: 1.356 ± 0.437
3.441SerIle: 3.441 ± 0.655
3.754SerLys: 3.754 ± 0.797
6.465SerLeu: 6.465 ± 0.753
1.251SerMet: 1.251 ± 0.248
2.92SerAsn: 2.92 ± 0.727
2.19SerPro: 2.19 ± 0.35
2.294SerGln: 2.294 ± 0.69
3.441SerArg: 3.441 ± 0.474
4.588SerSer: 4.588 ± 0.603
4.067SerThr: 4.067 ± 0.429
3.858SerVal: 3.858 ± 0.722
1.46SerTrp: 1.46 ± 0.455
2.19SerTyr: 2.19 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
5.631ThrAla: 5.631 ± 1.12
0.521ThrCys: 0.521 ± 0.35
4.484ThrAsp: 4.484 ± 0.678
3.65ThrGlu: 3.65 ± 0.857
2.398ThrPhe: 2.398 ± 0.523
5.109ThrGly: 5.109 ± 0.844
1.46ThrHis: 1.46 ± 0.472
3.441ThrIle: 3.441 ± 0.677
2.086ThrLys: 2.086 ± 0.467
6.674ThrLeu: 6.674 ± 0.767
1.251ThrMet: 1.251 ± 0.277
1.981ThrAsn: 1.981 ± 0.337
2.607ThrPro: 2.607 ± 0.552
1.981ThrGln: 1.981 ± 0.458
3.754ThrArg: 3.754 ± 0.548
3.545ThrSer: 3.545 ± 0.478
3.962ThrThr: 3.962 ± 0.862
4.38ThrVal: 4.38 ± 0.712
0.73ThrTrp: 0.73 ± 0.26
1.564ThrTyr: 1.564 ± 0.402
0.0ThrXaa: 0.0 ± 0.0
Val
6.465ValAla: 6.465 ± 1.252
0.521ValCys: 0.521 ± 0.212
4.484ValAsp: 4.484 ± 0.771
5.005ValGlu: 5.005 ± 0.806
1.877ValPhe: 1.877 ± 0.436
2.086ValGly: 2.086 ± 0.468
0.73ValHis: 0.73 ± 0.236
3.65ValIle: 3.65 ± 0.538
4.38ValLys: 4.38 ± 0.668
6.048ValLeu: 6.048 ± 0.729
2.711ValMet: 2.711 ± 0.599
3.024ValAsn: 3.024 ± 0.649
2.398ValPro: 2.398 ± 0.521
2.92ValGln: 2.92 ± 0.62
3.024ValArg: 3.024 ± 0.429
4.797ValSer: 4.797 ± 0.771
4.484ValThr: 4.484 ± 0.666
4.692ValVal: 4.692 ± 0.621
0.73ValTrp: 0.73 ± 0.279
2.086ValTyr: 2.086 ± 0.576
0.0ValXaa: 0.0 ± 0.0
Trp
1.668TrpAla: 1.668 ± 0.525
0.209TrpCys: 0.209 ± 0.166
0.521TrpAsp: 0.521 ± 0.297
1.043TrpGlu: 1.043 ± 0.299
0.521TrpPhe: 0.521 ± 0.189
0.521TrpGly: 0.521 ± 0.203
0.313TrpHis: 0.313 ± 0.179
0.938TrpIle: 0.938 ± 0.286
0.834TrpLys: 0.834 ± 0.268
2.398TrpLeu: 2.398 ± 0.491
0.104TrpMet: 0.104 ± 0.121
0.313TrpAsn: 0.313 ± 0.161
0.626TrpPro: 0.626 ± 0.257
0.626TrpGln: 0.626 ± 0.233
1.46TrpArg: 1.46 ± 0.419
0.626TrpSer: 0.626 ± 0.197
0.73TrpThr: 0.73 ± 0.249
1.043TrpVal: 1.043 ± 0.368
0.521TrpTrp: 0.521 ± 0.189
0.521TrpTyr: 0.521 ± 0.234
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.503TyrAla: 2.503 ± 0.409
0.834TyrCys: 0.834 ± 0.272
1.356TyrAsp: 1.356 ± 0.357
2.294TyrGlu: 2.294 ± 0.428
1.564TyrPhe: 1.564 ± 0.542
2.398TyrGly: 2.398 ± 0.413
0.626TyrHis: 0.626 ± 0.291
2.19TyrIle: 2.19 ± 0.517
1.356TyrLys: 1.356 ± 0.425
3.337TyrLeu: 3.337 ± 0.593
0.417TyrMet: 0.417 ± 0.181
1.043TyrAsn: 1.043 ± 0.354
0.73TyrPro: 0.73 ± 0.231
1.356TyrGln: 1.356 ± 0.457
2.294TyrArg: 2.294 ± 0.431
1.46TyrSer: 1.46 ± 0.42
1.773TyrThr: 1.773 ± 0.402
1.356TyrVal: 1.356 ± 0.417
0.313TyrTrp: 0.313 ± 0.147
0.313TyrTyr: 0.313 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 38 proteins (9591 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski