Amino acid dipepetide frequency for Klebsiella phage vB_KpnP_KpV763

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.439AlaAla: 8.439 ± 1.112
0.723AlaCys: 0.723 ± 0.222
6.189AlaAsp: 6.189 ± 0.761
5.305AlaGlu: 5.305 ± 0.688
3.295AlaPhe: 3.295 ± 0.409
7.073AlaGly: 7.073 ± 1.096
1.286AlaHis: 1.286 ± 0.27
4.421AlaIle: 4.421 ± 0.547
6.43AlaLys: 6.43 ± 0.71
7.877AlaLeu: 7.877 ± 0.92
2.733AlaMet: 2.733 ± 0.63
4.34AlaAsn: 4.34 ± 0.408
2.893AlaPro: 2.893 ± 0.488
3.697AlaGln: 3.697 ± 0.546
5.144AlaArg: 5.144 ± 0.642
5.305AlaSer: 5.305 ± 0.641
4.099AlaThr: 4.099 ± 0.54
5.626AlaVal: 5.626 ± 0.567
1.206AlaTrp: 1.206 ± 0.365
3.215AlaTyr: 3.215 ± 0.488
0.0AlaXaa: 0.0 ± 0.0
Cys
0.723CysAla: 0.723 ± 0.248
0.08CysCys: 0.08 ± 0.068
0.723CysAsp: 0.723 ± 0.276
0.804CysGlu: 0.804 ± 0.275
0.482CysPhe: 0.482 ± 0.217
0.723CysGly: 0.723 ± 0.207
0.161CysHis: 0.161 ± 0.11
0.723CysIle: 0.723 ± 0.237
0.321CysLys: 0.321 ± 0.159
0.884CysLeu: 0.884 ± 0.28
0.08CysMet: 0.08 ± 0.074
0.321CysAsn: 0.321 ± 0.146
0.482CysPro: 0.482 ± 0.197
0.643CysGln: 0.643 ± 0.298
0.723CysArg: 0.723 ± 0.277
0.804CysSer: 0.804 ± 0.269
0.563CysThr: 0.563 ± 0.225
0.643CysVal: 0.643 ± 0.207
0.161CysTrp: 0.161 ± 0.104
0.402CysTyr: 0.402 ± 0.211
0.0CysXaa: 0.0 ± 0.0
Asp
5.305AspAla: 5.305 ± 0.556
0.402AspCys: 0.402 ± 0.187
4.421AspAsp: 4.421 ± 0.484
4.019AspGlu: 4.019 ± 0.607
2.652AspPhe: 2.652 ± 0.481
6.43AspGly: 6.43 ± 0.664
0.884AspHis: 0.884 ± 0.275
2.492AspIle: 2.492 ± 0.344
4.019AspLys: 4.019 ± 0.593
4.019AspLeu: 4.019 ± 0.612
2.25AspMet: 2.25 ± 0.381
2.813AspAsn: 2.813 ± 0.548
2.733AspPro: 2.733 ± 0.441
2.17AspGln: 2.17 ± 0.439
3.215AspArg: 3.215 ± 0.5
3.778AspSer: 3.778 ± 0.421
4.019AspThr: 4.019 ± 0.48
4.179AspVal: 4.179 ± 0.589
0.804AspTrp: 0.804 ± 0.312
2.25AspTyr: 2.25 ± 0.448
0.0AspXaa: 0.0 ± 0.0
Glu
7.877GluAla: 7.877 ± 0.984
0.723GluCys: 0.723 ± 0.289
4.501GluAsp: 4.501 ± 0.717
5.465GluGlu: 5.465 ± 0.95
2.652GluPhe: 2.652 ± 0.434
5.546GluGly: 5.546 ± 0.722
1.527GluHis: 1.527 ± 0.465
2.492GluIle: 2.492 ± 0.366
3.536GluLys: 3.536 ± 0.772
5.706GluLeu: 5.706 ± 0.779
1.607GluMet: 1.607 ± 0.454
2.17GluAsn: 2.17 ± 0.383
2.572GluPro: 2.572 ± 0.672
3.215GluGln: 3.215 ± 0.712
4.019GluArg: 4.019 ± 0.69
4.099GluSer: 4.099 ± 0.563
3.215GluThr: 3.215 ± 0.562
4.581GluVal: 4.581 ± 0.568
0.884GluTrp: 0.884 ± 0.251
3.054GluTyr: 3.054 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
2.492PheAla: 2.492 ± 0.395
0.563PheCys: 0.563 ± 0.214
2.652PheAsp: 2.652 ± 0.492
1.849PheGlu: 1.849 ± 0.311
1.045PhePhe: 1.045 ± 0.307
2.974PheGly: 2.974 ± 0.649
0.723PheHis: 0.723 ± 0.298
1.607PheIle: 1.607 ± 0.492
2.331PheLys: 2.331 ± 0.339
3.215PheLeu: 3.215 ± 0.536
1.045PheMet: 1.045 ± 0.269
2.17PheAsn: 2.17 ± 0.374
1.607PhePro: 1.607 ± 0.388
1.206PheGln: 1.206 ± 0.261
1.929PheArg: 1.929 ± 0.511
2.572PheSer: 2.572 ± 0.434
2.411PheThr: 2.411 ± 0.427
2.652PheVal: 2.652 ± 0.517
0.402PheTrp: 0.402 ± 0.156
1.045PheTyr: 1.045 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
7.314GlyAla: 7.314 ± 1.18
0.804GlyCys: 0.804 ± 0.275
5.305GlyAsp: 5.305 ± 0.584
5.465GlyGlu: 5.465 ± 0.77
2.813GlyPhe: 2.813 ± 0.335
6.189GlyGly: 6.189 ± 0.89
1.366GlyHis: 1.366 ± 0.34
4.581GlyIle: 4.581 ± 0.836
5.706GlyLys: 5.706 ± 0.798
6.832GlyLeu: 6.832 ± 0.869
2.009GlyMet: 2.009 ± 0.466
3.456GlyAsn: 3.456 ± 0.521
1.447GlyPro: 1.447 ± 0.429
2.652GlyGln: 2.652 ± 0.441
4.34GlyArg: 4.34 ± 0.341
5.706GlySer: 5.706 ± 0.731
4.903GlyThr: 4.903 ± 0.659
5.305GlyVal: 5.305 ± 0.82
1.768GlyTrp: 1.768 ± 0.389
2.813GlyTyr: 2.813 ± 0.484
0.0GlyXaa: 0.0 ± 0.0
His
1.366HisAla: 1.366 ± 0.291
0.402HisCys: 0.402 ± 0.161
1.206HisAsp: 1.206 ± 0.307
1.527HisGlu: 1.527 ± 0.414
0.643HisPhe: 0.643 ± 0.218
1.366HisGly: 1.366 ± 0.317
0.482HisHis: 0.482 ± 0.217
0.884HisIle: 0.884 ± 0.227
1.045HisLys: 1.045 ± 0.251
1.366HisLeu: 1.366 ± 0.359
0.643HisMet: 0.643 ± 0.216
0.321HisAsn: 0.321 ± 0.141
0.723HisPro: 0.723 ± 0.245
0.482HisGln: 0.482 ± 0.198
0.643HisArg: 0.643 ± 0.2
0.723HisSer: 0.723 ± 0.211
0.723HisThr: 0.723 ± 0.197
1.849HisVal: 1.849 ± 0.38
0.161HisTrp: 0.161 ± 0.101
0.964HisTyr: 0.964 ± 0.161
0.0HisXaa: 0.0 ± 0.0
Ile
4.421IleAla: 4.421 ± 0.488
0.643IleCys: 0.643 ± 0.219
3.536IleAsp: 3.536 ± 0.561
2.974IleGlu: 2.974 ± 0.497
0.964IlePhe: 0.964 ± 0.283
3.858IleGly: 3.858 ± 0.478
1.045IleHis: 1.045 ± 0.29
2.733IleIle: 2.733 ± 0.491
3.215IleLys: 3.215 ± 0.498
3.456IleLeu: 3.456 ± 0.346
1.125IleMet: 1.125 ± 0.308
2.411IleAsn: 2.411 ± 0.762
2.411IlePro: 2.411 ± 0.409
1.768IleGln: 1.768 ± 0.398
3.938IleArg: 3.938 ± 0.638
3.135IleSer: 3.135 ± 0.394
2.572IleThr: 2.572 ± 0.459
3.215IleVal: 3.215 ± 0.508
0.402IleTrp: 0.402 ± 0.182
1.688IleTyr: 1.688 ± 0.386
0.0IleXaa: 0.0 ± 0.0
Lys
7.635LysAla: 7.635 ± 0.933
0.643LysCys: 0.643 ± 0.233
3.295LysAsp: 3.295 ± 0.58
5.305LysGlu: 5.305 ± 0.572
2.492LysPhe: 2.492 ± 0.485
5.867LysGly: 5.867 ± 0.906
1.527LysHis: 1.527 ± 0.36
2.331LysIle: 2.331 ± 0.473
3.376LysLys: 3.376 ± 0.834
5.706LysLeu: 5.706 ± 0.725
1.849LysMet: 1.849 ± 0.397
2.572LysAsn: 2.572 ± 0.391
2.652LysPro: 2.652 ± 0.573
2.331LysGln: 2.331 ± 0.46
3.135LysArg: 3.135 ± 0.523
3.215LysSer: 3.215 ± 0.474
2.974LysThr: 2.974 ± 0.382
6.028LysVal: 6.028 ± 0.802
0.723LysTrp: 0.723 ± 0.27
1.607LysTyr: 1.607 ± 0.408
0.0LysXaa: 0.0 ± 0.0
Leu
7.635LeuAla: 7.635 ± 0.997
0.402LeuCys: 0.402 ± 0.183
4.742LeuAsp: 4.742 ± 0.561
6.751LeuGlu: 6.751 ± 0.987
2.652LeuPhe: 2.652 ± 0.407
5.144LeuGly: 5.144 ± 0.647
1.125LeuHis: 1.125 ± 0.282
4.179LeuIle: 4.179 ± 0.618
6.832LeuLys: 6.832 ± 0.804
6.108LeuLeu: 6.108 ± 0.867
2.411LeuMet: 2.411 ± 0.359
3.697LeuAsn: 3.697 ± 0.463
3.295LeuPro: 3.295 ± 0.558
3.376LeuGln: 3.376 ± 0.442
5.063LeuArg: 5.063 ± 0.596
4.581LeuSer: 4.581 ± 0.525
5.144LeuThr: 5.144 ± 0.786
5.224LeuVal: 5.224 ± 0.646
1.527LeuTrp: 1.527 ± 0.36
2.572LeuTyr: 2.572 ± 0.491
0.0LeuXaa: 0.0 ± 0.0
Met
3.215MetAla: 3.215 ± 0.44
0.161MetCys: 0.161 ± 0.104
1.929MetAsp: 1.929 ± 0.354
1.125MetGlu: 1.125 ± 0.295
1.286MetPhe: 1.286 ± 0.333
1.768MetGly: 1.768 ± 0.374
0.402MetHis: 0.402 ± 0.189
1.206MetIle: 1.206 ± 0.294
1.125MetLys: 1.125 ± 0.256
2.893MetLeu: 2.893 ± 0.534
0.643MetMet: 0.643 ± 0.242
0.884MetAsn: 0.884 ± 0.215
0.884MetPro: 0.884 ± 0.251
2.009MetGln: 2.009 ± 0.461
1.045MetArg: 1.045 ± 0.255
1.607MetSer: 1.607 ± 0.364
1.929MetThr: 1.929 ± 0.456
1.527MetVal: 1.527 ± 0.381
0.08MetTrp: 0.08 ± 0.084
0.723MetTyr: 0.723 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
3.376AsnAla: 3.376 ± 0.631
0.643AsnCys: 0.643 ± 0.209
1.929AsnAsp: 1.929 ± 0.364
3.054AsnGlu: 3.054 ± 0.538
1.527AsnPhe: 1.527 ± 0.41
4.581AsnGly: 4.581 ± 0.69
0.402AsnHis: 0.402 ± 0.151
2.733AsnIle: 2.733 ± 0.533
2.25AsnLys: 2.25 ± 0.333
3.215AsnLeu: 3.215 ± 0.372
0.884AsnMet: 0.884 ± 0.285
1.768AsnAsn: 1.768 ± 0.33
2.25AsnPro: 2.25 ± 0.371
1.447AsnGln: 1.447 ± 0.279
1.929AsnArg: 1.929 ± 0.491
2.893AsnSer: 2.893 ± 0.516
2.572AsnThr: 2.572 ± 0.396
2.974AsnVal: 2.974 ± 0.448
0.884AsnTrp: 0.884 ± 0.294
1.849AsnTyr: 1.849 ± 0.46
0.0AsnXaa: 0.0 ± 0.0
Pro
3.215ProAla: 3.215 ± 0.45
0.482ProCys: 0.482 ± 0.207
2.09ProAsp: 2.09 ± 0.338
4.179ProGlu: 4.179 ± 0.639
1.366ProPhe: 1.366 ± 0.3
2.652ProGly: 2.652 ± 0.375
0.402ProHis: 0.402 ± 0.165
1.125ProIle: 1.125 ± 0.316
2.572ProLys: 2.572 ± 0.434
2.733ProLeu: 2.733 ± 0.373
0.884ProMet: 0.884 ± 0.236
1.929ProAsn: 1.929 ± 0.446
0.884ProPro: 0.884 ± 0.321
1.286ProGln: 1.286 ± 0.24
1.607ProArg: 1.607 ± 0.384
2.25ProSer: 2.25 ± 0.33
1.849ProThr: 1.849 ± 0.409
2.733ProVal: 2.733 ± 0.316
0.723ProTrp: 0.723 ± 0.195
1.768ProTyr: 1.768 ± 0.451
0.0ProXaa: 0.0 ± 0.0
Gln
3.617GlnAla: 3.617 ± 0.586
0.08GlnCys: 0.08 ± 0.074
2.411GlnAsp: 2.411 ± 0.249
3.054GlnGlu: 3.054 ± 0.403
1.607GlnPhe: 1.607 ± 0.289
2.893GlnGly: 2.893 ± 0.431
0.563GlnHis: 0.563 ± 0.221
1.849GlnIle: 1.849 ± 0.37
2.974GlnLys: 2.974 ± 0.473
4.019GlnLeu: 4.019 ± 0.454
1.286GlnMet: 1.286 ± 0.369
1.527GlnAsn: 1.527 ± 0.3
1.447GlnPro: 1.447 ± 0.224
3.456GlnGln: 3.456 ± 0.605
1.929GlnArg: 1.929 ± 0.494
2.492GlnSer: 2.492 ± 0.468
1.849GlnThr: 1.849 ± 0.434
2.652GlnVal: 2.652 ± 0.418
0.723GlnTrp: 0.723 ± 0.209
1.607GlnTyr: 1.607 ± 0.515
0.0GlnXaa: 0.0 ± 0.0
Arg
5.144ArgAla: 5.144 ± 0.752
0.723ArgCys: 0.723 ± 0.285
3.295ArgAsp: 3.295 ± 0.438
3.778ArgGlu: 3.778 ± 0.447
2.009ArgPhe: 2.009 ± 0.421
3.938ArgGly: 3.938 ± 0.587
0.964ArgHis: 0.964 ± 0.31
3.054ArgIle: 3.054 ± 0.474
3.617ArgLys: 3.617 ± 0.534
4.662ArgLeu: 4.662 ± 0.635
1.286ArgMet: 1.286 ± 0.238
2.572ArgAsn: 2.572 ± 0.374
2.17ArgPro: 2.17 ± 0.356
2.572ArgGln: 2.572 ± 0.458
2.813ArgArg: 2.813 ± 0.485
3.536ArgSer: 3.536 ± 0.45
2.974ArgThr: 2.974 ± 0.504
3.054ArgVal: 3.054 ± 0.471
1.045ArgTrp: 1.045 ± 0.357
1.206ArgTyr: 1.206 ± 0.253
0.0ArgXaa: 0.0 ± 0.0
Ser
4.501SerAla: 4.501 ± 0.615
0.723SerCys: 0.723 ± 0.201
4.581SerAsp: 4.581 ± 0.545
4.019SerGlu: 4.019 ± 0.537
3.054SerPhe: 3.054 ± 0.46
4.983SerGly: 4.983 ± 0.708
1.366SerHis: 1.366 ± 0.328
2.813SerIle: 2.813 ± 0.424
3.778SerLys: 3.778 ± 0.493
4.662SerLeu: 4.662 ± 0.814
1.206SerMet: 1.206 ± 0.325
2.25SerAsn: 2.25 ± 0.485
1.768SerPro: 1.768 ± 0.408
3.054SerGln: 3.054 ± 0.418
3.135SerArg: 3.135 ± 0.521
2.572SerSer: 2.572 ± 0.438
3.858SerThr: 3.858 ± 0.639
4.742SerVal: 4.742 ± 0.56
0.723SerTrp: 0.723 ± 0.199
2.25SerTyr: 2.25 ± 0.46
0.0SerXaa: 0.0 ± 0.0
Thr
4.822ThrAla: 4.822 ± 0.754
0.723ThrCys: 0.723 ± 0.295
3.295ThrAsp: 3.295 ± 0.474
2.974ThrGlu: 2.974 ± 0.418
2.331ThrPhe: 2.331 ± 0.452
5.546ThrGly: 5.546 ± 0.707
1.125ThrHis: 1.125 ± 0.241
4.179ThrIle: 4.179 ± 0.629
4.26ThrLys: 4.26 ± 0.522
4.983ThrLeu: 4.983 ± 0.671
1.366ThrMet: 1.366 ± 0.329
1.607ThrAsn: 1.607 ± 0.39
2.733ThrPro: 2.733 ± 0.473
2.331ThrGln: 2.331 ± 0.425
2.17ThrArg: 2.17 ± 0.331
3.697ThrSer: 3.697 ± 0.573
3.054ThrThr: 3.054 ± 0.715
3.617ThrVal: 3.617 ± 0.606
0.563ThrTrp: 0.563 ± 0.194
1.447ThrTyr: 1.447 ± 0.3
0.0ThrXaa: 0.0 ± 0.0
Val
5.546ValAla: 5.546 ± 0.674
0.804ValCys: 0.804 ± 0.276
3.456ValAsp: 3.456 ± 0.437
4.26ValGlu: 4.26 ± 0.621
2.09ValPhe: 2.09 ± 0.534
5.626ValGly: 5.626 ± 0.641
1.125ValHis: 1.125 ± 0.332
3.938ValIle: 3.938 ± 0.607
4.421ValLys: 4.421 ± 0.496
6.349ValLeu: 6.349 ± 0.782
1.286ValMet: 1.286 ± 0.28
3.536ValAsn: 3.536 ± 0.697
2.331ValPro: 2.331 ± 0.48
2.25ValGln: 2.25 ± 0.374
4.179ValArg: 4.179 ± 0.514
4.501ValSer: 4.501 ± 0.775
5.385ValThr: 5.385 ± 0.652
4.099ValVal: 4.099 ± 0.655
0.643ValTrp: 0.643 ± 0.302
2.492ValTyr: 2.492 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
0.563TrpAla: 0.563 ± 0.168
0.402TrpCys: 0.402 ± 0.19
0.563TrpAsp: 0.563 ± 0.235
1.286TrpGlu: 1.286 ± 0.254
0.402TrpPhe: 0.402 ± 0.158
0.723TrpGly: 0.723 ± 0.234
0.402TrpHis: 0.402 ± 0.206
0.563TrpIle: 0.563 ± 0.268
1.366TrpLys: 1.366 ± 0.37
1.607TrpLeu: 1.607 ± 0.479
0.482TrpMet: 0.482 ± 0.162
0.723TrpAsn: 0.723 ± 0.267
0.241TrpPro: 0.241 ± 0.125
0.643TrpGln: 0.643 ± 0.203
0.884TrpArg: 0.884 ± 0.227
1.045TrpSer: 1.045 ± 0.352
0.643TrpThr: 0.643 ± 0.205
1.206TrpVal: 1.206 ± 0.379
0.241TrpTrp: 0.241 ± 0.134
0.161TrpTyr: 0.161 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.09TyrAla: 2.09 ± 0.48
0.321TyrCys: 0.321 ± 0.167
2.733TyrAsp: 2.733 ± 0.489
2.25TyrGlu: 2.25 ± 0.48
1.125TyrPhe: 1.125 ± 0.251
2.893TyrGly: 2.893 ± 0.428
0.643TyrHis: 0.643 ± 0.328
1.607TyrIle: 1.607 ± 0.406
2.09TyrLys: 2.09 ± 0.296
2.17TyrLeu: 2.17 ± 0.34
1.286TyrMet: 1.286 ± 0.343
2.09TyrAsn: 2.09 ± 0.453
1.206TyrPro: 1.206 ± 0.311
1.527TyrGln: 1.527 ± 0.45
2.572TyrArg: 2.572 ± 0.426
1.527TyrSer: 1.527 ± 0.395
2.09TyrThr: 2.09 ± 0.442
2.411TyrVal: 2.411 ± 0.482
0.482TyrTrp: 0.482 ± 0.188
0.723TyrTyr: 0.723 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (12443 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski