Amino acid dipepetide frequency for Klebsiella phage ST147-VIM1phi7.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.027AlaAla: 11.027 ± 1.467
0.677AlaCys: 0.677 ± 0.269
5.417AlaAsp: 5.417 ± 0.829
5.997AlaGlu: 5.997 ± 0.742
3.676AlaPhe: 3.676 ± 0.608
7.545AlaGly: 7.545 ± 0.681
1.257AlaHis: 1.257 ± 0.349
3.966AlaIle: 3.966 ± 0.698
5.417AlaLys: 5.417 ± 0.787
11.221AlaLeu: 11.221 ± 1.334
3.386AlaMet: 3.386 ± 0.642
3.966AlaAsn: 3.966 ± 0.801
3.192AlaPro: 3.192 ± 0.605
4.159AlaGln: 4.159 ± 0.749
5.804AlaArg: 5.804 ± 0.997
7.545AlaSer: 7.545 ± 1.111
5.707AlaThr: 5.707 ± 0.881
5.03AlaVal: 5.03 ± 0.56
1.064AlaTrp: 1.064 ± 0.274
2.128AlaTyr: 2.128 ± 0.347
0.0AlaXaa: 0.0 ± 0.0
Cys
0.58CysAla: 0.58 ± 0.213
0.097CysCys: 0.097 ± 0.099
0.871CysAsp: 0.871 ± 0.258
0.677CysGlu: 0.677 ± 0.252
0.58CysPhe: 0.58 ± 0.209
0.58CysGly: 0.58 ± 0.221
0.387CysHis: 0.387 ± 0.167
0.193CysIle: 0.193 ± 0.135
0.677CysLys: 0.677 ± 0.277
0.967CysLeu: 0.967 ± 0.353
0.0CysMet: 0.0 ± 0.0
0.58CysAsn: 0.58 ± 0.234
0.484CysPro: 0.484 ± 0.219
0.58CysGln: 0.58 ± 0.252
0.967CysArg: 0.967 ± 0.321
0.387CysSer: 0.387 ± 0.19
0.387CysThr: 0.387 ± 0.238
0.774CysVal: 0.774 ± 0.308
0.58CysTrp: 0.58 ± 0.242
0.484CysTyr: 0.484 ± 0.255
0.0CysXaa: 0.0 ± 0.0
Asp
4.74AspAla: 4.74 ± 0.663
0.29AspCys: 0.29 ± 0.185
4.643AspAsp: 4.643 ± 0.896
3.966AspGlu: 3.966 ± 0.469
2.515AspPhe: 2.515 ± 0.57
4.643AspGly: 4.643 ± 0.663
0.58AspHis: 0.58 ± 0.269
4.256AspIle: 4.256 ± 0.607
2.225AspLys: 2.225 ± 0.309
5.32AspLeu: 5.32 ± 0.591
0.58AspMet: 0.58 ± 0.192
2.225AspAsn: 2.225 ± 0.364
2.128AspPro: 2.128 ± 0.493
2.225AspGln: 2.225 ± 0.606
2.612AspArg: 2.612 ± 0.399
3.579AspSer: 3.579 ± 0.661
3.095AspThr: 3.095 ± 0.624
4.159AspVal: 4.159 ± 0.781
0.967AspTrp: 0.967 ± 0.301
2.031AspTyr: 2.031 ± 0.493
0.0AspXaa: 0.0 ± 0.0
Glu
6.094GluAla: 6.094 ± 0.809
0.58GluCys: 0.58 ± 0.236
2.225GluAsp: 2.225 ± 0.498
3.676GluGlu: 3.676 ± 0.672
1.548GluPhe: 1.548 ± 0.366
3.966GluGly: 3.966 ± 0.665
1.354GluHis: 1.354 ± 0.302
4.256GluIle: 4.256 ± 0.574
3.869GluLys: 3.869 ± 0.582
6.771GluLeu: 6.771 ± 0.896
1.644GluMet: 1.644 ± 0.301
3.289GluAsn: 3.289 ± 0.551
2.999GluPro: 2.999 ± 0.582
3.482GluGln: 3.482 ± 0.639
4.063GluArg: 4.063 ± 0.566
3.869GluSer: 3.869 ± 0.713
3.289GluThr: 3.289 ± 0.62
3.772GluVal: 3.772 ± 0.563
1.644GluTrp: 1.644 ± 0.37
1.644GluTyr: 1.644 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
2.612PheAla: 2.612 ± 0.399
0.677PheCys: 0.677 ± 0.217
1.935PheAsp: 1.935 ± 0.558
1.741PheGlu: 1.741 ± 0.412
1.354PhePhe: 1.354 ± 0.58
2.418PheGly: 2.418 ± 0.556
0.387PheHis: 0.387 ± 0.221
1.741PheIle: 1.741 ± 0.449
1.548PheLys: 1.548 ± 0.36
2.612PheLeu: 2.612 ± 0.42
0.967PheMet: 0.967 ± 0.27
1.935PheAsn: 1.935 ± 0.476
0.967PhePro: 0.967 ± 0.275
0.967PheGln: 0.967 ± 0.308
2.902PheArg: 2.902 ± 0.595
3.386PheSer: 3.386 ± 0.577
2.515PheThr: 2.515 ± 0.482
1.644PheVal: 1.644 ± 0.394
0.58PheTrp: 0.58 ± 0.194
1.064PheTyr: 1.064 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
5.997GlyAla: 5.997 ± 1.401
0.871GlyCys: 0.871 ± 0.31
3.869GlyAsp: 3.869 ± 0.925
3.869GlyGlu: 3.869 ± 0.599
2.515GlyPhe: 2.515 ± 0.466
6.578GlyGly: 6.578 ± 1.105
1.451GlyHis: 1.451 ± 0.359
3.579GlyIle: 3.579 ± 0.69
4.74GlyLys: 4.74 ± 0.866
6.868GlyLeu: 6.868 ± 0.645
2.128GlyMet: 2.128 ± 0.379
2.418GlyAsn: 2.418 ± 0.452
1.548GlyPro: 1.548 ± 0.347
2.612GlyGln: 2.612 ± 0.425
5.127GlyArg: 5.127 ± 0.572
4.74GlySer: 4.74 ± 0.513
3.772GlyThr: 3.772 ± 0.822
5.901GlyVal: 5.901 ± 0.749
1.257GlyTrp: 1.257 ± 0.313
2.418GlyTyr: 2.418 ± 0.422
0.0GlyXaa: 0.0 ± 0.0
His
0.774HisAla: 0.774 ± 0.255
0.29HisCys: 0.29 ± 0.15
1.064HisAsp: 1.064 ± 0.309
0.967HisGlu: 0.967 ± 0.372
0.484HisPhe: 0.484 ± 0.175
1.548HisGly: 1.548 ± 0.395
0.58HisHis: 0.58 ± 0.288
1.161HisIle: 1.161 ± 0.34
0.58HisLys: 0.58 ± 0.249
2.612HisLeu: 2.612 ± 0.48
0.097HisMet: 0.097 ± 0.087
0.774HisAsn: 0.774 ± 0.3
0.677HisPro: 0.677 ± 0.271
1.257HisGln: 1.257 ± 0.42
0.774HisArg: 0.774 ± 0.231
0.774HisSer: 0.774 ± 0.285
0.774HisThr: 0.774 ± 0.281
1.257HisVal: 1.257 ± 0.331
0.387HisTrp: 0.387 ± 0.201
0.387HisTyr: 0.387 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
5.03IleAla: 5.03 ± 0.648
0.774IleCys: 0.774 ± 0.291
3.676IleAsp: 3.676 ± 0.53
3.386IleGlu: 3.386 ± 0.505
1.741IlePhe: 1.741 ± 0.437
2.805IleGly: 2.805 ± 0.505
0.967IleHis: 0.967 ± 0.399
2.708IleIle: 2.708 ± 0.443
2.322IleLys: 2.322 ± 0.479
2.999IleLeu: 2.999 ± 0.566
0.871IleMet: 0.871 ± 0.26
2.902IleAsn: 2.902 ± 0.493
2.322IlePro: 2.322 ± 0.507
1.451IleGln: 1.451 ± 0.441
3.386IleArg: 3.386 ± 0.574
5.03IleSer: 5.03 ± 0.709
3.192IleThr: 3.192 ± 0.509
2.902IleVal: 2.902 ± 0.631
0.677IleTrp: 0.677 ± 0.236
2.031IleTyr: 2.031 ± 0.469
0.0IleXaa: 0.0 ± 0.0
Lys
5.223LysAla: 5.223 ± 0.608
0.29LysCys: 0.29 ± 0.237
2.515LysAsp: 2.515 ± 0.489
3.966LysGlu: 3.966 ± 0.682
1.257LysPhe: 1.257 ± 0.33
2.805LysGly: 2.805 ± 0.599
0.677LysHis: 0.677 ± 0.28
1.548LysIle: 1.548 ± 0.461
3.676LysLys: 3.676 ± 0.943
4.933LysLeu: 4.933 ± 0.8
1.741LysMet: 1.741 ± 0.467
2.031LysAsn: 2.031 ± 0.437
2.515LysPro: 2.515 ± 0.418
2.418LysGln: 2.418 ± 0.614
3.095LysArg: 3.095 ± 0.526
3.482LysSer: 3.482 ± 0.842
4.353LysThr: 4.353 ± 0.689
3.772LysVal: 3.772 ± 0.68
0.774LysTrp: 0.774 ± 0.357
1.741LysTyr: 1.741 ± 0.323
0.0LysXaa: 0.0 ± 0.0
Leu
10.64LeuAla: 10.64 ± 1.128
1.644LeuCys: 1.644 ± 0.465
5.514LeuAsp: 5.514 ± 0.482
5.707LeuGlu: 5.707 ± 0.73
3.482LeuPhe: 3.482 ± 0.557
5.127LeuGly: 5.127 ± 0.888
1.548LeuHis: 1.548 ± 0.547
4.546LeuIle: 4.546 ± 0.865
5.804LeuLys: 5.804 ± 0.711
7.642LeuLeu: 7.642 ± 1.035
2.708LeuMet: 2.708 ± 0.555
4.353LeuAsn: 4.353 ± 0.629
3.579LeuPro: 3.579 ± 0.461
3.579LeuGln: 3.579 ± 0.692
6.191LeuArg: 6.191 ± 0.967
5.03LeuSer: 5.03 ± 0.656
6.674LeuThr: 6.674 ± 0.968
4.837LeuVal: 4.837 ± 0.728
0.677LeuTrp: 0.677 ± 0.247
2.418LeuTyr: 2.418 ± 0.592
0.0LeuXaa: 0.0 ± 0.0
Met
3.289MetAla: 3.289 ± 0.622
0.387MetCys: 0.387 ± 0.21
0.58MetAsp: 0.58 ± 0.211
0.871MetGlu: 0.871 ± 0.255
1.354MetPhe: 1.354 ± 0.355
1.354MetGly: 1.354 ± 0.353
0.29MetHis: 0.29 ± 0.155
1.161MetIle: 1.161 ± 0.273
1.548MetLys: 1.548 ± 0.347
2.515MetLeu: 2.515 ± 0.552
0.677MetMet: 0.677 ± 0.271
0.774MetAsn: 0.774 ± 0.273
1.644MetPro: 1.644 ± 0.351
1.257MetGln: 1.257 ± 0.377
1.548MetArg: 1.548 ± 0.321
1.838MetSer: 1.838 ± 0.344
2.031MetThr: 2.031 ± 0.542
1.257MetVal: 1.257 ± 0.52
0.193MetTrp: 0.193 ± 0.13
0.387MetTyr: 0.387 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
4.643AsnAla: 4.643 ± 0.944
0.484AsnCys: 0.484 ± 0.174
2.128AsnAsp: 2.128 ± 0.468
2.515AsnGlu: 2.515 ± 0.468
1.161AsnPhe: 1.161 ± 0.342
4.063AsnGly: 4.063 ± 0.707
0.58AsnHis: 0.58 ± 0.204
2.805AsnIle: 2.805 ± 0.591
1.741AsnLys: 1.741 ± 0.379
2.805AsnLeu: 2.805 ± 0.589
0.677AsnMet: 0.677 ± 0.275
2.418AsnAsn: 2.418 ± 0.461
2.612AsnPro: 2.612 ± 0.492
1.548AsnGln: 1.548 ± 0.328
1.935AsnArg: 1.935 ± 0.398
2.515AsnSer: 2.515 ± 0.525
1.741AsnThr: 1.741 ± 0.4
2.322AsnVal: 2.322 ± 0.443
0.29AsnTrp: 0.29 ± 0.169
1.161AsnTyr: 1.161 ± 0.397
0.0AsnXaa: 0.0 ± 0.0
Pro
4.256ProAla: 4.256 ± 0.611
0.29ProCys: 0.29 ± 0.157
2.902ProAsp: 2.902 ± 0.477
4.353ProGlu: 4.353 ± 0.899
1.741ProPhe: 1.741 ± 0.376
3.386ProGly: 3.386 ± 0.713
0.967ProHis: 0.967 ± 0.369
1.644ProIle: 1.644 ± 0.464
1.548ProLys: 1.548 ± 0.374
3.966ProLeu: 3.966 ± 0.891
0.58ProMet: 0.58 ± 0.261
0.484ProAsn: 0.484 ± 0.177
1.741ProPro: 1.741 ± 0.476
1.935ProGln: 1.935 ± 0.424
1.548ProArg: 1.548 ± 0.308
3.482ProSer: 3.482 ± 0.785
1.935ProThr: 1.935 ± 0.423
3.289ProVal: 3.289 ± 0.697
0.29ProTrp: 0.29 ± 0.214
1.548ProTyr: 1.548 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
5.127GlnAla: 5.127 ± 0.806
0.484GlnCys: 0.484 ± 0.152
1.741GlnAsp: 1.741 ± 0.53
2.612GlnGlu: 2.612 ± 0.648
1.257GlnPhe: 1.257 ± 0.307
1.935GlnGly: 1.935 ± 0.401
0.967GlnHis: 0.967 ± 0.329
1.838GlnIle: 1.838 ± 0.409
1.935GlnLys: 1.935 ± 0.45
4.546GlnLeu: 4.546 ± 0.785
1.451GlnMet: 1.451 ± 0.337
1.838GlnAsn: 1.838 ± 0.597
1.644GlnPro: 1.644 ± 0.418
2.031GlnGln: 2.031 ± 0.631
2.612GlnArg: 2.612 ± 0.408
2.515GlnSer: 2.515 ± 0.581
2.805GlnThr: 2.805 ± 0.569
3.289GlnVal: 3.289 ± 0.587
0.484GlnTrp: 0.484 ± 0.185
1.354GlnTyr: 1.354 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
5.03ArgAla: 5.03 ± 0.779
0.774ArgCys: 0.774 ± 0.295
3.192ArgAsp: 3.192 ± 0.577
4.159ArgGlu: 4.159 ± 0.655
1.935ArgPhe: 1.935 ± 0.448
3.966ArgGly: 3.966 ± 0.505
1.935ArgHis: 1.935 ± 0.441
2.999ArgIle: 2.999 ± 0.59
3.192ArgLys: 3.192 ± 0.592
6.965ArgLeu: 6.965 ± 0.962
1.935ArgMet: 1.935 ± 0.428
1.935ArgAsn: 1.935 ± 0.41
2.128ArgPro: 2.128 ± 0.543
2.805ArgGln: 2.805 ± 0.699
5.997ArgArg: 5.997 ± 0.836
2.515ArgSer: 2.515 ± 0.497
3.192ArgThr: 3.192 ± 0.593
3.966ArgVal: 3.966 ± 0.674
1.644ArgTrp: 1.644 ± 0.434
1.838ArgTyr: 1.838 ± 0.598
0.0ArgXaa: 0.0 ± 0.0
Ser
7.448SerAla: 7.448 ± 1.045
0.967SerCys: 0.967 ± 0.373
4.159SerAsp: 4.159 ± 0.438
4.45SerGlu: 4.45 ± 0.63
2.708SerPhe: 2.708 ± 0.434
6.868SerGly: 6.868 ± 1.369
0.58SerHis: 0.58 ± 0.213
2.805SerIle: 2.805 ± 0.461
2.612SerLys: 2.612 ± 0.425
5.03SerLeu: 5.03 ± 0.725
1.548SerMet: 1.548 ± 0.358
2.418SerAsn: 2.418 ± 0.54
2.805SerPro: 2.805 ± 0.53
2.708SerGln: 2.708 ± 0.692
3.772SerArg: 3.772 ± 0.655
5.03SerSer: 5.03 ± 0.978
4.063SerThr: 4.063 ± 0.651
4.353SerVal: 4.353 ± 0.515
1.161SerTrp: 1.161 ± 0.402
1.935SerTyr: 1.935 ± 0.483
0.0SerXaa: 0.0 ± 0.0
Thr
6.384ThrAla: 6.384 ± 1.082
0.29ThrCys: 0.29 ± 0.156
3.095ThrAsp: 3.095 ± 0.486
4.45ThrGlu: 4.45 ± 0.688
1.548ThrPhe: 1.548 ± 0.398
5.997ThrGly: 5.997 ± 0.9
0.871ThrHis: 0.871 ± 0.309
3.579ThrIle: 3.579 ± 0.58
3.289ThrLys: 3.289 ± 0.61
6.287ThrLeu: 6.287 ± 0.65
1.548ThrMet: 1.548 ± 0.365
1.935ThrAsn: 1.935 ± 0.478
2.999ThrPro: 2.999 ± 0.506
2.708ThrGln: 2.708 ± 0.431
2.805ThrArg: 2.805 ± 0.442
3.966ThrSer: 3.966 ± 0.668
3.482ThrThr: 3.482 ± 0.572
4.546ThrVal: 4.546 ± 0.831
1.161ThrTrp: 1.161 ± 0.47
1.644ThrTyr: 1.644 ± 0.555
0.0ThrXaa: 0.0 ± 0.0
Val
5.417ValAla: 5.417 ± 0.837
0.387ValCys: 0.387 ± 0.178
4.45ValAsp: 4.45 ± 0.646
4.353ValGlu: 4.353 ± 0.567
1.257ValPhe: 1.257 ± 0.357
2.999ValGly: 2.999 ± 0.452
1.064ValHis: 1.064 ± 0.34
4.45ValIle: 4.45 ± 0.746
4.063ValLys: 4.063 ± 0.714
4.45ValLeu: 4.45 ± 0.715
1.161ValMet: 1.161 ± 0.388
2.225ValAsn: 2.225 ± 0.396
3.966ValPro: 3.966 ± 0.646
2.225ValGln: 2.225 ± 0.453
3.386ValArg: 3.386 ± 0.591
4.74ValSer: 4.74 ± 0.642
5.997ValThr: 5.997 ± 0.721
4.933ValVal: 4.933 ± 0.8
0.871ValTrp: 0.871 ± 0.346
2.031ValTyr: 2.031 ± 0.367
0.0ValXaa: 0.0 ± 0.0
Trp
1.257TrpAla: 1.257 ± 0.295
0.29TrpCys: 0.29 ± 0.158
0.774TrpAsp: 0.774 ± 0.304
0.677TrpGlu: 0.677 ± 0.243
0.677TrpPhe: 0.677 ± 0.284
0.871TrpGly: 0.871 ± 0.316
0.29TrpHis: 0.29 ± 0.16
0.967TrpIle: 0.967 ± 0.285
0.58TrpLys: 0.58 ± 0.294
1.257TrpLeu: 1.257 ± 0.352
0.677TrpMet: 0.677 ± 0.287
0.871TrpAsn: 0.871 ± 0.302
0.387TrpPro: 0.387 ± 0.189
0.967TrpGln: 0.967 ± 0.307
1.257TrpArg: 1.257 ± 0.346
0.774TrpSer: 0.774 ± 0.322
1.257TrpThr: 1.257 ± 0.346
1.354TrpVal: 1.354 ± 0.265
0.193TrpTrp: 0.193 ± 0.126
0.29TrpTyr: 0.29 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.612TyrAla: 2.612 ± 0.407
0.29TyrCys: 0.29 ± 0.164
2.031TyrAsp: 2.031 ± 0.615
1.644TyrGlu: 1.644 ± 0.383
1.161TyrPhe: 1.161 ± 0.308
2.805TyrGly: 2.805 ± 0.505
0.484TyrHis: 0.484 ± 0.164
0.967TyrIle: 0.967 ± 0.295
1.451TyrLys: 1.451 ± 0.411
1.935TyrLeu: 1.935 ± 0.393
0.484TyrMet: 0.484 ± 0.277
0.774TyrAsn: 0.774 ± 0.232
1.548TyrPro: 1.548 ± 0.474
1.548TyrGln: 1.548 ± 0.426
2.322TyrArg: 2.322 ± 0.52
2.322TyrSer: 2.322 ± 0.51
2.322TyrThr: 2.322 ± 0.594
1.064TyrVal: 1.064 ± 0.281
0.774TyrTrp: 0.774 ± 0.249
0.871TyrTyr: 0.871 ± 0.243
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (10339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski