Amino acid dipepetide frequency for Klebsiella phage vB_KpnS_FZ10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.674AlaAla: 9.674 ± 1.212
0.627AlaCys: 0.627 ± 0.229
5.285AlaAsp: 5.285 ± 0.762
6.897AlaGlu: 6.897 ± 0.829
2.866AlaPhe: 2.866 ± 0.59
6.897AlaGly: 6.897 ± 0.827
1.075AlaHis: 1.075 ± 0.264
5.643AlaIle: 5.643 ± 0.66
7.166AlaLys: 7.166 ± 0.743
7.703AlaLeu: 7.703 ± 0.896
3.314AlaMet: 3.314 ± 0.376
3.135AlaAsn: 3.135 ± 0.518
2.418AlaPro: 2.418 ± 0.408
3.404AlaGln: 3.404 ± 0.7
4.3AlaArg: 4.3 ± 0.678
6.27AlaSer: 6.27 ± 0.881
4.479AlaThr: 4.479 ± 0.532
5.374AlaVal: 5.374 ± 0.858
1.523AlaTrp: 1.523 ± 0.351
2.687AlaTyr: 2.687 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
0.896CysAla: 0.896 ± 0.275
0.09CysCys: 0.09 ± 0.084
1.702CysAsp: 1.702 ± 0.33
0.896CysGlu: 0.896 ± 0.344
0.179CysPhe: 0.179 ± 0.126
1.254CysGly: 1.254 ± 0.378
0.269CysHis: 0.269 ± 0.16
0.985CysIle: 0.985 ± 0.227
0.896CysLys: 0.896 ± 0.312
0.537CysLeu: 0.537 ± 0.214
0.179CysMet: 0.179 ± 0.126
0.269CysAsn: 0.269 ± 0.125
0.806CysPro: 0.806 ± 0.306
0.179CysGln: 0.179 ± 0.118
1.075CysArg: 1.075 ± 0.316
0.269CysSer: 0.269 ± 0.15
0.806CysThr: 0.806 ± 0.261
0.806CysVal: 0.806 ± 0.262
0.269CysTrp: 0.269 ± 0.168
0.448CysTyr: 0.448 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
6.539AspAla: 6.539 ± 0.757
0.448AspCys: 0.448 ± 0.173
4.21AspAsp: 4.21 ± 0.763
4.658AspGlu: 4.658 ± 0.576
3.225AspPhe: 3.225 ± 0.542
6.718AspGly: 6.718 ± 1.026
0.806AspHis: 0.806 ± 0.226
4.3AspIle: 4.3 ± 0.648
4.747AspLys: 4.747 ± 0.834
3.673AspLeu: 3.673 ± 0.493
1.164AspMet: 1.164 ± 0.311
2.329AspAsn: 2.329 ± 0.516
2.239AspPro: 2.239 ± 0.368
1.523AspGln: 1.523 ± 0.343
3.404AspArg: 3.404 ± 0.679
4.479AspSer: 4.479 ± 0.612
3.046AspThr: 3.046 ± 0.473
3.762AspVal: 3.762 ± 0.488
0.896AspTrp: 0.896 ± 0.363
2.239AspTyr: 2.239 ± 0.461
0.0AspXaa: 0.0 ± 0.0
Glu
5.106GluAla: 5.106 ± 0.716
0.896GluCys: 0.896 ± 0.22
3.493GluAsp: 3.493 ± 0.604
4.12GluGlu: 4.12 ± 0.7
3.762GluPhe: 3.762 ± 0.555
3.673GluGly: 3.673 ± 0.484
0.717GluHis: 0.717 ± 0.257
3.314GluIle: 3.314 ± 0.414
3.493GluLys: 3.493 ± 0.507
4.747GluLeu: 4.747 ± 0.682
2.687GluMet: 2.687 ± 0.383
3.135GluAsn: 3.135 ± 0.46
2.239GluPro: 2.239 ± 0.491
3.673GluGln: 3.673 ± 0.635
3.046GluArg: 3.046 ± 0.593
4.12GluSer: 4.12 ± 0.509
2.418GluThr: 2.418 ± 0.357
4.031GluVal: 4.031 ± 0.698
1.075GluTrp: 1.075 ± 0.236
3.046GluTyr: 3.046 ± 0.543
0.0GluXaa: 0.0 ± 0.0
Phe
2.687PheAla: 2.687 ± 0.445
0.806PheCys: 0.806 ± 0.271
3.225PheAsp: 3.225 ± 0.583
2.06PheGlu: 2.06 ± 0.369
0.985PhePhe: 0.985 ± 0.27
4.3PheGly: 4.3 ± 0.613
0.537PheHis: 0.537 ± 0.186
3.046PheIle: 3.046 ± 0.453
2.15PheLys: 2.15 ± 0.365
2.06PheLeu: 2.06 ± 0.388
1.075PheMet: 1.075 ± 0.28
2.239PheAsn: 2.239 ± 0.403
1.523PhePro: 1.523 ± 0.333
1.791PheGln: 1.791 ± 0.464
2.239PheArg: 2.239 ± 0.437
2.687PheSer: 2.687 ± 0.61
2.777PheThr: 2.777 ± 0.411
2.239PheVal: 2.239 ± 0.349
0.896PheTrp: 0.896 ± 0.339
1.433PheTyr: 1.433 ± 0.29
0.0PheXaa: 0.0 ± 0.0
Gly
4.837GlyAla: 4.837 ± 0.716
1.075GlyCys: 1.075 ± 0.344
4.479GlyAsp: 4.479 ± 0.699
4.837GlyGlu: 4.837 ± 0.641
3.225GlyPhe: 3.225 ± 0.474
8.151GlyGly: 8.151 ± 1.025
1.164GlyHis: 1.164 ± 0.392
4.479GlyIle: 4.479 ± 0.544
6.449GlyLys: 6.449 ± 0.632
5.195GlyLeu: 5.195 ± 0.574
2.866GlyMet: 2.866 ± 0.449
4.658GlyAsn: 4.658 ± 0.422
1.254GlyPro: 1.254 ± 0.313
2.239GlyGln: 2.239 ± 0.442
4.747GlyArg: 4.747 ± 0.55
5.554GlySer: 5.554 ± 0.644
3.225GlyThr: 3.225 ± 0.521
6.181GlyVal: 6.181 ± 0.669
0.985GlyTrp: 0.985 ± 0.294
3.046GlyTyr: 3.046 ± 0.467
0.0GlyXaa: 0.0 ± 0.0
His
0.985HisAla: 0.985 ± 0.303
0.179HisCys: 0.179 ± 0.108
0.985HisAsp: 0.985 ± 0.243
1.702HisGlu: 1.702 ± 0.415
0.627HisPhe: 0.627 ± 0.193
1.254HisGly: 1.254 ± 0.397
0.448HisHis: 0.448 ± 0.163
0.627HisIle: 0.627 ± 0.288
1.164HisLys: 1.164 ± 0.368
0.806HisLeu: 0.806 ± 0.258
0.0HisMet: 0.0 ± 0.0
0.537HisAsn: 0.537 ± 0.279
0.896HisPro: 0.896 ± 0.342
0.448HisGln: 0.448 ± 0.18
0.627HisArg: 0.627 ± 0.237
0.985HisSer: 0.985 ± 0.334
0.985HisThr: 0.985 ± 0.245
1.254HisVal: 1.254 ± 0.31
0.358HisTrp: 0.358 ± 0.139
0.627HisTyr: 0.627 ± 0.292
0.0HisXaa: 0.0 ± 0.0
Ile
6.001IleAla: 6.001 ± 0.811
1.254IleCys: 1.254 ± 0.329
5.285IleAsp: 5.285 ± 0.632
3.314IleGlu: 3.314 ± 0.345
2.15IlePhe: 2.15 ± 0.455
3.583IleGly: 3.583 ± 0.494
1.164IleHis: 1.164 ± 0.331
3.046IleIle: 3.046 ± 0.471
4.21IleLys: 4.21 ± 0.58
3.314IleLeu: 3.314 ± 0.481
1.702IleMet: 1.702 ± 0.426
2.418IleAsn: 2.418 ± 0.576
2.777IlePro: 2.777 ± 0.453
2.06IleGln: 2.06 ± 0.401
2.956IleArg: 2.956 ± 0.511
3.135IleSer: 3.135 ± 0.449
5.016IleThr: 5.016 ± 0.687
4.389IleVal: 4.389 ± 0.692
1.075IleTrp: 1.075 ± 0.301
1.971IleTyr: 1.971 ± 0.38
0.0IleXaa: 0.0 ± 0.0
Lys
5.643LysAla: 5.643 ± 0.824
0.806LysCys: 0.806 ± 0.318
4.031LysAsp: 4.031 ± 0.586
4.479LysGlu: 4.479 ± 0.533
2.06LysPhe: 2.06 ± 0.28
4.12LysGly: 4.12 ± 0.532
1.433LysHis: 1.433 ± 0.37
4.479LysIle: 4.479 ± 0.631
3.673LysLys: 3.673 ± 0.597
4.747LysLeu: 4.747 ± 0.56
3.225LysMet: 3.225 ± 0.599
2.956LysAsn: 2.956 ± 0.544
2.866LysPro: 2.866 ± 0.504
2.687LysGln: 2.687 ± 0.545
4.21LysArg: 4.21 ± 0.712
3.135LysSer: 3.135 ± 0.573
4.479LysThr: 4.479 ± 0.787
4.747LysVal: 4.747 ± 0.661
1.344LysTrp: 1.344 ± 0.313
1.881LysTyr: 1.881 ± 0.292
0.0LysXaa: 0.0 ± 0.0
Leu
6.897LeuAla: 6.897 ± 0.787
0.717LeuCys: 0.717 ± 0.24
3.225LeuAsp: 3.225 ± 0.513
4.12LeuGlu: 4.12 ± 0.632
1.881LeuPhe: 1.881 ± 0.338
5.106LeuGly: 5.106 ± 0.694
0.985LeuHis: 0.985 ± 0.399
3.852LeuIle: 3.852 ± 0.519
5.106LeuLys: 5.106 ± 0.762
3.852LeuLeu: 3.852 ± 0.453
1.612LeuMet: 1.612 ± 0.386
3.404LeuAsn: 3.404 ± 0.645
2.956LeuPro: 2.956 ± 0.458
2.418LeuGln: 2.418 ± 0.477
2.866LeuArg: 2.866 ± 0.575
4.568LeuSer: 4.568 ± 0.607
4.479LeuThr: 4.479 ± 0.653
3.852LeuVal: 3.852 ± 0.427
0.537LeuTrp: 0.537 ± 0.16
2.508LeuTyr: 2.508 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
4.031MetAla: 4.031 ± 0.624
0.09MetCys: 0.09 ± 0.091
1.702MetAsp: 1.702 ± 0.331
1.254MetGlu: 1.254 ± 0.319
1.254MetPhe: 1.254 ± 0.259
1.702MetGly: 1.702 ± 0.493
0.806MetHis: 0.806 ± 0.228
2.239MetIle: 2.239 ± 0.369
2.15MetLys: 2.15 ± 0.42
2.239MetLeu: 2.239 ± 0.384
0.806MetMet: 0.806 ± 0.303
1.344MetAsn: 1.344 ± 0.345
0.717MetPro: 0.717 ± 0.27
1.523MetGln: 1.523 ± 0.365
2.329MetArg: 2.329 ± 0.435
1.612MetSer: 1.612 ± 0.32
1.881MetThr: 1.881 ± 0.397
1.881MetVal: 1.881 ± 0.33
0.269MetTrp: 0.269 ± 0.137
0.806MetTyr: 0.806 ± 0.268
0.0MetXaa: 0.0 ± 0.0
Asn
4.031AsnAla: 4.031 ± 0.69
0.358AsnCys: 0.358 ± 0.171
2.508AsnAsp: 2.508 ± 0.347
3.135AsnGlu: 3.135 ± 0.429
1.523AsnPhe: 1.523 ± 0.352
5.912AsnGly: 5.912 ± 0.797
0.985AsnHis: 0.985 ± 0.303
2.777AsnIle: 2.777 ± 0.499
2.418AsnLys: 2.418 ± 0.371
2.777AsnLeu: 2.777 ± 0.562
1.523AsnMet: 1.523 ± 0.399
2.598AsnAsn: 2.598 ± 0.558
1.702AsnPro: 1.702 ± 0.388
1.881AsnGln: 1.881 ± 0.499
2.15AsnArg: 2.15 ± 0.365
2.418AsnSer: 2.418 ± 0.405
1.702AsnThr: 1.702 ± 0.495
2.687AsnVal: 2.687 ± 0.537
0.896AsnTrp: 0.896 ± 0.227
1.164AsnTyr: 1.164 ± 0.352
0.0AsnXaa: 0.0 ± 0.0
Pro
2.15ProAla: 2.15 ± 0.548
0.448ProCys: 0.448 ± 0.192
3.762ProAsp: 3.762 ± 0.728
2.956ProGlu: 2.956 ± 0.589
1.612ProPhe: 1.612 ± 0.299
1.971ProGly: 1.971 ± 0.299
0.448ProHis: 0.448 ± 0.175
2.06ProIle: 2.06 ± 0.423
2.777ProLys: 2.777 ± 0.45
1.612ProLeu: 1.612 ± 0.393
0.627ProMet: 0.627 ± 0.215
1.344ProAsn: 1.344 ± 0.332
1.075ProPro: 1.075 ± 0.287
1.702ProGln: 1.702 ± 0.409
2.15ProArg: 2.15 ± 0.485
1.881ProSer: 1.881 ± 0.349
2.06ProThr: 2.06 ± 0.418
2.956ProVal: 2.956 ± 0.497
0.627ProTrp: 0.627 ± 0.261
1.344ProTyr: 1.344 ± 0.49
0.0ProXaa: 0.0 ± 0.0
Gln
3.673GlnAla: 3.673 ± 0.56
0.806GlnCys: 0.806 ± 0.263
2.598GlnAsp: 2.598 ± 0.384
2.239GlnGlu: 2.239 ± 0.533
0.985GlnPhe: 0.985 ± 0.225
1.612GlnGly: 1.612 ± 0.314
0.448GlnHis: 0.448 ± 0.214
3.314GlnIle: 3.314 ± 0.589
2.418GlnLys: 2.418 ± 0.47
2.956GlnLeu: 2.956 ± 0.609
1.254GlnMet: 1.254 ± 0.482
1.791GlnAsn: 1.791 ± 0.482
1.433GlnPro: 1.433 ± 0.397
2.418GlnGln: 2.418 ± 1.262
2.15GlnArg: 2.15 ± 0.458
2.687GlnSer: 2.687 ± 0.464
1.791GlnThr: 1.791 ± 0.348
3.135GlnVal: 3.135 ± 0.638
0.448GlnTrp: 0.448 ± 0.163
1.791GlnTyr: 1.791 ± 0.41
0.0GlnXaa: 0.0 ± 0.0
Arg
4.747ArgAla: 4.747 ± 0.559
1.344ArgCys: 1.344 ± 0.464
2.777ArgAsp: 2.777 ± 0.436
3.762ArgGlu: 3.762 ± 0.589
3.225ArgPhe: 3.225 ± 0.499
3.493ArgGly: 3.493 ± 0.499
0.806ArgHis: 0.806 ± 0.258
2.15ArgIle: 2.15 ± 0.372
4.12ArgLys: 4.12 ± 0.5
3.852ArgLeu: 3.852 ± 0.635
1.612ArgMet: 1.612 ± 0.388
1.971ArgAsn: 1.971 ± 0.432
1.702ArgPro: 1.702 ± 0.4
1.881ArgGln: 1.881 ± 0.527
3.673ArgArg: 3.673 ± 0.499
2.866ArgSer: 2.866 ± 0.481
1.791ArgThr: 1.791 ± 0.43
4.21ArgVal: 4.21 ± 0.688
0.717ArgTrp: 0.717 ± 0.201
2.15ArgTyr: 2.15 ± 0.422
0.0ArgXaa: 0.0 ± 0.0
Ser
6.181SerAla: 6.181 ± 0.987
0.448SerCys: 0.448 ± 0.21
4.389SerAsp: 4.389 ± 0.611
4.479SerGlu: 4.479 ± 0.582
2.866SerPhe: 2.866 ± 0.458
6.897SerGly: 6.897 ± 0.843
0.806SerHis: 0.806 ± 0.276
3.762SerIle: 3.762 ± 0.658
3.941SerLys: 3.941 ± 0.718
3.583SerLeu: 3.583 ± 0.741
1.523SerMet: 1.523 ± 0.419
2.329SerAsn: 2.329 ± 0.31
2.15SerPro: 2.15 ± 0.476
2.598SerGln: 2.598 ± 0.444
2.777SerArg: 2.777 ± 0.474
3.673SerSer: 3.673 ± 0.544
2.598SerThr: 2.598 ± 0.394
5.374SerVal: 5.374 ± 0.616
1.075SerTrp: 1.075 ± 0.258
2.15SerTyr: 2.15 ± 0.38
0.0SerXaa: 0.0 ± 0.0
Thr
4.658ThrAla: 4.658 ± 0.598
0.717ThrCys: 0.717 ± 0.258
3.135ThrAsp: 3.135 ± 0.518
2.06ThrGlu: 2.06 ± 0.435
3.225ThrPhe: 3.225 ± 0.456
4.568ThrGly: 4.568 ± 0.621
0.896ThrHis: 0.896 ± 0.239
3.493ThrIle: 3.493 ± 0.579
3.404ThrLys: 3.404 ± 0.693
4.031ThrLeu: 4.031 ± 0.785
1.791ThrMet: 1.791 ± 0.347
2.508ThrAsn: 2.508 ± 0.424
2.239ThrPro: 2.239 ± 0.476
2.239ThrGln: 2.239 ± 0.588
1.971ThrArg: 1.971 ± 0.332
4.031ThrSer: 4.031 ± 0.789
3.225ThrThr: 3.225 ± 0.581
3.941ThrVal: 3.941 ± 0.542
0.717ThrTrp: 0.717 ± 0.254
2.06ThrTyr: 2.06 ± 0.353
0.0ThrXaa: 0.0 ± 0.0
Val
6.808ValAla: 6.808 ± 0.968
0.627ValCys: 0.627 ± 0.271
3.852ValAsp: 3.852 ± 0.643
3.762ValGlu: 3.762 ± 0.593
2.866ValPhe: 2.866 ± 0.507
4.12ValGly: 4.12 ± 0.658
0.627ValHis: 0.627 ± 0.228
4.568ValIle: 4.568 ± 0.49
4.031ValLys: 4.031 ± 0.602
4.568ValLeu: 4.568 ± 0.55
2.239ValMet: 2.239 ± 0.377
4.031ValAsn: 4.031 ± 0.452
2.777ValPro: 2.777 ± 0.528
2.598ValGln: 2.598 ± 0.54
2.956ValArg: 2.956 ± 0.453
5.643ValSer: 5.643 ± 0.725
4.568ValThr: 4.568 ± 0.677
5.106ValVal: 5.106 ± 0.848
1.075ValTrp: 1.075 ± 0.28
1.791ValTyr: 1.791 ± 0.358
0.0ValXaa: 0.0 ± 0.0
Trp
1.971TrpAla: 1.971 ± 0.329
0.358TrpCys: 0.358 ± 0.182
0.806TrpAsp: 0.806 ± 0.281
0.358TrpGlu: 0.358 ± 0.277
0.717TrpPhe: 0.717 ± 0.279
1.075TrpGly: 1.075 ± 0.253
0.537TrpHis: 0.537 ± 0.223
0.717TrpIle: 0.717 ± 0.22
1.075TrpLys: 1.075 ± 0.35
0.985TrpLeu: 0.985 ± 0.263
0.179TrpMet: 0.179 ± 0.121
0.448TrpAsn: 0.448 ± 0.211
0.448TrpPro: 0.448 ± 0.16
0.717TrpGln: 0.717 ± 0.215
1.254TrpArg: 1.254 ± 0.289
1.612TrpSer: 1.612 ± 0.479
0.806TrpThr: 0.806 ± 0.215
0.896TrpVal: 0.896 ± 0.191
0.448TrpTrp: 0.448 ± 0.186
0.537TrpTyr: 0.537 ± 0.234
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.225TyrAla: 3.225 ± 0.528
0.717TyrCys: 0.717 ± 0.257
2.956TyrAsp: 2.956 ± 0.427
1.702TyrGlu: 1.702 ± 0.329
1.523TyrPhe: 1.523 ± 0.441
2.239TyrGly: 2.239 ± 0.519
0.537TyrHis: 0.537 ± 0.226
1.971TyrIle: 1.971 ± 0.349
1.612TyrLys: 1.612 ± 0.342
1.791TyrLeu: 1.791 ± 0.349
1.164TyrMet: 1.164 ± 0.277
1.702TyrAsn: 1.702 ± 0.423
1.344TyrPro: 1.344 ± 0.356
1.971TyrGln: 1.971 ± 0.31
1.971TyrArg: 1.971 ± 0.43
1.971TyrSer: 1.971 ± 0.348
2.777TyrThr: 2.777 ± 0.376
1.791TyrVal: 1.791 ± 0.41
0.717TyrTrp: 0.717 ± 0.254
0.717TyrTyr: 0.717 ± 0.258
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (11165 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski