Amino acid dipepetide frequency for Enterococcus phage vB_EfaP_Zip

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.342AlaAla: 3.342 ± 1.357
0.334AlaCys: 0.334 ± 0.323
4.01AlaAsp: 4.01 ± 1.012
4.678AlaGlu: 4.678 ± 1.17
2.506AlaPhe: 2.506 ± 0.648
3.676AlaGly: 3.676 ± 1.073
0.501AlaHis: 0.501 ± 0.27
3.509AlaIle: 3.509 ± 0.892
4.845AlaLys: 4.845 ± 0.776
6.015AlaLeu: 6.015 ± 1.088
1.17AlaMet: 1.17 ± 0.363
5.347AlaAsn: 5.347 ± 1.099
2.172AlaPro: 2.172 ± 0.51
2.005AlaGln: 2.005 ± 0.609
2.673AlaArg: 2.673 ± 0.655
3.008AlaSer: 3.008 ± 0.514
3.175AlaThr: 3.175 ± 0.719
2.005AlaVal: 2.005 ± 0.59
1.337AlaTrp: 1.337 ± 0.471
5.013AlaTyr: 5.013 ± 0.884
0.0AlaXaa: 0.0 ± 0.0
Cys
0.501CysAla: 0.501 ± 0.485
0.167CysCys: 0.167 ± 0.162
0.334CysAsp: 0.334 ± 0.291
0.167CysGlu: 0.167 ± 0.152
0.501CysPhe: 0.501 ± 0.281
0.334CysGly: 0.334 ± 0.303
0.0CysHis: 0.0 ± 0.0
0.668CysIle: 0.668 ± 0.417
0.334CysLys: 0.334 ± 0.243
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.167CysPro: 0.167 ± 0.161
0.668CysGln: 0.668 ± 0.361
0.334CysArg: 0.334 ± 0.222
0.835CysSer: 0.835 ± 0.309
0.334CysThr: 0.334 ± 0.208
1.17CysVal: 1.17 ± 0.51
0.0CysTrp: 0.0 ± 0.0
0.334CysTyr: 0.334 ± 0.323
0.0CysXaa: 0.0 ± 0.0
Asp
3.175AspAla: 3.175 ± 0.612
0.167AspCys: 0.167 ± 0.186
2.84AspAsp: 2.84 ± 0.678
5.347AspGlu: 5.347 ± 0.879
3.843AspPhe: 3.843 ± 0.643
3.342AspGly: 3.342 ± 0.703
0.334AspHis: 0.334 ± 0.226
4.678AspIle: 4.678 ± 0.925
4.845AspLys: 4.845 ± 0.811
4.678AspLeu: 4.678 ± 0.633
1.17AspMet: 1.17 ± 0.446
5.347AspAsn: 5.347 ± 0.997
2.172AspPro: 2.172 ± 0.736
1.838AspGln: 1.838 ± 0.386
1.671AspArg: 1.671 ± 0.503
1.838AspSer: 1.838 ± 0.459
4.177AspThr: 4.177 ± 0.839
5.514AspVal: 5.514 ± 1.176
0.334AspTrp: 0.334 ± 0.245
2.506AspTyr: 2.506 ± 0.538
0.0AspXaa: 0.0 ± 0.0
Glu
3.342GluAla: 3.342 ± 0.825
0.167GluCys: 0.167 ± 0.152
4.01GluAsp: 4.01 ± 1.124
6.516GluGlu: 6.516 ± 1.73
3.676GluPhe: 3.676 ± 1.079
3.509GluGly: 3.509 ± 0.582
1.337GluHis: 1.337 ± 0.616
5.347GluIle: 5.347 ± 1.073
4.344GluLys: 4.344 ± 0.91
6.85GluLeu: 6.85 ± 1.693
2.005GluMet: 2.005 ± 0.526
4.678GluAsn: 4.678 ± 1.316
1.504GluPro: 1.504 ± 0.61
3.843GluGln: 3.843 ± 0.787
2.506GluArg: 2.506 ± 0.941
3.843GluSer: 3.843 ± 0.866
4.511GluThr: 4.511 ± 0.952
5.013GluVal: 5.013 ± 1.132
0.835GluTrp: 0.835 ± 0.305
3.509GluTyr: 3.509 ± 0.785
0.0GluXaa: 0.0 ± 0.0
Phe
2.506PheAla: 2.506 ± 0.614
0.668PheCys: 0.668 ± 0.325
3.843PheAsp: 3.843 ± 0.961
2.339PheGlu: 2.339 ± 0.737
1.337PhePhe: 1.337 ± 0.459
1.671PheGly: 1.671 ± 0.469
1.504PheHis: 1.504 ± 0.659
3.342PheIle: 3.342 ± 0.634
2.84PheLys: 2.84 ± 0.726
3.843PheLeu: 3.843 ± 0.986
1.337PheMet: 1.337 ± 0.429
3.676PheAsn: 3.676 ± 0.76
2.172PhePro: 2.172 ± 0.564
0.668PheGln: 0.668 ± 0.393
1.337PheArg: 1.337 ± 0.624
2.84PheSer: 2.84 ± 0.696
4.678PheThr: 4.678 ± 0.917
2.172PheVal: 2.172 ± 0.42
0.668PheTrp: 0.668 ± 0.427
2.339PheTyr: 2.339 ± 0.888
0.0PheXaa: 0.0 ± 0.0
Gly
3.175GlyAla: 3.175 ± 0.8
0.835GlyCys: 0.835 ± 0.419
2.506GlyAsp: 2.506 ± 0.556
3.008GlyGlu: 3.008 ± 0.673
3.843GlyPhe: 3.843 ± 0.963
4.344GlyGly: 4.344 ± 0.761
1.337GlyHis: 1.337 ± 0.691
3.843GlyIle: 3.843 ± 1.318
5.18GlyLys: 5.18 ± 1.155
4.01GlyLeu: 4.01 ± 0.673
1.838GlyMet: 1.838 ± 0.561
4.678GlyAsn: 4.678 ± 1.062
0.501GlyPro: 0.501 ± 0.269
2.673GlyGln: 2.673 ± 0.64
1.504GlyArg: 1.504 ± 0.444
5.013GlySer: 5.013 ± 1.557
3.843GlyThr: 3.843 ± 1.157
4.678GlyVal: 4.678 ± 0.625
0.501GlyTrp: 0.501 ± 0.351
2.84GlyTyr: 2.84 ± 0.64
0.0GlyXaa: 0.0 ± 0.0
His
1.003HisAla: 1.003 ± 0.531
0.167HisCys: 0.167 ± 0.152
1.17HisAsp: 1.17 ± 0.347
1.003HisGlu: 1.003 ± 0.362
1.671HisPhe: 1.671 ± 0.564
1.003HisGly: 1.003 ± 0.612
0.501HisHis: 0.501 ± 0.349
0.835HisIle: 0.835 ± 0.317
1.003HisLys: 1.003 ± 0.366
1.003HisLeu: 1.003 ± 0.415
0.501HisMet: 0.501 ± 0.27
1.337HisAsn: 1.337 ± 0.524
0.835HisPro: 0.835 ± 0.312
0.835HisGln: 0.835 ± 0.475
1.504HisArg: 1.504 ± 0.439
1.337HisSer: 1.337 ± 0.482
0.668HisThr: 0.668 ± 0.301
0.835HisVal: 0.835 ± 0.423
0.501HisTrp: 0.501 ± 0.268
2.005HisTyr: 2.005 ± 0.524
0.0HisXaa: 0.0 ± 0.0
Ile
4.678IleAla: 4.678 ± 0.717
0.501IleCys: 0.501 ± 0.375
6.349IleAsp: 6.349 ± 1.451
3.676IleGlu: 3.676 ± 0.633
1.671IlePhe: 1.671 ± 0.782
3.676IleGly: 3.676 ± 0.575
1.504IleHis: 1.504 ± 0.382
3.008IleIle: 3.008 ± 1.054
4.845IleLys: 4.845 ± 0.813
4.678IleLeu: 4.678 ± 0.715
1.003IleMet: 1.003 ± 0.36
3.342IleAsn: 3.342 ± 0.976
3.008IlePro: 3.008 ± 0.67
3.342IleGln: 3.342 ± 0.779
2.84IleArg: 2.84 ± 0.68
5.013IleSer: 5.013 ± 0.95
5.18IleThr: 5.18 ± 0.934
2.506IleVal: 2.506 ± 0.498
0.501IleTrp: 0.501 ± 0.351
3.509IleTyr: 3.509 ± 1.111
0.0IleXaa: 0.0 ± 0.0
Lys
3.509LysAla: 3.509 ± 0.769
0.501LysCys: 0.501 ± 0.265
3.676LysAsp: 3.676 ± 0.692
7.185LysGlu: 7.185 ± 1.266
3.509LysPhe: 3.509 ± 0.91
5.681LysGly: 5.681 ± 0.855
2.172LysHis: 2.172 ± 0.834
5.013LysIle: 5.013 ± 0.912
4.344LysLys: 4.344 ± 1.055
5.681LysLeu: 5.681 ± 1.067
1.17LysMet: 1.17 ± 0.485
4.01LysAsn: 4.01 ± 0.815
4.01LysPro: 4.01 ± 0.804
2.339LysGln: 2.339 ± 0.72
3.175LysArg: 3.175 ± 0.972
3.843LysSer: 3.843 ± 0.631
5.013LysThr: 5.013 ± 1.077
4.678LysVal: 4.678 ± 0.713
0.0LysTrp: 0.0 ± 0.0
4.344LysTyr: 4.344 ± 0.728
0.0LysXaa: 0.0 ± 0.0
Leu
6.182LeuAla: 6.182 ± 0.784
0.501LeuCys: 0.501 ± 0.261
4.01LeuAsp: 4.01 ± 1.101
5.013LeuGlu: 5.013 ± 1.033
2.84LeuPhe: 2.84 ± 0.747
6.015LeuGly: 6.015 ± 0.765
0.668LeuHis: 0.668 ± 0.279
4.511LeuIle: 4.511 ± 1.136
6.683LeuLys: 6.683 ± 1.417
4.678LeuLeu: 4.678 ± 0.711
1.671LeuMet: 1.671 ± 0.523
7.018LeuAsn: 7.018 ± 0.991
3.175LeuPro: 3.175 ± 0.628
4.177LeuGln: 4.177 ± 0.621
3.509LeuArg: 3.509 ± 0.775
4.511LeuSer: 4.511 ± 0.763
7.018LeuThr: 7.018 ± 0.966
3.843LeuVal: 3.843 ± 1.264
1.003LeuTrp: 1.003 ± 0.428
2.339LeuTyr: 2.339 ± 0.695
0.0LeuXaa: 0.0 ± 0.0
Met
2.005MetAla: 2.005 ± 0.549
0.167MetCys: 0.167 ± 0.157
1.17MetAsp: 1.17 ± 0.412
1.003MetGlu: 1.003 ± 0.563
1.17MetPhe: 1.17 ± 0.385
1.504MetGly: 1.504 ± 0.472
0.334MetHis: 0.334 ± 0.224
1.838MetIle: 1.838 ± 0.462
2.172MetLys: 2.172 ± 0.713
2.172MetLeu: 2.172 ± 0.753
0.334MetMet: 0.334 ± 0.2
3.008MetAsn: 3.008 ± 0.781
0.668MetPro: 0.668 ± 0.292
1.337MetGln: 1.337 ± 0.539
0.501MetArg: 0.501 ± 0.292
1.17MetSer: 1.17 ± 0.456
1.671MetThr: 1.671 ± 0.656
0.334MetVal: 0.334 ± 0.314
0.167MetTrp: 0.167 ± 0.184
0.501MetTyr: 0.501 ± 0.261
0.0MetXaa: 0.0 ± 0.0
Asn
5.18AsnAla: 5.18 ± 0.743
0.334AsnCys: 0.334 ± 0.213
3.342AsnAsp: 3.342 ± 0.699
4.344AsnGlu: 4.344 ± 0.915
3.008AsnPhe: 3.008 ± 0.69
5.347AsnGly: 5.347 ± 1.015
2.005AsnHis: 2.005 ± 0.465
5.347AsnIle: 5.347 ± 1.187
4.678AsnLys: 4.678 ± 1.126
6.182AsnLeu: 6.182 ± 0.882
1.504AsnMet: 1.504 ± 0.341
5.514AsnAsn: 5.514 ± 0.887
4.01AsnPro: 4.01 ± 1.006
2.339AsnGln: 2.339 ± 0.505
2.506AsnArg: 2.506 ± 0.546
4.845AsnSer: 4.845 ± 1.2
5.681AsnThr: 5.681 ± 0.766
2.506AsnVal: 2.506 ± 0.675
1.504AsnTrp: 1.504 ± 0.669
3.676AsnTyr: 3.676 ± 0.591
0.0AsnXaa: 0.0 ± 0.0
Pro
3.008ProAla: 3.008 ± 0.736
0.334ProCys: 0.334 ± 0.212
1.504ProAsp: 1.504 ± 0.545
3.342ProGlu: 3.342 ± 0.859
1.003ProPhe: 1.003 ± 0.335
1.337ProGly: 1.337 ± 0.45
0.668ProHis: 0.668 ± 0.307
2.172ProIle: 2.172 ± 0.548
3.342ProLys: 3.342 ± 0.668
2.005ProLeu: 2.005 ± 0.61
1.003ProMet: 1.003 ± 0.445
3.008ProAsn: 3.008 ± 0.65
0.835ProPro: 0.835 ± 0.418
1.337ProGln: 1.337 ± 0.474
0.668ProArg: 0.668 ± 0.435
3.008ProSer: 3.008 ± 0.543
2.84ProThr: 2.84 ± 0.853
2.172ProVal: 2.172 ± 0.647
0.334ProTrp: 0.334 ± 0.213
1.504ProTyr: 1.504 ± 0.469
0.0ProXaa: 0.0 ± 0.0
Gln
2.673GlnAla: 2.673 ± 0.765
0.501GlnCys: 0.501 ± 0.343
1.337GlnAsp: 1.337 ± 0.445
3.175GlnGlu: 3.175 ± 0.703
1.838GlnPhe: 1.838 ± 0.478
2.84GlnGly: 2.84 ± 0.818
1.17GlnHis: 1.17 ± 0.471
3.509GlnIle: 3.509 ± 0.889
2.005GlnLys: 2.005 ± 0.892
2.673GlnLeu: 2.673 ± 0.979
1.337GlnMet: 1.337 ± 0.518
3.342GlnAsn: 3.342 ± 0.692
1.17GlnPro: 1.17 ± 0.497
1.838GlnGln: 1.838 ± 0.624
1.504GlnArg: 1.504 ± 0.454
2.172GlnSer: 2.172 ± 0.69
2.339GlnThr: 2.339 ± 0.626
2.84GlnVal: 2.84 ± 0.691
1.003GlnTrp: 1.003 ± 0.365
1.671GlnTyr: 1.671 ± 0.419
0.0GlnXaa: 0.0 ± 0.0
Arg
2.172ArgAla: 2.172 ± 0.771
0.0ArgCys: 0.0 ± 0.0
2.506ArgAsp: 2.506 ± 0.546
2.673ArgGlu: 2.673 ± 0.406
2.339ArgPhe: 2.339 ± 0.885
1.504ArgGly: 1.504 ± 0.487
0.668ArgHis: 0.668 ± 0.322
2.172ArgIle: 2.172 ± 0.614
1.671ArgLys: 1.671 ± 0.516
3.175ArgLeu: 3.175 ± 0.839
0.835ArgMet: 0.835 ± 0.353
2.673ArgAsn: 2.673 ± 0.853
1.17ArgPro: 1.17 ± 0.388
2.005ArgGln: 2.005 ± 0.576
2.005ArgArg: 2.005 ± 0.649
2.339ArgSer: 2.339 ± 0.475
1.337ArgThr: 1.337 ± 0.462
1.838ArgVal: 1.838 ± 0.537
0.167ArgTrp: 0.167 ± 0.162
2.339ArgTyr: 2.339 ± 0.678
0.0ArgXaa: 0.0 ± 0.0
Ser
3.843SerAla: 3.843 ± 1.034
0.167SerCys: 0.167 ± 0.182
3.843SerAsp: 3.843 ± 0.82
4.177SerGlu: 4.177 ± 0.881
2.84SerPhe: 2.84 ± 0.672
4.511SerGly: 4.511 ± 1.302
1.337SerHis: 1.337 ± 0.489
4.01SerIle: 4.01 ± 0.679
5.347SerLys: 5.347 ± 0.883
5.013SerLeu: 5.013 ± 0.918
1.17SerMet: 1.17 ± 0.44
3.509SerAsn: 3.509 ± 0.89
1.671SerPro: 1.671 ± 0.537
2.506SerGln: 2.506 ± 0.614
1.671SerArg: 1.671 ± 0.56
3.509SerSer: 3.509 ± 0.829
4.177SerThr: 4.177 ± 0.799
2.339SerVal: 2.339 ± 0.485
1.003SerTrp: 1.003 ± 0.322
1.838SerTyr: 1.838 ± 0.563
0.0SerXaa: 0.0 ± 0.0
Thr
3.843ThrAla: 3.843 ± 0.916
0.668ThrCys: 0.668 ± 0.292
5.013ThrAsp: 5.013 ± 0.789
6.182ThrGlu: 6.182 ± 0.966
3.342ThrPhe: 3.342 ± 0.613
4.344ThrGly: 4.344 ± 0.913
0.835ThrHis: 0.835 ± 0.324
4.511ThrIle: 4.511 ± 0.973
5.347ThrLys: 5.347 ± 1.196
7.185ThrLeu: 7.185 ± 1.41
0.835ThrMet: 0.835 ± 0.409
3.509ThrAsn: 3.509 ± 1.142
3.008ThrPro: 3.008 ± 0.907
3.342ThrGln: 3.342 ± 0.72
1.17ThrArg: 1.17 ± 0.35
4.177ThrSer: 4.177 ± 0.694
6.349ThrThr: 6.349 ± 1.086
4.678ThrVal: 4.678 ± 0.495
1.003ThrTrp: 1.003 ± 0.317
2.506ThrTyr: 2.506 ± 0.593
0.0ThrXaa: 0.0 ± 0.0
Val
2.172ValAla: 2.172 ± 0.673
0.334ValCys: 0.334 ± 0.208
4.511ValAsp: 4.511 ± 0.882
3.843ValGlu: 3.843 ± 0.967
2.172ValPhe: 2.172 ± 0.479
2.84ValGly: 2.84 ± 0.912
1.003ValHis: 1.003 ± 0.462
4.177ValIle: 4.177 ± 0.816
6.015ValLys: 6.015 ± 1.189
3.509ValLeu: 3.509 ± 0.762
1.838ValMet: 1.838 ± 0.668
4.678ValAsn: 4.678 ± 0.778
1.838ValPro: 1.838 ± 0.734
2.172ValGln: 2.172 ± 0.637
2.339ValArg: 2.339 ± 0.63
3.008ValSer: 3.008 ± 0.578
4.344ValThr: 4.344 ± 0.683
3.008ValVal: 3.008 ± 0.488
0.501ValTrp: 0.501 ± 0.278
2.506ValTyr: 2.506 ± 0.545
0.0ValXaa: 0.0 ± 0.0
Trp
0.668TrpAla: 0.668 ± 0.39
0.167TrpCys: 0.167 ± 0.162
0.167TrpAsp: 0.167 ± 0.152
1.337TrpGlu: 1.337 ± 0.508
0.501TrpPhe: 0.501 ± 0.341
0.501TrpGly: 0.501 ± 0.351
0.501TrpHis: 0.501 ± 0.261
0.668TrpIle: 0.668 ± 0.233
0.668TrpLys: 0.668 ± 0.273
1.337TrpLeu: 1.337 ± 0.304
0.501TrpMet: 0.501 ± 0.319
0.501TrpAsn: 0.501 ± 0.294
0.0TrpPro: 0.0 ± 0.0
0.167TrpGln: 0.167 ± 0.186
0.668TrpArg: 0.668 ± 0.348
0.501TrpSer: 0.501 ± 0.357
1.671TrpThr: 1.671 ± 0.604
0.835TrpVal: 0.835 ± 0.322
0.167TrpTrp: 0.167 ± 0.162
0.501TrpTyr: 0.501 ± 0.259
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.344TyrAla: 4.344 ± 0.934
0.167TyrCys: 0.167 ± 0.152
3.676TyrAsp: 3.676 ± 0.716
2.339TyrGlu: 2.339 ± 1.071
2.172TyrPhe: 2.172 ± 0.634
2.339TyrGly: 2.339 ± 0.84
1.337TyrHis: 1.337 ± 0.534
1.671TyrIle: 1.671 ± 0.723
3.342TyrLys: 3.342 ± 0.698
4.678TyrLeu: 4.678 ± 0.739
1.838TyrMet: 1.838 ± 0.699
4.678TyrAsn: 4.678 ± 0.881
1.337TyrPro: 1.337 ± 0.499
1.504TyrGln: 1.504 ± 0.451
1.504TyrArg: 1.504 ± 0.422
1.671TyrSer: 1.671 ± 0.518
2.84TyrThr: 2.84 ± 0.647
3.843TyrVal: 3.843 ± 0.754
0.501TyrTrp: 0.501 ± 0.533
3.008TyrTyr: 3.008 ± 0.626
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (5986 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski