Amino acid dipepetide frequency for Enterococcus phage vB_EfaP_IME195

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.514AlaAla: 0.514 ± 0.3
0.514AlaCys: 0.514 ± 0.252
3.253AlaAsp: 3.253 ± 0.566
3.596AlaGlu: 3.596 ± 0.838
2.74AlaPhe: 2.74 ± 0.546
3.082AlaGly: 3.082 ± 1.115
0.685AlaHis: 0.685 ± 0.384
4.11AlaIle: 4.11 ± 0.822
2.74AlaLys: 2.74 ± 0.595
5.479AlaLeu: 5.479 ± 0.852
1.884AlaMet: 1.884 ± 0.524
3.596AlaAsn: 3.596 ± 0.631
1.884AlaPro: 1.884 ± 0.418
3.082AlaGln: 3.082 ± 0.758
2.055AlaArg: 2.055 ± 0.424
2.397AlaSer: 2.397 ± 0.553
3.767AlaThr: 3.767 ± 0.829
1.884AlaVal: 1.884 ± 0.47
0.685AlaTrp: 0.685 ± 0.412
3.082AlaTyr: 3.082 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
0.342CysAla: 0.342 ± 0.209
0.0CysCys: 0.0 ± 0.0
0.856CysAsp: 0.856 ± 0.348
0.685CysGlu: 0.685 ± 0.24
0.514CysPhe: 0.514 ± 0.269
0.856CysGly: 0.856 ± 0.387
0.171CysHis: 0.171 ± 0.198
0.171CysIle: 0.171 ± 0.148
0.171CysLys: 0.171 ± 0.162
0.514CysLeu: 0.514 ± 0.294
0.171CysMet: 0.171 ± 0.189
0.685CysAsn: 0.685 ± 0.26
0.171CysPro: 0.171 ± 0.149
0.342CysGln: 0.342 ± 0.196
0.514CysArg: 0.514 ± 0.279
0.514CysSer: 0.514 ± 0.263
0.514CysThr: 0.514 ± 0.331
1.027CysVal: 1.027 ± 0.362
0.0CysTrp: 0.0 ± 0.0
0.342CysTyr: 0.342 ± 0.249
0.0CysXaa: 0.0 ± 0.0
Asp
1.37AspAla: 1.37 ± 0.397
0.856AspCys: 0.856 ± 0.356
2.397AspAsp: 2.397 ± 0.753
4.795AspGlu: 4.795 ± 1.021
3.938AspPhe: 3.938 ± 0.625
3.938AspGly: 3.938 ± 0.715
0.342AspHis: 0.342 ± 0.26
4.966AspIle: 4.966 ± 0.961
4.452AspLys: 4.452 ± 0.907
7.192AspLeu: 7.192 ± 1.138
1.199AspMet: 1.199 ± 0.469
4.795AspAsn: 4.795 ± 0.882
2.397AspPro: 2.397 ± 0.742
0.685AspGln: 0.685 ± 0.428
2.226AspArg: 2.226 ± 0.698
2.74AspSer: 2.74 ± 0.789
3.938AspThr: 3.938 ± 0.858
5.308AspVal: 5.308 ± 0.964
0.514AspTrp: 0.514 ± 0.271
4.11AspTyr: 4.11 ± 0.556
0.0AspXaa: 0.0 ± 0.0
Glu
3.596GluAla: 3.596 ± 0.793
0.856GluCys: 0.856 ± 0.47
3.938GluAsp: 3.938 ± 0.92
6.507GluGlu: 6.507 ± 1.395
3.253GluPhe: 3.253 ± 0.623
3.425GluGly: 3.425 ± 0.911
1.541GluHis: 1.541 ± 0.411
5.993GluIle: 5.993 ± 1.142
5.993GluLys: 5.993 ± 1.347
5.137GluLeu: 5.137 ± 1.193
2.568GluMet: 2.568 ± 0.705
4.623GluAsn: 4.623 ± 0.768
2.226GluPro: 2.226 ± 0.845
3.596GluGln: 3.596 ± 0.542
3.425GluArg: 3.425 ± 0.654
3.767GluSer: 3.767 ± 0.851
5.137GluThr: 5.137 ± 0.702
4.795GluVal: 4.795 ± 1.067
1.541GluTrp: 1.541 ± 0.514
4.11GluTyr: 4.11 ± 1.213
0.0GluXaa: 0.0 ± 0.0
Phe
2.226PheAla: 2.226 ± 0.457
0.342PheCys: 0.342 ± 0.21
3.938PheAsp: 3.938 ± 0.908
2.911PheGlu: 2.911 ± 0.735
1.37PhePhe: 1.37 ± 0.543
3.082PheGly: 3.082 ± 1.374
0.342PheHis: 0.342 ± 0.274
4.623PheIle: 4.623 ± 0.745
2.568PheLys: 2.568 ± 0.653
3.938PheLeu: 3.938 ± 1.061
1.199PheMet: 1.199 ± 0.425
4.795PheAsn: 4.795 ± 1.279
2.055PhePro: 2.055 ± 0.431
0.685PheGln: 0.685 ± 0.324
1.712PheArg: 1.712 ± 0.43
3.082PheSer: 3.082 ± 0.999
3.253PheThr: 3.253 ± 0.885
2.568PheVal: 2.568 ± 0.789
0.342PheTrp: 0.342 ± 0.187
2.055PheTyr: 2.055 ± 0.725
0.0PheXaa: 0.0 ± 0.0
Gly
4.11GlyAla: 4.11 ± 1.788
0.685GlyCys: 0.685 ± 0.287
3.767GlyAsp: 3.767 ± 1.148
4.11GlyGlu: 4.11 ± 0.871
2.397GlyPhe: 2.397 ± 0.789
6.678GlyGly: 6.678 ± 1.171
1.712GlyHis: 1.712 ± 0.498
3.767GlyIle: 3.767 ± 0.917
4.623GlyLys: 4.623 ± 0.728
5.137GlyLeu: 5.137 ± 0.987
1.199GlyMet: 1.199 ± 0.551
3.938GlyAsn: 3.938 ± 1.072
0.342GlyPro: 0.342 ± 0.236
0.856GlyGln: 0.856 ± 0.495
1.541GlyArg: 1.541 ± 0.334
3.596GlySer: 3.596 ± 1.132
4.11GlyThr: 4.11 ± 0.729
4.452GlyVal: 4.452 ± 0.803
0.514GlyTrp: 0.514 ± 0.397
2.74GlyTyr: 2.74 ± 0.726
0.0GlyXaa: 0.0 ± 0.0
His
0.856HisAla: 0.856 ± 0.378
0.171HisCys: 0.171 ± 0.137
0.856HisAsp: 0.856 ± 0.347
1.37HisGlu: 1.37 ± 0.462
1.884HisPhe: 1.884 ± 0.606
0.856HisGly: 0.856 ± 0.562
0.171HisHis: 0.171 ± 0.149
1.199HisIle: 1.199 ± 0.411
0.685HisLys: 0.685 ± 0.344
1.199HisLeu: 1.199 ± 0.29
0.514HisMet: 0.514 ± 0.276
1.541HisAsn: 1.541 ± 0.456
0.514HisPro: 0.514 ± 0.363
0.171HisGln: 0.171 ± 0.137
1.027HisArg: 1.027 ± 0.472
1.027HisSer: 1.027 ± 0.377
0.171HisThr: 0.171 ± 0.162
1.027HisVal: 1.027 ± 0.291
0.514HisTrp: 0.514 ± 0.336
1.884HisTyr: 1.884 ± 0.404
0.0HisXaa: 0.0 ± 0.0
Ile
4.281IleAla: 4.281 ± 0.857
0.856IleCys: 0.856 ± 0.333
5.308IleAsp: 5.308 ± 0.824
4.623IleGlu: 4.623 ± 1.023
1.884IlePhe: 1.884 ± 0.692
3.596IleGly: 3.596 ± 0.745
1.541IleHis: 1.541 ± 0.4
3.596IleIle: 3.596 ± 0.753
5.993IleLys: 5.993 ± 1.161
3.596IleLeu: 3.596 ± 0.823
1.027IleMet: 1.027 ± 0.402
5.308IleAsn: 5.308 ± 0.66
3.082IlePro: 3.082 ± 0.634
2.74IleGln: 2.74 ± 0.753
1.541IleArg: 1.541 ± 0.583
4.281IleSer: 4.281 ± 0.738
4.795IleThr: 4.795 ± 0.767
3.767IleVal: 3.767 ± 0.975
0.685IleTrp: 0.685 ± 0.397
2.911IleTyr: 2.911 ± 0.55
0.0IleXaa: 0.0 ± 0.0
Lys
5.479LysAla: 5.479 ± 1.147
0.514LysCys: 0.514 ± 0.242
5.137LysAsp: 5.137 ± 0.981
6.849LysGlu: 6.849 ± 1.186
4.281LysPhe: 4.281 ± 0.759
4.795LysGly: 4.795 ± 0.774
1.199LysHis: 1.199 ± 0.289
4.623LysIle: 4.623 ± 0.638
5.479LysLys: 5.479 ± 0.849
8.219LysLeu: 8.219 ± 1.121
2.74LysMet: 2.74 ± 0.708
2.568LysAsn: 2.568 ± 0.675
3.425LysPro: 3.425 ± 0.916
2.911LysGln: 2.911 ± 0.796
3.253LysArg: 3.253 ± 0.604
2.568LysSer: 2.568 ± 0.764
4.795LysThr: 4.795 ± 1.235
6.849LysVal: 6.849 ± 0.956
0.856LysTrp: 0.856 ± 0.381
3.938LysTyr: 3.938 ± 0.839
0.0LysXaa: 0.0 ± 0.0
Leu
5.822LeuAla: 5.822 ± 0.781
0.514LeuCys: 0.514 ± 0.347
5.651LeuAsp: 5.651 ± 0.767
7.021LeuGlu: 7.021 ± 1.153
0.856LeuPhe: 0.856 ± 0.383
5.308LeuGly: 5.308 ± 1.097
1.37LeuHis: 1.37 ± 0.441
3.767LeuIle: 3.767 ± 0.689
8.39LeuLys: 8.39 ± 1.41
7.192LeuLeu: 7.192 ± 1.411
2.055LeuMet: 2.055 ± 0.716
6.336LeuAsn: 6.336 ± 1.088
3.425LeuPro: 3.425 ± 0.793
1.541LeuGln: 1.541 ± 0.475
3.253LeuArg: 3.253 ± 0.733
5.479LeuSer: 5.479 ± 0.723
5.137LeuThr: 5.137 ± 0.677
5.137LeuVal: 5.137 ± 0.874
1.199LeuTrp: 1.199 ± 0.439
3.425LeuTyr: 3.425 ± 0.773
0.0LeuXaa: 0.0 ± 0.0
Met
1.37MetAla: 1.37 ± 0.353
0.0MetCys: 0.0 ± 0.0
0.856MetAsp: 0.856 ± 0.337
1.37MetGlu: 1.37 ± 0.414
1.884MetPhe: 1.884 ± 0.475
1.884MetGly: 1.884 ± 0.416
0.171MetHis: 0.171 ± 0.175
2.226MetIle: 2.226 ± 0.598
3.082MetLys: 3.082 ± 0.524
2.911MetLeu: 2.911 ± 0.686
0.514MetMet: 0.514 ± 0.285
2.055MetAsn: 2.055 ± 0.505
0.685MetPro: 0.685 ± 0.299
0.856MetGln: 0.856 ± 0.321
0.856MetArg: 0.856 ± 0.411
1.884MetSer: 1.884 ± 0.562
2.055MetThr: 2.055 ± 0.676
1.199MetVal: 1.199 ± 0.347
0.171MetTrp: 0.171 ± 0.176
1.37MetTyr: 1.37 ± 0.454
0.0MetXaa: 0.0 ± 0.0
Asn
4.281AsnAla: 4.281 ± 0.913
0.0AsnCys: 0.0 ± 0.0
2.911AsnAsp: 2.911 ± 0.721
5.651AsnGlu: 5.651 ± 0.925
3.596AsnPhe: 3.596 ± 0.962
5.137AsnGly: 5.137 ± 0.928
2.226AsnHis: 2.226 ± 0.56
3.938AsnIle: 3.938 ± 1.009
5.822AsnLys: 5.822 ± 0.887
4.452AsnLeu: 4.452 ± 0.792
2.397AsnMet: 2.397 ± 0.696
6.336AsnAsn: 6.336 ± 1.015
4.11AsnPro: 4.11 ± 0.898
2.397AsnGln: 2.397 ± 0.581
1.884AsnArg: 1.884 ± 0.469
4.623AsnSer: 4.623 ± 0.765
3.082AsnThr: 3.082 ± 0.775
3.253AsnVal: 3.253 ± 1.069
1.199AsnTrp: 1.199 ± 0.454
3.938AsnTyr: 3.938 ± 0.78
0.0AsnXaa: 0.0 ± 0.0
Pro
2.055ProAla: 2.055 ± 0.519
0.0ProCys: 0.0 ± 0.0
3.425ProAsp: 3.425 ± 0.594
3.425ProGlu: 3.425 ± 0.528
2.397ProPhe: 2.397 ± 0.498
0.685ProGly: 0.685 ± 0.28
0.342ProHis: 0.342 ± 0.208
2.055ProIle: 2.055 ± 0.687
3.938ProLys: 3.938 ± 1.049
1.884ProLeu: 1.884 ± 0.454
0.171ProMet: 0.171 ± 0.159
2.397ProAsn: 2.397 ± 0.721
0.856ProPro: 0.856 ± 0.747
1.199ProGln: 1.199 ± 0.462
0.856ProArg: 0.856 ± 0.436
2.74ProSer: 2.74 ± 0.678
3.425ProThr: 3.425 ± 0.718
1.37ProVal: 1.37 ± 0.476
0.342ProTrp: 0.342 ± 0.217
1.199ProTyr: 1.199 ± 0.499
0.0ProXaa: 0.0 ± 0.0
Gln
1.541GlnAla: 1.541 ± 0.447
0.171GlnCys: 0.171 ± 0.189
1.027GlnAsp: 1.027 ± 0.412
2.226GlnGlu: 2.226 ± 0.346
2.226GlnPhe: 2.226 ± 0.59
1.884GlnGly: 1.884 ± 0.523
0.685GlnHis: 0.685 ± 0.324
1.884GlnIle: 1.884 ± 0.469
2.911GlnLys: 2.911 ± 0.691
2.568GlnLeu: 2.568 ± 0.508
0.856GlnMet: 0.856 ± 0.291
2.568GlnAsn: 2.568 ± 0.513
0.856GlnPro: 0.856 ± 0.393
1.199GlnGln: 1.199 ± 0.473
0.856GlnArg: 0.856 ± 0.422
1.199GlnSer: 1.199 ± 0.442
3.425GlnThr: 3.425 ± 0.838
1.884GlnVal: 1.884 ± 0.67
1.541GlnTrp: 1.541 ± 0.487
1.37GlnTyr: 1.37 ± 0.514
0.0GlnXaa: 0.0 ± 0.0
Arg
0.856ArgAla: 0.856 ± 0.421
0.856ArgCys: 0.856 ± 0.42
1.884ArgAsp: 1.884 ± 0.441
2.226ArgGlu: 2.226 ± 0.657
2.226ArgPhe: 2.226 ± 0.695
1.712ArgGly: 1.712 ± 0.431
0.514ArgHis: 0.514 ± 0.292
2.911ArgIle: 2.911 ± 0.647
2.397ArgLys: 2.397 ± 0.695
3.253ArgLeu: 3.253 ± 0.718
0.856ArgMet: 0.856 ± 0.481
2.568ArgAsn: 2.568 ± 0.58
1.027ArgPro: 1.027 ± 0.328
2.226ArgGln: 2.226 ± 0.51
0.856ArgArg: 0.856 ± 0.381
1.199ArgSer: 1.199 ± 0.431
1.027ArgThr: 1.027 ± 0.4
2.055ArgVal: 2.055 ± 0.585
0.171ArgTrp: 0.171 ± 0.186
2.74ArgTyr: 2.74 ± 0.695
0.0ArgXaa: 0.0 ± 0.0
Ser
2.568SerAla: 2.568 ± 0.763
0.0SerCys: 0.0 ± 0.0
4.623SerAsp: 4.623 ± 1.07
3.253SerGlu: 3.253 ± 0.513
3.082SerPhe: 3.082 ± 0.567
2.911SerGly: 2.911 ± 0.823
1.027SerHis: 1.027 ± 0.378
4.281SerIle: 4.281 ± 0.821
4.966SerLys: 4.966 ± 0.646
4.966SerLeu: 4.966 ± 1.315
2.055SerMet: 2.055 ± 0.638
3.767SerAsn: 3.767 ± 0.88
1.027SerPro: 1.027 ± 0.508
1.541SerGln: 1.541 ± 0.465
1.541SerArg: 1.541 ± 0.532
2.911SerSer: 2.911 ± 0.587
3.425SerThr: 3.425 ± 0.937
2.74SerVal: 2.74 ± 0.729
1.37SerTrp: 1.37 ± 0.501
3.253SerTyr: 3.253 ± 1.107
0.0SerXaa: 0.0 ± 0.0
Thr
3.253ThrAla: 3.253 ± 0.79
0.171ThrCys: 0.171 ± 0.159
3.938ThrAsp: 3.938 ± 1.129
5.479ThrGlu: 5.479 ± 0.833
3.596ThrPhe: 3.596 ± 0.786
4.452ThrGly: 4.452 ± 0.815
1.37ThrHis: 1.37 ± 0.44
4.795ThrIle: 4.795 ± 0.796
5.651ThrLys: 5.651 ± 0.963
5.308ThrLeu: 5.308 ± 0.914
2.226ThrMet: 2.226 ± 0.727
4.452ThrAsn: 4.452 ± 0.866
2.226ThrPro: 2.226 ± 0.671
2.911ThrGln: 2.911 ± 1.023
1.712ThrArg: 1.712 ± 0.458
4.623ThrSer: 4.623 ± 0.734
5.137ThrThr: 5.137 ± 0.864
4.11ThrVal: 4.11 ± 0.886
0.514ThrTrp: 0.514 ± 0.325
2.74ThrTyr: 2.74 ± 0.445
0.0ThrXaa: 0.0 ± 0.0
Val
3.253ValAla: 3.253 ± 0.835
0.856ValCys: 0.856 ± 0.331
4.452ValAsp: 4.452 ± 1.163
5.651ValGlu: 5.651 ± 1.048
2.226ValPhe: 2.226 ± 0.489
3.082ValGly: 3.082 ± 0.777
0.685ValHis: 0.685 ± 0.573
3.596ValIle: 3.596 ± 0.453
5.822ValLys: 5.822 ± 0.882
3.425ValLeu: 3.425 ± 0.586
1.884ValMet: 1.884 ± 0.498
3.767ValAsn: 3.767 ± 0.862
2.911ValPro: 2.911 ± 0.524
0.342ValGln: 0.342 ± 0.255
2.397ValArg: 2.397 ± 0.688
2.911ValSer: 2.911 ± 0.908
5.137ValThr: 5.137 ± 1.153
2.397ValVal: 2.397 ± 0.603
0.342ValTrp: 0.342 ± 0.209
3.425ValTyr: 3.425 ± 0.782
0.0ValXaa: 0.0 ± 0.0
Trp
0.856TrpAla: 0.856 ± 0.382
0.342TrpCys: 0.342 ± 0.378
1.027TrpAsp: 1.027 ± 0.349
0.514TrpGlu: 0.514 ± 0.266
0.685TrpPhe: 0.685 ± 0.292
0.0TrpGly: 0.0 ± 0.0
0.342TrpHis: 0.342 ± 0.199
0.856TrpIle: 0.856 ± 0.298
0.685TrpLys: 0.685 ± 0.358
2.055TrpLeu: 2.055 ± 0.653
0.342TrpMet: 0.342 ± 0.235
0.514TrpAsn: 0.514 ± 0.382
0.0TrpPro: 0.0 ± 0.0
0.685TrpGln: 0.685 ± 0.32
0.856TrpArg: 0.856 ± 0.42
0.514TrpSer: 0.514 ± 0.305
1.541TrpThr: 1.541 ± 0.43
0.514TrpVal: 0.514 ± 0.409
0.171TrpTrp: 0.171 ± 0.189
0.856TrpTyr: 0.856 ± 0.319
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.397TyrAla: 2.397 ± 0.378
0.856TyrCys: 0.856 ± 0.349
2.911TyrAsp: 2.911 ± 0.91
3.767TyrGlu: 3.767 ± 0.96
2.055TyrPhe: 2.055 ± 0.539
2.911TyrGly: 2.911 ± 0.628
1.199TyrHis: 1.199 ± 0.336
2.055TyrIle: 2.055 ± 0.674
4.11TyrLys: 4.11 ± 0.973
4.623TyrLeu: 4.623 ± 0.755
1.37TyrMet: 1.37 ± 0.367
4.623TyrAsn: 4.623 ± 0.685
1.541TyrPro: 1.541 ± 0.451
2.911TyrGln: 2.911 ± 0.546
1.199TyrArg: 1.199 ± 0.376
3.253TyrSer: 3.253 ± 0.564
4.795TyrThr: 4.795 ± 0.632
2.226TyrVal: 2.226 ± 0.628
0.685TyrTrp: 0.685 ± 0.338
3.253TyrTyr: 3.253 ± 0.69
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (5841 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski