Amino acid dipepetide frequency for Enterococcus phage EF-P29

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.018AlaAla: 6.018 ± 1.21
0.596AlaCys: 0.596 ± 0.196
4.29AlaAsp: 4.29 ± 0.591
5.899AlaGlu: 5.899 ± 0.508
2.741AlaPhe: 2.741 ± 0.448
4.409AlaGly: 4.409 ± 0.669
0.775AlaHis: 0.775 ± 0.249
5.72AlaIle: 5.72 ± 0.676
4.648AlaLys: 4.648 ± 0.695
5.839AlaLeu: 5.839 ± 0.818
2.681AlaMet: 2.681 ± 0.471
3.575AlaAsn: 3.575 ± 0.434
2.026AlaPro: 2.026 ± 0.38
2.145AlaGln: 2.145 ± 0.334
2.86AlaArg: 2.86 ± 0.345
4.826AlaSer: 4.826 ± 0.616
4.171AlaThr: 4.171 ± 0.509
4.469AlaVal: 4.469 ± 0.515
0.655AlaTrp: 0.655 ± 0.167
3.396AlaTyr: 3.396 ± 0.539
0.0AlaXaa: 0.0 ± 0.0
Cys
0.358CysAla: 0.358 ± 0.147
0.358CysCys: 0.358 ± 0.177
0.596CysAsp: 0.596 ± 0.17
0.715CysGlu: 0.715 ± 0.231
0.596CysPhe: 0.596 ± 0.158
0.775CysGly: 0.775 ± 0.222
0.119CysHis: 0.119 ± 0.088
0.298CysIle: 0.298 ± 0.149
0.417CysLys: 0.417 ± 0.187
0.655CysLeu: 0.655 ± 0.208
0.417CysMet: 0.417 ± 0.16
0.298CysAsn: 0.298 ± 0.136
0.06CysPro: 0.06 ± 0.061
0.298CysGln: 0.298 ± 0.121
0.179CysArg: 0.179 ± 0.092
0.477CysSer: 0.477 ± 0.176
0.298CysThr: 0.298 ± 0.131
0.298CysVal: 0.298 ± 0.13
0.06CysTrp: 0.06 ± 0.058
0.179CysTyr: 0.179 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
3.873AspAla: 3.873 ± 0.523
0.655AspCys: 0.655 ± 0.195
3.098AspAsp: 3.098 ± 0.637
4.886AspGlu: 4.886 ± 0.496
3.694AspPhe: 3.694 ± 0.533
4.171AspGly: 4.171 ± 0.53
0.417AspHis: 0.417 ± 0.151
3.873AspIle: 3.873 ± 0.496
4.111AspLys: 4.111 ± 0.495
5.839AspLeu: 5.839 ± 0.578
2.085AspMet: 2.085 ± 0.405
3.337AspAsn: 3.337 ± 0.48
1.192AspPro: 1.192 ± 0.255
1.073AspGln: 1.073 ± 0.297
2.145AspArg: 2.145 ± 0.386
3.277AspSer: 3.277 ± 0.384
4.111AspThr: 4.111 ± 0.64
4.29AspVal: 4.29 ± 0.522
1.073AspTrp: 1.073 ± 0.364
3.694AspTyr: 3.694 ± 0.461
0.0AspXaa: 0.0 ± 0.0
Glu
8.223GluAla: 8.223 ± 0.801
0.417GluCys: 0.417 ± 0.145
6.375GluAsp: 6.375 ± 0.622
10.129GluGlu: 10.129 ± 1.102
3.277GluPhe: 3.277 ± 0.531
5.66GluGly: 5.66 ± 0.604
0.536GluHis: 0.536 ± 0.185
3.933GluIle: 3.933 ± 0.459
5.124GluLys: 5.124 ± 0.668
7.865GluLeu: 7.865 ± 0.776
2.681GluMet: 2.681 ± 0.413
4.469GluAsn: 4.469 ± 0.626
2.085GluPro: 2.085 ± 0.42
2.979GluGln: 2.979 ± 0.429
4.826GluArg: 4.826 ± 0.52
5.184GluSer: 5.184 ± 0.527
4.111GluThr: 4.111 ± 0.56
6.495GluVal: 6.495 ± 0.606
1.609GluTrp: 1.609 ± 0.279
3.575GluTyr: 3.575 ± 0.46
0.0GluXaa: 0.0 ± 0.0
Phe
2.383PheAla: 2.383 ± 0.397
0.358PheCys: 0.358 ± 0.143
2.443PheAsp: 2.443 ± 0.453
3.515PheGlu: 3.515 ± 0.508
1.311PhePhe: 1.311 ± 0.278
3.218PheGly: 3.218 ± 0.35
0.655PheHis: 0.655 ± 0.226
3.158PheIle: 3.158 ± 0.509
3.635PheLys: 3.635 ± 0.368
3.396PheLeu: 3.396 ± 0.446
0.834PheMet: 0.834 ± 0.219
2.443PheAsn: 2.443 ± 0.321
1.43PhePro: 1.43 ± 0.31
1.311PheGln: 1.311 ± 0.283
1.251PheArg: 1.251 ± 0.267
2.443PheSer: 2.443 ± 0.359
2.741PheThr: 2.741 ± 0.339
2.383PheVal: 2.383 ± 0.378
0.596PheTrp: 0.596 ± 0.205
1.966PheTyr: 1.966 ± 0.433
0.0PheXaa: 0.0 ± 0.0
Gly
3.277GlyAla: 3.277 ± 0.398
0.477GlyCys: 0.477 ± 0.195
3.813GlyAsp: 3.813 ± 0.509
4.111GlyGlu: 4.111 ± 0.453
3.635GlyPhe: 3.635 ± 0.47
3.456GlyGly: 3.456 ± 0.798
1.37GlyHis: 1.37 ± 0.252
4.528GlyIle: 4.528 ± 0.616
6.018GlyLys: 6.018 ± 0.694
5.005GlyLeu: 5.005 ± 0.595
1.728GlyMet: 1.728 ± 0.274
3.158GlyAsn: 3.158 ± 0.434
0.119GlyPro: 0.119 ± 0.079
2.026GlyGln: 2.026 ± 0.39
2.562GlyArg: 2.562 ± 0.483
3.575GlySer: 3.575 ± 0.363
4.648GlyThr: 4.648 ± 0.719
4.23GlyVal: 4.23 ± 0.558
1.192GlyTrp: 1.192 ± 0.307
2.86GlyTyr: 2.86 ± 0.374
0.0GlyXaa: 0.0 ± 0.0
His
1.013HisAla: 1.013 ± 0.24
0.179HisCys: 0.179 ± 0.106
0.536HisAsp: 0.536 ± 0.175
1.192HisGlu: 1.192 ± 0.259
0.953HisPhe: 0.953 ± 0.235
1.073HisGly: 1.073 ± 0.23
0.358HisHis: 0.358 ± 0.147
1.251HisIle: 1.251 ± 0.262
1.013HisLys: 1.013 ± 0.302
1.013HisLeu: 1.013 ± 0.255
0.238HisMet: 0.238 ± 0.12
0.953HisAsn: 0.953 ± 0.223
0.775HisPro: 0.775 ± 0.221
0.417HisGln: 0.417 ± 0.171
0.953HisArg: 0.953 ± 0.19
0.596HisSer: 0.596 ± 0.183
0.536HisThr: 0.536 ± 0.195
1.013HisVal: 1.013 ± 0.228
0.06HisTrp: 0.06 ± 0.059
0.715HisTyr: 0.715 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
4.945IleAla: 4.945 ± 0.604
0.596IleCys: 0.596 ± 0.235
3.813IleAsp: 3.813 ± 0.409
5.66IleGlu: 5.66 ± 0.549
1.847IlePhe: 1.847 ± 0.383
3.337IleGly: 3.337 ± 0.371
1.132IleHis: 1.132 ± 0.242
3.158IleIle: 3.158 ± 0.488
5.422IleLys: 5.422 ± 0.555
4.052IleLeu: 4.052 ± 0.48
1.49IleMet: 1.49 ± 0.31
4.29IleAsn: 4.29 ± 0.428
1.966IlePro: 1.966 ± 0.268
2.681IleGln: 2.681 ± 0.541
2.622IleArg: 2.622 ± 0.375
3.575IleSer: 3.575 ± 0.469
3.515IleThr: 3.515 ± 0.571
3.694IleVal: 3.694 ± 0.375
0.417IleTrp: 0.417 ± 0.198
3.098IleTyr: 3.098 ± 0.445
0.0IleXaa: 0.0 ± 0.0
Lys
7.746LysAla: 7.746 ± 0.639
0.238LysCys: 0.238 ± 0.115
5.422LysAsp: 5.422 ± 0.509
7.21LysGlu: 7.21 ± 0.783
2.8LysPhe: 2.8 ± 0.443
4.469LysGly: 4.469 ± 0.478
1.132LysHis: 1.132 ± 0.31
3.754LysIle: 3.754 ± 0.346
6.137LysLys: 6.137 ± 0.711
6.673LysLeu: 6.673 ± 0.59
2.264LysMet: 2.264 ± 0.424
2.622LysAsn: 2.622 ± 0.377
3.218LysPro: 3.218 ± 0.485
2.741LysGln: 2.741 ± 0.487
3.515LysArg: 3.515 ± 0.54
4.528LysSer: 4.528 ± 0.695
3.694LysThr: 3.694 ± 0.496
5.839LysVal: 5.839 ± 0.538
1.132LysTrp: 1.132 ± 0.25
2.562LysTyr: 2.562 ± 0.397
0.0LysXaa: 0.0 ± 0.0
Leu
5.422LeuAla: 5.422 ± 0.572
0.536LeuCys: 0.536 ± 0.182
4.707LeuAsp: 4.707 ± 0.421
9.295LeuGlu: 9.295 ± 0.938
3.098LeuPhe: 3.098 ± 0.5
5.66LeuGly: 5.66 ± 0.794
1.132LeuHis: 1.132 ± 0.212
5.482LeuIle: 5.482 ± 0.636
6.852LeuLys: 6.852 ± 0.722
6.316LeuLeu: 6.316 ± 0.818
2.562LeuMet: 2.562 ± 0.471
4.409LeuAsn: 4.409 ± 0.557
3.098LeuPro: 3.098 ± 0.401
3.277LeuGln: 3.277 ± 0.414
3.396LeuArg: 3.396 ± 0.326
5.243LeuSer: 5.243 ± 0.488
6.554LeuThr: 6.554 ± 0.55
5.66LeuVal: 5.66 ± 0.698
0.775LeuTrp: 0.775 ± 0.2
2.503LeuTyr: 2.503 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
2.86MetAla: 2.86 ± 0.574
0.119MetCys: 0.119 ± 0.079
2.085MetAsp: 2.085 ± 0.337
2.979MetGlu: 2.979 ± 0.42
1.013MetPhe: 1.013 ± 0.307
1.073MetGly: 1.073 ± 0.298
0.358MetHis: 0.358 ± 0.144
1.132MetIle: 1.132 ± 0.297
2.145MetLys: 2.145 ± 0.33
2.8MetLeu: 2.8 ± 0.398
0.596MetMet: 0.596 ± 0.182
1.788MetAsn: 1.788 ± 0.284
1.132MetPro: 1.132 ± 0.226
0.715MetGln: 0.715 ± 0.208
1.37MetArg: 1.37 ± 0.304
1.847MetSer: 1.847 ± 0.318
1.192MetThr: 1.192 ± 0.227
1.609MetVal: 1.609 ± 0.248
0.179MetTrp: 0.179 ± 0.126
0.894MetTyr: 0.894 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
2.8AsnAla: 2.8 ± 0.36
0.238AsnCys: 0.238 ± 0.119
2.979AsnAsp: 2.979 ± 0.46
4.23AsnGlu: 4.23 ± 0.497
1.728AsnPhe: 1.728 ± 0.286
4.052AsnGly: 4.052 ± 0.505
1.37AsnHis: 1.37 ± 0.384
3.218AsnIle: 3.218 ± 0.444
4.707AsnLys: 4.707 ± 0.677
4.469AsnLeu: 4.469 ± 0.547
1.251AsnMet: 1.251 ± 0.263
2.383AsnAsn: 2.383 ± 0.494
3.098AsnPro: 3.098 ± 0.466
1.907AsnGln: 1.907 ± 0.278
1.907AsnArg: 1.907 ± 0.317
3.098AsnSer: 3.098 ± 0.343
2.622AsnThr: 2.622 ± 0.364
3.277AsnVal: 3.277 ± 0.385
0.417AsnTrp: 0.417 ± 0.192
2.264AsnTyr: 2.264 ± 0.398
0.0AsnXaa: 0.0 ± 0.0
Pro
2.026ProAla: 2.026 ± 0.453
0.179ProCys: 0.179 ± 0.096
1.549ProAsp: 1.549 ± 0.261
3.337ProGlu: 3.337 ± 0.52
1.311ProPhe: 1.311 ± 0.307
0.179ProGly: 0.179 ± 0.095
0.477ProHis: 0.477 ± 0.161
2.145ProIle: 2.145 ± 0.375
2.741ProLys: 2.741 ± 0.359
2.562ProLeu: 2.562 ± 0.4
0.834ProMet: 0.834 ± 0.204
1.966ProAsn: 1.966 ± 0.359
0.715ProPro: 0.715 ± 0.205
0.715ProGln: 0.715 ± 0.219
1.073ProArg: 1.073 ± 0.301
2.8ProSer: 2.8 ± 0.418
1.668ProThr: 1.668 ± 0.285
1.788ProVal: 1.788 ± 0.319
0.358ProTrp: 0.358 ± 0.142
1.668ProTyr: 1.668 ± 0.386
0.0ProXaa: 0.0 ± 0.0
Gln
2.145GlnAla: 2.145 ± 0.378
0.179GlnCys: 0.179 ± 0.093
1.847GlnAsp: 1.847 ± 0.289
3.337GlnGlu: 3.337 ± 0.421
1.073GlnPhe: 1.073 ± 0.294
2.503GlnGly: 2.503 ± 0.381
0.477GlnHis: 0.477 ± 0.191
1.668GlnIle: 1.668 ± 0.347
1.966GlnLys: 1.966 ± 0.36
3.575GlnLeu: 3.575 ± 0.437
0.715GlnMet: 0.715 ± 0.256
1.192GlnAsn: 1.192 ± 0.25
0.894GlnPro: 0.894 ± 0.23
1.788GlnGln: 1.788 ± 0.512
1.549GlnArg: 1.549 ± 0.246
2.085GlnSer: 2.085 ± 0.365
2.503GlnThr: 2.503 ± 0.326
2.681GlnVal: 2.681 ± 0.37
0.417GlnTrp: 0.417 ± 0.15
1.311GlnTyr: 1.311 ± 0.267
0.0GlnXaa: 0.0 ± 0.0
Arg
2.562ArgAla: 2.562 ± 0.43
0.536ArgCys: 0.536 ± 0.189
2.145ArgAsp: 2.145 ± 0.274
3.218ArgGlu: 3.218 ± 0.361
1.966ArgPhe: 1.966 ± 0.346
2.503ArgGly: 2.503 ± 0.466
0.775ArgHis: 0.775 ± 0.228
3.754ArgIle: 3.754 ± 0.566
3.098ArgLys: 3.098 ± 0.361
4.469ArgLeu: 4.469 ± 0.541
0.834ArgMet: 0.834 ± 0.23
1.907ArgAsn: 1.907 ± 0.34
1.37ArgPro: 1.37 ± 0.293
1.549ArgGln: 1.549 ± 0.325
2.145ArgArg: 2.145 ± 0.359
2.205ArgSer: 2.205 ± 0.449
1.966ArgThr: 1.966 ± 0.295
2.383ArgVal: 2.383 ± 0.38
0.536ArgTrp: 0.536 ± 0.152
2.085ArgTyr: 2.085 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
3.218SerAla: 3.218 ± 0.376
0.417SerCys: 0.417 ± 0.154
3.515SerAsp: 3.515 ± 0.425
4.23SerGlu: 4.23 ± 0.446
2.86SerPhe: 2.86 ± 0.377
3.873SerGly: 3.873 ± 0.391
1.132SerHis: 1.132 ± 0.277
4.111SerIle: 4.111 ± 0.489
5.72SerLys: 5.72 ± 0.551
5.363SerLeu: 5.363 ± 0.591
1.609SerMet: 1.609 ± 0.28
3.158SerAsn: 3.158 ± 0.456
1.668SerPro: 1.668 ± 0.293
1.907SerGln: 1.907 ± 0.376
2.562SerArg: 2.562 ± 0.332
3.039SerSer: 3.039 ± 0.503
3.515SerThr: 3.515 ± 0.479
4.29SerVal: 4.29 ± 0.598
0.953SerTrp: 0.953 ± 0.242
2.8SerTyr: 2.8 ± 0.424
0.0SerXaa: 0.0 ± 0.0
Thr
4.886ThrAla: 4.886 ± 0.855
0.358ThrCys: 0.358 ± 0.141
3.575ThrAsp: 3.575 ± 0.478
3.754ThrGlu: 3.754 ± 0.443
2.741ThrPhe: 2.741 ± 0.417
3.396ThrGly: 3.396 ± 0.585
1.013ThrHis: 1.013 ± 0.267
3.933ThrIle: 3.933 ± 0.506
3.635ThrLys: 3.635 ± 0.362
6.256ThrLeu: 6.256 ± 0.674
1.251ThrMet: 1.251 ± 0.251
2.92ThrAsn: 2.92 ± 0.535
2.383ThrPro: 2.383 ± 0.447
1.549ThrGln: 1.549 ± 0.312
1.668ThrArg: 1.668 ± 0.338
3.933ThrSer: 3.933 ± 0.507
2.681ThrThr: 2.681 ± 0.382
4.767ThrVal: 4.767 ± 0.699
0.596ThrTrp: 0.596 ± 0.168
2.622ThrTyr: 2.622 ± 0.364
0.0ThrXaa: 0.0 ± 0.0
Val
4.945ValAla: 4.945 ± 0.569
0.477ValCys: 0.477 ± 0.171
4.588ValAsp: 4.588 ± 0.47
5.839ValGlu: 5.839 ± 0.665
3.098ValPhe: 3.098 ± 0.47
3.873ValGly: 3.873 ± 0.426
0.775ValHis: 0.775 ± 0.204
3.813ValIle: 3.813 ± 0.462
5.72ValLys: 5.72 ± 0.53
5.065ValLeu: 5.065 ± 0.475
1.549ValMet: 1.549 ± 0.26
3.873ValAsn: 3.873 ± 0.489
1.788ValPro: 1.788 ± 0.327
2.443ValGln: 2.443 ± 0.36
3.218ValArg: 3.218 ± 0.426
3.337ValSer: 3.337 ± 0.491
3.992ValThr: 3.992 ± 0.432
3.873ValVal: 3.873 ± 0.504
1.311ValTrp: 1.311 ± 0.23
2.741ValTyr: 2.741 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.715TrpAla: 0.715 ± 0.214
0.119TrpCys: 0.119 ± 0.078
0.655TrpAsp: 0.655 ± 0.183
1.43TrpGlu: 1.43 ± 0.285
0.536TrpPhe: 0.536 ± 0.189
1.013TrpGly: 1.013 ± 0.277
0.179TrpHis: 0.179 ± 0.097
1.013TrpIle: 1.013 ± 0.217
1.132TrpLys: 1.132 ± 0.225
0.775TrpLeu: 0.775 ± 0.234
0.06TrpMet: 0.06 ± 0.049
1.192TrpAsn: 1.192 ± 0.244
0.0TrpPro: 0.0 ± 0.0
0.655TrpGln: 0.655 ± 0.222
0.298TrpArg: 0.298 ± 0.133
1.192TrpSer: 1.192 ± 0.331
0.775TrpThr: 0.775 ± 0.212
0.477TrpVal: 0.477 ± 0.168
0.119TrpTrp: 0.119 ± 0.075
0.715TrpTyr: 0.715 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.324TyrAla: 2.324 ± 0.406
0.417TyrCys: 0.417 ± 0.176
2.86TyrAsp: 2.86 ± 0.392
4.588TyrGlu: 4.588 ± 0.612
1.192TyrPhe: 1.192 ± 0.225
3.098TyrGly: 3.098 ± 0.512
0.655TyrHis: 0.655 ± 0.194
1.37TyrIle: 1.37 ± 0.263
3.456TyrLys: 3.456 ± 0.431
3.992TyrLeu: 3.992 ± 0.511
2.145TyrMet: 2.145 ± 0.415
2.085TyrAsn: 2.085 ± 0.445
1.073TyrPro: 1.073 ± 0.24
1.728TyrGln: 1.728 ± 0.334
1.907TyrArg: 1.907 ± 0.345
2.741TyrSer: 2.741 ± 0.366
2.622TyrThr: 2.622 ± 0.421
2.8TyrVal: 2.8 ± 0.289
0.596TyrTrp: 0.596 ± 0.189
2.503TyrTyr: 2.503 ± 0.373
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (16784 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski