Amino acid dipepetide frequency for Enterococcus phage vB_EfaS_Ef6.1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.73AlaAla: 0.73 ± 0.315
0.162AlaCys: 0.162 ± 0.118
3.57AlaAsp: 3.57 ± 0.485
3.732AlaGlu: 3.732 ± 0.495
2.84AlaPhe: 2.84 ± 0.481
3.326AlaGly: 3.326 ± 0.671
0.974AlaHis: 0.974 ± 0.317
4.787AlaIle: 4.787 ± 1.0
5.841AlaLys: 5.841 ± 0.856
5.517AlaLeu: 5.517 ± 0.779
2.758AlaMet: 2.758 ± 0.584
3.894AlaAsn: 3.894 ± 0.484
1.866AlaPro: 1.866 ± 0.37
1.623AlaGln: 1.623 ± 0.264
1.704AlaArg: 1.704 ± 0.249
3.164AlaSer: 3.164 ± 0.5
4.138AlaThr: 4.138 ± 0.57
4.706AlaVal: 4.706 ± 0.702
0.487AlaTrp: 0.487 ± 0.212
2.758AlaTyr: 2.758 ± 0.359
0.0AlaXaa: 0.0 ± 0.0
Cys
0.487CysAla: 0.487 ± 0.208
0.0CysCys: 0.0 ± 0.0
0.162CysAsp: 0.162 ± 0.12
0.649CysGlu: 0.649 ± 0.266
0.325CysPhe: 0.325 ± 0.185
0.649CysGly: 0.649 ± 0.248
0.243CysHis: 0.243 ± 0.146
0.406CysIle: 0.406 ± 0.2
0.811CysLys: 0.811 ± 0.279
0.568CysLeu: 0.568 ± 0.216
0.162CysMet: 0.162 ± 0.115
0.649CysAsn: 0.649 ± 0.273
0.0CysPro: 0.0 ± 0.0
0.081CysGln: 0.081 ± 0.081
0.162CysArg: 0.162 ± 0.118
0.406CysSer: 0.406 ± 0.194
0.487CysThr: 0.487 ± 0.273
0.243CysVal: 0.243 ± 0.135
0.162CysTrp: 0.162 ± 0.118
0.162CysTyr: 0.162 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
2.84AspAla: 2.84 ± 0.619
0.406AspCys: 0.406 ± 0.217
2.515AspAsp: 2.515 ± 0.621
4.219AspGlu: 4.219 ± 0.501
2.84AspPhe: 2.84 ± 0.589
5.517AspGly: 5.517 ± 0.601
0.649AspHis: 0.649 ± 0.247
4.624AspIle: 4.624 ± 0.637
5.841AspLys: 5.841 ± 0.747
5.436AspLeu: 5.436 ± 0.595
1.785AspMet: 1.785 ± 0.416
4.219AspAsn: 4.219 ± 0.656
2.353AspPro: 2.353 ± 0.436
1.541AspGln: 1.541 ± 0.322
1.866AspArg: 1.866 ± 0.359
3.245AspSer: 3.245 ± 0.468
4.138AspThr: 4.138 ± 0.665
3.894AspVal: 3.894 ± 0.596
0.892AspTrp: 0.892 ± 0.312
3.489AspTyr: 3.489 ± 0.662
0.0AspXaa: 0.0 ± 0.0
Glu
5.192GluAla: 5.192 ± 0.847
0.487GluCys: 0.487 ± 0.216
4.543GluAsp: 4.543 ± 0.69
7.383GluGlu: 7.383 ± 1.563
3.894GluPhe: 3.894 ± 0.632
4.787GluGly: 4.787 ± 0.667
1.217GluHis: 1.217 ± 0.304
4.381GluIle: 4.381 ± 0.662
6.815GluLys: 6.815 ± 0.678
9.086GluLeu: 9.086 ± 0.855
2.677GluMet: 2.677 ± 0.596
3.57GluAsn: 3.57 ± 0.504
2.677GluPro: 2.677 ± 0.631
3.651GluGln: 3.651 ± 0.635
2.758GluArg: 2.758 ± 0.465
3.57GluSer: 3.57 ± 0.681
4.949GluThr: 4.949 ± 0.635
6.085GluVal: 6.085 ± 0.794
1.46GluTrp: 1.46 ± 0.282
3.651GluTyr: 3.651 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
1.947PheAla: 1.947 ± 0.286
0.325PheCys: 0.325 ± 0.161
3.164PheAsp: 3.164 ± 0.728
2.758PheGlu: 2.758 ± 0.682
1.136PhePhe: 1.136 ± 0.296
2.677PheGly: 2.677 ± 0.453
0.325PheHis: 0.325 ± 0.166
4.787PheIle: 4.787 ± 0.739
4.3PheLys: 4.3 ± 0.59
2.272PheLeu: 2.272 ± 0.42
1.055PheMet: 1.055 ± 0.328
3.489PheAsn: 3.489 ± 0.567
0.811PhePro: 0.811 ± 0.243
2.109PheGln: 2.109 ± 0.498
1.704PheArg: 1.704 ± 0.408
1.704PheSer: 1.704 ± 0.391
3.651PheThr: 3.651 ± 0.588
2.353PheVal: 2.353 ± 0.421
0.487PheTrp: 0.487 ± 0.175
1.055PheTyr: 1.055 ± 0.23
0.0PheXaa: 0.0 ± 0.0
Gly
4.949GlyAla: 4.949 ± 1.272
0.406GlyCys: 0.406 ± 0.175
3.164GlyAsp: 3.164 ± 0.41
4.381GlyGlu: 4.381 ± 0.613
3.002GlyPhe: 3.002 ± 0.413
4.381GlyGly: 4.381 ± 1.285
0.811GlyHis: 0.811 ± 0.247
6.085GlyIle: 6.085 ± 0.998
6.409GlyLys: 6.409 ± 0.702
5.598GlyLeu: 5.598 ± 0.687
1.785GlyMet: 1.785 ± 0.351
3.975GlyAsn: 3.975 ± 0.526
0.568GlyPro: 0.568 ± 0.243
2.19GlyGln: 2.19 ± 0.335
2.758GlyArg: 2.758 ± 0.405
3.083GlySer: 3.083 ± 0.481
3.407GlyThr: 3.407 ± 0.682
3.813GlyVal: 3.813 ± 0.701
1.298GlyTrp: 1.298 ± 0.318
3.083GlyTyr: 3.083 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
0.974HisAla: 0.974 ± 0.223
0.243HisCys: 0.243 ± 0.156
0.811HisAsp: 0.811 ± 0.248
1.379HisGlu: 1.379 ± 0.369
0.649HisPhe: 0.649 ± 0.27
1.217HisGly: 1.217 ± 0.383
0.406HisHis: 0.406 ± 0.171
0.974HisIle: 0.974 ± 0.226
1.217HisLys: 1.217 ± 0.316
0.892HisLeu: 0.892 ± 0.273
0.162HisMet: 0.162 ± 0.112
0.974HisAsn: 0.974 ± 0.313
0.243HisPro: 0.243 ± 0.153
0.487HisGln: 0.487 ± 0.166
0.974HisArg: 0.974 ± 0.305
0.649HisSer: 0.649 ± 0.252
0.811HisThr: 0.811 ± 0.37
1.055HisVal: 1.055 ± 0.31
0.243HisTrp: 0.243 ± 0.128
0.974HisTyr: 0.974 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
3.975IleAla: 3.975 ± 0.554
0.487IleCys: 0.487 ± 0.202
5.517IleAsp: 5.517 ± 0.561
7.139IleGlu: 7.139 ± 1.086
2.19IlePhe: 2.19 ± 0.48
4.543IleGly: 4.543 ± 0.707
0.974IleHis: 0.974 ± 0.256
3.813IleIle: 3.813 ± 0.426
6.409IleLys: 6.409 ± 0.852
5.436IleLeu: 5.436 ± 0.633
1.46IleMet: 1.46 ± 0.444
4.624IleAsn: 4.624 ± 0.787
2.921IlePro: 2.921 ± 0.472
2.596IleGln: 2.596 ± 0.451
2.272IleArg: 2.272 ± 0.386
3.813IleSer: 3.813 ± 0.551
3.894IleThr: 3.894 ± 0.502
3.894IleVal: 3.894 ± 0.434
0.649IleTrp: 0.649 ± 0.187
1.866IleTyr: 1.866 ± 0.461
0.0IleXaa: 0.0 ± 0.0
Lys
6.977LysAla: 6.977 ± 0.925
0.73LysCys: 0.73 ± 0.281
5.679LysAsp: 5.679 ± 0.622
8.519LysGlu: 8.519 ± 1.141
3.083LysPhe: 3.083 ± 0.389
5.111LysGly: 5.111 ± 0.92
1.379LysHis: 1.379 ± 0.441
4.3LysIle: 4.3 ± 0.524
6.571LysLys: 6.571 ± 0.843
6.571LysLeu: 6.571 ± 0.863
3.57LysMet: 3.57 ± 0.489
5.273LysAsn: 5.273 ± 0.837
2.758LysPro: 2.758 ± 0.558
3.489LysGln: 3.489 ± 0.518
3.894LysArg: 3.894 ± 0.637
3.894LysSer: 3.894 ± 0.714
5.355LysThr: 5.355 ± 0.69
6.653LysVal: 6.653 ± 0.845
1.46LysTrp: 1.46 ± 0.335
3.732LysTyr: 3.732 ± 0.559
0.0LysXaa: 0.0 ± 0.0
Leu
4.056LeuAla: 4.056 ± 0.6
0.73LeuCys: 0.73 ± 0.243
5.922LeuAsp: 5.922 ± 0.828
8.843LeuGlu: 8.843 ± 0.929
3.813LeuPhe: 3.813 ± 0.483
5.03LeuGly: 5.03 ± 0.707
0.811LeuHis: 0.811 ± 0.218
4.868LeuIle: 4.868 ± 0.572
6.49LeuLys: 6.49 ± 0.962
7.626LeuLeu: 7.626 ± 1.013
1.785LeuMet: 1.785 ± 0.358
6.247LeuAsn: 6.247 ± 0.777
2.758LeuPro: 2.758 ± 0.455
4.462LeuGln: 4.462 ± 0.565
2.677LeuArg: 2.677 ± 0.446
3.894LeuSer: 3.894 ± 0.512
3.894LeuThr: 3.894 ± 0.556
5.76LeuVal: 5.76 ± 0.686
1.298LeuTrp: 1.298 ± 0.345
2.515LeuTyr: 2.515 ± 0.483
0.0LeuXaa: 0.0 ± 0.0
Met
1.055MetAla: 1.055 ± 0.286
0.325MetCys: 0.325 ± 0.16
1.623MetAsp: 1.623 ± 0.422
1.947MetGlu: 1.947 ± 0.508
1.217MetPhe: 1.217 ± 0.394
1.866MetGly: 1.866 ± 0.42
0.162MetHis: 0.162 ± 0.112
2.028MetIle: 2.028 ± 0.347
2.596MetLys: 2.596 ± 0.471
1.785MetLeu: 1.785 ± 0.382
0.243MetMet: 0.243 ± 0.21
1.947MetAsn: 1.947 ± 0.372
0.649MetPro: 0.649 ± 0.207
0.974MetGln: 0.974 ± 0.3
1.704MetArg: 1.704 ± 0.389
1.947MetSer: 1.947 ± 0.372
2.109MetThr: 2.109 ± 0.486
2.19MetVal: 2.19 ± 0.455
0.568MetTrp: 0.568 ± 0.251
1.623MetTyr: 1.623 ± 0.402
0.0MetXaa: 0.0 ± 0.0
Asn
5.111AsnAla: 5.111 ± 0.902
0.162AsnCys: 0.162 ± 0.127
3.57AsnAsp: 3.57 ± 0.587
5.436AsnGlu: 5.436 ± 0.685
2.109AsnPhe: 2.109 ± 0.452
6.571AsnGly: 6.571 ± 0.702
0.892AsnHis: 0.892 ± 0.227
3.813AsnIle: 3.813 ± 0.633
5.679AsnLys: 5.679 ± 0.713
4.138AsnLeu: 4.138 ± 0.492
1.704AsnMet: 1.704 ± 0.346
2.84AsnAsn: 2.84 ± 0.517
1.704AsnPro: 1.704 ± 0.401
1.947AsnGln: 1.947 ± 0.345
1.298AsnArg: 1.298 ± 0.304
3.326AsnSer: 3.326 ± 0.587
4.624AsnThr: 4.624 ± 0.775
3.489AsnVal: 3.489 ± 0.681
0.73AsnTrp: 0.73 ± 0.261
2.434AsnTyr: 2.434 ± 0.494
0.0AsnXaa: 0.0 ± 0.0
Pro
1.217ProAla: 1.217 ± 0.348
0.081ProCys: 0.081 ± 0.087
2.758ProAsp: 2.758 ± 0.463
2.921ProGlu: 2.921 ± 0.542
1.217ProPhe: 1.217 ± 0.319
0.0ProGly: 0.0 ± 0.0
0.162ProHis: 0.162 ± 0.11
1.541ProIle: 1.541 ± 0.284
2.353ProLys: 2.353 ± 0.636
2.84ProLeu: 2.84 ± 0.533
0.892ProMet: 0.892 ± 0.247
1.947ProAsn: 1.947 ± 0.429
0.406ProPro: 0.406 ± 0.225
1.704ProGln: 1.704 ± 0.379
0.811ProArg: 0.811 ± 0.229
1.541ProSer: 1.541 ± 0.375
2.19ProThr: 2.19 ± 0.508
2.272ProVal: 2.272 ± 0.316
0.325ProTrp: 0.325 ± 0.162
1.623ProTyr: 1.623 ± 0.474
0.0ProXaa: 0.0 ± 0.0
Gln
2.515GlnAla: 2.515 ± 0.422
0.243GlnCys: 0.243 ± 0.168
1.785GlnAsp: 1.785 ± 0.331
2.434GlnGlu: 2.434 ± 0.397
1.623GlnPhe: 1.623 ± 0.384
1.785GlnGly: 1.785 ± 0.409
0.568GlnHis: 0.568 ± 0.226
2.921GlnIle: 2.921 ± 0.455
2.028GlnLys: 2.028 ± 0.428
3.407GlnLeu: 3.407 ± 0.55
1.379GlnMet: 1.379 ± 0.437
1.46GlnAsn: 1.46 ± 0.308
1.298GlnPro: 1.298 ± 0.368
2.19GlnGln: 2.19 ± 0.377
2.109GlnArg: 2.109 ± 0.576
2.19GlnSer: 2.19 ± 0.461
1.947GlnThr: 1.947 ± 0.309
3.245GlnVal: 3.245 ± 0.48
0.406GlnTrp: 0.406 ± 0.256
2.596GlnTyr: 2.596 ± 0.455
0.0GlnXaa: 0.0 ± 0.0
Arg
1.785ArgAla: 1.785 ± 0.363
0.325ArgCys: 0.325 ± 0.146
2.353ArgAsp: 2.353 ± 0.512
2.19ArgGlu: 2.19 ± 0.475
1.541ArgPhe: 1.541 ± 0.346
1.785ArgGly: 1.785 ± 0.438
0.892ArgHis: 0.892 ± 0.375
2.272ArgIle: 2.272 ± 0.382
3.083ArgLys: 3.083 ± 0.653
3.083ArgLeu: 3.083 ± 0.567
0.892ArgMet: 0.892 ± 0.281
2.758ArgAsn: 2.758 ± 0.487
1.055ArgPro: 1.055 ± 0.353
1.298ArgGln: 1.298 ± 0.368
1.217ArgArg: 1.217 ± 0.279
1.623ArgSer: 1.623 ± 0.434
1.704ArgThr: 1.704 ± 0.396
2.109ArgVal: 2.109 ± 0.402
0.243ArgTrp: 0.243 ± 0.184
1.785ArgTyr: 1.785 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
3.245SerAla: 3.245 ± 0.643
0.162SerCys: 0.162 ± 0.121
3.002SerAsp: 3.002 ± 0.55
3.651SerGlu: 3.651 ± 0.46
2.515SerPhe: 2.515 ± 0.358
4.543SerGly: 4.543 ± 0.894
1.541SerHis: 1.541 ± 0.345
3.813SerIle: 3.813 ± 0.516
4.624SerLys: 4.624 ± 0.828
3.326SerLeu: 3.326 ± 0.614
1.379SerMet: 1.379 ± 0.362
2.84SerAsn: 2.84 ± 0.48
0.974SerPro: 0.974 ± 0.234
2.19SerGln: 2.19 ± 0.459
0.892SerArg: 0.892 ± 0.29
2.596SerSer: 2.596 ± 0.466
3.57SerThr: 3.57 ± 0.727
2.677SerVal: 2.677 ± 0.433
0.649SerTrp: 0.649 ± 0.214
2.353SerTyr: 2.353 ± 0.628
0.0SerXaa: 0.0 ± 0.0
Thr
2.677ThrAla: 2.677 ± 0.444
0.081ThrCys: 0.081 ± 0.075
3.57ThrAsp: 3.57 ± 0.516
4.624ThrGlu: 4.624 ± 0.616
2.109ThrPhe: 2.109 ± 0.427
4.624ThrGly: 4.624 ± 0.743
1.623ThrHis: 1.623 ± 0.334
4.787ThrIle: 4.787 ± 0.675
7.383ThrLys: 7.383 ± 0.816
5.517ThrLeu: 5.517 ± 0.84
1.704ThrMet: 1.704 ± 0.403
3.164ThrAsn: 3.164 ± 0.524
2.758ThrPro: 2.758 ± 0.51
1.866ThrGln: 1.866 ± 0.441
1.704ThrArg: 1.704 ± 0.374
2.028ThrSer: 2.028 ± 0.444
4.219ThrThr: 4.219 ± 0.834
4.543ThrVal: 4.543 ± 0.6
0.487ThrTrp: 0.487 ± 0.202
2.353ThrTyr: 2.353 ± 0.553
0.0ThrXaa: 0.0 ± 0.0
Val
6.004ValAla: 6.004 ± 0.665
0.487ValCys: 0.487 ± 0.212
4.462ValAsp: 4.462 ± 0.476
4.787ValGlu: 4.787 ± 0.601
3.245ValPhe: 3.245 ± 0.464
4.138ValGly: 4.138 ± 0.684
0.73ValHis: 0.73 ± 0.296
4.138ValIle: 4.138 ± 0.528
6.247ValLys: 6.247 ± 0.987
5.273ValLeu: 5.273 ± 0.601
2.109ValMet: 2.109 ± 0.319
3.732ValAsn: 3.732 ± 0.501
1.704ValPro: 1.704 ± 0.31
2.272ValGln: 2.272 ± 0.501
1.947ValArg: 1.947 ± 0.405
5.111ValSer: 5.111 ± 0.715
3.245ValThr: 3.245 ± 0.572
4.3ValVal: 4.3 ± 0.602
0.892ValTrp: 0.892 ± 0.26
2.353ValTyr: 2.353 ± 0.508
0.0ValXaa: 0.0 ± 0.0
Trp
0.325TrpAla: 0.325 ± 0.164
0.243TrpCys: 0.243 ± 0.154
0.73TrpAsp: 0.73 ± 0.328
1.785TrpGlu: 1.785 ± 0.327
0.811TrpPhe: 0.811 ± 0.256
0.811TrpGly: 0.811 ± 0.242
0.406TrpHis: 0.406 ± 0.196
0.649TrpIle: 0.649 ± 0.238
0.974TrpLys: 0.974 ± 0.305
1.298TrpLeu: 1.298 ± 0.308
0.162TrpMet: 0.162 ± 0.129
0.73TrpAsn: 0.73 ± 0.24
0.0TrpPro: 0.0 ± 0.0
0.325TrpGln: 0.325 ± 0.14
0.649TrpArg: 0.649 ± 0.326
0.568TrpSer: 0.568 ± 0.205
0.649TrpThr: 0.649 ± 0.226
1.46TrpVal: 1.46 ± 0.256
0.325TrpTrp: 0.325 ± 0.132
0.243TrpTyr: 0.243 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.515TyrAla: 2.515 ± 0.463
0.73TyrCys: 0.73 ± 0.246
3.407TyrAsp: 3.407 ± 0.587
4.056TyrGlu: 4.056 ± 0.701
1.947TyrPhe: 1.947 ± 0.399
1.785TyrGly: 1.785 ± 0.4
0.568TyrHis: 0.568 ± 0.174
3.489TyrIle: 3.489 ± 0.649
3.326TyrLys: 3.326 ± 0.472
3.813TyrLeu: 3.813 ± 0.612
0.811TyrMet: 0.811 ± 0.224
3.245TyrAsn: 3.245 ± 0.522
1.298TyrPro: 1.298 ± 0.35
1.136TyrGln: 1.136 ± 0.31
0.974TyrArg: 0.974 ± 0.303
2.19TyrSer: 2.19 ± 0.559
2.84TyrThr: 2.84 ± 0.697
2.434TyrVal: 2.434 ± 0.46
0.081TyrTrp: 0.081 ± 0.079
1.541TyrTyr: 1.541 ± 0.377
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12327 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski