Amino acid dipepetide frequency for Enterococcus phage EFC-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.643AlaAla: 3.643 ± 0.701
0.396AlaCys: 0.396 ± 0.211
4.197AlaAsp: 4.197 ± 0.515
4.673AlaGlu: 4.673 ± 0.825
2.297AlaPhe: 2.297 ± 0.462
3.801AlaGly: 3.801 ± 0.625
1.267AlaHis: 1.267 ± 0.319
5.306AlaIle: 5.306 ± 1.085
6.098AlaLys: 6.098 ± 0.912
5.069AlaLeu: 5.069 ± 0.663
1.98AlaMet: 1.98 ± 0.48
4.514AlaAsn: 4.514 ± 0.643
1.505AlaPro: 1.505 ± 0.321
2.376AlaGln: 2.376 ± 0.446
2.138AlaArg: 2.138 ± 0.357
3.564AlaSer: 3.564 ± 0.495
4.118AlaThr: 4.118 ± 0.493
4.197AlaVal: 4.197 ± 0.694
1.267AlaTrp: 1.267 ± 0.283
1.742AlaTyr: 1.742 ± 0.317
0.0AlaXaa: 0.0 ± 0.0
Cys
0.396CysAla: 0.396 ± 0.18
0.0CysCys: 0.0 ± 0.0
0.317CysAsp: 0.317 ± 0.177
0.396CysGlu: 0.396 ± 0.19
0.238CysPhe: 0.238 ± 0.122
0.634CysGly: 0.634 ± 0.252
0.158CysHis: 0.158 ± 0.102
0.554CysIle: 0.554 ± 0.245
0.554CysLys: 0.554 ± 0.229
0.475CysLeu: 0.475 ± 0.202
0.317CysMet: 0.317 ± 0.167
0.634CysAsn: 0.634 ± 0.271
0.158CysPro: 0.158 ± 0.11
0.238CysGln: 0.238 ± 0.146
0.317CysArg: 0.317 ± 0.145
0.475CysSer: 0.475 ± 0.19
0.158CysThr: 0.158 ± 0.102
0.158CysVal: 0.158 ± 0.112
0.238CysTrp: 0.238 ± 0.142
0.238CysTyr: 0.238 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
3.722AspAla: 3.722 ± 0.492
0.871AspCys: 0.871 ± 0.268
4.197AspAsp: 4.197 ± 0.669
5.069AspGlu: 5.069 ± 0.52
2.217AspPhe: 2.217 ± 0.435
3.881AspGly: 3.881 ± 0.525
0.475AspHis: 0.475 ± 0.207
4.593AspIle: 4.593 ± 0.646
4.989AspLys: 4.989 ± 0.578
5.385AspLeu: 5.385 ± 0.68
1.346AspMet: 1.346 ± 0.258
3.564AspAsn: 3.564 ± 0.409
2.297AspPro: 2.297 ± 0.482
1.98AspGln: 1.98 ± 0.473
2.297AspArg: 2.297 ± 0.423
4.118AspSer: 4.118 ± 0.641
3.881AspThr: 3.881 ± 0.571
4.197AspVal: 4.197 ± 0.47
0.792AspTrp: 0.792 ± 0.269
3.089AspTyr: 3.089 ± 0.594
0.0AspXaa: 0.0 ± 0.0
Glu
5.544GluAla: 5.544 ± 0.8
0.554GluCys: 0.554 ± 0.184
3.801GluAsp: 3.801 ± 0.663
5.148GluGlu: 5.148 ± 0.956
3.247GluPhe: 3.247 ± 0.623
4.831GluGly: 4.831 ± 0.57
0.871GluHis: 0.871 ± 0.231
5.623GluIle: 5.623 ± 0.632
7.444GluLys: 7.444 ± 0.868
8.632GluLeu: 8.632 ± 0.945
2.851GluMet: 2.851 ± 0.473
5.86GluAsn: 5.86 ± 0.637
2.217GluPro: 2.217 ± 0.528
3.643GluGln: 3.643 ± 0.521
4.118GluArg: 4.118 ± 0.769
4.197GluSer: 4.197 ± 0.565
3.96GluThr: 3.96 ± 0.501
4.197GluVal: 4.197 ± 0.527
0.871GluTrp: 0.871 ± 0.255
3.168GluTyr: 3.168 ± 0.542
0.0GluXaa: 0.0 ± 0.0
Phe
2.534PheAla: 2.534 ± 0.407
0.238PheCys: 0.238 ± 0.133
2.93PheAsp: 2.93 ± 0.464
2.217PheGlu: 2.217 ± 0.439
0.713PhePhe: 0.713 ± 0.221
2.059PheGly: 2.059 ± 0.43
0.238PheHis: 0.238 ± 0.12
2.297PheIle: 2.297 ± 0.44
2.851PheLys: 2.851 ± 0.384
2.613PheLeu: 2.613 ± 0.511
0.95PheMet: 0.95 ± 0.324
2.297PheAsn: 2.297 ± 0.585
0.713PhePro: 0.713 ± 0.235
1.267PheGln: 1.267 ± 0.343
1.346PheArg: 1.346 ± 0.289
3.485PheSer: 3.485 ± 0.511
1.821PheThr: 1.821 ± 0.402
2.297PheVal: 2.297 ± 0.495
0.634PheTrp: 0.634 ± 0.266
1.346PheTyr: 1.346 ± 0.366
0.0PheXaa: 0.0 ± 0.0
Gly
3.722GlyAla: 3.722 ± 0.485
0.475GlyCys: 0.475 ± 0.19
3.881GlyAsp: 3.881 ± 0.599
3.881GlyGlu: 3.881 ± 0.554
2.455GlyPhe: 2.455 ± 0.464
3.485GlyGly: 3.485 ± 1.046
0.871GlyHis: 0.871 ± 0.33
3.801GlyIle: 3.801 ± 0.519
5.306GlyLys: 5.306 ± 0.744
5.464GlyLeu: 5.464 ± 0.679
1.584GlyMet: 1.584 ± 0.38
3.168GlyAsn: 3.168 ± 0.529
0.792GlyPro: 0.792 ± 0.234
2.217GlyGln: 2.217 ± 0.437
1.901GlyArg: 1.901 ± 0.294
3.564GlySer: 3.564 ± 0.474
3.485GlyThr: 3.485 ± 0.514
4.514GlyVal: 4.514 ± 0.851
0.713GlyTrp: 0.713 ± 0.275
3.089GlyTyr: 3.089 ± 0.487
0.0GlyXaa: 0.0 ± 0.0
His
0.317HisAla: 0.317 ± 0.133
0.396HisCys: 0.396 ± 0.178
0.396HisAsp: 0.396 ± 0.151
1.584HisGlu: 1.584 ± 0.349
0.396HisPhe: 0.396 ± 0.183
0.792HisGly: 0.792 ± 0.343
0.792HisHis: 0.792 ± 0.304
0.634HisIle: 0.634 ± 0.232
0.396HisLys: 0.396 ± 0.197
1.109HisLeu: 1.109 ± 0.256
0.079HisMet: 0.079 ± 0.091
0.792HisAsn: 0.792 ± 0.289
0.792HisPro: 0.792 ± 0.224
0.554HisGln: 0.554 ± 0.227
0.317HisArg: 0.317 ± 0.152
1.426HisSer: 1.426 ± 0.48
1.188HisThr: 1.188 ± 0.374
0.634HisVal: 0.634 ± 0.254
0.396HisTrp: 0.396 ± 0.167
0.713HisTyr: 0.713 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
4.514IleAla: 4.514 ± 0.532
0.238IleCys: 0.238 ± 0.119
5.94IleAsp: 5.94 ± 0.818
6.652IleGlu: 6.652 ± 0.692
1.98IlePhe: 1.98 ± 0.472
2.93IleGly: 2.93 ± 0.469
1.109IleHis: 1.109 ± 0.258
4.752IleIle: 4.752 ± 0.717
5.94IleLys: 5.94 ± 0.61
4.197IleLeu: 4.197 ± 0.61
0.634IleMet: 0.634 ± 0.203
4.989IleAsn: 4.989 ± 0.584
2.772IlePro: 2.772 ± 0.518
4.039IleGln: 4.039 ± 0.542
1.742IleArg: 1.742 ± 0.421
5.306IleSer: 5.306 ± 0.732
4.673IleThr: 4.673 ± 0.59
3.564IleVal: 3.564 ± 0.442
0.554IleTrp: 0.554 ± 0.221
2.376IleTyr: 2.376 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
5.544LysAla: 5.544 ± 0.533
0.475LysCys: 0.475 ± 0.206
4.514LysAsp: 4.514 ± 0.771
8.474LysGlu: 8.474 ± 0.831
2.93LysPhe: 2.93 ± 0.452
5.069LysGly: 5.069 ± 0.61
1.346LysHis: 1.346 ± 0.354
5.781LysIle: 5.781 ± 1.031
9.979LysLys: 9.979 ± 1.116
7.524LysLeu: 7.524 ± 0.783
2.772LysMet: 2.772 ± 0.502
4.831LysAsn: 4.831 ± 0.64
2.217LysPro: 2.217 ± 0.429
4.752LysGln: 4.752 ± 0.593
3.96LysArg: 3.96 ± 0.632
5.623LysSer: 5.623 ± 0.787
5.544LysThr: 5.544 ± 0.754
4.91LysVal: 4.91 ± 0.621
1.109LysTrp: 1.109 ± 0.27
3.405LysTyr: 3.405 ± 0.625
0.0LysXaa: 0.0 ± 0.0
Leu
6.573LeuAla: 6.573 ± 0.696
0.554LeuCys: 0.554 ± 0.212
5.544LeuAsp: 5.544 ± 0.637
6.89LeuGlu: 6.89 ± 0.704
2.613LeuPhe: 2.613 ± 0.403
5.227LeuGly: 5.227 ± 0.647
0.554LeuHis: 0.554 ± 0.296
4.356LeuIle: 4.356 ± 0.58
7.682LeuLys: 7.682 ± 0.726
6.336LeuLeu: 6.336 ± 0.804
2.693LeuMet: 2.693 ± 0.496
4.673LeuAsn: 4.673 ± 0.658
3.247LeuPro: 3.247 ± 0.526
2.851LeuGln: 2.851 ± 0.486
2.613LeuArg: 2.613 ± 0.41
6.336LeuSer: 6.336 ± 0.813
4.989LeuThr: 4.989 ± 0.548
5.544LeuVal: 5.544 ± 0.594
0.634LeuTrp: 0.634 ± 0.184
3.168LeuTyr: 3.168 ± 0.549
0.0LeuXaa: 0.0 ± 0.0
Met
2.297MetAla: 2.297 ± 0.475
0.158MetCys: 0.158 ± 0.121
2.059MetAsp: 2.059 ± 0.415
1.742MetGlu: 1.742 ± 0.343
0.713MetPhe: 0.713 ± 0.272
1.188MetGly: 1.188 ± 0.311
0.475MetHis: 0.475 ± 0.209
1.821MetIle: 1.821 ± 0.349
3.009MetLys: 3.009 ± 0.579
3.089MetLeu: 3.089 ± 0.604
0.317MetMet: 0.317 ± 0.161
1.505MetAsn: 1.505 ± 0.356
1.109MetPro: 1.109 ± 0.306
1.267MetGln: 1.267 ± 0.316
1.346MetArg: 1.346 ± 0.296
1.742MetSer: 1.742 ± 0.279
1.426MetThr: 1.426 ± 0.282
1.188MetVal: 1.188 ± 0.246
0.238MetTrp: 0.238 ± 0.139
0.634MetTyr: 0.634 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
4.277AsnAla: 4.277 ± 0.805
0.475AsnCys: 0.475 ± 0.176
3.485AsnAsp: 3.485 ± 0.513
4.514AsnGlu: 4.514 ± 0.656
1.98AsnPhe: 1.98 ± 0.427
4.356AsnGly: 4.356 ± 0.633
1.109AsnHis: 1.109 ± 0.453
4.356AsnIle: 4.356 ± 0.507
5.86AsnLys: 5.86 ± 0.574
3.96AsnLeu: 3.96 ± 0.48
1.426AsnMet: 1.426 ± 0.341
3.405AsnAsn: 3.405 ± 0.785
2.376AsnPro: 2.376 ± 0.468
3.564AsnGln: 3.564 ± 0.652
2.851AsnArg: 2.851 ± 0.494
3.722AsnSer: 3.722 ± 0.769
3.009AsnThr: 3.009 ± 0.465
3.326AsnVal: 3.326 ± 0.485
0.634AsnTrp: 0.634 ± 0.242
1.901AsnTyr: 1.901 ± 0.364
0.0AsnXaa: 0.0 ± 0.0
Pro
1.505ProAla: 1.505 ± 0.33
0.079ProCys: 0.079 ± 0.077
2.217ProAsp: 2.217 ± 0.511
2.534ProGlu: 2.534 ± 0.487
1.109ProPhe: 1.109 ± 0.29
1.426ProGly: 1.426 ± 0.478
0.554ProHis: 0.554 ± 0.223
2.455ProIle: 2.455 ± 0.521
3.009ProLys: 3.009 ± 0.501
2.297ProLeu: 2.297 ± 0.408
0.713ProMet: 0.713 ± 0.26
2.217ProAsn: 2.217 ± 0.412
0.95ProPro: 0.95 ± 0.322
1.188ProGln: 1.188 ± 0.413
0.554ProArg: 0.554 ± 0.212
2.93ProSer: 2.93 ± 0.453
1.584ProThr: 1.584 ± 0.339
2.138ProVal: 2.138 ± 0.498
0.158ProTrp: 0.158 ± 0.127
1.188ProTyr: 1.188 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
3.405GlnAla: 3.405 ± 0.568
0.317GlnCys: 0.317 ± 0.149
1.663GlnAsp: 1.663 ± 0.387
4.752GlnGlu: 4.752 ± 0.628
1.188GlnPhe: 1.188 ± 0.25
2.217GlnGly: 2.217 ± 0.408
0.238GlnHis: 0.238 ± 0.139
2.693GlnIle: 2.693 ± 0.404
3.643GlnLys: 3.643 ± 0.573
3.881GlnLeu: 3.881 ± 0.514
1.426GlnMet: 1.426 ± 0.352
2.217GlnAsn: 2.217 ± 0.345
1.426GlnPro: 1.426 ± 0.464
3.247GlnGln: 3.247 ± 0.81
1.426GlnArg: 1.426 ± 0.405
4.039GlnSer: 4.039 ± 0.576
2.455GlnThr: 2.455 ± 0.334
2.217GlnVal: 2.217 ± 0.379
0.634GlnTrp: 0.634 ± 0.235
1.505GlnTyr: 1.505 ± 0.32
0.0GlnXaa: 0.0 ± 0.0
Arg
2.217ArgAla: 2.217 ± 0.395
0.238ArgCys: 0.238 ± 0.13
1.505ArgAsp: 1.505 ± 0.347
2.851ArgGlu: 2.851 ± 0.447
1.663ArgPhe: 1.663 ± 0.397
1.98ArgGly: 1.98 ± 0.362
0.396ArgHis: 0.396 ± 0.174
2.376ArgIle: 2.376 ± 0.424
3.643ArgLys: 3.643 ± 0.586
4.039ArgLeu: 4.039 ± 0.597
1.584ArgMet: 1.584 ± 0.382
2.059ArgAsn: 2.059 ± 0.417
0.792ArgPro: 0.792 ± 0.272
1.584ArgGln: 1.584 ± 0.418
1.584ArgArg: 1.584 ± 0.395
1.98ArgSer: 1.98 ± 0.388
1.742ArgThr: 1.742 ± 0.343
2.693ArgVal: 2.693 ± 0.445
0.317ArgTrp: 0.317 ± 0.148
1.663ArgTyr: 1.663 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
3.326SerAla: 3.326 ± 0.631
0.238SerCys: 0.238 ± 0.142
4.356SerAsp: 4.356 ± 0.562
5.306SerGlu: 5.306 ± 0.701
2.534SerPhe: 2.534 ± 0.384
4.435SerGly: 4.435 ± 0.696
0.871SerHis: 0.871 ± 0.406
5.227SerIle: 5.227 ± 0.817
6.098SerLys: 6.098 ± 1.016
5.702SerLeu: 5.702 ± 0.769
2.059SerMet: 2.059 ± 0.431
4.039SerAsn: 4.039 ± 0.577
1.901SerPro: 1.901 ± 0.367
2.93SerGln: 2.93 ± 0.508
2.138SerArg: 2.138 ± 0.38
4.118SerSer: 4.118 ± 0.955
4.989SerThr: 4.989 ± 0.741
3.722SerVal: 3.722 ± 0.459
1.03SerTrp: 1.03 ± 0.245
2.217SerTyr: 2.217 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
5.227ThrAla: 5.227 ± 0.795
0.158ThrCys: 0.158 ± 0.102
3.564ThrAsp: 3.564 ± 0.665
4.831ThrGlu: 4.831 ± 0.684
2.297ThrPhe: 2.297 ± 0.52
4.277ThrGly: 4.277 ± 0.665
0.713ThrHis: 0.713 ± 0.289
4.593ThrIle: 4.593 ± 0.537
4.435ThrLys: 4.435 ± 0.633
4.514ThrLeu: 4.514 ± 0.544
1.505ThrMet: 1.505 ± 0.292
2.613ThrAsn: 2.613 ± 0.408
1.426ThrPro: 1.426 ± 0.338
2.772ThrGln: 2.772 ± 0.576
1.346ThrArg: 1.346 ± 0.29
4.039ThrSer: 4.039 ± 0.648
3.643ThrThr: 3.643 ± 0.519
4.039ThrVal: 4.039 ± 0.733
0.792ThrTrp: 0.792 ± 0.241
1.821ThrTyr: 1.821 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
3.247ValAla: 3.247 ± 0.611
0.475ValCys: 0.475 ± 0.21
4.514ValAsp: 4.514 ± 0.49
4.831ValGlu: 4.831 ± 0.577
2.455ValPhe: 2.455 ± 0.481
3.089ValGly: 3.089 ± 0.59
0.792ValHis: 0.792 ± 0.239
4.197ValIle: 4.197 ± 0.564
5.623ValLys: 5.623 ± 0.681
5.148ValLeu: 5.148 ± 0.684
1.426ValMet: 1.426 ± 0.255
4.039ValAsn: 4.039 ± 0.767
2.059ValPro: 2.059 ± 0.345
2.297ValGln: 2.297 ± 0.431
2.851ValArg: 2.851 ± 0.46
3.881ValSer: 3.881 ± 0.712
3.089ValThr: 3.089 ± 0.592
3.881ValVal: 3.881 ± 0.664
0.475ValTrp: 0.475 ± 0.227
2.297ValTyr: 2.297 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
0.634TrpAla: 0.634 ± 0.184
0.079TrpCys: 0.079 ± 0.08
1.03TrpAsp: 1.03 ± 0.299
1.267TrpGlu: 1.267 ± 0.347
0.713TrpPhe: 0.713 ± 0.236
0.792TrpGly: 0.792 ± 0.276
0.317TrpHis: 0.317 ± 0.165
0.95TrpIle: 0.95 ± 0.292
0.792TrpLys: 0.792 ± 0.273
0.792TrpLeu: 0.792 ± 0.278
0.238TrpMet: 0.238 ± 0.128
0.95TrpAsn: 0.95 ± 0.242
0.238TrpPro: 0.238 ± 0.121
0.396TrpGln: 0.396 ± 0.138
0.475TrpArg: 0.475 ± 0.173
0.634TrpSer: 0.634 ± 0.18
0.713TrpThr: 0.713 ± 0.221
0.792TrpVal: 0.792 ± 0.26
0.079TrpTrp: 0.079 ± 0.093
0.238TrpTyr: 0.238 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.505TyrAla: 1.505 ± 0.301
0.238TyrCys: 0.238 ± 0.128
2.851TyrAsp: 2.851 ± 0.588
3.168TyrGlu: 3.168 ± 0.517
1.109TyrPhe: 1.109 ± 0.303
1.901TyrGly: 1.901 ± 0.444
0.554TyrHis: 0.554 ± 0.225
2.534TyrIle: 2.534 ± 0.409
3.405TyrLys: 3.405 ± 0.688
2.93TyrLeu: 2.93 ± 0.457
1.346TyrMet: 1.346 ± 0.363
2.217TyrAsn: 2.217 ± 0.467
1.821TyrPro: 1.821 ± 0.418
1.426TyrGln: 1.426 ± 0.359
1.584TyrArg: 1.584 ± 0.387
2.059TyrSer: 2.059 ± 0.399
2.138TyrThr: 2.138 ± 0.377
2.455TyrVal: 2.455 ± 0.501
0.554TyrTrp: 0.554 ± 0.27
1.188TyrTyr: 1.188 ± 0.425
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (12628 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski