Amino acid dipepetide frequency for Enterococcus phage EF62phi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.511AlaAla: 1.511 ± 0.415
0.648AlaCys: 0.648 ± 0.322
2.267AlaAsp: 2.267 ± 0.474
4.426AlaGlu: 4.426 ± 0.671
3.671AlaPhe: 3.671 ± 0.473
3.563AlaGly: 3.563 ± 0.975
0.432AlaHis: 0.432 ± 0.226
4.21AlaIle: 4.21 ± 0.817
4.318AlaLys: 4.318 ± 0.634
4.102AlaLeu: 4.102 ± 0.664
0.972AlaMet: 0.972 ± 0.276
2.375AlaAsn: 2.375 ± 0.525
2.159AlaPro: 2.159 ± 0.493
1.511AlaGln: 1.511 ± 0.386
1.511AlaArg: 1.511 ± 0.359
3.023AlaSer: 3.023 ± 0.691
3.131AlaThr: 3.131 ± 0.547
3.023AlaVal: 3.023 ± 0.482
0.648AlaTrp: 0.648 ± 0.238
2.915AlaTyr: 2.915 ± 0.508
0.0AlaXaa: 0.0 ± 0.0
Cys
0.108CysAla: 0.108 ± 0.113
0.108CysCys: 0.108 ± 0.101
0.648CysAsp: 0.648 ± 0.266
0.972CysGlu: 0.972 ± 0.289
0.648CysPhe: 0.648 ± 0.338
0.216CysGly: 0.216 ± 0.145
0.216CysHis: 0.216 ± 0.134
0.432CysIle: 0.432 ± 0.198
0.432CysLys: 0.432 ± 0.177
1.188CysLeu: 1.188 ± 0.421
0.0CysMet: 0.0 ± 0.0
1.188CysAsn: 1.188 ± 0.321
0.216CysPro: 0.216 ± 0.141
0.108CysGln: 0.108 ± 0.109
0.216CysArg: 0.216 ± 0.177
0.216CysSer: 0.216 ± 0.15
0.216CysThr: 0.216 ± 0.145
0.54CysVal: 0.54 ± 0.202
0.0CysTrp: 0.0 ± 0.0
0.216CysTyr: 0.216 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
2.699AspAla: 2.699 ± 0.491
0.972AspCys: 0.972 ± 0.468
2.807AspAsp: 2.807 ± 0.606
4.426AspGlu: 4.426 ± 0.663
3.347AspPhe: 3.347 ± 0.494
4.102AspGly: 4.102 ± 0.554
0.756AspHis: 0.756 ± 0.26
5.182AspIle: 5.182 ± 0.763
4.534AspLys: 4.534 ± 0.707
6.261AspLeu: 6.261 ± 0.86
1.08AspMet: 1.08 ± 0.351
3.994AspAsn: 3.994 ± 0.623
1.727AspPro: 1.727 ± 0.389
1.727AspGln: 1.727 ± 0.407
2.915AspArg: 2.915 ± 0.499
3.023AspSer: 3.023 ± 0.564
3.563AspThr: 3.563 ± 0.544
3.671AspVal: 3.671 ± 0.687
0.756AspTrp: 0.756 ± 0.309
3.886AspTyr: 3.886 ± 0.597
0.0AspXaa: 0.0 ± 0.0
Glu
4.318GluAla: 4.318 ± 0.765
0.432GluCys: 0.432 ± 0.181
4.966GluAsp: 4.966 ± 0.765
9.716GluGlu: 9.716 ± 1.555
4.858GluPhe: 4.858 ± 0.573
4.102GluGly: 4.102 ± 0.717
0.648GluHis: 0.648 ± 0.247
5.722GluIle: 5.722 ± 0.801
7.989GluLys: 7.989 ± 0.753
6.801GluLeu: 6.801 ± 0.695
1.835GluMet: 1.835 ± 0.503
6.154GluAsn: 6.154 ± 0.708
2.159GluPro: 2.159 ± 0.528
3.671GluGln: 3.671 ± 1.005
3.671GluArg: 3.671 ± 0.557
3.239GluSer: 3.239 ± 0.538
5.398GluThr: 5.398 ± 0.612
4.75GluVal: 4.75 ± 0.751
1.295GluTrp: 1.295 ± 0.488
3.886GluTyr: 3.886 ± 0.673
0.0GluXaa: 0.0 ± 0.0
Phe
2.483PheAla: 2.483 ± 0.469
0.324PheCys: 0.324 ± 0.196
2.807PheAsp: 2.807 ± 0.664
4.318PheGlu: 4.318 ± 0.822
1.727PhePhe: 1.727 ± 0.497
2.267PheGly: 2.267 ± 0.408
0.432PheHis: 0.432 ± 0.229
3.239PheIle: 3.239 ± 0.565
4.318PheLys: 4.318 ± 0.688
3.671PheLeu: 3.671 ± 0.841
1.08PheMet: 1.08 ± 0.329
2.807PheAsn: 2.807 ± 0.506
1.727PhePro: 1.727 ± 0.336
2.591PheGln: 2.591 ± 0.611
2.159PheArg: 2.159 ± 0.458
3.239PheSer: 3.239 ± 0.532
1.295PheThr: 1.295 ± 0.335
3.023PheVal: 3.023 ± 0.576
0.432PheTrp: 0.432 ± 0.172
2.807PheTyr: 2.807 ± 0.477
0.0PheXaa: 0.0 ± 0.0
Gly
2.699GlyAla: 2.699 ± 0.642
0.432GlyCys: 0.432 ± 0.196
3.239GlyAsp: 3.239 ± 0.695
2.807GlyGlu: 2.807 ± 0.656
2.159GlyPhe: 2.159 ± 0.504
4.102GlyGly: 4.102 ± 0.786
0.756GlyHis: 0.756 ± 0.255
4.21GlyIle: 4.21 ± 0.706
5.938GlyLys: 5.938 ± 0.738
5.074GlyLeu: 5.074 ± 0.632
1.188GlyMet: 1.188 ± 0.349
5.614GlyAsn: 5.614 ± 0.794
1.08GlyPro: 1.08 ± 0.533
1.403GlyGln: 1.403 ± 0.361
2.159GlyArg: 2.159 ± 0.49
3.023GlySer: 3.023 ± 0.613
2.915GlyThr: 2.915 ± 0.554
4.426GlyVal: 4.426 ± 0.671
0.972GlyTrp: 0.972 ± 0.308
3.455GlyTyr: 3.455 ± 0.588
0.0GlyXaa: 0.0 ± 0.0
His
0.756HisAla: 0.756 ± 0.288
0.216HisCys: 0.216 ± 0.177
0.972HisAsp: 0.972 ± 0.285
1.403HisGlu: 1.403 ± 0.336
0.756HisPhe: 0.756 ± 0.24
0.324HisGly: 0.324 ± 0.194
0.324HisHis: 0.324 ± 0.24
0.54HisIle: 0.54 ± 0.27
0.648HisLys: 0.648 ± 0.269
1.403HisLeu: 1.403 ± 0.427
0.432HisMet: 0.432 ± 0.21
0.648HisAsn: 0.648 ± 0.24
0.54HisPro: 0.54 ± 0.188
0.108HisGln: 0.108 ± 0.101
0.432HisArg: 0.432 ± 0.211
0.54HisSer: 0.54 ± 0.22
0.648HisThr: 0.648 ± 0.287
0.972HisVal: 0.972 ± 0.395
0.108HisTrp: 0.108 ± 0.108
1.295HisTyr: 1.295 ± 0.345
0.0HisXaa: 0.0 ± 0.0
Ile
3.455IleAla: 3.455 ± 0.53
0.54IleCys: 0.54 ± 0.242
4.318IleAsp: 4.318 ± 0.704
6.693IleGlu: 6.693 ± 0.886
2.699IlePhe: 2.699 ± 0.522
3.239IleGly: 3.239 ± 0.672
1.188IleHis: 1.188 ± 0.342
3.671IleIle: 3.671 ± 0.797
6.154IleLys: 6.154 ± 0.668
5.614IleLeu: 5.614 ± 0.721
1.835IleMet: 1.835 ± 0.46
6.477IleAsn: 6.477 ± 1.03
2.699IlePro: 2.699 ± 0.414
3.455IleGln: 3.455 ± 0.606
3.455IleArg: 3.455 ± 0.665
4.858IleSer: 4.858 ± 0.962
4.21IleThr: 4.21 ± 0.59
3.023IleVal: 3.023 ± 0.525
0.648IleTrp: 0.648 ± 0.292
3.886IleTyr: 3.886 ± 0.664
0.0IleXaa: 0.0 ± 0.0
Lys
5.074LysAla: 5.074 ± 0.788
0.648LysCys: 0.648 ± 0.283
5.506LysAsp: 5.506 ± 0.587
8.313LysGlu: 8.313 ± 0.789
2.807LysPhe: 2.807 ± 0.499
4.21LysGly: 4.21 ± 0.72
0.864LysHis: 0.864 ± 0.274
5.83LysIle: 5.83 ± 0.695
8.097LysLys: 8.097 ± 0.919
5.83LysLeu: 5.83 ± 0.822
3.023LysMet: 3.023 ± 0.573
6.693LysAsn: 6.693 ± 0.922
3.455LysPro: 3.455 ± 0.502
4.318LysGln: 4.318 ± 0.901
4.858LysArg: 4.858 ± 0.694
3.994LysSer: 3.994 ± 0.605
4.858LysThr: 4.858 ± 0.714
4.966LysVal: 4.966 ± 0.634
0.432LysTrp: 0.432 ± 0.204
3.563LysTyr: 3.563 ± 0.474
0.0LysXaa: 0.0 ± 0.0
Leu
5.182LeuAla: 5.182 ± 0.622
0.432LeuCys: 0.432 ± 0.198
5.83LeuAsp: 5.83 ± 0.814
6.369LeuGlu: 6.369 ± 0.969
4.318LeuPhe: 4.318 ± 0.893
4.75LeuGly: 4.75 ± 0.9
0.972LeuHis: 0.972 ± 0.443
4.966LeuIle: 4.966 ± 0.583
7.341LeuLys: 7.341 ± 0.74
6.477LeuLeu: 6.477 ± 0.749
2.267LeuMet: 2.267 ± 0.409
5.83LeuAsn: 5.83 ± 0.906
3.886LeuPro: 3.886 ± 0.602
3.347LeuGln: 3.347 ± 0.618
3.239LeuArg: 3.239 ± 0.586
5.83LeuSer: 5.83 ± 0.865
3.994LeuThr: 3.994 ± 0.798
4.318LeuVal: 4.318 ± 0.642
0.324LeuTrp: 0.324 ± 0.142
2.699LeuTyr: 2.699 ± 0.449
0.0LeuXaa: 0.0 ± 0.0
Met
1.835MetAla: 1.835 ± 0.416
0.216MetCys: 0.216 ± 0.145
1.619MetAsp: 1.619 ± 0.42
2.483MetGlu: 2.483 ± 0.452
1.08MetPhe: 1.08 ± 0.368
0.972MetGly: 0.972 ± 0.365
0.216MetHis: 0.216 ± 0.136
2.051MetIle: 2.051 ± 0.394
2.483MetLys: 2.483 ± 0.489
2.159MetLeu: 2.159 ± 0.638
0.54MetMet: 0.54 ± 0.223
1.511MetAsn: 1.511 ± 0.426
0.864MetPro: 0.864 ± 0.307
0.432MetGln: 0.432 ± 0.185
0.756MetArg: 0.756 ± 0.249
1.943MetSer: 1.943 ± 0.354
1.619MetThr: 1.619 ± 0.361
0.972MetVal: 0.972 ± 0.304
0.0MetTrp: 0.0 ± 0.0
0.972MetTyr: 0.972 ± 0.331
0.0MetXaa: 0.0 ± 0.0
Asn
3.671AsnAla: 3.671 ± 0.75
0.432AsnCys: 0.432 ± 0.181
3.994AsnAsp: 3.994 ± 0.658
6.046AsnGlu: 6.046 ± 0.602
2.807AsnPhe: 2.807 ± 0.544
4.858AsnGly: 4.858 ± 0.719
1.188AsnHis: 1.188 ± 0.271
5.074AsnIle: 5.074 ± 0.912
6.585AsnLys: 6.585 ± 0.96
4.858AsnLeu: 4.858 ± 0.677
2.483AsnMet: 2.483 ± 0.433
5.938AsnAsn: 5.938 ± 0.904
3.131AsnPro: 3.131 ± 0.742
3.671AsnGln: 3.671 ± 0.607
1.943AsnArg: 1.943 ± 0.482
3.886AsnSer: 3.886 ± 0.622
3.778AsnThr: 3.778 ± 0.612
3.886AsnVal: 3.886 ± 0.695
0.864AsnTrp: 0.864 ± 0.257
3.886AsnTyr: 3.886 ± 0.73
0.0AsnXaa: 0.0 ± 0.0
Pro
2.159ProAla: 2.159 ± 0.523
0.216ProCys: 0.216 ± 0.147
2.591ProAsp: 2.591 ± 0.417
3.023ProGlu: 3.023 ± 0.555
1.188ProPhe: 1.188 ± 0.382
1.943ProGly: 1.943 ± 0.649
0.108ProHis: 0.108 ± 0.131
2.051ProIle: 2.051 ± 0.403
2.375ProLys: 2.375 ± 0.512
3.239ProLeu: 3.239 ± 0.591
0.756ProMet: 0.756 ± 0.252
2.051ProAsn: 2.051 ± 0.514
1.511ProPro: 1.511 ± 0.709
1.295ProGln: 1.295 ± 0.296
0.648ProArg: 0.648 ± 0.294
2.483ProSer: 2.483 ± 0.453
1.835ProThr: 1.835 ± 0.412
2.267ProVal: 2.267 ± 0.447
0.648ProTrp: 0.648 ± 0.215
1.943ProTyr: 1.943 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
1.943GlnAla: 1.943 ± 0.683
0.324GlnCys: 0.324 ± 0.183
1.943GlnAsp: 1.943 ± 0.602
4.102GlnGlu: 4.102 ± 0.742
1.835GlnPhe: 1.835 ± 0.421
2.483GlnGly: 2.483 ± 0.496
0.432GlnHis: 0.432 ± 0.219
2.807GlnIle: 2.807 ± 0.471
3.347GlnLys: 3.347 ± 0.513
3.886GlnLeu: 3.886 ± 0.597
0.756GlnMet: 0.756 ± 0.285
3.239GlnAsn: 3.239 ± 0.682
1.08GlnPro: 1.08 ± 0.286
1.511GlnGln: 1.511 ± 0.321
1.08GlnArg: 1.08 ± 0.347
2.159GlnSer: 2.159 ± 0.5
2.375GlnThr: 2.375 ± 0.36
1.188GlnVal: 1.188 ± 0.305
0.864GlnTrp: 0.864 ± 0.356
1.511GlnTyr: 1.511 ± 0.328
0.0GlnXaa: 0.0 ± 0.0
Arg
1.727ArgAla: 1.727 ± 0.351
0.216ArgCys: 0.216 ± 0.141
2.267ArgAsp: 2.267 ± 0.406
1.835ArgGlu: 1.835 ± 0.507
1.511ArgPhe: 1.511 ± 0.329
1.511ArgGly: 1.511 ± 0.432
0.648ArgHis: 0.648 ± 0.226
3.023ArgIle: 3.023 ± 0.517
3.778ArgLys: 3.778 ± 0.651
4.426ArgLeu: 4.426 ± 0.841
1.727ArgMet: 1.727 ± 0.415
2.591ArgAsn: 2.591 ± 0.542
0.972ArgPro: 0.972 ± 0.338
1.619ArgGln: 1.619 ± 0.467
1.619ArgArg: 1.619 ± 0.456
1.08ArgSer: 1.08 ± 0.225
2.591ArgThr: 2.591 ± 0.459
2.591ArgVal: 2.591 ± 0.51
0.324ArgTrp: 0.324 ± 0.158
1.08ArgTyr: 1.08 ± 0.324
0.0ArgXaa: 0.0 ± 0.0
Ser
1.835SerAla: 1.835 ± 0.479
0.216SerCys: 0.216 ± 0.163
3.778SerAsp: 3.778 ± 0.586
4.534SerGlu: 4.534 ± 0.7
2.267SerPhe: 2.267 ± 0.574
2.915SerGly: 2.915 ± 0.428
0.864SerHis: 0.864 ± 0.269
3.994SerIle: 3.994 ± 0.575
5.29SerLys: 5.29 ± 0.774
5.614SerLeu: 5.614 ± 0.852
1.188SerMet: 1.188 ± 0.347
4.426SerAsn: 4.426 ± 0.761
1.511SerPro: 1.511 ± 0.336
1.943SerGln: 1.943 ± 0.392
1.835SerArg: 1.835 ± 0.411
3.671SerSer: 3.671 ± 1.08
2.915SerThr: 2.915 ± 0.535
3.131SerVal: 3.131 ± 0.435
0.972SerTrp: 0.972 ± 0.27
3.023SerTyr: 3.023 ± 0.513
0.0SerXaa: 0.0 ± 0.0
Thr
2.699ThrAla: 2.699 ± 0.465
0.108ThrCys: 0.108 ± 0.112
3.023ThrAsp: 3.023 ± 0.622
4.102ThrGlu: 4.102 ± 0.664
2.699ThrPhe: 2.699 ± 0.44
4.426ThrGly: 4.426 ± 0.693
0.756ThrHis: 0.756 ± 0.271
5.722ThrIle: 5.722 ± 0.825
4.75ThrLys: 4.75 ± 0.835
3.778ThrLeu: 3.778 ± 0.478
0.864ThrMet: 0.864 ± 0.263
3.455ThrAsn: 3.455 ± 0.627
2.591ThrPro: 2.591 ± 0.606
1.835ThrGln: 1.835 ± 0.374
0.972ThrArg: 0.972 ± 0.324
3.671ThrSer: 3.671 ± 0.644
3.778ThrThr: 3.778 ± 0.59
3.455ThrVal: 3.455 ± 0.459
0.756ThrTrp: 0.756 ± 0.323
2.807ThrTyr: 2.807 ± 0.605
0.0ThrXaa: 0.0 ± 0.0
Val
2.699ValAla: 2.699 ± 0.574
0.756ValCys: 0.756 ± 0.296
3.778ValAsp: 3.778 ± 0.816
4.534ValGlu: 4.534 ± 0.893
2.591ValPhe: 2.591 ± 0.513
2.915ValGly: 2.915 ± 0.631
0.864ValHis: 0.864 ± 0.297
4.966ValIle: 4.966 ± 0.869
4.642ValLys: 4.642 ± 0.792
4.642ValLeu: 4.642 ± 0.758
1.943ValMet: 1.943 ± 0.364
3.671ValAsn: 3.671 ± 0.522
1.511ValPro: 1.511 ± 0.345
1.835ValGln: 1.835 ± 0.399
1.511ValArg: 1.511 ± 0.316
3.347ValSer: 3.347 ± 0.68
3.455ValThr: 3.455 ± 0.627
3.563ValVal: 3.563 ± 0.624
0.54ValTrp: 0.54 ± 0.296
2.267ValTyr: 2.267 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
0.972TrpAla: 0.972 ± 0.364
0.216TrpCys: 0.216 ± 0.138
1.188TrpAsp: 1.188 ± 0.376
0.972TrpGlu: 0.972 ± 0.324
0.432TrpPhe: 0.432 ± 0.238
1.08TrpGly: 1.08 ± 0.299
0.54TrpHis: 0.54 ± 0.215
1.295TrpIle: 1.295 ± 0.394
0.648TrpLys: 0.648 ± 0.275
0.54TrpLeu: 0.54 ± 0.189
0.0TrpMet: 0.0 ± 0.0
0.54TrpAsn: 0.54 ± 0.249
0.108TrpPro: 0.108 ± 0.105
0.864TrpGln: 0.864 ± 0.303
0.54TrpArg: 0.54 ± 0.252
0.216TrpSer: 0.216 ± 0.15
0.54TrpThr: 0.54 ± 0.23
0.324TrpVal: 0.324 ± 0.164
0.108TrpTrp: 0.108 ± 0.103
0.324TrpTyr: 0.324 ± 0.163
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.591TyrAla: 2.591 ± 0.53
0.432TyrCys: 0.432 ± 0.191
3.886TyrAsp: 3.886 ± 0.673
4.534TyrGlu: 4.534 ± 0.814
3.347TyrPhe: 3.347 ± 0.586
3.671TyrGly: 3.671 ± 0.638
0.864TyrHis: 0.864 ± 0.294
3.239TyrIle: 3.239 ± 0.56
3.671TyrLys: 3.671 ± 0.638
2.807TyrLeu: 2.807 ± 0.637
0.648TyrMet: 0.648 ± 0.255
3.778TyrAsn: 3.778 ± 0.7
1.403TyrPro: 1.403 ± 0.426
1.619TyrGln: 1.619 ± 0.481
1.619TyrArg: 1.619 ± 0.431
2.483TyrSer: 2.483 ± 0.552
3.131TyrThr: 3.131 ± 0.608
1.943TyrVal: 1.943 ± 0.389
0.756TyrTrp: 0.756 ± 0.317
1.295TyrTyr: 1.295 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (9264 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski