Amino acid dipepetide frequency for Bacteroides phage p00

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.269AlaAla: 6.269 ± 1.307
0.261AlaCys: 0.261 ± 0.193
5.224AlaAsp: 5.224 ± 0.837
6.53AlaGlu: 6.53 ± 1.311
2.351AlaPhe: 2.351 ± 0.584
5.355AlaGly: 5.355 ± 0.962
1.045AlaHis: 1.045 ± 0.526
2.743AlaIle: 2.743 ± 0.605
4.44AlaLys: 4.44 ± 0.701
7.314AlaLeu: 7.314 ± 1.049
1.567AlaMet: 1.567 ± 0.341
2.612AlaAsn: 2.612 ± 0.507
1.959AlaPro: 1.959 ± 0.636
2.612AlaGln: 2.612 ± 0.776
4.571AlaArg: 4.571 ± 1.083
3.918AlaSer: 3.918 ± 0.582
4.179AlaThr: 4.179 ± 0.533
5.093AlaVal: 5.093 ± 1.05
0.784AlaTrp: 0.784 ± 0.346
2.612AlaTyr: 2.612 ± 0.783
0.0AlaXaa: 0.0 ± 0.0
Cys
0.914CysAla: 0.914 ± 0.339
0.261CysCys: 0.261 ± 0.186
0.522CysAsp: 0.522 ± 0.307
0.522CysGlu: 0.522 ± 0.223
0.0CysPhe: 0.0 ± 0.0
0.914CysGly: 0.914 ± 0.543
0.261CysHis: 0.261 ± 0.159
0.784CysIle: 0.784 ± 0.333
0.784CysLys: 0.784 ± 0.45
0.392CysLeu: 0.392 ± 0.237
0.131CysMet: 0.131 ± 0.133
0.261CysAsn: 0.261 ± 0.151
0.392CysPro: 0.392 ± 0.341
0.131CysGln: 0.131 ± 0.127
0.522CysArg: 0.522 ± 0.308
1.045CysSer: 1.045 ± 0.358
0.653CysThr: 0.653 ± 0.323
0.914CysVal: 0.914 ± 0.333
0.261CysTrp: 0.261 ± 0.155
0.653CysTyr: 0.653 ± 0.333
0.0CysXaa: 0.0 ± 0.0
Asp
5.877AspAla: 5.877 ± 1.001
0.653AspCys: 0.653 ± 0.328
4.571AspAsp: 4.571 ± 1.174
5.224AspGlu: 5.224 ± 0.821
2.09AspPhe: 2.09 ± 0.644
6.53AspGly: 6.53 ± 1.657
1.306AspHis: 1.306 ± 0.37
4.832AspIle: 4.832 ± 0.741
4.179AspLys: 4.179 ± 0.514
4.31AspLeu: 4.31 ± 0.457
2.22AspMet: 2.22 ± 0.562
3.526AspAsn: 3.526 ± 0.672
2.612AspPro: 2.612 ± 0.703
0.522AspGln: 0.522 ± 0.43
3.265AspArg: 3.265 ± 0.496
2.743AspSer: 2.743 ± 0.55
3.134AspThr: 3.134 ± 0.543
3.265AspVal: 3.265 ± 0.542
1.306AspTrp: 1.306 ± 0.388
3.004AspTyr: 3.004 ± 0.907
0.0AspXaa: 0.0 ± 0.0
Glu
7.314GluAla: 7.314 ± 1.055
0.784GluCys: 0.784 ± 0.267
5.355GluAsp: 5.355 ± 0.505
8.489GluGlu: 8.489 ± 0.942
3.396GluPhe: 3.396 ± 0.562
4.702GluGly: 4.702 ± 0.731
1.437GluHis: 1.437 ± 0.238
5.093GluIle: 5.093 ± 0.573
8.358GluLys: 8.358 ± 1.364
7.052GluLeu: 7.052 ± 0.971
2.351GluMet: 2.351 ± 0.776
3.918GluAsn: 3.918 ± 0.569
1.959GluPro: 1.959 ± 0.635
2.873GluGln: 2.873 ± 0.578
4.571GluArg: 4.571 ± 0.616
2.481GluSer: 2.481 ± 0.57
4.049GluThr: 4.049 ± 0.864
4.832GluVal: 4.832 ± 0.695
1.175GluTrp: 1.175 ± 0.485
3.134GluTyr: 3.134 ± 0.747
0.0GluXaa: 0.0 ± 0.0
Phe
1.959PheAla: 1.959 ± 0.625
0.261PheCys: 0.261 ± 0.185
2.22PheAsp: 2.22 ± 0.447
3.526PheGlu: 3.526 ± 0.782
1.175PhePhe: 1.175 ± 0.317
3.396PheGly: 3.396 ± 0.798
0.784PheHis: 0.784 ± 0.357
1.698PheIle: 1.698 ± 0.622
2.481PheLys: 2.481 ± 0.616
2.612PheLeu: 2.612 ± 0.709
0.522PheMet: 0.522 ± 0.22
2.22PheAsn: 2.22 ± 0.579
1.959PhePro: 1.959 ± 0.547
1.698PheGln: 1.698 ± 0.382
2.09PheArg: 2.09 ± 0.56
2.743PheSer: 2.743 ± 0.579
2.09PheThr: 2.09 ± 0.439
2.351PheVal: 2.351 ± 0.771
0.784PheTrp: 0.784 ± 0.382
1.698PheTyr: 1.698 ± 0.623
0.0PheXaa: 0.0 ± 0.0
Gly
3.787GlyAla: 3.787 ± 0.811
1.045GlyCys: 1.045 ± 0.341
3.657GlyAsp: 3.657 ± 0.902
6.661GlyGlu: 6.661 ± 0.97
1.828GlyPhe: 1.828 ± 0.444
5.093GlyGly: 5.093 ± 0.552
1.306GlyHis: 1.306 ± 0.365
4.049GlyIle: 4.049 ± 0.546
8.62GlyLys: 8.62 ± 1.171
6.008GlyLeu: 6.008 ± 0.754
1.175GlyMet: 1.175 ± 0.389
4.049GlyAsn: 4.049 ± 1.084
0.653GlyPro: 0.653 ± 0.272
2.743GlyGln: 2.743 ± 0.656
4.179GlyArg: 4.179 ± 0.481
3.787GlySer: 3.787 ± 0.637
5.224GlyThr: 5.224 ± 0.651
4.44GlyVal: 4.44 ± 0.861
1.828GlyTrp: 1.828 ± 0.41
2.873GlyTyr: 2.873 ± 0.586
0.0GlyXaa: 0.0 ± 0.0
His
0.914HisAla: 0.914 ± 0.351
0.131HisCys: 0.131 ± 0.135
0.392HisAsp: 0.392 ± 0.193
0.131HisGlu: 0.131 ± 0.141
0.261HisPhe: 0.261 ± 0.202
1.045HisGly: 1.045 ± 0.298
0.131HisHis: 0.131 ± 0.107
0.784HisIle: 0.784 ± 0.372
1.437HisLys: 1.437 ± 0.613
1.437HisLeu: 1.437 ± 0.314
0.261HisMet: 0.261 ± 0.171
0.261HisAsn: 0.261 ± 0.215
0.653HisPro: 0.653 ± 0.395
0.653HisGln: 0.653 ± 0.428
1.045HisArg: 1.045 ± 0.453
0.914HisSer: 0.914 ± 0.244
0.653HisThr: 0.653 ± 0.25
1.045HisVal: 1.045 ± 0.293
0.392HisTrp: 0.392 ± 0.181
1.045HisTyr: 1.045 ± 0.303
0.0HisXaa: 0.0 ± 0.0
Ile
4.702IleAla: 4.702 ± 0.869
0.261IleCys: 0.261 ± 0.241
3.918IleAsp: 3.918 ± 0.56
5.485IleGlu: 5.485 ± 0.868
1.306IlePhe: 1.306 ± 0.379
2.873IleGly: 2.873 ± 0.435
0.784IleHis: 0.784 ± 0.355
2.22IleIle: 2.22 ± 0.657
3.134IleLys: 3.134 ± 0.819
4.571IleLeu: 4.571 ± 0.982
0.653IleMet: 0.653 ± 0.359
2.351IleAsn: 2.351 ± 0.547
1.959IlePro: 1.959 ± 0.39
1.306IleGln: 1.306 ± 0.306
4.571IleArg: 4.571 ± 0.658
3.918IleSer: 3.918 ± 0.7
4.702IleThr: 4.702 ± 0.712
2.351IleVal: 2.351 ± 0.628
0.914IleTrp: 0.914 ± 0.291
2.09IleTyr: 2.09 ± 0.735
0.0IleXaa: 0.0 ± 0.0
Lys
6.791LysAla: 6.791 ± 1.237
1.045LysCys: 1.045 ± 0.289
5.093LysAsp: 5.093 ± 1.393
5.616LysGlu: 5.616 ± 1.12
2.873LysPhe: 2.873 ± 0.637
6.138LysGly: 6.138 ± 0.862
0.784LysHis: 0.784 ± 0.4
4.049LysIle: 4.049 ± 0.735
6.661LysLys: 6.661 ± 1.177
7.052LysLeu: 7.052 ± 1.012
1.698LysMet: 1.698 ± 0.568
2.873LysAsn: 2.873 ± 0.479
2.873LysPro: 2.873 ± 0.66
2.09LysGln: 2.09 ± 0.572
4.832LysArg: 4.832 ± 1.19
2.873LysSer: 2.873 ± 0.475
4.44LysThr: 4.44 ± 0.883
3.918LysVal: 3.918 ± 0.597
1.306LysTrp: 1.306 ± 0.457
3.918LysTyr: 3.918 ± 0.71
0.0LysXaa: 0.0 ± 0.0
Leu
4.702LeuAla: 4.702 ± 1.087
1.437LeuCys: 1.437 ± 0.481
4.31LeuAsp: 4.31 ± 0.601
6.138LeuGlu: 6.138 ± 0.49
3.657LeuPhe: 3.657 ± 0.663
4.44LeuGly: 4.44 ± 0.702
0.653LeuHis: 0.653 ± 0.275
3.265LeuIle: 3.265 ± 0.636
6.791LeuLys: 6.791 ± 1.01
6.661LeuLeu: 6.661 ± 0.751
2.09LeuMet: 2.09 ± 0.327
4.049LeuAsn: 4.049 ± 0.492
3.134LeuPro: 3.134 ± 0.861
2.481LeuGln: 2.481 ± 1.048
4.571LeuArg: 4.571 ± 0.82
6.922LeuSer: 6.922 ± 1.127
6.53LeuThr: 6.53 ± 0.923
6.791LeuVal: 6.791 ± 0.91
1.045LeuTrp: 1.045 ± 0.474
3.787LeuTyr: 3.787 ± 0.512
0.0LeuXaa: 0.0 ± 0.0
Met
2.351MetAla: 2.351 ± 0.569
0.131MetCys: 0.131 ± 0.15
0.914MetAsp: 0.914 ± 0.254
2.09MetGlu: 2.09 ± 0.433
1.045MetPhe: 1.045 ± 0.504
0.653MetGly: 0.653 ± 0.256
0.261MetHis: 0.261 ± 0.283
1.175MetIle: 1.175 ± 0.33
2.22MetLys: 2.22 ± 0.609
2.09MetLeu: 2.09 ± 0.489
0.522MetMet: 0.522 ± 0.244
1.306MetAsn: 1.306 ± 0.412
1.045MetPro: 1.045 ± 0.463
0.522MetGln: 0.522 ± 0.258
1.175MetArg: 1.175 ± 0.366
1.698MetSer: 1.698 ± 0.499
1.045MetThr: 1.045 ± 0.384
0.914MetVal: 0.914 ± 0.276
0.131MetTrp: 0.131 ± 0.107
1.045MetTyr: 1.045 ± 0.384
0.0MetXaa: 0.0 ± 0.0
Asn
3.526AsnAla: 3.526 ± 0.747
0.392AsnCys: 0.392 ± 0.276
2.481AsnAsp: 2.481 ± 0.785
3.265AsnGlu: 3.265 ± 0.577
1.828AsnPhe: 1.828 ± 0.698
4.571AsnGly: 4.571 ± 0.823
0.261AsnHis: 0.261 ± 0.159
3.265AsnIle: 3.265 ± 0.802
4.049AsnLys: 4.049 ± 0.757
4.832AsnLeu: 4.832 ± 0.475
0.261AsnMet: 0.261 ± 0.159
2.743AsnAsn: 2.743 ± 0.651
2.743AsnPro: 2.743 ± 0.618
0.784AsnGln: 0.784 ± 0.261
2.351AsnArg: 2.351 ± 0.42
3.265AsnSer: 3.265 ± 0.698
2.09AsnThr: 2.09 ± 0.531
2.612AsnVal: 2.612 ± 0.646
0.914AsnTrp: 0.914 ± 0.339
1.959AsnTyr: 1.959 ± 0.417
0.0AsnXaa: 0.0 ± 0.0
Pro
2.743ProAla: 2.743 ± 0.626
0.131ProCys: 0.131 ± 0.16
3.526ProAsp: 3.526 ± 0.84
4.049ProGlu: 4.049 ± 0.769
1.437ProPhe: 1.437 ± 0.494
3.004ProGly: 3.004 ± 0.63
0.261ProHis: 0.261 ± 0.165
0.784ProIle: 0.784 ± 0.334
1.698ProLys: 1.698 ± 0.468
1.698ProLeu: 1.698 ± 0.552
0.522ProMet: 0.522 ± 0.291
1.045ProAsn: 1.045 ± 0.37
0.914ProPro: 0.914 ± 0.334
0.522ProGln: 0.522 ± 0.24
1.828ProArg: 1.828 ± 0.529
1.698ProSer: 1.698 ± 0.503
1.828ProThr: 1.828 ± 0.397
2.743ProVal: 2.743 ± 0.461
0.392ProTrp: 0.392 ± 0.231
1.306ProTyr: 1.306 ± 0.554
0.0ProXaa: 0.0 ± 0.0
Gln
2.873GlnAla: 2.873 ± 0.688
0.0GlnCys: 0.0 ± 0.0
2.612GlnAsp: 2.612 ± 0.729
2.612GlnGlu: 2.612 ± 1.092
1.306GlnPhe: 1.306 ± 0.322
1.959GlnGly: 1.959 ± 0.448
0.131GlnHis: 0.131 ± 0.107
2.351GlnIle: 2.351 ± 0.815
2.873GlnLys: 2.873 ± 0.882
2.612GlnLeu: 2.612 ± 0.74
0.914GlnMet: 0.914 ± 0.328
1.437GlnAsn: 1.437 ± 0.334
1.045GlnPro: 1.045 ± 0.333
2.351GlnGln: 2.351 ± 1.407
1.567GlnArg: 1.567 ± 0.424
1.828GlnSer: 1.828 ± 0.661
1.437GlnThr: 1.437 ± 0.369
1.959GlnVal: 1.959 ± 0.519
0.261GlnTrp: 0.261 ± 0.189
0.914GlnTyr: 0.914 ± 0.488
0.0GlnXaa: 0.0 ± 0.0
Arg
2.873ArgAla: 2.873 ± 0.439
0.784ArgCys: 0.784 ± 0.332
3.134ArgAsp: 3.134 ± 0.704
6.138ArgGlu: 6.138 ± 0.923
2.481ArgPhe: 2.481 ± 0.671
2.22ArgGly: 2.22 ± 0.515
1.045ArgHis: 1.045 ± 0.513
5.093ArgIle: 5.093 ± 0.909
5.224ArgLys: 5.224 ± 0.942
4.832ArgLeu: 4.832 ± 0.694
1.567ArgMet: 1.567 ± 0.349
3.526ArgAsn: 3.526 ± 0.659
1.567ArgPro: 1.567 ± 0.807
3.265ArgGln: 3.265 ± 0.535
5.224ArgArg: 5.224 ± 1.006
3.004ArgSer: 3.004 ± 0.542
2.481ArgThr: 2.481 ± 0.461
3.657ArgVal: 3.657 ± 0.912
0.653ArgTrp: 0.653 ± 0.305
2.743ArgTyr: 2.743 ± 0.628
0.0ArgXaa: 0.0 ± 0.0
Ser
4.702SerAla: 4.702 ± 0.826
0.261SerCys: 0.261 ± 0.171
3.265SerAsp: 3.265 ± 0.594
3.657SerGlu: 3.657 ± 0.812
3.265SerPhe: 3.265 ± 0.6
5.746SerGly: 5.746 ± 1.502
0.653SerHis: 0.653 ± 0.219
3.396SerIle: 3.396 ± 0.521
3.526SerLys: 3.526 ± 0.534
5.224SerLeu: 5.224 ± 0.732
1.698SerMet: 1.698 ± 0.403
3.396SerAsn: 3.396 ± 0.834
1.437SerPro: 1.437 ± 0.414
1.959SerGln: 1.959 ± 0.538
3.265SerArg: 3.265 ± 0.894
3.787SerSer: 3.787 ± 0.683
2.481SerThr: 2.481 ± 0.627
4.832SerVal: 4.832 ± 0.665
1.045SerTrp: 1.045 ± 0.319
0.914SerTyr: 0.914 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
3.918ThrAla: 3.918 ± 0.861
0.653ThrCys: 0.653 ± 0.317
5.485ThrAsp: 5.485 ± 0.761
5.093ThrGlu: 5.093 ± 0.678
1.437ThrPhe: 1.437 ± 0.576
6.008ThrGly: 6.008 ± 0.7
0.392ThrHis: 0.392 ± 0.193
2.481ThrIle: 2.481 ± 0.39
3.134ThrLys: 3.134 ± 0.637
5.093ThrLeu: 5.093 ± 0.78
1.045ThrMet: 1.045 ± 0.395
1.306ThrAsn: 1.306 ± 0.382
2.09ThrPro: 2.09 ± 0.493
2.22ThrGln: 2.22 ± 0.581
3.134ThrArg: 3.134 ± 0.749
3.657ThrSer: 3.657 ± 0.768
2.481ThrThr: 2.481 ± 0.441
3.526ThrVal: 3.526 ± 0.699
0.914ThrTrp: 0.914 ± 0.345
2.09ThrTyr: 2.09 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
3.526ValAla: 3.526 ± 0.841
0.522ValCys: 0.522 ± 0.227
4.049ValAsp: 4.049 ± 0.695
4.44ValGlu: 4.44 ± 0.658
3.918ValPhe: 3.918 ± 1.053
4.049ValGly: 4.049 ± 0.609
1.306ValHis: 1.306 ± 0.395
3.004ValIle: 3.004 ± 0.544
3.787ValLys: 3.787 ± 0.919
4.832ValLeu: 4.832 ± 0.852
1.437ValMet: 1.437 ± 0.344
3.526ValAsn: 3.526 ± 0.667
1.306ValPro: 1.306 ± 0.348
2.09ValGln: 2.09 ± 0.44
4.44ValArg: 4.44 ± 0.651
5.746ValSer: 5.746 ± 0.953
3.918ValThr: 3.918 ± 0.619
3.265ValVal: 3.265 ± 0.909
0.653ValTrp: 0.653 ± 0.284
2.481ValTyr: 2.481 ± 0.526
0.0ValXaa: 0.0 ± 0.0
Trp
0.522TrpAla: 0.522 ± 0.196
0.131TrpCys: 0.131 ± 0.105
0.522TrpAsp: 0.522 ± 0.232
1.045TrpGlu: 1.045 ± 0.288
0.784TrpPhe: 0.784 ± 0.292
0.914TrpGly: 0.914 ± 0.342
0.261TrpHis: 0.261 ± 0.191
0.653TrpIle: 0.653 ± 0.313
1.567TrpLys: 1.567 ± 0.363
1.437TrpLeu: 1.437 ± 0.339
0.653TrpMet: 0.653 ± 0.238
1.698TrpAsn: 1.698 ± 0.33
0.392TrpPro: 0.392 ± 0.194
0.653TrpGln: 0.653 ± 0.276
1.175TrpArg: 1.175 ± 0.406
0.392TrpSer: 0.392 ± 0.181
0.261TrpThr: 0.261 ± 0.215
1.698TrpVal: 1.698 ± 0.372
0.392TrpTrp: 0.392 ± 0.199
0.784TrpTyr: 0.784 ± 0.447
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.175TyrAla: 1.175 ± 0.36
1.175TyrCys: 1.175 ± 0.488
3.918TyrAsp: 3.918 ± 0.945
2.612TyrGlu: 2.612 ± 0.446
1.828TyrPhe: 1.828 ± 0.304
3.396TyrGly: 3.396 ± 0.728
0.522TyrHis: 0.522 ± 0.201
2.481TyrIle: 2.481 ± 0.437
1.959TyrLys: 1.959 ± 0.815
3.396TyrLeu: 3.396 ± 0.788
0.914TyrMet: 0.914 ± 0.33
2.09TyrAsn: 2.09 ± 0.499
1.437TyrPro: 1.437 ± 0.417
1.567TyrGln: 1.567 ± 0.612
3.134TyrArg: 3.134 ± 0.449
2.09TyrSer: 2.09 ± 0.385
2.481TyrThr: 2.481 ± 0.551
2.09TyrVal: 2.09 ± 0.404
0.784TyrTrp: 0.784 ± 0.395
1.306TyrTyr: 1.306 ± 0.421
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (7658 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski