Amino acid dipepetide frequency for Propionibacterium phage Doucette

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.575AlaAla: 18.575 ± 2.247
0.959AlaCys: 0.959 ± 0.309
8.546AlaAsp: 8.546 ± 0.994
6.715AlaGlu: 6.715 ± 0.926
2.616AlaPhe: 2.616 ± 0.629
10.029AlaGly: 10.029 ± 1.07
1.308AlaHis: 1.308 ± 0.286
5.756AlaIle: 5.756 ± 0.948
5.145AlaLys: 5.145 ± 0.518
12.471AlaLeu: 12.471 ± 1.211
3.924AlaMet: 3.924 ± 0.477
2.791AlaAsn: 2.791 ± 0.458
5.756AlaPro: 5.756 ± 0.599
5.843AlaGln: 5.843 ± 0.775
8.459AlaArg: 8.459 ± 1.583
7.936AlaSer: 7.936 ± 0.764
8.11AlaThr: 8.11 ± 0.921
8.11AlaVal: 8.11 ± 0.781
3.837AlaTrp: 3.837 ± 0.551
1.483AlaTyr: 1.483 ± 0.377
0.0AlaXaa: 0.0 ± 0.0
Cys
0.959CysAla: 0.959 ± 0.289
0.0CysCys: 0.0 ± 0.0
0.785CysAsp: 0.785 ± 0.305
0.262CysGlu: 0.262 ± 0.164
0.262CysPhe: 0.262 ± 0.174
1.046CysGly: 1.046 ± 0.341
0.174CysHis: 0.174 ± 0.114
0.262CysIle: 0.262 ± 0.145
0.262CysLys: 0.262 ± 0.139
0.436CysLeu: 0.436 ± 0.19
0.174CysMet: 0.174 ± 0.157
0.087CysAsn: 0.087 ± 0.095
1.046CysPro: 1.046 ± 0.315
0.349CysGln: 0.349 ± 0.165
0.785CysArg: 0.785 ± 0.28
0.436CysSer: 0.436 ± 0.172
0.523CysThr: 0.523 ± 0.211
0.174CysVal: 0.174 ± 0.132
0.349CysTrp: 0.349 ± 0.174
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.023AspAla: 8.023 ± 0.9
0.61AspCys: 0.61 ± 0.276
3.924AspAsp: 3.924 ± 0.696
4.273AspGlu: 4.273 ± 0.897
1.831AspPhe: 1.831 ± 0.484
7.151AspGly: 7.151 ± 0.886
1.046AspHis: 1.046 ± 0.289
2.703AspIle: 2.703 ± 0.465
1.57AspLys: 1.57 ± 0.385
5.668AspLeu: 5.668 ± 0.678
1.831AspMet: 1.831 ± 0.389
0.61AspAsn: 0.61 ± 0.179
4.971AspPro: 4.971 ± 0.804
2.18AspGln: 2.18 ± 0.539
5.32AspArg: 5.32 ± 0.681
2.529AspSer: 2.529 ± 0.483
3.75AspThr: 3.75 ± 0.659
4.796AspVal: 4.796 ± 0.605
0.959AspTrp: 0.959 ± 0.369
1.134AspTyr: 1.134 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
8.023GluAla: 8.023 ± 0.806
0.785GluCys: 0.785 ± 0.241
2.965GluAsp: 2.965 ± 0.48
2.006GluGlu: 2.006 ± 0.615
0.959GluPhe: 0.959 ± 0.227
3.139GluGly: 3.139 ± 0.737
1.308GluHis: 1.308 ± 0.278
2.355GluIle: 2.355 ± 0.397
1.395GluLys: 1.395 ± 0.469
5.581GluLeu: 5.581 ± 0.921
0.698GluMet: 0.698 ± 0.241
0.61GluAsn: 0.61 ± 0.209
3.139GluPro: 3.139 ± 0.61
1.831GluGln: 1.831 ± 0.486
3.75GluArg: 3.75 ± 0.717
2.791GluSer: 2.791 ± 0.536
3.314GluThr: 3.314 ± 0.648
3.75GluVal: 3.75 ± 0.606
1.744GluTrp: 1.744 ± 0.357
0.349GluTyr: 0.349 ± 0.198
0.0GluXaa: 0.0 ± 0.0
Phe
3.488PheAla: 3.488 ± 0.582
0.349PheCys: 0.349 ± 0.15
2.006PheAsp: 2.006 ± 0.409
1.395PheGlu: 1.395 ± 0.378
0.436PhePhe: 0.436 ± 0.194
2.791PheGly: 2.791 ± 0.799
0.61PheHis: 0.61 ± 0.214
1.395PheIle: 1.395 ± 0.307
1.046PheLys: 1.046 ± 0.301
1.831PheLeu: 1.831 ± 0.405
0.872PheMet: 0.872 ± 0.244
0.349PheAsn: 0.349 ± 0.217
1.395PhePro: 1.395 ± 0.405
1.744PheGln: 1.744 ± 0.349
1.57PheArg: 1.57 ± 0.292
1.395PheSer: 1.395 ± 0.417
1.919PheThr: 1.919 ± 0.377
1.744PheVal: 1.744 ± 0.343
0.523PheTrp: 0.523 ± 0.239
0.349PheTyr: 0.349 ± 0.174
0.0PheXaa: 0.0 ± 0.0
Gly
10.726GlyAla: 10.726 ± 1.214
0.61GlyCys: 0.61 ± 0.201
5.058GlyAsp: 5.058 ± 0.776
4.971GlyGlu: 4.971 ± 0.557
2.878GlyPhe: 2.878 ± 0.567
7.674GlyGly: 7.674 ± 1.122
1.221GlyHis: 1.221 ± 0.301
4.186GlyIle: 4.186 ± 0.76
3.575GlyLys: 3.575 ± 0.478
8.459GlyLeu: 8.459 ± 1.029
1.744GlyMet: 1.744 ± 0.454
1.657GlyAsn: 1.657 ± 0.338
4.448GlyPro: 4.448 ± 0.627
2.529GlyGln: 2.529 ± 0.903
5.843GlyArg: 5.843 ± 0.96
4.884GlySer: 4.884 ± 0.889
5.145GlyThr: 5.145 ± 0.586
6.366GlyVal: 6.366 ± 1.053
3.227GlyTrp: 3.227 ± 0.489
1.657GlyTyr: 1.657 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
1.483HisAla: 1.483 ± 0.392
0.262HisCys: 0.262 ± 0.15
0.785HisAsp: 0.785 ± 0.244
0.959HisGlu: 0.959 ± 0.309
0.698HisPhe: 0.698 ± 0.282
1.395HisGly: 1.395 ± 0.341
0.61HisHis: 0.61 ± 0.23
0.523HisIle: 0.523 ± 0.205
0.436HisLys: 0.436 ± 0.202
1.395HisLeu: 1.395 ± 0.454
0.785HisMet: 0.785 ± 0.271
0.087HisAsn: 0.087 ± 0.091
1.046HisPro: 1.046 ± 0.294
1.134HisGln: 1.134 ± 0.334
1.134HisArg: 1.134 ± 0.333
0.959HisSer: 0.959 ± 0.316
1.744HisThr: 1.744 ± 0.466
1.134HisVal: 1.134 ± 0.279
0.61HisTrp: 0.61 ± 0.204
0.087HisTyr: 0.087 ± 0.092
0.0HisXaa: 0.0 ± 0.0
Ile
4.709IleAla: 4.709 ± 0.72
0.262IleCys: 0.262 ± 0.147
4.709IleAsp: 4.709 ± 0.647
3.401IleGlu: 3.401 ± 0.546
1.744IlePhe: 1.744 ± 0.454
3.488IleGly: 3.488 ± 1.146
0.872IleHis: 0.872 ± 0.251
2.442IleIle: 2.442 ± 0.623
1.483IleLys: 1.483 ± 0.329
2.965IleLeu: 2.965 ± 0.676
0.61IleMet: 0.61 ± 0.188
0.523IleAsn: 0.523 ± 0.21
3.052IlePro: 3.052 ± 0.478
1.744IleGln: 1.744 ± 0.399
2.529IleArg: 2.529 ± 0.533
2.791IleSer: 2.791 ± 0.723
4.273IleThr: 4.273 ± 0.509
3.139IleVal: 3.139 ± 0.548
0.61IleTrp: 0.61 ± 0.208
0.959IleTyr: 0.959 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
4.884LysAla: 4.884 ± 0.796
0.174LysCys: 0.174 ± 0.116
1.831LysAsp: 1.831 ± 0.375
1.657LysGlu: 1.657 ± 0.291
1.134LysPhe: 1.134 ± 0.337
2.616LysGly: 2.616 ± 0.464
0.698LysHis: 0.698 ± 0.289
1.221LysIle: 1.221 ± 0.39
0.959LysLys: 0.959 ± 0.313
2.442LysLeu: 2.442 ± 0.479
0.698LysMet: 0.698 ± 0.233
0.698LysAsn: 0.698 ± 0.226
2.616LysPro: 2.616 ± 0.542
1.744LysGln: 1.744 ± 0.405
3.575LysArg: 3.575 ± 0.631
1.395LysSer: 1.395 ± 0.266
2.791LysThr: 2.791 ± 0.59
2.791LysVal: 2.791 ± 0.522
0.785LysTrp: 0.785 ± 0.26
0.523LysTyr: 0.523 ± 0.223
0.0LysXaa: 0.0 ± 0.0
Leu
11.511LeuAla: 11.511 ± 1.225
0.698LeuCys: 0.698 ± 0.25
5.407LeuAsp: 5.407 ± 0.679
4.099LeuGlu: 4.099 ± 0.848
2.267LeuPhe: 2.267 ± 0.443
8.633LeuGly: 8.633 ± 1.013
1.483LeuHis: 1.483 ± 0.391
3.488LeuIle: 3.488 ± 0.843
2.791LeuLys: 2.791 ± 0.581
5.843LeuLeu: 5.843 ± 0.664
1.831LeuMet: 1.831 ± 0.426
1.831LeuAsn: 1.831 ± 0.473
5.058LeuPro: 5.058 ± 0.646
3.052LeuGln: 3.052 ± 0.534
5.668LeuArg: 5.668 ± 0.762
5.843LeuSer: 5.843 ± 0.987
6.715LeuThr: 6.715 ± 0.835
4.884LeuVal: 4.884 ± 0.9
2.006LeuTrp: 2.006 ± 0.433
0.61LeuTyr: 0.61 ± 0.198
0.0LeuXaa: 0.0 ± 0.0
Met
3.139MetAla: 3.139 ± 0.514
0.0MetCys: 0.0 ± 0.0
1.221MetAsp: 1.221 ± 0.353
0.698MetGlu: 0.698 ± 0.274
0.349MetPhe: 0.349 ± 0.209
1.57MetGly: 1.57 ± 0.466
0.61MetHis: 0.61 ± 0.232
1.221MetIle: 1.221 ± 0.255
1.395MetLys: 1.395 ± 0.422
1.657MetLeu: 1.657 ± 0.421
0.61MetMet: 0.61 ± 0.214
0.785MetAsn: 0.785 ± 0.239
1.919MetPro: 1.919 ± 0.425
0.61MetGln: 0.61 ± 0.198
2.006MetArg: 2.006 ± 0.449
2.616MetSer: 2.616 ± 0.385
1.657MetThr: 1.657 ± 0.375
1.308MetVal: 1.308 ± 0.283
0.262MetTrp: 0.262 ± 0.15
0.262MetTyr: 0.262 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
2.791AsnAla: 2.791 ± 0.603
0.087AsnCys: 0.087 ± 0.08
0.698AsnAsp: 0.698 ± 0.271
0.523AsnGlu: 0.523 ± 0.174
0.349AsnPhe: 0.349 ± 0.146
1.744AsnGly: 1.744 ± 0.381
0.436AsnHis: 0.436 ± 0.177
1.134AsnIle: 1.134 ± 0.263
0.262AsnLys: 0.262 ± 0.153
1.395AsnLeu: 1.395 ± 0.408
0.262AsnMet: 0.262 ± 0.182
0.349AsnAsn: 0.349 ± 0.158
1.919AsnPro: 1.919 ± 0.448
0.61AsnGln: 0.61 ± 0.233
1.919AsnArg: 1.919 ± 0.467
0.523AsnSer: 0.523 ± 0.199
1.046AsnThr: 1.046 ± 0.253
1.831AsnVal: 1.831 ± 0.442
0.349AsnTrp: 0.349 ± 0.289
0.262AsnTyr: 0.262 ± 0.154
0.0AsnXaa: 0.0 ± 0.0
Pro
7.674ProAla: 7.674 ± 0.9
0.523ProCys: 0.523 ± 0.256
4.971ProAsp: 4.971 ± 0.779
3.401ProGlu: 3.401 ± 0.542
2.18ProPhe: 2.18 ± 0.504
5.058ProGly: 5.058 ± 0.735
0.872ProHis: 0.872 ± 0.221
2.529ProIle: 2.529 ± 0.477
2.006ProLys: 2.006 ± 0.414
3.401ProLeu: 3.401 ± 0.615
1.395ProMet: 1.395 ± 0.308
1.134ProAsn: 1.134 ± 0.314
4.012ProPro: 4.012 ± 1.027
2.093ProGln: 2.093 ± 0.536
2.791ProArg: 2.791 ± 0.598
4.36ProSer: 4.36 ± 0.692
3.488ProThr: 3.488 ± 0.601
5.93ProVal: 5.93 ± 0.954
1.134ProTrp: 1.134 ± 0.34
0.61ProTyr: 0.61 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
4.622GlnAla: 4.622 ± 0.638
0.174GlnCys: 0.174 ± 0.152
1.657GlnAsp: 1.657 ± 0.419
1.744GlnGlu: 1.744 ± 0.525
1.046GlnPhe: 1.046 ± 0.289
2.791GlnGly: 2.791 ± 0.502
0.436GlnHis: 0.436 ± 0.178
2.442GlnIle: 2.442 ± 0.505
1.046GlnLys: 1.046 ± 0.231
4.884GlnLeu: 4.884 ± 1.014
0.872GlnMet: 0.872 ± 0.306
0.698GlnAsn: 0.698 ± 0.252
1.831GlnPro: 1.831 ± 0.461
2.18GlnGln: 2.18 ± 0.343
2.093GlnArg: 2.093 ± 0.476
2.355GlnSer: 2.355 ± 0.481
2.18GlnThr: 2.18 ± 0.523
3.837GlnVal: 3.837 ± 0.769
1.57GlnTrp: 1.57 ± 0.352
0.61GlnTyr: 0.61 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
8.285ArgAla: 8.285 ± 0.945
0.872ArgCys: 0.872 ± 0.347
5.058ArgAsp: 5.058 ± 0.752
3.488ArgGlu: 3.488 ± 0.57
2.093ArgPhe: 2.093 ± 0.513
5.494ArgGly: 5.494 ± 0.825
1.483ArgHis: 1.483 ± 0.423
2.791ArgIle: 2.791 ± 0.447
3.314ArgLys: 3.314 ± 0.771
4.884ArgLeu: 4.884 ± 0.734
1.831ArgMet: 1.831 ± 0.377
1.57ArgAsn: 1.57 ± 0.362
3.837ArgPro: 3.837 ± 0.79
2.355ArgGln: 2.355 ± 0.458
5.145ArgArg: 5.145 ± 1.059
3.75ArgSer: 3.75 ± 0.541
4.186ArgThr: 4.186 ± 0.667
4.36ArgVal: 4.36 ± 0.651
2.006ArgTrp: 2.006 ± 0.499
1.221ArgTyr: 1.221 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
7.849SerAla: 7.849 ± 0.947
0.436SerCys: 0.436 ± 0.242
4.796SerAsp: 4.796 ± 0.653
1.919SerGlu: 1.919 ± 0.532
1.657SerPhe: 1.657 ± 0.376
5.407SerGly: 5.407 ± 0.793
0.959SerHis: 0.959 ± 0.334
3.139SerIle: 3.139 ± 0.629
1.657SerLys: 1.657 ± 0.393
4.796SerLeu: 4.796 ± 0.859
1.919SerMet: 1.919 ± 0.35
1.134SerAsn: 1.134 ± 0.248
2.442SerPro: 2.442 ± 0.406
2.18SerGln: 2.18 ± 0.507
4.012SerArg: 4.012 ± 0.58
5.32SerSer: 5.32 ± 0.893
4.796SerThr: 4.796 ± 0.659
5.32SerVal: 5.32 ± 0.762
1.395SerTrp: 1.395 ± 0.391
0.872SerTyr: 0.872 ± 0.3
0.0SerXaa: 0.0 ± 0.0
Thr
6.715ThrAla: 6.715 ± 0.618
1.134ThrCys: 1.134 ± 0.44
4.709ThrAsp: 4.709 ± 0.975
2.616ThrGlu: 2.616 ± 0.506
1.57ThrPhe: 1.57 ± 0.406
6.192ThrGly: 6.192 ± 0.822
1.134ThrHis: 1.134 ± 0.375
3.575ThrIle: 3.575 ± 0.604
2.616ThrLys: 2.616 ± 0.498
6.279ThrLeu: 6.279 ± 0.642
1.046ThrMet: 1.046 ± 0.31
1.308ThrAsn: 1.308 ± 0.329
5.232ThrPro: 5.232 ± 0.891
2.442ThrGln: 2.442 ± 0.346
3.575ThrArg: 3.575 ± 0.675
4.535ThrSer: 4.535 ± 0.935
5.32ThrThr: 5.32 ± 0.816
4.971ThrVal: 4.971 ± 0.647
1.744ThrTrp: 1.744 ± 0.381
0.959ThrTyr: 0.959 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
10.552ValAla: 10.552 ± 1.027
0.349ValCys: 0.349 ± 0.161
3.401ValAsp: 3.401 ± 0.499
4.796ValGlu: 4.796 ± 0.721
2.093ValPhe: 2.093 ± 0.697
7.238ValGly: 7.238 ± 1.518
0.872ValHis: 0.872 ± 0.316
3.488ValIle: 3.488 ± 0.712
2.791ValLys: 2.791 ± 0.488
5.843ValLeu: 5.843 ± 0.978
1.744ValMet: 1.744 ± 0.412
1.483ValAsn: 1.483 ± 0.355
4.012ValPro: 4.012 ± 0.52
2.442ValGln: 2.442 ± 0.457
4.273ValArg: 4.273 ± 0.636
5.581ValSer: 5.581 ± 0.833
4.535ValThr: 4.535 ± 0.649
6.279ValVal: 6.279 ± 0.928
1.657ValTrp: 1.657 ± 0.374
0.698ValTyr: 0.698 ± 0.237
0.0ValXaa: 0.0 ± 0.0
Trp
2.616TrpAla: 2.616 ± 0.51
0.0TrpCys: 0.0 ± 0.0
2.093TrpAsp: 2.093 ± 0.402
1.134TrpGlu: 1.134 ± 0.333
0.436TrpPhe: 0.436 ± 0.168
1.744TrpGly: 1.744 ± 0.451
0.61TrpHis: 0.61 ± 0.296
1.395TrpIle: 1.395 ± 0.334
1.046TrpLys: 1.046 ± 0.218
2.267TrpLeu: 2.267 ± 0.501
0.872TrpMet: 0.872 ± 0.228
0.523TrpAsn: 0.523 ± 0.224
1.483TrpPro: 1.483 ± 0.313
1.046TrpGln: 1.046 ± 0.33
3.052TrpArg: 3.052 ± 0.702
1.134TrpSer: 1.134 ± 0.373
1.308TrpThr: 1.308 ± 0.317
2.093TrpVal: 2.093 ± 0.395
0.436TrpTrp: 0.436 ± 0.199
0.174TrpTyr: 0.174 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.744TyrAla: 1.744 ± 0.439
0.174TyrCys: 0.174 ± 0.129
0.262TyrAsp: 0.262 ± 0.164
0.436TyrGlu: 0.436 ± 0.212
0.61TyrPhe: 0.61 ± 0.249
1.657TyrGly: 1.657 ± 0.525
0.523TyrHis: 0.523 ± 0.232
0.262TyrIle: 0.262 ± 0.13
0.436TyrLys: 0.436 ± 0.179
1.221TyrLeu: 1.221 ± 0.331
0.087TyrMet: 0.087 ± 0.098
0.262TyrAsn: 0.262 ± 0.162
0.349TyrPro: 0.349 ± 0.191
0.959TyrGln: 0.959 ± 0.31
0.523TyrArg: 0.523 ± 0.203
0.872TyrSer: 0.872 ± 0.281
0.872TyrThr: 0.872 ± 0.261
1.308TyrVal: 1.308 ± 0.26
0.262TyrTrp: 0.262 ± 0.154
0.174TyrTyr: 0.174 ± 0.134
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (11468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski