Amino acid dipepetide frequency for Faecalibacterium phage FP_Mushu

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.678AlaAla: 11.678 ± 1.095
0.606AlaCys: 0.606 ± 0.235
4.758AlaAsp: 4.758 ± 0.644
7.18AlaGlu: 7.18 ± 0.929
2.768AlaPhe: 2.768 ± 0.557
6.055AlaGly: 6.055 ± 0.667
1.125AlaHis: 1.125 ± 0.301
4.585AlaIle: 4.585 ± 0.729
5.19AlaLys: 5.19 ± 0.586
9.862AlaLeu: 9.862 ± 1.06
2.595AlaMet: 2.595 ± 0.456
3.028AlaAsn: 3.028 ± 0.589
3.547AlaPro: 3.547 ± 0.642
4.931AlaGln: 4.931 ± 0.778
5.882AlaArg: 5.882 ± 0.648
5.017AlaSer: 5.017 ± 0.701
6.142AlaThr: 6.142 ± 0.887
8.564AlaVal: 8.564 ± 0.884
1.125AlaTrp: 1.125 ± 0.317
3.46AlaTyr: 3.46 ± 0.653
0.0AlaXaa: 0.0 ± 0.0
Cys
0.346CysAla: 0.346 ± 0.168
0.0CysCys: 0.0 ± 0.0
0.606CysAsp: 0.606 ± 0.199
0.779CysGlu: 0.779 ± 0.308
0.346CysPhe: 0.346 ± 0.175
0.692CysGly: 0.692 ± 0.281
0.087CysHis: 0.087 ± 0.086
0.519CysIle: 0.519 ± 0.201
0.519CysLys: 0.519 ± 0.199
0.173CysLeu: 0.173 ± 0.125
0.173CysMet: 0.173 ± 0.117
0.26CysAsn: 0.26 ± 0.14
0.606CysPro: 0.606 ± 0.296
0.346CysGln: 0.346 ± 0.215
0.606CysArg: 0.606 ± 0.219
0.952CysSer: 0.952 ± 0.337
0.433CysThr: 0.433 ± 0.161
0.26CysVal: 0.26 ± 0.144
0.087CysTrp: 0.087 ± 0.087
0.346CysTyr: 0.346 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
7.612AspAla: 7.612 ± 0.706
0.433AspCys: 0.433 ± 0.188
4.152AspAsp: 4.152 ± 0.519
5.19AspGlu: 5.19 ± 0.551
2.422AspPhe: 2.422 ± 0.539
5.277AspGly: 5.277 ± 0.821
0.606AspHis: 0.606 ± 0.212
3.46AspIle: 3.46 ± 0.521
3.547AspLys: 3.547 ± 0.609
5.277AspLeu: 5.277 ± 0.774
1.817AspMet: 1.817 ± 0.383
1.471AspAsn: 1.471 ± 0.325
2.076AspPro: 2.076 ± 0.501
0.952AspGln: 0.952 ± 0.294
3.028AspArg: 3.028 ± 0.478
2.336AspSer: 2.336 ± 0.407
3.893AspThr: 3.893 ± 0.643
4.498AspVal: 4.498 ± 0.711
0.779AspTrp: 0.779 ± 0.2
1.557AspTyr: 1.557 ± 0.388
0.0AspXaa: 0.0 ± 0.0
Glu
8.91GluAla: 8.91 ± 0.875
0.606GluCys: 0.606 ± 0.284
6.142GluAsp: 6.142 ± 0.673
6.228GluGlu: 6.228 ± 0.73
1.903GluPhe: 1.903 ± 0.37
6.055GluGly: 6.055 ± 0.555
0.692GluHis: 0.692 ± 0.23
4.239GluIle: 4.239 ± 0.595
4.844GluLys: 4.844 ± 0.521
8.824GluLeu: 8.824 ± 0.939
2.163GluMet: 2.163 ± 0.409
2.682GluAsn: 2.682 ± 0.505
2.682GluPro: 2.682 ± 0.43
3.287GluGln: 3.287 ± 0.656
4.844GluArg: 4.844 ± 0.689
3.374GluSer: 3.374 ± 0.503
4.671GluThr: 4.671 ± 0.574
4.585GluVal: 4.585 ± 0.667
1.125GluTrp: 1.125 ± 0.282
2.509GluTyr: 2.509 ± 0.556
0.0GluXaa: 0.0 ± 0.0
Phe
3.114PheAla: 3.114 ± 0.51
0.173PheCys: 0.173 ± 0.13
1.903PheAsp: 1.903 ± 0.429
3.028PheGlu: 3.028 ± 0.484
1.73PhePhe: 1.73 ± 0.334
2.509PheGly: 2.509 ± 0.416
0.173PheHis: 0.173 ± 0.106
1.99PheIle: 1.99 ± 0.409
2.595PheLys: 2.595 ± 0.401
2.855PheLeu: 2.855 ± 0.424
0.779PheMet: 0.779 ± 0.219
1.211PheAsn: 1.211 ± 0.264
1.125PhePro: 1.125 ± 0.308
0.606PheGln: 0.606 ± 0.275
1.903PheArg: 1.903 ± 0.361
2.855PheSer: 2.855 ± 0.551
2.422PheThr: 2.422 ± 0.507
2.509PheVal: 2.509 ± 0.46
0.433PheTrp: 0.433 ± 0.178
1.384PheTyr: 1.384 ± 0.343
0.0PheXaa: 0.0 ± 0.0
Gly
5.709GlyAla: 5.709 ± 0.634
0.433GlyCys: 0.433 ± 0.169
4.412GlyAsp: 4.412 ± 0.711
5.969GlyGlu: 5.969 ± 0.609
3.633GlyPhe: 3.633 ± 0.568
6.055GlyGly: 6.055 ± 0.881
1.125GlyHis: 1.125 ± 0.323
3.547GlyIle: 3.547 ± 0.752
5.969GlyLys: 5.969 ± 0.663
5.882GlyLeu: 5.882 ± 0.58
1.557GlyMet: 1.557 ± 0.362
2.163GlyAsn: 2.163 ± 0.425
1.298GlyPro: 1.298 ± 0.297
2.509GlyGln: 2.509 ± 0.588
2.941GlyArg: 2.941 ± 0.505
4.498GlySer: 4.498 ± 0.742
5.104GlyThr: 5.104 ± 0.622
5.19GlyVal: 5.19 ± 0.639
1.038GlyTrp: 1.038 ± 0.263
3.374GlyTyr: 3.374 ± 0.575
0.0GlyXaa: 0.0 ± 0.0
His
1.038HisAla: 1.038 ± 0.373
0.26HisCys: 0.26 ± 0.133
0.606HisAsp: 0.606 ± 0.202
0.606HisGlu: 0.606 ± 0.285
0.173HisPhe: 0.173 ± 0.11
0.952HisGly: 0.952 ± 0.383
0.087HisHis: 0.087 ± 0.087
1.038HisIle: 1.038 ± 0.263
0.952HisLys: 0.952 ± 0.292
1.038HisLeu: 1.038 ± 0.276
0.173HisMet: 0.173 ± 0.118
0.692HisAsn: 0.692 ± 0.27
0.952HisPro: 0.952 ± 0.364
0.606HisGln: 0.606 ± 0.245
0.519HisArg: 0.519 ± 0.173
0.606HisSer: 0.606 ± 0.265
0.606HisThr: 0.606 ± 0.218
0.952HisVal: 0.952 ± 0.283
0.346HisTrp: 0.346 ± 0.239
0.519HisTyr: 0.519 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
4.066IleAla: 4.066 ± 0.628
0.606IleCys: 0.606 ± 0.248
3.806IleAsp: 3.806 ± 0.48
4.758IleGlu: 4.758 ± 0.522
1.384IlePhe: 1.384 ± 0.321
4.239IleGly: 4.239 ± 0.681
0.692IleHis: 0.692 ± 0.228
4.152IleIle: 4.152 ± 0.603
4.758IleLys: 4.758 ± 0.557
4.325IleLeu: 4.325 ± 0.536
0.865IleMet: 0.865 ± 0.307
2.076IleAsn: 2.076 ± 0.411
2.076IlePro: 2.076 ± 0.58
2.336IleGln: 2.336 ± 0.406
3.72IleArg: 3.72 ± 0.65
3.374IleSer: 3.374 ± 0.564
3.287IleThr: 3.287 ± 0.771
3.028IleVal: 3.028 ± 0.587
0.519IleTrp: 0.519 ± 0.239
2.249IleTyr: 2.249 ± 0.5
0.0IleXaa: 0.0 ± 0.0
Lys
9.343LysAla: 9.343 ± 1.31
0.606LysCys: 0.606 ± 0.181
3.201LysAsp: 3.201 ± 0.454
3.806LysGlu: 3.806 ± 0.549
2.682LysPhe: 2.682 ± 0.484
4.412LysGly: 4.412 ± 0.692
0.865LysHis: 0.865 ± 0.292
3.979LysIle: 3.979 ± 0.62
5.363LysLys: 5.363 ± 0.654
5.709LysLeu: 5.709 ± 0.554
1.644LysMet: 1.644 ± 0.334
3.028LysAsn: 3.028 ± 0.505
2.336LysPro: 2.336 ± 0.391
2.595LysGln: 2.595 ± 0.537
4.498LysArg: 4.498 ± 0.584
4.152LysSer: 4.152 ± 0.744
4.671LysThr: 4.671 ± 0.648
4.498LysVal: 4.498 ± 0.563
0.606LysTrp: 0.606 ± 0.276
2.336LysTyr: 2.336 ± 0.41
0.0LysXaa: 0.0 ± 0.0
Leu
7.958LeuAla: 7.958 ± 0.947
0.952LeuCys: 0.952 ± 0.288
5.19LeuAsp: 5.19 ± 0.655
6.661LeuGlu: 6.661 ± 0.791
1.99LeuPhe: 1.99 ± 0.415
5.882LeuGly: 5.882 ± 0.703
1.817LeuHis: 1.817 ± 0.354
3.547LeuIle: 3.547 ± 0.74
6.661LeuLys: 6.661 ± 1.035
7.093LeuLeu: 7.093 ± 0.756
2.249LeuMet: 2.249 ± 0.459
3.114LeuAsn: 3.114 ± 0.514
3.374LeuPro: 3.374 ± 0.448
2.768LeuGln: 2.768 ± 0.571
4.844LeuArg: 4.844 ± 0.683
4.412LeuSer: 4.412 ± 0.644
7.007LeuThr: 7.007 ± 0.693
5.709LeuVal: 5.709 ± 0.754
0.692LeuTrp: 0.692 ± 0.237
2.682LeuTyr: 2.682 ± 0.465
0.0LeuXaa: 0.0 ± 0.0
Met
2.768MetAla: 2.768 ± 0.386
0.087MetCys: 0.087 ± 0.086
1.557MetAsp: 1.557 ± 0.373
3.114MetGlu: 3.114 ± 0.449
0.519MetPhe: 0.519 ± 0.2
1.038MetGly: 1.038 ± 0.229
0.0MetHis: 0.0 ± 0.0
1.125MetIle: 1.125 ± 0.326
1.99MetLys: 1.99 ± 0.463
2.076MetLeu: 2.076 ± 0.372
0.865MetMet: 0.865 ± 0.303
1.298MetAsn: 1.298 ± 0.246
0.606MetPro: 0.606 ± 0.214
0.952MetGln: 0.952 ± 0.252
1.384MetArg: 1.384 ± 0.377
1.557MetSer: 1.557 ± 0.41
1.471MetThr: 1.471 ± 0.42
0.865MetVal: 0.865 ± 0.233
0.087MetTrp: 0.087 ± 0.088
0.606MetTyr: 0.606 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
3.547AsnAla: 3.547 ± 0.58
0.173AsnCys: 0.173 ± 0.125
2.682AsnAsp: 2.682 ± 0.533
1.817AsnGlu: 1.817 ± 0.368
1.211AsnPhe: 1.211 ± 0.294
4.066AsnGly: 4.066 ± 0.673
0.606AsnHis: 0.606 ± 0.237
1.817AsnIle: 1.817 ± 0.407
2.509AsnLys: 2.509 ± 0.477
2.855AsnLeu: 2.855 ± 0.492
1.125AsnMet: 1.125 ± 0.331
1.557AsnAsn: 1.557 ± 0.437
1.384AsnPro: 1.384 ± 0.279
1.038AsnGln: 1.038 ± 0.3
1.99AsnArg: 1.99 ± 0.446
1.99AsnSer: 1.99 ± 0.47
1.903AsnThr: 1.903 ± 0.434
2.595AsnVal: 2.595 ± 0.431
0.087AsnTrp: 0.087 ± 0.084
0.865AsnTyr: 0.865 ± 0.26
0.0AsnXaa: 0.0 ± 0.0
Pro
2.941ProAla: 2.941 ± 0.506
0.26ProCys: 0.26 ± 0.156
2.509ProAsp: 2.509 ± 0.467
4.239ProGlu: 4.239 ± 0.63
1.298ProPhe: 1.298 ± 0.344
2.076ProGly: 2.076 ± 0.456
0.692ProHis: 0.692 ± 0.237
1.903ProIle: 1.903 ± 0.428
1.903ProLys: 1.903 ± 0.387
2.941ProLeu: 2.941 ± 0.443
0.692ProMet: 0.692 ± 0.24
0.865ProAsn: 0.865 ± 0.281
1.73ProPro: 1.73 ± 0.543
1.384ProGln: 1.384 ± 0.374
2.249ProArg: 2.249 ± 0.518
2.422ProSer: 2.422 ± 0.471
1.73ProThr: 1.73 ± 0.6
2.509ProVal: 2.509 ± 0.399
0.26ProTrp: 0.26 ± 0.174
1.125ProTyr: 1.125 ± 0.296
0.0ProXaa: 0.0 ± 0.0
Gln
2.336GlnAla: 2.336 ± 0.407
0.087GlnCys: 0.087 ± 0.088
1.211GlnAsp: 1.211 ± 0.286
3.114GlnGlu: 3.114 ± 0.503
1.125GlnPhe: 1.125 ± 0.276
2.422GlnGly: 2.422 ± 0.469
0.952GlnHis: 0.952 ± 0.252
2.768GlnIle: 2.768 ± 0.547
3.633GlnLys: 3.633 ± 0.649
2.509GlnLeu: 2.509 ± 0.464
1.298GlnMet: 1.298 ± 0.25
1.125GlnAsn: 1.125 ± 0.294
0.865GlnPro: 0.865 ± 0.239
1.211GlnGln: 1.211 ± 0.264
2.336GlnArg: 2.336 ± 0.479
1.903GlnSer: 1.903 ± 0.435
2.422GlnThr: 2.422 ± 0.542
1.99GlnVal: 1.99 ± 0.35
0.433GlnTrp: 0.433 ± 0.155
1.038GlnTyr: 1.038 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
5.536ArgAla: 5.536 ± 0.784
0.779ArgCys: 0.779 ± 0.293
2.855ArgAsp: 2.855 ± 0.452
6.401ArgGlu: 6.401 ± 0.909
2.682ArgPhe: 2.682 ± 0.541
3.46ArgGly: 3.46 ± 0.509
0.606ArgHis: 0.606 ± 0.22
3.287ArgIle: 3.287 ± 0.514
4.239ArgLys: 4.239 ± 0.641
4.498ArgLeu: 4.498 ± 0.752
1.038ArgMet: 1.038 ± 0.298
1.644ArgAsn: 1.644 ± 0.377
1.99ArgPro: 1.99 ± 0.409
1.817ArgGln: 1.817 ± 0.406
4.498ArgArg: 4.498 ± 0.862
3.287ArgSer: 3.287 ± 0.382
3.287ArgThr: 3.287 ± 0.608
4.325ArgVal: 4.325 ± 0.61
0.606ArgTrp: 0.606 ± 0.205
2.076ArgTyr: 2.076 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
3.46SerAla: 3.46 ± 0.645
0.606SerCys: 0.606 ± 0.205
3.893SerAsp: 3.893 ± 0.797
4.152SerGlu: 4.152 ± 0.561
2.076SerPhe: 2.076 ± 0.459
4.585SerGly: 4.585 ± 0.863
0.519SerHis: 0.519 ± 0.213
3.201SerIle: 3.201 ± 0.36
4.931SerLys: 4.931 ± 0.589
4.585SerLeu: 4.585 ± 0.844
1.384SerMet: 1.384 ± 0.378
2.076SerAsn: 2.076 ± 0.357
2.163SerPro: 2.163 ± 0.463
1.817SerGln: 1.817 ± 0.4
3.46SerArg: 3.46 ± 0.811
3.287SerSer: 3.287 ± 0.551
2.595SerThr: 2.595 ± 0.546
4.671SerVal: 4.671 ± 0.754
0.519SerTrp: 0.519 ± 0.227
2.076SerTyr: 2.076 ± 0.418
0.0SerXaa: 0.0 ± 0.0
Thr
7.093ThrAla: 7.093 ± 0.925
0.087ThrCys: 0.087 ± 0.084
3.201ThrAsp: 3.201 ± 0.56
4.412ThrGlu: 4.412 ± 0.527
3.201ThrPhe: 3.201 ± 0.46
5.19ThrGly: 5.19 ± 0.688
0.433ThrHis: 0.433 ± 0.178
4.066ThrIle: 4.066 ± 1.006
4.066ThrLys: 4.066 ± 0.71
5.277ThrLeu: 5.277 ± 0.582
1.125ThrMet: 1.125 ± 0.365
2.768ThrAsn: 2.768 ± 0.605
2.855ThrPro: 2.855 ± 0.496
1.99ThrGln: 1.99 ± 0.484
3.547ThrArg: 3.547 ± 0.544
2.682ThrSer: 2.682 ± 0.552
2.509ThrThr: 2.509 ± 0.487
5.536ThrVal: 5.536 ± 0.838
0.952ThrTrp: 0.952 ± 0.393
1.903ThrTyr: 1.903 ± 0.352
0.0ThrXaa: 0.0 ± 0.0
Val
5.969ValAla: 5.969 ± 0.695
1.125ValCys: 1.125 ± 0.374
3.979ValAsp: 3.979 ± 0.769
5.796ValGlu: 5.796 ± 0.656
2.509ValPhe: 2.509 ± 0.485
3.806ValGly: 3.806 ± 0.574
0.692ValHis: 0.692 ± 0.195
4.671ValIle: 4.671 ± 0.608
3.979ValLys: 3.979 ± 0.5
5.017ValLeu: 5.017 ± 0.614
1.384ValMet: 1.384 ± 0.348
3.547ValAsn: 3.547 ± 0.534
2.768ValPro: 2.768 ± 0.458
1.817ValGln: 1.817 ± 0.367
4.152ValArg: 4.152 ± 0.47
5.363ValSer: 5.363 ± 0.631
5.017ValThr: 5.017 ± 0.712
3.893ValVal: 3.893 ± 0.77
0.606ValTrp: 0.606 ± 0.179
2.336ValTyr: 2.336 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
1.211TrpAla: 1.211 ± 0.266
0.087TrpCys: 0.087 ± 0.087
1.038TrpAsp: 1.038 ± 0.353
0.865TrpGlu: 0.865 ± 0.233
0.779TrpPhe: 0.779 ± 0.274
0.779TrpGly: 0.779 ± 0.229
0.173TrpHis: 0.173 ± 0.119
0.26TrpIle: 0.26 ± 0.144
0.519TrpLys: 0.519 ± 0.192
1.211TrpLeu: 1.211 ± 0.322
0.173TrpMet: 0.173 ± 0.119
0.087TrpAsn: 0.087 ± 0.088
0.26TrpPro: 0.26 ± 0.198
0.433TrpGln: 0.433 ± 0.163
0.606TrpArg: 0.606 ± 0.264
0.346TrpSer: 0.346 ± 0.126
0.606TrpThr: 0.606 ± 0.223
0.865TrpVal: 0.865 ± 0.26
0.0TrpTrp: 0.0 ± 0.0
0.346TrpTyr: 0.346 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.46TyrAla: 3.46 ± 0.449
0.173TyrCys: 0.173 ± 0.12
2.682TyrAsp: 2.682 ± 0.466
2.249TyrGlu: 2.249 ± 0.397
0.865TyrPhe: 0.865 ± 0.302
2.855TyrGly: 2.855 ± 0.496
0.779TyrHis: 0.779 ± 0.268
2.163TyrIle: 2.163 ± 0.509
2.163TyrLys: 2.163 ± 0.459
2.595TyrLeu: 2.595 ± 0.628
0.779TyrMet: 0.779 ± 0.231
1.211TyrAsn: 1.211 ± 0.31
1.038TyrPro: 1.038 ± 0.241
1.384TyrGln: 1.384 ± 0.382
1.903TyrArg: 1.903 ± 0.415
1.644TyrSer: 1.644 ± 0.345
3.114TyrThr: 3.114 ± 0.455
1.384TyrVal: 1.384 ± 0.345
0.346TyrTrp: 0.346 ± 0.164
1.384TyrTyr: 1.384 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (11561 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski