Amino acid dipepetide frequency for Ralstonia phage PE226

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.817AlaAla: 13.817 ± 3.754
3.454AlaCys: 3.454 ± 1.215
7.484AlaAsp: 7.484 ± 1.513
3.454AlaGlu: 3.454 ± 0.875
2.879AlaPhe: 2.879 ± 1.133
10.938AlaGly: 10.938 ± 2.06
2.879AlaHis: 2.879 ± 1.378
7.484AlaIle: 7.484 ± 2.565
3.454AlaLys: 3.454 ± 1.73
12.09AlaLeu: 12.09 ± 3.538
2.303AlaMet: 2.303 ± 1.163
2.879AlaAsn: 2.879 ± 0.707
4.606AlaPro: 4.606 ± 1.608
4.03AlaGln: 4.03 ± 0.927
4.606AlaArg: 4.606 ± 1.18
10.363AlaSer: 10.363 ± 3.219
4.606AlaThr: 4.606 ± 1.845
8.636AlaVal: 8.636 ± 4.822
1.727AlaTrp: 1.727 ± 1.776
4.606AlaTyr: 4.606 ± 1.34
0.0AlaXaa: 0.0 ± 0.0
Cys
4.606CysAla: 4.606 ± 2.345
0.0CysCys: 0.0 ± 0.0
2.879CysAsp: 2.879 ± 1.17
1.151CysGlu: 1.151 ± 0.732
0.576CysPhe: 0.576 ± 1.035
1.151CysGly: 1.151 ± 0.541
0.576CysHis: 0.576 ± 0.453
0.576CysIle: 0.576 ± 0.644
1.151CysLys: 1.151 ± 0.895
1.151CysLeu: 1.151 ± 0.895
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.727CysPro: 1.727 ± 0.896
1.151CysGln: 1.151 ± 0.541
0.0CysArg: 0.0 ± 0.0
1.151CysSer: 1.151 ± 0.895
2.303CysThr: 2.303 ± 1.135
3.454CysVal: 3.454 ± 0.966
0.576CysTrp: 0.576 ± 0.516
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.636AspAla: 8.636 ± 1.811
0.576AspCys: 0.576 ± 0.447
0.0AspAsp: 0.0 ± 0.0
4.03AspGlu: 4.03 ± 2.757
1.727AspPhe: 1.727 ± 1.041
5.181AspGly: 5.181 ± 1.204
0.576AspHis: 0.576 ± 0.453
0.576AspIle: 0.576 ± 0.447
0.576AspLys: 0.576 ± 0.447
5.181AspLeu: 5.181 ± 1.589
0.0AspMet: 0.0 ± 0.0
1.727AspAsn: 1.727 ± 0.885
4.606AspPro: 4.606 ± 1.432
2.879AspGln: 2.879 ± 0.712
1.151AspArg: 1.151 ± 0.541
2.879AspSer: 2.879 ± 1.018
1.727AspThr: 1.727 ± 1.086
1.727AspVal: 1.727 ± 1.28
1.727AspTrp: 1.727 ± 1.359
1.151AspTyr: 1.151 ± 0.541
0.0AspXaa: 0.0 ± 0.0
Glu
4.03GluAla: 4.03 ± 1.62
1.151GluCys: 1.151 ± 1.031
1.727GluAsp: 1.727 ± 0.68
1.151GluGlu: 1.151 ± 0.958
1.727GluPhe: 1.727 ± 1.359
2.303GluGly: 2.303 ± 1.399
1.151GluHis: 1.151 ± 0.55
1.727GluIle: 1.727 ± 1.359
1.727GluLys: 1.727 ± 0.68
4.03GluLeu: 4.03 ± 2.272
0.576GluMet: 0.576 ± 0.748
2.303GluAsn: 2.303 ± 1.025
1.151GluPro: 1.151 ± 0.765
2.879GluGln: 2.879 ± 1.341
2.303GluArg: 2.303 ± 1.267
2.303GluSer: 2.303 ± 0.769
2.303GluThr: 2.303 ± 0.958
4.606GluVal: 4.606 ± 2.396
0.576GluTrp: 0.576 ± 0.644
1.151GluTyr: 1.151 ± 0.765
0.0GluXaa: 0.0 ± 0.0
Phe
1.727PheAla: 1.727 ± 1.359
0.576PheCys: 0.576 ± 0.453
3.454PheAsp: 3.454 ± 0.966
1.151PheGlu: 1.151 ± 0.55
2.879PhePhe: 2.879 ± 1.133
4.03PheGly: 4.03 ± 1.692
0.576PheHis: 0.576 ± 0.453
1.151PheIle: 1.151 ± 0.734
1.151PheLys: 1.151 ± 1.031
2.303PheLeu: 2.303 ± 1.897
2.303PheMet: 2.303 ± 1.056
0.576PheAsn: 0.576 ± 0.453
2.303PhePro: 2.303 ± 0.861
1.727PheGln: 1.727 ± 2.102
1.151PheArg: 1.151 ± 0.958
2.303PheSer: 2.303 ± 1.081
0.576PheThr: 0.576 ± 0.453
1.727PheVal: 1.727 ± 1.425
0.576PheTrp: 0.576 ± 0.644
0.576PheTyr: 0.576 ± 0.516
0.0PheXaa: 0.0 ± 0.0
Gly
14.393GlyAla: 14.393 ± 3.572
1.727GlyCys: 1.727 ± 0.866
3.454GlyAsp: 3.454 ± 1.659
5.757GlyGlu: 5.757 ± 2.849
4.03GlyPhe: 4.03 ± 2.25
9.787GlyGly: 9.787 ± 2.497
0.0GlyHis: 0.0 ± 0.0
4.03GlyIle: 4.03 ± 1.788
5.181GlyLys: 5.181 ± 1.387
4.03GlyLeu: 4.03 ± 2.328
1.151GlyMet: 1.151 ± 0.986
2.879GlyAsn: 2.879 ± 1.17
3.454GlyPro: 3.454 ± 2.155
3.454GlyGln: 3.454 ± 0.966
6.908GlyArg: 6.908 ± 2.408
8.06GlySer: 8.06 ± 2.504
6.908GlyThr: 6.908 ± 1.83
9.211GlyVal: 9.211 ± 1.518
1.151GlyTrp: 1.151 ± 0.578
2.879GlyTyr: 2.879 ± 1.325
0.0GlyXaa: 0.0 ± 0.0
His
1.151HisAla: 1.151 ± 1.257
0.0HisCys: 0.0 ± 0.0
1.151HisAsp: 1.151 ± 0.906
1.151HisGlu: 1.151 ± 0.55
0.0HisPhe: 0.0 ± 0.0
1.151HisGly: 1.151 ± 0.55
0.0HisHis: 0.0 ± 0.0
0.576HisIle: 0.576 ± 0.516
1.151HisLys: 1.151 ± 0.55
3.454HisLeu: 3.454 ± 1.377
0.576HisMet: 0.576 ± 0.453
0.0HisAsn: 0.0 ± 0.0
0.576HisPro: 0.576 ± 1.035
0.0HisGln: 0.0 ± 0.0
1.727HisArg: 1.727 ± 0.965
0.0HisSer: 0.0 ± 0.0
1.151HisThr: 1.151 ± 0.578
1.727HisVal: 1.727 ± 0.965
0.0HisTrp: 0.0 ± 0.0
0.576HisTyr: 0.576 ± 0.453
0.0HisXaa: 0.0 ± 0.0
Ile
6.908IleAla: 6.908 ± 1.698
1.151IleCys: 1.151 ± 0.578
1.727IleAsp: 1.727 ± 0.639
1.727IleGlu: 1.727 ± 1.359
0.0IlePhe: 0.0 ± 0.0
4.606IleGly: 4.606 ± 1.967
0.0IleHis: 0.0 ± 0.0
0.576IleIle: 0.576 ± 0.516
2.879IleLys: 2.879 ± 1.757
2.879IleLeu: 2.879 ± 0.998
0.576IleMet: 0.576 ± 0.447
0.576IleAsn: 0.576 ± 0.666
4.03IlePro: 4.03 ± 1.265
1.151IleGln: 1.151 ± 1.274
4.03IleArg: 4.03 ± 1.285
1.727IleSer: 1.727 ± 0.965
1.727IleThr: 1.727 ± 0.507
6.333IleVal: 6.333 ± 3.022
1.151IleTrp: 1.151 ± 0.578
0.576IleTyr: 0.576 ± 0.666
0.0IleXaa: 0.0 ± 0.0
Lys
2.879LysAla: 2.879 ± 0.712
0.576LysCys: 0.576 ± 0.666
1.151LysAsp: 1.151 ± 0.541
2.303LysGlu: 2.303 ± 0.876
0.576LysPhe: 0.576 ± 0.64
3.454LysGly: 3.454 ± 1.817
0.576LysHis: 0.576 ± 0.453
0.576LysIle: 0.576 ± 0.702
1.151LysLys: 1.151 ± 0.732
2.879LysLeu: 2.879 ± 0.913
2.303LysMet: 2.303 ± 1.593
0.0LysAsn: 0.0 ± 0.0
4.03LysPro: 4.03 ± 1.221
2.303LysGln: 2.303 ± 0.994
3.454LysArg: 3.454 ± 1.86
3.454LysSer: 3.454 ± 1.256
1.727LysThr: 1.727 ± 1.077
2.303LysVal: 2.303 ± 1.522
0.576LysTrp: 0.576 ± 0.644
1.151LysTyr: 1.151 ± 0.55
0.0LysXaa: 0.0 ± 0.0
Leu
10.938LeuAla: 10.938 ± 3.219
1.727LeuCys: 1.727 ± 1.128
4.03LeuAsp: 4.03 ± 1.539
4.606LeuGlu: 4.606 ± 1.482
2.303LeuPhe: 2.303 ± 1.858
5.181LeuGly: 5.181 ± 1.372
1.727LeuHis: 1.727 ± 0.961
3.454LeuIle: 3.454 ± 1.909
3.454LeuLys: 3.454 ± 0.847
7.484LeuLeu: 7.484 ± 3.484
3.454LeuMet: 3.454 ± 2.248
1.151LeuAsn: 1.151 ± 0.895
3.454LeuPro: 3.454 ± 1.369
2.879LeuGln: 2.879 ± 1.715
6.908LeuArg: 6.908 ± 3.267
3.454LeuSer: 3.454 ± 1.543
4.03LeuThr: 4.03 ± 1.643
7.484LeuVal: 7.484 ± 1.744
0.576LeuTrp: 0.576 ± 0.453
0.576LeuTyr: 0.576 ± 0.516
0.0LeuXaa: 0.0 ± 0.0
Met
1.727MetAla: 1.727 ± 0.808
0.576MetCys: 0.576 ± 0.447
0.0MetAsp: 0.0 ± 0.0
1.151MetGlu: 1.151 ± 1.331
0.0MetPhe: 0.0 ± 0.0
0.576MetGly: 0.576 ± 0.447
1.151MetHis: 1.151 ± 1.243
1.727MetIle: 1.727 ± 1.559
1.727MetLys: 1.727 ± 1.084
2.879MetLeu: 2.879 ± 0.814
0.0MetMet: 0.0 ± 0.0
0.576MetAsn: 0.576 ± 0.702
1.727MetPro: 1.727 ± 1.123
2.303MetGln: 2.303 ± 0.958
3.454MetArg: 3.454 ± 1.376
1.727MetSer: 1.727 ± 1.344
0.576MetThr: 0.576 ± 0.516
1.151MetVal: 1.151 ± 0.765
0.576MetTrp: 0.576 ± 0.64
1.151MetTyr: 1.151 ± 0.738
0.0MetXaa: 0.0 ± 0.0
Asn
3.454AsnAla: 3.454 ± 0.832
2.303AsnCys: 2.303 ± 1.79
0.576AsnAsp: 0.576 ± 0.447
1.151AsnGlu: 1.151 ± 0.541
0.0AsnPhe: 0.0 ± 0.0
4.03AsnGly: 4.03 ± 0.814
0.576AsnHis: 0.576 ± 0.516
0.576AsnIle: 0.576 ± 0.447
0.576AsnLys: 0.576 ± 0.453
0.576AsnLeu: 0.576 ± 0.516
1.151AsnMet: 1.151 ± 0.715
0.576AsnAsn: 0.576 ± 0.666
1.727AsnPro: 1.727 ± 1.086
1.151AsnGln: 1.151 ± 1.031
0.0AsnArg: 0.0 ± 0.0
1.727AsnSer: 1.727 ± 1.086
0.576AsnThr: 0.576 ± 0.447
3.454AsnVal: 3.454 ± 1.307
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.333ProAla: 6.333 ± 2.667
0.0ProCys: 0.0 ± 0.0
1.151ProAsp: 1.151 ± 0.541
2.879ProGlu: 2.879 ± 1.465
1.727ProPhe: 1.727 ± 1.51
4.03ProGly: 4.03 ± 2.378
0.576ProHis: 0.576 ± 0.453
2.879ProIle: 2.879 ± 1.195
1.727ProLys: 1.727 ± 0.507
1.727ProLeu: 1.727 ± 0.852
2.303ProMet: 2.303 ± 1.147
1.151ProAsn: 1.151 ± 0.895
2.303ProPro: 2.303 ± 1.14
2.879ProGln: 2.879 ± 0.721
5.181ProArg: 5.181 ± 1.767
5.757ProSer: 5.757 ± 2.499
5.757ProThr: 5.757 ± 1.825
4.606ProVal: 4.606 ± 1.408
0.576ProTrp: 0.576 ± 0.516
1.151ProTyr: 1.151 ± 0.895
0.0ProXaa: 0.0 ± 0.0
Gln
1.727GlnAla: 1.727 ± 0.964
1.151GlnCys: 1.151 ± 0.827
3.454GlnAsp: 3.454 ± 1.478
1.727GlnGlu: 1.727 ± 1.547
2.303GlnPhe: 2.303 ± 1.225
4.03GlnGly: 4.03 ± 1.412
0.576GlnHis: 0.576 ± 0.644
2.303GlnIle: 2.303 ± 1.486
4.03GlnLys: 4.03 ± 1.316
3.454GlnLeu: 3.454 ± 1.093
0.576GlnMet: 0.576 ± 0.453
1.151GlnAsn: 1.151 ± 0.578
3.454GlnPro: 3.454 ± 1.529
3.454GlnGln: 3.454 ± 1.334
2.879GlnArg: 2.879 ± 1.13
1.151GlnSer: 1.151 ± 0.895
2.303GlnThr: 2.303 ± 1.293
3.454GlnVal: 3.454 ± 1.256
1.727GlnTrp: 1.727 ± 1.01
1.151GlnTyr: 1.151 ± 0.578
0.0GlnXaa: 0.0 ± 0.0
Arg
5.181ArgAla: 5.181 ± 1.841
1.727ArgCys: 1.727 ± 0.883
3.454ArgAsp: 3.454 ± 2.122
1.727ArgGlu: 1.727 ± 0.866
1.151ArgPhe: 1.151 ± 0.738
6.908ArgGly: 6.908 ± 2.594
0.576ArgHis: 0.576 ± 0.516
2.879ArgIle: 2.879 ± 1.497
2.879ArgLys: 2.879 ± 1.438
2.303ArgLeu: 2.303 ± 1.034
1.151ArgMet: 1.151 ± 0.765
0.576ArgAsn: 0.576 ± 0.516
3.454ArgPro: 3.454 ± 1.368
2.303ArgGln: 2.303 ± 1.1
4.03ArgArg: 4.03 ± 2.155
3.454ArgSer: 3.454 ± 1.632
2.879ArgThr: 2.879 ± 0.951
8.636ArgVal: 8.636 ± 2.468
0.0ArgTrp: 0.0 ± 0.0
0.576ArgTyr: 0.576 ± 0.453
0.0ArgXaa: 0.0 ± 0.0
Ser
9.211SerAla: 9.211 ± 2.118
1.727SerCys: 1.727 ± 0.965
2.303SerAsp: 2.303 ± 0.98
0.576SerGlu: 0.576 ± 0.447
1.727SerPhe: 1.727 ± 1.342
10.363SerGly: 10.363 ± 3.184
1.151SerHis: 1.151 ± 0.541
4.606SerIle: 4.606 ± 1.201
1.151SerLys: 1.151 ± 0.732
2.879SerLeu: 2.879 ± 1.89
0.576SerMet: 0.576 ± 0.702
2.303SerAsn: 2.303 ± 1.147
2.879SerPro: 2.879 ± 1.394
1.727SerGln: 1.727 ± 0.964
1.727SerArg: 1.727 ± 1.0
6.908SerSer: 6.908 ± 3.519
3.454SerThr: 3.454 ± 0.793
5.757SerVal: 5.757 ± 1.878
1.727SerTrp: 1.727 ± 1.059
1.727SerTyr: 1.727 ± 0.896
0.0SerXaa: 0.0 ± 0.0
Thr
4.606ThrAla: 4.606 ± 2.432
2.879ThrCys: 2.879 ± 1.876
2.303ThrAsp: 2.303 ± 1.023
1.151ThrGlu: 1.151 ± 0.55
2.303ThrPhe: 2.303 ± 0.769
8.636ThrGly: 8.636 ± 3.08
1.151ThrHis: 1.151 ± 1.031
4.03ThrIle: 4.03 ± 1.769
0.576ThrLys: 0.576 ± 0.516
6.908ThrLeu: 6.908 ± 2.041
1.727ThrMet: 1.727 ± 1.359
1.727ThrAsn: 1.727 ± 0.507
2.879ThrPro: 2.879 ± 1.55
4.03ThrGln: 4.03 ± 1.919
1.151ThrArg: 1.151 ± 0.541
3.454ThrSer: 3.454 ± 1.662
15.544ThrThr: 15.544 ± 8.608
1.727ThrVal: 1.727 ± 0.507
1.151ThrTrp: 1.151 ± 0.732
1.727ThrTyr: 1.727 ± 0.964
0.0ThrXaa: 0.0 ± 0.0
Val
10.938ValAla: 10.938 ± 2.464
2.303ValCys: 2.303 ± 1.322
4.606ValAsp: 4.606 ± 1.452
2.879ValGlu: 2.879 ± 1.318
3.454ValPhe: 3.454 ± 1.256
9.211ValGly: 9.211 ± 3.778
1.727ValHis: 1.727 ± 0.965
4.03ValIle: 4.03 ± 1.266
1.727ValLys: 1.727 ± 1.077
8.06ValLeu: 8.06 ± 1.728
1.151ValMet: 1.151 ± 0.732
2.879ValAsn: 2.879 ± 1.394
5.181ValPro: 5.181 ± 1.772
4.03ValGln: 4.03 ± 2.701
4.606ValArg: 4.606 ± 2.2
2.303ValSer: 2.303 ± 2.081
6.908ValThr: 6.908 ± 3.512
5.757ValVal: 5.757 ± 2.465
0.576ValTrp: 0.576 ± 0.914
2.303ValTyr: 2.303 ± 0.551
0.0ValXaa: 0.0 ± 0.0
Trp
2.879TrpAla: 2.879 ± 1.692
0.0TrpCys: 0.0 ± 0.0
0.576TrpAsp: 0.576 ± 0.644
0.0TrpGlu: 0.0 ± 0.0
1.151TrpPhe: 1.151 ± 0.55
1.151TrpGly: 1.151 ± 1.614
0.576TrpHis: 0.576 ± 0.516
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.303TrpLeu: 2.303 ± 1.475
1.151TrpMet: 1.151 ± 0.69
0.576TrpAsn: 0.576 ± 0.447
0.0TrpPro: 0.0 ± 0.0
1.151TrpGln: 1.151 ± 0.895
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
2.303TrpThr: 2.303 ± 1.156
1.151TrpVal: 1.151 ± 1.046
0.0TrpTrp: 0.0 ± 0.0
0.576TrpTyr: 0.576 ± 0.516
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.151TyrAla: 1.151 ± 0.738
0.576TyrCys: 0.576 ± 0.447
1.151TyrAsp: 1.151 ± 0.578
1.727TyrGlu: 1.727 ± 1.359
2.303TyrPhe: 2.303 ± 1.307
1.727TyrGly: 1.727 ± 0.507
0.0TyrHis: 0.0 ± 0.0
0.576TyrIle: 0.576 ± 0.644
1.151TyrLys: 1.151 ± 0.55
2.303TyrLeu: 2.303 ± 1.507
1.151TyrMet: 1.151 ± 0.671
0.576TyrAsn: 0.576 ± 0.702
1.151TyrPro: 1.151 ± 0.578
0.576TyrGln: 0.576 ± 0.447
0.576TyrArg: 0.576 ± 0.453
2.303TyrSer: 2.303 ± 1.322
2.303TyrThr: 2.303 ± 1.156
1.727TyrVal: 1.727 ± 0.756
0.576TyrTrp: 0.576 ± 0.447
0.576TyrTyr: 0.576 ± 0.516
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1738 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski