Amino acid dipepetide frequency for Hydrogenispora ethanolica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.922AlaAla: 11.922 ± 0.135
0.887AlaCys: 0.887 ± 0.027
4.465AlaAsp: 4.465 ± 0.065
6.678AlaGlu: 6.678 ± 0.082
3.414AlaPhe: 3.414 ± 0.051
8.663AlaGly: 8.663 ± 0.096
1.381AlaHis: 1.381 ± 0.029
6.019AlaIle: 6.019 ± 0.066
4.554AlaLys: 4.554 ± 0.062
9.828AlaLeu: 9.828 ± 0.094
2.406AlaMet: 2.406 ± 0.046
2.825AlaAsn: 2.825 ± 0.048
3.355AlaPro: 3.355 ± 0.045
3.135AlaGln: 3.135 ± 0.041
5.078AlaArg: 5.078 ± 0.058
4.097AlaSer: 4.097 ± 0.056
4.196AlaThr: 4.196 ± 0.071
7.61AlaVal: 7.61 ± 0.078
1.017AlaTrp: 1.017 ± 0.029
2.434AlaTyr: 2.434 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
0.77CysAla: 0.77 ± 0.024
0.207CysCys: 0.207 ± 0.011
0.525CysAsp: 0.525 ± 0.019
0.48CysGlu: 0.48 ± 0.017
0.53CysPhe: 0.53 ± 0.018
1.152CysGly: 1.152 ± 0.024
0.28CysHis: 0.28 ± 0.015
0.624CysIle: 0.624 ± 0.024
0.383CysLys: 0.383 ± 0.017
1.094CysLeu: 1.094 ± 0.028
0.215CysMet: 0.215 ± 0.012
0.384CysAsn: 0.384 ± 0.015
0.623CysPro: 0.623 ± 0.021
0.437CysGln: 0.437 ± 0.015
0.794CysArg: 0.794 ± 0.024
0.611CysSer: 0.611 ± 0.021
0.44CysThr: 0.44 ± 0.017
0.564CysVal: 0.564 ± 0.019
0.148CysTrp: 0.148 ± 0.009
0.368CysTyr: 0.368 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.673AspAla: 3.673 ± 0.057
0.592AspCys: 0.592 ± 0.02
2.013AspAsp: 2.013 ± 0.039
3.087AspGlu: 3.087 ± 0.048
2.35AspPhe: 2.35 ± 0.038
3.952AspGly: 3.952 ± 0.057
0.916AspHis: 0.916 ± 0.026
2.876AspIle: 2.876 ± 0.05
1.842AspLys: 1.842 ± 0.045
5.549AspLeu: 5.549 ± 0.055
0.868AspMet: 0.868 ± 0.024
1.514AspAsn: 1.514 ± 0.032
2.91AspPro: 2.91 ± 0.043
1.893AspGln: 1.893 ± 0.034
3.293AspArg: 3.293 ± 0.048
2.618AspSer: 2.618 ± 0.042
1.842AspThr: 1.842 ± 0.035
2.576AspVal: 2.576 ± 0.043
0.804AspTrp: 0.804 ± 0.024
1.981AspTyr: 1.981 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
6.118GluAla: 6.118 ± 0.076
0.533GluCys: 0.533 ± 0.017
2.347GluAsp: 2.347 ± 0.046
4.299GluGlu: 4.299 ± 0.087
2.343GluPhe: 2.343 ± 0.042
3.535GluGly: 3.535 ± 0.05
1.144GluHis: 1.144 ± 0.026
4.742GluIle: 4.742 ± 0.061
3.635GluLys: 3.635 ± 0.059
7.517GluLeu: 7.517 ± 0.083
1.701GluMet: 1.701 ± 0.036
2.589GluAsn: 2.589 ± 0.042
2.468GluPro: 2.468 ± 0.046
2.77GluGln: 2.77 ± 0.041
4.714GluArg: 4.714 ± 0.071
3.017GluSer: 3.017 ± 0.044
3.259GluThr: 3.259 ± 0.047
4.133GluVal: 4.133 ± 0.048
0.774GluTrp: 0.774 ± 0.024
2.169GluTyr: 2.169 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
3.406PheAla: 3.406 ± 0.043
0.572PheCys: 0.572 ± 0.018
2.246PheAsp: 2.246 ± 0.044
2.256PheGlu: 2.256 ± 0.04
2.12PhePhe: 2.12 ± 0.041
3.565PheGly: 3.565 ± 0.04
0.896PheHis: 0.896 ± 0.023
2.791PheIle: 2.791 ± 0.046
1.939PheLys: 1.939 ± 0.035
4.593PheLeu: 4.593 ± 0.06
1.02PheMet: 1.02 ± 0.032
1.597PheAsn: 1.597 ± 0.032
1.9PhePro: 1.9 ± 0.039
1.846PheGln: 1.846 ± 0.034
2.38PheArg: 2.38 ± 0.043
2.757PheSer: 2.757 ± 0.045
2.248PheThr: 2.248 ± 0.038
2.488PheVal: 2.488 ± 0.04
0.646PheTrp: 0.646 ± 0.021
1.586PheTyr: 1.586 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
6.794GlyAla: 6.794 ± 0.112
1.005GlyCys: 1.005 ± 0.03
3.371GlyAsp: 3.371 ± 0.056
4.434GlyGlu: 4.434 ± 0.052
3.566GlyPhe: 3.566 ± 0.051
5.69GlyGly: 5.69 ± 0.067
1.331GlyHis: 1.331 ± 0.033
5.939GlyIle: 5.939 ± 0.059
4.343GlyLys: 4.343 ± 0.045
8.399GlyLeu: 8.399 ± 0.079
2.295GlyMet: 2.295 ± 0.041
2.649GlyAsn: 2.649 ± 0.047
2.729GlyPro: 2.729 ± 0.05
2.674GlyGln: 2.674 ± 0.047
4.742GlyArg: 4.742 ± 0.056
4.141GlySer: 4.141 ± 0.059
4.005GlyThr: 4.005 ± 0.059
5.795GlyVal: 5.795 ± 0.055
1.129GlyTrp: 1.129 ± 0.031
2.933GlyTyr: 2.933 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
1.228HisAla: 1.228 ± 0.028
0.329HisCys: 0.329 ± 0.016
0.866HisAsp: 0.866 ± 0.023
0.992HisGlu: 0.992 ± 0.03
0.957HisPhe: 0.957 ± 0.024
1.515HisGly: 1.515 ± 0.035
0.544HisHis: 0.544 ± 0.017
0.986HisIle: 0.986 ± 0.025
0.592HisLys: 0.592 ± 0.018
1.912HisLeu: 1.912 ± 0.037
0.292HisMet: 0.292 ± 0.015
0.59HisAsn: 0.59 ± 0.019
1.295HisPro: 1.295 ± 0.029
0.895HisGln: 0.895 ± 0.022
1.203HisArg: 1.203 ± 0.034
1.118HisSer: 1.118 ± 0.032
0.762HisThr: 0.762 ± 0.019
0.859HisVal: 0.859 ± 0.024
0.319HisTrp: 0.319 ± 0.015
0.765HisTyr: 0.765 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.753IleAla: 6.753 ± 0.08
0.781IleCys: 0.781 ± 0.026
3.455IleAsp: 3.455 ± 0.046
3.744IleGlu: 3.744 ± 0.057
2.72IlePhe: 2.72 ± 0.049
5.423IleGly: 5.423 ± 0.074
1.356IleHis: 1.356 ± 0.034
4.334IleIle: 4.334 ± 0.072
3.048IleLys: 3.048 ± 0.053
6.703IleLeu: 6.703 ± 0.08
1.394IleMet: 1.394 ± 0.028
2.364IleAsn: 2.364 ± 0.043
3.467IlePro: 3.467 ± 0.049
2.509IleGln: 2.509 ± 0.037
4.191IleArg: 4.191 ± 0.05
3.662IleSer: 3.662 ± 0.053
3.467IleThr: 3.467 ± 0.057
4.411IleVal: 4.411 ± 0.069
0.667IleTrp: 0.667 ± 0.021
2.043IleTyr: 2.043 ± 0.037
0.0IleXaa: 0.0 ± 0.0
Lys
4.331LysAla: 4.331 ± 0.061
0.344LysCys: 0.344 ± 0.017
2.222LysAsp: 2.222 ± 0.041
3.678LysGlu: 3.678 ± 0.06
1.61LysPhe: 1.61 ± 0.03
3.163LysGly: 3.163 ± 0.049
0.774LysHis: 0.774 ± 0.021
3.545LysIle: 3.545 ± 0.053
2.927LysLys: 2.927 ± 0.053
5.085LysLeu: 5.085 ± 0.055
1.35LysMet: 1.35 ± 0.035
2.225LysAsn: 2.225 ± 0.045
2.058LysPro: 2.058 ± 0.034
1.949LysGln: 1.949 ± 0.038
2.63LysArg: 2.63 ± 0.046
2.329LysSer: 2.329 ± 0.04
2.823LysThr: 2.823 ± 0.049
3.672LysVal: 3.672 ± 0.051
0.49LysTrp: 0.49 ± 0.016
1.687LysTyr: 1.687 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
11.293LeuAla: 11.293 ± 0.109
1.082LeuCys: 1.082 ± 0.032
5.2LeuAsp: 5.2 ± 0.053
6.948LeuGlu: 6.948 ± 0.082
4.475LeuPhe: 4.475 ± 0.064
8.0LeuGly: 8.0 ± 0.084
1.718LeuHis: 1.718 ± 0.035
6.92LeuIle: 6.92 ± 0.087
5.771LeuLys: 5.771 ± 0.063
11.609LeuLeu: 11.609 ± 0.129
2.381LeuMet: 2.381 ± 0.034
4.167LeuAsn: 4.167 ± 0.06
5.135LeuPro: 5.135 ± 0.062
4.108LeuGln: 4.108 ± 0.057
6.403LeuArg: 6.403 ± 0.079
6.425LeuSer: 6.425 ± 0.079
5.857LeuThr: 5.857 ± 0.069
7.138LeuVal: 7.138 ± 0.069
1.159LeuTrp: 1.159 ± 0.032
3.009LeuTyr: 3.009 ± 0.045
0.001LeuXaa: 0.001 ± 0.001
Met
2.626MetAla: 2.626 ± 0.042
0.125MetCys: 0.125 ± 0.009
1.126MetAsp: 1.126 ± 0.026
1.572MetGlu: 1.572 ± 0.037
0.74MetPhe: 0.74 ± 0.025
1.712MetGly: 1.712 ± 0.033
0.301MetHis: 0.301 ± 0.014
1.735MetIle: 1.735 ± 0.037
1.6MetLys: 1.6 ± 0.033
2.439MetLeu: 2.439 ± 0.039
0.71MetMet: 0.71 ± 0.024
1.13MetAsn: 1.13 ± 0.027
1.021MetPro: 1.021 ± 0.027
0.705MetGln: 0.705 ± 0.021
1.145MetArg: 1.145 ± 0.026
1.275MetSer: 1.275 ± 0.031
1.328MetThr: 1.328 ± 0.033
1.928MetVal: 1.928 ± 0.034
0.169MetTrp: 0.169 ± 0.01
0.519MetTyr: 0.519 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.703AsnAla: 2.703 ± 0.042
0.433AsnCys: 0.433 ± 0.016
1.696AsnAsp: 1.696 ± 0.038
2.048AsnGlu: 2.048 ± 0.042
1.566AsnPhe: 1.566 ± 0.029
2.893AsnGly: 2.893 ± 0.052
0.853AsnHis: 0.853 ± 0.024
2.466AsnIle: 2.466 ± 0.041
1.413AsnLys: 1.413 ± 0.036
4.14AsnLeu: 4.14 ± 0.054
0.784AsnMet: 0.784 ± 0.023
1.295AsnAsn: 1.295 ± 0.039
2.375AsnPro: 2.375 ± 0.042
1.723AsnGln: 1.723 ± 0.035
2.391AsnArg: 2.391 ± 0.04
2.03AsnSer: 2.03 ± 0.038
1.679AsnThr: 1.679 ± 0.031
2.084AsnVal: 2.084 ± 0.041
0.516AsnTrp: 0.516 ± 0.021
1.372AsnTyr: 1.372 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
4.662ProAla: 4.662 ± 0.066
0.394ProCys: 0.394 ± 0.016
2.698ProAsp: 2.698 ± 0.043
4.097ProGlu: 4.097 ± 0.057
1.954ProPhe: 1.954 ± 0.037
4.119ProGly: 4.119 ± 0.056
0.803ProHis: 0.803 ± 0.027
2.216ProIle: 2.216 ± 0.04
1.797ProLys: 1.797 ± 0.036
4.617ProLeu: 4.617 ± 0.061
0.917ProMet: 0.917 ± 0.021
1.461ProAsn: 1.461 ± 0.032
1.823ProPro: 1.823 ± 0.041
1.709ProGln: 1.709 ± 0.038
2.077ProArg: 2.077 ± 0.037
2.071ProSer: 2.071 ± 0.035
1.891ProThr: 1.891 ± 0.045
3.864ProVal: 3.864 ± 0.052
0.606ProTrp: 0.606 ± 0.023
1.449ProTyr: 1.449 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.797GlnAla: 3.797 ± 0.052
0.346GlnCys: 0.346 ± 0.015
1.552GlnAsp: 1.552 ± 0.032
2.48GlnGlu: 2.48 ± 0.04
1.636GlnPhe: 1.636 ± 0.032
2.638GlnGly: 2.638 ± 0.043
0.581GlnHis: 0.581 ± 0.02
2.615GlnIle: 2.615 ± 0.043
2.278GlnLys: 2.278 ± 0.031
4.272GlnLeu: 4.272 ± 0.056
0.944GlnMet: 0.944 ± 0.026
1.588GlnAsn: 1.588 ± 0.031
1.793GlnPro: 1.793 ± 0.032
1.624GlnGln: 1.624 ± 0.031
2.321GlnArg: 2.321 ± 0.04
2.078GlnSer: 2.078 ± 0.04
2.003GlnThr: 2.003 ± 0.037
2.634GlnVal: 2.634 ± 0.042
0.561GlnTrp: 0.561 ± 0.02
1.293GlnTyr: 1.293 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
4.417ArgAla: 4.417 ± 0.058
0.623ArgCys: 0.623 ± 0.02
2.887ArgAsp: 2.887 ± 0.043
4.737ArgGlu: 4.737 ± 0.067
2.823ArgPhe: 2.823 ± 0.037
3.613ArgGly: 3.613 ± 0.048
1.121ArgHis: 1.121 ± 0.032
4.469ArgIle: 4.469 ± 0.054
2.937ArgLys: 2.937 ± 0.042
6.898ArgLeu: 6.898 ± 0.08
1.589ArgMet: 1.589 ± 0.03
2.289ArgAsn: 2.289 ± 0.04
2.49ArgPro: 2.49 ± 0.04
2.753ArgGln: 2.753 ± 0.044
3.876ArgArg: 3.876 ± 0.053
3.17ArgSer: 3.17 ± 0.045
2.69ArgThr: 2.69 ± 0.037
3.931ArgVal: 3.931 ± 0.051
0.759ArgTrp: 0.759 ± 0.023
2.069ArgTyr: 2.069 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
4.442SerAla: 4.442 ± 0.052
0.624SerCys: 0.624 ± 0.02
2.556SerAsp: 2.556 ± 0.044
3.039SerGlu: 3.039 ± 0.043
2.798SerPhe: 2.798 ± 0.044
5.31SerGly: 5.31 ± 0.063
0.985SerHis: 0.985 ± 0.024
3.241SerIle: 3.241 ± 0.057
2.5SerLys: 2.5 ± 0.047
6.103SerLeu: 6.103 ± 0.064
1.265SerMet: 1.265 ± 0.028
1.761SerAsn: 1.761 ± 0.04
2.235SerPro: 2.235 ± 0.044
2.019SerGln: 2.019 ± 0.039
3.142SerArg: 3.142 ± 0.043
2.891SerSer: 2.891 ± 0.051
2.329SerThr: 2.329 ± 0.036
3.388SerVal: 3.388 ± 0.049
0.681SerTrp: 0.681 ± 0.02
1.866SerTyr: 1.866 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
4.977ThrAla: 4.977 ± 0.059
0.405ThrCys: 0.405 ± 0.018
2.27ThrAsp: 2.27 ± 0.04
2.739ThrGlu: 2.739 ± 0.047
1.967ThrPhe: 1.967 ± 0.032
4.729ThrGly: 4.729 ± 0.09
0.864ThrHis: 0.864 ± 0.024
3.518ThrIle: 3.518 ± 0.051
2.023ThrLys: 2.023 ± 0.037
5.327ThrLeu: 5.327 ± 0.059
1.146ThrMet: 1.146 ± 0.026
1.563ThrAsn: 1.563 ± 0.038
2.482ThrPro: 2.482 ± 0.044
1.522ThrGln: 1.522 ± 0.033
2.432ThrArg: 2.432 ± 0.04
2.338ThrSer: 2.338 ± 0.048
2.476ThrThr: 2.476 ± 0.053
4.45ThrVal: 4.45 ± 0.06
0.531ThrTrp: 0.531 ± 0.018
1.459ThrTyr: 1.459 ± 0.032
0.001ThrXaa: 0.001 ± 0.001
Val
6.77ValAla: 6.77 ± 0.074
0.796ValCys: 0.796 ± 0.022
3.306ValAsp: 3.306 ± 0.05
3.963ValGlu: 3.963 ± 0.055
3.043ValPhe: 3.043 ± 0.048
4.921ValGly: 4.921 ± 0.061
1.112ValHis: 1.112 ± 0.026
4.955ValIle: 4.955 ± 0.058
3.624ValLys: 3.624 ± 0.049
7.215ValLeu: 7.215 ± 0.073
1.727ValMet: 1.727 ± 0.034
2.631ValAsn: 2.631 ± 0.043
3.002ValPro: 3.002 ± 0.042
2.278ValGln: 2.278 ± 0.037
3.951ValArg: 3.951 ± 0.049
3.982ValSer: 3.982 ± 0.052
3.882ValThr: 3.882 ± 0.06
5.017ValVal: 5.017 ± 0.07
0.815ValTrp: 0.815 ± 0.023
2.093ValTyr: 2.093 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
0.867TrpAla: 0.867 ± 0.026
0.13TrpCys: 0.13 ± 0.009
0.617TrpAsp: 0.617 ± 0.021
0.783TrpGlu: 0.783 ± 0.022
0.556TrpPhe: 0.556 ± 0.02
0.896TrpGly: 0.896 ± 0.028
0.221TrpHis: 0.221 ± 0.011
0.71TrpIle: 0.71 ± 0.023
0.575TrpLys: 0.575 ± 0.018
1.555TrpLeu: 1.555 ± 0.038
0.337TrpMet: 0.337 ± 0.015
0.613TrpAsn: 0.613 ± 0.018
0.468TrpPro: 0.468 ± 0.016
0.615TrpGln: 0.615 ± 0.022
0.857TrpArg: 0.857 ± 0.024
0.693TrpSer: 0.693 ± 0.022
0.576TrpThr: 0.576 ± 0.019
0.828TrpVal: 0.828 ± 0.022
0.18TrpTrp: 0.18 ± 0.011
0.37TrpTyr: 0.37 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.416TyrAla: 2.416 ± 0.041
0.422TyrCys: 0.422 ± 0.017
1.732TyrAsp: 1.732 ± 0.049
1.752TyrGlu: 1.752 ± 0.034
1.686TyrPhe: 1.686 ± 0.03
2.476TyrGly: 2.476 ± 0.04
0.836TyrHis: 0.836 ± 0.025
1.792TyrIle: 1.792 ± 0.038
1.113TyrLys: 1.113 ± 0.029
3.945TyrLeu: 3.945 ± 0.051
0.563TyrMet: 0.563 ± 0.019
1.224TyrAsn: 1.224 ± 0.03
1.597TyrPro: 1.597 ± 0.035
1.871TyrGln: 1.871 ± 0.035
2.519TyrArg: 2.519 ± 0.044
1.869TyrSer: 1.869 ± 0.034
1.463TyrThr: 1.463 ± 0.031
1.712TyrVal: 1.712 ± 0.032
0.478TyrTrp: 0.478 ± 0.016
1.277TyrTyr: 1.277 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.009XaaXaa: 0.009 ± 0.006
Statistics based on 5259 proteins (1646465 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski