Amino acid dipepetide frequency for Natronincola ferrireducens

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.982AlaAla: 4.982 ± 0.099
0.662AlaCys: 0.662 ± 0.026
2.66AlaAsp: 2.66 ± 0.056
3.98AlaGlu: 3.98 ± 0.08
2.708AlaPhe: 2.708 ± 0.069
4.644AlaGly: 4.644 ± 0.099
1.039AlaHis: 1.039 ± 0.041
6.659AlaIle: 6.659 ± 0.094
4.498AlaLys: 4.498 ± 0.078
6.964AlaLeu: 6.964 ± 0.105
2.043AlaMet: 2.043 ± 0.053
2.617AlaAsn: 2.617 ± 0.056
1.723AlaPro: 1.723 ± 0.047
1.724AlaGln: 1.724 ± 0.043
2.267AlaArg: 2.267 ± 0.054
3.32AlaSer: 3.32 ± 0.058
3.586AlaThr: 3.586 ± 0.074
5.067AlaVal: 5.067 ± 0.095
0.385AlaTrp: 0.385 ± 0.024
2.325AlaTyr: 2.325 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.446CysAla: 0.446 ± 0.023
0.16CysCys: 0.16 ± 0.014
0.527CysAsp: 0.527 ± 0.027
0.555CysGlu: 0.555 ± 0.028
0.375CysPhe: 0.375 ± 0.019
0.915CysGly: 0.915 ± 0.033
0.241CysHis: 0.241 ± 0.016
0.806CysIle: 0.806 ± 0.033
0.658CysLys: 0.658 ± 0.026
0.725CysLeu: 0.725 ± 0.03
0.24CysMet: 0.24 ± 0.019
0.525CysAsn: 0.525 ± 0.026
0.47CysPro: 0.47 ± 0.024
0.311CysGln: 0.311 ± 0.019
0.427CysArg: 0.427 ± 0.025
0.669CysSer: 0.669 ± 0.03
0.48CysThr: 0.48 ± 0.025
0.506CysVal: 0.506 ± 0.025
0.057CysTrp: 0.057 ± 0.009
0.325CysTyr: 0.325 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
2.945AspAla: 2.945 ± 0.064
0.5AspCys: 0.5 ± 0.024
2.198AspAsp: 2.198 ± 0.054
4.105AspGlu: 4.105 ± 0.066
2.512AspPhe: 2.512 ± 0.063
3.175AspGly: 3.175 ± 0.06
0.86AspHis: 0.86 ± 0.032
5.814AspIle: 5.814 ± 0.095
4.154AspLys: 4.154 ± 0.078
4.966AspLeu: 4.966 ± 0.083
1.523AspMet: 1.523 ± 0.04
2.302AspAsn: 2.302 ± 0.053
1.566AspPro: 1.566 ± 0.048
1.088AspGln: 1.088 ± 0.031
2.111AspArg: 2.111 ± 0.048
2.284AspSer: 2.284 ± 0.054
2.618AspThr: 2.618 ± 0.054
3.491AspVal: 3.491 ± 0.062
0.427AspTrp: 0.427 ± 0.024
2.403AspTyr: 2.403 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
5.532GluAla: 5.532 ± 0.093
0.461GluCys: 0.461 ± 0.023
4.609GluAsp: 4.609 ± 0.08
8.536GluGlu: 8.536 ± 0.15
2.551GluPhe: 2.551 ± 0.053
5.528GluGly: 5.528 ± 0.088
1.001GluHis: 1.001 ± 0.037
7.633GluIle: 7.633 ± 0.119
7.598GluLys: 7.598 ± 0.11
6.608GluLeu: 6.608 ± 0.095
2.247GluMet: 2.247 ± 0.05
4.244GluAsn: 4.244 ± 0.077
1.577GluPro: 1.577 ± 0.046
1.688GluGln: 1.688 ± 0.047
2.849GluArg: 2.849 ± 0.058
2.783GluSer: 2.783 ± 0.058
3.424GluThr: 3.424 ± 0.071
5.836GluVal: 5.836 ± 0.089
0.434GluTrp: 0.434 ± 0.024
2.501GluTyr: 2.501 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
2.393PheAla: 2.393 ± 0.053
0.39PheCys: 0.39 ± 0.021
2.123PheAsp: 2.123 ± 0.048
2.385PheGlu: 2.385 ± 0.051
1.905PhePhe: 1.905 ± 0.058
2.786PheGly: 2.786 ± 0.068
0.888PheHis: 0.888 ± 0.034
4.303PheIle: 4.303 ± 0.089
2.838PheLys: 2.838 ± 0.068
3.949PheLeu: 3.949 ± 0.086
1.144PheMet: 1.144 ± 0.036
2.059PheAsn: 2.059 ± 0.05
1.377PhePro: 1.377 ± 0.037
1.433PheGln: 1.433 ± 0.048
1.307PheArg: 1.307 ± 0.038
2.653PheSer: 2.653 ± 0.061
2.374PheThr: 2.374 ± 0.054
2.671PheVal: 2.671 ± 0.067
0.285PheTrp: 0.285 ± 0.022
1.595PheTyr: 1.595 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.663GlyAla: 4.663 ± 0.099
0.892GlyCys: 0.892 ± 0.035
3.487GlyAsp: 3.487 ± 0.067
5.004GlyGlu: 5.004 ± 0.079
3.377GlyPhe: 3.377 ± 0.069
5.357GlyGly: 5.357 ± 0.096
1.278GlyHis: 1.278 ± 0.043
7.407GlyIle: 7.407 ± 0.102
5.38GlyLys: 5.38 ± 0.083
6.667GlyLeu: 6.667 ± 0.107
2.182GlyMet: 2.182 ± 0.056
3.113GlyAsn: 3.113 ± 0.074
1.578GlyPro: 1.578 ± 0.04
1.848GlyGln: 1.848 ± 0.04
2.732GlyArg: 2.732 ± 0.055
3.58GlySer: 3.58 ± 0.059
3.771GlyThr: 3.771 ± 0.08
5.178GlyVal: 5.178 ± 0.098
0.589GlyTrp: 0.589 ± 0.026
3.063GlyTyr: 3.063 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
0.897HisAla: 0.897 ± 0.035
0.209HisCys: 0.209 ± 0.015
0.828HisAsp: 0.828 ± 0.03
1.133HisGlu: 1.133 ± 0.033
0.726HisPhe: 0.726 ± 0.03
1.296HisGly: 1.296 ± 0.047
0.521HisHis: 0.521 ± 0.027
1.742HisIle: 1.742 ± 0.048
1.229HisLys: 1.229 ± 0.038
1.73HisLeu: 1.73 ± 0.04
0.497HisMet: 0.497 ± 0.023
0.885HisAsn: 0.885 ± 0.033
0.898HisPro: 0.898 ± 0.029
0.726HisGln: 0.726 ± 0.027
0.88HisArg: 0.88 ± 0.039
1.02HisSer: 1.02 ± 0.038
0.836HisThr: 0.836 ± 0.035
1.02HisVal: 1.02 ± 0.036
0.161HisTrp: 0.161 ± 0.015
0.76HisTyr: 0.76 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.721IleAla: 6.721 ± 0.106
0.982IleCys: 0.982 ± 0.035
5.475IleAsp: 5.475 ± 0.09
7.55IleGlu: 7.55 ± 0.105
3.86IlePhe: 3.86 ± 0.084
6.878IleGly: 6.878 ± 0.11
1.855IleHis: 1.855 ± 0.052
8.907IleIle: 8.907 ± 0.139
7.26IleLys: 7.26 ± 0.099
9.202IleLeu: 9.202 ± 0.129
2.403IleMet: 2.403 ± 0.05
4.803IleAsn: 4.803 ± 0.077
3.761IlePro: 3.761 ± 0.077
2.994IleGln: 2.994 ± 0.052
3.267IleArg: 3.267 ± 0.07
5.364IleSer: 5.364 ± 0.092
5.365IleThr: 5.365 ± 0.084
6.384IleVal: 6.384 ± 0.091
0.56IleTrp: 0.56 ± 0.03
3.332IleTyr: 3.332 ± 0.06
0.0IleXaa: 0.0 ± 0.0
Lys
4.911LysAla: 4.911 ± 0.081
0.499LysCys: 0.499 ± 0.027
4.712LysAsp: 4.712 ± 0.077
7.922LysGlu: 7.922 ± 0.114
2.166LysPhe: 2.166 ± 0.054
5.252LysGly: 5.252 ± 0.072
1.251LysHis: 1.251 ± 0.037
7.246LysIle: 7.246 ± 0.109
7.039LysLys: 7.039 ± 0.099
6.486LysLeu: 6.486 ± 0.086
2.166LysMet: 2.166 ± 0.051
4.61LysAsn: 4.61 ± 0.084
2.135LysPro: 2.135 ± 0.051
2.366LysGln: 2.366 ± 0.057
3.01LysArg: 3.01 ± 0.07
3.816LysSer: 3.816 ± 0.068
4.026LysThr: 4.026 ± 0.071
5.215LysVal: 5.215 ± 0.084
0.532LysTrp: 0.532 ± 0.028
3.058LysTyr: 3.058 ± 0.064
0.0LysXaa: 0.0 ± 0.0
Leu
6.14LeuAla: 6.14 ± 0.107
0.862LeuCys: 0.862 ± 0.029
4.794LeuAsp: 4.794 ± 0.066
7.377LeuGlu: 7.377 ± 0.112
3.691LeuPhe: 3.691 ± 0.076
7.064LeuGly: 7.064 ± 0.112
1.579LeuHis: 1.579 ± 0.047
7.8LeuIle: 7.8 ± 0.113
7.871LeuLys: 7.871 ± 0.109
9.468LeuLeu: 9.468 ± 0.115
2.74LeuMet: 2.74 ± 0.055
4.579LeuAsn: 4.579 ± 0.07
3.281LeuPro: 3.281 ± 0.063
3.542LeuGln: 3.542 ± 0.068
3.678LeuArg: 3.678 ± 0.064
5.744LeuSer: 5.744 ± 0.087
5.011LeuThr: 5.011 ± 0.081
5.808LeuVal: 5.808 ± 0.086
0.621LeuTrp: 0.621 ± 0.024
3.123LeuTyr: 3.123 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.111MetAla: 2.111 ± 0.054
0.177MetCys: 0.177 ± 0.013
1.631MetAsp: 1.631 ± 0.041
2.446MetGlu: 2.446 ± 0.052
0.926MetPhe: 0.926 ± 0.031
2.279MetGly: 2.279 ± 0.057
0.305MetHis: 0.305 ± 0.021
2.531MetIle: 2.531 ± 0.061
2.65MetLys: 2.65 ± 0.048
2.417MetLeu: 2.417 ± 0.052
1.011MetMet: 1.011 ± 0.039
1.349MetAsn: 1.349 ± 0.034
0.932MetPro: 0.932 ± 0.037
0.684MetGln: 0.684 ± 0.025
0.937MetArg: 0.937 ± 0.03
1.358MetSer: 1.358 ± 0.038
1.543MetThr: 1.543 ± 0.047
2.076MetVal: 2.076 ± 0.052
0.157MetTrp: 0.157 ± 0.013
0.733MetTyr: 0.733 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.515AsnAla: 2.515 ± 0.061
0.557AsnCys: 0.557 ± 0.025
2.031AsnAsp: 2.031 ± 0.053
3.013AsnGlu: 3.013 ± 0.066
1.953AsnPhe: 1.953 ± 0.052
2.701AsnGly: 2.701 ± 0.066
1.102AsnHis: 1.102 ± 0.037
5.474AsnIle: 5.474 ± 0.086
4.182AsnLys: 4.182 ± 0.07
4.845AsnLeu: 4.845 ± 0.075
1.283AsnMet: 1.283 ± 0.034
2.971AsnAsn: 2.971 ± 0.075
2.398AsnPro: 2.398 ± 0.055
1.889AsnGln: 1.889 ± 0.049
2.161AsnArg: 2.161 ± 0.051
2.473AsnSer: 2.473 ± 0.055
2.681AsnThr: 2.681 ± 0.056
2.837AsnVal: 2.837 ± 0.06
0.392AsnTrp: 0.392 ± 0.023
2.119AsnTyr: 2.119 ± 0.054
0.0AsnXaa: 0.0 ± 0.0
Pro
1.839ProAla: 1.839 ± 0.051
0.326ProCys: 0.326 ± 0.02
1.504ProAsp: 1.504 ± 0.043
2.45ProGlu: 2.45 ± 0.059
1.431ProPhe: 1.431 ± 0.035
2.131ProGly: 2.131 ± 0.051
0.742ProHis: 0.742 ± 0.03
3.203ProIle: 3.203 ± 0.059
2.228ProLys: 2.228 ± 0.057
3.07ProLeu: 3.07 ± 0.055
0.859ProMet: 0.859 ± 0.035
1.569ProAsn: 1.569 ± 0.047
0.988ProPro: 0.988 ± 0.039
1.371ProGln: 1.371 ± 0.041
1.124ProArg: 1.124 ± 0.035
1.891ProSer: 1.891 ± 0.044
1.783ProThr: 1.783 ± 0.043
2.377ProVal: 2.377 ± 0.053
0.286ProTrp: 0.286 ± 0.018
1.383ProTyr: 1.383 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
1.85GlnAla: 1.85 ± 0.046
0.296GlnCys: 0.296 ± 0.021
1.516GlnAsp: 1.516 ± 0.04
2.938GlnGlu: 2.938 ± 0.065
1.059GlnPhe: 1.059 ± 0.037
2.052GlnGly: 2.052 ± 0.049
0.664GlnHis: 0.664 ± 0.027
2.303GlnIle: 2.303 ± 0.054
2.284GlnLys: 2.284 ± 0.053
3.162GlnLeu: 3.162 ± 0.061
0.885GlnMet: 0.885 ± 0.033
1.467GlnAsn: 1.467 ± 0.043
0.919GlnPro: 0.919 ± 0.037
1.679GlnGln: 1.679 ± 0.057
1.433GlnArg: 1.433 ± 0.041
1.541GlnSer: 1.541 ± 0.044
1.285GlnThr: 1.285 ± 0.04
1.963GlnVal: 1.963 ± 0.048
0.365GlnTrp: 0.365 ± 0.021
1.333GlnTyr: 1.333 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
2.225ArgAla: 2.225 ± 0.056
0.326ArgCys: 0.326 ± 0.021
2.051ArgAsp: 2.051 ± 0.051
3.343ArgGlu: 3.343 ± 0.073
1.576ArgPhe: 1.576 ± 0.041
2.808ArgGly: 2.808 ± 0.065
0.653ArgHis: 0.653 ± 0.025
3.566ArgIle: 3.566 ± 0.06
2.989ArgLys: 2.989 ± 0.058
3.492ArgLeu: 3.492 ± 0.056
1.106ArgMet: 1.106 ± 0.033
2.045ArgAsn: 2.045 ± 0.054
1.096ArgPro: 1.096 ± 0.037
1.385ArgGln: 1.385 ± 0.039
1.868ArgArg: 1.868 ± 0.054
1.683ArgSer: 1.683 ± 0.044
1.717ArgThr: 1.717 ± 0.045
2.702ArgVal: 2.702 ± 0.063
0.277ArgTrp: 0.277 ± 0.019
1.461ArgTyr: 1.461 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
2.812SerAla: 2.812 ± 0.064
0.558SerCys: 0.558 ± 0.029
2.201SerAsp: 2.201 ± 0.052
2.903SerGlu: 2.903 ± 0.064
2.653SerPhe: 2.653 ± 0.058
3.652SerGly: 3.652 ± 0.06
1.082SerHis: 1.082 ± 0.034
5.697SerIle: 5.697 ± 0.08
3.829SerLys: 3.829 ± 0.064
5.372SerLeu: 5.372 ± 0.09
1.572SerMet: 1.572 ± 0.042
2.659SerAsn: 2.659 ± 0.062
1.854SerPro: 1.854 ± 0.046
1.785SerGln: 1.785 ± 0.048
2.125SerArg: 2.125 ± 0.051
3.297SerSer: 3.297 ± 0.072
2.891SerThr: 2.891 ± 0.054
3.16SerVal: 3.16 ± 0.065
0.411SerTrp: 0.411 ± 0.022
2.119SerTyr: 2.119 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
3.595ThrAla: 3.595 ± 0.079
0.441ThrCys: 0.441 ± 0.023
2.322ThrAsp: 2.322 ± 0.06
3.352ThrGlu: 3.352 ± 0.057
2.312ThrPhe: 2.312 ± 0.059
4.161ThrGly: 4.161 ± 0.081
0.924ThrHis: 0.924 ± 0.035
5.367ThrIle: 5.367 ± 0.069
3.608ThrLys: 3.608 ± 0.07
5.181ThrLeu: 5.181 ± 0.071
1.36ThrMet: 1.36 ± 0.035
2.404ThrAsn: 2.404 ± 0.056
2.18ThrPro: 2.18 ± 0.051
1.437ThrGln: 1.437 ± 0.04
1.871ThrArg: 1.871 ± 0.054
2.948ThrSer: 2.948 ± 0.058
3.209ThrThr: 3.209 ± 0.063
3.741ThrVal: 3.741 ± 0.064
0.352ThrTrp: 0.352 ± 0.022
1.907ThrTyr: 1.907 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
4.987ValAla: 4.987 ± 0.1
0.606ValCys: 0.606 ± 0.027
3.909ValAsp: 3.909 ± 0.065
5.539ValGlu: 5.539 ± 0.083
2.979ValPhe: 2.979 ± 0.064
4.877ValGly: 4.877 ± 0.092
1.032ValHis: 1.032 ± 0.034
6.305ValIle: 6.305 ± 0.107
5.036ValLys: 5.036 ± 0.08
6.392ValLeu: 6.392 ± 0.087
1.891ValMet: 1.891 ± 0.047
2.978ValAsn: 2.978 ± 0.058
2.256ValPro: 2.256 ± 0.054
1.569ValGln: 1.569 ± 0.044
2.224ValArg: 2.224 ± 0.056
3.602ValSer: 3.602 ± 0.063
3.657ValThr: 3.657 ± 0.072
5.301ValVal: 5.301 ± 0.082
0.436ValTrp: 0.436 ± 0.024
2.307ValTyr: 2.307 ± 0.056
0.0ValXaa: 0.0 ± 0.0
Trp
0.384TrpAla: 0.384 ± 0.023
0.068TrpCys: 0.068 ± 0.009
0.389TrpAsp: 0.389 ± 0.021
0.468TrpGlu: 0.468 ± 0.023
0.332TrpPhe: 0.332 ± 0.022
0.635TrpGly: 0.635 ± 0.031
0.14TrpHis: 0.14 ± 0.013
0.658TrpIle: 0.658 ± 0.032
0.444TrpLys: 0.444 ± 0.021
0.63TrpLeu: 0.63 ± 0.03
0.213TrpMet: 0.213 ± 0.015
0.383TrpAsn: 0.383 ± 0.024
0.174TrpPro: 0.174 ± 0.014
0.276TrpGln: 0.276 ± 0.018
0.325TrpArg: 0.325 ± 0.023
0.401TrpSer: 0.401 ± 0.022
0.366TrpThr: 0.366 ± 0.023
0.451TrpVal: 0.451 ± 0.025
0.086TrpTrp: 0.086 ± 0.011
0.276TrpTyr: 0.276 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.907TyrAla: 1.907 ± 0.047
0.45TyrCys: 0.45 ± 0.023
2.073TyrAsp: 2.073 ± 0.052
2.641TyrGlu: 2.641 ± 0.059
1.765TyrPhe: 1.765 ± 0.045
2.956TyrGly: 2.956 ± 0.056
0.836TyrHis: 0.836 ± 0.032
3.5TyrIle: 3.5 ± 0.062
2.648TyrLys: 2.648 ± 0.057
3.447TyrLeu: 3.447 ± 0.067
0.886TyrMet: 0.886 ± 0.032
2.015TyrAsn: 2.015 ± 0.05
1.393TyrPro: 1.393 ± 0.04
1.198TyrGln: 1.198 ± 0.032
1.771TyrArg: 1.771 ± 0.047
2.206TyrSer: 2.206 ± 0.048
2.048TyrThr: 2.048 ± 0.059
2.101TyrVal: 2.101 ± 0.053
0.283TyrTrp: 0.283 ± 0.018
1.771TyrTyr: 1.771 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2920 proteins (887233 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski