Amino acid dipepetide frequency for Rhizobium oryziradicis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.756AlaAla: 13.756 ± 0.149
1.041AlaCys: 1.041 ± 0.026
6.561AlaAsp: 6.561 ± 0.076
6.398AlaGlu: 6.398 ± 0.081
4.57AlaPhe: 4.57 ± 0.065
8.984AlaGly: 8.984 ± 0.091
2.22AlaHis: 2.22 ± 0.036
6.73AlaIle: 6.73 ± 0.079
5.031AlaLys: 5.031 ± 0.073
12.606AlaLeu: 12.606 ± 0.127
3.549AlaMet: 3.549 ± 0.053
3.426AlaAsn: 3.426 ± 0.049
4.574AlaPro: 4.574 ± 0.063
3.945AlaGln: 3.945 ± 0.061
6.654AlaArg: 6.654 ± 0.074
6.845AlaSer: 6.845 ± 0.079
5.74AlaThr: 5.74 ± 0.067
7.91AlaVal: 7.91 ± 0.08
1.2AlaTrp: 1.2 ± 0.028
2.587AlaTyr: 2.587 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.884CysAla: 0.884 ± 0.025
0.121CysCys: 0.121 ± 0.008
0.558CysAsp: 0.558 ± 0.02
0.448CysGlu: 0.448 ± 0.014
0.358CysPhe: 0.358 ± 0.016
0.917CysGly: 0.917 ± 0.025
0.256CysHis: 0.256 ± 0.014
0.43CysIle: 0.43 ± 0.017
0.234CysLys: 0.234 ± 0.014
0.857CysLeu: 0.857 ± 0.026
0.168CysMet: 0.168 ± 0.011
0.228CysAsn: 0.228 ± 0.014
0.399CysPro: 0.399 ± 0.018
0.287CysGln: 0.287 ± 0.016
0.542CysArg: 0.542 ± 0.022
0.47CysSer: 0.47 ± 0.019
0.392CysThr: 0.392 ± 0.018
0.618CysVal: 0.618 ± 0.022
0.123CysTrp: 0.123 ± 0.01
0.218CysTyr: 0.218 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
6.592AspAla: 6.592 ± 0.079
0.55AspCys: 0.55 ± 0.019
3.251AspAsp: 3.251 ± 0.059
3.34AspGlu: 3.34 ± 0.059
2.374AspPhe: 2.374 ± 0.043
4.98AspGly: 4.98 ± 0.068
1.346AspHis: 1.346 ± 0.037
3.482AspIle: 3.482 ± 0.052
2.147AspLys: 2.147 ± 0.045
5.862AspLeu: 5.862 ± 0.065
1.533AspMet: 1.533 ± 0.034
1.622AspAsn: 1.622 ± 0.036
2.984AspPro: 2.984 ± 0.042
2.029AspGln: 2.029 ± 0.04
3.581AspArg: 3.581 ± 0.059
2.355AspSer: 2.355 ± 0.038
2.667AspThr: 2.667 ± 0.045
4.301AspVal: 4.301 ± 0.057
0.937AspTrp: 0.937 ± 0.027
1.607AspTyr: 1.607 ± 0.037
0.001AspXaa: 0.001 ± 0.001
Glu
6.758GluAla: 6.758 ± 0.078
0.339GluCys: 0.339 ± 0.015
2.711GluAsp: 2.711 ± 0.045
2.972GluGlu: 2.972 ± 0.053
1.833GluPhe: 1.833 ± 0.037
3.754GluGly: 3.754 ± 0.052
1.244GluHis: 1.244 ± 0.028
3.663GluIle: 3.663 ± 0.058
2.879GluLys: 2.879 ± 0.049
4.85GluLeu: 4.85 ± 0.06
1.551GluMet: 1.551 ± 0.029
1.864GluAsn: 1.864 ± 0.036
2.414GluPro: 2.414 ± 0.043
2.294GluGln: 2.294 ± 0.046
4.167GluArg: 4.167 ± 0.059
2.337GluSer: 2.337 ± 0.043
3.417GluThr: 3.417 ± 0.05
3.53GluVal: 3.53 ± 0.052
0.643GluTrp: 0.643 ± 0.02
0.939GluTyr: 0.939 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.538PheAla: 4.538 ± 0.059
0.43PheCys: 0.43 ± 0.018
2.819PheAsp: 2.819 ± 0.041
2.205PheGlu: 2.205 ± 0.038
1.632PhePhe: 1.632 ± 0.036
3.766PheGly: 3.766 ± 0.055
0.826PheHis: 0.826 ± 0.022
2.1PheIle: 2.1 ± 0.034
1.422PheLys: 1.422 ± 0.035
3.601PheLeu: 3.601 ± 0.06
1.015PheMet: 1.015 ± 0.029
1.328PheAsn: 1.328 ± 0.034
1.613PhePro: 1.613 ± 0.035
1.218PheGln: 1.218 ± 0.029
2.064PheArg: 2.064 ± 0.037
2.687PheSer: 2.687 ± 0.038
2.12PheThr: 2.12 ± 0.037
2.883PheVal: 2.883 ± 0.048
0.568PheTrp: 0.568 ± 0.021
1.034PheTyr: 1.034 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
7.951GlyAla: 7.951 ± 0.08
0.808GlyCys: 0.808 ± 0.022
4.118GlyAsp: 4.118 ± 0.056
4.233GlyGlu: 4.233 ± 0.064
3.774GlyPhe: 3.774 ± 0.057
6.411GlyGly: 6.411 ± 0.09
1.823GlyHis: 1.823 ± 0.032
4.856GlyIle: 4.856 ± 0.068
3.938GlyLys: 3.938 ± 0.053
8.572GlyLeu: 8.572 ± 0.083
2.324GlyMet: 2.324 ± 0.043
2.355GlyAsn: 2.355 ± 0.044
3.014GlyPro: 3.014 ± 0.05
2.958GlyGln: 2.958 ± 0.045
4.891GlyArg: 4.891 ± 0.065
4.552GlySer: 4.552 ± 0.066
4.369GlyThr: 4.369 ± 0.077
5.773GlyVal: 5.773 ± 0.076
1.199GlyTrp: 1.199 ± 0.033
2.312GlyTyr: 2.312 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.212HisAla: 2.212 ± 0.038
0.222HisCys: 0.222 ± 0.012
1.295HisAsp: 1.295 ± 0.032
1.015HisGlu: 1.015 ± 0.026
0.985HisPhe: 0.985 ± 0.026
1.885HisGly: 1.885 ± 0.04
0.658HisHis: 0.658 ± 0.024
1.168HisIle: 1.168 ± 0.029
0.627HisLys: 0.627 ± 0.019
2.198HisLeu: 2.198 ± 0.044
0.608HisMet: 0.608 ± 0.021
0.6HisAsn: 0.6 ± 0.02
1.258HisPro: 1.258 ± 0.03
0.745HisGln: 0.745 ± 0.021
1.312HisArg: 1.312 ± 0.031
1.043HisSer: 1.043 ± 0.026
0.894HisThr: 0.894 ± 0.024
1.51HisVal: 1.51 ± 0.031
0.305HisTrp: 0.305 ± 0.015
0.651HisTyr: 0.651 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
7.847IleAla: 7.847 ± 0.103
0.594IleCys: 0.594 ± 0.021
3.847IleAsp: 3.847 ± 0.048
3.589IleGlu: 3.589 ± 0.054
2.158IlePhe: 2.158 ± 0.041
5.18IleGly: 5.18 ± 0.07
1.052IleHis: 1.052 ± 0.027
3.039IleIle: 3.039 ± 0.047
2.18IleLys: 2.18 ± 0.042
4.984IleLeu: 4.984 ± 0.06
1.225IleMet: 1.225 ± 0.032
1.961IleAsn: 1.961 ± 0.036
2.472IlePro: 2.472 ± 0.046
1.371IleGln: 1.371 ± 0.032
3.237IleArg: 3.237 ± 0.051
3.852IleSer: 3.852 ± 0.051
3.246IleThr: 3.246 ± 0.044
4.383IleVal: 4.383 ± 0.061
0.659IleTrp: 0.659 ± 0.022
1.302IleTyr: 1.302 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
5.367LysAla: 5.367 ± 0.063
0.198LysCys: 0.198 ± 0.013
2.328LysAsp: 2.328 ± 0.043
1.959LysGlu: 1.959 ± 0.037
1.131LysPhe: 1.131 ± 0.027
3.187LysGly: 3.187 ± 0.05
0.729LysHis: 0.729 ± 0.021
2.512LysIle: 2.512 ± 0.042
1.695LysLys: 1.695 ± 0.043
3.979LysLeu: 3.979 ± 0.051
1.098LysMet: 1.098 ± 0.027
1.288LysAsn: 1.288 ± 0.033
2.542LysPro: 2.542 ± 0.048
1.42LysGln: 1.42 ± 0.033
2.667LysArg: 2.667 ± 0.042
2.511LysSer: 2.511 ± 0.045
2.695LysThr: 2.695 ± 0.038
2.872LysVal: 2.872 ± 0.049
0.412LysTrp: 0.412 ± 0.017
0.697LysTyr: 0.697 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
12.002LeuAla: 12.002 ± 0.101
0.927LeuCys: 0.927 ± 0.026
5.869LeuAsp: 5.869 ± 0.068
5.191LeuGlu: 5.191 ± 0.062
3.817LeuPhe: 3.817 ± 0.061
7.685LeuGly: 7.685 ± 0.085
1.965LeuHis: 1.965 ± 0.039
5.477LeuIle: 5.477 ± 0.068
4.577LeuLys: 4.577 ± 0.059
9.385LeuLeu: 9.385 ± 0.102
2.693LeuMet: 2.693 ± 0.046
3.178LeuAsn: 3.178 ± 0.046
5.241LeuPro: 5.241 ± 0.065
3.162LeuGln: 3.162 ± 0.05
5.82LeuArg: 5.82 ± 0.076
7.546LeuSer: 7.546 ± 0.109
5.774LeuThr: 5.774 ± 0.078
7.067LeuVal: 7.067 ± 0.074
1.118LeuTrp: 1.118 ± 0.028
2.159LeuTyr: 2.159 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
3.356MetAla: 3.356 ± 0.053
0.155MetCys: 0.155 ± 0.01
1.344MetAsp: 1.344 ± 0.029
1.304MetGlu: 1.304 ± 0.028
0.791MetPhe: 0.791 ± 0.02
1.939MetGly: 1.939 ± 0.039
0.448MetHis: 0.448 ± 0.018
1.631MetIle: 1.631 ± 0.033
1.186MetLys: 1.186 ± 0.027
2.636MetLeu: 2.636 ± 0.046
0.847MetMet: 0.847 ± 0.025
0.909MetAsn: 0.909 ± 0.027
1.555MetPro: 1.555 ± 0.033
0.994MetGln: 0.994 ± 0.027
1.749MetArg: 1.749 ± 0.035
1.851MetSer: 1.851 ± 0.04
2.034MetThr: 2.034 ± 0.035
2.071MetVal: 2.071 ± 0.041
0.206MetTrp: 0.206 ± 0.013
0.331MetTyr: 0.331 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.724AsnAla: 3.724 ± 0.054
0.244AsnCys: 0.244 ± 0.013
1.707AsnAsp: 1.707 ± 0.035
1.408AsnGlu: 1.408 ± 0.032
1.22AsnPhe: 1.22 ± 0.029
2.884AsnGly: 2.884 ± 0.046
0.707AsnHis: 0.707 ± 0.023
1.795AsnIle: 1.795 ± 0.035
1.02AsnLys: 1.02 ± 0.024
2.969AsnLeu: 2.969 ± 0.046
0.759AsnMet: 0.759 ± 0.024
0.959AsnAsn: 0.959 ± 0.03
1.98AsnPro: 1.98 ± 0.038
0.981AsnGln: 0.981 ± 0.028
1.959AsnArg: 1.959 ± 0.041
1.723AsnSer: 1.723 ± 0.032
1.658AsnThr: 1.658 ± 0.04
2.191AsnVal: 2.191 ± 0.037
0.509AsnTrp: 0.509 ± 0.019
0.771AsnTyr: 0.771 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
5.092ProAla: 5.092 ± 0.065
0.293ProCys: 0.293 ± 0.015
3.353ProAsp: 3.353 ± 0.049
3.206ProGlu: 3.206 ± 0.046
2.008ProPhe: 2.008 ± 0.033
3.593ProGly: 3.593 ± 0.059
1.11ProHis: 1.11 ± 0.03
2.434ProIle: 2.434 ± 0.038
2.023ProLys: 2.023 ± 0.039
4.655ProLeu: 4.655 ± 0.059
1.239ProMet: 1.239 ± 0.029
1.453ProAsn: 1.453 ± 0.031
1.996ProPro: 1.996 ± 0.043
1.79ProGln: 1.79 ± 0.034
2.231ProArg: 2.231 ± 0.039
2.862ProSer: 2.862 ± 0.041
2.396ProThr: 2.396 ± 0.036
3.946ProVal: 3.946 ± 0.055
0.621ProTrp: 0.621 ± 0.02
1.212ProTyr: 1.212 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
4.184GlnAla: 4.184 ± 0.063
0.203GlnCys: 0.203 ± 0.011
1.692GlnAsp: 1.692 ± 0.037
1.693GlnGlu: 1.693 ± 0.036
1.243GlnPhe: 1.243 ± 0.03
2.423GlnGly: 2.423 ± 0.041
0.727GlnHis: 0.727 ± 0.025
2.197GlnIle: 2.197 ± 0.072
1.496GlnLys: 1.496 ± 0.035
3.147GlnLeu: 3.147 ± 0.048
1.062GlnMet: 1.062 ± 0.03
1.156GlnAsn: 1.156 ± 0.028
1.795GlnPro: 1.795 ± 0.03
1.526GlnGln: 1.526 ± 0.041
2.312GlnArg: 2.312 ± 0.047
2.267GlnSer: 2.267 ± 0.04
2.024GlnThr: 2.024 ± 0.033
2.286GlnVal: 2.286 ± 0.05
0.433GlnTrp: 0.433 ± 0.017
0.658GlnTyr: 0.658 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
6.096ArgAla: 6.096 ± 0.066
0.43ArgCys: 0.43 ± 0.018
3.589ArgAsp: 3.589 ± 0.058
3.343ArgGlu: 3.343 ± 0.053
2.755ArgPhe: 2.755 ± 0.046
3.937ArgGly: 3.937 ± 0.057
1.547ArgHis: 1.547 ± 0.029
3.726ArgIle: 3.726 ± 0.059
2.493ArgLys: 2.493 ± 0.046
6.838ArgLeu: 6.838 ± 0.074
1.741ArgMet: 1.741 ± 0.04
1.988ArgAsn: 1.988 ± 0.038
2.718ArgPro: 2.718 ± 0.044
2.519ArgGln: 2.519 ± 0.044
4.09ArgArg: 4.09 ± 0.068
3.388ArgSer: 3.388 ± 0.051
2.839ArgThr: 2.839 ± 0.049
3.894ArgVal: 3.894 ± 0.054
0.817ArgTrp: 0.817 ± 0.026
1.624ArgTyr: 1.624 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
6.463SerAla: 6.463 ± 0.074
0.478SerCys: 0.478 ± 0.02
3.383SerAsp: 3.383 ± 0.054
3.162SerGlu: 3.162 ± 0.054
2.64SerPhe: 2.64 ± 0.047
5.725SerGly: 5.725 ± 0.063
1.285SerHis: 1.285 ± 0.032
3.51SerIle: 3.51 ± 0.056
2.272SerLys: 2.272 ± 0.04
6.241SerLeu: 6.241 ± 0.071
1.571SerMet: 1.571 ± 0.033
1.83SerAsn: 1.83 ± 0.035
2.705SerPro: 2.705 ± 0.045
1.923SerGln: 1.923 ± 0.04
3.488SerArg: 3.488 ± 0.056
3.761SerSer: 3.761 ± 0.062
3.171SerThr: 3.171 ± 0.073
4.521SerVal: 4.521 ± 0.059
0.753SerTrp: 0.753 ± 0.023
1.417SerTyr: 1.417 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
5.977ThrAla: 5.977 ± 0.075
0.43ThrCys: 0.43 ± 0.018
2.93ThrAsp: 2.93 ± 0.049
2.655ThrGlu: 2.655 ± 0.047
2.09ThrPhe: 2.09 ± 0.037
4.748ThrGly: 4.748 ± 0.061
1.083ThrHis: 1.083 ± 0.026
3.38ThrIle: 3.38 ± 0.044
1.912ThrLys: 1.912 ± 0.042
6.045ThrLeu: 6.045 ± 0.07
1.422ThrMet: 1.422 ± 0.031
1.623ThrAsn: 1.623 ± 0.033
3.144ThrPro: 3.144 ± 0.043
1.673ThrGln: 1.673 ± 0.035
2.91ThrArg: 2.91 ± 0.043
3.291ThrSer: 3.291 ± 0.065
3.054ThrThr: 3.054 ± 0.055
4.473ThrVal: 4.473 ± 0.057
0.642ThrTrp: 0.642 ± 0.021
1.243ThrTyr: 1.243 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
8.16ValAla: 8.16 ± 0.08
0.657ValCys: 0.657 ± 0.02
4.118ValAsp: 4.118 ± 0.061
4.36ValGlu: 4.36 ± 0.054
2.928ValPhe: 2.928 ± 0.05
5.315ValGly: 5.315 ± 0.073
1.316ValHis: 1.316 ± 0.029
4.275ValIle: 4.275 ± 0.059
2.797ValLys: 2.797 ± 0.049
7.306ValLeu: 7.306 ± 0.08
2.058ValMet: 2.058 ± 0.036
2.17ValAsn: 2.17 ± 0.037
3.301ValPro: 3.301 ± 0.045
2.15ValGln: 2.15 ± 0.037
4.197ValArg: 4.197 ± 0.054
4.839ValSer: 4.839 ± 0.052
4.351ValThr: 4.351 ± 0.06
5.725ValVal: 5.725 ± 0.082
0.839ValTrp: 0.839 ± 0.028
1.469ValTyr: 1.469 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.071TrpAla: 1.071 ± 0.028
0.133TrpCys: 0.133 ± 0.009
0.594TrpAsp: 0.594 ± 0.022
0.504TrpGlu: 0.504 ± 0.019
0.554TrpPhe: 0.554 ± 0.02
0.788TrpGly: 0.788 ± 0.025
0.328TrpHis: 0.328 ± 0.016
0.646TrpIle: 0.646 ± 0.022
0.516TrpLys: 0.516 ± 0.017
1.571TrpLeu: 1.571 ± 0.033
0.367TrpMet: 0.367 ± 0.014
0.508TrpAsn: 0.508 ± 0.018
0.649TrpPro: 0.649 ± 0.024
0.661TrpGln: 0.661 ± 0.021
0.915TrpArg: 0.915 ± 0.025
0.787TrpSer: 0.787 ± 0.023
0.687TrpThr: 0.687 ± 0.021
0.773TrpVal: 0.773 ± 0.022
0.201TrpTrp: 0.201 ± 0.012
0.304TrpTyr: 0.304 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.294TyrAla: 2.294 ± 0.042
0.257TyrCys: 0.257 ± 0.015
1.475TyrAsp: 1.475 ± 0.038
1.174TyrGlu: 1.174 ± 0.028
1.009TyrPhe: 1.009 ± 0.027
2.13TyrGly: 2.13 ± 0.046
0.522TyrHis: 0.522 ± 0.018
1.127TyrIle: 1.127 ± 0.031
0.843TyrLys: 0.843 ± 0.026
2.351TyrLeu: 2.351 ± 0.045
0.507TyrMet: 0.507 ± 0.019
0.72TyrAsn: 0.72 ± 0.027
1.16TyrPro: 1.16 ± 0.03
0.878TyrGln: 0.878 ± 0.025
1.586TyrArg: 1.586 ± 0.034
1.332TyrSer: 1.332 ± 0.029
1.195TyrThr: 1.195 ± 0.028
1.603TyrVal: 1.603 ± 0.032
0.372TyrTrp: 0.372 ± 0.018
0.688TyrTyr: 0.688 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4576 proteins (1498857 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski