Amino acid dipepetide frequency for Rhizobium etli CNPAF512

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.544AlaAla: 16.544 ± 0.145
1.18AlaCys: 1.18 ± 0.03
6.826AlaAsp: 6.826 ± 0.078
7.5AlaGlu: 7.5 ± 0.072
4.52AlaPhe: 4.52 ± 0.05
10.51AlaGly: 10.51 ± 0.096
2.219AlaHis: 2.219 ± 0.04
6.826AlaIle: 6.826 ± 0.074
4.139AlaLys: 4.139 ± 0.061
11.92AlaLeu: 11.92 ± 0.089
3.309AlaMet: 3.309 ± 0.049
2.932AlaAsn: 2.932 ± 0.044
4.764AlaPro: 4.764 ± 0.063
3.381AlaGln: 3.381 ± 0.046
7.885AlaArg: 7.885 ± 0.072
6.599AlaSer: 6.599 ± 0.067
5.299AlaThr: 5.299 ± 0.056
8.552AlaVal: 8.552 ± 0.07
1.321AlaTrp: 1.321 ± 0.028
2.435AlaTyr: 2.435 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
1.036CysAla: 1.036 ± 0.024
0.19CysCys: 0.19 ± 0.012
0.556CysAsp: 0.556 ± 0.015
0.474CysGlu: 0.474 ± 0.017
0.406CysPhe: 0.406 ± 0.017
1.034CysGly: 1.034 ± 0.026
0.332CysHis: 0.332 ± 0.015
0.44CysIle: 0.44 ± 0.016
0.233CysLys: 0.233 ± 0.011
0.983CysLeu: 0.983 ± 0.023
0.205CysMet: 0.205 ± 0.011
0.259CysAsn: 0.259 ± 0.012
0.499CysPro: 0.499 ± 0.02
0.314CysGln: 0.314 ± 0.013
0.976CysArg: 0.976 ± 0.033
0.628CysSer: 0.628 ± 0.017
0.405CysThr: 0.405 ± 0.016
0.605CysVal: 0.605 ± 0.016
0.131CysTrp: 0.131 ± 0.008
0.204CysTyr: 0.204 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
6.416AspAla: 6.416 ± 0.068
0.535AspCys: 0.535 ± 0.019
3.358AspAsp: 3.358 ± 0.052
3.595AspGlu: 3.595 ± 0.046
2.391AspPhe: 2.391 ± 0.041
5.168AspGly: 5.168 ± 0.06
1.414AspHis: 1.414 ± 0.03
3.499AspIle: 3.499 ± 0.046
1.815AspLys: 1.815 ± 0.037
5.738AspLeu: 5.738 ± 0.065
1.359AspMet: 1.359 ± 0.028
1.422AspAsn: 1.422 ± 0.031
3.124AspPro: 3.124 ± 0.046
1.863AspGln: 1.863 ± 0.038
4.621AspArg: 4.621 ± 0.074
2.158AspSer: 2.158 ± 0.041
2.285AspThr: 2.285 ± 0.034
3.997AspVal: 3.997 ± 0.051
0.873AspTrp: 0.873 ± 0.022
1.44AspTyr: 1.44 ± 0.028
0.0AspXaa: 0.0 ± 0.0
Glu
7.067GluAla: 7.067 ± 0.077
0.409GluCys: 0.409 ± 0.015
2.828GluAsp: 2.828 ± 0.048
3.391GluGlu: 3.391 ± 0.05
1.966GluPhe: 1.966 ± 0.03
4.059GluGly: 4.059 ± 0.053
1.256GluHis: 1.256 ± 0.029
3.954GluIle: 3.954 ± 0.055
2.636GluLys: 2.636 ± 0.039
5.253GluLeu: 5.253 ± 0.058
1.562GluMet: 1.562 ± 0.031
1.835GluAsn: 1.835 ± 0.033
2.563GluPro: 2.563 ± 0.041
2.136GluGln: 2.136 ± 0.039
4.852GluArg: 4.852 ± 0.061
2.438GluSer: 2.438 ± 0.038
3.419GluThr: 3.419 ± 0.041
3.568GluVal: 3.568 ± 0.048
0.649GluTrp: 0.649 ± 0.022
0.996GluTyr: 0.996 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
4.675PheAla: 4.675 ± 0.054
0.45PheCys: 0.45 ± 0.015
2.734PheAsp: 2.734 ± 0.042
2.216PheGlu: 2.216 ± 0.034
1.623PhePhe: 1.623 ± 0.031
3.729PheGly: 3.729 ± 0.053
0.868PheHis: 0.868 ± 0.023
1.884PheIle: 1.884 ± 0.031
1.123PheLys: 1.123 ± 0.022
3.661PheLeu: 3.661 ± 0.051
0.865PheMet: 0.865 ± 0.022
1.14PheAsn: 1.14 ± 0.027
1.726PhePro: 1.726 ± 0.032
1.176PheGln: 1.176 ± 0.029
2.626PheArg: 2.626 ± 0.044
2.576PheSer: 2.576 ± 0.038
1.877PheThr: 1.877 ± 0.033
2.906PheVal: 2.906 ± 0.036
0.555PheTrp: 0.555 ± 0.018
0.985PheTyr: 0.985 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
8.426GlyAla: 8.426 ± 0.085
0.932GlyCys: 0.932 ± 0.024
4.529GlyAsp: 4.529 ± 0.053
4.834GlyGlu: 4.834 ± 0.051
3.669GlyPhe: 3.669 ± 0.052
6.887GlyGly: 6.887 ± 0.081
1.951GlyHis: 1.951 ± 0.041
5.029GlyIle: 5.029 ± 0.063
3.691GlyLys: 3.691 ± 0.051
8.412GlyLeu: 8.412 ± 0.076
2.257GlyMet: 2.257 ± 0.04
2.538GlyAsn: 2.538 ± 0.042
3.142GlyPro: 3.142 ± 0.043
2.748GlyGln: 2.748 ± 0.045
6.305GlyArg: 6.305 ± 0.071
4.835GlySer: 4.835 ± 0.053
4.194GlyThr: 4.194 ± 0.067
5.544GlyVal: 5.544 ± 0.054
1.235GlyTrp: 1.235 ± 0.027
2.187GlyTyr: 2.187 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
2.402HisAla: 2.402 ± 0.038
0.311HisCys: 0.311 ± 0.013
1.392HisAsp: 1.392 ± 0.029
1.158HisGlu: 1.158 ± 0.026
1.008HisPhe: 1.008 ± 0.024
2.002HisGly: 2.002 ± 0.036
0.728HisHis: 0.728 ± 0.024
1.052HisIle: 1.052 ± 0.022
0.523HisLys: 0.523 ± 0.015
2.225HisLeu: 2.225 ± 0.038
0.551HisMet: 0.551 ± 0.018
0.49HisAsn: 0.49 ± 0.016
1.362HisPro: 1.362 ± 0.029
0.777HisGln: 0.777 ± 0.023
1.925HisArg: 1.925 ± 0.043
1.057HisSer: 1.057 ± 0.024
0.758HisThr: 0.758 ± 0.021
1.523HisVal: 1.523 ± 0.037
0.305HisTrp: 0.305 ± 0.013
0.541HisTyr: 0.541 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
7.634IleAla: 7.634 ± 0.066
0.613IleCys: 0.613 ± 0.018
3.915IleAsp: 3.915 ± 0.046
3.721IleGlu: 3.721 ± 0.044
2.093IlePhe: 2.093 ± 0.039
5.524IleGly: 5.524 ± 0.054
0.996IleHis: 0.996 ± 0.022
2.748IleIle: 2.748 ± 0.048
1.581IleLys: 1.581 ± 0.03
4.828IleLeu: 4.828 ± 0.056
1.135IleMet: 1.135 ± 0.024
1.499IleAsn: 1.499 ± 0.032
2.496IlePro: 2.496 ± 0.039
1.235IleGln: 1.235 ± 0.029
3.598IleArg: 3.598 ± 0.044
3.642IleSer: 3.642 ± 0.05
2.662IleThr: 2.662 ± 0.04
4.56IleVal: 4.56 ± 0.056
0.646IleTrp: 0.646 ± 0.019
1.194IleTyr: 1.194 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
4.546LysAla: 4.546 ± 0.06
0.207LysCys: 0.207 ± 0.01
1.895LysAsp: 1.895 ± 0.03
1.72LysGlu: 1.72 ± 0.036
0.997LysPhe: 0.997 ± 0.024
2.745LysGly: 2.745 ± 0.041
0.679LysHis: 0.679 ± 0.018
2.076LysIle: 2.076 ± 0.035
1.475LysLys: 1.475 ± 0.037
3.499LysLeu: 3.499 ± 0.051
0.901LysMet: 0.901 ± 0.024
1.03LysAsn: 1.03 ± 0.024
2.222LysPro: 2.222 ± 0.036
1.186LysGln: 1.186 ± 0.025
2.591LysArg: 2.591 ± 0.045
2.265LysSer: 2.265 ± 0.04
2.115LysThr: 2.115 ± 0.039
2.458LysVal: 2.458 ± 0.042
0.376LysTrp: 0.376 ± 0.013
0.664LysTyr: 0.664 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
12.42LeuAla: 12.42 ± 0.094
0.998LeuCys: 0.998 ± 0.025
5.789LeuAsp: 5.789 ± 0.069
5.021LeuGlu: 5.021 ± 0.047
3.803LeuPhe: 3.803 ± 0.05
7.68LeuGly: 7.68 ± 0.076
2.023LeuHis: 2.023 ± 0.036
5.072LeuIle: 5.072 ± 0.06
3.88LeuLys: 3.88 ± 0.052
9.241LeuLeu: 9.241 ± 0.099
2.294LeuMet: 2.294 ± 0.034
2.541LeuAsn: 2.541 ± 0.039
5.378LeuPro: 5.378 ± 0.059
3.146LeuGln: 3.146 ± 0.045
6.725LeuArg: 6.725 ± 0.061
6.782LeuSer: 6.782 ± 0.07
5.256LeuThr: 5.256 ± 0.066
7.055LeuVal: 7.055 ± 0.073
1.018LeuTrp: 1.018 ± 0.027
1.992LeuTyr: 1.992 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.025MetAla: 3.025 ± 0.041
0.165MetCys: 0.165 ± 0.01
1.103MetAsp: 1.103 ± 0.026
1.165MetGlu: 1.165 ± 0.028
0.712MetPhe: 0.712 ± 0.023
1.67MetGly: 1.67 ± 0.033
0.473MetHis: 0.473 ± 0.014
1.466MetIle: 1.466 ± 0.026
1.11MetLys: 1.11 ± 0.028
2.505MetLeu: 2.505 ± 0.041
0.668MetMet: 0.668 ± 0.022
0.828MetAsn: 0.828 ± 0.022
1.53MetPro: 1.53 ± 0.029
0.907MetGln: 0.907 ± 0.022
1.939MetArg: 1.939 ± 0.034
1.765MetSer: 1.765 ± 0.029
1.912MetThr: 1.912 ± 0.037
1.674MetVal: 1.674 ± 0.032
0.185MetTrp: 0.185 ± 0.01
0.296MetTyr: 0.296 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.197AsnAla: 3.197 ± 0.045
0.266AsnCys: 0.266 ± 0.012
1.564AsnAsp: 1.564 ± 0.035
1.352AsnGlu: 1.352 ± 0.031
1.122AsnPhe: 1.122 ± 0.03
2.564AsnGly: 2.564 ± 0.041
0.602AsnHis: 0.602 ± 0.02
1.548AsnIle: 1.548 ± 0.028
0.788AsnLys: 0.788 ± 0.023
2.738AsnLeu: 2.738 ± 0.04
0.664AsnMet: 0.664 ± 0.02
0.826AsnAsn: 0.826 ± 0.022
1.81AsnPro: 1.81 ± 0.032
0.836AsnGln: 0.836 ± 0.022
2.04AsnArg: 2.04 ± 0.036
1.517AsnSer: 1.517 ± 0.032
1.247AsnThr: 1.247 ± 0.026
2.008AsnVal: 2.008 ± 0.035
0.452AsnTrp: 0.452 ± 0.017
0.664AsnTyr: 0.664 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
5.897ProAla: 5.897 ± 0.073
0.436ProCys: 0.436 ± 0.016
3.376ProAsp: 3.376 ± 0.045
3.443ProGlu: 3.443 ± 0.051
2.061ProPhe: 2.061 ± 0.036
4.087ProGly: 4.087 ± 0.052
1.132ProHis: 1.132 ± 0.027
2.398ProIle: 2.398 ± 0.038
1.698ProLys: 1.698 ± 0.036
4.653ProLeu: 4.653 ± 0.053
1.145ProMet: 1.145 ± 0.022
1.341ProAsn: 1.341 ± 0.027
2.368ProPro: 2.368 ± 0.045
1.619ProGln: 1.619 ± 0.031
2.954ProArg: 2.954 ± 0.041
2.974ProSer: 2.974 ± 0.037
2.253ProThr: 2.253 ± 0.035
3.985ProVal: 3.985 ± 0.047
0.64ProTrp: 0.64 ± 0.02
1.153ProTyr: 1.153 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.959GlnAla: 3.959 ± 0.05
0.284GlnCys: 0.284 ± 0.014
1.461GlnAsp: 1.461 ± 0.028
1.575GlnGlu: 1.575 ± 0.034
1.134GlnPhe: 1.134 ± 0.025
2.208GlnGly: 2.208 ± 0.037
0.837GlnHis: 0.837 ± 0.023
1.877GlnIle: 1.877 ± 0.033
1.218GlnLys: 1.218 ± 0.025
2.909GlnLeu: 2.909 ± 0.043
0.929GlnMet: 0.929 ± 0.022
0.931GlnAsn: 0.931 ± 0.023
1.87GlnPro: 1.87 ± 0.039
1.484GlnGln: 1.484 ± 0.037
2.728GlnArg: 2.728 ± 0.049
2.016GlnSer: 2.016 ± 0.035
1.732GlnThr: 1.732 ± 0.034
2.043GlnVal: 2.043 ± 0.037
0.392GlnTrp: 0.392 ± 0.015
0.614GlnTyr: 0.614 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
7.202ArgAla: 7.202 ± 0.066
0.789ArgCys: 0.789 ± 0.027
3.935ArgAsp: 3.935 ± 0.049
3.936ArgGlu: 3.936 ± 0.051
2.998ArgPhe: 2.998 ± 0.04
4.672ArgGly: 4.672 ± 0.056
2.215ArgHis: 2.215 ± 0.047
4.256ArgIle: 4.256 ± 0.049
2.662ArgLys: 2.662 ± 0.041
8.089ArgLeu: 8.089 ± 0.08
2.016ArgMet: 2.016 ± 0.033
2.196ArgAsn: 2.196 ± 0.034
3.798ArgPro: 3.798 ± 0.059
3.176ArgGln: 3.176 ± 0.043
6.893ArgArg: 6.893 ± 0.097
4.494ArgSer: 4.494 ± 0.062
3.295ArgThr: 3.295 ± 0.04
4.191ArgVal: 4.191 ± 0.051
0.934ArgTrp: 0.934 ± 0.025
1.733ArgTyr: 1.733 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
6.651SerAla: 6.651 ± 0.069
0.622SerCys: 0.622 ± 0.02
3.167SerAsp: 3.167 ± 0.048
3.052SerGlu: 3.052 ± 0.044
2.554SerPhe: 2.554 ± 0.039
5.97SerGly: 5.97 ± 0.063
1.226SerHis: 1.226 ± 0.031
3.272SerIle: 3.272 ± 0.041
1.879SerLys: 1.879 ± 0.027
5.783SerLeu: 5.783 ± 0.063
1.509SerMet: 1.509 ± 0.027
1.614SerAsn: 1.614 ± 0.031
3.02SerPro: 3.02 ± 0.04
1.73SerGln: 1.73 ± 0.033
4.25SerArg: 4.25 ± 0.051
3.882SerSer: 3.882 ± 0.059
2.863SerThr: 2.863 ± 0.047
4.202SerVal: 4.202 ± 0.051
0.842SerTrp: 0.842 ± 0.025
1.361SerTyr: 1.361 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
5.818ThrAla: 5.818 ± 0.049
0.466ThrCys: 0.466 ± 0.019
2.613ThrAsp: 2.613 ± 0.041
2.572ThrGlu: 2.572 ± 0.042
1.959ThrPhe: 1.959 ± 0.037
4.708ThrGly: 4.708 ± 0.055
0.969ThrHis: 0.969 ± 0.02
3.026ThrIle: 3.026 ± 0.045
1.573ThrLys: 1.573 ± 0.029
5.185ThrLeu: 5.185 ± 0.062
1.22ThrMet: 1.22 ± 0.029
1.339ThrAsn: 1.339 ± 0.031
2.845ThrPro: 2.845 ± 0.039
1.292ThrGln: 1.292 ± 0.028
3.009ThrArg: 3.009 ± 0.041
3.062ThrSer: 3.062 ± 0.044
2.577ThrThr: 2.577 ± 0.048
4.057ThrVal: 4.057 ± 0.052
0.571ThrTrp: 0.571 ± 0.02
1.135ThrTyr: 1.135 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
8.292ValAla: 8.292 ± 0.068
0.655ValCys: 0.655 ± 0.022
4.061ValAsp: 4.061 ± 0.052
4.418ValGlu: 4.418 ± 0.054
2.864ValPhe: 2.864 ± 0.045
5.319ValGly: 5.319 ± 0.062
1.383ValHis: 1.383 ± 0.03
4.216ValIle: 4.216 ± 0.049
2.422ValLys: 2.422 ± 0.039
6.819ValLeu: 6.819 ± 0.073
1.754ValMet: 1.754 ± 0.029
1.955ValAsn: 1.955 ± 0.034
3.369ValPro: 3.369 ± 0.042
1.939ValGln: 1.939 ± 0.037
4.64ValArg: 4.64 ± 0.048
4.688ValSer: 4.688 ± 0.056
4.208ValThr: 4.208 ± 0.049
5.453ValVal: 5.453 ± 0.06
0.775ValTrp: 0.775 ± 0.02
1.403ValTyr: 1.403 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
1.125TrpAla: 1.125 ± 0.026
0.137TrpCys: 0.137 ± 0.009
0.565TrpAsp: 0.565 ± 0.018
0.515TrpGlu: 0.515 ± 0.018
0.52TrpPhe: 0.52 ± 0.019
0.791TrpGly: 0.791 ± 0.023
0.298TrpHis: 0.298 ± 0.013
0.691TrpIle: 0.691 ± 0.018
0.551TrpLys: 0.551 ± 0.019
1.49TrpLeu: 1.49 ± 0.032
0.326TrpMet: 0.326 ± 0.014
0.482TrpAsn: 0.482 ± 0.017
0.657TrpPro: 0.657 ± 0.02
0.529TrpGln: 0.529 ± 0.019
1.101TrpArg: 1.101 ± 0.028
0.817TrpSer: 0.817 ± 0.022
0.689TrpThr: 0.689 ± 0.02
0.709TrpVal: 0.709 ± 0.022
0.217TrpTrp: 0.217 ± 0.012
0.257TrpTyr: 0.257 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.329TyrAla: 2.329 ± 0.038
0.265TyrCys: 0.265 ± 0.012
1.404TyrAsp: 1.404 ± 0.028
1.181TyrGlu: 1.181 ± 0.027
0.994TyrPhe: 0.994 ± 0.022
2.019TyrGly: 2.019 ± 0.033
0.49TyrHis: 0.49 ± 0.015
0.988TyrIle: 0.988 ± 0.028
0.65TyrLys: 0.65 ± 0.021
2.182TyrLeu: 2.182 ± 0.039
0.429TyrMet: 0.429 ± 0.014
0.606TyrAsn: 0.606 ± 0.018
1.072TyrPro: 1.072 ± 0.024
0.725TyrGln: 0.725 ± 0.019
1.821TyrArg: 1.821 ± 0.033
1.252TyrSer: 1.252 ± 0.028
0.95TyrThr: 0.95 ± 0.022
1.542TyrVal: 1.542 ± 0.03
0.353TyrTrp: 0.353 ± 0.015
0.615TyrTyr: 0.615 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6543 proteins (1785383 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski