Amino acid dipepetide frequency for Mesorhizobium sp. Root554

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.09AlaAla: 17.09 ± 0.162
0.958AlaCys: 0.958 ± 0.029
7.017AlaAsp: 7.017 ± 0.089
7.705AlaGlu: 7.705 ± 0.104
4.735AlaPhe: 4.735 ± 0.074
11.265AlaGly: 11.265 ± 0.11
2.051AlaHis: 2.051 ± 0.043
6.78AlaIle: 6.78 ± 0.081
4.528AlaLys: 4.528 ± 0.082
12.71AlaLeu: 12.71 ± 0.133
3.747AlaMet: 3.747 ± 0.067
3.071AlaAsn: 3.071 ± 0.057
4.965AlaPro: 4.965 ± 0.085
3.56AlaGln: 3.56 ± 0.069
8.143AlaArg: 8.143 ± 0.091
6.682AlaSer: 6.682 ± 0.091
5.841AlaThr: 5.841 ± 0.074
9.069AlaVal: 9.069 ± 0.102
1.4AlaTrp: 1.4 ± 0.038
2.631AlaTyr: 2.631 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.905CysAla: 0.905 ± 0.03
0.091CysCys: 0.091 ± 0.01
0.481CysAsp: 0.481 ± 0.022
0.375CysGlu: 0.375 ± 0.021
0.322CysPhe: 0.322 ± 0.017
0.901CysGly: 0.901 ± 0.032
0.206CysHis: 0.206 ± 0.013
0.372CysIle: 0.372 ± 0.018
0.164CysLys: 0.164 ± 0.013
0.723CysLeu: 0.723 ± 0.025
0.152CysMet: 0.152 ± 0.01
0.194CysAsn: 0.194 ± 0.013
0.378CysPro: 0.378 ± 0.02
0.206CysGln: 0.206 ± 0.015
0.524CysArg: 0.524 ± 0.022
0.432CysSer: 0.432 ± 0.019
0.381CysThr: 0.381 ± 0.021
0.572CysVal: 0.572 ± 0.022
0.119CysTrp: 0.119 ± 0.01
0.188CysTyr: 0.188 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
6.812AspAla: 6.812 ± 0.088
0.445AspCys: 0.445 ± 0.021
3.085AspAsp: 3.085 ± 0.07
3.511AspGlu: 3.511 ± 0.059
2.423AspPhe: 2.423 ± 0.056
5.278AspGly: 5.278 ± 0.076
1.154AspHis: 1.154 ± 0.035
3.44AspIle: 3.44 ± 0.062
2.102AspLys: 2.102 ± 0.046
5.669AspLeu: 5.669 ± 0.083
1.547AspMet: 1.547 ± 0.035
1.471AspAsn: 1.471 ± 0.036
3.317AspPro: 3.317 ± 0.051
1.572AspGln: 1.572 ± 0.044
4.351AspArg: 4.351 ± 0.074
2.257AspSer: 2.257 ± 0.041
2.543AspThr: 2.543 ± 0.049
3.955AspVal: 3.955 ± 0.067
0.929AspTrp: 0.929 ± 0.028
1.518AspTyr: 1.518 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
7.641GluAla: 7.641 ± 0.093
0.346GluCys: 0.346 ± 0.017
2.783GluAsp: 2.783 ± 0.051
3.238GluGlu: 3.238 ± 0.056
1.904GluPhe: 1.904 ± 0.039
4.245GluGly: 4.245 ± 0.067
1.15GluHis: 1.15 ± 0.033
3.671GluIle: 3.671 ± 0.056
2.919GluLys: 2.919 ± 0.057
5.034GluLeu: 5.034 ± 0.074
1.591GluMet: 1.591 ± 0.042
1.751GluAsn: 1.751 ± 0.033
2.815GluPro: 2.815 ± 0.06
2.045GluGln: 2.045 ± 0.052
4.635GluArg: 4.635 ± 0.084
2.454GluSer: 2.454 ± 0.044
3.64GluThr: 3.64 ± 0.061
3.671GluVal: 3.671 ± 0.054
0.702GluTrp: 0.702 ± 0.027
1.025GluTyr: 1.025 ± 0.033
0.001GluXaa: 0.001 ± 0.001
Phe
4.857PheAla: 4.857 ± 0.074
0.399PheCys: 0.399 ± 0.017
2.757PheAsp: 2.757 ± 0.052
2.297PheGlu: 2.297 ± 0.049
1.613PhePhe: 1.613 ± 0.04
3.975PheGly: 3.975 ± 0.065
0.797PheHis: 0.797 ± 0.026
1.926PheIle: 1.926 ± 0.048
1.102PheLys: 1.102 ± 0.034
3.672PheLeu: 3.672 ± 0.065
0.888PheMet: 0.888 ± 0.028
1.043PheAsn: 1.043 ± 0.03
1.625PhePro: 1.625 ± 0.042
1.097PheGln: 1.097 ± 0.032
2.322PheArg: 2.322 ± 0.045
2.482PheSer: 2.482 ± 0.053
1.951PheThr: 1.951 ± 0.04
3.059PheVal: 3.059 ± 0.053
0.602PheTrp: 0.602 ± 0.025
0.972PheTyr: 0.972 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
9.147GlyAla: 9.147 ± 0.098
0.811GlyCys: 0.811 ± 0.03
4.518GlyAsp: 4.518 ± 0.06
5.007GlyGlu: 5.007 ± 0.078
3.838GlyPhe: 3.838 ± 0.057
7.556GlyGly: 7.556 ± 0.138
1.848GlyHis: 1.848 ± 0.045
5.047GlyIle: 5.047 ± 0.072
4.05GlyLys: 4.05 ± 0.062
8.841GlyLeu: 8.841 ± 0.1
2.473GlyMet: 2.473 ± 0.05
2.402GlyAsn: 2.402 ± 0.046
3.402GlyPro: 3.402 ± 0.06
2.752GlyGln: 2.752 ± 0.055
5.85GlyArg: 5.85 ± 0.074
4.848GlySer: 4.848 ± 0.068
4.596GlyThr: 4.596 ± 0.064
6.232GlyVal: 6.232 ± 0.077
1.338GlyTrp: 1.338 ± 0.033
2.364GlyTyr: 2.364 ± 0.051
0.001GlyXaa: 0.001 ± 0.001
His
2.22HisAla: 2.22 ± 0.048
0.204HisCys: 0.204 ± 0.014
1.194HisAsp: 1.194 ± 0.035
1.052HisGlu: 1.052 ± 0.029
0.827HisPhe: 0.827 ± 0.03
1.88HisGly: 1.88 ± 0.046
0.533HisHis: 0.533 ± 0.027
0.998HisIle: 0.998 ± 0.034
0.542HisLys: 0.542 ± 0.021
1.84HisLeu: 1.84 ± 0.038
0.533HisMet: 0.533 ± 0.026
0.428HisAsn: 0.428 ± 0.02
1.216HisPro: 1.216 ± 0.036
0.543HisGln: 0.543 ± 0.024
1.317HisArg: 1.317 ± 0.032
0.908HisSer: 0.908 ± 0.033
0.784HisThr: 0.784 ± 0.029
1.483HisVal: 1.483 ± 0.041
0.315HisTrp: 0.315 ± 0.019
0.544HisTyr: 0.544 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.552IleAla: 7.552 ± 0.092
0.523IleCys: 0.523 ± 0.022
3.898IleAsp: 3.898 ± 0.066
3.677IleGlu: 3.677 ± 0.059
1.936IlePhe: 1.936 ± 0.049
5.298IleGly: 5.298 ± 0.074
0.955IleHis: 0.955 ± 0.032
2.412IleIle: 2.412 ± 0.048
1.629IleLys: 1.629 ± 0.038
4.731IleLeu: 4.731 ± 0.073
1.112IleMet: 1.112 ± 0.034
1.432IleAsn: 1.432 ± 0.034
2.37IlePro: 2.37 ± 0.053
1.265IleGln: 1.265 ± 0.035
3.315IleArg: 3.315 ± 0.055
3.036IleSer: 3.036 ± 0.054
2.493IleThr: 2.493 ± 0.05
4.896IleVal: 4.896 ± 0.073
0.602IleTrp: 0.602 ± 0.026
1.203IleTyr: 1.203 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.991LysAla: 4.991 ± 0.071
0.159LysCys: 0.159 ± 0.013
2.106LysAsp: 2.106 ± 0.043
1.851LysGlu: 1.851 ± 0.044
1.099LysPhe: 1.099 ± 0.034
2.975LysGly: 2.975 ± 0.055
0.636LysHis: 0.636 ± 0.026
2.026LysIle: 2.026 ± 0.042
1.784LysLys: 1.784 ± 0.044
3.828LysLeu: 3.828 ± 0.064
0.939LysMet: 0.939 ± 0.029
1.082LysAsn: 1.082 ± 0.031
2.569LysPro: 2.569 ± 0.059
1.141LysGln: 1.141 ± 0.033
2.602LysArg: 2.602 ± 0.042
2.161LysSer: 2.161 ± 0.046
2.275LysThr: 2.275 ± 0.048
2.823LysVal: 2.823 ± 0.055
0.404LysTrp: 0.404 ± 0.02
0.72LysTyr: 0.72 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
13.296LeuAla: 13.296 ± 0.126
0.813LeuCys: 0.813 ± 0.026
5.901LeuAsp: 5.901 ± 0.083
5.08LeuGlu: 5.08 ± 0.079
3.796LeuPhe: 3.796 ± 0.071
8.135LeuGly: 8.135 ± 0.092
1.65LeuHis: 1.65 ± 0.042
5.047LeuIle: 5.047 ± 0.082
4.153LeuLys: 4.153 ± 0.065
9.173LeuLeu: 9.173 ± 0.124
2.326LeuMet: 2.326 ± 0.042
2.513LeuAsn: 2.513 ± 0.047
5.309LeuPro: 5.309 ± 0.062
2.618LeuGln: 2.618 ± 0.047
6.047LeuArg: 6.047 ± 0.079
6.422LeuSer: 6.422 ± 0.099
5.325LeuThr: 5.325 ± 0.067
7.548LeuVal: 7.548 ± 0.106
1.11LeuTrp: 1.11 ± 0.035
2.17LeuTyr: 2.17 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
3.376MetAla: 3.376 ± 0.054
0.134MetCys: 0.134 ± 0.013
1.241MetAsp: 1.241 ± 0.031
1.219MetGlu: 1.219 ± 0.035
0.772MetPhe: 0.772 ± 0.03
1.861MetGly: 1.861 ± 0.046
0.456MetHis: 0.456 ± 0.02
1.424MetIle: 1.424 ± 0.036
1.188MetLys: 1.188 ± 0.035
2.599MetLeu: 2.599 ± 0.042
0.713MetMet: 0.713 ± 0.03
0.919MetAsn: 0.919 ± 0.029
1.615MetPro: 1.615 ± 0.039
0.832MetGln: 0.832 ± 0.029
1.802MetArg: 1.802 ± 0.042
1.697MetSer: 1.697 ± 0.04
1.836MetThr: 1.836 ± 0.042
1.791MetVal: 1.791 ± 0.041
0.207MetTrp: 0.207 ± 0.014
0.304MetTyr: 0.304 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.313AsnAla: 3.313 ± 0.056
0.236AsnCys: 0.236 ± 0.016
1.48AsnAsp: 1.48 ± 0.04
1.313AsnGlu: 1.313 ± 0.033
1.04AsnPhe: 1.04 ± 0.032
2.545AsnGly: 2.545 ± 0.048
0.54AsnHis: 0.54 ± 0.023
1.498AsnIle: 1.498 ± 0.035
0.842AsnLys: 0.842 ± 0.032
2.496AsnLeu: 2.496 ± 0.05
0.672AsnMet: 0.672 ± 0.026
0.661AsnAsn: 0.661 ± 0.026
1.952AsnPro: 1.952 ± 0.042
0.782AsnGln: 0.782 ± 0.028
1.846AsnArg: 1.846 ± 0.041
1.316AsnSer: 1.316 ± 0.037
1.226AsnThr: 1.226 ± 0.036
2.029AsnVal: 2.029 ± 0.043
0.425AsnTrp: 0.425 ± 0.018
0.724AsnTyr: 0.724 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
6.244ProAla: 6.244 ± 0.101
0.265ProCys: 0.265 ± 0.014
3.518ProAsp: 3.518 ± 0.064
3.543ProGlu: 3.543 ± 0.064
2.065ProPhe: 2.065 ± 0.042
4.369ProGly: 4.369 ± 0.06
0.982ProHis: 0.982 ± 0.029
2.247ProIle: 2.247 ± 0.045
1.978ProLys: 1.978 ± 0.045
4.707ProLeu: 4.707 ± 0.069
1.222ProMet: 1.222 ± 0.039
1.371ProAsn: 1.371 ± 0.036
2.207ProPro: 2.207 ± 0.066
1.614ProGln: 1.614 ± 0.043
2.748ProArg: 2.748 ± 0.057
2.647ProSer: 2.647 ± 0.05
2.236ProThr: 2.236 ± 0.043
4.323ProVal: 4.323 ± 0.069
0.662ProTrp: 0.662 ± 0.028
1.221ProTyr: 1.221 ± 0.033
0.001ProXaa: 0.001 ± 0.001
Gln
3.894GlnAla: 3.894 ± 0.062
0.178GlnCys: 0.178 ± 0.014
1.467GlnAsp: 1.467 ± 0.038
1.512GlnGlu: 1.512 ± 0.04
1.054GlnPhe: 1.054 ± 0.032
2.29GlnGly: 2.29 ± 0.048
0.588GlnHis: 0.588 ± 0.026
1.755GlnIle: 1.755 ± 0.039
1.304GlnLys: 1.304 ± 0.036
2.632GlnLeu: 2.632 ± 0.052
0.879GlnMet: 0.879 ± 0.028
0.855GlnAsn: 0.855 ± 0.027
1.718GlnPro: 1.718 ± 0.044
1.075GlnGln: 1.075 ± 0.039
2.182GlnArg: 2.182 ± 0.043
1.713GlnSer: 1.713 ± 0.044
1.708GlnThr: 1.708 ± 0.04
2.07GlnVal: 2.07 ± 0.05
0.392GlnTrp: 0.392 ± 0.019
0.585GlnTyr: 0.585 ± 0.025
0.001GlnXaa: 0.001 ± 0.001
Arg
7.131ArgAla: 7.131 ± 0.099
0.423ArgCys: 0.423 ± 0.018
3.911ArgAsp: 3.911 ± 0.065
3.904ArgGlu: 3.904 ± 0.066
3.07ArgPhe: 3.07 ± 0.051
4.56ArgGly: 4.56 ± 0.072
1.666ArgHis: 1.666 ± 0.037
4.039ArgIle: 4.039 ± 0.055
2.633ArgLys: 2.633 ± 0.044
7.44ArgLeu: 7.44 ± 0.1
1.797ArgMet: 1.797 ± 0.039
1.907ArgAsn: 1.907 ± 0.05
3.291ArgPro: 3.291 ± 0.059
2.535ArgGln: 2.535 ± 0.051
5.256ArgArg: 5.256 ± 0.077
3.553ArgSer: 3.553 ± 0.068
3.274ArgThr: 3.274 ± 0.059
4.325ArgVal: 4.325 ± 0.062
0.846ArgTrp: 0.846 ± 0.032
1.589ArgTyr: 1.589 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.405SerAla: 6.405 ± 0.071
0.368SerCys: 0.368 ± 0.02
3.038SerAsp: 3.038 ± 0.05
2.798SerGlu: 2.798 ± 0.05
2.385SerPhe: 2.385 ± 0.051
5.811SerGly: 5.811 ± 0.077
1.103SerHis: 1.103 ± 0.031
2.965SerIle: 2.965 ± 0.055
1.766SerLys: 1.766 ± 0.045
5.476SerLeu: 5.476 ± 0.07
1.403SerMet: 1.403 ± 0.036
1.479SerAsn: 1.479 ± 0.037
2.712SerPro: 2.712 ± 0.052
1.648SerGln: 1.648 ± 0.047
3.621SerArg: 3.621 ± 0.059
2.988SerSer: 2.988 ± 0.059
2.688SerThr: 2.688 ± 0.052
4.192SerVal: 4.192 ± 0.066
0.737SerTrp: 0.737 ± 0.029
1.283SerTyr: 1.283 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
6.068ThrAla: 6.068 ± 0.082
0.361ThrCys: 0.361 ± 0.019
2.764ThrAsp: 2.764 ± 0.044
2.655ThrGlu: 2.655 ± 0.045
2.002ThrPhe: 2.002 ± 0.042
5.253ThrGly: 5.253 ± 0.07
0.958ThrHis: 0.958 ± 0.031
2.967ThrIle: 2.967 ± 0.06
1.583ThrLys: 1.583 ± 0.039
5.549ThrLeu: 5.549 ± 0.074
1.251ThrMet: 1.251 ± 0.031
1.267ThrAsn: 1.267 ± 0.035
3.035ThrPro: 3.035 ± 0.065
1.324ThrGln: 1.324 ± 0.039
3.0ThrArg: 3.0 ± 0.054
2.693ThrSer: 2.693 ± 0.045
2.702ThrThr: 2.702 ± 0.05
4.539ThrVal: 4.539 ± 0.07
0.589ThrTrp: 0.589 ± 0.025
1.144ThrTyr: 1.144 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
9.312ValAla: 9.312 ± 0.105
0.591ValCys: 0.591 ± 0.025
4.182ValAsp: 4.182 ± 0.064
4.734ValGlu: 4.734 ± 0.074
3.076ValPhe: 3.076 ± 0.055
5.792ValGly: 5.792 ± 0.079
1.314ValHis: 1.314 ± 0.037
4.08ValIle: 4.08 ± 0.065
2.623ValLys: 2.623 ± 0.05
7.494ValLeu: 7.494 ± 0.095
1.888ValMet: 1.888 ± 0.044
1.994ValAsn: 1.994 ± 0.045
3.797ValPro: 3.797 ± 0.069
1.963ValGln: 1.963 ± 0.042
4.819ValArg: 4.819 ± 0.061
4.587ValSer: 4.587 ± 0.064
4.4ValThr: 4.4 ± 0.064
5.872ValVal: 5.872 ± 0.086
0.862ValTrp: 0.862 ± 0.027
1.568ValTyr: 1.568 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.163TrpAla: 1.163 ± 0.033
0.13TrpCys: 0.13 ± 0.01
0.564TrpAsp: 0.564 ± 0.026
0.534TrpGlu: 0.534 ± 0.02
0.552TrpPhe: 0.552 ± 0.023
0.814TrpGly: 0.814 ± 0.03
0.319TrpHis: 0.319 ± 0.017
0.652TrpIle: 0.652 ± 0.023
0.52TrpLys: 0.52 ± 0.022
1.61TrpLeu: 1.61 ± 0.043
0.362TrpMet: 0.362 ± 0.018
0.476TrpAsn: 0.476 ± 0.021
0.686TrpPro: 0.686 ± 0.023
0.567TrpGln: 0.567 ± 0.027
1.064TrpArg: 1.064 ± 0.033
0.785TrpSer: 0.785 ± 0.029
0.704TrpThr: 0.704 ± 0.025
0.8TrpVal: 0.8 ± 0.029
0.238TrpTrp: 0.238 ± 0.017
0.279TrpTyr: 0.279 ± 0.015
0.002TrpXaa: 0.002 ± 0.001
Tyr
2.53TyrAla: 2.53 ± 0.05
0.249TyrCys: 0.249 ± 0.015
1.473TyrAsp: 1.473 ± 0.045
1.254TyrGlu: 1.254 ± 0.033
0.927TyrPhe: 0.927 ± 0.03
2.119TyrGly: 2.119 ± 0.047
0.454TyrHis: 0.454 ± 0.021
0.989TyrIle: 0.989 ± 0.031
0.763TyrLys: 0.763 ± 0.028
2.204TyrLeu: 2.204 ± 0.045
0.489TyrMet: 0.489 ± 0.023
0.62TyrAsn: 0.62 ± 0.026
1.191TyrPro: 1.191 ± 0.031
0.712TyrGln: 0.712 ± 0.026
1.706TyrArg: 1.706 ± 0.041
1.207TyrSer: 1.207 ± 0.034
1.105TyrThr: 1.105 ± 0.032
1.7TyrVal: 1.7 ± 0.04
0.342TyrTrp: 0.342 ± 0.016
0.598TyrTyr: 0.598 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.001
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.001XaaHis: 0.001 ± 0.001
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3514 proteins (1103888 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski