Amino acid dipepetide frequency for Xanthomonas oryzae pv. oryzae (strain KACC10331 / KXO85)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.416AlaAla: 17.416 ± 0.172
1.562AlaCys: 1.562 ± 0.036
6.784AlaAsp: 6.784 ± 0.07
5.85AlaGlu: 5.85 ± 0.082
3.645AlaPhe: 3.645 ± 0.057
10.116AlaGly: 10.116 ± 0.118
3.518AlaHis: 3.518 ± 0.058
5.077AlaIle: 5.077 ± 0.078
3.265AlaLys: 3.265 ± 0.065
15.164AlaLeu: 15.164 ± 0.145
3.18AlaMet: 3.18 ± 0.055
2.985AlaAsn: 2.985 ± 0.051
6.193AlaPro: 6.193 ± 0.075
6.635AlaGln: 6.635 ± 0.079
9.482AlaArg: 9.482 ± 0.093
7.066AlaSer: 7.066 ± 0.079
6.05AlaThr: 6.05 ± 0.08
8.564AlaVal: 8.564 ± 0.089
1.898AlaTrp: 1.898 ± 0.04
2.525AlaTyr: 2.525 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.509CysAla: 1.509 ± 0.039
0.248CysCys: 0.248 ± 0.017
0.558CysAsp: 0.558 ± 0.021
0.594CysGlu: 0.594 ± 0.023
0.246CysPhe: 0.246 ± 0.012
1.038CysGly: 1.038 ± 0.031
0.389CysHis: 0.389 ± 0.019
0.447CysIle: 0.447 ± 0.018
0.28CysLys: 0.28 ± 0.016
1.007CysLeu: 1.007 ± 0.028
0.34CysMet: 0.34 ± 0.013
0.286CysAsn: 0.286 ± 0.013
0.57CysPro: 0.57 ± 0.023
0.593CysGln: 0.593 ± 0.054
0.938CysArg: 0.938 ± 0.03
0.621CysSer: 0.621 ± 0.023
0.638CysThr: 0.638 ± 0.022
0.834CysVal: 0.834 ± 0.027
0.168CysTrp: 0.168 ± 0.01
0.268CysTyr: 0.268 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
8.226AspAla: 8.226 ± 0.089
0.489AspCys: 0.489 ± 0.022
3.118AspAsp: 3.118 ± 0.052
2.723AspGlu: 2.723 ± 0.052
1.754AspPhe: 1.754 ± 0.037
5.242AspGly: 5.242 ± 0.068
1.345AspHis: 1.345 ± 0.051
2.089AspIle: 2.089 ± 0.042
1.382AspLys: 1.382 ± 0.045
4.993AspLeu: 4.993 ± 0.073
1.027AspMet: 1.027 ± 0.029
1.43AspAsn: 1.43 ± 0.03
3.082AspPro: 3.082 ± 0.054
2.271AspGln: 2.271 ± 0.061
3.675AspArg: 3.675 ± 0.051
2.641AspSer: 2.641 ± 0.051
2.919AspThr: 2.919 ± 0.052
4.05AspVal: 4.05 ± 0.052
0.968AspTrp: 0.968 ± 0.028
1.424AspTyr: 1.424 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
5.534GluAla: 5.534 ± 0.076
0.412GluCys: 0.412 ± 0.018
2.082GluAsp: 2.082 ± 0.038
2.019GluGlu: 2.019 ± 0.05
1.436GluPhe: 1.436 ± 0.035
3.231GluGly: 3.231 ± 0.055
1.561GluHis: 1.561 ± 0.037
2.1GluIle: 2.1 ± 0.043
1.4GluLys: 1.4 ± 0.039
5.272GluLeu: 5.272 ± 0.079
0.984GluMet: 0.984 ± 0.032
1.174GluAsn: 1.174 ± 0.031
2.241GluPro: 2.241 ± 0.04
3.112GluGln: 3.112 ± 0.051
5.075GluArg: 5.075 ± 0.07
2.204GluSer: 2.204 ± 0.045
2.412GluThr: 2.412 ± 0.067
3.389GluVal: 3.389 ± 0.055
0.666GluTrp: 0.666 ± 0.021
0.876GluTyr: 0.876 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.973PheAla: 3.973 ± 0.059
0.326PheCys: 0.326 ± 0.015
2.266PheAsp: 2.266 ± 0.046
1.574PheGlu: 1.574 ± 0.035
1.115PhePhe: 1.115 ± 0.033
3.051PheGly: 3.051 ± 0.051
0.706PheHis: 0.706 ± 0.024
1.102PheIle: 1.102 ± 0.029
1.034PheLys: 1.034 ± 0.028
2.679PheLeu: 2.679 ± 0.049
0.559PheMet: 0.559 ± 0.021
1.006PheAsn: 1.006 ± 0.027
1.313PhePro: 1.313 ± 0.035
1.196PheGln: 1.196 ± 0.03
2.02PheArg: 2.02 ± 0.037
1.841PheSer: 1.841 ± 0.039
1.573PheThr: 1.573 ± 0.036
2.386PheVal: 2.386 ± 0.043
0.468PheTrp: 0.468 ± 0.018
0.737PheTyr: 0.737 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
8.434GlyAla: 8.434 ± 0.108
1.011GlyCys: 1.011 ± 0.027
4.28GlyAsp: 4.28 ± 0.056
4.067GlyGlu: 4.067 ± 0.058
3.238GlyPhe: 3.238 ± 0.055
6.79GlyGly: 6.79 ± 0.109
2.067GlyHis: 2.067 ± 0.041
3.602GlyIle: 3.602 ± 0.055
3.236GlyLys: 3.236 ± 0.067
8.442GlyLeu: 8.442 ± 0.09
2.309GlyMet: 2.309 ± 0.041
2.437GlyAsn: 2.437 ± 0.051
2.829GlyPro: 2.829 ± 0.048
3.506GlyGln: 3.506 ± 0.056
6.438GlyArg: 6.438 ± 0.078
4.545GlySer: 4.545 ± 0.064
4.277GlyThr: 4.277 ± 0.101
6.187GlyVal: 6.187 ± 0.071
1.48GlyTrp: 1.48 ± 0.039
2.133GlyTyr: 2.133 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
3.999HisAla: 3.999 ± 0.056
0.363HisCys: 0.363 ± 0.017
1.486HisAsp: 1.486 ± 0.033
1.091HisGlu: 1.091 ± 0.028
0.813HisPhe: 0.813 ± 0.025
2.672HisGly: 2.672 ± 0.07
0.917HisHis: 0.917 ± 0.034
0.799HisIle: 0.799 ± 0.022
0.523HisLys: 0.523 ± 0.021
2.552HisLeu: 2.552 ± 0.051
0.467HisMet: 0.467 ± 0.016
0.545HisAsn: 0.545 ± 0.016
1.699HisPro: 1.699 ± 0.036
0.98HisGln: 0.98 ± 0.029
2.376HisArg: 2.376 ± 0.052
1.22HisSer: 1.22 ± 0.027
1.229HisThr: 1.229 ± 0.032
1.702HisVal: 1.702 ± 0.036
0.665HisTrp: 0.665 ± 0.025
0.746HisTyr: 0.746 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.369IleAla: 6.369 ± 0.087
0.408IleCys: 0.408 ± 0.019
3.036IleAsp: 3.036 ± 0.053
2.617IleGlu: 2.617 ± 0.051
1.012IlePhe: 1.012 ± 0.027
4.141IleGly: 4.141 ± 0.065
0.805IleHis: 0.805 ± 0.025
1.174IleIle: 1.174 ± 0.031
1.237IleLys: 1.237 ± 0.033
2.737IleLeu: 2.737 ± 0.048
0.552IleMet: 0.552 ± 0.021
1.213IleAsn: 1.213 ± 0.031
1.835IlePro: 1.835 ± 0.036
1.129IleGln: 1.129 ± 0.027
2.271IleArg: 2.271 ± 0.039
2.268IleSer: 2.268 ± 0.041
2.009IleThr: 2.009 ± 0.04
3.208IleVal: 3.208 ± 0.049
0.412IleTrp: 0.412 ± 0.019
0.762IleTyr: 0.762 ± 0.024
0.0IleXaa: 0.0 ± 0.0
Lys
3.194LysAla: 3.194 ± 0.06
0.214LysCys: 0.214 ± 0.012
1.311LysAsp: 1.311 ± 0.039
1.112LysGlu: 1.112 ± 0.03
0.684LysPhe: 0.684 ± 0.024
2.1LysGly: 2.1 ± 0.041
0.741LysHis: 0.741 ± 0.025
1.134LysIle: 1.134 ± 0.031
1.181LysLys: 1.181 ± 0.036
2.925LysLeu: 2.925 ± 0.051
0.569LysMet: 0.569 ± 0.018
0.78LysAsn: 0.78 ± 0.026
1.884LysPro: 1.884 ± 0.042
1.669LysGln: 1.669 ± 0.062
2.736LysArg: 2.736 ± 0.055
1.553LysSer: 1.553 ± 0.04
1.749LysThr: 1.749 ± 0.04
2.128LysVal: 2.128 ± 0.048
0.323LysTrp: 0.323 ± 0.022
0.599LysTyr: 0.599 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
13.162LeuAla: 13.162 ± 0.143
1.343LeuCys: 1.343 ± 0.061
6.079LeuAsp: 6.079 ± 0.08
5.323LeuGlu: 5.323 ± 0.085
3.008LeuPhe: 3.008 ± 0.054
8.143LeuGly: 8.143 ± 0.097
2.99LeuHis: 2.99 ± 0.052
3.886LeuIle: 3.886 ± 0.068
3.046LeuLys: 3.046 ± 0.051
11.531LeuLeu: 11.531 ± 0.125
2.192LeuMet: 2.192 ± 0.04
2.385LeuAsn: 2.385 ± 0.043
6.868LeuPro: 6.868 ± 0.1
4.919LeuGln: 4.919 ± 0.065
8.934LeuArg: 8.934 ± 0.084
6.484LeuSer: 6.484 ± 0.079
5.067LeuThr: 5.067 ± 0.089
6.868LeuVal: 6.868 ± 0.082
1.343LeuTrp: 1.343 ± 0.034
2.128LeuTyr: 2.128 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
2.501MetAla: 2.501 ± 0.048
0.209MetCys: 0.209 ± 0.012
1.144MetAsp: 1.144 ± 0.029
0.905MetGlu: 0.905 ± 0.027
0.582MetPhe: 0.582 ± 0.023
1.352MetGly: 1.352 ± 0.036
0.658MetHis: 0.658 ± 0.021
0.839MetIle: 0.839 ± 0.026
0.763MetLys: 0.763 ± 0.022
2.295MetLeu: 2.295 ± 0.042
0.471MetMet: 0.471 ± 0.018
0.575MetAsn: 0.575 ± 0.02
1.556MetPro: 1.556 ± 0.035
1.221MetGln: 1.221 ± 0.029
2.158MetArg: 2.158 ± 0.044
1.612MetSer: 1.612 ± 0.033
1.382MetThr: 1.382 ± 0.03
1.425MetVal: 1.425 ± 0.033
0.247MetTrp: 0.247 ± 0.014
0.371MetTyr: 0.371 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.487AsnAla: 3.487 ± 0.058
0.275AsnCys: 0.275 ± 0.013
1.351AsnAsp: 1.351 ± 0.033
1.101AsnGlu: 1.101 ± 0.028
0.763AsnPhe: 0.763 ± 0.028
2.484AsnGly: 2.484 ± 0.053
0.484AsnHis: 0.484 ± 0.022
0.948AsnIle: 0.948 ± 0.028
0.65AsnLys: 0.65 ± 0.021
2.409AsnLeu: 2.409 ± 0.05
0.388AsnMet: 0.388 ± 0.017
0.716AsnAsn: 0.716 ± 0.032
1.605AsnPro: 1.605 ± 0.033
1.014AsnGln: 1.014 ± 0.028
1.968AsnArg: 1.968 ± 0.045
1.292AsnSer: 1.292 ± 0.036
1.396AsnThr: 1.396 ± 0.041
1.922AsnVal: 1.922 ± 0.042
0.432AsnTrp: 0.432 ± 0.018
0.655AsnTyr: 0.655 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
7.338ProAla: 7.338 ± 0.091
0.53ProCys: 0.53 ± 0.02
3.408ProAsp: 3.408 ± 0.062
2.746ProGlu: 2.746 ± 0.047
1.517ProPhe: 1.517 ± 0.036
4.726ProGly: 4.726 ± 0.073
1.263ProHis: 1.263 ± 0.029
1.854ProIle: 1.854 ± 0.038
1.443ProLys: 1.443 ± 0.034
5.235ProLeu: 5.235 ± 0.062
1.457ProMet: 1.457 ± 0.033
1.237ProAsn: 1.237 ± 0.029
2.981ProPro: 2.981 ± 0.058
2.492ProGln: 2.492 ± 0.05
3.695ProArg: 3.695 ± 0.063
3.221ProSer: 3.221 ± 0.048
2.694ProThr: 2.694 ± 0.044
4.252ProVal: 4.252 ± 0.071
0.917ProTrp: 0.917 ± 0.03
1.162ProTyr: 1.162 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
6.316GlnAla: 6.316 ± 0.098
0.523GlnCys: 0.523 ± 0.023
1.902GlnAsp: 1.902 ± 0.057
1.575GlnGlu: 1.575 ± 0.038
1.375GlnPhe: 1.375 ± 0.029
3.145GlnGly: 3.145 ± 0.047
1.414GlnHis: 1.414 ± 0.037
2.073GlnIle: 2.073 ± 0.042
0.882GlnLys: 0.882 ± 0.027
5.206GlnLeu: 5.206 ± 0.071
0.968GlnMet: 0.968 ± 0.029
0.803GlnAsn: 0.803 ± 0.027
2.726GlnPro: 2.726 ± 0.049
2.604GlnGln: 2.604 ± 0.057
4.997GlnArg: 4.997 ± 0.086
2.207GlnSer: 2.207 ± 0.037
2.331GlnThr: 2.331 ± 0.041
3.752GlnVal: 3.752 ± 0.071
0.934GlnTrp: 0.934 ± 0.03
0.841GlnTyr: 0.841 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
8.973ArgAla: 8.973 ± 0.093
1.19ArgCys: 1.19 ± 0.033
4.274ArgAsp: 4.274 ± 0.052
3.974ArgGlu: 3.974 ± 0.067
2.86ArgPhe: 2.86 ± 0.047
5.392ArgGly: 5.392 ± 0.056
2.476ArgHis: 2.476 ± 0.054
3.599ArgIle: 3.599 ± 0.056
2.4ArgLys: 2.4 ± 0.047
8.915ArgLeu: 8.915 ± 0.106
2.064ArgMet: 2.064 ± 0.039
2.053ArgAsn: 2.053 ± 0.041
4.088ArgPro: 4.088 ± 0.062
4.028ArgGln: 4.028 ± 0.067
7.068ArgArg: 7.068 ± 0.11
4.757ArgSer: 4.757 ± 0.064
3.931ArgThr: 3.931 ± 0.058
5.168ArgVal: 5.168 ± 0.072
1.759ArgTrp: 1.759 ± 0.038
2.299ArgTyr: 2.299 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
7.027SerAla: 7.027 ± 0.079
0.618SerCys: 0.618 ± 0.023
3.075SerAsp: 3.075 ± 0.051
2.614SerGlu: 2.614 ± 0.038
1.722SerPhe: 1.722 ± 0.039
5.335SerGly: 5.335 ± 0.078
1.414SerHis: 1.414 ± 0.042
2.046SerIle: 2.046 ± 0.045
1.744SerLys: 1.744 ± 0.042
5.559SerLeu: 5.559 ± 0.075
1.367SerMet: 1.367 ± 0.026
1.764SerAsn: 1.764 ± 0.046
3.064SerPro: 3.064 ± 0.048
1.952SerGln: 1.952 ± 0.036
4.102SerArg: 4.102 ± 0.054
3.41SerSer: 3.41 ± 0.061
3.094SerThr: 3.094 ± 0.054
4.036SerVal: 4.036 ± 0.06
0.847SerTrp: 0.847 ± 0.023
1.266SerTyr: 1.266 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
6.298ThrAla: 6.298 ± 0.083
0.502ThrCys: 0.502 ± 0.02
2.516ThrAsp: 2.516 ± 0.043
2.018ThrGlu: 2.018 ± 0.039
1.447ThrPhe: 1.447 ± 0.033
4.275ThrGly: 4.275 ± 0.076
1.344ThrHis: 1.344 ± 0.032
1.918ThrIle: 1.918 ± 0.039
1.23ThrLys: 1.23 ± 0.03
6.222ThrLeu: 6.222 ± 0.077
1.01ThrMet: 1.01 ± 0.023
1.115ThrAsn: 1.115 ± 0.035
3.799ThrPro: 3.799 ± 0.07
2.117ThrGln: 2.117 ± 0.042
3.814ThrArg: 3.814 ± 0.048
2.704ThrSer: 2.704 ± 0.049
2.753ThrThr: 2.753 ± 0.053
3.977ThrVal: 3.977 ± 0.079
0.709ThrTrp: 0.709 ± 0.022
1.163ThrTyr: 1.163 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
9.203ValAla: 9.203 ± 0.111
0.815ValCys: 0.815 ± 0.029
4.154ValAsp: 4.154 ± 0.067
3.743ValGlu: 3.743 ± 0.061
2.327ValPhe: 2.327 ± 0.046
5.321ValGly: 5.321 ± 0.073
1.864ValHis: 1.864 ± 0.034
3.088ValIle: 3.088 ± 0.05
1.78ValLys: 1.78 ± 0.04
7.968ValLeu: 7.968 ± 0.099
1.583ValMet: 1.583 ± 0.039
1.771ValAsn: 1.771 ± 0.041
3.862ValPro: 3.862 ± 0.059
3.286ValGln: 3.286 ± 0.074
5.482ValArg: 5.482 ± 0.062
4.155ValSer: 4.155 ± 0.06
3.541ValThr: 3.541 ± 0.055
5.754ValVal: 5.754 ± 0.081
0.981ValTrp: 0.981 ± 0.03
1.335ValTyr: 1.335 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.262TrpAla: 1.262 ± 0.03
0.237TrpCys: 0.237 ± 0.011
0.652TrpAsp: 0.652 ± 0.023
0.559TrpGlu: 0.559 ± 0.021
0.591TrpPhe: 0.591 ± 0.018
0.851TrpGly: 0.851 ± 0.029
0.417TrpHis: 0.417 ± 0.018
0.769TrpIle: 0.769 ± 0.025
0.524TrpLys: 0.524 ± 0.019
2.185TrpLeu: 2.185 ± 0.042
0.419TrpMet: 0.419 ± 0.015
0.446TrpAsn: 0.446 ± 0.016
0.903TrpPro: 0.903 ± 0.029
0.948TrpGln: 0.948 ± 0.027
1.859TrpArg: 1.859 ± 0.044
1.043TrpSer: 1.043 ± 0.026
0.694TrpThr: 0.694 ± 0.022
0.945TrpVal: 0.945 ± 0.027
0.308TrpTrp: 0.308 ± 0.015
0.396TrpTyr: 0.396 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.753TyrAla: 2.753 ± 0.047
0.297TyrCys: 0.297 ± 0.014
1.374TyrAsp: 1.374 ± 0.045
0.973TyrGlu: 0.973 ± 0.03
0.791TyrPhe: 0.791 ± 0.027
1.9TyrGly: 1.9 ± 0.042
0.469TyrHis: 0.469 ± 0.02
0.647TyrIle: 0.647 ± 0.025
0.588TyrLys: 0.588 ± 0.023
2.403TyrLeu: 2.403 ± 0.047
0.36TyrMet: 0.36 ± 0.015
0.617TyrAsn: 0.617 ± 0.024
1.073TyrPro: 1.073 ± 0.033
0.9TyrGln: 0.9 ± 0.022
2.181TyrArg: 2.181 ± 0.045
1.213TyrSer: 1.213 ± 0.035
1.194TyrThr: 1.194 ± 0.034
1.47TyrVal: 1.47 ± 0.033
0.487TyrTrp: 0.487 ± 0.02
0.554TyrTyr: 0.554 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4382 proteins (1408507 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski