Amino acid dipepetide frequency for Muribaculaceae bacterium Isolate-002 (NCI)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.309AlaAla: 8.309 ± 0.138
0.998AlaCys: 0.998 ± 0.036
6.021AlaAsp: 6.021 ± 0.09
5.327AlaGlu: 5.327 ± 0.107
3.135AlaPhe: 3.135 ± 0.061
6.465AlaGly: 6.465 ± 0.094
1.29AlaHis: 1.29 ± 0.036
5.468AlaIle: 5.468 ± 0.09
4.017AlaLys: 4.017 ± 0.079
7.816AlaLeu: 7.816 ± 0.112
2.463AlaMet: 2.463 ± 0.056
3.203AlaAsn: 3.203 ± 0.051
3.14AlaPro: 3.14 ± 0.071
2.725AlaGln: 2.725 ± 0.059
4.347AlaArg: 4.347 ± 0.074
5.242AlaSer: 5.242 ± 0.076
4.967AlaThr: 4.967 ± 0.085
6.255AlaVal: 6.255 ± 0.084
0.884AlaTrp: 0.884 ± 0.035
2.803AlaTyr: 2.803 ± 0.063
0.0AlaXaa: 0.0 ± 0.0
Cys
0.916CysAla: 0.916 ± 0.032
0.207CysCys: 0.207 ± 0.015
0.81CysAsp: 0.81 ± 0.028
0.703CysGlu: 0.703 ± 0.028
0.505CysPhe: 0.505 ± 0.022
1.113CysGly: 1.113 ± 0.04
0.319CysHis: 0.319 ± 0.019
0.739CysIle: 0.739 ± 0.031
0.53CysLys: 0.53 ± 0.021
0.882CysLeu: 0.882 ± 0.033
0.333CysMet: 0.333 ± 0.021
0.59CysAsn: 0.59 ± 0.029
0.541CysPro: 0.541 ± 0.025
0.338CysGln: 0.338 ± 0.019
0.884CysArg: 0.884 ± 0.029
0.786CysSer: 0.786 ± 0.03
0.628CysThr: 0.628 ± 0.027
0.835CysVal: 0.835 ± 0.033
0.14CysTrp: 0.14 ± 0.013
0.492CysTyr: 0.492 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
5.239AspAla: 5.239 ± 0.08
0.794AspCys: 0.794 ± 0.033
3.908AspAsp: 3.908 ± 0.075
4.048AspGlu: 4.048 ± 0.069
3.089AspPhe: 3.089 ± 0.07
5.216AspGly: 5.216 ± 0.095
0.888AspHis: 0.888 ± 0.035
4.889AspIle: 4.889 ± 0.076
3.505AspLys: 3.505 ± 0.081
4.604AspLeu: 4.604 ± 0.076
1.9AspMet: 1.9 ± 0.044
3.523AspAsn: 3.523 ± 0.068
2.19AspPro: 2.19 ± 0.054
1.069AspGln: 1.069 ± 0.036
3.438AspArg: 3.438 ± 0.069
3.684AspSer: 3.684 ± 0.068
3.495AspThr: 3.495 ± 0.06
3.85AspVal: 3.85 ± 0.063
0.834AspTrp: 0.834 ± 0.033
2.959AspTyr: 2.959 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
5.486GluAla: 5.486 ± 0.109
0.645GluCys: 0.645 ± 0.028
2.818GluAsp: 2.818 ± 0.065
3.951GluGlu: 3.951 ± 0.088
2.344GluPhe: 2.344 ± 0.043
3.876GluGly: 3.876 ± 0.075
1.157GluHis: 1.157 ± 0.037
4.725GluIle: 4.725 ± 0.081
3.822GluLys: 3.822 ± 0.081
5.585GluLeu: 5.585 ± 0.093
1.842GluMet: 1.842 ± 0.042
2.986GluAsn: 2.986 ± 0.065
1.924GluPro: 1.924 ± 0.051
2.191GluGln: 2.191 ± 0.053
3.531GluArg: 3.531 ± 0.068
3.033GluSer: 3.033 ± 0.057
3.185GluThr: 3.185 ± 0.063
3.941GluVal: 3.941 ± 0.07
0.782GluTrp: 0.782 ± 0.03
2.541GluTyr: 2.541 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
3.304PheAla: 3.304 ± 0.057
0.566PheCys: 0.566 ± 0.024
3.052PheAsp: 3.052 ± 0.063
2.309PheGlu: 2.309 ± 0.05
1.794PhePhe: 1.794 ± 0.053
3.195PheGly: 3.195 ± 0.061
0.767PheHis: 0.767 ± 0.024
2.791PheIle: 2.791 ± 0.064
1.974PheLys: 1.974 ± 0.045
3.131PheLeu: 3.131 ± 0.069
1.125PheMet: 1.125 ± 0.036
2.391PheAsn: 2.391 ± 0.054
1.649PhePro: 1.649 ± 0.048
0.95PheGln: 0.95 ± 0.032
2.123PheArg: 2.123 ± 0.05
3.028PheSer: 3.028 ± 0.063
2.651PheThr: 2.651 ± 0.055
2.721PheVal: 2.721 ± 0.069
0.45PheTrp: 0.45 ± 0.021
1.573PheTyr: 1.573 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
5.545GlyAla: 5.545 ± 0.092
1.094GlyCys: 1.094 ± 0.033
4.341GlyAsp: 4.341 ± 0.083
4.146GlyGlu: 4.146 ± 0.076
3.108GlyPhe: 3.108 ± 0.064
4.982GlyGly: 4.982 ± 0.09
1.379GlyHis: 1.379 ± 0.038
5.026GlyIle: 5.026 ± 0.077
4.515GlyLys: 4.515 ± 0.071
5.553GlyLeu: 5.553 ± 0.089
2.045GlyMet: 2.045 ± 0.048
3.841GlyAsn: 3.841 ± 0.066
1.385GlyPro: 1.385 ± 0.042
2.067GlyGln: 2.067 ± 0.046
3.88GlyArg: 3.88 ± 0.076
4.336GlySer: 4.336 ± 0.081
4.12GlyThr: 4.12 ± 0.072
5.333GlyVal: 5.333 ± 0.092
0.981GlyTrp: 0.981 ± 0.038
3.326GlyTyr: 3.326 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
1.332HisAla: 1.332 ± 0.045
0.279HisCys: 0.279 ± 0.017
1.197HisAsp: 1.197 ± 0.037
0.942HisGlu: 0.942 ± 0.036
0.899HisPhe: 0.899 ± 0.035
1.355HisGly: 1.355 ± 0.038
0.438HisHis: 0.438 ± 0.024
1.469HisIle: 1.469 ± 0.048
0.841HisLys: 0.841 ± 0.033
1.738HisLeu: 1.738 ± 0.053
0.294HisMet: 0.294 ± 0.018
0.959HisAsn: 0.959 ± 0.032
1.001HisPro: 1.001 ± 0.033
0.507HisGln: 0.507 ± 0.023
1.09HisArg: 1.09 ± 0.037
1.202HisSer: 1.202 ± 0.035
1.022HisThr: 1.022 ± 0.032
1.02HisVal: 1.02 ± 0.04
0.229HisTrp: 0.229 ± 0.017
0.82HisTyr: 0.82 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.307IleAla: 6.307 ± 0.102
0.877IleCys: 0.877 ± 0.034
5.137IleAsp: 5.137 ± 0.086
4.396IleGlu: 4.396 ± 0.079
2.732IlePhe: 2.732 ± 0.058
4.661IleGly: 4.661 ± 0.087
1.1IleHis: 1.1 ± 0.04
4.651IleIle: 4.651 ± 0.098
3.604IleLys: 3.604 ± 0.064
5.188IleLeu: 5.188 ± 0.087
1.467IleMet: 1.467 ± 0.039
3.372IleAsn: 3.372 ± 0.063
2.993IlePro: 2.993 ± 0.063
1.534IleGln: 1.534 ± 0.04
3.165IleArg: 3.165 ± 0.071
4.69IleSer: 4.69 ± 0.075
4.025IleThr: 4.025 ± 0.075
4.656IleVal: 4.656 ± 0.085
0.615IleTrp: 0.615 ± 0.03
2.523IleTyr: 2.523 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
4.691LysAla: 4.691 ± 0.081
0.513LysCys: 0.513 ± 0.026
2.876LysAsp: 2.876 ± 0.06
3.954LysGlu: 3.954 ± 0.079
2.05LysPhe: 2.05 ± 0.048
3.68LysGly: 3.68 ± 0.074
0.96LysHis: 0.96 ± 0.031
3.67LysIle: 3.67 ± 0.071
3.618LysLys: 3.618 ± 0.095
4.105LysLeu: 4.105 ± 0.069
1.728LysMet: 1.728 ± 0.041
2.514LysAsn: 2.514 ± 0.058
1.856LysPro: 1.856 ± 0.051
1.669LysGln: 1.669 ± 0.046
2.727LysArg: 2.727 ± 0.058
3.405LysSer: 3.405 ± 0.056
2.821LysThr: 2.821 ± 0.061
3.711LysVal: 3.711 ± 0.066
0.687LysTrp: 0.687 ± 0.031
2.328LysTyr: 2.328 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
7.122LeuAla: 7.122 ± 0.097
1.234LeuCys: 1.234 ± 0.04
5.304LeuAsp: 5.304 ± 0.071
4.389LeuGlu: 4.389 ± 0.076
3.405LeuPhe: 3.405 ± 0.077
5.655LeuGly: 5.655 ± 0.09
1.684LeuHis: 1.684 ± 0.045
5.107LeuIle: 5.107 ± 0.085
4.597LeuLys: 4.597 ± 0.078
7.61LeuLeu: 7.61 ± 0.128
2.295LeuMet: 2.295 ± 0.051
3.994LeuAsn: 3.994 ± 0.063
4.18LeuPro: 4.18 ± 0.065
2.662LeuGln: 2.662 ± 0.056
5.413LeuArg: 5.413 ± 0.083
6.636LeuSer: 6.636 ± 0.101
5.516LeuThr: 5.516 ± 0.074
4.85LeuVal: 4.85 ± 0.08
0.951LeuTrp: 0.951 ± 0.034
3.217LeuTyr: 3.217 ± 0.067
0.0LeuXaa: 0.0 ± 0.0
Met
2.454MetAla: 2.454 ± 0.054
0.254MetCys: 0.254 ± 0.016
1.247MetAsp: 1.247 ± 0.035
1.592MetGlu: 1.592 ± 0.041
1.044MetPhe: 1.044 ± 0.039
1.601MetGly: 1.601 ± 0.042
0.498MetHis: 0.498 ± 0.022
1.55MetIle: 1.55 ± 0.039
1.921MetLys: 1.921 ± 0.048
2.66MetLeu: 2.66 ± 0.065
0.842MetMet: 0.842 ± 0.032
1.182MetAsn: 1.182 ± 0.034
1.365MetPro: 1.365 ± 0.039
0.909MetGln: 0.909 ± 0.032
1.627MetArg: 1.627 ± 0.047
1.944MetSer: 1.944 ± 0.048
1.884MetThr: 1.884 ± 0.051
1.562MetVal: 1.562 ± 0.046
0.272MetTrp: 0.272 ± 0.019
0.705MetTyr: 0.705 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.926AsnAla: 3.926 ± 0.071
0.562AsnCys: 0.562 ± 0.028
2.851AsnAsp: 2.851 ± 0.06
2.666AsnGlu: 2.666 ± 0.054
1.947AsnPhe: 1.947 ± 0.052
3.802AsnGly: 3.802 ± 0.076
0.942AsnHis: 0.942 ± 0.036
3.582AsnIle: 3.582 ± 0.065
2.338AsnLys: 2.338 ± 0.051
3.934AsnLeu: 3.934 ± 0.055
1.143AsnMet: 1.143 ± 0.033
2.393AsnAsn: 2.393 ± 0.067
2.739AsnPro: 2.739 ± 0.06
1.229AsnGln: 1.229 ± 0.04
2.634AsnArg: 2.634 ± 0.057
2.84AsnSer: 2.84 ± 0.066
2.448AsnThr: 2.448 ± 0.058
3.1AsnVal: 3.1 ± 0.057
0.572AsnTrp: 0.572 ± 0.025
1.878AsnTyr: 1.878 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
3.79ProAla: 3.79 ± 0.079
0.404ProCys: 0.404 ± 0.023
3.229ProAsp: 3.229 ± 0.057
3.293ProGlu: 3.293 ± 0.062
1.607ProPhe: 1.607 ± 0.043
2.864ProGly: 2.864 ± 0.057
0.758ProHis: 0.758 ± 0.028
1.994ProIle: 1.994 ± 0.046
1.753ProLys: 1.753 ± 0.044
3.335ProLeu: 3.335 ± 0.065
1.02ProMet: 1.02 ± 0.039
1.405ProAsn: 1.405 ± 0.041
0.985ProPro: 0.985 ± 0.04
1.465ProGln: 1.465 ± 0.039
1.85ProArg: 1.85 ± 0.043
2.552ProSer: 2.552 ± 0.061
2.232ProThr: 2.232 ± 0.05
3.385ProVal: 3.385 ± 0.059
0.453ProTrp: 0.453 ± 0.025
1.62ProTyr: 1.62 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
2.384GlnAla: 2.384 ± 0.057
0.318GlnCys: 0.318 ± 0.02
1.203GlnAsp: 1.203 ± 0.036
1.607GlnGlu: 1.607 ± 0.048
1.275GlnPhe: 1.275 ± 0.041
1.757GlnGly: 1.757 ± 0.048
0.598GlnHis: 0.598 ± 0.029
1.989GlnIle: 1.989 ± 0.043
1.674GlnLys: 1.674 ± 0.046
3.003GlnLeu: 3.003 ± 0.06
0.979GlnMet: 0.979 ± 0.033
1.249GlnAsn: 1.249 ± 0.042
1.316GlnPro: 1.316 ± 0.043
1.361GlnGln: 1.361 ± 0.058
1.854GlnArg: 1.854 ± 0.046
2.021GlnSer: 2.021 ± 0.048
1.574GlnThr: 1.574 ± 0.046
1.688GlnVal: 1.688 ± 0.045
0.504GlnTrp: 0.504 ± 0.027
1.248GlnTyr: 1.248 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
3.733ArgAla: 3.733 ± 0.077
0.644ArgCys: 0.644 ± 0.031
3.08ArgAsp: 3.08 ± 0.066
3.365ArgGlu: 3.365 ± 0.073
2.47ArgPhe: 2.47 ± 0.054
3.239ArgGly: 3.239 ± 0.064
1.487ArgHis: 1.487 ± 0.043
3.848ArgIle: 3.848 ± 0.071
3.118ArgLys: 3.118 ± 0.065
5.572ArgLeu: 5.572 ± 0.096
1.589ArgMet: 1.589 ± 0.046
2.644ArgAsn: 2.644 ± 0.052
2.164ArgPro: 2.164 ± 0.052
2.237ArgGln: 2.237 ± 0.058
3.978ArgArg: 3.978 ± 0.072
2.828ArgSer: 2.828 ± 0.057
2.736ArgThr: 2.736 ± 0.055
3.27ArgVal: 3.27 ± 0.064
0.709ArgTrp: 0.709 ± 0.032
2.621ArgTyr: 2.621 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
5.293SerAla: 5.293 ± 0.078
0.766SerCys: 0.766 ± 0.03
4.027SerAsp: 4.027 ± 0.062
3.531SerGlu: 3.531 ± 0.071
2.771SerPhe: 2.771 ± 0.062
5.133SerGly: 5.133 ± 0.087
1.305SerHis: 1.305 ± 0.04
4.201SerIle: 4.201 ± 0.08
2.946SerLys: 2.946 ± 0.062
5.994SerLeu: 5.994 ± 0.097
1.627SerMet: 1.627 ± 0.046
2.587SerAsn: 2.587 ± 0.058
2.698SerPro: 2.698 ± 0.06
2.003SerGln: 2.003 ± 0.048
3.585SerArg: 3.585 ± 0.069
4.025SerSer: 4.025 ± 0.085
3.614SerThr: 3.614 ± 0.063
4.771SerVal: 4.771 ± 0.068
0.727SerTrp: 0.727 ± 0.03
2.473SerTyr: 2.473 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
5.281ThrAla: 5.281 ± 0.081
0.562ThrCys: 0.562 ± 0.027
3.987ThrAsp: 3.987 ± 0.08
3.19ThrGlu: 3.19 ± 0.068
2.538ThrPhe: 2.538 ± 0.052
4.578ThrGly: 4.578 ± 0.082
0.992ThrHis: 0.992 ± 0.033
3.879ThrIle: 3.879 ± 0.07
2.248ThrLys: 2.248 ± 0.055
5.481ThrLeu: 5.481 ± 0.086
1.25ThrMet: 1.25 ± 0.037
2.116ThrAsn: 2.116 ± 0.048
3.339ThrPro: 3.339 ± 0.064
1.521ThrGln: 1.521 ± 0.038
2.681ThrArg: 2.681 ± 0.047
3.54ThrSer: 3.54 ± 0.068
3.314ThrThr: 3.314 ± 0.071
4.763ThrVal: 4.763 ± 0.076
0.601ThrTrp: 0.601 ± 0.028
2.157ThrTyr: 2.157 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
5.995ValAla: 5.995 ± 0.086
0.838ValCys: 0.838 ± 0.033
4.594ValAsp: 4.594 ± 0.072
4.32ValGlu: 4.32 ± 0.076
2.58ValPhe: 2.58 ± 0.065
4.093ValGly: 4.093 ± 0.069
0.978ValHis: 0.978 ± 0.033
4.635ValIle: 4.635 ± 0.085
4.001ValLys: 4.001 ± 0.057
5.046ValLeu: 5.046 ± 0.082
1.868ValMet: 1.868 ± 0.048
3.475ValAsn: 3.475 ± 0.064
2.558ValPro: 2.558 ± 0.051
1.575ValGln: 1.575 ± 0.045
3.4ValArg: 3.4 ± 0.062
4.83ValSer: 4.83 ± 0.07
4.735ValThr: 4.735 ± 0.078
4.742ValVal: 4.742 ± 0.089
0.754ValTrp: 0.754 ± 0.028
2.502ValTyr: 2.502 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
0.775TrpAla: 0.775 ± 0.03
0.166TrpCys: 0.166 ± 0.014
0.716TrpAsp: 0.716 ± 0.027
0.671TrpGlu: 0.671 ± 0.029
0.471TrpPhe: 0.471 ± 0.023
0.893TrpGly: 0.893 ± 0.035
0.285TrpHis: 0.285 ± 0.019
0.733TrpIle: 0.733 ± 0.032
0.65TrpLys: 0.65 ± 0.026
1.169TrpLeu: 1.169 ± 0.042
0.312TrpMet: 0.312 ± 0.017
0.691TrpAsn: 0.691 ± 0.031
0.292TrpPro: 0.292 ± 0.019
0.467TrpGln: 0.467 ± 0.026
0.728TrpArg: 0.728 ± 0.027
0.784TrpSer: 0.784 ± 0.031
0.681TrpThr: 0.681 ± 0.026
0.693TrpVal: 0.693 ± 0.028
0.201TrpTrp: 0.201 ± 0.013
0.44TrpTyr: 0.44 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.996TyrAla: 2.996 ± 0.056
0.572TyrCys: 0.572 ± 0.026
2.725TyrAsp: 2.725 ± 0.066
1.967TyrGlu: 1.967 ± 0.046
1.778TyrPhe: 1.778 ± 0.047
2.847TyrGly: 2.847 ± 0.057
0.808TyrHis: 0.808 ± 0.028
2.745TyrIle: 2.745 ± 0.061
1.888TyrLys: 1.888 ± 0.048
3.417TyrLeu: 3.417 ± 0.061
0.991TyrMet: 0.991 ± 0.034
2.413TyrAsn: 2.413 ± 0.064
1.678TyrPro: 1.678 ± 0.047
1.123TyrGln: 1.123 ± 0.038
2.394TyrArg: 2.394 ± 0.052
2.671TyrSer: 2.671 ± 0.068
2.371TyrThr: 2.371 ± 0.062
2.373TyrVal: 2.373 ± 0.048
0.471TyrTrp: 0.471 ± 0.025
1.867TyrTyr: 1.867 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2861 proteins (936511 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski