Amino acid dipepetide frequency for Eubacterium sp. AM18-10LB-B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.944AlaAla: 3.944 ± 0.09
1.056AlaCys: 1.056 ± 0.038
3.193AlaAsp: 3.193 ± 0.074
3.32AlaGlu: 3.32 ± 0.068
2.971AlaPhe: 2.971 ± 0.076
4.143AlaGly: 4.143 ± 0.099
1.154AlaHis: 1.154 ± 0.044
5.001AlaIle: 5.001 ± 0.098
4.842AlaLys: 4.842 ± 0.087
6.405AlaLeu: 6.405 ± 0.102
2.144AlaMet: 2.144 ± 0.06
2.603AlaAsn: 2.603 ± 0.055
1.403AlaPro: 1.403 ± 0.053
2.306AlaGln: 2.306 ± 0.064
2.166AlaArg: 2.166 ± 0.054
3.814AlaSer: 3.814 ± 0.074
2.635AlaThr: 2.635 ± 0.073
4.259AlaVal: 4.259 ± 0.079
0.519AlaTrp: 0.519 ± 0.029
2.81AlaTyr: 2.81 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
0.986CysAla: 0.986 ± 0.039
0.251CysCys: 0.251 ± 0.022
0.914CysAsp: 0.914 ± 0.038
0.999CysGlu: 0.999 ± 0.039
0.764CysPhe: 0.764 ± 0.033
1.201CysGly: 1.201 ± 0.047
0.356CysHis: 0.356 ± 0.022
1.403CysIle: 1.403 ± 0.049
1.023CysLys: 1.023 ± 0.043
1.21CysLeu: 1.21 ± 0.047
0.528CysMet: 0.528 ± 0.029
0.617CysAsn: 0.617 ± 0.032
0.519CysPro: 0.519 ± 0.03
0.345CysGln: 0.345 ± 0.021
0.484CysArg: 0.484 ± 0.026
0.933CysSer: 0.933 ± 0.041
0.656CysThr: 0.656 ± 0.034
1.046CysVal: 1.046 ± 0.042
0.102CysTrp: 0.102 ± 0.011
0.592CysTyr: 0.592 ± 0.033
0.0CysXaa: 0.0 ± 0.0
Asp
3.684AspAla: 3.684 ± 0.081
0.768AspCys: 0.768 ± 0.037
2.951AspAsp: 2.951 ± 0.068
4.916AspGlu: 4.916 ± 0.085
2.809AspPhe: 2.809 ± 0.061
3.839AspGly: 3.839 ± 0.093
0.999AspHis: 0.999 ± 0.037
5.233AspIle: 5.233 ± 0.082
4.331AspLys: 4.331 ± 0.078
4.634AspLeu: 4.634 ± 0.075
1.923AspMet: 1.923 ± 0.059
2.342AspAsn: 2.342 ± 0.058
1.567AspPro: 1.567 ± 0.046
1.132AspGln: 1.132 ± 0.039
1.81AspArg: 1.81 ± 0.052
2.984AspSer: 2.984 ± 0.066
3.029AspThr: 3.029 ± 0.068
4.149AspVal: 4.149 ± 0.077
0.464AspTrp: 0.464 ± 0.03
2.65AspTyr: 2.65 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
4.327GluAla: 4.327 ± 0.085
0.925GluCys: 0.925 ± 0.038
4.715GluAsp: 4.715 ± 0.086
7.845GluGlu: 7.845 ± 0.129
2.546GluPhe: 2.546 ± 0.054
3.922GluGly: 3.922 ± 0.082
1.568GluHis: 1.568 ± 0.046
6.368GluIle: 6.368 ± 0.089
7.861GluLys: 7.861 ± 0.11
6.555GluLeu: 6.555 ± 0.099
2.61GluMet: 2.61 ± 0.056
4.845GluAsn: 4.845 ± 0.095
1.518GluPro: 1.518 ± 0.052
2.654GluGln: 2.654 ± 0.067
2.858GluArg: 2.858 ± 0.073
3.216GluSer: 3.216 ± 0.068
3.567GluThr: 3.567 ± 0.07
4.866GluVal: 4.866 ± 0.086
0.673GluTrp: 0.673 ± 0.034
3.057GluTyr: 3.057 ± 0.065
0.0GluXaa: 0.0 ± 0.0
Phe
2.658PheAla: 2.658 ± 0.066
0.638PheCys: 0.638 ± 0.029
2.792PheAsp: 2.792 ± 0.063
2.954PheGlu: 2.954 ± 0.061
2.189PhePhe: 2.189 ± 0.073
2.679PheGly: 2.679 ± 0.064
1.154PheHis: 1.154 ± 0.038
3.466PheIle: 3.466 ± 0.083
2.663PheLys: 2.663 ± 0.06
4.757PheLeu: 4.757 ± 0.105
1.298PheMet: 1.298 ± 0.041
1.748PheAsn: 1.748 ± 0.05
1.323PhePro: 1.323 ± 0.042
1.49PheGln: 1.49 ± 0.052
1.311PheArg: 1.311 ± 0.044
3.327PheSer: 3.327 ± 0.074
2.395PheThr: 2.395 ± 0.057
3.108PheVal: 3.108 ± 0.064
0.349PheTrp: 0.349 ± 0.02
2.073PheTyr: 2.073 ± 0.06
0.0PheXaa: 0.0 ± 0.0
Gly
3.652GlyAla: 3.652 ± 0.089
1.107GlyCys: 1.107 ± 0.04
2.68GlyAsp: 2.68 ± 0.065
3.611GlyGlu: 3.611 ± 0.073
3.021GlyPhe: 3.021 ± 0.07
3.772GlyGly: 3.772 ± 0.109
1.052GlyHis: 1.052 ± 0.039
6.053GlyIle: 6.053 ± 0.104
5.431GlyLys: 5.431 ± 0.102
4.969GlyLeu: 4.969 ± 0.099
2.154GlyMet: 2.154 ± 0.061
3.357GlyAsn: 3.357 ± 0.074
0.986GlyPro: 0.986 ± 0.04
1.499GlyGln: 1.499 ± 0.05
2.116GlyArg: 2.116 ± 0.062
3.458GlySer: 3.458 ± 0.077
3.258GlyThr: 3.258 ± 0.08
4.248GlyVal: 4.248 ± 0.089
0.561GlyTrp: 0.561 ± 0.034
3.145GlyTyr: 3.145 ± 0.079
0.0GlyXaa: 0.0 ± 0.0
His
1.331HisAla: 1.331 ± 0.04
0.366HisCys: 0.366 ± 0.023
1.188HisAsp: 1.188 ± 0.042
1.328HisGlu: 1.328 ± 0.046
0.91HisPhe: 0.91 ± 0.039
1.286HisGly: 1.286 ± 0.047
0.571HisHis: 0.571 ± 0.034
1.835HisIle: 1.835 ± 0.044
1.476HisLys: 1.476 ± 0.041
1.921HisLeu: 1.921 ± 0.054
0.657HisMet: 0.657 ± 0.027
0.82HisAsn: 0.82 ± 0.037
0.929HisPro: 0.929 ± 0.037
0.723HisGln: 0.723 ± 0.034
0.801HisArg: 0.801 ± 0.031
1.144HisSer: 1.144 ± 0.038
1.123HisThr: 1.123 ± 0.039
1.485HisVal: 1.485 ± 0.044
0.166HisTrp: 0.166 ± 0.015
0.868HisTyr: 0.868 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.488IleAla: 5.488 ± 0.088
1.365IleCys: 1.365 ± 0.048
4.68IleAsp: 4.68 ± 0.081
6.004IleGlu: 6.004 ± 0.094
3.36IlePhe: 3.36 ± 0.097
4.991IleGly: 4.991 ± 0.093
1.87IleHis: 1.87 ± 0.057
5.716IleIle: 5.716 ± 0.127
5.613IleLys: 5.613 ± 0.107
7.976IleLeu: 7.976 ± 0.113
2.085IleMet: 2.085 ± 0.048
3.445IleAsn: 3.445 ± 0.077
2.898IlePro: 2.898 ± 0.063
3.187IleGln: 3.187 ± 0.063
3.158IleArg: 3.158 ± 0.079
5.831IleSer: 5.831 ± 0.107
4.226IleThr: 4.226 ± 0.078
5.357IleVal: 5.357 ± 0.094
0.537IleTrp: 0.537 ± 0.03
3.166IleTyr: 3.166 ± 0.07
0.0IleXaa: 0.0 ± 0.0
Lys
4.915LysAla: 4.915 ± 0.084
0.725LysCys: 0.725 ± 0.032
5.735LysAsp: 5.735 ± 0.096
10.159LysGlu: 10.159 ± 0.132
2.036LysPhe: 2.036 ± 0.046
4.622LysGly: 4.622 ± 0.091
1.641LysHis: 1.641 ± 0.049
5.86LysIle: 5.86 ± 0.093
8.457LysLys: 8.457 ± 0.129
6.168LysLeu: 6.168 ± 0.085
2.649LysMet: 2.649 ± 0.058
4.675LysAsn: 4.675 ± 0.089
2.067LysPro: 2.067 ± 0.054
3.766LysGln: 3.766 ± 0.092
3.366LysArg: 3.366 ± 0.073
3.807LysSer: 3.807 ± 0.068
4.262LysThr: 4.262 ± 0.081
4.908LysVal: 4.908 ± 0.08
0.59LysTrp: 0.59 ± 0.031
3.144LysTyr: 3.144 ± 0.078
0.0LysXaa: 0.0 ± 0.0
Leu
5.576LeuAla: 5.576 ± 0.102
1.73LeuCys: 1.73 ± 0.05
5.251LeuAsp: 5.251 ± 0.094
6.474LeuGlu: 6.474 ± 0.093
4.509LeuPhe: 4.509 ± 0.105
5.436LeuGly: 5.436 ± 0.099
2.306LeuHis: 2.306 ± 0.06
6.324LeuIle: 6.324 ± 0.113
7.635LeuLys: 7.635 ± 0.098
9.715LeuLeu: 9.715 ± 0.15
2.737LeuMet: 2.737 ± 0.065
4.33LeuAsn: 4.33 ± 0.08
3.227LeuPro: 3.227 ± 0.062
3.512LeuGln: 3.512 ± 0.085
3.473LeuArg: 3.473 ± 0.067
6.762LeuSer: 6.762 ± 0.104
4.346LeuThr: 4.346 ± 0.081
5.332LeuVal: 5.332 ± 0.09
0.736LeuTrp: 0.736 ± 0.034
3.949LeuTyr: 3.949 ± 0.083
0.001LeuXaa: 0.001 ± 0.002
Met
1.923MetAla: 1.923 ± 0.05
0.393MetCys: 0.393 ± 0.022
1.855MetAsp: 1.855 ± 0.057
2.668MetGlu: 2.668 ± 0.065
1.247MetPhe: 1.247 ± 0.043
1.783MetGly: 1.783 ± 0.053
0.649MetHis: 0.649 ± 0.033
2.515MetIle: 2.515 ± 0.061
3.522MetLys: 3.522 ± 0.07
2.93MetLeu: 2.93 ± 0.06
1.149MetMet: 1.149 ± 0.037
1.744MetAsn: 1.744 ± 0.053
0.946MetPro: 0.946 ± 0.036
1.381MetGln: 1.381 ± 0.045
1.116MetArg: 1.116 ± 0.039
1.779MetSer: 1.779 ± 0.045
1.294MetThr: 1.294 ± 0.046
1.787MetVal: 1.787 ± 0.05
0.183MetTrp: 0.183 ± 0.015
1.085MetTyr: 1.085 ± 0.041
0.0MetXaa: 0.0 ± 0.0
Asn
3.263AsnAla: 3.263 ± 0.067
0.678AsnCys: 0.678 ± 0.03
2.788AsnAsp: 2.788 ± 0.067
3.637AsnGlu: 3.637 ± 0.081
1.872AsnPhe: 1.872 ± 0.055
3.421AsnGly: 3.421 ± 0.087
1.042AsnHis: 1.042 ± 0.031
4.172AsnIle: 4.172 ± 0.088
4.013AsnLys: 4.013 ± 0.078
4.107AsnLeu: 4.107 ± 0.072
1.578AsnMet: 1.578 ± 0.046
2.322AsnAsn: 2.322 ± 0.065
1.896AsnPro: 1.896 ± 0.05
1.618AsnGln: 1.618 ± 0.055
1.937AsnArg: 1.937 ± 0.054
2.473AsnSer: 2.473 ± 0.058
2.67AsnThr: 2.67 ± 0.061
3.26AsnVal: 3.26 ± 0.072
0.441AsnTrp: 0.441 ± 0.024
2.091AsnTyr: 2.091 ± 0.065
0.0AsnXaa: 0.0 ± 0.0
Pro
1.604ProAla: 1.604 ± 0.05
0.415ProCys: 0.415 ± 0.025
1.472ProAsp: 1.472 ± 0.046
2.025ProGlu: 2.025 ± 0.048
1.643ProPhe: 1.643 ± 0.05
1.401ProGly: 1.401 ± 0.051
0.608ProHis: 0.608 ± 0.028
2.209ProIle: 2.209 ± 0.054
2.167ProLys: 2.167 ± 0.053
2.757ProLeu: 2.757 ± 0.061
0.876ProMet: 0.876 ± 0.038
1.55ProAsn: 1.55 ± 0.045
0.492ProPro: 0.492 ± 0.026
1.02ProGln: 1.02 ± 0.04
0.811ProArg: 0.811 ± 0.034
1.852ProSer: 1.852 ± 0.049
1.339ProThr: 1.339 ± 0.045
2.272ProVal: 2.272 ± 0.058
0.275ProTrp: 0.275 ± 0.019
1.563ProTyr: 1.563 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
2.016GlnAla: 2.016 ± 0.059
0.427GlnCys: 0.427 ± 0.025
1.814GlnAsp: 1.814 ± 0.051
2.869GlnGlu: 2.869 ± 0.07
1.327GlnPhe: 1.327 ± 0.04
1.981GlnGly: 1.981 ± 0.057
0.685GlnHis: 0.685 ± 0.033
2.898GlnIle: 2.898 ± 0.063
3.502GlnLys: 3.502 ± 0.077
3.226GlnLeu: 3.226 ± 0.079
1.166GlnMet: 1.166 ± 0.04
1.943GlnAsn: 1.943 ± 0.052
0.884GlnPro: 0.884 ± 0.039
1.428GlnGln: 1.428 ± 0.055
1.415GlnArg: 1.415 ± 0.048
1.818GlnSer: 1.818 ± 0.052
1.616GlnThr: 1.616 ± 0.044
1.952GlnVal: 1.952 ± 0.049
0.301GlnTrp: 0.301 ± 0.02
1.546GlnTyr: 1.546 ± 0.045
0.0GlnXaa: 0.0 ± 0.0
Arg
1.762ArgAla: 1.762 ± 0.046
0.523ArgCys: 0.523 ± 0.029
1.676ArgAsp: 1.676 ± 0.047
2.542ArgGlu: 2.542 ± 0.065
1.738ArgPhe: 1.738 ± 0.048
1.811ArgGly: 1.811 ± 0.054
0.71ArgHis: 0.71 ± 0.031
3.284ArgIle: 3.284 ± 0.065
3.794ArgLys: 3.794 ± 0.081
3.297ArgLeu: 3.297 ± 0.076
1.355ArgMet: 1.355 ± 0.042
2.215ArgAsn: 2.215 ± 0.053
0.929ArgPro: 0.929 ± 0.035
1.242ArgGln: 1.242 ± 0.042
1.537ArgArg: 1.537 ± 0.052
1.96ArgSer: 1.96 ± 0.054
1.656ArgThr: 1.656 ± 0.046
2.187ArgVal: 2.187 ± 0.052
0.307ArgTrp: 0.307 ± 0.019
1.871ArgTyr: 1.871 ± 0.059
0.001ArgXaa: 0.001 ± 0.001
Ser
3.531SerAla: 3.531 ± 0.075
0.888SerCys: 0.888 ± 0.038
3.157SerAsp: 3.157 ± 0.073
3.706SerGlu: 3.706 ± 0.078
3.499SerPhe: 3.499 ± 0.08
3.952SerGly: 3.952 ± 0.086
1.135SerHis: 1.135 ± 0.042
5.118SerIle: 5.118 ± 0.087
4.558SerLys: 4.558 ± 0.079
5.898SerLeu: 5.898 ± 0.102
2.002SerMet: 2.002 ± 0.055
2.854SerAsn: 2.854 ± 0.066
1.324SerPro: 1.324 ± 0.047
1.761SerGln: 1.761 ± 0.052
1.967SerArg: 1.967 ± 0.047
4.155SerSer: 4.155 ± 0.085
2.885SerThr: 2.885 ± 0.065
4.076SerVal: 4.076 ± 0.079
0.52SerTrp: 0.52 ± 0.027
3.063SerTyr: 3.063 ± 0.065
0.0SerXaa: 0.0 ± 0.0
Thr
2.968ThrAla: 2.968 ± 0.073
0.698ThrCys: 0.698 ± 0.032
2.294ThrAsp: 2.294 ± 0.056
2.595ThrGlu: 2.595 ± 0.066
2.345ThrPhe: 2.345 ± 0.064
3.219ThrGly: 3.219 ± 0.082
0.989ThrHis: 0.989 ± 0.039
4.468ThrIle: 4.468 ± 0.089
3.7ThrLys: 3.7 ± 0.069
5.155ThrLeu: 5.155 ± 0.088
1.518ThrMet: 1.518 ± 0.041
2.366ThrAsn: 2.366 ± 0.058
1.856ThrPro: 1.856 ± 0.048
1.728ThrGln: 1.728 ± 0.05
1.652ThrArg: 1.652 ± 0.048
3.315ThrSer: 3.315 ± 0.077
2.792ThrThr: 2.792 ± 0.069
3.035ThrVal: 3.035 ± 0.078
0.484ThrTrp: 0.484 ± 0.024
2.439ThrTyr: 2.439 ± 0.063
0.001ThrXaa: 0.001 ± 0.002
Val
3.761ValAla: 3.761 ± 0.09
1.196ValCys: 1.196 ± 0.047
3.775ValAsp: 3.775 ± 0.08
4.944ValGlu: 4.944 ± 0.098
3.157ValPhe: 3.157 ± 0.073
3.453ValGly: 3.453 ± 0.088
1.189ValHis: 1.189 ± 0.038
5.057ValIle: 5.057 ± 0.084
5.093ValLys: 5.093 ± 0.076
6.705ValLeu: 6.705 ± 0.098
1.945ValMet: 1.945 ± 0.053
3.124ValAsn: 3.124 ± 0.072
1.802ValPro: 1.802 ± 0.047
1.964ValGln: 1.964 ± 0.053
2.274ValArg: 2.274 ± 0.06
4.567ValSer: 4.567 ± 0.082
3.182ValThr: 3.182 ± 0.087
4.537ValVal: 4.537 ± 0.098
0.515ValTrp: 0.515 ± 0.027
2.971ValTyr: 2.971 ± 0.066
0.001ValXaa: 0.001 ± 0.001
Trp
0.405TrpAla: 0.405 ± 0.022
0.147TrpCys: 0.147 ± 0.015
0.479TrpAsp: 0.479 ± 0.028
0.551TrpGlu: 0.551 ± 0.028
0.385TrpPhe: 0.385 ± 0.022
0.487TrpGly: 0.487 ± 0.029
0.17TrpHis: 0.17 ± 0.017
0.811TrpIle: 0.811 ± 0.035
0.87TrpLys: 0.87 ± 0.034
0.76TrpLeu: 0.76 ± 0.034
0.299TrpMet: 0.299 ± 0.021
0.532TrpAsn: 0.532 ± 0.028
0.139TrpPro: 0.139 ± 0.014
0.309TrpGln: 0.309 ± 0.021
0.226TrpArg: 0.226 ± 0.017
0.407TrpSer: 0.407 ± 0.025
0.321TrpThr: 0.321 ± 0.022
0.414TrpVal: 0.414 ± 0.022
0.11TrpTrp: 0.11 ± 0.013
0.377TrpTyr: 0.377 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.897TyrAla: 2.897 ± 0.066
0.63TyrCys: 0.63 ± 0.033
2.808TyrAsp: 2.808 ± 0.061
3.349TyrGlu: 3.349 ± 0.071
2.042TyrPhe: 2.042 ± 0.06
2.732TyrGly: 2.732 ± 0.065
1.08TyrHis: 1.08 ± 0.039
3.162TyrIle: 3.162 ± 0.062
2.999TyrLys: 2.999 ± 0.071
4.216TyrLeu: 4.216 ± 0.078
1.277TyrMet: 1.277 ± 0.043
1.848TyrAsn: 1.848 ± 0.05
1.645TyrPro: 1.645 ± 0.051
1.746TyrGln: 1.746 ± 0.045
1.919TyrArg: 1.919 ± 0.056
2.446TyrSer: 2.446 ± 0.065
2.365TyrThr: 2.365 ± 0.06
2.934TyrVal: 2.934 ± 0.066
0.365TyrTrp: 0.365 ± 0.026
2.036TyrTyr: 2.036 ± 0.063
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.001XaaCys: 0.001 ± 0.001
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.003XaaLys: 0.003 ± 0.002
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.011XaaXaa: 0.011 ± 0.006
Statistics based on 2530 proteins (753616 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski