Amino acid dipepetide frequency for Clostridiaceae bacterium DONG20-135

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.471AlaAla: 6.471 ± 0.135
1.191AlaCys: 1.191 ± 0.04
4.284AlaAsp: 4.284 ± 0.093
4.126AlaGlu: 4.126 ± 0.094
3.368AlaPhe: 3.368 ± 0.073
5.014AlaGly: 5.014 ± 0.107
1.393AlaHis: 1.393 ± 0.053
5.536AlaIle: 5.536 ± 0.098
4.914AlaLys: 4.914 ± 0.097
7.36AlaLeu: 7.36 ± 0.13
2.607AlaMet: 2.607 ± 0.059
2.654AlaAsn: 2.654 ± 0.063
1.708AlaPro: 1.708 ± 0.054
2.498AlaGln: 2.498 ± 0.059
2.65AlaArg: 2.65 ± 0.073
4.297AlaSer: 4.297 ± 0.081
2.812AlaThr: 2.812 ± 0.07
5.916AlaVal: 5.916 ± 0.109
0.637AlaTrp: 0.637 ± 0.037
3.071AlaTyr: 3.071 ± 0.081
0.0AlaXaa: 0.0 ± 0.0
Cys
1.051CysAla: 1.051 ± 0.038
0.262CysCys: 0.262 ± 0.023
0.999CysAsp: 0.999 ± 0.048
0.992CysGlu: 0.992 ± 0.045
0.708CysPhe: 0.708 ± 0.032
1.367CysGly: 1.367 ± 0.056
0.331CysHis: 0.331 ± 0.021
1.151CysIle: 1.151 ± 0.047
0.701CysLys: 0.701 ± 0.038
1.182CysLeu: 1.182 ± 0.042
0.477CysMet: 0.477 ± 0.025
0.47CysAsn: 0.47 ± 0.027
0.561CysPro: 0.561 ± 0.033
0.38CysGln: 0.38 ± 0.023
0.665CysArg: 0.665 ± 0.034
0.851CysSer: 0.851 ± 0.035
0.665CysThr: 0.665 ± 0.03
0.951CysVal: 0.951 ± 0.036
0.112CysTrp: 0.112 ± 0.012
0.562CysTyr: 0.562 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
4.195AspAla: 4.195 ± 0.091
0.802AspCys: 0.802 ± 0.041
3.475AspAsp: 3.475 ± 0.077
4.947AspGlu: 4.947 ± 0.083
2.664AspPhe: 2.664 ± 0.067
3.676AspGly: 3.676 ± 0.092
1.597AspHis: 1.597 ± 0.047
4.738AspIle: 4.738 ± 0.09
3.549AspLys: 3.549 ± 0.066
5.073AspLeu: 5.073 ± 0.103
1.989AspMet: 1.989 ± 0.053
2.168AspAsn: 2.168 ± 0.059
1.974AspPro: 1.974 ± 0.064
2.432AspGln: 2.432 ± 0.058
2.454AspArg: 2.454 ± 0.072
2.872AspSer: 2.872 ± 0.065
2.957AspThr: 2.957 ± 0.068
3.882AspVal: 3.882 ± 0.084
0.505AspTrp: 0.505 ± 0.03
2.725AspTyr: 2.725 ± 0.063
0.0AspXaa: 0.0 ± 0.0
Glu
5.585GluAla: 5.585 ± 0.113
0.707GluCys: 0.707 ± 0.032
3.506GluAsp: 3.506 ± 0.077
5.283GluGlu: 5.283 ± 0.111
2.199GluPhe: 2.199 ± 0.053
3.499GluGly: 3.499 ± 0.068
1.532GluHis: 1.532 ± 0.049
5.47GluIle: 5.47 ± 0.095
5.289GluLys: 5.289 ± 0.087
6.384GluLeu: 6.384 ± 0.111
2.305GluMet: 2.305 ± 0.058
3.515GluAsn: 3.515 ± 0.084
1.722GluPro: 1.722 ± 0.051
2.912GluGln: 2.912 ± 0.077
2.928GluArg: 2.928 ± 0.074
3.078GluSer: 3.078 ± 0.07
3.377GluThr: 3.377 ± 0.071
4.172GluVal: 4.172 ± 0.09
0.584GluTrp: 0.584 ± 0.029
2.585GluTyr: 2.585 ± 0.072
0.0GluXaa: 0.0 ± 0.0
Phe
2.962PheAla: 2.962 ± 0.075
0.631PheCys: 0.631 ± 0.027
2.813PheAsp: 2.813 ± 0.073
2.616PheGlu: 2.616 ± 0.064
2.02PhePhe: 2.02 ± 0.072
2.704PheGly: 2.704 ± 0.07
1.242PheHis: 1.242 ± 0.054
3.442PheIle: 3.442 ± 0.088
2.261PheLys: 2.261 ± 0.063
4.272PheLeu: 4.272 ± 0.09
1.341PheMet: 1.341 ± 0.044
1.603PheAsn: 1.603 ± 0.05
1.292PhePro: 1.292 ± 0.052
1.862PheGln: 1.862 ± 0.051
1.478PheArg: 1.478 ± 0.046
2.72PheSer: 2.72 ± 0.066
2.611PheThr: 2.611 ± 0.061
2.754PheVal: 2.754 ± 0.061
0.353PheTrp: 0.353 ± 0.023
1.824PheTyr: 1.824 ± 0.058
0.0PheXaa: 0.0 ± 0.0
Gly
4.223GlyAla: 4.223 ± 0.095
1.188GlyCys: 1.188 ± 0.045
3.157GlyAsp: 3.157 ± 0.08
3.536GlyGlu: 3.536 ± 0.086
3.059GlyPhe: 3.059 ± 0.072
4.144GlyGly: 4.144 ± 0.101
1.238GlyHis: 1.238 ± 0.048
5.987GlyIle: 5.987 ± 0.112
4.677GlyLys: 4.677 ± 0.092
5.436GlyLeu: 5.436 ± 0.099
2.229GlyMet: 2.229 ± 0.061
2.831GlyAsn: 2.831 ± 0.062
1.275GlyPro: 1.275 ± 0.059
1.671GlyGln: 1.671 ± 0.046
2.547GlyArg: 2.547 ± 0.066
3.902GlySer: 3.902 ± 0.087
3.592GlyThr: 3.592 ± 0.084
4.391GlyVal: 4.391 ± 0.093
0.573GlyTrp: 0.573 ± 0.029
3.202GlyTyr: 3.202 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
1.635HisAla: 1.635 ± 0.054
0.349HisCys: 0.349 ± 0.026
1.476HisAsp: 1.476 ± 0.05
1.641HisGlu: 1.641 ± 0.053
1.027HisPhe: 1.027 ± 0.043
1.529HisGly: 1.529 ± 0.049
0.762HisHis: 0.762 ± 0.038
1.892HisIle: 1.892 ± 0.052
1.313HisLys: 1.313 ± 0.044
2.002HisLeu: 2.002 ± 0.058
0.695HisMet: 0.695 ± 0.031
0.919HisAsn: 0.919 ± 0.035
1.044HisPro: 1.044 ± 0.039
1.042HisGln: 1.042 ± 0.039
1.033HisArg: 1.033 ± 0.041
1.232HisSer: 1.232 ± 0.045
1.267HisThr: 1.267 ± 0.052
1.566HisVal: 1.566 ± 0.05
0.16HisTrp: 0.16 ± 0.014
0.991HisTyr: 0.991 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.246IleAla: 6.246 ± 0.115
1.316IleCys: 1.316 ± 0.05
5.061IleAsp: 5.061 ± 0.093
4.975IleGlu: 4.975 ± 0.091
3.134IlePhe: 3.134 ± 0.079
5.416IleGly: 5.416 ± 0.11
1.992IleHis: 1.992 ± 0.067
5.778IleIle: 5.778 ± 0.129
4.578IleLys: 4.578 ± 0.09
7.641IleLeu: 7.641 ± 0.125
2.457IleMet: 2.457 ± 0.057
3.216IleAsn: 3.216 ± 0.075
2.982IlePro: 2.982 ± 0.071
3.166IleGln: 3.166 ± 0.079
3.536IleArg: 3.536 ± 0.077
5.206IleSer: 5.206 ± 0.101
4.669IleThr: 4.669 ± 0.1
5.14IleVal: 5.14 ± 0.106
0.601IleTrp: 0.601 ± 0.034
2.94IleTyr: 2.94 ± 0.062
0.0IleXaa: 0.0 ± 0.0
Lys
5.343LysAla: 5.343 ± 0.097
0.623LysCys: 0.623 ± 0.034
4.236LysAsp: 4.236 ± 0.093
5.807LysGlu: 5.807 ± 0.105
1.766LysPhe: 1.766 ± 0.047
3.895LysGly: 3.895 ± 0.08
1.519LysHis: 1.519 ± 0.052
4.628LysIle: 4.628 ± 0.078
5.686LysLys: 5.686 ± 0.108
5.982LysLeu: 5.982 ± 0.101
2.239LysMet: 2.239 ± 0.052
3.417LysAsn: 3.417 ± 0.076
2.177LysPro: 2.177 ± 0.067
3.729LysGln: 3.729 ± 0.084
3.33LysArg: 3.33 ± 0.065
3.28LysSer: 3.28 ± 0.077
3.861LysThr: 3.861 ± 0.08
4.008LysVal: 4.008 ± 0.079
0.512LysTrp: 0.512 ± 0.027
2.335LysTyr: 2.335 ± 0.059
0.0LysXaa: 0.0 ± 0.0
Leu
6.606LeuAla: 6.606 ± 0.102
1.538LeuCys: 1.538 ± 0.053
5.654LeuAsp: 5.654 ± 0.095
5.895LeuGlu: 5.895 ± 0.117
4.223LeuPhe: 4.223 ± 0.091
5.65LeuGly: 5.65 ± 0.105
2.501LeuHis: 2.501 ± 0.068
7.419LeuIle: 7.419 ± 0.133
7.13LeuLys: 7.13 ± 0.101
9.892LeuLeu: 9.892 ± 0.173
2.971LeuMet: 2.971 ± 0.063
4.372LeuAsn: 4.372 ± 0.074
3.518LeuPro: 3.518 ± 0.081
3.493LeuGln: 3.493 ± 0.072
3.739LeuArg: 3.739 ± 0.085
6.617LeuSer: 6.617 ± 0.116
4.956LeuThr: 4.956 ± 0.084
5.274LeuVal: 5.274 ± 0.092
0.651LeuTrp: 0.651 ± 0.034
3.679LeuTyr: 3.679 ± 0.081
0.0LeuXaa: 0.0 ± 0.0
Met
2.065MetAla: 2.065 ± 0.054
0.331MetCys: 0.331 ± 0.022
2.021MetAsp: 2.021 ± 0.05
2.379MetGlu: 2.379 ± 0.058
1.148MetPhe: 1.148 ± 0.045
1.911MetGly: 1.911 ± 0.06
0.687MetHis: 0.687 ± 0.034
2.888MetIle: 2.888 ± 0.064
3.169MetLys: 3.169 ± 0.07
3.182MetLeu: 3.182 ± 0.069
1.267MetMet: 1.267 ± 0.047
1.917MetAsn: 1.917 ± 0.053
1.132MetPro: 1.132 ± 0.038
1.291MetGln: 1.291 ± 0.044
1.428MetArg: 1.428 ± 0.049
1.93MetSer: 1.93 ± 0.057
1.45MetThr: 1.45 ± 0.054
1.955MetVal: 1.955 ± 0.061
0.155MetTrp: 0.155 ± 0.015
1.054MetTyr: 1.054 ± 0.04
0.0MetXaa: 0.0 ± 0.0
Asn
3.207AsnAla: 3.207 ± 0.078
0.53AsnCys: 0.53 ± 0.026
2.626AsnAsp: 2.626 ± 0.066
3.062AsnGlu: 3.062 ± 0.08
1.625AsnPhe: 1.625 ± 0.052
3.156AsnGly: 3.156 ± 0.083
1.234AsnHis: 1.234 ± 0.039
3.331AsnIle: 3.331 ± 0.074
2.604AsnLys: 2.604 ± 0.071
3.78AsnLeu: 3.78 ± 0.076
1.409AsnMet: 1.409 ± 0.044
1.73AsnAsn: 1.73 ± 0.066
1.794AsnPro: 1.794 ± 0.047
1.868AsnGln: 1.868 ± 0.054
2.106AsnArg: 2.106 ± 0.05
2.195AsnSer: 2.195 ± 0.055
2.285AsnThr: 2.285 ± 0.06
2.764AsnVal: 2.764 ± 0.06
0.427AsnTrp: 0.427 ± 0.026
1.828AsnTyr: 1.828 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
2.133ProAla: 2.133 ± 0.065
0.411ProCys: 0.411 ± 0.029
1.981ProAsp: 1.981 ± 0.056
1.964ProGlu: 1.964 ± 0.065
1.646ProPhe: 1.646 ± 0.057
1.812ProGly: 1.812 ± 0.053
0.665ProHis: 0.665 ± 0.035
2.753ProIle: 2.753 ± 0.073
2.011ProLys: 2.011 ± 0.048
3.087ProLeu: 3.087 ± 0.068
0.949ProMet: 0.949 ± 0.041
1.46ProAsn: 1.46 ± 0.043
0.605ProPro: 0.605 ± 0.031
1.301ProGln: 1.301 ± 0.047
1.001ProArg: 1.001 ± 0.046
1.79ProSer: 1.79 ± 0.055
1.524ProThr: 1.524 ± 0.05
2.268ProVal: 2.268 ± 0.055
0.277ProTrp: 0.277 ± 0.023
1.581ProTyr: 1.581 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
2.918GlnAla: 2.918 ± 0.069
0.406GlnCys: 0.406 ± 0.023
2.162GlnAsp: 2.162 ± 0.069
2.931GlnGlu: 2.931 ± 0.073
1.541GlnPhe: 1.541 ± 0.049
2.118GlnGly: 2.118 ± 0.061
0.811GlnHis: 0.811 ± 0.034
3.115GlnIle: 3.115 ± 0.069
3.047GlnLys: 3.047 ± 0.071
4.001GlnLeu: 4.001 ± 0.083
1.298GlnMet: 1.298 ± 0.048
1.868GlnAsn: 1.868 ± 0.06
1.148GlnPro: 1.148 ± 0.049
1.708GlnGln: 1.708 ± 0.057
1.847GlnArg: 1.847 ± 0.06
2.065GlnSer: 2.065 ± 0.056
1.89GlnThr: 1.89 ± 0.052
2.349GlnVal: 2.349 ± 0.054
0.268GlnTrp: 0.268 ± 0.021
1.706GlnTyr: 1.706 ± 0.053
0.0GlnXaa: 0.0 ± 0.0
Arg
2.299ArgAla: 2.299 ± 0.059
0.67ArgCys: 0.67 ± 0.036
2.282ArgAsp: 2.282 ± 0.065
2.934ArgGlu: 2.934 ± 0.066
1.962ArgPhe: 1.962 ± 0.055
2.184ArgGly: 2.184 ± 0.048
0.954ArgHis: 0.954 ± 0.035
3.729ArgIle: 3.729 ± 0.079
3.163ArgLys: 3.163 ± 0.074
3.867ArgLeu: 3.867 ± 0.09
1.577ArgMet: 1.577 ± 0.045
2.042ArgAsn: 2.042 ± 0.055
1.213ArgPro: 1.213 ± 0.052
1.681ArgGln: 1.681 ± 0.05
2.064ArgArg: 2.064 ± 0.058
2.505ArgSer: 2.505 ± 0.064
2.093ArgThr: 2.093 ± 0.056
2.508ArgVal: 2.508 ± 0.063
0.321ArgTrp: 0.321 ± 0.025
2.117ArgTyr: 2.117 ± 0.064
0.0ArgXaa: 0.0 ± 0.0
Ser
3.986SerAla: 3.986 ± 0.087
0.896SerCys: 0.896 ± 0.041
3.54SerAsp: 3.54 ± 0.078
3.468SerGlu: 3.468 ± 0.074
3.038SerPhe: 3.038 ± 0.068
4.534SerGly: 4.534 ± 0.092
1.217SerHis: 1.217 ± 0.043
4.635SerIle: 4.635 ± 0.106
3.724SerLys: 3.724 ± 0.072
5.658SerLeu: 5.658 ± 0.106
1.995SerMet: 1.995 ± 0.058
2.357SerAsn: 2.357 ± 0.073
1.415SerPro: 1.415 ± 0.041
1.981SerGln: 1.981 ± 0.058
2.511SerArg: 2.511 ± 0.055
3.571SerSer: 3.571 ± 0.086
2.761SerThr: 2.761 ± 0.068
3.932SerVal: 3.932 ± 0.08
0.545SerTrp: 0.545 ± 0.028
2.522SerTyr: 2.522 ± 0.071
0.0SerXaa: 0.0 ± 0.0
Thr
4.158ThrAla: 4.158 ± 0.089
0.721ThrCys: 0.721 ± 0.036
2.669ThrAsp: 2.669 ± 0.065
2.492ThrGlu: 2.492 ± 0.059
2.446ThrPhe: 2.446 ± 0.067
3.468ThrGly: 3.468 ± 0.081
1.072ThrHis: 1.072 ± 0.036
4.41ThrIle: 4.41 ± 0.098
3.3ThrLys: 3.3 ± 0.074
5.195ThrLeu: 5.195 ± 0.091
1.702ThrMet: 1.702 ± 0.046
1.98ThrAsn: 1.98 ± 0.059
1.943ThrPro: 1.943 ± 0.06
1.535ThrGln: 1.535 ± 0.049
1.841ThrArg: 1.841 ± 0.048
3.187ThrSer: 3.187 ± 0.077
2.479ThrThr: 2.479 ± 0.068
3.889ThrVal: 3.889 ± 0.071
0.464ThrTrp: 0.464 ± 0.026
2.305ThrTyr: 2.305 ± 0.067
0.0ThrXaa: 0.0 ± 0.0
Val
4.073ValAla: 4.073 ± 0.099
1.176ValCys: 1.176 ± 0.042
3.611ValAsp: 3.611 ± 0.08
3.899ValGlu: 3.899 ± 0.088
2.885ValPhe: 2.885 ± 0.073
3.596ValGly: 3.596 ± 0.083
1.373ValHis: 1.373 ± 0.051
5.607ValIle: 5.607 ± 0.104
4.64ValLys: 4.64 ± 0.098
6.726ValLeu: 6.726 ± 0.112
2.277ValMet: 2.277 ± 0.063
3.069ValAsn: 3.069 ± 0.067
2.003ValPro: 2.003 ± 0.064
2.083ValGln: 2.083 ± 0.053
2.591ValArg: 2.591 ± 0.067
4.473ValSer: 4.473 ± 0.09
3.497ValThr: 3.497 ± 0.074
4.319ValVal: 4.319 ± 0.093
0.52ValTrp: 0.52 ± 0.027
2.716ValTyr: 2.716 ± 0.063
0.0ValXaa: 0.0 ± 0.0
Trp
0.405TrpAla: 0.405 ± 0.025
0.131TrpCys: 0.131 ± 0.02
0.393TrpAsp: 0.393 ± 0.022
0.493TrpGlu: 0.493 ± 0.024
0.39TrpPhe: 0.39 ± 0.023
0.486TrpGly: 0.486 ± 0.029
0.188TrpHis: 0.188 ± 0.018
0.749TrpIle: 0.749 ± 0.037
0.696TrpLys: 0.696 ± 0.029
0.883TrpLeu: 0.883 ± 0.041
0.356TrpMet: 0.356 ± 0.026
0.461TrpAsn: 0.461 ± 0.027
0.196TrpPro: 0.196 ± 0.016
0.306TrpGln: 0.306 ± 0.022
0.274TrpArg: 0.274 ± 0.022
0.397TrpSer: 0.397 ± 0.024
0.364TrpThr: 0.364 ± 0.028
0.484TrpVal: 0.484 ± 0.026
0.081TrpTrp: 0.081 ± 0.011
0.355TrpTyr: 0.355 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.941TyrAla: 2.941 ± 0.071
0.559TyrCys: 0.559 ± 0.032
2.726TyrAsp: 2.726 ± 0.067
3.191TyrGlu: 3.191 ± 0.064
1.95TyrPhe: 1.95 ± 0.064
2.698TyrGly: 2.698 ± 0.068
1.267TyrHis: 1.267 ± 0.047
2.792TyrIle: 2.792 ± 0.066
1.921TyrLys: 1.921 ± 0.056
4.175TyrLeu: 4.175 ± 0.085
1.22TyrMet: 1.22 ± 0.038
1.515TyrAsn: 1.515 ± 0.054
1.516TyrPro: 1.516 ± 0.053
2.26TyrGln: 2.26 ± 0.065
2.142TyrArg: 2.142 ± 0.059
2.118TyrSer: 2.118 ± 0.056
2.146TyrThr: 2.146 ± 0.058
2.611TyrVal: 2.611 ± 0.067
0.344TyrTrp: 0.344 ± 0.022
1.956TyrTyr: 1.956 ± 0.064
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2223 proteins (679347 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski