Amino acid dipepetide frequency for Firmicutes bacterium CAG:212

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.533AlaAla: 6.533 ± 0.122
1.11AlaCys: 1.11 ± 0.039
4.154AlaAsp: 4.154 ± 0.074
5.246AlaGlu: 5.246 ± 0.096
2.939AlaPhe: 2.939 ± 0.077
5.732AlaGly: 5.732 ± 0.095
1.147AlaHis: 1.147 ± 0.042
5.722AlaIle: 5.722 ± 0.101
5.315AlaLys: 5.315 ± 0.094
6.376AlaLeu: 6.376 ± 0.1
2.515AlaMet: 2.515 ± 0.063
2.775AlaAsn: 2.775 ± 0.072
1.765AlaPro: 1.765 ± 0.056
2.265AlaGln: 2.265 ± 0.067
2.622AlaArg: 2.622 ± 0.063
3.68AlaSer: 3.68 ± 0.084
3.453AlaThr: 3.453 ± 0.083
5.85AlaVal: 5.85 ± 0.088
0.533AlaTrp: 0.533 ± 0.033
2.793AlaTyr: 2.793 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
1.042CysAla: 1.042 ± 0.04
0.241CysCys: 0.241 ± 0.02
0.798CysAsp: 0.798 ± 0.033
1.069CysGlu: 1.069 ± 0.039
0.596CysPhe: 0.596 ± 0.029
1.494CysGly: 1.494 ± 0.046
0.275CysHis: 0.275 ± 0.021
1.166CysIle: 1.166 ± 0.045
0.908CysLys: 0.908 ± 0.038
1.085CysLeu: 1.085 ± 0.039
0.484CysMet: 0.484 ± 0.025
0.64CysAsn: 0.64 ± 0.032
0.608CysPro: 0.608 ± 0.033
0.426CysGln: 0.426 ± 0.024
0.551CysArg: 0.551 ± 0.024
0.827CysSer: 0.827 ± 0.032
0.783CysThr: 0.783 ± 0.032
1.107CysVal: 1.107 ± 0.041
0.116CysTrp: 0.116 ± 0.014
0.576CysTyr: 0.576 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
4.266AspAla: 4.266 ± 0.089
0.782AspCys: 0.782 ± 0.04
2.846AspAsp: 2.846 ± 0.075
4.977AspGlu: 4.977 ± 0.087
2.441AspPhe: 2.441 ± 0.05
4.076AspGly: 4.076 ± 0.092
0.804AspHis: 0.804 ± 0.041
4.712AspIle: 4.712 ± 0.094
3.772AspLys: 3.772 ± 0.068
4.37AspLeu: 4.37 ± 0.074
1.866AspMet: 1.866 ± 0.051
2.088AspAsn: 2.088 ± 0.06
1.586AspPro: 1.586 ± 0.045
1.324AspGln: 1.324 ± 0.044
1.924AspArg: 1.924 ± 0.061
3.015AspSer: 3.015 ± 0.079
3.148AspThr: 3.148 ± 0.076
4.182AspVal: 4.182 ± 0.091
0.514AspTrp: 0.514 ± 0.028
2.664AspTyr: 2.664 ± 0.077
0.0AspXaa: 0.0 ± 0.0
Glu
5.576GluAla: 5.576 ± 0.108
0.92GluCys: 0.92 ± 0.04
4.451GluAsp: 4.451 ± 0.081
8.114GluGlu: 8.114 ± 0.135
2.677GluPhe: 2.677 ± 0.059
4.394GluGly: 4.394 ± 0.092
1.601GluHis: 1.601 ± 0.052
6.147GluIle: 6.147 ± 0.115
7.337GluLys: 7.337 ± 0.098
7.093GluLeu: 7.093 ± 0.101
2.662GluMet: 2.662 ± 0.067
4.511GluAsn: 4.511 ± 0.089
1.701GluPro: 1.701 ± 0.056
3.388GluGln: 3.388 ± 0.089
3.294GluArg: 3.294 ± 0.068
3.384GluSer: 3.384 ± 0.079
3.775GluThr: 3.775 ± 0.081
5.037GluVal: 5.037 ± 0.11
0.636GluTrp: 0.636 ± 0.03
3.457GluTyr: 3.457 ± 0.072
0.0GluXaa: 0.0 ± 0.0
Phe
2.884PheAla: 2.884 ± 0.073
0.73PheCys: 0.73 ± 0.033
2.466PheAsp: 2.466 ± 0.064
2.787PheGlu: 2.787 ± 0.068
1.726PhePhe: 1.726 ± 0.059
3.107PheGly: 3.107 ± 0.085
0.88PheHis: 0.88 ± 0.036
2.901PheIle: 2.901 ± 0.074
2.1PheLys: 2.1 ± 0.057
3.547PheLeu: 3.547 ± 0.083
1.221PheMet: 1.221 ± 0.045
1.415PheAsn: 1.415 ± 0.048
1.416PhePro: 1.416 ± 0.046
1.365PheGln: 1.365 ± 0.044
1.525PheArg: 1.525 ± 0.049
2.563PheSer: 2.563 ± 0.073
2.175PheThr: 2.175 ± 0.065
2.939PheVal: 2.939 ± 0.078
0.362PheTrp: 0.362 ± 0.023
1.646PheTyr: 1.646 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
4.887GlyAla: 4.887 ± 0.095
1.244GlyCys: 1.244 ± 0.048
3.388GlyAsp: 3.388 ± 0.072
4.778GlyGlu: 4.778 ± 0.088
3.06GlyPhe: 3.06 ± 0.069
4.718GlyGly: 4.718 ± 0.11
1.243GlyHis: 1.243 ± 0.038
6.47GlyIle: 6.47 ± 0.121
5.552GlyLys: 5.552 ± 0.09
5.616GlyLeu: 5.616 ± 0.113
2.59GlyMet: 2.59 ± 0.069
3.272GlyAsn: 3.272 ± 0.077
1.344GlyPro: 1.344 ± 0.04
2.036GlyGln: 2.036 ± 0.054
2.748GlyArg: 2.748 ± 0.07
3.738GlySer: 3.738 ± 0.082
4.165GlyThr: 4.165 ± 0.083
5.226GlyVal: 5.226 ± 0.088
0.64GlyTrp: 0.64 ± 0.038
3.189GlyTyr: 3.189 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
1.238HisAla: 1.238 ± 0.049
0.342HisCys: 0.342 ± 0.026
0.911HisAsp: 0.911 ± 0.041
1.095HisGlu: 1.095 ± 0.045
0.808HisPhe: 0.808 ± 0.036
1.319HisGly: 1.319 ± 0.046
0.455HisHis: 0.455 ± 0.035
1.58HisIle: 1.58 ± 0.05
1.035HisLys: 1.035 ± 0.04
1.534HisLeu: 1.534 ± 0.049
0.601HisMet: 0.601 ± 0.025
0.755HisAsn: 0.755 ± 0.033
0.872HisPro: 0.872 ± 0.035
0.549HisGln: 0.549 ± 0.028
0.676HisArg: 0.676 ± 0.031
0.96HisSer: 0.96 ± 0.038
0.976HisThr: 0.976 ± 0.036
1.269HisVal: 1.269 ± 0.041
0.134HisTrp: 0.134 ± 0.016
0.77HisTyr: 0.77 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.087IleAla: 6.087 ± 0.102
1.492IleCys: 1.492 ± 0.054
4.179IleAsp: 4.179 ± 0.088
5.56IleGlu: 5.56 ± 0.109
3.105IlePhe: 3.105 ± 0.076
5.788IleGly: 5.788 ± 0.102
1.44IleHis: 1.44 ± 0.046
5.729IleIle: 5.729 ± 0.124
4.575IleLys: 4.575 ± 0.086
7.467IleLeu: 7.467 ± 0.104
2.184IleMet: 2.184 ± 0.061
2.989IleAsn: 2.989 ± 0.069
3.216IlePro: 3.216 ± 0.067
2.899IleGln: 2.899 ± 0.074
3.384IleArg: 3.384 ± 0.069
4.952IleSer: 4.952 ± 0.091
4.13IleThr: 4.13 ± 0.089
5.601IleVal: 5.601 ± 0.099
0.627IleTrp: 0.627 ± 0.032
2.971IleTyr: 2.971 ± 0.075
0.0IleXaa: 0.0 ± 0.0
Lys
4.972LysAla: 4.972 ± 0.095
0.766LysCys: 0.766 ± 0.034
3.967LysAsp: 3.967 ± 0.075
7.798LysGlu: 7.798 ± 0.12
2.151LysPhe: 2.151 ± 0.057
4.121LysGly: 4.121 ± 0.073
1.151LysHis: 1.151 ± 0.044
5.304LysIle: 5.304 ± 0.091
6.826LysLys: 6.826 ± 0.099
5.776LysLeu: 5.776 ± 0.092
2.563LysMet: 2.563 ± 0.061
3.818LysAsn: 3.818 ± 0.089
2.044LysPro: 2.044 ± 0.053
2.758LysGln: 2.758 ± 0.071
3.12LysArg: 3.12 ± 0.071
3.391LysSer: 3.391 ± 0.067
3.995LysThr: 3.995 ± 0.076
4.871LysVal: 4.871 ± 0.082
0.618LysTrp: 0.618 ± 0.032
3.052LysTyr: 3.052 ± 0.07
0.0LysXaa: 0.0 ± 0.0
Leu
6.334LeuAla: 6.334 ± 0.099
1.43LeuCys: 1.43 ± 0.045
5.12LeuAsp: 5.12 ± 0.096
6.646LeuGlu: 6.646 ± 0.103
3.543LeuPhe: 3.543 ± 0.095
5.982LeuGly: 5.982 ± 0.113
1.562LeuHis: 1.562 ± 0.053
6.408LeuIle: 6.408 ± 0.112
6.518LeuLys: 6.518 ± 0.091
8.029LeuLeu: 8.029 ± 0.144
2.687LeuMet: 2.687 ± 0.066
4.121LeuAsn: 4.121 ± 0.095
3.239LeuPro: 3.239 ± 0.069
3.066LeuGln: 3.066 ± 0.069
3.281LeuArg: 3.281 ± 0.081
5.589LeuSer: 5.589 ± 0.094
4.781LeuThr: 4.781 ± 0.09
5.335LeuVal: 5.335 ± 0.098
0.648LeuTrp: 0.648 ± 0.032
3.2LeuTyr: 3.2 ± 0.08
0.0LeuXaa: 0.0 ± 0.0
Met
2.35MetAla: 2.35 ± 0.057
0.424MetCys: 0.424 ± 0.027
1.91MetAsp: 1.91 ± 0.051
2.643MetGlu: 2.643 ± 0.058
1.053MetPhe: 1.053 ± 0.047
2.175MetGly: 2.175 ± 0.067
0.536MetHis: 0.536 ± 0.032
2.388MetIle: 2.388 ± 0.062
2.848MetLys: 2.848 ± 0.053
2.993MetLeu: 2.993 ± 0.07
1.072MetMet: 1.072 ± 0.046
1.786MetAsn: 1.786 ± 0.055
1.148MetPro: 1.148 ± 0.046
1.199MetGln: 1.199 ± 0.046
1.19MetArg: 1.19 ± 0.047
1.914MetSer: 1.914 ± 0.051
1.923MetThr: 1.923 ± 0.051
1.947MetVal: 1.947 ± 0.057
0.209MetTrp: 0.209 ± 0.018
1.04MetTyr: 1.04 ± 0.044
0.0MetXaa: 0.0 ± 0.0
Asn
3.127AsnAla: 3.127 ± 0.066
0.635AsnCys: 0.635 ± 0.027
2.139AsnAsp: 2.139 ± 0.059
3.158AsnGlu: 3.158 ± 0.079
1.521AsnPhe: 1.521 ± 0.055
3.425AsnGly: 3.425 ± 0.076
0.898AsnHis: 0.898 ± 0.04
3.593AsnIle: 3.593 ± 0.072
2.951AsnLys: 2.951 ± 0.072
4.002AsnLeu: 4.002 ± 0.079
1.584AsnMet: 1.584 ± 0.046
1.829AsnAsn: 1.829 ± 0.053
2.086AsnPro: 2.086 ± 0.058
1.595AsnGln: 1.595 ± 0.051
1.982AsnArg: 1.982 ± 0.056
2.256AsnSer: 2.256 ± 0.058
2.425AsnThr: 2.425 ± 0.055
3.132AsnVal: 3.132 ± 0.069
0.42AsnTrp: 0.42 ± 0.021
1.938AsnTyr: 1.938 ± 0.063
0.0AsnXaa: 0.0 ± 0.0
Pro
2.095ProAla: 2.095 ± 0.058
0.399ProCys: 0.399 ± 0.027
2.039ProAsp: 2.039 ± 0.052
3.091ProGlu: 3.091 ± 0.087
1.455ProPhe: 1.455 ± 0.055
2.237ProGly: 2.237 ± 0.055
0.562ProHis: 0.562 ± 0.03
2.365ProIle: 2.365 ± 0.06
2.029ProLys: 2.029 ± 0.056
2.624ProLeu: 2.624 ± 0.066
0.934ProMet: 0.934 ± 0.033
1.29ProAsn: 1.29 ± 0.042
0.664ProPro: 0.664 ± 0.033
0.984ProGln: 0.984 ± 0.036
0.864ProArg: 0.864 ± 0.036
1.739ProSer: 1.739 ± 0.047
1.665ProThr: 1.665 ± 0.054
2.643ProVal: 2.643 ± 0.061
0.268ProTrp: 0.268 ± 0.021
1.468ProTyr: 1.468 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
2.387GlnAla: 2.387 ± 0.06
0.383GlnCys: 0.383 ± 0.027
1.723GlnAsp: 1.723 ± 0.048
2.756GlnGlu: 2.756 ± 0.064
1.235GlnPhe: 1.235 ± 0.042
1.988GlnGly: 1.988 ± 0.065
0.557GlnHis: 0.557 ± 0.035
3.076GlnIle: 3.076 ± 0.07
2.957GlnLys: 2.957 ± 0.073
2.939GlnLeu: 2.939 ± 0.068
1.327GlnMet: 1.327 ± 0.048
1.79GlnAsn: 1.79 ± 0.06
0.808GlnPro: 0.808 ± 0.038
1.263GlnGln: 1.263 ± 0.055
1.3GlnArg: 1.3 ± 0.044
1.655GlnSer: 1.655 ± 0.053
1.783GlnThr: 1.783 ± 0.059
2.3GlnVal: 2.3 ± 0.059
0.299GlnTrp: 0.299 ± 0.021
1.477GlnTyr: 1.477 ± 0.049
0.0GlnXaa: 0.0 ± 0.0
Arg
2.541ArgAla: 2.541 ± 0.057
0.515ArgCys: 0.515 ± 0.029
2.026ArgAsp: 2.026 ± 0.055
3.286ArgGlu: 3.286 ± 0.078
1.705ArgPhe: 1.705 ± 0.045
2.26ArgGly: 2.26 ± 0.061
0.73ArgHis: 0.73 ± 0.031
3.208ArgIle: 3.208 ± 0.072
3.282ArgLys: 3.282 ± 0.068
3.295ArgLeu: 3.295 ± 0.073
1.464ArgMet: 1.464 ± 0.046
1.917ArgAsn: 1.917 ± 0.05
1.206ArgPro: 1.206 ± 0.048
1.508ArgGln: 1.508 ± 0.053
1.861ArgArg: 1.861 ± 0.065
1.801ArgSer: 1.801 ± 0.053
2.122ArgThr: 2.122 ± 0.06
2.614ArgVal: 2.614 ± 0.068
0.308ArgTrp: 0.308 ± 0.022
1.664ArgTyr: 1.664 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
3.936SerAla: 3.936 ± 0.073
0.823SerCys: 0.823 ± 0.037
3.094SerAsp: 3.094 ± 0.075
3.973SerGlu: 3.973 ± 0.077
2.372SerPhe: 2.372 ± 0.059
4.525SerGly: 4.525 ± 0.083
0.936SerHis: 0.936 ± 0.038
4.112SerIle: 4.112 ± 0.084
3.752SerLys: 3.752 ± 0.07
4.612SerLeu: 4.612 ± 0.096
1.764SerMet: 1.764 ± 0.055
2.189SerAsn: 2.189 ± 0.058
1.606SerPro: 1.606 ± 0.05
1.695SerGln: 1.695 ± 0.057
2.257SerArg: 2.257 ± 0.067
3.226SerSer: 3.226 ± 0.098
2.758SerThr: 2.758 ± 0.068
4.139SerVal: 4.139 ± 0.088
0.504SerTrp: 0.504 ± 0.032
2.34SerTyr: 2.34 ± 0.062
0.001SerXaa: 0.001 ± 0.002
Thr
4.121ThrAla: 4.121 ± 0.097
0.694ThrCys: 0.694 ± 0.031
3.117ThrAsp: 3.117 ± 0.064
4.056ThrGlu: 4.056 ± 0.072
2.312ThrPhe: 2.312 ± 0.06
4.369ThrGly: 4.369 ± 0.084
0.906ThrHis: 0.906 ± 0.036
4.379ThrIle: 4.379 ± 0.081
3.535ThrLys: 3.535 ± 0.074
4.856ThrLeu: 4.856 ± 0.082
1.661ThrMet: 1.661 ± 0.048
2.175ThrAsn: 2.175 ± 0.061
1.997ThrPro: 1.997 ± 0.057
1.424ThrGln: 1.424 ± 0.049
1.855ThrArg: 1.855 ± 0.05
2.824ThrSer: 2.824 ± 0.071
3.004ThrThr: 3.004 ± 0.081
4.03ThrVal: 4.03 ± 0.077
0.426ThrTrp: 0.426 ± 0.03
2.061ThrTyr: 2.061 ± 0.062
0.0ThrXaa: 0.0 ± 0.0
Val
4.971ValAla: 4.971 ± 0.088
1.134ValCys: 1.134 ± 0.041
3.986ValAsp: 3.986 ± 0.075
5.282ValGlu: 5.282 ± 0.093
2.883ValPhe: 2.883 ± 0.069
4.734ValGly: 4.734 ± 0.087
1.153ValHis: 1.153 ± 0.045
5.523ValIle: 5.523 ± 0.109
4.635ValLys: 4.635 ± 0.097
6.754ValLeu: 6.754 ± 0.116
2.126ValMet: 2.126 ± 0.061
2.929ValAsn: 2.929 ± 0.065
2.466ValPro: 2.466 ± 0.065
2.3ValGln: 2.3 ± 0.062
2.75ValArg: 2.75 ± 0.071
4.472ValSer: 4.472 ± 0.082
4.14ValThr: 4.14 ± 0.079
5.199ValVal: 5.199 ± 0.096
0.577ValTrp: 0.577 ± 0.031
2.667ValTyr: 2.667 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.501TrpAla: 0.501 ± 0.03
0.141TrpCys: 0.141 ± 0.016
0.501TrpAsp: 0.501 ± 0.03
0.589TrpGlu: 0.589 ± 0.029
0.343TrpPhe: 0.343 ± 0.024
0.567TrpGly: 0.567 ± 0.028
0.15TrpHis: 0.15 ± 0.013
0.62TrpIle: 0.62 ± 0.03
0.714TrpLys: 0.714 ± 0.036
0.782TrpLeu: 0.782 ± 0.036
0.32TrpMet: 0.32 ± 0.024
0.508TrpAsn: 0.508 ± 0.029
0.215TrpPro: 0.215 ± 0.019
0.311TrpGln: 0.311 ± 0.023
0.305TrpArg: 0.305 ± 0.021
0.409TrpSer: 0.409 ± 0.025
0.381TrpThr: 0.381 ± 0.025
0.481TrpVal: 0.481 ± 0.028
0.085TrpTrp: 0.085 ± 0.013
0.339TrpTyr: 0.339 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.677TyrAla: 2.677 ± 0.057
0.592TyrCys: 0.592 ± 0.03
2.541TyrAsp: 2.541 ± 0.067
3.25TyrGlu: 3.25 ± 0.072
1.832TyrPhe: 1.832 ± 0.05
2.895TyrGly: 2.895 ± 0.064
0.894TyrHis: 0.894 ± 0.034
2.971TyrIle: 2.971 ± 0.071
2.421TyrLys: 2.421 ± 0.056
3.769TyrLeu: 3.769 ± 0.084
1.122TyrMet: 1.122 ± 0.042
1.896TyrAsn: 1.896 ± 0.06
1.424TyrPro: 1.424 ± 0.048
1.651TyrGln: 1.651 ± 0.062
1.824TyrArg: 1.824 ± 0.054
2.217TyrSer: 2.217 ± 0.056
2.217TyrThr: 2.217 ± 0.059
2.78TyrVal: 2.78 ± 0.061
0.337TyrTrp: 0.337 ± 0.023
1.833TyrTyr: 1.833 ± 0.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.002
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.003
Statistics based on 2078 proteins (679161 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski