Amino acid dipepetide frequency for Firmicutes bacterium CAG:194

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.62AlaAla: 7.62 ± 0.16
1.18AlaCys: 1.18 ± 0.045
5.473AlaAsp: 5.473 ± 0.096
5.99AlaGlu: 5.99 ± 0.119
3.254AlaPhe: 3.254 ± 0.079
6.421AlaGly: 6.421 ± 0.108
1.165AlaHis: 1.165 ± 0.04
4.972AlaIle: 4.972 ± 0.097
5.368AlaLys: 5.368 ± 0.096
7.091AlaLeu: 7.091 ± 0.102
2.529AlaMet: 2.529 ± 0.061
2.632AlaAsn: 2.632 ± 0.065
2.013AlaPro: 2.013 ± 0.055
2.767AlaGln: 2.767 ± 0.076
2.808AlaArg: 2.808 ± 0.063
4.032AlaSer: 4.032 ± 0.086
3.649AlaThr: 3.649 ± 0.096
6.434AlaVal: 6.434 ± 0.099
0.559AlaTrp: 0.559 ± 0.025
3.348AlaTyr: 3.348 ± 0.069
0.0AlaXaa: 0.0 ± 0.0
Cys
0.961CysAla: 0.961 ± 0.038
0.291CysCys: 0.291 ± 0.023
0.842CysAsp: 0.842 ± 0.036
0.883CysGlu: 0.883 ± 0.039
0.717CysPhe: 0.717 ± 0.033
1.412CysGly: 1.412 ± 0.043
0.303CysHis: 0.303 ± 0.02
1.183CysIle: 1.183 ± 0.046
0.883CysLys: 0.883 ± 0.035
1.284CysLeu: 1.284 ± 0.043
0.531CysMet: 0.531 ± 0.027
0.587CysAsn: 0.587 ± 0.03
0.557CysPro: 0.557 ± 0.03
0.444CysGln: 0.444 ± 0.025
0.672CysArg: 0.672 ± 0.034
0.891CysSer: 0.891 ± 0.033
0.635CysThr: 0.635 ± 0.028
1.079CysVal: 1.079 ± 0.034
0.109CysTrp: 0.109 ± 0.011
0.6CysTyr: 0.6 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
4.946AspAla: 4.946 ± 0.095
0.84AspCys: 0.84 ± 0.036
2.896AspAsp: 2.896 ± 0.086
4.455AspGlu: 4.455 ± 0.092
2.533AspPhe: 2.533 ± 0.06
4.518AspGly: 4.518 ± 0.102
1.043AspHis: 1.043 ± 0.039
4.285AspIle: 4.285 ± 0.082
3.52AspLys: 3.52 ± 0.084
4.872AspLeu: 4.872 ± 0.079
1.933AspMet: 1.933 ± 0.051
2.054AspAsn: 2.054 ± 0.05
1.931AspPro: 1.931 ± 0.055
1.639AspGln: 1.639 ± 0.053
2.515AspArg: 2.515 ± 0.063
3.027AspSer: 3.027 ± 0.07
3.365AspThr: 3.365 ± 0.076
3.864AspVal: 3.864 ± 0.074
0.532AspTrp: 0.532 ± 0.031
2.878AspTyr: 2.878 ± 0.065
0.0AspXaa: 0.0 ± 0.0
Glu
5.855GluAla: 5.855 ± 0.11
0.711GluCys: 0.711 ± 0.029
4.338GluAsp: 4.338 ± 0.086
7.081GluGlu: 7.081 ± 0.143
2.114GluPhe: 2.114 ± 0.056
4.109GluGly: 4.109 ± 0.076
1.223GluHis: 1.223 ± 0.04
5.564GluIle: 5.564 ± 0.085
7.256GluLys: 7.256 ± 0.122
6.286GluLeu: 6.286 ± 0.094
2.438GluMet: 2.438 ± 0.062
4.095GluAsn: 4.095 ± 0.077
1.838GluPro: 1.838 ± 0.057
3.402GluGln: 3.402 ± 0.087
3.011GluArg: 3.011 ± 0.072
3.297GluSer: 3.297 ± 0.07
4.109GluThr: 4.109 ± 0.082
4.266GluVal: 4.266 ± 0.087
0.524GluTrp: 0.524 ± 0.028
2.714GluTyr: 2.714 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
3.204PheAla: 3.204 ± 0.069
0.744PheCys: 0.744 ± 0.036
2.452PheAsp: 2.452 ± 0.064
2.459PheGlu: 2.459 ± 0.06
1.855PhePhe: 1.855 ± 0.065
2.857PheGly: 2.857 ± 0.068
0.834PheHis: 0.834 ± 0.034
2.569PheIle: 2.569 ± 0.069
1.785PheLys: 1.785 ± 0.054
4.08PheLeu: 4.08 ± 0.098
1.227PheMet: 1.227 ± 0.038
1.239PheAsn: 1.239 ± 0.046
1.255PhePro: 1.255 ± 0.046
1.234PheGln: 1.234 ± 0.047
1.577PheArg: 1.577 ± 0.048
2.713PheSer: 2.713 ± 0.072
2.371PheThr: 2.371 ± 0.064
2.848PheVal: 2.848 ± 0.066
0.375PheTrp: 0.375 ± 0.02
1.793PheTyr: 1.793 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
4.858GlyAla: 4.858 ± 0.103
1.246GlyCys: 1.246 ± 0.044
3.458GlyAsp: 3.458 ± 0.066
4.519GlyGlu: 4.519 ± 0.078
3.006GlyPhe: 3.006 ± 0.067
4.41GlyGly: 4.41 ± 0.114
1.185GlyHis: 1.185 ± 0.04
6.111GlyIle: 6.111 ± 0.109
5.561GlyLys: 5.561 ± 0.096
5.629GlyLeu: 5.629 ± 0.102
2.474GlyMet: 2.474 ± 0.057
3.152GlyAsn: 3.152 ± 0.081
1.166GlyPro: 1.166 ± 0.043
2.515GlyGln: 2.515 ± 0.062
2.945GlyArg: 2.945 ± 0.066
4.206GlySer: 4.206 ± 0.085
4.082GlyThr: 4.082 ± 0.087
4.61GlyVal: 4.61 ± 0.084
0.609GlyTrp: 0.609 ± 0.035
3.366GlyTyr: 3.366 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
1.213HisAla: 1.213 ± 0.041
0.288HisCys: 0.288 ± 0.02
0.947HisAsp: 0.947 ± 0.04
1.024HisGlu: 1.024 ± 0.038
0.883HisPhe: 0.883 ± 0.038
1.209HisGly: 1.209 ± 0.044
0.363HisHis: 0.363 ± 0.025
1.385HisIle: 1.385 ± 0.044
1.133HisLys: 1.133 ± 0.043
1.432HisLeu: 1.432 ± 0.041
0.639HisMet: 0.639 ± 0.028
0.681HisAsn: 0.681 ± 0.033
0.802HisPro: 0.802 ± 0.036
0.478HisGln: 0.478 ± 0.022
0.673HisArg: 0.673 ± 0.033
0.913HisSer: 0.913 ± 0.037
1.037HisThr: 1.037 ± 0.038
1.179HisVal: 1.179 ± 0.04
0.126HisTrp: 0.126 ± 0.013
0.783HisTyr: 0.783 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
5.964IleAla: 5.964 ± 0.098
1.449IleCys: 1.449 ± 0.045
4.118IleAsp: 4.118 ± 0.09
4.682IleGlu: 4.682 ± 0.084
2.68IlePhe: 2.68 ± 0.072
5.255IleGly: 5.255 ± 0.114
1.313IleHis: 1.313 ± 0.043
4.713IleIle: 4.713 ± 0.101
3.948IleLys: 3.948 ± 0.079
7.015IleLeu: 7.015 ± 0.131
1.957IleMet: 1.957 ± 0.058
2.664IleAsn: 2.664 ± 0.063
2.985IlePro: 2.985 ± 0.062
2.025IleGln: 2.025 ± 0.053
3.769IleArg: 3.769 ± 0.078
4.678IleSer: 4.678 ± 0.097
4.387IleThr: 4.387 ± 0.08
5.018IleVal: 5.018 ± 0.091
0.633IleTrp: 0.633 ± 0.034
2.937IleTyr: 2.937 ± 0.061
0.001IleXaa: 0.001 ± 0.001
Lys
5.312LysAla: 5.312 ± 0.089
0.74LysCys: 0.74 ± 0.037
3.955LysAsp: 3.955 ± 0.085
7.013LysGlu: 7.013 ± 0.114
1.759LysPhe: 1.759 ± 0.048
4.069LysGly: 4.069 ± 0.081
0.98LysHis: 0.98 ± 0.033
4.703LysIle: 4.703 ± 0.083
6.903LysLys: 6.903 ± 0.14
5.884LysLeu: 5.884 ± 0.094
2.207LysMet: 2.207 ± 0.059
3.618LysAsn: 3.618 ± 0.069
2.106LysPro: 2.106 ± 0.063
3.014LysGln: 3.014 ± 0.072
2.764LysArg: 2.764 ± 0.065
3.197LysSer: 3.197 ± 0.069
3.955LysThr: 3.955 ± 0.083
4.505LysVal: 4.505 ± 0.086
0.595LysTrp: 0.595 ± 0.031
2.793LysTyr: 2.793 ± 0.066
0.0LysXaa: 0.0 ± 0.0
Leu
7.359LeuAla: 7.359 ± 0.118
1.599LeuCys: 1.599 ± 0.05
4.885LeuAsp: 4.885 ± 0.08
5.523LeuGlu: 5.523 ± 0.101
4.062LeuPhe: 4.062 ± 0.113
5.714LeuGly: 5.714 ± 0.099
1.729LeuHis: 1.729 ± 0.051
5.892LeuIle: 5.892 ± 0.108
5.726LeuLys: 5.726 ± 0.09
10.099LeuLeu: 10.099 ± 0.205
2.531LeuMet: 2.531 ± 0.061
3.212LeuAsn: 3.212 ± 0.067
3.703LeuPro: 3.703 ± 0.083
3.733LeuGln: 3.733 ± 0.083
3.549LeuArg: 3.549 ± 0.071
6.481LeuSer: 6.481 ± 0.102
5.724LeuThr: 5.724 ± 0.104
5.726LeuVal: 5.726 ± 0.096
0.73LeuTrp: 0.73 ± 0.026
3.729LeuTyr: 3.729 ± 0.082
0.0LeuXaa: 0.0 ± 0.0
Met
2.591MetAla: 2.591 ± 0.062
0.352MetCys: 0.352 ± 0.022
1.81MetAsp: 1.81 ± 0.044
2.607MetGlu: 2.607 ± 0.063
0.95MetPhe: 0.95 ± 0.041
2.062MetGly: 2.062 ± 0.052
0.58MetHis: 0.58 ± 0.029
2.298MetIle: 2.298 ± 0.057
2.368MetLys: 2.368 ± 0.067
2.933MetLeu: 2.933 ± 0.066
0.913MetMet: 0.913 ± 0.036
1.416MetAsn: 1.416 ± 0.04
1.176MetPro: 1.176 ± 0.047
1.637MetGln: 1.637 ± 0.056
1.231MetArg: 1.231 ± 0.042
1.654MetSer: 1.654 ± 0.044
1.842MetThr: 1.842 ± 0.052
1.899MetVal: 1.899 ± 0.047
0.249MetTrp: 0.249 ± 0.017
0.879MetTyr: 0.879 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
3.372AsnAla: 3.372 ± 0.065
0.616AsnCys: 0.616 ± 0.033
2.011AsnAsp: 2.011 ± 0.054
2.65AsnGlu: 2.65 ± 0.054
1.386AsnPhe: 1.386 ± 0.046
3.521AsnGly: 3.521 ± 0.079
0.722AsnHis: 0.722 ± 0.031
3.061AsnIle: 3.061 ± 0.074
2.557AsnLys: 2.557 ± 0.058
3.504AsnLeu: 3.504 ± 0.07
1.365AsnMet: 1.365 ± 0.044
1.637AsnAsn: 1.637 ± 0.066
1.766AsnPro: 1.766 ± 0.047
1.349AsnGln: 1.349 ± 0.047
1.961AsnArg: 1.961 ± 0.051
2.067AsnSer: 2.067 ± 0.063
2.282AsnThr: 2.282 ± 0.055
2.818AsnVal: 2.818 ± 0.067
0.368AsnTrp: 0.368 ± 0.022
1.723AsnTyr: 1.723 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
2.662ProAla: 2.662 ± 0.073
0.438ProCys: 0.438 ± 0.027
2.375ProAsp: 2.375 ± 0.06
2.772ProGlu: 2.772 ± 0.071
1.42ProPhe: 1.42 ± 0.045
2.188ProGly: 2.188 ± 0.049
0.534ProHis: 0.534 ± 0.027
2.043ProIle: 2.043 ± 0.05
1.909ProLys: 1.909 ± 0.052
2.639ProLeu: 2.639 ± 0.064
0.787ProMet: 0.787 ± 0.035
1.049ProAsn: 1.049 ± 0.037
0.61ProPro: 0.61 ± 0.03
1.269ProGln: 1.269 ± 0.051
0.903ProArg: 0.903 ± 0.038
1.668ProSer: 1.668 ± 0.06
1.421ProThr: 1.421 ± 0.047
3.031ProVal: 3.031 ± 0.059
0.267ProTrp: 0.267 ± 0.019
1.542ProTyr: 1.542 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
3.022GlnAla: 3.022 ± 0.08
0.317GlnCys: 0.317 ± 0.02
1.77GlnAsp: 1.77 ± 0.049
3.091GlnGlu: 3.091 ± 0.067
1.174GlnPhe: 1.174 ± 0.042
2.024GlnGly: 2.024 ± 0.05
0.54GlnHis: 0.54 ± 0.027
3.124GlnIle: 3.124 ± 0.069
3.349GlnLys: 3.349 ± 0.072
3.325GlnLeu: 3.325 ± 0.073
1.326GlnMet: 1.326 ± 0.049
1.77GlnAsn: 1.77 ± 0.056
1.03GlnPro: 1.03 ± 0.04
1.696GlnGln: 1.696 ± 0.068
1.299GlnArg: 1.299 ± 0.044
1.77GlnSer: 1.77 ± 0.054
2.337GlnThr: 2.337 ± 0.067
2.249GlnVal: 2.249 ± 0.056
0.216GlnTrp: 0.216 ± 0.017
1.517GlnTyr: 1.517 ± 0.049
0.0GlnXaa: 0.0 ± 0.0
Arg
2.643ArgAla: 2.643 ± 0.057
0.551ArgCys: 0.551 ± 0.028
2.055ArgAsp: 2.055 ± 0.058
3.499ArgGlu: 3.499 ± 0.073
1.63ArgPhe: 1.63 ± 0.048
2.254ArgGly: 2.254 ± 0.062
0.656ArgHis: 0.656 ± 0.032
3.467ArgIle: 3.467 ± 0.076
3.425ArgLys: 3.425 ± 0.07
3.674ArgLeu: 3.674 ± 0.077
1.535ArgMet: 1.535 ± 0.047
1.863ArgAsn: 1.863 ± 0.052
1.144ArgPro: 1.144 ± 0.043
1.854ArgGln: 1.854 ± 0.058
1.977ArgArg: 1.977 ± 0.057
2.138ArgSer: 2.138 ± 0.06
2.063ArgThr: 2.063 ± 0.054
2.469ArgVal: 2.469 ± 0.052
0.274ArgTrp: 0.274 ± 0.019
1.775ArgTyr: 1.775 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
4.472SerAla: 4.472 ± 0.089
0.823SerCys: 0.823 ± 0.035
3.482SerAsp: 3.482 ± 0.079
3.428SerGlu: 3.428 ± 0.078
2.605SerPhe: 2.605 ± 0.062
4.502SerGly: 4.502 ± 0.082
0.998SerHis: 0.998 ± 0.037
3.907SerIle: 3.907 ± 0.084
3.173SerLys: 3.173 ± 0.066
5.192SerLeu: 5.192 ± 0.099
1.99SerMet: 1.99 ± 0.056
2.02SerAsn: 2.02 ± 0.054
1.561SerPro: 1.561 ± 0.049
1.797SerGln: 1.797 ± 0.053
2.316SerArg: 2.316 ± 0.053
3.224SerSer: 3.224 ± 0.072
2.681SerThr: 2.681 ± 0.058
4.446SerVal: 4.446 ± 0.076
0.457SerTrp: 0.457 ± 0.028
2.849SerTyr: 2.849 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
4.693ThrAla: 4.693 ± 0.084
0.694ThrCys: 0.694 ± 0.032
3.808ThrAsp: 3.808 ± 0.084
4.333ThrGlu: 4.333 ± 0.084
2.228ThrPhe: 2.228 ± 0.054
4.923ThrGly: 4.923 ± 0.093
0.875ThrHis: 0.875 ± 0.035
4.1ThrIle: 4.1 ± 0.091
3.651ThrLys: 3.651 ± 0.074
4.796ThrLeu: 4.796 ± 0.097
1.561ThrMet: 1.561 ± 0.047
1.992ThrAsn: 1.992 ± 0.055
1.996ThrPro: 1.996 ± 0.062
1.696ThrGln: 1.696 ± 0.054
1.799ThrArg: 1.799 ± 0.053
2.806ThrSer: 2.806 ± 0.071
3.072ThrThr: 3.072 ± 0.083
4.667ThrVal: 4.667 ± 0.096
0.426ThrTrp: 0.426 ± 0.029
2.392ThrTyr: 2.392 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
4.967ValAla: 4.967 ± 0.079
1.264ValCys: 1.264 ± 0.039
3.797ValAsp: 3.797 ± 0.068
4.397ValGlu: 4.397 ± 0.076
3.006ValPhe: 3.006 ± 0.078
4.099ValGly: 4.099 ± 0.074
1.09ValHis: 1.09 ± 0.035
5.266ValIle: 5.266 ± 0.084
4.527ValLys: 4.527 ± 0.089
7.001ValLeu: 7.001 ± 0.103
2.072ValMet: 2.072 ± 0.058
2.768ValAsn: 2.768 ± 0.06
2.266ValPro: 2.266 ± 0.06
2.233ValGln: 2.233 ± 0.055
2.828ValArg: 2.828 ± 0.058
4.768ValSer: 4.768 ± 0.086
4.611ValThr: 4.611 ± 0.085
4.741ValVal: 4.741 ± 0.088
0.529ValTrp: 0.529 ± 0.03
2.924ValTyr: 2.924 ± 0.069
0.0ValXaa: 0.0 ± 0.0
Trp
0.465TrpAla: 0.465 ± 0.028
0.122TrpCys: 0.122 ± 0.012
0.458TrpAsp: 0.458 ± 0.027
0.532TrpGlu: 0.532 ± 0.028
0.324TrpPhe: 0.324 ± 0.023
0.61TrpGly: 0.61 ± 0.032
0.17TrpHis: 0.17 ± 0.018
0.641TrpIle: 0.641 ± 0.03
0.651TrpLys: 0.651 ± 0.029
0.799TrpLeu: 0.799 ± 0.034
0.233TrpMet: 0.233 ± 0.016
0.465TrpAsn: 0.465 ± 0.025
0.176TrpPro: 0.176 ± 0.015
0.414TrpGln: 0.414 ± 0.023
0.305TrpArg: 0.305 ± 0.021
0.385TrpSer: 0.385 ± 0.022
0.354TrpThr: 0.354 ± 0.022
0.411TrpVal: 0.411 ± 0.023
0.092TrpTrp: 0.092 ± 0.011
0.362TrpTyr: 0.362 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.124TyrAla: 3.124 ± 0.066
0.597TyrCys: 0.597 ± 0.03
2.714TyrAsp: 2.714 ± 0.07
3.296TyrGlu: 3.296 ± 0.059
1.886TyrPhe: 1.886 ± 0.044
2.951TyrGly: 2.951 ± 0.071
0.885TyrHis: 0.885 ± 0.039
2.907TyrIle: 2.907 ± 0.06
2.379TyrLys: 2.379 ± 0.062
4.171TyrLeu: 4.171 ± 0.087
1.197TyrMet: 1.197 ± 0.041
1.821TyrAsn: 1.821 ± 0.052
1.446TyrPro: 1.446 ± 0.051
1.747TyrGln: 1.747 ± 0.053
2.045TyrArg: 2.045 ± 0.058
2.009TyrSer: 2.009 ± 0.057
2.531TyrThr: 2.531 ± 0.073
2.895TyrVal: 2.895 ± 0.063
0.3TyrTrp: 0.3 ± 0.024
2.16TyrTyr: 2.16 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2310 proteins (763414 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski