Amino acid dipepetide frequency for Bacteroidales bacterium KA00344

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.475AlaAla: 5.475 ± 0.105
0.901AlaCys: 0.901 ± 0.034
4.532AlaAsp: 4.532 ± 0.066
4.418AlaGlu: 4.418 ± 0.082
3.124AlaPhe: 3.124 ± 0.066
4.77AlaGly: 4.77 ± 0.077
1.352AlaHis: 1.352 ± 0.039
4.729AlaIle: 4.729 ± 0.084
4.526AlaLys: 4.526 ± 0.078
6.543AlaLeu: 6.543 ± 0.109
2.277AlaMet: 2.277 ± 0.049
3.39AlaAsn: 3.39 ± 0.072
2.255AlaPro: 2.255 ± 0.052
2.832AlaGln: 2.832 ± 0.048
3.247AlaArg: 3.247 ± 0.066
4.15AlaSer: 4.15 ± 0.082
4.03AlaThr: 4.03 ± 0.068
4.738AlaVal: 4.738 ± 0.082
0.868AlaTrp: 0.868 ± 0.033
2.868AlaTyr: 2.868 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.806CysAla: 0.806 ± 0.031
0.201CysCys: 0.201 ± 0.014
0.726CysAsp: 0.726 ± 0.03
0.697CysGlu: 0.697 ± 0.028
0.623CysPhe: 0.623 ± 0.024
1.048CysGly: 1.048 ± 0.034
0.344CysHis: 0.344 ± 0.018
0.83CysIle: 0.83 ± 0.034
0.754CysLys: 0.754 ± 0.027
1.091CysLeu: 1.091 ± 0.035
0.316CysMet: 0.316 ± 0.02
0.68CysAsn: 0.68 ± 0.027
0.532CysPro: 0.532 ± 0.03
0.384CysGln: 0.384 ± 0.021
0.656CysArg: 0.656 ± 0.028
0.8CysSer: 0.8 ± 0.031
0.659CysThr: 0.659 ± 0.025
0.838CysVal: 0.838 ± 0.034
0.176CysTrp: 0.176 ± 0.014
0.578CysTyr: 0.578 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
4.232AspAla: 4.232 ± 0.078
0.669AspCys: 0.669 ± 0.027
3.135AspAsp: 3.135 ± 0.065
3.981AspGlu: 3.981 ± 0.083
3.09AspPhe: 3.09 ± 0.07
4.398AspGly: 4.398 ± 0.076
1.093AspHis: 1.093 ± 0.037
4.086AspIle: 4.086 ± 0.07
3.973AspLys: 3.973 ± 0.069
4.376AspLeu: 4.376 ± 0.069
1.682AspMet: 1.682 ± 0.05
3.194AspAsn: 3.194 ± 0.064
1.859AspPro: 1.859 ± 0.055
1.376AspGln: 1.376 ± 0.042
2.755AspArg: 2.755 ± 0.056
3.09AspSer: 3.09 ± 0.079
2.907AspThr: 2.907 ± 0.054
3.85AspVal: 3.85 ± 0.066
0.852AspTrp: 0.852 ± 0.033
2.866AspTyr: 2.866 ± 0.064
0.0AspXaa: 0.0 ± 0.0
Glu
4.573GluAla: 4.573 ± 0.087
0.569GluCys: 0.569 ± 0.024
3.258GluAsp: 3.258 ± 0.061
4.26GluGlu: 4.26 ± 0.09
2.219GluPhe: 2.219 ± 0.052
3.792GluGly: 3.792 ± 0.062
1.359GluHis: 1.359 ± 0.041
3.986GluIle: 3.986 ± 0.083
4.436GluLys: 4.436 ± 0.089
5.304GluLeu: 5.304 ± 0.089
1.953GluMet: 1.953 ± 0.049
3.232GluAsn: 3.232 ± 0.057
1.691GluPro: 1.691 ± 0.045
2.684GluGln: 2.684 ± 0.057
3.293GluArg: 3.293 ± 0.058
2.824GluSer: 2.824 ± 0.049
3.236GluThr: 3.236 ± 0.055
3.796GluVal: 3.796 ± 0.069
0.739GluTrp: 0.739 ± 0.03
2.384GluTyr: 2.384 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
3.095PheAla: 3.095 ± 0.064
0.771PheCys: 0.771 ± 0.031
2.908PheAsp: 2.908 ± 0.063
2.355PheGlu: 2.355 ± 0.052
2.17PhePhe: 2.17 ± 0.063
3.253PheGly: 3.253 ± 0.067
0.938PheHis: 0.938 ± 0.033
2.96PheIle: 2.96 ± 0.068
2.543PheLys: 2.543 ± 0.059
3.804PheLeu: 3.804 ± 0.075
1.297PheMet: 1.297 ± 0.042
2.347PheAsn: 2.347 ± 0.062
1.646PhePro: 1.646 ± 0.042
1.263PheGln: 1.263 ± 0.037
2.153PheArg: 2.153 ± 0.053
3.258PheSer: 3.258 ± 0.068
2.674PheThr: 2.674 ± 0.063
3.181PheVal: 3.181 ± 0.061
0.546PheTrp: 0.546 ± 0.028
1.949PheTyr: 1.949 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
4.394GlyAla: 4.394 ± 0.089
0.972GlyCys: 0.972 ± 0.032
3.581GlyAsp: 3.581 ± 0.066
3.841GlyGlu: 3.841 ± 0.063
3.114GlyPhe: 3.114 ± 0.056
4.529GlyGly: 4.529 ± 0.096
1.429GlyHis: 1.429 ± 0.043
4.93GlyIle: 4.93 ± 0.083
5.352GlyLys: 5.352 ± 0.079
5.649GlyLeu: 5.649 ± 0.085
2.171GlyMet: 2.171 ± 0.053
3.564GlyAsn: 3.564 ± 0.07
1.267GlyPro: 1.267 ± 0.038
2.346GlyGln: 2.346 ± 0.051
3.47GlyArg: 3.47 ± 0.071
3.918GlySer: 3.918 ± 0.078
4.086GlyThr: 4.086 ± 0.071
4.809GlyVal: 4.809 ± 0.084
0.978GlyTrp: 0.978 ± 0.04
3.24GlyTyr: 3.24 ± 0.074
0.0GlyXaa: 0.0 ± 0.0
His
1.266HisAla: 1.266 ± 0.038
0.321HisCys: 0.321 ± 0.018
1.17HisAsp: 1.17 ± 0.039
1.149HisGlu: 1.149 ± 0.036
1.072HisPhe: 1.072 ± 0.034
1.418HisGly: 1.418 ± 0.041
0.628HisHis: 0.628 ± 0.026
1.615HisIle: 1.615 ± 0.047
1.173HisLys: 1.173 ± 0.038
1.902HisLeu: 1.902 ± 0.05
0.384HisMet: 0.384 ± 0.022
1.137HisAsn: 1.137 ± 0.035
1.04HisPro: 1.04 ± 0.035
0.793HisGln: 0.793 ± 0.028
1.153HisArg: 1.153 ± 0.032
1.196HisSer: 1.196 ± 0.043
1.192HisThr: 1.192 ± 0.037
1.269HisVal: 1.269 ± 0.038
0.26HisTrp: 0.26 ± 0.021
0.986HisTyr: 0.986 ± 0.037
0.0HisXaa: 0.0 ± 0.0
Ile
4.991IleAla: 4.991 ± 0.087
0.974IleCys: 0.974 ± 0.033
4.648IleAsp: 4.648 ± 0.077
3.932IleGlu: 3.932 ± 0.075
2.795IlePhe: 2.795 ± 0.06
4.58IleGly: 4.58 ± 0.081
1.282IleHis: 1.282 ± 0.035
4.72IleIle: 4.72 ± 0.094
4.055IleLys: 4.055 ± 0.071
5.368IleLeu: 5.368 ± 0.11
1.712IleMet: 1.712 ± 0.048
3.565IleAsn: 3.565 ± 0.064
2.93IlePro: 2.93 ± 0.055
2.1IleGln: 2.1 ± 0.047
3.303IleArg: 3.303 ± 0.059
4.425IleSer: 4.425 ± 0.078
3.855IleThr: 3.855 ± 0.08
4.57IleVal: 4.57 ± 0.088
0.72IleTrp: 0.72 ± 0.028
2.618IleTyr: 2.618 ± 0.058
0.0IleXaa: 0.0 ± 0.0
Lys
5.068LysAla: 5.068 ± 0.086
0.589LysCys: 0.589 ± 0.027
3.829LysAsp: 3.829 ± 0.075
4.558LysGlu: 4.558 ± 0.088
2.383LysPhe: 2.383 ± 0.05
4.375LysGly: 4.375 ± 0.076
1.502LysHis: 1.502 ± 0.042
4.266LysIle: 4.266 ± 0.072
5.009LysLys: 5.009 ± 0.09
5.448LysLeu: 5.448 ± 0.08
2.241LysMet: 2.241 ± 0.056
3.539LysAsn: 3.539 ± 0.065
2.397LysPro: 2.397 ± 0.052
2.867LysGln: 2.867 ± 0.065
3.317LysArg: 3.317 ± 0.06
3.511LysSer: 3.511 ± 0.069
3.936LysThr: 3.936 ± 0.058
4.201LysVal: 4.201 ± 0.072
0.792LysTrp: 0.792 ± 0.032
2.994LysTyr: 2.994 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
6.208LeuAla: 6.208 ± 0.087
1.428LeuCys: 1.428 ± 0.041
4.669LeuAsp: 4.669 ± 0.078
4.247LeuGlu: 4.247 ± 0.079
4.168LeuPhe: 4.168 ± 0.088
5.538LeuGly: 5.538 ± 0.088
1.902LeuHis: 1.902 ± 0.045
5.106LeuIle: 5.106 ± 0.079
5.902LeuLys: 5.902 ± 0.085
8.168LeuLeu: 8.168 ± 0.144
2.65LeuMet: 2.65 ± 0.066
4.585LeuAsn: 4.585 ± 0.077
3.822LeuPro: 3.822 ± 0.069
3.43LeuGln: 3.43 ± 0.075
4.494LeuArg: 4.494 ± 0.065
6.548LeuSer: 6.548 ± 0.101
5.106LeuThr: 5.106 ± 0.073
5.076LeuVal: 5.076 ± 0.093
1.033LeuTrp: 1.033 ± 0.039
3.469LeuTyr: 3.469 ± 0.064
0.0LeuXaa: 0.0 ± 0.0
Met
2.381MetAla: 2.381 ± 0.047
0.283MetCys: 0.283 ± 0.018
1.506MetAsp: 1.506 ± 0.044
1.774MetGlu: 1.774 ± 0.051
1.145MetPhe: 1.145 ± 0.036
2.047MetGly: 2.047 ± 0.054
0.546MetHis: 0.546 ± 0.025
1.638MetIle: 1.638 ± 0.047
2.558MetLys: 2.558 ± 0.056
2.824MetLeu: 2.824 ± 0.071
1.128MetMet: 1.128 ± 0.043
1.591MetAsn: 1.591 ± 0.04
1.296MetPro: 1.296 ± 0.038
1.299MetGln: 1.299 ± 0.035
1.583MetArg: 1.583 ± 0.049
1.738MetSer: 1.738 ± 0.043
1.761MetThr: 1.761 ± 0.04
1.783MetVal: 1.783 ± 0.042
0.285MetTrp: 0.285 ± 0.019
0.86MetTyr: 0.86 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.667AsnAla: 3.667 ± 0.07
0.575AsnCys: 0.575 ± 0.028
2.863AsnAsp: 2.863 ± 0.059
2.89AsnGlu: 2.89 ± 0.052
2.31AsnPhe: 2.31 ± 0.049
4.07AsnGly: 4.07 ± 0.081
1.098AsnHis: 1.098 ± 0.034
4.091AsnIle: 4.091 ± 0.084
3.39AsnLys: 3.39 ± 0.061
4.337AsnLeu: 4.337 ± 0.082
1.472AsnMet: 1.472 ± 0.036
2.891AsnAsn: 2.891 ± 0.066
2.337AsnPro: 2.337 ± 0.051
1.588AsnGln: 1.588 ± 0.04
2.649AsnArg: 2.649 ± 0.062
2.86AsnSer: 2.86 ± 0.068
2.901AsnThr: 2.901 ± 0.066
3.437AsnVal: 3.437 ± 0.078
0.662AsnTrp: 0.662 ± 0.029
2.37AsnTyr: 2.37 ± 0.063
0.0AsnXaa: 0.0 ± 0.0
Pro
2.454ProAla: 2.454 ± 0.048
0.388ProCys: 0.388 ± 0.02
2.376ProAsp: 2.376 ± 0.059
2.833ProGlu: 2.833 ± 0.055
1.787ProPhe: 1.787 ± 0.046
2.151ProGly: 2.151 ± 0.055
0.773ProHis: 0.773 ± 0.034
2.431ProIle: 2.431 ± 0.058
2.246ProLys: 2.246 ± 0.047
3.132ProLeu: 3.132 ± 0.056
1.103ProMet: 1.103 ± 0.035
1.905ProAsn: 1.905 ± 0.042
0.717ProPro: 0.717 ± 0.029
1.452ProGln: 1.452 ± 0.041
1.419ProArg: 1.419 ± 0.042
2.259ProSer: 2.259 ± 0.05
2.274ProThr: 2.274 ± 0.053
2.566ProVal: 2.566 ± 0.058
0.423ProTrp: 0.423 ± 0.025
1.709ProTyr: 1.709 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
2.478GlnAla: 2.478 ± 0.053
0.339GlnCys: 0.339 ± 0.018
1.745GlnAsp: 1.745 ± 0.048
2.241GlnGlu: 2.241 ± 0.067
1.444GlnPhe: 1.444 ± 0.037
2.257GlnGly: 2.257 ± 0.049
0.816GlnHis: 0.816 ± 0.031
2.422GlnIle: 2.422 ± 0.057
2.602GlnLys: 2.602 ± 0.062
3.423GlnLeu: 3.423 ± 0.079
1.168GlnMet: 1.168 ± 0.037
1.921GlnAsn: 1.921 ± 0.054
1.32GlnPro: 1.32 ± 0.035
1.911GlnGln: 1.911 ± 0.059
2.026GlnArg: 2.026 ± 0.05
2.056GlnSer: 2.056 ± 0.051
2.173GlnThr: 2.173 ± 0.042
2.157GlnVal: 2.157 ± 0.054
0.54GlnTrp: 0.54 ± 0.026
1.555GlnTyr: 1.555 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
3.044ArgAla: 3.044 ± 0.054
0.547ArgCys: 0.547 ± 0.023
2.479ArgAsp: 2.479 ± 0.056
3.135ArgGlu: 3.135 ± 0.063
2.295ArgPhe: 2.295 ± 0.05
2.817ArgGly: 2.817 ± 0.057
1.266ArgHis: 1.266 ± 0.04
3.447ArgIle: 3.447 ± 0.065
3.558ArgLys: 3.558 ± 0.071
4.804ArgLeu: 4.804 ± 0.087
1.724ArgMet: 1.724 ± 0.049
2.765ArgAsn: 2.765 ± 0.056
1.721ArgPro: 1.721 ± 0.044
2.175ArgGln: 2.175 ± 0.055
2.815ArgArg: 2.815 ± 0.068
2.62ArgSer: 2.62 ± 0.054
2.52ArgThr: 2.52 ± 0.055
2.82ArgVal: 2.82 ± 0.062
0.692ArgTrp: 0.692 ± 0.029
2.426ArgTyr: 2.426 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
4.202SerAla: 4.202 ± 0.079
0.846SerCys: 0.846 ± 0.03
3.252SerAsp: 3.252 ± 0.056
3.331SerGlu: 3.331 ± 0.063
3.043SerPhe: 3.043 ± 0.062
4.245SerGly: 4.245 ± 0.091
1.244SerHis: 1.244 ± 0.039
4.097SerIle: 4.097 ± 0.058
3.689SerLys: 3.689 ± 0.068
5.548SerLeu: 5.548 ± 0.092
1.743SerMet: 1.743 ± 0.041
2.938SerAsn: 2.938 ± 0.056
2.297SerPro: 2.297 ± 0.058
2.044SerGln: 2.044 ± 0.047
2.882SerArg: 2.882 ± 0.063
3.883SerSer: 3.883 ± 0.087
3.385SerThr: 3.385 ± 0.063
4.148SerVal: 4.148 ± 0.071
0.768SerTrp: 0.768 ± 0.029
2.734SerTyr: 2.734 ± 0.071
0.0SerXaa: 0.0 ± 0.0
Thr
4.165ThrAla: 4.165 ± 0.062
0.61ThrCys: 0.61 ± 0.024
3.554ThrAsp: 3.554 ± 0.064
3.129ThrGlu: 3.129 ± 0.052
2.746ThrPhe: 2.746 ± 0.066
4.089ThrGly: 4.089 ± 0.084
1.115ThrHis: 1.115 ± 0.034
4.051ThrIle: 4.051 ± 0.071
3.119ThrLys: 3.119 ± 0.052
5.456ThrLeu: 5.456 ± 0.081
1.502ThrMet: 1.502 ± 0.043
2.632ThrAsn: 2.632 ± 0.058
2.756ThrPro: 2.756 ± 0.055
1.865ThrGln: 1.865 ± 0.044
2.292ThrArg: 2.292 ± 0.053
3.42ThrSer: 3.42 ± 0.078
3.43ThrThr: 3.43 ± 0.074
4.03ThrVal: 4.03 ± 0.075
0.746ThrTrp: 0.746 ± 0.03
2.435ThrTyr: 2.435 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
4.787ValAla: 4.787 ± 0.089
1.015ValCys: 1.015 ± 0.035
3.781ValAsp: 3.781 ± 0.079
3.9ValGlu: 3.9 ± 0.081
2.946ValPhe: 2.946 ± 0.066
4.316ValGly: 4.316 ± 0.086
1.091ValHis: 1.091 ± 0.037
4.126ValIle: 4.126 ± 0.074
4.405ValLys: 4.405 ± 0.075
5.505ValLeu: 5.505 ± 0.083
1.923ValMet: 1.923 ± 0.051
3.24ValAsn: 3.24 ± 0.064
2.52ValPro: 2.52 ± 0.058
2.059ValGln: 2.059 ± 0.048
3.186ValArg: 3.186 ± 0.07
4.521ValSer: 4.521 ± 0.076
3.734ValThr: 3.734 ± 0.071
4.782ValVal: 4.782 ± 0.09
0.761ValTrp: 0.761 ± 0.031
2.678ValTyr: 2.678 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.751TrpAla: 0.751 ± 0.032
0.168TrpCys: 0.168 ± 0.015
0.732TrpAsp: 0.732 ± 0.03
0.644TrpGlu: 0.644 ± 0.029
0.53TrpPhe: 0.53 ± 0.023
0.96TrpGly: 0.96 ± 0.03
0.276TrpHis: 0.276 ± 0.018
0.753TrpIle: 0.753 ± 0.03
0.874TrpLys: 0.874 ± 0.031
1.229TrpLeu: 1.229 ± 0.038
0.437TrpMet: 0.437 ± 0.02
0.807TrpAsn: 0.807 ± 0.034
0.318TrpPro: 0.318 ± 0.02
0.583TrpGln: 0.583 ± 0.025
0.622TrpArg: 0.622 ± 0.029
0.759TrpSer: 0.759 ± 0.03
0.713TrpThr: 0.713 ± 0.032
0.697TrpVal: 0.697 ± 0.028
0.21TrpTrp: 0.21 ± 0.017
0.515TrpTyr: 0.515 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.988TyrAla: 2.988 ± 0.061
0.573TyrCys: 0.573 ± 0.029
2.72TyrAsp: 2.72 ± 0.064
2.272TyrGlu: 2.272 ± 0.049
2.111TyrPhe: 2.111 ± 0.054
2.989TyrGly: 2.989 ± 0.072
1.073TyrHis: 1.073 ± 0.039
2.856TyrIle: 2.856 ± 0.058
2.658TyrLys: 2.658 ± 0.06
3.637TyrLeu: 3.637 ± 0.068
1.142TyrMet: 1.142 ± 0.038
2.537TyrAsn: 2.537 ± 0.063
1.677TyrPro: 1.677 ± 0.041
1.54TyrGln: 1.54 ± 0.04
2.354TyrArg: 2.354 ± 0.053
2.485TyrSer: 2.485 ± 0.065
2.569TyrThr: 2.569 ± 0.065
2.528TyrVal: 2.528 ± 0.059
0.527TyrTrp: 0.527 ± 0.025
2.101TyrTyr: 2.101 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2869 proteins (880993 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski