Amino acid dipepetide frequency for Dialister micraerophilus DSM 19965

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.257AlaAla: 5.257 ± 0.165
0.736AlaCys: 0.736 ± 0.048
3.41AlaAsp: 3.41 ± 0.109
4.356AlaGlu: 4.356 ± 0.107
3.03AlaPhe: 3.03 ± 0.11
5.05AlaGly: 5.05 ± 0.135
1.068AlaHis: 1.068 ± 0.05
5.897AlaIle: 5.897 ± 0.144
5.151AlaLys: 5.151 ± 0.102
6.402AlaLeu: 6.402 ± 0.138
2.118AlaMet: 2.118 ± 0.09
2.761AlaAsn: 2.761 ± 0.096
1.741AlaPro: 1.741 ± 0.067
2.121AlaGln: 2.121 ± 0.082
2.647AlaArg: 2.647 ± 0.084
3.737AlaSer: 3.737 ± 0.121
3.229AlaThr: 3.229 ± 0.099
5.448AlaVal: 5.448 ± 0.155
0.478AlaTrp: 0.478 ± 0.04
2.416AlaTyr: 2.416 ± 0.08
0.0AlaXaa: 0.0 ± 0.0
Cys
0.739CysAla: 0.739 ± 0.047
0.133CysCys: 0.133 ± 0.021
0.651CysAsp: 0.651 ± 0.044
0.797CysGlu: 0.797 ± 0.045
0.492CysPhe: 0.492 ± 0.037
1.087CysGly: 1.087 ± 0.053
0.229CysHis: 0.229 ± 0.026
1.098CysIle: 1.098 ± 0.063
0.951CysLys: 0.951 ± 0.048
0.904CysLeu: 0.904 ± 0.046
0.351CysMet: 0.351 ± 0.029
0.609CysAsn: 0.609 ± 0.05
0.396CysPro: 0.396 ± 0.032
0.202CysGln: 0.202 ± 0.021
0.449CysArg: 0.449 ± 0.039
0.76CysSer: 0.76 ± 0.052
0.582CysThr: 0.582 ± 0.046
0.776CysVal: 0.776 ± 0.046
0.085CysTrp: 0.085 ± 0.014
0.431CysTyr: 0.431 ± 0.03
0.0CysXaa: 0.0 ± 0.0
Asp
3.42AspAla: 3.42 ± 0.104
0.611AspCys: 0.611 ± 0.045
2.36AspAsp: 2.36 ± 0.093
4.311AspGlu: 4.311 ± 0.129
2.615AspPhe: 2.615 ± 0.087
3.606AspGly: 3.606 ± 0.116
0.683AspHis: 0.683 ± 0.037
4.898AspIle: 4.898 ± 0.12
4.481AspLys: 4.481 ± 0.121
4.627AspLeu: 4.627 ± 0.123
1.642AspMet: 1.642 ± 0.07
2.222AspAsn: 2.222 ± 0.083
1.39AspPro: 1.39 ± 0.065
0.959AspGln: 0.959 ± 0.046
1.765AspArg: 1.765 ± 0.073
2.732AspSer: 2.732 ± 0.09
2.466AspThr: 2.466 ± 0.081
3.697AspVal: 3.697 ± 0.121
0.462AspTrp: 0.462 ± 0.032
2.092AspTyr: 2.092 ± 0.086
0.0AspXaa: 0.0 ± 0.0
Glu
4.26GluAla: 4.26 ± 0.128
0.718GluCys: 0.718 ± 0.042
3.508GluAsp: 3.508 ± 0.095
6.171GluGlu: 6.171 ± 0.167
2.761GluPhe: 2.761 ± 0.079
4.101GluGly: 4.101 ± 0.109
1.209GluHis: 1.209 ± 0.068
6.987GluIle: 6.987 ± 0.165
8.515GluLys: 8.515 ± 0.2
6.227GluLeu: 6.227 ± 0.152
2.275GluMet: 2.275 ± 0.081
5.536GluAsn: 5.536 ± 0.137
1.52GluPro: 1.52 ± 0.055
1.884GluGln: 1.884 ± 0.072
3.149GluArg: 3.149 ± 0.093
3.474GluSer: 3.474 ± 0.104
3.58GluThr: 3.58 ± 0.101
4.271GluVal: 4.271 ± 0.13
0.561GluTrp: 0.561 ± 0.044
2.501GluTyr: 2.501 ± 0.097
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 0.091
0.68PheCys: 0.68 ± 0.047
2.434PheAsp: 2.434 ± 0.079
2.745PheGlu: 2.745 ± 0.079
2.179PhePhe: 2.179 ± 0.108
3.046PheGly: 3.046 ± 0.104
0.765PheHis: 0.765 ± 0.047
4.165PheIle: 4.165 ± 0.116
3.173PheLys: 3.173 ± 0.091
4.032PheLeu: 4.032 ± 0.132
1.316PheMet: 1.316 ± 0.067
1.972PheAsn: 1.972 ± 0.08
1.422PhePro: 1.422 ± 0.059
0.882PheGln: 0.882 ± 0.045
1.488PheArg: 1.488 ± 0.061
3.218PheSer: 3.218 ± 0.112
2.413PheThr: 2.413 ± 0.086
2.838PheVal: 2.838 ± 0.088
0.425PheTrp: 0.425 ± 0.039
1.757PheTyr: 1.757 ± 0.073
0.0PheXaa: 0.0 ± 0.0
Gly
4.755GlyAla: 4.755 ± 0.127
0.858GlyCys: 0.858 ± 0.055
3.24GlyAsp: 3.24 ± 0.115
4.292GlyGlu: 4.292 ± 0.111
2.892GlyPhe: 2.892 ± 0.097
4.584GlyGly: 4.584 ± 0.133
1.286GlyHis: 1.286 ± 0.067
6.74GlyIle: 6.74 ± 0.18
6.471GlyLys: 6.471 ± 0.136
5.501GlyLeu: 5.501 ± 0.132
2.137GlyMet: 2.137 ± 0.063
3.452GlyAsn: 3.452 ± 0.111
1.401GlyPro: 1.401 ± 0.073
1.781GlyGln: 1.781 ± 0.062
2.767GlyArg: 2.767 ± 0.093
3.875GlySer: 3.875 ± 0.101
3.912GlyThr: 3.912 ± 0.123
4.662GlyVal: 4.662 ± 0.141
0.555GlyTrp: 0.555 ± 0.045
2.777GlyTyr: 2.777 ± 0.084
0.0GlyXaa: 0.0 ± 0.0
His
1.201HisAla: 1.201 ± 0.057
0.231HisCys: 0.231 ± 0.027
0.824HisAsp: 0.824 ± 0.046
1.215HisGlu: 1.215 ± 0.072
0.755HisPhe: 0.755 ± 0.043
1.204HisGly: 1.204 ± 0.055
0.404HisHis: 0.404 ± 0.034
1.573HisIle: 1.573 ± 0.06
1.432HisLys: 1.432 ± 0.071
1.531HisLeu: 1.531 ± 0.066
0.545HisMet: 0.545 ± 0.042
0.837HisAsn: 0.837 ± 0.042
0.773HisPro: 0.773 ± 0.049
0.431HisGln: 0.431 ± 0.029
0.702HisArg: 0.702 ± 0.042
0.957HisSer: 0.957 ± 0.056
0.893HisThr: 0.893 ± 0.044
1.172HisVal: 1.172 ± 0.062
0.167HisTrp: 0.167 ± 0.022
0.598HisTyr: 0.598 ± 0.037
0.0HisXaa: 0.0 ± 0.0
Ile
6.325IleAla: 6.325 ± 0.156
1.329IleCys: 1.329 ± 0.064
4.502IleAsp: 4.502 ± 0.141
5.836IleGlu: 5.836 ± 0.135
3.973IlePhe: 3.973 ± 0.145
6.081IleGly: 6.081 ± 0.16
1.704IleHis: 1.704 ± 0.078
7.293IleIle: 7.293 ± 0.186
6.881IleLys: 6.881 ± 0.161
8.22IleLeu: 8.22 ± 0.169
2.291IleMet: 2.291 ± 0.082
4.499IleAsn: 4.499 ± 0.13
3.715IlePro: 3.715 ± 0.098
2.158IleGln: 2.158 ± 0.071
3.346IleArg: 3.346 ± 0.105
6.493IleSer: 6.493 ± 0.146
4.704IleThr: 4.704 ± 0.124
5.414IleVal: 5.414 ± 0.144
0.611IleTrp: 0.611 ± 0.033
3.327IleTyr: 3.327 ± 0.099
0.0IleXaa: 0.0 ± 0.0
Lys
5.212LysAla: 5.212 ± 0.138
0.741LysCys: 0.741 ± 0.047
4.922LysAsp: 4.922 ± 0.131
8.218LysGlu: 8.218 ± 0.197
2.945LysPhe: 2.945 ± 0.091
5.074LysGly: 5.074 ± 0.123
1.432LysHis: 1.432 ± 0.067
7.734LysIle: 7.734 ± 0.167
9.065LysLys: 9.065 ± 0.19
6.926LysLeu: 6.926 ± 0.146
2.578LysMet: 2.578 ± 0.096
6.211LysAsn: 6.211 ± 0.18
2.177LysPro: 2.177 ± 0.063
2.416LysGln: 2.416 ± 0.086
3.545LysArg: 3.545 ± 0.106
4.832LysSer: 4.832 ± 0.129
4.866LysThr: 4.866 ± 0.12
5.031LysVal: 5.031 ± 0.137
0.731LysTrp: 0.731 ± 0.044
3.479LysTyr: 3.479 ± 0.104
0.0LysXaa: 0.0 ± 0.0
Leu
6.036LeuAla: 6.036 ± 0.152
1.151LeuCys: 1.151 ± 0.054
4.361LeuAsp: 4.361 ± 0.118
5.871LeuGlu: 5.871 ± 0.132
3.947LeuPhe: 3.947 ± 0.117
5.823LeuGly: 5.823 ± 0.142
1.549LeuHis: 1.549 ± 0.064
6.979LeuIle: 6.979 ± 0.161
7.816LeuLys: 7.816 ± 0.137
8.021LeuLeu: 8.021 ± 0.172
2.294LeuMet: 2.294 ± 0.086
4.377LeuAsn: 4.377 ± 0.126
3.362LeuPro: 3.362 ± 0.095
2.618LeuGln: 2.618 ± 0.08
3.418LeuArg: 3.418 ± 0.11
6.509LeuSer: 6.509 ± 0.142
4.749LeuThr: 4.749 ± 0.116
5.286LeuVal: 5.286 ± 0.125
0.704LeuTrp: 0.704 ± 0.048
3.096LeuTyr: 3.096 ± 0.092
0.0LeuXaa: 0.0 ± 0.0
Met
2.089MetAla: 2.089 ± 0.081
0.271MetCys: 0.271 ± 0.025
1.35MetAsp: 1.35 ± 0.057
1.951MetGlu: 1.951 ± 0.075
0.997MetPhe: 0.997 ± 0.052
1.813MetGly: 1.813 ± 0.073
0.486MetHis: 0.486 ± 0.037
2.331MetIle: 2.331 ± 0.084
3.051MetLys: 3.051 ± 0.092
2.347MetLeu: 2.347 ± 0.086
0.76MetMet: 0.76 ± 0.043
1.762MetAsn: 1.762 ± 0.073
1.047MetPro: 1.047 ± 0.057
1.029MetGln: 1.029 ± 0.051
1.268MetArg: 1.268 ± 0.06
1.871MetSer: 1.871 ± 0.063
1.547MetThr: 1.547 ± 0.072
1.592MetVal: 1.592 ± 0.069
0.183MetTrp: 0.183 ± 0.02
1.039MetTyr: 1.039 ± 0.049
0.0MetXaa: 0.0 ± 0.0
Asn
3.46AsnAla: 3.46 ± 0.119
0.585AsnCys: 0.585 ± 0.045
2.466AsnAsp: 2.466 ± 0.086
4.048AsnGlu: 4.048 ± 0.122
2.145AsnPhe: 2.145 ± 0.079
3.415AsnGly: 3.415 ± 0.096
0.832AsnHis: 0.832 ± 0.049
5.275AsnIle: 5.275 ± 0.132
4.733AsnLys: 4.733 ± 0.126
4.595AsnLeu: 4.595 ± 0.116
1.417AsnMet: 1.417 ± 0.065
2.565AsnAsn: 2.565 ± 0.096
2.028AsnPro: 2.028 ± 0.079
1.324AsnGln: 1.324 ± 0.054
1.98AsnArg: 1.98 ± 0.074
2.995AsnSer: 2.995 ± 0.105
2.575AsnThr: 2.575 ± 0.093
3.338AsnVal: 3.338 ± 0.106
0.526AsnTrp: 0.526 ± 0.036
1.956AsnTyr: 1.956 ± 0.08
0.0AsnXaa: 0.0 ± 0.0
Pro
1.898ProAla: 1.898 ± 0.074
0.399ProCys: 0.399 ± 0.036
1.757ProAsp: 1.757 ± 0.074
2.804ProGlu: 2.804 ± 0.091
1.72ProPhe: 1.72 ± 0.068
1.988ProGly: 1.988 ± 0.09
0.643ProHis: 0.643 ± 0.042
2.589ProIle: 2.589 ± 0.077
2.036ProLys: 2.036 ± 0.083
2.838ProLeu: 2.838 ± 0.086
0.901ProMet: 0.901 ± 0.049
1.169ProAsn: 1.169 ± 0.051
0.736ProPro: 0.736 ± 0.046
0.909ProGln: 0.909 ± 0.053
0.896ProArg: 0.896 ± 0.054
1.882ProSer: 1.882 ± 0.062
1.475ProThr: 1.475 ± 0.063
2.769ProVal: 2.769 ± 0.09
0.263ProTrp: 0.263 ± 0.027
1.406ProTyr: 1.406 ± 0.067
0.0ProXaa: 0.0 ± 0.0
Gln
1.89GlnAla: 1.89 ± 0.091
0.258GlnCys: 0.258 ± 0.023
1.225GlnAsp: 1.225 ± 0.061
1.908GlnGlu: 1.908 ± 0.078
1.015GlnPhe: 1.015 ± 0.05
1.693GlnGly: 1.693 ± 0.072
0.383GlnHis: 0.383 ± 0.034
2.416GlnIle: 2.416 ± 0.081
2.761GlnLys: 2.761 ± 0.083
2.27GlnLeu: 2.27 ± 0.07
0.85GlnMet: 0.85 ± 0.05
1.528GlnAsn: 1.528 ± 0.069
0.67GlnPro: 0.67 ± 0.05
0.837GlnGln: 0.837 ± 0.061
1.238GlnArg: 1.238 ± 0.058
1.563GlnSer: 1.563 ± 0.064
1.3GlnThr: 1.3 ± 0.054
1.73GlnVal: 1.73 ± 0.078
0.218GlnTrp: 0.218 ± 0.026
0.983GlnTyr: 0.983 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
2.477ArgAla: 2.477 ± 0.083
0.423ArgCys: 0.423 ± 0.035
1.735ArgAsp: 1.735 ± 0.082
2.918ArgGlu: 2.918 ± 0.094
1.69ArgPhe: 1.69 ± 0.074
2.304ArgGly: 2.304 ± 0.09
0.747ArgHis: 0.747 ± 0.043
3.545ArgIle: 3.545 ± 0.094
4.029ArgLys: 4.029 ± 0.125
3.293ArgLeu: 3.293 ± 0.088
1.329ArgMet: 1.329 ± 0.062
2.248ArgAsn: 2.248 ± 0.077
1.151ArgPro: 1.151 ± 0.067
1.31ArgGln: 1.31 ± 0.06
1.844ArgArg: 1.844 ± 0.082
2.025ArgSer: 2.025 ± 0.076
1.831ArgThr: 1.831 ± 0.065
2.567ArgVal: 2.567 ± 0.086
0.314ArgTrp: 0.314 ± 0.03
1.531ArgTyr: 1.531 ± 0.06
0.0ArgXaa: 0.0 ± 0.0
Ser
4.093SerAla: 4.093 ± 0.116
0.643SerCys: 0.643 ± 0.039
3.49SerAsp: 3.49 ± 0.107
4.497SerGlu: 4.497 ± 0.117
2.852SerPhe: 2.852 ± 0.097
5.047SerGly: 5.047 ± 0.132
1.044SerHis: 1.044 ± 0.059
5.19SerIle: 5.19 ± 0.133
4.513SerLys: 4.513 ± 0.103
5.464SerLeu: 5.464 ± 0.141
1.565SerMet: 1.565 ± 0.071
2.517SerAsn: 2.517 ± 0.085
1.704SerPro: 1.704 ± 0.074
1.597SerGln: 1.597 ± 0.069
2.243SerArg: 2.243 ± 0.08
3.5SerSer: 3.5 ± 0.097
2.982SerThr: 2.982 ± 0.1
4.757SerVal: 4.757 ± 0.128
0.529SerTrp: 0.529 ± 0.036
2.214SerTyr: 2.214 ± 0.076
0.0SerXaa: 0.0 ± 0.0
Thr
3.737ThrAla: 3.737 ± 0.111
0.54ThrCys: 0.54 ± 0.038
2.868ThrAsp: 2.868 ± 0.092
3.593ThrGlu: 3.593 ± 0.11
2.158ThrPhe: 2.158 ± 0.083
4.271ThrGly: 4.271 ± 0.107
0.975ThrHis: 0.975 ± 0.05
4.111ThrIle: 4.111 ± 0.103
3.697ThrLys: 3.697 ± 0.118
4.818ThrLeu: 4.818 ± 0.12
1.385ThrMet: 1.385 ± 0.061
2.222ThrAsn: 2.222 ± 0.097
1.879ThrPro: 1.879 ± 0.075
1.409ThrGln: 1.409 ± 0.066
1.874ThrArg: 1.874 ± 0.082
2.969ThrSer: 2.969 ± 0.088
2.578ThrThr: 2.578 ± 0.103
4.361ThrVal: 4.361 ± 0.126
0.401ThrTrp: 0.401 ± 0.034
1.805ThrTyr: 1.805 ± 0.068
0.0ThrXaa: 0.0 ± 0.0
Val
4.731ValAla: 4.731 ± 0.138
0.874ValCys: 0.874 ± 0.053
3.346ValAsp: 3.346 ± 0.093
4.531ValGlu: 4.531 ± 0.134
3.12ValPhe: 3.12 ± 0.106
4.327ValGly: 4.327 ± 0.133
1.268ValHis: 1.268 ± 0.064
5.788ValIle: 5.788 ± 0.127
5.337ValLys: 5.337 ± 0.128
6.081ValLeu: 6.081 ± 0.139
1.797ValMet: 1.797 ± 0.073
3.285ValAsn: 3.285 ± 0.102
2.403ValPro: 2.403 ± 0.099
1.72ValGln: 1.72 ± 0.073
2.759ValArg: 2.759 ± 0.086
4.452ValSer: 4.452 ± 0.135
3.628ValThr: 3.628 ± 0.106
4.547ValVal: 4.547 ± 0.128
0.505ValTrp: 0.505 ± 0.035
2.517ValTyr: 2.517 ± 0.093
0.0ValXaa: 0.0 ± 0.0
Trp
0.476TrpAla: 0.476 ± 0.034
0.096TrpCys: 0.096 ± 0.018
0.407TrpAsp: 0.407 ± 0.033
0.518TrpGlu: 0.518 ± 0.035
0.399TrpPhe: 0.399 ± 0.038
0.542TrpGly: 0.542 ± 0.037
0.154TrpHis: 0.154 ± 0.022
0.789TrpIle: 0.789 ± 0.051
0.821TrpLys: 0.821 ± 0.048
0.625TrpLeu: 0.625 ± 0.041
0.245TrpMet: 0.245 ± 0.026
0.577TrpAsn: 0.577 ± 0.047
0.26TrpPro: 0.26 ± 0.028
0.335TrpGln: 0.335 ± 0.031
0.287TrpArg: 0.287 ± 0.027
0.431TrpSer: 0.431 ± 0.032
0.396TrpThr: 0.396 ± 0.036
0.396TrpVal: 0.396 ± 0.038
0.098TrpTrp: 0.098 ± 0.016
0.3TrpTyr: 0.3 ± 0.03
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.187TyrAla: 2.187 ± 0.073
0.444TyrCys: 0.444 ± 0.036
2.185TyrAsp: 2.185 ± 0.089
2.628TyrGlu: 2.628 ± 0.1
2.081TyrPhe: 2.081 ± 0.072
2.884TyrGly: 2.884 ± 0.095
0.582TyrHis: 0.582 ± 0.042
3.306TyrIle: 3.306 ± 0.091
3.218TyrLys: 3.218 ± 0.095
3.245TyrLeu: 3.245 ± 0.105
0.986TyrMet: 0.986 ± 0.055
1.908TyrAsn: 1.908 ± 0.08
1.302TyrPro: 1.302 ± 0.06
0.816TyrGln: 0.816 ± 0.045
1.648TyrArg: 1.648 ± 0.076
2.169TyrSer: 2.169 ± 0.077
1.943TyrThr: 1.943 ± 0.086
2.36TyrVal: 2.36 ± 0.069
0.332TyrTrp: 0.332 ± 0.035
1.382TyrTyr: 1.382 ± 0.064
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1242 proteins (376269 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski