Amino acid dipepetide frequency for Clostridium sp. CAG:465

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.507AlaAla: 2.507 ± 0.1
0.616AlaCys: 0.616 ± 0.048
2.343AlaAsp: 2.343 ± 0.087
2.471AlaGlu: 2.471 ± 0.093
1.963AlaPhe: 1.963 ± 0.1
3.039AlaGly: 3.039 ± 0.101
0.687AlaHis: 0.687 ± 0.05
5.118AlaIle: 5.118 ± 0.15
5.139AlaLys: 5.139 ± 0.135
4.174AlaLeu: 4.174 ± 0.152
1.339AlaMet: 1.339 ± 0.079
2.856AlaAsn: 2.856 ± 0.102
1.112AlaPro: 1.112 ± 0.054
1.103AlaGln: 1.103 ± 0.064
1.709AlaArg: 1.709 ± 0.081
2.961AlaSer: 2.961 ± 0.092
2.886AlaThr: 2.886 ± 0.119
3.173AlaVal: 3.173 ± 0.116
0.239AlaTrp: 0.239 ± 0.027
1.882AlaTyr: 1.882 ± 0.079
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.047
0.197CysCys: 0.197 ± 0.03
0.816CysAsp: 0.816 ± 0.05
0.762CysGlu: 0.762 ± 0.053
0.613CysPhe: 0.613 ± 0.039
1.031CysGly: 1.031 ± 0.071
0.14CysHis: 0.14 ± 0.02
1.458CysIle: 1.458 ± 0.068
1.207CysLys: 1.207 ± 0.071
0.858CysLeu: 0.858 ± 0.051
0.317CysMet: 0.317 ± 0.034
0.834CysAsn: 0.834 ± 0.058
0.463CysPro: 0.463 ± 0.042
0.179CysGln: 0.179 ± 0.024
0.287CysArg: 0.287 ± 0.032
0.831CysSer: 0.831 ± 0.053
0.586CysThr: 0.586 ± 0.05
0.816CysVal: 0.816 ± 0.046
0.057CysTrp: 0.057 ± 0.016
0.571CysTyr: 0.571 ± 0.04
0.0CysXaa: 0.0 ± 0.0
Asp
2.444AspAla: 2.444 ± 0.088
0.565AspCys: 0.565 ± 0.044
3.457AspAsp: 3.457 ± 0.116
5.581AspGlu: 5.581 ± 0.14
2.552AspPhe: 2.552 ± 0.078
3.275AspGly: 3.275 ± 0.112
0.472AspHis: 0.472 ± 0.033
6.986AspIle: 6.986 ± 0.16
6.421AspLys: 6.421 ± 0.142
4.721AspLeu: 4.721 ± 0.131
1.721AspMet: 1.721 ± 0.067
4.338AspAsn: 4.338 ± 0.129
1.097AspPro: 1.097 ± 0.061
0.672AspGln: 0.672 ± 0.048
1.754AspArg: 1.754 ± 0.079
3.469AspSer: 3.469 ± 0.105
3.072AspThr: 3.072 ± 0.111
4.198AspVal: 4.198 ± 0.13
0.269AspTrp: 0.269 ± 0.029
2.743AspTyr: 2.743 ± 0.093
0.0AspXaa: 0.0 ± 0.0
Glu
3.278GluAla: 3.278 ± 0.113
0.828GluCys: 0.828 ± 0.052
4.177GluAsp: 4.177 ± 0.12
6.424GluGlu: 6.424 ± 0.196
3.012GluPhe: 3.012 ± 0.091
2.91GluGly: 2.91 ± 0.098
0.78GluHis: 0.78 ± 0.056
7.258GluIle: 7.258 ± 0.149
8.907GluLys: 8.907 ± 0.183
5.973GluLeu: 5.973 ± 0.151
1.9GluMet: 1.9 ± 0.082
6.783GluAsn: 6.783 ± 0.144
1.282GluPro: 1.282 ± 0.074
1.697GluGln: 1.697 ± 0.077
2.226GluArg: 2.226 ± 0.099
3.373GluSer: 3.373 ± 0.114
2.833GluThr: 2.833 ± 0.086
4.667GluVal: 4.667 ± 0.12
0.335GluTrp: 0.335 ± 0.033
4.198GluTyr: 4.198 ± 0.121
0.0GluXaa: 0.0 ± 0.0
Phe
2.011PheAla: 2.011 ± 0.082
0.732PheCys: 0.732 ± 0.049
2.928PheAsp: 2.928 ± 0.095
3.06PheGlu: 3.06 ± 0.104
2.056PhePhe: 2.056 ± 0.106
2.447PheGly: 2.447 ± 0.091
0.379PheHis: 0.379 ± 0.036
3.896PheIle: 3.896 ± 0.155
3.854PheLys: 3.854 ± 0.103
3.565PheLeu: 3.565 ± 0.107
1.109PheMet: 1.109 ± 0.064
2.979PheAsn: 2.979 ± 0.105
0.977PhePro: 0.977 ± 0.057
0.642PheGln: 0.642 ± 0.043
1.159PheArg: 1.159 ± 0.058
3.23PheSer: 3.23 ± 0.115
2.053PheThr: 2.053 ± 0.084
2.937PheVal: 2.937 ± 0.092
0.242PheTrp: 0.242 ± 0.028
1.658PheTyr: 1.658 ± 0.077
0.0PheXaa: 0.0 ± 0.0
Gly
2.904GlyAla: 2.904 ± 0.122
0.699GlyCys: 0.699 ± 0.057
2.701GlyAsp: 2.701 ± 0.099
3.185GlyGlu: 3.185 ± 0.122
2.357GlyPhe: 2.357 ± 0.085
3.033GlyGly: 3.033 ± 0.118
0.866GlyHis: 0.866 ± 0.059
6.179GlyIle: 6.179 ± 0.163
5.716GlyLys: 5.716 ± 0.135
4.04GlyLeu: 4.04 ± 0.108
1.578GlyMet: 1.578 ± 0.076
3.505GlyAsn: 3.505 ± 0.143
0.938GlyPro: 0.938 ± 0.094
1.165GlyGln: 1.165 ± 0.06
1.879GlyArg: 1.879 ± 0.098
2.943GlySer: 2.943 ± 0.105
3.188GlyThr: 3.188 ± 0.114
3.696GlyVal: 3.696 ± 0.108
0.257GlyTrp: 0.257 ± 0.032
2.839GlyTyr: 2.839 ± 0.102
0.0GlyXaa: 0.0 ± 0.0
His
0.541HisAla: 0.541 ± 0.046
0.134HisCys: 0.134 ± 0.019
0.708HisAsp: 0.708 ± 0.04
0.657HisGlu: 0.657 ± 0.051
0.52HisPhe: 0.52 ± 0.043
0.777HisGly: 0.777 ± 0.049
0.26HisHis: 0.26 ± 0.027
1.24HisIle: 1.24 ± 0.065
0.914HisLys: 0.914 ± 0.051
0.938HisLeu: 0.938 ± 0.061
0.326HisMet: 0.326 ± 0.031
0.705HisAsn: 0.705 ± 0.051
0.475HisPro: 0.475 ± 0.04
0.281HisGln: 0.281 ± 0.03
0.406HisArg: 0.406 ± 0.04
0.72HisSer: 0.72 ± 0.049
0.618HisThr: 0.618 ± 0.036
0.675HisVal: 0.675 ± 0.045
0.051HisTrp: 0.051 ± 0.012
0.457HisTyr: 0.457 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
5.36IleAla: 5.36 ± 0.154
1.59IleCys: 1.59 ± 0.077
6.57IleAsp: 6.57 ± 0.137
7.255IleGlu: 7.255 ± 0.162
4.431IlePhe: 4.431 ± 0.164
5.324IleGly: 5.324 ± 0.129
1.079IleHis: 1.079 ± 0.05
9.612IleIle: 9.612 ± 0.241
9.669IleLys: 9.669 ± 0.191
9.34IleLeu: 9.34 ± 0.223
2.241IleMet: 2.241 ± 0.071
7.043IleAsn: 7.043 ± 0.158
3.069IlePro: 3.069 ± 0.093
2.217IleGln: 2.217 ± 0.084
2.988IleArg: 2.988 ± 0.103
7.951IleSer: 7.951 ± 0.179
5.489IleThr: 5.489 ± 0.146
6.78IleVal: 6.78 ± 0.152
0.457IleTrp: 0.457 ± 0.041
4.572IleTyr: 4.572 ± 0.13
0.0IleXaa: 0.0 ± 0.0
Lys
4.276LysAla: 4.276 ± 0.125
1.085LysCys: 1.085 ± 0.056
6.394LysAsp: 6.394 ± 0.136
9.6LysGlu: 9.6 ± 0.2
3.726LysPhe: 3.726 ± 0.102
4.3LysGly: 4.3 ± 0.119
1.016LysHis: 1.016 ± 0.048
10.816LysIle: 10.816 ± 0.177
11.492LysLys: 11.492 ± 0.214
8.716LysLeu: 8.716 ± 0.187
2.907LysMet: 2.907 ± 0.112
8.614LysAsn: 8.614 ± 0.186
1.82LysPro: 1.82 ± 0.083
2.516LysGln: 2.516 ± 0.099
3.484LysArg: 3.484 ± 0.111
5.668LysSer: 5.668 ± 0.124
4.757LysThr: 4.757 ± 0.118
6.645LysVal: 6.645 ± 0.166
0.511LysTrp: 0.511 ± 0.039
5.671LysTyr: 5.671 ± 0.148
0.0LysXaa: 0.0 ± 0.0
Leu
4.102LeuAla: 4.102 ± 0.111
1.1LeuCys: 1.1 ± 0.07
5.546LeuAsp: 5.546 ± 0.126
6.024LeuGlu: 6.024 ± 0.143
3.544LeuPhe: 3.544 ± 0.144
4.577LeuGly: 4.577 ± 0.127
0.908LeuHis: 0.908 ± 0.05
7.165LeuIle: 7.165 ± 0.187
9.035LeuLys: 9.035 ± 0.183
6.567LeuLeu: 6.567 ± 0.177
1.882LeuMet: 1.882 ± 0.073
6.544LeuAsn: 6.544 ± 0.166
2.346LeuPro: 2.346 ± 0.079
1.841LeuGln: 1.841 ± 0.085
2.647LeuArg: 2.647 ± 0.09
6.197LeuSer: 6.197 ± 0.158
4.093LeuThr: 4.093 ± 0.102
5.262LeuVal: 5.262 ± 0.144
0.463LeuTrp: 0.463 ± 0.038
3.409LeuTyr: 3.409 ± 0.106
0.0LeuXaa: 0.0 ± 0.0
Met
1.401MetAla: 1.401 ± 0.08
0.311MetCys: 0.311 ± 0.028
1.392MetAsp: 1.392 ± 0.065
1.715MetGlu: 1.715 ± 0.076
1.109MetPhe: 1.109 ± 0.062
1.21MetGly: 1.21 ± 0.078
0.332MetHis: 0.332 ± 0.032
2.139MetIle: 2.139 ± 0.085
2.611MetLys: 2.611 ± 0.084
2.573MetLeu: 2.573 ± 0.104
0.58MetMet: 0.58 ± 0.043
1.643MetAsn: 1.643 ± 0.067
0.855MetPro: 0.855 ± 0.052
0.95MetGln: 0.95 ± 0.055
0.648MetArg: 0.648 ± 0.046
1.56MetSer: 1.56 ± 0.063
1.159MetThr: 1.159 ± 0.048
1.425MetVal: 1.425 ± 0.07
0.146MetTrp: 0.146 ± 0.022
1.386MetTyr: 1.386 ± 0.067
0.0MetXaa: 0.0 ± 0.0
Asn
2.904AsnAla: 2.904 ± 0.095
0.869AsnCys: 0.869 ± 0.057
4.147AsnAsp: 4.147 ± 0.146
5.127AsnGlu: 5.127 ± 0.114
3.075AsnPhe: 3.075 ± 0.1
3.923AsnGly: 3.923 ± 0.121
0.657AsnHis: 0.657 ± 0.042
9.328AsnIle: 9.328 ± 0.192
7.858AsnLys: 7.858 ± 0.17
6.125AsnLeu: 6.125 ± 0.15
1.921AsnMet: 1.921 ± 0.08
6.316AsnAsn: 6.316 ± 0.174
1.79AsnPro: 1.79 ± 0.065
1.602AsnGln: 1.602 ± 0.067
1.918AsnArg: 1.918 ± 0.078
5.178AsnSer: 5.178 ± 0.147
3.702AsnThr: 3.702 ± 0.119
4.993AsnVal: 4.993 ± 0.139
0.421AsnTrp: 0.421 ± 0.037
3.194AsnTyr: 3.194 ± 0.109
0.0AsnXaa: 0.0 ± 0.0
Pro
1.076ProAla: 1.076 ± 0.053
0.362ProCys: 0.362 ± 0.034
1.47ProAsp: 1.47 ± 0.068
1.897ProGlu: 1.897 ± 0.088
1.106ProPhe: 1.106 ± 0.057
1.339ProGly: 1.339 ± 0.07
0.35ProHis: 0.35 ± 0.03
2.271ProIle: 2.271 ± 0.077
2.268ProLys: 2.268 ± 0.088
1.784ProLeu: 1.784 ± 0.089
0.526ProMet: 0.526 ± 0.044
1.581ProAsn: 1.581 ± 0.069
0.433ProPro: 0.433 ± 0.036
0.547ProGln: 0.547 ± 0.042
0.726ProArg: 0.726 ± 0.045
1.461ProSer: 1.461 ± 0.078
1.351ProThr: 1.351 ± 0.106
1.766ProVal: 1.766 ± 0.081
0.149ProTrp: 0.149 ± 0.022
1.18ProTyr: 1.18 ± 0.057
0.0ProXaa: 0.0 ± 0.0
Gln
1.21GlnAla: 1.21 ± 0.072
0.164GlnCys: 0.164 ± 0.027
1.324GlnAsp: 1.324 ± 0.06
1.554GlnGlu: 1.554 ± 0.078
0.747GlnPhe: 0.747 ± 0.054
1.162GlnGly: 1.162 ± 0.071
0.176GlnHis: 0.176 ± 0.021
2.384GlnIle: 2.384 ± 0.082
2.555GlnLys: 2.555 ± 0.094
1.386GlnLeu: 1.386 ± 0.066
0.595GlnMet: 0.595 ± 0.043
2.047GlnAsn: 2.047 ± 0.069
0.439GlnPro: 0.439 ± 0.034
0.511GlnGln: 0.511 ± 0.049
0.84GlnArg: 0.84 ± 0.053
1.231GlnSer: 1.231 ± 0.068
1.129GlnThr: 1.129 ± 0.061
1.524GlnVal: 1.524 ± 0.065
0.111GlnTrp: 0.111 ± 0.018
0.813GlnTyr: 0.813 ± 0.051
0.0GlnXaa: 0.0 ± 0.0
Arg
1.497ArgAla: 1.497 ± 0.079
0.436ArgCys: 0.436 ± 0.039
1.643ArgAsp: 1.643 ± 0.08
2.534ArgGlu: 2.534 ± 0.103
1.237ArgPhe: 1.237 ± 0.063
1.634ArgGly: 1.634 ± 0.073
0.493ArgHis: 0.493 ± 0.035
2.976ArgIle: 2.976 ± 0.107
3.541ArgLys: 3.541 ± 0.124
2.605ArgLeu: 2.605 ± 0.091
0.849ArgMet: 0.849 ± 0.053
2.101ArgAsn: 2.101 ± 0.081
0.744ArgPro: 0.744 ± 0.053
0.834ArgGln: 0.834 ± 0.05
1.231ArgArg: 1.231 ± 0.068
1.401ArgSer: 1.401 ± 0.062
1.437ArgThr: 1.437 ± 0.069
2.041ArgVal: 2.041 ± 0.089
0.117ArgTrp: 0.117 ± 0.017
1.557ArgTyr: 1.557 ± 0.074
0.0ArgXaa: 0.0 ± 0.0
Ser
2.755SerAla: 2.755 ± 0.094
0.705SerCys: 0.705 ± 0.053
4.049SerAsp: 4.049 ± 0.114
3.959SerGlu: 3.959 ± 0.112
2.847SerPhe: 2.847 ± 0.1
3.609SerGly: 3.609 ± 0.127
0.786SerHis: 0.786 ± 0.044
6.302SerIle: 6.302 ± 0.149
7.099SerLys: 7.099 ± 0.152
5.519SerLeu: 5.519 ± 0.132
1.512SerMet: 1.512 ± 0.065
5.13SerAsn: 5.13 ± 0.158
1.285SerPro: 1.285 ± 0.063
1.533SerGln: 1.533 ± 0.069
2.047SerArg: 2.047 ± 0.078
4.849SerSer: 4.849 ± 0.16
3.364SerThr: 3.364 ± 0.112
4.019SerVal: 4.019 ± 0.119
0.302SerTrp: 0.302 ± 0.029
3.06SerTyr: 3.06 ± 0.1
0.0SerXaa: 0.0 ± 0.0
Thr
2.456ThrAla: 2.456 ± 0.102
0.624ThrCys: 0.624 ± 0.039
2.919ThrAsp: 2.919 ± 0.088
2.937ThrGlu: 2.937 ± 0.087
2.199ThrPhe: 2.199 ± 0.077
3.544ThrGly: 3.544 ± 0.144
0.741ThrHis: 0.741 ± 0.048
4.855ThrIle: 4.855 ± 0.144
4.948ThrLys: 4.948 ± 0.125
4.536ThrLeu: 4.536 ± 0.139
1.079ThrMet: 1.079 ± 0.057
3.481ThrAsn: 3.481 ± 0.106
1.599ThrPro: 1.599 ± 0.078
1.219ThrGln: 1.219 ± 0.067
1.613ThrArg: 1.613 ± 0.074
3.615ThrSer: 3.615 ± 0.108
3.045ThrThr: 3.045 ± 0.134
3.323ThrVal: 3.323 ± 0.138
0.287ThrTrp: 0.287 ± 0.03
2.31ThrTyr: 2.31 ± 0.098
0.0ThrXaa: 0.0 ± 0.0
Val
3.606ValAla: 3.606 ± 0.116
0.971ValCys: 0.971 ± 0.056
4.031ValAsp: 4.031 ± 0.13
4.625ValGlu: 4.625 ± 0.121
2.599ValPhe: 2.599 ± 0.088
3.798ValGly: 3.798 ± 0.117
0.675ValHis: 0.675 ± 0.046
7.093ValIle: 7.093 ± 0.181
6.263ValLys: 6.263 ± 0.14
5.381ValLeu: 5.381 ± 0.144
1.386ValMet: 1.386 ± 0.066
4.35ValAsn: 4.35 ± 0.137
1.727ValPro: 1.727 ± 0.086
1.291ValGln: 1.291 ± 0.063
1.772ValArg: 1.772 ± 0.088
4.694ValSer: 4.694 ± 0.124
3.618ValThr: 3.618 ± 0.111
4.643ValVal: 4.643 ± 0.125
0.308ValTrp: 0.308 ± 0.031
2.88ValTyr: 2.88 ± 0.084
0.0ValXaa: 0.0 ± 0.0
Trp
0.257TrpAla: 0.257 ± 0.03
0.117TrpCys: 0.117 ± 0.021
0.278TrpAsp: 0.278 ± 0.03
0.269TrpGlu: 0.269 ± 0.026
0.236TrpPhe: 0.236 ± 0.033
0.281TrpGly: 0.281 ± 0.028
0.093TrpHis: 0.093 ± 0.017
0.493TrpIle: 0.493 ± 0.034
0.43TrpLys: 0.43 ± 0.041
0.436TrpLeu: 0.436 ± 0.04
0.149TrpMet: 0.149 ± 0.018
0.35TrpAsn: 0.35 ± 0.034
0.108TrpPro: 0.108 ± 0.019
0.167TrpGln: 0.167 ± 0.019
0.111TrpArg: 0.111 ± 0.016
0.284TrpSer: 0.284 ± 0.03
0.248TrpThr: 0.248 ± 0.029
0.251TrpVal: 0.251 ± 0.028
0.06TrpTrp: 0.06 ± 0.012
0.35TrpTyr: 0.35 ± 0.03
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.092TyrAla: 2.092 ± 0.084
0.556TyrCys: 0.556 ± 0.046
2.913TyrAsp: 2.913 ± 0.096
3.203TyrGlu: 3.203 ± 0.11
1.987TyrPhe: 1.987 ± 0.082
2.456TyrGly: 2.456 ± 0.091
0.514TyrHis: 0.514 ± 0.038
5.465TyrIle: 5.465 ± 0.14
4.084TyrLys: 4.084 ± 0.12
4.019TyrLeu: 4.019 ± 0.116
1.228TyrMet: 1.228 ± 0.064
3.851TyrAsn: 3.851 ± 0.115
1.085TyrPro: 1.085 ± 0.058
0.935TyrGln: 0.935 ± 0.06
1.461TyrArg: 1.461 ± 0.063
3.113TyrSer: 3.113 ± 0.097
2.701TyrThr: 2.701 ± 0.109
2.88TyrVal: 2.88 ± 0.092
0.185TyrTrp: 0.185 ± 0.018
1.975TyrTyr: 1.975 ± 0.084
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1173 proteins (334682 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski