Amino acid dipepetide frequency for Veillonella sp. DORA_B_18_19_23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.362AlaAla: 6.362 ± 0.227
0.822AlaCys: 0.822 ± 0.074
3.964AlaAsp: 3.964 ± 0.15
4.081AlaGlu: 4.081 ± 0.19
3.226AlaPhe: 3.226 ± 0.159
5.233AlaGly: 5.233 ± 0.216
1.414AlaHis: 1.414 ± 0.09
6.412AlaIle: 6.412 ± 0.246
5.177AlaLys: 5.177 ± 0.184
7.961AlaLeu: 7.961 ± 0.253
2.449AlaMet: 2.449 ± 0.133
3.807AlaAsn: 3.807 ± 0.209
2.029AlaPro: 2.029 ± 0.093
2.6AlaGln: 2.6 ± 0.135
2.773AlaArg: 2.773 ± 0.164
4.008AlaSer: 4.008 ± 0.176
4.21AlaThr: 4.21 ± 0.212
5.876AlaVal: 5.876 ± 0.181
0.699AlaTrp: 0.699 ± 0.064
2.717AlaTyr: 2.717 ± 0.132
0.006AlaXaa: 0.006 ± 0.005
Cys
0.688CysAla: 0.688 ± 0.059
0.162CysCys: 0.162 ± 0.032
0.587CysAsp: 0.587 ± 0.066
0.542CysGlu: 0.542 ± 0.06
0.458CysPhe: 0.458 ± 0.046
0.911CysGly: 0.911 ± 0.084
0.268CysHis: 0.268 ± 0.041
0.922CysIle: 0.922 ± 0.083
0.671CysLys: 0.671 ± 0.062
0.872CysLeu: 0.872 ± 0.072
0.313CysMet: 0.313 ± 0.046
0.436CysAsn: 0.436 ± 0.052
0.369CysPro: 0.369 ± 0.052
0.347CysGln: 0.347 ± 0.044
0.375CysArg: 0.375 ± 0.048
0.637CysSer: 0.637 ± 0.063
0.565CysThr: 0.565 ± 0.057
0.66CysVal: 0.66 ± 0.066
0.101CysTrp: 0.101 ± 0.025
0.419CysTyr: 0.419 ± 0.04
0.0CysXaa: 0.0 ± 0.0
Asp
4.439AspAla: 4.439 ± 0.17
0.654AspCys: 0.654 ± 0.073
3.282AspAsp: 3.282 ± 0.174
4.238AspGlu: 4.238 ± 0.169
2.32AspPhe: 2.32 ± 0.123
4.372AspGly: 4.372 ± 0.229
1.163AspHis: 1.163 ± 0.09
4.981AspIle: 4.981 ± 0.183
3.818AspLys: 3.818 ± 0.147
4.702AspLeu: 4.702 ± 0.199
1.795AspMet: 1.795 ± 0.108
2.778AspAsn: 2.778 ± 0.152
1.811AspPro: 1.811 ± 0.109
1.493AspGln: 1.493 ± 0.105
2.018AspArg: 2.018 ± 0.104
3.265AspSer: 3.265 ± 0.131
3.461AspThr: 3.461 ± 0.153
4.433AspVal: 4.433 ± 0.165
0.626AspTrp: 0.626 ± 0.059
2.633AspTyr: 2.633 ± 0.14
0.0AspXaa: 0.0 ± 0.0
Glu
5.087GluAla: 5.087 ± 0.207
0.604GluCys: 0.604 ± 0.067
3.421GluAsp: 3.421 ± 0.148
4.5GluGlu: 4.5 ± 0.207
2.387GluPhe: 2.387 ± 0.14
3.829GluGly: 3.829 ± 0.168
1.448GluHis: 1.448 ± 0.105
4.148GluIle: 4.148 ± 0.175
3.891GluLys: 3.891 ± 0.145
6.552GluLeu: 6.552 ± 0.222
1.526GluMet: 1.526 ± 0.102
2.89GluAsn: 2.89 ± 0.127
1.89GluPro: 1.89 ± 0.122
2.572GluGln: 2.572 ± 0.107
3.119GluArg: 3.119 ± 0.14
3.807GluSer: 3.807 ± 0.175
3.321GluThr: 3.321 ± 0.137
3.964GluVal: 3.964 ± 0.15
0.604GluTrp: 0.604 ± 0.06
2.376GluTyr: 2.376 ± 0.117
0.006GluXaa: 0.006 ± 0.005
Phe
2.398PheAla: 2.398 ± 0.129
0.464PheCys: 0.464 ± 0.057
2.655PheAsp: 2.655 ± 0.139
2.147PheGlu: 2.147 ± 0.117
1.638PhePhe: 1.638 ± 0.113
3.131PheGly: 3.131 ± 0.166
0.867PheHis: 0.867 ± 0.069
2.946PheIle: 2.946 ± 0.166
2.421PheLys: 2.421 ± 0.119
3.338PheLeu: 3.338 ± 0.193
1.152PheMet: 1.152 ± 0.087
2.208PheAsn: 2.208 ± 0.106
1.297PhePro: 1.297 ± 0.095
0.962PheGln: 0.962 ± 0.079
1.28PheArg: 1.28 ± 0.099
2.65PheSer: 2.65 ± 0.15
2.404PheThr: 2.404 ± 0.116
2.639PheVal: 2.639 ± 0.147
0.419PheTrp: 0.419 ± 0.052
1.604PheTyr: 1.604 ± 0.096
0.0PheXaa: 0.0 ± 0.0
Gly
5.775GlyAla: 5.775 ± 0.275
0.771GlyCys: 0.771 ± 0.069
4.036GlyAsp: 4.036 ± 0.211
3.829GlyGlu: 3.829 ± 0.155
3.097GlyPhe: 3.097 ± 0.142
5.3GlyGly: 5.3 ± 0.257
1.761GlyHis: 1.761 ± 0.109
5.758GlyIle: 5.758 ± 0.203
4.813GlyLys: 4.813 ± 0.206
6.72GlyLeu: 6.72 ± 0.213
1.957GlyMet: 1.957 ± 0.119
3.617GlyAsn: 3.617 ± 0.222
2.001GlyPro: 2.001 ± 0.115
2.231GlyGln: 2.231 ± 0.109
3.075GlyArg: 3.075 ± 0.137
4.433GlySer: 4.433 ± 0.21
5.009GlyThr: 5.009 ± 0.239
5.311GlyVal: 5.311 ± 0.195
0.565GlyTrp: 0.565 ± 0.048
2.801GlyTyr: 2.801 ± 0.129
0.0GlyXaa: 0.0 ± 0.0
His
1.241HisAla: 1.241 ± 0.091
0.313HisCys: 0.313 ± 0.044
1.096HisAsp: 1.096 ± 0.056
1.202HisGlu: 1.202 ± 0.083
0.827HisPhe: 0.827 ± 0.075
1.599HisGly: 1.599 ± 0.108
0.559HisHis: 0.559 ± 0.064
2.136HisIle: 2.136 ± 0.129
1.42HisLys: 1.42 ± 0.103
1.795HisLeu: 1.795 ± 0.1
0.727HisMet: 0.727 ± 0.068
1.224HisAsn: 1.224 ± 0.083
0.984HisPro: 0.984 ± 0.086
0.777HisGln: 0.777 ± 0.073
0.85HisArg: 0.85 ± 0.062
1.185HisSer: 1.185 ± 0.081
1.219HisThr: 1.219 ± 0.094
1.431HisVal: 1.431 ± 0.112
0.201HisTrp: 0.201 ± 0.04
0.934HisTyr: 0.934 ± 0.085
0.011HisXaa: 0.011 ± 0.008
Ile
6.038IleAla: 6.038 ± 0.206
0.827IleCys: 0.827 ± 0.07
5.121IleAsp: 5.121 ± 0.189
4.752IleGlu: 4.752 ± 0.166
3.03IlePhe: 3.03 ± 0.163
5.87IleGly: 5.87 ± 0.248
1.431IleHis: 1.431 ± 0.086
5.809IleIle: 5.809 ± 0.227
4.327IleLys: 4.327 ± 0.178
6.642IleLeu: 6.642 ± 0.255
1.839IleMet: 1.839 ± 0.097
4.003IleAsn: 4.003 ± 0.187
3.008IlePro: 3.008 ± 0.137
2.136IleGln: 2.136 ± 0.116
2.734IleArg: 2.734 ± 0.119
4.847IleSer: 4.847 ± 0.208
4.908IleThr: 4.908 ± 0.224
5.272IleVal: 5.272 ± 0.208
0.637IleTrp: 0.637 ± 0.067
2.762IleTyr: 2.762 ± 0.136
0.011IleXaa: 0.011 ± 0.009
Lys
5.289LysAla: 5.289 ± 0.173
0.363LysCys: 0.363 ± 0.051
4.4LysAsp: 4.4 ± 0.234
4.858LysGlu: 4.858 ± 0.203
1.688LysPhe: 1.688 ± 0.101
4.294LysGly: 4.294 ± 0.166
1.325LysHis: 1.325 ± 0.106
3.639LysIle: 3.639 ± 0.164
4.221LysLys: 4.221 ± 0.213
5.238LysLeu: 5.238 ± 0.206
1.375LysMet: 1.375 ± 0.074
3.567LysAsn: 3.567 ± 0.183
2.259LysPro: 2.259 ± 0.113
2.734LysGln: 2.734 ± 0.127
2.957LysArg: 2.957 ± 0.159
3.841LysSer: 3.841 ± 0.15
3.818LysThr: 3.818 ± 0.238
4.383LysVal: 4.383 ± 0.171
0.542LysTrp: 0.542 ± 0.052
2.499LysTyr: 2.499 ± 0.153
0.011LysXaa: 0.011 ± 0.007
Leu
7.167LeuAla: 7.167 ± 0.219
1.006LeuCys: 1.006 ± 0.086
5.272LeuAsp: 5.272 ± 0.203
5.663LeuGlu: 5.663 ± 0.223
3.734LeuPhe: 3.734 ± 0.168
7.329LeuGly: 7.329 ± 0.225
2.035LeuHis: 2.035 ± 0.109
5.652LeuIle: 5.652 ± 0.205
5.372LeuLys: 5.372 ± 0.217
8.157LeuLeu: 8.157 ± 0.308
2.225LeuMet: 2.225 ± 0.122
4.461LeuAsn: 4.461 ± 0.176
3.522LeuPro: 3.522 ± 0.161
3.192LeuGln: 3.192 ± 0.154
3.69LeuArg: 3.69 ± 0.163
6.166LeuSer: 6.166 ± 0.202
5.456LeuThr: 5.456 ± 0.162
6.602LeuVal: 6.602 ± 0.228
0.928LeuTrp: 0.928 ± 0.077
3.069LeuTyr: 3.069 ± 0.134
0.022LeuXaa: 0.022 ± 0.01
Met
2.46MetAla: 2.46 ± 0.126
0.246MetCys: 0.246 ± 0.039
1.526MetAsp: 1.526 ± 0.095
1.465MetGlu: 1.465 ± 0.091
0.799MetPhe: 0.799 ± 0.07
1.985MetGly: 1.985 ± 0.119
0.503MetHis: 0.503 ± 0.054
1.94MetIle: 1.94 ± 0.111
1.945MetLys: 1.945 ± 0.118
2.292MetLeu: 2.292 ± 0.124
0.878MetMet: 0.878 ± 0.067
1.42MetAsn: 1.42 ± 0.078
1.029MetPro: 1.029 ± 0.074
0.872MetGln: 0.872 ± 0.081
1.18MetArg: 1.18 ± 0.085
1.727MetSer: 1.727 ± 0.104
1.688MetThr: 1.688 ± 0.093
2.013MetVal: 2.013 ± 0.118
0.285MetTrp: 0.285 ± 0.038
0.883MetTyr: 0.883 ± 0.072
0.0MetXaa: 0.0 ± 0.0
Asn
3.215AsnAla: 3.215 ± 0.179
0.475AsnCys: 0.475 ± 0.058
2.795AsnAsp: 2.795 ± 0.154
2.98AsnGlu: 2.98 ± 0.115
1.761AsnPhe: 1.761 ± 0.112
4.171AsnGly: 4.171 ± 0.236
1.269AsnHis: 1.269 ± 0.087
4.338AsnIle: 4.338 ± 0.194
3.74AsnLys: 3.74 ± 0.2
3.975AsnLeu: 3.975 ± 0.174
1.152AsnMet: 1.152 ± 0.081
2.974AsnAsn: 2.974 ± 0.207
2.46AsnPro: 2.46 ± 0.108
1.733AsnGln: 1.733 ± 0.105
2.572AsnArg: 2.572 ± 0.116
2.974AsnSer: 2.974 ± 0.174
2.901AsnThr: 2.901 ± 0.166
3.466AsnVal: 3.466 ± 0.154
0.464AsnTrp: 0.464 ± 0.05
1.94AsnTyr: 1.94 ± 0.091
0.0AsnXaa: 0.0 ± 0.0
Pro
2.342ProAla: 2.342 ± 0.117
0.341ProCys: 0.341 ± 0.041
2.057ProAsp: 2.057 ± 0.115
2.56ProGlu: 2.56 ± 0.115
1.47ProPhe: 1.47 ± 0.097
2.119ProGly: 2.119 ± 0.131
0.917ProHis: 0.917 ± 0.068
2.577ProIle: 2.577 ± 0.135
2.136ProLys: 2.136 ± 0.101
3.019ProLeu: 3.019 ± 0.131
1.012ProMet: 1.012 ± 0.078
1.912ProAsn: 1.912 ± 0.125
0.811ProPro: 0.811 ± 0.087
1.314ProGln: 1.314 ± 0.09
1.163ProArg: 1.163 ± 0.085
2.018ProSer: 2.018 ± 0.131
2.348ProThr: 2.348 ± 0.114
2.812ProVal: 2.812 ± 0.131
0.302ProTrp: 0.302 ± 0.043
1.526ProTyr: 1.526 ± 0.093
0.028ProXaa: 0.028 ± 0.012
Gln
2.778GlnAla: 2.778 ± 0.134
0.38GlnCys: 0.38 ± 0.054
1.934GlnAsp: 1.934 ± 0.096
2.119GlnGlu: 2.119 ± 0.11
1.325GlnPhe: 1.325 ± 0.08
1.985GlnGly: 1.985 ± 0.099
0.76GlnHis: 0.76 ± 0.075
2.421GlnIle: 2.421 ± 0.119
1.929GlnLys: 1.929 ± 0.11
3.315GlnLeu: 3.315 ± 0.154
0.878GlnMet: 0.878 ± 0.067
1.599GlnAsn: 1.599 ± 0.098
1.073GlnPro: 1.073 ± 0.085
1.515GlnGln: 1.515 ± 0.107
1.515GlnArg: 1.515 ± 0.114
1.996GlnSer: 1.996 ± 0.121
1.683GlnThr: 1.683 ± 0.094
2.303GlnVal: 2.303 ± 0.123
0.375GlnTrp: 0.375 ± 0.048
1.61GlnTyr: 1.61 ± 0.104
0.0GlnXaa: 0.0 ± 0.0
Arg
2.711ArgAla: 2.711 ± 0.129
0.391ArgCys: 0.391 ± 0.054
2.247ArgAsp: 2.247 ± 0.119
2.717ArgGlu: 2.717 ± 0.134
1.638ArgPhe: 1.638 ± 0.104
2.32ArgGly: 2.32 ± 0.139
0.833ArgHis: 0.833 ± 0.065
3.365ArgIle: 3.365 ± 0.141
2.51ArgLys: 2.51 ± 0.141
4.087ArgLeu: 4.087 ± 0.156
1.286ArgMet: 1.286 ± 0.096
2.186ArgAsn: 2.186 ± 0.116
1.353ArgPro: 1.353 ± 0.092
1.409ArgGln: 1.409 ± 0.098
2.052ArgArg: 2.052 ± 0.143
2.164ArgSer: 2.164 ± 0.134
2.326ArgThr: 2.326 ± 0.103
2.678ArgVal: 2.678 ± 0.121
0.347ArgTrp: 0.347 ± 0.041
1.761ArgTyr: 1.761 ± 0.092
0.017ArgXaa: 0.017 ± 0.009
Ser
4.154SerAla: 4.154 ± 0.187
0.587SerCys: 0.587 ± 0.062
3.259SerAsp: 3.259 ± 0.143
3.472SerGlu: 3.472 ± 0.139
2.37SerPhe: 2.37 ± 0.133
4.69SerGly: 4.69 ± 0.192
1.269SerHis: 1.269 ± 0.083
5.3SerIle: 5.3 ± 0.213
3.679SerLys: 3.679 ± 0.154
6.055SerLeu: 6.055 ± 0.222
1.7SerMet: 1.7 ± 0.096
3.114SerAsn: 3.114 ± 0.155
1.772SerPro: 1.772 ± 0.113
1.929SerGln: 1.929 ± 0.104
2.203SerArg: 2.203 ± 0.113
3.695SerSer: 3.695 ± 0.195
3.623SerThr: 3.623 ± 0.174
4.797SerVal: 4.797 ± 0.148
0.587SerTrp: 0.587 ± 0.049
2.404SerTyr: 2.404 ± 0.116
0.011SerXaa: 0.011 ± 0.008
Thr
4.707ThrAla: 4.707 ± 0.22
0.576ThrCys: 0.576 ± 0.069
3.175ThrAsp: 3.175 ± 0.149
3.343ThrGlu: 3.343 ± 0.197
2.119ThrPhe: 2.119 ± 0.122
4.618ThrGly: 4.618 ± 0.207
1.308ThrHis: 1.308 ± 0.091
5.182ThrIle: 5.182 ± 0.169
3.734ThrLys: 3.734 ± 0.159
5.367ThrLeu: 5.367 ± 0.18
1.627ThrMet: 1.627 ± 0.088
3.108ThrAsn: 3.108 ± 0.213
2.532ThrPro: 2.532 ± 0.116
1.666ThrGln: 1.666 ± 0.103
1.834ThrArg: 1.834 ± 0.097
3.567ThrSer: 3.567 ± 0.173
4.008ThrThr: 4.008 ± 0.218
5.451ThrVal: 5.451 ± 0.259
0.576ThrTrp: 0.576 ± 0.053
2.259ThrTyr: 2.259 ± 0.108
0.017ThrXaa: 0.017 ± 0.01
Val
5.982ValAla: 5.982 ± 0.218
0.794ValCys: 0.794 ± 0.058
4.232ValAsp: 4.232 ± 0.169
4.467ValGlu: 4.467 ± 0.179
2.756ValPhe: 2.756 ± 0.155
5.495ValGly: 5.495 ± 0.188
1.532ValHis: 1.532 ± 0.097
4.763ValIle: 4.763 ± 0.165
4.394ValLys: 4.394 ± 0.179
6.289ValLeu: 6.289 ± 0.221
1.839ValMet: 1.839 ± 0.102
3.589ValAsn: 3.589 ± 0.162
2.946ValPro: 2.946 ± 0.139
2.454ValGln: 2.454 ± 0.124
2.7ValArg: 2.7 ± 0.13
4.741ValSer: 4.741 ± 0.195
5.02ValThr: 5.02 ± 0.209
5.261ValVal: 5.261 ± 0.206
0.671ValTrp: 0.671 ± 0.06
2.594ValTyr: 2.594 ± 0.115
0.011ValXaa: 0.011 ± 0.007
Trp
0.66TrpAla: 0.66 ± 0.062
0.073TrpCys: 0.073 ± 0.019
0.637TrpAsp: 0.637 ± 0.057
0.581TrpGlu: 0.581 ± 0.061
0.425TrpPhe: 0.425 ± 0.055
0.643TrpGly: 0.643 ± 0.062
0.263TrpHis: 0.263 ± 0.034
0.727TrpIle: 0.727 ± 0.069
0.47TrpLys: 0.47 ± 0.053
0.962TrpLeu: 0.962 ± 0.064
0.291TrpMet: 0.291 ± 0.042
0.486TrpAsn: 0.486 ± 0.043
0.28TrpPro: 0.28 ± 0.036
0.403TrpGln: 0.403 ± 0.05
0.503TrpArg: 0.503 ± 0.057
0.598TrpSer: 0.598 ± 0.049
0.509TrpThr: 0.509 ± 0.061
0.458TrpVal: 0.458 ± 0.048
0.134TrpTrp: 0.134 ± 0.024
0.386TrpTyr: 0.386 ± 0.05
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.326TyrAla: 2.326 ± 0.119
0.447TyrCys: 0.447 ± 0.046
2.56TyrAsp: 2.56 ± 0.137
2.426TyrGlu: 2.426 ± 0.109
1.537TyrPhe: 1.537 ± 0.093
2.963TyrGly: 2.963 ± 0.141
0.799TyrHis: 0.799 ± 0.072
2.985TyrIle: 2.985 ± 0.154
2.633TyrLys: 2.633 ± 0.169
3.365TyrLeu: 3.365 ± 0.149
1.113TyrMet: 1.113 ± 0.076
1.962TyrAsn: 1.962 ± 0.123
1.336TyrPro: 1.336 ± 0.097
1.152TyrGln: 1.152 ± 0.076
1.806TyrArg: 1.806 ± 0.102
2.342TyrSer: 2.342 ± 0.12
2.292TyrThr: 2.292 ± 0.123
2.723TyrVal: 2.723 ± 0.124
0.425TyrTrp: 0.425 ± 0.054
1.627TyrTyr: 1.627 ± 0.094
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.006XaaAla: 0.006 ± 0.007
0.006XaaCys: 0.006 ± 0.006
0.022XaaAsp: 0.022 ± 0.01
0.0XaaGlu: 0.0 ± 0.001
0.011XaaPhe: 0.011 ± 0.009
0.006XaaGly: 0.006 ± 0.006
0.006XaaHis: 0.006 ± 0.005
0.0XaaIle: 0.0 ± 0.002
0.0XaaLys: 0.0 ± 0.0
0.022XaaLeu: 0.022 ± 0.01
0.0XaaMet: 0.0 ± 0.006
0.011XaaAsn: 0.011 ± 0.007
0.022XaaPro: 0.022 ± 0.011
0.006XaaGln: 0.006 ± 0.006
0.0XaaArg: 0.0 ± 0.001
0.006XaaSer: 0.006 ± 0.005
0.006XaaThr: 0.006 ± 0.006
0.006XaaVal: 0.006 ± 0.005
0.0XaaTrp: 0.0 ± 0.0
0.017XaaTyr: 0.017 ± 0.009
0.816XaaXaa: 0.816 ± 0.187
Statistics based on 870 proteins (178876 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski