Amino acid dipepetide frequency for Epsilonproteobacteria bacterium SCGC AD-311-E16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.717AlaAla: 4.717 ± 0.275
0.464AlaCys: 0.464 ± 0.068
3.377AlaAsp: 3.377 ± 0.144
3.467AlaGlu: 3.467 ± 0.193
3.048AlaPhe: 3.048 ± 0.177
4.111AlaGly: 4.111 ± 0.233
1.348AlaHis: 1.348 ± 0.091
5.968AlaIle: 5.968 ± 0.242
5.578AlaLys: 5.578 ± 0.225
7.098AlaLeu: 7.098 ± 0.257
2.329AlaMet: 2.329 ± 0.146
3.137AlaAsn: 3.137 ± 0.16
1.677AlaPro: 1.677 ± 0.114
2.291AlaGln: 2.291 ± 0.141
2.291AlaArg: 2.291 ± 0.154
4.231AlaSer: 4.231 ± 0.215
3.691AlaThr: 3.691 ± 0.183
4.32AlaVal: 4.32 ± 0.217
0.629AlaTrp: 0.629 ± 0.082
2.666AlaTyr: 2.666 ± 0.163
0.0AlaXaa: 0.0 ± 0.0
Cys
0.427CysAla: 0.427 ± 0.064
0.067CysCys: 0.067 ± 0.025
0.644CysAsp: 0.644 ± 0.07
0.629CysGlu: 0.629 ± 0.076
0.359CysPhe: 0.359 ± 0.055
0.621CysGly: 0.621 ± 0.086
0.24CysHis: 0.24 ± 0.04
0.592CysIle: 0.592 ± 0.072
0.524CysLys: 0.524 ± 0.065
0.457CysLeu: 0.457 ± 0.06
0.18CysMet: 0.18 ± 0.035
0.329CysAsn: 0.329 ± 0.046
0.262CysPro: 0.262 ± 0.052
0.187CysGln: 0.187 ± 0.038
0.24CysArg: 0.24 ± 0.042
0.607CysSer: 0.607 ± 0.067
0.344CysThr: 0.344 ± 0.052
0.479CysVal: 0.479 ± 0.052
0.052CysTrp: 0.052 ± 0.018
0.277CysTyr: 0.277 ± 0.046
0.0CysXaa: 0.0 ± 0.0
Asp
4.178AspAla: 4.178 ± 0.217
0.442AspCys: 0.442 ± 0.067
3.145AspAsp: 3.145 ± 0.169
5.002AspGlu: 5.002 ± 0.243
3.19AspPhe: 3.19 ± 0.16
3.647AspGly: 3.647 ± 0.19
0.644AspHis: 0.644 ± 0.079
5.751AspIle: 5.751 ± 0.235
5.137AspLys: 5.137 ± 0.214
5.152AspLeu: 5.152 ± 0.182
1.79AspMet: 1.79 ± 0.144
2.95AspAsn: 2.95 ± 0.148
1.25AspPro: 1.25 ± 0.102
1.295AspGln: 1.295 ± 0.101
1.587AspArg: 1.587 ± 0.127
3.706AspSer: 3.706 ± 0.165
3.145AspThr: 3.145 ± 0.168
3.826AspVal: 3.826 ± 0.178
0.442AspTrp: 0.442 ± 0.064
2.351AspTyr: 2.351 ± 0.156
0.0AspXaa: 0.0 ± 0.0
Glu
4.755GluAla: 4.755 ± 0.197
0.524GluCys: 0.524 ± 0.067
4.051GluAsp: 4.051 ± 0.204
5.421GluGlu: 5.421 ± 0.269
3.13GluPhe: 3.13 ± 0.204
2.868GluGly: 2.868 ± 0.163
1.602GluHis: 1.602 ± 0.112
6.484GluIle: 6.484 ± 0.256
6.432GluLys: 6.432 ± 0.244
7.57GluLeu: 7.57 ± 0.278
2.044GluMet: 2.044 ± 0.119
4.35GluAsn: 4.35 ± 0.221
1.355GluPro: 1.355 ± 0.093
2.134GluGln: 2.134 ± 0.136
2.321GluArg: 2.321 ± 0.173
4.253GluSer: 4.253 ± 0.213
2.92GluThr: 2.92 ± 0.159
4.38GluVal: 4.38 ± 0.214
0.704GluTrp: 0.704 ± 0.085
2.943GluTyr: 2.943 ± 0.156
0.0GluXaa: 0.0 ± 0.0
Phe
3.04PheAla: 3.04 ± 0.179
0.374PheCys: 0.374 ± 0.06
3.01PheAsp: 3.01 ± 0.129
3.444PheGlu: 3.444 ± 0.161
2.591PhePhe: 2.591 ± 0.181
3.197PheGly: 3.197 ± 0.17
0.801PheHis: 0.801 ± 0.084
3.864PheIle: 3.864 ± 0.216
3.489PheLys: 3.489 ± 0.165
4.455PheLeu: 4.455 ± 0.192
1.318PheMet: 1.318 ± 0.113
2.748PheAsn: 2.748 ± 0.153
1.198PhePro: 1.198 ± 0.094
1.078PheGln: 1.078 ± 0.086
1.183PheArg: 1.183 ± 0.092
3.647PheSer: 3.647 ± 0.192
2.95PheThr: 2.95 ± 0.134
2.793PheVal: 2.793 ± 0.169
0.382PheTrp: 0.382 ± 0.067
1.969PheTyr: 1.969 ± 0.148
0.0PheXaa: 0.0 ± 0.0
Gly
3.916GlyAla: 3.916 ± 0.201
0.651GlyCys: 0.651 ± 0.074
3.025GlyAsp: 3.025 ± 0.2
3.534GlyGlu: 3.534 ± 0.199
3.1GlyPhe: 3.1 ± 0.162
4.335GlyGly: 4.335 ± 0.27
1.093GlyHis: 1.093 ± 0.098
5.249GlyIle: 5.249 ± 0.198
4.41GlyLys: 4.41 ± 0.21
5.241GlyLeu: 5.241 ± 0.237
1.662GlyMet: 1.662 ± 0.128
2.276GlyAsn: 2.276 ± 0.161
0.906GlyPro: 0.906 ± 0.097
1.385GlyGln: 1.385 ± 0.124
1.984GlyArg: 1.984 ± 0.119
3.587GlySer: 3.587 ± 0.162
3.212GlyThr: 3.212 ± 0.153
4.35GlyVal: 4.35 ± 0.219
0.539GlyTrp: 0.539 ± 0.066
2.913GlyTyr: 2.913 ± 0.169
0.0GlyXaa: 0.0 ± 0.0
His
1.28HisAla: 1.28 ± 0.105
0.172HisCys: 0.172 ± 0.037
1.101HisAsp: 1.101 ± 0.089
1.258HisGlu: 1.258 ± 0.119
0.771HisPhe: 0.771 ± 0.078
1.31HisGly: 1.31 ± 0.105
0.569HisHis: 0.569 ± 0.066
1.782HisIle: 1.782 ± 0.107
1.393HisLys: 1.393 ± 0.087
1.715HisLeu: 1.715 ± 0.123
0.621HisMet: 0.621 ± 0.075
1.176HisAsn: 1.176 ± 0.107
0.884HisPro: 0.884 ± 0.083
0.779HisGln: 0.779 ± 0.069
0.636HisArg: 0.636 ± 0.073
1.325HisSer: 1.325 ± 0.096
1.198HisThr: 1.198 ± 0.101
1.153HisVal: 1.153 ± 0.107
0.18HisTrp: 0.18 ± 0.036
0.756HisTyr: 0.756 ± 0.063
0.0HisXaa: 0.0 ± 0.0
Ile
6.23IleAla: 6.23 ± 0.25
0.741IleCys: 0.741 ± 0.08
5.736IleAsp: 5.736 ± 0.226
6.2IleGlu: 6.2 ± 0.254
4.133IlePhe: 4.133 ± 0.2
5.077IleGly: 5.077 ± 0.199
1.46IleHis: 1.46 ± 0.111
6.986IleIle: 6.986 ± 0.319
6.874IleLys: 6.874 ± 0.267
7.84IleLeu: 7.84 ± 0.284
2.007IleMet: 2.007 ± 0.119
5.144IleAsn: 5.144 ± 0.233
2.681IlePro: 2.681 ± 0.156
2.538IleGln: 2.538 ± 0.122
2.688IleArg: 2.688 ± 0.151
6.103IleSer: 6.103 ± 0.215
4.575IleThr: 4.575 ± 0.195
5.616IleVal: 5.616 ± 0.194
0.584IleTrp: 0.584 ± 0.079
3.137IleTyr: 3.137 ± 0.148
0.0IleXaa: 0.0 ± 0.0
Lys
4.725LysAla: 4.725 ± 0.187
0.487LysCys: 0.487 ± 0.058
5.496LysAsp: 5.496 ± 0.222
7.48LysGlu: 7.48 ± 0.298
2.726LysPhe: 2.726 ± 0.155
3.617LysGly: 3.617 ± 0.191
1.67LysHis: 1.67 ± 0.113
7.211LysIle: 7.211 ± 0.211
7.825LysLys: 7.825 ± 0.336
7.031LysLeu: 7.031 ± 0.218
2.284LysMet: 2.284 ± 0.134
5.93LysAsn: 5.93 ± 0.201
2.434LysPro: 2.434 ± 0.142
2.651LysGln: 2.651 ± 0.156
2.988LysArg: 2.988 ± 0.144
5.324LysSer: 5.324 ± 0.189
4.253LysThr: 4.253 ± 0.185
5.024LysVal: 5.024 ± 0.19
0.599LysTrp: 0.599 ± 0.067
3.362LysTyr: 3.362 ± 0.169
0.0LysXaa: 0.0 ± 0.0
Leu
6.118LeuAla: 6.118 ± 0.234
0.681LeuCys: 0.681 ± 0.071
5.668LeuAsp: 5.668 ± 0.224
6.866LeuGlu: 6.866 ± 0.288
4.837LeuPhe: 4.837 ± 0.253
5.444LeuGly: 5.444 ± 0.212
2.149LeuHis: 2.149 ± 0.156
7.143LeuIle: 7.143 ± 0.249
8.506LeuLys: 8.506 ± 0.248
9.135LeuLeu: 9.135 ± 0.323
2.441LeuMet: 2.441 ± 0.149
5.069LeuAsn: 5.069 ± 0.21
2.95LeuPro: 2.95 ± 0.158
3.205LeuGln: 3.205 ± 0.164
3.122LeuArg: 3.122 ± 0.159
7.638LeuSer: 7.638 ± 0.24
4.927LeuThr: 4.927 ± 0.165
5.511LeuVal: 5.511 ± 0.224
0.651LeuTrp: 0.651 ± 0.088
3.602LeuTyr: 3.602 ± 0.172
0.0LeuXaa: 0.0 ± 0.0
Met
2.089MetAla: 2.089 ± 0.127
0.195MetCys: 0.195 ± 0.041
1.595MetAsp: 1.595 ± 0.094
1.445MetGlu: 1.445 ± 0.104
1.056MetPhe: 1.056 ± 0.11
1.513MetGly: 1.513 ± 0.119
0.621MetHis: 0.621 ± 0.063
2.276MetIle: 2.276 ± 0.131
2.673MetLys: 2.673 ± 0.154
2.83MetLeu: 2.83 ± 0.136
1.033MetMet: 1.033 ± 0.105
1.542MetAsn: 1.542 ± 0.11
1.086MetPro: 1.086 ± 0.095
1.265MetGln: 1.265 ± 0.095
0.996MetArg: 0.996 ± 0.087
2.336MetSer: 2.336 ± 0.123
1.378MetThr: 1.378 ± 0.112
1.528MetVal: 1.528 ± 0.099
0.24MetTrp: 0.24 ± 0.041
0.756MetTyr: 0.756 ± 0.07
0.0MetXaa: 0.0 ± 0.0
Asn
3.549AsnAla: 3.549 ± 0.163
0.329AsnCys: 0.329 ± 0.049
2.965AsnAsp: 2.965 ± 0.15
3.819AsnGlu: 3.819 ± 0.185
2.419AsnPhe: 2.419 ± 0.144
3.137AsnGly: 3.137 ± 0.154
0.996AsnHis: 0.996 ± 0.09
5.751AsnIle: 5.751 ± 0.28
4.59AsnLys: 4.59 ± 0.197
4.919AsnLeu: 4.919 ± 0.201
1.468AsnMet: 1.468 ± 0.093
3.212AsnAsn: 3.212 ± 0.197
2.089AsnPro: 2.089 ± 0.125
1.617AsnGln: 1.617 ± 0.109
1.827AsnArg: 1.827 ± 0.131
3.819AsnSer: 3.819 ± 0.185
3.063AsnThr: 3.063 ± 0.148
2.808AsnVal: 2.808 ± 0.154
0.427AsnTrp: 0.427 ± 0.061
2.119AsnTyr: 2.119 ± 0.131
0.0AsnXaa: 0.0 ± 0.0
Pro
1.572ProAla: 1.572 ± 0.105
0.18ProCys: 0.18 ± 0.042
1.55ProAsp: 1.55 ± 0.107
1.932ProGlu: 1.932 ± 0.106
1.655ProPhe: 1.655 ± 0.104
1.048ProGly: 1.048 ± 0.11
0.644ProHis: 0.644 ± 0.062
2.374ProIle: 2.374 ± 0.161
2.119ProLys: 2.119 ± 0.126
2.905ProLeu: 2.905 ± 0.146
0.816ProMet: 0.816 ± 0.093
1.647ProAsn: 1.647 ± 0.126
0.614ProPro: 0.614 ± 0.077
0.943ProGln: 0.943 ± 0.081
0.861ProArg: 0.861 ± 0.076
1.737ProSer: 1.737 ± 0.109
1.67ProThr: 1.67 ± 0.129
1.797ProVal: 1.797 ± 0.108
0.307ProTrp: 0.307 ± 0.048
1.086ProTyr: 1.086 ± 0.074
0.0ProXaa: 0.0 ± 0.0
Gln
2.052GlnAla: 2.052 ± 0.131
0.172GlnCys: 0.172 ± 0.032
1.67GlnAsp: 1.67 ± 0.107
2.703GlnGlu: 2.703 ± 0.178
1.258GlnPhe: 1.258 ± 0.096
1.408GlnGly: 1.408 ± 0.113
0.794GlnHis: 0.794 ± 0.078
2.441GlnIle: 2.441 ± 0.132
2.928GlnLys: 2.928 ± 0.175
2.823GlnLeu: 2.823 ± 0.167
0.973GlnMet: 0.973 ± 0.085
1.977GlnAsn: 1.977 ± 0.141
0.666GlnPro: 0.666 ± 0.067
1.056GlnGln: 1.056 ± 0.1
1.258GlnArg: 1.258 ± 0.102
2.119GlnSer: 2.119 ± 0.138
1.745GlnThr: 1.745 ± 0.116
1.707GlnVal: 1.707 ± 0.11
0.322GlnTrp: 0.322 ± 0.052
1.131GlnTyr: 1.131 ± 0.095
0.0GlnXaa: 0.0 ± 0.0
Arg
2.112ArgAla: 2.112 ± 0.125
0.277ArgCys: 0.277 ± 0.059
2.112ArgAsp: 2.112 ± 0.137
2.441ArgGlu: 2.441 ± 0.117
1.782ArgPhe: 1.782 ± 0.121
1.864ArgGly: 1.864 ± 0.134
0.689ArgHis: 0.689 ± 0.082
2.83ArgIle: 2.83 ± 0.154
2.434ArgLys: 2.434 ± 0.147
3.085ArgLeu: 3.085 ± 0.155
0.831ArgMet: 0.831 ± 0.093
1.55ArgAsn: 1.55 ± 0.111
0.809ArgPro: 0.809 ± 0.091
0.899ArgGln: 0.899 ± 0.089
1.333ArgArg: 1.333 ± 0.118
1.864ArgSer: 1.864 ± 0.114
1.745ArgThr: 1.745 ± 0.12
2.598ArgVal: 2.598 ± 0.14
0.24ArgTrp: 0.24 ± 0.041
1.535ArgTyr: 1.535 ± 0.125
0.0ArgXaa: 0.0 ± 0.0
Ser
4.65SerAla: 4.65 ± 0.213
0.479SerCys: 0.479 ± 0.067
3.946SerAsp: 3.946 ± 0.201
4.156SerGlu: 4.156 ± 0.199
3.542SerPhe: 3.542 ± 0.202
4.403SerGly: 4.403 ± 0.177
1.55SerHis: 1.55 ± 0.104
6.14SerIle: 6.14 ± 0.213
5.489SerLys: 5.489 ± 0.198
6.889SerLeu: 6.889 ± 0.228
2.097SerMet: 2.097 ± 0.13
3.422SerAsn: 3.422 ± 0.177
1.453SerPro: 1.453 ± 0.101
2.501SerGln: 2.501 ± 0.158
2.434SerArg: 2.434 ± 0.139
5.039SerSer: 5.039 ± 0.232
3.197SerThr: 3.197 ± 0.162
4.246SerVal: 4.246 ± 0.169
0.592SerTrp: 0.592 ± 0.061
2.89SerTyr: 2.89 ± 0.152
0.0SerXaa: 0.0 ± 0.0
Thr
3.459ThrAla: 3.459 ± 0.182
0.389ThrCys: 0.389 ± 0.058
2.666ThrAsp: 2.666 ± 0.207
2.711ThrGlu: 2.711 ± 0.152
2.718ThrPhe: 2.718 ± 0.152
3.212ThrGly: 3.212 ± 0.17
1.153ThrHis: 1.153 ± 0.083
4.478ThrIle: 4.478 ± 0.208
4.388ThrLys: 4.388 ± 0.194
5.893ThrLeu: 5.893 ± 0.229
1.198ThrMet: 1.198 ± 0.097
2.726ThrAsn: 2.726 ± 0.145
2.037ThrPro: 2.037 ± 0.137
2.074ThrGln: 2.074 ± 0.143
1.58ThrArg: 1.58 ± 0.12
3.355ThrSer: 3.355 ± 0.162
3.152ThrThr: 3.152 ± 0.186
3.25ThrVal: 3.25 ± 0.146
0.412ThrTrp: 0.412 ± 0.057
2.097ThrTyr: 2.097 ± 0.131
0.0ThrXaa: 0.0 ± 0.0
Val
4.62ValAla: 4.62 ± 0.227
0.494ValCys: 0.494 ± 0.075
3.901ValAsp: 3.901 ± 0.156
4.208ValGlu: 4.208 ± 0.189
2.651ValPhe: 2.651 ± 0.145
3.991ValGly: 3.991 ± 0.197
1.063ValHis: 1.063 ± 0.087
5.114ValIle: 5.114 ± 0.261
4.597ValLys: 4.597 ± 0.212
5.818ValLeu: 5.818 ± 0.237
1.797ValMet: 1.797 ± 0.129
3.175ValAsn: 3.175 ± 0.168
1.767ValPro: 1.767 ± 0.121
1.73ValGln: 1.73 ± 0.116
2.022ValArg: 2.022 ± 0.129
5.047ValSer: 5.047 ± 0.186
3.235ValThr: 3.235 ± 0.165
4.425ValVal: 4.425 ± 0.206
0.494ValTrp: 0.494 ± 0.061
2.142ValTyr: 2.142 ± 0.141
0.0ValXaa: 0.0 ± 0.0
Trp
0.569TrpAla: 0.569 ± 0.074
0.067TrpCys: 0.067 ± 0.023
0.382TrpAsp: 0.382 ± 0.055
0.524TrpGlu: 0.524 ± 0.072
0.434TrpPhe: 0.434 ± 0.067
0.607TrpGly: 0.607 ± 0.071
0.195TrpHis: 0.195 ± 0.034
0.689TrpIle: 0.689 ± 0.082
0.479TrpLys: 0.479 ± 0.065
0.884TrpLeu: 0.884 ± 0.094
0.374TrpMet: 0.374 ± 0.06
0.464TrpAsn: 0.464 ± 0.066
0.172TrpPro: 0.172 ± 0.045
0.292TrpGln: 0.292 ± 0.047
0.344TrpArg: 0.344 ± 0.047
0.517TrpSer: 0.517 ± 0.064
0.359TrpThr: 0.359 ± 0.053
0.502TrpVal: 0.502 ± 0.074
0.067TrpTrp: 0.067 ± 0.028
0.285TrpTyr: 0.285 ± 0.04
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.389TyrAla: 2.389 ± 0.142
0.329TyrCys: 0.329 ± 0.06
2.591TyrAsp: 2.591 ± 0.147
3.003TyrGlu: 3.003 ± 0.148
2.134TyrPhe: 2.134 ± 0.148
1.962TyrGly: 1.962 ± 0.123
0.779TyrHis: 0.779 ± 0.078
3.227TyrIle: 3.227 ± 0.187
3.25TyrLys: 3.25 ± 0.181
3.924TyrLeu: 3.924 ± 0.176
1.183TyrMet: 1.183 ± 0.103
2.097TyrAsn: 2.097 ± 0.148
1.168TyrPro: 1.168 ± 0.099
1.28TyrGln: 1.28 ± 0.096
1.333TyrArg: 1.333 ± 0.095
2.86TyrSer: 2.86 ± 0.155
2.186TyrThr: 2.186 ± 0.153
1.977TyrVal: 1.977 ± 0.118
0.344TyrTrp: 0.344 ± 0.055
1.445TyrTyr: 1.445 ± 0.119
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 456 proteins (133552 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski