Amino acid dipepetide frequency for Spirochaetia bacterium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.085AlaAla: 13.085 ± 0.597
1.115AlaCys: 1.115 ± 0.143
6.707AlaAsp: 6.707 ± 0.322
5.863AlaGlu: 5.863 ± 0.326
4.276AlaPhe: 4.276 ± 0.279
8.551AlaGly: 8.551 ± 0.384
2.031AlaHis: 2.031 ± 0.182
6.464AlaIle: 6.464 ± 0.321
4.133AlaLys: 4.133 ± 0.248
11.226AlaLeu: 11.226 ± 0.407
3.604AlaMet: 3.604 ± 0.239
3.046AlaAsn: 3.046 ± 0.212
4.805AlaPro: 4.805 ± 0.216
3.804AlaGln: 3.804 ± 0.21
6.978AlaArg: 6.978 ± 0.299
7.121AlaSer: 7.121 ± 0.357
6.378AlaThr: 6.378 ± 0.334
8.923AlaVal: 8.923 ± 0.329
1.43AlaTrp: 1.43 ± 0.147
2.417AlaTyr: 2.417 ± 0.178
0.0AlaXaa: 0.0 ± 0.0
Cys
1.015CysAla: 1.015 ± 0.131
0.1CysCys: 0.1 ± 0.037
0.515CysAsp: 0.515 ± 0.088
0.372CysGlu: 0.372 ± 0.08
0.329CysPhe: 0.329 ± 0.071
0.787CysGly: 0.787 ± 0.124
0.329CysHis: 0.329 ± 0.063
0.501CysIle: 0.501 ± 0.096
0.215CysLys: 0.215 ± 0.052
0.801CysLeu: 0.801 ± 0.124
0.2CysMet: 0.2 ± 0.053
0.257CysAsn: 0.257 ± 0.074
0.443CysPro: 0.443 ± 0.086
0.272CysGln: 0.272 ± 0.062
0.586CysArg: 0.586 ± 0.09
0.486CysSer: 0.486 ± 0.074
0.486CysThr: 0.486 ± 0.085
0.701CysVal: 0.701 ± 0.083
0.157CysTrp: 0.157 ± 0.051
0.257CysTyr: 0.257 ± 0.071
0.0CysXaa: 0.0 ± 0.0
Asp
6.263AspAla: 6.263 ± 0.359
0.486AspCys: 0.486 ± 0.073
2.803AspAsp: 2.803 ± 0.218
2.831AspGlu: 2.831 ± 0.206
2.045AspPhe: 2.045 ± 0.203
4.862AspGly: 4.862 ± 0.286
1.416AspHis: 1.416 ± 0.147
2.932AspIle: 2.932 ± 0.206
1.816AspLys: 1.816 ± 0.141
6.221AspLeu: 6.221 ± 0.313
1.43AspMet: 1.43 ± 0.141
1.487AspAsn: 1.487 ± 0.165
3.518AspPro: 3.518 ± 0.217
1.931AspGln: 1.931 ± 0.156
3.961AspArg: 3.961 ± 0.263
2.66AspSer: 2.66 ± 0.191
2.889AspThr: 2.889 ± 0.261
4.333AspVal: 4.333 ± 0.256
0.815AspTrp: 0.815 ± 0.112
1.73AspTyr: 1.73 ± 0.154
0.0AspXaa: 0.0 ± 0.0
Glu
6.092GluAla: 6.092 ± 0.338
0.429GluCys: 0.429 ± 0.083
2.517GluAsp: 2.517 ± 0.214
2.574GluGlu: 2.574 ± 0.24
1.373GluPhe: 1.373 ± 0.146
3.99GluGly: 3.99 ± 0.237
1.258GluHis: 1.258 ± 0.129
3.089GluIle: 3.089 ± 0.245
2.131GluLys: 2.131 ± 0.184
5.019GluLeu: 5.019 ± 0.299
1.459GluMet: 1.459 ± 0.162
1.516GluAsn: 1.516 ± 0.163
2.116GluPro: 2.116 ± 0.148
1.873GluGln: 1.873 ± 0.163
4.147GluArg: 4.147 ± 0.305
2.46GluSer: 2.46 ± 0.181
3.089GluThr: 3.089 ± 0.232
3.618GluVal: 3.618 ± 0.232
0.844GluTrp: 0.844 ± 0.12
1.158GluTyr: 1.158 ± 0.111
0.0GluXaa: 0.0 ± 0.0
Phe
4.462PheAla: 4.462 ± 0.231
0.372PheCys: 0.372 ± 0.076
2.831PheAsp: 2.831 ± 0.186
1.873PheGlu: 1.873 ± 0.193
1.316PhePhe: 1.316 ± 0.125
3.461PheGly: 3.461 ± 0.232
0.787PheHis: 0.787 ± 0.109
1.988PheIle: 1.988 ± 0.2
0.93PheLys: 0.93 ± 0.101
3.461PheLeu: 3.461 ± 0.228
0.944PheMet: 0.944 ± 0.112
1.201PheAsn: 1.201 ± 0.124
1.459PhePro: 1.459 ± 0.13
0.93PheGln: 0.93 ± 0.111
2.317PheArg: 2.317 ± 0.21
2.46PheSer: 2.46 ± 0.187
1.945PheThr: 1.945 ± 0.163
2.774PheVal: 2.774 ± 0.219
0.658PheTrp: 0.658 ± 0.105
0.93PheTyr: 0.93 ± 0.119
0.0PheXaa: 0.0 ± 0.0
Gly
7.965GlyAla: 7.965 ± 0.418
0.829GlyCys: 0.829 ± 0.104
3.933GlyAsp: 3.933 ± 0.302
3.99GlyGlu: 3.99 ± 0.238
3.618GlyPhe: 3.618 ± 0.26
6.349GlyGly: 6.349 ± 0.409
1.645GlyHis: 1.645 ± 0.153
5.048GlyIle: 5.048 ± 0.321
3.26GlyLys: 3.26 ± 0.232
9.61GlyLeu: 9.61 ± 0.412
2.188GlyMet: 2.188 ± 0.165
2.56GlyAsn: 2.56 ± 0.288
2.86GlyPro: 2.86 ± 0.206
2.488GlyGln: 2.488 ± 0.22
5.52GlyArg: 5.52 ± 0.283
4.633GlySer: 4.633 ± 0.254
4.562GlyThr: 4.562 ± 0.28
5.32GlyVal: 5.32 ± 0.29
1.401GlyTrp: 1.401 ± 0.176
2.503GlyTyr: 2.503 ± 0.172
0.0GlyXaa: 0.0 ± 0.0
His
2.045HisAla: 2.045 ± 0.182
0.257HisCys: 0.257 ± 0.071
1.073HisAsp: 1.073 ± 0.127
1.001HisGlu: 1.001 ± 0.109
0.815HisPhe: 0.815 ± 0.113
1.516HisGly: 1.516 ± 0.157
0.515HisHis: 0.515 ± 0.097
0.872HisIle: 0.872 ± 0.105
0.629HisLys: 0.629 ± 0.082
2.302HisLeu: 2.302 ± 0.171
0.543HisMet: 0.543 ± 0.082
0.501HisAsn: 0.501 ± 0.094
1.53HisPro: 1.53 ± 0.163
0.572HisGln: 0.572 ± 0.091
1.273HisArg: 1.273 ± 0.15
1.101HisSer: 1.101 ± 0.124
1.144HisThr: 1.144 ± 0.141
1.602HisVal: 1.602 ± 0.176
0.343HisTrp: 0.343 ± 0.08
0.543HisTyr: 0.543 ± 0.085
0.0HisXaa: 0.0 ± 0.0
Ile
7.264IleAla: 7.264 ± 0.349
0.515IleCys: 0.515 ± 0.083
4.147IleAsp: 4.147 ± 0.248
3.546IleGlu: 3.546 ± 0.23
1.959IlePhe: 1.959 ± 0.168
5.034IleGly: 5.034 ± 0.315
0.772IleHis: 0.772 ± 0.108
2.402IleIle: 2.402 ± 0.191
1.73IleLys: 1.73 ± 0.165
4.976IleLeu: 4.976 ± 0.28
1.173IleMet: 1.173 ± 0.122
1.602IleAsn: 1.602 ± 0.159
2.474IlePro: 2.474 ± 0.203
1.502IleGln: 1.502 ± 0.143
3.546IleArg: 3.546 ± 0.228
3.546IleSer: 3.546 ± 0.231
3.732IleThr: 3.732 ± 0.269
4.905IleVal: 4.905 ± 0.243
0.729IleTrp: 0.729 ± 0.126
1.101IleTyr: 1.101 ± 0.125
0.0IleXaa: 0.0 ± 0.0
Lys
4.19LysAla: 4.19 ± 0.233
0.243LysCys: 0.243 ± 0.06
1.687LysAsp: 1.687 ± 0.155
1.416LysGlu: 1.416 ± 0.144
0.944LysPhe: 0.944 ± 0.12
2.617LysGly: 2.617 ± 0.2
0.644LysHis: 0.644 ± 0.094
1.859LysIle: 1.859 ± 0.196
1.273LysLys: 1.273 ± 0.167
3.704LysLeu: 3.704 ± 0.242
0.944LysMet: 0.944 ± 0.124
0.801LysAsn: 0.801 ± 0.113
2.131LysPro: 2.131 ± 0.15
1.23LysGln: 1.23 ± 0.124
2.574LysArg: 2.574 ± 0.182
2.388LysSer: 2.388 ± 0.208
2.345LysThr: 2.345 ± 0.167
2.545LysVal: 2.545 ± 0.187
0.443LysTrp: 0.443 ± 0.085
0.744LysTyr: 0.744 ± 0.11
0.0LysXaa: 0.0 ± 0.0
Leu
13.142LeuAla: 13.142 ± 0.46
0.858LeuCys: 0.858 ± 0.1
5.248LeuAsp: 5.248 ± 0.249
4.833LeuGlu: 4.833 ± 0.28
3.289LeuPhe: 3.289 ± 0.251
8.051LeuGly: 8.051 ± 0.402
1.83LeuHis: 1.83 ± 0.166
5.591LeuIle: 5.591 ± 0.305
3.675LeuLys: 3.675 ± 0.225
9.939LeuLeu: 9.939 ± 0.441
2.188LeuMet: 2.188 ± 0.166
3.017LeuAsn: 3.017 ± 0.187
5.205LeuPro: 5.205 ± 0.284
3.289LeuGln: 3.289 ± 0.27
6.464LeuArg: 6.464 ± 0.353
7.836LeuSer: 7.836 ± 0.376
6.349LeuThr: 6.349 ± 0.302
7.465LeuVal: 7.465 ± 0.336
1.101LeuTrp: 1.101 ± 0.167
2.045LeuTyr: 2.045 ± 0.162
0.0LeuXaa: 0.0 ± 0.0
Met
2.96MetAla: 2.96 ± 0.211
0.2MetCys: 0.2 ± 0.043
1.144MetAsp: 1.144 ± 0.142
1.144MetGlu: 1.144 ± 0.138
1.073MetPhe: 1.073 ± 0.143
1.773MetGly: 1.773 ± 0.161
0.4MetHis: 0.4 ± 0.073
1.487MetIle: 1.487 ± 0.152
1.001MetLys: 1.001 ± 0.118
2.96MetLeu: 2.96 ± 0.26
0.758MetMet: 0.758 ± 0.107
0.772MetAsn: 0.772 ± 0.098
1.559MetPro: 1.559 ± 0.159
0.872MetGln: 0.872 ± 0.126
1.959MetArg: 1.959 ± 0.209
1.959MetSer: 1.959 ± 0.162
1.659MetThr: 1.659 ± 0.148
1.902MetVal: 1.902 ± 0.176
0.243MetTrp: 0.243 ± 0.059
0.315MetTyr: 0.315 ± 0.059
0.0MetXaa: 0.0 ± 0.0
Asn
3.132AsnAla: 3.132 ± 0.233
0.272AsnCys: 0.272 ± 0.073
1.559AsnAsp: 1.559 ± 0.19
1.287AsnGlu: 1.287 ± 0.128
1.073AsnPhe: 1.073 ± 0.12
2.932AsnGly: 2.932 ± 0.217
0.644AsnHis: 0.644 ± 0.091
1.673AsnIle: 1.673 ± 0.167
0.972AsnLys: 0.972 ± 0.111
3.003AsnLeu: 3.003 ± 0.211
0.629AsnMet: 0.629 ± 0.1
0.872AsnAsn: 0.872 ± 0.108
1.645AsnPro: 1.645 ± 0.15
0.958AsnGln: 0.958 ± 0.122
2.031AsnArg: 2.031 ± 0.172
1.773AsnSer: 1.773 ± 0.179
1.587AsnThr: 1.587 ± 0.157
2.288AsnVal: 2.288 ± 0.181
0.486AsnTrp: 0.486 ± 0.075
0.829AsnTyr: 0.829 ± 0.119
0.0AsnXaa: 0.0 ± 0.0
Pro
5.334ProAla: 5.334 ± 0.254
0.343ProCys: 0.343 ± 0.057
3.26ProAsp: 3.26 ± 0.205
3.089ProGlu: 3.089 ± 0.216
2.102ProPhe: 2.102 ± 0.177
3.832ProGly: 3.832 ± 0.205
1.158ProHis: 1.158 ± 0.117
2.646ProIle: 2.646 ± 0.192
1.673ProLys: 1.673 ± 0.167
4.404ProLeu: 4.404 ± 0.293
1.101ProMet: 1.101 ± 0.134
1.645ProAsn: 1.645 ± 0.146
2.188ProPro: 2.188 ± 0.213
1.602ProGln: 1.602 ± 0.166
2.545ProArg: 2.545 ± 0.172
3.389ProSer: 3.389 ± 0.209
2.96ProThr: 2.96 ± 0.245
4.219ProVal: 4.219 ± 0.241
0.686ProTrp: 0.686 ± 0.089
1.344ProTyr: 1.344 ± 0.156
0.0ProXaa: 0.0 ± 0.0
Gln
3.947GlnAla: 3.947 ± 0.258
0.257GlnCys: 0.257 ± 0.066
1.702GlnAsp: 1.702 ± 0.151
1.502GlnGlu: 1.502 ± 0.163
1.173GlnPhe: 1.173 ± 0.128
2.159GlnGly: 2.159 ± 0.18
0.715GlnHis: 0.715 ± 0.113
2.045GlnIle: 2.045 ± 0.195
1.101GlnLys: 1.101 ± 0.138
2.889GlnLeu: 2.889 ± 0.233
1.144GlnMet: 1.144 ± 0.141
0.829GlnAsn: 0.829 ± 0.104
1.816GlnPro: 1.816 ± 0.168
1.173GlnGln: 1.173 ± 0.127
2.617GlnArg: 2.617 ± 0.195
1.673GlnSer: 1.673 ± 0.175
2.002GlnThr: 2.002 ± 0.179
2.302GlnVal: 2.302 ± 0.188
0.443GlnTrp: 0.443 ± 0.076
0.972GlnTyr: 0.972 ± 0.109
0.0GlnXaa: 0.0 ± 0.0
Arg
6.078ArgAla: 6.078 ± 0.33
0.572ArgCys: 0.572 ± 0.096
3.632ArgAsp: 3.632 ± 0.236
3.432ArgGlu: 3.432 ± 0.256
2.96ArgPhe: 2.96 ± 0.202
4.147ArgGly: 4.147 ± 0.224
1.573ArgHis: 1.573 ± 0.147
4.104ArgIle: 4.104 ± 0.245
2.588ArgLys: 2.588 ± 0.19
7.665ArgLeu: 7.665 ± 0.442
2.131ArgMet: 2.131 ± 0.172
2.002ArgAsn: 2.002 ± 0.155
3.361ArgPro: 3.361 ± 0.234
2.603ArgGln: 2.603 ± 0.254
5.22ArgArg: 5.22 ± 0.426
3.933ArgSer: 3.933 ± 0.256
3.303ArgThr: 3.303 ± 0.201
4.519ArgVal: 4.519 ± 0.276
1.073ArgTrp: 1.073 ± 0.128
2.116ArgTyr: 2.116 ± 0.152
0.0ArgXaa: 0.0 ± 0.0
Ser
6.907SerAla: 6.907 ± 0.258
0.543SerCys: 0.543 ± 0.094
4.018SerAsp: 4.018 ± 0.249
3.461SerGlu: 3.461 ± 0.232
2.402SerPhe: 2.402 ± 0.194
6.078SerGly: 6.078 ± 0.35
1.258SerHis: 1.258 ± 0.132
3.961SerIle: 3.961 ± 0.242
2.031SerLys: 2.031 ± 0.165
6.306SerLeu: 6.306 ± 0.354
1.544SerMet: 1.544 ± 0.158
1.931SerAsn: 1.931 ± 0.185
3.046SerPro: 3.046 ± 0.217
2.217SerGln: 2.217 ± 0.183
3.546SerArg: 3.546 ± 0.232
4.347SerSer: 4.347 ± 0.279
3.232SerThr: 3.232 ± 0.23
4.605SerVal: 4.605 ± 0.273
0.801SerTrp: 0.801 ± 0.108
1.673SerTyr: 1.673 ± 0.174
0.0SerXaa: 0.0 ± 0.0
Thr
6.406ThrAla: 6.406 ± 0.306
0.372ThrCys: 0.372 ± 0.086
3.446ThrAsp: 3.446 ± 0.208
2.302ThrGlu: 2.302 ± 0.183
2.274ThrPhe: 2.274 ± 0.192
4.948ThrGly: 4.948 ± 0.333
1.287ThrHis: 1.287 ± 0.157
3.618ThrIle: 3.618 ± 0.242
1.902ThrLys: 1.902 ± 0.198
5.949ThrLeu: 5.949 ± 0.286
1.373ThrMet: 1.373 ± 0.127
1.745ThrAsn: 1.745 ± 0.173
3.432ThrPro: 3.432 ± 0.236
1.759ThrGln: 1.759 ± 0.146
3.218ThrArg: 3.218 ± 0.242
3.961ThrSer: 3.961 ± 0.229
3.332ThrThr: 3.332 ± 0.225
4.948ThrVal: 4.948 ± 0.274
0.858ThrTrp: 0.858 ± 0.109
1.444ThrTyr: 1.444 ± 0.148
0.0ThrXaa: 0.0 ± 0.0
Val
8.037ValAla: 8.037 ± 0.304
0.629ValCys: 0.629 ± 0.107
4.047ValAsp: 4.047 ± 0.25
4.304ValGlu: 4.304 ± 0.227
2.474ValPhe: 2.474 ± 0.186
5.849ValGly: 5.849 ± 0.244
1.244ValHis: 1.244 ± 0.129
4.347ValIle: 4.347 ± 0.241
2.259ValLys: 2.259 ± 0.199
7.436ValLeu: 7.436 ± 0.387
1.83ValMet: 1.83 ± 0.171
2.46ValAsn: 2.46 ± 0.186
4.09ValPro: 4.09 ± 0.245
2.045ValGln: 2.045 ± 0.18
5.348ValArg: 5.348 ± 0.308
5.691ValSer: 5.691 ± 0.311
5.034ValThr: 5.034 ± 0.275
6.321ValVal: 6.321 ± 0.358
0.93ValTrp: 0.93 ± 0.122
1.53ValTyr: 1.53 ± 0.153
0.0ValXaa: 0.0 ± 0.0
Trp
1.387TrpAla: 1.387 ± 0.12
0.143TrpCys: 0.143 ± 0.043
0.572TrpAsp: 0.572 ± 0.078
0.601TrpGlu: 0.601 ± 0.099
0.672TrpPhe: 0.672 ± 0.084
0.93TrpGly: 0.93 ± 0.135
0.257TrpHis: 0.257 ± 0.056
0.944TrpIle: 0.944 ± 0.118
0.543TrpLys: 0.543 ± 0.093
1.459TrpLeu: 1.459 ± 0.158
0.386TrpMet: 0.386 ± 0.071
0.529TrpAsn: 0.529 ± 0.096
0.829TrpPro: 0.829 ± 0.103
0.558TrpGln: 0.558 ± 0.075
1.173TrpArg: 1.173 ± 0.134
0.758TrpSer: 0.758 ± 0.113
1.058TrpThr: 1.058 ± 0.118
0.744TrpVal: 0.744 ± 0.1
0.215TrpTrp: 0.215 ± 0.06
0.272TrpTyr: 0.272 ± 0.062
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.245TyrAla: 2.245 ± 0.182
0.272TyrCys: 0.272 ± 0.069
1.587TyrAsp: 1.587 ± 0.184
1.33TyrGlu: 1.33 ± 0.142
0.93TyrPhe: 0.93 ± 0.115
2.417TyrGly: 2.417 ± 0.206
0.415TyrHis: 0.415 ± 0.071
1.087TyrIle: 1.087 ± 0.132
0.858TyrLys: 0.858 ± 0.097
2.088TyrLeu: 2.088 ± 0.194
0.558TyrMet: 0.558 ± 0.089
0.872TyrAsn: 0.872 ± 0.112
1.015TyrPro: 1.015 ± 0.137
0.787TyrGln: 0.787 ± 0.107
1.988TyrArg: 1.988 ± 0.159
1.73TyrSer: 1.73 ± 0.172
1.502TyrThr: 1.502 ± 0.156
1.873TyrVal: 1.873 ± 0.142
0.372TyrTrp: 0.372 ± 0.069
0.601TyrTyr: 0.601 ± 0.083
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 325 proteins (69931 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski