Amino acid dipepetide frequency for Candidatus Mikella endobia

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.695AlaAla: 4.695 ± 0.264
0.745AlaCys: 0.745 ± 0.103
3.025AlaAsp: 3.025 ± 0.18
3.905AlaGlu: 3.905 ± 0.296
1.873AlaPhe: 1.873 ± 0.144
3.916AlaGly: 3.916 ± 0.236
1.275AlaHis: 1.275 ± 0.112
6.162AlaIle: 6.162 ± 0.264
4.65AlaLys: 4.65 ± 0.245
6.715AlaLeu: 6.715 ± 0.37
1.761AlaMet: 1.761 ± 0.183
2.957AlaAsn: 2.957 ± 0.234
1.975AlaPro: 1.975 ± 0.149
2.381AlaGln: 2.381 ± 0.166
3.657AlaArg: 3.657 ± 0.213
3.104AlaSer: 3.104 ± 0.193
3.059AlaThr: 3.059 ± 0.225
3.792AlaVal: 3.792 ± 0.192
0.418AlaTrp: 0.418 ± 0.058
1.682AlaTyr: 1.682 ± 0.146
0.0AlaXaa: 0.0 ± 0.0
Cys
0.745CysAla: 0.745 ± 0.084
0.226CysCys: 0.226 ± 0.047
0.677CysAsp: 0.677 ± 0.082
0.553CysGlu: 0.553 ± 0.074
0.463CysPhe: 0.463 ± 0.072
1.14CysGly: 1.14 ± 0.118
0.429CysHis: 0.429 ± 0.075
1.162CysIle: 1.162 ± 0.12
1.027CysLys: 1.027 ± 0.121
1.083CysLeu: 1.083 ± 0.101
0.282CysMet: 0.282 ± 0.057
0.892CysAsn: 0.892 ± 0.106
0.576CysPro: 0.576 ± 0.08
0.508CysGln: 0.508 ± 0.07
0.666CysArg: 0.666 ± 0.094
0.846CysSer: 0.846 ± 0.087
0.655CysThr: 0.655 ± 0.08
0.497CysVal: 0.497 ± 0.065
0.181CysTrp: 0.181 ± 0.045
0.598CysTyr: 0.598 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
2.641AspAla: 2.641 ± 0.186
0.553AspCys: 0.553 ± 0.086
1.794AspAsp: 1.794 ± 0.152
2.607AspGlu: 2.607 ± 0.192
2.291AspPhe: 2.291 ± 0.174
2.596AspGly: 2.596 ± 0.163
0.948AspHis: 0.948 ± 0.098
6.015AspIle: 6.015 ± 0.253
3.6AspLys: 3.6 ± 0.195
4.706AspLeu: 4.706 ± 0.269
1.038AspMet: 1.038 ± 0.093
2.991AspAsn: 2.991 ± 0.181
1.693AspPro: 1.693 ± 0.145
1.445AspGln: 1.445 ± 0.119
2.201AspArg: 2.201 ± 0.176
2.607AspSer: 2.607 ± 0.147
2.393AspThr: 2.393 ± 0.159
2.449AspVal: 2.449 ± 0.169
0.497AspTrp: 0.497 ± 0.086
1.84AspTyr: 1.84 ± 0.149
0.0AspXaa: 0.0 ± 0.0
Glu
3.95GluAla: 3.95 ± 0.22
0.655GluCys: 0.655 ± 0.082
2.347GluAsp: 2.347 ± 0.178
3.713GluGlu: 3.713 ± 0.274
2.099GluPhe: 2.099 ± 0.169
2.968GluGly: 2.968 ± 0.191
1.467GluHis: 1.467 ± 0.135
6.602GluIle: 6.602 ± 0.308
4.593GluLys: 4.593 ± 0.274
5.654GluLeu: 5.654 ± 0.266
1.49GluMet: 1.49 ± 0.154
3.194GluAsn: 3.194 ± 0.184
1.557GluPro: 1.557 ± 0.128
2.483GluGln: 2.483 ± 0.171
3.25GluArg: 3.25 ± 0.209
3.092GluSer: 3.092 ± 0.2
2.867GluThr: 2.867 ± 0.217
3.612GluVal: 3.612 ± 0.256
0.463GluTrp: 0.463 ± 0.075
1.828GluTyr: 1.828 ± 0.131
0.0GluXaa: 0.0 ± 0.0
Phe
2.054PheAla: 2.054 ± 0.143
0.767PheCys: 0.767 ± 0.09
2.302PheAsp: 2.302 ± 0.185
1.873PheGlu: 1.873 ± 0.167
1.636PhePhe: 1.636 ± 0.159
2.393PheGly: 2.393 ± 0.184
0.846PheHis: 0.846 ± 0.088
4.097PheIle: 4.097 ± 0.258
2.46PheLys: 2.46 ± 0.177
3.566PheLeu: 3.566 ± 0.208
1.027PheMet: 1.027 ± 0.106
2.596PheAsn: 2.596 ± 0.173
1.49PhePro: 1.49 ± 0.133
1.377PheGln: 1.377 ± 0.11
2.02PheArg: 2.02 ± 0.162
2.867PheSer: 2.867 ± 0.176
2.167PheThr: 2.167 ± 0.155
1.58PheVal: 1.58 ± 0.133
0.372PheTrp: 0.372 ± 0.075
1.715PheTyr: 1.715 ± 0.129
0.0PheXaa: 0.0 ± 0.0
Gly
3.375GlyAla: 3.375 ± 0.222
0.869GlyCys: 0.869 ± 0.102
2.765GlyAsp: 2.765 ± 0.19
3.25GlyGlu: 3.25 ± 0.188
2.788GlyPhe: 2.788 ± 0.17
3.837GlyGly: 3.837 ± 0.25
1.704GlyHis: 1.704 ± 0.153
7.178GlyIle: 7.178 ± 0.284
5.304GlyLys: 5.304 ± 0.231
4.988GlyLeu: 4.988 ± 0.228
1.445GlyMet: 1.445 ± 0.14
3.013GlyAsn: 3.013 ± 0.214
1.783GlyPro: 1.783 ± 0.13
2.122GlyGln: 2.122 ± 0.184
3.454GlyArg: 3.454 ± 0.239
3.442GlySer: 3.442 ± 0.218
3.363GlyThr: 3.363 ± 0.166
3.341GlyVal: 3.341 ± 0.207
0.542GlyTrp: 0.542 ± 0.082
2.11GlyTyr: 2.11 ± 0.147
0.0GlyXaa: 0.0 ± 0.0
His
1.287HisAla: 1.287 ± 0.126
0.406HisCys: 0.406 ± 0.079
0.903HisAsp: 0.903 ± 0.089
0.971HisGlu: 0.971 ± 0.119
0.937HisPhe: 0.937 ± 0.112
1.445HisGly: 1.445 ± 0.12
0.722HisHis: 0.722 ± 0.092
2.189HisIle: 2.189 ± 0.165
1.557HisLys: 1.557 ± 0.15
2.11HisLeu: 2.11 ± 0.154
0.474HisMet: 0.474 ± 0.062
1.478HisAsn: 1.478 ± 0.169
1.185HisPro: 1.185 ± 0.106
1.185HisGln: 1.185 ± 0.125
1.129HisArg: 1.129 ± 0.128
1.377HisSer: 1.377 ± 0.136
1.241HisThr: 1.241 ± 0.119
1.072HisVal: 1.072 ± 0.102
0.316HisTrp: 0.316 ± 0.057
0.846HisTyr: 0.846 ± 0.1
0.0HisXaa: 0.0 ± 0.0
Ile
7.291IleAla: 7.291 ± 0.329
1.433IleCys: 1.433 ± 0.129
5.97IleAsp: 5.97 ± 0.243
6.828IleGlu: 6.828 ± 0.302
3.6IlePhe: 3.6 ± 0.212
6.128IleGly: 6.128 ± 0.255
2.11IleHis: 2.11 ± 0.146
10.011IleIle: 10.011 ± 0.395
7.426IleLys: 7.426 ± 0.335
9.266IleLeu: 9.266 ± 0.377
2.009IleMet: 2.009 ± 0.162
6.93IleAsn: 6.93 ± 0.286
3.612IlePro: 3.612 ± 0.205
3.002IleGln: 3.002 ± 0.198
5.192IleArg: 5.192 ± 0.238
6.456IleSer: 6.456 ± 0.265
5.609IleThr: 5.609 ± 0.244
5.293IleVal: 5.293 ± 0.242
0.722IleTrp: 0.722 ± 0.099
3.284IleTyr: 3.284 ± 0.195
0.0IleXaa: 0.0 ± 0.0
Lys
3.984LysAla: 3.984 ± 0.239
0.677LysCys: 0.677 ± 0.093
2.799LysAsp: 2.799 ± 0.189
4.435LysGlu: 4.435 ± 0.236
2.618LysPhe: 2.618 ± 0.179
3.815LysGly: 3.815 ± 0.231
1.287LysHis: 1.287 ± 0.12
8.544LysIle: 8.544 ± 0.273
6.828LysLys: 6.828 ± 0.281
7.528LysLeu: 7.528 ± 0.298
2.02LysMet: 2.02 ± 0.143
5.88LysAsn: 5.88 ± 0.269
2.156LysPro: 2.156 ± 0.148
3.013LysGln: 3.013 ± 0.172
3.803LysArg: 3.803 ± 0.208
4.074LysSer: 4.074 ± 0.21
3.375LysThr: 3.375 ± 0.204
4.255LysVal: 4.255 ± 0.202
0.474LysTrp: 0.474 ± 0.071
2.539LysTyr: 2.539 ± 0.181
0.0LysXaa: 0.0 ± 0.0
Leu
6.839LeuAla: 6.839 ± 0.285
1.275LeuCys: 1.275 ± 0.129
4.932LeuAsp: 4.932 ± 0.234
6.489LeuGlu: 6.489 ± 0.235
3.736LeuPhe: 3.736 ± 0.206
5.417LeuGly: 5.417 ± 0.262
2.63LeuHis: 2.63 ± 0.172
8.521LeuIle: 8.521 ± 0.361
6.681LeuLys: 6.681 ± 0.247
10.496LeuLeu: 10.496 ± 0.511
2.246LeuMet: 2.246 ± 0.132
6.106LeuAsn: 6.106 ± 0.264
4.334LeuPro: 4.334 ± 0.24
4.221LeuGln: 4.221 ± 0.233
5.677LeuArg: 5.677 ± 0.255
6.173LeuSer: 6.173 ± 0.295
5.304LeuThr: 5.304 ± 0.217
5.688LeuVal: 5.688 ± 0.253
0.734LeuTrp: 0.734 ± 0.093
2.607LeuTyr: 2.607 ± 0.162
0.0LeuXaa: 0.0 ± 0.0
Met
1.58MetAla: 1.58 ± 0.142
0.169MetCys: 0.169 ± 0.048
0.959MetAsp: 0.959 ± 0.111
1.298MetGlu: 1.298 ± 0.123
0.779MetPhe: 0.779 ± 0.096
1.614MetGly: 1.614 ± 0.167
0.587MetHis: 0.587 ± 0.072
2.144MetIle: 2.144 ± 0.149
1.715MetLys: 1.715 ± 0.131
2.686MetLeu: 2.686 ± 0.15
0.587MetMet: 0.587 ± 0.101
1.456MetAsn: 1.456 ± 0.117
0.982MetPro: 0.982 ± 0.107
1.038MetGln: 1.038 ± 0.113
1.275MetArg: 1.275 ± 0.126
1.366MetSer: 1.366 ± 0.139
1.241MetThr: 1.241 ± 0.107
1.512MetVal: 1.512 ± 0.144
0.068MetTrp: 0.068 ± 0.024
0.677MetTyr: 0.677 ± 0.078
0.0MetXaa: 0.0 ± 0.0
Asn
3.092AsnAla: 3.092 ± 0.181
0.88AsnCys: 0.88 ± 0.101
2.81AsnAsp: 2.81 ± 0.179
3.013AsnGlu: 3.013 ± 0.171
2.641AsnPhe: 2.641 ± 0.186
3.352AsnGly: 3.352 ± 0.182
1.162AsnHis: 1.162 ± 0.122
7.37AsnIle: 7.37 ± 0.326
5.857AsnLys: 5.857 ± 0.265
5.666AsnLeu: 5.666 ± 0.221
1.399AsnMet: 1.399 ± 0.119
4.955AsnAsn: 4.955 ± 0.278
2.189AsnPro: 2.189 ± 0.159
2.088AsnGln: 2.088 ± 0.159
2.618AsnArg: 2.618 ± 0.151
3.566AsnSer: 3.566 ± 0.224
3.059AsnThr: 3.059 ± 0.211
2.776AsnVal: 2.776 ± 0.163
0.621AsnTrp: 0.621 ± 0.082
2.539AsnTyr: 2.539 ± 0.182
0.0AsnXaa: 0.0 ± 0.0
Pro
1.986ProAla: 1.986 ± 0.154
0.271ProCys: 0.271 ± 0.064
1.614ProAsp: 1.614 ± 0.125
2.426ProGlu: 2.426 ± 0.181
1.467ProPhe: 1.467 ± 0.121
2.291ProGly: 2.291 ± 0.162
0.79ProHis: 0.79 ± 0.103
3.318ProIle: 3.318 ± 0.196
2.415ProLys: 2.415 ± 0.176
3.352ProLeu: 3.352 ± 0.191
0.869ProMet: 0.869 ± 0.108
1.941ProAsn: 1.941 ± 0.152
1.004ProPro: 1.004 ± 0.103
1.377ProGln: 1.377 ± 0.137
1.433ProArg: 1.433 ± 0.131
2.133ProSer: 2.133 ± 0.163
1.749ProThr: 1.749 ± 0.13
2.585ProVal: 2.585 ± 0.161
0.384ProTrp: 0.384 ± 0.082
1.411ProTyr: 1.411 ± 0.143
0.0ProXaa: 0.0 ± 0.0
Gln
2.528GlnAla: 2.528 ± 0.171
0.519GlnCys: 0.519 ± 0.081
1.196GlnAsp: 1.196 ± 0.109
2.133GlnGlu: 2.133 ± 0.165
1.569GlnPhe: 1.569 ± 0.133
2.212GlnGly: 2.212 ± 0.158
1.095GlnHis: 1.095 ± 0.116
3.533GlnIle: 3.533 ± 0.199
2.46GlnLys: 2.46 ± 0.182
4.774GlnLeu: 4.774 ± 0.254
0.869GlnMet: 0.869 ± 0.093
1.84GlnAsn: 1.84 ± 0.158
1.275GlnPro: 1.275 ± 0.117
2.291GlnGln: 2.291 ± 0.154
2.235GlnArg: 2.235 ± 0.172
1.896GlnSer: 1.896 ± 0.109
1.535GlnThr: 1.535 ± 0.116
2.697GlnVal: 2.697 ± 0.17
0.497GlnTrp: 0.497 ± 0.083
1.456GlnTyr: 1.456 ± 0.112
0.0GlnXaa: 0.0 ± 0.0
Arg
3.228ArgAla: 3.228 ± 0.187
0.666ArgCys: 0.666 ± 0.087
2.393ArgAsp: 2.393 ± 0.159
3.013ArgGlu: 3.013 ± 0.186
2.426ArgPhe: 2.426 ± 0.162
3.375ArgGly: 3.375 ± 0.229
1.072ArgHis: 1.072 ± 0.104
5.293ArgIle: 5.293 ± 0.268
3.578ArgLys: 3.578 ± 0.173
5.237ArgLeu: 5.237 ± 0.249
1.287ArgMet: 1.287 ± 0.127
3.07ArgAsn: 3.07 ± 0.197
1.693ArgPro: 1.693 ± 0.132
2.359ArgGln: 2.359 ± 0.18
3.228ArgArg: 3.228 ± 0.206
2.957ArgSer: 2.957 ± 0.168
2.743ArgThr: 2.743 ± 0.163
2.799ArgVal: 2.799 ± 0.167
0.722ArgTrp: 0.722 ± 0.1
2.088ArgTyr: 2.088 ± 0.165
0.0ArgXaa: 0.0 ± 0.0
Ser
3.566SerAla: 3.566 ± 0.197
0.722SerCys: 0.722 ± 0.089
2.585SerAsp: 2.585 ± 0.153
2.889SerGlu: 2.889 ± 0.169
2.325SerPhe: 2.325 ± 0.156
4.435SerGly: 4.435 ± 0.219
1.162SerHis: 1.162 ± 0.105
5.88SerIle: 5.88 ± 0.289
4.277SerLys: 4.277 ± 0.217
5.699SerLeu: 5.699 ± 0.283
1.49SerMet: 1.49 ± 0.122
3.578SerAsn: 3.578 ± 0.193
1.693SerPro: 1.693 ± 0.135
1.907SerGln: 1.907 ± 0.14
3.149SerArg: 3.149 ± 0.185
3.578SerSer: 3.578 ± 0.203
3.025SerThr: 3.025 ± 0.17
3.397SerVal: 3.397 ± 0.229
0.734SerTrp: 0.734 ± 0.084
2.291SerTyr: 2.291 ± 0.146
0.0SerXaa: 0.0 ± 0.0
Thr
3.228ThrAla: 3.228 ± 0.186
0.655ThrCys: 0.655 ± 0.092
2.743ThrAsp: 2.743 ± 0.184
3.126ThrGlu: 3.126 ± 0.182
2.02ThrPhe: 2.02 ± 0.132
3.691ThrGly: 3.691 ± 0.207
1.106ThrHis: 1.106 ± 0.119
4.819ThrIle: 4.819 ± 0.246
3.341ThrLys: 3.341 ± 0.186
6.14ThrLeu: 6.14 ± 0.25
1.106ThrMet: 1.106 ± 0.112
2.867ThrAsn: 2.867 ± 0.177
1.998ThrPro: 1.998 ± 0.156
1.58ThrGln: 1.58 ± 0.125
2.415ThrArg: 2.415 ± 0.154
3.025ThrSer: 3.025 ± 0.204
2.743ThrThr: 2.743 ± 0.191
3.047ThrVal: 3.047 ± 0.206
0.395ThrTrp: 0.395 ± 0.072
1.467ThrTyr: 1.467 ± 0.128
0.0ThrXaa: 0.0 ± 0.0
Val
3.42ValAla: 3.42 ± 0.257
0.846ValCys: 0.846 ± 0.105
3.104ValAsp: 3.104 ± 0.197
3.329ValGlu: 3.329 ± 0.203
1.828ValPhe: 1.828 ± 0.129
3.341ValGly: 3.341 ± 0.222
1.23ValHis: 1.23 ± 0.12
5.609ValIle: 5.609 ± 0.225
3.849ValLys: 3.849 ± 0.225
5.079ValLeu: 5.079 ± 0.228
1.309ValMet: 1.309 ± 0.119
3.408ValAsn: 3.408 ± 0.19
2.359ValPro: 2.359 ± 0.148
1.772ValGln: 1.772 ± 0.154
3.228ValArg: 3.228 ± 0.197
3.341ValSer: 3.341 ± 0.217
3.521ValThr: 3.521 ± 0.186
3.262ValVal: 3.262 ± 0.215
0.497ValTrp: 0.497 ± 0.088
1.433ValTyr: 1.433 ± 0.14
0.0ValXaa: 0.0 ± 0.0
Trp
0.395TrpAla: 0.395 ± 0.065
0.124TrpCys: 0.124 ± 0.043
0.395TrpAsp: 0.395 ± 0.067
0.406TrpGlu: 0.406 ± 0.076
0.395TrpPhe: 0.395 ± 0.074
0.339TrpGly: 0.339 ± 0.074
0.237TrpHis: 0.237 ± 0.049
0.925TrpIle: 0.925 ± 0.111
0.542TrpLys: 0.542 ± 0.083
1.196TrpLeu: 1.196 ± 0.154
0.316TrpMet: 0.316 ± 0.063
0.508TrpAsn: 0.508 ± 0.069
0.282TrpPro: 0.282 ± 0.052
0.688TrpGln: 0.688 ± 0.102
0.621TrpArg: 0.621 ± 0.092
0.44TrpSer: 0.44 ± 0.069
0.305TrpThr: 0.305 ± 0.054
0.508TrpVal: 0.508 ± 0.075
0.102TrpTrp: 0.102 ± 0.032
0.44TrpTyr: 0.44 ± 0.087
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.862TyrAla: 1.862 ± 0.131
0.745TyrCys: 0.745 ± 0.092
1.591TyrAsp: 1.591 ± 0.148
1.625TyrGlu: 1.625 ± 0.136
1.715TyrPhe: 1.715 ± 0.16
2.37TyrGly: 2.37 ± 0.154
0.971TyrHis: 0.971 ± 0.1
2.799TyrIle: 2.799 ± 0.178
1.941TyrLys: 1.941 ± 0.129
4.165TyrLeu: 4.165 ± 0.197
0.7TyrMet: 0.7 ± 0.086
2.009TyrAsn: 2.009 ± 0.157
0.982TyrPro: 0.982 ± 0.095
1.772TyrGln: 1.772 ± 0.171
1.975TyrArg: 1.975 ± 0.153
2.043TyrSer: 2.043 ± 0.152
1.557TyrThr: 1.557 ± 0.143
1.603TyrVal: 1.603 ± 0.132
0.429TyrTrp: 0.429 ± 0.066
1.445TyrTyr: 1.445 ± 0.127
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 273 proteins (88606 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski