Amino acid dipepetide frequency for Candidatus Walczuchella monophlebidarum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.477AlaAla: 2.477 ± 0.184
0.559AlaCys: 0.559 ± 0.084
1.627AlaAsp: 1.627 ± 0.161
2.501AlaGlu: 2.501 ± 0.159
2.21AlaPhe: 2.21 ± 0.162
2.865AlaGly: 2.865 ± 0.215
1.068AlaHis: 1.068 ± 0.111
5.318AlaIle: 5.318 ± 0.294
3.982AlaLys: 3.982 ± 0.222
4.735AlaLeu: 4.735 ± 0.244
1.178AlaMet: 1.178 ± 0.131
2.404AlaAsn: 2.404 ± 0.2
1.214AlaPro: 1.214 ± 0.095
1.833AlaGln: 1.833 ± 0.144
2.27AlaArg: 2.27 ± 0.174
2.756AlaSer: 2.756 ± 0.179
2.295AlaThr: 2.295 ± 0.174
2.38AlaVal: 2.38 ± 0.198
0.413AlaTrp: 0.413 ± 0.059
1.918AlaTyr: 1.918 ± 0.135
0.0AlaXaa: 0.0 ± 0.0
Cys
0.498CysAla: 0.498 ± 0.08
0.121CysCys: 0.121 ± 0.034
0.498CysAsp: 0.498 ± 0.078
0.656CysGlu: 0.656 ± 0.068
0.801CysPhe: 0.801 ± 0.108
0.729CysGly: 0.729 ± 0.094
0.316CysHis: 0.316 ± 0.085
1.093CysIle: 1.093 ± 0.122
0.862CysLys: 0.862 ± 0.088
1.202CysLeu: 1.202 ± 0.121
0.267CysMet: 0.267 ± 0.061
0.486CysAsn: 0.486 ± 0.076
0.425CysPro: 0.425 ± 0.074
0.413CysGln: 0.413 ± 0.066
0.364CysArg: 0.364 ± 0.069
0.862CysSer: 0.862 ± 0.101
0.607CysThr: 0.607 ± 0.085
0.559CysVal: 0.559 ± 0.082
0.061CysTrp: 0.061 ± 0.024
0.389CysTyr: 0.389 ± 0.065
0.0CysXaa: 0.0 ± 0.0
Asp
1.931AspAla: 1.931 ± 0.141
0.316AspCys: 0.316 ± 0.069
1.882AspAsp: 1.882 ± 0.147
2.963AspGlu: 2.963 ± 0.211
2.793AspPhe: 2.793 ± 0.197
2.331AspGly: 2.331 ± 0.188
1.044AspHis: 1.044 ± 0.093
4.796AspIle: 4.796 ± 0.209
3.861AspLys: 3.861 ± 0.205
4.59AspLeu: 4.59 ± 0.244
0.741AspMet: 0.741 ± 0.108
2.064AspAsn: 2.064 ± 0.178
1.676AspPro: 1.676 ± 0.155
1.323AspGln: 1.323 ± 0.132
2.052AspArg: 2.052 ± 0.149
2.428AspSer: 2.428 ± 0.162
1.979AspThr: 1.979 ± 0.183
2.586AspVal: 2.586 ± 0.149
0.546AspTrp: 0.546 ± 0.075
2.392AspTyr: 2.392 ± 0.174
0.0AspXaa: 0.0 ± 0.0
Glu
3.303GluAla: 3.303 ± 0.207
0.741GluCys: 0.741 ± 0.088
2.562GluAsp: 2.562 ± 0.186
5.197GluGlu: 5.197 ± 0.297
2.489GluPhe: 2.489 ± 0.194
3.57GluGly: 3.57 ± 0.211
1.226GluHis: 1.226 ± 0.126
8.171GluIle: 8.171 ± 0.33
7.856GluLys: 7.856 ± 0.33
5.646GluLeu: 5.646 ± 0.296
1.323GluMet: 1.323 ± 0.124
4.286GluAsn: 4.286 ± 0.212
1.287GluPro: 1.287 ± 0.156
2.04GluGln: 2.04 ± 0.154
3.545GluArg: 3.545 ± 0.197
3.922GluSer: 3.922 ± 0.201
2.987GluThr: 2.987 ± 0.185
3.558GluVal: 3.558 ± 0.204
0.498GluTrp: 0.498 ± 0.068
2.44GluTyr: 2.44 ± 0.17
0.0GluXaa: 0.0 ± 0.0
Phe
1.554PheAla: 1.554 ± 0.132
0.704PheCys: 0.704 ± 0.103
2.525PheAsp: 2.525 ± 0.196
2.89PheGlu: 2.89 ± 0.18
2.89PhePhe: 2.89 ± 0.282
2.829PheGly: 2.829 ± 0.188
1.129PheHis: 1.129 ± 0.102
4.25PheIle: 4.25 ± 0.258
3.691PheLys: 3.691 ± 0.219
5.452PheLeu: 5.452 ± 0.31
1.008PheMet: 1.008 ± 0.12
2.295PheAsn: 2.295 ± 0.153
2.319PhePro: 2.319 ± 0.167
1.603PheGln: 1.603 ± 0.145
2.161PheArg: 2.161 ± 0.169
4.298PheSer: 4.298 ± 0.257
2.052PheThr: 2.052 ± 0.164
2.125PheVal: 2.125 ± 0.2
0.546PheTrp: 0.546 ± 0.105
2.343PheTyr: 2.343 ± 0.216
0.0PheXaa: 0.0 ± 0.0
Gly
3.315GlyAla: 3.315 ± 0.218
0.826GlyCys: 0.826 ± 0.112
2.938GlyAsp: 2.938 ± 0.186
3.278GlyGlu: 3.278 ± 0.201
2.975GlyPhe: 2.975 ± 0.174
3.873GlyGly: 3.873 ± 0.25
1.36GlyHis: 1.36 ± 0.115
6.666GlyIle: 6.666 ± 0.305
6.302GlyLys: 6.302 ± 0.28
5.002GlyLeu: 5.002 ± 0.265
1.542GlyMet: 1.542 ± 0.141
2.878GlyAsn: 2.878 ± 0.241
1.566GlyPro: 1.566 ± 0.14
1.748GlyGln: 1.748 ± 0.166
3.023GlyArg: 3.023 ± 0.193
3.679GlySer: 3.679 ± 0.223
3.412GlyThr: 3.412 ± 0.217
3.715GlyVal: 3.715 ± 0.229
0.559GlyTrp: 0.559 ± 0.106
2.598GlyTyr: 2.598 ± 0.155
0.0GlyXaa: 0.0 ± 0.0
His
1.323HisAla: 1.323 ± 0.106
0.291HisCys: 0.291 ± 0.054
0.692HisAsp: 0.692 ± 0.082
0.886HisGlu: 0.886 ± 0.1
1.251HisPhe: 1.251 ± 0.131
1.421HisGly: 1.421 ± 0.125
0.619HisHis: 0.619 ± 0.082
2.101HisIle: 2.101 ± 0.146
1.639HisLys: 1.639 ± 0.152
2.343HisLeu: 2.343 ± 0.204
0.449HisMet: 0.449 ± 0.079
1.166HisAsn: 1.166 ± 0.118
1.299HisPro: 1.299 ± 0.126
0.716HisGln: 0.716 ± 0.101
1.275HisArg: 1.275 ± 0.119
1.603HisSer: 1.603 ± 0.139
0.911HisThr: 0.911 ± 0.106
1.287HisVal: 1.287 ± 0.134
0.279HisTrp: 0.279 ± 0.053
0.935HisTyr: 0.935 ± 0.098
0.0HisXaa: 0.0 ± 0.0
Ile
4.942IleAla: 4.942 ± 0.312
1.251IleCys: 1.251 ± 0.128
5.379IleAsp: 5.379 ± 0.234
7.686IleGlu: 7.686 ± 0.339
4.699IlePhe: 4.699 ± 0.292
6.581IleGly: 6.581 ± 0.324
2.756IleHis: 2.756 ± 0.165
8.754IleIle: 8.754 ± 0.365
7.795IleLys: 7.795 ± 0.324
9.98IleLeu: 9.98 ± 0.369
1.833IleMet: 1.833 ± 0.144
4.905IleAsn: 4.905 ± 0.226
4.213IlePro: 4.213 ± 0.245
4.152IleGln: 4.152 ± 0.225
4.711IleArg: 4.711 ± 0.243
7.929IleSer: 7.929 ± 0.275
4.82IleThr: 4.82 ± 0.231
4.966IleVal: 4.966 ± 0.264
0.753IleTrp: 0.753 ± 0.098
3.667IleTyr: 3.667 ± 0.254
0.0IleXaa: 0.0 ± 0.0
Lys
3.606LysAla: 3.606 ± 0.202
0.692LysCys: 0.692 ± 0.095
3.727LysAsp: 3.727 ± 0.238
7.977LysGlu: 7.977 ± 0.345
2.938LysPhe: 2.938 ± 0.225
5.464LysGly: 5.464 ± 0.264
1.506LysHis: 1.506 ± 0.135
10.915LysIle: 10.915 ± 0.377
12.834LysLys: 12.834 ± 0.523
6.739LysLeu: 6.739 ± 0.256
2.137LysMet: 2.137 ± 0.189
6.909LysAsn: 6.909 ± 0.318
2.283LysPro: 2.283 ± 0.198
2.768LysGln: 2.768 ± 0.201
3.91LysArg: 3.91 ± 0.237
5.828LysSer: 5.828 ± 0.302
4.905LysThr: 4.905 ± 0.248
4.796LysVal: 4.796 ± 0.248
0.692LysTrp: 0.692 ± 0.084
3.995LysTyr: 3.995 ± 0.219
0.0LysXaa: 0.0 ± 0.0
Leu
4.262LeuAla: 4.262 ± 0.239
1.153LeuCys: 1.153 ± 0.137
4.043LeuAsp: 4.043 ± 0.22
6.277LeuGlu: 6.277 ± 0.266
4.675LeuPhe: 4.675 ± 0.282
5.901LeuGly: 5.901 ± 0.3
2.392LeuHis: 2.392 ± 0.175
8.196LeuIle: 8.196 ± 0.368
8.39LeuLys: 8.39 ± 0.268
9.009LeuLeu: 9.009 ± 0.385
1.943LeuMet: 1.943 ± 0.122
5.257LeuAsn: 5.257 ± 0.236
3.703LeuPro: 3.703 ± 0.195
3.509LeuGln: 3.509 ± 0.199
4.286LeuArg: 4.286 ± 0.252
8.268LeuSer: 8.268 ± 0.305
3.97LeuThr: 3.97 ± 0.213
4.468LeuVal: 4.468 ± 0.231
0.838LeuTrp: 0.838 ± 0.119
3.618LeuTyr: 3.618 ± 0.218
0.0LeuXaa: 0.0 ± 0.0
Met
1.299MetAla: 1.299 ± 0.123
0.182MetCys: 0.182 ± 0.052
0.947MetAsp: 0.947 ± 0.108
1.445MetGlu: 1.445 ± 0.122
0.923MetPhe: 0.923 ± 0.091
1.566MetGly: 1.566 ± 0.155
0.437MetHis: 0.437 ± 0.074
1.748MetIle: 1.748 ± 0.144
1.87MetLys: 1.87 ± 0.171
1.858MetLeu: 1.858 ± 0.161
0.498MetMet: 0.498 ± 0.072
1.603MetAsn: 1.603 ± 0.121
0.789MetPro: 0.789 ± 0.095
0.753MetGln: 0.753 ± 0.104
1.129MetArg: 1.129 ± 0.128
1.384MetSer: 1.384 ± 0.127
0.862MetThr: 0.862 ± 0.109
1.068MetVal: 1.068 ± 0.114
0.146MetTrp: 0.146 ± 0.039
0.765MetTyr: 0.765 ± 0.106
0.0MetXaa: 0.0 ± 0.0
Asn
2.489AsnAla: 2.489 ± 0.146
0.692AsnCys: 0.692 ± 0.107
1.809AsnAsp: 1.809 ± 0.13
3.169AsnGlu: 3.169 ± 0.18
2.878AsnPhe: 2.878 ± 0.176
2.623AsnGly: 2.623 ± 0.173
1.19AsnHis: 1.19 ± 0.116
6.277AsnIle: 6.277 ± 0.283
5.221AsnLys: 5.221 ± 0.284
5.342AsnLeu: 5.342 ± 0.277
1.323AsnMet: 1.323 ± 0.131
3.06AsnAsn: 3.06 ± 0.223
2.416AsnPro: 2.416 ± 0.145
1.748AsnGln: 1.748 ± 0.154
2.598AsnArg: 2.598 ± 0.198
3.157AsnSer: 3.157 ± 0.181
2.635AsnThr: 2.635 ± 0.16
2.926AsnVal: 2.926 ± 0.177
0.668AsnTrp: 0.668 ± 0.083
2.173AsnTyr: 2.173 ± 0.166
0.0AsnXaa: 0.0 ± 0.0
Pro
1.263ProAla: 1.263 ± 0.113
0.352ProCys: 0.352 ± 0.066
1.724ProAsp: 1.724 ± 0.135
2.355ProGlu: 2.355 ± 0.165
1.833ProPhe: 1.833 ± 0.181
2.125ProGly: 2.125 ± 0.161
0.765ProHis: 0.765 ± 0.095
4.019ProIle: 4.019 ± 0.195
3.193ProLys: 3.193 ± 0.215
3.254ProLeu: 3.254 ± 0.237
0.68ProMet: 0.68 ± 0.081
2.198ProAsn: 2.198 ± 0.196
0.886ProPro: 0.886 ± 0.093
0.959ProGln: 0.959 ± 0.09
1.263ProArg: 1.263 ± 0.125
2.659ProSer: 2.659 ± 0.178
1.372ProThr: 1.372 ± 0.132
2.003ProVal: 2.003 ± 0.166
0.425ProTrp: 0.425 ± 0.078
1.323ProTyr: 1.323 ± 0.11
0.0ProXaa: 0.0 ± 0.0
Gln
1.761GlnAla: 1.761 ± 0.144
0.243GlnCys: 0.243 ± 0.051
1.287GlnAsp: 1.287 ± 0.134
2.501GlnGlu: 2.501 ± 0.197
1.566GlnPhe: 1.566 ± 0.138
1.858GlnGly: 1.858 ± 0.157
0.765GlnHis: 0.765 ± 0.091
4.067GlnIle: 4.067 ± 0.209
3.327GlnLys: 3.327 ± 0.207
3.169GlnLeu: 3.169 ± 0.179
0.898GlnMet: 0.898 ± 0.1
1.967GlnAsn: 1.967 ± 0.151
0.789GlnPro: 0.789 ± 0.12
1.068GlnGln: 1.068 ± 0.142
1.724GlnArg: 1.724 ± 0.123
1.906GlnSer: 1.906 ± 0.159
1.214GlnThr: 1.214 ± 0.128
1.651GlnVal: 1.651 ± 0.169
0.449GlnTrp: 0.449 ± 0.089
1.421GlnTyr: 1.421 ± 0.126
0.0GlnXaa: 0.0 ± 0.0
Arg
2.149ArgAla: 2.149 ± 0.166
0.461ArgCys: 0.461 ± 0.065
1.931ArgAsp: 1.931 ± 0.134
3.436ArgGlu: 3.436 ± 0.198
2.465ArgPhe: 2.465 ± 0.163
2.538ArgGly: 2.538 ± 0.172
0.947ArgHis: 0.947 ± 0.116
4.747ArgIle: 4.747 ± 0.28
4.711ArgLys: 4.711 ± 0.253
4.675ArgLeu: 4.675 ± 0.251
1.226ArgMet: 1.226 ± 0.149
2.744ArgAsn: 2.744 ± 0.206
1.299ArgPro: 1.299 ± 0.151
1.481ArgGln: 1.481 ± 0.139
2.04ArgArg: 2.04 ± 0.166
3.084ArgSer: 3.084 ± 0.209
2.149ArgThr: 2.149 ± 0.147
2.538ArgVal: 2.538 ± 0.204
0.401ArgTrp: 0.401 ± 0.075
1.991ArgTyr: 1.991 ± 0.183
0.0ArgXaa: 0.0 ± 0.0
Ser
2.975SerAla: 2.975 ± 0.167
0.935SerCys: 0.935 ± 0.108
3.072SerAsp: 3.072 ± 0.196
4.116SerGlu: 4.116 ± 0.218
3.946SerPhe: 3.946 ± 0.22
5.015SerGly: 5.015 ± 0.251
1.493SerHis: 1.493 ± 0.143
6.678SerIle: 6.678 ± 0.323
5.913SerLys: 5.913 ± 0.323
6.678SerLeu: 6.678 ± 0.295
1.008SerMet: 1.008 ± 0.101
2.963SerAsn: 2.963 ± 0.168
2.598SerPro: 2.598 ± 0.192
2.416SerGln: 2.416 ± 0.176
3.46SerArg: 3.46 ± 0.195
4.917SerSer: 4.917 ± 0.261
3.218SerThr: 3.218 ± 0.2
4.031SerVal: 4.031 ± 0.208
0.68SerTrp: 0.68 ± 0.113
2.805SerTyr: 2.805 ± 0.193
0.0SerXaa: 0.0 ± 0.0
Thr
2.186ThrAla: 2.186 ± 0.176
0.401ThrCys: 0.401 ± 0.082
2.355ThrAsp: 2.355 ± 0.165
2.756ThrGlu: 2.756 ± 0.175
2.27ThrPhe: 2.27 ± 0.179
3.473ThrGly: 3.473 ± 0.213
1.141ThrHis: 1.141 ± 0.131
4.541ThrIle: 4.541 ± 0.225
3.715ThrLys: 3.715 ± 0.183
4.383ThrLeu: 4.383 ± 0.2
1.02ThrMet: 1.02 ± 0.108
2.246ThrAsn: 2.246 ± 0.218
1.991ThrPro: 1.991 ± 0.168
1.457ThrGln: 1.457 ± 0.126
2.016ThrArg: 2.016 ± 0.176
3.303ThrSer: 3.303 ± 0.186
2.453ThrThr: 2.453 ± 0.198
2.78ThrVal: 2.78 ± 0.184
0.34ThrTrp: 0.34 ± 0.071
1.712ThrTyr: 1.712 ± 0.142
0.0ThrXaa: 0.0 ± 0.0
Val
2.465ValAla: 2.465 ± 0.238
0.619ValCys: 0.619 ± 0.083
2.695ValAsp: 2.695 ± 0.183
3.266ValGlu: 3.266 ± 0.204
2.586ValPhe: 2.586 ± 0.187
3.375ValGly: 3.375 ± 0.203
1.166ValHis: 1.166 ± 0.116
4.954ValIle: 4.954 ± 0.234
4.857ValLys: 4.857 ± 0.255
5.452ValLeu: 5.452 ± 0.241
1.166ValMet: 1.166 ± 0.108
2.538ValAsn: 2.538 ± 0.147
1.979ValPro: 1.979 ± 0.164
1.773ValGln: 1.773 ± 0.17
2.683ValArg: 2.683 ± 0.183
3.849ValSer: 3.849 ± 0.204
2.27ValThr: 2.27 ± 0.159
2.878ValVal: 2.878 ± 0.222
0.401ValTrp: 0.401 ± 0.074
1.846ValTyr: 1.846 ± 0.156
0.0ValXaa: 0.0 ± 0.0
Trp
0.401TrpAla: 0.401 ± 0.079
0.146TrpCys: 0.146 ± 0.039
0.546TrpAsp: 0.546 ± 0.082
0.607TrpGlu: 0.607 ± 0.076
0.364TrpPhe: 0.364 ± 0.091
0.486TrpGly: 0.486 ± 0.078
0.194TrpHis: 0.194 ± 0.045
0.971TrpIle: 0.971 ± 0.101
1.056TrpLys: 1.056 ± 0.123
0.85TrpLeu: 0.85 ± 0.111
0.291TrpMet: 0.291 ± 0.057
0.498TrpAsn: 0.498 ± 0.078
0.243TrpPro: 0.243 ± 0.059
0.255TrpGln: 0.255 ± 0.058
0.376TrpArg: 0.376 ± 0.067
0.583TrpSer: 0.583 ± 0.103
0.389TrpThr: 0.389 ± 0.062
0.486TrpVal: 0.486 ± 0.083
0.158TrpTrp: 0.158 ± 0.044
0.401TrpTyr: 0.401 ± 0.069
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.809TyrAla: 1.809 ± 0.171
0.522TyrCys: 0.522 ± 0.072
2.052TyrAsp: 2.052 ± 0.144
2.683TyrGlu: 2.683 ± 0.194
2.113TyrPhe: 2.113 ± 0.168
2.635TyrGly: 2.635 ± 0.184
0.971TyrHis: 0.971 ± 0.104
3.509TyrIle: 3.509 ± 0.251
3.545TyrLys: 3.545 ± 0.188
3.8TyrLeu: 3.8 ± 0.191
0.729TyrMet: 0.729 ± 0.097
1.931TyrAsn: 1.931 ± 0.184
1.676TyrPro: 1.676 ± 0.138
1.615TyrGln: 1.615 ± 0.14
2.125TyrArg: 2.125 ± 0.132
2.513TyrSer: 2.513 ± 0.168
2.04TyrThr: 2.04 ± 0.139
2.016TyrVal: 2.016 ± 0.139
0.449TyrTrp: 0.449 ± 0.084
1.627TyrTyr: 1.627 ± 0.154
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 243 proteins (82362 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski