Amino acid dipepetide frequency for Candidatus Riesia sp. GBBU

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.616AlaAla: 1.616 ± 0.158
0.441AlaCys: 0.441 ± 0.069
1.191AlaAsp: 1.191 ± 0.111
1.709AlaGlu: 1.709 ± 0.117
1.423AlaPhe: 1.423 ± 0.12
1.956AlaGly: 1.956 ± 0.153
0.541AlaHis: 0.541 ± 0.062
4.392AlaIle: 4.392 ± 0.198
3.518AlaLys: 3.518 ± 0.17
2.644AlaLeu: 2.644 ± 0.148
0.874AlaMet: 0.874 ± 0.09
1.794AlaAsn: 1.794 ± 0.136
0.619AlaPro: 0.619 ± 0.07
0.634AlaGln: 0.634 ± 0.074
1.577AlaArg: 1.577 ± 0.126
2.181AlaSer: 2.181 ± 0.145
1.43AlaThr: 1.43 ± 0.121
1.755AlaVal: 1.755 ± 0.133
0.186AlaTrp: 0.186 ± 0.037
1.113AlaTyr: 1.113 ± 0.088
0.0AlaXaa: 0.0 ± 0.0
Cys
0.41CysAla: 0.41 ± 0.054
0.286CysCys: 0.286 ± 0.04
0.518CysAsp: 0.518 ± 0.067
0.626CysGlu: 0.626 ± 0.069
0.874CysPhe: 0.874 ± 0.086
0.843CysGly: 0.843 ± 0.085
0.24CysHis: 0.24 ± 0.04
1.392CysIle: 1.392 ± 0.102
1.384CysLys: 1.384 ± 0.11
1.214CysLeu: 1.214 ± 0.088
0.255CysMet: 0.255 ± 0.042
0.804CysAsn: 0.804 ± 0.093
0.271CysPro: 0.271 ± 0.047
0.247CysGln: 0.247 ± 0.049
0.425CysArg: 0.425 ± 0.067
1.26CysSer: 1.26 ± 0.103
0.503CysThr: 0.503 ± 0.062
0.603CysVal: 0.603 ± 0.067
0.162CysTrp: 0.162 ± 0.034
0.479CysTyr: 0.479 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
1.168AspAla: 1.168 ± 0.105
0.487AspCys: 0.487 ± 0.06
1.314AspAsp: 1.314 ± 0.091
2.32AspGlu: 2.32 ± 0.138
2.675AspPhe: 2.675 ± 0.141
2.296AspGly: 2.296 ± 0.142
0.688AspHis: 0.688 ± 0.063
5.15AspIle: 5.15 ± 0.19
3.704AspLys: 3.704 ± 0.181
4.593AspLeu: 4.593 ± 0.211
0.866AspMet: 0.866 ± 0.08
2.289AspAsn: 2.289 ± 0.134
1.307AspPro: 1.307 ± 0.111
0.967AspGln: 0.967 ± 0.101
1.856AspArg: 1.856 ± 0.106
2.923AspSer: 2.923 ± 0.15
1.531AspThr: 1.531 ± 0.115
2.428AspVal: 2.428 ± 0.155
0.379AspTrp: 0.379 ± 0.052
1.461AspTyr: 1.461 ± 0.11
0.0AspXaa: 0.0 ± 0.0
Glu
1.825GluAla: 1.825 ± 0.144
0.418GluCys: 0.418 ± 0.062
2.312GluAsp: 2.312 ± 0.158
3.773GluGlu: 3.773 ± 0.246
2.451GluPhe: 2.451 ± 0.152
2.211GluGly: 2.211 ± 0.137
0.727GluHis: 0.727 ± 0.079
8.204GluIle: 8.204 ± 0.277
8.823GluLys: 8.823 ± 0.334
4.26GluLeu: 4.26 ± 0.179
1.168GluMet: 1.168 ± 0.086
4.585GluAsn: 4.585 ± 0.201
0.804GluPro: 0.804 ± 0.078
0.943GluGln: 0.943 ± 0.101
2.467GluArg: 2.467 ± 0.154
3.789GluSer: 3.789 ± 0.176
2.474GluThr: 2.474 ± 0.172
3.039GluVal: 3.039 ± 0.223
0.371GluTrp: 0.371 ± 0.051
1.833GluTyr: 1.833 ± 0.115
0.0GluXaa: 0.0 ± 0.0
Phe
1.253PheAla: 1.253 ± 0.102
1.075PheCys: 1.075 ± 0.099
2.119PheAsp: 2.119 ± 0.121
2.436PheGlu: 2.436 ± 0.149
4.678PhePhe: 4.678 ± 0.274
3.565PheGly: 3.565 ± 0.199
1.291PheHis: 1.291 ± 0.093
5.776PheIle: 5.776 ± 0.289
4.245PheLys: 4.245 ± 0.192
7.214PheLeu: 7.214 ± 0.276
1.121PheMet: 1.121 ± 0.09
3.51PheAsn: 3.51 ± 0.158
2.065PhePro: 2.065 ± 0.147
1.415PheGln: 1.415 ± 0.108
2.018PheArg: 2.018 ± 0.127
6.449PheSer: 6.449 ± 0.271
2.119PheThr: 2.119 ± 0.136
2.559PheVal: 2.559 ± 0.134
0.534PheTrp: 0.534 ± 0.071
2.652PheTyr: 2.652 ± 0.155
0.0PheXaa: 0.0 ± 0.0
Gly
2.103GlyAla: 2.103 ± 0.153
0.967GlyCys: 0.967 ± 0.096
2.211GlyAsp: 2.211 ± 0.166
3.031GlyGlu: 3.031 ± 0.169
2.776GlyPhe: 2.776 ± 0.16
3.186GlyGly: 3.186 ± 0.188
1.268GlyHis: 1.268 ± 0.104
6.874GlyIle: 6.874 ± 0.267
6.062GlyLys: 6.062 ± 0.223
3.936GlyLeu: 3.936 ± 0.18
1.245GlyMet: 1.245 ± 0.109
3.132GlyAsn: 3.132 ± 0.18
1.16GlyPro: 1.16 ± 0.096
1.036GlyGln: 1.036 ± 0.093
2.296GlyArg: 2.296 ± 0.141
3.866GlySer: 3.866 ± 0.176
2.668GlyThr: 2.668 ± 0.158
3.201GlyVal: 3.201 ± 0.166
0.441GlyTrp: 0.441 ± 0.06
1.802GlyTyr: 1.802 ± 0.128
0.0GlyXaa: 0.0 ± 0.0
His
0.564HisAla: 0.564 ± 0.061
0.224HisCys: 0.224 ± 0.038
0.626HisAsp: 0.626 ± 0.063
0.889HisGlu: 0.889 ± 0.086
0.967HisPhe: 0.967 ± 0.096
1.43HisGly: 1.43 ± 0.112
0.286HisHis: 0.286 ± 0.05
2.041HisIle: 2.041 ± 0.125
1.585HisLys: 1.585 ± 0.097
1.361HisLeu: 1.361 ± 0.101
0.271HisMet: 0.271 ± 0.045
0.881HisAsn: 0.881 ± 0.086
0.82HisPro: 0.82 ± 0.073
0.441HisGln: 0.441 ± 0.07
0.82HisArg: 0.82 ± 0.081
1.376HisSer: 1.376 ± 0.114
0.68HisThr: 0.68 ± 0.067
0.858HisVal: 0.858 ± 0.093
0.162HisTrp: 0.162 ± 0.036
0.673HisTyr: 0.673 ± 0.069
0.0HisXaa: 0.0 ± 0.0
Ile
4.462IleAla: 4.462 ± 0.248
1.539IleCys: 1.539 ± 0.103
6.224IleAsp: 6.224 ± 0.222
7.044IleGlu: 7.044 ± 0.254
7.09IlePhe: 7.09 ± 0.331
7.322IleGly: 7.322 ± 0.303
2.25IleHis: 2.25 ± 0.158
12.248IleIle: 12.248 ± 0.376
12.665IleLys: 12.665 ± 0.352
11.977IleLeu: 11.977 ± 0.361
2.227IleMet: 2.227 ± 0.144
8.018IleAsn: 8.018 ± 0.23
3.913IlePro: 3.913 ± 0.172
3.302IleGln: 3.302 ± 0.166
5.103IleArg: 5.103 ± 0.229
11.173IleSer: 11.173 ± 0.284
4.794IleThr: 4.794 ± 0.181
6.163IleVal: 6.163 ± 0.215
0.82IleTrp: 0.82 ± 0.093
3.882IleTyr: 3.882 ± 0.177
0.008IleXaa: 0.008 ± 0.008
Lys
2.521LysAla: 2.521 ± 0.137
0.982LysCys: 0.982 ± 0.092
4.717LysAsp: 4.717 ± 0.198
7.469LysGlu: 7.469 ± 0.273
6.503LysPhe: 6.503 ± 0.222
3.441LysGly: 3.441 ± 0.209
1.369LysHis: 1.369 ± 0.098
16.601LysIle: 16.601 ± 0.448
18.534LysLys: 18.534 ± 0.533
9.155LysLeu: 9.155 ± 0.29
2.474LysMet: 2.474 ± 0.139
12.14LysAsn: 12.14 ± 0.377
1.84LysPro: 1.84 ± 0.133
2.173LysGln: 2.173 ± 0.129
4.933LysArg: 4.933 ± 0.204
8.034LysSer: 8.034 ± 0.232
4.817LysThr: 4.817 ± 0.193
5.351LysVal: 5.351 ± 0.227
0.92LysTrp: 0.92 ± 0.091
4.616LysTyr: 4.616 ± 0.198
0.0LysXaa: 0.0 ± 0.0
Leu
2.961LeuAla: 2.961 ± 0.191
1.291LeuCys: 1.291 ± 0.104
3.99LeuAsp: 3.99 ± 0.165
5.413LeuGlu: 5.413 ± 0.218
4.995LeuPhe: 4.995 ± 0.247
4.083LeuGly: 4.083 ± 0.168
1.516LeuHis: 1.516 ± 0.112
10.415LeuIle: 10.415 ± 0.315
10.81LeuLys: 10.81 ± 0.327
8.042LeuLeu: 8.042 ± 0.293
1.995LeuMet: 1.995 ± 0.111
6.263LeuAsn: 6.263 ± 0.26
2.66LeuPro: 2.66 ± 0.129
1.964LeuGln: 1.964 ± 0.122
3.797LeuArg: 3.797 ± 0.198
7.817LeuSer: 7.817 ± 0.268
3.549LeuThr: 3.549 ± 0.154
4.802LeuVal: 4.802 ± 0.184
0.611LeuTrp: 0.611 ± 0.073
3.333LeuTyr: 3.333 ± 0.167
0.0LeuXaa: 0.0 ± 0.0
Met
0.758MetAla: 0.758 ± 0.079
0.201MetCys: 0.201 ± 0.037
0.773MetAsp: 0.773 ± 0.075
1.083MetGlu: 1.083 ± 0.088
1.16MetPhe: 1.16 ± 0.1
0.881MetGly: 0.881 ± 0.09
0.309MetHis: 0.309 ± 0.043
2.474MetIle: 2.474 ± 0.131
2.838MetLys: 2.838 ± 0.138
1.887MetLeu: 1.887 ± 0.103
0.41MetMet: 0.41 ± 0.071
1.786MetAsn: 1.786 ± 0.14
0.557MetPro: 0.557 ± 0.072
0.541MetGln: 0.541 ± 0.074
0.92MetArg: 0.92 ± 0.081
1.577MetSer: 1.577 ± 0.122
0.827MetThr: 0.827 ± 0.075
1.013MetVal: 1.013 ± 0.107
0.139MetTrp: 0.139 ± 0.031
0.711MetTyr: 0.711 ± 0.082
0.0MetXaa: 0.0 ± 0.0
Asn
1.701AsnAla: 1.701 ± 0.123
0.881AsnCys: 0.881 ± 0.083
2.15AsnAsp: 2.15 ± 0.12
3.742AsnGlu: 3.742 ± 0.172
5.343AsnPhe: 5.343 ± 0.24
3.851AsnGly: 3.851 ± 0.195
1.098AsnHis: 1.098 ± 0.093
8.428AsnIle: 8.428 ± 0.255
8.444AsnLys: 8.444 ± 0.31
6.727AsnLeu: 6.727 ± 0.234
1.369AsnMet: 1.369 ± 0.108
4.539AsnAsn: 4.539 ± 0.212
2.196AsnPro: 2.196 ± 0.132
1.871AsnGln: 1.871 ± 0.118
3.302AsnArg: 3.302 ± 0.154
5.32AsnSer: 5.32 ± 0.203
2.459AsnThr: 2.459 ± 0.145
3.317AsnVal: 3.317 ± 0.161
0.572AsnTrp: 0.572 ± 0.067
2.776AsnTyr: 2.776 ± 0.124
0.0AsnXaa: 0.0 ± 0.0
Pro
0.657ProAla: 0.657 ± 0.072
0.325ProCys: 0.325 ± 0.052
1.075ProAsp: 1.075 ± 0.092
1.554ProGlu: 1.554 ± 0.105
1.477ProPhe: 1.477 ± 0.114
1.709ProGly: 1.709 ± 0.132
0.41ProHis: 0.41 ± 0.06
3.51ProIle: 3.51 ± 0.171
3.008ProLys: 3.008 ± 0.182
1.949ProLeu: 1.949 ± 0.121
0.588ProMet: 0.588 ± 0.065
1.848ProAsn: 1.848 ± 0.127
0.518ProPro: 0.518 ± 0.061
0.348ProGln: 0.348 ± 0.053
0.812ProArg: 0.812 ± 0.076
1.894ProSer: 1.894 ± 0.128
1.361ProThr: 1.361 ± 0.108
1.809ProVal: 1.809 ± 0.116
0.271ProTrp: 0.271 ± 0.044
1.052ProTyr: 1.052 ± 0.096
0.0ProXaa: 0.0 ± 0.0
Gln
0.835GlnAla: 0.835 ± 0.087
0.224GlnCys: 0.224 ± 0.045
0.789GlnAsp: 0.789 ± 0.08
1.485GlnGlu: 1.485 ± 0.124
1.291GlnPhe: 1.291 ± 0.087
0.943GlnGly: 0.943 ± 0.091
0.425GlnHis: 0.425 ± 0.059
3.232GlnIle: 3.232 ± 0.162
2.876GlnLys: 2.876 ± 0.162
1.786GlnLeu: 1.786 ± 0.126
0.572GlnMet: 0.572 ± 0.06
1.577GlnAsn: 1.577 ± 0.124
0.379GlnPro: 0.379 ± 0.047
0.363GlnGln: 0.363 ± 0.055
0.928GlnArg: 0.928 ± 0.077
1.477GlnSer: 1.477 ± 0.122
0.773GlnThr: 0.773 ± 0.07
1.044GlnVal: 1.044 ± 0.089
0.178GlnTrp: 0.178 ± 0.042
1.005GlnTyr: 1.005 ± 0.091
0.0GlnXaa: 0.0 ± 0.0
Arg
1.284ArgAla: 1.284 ± 0.109
0.479ArgCys: 0.479 ± 0.065
1.446ArgAsp: 1.446 ± 0.113
2.374ArgGlu: 2.374 ± 0.161
2.188ArgPhe: 2.188 ± 0.123
1.786ArgGly: 1.786 ± 0.142
0.696ArgHis: 0.696 ± 0.078
5.219ArgIle: 5.219 ± 0.226
6.155ArgLys: 6.155 ± 0.216
3.108ArgLeu: 3.108 ± 0.165
0.974ArgMet: 0.974 ± 0.084
3.711ArgAsn: 3.711 ± 0.177
0.951ArgPro: 0.951 ± 0.079
0.99ArgGln: 0.99 ± 0.09
2.003ArgArg: 2.003 ± 0.134
3.17ArgSer: 3.17 ± 0.184
1.701ArgThr: 1.701 ± 0.11
2.095ArgVal: 2.095 ± 0.166
0.41ArgTrp: 0.41 ± 0.061
1.485ArgTyr: 1.485 ± 0.109
0.0ArgXaa: 0.0 ± 0.0
Ser
2.358SerAla: 2.358 ± 0.143
1.098SerCys: 1.098 ± 0.09
3.209SerAsp: 3.209 ± 0.152
4.09SerGlu: 4.09 ± 0.182
4.748SerPhe: 4.748 ± 0.247
4.779SerGly: 4.779 ± 0.203
1.183SerHis: 1.183 ± 0.095
9.936SerIle: 9.936 ± 0.302
9.124SerLys: 9.124 ± 0.299
7.864SerLeu: 7.864 ± 0.288
1.693SerMet: 1.693 ± 0.117
4.748SerAsn: 4.748 ± 0.214
1.84SerPro: 1.84 ± 0.11
1.438SerGln: 1.438 ± 0.12
3.178SerArg: 3.178 ± 0.159
7.044SerSer: 7.044 ± 0.292
3.0SerThr: 3.0 ± 0.171
4.748SerVal: 4.748 ± 0.215
0.657SerTrp: 0.657 ± 0.075
3.317SerTyr: 3.317 ± 0.176
0.0SerXaa: 0.0 ± 0.0
Thr
1.562ThrAla: 1.562 ± 0.109
0.526ThrCys: 0.526 ± 0.067
1.593ThrAsp: 1.593 ± 0.116
2.389ThrGlu: 2.389 ± 0.132
1.918ThrPhe: 1.918 ± 0.134
2.884ThrGly: 2.884 ± 0.16
0.719ThrHis: 0.719 ± 0.079
4.794ThrIle: 4.794 ± 0.203
4.554ThrLys: 4.554 ± 0.172
3.518ThrLeu: 3.518 ± 0.185
0.742ThrMet: 0.742 ± 0.077
2.59ThrAsn: 2.59 ± 0.139
1.284ThrPro: 1.284 ± 0.088
0.866ThrGln: 0.866 ± 0.086
1.539ThrArg: 1.539 ± 0.108
2.498ThrSer: 2.498 ± 0.122
1.941ThrThr: 1.941 ± 0.142
2.699ThrVal: 2.699 ± 0.153
0.363ThrTrp: 0.363 ± 0.049
1.43ThrTyr: 1.43 ± 0.111
0.0ThrXaa: 0.0 ± 0.0
Val
2.095ValAla: 2.095 ± 0.14
0.696ValCys: 0.696 ± 0.067
2.235ValAsp: 2.235 ± 0.137
2.791ValGlu: 2.791 ± 0.172
2.876ValPhe: 2.876 ± 0.162
3.193ValGly: 3.193 ± 0.199
1.059ValHis: 1.059 ± 0.092
6.108ValIle: 6.108 ± 0.265
5.103ValLys: 5.103 ± 0.189
4.895ValLeu: 4.895 ± 0.172
0.959ValMet: 0.959 ± 0.088
3.023ValAsn: 3.023 ± 0.165
1.709ValPro: 1.709 ± 0.117
1.345ValGln: 1.345 ± 0.116
2.126ValArg: 2.126 ± 0.141
4.755ValSer: 4.755 ± 0.204
2.181ValThr: 2.181 ± 0.112
2.729ValVal: 2.729 ± 0.179
0.402ValTrp: 0.402 ± 0.064
1.771ValTyr: 1.771 ± 0.137
0.0ValXaa: 0.0 ± 0.0
Trp
0.224TrpAla: 0.224 ± 0.044
0.101TrpCys: 0.101 ± 0.026
0.232TrpAsp: 0.232 ± 0.046
0.433TrpGlu: 0.433 ± 0.063
0.348TrpPhe: 0.348 ± 0.069
0.379TrpGly: 0.379 ± 0.053
0.101TrpHis: 0.101 ± 0.029
1.477TrpIle: 1.477 ± 0.111
1.013TrpLys: 1.013 ± 0.097
0.68TrpLeu: 0.68 ± 0.095
0.294TrpMet: 0.294 ± 0.051
0.619TrpAsn: 0.619 ± 0.077
0.217TrpPro: 0.217 ± 0.041
0.217TrpGln: 0.217 ± 0.041
0.425TrpArg: 0.425 ± 0.055
0.387TrpSer: 0.387 ± 0.067
0.302TrpThr: 0.302 ± 0.046
0.217TrpVal: 0.217 ± 0.041
0.062TrpTrp: 0.062 ± 0.024
0.302TrpTyr: 0.302 ± 0.059
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.237TyrAla: 1.237 ± 0.095
0.557TyrCys: 0.557 ± 0.069
1.678TyrAsp: 1.678 ± 0.117
1.825TyrGlu: 1.825 ± 0.114
2.304TyrPhe: 2.304 ± 0.141
2.521TyrGly: 2.521 ± 0.156
0.781TyrHis: 0.781 ± 0.079
3.951TyrIle: 3.951 ± 0.196
4.152TyrLys: 4.152 ± 0.216
3.495TyrLeu: 3.495 ± 0.16
0.742TyrMet: 0.742 ± 0.076
2.227TyrAsn: 2.227 ± 0.143
1.067TyrPro: 1.067 ± 0.09
1.083TyrGln: 1.083 ± 0.1
1.701TyrArg: 1.701 ± 0.116
3.062TyrSer: 3.062 ± 0.134
1.299TyrThr: 1.299 ± 0.105
1.624TyrVal: 1.624 ± 0.103
0.387TyrTrp: 0.387 ± 0.051
1.546TyrTyr: 1.546 ± 0.136
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.008XaaPhe: 0.008 ± 0.008
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 402 proteins (129329 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski