Amino acid dipepetide frequency for Acidilobus sp. SCGC AC-742_E15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.435AlaAla: 9.435 ± 0.376
0.513AlaCys: 0.513 ± 0.052
3.271AlaAsp: 3.271 ± 0.148
6.66AlaGlu: 6.66 ± 0.206
3.389AlaPhe: 3.389 ± 0.164
6.677AlaGly: 6.677 ± 0.22
1.122AlaHis: 1.122 ± 0.074
5.2AlaIle: 5.2 ± 0.169
4.337AlaLys: 4.337 ± 0.179
12.531AlaLeu: 12.531 ± 0.285
2.588AlaMet: 2.588 ± 0.129
1.979AlaAsn: 1.979 ± 0.119
3.688AlaPro: 3.688 ± 0.141
2.081AlaGln: 2.081 ± 0.113
6.446AlaArg: 6.446 ± 0.188
7.421AlaSer: 7.421 ± 0.242
3.688AlaThr: 3.688 ± 0.131
8.391AlaVal: 8.391 ± 0.274
1.269AlaTrp: 1.269 ± 0.083
3.677AlaTyr: 3.677 ± 0.141
0.0AlaXaa: 0.0 ± 0.0
Cys
0.406CysAla: 0.406 ± 0.049
0.079CysCys: 0.079 ± 0.02
0.321CysAsp: 0.321 ± 0.041
0.412CysGlu: 0.412 ± 0.05
0.147CysPhe: 0.147 ± 0.031
0.931CysGly: 0.931 ± 0.094
0.124CysHis: 0.124 ± 0.024
0.231CysIle: 0.231 ± 0.039
0.242CysLys: 0.242 ± 0.04
0.496CysLeu: 0.496 ± 0.051
0.09CysMet: 0.09 ± 0.025
0.203CysAsn: 0.203 ± 0.036
0.801CysPro: 0.801 ± 0.075
0.152CysGln: 0.152 ± 0.03
0.547CysArg: 0.547 ± 0.054
0.451CysSer: 0.451 ± 0.057
0.203CysThr: 0.203 ± 0.04
0.485CysVal: 0.485 ± 0.051
0.068CysTrp: 0.068 ± 0.018
0.192CysTyr: 0.192 ± 0.032
0.0CysXaa: 0.0 ± 0.0
Asp
3.536AspAla: 3.536 ± 0.159
0.214AspCys: 0.214 ± 0.032
2.064AspAsp: 2.064 ± 0.11
3.694AspGlu: 3.694 ± 0.196
1.607AspPhe: 1.607 ± 0.088
3.006AspGly: 3.006 ± 0.127
0.57AspHis: 0.57 ± 0.06
2.549AspIle: 2.549 ± 0.106
1.912AspLys: 1.912 ± 0.132
5.933AspLeu: 5.933 ± 0.197
1.009AspMet: 1.009 ± 0.083
1.038AspAsn: 1.038 ± 0.077
3.102AspPro: 3.102 ± 0.16
0.727AspGln: 0.727 ± 0.069
2.651AspArg: 2.651 ± 0.132
1.985AspSer: 1.985 ± 0.119
1.331AspThr: 1.331 ± 0.076
5.16AspVal: 5.16 ± 0.159
0.603AspTrp: 0.603 ± 0.056
1.906AspTyr: 1.906 ± 0.108
0.0AspXaa: 0.0 ± 0.0
Glu
8.91GluAla: 8.91 ± 0.279
0.316GluCys: 0.316 ± 0.044
3.102GluAsp: 3.102 ± 0.169
6.136GluGlu: 6.136 ± 0.235
1.94GluPhe: 1.94 ± 0.082
6.164GluGly: 6.164 ± 0.199
0.857GluHis: 0.857 ± 0.071
3.068GluIle: 3.068 ± 0.183
2.921GluLys: 2.921 ± 0.144
7.929GluLeu: 7.929 ± 0.263
1.145GluMet: 1.145 ± 0.085
1.466GluAsn: 1.466 ± 0.094
3.147GluPro: 3.147 ± 0.13
1.579GluGln: 1.579 ± 0.1
5.442GluArg: 5.442 ± 0.205
3.181GluSer: 3.181 ± 0.148
2.267GluThr: 2.267 ± 0.133
7.156GluVal: 7.156 ± 0.247
0.801GluTrp: 0.801 ± 0.071
1.844GluTyr: 1.844 ± 0.107
0.0GluXaa: 0.0 ± 0.0
Phe
2.504PheAla: 2.504 ± 0.122
0.192PheCys: 0.192 ± 0.039
1.652PheAsp: 1.652 ± 0.095
2.013PheGlu: 2.013 ± 0.117
1.218PhePhe: 1.218 ± 0.105
2.414PheGly: 2.414 ± 0.131
0.586PheHis: 0.586 ± 0.061
2.199PheIle: 2.199 ± 0.114
1.494PheLys: 1.494 ± 0.099
3.406PheLeu: 3.406 ± 0.171
1.049PheMet: 1.049 ± 0.083
1.325PheAsn: 1.325 ± 0.092
1.387PhePro: 1.387 ± 0.085
0.66PheGln: 0.66 ± 0.067
2.137PheArg: 2.137 ± 0.123
2.442PheSer: 2.442 ± 0.134
2.216PheThr: 2.216 ± 0.104
2.617PheVal: 2.617 ± 0.138
0.406PheTrp: 0.406 ± 0.055
1.489PheTyr: 1.489 ± 0.106
0.0PheXaa: 0.0 ± 0.0
Gly
6.514GlyAla: 6.514 ± 0.205
0.677GlyCys: 0.677 ± 0.073
3.389GlyAsp: 3.389 ± 0.146
4.889GlyGlu: 4.889 ± 0.189
3.164GlyPhe: 3.164 ± 0.136
6.468GlyGly: 6.468 ± 0.214
1.466GlyHis: 1.466 ± 0.097
3.964GlyIle: 3.964 ± 0.152
3.677GlyLys: 3.677 ± 0.171
10.354GlyLeu: 10.354 ± 0.312
1.776GlyMet: 1.776 ± 0.095
1.957GlyAsn: 1.957 ± 0.119
3.818GlyPro: 3.818 ± 0.173
1.94GlyGln: 1.94 ± 0.113
5.459GlyArg: 5.459 ± 0.193
5.391GlySer: 5.391 ± 0.189
3.491GlyThr: 3.491 ± 0.131
7.619GlyVal: 7.619 ± 0.221
1.275GlyTrp: 1.275 ± 0.082
3.051GlyTyr: 3.051 ± 0.141
0.0GlyXaa: 0.0 ± 0.0
His
1.094HisAla: 1.094 ± 0.078
0.135HisCys: 0.135 ± 0.027
0.756HisAsp: 0.756 ± 0.073
0.885HisGlu: 0.885 ± 0.078
0.468HisPhe: 0.468 ± 0.051
1.291HisGly: 1.291 ± 0.081
0.31HisHis: 0.31 ± 0.049
0.761HisIle: 0.761 ± 0.064
0.434HisLys: 0.434 ± 0.049
1.308HisLeu: 1.308 ± 0.069
0.372HisMet: 0.372 ± 0.055
0.35HisAsn: 0.35 ± 0.044
1.043HisPro: 1.043 ± 0.081
0.316HisGln: 0.316 ± 0.051
0.964HisArg: 0.964 ± 0.075
0.665HisSer: 0.665 ± 0.056
0.603HisThr: 0.603 ± 0.06
1.472HisVal: 1.472 ± 0.098
0.152HisTrp: 0.152 ± 0.031
0.513HisTyr: 0.513 ± 0.056
0.0HisXaa: 0.0 ± 0.0
Ile
5.498IleAla: 5.498 ± 0.216
0.254IleCys: 0.254 ± 0.037
2.949IleAsp: 2.949 ± 0.123
3.688IleGlu: 3.688 ± 0.137
1.506IlePhe: 1.506 ± 0.092
4.263IleGly: 4.263 ± 0.153
0.722IleHis: 0.722 ± 0.069
3.011IleIle: 3.011 ± 0.185
2.515IleLys: 2.515 ± 0.114
4.004IleLeu: 4.004 ± 0.18
1.32IleMet: 1.32 ± 0.097
1.94IleAsn: 1.94 ± 0.124
2.414IlePro: 2.414 ± 0.127
0.959IleGln: 0.959 ± 0.084
3.344IleArg: 3.344 ± 0.166
3.564IleSer: 3.564 ± 0.128
2.893IleThr: 2.893 ± 0.127
5.273IleVal: 5.273 ± 0.159
0.553IleTrp: 0.553 ± 0.058
2.154IleTyr: 2.154 ± 0.115
0.0IleXaa: 0.0 ± 0.0
Lys
4.782LysAla: 4.782 ± 0.154
0.282LysCys: 0.282 ± 0.045
2.256LysAsp: 2.256 ± 0.127
3.688LysGlu: 3.688 ± 0.176
1.342LysPhe: 1.342 ± 0.087
4.263LysGly: 4.263 ± 0.167
0.496LysHis: 0.496 ± 0.045
1.872LysIle: 1.872 ± 0.147
1.579LysLys: 1.579 ± 0.121
4.348LysLeu: 4.348 ± 0.176
0.846LysMet: 0.846 ± 0.079
0.97LysAsn: 0.97 ± 0.082
1.979LysPro: 1.979 ± 0.121
0.778LysGln: 0.778 ± 0.071
2.887LysArg: 2.887 ± 0.141
2.171LysSer: 2.171 ± 0.139
1.675LysThr: 1.675 ± 0.114
5.431LysVal: 5.431 ± 0.199
0.468LysTrp: 0.468 ± 0.052
1.765LysTyr: 1.765 ± 0.092
0.0LysXaa: 0.0 ± 0.0
Leu
11.527LeuAla: 11.527 ± 0.313
0.711LeuCys: 0.711 ± 0.074
4.726LeuAsp: 4.726 ± 0.168
7.681LeuGlu: 7.681 ± 0.26
3.372LeuPhe: 3.372 ± 0.179
9.22LeuGly: 9.22 ± 0.262
1.449LeuHis: 1.449 ± 0.083
5.679LeuIle: 5.679 ± 0.193
5.51LeuLys: 5.51 ± 0.189
10.766LeuLeu: 10.766 ± 0.343
2.831LeuMet: 2.831 ± 0.12
3.045LeuAsn: 3.045 ± 0.151
5.013LeuPro: 5.013 ± 0.157
2.295LeuGln: 2.295 ± 0.126
9.243LeuArg: 9.243 ± 0.285
8.549LeuSer: 8.549 ± 0.242
5.668LeuThr: 5.668 ± 0.183
9.226LeuVal: 9.226 ± 0.284
1.083LeuTrp: 1.083 ± 0.074
3.948LeuTyr: 3.948 ± 0.173
0.0LeuXaa: 0.0 ± 0.0
Met
2.775MetAla: 2.775 ± 0.129
0.113MetCys: 0.113 ± 0.024
0.942MetAsp: 0.942 ± 0.077
1.263MetGlu: 1.263 ± 0.089
0.716MetPhe: 0.716 ± 0.07
1.968MetGly: 1.968 ± 0.096
0.271MetHis: 0.271 ± 0.04
1.308MetIle: 1.308 ± 0.086
1.184MetLys: 1.184 ± 0.099
1.957MetLeu: 1.957 ± 0.109
0.575MetMet: 0.575 ± 0.06
0.671MetAsn: 0.671 ± 0.073
1.5MetPro: 1.5 ± 0.082
0.383MetGln: 0.383 ± 0.042
1.647MetArg: 1.647 ± 0.103
1.895MetSer: 1.895 ± 0.097
1.421MetThr: 1.421 ± 0.088
1.934MetVal: 1.934 ± 0.114
0.276MetTrp: 0.276 ± 0.039
0.682MetTyr: 0.682 ± 0.07
0.0MetXaa: 0.0 ± 0.0
Asn
2.239AsnAla: 2.239 ± 0.13
0.164AsnCys: 0.164 ± 0.032
1.049AsnAsp: 1.049 ± 0.095
1.624AsnGlu: 1.624 ± 0.096
0.784AsnPhe: 0.784 ± 0.079
2.019AsnGly: 2.019 ± 0.136
0.226AsnHis: 0.226 ± 0.037
1.72AsnIle: 1.72 ± 0.103
1.117AsnLys: 1.117 ± 0.086
2.904AsnLeu: 2.904 ± 0.146
0.699AsnMet: 0.699 ± 0.078
0.812AsnAsn: 0.812 ± 0.071
1.782AsnPro: 1.782 ± 0.1
0.479AsnGln: 0.479 ± 0.057
1.325AsnArg: 1.325 ± 0.088
1.483AsnSer: 1.483 ± 0.111
1.173AsnThr: 1.173 ± 0.08
3.011AsnVal: 3.011 ± 0.16
0.305AsnTrp: 0.305 ± 0.044
1.28AsnTyr: 1.28 ± 0.109
0.0AsnXaa: 0.0 ± 0.0
Pro
3.536ProAla: 3.536 ± 0.129
0.395ProCys: 0.395 ± 0.046
2.504ProAsp: 2.504 ± 0.138
4.134ProGlu: 4.134 ± 0.156
1.923ProPhe: 1.923 ± 0.128
4.23ProGly: 4.23 ± 0.173
0.795ProHis: 0.795 ± 0.062
2.402ProIle: 2.402 ± 0.132
2.053ProLys: 2.053 ± 0.116
5.611ProLeu: 5.611 ± 0.196
1.173ProMet: 1.173 ± 0.086
1.331ProAsn: 1.331 ± 0.107
3.119ProPro: 3.119 ± 0.13
1.478ProGln: 1.478 ± 0.089
3.181ProArg: 3.181 ± 0.132
3.756ProSer: 3.756 ± 0.151
2.442ProThr: 2.442 ± 0.112
4.269ProVal: 4.269 ± 0.166
0.953ProTrp: 0.953 ± 0.078
2.256ProTyr: 2.256 ± 0.118
0.0ProXaa: 0.0 ± 0.0
Gln
2.38GlnAla: 2.38 ± 0.131
0.107GlnCys: 0.107 ± 0.025
0.919GlnAsp: 0.919 ± 0.084
1.602GlnGlu: 1.602 ± 0.118
0.609GlnPhe: 0.609 ± 0.061
2.087GlnGly: 2.087 ± 0.112
0.265GlnHis: 0.265 ± 0.04
0.75GlnIle: 0.75 ± 0.069
0.829GlnLys: 0.829 ± 0.073
2.684GlnLeu: 2.684 ± 0.132
0.508GlnMet: 0.508 ± 0.054
0.44GlnAsn: 0.44 ± 0.054
1.026GlnPro: 1.026 ± 0.081
0.665GlnGln: 0.665 ± 0.063
1.534GlnArg: 1.534 ± 0.105
1.043GlnSer: 1.043 ± 0.08
0.773GlnThr: 0.773 ± 0.064
2.12GlnVal: 2.12 ± 0.116
0.327GlnTrp: 0.327 ± 0.05
0.835GlnTyr: 0.835 ± 0.07
0.0GlnXaa: 0.0 ± 0.0
Arg
7.28ArgAla: 7.28 ± 0.241
0.694ArgCys: 0.694 ± 0.08
3.22ArgAsp: 3.22 ± 0.142
5.211ArgGlu: 5.211 ± 0.185
2.182ArgPhe: 2.182 ± 0.109
6.062ArgGly: 6.062 ± 0.238
0.801ArgHis: 0.801 ± 0.071
2.735ArgIle: 2.735 ± 0.147
2.78ArgLys: 2.78 ± 0.133
9.237ArgLeu: 9.237 ± 0.26
1.252ArgMet: 1.252 ± 0.08
1.184ArgAsn: 1.184 ± 0.076
4.049ArgPro: 4.049 ± 0.161
1.534ArgGln: 1.534 ± 0.09
5.888ArgArg: 5.888 ± 0.226
4.117ArgSer: 4.117 ± 0.187
2.543ArgThr: 2.543 ± 0.14
6.158ArgVal: 6.158 ± 0.239
0.857ArgTrp: 0.857 ± 0.063
2.464ArgTyr: 2.464 ± 0.115
0.0ArgXaa: 0.0 ± 0.0
Ser
5.143SerAla: 5.143 ± 0.183
0.598SerCys: 0.598 ± 0.061
2.572SerAsp: 2.572 ± 0.095
3.976SerGlu: 3.976 ± 0.155
2.487SerPhe: 2.487 ± 0.132
5.386SerGly: 5.386 ± 0.185
0.914SerHis: 0.914 ± 0.07
3.429SerIle: 3.429 ± 0.14
2.746SerLys: 2.746 ± 0.167
8.234SerLeu: 8.234 ± 0.266
1.63SerMet: 1.63 ± 0.089
1.387SerAsn: 1.387 ± 0.1
3.818SerPro: 3.818 ± 0.157
1.681SerGln: 1.681 ± 0.112
5.059SerArg: 5.059 ± 0.177
5.121SerSer: 5.121 ± 0.213
2.949SerThr: 2.949 ± 0.142
5.386SerVal: 5.386 ± 0.173
1.117SerTrp: 1.117 ± 0.084
2.639SerTyr: 2.639 ± 0.113
0.0SerXaa: 0.0 ± 0.0
Thr
4.128ThrAla: 4.128 ± 0.168
0.327ThrCys: 0.327 ± 0.042
1.72ThrAsp: 1.72 ± 0.092
2.166ThrGlu: 2.166 ± 0.114
1.822ThrPhe: 1.822 ± 0.1
4.072ThrGly: 4.072 ± 0.155
0.688ThrHis: 0.688 ± 0.06
2.831ThrIle: 2.831 ± 0.152
1.759ThrLys: 1.759 ± 0.101
5.121ThrLeu: 5.121 ± 0.177
1.077ThrMet: 1.077 ± 0.079
1.291ThrAsn: 1.291 ± 0.086
2.966ThrPro: 2.966 ± 0.154
0.88ThrGln: 0.88 ± 0.079
2.329ThrArg: 2.329 ± 0.131
3.158ThrSer: 3.158 ± 0.15
2.825ThrThr: 2.825 ± 0.237
3.936ThrVal: 3.936 ± 0.169
0.564ThrTrp: 0.564 ± 0.059
2.205ThrTyr: 2.205 ± 0.128
0.0ThrXaa: 0.0 ± 0.0
Val
8.047ValAla: 8.047 ± 0.27
0.485ValCys: 0.485 ± 0.068
4.624ValAsp: 4.624 ± 0.206
6.322ValGlu: 6.322 ± 0.23
2.769ValPhe: 2.769 ± 0.131
6.029ValGly: 6.029 ± 0.207
1.376ValHis: 1.376 ± 0.079
6.401ValIle: 6.401 ± 0.211
5.216ValLys: 5.216 ± 0.194
9.311ValLeu: 9.311 ± 0.259
2.29ValMet: 2.29 ± 0.128
3.147ValAsn: 3.147 ± 0.145
4.495ValPro: 4.495 ± 0.163
1.765ValGln: 1.765 ± 0.104
6.536ValArg: 6.536 ± 0.23
6.361ValSer: 6.361 ± 0.244
5.171ValThr: 5.171 ± 0.205
8.352ValVal: 8.352 ± 0.376
0.739ValTrp: 0.739 ± 0.05
3.434ValTyr: 3.434 ± 0.162
0.0ValXaa: 0.0 ± 0.0
Trp
1.15TrpAla: 1.15 ± 0.086
0.09TrpCys: 0.09 ± 0.023
0.598TrpAsp: 0.598 ± 0.058
0.75TrpGlu: 0.75 ± 0.068
0.451TrpPhe: 0.451 ± 0.044
0.976TrpGly: 0.976 ± 0.08
0.242TrpHis: 0.242 ± 0.041
0.564TrpIle: 0.564 ± 0.057
0.412TrpLys: 0.412 ± 0.042
1.506TrpLeu: 1.506 ± 0.115
0.259TrpMet: 0.259 ± 0.038
0.367TrpAsn: 0.367 ± 0.054
0.671TrpPro: 0.671 ± 0.055
0.305TrpGln: 0.305 ± 0.046
0.874TrpArg: 0.874 ± 0.064
0.897TrpSer: 0.897 ± 0.068
0.649TrpThr: 0.649 ± 0.056
1.055TrpVal: 1.055 ± 0.09
0.259TrpTrp: 0.259 ± 0.042
0.519TrpTyr: 0.519 ± 0.072
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.48TyrAla: 3.48 ± 0.144
0.237TyrCys: 0.237 ± 0.038
1.963TyrAsp: 1.963 ± 0.125
2.577TyrGlu: 2.577 ± 0.133
1.511TyrPhe: 1.511 ± 0.094
2.82TyrGly: 2.82 ± 0.13
0.615TyrHis: 0.615 ± 0.06
2.149TyrIle: 2.149 ± 0.124
1.162TyrLys: 1.162 ± 0.082
3.902TyrLeu: 3.902 ± 0.187
0.987TyrMet: 0.987 ± 0.083
1.291TyrAsn: 1.291 ± 0.104
1.72TyrPro: 1.72 ± 0.091
0.846TyrGln: 0.846 ± 0.075
2.758TyrArg: 2.758 ± 0.143
2.617TyrSer: 2.617 ± 0.118
1.884TyrThr: 1.884 ± 0.134
3.818TyrVal: 3.818 ± 0.156
0.519TyrTrp: 0.519 ± 0.058
1.782TyrTyr: 1.782 ± 0.134
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 672 proteins (177325 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski