Amino acid dipepetide frequency for Halogeometricum sp. wsp3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.203AlaAla: 10.203 ± 0.41
1.331AlaCys: 1.331 ± 0.127
6.965AlaAsp: 6.965 ± 0.311
5.815AlaGlu: 5.815 ± 0.242
2.428AlaPhe: 2.428 ± 0.17
7.934AlaGly: 7.934 ± 0.282
1.64AlaHis: 1.64 ± 0.131
3.344AlaIle: 3.344 ± 0.181
1.331AlaLys: 1.331 ± 0.131
7.466AlaLeu: 7.466 ± 0.327
1.885AlaMet: 1.885 ± 0.138
2.077AlaAsn: 2.077 ± 0.157
3.76AlaPro: 3.76 ± 0.213
1.843AlaGln: 1.843 ± 0.127
7.998AlaArg: 7.998 ± 0.278
7.871AlaSer: 7.871 ± 0.358
7.253AlaThr: 7.253 ± 0.277
9.01AlaVal: 9.01 ± 0.298
0.895AlaTrp: 0.895 ± 0.095
1.299AlaTyr: 1.299 ± 0.118
0.0AlaXaa: 0.0 ± 0.0
Cys
1.033CysAla: 1.033 ± 0.117
0.49CysCys: 0.49 ± 0.069
0.895CysAsp: 0.895 ± 0.092
0.788CysGlu: 0.788 ± 0.082
0.32CysPhe: 0.32 ± 0.052
1.267CysGly: 1.267 ± 0.122
0.426CysHis: 0.426 ± 0.072
0.415CysIle: 0.415 ± 0.068
0.224CysLys: 0.224 ± 0.049
0.948CysLeu: 0.948 ± 0.1
0.362CysMet: 0.362 ± 0.057
0.266CysAsn: 0.266 ± 0.053
1.044CysPro: 1.044 ± 0.103
0.426CysGln: 0.426 ± 0.057
2.79CysArg: 2.79 ± 0.161
1.992CysSer: 1.992 ± 0.135
1.012CysThr: 1.012 ± 0.113
0.959CysVal: 0.959 ± 0.092
0.426CysTrp: 0.426 ± 0.064
0.224CysTyr: 0.224 ± 0.043
0.0CysXaa: 0.0 ± 0.0
Asp
6.869AspAla: 6.869 ± 0.358
0.99AspCys: 0.99 ± 0.097
5.655AspAsp: 5.655 ± 0.319
5.08AspGlu: 5.08 ± 0.282
1.427AspPhe: 1.427 ± 0.135
6.486AspGly: 6.486 ± 0.256
1.235AspHis: 1.235 ± 0.123
3.174AspIle: 3.174 ± 0.215
0.799AspLys: 0.799 ± 0.094
4.175AspLeu: 4.175 ± 0.213
1.182AspMet: 1.182 ± 0.107
1.161AspAsn: 1.161 ± 0.117
2.854AspPro: 2.854 ± 0.186
1.331AspGln: 1.331 ± 0.122
6.145AspArg: 6.145 ± 0.269
4.793AspSer: 4.793 ± 0.213
3.962AspThr: 3.962 ± 0.185
5.016AspVal: 5.016 ± 0.217
0.82AspTrp: 0.82 ± 0.088
1.246AspTyr: 1.246 ± 0.11
0.0AspXaa: 0.0 ± 0.0
Glu
4.782GluAla: 4.782 ± 0.257
0.543GluCys: 0.543 ± 0.078
3.706GluAsp: 3.706 ± 0.221
3.802GluGlu: 3.802 ± 0.245
1.768GluPhe: 1.768 ± 0.133
3.089GluGly: 3.089 ± 0.205
1.672GluHis: 1.672 ± 0.134
2.077GluIle: 2.077 ± 0.138
1.715GluLys: 1.715 ± 0.143
5.464GluLeu: 5.464 ± 0.29
1.438GluMet: 1.438 ± 0.115
1.843GluAsn: 1.843 ± 0.148
2.961GluPro: 2.961 ± 0.19
2.471GluGln: 2.471 ± 0.196
7.423GluArg: 7.423 ± 0.302
4.292GluSer: 4.292 ± 0.221
4.58GluThr: 4.58 ± 0.187
3.536GluVal: 3.536 ± 0.234
0.799GluTrp: 0.799 ± 0.092
1.534GluTyr: 1.534 ± 0.131
0.0GluXaa: 0.0 ± 0.0
Phe
2.354PheAla: 2.354 ± 0.163
0.351PheCys: 0.351 ± 0.058
1.906PheAsp: 1.906 ± 0.151
2.311PheGlu: 2.311 ± 0.171
0.511PhePhe: 0.511 ± 0.081
2.567PheGly: 2.567 ± 0.154
0.362PheHis: 0.362 ± 0.067
0.937PheIle: 0.937 ± 0.101
0.288PheLys: 0.288 ± 0.056
2.098PheLeu: 2.098 ± 0.138
0.362PheMet: 0.362 ± 0.065
0.575PheAsn: 0.575 ± 0.089
1.278PhePro: 1.278 ± 0.131
0.714PheGln: 0.714 ± 0.092
2.354PheArg: 2.354 ± 0.177
2.269PheSer: 2.269 ± 0.152
1.203PheThr: 1.203 ± 0.11
2.503PheVal: 2.503 ± 0.167
0.309PheTrp: 0.309 ± 0.049
0.501PheTyr: 0.501 ± 0.074
0.0PheXaa: 0.0 ± 0.0
Gly
5.698GlyAla: 5.698 ± 0.253
1.15GlyCys: 1.15 ± 0.118
5.006GlyAsp: 5.006 ± 0.239
4.164GlyGlu: 4.164 ± 0.241
2.364GlyPhe: 2.364 ± 0.165
5.219GlyGly: 5.219 ± 0.262
1.704GlyHis: 1.704 ± 0.155
3.216GlyIle: 3.216 ± 0.174
1.385GlyLys: 1.385 ± 0.106
5.101GlyLeu: 5.101 ± 0.239
1.608GlyMet: 1.608 ± 0.142
2.056GlyAsn: 2.056 ± 0.142
3.887GlyPro: 3.887 ± 0.219
2.183GlyGln: 2.183 ± 0.173
8.712GlyArg: 8.712 ± 0.264
5.623GlySer: 5.623 ± 0.257
5.868GlyThr: 5.868 ± 0.252
6.198GlyVal: 6.198 ± 0.267
0.809GlyTrp: 0.809 ± 0.105
1.832GlyTyr: 1.832 ± 0.121
0.0GlyXaa: 0.0 ± 0.0
His
1.981HisAla: 1.981 ± 0.145
0.554HisCys: 0.554 ± 0.089
1.672HisAsp: 1.672 ± 0.137
1.108HisGlu: 1.108 ± 0.102
0.564HisPhe: 0.564 ± 0.077
1.672HisGly: 1.672 ± 0.124
0.884HisHis: 0.884 ± 0.104
0.607HisIle: 0.607 ± 0.081
0.362HisLys: 0.362 ± 0.07
1.704HisLeu: 1.704 ± 0.106
0.351HisMet: 0.351 ± 0.061
0.277HisAsn: 0.277 ± 0.053
1.502HisPro: 1.502 ± 0.123
0.639HisGln: 0.639 ± 0.082
3.685HisArg: 3.685 ± 0.216
1.843HisSer: 1.843 ± 0.127
1.257HisThr: 1.257 ± 0.118
1.757HisVal: 1.757 ± 0.126
0.245HisTrp: 0.245 ± 0.047
0.426HisTyr: 0.426 ± 0.07
0.0HisXaa: 0.0 ± 0.0
Ile
3.93IleAla: 3.93 ± 0.196
0.426IleCys: 0.426 ± 0.059
2.737IleAsp: 2.737 ± 0.163
2.993IleGlu: 2.993 ± 0.201
0.735IlePhe: 0.735 ± 0.098
3.089IleGly: 3.089 ± 0.181
1.022IleHis: 1.022 ± 0.1
1.374IleIle: 1.374 ± 0.146
0.522IleLys: 0.522 ± 0.087
2.513IleLeu: 2.513 ± 0.165
0.533IleMet: 0.533 ± 0.08
0.714IleAsn: 0.714 ± 0.087
1.885IlePro: 1.885 ± 0.141
1.022IleGln: 1.022 ± 0.092
3.451IleArg: 3.451 ± 0.201
3.259IleSer: 3.259 ± 0.212
1.938IleThr: 1.938 ± 0.148
3.014IleVal: 3.014 ± 0.169
0.288IleTrp: 0.288 ± 0.054
0.522IleTyr: 0.522 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
1.512LysAla: 1.512 ± 0.131
0.277LysCys: 0.277 ± 0.049
0.746LysAsp: 0.746 ± 0.098
0.841LysGlu: 0.841 ± 0.103
0.458LysPhe: 0.458 ± 0.069
1.129LysGly: 1.129 ± 0.112
0.586LysHis: 0.586 ± 0.085
0.777LysIle: 0.777 ± 0.108
0.767LysLys: 0.767 ± 0.099
1.395LysLeu: 1.395 ± 0.104
0.383LysMet: 0.383 ± 0.067
0.543LysAsn: 0.543 ± 0.087
1.438LysPro: 1.438 ± 0.123
0.703LysGln: 0.703 ± 0.088
2.396LysArg: 2.396 ± 0.178
1.874LysSer: 1.874 ± 0.144
1.651LysThr: 1.651 ± 0.138
0.948LysVal: 0.948 ± 0.108
0.256LysTrp: 0.256 ± 0.062
0.447LysTyr: 0.447 ± 0.067
0.0LysXaa: 0.0 ± 0.0
Leu
7.86LeuAla: 7.86 ± 0.274
1.086LeuCys: 1.086 ± 0.114
5.229LeuAsp: 5.229 ± 0.247
4.612LeuGlu: 4.612 ± 0.221
2.109LeuPhe: 2.109 ± 0.153
5.538LeuGly: 5.538 ± 0.262
1.629LeuHis: 1.629 ± 0.134
2.737LeuIle: 2.737 ± 0.192
1.566LeuLys: 1.566 ± 0.133
6.358LeuLeu: 6.358 ± 0.308
1.235LeuMet: 1.235 ± 0.115
1.96LeuAsn: 1.96 ± 0.148
4.079LeuPro: 4.079 ± 0.197
1.906LeuGln: 1.906 ± 0.155
7.061LeuArg: 7.061 ± 0.308
6.262LeuSer: 6.262 ± 0.244
4.761LeuThr: 4.761 ± 0.237
6.358LeuVal: 6.358 ± 0.275
0.735LeuTrp: 0.735 ± 0.096
1.406LeuTyr: 1.406 ± 0.121
0.0LeuXaa: 0.0 ± 0.0
Met
1.811MetAla: 1.811 ± 0.154
0.234MetCys: 0.234 ± 0.049
1.001MetAsp: 1.001 ± 0.113
0.841MetGlu: 0.841 ± 0.097
0.458MetPhe: 0.458 ± 0.072
1.076MetGly: 1.076 ± 0.101
0.415MetHis: 0.415 ± 0.062
0.724MetIle: 0.724 ± 0.079
0.66MetLys: 0.66 ± 0.076
1.576MetLeu: 1.576 ± 0.126
0.469MetMet: 0.469 ± 0.078
0.65MetAsn: 0.65 ± 0.093
1.203MetPro: 1.203 ± 0.114
0.394MetGln: 0.394 ± 0.075
2.343MetArg: 2.343 ± 0.178
2.151MetSer: 2.151 ± 0.146
2.087MetThr: 2.087 ± 0.143
1.353MetVal: 1.353 ± 0.126
0.16MetTrp: 0.16 ± 0.045
0.437MetTyr: 0.437 ± 0.073
0.0MetXaa: 0.0 ± 0.0
Asn
2.428AsnAla: 2.428 ± 0.166
0.596AsnCys: 0.596 ± 0.08
1.555AsnAsp: 1.555 ± 0.136
1.534AsnGlu: 1.534 ± 0.124
0.426AsnPhe: 0.426 ± 0.073
2.002AsnGly: 2.002 ± 0.153
0.692AsnHis: 0.692 ± 0.086
0.82AsnIle: 0.82 ± 0.098
0.362AsnLys: 0.362 ± 0.065
1.512AsnLeu: 1.512 ± 0.123
0.341AsnMet: 0.341 ± 0.063
0.522AsnAsn: 0.522 ± 0.081
1.587AsnPro: 1.587 ± 0.128
0.682AsnGln: 0.682 ± 0.086
2.939AsnArg: 2.939 ± 0.158
2.013AsnSer: 2.013 ± 0.139
1.459AsnThr: 1.459 ± 0.119
1.566AsnVal: 1.566 ± 0.152
0.32AsnTrp: 0.32 ± 0.061
0.522AsnTyr: 0.522 ± 0.076
0.0AsnXaa: 0.0 ± 0.0
Pro
5.325ProAla: 5.325 ± 0.285
0.777ProCys: 0.777 ± 0.096
3.866ProAsp: 3.866 ± 0.198
3.227ProGlu: 3.227 ± 0.185
1.342ProPhe: 1.342 ± 0.109
3.493ProGly: 3.493 ± 0.205
1.108ProHis: 1.108 ± 0.105
1.512ProIle: 1.512 ± 0.148
1.129ProLys: 1.129 ± 0.116
3.909ProLeu: 3.909 ± 0.189
1.214ProMet: 1.214 ± 0.106
1.086ProAsn: 1.086 ± 0.11
3.312ProPro: 3.312 ± 0.303
1.502ProGln: 1.502 ± 0.118
5.932ProArg: 5.932 ± 0.251
5.922ProSer: 5.922 ± 0.282
5.07ProThr: 5.07 ± 0.262
4.42ProVal: 4.42 ± 0.206
0.458ProTrp: 0.458 ± 0.065
0.799ProTyr: 0.799 ± 0.094
0.0ProXaa: 0.0 ± 0.0
Gln
2.013GlnAla: 2.013 ± 0.134
0.362GlnCys: 0.362 ± 0.077
1.172GlnAsp: 1.172 ± 0.095
1.076GlnGlu: 1.076 ± 0.099
1.012GlnPhe: 1.012 ± 0.092
1.267GlnGly: 1.267 ± 0.118
0.746GlnHis: 0.746 ± 0.076
0.948GlnIle: 0.948 ± 0.12
0.682GlnLys: 0.682 ± 0.091
2.162GlnLeu: 2.162 ± 0.165
0.586GlnMet: 0.586 ± 0.076
0.714GlnAsn: 0.714 ± 0.074
1.48GlnPro: 1.48 ± 0.118
1.044GlnGln: 1.044 ± 0.111
3.994GlnArg: 3.994 ± 0.198
2.897GlnSer: 2.897 ± 0.177
2.332GlnThr: 2.332 ± 0.16
1.406GlnVal: 1.406 ± 0.123
0.32GlnTrp: 0.32 ± 0.064
0.724GlnTyr: 0.724 ± 0.093
0.0GlnXaa: 0.0 ± 0.0
Arg
7.711ArgAla: 7.711 ± 0.267
2.982ArgCys: 2.982 ± 0.195
5.719ArgAsp: 5.719 ± 0.259
5.559ArgGlu: 5.559 ± 0.266
3.003ArgPhe: 3.003 ± 0.2
6.635ArgGly: 6.635 ± 0.255
3.206ArgHis: 3.206 ± 0.163
3.642ArgIle: 3.642 ± 0.236
2.066ArgLys: 2.066 ± 0.172
8.137ArgLeu: 8.137 ± 0.279
2.854ArgMet: 2.854 ± 0.163
2.652ArgAsn: 2.652 ± 0.155
7.381ArgPro: 7.381 ± 0.284
3.525ArgGln: 3.525 ± 0.225
19.149ArgArg: 19.149 ± 0.613
11.183ArgSer: 11.183 ± 0.355
8.797ArgThr: 8.797 ± 0.326
6.656ArgVal: 6.656 ± 0.265
1.8ArgTrp: 1.8 ± 0.157
2.748ArgTyr: 2.748 ± 0.153
0.0ArgXaa: 0.0 ± 0.0
Ser
8.105SerAla: 8.105 ± 0.302
1.416SerCys: 1.416 ± 0.111
4.91SerAsp: 4.91 ± 0.251
4.707SerGlu: 4.707 ± 0.241
1.874SerPhe: 1.874 ± 0.137
7.551SerGly: 7.551 ± 0.3
1.97SerHis: 1.97 ± 0.154
3.099SerIle: 3.099 ± 0.175
1.906SerLys: 1.906 ± 0.164
6.358SerLeu: 6.358 ± 0.251
2.066SerMet: 2.066 ± 0.143
2.396SerAsn: 2.396 ± 0.148
5.389SerPro: 5.389 ± 0.236
1.992SerGln: 1.992 ± 0.136
10.022SerArg: 10.022 ± 0.349
9.724SerSer: 9.724 ± 0.455
7.189SerThr: 7.189 ± 0.303
6.614SerVal: 6.614 ± 0.308
0.884SerTrp: 0.884 ± 0.09
1.491SerTyr: 1.491 ± 0.119
0.0SerXaa: 0.0 ± 0.0
Thr
7.998ThrAla: 7.998 ± 0.306
0.959ThrCys: 0.959 ± 0.098
4.771ThrAsp: 4.771 ± 0.244
4.335ThrGlu: 4.335 ± 0.214
1.661ThrPhe: 1.661 ± 0.127
5.762ThrGly: 5.762 ± 0.257
1.278ThrHis: 1.278 ± 0.12
2.684ThrIle: 2.684 ± 0.168
1.353ThrLys: 1.353 ± 0.112
4.963ThrLeu: 4.963 ± 0.245
1.331ThrMet: 1.331 ± 0.132
1.736ThrAsn: 1.736 ± 0.14
4.697ThrPro: 4.697 ± 0.223
1.715ThrGln: 1.715 ± 0.144
7.477ThrArg: 7.477 ± 0.28
6.795ThrSer: 6.795 ± 0.287
6.561ThrThr: 6.561 ± 0.278
6.944ThrVal: 6.944 ± 0.274
0.809ThrTrp: 0.809 ± 0.085
1.299ThrTyr: 1.299 ± 0.093
0.0ThrXaa: 0.0 ± 0.0
Val
8.222ValAla: 8.222 ± 0.312
1.14ValCys: 1.14 ± 0.11
5.006ValAsp: 5.006 ± 0.232
4.707ValGlu: 4.707 ± 0.25
2.396ValPhe: 2.396 ± 0.17
5.9ValGly: 5.9 ± 0.24
1.672ValHis: 1.672 ± 0.139
2.663ValIle: 2.663 ± 0.165
1.15ValLys: 1.15 ± 0.12
6.017ValLeu: 6.017 ± 0.238
1.161ValMet: 1.161 ± 0.112
1.779ValAsn: 1.779 ± 0.147
3.898ValPro: 3.898 ± 0.205
1.96ValGln: 1.96 ± 0.139
8.041ValArg: 8.041 ± 0.317
6.422ValSer: 6.422 ± 0.254
6.092ValThr: 6.092 ± 0.228
6.869ValVal: 6.869 ± 0.313
0.82ValTrp: 0.82 ± 0.094
1.512ValTyr: 1.512 ± 0.116
0.0ValXaa: 0.0 ± 0.0
Trp
0.788TrpAla: 0.788 ± 0.1
0.362TrpCys: 0.362 ± 0.061
0.415TrpAsp: 0.415 ± 0.067
0.522TrpGlu: 0.522 ± 0.075
0.298TrpPhe: 0.298 ± 0.052
0.564TrpGly: 0.564 ± 0.07
0.309TrpHis: 0.309 ± 0.057
0.458TrpIle: 0.458 ± 0.073
0.383TrpLys: 0.383 ± 0.068
1.065TrpLeu: 1.065 ± 0.103
0.405TrpMet: 0.405 ± 0.06
0.383TrpAsn: 0.383 ± 0.063
0.788TrpPro: 0.788 ± 0.104
0.298TrpGln: 0.298 ± 0.057
1.129TrpArg: 1.129 ± 0.106
1.108TrpSer: 1.108 ± 0.107
1.161TrpThr: 1.161 ± 0.118
0.671TrpVal: 0.671 ± 0.078
0.192TrpTrp: 0.192 ± 0.042
0.288TrpTyr: 0.288 ± 0.046
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.725TyrAla: 1.725 ± 0.158
0.277TyrCys: 0.277 ± 0.058
1.47TyrAsp: 1.47 ± 0.103
1.47TyrGlu: 1.47 ± 0.12
0.596TyrPhe: 0.596 ± 0.088
1.544TyrGly: 1.544 ± 0.114
0.533TyrHis: 0.533 ± 0.07
0.756TyrIle: 0.756 ± 0.097
0.479TyrLys: 0.479 ± 0.08
1.566TyrLeu: 1.566 ± 0.121
0.266TyrMet: 0.266 ± 0.049
0.49TyrAsn: 0.49 ± 0.071
0.98TyrPro: 0.98 ± 0.114
0.586TyrGln: 0.586 ± 0.078
2.364TyrArg: 2.364 ± 0.156
1.385TyrSer: 1.385 ± 0.114
0.895TyrThr: 0.895 ± 0.101
1.629TyrVal: 1.629 ± 0.129
0.245TyrTrp: 0.245 ± 0.054
0.511TyrTyr: 0.511 ± 0.076
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 845 proteins (93895 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski