Amino acid dipepetide frequency for Desulfobacteraceae bacterium SEEP-SAG10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.039AlaAla: 5.039 ± 0.253
0.973AlaCys: 0.973 ± 0.115
3.971AlaAsp: 3.971 ± 0.208
4.332AlaGlu: 4.332 ± 0.211
2.808AlaPhe: 2.808 ± 0.157
5.306AlaGly: 5.306 ± 0.252
1.318AlaHis: 1.318 ± 0.094
5.004AlaIle: 5.004 ± 0.19
4.505AlaLys: 4.505 ± 0.223
6.735AlaLeu: 6.735 ± 0.235
2.024AlaMet: 2.024 ± 0.156
2.437AlaAsn: 2.437 ± 0.152
1.585AlaPro: 1.585 ± 0.133
2.127AlaGln: 2.127 ± 0.137
3.471AlaArg: 3.471 ± 0.188
3.824AlaSer: 3.824 ± 0.196
3.282AlaThr: 3.282 ± 0.166
4.832AlaVal: 4.832 ± 0.254
0.99AlaTrp: 0.99 ± 0.1
2.386AlaTyr: 2.386 ± 0.149
0.0AlaXaa: 0.0 ± 0.0
Cys
0.698CysAla: 0.698 ± 0.064
0.224CysCys: 0.224 ± 0.041
0.698CysAsp: 0.698 ± 0.082
0.706CysGlu: 0.706 ± 0.072
0.551CysPhe: 0.551 ± 0.067
1.042CysGly: 1.042 ± 0.091
0.258CysHis: 0.258 ± 0.043
0.904CysIle: 0.904 ± 0.089
0.801CysLys: 0.801 ± 0.088
1.111CysLeu: 1.111 ± 0.113
0.319CysMet: 0.319 ± 0.056
0.534CysAsn: 0.534 ± 0.066
0.818CysPro: 0.818 ± 0.088
0.37CysGln: 0.37 ± 0.062
0.775CysArg: 0.775 ± 0.091
0.922CysSer: 0.922 ± 0.082
0.577CysThr: 0.577 ± 0.063
0.775CysVal: 0.775 ± 0.085
0.181CysTrp: 0.181 ± 0.036
0.465CysTyr: 0.465 ± 0.061
0.0CysXaa: 0.0 ± 0.0
Asp
3.626AspAla: 3.626 ± 0.211
0.646AspCys: 0.646 ± 0.072
2.971AspAsp: 2.971 ± 0.196
3.376AspGlu: 3.376 ± 0.191
2.928AspPhe: 2.928 ± 0.136
3.798AspGly: 3.798 ± 0.314
0.982AspHis: 0.982 ± 0.094
5.073AspIle: 5.073 ± 0.213
3.686AspLys: 3.686 ± 0.172
6.107AspLeu: 6.107 ± 0.217
1.326AspMet: 1.326 ± 0.102
2.153AspAsn: 2.153 ± 0.155
2.661AspPro: 2.661 ± 0.158
1.947AspGln: 1.947 ± 0.118
3.058AspArg: 3.058 ± 0.176
3.316AspSer: 3.316 ± 0.181
2.911AspThr: 2.911 ± 0.173
3.333AspVal: 3.333 ± 0.185
0.68AspTrp: 0.68 ± 0.09
2.463AspTyr: 2.463 ± 0.153
0.0AspXaa: 0.0 ± 0.0
Glu
4.195GluAla: 4.195 ± 0.216
0.568GluCys: 0.568 ± 0.07
3.437GluAsp: 3.437 ± 0.178
4.315GluGlu: 4.315 ± 0.193
2.704GluPhe: 2.704 ± 0.154
3.419GluGly: 3.419 ± 0.185
1.318GluHis: 1.318 ± 0.095
5.848GluIle: 5.848 ± 0.271
6.158GluLys: 6.158 ± 0.222
5.986GluLeu: 5.986 ± 0.225
1.878GluMet: 1.878 ± 0.136
3.505GluAsn: 3.505 ± 0.186
2.058GluPro: 2.058 ± 0.105
2.394GluGln: 2.394 ± 0.166
3.23GluArg: 3.23 ± 0.156
3.531GluSer: 3.531 ± 0.198
3.505GluThr: 3.505 ± 0.186
3.816GluVal: 3.816 ± 0.183
0.689GluTrp: 0.689 ± 0.088
2.369GluTyr: 2.369 ± 0.149
0.0GluXaa: 0.0 ± 0.0
Phe
2.679PheAla: 2.679 ± 0.148
0.672PheCys: 0.672 ± 0.084
2.86PheAsp: 2.86 ± 0.15
2.799PheGlu: 2.799 ± 0.154
2.636PhePhe: 2.636 ± 0.177
2.98PheGly: 2.98 ± 0.165
0.973PheHis: 0.973 ± 0.077
3.609PheIle: 3.609 ± 0.199
3.195PheLys: 3.195 ± 0.148
4.367PheLeu: 4.367 ± 0.271
1.163PheMet: 1.163 ± 0.1
2.17PheAsn: 2.17 ± 0.139
1.912PhePro: 1.912 ± 0.12
1.301PheGln: 1.301 ± 0.115
2.214PheArg: 2.214 ± 0.14
3.523PheSer: 3.523 ± 0.224
1.895PheThr: 1.895 ± 0.127
3.066PheVal: 3.066 ± 0.168
0.534PheTrp: 0.534 ± 0.08
1.654PheTyr: 1.654 ± 0.112
0.0PheXaa: 0.0 ± 0.0
Gly
4.117GlyAla: 4.117 ± 0.222
0.956GlyCys: 0.956 ± 0.083
3.29GlyAsp: 3.29 ± 0.165
3.859GlyGlu: 3.859 ± 0.199
3.6GlyPhe: 3.6 ± 0.186
4.556GlyGly: 4.556 ± 0.3
1.697GlyHis: 1.697 ± 0.126
5.797GlyIle: 5.797 ± 0.229
5.107GlyLys: 5.107 ± 0.212
6.253GlyLeu: 6.253 ± 0.236
1.886GlyMet: 1.886 ± 0.14
2.73GlyAsn: 2.73 ± 0.154
2.119GlyPro: 2.119 ± 0.156
2.308GlyGln: 2.308 ± 0.153
3.859GlyArg: 3.859 ± 0.19
4.108GlySer: 4.108 ± 0.215
3.394GlyThr: 3.394 ± 0.197
4.332GlyVal: 4.332 ± 0.211
0.922GlyTrp: 0.922 ± 0.106
2.498GlyTyr: 2.498 ± 0.17
0.0GlyXaa: 0.0 ± 0.0
His
1.197HisAla: 1.197 ± 0.095
0.319HisCys: 0.319 ± 0.057
1.154HisAsp: 1.154 ± 0.089
1.059HisGlu: 1.059 ± 0.084
0.999HisPhe: 0.999 ± 0.085
1.533HisGly: 1.533 ± 0.12
0.543HisHis: 0.543 ± 0.07
1.516HisIle: 1.516 ± 0.107
1.318HisLys: 1.318 ± 0.099
2.351HisLeu: 2.351 ± 0.136
0.56HisMet: 0.56 ± 0.072
0.947HisAsn: 0.947 ± 0.096
1.085HisPro: 1.085 ± 0.09
0.723HisGln: 0.723 ± 0.075
1.128HisArg: 1.128 ± 0.094
1.369HisSer: 1.369 ± 0.097
1.016HisThr: 1.016 ± 0.092
1.146HisVal: 1.146 ± 0.105
0.241HisTrp: 0.241 ± 0.045
0.965HisTyr: 0.965 ± 0.096
0.0HisXaa: 0.0 ± 0.0
Ile
5.314IleAla: 5.314 ± 0.211
1.154IleCys: 1.154 ± 0.11
4.763IleAsp: 4.763 ± 0.201
5.702IleGlu: 5.702 ± 0.21
3.195IlePhe: 3.195 ± 0.187
4.556IleGly: 4.556 ± 0.208
1.809IleHis: 1.809 ± 0.14
6.348IleIle: 6.348 ± 0.32
5.857IleLys: 5.857 ± 0.269
7.355IleLeu: 7.355 ± 0.299
1.826IleMet: 1.826 ± 0.137
3.505IleAsn: 3.505 ± 0.163
3.884IlePro: 3.884 ± 0.154
2.369IleGln: 2.369 ± 0.154
4.315IleArg: 4.315 ± 0.169
5.263IleSer: 5.263 ± 0.222
4.384IleThr: 4.384 ± 0.214
4.668IleVal: 4.668 ± 0.215
0.612IleTrp: 0.612 ± 0.067
2.472IleTyr: 2.472 ± 0.158
0.0IleXaa: 0.0 ± 0.0
Lys
5.047LysAla: 5.047 ± 0.211
0.81LysCys: 0.81 ± 0.081
4.453LysAsp: 4.453 ± 0.192
5.641LysGlu: 5.641 ± 0.252
2.549LysPhe: 2.549 ± 0.159
4.806LysGly: 4.806 ± 0.222
1.464LysHis: 1.464 ± 0.121
6.244LysIle: 6.244 ± 0.245
7.554LysLys: 7.554 ± 0.281
6.081LysLeu: 6.081 ± 0.233
1.895LysMet: 1.895 ± 0.137
3.686LysAsn: 3.686 ± 0.173
2.842LysPro: 2.842 ± 0.172
2.343LysGln: 2.343 ± 0.169
4.332LysArg: 4.332 ± 0.21
3.798LysSer: 3.798 ± 0.19
3.979LysThr: 3.979 ± 0.2
4.151LysVal: 4.151 ± 0.207
0.784LysTrp: 0.784 ± 0.084
2.558LysTyr: 2.558 ± 0.151
0.0LysXaa: 0.0 ± 0.0
Leu
6.813LeuAla: 6.813 ± 0.277
1.171LeuCys: 1.171 ± 0.102
5.288LeuAsp: 5.288 ± 0.206
6.494LeuGlu: 6.494 ± 0.221
4.453LeuPhe: 4.453 ± 0.257
6.072LeuGly: 6.072 ± 0.258
1.611LeuHis: 1.611 ± 0.106
7.045LeuIle: 7.045 ± 0.294
7.666LeuLys: 7.666 ± 0.282
9.035LeuLeu: 9.035 ± 0.358
2.386LeuMet: 2.386 ± 0.151
4.487LeuAsn: 4.487 ± 0.185
3.695LeuPro: 3.695 ± 0.172
2.722LeuGln: 2.722 ± 0.168
4.634LeuArg: 4.634 ± 0.221
6.692LeuSer: 6.692 ± 0.253
4.815LeuThr: 4.815 ± 0.231
5.917LeuVal: 5.917 ± 0.22
1.059LeuTrp: 1.059 ± 0.112
2.593LeuTyr: 2.593 ± 0.152
0.0LeuXaa: 0.0 ± 0.0
Met
2.265MetAla: 2.265 ± 0.128
0.189MetCys: 0.189 ± 0.044
1.568MetAsp: 1.568 ± 0.103
1.955MetGlu: 1.955 ± 0.137
0.741MetPhe: 0.741 ± 0.071
1.852MetGly: 1.852 ± 0.15
0.345MetHis: 0.345 ± 0.06
1.602MetIle: 1.602 ± 0.106
2.015MetLys: 2.015 ± 0.13
2.127MetLeu: 2.127 ± 0.151
0.646MetMet: 0.646 ± 0.082
1.309MetAsn: 1.309 ± 0.104
1.197MetPro: 1.197 ± 0.091
0.698MetGln: 0.698 ± 0.084
1.051MetArg: 1.051 ± 0.098
1.404MetSer: 1.404 ± 0.114
1.335MetThr: 1.335 ± 0.107
1.774MetVal: 1.774 ± 0.118
0.146MetTrp: 0.146 ± 0.037
0.543MetTyr: 0.543 ± 0.065
0.0MetXaa: 0.0 ± 0.0
Asn
2.437AsnAla: 2.437 ± 0.145
0.655AsnCys: 0.655 ± 0.083
2.282AsnAsp: 2.282 ± 0.142
2.575AsnGlu: 2.575 ± 0.157
1.99AsnPhe: 1.99 ± 0.135
3.049AsnGly: 3.049 ± 0.218
0.93AsnHis: 0.93 ± 0.08
3.979AsnIle: 3.979 ± 0.213
2.877AsnLys: 2.877 ± 0.154
4.289AsnLeu: 4.289 ± 0.213
0.81AsnMet: 0.81 ± 0.07
2.05AsnAsn: 2.05 ± 0.182
2.153AsnPro: 2.153 ± 0.129
1.8AsnGln: 1.8 ± 0.135
2.618AsnArg: 2.618 ± 0.149
2.369AsnSer: 2.369 ± 0.157
2.265AsnThr: 2.265 ± 0.131
2.558AsnVal: 2.558 ± 0.149
0.56AsnTrp: 0.56 ± 0.067
1.654AsnTyr: 1.654 ± 0.129
0.0AsnXaa: 0.0 ± 0.0
Pro
2.644ProAla: 2.644 ± 0.138
0.586ProCys: 0.586 ± 0.077
3.152ProAsp: 3.152 ± 0.151
3.368ProGlu: 3.368 ± 0.166
2.093ProPhe: 2.093 ± 0.139
2.937ProGly: 2.937 ± 0.157
0.861ProHis: 0.861 ± 0.086
2.593ProIle: 2.593 ± 0.151
2.679ProLys: 2.679 ± 0.155
3.592ProLeu: 3.592 ± 0.178
0.913ProMet: 0.913 ± 0.098
1.714ProAsn: 1.714 ± 0.107
1.464ProPro: 1.464 ± 0.113
1.008ProGln: 1.008 ± 0.098
1.611ProArg: 1.611 ± 0.124
2.394ProSer: 2.394 ± 0.142
1.705ProThr: 1.705 ± 0.139
3.385ProVal: 3.385 ± 0.174
0.379ProTrp: 0.379 ± 0.056
1.352ProTyr: 1.352 ± 0.099
0.0ProXaa: 0.0 ± 0.0
Gln
2.386GlnAla: 2.386 ± 0.144
0.267GlnCys: 0.267 ± 0.051
1.568GlnAsp: 1.568 ± 0.123
2.007GlnGlu: 2.007 ± 0.127
1.249GlnPhe: 1.249 ± 0.099
2.093GlnGly: 2.093 ± 0.157
0.689GlnHis: 0.689 ± 0.075
2.748GlnIle: 2.748 ± 0.159
2.851GlnLys: 2.851 ± 0.172
2.696GlnLeu: 2.696 ± 0.134
0.835GlnMet: 0.835 ± 0.08
1.507GlnAsn: 1.507 ± 0.115
1.085GlnPro: 1.085 ± 0.103
1.111GlnGln: 1.111 ± 0.135
1.585GlnArg: 1.585 ± 0.113
1.723GlnSer: 1.723 ± 0.132
1.74GlnThr: 1.74 ± 0.136
1.998GlnVal: 1.998 ± 0.14
0.439GlnTrp: 0.439 ± 0.076
1.102GlnTyr: 1.102 ± 0.081
0.0GlnXaa: 0.0 ± 0.0
Arg
3.307ArgAla: 3.307 ± 0.174
0.646ArgCys: 0.646 ± 0.078
2.575ArgAsp: 2.575 ± 0.153
3.178ArgGlu: 3.178 ± 0.147
2.765ArgPhe: 2.765 ± 0.157
2.98ArgGly: 2.98 ± 0.163
1.283ArgHis: 1.283 ± 0.101
4.263ArgIle: 4.263 ± 0.162
4.255ArgLys: 4.255 ± 0.258
5.323ArgLeu: 5.323 ± 0.205
1.318ArgMet: 1.318 ± 0.098
2.394ArgAsn: 2.394 ± 0.157
2.127ArgPro: 2.127 ± 0.147
1.697ArgGln: 1.697 ± 0.105
3.058ArgArg: 3.058 ± 0.17
3.144ArgSer: 3.144 ± 0.174
2.463ArgThr: 2.463 ± 0.137
3.299ArgVal: 3.299 ± 0.159
0.646ArgTrp: 0.646 ± 0.068
2.239ArgTyr: 2.239 ± 0.13
0.0ArgXaa: 0.0 ± 0.0
Ser
3.859SerAla: 3.859 ± 0.154
0.844SerCys: 0.844 ± 0.087
3.635SerAsp: 3.635 ± 0.217
3.781SerGlu: 3.781 ± 0.185
3.075SerPhe: 3.075 ± 0.168
5.349SerGly: 5.349 ± 0.219
1.189SerHis: 1.189 ± 0.093
4.875SerIle: 4.875 ± 0.214
4.151SerLys: 4.151 ± 0.214
6.124SerLeu: 6.124 ± 0.239
1.516SerMet: 1.516 ± 0.116
2.3SerAsn: 2.3 ± 0.153
2.489SerPro: 2.489 ± 0.156
2.041SerGln: 2.041 ± 0.139
3.135SerArg: 3.135 ± 0.147
3.971SerSer: 3.971 ± 0.22
2.928SerThr: 2.928 ± 0.166
3.437SerVal: 3.437 ± 0.193
0.577SerTrp: 0.577 ± 0.074
2.231SerTyr: 2.231 ± 0.148
0.0SerXaa: 0.0 ± 0.0
Thr
3.635ThrAla: 3.635 ± 0.201
0.655ThrCys: 0.655 ± 0.082
2.791ThrAsp: 2.791 ± 0.158
2.825ThrGlu: 2.825 ± 0.14
2.394ThrPhe: 2.394 ± 0.156
4.108ThrGly: 4.108 ± 0.204
1.404ThrHis: 1.404 ± 0.112
3.721ThrIle: 3.721 ± 0.185
2.963ThrLys: 2.963 ± 0.149
4.53ThrLeu: 4.53 ± 0.213
1.111ThrMet: 1.111 ± 0.103
1.912ThrAsn: 1.912 ± 0.147
2.446ThrPro: 2.446 ± 0.161
1.395ThrGln: 1.395 ± 0.113
2.946ThrArg: 2.946 ± 0.171
3.049ThrSer: 3.049 ± 0.155
2.437ThrThr: 2.437 ± 0.152
3.884ThrVal: 3.884 ± 0.157
0.5ThrTrp: 0.5 ± 0.071
1.671ThrTyr: 1.671 ± 0.139
0.0ThrXaa: 0.0 ± 0.0
Val
4.608ValAla: 4.608 ± 0.22
0.749ValCys: 0.749 ± 0.075
3.738ValAsp: 3.738 ± 0.179
4.1ValGlu: 4.1 ± 0.192
3.101ValPhe: 3.101 ± 0.177
3.798ValGly: 3.798 ± 0.183
1.318ValHis: 1.318 ± 0.103
4.884ValIle: 4.884 ± 0.228
4.462ValLys: 4.462 ± 0.212
6.029ValLeu: 6.029 ± 0.215
1.438ValMet: 1.438 ± 0.105
2.549ValAsn: 2.549 ± 0.147
2.696ValPro: 2.696 ± 0.149
1.757ValGln: 1.757 ± 0.129
3.127ValArg: 3.127 ± 0.175
4.462ValSer: 4.462 ± 0.183
3.471ValThr: 3.471 ± 0.181
4.677ValVal: 4.677 ± 0.245
0.517ValTrp: 0.517 ± 0.068
1.757ValTyr: 1.757 ± 0.135
0.0ValXaa: 0.0 ± 0.0
Trp
0.775TrpAla: 0.775 ± 0.084
0.112TrpCys: 0.112 ± 0.031
0.715TrpAsp: 0.715 ± 0.113
0.68TrpGlu: 0.68 ± 0.083
0.482TrpPhe: 0.482 ± 0.057
0.698TrpGly: 0.698 ± 0.085
0.276TrpHis: 0.276 ± 0.068
0.818TrpIle: 0.818 ± 0.078
0.801TrpLys: 0.801 ± 0.083
1.128TrpLeu: 1.128 ± 0.112
0.284TrpMet: 0.284 ± 0.054
0.586TrpAsn: 0.586 ± 0.068
0.396TrpPro: 0.396 ± 0.059
0.379TrpGln: 0.379 ± 0.054
0.698TrpArg: 0.698 ± 0.076
0.646TrpSer: 0.646 ± 0.069
0.56TrpThr: 0.56 ± 0.069
0.603TrpVal: 0.603 ± 0.067
0.146TrpTrp: 0.146 ± 0.036
0.327TrpTyr: 0.327 ± 0.048
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.308TyrAla: 2.308 ± 0.151
0.534TyrCys: 0.534 ± 0.064
2.136TyrAsp: 2.136 ± 0.162
2.17TyrGlu: 2.17 ± 0.153
1.895TyrPhe: 1.895 ± 0.128
2.455TyrGly: 2.455 ± 0.163
0.904TyrHis: 0.904 ± 0.089
2.481TyrIle: 2.481 ± 0.157
2.119TyrLys: 2.119 ± 0.17
3.428TyrLeu: 3.428 ± 0.188
0.62TyrMet: 0.62 ± 0.07
1.421TyrAsn: 1.421 ± 0.114
1.714TyrPro: 1.714 ± 0.131
1.189TyrGln: 1.189 ± 0.105
2.119TyrArg: 2.119 ± 0.157
1.998TyrSer: 1.998 ± 0.136
1.714TyrThr: 1.714 ± 0.123
1.585TyrVal: 1.585 ± 0.136
0.508TyrTrp: 0.508 ± 0.074
1.318TyrTyr: 1.318 ± 0.115
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 614 proteins (116105 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski