Amino acid dipepetide frequency for Acidilobus sp. SCGC AC-742_M05

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.711AlaAla: 8.711 ± 0.433
0.506AlaCys: 0.506 ± 0.093
3.511AlaAsp: 3.511 ± 0.236
6.775AlaGlu: 6.775 ± 0.296
2.889AlaPhe: 2.889 ± 0.218
5.764AlaGly: 5.764 ± 0.297
1.055AlaHis: 1.055 ± 0.114
5.475AlaIle: 5.475 ± 0.254
5.114AlaLys: 5.114 ± 0.263
12.323AlaLeu: 12.323 ± 0.493
2.384AlaMet: 2.384 ± 0.189
1.705AlaAsn: 1.705 ± 0.161
3.467AlaPro: 3.467 ± 0.23
2.615AlaGln: 2.615 ± 0.165
7.209AlaArg: 7.209 ± 0.398
7.137AlaSer: 7.137 ± 0.391
3.626AlaThr: 3.626 ± 0.227
7.454AlaVal: 7.454 ± 0.371
1.257AlaTrp: 1.257 ± 0.139
3.092AlaTyr: 3.092 ± 0.196
0.0AlaXaa: 0.0 ± 0.0
Cys
0.26CysAla: 0.26 ± 0.049
0.043CysCys: 0.043 ± 0.023
0.433CysAsp: 0.433 ± 0.079
0.506CysGlu: 0.506 ± 0.093
0.202CysPhe: 0.202 ± 0.062
1.04CysGly: 1.04 ± 0.151
0.101CysHis: 0.101 ± 0.042
0.246CysIle: 0.246 ± 0.057
0.318CysLys: 0.318 ± 0.074
0.347CysLeu: 0.347 ± 0.071
0.116CysMet: 0.116 ± 0.04
0.202CysAsn: 0.202 ± 0.048
0.751CysPro: 0.751 ± 0.12
0.159CysGln: 0.159 ± 0.048
0.477CysArg: 0.477 ± 0.107
0.592CysSer: 0.592 ± 0.113
0.318CysThr: 0.318 ± 0.08
0.737CysVal: 0.737 ± 0.105
0.043CysTrp: 0.043 ± 0.031
0.303CysTyr: 0.303 ± 0.067
0.0CysXaa: 0.0 ± 0.0
Asp
3.843AspAla: 3.843 ± 0.271
0.289AspCys: 0.289 ± 0.064
2.413AspAsp: 2.413 ± 0.209
4.739AspGlu: 4.739 ± 0.275
1.661AspPhe: 1.661 ± 0.137
2.803AspGly: 2.803 ± 0.213
0.766AspHis: 0.766 ± 0.098
2.485AspIle: 2.485 ± 0.216
2.528AspLys: 2.528 ± 0.201
6.125AspLeu: 6.125 ± 0.314
0.968AspMet: 0.968 ± 0.111
1.084AspAsn: 1.084 ± 0.106
3.12AspPro: 3.12 ± 0.226
0.722AspGln: 0.722 ± 0.104
2.788AspArg: 2.788 ± 0.23
2.369AspSer: 2.369 ± 0.189
1.531AspThr: 1.531 ± 0.134
5.678AspVal: 5.678 ± 0.266
0.52AspTrp: 0.52 ± 0.082
2.008AspTyr: 2.008 ± 0.164
0.0AspXaa: 0.0 ± 0.0
Glu
9.506GluAla: 9.506 ± 0.411
0.419GluCys: 0.419 ± 0.086
3.482GluAsp: 3.482 ± 0.247
7.296GluGlu: 7.296 ± 0.417
1.965GluPhe: 1.965 ± 0.181
6.515GluGly: 6.515 ± 0.342
0.939GluHis: 0.939 ± 0.14
3.641GluIle: 3.641 ± 0.284
3.193GluLys: 3.193 ± 0.257
8.841GluLeu: 8.841 ± 0.496
1.214GluMet: 1.214 ± 0.141
1.575GluAsn: 1.575 ± 0.149
3.265GluPro: 3.265 ± 0.203
1.546GluGln: 1.546 ± 0.166
6.01GluArg: 6.01 ± 0.314
3.496GluSer: 3.496 ± 0.253
2.283GluThr: 2.283 ± 0.19
7.758GluVal: 7.758 ± 0.362
1.055GluTrp: 1.055 ± 0.125
1.979GluTyr: 1.979 ± 0.182
0.0GluXaa: 0.0 ± 0.0
Phe
1.893PheAla: 1.893 ± 0.199
0.289PheCys: 0.289 ± 0.07
1.979PheAsp: 1.979 ± 0.183
1.921PheGlu: 1.921 ± 0.178
1.286PhePhe: 1.286 ± 0.121
2.037PheGly: 2.037 ± 0.144
0.549PheHis: 0.549 ± 0.082
1.864PheIle: 1.864 ± 0.163
1.618PheLys: 1.618 ± 0.157
2.774PheLeu: 2.774 ± 0.198
0.78PheMet: 0.78 ± 0.105
1.084PheAsn: 1.084 ± 0.132
1.055PhePro: 1.055 ± 0.131
0.578PheGln: 0.578 ± 0.096
2.037PheArg: 2.037 ± 0.183
2.08PheSer: 2.08 ± 0.19
1.734PheThr: 1.734 ± 0.155
2.441PheVal: 2.441 ± 0.21
0.231PheTrp: 0.231 ± 0.053
1.286PheTyr: 1.286 ± 0.138
0.0PheXaa: 0.0 ± 0.0
Gly
6.53GlyAla: 6.53 ± 0.327
0.737GlyCys: 0.737 ± 0.126
3.944GlyAsp: 3.944 ± 0.255
4.724GlyGlu: 4.724 ± 0.271
2.976GlyPhe: 2.976 ± 0.231
5.952GlyGly: 5.952 ± 0.406
1.344GlyHis: 1.344 ± 0.155
3.698GlyIle: 3.698 ± 0.214
3.698GlyLys: 3.698 ± 0.232
10.17GlyLeu: 10.17 ± 0.415
1.791GlyMet: 1.791 ± 0.144
1.604GlyAsn: 1.604 ± 0.142
3.496GlyPro: 3.496 ± 0.231
1.936GlyGln: 1.936 ± 0.188
5.461GlyArg: 5.461 ± 0.258
4.984GlySer: 4.984 ± 0.261
2.889GlyThr: 2.889 ± 0.222
7.758GlyVal: 7.758 ± 0.398
1.098GlyTrp: 1.098 ± 0.137
2.889GlyTyr: 2.889 ± 0.182
0.0GlyXaa: 0.0 ± 0.0
His
1.04HisAla: 1.04 ± 0.12
0.116HisCys: 0.116 ± 0.039
0.766HisAsp: 0.766 ± 0.105
1.228HisGlu: 1.228 ± 0.148
0.448HisPhe: 0.448 ± 0.074
1.257HisGly: 1.257 ± 0.152
0.405HisHis: 0.405 ± 0.078
0.852HisIle: 0.852 ± 0.122
0.607HisLys: 0.607 ± 0.089
1.271HisLeu: 1.271 ± 0.149
0.274HisMet: 0.274 ± 0.052
0.361HisAsn: 0.361 ± 0.067
0.896HisPro: 0.896 ± 0.122
0.274HisGln: 0.274 ± 0.077
0.925HisArg: 0.925 ± 0.117
0.679HisSer: 0.679 ± 0.11
0.607HisThr: 0.607 ± 0.086
1.344HisVal: 1.344 ± 0.132
0.159HisTrp: 0.159 ± 0.044
0.535HisTyr: 0.535 ± 0.092
0.0HisXaa: 0.0 ± 0.0
Ile
5.504IleAla: 5.504 ± 0.283
0.376IleCys: 0.376 ± 0.078
3.063IleAsp: 3.063 ± 0.23
4.045IleGlu: 4.045 ± 0.263
1.329IlePhe: 1.329 ± 0.144
4.392IleGly: 4.392 ± 0.229
0.592IleHis: 0.592 ± 0.098
3.857IleIle: 3.857 ± 0.237
2.73IleLys: 2.73 ± 0.214
3.539IleLeu: 3.539 ± 0.256
1.156IleMet: 1.156 ± 0.138
1.589IleAsn: 1.589 ± 0.157
2.514IlePro: 2.514 ± 0.201
0.867IleGln: 0.867 ± 0.109
4.074IleArg: 4.074 ± 0.28
3.583IleSer: 3.583 ± 0.257
2.962IleThr: 2.962 ± 0.23
5.808IleVal: 5.808 ± 0.281
0.39IleTrp: 0.39 ± 0.081
2.124IleTyr: 2.124 ± 0.155
0.0IleXaa: 0.0 ± 0.0
Lys
5.634LysAla: 5.634 ± 0.265
0.347LysCys: 0.347 ± 0.075
2.86LysAsp: 2.86 ± 0.222
4.565LysGlu: 4.565 ± 0.324
1.257LysPhe: 1.257 ± 0.126
4.869LysGly: 4.869 ± 0.266
0.535LysHis: 0.535 ± 0.088
2.095LysIle: 2.095 ± 0.215
2.239LysLys: 2.239 ± 0.208
5.071LysLeu: 5.071 ± 0.262
0.881LysMet: 0.881 ± 0.118
0.823LysAsn: 0.823 ± 0.118
2.08LysPro: 2.08 ± 0.167
0.939LysGln: 0.939 ± 0.127
3.496LysArg: 3.496 ± 0.226
2.268LysSer: 2.268 ± 0.166
1.864LysThr: 1.864 ± 0.172
6.4LysVal: 6.4 ± 0.327
0.376LysTrp: 0.376 ± 0.091
1.748LysTyr: 1.748 ± 0.156
0.0LysXaa: 0.0 ± 0.0
Leu
11.196LeuAla: 11.196 ± 0.446
0.881LeuCys: 0.881 ± 0.14
5.244LeuAsp: 5.244 ± 0.347
8.567LeuGlu: 8.567 ± 0.419
2.6LeuPhe: 2.6 ± 0.185
8.711LeuGly: 8.711 ± 0.363
1.416LeuHis: 1.416 ± 0.126
5.764LeuIle: 5.764 ± 0.28
5.894LeuLys: 5.894 ± 0.297
11.196LeuLeu: 11.196 ± 0.472
2.803LeuMet: 2.803 ± 0.201
2.933LeuAsn: 2.933 ± 0.201
4.276LeuPro: 4.276 ± 0.266
2.413LeuGln: 2.413 ± 0.174
9.853LeuArg: 9.853 ± 0.438
8.235LeuSer: 8.235 ± 0.389
5.461LeuThr: 5.461 ± 0.276
9.26LeuVal: 9.26 ± 0.374
1.112LeuTrp: 1.112 ± 0.107
3.092LeuTyr: 3.092 ± 0.24
0.0LeuXaa: 0.0 ± 0.0
Met
2.456MetAla: 2.456 ± 0.173
0.087MetCys: 0.087 ± 0.033
0.968MetAsp: 0.968 ± 0.114
1.329MetGlu: 1.329 ± 0.141
0.592MetPhe: 0.592 ± 0.09
1.893MetGly: 1.893 ± 0.167
0.376MetHis: 0.376 ± 0.067
1.228MetIle: 1.228 ± 0.131
1.141MetLys: 1.141 ± 0.132
1.95MetLeu: 1.95 ± 0.147
0.491MetMet: 0.491 ± 0.079
0.636MetAsn: 0.636 ± 0.099
1.546MetPro: 1.546 ± 0.172
0.491MetGln: 0.491 ± 0.079
1.43MetArg: 1.43 ± 0.127
1.835MetSer: 1.835 ± 0.176
1.271MetThr: 1.271 ± 0.146
1.69MetVal: 1.69 ± 0.157
0.26MetTrp: 0.26 ± 0.063
0.563MetTyr: 0.563 ± 0.091
0.0MetXaa: 0.0 ± 0.0
Asn
2.109AsnAla: 2.109 ± 0.162
0.188AsnCys: 0.188 ± 0.052
1.214AsnAsp: 1.214 ± 0.154
1.474AsnGlu: 1.474 ± 0.153
0.665AsnPhe: 0.665 ± 0.096
1.864AsnGly: 1.864 ± 0.208
0.144AsnHis: 0.144 ± 0.049
1.575AsnIle: 1.575 ± 0.133
1.3AsnLys: 1.3 ± 0.144
2.181AsnLeu: 2.181 ± 0.171
0.766AsnMet: 0.766 ± 0.108
0.708AsnAsn: 0.708 ± 0.115
1.705AsnPro: 1.705 ± 0.151
0.448AsnGln: 0.448 ± 0.085
1.604AsnArg: 1.604 ± 0.165
1.17AsnSer: 1.17 ± 0.134
0.708AsnThr: 0.708 ± 0.124
2.759AsnVal: 2.759 ± 0.217
0.26AsnTrp: 0.26 ± 0.069
1.084AsnTyr: 1.084 ± 0.137
0.0AsnXaa: 0.0 ± 0.0
Pro
3.381ProAla: 3.381 ± 0.214
0.289ProCys: 0.289 ± 0.055
2.499ProAsp: 2.499 ± 0.207
3.987ProGlu: 3.987 ± 0.262
1.69ProPhe: 1.69 ± 0.148
3.583ProGly: 3.583 ± 0.253
0.852ProHis: 0.852 ± 0.104
2.196ProIle: 2.196 ± 0.185
2.514ProLys: 2.514 ± 0.21
5.316ProLeu: 5.316 ± 0.286
0.953ProMet: 0.953 ± 0.104
1.127ProAsn: 1.127 ± 0.158
3.034ProPro: 3.034 ± 0.211
1.589ProGln: 1.589 ± 0.148
3.453ProArg: 3.453 ± 0.229
3.554ProSer: 3.554 ± 0.25
2.514ProThr: 2.514 ± 0.207
4.016ProVal: 4.016 ± 0.222
0.838ProTrp: 0.838 ± 0.113
1.661ProTyr: 1.661 ± 0.16
0.0ProXaa: 0.0 ± 0.0
Gln
2.658GlnAla: 2.658 ± 0.225
0.188GlnCys: 0.188 ± 0.049
0.896GlnAsp: 0.896 ± 0.098
1.618GlnGlu: 1.618 ± 0.188
0.506GlnPhe: 0.506 ± 0.096
1.661GlnGly: 1.661 ± 0.148
0.289GlnHis: 0.289 ± 0.059
1.141GlnIle: 1.141 ± 0.129
0.852GlnLys: 0.852 ± 0.113
2.832GlnLeu: 2.832 ± 0.242
0.419GlnMet: 0.419 ± 0.075
0.318GlnAsn: 0.318 ± 0.064
1.127GlnPro: 1.127 ± 0.146
0.766GlnGln: 0.766 ± 0.112
2.037GlnArg: 2.037 ± 0.179
1.026GlnSer: 1.026 ± 0.134
0.78GlnThr: 0.78 ± 0.11
2.181GlnVal: 2.181 ± 0.157
0.419GlnTrp: 0.419 ± 0.095
0.607GlnTyr: 0.607 ± 0.075
0.0GlnXaa: 0.0 ± 0.0
Arg
7.252ArgAla: 7.252 ± 0.361
0.549ArgCys: 0.549 ± 0.112
3.149ArgAsp: 3.149 ± 0.209
6.097ArgGlu: 6.097 ± 0.351
1.965ArgPhe: 1.965 ± 0.139
6.487ArgGly: 6.487 ± 0.318
1.271ArgHis: 1.271 ± 0.139
3.482ArgIle: 3.482 ± 0.266
3.482ArgLys: 3.482 ± 0.249
9.463ArgLeu: 9.463 ± 0.435
1.069ArgMet: 1.069 ± 0.124
1.676ArgAsn: 1.676 ± 0.15
4.522ArgPro: 4.522 ± 0.269
1.647ArgGln: 1.647 ± 0.148
6.342ArgArg: 6.342 ± 0.397
4.507ArgSer: 4.507 ± 0.257
2.745ArgThr: 2.745 ± 0.205
6.458ArgVal: 6.458 ± 0.369
1.011ArgTrp: 1.011 ± 0.139
2.008ArgTyr: 2.008 ± 0.163
0.0ArgXaa: 0.0 ± 0.0
Ser
4.854SerAla: 4.854 ± 0.27
0.491SerCys: 0.491 ± 0.093
2.73SerAsp: 2.73 ± 0.203
4.854SerGlu: 4.854 ± 0.278
2.08SerPhe: 2.08 ± 0.171
4.999SerGly: 4.999 ± 0.257
0.823SerHis: 0.823 ± 0.1
3.308SerIle: 3.308 ± 0.2
3.308SerLys: 3.308 ± 0.2
8.09SerLeu: 8.09 ± 0.379
1.575SerMet: 1.575 ± 0.132
1.214SerAsn: 1.214 ± 0.159
3.409SerPro: 3.409 ± 0.226
1.878SerGln: 1.878 ± 0.155
5.157SerArg: 5.157 ± 0.276
4.305SerSer: 4.305 ± 0.264
2.687SerThr: 2.687 ± 0.197
5.042SerVal: 5.042 ± 0.29
0.78SerTrp: 0.78 ± 0.115
2.311SerTyr: 2.311 ± 0.161
0.0SerXaa: 0.0 ± 0.0
Thr
3.742ThrAla: 3.742 ± 0.219
0.419ThrCys: 0.419 ± 0.081
1.517ThrAsp: 1.517 ± 0.175
2.528ThrGlu: 2.528 ± 0.178
1.459ThrPhe: 1.459 ± 0.161
4.002ThrGly: 4.002 ± 0.231
0.65ThrHis: 0.65 ± 0.104
2.615ThrIle: 2.615 ± 0.194
1.965ThrLys: 1.965 ± 0.175
4.854ThrLeu: 4.854 ± 0.267
1.228ThrMet: 1.228 ± 0.125
1.185ThrAsn: 1.185 ± 0.134
2.803ThrPro: 2.803 ± 0.191
0.925ThrGln: 0.925 ± 0.128
2.543ThrArg: 2.543 ± 0.175
2.918ThrSer: 2.918 ± 0.219
2.181ThrThr: 2.181 ± 0.177
3.756ThrVal: 3.756 ± 0.194
0.433ThrTrp: 0.433 ± 0.062
1.604ThrTyr: 1.604 ± 0.144
0.0ThrXaa: 0.0 ± 0.0
Val
7.628ValAla: 7.628 ± 0.402
0.766ValCys: 0.766 ± 0.136
5.244ValAsp: 5.244 ± 0.318
6.559ValGlu: 6.559 ± 0.324
2.557ValPhe: 2.557 ± 0.194
6.241ValGly: 6.241 ± 0.333
1.127ValHis: 1.127 ± 0.129
6.53ValIle: 6.53 ± 0.278
5.981ValLys: 5.981 ± 0.293
9.506ValLeu: 9.506 ± 0.462
2.268ValMet: 2.268 ± 0.204
2.99ValAsn: 2.99 ± 0.197
4.06ValPro: 4.06 ± 0.228
1.69ValGln: 1.69 ± 0.169
6.891ValArg: 6.891 ± 0.335
6.718ValSer: 6.718 ± 0.35
5.013ValThr: 5.013 ± 0.272
9.13ValVal: 9.13 ± 0.394
0.693ValTrp: 0.693 ± 0.09
2.817ValTyr: 2.817 ± 0.227
0.0ValXaa: 0.0 ± 0.0
Trp
1.112TrpAla: 1.112 ± 0.14
0.043TrpCys: 0.043 ± 0.026
0.679TrpAsp: 0.679 ± 0.089
0.737TrpGlu: 0.737 ± 0.114
0.289TrpPhe: 0.289 ± 0.062
0.867TrpGly: 0.867 ± 0.118
0.274TrpHis: 0.274 ± 0.072
0.636TrpIle: 0.636 ± 0.09
0.592TrpLys: 0.592 ± 0.091
1.228TrpLeu: 1.228 ± 0.132
0.246TrpMet: 0.246 ± 0.057
0.303TrpAsn: 0.303 ± 0.113
0.535TrpPro: 0.535 ± 0.09
0.303TrpGln: 0.303 ± 0.07
0.925TrpArg: 0.925 ± 0.12
0.679TrpSer: 0.679 ± 0.085
0.52TrpThr: 0.52 ± 0.106
1.069TrpVal: 1.069 ± 0.122
0.202TrpTrp: 0.202 ± 0.05
0.347TrpTyr: 0.347 ± 0.079
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.759TyrAla: 2.759 ± 0.199
0.173TyrCys: 0.173 ± 0.058
1.849TyrAsp: 1.849 ± 0.159
2.427TyrGlu: 2.427 ± 0.199
1.098TyrPhe: 1.098 ± 0.124
2.745TyrGly: 2.745 ± 0.188
0.535TyrHis: 0.535 ± 0.1
1.82TyrIle: 1.82 ± 0.181
1.271TyrLys: 1.271 ± 0.146
3.828TyrLeu: 3.828 ± 0.265
0.838TyrMet: 0.838 ± 0.092
0.809TyrAsn: 0.809 ± 0.111
1.387TyrPro: 1.387 ± 0.145
0.535TyrGln: 0.535 ± 0.11
2.47TyrArg: 2.47 ± 0.219
1.762TyrSer: 1.762 ± 0.156
1.632TyrThr: 1.632 ± 0.163
3.669TyrVal: 3.669 ± 0.214
0.448TyrTrp: 0.448 ± 0.078
1.358TyrTyr: 1.358 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 281 proteins (69221 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski