Amino acid dipepetide frequency for Camelpox virus (strain CMS)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.161AlaAla: 2.161 ± 0.244
1.064AlaCys: 1.064 ± 0.158
1.89AlaAsp: 1.89 ± 0.185
1.827AlaGlu: 1.827 ± 0.179
1.652AlaPhe: 1.652 ± 0.176
1.541AlaGly: 1.541 ± 0.154
0.556AlaHis: 0.556 ± 0.101
3.749AlaIle: 3.749 ± 0.254
2.701AlaLys: 2.701 ± 0.194
3.225AlaLeu: 3.225 ± 0.236
1.096AlaMet: 1.096 ± 0.124
2.192AlaAsn: 2.192 ± 0.166
1.096AlaPro: 1.096 ± 0.129
0.699AlaGln: 0.699 ± 0.104
1.398AlaArg: 1.398 ± 0.154
3.368AlaSer: 3.368 ± 0.225
2.478AlaThr: 2.478 ± 0.215
2.939AlaVal: 2.939 ± 0.193
0.238AlaTrp: 0.238 ± 0.056
1.589AlaTyr: 1.589 ± 0.137
0.0AlaXaa: 0.0 ± 0.0
Cys
0.874CysAla: 0.874 ± 0.122
0.572CysCys: 0.572 ± 0.083
1.366CysAsp: 1.366 ± 0.139
0.937CysGlu: 0.937 ± 0.142
0.778CysPhe: 0.778 ± 0.115
1.128CysGly: 1.128 ± 0.133
0.492CysHis: 0.492 ± 0.094
2.224CysIle: 2.224 ± 0.206
1.255CysLys: 1.255 ± 0.133
1.716CysLeu: 1.716 ± 0.166
0.699CysMet: 0.699 ± 0.108
1.239CysAsn: 1.239 ± 0.147
0.683CysPro: 0.683 ± 0.102
0.429CysGln: 0.429 ± 0.104
0.906CysArg: 0.906 ± 0.114
1.763CysSer: 1.763 ± 0.202
1.334CysThr: 1.334 ± 0.151
1.414CysVal: 1.414 ± 0.146
0.254CysTrp: 0.254 ± 0.063
1.239CysTyr: 1.239 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
2.478AspAla: 2.478 ± 0.203
0.937AspCys: 0.937 ± 0.134
5.195AspAsp: 5.195 ± 0.368
4.115AspGlu: 4.115 ± 0.267
2.86AspPhe: 2.86 ± 0.196
2.78AspGly: 2.78 ± 0.196
1.112AspHis: 1.112 ± 0.121
7.911AspIle: 7.911 ± 0.386
4.734AspLys: 4.734 ± 0.272
4.575AspLeu: 4.575 ± 0.302
1.779AspMet: 1.779 ± 0.184
4.623AspAsn: 4.623 ± 0.326
1.763AspPro: 1.763 ± 0.173
1.096AspGln: 1.096 ± 0.131
2.224AspArg: 2.224 ± 0.19
4.146AspSer: 4.146 ± 0.252
3.701AspThr: 3.701 ± 0.264
4.496AspVal: 4.496 ± 0.263
0.508AspTrp: 0.508 ± 0.106
3.368AspTyr: 3.368 ± 0.252
0.0AspXaa: 0.0 ± 0.0
Glu
1.922GluAla: 1.922 ± 0.165
1.144GluCys: 1.144 ± 0.136
3.431GluAsp: 3.431 ± 0.241
2.955GluGlu: 2.955 ± 0.273
2.399GluPhe: 2.399 ± 0.193
1.668GluGly: 1.668 ± 0.195
1.048GluHis: 1.048 ± 0.135
4.718GluIle: 4.718 ± 0.31
3.4GluLys: 3.4 ± 0.221
4.893GluLeu: 4.893 ± 0.323
1.43GluMet: 1.43 ± 0.175
3.209GluAsn: 3.209 ± 0.269
1.7GluPro: 1.7 ± 0.175
1.366GluGln: 1.366 ± 0.147
2.335GluArg: 2.335 ± 0.225
3.972GluSer: 3.972 ± 0.242
3.177GluThr: 3.177 ± 0.208
2.446GluVal: 2.446 ± 0.202
0.508GluTrp: 0.508 ± 0.083
3.447GluTyr: 3.447 ± 0.252
0.0GluXaa: 0.0 ± 0.0
Phe
1.541PheAla: 1.541 ± 0.17
1.064PheCys: 1.064 ± 0.127
3.161PheAsp: 3.161 ± 0.246
1.97PheGlu: 1.97 ± 0.162
2.399PhePhe: 2.399 ± 0.179
1.875PheGly: 1.875 ± 0.151
0.763PheHis: 0.763 ± 0.108
4.893PheIle: 4.893 ± 0.254
3.209PheLys: 3.209 ± 0.232
4.273PheLeu: 4.273 ± 0.273
1.334PheMet: 1.334 ± 0.16
3.733PheAsn: 3.733 ± 0.215
1.303PhePro: 1.303 ± 0.157
0.826PheGln: 0.826 ± 0.107
1.906PheArg: 1.906 ± 0.177
3.94PheSer: 3.94 ± 0.242
3.034PheThr: 3.034 ± 0.239
3.002PheVal: 3.002 ± 0.224
0.445PheTrp: 0.445 ± 0.082
2.256PheTyr: 2.256 ± 0.179
0.0PheXaa: 0.0 ± 0.0
Gly
1.859GlyAla: 1.859 ± 0.173
0.763GlyCys: 0.763 ± 0.107
2.526GlyAsp: 2.526 ± 0.168
1.906GlyGlu: 1.906 ± 0.18
1.732GlyPhe: 1.732 ± 0.18
2.161GlyGly: 2.161 ± 0.22
0.921GlyHis: 0.921 ± 0.132
3.781GlyIle: 3.781 ± 0.239
2.891GlyLys: 2.891 ± 0.222
3.257GlyLeu: 3.257 ± 0.2
0.89GlyMet: 0.89 ± 0.124
2.78GlyAsn: 2.78 ± 0.2
0.778GlyPro: 0.778 ± 0.111
0.667GlyGln: 0.667 ± 0.104
1.811GlyArg: 1.811 ± 0.148
3.082GlySer: 3.082 ± 0.248
2.319GlyThr: 2.319 ± 0.225
2.78GlyVal: 2.78 ± 0.218
0.207GlyTrp: 0.207 ± 0.054
2.256GlyTyr: 2.256 ± 0.217
0.0GlyXaa: 0.0 ± 0.0
His
0.921HisAla: 0.921 ± 0.106
0.635HisCys: 0.635 ± 0.111
1.176HisAsp: 1.176 ± 0.114
0.874HisGlu: 0.874 ± 0.101
0.874HisPhe: 0.874 ± 0.116
1.017HisGly: 1.017 ± 0.122
0.667HisHis: 0.667 ± 0.114
2.415HisIle: 2.415 ± 0.201
1.303HisLys: 1.303 ± 0.128
2.065HisLeu: 2.065 ± 0.165
0.62HisMet: 0.62 ± 0.094
1.287HisAsn: 1.287 ± 0.145
0.826HisPro: 0.826 ± 0.095
0.492HisGln: 0.492 ± 0.075
1.176HisArg: 1.176 ± 0.135
1.43HisSer: 1.43 ± 0.144
1.255HisThr: 1.255 ± 0.126
1.414HisVal: 1.414 ± 0.152
0.254HisTrp: 0.254 ± 0.067
0.969HisTyr: 0.969 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
3.066IleAla: 3.066 ± 0.21
1.684IleCys: 1.684 ± 0.187
6.99IleAsp: 6.99 ± 0.331
4.798IleGlu: 4.798 ± 0.248
4.289IlePhe: 4.289 ± 0.291
3.94IleGly: 3.94 ± 0.267
2.272IleHis: 2.272 ± 0.186
8.801IleIle: 8.801 ± 0.445
6.958IleLys: 6.958 ± 0.359
8.706IleLeu: 8.706 ± 0.463
2.399IleMet: 2.399 ± 0.214
7.482IleAsn: 7.482 ± 0.349
3.511IlePro: 3.511 ± 0.264
2.272IleGln: 2.272 ± 0.206
3.86IleArg: 3.86 ± 0.218
8.785IleSer: 8.785 ± 0.414
5.449IleThr: 5.449 ± 0.337
5.846IleVal: 5.846 ± 0.298
0.604IleTrp: 0.604 ± 0.101
4.941IleTyr: 4.941 ± 0.342
0.0IleXaa: 0.0 ± 0.0
Lys
1.811LysAla: 1.811 ± 0.185
1.716LysCys: 1.716 ± 0.199
4.496LysAsp: 4.496 ± 0.262
3.701LysGlu: 3.701 ± 0.246
3.018LysPhe: 3.018 ± 0.236
1.875LysGly: 1.875 ± 0.148
1.763LysHis: 1.763 ± 0.185
6.275LysIle: 6.275 ± 0.311
5.37LysLys: 5.37 ± 0.303
6.529LysLeu: 6.529 ± 0.29
2.018LysMet: 2.018 ± 0.184
4.829LysAsn: 4.829 ± 0.312
2.018LysPro: 2.018 ± 0.144
1.97LysGln: 1.97 ± 0.194
3.67LysArg: 3.67 ± 0.271
5.512LysSer: 5.512 ± 0.33
4.019LysThr: 4.019 ± 0.279
3.797LysVal: 3.797 ± 0.31
0.794LysTrp: 0.794 ± 0.121
4.464LysTyr: 4.464 ± 0.249
0.0LysXaa: 0.0 ± 0.0
Leu
3.4LeuAla: 3.4 ± 0.257
1.779LeuCys: 1.779 ± 0.164
5.481LeuAsp: 5.481 ± 0.293
4.813LeuGlu: 4.813 ± 0.309
4.766LeuPhe: 4.766 ± 0.319
3.241LeuGly: 3.241 ± 0.266
1.954LeuHis: 1.954 ± 0.192
7.26LeuIle: 7.26 ± 0.338
5.957LeuLys: 5.957 ± 0.288
9.643LeuLeu: 9.643 ± 0.402
2.51LeuMet: 2.51 ± 0.206
5.354LeuAsn: 5.354 ± 0.296
3.336LeuPro: 3.336 ± 0.238
1.97LeuGln: 1.97 ± 0.171
3.463LeuArg: 3.463 ± 0.248
8.245LeuSer: 8.245 ± 0.304
5.878LeuThr: 5.878 ± 0.348
5.449LeuVal: 5.449 ± 0.292
0.556LeuTrp: 0.556 ± 0.106
4.972LeuTyr: 4.972 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
1.525MetAla: 1.525 ± 0.165
0.62MetCys: 0.62 ± 0.105
2.018MetAsp: 2.018 ± 0.167
1.509MetGlu: 1.509 ± 0.134
1.382MetPhe: 1.382 ± 0.171
0.906MetGly: 0.906 ± 0.125
0.429MetHis: 0.429 ± 0.077
2.637MetIle: 2.637 ± 0.229
1.875MetLys: 1.875 ± 0.186
2.669MetLeu: 2.669 ± 0.201
0.906MetMet: 0.906 ± 0.134
1.97MetAsn: 1.97 ± 0.201
0.953MetPro: 0.953 ± 0.104
0.508MetGln: 0.508 ± 0.079
1.176MetArg: 1.176 ± 0.137
2.383MetSer: 2.383 ± 0.192
1.525MetThr: 1.525 ± 0.164
1.462MetVal: 1.462 ± 0.152
0.143MetTrp: 0.143 ± 0.048
1.541MetTyr: 1.541 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
2.701AsnAla: 2.701 ± 0.167
1.223AsnCys: 1.223 ± 0.142
4.575AsnAsp: 4.575 ± 0.319
3.574AsnGlu: 3.574 ± 0.233
2.701AsnPhe: 2.701 ± 0.208
3.034AsnGly: 3.034 ± 0.236
1.398AsnHis: 1.398 ± 0.15
7.657AsnIle: 7.657 ± 0.417
5.338AsnLys: 5.338 ± 0.272
5.258AsnLeu: 5.258 ± 0.291
1.954AsnMet: 1.954 ± 0.163
5.624AsnAsn: 5.624 ± 0.354
2.256AsnPro: 2.256 ± 0.201
1.382AsnGln: 1.382 ± 0.146
3.002AsnArg: 3.002 ± 0.2
4.734AsnSer: 4.734 ± 0.296
4.639AsnThr: 4.639 ± 0.28
4.257AsnVal: 4.257 ± 0.271
0.397AsnTrp: 0.397 ± 0.072
3.304AsnTyr: 3.304 ± 0.27
0.0AsnXaa: 0.0 ± 0.0
Pro
1.191ProAla: 1.191 ± 0.122
0.556ProCys: 0.556 ± 0.119
1.795ProAsp: 1.795 ± 0.183
2.192ProGlu: 2.192 ± 0.168
1.573ProPhe: 1.573 ± 0.138
1.446ProGly: 1.446 ± 0.172
0.667ProHis: 0.667 ± 0.116
3.114ProIle: 3.114 ± 0.235
1.906ProLys: 1.906 ± 0.174
3.114ProLeu: 3.114 ± 0.217
1.064ProMet: 1.064 ± 0.125
2.081ProAsn: 2.081 ± 0.181
1.541ProPro: 1.541 ± 0.204
0.699ProGln: 0.699 ± 0.104
1.652ProArg: 1.652 ± 0.199
2.748ProSer: 2.748 ± 0.207
2.24ProThr: 2.24 ± 0.185
2.113ProVal: 2.113 ± 0.184
0.27ProTrp: 0.27 ± 0.067
1.62ProTyr: 1.62 ± 0.163
0.0ProXaa: 0.0 ± 0.0
Gln
0.572GlnAla: 0.572 ± 0.1
0.54GlnCys: 0.54 ± 0.084
1.239GlnAsp: 1.239 ± 0.131
1.191GlnGlu: 1.191 ± 0.158
1.08GlnPhe: 1.08 ± 0.117
0.683GlnGly: 0.683 ± 0.143
0.604GlnHis: 0.604 ± 0.104
1.62GlnIle: 1.62 ± 0.153
1.446GlnLys: 1.446 ± 0.17
2.335GlnLeu: 2.335 ± 0.204
0.715GlnMet: 0.715 ± 0.1
1.43GlnAsn: 1.43 ± 0.156
0.747GlnPro: 0.747 ± 0.152
0.953GlnGln: 0.953 ± 0.143
1.176GlnArg: 1.176 ± 0.148
1.557GlnSer: 1.557 ± 0.155
1.43GlnThr: 1.43 ± 0.17
0.89GlnVal: 0.89 ± 0.13
0.254GlnTrp: 0.254 ± 0.053
1.541GlnTyr: 1.541 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
1.144ArgAla: 1.144 ± 0.131
1.001ArgCys: 1.001 ± 0.121
2.574ArgAsp: 2.574 ± 0.212
1.89ArgGlu: 1.89 ± 0.193
2.256ArgPhe: 2.256 ± 0.196
1.827ArgGly: 1.827 ± 0.185
1.334ArgHis: 1.334 ± 0.152
3.447ArgIle: 3.447 ± 0.192
2.351ArgLys: 2.351 ± 0.197
4.385ArgLeu: 4.385 ± 0.187
1.096ArgMet: 1.096 ± 0.124
2.939ArgAsn: 2.939 ± 0.223
1.366ArgPro: 1.366 ± 0.15
1.287ArgGln: 1.287 ± 0.166
2.717ArgArg: 2.717 ± 0.222
3.384ArgSer: 3.384 ± 0.266
2.272ArgThr: 2.272 ± 0.196
2.653ArgVal: 2.653 ± 0.181
0.445ArgTrp: 0.445 ± 0.074
2.796ArgTyr: 2.796 ± 0.192
0.0ArgXaa: 0.0 ± 0.0
Ser
3.13SerAla: 3.13 ± 0.214
1.7SerCys: 1.7 ± 0.176
4.798SerAsp: 4.798 ± 0.352
3.511SerGlu: 3.511 ± 0.283
3.844SerPhe: 3.844 ± 0.242
3.273SerGly: 3.273 ± 0.251
1.477SerHis: 1.477 ± 0.172
8.134SerIle: 8.134 ± 0.358
5.862SerLys: 5.862 ± 0.314
7.212SerLeu: 7.212 ± 0.327
2.478SerMet: 2.478 ± 0.212
5.211SerAsn: 5.211 ± 0.315
3.431SerPro: 3.431 ± 0.263
1.922SerGln: 1.922 ± 0.198
3.431SerArg: 3.431 ± 0.237
7.498SerSer: 7.498 ± 0.416
5.465SerThr: 5.465 ± 0.432
5.497SerVal: 5.497 ± 0.273
0.508SerTrp: 0.508 ± 0.086
3.431SerTyr: 3.431 ± 0.265
0.0SerXaa: 0.0 ± 0.0
Thr
2.367ThrAla: 2.367 ± 0.208
1.382ThrCys: 1.382 ± 0.152
4.003ThrAsp: 4.003 ± 0.29
3.336ThrGlu: 3.336 ± 0.241
2.637ThrPhe: 2.637 ± 0.196
2.399ThrGly: 2.399 ± 0.187
1.43ThrHis: 1.43 ± 0.143
5.894ThrIle: 5.894 ± 0.306
4.226ThrLys: 4.226 ± 0.221
5.306ThrLeu: 5.306 ± 0.321
1.843ThrMet: 1.843 ± 0.162
4.099ThrAsn: 4.099 ± 0.241
2.319ThrPro: 2.319 ± 0.263
1.096ThrGln: 1.096 ± 0.154
2.351ThrArg: 2.351 ± 0.182
5.274ThrSer: 5.274 ± 0.29
3.987ThrThr: 3.987 ± 0.268
4.035ThrVal: 4.035 ± 0.228
0.508ThrTrp: 0.508 ± 0.084
2.812ThrTyr: 2.812 ± 0.233
0.0ThrXaa: 0.0 ± 0.0
Val
2.558ValAla: 2.558 ± 0.184
1.43ValCys: 1.43 ± 0.16
4.067ValAsp: 4.067 ± 0.208
3.447ValGlu: 3.447 ± 0.264
3.558ValPhe: 3.558 ± 0.277
1.811ValGly: 1.811 ± 0.162
1.223ValHis: 1.223 ± 0.132
5.751ValIle: 5.751 ± 0.318
4.464ValLys: 4.464 ± 0.316
5.068ValLeu: 5.068 ± 0.289
1.462ValMet: 1.462 ± 0.151
4.575ValAsn: 4.575 ± 0.256
1.843ValPro: 1.843 ± 0.177
1.287ValGln: 1.287 ± 0.143
2.351ValArg: 2.351 ± 0.194
5.385ValSer: 5.385 ± 0.289
3.749ValThr: 3.749 ± 0.224
3.543ValVal: 3.543 ± 0.255
0.286ValTrp: 0.286 ± 0.076
3.463ValTyr: 3.463 ± 0.252
0.0ValXaa: 0.0 ± 0.0
Trp
0.191TrpAla: 0.191 ± 0.058
0.191TrpCys: 0.191 ± 0.047
0.334TrpAsp: 0.334 ± 0.069
0.381TrpGlu: 0.381 ± 0.081
0.524TrpPhe: 0.524 ± 0.091
0.254TrpGly: 0.254 ± 0.071
0.175TrpHis: 0.175 ± 0.043
0.731TrpIle: 0.731 ± 0.118
0.731TrpLys: 0.731 ± 0.1
0.794TrpLeu: 0.794 ± 0.106
0.365TrpMet: 0.365 ± 0.068
0.477TrpAsn: 0.477 ± 0.111
0.27TrpPro: 0.27 ± 0.069
0.159TrpGln: 0.159 ± 0.055
0.27TrpArg: 0.27 ± 0.069
0.461TrpSer: 0.461 ± 0.078
0.477TrpThr: 0.477 ± 0.07
0.381TrpVal: 0.381 ± 0.075
0.016TrpTrp: 0.016 ± 0.015
0.397TrpTyr: 0.397 ± 0.071
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.049TyrAla: 2.049 ± 0.193
1.303TyrCys: 1.303 ± 0.153
3.288TyrAsp: 3.288 ± 0.244
2.24TyrGlu: 2.24 ± 0.202
2.605TyrPhe: 2.605 ± 0.188
2.51TyrGly: 2.51 ± 0.188
1.382TyrHis: 1.382 ± 0.171
5.64TyrIle: 5.64 ± 0.292
3.686TyrLys: 3.686 ± 0.268
5.004TyrLeu: 5.004 ± 0.29
1.462TyrMet: 1.462 ± 0.137
3.876TyrAsn: 3.876 ± 0.353
1.843TyrPro: 1.843 ± 0.17
0.921TyrGln: 0.921 ± 0.122
2.288TyrArg: 2.288 ± 0.193
4.146TyrSer: 4.146 ± 0.275
2.875TyrThr: 2.875 ± 0.213
2.907TyrVal: 2.907 ± 0.191
0.381TyrTrp: 0.381 ± 0.08
3.066TyrTyr: 3.066 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 261 proteins (62949 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski