Amino acid dipepetide frequency for Deerpox virus (strain Mule deer/United States/W-848-83/1983) (DPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.35AlaAla: 1.35 ± 0.187
0.45AlaCys: 0.45 ± 0.097
1.546AlaAsp: 1.546 ± 0.175
1.233AlaGlu: 1.233 ± 0.153
1.135AlaPhe: 1.135 ± 0.151
1.037AlaGly: 1.037 ± 0.207
0.509AlaHis: 0.509 ± 0.105
3.503AlaIle: 3.503 ± 0.236
2.114AlaLys: 2.114 ± 0.206
2.642AlaLeu: 2.642 ± 0.233
0.665AlaMet: 0.665 ± 0.108
2.094AlaAsn: 2.094 ± 0.194
0.665AlaPro: 0.665 ± 0.118
0.47AlaGln: 0.47 ± 0.078
1.037AlaArg: 1.037 ± 0.163
2.446AlaSer: 2.446 ± 0.258
1.703AlaThr: 1.703 ± 0.169
1.389AlaVal: 1.389 ± 0.158
0.215AlaTrp: 0.215 ± 0.064
1.507AlaTyr: 1.507 ± 0.16
0.0AlaXaa: 0.0 ± 0.0
Cys
0.568CysAla: 0.568 ± 0.112
0.607CysCys: 0.607 ± 0.121
1.213CysAsp: 1.213 ± 0.16
1.018CysGlu: 1.018 ± 0.142
0.978CysPhe: 0.978 ± 0.141
0.939CysGly: 0.939 ± 0.14
0.235CysHis: 0.235 ± 0.067
2.564CysIle: 2.564 ± 0.215
1.448CysLys: 1.448 ± 0.175
1.526CysLeu: 1.526 ± 0.148
0.548CysMet: 0.548 ± 0.096
2.114CysAsn: 2.114 ± 0.21
0.587CysPro: 0.587 ± 0.108
0.372CysGln: 0.372 ± 0.088
0.45CysArg: 0.45 ± 0.087
1.566CysSer: 1.566 ± 0.174
0.959CysThr: 0.959 ± 0.124
1.057CysVal: 1.057 ± 0.136
0.157CysTrp: 0.157 ± 0.062
1.174CysTyr: 1.174 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
1.487AspAla: 1.487 ± 0.151
0.939AspCys: 0.939 ± 0.163
4.599AspAsp: 4.599 ± 0.276
4.697AspGlu: 4.697 ± 0.27
3.19AspPhe: 3.19 ± 0.261
2.564AspGly: 2.564 ± 0.254
0.705AspHis: 0.705 ± 0.128
8.611AspIle: 8.611 ± 0.44
5.323AspLys: 5.323 ± 0.359
4.54AspLeu: 4.54 ± 0.341
1.624AspMet: 1.624 ± 0.159
4.853AspAsn: 4.853 ± 0.26
1.35AspPro: 1.35 ± 0.145
1.057AspGln: 1.057 ± 0.142
1.663AspArg: 1.663 ± 0.178
3.699AspSer: 3.699 ± 0.262
3.17AspThr: 3.17 ± 0.29
3.894AspVal: 3.894 ± 0.241
0.431AspTrp: 0.431 ± 0.099
3.229AspTyr: 3.229 ± 0.248
0.0AspXaa: 0.0 ± 0.0
Glu
1.292GluAla: 1.292 ± 0.138
1.115GluCys: 1.115 ± 0.146
4.129GluAsp: 4.129 ± 0.288
4.071GluGlu: 4.071 ± 0.347
2.642GluPhe: 2.642 ± 0.254
1.233GluGly: 1.233 ± 0.141
0.959GluHis: 0.959 ± 0.131
5.988GluIle: 5.988 ± 0.303
5.558GluLys: 5.558 ± 0.395
5.108GluLeu: 5.108 ± 0.328
1.429GluMet: 1.429 ± 0.174
5.049GluAsn: 5.049 ± 0.283
1.429GluPro: 1.429 ± 0.158
1.155GluGln: 1.155 ± 0.167
1.663GluArg: 1.663 ± 0.232
4.227GluSer: 4.227 ± 0.251
3.444GluThr: 3.444 ± 0.259
2.153GluVal: 2.153 ± 0.221
0.254GluTrp: 0.254 ± 0.08
3.209GluTyr: 3.209 ± 0.233
0.0GluXaa: 0.0 ± 0.0
Phe
1.115PheAla: 1.115 ± 0.138
1.076PheCys: 1.076 ± 0.136
3.581PheAsp: 3.581 ± 0.231
2.485PheGlu: 2.485 ± 0.228
2.662PhePhe: 2.662 ± 0.21
2.231PheGly: 2.231 ± 0.215
0.861PheHis: 0.861 ± 0.153
6.282PheIle: 6.282 ± 0.335
4.208PheLys: 4.208 ± 0.311
5.01PheLeu: 5.01 ± 0.343
1.605PheMet: 1.605 ± 0.17
4.501PheAsn: 4.501 ± 0.35
1.82PhePro: 1.82 ± 0.157
0.783PheGln: 0.783 ± 0.104
1.487PheArg: 1.487 ± 0.165
4.481PheSer: 4.481 ± 0.306
2.603PheThr: 2.603 ± 0.228
2.818PheVal: 2.818 ± 0.227
0.47PheTrp: 0.47 ± 0.105
2.818PheTyr: 2.818 ± 0.204
0.0PheXaa: 0.0 ± 0.0
Gly
1.429GlyAla: 1.429 ± 0.198
0.626GlyCys: 0.626 ± 0.112
2.231GlyAsp: 2.231 ± 0.201
1.84GlyGlu: 1.84 ± 0.213
2.348GlyPhe: 2.348 ± 0.262
1.781GlyGly: 1.781 ± 0.247
0.431GlyHis: 0.431 ± 0.093
4.481GlyIle: 4.481 ± 0.308
3.483GlyLys: 3.483 ± 0.259
2.564GlyLeu: 2.564 ± 0.201
0.646GlyMet: 0.646 ± 0.126
3.425GlyAsn: 3.425 ± 0.257
0.685GlyPro: 0.685 ± 0.101
0.509GlyGln: 0.509 ± 0.092
1.35GlyArg: 1.35 ± 0.184
2.388GlySer: 2.388 ± 0.198
1.605GlyThr: 1.605 ± 0.187
1.742GlyVal: 1.742 ± 0.185
0.254GlyTrp: 0.254 ± 0.078
2.133GlyTyr: 2.133 ± 0.224
0.0GlyXaa: 0.0 ± 0.0
His
0.45HisAla: 0.45 ± 0.085
0.509HisCys: 0.509 ± 0.122
0.822HisAsp: 0.822 ± 0.113
1.018HisGlu: 1.018 ± 0.137
0.783HisPhe: 0.783 ± 0.126
0.802HisGly: 0.802 ± 0.128
0.313HisHis: 0.313 ± 0.083
2.525HisIle: 2.525 ± 0.195
1.233HisLys: 1.233 ± 0.165
1.526HisLeu: 1.526 ± 0.158
0.489HisMet: 0.489 ± 0.097
1.487HisAsn: 1.487 ± 0.177
0.509HisPro: 0.509 ± 0.093
0.45HisGln: 0.45 ± 0.08
0.548HisArg: 0.548 ± 0.092
0.959HisSer: 0.959 ± 0.118
0.763HisThr: 0.763 ± 0.124
1.018HisVal: 1.018 ± 0.15
0.196HisTrp: 0.196 ± 0.059
0.822HisTyr: 0.822 ± 0.122
0.0HisXaa: 0.0 ± 0.0
Ile
3.112IleAla: 3.112 ± 0.212
2.485IleCys: 2.485 ± 0.204
7.495IleAsp: 7.495 ± 0.373
6.086IleGlu: 6.086 ± 0.385
6.204IlePhe: 6.204 ± 0.403
3.699IleGly: 3.699 ± 0.24
2.231IleHis: 2.231 ± 0.217
12.74IleIle: 12.74 ± 0.589
10.803IleLys: 10.803 ± 0.475
11.429IleLeu: 11.429 ± 0.542
2.192IleMet: 2.192 ± 0.179
11.214IleAsn: 11.214 ± 0.543
3.679IlePro: 3.679 ± 0.264
2.035IleGln: 2.035 ± 0.161
3.327IleArg: 3.327 ± 0.263
9.315IleSer: 9.315 ± 0.393
6.184IleThr: 6.184 ± 0.332
5.46IleVal: 5.46 ± 0.295
0.607IleTrp: 0.607 ± 0.111
5.851IleTyr: 5.851 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
2.114LysAla: 2.114 ± 0.199
1.429LysCys: 1.429 ± 0.14
4.912LysAsp: 4.912 ± 0.276
5.049LysGlu: 5.049 ± 0.28
4.11LysPhe: 4.11 ± 0.283
2.72LysGly: 2.72 ± 0.223
1.683LysHis: 1.683 ± 0.186
10.372LysIle: 10.372 ± 0.468
9.472LysLys: 9.472 ± 0.457
8.043LysLeu: 8.043 ± 0.411
2.525LysMet: 2.525 ± 0.203
8.337LysAsn: 8.337 ± 0.381
2.466LysPro: 2.466 ± 0.212
2.153LysGln: 2.153 ± 0.207
3.601LysArg: 3.601 ± 0.244
6.165LysSer: 6.165 ± 0.334
5.049LysThr: 5.049 ± 0.281
3.914LysVal: 3.914 ± 0.29
0.724LysTrp: 0.724 ± 0.154
6.047LysTyr: 6.047 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
2.74LeuAla: 2.74 ± 0.23
1.683LeuCys: 1.683 ± 0.197
5.303LeuAsp: 5.303 ± 0.33
5.519LeuGlu: 5.519 ± 0.331
5.147LeuPhe: 5.147 ± 0.346
2.916LeuGly: 2.916 ± 0.22
1.546LeuHis: 1.546 ± 0.204
9.746LeuIle: 9.746 ± 0.486
7.574LeuLys: 7.574 ± 0.366
9.922LeuLeu: 9.922 ± 0.543
2.133LeuMet: 2.133 ± 0.194
6.752LeuAsn: 6.752 ± 0.399
2.701LeuPro: 2.701 ± 0.242
2.094LeuGln: 2.094 ± 0.203
2.642LeuArg: 2.642 ± 0.268
8.532LeuSer: 8.532 ± 0.359
5.088LeuThr: 5.088 ± 0.342
3.934LeuVal: 3.934 ± 0.326
0.391LeuTrp: 0.391 ± 0.103
4.755LeuTyr: 4.755 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
1.076MetAla: 1.076 ± 0.144
0.313MetCys: 0.313 ± 0.086
1.683MetAsp: 1.683 ± 0.172
1.644MetGlu: 1.644 ± 0.144
1.546MetPhe: 1.546 ± 0.158
0.998MetGly: 0.998 ± 0.11
0.294MetHis: 0.294 ± 0.074
2.388MetIle: 2.388 ± 0.211
2.055MetLys: 2.055 ± 0.218
2.251MetLeu: 2.251 ± 0.225
0.763MetMet: 0.763 ± 0.106
1.742MetAsn: 1.742 ± 0.18
0.783MetPro: 0.783 ± 0.124
0.528MetGln: 0.528 ± 0.107
0.9MetArg: 0.9 ± 0.125
2.016MetSer: 2.016 ± 0.179
1.057MetThr: 1.057 ± 0.132
1.194MetVal: 1.194 ± 0.127
0.196MetTrp: 0.196 ± 0.066
1.507MetTyr: 1.507 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
1.663AsnAla: 1.663 ± 0.179
1.233AsnCys: 1.233 ± 0.164
5.401AsnAsp: 5.401 ± 0.304
4.403AsnGlu: 4.403 ± 0.374
3.875AsnPhe: 3.875 ± 0.229
3.268AsnGly: 3.268 ± 0.279
1.663AsnHis: 1.663 ± 0.165
10.959AsnIle: 10.959 ± 0.593
7.926AsnLys: 7.926 ± 0.379
6.497AsnLeu: 6.497 ± 0.374
2.525AsnMet: 2.525 ± 0.237
8.669AsnAsn: 8.669 ± 0.561
2.622AsnPro: 2.622 ± 0.23
1.918AsnGln: 1.918 ± 0.202
2.779AsnArg: 2.779 ± 0.262
5.538AsnSer: 5.538 ± 0.359
4.149AsnThr: 4.149 ± 0.316
4.266AsnVal: 4.266 ± 0.309
0.391AsnTrp: 0.391 ± 0.08
4.09AsnTyr: 4.09 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
0.665ProAla: 0.665 ± 0.111
0.607ProCys: 0.607 ± 0.12
1.663ProAsp: 1.663 ± 0.173
1.8ProGlu: 1.8 ± 0.182
1.898ProPhe: 1.898 ± 0.19
1.018ProGly: 1.018 ± 0.147
0.372ProHis: 0.372 ± 0.088
3.19ProIle: 3.19 ± 0.235
2.662ProLys: 2.662 ± 0.232
2.72ProLeu: 2.72 ± 0.213
0.822ProMet: 0.822 ± 0.125
2.133ProAsn: 2.133 ± 0.205
1.115ProPro: 1.115 ± 0.171
0.626ProGln: 0.626 ± 0.103
0.959ProArg: 0.959 ± 0.145
2.329ProSer: 2.329 ± 0.206
1.624ProThr: 1.624 ± 0.176
1.272ProVal: 1.272 ± 0.179
0.215ProTrp: 0.215 ± 0.064
1.781ProTyr: 1.781 ± 0.16
0.0ProXaa: 0.0 ± 0.0
Gln
0.489GlnAla: 0.489 ± 0.106
0.372GlnCys: 0.372 ± 0.081
0.998GlnAsp: 0.998 ± 0.154
1.135GlnGlu: 1.135 ± 0.144
0.959GlnPhe: 0.959 ± 0.135
0.587GlnGly: 0.587 ± 0.1
0.431GlnHis: 0.431 ± 0.082
2.192GlnIle: 2.192 ± 0.206
2.114GlnLys: 2.114 ± 0.215
2.114GlnLeu: 2.114 ± 0.226
0.548GlnMet: 0.548 ± 0.114
1.272GlnAsn: 1.272 ± 0.156
0.528GlnPro: 0.528 ± 0.101
0.587GlnGln: 0.587 ± 0.112
0.744GlnArg: 0.744 ± 0.15
1.526GlnSer: 1.526 ± 0.175
1.331GlnThr: 1.331 ± 0.167
0.783GlnVal: 0.783 ± 0.119
0.235GlnTrp: 0.235 ± 0.06
1.272GlnTyr: 1.272 ± 0.138
0.0GlnXaa: 0.0 ± 0.0
Arg
0.802ArgAla: 0.802 ± 0.135
0.861ArgCys: 0.861 ± 0.138
1.605ArgAsp: 1.605 ± 0.185
1.546ArgGlu: 1.546 ± 0.161
2.016ArgPhe: 2.016 ± 0.195
1.233ArgGly: 1.233 ± 0.17
0.724ArgHis: 0.724 ± 0.119
3.17ArgIle: 3.17 ± 0.235
2.975ArgLys: 2.975 ± 0.272
2.838ArgLeu: 2.838 ± 0.199
0.705ArgMet: 0.705 ± 0.101
2.309ArgAsn: 2.309 ± 0.241
0.959ArgPro: 0.959 ± 0.135
0.822ArgGln: 0.822 ± 0.109
1.311ArgArg: 1.311 ± 0.162
2.446ArgSer: 2.446 ± 0.234
1.585ArgThr: 1.585 ± 0.149
1.644ArgVal: 1.644 ± 0.174
0.235ArgTrp: 0.235 ± 0.077
2.055ArgTyr: 2.055 ± 0.208
0.0ArgXaa: 0.0 ± 0.0
Ser
2.348SerAla: 2.348 ± 0.24
1.781SerCys: 1.781 ± 0.178
4.755SerAsp: 4.755 ± 0.282
4.286SerGlu: 4.286 ± 0.324
3.973SerPhe: 3.973 ± 0.245
2.74SerGly: 2.74 ± 0.231
1.252SerHis: 1.252 ± 0.17
9.08SerIle: 9.08 ± 0.427
7.182SerLys: 7.182 ± 0.403
7.632SerLeu: 7.632 ± 0.404
1.977SerMet: 1.977 ± 0.197
5.597SerAsn: 5.597 ± 0.357
2.074SerPro: 2.074 ± 0.271
1.703SerGln: 1.703 ± 0.182
2.446SerArg: 2.446 ± 0.187
6.399SerSer: 6.399 ± 0.587
3.934SerThr: 3.934 ± 0.306
3.542SerVal: 3.542 ± 0.305
0.528SerTrp: 0.528 ± 0.104
3.836SerTyr: 3.836 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
1.605ThrAla: 1.605 ± 0.151
1.409ThrCys: 1.409 ± 0.192
3.131ThrAsp: 3.131 ± 0.234
2.759ThrGlu: 2.759 ± 0.254
3.249ThrPhe: 3.249 ± 0.257
1.918ThrGly: 1.918 ± 0.181
1.213ThrHis: 1.213 ± 0.121
6.262ThrIle: 6.262 ± 0.373
4.892ThrLys: 4.892 ± 0.28
4.892ThrLeu: 4.892 ± 0.299
1.35ThrMet: 1.35 ± 0.162
3.366ThrAsn: 3.366 ± 0.237
1.761ThrPro: 1.761 ± 0.161
0.881ThrGln: 0.881 ± 0.127
1.468ThrArg: 1.468 ± 0.191
3.914ThrSer: 3.914 ± 0.213
3.288ThrThr: 3.288 ± 0.27
2.877ThrVal: 2.877 ± 0.229
0.431ThrTrp: 0.431 ± 0.092
2.74ThrTyr: 2.74 ± 0.275
0.0ThrXaa: 0.0 ± 0.0
Val
1.761ValAla: 1.761 ± 0.171
1.252ValCys: 1.252 ± 0.155
2.896ValAsp: 2.896 ± 0.228
2.485ValGlu: 2.485 ± 0.249
2.701ValPhe: 2.701 ± 0.237
1.389ValGly: 1.389 ± 0.181
0.802ValHis: 0.802 ± 0.131
4.716ValIle: 4.716 ± 0.307
4.56ValLys: 4.56 ± 0.259
4.442ValLeu: 4.442 ± 0.318
0.939ValMet: 0.939 ± 0.111
3.894ValAsn: 3.894 ± 0.262
1.566ValPro: 1.566 ± 0.145
0.978ValGln: 0.978 ± 0.13
1.605ValArg: 1.605 ± 0.196
4.247ValSer: 4.247 ± 0.315
2.818ValThr: 2.818 ± 0.246
1.859ValVal: 1.859 ± 0.235
0.215ValTrp: 0.215 ± 0.057
2.916ValTyr: 2.916 ± 0.287
0.0ValXaa: 0.0 ± 0.0
Trp
0.274TrpAla: 0.274 ± 0.064
0.098TrpCys: 0.098 ± 0.051
0.176TrpAsp: 0.176 ± 0.057
0.391TrpGlu: 0.391 ± 0.079
0.509TrpPhe: 0.509 ± 0.095
0.235TrpGly: 0.235 ± 0.066
0.02TrpHis: 0.02 ± 0.018
0.783TrpIle: 0.783 ± 0.119
0.665TrpLys: 0.665 ± 0.115
0.47TrpLeu: 0.47 ± 0.115
0.274TrpMet: 0.274 ± 0.065
0.509TrpAsn: 0.509 ± 0.098
0.176TrpPro: 0.176 ± 0.053
0.078TrpGln: 0.078 ± 0.036
0.372TrpArg: 0.372 ± 0.089
0.646TrpSer: 0.646 ± 0.127
0.254TrpThr: 0.254 ± 0.061
0.294TrpVal: 0.294 ± 0.085
0.02TrpTrp: 0.02 ± 0.019
0.294TrpTyr: 0.294 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.37TyrAla: 1.37 ± 0.179
1.233TyrCys: 1.233 ± 0.155
3.483TyrAsp: 3.483 ± 0.299
2.642TyrGlu: 2.642 ± 0.21
2.975TyrPhe: 2.975 ± 0.208
2.642TyrGly: 2.642 ± 0.24
0.978TyrHis: 0.978 ± 0.127
6.595TyrIle: 6.595 ± 0.464
4.638TyrLys: 4.638 ± 0.331
4.951TyrLeu: 4.951 ± 0.383
1.155TyrMet: 1.155 ± 0.151
4.521TyrAsn: 4.521 ± 0.284
1.996TyrPro: 1.996 ± 0.196
1.018TyrGln: 1.018 ± 0.165
1.487TyrArg: 1.487 ± 0.148
4.227TyrSer: 4.227 ± 0.29
2.798TyrThr: 2.798 ± 0.249
2.955TyrVal: 2.955 ± 0.259
0.372TyrTrp: 0.372 ± 0.089
2.877TyrTyr: 2.877 ± 0.26
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 165 proteins (51100 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski