Amino acid dipepetide frequency for Taterapox virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.395AlaAla: 2.395 ± 0.233
1.051AlaCys: 1.051 ± 0.154
1.86AlaAsp: 1.86 ± 0.165
2.067AlaGlu: 2.067 ± 0.198
1.585AlaPhe: 1.585 ± 0.176
1.568AlaGly: 1.568 ± 0.185
0.465AlaHis: 0.465 ± 0.074
3.738AlaIle: 3.738 ± 0.265
2.911AlaLys: 2.911 ± 0.216
3.256AlaLeu: 3.256 ± 0.238
1.085AlaMet: 1.085 ± 0.131
2.532AlaAsn: 2.532 ± 0.25
1.034AlaPro: 1.034 ± 0.133
0.706AlaGln: 0.706 ± 0.109
1.464AlaArg: 1.464 ± 0.195
3.221AlaSer: 3.221 ± 0.282
2.429AlaThr: 2.429 ± 0.237
2.687AlaVal: 2.687 ± 0.173
0.224AlaTrp: 0.224 ± 0.056
1.74AlaTyr: 1.74 ± 0.155
0.0AlaXaa: 0.0 ± 0.0
Cys
0.879CysAla: 0.879 ± 0.121
0.62CysCys: 0.62 ± 0.124
1.516CysAsp: 1.516 ± 0.166
0.947CysGlu: 0.947 ± 0.148
0.672CysPhe: 0.672 ± 0.1
1.016CysGly: 1.016 ± 0.119
0.465CysHis: 0.465 ± 0.082
2.102CysIle: 2.102 ± 0.208
1.258CysLys: 1.258 ± 0.162
1.74CysLeu: 1.74 ± 0.139
0.637CysMet: 0.637 ± 0.092
1.326CysAsn: 1.326 ± 0.17
0.689CysPro: 0.689 ± 0.104
0.465CysGln: 0.465 ± 0.081
0.913CysArg: 0.913 ± 0.115
1.55CysSer: 1.55 ± 0.176
1.413CysThr: 1.413 ± 0.183
1.395CysVal: 1.395 ± 0.166
0.241CysTrp: 0.241 ± 0.074
1.292CysTyr: 1.292 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
2.532AspAla: 2.532 ± 0.208
1.068AspCys: 1.068 ± 0.152
6.219AspAsp: 6.219 ± 0.65
4.479AspGlu: 4.479 ± 0.216
2.808AspPhe: 2.808 ± 0.243
2.86AspGly: 2.86 ± 0.209
1.258AspHis: 1.258 ± 0.152
8.493AspIle: 8.493 ± 0.711
4.961AspLys: 4.961 ± 0.276
4.841AspLeu: 4.841 ± 0.259
1.757AspMet: 1.757 ± 0.189
4.824AspAsn: 4.824 ± 0.307
1.74AspPro: 1.74 ± 0.131
1.189AspGln: 1.189 ± 0.15
2.153AspArg: 2.153 ± 0.178
4.617AspSer: 4.617 ± 0.474
3.755AspThr: 3.755 ± 0.29
4.548AspVal: 4.548 ± 0.272
0.482AspTrp: 0.482 ± 0.104
3.566AspTyr: 3.566 ± 0.246
0.0AspXaa: 0.0 ± 0.0
Glu
2.119GluAla: 2.119 ± 0.159
1.24GluCys: 1.24 ± 0.151
3.669GluAsp: 3.669 ± 0.273
3.325GluGlu: 3.325 ± 0.238
2.412GluPhe: 2.412 ± 0.21
1.705GluGly: 1.705 ± 0.168
1.137GluHis: 1.137 ± 0.137
4.755GluIle: 4.755 ± 0.36
3.652GluLys: 3.652 ± 0.23
5.203GluLeu: 5.203 ± 0.335
1.55GluMet: 1.55 ± 0.17
3.6GluAsn: 3.6 ± 0.293
1.688GluPro: 1.688 ± 0.191
1.482GluGln: 1.482 ± 0.196
2.429GluArg: 2.429 ± 0.21
4.031GluSer: 4.031 ± 0.247
3.497GluThr: 3.497 ± 0.261
2.722GluVal: 2.722 ± 0.214
0.568GluTrp: 0.568 ± 0.102
3.549GluTyr: 3.549 ± 0.248
0.0GluXaa: 0.0 ± 0.0
Phe
1.568PheAla: 1.568 ± 0.174
0.913PheCys: 0.913 ± 0.136
3.066PheAsp: 3.066 ± 0.216
2.016PheGlu: 2.016 ± 0.171
2.033PhePhe: 2.033 ± 0.195
1.947PheGly: 1.947 ± 0.168
0.792PheHis: 0.792 ± 0.105
4.686PheIle: 4.686 ± 0.29
3.342PheLys: 3.342 ± 0.235
4.014PheLeu: 4.014 ± 0.289
1.447PheMet: 1.447 ± 0.138
3.48PheAsn: 3.48 ± 0.241
1.275PhePro: 1.275 ± 0.148
0.792PheGln: 0.792 ± 0.102
1.654PheArg: 1.654 ± 0.176
3.669PheSer: 3.669 ± 0.271
2.911PheThr: 2.911 ± 0.245
2.808PheVal: 2.808 ± 0.254
0.396PheTrp: 0.396 ± 0.09
2.257PheTyr: 2.257 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
1.929GlyAla: 1.929 ± 0.184
0.81GlyCys: 0.81 ± 0.101
2.515GlyAsp: 2.515 ± 0.196
2.084GlyGlu: 2.084 ± 0.189
1.74GlyPhe: 1.74 ± 0.164
2.274GlyGly: 2.274 ± 0.293
0.965GlyHis: 0.965 ± 0.149
3.773GlyIle: 3.773 ± 0.288
2.911GlyLys: 2.911 ± 0.251
2.98GlyLeu: 2.98 ± 0.206
0.947GlyMet: 0.947 ± 0.124
2.894GlyAsn: 2.894 ± 0.224
0.792GlyPro: 0.792 ± 0.109
0.741GlyGln: 0.741 ± 0.115
1.981GlyArg: 1.981 ± 0.172
2.911GlySer: 2.911 ± 0.202
2.308GlyThr: 2.308 ± 0.227
2.653GlyVal: 2.653 ± 0.253
0.258GlyTrp: 0.258 ± 0.071
2.274GlyTyr: 2.274 ± 0.224
0.0GlyXaa: 0.0 ± 0.0
His
0.947HisAla: 0.947 ± 0.131
0.568HisCys: 0.568 ± 0.117
1.171HisAsp: 1.171 ± 0.121
0.999HisGlu: 0.999 ± 0.139
0.861HisPhe: 0.861 ± 0.118
1.12HisGly: 1.12 ± 0.157
0.603HisHis: 0.603 ± 0.111
2.446HisIle: 2.446 ± 0.245
1.275HisLys: 1.275 ± 0.145
2.016HisLeu: 2.016 ± 0.196
0.568HisMet: 0.568 ± 0.091
1.12HisAsn: 1.12 ± 0.142
0.775HisPro: 0.775 ± 0.113
0.517HisGln: 0.517 ± 0.09
0.982HisArg: 0.982 ± 0.12
1.43HisSer: 1.43 ± 0.159
1.24HisThr: 1.24 ± 0.139
1.482HisVal: 1.482 ± 0.179
0.172HisTrp: 0.172 ± 0.052
0.947HisTyr: 0.947 ± 0.135
0.0HisXaa: 0.0 ± 0.0
Ile
3.514IleAla: 3.514 ± 0.228
1.464IleCys: 1.464 ± 0.151
7.942IleAsp: 7.942 ± 0.627
5.065IleGlu: 5.065 ± 0.284
4.152IlePhe: 4.152 ± 0.256
3.618IleGly: 3.618 ± 0.239
2.05IleHis: 2.05 ± 0.222
8.458IleIle: 8.458 ± 0.491
7.321IleLys: 7.321 ± 0.372
8.7IleLeu: 8.7 ± 0.468
2.343IleMet: 2.343 ± 0.212
7.563IleAsn: 7.563 ± 0.367
3.566IlePro: 3.566 ± 0.23
2.102IleGln: 2.102 ± 0.204
3.893IleArg: 3.893 ± 0.291
8.234IleSer: 8.234 ± 0.367
4.979IleThr: 4.979 ± 0.31
5.668IleVal: 5.668 ± 0.365
0.568IleTrp: 0.568 ± 0.089
4.668IleTyr: 4.668 ± 0.303
0.0IleXaa: 0.0 ± 0.0
Lys
1.878LysAla: 1.878 ± 0.185
1.688LysCys: 1.688 ± 0.146
4.979LysAsp: 4.979 ± 0.321
3.876LysGlu: 3.876 ± 0.245
3.084LysPhe: 3.084 ± 0.274
2.05LysGly: 2.05 ± 0.167
1.843LysHis: 1.843 ± 0.169
6.684LysIle: 6.684 ± 0.351
5.495LysLys: 5.495 ± 0.331
6.805LysLeu: 6.805 ± 0.296
2.102LysMet: 2.102 ± 0.193
5.254LysAsn: 5.254 ± 0.318
2.136LysPro: 2.136 ± 0.193
2.05LysGln: 2.05 ± 0.168
3.6LysArg: 3.6 ± 0.243
5.581LysSer: 5.581 ± 0.329
4.255LysThr: 4.255 ± 0.29
3.859LysVal: 3.859 ± 0.271
0.637LysTrp: 0.637 ± 0.117
4.513LysTyr: 4.513 ± 0.263
0.0LysXaa: 0.0 ± 0.0
Leu
3.187LeuAla: 3.187 ± 0.227
1.654LeuCys: 1.654 ± 0.165
5.65LeuAsp: 5.65 ± 0.32
5.099LeuGlu: 5.099 ± 0.375
4.824LeuPhe: 4.824 ± 0.401
3.204LeuGly: 3.204 ± 0.264
1.998LeuHis: 1.998 ± 0.246
6.718LeuIle: 6.718 ± 0.354
5.96LeuLys: 5.96 ± 0.325
9.044LeuLeu: 9.044 ± 0.437
2.687LeuMet: 2.687 ± 0.22
5.358LeuAsn: 5.358 ± 0.296
3.359LeuPro: 3.359 ± 0.261
1.878LeuGln: 1.878 ± 0.161
3.549LeuArg: 3.549 ± 0.282
7.7LeuSer: 7.7 ± 0.39
5.599LeuThr: 5.599 ± 0.289
5.409LeuVal: 5.409 ± 0.325
0.465LeuTrp: 0.465 ± 0.101
4.634LeuTyr: 4.634 ± 0.282
0.0LeuXaa: 0.0 ± 0.0
Met
1.533MetAla: 1.533 ± 0.166
0.689MetCys: 0.689 ± 0.106
2.084MetAsp: 2.084 ± 0.178
1.637MetGlu: 1.637 ± 0.176
1.24MetPhe: 1.24 ± 0.14
0.947MetGly: 0.947 ± 0.149
0.465MetHis: 0.465 ± 0.086
2.601MetIle: 2.601 ± 0.252
1.809MetLys: 1.809 ± 0.176
2.55MetLeu: 2.55 ± 0.196
0.965MetMet: 0.965 ± 0.142
1.878MetAsn: 1.878 ± 0.19
0.982MetPro: 0.982 ± 0.118
0.534MetGln: 0.534 ± 0.089
1.223MetArg: 1.223 ± 0.138
2.343MetSer: 2.343 ± 0.21
1.688MetThr: 1.688 ± 0.158
1.464MetVal: 1.464 ± 0.158
0.241MetTrp: 0.241 ± 0.066
1.602MetTyr: 1.602 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
2.842AsnAla: 2.842 ± 0.24
1.258AsnCys: 1.258 ± 0.151
4.565AsnAsp: 4.565 ± 0.293
3.687AsnGlu: 3.687 ± 0.259
2.429AsnPhe: 2.429 ± 0.227
3.273AsnGly: 3.273 ± 0.294
1.585AsnHis: 1.585 ± 0.175
7.787AsnIle: 7.787 ± 0.41
5.599AsnLys: 5.599 ± 0.261
4.841AsnLeu: 4.841 ± 0.335
2.188AsnMet: 2.188 ± 0.199
5.978AsnAsn: 5.978 ± 0.385
2.36AsnPro: 2.36 ± 0.209
1.344AsnGln: 1.344 ± 0.143
2.98AsnArg: 2.98 ± 0.222
4.41AsnSer: 4.41 ± 0.26
4.358AsnThr: 4.358 ± 0.295
4.41AsnVal: 4.41 ± 0.247
0.431AsnTrp: 0.431 ± 0.094
3.411AsnTyr: 3.411 ± 0.289
0.0AsnXaa: 0.0 ± 0.0
Pro
1.12ProAla: 1.12 ± 0.15
0.568ProCys: 0.568 ± 0.104
1.895ProAsp: 1.895 ± 0.165
2.326ProGlu: 2.326 ± 0.166
1.55ProPhe: 1.55 ± 0.177
1.413ProGly: 1.413 ± 0.16
0.689ProHis: 0.689 ± 0.116
3.049ProIle: 3.049 ± 0.212
1.912ProLys: 1.912 ± 0.17
2.929ProLeu: 2.929 ± 0.231
1.016ProMet: 1.016 ± 0.136
2.084ProAsn: 2.084 ± 0.175
1.499ProPro: 1.499 ± 0.236
0.758ProGln: 0.758 ± 0.129
1.671ProArg: 1.671 ± 0.263
2.567ProSer: 2.567 ± 0.212
2.326ProThr: 2.326 ± 0.217
1.981ProVal: 1.981 ± 0.194
0.258ProTrp: 0.258 ± 0.062
1.568ProTyr: 1.568 ± 0.161
0.0ProXaa: 0.0 ± 0.0
Gln
0.568GlnAla: 0.568 ± 0.102
0.551GlnCys: 0.551 ± 0.097
1.258GlnAsp: 1.258 ± 0.133
1.223GlnGlu: 1.223 ± 0.136
0.861GlnPhe: 0.861 ± 0.127
0.655GlnGly: 0.655 ± 0.106
0.689GlnHis: 0.689 ± 0.109
1.602GlnIle: 1.602 ± 0.181
1.516GlnLys: 1.516 ± 0.181
2.55GlnLeu: 2.55 ± 0.22
0.689GlnMet: 0.689 ± 0.119
1.43GlnAsn: 1.43 ± 0.161
0.758GlnPro: 0.758 ± 0.18
0.965GlnGln: 0.965 ± 0.145
1.189GlnArg: 1.189 ± 0.154
1.671GlnSer: 1.671 ± 0.172
1.413GlnThr: 1.413 ± 0.183
0.861GlnVal: 0.861 ± 0.114
0.224GlnTrp: 0.224 ± 0.059
1.585GlnTyr: 1.585 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
1.223ArgAla: 1.223 ± 0.165
1.016ArgCys: 1.016 ± 0.13
2.567ArgAsp: 2.567 ± 0.209
2.171ArgGlu: 2.171 ± 0.26
2.188ArgPhe: 2.188 ± 0.215
1.929ArgGly: 1.929 ± 0.2
1.223ArgHis: 1.223 ± 0.143
3.549ArgIle: 3.549 ± 0.255
2.618ArgLys: 2.618 ± 0.206
4.083ArgLeu: 4.083 ± 0.246
1.016ArgMet: 1.016 ± 0.125
3.032ArgAsn: 3.032 ± 0.257
1.361ArgPro: 1.361 ± 0.189
1.447ArgGln: 1.447 ± 0.165
2.601ArgArg: 2.601 ± 0.278
3.17ArgSer: 3.17 ± 0.256
2.239ArgThr: 2.239 ± 0.19
2.532ArgVal: 2.532 ± 0.221
0.362ArgTrp: 0.362 ± 0.086
2.567ArgTyr: 2.567 ± 0.21
0.0ArgXaa: 0.0 ± 0.0
Ser
2.929SerAla: 2.929 ± 0.257
1.464SerCys: 1.464 ± 0.182
5.426SerAsp: 5.426 ± 0.5
3.6SerGlu: 3.6 ± 0.267
3.6SerPhe: 3.6 ± 0.254
3.204SerGly: 3.204 ± 0.24
1.482SerHis: 1.482 ± 0.171
7.631SerIle: 7.631 ± 0.416
5.978SerLys: 5.978 ± 0.314
6.753SerLeu: 6.753 ± 0.334
2.463SerMet: 2.463 ± 0.237
4.858SerAsn: 4.858 ± 0.267
2.911SerPro: 2.911 ± 0.237
1.86SerGln: 1.86 ± 0.205
3.48SerArg: 3.48 ± 0.283
6.753SerSer: 6.753 ± 0.492
4.841SerThr: 4.841 ± 0.426
5.065SerVal: 5.065 ± 0.296
0.396SerTrp: 0.396 ± 0.08
3.48SerTyr: 3.48 ± 0.248
0.0SerXaa: 0.0 ± 0.0
Thr
2.343ThrAla: 2.343 ± 0.208
1.344ThrCys: 1.344 ± 0.17
4.117ThrAsp: 4.117 ± 0.278
3.394ThrGlu: 3.394 ± 0.235
2.756ThrPhe: 2.756 ± 0.209
2.412ThrGly: 2.412 ± 0.21
1.309ThrHis: 1.309 ± 0.161
6.064ThrIle: 6.064 ± 0.375
4.324ThrLys: 4.324 ± 0.246
4.996ThrLeu: 4.996 ± 0.269
1.74ThrMet: 1.74 ± 0.167
3.79ThrAsn: 3.79 ± 0.227
2.395ThrPro: 2.395 ± 0.294
0.965ThrGln: 0.965 ± 0.129
2.515ThrArg: 2.515 ± 0.182
4.961ThrSer: 4.961 ± 0.279
3.824ThrThr: 3.824 ± 0.312
3.979ThrVal: 3.979 ± 0.313
0.448ThrTrp: 0.448 ± 0.079
2.774ThrTyr: 2.774 ± 0.205
0.0ThrXaa: 0.0 ± 0.0
Val
2.343ValAla: 2.343 ± 0.185
1.585ValCys: 1.585 ± 0.142
4.186ValAsp: 4.186 ± 0.233
3.549ValGlu: 3.549 ± 0.277
3.29ValPhe: 3.29 ± 0.279
1.774ValGly: 1.774 ± 0.184
1.068ValHis: 1.068 ± 0.143
5.478ValIle: 5.478 ± 0.315
4.789ValLys: 4.789 ± 0.304
5.03ValLeu: 5.03 ± 0.309
1.464ValMet: 1.464 ± 0.153
4.289ValAsn: 4.289 ± 0.257
1.86ValPro: 1.86 ± 0.175
1.24ValGln: 1.24 ± 0.144
2.463ValArg: 2.463 ± 0.193
4.892ValSer: 4.892 ± 0.313
3.704ValThr: 3.704 ± 0.264
3.721ValVal: 3.721 ± 0.275
0.31ValTrp: 0.31 ± 0.073
3.583ValTyr: 3.583 ± 0.272
0.0ValXaa: 0.0 ± 0.0
Trp
0.172TrpAla: 0.172 ± 0.055
0.189TrpCys: 0.189 ± 0.047
0.327TrpAsp: 0.327 ± 0.06
0.362TrpGlu: 0.362 ± 0.079
0.413TrpPhe: 0.413 ± 0.087
0.276TrpGly: 0.276 ± 0.065
0.172TrpHis: 0.172 ± 0.056
0.586TrpIle: 0.586 ± 0.111
0.724TrpLys: 0.724 ± 0.108
0.758TrpLeu: 0.758 ± 0.127
0.362TrpMet: 0.362 ± 0.07
0.482TrpAsn: 0.482 ± 0.096
0.31TrpPro: 0.31 ± 0.071
0.121TrpGln: 0.121 ± 0.052
0.224TrpArg: 0.224 ± 0.079
0.448TrpSer: 0.448 ± 0.084
0.431TrpThr: 0.431 ± 0.079
0.379TrpVal: 0.379 ± 0.077
0.0TrpTrp: 0.0 ± 0.0
0.345TrpTyr: 0.345 ± 0.071
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.998TyrAla: 1.998 ± 0.187
1.395TyrCys: 1.395 ± 0.151
3.135TyrAsp: 3.135 ± 0.238
2.429TyrGlu: 2.429 ± 0.206
2.567TyrPhe: 2.567 ± 0.172
2.498TyrGly: 2.498 ± 0.21
0.982TyrHis: 0.982 ± 0.114
5.668TyrIle: 5.668 ± 0.304
4.031TyrLys: 4.031 ± 0.237
4.892TyrLeu: 4.892 ± 0.328
1.43TyrMet: 1.43 ± 0.155
3.962TyrAsn: 3.962 ± 0.297
1.688TyrPro: 1.688 ± 0.16
1.103TyrGln: 1.103 ± 0.15
1.929TyrArg: 1.929 ± 0.176
4.014TyrSer: 4.014 ± 0.295
3.239TyrThr: 3.239 ± 0.268
2.997TyrVal: 2.997 ± 0.172
0.396TyrTrp: 0.396 ± 0.085
2.877TyrTyr: 2.877 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 220 proteins (58050 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski