Amino acid dipepetide frequency for Yokapox virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.466AlaAla: 1.466 ± 0.201
0.52AlaCys: 0.52 ± 0.084
1.448AlaAsp: 1.448 ± 0.167
1.188AlaGlu: 1.188 ± 0.184
1.299AlaPhe: 1.299 ± 0.137
0.965AlaGly: 0.965 ± 0.144
0.278AlaHis: 0.278 ± 0.076
3.044AlaIle: 3.044 ± 0.228
1.782AlaLys: 1.782 ± 0.199
2.357AlaLeu: 2.357 ± 0.194
0.613AlaMet: 0.613 ± 0.104
1.578AlaAsn: 1.578 ± 0.156
0.594AlaPro: 0.594 ± 0.106
0.278AlaGln: 0.278 ± 0.065
1.151AlaArg: 1.151 ± 0.148
2.32AlaSer: 2.32 ± 0.23
1.578AlaThr: 1.578 ± 0.217
2.06AlaVal: 2.06 ± 0.165
0.148AlaTrp: 0.148 ± 0.058
1.466AlaTyr: 1.466 ± 0.172
0.0AlaXaa: 0.0 ± 0.0
Cys
0.483CysAla: 0.483 ± 0.093
0.538CysCys: 0.538 ± 0.094
1.299CysAsp: 1.299 ± 0.152
0.817CysGlu: 0.817 ± 0.111
0.742CysPhe: 0.742 ± 0.129
0.891CysGly: 0.891 ± 0.11
0.371CysHis: 0.371 ± 0.072
2.524CysIle: 2.524 ± 0.257
1.689CysLys: 1.689 ± 0.182
1.726CysLeu: 1.726 ± 0.197
0.316CysMet: 0.316 ± 0.069
1.708CysAsn: 1.708 ± 0.179
0.613CysPro: 0.613 ± 0.123
0.297CysGln: 0.297 ± 0.069
0.705CysArg: 0.705 ± 0.113
1.615CysSer: 1.615 ± 0.189
1.077CysThr: 1.077 ± 0.164
1.429CysVal: 1.429 ± 0.195
0.167CysTrp: 0.167 ± 0.058
1.318CysTyr: 1.318 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
1.615AspAla: 1.615 ± 0.159
0.835AspCys: 0.835 ± 0.121
5.383AspAsp: 5.383 ± 0.48
4.009AspGlu: 4.009 ± 0.328
2.636AspPhe: 2.636 ± 0.216
2.524AspGly: 2.524 ± 0.211
0.78AspHis: 0.78 ± 0.128
9.782AspIle: 9.782 ± 0.547
5.772AspLys: 5.772 ± 0.328
4.677AspLeu: 4.677 ± 0.246
1.689AspMet: 1.689 ± 0.18
6.051AspAsn: 6.051 ± 0.313
1.392AspPro: 1.392 ± 0.15
0.872AspGln: 0.872 ± 0.129
1.949AspArg: 1.949 ± 0.217
3.601AspSer: 3.601 ± 0.246
3.675AspThr: 3.675 ± 0.371
4.083AspVal: 4.083 ± 0.272
0.52AspTrp: 0.52 ± 0.099
3.174AspTyr: 3.174 ± 0.239
0.0AspXaa: 0.0 ± 0.0
Glu
1.318GluAla: 1.318 ± 0.16
1.151GluCys: 1.151 ± 0.14
3.285GluAsp: 3.285 ± 0.257
3.341GluGlu: 3.341 ± 0.354
2.84GluPhe: 2.84 ± 0.2
1.132GluGly: 1.132 ± 0.156
0.984GluHis: 0.984 ± 0.118
5.438GluIle: 5.438 ± 0.371
4.065GluLys: 4.065 ± 0.335
4.937GluLeu: 4.937 ± 0.285
1.244GluMet: 1.244 ± 0.15
4.306GluAsn: 4.306 ± 0.608
1.522GluPro: 1.522 ± 0.186
1.077GluGln: 1.077 ± 0.153
1.763GluArg: 1.763 ± 0.244
3.749GluSer: 3.749 ± 0.231
2.951GluThr: 2.951 ± 0.21
1.967GluVal: 1.967 ± 0.177
0.408GluTrp: 0.408 ± 0.084
3.545GluTyr: 3.545 ± 0.235
0.0GluXaa: 0.0 ± 0.0
Phe
0.891PheAla: 0.891 ± 0.169
1.077PheCys: 1.077 ± 0.163
2.951PheAsp: 2.951 ± 0.258
2.116PheGlu: 2.116 ± 0.189
2.246PhePhe: 2.246 ± 0.232
2.079PheGly: 2.079 ± 0.19
0.724PheHis: 0.724 ± 0.12
6.107PheIle: 6.107 ± 0.334
3.656PheLys: 3.656 ± 0.277
3.935PheLeu: 3.935 ± 0.291
1.318PheMet: 1.318 ± 0.162
4.213PheAsn: 4.213 ± 0.303
1.373PhePro: 1.373 ± 0.163
0.854PheGln: 0.854 ± 0.117
1.763PheArg: 1.763 ± 0.193
3.935PheSer: 3.935 ± 0.279
3.025PheThr: 3.025 ± 0.227
3.025PheVal: 3.025 ± 0.225
0.408PheTrp: 0.408 ± 0.094
2.302PheTyr: 2.302 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
1.244GlyAla: 1.244 ± 0.144
0.668GlyCys: 0.668 ± 0.106
2.079GlyAsp: 2.079 ± 0.168
2.023GlyGlu: 2.023 ± 0.194
1.782GlyPhe: 1.782 ± 0.159
1.949GlyGly: 1.949 ± 0.202
0.594GlyHis: 0.594 ± 0.097
4.12GlyIle: 4.12 ± 0.326
3.471GlyLys: 3.471 ± 0.255
2.617GlyLeu: 2.617 ± 0.253
0.483GlyMet: 0.483 ± 0.099
3.1GlyAsn: 3.1 ± 0.249
0.705GlyPro: 0.705 ± 0.113
0.65GlyGln: 0.65 ± 0.115
1.411GlyArg: 1.411 ± 0.157
2.691GlySer: 2.691 ± 0.227
1.893GlyThr: 1.893 ± 0.211
2.302GlyVal: 2.302 ± 0.238
0.204GlyTrp: 0.204 ± 0.058
2.116GlyTyr: 2.116 ± 0.207
0.0GlyXaa: 0.0 ± 0.0
His
0.52HisAla: 0.52 ± 0.092
0.52HisCys: 0.52 ± 0.103
0.947HisAsp: 0.947 ± 0.121
0.909HisGlu: 0.909 ± 0.124
0.872HisPhe: 0.872 ± 0.124
0.798HisGly: 0.798 ± 0.116
0.464HisHis: 0.464 ± 0.09
2.58HisIle: 2.58 ± 0.248
1.373HisLys: 1.373 ± 0.186
1.633HisLeu: 1.633 ± 0.156
0.538HisMet: 0.538 ± 0.106
1.355HisAsn: 1.355 ± 0.158
0.687HisPro: 0.687 ± 0.112
0.445HisGln: 0.445 ± 0.086
0.613HisArg: 0.613 ± 0.12
1.355HisSer: 1.355 ± 0.149
0.817HisThr: 0.817 ± 0.146
1.299HisVal: 1.299 ± 0.152
0.186HisTrp: 0.186 ± 0.053
0.761HisTyr: 0.761 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
2.895IleAla: 2.895 ± 0.249
2.005IleCys: 2.005 ± 0.178
8.204IleAsp: 8.204 ± 0.359
5.42IleGlu: 5.42 ± 0.322
5.271IlePhe: 5.271 ± 0.341
4.195IleGly: 4.195 ± 0.315
2.394IleHis: 2.394 ± 0.189
12.9IleIle: 12.9 ± 0.663
9.8IleLys: 9.8 ± 0.509
9.911IleLeu: 9.911 ± 0.503
2.58IleMet: 2.58 ± 0.233
11.099IleAsn: 11.099 ± 0.539
3.601IlePro: 3.601 ± 0.259
2.227IleGln: 2.227 ± 0.235
3.786IleArg: 3.786 ± 0.259
9.559IleSer: 9.559 ± 0.405
6.255IleThr: 6.255 ± 0.424
5.791IleVal: 5.791 ± 0.375
0.594IleTrp: 0.594 ± 0.11
6.793IleTyr: 6.793 ± 0.389
0.0IleXaa: 0.0 ± 0.0
Lys
1.745LysAla: 1.745 ± 0.189
1.967LysCys: 1.967 ± 0.203
5.438LysAsp: 5.438 ± 0.439
4.288LysGlu: 4.288 ± 0.278
3.527LysPhe: 3.527 ± 0.247
2.172LysGly: 2.172 ± 0.208
2.005LysHis: 2.005 ± 0.234
8.891LysIle: 8.891 ± 0.502
7.925LysLys: 7.925 ± 0.467
7.721LysLeu: 7.721 ± 0.408
2.172LysMet: 2.172 ± 0.187
7.332LysAsn: 7.332 ± 0.44
1.967LysPro: 1.967 ± 0.167
2.172LysGln: 2.172 ± 0.233
3.044LysArg: 3.044 ± 0.217
6.199LysSer: 6.199 ± 0.317
4.677LysThr: 4.677 ± 0.307
3.935LysVal: 3.935 ± 0.398
0.872LysTrp: 0.872 ± 0.135
6.236LysTyr: 6.236 ± 0.402
0.0LysXaa: 0.0 ± 0.0
Leu
2.32LeuAla: 2.32 ± 0.22
1.559LeuCys: 1.559 ± 0.182
5.513LeuAsp: 5.513 ± 0.356
4.547LeuGlu: 4.547 ± 0.302
4.844LeuPhe: 4.844 ± 0.341
2.747LeuGly: 2.747 ± 0.256
1.708LeuHis: 1.708 ± 0.22
7.517LeuIle: 7.517 ± 0.376
6.478LeuLys: 6.478 ± 0.354
8.872LeuLeu: 8.872 ± 0.462
2.172LeuMet: 2.172 ± 0.188
6.329LeuAsn: 6.329 ± 0.398
2.895LeuPro: 2.895 ± 0.204
1.912LeuGln: 1.912 ± 0.187
2.561LeuArg: 2.561 ± 0.254
8.204LeuSer: 8.204 ± 0.39
4.844LeuThr: 4.844 ± 0.274
4.603LeuVal: 4.603 ± 0.316
0.408LeuTrp: 0.408 ± 0.103
5.995LeuTyr: 5.995 ± 0.327
0.0LeuXaa: 0.0 ± 0.0
Met
1.151MetAla: 1.151 ± 0.131
0.353MetCys: 0.353 ± 0.079
1.967MetAsp: 1.967 ± 0.197
1.411MetGlu: 1.411 ± 0.163
1.392MetPhe: 1.392 ± 0.161
0.854MetGly: 0.854 ± 0.115
0.223MetHis: 0.223 ± 0.064
2.524MetIle: 2.524 ± 0.208
1.708MetLys: 1.708 ± 0.209
2.636MetLeu: 2.636 ± 0.262
0.65MetMet: 0.65 ± 0.091
2.005MetAsn: 2.005 ± 0.204
0.78MetPro: 0.78 ± 0.141
0.353MetGln: 0.353 ± 0.091
1.095MetArg: 1.095 ± 0.146
1.949MetSer: 1.949 ± 0.184
1.318MetThr: 1.318 ± 0.19
1.132MetVal: 1.132 ± 0.137
0.148MetTrp: 0.148 ± 0.053
1.578MetTyr: 1.578 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
1.93AsnAla: 1.93 ± 0.216
1.355AsnCys: 1.355 ± 0.168
5.717AsnAsp: 5.717 ± 0.378
4.64AsnGlu: 4.64 ± 0.344
3.007AsnPhe: 3.007 ± 0.231
3.192AsnGly: 3.192 ± 0.263
1.392AsnHis: 1.392 ± 0.14
12.139AsnIle: 12.139 ± 0.646
8.445AsnLys: 8.445 ± 0.432
4.993AsnLeu: 4.993 ± 0.305
2.394AsnMet: 2.394 ± 0.23
9.076AsnAsn: 9.076 ± 0.537
2.172AsnPro: 2.172 ± 0.175
1.466AsnGln: 1.466 ± 0.16
2.543AsnArg: 2.543 ± 0.205
5.772AsnSer: 5.772 ± 0.34
5.234AsnThr: 5.234 ± 0.325
4.64AsnVal: 4.64 ± 0.291
0.39AsnTrp: 0.39 ± 0.086
4.547AsnTyr: 4.547 ± 0.289
0.0AsnXaa: 0.0 ± 0.0
Pro
0.705ProAla: 0.705 ± 0.113
0.557ProCys: 0.557 ± 0.104
1.8ProAsp: 1.8 ± 0.182
1.875ProGlu: 1.875 ± 0.183
1.596ProPhe: 1.596 ± 0.161
1.151ProGly: 1.151 ± 0.169
0.52ProHis: 0.52 ± 0.113
3.192ProIle: 3.192 ± 0.29
1.912ProLys: 1.912 ± 0.188
2.766ProLeu: 2.766 ± 0.262
0.65ProMet: 0.65 ± 0.147
1.986ProAsn: 1.986 ± 0.176
1.169ProPro: 1.169 ± 0.25
0.557ProGln: 0.557 ± 0.098
1.151ProArg: 1.151 ± 0.18
2.134ProSer: 2.134 ± 0.217
1.559ProThr: 1.559 ± 0.172
1.782ProVal: 1.782 ± 0.188
0.297ProTrp: 0.297 ± 0.067
1.689ProTyr: 1.689 ± 0.174
0.0ProXaa: 0.0 ± 0.0
Gln
0.427GlnAla: 0.427 ± 0.1
0.464GlnCys: 0.464 ± 0.082
0.891GlnAsp: 0.891 ± 0.158
0.909GlnGlu: 0.909 ± 0.16
0.761GlnPhe: 0.761 ± 0.103
0.631GlnGly: 0.631 ± 0.124
0.594GlnHis: 0.594 ± 0.106
1.633GlnIle: 1.633 ± 0.18
1.541GlnLys: 1.541 ± 0.197
2.079GlnLeu: 2.079 ± 0.199
0.65GlnMet: 0.65 ± 0.088
1.262GlnAsn: 1.262 ± 0.142
0.594GlnPro: 0.594 ± 0.125
0.575GlnGln: 0.575 ± 0.13
0.78GlnArg: 0.78 ± 0.13
1.392GlnSer: 1.392 ± 0.138
1.114GlnThr: 1.114 ± 0.157
0.835GlnVal: 0.835 ± 0.132
0.241GlnTrp: 0.241 ± 0.076
1.633GlnTyr: 1.633 ± 0.15
0.0GlnXaa: 0.0 ± 0.0
Arg
0.613ArgAla: 0.613 ± 0.097
0.872ArgCys: 0.872 ± 0.13
2.042ArgAsp: 2.042 ± 0.199
1.633ArgGlu: 1.633 ± 0.221
2.264ArgPhe: 2.264 ± 0.159
1.411ArgGly: 1.411 ± 0.156
0.909ArgHis: 0.909 ± 0.141
3.378ArgIle: 3.378 ± 0.243
2.617ArgLys: 2.617 ± 0.205
3.397ArgLeu: 3.397 ± 0.216
0.835ArgMet: 0.835 ± 0.107
2.524ArgAsn: 2.524 ± 0.213
0.965ArgPro: 0.965 ± 0.142
0.909ArgGln: 0.909 ± 0.139
1.782ArgArg: 1.782 ± 0.201
2.264ArgSer: 2.264 ± 0.207
1.8ArgThr: 1.8 ± 0.195
1.949ArgVal: 1.949 ± 0.207
0.26ArgTrp: 0.26 ± 0.068
2.227ArgTyr: 2.227 ± 0.238
0.0ArgXaa: 0.0 ± 0.0
Ser
1.856SerAla: 1.856 ± 0.187
1.689SerCys: 1.689 ± 0.208
4.789SerAsp: 4.789 ± 0.317
3.786SerGlu: 3.786 ± 0.344
3.879SerPhe: 3.879 ± 0.288
2.877SerGly: 2.877 ± 0.205
1.188SerHis: 1.188 ± 0.142
9.336SerIle: 9.336 ± 0.4
6.756SerLys: 6.756 ± 0.35
6.775SerLeu: 6.775 ± 0.373
2.19SerMet: 2.19 ± 0.174
5.791SerAsn: 5.791 ± 0.377
2.357SerPro: 2.357 ± 0.242
1.689SerGln: 1.689 ± 0.191
2.543SerArg: 2.543 ± 0.2
6.812SerSer: 6.812 ± 0.397
4.213SerThr: 4.213 ± 0.306
4.844SerVal: 4.844 ± 0.325
0.353SerTrp: 0.353 ± 0.088
4.417SerTyr: 4.417 ± 0.314
0.0SerXaa: 0.0 ± 0.0
Thr
1.578ThrAla: 1.578 ± 0.181
1.411ThrCys: 1.411 ± 0.204
3.656ThrAsp: 3.656 ± 0.259
2.617ThrGlu: 2.617 ± 0.228
2.858ThrPhe: 2.858 ± 0.235
2.153ThrGly: 2.153 ± 0.215
1.485ThrHis: 1.485 ± 0.161
5.253ThrIle: 5.253 ± 0.305
4.585ThrLys: 4.585 ± 0.29
4.77ThrLeu: 4.77 ± 0.288
1.429ThrMet: 1.429 ± 0.162
4.102ThrAsn: 4.102 ± 0.333
2.32ThrPro: 2.32 ± 0.241
1.002ThrGln: 1.002 ± 0.15
2.153ThrArg: 2.153 ± 0.212
4.733ThrSer: 4.733 ± 0.386
3.211ThrThr: 3.211 ± 0.318
3.36ThrVal: 3.36 ± 0.268
0.464ThrTrp: 0.464 ± 0.079
2.914ThrTyr: 2.914 ± 0.232
0.0ThrXaa: 0.0 ± 0.0
Val
1.67ValAla: 1.67 ± 0.142
1.336ValCys: 1.336 ± 0.15
3.322ValAsp: 3.322 ± 0.237
2.933ValGlu: 2.933 ± 0.391
2.933ValPhe: 2.933 ± 0.269
1.522ValGly: 1.522 ± 0.203
0.909ValHis: 0.909 ± 0.128
6.144ValIle: 6.144 ± 0.315
5.141ValLys: 5.141 ± 0.328
4.789ValLeu: 4.789 ± 0.331
1.225ValMet: 1.225 ± 0.154
4.789ValAsn: 4.789 ± 0.337
1.466ValPro: 1.466 ± 0.192
0.705ValGln: 0.705 ± 0.1
1.949ValArg: 1.949 ± 0.192
5.16ValSer: 5.16 ± 0.354
2.84ValThr: 2.84 ± 0.24
3.174ValVal: 3.174 ± 0.261
0.241ValTrp: 0.241 ± 0.063
3.267ValTyr: 3.267 ± 0.282
0.0ValXaa: 0.0 ± 0.0
Trp
0.148TrpAla: 0.148 ± 0.05
0.148TrpCys: 0.148 ± 0.054
0.371TrpAsp: 0.371 ± 0.093
0.39TrpGlu: 0.39 ± 0.087
0.39TrpPhe: 0.39 ± 0.088
0.204TrpGly: 0.204 ± 0.069
0.056TrpHis: 0.056 ± 0.03
0.835TrpIle: 0.835 ± 0.118
0.761TrpLys: 0.761 ± 0.128
0.594TrpLeu: 0.594 ± 0.089
0.186TrpMet: 0.186 ± 0.061
0.501TrpAsn: 0.501 ± 0.114
0.241TrpPro: 0.241 ± 0.065
0.056TrpGln: 0.056 ± 0.042
0.241TrpArg: 0.241 ± 0.069
0.483TrpSer: 0.483 ± 0.089
0.353TrpThr: 0.353 ± 0.077
0.353TrpVal: 0.353 ± 0.078
0.0TrpTrp: 0.0 ± 0.0
0.353TrpTyr: 0.353 ± 0.073
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.559TyrAla: 1.559 ± 0.176
1.299TyrCys: 1.299 ± 0.172
3.916TyrAsp: 3.916 ± 0.259
2.19TyrGlu: 2.19 ± 0.195
2.858TyrPhe: 2.858 ± 0.234
2.654TyrGly: 2.654 ± 0.236
1.095TyrHis: 1.095 ± 0.149
7.944TyrIle: 7.944 ± 0.374
4.807TyrLys: 4.807 ± 0.333
4.844TyrLeu: 4.844 ± 0.334
1.875TyrMet: 1.875 ± 0.209
5.939TyrAsn: 5.939 ± 0.443
1.633TyrPro: 1.633 ± 0.181
0.909TyrGln: 0.909 ± 0.133
1.67TyrArg: 1.67 ± 0.196
4.213TyrSer: 4.213 ± 0.242
3.619TyrThr: 3.619 ± 0.305
2.951TyrVal: 2.951 ± 0.239
0.353TyrTrp: 0.353 ± 0.07
3.452TyrTyr: 3.452 ± 0.266
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 184 proteins (53878 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski