Amino acid dipepetide frequency for Turkeypox virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.18AlaAla: 1.18 ± 0.16
0.835AlaCys: 0.835 ± 0.12
2.561AlaAsp: 2.561 ± 0.288
1.344AlaGlu: 1.344 ± 0.173
1.162AlaPhe: 1.162 ± 0.133
1.09AlaGly: 1.09 ± 0.173
0.581AlaHis: 0.581 ± 0.098
4.195AlaIle: 4.195 ± 0.259
2.07AlaLys: 2.07 ± 0.213
2.997AlaLeu: 2.997 ± 0.227
0.817AlaMet: 0.817 ± 0.118
2.688AlaAsn: 2.688 ± 0.236
0.963AlaPro: 0.963 ± 0.165
0.509AlaGln: 0.509 ± 0.105
1.635AlaArg: 1.635 ± 0.198
2.724AlaSer: 2.724 ± 0.233
1.816AlaThr: 1.816 ± 0.182
2.47AlaVal: 2.47 ± 0.274
0.218AlaTrp: 0.218 ± 0.079
1.707AlaTyr: 1.707 ± 0.21
0.0AlaXaa: 0.0 ± 0.0
Cys
0.745CysAla: 0.745 ± 0.105
0.563CysCys: 0.563 ± 0.14
1.489CysAsp: 1.489 ± 0.166
1.072CysGlu: 1.072 ± 0.138
0.89CysPhe: 0.89 ± 0.106
1.053CysGly: 1.053 ± 0.154
0.527CysHis: 0.527 ± 0.104
2.379CysIle: 2.379 ± 0.21
1.598CysLys: 1.598 ± 0.18
1.743CysLeu: 1.743 ± 0.219
0.672CysMet: 0.672 ± 0.107
1.798CysAsn: 1.798 ± 0.174
0.581CysPro: 0.581 ± 0.116
0.291CysGln: 0.291 ± 0.078
0.854CysArg: 0.854 ± 0.142
1.816CysSer: 1.816 ± 0.204
0.926CysThr: 0.926 ± 0.14
1.144CysVal: 1.144 ± 0.144
0.182CysTrp: 0.182 ± 0.065
1.526CysTyr: 1.526 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
2.361AspAla: 2.361 ± 0.245
1.126AspCys: 1.126 ± 0.148
4.45AspAsp: 4.45 ± 0.336
3.215AspGlu: 3.215 ± 0.233
2.561AspPhe: 2.561 ± 0.226
2.597AspGly: 2.597 ± 0.213
0.835AspHis: 0.835 ± 0.134
9.88AspIle: 9.88 ± 0.453
5.14AspLys: 5.14 ± 0.33
4.068AspLeu: 4.068 ± 0.296
1.925AspMet: 1.925 ± 0.194
5.503AspAsn: 5.503 ± 0.288
1.635AspPro: 1.635 ± 0.192
0.781AspGln: 0.781 ± 0.125
2.306AspArg: 2.306 ± 0.207
4.577AspSer: 4.577 ± 0.279
4.195AspThr: 4.195 ± 0.302
3.669AspVal: 3.669 ± 0.284
0.436AspTrp: 0.436 ± 0.078
3.287AspTyr: 3.287 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
1.235GluAla: 1.235 ± 0.156
0.926GluCys: 0.926 ± 0.131
3.196GluAsp: 3.196 ± 0.249
2.942GluGlu: 2.942 ± 0.237
2.688GluPhe: 2.688 ± 0.258
1.271GluGly: 1.271 ± 0.167
1.053GluHis: 1.053 ± 0.124
5.485GluIle: 5.485 ± 0.383
4.322GluLys: 4.322 ± 0.283
5.249GluLeu: 5.249 ± 0.316
1.235GluMet: 1.235 ± 0.122
3.977GluAsn: 3.977 ± 0.238
1.616GluPro: 1.616 ± 0.237
1.289GluGln: 1.289 ± 0.157
1.834GluArg: 1.834 ± 0.209
3.759GluSer: 3.759 ± 0.25
2.869GluThr: 2.869 ± 0.246
1.798GluVal: 1.798 ± 0.169
0.527GluTrp: 0.527 ± 0.121
3.995GluTyr: 3.995 ± 0.293
0.0GluXaa: 0.0 ± 0.0
Phe
1.144PheAla: 1.144 ± 0.13
0.944PheCys: 0.944 ± 0.128
2.325PheAsp: 2.325 ± 0.244
2.125PheGlu: 2.125 ± 0.249
1.925PhePhe: 1.925 ± 0.231
1.762PheGly: 1.762 ± 0.207
0.872PheHis: 0.872 ± 0.137
4.958PheIle: 4.958 ± 0.314
3.523PheLys: 3.523 ± 0.264
4.104PheLeu: 4.104 ± 0.28
1.344PheMet: 1.344 ± 0.174
3.487PheAsn: 3.487 ± 0.228
1.38PhePro: 1.38 ± 0.165
0.763PheGln: 0.763 ± 0.102
1.489PheArg: 1.489 ± 0.163
4.159PheSer: 4.159 ± 0.29
2.706PheThr: 2.706 ± 0.227
2.506PheVal: 2.506 ± 0.212
0.381PheTrp: 0.381 ± 0.086
2.543PheTyr: 2.543 ± 0.214
0.0PheXaa: 0.0 ± 0.0
Gly
2.216GlyAla: 2.216 ± 0.373
0.799GlyCys: 0.799 ± 0.115
2.089GlyAsp: 2.089 ± 0.191
1.816GlyGlu: 1.816 ± 0.162
1.707GlyPhe: 1.707 ± 0.181
1.58GlyGly: 1.58 ± 0.185
0.563GlyHis: 0.563 ± 0.098
4.958GlyIle: 4.958 ± 0.309
3.015GlyLys: 3.015 ± 0.255
2.688GlyLeu: 2.688 ± 0.207
0.835GlyMet: 0.835 ± 0.146
3.36GlyAsn: 3.36 ± 0.211
0.817GlyPro: 0.817 ± 0.11
0.581GlyGln: 0.581 ± 0.098
1.689GlyArg: 1.689 ± 0.155
2.761GlySer: 2.761 ± 0.198
1.598GlyThr: 1.598 ± 0.169
1.907GlyVal: 1.907 ± 0.219
0.236GlyTrp: 0.236 ± 0.068
2.597GlyTyr: 2.597 ± 0.19
0.0GlyXaa: 0.0 ± 0.0
His
0.636HisAla: 0.636 ± 0.103
0.454HisCys: 0.454 ± 0.088
1.217HisAsp: 1.217 ± 0.152
0.944HisGlu: 0.944 ± 0.124
0.708HisPhe: 0.708 ± 0.124
1.017HisGly: 1.017 ± 0.171
0.436HisHis: 0.436 ± 0.102
2.506HisIle: 2.506 ± 0.212
1.289HisLys: 1.289 ± 0.147
1.798HisLeu: 1.798 ± 0.143
0.581HisMet: 0.581 ± 0.103
1.562HisAsn: 1.562 ± 0.187
0.49HisPro: 0.49 ± 0.097
0.291HisGln: 0.291 ± 0.074
1.053HisArg: 1.053 ± 0.136
1.017HisSer: 1.017 ± 0.147
0.908HisThr: 0.908 ± 0.129
1.271HisVal: 1.271 ± 0.192
0.182HisTrp: 0.182 ± 0.054
1.144HisTyr: 1.144 ± 0.157
0.0HisXaa: 0.0 ± 0.0
Ile
4.032IleAla: 4.032 ± 0.295
2.652IleCys: 2.652 ± 0.244
8.427IleAsp: 8.427 ± 0.363
5.775IleGlu: 5.775 ± 0.366
4.649IlePhe: 4.649 ± 0.345
3.378IleGly: 3.378 ± 0.22
2.143IleHis: 2.143 ± 0.212
10.879IleIle: 10.879 ± 0.475
8.863IleLys: 8.863 ± 0.422
10.534IleLeu: 10.534 ± 0.429
2.797IleMet: 2.797 ± 0.263
9.171IleAsn: 9.171 ± 0.405
3.741IlePro: 3.741 ± 0.287
2.325IleGln: 2.325 ± 0.191
4.885IleArg: 4.885 ± 0.337
9.335IleSer: 9.335 ± 0.423
6.611IleThr: 6.611 ± 0.313
5.848IleVal: 5.848 ± 0.326
0.599IleTrp: 0.599 ± 0.1
5.739IleTyr: 5.739 ± 0.392
0.0IleXaa: 0.0 ± 0.0
Lys
2.325LysAla: 2.325 ± 0.194
1.707LysCys: 1.707 ± 0.175
5.43LysAsp: 5.43 ± 0.405
4.922LysGlu: 4.922 ± 0.272
3.287LysPhe: 3.287 ± 0.251
2.579LysGly: 2.579 ± 0.22
1.889LysHis: 1.889 ± 0.22
8.5LysIle: 8.5 ± 0.447
6.32LysLys: 6.32 ± 0.398
7.664LysLeu: 7.664 ± 0.335
1.725LysMet: 1.725 ± 0.181
6.048LysAsn: 6.048 ± 0.33
2.125LysPro: 2.125 ± 0.231
2.07LysGln: 2.07 ± 0.195
2.997LysArg: 2.997 ± 0.256
5.031LysSer: 5.031 ± 0.344
4.032LysThr: 4.032 ± 0.286
3.233LysVal: 3.233 ± 0.23
0.581LysTrp: 0.581 ± 0.1
5.775LysTyr: 5.775 ± 0.336
0.0LysXaa: 0.0 ± 0.0
Leu
2.742LeuAla: 2.742 ± 0.262
2.234LeuCys: 2.234 ± 0.227
6.356LeuAsp: 6.356 ± 0.341
5.485LeuGlu: 5.485 ± 0.389
4.395LeuPhe: 4.395 ± 0.343
3.705LeuGly: 3.705 ± 0.279
2.434LeuHis: 2.434 ± 0.379
7.864LeuIle: 7.864 ± 0.396
5.83LeuLys: 5.83 ± 0.374
11.569LeuLeu: 11.569 ± 0.712
2.107LeuMet: 2.107 ± 0.17
5.158LeuAsn: 5.158 ± 0.274
3.106LeuPro: 3.106 ± 0.241
1.889LeuGln: 1.889 ± 0.196
3.451LeuArg: 3.451 ± 0.28
8.064LeuSer: 8.064 ± 0.454
4.195LeuThr: 4.195 ± 0.332
5.503LeuVal: 5.503 ± 0.322
0.345LeuTrp: 0.345 ± 0.079
5.703LeuTyr: 5.703 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
1.235MetAla: 1.235 ± 0.15
0.4MetCys: 0.4 ± 0.08
1.798MetAsp: 1.798 ± 0.15
1.78MetGlu: 1.78 ± 0.197
1.762MetPhe: 1.762 ± 0.213
0.999MetGly: 0.999 ± 0.107
0.581MetHis: 0.581 ± 0.109
2.07MetIle: 2.07 ± 0.21
1.743MetLys: 1.743 ± 0.172
2.815MetLeu: 2.815 ± 0.205
0.672MetMet: 0.672 ± 0.127
1.616MetAsn: 1.616 ± 0.159
0.963MetPro: 0.963 ± 0.136
0.509MetGln: 0.509 ± 0.086
0.763MetArg: 0.763 ± 0.142
1.889MetSer: 1.889 ± 0.198
1.09MetThr: 1.09 ± 0.134
1.489MetVal: 1.489 ± 0.172
0.145MetTrp: 0.145 ± 0.043
1.743MetTyr: 1.743 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
2.524AsnAla: 2.524 ± 0.256
1.38AsnCys: 1.38 ± 0.156
4.776AsnAsp: 4.776 ± 0.31
3.65AsnGlu: 3.65 ± 0.276
3.069AsnPhe: 3.069 ± 0.274
2.96AsnGly: 2.96 ± 0.199
1.344AsnHis: 1.344 ± 0.151
11.46AsnIle: 11.46 ± 0.573
6.992AsnLys: 6.992 ± 0.346
5.194AsnLeu: 5.194 ± 0.32
2.47AsnMet: 2.47 ± 0.252
7.918AsnAsn: 7.918 ± 0.54
1.78AsnPro: 1.78 ± 0.193
1.271AsnGln: 1.271 ± 0.145
3.36AsnArg: 3.36 ± 0.241
5.14AsnSer: 5.14 ± 0.333
5.031AsnThr: 5.031 ± 0.304
3.596AsnVal: 3.596 ± 0.246
0.4AsnTrp: 0.4 ± 0.079
3.941AsnTyr: 3.941 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
0.981ProAla: 0.981 ± 0.12
0.69ProCys: 0.69 ± 0.123
2.252ProAsp: 2.252 ± 0.219
1.961ProGlu: 1.961 ± 0.212
1.362ProPhe: 1.362 ± 0.158
1.18ProGly: 1.18 ± 0.146
0.49ProHis: 0.49 ± 0.094
3.142ProIle: 3.142 ± 0.228
2.034ProLys: 2.034 ± 0.245
3.414ProLeu: 3.414 ± 0.382
0.745ProMet: 0.745 ± 0.103
2.125ProAsn: 2.125 ± 0.228
0.999ProPro: 0.999 ± 0.204
0.545ProGln: 0.545 ± 0.114
1.235ProArg: 1.235 ± 0.152
2.379ProSer: 2.379 ± 0.218
1.38ProThr: 1.38 ± 0.182
1.925ProVal: 1.925 ± 0.214
0.272ProTrp: 0.272 ± 0.067
1.635ProTyr: 1.635 ± 0.197
0.0ProXaa: 0.0 ± 0.0
Gln
0.581GlnAla: 0.581 ± 0.115
0.454GlnCys: 0.454 ± 0.084
0.908GlnAsp: 0.908 ± 0.14
0.872GlnGlu: 0.872 ± 0.131
0.563GlnPhe: 0.563 ± 0.097
0.527GlnGly: 0.527 ± 0.106
0.509GlnHis: 0.509 ± 0.091
1.689GlnIle: 1.689 ± 0.157
1.362GlnLys: 1.362 ± 0.143
2.379GlnLeu: 2.379 ± 0.231
0.363GlnMet: 0.363 ± 0.082
1.253GlnAsn: 1.253 ± 0.178
0.599GlnPro: 0.599 ± 0.138
0.708GlnGln: 0.708 ± 0.143
0.872GlnArg: 0.872 ± 0.137
1.417GlnSer: 1.417 ± 0.177
0.89GlnThr: 0.89 ± 0.123
0.817GlnVal: 0.817 ± 0.119
0.163GlnTrp: 0.163 ± 0.055
1.308GlnTyr: 1.308 ± 0.154
0.0GlnXaa: 0.0 ± 0.0
Arg
1.09ArgAla: 1.09 ± 0.149
1.144ArgCys: 1.144 ± 0.14
2.234ArgAsp: 2.234 ± 0.205
1.852ArgGlu: 1.852 ± 0.189
1.816ArgPhe: 1.816 ± 0.177
1.58ArgGly: 1.58 ± 0.215
0.981ArgHis: 0.981 ± 0.161
3.85ArgIle: 3.85 ± 0.301
3.342ArgLys: 3.342 ± 0.268
4.159ArgLeu: 4.159 ± 0.267
0.817ArgMet: 0.817 ± 0.124
2.978ArgAsn: 2.978 ± 0.215
1.18ArgPro: 1.18 ± 0.179
0.745ArgGln: 0.745 ± 0.109
2.234ArgArg: 2.234 ± 0.262
2.724ArgSer: 2.724 ± 0.216
2.252ArgThr: 2.252 ± 0.214
2.179ArgVal: 2.179 ± 0.19
0.218ArgTrp: 0.218 ± 0.05
2.924ArgTyr: 2.924 ± 0.24
0.0ArgXaa: 0.0 ± 0.0
Ser
2.688SerAla: 2.688 ± 0.232
1.471SerCys: 1.471 ± 0.157
4.558SerAsp: 4.558 ± 0.307
3.215SerGlu: 3.215 ± 0.205
3.632SerPhe: 3.632 ± 0.279
2.906SerGly: 2.906 ± 0.221
1.38SerHis: 1.38 ± 0.153
8.954SerIle: 8.954 ± 0.472
6.302SerLys: 6.302 ± 0.378
6.502SerLeu: 6.502 ± 0.437
1.925SerMet: 1.925 ± 0.197
5.648SerAsn: 5.648 ± 0.337
2.506SerPro: 2.506 ± 0.256
1.144SerGln: 1.144 ± 0.14
3.124SerArg: 3.124 ± 0.241
6.629SerSer: 6.629 ± 0.551
4.431SerThr: 4.431 ± 0.306
4.177SerVal: 4.177 ± 0.286
0.4SerTrp: 0.4 ± 0.09
4.322SerTyr: 4.322 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
1.907ThrAla: 1.907 ± 0.193
1.326ThrCys: 1.326 ± 0.174
3.523ThrAsp: 3.523 ± 0.297
2.797ThrGlu: 2.797 ± 0.235
2.107ThrPhe: 2.107 ± 0.202
2.506ThrGly: 2.506 ± 0.185
0.726ThrHis: 0.726 ± 0.112
6.139ThrIle: 6.139 ± 0.315
4.486ThrLys: 4.486 ± 0.251
4.486ThrLeu: 4.486 ± 0.267
1.289ThrMet: 1.289 ± 0.179
3.669ThrAsn: 3.669 ± 0.267
2.652ThrPro: 2.652 ± 0.323
0.835ThrGln: 0.835 ± 0.123
2.415ThrArg: 2.415 ± 0.173
3.905ThrSer: 3.905 ± 0.295
3.36ThrThr: 3.36 ± 0.279
3.596ThrVal: 3.596 ± 0.249
0.49ThrTrp: 0.49 ± 0.089
2.67ThrTyr: 2.67 ± 0.217
0.0ThrXaa: 0.0 ± 0.0
Val
1.58ValAla: 1.58 ± 0.158
1.271ValCys: 1.271 ± 0.177
3.087ValAsp: 3.087 ± 0.286
2.706ValGlu: 2.706 ± 0.226
2.761ValPhe: 2.761 ± 0.223
1.743ValGly: 1.743 ± 0.181
0.781ValHis: 0.781 ± 0.12
6.248ValIle: 6.248 ± 0.338
5.013ValLys: 5.013 ± 0.303
5.285ValLeu: 5.285 ± 0.246
1.435ValMet: 1.435 ± 0.156
4.413ValAsn: 4.413 ± 0.273
1.489ValPro: 1.489 ± 0.168
0.708ValGln: 0.708 ± 0.118
1.78ValArg: 1.78 ± 0.183
4.141ValSer: 4.141 ± 0.298
2.815ValThr: 2.815 ± 0.238
2.942ValVal: 2.942 ± 0.208
0.291ValTrp: 0.291 ± 0.071
3.269ValTyr: 3.269 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.163TrpAla: 0.163 ± 0.059
0.145TrpCys: 0.145 ± 0.054
0.272TrpAsp: 0.272 ± 0.063
0.327TrpGlu: 0.327 ± 0.076
0.545TrpPhe: 0.545 ± 0.107
0.182TrpGly: 0.182 ± 0.06
0.018TrpHis: 0.018 ± 0.018
0.654TrpIle: 0.654 ± 0.111
0.327TrpLys: 0.327 ± 0.081
0.781TrpLeu: 0.781 ± 0.126
0.363TrpMet: 0.363 ± 0.07
0.454TrpAsn: 0.454 ± 0.094
0.163TrpPro: 0.163 ± 0.059
0.109TrpGln: 0.109 ± 0.045
0.272TrpArg: 0.272 ± 0.059
0.363TrpSer: 0.363 ± 0.08
0.327TrpThr: 0.327 ± 0.073
0.509TrpVal: 0.509 ± 0.106
0.036TrpTrp: 0.036 ± 0.024
0.454TrpTyr: 0.454 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.198TyrAla: 2.198 ± 0.164
1.326TyrCys: 1.326 ± 0.148
3.596TyrAsp: 3.596 ± 0.253
2.47TyrGlu: 2.47 ± 0.283
2.851TyrPhe: 2.851 ± 0.235
3.196TyrGly: 3.196 ± 0.304
1.362TyrHis: 1.362 ± 0.181
6.593TyrIle: 6.593 ± 0.338
4.667TyrLys: 4.667 ± 0.284
4.704TyrLeu: 4.704 ± 0.305
1.925TyrMet: 1.925 ± 0.176
5.43TyrAsn: 5.43 ± 0.32
1.961TyrPro: 1.961 ± 0.179
0.872TyrGln: 0.872 ± 0.115
2.052TyrArg: 2.052 ± 0.17
4.086TyrSer: 4.086 ± 0.261
3.451TyrThr: 3.451 ± 0.221
3.178TyrVal: 3.178 ± 0.259
0.363TyrTrp: 0.363 ± 0.084
3.36TyrTyr: 3.36 ± 0.256
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 170 proteins (55063 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski