Amino acid dipepetide frequency for Variola virus (isolate Human/India/Ind3/1967) (VARV) (Smallpox virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.071AlaAla: 2.071 ± 0.253
0.998AlaCys: 0.998 ± 0.119
1.775AlaAsp: 1.775 ± 0.162
1.941AlaGlu: 1.941 ± 0.212
1.701AlaPhe: 1.701 ± 0.179
1.442AlaGly: 1.442 ± 0.215
0.462AlaHis: 0.462 ± 0.096
3.735AlaIle: 3.735 ± 0.261
2.903AlaLys: 2.903 ± 0.251
3.236AlaLeu: 3.236 ± 0.263
1.128AlaMet: 1.128 ± 0.151
2.385AlaAsn: 2.385 ± 0.247
1.035AlaPro: 1.035 ± 0.152
0.795AlaGln: 0.795 ± 0.126
1.498AlaArg: 1.498 ± 0.18
3.273AlaSer: 3.273 ± 0.308
2.496AlaThr: 2.496 ± 0.235
2.81AlaVal: 2.81 ± 0.193
0.24AlaTrp: 0.24 ± 0.07
1.609AlaTyr: 1.609 ± 0.168
0.0AlaXaa: 0.0 ± 0.0
Cys
0.832CysAla: 0.832 ± 0.12
0.592CysCys: 0.592 ± 0.109
1.294CysAsp: 1.294 ± 0.149
0.961CysGlu: 0.961 ± 0.139
0.814CysPhe: 0.814 ± 0.118
1.128CysGly: 1.128 ± 0.118
0.37CysHis: 0.37 ± 0.084
2.034CysIle: 2.034 ± 0.241
1.313CysLys: 1.313 ± 0.154
1.701CysLeu: 1.701 ± 0.164
0.592CysMet: 0.592 ± 0.095
1.442CysAsn: 1.442 ± 0.208
0.684CysPro: 0.684 ± 0.137
0.462CysGln: 0.462 ± 0.104
0.851CysArg: 0.851 ± 0.105
1.646CysSer: 1.646 ± 0.199
1.368CysThr: 1.368 ± 0.178
1.424CysVal: 1.424 ± 0.158
0.222CysTrp: 0.222 ± 0.071
1.276CysTyr: 1.276 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
2.681AspAla: 2.681 ± 0.227
0.924AspCys: 0.924 ± 0.132
5.51AspAsp: 5.51 ± 0.434
4.049AspGlu: 4.049 ± 0.24
2.736AspPhe: 2.736 ± 0.22
2.866AspGly: 2.866 ± 0.217
1.091AspHis: 1.091 ± 0.126
7.729AspIle: 7.729 ± 0.423
5.011AspLys: 5.011 ± 0.33
4.807AspLeu: 4.807 ± 0.274
1.609AspMet: 1.609 ± 0.163
4.918AspAsn: 4.918 ± 0.273
1.646AspPro: 1.646 ± 0.16
1.239AspGln: 1.239 ± 0.162
2.348AspArg: 2.348 ± 0.214
4.345AspSer: 4.345 ± 0.276
3.624AspThr: 3.624 ± 0.317
4.456AspVal: 4.456 ± 0.257
0.462AspTrp: 0.462 ± 0.09
3.494AspTyr: 3.494 ± 0.234
0.0AspXaa: 0.0 ± 0.0
Glu
2.108GluAla: 2.108 ± 0.205
1.072GluCys: 1.072 ± 0.122
3.494GluAsp: 3.494 ± 0.263
3.273GluGlu: 3.273 ± 0.329
2.533GluPhe: 2.533 ± 0.174
1.627GluGly: 1.627 ± 0.184
1.202GluHis: 1.202 ± 0.13
4.807GluIle: 4.807 ± 0.352
3.55GluLys: 3.55 ± 0.24
5.306GluLeu: 5.306 ± 0.407
1.331GluMet: 1.331 ± 0.183
3.402GluAsn: 3.402 ± 0.283
1.738GluPro: 1.738 ± 0.227
1.424GluGln: 1.424 ± 0.174
2.441GluArg: 2.441 ± 0.271
3.753GluSer: 3.753 ± 0.243
3.55GluThr: 3.55 ± 0.269
2.552GluVal: 2.552 ± 0.258
0.518GluTrp: 0.518 ± 0.088
3.568GluTyr: 3.568 ± 0.212
0.0GluXaa: 0.0 ± 0.0
Phe
1.59PheAla: 1.59 ± 0.189
0.943PheCys: 0.943 ± 0.133
3.088PheAsp: 3.088 ± 0.279
2.126PheGlu: 2.126 ± 0.182
2.2PhePhe: 2.2 ± 0.202
1.941PheGly: 1.941 ± 0.156
0.721PheHis: 0.721 ± 0.095
4.696PheIle: 4.696 ± 0.269
3.624PheLys: 3.624 ± 0.232
3.994PheLeu: 3.994 ± 0.296
1.424PheMet: 1.424 ± 0.151
3.698PheAsn: 3.698 ± 0.198
1.294PhePro: 1.294 ± 0.155
0.924PheGln: 0.924 ± 0.108
1.812PheArg: 1.812 ± 0.169
3.957PheSer: 3.957 ± 0.283
2.995PheThr: 2.995 ± 0.243
3.014PheVal: 3.014 ± 0.235
0.388PheTrp: 0.388 ± 0.097
2.293PheTyr: 2.293 ± 0.212
0.0PheXaa: 0.0 ± 0.0
Gly
1.867GlyAla: 1.867 ± 0.194
0.869GlyCys: 0.869 ± 0.114
2.533GlyAsp: 2.533 ± 0.192
2.071GlyGlu: 2.071 ± 0.222
1.849GlyPhe: 1.849 ± 0.198
1.997GlyGly: 1.997 ± 0.208
0.869GlyHis: 0.869 ± 0.122
3.642GlyIle: 3.642 ± 0.248
2.958GlyLys: 2.958 ± 0.292
3.069GlyLeu: 3.069 ± 0.213
0.887GlyMet: 0.887 ± 0.136
3.051GlyAsn: 3.051 ± 0.214
0.906GlyPro: 0.906 ± 0.123
0.721GlyGln: 0.721 ± 0.111
1.886GlyArg: 1.886 ± 0.166
2.977GlySer: 2.977 ± 0.279
2.293GlyThr: 2.293 ± 0.23
2.699GlyVal: 2.699 ± 0.25
0.203GlyTrp: 0.203 ± 0.062
2.311GlyTyr: 2.311 ± 0.218
0.0GlyXaa: 0.0 ± 0.0
His
0.832HisAla: 0.832 ± 0.109
0.573HisCys: 0.573 ± 0.113
1.054HisAsp: 1.054 ± 0.137
0.869HisGlu: 0.869 ± 0.123
0.906HisPhe: 0.906 ± 0.12
1.146HisGly: 1.146 ± 0.179
0.555HisHis: 0.555 ± 0.115
2.459HisIle: 2.459 ± 0.234
1.294HisLys: 1.294 ± 0.127
1.867HisLeu: 1.867 ± 0.197
0.592HisMet: 0.592 ± 0.091
1.294HisAsn: 1.294 ± 0.146
0.74HisPro: 0.74 ± 0.098
0.518HisGln: 0.518 ± 0.096
0.943HisArg: 0.943 ± 0.14
1.294HisSer: 1.294 ± 0.142
1.257HisThr: 1.257 ± 0.144
1.368HisVal: 1.368 ± 0.167
0.222HisTrp: 0.222 ± 0.058
0.924HisTyr: 0.924 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
3.476IleAla: 3.476 ± 0.237
1.609IleCys: 1.609 ± 0.164
7.192IleAsp: 7.192 ± 0.364
4.826IleGlu: 4.826 ± 0.321
4.086IlePhe: 4.086 ± 0.278
3.513IleGly: 3.513 ± 0.252
2.052IleHis: 2.052 ± 0.201
8.135IleIle: 8.135 ± 0.456
7.266IleLys: 7.266 ± 0.398
7.895IleLeu: 7.895 ± 0.379
2.219IleMet: 2.219 ± 0.215
7.729IleAsn: 7.729 ± 0.371
3.421IlePro: 3.421 ± 0.26
2.163IleGln: 2.163 ± 0.188
3.901IleArg: 3.901 ± 0.291
8.394IleSer: 8.394 ± 0.355
5.288IleThr: 5.288 ± 0.3
5.75IleVal: 5.75 ± 0.351
0.536IleTrp: 0.536 ± 0.094
4.585IleTyr: 4.585 ± 0.311
0.0IleXaa: 0.0 ± 0.0
Lys
2.071LysAla: 2.071 ± 0.198
1.793LysCys: 1.793 ± 0.185
5.233LysAsp: 5.233 ± 0.324
3.901LysGlu: 3.901 ± 0.236
3.402LysPhe: 3.402 ± 0.271
2.182LysGly: 2.182 ± 0.173
1.701LysHis: 1.701 ± 0.174
6.49LysIle: 6.49 ± 0.385
5.787LysLys: 5.787 ± 0.343
6.749LysLeu: 6.749 ± 0.339
1.941LysMet: 1.941 ± 0.186
5.122LysAsn: 5.122 ± 0.338
2.219LysPro: 2.219 ± 0.21
2.052LysGln: 2.052 ± 0.178
3.79LysArg: 3.79 ± 0.246
5.898LysSer: 5.898 ± 0.333
4.474LysThr: 4.474 ± 0.313
4.253LysVal: 4.253 ± 0.33
0.61LysTrp: 0.61 ± 0.108
4.604LysTyr: 4.604 ± 0.295
0.0LysXaa: 0.0 ± 0.0
Leu
3.254LeuAla: 3.254 ± 0.241
1.553LeuCys: 1.553 ± 0.183
5.658LeuAsp: 5.658 ± 0.359
4.974LeuGlu: 4.974 ± 0.362
4.826LeuPhe: 4.826 ± 0.337
3.162LeuGly: 3.162 ± 0.279
1.812LeuHis: 1.812 ± 0.263
6.49LeuIle: 6.49 ± 0.343
6.416LeuLys: 6.416 ± 0.392
9.097LeuLeu: 9.097 ± 0.535
2.57LeuMet: 2.57 ± 0.182
5.676LeuAsn: 5.676 ± 0.346
3.31LeuPro: 3.31 ± 0.24
2.071LeuGln: 2.071 ± 0.211
3.347LeuArg: 3.347 ± 0.275
7.451LeuSer: 7.451 ± 0.361
5.972LeuThr: 5.972 ± 0.328
5.417LeuVal: 5.417 ± 0.334
0.518LeuTrp: 0.518 ± 0.099
4.715LeuTyr: 4.715 ± 0.269
0.0LeuXaa: 0.0 ± 0.0
Met
1.479MetAla: 1.479 ± 0.165
0.647MetCys: 0.647 ± 0.102
2.052MetAsp: 2.052 ± 0.186
1.59MetGlu: 1.59 ± 0.158
1.257MetPhe: 1.257 ± 0.139
0.906MetGly: 0.906 ± 0.121
0.407MetHis: 0.407 ± 0.07
2.441MetIle: 2.441 ± 0.195
1.664MetLys: 1.664 ± 0.146
2.589MetLeu: 2.589 ± 0.232
0.98MetMet: 0.98 ± 0.125
1.83MetAsn: 1.83 ± 0.18
0.887MetPro: 0.887 ± 0.124
0.499MetGln: 0.499 ± 0.098
1.128MetArg: 1.128 ± 0.144
2.348MetSer: 2.348 ± 0.199
1.72MetThr: 1.72 ± 0.156
1.516MetVal: 1.516 ± 0.178
0.148MetTrp: 0.148 ± 0.049
1.609MetTyr: 1.609 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
2.662AsnAla: 2.662 ± 0.223
1.22AsnCys: 1.22 ± 0.154
4.77AsnAsp: 4.77 ± 0.267
3.716AsnGlu: 3.716 ± 0.254
2.644AsnPhe: 2.644 ± 0.213
3.18AsnGly: 3.18 ± 0.314
1.609AsnHis: 1.609 ± 0.153
7.821AsnIle: 7.821 ± 0.386
6.102AsnLys: 6.102 ± 0.355
4.789AsnLeu: 4.789 ± 0.345
2.052AsnMet: 2.052 ± 0.204
5.769AsnAsn: 5.769 ± 0.324
2.404AsnPro: 2.404 ± 0.197
1.368AsnGln: 1.368 ± 0.132
2.958AsnArg: 2.958 ± 0.217
4.327AsnSer: 4.327 ± 0.277
4.715AsnThr: 4.715 ± 0.297
4.548AsnVal: 4.548 ± 0.243
0.388AsnTrp: 0.388 ± 0.091
3.347AsnTyr: 3.347 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
1.294ProAla: 1.294 ± 0.165
0.573ProCys: 0.573 ± 0.106
1.775ProAsp: 1.775 ± 0.183
2.256ProGlu: 2.256 ± 0.172
1.516ProPhe: 1.516 ± 0.193
1.35ProGly: 1.35 ± 0.16
0.703ProHis: 0.703 ± 0.114
3.069ProIle: 3.069 ± 0.226
2.015ProLys: 2.015 ± 0.175
2.866ProLeu: 2.866 ± 0.191
0.924ProMet: 0.924 ± 0.126
2.219ProAsn: 2.219 ± 0.165
1.72ProPro: 1.72 ± 0.219
0.666ProGln: 0.666 ± 0.11
1.627ProArg: 1.627 ± 0.193
2.533ProSer: 2.533 ± 0.238
2.367ProThr: 2.367 ± 0.219
2.071ProVal: 2.071 ± 0.182
0.296ProTrp: 0.296 ± 0.066
1.572ProTyr: 1.572 ± 0.192
0.0ProXaa: 0.0 ± 0.0
Gln
0.555GlnAla: 0.555 ± 0.112
0.499GlnCys: 0.499 ± 0.092
1.294GlnAsp: 1.294 ± 0.121
1.22GlnGlu: 1.22 ± 0.138
0.887GlnPhe: 0.887 ± 0.121
0.777GlnGly: 0.777 ± 0.137
0.666GlnHis: 0.666 ± 0.11
1.609GlnIle: 1.609 ± 0.153
1.535GlnLys: 1.535 ± 0.193
2.755GlnLeu: 2.755 ± 0.238
0.703GlnMet: 0.703 ± 0.113
1.461GlnAsn: 1.461 ± 0.198
0.721GlnPro: 0.721 ± 0.146
0.961GlnGln: 0.961 ± 0.149
1.091GlnArg: 1.091 ± 0.141
1.738GlnSer: 1.738 ± 0.188
1.59GlnThr: 1.59 ± 0.194
0.98GlnVal: 0.98 ± 0.133
0.203GlnTrp: 0.203 ± 0.058
1.646GlnTyr: 1.646 ± 0.182
0.0GlnXaa: 0.0 ± 0.0
Arg
1.146ArgAla: 1.146 ± 0.158
1.072ArgCys: 1.072 ± 0.137
2.625ArgAsp: 2.625 ± 0.239
2.163ArgGlu: 2.163 ± 0.247
2.182ArgPhe: 2.182 ± 0.195
1.812ArgGly: 1.812 ± 0.172
1.387ArgHis: 1.387 ± 0.184
3.531ArgIle: 3.531 ± 0.224
2.589ArgLys: 2.589 ± 0.226
4.068ArgLeu: 4.068 ± 0.281
1.165ArgMet: 1.165 ± 0.15
2.903ArgAsn: 2.903 ± 0.249
1.442ArgPro: 1.442 ± 0.189
1.368ArgGln: 1.368 ± 0.19
2.589ArgArg: 2.589 ± 0.239
3.032ArgSer: 3.032 ± 0.24
2.163ArgThr: 2.163 ± 0.226
2.385ArgVal: 2.385 ± 0.172
0.425ArgTrp: 0.425 ± 0.085
2.552ArgTyr: 2.552 ± 0.193
0.0ArgXaa: 0.0 ± 0.0
Ser
2.773SerAla: 2.773 ± 0.252
1.683SerCys: 1.683 ± 0.216
4.585SerAsp: 4.585 ± 0.336
3.531SerGlu: 3.531 ± 0.29
3.809SerPhe: 3.809 ± 0.309
3.069SerGly: 3.069 ± 0.28
1.572SerHis: 1.572 ± 0.203
7.248SerIle: 7.248 ± 0.343
6.175SerLys: 6.175 ± 0.363
7.174SerLeu: 7.174 ± 0.347
2.367SerMet: 2.367 ± 0.232
4.863SerAsn: 4.863 ± 0.291
2.921SerPro: 2.921 ± 0.245
2.052SerGln: 2.052 ± 0.208
3.347SerArg: 3.347 ± 0.268
6.989SerSer: 6.989 ± 0.51
5.029SerThr: 5.029 ± 0.385
5.011SerVal: 5.011 ± 0.335
0.407SerTrp: 0.407 ± 0.078
3.513SerTyr: 3.513 ± 0.242
0.0SerXaa: 0.0 ± 0.0
Thr
2.311ThrAla: 2.311 ± 0.217
1.276ThrCys: 1.276 ± 0.173
4.29ThrAsp: 4.29 ± 0.302
3.402ThrGlu: 3.402 ± 0.26
2.921ThrPhe: 2.921 ± 0.229
2.662ThrGly: 2.662 ± 0.207
1.35ThrHis: 1.35 ± 0.152
6.009ThrIle: 6.009 ± 0.34
4.53ThrLys: 4.53 ± 0.28
5.048ThrLeu: 5.048 ± 0.297
1.886ThrMet: 1.886 ± 0.155
3.846ThrAsn: 3.846 ± 0.268
2.496ThrPro: 2.496 ± 0.285
1.035ThrGln: 1.035 ± 0.156
2.589ThrArg: 2.589 ± 0.182
4.955ThrSer: 4.955 ± 0.312
4.012ThrThr: 4.012 ± 0.39
4.197ThrVal: 4.197 ± 0.295
0.499ThrTrp: 0.499 ± 0.094
2.847ThrTyr: 2.847 ± 0.235
0.0ThrXaa: 0.0 ± 0.0
Val
2.33ValAla: 2.33 ± 0.178
1.535ValCys: 1.535 ± 0.151
3.975ValAsp: 3.975 ± 0.268
3.587ValGlu: 3.587 ± 0.269
3.254ValPhe: 3.254 ± 0.3
1.941ValGly: 1.941 ± 0.208
1.054ValHis: 1.054 ± 0.141
5.861ValIle: 5.861 ± 0.293
4.918ValLys: 4.918 ± 0.288
5.343ValLeu: 5.343 ± 0.324
1.516ValMet: 1.516 ± 0.158
4.511ValAsn: 4.511 ± 0.292
1.886ValPro: 1.886 ± 0.208
1.331ValGln: 1.331 ± 0.158
2.293ValArg: 2.293 ± 0.225
4.974ValSer: 4.974 ± 0.33
3.827ValThr: 3.827 ± 0.324
3.698ValVal: 3.698 ± 0.269
0.24ValTrp: 0.24 ± 0.066
3.55ValTyr: 3.55 ± 0.321
0.0ValXaa: 0.0 ± 0.0
Trp
0.203TrpAla: 0.203 ± 0.059
0.166TrpCys: 0.166 ± 0.049
0.314TrpAsp: 0.314 ± 0.064
0.407TrpGlu: 0.407 ± 0.09
0.462TrpPhe: 0.462 ± 0.086
0.277TrpGly: 0.277 ± 0.072
0.129TrpHis: 0.129 ± 0.053
0.629TrpIle: 0.629 ± 0.104
0.592TrpLys: 0.592 ± 0.09
0.758TrpLeu: 0.758 ± 0.132
0.37TrpMet: 0.37 ± 0.08
0.444TrpAsn: 0.444 ± 0.112
0.24TrpPro: 0.24 ± 0.062
0.166TrpGln: 0.166 ± 0.054
0.166TrpArg: 0.166 ± 0.061
0.407TrpSer: 0.407 ± 0.066
0.444TrpThr: 0.444 ± 0.089
0.333TrpVal: 0.333 ± 0.076
0.0TrpTrp: 0.0 ± 0.0
0.314TrpTyr: 0.314 ± 0.073
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.997TyrAla: 1.997 ± 0.19
1.405TyrCys: 1.405 ± 0.183
3.032TyrAsp: 3.032 ± 0.289
2.385TyrGlu: 2.385 ± 0.199
2.736TyrPhe: 2.736 ± 0.246
2.681TyrGly: 2.681 ± 0.218
0.998TyrHis: 0.998 ± 0.118
5.602TyrIle: 5.602 ± 0.3
4.031TyrLys: 4.031 ± 0.259
5.177TyrLeu: 5.177 ± 0.342
1.424TyrMet: 1.424 ± 0.156
3.827TyrAsn: 3.827 ± 0.339
1.701TyrPro: 1.701 ± 0.151
1.072TyrGln: 1.072 ± 0.155
1.997TyrArg: 1.997 ± 0.203
3.846TyrSer: 3.846 ± 0.212
2.977TyrThr: 2.977 ± 0.235
3.143TyrVal: 3.143 ± 0.201
0.296TyrTrp: 0.296 ± 0.084
2.94TyrTyr: 2.94 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 198 proteins (54086 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski