Amino acid dipepetide frequency for Penguinpox virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.635AlaAla: 1.635 ± 0.167
0.961AlaCys: 0.961 ± 0.113
2.547AlaAsp: 2.547 ± 0.218
1.511AlaGlu: 1.511 ± 0.13
1.286AlaPhe: 1.286 ± 0.115
1.298AlaGly: 1.298 ± 0.15
0.512AlaHis: 0.512 ± 0.067
4.269AlaIle: 4.269 ± 0.278
2.26AlaLys: 2.26 ± 0.171
3.021AlaLeu: 3.021 ± 0.209
0.899AlaMet: 0.899 ± 0.109
2.597AlaAsn: 2.597 ± 0.204
0.624AlaPro: 0.624 ± 0.094
0.449AlaGln: 0.449 ± 0.073
1.124AlaArg: 1.124 ± 0.107
2.609AlaSer: 2.609 ± 0.175
2.122AlaThr: 2.122 ± 0.145
2.659AlaVal: 2.659 ± 0.224
0.225AlaTrp: 0.225 ± 0.06
1.623AlaTyr: 1.623 ± 0.164
0.0AlaXaa: 0.0 ± 0.0
Cys
0.724CysAla: 0.724 ± 0.103
0.524CysCys: 0.524 ± 0.084
1.198CysAsp: 1.198 ± 0.127
1.136CysGlu: 1.136 ± 0.1
1.011CysPhe: 1.011 ± 0.123
1.111CysGly: 1.111 ± 0.139
0.337CysHis: 0.337 ± 0.065
2.372CysIle: 2.372 ± 0.184
1.922CysLys: 1.922 ± 0.19
1.498CysLeu: 1.498 ± 0.15
0.749CysMet: 0.749 ± 0.089
1.71CysAsn: 1.71 ± 0.14
0.724CysPro: 0.724 ± 0.093
0.237CysGln: 0.237 ± 0.051
0.936CysArg: 0.936 ± 0.102
1.548CysSer: 1.548 ± 0.131
1.049CysThr: 1.049 ± 0.108
1.173CysVal: 1.173 ± 0.121
0.162CysTrp: 0.162 ± 0.05
1.648CysTyr: 1.648 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
2.06AspAla: 2.06 ± 0.145
1.086AspCys: 1.086 ± 0.122
4.544AspAsp: 4.544 ± 0.363
3.483AspGlu: 3.483 ± 0.215
2.634AspPhe: 2.634 ± 0.209
2.172AspGly: 2.172 ± 0.164
1.136AspHis: 1.136 ± 0.128
8.339AspIle: 8.339 ± 0.316
5.48AspLys: 5.48 ± 0.286
4.457AspLeu: 4.457 ± 0.245
1.698AspMet: 1.698 ± 0.153
5.006AspAsn: 5.006 ± 0.26
1.972AspPro: 1.972 ± 0.141
0.874AspGln: 0.874 ± 0.103
2.359AspArg: 2.359 ± 0.243
4.232AspSer: 4.232 ± 0.229
3.845AspThr: 3.845 ± 0.217
3.77AspVal: 3.77 ± 0.231
0.375AspTrp: 0.375 ± 0.067
3.783AspTyr: 3.783 ± 0.216
0.0AspXaa: 0.0 ± 0.0
Glu
1.823GluAla: 1.823 ± 0.151
1.211GluCys: 1.211 ± 0.11
3.495GluAsp: 3.495 ± 0.211
3.882GluGlu: 3.882 ± 0.242
2.147GluPhe: 2.147 ± 0.174
1.997GluGly: 1.997 ± 0.146
0.986GluHis: 0.986 ± 0.113
5.755GluIle: 5.755 ± 0.339
4.731GluLys: 4.731 ± 0.282
6.367GluLeu: 6.367 ± 0.3
1.336GluMet: 1.336 ± 0.119
3.845GluAsn: 3.845 ± 0.238
1.411GluPro: 1.411 ± 0.141
1.248GluGln: 1.248 ± 0.137
2.16GluArg: 2.16 ± 0.21
3.745GluSer: 3.745 ± 0.211
3.271GluThr: 3.271 ± 0.165
3.108GluVal: 3.108 ± 0.199
0.412GluTrp: 0.412 ± 0.077
3.87GluTyr: 3.87 ± 0.231
0.0GluXaa: 0.0 ± 0.0
Phe
0.986PheAla: 0.986 ± 0.114
0.936PheCys: 0.936 ± 0.111
2.347PheAsp: 2.347 ± 0.16
2.122PheGlu: 2.122 ± 0.186
1.885PhePhe: 1.885 ± 0.179
1.548PheGly: 1.548 ± 0.15
0.749PheHis: 0.749 ± 0.109
4.631PheIle: 4.631 ± 0.271
3.533PheLys: 3.533 ± 0.201
3.695PheLeu: 3.695 ± 0.277
1.124PheMet: 1.124 ± 0.112
3.308PheAsn: 3.308 ± 0.188
1.473PhePro: 1.473 ± 0.115
0.687PheGln: 0.687 ± 0.098
1.386PheArg: 1.386 ± 0.113
3.595PheSer: 3.595 ± 0.194
2.759PheThr: 2.759 ± 0.187
2.547PheVal: 2.547 ± 0.184
0.35PheTrp: 0.35 ± 0.063
2.309PheTyr: 2.309 ± 0.172
0.0PheXaa: 0.0 ± 0.0
Gly
2.809GlyAla: 2.809 ± 0.297
0.886GlyCys: 0.886 ± 0.102
2.409GlyAsp: 2.409 ± 0.193
1.86GlyGlu: 1.86 ± 0.135
1.81GlyPhe: 1.81 ± 0.168
1.498GlyGly: 1.498 ± 0.201
0.687GlyHis: 0.687 ± 0.089
4.07GlyIle: 4.07 ± 0.243
3.595GlyLys: 3.595 ± 0.158
2.547GlyLeu: 2.547 ± 0.182
0.899GlyMet: 0.899 ± 0.1
3.233GlyAsn: 3.233 ± 0.203
0.799GlyPro: 0.799 ± 0.111
0.587GlyGln: 0.587 ± 0.076
1.685GlyArg: 1.685 ± 0.151
2.709GlySer: 2.709 ± 0.239
1.947GlyThr: 1.947 ± 0.158
2.147GlyVal: 2.147 ± 0.182
0.25GlyTrp: 0.25 ± 0.053
2.597GlyTyr: 2.597 ± 0.183
0.0GlyXaa: 0.0 ± 0.0
His
0.762HisAla: 0.762 ± 0.094
0.449HisCys: 0.449 ± 0.065
1.311HisAsp: 1.311 ± 0.128
0.836HisGlu: 0.836 ± 0.119
0.699HisPhe: 0.699 ± 0.09
1.136HisGly: 1.136 ± 0.111
0.512HisHis: 0.512 ± 0.077
2.272HisIle: 2.272 ± 0.16
1.585HisLys: 1.585 ± 0.136
1.56HisLeu: 1.56 ± 0.145
0.387HisMet: 0.387 ± 0.062
1.348HisAsn: 1.348 ± 0.132
0.824HisPro: 0.824 ± 0.102
0.412HisGln: 0.412 ± 0.067
0.886HisArg: 0.886 ± 0.106
1.336HisSer: 1.336 ± 0.121
1.049HisThr: 1.049 ± 0.103
1.124HisVal: 1.124 ± 0.125
0.162HisTrp: 0.162 ± 0.044
1.511HisTyr: 1.511 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
3.52IleAla: 3.52 ± 0.198
2.122IleCys: 2.122 ± 0.2
6.741IleAsp: 6.741 ± 0.277
6.254IleGlu: 6.254 ± 0.262
3.982IlePhe: 3.982 ± 0.247
3.495IleGly: 3.495 ± 0.183
2.097IleHis: 2.097 ± 0.17
9.275IleIle: 9.275 ± 0.427
9.35IleLys: 9.35 ± 0.382
9.438IleLeu: 9.438 ± 0.438
2.622IleMet: 2.622 ± 0.172
9.038IleAsn: 9.038 ± 0.37
3.52IlePro: 3.52 ± 0.203
1.773IleGln: 1.773 ± 0.156
3.982IleArg: 3.982 ± 0.237
8.501IleSer: 8.501 ± 0.299
6.317IleThr: 6.317 ± 0.311
5.281IleVal: 5.281 ± 0.222
0.412IleTrp: 0.412 ± 0.077
5.093IleTyr: 5.093 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
2.272LysAla: 2.272 ± 0.177
1.735LysCys: 1.735 ± 0.146
5.792LysAsp: 5.792 ± 0.264
5.493LysGlu: 5.493 ± 0.265
3.433LysPhe: 3.433 ± 0.243
3.058LysGly: 3.058 ± 0.199
2.047LysHis: 2.047 ± 0.152
7.428LysIle: 7.428 ± 0.389
7.328LysLys: 7.328 ± 0.408
7.453LysLeu: 7.453 ± 0.361
1.985LysMet: 1.985 ± 0.176
6.479LysAsn: 6.479 ± 0.33
2.197LysPro: 2.197 ± 0.18
2.247LysGln: 2.247 ± 0.167
3.221LysArg: 3.221 ± 0.213
5.643LysSer: 5.643 ± 0.28
4.744LysThr: 4.744 ± 0.243
4.557LysVal: 4.557 ± 0.241
0.612LysTrp: 0.612 ± 0.097
5.867LysTyr: 5.867 ± 0.286
0.0LysXaa: 0.0 ± 0.0
Leu
3.058LeuAla: 3.058 ± 0.215
1.985LeuCys: 1.985 ± 0.177
6.104LeuAsp: 6.104 ± 0.247
6.179LeuGlu: 6.179 ± 0.322
3.87LeuPhe: 3.87 ± 0.256
3.296LeuGly: 3.296 ± 0.211
2.946LeuHis: 2.946 ± 0.304
7.965LeuIle: 7.965 ± 0.317
7.29LeuLys: 7.29 ± 0.303
11.16LeuLeu: 11.16 ± 0.54
2.135LeuMet: 2.135 ± 0.152
5.056LeuAsn: 5.056 ± 0.274
3.133LeuPro: 3.133 ± 0.182
2.21LeuGln: 2.21 ± 0.166
3.346LeuArg: 3.346 ± 0.194
7.328LeuSer: 7.328 ± 0.305
4.507LeuThr: 4.507 ± 0.26
4.993LeuVal: 4.993 ± 0.277
0.35LeuTrp: 0.35 ± 0.061
5.093LeuTyr: 5.093 ± 0.232
0.0LeuXaa: 0.0 ± 0.0
Met
1.111MetAla: 1.111 ± 0.117
0.524MetCys: 0.524 ± 0.082
1.835MetAsp: 1.835 ± 0.157
1.873MetGlu: 1.873 ± 0.171
1.336MetPhe: 1.336 ± 0.116
1.024MetGly: 1.024 ± 0.113
0.487MetHis: 0.487 ± 0.076
1.898MetIle: 1.898 ± 0.148
2.222MetLys: 2.222 ± 0.155
2.734MetLeu: 2.734 ± 0.219
0.724MetMet: 0.724 ± 0.083
1.461MetAsn: 1.461 ± 0.147
0.599MetPro: 0.599 ± 0.09
0.462MetGln: 0.462 ± 0.06
0.824MetArg: 0.824 ± 0.105
2.047MetSer: 2.047 ± 0.136
1.161MetThr: 1.161 ± 0.123
1.423MetVal: 1.423 ± 0.13
0.125MetTrp: 0.125 ± 0.038
1.648MetTyr: 1.648 ± 0.175
0.0MetXaa: 0.0 ± 0.0
Asn
2.372AsnAla: 2.372 ± 0.18
1.348AsnCys: 1.348 ± 0.118
4.282AsnAsp: 4.282 ± 0.228
3.72AsnGlu: 3.72 ± 0.2
2.784AsnPhe: 2.784 ± 0.248
3.071AsnGly: 3.071 ± 0.189
1.461AsnHis: 1.461 ± 0.13
9.962AsnIle: 9.962 ± 0.456
6.379AsnLys: 6.379 ± 0.334
4.993AsnLeu: 4.993 ± 0.252
2.185AsnMet: 2.185 ± 0.19
6.454AsnAsn: 6.454 ± 0.335
2.11AsnPro: 2.11 ± 0.163
1.361AsnGln: 1.361 ± 0.135
3.233AsnArg: 3.233 ± 0.189
5.268AsnSer: 5.268 ± 0.281
5.318AsnThr: 5.318 ± 0.278
4.244AsnVal: 4.244 ± 0.234
0.512AsnTrp: 0.512 ± 0.085
3.67AsnTyr: 3.67 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
0.574ProAla: 0.574 ± 0.081
0.662ProCys: 0.662 ± 0.096
2.16ProAsp: 2.16 ± 0.169
1.972ProGlu: 1.972 ± 0.151
1.373ProPhe: 1.373 ± 0.14
1.161ProGly: 1.161 ± 0.129
0.424ProHis: 0.424 ± 0.08
3.158ProIle: 3.158 ± 0.18
2.322ProLys: 2.322 ± 0.21
4.257ProLeu: 4.257 ± 0.357
0.612ProMet: 0.612 ± 0.088
2.11ProAsn: 2.11 ± 0.16
1.111ProPro: 1.111 ± 0.22
0.612ProGln: 0.612 ± 0.094
1.211ProArg: 1.211 ± 0.138
2.322ProSer: 2.322 ± 0.171
1.186ProThr: 1.186 ± 0.14
1.798ProVal: 1.798 ± 0.18
0.3ProTrp: 0.3 ± 0.053
1.473ProTyr: 1.473 ± 0.137
0.0ProXaa: 0.0 ± 0.0
Gln
0.774GlnAla: 0.774 ± 0.092
0.499GlnCys: 0.499 ± 0.075
1.236GlnAsp: 1.236 ± 0.111
1.198GlnGlu: 1.198 ± 0.135
0.649GlnPhe: 0.649 ± 0.097
0.724GlnGly: 0.724 ± 0.123
0.424GlnHis: 0.424 ± 0.096
1.573GlnIle: 1.573 ± 0.144
1.548GlnLys: 1.548 ± 0.149
2.197GlnLeu: 2.197 ± 0.161
0.549GlnMet: 0.549 ± 0.069
1.011GlnAsn: 1.011 ± 0.108
0.487GlnPro: 0.487 ± 0.087
0.737GlnGln: 0.737 ± 0.11
0.799GlnArg: 0.799 ± 0.095
1.623GlnSer: 1.623 ± 0.14
0.849GlnThr: 0.849 ± 0.091
0.974GlnVal: 0.974 ± 0.105
0.1GlnTrp: 0.1 ± 0.036
1.136GlnTyr: 1.136 ± 0.142
0.0GlnXaa: 0.0 ± 0.0
Arg
1.211ArgAla: 1.211 ± 0.142
1.061ArgCys: 1.061 ± 0.108
1.96ArgAsp: 1.96 ± 0.172
1.885ArgGlu: 1.885 ± 0.19
1.685ArgPhe: 1.685 ± 0.156
1.698ArgGly: 1.698 ± 0.125
1.024ArgHis: 1.024 ± 0.124
3.495ArgIle: 3.495 ± 0.192
3.108ArgLys: 3.108 ± 0.222
3.545ArgLeu: 3.545 ± 0.214
0.874ArgMet: 0.874 ± 0.104
3.183ArgAsn: 3.183 ± 0.219
1.036ArgPro: 1.036 ± 0.144
0.999ArgGln: 0.999 ± 0.123
1.885ArgArg: 1.885 ± 0.214
3.321ArgSer: 3.321 ± 0.217
2.297ArgThr: 2.297 ± 0.176
2.147ArgVal: 2.147 ± 0.175
0.462ArgTrp: 0.462 ± 0.071
2.796ArgTyr: 2.796 ± 0.209
0.0ArgXaa: 0.0 ± 0.0
Ser
2.222SerAla: 2.222 ± 0.201
1.598SerCys: 1.598 ± 0.118
4.432SerAsp: 4.432 ± 0.237
3.72SerGlu: 3.72 ± 0.215
3.383SerPhe: 3.383 ± 0.217
3.271SerGly: 3.271 ± 0.274
1.248SerHis: 1.248 ± 0.132
8.701SerIle: 8.701 ± 0.301
6.866SerLys: 6.866 ± 0.287
7.091SerLeu: 7.091 ± 0.288
1.798SerMet: 1.798 ± 0.143
5.368SerAsn: 5.368 ± 0.299
2.509SerPro: 2.509 ± 0.194
1.323SerGln: 1.323 ± 0.132
3.421SerArg: 3.421 ± 0.214
7.44SerSer: 7.44 ± 0.53
3.982SerThr: 3.982 ± 0.265
4.656SerVal: 4.656 ± 0.251
0.424SerTrp: 0.424 ± 0.077
4.132SerTyr: 4.132 ± 0.243
0.0SerXaa: 0.0 ± 0.0
Thr
2.285ThrAla: 2.285 ± 0.161
1.598ThrCys: 1.598 ± 0.178
3.658ThrAsp: 3.658 ± 0.203
3.096ThrGlu: 3.096 ± 0.199
2.384ThrPhe: 2.384 ± 0.155
2.322ThrGly: 2.322 ± 0.155
0.861ThrHis: 0.861 ± 0.1
5.63ThrIle: 5.63 ± 0.26
4.319ThrLys: 4.319 ± 0.281
4.669ThrLeu: 4.669 ± 0.255
1.361ThrMet: 1.361 ± 0.142
3.733ThrAsn: 3.733 ± 0.27
3.009ThrPro: 3.009 ± 0.317
0.849ThrGln: 0.849 ± 0.119
2.285ThrArg: 2.285 ± 0.178
4.631ThrSer: 4.631 ± 0.26
2.921ThrThr: 2.921 ± 0.209
3.608ThrVal: 3.608 ± 0.233
0.537ThrTrp: 0.537 ± 0.079
2.584ThrTyr: 2.584 ± 0.198
0.0ThrXaa: 0.0 ± 0.0
Val
1.76ValAla: 1.76 ± 0.166
1.211ValCys: 1.211 ± 0.135
3.608ValAsp: 3.608 ± 0.207
3.058ValGlu: 3.058 ± 0.193
2.809ValPhe: 2.809 ± 0.232
1.673ValGly: 1.673 ± 0.168
0.949ValHis: 0.949 ± 0.12
5.505ValIle: 5.505 ± 0.263
4.706ValLys: 4.706 ± 0.248
5.643ValLeu: 5.643 ± 0.243
1.548ValMet: 1.548 ± 0.171
4.669ValAsn: 4.669 ± 0.342
1.723ValPro: 1.723 ± 0.156
0.811ValGln: 0.811 ± 0.106
2.309ValArg: 2.309 ± 0.167
5.043ValSer: 5.043 ± 0.236
3.258ValThr: 3.258 ± 0.202
3.034ValVal: 3.034 ± 0.232
0.25ValTrp: 0.25 ± 0.051
3.046ValTyr: 3.046 ± 0.204
0.0ValXaa: 0.0 ± 0.0
Trp
0.162TrpAla: 0.162 ± 0.059
0.225TrpCys: 0.225 ± 0.054
0.262TrpAsp: 0.262 ± 0.059
0.437TrpGlu: 0.437 ± 0.073
0.35TrpPhe: 0.35 ± 0.063
0.112TrpGly: 0.112 ± 0.043
0.062TrpHis: 0.062 ± 0.028
0.824TrpIle: 0.824 ± 0.104
0.537TrpLys: 0.537 ± 0.089
0.649TrpLeu: 0.649 ± 0.088
0.387TrpMet: 0.387 ± 0.061
0.399TrpAsn: 0.399 ± 0.068
0.187TrpPro: 0.187 ± 0.052
0.137TrpGln: 0.137 ± 0.043
0.25TrpArg: 0.25 ± 0.065
0.3TrpSer: 0.3 ± 0.061
0.399TrpThr: 0.399 ± 0.072
0.362TrpVal: 0.362 ± 0.07
0.062TrpTrp: 0.062 ± 0.033
0.325TrpTyr: 0.325 ± 0.059
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.11TyrAla: 2.11 ± 0.139
1.148TyrCys: 1.148 ± 0.111
3.383TyrAsp: 3.383 ± 0.219
3.083TyrGlu: 3.083 ± 0.182
2.459TyrPhe: 2.459 ± 0.185
3.358TyrGly: 3.358 ± 0.23
1.211TyrHis: 1.211 ± 0.122
5.693TyrIle: 5.693 ± 0.245
4.507TyrLys: 4.507 ± 0.237
5.031TyrLeu: 5.031 ± 0.242
1.635TyrMet: 1.635 ± 0.148
4.719TyrAsn: 4.719 ± 0.29
1.361TyrPro: 1.361 ± 0.13
1.074TyrGln: 1.074 ± 0.105
2.372TyrArg: 2.372 ± 0.173
4.344TyrSer: 4.344 ± 0.208
3.358TyrThr: 3.358 ± 0.216
3.009TyrVal: 3.009 ± 0.186
0.35TyrTrp: 0.35 ± 0.062
3.358TyrTyr: 3.358 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 240 proteins (80106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski