Amino acid dipepetide frequency for White spot syndrome virus (isolate Shrimp/China/Tongan/1996) (WSSV) (White spot bacilliform virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.746AlaAla: 6.746 ± 0.585
0.869AlaCys: 0.869 ± 0.089
2.326AlaAsp: 2.326 ± 0.156
3.389AlaGlu: 3.389 ± 0.176
2.439AlaPhe: 2.439 ± 0.153
2.777AlaGly: 2.777 ± 0.354
0.99AlaHis: 0.99 ± 0.086
3.638AlaIle: 3.638 ± 0.238
3.445AlaLys: 3.445 ± 0.214
5.023AlaLeu: 5.023 ± 0.291
1.481AlaMet: 1.481 ± 0.124
2.648AlaAsn: 2.648 ± 0.173
2.914AlaPro: 2.914 ± 0.235
1.36AlaGln: 1.36 ± 0.122
2.721AlaArg: 2.721 ± 0.195
5.933AlaSer: 5.933 ± 0.264
3.349AlaThr: 3.349 ± 0.216
4.21AlaVal: 4.21 ± 0.254
0.362AlaTrp: 0.362 ± 0.055
1.143AlaTyr: 1.143 ± 0.089
0.0AlaXaa: 0.0 ± 0.0
Cys
0.837CysAla: 0.837 ± 0.083
0.741CysCys: 0.741 ± 0.132
0.91CysAsp: 0.91 ± 0.091
0.765CysGlu: 0.765 ± 0.073
1.256CysPhe: 1.256 ± 0.106
0.99CysGly: 0.99 ± 0.091
0.322CysHis: 0.322 ± 0.044
1.191CysIle: 1.191 ± 0.098
1.103CysLys: 1.103 ± 0.103
1.988CysLeu: 1.988 ± 0.133
0.572CysMet: 0.572 ± 0.073
0.781CysAsn: 0.781 ± 0.074
1.022CysPro: 1.022 ± 0.092
0.443CysGln: 0.443 ± 0.054
0.958CysArg: 0.958 ± 0.079
2.012CysSer: 2.012 ± 0.138
1.28CysThr: 1.28 ± 0.117
1.224CysVal: 1.224 ± 0.115
0.306CysTrp: 0.306 ± 0.055
0.539CysTyr: 0.539 ± 0.092
0.0CysXaa: 0.0 ± 0.0
Asp
3.027AspAla: 3.027 ± 0.21
0.813AspCys: 0.813 ± 0.088
4.282AspAsp: 4.282 ± 0.356
4.387AspGlu: 4.387 ± 0.356
2.367AspPhe: 2.367 ± 0.136
3.155AspGly: 3.155 ± 0.41
0.668AspHis: 0.668 ± 0.077
3.494AspIle: 3.494 ± 0.163
3.196AspLys: 3.196 ± 0.174
3.429AspLeu: 3.429 ± 0.186
1.594AspMet: 1.594 ± 0.134
2.825AspAsn: 2.825 ± 0.173
1.851AspPro: 1.851 ± 0.142
1.127AspGln: 1.127 ± 0.103
2.069AspArg: 2.069 ± 0.135
3.695AspSer: 3.695 ± 0.243
3.3AspThr: 3.3 ± 0.166
3.735AspVal: 3.735 ± 0.223
0.507AspTrp: 0.507 ± 0.062
1.61AspTyr: 1.61 ± 0.101
0.0AspXaa: 0.0 ± 0.0
Glu
3.099GluAla: 3.099 ± 0.22
0.877GluCys: 0.877 ± 0.087
5.015GluAsp: 5.015 ± 0.418
10.593GluGlu: 10.593 ± 0.863
2.125GluPhe: 2.125 ± 0.132
3.824GluGly: 3.824 ± 0.199
1.079GluHis: 1.079 ± 0.101
3.292GluIle: 3.292 ± 0.189
5.224GluLys: 5.224 ± 0.279
3.655GluLeu: 3.655 ± 0.172
2.085GluMet: 2.085 ± 0.143
4.009GluAsn: 4.009 ± 0.18
1.602GluPro: 1.602 ± 0.108
2.125GluGln: 2.125 ± 0.191
4.025GluArg: 4.025 ± 0.23
3.574GluSer: 3.574 ± 0.192
3.735GluThr: 3.735 ± 0.226
2.938GluVal: 2.938 ± 0.193
0.58GluTrp: 0.58 ± 0.067
1.779GluTyr: 1.779 ± 0.104
0.0GluXaa: 0.0 ± 0.0
Phe
2.061PheAla: 2.061 ± 0.141
1.046PheCys: 1.046 ± 0.103
2.004PheAsp: 2.004 ± 0.122
2.141PheGlu: 2.141 ± 0.129
3.606PhePhe: 3.606 ± 0.248
1.996PheGly: 1.996 ± 0.125
1.095PheHis: 1.095 ± 0.08
3.067PheIle: 3.067 ± 0.201
2.713PheLys: 2.713 ± 0.158
6.061PheLeu: 6.061 ± 0.269
1.312PheMet: 1.312 ± 0.097
2.439PheAsn: 2.439 ± 0.12
2.487PhePro: 2.487 ± 0.192
1.079PheGln: 1.079 ± 0.095
2.028PheArg: 2.028 ± 0.138
5.989PheSer: 5.989 ± 0.289
2.359PheThr: 2.359 ± 0.128
3.172PheVal: 3.172 ± 0.196
0.741PheTrp: 0.741 ± 0.099
1.385PheTyr: 1.385 ± 0.148
0.0PheXaa: 0.0 ± 0.0
Gly
3.324GlyAla: 3.324 ± 0.424
0.877GlyCys: 0.877 ± 0.088
2.97GlyAsp: 2.97 ± 0.148
4.194GlyGlu: 4.194 ± 0.524
1.94GlyPhe: 1.94 ± 0.128
5.409GlyGly: 5.409 ± 0.416
0.894GlyHis: 0.894 ± 0.07
2.833GlyIle: 2.833 ± 0.162
3.445GlyLys: 3.445 ± 0.222
3.614GlyLeu: 3.614 ± 0.193
1.256GlyMet: 1.256 ± 0.094
2.672GlyAsn: 2.672 ± 0.13
2.487GlyPro: 2.487 ± 0.898
1.272GlyGln: 1.272 ± 0.115
3.276GlyArg: 3.276 ± 0.448
4.411GlySer: 4.411 ± 0.221
2.938GlyThr: 2.938 ± 0.182
3.832GlyVal: 3.832 ± 0.185
0.475GlyTrp: 0.475 ± 0.065
0.99GlyTyr: 0.99 ± 0.088
0.0GlyXaa: 0.0 ± 0.0
His
0.773HisAla: 0.773 ± 0.068
0.459HisCys: 0.459 ± 0.07
0.781HisAsp: 0.781 ± 0.084
0.942HisGlu: 0.942 ± 0.099
1.328HisPhe: 1.328 ± 0.117
0.773HisGly: 0.773 ± 0.082
0.716HisHis: 0.716 ± 0.095
1.417HisIle: 1.417 ± 0.093
1.079HisLys: 1.079 ± 0.087
2.592HisLeu: 2.592 ± 0.3
0.523HisMet: 0.523 ± 0.068
0.805HisAsn: 0.805 ± 0.076
1.03HisPro: 1.03 ± 0.082
0.765HisGln: 0.765 ± 0.091
0.99HisArg: 0.99 ± 0.082
1.884HisSer: 1.884 ± 0.099
1.143HisThr: 1.143 ± 0.1
1.183HisVal: 1.183 ± 0.088
0.185HisTrp: 0.185 ± 0.044
0.596HisTyr: 0.596 ± 0.077
0.0HisXaa: 0.0 ± 0.0
Ile
3.18IleAla: 3.18 ± 0.174
1.159IleCys: 1.159 ± 0.103
3.123IleAsp: 3.123 ± 0.163
3.26IleGlu: 3.26 ± 0.158
3.502IlePhe: 3.502 ± 0.208
2.753IleGly: 2.753 ± 0.253
1.199IleHis: 1.199 ± 0.092
3.928IleIle: 3.928 ± 0.178
4.009IleLys: 4.009 ± 0.201
5.989IleLeu: 5.989 ± 0.246
1.513IleMet: 1.513 ± 0.112
3.341IleAsn: 3.341 ± 0.169
2.64IlePro: 2.64 ± 0.124
1.658IleGln: 1.658 ± 0.102
2.922IleArg: 2.922 ± 0.159
6.045IleSer: 6.045 ± 0.25
3.606IleThr: 3.606 ± 0.198
4.178IleVal: 4.178 ± 0.201
0.636IleTrp: 0.636 ± 0.079
1.143IleTyr: 1.143 ± 0.082
0.0IleXaa: 0.0 ± 0.0
Lys
2.793LysAla: 2.793 ± 0.158
1.304LysCys: 1.304 ± 0.114
3.494LysAsp: 3.494 ± 0.22
4.524LysGlu: 4.524 ± 0.241
2.27LysPhe: 2.27 ± 0.108
3.292LysGly: 3.292 ± 0.207
1.658LysHis: 1.658 ± 0.119
4.154LysIle: 4.154 ± 0.232
6.939LysLys: 6.939 ± 0.383
4.814LysLeu: 4.814 ± 0.215
2.367LysMet: 2.367 ± 0.14
4.604LysAsn: 4.604 ± 0.23
2.004LysPro: 2.004 ± 0.125
2.206LysGln: 2.206 ± 0.201
4.999LysArg: 4.999 ± 0.265
5.297LysSer: 5.297 ± 0.265
4.226LysThr: 4.226 ± 0.186
3.3LysVal: 3.3 ± 0.198
0.708LysTrp: 0.708 ± 0.069
2.23LysTyr: 2.23 ± 0.137
0.0LysXaa: 0.0 ± 0.0
Leu
5.176LeuAla: 5.176 ± 0.216
1.892LeuCys: 1.892 ± 0.147
4.178LeuAsp: 4.178 ± 0.208
4.95LeuGlu: 4.95 ± 0.18
5.417LeuPhe: 5.417 ± 0.269
3.912LeuGly: 3.912 ± 0.213
2.149LeuHis: 2.149 ± 0.153
4.749LeuIle: 4.749 ± 0.248
5.78LeuLys: 5.78 ± 0.292
11.535LeuLeu: 11.535 ± 0.61
2.705LeuMet: 2.705 ± 0.171
3.968LeuAsn: 3.968 ± 0.183
4.87LeuPro: 4.87 ± 0.261
2.737LeuGln: 2.737 ± 0.153
4.137LeuArg: 4.137 ± 0.231
8.38LeuSer: 8.38 ± 0.351
4.556LeuThr: 4.556 ± 0.263
5.256LeuVal: 5.256 ± 0.205
0.692LeuTrp: 0.692 ± 0.084
2.962LeuTyr: 2.962 ± 0.152
0.0LeuXaa: 0.0 ± 0.0
Met
2.318MetAla: 2.318 ± 0.154
0.708MetCys: 0.708 ± 0.089
1.868MetAsp: 1.868 ± 0.112
2.02MetGlu: 2.02 ± 0.153
1.24MetPhe: 1.24 ± 0.097
1.513MetGly: 1.513 ± 0.131
0.483MetHis: 0.483 ± 0.063
1.264MetIle: 1.264 ± 0.111
1.988MetLys: 1.988 ± 0.192
2.173MetLeu: 2.173 ± 0.121
1.095MetMet: 1.095 ± 0.097
1.095MetAsn: 1.095 ± 0.099
0.934MetPro: 0.934 ± 0.095
0.692MetGln: 0.692 ± 0.079
1.465MetArg: 1.465 ± 0.096
2.681MetSer: 2.681 ± 0.163
1.707MetThr: 1.707 ± 0.109
1.65MetVal: 1.65 ± 0.123
0.346MetTrp: 0.346 ± 0.049
0.958MetTyr: 0.958 ± 0.088
0.0MetXaa: 0.0 ± 0.0
Asn
2.801AsnAla: 2.801 ± 0.205
1.03AsnCys: 1.03 ± 0.084
2.536AsnAsp: 2.536 ± 0.182
2.721AsnGlu: 2.721 ± 0.174
2.415AsnPhe: 2.415 ± 0.148
2.737AsnGly: 2.737 ± 0.202
0.861AsnHis: 0.861 ± 0.078
3.912AsnIle: 3.912 ± 0.204
4.677AsnLys: 4.677 ± 0.239
4.315AsnLeu: 4.315 ± 0.177
1.489AsnMet: 1.489 ± 0.117
4.999AsnAsn: 4.999 ± 0.301
2.101AsnPro: 2.101 ± 0.151
1.199AsnGln: 1.199 ± 0.119
2.584AsnArg: 2.584 ± 0.127
4.508AsnSer: 4.508 ± 0.23
3.743AsnThr: 3.743 ± 0.261
3.679AsnVal: 3.679 ± 0.202
0.563AsnTrp: 0.563 ± 0.07
1.264AsnTyr: 1.264 ± 0.098
0.0AsnXaa: 0.0 ± 0.0
Pro
2.471ProAla: 2.471 ± 0.243
0.757ProCys: 0.757 ± 0.083
1.481ProAsp: 1.481 ± 0.105
2.608ProGlu: 2.608 ± 0.176
2.487ProPhe: 2.487 ± 0.196
1.835ProGly: 1.835 ± 0.363
1.183ProHis: 1.183 ± 0.168
2.898ProIle: 2.898 ± 0.141
2.27ProLys: 2.27 ± 0.134
4.814ProLeu: 4.814 ± 0.256
0.998ProMet: 0.998 ± 0.09
1.69ProAsn: 1.69 ± 0.114
4.604ProPro: 4.604 ± 0.599
1.449ProGln: 1.449 ± 0.273
2.093ProArg: 2.093 ± 0.177
5.594ProSer: 5.594 ± 0.244
2.801ProThr: 2.801 ± 0.167
3.155ProVal: 3.155 ± 0.156
0.362ProTrp: 0.362 ± 0.066
1.095ProTyr: 1.095 ± 0.1
0.0ProXaa: 0.0 ± 0.0
Gln
1.28GlnAla: 1.28 ± 0.111
0.419GlnCys: 0.419 ± 0.051
1.022GlnAsp: 1.022 ± 0.086
2.053GlnGlu: 2.053 ± 0.144
1.207GlnPhe: 1.207 ± 0.092
1.465GlnGly: 1.465 ± 0.368
0.805GlnHis: 0.805 ± 0.076
1.489GlnIle: 1.489 ± 0.115
2.222GlnLys: 2.222 ± 0.143
2.536GlnLeu: 2.536 ± 0.206
0.837GlnMet: 0.837 ± 0.094
1.393GlnAsn: 1.393 ± 0.132
1.159GlnPro: 1.159 ± 0.1
2.866GlnGln: 2.866 ± 0.607
1.715GlnArg: 1.715 ± 0.201
1.892GlnSer: 1.892 ± 0.118
1.546GlnThr: 1.546 ± 0.104
1.529GlnVal: 1.529 ± 0.101
0.209GlnTrp: 0.209 ± 0.045
1.071GlnTyr: 1.071 ± 0.09
0.0GlnXaa: 0.0 ± 0.0
Arg
2.986ArgAla: 2.986 ± 0.233
0.95ArgCys: 0.95 ± 0.1
2.705ArgAsp: 2.705 ± 0.209
3.324ArgGlu: 3.324 ± 0.298
2.012ArgPhe: 2.012 ± 0.123
3.461ArgGly: 3.461 ± 0.475
1.207ArgHis: 1.207 ± 0.108
3.139ArgIle: 3.139 ± 0.169
3.928ArgLys: 3.928 ± 0.215
4.29ArgLeu: 4.29 ± 0.218
1.594ArgMet: 1.594 ± 0.255
2.809ArgAsn: 2.809 ± 0.136
2.367ArgPro: 2.367 ± 0.15
1.843ArgGln: 1.843 ± 0.139
4.073ArgArg: 4.073 ± 0.254
3.687ArgSer: 3.687 ± 0.179
2.85ArgThr: 2.85 ± 0.146
2.986ArgVal: 2.986 ± 0.151
0.419ArgTrp: 0.419 ± 0.067
1.344ArgTyr: 1.344 ± 0.109
0.0ArgXaa: 0.0 ± 0.0
Ser
5.651SerAla: 5.651 ± 0.312
1.787SerCys: 1.787 ± 0.124
4.572SerAsp: 4.572 ± 0.27
4.097SerGlu: 4.097 ± 0.175
5.055SerPhe: 5.055 ± 0.249
4.524SerGly: 4.524 ± 0.234
1.497SerHis: 1.497 ± 0.102
6.118SerIle: 6.118 ± 0.208
5.329SerLys: 5.329 ± 0.227
9.209SerLeu: 9.209 ± 0.351
2.479SerMet: 2.479 ± 0.146
4.886SerAsn: 4.886 ± 0.296
4.975SerPro: 4.975 ± 0.243
1.803SerGln: 1.803 ± 0.124
4.21SerArg: 4.21 ± 0.186
18.047SerSer: 18.047 ± 1.003
6.214SerThr: 6.214 ± 0.291
6.005SerVal: 6.005 ± 0.282
0.845SerTrp: 0.845 ± 0.09
2.198SerTyr: 2.198 ± 0.143
0.0SerXaa: 0.0 ± 0.0
Thr
3.936ThrAla: 3.936 ± 0.234
1.199ThrCys: 1.199 ± 0.093
2.938ThrAsp: 2.938 ± 0.192
3.147ThrGlu: 3.147 ± 0.173
2.938ThrPhe: 2.938 ± 0.168
3.437ThrGly: 3.437 ± 0.384
1.207ThrHis: 1.207 ± 0.103
3.558ThrIle: 3.558 ± 0.146
3.349ThrLys: 3.349 ± 0.144
5.039ThrLeu: 5.039 ± 0.2
1.481ThrMet: 1.481 ± 0.11
3.542ThrAsn: 3.542 ± 0.187
3.405ThrPro: 3.405 ± 0.205
1.393ThrGln: 1.393 ± 0.097
2.801ThrArg: 2.801 ± 0.15
6.585ThrSer: 6.585 ± 0.28
4.846ThrThr: 4.846 ± 0.267
3.461ThrVal: 3.461 ± 0.201
0.491ThrTrp: 0.491 ± 0.061
1.417ThrTyr: 1.417 ± 0.108
0.0ThrXaa: 0.0 ± 0.0
Val
3.518ValAla: 3.518 ± 0.179
1.368ValCys: 1.368 ± 0.099
3.123ValAsp: 3.123 ± 0.172
3.92ValGlu: 3.92 ± 0.198
3.3ValPhe: 3.3 ± 0.161
3.308ValGly: 3.308 ± 0.247
1.312ValHis: 1.312 ± 0.115
3.494ValIle: 3.494 ± 0.178
4.057ValLys: 4.057 ± 0.213
5.949ValLeu: 5.949 ± 0.283
1.731ValMet: 1.731 ± 0.112
3.019ValAsn: 3.019 ± 0.164
2.954ValPro: 2.954 ± 0.211
1.779ValGln: 1.779 ± 0.108
2.713ValArg: 2.713 ± 0.17
6.013ValSer: 6.013 ± 0.267
3.558ValThr: 3.558 ± 0.166
4.572ValVal: 4.572 ± 0.218
0.547ValTrp: 0.547 ± 0.068
1.956ValTyr: 1.956 ± 0.116
0.0ValXaa: 0.0 ± 0.0
Trp
0.499TrpAla: 0.499 ± 0.06
0.209TrpCys: 0.209 ± 0.044
0.306TrpAsp: 0.306 ± 0.054
0.523TrpGlu: 0.523 ± 0.067
0.411TrpPhe: 0.411 ± 0.052
0.531TrpGly: 0.531 ± 0.068
0.105TrpHis: 0.105 ± 0.032
0.547TrpIle: 0.547 ± 0.064
0.724TrpLys: 0.724 ± 0.096
0.837TrpLeu: 0.837 ± 0.075
0.362TrpMet: 0.362 ± 0.051
0.733TrpAsn: 0.733 ± 0.077
0.298TrpPro: 0.298 ± 0.064
0.201TrpGln: 0.201 ± 0.039
0.668TrpArg: 0.668 ± 0.085
0.773TrpSer: 0.773 ± 0.081
0.733TrpThr: 0.733 ± 0.098
0.491TrpVal: 0.491 ± 0.063
0.185TrpTrp: 0.185 ± 0.046
0.193TrpTyr: 0.193 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.296TyrAla: 1.296 ± 0.095
0.773TyrCys: 0.773 ± 0.088
1.497TyrAsp: 1.497 ± 0.116
1.674TyrGlu: 1.674 ± 0.133
1.465TyrPhe: 1.465 ± 0.11
1.417TyrGly: 1.417 ± 0.107
0.531TyrHis: 0.531 ± 0.065
1.602TyrIle: 1.602 ± 0.123
1.707TyrLys: 1.707 ± 0.135
2.141TyrLeu: 2.141 ± 0.148
0.716TyrMet: 0.716 ± 0.078
1.876TyrAsn: 1.876 ± 0.138
0.942TyrPro: 0.942 ± 0.089
0.716TyrGln: 0.716 ± 0.083
1.529TyrArg: 1.529 ± 0.128
2.503TyrSer: 2.503 ± 0.147
1.682TyrThr: 1.682 ± 0.116
1.634TyrVal: 1.634 ± 0.127
0.169TyrTrp: 0.169 ± 0.039
1.079TyrTyr: 1.079 ± 0.115
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 524 proteins (124231 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski