Amino acid dipepetide frequency for White spot syndrome virus (WSSV) (White spot bacilliform virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.724AlaAla: 6.724 ± 0.598
0.863AlaCys: 0.863 ± 0.095
2.342AlaAsp: 2.342 ± 0.152
3.374AlaGlu: 3.374 ± 0.172
2.43AlaPhe: 2.43 ± 0.178
2.758AlaGly: 2.758 ± 0.292
0.991AlaHis: 0.991 ± 0.09
3.614AlaIle: 3.614 ± 0.242
3.446AlaLys: 3.446 ± 0.247
5.245AlaLeu: 5.245 ± 0.346
1.487AlaMet: 1.487 ± 0.1
2.646AlaAsn: 2.646 ± 0.162
2.902AlaPro: 2.902 ± 0.244
1.399AlaGln: 1.399 ± 0.132
2.726AlaArg: 2.726 ± 0.185
5.924AlaSer: 5.924 ± 0.259
3.278AlaThr: 3.278 ± 0.219
4.205AlaVal: 4.205 ± 0.228
0.368AlaTrp: 0.368 ± 0.055
1.143AlaTyr: 1.143 ± 0.086
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.095
0.728CysCys: 0.728 ± 0.139
0.903CysAsp: 0.903 ± 0.084
0.767CysGlu: 0.767 ± 0.081
1.247CysPhe: 1.247 ± 0.107
0.975CysGly: 0.975 ± 0.101
0.336CysHis: 0.336 ± 0.053
1.183CysIle: 1.183 ± 0.114
1.095CysLys: 1.095 ± 0.105
1.959CysLeu: 1.959 ± 0.164
0.56CysMet: 0.56 ± 0.065
0.783CysAsn: 0.783 ± 0.081
1.031CysPro: 1.031 ± 0.09
0.448CysGln: 0.448 ± 0.065
0.935CysArg: 0.935 ± 0.085
2.007CysSer: 2.007 ± 0.16
1.279CysThr: 1.279 ± 0.115
1.279CysVal: 1.279 ± 0.125
0.312CysTrp: 0.312 ± 0.056
0.528CysTyr: 0.528 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
3.038AspAla: 3.038 ± 0.192
0.815AspCys: 0.815 ± 0.096
4.245AspAsp: 4.245 ± 0.3
4.277AspGlu: 4.277 ± 0.267
2.358AspPhe: 2.358 ± 0.14
3.102AspGly: 3.102 ± 0.369
0.68AspHis: 0.68 ± 0.087
3.478AspIle: 3.478 ± 0.163
3.158AspLys: 3.158 ± 0.179
3.478AspLeu: 3.478 ± 0.181
1.583AspMet: 1.583 ± 0.106
2.822AspAsn: 2.822 ± 0.178
1.831AspPro: 1.831 ± 0.131
1.151AspGln: 1.151 ± 0.112
2.095AspArg: 2.095 ± 0.113
3.678AspSer: 3.678 ± 0.223
3.302AspThr: 3.302 ± 0.149
3.734AspVal: 3.734 ± 0.221
0.52AspTrp: 0.52 ± 0.06
1.567AspTyr: 1.567 ± 0.109
0.0AspXaa: 0.0 ± 0.0
Glu
3.142GluAla: 3.142 ± 0.269
0.871GluCys: 0.871 ± 0.092
4.901GluAsp: 4.901 ± 0.347
10.657GluGlu: 10.657 ± 0.889
2.055GluPhe: 2.055 ± 0.138
3.798GluGly: 3.798 ± 0.191
1.047GluHis: 1.047 ± 0.093
3.286GluIle: 3.286 ± 0.168
5.62GluLys: 5.62 ± 0.404
3.678GluLeu: 3.678 ± 0.193
2.103GluMet: 2.103 ± 0.159
4.005GluAsn: 4.005 ± 0.185
1.607GluPro: 1.607 ± 0.131
2.287GluGln: 2.287 ± 0.23
4.029GluArg: 4.029 ± 0.247
3.598GluSer: 3.598 ± 0.166
3.71GluThr: 3.71 ± 0.224
2.926GluVal: 2.926 ± 0.178
0.576GluTrp: 0.576 ± 0.067
1.759GluTyr: 1.759 ± 0.113
0.0GluXaa: 0.0 ± 0.0
Phe
2.039PheAla: 2.039 ± 0.154
1.047PheCys: 1.047 ± 0.105
2.031PheAsp: 2.031 ± 0.139
2.127PheGlu: 2.127 ± 0.131
3.63PhePhe: 3.63 ± 0.27
1.999PheGly: 1.999 ± 0.116
1.087PheHis: 1.087 ± 0.099
3.086PheIle: 3.086 ± 0.183
2.678PheLys: 2.678 ± 0.153
6.028PheLeu: 6.028 ± 0.325
1.327PheMet: 1.327 ± 0.11
2.43PheAsn: 2.43 ± 0.136
2.47PhePro: 2.47 ± 0.155
1.079PheGln: 1.079 ± 0.109
2.015PheArg: 2.015 ± 0.146
5.948PheSer: 5.948 ± 0.267
2.342PheThr: 2.342 ± 0.153
3.19PheVal: 3.19 ± 0.21
0.744PheTrp: 0.744 ± 0.089
1.327PheTyr: 1.327 ± 0.13
0.0PheXaa: 0.0 ± 0.0
Gly
3.382GlyAla: 3.382 ± 0.411
0.855GlyCys: 0.855 ± 0.092
2.926GlyAsp: 2.926 ± 0.155
4.413GlyGlu: 4.413 ± 0.542
1.919GlyPhe: 1.919 ± 0.128
5.404GlyGly: 5.404 ± 0.397
0.879GlyHis: 0.879 ± 0.087
2.822GlyIle: 2.822 ± 0.152
3.454GlyLys: 3.454 ± 0.222
3.59GlyLeu: 3.59 ± 0.194
1.263GlyMet: 1.263 ± 0.113
2.678GlyAsn: 2.678 ± 0.138
2.534GlyPro: 2.534 ± 0.866
1.271GlyGln: 1.271 ± 0.133
3.334GlyArg: 3.334 ± 0.465
4.381GlySer: 4.381 ± 0.182
2.934GlyThr: 2.934 ± 0.161
3.79GlyVal: 3.79 ± 0.211
0.488GlyTrp: 0.488 ± 0.068
1.023GlyTyr: 1.023 ± 0.093
0.0GlyXaa: 0.0 ± 0.0
His
0.791HisAla: 0.791 ± 0.078
0.464HisCys: 0.464 ± 0.074
0.799HisAsp: 0.799 ± 0.091
0.943HisGlu: 0.943 ± 0.107
1.319HisPhe: 1.319 ± 0.117
0.791HisGly: 0.791 ± 0.075
0.704HisHis: 0.704 ± 0.116
1.383HisIle: 1.383 ± 0.11
1.071HisLys: 1.071 ± 0.098
2.542HisLeu: 2.542 ± 0.288
0.528HisMet: 0.528 ± 0.068
0.799HisAsn: 0.799 ± 0.074
1.023HisPro: 1.023 ± 0.1
0.752HisGln: 0.752 ± 0.086
0.983HisArg: 0.983 ± 0.082
1.879HisSer: 1.879 ± 0.115
1.119HisThr: 1.119 ± 0.109
1.175HisVal: 1.175 ± 0.096
0.176HisTrp: 0.176 ± 0.038
0.592HisTyr: 0.592 ± 0.07
0.0HisXaa: 0.0 ± 0.0
Ile
3.182IleAla: 3.182 ± 0.184
1.167IleCys: 1.167 ± 0.112
3.102IleAsp: 3.102 ± 0.148
3.246IleGlu: 3.246 ± 0.158
3.486IlePhe: 3.486 ± 0.196
2.758IleGly: 2.758 ± 0.217
1.167IleHis: 1.167 ± 0.102
3.893IleIle: 3.893 ± 0.189
4.013IleLys: 4.013 ± 0.172
5.972IleLeu: 5.972 ± 0.22
1.527IleMet: 1.527 ± 0.103
3.302IleAsn: 3.302 ± 0.16
2.63IlePro: 2.63 ± 0.146
1.647IleGln: 1.647 ± 0.111
2.926IleArg: 2.926 ± 0.142
6.02IleSer: 6.02 ± 0.253
3.614IleThr: 3.614 ± 0.188
4.133IleVal: 4.133 ± 0.199
0.624IleTrp: 0.624 ± 0.082
1.151IleTyr: 1.151 ± 0.103
0.0IleXaa: 0.0 ± 0.0
Lys
2.798LysAla: 2.798 ± 0.185
1.287LysCys: 1.287 ± 0.114
3.462LysAsp: 3.462 ± 0.21
4.685LysGlu: 4.685 ± 0.248
2.279LysPhe: 2.279 ± 0.12
3.582LysGly: 3.582 ± 0.333
1.663LysHis: 1.663 ± 0.116
4.117LysIle: 4.117 ± 0.214
6.804LysLys: 6.804 ± 0.392
4.861LysLeu: 4.861 ± 0.189
2.374LysMet: 2.374 ± 0.126
4.581LysAsn: 4.581 ± 0.236
2.015LysPro: 2.015 ± 0.122
2.263LysGln: 2.263 ± 0.231
4.925LysArg: 4.925 ± 0.309
5.245LysSer: 5.245 ± 0.25
4.213LysThr: 4.213 ± 0.204
3.31LysVal: 3.31 ± 0.243
0.696LysTrp: 0.696 ± 0.065
2.215LysTyr: 2.215 ± 0.139
0.0LysXaa: 0.0 ± 0.0
Leu
5.293LeuAla: 5.293 ± 0.253
1.847LeuCys: 1.847 ± 0.151
4.253LeuAsp: 4.253 ± 0.224
4.941LeuGlu: 4.941 ± 0.223
5.388LeuPhe: 5.388 ± 0.294
4.037LeuGly: 4.037 ± 0.252
2.119LeuHis: 2.119 ± 0.156
4.757LeuIle: 4.757 ± 0.226
5.908LeuLys: 5.908 ± 0.295
11.496LeuLeu: 11.496 ± 0.563
2.71LeuMet: 2.71 ± 0.146
3.941LeuAsn: 3.941 ± 0.183
4.893LeuPro: 4.893 ± 0.26
2.822LeuGln: 2.822 ± 0.166
4.093LeuArg: 4.093 ± 0.186
8.386LeuSer: 8.386 ± 0.315
4.565LeuThr: 4.565 ± 0.265
5.293LeuVal: 5.293 ± 0.227
0.696LeuTrp: 0.696 ± 0.079
2.934LeuTyr: 2.934 ± 0.147
0.0LeuXaa: 0.0 ± 0.0
Met
2.302MetAla: 2.302 ± 0.14
0.696MetCys: 0.696 ± 0.08
1.887MetAsp: 1.887 ± 0.113
2.087MetGlu: 2.087 ± 0.171
1.247MetPhe: 1.247 ± 0.091
1.527MetGly: 1.527 ± 0.134
0.48MetHis: 0.48 ± 0.062
1.247MetIle: 1.247 ± 0.095
1.927MetLys: 1.927 ± 0.137
2.167MetLeu: 2.167 ± 0.147
1.055MetMet: 1.055 ± 0.088
1.087MetAsn: 1.087 ± 0.113
0.943MetPro: 0.943 ± 0.085
0.696MetGln: 0.696 ± 0.078
1.447MetArg: 1.447 ± 0.108
2.638MetSer: 2.638 ± 0.139
1.703MetThr: 1.703 ± 0.114
1.703MetVal: 1.703 ± 0.103
0.352MetTrp: 0.352 ± 0.049
0.959MetTyr: 0.959 ± 0.094
0.0MetXaa: 0.0 ± 0.0
Asn
2.79AsnAla: 2.79 ± 0.228
1.031AsnCys: 1.031 ± 0.096
2.542AsnAsp: 2.542 ± 0.19
2.702AsnGlu: 2.702 ± 0.159
2.398AsnPhe: 2.398 ± 0.129
2.71AsnGly: 2.71 ± 0.211
0.831AsnHis: 0.831 ± 0.076
3.909AsnIle: 3.909 ± 0.21
4.621AsnLys: 4.621 ± 0.213
4.293AsnLeu: 4.293 ± 0.181
1.487AsnMet: 1.487 ± 0.126
4.941AsnAsn: 4.941 ± 0.321
2.095AsnPro: 2.095 ± 0.127
1.191AsnGln: 1.191 ± 0.105
2.614AsnArg: 2.614 ± 0.132
4.493AsnSer: 4.493 ± 0.217
3.758AsnThr: 3.758 ± 0.264
3.726AsnVal: 3.726 ± 0.209
0.568AsnTrp: 0.568 ± 0.071
1.271AsnTyr: 1.271 ± 0.109
0.0AsnXaa: 0.0 ± 0.0
Pro
2.446ProAla: 2.446 ± 0.254
0.783ProCys: 0.783 ± 0.094
1.455ProAsp: 1.455 ± 0.104
2.542ProGlu: 2.542 ± 0.163
2.47ProPhe: 2.47 ± 0.185
1.823ProGly: 1.823 ± 0.335
1.159ProHis: 1.159 ± 0.164
2.934ProIle: 2.934 ± 0.137
2.302ProLys: 2.302 ± 0.134
4.853ProLeu: 4.853 ± 0.273
1.047ProMet: 1.047 ± 0.081
1.695ProAsn: 1.695 ± 0.098
4.541ProPro: 4.541 ± 0.577
1.415ProGln: 1.415 ± 0.263
2.103ProArg: 2.103 ± 0.172
5.58ProSer: 5.58 ± 0.239
2.798ProThr: 2.798 ± 0.154
3.166ProVal: 3.166 ± 0.138
0.392ProTrp: 0.392 ± 0.069
1.063ProTyr: 1.063 ± 0.104
0.0ProXaa: 0.0 ± 0.0
Gln
1.343GlnAla: 1.343 ± 0.115
0.408GlnCys: 0.408 ± 0.057
1.031GlnAsp: 1.031 ± 0.088
2.223GlnGlu: 2.223 ± 0.189
1.207GlnPhe: 1.207 ± 0.093
1.447GlnGly: 1.447 ± 0.351
0.791GlnHis: 0.791 ± 0.08
1.487GlnIle: 1.487 ± 0.126
2.223GlnLys: 2.223 ± 0.14
2.71GlnLeu: 2.71 ± 0.243
0.831GlnMet: 0.831 ± 0.094
1.407GlnAsn: 1.407 ± 0.129
1.151GlnPro: 1.151 ± 0.114
2.886GlnGln: 2.886 ± 0.641
1.719GlnArg: 1.719 ± 0.212
1.887GlnSer: 1.887 ± 0.137
1.527GlnThr: 1.527 ± 0.122
1.551GlnVal: 1.551 ± 0.112
0.208GlnTrp: 0.208 ± 0.036
1.087GlnTyr: 1.087 ± 0.094
0.0GlnXaa: 0.0 ± 0.0
Arg
2.958ArgAla: 2.958 ± 0.194
0.951ArgCys: 0.951 ± 0.097
2.678ArgAsp: 2.678 ± 0.196
3.398ArgGlu: 3.398 ± 0.352
2.015ArgPhe: 2.015 ± 0.112
3.534ArgGly: 3.534 ± 0.442
1.223ArgHis: 1.223 ± 0.103
3.11ArgIle: 3.11 ± 0.164
3.941ArgLys: 3.941 ± 0.246
4.293ArgLeu: 4.293 ± 0.212
1.519ArgMet: 1.519 ± 0.141
2.846ArgAsn: 2.846 ± 0.16
2.366ArgPro: 2.366 ± 0.149
1.871ArgGln: 1.871 ± 0.147
4.125ArgArg: 4.125 ± 0.275
3.654ArgSer: 3.654 ± 0.173
2.862ArgThr: 2.862 ± 0.156
2.974ArgVal: 2.974 ± 0.157
0.424ArgTrp: 0.424 ± 0.065
1.311ArgTyr: 1.311 ± 0.1
0.0ArgXaa: 0.0 ± 0.0
Ser
5.644SerAla: 5.644 ± 0.29
1.783SerCys: 1.783 ± 0.111
4.533SerAsp: 4.533 ± 0.245
4.021SerGlu: 4.021 ± 0.191
5.093SerPhe: 5.093 ± 0.249
4.501SerGly: 4.501 ± 0.232
1.503SerHis: 1.503 ± 0.098
6.124SerIle: 6.124 ± 0.244
5.325SerLys: 5.325 ± 0.228
9.122SerLeu: 9.122 ± 0.384
2.486SerMet: 2.486 ± 0.131
4.853SerAsn: 4.853 ± 0.263
5.029SerPro: 5.029 ± 0.237
1.799SerGln: 1.799 ± 0.122
4.221SerArg: 4.221 ± 0.207
17.876SerSer: 17.876 ± 1.063
6.116SerThr: 6.116 ± 0.23
5.98SerVal: 5.98 ± 0.249
0.831SerTrp: 0.831 ± 0.076
2.199SerTyr: 2.199 ± 0.125
0.0SerXaa: 0.0 ± 0.0
Thr
3.845ThrAla: 3.845 ± 0.231
1.207ThrCys: 1.207 ± 0.108
2.966ThrAsp: 2.966 ± 0.169
3.142ThrGlu: 3.142 ± 0.174
2.918ThrPhe: 2.918 ± 0.134
3.406ThrGly: 3.406 ± 0.329
1.207ThrHis: 1.207 ± 0.092
3.518ThrIle: 3.518 ± 0.148
3.334ThrLys: 3.334 ± 0.13
5.045ThrLeu: 5.045 ± 0.224
1.463ThrMet: 1.463 ± 0.11
3.542ThrAsn: 3.542 ± 0.187
3.382ThrPro: 3.382 ± 0.218
1.407ThrGln: 1.407 ± 0.106
2.798ThrArg: 2.798 ± 0.153
6.524ThrSer: 6.524 ± 0.272
4.845ThrThr: 4.845 ± 0.293
3.478ThrVal: 3.478 ± 0.199
0.48ThrTrp: 0.48 ± 0.064
1.415ThrTyr: 1.415 ± 0.107
0.0ThrXaa: 0.0 ± 0.0
Val
3.478ValAla: 3.478 ± 0.184
1.383ValCys: 1.383 ± 0.093
3.078ValAsp: 3.078 ± 0.171
3.925ValGlu: 3.925 ± 0.179
3.294ValPhe: 3.294 ± 0.202
3.246ValGly: 3.246 ± 0.234
1.343ValHis: 1.343 ± 0.151
3.478ValIle: 3.478 ± 0.169
4.021ValLys: 4.021 ± 0.204
6.036ValLeu: 6.036 ± 0.274
1.727ValMet: 1.727 ± 0.115
3.03ValAsn: 3.03 ± 0.177
2.894ValPro: 2.894 ± 0.166
1.807ValGln: 1.807 ± 0.114
2.822ValArg: 2.822 ± 0.185
6.068ValSer: 6.068 ± 0.247
3.534ValThr: 3.534 ± 0.183
4.541ValVal: 4.541 ± 0.235
0.56ValTrp: 0.56 ± 0.079
1.991ValTyr: 1.991 ± 0.138
0.0ValXaa: 0.0 ± 0.0
Trp
0.52TrpAla: 0.52 ± 0.07
0.216TrpCys: 0.216 ± 0.045
0.312TrpAsp: 0.312 ± 0.048
0.52TrpGlu: 0.52 ± 0.057
0.408TrpPhe: 0.408 ± 0.056
0.528TrpGly: 0.528 ± 0.059
0.104TrpHis: 0.104 ± 0.029
0.544TrpIle: 0.544 ± 0.069
0.72TrpLys: 0.72 ± 0.082
0.855TrpLeu: 0.855 ± 0.091
0.36TrpMet: 0.36 ± 0.048
0.712TrpAsn: 0.712 ± 0.083
0.296TrpPro: 0.296 ± 0.052
0.192TrpGln: 0.192 ± 0.036
0.68TrpArg: 0.68 ± 0.086
0.775TrpSer: 0.775 ± 0.088
0.744TrpThr: 0.744 ± 0.086
0.496TrpVal: 0.496 ± 0.061
0.184TrpTrp: 0.184 ± 0.041
0.208TrpTyr: 0.208 ± 0.037
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.319TyrAla: 1.319 ± 0.094
0.783TyrCys: 0.783 ± 0.093
1.463TyrAsp: 1.463 ± 0.101
1.663TyrGlu: 1.663 ± 0.128
1.463TyrPhe: 1.463 ± 0.117
1.415TyrGly: 1.415 ± 0.11
0.52TyrHis: 0.52 ± 0.064
1.591TyrIle: 1.591 ± 0.125
1.711TyrLys: 1.711 ± 0.13
2.151TyrLeu: 2.151 ± 0.145
0.704TyrMet: 0.704 ± 0.077
1.871TyrAsn: 1.871 ± 0.116
0.935TyrPro: 0.935 ± 0.087
0.744TyrGln: 0.744 ± 0.07
1.463TyrArg: 1.463 ± 0.095
2.478TyrSer: 2.478 ± 0.135
1.679TyrThr: 1.679 ± 0.103
1.615TyrVal: 1.615 ± 0.131
0.176TyrTrp: 0.176 ± 0.038
1.079TyrTyr: 1.079 ± 0.1
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 532 proteins (125083 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski