Amino acid dipepetide frequency for Ostreid herpesvirus 1 (isolate France) (OsHV-1) (Pacific oyster herpesvirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.889AlaAla: 3.889 ± 0.303
1.069AlaCys: 1.069 ± 0.168
2.956AlaAsp: 2.956 ± 0.214
3.753AlaGlu: 3.753 ± 0.31
1.769AlaPhe: 1.769 ± 0.148
2.858AlaGly: 2.858 ± 0.282
0.836AlaHis: 0.836 ± 0.127
3.792AlaIle: 3.792 ± 0.28
3.733AlaLys: 3.733 ± 0.36
3.733AlaLeu: 3.733 ± 0.344
1.886AlaMet: 1.886 ± 0.225
2.528AlaAsn: 2.528 ± 0.208
2.217AlaPro: 2.217 ± 0.321
1.594AlaGln: 1.594 ± 0.221
2.761AlaArg: 2.761 ± 0.213
2.936AlaSer: 2.936 ± 0.245
3.131AlaThr: 3.131 ± 0.243
3.558AlaVal: 3.558 ± 0.231
0.35AlaTrp: 0.35 ± 0.07
1.594AlaTyr: 1.594 ± 0.167
0.0AlaXaa: 0.0 ± 0.0
Cys
0.797CysAla: 0.797 ± 0.125
0.583CysCys: 0.583 ± 0.14
1.108CysAsp: 1.108 ± 0.139
1.4CysGlu: 1.4 ± 0.203
0.758CysPhe: 0.758 ± 0.117
1.225CysGly: 1.225 ± 0.179
0.35CysHis: 0.35 ± 0.081
1.186CysIle: 1.186 ± 0.139
1.75CysLys: 1.75 ± 0.196
1.147CysLeu: 1.147 ± 0.158
0.661CysMet: 0.661 ± 0.113
1.186CysAsn: 1.186 ± 0.171
0.797CysPro: 0.797 ± 0.137
0.583CysGln: 0.583 ± 0.119
1.069CysArg: 1.069 ± 0.194
1.4CysSer: 1.4 ± 0.184
0.914CysThr: 0.914 ± 0.168
1.361CysVal: 1.361 ± 0.199
0.156CysTrp: 0.156 ± 0.058
0.797CysTyr: 0.797 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
2.781AspAla: 2.781 ± 0.197
0.953AspCys: 0.953 ± 0.174
5.892AspAsp: 5.892 ± 0.419
6.261AspGlu: 6.261 ± 0.368
3.072AspPhe: 3.072 ± 0.28
3.461AspGly: 3.461 ± 0.274
1.089AspHis: 1.089 ± 0.16
4.861AspIle: 4.861 ± 0.337
4.92AspLys: 4.92 ± 0.335
6.086AspLeu: 6.086 ± 0.316
3.053AspMet: 3.053 ± 0.247
3.383AspAsn: 3.383 ± 0.277
2.372AspPro: 2.372 ± 0.286
1.556AspGln: 1.556 ± 0.164
3.228AspArg: 3.228 ± 0.293
3.656AspSer: 3.656 ± 0.308
3.422AspThr: 3.422 ± 0.237
4.492AspVal: 4.492 ± 0.257
0.875AspTrp: 0.875 ± 0.12
3.092AspTyr: 3.092 ± 0.243
0.0AspXaa: 0.0 ± 0.0
Glu
2.975GluAla: 2.975 ± 0.25
1.186GluCys: 1.186 ± 0.152
5.6GluAsp: 5.6 ± 0.425
9.1GluGlu: 9.1 ± 0.867
3.228GluPhe: 3.228 ± 0.309
3.111GluGly: 3.111 ± 0.228
1.342GluHis: 1.342 ± 0.168
5.464GluIle: 5.464 ± 0.375
5.814GluLys: 5.814 ± 0.524
7.136GluLeu: 7.136 ± 0.416
2.431GluMet: 2.431 ± 0.224
3.85GluAsn: 3.85 ± 0.261
2.314GluPro: 2.314 ± 0.315
1.867GluGln: 1.867 ± 0.179
3.675GluArg: 3.675 ± 0.254
4.278GluSer: 4.278 ± 0.32
3.714GluThr: 3.714 ± 0.267
4.2GluVal: 4.2 ± 0.266
0.758GluTrp: 0.758 ± 0.109
3.345GluTyr: 3.345 ± 0.293
0.0GluXaa: 0.0 ± 0.0
Phe
1.769PheAla: 1.769 ± 0.213
0.661PheCys: 0.661 ± 0.106
3.15PheAsp: 3.15 ± 0.252
2.411PheGlu: 2.411 ± 0.245
2.081PhePhe: 2.081 ± 0.246
1.769PheGly: 1.769 ± 0.216
0.856PheHis: 0.856 ± 0.141
3.247PheIle: 3.247 ± 0.26
3.325PheLys: 3.325 ± 0.238
3.383PheLeu: 3.383 ± 0.264
1.322PheMet: 1.322 ± 0.145
2.431PheAsn: 2.431 ± 0.233
1.419PhePro: 1.419 ± 0.171
0.817PheGln: 0.817 ± 0.111
1.769PheArg: 1.769 ± 0.18
3.111PheSer: 3.111 ± 0.29
2.936PheThr: 2.936 ± 0.32
2.217PheVal: 2.217 ± 0.212
0.525PheTrp: 0.525 ± 0.095
2.022PheTyr: 2.022 ± 0.212
0.0PheXaa: 0.0 ± 0.0
Gly
2.217GlyAla: 2.217 ± 0.22
1.108GlyCys: 1.108 ± 0.141
3.617GlyAsp: 3.617 ± 0.264
3.461GlyGlu: 3.461 ± 0.261
2.022GlyPhe: 2.022 ± 0.211
3.461GlyGly: 3.461 ± 0.297
1.031GlyHis: 1.031 ± 0.124
3.87GlyIle: 3.87 ± 0.316
4.161GlyLys: 4.161 ± 0.3
3.695GlyLeu: 3.695 ± 0.271
1.886GlyMet: 1.886 ± 0.192
2.625GlyAsn: 2.625 ± 0.229
1.283GlyPro: 1.283 ± 0.169
1.439GlyGln: 1.439 ± 0.173
2.547GlyArg: 2.547 ± 0.213
2.917GlySer: 2.917 ± 0.252
2.256GlyThr: 2.256 ± 0.193
3.695GlyVal: 3.695 ± 0.204
0.428GlyTrp: 0.428 ± 0.09
2.022GlyTyr: 2.022 ± 0.157
0.0GlyXaa: 0.0 ± 0.0
His
0.953HisAla: 0.953 ± 0.12
0.7HisCys: 0.7 ± 0.107
0.875HisAsp: 0.875 ± 0.094
1.128HisGlu: 1.128 ± 0.156
0.856HisPhe: 0.856 ± 0.138
1.167HisGly: 1.167 ± 0.162
0.467HisHis: 0.467 ± 0.094
1.342HisIle: 1.342 ± 0.156
1.536HisLys: 1.536 ± 0.198
1.322HisLeu: 1.322 ± 0.17
0.836HisMet: 0.836 ± 0.132
1.108HisAsn: 1.108 ± 0.121
0.836HisPro: 0.836 ± 0.118
0.156HisGln: 0.156 ± 0.057
1.128HisArg: 1.128 ± 0.135
0.972HisSer: 0.972 ± 0.135
1.361HisThr: 1.361 ± 0.175
1.458HisVal: 1.458 ± 0.17
0.156HisTrp: 0.156 ± 0.056
0.894HisTyr: 0.894 ± 0.118
0.0HisXaa: 0.0 ± 0.0
Ile
3.5IleAla: 3.5 ± 0.247
1.633IleCys: 1.633 ± 0.2
4.842IleAsp: 4.842 ± 0.362
4.997IleGlu: 4.997 ± 0.292
2.664IlePhe: 2.664 ± 0.208
2.995IleGly: 2.995 ± 0.222
1.614IleHis: 1.614 ± 0.188
4.608IleIle: 4.608 ± 0.352
5.717IleLys: 5.717 ± 0.291
4.997IleLeu: 4.997 ± 0.284
2.314IleMet: 2.314 ± 0.234
5.367IleAsn: 5.367 ± 0.304
3.87IlePro: 3.87 ± 0.272
1.653IleGln: 1.653 ± 0.219
2.878IleArg: 2.878 ± 0.242
4.745IleSer: 4.745 ± 0.335
5.853IleThr: 5.853 ± 0.364
3.811IleVal: 3.811 ± 0.324
0.486IleTrp: 0.486 ± 0.098
2.683IleTyr: 2.683 ± 0.235
0.0IleXaa: 0.0 ± 0.0
Lys
3.286LysAla: 3.286 ± 0.27
1.089LysCys: 1.089 ± 0.167
5.095LysAsp: 5.095 ± 0.324
6.475LysGlu: 6.475 ± 0.435
2.82LysPhe: 2.82 ± 0.235
3.131LysGly: 3.131 ± 0.231
1.672LysHis: 1.672 ± 0.157
5.931LysIle: 5.931 ± 0.358
7.603LysLys: 7.603 ± 0.644
6.65LysLeu: 6.65 ± 0.339
3.053LysMet: 3.053 ± 0.285
4.375LysAsn: 4.375 ± 0.361
3.325LysPro: 3.325 ± 0.308
2.586LysGln: 2.586 ± 0.253
5.347LysArg: 5.347 ± 0.364
5.133LysSer: 5.133 ± 0.337
4.92LysThr: 4.92 ± 0.341
4.103LysVal: 4.103 ± 0.316
0.506LysTrp: 0.506 ± 0.121
2.722LysTyr: 2.722 ± 0.203
0.0LysXaa: 0.0 ± 0.0
Leu
4.881LeuAla: 4.881 ± 0.294
1.264LeuCys: 1.264 ± 0.189
4.9LeuAsp: 4.9 ± 0.26
5.425LeuGlu: 5.425 ± 0.393
4.297LeuPhe: 4.297 ± 0.265
3.247LeuGly: 3.247 ± 0.228
1.614LeuHis: 1.614 ± 0.191
5.678LeuIle: 5.678 ± 0.331
6.339LeuLys: 6.339 ± 0.407
7.117LeuLeu: 7.117 ± 0.527
2.178LeuMet: 2.178 ± 0.195
4.064LeuAsn: 4.064 ± 0.298
3.889LeuPro: 3.889 ± 0.309
2.12LeuGln: 2.12 ± 0.191
3.539LeuArg: 3.539 ± 0.238
5.659LeuSer: 5.659 ± 0.322
5.367LeuThr: 5.367 ± 0.382
4.064LeuVal: 4.064 ± 0.276
0.583LeuTrp: 0.583 ± 0.112
3.442LeuTyr: 3.442 ± 0.262
0.0LeuXaa: 0.0 ± 0.0
Met
2.897MetAla: 2.897 ± 0.235
0.7MetCys: 0.7 ± 0.131
3.053MetAsp: 3.053 ± 0.207
2.761MetGlu: 2.761 ± 0.231
1.575MetPhe: 1.575 ± 0.144
1.653MetGly: 1.653 ± 0.179
0.389MetHis: 0.389 ± 0.076
2.392MetIle: 2.392 ± 0.222
2.858MetLys: 2.858 ± 0.24
2.178MetLeu: 2.178 ± 0.183
1.167MetMet: 1.167 ± 0.152
1.828MetAsn: 1.828 ± 0.19
1.186MetPro: 1.186 ± 0.126
0.661MetGln: 0.661 ± 0.111
1.478MetArg: 1.478 ± 0.193
2.586MetSer: 2.586 ± 0.227
2.47MetThr: 2.47 ± 0.223
2.236MetVal: 2.236 ± 0.185
0.194MetTrp: 0.194 ± 0.071
1.089MetTyr: 1.089 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
2.411AsnAla: 2.411 ± 0.187
0.758AsnCys: 0.758 ± 0.117
3.617AsnAsp: 3.617 ± 0.308
3.578AsnGlu: 3.578 ± 0.205
1.964AsnPhe: 1.964 ± 0.157
3.014AsnGly: 3.014 ± 0.272
0.992AsnHis: 0.992 ± 0.134
4.336AsnIle: 4.336 ± 0.279
4.997AsnLys: 4.997 ± 0.306
4.453AsnLeu: 4.453 ± 0.299
2.314AsnMet: 2.314 ± 0.207
4.22AsnAsn: 4.22 ± 0.32
2.178AsnPro: 2.178 ± 0.187
1.633AsnGln: 1.633 ± 0.201
2.956AsnArg: 2.956 ± 0.231
3.345AsnSer: 3.345 ± 0.242
3.5AsnThr: 3.5 ± 0.306
3.636AsnVal: 3.636 ± 0.251
0.506AsnTrp: 0.506 ± 0.093
2.178AsnTyr: 2.178 ± 0.2
0.0AsnXaa: 0.0 ± 0.0
Pro
2.256ProAla: 2.256 ± 0.268
0.836ProCys: 0.836 ± 0.158
2.547ProAsp: 2.547 ± 0.239
3.247ProGlu: 3.247 ± 0.289
1.789ProPhe: 1.789 ± 0.21
2.217ProGly: 2.217 ± 0.233
0.583ProHis: 0.583 ± 0.107
3.597ProIle: 3.597 ± 0.308
3.014ProLys: 3.014 ± 0.26
3.092ProLeu: 3.092 ± 0.243
1.478ProMet: 1.478 ± 0.15
1.75ProAsn: 1.75 ± 0.209
3.714ProPro: 3.714 ± 0.647
1.303ProGln: 1.303 ± 0.224
1.925ProArg: 1.925 ± 0.242
2.45ProSer: 2.45 ± 0.235
2.742ProThr: 2.742 ± 0.297
2.839ProVal: 2.839 ± 0.28
0.331ProTrp: 0.331 ± 0.085
1.342ProTyr: 1.342 ± 0.185
0.0ProXaa: 0.0 ± 0.0
Gln
1.089GlnAla: 1.089 ± 0.169
0.564GlnCys: 0.564 ± 0.107
1.206GlnAsp: 1.206 ± 0.119
1.886GlnGlu: 1.886 ± 0.215
1.128GlnPhe: 1.128 ± 0.172
1.536GlnGly: 1.536 ± 0.182
0.447GlnHis: 0.447 ± 0.091
1.167GlnIle: 1.167 ± 0.146
1.983GlnLys: 1.983 ± 0.174
2.236GlnLeu: 2.236 ± 0.244
1.147GlnMet: 1.147 ± 0.149
1.342GlnAsn: 1.342 ± 0.132
1.711GlnPro: 1.711 ± 0.267
1.381GlnGln: 1.381 ± 0.194
1.342GlnArg: 1.342 ± 0.184
1.536GlnSer: 1.536 ± 0.164
1.789GlnThr: 1.789 ± 0.169
1.439GlnVal: 1.439 ± 0.185
0.253GlnTrp: 0.253 ± 0.071
1.206GlnTyr: 1.206 ± 0.181
0.0GlnXaa: 0.0 ± 0.0
Arg
2.489ArgAla: 2.489 ± 0.304
1.186ArgCys: 1.186 ± 0.175
3.403ArgAsp: 3.403 ± 0.314
3.636ArgGlu: 3.636 ± 0.266
1.964ArgPhe: 1.964 ± 0.199
2.625ArgGly: 2.625 ± 0.204
1.128ArgHis: 1.128 ± 0.176
3.325ArgIle: 3.325 ± 0.247
4.025ArgLys: 4.025 ± 0.278
4.511ArgLeu: 4.511 ± 0.29
1.672ArgMet: 1.672 ± 0.227
2.586ArgAsn: 2.586 ± 0.243
1.867ArgPro: 1.867 ± 0.221
1.206ArgGln: 1.206 ± 0.129
3.131ArgArg: 3.131 ± 0.369
3.053ArgSer: 3.053 ± 0.272
2.256ArgThr: 2.256 ± 0.204
2.917ArgVal: 2.917 ± 0.243
0.447ArgTrp: 0.447 ± 0.089
2.217ArgTyr: 2.217 ± 0.176
0.0ArgXaa: 0.0 ± 0.0
Ser
4.064SerAla: 4.064 ± 0.27
1.225SerCys: 1.225 ± 0.204
4.161SerAsp: 4.161 ± 0.334
4.278SerGlu: 4.278 ± 0.298
2.645SerPhe: 2.645 ± 0.221
3.267SerGly: 3.267 ± 0.272
1.322SerHis: 1.322 ± 0.149
4.336SerIle: 4.336 ± 0.309
4.958SerLys: 4.958 ± 0.34
4.783SerLeu: 4.783 ± 0.337
1.692SerMet: 1.692 ± 0.191
3.053SerAsn: 3.053 ± 0.251
2.45SerPro: 2.45 ± 0.273
1.633SerGln: 1.633 ± 0.162
2.897SerArg: 2.897 ± 0.226
4.22SerSer: 4.22 ± 0.377
4.531SerThr: 4.531 ± 0.338
4.025SerVal: 4.025 ± 0.29
0.428SerTrp: 0.428 ± 0.088
2.372SerTyr: 2.372 ± 0.198
0.0SerXaa: 0.0 ± 0.0
Thr
3.52ThrAla: 3.52 ± 0.234
1.264ThrCys: 1.264 ± 0.21
4.083ThrAsp: 4.083 ± 0.287
4.278ThrGlu: 4.278 ± 0.312
2.236ThrPhe: 2.236 ± 0.179
3.5ThrGly: 3.5 ± 0.25
1.458ThrHis: 1.458 ± 0.201
4.433ThrIle: 4.433 ± 0.255
3.792ThrLys: 3.792 ± 0.246
4.958ThrLeu: 4.958 ± 0.363
2.664ThrMet: 2.664 ± 0.192
3.558ThrAsn: 3.558 ± 0.32
3.636ThrPro: 3.636 ± 0.361
1.769ThrGln: 1.769 ± 0.189
2.683ThrArg: 2.683 ± 0.231
3.617ThrSer: 3.617 ± 0.28
4.647ThrThr: 4.647 ± 0.393
3.481ThrVal: 3.481 ± 0.244
0.583ThrTrp: 0.583 ± 0.103
1.964ThrTyr: 1.964 ± 0.207
0.0ThrXaa: 0.0 ± 0.0
Val
2.936ValAla: 2.936 ± 0.226
1.633ValCys: 1.633 ± 0.174
4.414ValAsp: 4.414 ± 0.283
4.258ValGlu: 4.258 ± 0.278
2.489ValPhe: 2.489 ± 0.195
2.761ValGly: 2.761 ± 0.213
0.933ValHis: 0.933 ± 0.115
4.045ValIle: 4.045 ± 0.268
5.561ValLys: 5.561 ± 0.292
4.628ValLeu: 4.628 ± 0.279
1.867ValMet: 1.867 ± 0.203
4.2ValAsn: 4.2 ± 0.309
2.567ValPro: 2.567 ± 0.235
1.225ValGln: 1.225 ± 0.168
2.761ValArg: 2.761 ± 0.33
3.558ValSer: 3.558 ± 0.327
3.131ValThr: 3.131 ± 0.27
4.686ValVal: 4.686 ± 0.291
0.389ValTrp: 0.389 ± 0.078
2.936ValTyr: 2.936 ± 0.264
0.0ValXaa: 0.0 ± 0.0
Trp
0.467TrpAla: 0.467 ± 0.119
0.136TrpCys: 0.136 ± 0.043
0.525TrpAsp: 0.525 ± 0.094
0.7TrpGlu: 0.7 ± 0.129
0.292TrpPhe: 0.292 ± 0.088
0.389TrpGly: 0.389 ± 0.081
0.136TrpHis: 0.136 ± 0.047
0.506TrpIle: 0.506 ± 0.115
0.681TrpLys: 0.681 ± 0.119
0.622TrpLeu: 0.622 ± 0.098
0.389TrpMet: 0.389 ± 0.091
0.428TrpAsn: 0.428 ± 0.1
0.369TrpPro: 0.369 ± 0.075
0.331TrpGln: 0.331 ± 0.091
0.467TrpArg: 0.467 ± 0.102
0.564TrpSer: 0.564 ± 0.097
0.447TrpThr: 0.447 ± 0.099
0.583TrpVal: 0.583 ± 0.111
0.117TrpTrp: 0.117 ± 0.047
0.272TrpTyr: 0.272 ± 0.086
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.769TyrAla: 1.769 ± 0.185
0.681TyrCys: 0.681 ± 0.12
3.461TyrAsp: 3.461 ± 0.281
2.353TyrGlu: 2.353 ± 0.209
1.575TyrPhe: 1.575 ± 0.17
2.353TyrGly: 2.353 ± 0.206
0.992TyrHis: 0.992 ± 0.134
3.111TyrIle: 3.111 ± 0.204
3.053TyrLys: 3.053 ± 0.216
2.936TyrLeu: 2.936 ± 0.23
1.322TyrMet: 1.322 ± 0.167
2.703TyrAsn: 2.703 ± 0.208
1.05TyrPro: 1.05 ± 0.123
0.894TyrGln: 0.894 ± 0.135
2.061TyrArg: 2.061 ± 0.194
2.606TyrSer: 2.606 ± 0.216
2.742TyrThr: 2.742 ± 0.253
2.178TyrVal: 2.178 ± 0.18
0.35TyrTrp: 0.35 ± 0.088
1.925TyrTyr: 1.925 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 116 proteins (51428 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski