Amino acid dipepetide frequency for Cyprinid herpesvirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.319AlaAla: 11.319 ± 0.635
1.759AlaCys: 1.759 ± 0.162
4.451AlaAsp: 4.451 ± 0.257
5.277AlaGlu: 5.277 ± 0.317
2.943AlaPhe: 2.943 ± 0.206
5.277AlaGly: 5.277 ± 0.317
2.154AlaHis: 2.154 ± 0.16
2.572AlaIle: 2.572 ± 0.223
3.35AlaLys: 3.35 ± 0.285
7.897AlaLeu: 7.897 ± 0.314
2.13AlaMet: 2.13 ± 0.164
2.417AlaAsn: 2.417 ± 0.222
5.277AlaPro: 5.277 ± 0.34
3.362AlaGln: 3.362 ± 0.31
5.265AlaArg: 5.265 ± 0.316
7.382AlaSer: 7.382 ± 0.463
5.073AlaThr: 5.073 ± 0.281
8.1AlaVal: 8.1 ± 0.342
0.945AlaTrp: 0.945 ± 0.098
2.225AlaTyr: 2.225 ± 0.168
0.0AlaXaa: 0.0 ± 0.0
Cys
1.879CysAla: 1.879 ± 0.168
0.766CysCys: 0.766 ± 0.127
1.304CysAsp: 1.304 ± 0.149
1.316CysGlu: 1.316 ± 0.14
0.718CysPhe: 0.718 ± 0.092
1.663CysGly: 1.663 ± 0.221
0.455CysHis: 0.455 ± 0.069
0.67CysIle: 0.67 ± 0.109
0.682CysLys: 0.682 ± 0.077
2.273CysLeu: 2.273 ± 0.19
0.67CysMet: 0.67 ± 0.103
0.574CysAsn: 0.574 ± 0.101
1.508CysPro: 1.508 ± 0.192
0.538CysGln: 0.538 ± 0.086
1.28CysArg: 1.28 ± 0.122
1.448CysSer: 1.448 ± 0.145
1.208CysThr: 1.208 ± 0.106
1.639CysVal: 1.639 ± 0.167
0.215CysTrp: 0.215 ± 0.054
0.634CysTyr: 0.634 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
5.48AspAla: 5.48 ± 0.325
1.017AspCys: 1.017 ± 0.121
5.049AspAsp: 5.049 ± 0.286
4.69AspGlu: 4.69 ± 0.238
2.094AspPhe: 2.094 ± 0.182
4.343AspGly: 4.343 ± 0.262
1.232AspHis: 1.232 ± 0.11
1.831AspIle: 1.831 ± 0.167
2.106AspLys: 2.106 ± 0.189
5.169AspLeu: 5.169 ± 0.245
1.555AspMet: 1.555 ± 0.146
1.675AspAsn: 1.675 ± 0.178
3.219AspPro: 3.219 ± 0.223
1.735AspGln: 1.735 ± 0.163
3.039AspArg: 3.039 ± 0.217
4.032AspSer: 4.032 ± 0.253
3.027AspThr: 3.027 ± 0.23
4.607AspVal: 4.607 ± 0.284
0.802AspTrp: 0.802 ± 0.089
1.759AspTyr: 1.759 ± 0.138
0.0AspXaa: 0.0 ± 0.0
Glu
5.241GluAla: 5.241 ± 0.295
0.993GluCys: 0.993 ± 0.112
5.001GluAsp: 5.001 ± 0.282
5.743GluGlu: 5.743 ± 0.44
1.627GluPhe: 1.627 ± 0.137
3.362GluGly: 3.362 ± 0.268
1.448GluHis: 1.448 ± 0.156
1.938GluIle: 1.938 ± 0.17
2.309GluLys: 2.309 ± 0.223
5.863GluLeu: 5.863 ± 0.291
1.555GluMet: 1.555 ± 0.137
1.52GluAsn: 1.52 ± 0.141
3.41GluPro: 3.41 ± 0.23
2.273GluGln: 2.273 ± 0.201
4.224GluArg: 4.224 ± 0.284
3.889GluSer: 3.889 ± 0.244
3.637GluThr: 3.637 ± 0.347
3.685GluVal: 3.685 ± 0.241
0.873GluTrp: 0.873 ± 0.131
1.747GluTyr: 1.747 ± 0.17
0.0GluXaa: 0.0 ± 0.0
Phe
2.752PheAla: 2.752 ± 0.2
0.85PheCys: 0.85 ± 0.095
2.142PheAsp: 2.142 ± 0.173
2.106PheGlu: 2.106 ± 0.158
1.615PhePhe: 1.615 ± 0.145
2.118PheGly: 2.118 ± 0.171
0.754PheHis: 0.754 ± 0.107
1.029PheIle: 1.029 ± 0.116
1.663PheLys: 1.663 ± 0.14
2.465PheLeu: 2.465 ± 0.19
0.981PheMet: 0.981 ± 0.119
1.543PheAsn: 1.543 ± 0.181
1.244PhePro: 1.244 ± 0.128
1.113PheGln: 1.113 ± 0.125
1.938PheArg: 1.938 ± 0.156
2.393PheSer: 2.393 ± 0.193
2.345PheThr: 2.345 ± 0.186
2.824PheVal: 2.824 ± 0.218
0.467PheTrp: 0.467 ± 0.081
1.017PheTyr: 1.017 ± 0.111
0.0PheXaa: 0.0 ± 0.0
Gly
5.683GlyAla: 5.683 ± 0.329
1.173GlyCys: 1.173 ± 0.129
3.362GlyAsp: 3.362 ± 0.196
3.601GlyGlu: 3.601 ± 0.263
2.237GlyPhe: 2.237 ± 0.199
5.923GlyGly: 5.923 ± 0.507
1.448GlyHis: 1.448 ± 0.178
1.962GlyIle: 1.962 ± 0.169
2.214GlyLys: 2.214 ± 0.206
5.576GlyLeu: 5.576 ± 0.317
1.34GlyMet: 1.34 ± 0.131
1.89GlyAsn: 1.89 ± 0.224
3.422GlyPro: 3.422 ± 0.268
2.106GlyGln: 2.106 ± 0.182
3.805GlyArg: 3.805 ± 0.264
4.355GlySer: 4.355 ± 0.296
3.661GlyThr: 3.661 ± 0.217
4.906GlyVal: 4.906 ± 0.308
0.933GlyTrp: 0.933 ± 0.095
1.807GlyTyr: 1.807 ± 0.162
0.0GlyXaa: 0.0 ± 0.0
His
2.082HisAla: 2.082 ± 0.183
0.562HisCys: 0.562 ± 0.083
1.077HisAsp: 1.077 ± 0.113
1.113HisGlu: 1.113 ± 0.119
0.838HisPhe: 0.838 ± 0.105
1.687HisGly: 1.687 ± 0.19
1.22HisHis: 1.22 ± 0.162
1.005HisIle: 1.005 ± 0.143
1.244HisLys: 1.244 ± 0.143
2.357HisLeu: 2.357 ± 0.155
0.706HisMet: 0.706 ± 0.099
0.945HisAsn: 0.945 ± 0.105
1.567HisPro: 1.567 ± 0.212
0.993HisGln: 0.993 ± 0.135
2.01HisArg: 2.01 ± 0.186
1.484HisSer: 1.484 ± 0.166
1.639HisThr: 1.639 ± 0.198
1.532HisVal: 1.532 ± 0.145
0.275HisTrp: 0.275 ± 0.062
0.957HisTyr: 0.957 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
2.345IleAla: 2.345 ± 0.211
0.55IleCys: 0.55 ± 0.08
1.986IleAsp: 1.986 ± 0.169
2.082IleGlu: 2.082 ± 0.174
1.4IlePhe: 1.4 ± 0.129
1.89IleGly: 1.89 ± 0.137
0.694IleHis: 0.694 ± 0.082
1.244IleIle: 1.244 ± 0.128
2.166IleLys: 2.166 ± 0.194
2.728IleLeu: 2.728 ± 0.167
0.981IleMet: 0.981 ± 0.142
1.52IleAsn: 1.52 ± 0.166
1.675IlePro: 1.675 ± 0.188
1.065IleGln: 1.065 ± 0.119
1.962IleArg: 1.962 ± 0.153
2.297IleSer: 2.297 ± 0.202
2.202IleThr: 2.202 ± 0.193
2.297IleVal: 2.297 ± 0.164
0.371IleTrp: 0.371 ± 0.059
1.065IleTyr: 1.065 ± 0.119
0.0IleXaa: 0.0 ± 0.0
Lys
3.59LysAla: 3.59 ± 0.244
0.885LysCys: 0.885 ± 0.108
2.489LysAsp: 2.489 ± 0.232
2.058LysGlu: 2.058 ± 0.213
1.113LysPhe: 1.113 ± 0.142
2.07LysGly: 2.07 ± 0.182
1.077LysHis: 1.077 ± 0.125
1.268LysIle: 1.268 ± 0.134
3.111LysLys: 3.111 ± 0.384
4.02LysLeu: 4.02 ± 0.269
1.113LysMet: 1.113 ± 0.122
1.651LysAsn: 1.651 ± 0.138
3.075LysPro: 3.075 ± 0.231
1.974LysGln: 1.974 ± 0.172
3.721LysArg: 3.721 ± 0.285
2.489LysSer: 2.489 ± 0.237
2.74LysThr: 2.74 ± 0.203
2.608LysVal: 2.608 ± 0.237
0.491LysTrp: 0.491 ± 0.087
1.268LysTyr: 1.268 ± 0.132
0.0LysXaa: 0.0 ± 0.0
Leu
7.658LeuAla: 7.658 ± 0.374
2.333LeuCys: 2.333 ± 0.209
5.133LeuAsp: 5.133 ± 0.291
4.93LeuGlu: 4.93 ± 0.31
3.051LeuPhe: 3.051 ± 0.201
5.301LeuGly: 5.301 ± 0.339
2.214LeuHis: 2.214 ± 0.187
2.884LeuIle: 2.884 ± 0.199
4.068LeuLys: 4.068 ± 0.24
8.543LeuLeu: 8.543 ± 0.39
2.931LeuMet: 2.931 ± 0.198
3.326LeuAsn: 3.326 ± 0.232
5.145LeuPro: 5.145 ± 0.284
3.195LeuGln: 3.195 ± 0.254
5.875LeuArg: 5.875 ± 0.233
6.018LeuSer: 6.018 ± 0.317
5.875LeuThr: 5.875 ± 0.267
5.803LeuVal: 5.803 ± 0.263
1.089LeuTrp: 1.089 ± 0.127
2.848LeuTyr: 2.848 ± 0.198
0.0LeuXaa: 0.0 ± 0.0
Met
2.561MetAla: 2.561 ± 0.185
0.67MetCys: 0.67 ± 0.082
1.663MetAsp: 1.663 ± 0.151
1.316MetGlu: 1.316 ± 0.139
0.993MetPhe: 0.993 ± 0.116
1.567MetGly: 1.567 ± 0.156
0.61MetHis: 0.61 ± 0.084
0.838MetIle: 0.838 ± 0.102
0.993MetLys: 0.993 ± 0.12
2.417MetLeu: 2.417 ± 0.197
0.885MetMet: 0.885 ± 0.125
0.921MetAsn: 0.921 ± 0.122
1.555MetPro: 1.555 ± 0.146
0.909MetGln: 0.909 ± 0.117
1.663MetArg: 1.663 ± 0.156
1.879MetSer: 1.879 ± 0.187
1.651MetThr: 1.651 ± 0.164
2.07MetVal: 2.07 ± 0.166
0.359MetTrp: 0.359 ± 0.064
0.742MetTyr: 0.742 ± 0.098
0.0MetXaa: 0.0 ± 0.0
Asn
2.788AsnAla: 2.788 ± 0.224
0.479AsnCys: 0.479 ± 0.075
1.508AsnAsp: 1.508 ± 0.144
1.615AsnGlu: 1.615 ± 0.145
1.173AsnPhe: 1.173 ± 0.141
2.237AsnGly: 2.237 ± 0.173
0.981AsnHis: 0.981 ± 0.109
1.472AsnIle: 1.472 ± 0.136
1.627AsnLys: 1.627 ± 0.178
2.991AsnLeu: 2.991 ± 0.214
0.873AsnMet: 0.873 ± 0.11
1.424AsnAsn: 1.424 ± 0.145
2.225AsnPro: 2.225 ± 0.215
1.161AsnGln: 1.161 ± 0.115
1.986AsnArg: 1.986 ± 0.186
2.489AsnSer: 2.489 ± 0.163
2.345AsnThr: 2.345 ± 0.243
2.369AsnVal: 2.369 ± 0.18
0.311AsnTrp: 0.311 ± 0.065
0.933AsnTyr: 0.933 ± 0.122
0.0AsnXaa: 0.0 ± 0.0
Pro
5.492ProAla: 5.492 ± 0.344
1.101ProCys: 1.101 ± 0.13
3.243ProAsp: 3.243 ± 0.219
3.613ProGlu: 3.613 ± 0.208
1.843ProPhe: 1.843 ± 0.146
3.147ProGly: 3.147 ± 0.255
1.723ProHis: 1.723 ± 0.151
2.07ProIle: 2.07 ± 0.172
2.549ProLys: 2.549 ± 0.206
4.559ProLeu: 4.559 ± 0.313
1.113ProMet: 1.113 ± 0.125
2.058ProAsn: 2.058 ± 0.203
5.899ProPro: 5.899 ± 0.514
3.087ProGln: 3.087 ± 0.274
3.661ProArg: 3.661 ± 0.226
6.605ProSer: 6.605 ± 0.474
4.906ProThr: 4.906 ± 0.699
4.834ProVal: 4.834 ± 0.325
0.526ProTrp: 0.526 ± 0.087
1.663ProTyr: 1.663 ± 0.126
0.0ProXaa: 0.0 ± 0.0
Gln
3.518GlnAla: 3.518 ± 0.252
0.574GlnCys: 0.574 ± 0.078
1.412GlnAsp: 1.412 ± 0.127
1.819GlnGlu: 1.819 ± 0.217
1.113GlnPhe: 1.113 ± 0.162
2.034GlnGly: 2.034 ± 0.2
1.304GlnHis: 1.304 ± 0.154
1.089GlnIle: 1.089 ± 0.137
1.472GlnLys: 1.472 ± 0.17
3.673GlnLeu: 3.673 ± 0.305
1.053GlnMet: 1.053 ± 0.102
0.945GlnAsn: 0.945 ± 0.097
3.231GlnPro: 3.231 ± 0.307
4.487GlnGln: 4.487 ± 0.563
2.931GlnArg: 2.931 ± 0.243
2.441GlnSer: 2.441 ± 0.207
2.178GlnThr: 2.178 ± 0.194
2.345GlnVal: 2.345 ± 0.193
0.55GlnTrp: 0.55 ± 0.091
0.873GlnTyr: 0.873 ± 0.125
0.0GlnXaa: 0.0 ± 0.0
Arg
4.93ArgAla: 4.93 ± 0.29
1.651ArgCys: 1.651 ± 0.157
3.925ArgAsp: 3.925 ± 0.264
3.96ArgGlu: 3.96 ± 0.279
1.998ArgPhe: 1.998 ± 0.191
3.853ArgGly: 3.853 ± 0.304
1.879ArgHis: 1.879 ± 0.17
2.058ArgIle: 2.058 ± 0.152
2.608ArgLys: 2.608 ± 0.217
6.341ArgLeu: 6.341 ± 0.358
1.747ArgMet: 1.747 ± 0.175
1.759ArgAsn: 1.759 ± 0.155
3.769ArgPro: 3.769 ± 0.259
2.632ArgGln: 2.632 ± 0.191
6.688ArgArg: 6.688 ± 0.565
4.846ArgSer: 4.846 ± 0.291
3.685ArgThr: 3.685 ± 0.233
4.714ArgVal: 4.714 ± 0.284
1.005ArgTrp: 1.005 ± 0.121
1.723ArgTyr: 1.723 ± 0.17
0.0ArgXaa: 0.0 ± 0.0
Ser
6.174SerAla: 6.174 ± 0.357
1.627SerCys: 1.627 ± 0.159
4.618SerAsp: 4.618 ± 0.282
4.642SerGlu: 4.642 ± 0.359
2.022SerPhe: 2.022 ± 0.14
4.463SerGly: 4.463 ± 0.28
1.555SerHis: 1.555 ± 0.136
2.393SerIle: 2.393 ± 0.177
2.967SerLys: 2.967 ± 0.273
5.994SerLeu: 5.994 ± 0.242
1.723SerMet: 1.723 ± 0.169
2.309SerAsn: 2.309 ± 0.21
4.726SerPro: 4.726 ± 0.309
2.644SerGln: 2.644 ± 0.223
4.654SerArg: 4.654 ± 0.266
8.902SerSer: 8.902 ± 0.835
5.983SerThr: 5.983 ± 0.532
5.875SerVal: 5.875 ± 0.286
0.885SerTrp: 0.885 ± 0.092
1.807SerTyr: 1.807 ± 0.149
0.0SerXaa: 0.0 ± 0.0
Thr
6.114ThrAla: 6.114 ± 0.296
1.173ThrCys: 1.173 ± 0.142
3.111ThrAsp: 3.111 ± 0.212
3.302ThrGlu: 3.302 ± 0.408
2.393ThrPhe: 2.393 ± 0.21
3.613ThrGly: 3.613 ± 0.264
1.687ThrHis: 1.687 ± 0.19
2.453ThrIle: 2.453 ± 0.211
2.357ThrLys: 2.357 ± 0.191
6.03ThrLeu: 6.03 ± 0.298
1.639ThrMet: 1.639 ± 0.16
2.225ThrAsn: 2.225 ± 0.24
5.289ThrPro: 5.289 ± 0.79
1.938ThrGln: 1.938 ± 0.21
3.147ThrArg: 3.147 ± 0.209
4.774ThrSer: 4.774 ± 0.371
7.43ThrThr: 7.43 ± 1.207
5.839ThrVal: 5.839 ± 0.341
0.885ThrTrp: 0.885 ± 0.128
1.579ThrTyr: 1.579 ± 0.151
0.0ThrXaa: 0.0 ± 0.0
Val
6.545ValAla: 6.545 ± 0.337
2.19ValCys: 2.19 ± 0.237
4.511ValAsp: 4.511 ± 0.235
4.894ValGlu: 4.894 ± 0.252
2.728ValPhe: 2.728 ± 0.225
4.02ValGly: 4.02 ± 0.239
1.902ValHis: 1.902 ± 0.155
2.417ValIle: 2.417 ± 0.198
3.434ValLys: 3.434 ± 0.24
5.827ValLeu: 5.827 ± 0.275
1.879ValMet: 1.879 ± 0.165
2.513ValAsn: 2.513 ± 0.184
5.253ValPro: 5.253 ± 0.3
2.465ValGln: 2.465 ± 0.19
5.324ValArg: 5.324 ± 0.327
5.336ValSer: 5.336 ± 0.3
4.08ValThr: 4.08 ± 0.276
6.114ValVal: 6.114 ± 0.404
1.077ValTrp: 1.077 ± 0.133
2.261ValTyr: 2.261 ± 0.196
0.0ValXaa: 0.0 ± 0.0
Trp
0.921TrpAla: 0.921 ± 0.104
0.407TrpCys: 0.407 ± 0.062
0.694TrpAsp: 0.694 ± 0.098
0.503TrpGlu: 0.503 ± 0.079
0.503TrpPhe: 0.503 ± 0.079
0.682TrpGly: 0.682 ± 0.096
0.287TrpHis: 0.287 ± 0.045
0.479TrpIle: 0.479 ± 0.08
0.491TrpLys: 0.491 ± 0.075
1.28TrpLeu: 1.28 ± 0.131
0.491TrpMet: 0.491 ± 0.076
0.383TrpAsn: 0.383 ± 0.066
0.586TrpPro: 0.586 ± 0.094
0.299TrpGln: 0.299 ± 0.065
0.957TrpArg: 0.957 ± 0.131
1.125TrpSer: 1.125 ± 0.12
1.077TrpThr: 1.077 ± 0.139
0.838TrpVal: 0.838 ± 0.122
0.275TrpTrp: 0.275 ± 0.069
0.514TrpTyr: 0.514 ± 0.075
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.106TyrAla: 2.106 ± 0.163
0.682TyrCys: 0.682 ± 0.088
1.783TyrAsp: 1.783 ± 0.148
1.759TyrGlu: 1.759 ± 0.143
0.969TyrPhe: 0.969 ± 0.115
2.154TyrGly: 2.154 ± 0.168
0.67TyrHis: 0.67 ± 0.095
0.85TyrIle: 0.85 ± 0.099
1.4TyrLys: 1.4 ± 0.125
2.214TyrLeu: 2.214 ± 0.167
0.885TyrMet: 0.885 ± 0.11
1.388TyrAsn: 1.388 ± 0.142
1.615TyrPro: 1.615 ± 0.194
1.065TyrGln: 1.065 ± 0.103
1.639TyrArg: 1.639 ± 0.146
1.783TyrSer: 1.783 ± 0.134
2.237TyrThr: 2.237 ± 0.173
1.771TyrVal: 1.771 ± 0.148
0.455TyrTrp: 0.455 ± 0.082
1.292TyrTyr: 1.292 ± 0.12
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 169 proteins (83578 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski