Amino acid dipepetide frequency for Elephant endotheliotropic herpesvirus 1A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.668AlaAla: 3.668 ± 0.383
1.365AlaCys: 1.365 ± 0.159
2.466AlaAsp: 2.466 ± 0.211
2.099AlaGlu: 2.099 ± 0.22
2.609AlaPhe: 2.609 ± 0.253
2.283AlaGly: 2.283 ± 0.192
1.447AlaHis: 1.447 ± 0.197
2.547AlaIle: 2.547 ± 0.241
1.936AlaLys: 1.936 ± 0.179
4.912AlaLeu: 4.912 ± 0.375
1.467AlaMet: 1.467 ± 0.189
2.201AlaAsn: 2.201 ± 0.184
2.425AlaPro: 2.425 ± 0.316
1.549AlaGln: 1.549 ± 0.177
2.446AlaArg: 2.446 ± 0.275
4.667AlaSer: 4.667 ± 0.384
3.831AlaThr: 3.831 ± 0.255
3.994AlaVal: 3.994 ± 0.38
0.489AlaTrp: 0.489 ± 0.124
1.895AlaTyr: 1.895 ± 0.193
0.0AlaXaa: 0.0 ± 0.0
Cys
1.467CysAla: 1.467 ± 0.183
0.836CysCys: 0.836 ± 0.15
1.406CysAsp: 1.406 ± 0.219
0.795CysGlu: 0.795 ± 0.146
1.447CysPhe: 1.447 ± 0.2
1.08CysGly: 1.08 ± 0.151
0.509CysHis: 0.509 ± 0.093
2.099CysIle: 2.099 ± 0.258
1.488CysLys: 1.488 ± 0.21
2.364CysLeu: 2.364 ± 0.281
1.039CysMet: 1.039 ± 0.153
1.386CysAsn: 1.386 ± 0.18
0.917CysPro: 0.917 ± 0.128
0.795CysGln: 0.795 ± 0.159
1.141CysArg: 1.141 ± 0.135
2.283CysSer: 2.283 ± 0.284
2.038CysThr: 2.038 ± 0.217
2.425CysVal: 2.425 ± 0.227
0.204CysTrp: 0.204 ± 0.068
1.162CysTyr: 1.162 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
2.283AspAla: 2.283 ± 0.243
0.754AspCys: 0.754 ± 0.114
4.219AspAsp: 4.219 ± 0.422
3.403AspGlu: 3.403 ± 0.318
2.201AspPhe: 2.201 ± 0.224
2.975AspGly: 2.975 ± 0.276
1.365AspHis: 1.365 ± 0.159
4.056AspIle: 4.056 ± 0.362
2.629AspLys: 2.629 ± 0.255
4.667AspLeu: 4.667 ± 0.358
2.058AspMet: 2.058 ± 0.21
2.711AspAsn: 2.711 ± 0.293
2.527AspPro: 2.527 ± 0.219
1.406AspGln: 1.406 ± 0.147
2.405AspArg: 2.405 ± 0.258
3.913AspSer: 3.913 ± 0.343
3.893AspThr: 3.893 ± 0.364
3.913AspVal: 3.913 ± 0.324
0.469AspTrp: 0.469 ± 0.094
1.916AspTyr: 1.916 ± 0.209
0.0AspXaa: 0.0 ± 0.0
Glu
2.364GluAla: 2.364 ± 0.273
1.08GluCys: 1.08 ± 0.159
3.465GluAsp: 3.465 ± 0.359
3.179GluGlu: 3.179 ± 0.287
1.977GluPhe: 1.977 ± 0.207
2.018GluGly: 2.018 ± 0.261
1.447GluHis: 1.447 ± 0.208
3.139GluIle: 3.139 ± 0.238
2.751GluLys: 2.751 ± 0.289
4.3GluLeu: 4.3 ± 0.288
0.713GluMet: 0.713 ± 0.14
3.098GluAsn: 3.098 ± 0.253
1.671GluPro: 1.671 ± 0.183
1.773GluGln: 1.773 ± 0.238
2.384GluArg: 2.384 ± 0.249
3.546GluSer: 3.546 ± 0.432
3.546GluThr: 3.546 ± 0.246
2.344GluVal: 2.344 ± 0.2
0.265GluTrp: 0.265 ± 0.058
2.12GluTyr: 2.12 ± 0.175
0.0GluXaa: 0.0 ± 0.0
Phe
2.018PheAla: 2.018 ± 0.197
1.508PheCys: 1.508 ± 0.176
2.446PheAsp: 2.446 ± 0.22
1.753PheGlu: 1.753 ± 0.256
2.405PhePhe: 2.405 ± 0.229
2.201PheGly: 2.201 ± 0.26
1.223PheHis: 1.223 ± 0.146
3.75PheIle: 3.75 ± 0.265
2.466PheLys: 2.466 ± 0.266
5.604PheLeu: 5.604 ± 0.519
1.508PheMet: 1.508 ± 0.229
2.507PheAsn: 2.507 ± 0.285
1.895PhePro: 1.895 ± 0.174
1.202PheGln: 1.202 ± 0.161
1.895PheArg: 1.895 ± 0.169
4.035PheSer: 4.035 ± 0.325
3.179PheThr: 3.179 ± 0.267
3.2PheVal: 3.2 ± 0.28
0.387PheTrp: 0.387 ± 0.087
2.242PheTyr: 2.242 ± 0.268
0.0PheXaa: 0.0 ± 0.0
Gly
2.69GlyAla: 2.69 ± 0.313
1.039GlyCys: 1.039 ± 0.169
2.547GlyAsp: 2.547 ± 0.319
2.14GlyGlu: 2.14 ± 0.236
2.12GlyPhe: 2.12 ± 0.24
3.505GlyGly: 3.505 ± 0.499
1.467GlyHis: 1.467 ± 0.205
3.2GlyIle: 3.2 ± 0.245
1.773GlyLys: 1.773 ± 0.174
4.606GlyLeu: 4.606 ± 0.347
1.182GlyMet: 1.182 ± 0.141
2.425GlyAsn: 2.425 ± 0.205
2.384GlyPro: 2.384 ± 0.302
1.447GlyGln: 1.447 ± 0.231
2.323GlyArg: 2.323 ± 0.303
4.28GlySer: 4.28 ± 0.527
3.668GlyThr: 3.668 ± 0.27
3.037GlyVal: 3.037 ± 0.219
0.285GlyTrp: 0.285 ± 0.061
1.488GlyTyr: 1.488 ± 0.213
0.0GlyXaa: 0.0 ± 0.0
His
1.63HisAla: 1.63 ± 0.204
0.632HisCys: 0.632 ± 0.101
1.325HisAsp: 1.325 ± 0.147
1.304HisGlu: 1.304 ± 0.181
1.162HisPhe: 1.162 ± 0.13
1.773HisGly: 1.773 ± 0.246
1.06HisHis: 1.06 ± 0.195
1.753HisIle: 1.753 ± 0.168
1.569HisLys: 1.569 ± 0.18
2.507HisLeu: 2.507 ± 0.242
0.815HisMet: 0.815 ± 0.117
1.162HisAsn: 1.162 ± 0.152
1.141HisPro: 1.141 ± 0.146
0.917HisGln: 0.917 ± 0.149
1.671HisArg: 1.671 ± 0.211
1.753HisSer: 1.753 ± 0.208
1.773HisThr: 1.773 ± 0.174
2.547HisVal: 2.547 ± 0.288
0.183HisTrp: 0.183 ± 0.061
0.937HisTyr: 0.937 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
3.098IleAla: 3.098 ± 0.257
1.997IleCys: 1.997 ± 0.224
3.383IleAsp: 3.383 ± 0.261
2.67IleGlu: 2.67 ± 0.227
3.566IlePhe: 3.566 ± 0.278
2.466IleGly: 2.466 ± 0.231
1.61IleHis: 1.61 ± 0.17
4.769IleIle: 4.769 ± 0.388
2.996IleLys: 2.996 ± 0.276
7.765IleLeu: 7.765 ± 0.541
1.406IleMet: 1.406 ± 0.18
3.424IleAsn: 3.424 ± 0.269
2.894IlePro: 2.894 ± 0.278
2.018IleGln: 2.018 ± 0.217
2.935IleArg: 2.935 ± 0.254
5.441IleSer: 5.441 ± 0.319
4.687IleThr: 4.687 ± 0.359
4.178IleVal: 4.178 ± 0.344
0.693IleTrp: 0.693 ± 0.136
3.444IleTyr: 3.444 ± 0.375
0.0IleXaa: 0.0 ± 0.0
Lys
1.59LysAla: 1.59 ± 0.163
1.223LysCys: 1.223 ± 0.192
2.303LysAsp: 2.303 ± 0.227
2.405LysGlu: 2.405 ± 0.255
1.569LysPhe: 1.569 ± 0.184
1.63LysGly: 1.63 ± 0.194
2.181LysHis: 2.181 ± 0.194
3.546LysIle: 3.546 ± 0.3
3.424LysLys: 3.424 ± 0.35
4.524LysLeu: 4.524 ± 0.326
0.999LysMet: 0.999 ± 0.128
3.383LysAsn: 3.383 ± 0.307
2.201LysPro: 2.201 ± 0.258
1.855LysGln: 1.855 ± 0.228
3.139LysArg: 3.139 ± 0.312
3.852LysSer: 3.852 ± 0.299
3.668LysThr: 3.668 ± 0.266
2.853LysVal: 2.853 ± 0.253
0.53LysTrp: 0.53 ± 0.107
2.181LysTyr: 2.181 ± 0.199
0.0LysXaa: 0.0 ± 0.0
Leu
4.443LeuAla: 4.443 ± 0.329
3.342LeuCys: 3.342 ± 0.359
4.728LeuAsp: 4.728 ± 0.334
4.382LeuGlu: 4.382 ± 0.301
5.788LeuPhe: 5.788 ± 0.538
3.709LeuGly: 3.709 ± 0.25
2.466LeuHis: 2.466 ± 0.219
6.073LeuIle: 6.073 ± 0.541
4.932LeuLys: 4.932 ± 0.342
10.903LeuLeu: 10.903 ± 0.752
2.283LeuMet: 2.283 ± 0.259
5.462LeuAsn: 5.462 ± 0.362
4.117LeuPro: 4.117 ± 0.338
3.689LeuGln: 3.689 ± 0.347
4.769LeuArg: 4.769 ± 0.391
7.683LeuSer: 7.683 ± 0.489
6.725LeuThr: 6.725 ± 0.468
5.706LeuVal: 5.706 ± 0.432
1.141LeuTrp: 1.141 ± 0.16
4.973LeuTyr: 4.973 ± 0.45
0.0LeuXaa: 0.0 ± 0.0
Met
1.59MetAla: 1.59 ± 0.151
0.937MetCys: 0.937 ± 0.135
1.325MetAsp: 1.325 ± 0.167
1.08MetGlu: 1.08 ± 0.164
1.671MetPhe: 1.671 ± 0.204
1.019MetGly: 1.019 ± 0.174
0.571MetHis: 0.571 ± 0.112
1.61MetIle: 1.61 ± 0.18
1.243MetLys: 1.243 ± 0.175
2.486MetLeu: 2.486 ± 0.266
0.958MetMet: 0.958 ± 0.119
1.06MetAsn: 1.06 ± 0.155
0.673MetPro: 0.673 ± 0.104
0.836MetGln: 0.836 ± 0.144
1.121MetArg: 1.121 ± 0.168
2.16MetSer: 2.16 ± 0.211
1.773MetThr: 1.773 ± 0.189
1.365MetVal: 1.365 ± 0.147
0.346MetTrp: 0.346 ± 0.088
1.365MetTyr: 1.365 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
2.853AsnAla: 2.853 ± 0.286
1.202AsnCys: 1.202 ± 0.171
2.833AsnAsp: 2.833 ± 0.257
2.609AsnGlu: 2.609 ± 0.264
2.242AsnPhe: 2.242 ± 0.251
2.751AsnGly: 2.751 ± 0.25
1.121AsnHis: 1.121 ± 0.142
4.524AsnIle: 4.524 ± 0.339
2.67AsnLys: 2.67 ± 0.283
4.341AsnLeu: 4.341 ± 0.268
1.488AsnMet: 1.488 ± 0.201
3.526AsnAsn: 3.526 ± 0.374
2.201AsnPro: 2.201 ± 0.253
1.569AsnGln: 1.569 ± 0.216
2.405AsnArg: 2.405 ± 0.239
3.118AsnSer: 3.118 ± 0.273
5.013AsnThr: 5.013 ± 0.414
4.545AsnVal: 4.545 ± 0.32
0.387AsnTrp: 0.387 ± 0.091
2.079AsnTyr: 2.079 ± 0.264
0.0AsnXaa: 0.0 ± 0.0
Pro
2.364ProAla: 2.364 ± 0.244
1.06ProCys: 1.06 ± 0.153
2.262ProAsp: 2.262 ± 0.19
2.466ProGlu: 2.466 ± 0.319
1.855ProPhe: 1.855 ± 0.187
1.814ProGly: 1.814 ± 0.214
1.202ProHis: 1.202 ± 0.153
2.731ProIle: 2.731 ± 0.219
1.814ProLys: 1.814 ± 0.199
4.259ProLeu: 4.259 ± 0.342
0.937ProMet: 0.937 ± 0.138
2.16ProAsn: 2.16 ± 0.221
4.402ProPro: 4.402 ± 0.707
1.936ProGln: 1.936 ± 0.297
2.568ProArg: 2.568 ± 0.337
3.852ProSer: 3.852 ± 0.488
3.098ProThr: 3.098 ± 0.264
3.444ProVal: 3.444 ± 0.273
0.428ProTrp: 0.428 ± 0.092
2.099ProTyr: 2.099 ± 0.198
0.0ProXaa: 0.0 ± 0.0
Gln
1.732GlnAla: 1.732 ± 0.237
0.856GlnCys: 0.856 ± 0.157
1.732GlnAsp: 1.732 ± 0.257
1.508GlnGlu: 1.508 ± 0.17
1.264GlnPhe: 1.264 ± 0.126
1.467GlnGly: 1.467 ± 0.218
1.386GlnHis: 1.386 ± 0.147
1.855GlnIle: 1.855 ± 0.204
1.895GlnLys: 1.895 ± 0.203
2.975GlnLeu: 2.975 ± 0.253
0.897GlnMet: 0.897 ± 0.156
1.773GlnAsn: 1.773 ± 0.171
1.936GlnPro: 1.936 ± 0.387
2.323GlnGln: 2.323 ± 0.479
1.814GlnArg: 1.814 ± 0.34
2.772GlnSer: 2.772 ± 0.303
2.588GlnThr: 2.588 ± 0.343
1.753GlnVal: 1.753 ± 0.21
0.245GlnTrp: 0.245 ± 0.062
1.182GlnTyr: 1.182 ± 0.154
0.0GlnXaa: 0.0 ± 0.0
Arg
2.14ArgAla: 2.14 ± 0.283
1.202ArgCys: 1.202 ± 0.172
3.2ArgAsp: 3.2 ± 0.272
2.649ArgGlu: 2.649 ± 0.339
2.12ArgPhe: 2.12 ± 0.229
2.527ArgGly: 2.527 ± 0.368
1.671ArgHis: 1.671 ± 0.213
2.221ArgIle: 2.221 ± 0.238
2.853ArgLys: 2.853 ± 0.333
4.484ArgLeu: 4.484 ± 0.337
0.999ArgMet: 0.999 ± 0.144
2.466ArgAsn: 2.466 ± 0.256
2.792ArgPro: 2.792 ± 0.379
2.099ArgGln: 2.099 ± 0.372
4.056ArgArg: 4.056 ± 0.454
4.198ArgSer: 4.198 ± 0.392
2.955ArgThr: 2.955 ± 0.254
2.975ArgVal: 2.975 ± 0.255
0.469ArgTrp: 0.469 ± 0.103
2.466ArgTyr: 2.466 ± 0.237
0.0ArgXaa: 0.0 ± 0.0
Ser
4.83SerAla: 4.83 ± 0.414
2.384SerCys: 2.384 ± 0.247
4.361SerAsp: 4.361 ± 0.426
4.035SerGlu: 4.035 ± 0.4
3.322SerPhe: 3.322 ± 0.283
4.749SerGly: 4.749 ± 0.516
1.977SerHis: 1.977 ± 0.179
5.034SerIle: 5.034 ± 0.31
3.383SerLys: 3.383 ± 0.279
7.398SerLeu: 7.398 ± 0.459
1.569SerMet: 1.569 ± 0.165
4.504SerAsn: 4.504 ± 0.289
3.566SerPro: 3.566 ± 0.326
2.344SerGln: 2.344 ± 0.3
4.484SerArg: 4.484 ± 0.409
10.088SerSer: 10.088 ± 1.269
6.094SerThr: 6.094 ± 0.864
6.094SerVal: 6.094 ± 0.377
0.632SerTrp: 0.632 ± 0.121
2.955SerTyr: 2.955 ± 0.221
0.0SerXaa: 0.0 ± 0.0
Thr
3.913ThrAla: 3.913 ± 0.339
2.221ThrCys: 2.221 ± 0.226
3.587ThrAsp: 3.587 ± 0.263
3.098ThrGlu: 3.098 ± 0.23
3.424ThrPhe: 3.424 ± 0.353
4.117ThrGly: 4.117 ± 0.333
1.916ThrHis: 1.916 ± 0.169
4.239ThrIle: 4.239 ± 0.296
2.894ThrLys: 2.894 ± 0.231
7.214ThrLeu: 7.214 ± 0.517
1.569ThrMet: 1.569 ± 0.171
3.73ThrAsn: 3.73 ± 0.365
4.035ThrPro: 4.035 ± 0.348
2.772ThrGln: 2.772 ± 0.355
3.342ThrArg: 3.342 ± 0.3
6.318ThrSer: 6.318 ± 0.618
6.175ThrThr: 6.175 ± 0.768
5.34ThrVal: 5.34 ± 0.249
1.039ThrTrp: 1.039 ± 0.17
2.792ThrTyr: 2.792 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
3.465ValAla: 3.465 ± 0.233
1.936ValCys: 1.936 ± 0.215
3.566ValAsp: 3.566 ± 0.261
3.22ValGlu: 3.22 ± 0.253
4.239ValPhe: 4.239 ± 0.306
2.996ValGly: 2.996 ± 0.234
1.651ValHis: 1.651 ± 0.172
4.28ValIle: 4.28 ± 0.299
3.383ValLys: 3.383 ± 0.304
6.522ValLeu: 6.522 ± 0.519
1.549ValMet: 1.549 ± 0.209
3.363ValAsn: 3.363 ± 0.238
3.383ValPro: 3.383 ± 0.295
1.936ValGln: 1.936 ± 0.165
3.098ValArg: 3.098 ± 0.23
6.277ValSer: 6.277 ± 0.388
5.788ValThr: 5.788 ± 0.337
4.219ValVal: 4.219 ± 0.348
0.571ValTrp: 0.571 ± 0.136
3.016ValTyr: 3.016 ± 0.294
0.0ValXaa: 0.0 ± 0.0
Trp
0.306TrpAla: 0.306 ± 0.078
0.265TrpCys: 0.265 ± 0.088
0.285TrpAsp: 0.285 ± 0.087
0.428TrpGlu: 0.428 ± 0.093
0.591TrpPhe: 0.591 ± 0.121
0.408TrpGly: 0.408 ± 0.103
0.285TrpHis: 0.285 ± 0.079
0.632TrpIle: 0.632 ± 0.108
0.509TrpLys: 0.509 ± 0.113
1.121TrpLeu: 1.121 ± 0.189
0.326TrpMet: 0.326 ± 0.081
0.693TrpAsn: 0.693 ± 0.117
0.285TrpPro: 0.285 ± 0.079
0.367TrpGln: 0.367 ± 0.09
0.346TrpArg: 0.346 ± 0.083
0.734TrpSer: 0.734 ± 0.106
0.611TrpThr: 0.611 ± 0.107
0.632TrpVal: 0.632 ± 0.107
0.102TrpTrp: 0.102 ± 0.055
0.346TrpTyr: 0.346 ± 0.079
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.855TyrAla: 1.855 ± 0.179
1.039TyrCys: 1.039 ± 0.156
2.323TyrAsp: 2.323 ± 0.231
1.956TyrGlu: 1.956 ± 0.19
1.855TyrPhe: 1.855 ± 0.243
2.14TyrGly: 2.14 ± 0.19
1.06TyrHis: 1.06 ± 0.167
3.383TyrIle: 3.383 ± 0.38
2.446TyrLys: 2.446 ± 0.282
4.361TyrLeu: 4.361 ± 0.363
1.406TyrMet: 1.406 ± 0.207
2.446TyrAsn: 2.446 ± 0.288
1.304TyrPro: 1.304 ± 0.209
1.121TyrGln: 1.121 ± 0.154
2.201TyrArg: 2.201 ± 0.266
2.833TyrSer: 2.833 ± 0.257
2.629TyrThr: 2.629 ± 0.26
3.893TyrVal: 3.893 ± 0.333
0.408TyrTrp: 0.408 ± 0.095
1.59TyrTyr: 1.59 ± 0.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 120 proteins (49069 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski