Amino acid dipepetide frequency for Equine herpesvirus 1 (strain Ab4p) (EHV-1) (Equine abortion virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.039AlaAla: 13.039 ± 1.429
1.887AlaCys: 1.887 ± 0.222
4.272AlaAsp: 4.272 ± 0.402
4.893AlaGlu: 4.893 ± 0.404
3.477AlaPhe: 3.477 ± 0.316
5.613AlaGly: 5.613 ± 0.408
1.614AlaHis: 1.614 ± 0.195
3.974AlaIle: 3.974 ± 0.318
3.402AlaLys: 3.402 ± 0.549
9.164AlaLeu: 9.164 ± 0.525
2.31AlaMet: 2.31 ± 0.243
2.732AlaAsn: 2.732 ± 0.204
6.557AlaPro: 6.557 ± 0.741
3.229AlaGln: 3.229 ± 0.309
6.979AlaArg: 6.979 ± 0.382
8.643AlaSer: 8.643 ± 0.502
6.581AlaThr: 6.581 ± 0.707
7.55AlaVal: 7.55 ± 0.681
1.242AlaTrp: 1.242 ± 0.207
2.707AlaTyr: 2.707 ± 0.273
0.0AlaXaa: 0.0 ± 0.0
Cys
1.515CysAla: 1.515 ± 0.321
0.546CysCys: 0.546 ± 0.108
1.291CysAsp: 1.291 ± 0.18
1.043CysGlu: 1.043 ± 0.182
0.844CysPhe: 0.844 ± 0.154
1.391CysGly: 1.391 ± 0.192
0.497CysHis: 0.497 ± 0.096
0.745CysIle: 0.745 ± 0.134
0.522CysLys: 0.522 ± 0.101
1.714CysLeu: 1.714 ± 0.215
0.447CysMet: 0.447 ± 0.116
0.844CysAsn: 0.844 ± 0.171
1.192CysPro: 1.192 ± 0.191
0.472CysGln: 0.472 ± 0.119
1.142CysArg: 1.142 ± 0.178
1.267CysSer: 1.267 ± 0.195
1.192CysThr: 1.192 ± 0.206
1.49CysVal: 1.49 ± 0.189
0.248CysTrp: 0.248 ± 0.074
0.546CysTyr: 0.546 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
5.29AspAla: 5.29 ± 0.412
0.919AspCys: 0.919 ± 0.194
3.353AspAsp: 3.353 ± 0.399
3.849AspGlu: 3.849 ± 0.322
1.714AspPhe: 1.714 ± 0.186
3.75AspGly: 3.75 ± 0.286
1.043AspHis: 1.043 ± 0.169
1.863AspIle: 1.863 ± 0.216
1.093AspLys: 1.093 ± 0.24
4.744AspLeu: 4.744 ± 0.338
1.54AspMet: 1.54 ± 0.194
1.589AspAsn: 1.589 ± 0.183
3.427AspPro: 3.427 ± 0.293
1.863AspGln: 1.863 ± 0.249
2.707AspArg: 2.707 ± 0.245
4.545AspSer: 4.545 ± 0.448
3.154AspThr: 3.154 ± 0.256
3.974AspVal: 3.974 ± 0.396
0.72AspTrp: 0.72 ± 0.142
1.763AspTyr: 1.763 ± 0.239
0.0AspXaa: 0.0 ± 0.0
Glu
5.513GluAla: 5.513 ± 0.531
1.068GluCys: 1.068 ± 0.21
3.303GluAsp: 3.303 ± 0.304
4.197GluGlu: 4.197 ± 0.495
2.012GluPhe: 2.012 ± 0.174
2.931GluGly: 2.931 ± 0.348
1.267GluHis: 1.267 ± 0.179
2.335GluIle: 2.335 ± 0.255
1.44GluLys: 1.44 ± 0.182
5.389GluLeu: 5.389 ± 0.405
1.366GluMet: 1.366 ± 0.162
2.086GluAsn: 2.086 ± 0.234
2.831GluPro: 2.831 ± 0.305
1.291GluGln: 1.291 ± 0.191
4.073GluArg: 4.073 ± 0.289
3.949GluSer: 3.949 ± 0.376
2.98GluThr: 2.98 ± 0.293
3.924GluVal: 3.924 ± 0.308
0.248GluTrp: 0.248 ± 0.075
1.912GluTyr: 1.912 ± 0.236
0.0GluXaa: 0.0 ± 0.0
Phe
3.303PheAla: 3.303 ± 0.29
0.869PheCys: 0.869 ± 0.167
2.335PheAsp: 2.335 ± 0.29
1.987PheGlu: 1.987 ± 0.315
1.689PhePhe: 1.689 ± 0.224
2.558PheGly: 2.558 ± 0.277
0.72PheHis: 0.72 ± 0.161
1.614PheIle: 1.614 ± 0.227
1.887PheLys: 1.887 ± 0.269
3.204PheLeu: 3.204 ± 0.319
0.944PheMet: 0.944 ± 0.181
1.416PheAsn: 1.416 ± 0.248
1.838PhePro: 1.838 ± 0.218
0.77PheGln: 0.77 ± 0.11
1.565PheArg: 1.565 ± 0.179
2.98PheSer: 2.98 ± 0.26
2.335PheThr: 2.335 ± 0.221
3.055PheVal: 3.055 ± 0.308
0.397PheTrp: 0.397 ± 0.112
1.167PheTyr: 1.167 ± 0.182
0.0PheXaa: 0.0 ± 0.0
Gly
6.755GlyAla: 6.755 ± 0.518
1.118GlyCys: 1.118 ± 0.186
3.452GlyAsp: 3.452 ± 0.293
4.048GlyGlu: 4.048 ± 0.375
2.608GlyPhe: 2.608 ± 0.28
5.439GlyGly: 5.439 ± 0.556
1.242GlyHis: 1.242 ± 0.179
2.906GlyIle: 2.906 ± 0.323
1.962GlyLys: 1.962 ± 0.227
5.563GlyLeu: 5.563 ± 0.456
1.167GlyMet: 1.167 ± 0.162
2.136GlyAsn: 2.136 ± 0.265
3.104GlyPro: 3.104 ± 0.322
2.037GlyGln: 2.037 ± 0.206
4.595GlyArg: 4.595 ± 0.336
5.066GlySer: 5.066 ± 0.402
3.154GlyThr: 3.154 ± 0.298
4.793GlyVal: 4.793 ± 0.361
0.546GlyTrp: 0.546 ± 0.137
1.714GlyTyr: 1.714 ± 0.192
0.0GlyXaa: 0.0 ± 0.0
His
2.037HisAla: 2.037 ± 0.255
0.323HisCys: 0.323 ± 0.096
0.993HisAsp: 0.993 ± 0.167
0.77HisGlu: 0.77 ± 0.153
0.795HisPhe: 0.795 ± 0.144
1.763HisGly: 1.763 ± 0.197
0.497HisHis: 0.497 ± 0.151
0.82HisIle: 0.82 ± 0.145
0.72HisLys: 0.72 ± 0.137
2.285HisLeu: 2.285 ± 0.276
0.447HisMet: 0.447 ± 0.092
0.621HisAsn: 0.621 ± 0.115
1.49HisPro: 1.49 ± 0.169
0.82HisGln: 0.82 ± 0.138
1.614HisArg: 1.614 ± 0.165
1.664HisSer: 1.664 ± 0.201
1.465HisThr: 1.465 ± 0.204
1.54HisVal: 1.54 ± 0.232
0.05HisTrp: 0.05 ± 0.044
0.77HisTyr: 0.77 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
3.402IleAla: 3.402 ± 0.309
0.621IleCys: 0.621 ± 0.119
2.21IleAsp: 2.21 ± 0.228
2.26IleGlu: 2.26 ± 0.242
1.515IlePhe: 1.515 ± 0.179
2.757IleGly: 2.757 ± 0.269
0.695IleHis: 0.695 ± 0.106
2.012IleIle: 2.012 ± 0.239
1.291IleLys: 1.291 ± 0.172
3.8IleLeu: 3.8 ± 0.316
0.869IleMet: 0.869 ± 0.195
1.962IleAsn: 1.962 ± 0.193
2.757IlePro: 2.757 ± 0.295
1.565IleGln: 1.565 ± 0.229
2.26IleArg: 2.26 ± 0.241
3.328IleSer: 3.328 ± 0.273
3.179IleThr: 3.179 ± 0.261
3.03IleVal: 3.03 ± 0.283
0.224IleTrp: 0.224 ± 0.071
1.54IleTyr: 1.54 ± 0.189
0.0IleXaa: 0.0 ± 0.0
Lys
2.757LysAla: 2.757 ± 0.351
0.447LysCys: 0.447 ± 0.113
1.366LysAsp: 1.366 ± 0.29
1.291LysGlu: 1.291 ± 0.176
1.267LysPhe: 1.267 ± 0.19
1.565LysGly: 1.565 ± 0.221
1.043LysHis: 1.043 ± 0.151
1.242LysIle: 1.242 ± 0.192
1.068LysLys: 1.068 ± 0.18
3.179LysLeu: 3.179 ± 0.356
0.894LysMet: 0.894 ± 0.139
1.118LysAsn: 1.118 ± 0.215
1.689LysPro: 1.689 ± 0.283
1.639LysGln: 1.639 ± 0.262
3.204LysArg: 3.204 ± 0.33
2.558LysSer: 2.558 ± 0.512
2.558LysThr: 2.558 ± 0.328
1.689LysVal: 1.689 ± 0.248
0.397LysTrp: 0.397 ± 0.086
1.068LysTyr: 1.068 ± 0.144
0.0LysXaa: 0.0 ± 0.0
Leu
9.313LeuAla: 9.313 ± 0.572
1.763LeuCys: 1.763 ± 0.259
4.793LeuAsp: 4.793 ± 0.432
5.042LeuGlu: 5.042 ± 0.332
4.023LeuPhe: 4.023 ± 0.33
5.687LeuGly: 5.687 ± 0.438
2.061LeuHis: 2.061 ± 0.255
3.974LeuIle: 3.974 ± 0.357
2.806LeuLys: 2.806 ± 0.296
9.611LeuLeu: 9.611 ± 0.559
2.359LeuMet: 2.359 ± 0.264
3.427LeuAsn: 3.427 ± 0.387
5.638LeuPro: 5.638 ± 0.387
3.08LeuGln: 3.08 ± 0.245
6.209LeuArg: 6.209 ± 0.29
7.177LeuSer: 7.177 ± 0.386
5.24LeuThr: 5.24 ± 0.385
6.929LeuVal: 6.929 ± 0.512
0.894LeuTrp: 0.894 ± 0.16
2.881LeuTyr: 2.881 ± 0.292
0.0LeuXaa: 0.0 ± 0.0
Met
2.682MetAla: 2.682 ± 0.281
0.447MetCys: 0.447 ± 0.089
1.316MetAsp: 1.316 ± 0.151
1.267MetGlu: 1.267 ± 0.166
0.869MetPhe: 0.869 ± 0.147
1.416MetGly: 1.416 ± 0.197
0.397MetHis: 0.397 ± 0.106
0.671MetIle: 0.671 ± 0.117
0.596MetLys: 0.596 ± 0.104
2.484MetLeu: 2.484 ± 0.208
0.571MetMet: 0.571 ± 0.119
0.671MetAsn: 0.671 ± 0.115
1.192MetPro: 1.192 ± 0.191
0.422MetGln: 0.422 ± 0.104
1.664MetArg: 1.664 ± 0.225
1.738MetSer: 1.738 ± 0.157
0.869MetThr: 0.869 ± 0.154
1.242MetVal: 1.242 ± 0.182
0.149MetTrp: 0.149 ± 0.091
0.82MetTyr: 0.82 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.129AsnAla: 3.129 ± 0.313
0.621AsnCys: 0.621 ± 0.132
1.54AsnAsp: 1.54 ± 0.179
1.465AsnGlu: 1.465 ± 0.219
1.54AsnPhe: 1.54 ± 0.238
2.508AsnGly: 2.508 ± 0.367
0.745AsnHis: 0.745 ± 0.153
1.689AsnIle: 1.689 ± 0.173
1.242AsnLys: 1.242 ± 0.167
3.303AsnLeu: 3.303 ± 0.275
0.745AsnMet: 0.745 ± 0.131
1.565AsnAsn: 1.565 ± 0.219
2.657AsnPro: 2.657 ± 0.295
0.993AsnGln: 0.993 ± 0.155
1.788AsnArg: 1.788 ± 0.187
3.402AsnSer: 3.402 ± 0.364
2.31AsnThr: 2.31 ± 0.251
2.235AsnVal: 2.235 ± 0.25
0.373AsnTrp: 0.373 ± 0.103
1.118AsnTyr: 1.118 ± 0.195
0.0AsnXaa: 0.0 ± 0.0
Pro
5.985ProAla: 5.985 ± 0.682
1.068ProCys: 1.068 ± 0.156
3.204ProAsp: 3.204 ± 0.249
3.676ProGlu: 3.676 ± 0.33
1.589ProPhe: 1.589 ± 0.197
4.247ProGly: 4.247 ± 0.501
1.465ProHis: 1.465 ± 0.19
2.732ProIle: 2.732 ± 0.225
2.434ProLys: 2.434 ± 0.428
4.868ProLeu: 4.868 ± 0.407
1.217ProMet: 1.217 ± 0.19
1.937ProAsn: 1.937 ± 0.248
6.979ProPro: 6.979 ± 0.767
2.806ProGln: 2.806 ± 0.375
4.421ProArg: 4.421 ± 0.556
6.06ProSer: 6.06 ± 0.493
4.768ProThr: 4.768 ± 0.584
4.421ProVal: 4.421 ± 0.331
0.447ProTrp: 0.447 ± 0.119
1.291ProTyr: 1.291 ± 0.184
0.0ProXaa: 0.0 ± 0.0
Gln
3.626GlnAla: 3.626 ± 0.451
0.497GlnCys: 0.497 ± 0.114
1.217GlnAsp: 1.217 ± 0.172
1.49GlnGlu: 1.49 ± 0.174
1.316GlnPhe: 1.316 ± 0.162
1.416GlnGly: 1.416 ± 0.197
0.844GlnHis: 0.844 ± 0.122
1.465GlnIle: 1.465 ± 0.154
1.217GlnLys: 1.217 ± 0.189
2.906GlnLeu: 2.906 ± 0.266
0.596GlnMet: 0.596 ± 0.124
1.217GlnAsn: 1.217 ± 0.194
2.409GlnPro: 2.409 ± 0.321
1.565GlnGln: 1.565 ± 0.204
3.104GlnArg: 3.104 ± 0.307
2.657GlnSer: 2.657 ± 0.326
2.484GlnThr: 2.484 ± 0.25
1.44GlnVal: 1.44 ± 0.173
0.373GlnTrp: 0.373 ± 0.088
0.919GlnTyr: 0.919 ± 0.167
0.0GlnXaa: 0.0 ± 0.0
Arg
6.457ArgAla: 6.457 ± 0.458
1.44ArgCys: 1.44 ± 0.24
3.551ArgAsp: 3.551 ± 0.343
3.502ArgGlu: 3.502 ± 0.324
2.26ArgPhe: 2.26 ± 0.271
4.57ArgGly: 4.57 ± 0.428
1.589ArgHis: 1.589 ± 0.198
2.21ArgIle: 2.21 ± 0.256
2.285ArgLys: 2.285 ± 0.207
7.028ArgLeu: 7.028 ± 0.488
0.919ArgMet: 0.919 ± 0.161
2.434ArgAsn: 2.434 ± 0.227
4.595ArgPro: 4.595 ± 0.368
2.161ArgGln: 2.161 ± 0.286
7.227ArgArg: 7.227 ± 0.69
5.017ArgSer: 5.017 ± 0.422
3.7ArgThr: 3.7 ± 0.318
5.017ArgVal: 5.017 ± 0.326
0.695ArgTrp: 0.695 ± 0.12
1.614ArgTyr: 1.614 ± 0.247
0.0ArgXaa: 0.0 ± 0.0
Ser
8.146SerAla: 8.146 ± 0.454
1.589SerCys: 1.589 ± 0.23
4.868SerAsp: 4.868 ± 0.494
4.545SerGlu: 4.545 ± 0.377
2.409SerPhe: 2.409 ± 0.254
5.911SerGly: 5.911 ± 0.467
1.714SerHis: 1.714 ± 0.223
3.229SerIle: 3.229 ± 0.274
2.608SerLys: 2.608 ± 0.281
6.73SerLeu: 6.73 ± 0.5
1.589SerMet: 1.589 ± 0.221
2.732SerAsn: 2.732 ± 0.28
5.315SerPro: 5.315 ± 0.457
2.508SerGln: 2.508 ± 0.318
4.868SerArg: 4.868 ± 0.422
9.09SerSer: 9.09 ± 0.82
7.277SerThr: 7.277 ± 1.061
6.333SerVal: 6.333 ± 0.443
1.018SerTrp: 1.018 ± 0.165
2.061SerTyr: 2.061 ± 0.217
0.0SerXaa: 0.0 ± 0.0
Thr
6.383ThrAla: 6.383 ± 0.778
1.341ThrCys: 1.341 ± 0.179
2.856ThrAsp: 2.856 ± 0.331
3.303ThrGlu: 3.303 ± 0.28
2.186ThrPhe: 2.186 ± 0.253
3.502ThrGly: 3.502 ± 0.295
1.838ThrHis: 1.838 ± 0.203
2.732ThrIle: 2.732 ± 0.197
2.285ThrLys: 2.285 ± 0.225
6.234ThrLeu: 6.234 ± 0.372
1.043ThrMet: 1.043 ± 0.174
2.26ThrAsn: 2.26 ± 0.303
5.513ThrPro: 5.513 ± 0.443
2.732ThrGln: 2.732 ± 0.269
3.949ThrArg: 3.949 ± 0.259
5.439ThrSer: 5.439 ± 0.883
6.432ThrThr: 6.432 ± 2.138
4.619ThrVal: 4.619 ± 0.272
0.72ThrTrp: 0.72 ± 0.143
1.763ThrTyr: 1.763 ± 0.215
0.0ThrXaa: 0.0 ± 0.0
Val
7.177ValAla: 7.177 ± 0.522
1.714ValCys: 1.714 ± 0.256
4.47ValAsp: 4.47 ± 0.304
3.75ValGlu: 3.75 ± 0.346
2.955ValPhe: 2.955 ± 0.345
3.924ValGly: 3.924 ± 0.312
1.341ValHis: 1.341 ± 0.155
3.104ValIle: 3.104 ± 0.325
1.664ValLys: 1.664 ± 0.24
7.153ValLeu: 7.153 ± 0.47
1.291ValMet: 1.291 ± 0.25
2.782ValAsn: 2.782 ± 0.324
4.545ValPro: 4.545 ± 0.425
1.54ValGln: 1.54 ± 0.201
4.098ValArg: 4.098 ± 0.326
6.631ValSer: 6.631 ± 0.435
4.197ValThr: 4.197 ± 0.277
5.315ValVal: 5.315 ± 0.342
0.745ValTrp: 0.745 ± 0.135
3.08ValTyr: 3.08 ± 0.383
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.121
0.124TrpCys: 0.124 ± 0.064
0.646TrpAsp: 0.646 ± 0.103
0.522TrpGlu: 0.522 ± 0.09
0.472TrpPhe: 0.472 ± 0.133
0.596TrpGly: 0.596 ± 0.109
0.248TrpHis: 0.248 ± 0.068
0.323TrpIle: 0.323 ± 0.099
0.447TrpLys: 0.447 ± 0.112
0.844TrpLeu: 0.844 ± 0.129
0.224TrpMet: 0.224 ± 0.059
0.348TrpAsn: 0.348 ± 0.088
0.522TrpPro: 0.522 ± 0.152
0.522TrpGln: 0.522 ± 0.116
0.795TrpArg: 0.795 ± 0.139
0.671TrpSer: 0.671 ± 0.129
0.745TrpThr: 0.745 ± 0.142
0.571TrpVal: 0.571 ± 0.118
0.174TrpTrp: 0.174 ± 0.067
0.348TrpTyr: 0.348 ± 0.085
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.508TyrAla: 2.508 ± 0.211
0.546TyrCys: 0.546 ± 0.12
1.863TyrAsp: 1.863 ± 0.185
1.217TyrGlu: 1.217 ± 0.144
1.093TyrPhe: 1.093 ± 0.172
1.863TyrGly: 1.863 ± 0.176
0.621TyrHis: 0.621 ± 0.11
1.639TyrIle: 1.639 ± 0.243
1.068TyrLys: 1.068 ± 0.184
2.707TyrLeu: 2.707 ± 0.268
0.919TyrMet: 0.919 ± 0.177
1.093TyrAsn: 1.093 ± 0.14
1.366TyrPro: 1.366 ± 0.241
0.745TyrGln: 0.745 ± 0.102
2.037TyrArg: 2.037 ± 0.165
2.657TyrSer: 2.657 ± 0.306
2.508TyrThr: 2.508 ± 0.237
2.31TyrVal: 2.31 ± 0.212
0.323TyrTrp: 0.323 ± 0.101
1.118TyrTyr: 1.118 ± 0.168
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (40266 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski