Amino acid dipepetide frequency for Equid alphaherpesvirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.823AlaAla: 21.823 ± 1.899
2.433AlaCys: 2.433 ± 0.278
6.346AlaAsp: 6.346 ± 0.376
7.249AlaGlu: 7.249 ± 0.654
4.013AlaPhe: 4.013 ± 0.343
9.783AlaGly: 9.783 ± 0.649
2.709AlaHis: 2.709 ± 0.307
3.361AlaIle: 3.361 ± 0.313
2.483AlaLys: 2.483 ± 0.387
12.717AlaLeu: 12.717 ± 0.761
2.157AlaMet: 2.157 ± 0.183
2.96AlaAsn: 2.96 ± 0.218
10.961AlaPro: 10.961 ± 1.001
2.659AlaGln: 2.659 ± 0.244
12.617AlaArg: 12.617 ± 0.751
8.88AlaSer: 8.88 ± 0.603
6.472AlaThr: 6.472 ± 0.48
9.256AlaVal: 9.256 ± 0.666
1.38AlaTrp: 1.38 ± 0.17
3.261AlaTyr: 3.261 ± 0.309
0.0AlaXaa: 0.0 ± 0.0
Cys
2.483CysAla: 2.483 ± 0.341
0.627CysCys: 0.627 ± 0.113
1.154CysAsp: 1.154 ± 0.189
1.304CysGlu: 1.304 ± 0.228
0.828CysPhe: 0.828 ± 0.156
1.58CysGly: 1.58 ± 0.167
0.552CysHis: 0.552 ± 0.104
0.602CysIle: 0.602 ± 0.127
0.426CysLys: 0.426 ± 0.107
1.781CysLeu: 1.781 ± 0.221
0.251CysMet: 0.251 ± 0.079
0.527CysAsn: 0.527 ± 0.157
1.254CysPro: 1.254 ± 0.214
0.452CysGln: 0.452 ± 0.111
1.455CysArg: 1.455 ± 0.169
0.727CysSer: 0.727 ± 0.153
0.677CysThr: 0.677 ± 0.142
1.48CysVal: 1.48 ± 0.201
0.276CysTrp: 0.276 ± 0.087
0.376CysTyr: 0.376 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
7.5AspAla: 7.5 ± 0.704
0.677AspCys: 0.677 ± 0.161
3.436AspAsp: 3.436 ± 0.367
4.089AspGlu: 4.089 ± 0.322
1.58AspPhe: 1.58 ± 0.168
4.39AspGly: 4.39 ± 0.361
0.903AspHis: 0.903 ± 0.145
1.254AspIle: 1.254 ± 0.186
0.702AspLys: 0.702 ± 0.148
5.518AspLeu: 5.518 ± 0.428
0.903AspMet: 0.903 ± 0.143
0.928AspAsn: 0.928 ± 0.179
3.712AspPro: 3.712 ± 0.29
0.727AspGln: 0.727 ± 0.143
3.637AspArg: 3.637 ± 0.247
3.085AspSer: 3.085 ± 0.308
1.982AspThr: 1.982 ± 0.257
4.038AspVal: 4.038 ± 0.32
0.702AspTrp: 0.702 ± 0.109
1.505AspTyr: 1.505 ± 0.2
0.0AspXaa: 0.0 ± 0.0
Glu
7.726GluAla: 7.726 ± 0.561
1.079GluCys: 1.079 ± 0.205
3.863GluAsp: 3.863 ± 0.338
3.938GluGlu: 3.938 ± 0.443
2.207GluPhe: 2.207 ± 0.223
4.339GluGly: 4.339 ± 0.386
1.329GluHis: 1.329 ± 0.146
1.605GluIle: 1.605 ± 0.222
1.329GluLys: 1.329 ± 0.207
5.418GluLeu: 5.418 ± 0.377
1.179GluMet: 1.179 ± 0.228
1.731GluAsn: 1.731 ± 0.175
3.336GluPro: 3.336 ± 0.297
1.38GluGln: 1.38 ± 0.236
5.493GluArg: 5.493 ± 0.417
3.336GluSer: 3.336 ± 0.261
3.135GluThr: 3.135 ± 0.301
3.612GluVal: 3.612 ± 0.229
0.401GluTrp: 0.401 ± 0.096
1.555GluTyr: 1.555 ± 0.192
0.0GluXaa: 0.0 ± 0.0
Phe
3.512PheAla: 3.512 ± 0.305
0.878PheCys: 0.878 ± 0.16
2.634PheAsp: 2.634 ± 0.24
1.906PheGlu: 1.906 ± 0.227
1.706PhePhe: 1.706 ± 0.228
2.834PheGly: 2.834 ± 0.304
0.828PheHis: 0.828 ± 0.183
1.229PheIle: 1.229 ± 0.246
1.58PheLys: 1.58 ± 0.27
3.211PheLeu: 3.211 ± 0.326
0.903PheMet: 0.903 ± 0.171
1.028PheAsn: 1.028 ± 0.117
1.856PhePro: 1.856 ± 0.266
0.702PheGln: 0.702 ± 0.124
2.082PheArg: 2.082 ± 0.234
2.784PheSer: 2.784 ± 0.254
1.781PheThr: 1.781 ± 0.228
3.186PheVal: 3.186 ± 0.286
0.251PheTrp: 0.251 ± 0.081
1.129PheTyr: 1.129 ± 0.154
0.0PheXaa: 0.0 ± 0.0
Gly
10.911GlyAla: 10.911 ± 0.742
1.254GlyCys: 1.254 ± 0.204
4.666GlyAsp: 4.666 ± 0.309
5.242GlyGlu: 5.242 ± 0.373
2.659GlyPhe: 2.659 ± 0.243
8.227GlyGly: 8.227 ± 0.545
1.104GlyHis: 1.104 ± 0.164
1.856GlyIle: 1.856 ± 0.289
1.605GlyLys: 1.605 ± 0.2
6.547GlyLeu: 6.547 ± 0.408
1.028GlyMet: 1.028 ± 0.148
1.63GlyAsn: 1.63 ± 0.26
5.393GlyPro: 5.393 ± 0.471
1.957GlyGln: 1.957 ± 0.208
6.547GlyArg: 6.547 ± 0.422
5.117GlySer: 5.117 ± 0.366
3.462GlyThr: 3.462 ± 0.378
5.343GlyVal: 5.343 ± 0.371
0.727GlyTrp: 0.727 ± 0.142
1.706GlyTyr: 1.706 ± 0.213
0.0GlyXaa: 0.0 ± 0.0
His
2.383HisAla: 2.383 ± 0.228
0.351HisCys: 0.351 ± 0.089
0.727HisAsp: 0.727 ± 0.103
0.652HisGlu: 0.652 ± 0.137
0.702HisPhe: 0.702 ± 0.155
1.605HisGly: 1.605 ± 0.223
0.527HisHis: 0.527 ± 0.136
0.652HisIle: 0.652 ± 0.142
0.326HisLys: 0.326 ± 0.086
2.508HisLeu: 2.508 ± 0.252
0.301HisMet: 0.301 ± 0.107
0.727HisAsn: 0.727 ± 0.136
2.157HisPro: 2.157 ± 0.266
0.727HisGln: 0.727 ± 0.118
1.982HisArg: 1.982 ± 0.22
1.355HisSer: 1.355 ± 0.168
0.953HisThr: 0.953 ± 0.145
1.706HisVal: 1.706 ± 0.246
0.075HisTrp: 0.075 ± 0.049
0.803HisTyr: 0.803 ± 0.176
0.0HisXaa: 0.0 ± 0.0
Ile
3.361IleAla: 3.361 ± 0.313
0.426IleCys: 0.426 ± 0.101
1.706IleAsp: 1.706 ± 0.185
1.656IleGlu: 1.656 ± 0.183
1.254IlePhe: 1.254 ± 0.171
1.881IleGly: 1.881 ± 0.253
0.702IleHis: 0.702 ± 0.116
1.028IleIle: 1.028 ± 0.161
0.677IleLys: 0.677 ± 0.139
2.659IleLeu: 2.659 ± 0.262
0.426IleMet: 0.426 ± 0.113
1.179IleAsn: 1.179 ± 0.223
1.53IlePro: 1.53 ± 0.223
0.878IleGln: 0.878 ± 0.162
1.831IleArg: 1.831 ± 0.216
1.555IleSer: 1.555 ± 0.169
1.756IleThr: 1.756 ± 0.216
2.007IleVal: 2.007 ± 0.211
0.276IleTrp: 0.276 ± 0.099
1.054IleTyr: 1.054 ± 0.184
0.0IleXaa: 0.0 ± 0.0
Lys
2.258LysAla: 2.258 ± 0.294
0.276LysCys: 0.276 ± 0.077
0.828LysAsp: 0.828 ± 0.162
1.129LysGlu: 1.129 ± 0.135
1.204LysPhe: 1.204 ± 0.223
1.054LysGly: 1.054 ± 0.151
0.652LysHis: 0.652 ± 0.146
0.727LysIle: 0.727 ± 0.173
0.953LysLys: 0.953 ± 0.188
1.982LysLeu: 1.982 ± 0.33
0.803LysMet: 0.803 ± 0.146
0.928LysAsn: 0.928 ± 0.175
1.405LysPro: 1.405 ± 0.214
0.928LysGln: 0.928 ± 0.134
2.659LysArg: 2.659 ± 0.23
1.355LysSer: 1.355 ± 0.173
1.756LysThr: 1.756 ± 0.174
1.229LysVal: 1.229 ± 0.206
0.276LysTrp: 0.276 ± 0.084
0.953LysTyr: 0.953 ± 0.147
0.0LysXaa: 0.0 ± 0.0
Leu
13.319LeuAla: 13.319 ± 0.821
1.931LeuCys: 1.931 ± 0.262
5.017LeuAsp: 5.017 ± 0.51
5.594LeuGlu: 5.594 ± 0.417
3.863LeuPhe: 3.863 ± 0.335
6.572LeuGly: 6.572 ± 0.426
1.957LeuHis: 1.957 ± 0.226
2.684LeuIle: 2.684 ± 0.273
2.358LeuLys: 2.358 ± 0.255
9.783LeuLeu: 9.783 ± 0.629
2.107LeuMet: 2.107 ± 0.28
2.358LeuAsn: 2.358 ± 0.315
6.446LeuPro: 6.446 ± 0.449
2.759LeuGln: 2.759 ± 0.228
7.751LeuArg: 7.751 ± 0.363
6.446LeuSer: 6.446 ± 0.366
4.239LeuThr: 4.239 ± 0.26
7.274LeuVal: 7.274 ± 0.523
1.179LeuTrp: 1.179 ± 0.18
2.609LeuTyr: 2.609 ± 0.229
0.0LeuXaa: 0.0 ± 0.0
Met
2.283MetAla: 2.283 ± 0.234
0.376MetCys: 0.376 ± 0.079
1.229MetAsp: 1.229 ± 0.176
1.304MetGlu: 1.304 ± 0.162
0.702MetPhe: 0.702 ± 0.136
1.179MetGly: 1.179 ± 0.204
0.301MetHis: 0.301 ± 0.081
0.577MetIle: 0.577 ± 0.146
0.376MetLys: 0.376 ± 0.088
2.383MetLeu: 2.383 ± 0.233
0.376MetMet: 0.376 ± 0.117
0.401MetAsn: 0.401 ± 0.076
1.054MetPro: 1.054 ± 0.187
0.376MetGln: 0.376 ± 0.089
1.405MetArg: 1.405 ± 0.216
1.329MetSer: 1.329 ± 0.165
0.778MetThr: 0.778 ± 0.142
1.129MetVal: 1.129 ± 0.152
0.05MetTrp: 0.05 ± 0.029
0.477MetTyr: 0.477 ± 0.097
0.0MetXaa: 0.0 ± 0.0
Asn
3.186AsnAla: 3.186 ± 0.281
0.552AsnCys: 0.552 ± 0.14
0.828AsnAsp: 0.828 ± 0.133
1.38AsnGlu: 1.38 ± 0.254
1.054AsnPhe: 1.054 ± 0.193
2.007AsnGly: 2.007 ± 0.292
0.602AsnHis: 0.602 ± 0.116
0.803AsnIle: 0.803 ± 0.143
0.727AsnLys: 0.727 ± 0.136
2.684AsnLeu: 2.684 ± 0.27
0.577AsnMet: 0.577 ± 0.149
0.853AsnAsn: 0.853 ± 0.152
1.806AsnPro: 1.806 ± 0.239
0.602AsnGln: 0.602 ± 0.104
1.505AsnArg: 1.505 ± 0.225
2.082AsnSer: 2.082 ± 0.276
1.329AsnThr: 1.329 ± 0.179
1.58AsnVal: 1.58 ± 0.242
0.226AsnTrp: 0.226 ± 0.082
1.028AsnTyr: 1.028 ± 0.185
0.0AsnXaa: 0.0 ± 0.0
Pro
11.112ProAla: 11.112 ± 0.957
1.304ProCys: 1.304 ± 0.198
3.186ProAsp: 3.186 ± 0.282
4.666ProGlu: 4.666 ± 0.392
1.806ProPhe: 1.806 ± 0.217
6.873ProGly: 6.873 ± 0.666
1.555ProHis: 1.555 ± 0.249
1.656ProIle: 1.656 ± 0.198
2.182ProLys: 2.182 ± 0.318
6.196ProLeu: 6.196 ± 0.453
1.003ProMet: 1.003 ± 0.169
1.329ProAsn: 1.329 ± 0.188
10.535ProPro: 10.535 ± 1.183
2.082ProGln: 2.082 ± 0.241
5.844ProArg: 5.844 ± 0.438
5.619ProSer: 5.619 ± 0.454
3.637ProThr: 3.637 ± 0.385
4.139ProVal: 4.139 ± 0.438
0.753ProTrp: 0.753 ± 0.156
1.254ProTyr: 1.254 ± 0.175
0.0ProXaa: 0.0 ± 0.0
Gln
2.985GlnAla: 2.985 ± 0.346
0.477GlnCys: 0.477 ± 0.113
0.928GlnAsp: 0.928 ± 0.128
1.079GlnGlu: 1.079 ± 0.161
0.878GlnPhe: 0.878 ± 0.166
1.38GlnGly: 1.38 ± 0.141
0.753GlnHis: 0.753 ± 0.136
1.154GlnIle: 1.154 ± 0.156
0.878GlnLys: 0.878 ± 0.156
2.659GlnLeu: 2.659 ± 0.317
0.702GlnMet: 0.702 ± 0.133
0.753GlnAsn: 0.753 ± 0.139
2.157GlnPro: 2.157 ± 0.356
1.129GlnGln: 1.129 ± 0.194
2.634GlnArg: 2.634 ± 0.294
1.304GlnSer: 1.304 ± 0.164
2.007GlnThr: 2.007 ± 0.255
1.179GlnVal: 1.179 ± 0.171
0.151GlnTrp: 0.151 ± 0.061
0.753GlnTyr: 0.753 ± 0.119
0.0GlnXaa: 0.0 ± 0.0
Arg
11.764ArgAla: 11.764 ± 0.898
1.781ArgCys: 1.781 ± 0.232
3.763ArgAsp: 3.763 ± 0.336
4.816ArgGlu: 4.816 ± 0.352
2.759ArgPhe: 2.759 ± 0.261
6.572ArgGly: 6.572 ± 0.448
2.207ArgHis: 2.207 ± 0.231
1.756ArgIle: 1.756 ± 0.191
1.756ArgLys: 1.756 ± 0.186
8.353ArgLeu: 8.353 ± 0.48
1.003ArgMet: 1.003 ± 0.179
1.931ArgAsn: 1.931 ± 0.222
6.296ArgPro: 6.296 ± 0.464
2.559ArgGln: 2.559 ± 0.243
10.309ArgArg: 10.309 ± 0.681
4.992ArgSer: 4.992 ± 0.387
3.487ArgThr: 3.487 ± 0.323
6.02ArgVal: 6.02 ± 0.34
0.928ArgTrp: 0.928 ± 0.145
1.931ArgTyr: 1.931 ± 0.242
0.0ArgXaa: 0.0 ± 0.0
Ser
7.901SerAla: 7.901 ± 0.641
1.455SerCys: 1.455 ± 0.195
3.411SerAsp: 3.411 ± 0.234
4.089SerGlu: 4.089 ± 0.355
2.082SerPhe: 2.082 ± 0.225
6.346SerGly: 6.346 ± 0.38
1.279SerHis: 1.279 ± 0.188
1.731SerIle: 1.731 ± 0.211
1.63SerLys: 1.63 ± 0.2
6.12SerLeu: 6.12 ± 0.33
1.254SerMet: 1.254 ± 0.184
1.304SerAsn: 1.304 ± 0.217
5.493SerPro: 5.493 ± 0.545
1.706SerGln: 1.706 ± 0.358
5.393SerArg: 5.393 ± 0.308
5.995SerSer: 5.995 ± 0.464
3.035SerThr: 3.035 ± 0.288
4.766SerVal: 4.766 ± 0.362
0.853SerTrp: 0.853 ± 0.153
1.656SerTyr: 1.656 ± 0.24
0.0SerXaa: 0.0 ± 0.0
Thr
6.522ThrAla: 6.522 ± 0.434
0.828ThrCys: 0.828 ± 0.138
1.756ThrAsp: 1.756 ± 0.206
2.132ThrGlu: 2.132 ± 0.197
1.881ThrPhe: 1.881 ± 0.221
3.11ThrGly: 3.11 ± 0.298
1.129ThrHis: 1.129 ± 0.167
1.906ThrIle: 1.906 ± 0.216
0.928ThrLys: 0.928 ± 0.182
5.242ThrLeu: 5.242 ± 0.335
0.753ThrMet: 0.753 ± 0.141
1.455ThrAsn: 1.455 ± 0.243
4.967ThrPro: 4.967 ± 0.416
1.656ThrGln: 1.656 ± 0.213
3.737ThrArg: 3.737 ± 0.236
3.085ThrSer: 3.085 ± 0.319
3.161ThrThr: 3.161 ± 0.326
3.587ThrVal: 3.587 ± 0.372
0.527ThrTrp: 0.527 ± 0.147
1.555ThrTyr: 1.555 ± 0.235
0.0ThrXaa: 0.0 ± 0.0
Val
8.252ValAla: 8.252 ± 0.499
1.605ValCys: 1.605 ± 0.231
3.386ValAsp: 3.386 ± 0.29
3.687ValGlu: 3.687 ± 0.39
3.386ValPhe: 3.386 ± 0.338
4.54ValGly: 4.54 ± 0.343
1.154ValHis: 1.154 ± 0.18
2.007ValIle: 2.007 ± 0.228
1.304ValLys: 1.304 ± 0.194
6.848ValLeu: 6.848 ± 0.367
1.43ValMet: 1.43 ± 0.255
2.358ValAsn: 2.358 ± 0.327
4.289ValPro: 4.289 ± 0.312
1.881ValGln: 1.881 ± 0.224
5.042ValArg: 5.042 ± 0.33
6.02ValSer: 6.02 ± 0.417
3.662ValThr: 3.662 ± 0.308
5.343ValVal: 5.343 ± 0.42
0.753ValTrp: 0.753 ± 0.13
2.935ValTyr: 2.935 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
0.878TrpAla: 0.878 ± 0.158
0.125TrpCys: 0.125 ± 0.075
0.702TrpAsp: 0.702 ± 0.123
0.477TrpGlu: 0.477 ± 0.107
0.351TrpPhe: 0.351 ± 0.107
0.928TrpGly: 0.928 ± 0.126
0.351TrpHis: 0.351 ± 0.103
0.351TrpIle: 0.351 ± 0.103
0.401TrpLys: 0.401 ± 0.088
0.903TrpLeu: 0.903 ± 0.127
0.226TrpMet: 0.226 ± 0.073
0.201TrpAsn: 0.201 ± 0.075
0.627TrpPro: 0.627 ± 0.164
0.251TrpGln: 0.251 ± 0.082
0.878TrpArg: 0.878 ± 0.161
0.502TrpSer: 0.502 ± 0.094
1.003TrpThr: 1.003 ± 0.146
0.702TrpVal: 0.702 ± 0.157
0.176TrpTrp: 0.176 ± 0.064
0.276TrpTyr: 0.276 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.361TyrAla: 3.361 ± 0.331
0.376TyrCys: 0.376 ± 0.091
1.706TyrAsp: 1.706 ± 0.205
1.605TyrGlu: 1.605 ± 0.223
1.104TyrPhe: 1.104 ± 0.166
1.781TyrGly: 1.781 ± 0.222
0.627TyrHis: 0.627 ± 0.132
0.903TyrIle: 0.903 ± 0.141
0.727TyrLys: 0.727 ± 0.178
2.734TyrLeu: 2.734 ± 0.268
0.677TyrMet: 0.677 ± 0.131
0.828TyrAsn: 0.828 ± 0.154
1.48TyrPro: 1.48 ± 0.281
0.527TyrGln: 0.527 ± 0.114
2.182TyrArg: 2.182 ± 0.175
1.931TyrSer: 1.931 ± 0.268
1.605TyrThr: 1.605 ± 0.278
2.308TyrVal: 2.308 ± 0.216
0.351TyrTrp: 0.351 ± 0.092
1.179TyrTyr: 1.179 ± 0.225
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (39868 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski