Amino acid dipepetide frequency for Epstein-Barr virus (strain B95-8) (HHV-4) (Human herpesvirus 4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.884AlaAla: 11.884 ± 1.537
1.553AlaCys: 1.553 ± 0.198
4.184AlaAsp: 4.184 ± 0.943
3.904AlaGlu: 3.904 ± 0.406
3.127AlaPhe: 3.127 ± 0.308
8.519AlaGly: 8.519 ± 2.004
2.027AlaHis: 2.027 ± 0.189
2.955AlaIle: 2.955 ± 0.297
1.898AlaLys: 1.898 ± 0.23
7.872AlaLeu: 7.872 ± 0.724
1.467AlaMet: 1.467 ± 0.203
1.833AlaAsn: 1.833 ± 0.277
8.843AlaPro: 8.843 ± 1.042
3.817AlaGln: 3.817 ± 0.374
6.039AlaArg: 6.039 ± 0.586
8.735AlaSer: 8.735 ± 0.756
5.457AlaThr: 5.457 ± 0.611
5.91AlaVal: 5.91 ± 0.509
1.488AlaTrp: 1.488 ± 0.17
2.027AlaTyr: 2.027 ± 0.243
0.0AlaXaa: 0.0 ± 0.0
Cys
1.251CysAla: 1.251 ± 0.199
0.41CysCys: 0.41 ± 0.11
0.733CysAsp: 0.733 ± 0.154
0.776CysGlu: 0.776 ± 0.138
0.69CysPhe: 0.69 ± 0.129
1.208CysGly: 1.208 ± 0.171
0.518CysHis: 0.518 ± 0.121
0.647CysIle: 0.647 ± 0.14
0.733CysLys: 0.733 ± 0.118
2.825CysLeu: 2.825 ± 0.356
0.302CysMet: 0.302 ± 0.093
0.604CysAsn: 0.604 ± 0.112
2.351CysPro: 2.351 ± 0.413
0.755CysGln: 0.755 ± 0.115
1.337CysArg: 1.337 ± 0.222
1.1CysSer: 1.1 ± 0.171
1.143CysThr: 1.143 ± 0.163
1.122CysVal: 1.122 ± 0.151
0.324CysTrp: 0.324 ± 0.09
0.625CysTyr: 0.625 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
3.559AspAla: 3.559 ± 0.316
0.625AspCys: 0.625 ± 0.138
2.006AspAsp: 2.006 ± 0.347
3.127AspGlu: 3.127 ± 0.355
1.596AspPhe: 1.596 ± 0.161
2.545AspGly: 2.545 ± 0.272
0.82AspHis: 0.82 ± 0.144
1.833AspIle: 1.833 ± 0.217
1.1AspLys: 1.1 ± 0.173
4.572AspLeu: 4.572 ± 0.405
0.949AspMet: 0.949 ± 0.176
1.639AspAsn: 1.639 ± 0.255
5.241AspPro: 5.241 ± 0.981
1.1AspGln: 1.1 ± 0.164
2.868AspArg: 2.868 ± 0.279
2.761AspSer: 2.761 ± 0.24
2.221AspThr: 2.221 ± 0.216
2.868AspVal: 2.868 ± 0.292
0.431AspTrp: 0.431 ± 0.112
1.445AspTyr: 1.445 ± 0.159
0.0AspXaa: 0.0 ± 0.0
Glu
5.543GluAla: 5.543 ± 0.647
0.733GluCys: 0.733 ± 0.129
3.214GluAsp: 3.214 ± 0.302
4.378GluGlu: 4.378 ± 0.472
1.316GluPhe: 1.316 ± 0.164
4.572GluGly: 4.572 ± 0.56
1.229GluHis: 1.229 ± 0.163
2.178GluIle: 2.178 ± 0.217
1.359GluLys: 1.359 ± 0.198
4.421GluLeu: 4.421 ± 0.425
1.186GluMet: 1.186 ± 0.17
1.747GluAsn: 1.747 ± 0.227
3.58GluPro: 3.58 ± 0.475
1.812GluGln: 1.812 ± 0.245
2.739GluArg: 2.739 ± 0.292
3.494GluSer: 3.494 ± 0.272
3.365GluThr: 3.365 ± 0.346
3.192GluVal: 3.192 ± 0.269
0.388GluTrp: 0.388 ± 0.089
0.927GluTyr: 0.927 ± 0.184
0.0GluXaa: 0.0 ± 0.0
Phe
1.661PheAla: 1.661 ± 0.195
0.971PheCys: 0.971 ± 0.158
1.445PheAsp: 1.445 ± 0.183
1.574PheGlu: 1.574 ± 0.181
1.51PhePhe: 1.51 ± 0.194
1.963PheGly: 1.963 ± 0.185
0.712PheHis: 0.712 ± 0.108
1.833PheIle: 1.833 ± 0.232
1.229PheLys: 1.229 ± 0.188
4.529PheLeu: 4.529 ± 0.415
1.1PheMet: 1.1 ± 0.139
1.057PheAsn: 1.057 ± 0.149
2.07PhePro: 2.07 ± 0.179
1.359PheGln: 1.359 ± 0.166
1.445PheArg: 1.445 ± 0.174
3.149PheSer: 3.149 ± 0.243
1.769PheThr: 1.769 ± 0.204
2.459PheVal: 2.459 ± 0.306
0.367PheTrp: 0.367 ± 0.087
1.639PheTyr: 1.639 ± 0.193
0.0PheXaa: 0.0 ± 0.0
Gly
9.468GlyAla: 9.468 ± 2.103
1.488GlyCys: 1.488 ± 0.273
3.559GlyAsp: 3.559 ± 0.339
3.623GlyGlu: 3.623 ± 0.331
1.79GlyPhe: 1.79 ± 0.2
9.015GlyGly: 9.015 ± 2.171
2.804GlyHis: 2.804 ± 0.55
1.833GlyIle: 1.833 ± 0.206
2.178GlyLys: 2.178 ± 0.224
7.333GlyLeu: 7.333 ± 0.433
0.906GlyMet: 0.906 ± 0.14
1.963GlyAsn: 1.963 ± 0.293
8.778GlyPro: 8.778 ± 1.252
3.861GlyGln: 3.861 ± 0.442
5.586GlyArg: 5.586 ± 0.612
4.917GlySer: 4.917 ± 0.359
4.098GlyThr: 4.098 ± 0.328
3.494GlyVal: 3.494 ± 0.295
1.035GlyTrp: 1.035 ± 0.172
1.337GlyTyr: 1.337 ± 0.201
0.0GlyXaa: 0.0 ± 0.0
His
1.812HisAla: 1.812 ± 0.232
0.431HisCys: 0.431 ± 0.086
0.906HisAsp: 0.906 ± 0.161
1.078HisGlu: 1.078 ± 0.154
0.669HisPhe: 0.669 ± 0.135
1.963HisGly: 1.963 ± 0.216
0.798HisHis: 0.798 ± 0.171
0.755HisIle: 0.755 ± 0.124
0.69HisLys: 0.69 ± 0.127
3.235HisLeu: 3.235 ± 0.305
0.367HisMet: 0.367 ± 0.097
0.518HisAsn: 0.518 ± 0.099
3.3HisPro: 3.3 ± 0.573
1.014HisGln: 1.014 ± 0.154
1.941HisArg: 1.941 ± 0.279
1.618HisSer: 1.618 ± 0.203
1.445HisThr: 1.445 ± 0.202
1.855HisVal: 1.855 ± 0.181
0.216HisTrp: 0.216 ± 0.068
0.582HisTyr: 0.582 ± 0.113
0.0HisXaa: 0.0 ± 0.0
Ile
2.308IleAla: 2.308 ± 0.248
0.863IleCys: 0.863 ± 0.137
1.165IleAsp: 1.165 ± 0.17
1.725IleGlu: 1.725 ± 0.223
1.769IlePhe: 1.769 ± 0.253
1.272IleGly: 1.272 ± 0.228
0.776IleHis: 0.776 ± 0.154
1.596IleIle: 1.596 ± 0.242
1.51IleLys: 1.51 ± 0.193
3.99IleLeu: 3.99 ± 0.398
0.776IleMet: 0.776 ± 0.126
1.402IleAsn: 1.402 ± 0.206
2.523IlePro: 2.523 ± 0.321
1.337IleGln: 1.337 ± 0.144
1.661IleArg: 1.661 ± 0.213
2.523IleSer: 2.523 ± 0.266
2.092IleThr: 2.092 ± 0.259
2.135IleVal: 2.135 ± 0.228
0.453IleTrp: 0.453 ± 0.125
1.229IleTyr: 1.229 ± 0.149
0.0IleXaa: 0.0 ± 0.0
Lys
2.523LysAla: 2.523 ± 0.256
0.518LysCys: 0.518 ± 0.114
1.596LysAsp: 1.596 ± 0.216
1.876LysGlu: 1.876 ± 0.231
0.927LysPhe: 0.927 ± 0.14
1.618LysGly: 1.618 ± 0.195
0.906LysHis: 0.906 ± 0.142
1.272LysIle: 1.272 ± 0.177
1.531LysLys: 1.531 ± 0.269
2.545LysLeu: 2.545 ± 0.237
0.539LysMet: 0.539 ± 0.12
1.078LysAsn: 1.078 ± 0.162
1.876LysPro: 1.876 ± 0.249
1.553LysGln: 1.553 ± 0.19
2.027LysArg: 2.027 ± 0.238
2.049LysSer: 2.049 ± 0.222
1.92LysThr: 1.92 ± 0.248
1.79LysVal: 1.79 ± 0.206
0.237LysTrp: 0.237 ± 0.069
0.539LysTyr: 0.539 ± 0.112
0.0LysXaa: 0.0 ± 0.0
Leu
8.972LeuAla: 8.972 ± 0.715
2.696LeuCys: 2.696 ± 0.333
3.861LeuAsp: 3.861 ± 0.416
5.112LeuGlu: 5.112 ± 0.535
4.098LeuPhe: 4.098 ± 0.387
7.247LeuGly: 7.247 ± 0.463
2.394LeuHis: 2.394 ± 0.243
3.3LeuIle: 3.3 ± 0.452
2.955LeuLys: 2.955 ± 0.288
11.819LeuLeu: 11.819 ± 0.889
2.092LeuMet: 2.092 ± 0.222
2.761LeuAsn: 2.761 ± 0.243
7.613LeuPro: 7.613 ± 0.501
4.745LeuGln: 4.745 ± 0.513
7.635LeuArg: 7.635 ± 0.535
7.462LeuSer: 7.462 ± 0.459
5.996LeuThr: 5.996 ± 0.464
5.608LeuVal: 5.608 ± 0.385
1.143LeuTrp: 1.143 ± 0.196
2.998LeuTyr: 2.998 ± 0.275
0.0LeuXaa: 0.0 ± 0.0
Met
2.135MetAla: 2.135 ± 0.241
0.431MetCys: 0.431 ± 0.103
0.884MetAsp: 0.884 ± 0.157
1.208MetGlu: 1.208 ± 0.201
1.035MetPhe: 1.035 ± 0.21
0.949MetGly: 0.949 ± 0.162
0.518MetHis: 0.518 ± 0.116
0.625MetIle: 0.625 ± 0.129
0.474MetLys: 0.474 ± 0.106
1.898MetLeu: 1.898 ± 0.224
0.302MetMet: 0.302 ± 0.078
0.41MetAsn: 0.41 ± 0.089
1.122MetPro: 1.122 ± 0.174
0.798MetGln: 0.798 ± 0.16
1.143MetArg: 1.143 ± 0.175
1.165MetSer: 1.165 ± 0.179
1.186MetThr: 1.186 ± 0.181
0.971MetVal: 0.971 ± 0.166
0.216MetTrp: 0.216 ± 0.067
0.518MetTyr: 0.518 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
2.459AsnAla: 2.459 ± 0.349
0.539AsnCys: 0.539 ± 0.097
0.863AsnAsp: 0.863 ± 0.16
1.035AsnGlu: 1.035 ± 0.127
1.143AsnPhe: 1.143 ± 0.165
1.359AsnGly: 1.359 ± 0.204
0.733AsnHis: 0.733 ± 0.129
1.79AsnIle: 1.79 ± 0.232
1.337AsnLys: 1.337 ± 0.222
3.106AsnLeu: 3.106 ± 0.314
0.582AsnMet: 0.582 ± 0.148
1.035AsnAsn: 1.035 ± 0.153
1.941AsnPro: 1.941 ± 0.224
0.971AsnGln: 0.971 ± 0.163
1.639AsnArg: 1.639 ± 0.237
2.135AsnSer: 2.135 ± 0.223
1.704AsnThr: 1.704 ± 0.201
1.984AsnVal: 1.984 ± 0.269
0.194AsnTrp: 0.194 ± 0.062
0.863AsnTyr: 0.863 ± 0.137
0.0AsnXaa: 0.0 ± 0.0
Pro
9.274ProAla: 9.274 ± 1.418
1.488ProCys: 1.488 ± 0.182
2.868ProAsp: 2.868 ± 0.256
5.629ProGlu: 5.629 ± 0.517
2.006ProPhe: 2.006 ± 0.19
9.684ProGly: 9.684 ± 1.253
2.135ProHis: 2.135 ± 0.284
2.049ProIle: 2.049 ± 0.222
2.006ProLys: 2.006 ± 0.236
7.505ProLeu: 7.505 ± 0.538
1.1ProMet: 1.1 ± 0.192
1.661ProAsn: 1.661 ± 0.261
12.099ProPro: 12.099 ± 1.475
3.839ProGln: 3.839 ± 0.447
8.929ProArg: 8.929 ± 1.436
8.907ProSer: 8.907 ± 0.925
5.974ProThr: 5.974 ± 0.816
6.686ProVal: 6.686 ± 0.581
1.359ProTrp: 1.359 ± 0.189
1.402ProTyr: 1.402 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
4.615GlnAla: 4.615 ± 0.391
0.518GlnCys: 0.518 ± 0.116
2.221GlnAsp: 2.221 ± 0.239
2.092GlnGlu: 2.092 ± 0.306
1.251GlnPhe: 1.251 ± 0.202
3.235GlnGly: 3.235 ± 0.375
0.712GlnHis: 0.712 ± 0.133
1.251GlnIle: 1.251 ± 0.158
1.251GlnLys: 1.251 ± 0.184
3.063GlnLeu: 3.063 ± 0.302
0.733GlnMet: 0.733 ± 0.154
1.186GlnAsn: 1.186 ± 0.155
4.486GlnPro: 4.486 ± 0.629
2.157GlnGln: 2.157 ± 0.343
2.912GlnArg: 2.912 ± 0.413
2.998GlnSer: 2.998 ± 0.302
2.653GlnThr: 2.653 ± 0.229
2.329GlnVal: 2.329 ± 0.276
0.453GlnTrp: 0.453 ± 0.103
0.841GlnTyr: 0.841 ± 0.172
0.0GlnXaa: 0.0 ± 0.0
Arg
6.47ArgAla: 6.47 ± 0.587
1.186ArgCys: 1.186 ± 0.176
3.494ArgAsp: 3.494 ± 0.334
3.429ArgGlu: 3.429 ± 0.363
1.488ArgPhe: 1.488 ± 0.174
6.211ArgGly: 6.211 ± 0.675
1.812ArgHis: 1.812 ± 0.275
1.574ArgIle: 1.574 ± 0.189
1.876ArgLys: 1.876 ± 0.286
6.578ArgLeu: 6.578 ± 0.635
0.906ArgMet: 0.906 ± 0.122
1.574ArgAsn: 1.574 ± 0.178
7.872ArgPro: 7.872 ± 0.941
2.912ArgGln: 2.912 ± 0.298
7.656ArgArg: 7.656 ± 0.864
5.133ArgSer: 5.133 ± 0.659
3.365ArgThr: 3.365 ± 0.486
5.262ArgVal: 5.262 ± 0.493
0.712ArgTrp: 0.712 ± 0.179
1.488ArgTyr: 1.488 ± 0.169
0.0ArgXaa: 0.0 ± 0.0
Ser
5.845SerAla: 5.845 ± 0.627
1.423SerCys: 1.423 ± 0.201
3.321SerAsp: 3.321 ± 0.32
3.192SerGlu: 3.192 ± 0.311
2.545SerPhe: 2.545 ± 0.289
7.764SerGly: 7.764 ± 0.789
2.114SerHis: 2.114 ± 0.277
2.286SerIle: 2.286 ± 0.236
1.79SerLys: 1.79 ± 0.257
8.843SerLeu: 8.843 ± 0.557
1.488SerMet: 1.488 ± 0.165
1.963SerAsn: 1.963 ± 0.216
8.325SerPro: 8.325 ± 0.9
3.127SerGln: 3.127 ± 0.282
5.413SerArg: 5.413 ± 0.518
5.974SerSer: 5.974 ± 0.487
4.443SerThr: 4.443 ± 0.462
4.508SerVal: 4.508 ± 0.308
0.798SerTrp: 0.798 ± 0.129
1.812SerTyr: 1.812 ± 0.257
0.0SerXaa: 0.0 ± 0.0
Thr
4.917ThrAla: 4.917 ± 0.398
0.992ThrCys: 0.992 ± 0.166
2.588ThrAsp: 2.588 ± 0.252
2.523ThrGlu: 2.523 ± 0.3
2.243ThrPhe: 2.243 ± 0.265
3.904ThrGly: 3.904 ± 0.364
1.359ThrHis: 1.359 ± 0.198
1.812ThrIle: 1.812 ± 0.237
1.467ThrLys: 1.467 ± 0.237
6.47ThrLeu: 6.47 ± 0.414
1.078ThrMet: 1.078 ± 0.175
1.79ThrAsn: 1.79 ± 0.282
6.535ThrPro: 6.535 ± 0.697
1.812ThrGln: 1.812 ± 0.216
4.378ThrArg: 4.378 ± 0.591
4.961ThrSer: 4.961 ± 0.891
3.904ThrThr: 3.904 ± 0.714
4.421ThrVal: 4.421 ± 0.284
0.841ThrTrp: 0.841 ± 0.158
2.049ThrTyr: 2.049 ± 0.262
0.0ThrXaa: 0.0 ± 0.0
Val
5.543ValAla: 5.543 ± 0.419
1.682ValCys: 1.682 ± 0.193
2.61ValAsp: 2.61 ± 0.272
3.257ValGlu: 3.257 ± 0.292
2.718ValPhe: 2.718 ± 0.28
3.817ValGly: 3.817 ± 0.516
1.747ValHis: 1.747 ± 0.216
1.941ValIle: 1.941 ± 0.232
2.286ValLys: 2.286 ± 0.237
6.147ValLeu: 6.147 ± 0.491
1.423ValMet: 1.423 ± 0.201
1.725ValAsn: 1.725 ± 0.274
5.672ValPro: 5.672 ± 0.459
2.61ValGln: 2.61 ± 0.256
3.17ValArg: 3.17 ± 0.344
5.543ValSer: 5.543 ± 0.396
4.788ValThr: 4.788 ± 0.53
3.882ValVal: 3.882 ± 0.321
0.669ValTrp: 0.669 ± 0.111
2.07ValTyr: 2.07 ± 0.265
0.0ValXaa: 0.0 ± 0.0
Trp
0.971TrpAla: 0.971 ± 0.16
0.194TrpCys: 0.194 ± 0.064
0.561TrpAsp: 0.561 ± 0.105
0.604TrpGlu: 0.604 ± 0.107
0.561TrpPhe: 0.561 ± 0.11
0.647TrpGly: 0.647 ± 0.129
0.367TrpHis: 0.367 ± 0.096
0.41TrpIle: 0.41 ± 0.077
0.302TrpLys: 0.302 ± 0.081
1.229TrpLeu: 1.229 ± 0.178
0.324TrpMet: 0.324 ± 0.091
0.388TrpAsn: 0.388 ± 0.086
1.122TrpPro: 1.122 ± 0.153
0.518TrpGln: 0.518 ± 0.091
1.186TrpArg: 1.186 ± 0.212
0.539TrpSer: 0.539 ± 0.126
0.884TrpThr: 0.884 ± 0.156
0.625TrpVal: 0.625 ± 0.15
0.151TrpTrp: 0.151 ± 0.068
0.259TrpTyr: 0.259 ± 0.075
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.351TyrAla: 2.351 ± 0.23
0.712TyrCys: 0.712 ± 0.102
1.229TyrAsp: 1.229 ± 0.173
1.229TyrGlu: 1.229 ± 0.128
1.359TyrPhe: 1.359 ± 0.188
1.574TyrGly: 1.574 ± 0.196
0.712TyrHis: 0.712 ± 0.131
1.165TyrIle: 1.165 ± 0.159
0.949TyrLys: 0.949 ± 0.151
2.523TyrLeu: 2.523 ± 0.269
0.345TyrMet: 0.345 ± 0.108
1.078TyrAsn: 1.078 ± 0.186
1.251TyrPro: 1.251 ± 0.153
0.863TyrGln: 0.863 ± 0.181
1.294TyrArg: 1.294 ± 0.209
1.812TyrSer: 1.812 ± 0.247
1.639TyrThr: 1.639 ± 0.201
2.2TyrVal: 2.2 ± 0.26
0.367TyrTrp: 0.367 ± 0.088
0.927TyrTyr: 0.927 ± 0.164
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (46367 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski