Amino acid dipepetide frequency for Epstein-Barr virus (strain AG876) (HHV-4) (Human herpesvirus 4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.175AlaAla: 10.175 ± 1.581
1.797AlaCys: 1.797 ± 0.2
3.108AlaAsp: 3.108 ± 0.302
4.128AlaGlu: 4.128 ± 0.447
3.351AlaPhe: 3.351 ± 0.314
8.329AlaGly: 8.329 ± 2.163
2.307AlaHis: 2.307 ± 0.236
3.302AlaIle: 3.302 ± 0.286
2.137AlaLys: 2.137 ± 0.237
8.766AlaLeu: 8.766 ± 0.774
1.724AlaMet: 1.724 ± 0.198
2.04AlaAsn: 2.04 ± 0.271
7.601AlaPro: 7.601 ± 1.048
3.254AlaGln: 3.254 ± 0.245
6.144AlaArg: 6.144 ± 0.582
7.892AlaSer: 7.892 ± 0.658
5.294AlaThr: 5.294 ± 0.509
6.338AlaVal: 6.338 ± 0.525
1.408AlaTrp: 1.408 ± 0.186
2.185AlaTyr: 2.185 ± 0.296
0.0AlaXaa: 0.0 ± 0.0
Cys
1.36CysAla: 1.36 ± 0.206
0.437CysCys: 0.437 ± 0.122
0.728CysAsp: 0.728 ± 0.146
0.971CysGlu: 0.971 ± 0.161
0.777CysPhe: 0.777 ± 0.141
1.336CysGly: 1.336 ± 0.216
0.631CysHis: 0.631 ± 0.141
0.68CysIle: 0.68 ± 0.133
0.631CysLys: 0.631 ± 0.124
2.914CysLeu: 2.914 ± 0.415
0.389CysMet: 0.389 ± 0.097
0.68CysAsn: 0.68 ± 0.154
1.481CysPro: 1.481 ± 0.215
0.898CysGln: 0.898 ± 0.149
1.433CysArg: 1.433 ± 0.233
1.238CysSer: 1.238 ± 0.179
0.971CysThr: 0.971 ± 0.139
1.263CysVal: 1.263 ± 0.168
0.17CysTrp: 0.17 ± 0.064
0.704CysTyr: 0.704 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
3.91AspAla: 3.91 ± 0.328
0.777AspCys: 0.777 ± 0.178
2.331AspAsp: 2.331 ± 0.345
3.4AspGlu: 3.4 ± 0.373
1.554AspPhe: 1.554 ± 0.171
2.89AspGly: 2.89 ± 0.321
0.898AspHis: 0.898 ± 0.16
2.137AspIle: 2.137 ± 0.179
1.287AspLys: 1.287 ± 0.2
5.124AspLeu: 5.124 ± 0.364
1.336AspMet: 1.336 ± 0.239
1.748AspAsn: 1.748 ± 0.345
3.448AspPro: 3.448 ± 0.304
1.166AspGln: 1.166 ± 0.194
2.501AspArg: 2.501 ± 0.266
2.963AspSer: 2.963 ± 0.295
2.501AspThr: 2.501 ± 0.206
3.011AspVal: 3.011 ± 0.232
0.486AspTrp: 0.486 ± 0.143
1.603AspTyr: 1.603 ± 0.163
0.0AspXaa: 0.0 ± 0.0
Glu
5.974GluAla: 5.974 ± 0.587
0.631GluCys: 0.631 ± 0.128
3.642GluAsp: 3.642 ± 0.308
4.711GluGlu: 4.711 ± 0.517
1.481GluPhe: 1.481 ± 0.151
3.91GluGly: 3.91 ± 0.423
1.408GluHis: 1.408 ± 0.177
2.453GluIle: 2.453 ± 0.254
1.433GluLys: 1.433 ± 0.18
4.832GluLeu: 4.832 ± 0.395
1.238GluMet: 1.238 ± 0.18
2.064GluAsn: 2.064 ± 0.215
2.987GluPro: 2.987 ± 0.367
2.04GluGln: 2.04 ± 0.244
3.084GluArg: 3.084 ± 0.34
3.642GluSer: 3.642 ± 0.303
3.837GluThr: 3.837 ± 0.297
3.497GluVal: 3.497 ± 0.358
0.437GluTrp: 0.437 ± 0.092
1.044GluTyr: 1.044 ± 0.201
0.0GluXaa: 0.0 ± 0.0
Phe
1.87PheAla: 1.87 ± 0.251
0.898PheCys: 0.898 ± 0.176
1.627PheAsp: 1.627 ± 0.193
1.846PheGlu: 1.846 ± 0.243
1.7PhePhe: 1.7 ± 0.218
2.21PheGly: 2.21 ± 0.206
0.753PheHis: 0.753 ± 0.119
2.015PheIle: 2.015 ± 0.285
1.457PheLys: 1.457 ± 0.232
5.172PheLeu: 5.172 ± 0.399
0.996PheMet: 0.996 ± 0.169
1.19PheAsn: 1.19 ± 0.163
1.967PhePro: 1.967 ± 0.174
1.481PheGln: 1.481 ± 0.178
1.651PheArg: 1.651 ± 0.175
3.278PheSer: 3.278 ± 0.278
2.015PheThr: 2.015 ± 0.203
2.744PheVal: 2.744 ± 0.29
0.413PheTrp: 0.413 ± 0.105
1.87PheTyr: 1.87 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
8.013GlyAla: 8.013 ± 2.175
1.19GlyCys: 1.19 ± 0.2
3.327GlyAsp: 3.327 ± 0.329
3.861GlyGlu: 3.861 ± 0.362
2.015GlyPhe: 2.015 ± 0.227
9.495GlyGly: 9.495 ± 2.422
1.967GlyHis: 1.967 ± 0.224
1.991GlyIle: 1.991 ± 0.224
2.088GlyLys: 2.088 ± 0.214
7.771GlyLeu: 7.771 ± 0.413
0.971GlyMet: 0.971 ± 0.159
2.21GlyAsn: 2.21 ± 0.28
6.435GlyPro: 6.435 ± 0.915
3.351GlyGln: 3.351 ± 0.362
5.075GlyArg: 5.075 ± 0.526
4.832GlySer: 4.832 ± 0.394
3.812GlyThr: 3.812 ± 0.322
3.594GlyVal: 3.594 ± 0.33
0.826GlyTrp: 0.826 ± 0.141
1.36GlyTyr: 1.36 ± 0.216
0.0GlyXaa: 0.0 ± 0.0
His
1.918HisAla: 1.918 ± 0.252
0.461HisCys: 0.461 ± 0.104
0.874HisAsp: 0.874 ± 0.151
1.238HisGlu: 1.238 ± 0.154
0.753HisPhe: 0.753 ± 0.134
1.87HisGly: 1.87 ± 0.223
0.85HisHis: 0.85 ± 0.176
0.826HisIle: 0.826 ± 0.148
0.826HisLys: 0.826 ± 0.134
3.254HisLeu: 3.254 ± 0.362
0.437HisMet: 0.437 ± 0.108
0.583HisAsn: 0.583 ± 0.098
2.137HisPro: 2.137 ± 0.296
0.874HisGln: 0.874 ± 0.18
1.603HisArg: 1.603 ± 0.225
1.821HisSer: 1.821 ± 0.21
1.748HisThr: 1.748 ± 0.192
1.967HisVal: 1.967 ± 0.213
0.243HisTrp: 0.243 ± 0.075
0.68HisTyr: 0.68 ± 0.123
0.0HisXaa: 0.0 ± 0.0
Ile
2.598IleAla: 2.598 ± 0.267
0.947IleCys: 0.947 ± 0.171
1.214IleAsp: 1.214 ± 0.178
1.894IleGlu: 1.894 ± 0.222
1.991IlePhe: 1.991 ± 0.278
1.408IleGly: 1.408 ± 0.234
0.801IleHis: 0.801 ± 0.159
1.967IleIle: 1.967 ± 0.278
1.797IleLys: 1.797 ± 0.249
4.371IleLeu: 4.371 ± 0.361
0.85IleMet: 0.85 ± 0.126
1.554IleAsn: 1.554 ± 0.232
2.793IlePro: 2.793 ± 0.33
1.554IleGln: 1.554 ± 0.175
1.846IleArg: 1.846 ± 0.274
2.987IleSer: 2.987 ± 0.273
2.38IleThr: 2.38 ± 0.329
2.404IleVal: 2.404 ± 0.229
0.486IleTrp: 0.486 ± 0.108
1.384IleTyr: 1.384 ± 0.187
0.0IleXaa: 0.0 ± 0.0
Lys
2.793LysAla: 2.793 ± 0.268
0.607LysCys: 0.607 ± 0.126
1.821LysAsp: 1.821 ± 0.223
1.846LysGlu: 1.846 ± 0.257
1.093LysPhe: 1.093 ± 0.158
1.457LysGly: 1.457 ± 0.206
1.044LysHis: 1.044 ± 0.151
1.408LysIle: 1.408 ± 0.187
1.7LysLys: 1.7 ± 0.255
2.865LysLeu: 2.865 ± 0.243
0.607LysMet: 0.607 ± 0.124
1.238LysAsn: 1.238 ± 0.15
1.846LysPro: 1.846 ± 0.241
1.554LysGln: 1.554 ± 0.209
2.283LysArg: 2.283 ± 0.244
2.283LysSer: 2.283 ± 0.251
2.137LysThr: 2.137 ± 0.254
1.846LysVal: 1.846 ± 0.232
0.291LysTrp: 0.291 ± 0.105
0.607LysTyr: 0.607 ± 0.129
0.0LysXaa: 0.0 ± 0.0
Leu
9.689LeuAla: 9.689 ± 0.73
2.234LeuCys: 2.234 ± 0.326
4.322LeuAsp: 4.322 ± 0.403
5.804LeuGlu: 5.804 ± 0.498
4.614LeuPhe: 4.614 ± 0.433
7.139LeuGly: 7.139 ± 0.521
2.355LeuHis: 2.355 ± 0.29
3.837LeuIle: 3.837 ± 0.452
3.181LeuLys: 3.181 ± 0.31
12.409LeuLeu: 12.409 ± 1.079
2.283LeuMet: 2.283 ± 0.229
3.108LeuAsn: 3.108 ± 0.281
7.139LeuPro: 7.139 ± 0.553
3.958LeuGln: 3.958 ± 0.315
7.382LeuArg: 7.382 ± 0.463
7.941LeuSer: 7.941 ± 0.45
6.629LeuThr: 6.629 ± 0.484
6.216LeuVal: 6.216 ± 0.353
1.287LeuTrp: 1.287 ± 0.232
3.448LeuTyr: 3.448 ± 0.373
0.0LeuXaa: 0.0 ± 0.0
Met
2.404MetAla: 2.404 ± 0.255
0.486MetCys: 0.486 ± 0.115
1.02MetAsp: 1.02 ± 0.152
1.336MetGlu: 1.336 ± 0.209
1.166MetPhe: 1.166 ± 0.216
1.068MetGly: 1.068 ± 0.164
0.559MetHis: 0.559 ± 0.107
0.704MetIle: 0.704 ± 0.142
0.486MetLys: 0.486 ± 0.104
2.088MetLeu: 2.088 ± 0.184
0.243MetMet: 0.243 ± 0.061
0.437MetAsn: 0.437 ± 0.093
1.214MetPro: 1.214 ± 0.152
0.923MetGln: 0.923 ± 0.183
1.238MetArg: 1.238 ± 0.201
1.384MetSer: 1.384 ± 0.186
1.433MetThr: 1.433 ± 0.215
0.947MetVal: 0.947 ± 0.155
0.291MetTrp: 0.291 ± 0.074
0.559MetTyr: 0.559 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
2.744AsnAla: 2.744 ± 0.332
0.607AsnCys: 0.607 ± 0.096
0.947AsnAsp: 0.947 ± 0.148
1.117AsnGlu: 1.117 ± 0.142
1.287AsnPhe: 1.287 ± 0.177
1.554AsnGly: 1.554 ± 0.24
0.777AsnHis: 0.777 ± 0.146
2.015AsnIle: 2.015 ± 0.251
1.457AsnLys: 1.457 ± 0.238
3.472AsnLeu: 3.472 ± 0.327
0.68AsnMet: 0.68 ± 0.166
1.19AsnAsn: 1.19 ± 0.159
2.113AsnPro: 2.113 ± 0.245
1.093AsnGln: 1.093 ± 0.188
1.846AsnArg: 1.846 ± 0.284
2.428AsnSer: 2.428 ± 0.285
1.967AsnThr: 1.967 ± 0.222
2.064AsnVal: 2.064 ± 0.215
0.219AsnTrp: 0.219 ± 0.083
0.947AsnTyr: 0.947 ± 0.146
0.0AsnXaa: 0.0 ± 0.0
Pro
7.091ProAla: 7.091 ± 0.759
1.53ProCys: 1.53 ± 0.193
3.035ProAsp: 3.035 ± 0.277
4.395ProGlu: 4.395 ± 0.353
2.04ProPhe: 2.04 ± 0.169
6.338ProGly: 6.338 ± 0.561
1.748ProHis: 1.748 ± 0.264
2.258ProIle: 2.258 ± 0.265
1.967ProLys: 1.967 ± 0.279
6.386ProLeu: 6.386 ± 0.51
1.214ProMet: 1.214 ± 0.196
1.797ProAsn: 1.797 ± 0.255
9.203ProPro: 9.203 ± 1.336
3.06ProGln: 3.06 ± 0.469
5.974ProArg: 5.974 ± 0.611
7.188ProSer: 7.188 ± 0.83
5.488ProThr: 5.488 ± 0.709
5.318ProVal: 5.318 ± 0.342
1.19ProTrp: 1.19 ± 0.191
1.506ProTyr: 1.506 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
4.104GlnAla: 4.104 ± 0.387
0.559GlnCys: 0.559 ± 0.127
2.477GlnAsp: 2.477 ± 0.263
2.185GlnGlu: 2.185 ± 0.31
1.457GlnPhe: 1.457 ± 0.247
3.011GlnGly: 3.011 ± 0.301
0.801GlnHis: 0.801 ± 0.143
1.36GlnIle: 1.36 ± 0.152
1.554GlnLys: 1.554 ± 0.198
3.23GlnLeu: 3.23 ± 0.335
0.826GlnMet: 0.826 ± 0.15
1.311GlnAsn: 1.311 ± 0.168
2.865GlnPro: 2.865 ± 0.399
2.453GlnGln: 2.453 ± 0.403
2.525GlnArg: 2.525 ± 0.266
2.671GlnSer: 2.671 ± 0.344
2.331GlnThr: 2.331 ± 0.26
2.234GlnVal: 2.234 ± 0.254
0.486GlnTrp: 0.486 ± 0.116
0.996GlnTyr: 0.996 ± 0.169
0.0GlnXaa: 0.0 ± 0.0
Arg
5.682ArgAla: 5.682 ± 0.439
1.263ArgCys: 1.263 ± 0.176
3.545ArgAsp: 3.545 ± 0.299
3.885ArgGlu: 3.885 ± 0.402
1.724ArgPhe: 1.724 ± 0.199
5.367ArgGly: 5.367 ± 0.657
1.991ArgHis: 1.991 ± 0.239
1.676ArgIle: 1.676 ± 0.236
2.04ArgLys: 2.04 ± 0.26
6.168ArgLeu: 6.168 ± 0.499
0.971ArgMet: 0.971 ± 0.168
1.7ArgAsn: 1.7 ± 0.158
5.391ArgPro: 5.391 ± 0.499
2.428ArgGln: 2.428 ± 0.278
6.896ArgArg: 6.896 ± 0.85
4.492ArgSer: 4.492 ± 0.467
2.987ArgThr: 2.987 ± 0.24
4.711ArgVal: 4.711 ± 0.355
0.826ArgTrp: 0.826 ± 0.185
1.651ArgTyr: 1.651 ± 0.181
0.0ArgXaa: 0.0 ± 0.0
Ser
6.022SerAla: 6.022 ± 0.695
1.481SerCys: 1.481 ± 0.221
3.618SerAsp: 3.618 ± 0.398
3.594SerGlu: 3.594 ± 0.315
2.695SerPhe: 2.695 ± 0.294
6.751SerGly: 6.751 ± 0.598
2.161SerHis: 2.161 ± 0.265
2.695SerIle: 2.695 ± 0.215
2.04SerLys: 2.04 ± 0.226
8.305SerLeu: 8.305 ± 0.493
1.676SerMet: 1.676 ± 0.196
2.04SerAsn: 2.04 ± 0.205
7.115SerPro: 7.115 ± 0.777
3.4SerGln: 3.4 ± 0.355
4.225SerArg: 4.225 ± 0.377
6.556SerSer: 6.556 ± 0.6
5.197SerThr: 5.197 ± 0.595
4.589SerVal: 4.589 ± 0.356
0.874SerTrp: 0.874 ± 0.15
2.064SerTyr: 2.064 ± 0.265
0.0SerXaa: 0.0 ± 0.0
Thr
5.415ThrAla: 5.415 ± 0.377
1.117ThrCys: 1.117 ± 0.166
2.793ThrAsp: 2.793 ± 0.285
2.817ThrGlu: 2.817 ± 0.29
2.501ThrPhe: 2.501 ± 0.251
4.104ThrGly: 4.104 ± 0.381
1.433ThrHis: 1.433 ± 0.158
1.991ThrIle: 1.991 ± 0.215
1.603ThrLys: 1.603 ± 0.216
6.945ThrLeu: 6.945 ± 0.438
1.166ThrMet: 1.166 ± 0.183
2.088ThrAsn: 2.088 ± 0.274
5.561ThrPro: 5.561 ± 0.577
2.185ThrGln: 2.185 ± 0.252
3.837ThrArg: 3.837 ± 0.32
5.464ThrSer: 5.464 ± 0.947
4.347ThrThr: 4.347 ± 0.858
4.638ThrVal: 4.638 ± 0.346
0.826ThrTrp: 0.826 ± 0.161
2.355ThrTyr: 2.355 ± 0.24
0.0ThrXaa: 0.0 ± 0.0
Val
5.876ValAla: 5.876 ± 0.467
1.651ValCys: 1.651 ± 0.214
2.72ValAsp: 2.72 ± 0.325
3.327ValGlu: 3.327 ± 0.334
2.987ValPhe: 2.987 ± 0.344
3.521ValGly: 3.521 ± 0.296
1.311ValHis: 1.311 ± 0.161
2.21ValIle: 2.21 ± 0.256
2.258ValLys: 2.258 ± 0.244
6.654ValLeu: 6.654 ± 0.513
1.578ValMet: 1.578 ± 0.224
1.918ValAsn: 1.918 ± 0.272
4.978ValPro: 4.978 ± 0.426
2.501ValGln: 2.501 ± 0.295
3.424ValArg: 3.424 ± 0.315
5.391ValSer: 5.391 ± 0.335
5.367ValThr: 5.367 ± 0.502
4.201ValVal: 4.201 ± 0.333
0.559ValTrp: 0.559 ± 0.116
2.404ValTyr: 2.404 ± 0.262
0.0ValXaa: 0.0 ± 0.0
Trp
0.996TrpAla: 0.996 ± 0.143
0.219TrpCys: 0.219 ± 0.064
0.559TrpAsp: 0.559 ± 0.11
0.534TrpGlu: 0.534 ± 0.101
0.437TrpPhe: 0.437 ± 0.101
0.607TrpGly: 0.607 ± 0.126
0.413TrpHis: 0.413 ± 0.096
0.461TrpIle: 0.461 ± 0.097
0.34TrpLys: 0.34 ± 0.093
1.336TrpLeu: 1.336 ± 0.163
0.389TrpMet: 0.389 ± 0.122
0.437TrpAsn: 0.437 ± 0.097
1.044TrpPro: 1.044 ± 0.172
0.389TrpGln: 0.389 ± 0.095
0.947TrpArg: 0.947 ± 0.164
0.631TrpSer: 0.631 ± 0.143
0.947TrpThr: 0.947 ± 0.162
0.704TrpVal: 0.704 ± 0.151
0.146TrpTrp: 0.146 ± 0.062
0.316TrpTyr: 0.316 ± 0.074
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.647TyrAla: 2.647 ± 0.216
0.801TyrCys: 0.801 ± 0.133
1.36TyrAsp: 1.36 ± 0.197
1.457TyrGlu: 1.457 ± 0.139
1.53TyrPhe: 1.53 ± 0.223
1.821TyrGly: 1.821 ± 0.228
0.753TyrHis: 0.753 ± 0.13
1.238TyrIle: 1.238 ± 0.18
1.044TyrLys: 1.044 ± 0.16
2.914TyrLeu: 2.914 ± 0.343
0.389TyrMet: 0.389 ± 0.109
1.19TyrAsn: 1.19 ± 0.205
1.408TyrPro: 1.408 ± 0.138
0.874TyrGln: 0.874 ± 0.136
1.481TyrArg: 1.481 ± 0.189
2.015TyrSer: 2.015 ± 0.25
1.797TyrThr: 1.797 ± 0.192
2.574TyrVal: 2.574 ± 0.3
0.389TyrTrp: 0.389 ± 0.089
1.068TyrTyr: 1.068 ± 0.177
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (41182 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski