Amino acid dipepetide frequency for Human herpesvirus 6A (strain Uganda-1102) (HHV-6 variant A) (Human B lymphotropic virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.334AlaAla: 3.334 ± 0.359
1.279AlaCys: 1.279 ± 0.274
2.39AlaAsp: 2.39 ± 0.275
2.663AlaGlu: 2.663 ± 0.266
2.852AlaPhe: 2.852 ± 0.198
2.202AlaGly: 2.202 ± 0.254
1.363AlaHis: 1.363 ± 0.167
3.376AlaIle: 3.376 ± 0.282
2.516AlaLys: 2.516 ± 0.23
4.655AlaLeu: 4.655 ± 0.344
1.342AlaMet: 1.342 ± 0.15
2.097AlaAsn: 2.097 ± 0.2
1.95AlaPro: 1.95 ± 0.256
1.719AlaGln: 1.719 ± 0.163
3.627AlaArg: 3.627 ± 0.428
4.906AlaSer: 4.906 ± 0.966
3.271AlaThr: 3.271 ± 0.242
3.376AlaVal: 3.376 ± 0.279
0.44AlaTrp: 0.44 ± 0.115
1.426AlaTyr: 1.426 ± 0.151
0.0AlaXaa: 0.0 ± 0.0
Cys
1.719CysAla: 1.719 ± 0.422
0.566CysCys: 0.566 ± 0.092
1.447CysAsp: 1.447 ± 0.198
1.51CysGlu: 1.51 ± 0.194
1.153CysPhe: 1.153 ± 0.186
1.405CysGly: 1.405 ± 0.176
0.545CysHis: 0.545 ± 0.108
1.719CysIle: 1.719 ± 0.202
1.405CysLys: 1.405 ± 0.221
3.25CysLeu: 3.25 ± 0.603
1.761CysMet: 1.761 ± 1.073
1.237CysAsn: 1.237 ± 0.168
0.776CysPro: 0.776 ± 0.147
0.881CysGln: 0.881 ± 0.144
1.573CysArg: 1.573 ± 0.206
1.992CysSer: 1.992 ± 0.226
1.09CysThr: 1.09 ± 0.159
3.292CysVal: 3.292 ± 1.362
0.294CysTrp: 0.294 ± 0.076
0.839CysTyr: 0.839 ± 0.14
0.0CysXaa: 0.0 ± 0.0
Asp
2.306AspAla: 2.306 ± 0.229
1.195AspCys: 1.195 ± 0.16
2.956AspAsp: 2.956 ± 0.277
3.439AspGlu: 3.439 ± 0.318
2.852AspPhe: 2.852 ± 0.257
2.411AspGly: 2.411 ± 0.286
1.069AspHis: 1.069 ± 0.16
4.11AspIle: 4.11 ± 0.359
2.789AspLys: 2.789 ± 0.28
5.409AspLeu: 5.409 ± 0.363
1.342AspMet: 1.342 ± 0.188
2.726AspAsn: 2.726 ± 0.275
2.348AspPro: 2.348 ± 0.21
1.132AspGln: 1.132 ± 0.142
1.782AspArg: 1.782 ± 0.205
3.543AspSer: 3.543 ± 0.404
3.376AspThr: 3.376 ± 0.287
3.963AspVal: 3.963 ± 0.289
0.482AspTrp: 0.482 ± 0.091
1.866AspTyr: 1.866 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
2.369GluAla: 2.369 ± 0.185
1.174GluCys: 1.174 ± 0.17
2.768GluAsp: 2.768 ± 0.249
3.711GluGlu: 3.711 ± 0.351
2.411GluPhe: 2.411 ± 0.267
2.16GluGly: 2.16 ± 0.246
1.321GluHis: 1.321 ± 0.151
3.963GluIle: 3.963 ± 0.288
4.571GluLys: 4.571 ± 0.42
4.969GluLeu: 4.969 ± 0.36
1.782GluMet: 1.782 ± 0.198
3.963GluAsn: 3.963 ± 0.281
1.908GluPro: 1.908 ± 0.18
2.474GluGln: 2.474 ± 0.258
3.166GluArg: 3.166 ± 0.277
4.822GluSer: 4.822 ± 0.426
4.11GluThr: 4.11 ± 0.328
2.747GluVal: 2.747 ± 0.339
0.419GluTrp: 0.419 ± 0.098
1.656GluTyr: 1.656 ± 0.189
0.0GluXaa: 0.0 ± 0.0
Phe
2.642PheAla: 2.642 ± 0.217
1.698PheCys: 1.698 ± 0.207
2.537PheAsp: 2.537 ± 0.234
2.348PheGlu: 2.348 ± 0.218
3.418PhePhe: 3.418 ± 0.324
2.139PheGly: 2.139 ± 0.296
1.342PheHis: 1.342 ± 0.175
3.481PheIle: 3.481 ± 0.384
2.768PheLys: 2.768 ± 0.226
6.08PheLeu: 6.08 ± 0.474
1.195PheMet: 1.195 ± 0.158
2.998PheAsn: 2.998 ± 0.255
2.202PhePro: 2.202 ± 0.181
1.614PheGln: 1.614 ± 0.222
2.055PheArg: 2.055 ± 0.214
4.466PheSer: 4.466 ± 0.287
3.208PheThr: 3.208 ± 0.319
3.292PheVal: 3.292 ± 0.296
0.377PheTrp: 0.377 ± 0.095
2.076PheTyr: 2.076 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
2.16GlyAla: 2.16 ± 0.261
0.902GlyCys: 0.902 ± 0.17
2.642GlyAsp: 2.642 ± 0.29
2.495GlyGlu: 2.495 ± 0.241
1.929GlyPhe: 1.929 ± 0.215
2.285GlyGly: 2.285 ± 0.338
1.048GlyHis: 1.048 ± 0.16
2.285GlyIle: 2.285 ± 0.293
2.935GlyLys: 2.935 ± 0.228
4.403GlyLeu: 4.403 ± 0.297
1.027GlyMet: 1.027 ± 0.158
2.558GlyAsn: 2.558 ± 0.325
1.552GlyPro: 1.552 ± 0.226
1.719GlyGln: 1.719 ± 0.186
2.747GlyArg: 2.747 ± 0.384
2.789GlySer: 2.789 ± 0.257
2.537GlyThr: 2.537 ± 0.279
2.768GlyVal: 2.768 ± 0.272
0.503GlyTrp: 0.503 ± 0.097
1.573GlyTyr: 1.573 ± 0.207
0.0GlyXaa: 0.0 ± 0.0
His
1.426HisAla: 1.426 ± 0.181
0.65HisCys: 0.65 ± 0.139
1.489HisAsp: 1.489 ± 0.204
1.195HisGlu: 1.195 ± 0.153
1.489HisPhe: 1.489 ± 0.178
1.3HisGly: 1.3 ± 0.197
0.692HisHis: 0.692 ± 0.142
1.614HisIle: 1.614 ± 0.201
1.174HisLys: 1.174 ± 0.154
2.495HisLeu: 2.495 ± 0.261
0.734HisMet: 0.734 ± 0.103
1.195HisAsn: 1.195 ± 0.17
1.048HisPro: 1.048 ± 0.145
0.776HisGln: 0.776 ± 0.147
1.782HisArg: 1.782 ± 0.214
1.593HisSer: 1.593 ± 0.164
1.51HisThr: 1.51 ± 0.172
2.013HisVal: 2.013 ± 0.212
0.168HisTrp: 0.168 ± 0.054
0.734HisTyr: 0.734 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
3.292IleAla: 3.292 ± 0.311
1.614IleCys: 1.614 ± 0.205
3.187IleAsp: 3.187 ± 0.295
3.418IleGlu: 3.418 ± 0.404
2.977IlePhe: 2.977 ± 0.291
2.537IleGly: 2.537 ± 0.274
1.384IleHis: 1.384 ± 0.192
3.984IleIle: 3.984 ± 0.369
4.361IleLys: 4.361 ± 0.335
6.584IleLeu: 6.584 ± 0.499
1.698IleMet: 1.698 ± 0.239
3.439IleAsn: 3.439 ± 0.339
3.564IlePro: 3.564 ± 0.236
2.432IleGln: 2.432 ± 0.236
2.977IleArg: 2.977 ± 0.268
5.703IleSer: 5.703 ± 0.344
4.089IleThr: 4.089 ± 0.357
4.172IleVal: 4.172 ± 0.323
0.398IleTrp: 0.398 ± 0.086
3.271IleTyr: 3.271 ± 0.311
0.0IleXaa: 0.0 ± 0.0
Lys
2.747LysAla: 2.747 ± 0.302
1.531LysCys: 1.531 ± 0.191
3.25LysAsp: 3.25 ± 0.291
3.9LysGlu: 3.9 ± 0.327
2.935LysPhe: 2.935 ± 0.293
1.51LysGly: 1.51 ± 0.19
1.845LysHis: 1.845 ± 0.189
4.843LysIle: 4.843 ± 0.418
5.179LysLys: 5.179 ± 0.449
5.472LysLeu: 5.472 ± 0.416
1.51LysMet: 1.51 ± 0.181
4.277LysAsn: 4.277 ± 0.338
2.264LysPro: 2.264 ± 0.167
3.208LysGln: 3.208 ± 0.267
2.956LysArg: 2.956 ± 0.292
4.487LysSer: 4.487 ± 0.322
5.179LysThr: 5.179 ± 0.388
2.474LysVal: 2.474 ± 0.267
0.377LysTrp: 0.377 ± 0.089
1.992LysTyr: 1.992 ± 0.197
0.0LysXaa: 0.0 ± 0.0
Leu
4.55LeuAla: 4.55 ± 0.368
3.9LeuCys: 3.9 ± 0.665
4.403LeuAsp: 4.403 ± 0.395
4.634LeuGlu: 4.634 ± 0.361
5.598LeuPhe: 5.598 ± 0.384
4.256LeuGly: 4.256 ± 0.384
2.747LeuHis: 2.747 ± 0.245
5.556LeuIle: 5.556 ± 0.387
5.745LeuLys: 5.745 ± 0.336
10.316LeuLeu: 10.316 ± 0.634
2.327LeuMet: 2.327 ± 0.236
5.116LeuAsn: 5.116 ± 0.36
4.99LeuPro: 4.99 ± 0.43
4.068LeuGln: 4.068 ± 0.284
5.137LeuArg: 5.137 ± 0.378
9.184LeuSer: 9.184 ± 0.537
6.605LeuThr: 6.605 ± 0.399
4.068LeuVal: 4.068 ± 0.375
1.195LeuTrp: 1.195 ± 0.185
3.648LeuTyr: 3.648 ± 0.397
0.0LeuXaa: 0.0 ± 0.0
Met
1.635MetAla: 1.635 ± 0.226
1.656MetCys: 1.656 ± 1.004
1.321MetAsp: 1.321 ± 0.185
1.719MetGlu: 1.719 ± 0.187
1.384MetPhe: 1.384 ± 0.167
1.048MetGly: 1.048 ± 0.151
0.524MetHis: 0.524 ± 0.086
1.573MetIle: 1.573 ± 0.188
1.552MetLys: 1.552 ± 0.176
2.558MetLeu: 2.558 ± 0.254
0.692MetMet: 0.692 ± 0.171
1.153MetAsn: 1.153 ± 0.186
0.881MetPro: 0.881 ± 0.131
0.944MetGln: 0.944 ± 0.13
1.132MetArg: 1.132 ± 0.163
2.034MetSer: 2.034 ± 0.232
1.321MetThr: 1.321 ± 0.167
0.944MetVal: 0.944 ± 0.128
0.273MetTrp: 0.273 ± 0.081
1.09MetTyr: 1.09 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
2.495AsnAla: 2.495 ± 0.257
1.048AsnCys: 1.048 ± 0.162
3.145AsnAsp: 3.145 ± 0.246
3.187AsnGlu: 3.187 ± 0.249
2.726AsnPhe: 2.726 ± 0.223
2.223AsnGly: 2.223 ± 0.225
1.237AsnHis: 1.237 ± 0.163
3.837AsnIle: 3.837 ± 0.346
3.418AsnLys: 3.418 ± 0.306
5.64AsnLeu: 5.64 ± 0.32
1.279AsnMet: 1.279 ± 0.271
3.46AsnAsn: 3.46 ± 0.327
2.411AsnPro: 2.411 ± 0.295
1.426AsnGln: 1.426 ± 0.16
2.474AsnArg: 2.474 ± 0.212
4.801AsnSer: 4.801 ± 0.409
3.648AsnThr: 3.648 ± 0.353
3.837AsnVal: 3.837 ± 0.351
0.524AsnTrp: 0.524 ± 0.108
1.929AsnTyr: 1.929 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
2.243ProAla: 2.243 ± 0.224
1.027ProCys: 1.027 ± 0.154
2.118ProAsp: 2.118 ± 0.192
2.579ProGlu: 2.579 ± 0.192
2.243ProPhe: 2.243 ± 0.249
1.992ProGly: 1.992 ± 0.233
0.818ProHis: 0.818 ± 0.126
3.229ProIle: 3.229 ± 0.3
3.082ProLys: 3.082 ± 0.247
4.277ProLeu: 4.277 ± 0.389
1.09ProMet: 1.09 ± 0.147
2.306ProAsn: 2.306 ± 0.252
2.684ProPro: 2.684 ± 0.463
1.447ProGln: 1.447 ± 0.186
2.39ProArg: 2.39 ± 0.265
3.816ProSer: 3.816 ± 0.518
2.453ProThr: 2.453 ± 0.263
2.852ProVal: 2.852 ± 0.312
0.545ProTrp: 0.545 ± 0.124
1.51ProTyr: 1.51 ± 0.183
0.0ProXaa: 0.0 ± 0.0
Gln
1.174GlnAla: 1.174 ± 0.158
0.776GlnCys: 0.776 ± 0.125
1.698GlnAsp: 1.698 ± 0.196
2.097GlnGlu: 2.097 ± 0.206
1.677GlnPhe: 1.677 ± 0.221
1.531GlnGly: 1.531 ± 0.185
0.797GlnHis: 0.797 ± 0.119
2.768GlnIle: 2.768 ± 0.281
2.6GlnLys: 2.6 ± 0.291
3.355GlnLeu: 3.355 ± 0.252
0.713GlnMet: 0.713 ± 0.147
2.621GlnAsn: 2.621 ± 0.3
1.3GlnPro: 1.3 ± 0.191
1.824GlnGln: 1.824 ± 0.233
1.992GlnArg: 1.992 ± 0.212
2.831GlnSer: 2.831 ± 0.264
2.537GlnThr: 2.537 ± 0.219
1.866GlnVal: 1.866 ± 0.187
0.252GlnTrp: 0.252 ± 0.073
1.489GlnTyr: 1.489 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
3.669ArgAla: 3.669 ± 0.776
1.195ArgCys: 1.195 ± 0.155
3.019ArgAsp: 3.019 ± 0.372
2.852ArgGlu: 2.852 ± 0.257
2.327ArgPhe: 2.327 ± 0.247
3.04ArgGly: 3.04 ± 0.319
2.139ArgHis: 2.139 ± 0.24
2.789ArgIle: 2.789 ± 0.309
3.187ArgLys: 3.187 ± 0.264
4.864ArgLeu: 4.864 ± 0.327
1.342ArgMet: 1.342 ± 0.191
2.642ArgAsn: 2.642 ± 0.221
2.39ArgPro: 2.39 ± 0.318
2.243ArgGln: 2.243 ± 0.231
3.795ArgArg: 3.795 ± 0.463
3.564ArgSer: 3.564 ± 0.316
2.097ArgThr: 2.097 ± 0.231
4.969ArgVal: 4.969 ± 1.446
0.566ArgTrp: 0.566 ± 0.133
1.635ArgTyr: 1.635 ± 0.173
0.0ArgXaa: 0.0 ± 0.0
Ser
4.361SerAla: 4.361 ± 0.401
2.097SerCys: 2.097 ± 0.188
4.403SerAsp: 4.403 ± 0.315
4.906SerGlu: 4.906 ± 0.369
4.634SerPhe: 4.634 ± 0.394
3.585SerGly: 3.585 ± 0.341
1.782SerHis: 1.782 ± 0.205
5.074SerIle: 5.074 ± 0.335
4.948SerLys: 4.948 ± 0.465
7.842SerLeu: 7.842 ± 0.416
1.782SerMet: 1.782 ± 0.2
4.151SerAsn: 4.151 ± 0.426
4.256SerPro: 4.256 ± 0.455
2.16SerGln: 2.16 ± 0.207
4.592SerArg: 4.592 ± 1.066
8.429SerSer: 8.429 ± 1.518
5.158SerThr: 5.158 ± 0.381
5.472SerVal: 5.472 ± 0.568
0.797SerTrp: 0.797 ± 0.128
2.223SerTyr: 2.223 ± 0.215
0.0SerXaa: 0.0 ± 0.0
Thr
3.46ThrAla: 3.46 ± 0.28
1.761ThrCys: 1.761 ± 0.182
2.998ThrAsp: 2.998 ± 0.277
4.277ThrGlu: 4.277 ± 0.324
3.187ThrPhe: 3.187 ± 0.317
2.726ThrGly: 2.726 ± 0.264
2.097ThrHis: 2.097 ± 0.234
3.543ThrIle: 3.543 ± 0.298
3.439ThrLys: 3.439 ± 0.278
5.892ThrLeu: 5.892 ± 0.412
1.447ThrMet: 1.447 ± 0.195
3.313ThrAsn: 3.313 ± 0.315
3.187ThrPro: 3.187 ± 0.301
2.327ThrGln: 2.327 ± 0.237
2.558ThrArg: 2.558 ± 0.279
4.864ThrSer: 4.864 ± 0.399
4.403ThrThr: 4.403 ± 0.428
4.822ThrVal: 4.822 ± 0.349
0.398ThrTrp: 0.398 ± 0.1
2.097ThrTyr: 2.097 ± 0.214
0.0ThrXaa: 0.0 ± 0.0
Val
2.935ValAla: 2.935 ± 0.223
3.25ValCys: 3.25 ± 1.593
2.914ValAsp: 2.914 ± 0.302
2.977ValGlu: 2.977 ± 0.287
3.732ValPhe: 3.732 ± 0.316
2.306ValGly: 2.306 ± 0.262
1.552ValHis: 1.552 ± 0.189
4.005ValIle: 4.005 ± 0.277
3.271ValLys: 3.271 ± 0.347
6.059ValLeu: 6.059 ± 0.427
1.552ValMet: 1.552 ± 0.229
2.977ValAsn: 2.977 ± 0.289
2.852ValPro: 2.852 ± 0.248
2.097ValGln: 2.097 ± 0.198
5.053ValArg: 5.053 ± 1.39
5.451ValSer: 5.451 ± 0.531
3.837ValThr: 3.837 ± 0.324
3.501ValVal: 3.501 ± 0.346
0.503ValTrp: 0.503 ± 0.104
2.747ValTyr: 2.747 ± 0.249
0.0ValXaa: 0.0 ± 0.0
Trp
0.294TrpAla: 0.294 ± 0.08
0.294TrpCys: 0.294 ± 0.083
0.315TrpAsp: 0.315 ± 0.077
0.566TrpGlu: 0.566 ± 0.114
0.566TrpPhe: 0.566 ± 0.098
0.44TrpGly: 0.44 ± 0.114
0.105TrpHis: 0.105 ± 0.043
0.776TrpIle: 0.776 ± 0.116
0.44TrpLys: 0.44 ± 0.101
0.985TrpLeu: 0.985 ± 0.162
0.189TrpMet: 0.189 ± 0.078
0.377TrpAsn: 0.377 ± 0.09
0.734TrpPro: 0.734 ± 0.184
0.335TrpGln: 0.335 ± 0.077
0.461TrpArg: 0.461 ± 0.137
0.587TrpSer: 0.587 ± 0.116
0.692TrpThr: 0.692 ± 0.142
0.545TrpVal: 0.545 ± 0.117
0.105TrpTrp: 0.105 ± 0.051
0.273TrpTyr: 0.273 ± 0.062
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.971TyrAla: 1.971 ± 0.207
0.86TyrCys: 0.86 ± 0.116
2.097TyrAsp: 2.097 ± 0.216
1.992TyrGlu: 1.992 ± 0.192
1.845TyrPhe: 1.845 ± 0.197
1.761TyrGly: 1.761 ± 0.209
0.671TyrHis: 0.671 ± 0.119
2.558TyrIle: 2.558 ± 0.277
2.558TyrLys: 2.558 ± 0.277
2.935TyrLeu: 2.935 ± 0.34
0.608TyrMet: 0.608 ± 0.117
2.139TyrAsn: 2.139 ± 0.236
1.405TyrPro: 1.405 ± 0.155
1.069TyrGln: 1.069 ± 0.187
2.076TyrArg: 2.076 ± 0.229
2.789TyrSer: 2.789 ± 0.227
1.677TyrThr: 1.677 ± 0.177
2.6TyrVal: 2.6 ± 0.243
0.461TyrTrp: 0.461 ± 0.089
1.006TyrTyr: 1.006 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 112 proteins (47695 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski