Amino acid dipepetide frequency for Bovine gammaherpesvirus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.615AlaAla: 5.615 ± 0.646
2.269AlaCys: 2.269 ± 0.242
2.354AlaAsp: 2.354 ± 0.256
3.119AlaGlu: 3.119 ± 0.271
3.148AlaPhe: 3.148 ± 0.296
3.119AlaGly: 3.119 ± 0.402
1.56AlaHis: 1.56 ± 0.188
3.403AlaIle: 3.403 ± 0.254
3.942AlaLys: 3.942 ± 0.379
6.749AlaLeu: 6.749 ± 0.513
1.219AlaMet: 1.219 ± 0.206
2.722AlaAsn: 2.722 ± 0.252
4.367AlaPro: 4.367 ± 0.615
2.666AlaGln: 2.666 ± 0.316
2.013AlaArg: 2.013 ± 0.278
5.53AlaSer: 5.53 ± 0.435
4.367AlaThr: 4.367 ± 0.401
3.97AlaVal: 3.97 ± 0.336
0.567AlaTrp: 0.567 ± 0.132
2.41AlaTyr: 2.41 ± 0.263
0.0AlaXaa: 0.0 ± 0.0
Cys
1.418CysAla: 1.418 ± 0.21
0.737CysCys: 0.737 ± 0.143
0.992CysAsp: 0.992 ± 0.172
1.191CysGlu: 1.191 ± 0.212
1.616CysPhe: 1.616 ± 0.245
1.021CysGly: 1.021 ± 0.184
0.681CysHis: 0.681 ± 0.125
1.588CysIle: 1.588 ± 0.221
1.588CysLys: 1.588 ± 0.237
3.148CysLeu: 3.148 ± 0.455
0.567CysMet: 0.567 ± 0.149
1.304CysAsn: 1.304 ± 0.219
1.163CysPro: 1.163 ± 0.231
1.078CysGln: 1.078 ± 0.165
0.709CysArg: 0.709 ± 0.145
2.467CysSer: 2.467 ± 0.32
1.588CysThr: 1.588 ± 0.188
1.645CysVal: 1.645 ± 0.25
0.255CysTrp: 0.255 ± 0.069
1.248CysTyr: 1.248 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
2.722AspAla: 2.722 ± 0.266
0.964AspCys: 0.964 ± 0.155
2.127AspAsp: 2.127 ± 0.297
2.864AspGlu: 2.864 ± 0.33
2.637AspPhe: 2.637 ± 0.295
2.155AspGly: 2.155 ± 0.291
0.652AspHis: 0.652 ± 0.155
2.666AspIle: 2.666 ± 0.297
2.297AspLys: 2.297 ± 0.277
4.565AspLeu: 4.565 ± 0.343
1.134AspMet: 1.134 ± 0.211
1.815AspAsn: 1.815 ± 0.222
2.552AspPro: 2.552 ± 0.248
1.56AspGln: 1.56 ± 0.257
1.475AspArg: 1.475 ± 0.178
3.46AspSer: 3.46 ± 0.291
2.751AspThr: 2.751 ± 0.284
3.063AspVal: 3.063 ± 0.328
0.482AspTrp: 0.482 ± 0.118
1.645AspTyr: 1.645 ± 0.199
0.0AspXaa: 0.0 ± 0.0
Glu
4.083GluAla: 4.083 ± 0.352
1.418GluCys: 1.418 ± 0.255
3.204GluAsp: 3.204 ± 0.332
6.324GluGlu: 6.324 ± 2.377
2.722GluPhe: 2.722 ± 0.3
1.701GluGly: 1.701 ± 0.194
1.56GluHis: 1.56 ± 0.213
3.204GluIle: 3.204 ± 0.336
2.751GluLys: 2.751 ± 0.251
5.359GluLeu: 5.359 ± 0.452
1.475GluMet: 1.475 ± 0.182
3.148GluAsn: 3.148 ± 0.334
1.928GluPro: 1.928 ± 0.275
2.637GluGln: 2.637 ± 0.332
2.127GluArg: 2.127 ± 0.305
4.792GluSer: 4.792 ± 0.438
3.686GluThr: 3.686 ± 0.465
3.119GluVal: 3.119 ± 0.303
0.595GluTrp: 0.595 ± 0.142
1.616GluTyr: 1.616 ± 0.241
0.0GluXaa: 0.0 ± 0.0
Phe
2.807PheAla: 2.807 ± 0.329
1.304PheCys: 1.304 ± 0.217
2.694PheAsp: 2.694 ± 0.29
2.694PheGlu: 2.694 ± 0.29
3.233PhePhe: 3.233 ± 0.488
2.183PheGly: 2.183 ± 0.235
1.163PheHis: 1.163 ± 0.196
2.977PheIle: 2.977 ± 0.272
3.913PheLys: 3.913 ± 0.357
5.927PheLeu: 5.927 ± 0.443
1.475PheMet: 1.475 ± 0.243
3.601PheAsn: 3.601 ± 0.34
2.382PhePro: 2.382 ± 0.269
2.098PheGln: 2.098 ± 0.287
1.163PheArg: 1.163 ± 0.193
3.743PheSer: 3.743 ± 0.359
2.637PheThr: 2.637 ± 0.328
3.403PheVal: 3.403 ± 0.332
0.652PheTrp: 0.652 ± 0.139
2.467PheTyr: 2.467 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
2.722GlyAla: 2.722 ± 0.254
0.936GlyCys: 0.936 ± 0.158
2.042GlyAsp: 2.042 ± 0.298
2.013GlyGlu: 2.013 ± 0.282
2.098GlyPhe: 2.098 ± 0.249
1.985GlyGly: 1.985 ± 0.381
1.021GlyHis: 1.021 ± 0.156
2.694GlyIle: 2.694 ± 0.319
2.127GlyLys: 2.127 ± 0.272
4.877GlyLeu: 4.877 ± 0.531
0.737GlyMet: 0.737 ± 0.135
2.212GlyAsn: 2.212 ± 0.221
2.524GlyPro: 2.524 ± 0.421
1.957GlyGln: 1.957 ± 0.255
2.013GlyArg: 2.013 ± 0.449
3.97GlySer: 3.97 ± 0.447
2.24GlyThr: 2.24 ± 0.237
3.289GlyVal: 3.289 ± 0.252
0.397GlyTrp: 0.397 ± 0.097
1.219GlyTyr: 1.219 ± 0.234
0.0GlyXaa: 0.0 ± 0.0
His
1.191HisAla: 1.191 ± 0.164
0.624HisCys: 0.624 ± 0.163
0.709HisAsp: 0.709 ± 0.118
1.078HisGlu: 1.078 ± 0.161
1.163HisPhe: 1.163 ± 0.187
1.56HisGly: 1.56 ± 0.216
0.936HisHis: 0.936 ± 0.163
1.588HisIle: 1.588 ± 0.266
1.73HisLys: 1.73 ± 0.16
2.439HisLeu: 2.439 ± 0.209
0.822HisMet: 0.822 ± 0.166
1.219HisAsn: 1.219 ± 0.192
1.106HisPro: 1.106 ± 0.215
1.134HisGln: 1.134 ± 0.2
0.879HisArg: 0.879 ± 0.153
2.269HisSer: 2.269 ± 0.247
1.786HisThr: 1.786 ± 0.27
2.042HisVal: 2.042 ± 0.232
0.34HisTrp: 0.34 ± 0.109
1.106HisTyr: 1.106 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
3.091IleAla: 3.091 ± 0.326
1.616IleCys: 1.616 ± 0.285
2.325IleAsp: 2.325 ± 0.276
3.289IleGlu: 3.289 ± 0.405
3.601IlePhe: 3.601 ± 0.322
1.531IleGly: 1.531 ± 0.21
1.56IleHis: 1.56 ± 0.215
3.261IleIle: 3.261 ± 0.35
4.225IleLys: 4.225 ± 0.361
5.501IleLeu: 5.501 ± 0.391
1.361IleMet: 1.361 ± 0.222
3.828IleAsn: 3.828 ± 0.325
2.751IlePro: 2.751 ± 0.239
2.722IleGln: 2.722 ± 0.277
1.758IleArg: 1.758 ± 0.178
4.537IleSer: 4.537 ± 0.393
3.176IleThr: 3.176 ± 0.293
3.346IleVal: 3.346 ± 0.319
0.992IleTrp: 0.992 ± 0.163
2.269IleTyr: 2.269 ± 0.299
0.0IleXaa: 0.0 ± 0.0
Lys
3.97LysAla: 3.97 ± 0.399
1.588LysCys: 1.588 ± 0.24
3.204LysAsp: 3.204 ± 0.25
4.651LysGlu: 4.651 ± 0.437
2.921LysPhe: 2.921 ± 0.254
1.872LysGly: 1.872 ± 0.266
2.127LysHis: 2.127 ± 0.253
3.913LysIle: 3.913 ± 0.354
4.906LysLys: 4.906 ± 0.497
6.607LysLeu: 6.607 ± 0.503
1.333LysMet: 1.333 ± 0.211
4.225LysAsn: 4.225 ± 0.333
2.864LysPro: 2.864 ± 0.264
3.63LysGln: 3.63 ± 0.327
2.949LysArg: 2.949 ± 0.305
4.48LysSer: 4.48 ± 0.384
3.403LysThr: 3.403 ± 0.303
3.686LysVal: 3.686 ± 0.371
0.482LysTrp: 0.482 ± 0.119
1.928LysTyr: 1.928 ± 0.279
0.0LysXaa: 0.0 ± 0.0
Leu
7.061LeuAla: 7.061 ± 0.514
2.24LeuCys: 2.24 ± 0.335
4.509LeuAsp: 4.509 ± 0.389
5.898LeuGlu: 5.898 ± 0.529
5.473LeuPhe: 5.473 ± 0.439
4.197LeuGly: 4.197 ± 0.328
2.779LeuHis: 2.779 ± 0.264
4.906LeuIle: 4.906 ± 0.358
8.053LeuLys: 8.053 ± 0.629
10.605LeuLeu: 10.605 ± 0.513
1.985LeuMet: 1.985 ± 0.205
5.955LeuAsn: 5.955 ± 0.492
4.849LeuPro: 4.849 ± 0.347
5.813LeuGln: 5.813 ± 0.519
3.573LeuArg: 3.573 ± 0.282
9.046LeuSer: 9.046 ± 0.667
5.586LeuThr: 5.586 ± 0.385
5.671LeuVal: 5.671 ± 0.343
1.106LeuTrp: 1.106 ± 0.212
3.715LeuTyr: 3.715 ± 0.394
0.0LeuXaa: 0.0 ± 0.0
Met
2.183MetAla: 2.183 ± 0.254
0.51MetCys: 0.51 ± 0.13
1.163MetAsp: 1.163 ± 0.192
0.851MetGlu: 0.851 ± 0.152
1.616MetPhe: 1.616 ± 0.23
1.049MetGly: 1.049 ± 0.186
0.652MetHis: 0.652 ± 0.12
0.822MetIle: 0.822 ± 0.193
0.766MetLys: 0.766 ± 0.114
2.354MetLeu: 2.354 ± 0.271
0.454MetMet: 0.454 ± 0.116
0.737MetAsn: 0.737 ± 0.166
1.078MetPro: 1.078 ± 0.144
1.078MetGln: 1.078 ± 0.147
0.851MetArg: 0.851 ± 0.19
1.957MetSer: 1.957 ± 0.233
1.248MetThr: 1.248 ± 0.225
1.418MetVal: 1.418 ± 0.234
0.284MetTrp: 0.284 ± 0.087
1.134MetTyr: 1.134 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
3.346AsnAla: 3.346 ± 0.308
1.276AsnCys: 1.276 ± 0.158
1.503AsnAsp: 1.503 ± 0.217
2.58AsnGlu: 2.58 ± 0.315
2.751AsnPhe: 2.751 ± 0.345
2.269AsnGly: 2.269 ± 0.254
1.389AsnHis: 1.389 ± 0.205
3.857AsnIle: 3.857 ± 0.337
4.197AsnLys: 4.197 ± 0.277
5.501AsnLeu: 5.501 ± 0.484
1.503AsnMet: 1.503 ± 0.279
3.289AsnAsn: 3.289 ± 0.272
1.928AsnPro: 1.928 ± 0.216
2.439AsnGln: 2.439 ± 0.274
2.013AsnArg: 2.013 ± 0.331
4.821AsnSer: 4.821 ± 0.425
3.034AsnThr: 3.034 ± 0.304
3.658AsnVal: 3.658 ± 0.309
0.567AsnTrp: 0.567 ± 0.115
2.127AsnTyr: 2.127 ± 0.223
0.0AsnXaa: 0.0 ± 0.0
Pro
2.977ProAla: 2.977 ± 0.416
1.446ProCys: 1.446 ± 0.255
2.269ProAsp: 2.269 ± 0.293
3.119ProGlu: 3.119 ± 0.362
1.957ProPhe: 1.957 ± 0.249
2.609ProGly: 2.609 ± 0.358
1.361ProHis: 1.361 ± 0.21
2.807ProIle: 2.807 ± 0.265
2.779ProLys: 2.779 ± 0.335
4.027ProLeu: 4.027 ± 0.281
0.936ProMet: 0.936 ± 0.157
2.07ProAsn: 2.07 ± 0.257
2.949ProPro: 2.949 ± 0.371
2.666ProGln: 2.666 ± 0.543
2.127ProArg: 2.127 ± 0.368
4.821ProSer: 4.821 ± 0.814
3.006ProThr: 3.006 ± 0.31
3.97ProVal: 3.97 ± 0.369
0.681ProTrp: 0.681 ± 0.16
1.219ProTyr: 1.219 ± 0.17
0.0ProXaa: 0.0 ± 0.0
Gln
2.666GlnAla: 2.666 ± 0.304
1.078GlnCys: 1.078 ± 0.171
2.127GlnAsp: 2.127 ± 0.225
2.921GlnGlu: 2.921 ± 0.38
1.786GlnPhe: 1.786 ± 0.236
2.41GlnGly: 2.41 ± 0.416
1.304GlnHis: 1.304 ± 0.226
2.58GlnIle: 2.58 ± 0.209
3.063GlnLys: 3.063 ± 0.328
4.452GlnLeu: 4.452 ± 0.402
0.709GlnMet: 0.709 ± 0.165
2.637GlnAsn: 2.637 ± 0.248
3.006GlnPro: 3.006 ± 0.623
3.233GlnGln: 3.233 ± 0.647
1.531GlnArg: 1.531 ± 0.187
3.346GlnSer: 3.346 ± 0.349
2.439GlnThr: 2.439 ± 0.302
2.694GlnVal: 2.694 ± 0.27
0.681GlnTrp: 0.681 ± 0.159
1.503GlnTyr: 1.503 ± 0.232
0.0GlnXaa: 0.0 ± 0.0
Arg
2.779ArgAla: 2.779 ± 0.382
1.276ArgCys: 1.276 ± 0.241
1.872ArgAsp: 1.872 ± 0.293
2.269ArgGlu: 2.269 ± 0.278
1.276ArgPhe: 1.276 ± 0.159
2.666ArgGly: 2.666 ± 0.729
0.964ArgHis: 0.964 ± 0.161
1.616ArgIle: 1.616 ± 0.185
2.155ArgLys: 2.155 ± 0.286
3.686ArgLeu: 3.686 ± 0.396
0.936ArgMet: 0.936 ± 0.182
1.701ArgAsn: 1.701 ± 0.272
1.928ArgPro: 1.928 ± 0.377
1.248ArgGln: 1.248 ± 0.182
2.098ArgArg: 2.098 ± 0.325
2.127ArgSer: 2.127 ± 0.354
1.134ArgThr: 1.134 ± 0.16
2.127ArgVal: 2.127 ± 0.287
0.539ArgTrp: 0.539 ± 0.126
0.879ArgTyr: 0.879 ± 0.202
0.0ArgXaa: 0.0 ± 0.0
Ser
5.416SerAla: 5.416 ± 0.397
1.815SerCys: 1.815 ± 0.188
3.318SerAsp: 3.318 ± 0.297
4.424SerGlu: 4.424 ± 0.418
4.707SerPhe: 4.707 ± 0.376
3.942SerGly: 3.942 ± 0.339
2.439SerHis: 2.439 ± 0.271
4.395SerIle: 4.395 ± 0.313
5.416SerLys: 5.416 ± 0.455
8.961SerLeu: 8.961 ± 0.5
1.9SerMet: 1.9 ± 0.244
4.821SerAsn: 4.821 ± 0.451
4.452SerPro: 4.452 ± 0.597
3.743SerGln: 3.743 ± 0.365
3.204SerArg: 3.204 ± 0.842
9.613SerSer: 9.613 ± 0.725
4.991SerThr: 4.991 ± 0.418
5.473SerVal: 5.473 ± 0.452
0.595SerTrp: 0.595 ± 0.149
2.921SerTyr: 2.921 ± 0.296
0.0SerXaa: 0.0 ± 0.0
Thr
3.857ThrAla: 3.857 ± 0.39
1.475ThrCys: 1.475 ± 0.225
2.977ThrAsp: 2.977 ± 0.279
3.119ThrGlu: 3.119 ± 0.338
3.346ThrPhe: 3.346 ± 0.291
2.439ThrGly: 2.439 ± 0.3
0.992ThrHis: 0.992 ± 0.23
3.119ThrIle: 3.119 ± 0.343
3.148ThrLys: 3.148 ± 0.376
6.692ThrLeu: 6.692 ± 0.473
1.333ThrMet: 1.333 ± 0.188
2.807ThrAsn: 2.807 ± 0.312
2.467ThrPro: 2.467 ± 0.262
2.183ThrGln: 2.183 ± 0.276
1.503ThrArg: 1.503 ± 0.208
5.643ThrSer: 5.643 ± 0.428
3.204ThrThr: 3.204 ± 0.599
4.083ThrVal: 4.083 ± 0.405
0.595ThrTrp: 0.595 ± 0.132
2.183ThrTyr: 2.183 ± 0.224
0.0ThrXaa: 0.0 ± 0.0
Val
4.452ValAla: 4.452 ± 0.307
1.985ValCys: 1.985 ± 0.26
2.439ValAsp: 2.439 ± 0.212
2.949ValGlu: 2.949 ± 0.287
4.027ValPhe: 4.027 ± 0.362
2.183ValGly: 2.183 ± 0.275
1.333ValHis: 1.333 ± 0.215
4.282ValIle: 4.282 ± 0.416
4.339ValLys: 4.339 ± 0.443
6.267ValLeu: 6.267 ± 0.511
1.106ValMet: 1.106 ± 0.17
3.545ValAsn: 3.545 ± 0.344
3.119ValPro: 3.119 ± 0.286
2.58ValGln: 2.58 ± 0.215
1.786ValArg: 1.786 ± 0.227
6.068ValSer: 6.068 ± 0.408
3.913ValThr: 3.913 ± 0.374
3.686ValVal: 3.686 ± 0.388
0.482ValTrp: 0.482 ± 0.105
2.807ValTyr: 2.807 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
0.794TrpAla: 0.794 ± 0.133
0.17TrpCys: 0.17 ± 0.07
0.369TrpAsp: 0.369 ± 0.1
0.595TrpGlu: 0.595 ± 0.107
0.397TrpPhe: 0.397 ± 0.104
0.454TrpGly: 0.454 ± 0.128
0.312TrpHis: 0.312 ± 0.104
0.595TrpIle: 0.595 ± 0.11
0.737TrpLys: 0.737 ± 0.15
1.56TrpLeu: 1.56 ± 0.208
0.113TrpMet: 0.113 ± 0.064
0.624TrpAsn: 0.624 ± 0.108
0.964TrpPro: 0.964 ± 0.155
0.482TrpGln: 0.482 ± 0.134
0.255TrpArg: 0.255 ± 0.102
0.737TrpSer: 0.737 ± 0.113
0.766TrpThr: 0.766 ± 0.159
0.51TrpVal: 0.51 ± 0.122
0.057TrpTrp: 0.057 ± 0.04
0.312TrpTyr: 0.312 ± 0.1
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.957TyrAla: 1.957 ± 0.237
1.276TyrCys: 1.276 ± 0.174
1.389TyrAsp: 1.389 ± 0.217
1.503TyrGlu: 1.503 ± 0.226
2.07TyrPhe: 2.07 ± 0.249
1.73TyrGly: 1.73 ± 0.229
0.936TyrHis: 0.936 ± 0.15
2.467TyrIle: 2.467 ± 0.317
2.609TyrLys: 2.609 ± 0.367
3.743TyrLeu: 3.743 ± 0.276
1.078TyrMet: 1.078 ± 0.155
1.786TyrAsn: 1.786 ± 0.226
1.418TyrPro: 1.418 ± 0.269
1.219TyrGln: 1.219 ± 0.166
1.248TyrArg: 1.248 ± 0.182
3.119TyrSer: 3.119 ± 0.283
2.212TyrThr: 2.212 ± 0.295
2.495TyrVal: 2.495 ± 0.346
0.454TyrTrp: 0.454 ± 0.108
1.985TyrTyr: 1.985 ± 0.257
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (35266 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski