Amino acid dipepetide frequency for Bufonid herpesvirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.325AlaAla: 5.325 ± 0.361
1.968AlaCys: 1.968 ± 0.162
2.74AlaAsp: 2.74 ± 0.231
4.361AlaGlu: 4.361 ± 0.339
3.415AlaPhe: 3.415 ± 0.26
2.817AlaGly: 2.817 ± 0.246
1.737AlaHis: 1.737 ± 0.157
3.666AlaIle: 3.666 ± 0.237
3.627AlaLys: 3.627 ± 0.349
7.409AlaLeu: 7.409 ± 0.407
1.428AlaMet: 1.428 ± 0.169
3.145AlaAsn: 3.145 ± 0.212
2.45AlaPro: 2.45 ± 0.223
2.875AlaGln: 2.875 ± 0.239
2.412AlaArg: 2.412 ± 0.205
5.402AlaSer: 5.402 ± 0.331
3.782AlaThr: 3.782 ± 0.267
4.669AlaVal: 4.669 ± 0.31
0.54AlaTrp: 0.54 ± 0.088
2.335AlaTyr: 2.335 ± 0.202
0.0AlaXaa: 0.0 ± 0.0
Cys
1.949CysAla: 1.949 ± 0.198
0.926CysCys: 0.926 ± 0.132
1.08CysAsp: 1.08 ± 0.141
1.254CysGlu: 1.254 ± 0.153
1.737CysPhe: 1.737 ± 0.168
1.505CysGly: 1.505 ± 0.185
0.328CysHis: 0.328 ± 0.086
1.447CysIle: 1.447 ± 0.182
1.891CysLys: 1.891 ± 0.192
2.991CysLeu: 2.991 ± 0.265
0.752CysMet: 0.752 ± 0.111
1.563CysAsn: 1.563 ± 0.178
1.37CysPro: 1.37 ± 0.187
1.138CysGln: 1.138 ± 0.143
1.254CysArg: 1.254 ± 0.139
2.335CysSer: 2.335 ± 0.278
1.852CysThr: 1.852 ± 0.198
2.894CysVal: 2.894 ± 0.235
0.405CysTrp: 0.405 ± 0.104
1.216CysTyr: 1.216 ± 0.154
0.0CysXaa: 0.0 ± 0.0
Asp
2.836AspAla: 2.836 ± 0.254
1.196AspCys: 1.196 ± 0.151
2.547AspAsp: 2.547 ± 0.388
2.2AspGlu: 2.2 ± 0.273
2.18AspPhe: 2.18 ± 0.204
2.489AspGly: 2.489 ± 0.237
0.984AspHis: 0.984 ± 0.152
3.184AspIle: 3.184 ± 0.25
2.47AspLys: 2.47 ± 0.222
4.245AspLeu: 4.245 ± 0.305
1.389AspMet: 1.389 ± 0.187
2.856AspAsn: 2.856 ± 0.333
2.257AspPro: 2.257 ± 0.228
1.428AspGln: 1.428 ± 0.18
1.698AspArg: 1.698 ± 0.178
3.396AspSer: 3.396 ± 0.319
3.762AspThr: 3.762 ± 0.406
3.01AspVal: 3.01 ± 0.24
0.54AspTrp: 0.54 ± 0.094
1.64AspTyr: 1.64 ± 0.175
0.0AspXaa: 0.0 ± 0.0
Glu
3.762GluAla: 3.762 ± 0.376
1.409GluCys: 1.409 ± 0.171
3.299GluAsp: 3.299 ± 0.268
4.129GluGlu: 4.129 ± 0.579
1.872GluPhe: 1.872 ± 0.175
2.103GluGly: 2.103 ± 0.206
1.351GluHis: 1.351 ± 0.173
2.894GluIle: 2.894 ± 0.283
3.859GluLys: 3.859 ± 0.336
5.557GluLeu: 5.557 ± 0.491
1.466GluMet: 1.466 ± 0.182
2.894GluAsn: 2.894 ± 0.323
2.836GluPro: 2.836 ± 0.248
1.949GluGln: 1.949 ± 0.227
2.084GluArg: 2.084 ± 0.189
4.168GluSer: 4.168 ± 0.272
4.341GluThr: 4.341 ± 0.329
2.277GluVal: 2.277 ± 0.144
0.579GluTrp: 0.579 ± 0.084
1.775GluTyr: 1.775 ± 0.176
0.0GluXaa: 0.0 ± 0.0
Phe
3.01PheAla: 3.01 ± 0.224
1.601PheCys: 1.601 ± 0.185
2.528PheAsp: 2.528 ± 0.218
3.01PheGlu: 3.01 ± 0.26
2.682PhePhe: 2.682 ± 0.256
2.528PheGly: 2.528 ± 0.21
0.637PheHis: 0.637 ± 0.135
2.798PheIle: 2.798 ± 0.293
2.952PheLys: 2.952 ± 0.254
4.38PheLeu: 4.38 ± 0.299
1.042PheMet: 1.042 ± 0.141
2.431PheAsn: 2.431 ± 0.222
1.814PhePro: 1.814 ± 0.152
1.64PheGln: 1.64 ± 0.197
2.026PheArg: 2.026 ± 0.195
3.898PheSer: 3.898 ± 0.315
2.836PheThr: 2.836 ± 0.199
3.531PheVal: 3.531 ± 0.243
0.463PheTrp: 0.463 ± 0.096
2.026PheTyr: 2.026 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
2.547GlyAla: 2.547 ± 0.227
1.466GlyCys: 1.466 ± 0.174
2.045GlyAsp: 2.045 ± 0.186
1.775GlyGlu: 1.775 ± 0.182
2.759GlyPhe: 2.759 ± 0.213
2.373GlyGly: 2.373 ± 0.295
0.945GlyHis: 0.945 ± 0.135
2.026GlyIle: 2.026 ± 0.174
2.817GlyLys: 2.817 ± 0.226
5.036GlyLeu: 5.036 ± 0.301
0.926GlyMet: 0.926 ± 0.114
1.891GlyAsn: 1.891 ± 0.189
1.737GlyPro: 1.737 ± 0.19
1.717GlyGln: 1.717 ± 0.193
1.968GlyArg: 1.968 ± 0.197
3.473GlySer: 3.473 ± 0.245
2.663GlyThr: 2.663 ± 0.242
3.608GlyVal: 3.608 ± 0.272
0.54GlyTrp: 0.54 ± 0.097
1.679GlyTyr: 1.679 ± 0.193
0.0GlyXaa: 0.0 ± 0.0
His
1.428HisAla: 1.428 ± 0.161
0.752HisCys: 0.752 ± 0.113
0.695HisAsp: 0.695 ± 0.096
1.119HisGlu: 1.119 ± 0.139
0.926HisPhe: 0.926 ± 0.122
1.042HisGly: 1.042 ± 0.129
0.521HisHis: 0.521 ± 0.154
1.447HisIle: 1.447 ± 0.141
1.698HisLys: 1.698 ± 0.193
2.18HisLeu: 2.18 ± 0.191
0.56HisMet: 0.56 ± 0.103
1.717HisAsn: 1.717 ± 0.186
0.965HisPro: 0.965 ± 0.127
0.965HisGln: 0.965 ± 0.133
1.138HisArg: 1.138 ± 0.18
1.659HisSer: 1.659 ± 0.173
1.698HisThr: 1.698 ± 0.162
1.698HisVal: 1.698 ± 0.205
0.328HisTrp: 0.328 ± 0.078
0.54HisTyr: 0.54 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
3.608IleAla: 3.608 ± 0.256
1.293IleCys: 1.293 ± 0.166
2.142IleAsp: 2.142 ± 0.178
3.126IleGlu: 3.126 ± 0.367
2.566IlePhe: 2.566 ± 0.248
1.872IleGly: 1.872 ± 0.176
1.235IleHis: 1.235 ± 0.154
2.875IleIle: 2.875 ± 0.28
3.82IleLys: 3.82 ± 0.279
4.959IleLeu: 4.959 ± 0.451
1.273IleMet: 1.273 ± 0.18
3.396IleAsn: 3.396 ± 0.299
3.049IlePro: 3.049 ± 0.244
2.277IleGln: 2.277 ± 0.226
2.296IleArg: 2.296 ± 0.198
3.492IleSer: 3.492 ± 0.247
3.705IleThr: 3.705 ± 0.248
3.512IleVal: 3.512 ± 0.318
0.405IleTrp: 0.405 ± 0.087
1.814IleTyr: 1.814 ± 0.181
0.0IleXaa: 0.0 ± 0.0
Lys
5.229LysAla: 5.229 ± 0.342
1.794LysCys: 1.794 ± 0.155
3.087LysAsp: 3.087 ± 0.312
4.168LysGlu: 4.168 ± 0.458
1.621LysPhe: 1.621 ± 0.17
2.257LysGly: 2.257 ± 0.213
2.335LysHis: 2.335 ± 0.2
2.991LysIle: 2.991 ± 0.24
4.611LysLys: 4.611 ± 0.41
6.001LysLeu: 6.001 ± 0.456
1.563LysMet: 1.563 ± 0.186
3.55LysAsn: 3.55 ± 0.218
3.55LysPro: 3.55 ± 0.393
3.608LysGln: 3.608 ± 0.327
3.859LysArg: 3.859 ± 0.31
4.631LysSer: 4.631 ± 0.275
6.194LysThr: 6.194 ± 0.487
2.836LysVal: 2.836 ± 0.221
0.502LysTrp: 0.502 ± 0.096
2.065LysTyr: 2.065 ± 0.252
0.0LysXaa: 0.0 ± 0.0
Leu
6.251LeuAla: 6.251 ± 0.305
3.319LeuCys: 3.319 ± 0.325
3.955LeuAsp: 3.955 ± 0.284
5.248LeuGlu: 5.248 ± 0.366
5.499LeuPhe: 5.499 ± 0.333
4.303LeuGly: 4.303 ± 0.272
2.354LeuHis: 2.354 ± 0.214
4.322LeuIle: 4.322 ± 0.238
7.699LeuLys: 7.699 ± 0.594
9.512LeuLeu: 9.512 ± 0.602
2.2LeuMet: 2.2 ± 0.214
5.21LeuAsn: 5.21 ± 0.261
4.226LeuPro: 4.226 ± 0.364
4.38LeuGln: 4.38 ± 0.353
3.917LeuArg: 3.917 ± 0.268
7.93LeuSer: 7.93 ± 0.423
6.676LeuThr: 6.676 ± 0.433
6.136LeuVal: 6.136 ± 0.344
1.061LeuTrp: 1.061 ± 0.152
4.11LeuTyr: 4.11 ± 0.262
0.0LeuXaa: 0.0 ± 0.0
Met
1.524MetAla: 1.524 ± 0.153
1.023MetCys: 1.023 ± 0.141
1.023MetAsp: 1.023 ± 0.13
1.042MetGlu: 1.042 ± 0.14
1.216MetPhe: 1.216 ± 0.14
1.37MetGly: 1.37 ± 0.175
0.617MetHis: 0.617 ± 0.109
0.907MetIle: 0.907 ± 0.155
1.273MetLys: 1.273 ± 0.168
2.721MetLeu: 2.721 ± 0.218
0.579MetMet: 0.579 ± 0.189
0.888MetAsn: 0.888 ± 0.143
0.965MetPro: 0.965 ± 0.159
1.023MetGln: 1.023 ± 0.148
0.945MetArg: 0.945 ± 0.128
1.756MetSer: 1.756 ± 0.193
1.486MetThr: 1.486 ± 0.2
1.486MetVal: 1.486 ± 0.168
0.174MetTrp: 0.174 ± 0.057
1.409MetTyr: 1.409 ± 0.171
0.0MetXaa: 0.0 ± 0.0
Asn
3.454AsnAla: 3.454 ± 0.268
1.447AsnCys: 1.447 ± 0.16
1.775AsnAsp: 1.775 ± 0.203
2.508AsnGlu: 2.508 ± 0.236
1.659AsnPhe: 1.659 ± 0.147
2.142AsnGly: 2.142 ± 0.233
1.1AsnHis: 1.1 ± 0.193
3.859AsnIle: 3.859 ± 0.271
3.994AsnLys: 3.994 ± 0.257
4.554AsnLeu: 4.554 ± 0.328
1.563AsnMet: 1.563 ± 0.214
3.627AsnAsn: 3.627 ± 0.31
2.875AsnPro: 2.875 ± 0.198
1.64AsnGln: 1.64 ± 0.215
2.952AsnArg: 2.952 ± 0.302
4.611AsnSer: 4.611 ± 0.408
4.669AsnThr: 4.669 ± 0.294
3.145AsnVal: 3.145 ± 0.217
0.54AsnTrp: 0.54 ± 0.1
1.91AsnTyr: 1.91 ± 0.215
0.0AsnXaa: 0.0 ± 0.0
Pro
2.798ProAla: 2.798 ± 0.254
1.409ProCys: 1.409 ± 0.197
2.296ProAsp: 2.296 ± 0.183
2.161ProGlu: 2.161 ± 0.191
2.412ProPhe: 2.412 ± 0.202
2.18ProGly: 2.18 ± 0.248
1.061ProHis: 1.061 ± 0.151
2.296ProIle: 2.296 ± 0.19
2.798ProLys: 2.798 ± 0.25
4.476ProLeu: 4.476 ± 0.327
0.868ProMet: 0.868 ± 0.132
2.547ProAsn: 2.547 ± 0.215
2.663ProPro: 2.663 ± 0.273
1.891ProGln: 1.891 ± 0.213
2.2ProArg: 2.2 ± 0.202
3.357ProSer: 3.357 ± 0.286
3.801ProThr: 3.801 ± 0.391
3.705ProVal: 3.705 ± 0.319
0.328ProTrp: 0.328 ± 0.082
1.119ProTyr: 1.119 ± 0.151
0.0ProXaa: 0.0 ± 0.0
Gln
2.528GlnAla: 2.528 ± 0.204
1.003GlnCys: 1.003 ± 0.147
2.103GlnAsp: 2.103 ± 0.262
2.393GlnGlu: 2.393 ± 0.228
1.601GlnPhe: 1.601 ± 0.187
1.486GlnGly: 1.486 ± 0.15
0.984GlnHis: 0.984 ± 0.136
2.219GlnIle: 2.219 ± 0.197
3.087GlnLys: 3.087 ± 0.309
3.705GlnLeu: 3.705 ± 0.308
1.1GlnMet: 1.1 ± 0.101
2.759GlnAsn: 2.759 ± 0.298
2.065GlnPro: 2.065 ± 0.213
1.949GlnGln: 1.949 ± 0.209
2.084GlnArg: 2.084 ± 0.205
2.47GlnSer: 2.47 ± 0.214
3.145GlnThr: 3.145 ± 0.258
1.621GlnVal: 1.621 ± 0.186
0.405GlnTrp: 0.405 ± 0.086
1.254GlnTyr: 1.254 ± 0.137
0.0GlnXaa: 0.0 ± 0.0
Arg
2.991ArgAla: 2.991 ± 0.267
1.196ArgCys: 1.196 ± 0.161
2.122ArgAsp: 2.122 ± 0.185
2.18ArgGlu: 2.18 ± 0.269
2.238ArgPhe: 2.238 ± 0.209
1.929ArgGly: 1.929 ± 0.18
1.119ArgHis: 1.119 ± 0.15
2.257ArgIle: 2.257 ± 0.227
3.01ArgLys: 3.01 ± 0.263
4.592ArgLeu: 4.592 ± 0.307
0.945ArgMet: 0.945 ± 0.122
2.2ArgAsn: 2.2 ± 0.208
1.756ArgPro: 1.756 ± 0.201
2.103ArgGln: 2.103 ± 0.181
2.2ArgArg: 2.2 ± 0.267
3.357ArgSer: 3.357 ± 0.299
3.28ArgThr: 3.28 ± 0.277
2.894ArgVal: 2.894 ± 0.243
0.405ArgTrp: 0.405 ± 0.077
1.659ArgTyr: 1.659 ± 0.184
0.0ArgXaa: 0.0 ± 0.0
Ser
5.074SerAla: 5.074 ± 0.337
2.2SerCys: 2.2 ± 0.234
3.743SerAsp: 3.743 ± 0.364
4.168SerGlu: 4.168 ± 0.45
4.168SerPhe: 4.168 ± 0.318
3.377SerGly: 3.377 ± 0.288
1.389SerHis: 1.389 ± 0.167
4.168SerIle: 4.168 ± 0.288
5.152SerLys: 5.152 ± 0.289
7.428SerLeu: 7.428 ± 0.419
1.621SerMet: 1.621 ± 0.172
3.627SerAsn: 3.627 ± 0.293
3.569SerPro: 3.569 ± 0.301
3.068SerGln: 3.068 ± 0.249
3.647SerArg: 3.647 ± 0.366
6.85SerSer: 6.85 ± 0.495
5.055SerThr: 5.055 ± 0.506
6.039SerVal: 6.039 ± 0.413
0.888SerTrp: 0.888 ± 0.124
2.238SerTyr: 2.238 ± 0.213
0.0SerXaa: 0.0 ± 0.0
Thr
4.843ThrAla: 4.843 ± 0.265
1.659ThrCys: 1.659 ± 0.175
3.955ThrAsp: 3.955 ± 0.394
4.399ThrGlu: 4.399 ± 0.437
2.913ThrPhe: 2.913 ± 0.259
2.971ThrGly: 2.971 ± 0.232
1.987ThrHis: 1.987 ± 0.202
4.013ThrIle: 4.013 ± 0.398
4.438ThrLys: 4.438 ± 0.365
6.599ThrLeu: 6.599 ± 0.386
1.466ThrMet: 1.466 ± 0.179
4.283ThrAsn: 4.283 ± 0.288
3.492ThrPro: 3.492 ± 0.323
2.875ThrGln: 2.875 ± 0.19
3.164ThrArg: 3.164 ± 0.268
5.557ThrSer: 5.557 ± 0.593
5.962ThrThr: 5.962 ± 1.071
5.518ThrVal: 5.518 ± 0.32
0.579ThrTrp: 0.579 ± 0.123
2.103ThrTyr: 2.103 ± 0.196
0.0ThrXaa: 0.0 ± 0.0
Val
4.226ValAla: 4.226 ± 0.233
2.547ValCys: 2.547 ± 0.241
2.952ValAsp: 2.952 ± 0.28
3.434ValGlu: 3.434 ± 0.266
3.955ValPhe: 3.955 ± 0.307
3.01ValGly: 3.01 ± 0.224
1.466ValHis: 1.466 ± 0.15
3.164ValIle: 3.164 ± 0.245
3.975ValLys: 3.975 ± 0.297
7.911ValLeu: 7.911 ± 0.385
1.505ValMet: 1.505 ± 0.141
2.663ValAsn: 2.663 ± 0.246
3.164ValPro: 3.164 ± 0.239
2.18ValGln: 2.18 ± 0.183
2.47ValArg: 2.47 ± 0.267
5.595ValSer: 5.595 ± 0.365
4.033ValThr: 4.033 ± 0.241
5.21ValVal: 5.21 ± 0.419
0.849ValTrp: 0.849 ± 0.13
2.643ValTyr: 2.643 ± 0.284
0.0ValXaa: 0.0 ± 0.0
Trp
0.579TrpAla: 0.579 ± 0.116
0.444TrpCys: 0.444 ± 0.093
0.386TrpAsp: 0.386 ± 0.103
0.347TrpGlu: 0.347 ± 0.102
0.714TrpPhe: 0.714 ± 0.118
0.347TrpGly: 0.347 ± 0.065
0.212TrpHis: 0.212 ± 0.067
0.405TrpIle: 0.405 ± 0.111
0.482TrpLys: 0.482 ± 0.089
1.409TrpLeu: 1.409 ± 0.17
0.193TrpMet: 0.193 ± 0.06
0.309TrpAsn: 0.309 ± 0.081
0.289TrpPro: 0.289 ± 0.096
0.463TrpGln: 0.463 ± 0.112
0.424TrpArg: 0.424 ± 0.096
0.849TrpSer: 0.849 ± 0.132
0.907TrpThr: 0.907 ± 0.139
0.752TrpVal: 0.752 ± 0.096
0.27TrpTrp: 0.27 ± 0.081
0.386TrpTyr: 0.386 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.142TyrAla: 2.142 ± 0.184
1.235TyrCys: 1.235 ± 0.198
1.833TyrAsp: 1.833 ± 0.184
1.582TyrGlu: 1.582 ± 0.161
1.698TyrPhe: 1.698 ± 0.176
1.891TyrGly: 1.891 ± 0.196
0.714TyrHis: 0.714 ± 0.104
1.775TyrIle: 1.775 ± 0.183
2.798TyrLys: 2.798 ± 0.245
2.643TyrLeu: 2.643 ± 0.196
0.965TyrMet: 0.965 ± 0.163
2.47TyrAsn: 2.47 ± 0.218
1.273TyrPro: 1.273 ± 0.162
0.791TyrGln: 0.791 ± 0.16
1.698TyrArg: 1.698 ± 0.182
2.682TyrSer: 2.682 ± 0.213
2.933TyrThr: 2.933 ± 0.249
2.431TyrVal: 2.431 ± 0.197
0.367TyrTrp: 0.367 ± 0.087
1.235TyrTyr: 1.235 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 152 proteins (51829 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski