Amino acid dipepetide frequency for Ateline herpesvirus 3 (AtHV-3) (Herpesvirus ateles)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.895AlaAla: 2.895 ± 0.401
1.355AlaCys: 1.355 ± 0.192
1.786AlaAsp: 1.786 ± 0.217
2.34AlaGlu: 2.34 ± 0.26
1.971AlaPhe: 1.971 ± 0.227
1.817AlaGly: 1.817 ± 0.262
1.324AlaHis: 1.324 ± 0.187
4.126AlaIle: 4.126 ± 0.397
3.295AlaLys: 3.295 ± 0.349
5.42AlaLeu: 5.42 ± 0.428
1.201AlaMet: 1.201 ± 0.221
2.309AlaAsn: 2.309 ± 0.24
2.402AlaPro: 2.402 ± 0.26
2.094AlaGln: 2.094 ± 0.258
1.54AlaArg: 1.54 ± 0.216
4.465AlaSer: 4.465 ± 0.435
4.065AlaThr: 4.065 ± 0.271
3.51AlaVal: 3.51 ± 0.437
0.616AlaTrp: 0.616 ± 0.157
1.416AlaTyr: 1.416 ± 0.204
0.0AlaXaa: 0.0 ± 0.0
Cys
1.324CysAla: 1.324 ± 0.262
0.739CysCys: 0.739 ± 0.19
1.078CysAsp: 1.078 ± 0.185
1.478CysGlu: 1.478 ± 0.196
1.263CysPhe: 1.263 ± 0.178
1.263CysGly: 1.263 ± 0.255
0.647CysHis: 0.647 ± 0.138
1.509CysIle: 1.509 ± 0.283
1.54CysLys: 1.54 ± 0.218
2.71CysLeu: 2.71 ± 0.337
0.647CysMet: 0.647 ± 0.126
1.355CysAsn: 1.355 ± 0.251
1.478CysPro: 1.478 ± 0.241
1.263CysGln: 1.263 ± 0.23
0.585CysArg: 0.585 ± 0.16
1.94CysSer: 1.94 ± 0.37
1.355CysThr: 1.355 ± 0.217
1.724CysVal: 1.724 ± 0.213
0.246CysTrp: 0.246 ± 0.089
1.109CysTyr: 1.109 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
2.032AspAla: 2.032 ± 0.289
0.955AspCys: 0.955 ± 0.142
1.878AspAsp: 1.878 ± 0.269
3.541AspGlu: 3.541 ± 1.107
2.094AspPhe: 2.094 ± 0.195
3.264AspGly: 3.264 ± 1.637
0.985AspHis: 0.985 ± 0.177
4.342AspIle: 4.342 ± 0.427
2.525AspLys: 2.525 ± 0.248
4.742AspLeu: 4.742 ± 0.302
1.109AspMet: 1.109 ± 0.181
1.848AspAsn: 1.848 ± 0.251
2.494AspPro: 2.494 ± 0.294
1.232AspGln: 1.232 ± 0.128
1.663AspArg: 1.663 ± 0.24
3.233AspSer: 3.233 ± 0.429
3.264AspThr: 3.264 ± 0.304
2.371AspVal: 2.371 ± 0.267
0.585AspTrp: 0.585 ± 0.131
1.663AspTyr: 1.663 ± 0.229
0.0AspXaa: 0.0 ± 0.0
Glu
3.664GluAla: 3.664 ± 0.331
1.047GluCys: 1.047 ± 0.172
4.496GluAsp: 4.496 ± 1.158
6.528GluGlu: 6.528 ± 2.467
2.002GluPhe: 2.002 ± 0.31
1.94GluGly: 1.94 ± 0.338
1.57GluHis: 1.57 ± 0.164
4.003GluIle: 4.003 ± 0.424
4.065GluLys: 4.065 ± 0.385
5.173GluLeu: 5.173 ± 0.389
1.139GluMet: 1.139 ± 0.148
3.849GluAsn: 3.849 ± 0.31
2.063GluPro: 2.063 ± 0.28
1.663GluGln: 1.663 ± 0.209
1.109GluArg: 1.109 ± 0.181
3.849GluSer: 3.849 ± 0.394
4.065GluThr: 4.065 ± 0.367
2.925GluVal: 2.925 ± 0.286
0.4GluTrp: 0.4 ± 0.094
1.694GluTyr: 1.694 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
1.94PheAla: 1.94 ± 0.258
1.293PheCys: 1.293 ± 0.251
2.063PheAsp: 2.063 ± 0.265
2.032PheGlu: 2.032 ± 0.257
2.402PhePhe: 2.402 ± 0.317
2.063PheGly: 2.063 ± 0.311
0.893PheHis: 0.893 ± 0.163
4.126PheIle: 4.126 ± 0.353
3.603PheLys: 3.603 ± 0.368
5.943PheLeu: 5.943 ± 0.497
1.386PheMet: 1.386 ± 0.208
2.309PheAsn: 2.309 ± 0.286
1.94PhePro: 1.94 ± 0.215
1.848PheGln: 1.848 ± 0.224
1.94PheArg: 1.94 ± 0.226
4.065PheSer: 4.065 ± 0.341
2.679PheThr: 2.679 ± 0.316
2.741PheVal: 2.741 ± 0.328
0.647PheTrp: 0.647 ± 0.165
2.648PheTyr: 2.648 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
2.217GlyAla: 2.217 ± 0.274
0.801GlyCys: 0.801 ± 0.202
3.233GlyAsp: 3.233 ± 1.489
2.771GlyGlu: 2.771 ± 0.505
2.217GlyPhe: 2.217 ± 0.282
2.802GlyGly: 2.802 ± 1.267
0.893GlyHis: 0.893 ± 0.181
3.418GlyIle: 3.418 ± 0.286
2.125GlyLys: 2.125 ± 0.283
3.356GlyLeu: 3.356 ± 0.32
0.801GlyMet: 0.801 ± 0.151
2.494GlyAsn: 2.494 ± 0.276
2.186GlyPro: 2.186 ± 0.288
1.909GlyGln: 1.909 ± 0.232
1.909GlyArg: 1.909 ± 0.312
3.264GlySer: 3.264 ± 0.315
2.771GlyThr: 2.771 ± 0.332
2.248GlyVal: 2.248 ± 0.328
0.431GlyTrp: 0.431 ± 0.113
1.447GlyTyr: 1.447 ± 0.22
0.0GlyXaa: 0.0 ± 0.0
His
0.862HisAla: 0.862 ± 0.182
0.462HisCys: 0.462 ± 0.115
1.139HisAsp: 1.139 ± 0.153
1.016HisGlu: 1.016 ± 0.171
1.447HisPhe: 1.447 ± 0.235
1.416HisGly: 1.416 ± 0.162
0.523HisHis: 0.523 ± 0.112
2.556HisIle: 2.556 ± 0.328
1.663HisLys: 1.663 ± 0.187
3.141HisLeu: 3.141 ± 0.32
0.523HisMet: 0.523 ± 0.13
1.509HisAsn: 1.509 ± 0.265
1.478HisPro: 1.478 ± 0.271
1.047HisGln: 1.047 ± 0.157
1.078HisArg: 1.078 ± 0.201
1.263HisSer: 1.263 ± 0.201
2.125HisThr: 2.125 ± 0.312
2.525HisVal: 2.525 ± 0.319
0.185HisTrp: 0.185 ± 0.072
0.985HisTyr: 0.985 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
3.51IleAla: 3.51 ± 0.309
2.094IleCys: 2.094 ± 0.229
3.664IleAsp: 3.664 ± 0.396
3.788IleGlu: 3.788 ± 0.331
4.249IlePhe: 4.249 ± 0.455
2.864IleGly: 2.864 ± 0.356
1.94IleHis: 1.94 ± 0.203
6.035IleIle: 6.035 ± 0.543
4.711IleLys: 4.711 ± 0.473
7.052IleLeu: 7.052 ± 0.507
1.663IleMet: 1.663 ± 0.187
4.681IleAsn: 4.681 ± 0.402
3.972IlePro: 3.972 ± 0.494
2.987IleGln: 2.987 ± 0.497
2.125IleArg: 2.125 ± 0.26
6.251IleSer: 6.251 ± 0.426
4.896IleThr: 4.896 ± 0.43
3.818IleVal: 3.818 ± 0.431
0.739IleTrp: 0.739 ± 0.165
3.356IleTyr: 3.356 ± 0.316
0.0IleXaa: 0.0 ± 0.0
Lys
3.202LysAla: 3.202 ± 0.332
1.786LysCys: 1.786 ± 0.248
3.11LysAsp: 3.11 ± 0.368
3.449LysGlu: 3.449 ± 0.433
2.71LysPhe: 2.71 ± 0.298
1.57LysGly: 1.57 ± 0.199
2.217LysHis: 2.217 ± 0.244
5.081LysIle: 5.081 ± 0.406
4.958LysLys: 4.958 ± 0.496
7.175LysLeu: 7.175 ± 0.548
1.54LysMet: 1.54 ± 0.221
4.496LysAsn: 4.496 ± 0.437
3.634LysPro: 3.634 ± 0.521
2.956LysGln: 2.956 ± 0.276
2.525LysArg: 2.525 ± 0.249
4.834LysSer: 4.834 ± 0.355
5.481LysThr: 5.481 ± 0.479
3.48LysVal: 3.48 ± 0.327
0.616LysTrp: 0.616 ± 0.14
2.371LysTyr: 2.371 ± 0.244
0.0LysXaa: 0.0 ± 0.0
Leu
5.204LeuAla: 5.204 ± 0.431
2.279LeuCys: 2.279 ± 0.292
3.818LeuAsp: 3.818 ± 0.338
5.481LeuGlu: 5.481 ± 0.539
5.45LeuPhe: 5.45 ± 0.568
4.496LeuGly: 4.496 ± 0.41
3.018LeuHis: 3.018 ± 0.398
5.789LeuIle: 5.789 ± 0.424
7.421LeuLys: 7.421 ± 0.549
10.685LeuLeu: 10.685 ± 0.628
2.34LeuMet: 2.34 ± 0.336
5.974LeuAsn: 5.974 ± 0.456
5.604LeuPro: 5.604 ± 0.472
5.081LeuGln: 5.081 ± 0.49
3.326LeuArg: 3.326 ± 0.33
8.807LeuSer: 8.807 ± 0.683
6.959LeuThr: 6.959 ± 0.474
5.081LeuVal: 5.081 ± 0.419
0.77LeuTrp: 0.77 ± 0.128
3.541LeuTyr: 3.541 ± 0.26
0.0LeuXaa: 0.0 ± 0.0
Met
1.663MetAla: 1.663 ± 0.258
0.831MetCys: 0.831 ± 0.195
0.955MetAsp: 0.955 ± 0.139
1.016MetGlu: 1.016 ± 0.165
1.601MetPhe: 1.601 ± 0.17
0.862MetGly: 0.862 ± 0.156
0.493MetHis: 0.493 ± 0.115
0.924MetIle: 0.924 ± 0.191
0.862MetLys: 0.862 ± 0.129
3.141MetLeu: 3.141 ± 0.326
0.431MetMet: 0.431 ± 0.166
1.047MetAsn: 1.047 ± 0.163
1.047MetPro: 1.047 ± 0.174
1.016MetGln: 1.016 ± 0.201
0.831MetArg: 0.831 ± 0.135
1.878MetSer: 1.878 ± 0.182
1.663MetThr: 1.663 ± 0.235
1.047MetVal: 1.047 ± 0.214
0.277MetTrp: 0.277 ± 0.089
0.831MetTyr: 0.831 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
2.217AsnAla: 2.217 ± 0.264
1.54AsnCys: 1.54 ± 0.244
1.601AsnAsp: 1.601 ± 0.245
2.402AsnGlu: 2.402 ± 0.229
2.864AsnPhe: 2.864 ± 0.344
2.71AsnGly: 2.71 ± 0.34
1.201AsnHis: 1.201 ± 0.169
5.881AsnIle: 5.881 ± 0.592
4.095AsnLys: 4.095 ± 0.374
5.912AsnLeu: 5.912 ± 0.463
1.509AsnMet: 1.509 ± 0.208
4.28AsnAsn: 4.28 ± 0.301
2.156AsnPro: 2.156 ± 0.234
1.355AsnGln: 1.355 ± 0.335
1.57AsnArg: 1.57 ± 0.238
4.927AsnSer: 4.927 ± 0.489
3.295AsnThr: 3.295 ± 0.348
3.941AsnVal: 3.941 ± 0.372
0.708AsnTrp: 0.708 ± 0.155
2.156AsnTyr: 2.156 ± 0.242
0.0AsnXaa: 0.0 ± 0.0
Pro
2.279ProAla: 2.279 ± 0.26
1.447ProCys: 1.447 ± 0.249
2.094ProAsp: 2.094 ± 0.3
2.279ProGlu: 2.279 ± 0.242
2.094ProPhe: 2.094 ± 0.236
2.34ProGly: 2.34 ± 0.314
1.601ProHis: 1.601 ± 0.186
3.757ProIle: 3.757 ± 0.433
3.972ProLys: 3.972 ± 0.724
4.527ProLeu: 4.527 ± 0.353
0.893ProMet: 0.893 ± 0.197
2.094ProAsn: 2.094 ± 0.365
3.51ProPro: 3.51 ± 0.52
1.971ProGln: 1.971 ± 0.282
1.909ProArg: 1.909 ± 0.327
4.342ProSer: 4.342 ± 0.427
3.788ProThr: 3.788 ± 0.483
3.51ProVal: 3.51 ± 0.301
0.647ProTrp: 0.647 ± 0.163
1.94ProTyr: 1.94 ± 0.281
0.0ProXaa: 0.0 ± 0.0
Gln
1.94GlnAla: 1.94 ± 0.286
0.955GlnCys: 0.955 ± 0.21
2.063GlnAsp: 2.063 ± 0.235
2.525GlnGlu: 2.525 ± 0.379
1.817GlnPhe: 1.817 ± 0.301
1.694GlnGly: 1.694 ± 0.365
1.416GlnHis: 1.416 ± 0.181
2.771GlnIle: 2.771 ± 0.319
3.356GlnLys: 3.356 ± 0.407
3.572GlnLeu: 3.572 ± 0.49
0.924GlnMet: 0.924 ± 0.157
2.279GlnAsn: 2.279 ± 0.275
1.848GlnPro: 1.848 ± 0.287
1.601GlnGln: 1.601 ± 0.231
1.232GlnArg: 1.232 ± 0.221
2.925GlnSer: 2.925 ± 0.326
2.833GlnThr: 2.833 ± 0.402
1.416GlnVal: 1.416 ± 0.249
0.554GlnTrp: 0.554 ± 0.112
1.324GlnTyr: 1.324 ± 0.282
0.0GlnXaa: 0.0 ± 0.0
Arg
2.063ArgAla: 2.063 ± 0.283
0.985ArgCys: 0.985 ± 0.188
2.032ArgAsp: 2.032 ± 0.26
1.601ArgGlu: 1.601 ± 0.238
1.109ArgPhe: 1.109 ± 0.196
1.601ArgGly: 1.601 ± 0.278
1.324ArgHis: 1.324 ± 0.175
1.724ArgIle: 1.724 ± 0.242
2.248ArgLys: 2.248 ± 0.301
2.771ArgLeu: 2.771 ± 0.359
0.985ArgMet: 0.985 ± 0.19
1.509ArgAsn: 1.509 ± 0.23
1.694ArgPro: 1.694 ± 0.206
1.478ArgGln: 1.478 ± 0.231
1.447ArgArg: 1.447 ± 0.19
2.032ArgSer: 2.032 ± 0.283
2.34ArgThr: 2.34 ± 0.273
1.601ArgVal: 1.601 ± 0.21
0.493ArgTrp: 0.493 ± 0.14
1.139ArgTyr: 1.139 ± 0.184
0.0ArgXaa: 0.0 ± 0.0
Ser
3.972SerAla: 3.972 ± 0.308
2.002SerCys: 2.002 ± 0.287
3.911SerAsp: 3.911 ± 0.441
4.896SerGlu: 4.896 ± 0.425
3.387SerPhe: 3.387 ± 0.329
3.695SerGly: 3.695 ± 0.402
1.909SerHis: 1.909 ± 0.283
6.005SerIle: 6.005 ± 0.428
5.019SerLys: 5.019 ± 0.488
8.283SerLeu: 8.283 ± 0.499
1.724SerMet: 1.724 ± 0.202
3.603SerAsn: 3.603 ± 0.302
3.634SerPro: 3.634 ± 0.376
3.726SerGln: 3.726 ± 0.415
2.556SerArg: 2.556 ± 0.342
7.575SerSer: 7.575 ± 0.671
6.313SerThr: 6.313 ± 0.605
6.005SerVal: 6.005 ± 0.543
0.708SerTrp: 0.708 ± 0.145
2.741SerTyr: 2.741 ± 0.308
0.0SerXaa: 0.0 ± 0.0
Thr
3.264ThrAla: 3.264 ± 0.406
1.632ThrCys: 1.632 ± 0.245
3.202ThrAsp: 3.202 ± 0.349
4.373ThrGlu: 4.373 ± 0.35
3.387ThrPhe: 3.387 ± 0.363
2.617ThrGly: 2.617 ± 0.352
2.217ThrHis: 2.217 ± 0.293
5.019ThrIle: 5.019 ± 0.478
4.465ThrLys: 4.465 ± 0.387
6.682ThrLeu: 6.682 ± 0.652
1.263ThrMet: 1.263 ± 0.165
4.311ThrAsn: 4.311 ± 0.396
3.634ThrPro: 3.634 ± 0.449
2.433ThrGln: 2.433 ± 0.323
1.663ThrArg: 1.663 ± 0.223
6.898ThrSer: 6.898 ± 0.605
5.327ThrThr: 5.327 ± 0.767
3.972ThrVal: 3.972 ± 0.325
0.893ThrTrp: 0.893 ± 0.152
3.202ThrTyr: 3.202 ± 0.498
0.0ThrXaa: 0.0 ± 0.0
Val
3.264ValAla: 3.264 ± 0.409
1.601ValCys: 1.601 ± 0.245
2.125ValAsp: 2.125 ± 0.305
3.541ValGlu: 3.541 ± 0.345
3.911ValPhe: 3.911 ± 0.413
2.34ValGly: 2.34 ± 0.332
1.755ValHis: 1.755 ± 0.263
3.449ValIle: 3.449 ± 0.258
3.664ValLys: 3.664 ± 0.367
5.789ValLeu: 5.789 ± 0.427
1.201ValMet: 1.201 ± 0.212
3.172ValAsn: 3.172 ± 0.354
3.911ValPro: 3.911 ± 0.315
1.694ValGln: 1.694 ± 0.215
1.755ValArg: 1.755 ± 0.243
5.42ValSer: 5.42 ± 0.483
3.541ValThr: 3.541 ± 0.36
3.356ValVal: 3.356 ± 0.424
0.647ValTrp: 0.647 ± 0.147
2.463ValTyr: 2.463 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
0.677TrpAla: 0.677 ± 0.153
0.308TrpCys: 0.308 ± 0.102
0.431TrpAsp: 0.431 ± 0.128
0.493TrpGlu: 0.493 ± 0.125
0.523TrpPhe: 0.523 ± 0.134
0.431TrpGly: 0.431 ± 0.128
0.308TrpHis: 0.308 ± 0.076
0.616TrpIle: 0.616 ± 0.111
0.616TrpLys: 0.616 ± 0.181
1.078TrpLeu: 1.078 ± 0.157
0.246TrpMet: 0.246 ± 0.099
0.647TrpAsn: 0.647 ± 0.13
0.585TrpPro: 0.585 ± 0.109
0.493TrpGln: 0.493 ± 0.125
0.277TrpArg: 0.277 ± 0.114
0.924TrpSer: 0.924 ± 0.179
0.955TrpThr: 0.955 ± 0.174
0.554TrpVal: 0.554 ± 0.149
0.062TrpTrp: 0.062 ± 0.047
0.523TrpTyr: 0.523 ± 0.125
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.755TyrAla: 1.755 ± 0.232
1.139TyrCys: 1.139 ± 0.219
1.139TyrAsp: 1.139 ± 0.18
1.694TyrGlu: 1.694 ± 0.161
2.094TyrPhe: 2.094 ± 0.245
1.509TyrGly: 1.509 ± 0.199
0.708TyrHis: 0.708 ± 0.142
3.264TyrIle: 3.264 ± 0.296
2.987TyrLys: 2.987 ± 0.319
3.972TyrLeu: 3.972 ± 0.355
0.862TyrMet: 0.862 ± 0.134
2.556TyrAsn: 2.556 ± 0.276
1.724TyrPro: 1.724 ± 0.201
1.201TyrGln: 1.201 ± 0.158
1.139TyrArg: 1.139 ± 0.181
2.771TyrSer: 2.771 ± 0.331
2.741TyrThr: 2.741 ± 0.304
2.771TyrVal: 2.771 ± 0.323
0.523TyrTrp: 0.523 ± 0.157
1.263TyrTyr: 1.263 ± 0.195
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (32476 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski