Amino acid dipepetide frequency for Hyphantria cunea nuclear polyhedrosis virus (HcNPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.727AlaAla: 6.727 ± 0.571
1.688AlaCys: 1.688 ± 0.233
4.132AlaAsp: 4.132 ± 0.372
3.477AlaGlu: 3.477 ± 0.275
3.527AlaPhe: 3.527 ± 0.299
3.174AlaGly: 3.174 ± 0.287
2.015AlaHis: 2.015 ± 0.184
3.275AlaIle: 3.275 ± 0.287
3.225AlaLys: 3.225 ± 0.306
7.382AlaLeu: 7.382 ± 0.44
1.663AlaMet: 1.663 ± 0.177
3.804AlaAsn: 3.804 ± 0.333
3.93AlaPro: 3.93 ± 0.37
3.275AlaGln: 3.275 ± 0.245
4.636AlaArg: 4.636 ± 0.362
4.862AlaSer: 4.862 ± 0.406
4.358AlaThr: 4.358 ± 0.363
5.794AlaVal: 5.794 ± 0.432
0.529AlaTrp: 0.529 ± 0.128
2.469AlaTyr: 2.469 ± 0.254
0.0AlaXaa: 0.0 ± 0.0
Cys
2.242CysAla: 2.242 ± 0.242
0.655CysCys: 0.655 ± 0.153
1.764CysAsp: 1.764 ± 0.213
1.184CysGlu: 1.184 ± 0.16
1.083CysPhe: 1.083 ± 0.164
1.033CysGly: 1.033 ± 0.2
0.504CysHis: 0.504 ± 0.104
1.033CysIle: 1.033 ± 0.131
1.512CysLys: 1.512 ± 0.226
2.116CysLeu: 2.116 ± 0.225
0.63CysMet: 0.63 ± 0.112
1.688CysAsn: 1.688 ± 0.227
1.184CysPro: 1.184 ± 0.195
0.731CysGln: 0.731 ± 0.135
1.587CysArg: 1.587 ± 0.206
1.587CysSer: 1.587 ± 0.191
1.36CysThr: 1.36 ± 0.183
2.343CysVal: 2.343 ± 0.26
0.176CysTrp: 0.176 ± 0.073
1.033CysTyr: 1.033 ± 0.158
0.0CysXaa: 0.0 ± 0.0
Asp
5.568AspAla: 5.568 ± 0.368
1.335AspCys: 1.335 ± 0.18
4.837AspAsp: 4.837 ± 0.449
3.93AspGlu: 3.93 ± 0.296
2.545AspPhe: 2.545 ± 0.264
2.948AspGly: 2.948 ± 0.291
1.058AspHis: 1.058 ± 0.176
2.545AspIle: 2.545 ± 0.254
3.703AspLys: 3.703 ± 0.348
5.165AspLeu: 5.165 ± 0.412
1.335AspMet: 1.335 ± 0.166
3.678AspAsn: 3.678 ± 0.337
1.89AspPro: 1.89 ± 0.238
1.612AspGln: 1.612 ± 0.211
2.595AspArg: 2.595 ± 0.258
3.174AspSer: 3.174 ± 0.242
3.376AspThr: 3.376 ± 0.253
3.981AspVal: 3.981 ± 0.371
0.579AspTrp: 0.579 ± 0.135
2.796AspTyr: 2.796 ± 0.233
0.0AspXaa: 0.0 ± 0.0
Glu
2.948GluAla: 2.948 ± 0.294
1.587GluCys: 1.587 ± 0.223
2.066GluAsp: 2.066 ± 0.22
2.822GluGlu: 2.822 ± 0.291
2.872GluPhe: 2.872 ± 0.249
1.386GluGly: 1.386 ± 0.183
1.612GluHis: 1.612 ± 0.228
2.822GluIle: 2.822 ± 0.27
2.192GluLys: 2.192 ± 0.24
5.669GluLeu: 5.669 ± 0.339
1.008GluMet: 1.008 ± 0.151
3.426GluAsn: 3.426 ± 0.28
2.343GluPro: 2.343 ± 0.377
2.67GluGln: 2.67 ± 0.321
3.779GluArg: 3.779 ± 0.311
2.998GluSer: 2.998 ± 0.251
3.225GluThr: 3.225 ± 0.326
2.368GluVal: 2.368 ± 0.232
0.554GluTrp: 0.554 ± 0.101
1.965GluTyr: 1.965 ± 0.245
0.0GluXaa: 0.0 ± 0.0
Phe
3.703PheAla: 3.703 ± 0.318
1.36PheCys: 1.36 ± 0.197
3.678PheAsp: 3.678 ± 0.314
3.074PheGlu: 3.074 ± 0.284
1.688PhePhe: 1.688 ± 0.234
2.242PheGly: 2.242 ± 0.234
0.806PheHis: 0.806 ± 0.144
2.494PheIle: 2.494 ± 0.216
3.754PheLys: 3.754 ± 0.388
4.157PheLeu: 4.157 ± 0.309
1.234PheMet: 1.234 ± 0.17
3.729PheAsn: 3.729 ± 0.304
1.31PhePro: 1.31 ± 0.202
1.234PheGln: 1.234 ± 0.18
1.864PheArg: 1.864 ± 0.221
2.167PheSer: 2.167 ± 0.218
2.419PheThr: 2.419 ± 0.241
4.636PheVal: 4.636 ± 0.378
0.252PheTrp: 0.252 ± 0.079
2.091PheTyr: 2.091 ± 0.174
0.0PheXaa: 0.0 ± 0.0
Gly
3.048GlyAla: 3.048 ± 0.228
0.756GlyCys: 0.756 ± 0.13
2.796GlyAsp: 2.796 ± 0.323
2.066GlyGlu: 2.066 ± 0.198
1.738GlyPhe: 1.738 ± 0.236
2.217GlyGly: 2.217 ± 0.287
1.234GlyHis: 1.234 ± 0.15
1.008GlyIle: 1.008 ± 0.164
2.015GlyLys: 2.015 ± 0.197
3.3GlyLeu: 3.3 ± 0.302
0.605GlyMet: 0.605 ± 0.11
2.343GlyAsn: 2.343 ± 0.222
1.134GlyPro: 1.134 ± 0.16
1.713GlyGln: 1.713 ± 0.2
2.015GlyArg: 2.015 ± 0.276
1.814GlySer: 1.814 ± 0.187
1.99GlyThr: 1.99 ± 0.221
4.132GlyVal: 4.132 ± 0.38
0.378GlyTrp: 0.378 ± 0.083
1.512GlyTyr: 1.512 ± 0.165
0.0GlyXaa: 0.0 ± 0.0
His
1.94HisAla: 1.94 ± 0.231
0.605HisCys: 0.605 ± 0.109
1.234HisAsp: 1.234 ± 0.147
1.26HisGlu: 1.26 ± 0.217
1.285HisPhe: 1.285 ± 0.165
1.033HisGly: 1.033 ± 0.177
0.831HisHis: 0.831 ± 0.156
1.26HisIle: 1.26 ± 0.164
1.713HisLys: 1.713 ± 0.156
1.89HisLeu: 1.89 ± 0.211
0.63HisMet: 0.63 ± 0.147
1.965HisAsn: 1.965 ± 0.211
0.907HisPro: 0.907 ± 0.164
0.756HisGln: 0.756 ± 0.136
1.033HisArg: 1.033 ± 0.144
1.058HisSer: 1.058 ± 0.125
1.159HisThr: 1.159 ± 0.169
2.419HisVal: 2.419 ± 0.235
0.428HisTrp: 0.428 ± 0.128
1.234HisTyr: 1.234 ± 0.175
0.0HisXaa: 0.0 ± 0.0
Ile
3.552IleAla: 3.552 ± 0.349
1.537IleCys: 1.537 ± 0.193
3.905IleAsp: 3.905 ± 0.339
2.872IleGlu: 2.872 ± 0.259
1.99IlePhe: 1.99 ± 0.205
1.764IleGly: 1.764 ± 0.244
0.705IleHis: 0.705 ± 0.153
2.998IleIle: 2.998 ± 0.287
4.535IleLys: 4.535 ± 0.362
4.132IleLeu: 4.132 ± 0.311
1.486IleMet: 1.486 ± 0.199
4.484IleAsn: 4.484 ± 0.333
1.335IlePro: 1.335 ± 0.181
1.537IleGln: 1.537 ± 0.208
2.066IleArg: 2.066 ± 0.217
2.645IleSer: 2.645 ± 0.262
2.595IleThr: 2.595 ± 0.263
4.61IleVal: 4.61 ± 0.327
0.277IleTrp: 0.277 ± 0.09
1.688IleTyr: 1.688 ± 0.245
0.0IleXaa: 0.0 ± 0.0
Lys
2.267LysAla: 2.267 ± 0.215
1.965LysCys: 1.965 ± 0.252
2.242LysAsp: 2.242 ± 0.274
2.57LysGlu: 2.57 ± 0.276
3.351LysPhe: 3.351 ± 0.322
1.436LysGly: 1.436 ± 0.261
1.839LysHis: 1.839 ± 0.22
3.981LysIle: 3.981 ± 0.396
3.628LysLys: 3.628 ± 0.324
6.122LysLeu: 6.122 ± 0.356
2.041LysMet: 2.041 ± 0.267
4.232LysAsn: 4.232 ± 0.345
2.141LysPro: 2.141 ± 0.261
2.67LysGln: 2.67 ± 0.291
4.51LysArg: 4.51 ± 0.385
3.2LysSer: 3.2 ± 0.268
3.981LysThr: 3.981 ± 0.321
2.922LysVal: 2.922 ± 0.302
0.403LysTrp: 0.403 ± 0.11
2.897LysTyr: 2.897 ± 0.295
0.0LysXaa: 0.0 ± 0.0
Leu
5.92LeuAla: 5.92 ± 0.438
2.343LeuCys: 2.343 ± 0.27
4.938LeuAsp: 4.938 ± 0.317
4.384LeuGlu: 4.384 ± 0.338
4.484LeuPhe: 4.484 ± 0.349
2.998LeuGly: 2.998 ± 0.273
2.595LeuHis: 2.595 ± 0.257
5.996LeuIle: 5.996 ± 0.414
6.097LeuLys: 6.097 ± 0.371
10.304LeuLeu: 10.304 ± 0.679
2.696LeuMet: 2.696 ± 0.221
7.18LeuAsn: 7.18 ± 0.442
4.081LeuPro: 4.081 ± 0.375
5.92LeuGln: 5.92 ± 0.461
5.442LeuArg: 5.442 ± 0.377
5.366LeuSer: 5.366 ± 0.304
5.971LeuThr: 5.971 ± 0.334
5.92LeuVal: 5.92 ± 0.347
0.605LeuTrp: 0.605 ± 0.113
3.628LeuTyr: 3.628 ± 0.332
0.0LeuXaa: 0.0 ± 0.0
Met
1.612MetAla: 1.612 ± 0.203
0.882MetCys: 0.882 ± 0.138
1.486MetAsp: 1.486 ± 0.181
1.209MetGlu: 1.209 ± 0.173
1.461MetPhe: 1.461 ± 0.165
0.806MetGly: 0.806 ± 0.161
0.831MetHis: 0.831 ± 0.148
1.26MetIle: 1.26 ± 0.165
0.831MetLys: 0.831 ± 0.135
2.595MetLeu: 2.595 ± 0.264
0.479MetMet: 0.479 ± 0.106
1.612MetAsn: 1.612 ± 0.179
1.134MetPro: 1.134 ± 0.159
1.486MetGln: 1.486 ± 0.195
1.512MetArg: 1.512 ± 0.181
1.915MetSer: 1.915 ± 0.229
1.109MetThr: 1.109 ± 0.194
1.436MetVal: 1.436 ± 0.21
0.302MetTrp: 0.302 ± 0.085
1.436MetTyr: 1.436 ± 0.185
0.0MetXaa: 0.0 ± 0.0
Asn
5.92AsnAla: 5.92 ± 0.378
1.764AsnCys: 1.764 ± 0.204
4.207AsnAsp: 4.207 ± 0.328
3.628AsnGlu: 3.628 ± 0.321
3.149AsnPhe: 3.149 ± 0.303
2.847AsnGly: 2.847 ± 0.297
1.008AsnHis: 1.008 ± 0.173
3.174AsnIle: 3.174 ± 0.277
4.787AsnLys: 4.787 ± 0.359
5.744AsnLeu: 5.744 ± 0.39
1.738AsnMet: 1.738 ± 0.168
6.298AsnAsn: 6.298 ± 0.466
1.764AsnPro: 1.764 ± 0.229
1.688AsnGln: 1.688 ± 0.192
3.351AsnArg: 3.351 ± 0.27
4.056AsnSer: 4.056 ± 0.373
4.207AsnThr: 4.207 ± 0.326
5.769AsnVal: 5.769 ± 0.348
0.731AsnTrp: 0.731 ± 0.141
2.973AsnTyr: 2.973 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
3.25ProAla: 3.25 ± 0.339
0.731ProCys: 0.731 ± 0.139
2.721ProAsp: 2.721 ± 0.25
1.864ProGlu: 1.864 ± 0.34
1.94ProPhe: 1.94 ± 0.217
1.411ProGly: 1.411 ± 0.209
1.159ProHis: 1.159 ± 0.194
1.814ProIle: 1.814 ± 0.225
1.335ProLys: 1.335 ± 0.182
3.502ProLeu: 3.502 ± 0.274
1.109ProMet: 1.109 ± 0.159
2.368ProAsn: 2.368 ± 0.25
4.182ProPro: 4.182 ± 0.865
2.519ProGln: 2.519 ± 0.309
2.192ProArg: 2.192 ± 0.269
2.746ProSer: 2.746 ± 0.272
2.645ProThr: 2.645 ± 0.347
2.897ProVal: 2.897 ± 0.259
0.302ProTrp: 0.302 ± 0.1
1.411ProTyr: 1.411 ± 0.18
0.0ProXaa: 0.0 ± 0.0
Gln
2.091GlnAla: 2.091 ± 0.207
0.831GlnCys: 0.831 ± 0.151
1.26GlnAsp: 1.26 ± 0.2
1.89GlnGlu: 1.89 ± 0.259
2.343GlnPhe: 2.343 ± 0.198
0.806GlnGly: 0.806 ± 0.133
1.209GlnHis: 1.209 ± 0.194
2.57GlnIle: 2.57 ± 0.302
2.494GlnLys: 2.494 ± 0.265
5.694GlnLeu: 5.694 ± 0.377
1.436GlnMet: 1.436 ± 0.187
2.645GlnAsn: 2.645 ± 0.325
1.738GlnPro: 1.738 ± 0.225
3.804GlnGln: 3.804 ± 0.672
2.771GlnArg: 2.771 ± 0.276
1.99GlnSer: 1.99 ± 0.312
2.948GlnThr: 2.948 ± 0.329
2.293GlnVal: 2.293 ± 0.25
0.302GlnTrp: 0.302 ± 0.102
1.764GlnTyr: 1.764 ± 0.182
0.0GlnXaa: 0.0 ± 0.0
Arg
4.736ArgAla: 4.736 ± 0.41
1.638ArgCys: 1.638 ± 0.183
3.426ArgAsp: 3.426 ± 0.309
2.771ArgGlu: 2.771 ± 0.296
2.545ArgPhe: 2.545 ± 0.264
2.393ArgGly: 2.393 ± 0.262
1.461ArgHis: 1.461 ± 0.209
2.595ArgIle: 2.595 ± 0.21
2.62ArgLys: 2.62 ± 0.27
5.391ArgLeu: 5.391 ± 0.34
1.285ArgMet: 1.285 ± 0.168
2.922ArgAsn: 2.922 ± 0.266
2.444ArgPro: 2.444 ± 0.222
2.469ArgGln: 2.469 ± 0.242
5.341ArgArg: 5.341 ± 0.63
3.2ArgSer: 3.2 ± 0.344
2.57ArgThr: 2.57 ± 0.263
4.585ArgVal: 4.585 ± 0.339
0.579ArgTrp: 0.579 ± 0.125
1.99ArgTyr: 1.99 ± 0.194
0.0ArgXaa: 0.0 ± 0.0
Ser
4.56SerAla: 4.56 ± 0.436
1.159SerCys: 1.159 ± 0.196
3.804SerAsp: 3.804 ± 0.341
2.872SerGlu: 2.872 ± 0.283
2.393SerPhe: 2.393 ± 0.238
2.897SerGly: 2.897 ± 0.287
0.756SerHis: 0.756 ± 0.132
2.822SerIle: 2.822 ± 0.277
2.847SerLys: 2.847 ± 0.288
5.769SerLeu: 5.769 ± 0.431
1.31SerMet: 1.31 ± 0.172
3.804SerAsn: 3.804 ± 0.258
2.519SerPro: 2.519 ± 0.307
1.94SerGln: 1.94 ± 0.208
2.368SerArg: 2.368 ± 0.252
4.157SerSer: 4.157 ± 0.479
3.124SerThr: 3.124 ± 0.28
4.61SerVal: 4.61 ± 0.322
0.504SerTrp: 0.504 ± 0.096
1.965SerTyr: 1.965 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
4.409ThrAla: 4.409 ± 0.303
1.134ThrCys: 1.134 ± 0.16
3.099ThrAsp: 3.099 ± 0.291
2.57ThrGlu: 2.57 ± 0.246
3.376ThrPhe: 3.376 ± 0.279
2.519ThrGly: 2.519 ± 0.221
1.436ThrHis: 1.436 ± 0.192
3.174ThrIle: 3.174 ± 0.303
2.696ThrLys: 2.696 ± 0.284
5.845ThrLeu: 5.845 ± 0.382
1.335ThrMet: 1.335 ± 0.164
4.056ThrAsn: 4.056 ± 0.264
3.3ThrPro: 3.3 ± 0.408
2.141ThrGln: 2.141 ± 0.229
3.074ThrArg: 3.074 ± 0.222
3.174ThrSer: 3.174 ± 0.282
4.585ThrThr: 4.585 ± 0.4
4.56ThrVal: 4.56 ± 0.36
0.605ThrTrp: 0.605 ± 0.131
2.293ThrTyr: 2.293 ± 0.217
0.0ThrXaa: 0.0 ± 0.0
Val
6.223ValAla: 6.223 ± 0.381
2.066ValCys: 2.066 ± 0.216
4.333ValAsp: 4.333 ± 0.308
3.477ValGlu: 3.477 ± 0.264
3.653ValPhe: 3.653 ± 0.327
2.192ValGly: 2.192 ± 0.226
2.419ValHis: 2.419 ± 0.226
3.577ValIle: 3.577 ± 0.343
4.888ValLys: 4.888 ± 0.344
7.659ValLeu: 7.659 ± 0.481
1.99ValMet: 1.99 ± 0.21
4.711ValAsn: 4.711 ± 0.367
3.401ValPro: 3.401 ± 0.299
3.149ValGln: 3.149 ± 0.28
4.157ValArg: 4.157 ± 0.354
3.628ValSer: 3.628 ± 0.282
4.207ValThr: 4.207 ± 0.309
5.24ValVal: 5.24 ± 0.5
0.579ValTrp: 0.579 ± 0.129
3.048ValTyr: 3.048 ± 0.255
0.0ValXaa: 0.0 ± 0.0
Trp
0.529TrpAla: 0.529 ± 0.113
0.252TrpCys: 0.252 ± 0.078
0.605TrpAsp: 0.605 ± 0.114
0.277TrpGlu: 0.277 ± 0.102
0.403TrpPhe: 0.403 ± 0.097
0.328TrpGly: 0.328 ± 0.098
0.202TrpHis: 0.202 ± 0.072
0.302TrpIle: 0.302 ± 0.075
0.378TrpLys: 0.378 ± 0.101
0.907TrpLeu: 0.907 ± 0.138
0.176TrpMet: 0.176 ± 0.074
0.655TrpAsn: 0.655 ± 0.167
0.453TrpPro: 0.453 ± 0.11
0.453TrpGln: 0.453 ± 0.111
0.857TrpArg: 0.857 ± 0.152
0.353TrpSer: 0.353 ± 0.084
0.705TrpThr: 0.705 ± 0.138
0.403TrpVal: 0.403 ± 0.099
0.126TrpTrp: 0.126 ± 0.051
0.277TrpTyr: 0.277 ± 0.083
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.696TyrAla: 2.696 ± 0.268
1.159TyrCys: 1.159 ± 0.167
2.116TyrAsp: 2.116 ± 0.215
2.116TyrGlu: 2.116 ± 0.208
2.066TyrPhe: 2.066 ± 0.188
1.386TyrGly: 1.386 ± 0.2
0.957TyrHis: 0.957 ± 0.14
2.015TyrIle: 2.015 ± 0.255
3.174TyrLys: 3.174 ± 0.261
3.527TyrLeu: 3.527 ± 0.342
1.184TyrMet: 1.184 ± 0.163
3.2TyrAsn: 3.2 ± 0.264
0.907TyrPro: 0.907 ± 0.136
1.109TyrGln: 1.109 ± 0.155
1.814TyrArg: 1.814 ± 0.243
2.091TyrSer: 2.091 ± 0.269
2.847TyrThr: 2.847 ± 0.3
3.678TyrVal: 3.678 ± 0.258
0.428TyrTrp: 0.428 ± 0.101
1.915TyrTyr: 1.915 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 148 proteins (39694 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski