Amino acid dipepetide frequency for Helicoverpa armigera granulovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.058AlaAla: 3.058 ± 0.279
1.092AlaCys: 1.092 ± 0.158
3.118AlaAsp: 3.118 ± 0.274
2.78AlaGlu: 2.78 ± 0.294
1.966AlaPhe: 1.966 ± 0.198
2.264AlaGly: 2.264 ± 0.225
1.052AlaHis: 1.052 ± 0.178
2.74AlaIle: 2.74 ± 0.212
2.919AlaLys: 2.919 ± 0.263
4.388AlaLeu: 4.388 ± 0.265
1.191AlaMet: 1.191 ± 0.146
2.859AlaAsn: 2.859 ± 0.289
1.986AlaPro: 1.986 ± 0.215
1.728AlaGln: 1.728 ± 0.187
2.303AlaArg: 2.303 ± 0.212
3.435AlaSer: 3.435 ± 0.211
2.939AlaThr: 2.939 ± 0.249
3.475AlaVal: 3.475 ± 0.229
0.556AlaTrp: 0.556 ± 0.114
2.244AlaTyr: 2.244 ± 0.198
0.0AlaXaa: 0.0 ± 0.0
Cys
1.112CysAla: 1.112 ± 0.15
0.556CysCys: 0.556 ± 0.103
1.787CysAsp: 1.787 ± 0.223
1.35CysGlu: 1.35 ± 0.16
0.854CysPhe: 0.854 ± 0.123
1.271CysGly: 1.271 ± 0.175
0.794CysHis: 0.794 ± 0.131
1.311CysIle: 1.311 ± 0.178
1.489CysLys: 1.489 ± 0.176
1.966CysLeu: 1.966 ± 0.224
0.576CysMet: 0.576 ± 0.103
1.43CysAsn: 1.43 ± 0.168
0.973CysPro: 0.973 ± 0.166
0.774CysGln: 0.774 ± 0.133
1.072CysArg: 1.072 ± 0.141
1.33CysSer: 1.33 ± 0.184
1.43CysThr: 1.43 ± 0.173
2.681CysVal: 2.681 ± 0.258
0.04CysTrp: 0.04 ± 0.029
1.41CysTyr: 1.41 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
3.654AspAla: 3.654 ± 0.237
1.35AspCys: 1.35 ± 0.172
5.342AspAsp: 5.342 ± 0.453
4.031AspGlu: 4.031 ± 0.287
2.701AspPhe: 2.701 ± 0.255
3.455AspGly: 3.455 ± 0.273
1.529AspHis: 1.529 ± 0.223
3.118AspIle: 3.118 ± 0.264
4.369AspLys: 4.369 ± 0.345
5.779AspLeu: 5.779 ± 0.35
1.529AspMet: 1.529 ± 0.19
4.627AspAsn: 4.627 ± 0.339
1.986AspPro: 1.986 ± 0.214
2.105AspGln: 2.105 ± 0.186
2.701AspArg: 2.701 ± 0.25
4.13AspSer: 4.13 ± 0.343
3.991AspThr: 3.991 ± 0.313
4.944AspVal: 4.944 ± 0.324
0.695AspTrp: 0.695 ± 0.118
3.376AspTyr: 3.376 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
2.403GluAla: 2.403 ± 0.2
1.549GluCys: 1.549 ± 0.206
2.959GluAsp: 2.959 ± 0.219
4.23GluGlu: 4.23 ± 0.417
2.661GluPhe: 2.661 ± 0.228
1.489GluGly: 1.489 ± 0.172
1.45GluHis: 1.45 ± 0.155
3.594GluIle: 3.594 ± 0.308
3.554GluLys: 3.554 ± 0.315
5.659GluLeu: 5.659 ± 0.315
1.767GluMet: 1.767 ± 0.203
3.892GluAsn: 3.892 ± 0.289
1.986GluPro: 1.986 ± 0.225
2.343GluGln: 2.343 ± 0.249
2.74GluArg: 2.74 ± 0.282
3.197GluSer: 3.197 ± 0.248
2.998GluThr: 2.998 ± 0.208
2.72GluVal: 2.72 ± 0.219
0.536GluTrp: 0.536 ± 0.103
2.74GluTyr: 2.74 ± 0.235
0.0GluXaa: 0.0 ± 0.0
Phe
2.045PheAla: 2.045 ± 0.22
1.191PheCys: 1.191 ± 0.176
3.813PheAsp: 3.813 ± 0.296
2.879PheGlu: 2.879 ± 0.287
1.708PhePhe: 1.708 ± 0.186
1.668PheGly: 1.668 ± 0.183
0.774PheHis: 0.774 ± 0.121
2.899PheIle: 2.899 ± 0.253
3.177PheLys: 3.177 ± 0.249
3.435PheLeu: 3.435 ± 0.261
0.993PheMet: 0.993 ± 0.136
2.899PheAsn: 2.899 ± 0.263
1.291PhePro: 1.291 ± 0.186
1.37PheGln: 1.37 ± 0.209
2.006PheArg: 2.006 ± 0.199
2.323PheSer: 2.323 ± 0.229
2.8PheThr: 2.8 ± 0.273
4.369PheVal: 4.369 ± 0.347
0.298PheTrp: 0.298 ± 0.079
2.323PheTyr: 2.323 ± 0.206
0.0PheXaa: 0.0 ± 0.0
Gly
1.827GlyAla: 1.827 ± 0.215
0.794GlyCys: 0.794 ± 0.136
2.899GlyAsp: 2.899 ± 0.276
1.688GlyGlu: 1.688 ± 0.182
1.45GlyPhe: 1.45 ± 0.194
2.303GlyGly: 2.303 ± 0.264
0.854GlyHis: 0.854 ± 0.137
1.767GlyIle: 1.767 ± 0.176
2.125GlyLys: 2.125 ± 0.218
3.118GlyLeu: 3.118 ± 0.215
0.874GlyMet: 0.874 ± 0.139
1.966GlyAsn: 1.966 ± 0.216
1.33GlyPro: 1.33 ± 0.202
1.37GlyGln: 1.37 ± 0.208
1.847GlyArg: 1.847 ± 0.191
2.184GlySer: 2.184 ± 0.207
2.224GlyThr: 2.224 ± 0.21
3.753GlyVal: 3.753 ± 0.328
0.397GlyTrp: 0.397 ± 0.095
2.085GlyTyr: 2.085 ± 0.234
0.0GlyXaa: 0.0 ± 0.0
His
1.33HisAla: 1.33 ± 0.173
0.616HisCys: 0.616 ± 0.115
1.509HisAsp: 1.509 ± 0.16
1.231HisGlu: 1.231 ± 0.151
0.953HisPhe: 0.953 ± 0.135
1.033HisGly: 1.033 ± 0.131
0.675HisHis: 0.675 ± 0.123
1.211HisIle: 1.211 ± 0.182
1.589HisLys: 1.589 ± 0.207
2.184HisLeu: 2.184 ± 0.211
0.596HisMet: 0.596 ± 0.117
1.668HisAsn: 1.668 ± 0.188
0.774HisPro: 0.774 ± 0.129
0.913HisGln: 0.913 ± 0.137
0.973HisArg: 0.973 ± 0.125
1.608HisSer: 1.608 ± 0.166
1.668HisThr: 1.668 ± 0.226
1.728HisVal: 1.728 ± 0.184
0.258HisTrp: 0.258 ± 0.061
1.092HisTyr: 1.092 ± 0.135
0.0HisXaa: 0.0 ± 0.0
Ile
3.038IleAla: 3.038 ± 0.319
1.033IleCys: 1.033 ± 0.136
4.388IleAsp: 4.388 ± 0.318
3.634IleGlu: 3.634 ± 0.275
2.284IlePhe: 2.284 ± 0.23
1.926IleGly: 1.926 ± 0.185
1.231IleHis: 1.231 ± 0.175
3.614IleIle: 3.614 ± 0.261
4.984IleLys: 4.984 ± 0.317
4.706IleLeu: 4.706 ± 0.296
1.668IleMet: 1.668 ± 0.193
5.481IleAsn: 5.481 ± 0.328
1.966IlePro: 1.966 ± 0.242
2.006IleGln: 2.006 ± 0.198
2.78IleArg: 2.78 ± 0.238
3.237IleSer: 3.237 ± 0.275
3.455IleThr: 3.455 ± 0.246
4.468IleVal: 4.468 ± 0.305
0.457IleTrp: 0.457 ± 0.084
2.303IleTyr: 2.303 ± 0.237
0.0IleXaa: 0.0 ± 0.0
Lys
2.184LysAla: 2.184 ± 0.218
1.728LysCys: 1.728 ± 0.25
3.376LysAsp: 3.376 ± 0.315
3.356LysGlu: 3.356 ± 0.284
3.356LysPhe: 3.356 ± 0.271
1.549LysGly: 1.549 ± 0.192
2.145LysHis: 2.145 ± 0.233
5.223LysIle: 5.223 ± 0.373
5.342LysLys: 5.342 ± 0.496
6.97LysLeu: 6.97 ± 0.498
1.787LysMet: 1.787 ± 0.226
4.944LysAsn: 4.944 ± 0.288
2.403LysPro: 2.403 ± 0.25
3.038LysGln: 3.038 ± 0.266
4.587LysArg: 4.587 ± 0.351
4.13LysSer: 4.13 ± 0.315
3.634LysThr: 3.634 ± 0.283
3.455LysVal: 3.455 ± 0.294
0.755LysTrp: 0.755 ± 0.101
3.693LysTyr: 3.693 ± 0.286
0.0LysXaa: 0.0 ± 0.0
Leu
4.706LeuAla: 4.706 ± 0.318
2.522LeuCys: 2.522 ± 0.202
5.719LeuAsp: 5.719 ± 0.362
4.508LeuGlu: 4.508 ± 0.32
4.19LeuPhe: 4.19 ± 0.221
2.82LeuGly: 2.82 ± 0.247
2.323LeuHis: 2.323 ± 0.243
5.262LeuIle: 5.262 ± 0.334
6.97LeuLys: 6.97 ± 0.483
9.79LeuLeu: 9.79 ± 0.463
2.363LeuMet: 2.363 ± 0.23
5.977LeuAsn: 5.977 ± 0.346
4.17LeuPro: 4.17 ± 0.269
4.488LeuGln: 4.488 ± 0.403
4.508LeuArg: 4.508 ± 0.281
5.421LeuSer: 5.421 ± 0.362
5.501LeuThr: 5.501 ± 0.336
6.493LeuVal: 6.493 ± 0.358
0.894LeuTrp: 0.894 ± 0.122
5.322LeuTyr: 5.322 ± 0.332
0.0LeuXaa: 0.0 ± 0.0
Met
1.41MetAla: 1.41 ± 0.149
0.477MetCys: 0.477 ± 0.112
1.807MetAsp: 1.807 ± 0.173
1.251MetGlu: 1.251 ± 0.184
1.529MetPhe: 1.529 ± 0.187
0.834MetGly: 0.834 ± 0.112
0.477MetHis: 0.477 ± 0.087
1.311MetIle: 1.311 ± 0.182
1.628MetLys: 1.628 ± 0.19
2.661MetLeu: 2.661 ± 0.225
0.695MetMet: 0.695 ± 0.116
1.688MetAsn: 1.688 ± 0.161
0.735MetPro: 0.735 ± 0.107
0.874MetGln: 0.874 ± 0.14
1.132MetArg: 1.132 ± 0.12
1.946MetSer: 1.946 ± 0.205
1.469MetThr: 1.469 ± 0.169
2.224MetVal: 2.224 ± 0.219
0.238MetTrp: 0.238 ± 0.07
1.37MetTyr: 1.37 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
2.959AsnAla: 2.959 ± 0.261
1.708AsnCys: 1.708 ± 0.206
4.488AsnAsp: 4.488 ± 0.349
3.773AsnGlu: 3.773 ± 0.333
3.018AsnPhe: 3.018 ± 0.212
2.581AsnGly: 2.581 ± 0.198
1.172AsnHis: 1.172 ± 0.179
4.706AsnIle: 4.706 ± 0.317
4.984AsnLys: 4.984 ± 0.384
5.699AsnLeu: 5.699 ± 0.291
1.906AsnMet: 1.906 ± 0.241
5.779AsnAsn: 5.779 ± 0.401
2.323AsnPro: 2.323 ± 0.254
2.363AsnGln: 2.363 ± 0.203
2.74AsnArg: 2.74 ± 0.251
4.766AsnSer: 4.766 ± 0.279
4.925AsnThr: 4.925 ± 0.292
5.163AsnVal: 5.163 ± 0.314
0.735AsnTrp: 0.735 ± 0.131
3.316AsnTyr: 3.316 ± 0.257
0.0AsnXaa: 0.0 ± 0.0
Pro
2.065ProAla: 2.065 ± 0.251
0.596ProCys: 0.596 ± 0.117
2.76ProAsp: 2.76 ± 0.23
2.025ProGlu: 2.025 ± 0.231
1.668ProPhe: 1.668 ± 0.21
1.39ProGly: 1.39 ± 0.216
0.814ProHis: 0.814 ± 0.141
2.184ProIle: 2.184 ± 0.196
1.469ProLys: 1.469 ± 0.186
3.316ProLeu: 3.316 ± 0.272
0.814ProMet: 0.814 ± 0.123
2.383ProAsn: 2.383 ± 0.218
3.098ProPro: 3.098 ± 0.858
1.569ProGln: 1.569 ± 0.213
1.589ProArg: 1.589 ± 0.173
2.661ProSer: 2.661 ± 0.266
2.562ProThr: 2.562 ± 0.264
3.515ProVal: 3.515 ± 0.339
0.377ProTrp: 0.377 ± 0.087
1.986ProTyr: 1.986 ± 0.184
0.0ProXaa: 0.0 ± 0.0
Gln
1.668GlnAla: 1.668 ± 0.205
1.33GlnCys: 1.33 ± 0.162
1.648GlnAsp: 1.648 ± 0.199
1.966GlnGlu: 1.966 ± 0.259
1.847GlnPhe: 1.847 ± 0.178
1.052GlnGly: 1.052 ± 0.139
1.251GlnHis: 1.251 ± 0.148
2.303GlnIle: 2.303 ± 0.26
2.323GlnLys: 2.323 ± 0.245
4.885GlnLeu: 4.885 ± 0.337
0.894GlnMet: 0.894 ± 0.135
2.502GlnAsn: 2.502 ± 0.247
1.668GlnPro: 1.668 ± 0.193
2.462GlnGln: 2.462 ± 0.276
1.966GlnArg: 1.966 ± 0.226
2.343GlnSer: 2.343 ± 0.233
2.502GlnThr: 2.502 ± 0.242
2.025GlnVal: 2.025 ± 0.196
0.298GlnTrp: 0.298 ± 0.081
2.045GlnTyr: 2.045 ± 0.198
0.0GlnXaa: 0.0 ± 0.0
Arg
1.906ArgAla: 1.906 ± 0.174
1.191ArgCys: 1.191 ± 0.14
2.879ArgAsp: 2.879 ± 0.234
2.403ArgGlu: 2.403 ± 0.224
2.224ArgPhe: 2.224 ± 0.225
1.767ArgGly: 1.767 ± 0.196
1.41ArgHis: 1.41 ± 0.173
2.82ArgIle: 2.82 ± 0.264
3.137ArgLys: 3.137 ± 0.254
5.52ArgLeu: 5.52 ± 0.338
1.191ArgMet: 1.191 ± 0.153
2.959ArgAsn: 2.959 ± 0.298
2.025ArgPro: 2.025 ± 0.224
2.045ArgGln: 2.045 ± 0.208
2.998ArgArg: 2.998 ± 0.349
3.177ArgSer: 3.177 ± 0.38
2.462ArgThr: 2.462 ± 0.27
4.051ArgVal: 4.051 ± 0.322
0.397ArgTrp: 0.397 ± 0.081
2.224ArgTyr: 2.224 ± 0.224
0.0ArgXaa: 0.0 ± 0.0
Ser
2.581SerAla: 2.581 ± 0.266
1.469SerCys: 1.469 ± 0.196
3.952SerAsp: 3.952 ± 0.253
3.118SerGlu: 3.118 ± 0.228
3.118SerPhe: 3.118 ± 0.254
2.661SerGly: 2.661 ± 0.224
1.41SerHis: 1.41 ± 0.143
3.614SerIle: 3.614 ± 0.296
3.991SerLys: 3.991 ± 0.336
5.798SerLeu: 5.798 ± 0.343
1.509SerMet: 1.509 ± 0.153
3.991SerAsn: 3.991 ± 0.242
2.562SerPro: 2.562 ± 0.26
2.244SerGln: 2.244 ± 0.186
3.217SerArg: 3.217 ± 0.283
5.421SerSer: 5.421 ± 0.468
4.547SerThr: 4.547 ± 0.348
4.905SerVal: 4.905 ± 0.354
0.596SerTrp: 0.596 ± 0.127
2.8SerTyr: 2.8 ± 0.265
0.0SerXaa: 0.0 ± 0.0
Thr
2.701ThrAla: 2.701 ± 0.247
1.271ThrCys: 1.271 ± 0.166
3.813ThrAsp: 3.813 ± 0.257
3.038ThrGlu: 3.038 ± 0.309
2.601ThrPhe: 2.601 ± 0.225
2.125ThrGly: 2.125 ± 0.207
1.291ThrHis: 1.291 ± 0.163
3.793ThrIle: 3.793 ± 0.342
4.408ThrLys: 4.408 ± 0.278
5.779ThrLeu: 5.779 ± 0.323
1.708ThrMet: 1.708 ± 0.175
4.845ThrAsn: 4.845 ± 0.335
2.423ThrPro: 2.423 ± 0.217
2.303ThrGln: 2.303 ± 0.227
3.257ThrArg: 3.257 ± 0.228
4.309ThrSer: 4.309 ± 0.285
5.044ThrThr: 5.044 ± 0.484
4.607ThrVal: 4.607 ± 0.27
0.695ThrTrp: 0.695 ± 0.136
2.661ThrTyr: 2.661 ± 0.215
0.0ThrXaa: 0.0 ± 0.0
Val
4.21ValAla: 4.21 ± 0.285
2.125ValCys: 2.125 ± 0.206
5.143ValAsp: 5.143 ± 0.291
3.852ValGlu: 3.852 ± 0.325
3.376ValPhe: 3.376 ± 0.258
2.403ValGly: 2.403 ± 0.257
1.589ValHis: 1.589 ± 0.159
4.11ValIle: 4.11 ± 0.299
5.103ValLys: 5.103 ± 0.375
6.414ValLeu: 6.414 ± 0.372
2.065ValMet: 2.065 ± 0.221
4.647ValAsn: 4.647 ± 0.279
3.257ValPro: 3.257 ± 0.282
2.78ValGln: 2.78 ± 0.256
3.574ValArg: 3.574 ± 0.287
4.19ValSer: 4.19 ± 0.329
5.044ValThr: 5.044 ± 0.298
5.659ValVal: 5.659 ± 0.401
0.695ValTrp: 0.695 ± 0.107
4.369ValTyr: 4.369 ± 0.295
0.0ValXaa: 0.0 ± 0.0
Trp
0.417TrpAla: 0.417 ± 0.085
0.318TrpCys: 0.318 ± 0.091
0.457TrpAsp: 0.457 ± 0.106
0.516TrpGlu: 0.516 ± 0.123
0.338TrpPhe: 0.338 ± 0.073
0.417TrpGly: 0.417 ± 0.11
0.179TrpHis: 0.179 ± 0.061
0.258TrpIle: 0.258 ± 0.072
0.496TrpLys: 0.496 ± 0.101
1.251TrpLeu: 1.251 ± 0.191
0.218TrpMet: 0.218 ± 0.071
0.596TrpAsn: 0.596 ± 0.122
0.457TrpPro: 0.457 ± 0.093
0.417TrpGln: 0.417 ± 0.093
0.675TrpArg: 0.675 ± 0.084
0.774TrpSer: 0.774 ± 0.132
0.496TrpThr: 0.496 ± 0.095
0.457TrpVal: 0.457 ± 0.082
0.298TrpTrp: 0.298 ± 0.073
0.655TrpTyr: 0.655 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.82TyrAla: 2.82 ± 0.222
1.291TyrCys: 1.291 ± 0.171
3.554TyrAsp: 3.554 ± 0.29
2.84TyrGlu: 2.84 ± 0.242
2.423TyrPhe: 2.423 ± 0.229
1.847TyrGly: 1.847 ± 0.213
1.191TyrHis: 1.191 ± 0.145
2.82TyrIle: 2.82 ± 0.317
3.594TyrLys: 3.594 ± 0.231
4.647TyrLeu: 4.647 ± 0.354
1.39TyrMet: 1.39 ± 0.159
3.971TyrAsn: 3.971 ± 0.328
1.43TyrPro: 1.43 ± 0.162
1.787TyrGln: 1.787 ± 0.182
2.125TyrArg: 2.125 ± 0.18
2.919TyrSer: 2.919 ± 0.254
2.979TyrThr: 2.979 ± 0.271
3.912TyrVal: 3.912 ± 0.272
0.477TyrTrp: 0.477 ± 0.098
2.879TyrTyr: 2.879 ± 0.216
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 179 proteins (50360 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski