Amino acid dipepetide frequency for Mythimna unipuncta granulovirus B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.42AlaAla: 4.42 ± 0.495
1.169AlaCys: 1.169 ± 0.197
3.858AlaAsp: 3.858 ± 0.383
3.391AlaGlu: 3.391 ± 0.356
2.221AlaPhe: 2.221 ± 0.262
2.619AlaGly: 2.619 ± 0.287
1.38AlaHis: 1.38 ± 0.172
2.689AlaIle: 2.689 ± 0.236
2.642AlaLys: 2.642 ± 0.296
5.074AlaLeu: 5.074 ± 0.336
1.473AlaMet: 1.473 ± 0.209
3.18AlaAsn: 3.18 ± 0.345
2.315AlaPro: 2.315 ± 0.269
2.198AlaGln: 2.198 ± 0.232
2.876AlaArg: 2.876 ± 0.24
3.812AlaSer: 3.812 ± 0.34
3.274AlaThr: 3.274 ± 0.312
4.209AlaVal: 4.209 ± 0.316
0.444AlaTrp: 0.444 ± 0.12
2.479AlaTyr: 2.479 ± 0.283
0.0AlaXaa: 0.0 ± 0.0
Cys
1.239CysAla: 1.239 ± 0.194
0.491CysCys: 0.491 ± 0.117
1.801CysAsp: 1.801 ± 0.202
1.286CysGlu: 1.286 ± 0.174
0.842CysPhe: 0.842 ± 0.177
1.426CysGly: 1.426 ± 0.218
0.631CysHis: 0.631 ± 0.153
1.263CysIle: 1.263 ± 0.204
1.146CysLys: 1.146 ± 0.222
2.081CysLeu: 2.081 ± 0.237
0.608CysMet: 0.608 ± 0.109
1.239CysAsn: 1.239 ± 0.217
0.982CysPro: 0.982 ± 0.152
0.631CysGln: 0.631 ± 0.137
1.333CysArg: 1.333 ± 0.18
1.333CysSer: 1.333 ± 0.159
1.567CysThr: 1.567 ± 0.221
2.455CysVal: 2.455 ± 0.264
0.094CysTrp: 0.094 ± 0.047
1.146CysTyr: 1.146 ± 0.192
0.0CysXaa: 0.0 ± 0.0
Asp
4.396AspAla: 4.396 ± 0.382
1.52AspCys: 1.52 ± 0.191
5.285AspAsp: 5.285 ± 0.417
4.209AspGlu: 4.209 ± 0.288
2.315AspPhe: 2.315 ± 0.234
3.367AspGly: 3.367 ± 0.333
1.59AspHis: 1.59 ± 0.218
2.9AspIle: 2.9 ± 0.268
3.765AspLys: 3.765 ± 0.293
5.238AspLeu: 5.238 ± 0.37
1.684AspMet: 1.684 ± 0.228
3.671AspAsn: 3.671 ± 0.308
2.011AspPro: 2.011 ± 0.2
1.754AspGln: 1.754 ± 0.249
2.806AspArg: 2.806 ± 0.213
4.396AspSer: 4.396 ± 0.26
3.648AspThr: 3.648 ± 0.308
6.197AspVal: 6.197 ± 0.373
0.772AspTrp: 0.772 ± 0.143
3.157AspTyr: 3.157 ± 0.277
0.0AspXaa: 0.0 ± 0.0
Glu
3.484GluAla: 3.484 ± 0.329
1.45GluCys: 1.45 ± 0.199
3.11GluAsp: 3.11 ± 0.287
4.981GluGlu: 4.981 ± 0.662
2.502GluPhe: 2.502 ± 0.232
2.058GluGly: 2.058 ± 0.249
1.824GluHis: 1.824 ± 0.259
3.25GluIle: 3.25 ± 0.282
2.806GluLys: 2.806 ± 0.336
5.729GluLeu: 5.729 ± 0.416
1.543GluMet: 1.543 ± 0.198
3.414GluAsn: 3.414 ± 0.258
2.666GluPro: 2.666 ± 0.313
1.777GluGln: 1.777 ± 0.221
3.648GluArg: 3.648 ± 0.32
2.993GluSer: 2.993 ± 0.235
3.344GluThr: 3.344 ± 0.31
3.18GluVal: 3.18 ± 0.31
0.585GluTrp: 0.585 ± 0.112
1.964GluTyr: 1.964 ± 0.212
0.0GluXaa: 0.0 ± 0.0
Phe
2.385PheAla: 2.385 ± 0.219
0.795PheCys: 0.795 ± 0.177
3.718PheAsp: 3.718 ± 0.307
2.97PheGlu: 2.97 ± 0.27
2.034PhePhe: 2.034 ± 0.346
1.894PheGly: 1.894 ± 0.196
1.099PheHis: 1.099 ± 0.18
2.292PheIle: 2.292 ± 0.223
2.596PheLys: 2.596 ± 0.26
3.461PheLeu: 3.461 ± 0.323
0.912PheMet: 0.912 ± 0.15
2.993PheAsn: 2.993 ± 0.276
1.099PhePro: 1.099 ± 0.183
1.473PheGln: 1.473 ± 0.173
2.479PheArg: 2.479 ± 0.181
2.455PheSer: 2.455 ± 0.309
2.385PheThr: 2.385 ± 0.278
4.747PheVal: 4.747 ± 0.37
0.187PheTrp: 0.187 ± 0.068
1.964PheTyr: 1.964 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
2.432GlyAla: 2.432 ± 0.262
0.865GlyCys: 0.865 ± 0.152
2.9GlyAsp: 2.9 ± 0.278
2.151GlyGlu: 2.151 ± 0.231
1.66GlyPhe: 1.66 ± 0.204
2.572GlyGly: 2.572 ± 0.286
0.935GlyHis: 0.935 ± 0.147
1.543GlyIle: 1.543 ± 0.215
2.011GlyLys: 2.011 ± 0.259
3.648GlyLeu: 3.648 ± 0.323
0.912GlyMet: 0.912 ± 0.146
1.801GlyAsn: 1.801 ± 0.209
1.567GlyPro: 1.567 ± 0.175
1.099GlyGln: 1.099 ± 0.153
2.525GlyArg: 2.525 ± 0.259
2.245GlySer: 2.245 ± 0.255
2.198GlyThr: 2.198 ± 0.235
4.233GlyVal: 4.233 ± 0.334
0.398GlyTrp: 0.398 ± 0.091
1.824GlyTyr: 1.824 ± 0.254
0.0GlyXaa: 0.0 ± 0.0
His
1.356HisAla: 1.356 ± 0.191
0.561HisCys: 0.561 ± 0.14
2.011HisAsp: 2.011 ± 0.215
1.473HisGlu: 1.473 ± 0.194
1.426HisPhe: 1.426 ± 0.178
1.169HisGly: 1.169 ± 0.187
0.795HisHis: 0.795 ± 0.136
1.216HisIle: 1.216 ± 0.172
1.52HisLys: 1.52 ± 0.204
2.292HisLeu: 2.292 ± 0.224
0.561HisMet: 0.561 ± 0.121
2.198HisAsn: 2.198 ± 0.236
1.099HisPro: 1.099 ± 0.16
0.842HisGln: 0.842 ± 0.152
1.707HisArg: 1.707 ± 0.257
1.567HisSer: 1.567 ± 0.212
1.66HisThr: 1.66 ± 0.228
2.292HisVal: 2.292 ± 0.322
0.164HisTrp: 0.164 ± 0.063
1.169HisTyr: 1.169 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
3.11IleAla: 3.11 ± 0.303
1.076IleCys: 1.076 ± 0.152
4.396IleAsp: 4.396 ± 0.307
3.461IleGlu: 3.461 ± 0.303
2.198IlePhe: 2.198 ± 0.281
1.871IleGly: 1.871 ± 0.215
1.216IleHis: 1.216 ± 0.158
2.806IleIle: 2.806 ± 0.29
3.531IleLys: 3.531 ± 0.359
3.625IleLeu: 3.625 ± 0.35
1.871IleMet: 1.871 ± 0.227
3.461IleAsn: 3.461 ± 0.281
2.175IlePro: 2.175 ± 0.355
1.31IleGln: 1.31 ± 0.195
2.736IleArg: 2.736 ± 0.219
2.619IleSer: 2.619 ± 0.239
3.087IleThr: 3.087 ± 0.395
4.466IleVal: 4.466 ± 0.368
0.444IleTrp: 0.444 ± 0.111
2.175IleTyr: 2.175 ± 0.217
0.0IleXaa: 0.0 ± 0.0
Lys
2.058LysAla: 2.058 ± 0.206
1.45LysCys: 1.45 ± 0.216
2.689LysAsp: 2.689 ± 0.35
2.736LysGlu: 2.736 ± 0.293
2.572LysPhe: 2.572 ± 0.289
1.45LysGly: 1.45 ± 0.201
2.432LysHis: 2.432 ± 0.257
3.765LysIle: 3.765 ± 0.307
4.279LysLys: 4.279 ± 0.481
5.612LysLeu: 5.612 ± 0.413
1.543LysMet: 1.543 ± 0.182
3.531LysAsn: 3.531 ± 0.356
2.525LysPro: 2.525 ± 0.283
2.432LysGln: 2.432 ± 0.283
4.864LysArg: 4.864 ± 0.378
3.25LysSer: 3.25 ± 0.349
2.993LysThr: 2.993 ± 0.287
2.9LysVal: 2.9 ± 0.303
0.421LysTrp: 0.421 ± 0.088
2.666LysTyr: 2.666 ± 0.229
0.0LysXaa: 0.0 ± 0.0
Leu
5.215LeuAla: 5.215 ± 0.393
2.151LeuCys: 2.151 ± 0.213
5.355LeuAsp: 5.355 ± 0.319
4.7LeuGlu: 4.7 ± 0.333
4.233LeuPhe: 4.233 ± 0.302
3.321LeuGly: 3.321 ± 0.259
2.362LeuHis: 2.362 ± 0.293
5.098LeuIle: 5.098 ± 0.338
6.173LeuLys: 6.173 ± 0.494
9.798LeuLeu: 9.798 ± 0.531
2.315LeuMet: 2.315 ± 0.293
5.402LeuAsn: 5.402 ± 0.419
4.022LeuPro: 4.022 ± 0.41
4.233LeuGln: 4.233 ± 0.32
5.776LeuArg: 5.776 ± 0.371
5.589LeuSer: 5.589 ± 0.455
5.332LeuThr: 5.332 ± 0.294
6.922LeuVal: 6.922 ± 0.389
0.912LeuTrp: 0.912 ± 0.157
5.191LeuTyr: 5.191 ± 0.446
0.0LeuXaa: 0.0 ± 0.0
Met
1.52MetAla: 1.52 ± 0.158
1.052MetCys: 1.052 ± 0.155
1.66MetAsp: 1.66 ± 0.174
1.169MetGlu: 1.169 ± 0.17
1.239MetPhe: 1.239 ± 0.146
0.912MetGly: 0.912 ± 0.145
0.561MetHis: 0.561 ± 0.111
1.193MetIle: 1.193 ± 0.164
1.193MetLys: 1.193 ± 0.179
3.18MetLeu: 3.18 ± 0.298
0.678MetMet: 0.678 ± 0.129
1.356MetAsn: 1.356 ± 0.182
0.631MetPro: 0.631 ± 0.125
1.052MetGln: 1.052 ± 0.158
1.45MetArg: 1.45 ± 0.184
1.847MetSer: 1.847 ± 0.197
1.356MetThr: 1.356 ± 0.184
2.198MetVal: 2.198 ± 0.225
0.187MetTrp: 0.187 ± 0.068
1.356MetTyr: 1.356 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
2.993AsnAla: 2.993 ± 0.27
1.216AsnCys: 1.216 ± 0.194
3.929AsnAsp: 3.929 ± 0.331
3.741AsnGlu: 3.741 ± 0.316
2.572AsnPhe: 2.572 ± 0.259
2.385AsnGly: 2.385 ± 0.232
1.356AsnHis: 1.356 ± 0.159
3.741AsnIle: 3.741 ± 0.288
3.367AsnLys: 3.367 ± 0.323
5.028AsnLeu: 5.028 ± 0.325
1.426AsnMet: 1.426 ± 0.192
4.653AsnAsn: 4.653 ± 0.436
2.105AsnPro: 2.105 ± 0.256
2.128AsnGln: 2.128 ± 0.254
3.133AsnArg: 3.133 ± 0.3
3.952AsnSer: 3.952 ± 0.361
3.882AsnThr: 3.882 ± 0.297
5.098AsnVal: 5.098 ± 0.32
0.702AsnTrp: 0.702 ± 0.106
2.479AsnTyr: 2.479 ± 0.214
0.0AsnXaa: 0.0 ± 0.0
Pro
2.409ProAla: 2.409 ± 0.202
0.935ProCys: 0.935 ± 0.158
2.596ProAsp: 2.596 ± 0.268
2.292ProGlu: 2.292 ± 0.305
1.473ProPhe: 1.473 ± 0.181
1.543ProGly: 1.543 ± 0.219
1.31ProHis: 1.31 ± 0.182
2.409ProIle: 2.409 ± 0.356
1.684ProLys: 1.684 ± 0.186
3.297ProLeu: 3.297 ± 0.323
0.912ProMet: 0.912 ± 0.122
1.964ProAsn: 1.964 ± 0.232
5.846ProPro: 5.846 ± 1.695
1.356ProGln: 1.356 ± 0.214
2.385ProArg: 2.385 ± 0.282
2.806ProSer: 2.806 ± 0.314
3.157ProThr: 3.157 ± 0.67
3.508ProVal: 3.508 ± 0.429
0.281ProTrp: 0.281 ± 0.107
1.988ProTyr: 1.988 ± 0.225
0.0ProXaa: 0.0 ± 0.0
Gln
1.871GlnAla: 1.871 ± 0.294
1.099GlnCys: 1.099 ± 0.144
1.45GlnAsp: 1.45 ± 0.227
1.497GlnGlu: 1.497 ± 0.207
1.918GlnPhe: 1.918 ± 0.217
1.099GlnGly: 1.099 ± 0.17
1.31GlnHis: 1.31 ± 0.187
1.824GlnIle: 1.824 ± 0.255
2.128GlnLys: 2.128 ± 0.229
4.233GlnLeu: 4.233 ± 0.38
1.052GlnMet: 1.052 ± 0.169
1.497GlnAsn: 1.497 ± 0.187
1.684GlnPro: 1.684 ± 0.231
2.268GlnGln: 2.268 ± 0.278
2.292GlnArg: 2.292 ± 0.224
2.198GlnSer: 2.198 ± 0.182
2.198GlnThr: 2.198 ± 0.231
1.964GlnVal: 1.964 ± 0.16
0.257GlnTrp: 0.257 ± 0.073
1.777GlnTyr: 1.777 ± 0.215
0.0GlnXaa: 0.0 ± 0.0
Arg
2.713ArgAla: 2.713 ± 0.237
1.497ArgCys: 1.497 ± 0.176
3.437ArgAsp: 3.437 ± 0.278
2.619ArgGlu: 2.619 ± 0.314
2.993ArgPhe: 2.993 ± 0.242
2.292ArgGly: 2.292 ± 0.281
2.221ArgHis: 2.221 ± 0.252
2.9ArgIle: 2.9 ± 0.273
2.97ArgLys: 2.97 ± 0.267
7.085ArgLeu: 7.085 ± 0.442
1.286ArgMet: 1.286 ± 0.166
3.882ArgAsn: 3.882 ± 0.256
2.034ArgPro: 2.034 ± 0.273
2.596ArgGln: 2.596 ± 0.309
4.303ArgArg: 4.303 ± 0.566
3.461ArgSer: 3.461 ± 0.494
3.133ArgThr: 3.133 ± 0.308
4.957ArgVal: 4.957 ± 0.404
0.608ArgTrp: 0.608 ± 0.117
3.133ArgTyr: 3.133 ± 0.309
0.0ArgXaa: 0.0 ± 0.0
Ser
3.11SerAla: 3.11 ± 0.289
1.239SerCys: 1.239 ± 0.186
3.929SerAsp: 3.929 ± 0.332
2.736SerGlu: 2.736 ± 0.27
3.227SerPhe: 3.227 ± 0.296
2.666SerGly: 2.666 ± 0.243
1.356SerHis: 1.356 ± 0.192
3.437SerIle: 3.437 ± 0.256
3.087SerLys: 3.087 ± 0.296
6.173SerLeu: 6.173 ± 0.383
1.38SerMet: 1.38 ± 0.176
4.022SerAsn: 4.022 ± 0.387
2.362SerPro: 2.362 ± 0.234
2.011SerGln: 2.011 ± 0.225
3.788SerArg: 3.788 ± 0.42
5.636SerSer: 5.636 ± 0.583
4.279SerThr: 4.279 ± 0.34
5.519SerVal: 5.519 ± 0.342
0.421SerTrp: 0.421 ± 0.099
2.432SerTyr: 2.432 ± 0.224
0.0SerXaa: 0.0 ± 0.0
Thr
3.367ThrAla: 3.367 ± 0.379
1.216ThrCys: 1.216 ± 0.162
3.414ThrAsp: 3.414 ± 0.321
2.97ThrGlu: 2.97 ± 0.326
2.455ThrPhe: 2.455 ± 0.291
2.292ThrGly: 2.292 ± 0.238
1.38ThrHis: 1.38 ± 0.198
3.344ThrIle: 3.344 ± 0.332
2.993ThrLys: 2.993 ± 0.291
5.472ThrLeu: 5.472 ± 0.313
1.707ThrMet: 1.707 ± 0.191
4.186ThrAsn: 4.186 ± 0.377
3.695ThrPro: 3.695 ± 0.796
1.847ThrGln: 1.847 ± 0.186
3.999ThrArg: 3.999 ± 0.317
4.279ThrSer: 4.279 ± 0.35
5.449ThrThr: 5.449 ± 0.798
4.864ThrVal: 4.864 ± 0.353
0.398ThrTrp: 0.398 ± 0.103
2.058ThrTyr: 2.058 ± 0.231
0.0ThrXaa: 0.0 ± 0.0
Val
4.77ValAla: 4.77 ± 0.335
2.245ValCys: 2.245 ± 0.249
5.332ValAsp: 5.332 ± 0.331
4.887ValGlu: 4.887 ± 0.365
3.601ValPhe: 3.601 ± 0.318
2.876ValGly: 2.876 ± 0.253
1.988ValHis: 1.988 ± 0.196
3.695ValIle: 3.695 ± 0.33
4.911ValLys: 4.911 ± 0.368
7.296ValLeu: 7.296 ± 0.425
2.385ValMet: 2.385 ± 0.259
3.741ValAsn: 3.741 ± 0.291
3.648ValPro: 3.648 ± 0.364
2.9ValGln: 2.9 ± 0.275
4.583ValArg: 4.583 ± 0.375
4.934ValSer: 4.934 ± 0.345
5.051ValThr: 5.051 ± 0.321
7.553ValVal: 7.553 ± 0.521
0.959ValTrp: 0.959 ± 0.16
4.396ValTyr: 4.396 ± 0.324
0.0ValXaa: 0.0 ± 0.0
Trp
0.327TrpAla: 0.327 ± 0.086
0.444TrpCys: 0.444 ± 0.119
0.398TrpAsp: 0.398 ± 0.106
0.351TrpGlu: 0.351 ± 0.091
0.398TrpPhe: 0.398 ± 0.097
0.304TrpGly: 0.304 ± 0.079
0.257TrpHis: 0.257 ± 0.083
0.234TrpIle: 0.234 ± 0.063
0.304TrpLys: 0.304 ± 0.077
1.169TrpLeu: 1.169 ± 0.171
0.187TrpMet: 0.187 ± 0.058
0.561TrpAsn: 0.561 ± 0.124
0.234TrpPro: 0.234 ± 0.084
0.514TrpGln: 0.514 ± 0.133
0.702TrpArg: 0.702 ± 0.125
0.865TrpSer: 0.865 ± 0.165
0.608TrpThr: 0.608 ± 0.116
0.444TrpVal: 0.444 ± 0.095
0.281TrpTrp: 0.281 ± 0.097
0.398TrpTyr: 0.398 ± 0.103
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.549TyrAla: 2.549 ± 0.258
1.122TyrCys: 1.122 ± 0.162
3.414TyrAsp: 3.414 ± 0.324
2.689TyrGlu: 2.689 ± 0.259
2.058TyrPhe: 2.058 ± 0.21
1.403TyrGly: 1.403 ± 0.213
1.076TyrHis: 1.076 ± 0.151
2.034TyrIle: 2.034 ± 0.254
3.063TyrLys: 3.063 ± 0.295
4.607TyrLeu: 4.607 ± 0.396
1.333TyrMet: 1.333 ± 0.189
3.133TyrAsn: 3.133 ± 0.248
1.45TyrPro: 1.45 ± 0.177
1.216TyrGln: 1.216 ± 0.169
2.736TyrArg: 2.736 ± 0.284
2.572TyrSer: 2.572 ± 0.284
2.783TyrThr: 2.783 ± 0.266
4.045TyrVal: 4.045 ± 0.327
0.468TyrTrp: 0.468 ± 0.109
3.017TyrTyr: 3.017 ± 0.315
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 153 proteins (42765 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski