Amino acid dipepetide frequency for Mamestra configurata nucleopolyhedrovirus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.664AlaAla: 4.664 ± 0.435
1.145AlaCys: 1.145 ± 0.167
3.625AlaAsp: 3.625 ± 0.284
2.735AlaGlu: 2.735 ± 0.286
2.099AlaPhe: 2.099 ± 0.226
2.268AlaGly: 2.268 ± 0.246
1.442AlaHis: 1.442 ± 0.202
3.583AlaIle: 3.583 ± 0.291
3.243AlaLys: 3.243 ± 0.292
5.448AlaLeu: 5.448 ± 0.33
1.399AlaMet: 1.399 ± 0.186
3.667AlaAsn: 3.667 ± 0.275
2.183AlaPro: 2.183 ± 0.219
2.332AlaGln: 2.332 ± 0.24
2.586AlaArg: 2.586 ± 0.258
3.243AlaSer: 3.243 ± 0.269
3.18AlaThr: 3.18 ± 0.249
4.346AlaVal: 4.346 ± 0.37
0.424AlaTrp: 0.424 ± 0.121
2.141AlaTyr: 2.141 ± 0.221
0.0AlaXaa: 0.0 ± 0.0
Cys
2.099CysAla: 2.099 ± 0.213
0.636CysCys: 0.636 ± 0.136
1.42CysAsp: 1.42 ± 0.178
1.166CysGlu: 1.166 ± 0.178
0.933CysPhe: 0.933 ± 0.147
1.102CysGly: 1.102 ± 0.172
0.488CysHis: 0.488 ± 0.084
1.569CysIle: 1.569 ± 0.185
1.781CysLys: 1.781 ± 0.212
1.929CysLeu: 1.929 ± 0.187
0.594CysMet: 0.594 ± 0.124
1.696CysAsn: 1.696 ± 0.195
1.251CysPro: 1.251 ± 0.199
0.89CysGln: 0.89 ± 0.157
1.124CysArg: 1.124 ± 0.186
1.569CysSer: 1.569 ± 0.154
1.336CysThr: 1.336 ± 0.187
1.993CysVal: 1.993 ± 0.204
0.17CysTrp: 0.17 ± 0.066
0.933CysTyr: 0.933 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
3.583AspAla: 3.583 ± 0.283
1.738AspCys: 1.738 ± 0.219
5.575AspAsp: 5.575 ± 0.488
4.579AspGlu: 4.579 ± 0.346
2.459AspPhe: 2.459 ± 0.295
2.692AspGly: 2.692 ± 0.269
1.484AspHis: 1.484 ± 0.178
4.303AspIle: 4.303 ± 0.338
4.452AspLys: 4.452 ± 0.269
5.385AspLeu: 5.385 ± 0.367
1.802AspMet: 1.802 ± 0.201
4.812AspAsn: 4.812 ± 0.26
1.59AspPro: 1.59 ± 0.183
1.76AspGln: 1.76 ± 0.232
2.925AspArg: 2.925 ± 0.22
3.519AspSer: 3.519 ± 0.282
3.667AspThr: 3.667 ± 0.296
3.858AspVal: 3.858 ± 0.275
0.551AspTrp: 0.551 ± 0.101
3.561AspTyr: 3.561 ± 0.248
0.0AspXaa: 0.0 ± 0.0
Glu
2.692GluAla: 2.692 ± 0.31
1.632GluCys: 1.632 ± 0.205
2.629GluAsp: 2.629 ± 0.231
4.028GluGlu: 4.028 ± 0.709
2.862GluPhe: 2.862 ± 0.27
1.654GluGly: 1.654 ± 0.223
1.293GluHis: 1.293 ± 0.171
3.561GluIle: 3.561 ± 0.28
3.392GluLys: 3.392 ± 0.26
5.088GluLeu: 5.088 ± 0.359
1.59GluMet: 1.59 ± 0.234
3.434GluAsn: 3.434 ± 0.28
2.035GluPro: 2.035 ± 0.212
2.48GluGln: 2.48 ± 0.292
2.925GluArg: 2.925 ± 0.24
3.307GluSer: 3.307 ± 0.285
3.54GluThr: 3.54 ± 0.313
2.374GluVal: 2.374 ± 0.239
0.424GluTrp: 0.424 ± 0.092
2.692GluTyr: 2.692 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
2.417PheAla: 2.417 ± 0.207
1.23PheCys: 1.23 ± 0.187
4.197PheAsp: 4.197 ± 0.308
2.586PheGlu: 2.586 ± 0.226
1.929PhePhe: 1.929 ± 0.281
1.866PheGly: 1.866 ± 0.227
0.954PheHis: 0.954 ± 0.141
3.201PheIle: 3.201 ± 0.232
3.243PheLys: 3.243 ± 0.259
3.71PheLeu: 3.71 ± 0.271
1.378PheMet: 1.378 ± 0.155
3.392PheAsn: 3.392 ± 0.283
1.102PhePro: 1.102 ± 0.176
1.569PheGln: 1.569 ± 0.184
1.696PheArg: 1.696 ± 0.233
2.289PheSer: 2.289 ± 0.276
2.353PheThr: 2.353 ± 0.223
4.643PheVal: 4.643 ± 0.327
0.233PheTrp: 0.233 ± 0.068
1.993PheTyr: 1.993 ± 0.233
0.0PheXaa: 0.0 ± 0.0
Gly
1.972GlyAla: 1.972 ± 0.241
0.742GlyCys: 0.742 ± 0.13
2.544GlyAsp: 2.544 ± 0.266
1.569GlyGlu: 1.569 ± 0.189
1.484GlyPhe: 1.484 ± 0.205
2.162GlyGly: 2.162 ± 0.251
0.848GlyHis: 0.848 ± 0.13
2.311GlyIle: 2.311 ± 0.249
1.929GlyLys: 1.929 ± 0.199
2.883GlyLeu: 2.883 ± 0.273
0.975GlyMet: 0.975 ± 0.168
2.289GlyAsn: 2.289 ± 0.279
0.954GlyPro: 0.954 ± 0.142
1.23GlyGln: 1.23 ± 0.192
1.654GlyArg: 1.654 ± 0.23
2.162GlySer: 2.162 ± 0.191
2.311GlyThr: 2.311 ± 0.278
2.989GlyVal: 2.989 ± 0.319
0.424GlyTrp: 0.424 ± 0.093
1.632GlyTyr: 1.632 ± 0.223
0.0GlyXaa: 0.0 ± 0.0
His
1.505HisAla: 1.505 ± 0.17
0.424HisCys: 0.424 ± 0.078
1.632HisAsp: 1.632 ± 0.2
1.336HisGlu: 1.336 ± 0.167
1.039HisPhe: 1.039 ± 0.149
0.869HisGly: 0.869 ± 0.151
0.763HisHis: 0.763 ± 0.132
1.208HisIle: 1.208 ± 0.175
1.76HisLys: 1.76 ± 0.186
1.993HisLeu: 1.993 ± 0.213
0.678HisMet: 0.678 ± 0.141
2.099HisAsn: 2.099 ± 0.237
0.89HisPro: 0.89 ± 0.145
0.933HisGln: 0.933 ± 0.136
1.124HisArg: 1.124 ± 0.142
1.59HisSer: 1.59 ± 0.195
1.187HisThr: 1.187 ± 0.167
2.162HisVal: 2.162 ± 0.236
0.254HisTrp: 0.254 ± 0.069
1.484HisTyr: 1.484 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
3.731IleAla: 3.731 ± 0.318
1.272IleCys: 1.272 ± 0.193
5.385IleAsp: 5.385 ± 0.353
4.049IleGlu: 4.049 ± 0.309
3.031IlePhe: 3.031 ± 0.242
1.95IleGly: 1.95 ± 0.222
1.378IleHis: 1.378 ± 0.166
4.303IleIle: 4.303 ± 0.33
5.724IleLys: 5.724 ± 0.417
5.13IleLeu: 5.13 ± 0.364
2.205IleMet: 2.205 ± 0.207
5.999IleAsn: 5.999 ± 0.357
2.12IlePro: 2.12 ± 0.231
2.056IleGln: 2.056 ± 0.247
2.586IleArg: 2.586 ± 0.2
3.689IleSer: 3.689 ± 0.237
3.583IleThr: 3.583 ± 0.306
5.491IleVal: 5.491 ± 0.425
0.382IleTrp: 0.382 ± 0.08
2.735IleTyr: 2.735 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
2.141LysAla: 2.141 ± 0.231
2.099LysCys: 2.099 ± 0.221
2.989LysAsp: 2.989 ± 0.385
2.819LysGlu: 2.819 ± 0.334
3.434LysPhe: 3.434 ± 0.231
1.908LysGly: 1.908 ± 0.207
2.226LysHis: 2.226 ± 0.218
4.77LysIle: 4.77 ± 0.334
4.727LysLys: 4.727 ± 0.467
6.996LysLeu: 6.996 ± 0.501
2.162LysMet: 2.162 ± 0.237
4.918LysAsn: 4.918 ± 0.414
2.671LysPro: 2.671 ± 0.27
2.713LysGln: 2.713 ± 0.234
4.155LysArg: 4.155 ± 0.292
3.773LysSer: 3.773 ± 0.289
3.858LysThr: 3.858 ± 0.299
3.413LysVal: 3.413 ± 0.265
0.572LysTrp: 0.572 ± 0.119
3.71LysTyr: 3.71 ± 0.295
0.0LysXaa: 0.0 ± 0.0
Leu
5.173LeuAla: 5.173 ± 0.319
2.12LeuCys: 2.12 ± 0.254
4.918LeuAsp: 4.918 ± 0.25
4.452LeuGlu: 4.452 ± 0.349
4.537LeuPhe: 4.537 ± 0.362
2.586LeuGly: 2.586 ± 0.294
2.183LeuHis: 2.183 ± 0.234
5.766LeuIle: 5.766 ± 0.435
6.339LeuLys: 6.339 ± 0.369
9.073LeuLeu: 9.073 ± 0.471
2.332LeuMet: 2.332 ± 0.211
6.275LeuAsn: 6.275 ± 0.386
3.604LeuPro: 3.604 ± 0.253
4.494LeuGln: 4.494 ± 0.398
4.982LeuArg: 4.982 ± 0.33
5.491LeuSer: 5.491 ± 0.389
5.173LeuThr: 5.173 ± 0.384
5.469LeuVal: 5.469 ± 0.398
0.975LeuTrp: 0.975 ± 0.118
4.643LeuTyr: 4.643 ± 0.312
0.0LeuXaa: 0.0 ± 0.0
Met
1.526MetAla: 1.526 ± 0.187
0.89MetCys: 0.89 ± 0.135
1.124MetAsp: 1.124 ± 0.153
1.187MetGlu: 1.187 ± 0.157
1.611MetPhe: 1.611 ± 0.18
1.06MetGly: 1.06 ± 0.19
0.954MetHis: 0.954 ± 0.174
1.866MetIle: 1.866 ± 0.201
1.314MetLys: 1.314 ± 0.22
2.374MetLeu: 2.374 ± 0.227
0.827MetMet: 0.827 ± 0.133
1.717MetAsn: 1.717 ± 0.206
1.166MetPro: 1.166 ± 0.137
1.399MetGln: 1.399 ± 0.174
1.378MetArg: 1.378 ± 0.191
2.395MetSer: 2.395 ± 0.24
1.42MetThr: 1.42 ± 0.187
1.399MetVal: 1.399 ± 0.21
0.254MetTrp: 0.254 ± 0.07
1.632MetTyr: 1.632 ± 0.17
0.0MetXaa: 0.0 ± 0.0
Asn
4.176AsnAla: 4.176 ± 0.26
1.314AsnCys: 1.314 ± 0.191
5.194AsnAsp: 5.194 ± 0.328
4.749AsnGlu: 4.749 ± 0.384
3.222AsnPhe: 3.222 ± 0.268
2.862AsnGly: 2.862 ± 0.251
1.187AsnHis: 1.187 ± 0.136
5.491AsnIle: 5.491 ± 0.337
5.342AsnLys: 5.342 ± 0.386
5.13AsnLeu: 5.13 ± 0.32
1.866AsnMet: 1.866 ± 0.181
6.317AsnAsn: 6.317 ± 0.491
1.887AsnPro: 1.887 ± 0.196
1.738AsnGln: 1.738 ± 0.193
3.434AsnArg: 3.434 ± 0.267
4.579AsnSer: 4.579 ± 0.31
4.621AsnThr: 4.621 ± 0.461
6.021AsnVal: 6.021 ± 0.389
0.318AsnTrp: 0.318 ± 0.09
3.54AsnTyr: 3.54 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
2.014ProAla: 2.014 ± 0.279
0.721ProCys: 0.721 ± 0.145
2.183ProAsp: 2.183 ± 0.228
1.929ProGlu: 1.929 ± 0.179
1.42ProPhe: 1.42 ± 0.165
1.166ProGly: 1.166 ± 0.2
0.848ProHis: 0.848 ± 0.136
2.395ProIle: 2.395 ± 0.272
1.526ProLys: 1.526 ± 0.187
3.328ProLeu: 3.328 ± 0.271
0.7ProMet: 0.7 ± 0.135
2.078ProAsn: 2.078 ± 0.202
2.141ProPro: 2.141 ± 0.559
1.569ProGln: 1.569 ± 0.206
1.654ProArg: 1.654 ± 0.203
2.395ProSer: 2.395 ± 0.245
2.883ProThr: 2.883 ± 0.301
2.904ProVal: 2.904 ± 0.274
0.339ProTrp: 0.339 ± 0.078
1.526ProTyr: 1.526 ± 0.23
0.0ProXaa: 0.0 ± 0.0
Gln
1.76GlnAla: 1.76 ± 0.274
1.081GlnCys: 1.081 ± 0.151
1.696GlnAsp: 1.696 ± 0.179
1.866GlnGlu: 1.866 ± 0.205
2.12GlnPhe: 2.12 ± 0.242
0.572GlnGly: 0.572 ± 0.112
1.314GlnHis: 1.314 ± 0.193
2.501GlnIle: 2.501 ± 0.221
2.374GlnLys: 2.374 ± 0.312
4.833GlnLeu: 4.833 ± 0.36
1.208GlnMet: 1.208 ± 0.185
2.289GlnAsn: 2.289 ± 0.283
1.378GlnPro: 1.378 ± 0.213
2.501GlnGln: 2.501 ± 0.334
2.289GlnArg: 2.289 ± 0.205
2.289GlnSer: 2.289 ± 0.216
2.014GlnThr: 2.014 ± 0.256
2.162GlnVal: 2.162 ± 0.211
0.191GlnTrp: 0.191 ± 0.059
1.972GlnTyr: 1.972 ± 0.222
0.0GlnXaa: 0.0 ± 0.0
Arg
2.735ArgAla: 2.735 ± 0.253
1.251ArgCys: 1.251 ± 0.181
3.18ArgAsp: 3.18 ± 0.267
2.078ArgGlu: 2.078 ± 0.207
1.993ArgPhe: 1.993 ± 0.235
1.484ArgGly: 1.484 ± 0.196
1.717ArgHis: 1.717 ± 0.205
3.222ArgIle: 3.222 ± 0.292
3.031ArgLys: 3.031 ± 0.305
5.003ArgLeu: 5.003 ± 0.266
1.314ArgMet: 1.314 ± 0.168
3.392ArgAsn: 3.392 ± 0.298
1.972ArgPro: 1.972 ± 0.208
2.332ArgGln: 2.332 ± 0.253
3.413ArgArg: 3.413 ± 0.41
2.989ArgSer: 2.989 ± 0.327
2.798ArgThr: 2.798 ± 0.253
3.413ArgVal: 3.413 ± 0.244
0.466ArgTrp: 0.466 ± 0.118
2.268ArgTyr: 2.268 ± 0.208
0.0ArgXaa: 0.0 ± 0.0
Ser
3.328SerAla: 3.328 ± 0.286
1.272SerCys: 1.272 ± 0.143
4.113SerAsp: 4.113 ± 0.336
2.777SerGlu: 2.777 ± 0.263
2.904SerPhe: 2.904 ± 0.22
2.607SerGly: 2.607 ± 0.278
1.442SerHis: 1.442 ± 0.184
3.689SerIle: 3.689 ± 0.315
3.689SerLys: 3.689 ± 0.344
5.533SerLeu: 5.533 ± 0.337
1.378SerMet: 1.378 ± 0.177
4.621SerAsn: 4.621 ± 0.336
1.929SerPro: 1.929 ± 0.261
2.12SerGln: 2.12 ± 0.223
2.777SerArg: 2.777 ± 0.268
4.833SerSer: 4.833 ± 0.491
4.091SerThr: 4.091 ± 0.315
4.664SerVal: 4.664 ± 0.299
0.445SerTrp: 0.445 ± 0.091
2.777SerTyr: 2.777 ± 0.243
0.0SerXaa: 0.0 ± 0.0
Thr
3.159ThrAla: 3.159 ± 0.276
1.314ThrCys: 1.314 ± 0.212
3.583ThrAsp: 3.583 ± 0.263
2.65ThrGlu: 2.65 ± 0.222
3.031ThrPhe: 3.031 ± 0.272
1.972ThrGly: 1.972 ± 0.196
1.23ThrHis: 1.23 ± 0.156
5.13ThrIle: 5.13 ± 0.407
3.455ThrLys: 3.455 ± 0.312
5.915ThrLeu: 5.915 ± 0.35
2.014ThrMet: 2.014 ± 0.181
4.833ThrAsn: 4.833 ± 0.408
2.438ThrPro: 2.438 ± 0.25
2.226ThrGln: 2.226 ± 0.235
2.586ThrArg: 2.586 ± 0.255
3.222ThrSer: 3.222 ± 0.275
4.155ThrThr: 4.155 ± 0.358
4.219ThrVal: 4.219 ± 0.304
0.488ThrTrp: 0.488 ± 0.103
1.95ThrTyr: 1.95 ± 0.269
0.0ThrXaa: 0.0 ± 0.0
Val
4.176ValAla: 4.176 ± 0.323
2.099ValCys: 2.099 ± 0.3
4.409ValAsp: 4.409 ± 0.331
3.922ValGlu: 3.922 ± 0.285
3.371ValPhe: 3.371 ± 0.277
2.332ValGly: 2.332 ± 0.261
1.802ValHis: 1.802 ± 0.219
4.664ValIle: 4.664 ± 0.341
4.303ValLys: 4.303 ± 0.278
6.254ValLeu: 6.254 ± 0.41
1.548ValMet: 1.548 ± 0.167
4.812ValAsn: 4.812 ± 0.327
3.265ValPro: 3.265 ± 0.354
2.438ValGln: 2.438 ± 0.236
3.752ValArg: 3.752 ± 0.315
4.579ValSer: 4.579 ± 0.288
4.155ValThr: 4.155 ± 0.305
4.961ValVal: 4.961 ± 0.413
0.403ValTrp: 0.403 ± 0.106
3.752ValTyr: 3.752 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
0.445TrpAla: 0.445 ± 0.107
0.254TrpCys: 0.254 ± 0.067
0.318TrpAsp: 0.318 ± 0.091
0.445TrpGlu: 0.445 ± 0.102
0.254TrpPhe: 0.254 ± 0.076
0.233TrpGly: 0.233 ± 0.068
0.297TrpHis: 0.297 ± 0.076
0.36TrpIle: 0.36 ± 0.087
0.424TrpLys: 0.424 ± 0.103
0.7TrpLeu: 0.7 ± 0.122
0.17TrpMet: 0.17 ± 0.057
0.53TrpAsn: 0.53 ± 0.113
0.318TrpPro: 0.318 ± 0.084
0.382TrpGln: 0.382 ± 0.091
0.572TrpArg: 0.572 ± 0.115
0.615TrpSer: 0.615 ± 0.133
0.551TrpThr: 0.551 ± 0.103
0.339TrpVal: 0.339 ± 0.085
0.106TrpTrp: 0.106 ± 0.044
0.424TrpTyr: 0.424 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.459TyrAla: 2.459 ± 0.212
1.442TyrCys: 1.442 ± 0.185
3.583TyrAsp: 3.583 ± 0.258
2.713TyrGlu: 2.713 ± 0.239
2.247TyrPhe: 2.247 ± 0.256
1.717TyrGly: 1.717 ± 0.221
1.081TyrHis: 1.081 ± 0.144
2.904TyrIle: 2.904 ± 0.314
4.007TyrLys: 4.007 ± 0.291
3.901TyrLeu: 3.901 ± 0.358
1.336TyrMet: 1.336 ± 0.181
3.816TyrAsn: 3.816 ± 0.366
0.827TyrPro: 0.827 ± 0.133
1.251TyrGln: 1.251 ± 0.159
2.438TyrArg: 2.438 ± 0.212
2.459TyrSer: 2.459 ± 0.271
2.692TyrThr: 2.692 ± 0.245
4.176TyrVal: 4.176 ± 0.332
0.318TyrTrp: 0.318 ± 0.079
2.904TyrTyr: 2.904 ± 0.269
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 168 proteins (47173 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski