Amino acid dipepetide frequency for Heliothis virescens ascovirus 3e (HvAV-3e)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.536AlaAla: 4.536 ± 0.28
1.699AlaCys: 1.699 ± 0.172
3.584AlaAsp: 3.584 ± 0.236
3.416AlaGlu: 3.416 ± 0.322
1.885AlaPhe: 1.885 ± 0.169
2.818AlaGly: 2.818 ± 0.205
1.512AlaHis: 1.512 ± 0.165
3.528AlaIle: 3.528 ± 0.229
3.602AlaLys: 3.602 ± 0.242
5.973AlaLeu: 5.973 ± 0.43
1.885AlaMet: 1.885 ± 0.174
3.341AlaAsn: 3.341 ± 0.301
2.24AlaPro: 2.24 ± 0.221
1.643AlaGln: 1.643 ± 0.195
4.61AlaArg: 4.61 ± 0.359
4.853AlaSer: 4.853 ± 0.357
4.33AlaThr: 4.33 ± 0.293
5.17AlaVal: 5.17 ± 0.334
0.616AlaTrp: 0.616 ± 0.107
2.576AlaTyr: 2.576 ± 0.214
0.0AlaXaa: 0.0 ± 0.0
Cys
1.997CysAla: 1.997 ± 0.16
0.784CysCys: 0.784 ± 0.104
2.501CysAsp: 2.501 ± 0.223
1.381CysGlu: 1.381 ± 0.15
0.485CysPhe: 0.485 ± 0.104
1.736CysGly: 1.736 ± 0.231
0.56CysHis: 0.56 ± 0.106
1.307CysIle: 1.307 ± 0.157
1.419CysLys: 1.419 ± 0.173
1.978CysLeu: 1.978 ± 0.194
0.691CysMet: 0.691 ± 0.106
1.4CysAsn: 1.4 ± 0.152
0.933CysPro: 0.933 ± 0.136
0.672CysGln: 0.672 ± 0.11
2.202CysArg: 2.202 ± 0.253
1.978CysSer: 1.978 ± 0.221
1.661CysThr: 1.661 ± 0.176
2.128CysVal: 2.128 ± 0.236
0.243CysTrp: 0.243 ± 0.06
0.784CysTyr: 0.784 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
4.424AspAla: 4.424 ± 0.366
1.549AspCys: 1.549 ± 0.171
6.402AspAsp: 6.402 ± 0.384
4.61AspGlu: 4.61 ± 0.344
2.165AspPhe: 2.165 ± 0.2
4.2AspGly: 4.2 ± 0.222
1.419AspHis: 1.419 ± 0.15
3.621AspIle: 3.621 ± 0.276
2.874AspLys: 2.874 ± 0.231
4.965AspLeu: 4.965 ± 0.311
2.109AspMet: 2.109 ± 0.217
2.93AspAsn: 2.93 ± 0.271
2.072AspPro: 2.072 ± 0.179
1.288AspGln: 1.288 ± 0.146
4.386AspArg: 4.386 ± 0.301
4.088AspSer: 4.088 ± 0.247
4.088AspThr: 4.088 ± 0.276
5.6AspVal: 5.6 ± 0.333
0.821AspTrp: 0.821 ± 0.11
2.333AspTyr: 2.333 ± 0.214
0.0AspXaa: 0.0 ± 0.0
Glu
3.098GluAla: 3.098 ± 0.238
1.661GluCys: 1.661 ± 0.187
3.042GluAsp: 3.042 ± 0.235
2.725GluGlu: 2.725 ± 0.283
2.09GluPhe: 2.09 ± 0.192
1.549GluGly: 1.549 ± 0.159
1.755GluHis: 1.755 ± 0.208
2.968GluIle: 2.968 ± 0.238
2.24GluLys: 2.24 ± 0.227
5.73GluLeu: 5.73 ± 0.382
1.717GluMet: 1.717 ± 0.198
2.52GluAsn: 2.52 ± 0.239
1.419GluPro: 1.419 ± 0.19
1.829GluGln: 1.829 ± 0.184
4.069GluArg: 4.069 ± 0.333
4.088GluSer: 4.088 ± 0.293
3.136GluThr: 3.136 ± 0.246
2.874GluVal: 2.874 ± 0.232
0.803GluTrp: 0.803 ± 0.123
2.37GluTyr: 2.37 ± 0.221
0.0GluXaa: 0.0 ± 0.0
Phe
2.426PheAla: 2.426 ± 0.174
0.859PheCys: 0.859 ± 0.131
2.688PheAsp: 2.688 ± 0.27
2.37PheGlu: 2.37 ± 0.207
0.989PhePhe: 0.989 ± 0.138
2.221PheGly: 2.221 ± 0.218
0.803PheHis: 0.803 ± 0.122
1.941PheIle: 1.941 ± 0.193
2.221PheLys: 2.221 ± 0.215
2.352PheLeu: 2.352 ± 0.192
0.859PheMet: 0.859 ± 0.125
1.941PheAsn: 1.941 ± 0.148
0.933PhePro: 0.933 ± 0.143
1.176PheGln: 1.176 ± 0.152
1.96PheArg: 1.96 ± 0.205
1.867PheSer: 1.867 ± 0.199
1.941PheThr: 1.941 ± 0.182
3.472PheVal: 3.472 ± 0.241
0.243PheTrp: 0.243 ± 0.065
1.12PheTyr: 1.12 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
3.098GlyAla: 3.098 ± 0.248
1.083GlyCys: 1.083 ± 0.14
3.994GlyAsp: 3.994 ± 0.303
2.314GlyGlu: 2.314 ± 0.206
1.587GlyPhe: 1.587 ± 0.194
3.154GlyGly: 3.154 ± 0.269
1.176GlyHis: 1.176 ± 0.151
2.352GlyIle: 2.352 ± 0.197
1.978GlyLys: 1.978 ± 0.163
3.602GlyLeu: 3.602 ± 0.304
1.083GlyMet: 1.083 ± 0.136
1.978GlyAsn: 1.978 ± 0.234
1.12GlyPro: 1.12 ± 0.156
1.045GlyGln: 1.045 ± 0.15
3.154GlyArg: 3.154 ± 0.25
3.92GlySer: 3.92 ± 0.291
3.024GlyThr: 3.024 ± 0.255
5.73GlyVal: 5.73 ± 0.428
0.597GlyTrp: 0.597 ± 0.101
1.904GlyTyr: 1.904 ± 0.172
0.0GlyXaa: 0.0 ± 0.0
His
1.699HisAla: 1.699 ± 0.159
0.635HisCys: 0.635 ± 0.088
1.456HisAsp: 1.456 ± 0.175
1.549HisGlu: 1.549 ± 0.147
0.952HisPhe: 0.952 ± 0.153
1.475HisGly: 1.475 ± 0.179
0.709HisHis: 0.709 ± 0.126
1.344HisIle: 1.344 ± 0.185
1.139HisLys: 1.139 ± 0.156
1.643HisLeu: 1.643 ± 0.189
0.541HisMet: 0.541 ± 0.099
1.288HisAsn: 1.288 ± 0.151
0.747HisPro: 0.747 ± 0.13
0.691HisGln: 0.691 ± 0.105
1.885HisArg: 1.885 ± 0.202
2.072HisSer: 2.072 ± 0.221
1.792HisThr: 1.792 ± 0.168
1.904HisVal: 1.904 ± 0.206
0.28HisTrp: 0.28 ± 0.061
1.064HisTyr: 1.064 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
3.677IleAla: 3.677 ± 0.316
1.381IleCys: 1.381 ± 0.182
3.864IleAsp: 3.864 ± 0.288
3.136IleGlu: 3.136 ± 0.253
1.848IlePhe: 1.848 ± 0.207
2.408IleGly: 2.408 ± 0.221
1.288IleHis: 1.288 ± 0.164
2.557IleIle: 2.557 ± 0.248
2.594IleLys: 2.594 ± 0.215
3.826IleLeu: 3.826 ± 0.268
1.288IleMet: 1.288 ± 0.179
3.173IleAsn: 3.173 ± 0.197
2.202IlePro: 2.202 ± 0.225
1.587IleGln: 1.587 ± 0.161
4.013IleArg: 4.013 ± 0.245
4.536IleSer: 4.536 ± 0.285
4.517IleThr: 4.517 ± 0.35
4.536IleVal: 4.536 ± 0.304
0.261IleTrp: 0.261 ± 0.07
1.475IleTyr: 1.475 ± 0.16
0.0IleXaa: 0.0 ± 0.0
Lys
2.538LysAla: 2.538 ± 0.205
1.829LysCys: 1.829 ± 0.177
2.464LysAsp: 2.464 ± 0.258
2.24LysGlu: 2.24 ± 0.222
2.613LysPhe: 2.613 ± 0.183
1.475LysGly: 1.475 ± 0.17
1.531LysHis: 1.531 ± 0.142
2.389LysIle: 2.389 ± 0.251
2.93LysLys: 2.93 ± 0.25
5.17LysLeu: 5.17 ± 0.323
1.437LysMet: 1.437 ± 0.18
2.258LysAsn: 2.258 ± 0.208
2.053LysPro: 2.053 ± 0.246
1.699LysGln: 1.699 ± 0.182
4.946LysArg: 4.946 ± 0.288
4.778LysSer: 4.778 ± 0.326
3.472LysThr: 3.472 ± 0.252
2.986LysVal: 2.986 ± 0.282
0.56LysTrp: 0.56 ± 0.107
2.856LysTyr: 2.856 ± 0.185
0.0LysXaa: 0.0 ± 0.0
Leu
4.946LeuAla: 4.946 ± 0.312
2.389LeuCys: 2.389 ± 0.198
4.872LeuAsp: 4.872 ± 0.313
3.882LeuGlu: 3.882 ± 0.275
2.762LeuPhe: 2.762 ± 0.256
3.154LeuGly: 3.154 ± 0.236
2.333LeuHis: 2.333 ± 0.224
3.733LeuIle: 3.733 ± 0.237
5.488LeuLys: 5.488 ± 0.257
7.522LeuLeu: 7.522 ± 0.392
2.762LeuMet: 2.762 ± 0.208
4.722LeuAsn: 4.722 ± 0.351
3.752LeuPro: 3.752 ± 0.259
3.136LeuGln: 3.136 ± 0.353
6.869LeuArg: 6.869 ± 0.469
6.514LeuSer: 6.514 ± 0.391
4.741LeuThr: 4.741 ± 0.281
5.04LeuVal: 5.04 ± 0.278
0.877LeuTrp: 0.877 ± 0.109
3.21LeuTyr: 3.21 ± 0.269
0.0LeuXaa: 0.0 ± 0.0
Met
1.997MetAla: 1.997 ± 0.211
0.84MetCys: 0.84 ± 0.15
1.624MetAsp: 1.624 ± 0.19
0.971MetGlu: 0.971 ± 0.134
1.213MetPhe: 1.213 ± 0.171
1.307MetGly: 1.307 ± 0.167
0.709MetHis: 0.709 ± 0.126
1.363MetIle: 1.363 ± 0.163
1.493MetLys: 1.493 ± 0.147
2.034MetLeu: 2.034 ± 0.186
1.027MetMet: 1.027 ± 0.133
1.848MetAsn: 1.848 ± 0.151
0.784MetPro: 0.784 ± 0.101
0.765MetGln: 0.765 ± 0.106
1.848MetArg: 1.848 ± 0.21
2.52MetSer: 2.52 ± 0.251
1.923MetThr: 1.923 ± 0.193
2.072MetVal: 2.072 ± 0.194
0.28MetTrp: 0.28 ± 0.064
1.68MetTyr: 1.68 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
3.994AsnAla: 3.994 ± 0.29
1.064AsnCys: 1.064 ± 0.108
4.088AsnAsp: 4.088 ± 0.303
3.341AsnGlu: 3.341 ± 0.264
1.736AsnPhe: 1.736 ± 0.162
3.584AsnGly: 3.584 ± 0.275
1.232AsnHis: 1.232 ± 0.143
3.397AsnIle: 3.397 ± 0.248
2.65AsnLys: 2.65 ± 0.249
3.36AsnLeu: 3.36 ± 0.257
1.437AsnMet: 1.437 ± 0.191
2.949AsnAsn: 2.949 ± 0.266
1.661AsnPro: 1.661 ± 0.173
0.952AsnGln: 0.952 ± 0.148
3.528AsnArg: 3.528 ± 0.295
4.237AsnSer: 4.237 ± 0.322
3.248AsnThr: 3.248 ± 0.274
4.554AsnVal: 4.554 ± 0.287
0.467AsnTrp: 0.467 ± 0.092
2.632AsnTyr: 2.632 ± 0.196
0.0AsnXaa: 0.0 ± 0.0
Pro
2.408ProAla: 2.408 ± 0.267
0.915ProCys: 0.915 ± 0.158
1.848ProAsp: 1.848 ± 0.172
1.755ProGlu: 1.755 ± 0.197
1.027ProPhe: 1.027 ± 0.11
1.605ProGly: 1.605 ± 0.161
1.045ProHis: 1.045 ± 0.125
2.184ProIle: 2.184 ± 0.218
1.885ProLys: 1.885 ± 0.163
2.781ProLeu: 2.781 ± 0.182
1.251ProMet: 1.251 ± 0.146
2.146ProAsn: 2.146 ± 0.175
2.352ProPro: 2.352 ± 0.263
1.307ProGln: 1.307 ± 0.14
1.941ProArg: 1.941 ± 0.196
3.882ProSer: 3.882 ± 0.336
2.538ProThr: 2.538 ± 0.221
2.688ProVal: 2.688 ± 0.218
0.448ProTrp: 0.448 ± 0.074
1.325ProTyr: 1.325 ± 0.166
0.0ProXaa: 0.0 ± 0.0
Gln
1.363GlnAla: 1.363 ± 0.179
0.896GlnCys: 0.896 ± 0.14
1.045GlnAsp: 1.045 ± 0.117
1.176GlnGlu: 1.176 ± 0.135
1.139GlnPhe: 1.139 ± 0.146
0.728GlnGly: 0.728 ± 0.124
0.728GlnHis: 0.728 ± 0.11
1.792GlnIle: 1.792 ± 0.184
1.288GlnLys: 1.288 ± 0.137
3.658GlnLeu: 3.658 ± 0.351
0.896GlnMet: 0.896 ± 0.139
1.363GlnAsn: 1.363 ± 0.162
1.419GlnPro: 1.419 ± 0.167
1.101GlnGln: 1.101 ± 0.187
2.613GlnArg: 2.613 ± 0.27
2.24GlnSer: 2.24 ± 0.247
1.587GlnThr: 1.587 ± 0.146
1.493GlnVal: 1.493 ± 0.183
0.149GlnTrp: 0.149 ± 0.057
1.475GlnTyr: 1.475 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
3.565ArgAla: 3.565 ± 0.281
1.923ArgCys: 1.923 ± 0.195
4.685ArgAsp: 4.685 ± 0.334
3.322ArgGlu: 3.322 ± 0.259
2.482ArgPhe: 2.482 ± 0.23
2.8ArgGly: 2.8 ± 0.245
2.034ArgHis: 2.034 ± 0.191
4.517ArgIle: 4.517 ± 0.301
3.621ArgLys: 3.621 ± 0.354
6.738ArgLeu: 6.738 ± 0.499
2.072ArgMet: 2.072 ± 0.226
4.256ArgAsn: 4.256 ± 0.364
2.837ArgPro: 2.837 ± 0.22
2.445ArgGln: 2.445 ± 0.204
6.533ArgArg: 6.533 ± 0.602
6.439ArgSer: 6.439 ± 0.805
4.256ArgThr: 4.256 ± 0.248
4.853ArgVal: 4.853 ± 0.324
0.597ArgTrp: 0.597 ± 0.098
2.968ArgTyr: 2.968 ± 0.243
0.0ArgXaa: 0.0 ± 0.0
Ser
4.685SerAla: 4.685 ± 0.292
2.202SerCys: 2.202 ± 0.262
5.152SerAsp: 5.152 ± 0.314
3.808SerGlu: 3.808 ± 0.302
2.333SerPhe: 2.333 ± 0.187
4.293SerGly: 4.293 ± 0.303
1.829SerHis: 1.829 ± 0.198
4.088SerIle: 4.088 ± 0.227
4.349SerLys: 4.349 ± 0.25
6.421SerLeu: 6.421 ± 0.347
1.923SerMet: 1.923 ± 0.195
4.946SerAsn: 4.946 ± 0.264
3.584SerPro: 3.584 ± 0.622
1.717SerGln: 1.717 ± 0.164
5.786SerArg: 5.786 ± 0.671
6.738SerSer: 6.738 ± 0.449
5.394SerThr: 5.394 ± 0.329
7.279SerVal: 7.279 ± 0.415
0.579SerTrp: 0.579 ± 0.111
3.192SerTyr: 3.192 ± 0.207
0.0SerXaa: 0.0 ± 0.0
Thr
4.554ThrAla: 4.554 ± 0.313
1.643ThrCys: 1.643 ± 0.19
3.938ThrAsp: 3.938 ± 0.288
3.08ThrGlu: 3.08 ± 0.248
2.445ThrPhe: 2.445 ± 0.226
2.949ThrGly: 2.949 ± 0.273
1.251ThrHis: 1.251 ± 0.144
4.106ThrIle: 4.106 ± 0.257
3.546ThrLys: 3.546 ± 0.33
5.693ThrLeu: 5.693 ± 0.341
1.811ThrMet: 1.811 ± 0.166
3.546ThrAsn: 3.546 ± 0.206
2.725ThrPro: 2.725 ± 0.23
1.624ThrGln: 1.624 ± 0.169
3.77ThrArg: 3.77 ± 0.241
5.357ThrSer: 5.357 ± 0.279
4.853ThrThr: 4.853 ± 0.362
5.656ThrVal: 5.656 ± 0.386
0.747ThrTrp: 0.747 ± 0.133
2.52ThrTyr: 2.52 ± 0.206
0.0ThrXaa: 0.0 ± 0.0
Val
5.6ValAla: 5.6 ± 0.338
2.24ValCys: 2.24 ± 0.236
4.89ValAsp: 4.89 ± 0.313
3.882ValGlu: 3.882 ± 0.254
2.576ValPhe: 2.576 ± 0.203
3.826ValGly: 3.826 ± 0.251
1.512ValHis: 1.512 ± 0.165
4.424ValIle: 4.424 ± 0.319
4.088ValLys: 4.088 ± 0.306
5.935ValLeu: 5.935 ± 0.418
2.053ValMet: 2.053 ± 0.238
4.536ValAsn: 4.536 ± 0.28
2.968ValPro: 2.968 ± 0.225
2.37ValGln: 2.37 ± 0.188
5.581ValArg: 5.581 ± 0.413
6.29ValSer: 6.29 ± 0.325
5.338ValThr: 5.338 ± 0.269
6.253ValVal: 6.253 ± 0.355
0.635ValTrp: 0.635 ± 0.093
3.136ValTyr: 3.136 ± 0.265
0.0ValXaa: 0.0 ± 0.0
Trp
0.485TrpAla: 0.485 ± 0.102
0.448TrpCys: 0.448 ± 0.094
0.355TrpAsp: 0.355 ± 0.067
0.261TrpGlu: 0.261 ± 0.061
0.392TrpPhe: 0.392 ± 0.095
0.392TrpGly: 0.392 ± 0.085
0.411TrpHis: 0.411 ± 0.092
0.709TrpIle: 0.709 ± 0.101
0.579TrpLys: 0.579 ± 0.11
1.045TrpLeu: 1.045 ± 0.142
0.392TrpMet: 0.392 ± 0.088
0.672TrpAsn: 0.672 ± 0.107
0.261TrpPro: 0.261 ± 0.064
0.224TrpGln: 0.224 ± 0.069
0.635TrpArg: 0.635 ± 0.116
0.747TrpSer: 0.747 ± 0.115
0.616TrpThr: 0.616 ± 0.093
0.392TrpVal: 0.392 ± 0.084
0.224TrpTrp: 0.224 ± 0.084
0.56TrpTyr: 0.56 ± 0.103
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.725TyrAla: 2.725 ± 0.247
0.803TyrCys: 0.803 ± 0.122
3.453TyrAsp: 3.453 ± 0.286
2.464TyrGlu: 2.464 ± 0.252
1.661TyrPhe: 1.661 ± 0.186
2.221TyrGly: 2.221 ± 0.237
1.027TyrHis: 1.027 ± 0.143
1.848TyrIle: 1.848 ± 0.191
2.24TyrLys: 2.24 ± 0.203
2.277TyrLeu: 2.277 ± 0.183
1.045TyrMet: 1.045 ± 0.156
2.296TyrAsn: 2.296 ± 0.156
1.269TyrPro: 1.269 ± 0.145
0.877TyrGln: 0.877 ± 0.127
2.613TyrArg: 2.613 ± 0.249
3.173TyrSer: 3.173 ± 0.236
3.36TyrThr: 3.36 ± 0.26
3.453TyrVal: 3.453 ± 0.252
0.373TyrTrp: 0.373 ± 0.079
1.437TyrTyr: 1.437 ± 0.154
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 178 proteins (53577 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski