Amino acid dipepetide frequency for Spodoptera exigua multiple nucleopolyhedrovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.992AlaAla: 4.992 ± 0.586
1.135AlaCys: 1.135 ± 0.196
3.605AlaAsp: 3.605 ± 0.382
2.799AlaGlu: 2.799 ± 0.305
2.496AlaPhe: 2.496 ± 0.27
2.168AlaGly: 2.168 ± 0.272
0.832AlaHis: 0.832 ± 0.13
3.505AlaIle: 3.505 ± 0.362
2.622AlaLys: 2.622 ± 0.272
4.816AlaLeu: 4.816 ± 0.462
1.412AlaMet: 1.412 ± 0.181
2.925AlaAsn: 2.925 ± 0.228
1.916AlaPro: 1.916 ± 0.192
1.79AlaGln: 1.79 ± 0.216
1.967AlaArg: 1.967 ± 0.249
3.076AlaSer: 3.076 ± 0.285
3.0AlaThr: 3.0 ± 0.295
4.059AlaVal: 4.059 ± 0.338
0.378AlaTrp: 0.378 ± 0.094
1.664AlaTyr: 1.664 ± 0.186
0.0AlaXaa: 0.0 ± 0.0
Cys
1.135CysAla: 1.135 ± 0.161
0.504CysCys: 0.504 ± 0.13
1.412CysAsp: 1.412 ± 0.202
0.908CysGlu: 0.908 ± 0.146
1.16CysPhe: 1.16 ± 0.204
1.084CysGly: 1.084 ± 0.2
0.328CysHis: 0.328 ± 0.078
1.538CysIle: 1.538 ± 0.189
1.74CysLys: 1.74 ± 0.235
1.714CysLeu: 1.714 ± 0.185
0.529CysMet: 0.529 ± 0.115
1.235CysAsn: 1.235 ± 0.186
1.135CysPro: 1.135 ± 0.228
0.656CysGln: 0.656 ± 0.148
1.412CysArg: 1.412 ± 0.171
1.538CysSer: 1.538 ± 0.244
0.983CysThr: 0.983 ± 0.131
2.395CysVal: 2.395 ± 0.247
0.202CysTrp: 0.202 ± 0.084
0.63CysTyr: 0.63 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
3.807AspAla: 3.807 ± 0.338
1.261AspCys: 1.261 ± 0.21
11.068AspAsp: 11.068 ± 1.446
5.572AspGlu: 5.572 ± 0.366
2.975AspPhe: 2.975 ± 0.227
3.202AspGly: 3.202 ± 0.308
1.538AspHis: 1.538 ± 0.164
4.463AspIle: 4.463 ± 0.347
4.387AspLys: 4.387 ± 0.342
5.269AspLeu: 5.269 ± 0.337
1.361AspMet: 1.361 ± 0.157
5.698AspAsn: 5.698 ± 0.567
1.538AspPro: 1.538 ± 0.18
1.538AspGln: 1.538 ± 0.211
3.454AspArg: 3.454 ± 0.276
3.832AspSer: 3.832 ± 0.316
3.429AspThr: 3.429 ± 0.285
4.74AspVal: 4.74 ± 0.347
0.529AspTrp: 0.529 ± 0.112
3.53AspTyr: 3.53 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
2.042GluAla: 2.042 ± 0.223
1.336GluCys: 1.336 ± 0.23
2.723GluAsp: 2.723 ± 0.237
2.244GluGlu: 2.244 ± 0.228
2.748GluPhe: 2.748 ± 0.252
1.311GluGly: 1.311 ± 0.202
1.513GluHis: 1.513 ± 0.196
3.782GluIle: 3.782 ± 0.321
3.58GluLys: 3.58 ± 0.265
4.664GluLeu: 4.664 ± 0.382
1.689GluMet: 1.689 ± 0.204
4.488GluAsn: 4.488 ± 0.37
1.689GluPro: 1.689 ± 0.291
1.714GluGln: 1.714 ± 0.234
3.908GluArg: 3.908 ± 0.348
3.328GluSer: 3.328 ± 0.294
3.252GluThr: 3.252 ± 0.291
1.967GluVal: 1.967 ± 0.206
0.378GluTrp: 0.378 ± 0.1
2.294GluTyr: 2.294 ± 0.245
0.0GluXaa: 0.0 ± 0.0
Phe
2.093PheAla: 2.093 ± 0.251
0.958PheCys: 0.958 ± 0.173
5.345PheAsp: 5.345 ± 0.384
3.152PheGlu: 3.152 ± 0.236
1.765PhePhe: 1.765 ± 0.26
1.916PheGly: 1.916 ± 0.207
0.756PheHis: 0.756 ± 0.16
3.278PheIle: 3.278 ± 0.277
3.429PheLys: 3.429 ± 0.279
3.605PheLeu: 3.605 ± 0.394
1.387PheMet: 1.387 ± 0.174
4.059PheAsn: 4.059 ± 0.325
1.235PhePro: 1.235 ± 0.174
1.664PheGln: 1.664 ± 0.197
1.992PheArg: 1.992 ± 0.197
2.849PheSer: 2.849 ± 0.222
2.32PheThr: 2.32 ± 0.271
4.034PheVal: 4.034 ± 0.365
0.202PheTrp: 0.202 ± 0.065
2.521PheTyr: 2.521 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
1.513GlyAla: 1.513 ± 0.228
0.504GlyCys: 0.504 ± 0.11
2.799GlyAsp: 2.799 ± 0.336
1.941GlyGlu: 1.941 ± 0.221
1.664GlyPhe: 1.664 ± 0.174
3.0GlyGly: 3.0 ± 0.392
0.782GlyHis: 0.782 ± 0.135
2.496GlyIle: 2.496 ± 0.265
2.042GlyLys: 2.042 ± 0.246
2.345GlyLeu: 2.345 ± 0.258
0.706GlyMet: 0.706 ± 0.155
2.471GlyAsn: 2.471 ± 0.317
0.958GlyPro: 0.958 ± 0.205
1.336GlyGln: 1.336 ± 0.175
1.79GlyArg: 1.79 ± 0.23
2.118GlySer: 2.118 ± 0.19
1.765GlyThr: 1.765 ± 0.245
2.723GlyVal: 2.723 ± 0.338
0.328GlyTrp: 0.328 ± 0.103
1.488GlyTyr: 1.488 ± 0.257
0.0GlyXaa: 0.0 ± 0.0
His
0.908HisAla: 0.908 ± 0.145
0.555HisCys: 0.555 ± 0.115
1.261HisAsp: 1.261 ± 0.189
0.983HisGlu: 0.983 ± 0.161
1.16HisPhe: 1.16 ± 0.148
0.933HisGly: 0.933 ± 0.146
1.689HisHis: 1.689 ± 0.396
1.387HisIle: 1.387 ± 0.217
1.261HisLys: 1.261 ± 0.15
2.143HisLeu: 2.143 ± 0.235
0.605HisMet: 0.605 ± 0.108
1.614HisAsn: 1.614 ± 0.22
0.882HisPro: 0.882 ± 0.18
0.882HisGln: 0.882 ± 0.176
1.109HisArg: 1.109 ± 0.183
1.261HisSer: 1.261 ± 0.175
1.109HisThr: 1.109 ± 0.176
1.941HisVal: 1.941 ± 0.235
0.126HisTrp: 0.126 ± 0.055
1.462HisTyr: 1.462 ± 0.181
0.0HisXaa: 0.0 ± 0.0
Ile
4.034IleAla: 4.034 ± 0.348
1.286IleCys: 1.286 ± 0.153
5.799IleAsp: 5.799 ± 0.354
4.084IleGlu: 4.084 ± 0.292
3.303IlePhe: 3.303 ± 0.286
1.664IleGly: 1.664 ± 0.184
1.563IleHis: 1.563 ± 0.174
4.74IleIle: 4.74 ± 0.448
5.496IleLys: 5.496 ± 0.399
5.522IleLeu: 5.522 ± 0.45
1.866IleMet: 1.866 ± 0.201
5.748IleAsn: 5.748 ± 0.42
2.32IlePro: 2.32 ± 0.272
2.093IleGln: 2.093 ± 0.229
2.471IleArg: 2.471 ± 0.209
3.731IleSer: 3.731 ± 0.315
3.177IleThr: 3.177 ± 0.297
5.849IleVal: 5.849 ± 0.382
0.403IleTrp: 0.403 ± 0.109
3.227IleTyr: 3.227 ± 0.349
0.0IleXaa: 0.0 ± 0.0
Lys
2.143LysAla: 2.143 ± 0.234
2.042LysCys: 2.042 ± 0.271
2.799LysAsp: 2.799 ± 0.25
2.219LysGlu: 2.219 ± 0.273
3.757LysPhe: 3.757 ± 0.364
1.689LysGly: 1.689 ± 0.28
1.689LysHis: 1.689 ± 0.208
5.042LysIle: 5.042 ± 0.391
5.017LysLys: 5.017 ± 0.439
6.833LysLeu: 6.833 ± 0.411
2.193LysMet: 2.193 ± 0.23
5.194LysAsn: 5.194 ± 0.434
2.193LysPro: 2.193 ± 0.19
2.546LysGln: 2.546 ± 0.228
4.412LysArg: 4.412 ± 0.298
4.538LysSer: 4.538 ± 0.394
4.236LysThr: 4.236 ± 0.286
3.152LysVal: 3.152 ± 0.291
0.378LysTrp: 0.378 ± 0.09
3.857LysTyr: 3.857 ± 0.301
0.0LysXaa: 0.0 ± 0.0
Leu
4.463LeuAla: 4.463 ± 0.352
2.294LeuCys: 2.294 ± 0.254
4.614LeuAsp: 4.614 ± 0.296
4.437LeuGlu: 4.437 ± 0.351
4.816LeuPhe: 4.816 ± 0.391
2.42LeuGly: 2.42 ± 0.294
1.664LeuHis: 1.664 ± 0.197
6.227LeuIle: 6.227 ± 0.407
6.127LeuLys: 6.127 ± 0.458
8.623LeuLeu: 8.623 ± 0.553
2.899LeuMet: 2.899 ± 0.295
6.53LeuAsn: 6.53 ± 0.357
3.101LeuPro: 3.101 ± 0.287
3.857LeuGln: 3.857 ± 0.404
4.261LeuArg: 4.261 ± 0.34
5.799LeuSer: 5.799 ± 0.374
4.614LeuThr: 4.614 ± 0.364
5.244LeuVal: 5.244 ± 0.362
0.807LeuTrp: 0.807 ± 0.14
4.463LeuTyr: 4.463 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
1.563MetAla: 1.563 ± 0.23
0.908MetCys: 0.908 ± 0.154
1.059MetAsp: 1.059 ± 0.157
1.311MetGlu: 1.311 ± 0.175
1.462MetPhe: 1.462 ± 0.167
0.807MetGly: 0.807 ± 0.16
0.58MetHis: 0.58 ± 0.107
1.916MetIle: 1.916 ± 0.234
1.74MetLys: 1.74 ± 0.23
2.622MetLeu: 2.622 ± 0.281
0.882MetMet: 0.882 ± 0.164
1.664MetAsn: 1.664 ± 0.173
1.21MetPro: 1.21 ± 0.169
0.832MetGln: 0.832 ± 0.146
1.513MetArg: 1.513 ± 0.197
2.37MetSer: 2.37 ± 0.208
1.891MetThr: 1.891 ± 0.191
1.387MetVal: 1.387 ± 0.175
0.353MetTrp: 0.353 ± 0.092
1.462MetTyr: 1.462 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
3.631AsnAla: 3.631 ± 0.339
1.336AsnCys: 1.336 ± 0.198
6.681AsnAsp: 6.681 ± 0.543
4.286AsnGlu: 4.286 ± 0.318
3.681AsnPhe: 3.681 ± 0.33
3.53AsnGly: 3.53 ± 0.25
1.261AsnHis: 1.261 ± 0.145
5.219AsnIle: 5.219 ± 0.421
5.37AsnLys: 5.37 ± 0.376
5.295AsnLeu: 5.295 ± 0.328
1.563AsnMet: 1.563 ± 0.23
7.791AsnAsn: 7.791 ± 0.738
1.639AsnPro: 1.639 ± 0.189
1.866AsnGln: 1.866 ± 0.213
3.757AsnArg: 3.757 ± 0.391
4.185AsnSer: 4.185 ± 0.34
4.009AsnThr: 4.009 ± 0.328
6.101AsnVal: 6.101 ± 0.404
0.378AsnTrp: 0.378 ± 0.109
4.009AsnTyr: 4.009 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
1.714ProAla: 1.714 ± 0.224
0.479ProCys: 0.479 ± 0.127
1.941ProAsp: 1.941 ± 0.202
1.563ProGlu: 1.563 ± 0.282
1.563ProPhe: 1.563 ± 0.185
1.034ProGly: 1.034 ± 0.193
0.832ProHis: 0.832 ± 0.159
2.572ProIle: 2.572 ± 0.266
1.866ProLys: 1.866 ± 0.223
2.95ProLeu: 2.95 ± 0.266
1.084ProMet: 1.084 ± 0.182
2.345ProAsn: 2.345 ± 0.241
2.849ProPro: 2.849 ± 0.873
1.311ProGln: 1.311 ± 0.18
1.689ProArg: 1.689 ± 0.267
2.975ProSer: 2.975 ± 0.507
2.42ProThr: 2.42 ± 0.296
2.244ProVal: 2.244 ± 0.293
0.353ProTrp: 0.353 ± 0.088
1.361ProTyr: 1.361 ± 0.192
0.0ProXaa: 0.0 ± 0.0
Gln
1.135GlnAla: 1.135 ± 0.212
0.857GlnCys: 0.857 ± 0.136
1.387GlnAsp: 1.387 ± 0.203
1.261GlnGlu: 1.261 ± 0.206
1.941GlnPhe: 1.941 ± 0.206
0.529GlnGly: 0.529 ± 0.124
1.008GlnHis: 1.008 ± 0.149
2.32GlnIle: 2.32 ± 0.248
2.017GlnLys: 2.017 ± 0.228
4.11GlnLeu: 4.11 ± 0.345
1.336GlnMet: 1.336 ± 0.195
2.345GlnAsn: 2.345 ± 0.183
1.261GlnPro: 1.261 ± 0.186
2.849GlnGln: 2.849 ± 0.572
2.143GlnArg: 2.143 ± 0.237
2.395GlnSer: 2.395 ± 0.344
1.891GlnThr: 1.891 ± 0.268
1.689GlnVal: 1.689 ± 0.196
0.252GlnTrp: 0.252 ± 0.078
1.841GlnTyr: 1.841 ± 0.167
0.0GlnXaa: 0.0 ± 0.0
Arg
2.017ArgAla: 2.017 ± 0.256
1.059ArgCys: 1.059 ± 0.196
3.605ArgAsp: 3.605 ± 0.275
2.471ArgGlu: 2.471 ± 0.244
2.799ArgPhe: 2.799 ± 0.288
1.412ArgGly: 1.412 ± 0.166
1.664ArgHis: 1.664 ± 0.225
3.53ArgIle: 3.53 ± 0.275
2.899ArgLys: 2.899 ± 0.328
4.715ArgLeu: 4.715 ± 0.309
1.361ArgMet: 1.361 ± 0.175
3.807ArgAsn: 3.807 ± 0.304
1.967ArgPro: 1.967 ± 0.203
2.219ArgGln: 2.219 ± 0.22
4.362ArgArg: 4.362 ± 0.584
3.227ArgSer: 3.227 ± 0.38
2.546ArgThr: 2.546 ± 0.339
3.555ArgVal: 3.555 ± 0.288
0.504ArgTrp: 0.504 ± 0.127
2.345ArgTyr: 2.345 ± 0.267
0.0ArgXaa: 0.0 ± 0.0
Ser
3.555SerAla: 3.555 ± 0.358
1.109SerCys: 1.109 ± 0.179
3.706SerAsp: 3.706 ± 0.319
2.799SerGlu: 2.799 ± 0.222
2.723SerPhe: 2.723 ± 0.261
2.673SerGly: 2.673 ± 0.276
1.361SerHis: 1.361 ± 0.18
4.16SerIle: 4.16 ± 0.293
4.084SerLys: 4.084 ± 0.34
6.127SerLeu: 6.127 ± 0.442
1.614SerMet: 1.614 ± 0.194
4.362SerAsn: 4.362 ± 0.301
2.748SerPro: 2.748 ± 0.356
1.815SerGln: 1.815 ± 0.254
2.597SerArg: 2.597 ± 0.223
7.816SerSer: 7.816 ± 0.893
4.614SerThr: 4.614 ± 0.386
5.169SerVal: 5.169 ± 0.293
0.454SerTrp: 0.454 ± 0.127
3.0SerTyr: 3.0 ± 0.255
0.0SerXaa: 0.0 ± 0.0
Thr
2.925ThrAla: 2.925 ± 0.316
1.109ThrCys: 1.109 ± 0.174
3.404ThrAsp: 3.404 ± 0.295
2.042ThrGlu: 2.042 ± 0.23
2.597ThrPhe: 2.597 ± 0.274
1.815ThrGly: 1.815 ± 0.214
1.034ThrHis: 1.034 ± 0.159
4.69ThrIle: 4.69 ± 0.409
3.454ThrLys: 3.454 ± 0.279
5.446ThrLeu: 5.446 ± 0.441
1.841ThrMet: 1.841 ± 0.246
3.984ThrAsn: 3.984 ± 0.3
2.471ThrPro: 2.471 ± 0.303
2.219ThrGln: 2.219 ± 0.262
2.799ThrArg: 2.799 ± 0.255
3.656ThrSer: 3.656 ± 0.39
5.118ThrThr: 5.118 ± 0.583
4.034ThrVal: 4.034 ± 0.386
0.328ThrTrp: 0.328 ± 0.105
1.992ThrTyr: 1.992 ± 0.261
0.0ThrXaa: 0.0 ± 0.0
Val
4.437ValAla: 4.437 ± 0.332
1.992ValCys: 1.992 ± 0.282
5.975ValAsp: 5.975 ± 0.345
3.757ValGlu: 3.757 ± 0.312
3.227ValPhe: 3.227 ± 0.276
1.992ValGly: 1.992 ± 0.206
1.588ValHis: 1.588 ± 0.194
4.261ValIle: 4.261 ± 0.346
4.639ValLys: 4.639 ± 0.294
5.673ValLeu: 5.673 ± 0.314
1.714ValMet: 1.714 ± 0.186
4.387ValAsn: 4.387 ± 0.27
2.899ValPro: 2.899 ± 0.354
2.017ValGln: 2.017 ± 0.233
3.807ValArg: 3.807 ± 0.289
4.614ValSer: 4.614 ± 0.336
3.429ValThr: 3.429 ± 0.253
6.253ValVal: 6.253 ± 0.454
0.403ValTrp: 0.403 ± 0.107
3.857ValTyr: 3.857 ± 0.31
0.0ValXaa: 0.0 ± 0.0
Trp
0.403TrpAla: 0.403 ± 0.108
0.126TrpCys: 0.126 ± 0.061
0.303TrpAsp: 0.303 ± 0.081
0.479TrpGlu: 0.479 ± 0.132
0.252TrpPhe: 0.252 ± 0.093
0.126TrpGly: 0.126 ± 0.049
0.277TrpHis: 0.277 ± 0.078
0.454TrpIle: 0.454 ± 0.097
0.529TrpLys: 0.529 ± 0.102
0.504TrpLeu: 0.504 ± 0.099
0.126TrpMet: 0.126 ± 0.055
0.63TrpAsn: 0.63 ± 0.126
0.328TrpPro: 0.328 ± 0.094
0.227TrpGln: 0.227 ± 0.087
0.378TrpArg: 0.378 ± 0.109
0.656TrpSer: 0.656 ± 0.153
0.303TrpThr: 0.303 ± 0.089
0.328TrpVal: 0.328 ± 0.082
0.227TrpTrp: 0.227 ± 0.084
0.454TrpTyr: 0.454 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.647TyrAla: 2.647 ± 0.243
1.387TyrCys: 1.387 ± 0.189
3.706TyrAsp: 3.706 ± 0.349
2.345TyrGlu: 2.345 ± 0.238
2.471TyrPhe: 2.471 ± 0.215
1.462TyrGly: 1.462 ± 0.19
1.311TyrHis: 1.311 ± 0.173
2.799TyrIle: 2.799 ± 0.24
3.631TyrLys: 3.631 ± 0.287
4.311TyrLeu: 4.311 ± 0.4
1.336TyrMet: 1.336 ± 0.193
4.311TyrAsn: 4.311 ± 0.32
0.807TyrPro: 0.807 ± 0.129
1.135TyrGln: 1.135 ± 0.161
2.244TyrArg: 2.244 ± 0.257
2.521TyrSer: 2.521 ± 0.21
2.849TyrThr: 2.849 ± 0.277
4.059TyrVal: 4.059 ± 0.343
0.126TyrTrp: 0.126 ± 0.064
2.773TyrTyr: 2.773 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 135 proteins (39664 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski