Amino acid dipepetide frequency for Spodoptera frugiperda ascovirus 1a (SfAV-1a)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.421AlaAla: 4.421 ± 0.296
1.971AlaCys: 1.971 ± 0.243
3.116AlaAsp: 3.116 ± 0.271
2.636AlaGlu: 2.636 ± 0.293
2.503AlaPhe: 2.503 ± 0.286
3.036AlaGly: 3.036 ± 0.298
1.172AlaHis: 1.172 ± 0.152
3.675AlaIle: 3.675 ± 0.321
3.302AlaLys: 3.302 ± 0.355
5.699AlaLeu: 5.699 ± 0.407
1.838AlaMet: 1.838 ± 0.22
2.796AlaAsn: 2.796 ± 0.285
2.104AlaPro: 2.104 ± 0.232
1.411AlaGln: 1.411 ± 0.162
5.806AlaArg: 5.806 ± 0.481
4.634AlaSer: 4.634 ± 0.348
4.208AlaThr: 4.208 ± 0.354
4.847AlaVal: 4.847 ± 0.316
0.692AlaTrp: 0.692 ± 0.15
2.69AlaTyr: 2.69 ± 0.258
0.0AlaXaa: 0.0 ± 0.0
Cys
1.731CysAla: 1.731 ± 0.214
0.746CysCys: 0.746 ± 0.139
1.917CysAsp: 1.917 ± 0.264
1.491CysGlu: 1.491 ± 0.242
0.826CysPhe: 0.826 ± 0.164
1.518CysGly: 1.518 ± 0.216
0.586CysHis: 0.586 ± 0.129
1.278CysIle: 1.278 ± 0.173
1.332CysLys: 1.332 ± 0.243
2.184CysLeu: 2.184 ± 0.268
0.852CysMet: 0.852 ± 0.161
1.571CysAsn: 1.571 ± 0.208
0.959CysPro: 0.959 ± 0.147
0.639CysGln: 0.639 ± 0.128
2.45CysArg: 2.45 ± 0.311
2.77CysSer: 2.77 ± 0.31
1.997CysThr: 1.997 ± 0.197
2.264CysVal: 2.264 ± 0.273
0.24CysTrp: 0.24 ± 0.071
0.772CysTyr: 0.772 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
4.527AspAla: 4.527 ± 0.395
1.278AspCys: 1.278 ± 0.196
6.525AspAsp: 6.525 ± 0.446
4.9AspGlu: 4.9 ± 0.359
1.811AspPhe: 1.811 ± 0.26
4.581AspGly: 4.581 ± 0.337
1.625AspHis: 1.625 ± 0.198
3.142AspIle: 3.142 ± 0.275
1.944AspLys: 1.944 ± 0.218
4.154AspLeu: 4.154 ± 0.352
2.077AspMet: 2.077 ± 0.263
2.903AspAsn: 2.903 ± 0.231
2.237AspPro: 2.237 ± 0.263
1.012AspGln: 1.012 ± 0.173
4.607AspArg: 4.607 ± 0.352
4.314AspSer: 4.314 ± 0.376
4.74AspThr: 4.74 ± 0.401
6.578AspVal: 6.578 ± 0.375
0.586AspTrp: 0.586 ± 0.122
2.636AspTyr: 2.636 ± 0.287
0.0AspXaa: 0.0 ± 0.0
Glu
2.823GluAla: 2.823 ± 0.306
1.758GluCys: 1.758 ± 0.244
2.663GluAsp: 2.663 ± 0.216
2.13GluGlu: 2.13 ± 0.305
1.838GluPhe: 1.838 ± 0.245
1.305GluGly: 1.305 ± 0.152
1.411GluHis: 1.411 ± 0.213
2.663GluIle: 2.663 ± 0.264
1.704GluLys: 1.704 ± 0.241
5.593GluLeu: 5.593 ± 0.593
1.438GluMet: 1.438 ± 0.194
2.69GluAsn: 2.69 ± 0.316
1.571GluPro: 1.571 ± 0.212
1.518GluGln: 1.518 ± 0.206
4.501GluArg: 4.501 ± 0.49
4.048GluSer: 4.048 ± 0.314
3.515GluThr: 3.515 ± 0.31
2.61GluVal: 2.61 ± 0.274
0.639GluTrp: 0.639 ± 0.111
2.61GluTyr: 2.61 ± 0.305
0.0GluXaa: 0.0 ± 0.0
Phe
2.077PheAla: 2.077 ± 0.203
0.799PheCys: 0.799 ± 0.134
2.929PheAsp: 2.929 ± 0.358
2.024PheGlu: 2.024 ± 0.279
0.905PhePhe: 0.905 ± 0.117
2.37PheGly: 2.37 ± 0.241
0.559PheHis: 0.559 ± 0.135
1.758PheIle: 1.758 ± 0.204
1.811PheLys: 1.811 ± 0.201
2.024PheLeu: 2.024 ± 0.274
0.959PheMet: 0.959 ± 0.143
2.184PheAsn: 2.184 ± 0.285
1.039PhePro: 1.039 ± 0.153
0.879PheGln: 0.879 ± 0.158
2.37PheArg: 2.37 ± 0.286
2.264PheSer: 2.264 ± 0.278
2.317PheThr: 2.317 ± 0.243
3.915PheVal: 3.915 ± 0.286
0.133PheTrp: 0.133 ± 0.055
0.852PheTyr: 0.852 ± 0.141
0.0PheXaa: 0.0 ± 0.0
Gly
3.063GlyAla: 3.063 ± 0.325
1.092GlyCys: 1.092 ± 0.194
4.261GlyAsp: 4.261 ± 0.333
2.61GlyGlu: 2.61 ± 0.23
1.678GlyPhe: 1.678 ± 0.195
4.181GlyGly: 4.181 ± 0.544
1.385GlyHis: 1.385 ± 0.185
1.784GlyIle: 1.784 ± 0.241
2.237GlyLys: 2.237 ± 0.232
2.85GlyLeu: 2.85 ± 0.306
1.225GlyMet: 1.225 ± 0.174
2.024GlyAsn: 2.024 ± 0.219
1.278GlyPro: 1.278 ± 0.167
0.959GlyGln: 0.959 ± 0.166
4.021GlyArg: 4.021 ± 0.317
3.462GlySer: 3.462 ± 0.312
3.542GlyThr: 3.542 ± 0.291
5.859GlyVal: 5.859 ± 0.448
0.506GlyTrp: 0.506 ± 0.127
1.598GlyTyr: 1.598 ± 0.231
0.0GlyXaa: 0.0 ± 0.0
His
1.518HisAla: 1.518 ± 0.161
0.559HisCys: 0.559 ± 0.136
2.077HisAsp: 2.077 ± 0.231
1.145HisGlu: 1.145 ± 0.142
0.905HisPhe: 0.905 ± 0.126
1.225HisGly: 1.225 ± 0.159
0.826HisHis: 0.826 ± 0.244
1.198HisIle: 1.198 ± 0.192
1.065HisLys: 1.065 ± 0.157
1.784HisLeu: 1.784 ± 0.178
0.586HisMet: 0.586 ± 0.124
0.826HisAsn: 0.826 ± 0.152
0.852HisPro: 0.852 ± 0.192
0.905HisGln: 0.905 ± 0.146
2.317HisArg: 2.317 ± 0.29
1.651HisSer: 1.651 ± 0.221
1.651HisThr: 1.651 ± 0.243
2.69HisVal: 2.69 ± 0.276
0.213HisTrp: 0.213 ± 0.085
0.985HisTyr: 0.985 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
3.755IleAla: 3.755 ± 0.338
0.852IleCys: 0.852 ± 0.169
3.941IleAsp: 3.941 ± 0.379
3.196IleGlu: 3.196 ± 0.291
1.252IlePhe: 1.252 ± 0.2
2.823IleGly: 2.823 ± 0.326
1.358IleHis: 1.358 ± 0.236
2.317IleIle: 2.317 ± 0.318
1.545IleLys: 1.545 ± 0.214
3.036IleLeu: 3.036 ± 0.279
1.092IleMet: 1.092 ± 0.175
2.876IleAsn: 2.876 ± 0.337
2.157IlePro: 2.157 ± 0.283
1.545IleGln: 1.545 ± 0.187
4.128IleArg: 4.128 ± 0.355
3.409IleSer: 3.409 ± 0.308
3.063IleThr: 3.063 ± 0.287
4.9IleVal: 4.9 ± 0.401
0.16IleTrp: 0.16 ± 0.07
1.358IleTyr: 1.358 ± 0.191
0.0IleXaa: 0.0 ± 0.0
Lys
2.13LysAla: 2.13 ± 0.286
1.864LysCys: 1.864 ± 0.204
2.051LysAsp: 2.051 ± 0.277
1.518LysGlu: 1.518 ± 0.254
2.077LysPhe: 2.077 ± 0.234
1.784LysGly: 1.784 ± 0.271
1.278LysHis: 1.278 ± 0.191
1.997LysIle: 1.997 ± 0.254
2.104LysLys: 2.104 ± 0.293
4.075LysLeu: 4.075 ± 0.339
1.145LysMet: 1.145 ± 0.216
2.13LysAsn: 2.13 ± 0.295
1.784LysPro: 1.784 ± 0.216
1.411LysGln: 1.411 ± 0.214
5.806LysArg: 5.806 ± 0.352
4.075LysSer: 4.075 ± 0.363
2.45LysThr: 2.45 ± 0.291
2.503LysVal: 2.503 ± 0.252
0.506LysTrp: 0.506 ± 0.108
2.264LysTyr: 2.264 ± 0.254
0.0LysXaa: 0.0 ± 0.0
Leu
5.3LeuAla: 5.3 ± 0.439
2.503LeuCys: 2.503 ± 0.264
5.087LeuAsp: 5.087 ± 0.428
3.489LeuGlu: 3.489 ± 0.359
2.823LeuPhe: 2.823 ± 0.265
2.636LeuGly: 2.636 ± 0.231
2.423LeuHis: 2.423 ± 0.314
3.116LeuIle: 3.116 ± 0.331
4.847LeuLys: 4.847 ± 0.382
8.043LeuLeu: 8.043 ± 0.61
2.77LeuMet: 2.77 ± 0.249
3.915LeuAsn: 3.915 ± 0.255
3.622LeuPro: 3.622 ± 0.277
2.69LeuGln: 2.69 ± 0.358
7.457LeuArg: 7.457 ± 0.527
5.832LeuSer: 5.832 ± 0.502
5.113LeuThr: 5.113 ± 0.388
6.365LeuVal: 6.365 ± 0.432
0.905LeuTrp: 0.905 ± 0.163
3.702LeuTyr: 3.702 ± 0.275
0.0LeuXaa: 0.0 ± 0.0
Met
2.104MetAla: 2.104 ± 0.254
1.039MetCys: 1.039 ± 0.203
1.438MetAsp: 1.438 ± 0.18
1.145MetGlu: 1.145 ± 0.165
1.225MetPhe: 1.225 ± 0.221
1.012MetGly: 1.012 ± 0.156
0.506MetHis: 0.506 ± 0.124
1.491MetIle: 1.491 ± 0.233
1.145MetLys: 1.145 ± 0.146
2.636MetLeu: 2.636 ± 0.316
0.905MetMet: 0.905 ± 0.183
1.731MetAsn: 1.731 ± 0.227
0.586MetPro: 0.586 ± 0.103
0.639MetGln: 0.639 ± 0.126
1.864MetArg: 1.864 ± 0.193
2.876MetSer: 2.876 ± 0.301
1.891MetThr: 1.891 ± 0.195
2.051MetVal: 2.051 ± 0.251
0.266MetTrp: 0.266 ± 0.079
1.518MetTyr: 1.518 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
3.808AsnAla: 3.808 ± 0.318
1.065AsnCys: 1.065 ± 0.157
4.021AsnAsp: 4.021 ± 0.336
3.196AsnGlu: 3.196 ± 0.335
1.651AsnPhe: 1.651 ± 0.219
3.728AsnGly: 3.728 ± 0.409
1.065AsnHis: 1.065 ± 0.176
2.37AsnIle: 2.37 ± 0.258
2.184AsnLys: 2.184 ± 0.292
3.116AsnLeu: 3.116 ± 0.269
1.411AsnMet: 1.411 ± 0.199
3.063AsnAsn: 3.063 ± 0.409
1.411AsnPro: 1.411 ± 0.19
0.772AsnGln: 0.772 ± 0.134
3.622AsnArg: 3.622 ± 0.339
2.983AsnSer: 2.983 ± 0.301
3.249AsnThr: 3.249 ± 0.329
5.113AsnVal: 5.113 ± 0.321
0.533AsnTrp: 0.533 ± 0.103
2.21AsnTyr: 2.21 ± 0.242
0.0AsnXaa: 0.0 ± 0.0
Pro
2.077ProAla: 2.077 ± 0.204
0.905ProCys: 0.905 ± 0.177
2.21ProAsp: 2.21 ± 0.231
1.465ProGlu: 1.465 ± 0.222
1.039ProPhe: 1.039 ± 0.15
1.252ProGly: 1.252 ± 0.206
1.065ProHis: 1.065 ± 0.183
2.024ProIle: 2.024 ± 0.228
1.838ProLys: 1.838 ± 0.21
3.329ProLeu: 3.329 ± 0.269
1.119ProMet: 1.119 ± 0.165
2.157ProAsn: 2.157 ± 0.259
2.53ProPro: 2.53 ± 0.486
0.719ProGln: 0.719 ± 0.162
2.024ProArg: 2.024 ± 0.225
4.314ProSer: 4.314 ± 0.458
2.876ProThr: 2.876 ± 0.262
3.009ProVal: 3.009 ± 0.276
0.399ProTrp: 0.399 ± 0.118
1.491ProTyr: 1.491 ± 0.16
0.0ProXaa: 0.0 ± 0.0
Gln
1.491GlnAla: 1.491 ± 0.196
1.039GlnCys: 1.039 ± 0.166
1.225GlnAsp: 1.225 ± 0.163
1.065GlnGlu: 1.065 ± 0.165
0.985GlnPhe: 0.985 ± 0.122
0.772GlnGly: 0.772 ± 0.125
0.692GlnHis: 0.692 ± 0.151
1.305GlnIle: 1.305 ± 0.197
1.065GlnLys: 1.065 ± 0.192
3.196GlnLeu: 3.196 ± 0.347
0.719GlnMet: 0.719 ± 0.149
0.959GlnAsn: 0.959 ± 0.154
0.719GlnPro: 0.719 ± 0.145
1.039GlnGln: 1.039 ± 0.214
2.716GlnArg: 2.716 ± 0.333
2.077GlnSer: 2.077 ± 0.264
1.491GlnThr: 1.491 ± 0.197
1.571GlnVal: 1.571 ± 0.22
0.24GlnTrp: 0.24 ± 0.081
1.092GlnTyr: 1.092 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
4.527ArgAla: 4.527 ± 0.352
2.423ArgCys: 2.423 ± 0.264
4.687ArgAsp: 4.687 ± 0.406
3.435ArgGlu: 3.435 ± 0.32
2.929ArgPhe: 2.929 ± 0.265
3.329ArgGly: 3.329 ± 0.307
2.104ArgHis: 2.104 ± 0.223
4.714ArgIle: 4.714 ± 0.383
3.462ArgLys: 3.462 ± 0.296
7.696ArgLeu: 7.696 ± 0.525
2.45ArgMet: 2.45 ± 0.259
4.208ArgAsn: 4.208 ± 0.381
2.956ArgPro: 2.956 ± 0.29
2.29ArgGln: 2.29 ± 0.247
8.682ArgArg: 8.682 ± 0.982
7.483ArgSer: 7.483 ± 0.877
5.353ArgThr: 5.353 ± 0.35
6.125ArgVal: 6.125 ± 0.41
0.559ArgTrp: 0.559 ± 0.103
3.276ArgTyr: 3.276 ± 0.352
0.0ArgXaa: 0.0 ± 0.0
Ser
4.527SerAla: 4.527 ± 0.414
2.13SerCys: 2.13 ± 0.317
4.181SerAsp: 4.181 ± 0.339
3.755SerGlu: 3.755 ± 0.296
2.903SerPhe: 2.903 ± 0.298
3.835SerGly: 3.835 ± 0.323
1.651SerHis: 1.651 ± 0.21
4.048SerIle: 4.048 ± 0.306
4.234SerLys: 4.234 ± 0.356
6.471SerLeu: 6.471 ± 0.412
2.423SerMet: 2.423 ± 0.26
3.968SerAsn: 3.968 ± 0.348
3.595SerPro: 3.595 ± 0.593
1.971SerGln: 1.971 ± 0.298
6.285SerArg: 6.285 ± 0.607
8.708SerSer: 8.708 ± 0.915
5.939SerThr: 5.939 ± 0.343
7.377SerVal: 7.377 ± 0.415
0.719SerTrp: 0.719 ± 0.131
2.503SerTyr: 2.503 ± 0.234
0.0SerXaa: 0.0 ± 0.0
Thr
4.048ThrAla: 4.048 ± 0.378
1.784ThrCys: 1.784 ± 0.185
4.128ThrAsp: 4.128 ± 0.365
2.61ThrGlu: 2.61 ± 0.288
2.583ThrPhe: 2.583 ± 0.307
3.515ThrGly: 3.515 ± 0.312
1.598ThrHis: 1.598 ± 0.203
3.968ThrIle: 3.968 ± 0.363
3.009ThrLys: 3.009 ± 0.348
6.525ThrLeu: 6.525 ± 0.393
1.917ThrMet: 1.917 ± 0.27
2.876ThrAsn: 2.876 ± 0.246
3.382ThrPro: 3.382 ± 0.371
1.438ThrGln: 1.438 ± 0.228
4.554ThrArg: 4.554 ± 0.297
6.232ThrSer: 6.232 ± 0.538
5.646ThrThr: 5.646 ± 0.487
5.779ThrVal: 5.779 ± 0.426
0.692ThrTrp: 0.692 ± 0.131
2.45ThrTyr: 2.45 ± 0.287
0.0ThrXaa: 0.0 ± 0.0
Val
5.273ValAla: 5.273 ± 0.402
2.903ValCys: 2.903 ± 0.362
6.045ValAsp: 6.045 ± 0.39
4.874ValGlu: 4.874 ± 0.479
2.636ValPhe: 2.636 ± 0.318
3.995ValGly: 3.995 ± 0.293
2.423ValHis: 2.423 ± 0.294
3.808ValIle: 3.808 ± 0.361
4.021ValLys: 4.021 ± 0.336
6.631ValLeu: 6.631 ± 0.403
1.704ValMet: 1.704 ± 0.205
4.767ValAsn: 4.767 ± 0.367
3.462ValPro: 3.462 ± 0.337
2.477ValGln: 2.477 ± 0.264
6.258ValArg: 6.258 ± 0.353
6.418ValSer: 6.418 ± 0.397
5.752ValThr: 5.752 ± 0.364
7.244ValVal: 7.244 ± 0.528
0.905ValTrp: 0.905 ± 0.161
3.435ValTyr: 3.435 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
0.533TrpAla: 0.533 ± 0.105
0.479TrpCys: 0.479 ± 0.103
0.32TrpAsp: 0.32 ± 0.097
0.266TrpGlu: 0.266 ± 0.085
0.32TrpPhe: 0.32 ± 0.091
0.32TrpGly: 0.32 ± 0.113
0.373TrpHis: 0.373 ± 0.121
0.346TrpIle: 0.346 ± 0.099
0.586TrpLys: 0.586 ± 0.118
1.145TrpLeu: 1.145 ± 0.192
0.293TrpMet: 0.293 ± 0.085
0.666TrpAsn: 0.666 ± 0.158
0.266TrpPro: 0.266 ± 0.099
0.186TrpGln: 0.186 ± 0.069
0.533TrpArg: 0.533 ± 0.133
0.799TrpSer: 0.799 ± 0.136
0.746TrpThr: 0.746 ± 0.152
0.533TrpVal: 0.533 ± 0.111
0.24TrpTrp: 0.24 ± 0.082
0.613TrpTyr: 0.613 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.823TyrAla: 2.823 ± 0.242
0.932TyrCys: 0.932 ± 0.155
3.356TyrAsp: 3.356 ± 0.282
2.077TyrGlu: 2.077 ± 0.262
1.278TyrPhe: 1.278 ± 0.179
2.397TyrGly: 2.397 ± 0.258
0.879TyrHis: 0.879 ± 0.165
1.758TyrIle: 1.758 ± 0.207
1.518TyrLys: 1.518 ± 0.213
2.45TyrLeu: 2.45 ± 0.214
1.119TyrMet: 1.119 ± 0.189
2.264TyrAsn: 2.264 ± 0.222
1.332TyrPro: 1.332 ± 0.204
0.959TyrGln: 0.959 ± 0.159
2.716TyrArg: 2.716 ± 0.231
2.876TyrSer: 2.876 ± 0.272
3.196TyrThr: 3.196 ± 0.352
3.702TyrVal: 3.702 ± 0.349
0.426TyrTrp: 0.426 ± 0.098
1.758TyrTyr: 1.758 ± 0.257
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 123 proteins (37551 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski