Amino acid dipepetide frequency for African swine fever virus (strain Badajoz 1971 Vero-adapted) (Ba71V) (ASFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.923AlaAla: 3.923 ± 0.416
1.361AlaCys: 1.361 ± 0.17
2.762AlaAsp: 2.762 ± 0.269
3.522AlaGlu: 3.522 ± 0.29
2.542AlaPhe: 2.542 ± 0.227
2.382AlaGly: 2.382 ± 0.237
1.361AlaHis: 1.361 ± 0.178
4.503AlaIle: 4.503 ± 0.324
2.662AlaLys: 2.662 ± 0.293
5.384AlaLeu: 5.384 ± 0.387
1.381AlaMet: 1.381 ± 0.18
2.662AlaAsn: 2.662 ± 0.235
1.781AlaPro: 1.781 ± 0.204
2.001AlaGln: 2.001 ± 0.221
2.562AlaArg: 2.562 ± 0.274
3.342AlaSer: 3.342 ± 0.382
2.362AlaThr: 2.362 ± 0.228
3.502AlaVal: 3.502 ± 0.288
0.42AlaTrp: 0.42 ± 0.102
1.961AlaTyr: 1.961 ± 0.25
0.0AlaXaa: 0.0 ± 0.0
Cys
1.181CysAla: 1.181 ± 0.411
0.66CysCys: 0.66 ± 0.13
0.5CysAsp: 0.5 ± 0.098
0.861CysGlu: 0.861 ± 0.122
1.181CysPhe: 1.181 ± 0.173
1.201CysGly: 1.201 ± 0.171
0.921CysHis: 0.921 ± 0.162
1.621CysIle: 1.621 ± 0.219
1.401CysLys: 1.401 ± 0.174
1.861CysLeu: 1.861 ± 0.209
0.68CysMet: 0.68 ± 0.145
0.921CysAsn: 0.921 ± 0.138
0.921CysPro: 0.921 ± 0.203
0.68CysGln: 0.68 ± 0.133
0.921CysArg: 0.921 ± 0.142
1.341CysSer: 1.341 ± 0.179
1.141CysThr: 1.141 ± 0.143
0.881CysVal: 0.881 ± 0.146
0.3CysTrp: 0.3 ± 0.093
0.7CysTyr: 0.7 ± 0.111
0.0CysXaa: 0.0 ± 0.0
Asp
2.642AspAla: 2.642 ± 0.21
0.821AspCys: 0.821 ± 0.148
2.362AspAsp: 2.362 ± 0.259
3.022AspGlu: 3.022 ± 0.237
2.342AspPhe: 2.342 ± 0.215
1.861AspGly: 1.861 ± 0.224
1.081AspHis: 1.081 ± 0.171
4.283AspIle: 4.283 ± 0.356
3.022AspLys: 3.022 ± 0.241
5.024AspLeu: 5.024 ± 0.348
1.401AspMet: 1.401 ± 0.168
2.342AspAsn: 2.342 ± 0.203
2.322AspPro: 2.322 ± 0.239
1.261AspGln: 1.261 ± 0.157
1.401AspArg: 1.401 ± 0.164
2.762AspSer: 2.762 ± 0.248
2.822AspThr: 2.822 ± 0.348
2.422AspVal: 2.422 ± 0.179
0.48AspTrp: 0.48 ± 0.083
1.801AspTyr: 1.801 ± 0.212
0.0AspXaa: 0.0 ± 0.0
Glu
3.502GluAla: 3.502 ± 0.242
1.161GluCys: 1.161 ± 0.187
3.262GluAsp: 3.262 ± 0.269
5.104GluGlu: 5.104 ± 0.454
2.822GluPhe: 2.822 ± 0.243
2.262GluGly: 2.262 ± 0.26
1.481GluHis: 1.481 ± 0.177
4.663GluIle: 4.663 ± 0.303
5.904GluLys: 5.904 ± 0.432
6.404GluLeu: 6.404 ± 0.338
1.761GluMet: 1.761 ± 0.171
3.883GluAsn: 3.883 ± 0.322
2.242GluPro: 2.242 ± 0.218
2.842GluGln: 2.842 ± 0.275
2.482GluArg: 2.482 ± 0.198
2.802GluSer: 2.802 ± 0.247
4.283GluThr: 4.283 ± 0.435
2.522GluVal: 2.522 ± 0.214
0.961GluTrp: 0.961 ± 0.152
2.902GluTyr: 2.902 ± 0.223
0.0GluXaa: 0.0 ± 0.0
Phe
1.721PheAla: 1.721 ± 0.211
1.061PheCys: 1.061 ± 0.143
1.921PheAsp: 1.921 ± 0.202
2.602PheGlu: 2.602 ± 0.191
2.382PhePhe: 2.382 ± 0.228
1.541PheGly: 1.541 ± 0.18
1.061PheHis: 1.061 ± 0.15
4.603PheIle: 4.603 ± 0.274
3.863PheLys: 3.863 ± 0.317
5.044PheLeu: 5.044 ± 0.332
1.561PheMet: 1.561 ± 0.187
3.362PheAsn: 3.362 ± 0.287
1.841PhePro: 1.841 ± 0.197
1.761PheGln: 1.761 ± 0.238
1.481PheArg: 1.481 ± 0.185
3.803PheSer: 3.803 ± 0.283
2.762PheThr: 2.762 ± 0.26
2.362PheVal: 2.362 ± 0.214
0.54PheTrp: 0.54 ± 0.105
2.682PheTyr: 2.682 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
2.662GlyAla: 2.662 ± 0.268
0.66GlyCys: 0.66 ± 0.135
1.961GlyAsp: 1.961 ± 0.217
2.242GlyGlu: 2.242 ± 0.233
1.981GlyPhe: 1.981 ± 0.194
3.022GlyGly: 3.022 ± 0.328
1.141GlyHis: 1.141 ± 0.126
3.763GlyIle: 3.763 ± 0.346
3.502GlyLys: 3.502 ± 0.251
4.683GlyLeu: 4.683 ± 0.305
1.041GlyMet: 1.041 ± 0.13
2.202GlyAsn: 2.202 ± 0.223
1.581GlyPro: 1.581 ± 0.163
1.381GlyGln: 1.381 ± 0.142
1.941GlyArg: 1.941 ± 0.219
3.122GlySer: 3.122 ± 0.254
2.061GlyThr: 2.061 ± 0.203
2.262GlyVal: 2.262 ± 0.241
0.3GlyTrp: 0.3 ± 0.071
2.061GlyTyr: 2.061 ± 0.173
0.0GlyXaa: 0.0 ± 0.0
His
1.281HisAla: 1.281 ± 0.14
0.52HisCys: 0.52 ± 0.109
1.341HisAsp: 1.341 ± 0.16
1.801HisGlu: 1.801 ± 0.176
1.581HisPhe: 1.581 ± 0.189
1.501HisGly: 1.501 ± 0.156
0.881HisHis: 0.881 ± 0.132
2.422HisIle: 2.422 ± 0.207
2.001HisLys: 2.001 ± 0.23
3.162HisLeu: 3.162 ± 0.288
0.56HisMet: 0.56 ± 0.117
1.361HisAsn: 1.361 ± 0.168
1.141HisPro: 1.141 ± 0.172
1.241HisGln: 1.241 ± 0.156
1.261HisArg: 1.261 ± 0.154
1.681HisSer: 1.681 ± 0.17
1.541HisThr: 1.541 ± 0.162
1.621HisVal: 1.621 ± 0.209
0.24HisTrp: 0.24 ± 0.067
1.601HisTyr: 1.601 ± 0.173
0.0HisXaa: 0.0 ± 0.0
Ile
3.903IleAla: 3.903 ± 0.244
1.781IleCys: 1.781 ± 0.196
3.402IleAsp: 3.402 ± 0.247
4.763IleGlu: 4.763 ± 0.31
4.423IlePhe: 4.423 ± 0.265
3.002IleGly: 3.002 ± 0.268
2.822IleHis: 2.822 ± 0.24
6.965IleIle: 6.965 ± 0.482
6.264IleLys: 6.264 ± 0.398
8.826IleLeu: 8.826 ± 0.364
2.041IleMet: 2.041 ± 0.173
5.184IleAsn: 5.184 ± 0.411
3.983IlePro: 3.983 ± 0.371
3.883IleGln: 3.883 ± 0.324
3.663IleArg: 3.663 ± 0.351
5.564IleSer: 5.564 ± 0.353
4.303IleThr: 4.303 ± 0.266
4.043IleVal: 4.043 ± 0.3
0.5IleTrp: 0.5 ± 0.118
3.823IleTyr: 3.823 ± 0.361
0.0IleXaa: 0.0 ± 0.0
Lys
3.562LysAla: 3.562 ± 0.276
1.181LysCys: 1.181 ± 0.205
3.643LysAsp: 3.643 ± 0.32
5.264LysGlu: 5.264 ± 0.388
2.382LysPhe: 2.382 ± 0.219
2.762LysGly: 2.762 ± 0.23
2.722LysHis: 2.722 ± 0.268
5.964LysIle: 5.964 ± 0.381
7.805LysLys: 7.805 ± 0.472
5.804LysLeu: 5.804 ± 0.351
2.182LysMet: 2.182 ± 0.222
6.485LysAsn: 6.485 ± 0.417
2.942LysPro: 2.942 ± 0.365
3.482LysGln: 3.482 ± 0.234
2.982LysArg: 2.982 ± 0.227
2.962LysSer: 2.962 ± 0.26
5.064LysThr: 5.064 ± 0.321
2.882LysVal: 2.882 ± 0.202
0.62LysTrp: 0.62 ± 0.116
3.943LysTyr: 3.943 ± 0.321
0.0LysXaa: 0.0 ± 0.0
Leu
5.364LeuAla: 5.364 ± 0.367
2.001LeuCys: 2.001 ± 0.197
3.983LeuAsp: 3.983 ± 0.24
5.984LeuGlu: 5.984 ± 0.391
4.843LeuPhe: 4.843 ± 0.381
4.223LeuGly: 4.223 ± 0.326
2.862LeuHis: 2.862 ± 0.249
8.166LeuIle: 8.166 ± 0.514
7.665LeuLys: 7.665 ± 0.472
11.108LeuLeu: 11.108 ± 0.622
2.722LeuMet: 2.722 ± 0.261
6.364LeuAsn: 6.364 ± 0.362
3.943LeuPro: 3.943 ± 0.266
5.484LeuGln: 5.484 ± 0.326
4.063LeuArg: 4.063 ± 0.312
6.505LeuSer: 6.505 ± 0.388
5.604LeuThr: 5.604 ± 0.316
4.943LeuVal: 4.943 ± 0.29
1.101LeuTrp: 1.101 ± 0.163
4.503LeuTyr: 4.503 ± 0.304
0.0LeuXaa: 0.0 ± 0.0
Met
1.721MetAla: 1.721 ± 0.174
0.32MetCys: 0.32 ± 0.07
1.481MetAsp: 1.481 ± 0.185
1.881MetGlu: 1.881 ± 0.193
1.501MetPhe: 1.501 ± 0.175
1.301MetGly: 1.301 ± 0.182
0.6MetHis: 0.6 ± 0.112
1.741MetIle: 1.741 ± 0.177
1.181MetLys: 1.181 ± 0.203
3.703MetLeu: 3.703 ± 0.296
0.761MetMet: 0.761 ± 0.125
1.121MetAsn: 1.121 ± 0.157
1.161MetPro: 1.161 ± 0.15
1.081MetGln: 1.081 ± 0.153
1.261MetArg: 1.261 ± 0.177
1.341MetSer: 1.341 ± 0.155
1.021MetThr: 1.021 ± 0.148
1.701MetVal: 1.701 ± 0.195
0.28MetTrp: 0.28 ± 0.063
1.281MetTyr: 1.281 ± 0.135
0.0MetXaa: 0.0 ± 0.0
Asn
2.962AsnAla: 2.962 ± 0.22
1.101AsnCys: 1.101 ± 0.188
2.782AsnAsp: 2.782 ± 0.25
3.062AsnGlu: 3.062 ± 0.227
3.042AsnPhe: 3.042 ± 0.252
2.222AsnGly: 2.222 ± 0.22
1.901AsnHis: 1.901 ± 0.235
6.765AsnIle: 6.765 ± 0.45
3.883AsnLys: 3.883 ± 0.284
5.364AsnLeu: 5.364 ± 0.419
1.661AsnMet: 1.661 ± 0.198
4.123AsnAsn: 4.123 ± 0.415
2.782AsnPro: 2.782 ± 0.269
2.242AsnGln: 2.242 ± 0.236
2.282AsnArg: 2.282 ± 0.205
2.922AsnSer: 2.922 ± 0.271
3.763AsnThr: 3.763 ± 0.266
3.402AsnVal: 3.402 ± 0.314
0.58AsnTrp: 0.58 ± 0.113
3.182AsnTyr: 3.182 ± 0.28
0.0AsnXaa: 0.0 ± 0.0
Pro
2.061ProAla: 2.061 ± 0.26
0.881ProCys: 0.881 ± 0.25
1.961ProAsp: 1.961 ± 0.227
3.663ProGlu: 3.663 ± 0.269
1.841ProPhe: 1.841 ± 0.195
2.141ProGly: 2.141 ± 0.236
0.881ProHis: 0.881 ± 0.185
3.102ProIle: 3.102 ± 0.253
2.862ProLys: 2.862 ± 0.303
4.203ProLeu: 4.203 ± 0.26
0.761ProMet: 0.761 ± 0.122
2.362ProAsn: 2.362 ± 0.216
2.742ProPro: 2.742 ± 0.46
1.701ProGln: 1.701 ± 0.22
1.541ProArg: 1.541 ± 0.171
3.522ProSer: 3.522 ± 0.248
2.622ProThr: 2.622 ± 0.237
2.422ProVal: 2.422 ± 0.252
0.42ProTrp: 0.42 ± 0.091
1.841ProTyr: 1.841 ± 0.19
0.0ProXaa: 0.0 ± 0.0
Gln
2.282GlnAla: 2.282 ± 0.2
0.62GlnCys: 0.62 ± 0.105
2.101GlnAsp: 2.101 ± 0.216
3.022GlnGlu: 3.022 ± 0.213
1.461GlnPhe: 1.461 ± 0.158
1.881GlnGly: 1.881 ± 0.206
1.721GlnHis: 1.721 ± 0.191
3.042GlnIle: 3.042 ± 0.228
3.803GlnLys: 3.803 ± 0.272
3.943GlnLeu: 3.943 ± 0.26
1.061GlnMet: 1.061 ± 0.143
2.422GlnAsn: 2.422 ± 0.264
1.901GlnPro: 1.901 ± 0.225
2.462GlnGln: 2.462 ± 0.245
2.101GlnArg: 2.101 ± 0.205
2.242GlnSer: 2.242 ± 0.225
2.382GlnThr: 2.382 ± 0.19
2.141GlnVal: 2.141 ± 0.177
0.54GlnTrp: 0.54 ± 0.085
1.961GlnTyr: 1.961 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
2.141ArgAla: 2.141 ± 0.213
0.761ArgCys: 0.761 ± 0.13
1.661ArgAsp: 1.661 ± 0.185
2.782ArgGlu: 2.782 ± 0.242
2.322ArgPhe: 2.322 ± 0.245
1.741ArgGly: 1.741 ± 0.215
1.401ArgHis: 1.401 ± 0.15
3.242ArgIle: 3.242 ± 0.233
3.142ArgLys: 3.142 ± 0.252
4.223ArgLeu: 4.223 ± 0.344
1.101ArgMet: 1.101 ± 0.146
2.121ArgAsn: 2.121 ± 0.221
1.981ArgPro: 1.981 ± 0.218
1.781ArgGln: 1.781 ± 0.182
1.721ArgArg: 1.721 ± 0.201
2.182ArgSer: 2.182 ± 0.245
1.841ArgThr: 1.841 ± 0.184
2.202ArgVal: 2.202 ± 0.256
0.48ArgTrp: 0.48 ± 0.102
1.681ArgTyr: 1.681 ± 0.185
0.0ArgXaa: 0.0 ± 0.0
Ser
2.522SerAla: 2.522 ± 0.229
1.221SerCys: 1.221 ± 0.183
2.021SerAsp: 2.021 ± 0.194
3.663SerGlu: 3.663 ± 0.245
3.042SerPhe: 3.042 ± 0.263
2.922SerGly: 2.922 ± 0.24
1.381SerHis: 1.381 ± 0.182
5.464SerIle: 5.464 ± 0.365
4.343SerLys: 4.343 ± 0.319
6.645SerLeu: 6.645 ± 0.383
1.921SerMet: 1.921 ± 0.191
3.182SerAsn: 3.182 ± 0.295
3.122SerPro: 3.122 ± 0.293
2.382SerGln: 2.382 ± 0.246
2.402SerArg: 2.402 ± 0.208
4.583SerSer: 4.583 ± 0.32
4.003SerThr: 4.003 ± 0.401
3.122SerVal: 3.122 ± 0.283
0.6SerTrp: 0.6 ± 0.112
2.822SerTyr: 2.822 ± 0.215
0.0SerXaa: 0.0 ± 0.0
Thr
2.962ThrAla: 2.962 ± 0.26
1.481ThrCys: 1.481 ± 0.418
2.802ThrAsp: 2.802 ± 0.209
3.202ThrGlu: 3.202 ± 0.228
2.962ThrPhe: 2.962 ± 0.188
2.522ThrGly: 2.522 ± 0.235
1.601ThrHis: 1.601 ± 0.189
4.703ThrIle: 4.703 ± 0.285
3.442ThrLys: 3.442 ± 0.267
6.064ThrLeu: 6.064 ± 0.339
1.341ThrMet: 1.341 ± 0.142
3.322ThrAsn: 3.322 ± 0.273
2.822ThrPro: 2.822 ± 0.248
2.582ThrGln: 2.582 ± 0.207
2.222ThrArg: 2.222 ± 0.212
3.502ThrSer: 3.502 ± 0.296
3.162ThrThr: 3.162 ± 0.273
2.722ThrVal: 2.722 ± 0.208
0.6ThrTrp: 0.6 ± 0.108
2.322ThrTyr: 2.322 ± 0.232
0.0ThrXaa: 0.0 ± 0.0
Val
2.842ValAla: 2.842 ± 0.263
0.861ValCys: 0.861 ± 0.143
2.982ValAsp: 2.982 ± 0.332
3.102ValGlu: 3.102 ± 0.227
2.702ValPhe: 2.702 ± 0.244
2.121ValGly: 2.121 ± 0.195
1.261ValHis: 1.261 ± 0.161
3.883ValIle: 3.883 ± 0.295
4.203ValLys: 4.203 ± 0.354
5.544ValLeu: 5.544 ± 0.289
1.061ValMet: 1.061 ± 0.151
2.642ValAsn: 2.642 ± 0.225
2.001ValPro: 2.001 ± 0.18
2.582ValGln: 2.582 ± 0.237
2.322ValArg: 2.322 ± 0.239
3.262ValSer: 3.262 ± 0.257
2.362ValThr: 2.362 ± 0.196
3.242ValVal: 3.242 ± 0.229
0.44ValTrp: 0.44 ± 0.081
2.021ValTyr: 2.021 ± 0.254
0.0ValXaa: 0.0 ± 0.0
Trp
0.7TrpAla: 0.7 ± 0.117
0.32TrpCys: 0.32 ± 0.081
0.52TrpAsp: 0.52 ± 0.106
1.021TrpGlu: 1.021 ± 0.136
0.42TrpPhe: 0.42 ± 0.086
0.42TrpGly: 0.42 ± 0.095
0.34TrpHis: 0.34 ± 0.075
0.861TrpIle: 0.861 ± 0.133
0.821TrpLys: 0.821 ± 0.125
0.861TrpLeu: 0.861 ± 0.126
0.28TrpMet: 0.28 ± 0.059
0.52TrpAsn: 0.52 ± 0.1
0.38TrpPro: 0.38 ± 0.107
0.34TrpGln: 0.34 ± 0.071
0.4TrpArg: 0.4 ± 0.082
0.5TrpSer: 0.5 ± 0.127
0.5TrpThr: 0.5 ± 0.126
0.64TrpVal: 0.64 ± 0.11
0.28TrpTrp: 0.28 ± 0.067
0.48TrpTyr: 0.48 ± 0.086
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.442TyrAla: 2.442 ± 0.244
1.101TyrCys: 1.101 ± 0.158
2.021TyrAsp: 2.021 ± 0.225
2.682TyrGlu: 2.682 ± 0.246
2.182TyrPhe: 2.182 ± 0.219
2.542TyrGly: 2.542 ± 0.194
1.321TyrHis: 1.321 ± 0.17
3.382TyrIle: 3.382 ± 0.235
2.862TyrLys: 2.862 ± 0.222
3.583TyrLeu: 3.583 ± 0.266
1.141TyrMet: 1.141 ± 0.157
3.442TyrAsn: 3.442 ± 0.256
1.941TyrPro: 1.941 ± 0.175
1.921TyrGln: 1.921 ± 0.202
1.501TyrArg: 1.501 ± 0.172
3.422TyrSer: 3.422 ± 0.264
2.742TyrThr: 2.742 ± 0.221
2.422TyrVal: 2.422 ± 0.213
0.921TyrTrp: 0.921 ± 0.132
2.662TyrTyr: 2.662 ± 0.304
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 150 proteins (49966 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski