Amino acid dipepetide frequency for African swine fever virus (ASFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.77AlaAla: 3.77 ± 0.352
1.416AlaCys: 1.416 ± 0.181
2.759AlaAsp: 2.759 ± 0.223
3.347AlaGlu: 3.347 ± 0.236
2.446AlaPhe: 2.446 ± 0.194
2.299AlaGly: 2.299 ± 0.223
1.416AlaHis: 1.416 ± 0.159
4.616AlaIle: 4.616 ± 0.321
2.74AlaLys: 2.74 ± 0.251
5.591AlaLeu: 5.591 ± 0.341
1.398AlaMet: 1.398 ± 0.161
2.685AlaAsn: 2.685 ± 0.205
1.729AlaPro: 1.729 ± 0.163
1.894AlaGln: 1.894 ± 0.181
2.428AlaArg: 2.428 ± 0.236
3.145AlaSer: 3.145 ± 0.332
2.262AlaThr: 2.262 ± 0.22
3.66AlaVal: 3.66 ± 0.29
0.405AlaTrp: 0.405 ± 0.091
1.949AlaTyr: 1.949 ± 0.195
0.0AlaXaa: 0.0 ± 0.0
Cys
1.269CysAla: 1.269 ± 0.385
0.644CysCys: 0.644 ± 0.114
0.607CysAsp: 0.607 ± 0.111
0.846CysGlu: 0.846 ± 0.126
1.379CysPhe: 1.379 ± 0.197
1.177CysGly: 1.177 ± 0.16
1.03CysHis: 1.03 ± 0.158
1.802CysIle: 1.802 ± 0.193
1.582CysLys: 1.582 ± 0.191
1.986CysLeu: 1.986 ± 0.205
0.662CysMet: 0.662 ± 0.106
0.901CysAsn: 0.901 ± 0.146
0.883CysPro: 0.883 ± 0.193
0.57CysGln: 0.57 ± 0.106
0.975CysArg: 0.975 ± 0.157
1.343CysSer: 1.343 ± 0.196
1.14CysThr: 1.14 ± 0.163
0.901CysVal: 0.901 ± 0.132
0.294CysTrp: 0.294 ± 0.085
0.846CysTyr: 0.846 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
2.52AspAla: 2.52 ± 0.199
0.883AspCys: 0.883 ± 0.154
2.372AspAsp: 2.372 ± 0.272
3.016AspGlu: 3.016 ± 0.251
2.391AspPhe: 2.391 ± 0.217
1.802AspGly: 1.802 ± 0.203
1.177AspHis: 1.177 ± 0.153
4.377AspIle: 4.377 ± 0.294
2.832AspLys: 2.832 ± 0.232
5.517AspLeu: 5.517 ± 0.304
1.471AspMet: 1.471 ± 0.176
2.372AspAsn: 2.372 ± 0.2
2.299AspPro: 2.299 ± 0.207
1.195AspGln: 1.195 ± 0.141
1.379AspArg: 1.379 ± 0.159
2.722AspSer: 2.722 ± 0.276
2.777AspThr: 2.777 ± 0.355
2.446AspVal: 2.446 ± 0.187
0.515AspTrp: 0.515 ± 0.087
1.986AspTyr: 1.986 ± 0.214
0.0AspXaa: 0.0 ± 0.0
Glu
3.457GluAla: 3.457 ± 0.215
1.14GluCys: 1.14 ± 0.144
3.163GluAsp: 3.163 ± 0.283
4.947GluGlu: 4.947 ± 0.426
2.795GluPhe: 2.795 ± 0.206
2.262GluGly: 2.262 ± 0.203
1.49GluHis: 1.49 ± 0.171
4.855GluIle: 4.855 ± 0.299
5.903GluLys: 5.903 ± 0.39
6.29GluLeu: 6.29 ± 0.397
1.766GluMet: 1.766 ± 0.197
3.844GluAsn: 3.844 ± 0.252
2.207GluPro: 2.207 ± 0.201
2.759GluGln: 2.759 ± 0.228
2.446GluArg: 2.446 ± 0.202
2.63GluSer: 2.63 ± 0.246
4.193GluThr: 4.193 ± 0.375
2.483GluVal: 2.483 ± 0.201
0.975GluTrp: 0.975 ± 0.129
2.869GluTyr: 2.869 ± 0.215
0.0GluXaa: 0.0 ± 0.0
Phe
1.802PheAla: 1.802 ± 0.194
1.214PheCys: 1.214 ± 0.181
2.041PheAsp: 2.041 ± 0.168
2.483PheGlu: 2.483 ± 0.197
2.483PhePhe: 2.483 ± 0.242
1.508PheGly: 1.508 ± 0.177
1.067PheHis: 1.067 ± 0.136
4.671PheIle: 4.671 ± 0.279
3.715PheLys: 3.715 ± 0.295
5.002PheLeu: 5.002 ± 0.298
1.471PheMet: 1.471 ± 0.163
3.255PheAsn: 3.255 ± 0.214
1.766PhePro: 1.766 ± 0.193
1.692PheGln: 1.692 ± 0.225
1.453PheArg: 1.453 ± 0.174
3.954PheSer: 3.954 ± 0.261
2.685PheThr: 2.685 ± 0.224
2.409PheVal: 2.409 ± 0.217
0.57PheTrp: 0.57 ± 0.089
2.887PheTyr: 2.887 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
2.979GlyAla: 2.979 ± 0.281
0.717GlyCys: 0.717 ± 0.121
1.894GlyAsp: 1.894 ± 0.199
2.244GlyGlu: 2.244 ± 0.223
1.913GlyPhe: 1.913 ± 0.221
3.016GlyGly: 3.016 ± 0.324
1.067GlyHis: 1.067 ± 0.124
3.623GlyIle: 3.623 ± 0.325
3.329GlyLys: 3.329 ± 0.241
4.726GlyLeu: 4.726 ± 0.291
1.067GlyMet: 1.067 ± 0.128
2.317GlyAsn: 2.317 ± 0.219
1.545GlyPro: 1.545 ± 0.168
1.343GlyGln: 1.343 ± 0.15
1.839GlyArg: 1.839 ± 0.193
2.943GlySer: 2.943 ± 0.224
1.986GlyThr: 1.986 ± 0.195
2.133GlyVal: 2.133 ± 0.227
0.294GlyTrp: 0.294 ± 0.079
2.041GlyTyr: 2.041 ± 0.172
0.0GlyXaa: 0.0 ± 0.0
His
1.361HisAla: 1.361 ± 0.155
0.589HisCys: 0.589 ± 0.118
1.416HisAsp: 1.416 ± 0.15
1.931HisGlu: 1.931 ± 0.176
1.545HisPhe: 1.545 ± 0.178
1.453HisGly: 1.453 ± 0.175
0.975HisHis: 0.975 ± 0.139
2.391HisIle: 2.391 ± 0.21
2.097HisLys: 2.097 ± 0.202
3.126HisLeu: 3.126 ± 0.234
0.57HisMet: 0.57 ± 0.119
1.526HisAsn: 1.526 ± 0.18
1.177HisPro: 1.177 ± 0.164
1.177HisGln: 1.177 ± 0.143
1.251HisArg: 1.251 ± 0.146
1.802HisSer: 1.802 ± 0.179
1.508HisThr: 1.508 ± 0.175
1.618HisVal: 1.618 ± 0.193
0.239HisTrp: 0.239 ± 0.065
1.6HisTyr: 1.6 ± 0.165
0.0HisXaa: 0.0 ± 0.0
Ile
3.972IleAla: 3.972 ± 0.247
1.876IleCys: 1.876 ± 0.181
3.641IleAsp: 3.641 ± 0.241
4.69IleGlu: 4.69 ± 0.288
4.451IlePhe: 4.451 ± 0.247
3.016IleGly: 3.016 ± 0.22
2.832IleHis: 2.832 ± 0.252
6.952IleIle: 6.952 ± 0.417
6.345IleLys: 6.345 ± 0.417
9.195IleLeu: 9.195 ± 0.428
2.152IleMet: 2.152 ± 0.198
5.26IleAsn: 5.26 ± 0.349
3.77IlePro: 3.77 ± 0.279
3.917IleGln: 3.917 ± 0.284
3.733IleArg: 3.733 ± 0.341
5.554IleSer: 5.554 ± 0.378
4.175IleThr: 4.175 ± 0.28
3.917IleVal: 3.917 ± 0.285
0.589IleTrp: 0.589 ± 0.122
3.917IleTyr: 3.917 ± 0.307
0.0IleXaa: 0.0 ± 0.0
Lys
3.476LysAla: 3.476 ± 0.285
1.287LysCys: 1.287 ± 0.199
3.605LysAsp: 3.605 ± 0.25
5.315LysGlu: 5.315 ± 0.406
2.446LysPhe: 2.446 ± 0.245
2.648LysGly: 2.648 ± 0.238
2.851LysHis: 2.851 ± 0.204
6.106LysIle: 6.106 ± 0.321
7.871LysLys: 7.871 ± 0.448
5.811LysLeu: 5.811 ± 0.312
2.225LysMet: 2.225 ± 0.204
6.529LysAsn: 6.529 ± 0.403
2.759LysPro: 2.759 ± 0.32
3.439LysGln: 3.439 ± 0.261
2.832LysArg: 2.832 ± 0.217
2.998LysSer: 2.998 ± 0.255
4.984LysThr: 4.984 ± 0.352
2.961LysVal: 2.961 ± 0.217
0.607LysTrp: 0.607 ± 0.107
4.083LysTyr: 4.083 ± 0.329
0.0LysXaa: 0.0 ± 0.0
Leu
5.315LeuAla: 5.315 ± 0.385
2.28LeuCys: 2.28 ± 0.199
4.285LeuAsp: 4.285 ± 0.269
6.051LeuGlu: 6.051 ± 0.396
4.984LeuPhe: 4.984 ± 0.334
4.524LeuGly: 4.524 ± 0.303
2.924LeuHis: 2.924 ± 0.254
8.386LeuIle: 8.386 ± 0.411
7.743LeuLys: 7.743 ± 0.427
11.218LeuLeu: 11.218 ± 0.531
2.685LeuMet: 2.685 ± 0.298
6.382LeuAsn: 6.382 ± 0.348
4.009LeuPro: 4.009 ± 0.227
5.425LeuGln: 5.425 ± 0.317
4.12LeuArg: 4.12 ± 0.338
6.805LeuSer: 6.805 ± 0.387
5.683LeuThr: 5.683 ± 0.279
4.91LeuVal: 4.91 ± 0.314
1.232LeuTrp: 1.232 ± 0.164
4.579LeuTyr: 4.579 ± 0.329
0.0LeuXaa: 0.0 ± 0.0
Met
1.618MetAla: 1.618 ± 0.193
0.349MetCys: 0.349 ± 0.069
1.49MetAsp: 1.49 ± 0.178
1.839MetGlu: 1.839 ± 0.161
1.582MetPhe: 1.582 ± 0.171
1.324MetGly: 1.324 ± 0.181
0.644MetHis: 0.644 ± 0.118
1.747MetIle: 1.747 ± 0.138
1.379MetLys: 1.379 ± 0.179
3.807MetLeu: 3.807 ± 0.336
0.864MetMet: 0.864 ± 0.116
1.14MetAsn: 1.14 ± 0.135
1.085MetPro: 1.085 ± 0.145
1.067MetGln: 1.067 ± 0.139
1.232MetArg: 1.232 ± 0.148
1.324MetSer: 1.324 ± 0.132
0.975MetThr: 0.975 ± 0.142
1.655MetVal: 1.655 ± 0.164
0.276MetTrp: 0.276 ± 0.073
1.287MetTyr: 1.287 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
2.887AsnAla: 2.887 ± 0.236
1.14AsnCys: 1.14 ± 0.179
2.722AsnAsp: 2.722 ± 0.209
2.961AsnGlu: 2.961 ± 0.198
2.906AsnPhe: 2.906 ± 0.249
2.133AsnGly: 2.133 ± 0.274
1.968AsnHis: 1.968 ± 0.223
6.841AsnIle: 6.841 ± 0.398
3.844AsnLys: 3.844 ± 0.271
5.683AsnLeu: 5.683 ± 0.341
1.692AsnMet: 1.692 ± 0.193
4.211AsnAsn: 4.211 ± 0.384
2.759AsnPro: 2.759 ± 0.195
2.152AsnGln: 2.152 ± 0.177
2.372AsnArg: 2.372 ± 0.209
2.814AsnSer: 2.814 ± 0.233
3.789AsnThr: 3.789 ± 0.297
3.402AsnVal: 3.402 ± 0.284
0.57AsnTrp: 0.57 ± 0.097
3.366AsnTyr: 3.366 ± 0.27
0.0AsnXaa: 0.0 ± 0.0
Pro
1.949ProAla: 1.949 ± 0.241
0.754ProCys: 0.754 ± 0.236
1.931ProAsp: 1.931 ± 0.206
3.531ProGlu: 3.531 ± 0.25
1.784ProPhe: 1.784 ± 0.171
1.986ProGly: 1.986 ± 0.209
0.938ProHis: 0.938 ± 0.171
3.255ProIle: 3.255 ± 0.219
2.759ProLys: 2.759 ± 0.298
4.138ProLeu: 4.138 ± 0.325
0.791ProMet: 0.791 ± 0.108
2.299ProAsn: 2.299 ± 0.201
2.575ProPro: 2.575 ± 0.421
1.618ProGln: 1.618 ± 0.205
1.582ProArg: 1.582 ± 0.195
3.31ProSer: 3.31 ± 0.236
2.446ProThr: 2.446 ± 0.205
2.299ProVal: 2.299 ± 0.229
0.368ProTrp: 0.368 ± 0.083
1.747ProTyr: 1.747 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
2.189GlnAla: 2.189 ± 0.187
0.699GlnCys: 0.699 ± 0.107
2.041GlnAsp: 2.041 ± 0.187
2.924GlnGlu: 2.924 ± 0.221
1.306GlnPhe: 1.306 ± 0.159
1.747GlnGly: 1.747 ± 0.204
1.692GlnHis: 1.692 ± 0.17
2.943GlnIle: 2.943 ± 0.218
3.678GlnLys: 3.678 ± 0.24
3.862GlnLeu: 3.862 ± 0.227
0.956GlnMet: 0.956 ± 0.126
2.483GlnAsn: 2.483 ± 0.245
1.766GlnPro: 1.766 ± 0.211
2.428GlnGln: 2.428 ± 0.21
2.078GlnArg: 2.078 ± 0.174
2.225GlnSer: 2.225 ± 0.201
2.354GlnThr: 2.354 ± 0.215
2.041GlnVal: 2.041 ± 0.187
0.515GlnTrp: 0.515 ± 0.094
1.949GlnTyr: 1.949 ± 0.174
0.0GlnXaa: 0.0 ± 0.0
Arg
2.17ArgAla: 2.17 ± 0.227
0.754ArgCys: 0.754 ± 0.127
1.747ArgAsp: 1.747 ± 0.166
2.814ArgGlu: 2.814 ± 0.214
2.354ArgPhe: 2.354 ± 0.219
1.747ArgGly: 1.747 ± 0.181
1.361ArgHis: 1.361 ± 0.146
3.274ArgIle: 3.274 ± 0.242
3.016ArgLys: 3.016 ± 0.307
4.377ArgLeu: 4.377 ± 0.308
1.195ArgMet: 1.195 ± 0.144
2.06ArgAsn: 2.06 ± 0.208
1.913ArgPro: 1.913 ± 0.235
1.729ArgGln: 1.729 ± 0.187
1.71ArgArg: 1.71 ± 0.198
2.005ArgSer: 2.005 ± 0.185
1.821ArgThr: 1.821 ± 0.165
2.133ArgVal: 2.133 ± 0.236
0.423ArgTrp: 0.423 ± 0.078
1.692ArgTyr: 1.692 ± 0.184
0.0ArgXaa: 0.0 ± 0.0
Ser
2.354SerAla: 2.354 ± 0.233
1.269SerCys: 1.269 ± 0.156
2.041SerAsp: 2.041 ± 0.195
3.513SerGlu: 3.513 ± 0.263
2.979SerPhe: 2.979 ± 0.283
2.814SerGly: 2.814 ± 0.217
1.398SerHis: 1.398 ± 0.171
5.72SerIle: 5.72 ± 0.33
4.267SerLys: 4.267 ± 0.304
6.86SerLeu: 6.86 ± 0.379
1.821SerMet: 1.821 ± 0.165
3.2SerAsn: 3.2 ± 0.284
2.961SerPro: 2.961 ± 0.273
2.299SerGln: 2.299 ± 0.2
2.207SerArg: 2.207 ± 0.196
4.432SerSer: 4.432 ± 0.277
3.825SerThr: 3.825 ± 0.33
2.998SerVal: 2.998 ± 0.273
0.589SerTrp: 0.589 ± 0.109
2.795SerTyr: 2.795 ± 0.198
0.0SerXaa: 0.0 ± 0.0
Thr
2.906ThrAla: 2.906 ± 0.273
1.343ThrCys: 1.343 ± 0.36
2.759ThrAsp: 2.759 ± 0.224
3.218ThrGlu: 3.218 ± 0.214
2.943ThrPhe: 2.943 ± 0.194
2.409ThrGly: 2.409 ± 0.244
1.49ThrHis: 1.49 ± 0.191
4.579ThrIle: 4.579 ± 0.266
3.623ThrLys: 3.623 ± 0.244
6.032ThrLeu: 6.032 ± 0.32
1.343ThrMet: 1.343 ± 0.142
3.182ThrAsn: 3.182 ± 0.262
2.685ThrPro: 2.685 ± 0.232
2.464ThrGln: 2.464 ± 0.218
2.262ThrArg: 2.262 ± 0.211
3.384ThrSer: 3.384 ± 0.273
3.071ThrThr: 3.071 ± 0.277
2.575ThrVal: 2.575 ± 0.192
0.644ThrTrp: 0.644 ± 0.102
2.446ThrTyr: 2.446 ± 0.222
0.0ThrXaa: 0.0 ± 0.0
Val
2.795ValAla: 2.795 ± 0.223
0.864ValCys: 0.864 ± 0.114
2.924ValAsp: 2.924 ± 0.3
2.887ValGlu: 2.887 ± 0.241
2.759ValPhe: 2.759 ± 0.23
2.005ValGly: 2.005 ± 0.202
1.232ValHis: 1.232 ± 0.121
3.752ValIle: 3.752 ± 0.254
4.395ValLys: 4.395 ± 0.32
5.775ValLeu: 5.775 ± 0.326
1.085ValMet: 1.085 ± 0.161
2.575ValAsn: 2.575 ± 0.212
1.894ValPro: 1.894 ± 0.165
2.464ValGln: 2.464 ± 0.219
2.336ValArg: 2.336 ± 0.22
3.016ValSer: 3.016 ± 0.22
2.336ValThr: 2.336 ± 0.2
3.163ValVal: 3.163 ± 0.223
0.46ValTrp: 0.46 ± 0.091
1.968ValTyr: 1.968 ± 0.185
0.0ValXaa: 0.0 ± 0.0
Trp
0.644TrpAla: 0.644 ± 0.111
0.349TrpCys: 0.349 ± 0.089
0.533TrpAsp: 0.533 ± 0.093
1.122TrpGlu: 1.122 ± 0.154
0.405TrpPhe: 0.405 ± 0.078
0.625TrpGly: 0.625 ± 0.123
0.405TrpHis: 0.405 ± 0.092
0.809TrpIle: 0.809 ± 0.134
0.791TrpLys: 0.791 ± 0.116
0.828TrpLeu: 0.828 ± 0.112
0.276TrpMet: 0.276 ± 0.074
0.46TrpAsn: 0.46 ± 0.092
0.368TrpPro: 0.368 ± 0.083
0.313TrpGln: 0.313 ± 0.083
0.533TrpArg: 0.533 ± 0.092
0.478TrpSer: 0.478 ± 0.109
0.478TrpThr: 0.478 ± 0.099
0.589TrpVal: 0.589 ± 0.105
0.405TrpTrp: 0.405 ± 0.081
0.552TrpTyr: 0.552 ± 0.101
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.52TyrAla: 2.52 ± 0.218
1.269TyrCys: 1.269 ± 0.207
2.078TyrAsp: 2.078 ± 0.23
2.795TyrGlu: 2.795 ± 0.214
2.28TyrPhe: 2.28 ± 0.211
2.52TyrGly: 2.52 ± 0.213
1.398TyrHis: 1.398 ± 0.152
3.255TyrIle: 3.255 ± 0.215
2.924TyrLys: 2.924 ± 0.21
3.715TyrLeu: 3.715 ± 0.29
1.177TyrMet: 1.177 ± 0.155
3.549TyrAsn: 3.549 ± 0.253
1.913TyrPro: 1.913 ± 0.177
1.931TyrGln: 1.931 ± 0.201
1.526TyrArg: 1.526 ± 0.157
3.457TyrSer: 3.457 ± 0.262
2.869TyrThr: 2.869 ± 0.213
2.391TyrVal: 2.391 ± 0.194
0.993TyrTrp: 0.993 ± 0.116
2.832TyrTyr: 2.832 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 163 proteins (54376 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski