Amino acid dipepetide frequency for African swine fever virus (isolate Warthog/Namibia/Wart80/1980) (ASFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.847AlaAla: 3.847 ± 0.346
1.344AlaCys: 1.344 ± 0.154
2.687AlaAsp: 2.687 ± 0.243
3.332AlaGlu: 3.332 ± 0.277
2.375AlaPhe: 2.375 ± 0.212
2.319AlaGly: 2.319 ± 0.239
1.436AlaHis: 1.436 ± 0.183
4.62AlaIle: 4.62 ± 0.291
2.945AlaLys: 2.945 ± 0.236
5.596AlaLeu: 5.596 ± 0.346
1.288AlaMet: 1.288 ± 0.141
2.614AlaAsn: 2.614 ± 0.209
1.693AlaPro: 1.693 ± 0.179
1.97AlaGln: 1.97 ± 0.218
2.411AlaArg: 2.411 ± 0.26
3.037AlaSer: 3.037 ± 0.282
2.264AlaThr: 2.264 ± 0.211
3.497AlaVal: 3.497 ± 0.262
0.423AlaTrp: 0.423 ± 0.092
1.785AlaTyr: 1.785 ± 0.184
0.0AlaXaa: 0.0 ± 0.0
Cys
1.031CysAla: 1.031 ± 0.188
0.552CysCys: 0.552 ± 0.098
0.626CysAsp: 0.626 ± 0.12
0.92CysGlu: 0.92 ± 0.131
1.307CysPhe: 1.307 ± 0.163
1.123CysGly: 1.123 ± 0.164
0.994CysHis: 0.994 ± 0.145
1.804CysIle: 1.804 ± 0.212
1.712CysLys: 1.712 ± 0.216
1.97CysLeu: 1.97 ± 0.19
0.699CysMet: 0.699 ± 0.111
0.755CysAsn: 0.755 ± 0.13
0.755CysPro: 0.755 ± 0.188
0.718CysGln: 0.718 ± 0.139
1.104CysArg: 1.104 ± 0.183
1.491CysSer: 1.491 ± 0.163
1.473CysThr: 1.473 ± 0.204
0.828CysVal: 0.828 ± 0.122
0.405CysTrp: 0.405 ± 0.093
0.828CysTyr: 0.828 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
2.724AspAla: 2.724 ± 0.21
0.976AspCys: 0.976 ± 0.158
2.227AspAsp: 2.227 ± 0.219
3.166AspGlu: 3.166 ± 0.278
2.393AspPhe: 2.393 ± 0.199
1.914AspGly: 1.914 ± 0.21
1.252AspHis: 1.252 ± 0.168
4.27AspIle: 4.27 ± 0.287
2.779AspLys: 2.779 ± 0.232
5.633AspLeu: 5.633 ± 0.349
1.454AspMet: 1.454 ± 0.167
2.356AspAsn: 2.356 ± 0.185
2.503AspPro: 2.503 ± 0.231
1.16AspGln: 1.16 ± 0.15
1.509AspArg: 1.509 ± 0.175
2.614AspSer: 2.614 ± 0.29
2.614AspThr: 2.614 ± 0.273
2.467AspVal: 2.467 ± 0.209
0.515AspTrp: 0.515 ± 0.083
2.098AspTyr: 2.098 ± 0.206
0.0AspXaa: 0.0 ± 0.0
Glu
3.461GluAla: 3.461 ± 0.243
1.215GluCys: 1.215 ± 0.157
3.368GluAsp: 3.368 ± 0.258
5.117GluGlu: 5.117 ± 0.539
2.761GluPhe: 2.761 ± 0.25
2.338GluGly: 2.338 ± 0.21
1.509GluHis: 1.509 ± 0.187
4.841GluIle: 4.841 ± 0.284
5.725GluLys: 5.725 ± 0.368
6.424GluLeu: 6.424 ± 0.386
1.804GluMet: 1.804 ± 0.198
4.068GluAsn: 4.068 ± 0.236
1.988GluPro: 1.988 ± 0.188
2.743GluGln: 2.743 ± 0.249
2.485GluArg: 2.485 ± 0.223
2.687GluSer: 2.687 ± 0.227
4.105GluThr: 4.105 ± 0.377
2.467GluVal: 2.467 ± 0.191
1.068GluTrp: 1.068 ± 0.15
2.872GluTyr: 2.872 ± 0.227
0.0GluXaa: 0.0 ± 0.0
Phe
1.712PheAla: 1.712 ± 0.179
1.307PheCys: 1.307 ± 0.163
2.098PheAsp: 2.098 ± 0.177
2.467PheGlu: 2.467 ± 0.2
2.282PhePhe: 2.282 ± 0.209
1.473PheGly: 1.473 ± 0.155
1.049PheHis: 1.049 ± 0.129
4.399PheIle: 4.399 ± 0.275
3.773PheLys: 3.773 ± 0.308
4.915PheLeu: 4.915 ± 0.31
1.454PheMet: 1.454 ± 0.183
3.35PheAsn: 3.35 ± 0.267
1.785PhePro: 1.785 ± 0.165
1.638PheGln: 1.638 ± 0.183
1.381PheArg: 1.381 ± 0.2
3.958PheSer: 3.958 ± 0.277
2.632PheThr: 2.632 ± 0.209
2.411PheVal: 2.411 ± 0.226
0.552PheTrp: 0.552 ± 0.102
2.779PheTyr: 2.779 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
2.927GlyAla: 2.927 ± 0.264
0.626GlyCys: 0.626 ± 0.123
1.878GlyAsp: 1.878 ± 0.23
2.19GlyGlu: 2.19 ± 0.2
1.878GlyPhe: 1.878 ± 0.184
3.0GlyGly: 3.0 ± 0.32
1.178GlyHis: 1.178 ± 0.144
3.737GlyIle: 3.737 ± 0.353
3.276GlyLys: 3.276 ± 0.263
4.896GlyLeu: 4.896 ± 0.301
0.976GlyMet: 0.976 ± 0.132
2.338GlyAsn: 2.338 ± 0.22
1.546GlyPro: 1.546 ± 0.161
1.381GlyGln: 1.381 ± 0.129
1.749GlyArg: 1.749 ± 0.178
2.908GlySer: 2.908 ± 0.21
1.988GlyThr: 1.988 ± 0.177
2.098GlyVal: 2.098 ± 0.253
0.331GlyTrp: 0.331 ± 0.078
2.154GlyTyr: 2.154 ± 0.196
0.0GlyXaa: 0.0 ± 0.0
His
1.344HisAla: 1.344 ± 0.148
0.663HisCys: 0.663 ± 0.123
1.381HisAsp: 1.381 ± 0.168
1.822HisGlu: 1.822 ± 0.21
1.546HisPhe: 1.546 ± 0.152
1.436HisGly: 1.436 ± 0.16
1.012HisHis: 1.012 ± 0.147
2.393HisIle: 2.393 ± 0.242
1.97HisLys: 1.97 ± 0.213
3.203HisLeu: 3.203 ± 0.264
0.607HisMet: 0.607 ± 0.107
1.546HisAsn: 1.546 ± 0.196
1.27HisPro: 1.27 ± 0.161
1.288HisGln: 1.288 ± 0.143
1.233HisArg: 1.233 ± 0.135
1.675HisSer: 1.675 ± 0.171
1.491HisThr: 1.491 ± 0.14
1.657HisVal: 1.657 ± 0.165
0.258HisTrp: 0.258 ± 0.069
1.62HisTyr: 1.62 ± 0.168
0.0HisXaa: 0.0 ± 0.0
Ile
4.086IleAla: 4.086 ± 0.238
1.767IleCys: 1.767 ± 0.158
3.663IleAsp: 3.663 ± 0.246
4.62IleGlu: 4.62 ± 0.295
4.51IlePhe: 4.51 ± 0.295
2.89IleGly: 2.89 ± 0.251
2.816IleHis: 2.816 ± 0.246
6.995IleIle: 6.995 ± 0.391
6.7IleLys: 6.7 ± 0.378
9.259IleLeu: 9.259 ± 0.485
1.988IleMet: 1.988 ± 0.17
5.228IleAsn: 5.228 ± 0.372
3.589IlePro: 3.589 ± 0.298
4.031IleGln: 4.031 ± 0.343
3.589IleArg: 3.589 ± 0.36
5.356IleSer: 5.356 ± 0.307
4.197IleThr: 4.197 ± 0.266
3.902IleVal: 3.902 ± 0.304
0.552IleTrp: 0.552 ± 0.095
4.013IleTyr: 4.013 ± 0.287
0.0IleXaa: 0.0 ± 0.0
Lys
3.387LysAla: 3.387 ± 0.232
1.252LysCys: 1.252 ± 0.172
3.737LysAsp: 3.737 ± 0.207
5.485LysGlu: 5.485 ± 0.389
2.393LysPhe: 2.393 ± 0.192
2.687LysGly: 2.687 ± 0.249
2.945LysHis: 2.945 ± 0.259
5.817LysIle: 5.817 ± 0.271
7.768LysLys: 7.768 ± 0.458
6.056LysLeu: 6.056 ± 0.317
2.043LysMet: 2.043 ± 0.176
6.645LysAsn: 6.645 ± 0.375
2.706LysPro: 2.706 ± 0.36
3.442LysGln: 3.442 ± 0.248
2.669LysArg: 2.669 ± 0.196
3.092LysSer: 3.092 ± 0.242
4.915LysThr: 4.915 ± 0.304
3.092LysVal: 3.092 ± 0.205
0.571LysTrp: 0.571 ± 0.105
4.086LysTyr: 4.086 ± 0.337
0.0LysXaa: 0.0 ± 0.0
Leu
5.393LeuAla: 5.393 ± 0.336
2.356LeuCys: 2.356 ± 0.213
4.27LeuAsp: 4.27 ± 0.24
6.148LeuGlu: 6.148 ± 0.391
4.915LeuPhe: 4.915 ± 0.307
4.51LeuGly: 4.51 ± 0.3
2.872LeuHis: 2.872 ± 0.228
8.302LeuIle: 8.302 ± 0.447
7.749LeuLys: 7.749 ± 0.425
11.523LeuLeu: 11.523 ± 0.568
2.761LeuMet: 2.761 ± 0.211
6.258LeuAsn: 6.258 ± 0.28
3.994LeuPro: 3.994 ± 0.261
5.412LeuGln: 5.412 ± 0.304
4.013LeuArg: 4.013 ± 0.313
6.7LeuSer: 6.7 ± 0.408
5.725LeuThr: 5.725 ± 0.283
5.044LeuVal: 5.044 ± 0.289
1.196LeuTrp: 1.196 ± 0.166
4.547LeuTyr: 4.547 ± 0.3
0.0LeuXaa: 0.0 ± 0.0
Met
1.491MetAla: 1.491 ± 0.15
0.35MetCys: 0.35 ± 0.074
1.509MetAsp: 1.509 ± 0.197
1.841MetGlu: 1.841 ± 0.182
1.546MetPhe: 1.546 ± 0.151
1.233MetGly: 1.233 ± 0.167
0.626MetHis: 0.626 ± 0.108
1.712MetIle: 1.712 ± 0.152
1.344MetLys: 1.344 ± 0.177
3.571MetLeu: 3.571 ± 0.29
0.847MetMet: 0.847 ± 0.14
1.215MetAsn: 1.215 ± 0.157
1.16MetPro: 1.16 ± 0.166
1.049MetGln: 1.049 ± 0.138
1.325MetArg: 1.325 ± 0.155
1.344MetSer: 1.344 ± 0.155
0.976MetThr: 0.976 ± 0.134
1.601MetVal: 1.601 ± 0.162
0.295MetTrp: 0.295 ± 0.067
1.325MetTyr: 1.325 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
2.872AsnAla: 2.872 ± 0.268
1.215AsnCys: 1.215 ± 0.189
2.798AsnAsp: 2.798 ± 0.247
3.056AsnGlu: 3.056 ± 0.197
2.927AsnPhe: 2.927 ± 0.232
2.135AsnGly: 2.135 ± 0.204
1.859AsnHis: 1.859 ± 0.196
6.682AsnIle: 6.682 ± 0.427
3.884AsnLys: 3.884 ± 0.311
5.725AsnLeu: 5.725 ± 0.443
1.804AsnMet: 1.804 ± 0.205
4.418AsnAsn: 4.418 ± 0.342
2.964AsnPro: 2.964 ± 0.216
2.135AsnGln: 2.135 ± 0.201
2.375AsnArg: 2.375 ± 0.182
2.89AsnSer: 2.89 ± 0.24
3.645AsnThr: 3.645 ± 0.277
3.203AsnVal: 3.203 ± 0.291
0.552AsnTrp: 0.552 ± 0.094
3.276AsnTyr: 3.276 ± 0.292
0.0AsnXaa: 0.0 ± 0.0
Pro
1.804ProAla: 1.804 ± 0.236
0.699ProCys: 0.699 ± 0.193
1.97ProAsp: 1.97 ± 0.209
3.534ProGlu: 3.534 ± 0.251
1.841ProPhe: 1.841 ± 0.158
2.135ProGly: 2.135 ± 0.23
0.939ProHis: 0.939 ± 0.151
3.332ProIle: 3.332 ± 0.248
2.743ProLys: 2.743 ± 0.304
4.013ProLeu: 4.013 ± 0.315
0.81ProMet: 0.81 ± 0.148
2.246ProAsn: 2.246 ± 0.191
2.798ProPro: 2.798 ± 0.449
1.638ProGln: 1.638 ± 0.201
1.509ProArg: 1.509 ± 0.167
3.258ProSer: 3.258 ± 0.226
2.503ProThr: 2.503 ± 0.255
2.264ProVal: 2.264 ± 0.203
0.387ProTrp: 0.387 ± 0.096
1.785ProTyr: 1.785 ± 0.17
0.0ProXaa: 0.0 ± 0.0
Gln
2.282GlnAla: 2.282 ± 0.176
0.755GlnCys: 0.755 ± 0.1
2.246GlnAsp: 2.246 ± 0.218
2.853GlnGlu: 2.853 ± 0.23
1.362GlnPhe: 1.362 ± 0.155
1.804GlnGly: 1.804 ± 0.203
1.657GlnHis: 1.657 ± 0.165
3.184GlnIle: 3.184 ± 0.224
3.589GlnLys: 3.589 ± 0.237
3.921GlnLeu: 3.921 ± 0.251
0.939GlnMet: 0.939 ± 0.135
2.467GlnAsn: 2.467 ± 0.218
1.859GlnPro: 1.859 ± 0.218
2.338GlnGln: 2.338 ± 0.234
2.006GlnArg: 2.006 ± 0.172
2.172GlnSer: 2.172 ± 0.21
2.356GlnThr: 2.356 ± 0.217
1.822GlnVal: 1.822 ± 0.176
0.515GlnTrp: 0.515 ± 0.088
1.988GlnTyr: 1.988 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
2.043ArgAla: 2.043 ± 0.188
0.773ArgCys: 0.773 ± 0.122
1.73ArgAsp: 1.73 ± 0.176
2.872ArgGlu: 2.872 ± 0.207
2.246ArgPhe: 2.246 ± 0.264
1.693ArgGly: 1.693 ± 0.202
1.325ArgHis: 1.325 ± 0.148
3.35ArgIle: 3.35 ± 0.25
3.148ArgLys: 3.148 ± 0.228
4.142ArgLeu: 4.142 ± 0.322
1.068ArgMet: 1.068 ± 0.137
2.098ArgAsn: 2.098 ± 0.201
1.822ArgPro: 1.822 ± 0.232
1.693ArgGln: 1.693 ± 0.179
1.693ArgArg: 1.693 ± 0.183
1.988ArgSer: 1.988 ± 0.226
1.896ArgThr: 1.896 ± 0.188
2.282ArgVal: 2.282 ± 0.235
0.534ArgTrp: 0.534 ± 0.104
1.657ArgTyr: 1.657 ± 0.195
0.0ArgXaa: 0.0 ± 0.0
Ser
2.393SerAla: 2.393 ± 0.21
1.307SerCys: 1.307 ± 0.171
2.043SerAsp: 2.043 ± 0.181
3.424SerGlu: 3.424 ± 0.259
2.853SerPhe: 2.853 ± 0.24
2.743SerGly: 2.743 ± 0.243
1.399SerHis: 1.399 ± 0.175
5.669SerIle: 5.669 ± 0.328
4.142SerLys: 4.142 ± 0.288
6.792SerLeu: 6.792 ± 0.37
1.804SerMet: 1.804 ± 0.192
3.056SerAsn: 3.056 ± 0.218
3.0SerPro: 3.0 ± 0.264
2.485SerGln: 2.485 ± 0.233
2.43SerArg: 2.43 ± 0.185
4.436SerSer: 4.436 ± 0.351
3.884SerThr: 3.884 ± 0.334
3.037SerVal: 3.037 ± 0.218
0.515SerTrp: 0.515 ± 0.1
2.816SerTyr: 2.816 ± 0.194
0.0SerXaa: 0.0 ± 0.0
Thr
3.0ThrAla: 3.0 ± 0.281
1.27ThrCys: 1.27 ± 0.217
2.816ThrAsp: 2.816 ± 0.207
3.405ThrGlu: 3.405 ± 0.215
2.816ThrPhe: 2.816 ± 0.212
2.43ThrGly: 2.43 ± 0.248
1.344ThrHis: 1.344 ± 0.146
4.712ThrIle: 4.712 ± 0.314
3.516ThrLys: 3.516 ± 0.215
5.761ThrLeu: 5.761 ± 0.293
1.436ThrMet: 1.436 ± 0.172
3.166ThrAsn: 3.166 ± 0.257
2.503ThrPro: 2.503 ± 0.222
2.338ThrGln: 2.338 ± 0.211
2.375ThrArg: 2.375 ± 0.231
3.479ThrSer: 3.479 ± 0.263
3.0ThrThr: 3.0 ± 0.252
2.485ThrVal: 2.485 ± 0.19
0.644ThrTrp: 0.644 ± 0.102
2.595ThrTyr: 2.595 ± 0.221
0.0ThrXaa: 0.0 ± 0.0
Val
2.632ValAla: 2.632 ± 0.241
0.976ValCys: 0.976 ± 0.115
2.89ValAsp: 2.89 ± 0.343
2.872ValGlu: 2.872 ± 0.205
2.724ValPhe: 2.724 ± 0.245
2.117ValGly: 2.117 ± 0.205
1.362ValHis: 1.362 ± 0.141
3.81ValIle: 3.81 ± 0.262
4.197ValLys: 4.197 ± 0.318
5.89ValLeu: 5.89 ± 0.303
1.104ValMet: 1.104 ± 0.15
2.411ValAsn: 2.411 ± 0.197
1.933ValPro: 1.933 ± 0.158
2.375ValGln: 2.375 ± 0.214
2.209ValArg: 2.209 ± 0.246
3.184ValSer: 3.184 ± 0.231
2.246ValThr: 2.246 ± 0.203
3.056ValVal: 3.056 ± 0.231
0.423ValTrp: 0.423 ± 0.089
1.914ValTyr: 1.914 ± 0.188
0.0ValXaa: 0.0 ± 0.0
Trp
0.589TrpAla: 0.589 ± 0.109
0.442TrpCys: 0.442 ± 0.091
0.497TrpAsp: 0.497 ± 0.092
1.16TrpGlu: 1.16 ± 0.148
0.423TrpPhe: 0.423 ± 0.082
0.571TrpGly: 0.571 ± 0.102
0.46TrpHis: 0.46 ± 0.092
0.81TrpIle: 0.81 ± 0.098
0.884TrpLys: 0.884 ± 0.121
0.884TrpLeu: 0.884 ± 0.123
0.258TrpMet: 0.258 ± 0.062
0.589TrpAsn: 0.589 ± 0.102
0.331TrpPro: 0.331 ± 0.079
0.313TrpGln: 0.313 ± 0.07
0.497TrpArg: 0.497 ± 0.096
0.497TrpSer: 0.497 ± 0.108
0.46TrpThr: 0.46 ± 0.109
0.626TrpVal: 0.626 ± 0.105
0.387TrpTrp: 0.387 ± 0.084
0.571TrpTyr: 0.571 ± 0.11
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.467TyrAla: 2.467 ± 0.209
1.252TyrCys: 1.252 ± 0.176
2.19TyrAsp: 2.19 ± 0.265
2.761TyrGlu: 2.761 ± 0.224
2.319TyrPhe: 2.319 ± 0.179
2.503TyrGly: 2.503 ± 0.218
1.381TyrHis: 1.381 ± 0.149
3.313TyrIle: 3.313 ± 0.209
2.945TyrLys: 2.945 ± 0.236
3.516TyrLeu: 3.516 ± 0.261
1.196TyrMet: 1.196 ± 0.147
3.608TyrAsn: 3.608 ± 0.291
2.006TyrPro: 2.006 ± 0.169
1.97TyrGln: 1.97 ± 0.222
1.491TyrArg: 1.491 ± 0.138
3.461TyrSer: 3.461 ± 0.25
2.724TyrThr: 2.724 ± 0.22
2.485TyrVal: 2.485 ± 0.187
1.123TyrTrp: 1.123 ± 0.14
2.89TyrTyr: 2.89 ± 0.229
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 159 proteins (54328 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski