Amino acid dipepetide frequency for Cetacean poxvirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.411AlaAla: 1.411 ± 0.31
0.461AlaCys: 0.461 ± 0.104
1.384AlaAsp: 1.384 ± 0.211
1.248AlaGlu: 1.248 ± 0.178
1.248AlaPhe: 1.248 ± 0.153
1.031AlaGly: 1.031 ± 0.203
0.57AlaHis: 0.57 ± 0.118
3.663AlaIle: 3.663 ± 0.292
2.198AlaLys: 2.198 ± 0.224
3.094AlaLeu: 3.094 ± 0.296
0.733AlaMet: 0.733 ± 0.132
2.062AlaAsn: 2.062 ± 0.269
0.651AlaPro: 0.651 ± 0.134
0.597AlaGln: 0.597 ± 0.136
0.95AlaArg: 0.95 ± 0.168
2.497AlaSer: 2.497 ± 0.279
1.682AlaThr: 1.682 ± 0.205
1.872AlaVal: 1.872 ± 0.278
0.081AlaTrp: 0.081 ± 0.047
1.655AlaTyr: 1.655 ± 0.239
0.0AlaXaa: 0.0 ± 0.0
Cys
0.516CysAla: 0.516 ± 0.111
0.814CysCys: 0.814 ± 0.202
1.33CysAsp: 1.33 ± 0.19
1.194CysGlu: 1.194 ± 0.158
0.895CysPhe: 0.895 ± 0.14
0.895CysGly: 0.895 ± 0.152
0.353CysHis: 0.353 ± 0.095
2.904CysIle: 2.904 ± 0.373
2.089CysLys: 2.089 ± 0.241
1.845CysLeu: 1.845 ± 0.222
0.733CysMet: 0.733 ± 0.141
2.551CysAsn: 2.551 ± 0.276
0.57CysPro: 0.57 ± 0.142
0.543CysGln: 0.543 ± 0.137
0.733CysArg: 0.733 ± 0.122
2.008CysSer: 2.008 ± 0.233
1.085CysThr: 1.085 ± 0.18
1.303CysVal: 1.303 ± 0.178
0.244CysTrp: 0.244 ± 0.088
1.601CysTyr: 1.601 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
1.981AspAla: 1.981 ± 0.257
0.977AspCys: 0.977 ± 0.165
4.857AspAsp: 4.857 ± 0.658
3.908AspGlu: 3.908 ± 0.294
2.605AspPhe: 2.605 ± 0.265
2.171AspGly: 2.171 ± 0.32
0.814AspHis: 0.814 ± 0.153
7.652AspIle: 7.652 ± 0.439
3.908AspLys: 3.908 ± 0.355
4.993AspLeu: 4.993 ± 0.384
1.492AspMet: 1.492 ± 0.207
5.156AspAsn: 5.156 ± 0.424
1.303AspPro: 1.303 ± 0.19
1.058AspGln: 1.058 ± 0.165
1.167AspArg: 1.167 ± 0.201
4.125AspSer: 4.125 ± 0.306
3.175AspThr: 3.175 ± 0.347
4.477AspVal: 4.477 ± 0.352
0.461AspTrp: 0.461 ± 0.115
2.659AspTyr: 2.659 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
1.547GluAla: 1.547 ± 0.197
1.682GluCys: 1.682 ± 0.228
3.283GluAsp: 3.283 ± 0.284
3.745GluGlu: 3.745 ± 0.395
2.551GluPhe: 2.551 ± 0.264
1.357GluGly: 1.357 ± 0.195
1.167GluHis: 1.167 ± 0.206
4.233GluIle: 4.233 ± 0.438
3.88GluLys: 3.88 ± 0.278
6.377GluLeu: 6.377 ± 0.492
0.977GluMet: 0.977 ± 0.174
3.962GluAsn: 3.962 ± 0.364
1.465GluPro: 1.465 ± 0.204
1.845GluGln: 1.845 ± 0.243
2.361GluArg: 2.361 ± 0.288
3.799GluSer: 3.799 ± 0.337
3.039GluThr: 3.039 ± 0.326
2.632GluVal: 2.632 ± 0.229
0.407GluTrp: 0.407 ± 0.092
3.663GluTyr: 3.663 ± 0.307
0.0GluXaa: 0.0 ± 0.0
Phe
1.384PheAla: 1.384 ± 0.174
1.194PheCys: 1.194 ± 0.179
2.741PheAsp: 2.741 ± 0.314
2.225PheGlu: 2.225 ± 0.244
2.225PhePhe: 2.225 ± 0.25
1.764PheGly: 1.764 ± 0.195
1.058PheHis: 1.058 ± 0.193
5.671PheIle: 5.671 ± 0.315
3.311PheLys: 3.311 ± 0.291
4.288PheLeu: 4.288 ± 0.368
1.194PheMet: 1.194 ± 0.197
4.206PheAsn: 4.206 ± 0.382
1.465PhePro: 1.465 ± 0.179
1.248PheGln: 1.248 ± 0.178
1.601PheArg: 1.601 ± 0.203
4.233PheSer: 4.233 ± 0.322
2.876PheThr: 2.876 ± 0.283
2.876PheVal: 2.876 ± 0.271
0.38PheTrp: 0.38 ± 0.112
2.334PheTyr: 2.334 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
1.167GlyAla: 1.167 ± 0.208
0.76GlyCys: 0.76 ± 0.15
1.927GlyAsp: 1.927 ± 0.198
1.981GlyGlu: 1.981 ± 0.236
1.682GlyPhe: 1.682 ± 0.232
1.492GlyGly: 1.492 ± 0.255
0.57GlyHis: 0.57 ± 0.099
3.582GlyIle: 3.582 ± 0.316
2.686GlyLys: 2.686 ± 0.294
2.958GlyLeu: 2.958 ± 0.255
0.624GlyMet: 0.624 ± 0.122
2.089GlyAsn: 2.089 ± 0.258
0.516GlyPro: 0.516 ± 0.126
0.543GlyGln: 0.543 ± 0.116
1.682GlyArg: 1.682 ± 0.225
2.578GlySer: 2.578 ± 0.29
1.764GlyThr: 1.764 ± 0.225
1.9GlyVal: 1.9 ± 0.23
0.217GlyTrp: 0.217 ± 0.077
2.035GlyTyr: 2.035 ± 0.235
0.0GlyXaa: 0.0 ± 0.0
His
0.733HisAla: 0.733 ± 0.122
0.434HisCys: 0.434 ± 0.125
0.868HisAsp: 0.868 ± 0.139
0.95HisGlu: 0.95 ± 0.143
1.085HisPhe: 1.085 ± 0.171
0.706HisGly: 0.706 ± 0.13
0.516HisHis: 0.516 ± 0.139
2.334HisIle: 2.334 ± 0.236
1.33HisLys: 1.33 ± 0.214
1.845HisLeu: 1.845 ± 0.218
0.733HisMet: 0.733 ± 0.126
1.764HisAsn: 1.764 ± 0.234
0.516HisPro: 0.516 ± 0.124
0.597HisGln: 0.597 ± 0.111
0.706HisArg: 0.706 ± 0.127
1.275HisSer: 1.275 ± 0.152
1.085HisThr: 1.085 ± 0.186
1.547HisVal: 1.547 ± 0.176
0.217HisTrp: 0.217 ± 0.072
0.95HisTyr: 0.95 ± 0.157
0.0HisXaa: 0.0 ± 0.0
Ile
3.012IleAla: 3.012 ± 0.346
2.415IleCys: 2.415 ± 0.261
6.838IleAsp: 6.838 ± 0.482
5.97IleGlu: 5.97 ± 0.513
5.02IlePhe: 5.02 ± 0.416
2.931IleGly: 2.931 ± 0.292
2.252IleHis: 2.252 ± 0.272
11.859IleIle: 11.859 ± 0.55
9.661IleLys: 9.661 ± 0.607
9.389IleLeu: 9.389 ± 0.558
2.578IleMet: 2.578 ± 0.251
9.796IleAsn: 9.796 ± 0.398
3.392IlePro: 3.392 ± 0.346
2.524IleGln: 2.524 ± 0.229
3.799IleArg: 3.799 ± 0.353
9.091IleSer: 9.091 ± 0.449
6.486IleThr: 6.486 ± 0.418
6.024IleVal: 6.024 ± 0.368
0.488IleTrp: 0.488 ± 0.098
4.749IleTyr: 4.749 ± 0.415
0.0IleXaa: 0.0 ± 0.0
Lys
2.117LysAla: 2.117 ± 0.26
1.927LysCys: 1.927 ± 0.218
4.885LysAsp: 4.885 ± 0.423
4.939LysGlu: 4.939 ± 0.348
3.772LysPhe: 3.772 ± 0.359
2.442LysGly: 2.442 ± 0.228
2.388LysHis: 2.388 ± 0.292
8.141LysIle: 8.141 ± 0.488
6.974LysLys: 6.974 ± 0.576
7.87LysLeu: 7.87 ± 0.503
1.737LysMet: 1.737 ± 0.204
5.861LysAsn: 5.861 ± 0.408
2.198LysPro: 2.198 ± 0.195
2.876LysGln: 2.876 ± 0.358
2.849LysArg: 2.849 ± 0.263
5.889LysSer: 5.889 ± 0.381
4.288LysThr: 4.288 ± 0.408
4.179LysVal: 4.179 ± 0.302
0.461LysTrp: 0.461 ± 0.122
5.916LysTyr: 5.916 ± 0.399
0.0LysXaa: 0.0 ± 0.0
Leu
2.307LeuAla: 2.307 ± 0.293
2.361LeuCys: 2.361 ± 0.278
5.264LeuAsp: 5.264 ± 0.367
5.047LeuGlu: 5.047 ± 0.466
5.454LeuPhe: 5.454 ± 0.445
2.714LeuGly: 2.714 ± 0.304
2.198LeuHis: 2.198 ± 0.257
8.711LeuIle: 8.711 ± 0.479
7.435LeuLys: 7.435 ± 0.522
10.366LeuLeu: 10.366 ± 0.596
2.089LeuMet: 2.089 ± 0.256
5.889LeuAsn: 5.889 ± 0.332
2.958LeuPro: 2.958 ± 0.295
2.334LeuGln: 2.334 ± 0.275
3.718LeuArg: 3.718 ± 0.304
9.253LeuSer: 9.253 ± 0.382
5.644LeuThr: 5.644 ± 0.387
4.667LeuVal: 4.667 ± 0.335
0.244LeuTrp: 0.244 ± 0.073
5.047LeuTyr: 5.047 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
0.814MetAla: 0.814 ± 0.212
0.57MetCys: 0.57 ± 0.115
1.9MetAsp: 1.9 ± 0.209
0.977MetGlu: 0.977 ± 0.143
1.52MetPhe: 1.52 ± 0.253
0.977MetGly: 0.977 ± 0.162
0.57MetHis: 0.57 ± 0.157
1.981MetIle: 1.981 ± 0.274
1.547MetLys: 1.547 ± 0.222
2.307MetLeu: 2.307 ± 0.244
0.434MetMet: 0.434 ± 0.105
2.008MetAsn: 2.008 ± 0.257
0.706MetPro: 0.706 ± 0.127
0.57MetGln: 0.57 ± 0.118
1.031MetArg: 1.031 ± 0.152
2.117MetSer: 2.117 ± 0.225
1.303MetThr: 1.303 ± 0.176
0.95MetVal: 0.95 ± 0.17
0.217MetTrp: 0.217 ± 0.089
1.601MetTyr: 1.601 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
2.415AsnAla: 2.415 ± 0.229
1.547AsnCys: 1.547 ± 0.212
5.264AsnAsp: 5.264 ± 0.414
4.396AsnGlu: 4.396 ± 0.36
2.958AsnPhe: 2.958 ± 0.247
2.497AsnGly: 2.497 ± 0.297
1.194AsnHis: 1.194 ± 0.195
10.746AsnIle: 10.746 ± 0.528
7.897AsnLys: 7.897 ± 0.483
5.319AsnLeu: 5.319 ± 0.345
2.252AsnMet: 2.252 ± 0.267
9.362AsnAsn: 9.362 ± 0.797
1.9AsnPro: 1.9 ± 0.229
1.547AsnGln: 1.547 ± 0.191
2.198AsnArg: 2.198 ± 0.272
4.722AsnSer: 4.722 ± 0.354
4.939AsnThr: 4.939 ± 0.503
5.427AsnVal: 5.427 ± 0.336
0.434AsnTrp: 0.434 ± 0.096
4.288AsnTyr: 4.288 ± 0.351
0.0AsnXaa: 0.0 ± 0.0
Pro
0.434ProAla: 0.434 ± 0.107
0.841ProCys: 0.841 ± 0.156
1.547ProAsp: 1.547 ± 0.246
1.737ProGlu: 1.737 ± 0.225
1.465ProPhe: 1.465 ± 0.18
1.14ProGly: 1.14 ± 0.18
0.57ProHis: 0.57 ± 0.131
3.148ProIle: 3.148 ± 0.276
2.279ProLys: 2.279 ± 0.281
2.551ProLeu: 2.551 ± 0.232
0.488ProMet: 0.488 ± 0.119
2.089ProAsn: 2.089 ± 0.272
1.167ProPro: 1.167 ± 0.18
0.543ProGln: 0.543 ± 0.127
0.841ProArg: 0.841 ± 0.132
2.171ProSer: 2.171 ± 0.256
1.411ProThr: 1.411 ± 0.192
1.845ProVal: 1.845 ± 0.21
0.163ProTrp: 0.163 ± 0.076
1.384ProTyr: 1.384 ± 0.181
0.0ProXaa: 0.0 ± 0.0
Gln
0.57GlnAla: 0.57 ± 0.137
0.461GlnCys: 0.461 ± 0.107
0.923GlnAsp: 0.923 ± 0.137
1.764GlnGlu: 1.764 ± 0.217
0.977GlnPhe: 0.977 ± 0.169
0.516GlnGly: 0.516 ± 0.14
0.516GlnHis: 0.516 ± 0.118
1.845GlnIle: 1.845 ± 0.223
1.9GlnLys: 1.9 ± 0.209
2.551GlnLeu: 2.551 ± 0.222
0.678GlnMet: 0.678 ± 0.142
1.71GlnAsn: 1.71 ± 0.211
0.706GlnPro: 0.706 ± 0.123
1.33GlnGln: 1.33 ± 0.192
1.004GlnArg: 1.004 ± 0.185
1.737GlnSer: 1.737 ± 0.203
1.438GlnThr: 1.438 ± 0.226
1.303GlnVal: 1.303 ± 0.213
0.326GlnTrp: 0.326 ± 0.094
1.601GlnTyr: 1.601 ± 0.213
0.0GlnXaa: 0.0 ± 0.0
Arg
0.76ArgAla: 0.76 ± 0.126
1.031ArgCys: 1.031 ± 0.195
1.737ArgAsp: 1.737 ± 0.243
1.411ArgGlu: 1.411 ± 0.23
1.601ArgPhe: 1.601 ± 0.214
1.167ArgGly: 1.167 ± 0.17
1.004ArgHis: 1.004 ± 0.163
3.745ArgIle: 3.745 ± 0.256
3.256ArgLys: 3.256 ± 0.324
3.365ArgLeu: 3.365 ± 0.284
0.868ArgMet: 0.868 ± 0.133
2.171ArgAsn: 2.171 ± 0.216
0.895ArgPro: 0.895 ± 0.225
0.814ArgGln: 0.814 ± 0.144
1.601ArgArg: 1.601 ± 0.193
2.497ArgSer: 2.497 ± 0.312
1.655ArgThr: 1.655 ± 0.215
2.849ArgVal: 2.849 ± 0.261
0.054ArgTrp: 0.054 ± 0.033
2.198ArgTyr: 2.198 ± 0.223
0.0ArgXaa: 0.0 ± 0.0
Ser
2.469SerAla: 2.469 ± 0.287
1.845SerCys: 1.845 ± 0.216
3.989SerAsp: 3.989 ± 0.388
3.419SerGlu: 3.419 ± 0.322
3.88SerPhe: 3.88 ± 0.292
2.442SerGly: 2.442 ± 0.245
1.357SerHis: 1.357 ± 0.186
9.579SerIle: 9.579 ± 0.463
7.544SerLys: 7.544 ± 0.457
7.3SerLeu: 7.3 ± 0.378
1.954SerMet: 1.954 ± 0.226
6.54SerAsn: 6.54 ± 0.491
2.008SerPro: 2.008 ± 0.286
1.628SerGln: 1.628 ± 0.239
2.741SerArg: 2.741 ± 0.294
7.137SerSer: 7.137 ± 0.58
5.102SerThr: 5.102 ± 0.369
4.07SerVal: 4.07 ± 0.298
0.407SerTrp: 0.407 ± 0.113
4.152SerTyr: 4.152 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
1.791ThrAla: 1.791 ± 0.221
1.303ThrCys: 1.303 ± 0.19
3.609ThrAsp: 3.609 ± 0.358
2.931ThrGlu: 2.931 ± 0.314
2.876ThrPhe: 2.876 ± 0.254
2.008ThrGly: 2.008 ± 0.252
0.977ThrHis: 0.977 ± 0.155
5.943ThrIle: 5.943 ± 0.399
4.369ThrLys: 4.369 ± 0.314
5.78ThrLeu: 5.78 ± 0.372
1.275ThrMet: 1.275 ± 0.178
4.342ThrAsn: 4.342 ± 0.359
1.954ThrPro: 1.954 ± 0.232
1.14ThrGln: 1.14 ± 0.17
1.628ThrArg: 1.628 ± 0.207
5.346ThrSer: 5.346 ± 0.358
4.125ThrThr: 4.125 ± 0.392
3.338ThrVal: 3.338 ± 0.261
0.326ThrTrp: 0.326 ± 0.09
2.795ThrTyr: 2.795 ± 0.284
0.0ThrXaa: 0.0 ± 0.0
Val
1.872ValAla: 1.872 ± 0.186
1.872ValCys: 1.872 ± 0.245
2.931ValAsp: 2.931 ± 0.286
2.822ValGlu: 2.822 ± 0.293
3.256ValPhe: 3.256 ± 0.397
1.764ValGly: 1.764 ± 0.256
0.977ValHis: 0.977 ± 0.177
5.753ValIle: 5.753 ± 0.447
5.102ValLys: 5.102 ± 0.413
5.59ValLeu: 5.59 ± 0.396
1.357ValMet: 1.357 ± 0.192
4.857ValAsn: 4.857 ± 0.34
1.71ValPro: 1.71 ± 0.177
0.868ValGln: 0.868 ± 0.122
2.361ValArg: 2.361 ± 0.212
4.885ValSer: 4.885 ± 0.404
3.718ValThr: 3.718 ± 0.386
3.338ValVal: 3.338 ± 0.358
0.271ValTrp: 0.271 ± 0.084
3.555ValTyr: 3.555 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
0.136TrpAla: 0.136 ± 0.061
0.19TrpCys: 0.19 ± 0.075
0.298TrpAsp: 0.298 ± 0.083
0.38TrpGlu: 0.38 ± 0.098
0.461TrpPhe: 0.461 ± 0.105
0.19TrpGly: 0.19 ± 0.081
0.054TrpHis: 0.054 ± 0.038
0.57TrpIle: 0.57 ± 0.127
0.407TrpLys: 0.407 ± 0.12
0.597TrpLeu: 0.597 ± 0.148
0.244TrpMet: 0.244 ± 0.093
0.298TrpAsn: 0.298 ± 0.098
0.136TrpPro: 0.136 ± 0.067
0.19TrpGln: 0.19 ± 0.082
0.19TrpArg: 0.19 ± 0.071
0.38TrpSer: 0.38 ± 0.117
0.38TrpThr: 0.38 ± 0.105
0.163TrpVal: 0.163 ± 0.057
0.0TrpTrp: 0.0 ± 0.0
0.298TrpTyr: 0.298 ± 0.084
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.655TyrAla: 1.655 ± 0.21
1.52TyrCys: 1.52 ± 0.205
3.365TyrAsp: 3.365 ± 0.323
2.931TyrGlu: 2.931 ± 0.273
2.741TyrPhe: 2.741 ± 0.269
2.551TyrGly: 2.551 ± 0.273
1.085TyrHis: 1.085 ± 0.151
6.214TyrIle: 6.214 ± 0.486
3.935TyrLys: 3.935 ± 0.262
5.319TyrLeu: 5.319 ± 0.4
1.628TyrMet: 1.628 ± 0.237
4.586TyrAsn: 4.586 ± 0.295
1.601TyrPro: 1.601 ± 0.211
1.058TyrGln: 1.058 ± 0.167
1.52TyrArg: 1.52 ± 0.216
3.826TyrSer: 3.826 ± 0.34
2.659TyrThr: 2.659 ± 0.242
4.098TyrVal: 4.098 ± 0.326
0.19TyrTrp: 0.19 ± 0.062
2.822TyrTyr: 2.822 ± 0.328
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 115 proteins (36852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski