Amino acid dipepetide frequency for Campylobacter virus CP21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.407AlaAla: 2.407 ± 0.264
0.802AlaCys: 0.802 ± 0.134
2.7AlaAsp: 2.7 ± 0.246
2.622AlaGlu: 2.622 ± 0.222
1.683AlaPhe: 1.683 ± 0.181
2.113AlaGly: 2.113 ± 0.196
0.548AlaHis: 0.548 ± 0.114
3.64AlaIle: 3.64 ± 0.234
4.031AlaLys: 4.031 ± 0.321
3.855AlaLeu: 3.855 ± 0.302
1.018AlaMet: 1.018 ± 0.125
3.561AlaAsn: 3.561 ± 0.265
1.037AlaPro: 1.037 ± 0.182
1.233AlaGln: 1.233 ± 0.186
1.624AlaArg: 1.624 ± 0.168
3.385AlaSer: 3.385 ± 0.309
2.192AlaThr: 2.192 ± 0.236
2.485AlaVal: 2.485 ± 0.229
0.235AlaTrp: 0.235 ± 0.07
1.859AlaTyr: 1.859 ± 0.213
0.0AlaXaa: 0.0 ± 0.0
Cys
0.45CysAla: 0.45 ± 0.126
0.489CysCys: 0.489 ± 0.132
1.154CysAsp: 1.154 ± 0.198
1.35CysGlu: 1.35 ± 0.375
0.959CysPhe: 0.959 ± 0.165
0.783CysGly: 0.783 ± 0.154
0.235CysHis: 0.235 ± 0.075
1.565CysIle: 1.565 ± 0.291
1.409CysLys: 1.409 ± 0.285
1.291CysLeu: 1.291 ± 0.171
0.372CysMet: 0.372 ± 0.071
1.468CysAsn: 1.468 ± 0.286
0.352CysPro: 0.352 ± 0.112
0.45CysGln: 0.45 ± 0.105
0.528CysArg: 0.528 ± 0.129
1.076CysSer: 1.076 ± 0.196
1.018CysThr: 1.018 ± 0.242
1.037CysVal: 1.037 ± 0.2
0.098CysTrp: 0.098 ± 0.043
1.076CysTyr: 1.076 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
2.955AspAla: 2.955 ± 0.247
1.878AspCys: 1.878 ± 0.593
3.326AspAsp: 3.326 ± 0.329
3.072AspGlu: 3.072 ± 0.25
5.029AspPhe: 5.029 ± 0.34
2.759AspGly: 2.759 ± 0.258
0.841AspHis: 0.841 ± 0.12
6.653AspIle: 6.653 ± 0.355
3.874AspLys: 3.874 ± 0.264
8.199AspLeu: 8.199 ± 0.375
0.92AspMet: 0.92 ± 0.119
4.501AspAsn: 4.501 ± 0.317
1.898AspPro: 1.898 ± 0.193
1.057AspGln: 1.057 ± 0.142
1.781AspArg: 1.781 ± 0.185
6.868AspSer: 6.868 ± 0.483
4.168AspThr: 4.168 ± 0.298
2.935AspVal: 2.935 ± 0.237
0.509AspTrp: 0.509 ± 0.085
4.638AspTyr: 4.638 ± 0.367
0.0AspXaa: 0.0 ± 0.0
Glu
3.855GluAla: 3.855 ± 0.301
1.389GluCys: 1.389 ± 0.357
3.366GluAsp: 3.366 ± 0.287
3.15GluGlu: 3.15 ± 0.3
3.816GluPhe: 3.816 ± 0.237
2.446GluGly: 2.446 ± 0.264
0.822GluHis: 0.822 ± 0.15
6.379GluIle: 6.379 ± 0.35
4.187GluLys: 4.187 ± 0.357
6.673GluLeu: 6.673 ± 0.399
0.998GluMet: 0.998 ± 0.13
4.814GluAsn: 4.814 ± 0.319
2.329GluPro: 2.329 ± 0.242
1.389GluGln: 1.389 ± 0.187
1.683GluArg: 1.683 ± 0.176
5.577GluSer: 5.577 ± 0.355
3.972GluThr: 3.972 ± 0.318
4.148GluVal: 4.148 ± 0.37
0.372GluTrp: 0.372 ± 0.083
3.777GluTyr: 3.777 ± 0.301
0.0GluXaa: 0.0 ± 0.0
Phe
1.468PheAla: 1.468 ± 0.195
0.939PheCys: 0.939 ± 0.149
3.972PheAsp: 3.972 ± 0.271
3.953PheGlu: 3.953 ± 0.337
1.937PhePhe: 1.937 ± 0.185
2.348PheGly: 2.348 ± 0.214
0.528PheHis: 0.528 ± 0.094
4.383PheIle: 4.383 ± 0.313
6.946PheLys: 6.946 ± 0.377
4.559PheLeu: 4.559 ± 0.312
1.115PheMet: 1.115 ± 0.15
5.068PheAsn: 5.068 ± 0.291
0.92PhePro: 0.92 ± 0.152
1.389PheGln: 1.389 ± 0.176
1.605PheArg: 1.605 ± 0.148
3.268PheSer: 3.268 ± 0.264
2.602PheThr: 2.602 ± 0.228
2.544PheVal: 2.544 ± 0.21
0.235PheTrp: 0.235 ± 0.062
2.446PheTyr: 2.446 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
2.348GlyAla: 2.348 ± 0.239
0.763GlyCys: 0.763 ± 0.165
2.955GlyAsp: 2.955 ± 0.216
2.485GlyGlu: 2.485 ± 0.227
2.485GlyPhe: 2.485 ± 0.25
1.781GlyGly: 1.781 ± 0.207
0.509GlyHis: 0.509 ± 0.112
4.285GlyIle: 4.285 ± 0.267
3.033GlyLys: 3.033 ± 0.227
3.894GlyLeu: 3.894 ± 0.284
0.822GlyMet: 0.822 ± 0.122
3.287GlyAsn: 3.287 ± 0.253
0.646GlyPro: 0.646 ± 0.111
1.018GlyGln: 1.018 ± 0.148
1.037GlyArg: 1.037 ± 0.135
4.246GlySer: 4.246 ± 0.278
2.681GlyThr: 2.681 ± 0.239
3.072GlyVal: 3.072 ± 0.259
0.294GlyTrp: 0.294 ± 0.071
2.7GlyTyr: 2.7 ± 0.245
0.0GlyXaa: 0.0 ± 0.0
His
0.352HisAla: 0.352 ± 0.069
0.274HisCys: 0.274 ± 0.079
0.587HisAsp: 0.587 ± 0.111
0.411HisGlu: 0.411 ± 0.092
0.978HisPhe: 0.978 ± 0.159
0.391HisGly: 0.391 ± 0.094
0.254HisHis: 0.254 ± 0.084
1.213HisIle: 1.213 ± 0.132
1.683HisLys: 1.683 ± 0.177
1.272HisLeu: 1.272 ± 0.183
0.196HisMet: 0.196 ± 0.073
1.057HisAsn: 1.057 ± 0.158
0.587HisPro: 0.587 ± 0.117
0.333HisGln: 0.333 ± 0.075
0.47HisArg: 0.47 ± 0.099
1.037HisSer: 1.037 ± 0.152
0.763HisThr: 0.763 ± 0.132
0.333HisVal: 0.333 ± 0.085
0.059HisTrp: 0.059 ± 0.033
0.744HisTyr: 0.744 ± 0.1
0.0HisXaa: 0.0 ± 0.0
Ile
3.248IleAla: 3.248 ± 0.265
1.448IleCys: 1.448 ± 0.206
7.514IleAsp: 7.514 ± 0.395
6.183IleGlu: 6.183 ± 0.414
4.266IlePhe: 4.266 ± 0.357
3.444IleGly: 3.444 ± 0.263
1.331IleHis: 1.331 ± 0.149
6.653IleIle: 6.653 ± 0.419
10.156IleLys: 10.156 ± 0.475
7.925IleLeu: 7.925 ± 0.428
1.291IleMet: 1.291 ± 0.15
7.788IleAsn: 7.788 ± 0.405
2.348IlePro: 2.348 ± 0.256
3.248IleGln: 3.248 ± 0.256
2.563IleArg: 2.563 ± 0.209
5.303IleSer: 5.303 ± 0.391
4.461IleThr: 4.461 ± 0.371
4.129IleVal: 4.129 ± 0.26
0.587IleTrp: 0.587 ± 0.109
3.444IleTyr: 3.444 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
4.187LysAla: 4.187 ± 0.247
1.428LysCys: 1.428 ± 0.241
7.534LysAsp: 7.534 ± 0.421
6.907LysGlu: 6.907 ± 0.341
4.481LysPhe: 4.481 ± 0.309
3.777LysGly: 3.777 ± 0.301
1.428LysHis: 1.428 ± 0.164
8.551LysIle: 8.551 ± 0.427
7.847LysLys: 7.847 ± 0.488
8.492LysLeu: 8.492 ± 0.464
1.957LysMet: 1.957 ± 0.252
7.338LysAsn: 7.338 ± 0.375
3.092LysPro: 3.092 ± 0.258
2.955LysGln: 2.955 ± 0.291
2.681LysArg: 2.681 ± 0.296
7.083LysSer: 7.083 ± 0.448
5.479LysThr: 5.479 ± 0.364
4.461LysVal: 4.461 ± 0.268
0.9LysTrp: 0.9 ± 0.153
6.203LysTyr: 6.203 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
3.737LeuAla: 3.737 ± 0.221
0.881LeuCys: 0.881 ± 0.123
7.083LeuAsp: 7.083 ± 0.43
6.418LeuGlu: 6.418 ± 0.361
3.62LeuPhe: 3.62 ± 0.286
4.09LeuGly: 4.09 ± 0.335
1.018LeuHis: 1.018 ± 0.152
6.692LeuIle: 6.692 ± 0.33
10.625LeuLys: 10.625 ± 0.486
5.498LeuLeu: 5.498 ± 0.313
1.761LeuMet: 1.761 ± 0.195
8.218LeuAsn: 8.218 ± 0.426
2.485LeuPro: 2.485 ± 0.218
2.27LeuGln: 2.27 ± 0.208
2.994LeuArg: 2.994 ± 0.228
6.086LeuSer: 6.086 ± 0.377
4.794LeuThr: 4.794 ± 0.264
4.031LeuVal: 4.031 ± 0.307
0.665LeuTrp: 0.665 ± 0.118
4.324LeuTyr: 4.324 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
0.861MetAla: 0.861 ± 0.124
0.274MetCys: 0.274 ± 0.079
0.92MetAsp: 0.92 ± 0.142
0.939MetGlu: 0.939 ± 0.132
1.057MetPhe: 1.057 ± 0.148
0.704MetGly: 0.704 ± 0.116
0.235MetHis: 0.235 ± 0.063
1.565MetIle: 1.565 ± 0.162
1.761MetLys: 1.761 ± 0.2
1.918MetLeu: 1.918 ± 0.203
0.313MetMet: 0.313 ± 0.069
1.683MetAsn: 1.683 ± 0.167
0.587MetPro: 0.587 ± 0.102
0.685MetGln: 0.685 ± 0.126
0.763MetArg: 0.763 ± 0.134
1.291MetSer: 1.291 ± 0.188
1.076MetThr: 1.076 ± 0.139
0.939MetVal: 0.939 ± 0.132
0.098MetTrp: 0.098 ± 0.04
1.037MetTyr: 1.037 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.424AsnAla: 3.424 ± 0.247
1.389AsnCys: 1.389 ± 0.296
4.657AsnAsp: 4.657 ± 0.317
5.146AsnGlu: 5.146 ± 0.321
4.774AsnPhe: 4.774 ± 0.307
3.522AsnGly: 3.522 ± 0.256
1.272AsnHis: 1.272 ± 0.152
8.747AsnIle: 8.747 ± 0.494
7.651AsnLys: 7.651 ± 0.401
6.614AsnLeu: 6.614 ± 0.361
1.605AsnMet: 1.605 ± 0.158
6.086AsnAsn: 6.086 ± 0.411
3.229AsnPro: 3.229 ± 0.287
2.524AsnGln: 2.524 ± 0.26
2.211AsnArg: 2.211 ± 0.222
4.657AsnSer: 4.657 ± 0.363
4.735AsnThr: 4.735 ± 0.31
4.09AsnVal: 4.09 ± 0.309
0.391AsnTrp: 0.391 ± 0.074
4.461AsnTyr: 4.461 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
1.037ProAla: 1.037 ± 0.136
0.274ProCys: 0.274 ± 0.069
2.172ProAsp: 2.172 ± 0.199
2.916ProGlu: 2.916 ± 0.283
1.213ProPhe: 1.213 ± 0.178
1.898ProGly: 1.898 ± 0.213
0.313ProHis: 0.313 ± 0.072
1.937ProIle: 1.937 ± 0.189
2.896ProLys: 2.896 ± 0.302
1.957ProLeu: 1.957 ± 0.192
0.45ProMet: 0.45 ± 0.102
1.859ProAsn: 1.859 ± 0.196
0.45ProPro: 0.45 ± 0.111
0.646ProGln: 0.646 ± 0.116
1.076ProArg: 1.076 ± 0.138
2.172ProSer: 2.172 ± 0.225
1.35ProThr: 1.35 ± 0.149
1.663ProVal: 1.663 ± 0.22
0.235ProTrp: 0.235 ± 0.065
1.272ProTyr: 1.272 ± 0.15
0.0ProXaa: 0.0 ± 0.0
Gln
2.152GlnAla: 2.152 ± 0.252
0.352GlnCys: 0.352 ± 0.094
2.035GlnAsp: 2.035 ± 0.233
2.329GlnGlu: 2.329 ± 0.228
1.291GlnPhe: 1.291 ± 0.157
1.702GlnGly: 1.702 ± 0.194
0.352GlnHis: 0.352 ± 0.092
2.348GlnIle: 2.348 ± 0.213
2.368GlnLys: 2.368 ± 0.233
2.426GlnLeu: 2.426 ± 0.254
0.43GlnMet: 0.43 ± 0.083
2.446GlnAsn: 2.446 ± 0.205
0.528GlnPro: 0.528 ± 0.101
0.45GlnGln: 0.45 ± 0.09
0.861GlnArg: 0.861 ± 0.13
1.565GlnSer: 1.565 ± 0.144
1.605GlnThr: 1.605 ± 0.203
1.644GlnVal: 1.644 ± 0.189
0.196GlnTrp: 0.196 ± 0.059
1.565GlnTyr: 1.565 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
1.252ArgAla: 1.252 ± 0.144
0.724ArgCys: 0.724 ± 0.206
1.644ArgAsp: 1.644 ± 0.166
1.859ArgGlu: 1.859 ± 0.183
2.505ArgPhe: 2.505 ± 0.252
1.487ArgGly: 1.487 ± 0.175
0.45ArgHis: 0.45 ± 0.087
3.013ArgIle: 3.013 ± 0.224
2.133ArgLys: 2.133 ± 0.209
3.092ArgLeu: 3.092 ± 0.257
0.528ArgMet: 0.528 ± 0.101
2.152ArgAsn: 2.152 ± 0.208
0.646ArgPro: 0.646 ± 0.112
0.959ArgGln: 0.959 ± 0.101
1.057ArgArg: 1.057 ± 0.151
1.918ArgSer: 1.918 ± 0.194
1.82ArgThr: 1.82 ± 0.194
1.585ArgVal: 1.585 ± 0.199
0.176ArgTrp: 0.176 ± 0.056
1.878ArgTyr: 1.878 ± 0.206
0.0ArgXaa: 0.0 ± 0.0
Ser
2.779SerAla: 2.779 ± 0.247
0.841SerCys: 0.841 ± 0.126
4.187SerAsp: 4.187 ± 0.306
4.579SerGlu: 4.579 ± 0.286
3.777SerPhe: 3.777 ± 0.26
3.914SerGly: 3.914 ± 0.303
0.802SerHis: 0.802 ± 0.118
5.929SerIle: 5.929 ± 0.31
8.786SerLys: 8.786 ± 0.499
6.007SerLeu: 6.007 ± 0.295
1.722SerMet: 1.722 ± 0.198
6.046SerAsn: 6.046 ± 0.372
1.468SerPro: 1.468 ± 0.181
2.387SerGln: 2.387 ± 0.229
2.055SerArg: 2.055 ± 0.195
4.501SerSer: 4.501 ± 0.351
3.483SerThr: 3.483 ± 0.287
3.777SerVal: 3.777 ± 0.387
0.411SerTrp: 0.411 ± 0.093
3.053SerTyr: 3.053 ± 0.248
0.0SerXaa: 0.0 ± 0.0
Thr
2.289ThrAla: 2.289 ± 0.28
0.822ThrCys: 0.822 ± 0.189
3.542ThrAsp: 3.542 ± 0.29
3.953ThrGlu: 3.953 ± 0.277
2.837ThrPhe: 2.837 ± 0.245
2.602ThrGly: 2.602 ± 0.235
0.685ThrHis: 0.685 ± 0.135
4.794ThrIle: 4.794 ± 0.296
6.418ThrLys: 6.418 ± 0.407
4.461ThrLeu: 4.461 ± 0.258
1.057ThrMet: 1.057 ± 0.121
4.442ThrAsn: 4.442 ± 0.341
1.976ThrPro: 1.976 ± 0.206
2.113ThrGln: 2.113 ± 0.25
2.133ThrArg: 2.133 ± 0.195
3.307ThrSer: 3.307 ± 0.274
3.111ThrThr: 3.111 ± 0.281
2.681ThrVal: 2.681 ± 0.317
0.47ThrTrp: 0.47 ± 0.089
2.779ThrTyr: 2.779 ± 0.296
0.0ThrXaa: 0.0 ± 0.0
Val
2.035ValAla: 2.035 ± 0.249
0.744ValCys: 0.744 ± 0.151
4.168ValAsp: 4.168 ± 0.264
3.385ValGlu: 3.385 ± 0.277
2.661ValPhe: 2.661 ± 0.244
2.27ValGly: 2.27 ± 0.235
0.607ValHis: 0.607 ± 0.108
3.561ValIle: 3.561 ± 0.294
4.403ValLys: 4.403 ± 0.304
4.833ValLeu: 4.833 ± 0.366
0.881ValMet: 0.881 ± 0.133
3.933ValAsn: 3.933 ± 0.312
1.624ValPro: 1.624 ± 0.232
1.683ValGln: 1.683 ± 0.158
1.761ValArg: 1.761 ± 0.164
3.581ValSer: 3.581 ± 0.371
2.974ValThr: 2.974 ± 0.293
3.111ValVal: 3.111 ± 0.273
0.313ValTrp: 0.313 ± 0.093
3.033ValTyr: 3.033 ± 0.237
0.0ValXaa: 0.0 ± 0.0
Trp
0.254TrpAla: 0.254 ± 0.071
0.196TrpCys: 0.196 ± 0.054
0.626TrpAsp: 0.626 ± 0.099
0.47TrpGlu: 0.47 ± 0.086
0.274TrpPhe: 0.274 ± 0.078
0.352TrpGly: 0.352 ± 0.079
0.039TrpHis: 0.039 ± 0.027
0.47TrpIle: 0.47 ± 0.082
0.391TrpLys: 0.391 ± 0.098
0.45TrpLeu: 0.45 ± 0.104
0.215TrpMet: 0.215 ± 0.069
0.665TrpAsn: 0.665 ± 0.135
0.157TrpPro: 0.157 ± 0.049
0.294TrpGln: 0.294 ± 0.079
0.176TrpArg: 0.176 ± 0.057
0.489TrpSer: 0.489 ± 0.104
0.43TrpThr: 0.43 ± 0.108
0.45TrpVal: 0.45 ± 0.094
0.059TrpTrp: 0.059 ± 0.035
0.313TrpTyr: 0.313 ± 0.074
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.937TyrAla: 1.937 ± 0.193
1.252TyrCys: 1.252 ± 0.193
3.718TyrAsp: 3.718 ± 0.249
3.013TyrGlu: 3.013 ± 0.233
2.583TyrPhe: 2.583 ± 0.238
1.957TyrGly: 1.957 ± 0.215
0.724TyrHis: 0.724 ± 0.121
4.97TyrIle: 4.97 ± 0.347
6.183TyrLys: 6.183 ± 0.355
3.874TyrLeu: 3.874 ± 0.258
1.037TyrMet: 1.037 ± 0.136
4.774TyrAsn: 4.774 ± 0.289
1.428TyrPro: 1.428 ± 0.197
1.565TyrGln: 1.565 ± 0.164
1.82TyrArg: 1.82 ± 0.217
3.092TyrSer: 3.092 ± 0.251
3.757TyrThr: 3.757 ± 0.273
2.446TyrVal: 2.446 ± 0.286
0.47TyrTrp: 0.47 ± 0.095
2.739TyrTyr: 2.739 ± 0.21
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 257 proteins (51106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski