Amino acid dipepetide frequency for Mycobacterium phage Bxz1 (Mycobacteriophage Bxz1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.776AlaAla: 10.776 ± 0.674
0.618AlaCys: 0.618 ± 0.12
6.985AlaAsp: 6.985 ± 0.544
6.261AlaGlu: 6.261 ± 0.452
2.577AlaPhe: 2.577 ± 0.256
7.986AlaGly: 7.986 ± 0.71
1.768AlaHis: 1.768 ± 0.219
4.323AlaIle: 4.323 ± 0.381
4.259AlaLys: 4.259 ± 0.303
8.391AlaLeu: 8.391 ± 0.529
2.556AlaMet: 2.556 ± 0.22
3.344AlaAsn: 3.344 ± 0.295
5.409AlaPro: 5.409 ± 0.488
4.579AlaGln: 4.579 ± 0.362
6.474AlaArg: 6.474 ± 0.426
5.601AlaSer: 5.601 ± 0.462
5.409AlaThr: 5.409 ± 0.424
6.517AlaVal: 6.517 ± 0.485
1.384AlaTrp: 1.384 ± 0.156
2.683AlaTyr: 2.683 ± 0.245
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.144
0.106CysCys: 0.106 ± 0.044
0.532CysAsp: 0.532 ± 0.096
0.469CysGlu: 0.469 ± 0.091
0.319CysPhe: 0.319 ± 0.092
1.001CysGly: 1.001 ± 0.193
0.341CysHis: 0.341 ± 0.088
0.362CysIle: 0.362 ± 0.085
0.17CysLys: 0.17 ± 0.069
0.681CysLeu: 0.681 ± 0.127
0.213CysMet: 0.213 ± 0.071
0.17CysAsn: 0.17 ± 0.061
0.66CysPro: 0.66 ± 0.151
0.447CysGln: 0.447 ± 0.105
0.575CysArg: 0.575 ± 0.118
0.66CysSer: 0.66 ± 0.142
0.362CysThr: 0.362 ± 0.101
0.362CysVal: 0.362 ± 0.085
0.234CysTrp: 0.234 ± 0.082
0.149CysTyr: 0.149 ± 0.055
0.0CysXaa: 0.0 ± 0.0
Asp
6.368AspAla: 6.368 ± 0.405
0.532AspCys: 0.532 ± 0.118
5.282AspAsp: 5.282 ± 0.395
5.622AspGlu: 5.622 ± 0.379
2.534AspPhe: 2.534 ± 0.207
5.132AspGly: 5.132 ± 0.371
1.895AspHis: 1.895 ± 0.219
2.854AspIle: 2.854 ± 0.302
2.385AspLys: 2.385 ± 0.193
5.878AspLeu: 5.878 ± 0.377
2.002AspMet: 2.002 ± 0.245
1.789AspAsn: 1.789 ± 0.196
5.345AspPro: 5.345 ± 0.414
2.683AspGln: 2.683 ± 0.243
4.685AspArg: 4.685 ± 0.394
3.557AspSer: 3.557 ± 0.355
3.429AspThr: 3.429 ± 0.309
4.302AspVal: 4.302 ± 0.279
1.619AspTrp: 1.619 ± 0.216
2.811AspTyr: 2.811 ± 0.239
0.0AspXaa: 0.0 ± 0.0
Glu
6.432GluAla: 6.432 ± 0.433
0.575GluCys: 0.575 ± 0.139
5.196GluAsp: 5.196 ± 0.482
4.366GluGlu: 4.366 ± 0.374
1.917GluPhe: 1.917 ± 0.211
3.897GluGly: 3.897 ± 0.289
2.044GluHis: 2.044 ± 0.216
3.258GluIle: 3.258 ± 0.259
2.343GluLys: 2.343 ± 0.224
5.537GluLeu: 5.537 ± 0.379
1.661GluMet: 1.661 ± 0.191
1.576GluAsn: 1.576 ± 0.196
3.258GluPro: 3.258 ± 0.248
2.875GluGln: 2.875 ± 0.261
4.77GluArg: 4.77 ± 0.412
3.237GluSer: 3.237 ± 0.288
3.663GluThr: 3.663 ± 0.267
4.941GluVal: 4.941 ± 0.32
1.363GluTrp: 1.363 ± 0.164
1.853GluTyr: 1.853 ± 0.189
0.0GluXaa: 0.0 ± 0.0
Phe
2.832PheAla: 2.832 ± 0.252
0.277PheCys: 0.277 ± 0.094
2.747PheAsp: 2.747 ± 0.229
2.066PheGlu: 2.066 ± 0.184
0.703PhePhe: 0.703 ± 0.136
2.47PheGly: 2.47 ± 0.256
0.745PheHis: 0.745 ± 0.124
1.363PheIle: 1.363 ± 0.201
1.171PheLys: 1.171 ± 0.161
2.364PheLeu: 2.364 ± 0.241
0.596PheMet: 0.596 ± 0.128
0.98PheAsn: 0.98 ± 0.142
1.491PhePro: 1.491 ± 0.183
1.406PheGln: 1.406 ± 0.179
2.087PheArg: 2.087 ± 0.212
1.576PheSer: 1.576 ± 0.181
2.172PheThr: 2.172 ± 0.251
2.236PheVal: 2.236 ± 0.176
0.511PheTrp: 0.511 ± 0.111
1.129PheTyr: 1.129 ± 0.166
0.0PheXaa: 0.0 ± 0.0
Gly
6.495GlyAla: 6.495 ± 0.564
0.49GlyCys: 0.49 ± 0.105
4.983GlyAsp: 4.983 ± 0.33
4.728GlyGlu: 4.728 ± 0.31
2.407GlyPhe: 2.407 ± 0.235
8.54GlyGly: 8.54 ± 1.057
2.066GlyHis: 2.066 ± 0.231
3.706GlyIle: 3.706 ± 0.354
3.642GlyLys: 3.642 ± 0.372
6.133GlyLeu: 6.133 ± 0.421
2.044GlyMet: 2.044 ± 0.265
2.726GlyAsn: 2.726 ± 0.302
3.982GlyPro: 3.982 ± 0.421
3.535GlyGln: 3.535 ± 0.316
4.856GlyArg: 4.856 ± 0.304
4.685GlySer: 4.685 ± 0.347
5.431GlyThr: 5.431 ± 0.423
5.218GlyVal: 5.218 ± 0.374
1.917GlyTrp: 1.917 ± 0.191
2.492GlyTyr: 2.492 ± 0.252
0.0GlyXaa: 0.0 ± 0.0
His
2.044HisAla: 2.044 ± 0.209
0.17HisCys: 0.17 ± 0.06
1.682HisAsp: 1.682 ± 0.252
1.661HisGlu: 1.661 ± 0.179
0.809HisPhe: 0.809 ± 0.131
2.13HisGly: 2.13 ± 0.22
0.958HisHis: 0.958 ± 0.174
1.022HisIle: 1.022 ± 0.146
0.767HisLys: 0.767 ± 0.128
2.087HisLeu: 2.087 ± 0.284
0.447HisMet: 0.447 ± 0.102
0.596HisAsn: 0.596 ± 0.106
2.066HisPro: 2.066 ± 0.241
1.022HisGln: 1.022 ± 0.145
1.874HisArg: 1.874 ± 0.249
1.15HisSer: 1.15 ± 0.174
1.448HisThr: 1.448 ± 0.164
1.789HisVal: 1.789 ± 0.208
0.319HisTrp: 0.319 ± 0.079
0.894HisTyr: 0.894 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
4.749IleAla: 4.749 ± 0.364
0.426IleCys: 0.426 ± 0.096
3.045IleAsp: 3.045 ± 0.242
2.875IleGlu: 2.875 ± 0.238
1.171IlePhe: 1.171 ± 0.146
3.344IleGly: 3.344 ± 0.279
0.937IleHis: 0.937 ± 0.139
1.342IleIle: 1.342 ± 0.18
1.406IleLys: 1.406 ± 0.187
2.534IleLeu: 2.534 ± 0.263
0.937IleMet: 0.937 ± 0.145
1.746IleAsn: 1.746 ± 0.205
2.556IlePro: 2.556 ± 0.217
1.619IleGln: 1.619 ± 0.203
3.429IleArg: 3.429 ± 0.325
2.215IleSer: 2.215 ± 0.221
3.258IleThr: 3.258 ± 0.284
3.322IleVal: 3.322 ± 0.285
0.767IleTrp: 0.767 ± 0.127
1.214IleTyr: 1.214 ± 0.162
0.0IleXaa: 0.0 ± 0.0
Lys
4.344LysAla: 4.344 ± 0.365
0.298LysCys: 0.298 ± 0.073
2.087LysAsp: 2.087 ± 0.202
2.194LysGlu: 2.194 ± 0.228
1.384LysPhe: 1.384 ± 0.162
2.662LysGly: 2.662 ± 0.238
0.873LysHis: 0.873 ± 0.14
1.81LysIle: 1.81 ± 0.201
1.789LysLys: 1.789 ± 0.249
3.088LysLeu: 3.088 ± 0.268
1.107LysMet: 1.107 ± 0.185
1.342LysAsn: 1.342 ± 0.165
1.853LysPro: 1.853 ± 0.23
1.256LysGln: 1.256 ± 0.142
3.003LysArg: 3.003 ± 0.282
1.789LysSer: 1.789 ± 0.201
2.215LysThr: 2.215 ± 0.193
3.301LysVal: 3.301 ± 0.243
0.511LysTrp: 0.511 ± 0.107
1.15LysTyr: 1.15 ± 0.152
0.0LysXaa: 0.0 ± 0.0
Leu
8.242LeuAla: 8.242 ± 0.474
0.809LeuCys: 0.809 ± 0.124
6.495LeuAsp: 6.495 ± 0.426
4.217LeuGlu: 4.217 ± 0.342
1.917LeuPhe: 1.917 ± 0.23
5.196LeuGly: 5.196 ± 0.302
1.597LeuHis: 1.597 ± 0.197
3.258LeuIle: 3.258 ± 0.335
3.088LeuLys: 3.088 ± 0.276
5.132LeuLeu: 5.132 ± 0.408
2.194LeuMet: 2.194 ± 0.215
3.258LeuAsn: 3.258 ± 0.293
3.961LeuPro: 3.961 ± 0.273
2.726LeuGln: 2.726 ± 0.238
5.473LeuArg: 5.473 ± 0.406
4.195LeuSer: 4.195 ± 0.351
4.664LeuThr: 4.664 ± 0.386
5.473LeuVal: 5.473 ± 0.348
1.15LeuTrp: 1.15 ± 0.155
2.321LeuTyr: 2.321 ± 0.286
0.0LeuXaa: 0.0 ± 0.0
Met
2.769MetAla: 2.769 ± 0.272
0.128MetCys: 0.128 ± 0.047
1.597MetAsp: 1.597 ± 0.183
1.278MetGlu: 1.278 ± 0.152
0.703MetPhe: 0.703 ± 0.117
1.917MetGly: 1.917 ± 0.248
0.49MetHis: 0.49 ± 0.11
0.958MetIle: 0.958 ± 0.149
1.107MetLys: 1.107 ± 0.13
1.533MetLeu: 1.533 ± 0.18
0.745MetMet: 0.745 ± 0.119
0.98MetAsn: 0.98 ± 0.137
1.491MetPro: 1.491 ± 0.185
0.98MetGln: 0.98 ± 0.152
1.832MetArg: 1.832 ± 0.222
2.194MetSer: 2.194 ± 0.202
2.215MetThr: 2.215 ± 0.222
1.342MetVal: 1.342 ± 0.177
0.405MetTrp: 0.405 ± 0.101
0.469MetTyr: 0.469 ± 0.105
0.0MetXaa: 0.0 ± 0.0
Asn
3.194AsnAla: 3.194 ± 0.254
0.234AsnCys: 0.234 ± 0.075
1.959AsnAsp: 1.959 ± 0.223
1.682AsnGlu: 1.682 ± 0.169
1.171AsnPhe: 1.171 ± 0.139
3.173AsnGly: 3.173 ± 0.303
0.852AsnHis: 0.852 ± 0.142
1.406AsnIle: 1.406 ± 0.177
1.512AsnLys: 1.512 ± 0.179
2.108AsnLeu: 2.108 ± 0.174
0.873AsnMet: 0.873 ± 0.156
1.469AsnAsn: 1.469 ± 0.182
2.918AsnPro: 2.918 ± 0.286
1.363AsnGln: 1.363 ± 0.17
2.726AsnArg: 2.726 ± 0.228
1.682AsnSer: 1.682 ± 0.179
2.108AsnThr: 2.108 ± 0.228
1.959AsnVal: 1.959 ± 0.191
0.724AsnTrp: 0.724 ± 0.137
0.873AsnTyr: 0.873 ± 0.129
0.0AsnXaa: 0.0 ± 0.0
Pro
4.983ProAla: 4.983 ± 0.427
0.362ProCys: 0.362 ± 0.087
5.132ProAsp: 5.132 ± 0.401
4.813ProGlu: 4.813 ± 0.41
1.555ProPhe: 1.555 ± 0.185
5.793ProGly: 5.793 ± 0.503
1.406ProHis: 1.406 ± 0.201
1.981ProIle: 1.981 ± 0.195
1.768ProLys: 1.768 ± 0.197
3.237ProLeu: 3.237 ± 0.245
1.363ProMet: 1.363 ± 0.168
2.3ProAsn: 2.3 ± 0.207
3.365ProPro: 3.365 ± 0.463
2.343ProGln: 2.343 ± 0.204
2.747ProArg: 2.747 ± 0.241
2.875ProSer: 2.875 ± 0.304
3.833ProThr: 3.833 ± 0.319
4.6ProVal: 4.6 ± 0.382
1.171ProTrp: 1.171 ± 0.156
1.64ProTyr: 1.64 ± 0.189
0.0ProXaa: 0.0 ± 0.0
Gln
4.685GlnAla: 4.685 ± 0.359
0.277GlnCys: 0.277 ± 0.075
2.257GlnAsp: 2.257 ± 0.206
2.492GlnGlu: 2.492 ± 0.277
1.64GlnPhe: 1.64 ± 0.175
2.641GlnGly: 2.641 ± 0.255
1.278GlnHis: 1.278 ± 0.166
2.321GlnIle: 2.321 ± 0.17
0.894GlnLys: 0.894 ± 0.146
3.557GlnLeu: 3.557 ± 0.262
1.256GlnMet: 1.256 ± 0.182
1.406GlnAsn: 1.406 ± 0.18
1.895GlnPro: 1.895 ± 0.166
1.917GlnGln: 1.917 ± 0.237
3.578GlnArg: 3.578 ± 0.259
1.981GlnSer: 1.981 ± 0.207
2.47GlnThr: 2.47 ± 0.248
2.939GlnVal: 2.939 ± 0.273
0.703GlnTrp: 0.703 ± 0.113
1.171GlnTyr: 1.171 ± 0.149
0.0GlnXaa: 0.0 ± 0.0
Arg
6.219ArgAla: 6.219 ± 0.385
0.703ArgCys: 0.703 ± 0.125
3.961ArgAsp: 3.961 ± 0.336
5.537ArgGlu: 5.537 ± 0.364
2.385ArgPhe: 2.385 ± 0.291
4.259ArgGly: 4.259 ± 0.256
1.789ArgHis: 1.789 ± 0.23
2.918ArgIle: 2.918 ± 0.258
3.684ArgLys: 3.684 ± 0.29
5.132ArgLeu: 5.132 ± 0.42
1.895ArgMet: 1.895 ± 0.212
2.321ArgAsn: 2.321 ± 0.242
3.131ArgPro: 3.131 ± 0.304
3.237ArgGln: 3.237 ± 0.224
5.324ArgArg: 5.324 ± 0.45
3.684ArgSer: 3.684 ± 0.266
4.025ArgThr: 4.025 ± 0.321
5.132ArgVal: 5.132 ± 0.301
1.256ArgTrp: 1.256 ± 0.159
2.172ArgTyr: 2.172 ± 0.25
0.0ArgXaa: 0.0 ± 0.0
Ser
4.856SerAla: 4.856 ± 0.406
0.511SerCys: 0.511 ± 0.097
3.812SerAsp: 3.812 ± 0.401
2.982SerGlu: 2.982 ± 0.203
1.789SerPhe: 1.789 ± 0.213
5.303SerGly: 5.303 ± 0.416
0.852SerHis: 0.852 ± 0.138
2.215SerIle: 2.215 ± 0.217
2.066SerLys: 2.066 ± 0.205
4.302SerLeu: 4.302 ± 0.296
1.448SerMet: 1.448 ± 0.157
1.682SerAsn: 1.682 ± 0.175
3.003SerPro: 3.003 ± 0.245
2.151SerGln: 2.151 ± 0.222
3.344SerArg: 3.344 ± 0.252
3.557SerSer: 3.557 ± 0.326
3.876SerThr: 3.876 ± 0.294
3.727SerVal: 3.727 ± 0.328
1.256SerTrp: 1.256 ± 0.188
1.874SerTyr: 1.874 ± 0.193
0.0SerXaa: 0.0 ± 0.0
Thr
5.92ThrAla: 5.92 ± 0.476
0.724ThrCys: 0.724 ± 0.168
4.217ThrAsp: 4.217 ± 0.337
3.94ThrGlu: 3.94 ± 0.325
2.321ThrPhe: 2.321 ± 0.268
5.75ThrGly: 5.75 ± 0.349
1.895ThrHis: 1.895 ± 0.231
3.067ThrIle: 3.067 ± 0.319
2.236ThrLys: 2.236 ± 0.202
4.557ThrLeu: 4.557 ± 0.339
1.342ThrMet: 1.342 ± 0.162
2.343ThrAsn: 2.343 ± 0.289
4.281ThrPro: 4.281 ± 0.38
2.194ThrGln: 2.194 ± 0.25
3.258ThrArg: 3.258 ± 0.261
3.578ThrSer: 3.578 ± 0.288
4.43ThrThr: 4.43 ± 0.422
4.664ThrVal: 4.664 ± 0.379
1.32ThrTrp: 1.32 ± 0.167
2.108ThrTyr: 2.108 ± 0.209
0.0ThrXaa: 0.0 ± 0.0
Val
7.603ValAla: 7.603 ± 0.48
0.66ValCys: 0.66 ± 0.129
4.941ValAsp: 4.941 ± 0.261
4.877ValGlu: 4.877 ± 0.304
2.023ValPhe: 2.023 ± 0.23
5.218ValGly: 5.218 ± 0.383
1.789ValHis: 1.789 ± 0.2
3.067ValIle: 3.067 ± 0.282
2.3ValLys: 2.3 ± 0.235
5.324ValLeu: 5.324 ± 0.423
1.64ValMet: 1.64 ± 0.165
2.13ValAsn: 2.13 ± 0.191
4.281ValPro: 4.281 ± 0.301
2.832ValGln: 2.832 ± 0.306
4.494ValArg: 4.494 ± 0.394
3.791ValSer: 3.791 ± 0.252
5.239ValThr: 5.239 ± 0.407
5.899ValVal: 5.899 ± 0.526
1.171ValTrp: 1.171 ± 0.146
1.959ValTyr: 1.959 ± 0.276
0.0ValXaa: 0.0 ± 0.0
Trp
1.619TrpAla: 1.619 ± 0.167
0.192TrpCys: 0.192 ± 0.056
1.406TrpAsp: 1.406 ± 0.149
1.107TrpGlu: 1.107 ± 0.154
0.49TrpPhe: 0.49 ± 0.148
1.661TrpGly: 1.661 ± 0.204
0.426TrpHis: 0.426 ± 0.086
0.681TrpIle: 0.681 ± 0.116
0.681TrpLys: 0.681 ± 0.121
1.193TrpLeu: 1.193 ± 0.146
0.383TrpMet: 0.383 ± 0.079
0.618TrpAsn: 0.618 ± 0.122
0.894TrpPro: 0.894 ± 0.129
0.852TrpGln: 0.852 ± 0.145
1.597TrpArg: 1.597 ± 0.196
1.15TrpSer: 1.15 ± 0.145
1.555TrpThr: 1.555 ± 0.186
1.278TrpVal: 1.278 ± 0.154
0.554TrpTrp: 0.554 ± 0.112
0.511TrpTyr: 0.511 ± 0.092
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.131TyrAla: 3.131 ± 0.254
0.362TyrCys: 0.362 ± 0.097
2.385TyrAsp: 2.385 ± 0.269
1.597TyrGlu: 1.597 ± 0.192
1.107TyrPhe: 1.107 ± 0.187
2.002TyrGly: 2.002 ± 0.218
0.937TyrHis: 0.937 ± 0.155
0.98TyrIle: 0.98 ± 0.136
0.788TyrLys: 0.788 ± 0.134
2.598TyrLeu: 2.598 ± 0.267
0.405TyrMet: 0.405 ± 0.082
1.214TyrAsn: 1.214 ± 0.182
1.427TyrPro: 1.427 ± 0.149
1.342TyrGln: 1.342 ± 0.192
2.662TyrArg: 2.662 ± 0.249
1.448TyrSer: 1.448 ± 0.147
2.343TyrThr: 2.343 ± 0.221
2.236TyrVal: 2.236 ± 0.241
0.511TyrTrp: 0.511 ± 0.097
1.107TyrTyr: 1.107 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 225 proteins (46957 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski