Amino acid dipepetide frequency for Mycobacterium phage Redno2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.231AlaAla: 11.231 ± 0.855
1.016AlaCys: 1.016 ± 0.199
6.81AlaAsp: 6.81 ± 0.482
7.766AlaGlu: 7.766 ± 0.528
3.286AlaPhe: 3.286 ± 0.329
6.84AlaGly: 6.84 ± 0.601
2.24AlaHis: 2.24 ± 0.293
5.257AlaIle: 5.257 ± 0.393
4.301AlaLys: 4.301 ± 0.392
8.782AlaLeu: 8.782 ± 0.513
3.375AlaMet: 3.375 ± 0.428
2.748AlaAsn: 2.748 ± 0.34
4.182AlaPro: 4.182 ± 0.323
2.688AlaGln: 2.688 ± 0.318
6.96AlaArg: 6.96 ± 0.532
4.779AlaSer: 4.779 ± 0.411
4.809AlaThr: 4.809 ± 0.385
6.303AlaVal: 6.303 ± 0.406
2.001AlaTrp: 2.001 ± 0.252
2.688AlaTyr: 2.688 ± 0.328
0.0AlaXaa: 0.0 ± 0.0
Cys
1.016CysAla: 1.016 ± 0.201
0.149CysCys: 0.149 ± 0.066
1.165CysAsp: 1.165 ± 0.183
1.344CysGlu: 1.344 ± 0.191
0.418CysPhe: 0.418 ± 0.139
1.732CysGly: 1.732 ± 0.226
0.508CysHis: 0.508 ± 0.126
0.568CysIle: 0.568 ± 0.126
0.627CysLys: 0.627 ± 0.14
1.075CysLeu: 1.075 ± 0.201
0.388CysMet: 0.388 ± 0.113
0.627CysAsn: 0.627 ± 0.153
0.657CysPro: 0.657 ± 0.146
0.478CysGln: 0.478 ± 0.131
0.986CysArg: 0.986 ± 0.192
0.508CysSer: 0.508 ± 0.126
0.777CysThr: 0.777 ± 0.162
0.836CysVal: 0.836 ± 0.155
0.179CysTrp: 0.179 ± 0.075
0.568CysTyr: 0.568 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
6.123AspAla: 6.123 ± 0.423
1.045AspCys: 1.045 ± 0.157
4.66AspAsp: 4.66 ± 0.38
5.078AspGlu: 5.078 ± 0.499
2.21AspPhe: 2.21 ± 0.212
7.049AspGly: 7.049 ± 0.479
1.703AspHis: 1.703 ± 0.224
3.674AspIle: 3.674 ± 0.311
2.688AspLys: 2.688 ± 0.267
5.586AspLeu: 5.586 ± 0.332
0.896AspMet: 0.896 ± 0.137
2.658AspAsn: 2.658 ± 0.267
3.525AspPro: 3.525 ± 0.322
1.942AspGln: 1.942 ± 0.262
3.913AspArg: 3.913 ± 0.344
2.897AspSer: 2.897 ± 0.308
3.047AspThr: 3.047 ± 0.326
4.062AspVal: 4.062 ± 0.402
2.24AspTrp: 2.24 ± 0.266
2.33AspTyr: 2.33 ± 0.261
0.0AspXaa: 0.0 ± 0.0
Glu
7.169GluAla: 7.169 ± 0.455
1.404GluCys: 1.404 ± 0.24
4.54GluAsp: 4.54 ± 0.43
5.347GluGlu: 5.347 ± 0.444
2.927GluPhe: 2.927 ± 0.289
4.152GluGly: 4.152 ± 0.349
1.105GluHis: 1.105 ± 0.17
4.032GluIle: 4.032 ± 0.342
2.509GluLys: 2.509 ± 0.303
6.781GluLeu: 6.781 ± 0.491
2.419GluMet: 2.419 ± 0.345
1.374GluAsn: 1.374 ± 0.222
3.256GluPro: 3.256 ± 0.361
2.33GluGln: 2.33 ± 0.256
4.929GluArg: 4.929 ± 0.488
3.375GluSer: 3.375 ± 0.298
3.375GluThr: 3.375 ± 0.3
4.57GluVal: 4.57 ± 0.351
1.195GluTrp: 1.195 ± 0.2
2.569GluTyr: 2.569 ± 0.258
0.0GluXaa: 0.0 ± 0.0
Phe
2.509PheAla: 2.509 ± 0.272
0.448PheCys: 0.448 ± 0.112
2.509PheAsp: 2.509 ± 0.277
2.39PheGlu: 2.39 ± 0.234
0.986PhePhe: 0.986 ± 0.197
3.166PheGly: 3.166 ± 0.403
0.568PheHis: 0.568 ± 0.149
1.314PheIle: 1.314 ± 0.19
1.016PheLys: 1.016 ± 0.163
2.569PheLeu: 2.569 ± 0.269
0.806PheMet: 0.806 ± 0.123
1.434PheAsn: 1.434 ± 0.234
1.912PhePro: 1.912 ± 0.252
1.045PheGln: 1.045 ± 0.191
2.091PheArg: 2.091 ± 0.253
1.762PheSer: 1.762 ± 0.234
1.822PheThr: 1.822 ± 0.213
2.151PheVal: 2.151 ± 0.253
0.568PheTrp: 0.568 ± 0.134
0.806PheTyr: 0.806 ± 0.146
0.0PheXaa: 0.0 ± 0.0
Gly
6.99GlyAla: 6.99 ± 0.586
1.494GlyCys: 1.494 ± 0.196
5.765GlyAsp: 5.765 ± 0.408
5.825GlyGlu: 5.825 ± 0.469
3.286GlyPhe: 3.286 ± 0.338
7.946GlyGly: 7.946 ± 1.544
2.36GlyHis: 2.36 ± 0.25
3.973GlyIle: 3.973 ± 0.407
3.584GlyLys: 3.584 ± 0.369
7.139GlyLeu: 7.139 ± 0.537
2.061GlyMet: 2.061 ± 0.261
3.017GlyAsn: 3.017 ± 0.31
3.973GlyPro: 3.973 ± 0.377
2.748GlyGln: 2.748 ± 0.437
5.407GlyArg: 5.407 ± 0.331
4.899GlySer: 4.899 ± 0.386
4.6GlyThr: 4.6 ± 0.416
4.988GlyVal: 4.988 ± 0.389
1.882GlyTrp: 1.882 ± 0.257
3.136GlyTyr: 3.136 ± 0.265
0.0GlyXaa: 0.0 ± 0.0
His
1.583HisAla: 1.583 ± 0.225
0.269HisCys: 0.269 ± 0.097
1.673HisAsp: 1.673 ± 0.213
1.464HisGlu: 1.464 ± 0.211
0.508HisPhe: 0.508 ± 0.146
2.21HisGly: 2.21 ± 0.26
0.956HisHis: 0.956 ± 0.164
0.926HisIle: 0.926 ± 0.214
0.717HisLys: 0.717 ± 0.146
2.748HisLeu: 2.748 ± 0.306
0.597HisMet: 0.597 ± 0.127
0.508HisAsn: 0.508 ± 0.129
1.404HisPro: 1.404 ± 0.205
0.866HisGln: 0.866 ± 0.124
1.673HisArg: 1.673 ± 0.233
0.836HisSer: 0.836 ± 0.151
1.016HisThr: 1.016 ± 0.166
1.314HisVal: 1.314 ± 0.222
0.717HisTrp: 0.717 ± 0.13
0.717HisTyr: 0.717 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
5.855IleAla: 5.855 ± 0.408
0.568IleCys: 0.568 ± 0.131
3.764IleAsp: 3.764 ± 0.307
3.973IleGlu: 3.973 ± 0.342
1.045IlePhe: 1.045 ± 0.163
4.032IleGly: 4.032 ± 0.427
1.374IleHis: 1.374 ± 0.234
2.091IleIle: 2.091 ± 0.258
1.703IleLys: 1.703 ± 0.205
3.704IleLeu: 3.704 ± 0.361
0.956IleMet: 0.956 ± 0.167
1.643IleAsn: 1.643 ± 0.189
3.435IlePro: 3.435 ± 0.315
1.822IleGln: 1.822 ± 0.183
2.838IleArg: 2.838 ± 0.288
2.3IleSer: 2.3 ± 0.25
2.897IleThr: 2.897 ± 0.306
3.495IleVal: 3.495 ± 0.346
0.777IleTrp: 0.777 ± 0.136
1.344IleTyr: 1.344 ± 0.22
0.0IleXaa: 0.0 ± 0.0
Lys
4.152LysAla: 4.152 ± 0.416
0.687LysCys: 0.687 ± 0.152
2.061LysAsp: 2.061 ± 0.235
2.001LysGlu: 2.001 ± 0.3
1.075LysPhe: 1.075 ± 0.166
3.077LysGly: 3.077 ± 0.287
0.956LysHis: 0.956 ± 0.166
1.494LysIle: 1.494 ± 0.22
1.852LysLys: 1.852 ± 0.288
3.525LysLeu: 3.525 ± 0.332
1.494LysMet: 1.494 ± 0.204
0.747LysAsn: 0.747 ± 0.149
2.748LysPro: 2.748 ± 0.292
1.225LysGln: 1.225 ± 0.175
3.017LysArg: 3.017 ± 0.322
2.031LysSer: 2.031 ± 0.33
1.553LysThr: 1.553 ± 0.219
3.256LysVal: 3.256 ± 0.322
1.135LysTrp: 1.135 ± 0.172
1.434LysTyr: 1.434 ± 0.211
0.0LysXaa: 0.0 ± 0.0
Leu
9.14LeuAla: 9.14 ± 0.405
0.836LeuCys: 0.836 ± 0.181
5.645LeuAsp: 5.645 ± 0.324
5.108LeuGlu: 5.108 ± 0.415
2.449LeuPhe: 2.449 ± 0.239
6.661LeuGly: 6.661 ± 0.505
1.912LeuHis: 1.912 ± 0.229
3.196LeuIle: 3.196 ± 0.353
3.256LeuLys: 3.256 ± 0.292
6.213LeuLeu: 6.213 ± 0.533
2.001LeuMet: 2.001 ± 0.239
3.435LeuAsn: 3.435 ± 0.316
4.57LeuPro: 4.57 ± 0.373
2.509LeuGln: 2.509 ± 0.281
5.616LeuArg: 5.616 ± 0.442
5.138LeuSer: 5.138 ± 0.42
5.436LeuThr: 5.436 ± 0.387
4.988LeuVal: 4.988 ± 0.391
1.523LeuTrp: 1.523 ± 0.185
2.031LeuTyr: 2.031 ± 0.272
0.0LeuXaa: 0.0 ± 0.0
Met
2.539MetAla: 2.539 ± 0.244
0.269MetCys: 0.269 ± 0.095
1.404MetAsp: 1.404 ± 0.203
1.195MetGlu: 1.195 ± 0.203
0.717MetPhe: 0.717 ± 0.155
1.822MetGly: 1.822 ± 0.213
0.358MetHis: 0.358 ± 0.097
1.523MetIle: 1.523 ± 0.22
1.165MetLys: 1.165 ± 0.174
1.434MetLeu: 1.434 ± 0.216
0.627MetMet: 0.627 ± 0.146
1.075MetAsn: 1.075 ± 0.143
1.284MetPro: 1.284 ± 0.187
0.687MetGln: 0.687 ± 0.155
1.255MetArg: 1.255 ± 0.172
2.658MetSer: 2.658 ± 0.256
2.27MetThr: 2.27 ± 0.242
1.165MetVal: 1.165 ± 0.175
0.448MetTrp: 0.448 ± 0.119
0.478MetTyr: 0.478 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
3.286AsnAla: 3.286 ± 0.309
0.329AsnCys: 0.329 ± 0.091
2.121AsnAsp: 2.121 ± 0.214
1.613AsnGlu: 1.613 ± 0.204
1.135AsnPhe: 1.135 ± 0.187
3.584AsnGly: 3.584 ± 0.297
0.806AsnHis: 0.806 ± 0.149
1.374AsnIle: 1.374 ± 0.211
1.284AsnLys: 1.284 ± 0.188
2.688AsnLeu: 2.688 ± 0.297
0.538AsnMet: 0.538 ± 0.128
0.687AsnAsn: 0.687 ± 0.131
2.658AsnPro: 2.658 ± 0.257
1.016AsnGln: 1.016 ± 0.181
2.419AsnArg: 2.419 ± 0.261
1.613AsnSer: 1.613 ± 0.22
1.494AsnThr: 1.494 ± 0.248
2.36AsnVal: 2.36 ± 0.26
0.806AsnTrp: 0.806 ± 0.148
1.016AsnTyr: 1.016 ± 0.193
0.0AsnXaa: 0.0 ± 0.0
Pro
5.108ProAla: 5.108 ± 0.412
0.777ProCys: 0.777 ± 0.135
3.644ProAsp: 3.644 ± 0.323
4.391ProGlu: 4.391 ± 0.405
1.822ProPhe: 1.822 ± 0.25
5.526ProGly: 5.526 ± 0.575
0.806ProHis: 0.806 ± 0.148
2.33ProIle: 2.33 ± 0.233
2.21ProLys: 2.21 ± 0.28
4.212ProLeu: 4.212 ± 0.382
1.105ProMet: 1.105 ± 0.165
2.569ProAsn: 2.569 ± 0.238
2.927ProPro: 2.927 ± 0.385
1.344ProGln: 1.344 ± 0.179
3.166ProArg: 3.166 ± 0.333
2.151ProSer: 2.151 ± 0.248
2.927ProThr: 2.927 ± 0.325
4.152ProVal: 4.152 ± 0.344
1.434ProTrp: 1.434 ± 0.211
1.434ProTyr: 1.434 ± 0.18
0.0ProXaa: 0.0 ± 0.0
Gln
3.256GlnAla: 3.256 ± 0.298
0.388GlnCys: 0.388 ± 0.109
1.374GlnAsp: 1.374 ± 0.203
2.181GlnGlu: 2.181 ± 0.267
1.344GlnPhe: 1.344 ± 0.207
2.181GlnGly: 2.181 ± 0.239
0.597GlnHis: 0.597 ± 0.133
1.792GlnIle: 1.792 ± 0.227
1.732GlnLys: 1.732 ± 0.319
2.539GlnLeu: 2.539 ± 0.261
1.016GlnMet: 1.016 ± 0.161
0.866GlnAsn: 0.866 ± 0.162
1.792GlnPro: 1.792 ± 0.232
1.165GlnGln: 1.165 ± 0.232
2.181GlnArg: 2.181 ± 0.261
1.583GlnSer: 1.583 ± 0.215
1.613GlnThr: 1.613 ± 0.219
2.27GlnVal: 2.27 ± 0.243
0.717GlnTrp: 0.717 ± 0.138
0.747GlnTyr: 0.747 ± 0.119
0.0GlnXaa: 0.0 ± 0.0
Arg
6.601ArgAla: 6.601 ± 0.523
1.255ArgCys: 1.255 ± 0.225
4.242ArgAsp: 4.242 ± 0.387
4.182ArgGlu: 4.182 ± 0.384
2.121ArgPhe: 2.121 ± 0.226
5.078ArgGly: 5.078 ± 0.395
1.494ArgHis: 1.494 ± 0.252
3.823ArgIle: 3.823 ± 0.42
2.838ArgLys: 2.838 ± 0.279
4.63ArgLeu: 4.63 ± 0.462
2.001ArgMet: 2.001 ± 0.233
1.942ArgAsn: 1.942 ± 0.245
3.196ArgPro: 3.196 ± 0.345
2.718ArgGln: 2.718 ± 0.32
5.138ArgArg: 5.138 ± 0.446
2.868ArgSer: 2.868 ± 0.297
3.375ArgThr: 3.375 ± 0.418
4.988ArgVal: 4.988 ± 0.361
2.21ArgTrp: 2.21 ± 0.261
2.33ArgTyr: 2.33 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.257SerAla: 5.257 ± 0.399
0.866SerCys: 0.866 ± 0.175
3.166SerAsp: 3.166 ± 0.331
3.973SerGlu: 3.973 ± 0.339
1.434SerPhe: 1.434 ± 0.19
4.839SerGly: 4.839 ± 0.486
1.045SerHis: 1.045 ± 0.187
3.136SerIle: 3.136 ± 0.319
1.822SerLys: 1.822 ± 0.245
4.361SerLeu: 4.361 ± 0.32
1.135SerMet: 1.135 ± 0.177
1.792SerAsn: 1.792 ± 0.257
2.629SerPro: 2.629 ± 0.24
1.225SerGln: 1.225 ± 0.185
3.375SerArg: 3.375 ± 0.364
3.047SerSer: 3.047 ± 0.368
2.688SerThr: 2.688 ± 0.333
3.584SerVal: 3.584 ± 0.322
1.553SerTrp: 1.553 ± 0.207
1.523SerTyr: 1.523 ± 0.214
0.0SerXaa: 0.0 ± 0.0
Thr
4.63ThrAla: 4.63 ± 0.417
0.956ThrCys: 0.956 ± 0.173
3.614ThrAsp: 3.614 ± 0.372
2.897ThrGlu: 2.897 ± 0.281
1.971ThrPhe: 1.971 ± 0.247
4.929ThrGly: 4.929 ± 0.391
1.135ThrHis: 1.135 ± 0.186
2.927ThrIle: 2.927 ± 0.317
1.852ThrLys: 1.852 ± 0.277
4.63ThrLeu: 4.63 ± 0.341
0.866ThrMet: 0.866 ± 0.14
1.613ThrAsn: 1.613 ± 0.221
4.062ThrPro: 4.062 ± 0.323
1.374ThrGln: 1.374 ± 0.209
2.688ThrArg: 2.688 ± 0.288
2.927ThrSer: 2.927 ± 0.416
2.957ThrThr: 2.957 ± 0.394
4.481ThrVal: 4.481 ± 0.423
1.643ThrTrp: 1.643 ± 0.239
2.001ThrTyr: 2.001 ± 0.278
0.0ThrXaa: 0.0 ± 0.0
Val
6.721ValAla: 6.721 ± 0.46
0.836ValCys: 0.836 ± 0.153
5.377ValAsp: 5.377 ± 0.464
5.317ValGlu: 5.317 ± 0.419
1.732ValPhe: 1.732 ± 0.234
5.257ValGly: 5.257 ± 0.462
1.434ValHis: 1.434 ± 0.196
4.032ValIle: 4.032 ± 0.292
2.509ValLys: 2.509 ± 0.274
4.749ValLeu: 4.749 ± 0.38
1.165ValMet: 1.165 ± 0.21
2.33ValAsn: 2.33 ± 0.306
3.435ValPro: 3.435 ± 0.291
2.509ValGln: 2.509 ± 0.26
4.63ValArg: 4.63 ± 0.42
4.271ValSer: 4.271 ± 0.318
4.271ValThr: 4.271 ± 0.37
5.884ValVal: 5.884 ± 0.543
0.986ValTrp: 0.986 ± 0.168
2.031ValTyr: 2.031 ± 0.267
0.0ValXaa: 0.0 ± 0.0
Trp
1.882TrpAla: 1.882 ± 0.228
0.478TrpCys: 0.478 ± 0.117
1.523TrpAsp: 1.523 ± 0.233
1.225TrpGlu: 1.225 ± 0.205
0.687TrpPhe: 0.687 ± 0.14
1.673TrpGly: 1.673 ± 0.205
0.777TrpHis: 0.777 ± 0.159
1.016TrpIle: 1.016 ± 0.173
0.806TrpLys: 0.806 ± 0.16
1.942TrpLeu: 1.942 ± 0.229
0.508TrpMet: 0.508 ± 0.121
0.806TrpAsn: 0.806 ± 0.138
0.896TrpPro: 0.896 ± 0.139
0.657TrpGln: 0.657 ± 0.13
1.942TrpArg: 1.942 ± 0.271
1.523TrpSer: 1.523 ± 0.204
1.523TrpThr: 1.523 ± 0.195
2.001TrpVal: 2.001 ± 0.228
0.836TrpTrp: 0.836 ± 0.162
0.836TrpTyr: 0.836 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.017TyrAla: 3.017 ± 0.312
0.538TyrCys: 0.538 ± 0.122
2.27TyrAsp: 2.27 ± 0.247
2.091TyrGlu: 2.091 ± 0.234
0.717TyrPhe: 0.717 ± 0.154
3.345TyrGly: 3.345 ± 0.325
0.568TyrHis: 0.568 ± 0.135
1.135TyrIle: 1.135 ± 0.203
1.045TyrLys: 1.045 ± 0.17
2.36TyrLeu: 2.36 ± 0.262
0.388TyrMet: 0.388 ± 0.104
0.956TyrAsn: 0.956 ± 0.166
1.494TyrPro: 1.494 ± 0.212
0.986TyrGln: 0.986 ± 0.171
2.808TyrArg: 2.808 ± 0.301
1.284TyrSer: 1.284 ± 0.172
1.703TyrThr: 1.703 ± 0.209
2.629TyrVal: 2.629 ± 0.294
0.687TyrTrp: 0.687 ± 0.148
0.956TyrTyr: 0.956 ± 0.18
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 231 proteins (33479 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski