Amino acid dipepetide frequency for Shigella phage Shf125875

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.991AlaAla: 4.991 ± 0.372
0.414AlaCys: 0.414 ± 0.082
3.842AlaAsp: 3.842 ± 0.309
5.311AlaGlu: 5.311 ± 0.364
2.26AlaPhe: 2.26 ± 0.188
4.294AlaGly: 4.294 ± 0.344
1.036AlaHis: 1.036 ± 0.156
4.803AlaIle: 4.803 ± 0.329
5.236AlaLys: 5.236 ± 0.342
5.801AlaLeu: 5.801 ± 0.321
1.413AlaMet: 1.413 ± 0.166
3.371AlaAsn: 3.371 ± 0.221
2.562AlaPro: 2.562 ± 0.237
2.467AlaGln: 2.467 ± 0.207
3.07AlaArg: 3.07 ± 0.221
4.577AlaSer: 4.577 ± 0.315
3.767AlaThr: 3.767 ± 0.355
4.163AlaVal: 4.163 ± 0.279
1.017AlaTrp: 1.017 ± 0.144
2.693AlaTyr: 2.693 ± 0.237
0.0AlaXaa: 0.0 ± 0.0
Cys
0.64CysAla: 0.64 ± 0.106
0.207CysCys: 0.207 ± 0.066
0.659CysAsp: 0.659 ± 0.112
0.753CysGlu: 0.753 ± 0.143
0.414CysPhe: 0.414 ± 0.089
0.753CysGly: 0.753 ± 0.117
0.226CysHis: 0.226 ± 0.065
0.678CysIle: 0.678 ± 0.108
0.81CysLys: 0.81 ± 0.113
0.64CysLeu: 0.64 ± 0.116
0.264CysMet: 0.264 ± 0.064
0.565CysAsn: 0.565 ± 0.093
0.603CysPro: 0.603 ± 0.114
0.358CysGln: 0.358 ± 0.081
0.659CysArg: 0.659 ± 0.111
0.923CysSer: 0.923 ± 0.149
0.509CysThr: 0.509 ± 0.091
0.584CysVal: 0.584 ± 0.098
0.113CysTrp: 0.113 ± 0.034
0.546CysTyr: 0.546 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
4.577AspAla: 4.577 ± 0.265
0.603AspCys: 0.603 ± 0.112
3.974AspAsp: 3.974 ± 0.286
4.558AspGlu: 4.558 ± 0.346
3.371AspPhe: 3.371 ± 0.253
4.897AspGly: 4.897 ± 0.293
0.772AspHis: 0.772 ± 0.125
4.878AspIle: 4.878 ± 0.262
3.842AspLys: 3.842 ± 0.28
4.784AspLeu: 4.784 ± 0.326
1.544AspMet: 1.544 ± 0.2
2.938AspAsn: 2.938 ± 0.202
2.223AspPro: 2.223 ± 0.214
1.676AspGln: 1.676 ± 0.171
2.204AspArg: 2.204 ± 0.2
3.993AspSer: 3.993 ± 0.269
3.164AspThr: 3.164 ± 0.293
4.464AspVal: 4.464 ± 0.266
1.262AspTrp: 1.262 ± 0.168
3.164AspTyr: 3.164 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
5.236GluAla: 5.236 ± 0.332
1.092GluCys: 1.092 ± 0.16
4.389GluAsp: 4.389 ± 0.288
4.991GluGlu: 4.991 ± 0.305
3.654GluPhe: 3.654 ± 0.301
3.729GluGly: 3.729 ± 0.249
1.111GluHis: 1.111 ± 0.14
6.404GluIle: 6.404 ± 0.31
4.859GluLys: 4.859 ± 0.354
6.253GluLeu: 6.253 ± 0.368
2.204GluMet: 2.204 ± 0.217
3.729GluAsn: 3.729 ± 0.275
1.865GluPro: 1.865 ± 0.215
2.467GluGln: 2.467 ± 0.238
2.938GluArg: 2.938 ± 0.251
3.993GluSer: 3.993 ± 0.285
4.219GluThr: 4.219 ± 0.275
5.198GluVal: 5.198 ± 0.331
0.904GluTrp: 0.904 ± 0.118
3.541GluTyr: 3.541 ± 0.288
0.0GluXaa: 0.0 ± 0.0
Phe
2.806PheAla: 2.806 ± 0.206
0.452PheCys: 0.452 ± 0.09
3.089PheAsp: 3.089 ± 0.27
3.748PheGlu: 3.748 ± 0.259
1.375PhePhe: 1.375 ± 0.159
2.731PheGly: 2.731 ± 0.206
0.735PheHis: 0.735 ± 0.12
3.164PheIle: 3.164 ± 0.227
3.88PheLys: 3.88 ± 0.325
2.524PheLeu: 2.524 ± 0.236
1.149PheMet: 1.149 ± 0.146
3.108PheAsn: 3.108 ± 0.264
0.961PhePro: 0.961 ± 0.129
1.413PheGln: 1.413 ± 0.185
1.827PheArg: 1.827 ± 0.19
3.014PheSer: 3.014 ± 0.173
2.712PheThr: 2.712 ± 0.195
2.938PheVal: 2.938 ± 0.229
0.546PheTrp: 0.546 ± 0.102
1.883PheTyr: 1.883 ± 0.179
0.0PheXaa: 0.0 ± 0.0
Gly
3.108GlyAla: 3.108 ± 0.233
0.527GlyCys: 0.527 ± 0.095
4.031GlyAsp: 4.031 ± 0.312
3.71GlyGlu: 3.71 ± 0.252
2.769GlyPhe: 2.769 ± 0.207
4.257GlyGly: 4.257 ± 0.598
1.055GlyHis: 1.055 ± 0.152
4.52GlyIle: 4.52 ± 0.303
4.313GlyLys: 4.313 ± 0.298
5.311GlyLeu: 5.311 ± 0.273
1.62GlyMet: 1.62 ± 0.194
3.334GlyAsn: 3.334 ± 0.367
1.883GlyPro: 1.883 ± 0.165
2.185GlyGln: 2.185 ± 0.256
2.844GlyArg: 2.844 ± 0.236
3.786GlySer: 3.786 ± 0.264
4.464GlyThr: 4.464 ± 0.351
3.974GlyVal: 3.974 ± 0.327
0.961GlyTrp: 0.961 ± 0.129
3.014GlyTyr: 3.014 ± 0.264
0.0GlyXaa: 0.0 ± 0.0
His
0.942HisAla: 0.942 ± 0.126
0.32HisCys: 0.32 ± 0.089
1.036HisAsp: 1.036 ± 0.14
1.055HisGlu: 1.055 ± 0.15
0.753HisPhe: 0.753 ± 0.128
1.017HisGly: 1.017 ± 0.156
0.414HisHis: 0.414 ± 0.093
1.318HisIle: 1.318 ± 0.132
1.205HisLys: 1.205 ± 0.162
1.262HisLeu: 1.262 ± 0.146
0.471HisMet: 0.471 ± 0.097
0.829HisAsn: 0.829 ± 0.106
0.848HisPro: 0.848 ± 0.108
0.584HisGln: 0.584 ± 0.095
0.829HisArg: 0.829 ± 0.147
1.055HisSer: 1.055 ± 0.146
0.885HisThr: 0.885 ± 0.13
1.168HisVal: 1.168 ± 0.135
0.264HisTrp: 0.264 ± 0.074
0.64HisTyr: 0.64 ± 0.086
0.0HisXaa: 0.0 ± 0.0
Ile
5.236IleAla: 5.236 ± 0.387
0.753IleCys: 0.753 ± 0.13
4.916IleAsp: 4.916 ± 0.325
5.142IleGlu: 5.142 ± 0.347
2.524IlePhe: 2.524 ± 0.213
3.88IleGly: 3.88 ± 0.236
1.036IleHis: 1.036 ± 0.135
4.991IleIle: 4.991 ± 0.352
6.837IleLys: 6.837 ± 0.379
4.539IleLeu: 4.539 ± 0.285
1.601IleMet: 1.601 ± 0.172
4.935IleAsn: 4.935 ± 0.3
2.731IlePro: 2.731 ± 0.197
2.524IleGln: 2.524 ± 0.214
3.409IleArg: 3.409 ± 0.271
4.784IleSer: 4.784 ± 0.311
4.822IleThr: 4.822 ± 0.295
4.502IleVal: 4.502 ± 0.285
0.64IleTrp: 0.64 ± 0.114
2.505IleTyr: 2.505 ± 0.225
0.0IleXaa: 0.0 ± 0.0
Lys
5.952LysAla: 5.952 ± 0.41
0.678LysCys: 0.678 ± 0.108
4.483LysAsp: 4.483 ± 0.262
5.989LysGlu: 5.989 ± 0.375
3.466LysPhe: 3.466 ± 0.269
4.502LysGly: 4.502 ± 0.268
1.526LysHis: 1.526 ± 0.163
5.01LysIle: 5.01 ± 0.358
4.765LysLys: 4.765 ± 0.398
6.423LysLeu: 6.423 ± 0.369
2.26LysMet: 2.26 ± 0.198
4.219LysAsn: 4.219 ± 0.265
2.279LysPro: 2.279 ± 0.213
2.223LysGln: 2.223 ± 0.226
3.07LysArg: 3.07 ± 0.218
4.407LysSer: 4.407 ± 0.236
4.294LysThr: 4.294 ± 0.28
4.803LysVal: 4.803 ± 0.295
1.243LysTrp: 1.243 ± 0.134
3.334LysTyr: 3.334 ± 0.273
0.0LysXaa: 0.0 ± 0.0
Leu
5.274LeuAla: 5.274 ± 0.319
0.904LeuCys: 0.904 ± 0.127
5.142LeuAsp: 5.142 ± 0.318
4.954LeuGlu: 4.954 ± 0.29
3.258LeuPhe: 3.258 ± 0.27
4.313LeuGly: 4.313 ± 0.291
1.243LeuHis: 1.243 ± 0.156
4.746LeuIle: 4.746 ± 0.348
5.952LeuLys: 5.952 ± 0.354
5.387LeuLeu: 5.387 ± 0.369
2.128LeuMet: 2.128 ± 0.2
4.935LeuAsn: 4.935 ± 0.331
2.976LeuPro: 2.976 ± 0.221
2.75LeuGln: 2.75 ± 0.253
3.767LeuArg: 3.767 ± 0.229
4.445LeuSer: 4.445 ± 0.321
4.69LeuThr: 4.69 ± 0.301
4.841LeuVal: 4.841 ± 0.283
0.697LeuTrp: 0.697 ± 0.124
3.183LeuTyr: 3.183 ± 0.229
0.0LeuXaa: 0.0 ± 0.0
Met
2.091MetAla: 2.091 ± 0.218
0.358MetCys: 0.358 ± 0.071
1.337MetAsp: 1.337 ± 0.136
1.846MetGlu: 1.846 ± 0.209
1.318MetPhe: 1.318 ± 0.158
1.187MetGly: 1.187 ± 0.148
0.283MetHis: 0.283 ± 0.074
1.676MetIle: 1.676 ± 0.172
2.298MetLys: 2.298 ± 0.242
2.11MetLeu: 2.11 ± 0.18
0.848MetMet: 0.848 ± 0.145
1.544MetAsn: 1.544 ± 0.163
0.791MetPro: 0.791 ± 0.113
0.753MetGln: 0.753 ± 0.121
0.998MetArg: 0.998 ± 0.127
2.015MetSer: 2.015 ± 0.183
1.62MetThr: 1.62 ± 0.163
1.526MetVal: 1.526 ± 0.17
0.188MetTrp: 0.188 ± 0.069
0.942MetTyr: 0.942 ± 0.133
0.0MetXaa: 0.0 ± 0.0
Asn
3.729AsnAla: 3.729 ± 0.278
0.452AsnCys: 0.452 ± 0.109
3.089AsnAsp: 3.089 ± 0.255
4.163AsnGlu: 4.163 ± 0.21
2.938AsnPhe: 2.938 ± 0.191
3.993AsnGly: 3.993 ± 0.351
0.961AsnHis: 0.961 ± 0.125
4.049AsnIle: 4.049 ± 0.295
4.181AsnLys: 4.181 ± 0.34
4.049AsnLeu: 4.049 ± 0.241
1.695AsnMet: 1.695 ± 0.174
3.315AsnAsn: 3.315 ± 0.254
2.637AsnPro: 2.637 ± 0.229
1.413AsnGln: 1.413 ± 0.192
2.223AsnArg: 2.223 ± 0.199
3.447AsnSer: 3.447 ± 0.255
3.108AsnThr: 3.108 ± 0.245
3.428AsnVal: 3.428 ± 0.245
0.678AsnTrp: 0.678 ± 0.117
2.128AsnTyr: 2.128 ± 0.18
0.0AsnXaa: 0.0 ± 0.0
Pro
2.128ProAla: 2.128 ± 0.186
0.414ProCys: 0.414 ± 0.083
2.354ProAsp: 2.354 ± 0.209
3.127ProGlu: 3.127 ± 0.292
1.733ProPhe: 1.733 ± 0.171
2.336ProGly: 2.336 ± 0.212
0.527ProHis: 0.527 ± 0.104
2.185ProIle: 2.185 ± 0.184
2.392ProLys: 2.392 ± 0.237
2.317ProLeu: 2.317 ± 0.199
0.753ProMet: 0.753 ± 0.098
1.978ProAsn: 1.978 ± 0.186
1.017ProPro: 1.017 ± 0.16
1.074ProGln: 1.074 ± 0.146
1.337ProArg: 1.337 ± 0.199
2.392ProSer: 2.392 ± 0.173
2.373ProThr: 2.373 ± 0.203
2.806ProVal: 2.806 ± 0.2
0.753ProTrp: 0.753 ± 0.112
1.375ProTyr: 1.375 ± 0.165
0.0ProXaa: 0.0 ± 0.0
Gln
2.354GlnAla: 2.354 ± 0.257
0.339GlnCys: 0.339 ± 0.081
1.714GlnAsp: 1.714 ± 0.199
2.486GlnGlu: 2.486 ± 0.216
1.544GlnPhe: 1.544 ± 0.168
2.053GlnGly: 2.053 ± 0.201
0.584GlnHis: 0.584 ± 0.123
2.844GlnIle: 2.844 ± 0.195
2.317GlnLys: 2.317 ± 0.227
2.882GlnLeu: 2.882 ± 0.255
0.998GlnMet: 0.998 ± 0.134
1.526GlnAsn: 1.526 ± 0.172
1.092GlnPro: 1.092 ± 0.131
1.036GlnGln: 1.036 ± 0.128
1.563GlnArg: 1.563 ± 0.162
1.413GlnSer: 1.413 ± 0.162
1.865GlnThr: 1.865 ± 0.177
2.204GlnVal: 2.204 ± 0.22
0.753GlnTrp: 0.753 ± 0.114
1.469GlnTyr: 1.469 ± 0.151
0.0GlnXaa: 0.0 ± 0.0
Arg
2.75ArgAla: 2.75 ± 0.223
0.565ArgCys: 0.565 ± 0.103
2.769ArgAsp: 2.769 ± 0.206
3.277ArgGlu: 3.277 ± 0.253
2.128ArgPhe: 2.128 ± 0.213
2.637ArgGly: 2.637 ± 0.226
0.716ArgHis: 0.716 ± 0.111
3.108ArgIle: 3.108 ± 0.218
3.409ArgLys: 3.409 ± 0.288
3.635ArgLeu: 3.635 ± 0.267
1.111ArgMet: 1.111 ± 0.159
2.053ArgAsn: 2.053 ± 0.169
1.375ArgPro: 1.375 ± 0.17
1.733ArgGln: 1.733 ± 0.164
2.072ArgArg: 2.072 ± 0.204
2.656ArgSer: 2.656 ± 0.204
2.223ArgThr: 2.223 ± 0.199
3.108ArgVal: 3.108 ± 0.258
0.546ArgTrp: 0.546 ± 0.124
1.676ArgTyr: 1.676 ± 0.198
0.0ArgXaa: 0.0 ± 0.0
Ser
3.371SerAla: 3.371 ± 0.258
0.622SerCys: 0.622 ± 0.112
4.068SerAsp: 4.068 ± 0.324
4.426SerGlu: 4.426 ± 0.299
2.806SerPhe: 2.806 ± 0.2
4.125SerGly: 4.125 ± 0.332
1.13SerHis: 1.13 ± 0.145
4.897SerIle: 4.897 ± 0.279
4.558SerLys: 4.558 ± 0.309
5.085SerLeu: 5.085 ± 0.307
1.318SerMet: 1.318 ± 0.14
3.032SerAsn: 3.032 ± 0.233
2.467SerPro: 2.467 ± 0.225
2.204SerGln: 2.204 ± 0.235
3.07SerArg: 3.07 ± 0.328
4.728SerSer: 4.728 ± 0.37
3.748SerThr: 3.748 ± 0.326
4.163SerVal: 4.163 ± 0.248
0.961SerTrp: 0.961 ± 0.15
2.769SerTyr: 2.769 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
3.899ThrAla: 3.899 ± 0.321
0.603ThrCys: 0.603 ± 0.11
3.767ThrAsp: 3.767 ± 0.289
4.163ThrGlu: 4.163 ± 0.301
2.392ThrPhe: 2.392 ± 0.202
4.2ThrGly: 4.2 ± 0.344
1.262ThrHis: 1.262 ± 0.165
4.351ThrIle: 4.351 ± 0.297
4.163ThrLys: 4.163 ± 0.281
4.276ThrLeu: 4.276 ± 0.3
1.149ThrMet: 1.149 ± 0.152
2.938ThrAsn: 2.938 ± 0.227
2.788ThrPro: 2.788 ± 0.256
1.846ThrGln: 1.846 ± 0.237
2.599ThrArg: 2.599 ± 0.249
3.466ThrSer: 3.466 ± 0.314
3.673ThrThr: 3.673 ± 0.308
4.539ThrVal: 4.539 ± 0.372
0.904ThrTrp: 0.904 ± 0.132
2.675ThrTyr: 2.675 ± 0.179
0.0ThrXaa: 0.0 ± 0.0
Val
4.238ValAla: 4.238 ± 0.27
0.904ValCys: 0.904 ± 0.117
4.426ValAsp: 4.426 ± 0.306
5.575ValGlu: 5.575 ± 0.315
2.731ValPhe: 2.731 ± 0.213
3.654ValGly: 3.654 ± 0.323
1.187ValHis: 1.187 ± 0.137
4.746ValIle: 4.746 ± 0.291
5.745ValLys: 5.745 ± 0.339
4.633ValLeu: 4.633 ± 0.299
1.507ValMet: 1.507 ± 0.16
3.748ValAsn: 3.748 ± 0.291
2.11ValPro: 2.11 ± 0.17
2.279ValGln: 2.279 ± 0.217
3.032ValArg: 3.032 ± 0.246
4.615ValSer: 4.615 ± 0.24
4.031ValThr: 4.031 ± 0.291
4.633ValVal: 4.633 ± 0.294
0.791ValTrp: 0.791 ± 0.134
2.788ValTyr: 2.788 ± 0.207
0.0ValXaa: 0.0 ± 0.0
Trp
0.848TrpAla: 0.848 ± 0.117
0.151TrpCys: 0.151 ± 0.046
0.866TrpAsp: 0.866 ± 0.116
0.716TrpGlu: 0.716 ± 0.113
0.697TrpPhe: 0.697 ± 0.107
0.622TrpGly: 0.622 ± 0.116
0.17TrpHis: 0.17 ± 0.058
1.074TrpIle: 1.074 ± 0.136
1.356TrpLys: 1.356 ± 0.142
0.923TrpLeu: 0.923 ± 0.135
0.49TrpMet: 0.49 ± 0.096
0.961TrpAsn: 0.961 ± 0.126
0.49TrpPro: 0.49 ± 0.097
0.527TrpGln: 0.527 ± 0.103
0.452TrpArg: 0.452 ± 0.089
0.81TrpSer: 0.81 ± 0.124
0.904TrpThr: 0.904 ± 0.114
1.055TrpVal: 1.055 ± 0.128
0.17TrpTrp: 0.17 ± 0.057
0.772TrpTyr: 0.772 ± 0.11
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.919TyrAla: 2.919 ± 0.205
0.546TyrCys: 0.546 ± 0.109
2.938TyrAsp: 2.938 ± 0.226
2.788TyrGlu: 2.788 ± 0.256
1.94TyrPhe: 1.94 ± 0.179
2.486TyrGly: 2.486 ± 0.26
0.923TyrHis: 0.923 ± 0.115
2.863TyrIle: 2.863 ± 0.226
2.806TyrLys: 2.806 ± 0.266
2.75TyrLeu: 2.75 ± 0.277
1.187TyrMet: 1.187 ± 0.142
2.618TyrAsn: 2.618 ± 0.219
1.676TyrPro: 1.676 ± 0.168
1.526TyrGln: 1.526 ± 0.162
1.582TyrArg: 1.582 ± 0.18
3.032TyrSer: 3.032 ± 0.245
2.58TyrThr: 2.58 ± 0.191
3.277TyrVal: 3.277 ± 0.237
0.659TyrTrp: 0.659 ± 0.09
1.827TyrTyr: 1.827 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 267 proteins (53094 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski