Amino acid dipepetide frequency for Salmonella phage SSE121

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.833AlaAla: 5.833 ± 0.484
0.658AlaCys: 0.658 ± 0.124
4.471AlaAsp: 4.471 ± 0.315
4.471AlaGlu: 4.471 ± 0.397
3.018AlaPhe: 3.018 ± 0.286
5.538AlaGly: 5.538 ± 0.411
1.157AlaHis: 1.157 ± 0.155
3.972AlaIle: 3.972 ± 0.283
5.242AlaLys: 5.242 ± 0.418
5.719AlaLeu: 5.719 ± 0.4
2.496AlaMet: 2.496 ± 0.279
3.359AlaAsn: 3.359 ± 0.287
2.36AlaPro: 2.36 ± 0.224
2.519AlaGln: 2.519 ± 0.242
3.858AlaArg: 3.858 ± 0.34
3.835AlaSer: 3.835 ± 0.269
3.767AlaThr: 3.767 ± 0.359
4.902AlaVal: 4.902 ± 0.29
1.271AlaTrp: 1.271 ± 0.187
2.882AlaTyr: 2.882 ± 0.29
0.0AlaXaa: 0.0 ± 0.0
Cys
0.953CysAla: 0.953 ± 0.165
0.182CysCys: 0.182 ± 0.064
0.908CysAsp: 0.908 ± 0.15
0.704CysGlu: 0.704 ± 0.149
0.749CysPhe: 0.749 ± 0.133
0.976CysGly: 0.976 ± 0.163
0.318CysHis: 0.318 ± 0.08
0.613CysIle: 0.613 ± 0.11
0.772CysLys: 0.772 ± 0.157
1.135CysLeu: 1.135 ± 0.149
0.272CysMet: 0.272 ± 0.08
0.817CysAsn: 0.817 ± 0.146
0.454CysPro: 0.454 ± 0.106
0.318CysGln: 0.318 ± 0.093
0.658CysArg: 0.658 ± 0.141
0.817CysSer: 0.817 ± 0.157
0.431CysThr: 0.431 ± 0.1
0.749CysVal: 0.749 ± 0.135
0.25CysTrp: 0.25 ± 0.08
0.545CysTyr: 0.545 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
4.539AspAla: 4.539 ± 0.308
0.522AspCys: 0.522 ± 0.124
4.085AspAsp: 4.085 ± 0.346
4.289AspGlu: 4.289 ± 0.363
3.155AspPhe: 3.155 ± 0.225
6.218AspGly: 6.218 ± 0.325
1.339AspHis: 1.339 ± 0.168
3.858AspIle: 3.858 ± 0.271
3.631AspLys: 3.631 ± 0.326
5.719AspLeu: 5.719 ± 0.347
1.793AspMet: 1.793 ± 0.187
3.359AspAsn: 3.359 ± 0.295
3.291AspPro: 3.291 ± 0.251
2.111AspGln: 2.111 ± 0.26
2.633AspArg: 2.633 ± 0.239
3.631AspSer: 3.631 ± 0.299
3.404AspThr: 3.404 ± 0.304
4.698AspVal: 4.698 ± 0.388
1.498AspTrp: 1.498 ± 0.196
2.542AspTyr: 2.542 ± 0.23
0.0AspXaa: 0.0 ± 0.0
Glu
5.084GluAla: 5.084 ± 0.386
1.021GluCys: 1.021 ± 0.15
3.722GluAsp: 3.722 ± 0.293
5.56GluGlu: 5.56 ± 0.438
2.837GluPhe: 2.837 ± 0.329
4.425GluGly: 4.425 ± 0.343
1.407GluHis: 1.407 ± 0.222
3.949GluIle: 3.949 ± 0.333
5.538GluLys: 5.538 ± 0.462
5.424GluLeu: 5.424 ± 0.329
2.338GluMet: 2.338 ± 0.198
3.086GluAsn: 3.086 ± 0.252
1.77GluPro: 1.77 ± 0.218
2.496GluGln: 2.496 ± 0.284
3.313GluArg: 3.313 ± 0.271
3.404GluSer: 3.404 ± 0.32
3.291GluThr: 3.291 ± 0.268
4.13GluVal: 4.13 ± 0.346
1.294GluTrp: 1.294 ± 0.179
2.496GluTyr: 2.496 ± 0.258
0.0GluXaa: 0.0 ± 0.0
Phe
2.837PheAla: 2.837 ± 0.289
0.726PheCys: 0.726 ± 0.148
3.291PheAsp: 3.291 ± 0.261
2.133PheGlu: 2.133 ± 0.252
1.702PhePhe: 1.702 ± 0.227
3.45PheGly: 3.45 ± 0.259
0.749PheHis: 0.749 ± 0.114
2.565PheIle: 2.565 ± 0.233
2.542PheLys: 2.542 ± 0.231
2.746PheLeu: 2.746 ± 0.23
1.475PheMet: 1.475 ± 0.137
2.201PheAsn: 2.201 ± 0.24
1.498PhePro: 1.498 ± 0.18
1.384PheGln: 1.384 ± 0.165
1.952PheArg: 1.952 ± 0.211
2.837PheSer: 2.837 ± 0.227
2.791PheThr: 2.791 ± 0.269
3.359PheVal: 3.359 ± 0.255
0.704PheTrp: 0.704 ± 0.134
1.43PheTyr: 1.43 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
4.289GlyAla: 4.289 ± 0.385
0.908GlyCys: 0.908 ± 0.155
5.174GlyAsp: 5.174 ± 0.69
4.925GlyGlu: 4.925 ± 0.326
3.677GlyPhe: 3.677 ± 0.325
5.81GlyGly: 5.81 ± 0.66
1.203GlyHis: 1.203 ± 0.181
4.153GlyIle: 4.153 ± 0.313
5.742GlyLys: 5.742 ± 0.362
5.379GlyLeu: 5.379 ± 0.325
1.77GlyMet: 1.77 ± 0.177
3.949GlyAsn: 3.949 ± 0.326
2.428GlyPro: 2.428 ± 0.814
2.769GlyGln: 2.769 ± 0.25
3.45GlyArg: 3.45 ± 0.269
4.04GlySer: 4.04 ± 0.364
4.38GlyThr: 4.38 ± 0.446
5.583GlyVal: 5.583 ± 0.346
1.634GlyTrp: 1.634 ± 0.198
3.382GlyTyr: 3.382 ± 0.285
0.0GlyXaa: 0.0 ± 0.0
His
0.953HisAla: 0.953 ± 0.163
0.34HisCys: 0.34 ± 0.08
0.908HisAsp: 0.908 ± 0.145
1.203HisGlu: 1.203 ± 0.175
0.84HisPhe: 0.84 ± 0.155
1.589HisGly: 1.589 ± 0.246
0.522HisHis: 0.522 ± 0.118
1.362HisIle: 1.362 ± 0.174
1.271HisLys: 1.271 ± 0.162
1.135HisLeu: 1.135 ± 0.134
0.749HisMet: 0.749 ± 0.145
0.84HisAsn: 0.84 ± 0.134
1.021HisPro: 1.021 ± 0.169
0.522HisGln: 0.522 ± 0.105
0.84HisArg: 0.84 ± 0.149
0.885HisSer: 0.885 ± 0.137
1.294HisThr: 1.294 ± 0.167
0.817HisVal: 0.817 ± 0.15
0.386HisTrp: 0.386 ± 0.087
0.999HisTyr: 0.999 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
4.221IleAla: 4.221 ± 0.34
0.635IleCys: 0.635 ± 0.111
4.267IleAsp: 4.267 ± 0.364
4.13IleGlu: 4.13 ± 0.312
2.247IlePhe: 2.247 ± 0.204
3.518IleGly: 3.518 ± 0.323
1.316IleHis: 1.316 ± 0.159
3.722IleIle: 3.722 ± 0.28
3.427IleLys: 3.427 ± 0.274
4.471IleLeu: 4.471 ± 0.276
1.203IleMet: 1.203 ± 0.163
2.655IleAsn: 2.655 ± 0.274
2.86IlePro: 2.86 ± 0.256
1.974IleGln: 1.974 ± 0.226
2.928IleArg: 2.928 ± 0.299
3.767IleSer: 3.767 ± 0.309
3.177IleThr: 3.177 ± 0.271
3.79IleVal: 3.79 ± 0.27
0.613IleTrp: 0.613 ± 0.11
2.292IleTyr: 2.292 ± 0.243
0.0IleXaa: 0.0 ± 0.0
Lys
5.742LysAla: 5.742 ± 0.409
0.658LysCys: 0.658 ± 0.155
4.993LysAsp: 4.993 ± 0.41
5.038LysGlu: 5.038 ± 0.329
2.542LysPhe: 2.542 ± 0.233
4.857LysGly: 4.857 ± 0.502
1.657LysHis: 1.657 ± 0.234
3.79LysIle: 3.79 ± 0.337
4.902LysLys: 4.902 ± 0.387
4.925LysLeu: 4.925 ± 0.31
2.224LysMet: 2.224 ± 0.246
3.2LysAsn: 3.2 ± 0.303
2.224LysPro: 2.224 ± 0.297
2.02LysGln: 2.02 ± 0.224
3.472LysArg: 3.472 ± 0.349
3.404LysSer: 3.404 ± 0.271
3.972LysThr: 3.972 ± 0.251
5.447LysVal: 5.447 ± 0.355
1.044LysTrp: 1.044 ± 0.133
2.428LysTyr: 2.428 ± 0.207
0.0LysXaa: 0.0 ± 0.0
Leu
6.082LeuAla: 6.082 ± 0.398
1.157LeuCys: 1.157 ± 0.18
5.242LeuAsp: 5.242 ± 0.304
5.991LeuGlu: 5.991 ± 0.382
2.791LeuPhe: 2.791 ± 0.312
4.584LeuGly: 4.584 ± 0.302
1.203LeuHis: 1.203 ± 0.162
4.04LeuIle: 4.04 ± 0.272
6.037LeuLys: 6.037 ± 0.389
6.128LeuLeu: 6.128 ± 0.467
2.315LeuMet: 2.315 ± 0.234
3.699LeuAsn: 3.699 ± 0.331
3.268LeuPro: 3.268 ± 0.239
2.565LeuGln: 2.565 ± 0.238
4.108LeuArg: 4.108 ± 0.336
5.22LeuSer: 5.22 ± 0.356
4.879LeuThr: 4.879 ± 0.364
4.902LeuVal: 4.902 ± 0.336
1.271LeuTrp: 1.271 ± 0.176
3.268LeuTyr: 3.268 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
2.36MetAla: 2.36 ± 0.224
0.113MetCys: 0.113 ± 0.052
1.611MetAsp: 1.611 ± 0.193
1.747MetGlu: 1.747 ± 0.221
1.18MetPhe: 1.18 ± 0.16
1.793MetGly: 1.793 ± 0.218
0.272MetHis: 0.272 ± 0.064
1.884MetIle: 1.884 ± 0.223
2.837MetLys: 2.837 ± 0.261
2.292MetLeu: 2.292 ± 0.214
0.953MetMet: 0.953 ± 0.188
1.294MetAsn: 1.294 ± 0.182
0.885MetPro: 0.885 ± 0.14
1.112MetGln: 1.112 ± 0.16
1.157MetArg: 1.157 ± 0.151
2.111MetSer: 2.111 ± 0.209
2.043MetThr: 2.043 ± 0.211
1.838MetVal: 1.838 ± 0.213
0.454MetTrp: 0.454 ± 0.088
1.203MetTyr: 1.203 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
3.563AsnAla: 3.563 ± 0.29
0.545AsnCys: 0.545 ± 0.115
2.746AsnAsp: 2.746 ± 0.259
2.36AsnGlu: 2.36 ± 0.25
2.043AsnPhe: 2.043 ± 0.173
4.38AsnGly: 4.38 ± 0.416
0.908AsnHis: 0.908 ± 0.151
3.155AsnIle: 3.155 ± 0.27
2.95AsnLys: 2.95 ± 0.241
3.926AsnLeu: 3.926 ± 0.344
1.498AsnMet: 1.498 ± 0.205
2.133AsnAsn: 2.133 ± 0.276
2.383AsnPro: 2.383 ± 0.224
1.906AsnGln: 1.906 ± 0.202
1.906AsnArg: 1.906 ± 0.181
2.655AsnSer: 2.655 ± 0.297
2.678AsnThr: 2.678 ± 0.316
3.291AsnVal: 3.291 ± 0.322
0.885AsnTrp: 0.885 ± 0.159
1.861AsnTyr: 1.861 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
2.383ProAla: 2.383 ± 0.266
0.295ProCys: 0.295 ± 0.069
2.723ProAsp: 2.723 ± 0.259
3.404ProGlu: 3.404 ± 0.255
1.838ProPhe: 1.838 ± 0.184
2.269ProGly: 2.269 ± 0.24
0.613ProHis: 0.613 ± 0.138
1.452ProIle: 1.452 ± 0.187
2.451ProLys: 2.451 ± 0.262
2.633ProLeu: 2.633 ± 0.258
0.885ProMet: 0.885 ± 0.134
1.77ProAsn: 1.77 ± 0.203
0.976ProPro: 0.976 ± 0.187
2.02ProGln: 2.02 ± 0.437
1.498ProArg: 1.498 ± 0.182
2.065ProSer: 2.065 ± 0.204
2.633ProThr: 2.633 ± 0.254
3.109ProVal: 3.109 ± 0.258
0.363ProTrp: 0.363 ± 0.087
1.634ProTyr: 1.634 ± 0.2
0.0ProXaa: 0.0 ± 0.0
Gln
2.905GlnAla: 2.905 ± 0.294
0.34GlnCys: 0.34 ± 0.078
1.952GlnAsp: 1.952 ± 0.198
2.201GlnGlu: 2.201 ± 0.205
1.634GlnPhe: 1.634 ± 0.161
2.95GlnGly: 2.95 ± 0.718
0.499GlnHis: 0.499 ± 0.117
1.997GlnIle: 1.997 ± 0.222
2.088GlnLys: 2.088 ± 0.238
2.678GlnLeu: 2.678 ± 0.237
1.248GlnMet: 1.248 ± 0.167
1.543GlnAsn: 1.543 ± 0.156
1.044GlnPro: 1.044 ± 0.147
1.589GlnGln: 1.589 ± 0.236
1.997GlnArg: 1.997 ± 0.221
2.156GlnSer: 2.156 ± 0.214
1.861GlnThr: 1.861 ± 0.189
2.383GlnVal: 2.383 ± 0.23
0.522GlnTrp: 0.522 ± 0.098
1.702GlnTyr: 1.702 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
2.814ArgAla: 2.814 ± 0.294
0.93ArgCys: 0.93 ± 0.169
3.563ArgAsp: 3.563 ± 0.274
3.109ArgGlu: 3.109 ± 0.258
1.906ArgPhe: 1.906 ± 0.205
3.654ArgGly: 3.654 ± 0.298
0.862ArgHis: 0.862 ± 0.134
2.723ArgIle: 2.723 ± 0.248
3.54ArgLys: 3.54 ± 0.322
4.448ArgLeu: 4.448 ± 0.367
1.362ArgMet: 1.362 ± 0.16
2.451ArgAsn: 2.451 ± 0.234
1.498ArgPro: 1.498 ± 0.191
1.929ArgGln: 1.929 ± 0.234
2.769ArgArg: 2.769 ± 0.313
2.882ArgSer: 2.882 ± 0.289
2.406ArgThr: 2.406 ± 0.225
2.95ArgVal: 2.95 ± 0.279
0.84ArgTrp: 0.84 ± 0.114
1.634ArgTyr: 1.634 ± 0.196
0.0ArgXaa: 0.0 ± 0.0
Ser
4.221SerAla: 4.221 ± 0.276
0.794SerCys: 0.794 ± 0.166
4.176SerAsp: 4.176 ± 0.305
3.132SerGlu: 3.132 ± 0.304
2.769SerPhe: 2.769 ± 0.267
4.879SerGly: 4.879 ± 0.381
0.953SerHis: 0.953 ± 0.148
3.699SerIle: 3.699 ± 0.29
3.699SerLys: 3.699 ± 0.317
4.562SerLeu: 4.562 ± 0.346
1.566SerMet: 1.566 ± 0.155
2.428SerAsn: 2.428 ± 0.24
2.088SerPro: 2.088 ± 0.19
1.974SerGln: 1.974 ± 0.177
2.814SerArg: 2.814 ± 0.23
3.858SerSer: 3.858 ± 0.371
3.177SerThr: 3.177 ± 0.302
4.494SerVal: 4.494 ± 0.393
1.044SerTrp: 1.044 ± 0.156
2.133SerTyr: 2.133 ± 0.215
0.0SerXaa: 0.0 ± 0.0
Thr
4.199ThrAla: 4.199 ± 0.389
0.726ThrCys: 0.726 ± 0.133
3.518ThrAsp: 3.518 ± 0.34
3.631ThrGlu: 3.631 ± 0.272
2.723ThrPhe: 2.723 ± 0.213
5.242ThrGly: 5.242 ± 0.53
0.976ThrHis: 0.976 ± 0.126
3.336ThrIle: 3.336 ± 0.31
3.745ThrLys: 3.745 ± 0.351
4.766ThrLeu: 4.766 ± 0.287
1.362ThrMet: 1.362 ± 0.168
2.882ThrAsn: 2.882 ± 0.319
2.565ThrPro: 2.565 ± 0.218
2.088ThrGln: 2.088 ± 0.224
2.428ThrArg: 2.428 ± 0.199
2.86ThrSer: 2.86 ± 0.246
3.767ThrThr: 3.767 ± 0.417
4.698ThrVal: 4.698 ± 0.433
1.021ThrTrp: 1.021 ± 0.17
2.043ThrTyr: 2.043 ± 0.172
0.0ThrXaa: 0.0 ± 0.0
Val
4.63ValAla: 4.63 ± 0.336
1.089ValCys: 1.089 ± 0.163
5.061ValAsp: 5.061 ± 0.339
5.152ValGlu: 5.152 ± 0.38
2.655ValPhe: 2.655 ± 0.251
4.335ValGly: 4.335 ± 0.389
1.226ValHis: 1.226 ± 0.169
3.926ValIle: 3.926 ± 0.266
4.97ValLys: 4.97 ± 0.411
5.129ValLeu: 5.129 ± 0.311
1.952ValMet: 1.952 ± 0.206
3.563ValAsn: 3.563 ± 0.322
2.224ValPro: 2.224 ± 0.233
1.906ValGln: 1.906 ± 0.192
3.608ValArg: 3.608 ± 0.329
4.516ValSer: 4.516 ± 0.273
5.016ValThr: 5.016 ± 0.467
6.627ValVal: 6.627 ± 0.56
1.226ValTrp: 1.226 ± 0.162
2.928ValTyr: 2.928 ± 0.246
0.0ValXaa: 0.0 ± 0.0
Trp
1.089TrpAla: 1.089 ± 0.156
0.295TrpCys: 0.295 ± 0.084
1.157TrpAsp: 1.157 ± 0.195
1.248TrpGlu: 1.248 ± 0.144
0.681TrpPhe: 0.681 ± 0.153
1.067TrpGly: 1.067 ± 0.162
0.409TrpHis: 0.409 ± 0.085
0.908TrpIle: 0.908 ± 0.142
0.953TrpLys: 0.953 ± 0.168
1.679TrpLeu: 1.679 ± 0.192
0.749TrpMet: 0.749 ± 0.118
0.772TrpAsn: 0.772 ± 0.152
0.477TrpPro: 0.477 ± 0.1
0.567TrpGln: 0.567 ± 0.115
0.704TrpArg: 0.704 ± 0.126
1.157TrpSer: 1.157 ± 0.181
1.089TrpThr: 1.089 ± 0.17
1.271TrpVal: 1.271 ± 0.168
0.386TrpTrp: 0.386 ± 0.095
0.772TrpTyr: 0.772 ± 0.118
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.633TyrAla: 2.633 ± 0.285
0.681TyrCys: 0.681 ± 0.128
2.746TyrAsp: 2.746 ± 0.246
2.156TyrGlu: 2.156 ± 0.201
1.384TyrPhe: 1.384 ± 0.173
3.132TyrGly: 3.132 ± 0.27
0.885TyrHis: 0.885 ± 0.141
2.111TyrIle: 2.111 ± 0.21
2.201TyrLys: 2.201 ± 0.25
3.858TyrLeu: 3.858 ± 0.324
0.908TyrMet: 0.908 ± 0.148
1.77TyrAsn: 1.77 ± 0.2
1.702TyrPro: 1.702 ± 0.202
1.521TyrGln: 1.521 ± 0.163
2.179TyrArg: 2.179 ± 0.224
2.36TyrSer: 2.36 ± 0.227
2.519TyrThr: 2.519 ± 0.252
2.701TyrVal: 2.701 ± 0.214
0.726TyrTrp: 0.726 ± 0.119
1.294TyrTyr: 1.294 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 242 proteins (44064 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski