Amino acid dipepetide frequency for Serratia phage 2050HW

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.313AlaAla: 6.313 ± 0.365
0.451AlaCys: 0.451 ± 0.068
4.379AlaAsp: 4.379 ± 0.3
4.118AlaGlu: 4.118 ± 0.256
2.943AlaPhe: 2.943 ± 0.189
4.865AlaGly: 4.865 ± 0.301
0.997AlaHis: 0.997 ± 0.113
4.913AlaIle: 4.913 ± 0.222
4.343AlaLys: 4.343 ± 0.227
6.55AlaLeu: 6.55 ± 0.293
2.183AlaMet: 2.183 ± 0.14
4.236AlaAsn: 4.236 ± 0.262
2.48AlaPro: 2.48 ± 0.177
2.088AlaGln: 2.088 ± 0.163
2.907AlaArg: 2.907 ± 0.214
3.785AlaSer: 3.785 ± 0.215
4.711AlaThr: 4.711 ± 0.281
5.019AlaVal: 5.019 ± 0.273
0.831AlaTrp: 0.831 ± 0.127
2.266AlaTyr: 2.266 ± 0.149
0.0AlaXaa: 0.0 ± 0.0
Cys
0.593CysAla: 0.593 ± 0.08
0.119CysCys: 0.119 ± 0.037
0.463CysAsp: 0.463 ± 0.079
0.605CysGlu: 0.605 ± 0.094
0.487CysPhe: 0.487 ± 0.077
0.593CysGly: 0.593 ± 0.093
0.095CysHis: 0.095 ± 0.037
0.558CysIle: 0.558 ± 0.09
0.546CysLys: 0.546 ± 0.09
0.641CysLeu: 0.641 ± 0.086
0.225CysMet: 0.225 ± 0.049
0.593CysAsn: 0.593 ± 0.088
0.392CysPro: 0.392 ± 0.075
0.237CysGln: 0.237 ± 0.056
0.629CysArg: 0.629 ± 0.086
0.7CysSer: 0.7 ± 0.094
0.736CysThr: 0.736 ± 0.094
0.581CysVal: 0.581 ± 0.072
0.107CysTrp: 0.107 ± 0.038
0.498CysTyr: 0.498 ± 0.076
0.0CysXaa: 0.0 ± 0.0
Asp
4.141AspAla: 4.141 ± 0.224
0.641AspCys: 0.641 ± 0.098
3.726AspAsp: 3.726 ± 0.242
3.951AspGlu: 3.951 ± 0.219
3.026AspPhe: 3.026 ± 0.176
4.557AspGly: 4.557 ± 0.235
0.949AspHis: 0.949 ± 0.106
4.26AspIle: 4.26 ± 0.206
3.797AspLys: 3.797 ± 0.232
5.66AspLeu: 5.66 ± 0.258
1.531AspMet: 1.531 ± 0.133
3.595AspAsn: 3.595 ± 0.204
2.812AspPro: 2.812 ± 0.201
2.29AspGln: 2.29 ± 0.148
3.145AspArg: 3.145 ± 0.234
3.726AspSer: 3.726 ± 0.206
3.584AspThr: 3.584 ± 0.193
4.426AspVal: 4.426 ± 0.204
0.819AspTrp: 0.819 ± 0.081
3.061AspTyr: 3.061 ± 0.222
0.0AspXaa: 0.0 ± 0.0
Glu
4.616GluAla: 4.616 ± 0.312
0.463GluCys: 0.463 ± 0.067
3.845GluAsp: 3.845 ± 0.23
4.865GluGlu: 4.865 ± 0.324
2.895GluPhe: 2.895 ± 0.234
4.236GluGly: 4.236 ± 0.213
1.317GluHis: 1.317 ± 0.134
4.497GluIle: 4.497 ± 0.241
3.334GluLys: 3.334 ± 0.226
5.791GluLeu: 5.791 ± 0.361
2.005GluMet: 2.005 ± 0.16
2.99GluAsn: 2.99 ± 0.176
1.768GluPro: 1.768 ± 0.153
2.243GluGln: 2.243 ± 0.171
3.358GluArg: 3.358 ± 0.203
3.773GluSer: 3.773 ± 0.227
3.275GluThr: 3.275 ± 0.221
4.758GluVal: 4.758 ± 0.308
0.902GluTrp: 0.902 ± 0.108
2.599GluTyr: 2.599 ± 0.201
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 0.153
0.427PheCys: 0.427 ± 0.067
3.085PheAsp: 3.085 ± 0.193
2.48PheGlu: 2.48 ± 0.183
1.839PhePhe: 1.839 ± 0.139
2.848PheGly: 2.848 ± 0.201
0.854PheHis: 0.854 ± 0.096
3.073PheIle: 3.073 ± 0.192
2.694PheLys: 2.694 ± 0.189
2.907PheLeu: 2.907 ± 0.199
1.127PheMet: 1.127 ± 0.13
2.967PheAsn: 2.967 ± 0.175
1.673PhePro: 1.673 ± 0.146
1.139PheGln: 1.139 ± 0.113
2.231PheArg: 2.231 ± 0.151
3.441PheSer: 3.441 ± 0.176
3.109PheThr: 3.109 ± 0.2
3.014PheVal: 3.014 ± 0.186
0.439PheTrp: 0.439 ± 0.071
1.685PheTyr: 1.685 ± 0.154
0.0PheXaa: 0.0 ± 0.0
Gly
3.904GlyAla: 3.904 ± 0.266
0.902GlyCys: 0.902 ± 0.113
3.987GlyAsp: 3.987 ± 0.258
4.284GlyGlu: 4.284 ± 0.262
2.99GlyPhe: 2.99 ± 0.193
4.26GlyGly: 4.26 ± 0.395
1.02GlyHis: 1.02 ± 0.108
4.853GlyIle: 4.853 ± 0.268
5.126GlyLys: 5.126 ± 0.227
4.972GlyLeu: 4.972 ± 0.232
1.875GlyMet: 1.875 ± 0.15
4.011GlyAsn: 4.011 ± 0.232
1.709GlyPro: 1.709 ± 0.137
1.946GlyGln: 1.946 ± 0.159
3.18GlyArg: 3.18 ± 0.233
4.794GlySer: 4.794 ± 0.264
4.948GlyThr: 4.948 ± 0.292
4.77GlyVal: 4.77 ± 0.232
0.973GlyTrp: 0.973 ± 0.105
3.073GlyTyr: 3.073 ± 0.174
0.0GlyXaa: 0.0 ± 0.0
His
0.854HisAla: 0.854 ± 0.109
0.297HisCys: 0.297 ± 0.064
0.842HisAsp: 0.842 ± 0.095
0.878HisGlu: 0.878 ± 0.126
0.997HisPhe: 0.997 ± 0.098
1.092HisGly: 1.092 ± 0.112
0.427HisHis: 0.427 ± 0.069
0.831HisIle: 0.831 ± 0.108
0.878HisLys: 0.878 ± 0.104
1.59HisLeu: 1.59 ± 0.181
0.427HisMet: 0.427 ± 0.07
0.807HisAsn: 0.807 ± 0.097
0.949HisPro: 0.949 ± 0.107
0.427HisGln: 0.427 ± 0.077
1.009HisArg: 1.009 ± 0.11
0.914HisSer: 0.914 ± 0.084
0.902HisThr: 0.902 ± 0.093
1.068HisVal: 1.068 ± 0.122
0.297HisTrp: 0.297 ± 0.054
0.985HisTyr: 0.985 ± 0.102
0.0HisXaa: 0.0 ± 0.0
Ile
4.533IleAla: 4.533 ± 0.232
0.522IleCys: 0.522 ± 0.076
4.901IleAsp: 4.901 ± 0.226
4.414IleGlu: 4.414 ± 0.241
2.326IlePhe: 2.326 ± 0.169
3.655IleGly: 3.655 ± 0.217
1.151IleHis: 1.151 ± 0.115
3.429IleIle: 3.429 ± 0.199
3.809IleLys: 3.809 ± 0.189
4.604IleLeu: 4.604 ± 0.213
1.4IleMet: 1.4 ± 0.145
3.999IleAsn: 3.999 ± 0.209
2.919IlePro: 2.919 ± 0.186
2.148IleGln: 2.148 ± 0.159
3.904IleArg: 3.904 ± 0.254
4.592IleSer: 4.592 ± 0.234
4.355IleThr: 4.355 ± 0.262
4.165IleVal: 4.165 ± 0.238
0.712IleTrp: 0.712 ± 0.092
2.136IleTyr: 2.136 ± 0.165
0.0IleXaa: 0.0 ± 0.0
Lys
4.272LysAla: 4.272 ± 0.216
0.534LysCys: 0.534 ± 0.081
3.857LysAsp: 3.857 ± 0.199
4.414LysGlu: 4.414 ± 0.264
2.504LysPhe: 2.504 ± 0.153
3.951LysGly: 3.951 ± 0.252
1.127LysHis: 1.127 ± 0.128
3.323LysIle: 3.323 ± 0.193
2.919LysLys: 2.919 ± 0.198
4.83LysLeu: 4.83 ± 0.261
1.768LysMet: 1.768 ± 0.153
2.255LysAsn: 2.255 ± 0.179
2.207LysPro: 2.207 ± 0.175
1.661LysGln: 1.661 ± 0.127
3.251LysArg: 3.251 ± 0.203
3.88LysSer: 3.88 ± 0.21
3.56LysThr: 3.56 ± 0.188
4.96LysVal: 4.96 ± 0.24
0.724LysTrp: 0.724 ± 0.083
2.35LysTyr: 2.35 ± 0.19
0.0LysXaa: 0.0 ± 0.0
Leu
6.194LeuAla: 6.194 ± 0.262
0.712LeuCys: 0.712 ± 0.091
5.245LeuAsp: 5.245 ± 0.251
5.53LeuGlu: 5.53 ± 0.263
2.978LeuPhe: 2.978 ± 0.221
5.304LeuGly: 5.304 ± 0.272
1.317LeuHis: 1.317 ± 0.123
5.079LeuIle: 5.079 ± 0.265
4.96LeuLys: 4.96 ± 0.296
6.337LeuLeu: 6.337 ± 0.286
2.255LeuMet: 2.255 ± 0.174
4.996LeuAsn: 4.996 ± 0.256
3.868LeuPro: 3.868 ± 0.208
2.373LeuGln: 2.373 ± 0.201
4.806LeuArg: 4.806 ± 0.21
6.182LeuSer: 6.182 ± 0.282
5.186LeuThr: 5.186 ± 0.266
5.482LeuVal: 5.482 ± 0.24
0.878LeuTrp: 0.878 ± 0.13
2.694LeuTyr: 2.694 ± 0.21
0.0LeuXaa: 0.0 ± 0.0
Met
2.255MetAla: 2.255 ± 0.142
0.261MetCys: 0.261 ± 0.05
1.887MetAsp: 1.887 ± 0.138
1.626MetGlu: 1.626 ± 0.156
1.032MetPhe: 1.032 ± 0.118
1.721MetGly: 1.721 ± 0.14
0.368MetHis: 0.368 ± 0.069
1.626MetIle: 1.626 ± 0.135
1.732MetLys: 1.732 ± 0.164
1.994MetLeu: 1.994 ± 0.177
0.665MetMet: 0.665 ± 0.099
1.507MetAsn: 1.507 ± 0.138
0.866MetPro: 0.866 ± 0.107
0.546MetGln: 0.546 ± 0.078
1.412MetArg: 1.412 ± 0.129
2.468MetSer: 2.468 ± 0.168
1.804MetThr: 1.804 ± 0.132
2.172MetVal: 2.172 ± 0.182
0.237MetTrp: 0.237 ± 0.059
0.819MetTyr: 0.819 ± 0.103
0.0MetXaa: 0.0 ± 0.0
Asn
3.963AsnAla: 3.963 ± 0.23
0.641AsnCys: 0.641 ± 0.099
3.572AsnAsp: 3.572 ± 0.239
3.382AsnGlu: 3.382 ± 0.221
2.516AsnPhe: 2.516 ± 0.195
4.58AsnGly: 4.58 ± 0.297
0.665AsnHis: 0.665 ± 0.091
2.895AsnIle: 2.895 ± 0.185
2.777AsnLys: 2.777 ± 0.169
4.663AsnLeu: 4.663 ± 0.219
1.543AsnMet: 1.543 ± 0.14
2.599AsnAsn: 2.599 ± 0.194
3.121AsnPro: 3.121 ± 0.227
1.863AsnGln: 1.863 ± 0.173
3.156AsnArg: 3.156 ± 0.216
3.441AsnSer: 3.441 ± 0.219
2.967AsnThr: 2.967 ± 0.176
3.631AsnVal: 3.631 ± 0.189
0.676AsnTrp: 0.676 ± 0.075
2.1AsnTyr: 2.1 ± 0.169
0.0AsnXaa: 0.0 ± 0.0
Pro
2.646ProAla: 2.646 ± 0.169
0.356ProCys: 0.356 ± 0.066
2.943ProAsp: 2.943 ± 0.195
3.251ProGlu: 3.251 ± 0.208
1.721ProPhe: 1.721 ± 0.16
3.097ProGly: 3.097 ± 0.2
0.665ProHis: 0.665 ± 0.08
2.385ProIle: 2.385 ± 0.184
2.077ProLys: 2.077 ± 0.168
2.67ProLeu: 2.67 ± 0.17
0.878ProMet: 0.878 ± 0.102
2.231ProAsn: 2.231 ± 0.17
1.222ProPro: 1.222 ± 0.115
0.961ProGln: 0.961 ± 0.111
1.875ProArg: 1.875 ± 0.15
2.504ProSer: 2.504 ± 0.159
2.955ProThr: 2.955 ± 0.215
3.453ProVal: 3.453 ± 0.22
0.51ProTrp: 0.51 ± 0.089
1.412ProTyr: 1.412 ± 0.127
0.0ProXaa: 0.0 ± 0.0
Gln
2.278GlnAla: 2.278 ± 0.181
0.32GlnCys: 0.32 ± 0.056
1.448GlnAsp: 1.448 ± 0.114
1.709GlnGlu: 1.709 ± 0.154
1.495GlnPhe: 1.495 ± 0.122
1.97GlnGly: 1.97 ± 0.164
0.427GlnHis: 0.427 ± 0.073
2.017GlnIle: 2.017 ± 0.159
1.353GlnLys: 1.353 ± 0.121
2.777GlnLeu: 2.777 ± 0.197
0.949GlnMet: 0.949 ± 0.079
1.293GlnAsn: 1.293 ± 0.14
1.21GlnPro: 1.21 ± 0.118
1.127GlnGln: 1.127 ± 0.124
2.077GlnArg: 2.077 ± 0.164
2.053GlnSer: 2.053 ± 0.157
1.97GlnThr: 1.97 ± 0.152
2.421GlnVal: 2.421 ± 0.19
0.51GlnTrp: 0.51 ± 0.09
1.483GlnTyr: 1.483 ± 0.131
0.0GlnXaa: 0.0 ± 0.0
Arg
3.406ArgAla: 3.406 ± 0.229
0.487ArgCys: 0.487 ± 0.077
3.311ArgAsp: 3.311 ± 0.233
3.263ArgGlu: 3.263 ± 0.205
2.599ArgPhe: 2.599 ± 0.166
3.619ArgGly: 3.619 ± 0.204
0.985ArgHis: 0.985 ± 0.1
3.453ArgIle: 3.453 ± 0.224
3.18ArgLys: 3.18 ± 0.199
4.604ArgLeu: 4.604 ± 0.25
1.436ArgMet: 1.436 ± 0.143
3.251ArgAsn: 3.251 ± 0.201
1.934ArgPro: 1.934 ± 0.162
1.875ArgGln: 1.875 ± 0.175
3.334ArgArg: 3.334 ± 0.274
3.263ArgSer: 3.263 ± 0.205
3.228ArgThr: 3.228 ± 0.193
3.809ArgVal: 3.809 ± 0.184
0.866ArgTrp: 0.866 ± 0.088
2.468ArgTyr: 2.468 ± 0.152
0.0ArgXaa: 0.0 ± 0.0
Ser
4.782SerAla: 4.782 ± 0.221
0.629SerCys: 0.629 ± 0.094
4.177SerAsp: 4.177 ± 0.276
3.821SerGlu: 3.821 ± 0.242
2.883SerPhe: 2.883 ± 0.202
4.853SerGly: 4.853 ± 0.298
1.092SerHis: 1.092 ± 0.1
4.284SerIle: 4.284 ± 0.244
3.987SerLys: 3.987 ± 0.201
5.423SerLeu: 5.423 ± 0.233
1.839SerMet: 1.839 ± 0.156
3.762SerAsn: 3.762 ± 0.212
2.563SerPro: 2.563 ± 0.22
1.887SerGln: 1.887 ± 0.153
3.429SerArg: 3.429 ± 0.204
4.212SerSer: 4.212 ± 0.245
4.189SerThr: 4.189 ± 0.231
4.723SerVal: 4.723 ± 0.249
0.914SerTrp: 0.914 ± 0.101
2.385SerTyr: 2.385 ± 0.16
0.0SerXaa: 0.0 ± 0.0
Thr
4.485ThrAla: 4.485 ± 0.221
0.522ThrCys: 0.522 ± 0.089
3.773ThrAsp: 3.773 ± 0.187
3.572ThrGlu: 3.572 ± 0.247
3.168ThrPhe: 3.168 ± 0.201
4.782ThrGly: 4.782 ± 0.349
1.02ThrHis: 1.02 ± 0.096
4.296ThrIle: 4.296 ± 0.221
3.868ThrLys: 3.868 ± 0.21
5.85ThrLeu: 5.85 ± 0.22
1.412ThrMet: 1.412 ± 0.118
3.014ThrAsn: 3.014 ± 0.233
3.026ThrPro: 3.026 ± 0.188
1.958ThrGln: 1.958 ± 0.128
3.145ThrArg: 3.145 ± 0.217
3.477ThrSer: 3.477 ± 0.219
3.904ThrThr: 3.904 ± 0.223
4.604ThrVal: 4.604 ± 0.204
0.973ThrTrp: 0.973 ± 0.131
2.385ThrTyr: 2.385 ± 0.203
0.0ThrXaa: 0.0 ± 0.0
Val
4.96ValAla: 4.96 ± 0.244
0.534ValCys: 0.534 ± 0.068
4.746ValAsp: 4.746 ± 0.247
4.604ValGlu: 4.604 ± 0.241
2.978ValPhe: 2.978 ± 0.211
4.177ValGly: 4.177 ± 0.255
1.104ValHis: 1.104 ± 0.131
5.019ValIle: 5.019 ± 0.277
4.616ValLys: 4.616 ± 0.238
5.411ValLeu: 5.411 ± 0.244
1.863ValMet: 1.863 ± 0.142
4.165ValAsn: 4.165 ± 0.239
3.085ValPro: 3.085 ± 0.221
2.231ValGln: 2.231 ± 0.168
3.738ValArg: 3.738 ± 0.208
5.031ValSer: 5.031 ± 0.223
4.948ValThr: 4.948 ± 0.239
5.518ValVal: 5.518 ± 0.27
0.771ValTrp: 0.771 ± 0.119
2.729ValTyr: 2.729 ± 0.189
0.0ValXaa: 0.0 ± 0.0
Trp
0.878TrpAla: 0.878 ± 0.093
0.154TrpCys: 0.154 ± 0.042
0.7TrpAsp: 0.7 ± 0.095
0.819TrpGlu: 0.819 ± 0.097
0.641TrpPhe: 0.641 ± 0.087
0.807TrpGly: 0.807 ± 0.096
0.202TrpHis: 0.202 ± 0.042
0.831TrpIle: 0.831 ± 0.103
0.676TrpLys: 0.676 ± 0.07
1.341TrpLeu: 1.341 ± 0.14
0.51TrpMet: 0.51 ± 0.078
0.522TrpAsn: 0.522 ± 0.085
0.261TrpPro: 0.261 ± 0.053
0.297TrpGln: 0.297 ± 0.055
0.914TrpArg: 0.914 ± 0.113
0.807TrpSer: 0.807 ± 0.102
0.712TrpThr: 0.712 ± 0.109
1.104TrpVal: 1.104 ± 0.128
0.261TrpTrp: 0.261 ± 0.059
0.487TrpTyr: 0.487 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.492TyrAla: 2.492 ± 0.177
0.427TyrCys: 0.427 ± 0.076
2.812TyrAsp: 2.812 ± 0.175
1.946TyrGlu: 1.946 ± 0.168
1.839TyrPhe: 1.839 ± 0.143
2.611TyrGly: 2.611 ± 0.179
0.688TyrHis: 0.688 ± 0.102
2.053TyrIle: 2.053 ± 0.162
1.673TyrLys: 1.673 ± 0.133
3.762TyrLeu: 3.762 ± 0.223
1.092TyrMet: 1.092 ± 0.097
1.958TyrAsn: 1.958 ± 0.132
1.744TyrPro: 1.744 ± 0.16
1.59TyrGln: 1.59 ± 0.124
2.872TyrArg: 2.872 ± 0.183
2.789TyrSer: 2.789 ± 0.176
2.195TyrThr: 2.195 ± 0.161
2.551TyrVal: 2.551 ± 0.179
0.522TyrTrp: 0.522 ± 0.091
2.053TyrTyr: 2.053 ± 0.18
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 363 proteins (84274 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski