Amino acid dipepetide frequency for Serratia phage PCH45

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.687AlaAla: 5.687 ± 0.434
0.599AlaCys: 0.599 ± 0.095
4.52AlaAsp: 4.52 ± 0.253
4.43AlaGlu: 4.43 ± 0.28
2.754AlaPhe: 2.754 ± 0.212
4.894AlaGly: 4.894 ± 0.386
1.078AlaHis: 1.078 ± 0.121
4.699AlaIle: 4.699 ± 0.27
4.071AlaLys: 4.071 ± 0.255
6.406AlaLeu: 6.406 ± 0.304
2.155AlaMet: 2.155 ± 0.179
3.577AlaAsn: 3.577 ± 0.245
2.559AlaPro: 2.559 ± 0.198
3.068AlaGln: 3.068 ± 0.247
3.861AlaArg: 3.861 ± 0.276
3.906AlaSer: 3.906 ± 0.326
4.565AlaThr: 4.565 ± 0.314
4.744AlaVal: 4.744 ± 0.237
0.988AlaTrp: 0.988 ± 0.131
2.604AlaTyr: 2.604 ± 0.22
0.0AlaXaa: 0.0 ± 0.0
Cys
0.449CysAla: 0.449 ± 0.086
0.105CysCys: 0.105 ± 0.053
0.524CysAsp: 0.524 ± 0.082
0.479CysGlu: 0.479 ± 0.09
0.344CysPhe: 0.344 ± 0.075
0.629CysGly: 0.629 ± 0.099
0.254CysHis: 0.254 ± 0.064
0.524CysIle: 0.524 ± 0.095
0.434CysLys: 0.434 ± 0.083
0.688CysLeu: 0.688 ± 0.102
0.21CysMet: 0.21 ± 0.056
0.494CysAsn: 0.494 ± 0.105
0.344CysPro: 0.344 ± 0.07
0.329CysGln: 0.329 ± 0.066
0.449CysArg: 0.449 ± 0.081
0.404CysSer: 0.404 ± 0.077
0.479CysThr: 0.479 ± 0.071
0.539CysVal: 0.539 ± 0.084
0.135CysTrp: 0.135 ± 0.047
0.419CysTyr: 0.419 ± 0.086
0.0CysXaa: 0.0 ± 0.0
Asp
4.071AspAla: 4.071 ± 0.271
0.449AspCys: 0.449 ± 0.082
4.64AspAsp: 4.64 ± 0.383
4.565AspGlu: 4.565 ± 0.356
3.427AspPhe: 3.427 ± 0.233
5.163AspGly: 5.163 ± 0.355
1.048AspHis: 1.048 ± 0.137
4.415AspIle: 4.415 ± 0.258
4.295AspLys: 4.295 ± 0.244
5.852AspLeu: 5.852 ± 0.234
1.691AspMet: 1.691 ± 0.172
3.682AspAsn: 3.682 ± 0.257
3.113AspPro: 3.113 ± 0.229
2.17AspGln: 2.17 ± 0.179
3.757AspArg: 3.757 ± 0.361
3.742AspSer: 3.742 ± 0.237
2.918AspThr: 2.918 ± 0.194
4.071AspVal: 4.071 ± 0.242
0.958AspTrp: 0.958 ± 0.108
3.023AspTyr: 3.023 ± 0.247
0.0AspXaa: 0.0 ± 0.0
Glu
4.55GluAla: 4.55 ± 0.281
0.524GluCys: 0.524 ± 0.092
4.236GluAsp: 4.236 ± 0.335
5.328GluGlu: 5.328 ± 0.369
2.903GluPhe: 2.903 ± 0.231
4.071GluGly: 4.071 ± 0.249
1.407GluHis: 1.407 ± 0.156
4.864GluIle: 4.864 ± 0.334
4.699GluLys: 4.699 ± 0.28
5.717GluLeu: 5.717 ± 0.319
1.751GluMet: 1.751 ± 0.148
2.814GluAsn: 2.814 ± 0.2
2.095GluPro: 2.095 ± 0.189
2.634GluGln: 2.634 ± 0.195
3.772GluArg: 3.772 ± 0.293
3.652GluSer: 3.652 ± 0.277
3.083GluThr: 3.083 ± 0.215
4.714GluVal: 4.714 ± 0.281
1.048GluTrp: 1.048 ± 0.13
2.29GluTyr: 2.29 ± 0.15
0.0GluXaa: 0.0 ± 0.0
Phe
2.604PheAla: 2.604 ± 0.174
0.434PheCys: 0.434 ± 0.087
3.038PheAsp: 3.038 ± 0.177
2.484PheGlu: 2.484 ± 0.209
1.407PhePhe: 1.407 ± 0.136
2.664PheGly: 2.664 ± 0.212
0.838PheHis: 0.838 ± 0.116
2.784PheIle: 2.784 ± 0.223
2.425PheLys: 2.425 ± 0.186
3.218PheLeu: 3.218 ± 0.248
1.212PheMet: 1.212 ± 0.142
2.679PheAsn: 2.679 ± 0.183
1.691PhePro: 1.691 ± 0.142
1.362PheGln: 1.362 ± 0.133
2.095PheArg: 2.095 ± 0.137
2.874PheSer: 2.874 ± 0.207
2.754PheThr: 2.754 ± 0.249
2.874PheVal: 2.874 ± 0.229
0.599PheTrp: 0.599 ± 0.086
1.527PheTyr: 1.527 ± 0.137
0.0PheXaa: 0.0 ± 0.0
Gly
4.31GlyAla: 4.31 ± 0.334
0.434GlyCys: 0.434 ± 0.095
4.61GlyAsp: 4.61 ± 0.384
4.28GlyGlu: 4.28 ± 0.311
2.814GlyPhe: 2.814 ± 0.191
5.268GlyGly: 5.268 ± 0.589
1.257GlyHis: 1.257 ± 0.152
4.565GlyIle: 4.565 ± 0.219
5.104GlyLys: 5.104 ± 0.405
4.954GlyLeu: 4.954 ± 0.313
1.886GlyMet: 1.886 ± 0.192
3.816GlyAsn: 3.816 ± 0.26
1.916GlyPro: 1.916 ± 0.257
2.799GlyGln: 2.799 ± 0.215
3.637GlyArg: 3.637 ± 0.334
4.565GlySer: 4.565 ± 0.298
4.116GlyThr: 4.116 ± 0.287
4.58GlyVal: 4.58 ± 0.286
1.093GlyTrp: 1.093 ± 0.123
2.963GlyTyr: 2.963 ± 0.2
0.0GlyXaa: 0.0 ± 0.0
His
0.973HisAla: 0.973 ± 0.12
0.224HisCys: 0.224 ± 0.057
0.988HisAsp: 0.988 ± 0.11
0.808HisGlu: 0.808 ± 0.128
0.793HisPhe: 0.793 ± 0.101
1.108HisGly: 1.108 ± 0.131
0.434HisHis: 0.434 ± 0.079
1.093HisIle: 1.093 ± 0.139
0.793HisLys: 0.793 ± 0.107
1.242HisLeu: 1.242 ± 0.139
0.524HisMet: 0.524 ± 0.102
0.868HisAsn: 0.868 ± 0.108
1.108HisPro: 1.108 ± 0.146
0.614HisGln: 0.614 ± 0.092
1.093HisArg: 1.093 ± 0.149
1.078HisSer: 1.078 ± 0.115
0.793HisThr: 0.793 ± 0.102
1.093HisVal: 1.093 ± 0.13
0.18HisTrp: 0.18 ± 0.05
0.943HisTyr: 0.943 ± 0.107
0.0HisXaa: 0.0 ± 0.0
Ile
4.55IleAla: 4.55 ± 0.286
0.644IleCys: 0.644 ± 0.099
5.163IleAsp: 5.163 ± 0.285
4.46IleGlu: 4.46 ± 0.278
2.395IlePhe: 2.395 ± 0.227
4.026IleGly: 4.026 ± 0.267
1.063IleHis: 1.063 ± 0.124
3.801IleIle: 3.801 ± 0.241
3.293IleLys: 3.293 ± 0.267
4.625IleLeu: 4.625 ± 0.236
1.407IleMet: 1.407 ± 0.153
3.637IleAsn: 3.637 ± 0.239
3.427IlePro: 3.427 ± 0.24
2.29IleGln: 2.29 ± 0.196
3.427IleArg: 3.427 ± 0.26
4.041IleSer: 4.041 ± 0.249
4.011IleThr: 4.011 ± 0.244
3.906IleVal: 3.906 ± 0.249
0.883IleTrp: 0.883 ± 0.135
2.2IleTyr: 2.2 ± 0.161
0.0IleXaa: 0.0 ± 0.0
Lys
4.325LysAla: 4.325 ± 0.319
0.389LysCys: 0.389 ± 0.085
4.744LysAsp: 4.744 ± 0.224
4.714LysGlu: 4.714 ± 0.287
2.41LysPhe: 2.41 ± 0.221
4.714LysGly: 4.714 ± 0.328
0.958LysHis: 0.958 ± 0.129
3.637LysIle: 3.637 ± 0.22
3.682LysLys: 3.682 ± 0.251
5.059LysLeu: 5.059 ± 0.321
1.571LysMet: 1.571 ± 0.137
2.739LysAsn: 2.739 ± 0.269
1.886LysPro: 1.886 ± 0.158
1.482LysGln: 1.482 ± 0.125
3.218LysArg: 3.218 ± 0.232
3.128LysSer: 3.128 ± 0.229
2.963LysThr: 2.963 ± 0.196
4.25LysVal: 4.25 ± 0.236
0.943LysTrp: 0.943 ± 0.129
2.514LysTyr: 2.514 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
6.825LeuAla: 6.825 ± 0.363
0.853LeuCys: 0.853 ± 0.116
5.612LeuAsp: 5.612 ± 0.268
5.807LeuGlu: 5.807 ± 0.36
3.143LeuPhe: 3.143 ± 0.213
5.687LeuGly: 5.687 ± 0.328
1.347LeuHis: 1.347 ± 0.144
4.774LeuIle: 4.774 ± 0.26
5.298LeuLys: 5.298 ± 0.318
6.226LeuLeu: 6.226 ± 0.335
2.215LeuMet: 2.215 ± 0.154
4.625LeuAsn: 4.625 ± 0.272
4.011LeuPro: 4.011 ± 0.27
2.32LeuGln: 2.32 ± 0.174
4.565LeuArg: 4.565 ± 0.273
5.852LeuSer: 5.852 ± 0.266
5.223LeuThr: 5.223 ± 0.315
4.954LeuVal: 4.954 ± 0.312
1.018LeuTrp: 1.018 ± 0.109
3.188LeuTyr: 3.188 ± 0.23
0.0LeuXaa: 0.0 ± 0.0
Met
2.305MetAla: 2.305 ± 0.198
0.15MetCys: 0.15 ± 0.047
1.631MetAsp: 1.631 ± 0.133
1.976MetGlu: 1.976 ± 0.158
1.242MetPhe: 1.242 ± 0.141
1.512MetGly: 1.512 ± 0.116
0.314MetHis: 0.314 ± 0.075
1.467MetIle: 1.467 ± 0.161
1.302MetLys: 1.302 ± 0.101
2.574MetLeu: 2.574 ± 0.192
0.838MetMet: 0.838 ± 0.118
1.302MetAsn: 1.302 ± 0.154
1.302MetPro: 1.302 ± 0.165
1.287MetGln: 1.287 ± 0.11
1.557MetArg: 1.557 ± 0.149
2.29MetSer: 2.29 ± 0.186
1.811MetThr: 1.811 ± 0.171
1.796MetVal: 1.796 ± 0.167
0.314MetTrp: 0.314 ± 0.06
0.883MetTyr: 0.883 ± 0.11
0.0MetXaa: 0.0 ± 0.0
Asn
3.502AsnAla: 3.502 ± 0.282
0.284AsnCys: 0.284 ± 0.067
3.038AsnAsp: 3.038 ± 0.208
2.978AsnGlu: 2.978 ± 0.201
2.29AsnPhe: 2.29 ± 0.172
3.981AsnGly: 3.981 ± 0.234
1.048AsnHis: 1.048 ± 0.119
3.338AsnIle: 3.338 ± 0.247
2.859AsnLys: 2.859 ± 0.203
4.161AsnLeu: 4.161 ± 0.264
1.272AsnMet: 1.272 ± 0.163
2.739AsnAsn: 2.739 ± 0.211
3.113AsnPro: 3.113 ± 0.242
1.991AsnGln: 1.991 ± 0.146
2.829AsnArg: 2.829 ± 0.191
2.918AsnSer: 2.918 ± 0.231
3.083AsnThr: 3.083 ± 0.252
3.532AsnVal: 3.532 ± 0.22
0.584AsnTrp: 0.584 ± 0.092
2.05AsnTyr: 2.05 ± 0.176
0.0AsnXaa: 0.0 ± 0.0
Pro
3.098ProAla: 3.098 ± 0.217
0.284ProCys: 0.284 ± 0.068
3.113ProAsp: 3.113 ± 0.178
3.457ProGlu: 3.457 ± 0.232
2.08ProPhe: 2.08 ± 0.201
3.173ProGly: 3.173 ± 0.264
0.883ProHis: 0.883 ± 0.153
2.44ProIle: 2.44 ± 0.171
2.365ProLys: 2.365 ± 0.201
2.963ProLeu: 2.963 ± 0.179
1.093ProMet: 1.093 ± 0.103
2.455ProAsn: 2.455 ± 0.19
1.407ProPro: 1.407 ± 0.143
1.137ProGln: 1.137 ± 0.138
2.095ProArg: 2.095 ± 0.19
2.814ProSer: 2.814 ± 0.192
2.634ProThr: 2.634 ± 0.185
3.772ProVal: 3.772 ± 0.237
0.494ProTrp: 0.494 ± 0.086
1.497ProTyr: 1.497 ± 0.158
0.0ProXaa: 0.0 ± 0.0
Gln
2.739GlnAla: 2.739 ± 0.244
0.254GlnCys: 0.254 ± 0.059
1.452GlnAsp: 1.452 ± 0.151
2.2GlnGlu: 2.2 ± 0.197
1.392GlnPhe: 1.392 ± 0.145
2.185GlnGly: 2.185 ± 0.308
0.509GlnHis: 0.509 ± 0.089
2.26GlnIle: 2.26 ± 0.211
1.961GlnLys: 1.961 ± 0.151
3.517GlnLeu: 3.517 ± 0.206
1.182GlnMet: 1.182 ± 0.142
1.377GlnAsn: 1.377 ± 0.165
1.467GlnPro: 1.467 ± 0.172
1.616GlnGln: 1.616 ± 0.153
1.856GlnArg: 1.856 ± 0.138
2.065GlnSer: 2.065 ± 0.162
2.215GlnThr: 2.215 ± 0.177
2.23GlnVal: 2.23 ± 0.211
0.539GlnTrp: 0.539 ± 0.106
1.482GlnTyr: 1.482 ± 0.125
0.0GlnXaa: 0.0 ± 0.0
Arg
3.951ArgAla: 3.951 ± 0.232
0.359ArgCys: 0.359 ± 0.076
3.637ArgAsp: 3.637 ± 0.315
3.352ArgGlu: 3.352 ± 0.264
2.589ArgPhe: 2.589 ± 0.212
3.667ArgGly: 3.667 ± 0.335
0.868ArgHis: 0.868 ± 0.11
3.352ArgIle: 3.352 ± 0.241
3.203ArgLys: 3.203 ± 0.205
4.804ArgLeu: 4.804 ± 0.286
1.452ArgMet: 1.452 ± 0.157
3.008ArgAsn: 3.008 ± 0.229
2.2ArgPro: 2.2 ± 0.203
1.856ArgGln: 1.856 ± 0.152
4.236ArgArg: 4.236 ± 0.469
3.532ArgSer: 3.532 ± 0.315
2.918ArgThr: 2.918 ± 0.204
3.083ArgVal: 3.083 ± 0.211
0.868ArgTrp: 0.868 ± 0.115
2.709ArgTyr: 2.709 ± 0.188
0.0ArgXaa: 0.0 ± 0.0
Ser
4.415SerAla: 4.415 ± 0.268
0.494SerCys: 0.494 ± 0.081
3.801SerAsp: 3.801 ± 0.201
3.607SerGlu: 3.607 ± 0.207
2.918SerPhe: 2.918 ± 0.196
4.43SerGly: 4.43 ± 0.244
0.853SerHis: 0.853 ± 0.113
3.876SerIle: 3.876 ± 0.191
3.263SerLys: 3.263 ± 0.186
5.657SerLeu: 5.657 ± 0.299
1.856SerMet: 1.856 ± 0.151
2.619SerAsn: 2.619 ± 0.19
3.113SerPro: 3.113 ± 0.259
2.02SerGln: 2.02 ± 0.144
3.427SerArg: 3.427 ± 0.275
4.31SerSer: 4.31 ± 0.306
3.906SerThr: 3.906 ± 0.246
4.445SerVal: 4.445 ± 0.294
0.838SerTrp: 0.838 ± 0.12
2.335SerTyr: 2.335 ± 0.189
0.0SerXaa: 0.0 ± 0.0
Thr
4.58ThrAla: 4.58 ± 0.324
0.494ThrCys: 0.494 ± 0.076
3.876ThrAsp: 3.876 ± 0.284
3.487ThrGlu: 3.487 ± 0.233
2.499ThrPhe: 2.499 ± 0.224
4.026ThrGly: 4.026 ± 0.235
0.808ThrHis: 0.808 ± 0.111
3.981ThrIle: 3.981 ± 0.244
3.338ThrLys: 3.338 ± 0.218
5.104ThrLeu: 5.104 ± 0.284
1.616ThrMet: 1.616 ± 0.158
2.769ThrAsn: 2.769 ± 0.195
3.023ThrPro: 3.023 ± 0.213
1.736ThrGln: 1.736 ± 0.157
2.963ThrArg: 2.963 ± 0.213
3.308ThrSer: 3.308 ± 0.207
3.263ThrThr: 3.263 ± 0.27
4.25ThrVal: 4.25 ± 0.263
0.778ThrTrp: 0.778 ± 0.112
2.08ThrTyr: 2.08 ± 0.184
0.0ThrXaa: 0.0 ± 0.0
Val
4.834ValAla: 4.834 ± 0.266
0.629ValCys: 0.629 ± 0.097
4.655ValAsp: 4.655 ± 0.247
4.744ValGlu: 4.744 ± 0.274
2.245ValPhe: 2.245 ± 0.161
4.086ValGly: 4.086 ± 0.296
0.898ValHis: 0.898 ± 0.114
4.52ValIle: 4.52 ± 0.253
4.236ValLys: 4.236 ± 0.237
5.493ValLeu: 5.493 ± 0.235
2.11ValMet: 2.11 ± 0.156
3.577ValAsn: 3.577 ± 0.286
2.829ValPro: 2.829 ± 0.211
2.11ValGln: 2.11 ± 0.17
3.412ValArg: 3.412 ± 0.189
4.385ValSer: 4.385 ± 0.276
4.265ValThr: 4.265 ± 0.27
4.655ValVal: 4.655 ± 0.245
0.883ValTrp: 0.883 ± 0.116
2.619ValTyr: 2.619 ± 0.225
0.0ValXaa: 0.0 ± 0.0
Trp
1.003TrpAla: 1.003 ± 0.139
0.18TrpCys: 0.18 ± 0.051
1.018TrpAsp: 1.018 ± 0.14
0.928TrpGlu: 0.928 ± 0.138
0.629TrpPhe: 0.629 ± 0.09
0.853TrpGly: 0.853 ± 0.105
0.195TrpHis: 0.195 ± 0.056
0.778TrpIle: 0.778 ± 0.113
0.778TrpLys: 0.778 ± 0.102
1.287TrpLeu: 1.287 ± 0.152
0.494TrpMet: 0.494 ± 0.089
0.644TrpAsn: 0.644 ± 0.09
0.389TrpPro: 0.389 ± 0.065
0.449TrpGln: 0.449 ± 0.075
0.853TrpArg: 0.853 ± 0.097
0.913TrpSer: 0.913 ± 0.107
0.868TrpThr: 0.868 ± 0.136
0.928TrpVal: 0.928 ± 0.115
0.359TrpTrp: 0.359 ± 0.081
0.509TrpTyr: 0.509 ± 0.076
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.395TyrAla: 2.395 ± 0.2
0.449TyrCys: 0.449 ± 0.087
2.754TyrAsp: 2.754 ± 0.213
2.065TyrGlu: 2.065 ± 0.189
1.287TyrPhe: 1.287 ± 0.144
2.829TyrGly: 2.829 ± 0.168
0.688TyrHis: 0.688 ± 0.114
2.08TyrIle: 2.08 ± 0.183
1.946TyrLys: 1.946 ± 0.151
3.996TyrLeu: 3.996 ± 0.195
1.302TyrMet: 1.302 ± 0.15
2.17TyrAsn: 2.17 ± 0.197
2.26TyrPro: 2.26 ± 0.183
1.227TyrGln: 1.227 ± 0.122
2.544TyrArg: 2.544 ± 0.198
2.41TyrSer: 2.41 ± 0.181
2.11TyrThr: 2.11 ± 0.199
2.769TyrVal: 2.769 ± 0.211
0.554TyrTrp: 0.554 ± 0.087
1.931TyrTyr: 1.931 ± 0.182
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 225 proteins (66817 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski