Amino acid dipepetide frequency for Cronobacter phage CR5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.35AlaAla: 8.35 ± 0.539
0.756AlaCys: 0.756 ± 0.12
4.724AlaAsp: 4.724 ± 0.226
4.196AlaGlu: 4.196 ± 0.277
3.183AlaPhe: 3.183 ± 0.214
4.653AlaGly: 4.653 ± 0.278
1.27AlaHis: 1.27 ± 0.159
5.367AlaIle: 5.367 ± 0.289
4.625AlaLys: 4.625 ± 0.354
7.408AlaLeu: 7.408 ± 0.319
2.669AlaMet: 2.669 ± 0.215
3.982AlaAsn: 3.982 ± 0.285
2.84AlaPro: 2.84 ± 0.22
3.326AlaGln: 3.326 ± 0.244
4.096AlaArg: 4.096 ± 0.268
4.667AlaSer: 4.667 ± 0.274
4.996AlaThr: 4.996 ± 0.338
5.452AlaVal: 5.452 ± 0.293
0.942AlaTrp: 0.942 ± 0.122
3.097AlaTyr: 3.097 ± 0.23
0.0AlaXaa: 0.0 ± 0.0
Cys
0.557CysAla: 0.557 ± 0.097
0.043CysCys: 0.043 ± 0.026
0.514CysAsp: 0.514 ± 0.081
0.5CysGlu: 0.5 ± 0.082
0.371CysPhe: 0.371 ± 0.078
0.514CysGly: 0.514 ± 0.09
0.243CysHis: 0.243 ± 0.065
0.671CysIle: 0.671 ± 0.09
0.471CysLys: 0.471 ± 0.086
0.585CysLeu: 0.585 ± 0.088
0.243CysMet: 0.243 ± 0.055
0.371CysAsn: 0.371 ± 0.081
0.343CysPro: 0.343 ± 0.065
0.4CysGln: 0.4 ± 0.087
0.414CysArg: 0.414 ± 0.068
0.514CysSer: 0.514 ± 0.078
0.528CysThr: 0.528 ± 0.084
0.542CysVal: 0.542 ± 0.094
0.114CysTrp: 0.114 ± 0.04
0.328CysTyr: 0.328 ± 0.068
0.0CysXaa: 0.0 ± 0.0
Asp
5.138AspAla: 5.138 ± 0.261
0.514AspCys: 0.514 ± 0.079
4.296AspAsp: 4.296 ± 0.31
4.496AspGlu: 4.496 ± 0.294
3.069AspPhe: 3.069 ± 0.246
4.824AspGly: 4.824 ± 0.267
0.971AspHis: 0.971 ± 0.116
4.211AspIle: 4.211 ± 0.217
3.625AspLys: 3.625 ± 0.238
5.424AspLeu: 5.424 ± 0.252
1.798AspMet: 1.798 ± 0.162
3.054AspAsn: 3.054 ± 0.254
2.869AspPro: 2.869 ± 0.209
1.87AspGln: 1.87 ± 0.151
2.798AspArg: 2.798 ± 0.203
2.74AspSer: 2.74 ± 0.153
3.597AspThr: 3.597 ± 0.293
5.138AspVal: 5.138 ± 0.267
0.842AspTrp: 0.842 ± 0.1
3.126AspTyr: 3.126 ± 0.214
0.0AspXaa: 0.0 ± 0.0
Glu
4.368GluAla: 4.368 ± 0.316
0.585GluCys: 0.585 ± 0.1
3.982GluAsp: 3.982 ± 0.263
4.196GluGlu: 4.196 ± 0.257
2.683GluPhe: 2.683 ± 0.188
3.768GluGly: 3.768 ± 0.238
1.584GluHis: 1.584 ± 0.157
3.611GluIle: 3.611 ± 0.207
3.283GluLys: 3.283 ± 0.246
6.98GluLeu: 6.98 ± 0.373
2.013GluMet: 2.013 ± 0.207
3.169GluAsn: 3.169 ± 0.22
2.127GluPro: 2.127 ± 0.18
2.798GluGln: 2.798 ± 0.186
3.483GluArg: 3.483 ± 0.286
3.454GluSer: 3.454 ± 0.246
3.311GluThr: 3.311 ± 0.244
4.353GluVal: 4.353 ± 0.274
0.999GluTrp: 0.999 ± 0.125
2.369GluTyr: 2.369 ± 0.247
0.0GluXaa: 0.0 ± 0.0
Phe
2.955PheAla: 2.955 ± 0.194
0.343PheCys: 0.343 ± 0.067
3.14PheAsp: 3.14 ± 0.209
2.441PheGlu: 2.441 ± 0.177
1.756PhePhe: 1.756 ± 0.154
2.926PheGly: 2.926 ± 0.222
0.614PheHis: 0.614 ± 0.1
2.555PheIle: 2.555 ± 0.203
2.269PheLys: 2.269 ± 0.173
2.626PheLeu: 2.626 ± 0.213
1.142PheMet: 1.142 ± 0.136
2.683PheAsn: 2.683 ± 0.222
1.713PhePro: 1.713 ± 0.15
1.128PheGln: 1.128 ± 0.121
2.155PheArg: 2.155 ± 0.166
2.698PheSer: 2.698 ± 0.235
3.069PheThr: 3.069 ± 0.249
2.84PheVal: 2.84 ± 0.223
0.471PheTrp: 0.471 ± 0.073
1.684PheTyr: 1.684 ± 0.161
0.0PheXaa: 0.0 ± 0.0
Gly
4.482GlyAla: 4.482 ± 0.317
0.514GlyCys: 0.514 ± 0.09
4.011GlyAsp: 4.011 ± 0.277
4.125GlyGlu: 4.125 ± 0.267
2.541GlyPhe: 2.541 ± 0.186
4.653GlyGly: 4.653 ± 0.566
1.028GlyHis: 1.028 ± 0.12
3.897GlyIle: 3.897 ± 0.255
4.282GlyLys: 4.282 ± 0.248
5.495GlyLeu: 5.495 ± 0.295
1.813GlyMet: 1.813 ± 0.17
3.154GlyAsn: 3.154 ± 0.242
1.827GlyPro: 1.827 ± 0.159
2.484GlyGln: 2.484 ± 0.202
3.668GlyArg: 3.668 ± 0.242
3.583GlySer: 3.583 ± 0.218
4.325GlyThr: 4.325 ± 0.292
4.81GlyVal: 4.81 ± 0.261
1.185GlyTrp: 1.185 ± 0.13
2.327GlyTyr: 2.327 ± 0.194
0.0GlyXaa: 0.0 ± 0.0
His
1.413HisAla: 1.413 ± 0.157
0.214HisCys: 0.214 ± 0.057
1.228HisAsp: 1.228 ± 0.17
0.999HisGlu: 0.999 ± 0.119
0.856HisPhe: 0.856 ± 0.122
1.199HisGly: 1.199 ± 0.12
0.485HisHis: 0.485 ± 0.079
1.213HisIle: 1.213 ± 0.142
0.913HisLys: 0.913 ± 0.126
2.084HisLeu: 2.084 ± 0.201
0.5HisMet: 0.5 ± 0.084
0.828HisAsn: 0.828 ± 0.112
1.199HisPro: 1.199 ± 0.13
0.442HisGln: 0.442 ± 0.076
1.513HisArg: 1.513 ± 0.175
0.771HisSer: 0.771 ± 0.111
1.113HisThr: 1.113 ± 0.138
1.499HisVal: 1.499 ± 0.142
0.186HisTrp: 0.186 ± 0.051
1.099HisTyr: 1.099 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
4.596IleAla: 4.596 ± 0.255
0.371IleCys: 0.371 ± 0.072
4.439IleAsp: 4.439 ± 0.295
3.897IleGlu: 3.897 ± 0.234
1.784IlePhe: 1.784 ± 0.159
3.069IleGly: 3.069 ± 0.203
1.085IleHis: 1.085 ± 0.141
2.869IleIle: 2.869 ± 0.229
3.368IleLys: 3.368 ± 0.237
4.082IleLeu: 4.082 ± 0.249
1.142IleMet: 1.142 ± 0.132
3.269IleAsn: 3.269 ± 0.217
3.183IlePro: 3.183 ± 0.195
1.927IleGln: 1.927 ± 0.158
3.497IleArg: 3.497 ± 0.228
3.269IleSer: 3.269 ± 0.263
3.982IleThr: 3.982 ± 0.265
3.668IleVal: 3.668 ± 0.229
0.585IleTrp: 0.585 ± 0.085
2.17IleTyr: 2.17 ± 0.179
0.0IleXaa: 0.0 ± 0.0
Lys
4.51LysAla: 4.51 ± 0.289
0.328LysCys: 0.328 ± 0.059
3.326LysAsp: 3.326 ± 0.224
3.64LysGlu: 3.64 ± 0.303
2.312LysPhe: 2.312 ± 0.17
3.368LysGly: 3.368 ± 0.326
1.427LysHis: 1.427 ± 0.15
2.569LysIle: 2.569 ± 0.172
2.94LysLys: 2.94 ± 0.286
5.938LysLeu: 5.938 ± 0.249
1.599LysMet: 1.599 ± 0.175
2.269LysAsn: 2.269 ± 0.19
2.769LysPro: 2.769 ± 0.232
1.827LysGln: 1.827 ± 0.164
2.94LysArg: 2.94 ± 0.294
3.183LysSer: 3.183 ± 0.244
3.44LysThr: 3.44 ± 0.206
3.597LysVal: 3.597 ± 0.233
0.642LysTrp: 0.642 ± 0.103
1.998LysTyr: 1.998 ± 0.162
0.0LysXaa: 0.0 ± 0.0
Leu
7.122LeuAla: 7.122 ± 0.273
0.528LeuCys: 0.528 ± 0.083
5.509LeuAsp: 5.509 ± 0.235
5.752LeuGlu: 5.752 ± 0.353
3.083LeuPhe: 3.083 ± 0.166
5.096LeuGly: 5.096 ± 0.298
1.627LeuHis: 1.627 ± 0.161
4.196LeuIle: 4.196 ± 0.323
5.038LeuLys: 5.038 ± 0.265
7.379LeuLeu: 7.379 ± 0.341
2.541LeuMet: 2.541 ± 0.166
4.981LeuAsn: 4.981 ± 0.307
4.196LeuPro: 4.196 ± 0.225
3.097LeuGln: 3.097 ± 0.224
5.481LeuArg: 5.481 ± 0.296
6.394LeuSer: 6.394 ± 0.279
6.694LeuThr: 6.694 ± 0.311
5.681LeuVal: 5.681 ± 0.285
0.899LeuTrp: 0.899 ± 0.093
3.468LeuTyr: 3.468 ± 0.275
0.0LeuXaa: 0.0 ± 0.0
Met
2.255MetAla: 2.255 ± 0.161
0.285MetCys: 0.285 ± 0.058
1.47MetAsp: 1.47 ± 0.147
1.613MetGlu: 1.613 ± 0.172
1.285MetPhe: 1.285 ± 0.137
1.927MetGly: 1.927 ± 0.198
0.599MetHis: 0.599 ± 0.088
1.256MetIle: 1.256 ± 0.123
1.327MetLys: 1.327 ± 0.136
2.698MetLeu: 2.698 ± 0.172
0.599MetMet: 0.599 ± 0.09
1.228MetAsn: 1.228 ± 0.184
1.085MetPro: 1.085 ± 0.123
1.113MetGln: 1.113 ± 0.131
1.713MetArg: 1.713 ± 0.158
2.298MetSer: 2.298 ± 0.183
1.641MetThr: 1.641 ± 0.155
2.155MetVal: 2.155 ± 0.199
0.257MetTrp: 0.257 ± 0.059
0.628MetTyr: 0.628 ± 0.095
0.0MetXaa: 0.0 ± 0.0
Asn
4.41AsnAla: 4.41 ± 0.267
0.457AsnCys: 0.457 ± 0.074
3.097AsnAsp: 3.097 ± 0.242
2.869AsnGlu: 2.869 ± 0.166
2.112AsnPhe: 2.112 ± 0.201
3.911AsnGly: 3.911 ± 0.234
0.942AsnHis: 0.942 ± 0.112
2.726AsnIle: 2.726 ± 0.226
2.641AsnLys: 2.641 ± 0.205
3.882AsnLeu: 3.882 ± 0.212
1.37AsnMet: 1.37 ± 0.154
2.541AsnAsn: 2.541 ± 0.236
3.126AsnPro: 3.126 ± 0.224
1.556AsnGln: 1.556 ± 0.156
2.869AsnArg: 2.869 ± 0.184
2.812AsnSer: 2.812 ± 0.235
3.04AsnThr: 3.04 ± 0.202
3.283AsnVal: 3.283 ± 0.224
0.714AsnTrp: 0.714 ± 0.106
2.041AsnTyr: 2.041 ± 0.201
0.0AsnXaa: 0.0 ± 0.0
Pro
3.568ProAla: 3.568 ± 0.235
0.2ProCys: 0.2 ± 0.053
3.483ProAsp: 3.483 ± 0.257
3.654ProGlu: 3.654 ± 0.266
1.97ProPhe: 1.97 ± 0.19
2.626ProGly: 2.626 ± 0.196
0.999ProHis: 0.999 ± 0.135
2.312ProIle: 2.312 ± 0.183
2.184ProLys: 2.184 ± 0.211
3.426ProLeu: 3.426 ± 0.212
0.999ProMet: 0.999 ± 0.114
1.67ProAsn: 1.67 ± 0.154
1.385ProPro: 1.385 ± 0.13
1.741ProGln: 1.741 ± 0.181
1.77ProArg: 1.77 ± 0.149
2.669ProSer: 2.669 ± 0.185
3.34ProThr: 3.34 ± 0.207
3.54ProVal: 3.54 ± 0.244
0.471ProTrp: 0.471 ± 0.078
1.641ProTyr: 1.641 ± 0.186
0.0ProXaa: 0.0 ± 0.0
Gln
2.969GlnAla: 2.969 ± 0.229
0.442GlnCys: 0.442 ± 0.082
1.57GlnAsp: 1.57 ± 0.147
2.227GlnGlu: 2.227 ± 0.219
1.313GlnPhe: 1.313 ± 0.141
1.941GlnGly: 1.941 ± 0.192
0.728GlnHis: 0.728 ± 0.097
1.699GlnIle: 1.699 ± 0.152
1.741GlnLys: 1.741 ± 0.181
3.911GlnLeu: 3.911 ± 0.218
0.985GlnMet: 0.985 ± 0.125
1.57GlnAsn: 1.57 ± 0.142
1.399GlnPro: 1.399 ± 0.137
1.841GlnGln: 1.841 ± 0.205
2.541GlnArg: 2.541 ± 0.222
2.07GlnSer: 2.07 ± 0.177
2.227GlnThr: 2.227 ± 0.179
2.184GlnVal: 2.184 ± 0.174
0.557GlnTrp: 0.557 ± 0.095
1.599GlnTyr: 1.599 ± 0.156
0.0GlnXaa: 0.0 ± 0.0
Arg
3.868ArgAla: 3.868 ± 0.268
0.557ArgCys: 0.557 ± 0.092
3.511ArgAsp: 3.511 ± 0.241
3.64ArgGlu: 3.64 ± 0.25
2.298ArgPhe: 2.298 ± 0.184
3.24ArgGly: 3.24 ± 0.242
1.256ArgHis: 1.256 ± 0.143
3.511ArgIle: 3.511 ± 0.218
3.054ArgLys: 3.054 ± 0.252
5.481ArgLeu: 5.481 ± 0.292
1.699ArgMet: 1.699 ± 0.185
3.169ArgAsn: 3.169 ± 0.191
2.17ArgPro: 2.17 ± 0.186
1.97ArgGln: 1.97 ± 0.209
3.583ArgArg: 3.583 ± 0.281
2.883ArgSer: 2.883 ± 0.224
3.126ArgThr: 3.126 ± 0.205
3.74ArgVal: 3.74 ± 0.237
0.999ArgTrp: 0.999 ± 0.124
2.412ArgTyr: 2.412 ± 0.18
0.0ArgXaa: 0.0 ± 0.0
Ser
4.753SerAla: 4.753 ± 0.255
0.4SerCys: 0.4 ± 0.086
3.54SerAsp: 3.54 ± 0.221
3.511SerGlu: 3.511 ± 0.274
2.355SerPhe: 2.355 ± 0.192
4.054SerGly: 4.054 ± 0.32
1.085SerHis: 1.085 ± 0.134
3.283SerIle: 3.283 ± 0.175
3.169SerLys: 3.169 ± 0.204
5.381SerLeu: 5.381 ± 0.294
1.542SerMet: 1.542 ± 0.15
2.926SerAsn: 2.926 ± 0.258
2.669SerPro: 2.669 ± 0.201
1.713SerGln: 1.713 ± 0.139
3.197SerArg: 3.197 ± 0.226
3.383SerSer: 3.383 ± 0.264
3.939SerThr: 3.939 ± 0.274
3.925SerVal: 3.925 ± 0.24
0.999SerTrp: 0.999 ± 0.132
2.112SerTyr: 2.112 ± 0.158
0.0SerXaa: 0.0 ± 0.0
Thr
5.809ThrAla: 5.809 ± 0.34
0.514ThrCys: 0.514 ± 0.09
3.939ThrAsp: 3.939 ± 0.262
3.954ThrGlu: 3.954 ± 0.271
3.112ThrPhe: 3.112 ± 0.209
4.81ThrGly: 4.81 ± 0.26
1.142ThrHis: 1.142 ± 0.12
3.625ThrIle: 3.625 ± 0.247
3.097ThrLys: 3.097 ± 0.2
5.795ThrLeu: 5.795 ± 0.332
1.599ThrMet: 1.599 ± 0.158
2.669ThrAsn: 2.669 ± 0.215
3.254ThrPro: 3.254 ± 0.223
2.212ThrGln: 2.212 ± 0.216
3.197ThrArg: 3.197 ± 0.204
3.583ThrSer: 3.583 ± 0.247
4.339ThrThr: 4.339 ± 0.284
4.981ThrVal: 4.981 ± 0.349
0.971ThrTrp: 0.971 ± 0.115
2.227ThrTyr: 2.227 ± 0.227
0.0ThrXaa: 0.0 ± 0.0
Val
5.724ValAla: 5.724 ± 0.389
0.685ValCys: 0.685 ± 0.117
4.939ValAsp: 4.939 ± 0.254
4.653ValGlu: 4.653 ± 0.313
2.855ValPhe: 2.855 ± 0.198
4.296ValGly: 4.296 ± 0.237
1.37ValHis: 1.37 ± 0.141
3.939ValIle: 3.939 ± 0.221
3.982ValLys: 3.982 ± 0.235
5.01ValLeu: 5.01 ± 0.253
1.97ValMet: 1.97 ± 0.146
3.683ValAsn: 3.683 ± 0.222
3.311ValPro: 3.311 ± 0.214
2.17ValGln: 2.17 ± 0.192
4.039ValArg: 4.039 ± 0.235
3.925ValSer: 3.925 ± 0.232
5.038ValThr: 5.038 ± 0.258
5.267ValVal: 5.267 ± 0.304
0.842ValTrp: 0.842 ± 0.112
2.512ValTyr: 2.512 ± 0.242
0.0ValXaa: 0.0 ± 0.0
Trp
0.828TrpAla: 0.828 ± 0.098
0.114TrpCys: 0.114 ± 0.038
1.07TrpAsp: 1.07 ± 0.128
0.842TrpGlu: 0.842 ± 0.117
0.685TrpPhe: 0.685 ± 0.109
0.728TrpGly: 0.728 ± 0.104
0.271TrpHis: 0.271 ± 0.07
0.714TrpIle: 0.714 ± 0.095
0.799TrpLys: 0.799 ± 0.12
1.527TrpLeu: 1.527 ± 0.152
0.328TrpMet: 0.328 ± 0.077
0.756TrpAsn: 0.756 ± 0.117
0.442TrpPro: 0.442 ± 0.085
0.485TrpGln: 0.485 ± 0.082
0.728TrpArg: 0.728 ± 0.098
0.671TrpSer: 0.671 ± 0.085
0.714TrpThr: 0.714 ± 0.093
0.885TrpVal: 0.885 ± 0.114
0.243TrpTrp: 0.243 ± 0.049
0.542TrpTyr: 0.542 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.926TyrAla: 2.926 ± 0.224
0.414TyrCys: 0.414 ± 0.077
2.612TyrAsp: 2.612 ± 0.192
1.913TyrGlu: 1.913 ± 0.178
1.584TyrPhe: 1.584 ± 0.175
2.598TyrGly: 2.598 ± 0.21
1.028TyrHis: 1.028 ± 0.124
2.027TyrIle: 2.027 ± 0.192
1.827TyrLys: 1.827 ± 0.133
3.654TyrLeu: 3.654 ± 0.196
0.871TyrMet: 0.871 ± 0.115
2.426TyrAsn: 2.426 ± 0.177
1.741TyrPro: 1.741 ± 0.187
1.413TyrGln: 1.413 ± 0.149
2.484TyrArg: 2.484 ± 0.191
2.369TyrSer: 2.369 ± 0.219
2.369TyrThr: 2.369 ± 0.239
2.669TyrVal: 2.669 ± 0.205
0.457TyrTrp: 0.457 ± 0.079
1.927TyrTyr: 1.927 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 231 proteins (70062 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski