Amino acid dipepetide frequency for Cronobacter phage CR3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.915AlaAla: 6.915 ± 0.557
0.936AlaCys: 0.936 ± 0.15
4.427AlaAsp: 4.427 ± 0.331
5.317AlaGlu: 5.317 ± 0.428
2.944AlaPhe: 2.944 ± 0.279
6.618AlaGly: 6.618 ± 0.511
1.301AlaHis: 1.301 ± 0.195
4.108AlaIle: 4.108 ± 0.284
4.701AlaLys: 4.701 ± 0.354
6.504AlaLeu: 6.504 ± 0.424
2.351AlaMet: 2.351 ± 0.231
3.081AlaAsn: 3.081 ± 0.272
2.442AlaPro: 2.442 ± 0.335
2.944AlaGln: 2.944 ± 0.275
3.994AlaArg: 3.994 ± 0.363
3.674AlaSer: 3.674 ± 0.293
4.222AlaThr: 4.222 ± 0.405
4.998AlaVal: 4.998 ± 0.317
1.392AlaTrp: 1.392 ± 0.19
2.533AlaTyr: 2.533 ± 0.262
0.0AlaXaa: 0.0 ± 0.0
Cys
0.753CysAla: 0.753 ± 0.136
0.251CysCys: 0.251 ± 0.077
1.073CysAsp: 1.073 ± 0.18
0.867CysGlu: 0.867 ± 0.145
0.639CysPhe: 0.639 ± 0.134
1.141CysGly: 1.141 ± 0.2
0.434CysHis: 0.434 ± 0.107
0.297CysIle: 0.297 ± 0.076
0.981CysLys: 0.981 ± 0.157
0.936CysLeu: 0.936 ± 0.147
0.456CysMet: 0.456 ± 0.094
0.502CysAsn: 0.502 ± 0.102
0.616CysPro: 0.616 ± 0.13
0.32CysGln: 0.32 ± 0.091
0.616CysArg: 0.616 ± 0.111
0.32CysSer: 0.32 ± 0.109
0.525CysThr: 0.525 ± 0.121
0.571CysVal: 0.571 ± 0.111
0.068CysTrp: 0.068 ± 0.042
0.593CysTyr: 0.593 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
4.427AspAla: 4.427 ± 0.333
0.73AspCys: 0.73 ± 0.123
3.834AspAsp: 3.834 ± 0.342
4.062AspGlu: 4.062 ± 0.327
3.035AspPhe: 3.035 ± 0.238
4.975AspGly: 4.975 ± 0.391
1.392AspHis: 1.392 ± 0.185
4.222AspIle: 4.222 ± 0.367
4.176AspLys: 4.176 ± 0.367
6.39AspLeu: 6.39 ± 0.384
2.351AspMet: 2.351 ± 0.26
2.83AspAsn: 2.83 ± 0.29
3.172AspPro: 3.172 ± 0.301
2.647AspGln: 2.647 ± 0.276
3.012AspArg: 3.012 ± 0.27
2.967AspSer: 2.967 ± 0.27
2.761AspThr: 2.761 ± 0.301
4.29AspVal: 4.29 ± 0.412
1.461AspTrp: 1.461 ± 0.191
2.465AspTyr: 2.465 ± 0.25
0.0AspXaa: 0.0 ± 0.0
Glu
4.861GluAla: 4.861 ± 0.366
0.73GluCys: 0.73 ± 0.114
4.793GluAsp: 4.793 ± 0.328
5.934GluGlu: 5.934 ± 0.469
3.012GluPhe: 3.012 ± 0.269
4.701GluGly: 4.701 ± 0.344
1.438GluHis: 1.438 ± 0.184
4.405GluIle: 4.405 ± 0.369
4.701GluLys: 4.701 ± 0.3
5.637GluLeu: 5.637 ± 0.388
2.396GluMet: 2.396 ± 0.256
2.693GluAsn: 2.693 ± 0.209
2.031GluPro: 2.031 ± 0.234
2.579GluGln: 2.579 ± 0.269
3.492GluArg: 3.492 ± 0.324
3.104GluSer: 3.104 ± 0.275
3.651GluThr: 3.651 ± 0.34
5.135GluVal: 5.135 ± 0.362
1.095GluTrp: 1.095 ± 0.133
2.419GluTyr: 2.419 ± 0.273
0.0GluXaa: 0.0 ± 0.0
Phe
2.396PheAla: 2.396 ± 0.261
0.593PheCys: 0.593 ± 0.114
2.876PheAsp: 2.876 ± 0.294
3.035PheGlu: 3.035 ± 0.263
1.917PhePhe: 1.917 ± 0.209
3.766PheGly: 3.766 ± 0.352
0.844PheHis: 0.844 ± 0.146
2.624PheIle: 2.624 ± 0.261
2.488PheLys: 2.488 ± 0.273
3.149PheLeu: 3.149 ± 0.264
1.05PheMet: 1.05 ± 0.159
1.917PheAsn: 1.917 ± 0.212
1.689PhePro: 1.689 ± 0.204
1.461PheGln: 1.461 ± 0.171
1.712PheArg: 1.712 ± 0.167
2.442PheSer: 2.442 ± 0.232
2.784PheThr: 2.784 ± 0.265
2.83PheVal: 2.83 ± 0.249
1.118PheTrp: 1.118 ± 0.186
1.689PheTyr: 1.689 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
5.614GlyAla: 5.614 ± 0.513
0.913GlyCys: 0.913 ± 0.154
6.116GlyAsp: 6.116 ± 0.877
4.975GlyGlu: 4.975 ± 0.421
3.72GlyPhe: 3.72 ± 0.313
6.025GlyGly: 6.025 ± 0.448
1.164GlyHis: 1.164 ± 0.175
3.971GlyIle: 3.971 ± 0.254
5.979GlyLys: 5.979 ± 0.396
5.637GlyLeu: 5.637 ± 0.369
2.351GlyMet: 2.351 ± 0.229
3.651GlyAsn: 3.651 ± 0.38
3.766GlyPro: 3.766 ± 1.109
2.716GlyGln: 2.716 ± 0.233
3.674GlyArg: 3.674 ± 0.323
4.952GlySer: 4.952 ± 0.433
4.861GlyThr: 4.861 ± 0.519
5.34GlyVal: 5.34 ± 0.418
1.643GlyTrp: 1.643 ± 0.194
3.309GlyTyr: 3.309 ± 0.27
0.0GlyXaa: 0.0 ± 0.0
His
1.073HisAla: 1.073 ± 0.148
0.365HisCys: 0.365 ± 0.089
1.095HisAsp: 1.095 ± 0.15
1.05HisGlu: 1.05 ± 0.176
0.844HisPhe: 0.844 ± 0.136
1.483HisGly: 1.483 ± 0.152
0.297HisHis: 0.297 ± 0.083
1.324HisIle: 1.324 ± 0.147
0.753HisLys: 0.753 ± 0.14
1.575HisLeu: 1.575 ± 0.171
0.593HisMet: 0.593 ± 0.13
1.095HisAsn: 1.095 ± 0.168
1.324HisPro: 1.324 ± 0.177
0.502HisGln: 0.502 ± 0.123
1.095HisArg: 1.095 ± 0.159
0.799HisSer: 0.799 ± 0.137
1.004HisThr: 1.004 ± 0.14
1.438HisVal: 1.438 ± 0.173
0.342HisTrp: 0.342 ± 0.08
0.685HisTyr: 0.685 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
4.062IleAla: 4.062 ± 0.316
0.548IleCys: 0.548 ± 0.106
3.948IleAsp: 3.948 ± 0.288
4.359IleGlu: 4.359 ± 0.318
2.1IlePhe: 2.1 ± 0.223
3.971IleGly: 3.971 ± 0.323
1.05IleHis: 1.05 ± 0.139
2.898IleIle: 2.898 ± 0.276
3.788IleLys: 3.788 ± 0.304
4.085IleLeu: 4.085 ± 0.269
1.506IleMet: 1.506 ± 0.175
2.739IleAsn: 2.739 ± 0.258
2.465IlePro: 2.465 ± 0.313
1.689IleGln: 1.689 ± 0.182
3.537IleArg: 3.537 ± 0.313
3.127IleSer: 3.127 ± 0.27
3.446IleThr: 3.446 ± 0.283
3.651IleVal: 3.651 ± 0.297
0.502IleTrp: 0.502 ± 0.102
1.963IleTyr: 1.963 ± 0.22
0.0IleXaa: 0.0 ± 0.0
Lys
5.865LysAla: 5.865 ± 0.414
0.707LysCys: 0.707 ± 0.139
4.062LysAsp: 4.062 ± 0.284
4.587LysGlu: 4.587 ± 0.397
2.784LysPhe: 2.784 ± 0.266
5.705LysGly: 5.705 ± 0.673
1.369LysHis: 1.369 ± 0.192
3.241LysIle: 3.241 ± 0.236
4.313LysLys: 4.313 ± 0.35
4.427LysLeu: 4.427 ± 0.298
2.693LysMet: 2.693 ± 0.268
2.533LysAsn: 2.533 ± 0.271
2.67LysPro: 2.67 ± 0.262
2.351LysGln: 2.351 ± 0.231
2.921LysArg: 2.921 ± 0.227
3.263LysSer: 3.263 ± 0.277
3.766LysThr: 3.766 ± 0.294
4.861LysVal: 4.861 ± 0.36
0.799LysTrp: 0.799 ± 0.133
2.191LysTyr: 2.191 ± 0.224
0.0LysXaa: 0.0 ± 0.0
Leu
6.39LeuAla: 6.39 ± 0.434
0.913LeuCys: 0.913 ± 0.147
5.135LeuAsp: 5.135 ± 0.338
5.911LeuGlu: 5.911 ± 0.375
3.332LeuPhe: 3.332 ± 0.327
5.112LeuGly: 5.112 ± 0.298
1.803LeuHis: 1.803 ± 0.207
4.176LeuIle: 4.176 ± 0.282
5.158LeuLys: 5.158 ± 0.281
4.747LeuLeu: 4.747 ± 0.36
2.739LeuMet: 2.739 ± 0.278
3.948LeuAsn: 3.948 ± 0.253
3.834LeuPro: 3.834 ± 0.286
2.419LeuGln: 2.419 ± 0.247
4.268LeuArg: 4.268 ± 0.354
4.952LeuSer: 4.952 ± 0.303
4.176LeuThr: 4.176 ± 0.451
5.066LeuVal: 5.066 ± 0.373
1.141LeuTrp: 1.141 ± 0.14
2.716LeuTyr: 2.716 ± 0.241
0.0LeuXaa: 0.0 ± 0.0
Met
2.442MetAla: 2.442 ± 0.24
0.274MetCys: 0.274 ± 0.082
1.552MetAsp: 1.552 ± 0.213
1.894MetGlu: 1.894 ± 0.184
1.598MetPhe: 1.598 ± 0.183
2.237MetGly: 2.237 ± 0.272
0.365MetHis: 0.365 ± 0.087
2.328MetIle: 2.328 ± 0.247
2.624MetLys: 2.624 ± 0.233
2.305MetLeu: 2.305 ± 0.207
0.776MetMet: 0.776 ± 0.128
1.369MetAsn: 1.369 ± 0.164
1.21MetPro: 1.21 ± 0.158
0.959MetGln: 0.959 ± 0.161
1.483MetArg: 1.483 ± 0.176
2.488MetSer: 2.488 ± 0.223
1.849MetThr: 1.849 ± 0.216
1.917MetVal: 1.917 ± 0.223
0.228MetTrp: 0.228 ± 0.081
0.981MetTyr: 0.981 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
3.355AsnAla: 3.355 ± 0.247
0.456AsnCys: 0.456 ± 0.096
2.465AsnAsp: 2.465 ± 0.199
2.419AsnGlu: 2.419 ± 0.254
1.643AsnPhe: 1.643 ± 0.188
4.039AsnGly: 4.039 ± 0.353
0.707AsnHis: 0.707 ± 0.113
2.488AsnIle: 2.488 ± 0.263
2.556AsnLys: 2.556 ± 0.21
3.903AsnLeu: 3.903 ± 0.305
0.981AsnMet: 0.981 ± 0.143
2.008AsnAsn: 2.008 ± 0.214
1.917AsnPro: 1.917 ± 0.212
1.598AsnGln: 1.598 ± 0.198
2.259AsnArg: 2.259 ± 0.229
2.077AsnSer: 2.077 ± 0.214
2.83AsnThr: 2.83 ± 0.263
3.241AsnVal: 3.241 ± 0.324
0.707AsnTrp: 0.707 ± 0.133
1.575AsnTyr: 1.575 ± 0.186
0.0AsnXaa: 0.0 ± 0.0
Pro
3.743ProAla: 3.743 ± 0.369
0.525ProCys: 0.525 ± 0.113
3.058ProAsp: 3.058 ± 0.285
3.651ProGlu: 3.651 ± 0.37
1.712ProPhe: 1.712 ± 0.191
3.492ProGly: 3.492 ± 0.293
1.004ProHis: 1.004 ± 0.147
1.666ProIle: 1.666 ± 0.211
2.419ProLys: 2.419 ± 0.271
2.168ProLeu: 2.168 ± 0.193
1.21ProMet: 1.21 ± 0.164
1.392ProAsn: 1.392 ± 0.16
1.575ProPro: 1.575 ± 0.189
2.1ProGln: 2.1 ± 0.451
1.871ProArg: 1.871 ± 0.228
2.122ProSer: 2.122 ± 0.233
2.419ProThr: 2.419 ± 0.232
3.081ProVal: 3.081 ± 0.263
0.753ProTrp: 0.753 ± 0.127
1.461ProTyr: 1.461 ± 0.204
0.0ProXaa: 0.0 ± 0.0
Gln
2.898GlnAla: 2.898 ± 0.26
0.297GlnCys: 0.297 ± 0.084
1.849GlnAsp: 1.849 ± 0.205
2.191GlnGlu: 2.191 ± 0.253
1.461GlnPhe: 1.461 ± 0.213
3.583GlnGly: 3.583 ± 0.827
0.434GlnHis: 0.434 ± 0.101
2.579GlnIle: 2.579 ± 0.198
1.871GlnLys: 1.871 ± 0.183
2.784GlnLeu: 2.784 ± 0.275
1.164GlnMet: 1.164 ± 0.195
1.552GlnAsn: 1.552 ± 0.158
0.913GlnPro: 0.913 ± 0.129
1.575GlnGln: 1.575 ± 0.244
1.871GlnArg: 1.871 ± 0.217
1.94GlnSer: 1.94 ± 0.212
1.643GlnThr: 1.643 ± 0.201
2.967GlnVal: 2.967 ± 0.28
0.753GlnTrp: 0.753 ± 0.12
1.871GlnTyr: 1.871 ± 0.208
0.0GlnXaa: 0.0 ± 0.0
Arg
3.766ArgAla: 3.766 ± 0.313
0.707ArgCys: 0.707 ± 0.146
3.378ArgAsp: 3.378 ± 0.302
3.743ArgGlu: 3.743 ± 0.337
2.008ArgPhe: 2.008 ± 0.23
4.017ArgGly: 4.017 ± 0.343
0.936ArgHis: 0.936 ± 0.138
2.898ArgIle: 2.898 ± 0.342
3.88ArgLys: 3.88 ± 0.357
4.336ArgLeu: 4.336 ± 0.347
1.62ArgMet: 1.62 ± 0.199
2.077ArgAsn: 2.077 ± 0.24
1.575ArgPro: 1.575 ± 0.178
2.1ArgGln: 2.1 ± 0.213
3.081ArgArg: 3.081 ± 0.285
2.693ArgSer: 2.693 ± 0.222
2.624ArgThr: 2.624 ± 0.253
3.423ArgVal: 3.423 ± 0.229
0.685ArgTrp: 0.685 ± 0.127
1.894ArgTyr: 1.894 ± 0.232
0.0ArgXaa: 0.0 ± 0.0
Ser
3.72SerAla: 3.72 ± 0.308
0.662SerCys: 0.662 ± 0.111
3.537SerAsp: 3.537 ± 0.332
2.898SerGlu: 2.898 ± 0.296
1.871SerPhe: 1.871 ± 0.199
4.838SerGly: 4.838 ± 0.395
0.799SerHis: 0.799 ± 0.117
2.122SerIle: 2.122 ± 0.245
3.492SerLys: 3.492 ± 0.259
4.313SerLeu: 4.313 ± 0.329
1.483SerMet: 1.483 ± 0.168
2.145SerAsn: 2.145 ± 0.22
2.419SerPro: 2.419 ± 0.201
1.94SerGln: 1.94 ± 0.218
2.761SerArg: 2.761 ± 0.264
3.286SerSer: 3.286 ± 0.494
3.469SerThr: 3.469 ± 0.335
4.017SerVal: 4.017 ± 0.405
0.959SerTrp: 0.959 ± 0.153
2.442SerTyr: 2.442 ± 0.275
0.0SerXaa: 0.0 ± 0.0
Thr
4.154ThrAla: 4.154 ± 0.352
0.525ThrCys: 0.525 ± 0.106
2.967ThrAsp: 2.967 ± 0.317
3.081ThrGlu: 3.081 ± 0.293
2.533ThrPhe: 2.533 ± 0.268
6.116ThrGly: 6.116 ± 0.784
1.05ThrHis: 1.05 ± 0.159
3.286ThrIle: 3.286 ± 0.291
3.332ThrLys: 3.332 ± 0.257
5.066ThrLeu: 5.066 ± 0.348
1.575ThrMet: 1.575 ± 0.186
2.214ThrAsn: 2.214 ± 0.308
3.012ThrPro: 3.012 ± 0.255
1.689ThrGln: 1.689 ± 0.157
2.99ThrArg: 2.99 ± 0.275
2.465ThrSer: 2.465 ± 0.253
3.674ThrThr: 3.674 ± 0.469
4.519ThrVal: 4.519 ± 0.376
0.936ThrTrp: 0.936 ± 0.137
1.963ThrTyr: 1.963 ± 0.211
0.0ThrXaa: 0.0 ± 0.0
Val
5.089ValAla: 5.089 ± 0.33
1.073ValCys: 1.073 ± 0.188
5.044ValAsp: 5.044 ± 0.365
5.317ValGlu: 5.317 ± 0.372
2.67ValPhe: 2.67 ± 0.248
4.724ValGly: 4.724 ± 0.34
0.89ValHis: 0.89 ± 0.137
4.29ValIle: 4.29 ± 0.281
4.793ValLys: 4.793 ± 0.315
5.112ValLeu: 5.112 ± 0.386
2.145ValMet: 2.145 ± 0.183
3.172ValAsn: 3.172 ± 0.393
2.693ValPro: 2.693 ± 0.229
2.624ValGln: 2.624 ± 0.239
3.515ValArg: 3.515 ± 0.328
3.697ValSer: 3.697 ± 0.322
4.633ValThr: 4.633 ± 0.455
5.386ValVal: 5.386 ± 0.415
1.21ValTrp: 1.21 ± 0.172
2.876ValTyr: 2.876 ± 0.283
0.0ValXaa: 0.0 ± 0.0
Trp
1.438TrpAla: 1.438 ± 0.181
0.228TrpCys: 0.228 ± 0.084
1.095TrpAsp: 1.095 ± 0.15
1.301TrpGlu: 1.301 ± 0.158
0.753TrpPhe: 0.753 ± 0.12
0.959TrpGly: 0.959 ± 0.119
0.32TrpHis: 0.32 ± 0.073
0.844TrpIle: 0.844 ± 0.126
1.027TrpLys: 1.027 ± 0.177
1.278TrpLeu: 1.278 ± 0.182
0.525TrpMet: 0.525 ± 0.106
0.662TrpAsn: 0.662 ± 0.118
0.479TrpPro: 0.479 ± 0.086
0.479TrpGln: 0.479 ± 0.098
0.913TrpArg: 0.913 ± 0.125
0.913TrpSer: 0.913 ± 0.139
0.89TrpThr: 0.89 ± 0.131
1.346TrpVal: 1.346 ± 0.201
0.274TrpTrp: 0.274 ± 0.076
0.822TrpTyr: 0.822 ± 0.133
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.442TyrAla: 2.442 ± 0.226
0.525TyrCys: 0.525 ± 0.115
2.898TyrAsp: 2.898 ± 0.294
2.214TyrGlu: 2.214 ± 0.228
1.803TyrPhe: 1.803 ± 0.205
2.876TyrGly: 2.876 ± 0.267
1.141TyrHis: 1.141 ± 0.167
1.666TyrIle: 1.666 ± 0.212
2.1TyrLys: 2.1 ± 0.237
3.583TyrLeu: 3.583 ± 0.309
0.89TyrMet: 0.89 ± 0.145
1.643TyrAsn: 1.643 ± 0.206
1.62TyrPro: 1.62 ± 0.198
1.506TyrGln: 1.506 ± 0.179
2.373TyrArg: 2.373 ± 0.256
1.894TyrSer: 1.894 ± 0.212
1.917TyrThr: 1.917 ± 0.229
2.876TyrVal: 2.876 ± 0.292
0.479TyrTrp: 0.479 ± 0.113
1.232TyrTyr: 1.232 ± 0.165
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 265 proteins (43819 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski