Amino acid dipepetide frequency for Salmonella phage SE5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.079AlaAla: 7.079 ± 0.882
1.092AlaCys: 1.092 ± 0.217
4.085AlaAsp: 4.085 ± 0.429
5.663AlaGlu: 5.663 ± 0.467
2.589AlaPhe: 2.589 ± 0.282
6.067AlaGly: 6.067 ± 0.725
1.497AlaHis: 1.497 ± 0.254
4.894AlaIle: 4.894 ± 0.496
5.703AlaLys: 5.703 ± 0.516
6.108AlaLeu: 6.108 ± 0.643
2.831AlaMet: 2.831 ± 0.38
3.64AlaAsn: 3.64 ± 0.414
1.861AlaPro: 1.861 ± 0.244
2.751AlaGln: 2.751 ± 0.403
3.236AlaArg: 3.236 ± 0.449
4.935AlaSer: 4.935 ± 0.541
4.935AlaThr: 4.935 ± 0.666
5.663AlaVal: 5.663 ± 0.513
1.011AlaTrp: 1.011 ± 0.206
3.398AlaTyr: 3.398 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.688CysAla: 0.688 ± 0.178
0.202CysCys: 0.202 ± 0.083
0.688CysAsp: 0.688 ± 0.197
0.93CysGlu: 0.93 ± 0.208
0.647CysPhe: 0.647 ± 0.148
0.688CysGly: 0.688 ± 0.172
0.404CysHis: 0.404 ± 0.112
0.647CysIle: 0.647 ± 0.167
0.769CysLys: 0.769 ± 0.153
0.728CysLeu: 0.728 ± 0.163
0.202CysMet: 0.202 ± 0.084
0.566CysAsn: 0.566 ± 0.165
0.526CysPro: 0.526 ± 0.154
0.607CysGln: 0.607 ± 0.16
0.364CysArg: 0.364 ± 0.137
0.728CysSer: 0.728 ± 0.157
0.404CysThr: 0.404 ± 0.156
0.647CysVal: 0.647 ± 0.165
0.04CysTrp: 0.04 ± 0.046
0.849CysTyr: 0.849 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
5.38AspAla: 5.38 ± 0.42
0.607AspCys: 0.607 ± 0.145
4.288AspAsp: 4.288 ± 0.536
4.733AspGlu: 4.733 ± 0.532
3.236AspPhe: 3.236 ± 0.337
5.097AspGly: 5.097 ± 0.519
0.849AspHis: 0.849 ± 0.186
3.56AspIle: 3.56 ± 0.336
4.005AspLys: 4.005 ± 0.355
4.733AspLeu: 4.733 ± 0.476
1.578AspMet: 1.578 ± 0.239
3.883AspAsn: 3.883 ± 0.467
1.699AspPro: 1.699 ± 0.249
1.213AspGln: 1.213 ± 0.201
2.022AspArg: 2.022 ± 0.315
4.854AspSer: 4.854 ± 0.527
3.357AspThr: 3.357 ± 0.39
4.409AspVal: 4.409 ± 0.423
1.052AspTrp: 1.052 ± 0.251
2.791AspTyr: 2.791 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
4.611GluAla: 4.611 ± 0.529
0.566GluCys: 0.566 ± 0.185
3.681GluAsp: 3.681 ± 0.385
4.369GluGlu: 4.369 ± 0.505
3.398GluPhe: 3.398 ± 0.299
4.085GluGly: 4.085 ± 0.37
1.618GluHis: 1.618 ± 0.271
4.247GluIle: 4.247 ± 0.538
4.652GluLys: 4.652 ± 0.51
5.218GluLeu: 5.218 ± 0.447
1.982GluMet: 1.982 ± 0.278
3.236GluAsn: 3.236 ± 0.377
1.618GluPro: 1.618 ± 0.299
2.387GluGln: 2.387 ± 0.334
3.276GluArg: 3.276 ± 0.342
4.733GluSer: 4.733 ± 0.521
3.236GluThr: 3.236 ± 0.397
4.207GluVal: 4.207 ± 0.492
0.809GluTrp: 0.809 ± 0.186
2.022GluTyr: 2.022 ± 0.265
0.0GluXaa: 0.0 ± 0.0
Phe
3.6PheAla: 3.6 ± 0.385
0.324PheCys: 0.324 ± 0.14
3.034PheAsp: 3.034 ± 0.314
2.831PheGlu: 2.831 ± 0.405
2.144PhePhe: 2.144 ± 0.29
2.791PheGly: 2.791 ± 0.347
0.324PheHis: 0.324 ± 0.113
2.71PheIle: 2.71 ± 0.347
2.831PheLys: 2.831 ± 0.321
2.791PheLeu: 2.791 ± 0.332
1.375PheMet: 1.375 ± 0.285
2.022PheAsn: 2.022 ± 0.326
1.78PhePro: 1.78 ± 0.271
1.294PheGln: 1.294 ± 0.217
1.699PheArg: 1.699 ± 0.252
2.831PheSer: 2.831 ± 0.319
3.276PheThr: 3.276 ± 0.414
2.225PheVal: 2.225 ± 0.337
0.445PheTrp: 0.445 ± 0.135
1.416PheTyr: 1.416 ± 0.222
0.0PheXaa: 0.0 ± 0.0
Gly
5.461GlyAla: 5.461 ± 0.543
1.092GlyCys: 1.092 ± 0.185
4.409GlyAsp: 4.409 ± 0.412
4.247GlyGlu: 4.247 ± 0.441
3.155GlyPhe: 3.155 ± 0.336
3.843GlyGly: 3.843 ± 0.535
1.416GlyHis: 1.416 ± 0.205
4.328GlyIle: 4.328 ± 0.381
5.744GlyLys: 5.744 ± 0.476
6.351GlyLeu: 6.351 ± 0.574
1.618GlyMet: 1.618 ± 0.281
3.196GlyAsn: 3.196 ± 0.375
0.202GlyPro: 0.202 ± 0.099
2.548GlyGln: 2.548 ± 0.348
2.872GlyArg: 2.872 ± 0.328
4.692GlySer: 4.692 ± 0.487
3.802GlyThr: 3.802 ± 0.508
5.744GlyVal: 5.744 ± 0.409
0.89GlyTrp: 0.89 ± 0.195
3.519GlyTyr: 3.519 ± 0.382
0.0GlyXaa: 0.0 ± 0.0
His
1.254HisAla: 1.254 ± 0.203
0.202HisCys: 0.202 ± 0.097
1.456HisAsp: 1.456 ± 0.269
1.133HisGlu: 1.133 ± 0.196
0.688HisPhe: 0.688 ± 0.175
1.497HisGly: 1.497 ± 0.235
0.404HisHis: 0.404 ± 0.144
0.971HisIle: 0.971 ± 0.212
1.658HisLys: 1.658 ± 0.219
1.335HisLeu: 1.335 ± 0.268
0.607HisMet: 0.607 ± 0.149
1.052HisAsn: 1.052 ± 0.205
0.849HisPro: 0.849 ± 0.215
0.809HisGln: 0.809 ± 0.172
1.133HisArg: 1.133 ± 0.23
1.497HisSer: 1.497 ± 0.333
1.011HisThr: 1.011 ± 0.194
1.133HisVal: 1.133 ± 0.191
0.202HisTrp: 0.202 ± 0.09
0.89HisTyr: 0.89 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
5.097IleAla: 5.097 ± 0.449
0.647IleCys: 0.647 ± 0.161
4.005IleAsp: 4.005 ± 0.367
4.126IleGlu: 4.126 ± 0.415
1.982IlePhe: 1.982 ± 0.257
4.166IleGly: 4.166 ± 0.394
1.133IleHis: 1.133 ± 0.21
3.64IleIle: 3.64 ± 0.395
4.49IleLys: 4.49 ± 0.472
4.652IleLeu: 4.652 ± 0.49
1.335IleMet: 1.335 ± 0.232
4.045IleAsn: 4.045 ± 0.437
1.82IlePro: 1.82 ± 0.316
2.467IleGln: 2.467 ± 0.347
2.387IleArg: 2.387 ± 0.311
3.883IleSer: 3.883 ± 0.343
3.479IleThr: 3.479 ± 0.353
3.883IleVal: 3.883 ± 0.378
0.728IleTrp: 0.728 ± 0.152
1.82IleTyr: 1.82 ± 0.253
0.0IleXaa: 0.0 ± 0.0
Lys
6.998LysAla: 6.998 ± 0.607
0.607LysCys: 0.607 ± 0.193
4.247LysAsp: 4.247 ± 0.357
4.288LysGlu: 4.288 ± 0.499
2.387LysPhe: 2.387 ± 0.281
5.663LysGly: 5.663 ± 0.481
0.809LysHis: 0.809 ± 0.166
3.64LysIle: 3.64 ± 0.419
4.854LysLys: 4.854 ± 0.465
5.056LysLeu: 5.056 ± 0.385
1.78LysMet: 1.78 ± 0.253
3.56LysAsn: 3.56 ± 0.38
2.306LysPro: 2.306 ± 0.351
2.589LysGln: 2.589 ± 0.349
3.6LysArg: 3.6 ± 0.401
4.692LysSer: 4.692 ± 0.367
4.692LysThr: 4.692 ± 0.452
5.258LysVal: 5.258 ± 0.442
0.688LysTrp: 0.688 ± 0.148
2.548LysTyr: 2.548 ± 0.345
0.0LysXaa: 0.0 ± 0.0
Leu
6.755LeuAla: 6.755 ± 0.425
0.849LeuCys: 0.849 ± 0.237
5.784LeuAsp: 5.784 ± 0.473
5.38LeuGlu: 5.38 ± 0.476
3.196LeuPhe: 3.196 ± 0.307
3.64LeuGly: 3.64 ± 0.498
1.537LeuHis: 1.537 ± 0.249
4.328LeuIle: 4.328 ± 0.37
5.663LeuLys: 5.663 ± 0.535
5.663LeuLeu: 5.663 ± 0.457
2.71LeuMet: 2.71 ± 0.393
4.733LeuAsn: 4.733 ± 0.412
3.519LeuPro: 3.519 ± 0.352
2.589LeuGln: 2.589 ± 0.352
3.64LeuArg: 3.64 ± 0.47
5.339LeuSer: 5.339 ± 0.472
5.218LeuThr: 5.218 ± 0.438
4.773LeuVal: 4.773 ± 0.437
1.335LeuTrp: 1.335 ± 0.22
2.548LeuTyr: 2.548 ± 0.276
0.0LeuXaa: 0.0 ± 0.0
Met
2.467MetAla: 2.467 ± 0.317
0.243MetCys: 0.243 ± 0.1
1.497MetAsp: 1.497 ± 0.269
1.578MetGlu: 1.578 ± 0.215
0.688MetPhe: 0.688 ± 0.171
1.861MetGly: 1.861 ± 0.274
0.202MetHis: 0.202 ± 0.088
1.78MetIle: 1.78 ± 0.224
2.184MetLys: 2.184 ± 0.272
2.184MetLeu: 2.184 ± 0.275
0.647MetMet: 0.647 ± 0.159
2.022MetAsn: 2.022 ± 0.269
0.849MetPro: 0.849 ± 0.182
1.254MetGln: 1.254 ± 0.234
1.294MetArg: 1.294 ± 0.259
2.103MetSer: 2.103 ± 0.249
2.225MetThr: 2.225 ± 0.244
1.578MetVal: 1.578 ± 0.266
0.04MetTrp: 0.04 ± 0.042
1.052MetTyr: 1.052 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
3.398AsnAla: 3.398 ± 0.443
0.526AsnCys: 0.526 ± 0.149
2.467AsnAsp: 2.467 ± 0.299
2.467AsnGlu: 2.467 ± 0.305
2.103AsnPhe: 2.103 ± 0.276
4.652AsnGly: 4.652 ± 0.438
1.254AsnHis: 1.254 ± 0.244
3.762AsnIle: 3.762 ± 0.368
3.196AsnLys: 3.196 ± 0.316
4.814AsnLeu: 4.814 ± 0.382
1.335AsnMet: 1.335 ± 0.247
2.387AsnAsn: 2.387 ± 0.276
3.155AsnPro: 3.155 ± 0.326
1.861AsnGln: 1.861 ± 0.29
2.548AsnArg: 2.548 ± 0.327
3.64AsnSer: 3.64 ± 0.346
2.831AsnThr: 2.831 ± 0.336
2.953AsnVal: 2.953 ± 0.347
0.849AsnTrp: 0.849 ± 0.194
1.861AsnTyr: 1.861 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
2.346ProAla: 2.346 ± 0.323
0.202ProCys: 0.202 ± 0.087
2.751ProAsp: 2.751 ± 0.352
2.953ProGlu: 2.953 ± 0.426
2.022ProPhe: 2.022 ± 0.339
0.364ProGly: 0.364 ± 0.155
0.647ProHis: 0.647 ± 0.184
1.456ProIle: 1.456 ± 0.224
1.942ProLys: 1.942 ± 0.298
2.629ProLeu: 2.629 ± 0.318
0.647ProMet: 0.647 ± 0.162
1.861ProAsn: 1.861 ± 0.283
0.849ProPro: 0.849 ± 0.167
0.93ProGln: 0.93 ± 0.215
1.335ProArg: 1.335 ± 0.219
1.861ProSer: 1.861 ± 0.319
2.346ProThr: 2.346 ± 0.343
2.67ProVal: 2.67 ± 0.398
0.404ProTrp: 0.404 ± 0.135
1.294ProTyr: 1.294 ± 0.222
0.0ProXaa: 0.0 ± 0.0
Gln
2.872GlnAla: 2.872 ± 0.424
0.445GlnCys: 0.445 ± 0.141
1.416GlnAsp: 1.416 ± 0.259
1.82GlnGlu: 1.82 ± 0.305
1.861GlnPhe: 1.861 ± 0.292
2.427GlnGly: 2.427 ± 0.373
0.769GlnHis: 0.769 ± 0.199
2.548GlnIle: 2.548 ± 0.315
2.467GlnLys: 2.467 ± 0.299
3.357GlnLeu: 3.357 ± 0.413
1.294GlnMet: 1.294 ± 0.259
1.901GlnAsn: 1.901 ± 0.322
1.375GlnPro: 1.375 ± 0.233
1.537GlnGln: 1.537 ± 0.277
1.456GlnArg: 1.456 ± 0.209
2.184GlnSer: 2.184 ± 0.306
2.548GlnThr: 2.548 ± 0.328
1.861GlnVal: 1.861 ± 0.296
0.607GlnTrp: 0.607 ± 0.14
1.537GlnTyr: 1.537 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
3.155ArgAla: 3.155 ± 0.356
0.607ArgCys: 0.607 ± 0.157
2.467ArgAsp: 2.467 ± 0.385
2.467ArgGlu: 2.467 ± 0.334
1.861ArgPhe: 1.861 ± 0.304
2.751ArgGly: 2.751 ± 0.358
1.375ArgHis: 1.375 ± 0.238
2.831ArgIle: 2.831 ± 0.297
3.357ArgLys: 3.357 ± 0.389
3.196ArgLeu: 3.196 ± 0.386
1.618ArgMet: 1.618 ± 0.288
2.063ArgAsn: 2.063 ± 0.286
1.011ArgPro: 1.011 ± 0.186
1.982ArgGln: 1.982 ± 0.275
2.306ArgArg: 2.306 ± 0.354
2.144ArgSer: 2.144 ± 0.319
2.184ArgThr: 2.184 ± 0.292
3.56ArgVal: 3.56 ± 0.396
1.011ArgTrp: 1.011 ± 0.218
2.144ArgTyr: 2.144 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
4.854SerAla: 4.854 ± 0.606
0.769SerCys: 0.769 ± 0.173
4.449SerAsp: 4.449 ± 0.368
4.085SerGlu: 4.085 ± 0.451
2.912SerPhe: 2.912 ± 0.326
5.825SerGly: 5.825 ± 0.548
1.618SerHis: 1.618 ± 0.349
3.155SerIle: 3.155 ± 0.356
4.49SerLys: 4.49 ± 0.483
5.056SerLeu: 5.056 ± 0.422
1.942SerMet: 1.942 ± 0.282
3.034SerAsn: 3.034 ± 0.472
2.022SerPro: 2.022 ± 0.266
2.831SerGln: 2.831 ± 0.34
2.71SerArg: 2.71 ± 0.396
4.854SerSer: 4.854 ± 0.507
3.438SerThr: 3.438 ± 0.449
5.461SerVal: 5.461 ± 0.443
0.607SerTrp: 0.607 ± 0.152
2.953SerTyr: 2.953 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
5.097ThrAla: 5.097 ± 0.456
0.607ThrCys: 0.607 ± 0.138
4.045ThrAsp: 4.045 ± 0.424
3.074ThrGlu: 3.074 ± 0.357
2.427ThrPhe: 2.427 ± 0.34
5.42ThrGly: 5.42 ± 0.572
1.294ThrHis: 1.294 ± 0.229
4.126ThrIle: 4.126 ± 0.355
3.115ThrLys: 3.115 ± 0.315
5.784ThrLeu: 5.784 ± 0.556
0.89ThrMet: 0.89 ± 0.162
2.71ThrAsn: 2.71 ± 0.377
2.225ThrPro: 2.225 ± 0.299
2.265ThrGln: 2.265 ± 0.292
2.387ThrArg: 2.387 ± 0.301
4.085ThrSer: 4.085 ± 0.456
4.369ThrThr: 4.369 ± 0.596
5.056ThrVal: 5.056 ± 0.536
0.647ThrTrp: 0.647 ± 0.208
2.346ThrTyr: 2.346 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
4.773ValAla: 4.773 ± 0.54
1.011ValCys: 1.011 ± 0.214
4.733ValAsp: 4.733 ± 0.407
4.733ValGlu: 4.733 ± 0.547
2.306ValPhe: 2.306 ± 0.225
4.166ValGly: 4.166 ± 0.495
1.133ValHis: 1.133 ± 0.203
4.53ValIle: 4.53 ± 0.413
5.056ValLys: 5.056 ± 0.436
4.814ValLeu: 4.814 ± 0.502
2.144ValMet: 2.144 ± 0.324
3.479ValAsn: 3.479 ± 0.4
2.225ValPro: 2.225 ± 0.278
2.387ValGln: 2.387 ± 0.291
3.276ValArg: 3.276 ± 0.395
4.611ValSer: 4.611 ± 0.464
5.097ValThr: 5.097 ± 0.538
5.218ValVal: 5.218 ± 0.643
0.647ValTrp: 0.647 ± 0.21
2.831ValTyr: 2.831 ± 0.316
0.0ValXaa: 0.0 ± 0.0
Trp
0.93TrpAla: 0.93 ± 0.211
0.121TrpCys: 0.121 ± 0.075
1.011TrpAsp: 1.011 ± 0.225
0.728TrpGlu: 0.728 ± 0.162
0.526TrpPhe: 0.526 ± 0.134
0.566TrpGly: 0.566 ± 0.165
0.324TrpHis: 0.324 ± 0.119
0.607TrpIle: 0.607 ± 0.226
1.375TrpLys: 1.375 ± 0.253
1.254TrpLeu: 1.254 ± 0.246
0.324TrpMet: 0.324 ± 0.104
0.809TrpAsn: 0.809 ± 0.202
0.243TrpPro: 0.243 ± 0.107
0.364TrpGln: 0.364 ± 0.115
0.404TrpArg: 0.404 ± 0.133
0.809TrpSer: 0.809 ± 0.215
0.728TrpThr: 0.728 ± 0.191
0.89TrpVal: 0.89 ± 0.172
0.364TrpTrp: 0.364 ± 0.115
0.404TrpTyr: 0.404 ± 0.11
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.225TyrAla: 2.225 ± 0.278
0.607TyrCys: 0.607 ± 0.16
2.831TyrAsp: 2.831 ± 0.313
2.346TyrGlu: 2.346 ± 0.338
1.618TyrPhe: 1.618 ± 0.266
3.56TyrGly: 3.56 ± 0.332
1.173TyrHis: 1.173 ± 0.174
2.022TyrIle: 2.022 ± 0.369
2.71TyrLys: 2.71 ± 0.319
3.479TyrLeu: 3.479 ± 0.342
0.849TyrMet: 0.849 ± 0.218
1.861TyrAsn: 1.861 ± 0.241
1.375TyrPro: 1.375 ± 0.238
1.618TyrGln: 1.618 ± 0.265
2.144TyrArg: 2.144 ± 0.309
2.589TyrSer: 2.589 ± 0.352
2.791TyrThr: 2.791 ± 0.301
2.022TyrVal: 2.022 ± 0.325
0.404TyrTrp: 0.404 ± 0.117
1.537TyrTyr: 1.537 ± 0.295
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 115 proteins (24723 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski