Amino acid dipepetide frequency for Serratia phage MTx

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.821AlaAla: 10.821 ± 0.821
0.851AlaCys: 0.851 ± 0.191
6.19AlaAsp: 6.19 ± 0.509
5.765AlaGlu: 5.765 ± 0.765
3.072AlaPhe: 3.072 ± 0.352
6.001AlaGly: 6.001 ± 0.493
1.937AlaHis: 1.937 ± 0.301
6.143AlaIle: 6.143 ± 0.544
6.143AlaLys: 6.143 ± 0.763
7.183AlaLeu: 7.183 ± 0.583
2.221AlaMet: 2.221 ± 0.315
4.395AlaAsn: 4.395 ± 0.51
3.213AlaPro: 3.213 ± 0.29
3.45AlaGln: 3.45 ± 0.45
4.678AlaArg: 4.678 ± 0.503
5.671AlaSer: 5.671 ± 0.644
5.245AlaThr: 5.245 ± 0.494
5.907AlaVal: 5.907 ± 0.533
1.04AlaTrp: 1.04 ± 0.2
3.119AlaTyr: 3.119 ± 0.327
0.0AlaXaa: 0.0 ± 0.0
Cys
0.756CysAla: 0.756 ± 0.199
0.236CysCys: 0.236 ± 0.11
0.945CysAsp: 0.945 ± 0.266
0.473CysGlu: 0.473 ± 0.147
0.567CysPhe: 0.567 ± 0.161
0.662CysGly: 0.662 ± 0.175
0.331CysHis: 0.331 ± 0.141
0.378CysIle: 0.378 ± 0.125
0.614CysLys: 0.614 ± 0.171
0.803CysLeu: 0.803 ± 0.218
0.425CysMet: 0.425 ± 0.113
0.473CysAsn: 0.473 ± 0.151
0.709CysPro: 0.709 ± 0.157
0.378CysGln: 0.378 ± 0.117
0.756CysArg: 0.756 ± 0.205
0.52CysSer: 0.52 ± 0.134
0.945CysThr: 0.945 ± 0.209
0.803CysVal: 0.803 ± 0.201
0.284CysTrp: 0.284 ± 0.094
0.662CysTyr: 0.662 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
5.86AspAla: 5.86 ± 0.442
0.662AspCys: 0.662 ± 0.184
3.591AspAsp: 3.591 ± 0.377
4.442AspGlu: 4.442 ± 0.597
2.788AspPhe: 2.788 ± 0.336
5.86AspGly: 5.86 ± 0.575
0.945AspHis: 0.945 ± 0.196
3.875AspIle: 3.875 ± 0.404
3.024AspLys: 3.024 ± 0.426
5.009AspLeu: 5.009 ± 0.423
1.701AspMet: 1.701 ± 0.263
2.788AspAsn: 2.788 ± 0.339
2.552AspPro: 2.552 ± 0.383
1.89AspGln: 1.89 ± 0.283
2.552AspArg: 2.552 ± 0.377
3.591AspSer: 3.591 ± 0.401
3.213AspThr: 3.213 ± 0.383
4.962AspVal: 4.962 ± 0.489
0.898AspTrp: 0.898 ± 0.236
1.985AspTyr: 1.985 ± 0.305
0.0AspXaa: 0.0 ± 0.0
Glu
6.427GluAla: 6.427 ± 0.864
0.473GluCys: 0.473 ± 0.188
3.355GluAsp: 3.355 ± 0.432
4.206GluGlu: 4.206 ± 0.651
2.552GluPhe: 2.552 ± 0.384
3.45GluGly: 3.45 ± 0.37
1.418GluHis: 1.418 ± 0.257
3.213GluIle: 3.213 ± 0.332
3.213GluLys: 3.213 ± 0.525
5.056GluLeu: 5.056 ± 0.532
1.985GluMet: 1.985 ± 0.28
2.41GluAsn: 2.41 ± 0.324
2.363GluPro: 2.363 ± 0.397
2.977GluGln: 2.977 ± 0.427
3.497GluArg: 3.497 ± 0.628
4.3GluSer: 4.3 ± 0.544
2.694GluThr: 2.694 ± 0.379
3.969GluVal: 3.969 ± 0.453
1.04GluTrp: 1.04 ± 0.222
2.315GluTyr: 2.315 ± 0.321
0.0GluXaa: 0.0 ± 0.0
Phe
3.213PheAla: 3.213 ± 0.401
0.425PheCys: 0.425 ± 0.14
3.072PheAsp: 3.072 ± 0.424
2.788PheGlu: 2.788 ± 0.41
1.323PhePhe: 1.323 ± 0.288
3.969PheGly: 3.969 ± 0.422
0.52PheHis: 0.52 ± 0.156
2.694PheIle: 2.694 ± 0.326
2.883PheLys: 2.883 ± 0.45
2.363PheLeu: 2.363 ± 0.396
1.323PheMet: 1.323 ± 0.255
2.788PheAsn: 2.788 ± 0.301
1.276PhePro: 1.276 ± 0.246
1.843PheGln: 1.843 ± 0.284
1.89PheArg: 1.89 ± 0.284
2.552PheSer: 2.552 ± 0.37
2.363PheThr: 2.363 ± 0.419
2.504PheVal: 2.504 ± 0.333
0.662PheTrp: 0.662 ± 0.19
1.559PheTyr: 1.559 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
6.238GlyAla: 6.238 ± 0.593
1.087GlyCys: 1.087 ± 0.211
5.103GlyAsp: 5.103 ± 0.472
4.82GlyGlu: 4.82 ± 0.45
3.639GlyPhe: 3.639 ± 0.387
5.482GlyGly: 5.482 ± 0.725
1.087GlyHis: 1.087 ± 0.25
5.056GlyIle: 5.056 ± 0.503
5.009GlyLys: 5.009 ± 0.625
4.631GlyLeu: 4.631 ± 0.476
2.504GlyMet: 2.504 ± 0.371
3.119GlyAsn: 3.119 ± 0.579
1.512GlyPro: 1.512 ± 0.297
2.599GlyGln: 2.599 ± 0.371
4.064GlyArg: 4.064 ± 0.497
3.213GlySer: 3.213 ± 0.552
3.969GlyThr: 3.969 ± 0.672
5.387GlyVal: 5.387 ± 0.509
1.37GlyTrp: 1.37 ± 0.238
2.788GlyTyr: 2.788 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
1.229HisAla: 1.229 ± 0.246
0.378HisCys: 0.378 ± 0.15
0.945HisAsp: 0.945 ± 0.163
1.134HisGlu: 1.134 ± 0.218
0.851HisPhe: 0.851 ± 0.212
1.465HisGly: 1.465 ± 0.289
0.473HisHis: 0.473 ± 0.189
1.04HisIle: 1.04 ± 0.213
1.134HisLys: 1.134 ± 0.254
1.512HisLeu: 1.512 ± 0.295
0.662HisMet: 0.662 ± 0.165
0.709HisAsn: 0.709 ± 0.155
0.662HisPro: 0.662 ± 0.176
0.709HisGln: 0.709 ± 0.185
0.945HisArg: 0.945 ± 0.197
1.087HisSer: 1.087 ± 0.232
0.803HisThr: 0.803 ± 0.195
0.803HisVal: 0.803 ± 0.207
0.284HisTrp: 0.284 ± 0.108
0.709HisTyr: 0.709 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
4.725IleAla: 4.725 ± 0.478
0.945IleCys: 0.945 ± 0.239
5.954IleAsp: 5.954 ± 0.61
3.686IleGlu: 3.686 ± 0.36
2.126IlePhe: 2.126 ± 0.308
3.639IleGly: 3.639 ± 0.369
1.134IleHis: 1.134 ± 0.242
4.489IleIle: 4.489 ± 0.482
4.442IleLys: 4.442 ± 0.406
3.969IleLeu: 3.969 ± 0.42
1.985IleMet: 1.985 ± 0.301
4.064IleAsn: 4.064 ± 0.514
2.174IlePro: 2.174 ± 0.342
2.835IleGln: 2.835 ± 0.331
2.41IleArg: 2.41 ± 0.275
3.828IleSer: 3.828 ± 0.48
4.206IleThr: 4.206 ± 0.511
4.631IleVal: 4.631 ± 0.482
0.614IleTrp: 0.614 ± 0.16
2.032IleTyr: 2.032 ± 0.389
0.0IleXaa: 0.0 ± 0.0
Lys
7.513LysAla: 7.513 ± 0.8
0.662LysCys: 0.662 ± 0.181
3.875LysAsp: 3.875 ± 0.409
4.3LysGlu: 4.3 ± 0.868
2.694LysPhe: 2.694 ± 0.288
4.489LysGly: 4.489 ± 0.45
1.229LysHis: 1.229 ± 0.21
3.875LysIle: 3.875 ± 0.472
3.733LysLys: 3.733 ± 0.515
4.206LysLeu: 4.206 ± 0.38
2.032LysMet: 2.032 ± 0.255
2.41LysAsn: 2.41 ± 0.366
2.315LysPro: 2.315 ± 0.357
2.079LysGln: 2.079 ± 0.323
3.591LysArg: 3.591 ± 0.356
3.686LysSer: 3.686 ± 0.396
2.977LysThr: 2.977 ± 0.442
4.111LysVal: 4.111 ± 0.509
0.425LysTrp: 0.425 ± 0.134
2.174LysTyr: 2.174 ± 0.389
0.0LysXaa: 0.0 ± 0.0
Leu
6.71LeuAla: 6.71 ± 0.661
1.323LeuCys: 1.323 ± 0.317
3.875LeuAsp: 3.875 ± 0.43
4.158LeuGlu: 4.158 ± 0.461
2.93LeuPhe: 2.93 ± 0.373
4.631LeuGly: 4.631 ± 0.516
0.851LeuHis: 0.851 ± 0.186
4.631LeuIle: 4.631 ± 0.458
5.245LeuLys: 5.245 ± 0.642
4.82LeuLeu: 4.82 ± 0.484
2.552LeuMet: 2.552 ± 0.393
4.017LeuAsn: 4.017 ± 0.494
3.45LeuPro: 3.45 ± 0.397
3.024LeuGln: 3.024 ± 0.335
3.402LeuArg: 3.402 ± 0.448
4.914LeuSer: 4.914 ± 0.576
4.489LeuThr: 4.489 ± 0.561
4.584LeuVal: 4.584 ± 0.592
0.851LeuTrp: 0.851 ± 0.182
2.174LeuTyr: 2.174 ± 0.265
0.0LeuXaa: 0.0 ± 0.0
Met
3.072MetAla: 3.072 ± 0.421
0.142MetCys: 0.142 ± 0.077
1.465MetAsp: 1.465 ± 0.255
2.126MetGlu: 2.126 ± 0.316
0.898MetPhe: 0.898 ± 0.199
1.276MetGly: 1.276 ± 0.283
0.709MetHis: 0.709 ± 0.183
0.992MetIle: 0.992 ± 0.21
2.126MetLys: 2.126 ± 0.347
2.504MetLeu: 2.504 ± 0.311
0.709MetMet: 0.709 ± 0.174
1.465MetAsn: 1.465 ± 0.285
1.937MetPro: 1.937 ± 0.336
1.276MetGln: 1.276 ± 0.286
1.418MetArg: 1.418 ± 0.207
2.552MetSer: 2.552 ± 0.319
2.174MetThr: 2.174 ± 0.317
1.89MetVal: 1.89 ± 0.325
0.236MetTrp: 0.236 ± 0.098
0.992MetTyr: 0.992 ± 0.213
0.0MetXaa: 0.0 ± 0.0
Asn
4.489AsnAla: 4.489 ± 0.506
0.803AsnCys: 0.803 ± 0.193
2.599AsnAsp: 2.599 ± 0.381
1.985AsnGlu: 1.985 ± 0.279
2.41AsnPhe: 2.41 ± 0.337
4.867AsnGly: 4.867 ± 0.518
0.851AsnHis: 0.851 ± 0.161
3.355AsnIle: 3.355 ± 0.447
3.072AsnLys: 3.072 ± 0.418
3.166AsnLeu: 3.166 ± 0.426
1.229AsnMet: 1.229 ± 0.246
2.032AsnAsn: 2.032 ± 0.374
2.883AsnPro: 2.883 ± 0.371
1.985AsnGln: 1.985 ± 0.315
2.363AsnArg: 2.363 ± 0.388
2.835AsnSer: 2.835 ± 0.398
2.552AsnThr: 2.552 ± 0.307
3.544AsnVal: 3.544 ± 0.375
0.709AsnTrp: 0.709 ± 0.175
1.701AsnTyr: 1.701 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
2.977ProAla: 2.977 ± 0.363
0.378ProCys: 0.378 ± 0.146
3.166ProAsp: 3.166 ± 0.373
2.835ProGlu: 2.835 ± 0.402
2.079ProPhe: 2.079 ± 0.246
2.977ProGly: 2.977 ± 0.461
0.709ProHis: 0.709 ± 0.181
2.646ProIle: 2.646 ± 0.344
2.268ProLys: 2.268 ± 0.265
2.504ProLeu: 2.504 ± 0.251
1.418ProMet: 1.418 ± 0.311
2.315ProAsn: 2.315 ± 0.368
0.851ProPro: 0.851 ± 0.217
1.37ProGln: 1.37 ± 0.274
1.229ProArg: 1.229 ± 0.253
2.552ProSer: 2.552 ± 0.467
2.126ProThr: 2.126 ± 0.317
2.835ProVal: 2.835 ± 0.338
0.756ProTrp: 0.756 ± 0.22
1.323ProTyr: 1.323 ± 0.246
0.0ProXaa: 0.0 ± 0.0
Gln
3.591GlnAla: 3.591 ± 0.517
0.236GlnCys: 0.236 ± 0.086
1.748GlnAsp: 1.748 ± 0.325
2.363GlnGlu: 2.363 ± 0.406
2.599GlnPhe: 2.599 ± 0.305
2.504GlnGly: 2.504 ± 0.292
0.473GlnHis: 0.473 ± 0.12
2.268GlnIle: 2.268 ± 0.354
2.079GlnLys: 2.079 ± 0.344
3.308GlnLeu: 3.308 ± 0.384
1.229GlnMet: 1.229 ± 0.198
1.181GlnAsn: 1.181 ± 0.231
1.559GlnPro: 1.559 ± 0.309
1.937GlnGln: 1.937 ± 0.378
2.268GlnArg: 2.268 ± 0.339
2.504GlnSer: 2.504 ± 0.357
2.93GlnThr: 2.93 ± 0.424
2.552GlnVal: 2.552 ± 0.304
0.425GlnTrp: 0.425 ± 0.142
1.181GlnTyr: 1.181 ± 0.219
0.0GlnXaa: 0.0 ± 0.0
Arg
4.962ArgAla: 4.962 ± 0.55
0.614ArgCys: 0.614 ± 0.186
2.646ArgAsp: 2.646 ± 0.386
3.308ArgGlu: 3.308 ± 0.432
2.268ArgPhe: 2.268 ± 0.339
2.835ArgGly: 2.835 ± 0.376
1.181ArgHis: 1.181 ± 0.236
3.308ArgIle: 3.308 ± 0.439
3.024ArgLys: 3.024 ± 0.414
4.111ArgLeu: 4.111 ± 0.439
1.323ArgMet: 1.323 ± 0.239
2.268ArgAsn: 2.268 ± 0.35
2.126ArgPro: 2.126 ± 0.273
2.174ArgGln: 2.174 ± 0.475
2.883ArgArg: 2.883 ± 0.438
2.221ArgSer: 2.221 ± 0.324
2.646ArgThr: 2.646 ± 0.394
3.213ArgVal: 3.213 ± 0.341
0.898ArgTrp: 0.898 ± 0.219
1.843ArgTyr: 1.843 ± 0.326
0.0ArgXaa: 0.0 ± 0.0
Ser
5.529SerAla: 5.529 ± 0.711
0.756SerCys: 0.756 ± 0.178
3.544SerAsp: 3.544 ± 0.448
3.733SerGlu: 3.733 ± 0.48
2.504SerPhe: 2.504 ± 0.402
5.056SerGly: 5.056 ± 0.467
0.945SerHis: 0.945 ± 0.261
4.442SerIle: 4.442 ± 0.543
3.639SerLys: 3.639 ± 0.331
4.253SerLeu: 4.253 ± 0.514
1.607SerMet: 1.607 ± 0.267
3.45SerAsn: 3.45 ± 0.427
2.174SerPro: 2.174 ± 0.277
1.843SerGln: 1.843 ± 0.285
2.363SerArg: 2.363 ± 0.341
4.111SerSer: 4.111 ± 0.436
3.355SerThr: 3.355 ± 0.507
4.678SerVal: 4.678 ± 0.541
1.04SerTrp: 1.04 ± 0.203
1.748SerTyr: 1.748 ± 0.264
0.0SerXaa: 0.0 ± 0.0
Thr
4.773ThrAla: 4.773 ± 0.423
0.473ThrCys: 0.473 ± 0.162
2.41ThrAsp: 2.41 ± 0.284
2.646ThrGlu: 2.646 ± 0.387
2.079ThrPhe: 2.079 ± 0.3
5.009ThrGly: 5.009 ± 0.583
1.04ThrHis: 1.04 ± 0.209
4.064ThrIle: 4.064 ± 0.43
3.166ThrLys: 3.166 ± 0.337
4.82ThrLeu: 4.82 ± 0.481
1.607ThrMet: 1.607 ± 0.34
2.599ThrAsn: 2.599 ± 0.394
3.072ThrPro: 3.072 ± 0.377
2.126ThrGln: 2.126 ± 0.352
3.213ThrArg: 3.213 ± 0.437
3.828ThrSer: 3.828 ± 0.366
4.206ThrThr: 4.206 ± 0.701
3.78ThrVal: 3.78 ± 0.514
0.945ThrTrp: 0.945 ± 0.249
1.748ThrTyr: 1.748 ± 0.326
0.0ThrXaa: 0.0 ± 0.0
Val
6.427ValAla: 6.427 ± 0.594
0.851ValCys: 0.851 ± 0.191
4.442ValAsp: 4.442 ± 0.434
3.497ValGlu: 3.497 ± 0.539
2.315ValPhe: 2.315 ± 0.293
5.103ValGly: 5.103 ± 0.502
0.898ValHis: 0.898 ± 0.184
4.347ValIle: 4.347 ± 0.434
4.725ValLys: 4.725 ± 0.521
5.482ValLeu: 5.482 ± 0.542
1.654ValMet: 1.654 ± 0.251
3.639ValAsn: 3.639 ± 0.345
2.93ValPro: 2.93 ± 0.383
2.41ValGln: 2.41 ± 0.349
3.544ValArg: 3.544 ± 0.406
3.875ValSer: 3.875 ± 0.421
4.064ValThr: 4.064 ± 0.552
4.82ValVal: 4.82 ± 0.569
0.473ValTrp: 0.473 ± 0.159
2.032ValTyr: 2.032 ± 0.303
0.0ValXaa: 0.0 ± 0.0
Trp
1.229TrpAla: 1.229 ± 0.21
0.095TrpCys: 0.095 ± 0.055
0.803TrpAsp: 0.803 ± 0.203
0.662TrpGlu: 0.662 ± 0.157
0.567TrpPhe: 0.567 ± 0.155
0.803TrpGly: 0.803 ± 0.193
0.047TrpHis: 0.047 ± 0.051
0.898TrpIle: 0.898 ± 0.19
0.662TrpLys: 0.662 ± 0.159
1.134TrpLeu: 1.134 ± 0.232
0.52TrpMet: 0.52 ± 0.139
0.851TrpAsn: 0.851 ± 0.177
0.425TrpPro: 0.425 ± 0.149
0.614TrpGln: 0.614 ± 0.179
1.087TrpArg: 1.087 ± 0.232
0.662TrpSer: 0.662 ± 0.183
1.087TrpThr: 1.087 ± 0.21
0.709TrpVal: 0.709 ± 0.198
0.284TrpTrp: 0.284 ± 0.12
0.709TrpTyr: 0.709 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.504TyrAla: 2.504 ± 0.326
0.331TyrCys: 0.331 ± 0.142
2.457TyrAsp: 2.457 ± 0.327
1.985TyrGlu: 1.985 ± 0.323
1.748TyrPhe: 1.748 ± 0.306
2.599TyrGly: 2.599 ± 0.333
0.756TyrHis: 0.756 ± 0.171
2.174TyrIle: 2.174 ± 0.332
2.174TyrLys: 2.174 ± 0.316
1.796TyrLeu: 1.796 ± 0.282
1.229TyrMet: 1.229 ± 0.239
2.457TyrAsn: 2.457 ± 0.283
1.134TyrPro: 1.134 ± 0.272
1.418TyrGln: 1.418 ± 0.273
1.748TyrArg: 1.748 ± 0.277
2.268TyrSer: 2.268 ± 0.274
1.559TyrThr: 1.559 ± 0.256
1.843TyrVal: 1.843 ± 0.3
0.614TyrTrp: 0.614 ± 0.161
0.992TyrTyr: 0.992 ± 0.24
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 103 proteins (21163 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski