Amino acid dipepetide frequency for Staphylococcus phage phiSP38-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.027AlaAla: 3.027 ± 1.183
0.222AlaCys: 0.222 ± 0.133
3.397AlaAsp: 3.397 ± 0.655
3.84AlaGlu: 3.84 ± 0.479
1.846AlaPhe: 1.846 ± 0.26
3.47AlaGly: 3.47 ± 0.527
1.108AlaHis: 1.108 ± 0.291
4.947AlaIle: 4.947 ± 0.69
5.169AlaLys: 5.169 ± 0.711
4.726AlaLeu: 4.726 ± 0.64
1.403AlaMet: 1.403 ± 0.324
3.618AlaAsn: 3.618 ± 0.397
1.181AlaPro: 1.181 ± 0.234
2.067AlaGln: 2.067 ± 0.36
2.437AlaArg: 2.437 ± 0.417
3.692AlaSer: 3.692 ± 0.943
3.47AlaThr: 3.47 ± 0.575
3.47AlaVal: 3.47 ± 0.499
0.591AlaTrp: 0.591 ± 0.302
1.994AlaTyr: 1.994 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.369CysAla: 0.369 ± 0.175
0.0CysCys: 0.0 ± 0.0
0.148CysAsp: 0.148 ± 0.098
0.295CysGlu: 0.295 ± 0.159
0.074CysPhe: 0.074 ± 0.083
0.295CysGly: 0.295 ± 0.143
0.295CysHis: 0.295 ± 0.172
0.517CysIle: 0.517 ± 0.203
0.369CysLys: 0.369 ± 0.155
0.222CysLeu: 0.222 ± 0.142
0.148CysMet: 0.148 ± 0.118
0.369CysAsn: 0.369 ± 0.258
0.148CysPro: 0.148 ± 0.166
0.148CysGln: 0.148 ± 0.104
0.222CysArg: 0.222 ± 0.121
0.443CysSer: 0.443 ± 0.163
0.222CysThr: 0.222 ± 0.125
0.222CysVal: 0.222 ± 0.125
0.0CysTrp: 0.0 ± 0.0
0.295CysTyr: 0.295 ± 0.151
0.0CysXaa: 0.0 ± 0.0
Asp
2.806AspAla: 2.806 ± 0.409
0.074AspCys: 0.074 ± 0.065
5.169AspAsp: 5.169 ± 0.821
6.793AspGlu: 6.793 ± 0.844
3.249AspPhe: 3.249 ± 0.629
4.061AspGly: 4.061 ± 0.489
0.812AspHis: 0.812 ± 0.247
4.947AspIle: 4.947 ± 0.73
5.538AspLys: 5.538 ± 0.642
4.873AspLeu: 4.873 ± 0.615
1.551AspMet: 1.551 ± 0.315
3.101AspAsn: 3.101 ± 0.43
1.92AspPro: 1.92 ± 0.357
1.181AspGln: 1.181 ± 0.355
2.511AspArg: 2.511 ± 0.59
3.47AspSer: 3.47 ± 0.493
4.283AspThr: 4.283 ± 0.565
5.169AspVal: 5.169 ± 0.537
0.812AspTrp: 0.812 ± 0.221
4.061AspTyr: 4.061 ± 0.505
0.0AspXaa: 0.0 ± 0.0
Glu
4.283GluAla: 4.283 ± 0.486
0.517GluCys: 0.517 ± 0.195
3.987GluAsp: 3.987 ± 0.422
6.941GluGlu: 6.941 ± 1.206
2.732GluPhe: 2.732 ± 0.468
2.363GluGly: 2.363 ± 0.409
1.108GluHis: 1.108 ± 0.348
6.202GluIle: 6.202 ± 1.017
6.202GluLys: 6.202 ± 0.898
7.901GluLeu: 7.901 ± 0.903
3.323GluMet: 3.323 ± 0.534
4.578GluAsn: 4.578 ± 0.724
1.846GluPro: 1.846 ± 0.364
3.175GluGln: 3.175 ± 0.559
3.175GluArg: 3.175 ± 0.62
5.021GluSer: 5.021 ± 0.747
3.766GluThr: 3.766 ± 0.521
5.243GluVal: 5.243 ± 0.92
0.96GluTrp: 0.96 ± 0.234
3.766GluTyr: 3.766 ± 0.712
0.0GluXaa: 0.0 ± 0.0
Phe
1.551PheAla: 1.551 ± 0.361
0.0PheCys: 0.0 ± 0.0
2.806PheAsp: 2.806 ± 0.56
2.88PheGlu: 2.88 ± 0.588
1.108PhePhe: 1.108 ± 0.298
3.027PheGly: 3.027 ± 0.466
0.517PheHis: 0.517 ± 0.149
2.732PheIle: 2.732 ± 0.55
4.578PheLys: 4.578 ± 0.576
2.732PheLeu: 2.732 ± 0.669
1.181PheMet: 1.181 ± 0.364
3.397PheAsn: 3.397 ± 0.531
0.591PhePro: 0.591 ± 0.173
1.477PheGln: 1.477 ± 0.277
1.92PheArg: 1.92 ± 0.386
2.732PheSer: 2.732 ± 0.442
2.363PheThr: 2.363 ± 0.425
2.88PheVal: 2.88 ± 0.398
0.0PheTrp: 0.0 ± 0.0
1.772PheTyr: 1.772 ± 0.463
0.0PheXaa: 0.0 ± 0.0
Gly
3.101GlyAla: 3.101 ± 0.809
0.295GlyCys: 0.295 ± 0.138
3.47GlyAsp: 3.47 ± 0.533
3.47GlyGlu: 3.47 ± 0.5
2.954GlyPhe: 2.954 ± 0.458
4.209GlyGly: 4.209 ± 0.796
0.812GlyHis: 0.812 ± 0.269
4.578GlyIle: 4.578 ± 0.684
4.8GlyLys: 4.8 ± 0.687
5.316GlyLeu: 5.316 ± 1.176
1.255GlyMet: 1.255 ± 0.346
2.954GlyAsn: 2.954 ± 0.564
1.108GlyPro: 1.108 ± 0.38
1.624GlyGln: 1.624 ± 0.375
2.511GlyArg: 2.511 ± 0.736
3.101GlySer: 3.101 ± 0.568
3.323GlyThr: 3.323 ± 0.411
3.101GlyVal: 3.101 ± 0.474
1.034GlyTrp: 1.034 ± 0.405
3.249GlyTyr: 3.249 ± 0.566
0.0GlyXaa: 0.0 ± 0.0
His
0.812HisAla: 0.812 ± 0.263
0.222HisCys: 0.222 ± 0.143
1.034HisAsp: 1.034 ± 0.255
1.108HisGlu: 1.108 ± 0.294
0.886HisPhe: 0.886 ± 0.312
0.517HisGly: 0.517 ± 0.168
0.148HisHis: 0.148 ± 0.104
1.255HisIle: 1.255 ± 0.403
1.92HisLys: 1.92 ± 0.439
1.551HisLeu: 1.551 ± 0.342
0.443HisMet: 0.443 ± 0.178
0.96HisAsn: 0.96 ± 0.283
0.148HisPro: 0.148 ± 0.102
0.665HisGln: 0.665 ± 0.222
0.812HisArg: 0.812 ± 0.239
0.886HisSer: 0.886 ± 0.266
0.96HisThr: 0.96 ± 0.289
1.108HisVal: 1.108 ± 0.409
0.074HisTrp: 0.074 ± 0.081
0.591HisTyr: 0.591 ± 0.219
0.0HisXaa: 0.0 ± 0.0
Ile
5.095IleAla: 5.095 ± 0.602
0.517IleCys: 0.517 ± 0.208
6.202IleAsp: 6.202 ± 0.786
6.055IleGlu: 6.055 ± 0.903
2.658IlePhe: 2.658 ± 0.52
3.987IleGly: 3.987 ± 1.056
1.551IleHis: 1.551 ± 0.264
4.061IleIle: 4.061 ± 0.707
7.975IleLys: 7.975 ± 0.63
4.061IleLeu: 4.061 ± 0.59
1.255IleMet: 1.255 ± 0.339
5.538IleAsn: 5.538 ± 0.655
2.067IlePro: 2.067 ± 0.458
2.067IleGln: 2.067 ± 0.448
3.84IleArg: 3.84 ± 0.725
4.652IleSer: 4.652 ± 0.697
4.061IleThr: 4.061 ± 0.509
4.356IleVal: 4.356 ± 0.576
0.665IleTrp: 0.665 ± 0.336
2.584IleTyr: 2.584 ± 0.585
0.0IleXaa: 0.0 ± 0.0
Lys
5.169LysAla: 5.169 ± 0.62
0.148LysCys: 0.148 ± 0.108
6.498LysAsp: 6.498 ± 0.679
8.344LysGlu: 8.344 ± 1.29
3.692LysPhe: 3.692 ± 0.542
5.686LysGly: 5.686 ± 0.561
1.772LysHis: 1.772 ± 0.388
6.055LysIle: 6.055 ± 0.582
5.833LysLys: 5.833 ± 0.822
7.679LysLeu: 7.679 ± 0.878
2.141LysMet: 2.141 ± 0.48
5.612LysAsn: 5.612 ± 0.702
3.101LysPro: 3.101 ± 0.561
3.84LysGln: 3.84 ± 0.734
5.243LysArg: 5.243 ± 0.629
5.021LysSer: 5.021 ± 0.647
6.202LysThr: 6.202 ± 0.727
4.8LysVal: 4.8 ± 0.507
0.665LysTrp: 0.665 ± 0.276
3.397LysTyr: 3.397 ± 0.542
0.0LysXaa: 0.0 ± 0.0
Leu
4.43LeuAla: 4.43 ± 0.83
0.443LeuCys: 0.443 ± 0.235
4.209LeuAsp: 4.209 ± 0.643
6.129LeuGlu: 6.129 ± 0.785
2.88LeuPhe: 2.88 ± 0.371
4.504LeuGly: 4.504 ± 0.751
1.181LeuHis: 1.181 ± 0.305
6.202LeuIle: 6.202 ± 0.926
9.008LeuLys: 9.008 ± 1.168
6.129LeuLeu: 6.129 ± 0.669
1.772LeuMet: 1.772 ± 0.457
5.39LeuAsn: 5.39 ± 0.569
2.511LeuPro: 2.511 ± 0.493
3.249LeuGln: 3.249 ± 0.411
3.766LeuArg: 3.766 ± 0.578
5.981LeuSer: 5.981 ± 0.773
5.316LeuThr: 5.316 ± 0.705
3.618LeuVal: 3.618 ± 0.546
0.665LeuTrp: 0.665 ± 0.268
2.954LeuTyr: 2.954 ± 0.426
0.0LeuXaa: 0.0 ± 0.0
Met
1.255MetAla: 1.255 ± 0.294
0.0MetCys: 0.0 ± 0.0
1.551MetAsp: 1.551 ± 0.369
2.289MetGlu: 2.289 ± 0.497
0.886MetPhe: 0.886 ± 0.242
0.812MetGly: 0.812 ± 0.302
0.369MetHis: 0.369 ± 0.166
1.994MetIle: 1.994 ± 0.453
2.658MetLys: 2.658 ± 0.487
2.363MetLeu: 2.363 ± 0.35
0.665MetMet: 0.665 ± 0.216
1.477MetAsn: 1.477 ± 0.39
1.108MetPro: 1.108 ± 0.278
1.034MetGln: 1.034 ± 0.272
1.772MetArg: 1.772 ± 0.545
1.772MetSer: 1.772 ± 0.451
1.772MetThr: 1.772 ± 0.404
1.255MetVal: 1.255 ± 0.355
0.517MetTrp: 0.517 ± 0.186
1.034MetTyr: 1.034 ± 0.307
0.0MetXaa: 0.0 ± 0.0
Asn
3.397AsnAla: 3.397 ± 0.516
0.369AsnCys: 0.369 ± 0.193
4.135AsnAsp: 4.135 ± 0.659
4.578AsnGlu: 4.578 ± 0.66
2.289AsnPhe: 2.289 ± 0.384
5.021AsnGly: 5.021 ± 0.845
0.665AsnHis: 0.665 ± 0.277
3.84AsnIle: 3.84 ± 0.496
6.202AsnLys: 6.202 ± 0.738
4.504AsnLeu: 4.504 ± 0.479
1.698AsnMet: 1.698 ± 0.396
4.356AsnAsn: 4.356 ± 0.822
2.584AsnPro: 2.584 ± 0.407
2.215AsnGln: 2.215 ± 0.484
2.954AsnArg: 2.954 ± 0.431
3.913AsnSer: 3.913 ± 0.503
3.249AsnThr: 3.249 ± 0.75
4.061AsnVal: 4.061 ± 0.581
1.034AsnTrp: 1.034 ± 0.306
1.994AsnTyr: 1.994 ± 0.39
0.0AsnXaa: 0.0 ± 0.0
Pro
1.551ProAla: 1.551 ± 0.306
0.0ProCys: 0.0 ± 0.0
1.551ProAsp: 1.551 ± 0.371
2.511ProGlu: 2.511 ± 0.538
1.624ProPhe: 1.624 ± 0.316
1.181ProGly: 1.181 ± 0.238
0.591ProHis: 0.591 ± 0.191
2.067ProIle: 2.067 ± 0.451
2.141ProLys: 2.141 ± 0.432
2.437ProLeu: 2.437 ± 0.562
0.591ProMet: 0.591 ± 0.201
1.551ProAsn: 1.551 ± 0.329
0.96ProPro: 0.96 ± 0.276
1.624ProGln: 1.624 ± 0.433
0.665ProArg: 0.665 ± 0.245
1.403ProSer: 1.403 ± 0.389
1.403ProThr: 1.403 ± 0.289
1.329ProVal: 1.329 ± 0.278
0.148ProTrp: 0.148 ± 0.101
1.108ProTyr: 1.108 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
3.766GlnAla: 3.766 ± 0.494
0.222GlnCys: 0.222 ± 0.135
1.403GlnAsp: 1.403 ± 0.306
2.954GlnGlu: 2.954 ± 0.641
1.403GlnPhe: 1.403 ± 0.245
1.255GlnGly: 1.255 ± 0.319
0.591GlnHis: 0.591 ± 0.246
2.215GlnIle: 2.215 ± 0.409
2.954GlnLys: 2.954 ± 0.52
3.323GlnLeu: 3.323 ± 0.493
1.108GlnMet: 1.108 ± 0.321
1.92GlnAsn: 1.92 ± 0.398
1.329GlnPro: 1.329 ± 0.367
1.92GlnGln: 1.92 ± 0.427
1.624GlnArg: 1.624 ± 0.285
2.806GlnSer: 2.806 ± 0.521
2.289GlnThr: 2.289 ± 0.383
1.698GlnVal: 1.698 ± 0.398
0.517GlnTrp: 0.517 ± 0.157
1.92GlnTyr: 1.92 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
1.624ArgAla: 1.624 ± 0.329
0.369ArgCys: 0.369 ± 0.219
2.88ArgAsp: 2.88 ± 0.447
3.397ArgGlu: 3.397 ± 0.526
2.067ArgPhe: 2.067 ± 0.394
2.732ArgGly: 2.732 ± 0.487
0.812ArgHis: 0.812 ± 0.369
3.397ArgIle: 3.397 ± 0.581
4.061ArgLys: 4.061 ± 0.537
4.8ArgLeu: 4.8 ± 0.604
1.846ArgMet: 1.846 ± 0.395
2.732ArgAsn: 2.732 ± 0.46
1.255ArgPro: 1.255 ± 0.281
1.994ArgGln: 1.994 ± 0.329
2.067ArgArg: 2.067 ± 0.379
2.437ArgSer: 2.437 ± 0.399
3.544ArgThr: 3.544 ± 0.525
2.954ArgVal: 2.954 ± 0.579
0.148ArgTrp: 0.148 ± 0.132
2.437ArgTyr: 2.437 ± 0.436
0.0ArgXaa: 0.0 ± 0.0
Ser
3.544SerAla: 3.544 ± 0.707
0.148SerCys: 0.148 ± 0.1
4.947SerAsp: 4.947 ± 0.754
4.43SerGlu: 4.43 ± 0.672
2.954SerPhe: 2.954 ± 0.643
4.135SerGly: 4.135 ± 0.538
1.255SerHis: 1.255 ± 0.313
4.209SerIle: 4.209 ± 0.854
6.129SerLys: 6.129 ± 0.627
4.947SerLeu: 4.947 ± 0.758
1.477SerMet: 1.477 ± 0.351
3.913SerAsn: 3.913 ± 0.544
0.886SerPro: 0.886 ± 0.213
2.215SerGln: 2.215 ± 0.52
3.249SerArg: 3.249 ± 0.543
3.175SerSer: 3.175 ± 0.46
3.101SerThr: 3.101 ± 0.386
4.061SerVal: 4.061 ± 0.458
0.665SerTrp: 0.665 ± 0.223
1.994SerTyr: 1.994 ± 0.488
0.0SerXaa: 0.0 ± 0.0
Thr
3.249ThrAla: 3.249 ± 0.417
0.295ThrCys: 0.295 ± 0.162
4.652ThrAsp: 4.652 ± 0.424
3.84ThrGlu: 3.84 ± 0.664
2.511ThrPhe: 2.511 ± 0.45
3.47ThrGly: 3.47 ± 0.559
0.96ThrHis: 0.96 ± 0.241
4.873ThrIle: 4.873 ± 0.641
5.538ThrLys: 5.538 ± 0.664
5.169ThrLeu: 5.169 ± 0.584
1.255ThrMet: 1.255 ± 0.277
3.397ThrAsn: 3.397 ± 0.445
1.698ThrPro: 1.698 ± 0.369
2.511ThrGln: 2.511 ± 0.54
3.544ThrArg: 3.544 ± 0.408
3.397ThrSer: 3.397 ± 0.488
4.209ThrThr: 4.209 ± 0.53
3.84ThrVal: 3.84 ± 0.56
0.517ThrTrp: 0.517 ± 0.231
3.101ThrTyr: 3.101 ± 0.675
0.0ThrXaa: 0.0 ± 0.0
Val
4.135ValAla: 4.135 ± 0.6
0.369ValCys: 0.369 ± 0.179
4.283ValAsp: 4.283 ± 0.663
4.578ValGlu: 4.578 ± 0.673
2.289ValPhe: 2.289 ± 0.419
3.101ValGly: 3.101 ± 0.602
0.665ValHis: 0.665 ± 0.223
4.504ValIle: 4.504 ± 0.595
4.726ValLys: 4.726 ± 0.668
4.356ValLeu: 4.356 ± 0.491
1.846ValMet: 1.846 ± 0.339
4.726ValAsn: 4.726 ± 0.918
0.665ValPro: 0.665 ± 0.243
1.477ValGln: 1.477 ± 0.358
3.249ValArg: 3.249 ± 0.464
4.356ValSer: 4.356 ± 0.403
4.43ValThr: 4.43 ± 0.596
5.316ValVal: 5.316 ± 0.717
0.665ValTrp: 0.665 ± 0.21
2.141ValTyr: 2.141 ± 0.406
0.074ValXaa: 0.074 ± 0.07
Trp
0.591TrpAla: 0.591 ± 0.202
0.148TrpCys: 0.148 ± 0.118
0.886TrpAsp: 0.886 ± 0.285
0.295TrpGlu: 0.295 ± 0.141
0.295TrpPhe: 0.295 ± 0.15
0.369TrpGly: 0.369 ± 0.188
0.222TrpHis: 0.222 ± 0.115
0.96TrpIle: 0.96 ± 0.21
0.517TrpLys: 0.517 ± 0.246
0.665TrpLeu: 0.665 ± 0.223
0.222TrpMet: 0.222 ± 0.126
1.034TrpAsn: 1.034 ± 0.351
0.148TrpPro: 0.148 ± 0.097
0.591TrpGln: 0.591 ± 0.201
0.517TrpArg: 0.517 ± 0.195
0.96TrpSer: 0.96 ± 0.264
0.96TrpThr: 0.96 ± 0.292
0.886TrpVal: 0.886 ± 0.327
0.074TrpTrp: 0.074 ± 0.071
0.222TrpTyr: 0.222 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.92TyrAla: 1.92 ± 0.386
0.443TyrCys: 0.443 ± 0.173
3.175TyrAsp: 3.175 ± 0.516
2.437TyrGlu: 2.437 ± 0.416
1.846TyrPhe: 1.846 ± 0.383
2.067TyrGly: 2.067 ± 0.497
0.665TyrHis: 0.665 ± 0.204
3.692TyrIle: 3.692 ± 0.611
4.578TyrLys: 4.578 ± 0.712
2.511TyrLeu: 2.511 ± 0.435
1.329TyrMet: 1.329 ± 0.318
2.437TyrAsn: 2.437 ± 0.415
1.181TyrPro: 1.181 ± 0.305
2.141TyrGln: 2.141 ± 0.34
1.477TyrArg: 1.477 ± 0.304
2.215TyrSer: 2.215 ± 0.434
3.101TyrThr: 3.101 ± 0.65
2.584TyrVal: 2.584 ± 0.443
0.812TyrTrp: 0.812 ± 0.263
1.477TyrTyr: 1.477 ± 0.411
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.074XaaLeu: 0.074 ± 0.07
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (13544 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski