Amino acid dipepetide frequency for Staphylococcus phage phiMR25

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.47AlaAla: 1.47 ± 0.437
0.221AlaCys: 0.221 ± 0.118
2.794AlaAsp: 2.794 ± 0.453
3.676AlaGlu: 3.676 ± 0.515
2.5AlaPhe: 2.5 ± 0.419
3.088AlaGly: 3.088 ± 0.377
1.47AlaHis: 1.47 ± 0.314
5.22AlaIle: 5.22 ± 1.003
5.147AlaLys: 5.147 ± 0.678
3.897AlaLeu: 3.897 ± 0.61
1.838AlaMet: 1.838 ± 0.419
3.823AlaAsn: 3.823 ± 0.558
1.838AlaPro: 1.838 ± 0.389
2.132AlaGln: 2.132 ± 0.402
2.867AlaArg: 2.867 ± 0.438
3.309AlaSer: 3.309 ± 0.64
4.485AlaThr: 4.485 ± 0.708
3.603AlaVal: 3.603 ± 0.608
1.25AlaTrp: 1.25 ± 0.339
2.279AlaTyr: 2.279 ± 0.389
0.0AlaXaa: 0.0 ± 0.0
Cys
0.147CysAla: 0.147 ± 0.112
0.074CysCys: 0.074 ± 0.07
0.294CysAsp: 0.294 ± 0.146
0.588CysGlu: 0.588 ± 0.213
0.294CysPhe: 0.294 ± 0.138
0.147CysGly: 0.147 ± 0.092
0.0CysHis: 0.0 ± 0.0
0.147CysIle: 0.147 ± 0.108
0.515CysLys: 0.515 ± 0.179
0.221CysLeu: 0.221 ± 0.115
0.221CysMet: 0.221 ± 0.2
0.368CysAsn: 0.368 ± 0.161
0.294CysPro: 0.294 ± 0.177
0.294CysGln: 0.294 ± 0.128
0.294CysArg: 0.294 ± 0.137
0.588CysSer: 0.588 ± 0.212
0.441CysThr: 0.441 ± 0.174
0.147CysVal: 0.147 ± 0.097
0.147CysTrp: 0.147 ± 0.11
0.368CysTyr: 0.368 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
3.382AspAla: 3.382 ± 0.474
0.294AspCys: 0.294 ± 0.169
4.411AspAsp: 4.411 ± 0.955
5.514AspGlu: 5.514 ± 1.002
3.676AspPhe: 3.676 ± 0.502
3.823AspGly: 3.823 ± 0.479
0.147AspHis: 0.147 ± 0.1
4.264AspIle: 4.264 ± 0.529
5.073AspLys: 5.073 ± 0.636
5.22AspLeu: 5.22 ± 0.681
1.912AspMet: 1.912 ± 0.346
3.75AspAsn: 3.75 ± 0.401
1.47AspPro: 1.47 ± 0.32
1.25AspGln: 1.25 ± 0.298
2.5AspArg: 2.5 ± 0.44
3.75AspSer: 3.75 ± 0.524
3.603AspThr: 3.603 ± 0.665
4.558AspVal: 4.558 ± 0.594
0.515AspTrp: 0.515 ± 0.271
3.088AspTyr: 3.088 ± 0.59
0.0AspXaa: 0.0 ± 0.0
Glu
5.367GluAla: 5.367 ± 0.675
0.662GluCys: 0.662 ± 0.25
3.603GluAsp: 3.603 ± 0.558
6.029GluGlu: 6.029 ± 0.769
3.456GluPhe: 3.456 ± 0.494
2.867GluGly: 2.867 ± 0.419
1.544GluHis: 1.544 ± 0.375
5.22GluIle: 5.22 ± 0.732
5.0GluLys: 5.0 ± 0.624
6.985GluLeu: 6.985 ± 0.91
2.279GluMet: 2.279 ± 0.478
5.441GluAsn: 5.441 ± 0.855
1.691GluPro: 1.691 ± 0.342
4.044GluGln: 4.044 ± 0.539
3.823GluArg: 3.823 ± 0.603
4.044GluSer: 4.044 ± 0.535
3.088GluThr: 3.088 ± 0.433
5.073GluVal: 5.073 ± 0.756
1.029GluTrp: 1.029 ± 0.257
5.147GluTyr: 5.147 ± 0.767
0.0GluXaa: 0.0 ± 0.0
Phe
1.912PheAla: 1.912 ± 0.343
0.294PheCys: 0.294 ± 0.13
3.529PheAsp: 3.529 ± 0.547
3.823PheGlu: 3.823 ± 0.482
1.397PhePhe: 1.397 ± 0.284
2.794PheGly: 2.794 ± 0.403
0.515PheHis: 0.515 ± 0.237
3.603PheIle: 3.603 ± 0.573
4.779PheLys: 4.779 ± 0.516
2.573PheLeu: 2.573 ± 0.35
1.029PheMet: 1.029 ± 0.306
3.456PheAsn: 3.456 ± 0.482
0.809PhePro: 0.809 ± 0.266
1.25PheGln: 1.25 ± 0.361
1.618PheArg: 1.618 ± 0.303
3.235PheSer: 3.235 ± 0.45
3.162PheThr: 3.162 ± 0.402
2.72PheVal: 2.72 ± 0.52
0.221PheTrp: 0.221 ± 0.119
1.544PheTyr: 1.544 ± 0.294
0.0PheXaa: 0.0 ± 0.0
Gly
2.72GlyAla: 2.72 ± 0.517
0.294GlyCys: 0.294 ± 0.122
3.382GlyAsp: 3.382 ± 0.446
3.235GlyGlu: 3.235 ± 0.52
3.088GlyPhe: 3.088 ± 0.387
3.088GlyGly: 3.088 ± 0.568
1.103GlyHis: 1.103 ± 0.4
4.117GlyIle: 4.117 ± 0.534
4.338GlyLys: 4.338 ± 0.481
5.808GlyLeu: 5.808 ± 0.673
1.47GlyMet: 1.47 ± 0.296
3.235GlyAsn: 3.235 ± 0.395
0.588GlyPro: 0.588 ± 0.274
1.765GlyGln: 1.765 ± 0.309
1.985GlyArg: 1.985 ± 0.403
2.941GlySer: 2.941 ± 0.48
3.603GlyThr: 3.603 ± 0.514
4.411GlyVal: 4.411 ± 0.57
1.176GlyTrp: 1.176 ± 0.404
2.867GlyTyr: 2.867 ± 0.514
0.0GlyXaa: 0.0 ± 0.0
His
1.397HisAla: 1.397 ± 0.327
0.074HisCys: 0.074 ± 0.075
1.029HisAsp: 1.029 ± 0.25
1.176HisGlu: 1.176 ± 0.314
0.882HisPhe: 0.882 ± 0.245
1.25HisGly: 1.25 ± 0.273
0.662HisHis: 0.662 ± 0.208
0.956HisIle: 0.956 ± 0.26
0.882HisLys: 0.882 ± 0.294
0.809HisLeu: 0.809 ± 0.228
0.294HisMet: 0.294 ± 0.188
1.029HisAsn: 1.029 ± 0.292
0.588HisPro: 0.588 ± 0.166
0.588HisGln: 0.588 ± 0.217
0.368HisArg: 0.368 ± 0.173
0.882HisSer: 0.882 ± 0.216
0.882HisThr: 0.882 ± 0.262
1.029HisVal: 1.029 ± 0.255
0.074HisTrp: 0.074 ± 0.081
0.735HisTyr: 0.735 ± 0.331
0.0HisXaa: 0.0 ± 0.0
Ile
4.779IleAla: 4.779 ± 0.745
0.147IleCys: 0.147 ± 0.113
5.147IleAsp: 5.147 ± 0.623
6.176IleGlu: 6.176 ± 0.722
2.867IlePhe: 2.867 ± 0.505
3.897IleGly: 3.897 ± 0.479
1.029IleHis: 1.029 ± 0.19
4.191IleIle: 4.191 ± 0.515
6.47IleLys: 6.47 ± 0.7
4.853IleLeu: 4.853 ± 0.615
2.059IleMet: 2.059 ± 0.39
4.853IleAsn: 4.853 ± 0.562
2.647IlePro: 2.647 ± 0.423
2.206IleGln: 2.206 ± 0.478
3.309IleArg: 3.309 ± 0.592
4.117IleSer: 4.117 ± 0.55
5.441IleThr: 5.441 ± 0.768
4.117IleVal: 4.117 ± 0.599
1.176IleTrp: 1.176 ± 0.597
3.162IleTyr: 3.162 ± 0.468
0.0IleXaa: 0.0 ± 0.0
Lys
5.808LysAla: 5.808 ± 0.72
0.294LysCys: 0.294 ± 0.153
5.514LysAsp: 5.514 ± 0.696
8.235LysGlu: 8.235 ± 1.113
3.456LysPhe: 3.456 ± 0.452
5.294LysGly: 5.294 ± 0.533
1.176LysHis: 1.176 ± 0.333
6.764LysIle: 6.764 ± 0.731
9.338LysLys: 9.338 ± 0.735
6.47LysLeu: 6.47 ± 0.678
2.279LysMet: 2.279 ± 0.424
5.441LysAsn: 5.441 ± 0.606
2.794LysPro: 2.794 ± 0.505
4.632LysGln: 4.632 ± 0.528
3.823LysArg: 3.823 ± 0.495
5.0LysSer: 5.0 ± 0.552
4.853LysThr: 4.853 ± 0.719
5.808LysVal: 5.808 ± 0.62
0.735LysTrp: 0.735 ± 0.198
4.264LysTyr: 4.264 ± 0.695
0.0LysXaa: 0.0 ± 0.0
Leu
3.75LeuAla: 3.75 ± 0.643
0.662LeuCys: 0.662 ± 0.27
4.338LeuAsp: 4.338 ± 0.517
5.735LeuGlu: 5.735 ± 0.71
3.897LeuPhe: 3.897 ± 0.563
3.823LeuGly: 3.823 ± 0.516
0.882LeuHis: 0.882 ± 0.307
4.117LeuIle: 4.117 ± 0.451
7.352LeuLys: 7.352 ± 0.657
5.882LeuLeu: 5.882 ± 0.724
1.544LeuMet: 1.544 ± 0.301
4.779LeuAsn: 4.779 ± 0.494
2.72LeuPro: 2.72 ± 0.408
3.603LeuGln: 3.603 ± 0.632
2.941LeuArg: 2.941 ± 0.46
5.514LeuSer: 5.514 ± 0.564
5.367LeuThr: 5.367 ± 0.573
4.338LeuVal: 4.338 ± 0.793
0.735LeuTrp: 0.735 ± 0.247
3.456LeuTyr: 3.456 ± 0.54
0.0LeuXaa: 0.0 ± 0.0
Met
1.103MetAla: 1.103 ± 0.264
0.0MetCys: 0.0 ± 0.0
1.103MetAsp: 1.103 ± 0.238
1.618MetGlu: 1.618 ± 0.344
0.662MetPhe: 0.662 ± 0.231
0.956MetGly: 0.956 ± 0.287
0.221MetHis: 0.221 ± 0.107
1.397MetIle: 1.397 ± 0.308
2.206MetLys: 2.206 ± 0.378
2.867MetLeu: 2.867 ± 0.4
0.515MetMet: 0.515 ± 0.173
1.691MetAsn: 1.691 ± 0.372
1.176MetPro: 1.176 ± 0.307
1.323MetGln: 1.323 ± 0.349
1.176MetArg: 1.176 ± 0.337
1.765MetSer: 1.765 ± 0.461
2.72MetThr: 2.72 ± 0.582
1.029MetVal: 1.029 ± 0.261
0.441MetTrp: 0.441 ± 0.156
1.323MetTyr: 1.323 ± 0.311
0.0MetXaa: 0.0 ± 0.0
Asn
4.853AsnAla: 4.853 ± 0.785
0.294AsnCys: 0.294 ± 0.159
4.117AsnAsp: 4.117 ± 0.708
4.706AsnGlu: 4.706 ± 0.55
2.72AsnPhe: 2.72 ± 0.559
4.779AsnGly: 4.779 ± 0.644
0.735AsnHis: 0.735 ± 0.237
4.191AsnIle: 4.191 ± 0.54
7.058AsnLys: 7.058 ± 0.644
4.264AsnLeu: 4.264 ± 0.635
1.618AsnMet: 1.618 ± 0.287
4.853AsnAsn: 4.853 ± 0.801
2.573AsnPro: 2.573 ± 0.472
2.426AsnGln: 2.426 ± 0.364
2.5AsnArg: 2.5 ± 0.449
2.867AsnSer: 2.867 ± 0.519
4.411AsnThr: 4.411 ± 0.452
4.264AsnVal: 4.264 ± 0.562
0.809AsnTrp: 0.809 ± 0.229
2.941AsnTyr: 2.941 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
0.956ProAla: 0.956 ± 0.285
0.221ProCys: 0.221 ± 0.121
1.397ProAsp: 1.397 ± 0.23
2.132ProGlu: 2.132 ± 0.295
1.47ProPhe: 1.47 ± 0.335
1.838ProGly: 1.838 ± 0.44
0.441ProHis: 0.441 ± 0.163
2.132ProIle: 2.132 ± 0.343
3.162ProLys: 3.162 ± 0.624
1.765ProLeu: 1.765 ± 0.347
0.882ProMet: 0.882 ± 0.252
2.647ProAsn: 2.647 ± 0.427
0.441ProPro: 0.441 ± 0.236
1.103ProGln: 1.103 ± 0.299
1.176ProArg: 1.176 ± 0.278
1.103ProSer: 1.103 ± 0.251
2.353ProThr: 2.353 ± 0.367
1.838ProVal: 1.838 ± 0.416
0.147ProTrp: 0.147 ± 0.11
1.397ProTyr: 1.397 ± 0.358
0.0ProXaa: 0.0 ± 0.0
Gln
2.5GlnAla: 2.5 ± 0.488
0.441GlnCys: 0.441 ± 0.181
2.132GlnAsp: 2.132 ± 0.39
2.573GlnGlu: 2.573 ± 0.406
1.47GlnPhe: 1.47 ± 0.335
1.912GlnGly: 1.912 ± 0.328
0.662GlnHis: 0.662 ± 0.219
2.647GlnIle: 2.647 ± 0.369
3.088GlnLys: 3.088 ± 0.524
3.676GlnLeu: 3.676 ± 0.623
1.103GlnMet: 1.103 ± 0.288
1.765GlnAsn: 1.765 ± 0.347
1.912GlnPro: 1.912 ± 0.417
1.838GlnGln: 1.838 ± 0.514
1.838GlnArg: 1.838 ± 0.423
2.206GlnSer: 2.206 ± 0.355
2.279GlnThr: 2.279 ± 0.486
2.206GlnVal: 2.206 ± 0.377
0.515GlnTrp: 0.515 ± 0.173
1.323GlnTyr: 1.323 ± 0.331
0.0GlnXaa: 0.0 ± 0.0
Arg
1.176ArgAla: 1.176 ± 0.266
0.368ArgCys: 0.368 ± 0.152
2.279ArgAsp: 2.279 ± 0.464
3.456ArgGlu: 3.456 ± 0.486
2.059ArgPhe: 2.059 ± 0.404
2.426ArgGly: 2.426 ± 0.473
1.25ArgHis: 1.25 ± 0.295
3.676ArgIle: 3.676 ± 0.493
4.264ArgLys: 4.264 ± 0.552
3.382ArgLeu: 3.382 ± 0.409
0.735ArgMet: 0.735 ± 0.208
2.941ArgAsn: 2.941 ± 0.494
0.882ArgPro: 0.882 ± 0.222
1.544ArgGln: 1.544 ± 0.301
1.912ArgArg: 1.912 ± 0.438
2.279ArgSer: 2.279 ± 0.322
1.691ArgThr: 1.691 ± 0.348
2.5ArgVal: 2.5 ± 0.482
0.588ArgTrp: 0.588 ± 0.207
2.132ArgTyr: 2.132 ± 0.524
0.0ArgXaa: 0.0 ± 0.0
Ser
4.191SerAla: 4.191 ± 0.645
0.294SerCys: 0.294 ± 0.19
4.706SerAsp: 4.706 ± 0.653
3.382SerGlu: 3.382 ± 0.474
2.279SerPhe: 2.279 ± 0.426
3.088SerGly: 3.088 ± 0.56
0.956SerHis: 0.956 ± 0.367
5.367SerIle: 5.367 ± 0.602
5.147SerLys: 5.147 ± 0.65
3.456SerLeu: 3.456 ± 0.489
1.397SerMet: 1.397 ± 0.352
4.632SerAsn: 4.632 ± 0.513
1.029SerPro: 1.029 ± 0.302
1.838SerGln: 1.838 ± 0.343
3.014SerArg: 3.014 ± 0.315
3.088SerSer: 3.088 ± 0.476
3.897SerThr: 3.897 ± 0.473
3.235SerVal: 3.235 ± 0.495
0.221SerTrp: 0.221 ± 0.117
2.353SerTyr: 2.353 ± 0.38
0.0SerXaa: 0.0 ± 0.0
Thr
4.558ThrAla: 4.558 ± 0.637
0.147ThrCys: 0.147 ± 0.092
3.75ThrAsp: 3.75 ± 0.642
4.558ThrGlu: 4.558 ± 0.614
3.088ThrPhe: 3.088 ± 0.529
3.897ThrGly: 3.897 ± 0.544
0.735ThrHis: 0.735 ± 0.197
5.661ThrIle: 5.661 ± 0.964
5.808ThrLys: 5.808 ± 0.723
4.558ThrLeu: 4.558 ± 0.549
0.882ThrMet: 0.882 ± 0.273
3.823ThrAsn: 3.823 ± 0.667
1.765ThrPro: 1.765 ± 0.325
2.72ThrGln: 2.72 ± 0.501
2.353ThrArg: 2.353 ± 0.328
3.823ThrSer: 3.823 ± 0.622
4.264ThrThr: 4.264 ± 0.718
4.558ThrVal: 4.558 ± 0.661
0.662ThrTrp: 0.662 ± 0.243
3.014ThrTyr: 3.014 ± 0.457
0.0ThrXaa: 0.0 ± 0.0
Val
3.529ValAla: 3.529 ± 0.855
0.515ValCys: 0.515 ± 0.183
5.514ValAsp: 5.514 ± 0.722
5.294ValGlu: 5.294 ± 0.685
2.353ValPhe: 2.353 ± 0.45
2.426ValGly: 2.426 ± 0.501
0.735ValHis: 0.735 ± 0.234
4.926ValIle: 4.926 ± 0.585
6.102ValLys: 6.102 ± 0.581
4.558ValLeu: 4.558 ± 0.643
1.618ValMet: 1.618 ± 0.353
4.338ValAsn: 4.338 ± 0.578
1.985ValPro: 1.985 ± 0.441
1.323ValGln: 1.323 ± 0.305
2.206ValArg: 2.206 ± 0.399
3.603ValSer: 3.603 ± 0.514
4.264ValThr: 4.264 ± 0.427
3.897ValVal: 3.897 ± 0.661
1.029ValTrp: 1.029 ± 0.316
2.5ValTyr: 2.5 ± 0.565
0.0ValXaa: 0.0 ± 0.0
Trp
1.323TrpAla: 1.323 ± 0.427
0.147TrpCys: 0.147 ± 0.104
0.368TrpAsp: 0.368 ± 0.167
0.662TrpGlu: 0.662 ± 0.246
0.441TrpPhe: 0.441 ± 0.2
0.662TrpGly: 0.662 ± 0.355
0.221TrpHis: 0.221 ± 0.138
0.735TrpIle: 0.735 ± 0.228
0.882TrpLys: 0.882 ± 0.266
0.956TrpLeu: 0.956 ± 0.33
0.221TrpMet: 0.221 ± 0.145
1.544TrpAsn: 1.544 ± 0.856
0.147TrpPro: 0.147 ± 0.113
0.515TrpGln: 0.515 ± 0.229
0.221TrpArg: 0.221 ± 0.125
0.882TrpSer: 0.882 ± 0.265
0.956TrpThr: 0.956 ± 0.274
0.662TrpVal: 0.662 ± 0.267
0.0TrpTrp: 0.0 ± 0.0
0.735TrpTyr: 0.735 ± 0.257
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.985TyrAla: 1.985 ± 0.312
0.147TyrCys: 0.147 ± 0.111
2.794TyrAsp: 2.794 ± 0.496
3.823TyrGlu: 3.823 ± 0.591
2.206TyrPhe: 2.206 ± 0.46
3.162TyrGly: 3.162 ± 0.627
1.103TyrHis: 1.103 ± 0.306
3.382TyrIle: 3.382 ± 0.476
5.367TyrLys: 5.367 ± 0.703
3.235TyrLeu: 3.235 ± 0.485
1.323TyrMet: 1.323 ± 0.339
2.573TyrAsn: 2.573 ± 0.369
1.25TyrPro: 1.25 ± 0.321
1.691TyrGln: 1.691 ± 0.347
1.618TyrArg: 1.618 ± 0.42
2.573TyrSer: 2.573 ± 0.451
2.867TyrThr: 2.867 ± 0.366
2.72TyrVal: 2.72 ± 0.496
0.809TyrTrp: 0.809 ± 0.261
1.47TyrTyr: 1.47 ± 0.39
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13602 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski