Amino acid dipepetide frequency for Streptococcus phage CHPC1083

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.917AlaAla: 5.917 ± 2.562
0.273AlaCys: 0.273 ± 0.16
5.553AlaAsp: 5.553 ± 1.173
3.823AlaGlu: 3.823 ± 0.697
2.64AlaPhe: 2.64 ± 1.311
5.553AlaGly: 5.553 ± 1.541
1.001AlaHis: 1.001 ± 0.302
5.826AlaIle: 5.826 ± 1.666
5.553AlaLys: 5.553 ± 0.719
6.008AlaLeu: 6.008 ± 1.243
2.367AlaMet: 2.367 ± 1.158
4.005AlaAsn: 4.005 ± 0.716
2.458AlaPro: 2.458 ± 0.545
2.913AlaGln: 2.913 ± 1.018
3.459AlaArg: 3.459 ± 0.676
7.1AlaSer: 7.1 ± 1.477
4.733AlaThr: 4.733 ± 0.873
4.46AlaVal: 4.46 ± 1.456
0.637AlaTrp: 0.637 ± 0.256
2.822AlaTyr: 2.822 ± 0.542
0.0AlaXaa: 0.0 ± 0.0
Cys
0.182CysAla: 0.182 ± 0.131
0.0CysCys: 0.0 ± 0.0
0.728CysAsp: 0.728 ± 0.296
0.455CysGlu: 0.455 ± 0.197
0.182CysPhe: 0.182 ± 0.136
0.364CysGly: 0.364 ± 0.239
0.091CysHis: 0.091 ± 0.089
0.273CysIle: 0.273 ± 0.136
0.364CysLys: 0.364 ± 0.18
0.273CysLeu: 0.273 ± 0.246
0.182CysMet: 0.182 ± 0.124
0.455CysAsn: 0.455 ± 0.257
0.182CysPro: 0.182 ± 0.133
0.091CysGln: 0.091 ± 0.086
0.182CysArg: 0.182 ± 0.12
0.455CysSer: 0.455 ± 0.203
0.0CysThr: 0.0 ± 0.0
0.273CysVal: 0.273 ± 0.155
0.091CysTrp: 0.091 ± 0.097
0.364CysTyr: 0.364 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
2.822AspAla: 2.822 ± 0.436
0.728AspCys: 0.728 ± 0.246
4.278AspAsp: 4.278 ± 0.572
4.278AspGlu: 4.278 ± 0.823
3.459AspPhe: 3.459 ± 0.619
5.644AspGly: 5.644 ± 1.167
0.364AspHis: 0.364 ± 0.236
3.459AspIle: 3.459 ± 0.753
4.369AspLys: 4.369 ± 0.819
4.824AspLeu: 4.824 ± 0.711
1.547AspMet: 1.547 ± 0.361
4.824AspAsn: 4.824 ± 0.874
0.91AspPro: 0.91 ± 0.297
1.456AspGln: 1.456 ± 0.401
2.913AspArg: 2.913 ± 0.636
3.823AspSer: 3.823 ± 0.529
3.823AspThr: 3.823 ± 0.718
3.732AspVal: 3.732 ± 0.593
1.001AspTrp: 1.001 ± 0.374
3.641AspTyr: 3.641 ± 0.805
0.0AspXaa: 0.0 ± 0.0
Glu
5.006GluAla: 5.006 ± 0.834
0.273GluCys: 0.273 ± 0.173
2.458GluAsp: 2.458 ± 0.569
3.641GluGlu: 3.641 ± 0.85
2.822GluPhe: 2.822 ± 0.53
3.459GluGly: 3.459 ± 0.467
0.819GluHis: 0.819 ± 0.273
4.733GluIle: 4.733 ± 0.772
5.279GluLys: 5.279 ± 1.042
7.464GluLeu: 7.464 ± 1.475
2.731GluMet: 2.731 ± 0.728
4.46GluAsn: 4.46 ± 0.623
2.094GluPro: 2.094 ± 0.653
2.458GluGln: 2.458 ± 0.427
3.641GluArg: 3.641 ± 0.789
2.458GluSer: 2.458 ± 0.629
3.55GluThr: 3.55 ± 0.678
5.461GluVal: 5.461 ± 0.874
1.092GluTrp: 1.092 ± 0.37
2.913GluTyr: 2.913 ± 0.85
0.0GluXaa: 0.0 ± 0.0
Phe
2.549PheAla: 2.549 ± 0.437
0.182PheCys: 0.182 ± 0.147
3.004PheAsp: 3.004 ± 0.648
3.55PheGlu: 3.55 ± 0.683
1.365PhePhe: 1.365 ± 0.352
3.368PheGly: 3.368 ± 0.814
0.455PheHis: 0.455 ± 0.178
2.913PheIle: 2.913 ± 0.529
4.824PheLys: 4.824 ± 0.675
2.367PheLeu: 2.367 ± 0.636
0.546PheMet: 0.546 ± 0.232
3.55PheAsn: 3.55 ± 0.578
0.637PhePro: 0.637 ± 0.373
1.274PheGln: 1.274 ± 0.327
1.274PheArg: 1.274 ± 0.272
3.823PheSer: 3.823 ± 0.827
2.458PheThr: 2.458 ± 0.605
2.003PheVal: 2.003 ± 0.384
0.637PheTrp: 0.637 ± 0.254
1.092PheTyr: 1.092 ± 0.286
0.0PheXaa: 0.0 ± 0.0
Gly
5.826GlyAla: 5.826 ± 1.022
0.364GlyCys: 0.364 ± 0.167
3.277GlyAsp: 3.277 ± 0.478
3.095GlyGlu: 3.095 ± 0.503
3.368GlyPhe: 3.368 ± 0.542
3.277GlyGly: 3.277 ± 0.627
0.637GlyHis: 0.637 ± 0.248
6.372GlyIle: 6.372 ± 2.053
6.19GlyLys: 6.19 ± 0.948
6.281GlyLeu: 6.281 ± 1.077
1.547GlyMet: 1.547 ± 0.796
4.005GlyAsn: 4.005 ± 0.701
0.728GlyPro: 0.728 ± 0.449
2.822GlyGln: 2.822 ± 0.462
3.186GlyArg: 3.186 ± 0.732
4.005GlySer: 4.005 ± 0.719
4.278GlyThr: 4.278 ± 1.006
4.733GlyVal: 4.733 ± 0.639
1.092GlyTrp: 1.092 ± 0.471
2.913GlyTyr: 2.913 ± 0.588
0.0GlyXaa: 0.0 ± 0.0
His
0.728HisAla: 0.728 ± 0.265
0.0HisCys: 0.0 ± 0.0
0.728HisAsp: 0.728 ± 0.303
0.637HisGlu: 0.637 ± 0.221
0.91HisPhe: 0.91 ± 0.312
0.728HisGly: 0.728 ± 0.311
0.546HisHis: 0.546 ± 0.259
1.274HisIle: 1.274 ± 0.331
0.546HisLys: 0.546 ± 0.211
1.274HisLeu: 1.274 ± 0.369
0.455HisMet: 0.455 ± 0.228
0.455HisAsn: 0.455 ± 0.232
0.637HisPro: 0.637 ± 0.268
0.182HisGln: 0.182 ± 0.155
0.728HisArg: 0.728 ± 0.244
1.001HisSer: 1.001 ± 0.339
0.728HisThr: 0.728 ± 0.275
1.001HisVal: 1.001 ± 0.377
0.273HisTrp: 0.273 ± 0.188
0.455HisTyr: 0.455 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
4.824IleAla: 4.824 ± 1.139
0.455IleCys: 0.455 ± 0.192
5.644IleAsp: 5.644 ± 0.654
3.823IleGlu: 3.823 ± 0.769
1.82IlePhe: 1.82 ± 0.406
5.735IleGly: 5.735 ± 1.237
1.092IleHis: 1.092 ± 0.316
3.914IleIle: 3.914 ± 0.79
5.279IleLys: 5.279 ± 0.723
3.095IleLeu: 3.095 ± 0.546
1.912IleMet: 1.912 ± 0.369
3.004IleAsn: 3.004 ± 0.608
2.913IlePro: 2.913 ± 0.663
2.822IleGln: 2.822 ± 0.65
3.186IleArg: 3.186 ± 0.704
6.372IleSer: 6.372 ± 1.856
4.824IleThr: 4.824 ± 0.743
4.005IleVal: 4.005 ± 0.754
0.637IleTrp: 0.637 ± 0.237
3.277IleTyr: 3.277 ± 0.692
0.0IleXaa: 0.0 ± 0.0
Lys
6.918LysAla: 6.918 ± 1.041
0.273LysCys: 0.273 ± 0.156
3.914LysAsp: 3.914 ± 0.588
7.282LysGlu: 7.282 ± 1.34
2.367LysPhe: 2.367 ± 0.493
5.553LysGly: 5.553 ± 0.534
1.092LysHis: 1.092 ± 0.383
5.006LysIle: 5.006 ± 0.698
6.736LysLys: 6.736 ± 1.322
6.554LysLeu: 6.554 ± 1.034
1.456LysMet: 1.456 ± 0.471
3.641LysAsn: 3.641 ± 0.681
3.186LysPro: 3.186 ± 0.631
2.822LysGln: 2.822 ± 0.638
4.733LysArg: 4.733 ± 0.807
4.46LysSer: 4.46 ± 0.542
5.644LysThr: 5.644 ± 0.817
3.914LysVal: 3.914 ± 0.756
1.092LysTrp: 1.092 ± 0.286
2.913LysTyr: 2.913 ± 0.645
0.0LysXaa: 0.0 ± 0.0
Leu
6.281LeuAla: 6.281 ± 1.169
0.364LeuCys: 0.364 ± 0.193
5.097LeuAsp: 5.097 ± 0.762
6.281LeuGlu: 6.281 ± 1.118
3.004LeuPhe: 3.004 ± 0.412
5.279LeuGly: 5.279 ± 0.925
0.455LeuHis: 0.455 ± 0.218
4.005LeuIle: 4.005 ± 0.552
6.008LeuLys: 6.008 ± 0.968
4.46LeuLeu: 4.46 ± 0.696
1.638LeuMet: 1.638 ± 0.443
5.826LeuAsn: 5.826 ± 0.734
2.458LeuPro: 2.458 ± 0.598
3.004LeuGln: 3.004 ± 0.525
3.55LeuArg: 3.55 ± 0.858
5.006LeuSer: 5.006 ± 0.749
6.099LeuThr: 6.099 ± 0.891
5.097LeuVal: 5.097 ± 0.63
0.364LeuTrp: 0.364 ± 0.29
2.367LeuTyr: 2.367 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
3.368MetAla: 3.368 ± 1.108
0.0MetCys: 0.0 ± 0.0
0.91MetAsp: 0.91 ± 0.238
1.183MetGlu: 1.183 ± 0.386
1.183MetPhe: 1.183 ± 0.321
1.092MetGly: 1.092 ± 0.379
0.182MetHis: 0.182 ± 0.157
0.91MetIle: 0.91 ± 0.434
2.185MetLys: 2.185 ± 0.516
1.274MetLeu: 1.274 ± 0.346
0.728MetMet: 0.728 ± 0.552
1.092MetAsn: 1.092 ± 0.344
0.546MetPro: 0.546 ± 0.221
1.638MetGln: 1.638 ± 0.513
0.819MetArg: 0.819 ± 0.258
1.912MetSer: 1.912 ± 0.55
1.092MetThr: 1.092 ± 0.313
2.367MetVal: 2.367 ± 0.504
0.0MetTrp: 0.0 ± 0.0
0.728MetTyr: 0.728 ± 0.278
0.0MetXaa: 0.0 ± 0.0
Asn
4.096AsnAla: 4.096 ± 0.493
0.364AsnCys: 0.364 ± 0.182
3.732AsnAsp: 3.732 ± 0.703
4.642AsnGlu: 4.642 ± 0.901
2.367AsnPhe: 2.367 ± 0.486
5.644AsnGly: 5.644 ± 1.102
1.183AsnHis: 1.183 ± 0.425
3.186AsnIle: 3.186 ± 0.573
4.369AsnLys: 4.369 ± 0.688
4.551AsnLeu: 4.551 ± 0.668
0.91AsnMet: 0.91 ± 0.279
3.55AsnAsn: 3.55 ± 0.727
2.458AsnPro: 2.458 ± 0.559
2.276AsnGln: 2.276 ± 0.413
1.912AsnArg: 1.912 ± 0.554
3.823AsnSer: 3.823 ± 0.763
3.277AsnThr: 3.277 ± 0.595
3.459AsnVal: 3.459 ± 0.523
1.183AsnTrp: 1.183 ± 0.328
2.276AsnTyr: 2.276 ± 0.556
0.0AsnXaa: 0.0 ± 0.0
Pro
1.274ProAla: 1.274 ± 0.311
0.182ProCys: 0.182 ± 0.18
1.82ProAsp: 1.82 ± 0.527
2.367ProGlu: 2.367 ± 0.581
1.183ProPhe: 1.183 ± 0.309
1.183ProGly: 1.183 ± 0.357
0.364ProHis: 0.364 ± 0.171
1.82ProIle: 1.82 ± 0.48
3.368ProLys: 3.368 ± 0.59
1.729ProLeu: 1.729 ± 0.447
0.182ProMet: 0.182 ± 0.131
2.367ProAsn: 2.367 ± 0.614
1.001ProPro: 1.001 ± 0.275
1.729ProGln: 1.729 ± 0.501
1.092ProArg: 1.092 ± 0.334
2.185ProSer: 2.185 ± 0.427
1.365ProThr: 1.365 ± 0.472
2.003ProVal: 2.003 ± 0.445
0.364ProTrp: 0.364 ± 0.173
0.91ProTyr: 0.91 ± 0.437
0.0ProXaa: 0.0 ± 0.0
Gln
4.369GlnAla: 4.369 ± 0.881
0.273GlnCys: 0.273 ± 0.178
2.276GlnAsp: 2.276 ± 0.441
2.64GlnGlu: 2.64 ± 0.625
2.094GlnPhe: 2.094 ± 0.583
2.731GlnGly: 2.731 ± 0.71
0.546GlnHis: 0.546 ± 0.224
2.185GlnIle: 2.185 ± 0.666
2.458GlnLys: 2.458 ± 0.547
4.005GlnLeu: 4.005 ± 0.506
1.183GlnMet: 1.183 ± 0.325
1.82GlnAsn: 1.82 ± 0.317
0.637GlnPro: 0.637 ± 0.28
1.547GlnGln: 1.547 ± 0.377
1.092GlnArg: 1.092 ± 0.384
3.277GlnSer: 3.277 ± 0.849
2.458GlnThr: 2.458 ± 0.468
2.367GlnVal: 2.367 ± 0.433
0.364GlnTrp: 0.364 ± 0.174
1.183GlnTyr: 1.183 ± 0.394
0.0GlnXaa: 0.0 ± 0.0
Arg
3.641ArgAla: 3.641 ± 0.538
0.364ArgCys: 0.364 ± 0.222
2.367ArgAsp: 2.367 ± 0.376
3.004ArgGlu: 3.004 ± 0.677
1.82ArgPhe: 1.82 ± 0.436
2.64ArgGly: 2.64 ± 0.449
0.637ArgHis: 0.637 ± 0.258
3.095ArgIle: 3.095 ± 0.643
3.368ArgLys: 3.368 ± 0.823
3.732ArgLeu: 3.732 ± 0.563
1.274ArgMet: 1.274 ± 0.356
2.094ArgAsn: 2.094 ± 0.539
0.637ArgPro: 0.637 ± 0.236
1.547ArgGln: 1.547 ± 0.453
1.456ArgArg: 1.456 ± 0.368
2.003ArgSer: 2.003 ± 0.376
2.185ArgThr: 2.185 ± 0.629
2.822ArgVal: 2.822 ± 0.644
0.546ArgTrp: 0.546 ± 0.26
2.367ArgTyr: 2.367 ± 0.52
0.0ArgXaa: 0.0 ± 0.0
Ser
7.191SerAla: 7.191 ± 3.425
0.455SerCys: 0.455 ± 0.249
4.642SerAsp: 4.642 ± 0.681
3.732SerGlu: 3.732 ± 0.875
2.913SerPhe: 2.913 ± 0.396
4.551SerGly: 4.551 ± 0.591
0.728SerHis: 0.728 ± 0.266
6.008SerIle: 6.008 ± 0.669
4.46SerLys: 4.46 ± 0.689
5.006SerLeu: 5.006 ± 0.948
1.456SerMet: 1.456 ± 0.368
3.55SerAsn: 3.55 ± 0.559
1.912SerPro: 1.912 ± 0.364
3.55SerGln: 3.55 ± 1.287
2.003SerArg: 2.003 ± 0.415
4.551SerSer: 4.551 ± 0.987
4.46SerThr: 4.46 ± 0.835
5.461SerVal: 5.461 ± 0.776
0.455SerTrp: 0.455 ± 0.237
1.638SerTyr: 1.638 ± 0.253
0.0SerXaa: 0.0 ± 0.0
Thr
4.551ThrAla: 4.551 ± 1.596
0.091ThrCys: 0.091 ± 0.086
3.186ThrAsp: 3.186 ± 0.658
4.005ThrGlu: 4.005 ± 0.723
3.732ThrPhe: 3.732 ± 0.657
3.368ThrGly: 3.368 ± 0.561
1.456ThrHis: 1.456 ± 0.382
5.188ThrIle: 5.188 ± 0.789
6.463ThrLys: 6.463 ± 0.811
5.553ThrLeu: 5.553 ± 0.665
1.183ThrMet: 1.183 ± 0.838
3.55ThrAsn: 3.55 ± 0.645
2.185ThrPro: 2.185 ± 0.521
3.095ThrGln: 3.095 ± 0.445
1.547ThrArg: 1.547 ± 0.381
3.277ThrSer: 3.277 ± 0.93
4.096ThrThr: 4.096 ± 0.551
5.188ThrVal: 5.188 ± 0.518
0.273ThrTrp: 0.273 ± 0.238
2.367ThrTyr: 2.367 ± 0.65
0.0ThrXaa: 0.0 ± 0.0
Val
4.187ValAla: 4.187 ± 1.138
0.182ValCys: 0.182 ± 0.109
4.642ValAsp: 4.642 ± 0.862
5.097ValGlu: 5.097 ± 0.818
2.549ValPhe: 2.549 ± 0.448
4.46ValGly: 4.46 ± 0.637
1.001ValHis: 1.001 ± 0.356
5.188ValIle: 5.188 ± 0.573
4.733ValLys: 4.733 ± 0.632
4.187ValLeu: 4.187 ± 0.607
1.092ValMet: 1.092 ± 0.342
4.278ValAsn: 4.278 ± 1.002
1.82ValPro: 1.82 ± 0.447
2.549ValGln: 2.549 ± 0.68
2.003ValArg: 2.003 ± 0.376
5.461ValSer: 5.461 ± 0.8
5.461ValThr: 5.461 ± 0.68
5.097ValVal: 5.097 ± 0.677
0.728ValTrp: 0.728 ± 0.291
1.638ValTyr: 1.638 ± 0.456
0.0ValXaa: 0.0 ± 0.0
Trp
0.455TrpAla: 0.455 ± 0.17
0.091TrpCys: 0.091 ± 0.089
0.819TrpAsp: 0.819 ± 0.338
1.274TrpGlu: 1.274 ± 0.32
0.546TrpPhe: 0.546 ± 0.24
0.91TrpGly: 0.91 ± 0.311
0.273TrpHis: 0.273 ± 0.143
0.455TrpIle: 0.455 ± 0.194
0.637TrpLys: 0.637 ± 0.199
1.092TrpLeu: 1.092 ± 0.338
0.182TrpMet: 0.182 ± 0.128
0.455TrpAsn: 0.455 ± 0.223
0.091TrpPro: 0.091 ± 0.101
0.364TrpGln: 0.364 ± 0.174
0.455TrpArg: 0.455 ± 0.201
1.274TrpSer: 1.274 ± 0.643
0.819TrpThr: 0.819 ± 0.284
0.91TrpVal: 0.91 ± 0.212
0.273TrpTrp: 0.273 ± 0.19
0.182TrpTyr: 0.182 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.913TyrAla: 2.913 ± 0.454
0.273TyrCys: 0.273 ± 0.146
2.822TyrAsp: 2.822 ± 0.821
2.003TyrGlu: 2.003 ± 0.353
1.547TyrPhe: 1.547 ± 0.408
2.458TyrGly: 2.458 ± 0.495
0.364TyrHis: 0.364 ± 0.204
2.822TyrIle: 2.822 ± 0.731
2.367TyrLys: 2.367 ± 0.438
2.913TyrLeu: 2.913 ± 0.632
0.546TyrMet: 0.546 ± 0.219
2.276TyrAsn: 2.276 ± 0.647
1.001TyrPro: 1.001 ± 0.32
1.365TyrGln: 1.365 ± 0.327
2.458TyrArg: 2.458 ± 0.762
2.458TyrSer: 2.458 ± 0.42
3.004TyrThr: 3.004 ± 0.97
2.003TyrVal: 2.003 ± 0.445
0.455TyrTrp: 0.455 ± 0.2
1.912TyrTyr: 1.912 ± 0.653
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (10987 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski