Amino acid dipepetide frequency for Streptococcus phage CHPC1073

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.008AlaAla: 3.008 ± 1.027
0.188AlaCys: 0.188 ± 0.141
4.23AlaAsp: 4.23 ± 0.724
3.102AlaGlu: 3.102 ± 0.552
2.256AlaPhe: 2.256 ± 0.575
3.948AlaGly: 3.948 ± 0.763
0.752AlaHis: 0.752 ± 0.328
4.794AlaIle: 4.794 ± 0.855
6.392AlaLys: 6.392 ± 1.075
5.922AlaLeu: 5.922 ± 0.737
1.316AlaMet: 1.316 ± 0.427
4.888AlaAsn: 4.888 ± 0.914
1.88AlaPro: 1.88 ± 0.392
2.538AlaGln: 2.538 ± 0.484
2.538AlaArg: 2.538 ± 0.542
4.794AlaSer: 4.794 ± 0.708
4.794AlaThr: 4.794 ± 0.767
3.854AlaVal: 3.854 ± 0.52
0.846AlaTrp: 0.846 ± 0.248
2.35AlaTyr: 2.35 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.188CysAla: 0.188 ± 0.123
0.0CysCys: 0.0 ± 0.0
0.846CysAsp: 0.846 ± 0.317
0.376CysGlu: 0.376 ± 0.169
0.564CysPhe: 0.564 ± 0.346
0.188CysGly: 0.188 ± 0.151
0.188CysHis: 0.188 ± 0.131
0.282CysIle: 0.282 ± 0.242
0.376CysLys: 0.376 ± 0.204
0.376CysLeu: 0.376 ± 0.214
0.094CysMet: 0.094 ± 0.104
0.282CysAsn: 0.282 ± 0.151
0.282CysPro: 0.282 ± 0.176
0.188CysGln: 0.188 ± 0.148
0.376CysArg: 0.376 ± 0.237
0.564CysSer: 0.564 ± 0.246
0.376CysThr: 0.376 ± 0.176
0.282CysVal: 0.282 ± 0.134
0.188CysTrp: 0.188 ± 0.134
0.188CysTyr: 0.188 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
3.854AspAla: 3.854 ± 0.677
0.376AspCys: 0.376 ± 0.223
4.418AspAsp: 4.418 ± 0.575
4.606AspGlu: 4.606 ± 0.679
3.478AspPhe: 3.478 ± 0.578
5.076AspGly: 5.076 ± 0.71
0.94AspHis: 0.94 ± 0.324
5.264AspIle: 5.264 ± 0.817
4.7AspLys: 4.7 ± 0.671
3.854AspLeu: 3.854 ± 0.769
2.444AspMet: 2.444 ± 0.531
4.324AspAsn: 4.324 ± 0.914
2.162AspPro: 2.162 ± 0.446
1.692AspGln: 1.692 ± 0.391
3.008AspArg: 3.008 ± 0.509
3.384AspSer: 3.384 ± 0.548
4.512AspThr: 4.512 ± 0.653
3.572AspVal: 3.572 ± 0.586
0.846AspTrp: 0.846 ± 0.292
2.914AspTyr: 2.914 ± 0.565
0.0AspXaa: 0.0 ± 0.0
Glu
4.23GluAla: 4.23 ± 0.615
0.376GluCys: 0.376 ± 0.161
3.666GluAsp: 3.666 ± 0.712
4.23GluGlu: 4.23 ± 0.797
2.538GluPhe: 2.538 ± 0.466
3.196GluGly: 3.196 ± 0.446
1.222GluHis: 1.222 ± 0.384
5.828GluIle: 5.828 ± 0.754
4.136GluLys: 4.136 ± 0.959
6.674GluLeu: 6.674 ± 0.925
2.35GluMet: 2.35 ± 0.496
4.324GluAsn: 4.324 ± 0.753
1.974GluPro: 1.974 ± 0.573
2.82GluGln: 2.82 ± 0.436
3.29GluArg: 3.29 ± 0.633
3.102GluSer: 3.102 ± 0.423
3.196GluThr: 3.196 ± 0.543
4.418GluVal: 4.418 ± 0.658
1.222GluTrp: 1.222 ± 0.268
3.384GluTyr: 3.384 ± 0.571
0.0GluXaa: 0.0 ± 0.0
Phe
3.29PheAla: 3.29 ± 0.542
0.376PheCys: 0.376 ± 0.205
3.478PheAsp: 3.478 ± 0.617
1.88PheGlu: 1.88 ± 0.489
1.974PhePhe: 1.974 ± 0.554
3.384PheGly: 3.384 ± 0.629
0.47PheHis: 0.47 ± 0.191
2.914PheIle: 2.914 ± 0.677
4.042PheLys: 4.042 ± 0.582
3.478PheLeu: 3.478 ± 0.654
0.47PheMet: 0.47 ± 0.211
3.76PheAsn: 3.76 ± 0.657
0.658PhePro: 0.658 ± 0.247
1.316PheGln: 1.316 ± 0.307
1.504PheArg: 1.504 ± 0.392
2.538PheSer: 2.538 ± 0.464
2.632PheThr: 2.632 ± 0.453
2.444PheVal: 2.444 ± 0.361
0.658PheTrp: 0.658 ± 0.247
1.786PheTyr: 1.786 ± 0.41
0.0PheXaa: 0.0 ± 0.0
Gly
2.914GlyAla: 2.914 ± 0.599
0.376GlyCys: 0.376 ± 0.18
3.76GlyAsp: 3.76 ± 0.443
3.854GlyGlu: 3.854 ± 0.646
3.102GlyPhe: 3.102 ± 0.468
4.136GlyGly: 4.136 ± 0.869
0.752GlyHis: 0.752 ± 0.291
4.794GlyIle: 4.794 ± 0.797
6.768GlyLys: 6.768 ± 0.878
6.392GlyLeu: 6.392 ± 0.779
1.316GlyMet: 1.316 ± 0.313
4.042GlyAsn: 4.042 ± 0.679
0.752GlyPro: 0.752 ± 0.325
3.102GlyGln: 3.102 ± 0.593
2.82GlyArg: 2.82 ± 0.479
4.418GlySer: 4.418 ± 0.934
4.136GlyThr: 4.136 ± 0.715
4.042GlyVal: 4.042 ± 0.686
1.222GlyTrp: 1.222 ± 0.392
3.102GlyTyr: 3.102 ± 0.512
0.0GlyXaa: 0.0 ± 0.0
His
0.376HisAla: 0.376 ± 0.155
0.188HisCys: 0.188 ± 0.148
0.94HisAsp: 0.94 ± 0.352
0.564HisGlu: 0.564 ± 0.228
0.564HisPhe: 0.564 ± 0.223
0.752HisGly: 0.752 ± 0.271
0.47HisHis: 0.47 ± 0.187
1.222HisIle: 1.222 ± 0.357
1.128HisLys: 1.128 ± 0.352
1.316HisLeu: 1.316 ± 0.279
0.658HisMet: 0.658 ± 0.319
0.47HisAsn: 0.47 ± 0.198
0.658HisPro: 0.658 ± 0.177
0.564HisGln: 0.564 ± 0.225
0.846HisArg: 0.846 ± 0.265
0.94HisSer: 0.94 ± 0.215
0.564HisThr: 0.564 ± 0.202
1.41HisVal: 1.41 ± 0.312
0.094HisTrp: 0.094 ± 0.097
0.94HisTyr: 0.94 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
4.7IleAla: 4.7 ± 0.883
0.47IleCys: 0.47 ± 0.236
5.358IleAsp: 5.358 ± 0.787
5.076IleGlu: 5.076 ± 0.714
1.692IlePhe: 1.692 ± 0.519
4.512IleGly: 4.512 ± 0.691
0.94IleHis: 0.94 ± 0.291
3.854IleIle: 3.854 ± 0.687
6.862IleLys: 6.862 ± 0.725
4.7IleLeu: 4.7 ± 0.681
1.88IleMet: 1.88 ± 0.42
3.666IleAsn: 3.666 ± 0.541
3.196IlePro: 3.196 ± 0.618
2.632IleGln: 2.632 ± 0.418
2.914IleArg: 2.914 ± 0.559
4.512IleSer: 4.512 ± 0.56
3.76IleThr: 3.76 ± 0.606
3.384IleVal: 3.384 ± 0.642
0.752IleTrp: 0.752 ± 0.189
2.35IleTyr: 2.35 ± 0.586
0.0IleXaa: 0.0 ± 0.0
Lys
5.922LysAla: 5.922 ± 0.636
0.47LysCys: 0.47 ± 0.285
4.888LysAsp: 4.888 ± 0.817
6.768LysGlu: 6.768 ± 0.932
3.478LysPhe: 3.478 ± 0.691
5.734LysGly: 5.734 ± 0.742
1.222LysHis: 1.222 ± 0.439
5.17LysIle: 5.17 ± 0.747
6.768LysLys: 6.768 ± 1.175
6.956LysLeu: 6.956 ± 0.742
1.974LysMet: 1.974 ± 0.451
4.982LysAsn: 4.982 ± 0.713
3.196LysPro: 3.196 ± 0.407
3.666LysGln: 3.666 ± 0.597
3.854LysArg: 3.854 ± 0.479
4.512LysSer: 4.512 ± 0.599
5.922LysThr: 5.922 ± 0.72
4.7LysVal: 4.7 ± 0.656
1.034LysTrp: 1.034 ± 0.277
2.914LysTyr: 2.914 ± 0.68
0.0LysXaa: 0.0 ± 0.0
Leu
6.392LeuAla: 6.392 ± 0.818
0.658LeuCys: 0.658 ± 0.284
5.922LeuAsp: 5.922 ± 0.875
6.58LeuGlu: 6.58 ± 1.027
3.196LeuPhe: 3.196 ± 0.449
5.734LeuGly: 5.734 ± 0.996
1.034LeuHis: 1.034 ± 0.344
3.948LeuIle: 3.948 ± 0.66
6.956LeuLys: 6.956 ± 0.702
5.17LeuLeu: 5.17 ± 0.702
2.35LeuMet: 2.35 ± 0.479
6.204LeuAsn: 6.204 ± 0.967
2.82LeuPro: 2.82 ± 0.382
2.82LeuGln: 2.82 ± 0.517
3.572LeuArg: 3.572 ± 0.783
4.888LeuSer: 4.888 ± 0.736
6.486LeuThr: 6.486 ± 0.849
4.136LeuVal: 4.136 ± 0.634
0.564LeuTrp: 0.564 ± 0.306
1.786LeuTyr: 1.786 ± 0.476
0.0LeuXaa: 0.0 ± 0.0
Met
1.974MetAla: 1.974 ± 0.322
0.0MetCys: 0.0 ± 0.0
0.752MetAsp: 0.752 ± 0.238
1.504MetGlu: 1.504 ± 0.371
1.222MetPhe: 1.222 ± 0.288
0.94MetGly: 0.94 ± 0.289
0.282MetHis: 0.282 ± 0.149
1.41MetIle: 1.41 ± 0.356
2.914MetLys: 2.914 ± 0.554
1.598MetLeu: 1.598 ± 0.288
0.188MetMet: 0.188 ± 0.153
1.222MetAsn: 1.222 ± 0.277
1.128MetPro: 1.128 ± 0.284
0.846MetGln: 0.846 ± 0.268
0.752MetArg: 0.752 ± 0.228
2.068MetSer: 2.068 ± 0.509
1.504MetThr: 1.504 ± 0.301
2.162MetVal: 2.162 ± 0.468
0.094MetTrp: 0.094 ± 0.076
0.94MetTyr: 0.94 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
4.888AsnAla: 4.888 ± 1.105
0.376AsnCys: 0.376 ± 0.198
3.666AsnAsp: 3.666 ± 0.533
4.23AsnGlu: 4.23 ± 0.87
2.538AsnPhe: 2.538 ± 0.602
6.862AsnGly: 6.862 ± 1.322
1.034AsnHis: 1.034 ± 0.288
4.23AsnIle: 4.23 ± 0.606
3.854AsnLys: 3.854 ± 0.532
5.17AsnLeu: 5.17 ± 0.591
1.41AsnMet: 1.41 ± 0.349
4.23AsnAsn: 4.23 ± 0.779
3.102AsnPro: 3.102 ± 0.575
2.726AsnGln: 2.726 ± 0.432
2.256AsnArg: 2.256 ± 0.416
3.854AsnSer: 3.854 ± 0.8
3.29AsnThr: 3.29 ± 0.555
3.666AsnVal: 3.666 ± 0.574
1.504AsnTrp: 1.504 ± 0.282
1.88AsnTyr: 1.88 ± 0.546
0.0AsnXaa: 0.0 ± 0.0
Pro
1.41ProAla: 1.41 ± 0.289
0.188ProCys: 0.188 ± 0.18
1.598ProAsp: 1.598 ± 0.436
2.538ProGlu: 2.538 ± 0.526
1.222ProPhe: 1.222 ± 0.292
1.222ProGly: 1.222 ± 0.302
0.376ProHis: 0.376 ± 0.166
1.598ProIle: 1.598 ± 0.343
3.478ProLys: 3.478 ± 0.565
2.82ProLeu: 2.82 ± 0.498
0.188ProMet: 0.188 ± 0.159
2.444ProAsn: 2.444 ± 0.421
0.752ProPro: 0.752 ± 0.317
1.222ProGln: 1.222 ± 0.381
0.846ProArg: 0.846 ± 0.363
2.726ProSer: 2.726 ± 0.469
2.35ProThr: 2.35 ± 0.411
1.692ProVal: 1.692 ± 0.482
0.47ProTrp: 0.47 ± 0.182
1.034ProTyr: 1.034 ± 0.392
0.0ProXaa: 0.0 ± 0.0
Gln
4.512GlnAla: 4.512 ± 0.647
0.188GlnCys: 0.188 ± 0.119
1.786GlnAsp: 1.786 ± 0.297
2.538GlnGlu: 2.538 ± 0.57
1.41GlnPhe: 1.41 ± 0.314
3.29GlnGly: 3.29 ± 0.647
0.47GlnHis: 0.47 ± 0.22
2.444GlnIle: 2.444 ± 0.555
3.384GlnLys: 3.384 ± 0.484
3.102GlnLeu: 3.102 ± 0.444
1.316GlnMet: 1.316 ± 0.361
2.632GlnAsn: 2.632 ± 0.48
0.282GlnPro: 0.282 ± 0.153
2.538GlnGln: 2.538 ± 0.455
1.786GlnArg: 1.786 ± 0.378
2.632GlnSer: 2.632 ± 0.459
2.82GlnThr: 2.82 ± 0.478
1.88GlnVal: 1.88 ± 0.448
0.47GlnTrp: 0.47 ± 0.235
1.786GlnTyr: 1.786 ± 0.381
0.0GlnXaa: 0.0 ± 0.0
Arg
2.068ArgAla: 2.068 ± 0.429
0.094ArgCys: 0.094 ± 0.109
2.632ArgAsp: 2.632 ± 0.449
2.538ArgGlu: 2.538 ± 0.547
2.444ArgPhe: 2.444 ± 0.477
2.914ArgGly: 2.914 ± 0.535
0.846ArgHis: 0.846 ± 0.301
3.29ArgIle: 3.29 ± 0.591
3.008ArgLys: 3.008 ± 0.532
3.572ArgLeu: 3.572 ± 0.594
1.034ArgMet: 1.034 ± 0.298
2.538ArgAsn: 2.538 ± 0.419
1.128ArgPro: 1.128 ± 0.292
1.974ArgGln: 1.974 ± 0.36
1.504ArgArg: 1.504 ± 0.345
1.598ArgSer: 1.598 ± 0.337
2.726ArgThr: 2.726 ± 0.663
2.726ArgVal: 2.726 ± 0.476
1.034ArgTrp: 1.034 ± 0.294
2.162ArgTyr: 2.162 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
3.572SerAla: 3.572 ± 0.505
0.47SerCys: 0.47 ± 0.229
4.606SerAsp: 4.606 ± 0.555
3.572SerGlu: 3.572 ± 0.502
2.632SerPhe: 2.632 ± 0.537
3.948SerGly: 3.948 ± 0.634
0.47SerHis: 0.47 ± 0.177
4.606SerIle: 4.606 ± 0.636
5.264SerLys: 5.264 ± 0.73
4.606SerLeu: 4.606 ± 0.65
1.692SerMet: 1.692 ± 0.358
4.7SerAsn: 4.7 ± 0.604
1.598SerPro: 1.598 ± 0.285
3.008SerGln: 3.008 ± 0.573
2.538SerArg: 2.538 ± 0.641
3.478SerSer: 3.478 ± 0.624
3.76SerThr: 3.76 ± 0.635
5.264SerVal: 5.264 ± 0.665
0.94SerTrp: 0.94 ± 0.358
1.504SerTyr: 1.504 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
4.23ThrAla: 4.23 ± 0.753
0.376ThrCys: 0.376 ± 0.186
4.23ThrAsp: 4.23 ± 0.638
3.854ThrGlu: 3.854 ± 0.498
3.196ThrPhe: 3.196 ± 0.62
3.478ThrGly: 3.478 ± 0.451
1.41ThrHis: 1.41 ± 0.339
4.324ThrIle: 4.324 ± 0.706
5.452ThrLys: 5.452 ± 0.627
6.768ThrLeu: 6.768 ± 0.988
0.846ThrMet: 0.846 ± 0.241
4.042ThrAsn: 4.042 ± 0.686
1.504ThrPro: 1.504 ± 0.465
2.632ThrGln: 2.632 ± 0.46
1.786ThrArg: 1.786 ± 0.326
3.948ThrSer: 3.948 ± 0.572
3.384ThrThr: 3.384 ± 0.61
4.324ThrVal: 4.324 ± 0.574
0.658ThrTrp: 0.658 ± 0.245
3.29ThrTyr: 3.29 ± 0.592
0.0ThrXaa: 0.0 ± 0.0
Val
3.854ValAla: 3.854 ± 0.692
0.47ValCys: 0.47 ± 0.2
5.17ValAsp: 5.17 ± 0.581
4.324ValGlu: 4.324 ± 0.558
2.726ValPhe: 2.726 ± 0.617
4.042ValGly: 4.042 ± 0.558
0.564ValHis: 0.564 ± 0.16
4.23ValIle: 4.23 ± 0.589
5.264ValLys: 5.264 ± 0.516
3.854ValLeu: 3.854 ± 0.678
1.128ValMet: 1.128 ± 0.278
3.572ValAsn: 3.572 ± 0.599
1.786ValPro: 1.786 ± 0.376
1.974ValGln: 1.974 ± 0.387
2.162ValArg: 2.162 ± 0.603
4.512ValSer: 4.512 ± 0.714
5.17ValThr: 5.17 ± 0.811
3.478ValVal: 3.478 ± 0.719
1.222ValTrp: 1.222 ± 0.33
1.692ValTyr: 1.692 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
0.564TrpAla: 0.564 ± 0.199
0.094TrpCys: 0.094 ± 0.097
1.034TrpAsp: 1.034 ± 0.36
0.94TrpGlu: 0.94 ± 0.213
0.752TrpPhe: 0.752 ± 0.247
0.658TrpGly: 0.658 ± 0.256
0.47TrpHis: 0.47 ± 0.211
0.658TrpIle: 0.658 ± 0.177
0.658TrpLys: 0.658 ± 0.271
1.316TrpLeu: 1.316 ± 0.331
0.094TrpMet: 0.094 ± 0.083
0.94TrpAsn: 0.94 ± 0.359
0.094TrpPro: 0.094 ± 0.094
0.752TrpGln: 0.752 ± 0.227
0.846TrpArg: 0.846 ± 0.225
1.504TrpSer: 1.504 ± 0.625
0.846TrpThr: 0.846 ± 0.253
1.41TrpVal: 1.41 ± 0.29
0.282TrpTrp: 0.282 ± 0.208
0.376TrpTyr: 0.376 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.444TyrAla: 2.444 ± 0.383
0.47TyrCys: 0.47 ± 0.277
2.538TyrAsp: 2.538 ± 0.494
3.102TyrGlu: 3.102 ± 0.554
2.068TyrPhe: 2.068 ± 0.436
1.786TyrGly: 1.786 ± 0.385
0.752TyrHis: 0.752 ± 0.249
2.35TyrIle: 2.35 ± 0.395
2.726TyrLys: 2.726 ± 0.424
3.572TyrLeu: 3.572 ± 0.422
0.658TyrMet: 0.658 ± 0.244
1.504TyrAsn: 1.504 ± 0.365
1.222TyrPro: 1.222 ± 0.365
2.256TyrGln: 2.256 ± 0.349
2.632TyrArg: 2.632 ± 0.631
2.256TyrSer: 2.256 ± 0.575
1.504TyrThr: 1.504 ± 0.327
2.35TyrVal: 2.35 ± 0.475
0.188TyrTrp: 0.188 ± 0.135
2.444TyrTyr: 2.444 ± 0.51
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (10640 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski