Amino acid dipepetide frequency for Streptococcus phage CHPC1152

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.549AlaAla: 6.549 ± 2.524
0.285AlaCys: 0.285 ± 0.155
4.935AlaAsp: 4.935 ± 0.975
4.651AlaGlu: 4.651 ± 0.671
2.847AlaPhe: 2.847 ± 1.231
5.885AlaGly: 5.885 ± 1.657
0.759AlaHis: 0.759 ± 0.236
6.739AlaIle: 6.739 ± 1.591
5.125AlaLys: 5.125 ± 0.66
6.929AlaLeu: 6.929 ± 1.331
2.468AlaMet: 2.468 ± 0.969
3.986AlaAsn: 3.986 ± 0.686
2.658AlaPro: 2.658 ± 0.556
3.132AlaGln: 3.132 ± 0.984
3.322AlaArg: 3.322 ± 0.716
6.549AlaSer: 6.549 ± 1.726
4.271AlaThr: 4.271 ± 1.013
4.556AlaVal: 4.556 ± 1.303
0.475AlaTrp: 0.475 ± 0.177
2.373AlaTyr: 2.373 ± 0.611
0.0AlaXaa: 0.0 ± 0.0
Cys
0.285CysAla: 0.285 ± 0.137
0.0CysCys: 0.0 ± 0.0
0.569CysAsp: 0.569 ± 0.284
0.475CysGlu: 0.475 ± 0.198
0.095CysPhe: 0.095 ± 0.081
0.38CysGly: 0.38 ± 0.224
0.095CysHis: 0.095 ± 0.098
0.475CysIle: 0.475 ± 0.191
0.38CysLys: 0.38 ± 0.193
0.285CysLeu: 0.285 ± 0.227
0.095CysMet: 0.095 ± 0.087
0.285CysAsn: 0.285 ± 0.174
0.19CysPro: 0.19 ± 0.133
0.19CysGln: 0.19 ± 0.145
0.285CysArg: 0.285 ± 0.149
0.475CysSer: 0.475 ± 0.205
0.0CysThr: 0.0 ± 0.0
0.285CysVal: 0.285 ± 0.13
0.095CysTrp: 0.095 ± 0.097
0.19CysTyr: 0.19 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
2.942AspAla: 2.942 ± 0.495
0.285AspCys: 0.285 ± 0.16
4.651AspAsp: 4.651 ± 0.541
3.702AspGlu: 3.702 ± 0.618
3.132AspPhe: 3.132 ± 0.559
6.264AspGly: 6.264 ± 0.946
0.569AspHis: 0.569 ± 0.286
3.037AspIle: 3.037 ± 0.512
4.081AspLys: 4.081 ± 0.758
4.841AspLeu: 4.841 ± 0.684
1.519AspMet: 1.519 ± 0.35
4.176AspAsn: 4.176 ± 0.648
0.759AspPro: 0.759 ± 0.273
1.424AspGln: 1.424 ± 0.381
2.563AspArg: 2.563 ± 0.625
4.366AspSer: 4.366 ± 0.629
3.417AspThr: 3.417 ± 0.677
4.651AspVal: 4.651 ± 0.807
1.044AspTrp: 1.044 ± 0.299
3.512AspTyr: 3.512 ± 0.736
0.0AspXaa: 0.0 ± 0.0
Glu
5.03GluAla: 5.03 ± 0.795
0.285GluCys: 0.285 ± 0.163
2.373GluAsp: 2.373 ± 0.605
3.986GluGlu: 3.986 ± 0.854
2.752GluPhe: 2.752 ± 0.533
3.417GluGly: 3.417 ± 0.43
1.044GluHis: 1.044 ± 0.341
5.979GluIle: 5.979 ± 0.844
5.885GluLys: 5.885 ± 1.18
7.308GluLeu: 7.308 ± 1.213
2.942GluMet: 2.942 ± 0.74
4.081GluAsn: 4.081 ± 0.632
1.803GluPro: 1.803 ± 0.653
2.658GluGln: 2.658 ± 0.436
3.986GluArg: 3.986 ± 0.645
2.373GluSer: 2.373 ± 0.68
3.891GluThr: 3.891 ± 0.652
5.505GluVal: 5.505 ± 0.921
1.044GluTrp: 1.044 ± 0.302
3.322GluTyr: 3.322 ± 0.997
0.0GluXaa: 0.0 ± 0.0
Phe
2.278PheAla: 2.278 ± 0.345
0.19PheCys: 0.19 ± 0.128
3.227PheAsp: 3.227 ± 0.512
3.702PheGlu: 3.702 ± 0.664
1.424PhePhe: 1.424 ± 0.395
3.797PheGly: 3.797 ± 0.66
0.38PheHis: 0.38 ± 0.161
2.752PheIle: 2.752 ± 0.54
5.22PheLys: 5.22 ± 0.602
2.278PheLeu: 2.278 ± 0.565
0.475PheMet: 0.475 ± 0.192
2.847PheAsn: 2.847 ± 0.473
0.38PhePro: 0.38 ± 0.232
0.759PheGln: 0.759 ± 0.282
0.949PheArg: 0.949 ± 0.296
3.417PheSer: 3.417 ± 0.763
2.468PheThr: 2.468 ± 0.588
2.278PheVal: 2.278 ± 0.4
0.569PheTrp: 0.569 ± 0.213
1.139PheTyr: 1.139 ± 0.404
0.0PheXaa: 0.0 ± 0.0
Gly
5.315GlyAla: 5.315 ± 1.158
0.475GlyCys: 0.475 ± 0.199
3.512GlyAsp: 3.512 ± 0.469
3.512GlyGlu: 3.512 ± 0.471
3.037GlyPhe: 3.037 ± 0.5
3.322GlyGly: 3.322 ± 0.516
0.475GlyHis: 0.475 ± 0.2
7.878GlyIle: 7.878 ± 1.979
5.885GlyLys: 5.885 ± 0.955
6.549GlyLeu: 6.549 ± 1.016
1.708GlyMet: 1.708 ± 0.703
3.797GlyAsn: 3.797 ± 0.548
0.664GlyPro: 0.664 ± 0.291
2.563GlyGln: 2.563 ± 0.574
3.132GlyArg: 3.132 ± 0.596
4.556GlySer: 4.556 ± 0.798
4.081GlyThr: 4.081 ± 0.78
4.271GlyVal: 4.271 ± 0.713
0.664GlyTrp: 0.664 ± 0.218
3.037GlyTyr: 3.037 ± 0.648
0.0GlyXaa: 0.0 ± 0.0
His
0.664HisAla: 0.664 ± 0.235
0.0HisCys: 0.0 ± 0.0
0.854HisAsp: 0.854 ± 0.26
0.664HisGlu: 0.664 ± 0.235
0.664HisPhe: 0.664 ± 0.225
1.044HisGly: 1.044 ± 0.359
0.475HisHis: 0.475 ± 0.174
1.234HisIle: 1.234 ± 0.367
0.949HisLys: 0.949 ± 0.274
1.044HisLeu: 1.044 ± 0.332
0.38HisMet: 0.38 ± 0.163
0.569HisAsn: 0.569 ± 0.272
0.475HisPro: 0.475 ± 0.188
0.285HisGln: 0.285 ± 0.18
0.949HisArg: 0.949 ± 0.339
0.854HisSer: 0.854 ± 0.257
0.664HisThr: 0.664 ± 0.181
1.139HisVal: 1.139 ± 0.363
0.19HisTrp: 0.19 ± 0.134
0.569HisTyr: 0.569 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
5.41IleAla: 5.41 ± 1.271
0.475IleCys: 0.475 ± 0.229
5.41IleAsp: 5.41 ± 0.496
4.176IleGlu: 4.176 ± 0.768
1.898IlePhe: 1.898 ± 0.336
5.505IleGly: 5.505 ± 1.079
1.044IleHis: 1.044 ± 0.277
4.651IleIle: 4.651 ± 0.954
5.41IleLys: 5.41 ± 0.62
3.607IleLeu: 3.607 ± 0.519
1.993IleMet: 1.993 ± 0.37
3.132IleAsn: 3.132 ± 0.695
3.132IlePro: 3.132 ± 0.703
3.132IleGln: 3.132 ± 0.58
3.132IleArg: 3.132 ± 0.624
5.979IleSer: 5.979 ± 1.666
3.986IleThr: 3.986 ± 0.454
4.461IleVal: 4.461 ± 0.845
0.38IleTrp: 0.38 ± 0.164
3.227IleTyr: 3.227 ± 0.812
0.0IleXaa: 0.0 ± 0.0
Lys
7.593LysAla: 7.593 ± 0.898
0.38LysCys: 0.38 ± 0.183
4.366LysAsp: 4.366 ± 0.737
7.593LysGlu: 7.593 ± 1.317
2.373LysPhe: 2.373 ± 0.542
5.6LysGly: 5.6 ± 0.588
1.424LysHis: 1.424 ± 0.488
4.556LysIle: 4.556 ± 0.784
6.834LysLys: 6.834 ± 1.608
6.454LysLeu: 6.454 ± 0.924
1.803LysMet: 1.803 ± 0.485
4.176LysAsn: 4.176 ± 0.801
2.942LysPro: 2.942 ± 0.491
2.373LysGln: 2.373 ± 0.578
4.366LysArg: 4.366 ± 0.695
4.271LysSer: 4.271 ± 0.574
5.695LysThr: 5.695 ± 0.8
3.607LysVal: 3.607 ± 0.584
0.949LysTrp: 0.949 ± 0.312
3.607LysTyr: 3.607 ± 0.767
0.0LysXaa: 0.0 ± 0.0
Leu
7.118LeuAla: 7.118 ± 0.941
0.38LeuCys: 0.38 ± 0.186
4.081LeuAsp: 4.081 ± 0.758
6.169LeuGlu: 6.169 ± 1.031
2.373LeuPhe: 2.373 ± 0.356
5.979LeuGly: 5.979 ± 1.171
0.475LeuHis: 0.475 ± 0.239
3.512LeuIle: 3.512 ± 0.555
6.644LeuLys: 6.644 ± 1.092
4.841LeuLeu: 4.841 ± 0.704
1.898LeuMet: 1.898 ± 0.462
5.505LeuAsn: 5.505 ± 0.765
2.658LeuPro: 2.658 ± 0.595
2.942LeuGln: 2.942 ± 0.457
3.322LeuArg: 3.322 ± 0.723
5.79LeuSer: 5.79 ± 0.545
5.979LeuThr: 5.979 ± 1.032
5.22LeuVal: 5.22 ± 0.55
0.285LeuTrp: 0.285 ± 0.293
2.468LeuTyr: 2.468 ± 0.439
0.0LeuXaa: 0.0 ± 0.0
Met
3.037MetAla: 3.037 ± 1.008
0.0MetCys: 0.0 ± 0.0
0.854MetAsp: 0.854 ± 0.266
1.329MetGlu: 1.329 ± 0.482
1.139MetPhe: 1.139 ± 0.265
1.234MetGly: 1.234 ± 0.394
0.095MetHis: 0.095 ± 0.086
1.234MetIle: 1.234 ± 0.366
2.088MetLys: 2.088 ± 0.482
1.614MetLeu: 1.614 ± 0.372
0.949MetMet: 0.949 ± 0.607
1.614MetAsn: 1.614 ± 0.337
0.569MetPro: 0.569 ± 0.207
1.708MetGln: 1.708 ± 0.546
1.044MetArg: 1.044 ± 0.268
2.088MetSer: 2.088 ± 0.539
1.044MetThr: 1.044 ± 0.306
2.563MetVal: 2.563 ± 0.576
0.0MetTrp: 0.0 ± 0.0
0.759MetTyr: 0.759 ± 0.304
0.0MetXaa: 0.0 ± 0.0
Asn
4.176AsnAla: 4.176 ± 0.69
0.19AsnCys: 0.19 ± 0.104
3.607AsnAsp: 3.607 ± 0.714
4.746AsnGlu: 4.746 ± 0.837
2.752AsnPhe: 2.752 ± 0.483
5.505AsnGly: 5.505 ± 0.959
1.234AsnHis: 1.234 ± 0.448
3.417AsnIle: 3.417 ± 0.625
4.366AsnLys: 4.366 ± 0.813
4.366AsnLeu: 4.366 ± 0.545
0.664AsnMet: 0.664 ± 0.269
3.227AsnAsn: 3.227 ± 0.615
2.563AsnPro: 2.563 ± 0.436
2.183AsnGln: 2.183 ± 0.453
1.993AsnArg: 1.993 ± 0.443
3.037AsnSer: 3.037 ± 0.618
3.037AsnThr: 3.037 ± 0.516
3.322AsnVal: 3.322 ± 0.439
1.139AsnTrp: 1.139 ± 0.369
1.803AsnTyr: 1.803 ± 0.534
0.0AsnXaa: 0.0 ± 0.0
Pro
1.424ProAla: 1.424 ± 0.384
0.19ProCys: 0.19 ± 0.166
1.803ProAsp: 1.803 ± 0.481
2.468ProGlu: 2.468 ± 0.68
1.234ProPhe: 1.234 ± 0.306
1.044ProGly: 1.044 ± 0.275
0.38ProHis: 0.38 ± 0.171
1.803ProIle: 1.803 ± 0.453
2.942ProLys: 2.942 ± 0.505
1.993ProLeu: 1.993 ± 0.469
0.095ProMet: 0.095 ± 0.086
1.993ProAsn: 1.993 ± 0.473
1.234ProPro: 1.234 ± 0.28
1.424ProGln: 1.424 ± 0.383
1.234ProArg: 1.234 ± 0.412
2.373ProSer: 2.373 ± 0.397
1.519ProThr: 1.519 ± 0.498
1.898ProVal: 1.898 ± 0.415
0.38ProTrp: 0.38 ± 0.187
0.949ProTyr: 0.949 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
3.986GlnAla: 3.986 ± 1.091
0.095GlnCys: 0.095 ± 0.08
1.993GlnAsp: 1.993 ± 0.412
2.563GlnGlu: 2.563 ± 0.678
2.183GlnPhe: 2.183 ± 0.558
2.278GlnGly: 2.278 ± 0.728
0.475GlnHis: 0.475 ± 0.209
2.752GlnIle: 2.752 ± 0.683
2.563GlnLys: 2.563 ± 0.538
3.702GlnLeu: 3.702 ± 0.534
1.424GlnMet: 1.424 ± 0.389
1.614GlnAsn: 1.614 ± 0.296
0.759GlnPro: 0.759 ± 0.185
1.234GlnGln: 1.234 ± 0.319
1.234GlnArg: 1.234 ± 0.329
2.942GlnSer: 2.942 ± 0.624
2.373GlnThr: 2.373 ± 0.358
2.183GlnVal: 2.183 ± 0.333
0.285GlnTrp: 0.285 ± 0.18
1.234GlnTyr: 1.234 ± 0.376
0.0GlnXaa: 0.0 ± 0.0
Arg
3.797ArgAla: 3.797 ± 0.581
0.569ArgCys: 0.569 ± 0.245
2.752ArgAsp: 2.752 ± 0.469
3.417ArgGlu: 3.417 ± 0.656
1.708ArgPhe: 1.708 ± 0.511
2.563ArgGly: 2.563 ± 0.369
0.569ArgHis: 0.569 ± 0.222
3.037ArgIle: 3.037 ± 0.693
3.322ArgLys: 3.322 ± 0.799
3.417ArgLeu: 3.417 ± 0.579
1.614ArgMet: 1.614 ± 0.311
1.898ArgAsn: 1.898 ± 0.459
1.044ArgPro: 1.044 ± 0.246
1.519ArgGln: 1.519 ± 0.436
1.519ArgArg: 1.519 ± 0.496
2.373ArgSer: 2.373 ± 0.394
1.803ArgThr: 1.803 ± 0.467
2.847ArgVal: 2.847 ± 0.554
0.949ArgTrp: 0.949 ± 0.306
2.088ArgTyr: 2.088 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
6.264SerAla: 6.264 ± 2.835
0.475SerCys: 0.475 ± 0.235
4.271SerAsp: 4.271 ± 0.63
3.132SerGlu: 3.132 ± 0.662
2.942SerPhe: 2.942 ± 0.421
4.081SerGly: 4.081 ± 0.558
1.139SerHis: 1.139 ± 0.328
5.6SerIle: 5.6 ± 0.915
4.935SerLys: 4.935 ± 0.667
5.03SerLeu: 5.03 ± 0.892
1.803SerMet: 1.803 ± 0.365
3.891SerAsn: 3.891 ± 0.666
1.424SerPro: 1.424 ± 0.327
3.607SerGln: 3.607 ± 1.18
1.898SerArg: 1.898 ± 0.418
3.986SerSer: 3.986 ± 1.052
4.461SerThr: 4.461 ± 0.632
5.79SerVal: 5.79 ± 0.856
0.759SerTrp: 0.759 ± 0.248
1.614SerTyr: 1.614 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
4.461ThrAla: 4.461 ± 1.526
0.095ThrCys: 0.095 ± 0.08
2.752ThrAsp: 2.752 ± 0.476
3.891ThrGlu: 3.891 ± 0.642
3.512ThrPhe: 3.512 ± 0.509
3.891ThrGly: 3.891 ± 0.49
1.424ThrHis: 1.424 ± 0.41
4.081ThrIle: 4.081 ± 0.651
6.169ThrLys: 6.169 ± 0.838
4.746ThrLeu: 4.746 ± 0.673
1.519ThrMet: 1.519 ± 0.943
3.227ThrAsn: 3.227 ± 0.593
1.993ThrPro: 1.993 ± 0.377
2.658ThrGln: 2.658 ± 0.424
1.993ThrArg: 1.993 ± 0.474
2.847ThrSer: 2.847 ± 0.822
3.417ThrThr: 3.417 ± 0.582
5.41ThrVal: 5.41 ± 0.668
0.19ThrTrp: 0.19 ± 0.158
2.183ThrTyr: 2.183 ± 0.586
0.0ThrXaa: 0.0 ± 0.0
Val
4.746ValAla: 4.746 ± 1.293
0.285ValCys: 0.285 ± 0.169
5.125ValAsp: 5.125 ± 0.758
6.074ValGlu: 6.074 ± 0.939
2.563ValPhe: 2.563 ± 0.493
3.702ValGly: 3.702 ± 0.647
1.044ValHis: 1.044 ± 0.324
4.271ValIle: 4.271 ± 0.66
5.41ValLys: 5.41 ± 0.466
4.746ValLeu: 4.746 ± 0.546
0.854ValMet: 0.854 ± 0.31
4.366ValAsn: 4.366 ± 0.841
1.898ValPro: 1.898 ± 0.388
2.278ValGln: 2.278 ± 0.64
2.373ValArg: 2.373 ± 0.415
5.79ValSer: 5.79 ± 0.842
5.03ValThr: 5.03 ± 0.643
4.461ValVal: 4.461 ± 0.579
0.949ValTrp: 0.949 ± 0.276
1.329ValTyr: 1.329 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
0.38TrpAla: 0.38 ± 0.141
0.095TrpCys: 0.095 ± 0.098
0.664TrpAsp: 0.664 ± 0.307
1.234TrpGlu: 1.234 ± 0.289
0.569TrpPhe: 0.569 ± 0.198
0.759TrpGly: 0.759 ± 0.243
0.19TrpHis: 0.19 ± 0.127
0.475TrpIle: 0.475 ± 0.228
0.664TrpLys: 0.664 ± 0.185
0.949TrpLeu: 0.949 ± 0.275
0.095TrpMet: 0.095 ± 0.097
0.759TrpAsn: 0.759 ± 0.359
0.095TrpPro: 0.095 ± 0.113
0.569TrpGln: 0.569 ± 0.21
0.569TrpArg: 0.569 ± 0.209
0.759TrpSer: 0.759 ± 0.293
0.854TrpThr: 0.854 ± 0.296
0.854TrpVal: 0.854 ± 0.204
0.285TrpTrp: 0.285 ± 0.202
0.19TrpTyr: 0.19 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.037TyrAla: 3.037 ± 0.43
0.38TyrCys: 0.38 ± 0.168
3.132TyrAsp: 3.132 ± 0.7
2.183TyrGlu: 2.183 ± 0.594
1.329TyrPhe: 1.329 ± 0.425
2.183TyrGly: 2.183 ± 0.491
0.38TyrHis: 0.38 ± 0.191
2.658TyrIle: 2.658 ± 0.668
2.088TyrLys: 2.088 ± 0.362
3.037TyrLeu: 3.037 ± 0.704
0.569TyrMet: 0.569 ± 0.212
2.183TyrAsn: 2.183 ± 0.57
1.139TyrPro: 1.139 ± 0.348
1.234TyrGln: 1.234 ± 0.323
2.942TyrArg: 2.942 ± 0.9
2.278TyrSer: 2.278 ± 0.473
2.468TyrThr: 2.468 ± 0.555
1.993TyrVal: 1.993 ± 0.342
0.38TyrTrp: 0.38 ± 0.176
1.614TyrTyr: 1.614 ± 0.659
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (10537 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski