Amino acid dipepetide frequency for Streptococcus satellite phage Javan96

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.538AlaAla: 0.538 ± 0.371
1.346AlaCys: 1.346 ± 0.553
2.961AlaAsp: 2.961 ± 0.929
5.384AlaGlu: 5.384 ± 1.135
2.961AlaPhe: 2.961 ± 0.575
3.23AlaGly: 3.23 ± 0.774
1.077AlaHis: 1.077 ± 0.765
6.191AlaIle: 6.191 ± 0.995
4.576AlaLys: 4.576 ± 1.143
4.307AlaLeu: 4.307 ± 1.034
2.423AlaMet: 2.423 ± 0.627
4.038AlaAsn: 4.038 ± 0.814
1.346AlaPro: 1.346 ± 0.553
2.153AlaGln: 2.153 ± 0.687
1.884AlaArg: 1.884 ± 0.623
4.038AlaSer: 4.038 ± 1.023
3.499AlaThr: 3.499 ± 0.862
3.23AlaVal: 3.23 ± 0.943
0.269AlaTrp: 0.269 ± 0.259
1.615AlaTyr: 1.615 ± 0.557
0.0AlaXaa: 0.0 ± 0.0
Cys
0.538CysAla: 0.538 ± 0.352
0.0CysCys: 0.0 ± 0.0
0.538CysAsp: 0.538 ± 0.354
0.538CysGlu: 0.538 ± 0.311
0.0CysPhe: 0.0 ± 0.0
0.269CysGly: 0.269 ± 0.248
0.538CysHis: 0.538 ± 0.391
0.269CysIle: 0.269 ± 0.284
0.538CysLys: 0.538 ± 0.312
0.538CysLeu: 0.538 ± 0.329
0.0CysMet: 0.0 ± 0.0
0.808CysAsn: 0.808 ± 0.477
0.269CysPro: 0.269 ± 0.248
0.538CysGln: 0.538 ± 0.497
1.077CysArg: 1.077 ± 0.779
1.077CysSer: 1.077 ± 0.602
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.269CysTyr: 0.269 ± 0.297
0.0CysXaa: 0.0 ± 0.0
Asp
1.346AspAla: 1.346 ± 0.537
1.077AspCys: 1.077 ± 0.582
3.23AspAsp: 3.23 ± 1.154
3.769AspGlu: 3.769 ± 0.905
3.769AspPhe: 3.769 ± 0.945
1.884AspGly: 1.884 ± 0.564
0.269AspHis: 0.269 ± 0.258
5.114AspIle: 5.114 ± 1.156
4.307AspLys: 4.307 ± 1.001
6.729AspLeu: 6.729 ± 1.091
2.423AspMet: 2.423 ± 0.848
4.038AspAsn: 4.038 ± 1.545
1.615AspPro: 1.615 ± 0.535
1.077AspGln: 1.077 ± 0.626
4.576AspArg: 4.576 ± 1.039
2.961AspSer: 2.961 ± 0.879
4.038AspThr: 4.038 ± 1.031
1.346AspVal: 1.346 ± 0.652
0.538AspTrp: 0.538 ± 0.33
4.038AspTyr: 4.038 ± 0.741
0.0AspXaa: 0.0 ± 0.0
Glu
5.114GluAla: 5.114 ± 0.855
1.346GluCys: 1.346 ± 0.731
3.769GluAsp: 3.769 ± 0.906
4.576GluGlu: 4.576 ± 0.923
3.499GluPhe: 3.499 ± 1.009
2.423GluGly: 2.423 ± 0.871
1.884GluHis: 1.884 ± 0.561
5.384GluIle: 5.384 ± 1.21
8.614GluLys: 8.614 ± 1.326
11.306GluLeu: 11.306 ± 1.588
1.346GluMet: 1.346 ± 0.375
3.23GluAsn: 3.23 ± 1.212
0.808GluPro: 0.808 ± 0.366
6.191GluGln: 6.191 ± 1.454
4.038GluArg: 4.038 ± 0.991
4.307GluSer: 4.307 ± 0.854
6.191GluThr: 6.191 ± 1.076
1.884GluVal: 1.884 ± 0.473
1.077GluTrp: 1.077 ± 0.387
3.769GluTyr: 3.769 ± 0.771
0.0GluXaa: 0.0 ± 0.0
Phe
2.423PheAla: 2.423 ± 0.727
0.269PheCys: 0.269 ± 0.218
2.961PheAsp: 2.961 ± 0.582
4.576PheGlu: 4.576 ± 1.21
1.077PhePhe: 1.077 ± 0.551
1.615PheGly: 1.615 ± 0.58
1.346PheHis: 1.346 ± 0.508
3.769PheIle: 3.769 ± 0.736
4.576PheLys: 4.576 ± 0.925
3.499PheLeu: 3.499 ± 0.791
0.0PheMet: 0.0 ± 0.0
3.769PheAsn: 3.769 ± 0.877
1.346PhePro: 1.346 ± 0.658
1.346PheGln: 1.346 ± 0.52
2.153PheArg: 2.153 ± 0.715
3.23PheSer: 3.23 ± 0.723
2.423PheThr: 2.423 ± 0.687
1.346PheVal: 1.346 ± 0.506
0.269PheTrp: 0.269 ± 0.218
1.615PheTyr: 1.615 ± 0.527
0.0PheXaa: 0.0 ± 0.0
Gly
2.961GlyAla: 2.961 ± 1.039
0.538GlyCys: 0.538 ± 0.348
3.769GlyAsp: 3.769 ± 0.895
3.499GlyGlu: 3.499 ± 0.649
1.615GlyPhe: 1.615 ± 0.685
1.884GlyGly: 1.884 ± 0.554
1.077GlyHis: 1.077 ± 0.602
3.499GlyIle: 3.499 ± 0.752
3.499GlyLys: 3.499 ± 0.784
5.922GlyLeu: 5.922 ± 1.53
1.077GlyMet: 1.077 ± 0.418
2.153GlyAsn: 2.153 ± 0.725
0.269GlyPro: 0.269 ± 0.239
1.077GlyGln: 1.077 ± 0.681
2.153GlyArg: 2.153 ± 0.565
0.538GlySer: 0.538 ± 0.329
3.23GlyThr: 3.23 ± 1.075
2.153GlyVal: 2.153 ± 0.815
0.538GlyTrp: 0.538 ± 0.437
2.692GlyTyr: 2.692 ± 0.758
0.0GlyXaa: 0.0 ± 0.0
His
2.153HisAla: 2.153 ± 1.007
0.0HisCys: 0.0 ± 0.0
0.808HisAsp: 0.808 ± 0.533
0.808HisGlu: 0.808 ± 0.477
0.269HisPhe: 0.269 ± 0.218
1.346HisGly: 1.346 ± 0.501
0.269HisHis: 0.269 ± 0.255
1.077HisIle: 1.077 ± 0.575
1.077HisLys: 1.077 ± 0.468
1.346HisLeu: 1.346 ± 0.483
0.269HisMet: 0.269 ± 0.234
2.153HisAsn: 2.153 ± 0.677
0.269HisPro: 0.269 ± 0.242
0.808HisGln: 0.808 ± 0.444
1.077HisArg: 1.077 ± 0.649
1.884HisSer: 1.884 ± 0.796
1.615HisThr: 1.615 ± 0.694
0.808HisVal: 0.808 ± 0.524
0.538HisTrp: 0.538 ± 0.497
1.884HisTyr: 1.884 ± 0.615
0.0HisXaa: 0.0 ± 0.0
Ile
5.384IleAla: 5.384 ± 1.073
0.808IleCys: 0.808 ± 0.37
5.922IleAsp: 5.922 ± 1.185
6.191IleGlu: 6.191 ± 1.035
3.769IlePhe: 3.769 ± 0.645
2.961IleGly: 2.961 ± 0.762
1.077IleHis: 1.077 ± 0.567
6.191IleIle: 6.191 ± 1.086
7.806IleLys: 7.806 ± 1.228
4.576IleLeu: 4.576 ± 0.95
1.346IleMet: 1.346 ± 0.523
5.384IleAsn: 5.384 ± 1.627
4.038IlePro: 4.038 ± 0.86
2.153IleGln: 2.153 ± 0.914
2.423IleArg: 2.423 ± 0.613
4.845IleSer: 4.845 ± 1.286
4.038IleThr: 4.038 ± 0.993
2.423IleVal: 2.423 ± 0.67
0.0IleTrp: 0.0 ± 0.0
3.23IleTyr: 3.23 ± 0.845
0.0IleXaa: 0.0 ± 0.0
Lys
6.46LysAla: 6.46 ± 1.452
0.0LysCys: 0.0 ± 0.0
4.576LysAsp: 4.576 ± 1.21
9.69LysGlu: 9.69 ± 1.434
2.423LysPhe: 2.423 ± 0.588
4.038LysGly: 4.038 ± 0.98
2.692LysHis: 2.692 ± 0.845
5.384LysIle: 5.384 ± 1.494
5.114LysLys: 5.114 ± 1.061
9.421LysLeu: 9.421 ± 1.446
1.884LysMet: 1.884 ± 0.513
3.769LysAsn: 3.769 ± 0.717
5.653LysPro: 5.653 ± 1.247
3.23LysGln: 3.23 ± 1.112
4.307LysArg: 4.307 ± 1.026
4.307LysSer: 4.307 ± 0.926
4.307LysThr: 4.307 ± 0.932
5.384LysVal: 5.384 ± 0.845
0.538LysTrp: 0.538 ± 0.35
3.499LysTyr: 3.499 ± 0.797
0.0LysXaa: 0.0 ± 0.0
Leu
4.845LeuAla: 4.845 ± 1.233
0.538LeuCys: 0.538 ± 0.311
8.345LeuAsp: 8.345 ± 0.876
11.575LeuGlu: 11.575 ± 1.553
4.307LeuPhe: 4.307 ± 0.899
4.845LeuGly: 4.845 ± 1.171
1.346LeuHis: 1.346 ± 0.663
8.614LeuIle: 8.614 ± 1.554
9.69LeuLys: 9.69 ± 1.306
12.921LeuLeu: 12.921 ± 1.951
3.23LeuMet: 3.23 ± 0.898
4.576LeuAsn: 4.576 ± 1.157
3.769LeuPro: 3.769 ± 1.03
2.153LeuGln: 2.153 ± 0.764
4.307LeuArg: 4.307 ± 1.256
6.46LeuSer: 6.46 ± 1.448
4.307LeuThr: 4.307 ± 0.866
4.307LeuVal: 4.307 ± 0.864
1.077LeuTrp: 1.077 ± 0.398
3.769LeuTyr: 3.769 ± 0.619
0.0LeuXaa: 0.0 ± 0.0
Met
2.423MetAla: 2.423 ± 0.881
0.0MetCys: 0.0 ± 0.0
0.808MetAsp: 0.808 ± 0.44
1.884MetGlu: 1.884 ± 0.5
0.0MetPhe: 0.0 ± 0.0
0.269MetGly: 0.269 ± 0.218
0.269MetHis: 0.269 ± 0.237
0.808MetIle: 0.808 ± 0.386
2.692MetLys: 2.692 ± 0.575
2.423MetLeu: 2.423 ± 0.777
0.0MetMet: 0.0 ± 0.0
1.615MetAsn: 1.615 ± 0.668
0.538MetPro: 0.538 ± 0.274
0.538MetGln: 0.538 ± 0.35
1.346MetArg: 1.346 ± 0.465
0.808MetSer: 0.808 ± 0.424
3.23MetThr: 3.23 ± 1.229
1.615MetVal: 1.615 ± 0.465
0.0MetTrp: 0.0 ± 0.0
0.269MetTyr: 0.269 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
4.576AsnAla: 4.576 ± 0.889
0.0AsnCys: 0.0 ± 0.0
2.961AsnAsp: 2.961 ± 0.977
2.423AsnGlu: 2.423 ± 0.608
1.077AsnPhe: 1.077 ± 0.411
4.845AsnGly: 4.845 ± 1.005
2.153AsnHis: 2.153 ± 0.427
4.038AsnIle: 4.038 ± 1.056
4.576AsnLys: 4.576 ± 0.763
4.576AsnLeu: 4.576 ± 0.953
1.346AsnMet: 1.346 ± 0.546
3.499AsnAsn: 3.499 ± 0.879
2.961AsnPro: 2.961 ± 0.776
3.769AsnGln: 3.769 ± 1.2
5.114AsnArg: 5.114 ± 1.041
2.423AsnSer: 2.423 ± 0.881
2.692AsnThr: 2.692 ± 0.914
2.961AsnVal: 2.961 ± 0.713
0.538AsnTrp: 0.538 ± 0.358
1.884AsnTyr: 1.884 ± 0.593
0.0AsnXaa: 0.0 ± 0.0
Pro
1.346ProAla: 1.346 ± 0.612
0.0ProCys: 0.0 ± 0.0
1.346ProAsp: 1.346 ± 0.679
3.769ProGlu: 3.769 ± 0.884
2.423ProPhe: 2.423 ± 0.806
0.538ProGly: 0.538 ± 0.352
0.538ProHis: 0.538 ± 0.358
1.615ProIle: 1.615 ± 0.746
6.46ProLys: 6.46 ± 0.879
2.153ProLeu: 2.153 ± 0.824
0.808ProMet: 0.808 ± 0.412
2.153ProAsn: 2.153 ± 1.067
1.077ProPro: 1.077 ± 0.435
1.884ProGln: 1.884 ± 0.804
2.961ProArg: 2.961 ± 0.743
1.346ProSer: 1.346 ± 0.513
1.884ProThr: 1.884 ± 0.49
1.615ProVal: 1.615 ± 0.632
0.269ProTrp: 0.269 ± 0.218
0.808ProTyr: 0.808 ± 0.501
0.0ProXaa: 0.0 ± 0.0
Gln
2.961GlnAla: 2.961 ± 0.668
0.0GlnCys: 0.0 ± 0.0
1.615GlnAsp: 1.615 ± 0.729
3.769GlnGlu: 3.769 ± 0.816
1.884GlnPhe: 1.884 ± 0.51
1.884GlnGly: 1.884 ± 0.702
0.269GlnHis: 0.269 ± 0.253
2.961GlnIle: 2.961 ± 0.767
3.769GlnLys: 3.769 ± 0.666
5.922GlnLeu: 5.922 ± 1.454
0.808GlnMet: 0.808 ± 0.522
2.423GlnAsn: 2.423 ± 0.704
1.346GlnPro: 1.346 ± 0.561
1.346GlnGln: 1.346 ± 0.445
2.153GlnArg: 2.153 ± 0.658
1.346GlnSer: 1.346 ± 0.561
1.615GlnThr: 1.615 ± 0.58
3.23GlnVal: 3.23 ± 1.04
0.808GlnTrp: 0.808 ± 0.469
1.615GlnTyr: 1.615 ± 0.815
0.0GlnXaa: 0.0 ± 0.0
Arg
2.692ArgAla: 2.692 ± 1.002
1.077ArgCys: 1.077 ± 0.453
1.884ArgAsp: 1.884 ± 0.686
5.114ArgGlu: 5.114 ± 0.946
3.499ArgPhe: 3.499 ± 0.792
1.615ArgGly: 1.615 ± 0.67
1.077ArgHis: 1.077 ± 0.491
4.307ArgIle: 4.307 ± 0.951
2.961ArgLys: 2.961 ± 0.694
5.384ArgLeu: 5.384 ± 1.357
0.538ArgMet: 0.538 ± 0.384
2.961ArgAsn: 2.961 ± 0.736
1.077ArgPro: 1.077 ± 0.496
3.499ArgGln: 3.499 ± 0.863
2.423ArgArg: 2.423 ± 0.575
2.153ArgSer: 2.153 ± 0.631
2.961ArgThr: 2.961 ± 0.704
3.769ArgVal: 3.769 ± 0.752
0.538ArgTrp: 0.538 ± 0.425
3.769ArgTyr: 3.769 ± 1.021
0.0ArgXaa: 0.0 ± 0.0
Ser
2.153SerAla: 2.153 ± 0.679
0.269SerCys: 0.269 ± 0.248
5.384SerAsp: 5.384 ± 1.067
3.499SerGlu: 3.499 ± 0.804
1.884SerPhe: 1.884 ± 0.784
2.692SerGly: 2.692 ± 0.95
0.808SerHis: 0.808 ± 0.508
3.769SerIle: 3.769 ± 0.741
5.384SerLys: 5.384 ± 0.942
5.384SerLeu: 5.384 ± 0.915
0.538SerMet: 0.538 ± 0.3
2.961SerAsn: 2.961 ± 0.744
1.077SerPro: 1.077 ± 0.414
2.153SerGln: 2.153 ± 0.901
2.153SerArg: 2.153 ± 0.886
1.884SerSer: 1.884 ± 0.846
4.038SerThr: 4.038 ± 0.653
3.23SerVal: 3.23 ± 1.141
0.808SerTrp: 0.808 ± 0.431
2.692SerTyr: 2.692 ± 1.141
0.0SerXaa: 0.0 ± 0.0
Thr
3.769ThrAla: 3.769 ± 1.152
0.0ThrCys: 0.0 ± 0.0
2.423ThrAsp: 2.423 ± 0.647
2.961ThrGlu: 2.961 ± 0.622
4.307ThrPhe: 4.307 ± 1.642
3.23ThrGly: 3.23 ± 0.554
1.346ThrHis: 1.346 ± 0.415
5.114ThrIle: 5.114 ± 1.362
4.038ThrLys: 4.038 ± 0.899
7.806ThrLeu: 7.806 ± 1.329
1.077ThrMet: 1.077 ± 0.567
1.346ThrAsn: 1.346 ± 0.723
3.499ThrPro: 3.499 ± 0.829
2.423ThrGln: 2.423 ± 0.803
2.692ThrArg: 2.692 ± 0.715
1.884ThrSer: 1.884 ± 0.782
2.153ThrThr: 2.153 ± 0.856
2.692ThrVal: 2.692 ± 0.754
1.077ThrTrp: 1.077 ± 0.416
3.23ThrTyr: 3.23 ± 1.192
0.0ThrXaa: 0.0 ± 0.0
Val
3.23ValAla: 3.23 ± 0.586
0.269ValCys: 0.269 ± 0.218
1.884ValAsp: 1.884 ± 0.637
3.23ValGlu: 3.23 ± 1.05
2.423ValPhe: 2.423 ± 0.705
1.884ValGly: 1.884 ± 0.606
0.538ValHis: 0.538 ± 0.497
3.499ValIle: 3.499 ± 0.961
3.499ValLys: 3.499 ± 0.591
5.384ValLeu: 5.384 ± 1.101
0.538ValMet: 0.538 ± 0.36
4.307ValAsn: 4.307 ± 0.772
1.884ValPro: 1.884 ± 0.95
2.423ValGln: 2.423 ± 0.737
2.153ValArg: 2.153 ± 0.737
4.307ValSer: 4.307 ± 1.209
2.692ValThr: 2.692 ± 0.893
2.153ValVal: 2.153 ± 0.913
0.0ValTrp: 0.0 ± 0.0
1.077ValTyr: 1.077 ± 0.391
0.0ValXaa: 0.0 ± 0.0
Trp
0.269TrpAla: 0.269 ± 0.251
0.0TrpCys: 0.0 ± 0.0
0.538TrpAsp: 0.538 ± 0.328
0.538TrpGlu: 0.538 ± 0.364
0.0TrpPhe: 0.0 ± 0.0
0.269TrpGly: 0.269 ± 0.242
0.538TrpHis: 0.538 ± 0.347
0.538TrpIle: 0.538 ± 0.339
0.269TrpLys: 0.269 ± 0.248
1.346TrpLeu: 1.346 ± 0.453
0.0TrpMet: 0.0 ± 0.0
0.269TrpAsn: 0.269 ± 0.251
0.538TrpPro: 0.538 ± 0.33
0.538TrpGln: 0.538 ± 0.334
0.808TrpArg: 0.808 ± 0.416
0.538TrpSer: 0.538 ± 0.317
0.269TrpThr: 0.269 ± 0.362
1.346TrpVal: 1.346 ± 0.487
0.269TrpTrp: 0.269 ± 0.251
0.538TrpTyr: 0.538 ± 0.347
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.884TyrAla: 1.884 ± 0.733
0.269TyrCys: 0.269 ± 0.239
2.423TyrAsp: 2.423 ± 0.645
2.961TyrGlu: 2.961 ± 0.818
2.423TyrPhe: 2.423 ± 0.728
2.423TyrGly: 2.423 ± 0.622
1.346TyrHis: 1.346 ± 0.577
2.692TyrIle: 2.692 ± 0.77
3.23TyrLys: 3.23 ± 0.986
4.038TyrLeu: 4.038 ± 0.983
1.615TyrMet: 1.615 ± 0.702
2.961TyrAsn: 2.961 ± 0.925
1.615TyrPro: 1.615 ± 0.811
2.423TyrGln: 2.423 ± 0.797
3.499TyrArg: 3.499 ± 0.924
2.423TyrSer: 2.423 ± 0.622
1.884TyrThr: 1.884 ± 0.657
1.884TyrVal: 1.884 ± 0.544
0.269TyrTrp: 0.269 ± 0.248
2.961TyrTyr: 2.961 ± 1.102
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3716 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski