Amino acid dipepetide frequency for Streptococcus satellite phage Javan593

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.234AlaAla: 1.234 ± 0.519
0.0AlaCys: 0.0 ± 0.0
2.468AlaAsp: 2.468 ± 0.9
4.937AlaGlu: 4.937 ± 1.844
2.468AlaPhe: 2.468 ± 0.873
2.468AlaGly: 2.468 ± 1.046
0.0AlaHis: 0.0 ± 0.0
4.628AlaIle: 4.628 ± 1.018
5.862AlaLys: 5.862 ± 1.046
5.245AlaLeu: 5.245 ± 0.924
1.851AlaMet: 1.851 ± 0.786
2.16AlaAsn: 2.16 ± 0.492
2.16AlaPro: 2.16 ± 0.512
2.777AlaGln: 2.777 ± 1.067
1.851AlaArg: 1.851 ± 0.564
3.085AlaSer: 3.085 ± 0.873
3.394AlaThr: 3.394 ± 0.784
1.851AlaVal: 1.851 ± 0.58
0.309AlaTrp: 0.309 ± 0.338
2.777AlaTyr: 2.777 ± 0.652
0.0AlaXaa: 0.0 ± 0.0
Cys
0.926CysAla: 0.926 ± 0.477
0.309CysCys: 0.309 ± 0.279
0.0CysAsp: 0.0 ± 0.0
0.309CysGlu: 0.309 ± 0.293
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.617CysIle: 0.617 ± 0.392
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.309CysPro: 0.309 ± 0.279
0.0CysGln: 0.0 ± 0.0
0.309CysArg: 0.309 ± 0.293
0.309CysSer: 0.309 ± 0.279
0.617CysThr: 0.617 ± 0.424
0.309CysVal: 0.309 ± 0.299
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.468AspAla: 2.468 ± 0.939
0.0AspCys: 0.0 ± 0.0
3.085AspAsp: 3.085 ± 1.003
4.011AspGlu: 4.011 ± 1.3
4.011AspPhe: 4.011 ± 0.87
2.468AspGly: 2.468 ± 0.717
1.234AspHis: 1.234 ± 0.904
6.171AspIle: 6.171 ± 0.918
4.32AspLys: 4.32 ± 1.201
5.862AspLeu: 5.862 ± 1.333
0.309AspMet: 0.309 ± 0.293
1.234AspAsn: 1.234 ± 0.71
0.309AspPro: 0.309 ± 0.234
1.543AspGln: 1.543 ± 0.685
4.937AspArg: 4.937 ± 1.268
4.937AspSer: 4.937 ± 1.28
3.085AspThr: 3.085 ± 0.877
3.394AspVal: 3.394 ± 0.944
0.617AspTrp: 0.617 ± 0.381
1.543AspTyr: 1.543 ± 0.618
0.0AspXaa: 0.0 ± 0.0
Glu
4.32GluAla: 4.32 ± 0.846
1.234GluCys: 1.234 ± 0.695
4.937GluAsp: 4.937 ± 1.063
9.256GluGlu: 9.256 ± 2.246
4.937GluPhe: 4.937 ± 1.442
2.468GluGly: 2.468 ± 0.815
2.777GluHis: 2.777 ± 1.101
9.256GluIle: 9.256 ± 1.836
8.331GluLys: 8.331 ± 0.986
11.416GluLeu: 11.416 ± 2.631
1.851GluMet: 1.851 ± 0.72
4.628GluAsn: 4.628 ± 1.03
1.234GluPro: 1.234 ± 0.555
4.32GluGln: 4.32 ± 1.271
4.32GluArg: 4.32 ± 1.507
2.468GluSer: 2.468 ± 0.54
4.628GluThr: 4.628 ± 0.944
3.085GluVal: 3.085 ± 0.945
0.309GluTrp: 0.309 ± 0.293
2.468GluTyr: 2.468 ± 0.579
0.0GluXaa: 0.0 ± 0.0
Phe
2.468PheAla: 2.468 ± 0.907
0.617PheCys: 0.617 ± 0.401
2.777PheAsp: 2.777 ± 0.861
5.862PheGlu: 5.862 ± 1.48
1.234PhePhe: 1.234 ± 0.542
1.851PheGly: 1.851 ± 0.937
0.617PheHis: 0.617 ± 0.424
3.394PheIle: 3.394 ± 0.8
4.628PheLys: 4.628 ± 1.145
3.703PheLeu: 3.703 ± 0.832
1.543PheMet: 1.543 ± 0.646
1.851PheAsn: 1.851 ± 0.561
1.851PhePro: 1.851 ± 0.714
2.777PheGln: 2.777 ± 0.625
1.543PheArg: 1.543 ± 0.857
2.16PheSer: 2.16 ± 0.743
1.234PheThr: 1.234 ± 0.663
0.926PheVal: 0.926 ± 0.382
0.309PheTrp: 0.309 ± 0.234
1.234PheTyr: 1.234 ± 0.72
0.0PheXaa: 0.0 ± 0.0
Gly
2.468GlyAla: 2.468 ± 0.684
0.0GlyCys: 0.0 ± 0.0
2.468GlyAsp: 2.468 ± 0.979
2.16GlyGlu: 2.16 ± 0.96
1.851GlyPhe: 1.851 ± 0.608
1.851GlyGly: 1.851 ± 0.693
1.234GlyHis: 1.234 ± 0.545
2.777GlyIle: 2.777 ± 0.602
4.32GlyLys: 4.32 ± 0.931
5.245GlyLeu: 5.245 ± 1.743
1.234GlyMet: 1.234 ± 0.453
3.085GlyAsn: 3.085 ± 0.824
0.309GlyPro: 0.309 ± 0.234
1.851GlyGln: 1.851 ± 0.737
2.468GlyArg: 2.468 ± 0.859
2.468GlySer: 2.468 ± 0.697
2.16GlyThr: 2.16 ± 0.943
1.851GlyVal: 1.851 ± 0.72
0.926GlyTrp: 0.926 ± 0.571
2.777GlyTyr: 2.777 ± 0.854
0.0GlyXaa: 0.0 ± 0.0
His
1.543HisAla: 1.543 ± 0.732
0.309HisCys: 0.309 ± 0.293
0.617HisAsp: 0.617 ± 0.359
0.926HisGlu: 0.926 ± 0.518
0.926HisPhe: 0.926 ± 0.366
1.234HisGly: 1.234 ± 0.807
0.926HisHis: 0.926 ± 0.477
1.543HisIle: 1.543 ± 0.627
0.617HisLys: 0.617 ± 0.461
1.851HisLeu: 1.851 ± 0.911
0.0HisMet: 0.0 ± 0.0
2.16HisAsn: 2.16 ± 0.755
0.617HisPro: 0.617 ± 0.41
0.926HisGln: 0.926 ± 0.51
0.309HisArg: 0.309 ± 0.293
1.234HisSer: 1.234 ± 0.655
2.16HisThr: 2.16 ± 0.779
0.617HisVal: 0.617 ± 0.41
0.0HisTrp: 0.0 ± 0.0
1.543HisTyr: 1.543 ± 0.792
0.0HisXaa: 0.0 ± 0.0
Ile
2.777IleAla: 2.777 ± 0.827
0.309IleCys: 0.309 ± 0.299
4.011IleAsp: 4.011 ± 0.972
7.097IleGlu: 7.097 ± 1.797
3.085IlePhe: 3.085 ± 0.943
3.085IleGly: 3.085 ± 1.024
1.543IleHis: 1.543 ± 0.625
2.777IleIle: 2.777 ± 0.801
9.565IleLys: 9.565 ± 1.779
8.022IleLeu: 8.022 ± 1.244
1.543IleMet: 1.543 ± 0.539
5.862IleAsn: 5.862 ± 1.275
2.777IlePro: 2.777 ± 0.997
4.628IleGln: 4.628 ± 1.463
2.777IleArg: 2.777 ± 0.774
3.085IleSer: 3.085 ± 0.765
5.245IleThr: 5.245 ± 0.689
3.394IleVal: 3.394 ± 0.782
0.309IleTrp: 0.309 ± 0.234
2.777IleTyr: 2.777 ± 1.181
0.0IleXaa: 0.0 ± 0.0
Lys
7.097LysAla: 7.097 ± 1.176
0.617LysCys: 0.617 ± 0.398
6.171LysAsp: 6.171 ± 1.296
8.022LysGlu: 8.022 ± 2.057
3.394LysPhe: 3.394 ± 1.006
4.32LysGly: 4.32 ± 1.274
1.851LysHis: 1.851 ± 0.602
6.788LysIle: 6.788 ± 1.435
8.022LysLys: 8.022 ± 1.379
8.639LysLeu: 8.639 ± 1.308
1.234LysMet: 1.234 ± 0.763
8.331LysAsn: 8.331 ± 1.881
3.085LysPro: 3.085 ± 0.749
7.097LysGln: 7.097 ± 1.671
4.32LysArg: 4.32 ± 1.226
4.937LysSer: 4.937 ± 1.04
4.32LysThr: 4.32 ± 1.085
3.394LysVal: 3.394 ± 0.999
0.309LysTrp: 0.309 ± 0.293
2.777LysTyr: 2.777 ± 0.892
0.0LysXaa: 0.0 ± 0.0
Leu
5.245LeuAla: 5.245 ± 1.562
0.0LeuCys: 0.0 ± 0.0
7.714LeuAsp: 7.714 ± 1.233
15.119LeuGlu: 15.119 ± 2.631
3.703LeuPhe: 3.703 ± 0.921
4.628LeuGly: 4.628 ± 1.069
1.234LeuHis: 1.234 ± 0.571
6.788LeuIle: 6.788 ± 1.837
7.714LeuLys: 7.714 ± 0.957
12.65LeuLeu: 12.65 ± 1.909
2.16LeuMet: 2.16 ± 0.718
6.788LeuAsn: 6.788 ± 1.652
3.085LeuPro: 3.085 ± 1.417
3.085LeuGln: 3.085 ± 1.148
5.554LeuArg: 5.554 ± 1.368
8.331LeuSer: 8.331 ± 1.374
4.937LeuThr: 4.937 ± 1.182
4.011LeuVal: 4.011 ± 1.017
0.617LeuTrp: 0.617 ± 0.424
3.085LeuTyr: 3.085 ± 0.898
0.0LeuXaa: 0.0 ± 0.0
Met
2.16MetAla: 2.16 ± 0.875
0.0MetCys: 0.0 ± 0.0
1.234MetAsp: 1.234 ± 0.543
1.851MetGlu: 1.851 ± 0.804
1.543MetPhe: 1.543 ± 0.588
0.309MetGly: 0.309 ± 0.299
0.309MetHis: 0.309 ± 0.234
0.617MetIle: 0.617 ± 0.38
3.394MetLys: 3.394 ± 0.895
1.543MetLeu: 1.543 ± 0.667
0.0MetMet: 0.0 ± 0.0
1.851MetAsn: 1.851 ± 0.696
0.617MetPro: 0.617 ± 0.418
0.926MetGln: 0.926 ± 0.45
2.468MetArg: 2.468 ± 0.624
0.0MetSer: 0.0 ± 0.0
2.468MetThr: 2.468 ± 0.793
0.309MetVal: 0.309 ± 0.293
0.0MetTrp: 0.0 ± 0.0
1.234MetTyr: 1.234 ± 0.657
0.0MetXaa: 0.0 ± 0.0
Asn
3.085AsnAla: 3.085 ± 1.019
0.309AsnCys: 0.309 ± 0.299
2.777AsnAsp: 2.777 ± 1.059
1.851AsnGlu: 1.851 ± 0.867
3.085AsnPhe: 3.085 ± 0.742
3.394AsnGly: 3.394 ± 0.735
1.234AsnHis: 1.234 ± 0.658
4.937AsnIle: 4.937 ± 1.066
4.32AsnLys: 4.32 ± 0.849
7.097AsnLeu: 7.097 ± 1.472
3.085AsnMet: 3.085 ± 1.184
3.394AsnAsn: 3.394 ± 1.184
4.628AsnPro: 4.628 ± 1.207
2.16AsnGln: 2.16 ± 0.851
1.851AsnArg: 1.851 ± 0.603
3.703AsnSer: 3.703 ± 1.064
4.32AsnThr: 4.32 ± 0.998
2.468AsnVal: 2.468 ± 0.884
1.234AsnTrp: 1.234 ± 0.56
3.394AsnTyr: 3.394 ± 1.051
0.0AsnXaa: 0.0 ± 0.0
Pro
0.309ProAla: 0.309 ± 0.296
0.0ProCys: 0.0 ± 0.0
2.468ProAsp: 2.468 ± 0.721
1.851ProGlu: 1.851 ± 0.693
2.468ProPhe: 2.468 ± 0.796
1.234ProGly: 1.234 ± 0.616
0.617ProHis: 0.617 ± 0.359
1.234ProIle: 1.234 ± 0.749
5.554ProLys: 5.554 ± 1.688
1.234ProLeu: 1.234 ± 0.606
0.617ProMet: 0.617 ± 0.316
1.851ProAsn: 1.851 ± 0.589
1.234ProPro: 1.234 ± 0.559
1.234ProGln: 1.234 ± 0.733
1.851ProArg: 1.851 ± 0.776
1.851ProSer: 1.851 ± 0.584
1.234ProThr: 1.234 ± 0.618
1.543ProVal: 1.543 ± 0.634
0.309ProTrp: 0.309 ± 0.299
1.234ProTyr: 1.234 ± 0.527
0.0ProXaa: 0.0 ± 0.0
Gln
3.703GlnAla: 3.703 ± 1.23
0.0GlnCys: 0.0 ± 0.0
1.234GlnAsp: 1.234 ± 0.471
3.703GlnGlu: 3.703 ± 0.831
1.543GlnPhe: 1.543 ± 0.614
1.851GlnGly: 1.851 ± 0.579
1.234GlnHis: 1.234 ± 0.644
3.394GlnIle: 3.394 ± 1.011
5.245GlnLys: 5.245 ± 2.079
4.011GlnLeu: 4.011 ± 0.791
1.543GlnMet: 1.543 ± 0.65
2.468GlnAsn: 2.468 ± 0.769
1.543GlnPro: 1.543 ± 0.848
2.16GlnGln: 2.16 ± 0.627
1.234GlnArg: 1.234 ± 0.663
3.703GlnSer: 3.703 ± 1.37
2.777GlnThr: 2.777 ± 0.95
1.543GlnVal: 1.543 ± 0.618
0.617GlnTrp: 0.617 ± 0.439
2.777GlnTyr: 2.777 ± 0.827
0.0GlnXaa: 0.0 ± 0.0
Arg
0.926ArgAla: 0.926 ± 0.642
0.0ArgCys: 0.0 ± 0.0
2.16ArgAsp: 2.16 ± 0.525
4.937ArgGlu: 4.937 ± 1.324
1.543ArgPhe: 1.543 ± 0.538
1.234ArgGly: 1.234 ± 0.692
0.926ArgHis: 0.926 ± 0.453
3.703ArgIle: 3.703 ± 1.177
5.245ArgLys: 5.245 ± 1.487
4.937ArgLeu: 4.937 ± 1.232
2.16ArgMet: 2.16 ± 0.653
3.703ArgAsn: 3.703 ± 0.764
1.543ArgPro: 1.543 ± 0.725
1.543ArgGln: 1.543 ± 0.724
2.468ArgArg: 2.468 ± 1.032
2.468ArgSer: 2.468 ± 0.851
1.543ArgThr: 1.543 ± 0.754
4.628ArgVal: 4.628 ± 1.266
1.234ArgTrp: 1.234 ± 0.554
2.468ArgTyr: 2.468 ± 0.721
0.0ArgXaa: 0.0 ± 0.0
Ser
2.777SerAla: 2.777 ± 1.005
0.309SerCys: 0.309 ± 0.279
4.011SerAsp: 4.011 ± 0.699
5.862SerGlu: 5.862 ± 1.036
2.468SerPhe: 2.468 ± 0.958
3.085SerGly: 3.085 ± 0.863
1.543SerHis: 1.543 ± 0.499
5.554SerIle: 5.554 ± 1.187
3.703SerLys: 3.703 ± 0.775
5.862SerLeu: 5.862 ± 0.983
0.617SerMet: 0.617 ± 0.414
4.011SerAsn: 4.011 ± 1.15
1.543SerPro: 1.543 ± 0.63
2.777SerGln: 2.777 ± 0.93
1.543SerArg: 1.543 ± 0.423
3.703SerSer: 3.703 ± 1.344
3.085SerThr: 3.085 ± 0.749
3.085SerVal: 3.085 ± 0.697
0.309SerTrp: 0.309 ± 0.293
3.085SerTyr: 3.085 ± 0.915
0.0SerXaa: 0.0 ± 0.0
Thr
2.468ThrAla: 2.468 ± 0.938
0.0ThrCys: 0.0 ± 0.0
2.468ThrAsp: 2.468 ± 0.758
3.703ThrGlu: 3.703 ± 1.356
1.234ThrPhe: 1.234 ± 0.585
3.085ThrGly: 3.085 ± 1.024
0.926ThrHis: 0.926 ± 0.509
4.32ThrIle: 4.32 ± 0.885
4.011ThrLys: 4.011 ± 1.207
6.171ThrLeu: 6.171 ± 1.369
1.234ThrMet: 1.234 ± 0.512
4.011ThrAsn: 4.011 ± 1.168
1.234ThrPro: 1.234 ± 0.435
1.851ThrGln: 1.851 ± 0.9
5.245ThrArg: 5.245 ± 1.385
3.394ThrSer: 3.394 ± 1.07
2.777ThrThr: 2.777 ± 0.903
3.085ThrVal: 3.085 ± 1.021
0.617ThrTrp: 0.617 ± 0.415
2.777ThrTyr: 2.777 ± 0.6
0.0ThrXaa: 0.0 ± 0.0
Val
4.32ValAla: 4.32 ± 1.033
0.0ValCys: 0.0 ± 0.0
2.16ValAsp: 2.16 ± 0.523
2.777ValGlu: 2.777 ± 0.837
1.234ValPhe: 1.234 ± 0.658
2.16ValGly: 2.16 ± 0.783
0.617ValHis: 0.617 ± 0.359
2.777ValIle: 2.777 ± 0.989
3.394ValLys: 3.394 ± 0.806
4.32ValLeu: 4.32 ± 1.303
0.617ValMet: 0.617 ± 0.458
2.16ValAsn: 2.16 ± 0.845
1.543ValPro: 1.543 ± 0.619
1.543ValGln: 1.543 ± 0.539
2.468ValArg: 2.468 ± 0.664
2.777ValSer: 2.777 ± 0.715
2.777ValThr: 2.777 ± 0.955
1.543ValVal: 1.543 ± 0.76
0.0ValTrp: 0.0 ± 0.0
3.085ValTyr: 3.085 ± 0.833
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.309TrpAsp: 0.309 ± 0.305
1.851TrpGlu: 1.851 ± 0.613
0.0TrpPhe: 0.0 ± 0.0
0.309TrpGly: 0.309 ± 0.305
0.309TrpHis: 0.309 ± 0.293
0.309TrpIle: 0.309 ± 0.234
0.926TrpLys: 0.926 ± 0.468
1.234TrpLeu: 1.234 ± 0.519
0.0TrpMet: 0.0 ± 0.0
0.309TrpAsn: 0.309 ± 0.279
0.0TrpPro: 0.0 ± 0.0
0.309TrpGln: 0.309 ± 0.279
0.0TrpArg: 0.0 ± 0.0
0.926TrpSer: 0.926 ± 0.473
0.926TrpThr: 0.926 ± 0.535
0.309TrpVal: 0.309 ± 0.315
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.234TyrAla: 1.234 ± 0.618
0.0TyrCys: 0.0 ± 0.0
2.468TyrAsp: 2.468 ± 0.96
2.16TyrGlu: 2.16 ± 0.909
1.851TyrPhe: 1.851 ± 0.784
2.468TyrGly: 2.468 ± 0.847
0.926TyrHis: 0.926 ± 0.593
3.085TyrIle: 3.085 ± 0.876
5.554TyrLys: 5.554 ± 0.978
7.714TyrLeu: 7.714 ± 1.396
0.926TyrMet: 0.926 ± 0.477
2.468TyrAsn: 2.468 ± 0.605
0.309TyrPro: 0.309 ± 0.296
2.777TyrGln: 2.777 ± 0.962
1.851TyrArg: 1.851 ± 0.602
3.394TyrSer: 3.394 ± 0.774
0.926TyrThr: 0.926 ± 0.688
0.926TyrVal: 0.926 ± 0.413
0.0TyrTrp: 0.0 ± 0.0
3.085TyrTyr: 3.085 ± 1.168
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3242 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski