Amino acid dipepetide frequency for Streptococcus satellite phage Javan256

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.242AlaAla: 4.242 ± 1.031
1.212AlaCys: 1.212 ± 0.623
3.636AlaAsp: 3.636 ± 1.23
5.152AlaGlu: 5.152 ± 1.153
3.03AlaPhe: 3.03 ± 0.997
4.545AlaGly: 4.545 ± 0.676
0.303AlaHis: 0.303 ± 0.27
6.061AlaIle: 6.061 ± 1.329
4.848AlaLys: 4.848 ± 1.274
4.848AlaLeu: 4.848 ± 0.969
0.909AlaMet: 0.909 ± 0.476
3.03AlaAsn: 3.03 ± 0.85
0.606AlaPro: 0.606 ± 0.459
2.424AlaGln: 2.424 ± 0.822
1.212AlaArg: 1.212 ± 0.622
3.636AlaSer: 3.636 ± 0.849
3.939AlaThr: 3.939 ± 1.364
3.333AlaVal: 3.333 ± 0.959
0.606AlaTrp: 0.606 ± 0.54
3.636AlaTyr: 3.636 ± 0.819
0.0AlaXaa: 0.0 ± 0.0
Cys
0.606CysAla: 0.606 ± 0.33
0.0CysCys: 0.0 ± 0.0
0.303CysAsp: 0.303 ± 0.244
0.606CysGlu: 0.606 ± 0.367
0.303CysPhe: 0.303 ± 0.244
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.909CysLeu: 0.909 ± 0.733
0.0CysMet: 0.0 ± 0.0
0.303CysAsn: 0.303 ± 0.287
1.212CysPro: 1.212 ± 0.666
0.303CysGln: 0.303 ± 0.298
0.303CysArg: 0.303 ± 0.287
0.303CysSer: 0.303 ± 0.335
0.606CysThr: 0.606 ± 0.33
0.303CysVal: 0.303 ± 0.25
0.303CysTrp: 0.303 ± 0.344
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.818AspAla: 1.818 ± 0.778
0.0AspCys: 0.0 ± 0.0
2.121AspAsp: 2.121 ± 0.677
5.758AspGlu: 5.758 ± 1.474
1.818AspPhe: 1.818 ± 0.694
1.818AspGly: 1.818 ± 0.636
0.303AspHis: 0.303 ± 0.244
5.758AspIle: 5.758 ± 1.627
7.879AspLys: 7.879 ± 1.522
4.545AspLeu: 4.545 ± 1.145
1.212AspMet: 1.212 ± 0.662
2.121AspAsn: 2.121 ± 0.985
1.212AspPro: 1.212 ± 0.639
2.121AspGln: 2.121 ± 0.799
2.424AspArg: 2.424 ± 0.692
3.939AspSer: 3.939 ± 1.104
1.212AspThr: 1.212 ± 0.577
3.03AspVal: 3.03 ± 1.233
0.303AspTrp: 0.303 ± 0.25
3.939AspTyr: 3.939 ± 0.705
0.0AspXaa: 0.0 ± 0.0
Glu
3.333GluAla: 3.333 ± 0.981
0.0GluCys: 0.0 ± 0.0
3.939GluAsp: 3.939 ± 0.835
6.364GluGlu: 6.364 ± 1.511
2.727GluPhe: 2.727 ± 1.04
1.212GluGly: 1.212 ± 0.526
2.424GluHis: 2.424 ± 0.842
5.455GluIle: 5.455 ± 1.491
6.667GluLys: 6.667 ± 1.314
9.394GluLeu: 9.394 ± 2.49
1.818GluMet: 1.818 ± 0.676
5.152GluAsn: 5.152 ± 1.275
2.424GluPro: 2.424 ± 0.82
8.485GluGln: 8.485 ± 1.703
4.545GluArg: 4.545 ± 1.0
2.424GluSer: 2.424 ± 0.612
4.848GluThr: 4.848 ± 1.382
4.848GluVal: 4.848 ± 1.341
0.606GluTrp: 0.606 ± 0.454
2.424GluTyr: 2.424 ± 0.88
0.0GluXaa: 0.0 ± 0.0
Phe
2.424PheAla: 2.424 ± 0.792
0.0PheCys: 0.0 ± 0.0
1.818PheAsp: 1.818 ± 0.858
3.939PheGlu: 3.939 ± 1.077
1.515PhePhe: 1.515 ± 0.864
2.727PheGly: 2.727 ± 0.626
1.515PheHis: 1.515 ± 0.733
3.03PheIle: 3.03 ± 1.024
3.636PheLys: 3.636 ± 1.134
3.03PheLeu: 3.03 ± 1.1
1.515PheMet: 1.515 ± 0.756
1.212PheAsn: 1.212 ± 0.507
0.606PhePro: 0.606 ± 0.489
0.909PheGln: 0.909 ± 0.485
1.212PheArg: 1.212 ± 0.513
3.333PheSer: 3.333 ± 0.765
2.424PheThr: 2.424 ± 1.14
0.606PheVal: 0.606 ± 0.41
0.303PheTrp: 0.303 ± 0.302
0.909PheTyr: 0.909 ± 0.733
0.0PheXaa: 0.0 ± 0.0
Gly
2.121GlyAla: 2.121 ± 0.655
0.606GlyCys: 0.606 ± 0.403
0.909GlyAsp: 0.909 ± 0.527
2.424GlyGlu: 2.424 ± 0.987
1.818GlyPhe: 1.818 ± 0.667
1.212GlyGly: 1.212 ± 0.652
1.818GlyHis: 1.818 ± 0.783
3.636GlyIle: 3.636 ± 1.168
4.545GlyLys: 4.545 ± 1.191
4.848GlyLeu: 4.848 ± 1.028
1.515GlyMet: 1.515 ± 0.543
3.03GlyAsn: 3.03 ± 0.996
0.0GlyPro: 0.0 ± 0.0
3.03GlyGln: 3.03 ± 0.905
3.333GlyArg: 3.333 ± 0.835
0.606GlySer: 0.606 ± 0.441
2.424GlyThr: 2.424 ± 0.727
3.939GlyVal: 3.939 ± 1.244
0.909GlyTrp: 0.909 ± 0.541
3.333GlyTyr: 3.333 ± 0.76
0.0GlyXaa: 0.0 ± 0.0
His
0.909HisAla: 0.909 ± 0.47
0.0HisCys: 0.0 ± 0.0
1.515HisAsp: 1.515 ± 0.473
1.818HisGlu: 1.818 ± 0.787
0.909HisPhe: 0.909 ± 0.5
1.515HisGly: 1.515 ± 0.427
0.606HisHis: 0.606 ± 0.4
1.515HisIle: 1.515 ± 0.759
1.818HisLys: 1.818 ± 0.969
2.121HisLeu: 2.121 ± 0.678
0.303HisMet: 0.303 ± 0.27
1.515HisAsn: 1.515 ± 0.598
0.606HisPro: 0.606 ± 0.351
1.212HisGln: 1.212 ± 0.475
0.909HisArg: 0.909 ± 0.425
0.303HisSer: 0.303 ± 0.287
2.121HisThr: 2.121 ± 1.062
1.212HisVal: 1.212 ± 0.514
0.0HisTrp: 0.0 ± 0.0
0.909HisTyr: 0.909 ± 0.52
0.0HisXaa: 0.0 ± 0.0
Ile
3.636IleAla: 3.636 ± 1.191
0.303IleCys: 0.303 ± 0.244
4.545IleAsp: 4.545 ± 0.917
4.848IleGlu: 4.848 ± 1.6
2.121IlePhe: 2.121 ± 0.685
3.333IleGly: 3.333 ± 1.284
1.212IleHis: 1.212 ± 0.705
4.545IleIle: 4.545 ± 1.555
6.364IleLys: 6.364 ± 1.219
6.061IleLeu: 6.061 ± 1.716
1.212IleMet: 1.212 ± 0.665
2.424IleAsn: 2.424 ± 1.138
2.121IlePro: 2.121 ± 0.629
3.333IleGln: 3.333 ± 0.817
3.333IleArg: 3.333 ± 1.157
3.636IleSer: 3.636 ± 1.084
5.152IleThr: 5.152 ± 1.287
2.424IleVal: 2.424 ± 0.917
0.303IleTrp: 0.303 ± 0.244
4.242IleTyr: 4.242 ± 1.069
0.0IleXaa: 0.0 ± 0.0
Lys
7.273LysAla: 7.273 ± 1.363
0.606LysCys: 0.606 ± 0.412
4.242LysAsp: 4.242 ± 1.007
9.091LysGlu: 9.091 ± 1.277
1.818LysPhe: 1.818 ± 0.783
4.242LysGly: 4.242 ± 1.025
1.515LysHis: 1.515 ± 0.693
4.848LysIle: 4.848 ± 0.89
10.606LysLys: 10.606 ± 1.608
8.788LysLeu: 8.788 ± 1.562
2.121LysMet: 2.121 ± 0.772
6.364LysAsn: 6.364 ± 2.015
2.424LysPro: 2.424 ± 0.853
5.455LysGln: 5.455 ± 0.892
3.939LysArg: 3.939 ± 1.173
3.939LysSer: 3.939 ± 1.169
6.061LysThr: 6.061 ± 1.366
6.364LysVal: 6.364 ± 1.199
1.515LysTrp: 1.515 ± 0.641
2.727LysTyr: 2.727 ± 0.956
0.0LysXaa: 0.0 ± 0.0
Leu
8.182LeuAla: 8.182 ± 1.742
0.909LeuCys: 0.909 ± 0.528
8.182LeuAsp: 8.182 ± 1.68
6.97LeuGlu: 6.97 ± 1.317
4.545LeuPhe: 4.545 ± 1.029
6.061LeuGly: 6.061 ± 1.603
1.212LeuHis: 1.212 ± 0.651
2.424LeuIle: 2.424 ± 0.928
6.97LeuLys: 6.97 ± 1.272
8.182LeuLeu: 8.182 ± 1.228
2.121LeuMet: 2.121 ± 0.861
6.364LeuAsn: 6.364 ± 0.99
2.424LeuPro: 2.424 ± 0.886
4.242LeuGln: 4.242 ± 1.017
4.848LeuArg: 4.848 ± 1.247
5.152LeuSer: 5.152 ± 0.961
6.667LeuThr: 6.667 ± 0.98
5.455LeuVal: 5.455 ± 1.319
0.909LeuTrp: 0.909 ± 0.483
3.939LeuTyr: 3.939 ± 1.071
0.0LeuXaa: 0.0 ± 0.0
Met
1.212MetAla: 1.212 ± 0.848
0.0MetCys: 0.0 ± 0.0
1.515MetAsp: 1.515 ± 0.852
1.818MetGlu: 1.818 ± 0.585
0.606MetPhe: 0.606 ± 0.37
0.909MetGly: 0.909 ± 0.517
0.606MetHis: 0.606 ± 0.384
0.909MetIle: 0.909 ± 0.637
2.121MetLys: 2.121 ± 0.549
3.636MetLeu: 3.636 ± 0.923
1.818MetMet: 1.818 ± 0.703
1.818MetAsn: 1.818 ± 0.878
0.606MetPro: 0.606 ± 0.384
0.303MetGln: 0.303 ± 0.25
1.818MetArg: 1.818 ± 0.76
1.212MetSer: 1.212 ± 0.639
2.424MetThr: 2.424 ± 0.888
0.606MetVal: 0.606 ± 0.494
0.0MetTrp: 0.0 ± 0.0
0.303MetTyr: 0.303 ± 0.335
0.0MetXaa: 0.0 ± 0.0
Asn
3.939AsnAla: 3.939 ± 1.217
0.0AsnCys: 0.0 ± 0.0
1.818AsnAsp: 1.818 ± 0.629
3.03AsnGlu: 3.03 ± 1.147
1.515AsnPhe: 1.515 ± 0.651
2.727AsnGly: 2.727 ± 1.008
1.212AsnHis: 1.212 ± 0.453
2.121AsnIle: 2.121 ± 0.816
4.545AsnLys: 4.545 ± 1.492
4.545AsnLeu: 4.545 ± 0.62
1.515AsnMet: 1.515 ± 0.63
2.424AsnAsn: 2.424 ± 1.146
3.03AsnPro: 3.03 ± 1.036
3.333AsnGln: 3.333 ± 0.709
2.727AsnArg: 2.727 ± 0.625
1.818AsnSer: 1.818 ± 0.745
2.727AsnThr: 2.727 ± 0.775
2.424AsnVal: 2.424 ± 0.67
1.818AsnTrp: 1.818 ± 0.545
3.03AsnTyr: 3.03 ± 1.005
0.0AsnXaa: 0.0 ± 0.0
Pro
1.515ProAla: 1.515 ± 0.686
0.0ProCys: 0.0 ± 0.0
1.515ProAsp: 1.515 ± 0.588
1.818ProGlu: 1.818 ± 0.838
1.818ProPhe: 1.818 ± 0.64
0.303ProGly: 0.303 ± 0.25
0.606ProHis: 0.606 ± 0.326
3.03ProIle: 3.03 ± 1.202
4.242ProLys: 4.242 ± 0.819
1.818ProLeu: 1.818 ± 0.823
0.303ProMet: 0.303 ± 0.27
2.121ProAsn: 2.121 ± 0.623
0.303ProPro: 0.303 ± 0.302
0.303ProGln: 0.303 ± 0.364
1.515ProArg: 1.515 ± 0.495
2.727ProSer: 2.727 ± 0.784
2.424ProThr: 2.424 ± 1.048
1.818ProVal: 1.818 ± 0.696
0.0ProTrp: 0.0 ± 0.0
1.212ProTyr: 1.212 ± 0.649
0.0ProXaa: 0.0 ± 0.0
Gln
4.242GlnAla: 4.242 ± 0.734
0.606GlnCys: 0.606 ± 0.416
2.121GlnAsp: 2.121 ± 0.783
4.848GlnGlu: 4.848 ± 1.038
1.818GlnPhe: 1.818 ± 0.714
1.818GlnGly: 1.818 ± 0.731
1.212GlnHis: 1.212 ± 0.551
2.424GlnIle: 2.424 ± 0.954
3.939GlnLys: 3.939 ± 1.123
5.455GlnLeu: 5.455 ± 1.615
1.515GlnMet: 1.515 ± 0.841
0.909GlnAsn: 0.909 ± 0.533
1.212GlnPro: 1.212 ± 0.697
1.212GlnGln: 1.212 ± 0.448
1.818GlnArg: 1.818 ± 0.809
4.848GlnSer: 4.848 ± 0.91
4.242GlnThr: 4.242 ± 1.088
3.03GlnVal: 3.03 ± 0.884
0.303GlnTrp: 0.303 ± 0.343
2.121GlnTyr: 2.121 ± 0.868
0.0GlnXaa: 0.0 ± 0.0
Arg
3.636ArgAla: 3.636 ± 0.781
0.0ArgCys: 0.0 ± 0.0
3.636ArgAsp: 3.636 ± 0.542
3.03ArgGlu: 3.03 ± 1.132
0.303ArgPhe: 0.303 ± 0.244
1.212ArgGly: 1.212 ± 0.546
1.818ArgHis: 1.818 ± 0.545
4.545ArgIle: 4.545 ± 1.538
3.03ArgLys: 3.03 ± 1.097
3.939ArgLeu: 3.939 ± 0.797
0.303ArgMet: 0.303 ± 0.322
0.909ArgAsn: 0.909 ± 0.551
2.424ArgPro: 2.424 ± 1.005
3.636ArgGln: 3.636 ± 1.278
3.03ArgArg: 3.03 ± 0.794
2.727ArgSer: 2.727 ± 0.829
2.121ArgThr: 2.121 ± 0.842
3.03ArgVal: 3.03 ± 0.841
1.212ArgTrp: 1.212 ± 0.576
3.03ArgTyr: 3.03 ± 0.824
0.0ArgXaa: 0.0 ± 0.0
Ser
1.818SerAla: 1.818 ± 0.661
0.606SerCys: 0.606 ± 0.326
3.03SerAsp: 3.03 ± 0.864
4.242SerGlu: 4.242 ± 1.063
2.121SerPhe: 2.121 ± 0.763
3.03SerGly: 3.03 ± 1.098
0.606SerHis: 0.606 ± 0.404
2.727SerIle: 2.727 ± 1.026
6.364SerLys: 6.364 ± 1.744
6.667SerLeu: 6.667 ± 1.044
1.818SerMet: 1.818 ± 0.861
2.424SerAsn: 2.424 ± 0.683
2.424SerPro: 2.424 ± 1.065
1.818SerGln: 1.818 ± 0.917
2.424SerArg: 2.424 ± 0.884
2.727SerSer: 2.727 ± 0.735
2.727SerThr: 2.727 ± 0.928
4.848SerVal: 4.848 ± 1.205
0.606SerTrp: 0.606 ± 0.397
3.333SerTyr: 3.333 ± 0.63
0.0SerXaa: 0.0 ± 0.0
Thr
3.333ThrAla: 3.333 ± 0.953
0.606ThrCys: 0.606 ± 0.489
3.03ThrAsp: 3.03 ± 0.613
5.758ThrGlu: 5.758 ± 1.256
1.212ThrPhe: 1.212 ± 0.654
3.636ThrGly: 3.636 ± 0.989
1.818ThrHis: 1.818 ± 0.651
4.545ThrIle: 4.545 ± 1.138
4.242ThrLys: 4.242 ± 0.821
6.364ThrLeu: 6.364 ± 1.23
0.909ThrMet: 0.909 ± 0.48
3.03ThrAsn: 3.03 ± 1.087
3.03ThrPro: 3.03 ± 0.94
2.121ThrGln: 2.121 ± 0.751
3.03ThrArg: 3.03 ± 0.992
3.939ThrSer: 3.939 ± 1.433
4.848ThrThr: 4.848 ± 1.438
5.758ThrVal: 5.758 ± 1.122
0.606ThrTrp: 0.606 ± 0.421
2.727ThrTyr: 2.727 ± 1.091
0.0ThrXaa: 0.0 ± 0.0
Val
2.424ValAla: 2.424 ± 0.807
0.606ValCys: 0.606 ± 0.5
2.424ValAsp: 2.424 ± 0.579
4.848ValGlu: 4.848 ± 1.749
3.03ValPhe: 3.03 ± 1.076
2.727ValGly: 2.727 ± 0.794
1.212ValHis: 1.212 ± 0.419
3.939ValIle: 3.939 ± 1.156
5.152ValLys: 5.152 ± 1.212
3.636ValLeu: 3.636 ± 0.765
1.818ValMet: 1.818 ± 0.907
3.636ValAsn: 3.636 ± 1.178
2.121ValPro: 2.121 ± 0.526
1.212ValGln: 1.212 ± 0.664
3.03ValArg: 3.03 ± 0.945
4.545ValSer: 4.545 ± 1.189
3.636ValThr: 3.636 ± 0.769
2.727ValVal: 2.727 ± 0.786
0.303ValTrp: 0.303 ± 0.287
4.242ValTyr: 4.242 ± 1.259
0.0ValXaa: 0.0 ± 0.0
Trp
0.909TrpAla: 0.909 ± 0.423
0.0TrpCys: 0.0 ± 0.0
0.303TrpAsp: 0.303 ± 0.287
0.606TrpGlu: 0.606 ± 0.445
0.909TrpPhe: 0.909 ± 0.552
0.303TrpGly: 0.303 ± 0.343
0.606TrpHis: 0.606 ± 0.373
0.606TrpIle: 0.606 ± 0.441
0.909TrpLys: 0.909 ± 0.536
2.727TrpLeu: 2.727 ± 1.408
0.303TrpMet: 0.303 ± 0.244
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.515TrpGln: 1.515 ± 0.537
0.606TrpArg: 0.606 ± 0.528
0.606TrpSer: 0.606 ± 0.397
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.303TrpTrp: 0.303 ± 0.287
0.606TrpTyr: 0.606 ± 0.377
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.636TyrAla: 3.636 ± 1.153
0.303TyrCys: 0.303 ± 0.302
2.727TyrAsp: 2.727 ± 0.953
2.121TyrGlu: 2.121 ± 0.736
2.727TyrPhe: 2.727 ± 1.164
2.727TyrGly: 2.727 ± 0.815
1.212TyrHis: 1.212 ± 0.598
3.03TyrIle: 3.03 ± 1.348
6.061TyrLys: 6.061 ± 1.08
4.545TyrLeu: 4.545 ± 1.003
0.909TyrMet: 0.909 ± 0.418
1.515TyrAsn: 1.515 ± 0.724
0.606TyrPro: 0.606 ± 0.41
2.727TyrGln: 2.727 ± 0.648
1.515TyrArg: 1.515 ± 0.809
3.939TyrSer: 3.939 ± 1.143
3.939TyrThr: 3.939 ± 1.137
1.515TyrVal: 1.515 ± 0.488
0.909TyrTrp: 0.909 ± 0.585
2.727TyrTyr: 2.727 ± 0.934
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3301 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski