Amino acid dipepetide frequency for Streptococcus satellite phage Javan759

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.211AlaAla: 0.211 ± 0.242
0.422AlaCys: 0.422 ± 0.386
3.585AlaAsp: 3.585 ± 0.766
6.326AlaGlu: 6.326 ± 1.133
3.585AlaPhe: 3.585 ± 1.012
2.32AlaGly: 2.32 ± 0.699
0.633AlaHis: 0.633 ± 0.333
4.007AlaIle: 4.007 ± 0.792
5.905AlaLys: 5.905 ± 0.931
6.959AlaLeu: 6.959 ± 1.01
1.265AlaMet: 1.265 ± 0.552
2.952AlaAsn: 2.952 ± 0.773
0.844AlaPro: 0.844 ± 0.387
2.109AlaGln: 2.109 ± 0.629
4.218AlaArg: 4.218 ± 0.826
1.687AlaSer: 1.687 ± 0.576
4.218AlaThr: 4.218 ± 0.968
4.218AlaVal: 4.218 ± 0.859
0.422AlaTrp: 0.422 ± 0.293
2.109AlaTyr: 2.109 ± 0.723
0.0AlaXaa: 0.0 ± 0.0
Cys
0.633CysAla: 0.633 ± 0.302
0.0CysCys: 0.0 ± 0.0
0.422CysAsp: 0.422 ± 0.294
0.0CysGlu: 0.0 ± 0.0
0.422CysPhe: 0.422 ± 0.276
0.844CysGly: 0.844 ± 0.56
0.422CysHis: 0.422 ± 0.241
0.633CysIle: 0.633 ± 0.384
0.633CysLys: 0.633 ± 0.374
1.054CysLeu: 1.054 ± 0.404
0.422CysMet: 0.422 ± 0.342
0.422CysAsn: 0.422 ± 0.272
0.844CysPro: 0.844 ± 0.54
0.211CysGln: 0.211 ± 0.192
0.211CysArg: 0.211 ± 0.21
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.211CysTyr: 0.211 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
1.054AspAla: 1.054 ± 0.418
0.844AspCys: 0.844 ± 0.384
3.585AspAsp: 3.585 ± 0.668
5.483AspGlu: 5.483 ± 1.128
3.796AspPhe: 3.796 ± 0.618
3.585AspGly: 3.585 ± 0.698
0.422AspHis: 0.422 ± 0.289
5.483AspIle: 5.483 ± 0.816
5.905AspLys: 5.905 ± 1.0
5.061AspLeu: 5.061 ± 0.776
1.687AspMet: 1.687 ± 0.615
2.741AspAsn: 2.741 ± 0.73
1.054AspPro: 1.054 ± 0.534
1.054AspGln: 1.054 ± 0.415
3.163AspArg: 3.163 ± 0.849
2.32AspSer: 2.32 ± 0.712
3.374AspThr: 3.374 ± 0.713
2.32AspVal: 2.32 ± 0.665
0.422AspTrp: 0.422 ± 0.275
3.796AspTyr: 3.796 ± 1.263
0.0AspXaa: 0.0 ± 0.0
Glu
5.272GluAla: 5.272 ± 1.121
0.844GluCys: 0.844 ± 0.389
3.585GluAsp: 3.585 ± 0.864
6.116GluGlu: 6.116 ± 1.083
2.952GluPhe: 2.952 ± 0.81
2.741GluGly: 2.741 ± 0.629
1.265GluHis: 1.265 ± 0.465
7.803GluIle: 7.803 ± 1.084
6.326GluLys: 6.326 ± 1.165
11.177GluLeu: 11.177 ± 1.595
2.531GluMet: 2.531 ± 0.691
6.116GluAsn: 6.116 ± 1.163
1.898GluPro: 1.898 ± 0.85
4.007GluGln: 4.007 ± 0.985
5.694GluArg: 5.694 ± 1.032
5.483GluSer: 5.483 ± 1.157
4.639GluThr: 4.639 ± 0.868
4.007GluVal: 4.007 ± 1.087
0.422GluTrp: 0.422 ± 0.241
3.585GluTyr: 3.585 ± 0.694
0.0GluXaa: 0.0 ± 0.0
Phe
0.844PheAla: 0.844 ± 0.367
0.844PheCys: 0.844 ± 0.591
2.952PheAsp: 2.952 ± 0.608
2.32PheGlu: 2.32 ± 0.79
1.898PhePhe: 1.898 ± 0.942
1.687PheGly: 1.687 ± 0.57
0.633PheHis: 0.633 ± 0.336
3.163PheIle: 3.163 ± 0.829
5.061PheLys: 5.061 ± 1.098
4.007PheLeu: 4.007 ± 0.84
0.422PheMet: 0.422 ± 0.28
2.32PheAsn: 2.32 ± 0.592
0.422PhePro: 0.422 ± 0.352
1.898PheGln: 1.898 ± 0.587
1.898PheArg: 1.898 ± 0.716
4.218PheSer: 4.218 ± 0.965
1.898PheThr: 1.898 ± 0.463
1.265PheVal: 1.265 ± 0.475
1.054PheTrp: 1.054 ± 0.439
2.32PheTyr: 2.32 ± 0.613
0.0PheXaa: 0.0 ± 0.0
Gly
2.741GlyAla: 2.741 ± 0.971
0.633GlyCys: 0.633 ± 0.353
2.32GlyAsp: 2.32 ± 0.726
2.531GlyGlu: 2.531 ± 0.808
2.109GlyPhe: 2.109 ± 0.517
1.687GlyGly: 1.687 ± 0.973
0.844GlyHis: 0.844 ± 0.405
3.585GlyIle: 3.585 ± 0.849
5.061GlyLys: 5.061 ± 1.021
5.061GlyLeu: 5.061 ± 1.149
1.054GlyMet: 1.054 ± 0.394
2.741GlyAsn: 2.741 ± 0.746
0.422GlyPro: 0.422 ± 0.285
2.531GlyGln: 2.531 ± 0.685
2.531GlyArg: 2.531 ± 0.67
1.898GlySer: 1.898 ± 0.591
2.741GlyThr: 2.741 ± 0.852
4.007GlyVal: 4.007 ± 0.827
0.633GlyTrp: 0.633 ± 0.502
1.898GlyTyr: 1.898 ± 0.594
0.0GlyXaa: 0.0 ± 0.0
His
1.687HisAla: 1.687 ± 0.619
0.0HisCys: 0.0 ± 0.0
0.422HisAsp: 0.422 ± 0.283
0.844HisGlu: 0.844 ± 0.398
1.054HisPhe: 1.054 ± 0.431
1.687HisGly: 1.687 ± 0.676
0.422HisHis: 0.422 ± 0.259
0.844HisIle: 0.844 ± 0.351
1.265HisLys: 1.265 ± 0.443
1.476HisLeu: 1.476 ± 0.598
0.211HisMet: 0.211 ± 0.188
0.844HisAsn: 0.844 ± 0.433
0.422HisPro: 0.422 ± 0.285
0.633HisGln: 0.633 ± 0.372
0.422HisArg: 0.422 ± 0.317
0.633HisSer: 0.633 ± 0.323
0.633HisThr: 0.633 ± 0.397
0.211HisVal: 0.211 ± 0.167
0.0HisTrp: 0.0 ± 0.0
0.633HisTyr: 0.633 ± 0.305
0.0HisXaa: 0.0 ± 0.0
Ile
4.639IleAla: 4.639 ± 1.247
0.422IleCys: 0.422 ± 0.258
4.639IleAsp: 4.639 ± 1.159
6.537IleGlu: 6.537 ± 1.247
2.741IlePhe: 2.741 ± 0.667
3.585IleGly: 3.585 ± 0.794
0.633IleHis: 0.633 ± 0.299
4.639IleIle: 4.639 ± 0.965
7.381IleLys: 7.381 ± 1.25
5.061IleLeu: 5.061 ± 1.058
0.633IleMet: 0.633 ± 0.334
2.531IleAsn: 2.531 ± 0.71
3.163IlePro: 3.163 ± 0.898
2.741IleGln: 2.741 ± 0.603
3.374IleArg: 3.374 ± 0.677
6.116IleSer: 6.116 ± 1.283
3.585IleThr: 3.585 ± 0.678
3.374IleVal: 3.374 ± 0.738
1.054IleTrp: 1.054 ± 0.512
1.898IleTyr: 1.898 ± 0.728
0.0IleXaa: 0.0 ± 0.0
Lys
8.224LysAla: 8.224 ± 1.537
0.211LysCys: 0.211 ± 0.188
4.007LysAsp: 4.007 ± 0.91
8.857LysGlu: 8.857 ± 1.212
2.531LysPhe: 2.531 ± 0.744
3.585LysGly: 3.585 ± 0.785
2.531LysHis: 2.531 ± 0.69
4.429LysIle: 4.429 ± 1.036
7.381LysLys: 7.381 ± 1.417
10.122LysLeu: 10.122 ± 1.729
1.687LysMet: 1.687 ± 0.624
6.537LysAsn: 6.537 ± 1.194
2.531LysPro: 2.531 ± 0.909
4.007LysGln: 4.007 ± 0.835
5.905LysArg: 5.905 ± 0.996
5.061LysSer: 5.061 ± 0.887
7.17LysThr: 7.17 ± 1.16
4.218LysVal: 4.218 ± 0.927
1.265LysTrp: 1.265 ± 0.505
2.32LysTyr: 2.32 ± 0.878
0.0LysXaa: 0.0 ± 0.0
Leu
6.959LeuAla: 6.959 ± 1.302
1.054LeuCys: 1.054 ± 0.53
9.701LeuAsp: 9.701 ± 1.419
12.442LeuGlu: 12.442 ± 1.733
4.007LeuPhe: 4.007 ± 1.191
4.429LeuGly: 4.429 ± 1.026
1.054LeuHis: 1.054 ± 0.496
5.905LeuIle: 5.905 ± 1.335
8.013LeuLys: 8.013 ± 1.238
9.49LeuLeu: 9.49 ± 1.418
2.531LeuMet: 2.531 ± 0.698
5.905LeuAsn: 5.905 ± 1.289
2.741LeuPro: 2.741 ± 0.796
3.796LeuGln: 3.796 ± 0.888
3.796LeuArg: 3.796 ± 0.928
5.483LeuSer: 5.483 ± 0.793
7.381LeuThr: 7.381 ± 1.336
5.694LeuVal: 5.694 ± 1.081
0.422LeuTrp: 0.422 ± 0.259
3.796LeuTyr: 3.796 ± 0.815
0.0LeuXaa: 0.0 ± 0.0
Met
2.109MetAla: 2.109 ± 0.63
0.211MetCys: 0.211 ± 0.218
1.265MetAsp: 1.265 ± 0.552
2.952MetGlu: 2.952 ± 0.833
0.633MetPhe: 0.633 ± 0.288
1.265MetGly: 1.265 ± 0.553
0.211MetHis: 0.211 ± 0.167
1.476MetIle: 1.476 ± 0.585
2.32MetLys: 2.32 ± 0.616
1.898MetLeu: 1.898 ± 0.544
0.422MetMet: 0.422 ± 0.305
1.687MetAsn: 1.687 ± 0.568
0.422MetPro: 0.422 ± 0.265
0.211MetGln: 0.211 ± 0.206
1.265MetArg: 1.265 ± 0.708
1.054MetSer: 1.054 ± 0.416
3.374MetThr: 3.374 ± 0.902
1.054MetVal: 1.054 ± 0.458
0.211MetTrp: 0.211 ± 0.167
0.633MetTyr: 0.633 ± 0.344
0.0MetXaa: 0.0 ± 0.0
Asn
3.796AsnAla: 3.796 ± 0.914
0.211AsnCys: 0.211 ± 0.233
3.374AsnAsp: 3.374 ± 0.939
2.531AsnGlu: 2.531 ± 0.653
1.898AsnPhe: 1.898 ± 0.631
4.218AsnGly: 4.218 ± 1.054
1.265AsnHis: 1.265 ± 0.528
3.374AsnIle: 3.374 ± 1.029
6.116AsnLys: 6.116 ± 1.09
4.639AsnLeu: 4.639 ± 0.936
1.687AsnMet: 1.687 ± 0.589
2.109AsnAsn: 2.109 ± 0.69
2.741AsnPro: 2.741 ± 0.688
1.898AsnGln: 1.898 ± 0.773
1.054AsnArg: 1.054 ± 0.413
2.741AsnSer: 2.741 ± 0.791
4.429AsnThr: 4.429 ± 0.925
2.32AsnVal: 2.32 ± 0.833
0.633AsnTrp: 0.633 ± 0.446
2.109AsnTyr: 2.109 ± 0.673
0.0AsnXaa: 0.0 ± 0.0
Pro
1.054ProAla: 1.054 ± 0.438
0.211ProCys: 0.211 ± 0.167
1.898ProAsp: 1.898 ± 0.605
2.741ProGlu: 2.741 ± 0.784
1.476ProPhe: 1.476 ± 0.704
0.0ProGly: 0.0 ± 0.0
0.211ProHis: 0.211 ± 0.188
1.687ProIle: 1.687 ± 0.498
1.898ProLys: 1.898 ± 0.574
1.265ProLeu: 1.265 ± 0.527
1.054ProMet: 1.054 ± 0.42
1.687ProAsn: 1.687 ± 0.637
1.265ProPro: 1.265 ± 0.528
1.054ProGln: 1.054 ± 0.448
3.796ProArg: 3.796 ± 0.899
1.898ProSer: 1.898 ± 0.791
0.844ProThr: 0.844 ± 0.414
2.531ProVal: 2.531 ± 0.722
0.0ProTrp: 0.0 ± 0.0
1.476ProTyr: 1.476 ± 0.446
0.0ProXaa: 0.0 ± 0.0
Gln
3.585GlnAla: 3.585 ± 0.636
0.0GlnCys: 0.0 ± 0.0
1.476GlnAsp: 1.476 ± 0.534
2.531GlnGlu: 2.531 ± 0.646
1.265GlnPhe: 1.265 ± 0.541
2.109GlnGly: 2.109 ± 0.817
0.422GlnHis: 0.422 ± 0.259
1.898GlnIle: 1.898 ± 0.575
3.796GlnLys: 3.796 ± 0.663
4.218GlnLeu: 4.218 ± 1.211
0.422GlnMet: 0.422 ± 0.279
1.687GlnAsn: 1.687 ± 0.916
1.054GlnPro: 1.054 ± 0.503
1.898GlnGln: 1.898 ± 0.813
1.898GlnArg: 1.898 ± 0.586
1.898GlnSer: 1.898 ± 0.675
2.32GlnThr: 2.32 ± 0.762
2.952GlnVal: 2.952 ± 0.503
1.054GlnTrp: 1.054 ± 0.417
0.633GlnTyr: 0.633 ± 0.357
0.0GlnXaa: 0.0 ± 0.0
Arg
3.163ArgAla: 3.163 ± 0.771
0.211ArgCys: 0.211 ± 0.19
1.898ArgAsp: 1.898 ± 0.662
4.85ArgGlu: 4.85 ± 1.02
2.531ArgPhe: 2.531 ± 0.909
2.32ArgGly: 2.32 ± 0.847
0.422ArgHis: 0.422 ± 0.241
5.272ArgIle: 5.272 ± 1.132
5.694ArgLys: 5.694 ± 1.01
6.959ArgLeu: 6.959 ± 0.982
2.109ArgMet: 2.109 ± 0.61
2.531ArgAsn: 2.531 ± 0.64
1.054ArgPro: 1.054 ± 0.441
2.109ArgGln: 2.109 ± 0.728
1.476ArgArg: 1.476 ± 0.581
2.952ArgSer: 2.952 ± 0.835
2.741ArgThr: 2.741 ± 0.699
2.741ArgVal: 2.741 ± 0.71
0.211ArgTrp: 0.211 ± 0.167
2.531ArgTyr: 2.531 ± 0.731
0.0ArgXaa: 0.0 ± 0.0
Ser
2.741SerAla: 2.741 ± 0.738
0.211SerCys: 0.211 ± 0.21
2.952SerAsp: 2.952 ± 0.612
5.905SerGlu: 5.905 ± 1.246
2.109SerPhe: 2.109 ± 0.778
3.163SerGly: 3.163 ± 0.789
0.844SerHis: 0.844 ± 0.554
4.429SerIle: 4.429 ± 0.881
4.639SerLys: 4.639 ± 0.884
6.748SerLeu: 6.748 ± 1.112
2.109SerMet: 2.109 ± 0.603
3.796SerAsn: 3.796 ± 0.901
1.054SerPro: 1.054 ± 0.425
1.265SerGln: 1.265 ± 0.477
2.952SerArg: 2.952 ± 0.856
3.585SerSer: 3.585 ± 0.906
2.531SerThr: 2.531 ± 0.819
2.741SerVal: 2.741 ± 0.699
1.054SerTrp: 1.054 ± 0.634
2.741SerTyr: 2.741 ± 0.651
0.0SerXaa: 0.0 ± 0.0
Thr
2.952ThrAla: 2.952 ± 0.715
0.422ThrCys: 0.422 ± 0.272
3.163ThrAsp: 3.163 ± 1.031
3.374ThrGlu: 3.374 ± 0.767
2.952ThrPhe: 2.952 ± 0.761
3.374ThrGly: 3.374 ± 0.648
1.054ThrHis: 1.054 ± 0.348
3.163ThrIle: 3.163 ± 0.584
6.116ThrLys: 6.116 ± 1.198
6.326ThrLeu: 6.326 ± 1.134
1.687ThrMet: 1.687 ± 0.566
1.265ThrAsn: 1.265 ± 0.706
3.163ThrPro: 3.163 ± 0.909
1.898ThrGln: 1.898 ± 0.611
2.32ThrArg: 2.32 ± 0.666
2.741ThrSer: 2.741 ± 0.703
4.429ThrThr: 4.429 ± 1.032
5.061ThrVal: 5.061 ± 1.251
0.422ThrTrp: 0.422 ± 0.318
5.061ThrTyr: 5.061 ± 1.045
0.0ThrXaa: 0.0 ± 0.0
Val
4.007ValAla: 4.007 ± 1.271
0.211ValCys: 0.211 ± 0.167
2.952ValAsp: 2.952 ± 0.841
5.694ValGlu: 5.694 ± 1.158
1.687ValPhe: 1.687 ± 0.704
3.163ValGly: 3.163 ± 1.013
0.211ValHis: 0.211 ± 0.202
3.796ValIle: 3.796 ± 1.037
4.218ValLys: 4.218 ± 0.901
5.483ValLeu: 5.483 ± 1.252
1.476ValMet: 1.476 ± 0.574
3.585ValAsn: 3.585 ± 0.815
1.476ValPro: 1.476 ± 0.622
1.476ValGln: 1.476 ± 0.613
2.741ValArg: 2.741 ± 0.678
3.585ValSer: 3.585 ± 0.706
2.741ValThr: 2.741 ± 0.736
4.007ValVal: 4.007 ± 1.203
0.844ValTrp: 0.844 ± 0.386
2.32ValTyr: 2.32 ± 0.578
0.0ValXaa: 0.0 ± 0.0
Trp
0.422TrpAla: 0.422 ± 0.25
0.0TrpCys: 0.0 ± 0.0
0.211TrpAsp: 0.211 ± 0.228
1.476TrpGlu: 1.476 ± 0.512
0.211TrpPhe: 0.211 ± 0.167
0.633TrpGly: 0.633 ± 0.365
0.211TrpHis: 0.211 ± 0.167
0.633TrpIle: 0.633 ± 0.376
0.211TrpLys: 0.211 ± 0.221
1.054TrpLeu: 1.054 ± 0.606
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.054TrpGln: 1.054 ± 0.399
1.265TrpArg: 1.265 ± 0.581
0.844TrpSer: 0.844 ± 0.399
0.211TrpThr: 0.211 ± 0.226
1.054TrpVal: 1.054 ± 0.472
0.0TrpTrp: 0.0 ± 0.0
0.633TrpTyr: 0.633 ± 0.322
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.898TyrAla: 1.898 ± 0.657
0.422TyrCys: 0.422 ± 0.27
2.741TyrAsp: 2.741 ± 0.794
2.952TyrGlu: 2.952 ± 0.874
1.265TyrPhe: 1.265 ± 0.544
0.633TyrGly: 0.633 ± 0.397
0.633TyrHis: 0.633 ± 0.305
2.531TyrIle: 2.531 ± 0.949
4.639TyrLys: 4.639 ± 1.314
6.537TyrLeu: 6.537 ± 1.181
1.054TyrMet: 1.054 ± 0.504
1.687TyrAsn: 1.687 ± 0.473
1.476TyrPro: 1.476 ± 0.752
1.265TyrGln: 1.265 ± 0.646
3.796TyrArg: 3.796 ± 0.585
3.374TyrSer: 3.374 ± 0.63
1.898TyrThr: 1.898 ± 0.605
1.898TyrVal: 1.898 ± 0.638
0.0TyrTrp: 0.0 ± 0.0
0.844TyrTyr: 0.844 ± 0.415
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 36 proteins (4743 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski