Amino acid dipepetide frequency for Streptococcus satellite phage Javan745

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.347AlaCys: 0.347 ± 0.33
1.735AlaAsp: 1.735 ± 1.098
3.47AlaGlu: 3.47 ± 1.29
2.082AlaPhe: 2.082 ± 0.722
3.123AlaGly: 3.123 ± 0.824
0.694AlaHis: 0.694 ± 0.468
3.817AlaIle: 3.817 ± 1.385
7.634AlaLys: 7.634 ± 1.76
4.511AlaLeu: 4.511 ± 1.2
1.735AlaMet: 1.735 ± 0.818
2.776AlaAsn: 2.776 ± 1.021
0.694AlaPro: 0.694 ± 0.51
2.776AlaGln: 2.776 ± 0.986
3.47AlaArg: 3.47 ± 1.069
2.082AlaSer: 2.082 ± 0.865
4.164AlaThr: 4.164 ± 1.213
2.776AlaVal: 2.776 ± 0.972
0.347AlaTrp: 0.347 ± 0.321
2.776AlaTyr: 2.776 ± 0.849
0.0AlaXaa: 0.0 ± 0.0
Cys
0.694CysAla: 0.694 ± 0.458
0.0CysCys: 0.0 ± 0.0
0.347CysAsp: 0.347 ± 0.357
0.347CysGlu: 0.347 ± 0.338
0.347CysPhe: 0.347 ± 0.374
1.041CysGly: 1.041 ± 0.54
0.347CysHis: 0.347 ± 0.338
0.694CysIle: 0.694 ± 0.395
0.347CysLys: 0.347 ± 0.372
0.694CysLeu: 0.694 ± 0.464
0.347CysMet: 0.347 ± 0.349
0.347CysAsn: 0.347 ± 0.3
1.041CysPro: 1.041 ± 0.471
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.347CysSer: 0.347 ± 0.33
0.0CysThr: 0.0 ± 0.0
0.694CysVal: 0.694 ± 0.421
0.0CysTrp: 0.0 ± 0.0
0.347CysTyr: 0.347 ± 0.321
0.0CysXaa: 0.0 ± 0.0
Asp
1.388AspAla: 1.388 ± 0.589
0.694AspCys: 0.694 ± 0.434
5.552AspAsp: 5.552 ± 1.306
2.776AspGlu: 2.776 ± 0.952
3.47AspPhe: 3.47 ± 0.652
3.123AspGly: 3.123 ± 0.966
0.0AspHis: 0.0 ± 0.0
6.94AspIle: 6.94 ± 1.168
3.817AspLys: 3.817 ± 1.207
7.981AspLeu: 7.981 ± 1.852
1.041AspMet: 1.041 ± 0.586
6.593AspAsn: 6.593 ± 1.161
1.041AspPro: 1.041 ± 0.547
1.041AspGln: 1.041 ± 0.563
1.735AspArg: 1.735 ± 0.701
3.47AspSer: 3.47 ± 1.741
2.776AspThr: 2.776 ± 0.927
2.776AspVal: 2.776 ± 0.965
0.694AspTrp: 0.694 ± 0.464
3.123AspTyr: 3.123 ± 1.621
0.0AspXaa: 0.0 ± 0.0
Glu
2.429GluAla: 2.429 ± 1.094
1.735GluCys: 1.735 ± 0.469
4.511GluAsp: 4.511 ± 1.272
3.817GluGlu: 3.817 ± 0.863
3.817GluPhe: 3.817 ± 1.157
2.429GluGly: 2.429 ± 1.001
1.735GluHis: 1.735 ± 0.811
8.328GluIle: 8.328 ± 1.158
9.715GluLys: 9.715 ± 2.043
7.634GluLeu: 7.634 ± 1.792
2.429GluMet: 2.429 ± 0.85
6.94GluAsn: 6.94 ± 1.749
1.388GluPro: 1.388 ± 0.749
4.164GluGln: 4.164 ± 1.076
2.429GluArg: 2.429 ± 0.81
6.94GluSer: 6.94 ± 1.598
5.205GluThr: 5.205 ± 0.848
3.123GluVal: 3.123 ± 1.0
0.694GluTrp: 0.694 ± 0.395
3.47GluTyr: 3.47 ± 1.602
0.0GluXaa: 0.0 ± 0.0
Phe
1.388PheAla: 1.388 ± 0.8
0.694PheCys: 0.694 ± 0.601
2.776PheAsp: 2.776 ± 1.015
3.817PheGlu: 3.817 ± 1.534
1.735PhePhe: 1.735 ± 0.63
1.388PheGly: 1.388 ± 0.576
0.347PheHis: 0.347 ± 0.338
2.429PheIle: 2.429 ± 0.769
4.511PheLys: 4.511 ± 1.394
3.47PheLeu: 3.47 ± 1.02
0.694PheMet: 0.694 ± 0.411
2.082PheAsn: 2.082 ± 0.819
1.041PhePro: 1.041 ± 0.61
1.041PheGln: 1.041 ± 0.609
1.388PheArg: 1.388 ± 0.599
2.082PheSer: 2.082 ± 0.729
1.041PheThr: 1.041 ± 0.461
2.082PheVal: 2.082 ± 1.023
0.347PheTrp: 0.347 ± 0.276
1.735PheTyr: 1.735 ± 0.694
0.0PheXaa: 0.0 ± 0.0
Gly
2.776GlyAla: 2.776 ± 0.966
0.0GlyCys: 0.0 ± 0.0
0.694GlyAsp: 0.694 ± 0.526
4.511GlyGlu: 4.511 ± 0.98
0.347GlyPhe: 0.347 ± 0.349
1.735GlyGly: 1.735 ± 0.592
1.041GlyHis: 1.041 ± 0.457
3.123GlyIle: 3.123 ± 0.86
3.817GlyLys: 3.817 ± 1.151
4.164GlyLeu: 4.164 ± 0.897
2.429GlyMet: 2.429 ± 0.934
2.429GlyAsn: 2.429 ± 1.248
0.0GlyPro: 0.0 ± 0.0
2.429GlyGln: 2.429 ± 0.724
2.082GlyArg: 2.082 ± 0.694
2.429GlySer: 2.429 ± 0.803
2.429GlyThr: 2.429 ± 0.668
2.082GlyVal: 2.082 ± 0.871
0.347GlyTrp: 0.347 ± 0.3
4.164GlyTyr: 4.164 ± 1.15
0.0GlyXaa: 0.0 ± 0.0
His
2.082HisAla: 2.082 ± 1.073
0.0HisCys: 0.0 ± 0.0
1.041HisAsp: 1.041 ± 0.535
1.041HisGlu: 1.041 ± 0.589
0.347HisPhe: 0.347 ± 0.359
0.347HisGly: 0.347 ± 0.357
0.0HisHis: 0.0 ± 0.0
0.347HisIle: 0.347 ± 0.365
0.347HisLys: 0.347 ± 0.276
2.429HisLeu: 2.429 ± 0.887
0.0HisMet: 0.0 ± 0.0
0.694HisAsn: 0.694 ± 0.468
0.0HisPro: 0.0 ± 0.0
0.694HisGln: 0.694 ± 0.421
1.735HisArg: 1.735 ± 0.881
0.694HisSer: 0.694 ± 0.729
1.041HisThr: 1.041 ± 0.706
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.347HisTyr: 0.347 ± 0.321
0.0HisXaa: 0.0 ± 0.0
Ile
3.123IleAla: 3.123 ± 0.951
0.0IleCys: 0.0 ± 0.0
5.899IleAsp: 5.899 ± 1.173
5.552IleGlu: 5.552 ± 1.698
3.47IlePhe: 3.47 ± 1.171
2.082IleGly: 2.082 ± 1.074
0.694IleHis: 0.694 ± 0.421
5.205IleIle: 5.205 ± 1.194
10.062IleLys: 10.062 ± 1.749
7.981IleLeu: 7.981 ± 1.311
1.041IleMet: 1.041 ± 0.639
4.164IleAsn: 4.164 ± 1.077
3.47IlePro: 3.47 ± 0.917
3.123IleGln: 3.123 ± 0.74
1.735IleArg: 1.735 ± 0.624
5.899IleSer: 5.899 ± 1.236
2.429IleThr: 2.429 ± 0.707
2.082IleVal: 2.082 ± 0.692
0.347IleTrp: 0.347 ± 0.357
3.123IleTyr: 3.123 ± 0.858
0.0IleXaa: 0.0 ± 0.0
Lys
6.246LysAla: 6.246 ± 1.475
1.041LysCys: 1.041 ± 0.523
5.552LysAsp: 5.552 ± 1.542
10.409LysGlu: 10.409 ± 2.147
3.123LysPhe: 3.123 ± 0.974
5.205LysGly: 5.205 ± 1.159
2.429LysHis: 2.429 ± 1.361
5.552LysIle: 5.552 ± 1.116
8.328LysLys: 8.328 ± 1.27
11.45LysLeu: 11.45 ± 2.547
2.082LysMet: 2.082 ± 0.897
7.981LysAsn: 7.981 ± 1.437
1.388LysPro: 1.388 ± 0.66
4.511LysGln: 4.511 ± 0.862
6.593LysArg: 6.593 ± 1.328
8.675LysSer: 8.675 ± 1.416
7.634LysThr: 7.634 ± 1.684
3.817LysVal: 3.817 ± 0.948
1.041LysTrp: 1.041 ± 0.565
4.511LysTyr: 4.511 ± 0.816
0.0LysXaa: 0.0 ± 0.0
Leu
6.593LeuAla: 6.593 ± 1.134
0.694LeuCys: 0.694 ± 0.481
7.634LeuAsp: 7.634 ± 1.364
11.45LeuGlu: 11.45 ± 1.514
2.429LeuPhe: 2.429 ± 0.821
4.858LeuGly: 4.858 ± 1.124
0.0LeuHis: 0.0 ± 0.0
6.94LeuIle: 6.94 ± 1.496
10.756LeuLys: 10.756 ± 1.731
8.675LeuLeu: 8.675 ± 1.425
2.082LeuMet: 2.082 ± 0.603
7.981LeuAsn: 7.981 ± 1.584
2.776LeuPro: 2.776 ± 0.973
4.858LeuGln: 4.858 ± 1.457
3.817LeuArg: 3.817 ± 0.799
5.899LeuSer: 5.899 ± 0.836
9.022LeuThr: 9.022 ± 1.47
4.164LeuVal: 4.164 ± 0.66
0.347LeuTrp: 0.347 ± 0.276
3.123LeuTyr: 3.123 ± 0.892
0.0LeuXaa: 0.0 ± 0.0
Met
2.429MetAla: 2.429 ± 0.896
0.0MetCys: 0.0 ± 0.0
2.082MetAsp: 2.082 ± 0.956
1.735MetGlu: 1.735 ± 0.687
0.347MetPhe: 0.347 ± 0.357
0.347MetGly: 0.347 ± 0.312
0.0MetHis: 0.0 ± 0.0
1.041MetIle: 1.041 ± 0.497
2.776MetLys: 2.776 ± 0.834
1.388MetLeu: 1.388 ± 0.448
0.0MetMet: 0.0 ± 0.0
1.735MetAsn: 1.735 ± 0.742
0.694MetPro: 0.694 ± 0.435
0.347MetGln: 0.347 ± 0.329
2.429MetArg: 2.429 ± 0.951
2.082MetSer: 2.082 ± 0.762
2.429MetThr: 2.429 ± 0.962
0.694MetVal: 0.694 ± 0.458
0.347MetTrp: 0.347 ± 0.37
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.817AsnAla: 3.817 ± 0.847
0.347AsnCys: 0.347 ± 0.338
1.735AsnAsp: 1.735 ± 0.895
3.123AsnGlu: 3.123 ± 1.056
2.082AsnPhe: 2.082 ± 0.499
2.776AsnGly: 2.776 ± 0.755
1.388AsnHis: 1.388 ± 0.924
4.164AsnIle: 4.164 ± 1.79
7.287AsnLys: 7.287 ± 1.905
5.899AsnLeu: 5.899 ± 1.477
1.735AsnMet: 1.735 ± 0.846
3.817AsnAsn: 3.817 ± 1.882
2.082AsnPro: 2.082 ± 0.783
4.858AsnGln: 4.858 ± 1.509
2.776AsnArg: 2.776 ± 0.962
6.246AsnSer: 6.246 ± 1.412
5.552AsnThr: 5.552 ± 1.431
1.388AsnVal: 1.388 ± 0.563
1.041AsnTrp: 1.041 ± 0.541
1.735AsnTyr: 1.735 ± 0.54
0.0AsnXaa: 0.0 ± 0.0
Pro
1.388ProAla: 1.388 ± 0.747
0.0ProCys: 0.0 ± 0.0
3.817ProAsp: 3.817 ± 0.953
1.735ProGlu: 1.735 ± 0.72
1.041ProPhe: 1.041 ± 0.471
0.347ProGly: 0.347 ± 0.276
0.347ProHis: 0.347 ± 0.33
2.082ProIle: 2.082 ± 0.727
4.164ProLys: 4.164 ± 1.254
0.347ProLeu: 0.347 ± 0.359
0.0ProMet: 0.0 ± 0.0
0.694ProAsn: 0.694 ± 0.401
1.388ProPro: 1.388 ± 0.81
0.694ProGln: 0.694 ± 0.394
2.082ProArg: 2.082 ± 1.118
1.041ProSer: 1.041 ± 0.612
1.041ProThr: 1.041 ± 0.593
1.735ProVal: 1.735 ± 0.515
0.0ProTrp: 0.0 ± 0.0
0.694ProTyr: 0.694 ± 0.447
0.0ProXaa: 0.0 ± 0.0
Gln
4.164GlnAla: 4.164 ± 0.939
0.0GlnCys: 0.0 ± 0.0
2.776GlnAsp: 2.776 ± 1.208
2.776GlnGlu: 2.776 ± 0.517
1.041GlnPhe: 1.041 ± 0.649
1.388GlnGly: 1.388 ± 0.709
0.0GlnHis: 0.0 ± 0.0
3.123GlnIle: 3.123 ± 0.631
5.205GlnLys: 5.205 ± 1.181
3.817GlnLeu: 3.817 ± 1.403
1.041GlnMet: 1.041 ± 0.677
2.776GlnAsn: 2.776 ± 0.985
1.041GlnPro: 1.041 ± 0.531
1.735GlnGln: 1.735 ± 0.767
1.041GlnArg: 1.041 ± 0.706
1.388GlnSer: 1.388 ± 0.765
1.388GlnThr: 1.388 ± 0.601
3.817GlnVal: 3.817 ± 0.917
0.694GlnTrp: 0.694 ± 0.418
1.041GlnTyr: 1.041 ± 0.808
0.0GlnXaa: 0.0 ± 0.0
Arg
1.735ArgAla: 1.735 ± 0.787
0.347ArgCys: 0.347 ± 0.293
2.082ArgAsp: 2.082 ± 0.723
5.552ArgGlu: 5.552 ± 1.22
1.388ArgPhe: 1.388 ± 0.89
0.694ArgGly: 0.694 ± 0.418
1.041ArgHis: 1.041 ± 0.562
1.735ArgIle: 1.735 ± 0.592
4.858ArgLys: 4.858 ± 1.31
6.94ArgLeu: 6.94 ± 1.776
0.694ArgMet: 0.694 ± 0.453
1.735ArgAsn: 1.735 ± 0.836
0.347ArgPro: 0.347 ± 0.276
2.429ArgGln: 2.429 ± 1.139
2.429ArgArg: 2.429 ± 1.025
1.388ArgSer: 1.388 ± 0.717
2.429ArgThr: 2.429 ± 0.564
2.429ArgVal: 2.429 ± 0.692
0.347ArgTrp: 0.347 ± 0.365
3.123ArgTyr: 3.123 ± 0.849
0.0ArgXaa: 0.0 ± 0.0
Ser
3.123SerAla: 3.123 ± 1.888
0.347SerCys: 0.347 ± 0.276
3.817SerAsp: 3.817 ± 0.827
7.634SerGlu: 7.634 ± 1.706
2.082SerPhe: 2.082 ± 0.831
5.205SerGly: 5.205 ± 1.135
0.694SerHis: 0.694 ± 0.401
3.817SerIle: 3.817 ± 0.885
7.287SerLys: 7.287 ± 1.501
6.246SerLeu: 6.246 ± 0.978
1.735SerMet: 1.735 ± 0.767
4.164SerAsn: 4.164 ± 1.372
1.388SerPro: 1.388 ± 0.473
1.388SerGln: 1.388 ± 0.6
2.429SerArg: 2.429 ± 1.068
4.164SerSer: 4.164 ± 1.307
3.47SerThr: 3.47 ± 1.021
3.123SerVal: 3.123 ± 0.738
0.347SerTrp: 0.347 ± 0.349
2.082SerTyr: 2.082 ± 0.913
0.0SerXaa: 0.0 ± 0.0
Thr
2.429ThrAla: 2.429 ± 0.944
0.694ThrCys: 0.694 ± 0.493
1.735ThrAsp: 1.735 ± 0.678
3.817ThrGlu: 3.817 ± 1.062
2.429ThrPhe: 2.429 ± 0.829
3.123ThrGly: 3.123 ± 0.859
1.388ThrHis: 1.388 ± 0.656
4.858ThrIle: 4.858 ± 1.237
6.246ThrLys: 6.246 ± 1.646
6.593ThrLeu: 6.593 ± 1.363
1.735ThrMet: 1.735 ± 0.572
3.123ThrAsn: 3.123 ± 1.009
1.735ThrPro: 1.735 ± 1.039
0.694ThrGln: 0.694 ± 0.521
1.388ThrArg: 1.388 ± 0.65
3.817ThrSer: 3.817 ± 0.928
5.552ThrThr: 5.552 ± 1.684
6.593ThrVal: 6.593 ± 1.676
1.388ThrTrp: 1.388 ± 0.652
4.164ThrTyr: 4.164 ± 1.512
0.0ThrXaa: 0.0 ± 0.0
Val
3.123ValAla: 3.123 ± 1.27
0.347ValCys: 0.347 ± 0.357
3.123ValAsp: 3.123 ± 1.161
4.858ValGlu: 4.858 ± 1.218
1.041ValPhe: 1.041 ± 0.486
2.776ValGly: 2.776 ± 0.943
0.347ValHis: 0.347 ± 0.321
4.164ValIle: 4.164 ± 1.229
4.511ValLys: 4.511 ± 1.215
5.205ValLeu: 5.205 ± 1.053
0.347ValMet: 0.347 ± 0.45
2.082ValAsn: 2.082 ± 0.748
1.388ValPro: 1.388 ± 0.551
2.082ValGln: 2.082 ± 0.9
2.082ValArg: 2.082 ± 0.736
3.123ValSer: 3.123 ± 1.186
2.429ValThr: 2.429 ± 0.794
2.429ValVal: 2.429 ± 1.256
0.347ValTrp: 0.347 ± 0.37
2.082ValTyr: 2.082 ± 1.003
0.0ValXaa: 0.0 ± 0.0
Trp
1.041TrpAla: 1.041 ± 0.471
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.082TrpGlu: 2.082 ± 0.842
0.0TrpPhe: 0.0 ± 0.0
0.347TrpGly: 0.347 ± 0.365
0.347TrpHis: 0.347 ± 0.321
0.694TrpIle: 0.694 ± 0.517
1.041TrpLys: 1.041 ± 0.494
1.041TrpLeu: 1.041 ± 0.644
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.347TrpPro: 0.347 ± 0.359
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.041TrpSer: 1.041 ± 0.608
0.0TrpThr: 0.0 ± 0.0
1.041TrpVal: 1.041 ± 0.538
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.041TyrAla: 1.041 ± 0.723
0.694TyrCys: 0.694 ± 0.418
3.123TyrAsp: 3.123 ± 1.079
2.429TyrGlu: 2.429 ± 0.788
3.47TyrPhe: 3.47 ± 0.967
0.694TyrGly: 0.694 ± 0.447
0.347TyrHis: 0.347 ± 0.321
2.429TyrIle: 2.429 ± 0.848
4.858TyrLys: 4.858 ± 2.071
7.634TyrLeu: 7.634 ± 1.421
1.388TyrMet: 1.388 ± 0.692
1.735TyrAsn: 1.735 ± 0.53
1.388TyrPro: 1.388 ± 0.584
1.388TyrGln: 1.388 ± 0.635
2.082TyrArg: 2.082 ± 0.84
1.735TyrSer: 1.735 ± 0.965
3.47TyrThr: 3.47 ± 0.701
1.388TyrVal: 1.388 ± 0.882
0.347TyrTrp: 0.347 ± 0.3
0.694TyrTyr: 0.694 ± 0.407
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (2883 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski