Amino acid dipepetide frequency for Streptococcus satellite phage Javan142

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.854AlaAla: 1.854 ± 0.894
1.545AlaCys: 1.545 ± 0.629
1.854AlaAsp: 1.854 ± 0.82
5.562AlaGlu: 5.562 ± 1.83
1.854AlaPhe: 1.854 ± 0.651
2.472AlaGly: 2.472 ± 0.867
1.545AlaHis: 1.545 ± 0.772
2.781AlaIle: 2.781 ± 1.149
4.326AlaLys: 4.326 ± 1.278
5.253AlaLeu: 5.253 ± 1.165
1.236AlaMet: 1.236 ± 0.564
2.163AlaAsn: 2.163 ± 1.006
0.618AlaPro: 0.618 ± 0.501
3.09AlaGln: 3.09 ± 0.87
3.09AlaArg: 3.09 ± 0.967
2.472AlaSer: 2.472 ± 0.758
5.253AlaThr: 5.253 ± 0.943
4.944AlaVal: 4.944 ± 1.13
1.236AlaTrp: 1.236 ± 0.664
3.399AlaTyr: 3.399 ± 0.899
0.0AlaXaa: 0.0 ± 0.0
Cys
0.309CysAla: 0.309 ± 0.346
0.309CysCys: 0.309 ± 0.31
0.309CysAsp: 0.309 ± 0.326
0.309CysGlu: 0.309 ± 0.342
0.309CysPhe: 0.309 ± 0.342
0.618CysGly: 0.618 ± 0.383
0.309CysHis: 0.309 ± 0.337
0.309CysIle: 0.309 ± 0.255
0.309CysLys: 0.309 ± 0.272
0.309CysLeu: 0.309 ± 0.272
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.309CysArg: 0.309 ± 0.272
0.927CysSer: 0.927 ± 0.511
0.0CysThr: 0.0 ± 0.0
0.309CysVal: 0.309 ± 0.31
0.0CysTrp: 0.0 ± 0.0
0.927CysTyr: 0.927 ± 0.537
0.0CysXaa: 0.0 ± 0.0
Asp
0.618AspAla: 0.618 ± 0.428
0.618AspCys: 0.618 ± 0.383
4.944AspAsp: 4.944 ± 1.164
4.944AspGlu: 4.944 ± 1.416
2.781AspPhe: 2.781 ± 1.07
1.854AspGly: 1.854 ± 0.806
0.927AspHis: 0.927 ± 0.563
8.035AspIle: 8.035 ± 1.456
5.562AspLys: 5.562 ± 1.505
7.417AspLeu: 7.417 ± 1.107
2.781AspMet: 2.781 ± 0.969
1.545AspAsn: 1.545 ± 0.628
0.0AspPro: 0.0 ± 0.0
1.545AspGln: 1.545 ± 0.669
2.163AspArg: 2.163 ± 1.004
3.399AspSer: 3.399 ± 1.234
2.163AspThr: 2.163 ± 0.9
3.09AspVal: 3.09 ± 1.112
0.618AspTrp: 0.618 ± 0.438
4.635AspTyr: 4.635 ± 1.121
0.0AspXaa: 0.0 ± 0.0
Glu
5.562GluAla: 5.562 ± 1.514
0.618GluCys: 0.618 ± 0.693
3.399GluAsp: 3.399 ± 1.345
7.726GluGlu: 7.726 ± 2.683
1.854GluPhe: 1.854 ± 1.047
3.399GluGly: 3.399 ± 0.887
2.781GluHis: 2.781 ± 0.709
5.562GluIle: 5.562 ± 1.09
7.726GluLys: 7.726 ± 2.27
12.361GluLeu: 12.361 ± 2.24
4.017GluMet: 4.017 ± 1.026
4.017GluAsn: 4.017 ± 1.289
0.618GluPro: 0.618 ± 0.453
4.944GluGln: 4.944 ± 1.766
3.399GluArg: 3.399 ± 1.479
2.781GluSer: 2.781 ± 0.868
4.635GluThr: 4.635 ± 1.102
4.944GluVal: 4.944 ± 1.041
0.927GluTrp: 0.927 ± 0.437
2.781GluTyr: 2.781 ± 0.754
0.0GluXaa: 0.0 ± 0.0
Phe
1.545PheAla: 1.545 ± 0.636
0.309PheCys: 0.309 ± 0.31
3.399PheAsp: 3.399 ± 0.827
4.017PheGlu: 4.017 ± 0.911
3.09PhePhe: 3.09 ± 0.956
1.854PheGly: 1.854 ± 0.546
1.545PheHis: 1.545 ± 0.691
2.781PheIle: 2.781 ± 1.016
3.708PheLys: 3.708 ± 0.844
3.399PheLeu: 3.399 ± 0.78
0.618PheMet: 0.618 ± 0.442
2.163PheAsn: 2.163 ± 0.787
0.618PhePro: 0.618 ± 0.418
0.927PheGln: 0.927 ± 0.43
2.163PheArg: 2.163 ± 0.72
2.472PheSer: 2.472 ± 0.692
4.017PheThr: 4.017 ± 1.051
1.236PheVal: 1.236 ± 0.518
0.618PheTrp: 0.618 ± 0.494
0.927PheTyr: 0.927 ± 0.51
0.0PheXaa: 0.0 ± 0.0
Gly
2.163GlyAla: 2.163 ± 0.589
0.618GlyCys: 0.618 ± 0.409
3.09GlyAsp: 3.09 ± 0.801
3.708GlyGlu: 3.708 ± 0.979
1.545GlyPhe: 1.545 ± 0.813
1.545GlyGly: 1.545 ± 1.009
0.927GlyHis: 0.927 ± 0.462
4.635GlyIle: 4.635 ± 1.147
3.09GlyLys: 3.09 ± 0.91
5.253GlyLeu: 5.253 ± 1.51
0.309GlyMet: 0.309 ± 0.337
3.399GlyAsn: 3.399 ± 1.029
0.0GlyPro: 0.0 ± 0.0
1.545GlyGln: 1.545 ± 0.604
2.781GlyArg: 2.781 ± 1.506
0.927GlySer: 0.927 ± 0.442
2.781GlyThr: 2.781 ± 0.683
3.708GlyVal: 3.708 ± 1.233
0.309GlyTrp: 0.309 ± 0.31
3.708GlyTyr: 3.708 ± 0.965
0.0GlyXaa: 0.0 ± 0.0
His
1.545HisAla: 1.545 ± 0.668
0.0HisCys: 0.0 ± 0.0
0.618HisAsp: 0.618 ± 0.324
0.618HisGlu: 0.618 ± 0.439
0.927HisPhe: 0.927 ± 0.541
0.618HisGly: 0.618 ± 0.345
0.309HisHis: 0.309 ± 0.255
1.545HisIle: 1.545 ± 0.576
1.545HisLys: 1.545 ± 0.502
0.927HisLeu: 0.927 ± 0.701
0.927HisMet: 0.927 ± 0.508
1.236HisAsn: 1.236 ± 0.575
0.927HisPro: 0.927 ± 0.563
1.236HisGln: 1.236 ± 0.612
1.545HisArg: 1.545 ± 0.594
0.618HisSer: 0.618 ± 0.452
0.927HisThr: 0.927 ± 0.54
1.236HisVal: 1.236 ± 0.619
0.0HisTrp: 0.0 ± 0.0
0.927HisTyr: 0.927 ± 0.596
0.0HisXaa: 0.0 ± 0.0
Ile
5.562IleAla: 5.562 ± 1.359
0.618IleCys: 0.618 ± 0.459
4.326IleAsp: 4.326 ± 0.934
6.799IleGlu: 6.799 ± 1.303
3.708IlePhe: 3.708 ± 0.942
3.708IleGly: 3.708 ± 1.17
0.618IleHis: 0.618 ± 0.511
4.017IleIle: 4.017 ± 1.19
6.18IleLys: 6.18 ± 1.502
5.253IleLeu: 5.253 ± 0.931
1.236IleMet: 1.236 ± 0.599
2.781IleAsn: 2.781 ± 0.725
1.854IlePro: 1.854 ± 0.67
3.399IleGln: 3.399 ± 1.118
2.472IleArg: 2.472 ± 0.878
3.708IleSer: 3.708 ± 0.906
4.944IleThr: 4.944 ± 1.475
2.781IleVal: 2.781 ± 0.973
0.0IleTrp: 0.0 ± 0.0
1.854IleTyr: 1.854 ± 0.801
0.0IleXaa: 0.0 ± 0.0
Lys
7.108LysAla: 7.108 ± 1.36
0.0LysCys: 0.0 ± 0.0
4.326LysAsp: 4.326 ± 1.034
8.344LysGlu: 8.344 ± 1.374
4.017LysPhe: 4.017 ± 1.059
4.944LysGly: 4.944 ± 1.283
2.163LysHis: 2.163 ± 0.857
4.635LysIle: 4.635 ± 1.191
8.035LysLys: 8.035 ± 1.344
9.58LysLeu: 9.58 ± 1.634
2.163LysMet: 2.163 ± 0.95
4.944LysAsn: 4.944 ± 0.863
3.09LysPro: 3.09 ± 1.012
4.326LysGln: 4.326 ± 0.914
5.253LysArg: 5.253 ± 1.317
4.017LysSer: 4.017 ± 1.192
3.708LysThr: 3.708 ± 1.113
6.18LysVal: 6.18 ± 1.398
1.236LysTrp: 1.236 ± 0.554
2.781LysTyr: 2.781 ± 0.644
0.0LysXaa: 0.0 ± 0.0
Leu
4.944LeuAla: 4.944 ± 1.273
0.309LeuCys: 0.309 ± 0.255
8.344LeuAsp: 8.344 ± 0.936
10.816LeuGlu: 10.816 ± 2.11
3.399LeuPhe: 3.399 ± 1.247
5.871LeuGly: 5.871 ± 1.479
1.545LeuHis: 1.545 ± 0.715
5.253LeuIle: 5.253 ± 1.548
8.035LeuLys: 8.035 ± 1.245
8.035LeuLeu: 8.035 ± 1.806
2.163LeuMet: 2.163 ± 0.734
5.562LeuAsn: 5.562 ± 1.099
1.854LeuPro: 1.854 ± 0.585
3.708LeuGln: 3.708 ± 0.898
3.09LeuArg: 3.09 ± 0.801
10.198LeuSer: 10.198 ± 1.542
3.708LeuThr: 3.708 ± 0.816
4.635LeuVal: 4.635 ± 1.441
0.927LeuTrp: 0.927 ± 0.563
6.489LeuTyr: 6.489 ± 1.16
0.0LeuXaa: 0.0 ± 0.0
Met
2.472MetAla: 2.472 ± 0.755
0.0MetCys: 0.0 ± 0.0
1.236MetAsp: 1.236 ± 0.639
2.781MetGlu: 2.781 ± 1.127
0.618MetPhe: 0.618 ± 0.364
0.309MetGly: 0.309 ± 0.37
0.309MetHis: 0.309 ± 0.31
1.545MetIle: 1.545 ± 0.753
2.163MetLys: 2.163 ± 0.796
1.854MetLeu: 1.854 ± 0.736
0.0MetMet: 0.0 ± 0.0
1.545MetAsn: 1.545 ± 0.53
0.0MetPro: 0.0 ± 0.0
1.854MetGln: 1.854 ± 0.765
1.854MetArg: 1.854 ± 0.759
1.236MetSer: 1.236 ± 0.543
2.781MetThr: 2.781 ± 1.103
1.545MetVal: 1.545 ± 0.903
0.309MetTrp: 0.309 ± 0.282
0.927MetTyr: 0.927 ± 0.406
0.0MetXaa: 0.0 ± 0.0
Asn
3.708AsnAla: 3.708 ± 1.039
0.0AsnCys: 0.0 ± 0.0
1.854AsnAsp: 1.854 ± 0.679
1.545AsnGlu: 1.545 ± 1.027
2.163AsnPhe: 2.163 ± 0.711
3.09AsnGly: 3.09 ± 0.983
0.618AsnHis: 0.618 ± 0.438
4.017AsnIle: 4.017 ± 1.136
4.944AsnLys: 4.944 ± 1.171
2.781AsnLeu: 2.781 ± 0.891
0.927AsnMet: 0.927 ± 0.608
4.326AsnAsn: 4.326 ± 1.587
2.472AsnPro: 2.472 ± 0.668
3.708AsnGln: 3.708 ± 0.945
3.399AsnArg: 3.399 ± 0.91
4.326AsnSer: 4.326 ± 1.048
3.399AsnThr: 3.399 ± 0.852
1.854AsnVal: 1.854 ± 0.781
0.927AsnTrp: 0.927 ± 0.602
2.472AsnTyr: 2.472 ± 0.881
0.0AsnXaa: 0.0 ± 0.0
Pro
0.927ProAla: 0.927 ± 0.563
0.309ProCys: 0.309 ± 0.319
2.163ProAsp: 2.163 ± 0.587
1.545ProGlu: 1.545 ± 0.57
1.236ProPhe: 1.236 ± 0.611
0.309ProGly: 0.309 ± 0.255
0.0ProHis: 0.0 ± 0.0
0.927ProIle: 0.927 ± 0.512
3.399ProLys: 3.399 ± 1.066
1.854ProLeu: 1.854 ± 0.649
0.618ProMet: 0.618 ± 0.32
1.545ProAsn: 1.545 ± 0.622
0.927ProPro: 0.927 ± 0.564
0.618ProGln: 0.618 ± 0.492
1.236ProArg: 1.236 ± 0.737
1.545ProSer: 1.545 ± 0.669
2.472ProThr: 2.472 ± 0.823
0.618ProVal: 0.618 ± 0.383
0.0ProTrp: 0.0 ± 0.0
0.927ProTyr: 0.927 ± 0.616
0.0ProXaa: 0.0 ± 0.0
Gln
2.781GlnAla: 2.781 ± 0.805
0.0GlnCys: 0.0 ± 0.0
2.163GlnAsp: 2.163 ± 0.869
3.399GlnGlu: 3.399 ± 0.752
3.09GlnPhe: 3.09 ± 0.858
1.854GlnGly: 1.854 ± 0.975
0.618GlnHis: 0.618 ± 0.447
1.854GlnIle: 1.854 ± 0.978
4.635GlnLys: 4.635 ± 1.016
4.326GlnLeu: 4.326 ± 1.213
0.309GlnMet: 0.309 ± 0.45
3.09GlnAsn: 3.09 ± 0.968
1.854GlnPro: 1.854 ± 0.679
2.163GlnGln: 2.163 ± 1.019
1.236GlnArg: 1.236 ± 0.578
2.472GlnSer: 2.472 ± 0.923
3.708GlnThr: 3.708 ± 1.064
4.944GlnVal: 4.944 ± 1.154
0.0GlnTrp: 0.0 ± 0.0
1.854GlnTyr: 1.854 ± 0.81
0.0GlnXaa: 0.0 ± 0.0
Arg
3.09ArgAla: 3.09 ± 0.873
0.309ArgCys: 0.309 ± 0.37
4.326ArgAsp: 4.326 ± 1.045
2.781ArgGlu: 2.781 ± 0.858
1.236ArgPhe: 1.236 ± 0.576
0.618ArgGly: 0.618 ± 0.48
0.618ArgHis: 0.618 ± 0.342
2.472ArgIle: 2.472 ± 0.816
5.562ArgLys: 5.562 ± 1.113
5.562ArgLeu: 5.562 ± 1.287
0.927ArgMet: 0.927 ± 0.549
1.545ArgAsn: 1.545 ± 0.875
0.618ArgPro: 0.618 ± 0.511
4.944ArgGln: 4.944 ± 0.812
1.236ArgArg: 1.236 ± 0.622
3.09ArgSer: 3.09 ± 0.754
4.635ArgThr: 4.635 ± 1.183
2.472ArgVal: 2.472 ± 0.979
0.927ArgTrp: 0.927 ± 0.531
2.163ArgTyr: 2.163 ± 0.828
0.0ArgXaa: 0.0 ± 0.0
Ser
1.854SerAla: 1.854 ± 0.983
0.309SerCys: 0.309 ± 0.342
3.09SerAsp: 3.09 ± 0.881
4.635SerGlu: 4.635 ± 0.918
3.399SerPhe: 3.399 ± 0.895
3.399SerGly: 3.399 ± 1.335
0.0SerHis: 0.0 ± 0.0
3.708SerIle: 3.708 ± 0.885
6.18SerLys: 6.18 ± 1.516
4.326SerLeu: 4.326 ± 1.069
2.472SerMet: 2.472 ± 0.798
3.399SerAsn: 3.399 ± 1.086
2.472SerPro: 2.472 ± 0.764
2.472SerGln: 2.472 ± 0.825
3.399SerArg: 3.399 ± 1.054
2.472SerSer: 2.472 ± 0.859
1.545SerThr: 1.545 ± 0.556
3.09SerVal: 3.09 ± 1.011
1.236SerTrp: 1.236 ± 0.601
3.09SerTyr: 3.09 ± 0.766
0.0SerXaa: 0.0 ± 0.0
Thr
3.399ThrAla: 3.399 ± 1.174
0.0ThrCys: 0.0 ± 0.0
2.472ThrAsp: 2.472 ± 0.892
4.944ThrGlu: 4.944 ± 1.246
2.781ThrPhe: 2.781 ± 1.48
4.326ThrGly: 4.326 ± 1.208
1.545ThrHis: 1.545 ± 0.605
4.017ThrIle: 4.017 ± 1.447
4.017ThrLys: 4.017 ± 1.629
6.799ThrLeu: 6.799 ± 1.263
1.236ThrMet: 1.236 ± 0.555
2.472ThrAsn: 2.472 ± 1.172
3.09ThrPro: 3.09 ± 0.965
1.236ThrGln: 1.236 ± 0.988
2.163ThrArg: 2.163 ± 0.947
2.472ThrSer: 2.472 ± 0.642
4.326ThrThr: 4.326 ± 1.391
4.326ThrVal: 4.326 ± 0.996
0.0ThrTrp: 0.0 ± 0.0
5.562ThrTyr: 5.562 ± 1.059
0.0ThrXaa: 0.0 ± 0.0
Val
2.781ValAla: 2.781 ± 0.811
0.0ValCys: 0.0 ± 0.0
4.326ValAsp: 4.326 ± 1.078
5.253ValGlu: 5.253 ± 1.27
1.545ValPhe: 1.545 ± 0.713
2.472ValGly: 2.472 ± 1.268
0.309ValHis: 0.309 ± 0.272
5.253ValIle: 5.253 ± 0.995
5.562ValLys: 5.562 ± 1.246
7.108ValLeu: 7.108 ± 1.253
1.236ValMet: 1.236 ± 0.547
4.326ValAsn: 4.326 ± 0.947
1.545ValPro: 1.545 ± 0.754
0.309ValGln: 0.309 ± 0.419
2.781ValArg: 2.781 ± 0.715
5.253ValSer: 5.253 ± 1.43
4.326ValThr: 4.326 ± 1.883
3.09ValVal: 3.09 ± 1.124
0.618ValTrp: 0.618 ± 0.39
1.236ValTyr: 1.236 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
0.927TrpAla: 0.927 ± 0.788
0.0TrpCys: 0.0 ± 0.0
1.236TrpAsp: 1.236 ± 0.689
1.854TrpGlu: 1.854 ± 0.736
0.309TrpPhe: 0.309 ± 0.283
0.309TrpGly: 0.309 ± 0.255
0.0TrpHis: 0.0 ± 0.0
0.618TrpIle: 0.618 ± 0.484
0.618TrpLys: 0.618 ± 0.408
0.618TrpLeu: 0.618 ± 0.458
0.0TrpMet: 0.0 ± 0.0
0.309TrpAsn: 0.309 ± 0.283
0.0TrpPro: 0.0 ± 0.0
0.927TrpGln: 0.927 ± 0.507
0.618TrpArg: 0.618 ± 0.395
0.309TrpSer: 0.309 ± 0.272
0.309TrpThr: 0.309 ± 0.255
0.927TrpVal: 0.927 ± 0.615
0.309TrpTrp: 0.309 ± 0.272
0.309TrpTyr: 0.309 ± 0.346
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.399TyrAla: 3.399 ± 1.146
0.0TyrCys: 0.0 ± 0.0
2.163TyrAsp: 2.163 ± 0.859
2.472TyrGlu: 2.472 ± 0.784
1.545TyrPhe: 1.545 ± 0.582
2.472TyrGly: 2.472 ± 0.65
1.545TyrHis: 1.545 ± 0.556
1.854TyrIle: 1.854 ± 0.861
5.253TyrLys: 5.253 ± 1.332
5.871TyrLeu: 5.871 ± 0.968
1.854TyrMet: 1.854 ± 0.802
2.163TyrAsn: 2.163 ± 0.918
0.618TyrPro: 0.618 ± 0.412
3.708TyrGln: 3.708 ± 0.886
4.944TyrArg: 4.944 ± 1.211
1.854TyrSer: 1.854 ± 0.702
1.545TyrThr: 1.545 ± 0.775
3.399TyrVal: 3.399 ± 0.838
0.309TyrTrp: 0.309 ± 0.255
1.545TyrTyr: 1.545 ± 0.739
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (3237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski