Amino acid dipepetide frequency for Streptococcus satellite phage Javan138

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.854AlaAla: 1.854 ± 0.962
1.545AlaCys: 1.545 ± 0.553
1.854AlaAsp: 1.854 ± 0.734
5.562AlaGlu: 5.562 ± 1.541
1.854AlaPhe: 1.854 ± 0.59
2.472AlaGly: 2.472 ± 0.86
1.545AlaHis: 1.545 ± 0.874
2.781AlaIle: 2.781 ± 0.969
4.326AlaLys: 4.326 ± 1.276
5.253AlaLeu: 5.253 ± 1.267
1.236AlaMet: 1.236 ± 0.65
2.163AlaAsn: 2.163 ± 0.998
0.618AlaPro: 0.618 ± 0.397
3.09AlaGln: 3.09 ± 0.992
3.09AlaArg: 3.09 ± 0.901
2.472AlaSer: 2.472 ± 0.578
5.253AlaThr: 5.253 ± 1.015
4.944AlaVal: 4.944 ± 1.196
1.236AlaTrp: 1.236 ± 0.685
3.399AlaTyr: 3.399 ± 0.745
0.0AlaXaa: 0.0 ± 0.0
Cys
0.309CysAla: 0.309 ± 0.331
0.309CysCys: 0.309 ± 0.293
0.309CysAsp: 0.309 ± 0.331
0.309CysGlu: 0.309 ± 0.369
0.309CysPhe: 0.309 ± 0.369
0.618CysGly: 0.618 ± 0.353
0.309CysHis: 0.309 ± 0.311
0.309CysIle: 0.309 ± 0.256
0.309CysLys: 0.309 ± 0.285
0.309CysLeu: 0.309 ± 0.285
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.309CysArg: 0.309 ± 0.285
0.927CysSer: 0.927 ± 0.521
0.0CysThr: 0.0 ± 0.0
0.309CysVal: 0.309 ± 0.293
0.0CysTrp: 0.0 ± 0.0
0.927CysTyr: 0.927 ± 0.549
0.0CysXaa: 0.0 ± 0.0
Asp
0.618AspAla: 0.618 ± 0.393
0.618AspCys: 0.618 ± 0.388
4.944AspAsp: 4.944 ± 1.032
4.944AspGlu: 4.944 ± 1.456
2.781AspPhe: 2.781 ± 0.975
1.854AspGly: 1.854 ± 0.882
0.927AspHis: 0.927 ± 0.57
8.035AspIle: 8.035 ± 1.462
5.562AspLys: 5.562 ± 1.597
7.417AspLeu: 7.417 ± 1.151
2.781AspMet: 2.781 ± 0.923
1.545AspAsn: 1.545 ± 0.703
0.0AspPro: 0.0 ± 0.0
1.545AspGln: 1.545 ± 0.613
2.163AspArg: 2.163 ± 0.831
3.399AspSer: 3.399 ± 1.001
2.163AspThr: 2.163 ± 0.857
3.09AspVal: 3.09 ± 0.98
0.618AspTrp: 0.618 ± 0.385
4.635AspTyr: 4.635 ± 1.001
0.0AspXaa: 0.0 ± 0.0
Glu
5.562GluAla: 5.562 ± 1.495
0.618GluCys: 0.618 ± 0.663
3.399GluAsp: 3.399 ± 1.126
7.726GluGlu: 7.726 ± 2.207
1.854GluPhe: 1.854 ± 1.023
3.399GluGly: 3.399 ± 0.936
2.781GluHis: 2.781 ± 0.67
5.562GluIle: 5.562 ± 1.099
7.726GluLys: 7.726 ± 2.135
12.361GluLeu: 12.361 ± 2.204
4.017GluMet: 4.017 ± 0.913
4.017GluAsn: 4.017 ± 1.226
0.618GluPro: 0.618 ± 0.423
4.944GluGln: 4.944 ± 1.546
3.399GluArg: 3.399 ± 1.329
2.781GluSer: 2.781 ± 0.929
4.635GluThr: 4.635 ± 0.921
4.944GluVal: 4.944 ± 1.014
0.927GluTrp: 0.927 ± 0.442
2.781GluTyr: 2.781 ± 0.603
0.0GluXaa: 0.0 ± 0.0
Phe
1.545PheAla: 1.545 ± 0.587
0.309PheCys: 0.309 ± 0.293
3.399PheAsp: 3.399 ± 0.939
4.017PheGlu: 4.017 ± 0.886
3.09PhePhe: 3.09 ± 1.005
1.854PheGly: 1.854 ± 0.487
1.545PheHis: 1.545 ± 0.59
2.781PheIle: 2.781 ± 1.05
3.708PheLys: 3.708 ± 0.804
3.399PheLeu: 3.399 ± 0.79
0.618PheMet: 0.618 ± 0.408
2.163PheAsn: 2.163 ± 0.84
0.618PhePro: 0.618 ± 0.352
0.927PheGln: 0.927 ± 0.472
2.163PheArg: 2.163 ± 0.881
2.472PheSer: 2.472 ± 0.687
4.017PheThr: 4.017 ± 0.825
1.236PheVal: 1.236 ± 0.486
0.618PheTrp: 0.618 ± 0.434
0.927PheTyr: 0.927 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
2.163GlyAla: 2.163 ± 0.629
0.618GlyCys: 0.618 ± 0.416
3.09GlyAsp: 3.09 ± 0.668
3.708GlyGlu: 3.708 ± 0.884
1.545GlyPhe: 1.545 ± 0.751
1.545GlyGly: 1.545 ± 1.028
0.927GlyHis: 0.927 ± 0.483
4.635GlyIle: 4.635 ± 1.036
3.09GlyLys: 3.09 ± 0.901
5.253GlyLeu: 5.253 ± 1.125
0.309GlyMet: 0.309 ± 0.354
3.399GlyAsn: 3.399 ± 0.942
0.0GlyPro: 0.0 ± 0.0
1.545GlyGln: 1.545 ± 0.631
2.781GlyArg: 2.781 ± 1.398
0.927GlySer: 0.927 ± 0.507
2.781GlyThr: 2.781 ± 0.6
3.708GlyVal: 3.708 ± 1.143
0.309GlyTrp: 0.309 ± 0.293
3.708GlyTyr: 3.708 ± 0.863
0.0GlyXaa: 0.0 ± 0.0
His
1.545HisAla: 1.545 ± 0.615
0.0HisCys: 0.0 ± 0.0
0.618HisAsp: 0.618 ± 0.352
0.618HisGlu: 0.618 ± 0.462
0.927HisPhe: 0.927 ± 0.429
0.618HisGly: 0.618 ± 0.394
0.309HisHis: 0.309 ± 0.256
1.545HisIle: 1.545 ± 0.519
1.545HisLys: 1.545 ± 0.486
0.927HisLeu: 0.927 ± 0.664
0.927HisMet: 0.927 ± 0.393
1.236HisAsn: 1.236 ± 0.597
0.927HisPro: 0.927 ± 0.524
1.236HisGln: 1.236 ± 0.584
1.545HisArg: 1.545 ± 0.598
0.618HisSer: 0.618 ± 0.375
0.927HisThr: 0.927 ± 0.539
1.236HisVal: 1.236 ± 0.661
0.0HisTrp: 0.0 ± 0.0
0.927HisTyr: 0.927 ± 0.531
0.0HisXaa: 0.0 ± 0.0
Ile
5.562IleAla: 5.562 ± 1.085
0.618IleCys: 0.618 ± 0.439
4.326IleAsp: 4.326 ± 0.986
6.799IleGlu: 6.799 ± 1.17
3.708IlePhe: 3.708 ± 0.911
3.708IleGly: 3.708 ± 1.013
0.618IleHis: 0.618 ± 0.512
4.017IleIle: 4.017 ± 1.003
6.18IleLys: 6.18 ± 1.328
5.253IleLeu: 5.253 ± 1.017
1.236IleMet: 1.236 ± 0.565
2.781IleAsn: 2.781 ± 0.657
1.854IlePro: 1.854 ± 0.596
3.399IleGln: 3.399 ± 1.048
2.472IleArg: 2.472 ± 0.874
3.708IleSer: 3.708 ± 0.804
4.944IleThr: 4.944 ± 1.354
2.781IleVal: 2.781 ± 0.805
0.0IleTrp: 0.0 ± 0.0
1.854IleTyr: 1.854 ± 0.792
0.0IleXaa: 0.0 ± 0.0
Lys
7.108LysAla: 7.108 ± 1.415
0.0LysCys: 0.0 ± 0.0
4.326LysAsp: 4.326 ± 0.916
8.344LysGlu: 8.344 ± 1.231
4.017LysPhe: 4.017 ± 1.047
4.944LysGly: 4.944 ± 1.351
2.163LysHis: 2.163 ± 0.864
4.635LysIle: 4.635 ± 1.236
8.035LysLys: 8.035 ± 1.52
9.58LysLeu: 9.58 ± 1.587
2.163LysMet: 2.163 ± 0.901
4.944LysAsn: 4.944 ± 0.909
3.09LysPro: 3.09 ± 0.957
4.326LysGln: 4.326 ± 0.917
5.253LysArg: 5.253 ± 1.326
4.017LysSer: 4.017 ± 1.112
3.708LysThr: 3.708 ± 0.996
6.18LysVal: 6.18 ± 1.321
1.236LysTrp: 1.236 ± 0.502
2.781LysTyr: 2.781 ± 0.584
0.0LysXaa: 0.0 ± 0.0
Leu
4.944LeuAla: 4.944 ± 1.28
0.309LeuCys: 0.309 ± 0.256
8.344LeuAsp: 8.344 ± 0.952
10.816LeuGlu: 10.816 ± 2.116
3.399LeuPhe: 3.399 ± 1.173
5.871LeuGly: 5.871 ± 1.519
1.545LeuHis: 1.545 ± 0.543
5.253LeuIle: 5.253 ± 1.267
8.035LeuLys: 8.035 ± 1.101
8.035LeuLeu: 8.035 ± 1.728
1.854LeuMet: 1.854 ± 0.744
5.562LeuAsn: 5.562 ± 1.152
1.854LeuPro: 1.854 ± 0.553
3.708LeuGln: 3.708 ± 0.943
3.09LeuArg: 3.09 ± 0.731
10.198LeuSer: 10.198 ± 1.406
3.708LeuThr: 3.708 ± 0.938
4.635LeuVal: 4.635 ± 1.226
0.927LeuTrp: 0.927 ± 0.543
6.489LeuTyr: 6.489 ± 1.006
0.0LeuXaa: 0.0 ± 0.0
Met
2.472MetAla: 2.472 ± 0.823
0.0MetCys: 0.0 ± 0.0
1.236MetAsp: 1.236 ± 0.545
2.781MetGlu: 2.781 ± 1.011
0.618MetPhe: 0.618 ± 0.369
0.309MetGly: 0.309 ± 0.302
0.309MetHis: 0.309 ± 0.293
1.545MetIle: 1.545 ± 0.833
2.163MetLys: 2.163 ± 0.72
1.854MetLeu: 1.854 ± 0.672
0.0MetMet: 0.0 ± 0.0
1.545MetAsn: 1.545 ± 0.445
0.0MetPro: 0.0 ± 0.0
1.854MetGln: 1.854 ± 0.845
1.854MetArg: 1.854 ± 0.811
1.236MetSer: 1.236 ± 0.515
2.781MetThr: 2.781 ± 0.889
1.545MetVal: 1.545 ± 0.985
0.309MetTrp: 0.309 ± 0.298
0.927MetTyr: 0.927 ± 0.401
0.0MetXaa: 0.0 ± 0.0
Asn
3.708AsnAla: 3.708 ± 0.909
0.0AsnCys: 0.0 ± 0.0
1.854AsnAsp: 1.854 ± 0.682
1.545AsnGlu: 1.545 ± 1.014
2.163AsnPhe: 2.163 ± 0.765
3.09AsnGly: 3.09 ± 1.111
0.618AsnHis: 0.618 ± 0.378
4.017AsnIle: 4.017 ± 1.08
4.944AsnLys: 4.944 ± 1.095
2.781AsnLeu: 2.781 ± 0.821
0.927AsnMet: 0.927 ± 0.548
4.326AsnAsn: 4.326 ± 1.351
2.472AsnPro: 2.472 ± 0.609
3.708AsnGln: 3.708 ± 0.99
3.399AsnArg: 3.399 ± 0.893
4.326AsnSer: 4.326 ± 1.067
3.399AsnThr: 3.399 ± 0.868
1.854AsnVal: 1.854 ± 0.842
0.927AsnTrp: 0.927 ± 0.56
2.472AsnTyr: 2.472 ± 0.778
0.0AsnXaa: 0.0 ± 0.0
Pro
0.927ProAla: 0.927 ± 0.583
0.309ProCys: 0.309 ± 0.329
2.163ProAsp: 2.163 ± 0.557
1.545ProGlu: 1.545 ± 0.539
1.236ProPhe: 1.236 ± 0.597
0.309ProGly: 0.309 ± 0.256
0.0ProHis: 0.0 ± 0.0
0.927ProIle: 0.927 ± 0.507
3.399ProLys: 3.399 ± 0.92
1.854ProLeu: 1.854 ± 0.617
0.618ProMet: 0.618 ± 0.355
1.545ProAsn: 1.545 ± 0.632
0.927ProPro: 0.927 ± 0.57
0.618ProGln: 0.618 ± 0.456
1.236ProArg: 1.236 ± 0.607
1.545ProSer: 1.545 ± 0.504
2.472ProThr: 2.472 ± 0.899
0.618ProVal: 0.618 ± 0.388
0.0ProTrp: 0.0 ± 0.0
0.927ProTyr: 0.927 ± 0.72
0.0ProXaa: 0.0 ± 0.0
Gln
2.781GlnAla: 2.781 ± 0.869
0.0GlnCys: 0.0 ± 0.0
2.163GlnAsp: 2.163 ± 0.826
3.399GlnGlu: 3.399 ± 0.802
3.09GlnPhe: 3.09 ± 0.825
1.854GlnGly: 1.854 ± 1.044
0.618GlnHis: 0.618 ± 0.44
1.854GlnIle: 1.854 ± 0.78
4.635GlnLys: 4.635 ± 1.037
4.326GlnLeu: 4.326 ± 1.1
0.618GlnMet: 0.618 ± 0.458
3.09GlnAsn: 3.09 ± 0.945
1.854GlnPro: 1.854 ± 0.674
2.163GlnGln: 2.163 ± 1.048
1.236GlnArg: 1.236 ± 0.517
2.472GlnSer: 2.472 ± 0.884
3.708GlnThr: 3.708 ± 1.059
4.944GlnVal: 4.944 ± 1.278
0.0GlnTrp: 0.0 ± 0.0
1.854GlnTyr: 1.854 ± 0.862
0.0GlnXaa: 0.0 ± 0.0
Arg
3.09ArgAla: 3.09 ± 0.773
0.309ArgCys: 0.309 ± 0.302
4.326ArgAsp: 4.326 ± 0.965
2.781ArgGlu: 2.781 ± 0.767
1.236ArgPhe: 1.236 ± 0.585
0.618ArgGly: 0.618 ± 0.388
0.618ArgHis: 0.618 ± 0.348
2.472ArgIle: 2.472 ± 0.779
5.562ArgLys: 5.562 ± 1.014
5.562ArgLeu: 5.562 ± 1.102
0.927ArgMet: 0.927 ± 0.479
1.545ArgAsn: 1.545 ± 0.798
0.618ArgPro: 0.618 ± 0.512
4.944ArgGln: 4.944 ± 0.819
1.236ArgArg: 1.236 ± 0.602
3.09ArgSer: 3.09 ± 0.717
4.635ArgThr: 4.635 ± 1.325
2.472ArgVal: 2.472 ± 1.06
0.927ArgTrp: 0.927 ± 0.507
2.163ArgTyr: 2.163 ± 0.772
0.0ArgXaa: 0.0 ± 0.0
Ser
1.854SerAla: 1.854 ± 1.181
0.309SerCys: 0.309 ± 0.369
3.09SerAsp: 3.09 ± 0.811
4.635SerGlu: 4.635 ± 1.094
3.399SerPhe: 3.399 ± 0.867
3.399SerGly: 3.399 ± 1.275
0.0SerHis: 0.0 ± 0.0
3.708SerIle: 3.708 ± 0.935
6.18SerLys: 6.18 ± 1.408
4.326SerLeu: 4.326 ± 0.991
2.472SerMet: 2.472 ± 0.622
3.399SerAsn: 3.399 ± 0.94
2.472SerPro: 2.472 ± 0.853
2.472SerGln: 2.472 ± 0.975
3.399SerArg: 3.399 ± 0.992
2.472SerSer: 2.472 ± 0.792
1.545SerThr: 1.545 ± 0.493
3.09SerVal: 3.09 ± 0.877
1.236SerTrp: 1.236 ± 0.474
3.09SerTyr: 3.09 ± 0.601
0.0SerXaa: 0.0 ± 0.0
Thr
3.399ThrAla: 3.399 ± 1.054
0.0ThrCys: 0.0 ± 0.0
2.472ThrAsp: 2.472 ± 0.886
4.944ThrGlu: 4.944 ± 1.133
2.781ThrPhe: 2.781 ± 1.55
4.326ThrGly: 4.326 ± 0.931
1.545ThrHis: 1.545 ± 0.566
4.017ThrIle: 4.017 ± 1.483
4.017ThrLys: 4.017 ± 1.598
6.799ThrLeu: 6.799 ± 1.137
1.236ThrMet: 1.236 ± 0.512
2.472ThrAsn: 2.472 ± 1.03
3.09ThrPro: 3.09 ± 1.004
1.236ThrGln: 1.236 ± 0.846
2.163ThrArg: 2.163 ± 0.738
2.472ThrSer: 2.472 ± 0.605
4.326ThrThr: 4.326 ± 1.335
4.326ThrVal: 4.326 ± 0.948
0.0ThrTrp: 0.0 ± 0.0
5.562ThrTyr: 5.562 ± 0.977
0.0ThrXaa: 0.0 ± 0.0
Val
2.781ValAla: 2.781 ± 0.917
0.0ValCys: 0.0 ± 0.0
4.326ValAsp: 4.326 ± 1.049
5.253ValGlu: 5.253 ± 1.357
1.545ValPhe: 1.545 ± 0.585
2.472ValGly: 2.472 ± 1.107
0.309ValHis: 0.309 ± 0.285
5.253ValIle: 5.253 ± 0.9
5.562ValLys: 5.562 ± 1.112
7.108ValLeu: 7.108 ± 1.045
1.236ValMet: 1.236 ± 0.561
4.326ValAsn: 4.326 ± 0.97
1.545ValPro: 1.545 ± 0.631
0.309ValGln: 0.309 ± 0.328
2.781ValArg: 2.781 ± 0.702
5.253ValSer: 5.253 ± 1.243
4.326ValThr: 4.326 ± 1.721
3.09ValVal: 3.09 ± 1.024
0.618ValTrp: 0.618 ± 0.418
1.236ValTyr: 1.236 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
0.927TrpAla: 0.927 ± 0.74
0.0TrpCys: 0.0 ± 0.0
1.236TrpAsp: 1.236 ± 0.598
1.854TrpGlu: 1.854 ± 0.744
0.309TrpPhe: 0.309 ± 0.325
0.309TrpGly: 0.309 ± 0.256
0.0TrpHis: 0.0 ± 0.0
0.618TrpIle: 0.618 ± 0.409
0.618TrpLys: 0.618 ± 0.45
0.618TrpLeu: 0.618 ± 0.412
0.0TrpMet: 0.0 ± 0.0
0.309TrpAsn: 0.309 ± 0.325
0.0TrpPro: 0.0 ± 0.0
0.927TrpGln: 0.927 ± 0.476
0.618TrpArg: 0.618 ± 0.403
0.309TrpSer: 0.309 ± 0.285
0.309TrpThr: 0.309 ± 0.256
0.927TrpVal: 0.927 ± 0.558
0.309TrpTrp: 0.309 ± 0.285
0.309TrpTyr: 0.309 ± 0.331
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.399TyrAla: 3.399 ± 1.098
0.0TyrCys: 0.0 ± 0.0
2.163TyrAsp: 2.163 ± 0.709
2.472TyrGlu: 2.472 ± 0.755
1.545TyrPhe: 1.545 ± 0.577
2.472TyrGly: 2.472 ± 0.638
1.545TyrHis: 1.545 ± 0.637
1.854TyrIle: 1.854 ± 0.663
5.253TyrLys: 5.253 ± 1.23
5.871TyrLeu: 5.871 ± 0.878
1.854TyrMet: 1.854 ± 0.935
2.163TyrAsn: 2.163 ± 0.732
0.618TyrPro: 0.618 ± 0.406
3.708TyrGln: 3.708 ± 0.888
4.944TyrArg: 4.944 ± 1.251
1.854TyrSer: 1.854 ± 0.642
1.545TyrThr: 1.545 ± 0.753
3.399TyrVal: 3.399 ± 0.733
0.309TyrTrp: 0.309 ± 0.256
1.545TyrTyr: 1.545 ± 0.599
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (3237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski