Amino acid dipepetide frequency for Streptococcus satellite phage Javan328

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.389AlaAla: 0.389 ± 0.439
0.389AlaCys: 0.389 ± 0.275
3.505AlaAsp: 3.505 ± 0.835
3.894AlaGlu: 3.894 ± 1.628
2.726AlaPhe: 2.726 ± 0.71
2.726AlaGly: 2.726 ± 1.081
1.168AlaHis: 1.168 ± 0.672
4.283AlaIle: 4.283 ± 0.967
6.62AlaLys: 6.62 ± 1.399
7.399AlaLeu: 7.399 ± 2.168
0.779AlaMet: 0.779 ± 0.571
2.726AlaAsn: 2.726 ± 0.863
0.779AlaPro: 0.779 ± 0.398
3.505AlaGln: 3.505 ± 1.158
2.726AlaArg: 2.726 ± 0.971
3.505AlaSer: 3.505 ± 0.982
2.726AlaThr: 2.726 ± 1.373
3.505AlaVal: 3.505 ± 0.796
0.389AlaTrp: 0.389 ± 0.344
1.947AlaTyr: 1.947 ± 0.709
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.779CysAsp: 0.779 ± 0.432
0.389CysGlu: 0.389 ± 0.418
0.389CysPhe: 0.389 ± 0.318
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.389CysLys: 0.389 ± 0.418
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.389CysArg: 0.389 ± 0.344
0.0CysSer: 0.0 ± 0.0
0.389CysThr: 0.389 ± 0.275
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.389AspAla: 0.389 ± 0.393
0.389AspCys: 0.389 ± 0.318
4.673AspAsp: 4.673 ± 1.759
6.62AspGlu: 6.62 ± 2.052
3.894AspPhe: 3.894 ± 1.274
4.673AspGly: 4.673 ± 1.64
0.779AspHis: 0.779 ± 0.481
5.062AspIle: 5.062 ± 1.127
5.062AspLys: 5.062 ± 1.161
7.788AspLeu: 7.788 ± 1.443
0.779AspMet: 0.779 ± 0.476
5.062AspAsn: 5.062 ± 1.155
1.947AspPro: 1.947 ± 0.881
1.947AspGln: 1.947 ± 0.785
1.947AspArg: 1.947 ± 0.741
2.726AspSer: 2.726 ± 0.71
2.726AspThr: 2.726 ± 1.412
1.947AspVal: 1.947 ± 0.837
0.779AspTrp: 0.779 ± 0.547
3.894AspTyr: 3.894 ± 1.087
0.0AspXaa: 0.0 ± 0.0
Glu
4.673GluAla: 4.673 ± 1.213
0.389GluCys: 0.389 ± 0.353
5.062GluAsp: 5.062 ± 1.04
5.062GluGlu: 5.062 ± 2.184
1.947GluPhe: 1.947 ± 1.053
1.168GluGly: 1.168 ± 0.673
0.389GluHis: 0.389 ± 0.318
7.788GluIle: 7.788 ± 1.337
9.735GluLys: 9.735 ± 2.397
10.125GluLeu: 10.125 ± 1.716
1.947GluMet: 1.947 ± 0.832
5.841GluAsn: 5.841 ± 1.078
1.947GluPro: 1.947 ± 0.929
2.336GluGln: 2.336 ± 0.986
7.009GluArg: 7.009 ± 1.526
5.062GluSer: 5.062 ± 1.029
3.115GluThr: 3.115 ± 0.841
5.841GluVal: 5.841 ± 1.635
0.389GluTrp: 0.389 ± 0.318
3.505GluTyr: 3.505 ± 1.134
0.0GluXaa: 0.0 ± 0.0
Phe
1.558PheAla: 1.558 ± 0.74
0.0PheCys: 0.0 ± 0.0
1.947PheAsp: 1.947 ± 0.943
3.115PheGlu: 3.115 ± 1.003
0.779PhePhe: 0.779 ± 0.432
1.947PheGly: 1.947 ± 0.565
0.779PheHis: 0.779 ± 0.524
3.115PheIle: 3.115 ± 1.087
3.115PheLys: 3.115 ± 1.371
3.894PheLeu: 3.894 ± 1.304
1.947PheMet: 1.947 ± 0.832
1.558PheAsn: 1.558 ± 0.677
0.0PhePro: 0.0 ± 0.0
1.947PheGln: 1.947 ± 0.752
1.558PheArg: 1.558 ± 0.547
3.115PheSer: 3.115 ± 0.735
1.558PheThr: 1.558 ± 0.872
1.558PheVal: 1.558 ± 0.684
0.0PheTrp: 0.0 ± 0.0
2.336PheTyr: 2.336 ± 0.805
0.0PheXaa: 0.0 ± 0.0
Gly
3.115GlyAla: 3.115 ± 1.361
0.779GlyCys: 0.779 ± 0.519
1.947GlyAsp: 1.947 ± 1.013
1.947GlyGlu: 1.947 ± 0.812
1.947GlyPhe: 1.947 ± 0.776
2.726GlyGly: 2.726 ± 1.132
1.947GlyHis: 1.947 ± 0.858
4.283GlyIle: 4.283 ± 1.083
2.726GlyLys: 2.726 ± 1.017
5.452GlyLeu: 5.452 ± 1.315
1.558GlyMet: 1.558 ± 1.106
2.336GlyAsn: 2.336 ± 0.793
0.389GlyPro: 0.389 ± 0.318
1.558GlyGln: 1.558 ± 0.648
2.726GlyArg: 2.726 ± 1.127
4.673GlySer: 4.673 ± 1.169
2.726GlyThr: 2.726 ± 1.061
3.115GlyVal: 3.115 ± 0.923
0.0GlyTrp: 0.0 ± 0.0
3.115GlyTyr: 3.115 ± 0.929
0.0GlyXaa: 0.0 ± 0.0
His
1.558HisAla: 1.558 ± 0.703
0.389HisCys: 0.389 ± 0.318
1.168HisAsp: 1.168 ± 0.844
1.558HisGlu: 1.558 ± 0.529
1.947HisPhe: 1.947 ± 0.787
1.947HisGly: 1.947 ± 0.668
0.0HisHis: 0.0 ± 0.0
0.779HisIle: 0.779 ± 0.423
0.389HisLys: 0.389 ± 0.318
2.336HisLeu: 2.336 ± 0.749
1.168HisMet: 1.168 ± 0.615
0.389HisAsn: 0.389 ± 0.318
0.0HisPro: 0.0 ± 0.0
0.779HisGln: 0.779 ± 0.55
0.779HisArg: 0.779 ± 0.526
1.947HisSer: 1.947 ± 0.829
0.389HisThr: 0.389 ± 0.344
1.558HisVal: 1.558 ± 0.468
0.0HisTrp: 0.0 ± 0.0
0.779HisTyr: 0.779 ± 0.539
0.0HisXaa: 0.0 ± 0.0
Ile
3.115IleAla: 3.115 ± 0.808
0.389IleCys: 0.389 ± 0.418
7.009IleAsp: 7.009 ± 1.502
2.726IleGlu: 2.726 ± 0.857
2.336IlePhe: 2.336 ± 0.817
3.505IleGly: 3.505 ± 0.965
1.558IleHis: 1.558 ± 0.729
3.894IleIle: 3.894 ± 1.172
7.399IleLys: 7.399 ± 1.807
5.841IleLeu: 5.841 ± 0.983
0.389IleMet: 0.389 ± 0.51
5.452IleAsn: 5.452 ± 1.258
2.336IlePro: 2.336 ± 0.655
2.726IleGln: 2.726 ± 0.983
2.726IleArg: 2.726 ± 1.184
5.452IleSer: 5.452 ± 1.438
4.673IleThr: 4.673 ± 1.306
2.336IleVal: 2.336 ± 1.139
0.779IleTrp: 0.779 ± 0.481
2.336IleTyr: 2.336 ± 0.534
0.0IleXaa: 0.0 ± 0.0
Lys
6.231LysAla: 6.231 ± 2.274
0.0LysCys: 0.0 ± 0.0
4.283LysAsp: 4.283 ± 0.919
10.903LysGlu: 10.903 ± 2.554
3.115LysPhe: 3.115 ± 0.838
3.894LysGly: 3.894 ± 1.061
1.947LysHis: 1.947 ± 0.845
5.841LysIle: 5.841 ± 1.219
6.62LysLys: 6.62 ± 1.751
8.956LysLeu: 8.956 ± 0.854
3.115LysMet: 3.115 ± 0.924
7.399LysAsn: 7.399 ± 1.353
1.558LysPro: 1.558 ± 0.665
2.336LysGln: 2.336 ± 0.816
5.062LysArg: 5.062 ± 1.449
4.673LysSer: 4.673 ± 1.492
5.452LysThr: 5.452 ± 1.183
6.231LysVal: 6.231 ± 1.222
0.779LysTrp: 0.779 ± 0.58
1.947LysTyr: 1.947 ± 0.824
0.0LysXaa: 0.0 ± 0.0
Leu
8.178LeuAla: 8.178 ± 1.414
0.0LeuCys: 0.0 ± 0.0
6.62LeuAsp: 6.62 ± 1.096
12.072LeuGlu: 12.072 ± 2.476
2.726LeuPhe: 2.726 ± 0.997
5.452LeuGly: 5.452 ± 1.385
1.168LeuHis: 1.168 ± 0.481
6.231LeuIle: 6.231 ± 1.341
10.125LeuLys: 10.125 ± 1.433
12.85LeuLeu: 12.85 ± 1.683
1.947LeuMet: 1.947 ± 0.858
5.452LeuAsn: 5.452 ± 0.923
3.894LeuPro: 3.894 ± 1.49
5.062LeuGln: 5.062 ± 1.424
3.505LeuArg: 3.505 ± 0.908
7.009LeuSer: 7.009 ± 1.584
8.178LeuThr: 8.178 ± 1.678
2.726LeuVal: 2.726 ± 0.925
1.168LeuTrp: 1.168 ± 0.486
3.505LeuTyr: 3.505 ± 0.836
0.0LeuXaa: 0.0 ± 0.0
Met
2.726MetAla: 2.726 ± 1.398
0.0MetCys: 0.0 ± 0.0
2.336MetAsp: 2.336 ± 1.097
2.336MetGlu: 2.336 ± 1.085
0.389MetPhe: 0.389 ± 0.402
1.168MetGly: 1.168 ± 0.666
0.389MetHis: 0.389 ± 0.275
1.168MetIle: 1.168 ± 0.501
1.947MetLys: 1.947 ± 0.894
1.558MetLeu: 1.558 ± 0.666
0.0MetMet: 0.0 ± 0.0
1.947MetAsn: 1.947 ± 1.064
0.389MetPro: 0.389 ± 0.275
1.558MetGln: 1.558 ± 0.506
0.0MetArg: 0.0 ± 0.0
0.779MetSer: 0.779 ± 0.553
3.505MetThr: 3.505 ± 1.094
1.558MetVal: 1.558 ± 1.05
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.726AsnAla: 2.726 ± 0.833
0.0AsnCys: 0.0 ± 0.0
3.894AsnAsp: 3.894 ± 1.258
5.062AsnGlu: 5.062 ± 1.367
1.558AsnPhe: 1.558 ± 0.965
1.558AsnGly: 1.558 ± 0.752
2.726AsnHis: 2.726 ± 0.778
1.947AsnIle: 1.947 ± 1.002
4.673AsnLys: 4.673 ± 1.345
5.452AsnLeu: 5.452 ± 1.205
1.558AsnMet: 1.558 ± 0.767
5.452AsnAsn: 5.452 ± 1.362
1.947AsnPro: 1.947 ± 0.476
1.947AsnGln: 1.947 ± 0.495
4.673AsnArg: 4.673 ± 1.302
4.283AsnSer: 4.283 ± 1.103
2.726AsnThr: 2.726 ± 1.01
3.115AsnVal: 3.115 ± 1.276
1.168AsnTrp: 1.168 ± 0.59
2.726AsnTyr: 2.726 ± 1.073
0.0AsnXaa: 0.0 ± 0.0
Pro
2.336ProAla: 2.336 ± 0.735
0.0ProCys: 0.0 ± 0.0
1.558ProAsp: 1.558 ± 0.979
1.947ProGlu: 1.947 ± 0.886
0.779ProPhe: 0.779 ± 0.573
0.779ProGly: 0.779 ± 0.398
0.0ProHis: 0.0 ± 0.0
1.947ProIle: 1.947 ± 0.671
2.726ProLys: 2.726 ± 0.796
2.336ProLeu: 2.336 ± 0.752
0.0ProMet: 0.0 ± 0.0
2.336ProAsn: 2.336 ± 0.796
1.558ProPro: 1.558 ± 0.739
0.779ProGln: 0.779 ± 0.478
1.947ProArg: 1.947 ± 0.63
1.947ProSer: 1.947 ± 0.717
1.168ProThr: 1.168 ± 0.679
0.389ProVal: 0.389 ± 0.275
0.389ProTrp: 0.389 ± 0.427
1.558ProTyr: 1.558 ± 0.684
0.0ProXaa: 0.0 ± 0.0
Gln
2.336GlnAla: 2.336 ± 1.2
0.0GlnCys: 0.0 ± 0.0
1.947GlnAsp: 1.947 ± 0.731
3.894GlnGlu: 3.894 ± 1.054
1.558GlnPhe: 1.558 ± 0.643
1.168GlnGly: 1.168 ± 0.544
0.779GlnHis: 0.779 ± 0.478
1.558GlnIle: 1.558 ± 0.796
5.452GlnLys: 5.452 ± 2.164
2.726GlnLeu: 2.726 ± 1.293
1.168GlnMet: 1.168 ± 0.796
1.947GlnAsn: 1.947 ± 0.865
0.779GlnPro: 0.779 ± 0.629
1.947GlnGln: 1.947 ± 1.307
1.558GlnArg: 1.558 ± 0.889
1.947GlnSer: 1.947 ± 0.724
2.726GlnThr: 2.726 ± 0.977
3.894GlnVal: 3.894 ± 1.201
0.389GlnTrp: 0.389 ± 0.318
2.336GlnTyr: 2.336 ± 0.966
0.0GlnXaa: 0.0 ± 0.0
Arg
2.726ArgAla: 2.726 ± 1.043
0.0ArgCys: 0.0 ± 0.0
3.115ArgAsp: 3.115 ± 1.015
3.894ArgGlu: 3.894 ± 0.915
1.558ArgPhe: 1.558 ± 0.594
2.726ArgGly: 2.726 ± 1.005
1.168ArgHis: 1.168 ± 0.704
3.115ArgIle: 3.115 ± 1.029
3.894ArgLys: 3.894 ± 1.116
7.009ArgLeu: 7.009 ± 1.442
0.779ArgMet: 0.779 ± 0.436
1.558ArgAsn: 1.558 ± 0.441
1.947ArgPro: 1.947 ± 0.841
1.947ArgGln: 1.947 ± 1.026
1.947ArgArg: 1.947 ± 0.669
1.558ArgSer: 1.558 ± 0.607
3.115ArgThr: 3.115 ± 0.769
3.115ArgVal: 3.115 ± 1.018
0.389ArgTrp: 0.389 ± 0.49
3.505ArgTyr: 3.505 ± 1.041
0.0ArgXaa: 0.0 ± 0.0
Ser
3.894SerAla: 3.894 ± 1.355
0.0SerCys: 0.0 ± 0.0
5.062SerAsp: 5.062 ± 1.01
4.673SerGlu: 4.673 ± 1.412
1.947SerPhe: 1.947 ± 0.61
3.115SerGly: 3.115 ± 0.93
1.947SerHis: 1.947 ± 0.565
3.894SerIle: 3.894 ± 1.08
3.894SerLys: 3.894 ± 1.054
7.009SerLeu: 7.009 ± 1.749
1.947SerMet: 1.947 ± 1.142
1.947SerAsn: 1.947 ± 1.225
2.336SerPro: 2.336 ± 0.719
0.779SerGln: 0.779 ± 0.597
3.115SerArg: 3.115 ± 0.636
3.505SerSer: 3.505 ± 1.302
4.673SerThr: 4.673 ± 1.495
2.726SerVal: 2.726 ± 0.587
0.389SerTrp: 0.389 ± 0.344
3.505SerTyr: 3.505 ± 1.041
0.0SerXaa: 0.0 ± 0.0
Thr
3.894ThrAla: 3.894 ± 0.943
0.0ThrCys: 0.0 ± 0.0
3.115ThrAsp: 3.115 ± 1.254
5.841ThrGlu: 5.841 ± 1.297
3.894ThrPhe: 3.894 ± 1.507
3.115ThrGly: 3.115 ± 1.062
1.558ThrHis: 1.558 ± 0.655
3.115ThrIle: 3.115 ± 0.814
3.894ThrLys: 3.894 ± 0.931
7.009ThrLeu: 7.009 ± 1.128
0.779ThrMet: 0.779 ± 0.688
3.505ThrAsn: 3.505 ± 1.204
1.168ThrPro: 1.168 ± 0.793
3.894ThrGln: 3.894 ± 1.367
1.947ThrArg: 1.947 ± 0.661
2.336ThrSer: 2.336 ± 0.913
4.283ThrThr: 4.283 ± 1.179
3.505ThrVal: 3.505 ± 1.286
0.389ThrTrp: 0.389 ± 0.275
1.947ThrTyr: 1.947 ± 1.379
0.0ThrXaa: 0.0 ± 0.0
Val
4.283ValAla: 4.283 ± 1.363
0.0ValCys: 0.0 ± 0.0
2.726ValAsp: 2.726 ± 1.312
3.894ValGlu: 3.894 ± 1.279
1.558ValPhe: 1.558 ± 0.457
3.894ValGly: 3.894 ± 1.233
0.779ValHis: 0.779 ± 0.365
3.505ValIle: 3.505 ± 1.188
5.452ValLys: 5.452 ± 1.479
5.062ValLeu: 5.062 ± 1.012
2.336ValMet: 2.336 ± 1.015
1.558ValAsn: 1.558 ± 0.658
1.558ValPro: 1.558 ± 0.715
1.558ValGln: 1.558 ± 0.975
1.558ValArg: 1.558 ± 0.691
2.726ValSer: 2.726 ± 1.116
3.115ValThr: 3.115 ± 1.344
5.841ValVal: 5.841 ± 1.314
0.779ValTrp: 0.779 ± 0.636
1.558ValTyr: 1.558 ± 0.835
0.0ValXaa: 0.0 ± 0.0
Trp
0.389TrpAla: 0.389 ± 0.318
0.0TrpCys: 0.0 ± 0.0
0.389TrpAsp: 0.389 ± 0.318
1.558TrpGlu: 1.558 ± 0.749
0.0TrpPhe: 0.0 ± 0.0
1.168TrpGly: 1.168 ± 0.619
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.168TrpLys: 1.168 ± 0.419
1.558TrpLeu: 1.558 ± 0.506
0.0TrpMet: 0.0 ± 0.0
0.389TrpAsn: 0.389 ± 0.49
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.779TrpArg: 0.779 ± 0.48
0.779TrpSer: 0.779 ± 0.423
0.389TrpThr: 0.389 ± 0.481
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.389TrpTyr: 0.389 ± 0.481
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.168TyrAla: 1.168 ± 0.586
0.0TyrCys: 0.0 ± 0.0
1.947TyrAsp: 1.947 ± 0.631
1.947TyrGlu: 1.947 ± 1.035
0.779TyrPhe: 0.779 ± 0.463
2.336TyrGly: 2.336 ± 1.215
0.779TyrHis: 0.779 ± 0.398
5.841TyrIle: 5.841 ± 1.395
5.062TyrLys: 5.062 ± 1.855
3.505TyrLeu: 3.505 ± 0.622
1.168TyrMet: 1.168 ± 0.633
2.336TyrAsn: 2.336 ± 1.133
1.947TyrPro: 1.947 ± 0.789
3.505TyrGln: 3.505 ± 1.439
3.115TyrArg: 3.115 ± 1.579
2.336TyrSer: 2.336 ± 1.024
1.558TyrThr: 1.558 ± 0.634
0.779TyrVal: 0.779 ± 0.581
0.779TyrTrp: 0.779 ± 0.398
3.505TyrTyr: 3.505 ± 1.214
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (2569 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski