Amino acid dipepetide frequency for Streptococcus satellite phage Javan309

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.581AlaAla: 0.581 ± 0.556
0.581AlaCys: 0.581 ± 0.32
3.488AlaAsp: 3.488 ± 0.62
4.942AlaGlu: 4.942 ± 1.463
3.488AlaPhe: 3.488 ± 1.086
3.488AlaGly: 3.488 ± 0.803
0.291AlaHis: 0.291 ± 0.291
5.233AlaIle: 5.233 ± 1.315
4.651AlaLys: 4.651 ± 0.899
4.651AlaLeu: 4.651 ± 1.382
1.744AlaMet: 1.744 ± 1.186
4.36AlaAsn: 4.36 ± 0.875
1.453AlaPro: 1.453 ± 0.447
2.326AlaGln: 2.326 ± 0.875
2.616AlaArg: 2.616 ± 0.739
3.488AlaSer: 3.488 ± 0.959
4.36AlaThr: 4.36 ± 1.16
3.488AlaVal: 3.488 ± 0.521
0.872AlaTrp: 0.872 ± 0.423
1.453AlaTyr: 1.453 ± 0.639
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.291CysCys: 0.291 ± 0.278
1.163CysAsp: 1.163 ± 0.649
0.291CysGlu: 0.291 ± 0.278
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.291CysIle: 0.291 ± 0.282
0.291CysLys: 0.291 ± 0.239
0.0CysLeu: 0.0 ± 0.0
0.291CysMet: 0.291 ± 0.364
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.872CysGln: 0.872 ± 0.482
0.581CysArg: 0.581 ± 0.374
0.291CysSer: 0.291 ± 0.278
0.872CysThr: 0.872 ± 0.423
0.291CysVal: 0.291 ± 0.305
0.0CysTrp: 0.0 ± 0.0
0.872CysTyr: 0.872 ± 0.574
0.0CysXaa: 0.0 ± 0.0
Asp
0.872AspAla: 0.872 ± 0.418
1.163AspCys: 1.163 ± 0.748
4.07AspAsp: 4.07 ± 1.173
4.942AspGlu: 4.942 ± 1.009
3.198AspPhe: 3.198 ± 1.026
3.488AspGly: 3.488 ± 1.074
1.453AspHis: 1.453 ± 0.841
5.814AspIle: 5.814 ± 1.329
4.942AspLys: 4.942 ± 1.126
4.651AspLeu: 4.651 ± 0.891
1.744AspMet: 1.744 ± 0.528
3.779AspAsn: 3.779 ± 1.146
1.744AspPro: 1.744 ± 1.109
2.035AspGln: 2.035 ± 0.611
2.907AspArg: 2.907 ± 0.981
4.07AspSer: 4.07 ± 1.15
3.198AspThr: 3.198 ± 0.737
2.035AspVal: 2.035 ± 0.686
0.291AspTrp: 0.291 ± 0.307
3.488AspTyr: 3.488 ± 0.896
0.0AspXaa: 0.0 ± 0.0
Glu
4.36GluAla: 4.36 ± 1.032
0.581GluCys: 0.581 ± 0.391
4.36GluAsp: 4.36 ± 0.883
7.849GluGlu: 7.849 ± 1.626
1.453GluPhe: 1.453 ± 0.687
2.326GluGly: 2.326 ± 0.971
0.872GluHis: 0.872 ± 0.507
6.977GluIle: 6.977 ± 1.235
6.686GluLys: 6.686 ± 0.97
9.012GluLeu: 9.012 ± 1.138
2.616GluMet: 2.616 ± 0.817
7.267GluAsn: 7.267 ± 1.245
2.326GluPro: 2.326 ± 0.799
2.326GluGln: 2.326 ± 1.258
6.686GluArg: 6.686 ± 1.215
3.198GluSer: 3.198 ± 0.791
4.942GluThr: 4.942 ± 1.188
3.779GluVal: 3.779 ± 1.319
1.163GluTrp: 1.163 ± 0.904
3.779GluTyr: 3.779 ± 0.714
0.0GluXaa: 0.0 ± 0.0
Phe
2.326PheAla: 2.326 ± 0.875
0.581PheCys: 0.581 ± 0.387
3.488PheAsp: 3.488 ± 0.747
4.07PheGlu: 4.07 ± 1.194
1.453PhePhe: 1.453 ± 0.59
3.198PheGly: 3.198 ± 0.597
0.581PheHis: 0.581 ± 0.321
2.616PheIle: 2.616 ± 0.721
3.198PheLys: 3.198 ± 0.828
4.942PheLeu: 4.942 ± 1.282
0.291PheMet: 0.291 ± 0.358
2.035PheAsn: 2.035 ± 0.921
0.291PhePro: 0.291 ± 0.369
1.744PheGln: 1.744 ± 0.728
2.326PheArg: 2.326 ± 0.569
2.616PheSer: 2.616 ± 0.51
2.035PheThr: 2.035 ± 0.743
2.035PheVal: 2.035 ± 0.816
0.581PheTrp: 0.581 ± 0.443
2.907PheTyr: 2.907 ± 0.718
0.0PheXaa: 0.0 ± 0.0
Gly
2.616GlyAla: 2.616 ± 1.097
0.291GlyCys: 0.291 ± 0.239
3.198GlyAsp: 3.198 ± 1.107
3.779GlyGlu: 3.779 ± 0.958
3.198GlyPhe: 3.198 ± 0.893
2.035GlyGly: 2.035 ± 0.557
1.163GlyHis: 1.163 ± 0.41
3.488GlyIle: 3.488 ± 0.946
3.488GlyLys: 3.488 ± 1.086
6.977GlyLeu: 6.977 ± 1.439
1.744GlyMet: 1.744 ± 0.662
1.744GlyAsn: 1.744 ± 0.55
0.291GlyPro: 0.291 ± 0.278
2.616GlyGln: 2.616 ± 0.919
1.744GlyArg: 1.744 ± 0.6
3.488GlySer: 3.488 ± 1.095
2.907GlyThr: 2.907 ± 0.934
3.198GlyVal: 3.198 ± 0.863
0.581GlyTrp: 0.581 ± 0.324
3.488GlyTyr: 3.488 ± 0.887
0.0GlyXaa: 0.0 ± 0.0
His
1.453HisAla: 1.453 ± 0.657
0.0HisCys: 0.0 ± 0.0
0.291HisAsp: 0.291 ± 0.291
2.326HisGlu: 2.326 ± 0.55
0.872HisPhe: 0.872 ± 0.522
2.326HisGly: 2.326 ± 0.816
1.163HisHis: 1.163 ± 0.446
0.291HisIle: 0.291 ± 0.239
1.744HisLys: 1.744 ± 0.797
2.035HisLeu: 2.035 ± 0.619
0.581HisMet: 0.581 ± 0.468
0.0HisAsn: 0.0 ± 0.0
0.581HisPro: 0.581 ± 0.397
1.163HisGln: 1.163 ± 0.593
1.163HisArg: 1.163 ± 0.536
1.744HisSer: 1.744 ± 0.636
1.453HisThr: 1.453 ± 0.669
1.453HisVal: 1.453 ± 0.476
0.291HisTrp: 0.291 ± 0.268
1.163HisTyr: 1.163 ± 0.562
0.0HisXaa: 0.0 ± 0.0
Ile
4.07IleAla: 4.07 ± 0.979
0.581IleCys: 0.581 ± 0.399
4.07IleAsp: 4.07 ± 1.117
4.942IleGlu: 4.942 ± 1.212
2.616IlePhe: 2.616 ± 0.76
2.907IleGly: 2.907 ± 0.716
1.744IleHis: 1.744 ± 0.727
3.488IleIle: 3.488 ± 0.901
6.105IleLys: 6.105 ± 1.511
5.523IleLeu: 5.523 ± 0.822
0.872IleMet: 0.872 ± 0.526
3.488IleAsn: 3.488 ± 1.187
3.198IlePro: 3.198 ± 0.636
2.907IleGln: 2.907 ± 0.567
4.07IleArg: 4.07 ± 0.96
3.198IleSer: 3.198 ± 0.913
4.36IleThr: 4.36 ± 1.436
2.907IleVal: 2.907 ± 0.818
1.453IleTrp: 1.453 ± 0.561
2.616IleTyr: 2.616 ± 0.703
0.0IleXaa: 0.0 ± 0.0
Lys
6.395LysAla: 6.395 ± 1.78
0.0LysCys: 0.0 ± 0.0
4.36LysAsp: 4.36 ± 0.826
8.721LysGlu: 8.721 ± 1.287
2.907LysPhe: 2.907 ± 0.599
4.07LysGly: 4.07 ± 1.406
2.616LysHis: 2.616 ± 0.746
7.267LysIle: 7.267 ± 1.243
5.814LysLys: 5.814 ± 1.397
6.105LysLeu: 6.105 ± 1.66
2.035LysMet: 2.035 ± 0.7
3.198LysAsn: 3.198 ± 1.179
4.07LysPro: 4.07 ± 1.136
3.779LysGln: 3.779 ± 0.83
5.814LysArg: 5.814 ± 1.286
4.07LysSer: 4.07 ± 1.504
5.814LysThr: 5.814 ± 1.708
3.488LysVal: 3.488 ± 0.779
0.872LysTrp: 0.872 ± 0.542
3.198LysTyr: 3.198 ± 0.853
0.0LysXaa: 0.0 ± 0.0
Leu
6.395LeuAla: 6.395 ± 1.165
0.291LeuCys: 0.291 ± 0.278
8.14LeuAsp: 8.14 ± 1.804
7.849LeuGlu: 7.849 ± 1.39
2.907LeuPhe: 2.907 ± 1.001
4.942LeuGly: 4.942 ± 1.321
1.744LeuHis: 1.744 ± 0.511
4.07LeuIle: 4.07 ± 0.829
9.012LeuLys: 9.012 ± 1.917
10.174LeuLeu: 10.174 ± 1.586
0.872LeuMet: 0.872 ± 0.546
5.233LeuAsn: 5.233 ± 1.577
4.07LeuPro: 4.07 ± 1.416
3.488LeuGln: 3.488 ± 0.685
3.779LeuArg: 3.779 ± 0.945
5.814LeuSer: 5.814 ± 1.785
5.814LeuThr: 5.814 ± 0.731
6.686LeuVal: 6.686 ± 0.962
1.453LeuTrp: 1.453 ± 0.596
2.907LeuTyr: 2.907 ± 0.929
0.0LeuXaa: 0.0 ± 0.0
Met
2.326MetAla: 2.326 ± 0.896
0.0MetCys: 0.0 ± 0.0
2.035MetAsp: 2.035 ± 0.741
1.744MetGlu: 1.744 ± 0.71
1.453MetPhe: 1.453 ± 0.617
1.453MetGly: 1.453 ± 0.648
0.581MetHis: 0.581 ± 0.32
1.163MetIle: 1.163 ± 0.461
1.163MetLys: 1.163 ± 0.772
1.744MetLeu: 1.744 ± 0.583
0.581MetMet: 0.581 ± 0.366
2.326MetAsn: 2.326 ± 0.632
0.291MetPro: 0.291 ± 0.234
0.872MetGln: 0.872 ± 0.459
1.163MetArg: 1.163 ± 0.487
1.453MetSer: 1.453 ± 0.73
4.651MetThr: 4.651 ± 0.982
0.872MetVal: 0.872 ± 0.646
0.291MetTrp: 0.291 ± 0.278
0.291MetTyr: 0.291 ± 0.307
0.0MetXaa: 0.0 ± 0.0
Asn
2.907AsnAla: 2.907 ± 0.726
0.0AsnCys: 0.0 ± 0.0
2.035AsnAsp: 2.035 ± 0.834
3.198AsnGlu: 3.198 ± 0.866
1.744AsnPhe: 1.744 ± 0.599
3.779AsnGly: 3.779 ± 1.013
1.744AsnHis: 1.744 ± 0.561
1.453AsnIle: 1.453 ± 0.69
4.36AsnLys: 4.36 ± 1.096
5.233AsnLeu: 5.233 ± 1.294
2.035AsnMet: 2.035 ± 0.673
2.326AsnAsn: 2.326 ± 0.988
2.907AsnPro: 2.907 ± 1.228
2.035AsnGln: 2.035 ± 0.874
2.907AsnArg: 2.907 ± 1.14
4.07AsnSer: 4.07 ± 1.036
2.035AsnThr: 2.035 ± 0.95
2.616AsnVal: 2.616 ± 0.821
0.872AsnTrp: 0.872 ± 0.498
1.744AsnTyr: 1.744 ± 0.494
0.0AsnXaa: 0.0 ± 0.0
Pro
2.035ProAla: 2.035 ± 0.526
0.0ProCys: 0.0 ± 0.0
1.744ProAsp: 1.744 ± 0.825
4.942ProGlu: 4.942 ± 1.204
2.035ProPhe: 2.035 ± 0.946
0.581ProGly: 0.581 ± 0.32
0.0ProHis: 0.0 ± 0.0
2.035ProIle: 2.035 ± 0.56
2.616ProLys: 2.616 ± 0.75
2.907ProLeu: 2.907 ± 0.669
0.291ProMet: 0.291 ± 0.274
2.326ProAsn: 2.326 ± 0.861
1.163ProPro: 1.163 ± 0.587
0.0ProGln: 0.0 ± 0.0
2.326ProArg: 2.326 ± 0.841
1.744ProSer: 1.744 ± 0.827
1.744ProThr: 1.744 ± 0.75
1.744ProVal: 1.744 ± 0.636
0.0ProTrp: 0.0 ± 0.0
3.488ProTyr: 3.488 ± 1.161
0.0ProXaa: 0.0 ± 0.0
Gln
3.488GlnAla: 3.488 ± 1.208
0.0GlnCys: 0.0 ± 0.0
1.453GlnAsp: 1.453 ± 0.557
3.779GlnGlu: 3.779 ± 1.001
2.035GlnPhe: 2.035 ± 0.588
2.035GlnGly: 2.035 ± 0.771
0.872GlnHis: 0.872 ± 0.396
2.907GlnIle: 2.907 ± 0.8
4.651GlnLys: 4.651 ± 1.849
3.198GlnLeu: 3.198 ± 0.952
0.872GlnMet: 0.872 ± 0.425
1.163GlnAsn: 1.163 ± 0.506
0.581GlnPro: 0.581 ± 0.547
2.616GlnGln: 2.616 ± 1.042
1.744GlnArg: 1.744 ± 0.681
2.907GlnSer: 2.907 ± 0.7
1.163GlnThr: 1.163 ± 0.636
2.035GlnVal: 2.035 ± 0.724
0.291GlnTrp: 0.291 ± 0.305
2.035GlnTyr: 2.035 ± 0.755
0.0GlnXaa: 0.0 ± 0.0
Arg
3.198ArgAla: 3.198 ± 0.98
0.291ArgCys: 0.291 ± 0.278
3.779ArgAsp: 3.779 ± 0.949
4.942ArgGlu: 4.942 ± 0.986
3.198ArgPhe: 3.198 ± 0.96
2.326ArgGly: 2.326 ± 0.494
1.453ArgHis: 1.453 ± 0.785
2.907ArgIle: 2.907 ± 0.681
4.651ArgLys: 4.651 ± 1.111
4.651ArgLeu: 4.651 ± 1.138
2.035ArgMet: 2.035 ± 0.668
2.616ArgAsn: 2.616 ± 0.846
1.744ArgPro: 1.744 ± 0.569
2.616ArgGln: 2.616 ± 0.777
2.035ArgArg: 2.035 ± 0.586
2.326ArgSer: 2.326 ± 0.702
2.035ArgThr: 2.035 ± 0.51
1.453ArgVal: 1.453 ± 0.706
0.0ArgTrp: 0.0 ± 0.0
3.198ArgTyr: 3.198 ± 0.901
0.0ArgXaa: 0.0 ± 0.0
Ser
3.198SerAla: 3.198 ± 0.868
0.291SerCys: 0.291 ± 0.239
3.779SerAsp: 3.779 ± 0.965
4.07SerGlu: 4.07 ± 1.065
2.907SerPhe: 2.907 ± 0.646
2.907SerGly: 2.907 ± 0.792
1.453SerHis: 1.453 ± 0.516
3.779SerIle: 3.779 ± 0.788
4.07SerLys: 4.07 ± 0.747
5.814SerLeu: 5.814 ± 1.052
1.744SerMet: 1.744 ± 0.905
2.035SerAsn: 2.035 ± 0.69
1.744SerPro: 1.744 ± 0.549
2.616SerGln: 2.616 ± 0.873
2.326SerArg: 2.326 ± 0.724
4.36SerSer: 4.36 ± 1.125
3.198SerThr: 3.198 ± 0.538
3.198SerVal: 3.198 ± 0.683
0.581SerTrp: 0.581 ± 0.321
4.07SerTyr: 4.07 ± 0.978
0.0SerXaa: 0.0 ± 0.0
Thr
3.488ThrAla: 3.488 ± 0.704
0.291ThrCys: 0.291 ± 0.364
3.198ThrAsp: 3.198 ± 1.254
3.198ThrGlu: 3.198 ± 0.98
4.36ThrPhe: 4.36 ± 1.459
5.233ThrGly: 5.233 ± 1.381
0.872ThrHis: 0.872 ± 0.522
3.779ThrIle: 3.779 ± 0.889
5.233ThrLys: 5.233 ± 1.557
7.558ThrLeu: 7.558 ± 1.555
2.035ThrMet: 2.035 ± 0.902
1.744ThrAsn: 1.744 ± 0.962
2.326ThrPro: 2.326 ± 0.896
2.035ThrGln: 2.035 ± 0.933
2.326ThrArg: 2.326 ± 0.598
3.779ThrSer: 3.779 ± 0.778
2.326ThrThr: 2.326 ± 0.946
3.488ThrVal: 3.488 ± 0.854
1.163ThrTrp: 1.163 ± 0.472
2.326ThrTyr: 2.326 ± 1.149
0.0ThrXaa: 0.0 ± 0.0
Val
5.523ValAla: 5.523 ± 1.186
0.0ValCys: 0.0 ± 0.0
2.616ValAsp: 2.616 ± 1.47
3.779ValGlu: 3.779 ± 1.217
0.872ValPhe: 0.872 ± 0.35
3.198ValGly: 3.198 ± 1.145
1.163ValHis: 1.163 ± 0.522
3.779ValIle: 3.779 ± 1.499
3.779ValLys: 3.779 ± 0.874
4.07ValLeu: 4.07 ± 1.113
1.453ValMet: 1.453 ± 0.596
1.453ValAsn: 1.453 ± 0.618
1.453ValPro: 1.453 ± 0.632
1.453ValGln: 1.453 ± 0.681
2.035ValArg: 2.035 ± 0.687
1.744ValSer: 1.744 ± 0.604
5.233ValThr: 5.233 ± 1.33
4.36ValVal: 4.36 ± 0.974
0.291ValTrp: 0.291 ± 0.268
2.616ValTyr: 2.616 ± 0.745
0.0ValXaa: 0.0 ± 0.0
Trp
0.291TrpAla: 0.291 ± 0.278
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.453TrpGlu: 1.453 ± 0.574
0.291TrpPhe: 0.291 ± 0.278
0.581TrpGly: 0.581 ± 0.468
0.291TrpHis: 0.291 ± 0.307
0.581TrpIle: 0.581 ± 0.425
2.326TrpLys: 2.326 ± 0.551
2.326TrpLeu: 2.326 ± 0.788
0.291TrpMet: 0.291 ± 0.307
0.291TrpAsn: 0.291 ± 0.305
0.581TrpPro: 0.581 ± 0.324
0.291TrpGln: 0.291 ± 0.358
0.0TrpArg: 0.0 ± 0.0
0.872TrpSer: 0.872 ± 0.404
0.291TrpThr: 0.291 ± 0.278
0.581TrpVal: 0.581 ± 0.398
0.291TrpTrp: 0.291 ± 0.239
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.035TyrAla: 2.035 ± 1.032
1.163TyrCys: 1.163 ± 0.541
2.616TyrAsp: 2.616 ± 0.947
1.744TyrGlu: 1.744 ± 0.703
2.326TyrPhe: 2.326 ± 0.974
1.163TyrGly: 1.163 ± 0.777
2.035TyrHis: 2.035 ± 0.855
3.198TyrIle: 3.198 ± 0.944
5.814TyrLys: 5.814 ± 1.489
4.36TyrLeu: 4.36 ± 1.31
2.035TyrMet: 2.035 ± 0.831
2.616TyrAsn: 2.616 ± 1.014
2.907TyrPro: 2.907 ± 0.873
2.035TyrGln: 2.035 ± 0.805
2.907TyrArg: 2.907 ± 0.991
2.907TyrSer: 2.907 ± 0.771
2.326TyrThr: 2.326 ± 0.598
1.163TyrVal: 1.163 ± 0.53
0.291TyrTrp: 0.291 ± 0.234
2.326TyrTyr: 2.326 ± 0.746
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3441 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski