Amino acid dipepetide frequency for Streptococcus satellite phage Javan453

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.395AlaCys: 1.395 ± 0.682
2.79AlaAsp: 2.79 ± 0.919
3.837AlaGlu: 3.837 ± 0.936
3.837AlaPhe: 3.837 ± 0.855
1.046AlaGly: 1.046 ± 0.748
0.698AlaHis: 0.698 ± 0.59
4.186AlaIle: 4.186 ± 1.059
4.883AlaLys: 4.883 ± 1.01
5.232AlaLeu: 5.232 ± 1.442
1.395AlaMet: 1.395 ± 0.969
3.488AlaAsn: 3.488 ± 0.831
1.744AlaPro: 1.744 ± 0.628
2.79AlaGln: 2.79 ± 0.713
3.488AlaArg: 3.488 ± 0.939
3.488AlaSer: 3.488 ± 1.335
3.139AlaThr: 3.139 ± 0.747
2.79AlaVal: 2.79 ± 0.923
0.698AlaTrp: 0.698 ± 0.783
1.744AlaTyr: 1.744 ± 0.6
0.0AlaXaa: 0.0 ± 0.0
Cys
0.698CysAla: 0.698 ± 0.475
0.0CysCys: 0.0 ± 0.0
1.046CysAsp: 1.046 ± 0.585
0.698CysGlu: 0.698 ± 0.463
0.349CysPhe: 0.349 ± 0.295
1.395CysGly: 1.395 ± 0.846
0.349CysHis: 0.349 ± 0.337
0.349CysIle: 0.349 ± 0.259
0.349CysLys: 0.349 ± 0.295
1.744CysLeu: 1.744 ± 0.693
0.0CysMet: 0.0 ± 0.0
0.349CysAsn: 0.349 ± 0.369
0.698CysPro: 0.698 ± 0.553
0.698CysGln: 0.698 ± 0.682
0.698CysArg: 0.698 ± 0.434
1.046CysSer: 1.046 ± 0.589
0.0CysThr: 0.0 ± 0.0
0.349CysVal: 0.349 ± 0.259
0.0CysTrp: 0.0 ± 0.0
0.698CysTyr: 0.698 ± 0.537
0.0CysXaa: 0.0 ± 0.0
Asp
1.744AspAla: 1.744 ± 0.689
1.744AspCys: 1.744 ± 0.837
3.139AspAsp: 3.139 ± 1.196
3.837AspGlu: 3.837 ± 1.708
3.837AspPhe: 3.837 ± 1.642
1.395AspGly: 1.395 ± 0.699
1.046AspHis: 1.046 ± 0.72
4.883AspIle: 4.883 ± 1.279
4.186AspLys: 4.186 ± 1.379
5.232AspLeu: 5.232 ± 1.064
1.744AspMet: 1.744 ± 0.757
2.79AspAsn: 2.79 ± 0.888
1.046AspPro: 1.046 ± 0.568
2.093AspGln: 2.093 ± 0.893
1.744AspArg: 1.744 ± 0.733
2.79AspSer: 2.79 ± 1.466
4.186AspThr: 4.186 ± 1.286
2.093AspVal: 2.093 ± 0.915
0.349AspTrp: 0.349 ± 0.369
6.278AspTyr: 6.278 ± 1.385
0.0AspXaa: 0.0 ± 0.0
Glu
4.186GluAla: 4.186 ± 1.276
1.395GluCys: 1.395 ± 0.935
4.534GluAsp: 4.534 ± 1.074
5.232GluGlu: 5.232 ± 1.468
2.442GluPhe: 2.442 ± 1.458
2.093GluGly: 2.093 ± 0.778
1.744GluHis: 1.744 ± 0.608
6.627GluIle: 6.627 ± 1.076
5.232GluLys: 5.232 ± 0.984
9.766GluLeu: 9.766 ± 2.036
1.395GluMet: 1.395 ± 0.839
2.79GluAsn: 2.79 ± 1.625
2.093GluPro: 2.093 ± 0.901
4.186GluGln: 4.186 ± 1.518
3.837GluArg: 3.837 ± 1.261
2.79GluSer: 2.79 ± 1.175
3.488GluThr: 3.488 ± 0.894
2.79GluVal: 2.79 ± 0.844
0.698GluTrp: 0.698 ± 0.405
3.488GluTyr: 3.488 ± 1.01
0.0GluXaa: 0.0 ± 0.0
Phe
2.093PheAla: 2.093 ± 0.784
0.0PheCys: 0.0 ± 0.0
3.837PheAsp: 3.837 ± 1.518
3.139PheGlu: 3.139 ± 0.907
2.093PhePhe: 2.093 ± 0.851
1.395PheGly: 1.395 ± 0.561
1.046PheHis: 1.046 ± 0.535
4.186PheIle: 4.186 ± 1.174
3.139PheLys: 3.139 ± 1.234
3.488PheLeu: 3.488 ± 1.034
0.349PheMet: 0.349 ± 0.367
3.139PheAsn: 3.139 ± 0.91
1.395PhePro: 1.395 ± 0.602
1.046PheGln: 1.046 ± 0.559
2.79PheArg: 2.79 ± 1.013
1.744PheSer: 1.744 ± 0.726
3.139PheThr: 3.139 ± 0.78
1.046PheVal: 1.046 ± 0.526
0.698PheTrp: 0.698 ± 0.754
1.744PheTyr: 1.744 ± 0.816
0.0PheXaa: 0.0 ± 0.0
Gly
3.837GlyAla: 3.837 ± 1.573
1.395GlyCys: 1.395 ± 0.479
2.093GlyAsp: 2.093 ± 0.784
2.442GlyGlu: 2.442 ± 1.104
2.093GlyPhe: 2.093 ± 0.799
2.442GlyGly: 2.442 ± 0.774
1.046GlyHis: 1.046 ± 0.678
4.534GlyIle: 4.534 ± 1.208
4.883GlyLys: 4.883 ± 1.803
5.232GlyLeu: 5.232 ± 1.719
0.698GlyMet: 0.698 ± 0.442
1.744GlyAsn: 1.744 ± 0.744
0.698GlyPro: 0.698 ± 0.465
1.744GlyGln: 1.744 ± 0.854
1.046GlyArg: 1.046 ± 0.369
2.442GlySer: 2.442 ± 0.989
4.186GlyThr: 4.186 ± 0.988
1.395GlyVal: 1.395 ± 0.65
1.046GlyTrp: 1.046 ± 0.593
2.093GlyTyr: 2.093 ± 1.089
0.0GlyXaa: 0.0 ± 0.0
His
2.093HisAla: 2.093 ± 1.115
0.349HisCys: 0.349 ± 0.259
0.698HisAsp: 0.698 ± 0.465
0.0HisGlu: 0.0 ± 0.0
0.698HisPhe: 0.698 ± 0.526
1.744HisGly: 1.744 ± 0.683
0.0HisHis: 0.0 ± 0.0
1.744HisIle: 1.744 ± 0.665
2.442HisLys: 2.442 ± 0.844
2.093HisLeu: 2.093 ± 1.058
0.0HisMet: 0.0 ± 0.0
1.046HisAsn: 1.046 ± 0.52
1.395HisPro: 1.395 ± 0.847
1.046HisGln: 1.046 ± 0.742
1.046HisArg: 1.046 ± 0.662
0.349HisSer: 0.349 ± 0.295
2.093HisThr: 2.093 ± 0.734
0.349HisVal: 0.349 ± 0.392
0.698HisTrp: 0.698 ± 0.682
1.744HisTyr: 1.744 ± 0.867
0.0HisXaa: 0.0 ± 0.0
Ile
5.232IleAla: 5.232 ± 1.398
0.349IleCys: 0.349 ± 0.337
6.278IleAsp: 6.278 ± 1.335
6.278IleGlu: 6.278 ± 1.417
2.79IlePhe: 2.79 ± 0.756
2.79IleGly: 2.79 ± 1.104
1.744IleHis: 1.744 ± 0.649
5.581IleIle: 5.581 ± 1.719
6.278IleLys: 6.278 ± 1.675
5.232IleLeu: 5.232 ± 1.01
0.349IleMet: 0.349 ± 0.295
4.534IleAsn: 4.534 ± 1.447
4.186IlePro: 4.186 ± 1.583
2.093IleGln: 2.093 ± 1.072
2.442IleArg: 2.442 ± 0.719
5.581IleSer: 5.581 ± 1.518
5.232IleThr: 5.232 ± 1.339
1.744IleVal: 1.744 ± 0.537
0.0IleTrp: 0.0 ± 0.0
2.093IleTyr: 2.093 ± 0.722
0.0IleXaa: 0.0 ± 0.0
Lys
5.232LysAla: 5.232 ± 1.229
0.349LysCys: 0.349 ± 0.341
3.837LysAsp: 3.837 ± 1.431
9.766LysGlu: 9.766 ± 1.677
2.79LysPhe: 2.79 ± 0.647
5.232LysGly: 5.232 ± 1.826
2.442LysHis: 2.442 ± 0.724
5.93LysIle: 5.93 ± 1.674
5.581LysLys: 5.581 ± 1.564
6.627LysLeu: 6.627 ± 1.637
1.744LysMet: 1.744 ± 0.713
4.534LysAsn: 4.534 ± 0.894
6.278LysPro: 6.278 ± 1.739
2.093LysGln: 2.093 ± 0.634
5.93LysArg: 5.93 ± 1.259
2.79LysSer: 2.79 ± 1.032
4.534LysThr: 4.534 ± 1.194
6.278LysVal: 6.278 ± 0.963
1.046LysTrp: 1.046 ± 0.559
2.79LysTyr: 2.79 ± 0.925
0.0LysXaa: 0.0 ± 0.0
Leu
6.976LeuAla: 6.976 ± 1.709
1.395LeuCys: 1.395 ± 0.597
5.93LeuAsp: 5.93 ± 0.905
8.371LeuGlu: 8.371 ± 2.554
3.488LeuPhe: 3.488 ± 1.098
4.883LeuGly: 4.883 ± 1.208
1.395LeuHis: 1.395 ± 0.616
6.278LeuIle: 6.278 ± 1.72
8.72LeuLys: 8.72 ± 1.539
11.859LeuLeu: 11.859 ± 2.183
3.139LeuMet: 3.139 ± 0.649
4.883LeuAsn: 4.883 ± 1.224
4.186LeuPro: 4.186 ± 1.342
2.79LeuGln: 2.79 ± 0.797
1.046LeuArg: 1.046 ± 0.565
7.325LeuSer: 7.325 ± 1.521
5.232LeuThr: 5.232 ± 1.215
3.837LeuVal: 3.837 ± 1.156
1.046LeuTrp: 1.046 ± 0.369
3.837LeuTyr: 3.837 ± 0.934
0.0LeuXaa: 0.0 ± 0.0
Met
1.046MetAla: 1.046 ± 0.469
0.0MetCys: 0.0 ± 0.0
0.698MetAsp: 0.698 ± 0.475
0.698MetGlu: 0.698 ± 0.308
0.349MetPhe: 0.349 ± 0.392
0.698MetGly: 0.698 ± 0.544
0.349MetHis: 0.349 ± 0.419
0.698MetIle: 0.698 ± 0.451
2.79MetLys: 2.79 ± 0.914
1.046MetLeu: 1.046 ± 0.612
0.349MetMet: 0.349 ± 0.378
1.395MetAsn: 1.395 ± 0.627
0.0MetPro: 0.0 ± 0.0
0.349MetGln: 0.349 ± 0.378
2.79MetArg: 2.79 ± 1.15
1.744MetSer: 1.744 ± 0.687
3.139MetThr: 3.139 ± 0.988
1.395MetVal: 1.395 ± 0.631
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.488AsnAla: 3.488 ± 0.96
0.0AsnCys: 0.0 ± 0.0
2.442AsnAsp: 2.442 ± 0.84
3.139AsnGlu: 3.139 ± 0.988
1.395AsnPhe: 1.395 ± 0.582
3.837AsnGly: 3.837 ± 1.191
2.442AsnHis: 2.442 ± 0.723
3.488AsnIle: 3.488 ± 1.582
3.488AsnLys: 3.488 ± 0.869
4.186AsnLeu: 4.186 ± 1.042
1.046AsnMet: 1.046 ± 0.52
3.139AsnAsn: 3.139 ± 1.0
3.488AsnPro: 3.488 ± 0.926
4.883AsnGln: 4.883 ± 1.115
3.837AsnArg: 3.837 ± 0.859
1.046AsnSer: 1.046 ± 0.492
2.093AsnThr: 2.093 ± 0.892
3.488AsnVal: 3.488 ± 0.785
0.349AsnTrp: 0.349 ± 0.381
3.488AsnTyr: 3.488 ± 1.186
0.0AsnXaa: 0.0 ± 0.0
Pro
2.093ProAla: 2.093 ± 0.739
0.698ProCys: 0.698 ± 0.514
0.698ProAsp: 0.698 ± 0.494
3.837ProGlu: 3.837 ± 1.173
1.395ProPhe: 1.395 ± 0.747
0.698ProGly: 0.698 ± 0.526
0.698ProHis: 0.698 ± 0.421
2.442ProIle: 2.442 ± 1.073
4.534ProLys: 4.534 ± 1.815
3.488ProLeu: 3.488 ± 0.995
1.395ProMet: 1.395 ± 0.542
3.488ProAsn: 3.488 ± 1.451
1.395ProPro: 1.395 ± 0.695
2.093ProGln: 2.093 ± 1.046
3.139ProArg: 3.139 ± 0.956
2.093ProSer: 2.093 ± 0.729
3.488ProThr: 3.488 ± 0.776
2.442ProVal: 2.442 ± 0.743
0.349ProTrp: 0.349 ± 0.259
2.442ProTyr: 2.442 ± 0.787
0.0ProXaa: 0.0 ± 0.0
Gln
1.744GlnAla: 1.744 ± 0.671
0.349GlnCys: 0.349 ± 0.414
1.744GlnAsp: 1.744 ± 0.904
4.186GlnGlu: 4.186 ± 1.109
1.744GlnPhe: 1.744 ± 0.735
2.093GlnGly: 2.093 ± 0.821
1.046GlnHis: 1.046 ± 0.496
3.139GlnIle: 3.139 ± 0.801
3.837GlnLys: 3.837 ± 1.011
5.93GlnLeu: 5.93 ± 1.351
0.349GlnMet: 0.349 ± 0.295
2.442GlnAsn: 2.442 ± 0.874
2.79GlnPro: 2.79 ± 1.128
2.442GlnGln: 2.442 ± 1.164
4.186GlnArg: 4.186 ± 1.192
2.79GlnSer: 2.79 ± 1.026
2.79GlnThr: 2.79 ± 0.801
2.79GlnVal: 2.79 ± 0.86
0.349GlnTrp: 0.349 ± 0.369
2.442GlnTyr: 2.442 ± 0.813
0.0GlnXaa: 0.0 ± 0.0
Arg
1.395ArgAla: 1.395 ± 0.75
0.698ArgCys: 0.698 ± 0.434
3.139ArgAsp: 3.139 ± 1.237
2.093ArgGlu: 2.093 ± 0.737
2.093ArgPhe: 2.093 ± 0.703
2.79ArgGly: 2.79 ± 0.808
2.093ArgHis: 2.093 ± 0.745
2.79ArgIle: 2.79 ± 0.948
4.534ArgLys: 4.534 ± 1.322
5.232ArgLeu: 5.232 ± 0.775
0.349ArgMet: 0.349 ± 0.354
3.139ArgAsn: 3.139 ± 1.212
1.744ArgPro: 1.744 ± 1.009
3.139ArgGln: 3.139 ± 0.865
2.79ArgArg: 2.79 ± 0.742
3.488ArgSer: 3.488 ± 1.15
3.139ArgThr: 3.139 ± 1.12
3.837ArgVal: 3.837 ± 0.798
1.046ArgTrp: 1.046 ± 0.764
3.488ArgTyr: 3.488 ± 0.98
0.0ArgXaa: 0.0 ± 0.0
Ser
2.093SerAla: 2.093 ± 1.004
0.349SerCys: 0.349 ± 0.341
4.534SerAsp: 4.534 ± 1.143
2.79SerGlu: 2.79 ± 1.28
2.442SerPhe: 2.442 ± 0.772
2.093SerGly: 2.093 ± 0.714
0.349SerHis: 0.349 ± 0.341
3.139SerIle: 3.139 ± 1.503
6.278SerLys: 6.278 ± 1.462
4.883SerLeu: 4.883 ± 1.202
0.349SerMet: 0.349 ± 0.337
2.093SerAsn: 2.093 ± 1.213
1.046SerPro: 1.046 ± 0.486
5.581SerGln: 5.581 ± 1.22
2.093SerArg: 2.093 ± 0.886
1.744SerSer: 1.744 ± 0.553
3.488SerThr: 3.488 ± 0.829
2.093SerVal: 2.093 ± 1.196
0.698SerTrp: 0.698 ± 0.546
3.837SerTyr: 3.837 ± 1.137
0.0SerXaa: 0.0 ± 0.0
Thr
3.139ThrAla: 3.139 ± 1.026
0.0ThrCys: 0.0 ± 0.0
2.442ThrAsp: 2.442 ± 0.948
2.093ThrGlu: 2.093 ± 0.885
2.442ThrPhe: 2.442 ± 1.326
5.581ThrGly: 5.581 ± 1.126
0.349ThrHis: 0.349 ± 0.295
5.232ThrIle: 5.232 ± 1.752
5.93ThrLys: 5.93 ± 1.63
6.627ThrLeu: 6.627 ± 1.371
1.046ThrMet: 1.046 ± 0.696
1.395ThrAsn: 1.395 ± 0.752
4.883ThrPro: 4.883 ± 1.102
3.488ThrGln: 3.488 ± 1.184
3.488ThrArg: 3.488 ± 0.941
2.442ThrSer: 2.442 ± 0.944
3.488ThrThr: 3.488 ± 1.629
2.79ThrVal: 2.79 ± 0.822
0.349ThrTrp: 0.349 ± 0.316
5.581ThrTyr: 5.581 ± 0.955
0.0ThrXaa: 0.0 ± 0.0
Val
3.488ValAla: 3.488 ± 1.084
0.0ValCys: 0.0 ± 0.0
2.442ValAsp: 2.442 ± 0.959
3.139ValGlu: 3.139 ± 1.194
2.79ValPhe: 2.79 ± 0.741
2.442ValGly: 2.442 ± 0.685
1.046ValHis: 1.046 ± 0.647
3.837ValIle: 3.837 ± 1.032
4.186ValLys: 4.186 ± 1.001
3.837ValLeu: 3.837 ± 1.22
1.395ValMet: 1.395 ± 0.658
3.139ValAsn: 3.139 ± 0.963
2.093ValPro: 2.093 ± 0.802
1.395ValGln: 1.395 ± 0.892
0.698ValArg: 0.698 ± 0.405
3.488ValSer: 3.488 ± 0.905
3.837ValThr: 3.837 ± 1.517
2.442ValVal: 2.442 ± 1.143
0.698ValTrp: 0.698 ± 0.518
1.046ValTyr: 1.046 ± 0.519
0.0ValXaa: 0.0 ± 0.0
Trp
0.349TrpAla: 0.349 ± 0.377
0.0TrpCys: 0.0 ± 0.0
1.395TrpAsp: 1.395 ± 0.735
1.046TrpGlu: 1.046 ± 0.62
0.349TrpPhe: 0.349 ± 0.259
0.349TrpGly: 0.349 ± 0.369
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.698TrpLys: 0.698 ± 0.401
1.046TrpLeu: 1.046 ± 0.416
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.046TrpGln: 1.046 ± 0.825
0.349TrpArg: 0.349 ± 0.341
1.046TrpSer: 1.046 ± 0.369
0.698TrpThr: 0.698 ± 0.479
1.744TrpVal: 1.744 ± 0.698
0.0TrpTrp: 0.0 ± 0.0
0.349TrpTyr: 0.349 ± 0.426
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.744TyrAla: 1.744 ± 0.793
1.046TyrCys: 1.046 ± 0.519
2.442TyrAsp: 2.442 ± 1.183
4.534TyrGlu: 4.534 ± 1.202
2.093TyrPhe: 2.093 ± 0.671
2.093TyrGly: 2.093 ± 0.755
1.744TyrHis: 1.744 ± 0.762
1.744TyrIle: 1.744 ± 0.858
3.837TyrLys: 3.837 ± 1.251
3.837TyrLeu: 3.837 ± 0.798
2.093TyrMet: 2.093 ± 0.878
5.232TyrAsn: 5.232 ± 1.232
1.744TyrPro: 1.744 ± 0.955
4.186TyrGln: 4.186 ± 1.23
5.581TyrArg: 5.581 ± 1.195
1.744TyrSer: 1.744 ± 0.889
1.395TyrThr: 1.395 ± 0.635
1.744TyrVal: 1.744 ± 0.742
0.349TyrTrp: 0.349 ± 0.341
4.186TyrTyr: 4.186 ± 1.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (2868 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski