Amino acid dipepetide frequency for Streptococcus satellite phage Javan50

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.274AlaAla: 0.274 ± 0.293
1.098AlaCys: 1.098 ± 0.45
4.665AlaAsp: 4.665 ± 1.209
4.94AlaGlu: 4.94 ± 1.174
2.744AlaPhe: 2.744 ± 0.727
1.647AlaGly: 1.647 ± 0.577
0.274AlaHis: 0.274 ± 0.216
5.763AlaIle: 5.763 ± 1.231
3.842AlaLys: 3.842 ± 0.829
3.293AlaLeu: 3.293 ± 0.924
1.098AlaMet: 1.098 ± 0.671
4.665AlaAsn: 4.665 ± 1.207
1.372AlaPro: 1.372 ± 0.576
2.47AlaGln: 2.47 ± 0.759
1.921AlaArg: 1.921 ± 0.592
3.293AlaSer: 3.293 ± 0.917
3.568AlaThr: 3.568 ± 1.011
3.019AlaVal: 3.019 ± 1.025
0.823AlaTrp: 0.823 ± 0.422
2.744AlaTyr: 2.744 ± 0.588
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.274CysCys: 0.274 ± 0.212
0.549CysAsp: 0.549 ± 0.357
0.274CysGlu: 0.274 ± 0.212
0.0CysPhe: 0.0 ± 0.0
0.274CysGly: 0.274 ± 0.212
0.549CysHis: 0.549 ± 0.338
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.823CysLeu: 0.823 ± 0.406
0.0CysMet: 0.0 ± 0.0
0.274CysAsn: 0.274 ± 0.26
0.549CysPro: 0.549 ± 0.328
0.549CysGln: 0.549 ± 0.286
0.549CysArg: 0.549 ± 0.315
0.549CysSer: 0.549 ± 0.41
0.0CysThr: 0.0 ± 0.0
0.274CysVal: 0.274 ± 0.26
0.549CysTrp: 0.549 ± 0.366
0.274CysTyr: 0.274 ± 0.212
0.0CysXaa: 0.0 ± 0.0
Asp
1.098AspAla: 1.098 ± 0.667
0.549AspCys: 0.549 ± 0.307
2.47AspAsp: 2.47 ± 0.927
4.391AspGlu: 4.391 ± 0.965
4.116AspPhe: 4.116 ± 0.953
1.647AspGly: 1.647 ± 0.653
0.823AspHis: 0.823 ± 0.375
6.037AspIle: 6.037 ± 1.041
7.958AspLys: 7.958 ± 1.026
5.763AspLeu: 5.763 ± 0.958
1.647AspMet: 1.647 ± 0.81
3.568AspAsn: 3.568 ± 0.791
1.098AspPro: 1.098 ± 0.597
1.098AspGln: 1.098 ± 0.663
4.116AspArg: 4.116 ± 0.611
2.47AspSer: 2.47 ± 1.02
3.568AspThr: 3.568 ± 0.973
2.195AspVal: 2.195 ± 0.77
0.0AspTrp: 0.0 ± 0.0
3.568AspTyr: 3.568 ± 0.959
0.0AspXaa: 0.0 ± 0.0
Glu
6.037GluAla: 6.037 ± 1.262
0.549GluCys: 0.549 ± 0.286
4.94GluAsp: 4.94 ± 1.383
4.391GluGlu: 4.391 ± 1.276
2.195GluPhe: 2.195 ± 0.728
2.744GluGly: 2.744 ± 0.885
1.647GluHis: 1.647 ± 0.646
5.488GluIle: 5.488 ± 1.091
6.861GluLys: 6.861 ± 0.875
9.33GluLeu: 9.33 ± 1.018
1.921GluMet: 1.921 ± 0.708
3.293GluAsn: 3.293 ± 0.586
1.647GluPro: 1.647 ± 0.862
4.116GluGln: 4.116 ± 1.416
2.47GluArg: 2.47 ± 0.823
2.195GluSer: 2.195 ± 0.625
5.488GluThr: 5.488 ± 1.078
4.665GluVal: 4.665 ± 1.287
1.372GluTrp: 1.372 ± 0.725
3.019GluTyr: 3.019 ± 0.811
0.0GluXaa: 0.0 ± 0.0
Phe
1.372PheAla: 1.372 ± 0.557
0.274PheCys: 0.274 ± 0.312
2.195PheAsp: 2.195 ± 0.705
3.293PheGlu: 3.293 ± 0.787
1.921PhePhe: 1.921 ± 0.62
2.195PheGly: 2.195 ± 0.647
1.098PheHis: 1.098 ± 0.41
5.214PheIle: 5.214 ± 1.34
5.488PheLys: 5.488 ± 0.858
2.744PheLeu: 2.744 ± 0.764
0.274PheMet: 0.274 ± 0.257
3.842PheAsn: 3.842 ± 0.795
1.372PhePro: 1.372 ± 0.638
1.647PheGln: 1.647 ± 0.599
3.019PheArg: 3.019 ± 0.7
2.744PheSer: 2.744 ± 0.606
1.098PheThr: 1.098 ± 0.492
2.195PheVal: 2.195 ± 0.728
0.274PheTrp: 0.274 ± 0.216
1.647PheTyr: 1.647 ± 0.732
0.0PheXaa: 0.0 ± 0.0
Gly
2.47GlyAla: 2.47 ± 1.116
0.549GlyCys: 0.549 ± 0.359
3.842GlyAsp: 3.842 ± 1.631
3.293GlyGlu: 3.293 ± 0.855
3.293GlyPhe: 3.293 ± 0.826
2.195GlyGly: 2.195 ± 0.718
0.274GlyHis: 0.274 ± 0.254
2.744GlyIle: 2.744 ± 0.957
3.842GlyLys: 3.842 ± 1.015
6.586GlyLeu: 6.586 ± 1.425
1.098GlyMet: 1.098 ± 0.444
1.372GlyAsn: 1.372 ± 0.639
0.0GlyPro: 0.0 ± 0.0
3.293GlyGln: 3.293 ± 1.025
1.647GlyArg: 1.647 ± 0.493
1.098GlySer: 1.098 ± 0.452
2.744GlyThr: 2.744 ± 0.738
4.665GlyVal: 4.665 ± 0.878
0.274GlyTrp: 0.274 ± 0.216
3.293GlyTyr: 3.293 ± 0.843
0.0GlyXaa: 0.0 ± 0.0
His
1.647HisAla: 1.647 ± 0.746
0.0HisCys: 0.0 ± 0.0
1.098HisAsp: 1.098 ± 0.488
0.0HisGlu: 0.0 ± 0.0
1.098HisPhe: 1.098 ± 0.51
0.823HisGly: 0.823 ± 0.408
0.0HisHis: 0.0 ± 0.0
0.274HisIle: 0.274 ± 0.299
2.195HisLys: 2.195 ± 0.831
0.823HisLeu: 0.823 ± 0.558
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.647HisGln: 1.647 ± 0.801
0.549HisArg: 0.549 ± 0.325
1.098HisSer: 1.098 ± 0.606
1.098HisThr: 1.098 ± 0.609
0.549HisVal: 0.549 ± 0.508
0.549HisTrp: 0.549 ± 0.424
2.744HisTyr: 2.744 ± 0.751
0.0HisXaa: 0.0 ± 0.0
Ile
5.214IleAla: 5.214 ± 1.105
0.549IleCys: 0.549 ± 0.338
4.94IleAsp: 4.94 ± 0.964
5.488IleGlu: 5.488 ± 1.338
3.293IlePhe: 3.293 ± 0.635
4.116IleGly: 4.116 ± 0.846
1.098IleHis: 1.098 ± 0.502
4.391IleIle: 4.391 ± 0.872
8.507IleLys: 8.507 ± 1.638
6.037IleLeu: 6.037 ± 1.061
2.195IleMet: 2.195 ± 0.806
4.94IleAsn: 4.94 ± 1.054
3.293IlePro: 3.293 ± 0.958
2.744IleGln: 2.744 ± 0.74
2.744IleArg: 2.744 ± 0.862
5.488IleSer: 5.488 ± 1.469
5.488IleThr: 5.488 ± 0.795
1.647IleVal: 1.647 ± 0.499
0.549IleTrp: 0.549 ± 0.432
3.568IleTyr: 3.568 ± 1.081
0.0IleXaa: 0.0 ± 0.0
Lys
7.684LysAla: 7.684 ± 1.426
0.274LysCys: 0.274 ± 0.212
6.312LysAsp: 6.312 ± 1.396
9.605LysGlu: 9.605 ± 1.932
2.47LysPhe: 2.47 ± 0.937
4.94LysGly: 4.94 ± 1.019
2.195LysHis: 2.195 ± 0.609
5.488LysIle: 5.488 ± 1.112
6.586LysLys: 6.586 ± 1.7
6.861LysLeu: 6.861 ± 1.676
3.019LysMet: 3.019 ± 0.84
2.744LysAsn: 2.744 ± 0.801
4.94LysPro: 4.94 ± 1.152
4.665LysGln: 4.665 ± 1.223
5.488LysArg: 5.488 ± 1.127
6.037LysSer: 6.037 ± 1.491
4.665LysThr: 4.665 ± 1.005
4.116LysVal: 4.116 ± 1.344
0.549LysTrp: 0.549 ± 0.397
4.665LysTyr: 4.665 ± 1.256
0.0LysXaa: 0.0 ± 0.0
Leu
7.684LeuAla: 7.684 ± 1.476
0.549LeuCys: 0.549 ± 0.341
4.94LeuAsp: 4.94 ± 1.087
7.135LeuGlu: 7.135 ± 1.359
5.214LeuPhe: 5.214 ± 1.064
4.94LeuGly: 4.94 ± 1.083
1.921LeuHis: 1.921 ± 0.611
6.312LeuIle: 6.312 ± 1.398
7.958LeuLys: 7.958 ± 1.31
8.233LeuLeu: 8.233 ± 1.478
1.921LeuMet: 1.921 ± 0.739
8.233LeuAsn: 8.233 ± 1.836
3.842LeuPro: 3.842 ± 0.965
2.744LeuGln: 2.744 ± 0.699
1.921LeuArg: 1.921 ± 0.758
6.586LeuSer: 6.586 ± 1.301
4.391LeuThr: 4.391 ± 1.166
5.214LeuVal: 5.214 ± 1.024
0.549LeuTrp: 0.549 ± 0.33
4.391LeuTyr: 4.391 ± 0.94
0.0LeuXaa: 0.0 ± 0.0
Met
1.921MetAla: 1.921 ± 0.715
0.0MetCys: 0.0 ± 0.0
1.647MetAsp: 1.647 ± 0.631
1.372MetGlu: 1.372 ± 0.533
1.098MetPhe: 1.098 ± 0.519
0.549MetGly: 0.549 ± 0.322
0.0MetHis: 0.0 ± 0.0
1.372MetIle: 1.372 ± 0.755
2.47MetLys: 2.47 ± 0.617
2.47MetLeu: 2.47 ± 0.604
0.0MetMet: 0.0 ± 0.0
1.647MetAsn: 1.647 ± 0.656
0.274MetPro: 0.274 ± 0.312
1.372MetGln: 1.372 ± 0.533
0.823MetArg: 0.823 ± 0.396
0.823MetSer: 0.823 ± 0.469
3.568MetThr: 3.568 ± 0.827
0.549MetVal: 0.549 ± 0.386
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.116AsnAla: 4.116 ± 0.717
0.0AsnCys: 0.0 ± 0.0
2.195AsnAsp: 2.195 ± 0.895
3.568AsnGlu: 3.568 ± 1.136
1.098AsnPhe: 1.098 ± 0.535
4.116AsnGly: 4.116 ± 1.002
0.823AsnHis: 0.823 ± 0.398
3.568AsnIle: 3.568 ± 0.895
4.94AsnLys: 4.94 ± 1.124
4.665AsnLeu: 4.665 ± 0.893
1.647AsnMet: 1.647 ± 0.556
1.647AsnAsn: 1.647 ± 0.819
3.568AsnPro: 3.568 ± 1.062
3.842AsnGln: 3.842 ± 1.081
3.842AsnArg: 3.842 ± 0.897
2.744AsnSer: 2.744 ± 0.868
3.842AsnThr: 3.842 ± 1.31
2.47AsnVal: 2.47 ± 0.622
1.098AsnTrp: 1.098 ± 0.518
2.744AsnTyr: 2.744 ± 0.762
0.0AsnXaa: 0.0 ± 0.0
Pro
1.372ProAla: 1.372 ± 0.497
0.274ProCys: 0.274 ± 0.299
1.372ProAsp: 1.372 ± 0.578
3.568ProGlu: 3.568 ± 0.908
1.647ProPhe: 1.647 ± 0.62
0.823ProGly: 0.823 ± 0.445
0.0ProHis: 0.0 ± 0.0
2.47ProIle: 2.47 ± 0.943
4.94ProLys: 4.94 ± 1.048
3.293ProLeu: 3.293 ± 0.713
0.549ProMet: 0.549 ± 0.369
2.744ProAsn: 2.744 ± 0.935
1.098ProPro: 1.098 ± 0.399
0.274ProGln: 0.274 ± 0.26
2.195ProArg: 2.195 ± 0.58
2.195ProSer: 2.195 ± 0.614
1.921ProThr: 1.921 ± 0.586
1.372ProVal: 1.372 ± 0.511
0.0ProTrp: 0.0 ± 0.0
1.372ProTyr: 1.372 ± 0.537
0.0ProXaa: 0.0 ± 0.0
Gln
2.195GlnAla: 2.195 ± 0.568
0.0GlnCys: 0.0 ± 0.0
2.47GlnAsp: 2.47 ± 0.856
4.94GlnGlu: 4.94 ± 1.317
1.098GlnPhe: 1.098 ± 0.516
2.47GlnGly: 2.47 ± 0.859
0.823GlnHis: 0.823 ± 0.42
2.195GlnIle: 2.195 ± 0.632
4.116GlnLys: 4.116 ± 1.149
6.312GlnLeu: 6.312 ± 1.407
0.549GlnMet: 0.549 ± 0.33
2.744GlnAsn: 2.744 ± 0.907
1.921GlnPro: 1.921 ± 0.68
2.195GlnGln: 2.195 ± 0.581
2.195GlnArg: 2.195 ± 0.687
3.842GlnSer: 3.842 ± 1.23
1.372GlnThr: 1.372 ± 0.45
3.019GlnVal: 3.019 ± 0.679
0.0GlnTrp: 0.0 ± 0.0
2.195GlnTyr: 2.195 ± 0.697
0.0GlnXaa: 0.0 ± 0.0
Arg
2.195ArgAla: 2.195 ± 0.831
0.274ArgCys: 0.274 ± 0.212
3.019ArgAsp: 3.019 ± 0.861
2.195ArgGlu: 2.195 ± 0.705
2.195ArgPhe: 2.195 ± 0.682
2.47ArgGly: 2.47 ± 0.92
1.098ArgHis: 1.098 ± 0.503
3.842ArgIle: 3.842 ± 0.936
4.665ArgLys: 4.665 ± 1.046
4.665ArgLeu: 4.665 ± 0.891
0.823ArgMet: 0.823 ± 0.415
2.744ArgAsn: 2.744 ± 0.905
1.098ArgPro: 1.098 ± 0.462
2.195ArgGln: 2.195 ± 0.51
2.195ArgArg: 2.195 ± 0.602
2.195ArgSer: 2.195 ± 0.771
4.116ArgThr: 4.116 ± 1.148
2.744ArgVal: 2.744 ± 0.836
0.274ArgTrp: 0.274 ± 0.251
2.195ArgTyr: 2.195 ± 0.715
0.0ArgXaa: 0.0 ± 0.0
Ser
1.647SerAla: 1.647 ± 0.786
0.274SerCys: 0.274 ± 0.212
4.665SerAsp: 4.665 ± 0.901
5.488SerGlu: 5.488 ± 1.548
2.195SerPhe: 2.195 ± 0.563
1.921SerGly: 1.921 ± 0.83
0.823SerHis: 0.823 ± 0.473
6.312SerIle: 6.312 ± 1.085
4.116SerLys: 4.116 ± 0.852
6.312SerLeu: 6.312 ± 1.178
1.098SerMet: 1.098 ± 0.588
2.47SerAsn: 2.47 ± 0.687
0.823SerPro: 0.823 ± 0.39
3.019SerGln: 3.019 ± 1.061
3.019SerArg: 3.019 ± 0.859
1.647SerSer: 1.647 ± 0.533
3.568SerThr: 3.568 ± 0.992
2.195SerVal: 2.195 ± 0.74
0.549SerTrp: 0.549 ± 0.422
3.019SerTyr: 3.019 ± 0.971
0.0SerXaa: 0.0 ± 0.0
Thr
2.47ThrAla: 2.47 ± 0.793
0.0ThrCys: 0.0 ± 0.0
1.921ThrAsp: 1.921 ± 0.53
3.293ThrGlu: 3.293 ± 0.786
3.019ThrPhe: 3.019 ± 1.128
5.763ThrGly: 5.763 ± 1.365
1.098ThrHis: 1.098 ± 0.662
4.665ThrIle: 4.665 ± 1.207
5.488ThrLys: 5.488 ± 1.517
6.312ThrLeu: 6.312 ± 1.135
1.372ThrMet: 1.372 ± 0.668
1.647ThrAsn: 1.647 ± 0.628
3.293ThrPro: 3.293 ± 0.986
3.842ThrGln: 3.842 ± 0.956
2.47ThrArg: 2.47 ± 0.658
3.568ThrSer: 3.568 ± 0.774
5.488ThrThr: 5.488 ± 1.423
3.019ThrVal: 3.019 ± 1.111
0.823ThrTrp: 0.823 ± 0.345
4.116ThrTyr: 4.116 ± 0.891
0.0ThrXaa: 0.0 ± 0.0
Val
2.47ValAla: 2.47 ± 0.478
0.0ValCys: 0.0 ± 0.0
2.744ValAsp: 2.744 ± 0.796
3.293ValGlu: 3.293 ± 0.878
2.744ValPhe: 2.744 ± 1.208
1.921ValGly: 1.921 ± 0.967
0.823ValHis: 0.823 ± 0.532
6.312ValIle: 6.312 ± 0.914
4.116ValLys: 4.116 ± 0.85
5.763ValLeu: 5.763 ± 1.178
0.823ValMet: 0.823 ± 0.491
2.47ValAsn: 2.47 ± 0.856
1.098ValPro: 1.098 ± 0.528
2.47ValGln: 2.47 ± 1.112
1.098ValArg: 1.098 ± 0.495
2.744ValSer: 2.744 ± 1.01
4.94ValThr: 4.94 ± 1.646
3.568ValVal: 3.568 ± 0.948
0.0ValTrp: 0.0 ± 0.0
0.823ValTyr: 0.823 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
0.274TrpAla: 0.274 ± 0.275
0.0TrpCys: 0.0 ± 0.0
0.823TrpAsp: 0.823 ± 0.455
1.098TrpGlu: 1.098 ± 0.476
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.549TrpIle: 0.549 ± 0.414
1.098TrpLys: 1.098 ± 0.446
1.647TrpLeu: 1.647 ± 0.699
0.0TrpMet: 0.0 ± 0.0
0.549TrpAsn: 0.549 ± 0.328
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.274TrpArg: 0.274 ± 0.212
0.823TrpSer: 0.823 ± 0.441
0.274TrpThr: 0.274 ± 0.312
0.823TrpVal: 0.823 ± 0.455
0.549TrpTrp: 0.549 ± 0.356
0.549TrpTyr: 0.549 ± 0.334
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.372TyrAla: 1.372 ± 0.633
0.823TyrCys: 0.823 ± 0.386
1.372TyrAsp: 1.372 ± 0.346
2.744TyrGlu: 2.744 ± 0.804
2.47TyrPhe: 2.47 ± 0.611
3.019TyrGly: 3.019 ± 1.005
0.823TyrHis: 0.823 ± 0.415
3.293TyrIle: 3.293 ± 0.857
4.116TyrLys: 4.116 ± 1.389
2.744TyrLeu: 2.744 ± 0.795
1.647TyrMet: 1.647 ± 0.684
4.94TyrAsn: 4.94 ± 0.882
2.195TyrPro: 2.195 ± 0.711
2.47TyrGln: 2.47 ± 0.65
4.665TyrArg: 4.665 ± 1.241
3.019TyrSer: 3.019 ± 1.305
2.47TyrThr: 2.47 ± 0.798
2.195TyrVal: 2.195 ± 0.686
0.549TyrTrp: 0.549 ± 0.286
3.568TyrTyr: 3.568 ± 1.113
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski