Amino acid dipepetide frequency for Streptococcus satellite phage Javan461

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.937AlaAla: 0.937 ± 0.664
0.937AlaCys: 0.937 ± 0.438
2.186AlaAsp: 2.186 ± 0.759
4.372AlaGlu: 4.372 ± 1.154
5.621AlaPhe: 5.621 ± 1.533
1.874AlaGly: 1.874 ± 0.737
0.937AlaHis: 0.937 ± 0.658
7.495AlaIle: 7.495 ± 1.49
4.372AlaLys: 4.372 ± 1.446
5.309AlaLeu: 5.309 ± 1.484
1.249AlaMet: 1.249 ± 0.618
4.997AlaAsn: 4.997 ± 1.423
1.874AlaPro: 1.874 ± 0.674
3.435AlaGln: 3.435 ± 0.97
2.498AlaArg: 2.498 ± 0.634
4.997AlaSer: 4.997 ± 1.509
4.372AlaThr: 4.372 ± 0.746
2.811AlaVal: 2.811 ± 0.901
0.937AlaTrp: 0.937 ± 0.463
2.498AlaTyr: 2.498 ± 0.697
0.0AlaXaa: 0.0 ± 0.0
Cys
0.625CysAla: 0.625 ± 0.43
0.312CysCys: 0.312 ± 0.331
0.625CysAsp: 0.625 ± 0.455
0.312CysGlu: 0.312 ± 0.331
0.312CysPhe: 0.312 ± 0.269
0.312CysGly: 0.312 ± 0.331
0.312CysHis: 0.312 ± 0.283
0.0CysIle: 0.0 ± 0.0
0.625CysLys: 0.625 ± 0.403
0.937CysLeu: 0.937 ± 0.477
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.312CysPro: 0.312 ± 0.331
0.625CysGln: 0.625 ± 0.378
0.625CysArg: 0.625 ± 0.379
0.312CysSer: 0.312 ± 0.269
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.625CysTyr: 0.625 ± 0.421
0.0CysXaa: 0.0 ± 0.0
Asp
0.937AspAla: 0.937 ± 0.452
1.249AspCys: 1.249 ± 0.722
2.498AspAsp: 2.498 ± 0.731
4.685AspGlu: 4.685 ± 1.583
2.498AspPhe: 2.498 ± 0.99
1.874AspGly: 1.874 ± 0.687
0.625AspHis: 0.625 ± 0.327
6.558AspIle: 6.558 ± 1.467
5.621AspLys: 5.621 ± 2.128
5.934AspLeu: 5.934 ± 0.968
1.874AspMet: 1.874 ± 0.769
1.562AspAsn: 1.562 ± 0.658
0.312AspPro: 0.312 ± 0.331
1.249AspGln: 1.249 ± 0.549
2.811AspArg: 2.811 ± 0.907
1.249AspSer: 1.249 ± 0.788
4.685AspThr: 4.685 ± 1.249
1.249AspVal: 1.249 ± 0.652
0.312AspTrp: 0.312 ± 0.343
3.748AspTyr: 3.748 ± 1.362
0.0AspXaa: 0.0 ± 0.0
Glu
8.12GluAla: 8.12 ± 1.349
1.249GluCys: 1.249 ± 0.678
5.309GluAsp: 5.309 ± 1.24
5.934GluGlu: 5.934 ± 1.341
1.874GluPhe: 1.874 ± 1.072
2.186GluGly: 2.186 ± 0.685
2.186GluHis: 2.186 ± 0.773
7.183GluIle: 7.183 ± 1.418
5.621GluLys: 5.621 ± 0.9
10.931GluLeu: 10.931 ± 1.543
1.874GluMet: 1.874 ± 0.593
2.811GluAsn: 2.811 ± 0.999
1.249GluPro: 1.249 ± 0.61
4.372GluGln: 4.372 ± 1.49
4.06GluArg: 4.06 ± 1.243
1.562GluSer: 1.562 ± 0.697
2.498GluThr: 2.498 ± 0.738
2.186GluVal: 2.186 ± 0.734
1.249GluTrp: 1.249 ± 0.819
3.123GluTyr: 3.123 ± 1.005
0.0GluXaa: 0.0 ± 0.0
Phe
1.874PheAla: 1.874 ± 0.667
0.0PheCys: 0.0 ± 0.0
2.811PheAsp: 2.811 ± 0.747
3.123PheGlu: 3.123 ± 0.857
1.874PhePhe: 1.874 ± 0.85
2.498PheGly: 2.498 ± 0.749
1.874PheHis: 1.874 ± 0.52
2.186PheIle: 2.186 ± 0.696
4.685PheLys: 4.685 ± 1.207
3.123PheLeu: 3.123 ± 1.092
0.0PheMet: 0.0 ± 0.0
2.186PheAsn: 2.186 ± 0.922
0.937PhePro: 0.937 ± 0.473
0.937PheGln: 0.937 ± 0.753
1.562PheArg: 1.562 ± 0.503
2.186PheSer: 2.186 ± 0.614
3.435PheThr: 3.435 ± 0.625
2.498PheVal: 2.498 ± 0.541
0.312PheTrp: 0.312 ± 0.229
1.249PheTyr: 1.249 ± 0.55
0.0PheXaa: 0.0 ± 0.0
Gly
2.498GlyAla: 2.498 ± 1.336
0.312GlyCys: 0.312 ± 0.269
3.123GlyAsp: 3.123 ± 1.031
2.498GlyGlu: 2.498 ± 0.864
2.186GlyPhe: 2.186 ± 0.765
3.748GlyGly: 3.748 ± 1.19
0.937GlyHis: 0.937 ± 0.533
1.874GlyIle: 1.874 ± 0.88
4.372GlyLys: 4.372 ± 1.04
6.246GlyLeu: 6.246 ± 1.794
1.249GlyMet: 1.249 ± 0.512
1.562GlyAsn: 1.562 ± 0.502
0.312GlyPro: 0.312 ± 0.331
2.811GlyGln: 2.811 ± 1.143
3.123GlyArg: 3.123 ± 1.03
1.874GlySer: 1.874 ± 0.728
2.811GlyThr: 2.811 ± 0.694
3.748GlyVal: 3.748 ± 1.037
0.312GlyTrp: 0.312 ± 0.229
3.435GlyTyr: 3.435 ± 1.077
0.0GlyXaa: 0.0 ± 0.0
His
2.811HisAla: 2.811 ± 0.87
0.0HisCys: 0.0 ± 0.0
0.312HisAsp: 0.312 ± 0.291
0.312HisGlu: 0.312 ± 0.229
0.312HisPhe: 0.312 ± 0.351
0.937HisGly: 0.937 ± 0.46
0.312HisHis: 0.312 ± 0.229
1.874HisIle: 1.874 ± 0.762
2.186HisLys: 2.186 ± 0.929
1.562HisLeu: 1.562 ± 0.7
0.937HisMet: 0.937 ± 0.655
1.249HisAsn: 1.249 ± 0.698
0.625HisPro: 0.625 ± 0.378
0.625HisGln: 0.625 ± 0.473
1.249HisArg: 1.249 ± 0.656
0.312HisSer: 0.312 ± 0.269
1.562HisThr: 1.562 ± 0.673
0.312HisVal: 0.312 ± 0.229
0.625HisTrp: 0.625 ± 0.378
1.874HisTyr: 1.874 ± 0.827
0.0HisXaa: 0.0 ± 0.0
Ile
6.871IleAla: 6.871 ± 1.785
0.312IleCys: 0.312 ± 0.283
6.558IleAsp: 6.558 ± 1.258
4.685IleGlu: 4.685 ± 0.817
2.811IlePhe: 2.811 ± 0.774
2.186IleGly: 2.186 ± 0.514
1.249IleHis: 1.249 ± 0.673
6.558IleIle: 6.558 ± 1.413
10.306IleLys: 10.306 ± 1.84
4.372IleLeu: 4.372 ± 0.949
1.874IleMet: 1.874 ± 0.695
4.372IleAsn: 4.372 ± 1.205
2.498IlePro: 2.498 ± 0.831
1.874IleGln: 1.874 ± 0.751
3.435IleArg: 3.435 ± 0.953
5.309IleSer: 5.309 ± 1.32
4.685IleThr: 4.685 ± 1.255
1.874IleVal: 1.874 ± 0.645
0.0IleTrp: 0.0 ± 0.0
1.874IleTyr: 1.874 ± 0.71
0.0IleXaa: 0.0 ± 0.0
Lys
9.057LysAla: 9.057 ± 1.657
0.0LysCys: 0.0 ± 0.0
4.997LysAsp: 4.997 ± 1.324
10.618LysGlu: 10.618 ± 1.702
2.186LysPhe: 2.186 ± 0.707
4.997LysGly: 4.997 ± 1.568
2.811LysHis: 2.811 ± 0.804
3.748LysIle: 3.748 ± 1.064
5.934LysLys: 5.934 ± 1.639
5.934LysLeu: 5.934 ± 1.526
2.498LysMet: 2.498 ± 0.755
4.997LysAsn: 4.997 ± 1.187
6.871LysPro: 6.871 ± 1.271
5.309LysGln: 5.309 ± 1.02
4.997LysArg: 4.997 ± 0.952
4.997LysSer: 4.997 ± 1.185
4.997LysThr: 4.997 ± 1.412
4.06LysVal: 4.06 ± 0.849
0.312LysTrp: 0.312 ± 0.331
2.811LysTyr: 2.811 ± 1.029
0.0LysXaa: 0.0 ± 0.0
Leu
7.495LeuAla: 7.495 ± 1.495
0.0LeuCys: 0.0 ± 0.0
4.997LeuAsp: 4.997 ± 1.276
10.618LeuGlu: 10.618 ± 1.314
3.435LeuPhe: 3.435 ± 1.064
5.309LeuGly: 5.309 ± 0.949
1.562LeuHis: 1.562 ± 0.636
8.432LeuIle: 8.432 ± 1.598
10.931LeuLys: 10.931 ± 0.989
11.243LeuLeu: 11.243 ± 1.986
2.186LeuMet: 2.186 ± 0.845
4.685LeuAsn: 4.685 ± 1.467
4.06LeuPro: 4.06 ± 1.328
3.435LeuGln: 3.435 ± 0.882
2.186LeuArg: 2.186 ± 0.738
6.246LeuSer: 6.246 ± 1.722
5.934LeuThr: 5.934 ± 0.851
4.685LeuVal: 4.685 ± 1.209
0.312LeuTrp: 0.312 ± 0.269
4.997LeuTyr: 4.997 ± 0.723
0.0LeuXaa: 0.0 ± 0.0
Met
2.186MetAla: 2.186 ± 1.038
0.0MetCys: 0.0 ± 0.0
1.249MetAsp: 1.249 ± 0.537
0.625MetGlu: 0.625 ± 0.366
0.312MetPhe: 0.312 ± 0.317
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.562MetIle: 1.562 ± 0.708
2.498MetLys: 2.498 ± 0.83
2.186MetLeu: 2.186 ± 0.785
0.0MetMet: 0.0 ± 0.0
2.498MetAsn: 2.498 ± 0.765
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
2.186MetArg: 2.186 ± 0.82
1.874MetSer: 1.874 ± 0.771
4.997MetThr: 4.997 ± 1.327
0.625MetVal: 0.625 ± 0.438
0.0MetTrp: 0.0 ± 0.0
0.312MetTyr: 0.312 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
3.435AsnAla: 3.435 ± 0.689
0.0AsnCys: 0.0 ± 0.0
1.562AsnAsp: 1.562 ± 0.913
2.498AsnGlu: 2.498 ± 1.147
0.625AsnPhe: 0.625 ± 0.349
4.997AsnGly: 4.997 ± 1.018
1.249AsnHis: 1.249 ± 0.434
2.186AsnIle: 2.186 ± 0.719
3.435AsnLys: 3.435 ± 1.058
5.621AsnLeu: 5.621 ± 1.208
1.874AsnMet: 1.874 ± 0.672
3.435AsnAsn: 3.435 ± 1.002
2.811AsnPro: 2.811 ± 0.714
2.498AsnGln: 2.498 ± 1.085
5.309AsnArg: 5.309 ± 0.923
2.186AsnSer: 2.186 ± 0.745
1.874AsnThr: 1.874 ± 0.804
2.811AsnVal: 2.811 ± 0.855
0.312AsnTrp: 0.312 ± 0.384
2.186AsnTyr: 2.186 ± 0.717
0.0AsnXaa: 0.0 ± 0.0
Pro
0.625ProAla: 0.625 ± 0.408
0.0ProCys: 0.0 ± 0.0
1.874ProAsp: 1.874 ± 0.826
4.372ProGlu: 4.372 ± 1.054
1.562ProPhe: 1.562 ± 0.744
0.312ProGly: 0.312 ± 0.331
0.0ProHis: 0.0 ± 0.0
1.249ProIle: 1.249 ± 0.714
5.621ProLys: 5.621 ± 0.899
2.811ProLeu: 2.811 ± 0.995
0.312ProMet: 0.312 ± 0.331
2.186ProAsn: 2.186 ± 0.672
0.625ProPro: 0.625 ± 0.45
0.937ProGln: 0.937 ± 0.563
2.498ProArg: 2.498 ± 0.923
1.562ProSer: 1.562 ± 0.772
3.123ProThr: 3.123 ± 0.732
2.186ProVal: 2.186 ± 0.648
0.0ProTrp: 0.0 ± 0.0
1.562ProTyr: 1.562 ± 0.616
0.0ProXaa: 0.0 ± 0.0
Gln
2.498GlnAla: 2.498 ± 0.85
0.312GlnCys: 0.312 ± 0.291
2.186GlnAsp: 2.186 ± 0.759
3.123GlnGlu: 3.123 ± 1.035
0.937GlnPhe: 0.937 ± 0.473
2.498GlnGly: 2.498 ± 1.117
0.937GlnHis: 0.937 ± 0.419
4.06GlnIle: 4.06 ± 0.816
4.372GlnLys: 4.372 ± 1.225
7.495GlnLeu: 7.495 ± 1.012
0.937GlnMet: 0.937 ± 0.783
1.874GlnAsn: 1.874 ± 0.821
1.562GlnPro: 1.562 ± 0.831
2.186GlnGln: 2.186 ± 0.561
2.811GlnArg: 2.811 ± 0.756
2.186GlnSer: 2.186 ± 0.837
1.562GlnThr: 1.562 ± 0.621
3.123GlnVal: 3.123 ± 1.15
0.0GlnTrp: 0.0 ± 0.0
1.562GlnTyr: 1.562 ± 0.846
0.0GlnXaa: 0.0 ± 0.0
Arg
2.186ArgAla: 2.186 ± 0.806
0.625ArgCys: 0.625 ± 0.379
2.186ArgAsp: 2.186 ± 0.927
3.748ArgGlu: 3.748 ± 0.86
3.123ArgPhe: 3.123 ± 1.022
2.811ArgGly: 2.811 ± 1.019
1.249ArgHis: 1.249 ± 0.53
2.811ArgIle: 2.811 ± 0.691
5.621ArgLys: 5.621 ± 0.94
5.621ArgLeu: 5.621 ± 0.975
0.937ArgMet: 0.937 ± 0.484
2.186ArgAsn: 2.186 ± 0.82
1.562ArgPro: 1.562 ± 0.652
3.748ArgGln: 3.748 ± 0.765
3.123ArgArg: 3.123 ± 1.042
2.811ArgSer: 2.811 ± 0.792
3.748ArgThr: 3.748 ± 0.981
3.435ArgVal: 3.435 ± 0.801
0.625ArgTrp: 0.625 ± 0.453
3.123ArgTyr: 3.123 ± 1.128
0.0ArgXaa: 0.0 ± 0.0
Ser
2.186SerAla: 2.186 ± 0.706
0.312SerCys: 0.312 ± 0.331
3.123SerAsp: 3.123 ± 0.828
2.811SerGlu: 2.811 ± 0.927
1.874SerPhe: 1.874 ± 0.526
2.186SerGly: 2.186 ± 0.736
0.625SerHis: 0.625 ± 0.489
6.246SerIle: 6.246 ± 1.306
5.621SerLys: 5.621 ± 1.13
4.685SerLeu: 4.685 ± 1.062
0.937SerMet: 0.937 ± 0.556
1.874SerAsn: 1.874 ± 0.975
0.937SerPro: 0.937 ± 0.433
3.123SerGln: 3.123 ± 1.105
3.123SerArg: 3.123 ± 1.07
1.562SerSer: 1.562 ± 0.546
3.123SerThr: 3.123 ± 0.955
4.06SerVal: 4.06 ± 1.466
0.937SerTrp: 0.937 ± 0.573
2.811SerTyr: 2.811 ± 0.851
0.0SerXaa: 0.0 ± 0.0
Thr
4.06ThrAla: 4.06 ± 1.076
0.0ThrCys: 0.0 ± 0.0
2.498ThrAsp: 2.498 ± 0.889
4.372ThrGlu: 4.372 ± 1.249
2.811ThrPhe: 2.811 ± 1.126
5.621ThrGly: 5.621 ± 0.893
1.249ThrHis: 1.249 ± 0.524
3.435ThrIle: 3.435 ± 1.085
3.435ThrLys: 3.435 ± 1.147
8.432ThrLeu: 8.432 ± 1.692
1.249ThrMet: 1.249 ± 0.542
0.625ThrAsn: 0.625 ± 0.477
4.06ThrPro: 4.06 ± 1.09
2.811ThrGln: 2.811 ± 1.158
3.123ThrArg: 3.123 ± 0.823
3.123ThrSer: 3.123 ± 0.858
2.186ThrThr: 2.186 ± 1.059
2.811ThrVal: 2.811 ± 0.847
0.937ThrTrp: 0.937 ± 0.482
4.06ThrTyr: 4.06 ± 1.149
0.0ThrXaa: 0.0 ± 0.0
Val
4.06ValAla: 4.06 ± 0.683
0.625ValCys: 0.625 ± 0.507
0.937ValAsp: 0.937 ± 0.541
2.498ValGlu: 2.498 ± 1.108
2.186ValPhe: 2.186 ± 0.829
2.811ValGly: 2.811 ± 0.852
0.312ValHis: 0.312 ± 0.331
4.685ValIle: 4.685 ± 1.181
2.811ValLys: 2.811 ± 0.764
4.997ValLeu: 4.997 ± 1.192
1.249ValMet: 1.249 ± 0.482
3.123ValAsn: 3.123 ± 1.039
0.937ValPro: 0.937 ± 0.578
2.498ValGln: 2.498 ± 0.829
1.874ValArg: 1.874 ± 0.715
4.06ValSer: 4.06 ± 1.21
3.123ValThr: 3.123 ± 1.276
1.562ValVal: 1.562 ± 0.85
0.625ValTrp: 0.625 ± 0.458
1.874ValTyr: 1.874 ± 0.595
0.0ValXaa: 0.0 ± 0.0
Trp
0.312TrpAla: 0.312 ± 0.369
0.0TrpCys: 0.0 ± 0.0
0.312TrpAsp: 0.312 ± 0.353
0.937TrpGlu: 0.937 ± 0.528
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.625TrpLys: 0.625 ± 0.458
1.874TrpLeu: 1.874 ± 0.792
0.0TrpMet: 0.0 ± 0.0
0.312TrpAsn: 0.312 ± 0.369
0.312TrpPro: 0.312 ± 0.229
0.625TrpGln: 0.625 ± 0.421
0.625TrpArg: 0.625 ± 0.473
0.937TrpSer: 0.937 ± 0.529
0.0TrpThr: 0.0 ± 0.0
1.249TrpVal: 1.249 ± 0.763
0.312TrpTrp: 0.312 ± 0.369
0.312TrpTyr: 0.312 ± 0.309
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.937TyrAla: 0.937 ± 0.548
0.625TyrCys: 0.625 ± 0.412
2.186TyrAsp: 2.186 ± 0.672
3.435TyrGlu: 3.435 ± 1.095
2.811TyrPhe: 2.811 ± 0.792
1.874TyrGly: 1.874 ± 0.738
1.562TyrHis: 1.562 ± 0.585
1.562TyrIle: 1.562 ± 0.692
3.435TyrLys: 3.435 ± 1.208
3.435TyrLeu: 3.435 ± 0.636
1.562TyrMet: 1.562 ± 0.705
4.372TyrAsn: 4.372 ± 1.002
1.562TyrPro: 1.562 ± 0.917
3.123TyrGln: 3.123 ± 0.966
4.06TyrArg: 4.06 ± 1.345
2.811TyrSer: 2.811 ± 0.809
2.498TyrThr: 2.498 ± 0.83
1.562TyrVal: 1.562 ± 0.721
0.625TyrTrp: 0.625 ± 0.662
3.748TyrTyr: 3.748 ± 0.997
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3203 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski