Amino acid dipepetide frequency for Streptococcus satellite phage Javan343

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.373AlaAla: 1.373 ± 0.705
0.0AlaCys: 0.0 ± 0.0
5.034AlaAsp: 5.034 ± 1.782
4.577AlaGlu: 4.577 ± 1.611
1.373AlaPhe: 1.373 ± 0.806
3.661AlaGly: 3.661 ± 1.452
0.0AlaHis: 0.0 ± 0.0
6.407AlaIle: 6.407 ± 2.664
5.95AlaLys: 5.95 ± 1.824
6.865AlaLeu: 6.865 ± 2.381
0.458AlaMet: 0.458 ± 0.544
1.831AlaAsn: 1.831 ± 0.988
1.373AlaPro: 1.373 ± 0.621
4.577AlaGln: 4.577 ± 1.483
3.204AlaArg: 3.204 ± 1.106
3.204AlaSer: 3.204 ± 1.0
2.746AlaThr: 2.746 ± 0.887
2.288AlaVal: 2.288 ± 0.992
0.458AlaTrp: 0.458 ± 0.354
2.746AlaTyr: 2.746 ± 1.001
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.458CysAsp: 0.458 ± 0.434
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.458CysGly: 0.458 ± 0.39
0.0CysHis: 0.0 ± 0.0
1.373CysIle: 1.373 ± 0.937
0.0CysLys: 0.0 ± 0.0
0.915CysLeu: 0.915 ± 0.525
0.458CysMet: 0.458 ± 0.496
0.0CysAsn: 0.0 ± 0.0
0.458CysPro: 0.458 ± 0.39
0.458CysGln: 0.458 ± 0.354
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.831AspAla: 1.831 ± 0.961
1.373AspCys: 1.373 ± 0.735
3.204AspAsp: 3.204 ± 1.193
4.577AspGlu: 4.577 ± 1.312
2.288AspPhe: 2.288 ± 1.321
2.746AspGly: 2.746 ± 1.18
0.458AspHis: 0.458 ± 0.522
4.577AspIle: 4.577 ± 1.333
6.865AspLys: 6.865 ± 1.682
6.407AspLeu: 6.407 ± 1.188
1.831AspMet: 1.831 ± 0.715
3.204AspAsn: 3.204 ± 1.055
0.915AspPro: 0.915 ± 0.727
1.831AspGln: 1.831 ± 1.088
1.831AspArg: 1.831 ± 0.877
2.746AspSer: 2.746 ± 1.243
6.407AspThr: 6.407 ± 1.637
1.373AspVal: 1.373 ± 0.868
0.915AspTrp: 0.915 ± 0.539
4.119AspTyr: 4.119 ± 1.053
0.0AspXaa: 0.0 ± 0.0
Glu
4.119GluAla: 4.119 ± 1.406
0.915GluCys: 0.915 ± 0.486
6.407GluAsp: 6.407 ± 2.355
8.696GluGlu: 8.696 ± 2.093
3.661GluPhe: 3.661 ± 1.117
3.204GluGly: 3.204 ± 1.414
2.288GluHis: 2.288 ± 0.523
5.492GluIle: 5.492 ± 1.467
7.78GluLys: 7.78 ± 1.873
8.238GluLeu: 8.238 ± 2.314
2.288GluMet: 2.288 ± 0.945
5.034GluAsn: 5.034 ± 1.63
1.831GluPro: 1.831 ± 0.946
0.915GluGln: 0.915 ± 0.796
5.034GluArg: 5.034 ± 2.262
4.119GluSer: 4.119 ± 1.407
3.204GluThr: 3.204 ± 0.907
5.492GluVal: 5.492 ± 2.418
0.458GluTrp: 0.458 ± 0.39
4.577GluTyr: 4.577 ± 0.925
0.0GluXaa: 0.0 ± 0.0
Phe
1.373PheAla: 1.373 ± 0.78
0.0PheCys: 0.0 ± 0.0
3.204PheAsp: 3.204 ± 0.97
4.119PheGlu: 4.119 ± 1.239
0.915PhePhe: 0.915 ± 0.554
3.661PheGly: 3.661 ± 0.711
0.458PheHis: 0.458 ± 0.39
2.288PheIle: 2.288 ± 1.036
2.288PheLys: 2.288 ± 0.943
3.204PheLeu: 3.204 ± 0.842
0.915PheMet: 0.915 ± 0.895
0.915PheAsn: 0.915 ± 0.868
0.915PhePro: 0.915 ± 0.635
2.288PheGln: 2.288 ± 0.895
1.373PheArg: 1.373 ± 0.475
2.288PheSer: 2.288 ± 0.859
0.915PheThr: 0.915 ± 0.709
1.373PheVal: 1.373 ± 0.644
0.458PheTrp: 0.458 ± 0.354
2.746PheTyr: 2.746 ± 0.937
0.0PheXaa: 0.0 ± 0.0
Gly
1.831GlyAla: 1.831 ± 0.959
0.915GlyCys: 0.915 ± 0.637
3.204GlyAsp: 3.204 ± 1.248
1.831GlyGlu: 1.831 ± 0.785
3.204GlyPhe: 3.204 ± 1.205
1.831GlyGly: 1.831 ± 0.673
0.458GlyHis: 0.458 ± 0.39
2.746GlyIle: 2.746 ± 0.966
4.119GlyLys: 4.119 ± 1.26
8.238GlyLeu: 8.238 ± 1.577
3.661GlyMet: 3.661 ± 1.169
1.373GlyAsn: 1.373 ± 0.544
0.458GlyPro: 0.458 ± 0.526
3.204GlyGln: 3.204 ± 1.593
3.204GlyArg: 3.204 ± 0.684
2.288GlySer: 2.288 ± 1.162
2.746GlyThr: 2.746 ± 1.244
4.119GlyVal: 4.119 ± 1.185
0.915GlyTrp: 0.915 ± 0.554
4.577GlyTyr: 4.577 ± 1.25
0.0GlyXaa: 0.0 ± 0.0
His
1.373HisAla: 1.373 ± 0.895
0.458HisCys: 0.458 ± 0.434
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.458HisPhe: 0.458 ± 0.354
0.915HisGly: 0.915 ± 0.762
0.458HisHis: 0.458 ± 0.354
0.458HisIle: 0.458 ± 0.354
0.915HisLys: 0.915 ± 0.709
2.288HisLeu: 2.288 ± 0.763
0.458HisMet: 0.458 ± 0.519
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.458HisArg: 0.458 ± 0.354
1.373HisSer: 1.373 ± 0.825
0.915HisThr: 0.915 ± 0.78
0.458HisVal: 0.458 ± 0.354
0.458HisTrp: 0.458 ± 0.354
0.458HisTyr: 0.458 ± 0.503
0.0HisXaa: 0.0 ± 0.0
Ile
4.577IleAla: 4.577 ± 1.141
0.0IleCys: 0.0 ± 0.0
3.204IleAsp: 3.204 ± 0.904
6.865IleGlu: 6.865 ± 1.702
2.746IlePhe: 2.746 ± 1.563
4.577IleGly: 4.577 ± 1.493
0.458IleHis: 0.458 ± 0.354
3.204IleIle: 3.204 ± 0.856
7.78IleLys: 7.78 ± 2.12
7.78IleLeu: 7.78 ± 1.838
0.915IleMet: 0.915 ± 0.709
2.288IleAsn: 2.288 ± 1.514
0.915IlePro: 0.915 ± 0.709
3.204IleGln: 3.204 ± 1.222
4.119IleArg: 4.119 ± 1.39
5.95IleSer: 5.95 ± 1.398
2.746IleThr: 2.746 ± 1.23
1.831IleVal: 1.831 ± 1.167
0.458IleTrp: 0.458 ± 0.354
3.204IleTyr: 3.204 ± 0.891
0.0IleXaa: 0.0 ± 0.0
Lys
7.78LysAla: 7.78 ± 2.584
0.458LysCys: 0.458 ± 0.526
5.034LysAsp: 5.034 ± 1.753
7.78LysGlu: 7.78 ± 2.044
1.831LysPhe: 1.831 ± 0.742
6.407LysGly: 6.407 ± 1.927
2.288LysHis: 2.288 ± 1.0
7.323LysIle: 7.323 ± 1.738
10.069LysLys: 10.069 ± 3.502
7.323LysLeu: 7.323 ± 2.396
2.288LysMet: 2.288 ± 1.019
3.661LysAsn: 3.661 ± 1.079
5.95LysPro: 5.95 ± 1.405
5.492LysGln: 5.492 ± 1.889
5.034LysArg: 5.034 ± 1.454
6.407LysSer: 6.407 ± 1.827
5.95LysThr: 5.95 ± 2.136
6.407LysVal: 6.407 ± 1.456
1.373LysTrp: 1.373 ± 0.893
3.661LysTyr: 3.661 ± 1.636
0.0LysXaa: 0.0 ± 0.0
Leu
5.034LeuAla: 5.034 ± 1.315
0.0LeuCys: 0.0 ± 0.0
10.526LeuAsp: 10.526 ± 3.04
13.272LeuGlu: 13.272 ± 3.51
4.577LeuPhe: 4.577 ± 1.505
4.577LeuGly: 4.577 ± 1.818
0.915LeuHis: 0.915 ± 0.486
5.034LeuIle: 5.034 ± 1.535
13.272LeuLys: 13.272 ± 3.911
8.238LeuLeu: 8.238 ± 2.076
0.915LeuMet: 0.915 ± 0.539
4.119LeuAsn: 4.119 ± 1.418
5.034LeuPro: 5.034 ± 1.447
6.407LeuGln: 6.407 ± 1.477
3.204LeuArg: 3.204 ± 1.188
5.034LeuSer: 5.034 ± 2.295
5.492LeuThr: 5.492 ± 1.597
5.034LeuVal: 5.034 ± 1.218
0.0LeuTrp: 0.0 ± 0.0
5.034LeuTyr: 5.034 ± 1.098
0.0LeuXaa: 0.0 ± 0.0
Met
2.288MetAla: 2.288 ± 1.048
0.0MetCys: 0.0 ± 0.0
0.458MetAsp: 0.458 ± 0.39
1.831MetGlu: 1.831 ± 0.918
0.915MetPhe: 0.915 ± 0.538
0.458MetGly: 0.458 ± 0.354
0.0MetHis: 0.0 ± 0.0
0.915MetIle: 0.915 ± 0.544
1.831MetLys: 1.831 ± 0.922
2.288MetLeu: 2.288 ± 0.818
0.0MetMet: 0.0 ± 0.0
1.831MetAsn: 1.831 ± 0.68
0.458MetPro: 0.458 ± 0.511
0.458MetGln: 0.458 ± 0.503
2.288MetArg: 2.288 ± 0.896
0.458MetSer: 0.458 ± 0.519
5.034MetThr: 5.034 ± 1.103
1.373MetVal: 1.373 ± 0.896
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.288AsnAla: 2.288 ± 0.761
0.0AsnCys: 0.0 ± 0.0
1.831AsnAsp: 1.831 ± 0.977
2.288AsnGlu: 2.288 ± 0.689
0.458AsnPhe: 0.458 ± 0.39
4.577AsnGly: 4.577 ± 1.01
0.458AsnHis: 0.458 ± 0.354
1.373AsnIle: 1.373 ± 0.88
3.204AsnLys: 3.204 ± 0.661
5.492AsnLeu: 5.492 ± 1.193
0.915AsnMet: 0.915 ± 0.62
2.288AsnAsn: 2.288 ± 1.411
4.119AsnPro: 4.119 ± 1.482
0.915AsnGln: 0.915 ± 0.677
2.746AsnArg: 2.746 ± 1.023
3.661AsnSer: 3.661 ± 1.097
2.288AsnThr: 2.288 ± 0.96
1.831AsnVal: 1.831 ± 0.848
0.0AsnTrp: 0.0 ± 0.0
2.288AsnTyr: 2.288 ± 0.831
0.0AsnXaa: 0.0 ± 0.0
Pro
2.746ProAla: 2.746 ± 1.088
0.0ProCys: 0.0 ± 0.0
0.915ProAsp: 0.915 ± 0.424
5.034ProGlu: 5.034 ± 1.308
2.288ProPhe: 2.288 ± 1.18
0.458ProGly: 0.458 ± 0.354
0.458ProHis: 0.458 ± 0.561
2.746ProIle: 2.746 ± 0.884
2.288ProLys: 2.288 ± 0.763
2.746ProLeu: 2.746 ± 0.942
0.0ProMet: 0.0 ± 0.0
2.746ProAsn: 2.746 ± 1.244
1.373ProPro: 1.373 ± 0.536
0.458ProGln: 0.458 ± 0.526
3.204ProArg: 3.204 ± 1.12
1.831ProSer: 1.831 ± 1.051
2.288ProThr: 2.288 ± 0.933
1.831ProVal: 1.831 ± 0.911
0.0ProTrp: 0.0 ± 0.0
0.915ProTyr: 0.915 ± 0.424
0.0ProXaa: 0.0 ± 0.0
Gln
4.577GlnAla: 4.577 ± 1.302
0.0GlnCys: 0.0 ± 0.0
1.373GlnAsp: 1.373 ± 0.622
3.661GlnGlu: 3.661 ± 1.19
1.831GlnPhe: 1.831 ± 1.052
1.373GlnGly: 1.373 ± 0.65
0.0GlnHis: 0.0 ± 0.0
2.746GlnIle: 2.746 ± 1.524
6.407GlnLys: 6.407 ± 1.86
4.577GlnLeu: 4.577 ± 1.69
0.915GlnMet: 0.915 ± 0.62
2.288GlnAsn: 2.288 ± 1.258
0.458GlnPro: 0.458 ± 0.506
2.746GlnGln: 2.746 ± 1.271
2.288GlnArg: 2.288 ± 1.278
2.288GlnSer: 2.288 ± 1.225
1.831GlnThr: 1.831 ± 0.894
2.746GlnVal: 2.746 ± 1.369
0.0GlnTrp: 0.0 ± 0.0
1.831GlnTyr: 1.831 ± 0.951
0.0GlnXaa: 0.0 ± 0.0
Arg
2.288ArgAla: 2.288 ± 1.528
0.0ArgCys: 0.0 ± 0.0
2.288ArgAsp: 2.288 ± 0.64
3.661ArgGlu: 3.661 ± 1.306
2.746ArgPhe: 2.746 ± 1.144
2.288ArgGly: 2.288 ± 0.836
0.915ArgHis: 0.915 ± 0.78
3.204ArgIle: 3.204 ± 0.857
5.492ArgLys: 5.492 ± 1.984
6.865ArgLeu: 6.865 ± 1.515
0.915ArgMet: 0.915 ± 0.518
2.746ArgAsn: 2.746 ± 1.16
1.831ArgPro: 1.831 ± 1.054
1.831ArgGln: 1.831 ± 0.635
3.204ArgArg: 3.204 ± 1.656
2.288ArgSer: 2.288 ± 0.772
2.746ArgThr: 2.746 ± 1.246
2.746ArgVal: 2.746 ± 1.571
0.458ArgTrp: 0.458 ± 0.519
0.915ArgTyr: 0.915 ± 1.012
0.0ArgXaa: 0.0 ± 0.0
Ser
3.204SerAla: 3.204 ± 1.111
0.0SerCys: 0.0 ± 0.0
3.661SerAsp: 3.661 ± 1.713
3.204SerGlu: 3.204 ± 0.779
1.373SerPhe: 1.373 ± 0.671
3.661SerGly: 3.661 ± 1.558
1.373SerHis: 1.373 ± 0.707
5.95SerIle: 5.95 ± 1.32
3.661SerLys: 3.661 ± 1.146
6.407SerLeu: 6.407 ± 1.18
1.831SerMet: 1.831 ± 0.834
1.831SerAsn: 1.831 ± 0.785
0.915SerPro: 0.915 ± 0.601
2.746SerGln: 2.746 ± 1.01
1.831SerArg: 1.831 ± 1.114
5.034SerSer: 5.034 ± 1.284
4.119SerThr: 4.119 ± 1.345
4.119SerVal: 4.119 ± 1.088
0.915SerTrp: 0.915 ± 0.539
3.661SerTyr: 3.661 ± 1.516
0.0SerXaa: 0.0 ± 0.0
Thr
4.577ThrAla: 4.577 ± 2.005
0.0ThrCys: 0.0 ± 0.0
2.288ThrAsp: 2.288 ± 1.022
4.119ThrGlu: 4.119 ± 1.046
1.831ThrPhe: 1.831 ± 0.737
5.95ThrGly: 5.95 ± 1.533
0.458ThrHis: 0.458 ± 0.39
4.577ThrIle: 4.577 ± 1.364
4.119ThrLys: 4.119 ± 1.163
8.238ThrLeu: 8.238 ± 2.572
1.373ThrMet: 1.373 ± 0.816
1.831ThrAsn: 1.831 ± 1.17
3.204ThrPro: 3.204 ± 0.948
1.831ThrGln: 1.831 ± 0.96
2.288ThrArg: 2.288 ± 0.611
2.746ThrSer: 2.746 ± 1.008
3.661ThrThr: 3.661 ± 1.022
2.746ThrVal: 2.746 ± 1.164
0.458ThrTrp: 0.458 ± 0.434
1.373ThrTyr: 1.373 ± 0.816
0.0ThrXaa: 0.0 ± 0.0
Val
5.95ValAla: 5.95 ± 1.88
0.0ValCys: 0.0 ± 0.0
3.661ValAsp: 3.661 ± 2.009
4.119ValGlu: 4.119 ± 1.702
1.831ValPhe: 1.831 ± 0.889
1.831ValGly: 1.831 ± 0.68
0.0ValHis: 0.0 ± 0.0
4.119ValIle: 4.119 ± 2.078
5.95ValLys: 5.95 ± 1.947
2.288ValLeu: 2.288 ± 0.909
1.831ValMet: 1.831 ± 0.752
1.831ValAsn: 1.831 ± 0.933
1.373ValPro: 1.373 ± 0.88
1.373ValGln: 1.373 ± 0.719
0.458ValArg: 0.458 ± 0.354
4.119ValSer: 4.119 ± 1.945
3.661ValThr: 3.661 ± 2.036
3.661ValVal: 3.661 ± 1.624
0.458ValTrp: 0.458 ± 0.354
2.746ValTyr: 2.746 ± 1.068
0.0ValXaa: 0.0 ± 0.0
Trp
0.458TrpAla: 0.458 ± 0.39
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.373TrpGlu: 1.373 ± 0.789
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.458TrpIle: 0.458 ± 0.522
1.373TrpLys: 1.373 ± 0.782
1.831TrpLeu: 1.831 ± 1.08
0.458TrpMet: 0.458 ± 0.463
0.0TrpAsn: 0.0 ± 0.0
0.458TrpPro: 0.458 ± 0.354
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.915TrpSer: 0.915 ± 0.525
0.0TrpThr: 0.0 ± 0.0
0.458TrpVal: 0.458 ± 0.434
0.458TrpTrp: 0.458 ± 0.39
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.458TyrAla: 0.458 ± 0.354
0.458TyrCys: 0.458 ± 0.434
2.746TyrAsp: 2.746 ± 1.741
2.746TyrGlu: 2.746 ± 1.157
1.373TyrPhe: 1.373 ± 0.715
2.288TyrGly: 2.288 ± 1.218
0.458TyrHis: 0.458 ± 0.354
1.831TyrIle: 1.831 ± 0.73
9.153TyrLys: 9.153 ± 3.362
5.492TyrLeu: 5.492 ± 1.546
0.0TyrMet: 0.0 ± 0.528
3.204TyrAsn: 3.204 ± 1.693
2.288TyrPro: 2.288 ± 1.18
3.204TyrGln: 3.204 ± 1.316
3.661TyrArg: 3.661 ± 1.388
2.746TyrSer: 2.746 ± 0.924
0.915TyrThr: 0.915 ± 0.653
1.373TyrVal: 1.373 ± 0.79
0.0TyrTrp: 0.0 ± 0.0
1.831TyrTyr: 1.831 ± 0.949
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2186 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski