Amino acid dipepetide frequency for Streptococcus satellite phage Javan391

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.348AlaAla: 0.348 ± 0.338
1.044AlaCys: 1.044 ± 0.468
2.436AlaAsp: 2.436 ± 0.559
4.525AlaGlu: 4.525 ± 1.478
1.74AlaPhe: 1.74 ± 0.721
1.74AlaGly: 1.74 ± 0.619
0.0AlaHis: 0.0 ± 0.0
6.265AlaIle: 6.265 ± 0.888
3.481AlaLys: 3.481 ± 1.32
5.917AlaLeu: 5.917 ± 1.227
2.088AlaMet: 2.088 ± 0.599
2.436AlaAsn: 2.436 ± 0.801
1.392AlaPro: 1.392 ± 0.638
1.74AlaGln: 1.74 ± 0.899
2.088AlaArg: 2.088 ± 0.719
1.74AlaSer: 1.74 ± 0.666
2.088AlaThr: 2.088 ± 0.695
1.74AlaVal: 1.74 ± 0.873
0.348AlaTrp: 0.348 ± 0.295
3.133AlaTyr: 3.133 ± 0.836
0.0AlaXaa: 0.0 ± 0.0
Cys
0.348CysAla: 0.348 ± 0.295
0.0CysCys: 0.0 ± 0.0
0.348CysAsp: 0.348 ± 0.333
0.348CysGlu: 0.348 ± 0.261
0.348CysPhe: 0.348 ± 0.338
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.044CysIle: 1.044 ± 0.517
0.0CysLys: 0.0 ± 0.0
0.348CysLeu: 0.348 ± 0.295
0.348CysMet: 0.348 ± 0.356
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.348CysGln: 0.348 ± 0.338
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.696CysVal: 0.696 ± 0.391
0.0CysTrp: 0.0 ± 0.0
0.348CysTyr: 0.348 ± 0.374
0.0CysXaa: 0.0 ± 0.0
Asp
0.696AspAla: 0.696 ± 0.444
0.0AspCys: 0.0 ± 0.0
5.917AspAsp: 5.917 ± 2.224
2.785AspGlu: 2.785 ± 1.014
2.436AspPhe: 2.436 ± 0.721
2.436AspGly: 2.436 ± 0.692
0.696AspHis: 0.696 ± 0.441
7.658AspIle: 7.658 ± 0.847
6.265AspLys: 6.265 ± 1.244
8.006AspLeu: 8.006 ± 1.75
2.088AspMet: 2.088 ± 0.869
2.436AspAsn: 2.436 ± 0.863
0.348AspPro: 0.348 ± 0.261
1.392AspGln: 1.392 ± 0.793
2.088AspArg: 2.088 ± 0.765
3.829AspSer: 3.829 ± 1.837
5.917AspThr: 5.917 ± 2.421
1.392AspVal: 1.392 ± 0.437
0.348AspTrp: 0.348 ± 0.341
5.917AspTyr: 5.917 ± 1.374
0.0AspXaa: 0.0 ± 0.0
Glu
7.658GluAla: 7.658 ± 1.489
0.0GluCys: 0.0 ± 0.0
3.133GluAsp: 3.133 ± 1.45
3.829GluGlu: 3.829 ± 1.26
3.481GluPhe: 3.481 ± 0.913
1.044GluGly: 1.044 ± 0.532
2.088GluHis: 2.088 ± 1.069
5.917GluIle: 5.917 ± 1.909
5.569GluLys: 5.569 ± 1.683
11.138GluLeu: 11.138 ± 1.866
2.436GluMet: 2.436 ± 0.663
4.873GluAsn: 4.873 ± 1.301
3.133GluPro: 3.133 ± 0.995
3.133GluGln: 3.133 ± 1.205
3.481GluArg: 3.481 ± 0.962
2.436GluSer: 2.436 ± 0.826
5.221GluThr: 5.221 ± 0.921
4.177GluVal: 4.177 ± 1.656
0.348GluTrp: 0.348 ± 0.29
5.569GluTyr: 5.569 ± 0.974
0.0GluXaa: 0.0 ± 0.0
Phe
0.696PheAla: 0.696 ± 0.733
0.0PheCys: 0.0 ± 0.0
3.481PheAsp: 3.481 ± 0.896
3.829PheGlu: 3.829 ± 2.046
1.392PhePhe: 1.392 ± 0.675
2.088PheGly: 2.088 ± 1.047
0.348PheHis: 0.348 ± 0.295
3.481PheIle: 3.481 ± 0.914
4.177PheLys: 4.177 ± 1.521
3.133PheLeu: 3.133 ± 1.199
1.044PheMet: 1.044 ± 0.529
3.133PheAsn: 3.133 ± 0.958
0.348PhePro: 0.348 ± 0.333
0.696PheGln: 0.696 ± 0.588
1.74PheArg: 1.74 ± 0.649
3.133PheSer: 3.133 ± 0.76
3.829PheThr: 3.829 ± 1.207
1.74PheVal: 1.74 ± 0.578
0.348PheTrp: 0.348 ± 0.261
2.088PheTyr: 2.088 ± 0.968
0.0PheXaa: 0.0 ± 0.0
Gly
1.392GlyAla: 1.392 ± 0.641
0.0GlyCys: 0.0 ± 0.0
3.481GlyAsp: 3.481 ± 0.996
1.392GlyGlu: 1.392 ± 0.555
2.436GlyPhe: 2.436 ± 0.8
2.088GlyGly: 2.088 ± 0.793
0.348GlyHis: 0.348 ± 0.295
2.436GlyIle: 2.436 ± 0.736
4.177GlyLys: 4.177 ± 0.846
4.873GlyLeu: 4.873 ± 1.345
1.74GlyMet: 1.74 ± 0.643
3.481GlyAsn: 3.481 ± 1.063
0.348GlyPro: 0.348 ± 0.341
1.74GlyGln: 1.74 ± 0.792
3.133GlyArg: 3.133 ± 0.935
1.74GlySer: 1.74 ± 0.883
3.481GlyThr: 3.481 ± 0.892
1.74GlyVal: 1.74 ± 0.845
0.696GlyTrp: 0.696 ± 0.415
2.785GlyTyr: 2.785 ± 0.605
0.0GlyXaa: 0.0 ± 0.0
His
1.044HisAla: 1.044 ± 0.604
0.0HisCys: 0.0 ± 0.0
0.696HisAsp: 0.696 ± 0.363
1.392HisGlu: 1.392 ± 0.719
0.696HisPhe: 0.696 ± 0.474
0.348HisGly: 0.348 ± 0.353
0.348HisHis: 0.348 ± 0.261
1.044HisIle: 1.044 ± 0.524
0.0HisLys: 0.0 ± 0.0
1.74HisLeu: 1.74 ± 0.953
0.696HisMet: 0.696 ± 0.572
0.696HisAsn: 0.696 ± 0.577
0.0HisPro: 0.0 ± 0.0
0.348HisGln: 0.348 ± 0.29
0.348HisArg: 0.348 ± 0.261
1.392HisSer: 1.392 ± 0.636
1.74HisThr: 1.74 ± 0.525
0.348HisVal: 0.348 ± 0.261
0.348HisTrp: 0.348 ± 0.29
1.044HisTyr: 1.044 ± 0.422
0.0HisXaa: 0.0 ± 0.0
Ile
3.481IleAla: 3.481 ± 1.062
0.348IleCys: 0.348 ± 0.261
3.829IleAsp: 3.829 ± 1.232
7.309IleGlu: 7.309 ± 1.639
2.436IlePhe: 2.436 ± 0.641
4.177IleGly: 4.177 ± 1.346
1.044IleHis: 1.044 ± 0.563
6.265IleIle: 6.265 ± 0.968
10.79IleLys: 10.79 ± 1.771
5.569IleLeu: 5.569 ± 1.158
2.785IleMet: 2.785 ± 0.923
5.221IleAsn: 5.221 ± 1.241
3.829IlePro: 3.829 ± 0.902
4.177IleGln: 4.177 ± 0.883
2.088IleArg: 2.088 ± 0.793
3.829IleSer: 3.829 ± 0.835
4.873IleThr: 4.873 ± 1.272
1.74IleVal: 1.74 ± 0.658
0.348IleTrp: 0.348 ± 0.338
3.829IleTyr: 3.829 ± 1.092
0.0IleXaa: 0.0 ± 0.0
Lys
6.265LysAla: 6.265 ± 1.35
0.0LysCys: 0.0 ± 0.0
5.917LysAsp: 5.917 ± 1.256
13.575LysGlu: 13.575 ± 2.727
2.436LysPhe: 2.436 ± 0.797
4.177LysGly: 4.177 ± 1.305
2.436LysHis: 2.436 ± 0.815
7.309LysIle: 7.309 ± 1.83
11.834LysLys: 11.834 ± 3.074
6.961LysLeu: 6.961 ± 2.285
2.785LysMet: 2.785 ± 1.02
7.658LysAsn: 7.658 ± 2.078
4.873LysPro: 4.873 ± 1.635
3.133LysGln: 3.133 ± 1.093
3.481LysArg: 3.481 ± 1.267
7.309LysSer: 7.309 ± 1.306
3.481LysThr: 3.481 ± 1.071
3.133LysVal: 3.133 ± 0.934
1.044LysTrp: 1.044 ± 0.438
3.829LysTyr: 3.829 ± 0.936
0.0LysXaa: 0.0 ± 0.0
Leu
5.221LeuAla: 5.221 ± 1.161
0.0LeuCys: 0.0 ± 0.0
9.05LeuAsp: 9.05 ± 2.171
6.961LeuGlu: 6.961 ± 1.317
1.74LeuPhe: 1.74 ± 0.722
5.569LeuGly: 5.569 ± 1.081
1.392LeuHis: 1.392 ± 0.559
8.702LeuIle: 8.702 ± 1.328
7.658LeuLys: 7.658 ± 1.362
10.442LeuLeu: 10.442 ± 1.894
2.088LeuMet: 2.088 ± 0.75
8.006LeuAsn: 8.006 ± 1.916
4.525LeuPro: 4.525 ± 1.442
4.873LeuGln: 4.873 ± 0.918
3.133LeuArg: 3.133 ± 1.057
6.961LeuSer: 6.961 ± 1.39
7.309LeuThr: 7.309 ± 1.645
4.177LeuVal: 4.177 ± 0.83
0.696LeuTrp: 0.696 ± 0.441
5.221LeuTyr: 5.221 ± 1.12
0.0LeuXaa: 0.0 ± 0.0
Met
3.133MetAla: 3.133 ± 1.098
0.348MetCys: 0.348 ± 0.338
1.392MetAsp: 1.392 ± 0.565
0.696MetGlu: 0.696 ± 0.543
0.696MetPhe: 0.696 ± 0.327
0.696MetGly: 0.696 ± 0.579
0.0MetHis: 0.0 ± 0.0
1.044MetIle: 1.044 ± 0.524
5.569MetLys: 5.569 ± 1.242
3.133MetLeu: 3.133 ± 1.258
0.348MetMet: 0.348 ± 0.261
2.088MetAsn: 2.088 ± 0.704
0.0MetPro: 0.0 ± 0.0
0.696MetGln: 0.696 ± 0.378
0.696MetArg: 0.696 ± 0.462
0.348MetSer: 0.348 ± 0.295
3.481MetThr: 3.481 ± 0.902
1.74MetVal: 1.74 ± 0.877
0.0MetTrp: 0.0 ± 0.0
0.348MetTyr: 0.348 ± 0.261
0.0MetXaa: 0.0 ± 0.0
Asn
3.829AsnAla: 3.829 ± 1.244
2.088AsnCys: 2.088 ± 0.625
5.221AsnAsp: 5.221 ± 1.169
6.613AsnGlu: 6.613 ± 1.448
2.436AsnPhe: 2.436 ± 0.744
4.177AsnGly: 4.177 ± 0.903
1.392AsnHis: 1.392 ± 0.695
3.133AsnIle: 3.133 ± 1.004
6.613AsnLys: 6.613 ± 0.859
5.569AsnLeu: 5.569 ± 1.661
2.088AsnMet: 2.088 ± 0.691
4.177AsnAsn: 4.177 ± 0.817
3.133AsnPro: 3.133 ± 0.965
2.436AsnGln: 2.436 ± 0.871
2.785AsnArg: 2.785 ± 0.979
3.133AsnSer: 3.133 ± 0.811
3.133AsnThr: 3.133 ± 0.907
3.829AsnVal: 3.829 ± 1.215
0.696AsnTrp: 0.696 ± 0.432
3.481AsnTyr: 3.481 ± 0.974
0.0AsnXaa: 0.0 ± 0.0
Pro
0.696ProAla: 0.696 ± 0.327
0.0ProCys: 0.0 ± 0.0
3.133ProAsp: 3.133 ± 0.735
3.481ProGlu: 3.481 ± 1.238
1.74ProPhe: 1.74 ± 0.813
0.348ProGly: 0.348 ± 0.456
0.348ProHis: 0.348 ± 0.261
0.348ProIle: 0.348 ± 0.261
4.177ProLys: 4.177 ± 1.189
3.481ProLeu: 3.481 ± 1.105
0.348ProMet: 0.348 ± 0.333
2.436ProAsn: 2.436 ± 0.649
1.392ProPro: 1.392 ± 0.793
1.044ProGln: 1.044 ± 0.773
1.74ProArg: 1.74 ± 0.762
0.696ProSer: 0.696 ± 0.421
2.436ProThr: 2.436 ± 1.088
1.74ProVal: 1.74 ± 0.804
0.0ProTrp: 0.0 ± 0.0
2.088ProTyr: 2.088 ± 0.817
0.0ProXaa: 0.0 ± 0.0
Gln
2.088GlnAla: 2.088 ± 0.66
0.0GlnCys: 0.0 ± 0.0
1.044GlnAsp: 1.044 ± 0.448
3.481GlnGlu: 3.481 ± 1.611
2.436GlnPhe: 2.436 ± 0.778
1.74GlnGly: 1.74 ± 0.858
0.348GlnHis: 0.348 ± 0.353
1.392GlnIle: 1.392 ± 0.956
3.133GlnLys: 3.133 ± 0.986
5.221GlnLeu: 5.221 ± 1.478
1.044GlnMet: 1.044 ± 0.622
1.392GlnAsn: 1.392 ± 0.638
0.348GlnPro: 0.348 ± 0.333
2.088GlnGln: 2.088 ± 0.883
1.392GlnArg: 1.392 ± 0.767
2.088GlnSer: 2.088 ± 0.845
1.392GlnThr: 1.392 ± 0.726
1.392GlnVal: 1.392 ± 0.933
0.348GlnTrp: 0.348 ± 0.374
1.74GlnTyr: 1.74 ± 0.739
0.0GlnXaa: 0.0 ± 0.0
Arg
1.044ArgAla: 1.044 ± 0.51
0.348ArgCys: 0.348 ± 0.338
1.392ArgAsp: 1.392 ± 0.5
2.436ArgGlu: 2.436 ± 0.713
1.044ArgPhe: 1.044 ± 0.58
3.829ArgGly: 3.829 ± 0.931
1.044ArgHis: 1.044 ± 0.639
3.481ArgIle: 3.481 ± 1.029
5.569ArgLys: 5.569 ± 1.11
6.961ArgLeu: 6.961 ± 1.2
1.044ArgMet: 1.044 ± 0.54
1.74ArgAsn: 1.74 ± 0.988
0.348ArgPro: 0.348 ± 0.432
1.392ArgGln: 1.392 ± 0.611
1.044ArgArg: 1.044 ± 0.61
1.74ArgSer: 1.74 ± 0.675
1.392ArgThr: 1.392 ± 0.655
3.481ArgVal: 3.481 ± 0.952
0.348ArgTrp: 0.348 ± 0.314
1.392ArgTyr: 1.392 ± 0.584
0.0ArgXaa: 0.0 ± 0.0
Ser
1.392SerAla: 1.392 ± 0.808
0.0SerCys: 0.0 ± 0.0
5.221SerAsp: 5.221 ± 1.722
3.829SerGlu: 3.829 ± 1.101
3.829SerPhe: 3.829 ± 0.668
1.392SerGly: 1.392 ± 0.742
0.348SerHis: 0.348 ± 0.295
3.481SerIle: 3.481 ± 0.913
5.917SerLys: 5.917 ± 1.618
5.221SerLeu: 5.221 ± 0.912
0.696SerMet: 0.696 ± 0.531
4.873SerAsn: 4.873 ± 1.565
2.436SerPro: 2.436 ± 1.028
1.74SerGln: 1.74 ± 0.692
3.133SerArg: 3.133 ± 1.072
3.133SerSer: 3.133 ± 0.826
1.392SerThr: 1.392 ± 0.914
3.133SerVal: 3.133 ± 1.096
0.696SerTrp: 0.696 ± 0.487
3.829SerTyr: 3.829 ± 1.683
0.0SerXaa: 0.0 ± 0.0
Thr
2.088ThrAla: 2.088 ± 0.723
0.0ThrCys: 0.0 ± 0.0
3.133ThrAsp: 3.133 ± 0.975
5.221ThrGlu: 5.221 ± 1.019
4.177ThrPhe: 4.177 ± 1.299
3.481ThrGly: 3.481 ± 1.177
0.696ThrHis: 0.696 ± 0.432
6.961ThrIle: 6.961 ± 1.841
4.525ThrLys: 4.525 ± 1.303
5.917ThrLeu: 5.917 ± 1.625
0.696ThrMet: 0.696 ± 0.378
4.525ThrAsn: 4.525 ± 1.35
3.133ThrPro: 3.133 ± 0.704
0.696ThrGln: 0.696 ± 0.489
2.436ThrArg: 2.436 ± 0.928
2.088ThrSer: 2.088 ± 0.843
3.133ThrThr: 3.133 ± 0.972
5.221ThrVal: 5.221 ± 1.392
0.696ThrTrp: 0.696 ± 0.489
2.088ThrTyr: 2.088 ± 1.279
0.0ThrXaa: 0.0 ± 0.0
Val
3.133ValAla: 3.133 ± 1.027
0.0ValCys: 0.0 ± 0.0
1.044ValAsp: 1.044 ± 0.5
2.088ValGlu: 2.088 ± 1.135
3.133ValPhe: 3.133 ± 0.836
0.696ValGly: 0.696 ± 0.372
0.696ValHis: 0.696 ± 0.391
3.829ValIle: 3.829 ± 1.003
6.265ValLys: 6.265 ± 1.291
4.525ValLeu: 4.525 ± 0.951
0.0ValMet: 0.0 ± 0.0
5.221ValAsn: 5.221 ± 1.296
0.348ValPro: 0.348 ± 0.341
0.348ValGln: 0.348 ± 0.338
2.436ValArg: 2.436 ± 1.071
4.873ValSer: 4.873 ± 1.128
3.481ValThr: 3.481 ± 1.066
2.785ValVal: 2.785 ± 1.059
0.696ValTrp: 0.696 ± 0.521
0.696ValTyr: 0.696 ± 0.516
0.0ValXaa: 0.0 ± 0.0
Trp
0.348TrpAla: 0.348 ± 0.29
0.0TrpCys: 0.0 ± 0.0
0.348TrpAsp: 0.348 ± 0.295
1.044TrpGlu: 1.044 ± 0.52
0.348TrpPhe: 0.348 ± 0.338
0.348TrpGly: 0.348 ± 0.314
0.348TrpHis: 0.348 ± 0.261
0.696TrpIle: 0.696 ± 0.449
1.044TrpLys: 1.044 ± 0.513
1.392TrpLeu: 1.392 ± 0.68
0.0TrpMet: 0.0 ± 0.0
0.696TrpAsn: 0.696 ± 0.377
0.0TrpPro: 0.0 ± 0.0
0.348TrpGln: 0.348 ± 0.369
0.348TrpArg: 0.348 ± 0.338
0.696TrpSer: 0.696 ± 0.378
0.0TrpThr: 0.0 ± 0.0
0.348TrpVal: 0.348 ± 0.29
0.696TrpTrp: 0.696 ± 0.441
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.392TyrAla: 1.392 ± 0.492
0.348TyrCys: 0.348 ± 0.333
2.436TyrAsp: 2.436 ± 1.029
3.829TyrGlu: 3.829 ± 1.367
1.74TyrPhe: 1.74 ± 0.778
3.133TyrGly: 3.133 ± 1.064
0.348TyrHis: 0.348 ± 0.353
3.481TyrIle: 3.481 ± 1.232
4.873TyrLys: 4.873 ± 1.802
4.177TyrLeu: 4.177 ± 1.062
1.74TyrMet: 1.74 ± 0.65
5.221TyrAsn: 5.221 ± 1.485
1.74TyrPro: 1.74 ± 0.66
1.74TyrGln: 1.74 ± 0.648
3.481TyrArg: 3.481 ± 1.308
4.525TyrSer: 4.525 ± 1.055
3.133TyrThr: 3.133 ± 1.197
1.74TyrVal: 1.74 ± 0.885
0.348TyrTrp: 0.348 ± 0.261
1.392TyrTyr: 1.392 ± 0.658
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (2874 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski