Amino acid dipepetide frequency for Streptococcus satellite phage Javan752

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.972AlaAla: 0.972 ± 0.596
0.324AlaCys: 0.324 ± 0.41
3.238AlaAsp: 3.238 ± 1.005
5.181AlaGlu: 5.181 ± 1.497
2.591AlaPhe: 2.591 ± 1.038
1.943AlaGly: 1.943 ± 0.702
0.324AlaHis: 0.324 ± 0.314
3.562AlaIle: 3.562 ± 0.844
2.915AlaLys: 2.915 ± 0.948
6.153AlaLeu: 6.153 ± 1.297
1.619AlaMet: 1.619 ± 0.751
2.267AlaAsn: 2.267 ± 0.592
0.324AlaPro: 0.324 ± 0.319
2.267AlaGln: 2.267 ± 0.899
5.829AlaArg: 5.829 ± 1.192
1.619AlaSer: 1.619 ± 0.727
3.562AlaThr: 3.562 ± 1.188
3.562AlaVal: 3.562 ± 0.904
0.324AlaTrp: 0.324 ± 0.314
1.619AlaTyr: 1.619 ± 0.78
0.0AlaXaa: 0.0 ± 0.0
Cys
0.648CysAla: 0.648 ± 0.334
0.0CysCys: 0.0 ± 0.0
0.972CysAsp: 0.972 ± 0.537
0.324CysGlu: 0.324 ± 0.41
0.324CysPhe: 0.324 ± 0.263
0.972CysGly: 0.972 ± 0.756
0.648CysHis: 0.648 ± 0.357
0.972CysIle: 0.972 ± 0.654
0.648CysLys: 0.648 ± 0.534
1.943CysLeu: 1.943 ± 0.648
0.0CysMet: 0.0 ± 0.0
0.324CysAsn: 0.324 ± 0.283
0.648CysPro: 0.648 ± 0.423
0.648CysGln: 0.648 ± 0.52
0.324CysArg: 0.324 ± 0.366
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.324CysVal: 0.324 ± 0.314
0.0CysTrp: 0.0 ± 0.0
0.648CysTyr: 0.648 ± 0.415
0.0CysXaa: 0.0 ± 0.0
Asp
1.295AspAla: 1.295 ± 0.482
0.972AspCys: 0.972 ± 0.817
4.21AspAsp: 4.21 ± 1.144
3.562AspGlu: 3.562 ± 0.97
3.562AspPhe: 3.562 ± 1.378
3.238AspGly: 3.238 ± 0.754
0.0AspHis: 0.0 ± 0.0
6.801AspIle: 6.801 ± 1.221
5.181AspLys: 5.181 ± 1.23
7.448AspLeu: 7.448 ± 2.306
1.295AspMet: 1.295 ± 0.665
3.562AspAsn: 3.562 ± 1.042
1.295AspPro: 1.295 ± 0.773
1.619AspGln: 1.619 ± 0.606
3.562AspArg: 3.562 ± 1.212
2.591AspSer: 2.591 ± 0.749
2.267AspThr: 2.267 ± 0.705
2.267AspVal: 2.267 ± 1.099
0.0AspTrp: 0.0 ± 0.0
4.534AspTyr: 4.534 ± 1.311
0.0AspXaa: 0.0 ± 0.0
Glu
6.801GluAla: 6.801 ± 1.365
1.295GluCys: 1.295 ± 0.588
4.21GluAsp: 4.21 ± 1.163
5.829GluGlu: 5.829 ± 1.695
2.267GluPhe: 2.267 ± 0.998
2.915GluGly: 2.915 ± 0.902
1.619GluHis: 1.619 ± 0.687
6.153GluIle: 6.153 ± 1.204
8.096GluLys: 8.096 ± 1.922
9.715GluLeu: 9.715 ± 1.261
2.915GluMet: 2.915 ± 0.915
6.477GluAsn: 6.477 ± 1.421
2.267GluPro: 2.267 ± 1.091
2.591GluGln: 2.591 ± 0.886
6.153GluArg: 6.153 ± 1.383
4.534GluSer: 4.534 ± 1.473
2.915GluThr: 2.915 ± 0.842
4.534GluVal: 4.534 ± 1.066
0.648GluTrp: 0.648 ± 0.357
3.238GluTyr: 3.238 ± 0.861
0.0GluXaa: 0.0 ± 0.0
Phe
1.295PheAla: 1.295 ± 0.713
0.324PheCys: 0.324 ± 0.353
4.21PheAsp: 4.21 ± 1.042
2.591PheGlu: 2.591 ± 1.005
2.915PhePhe: 2.915 ± 1.037
2.915PheGly: 2.915 ± 0.682
0.648PheHis: 0.648 ± 0.357
3.886PheIle: 3.886 ± 1.128
3.886PheLys: 3.886 ± 1.15
4.21PheLeu: 4.21 ± 1.423
0.972PheMet: 0.972 ± 0.619
2.267PheAsn: 2.267 ± 0.649
0.648PhePro: 0.648 ± 0.455
2.915PheGln: 2.915 ± 0.716
2.267PheArg: 2.267 ± 0.811
3.562PheSer: 3.562 ± 0.67
1.619PheThr: 1.619 ± 0.592
0.648PheVal: 0.648 ± 0.386
0.324PheTrp: 0.324 ± 0.366
2.591PheTyr: 2.591 ± 0.806
0.0PheXaa: 0.0 ± 0.0
Gly
1.943GlyAla: 1.943 ± 0.788
0.648GlyCys: 0.648 ± 0.434
3.886GlyAsp: 3.886 ± 1.019
3.886GlyGlu: 3.886 ± 1.043
2.267GlyPhe: 2.267 ± 0.807
2.591GlyGly: 2.591 ± 1.346
0.972GlyHis: 0.972 ± 0.498
4.21GlyIle: 4.21 ± 1.186
2.915GlyLys: 2.915 ± 1.187
4.858GlyLeu: 4.858 ± 1.515
1.295GlyMet: 1.295 ± 0.51
2.267GlyAsn: 2.267 ± 0.689
0.324GlyPro: 0.324 ± 0.269
2.915GlyGln: 2.915 ± 0.585
2.915GlyArg: 2.915 ± 0.913
1.619GlySer: 1.619 ± 0.606
4.858GlyThr: 4.858 ± 1.235
4.534GlyVal: 4.534 ± 1.491
1.619GlyTrp: 1.619 ± 0.81
2.267GlyTyr: 2.267 ± 0.977
0.0GlyXaa: 0.0 ± 0.0
His
1.619HisAla: 1.619 ± 0.864
0.0HisCys: 0.0 ± 0.0
0.972HisAsp: 0.972 ± 0.564
1.295HisGlu: 1.295 ± 0.65
0.648HisPhe: 0.648 ± 0.388
0.648HisGly: 0.648 ± 0.448
0.0HisHis: 0.0 ± 0.0
0.324HisIle: 0.324 ± 0.269
1.295HisLys: 1.295 ± 0.733
1.295HisLeu: 1.295 ± 0.715
0.324HisMet: 0.324 ± 0.393
0.972HisAsn: 0.972 ± 0.631
0.324HisPro: 0.324 ± 0.344
0.324HisGln: 0.324 ± 0.314
1.619HisArg: 1.619 ± 0.704
1.295HisSer: 1.295 ± 0.829
0.972HisThr: 0.972 ± 0.576
0.648HisVal: 0.648 ± 0.334
0.0HisTrp: 0.0 ± 0.0
1.295HisTyr: 1.295 ± 0.614
0.0HisXaa: 0.0 ± 0.0
Ile
4.21IleAla: 4.21 ± 1.38
1.295IleCys: 1.295 ± 0.615
6.801IleAsp: 6.801 ± 2.971
7.124IleGlu: 7.124 ± 1.739
4.858IlePhe: 4.858 ± 1.251
3.886IleGly: 3.886 ± 1.028
1.295IleHis: 1.295 ± 0.725
3.562IleIle: 3.562 ± 0.853
5.829IleLys: 5.829 ± 1.338
3.886IleLeu: 3.886 ± 0.909
0.324IleMet: 0.324 ± 0.419
2.591IleAsn: 2.591 ± 1.027
3.562IlePro: 3.562 ± 1.073
2.591IleGln: 2.591 ± 0.713
3.238IleArg: 3.238 ± 0.992
8.744IleSer: 8.744 ± 2.33
2.915IleThr: 2.915 ± 0.763
2.915IleVal: 2.915 ± 0.67
0.324IleTrp: 0.324 ± 0.283
2.267IleTyr: 2.267 ± 0.824
0.0IleXaa: 0.0 ± 0.0
Lys
4.858LysAla: 4.858 ± 1.227
0.324LysCys: 0.324 ± 0.263
1.943LysAsp: 1.943 ± 0.88
10.039LysGlu: 10.039 ± 1.681
3.562LysPhe: 3.562 ± 0.772
4.534LysGly: 4.534 ± 1.267
2.591LysHis: 2.591 ± 0.852
4.858LysIle: 4.858 ± 1.156
6.477LysLys: 6.477 ± 1.705
7.772LysLeu: 7.772 ± 1.402
3.562LysMet: 3.562 ± 1.242
5.829LysAsn: 5.829 ± 1.225
3.238LysPro: 3.238 ± 1.283
4.534LysGln: 4.534 ± 1.348
5.829LysArg: 5.829 ± 1.704
4.534LysSer: 4.534 ± 0.914
5.829LysThr: 5.829 ± 1.323
4.534LysVal: 4.534 ± 0.796
0.648LysTrp: 0.648 ± 0.448
4.21LysTyr: 4.21 ± 1.257
0.0LysXaa: 0.0 ± 0.0
Leu
5.829LeuAla: 5.829 ± 1.416
0.972LeuCys: 0.972 ± 0.591
11.01LeuAsp: 11.01 ± 1.435
10.363LeuGlu: 10.363 ± 2.053
2.591LeuPhe: 2.591 ± 1.002
3.886LeuGly: 3.886 ± 1.406
0.324LeuHis: 0.324 ± 0.269
6.153LeuIle: 6.153 ± 1.944
7.448LeuLys: 7.448 ± 1.523
9.391LeuLeu: 9.391 ± 1.654
2.267LeuMet: 2.267 ± 0.9
6.477LeuAsn: 6.477 ± 1.415
4.858LeuPro: 4.858 ± 1.055
3.562LeuGln: 3.562 ± 1.266
3.886LeuArg: 3.886 ± 1.131
5.181LeuSer: 5.181 ± 1.401
5.505LeuThr: 5.505 ± 1.24
3.238LeuVal: 3.238 ± 1.021
1.295LeuTrp: 1.295 ± 0.663
4.858LeuTyr: 4.858 ± 1.031
0.0LeuXaa: 0.0 ± 0.0
Met
1.943MetAla: 1.943 ± 0.821
0.0MetCys: 0.0 ± 0.0
1.619MetAsp: 1.619 ± 0.709
3.238MetGlu: 3.238 ± 0.949
1.295MetPhe: 1.295 ± 0.558
2.267MetGly: 2.267 ± 0.815
0.0MetHis: 0.0 ± 0.0
0.972MetIle: 0.972 ± 0.569
2.267MetLys: 2.267 ± 1.151
1.295MetLeu: 1.295 ± 0.646
0.324MetMet: 0.324 ± 0.314
2.591MetAsn: 2.591 ± 0.841
0.648MetPro: 0.648 ± 0.386
0.648MetGln: 0.648 ± 0.518
1.943MetArg: 1.943 ± 0.86
1.619MetSer: 1.619 ± 0.587
2.915MetThr: 2.915 ± 1.175
0.972MetVal: 0.972 ± 0.636
0.324MetTrp: 0.324 ± 0.283
0.324MetTyr: 0.324 ± 0.353
0.0MetXaa: 0.0 ± 0.0
Asn
1.619AsnAla: 1.619 ± 0.915
0.324AsnCys: 0.324 ± 0.41
2.915AsnAsp: 2.915 ± 1.417
3.238AsnGlu: 3.238 ± 0.797
2.267AsnPhe: 2.267 ± 0.767
5.505AsnGly: 5.505 ± 1.099
1.295AsnHis: 1.295 ± 0.661
3.562AsnIle: 3.562 ± 1.177
7.124AsnLys: 7.124 ± 1.858
3.562AsnLeu: 3.562 ± 0.639
1.619AsnMet: 1.619 ± 0.742
2.267AsnAsn: 2.267 ± 0.864
2.591AsnPro: 2.591 ± 0.788
2.591AsnGln: 2.591 ± 0.929
1.619AsnArg: 1.619 ± 0.624
3.562AsnSer: 3.562 ± 0.88
3.238AsnThr: 3.238 ± 0.936
1.295AsnVal: 1.295 ± 0.712
0.324AsnTrp: 0.324 ± 0.269
1.295AsnTyr: 1.295 ± 0.513
0.0AsnXaa: 0.0 ± 0.0
Pro
2.267ProAla: 2.267 ± 1.056
0.324ProCys: 0.324 ± 0.283
2.267ProAsp: 2.267 ± 0.885
3.886ProGlu: 3.886 ± 1.121
1.943ProPhe: 1.943 ± 0.906
0.972ProGly: 0.972 ± 0.762
0.324ProHis: 0.324 ± 0.41
2.267ProIle: 2.267 ± 0.498
2.915ProLys: 2.915 ± 0.975
1.295ProLeu: 1.295 ± 0.62
0.648ProMet: 0.648 ± 0.402
1.619ProAsn: 1.619 ± 0.844
1.943ProPro: 1.943 ± 0.759
0.648ProGln: 0.648 ± 0.334
4.858ProArg: 4.858 ± 1.032
1.619ProSer: 1.619 ± 0.586
0.972ProThr: 0.972 ± 0.893
2.267ProVal: 2.267 ± 0.933
0.0ProTrp: 0.0 ± 0.0
1.295ProTyr: 1.295 ± 0.434
0.0ProXaa: 0.0 ± 0.0
Gln
3.238GlnAla: 3.238 ± 1.108
0.0GlnCys: 0.0 ± 0.0
1.295GlnAsp: 1.295 ± 0.771
3.562GlnGlu: 3.562 ± 1.161
0.972GlnPhe: 0.972 ± 0.653
3.562GlnGly: 3.562 ± 1.166
0.324GlnHis: 0.324 ± 0.263
2.267GlnIle: 2.267 ± 0.935
4.534GlnLys: 4.534 ± 0.981
5.181GlnLeu: 5.181 ± 1.419
0.972GlnMet: 0.972 ± 0.471
0.648GlnAsn: 0.648 ± 0.517
1.295GlnPro: 1.295 ± 0.604
1.295GlnGln: 1.295 ± 0.745
0.972GlnArg: 0.972 ± 0.584
1.295GlnSer: 1.295 ± 0.568
1.943GlnThr: 1.943 ± 0.588
3.886GlnVal: 3.886 ± 0.763
0.648GlnTrp: 0.648 ± 0.357
0.648GlnTyr: 0.648 ± 0.5
0.0GlnXaa: 0.0 ± 0.0
Arg
3.238ArgAla: 3.238 ± 1.104
0.648ArgCys: 0.648 ± 0.455
1.943ArgAsp: 1.943 ± 0.583
4.858ArgGlu: 4.858 ± 1.15
3.238ArgPhe: 3.238 ± 1.053
3.238ArgGly: 3.238 ± 1.272
1.619ArgHis: 1.619 ± 0.678
5.181ArgIle: 5.181 ± 0.841
6.477ArgLys: 6.477 ± 1.434
6.477ArgLeu: 6.477 ± 1.555
1.295ArgMet: 1.295 ± 0.764
2.915ArgAsn: 2.915 ± 0.917
0.972ArgPro: 0.972 ± 0.546
2.267ArgGln: 2.267 ± 0.878
2.267ArgArg: 2.267 ± 0.883
1.619ArgSer: 1.619 ± 0.559
2.915ArgThr: 2.915 ± 0.884
1.943ArgVal: 1.943 ± 0.683
0.648ArgTrp: 0.648 ± 0.385
3.238ArgTyr: 3.238 ± 1.329
0.0ArgXaa: 0.0 ± 0.0
Ser
2.267SerAla: 2.267 ± 1.097
0.648SerCys: 0.648 ± 0.446
2.591SerAsp: 2.591 ± 0.821
3.562SerGlu: 3.562 ± 1.329
2.915SerPhe: 2.915 ± 0.959
3.238SerGly: 3.238 ± 0.785
0.648SerHis: 0.648 ± 0.37
4.534SerIle: 4.534 ± 1.427
5.181SerLys: 5.181 ± 1.353
6.153SerLeu: 6.153 ± 1.773
2.267SerMet: 2.267 ± 0.739
3.886SerAsn: 3.886 ± 1.682
2.915SerPro: 2.915 ± 1.03
2.267SerGln: 2.267 ± 0.864
1.943SerArg: 1.943 ± 0.841
2.591SerSer: 2.591 ± 0.86
2.591SerThr: 2.591 ± 0.603
3.238SerVal: 3.238 ± 0.643
0.0SerTrp: 0.0 ± 0.0
2.591SerTyr: 2.591 ± 0.866
0.0SerXaa: 0.0 ± 0.0
Thr
1.943ThrAla: 1.943 ± 0.809
0.648ThrCys: 0.648 ± 0.471
1.295ThrAsp: 1.295 ± 0.661
2.267ThrGlu: 2.267 ± 0.749
2.591ThrPhe: 2.591 ± 0.737
2.591ThrGly: 2.591 ± 0.975
1.619ThrHis: 1.619 ± 0.654
4.534ThrIle: 4.534 ± 1.603
5.181ThrLys: 5.181 ± 1.403
6.477ThrLeu: 6.477 ± 1.318
2.267ThrMet: 2.267 ± 0.717
0.0ThrAsn: 0.0 ± 0.0
1.943ThrPro: 1.943 ± 0.612
1.943ThrGln: 1.943 ± 0.783
1.619ThrArg: 1.619 ± 0.499
3.886ThrSer: 3.886 ± 0.916
2.591ThrThr: 2.591 ± 0.964
4.21ThrVal: 4.21 ± 1.561
0.648ThrTrp: 0.648 ± 0.402
4.21ThrTyr: 4.21 ± 1.563
0.0ThrXaa: 0.0 ± 0.0
Val
2.915ValAla: 2.915 ± 1.173
0.648ValCys: 0.648 ± 0.422
1.943ValAsp: 1.943 ± 0.918
5.829ValGlu: 5.829 ± 1.348
1.295ValPhe: 1.295 ± 0.704
2.591ValGly: 2.591 ± 1.058
0.0ValHis: 0.0 ± 0.0
4.21ValIle: 4.21 ± 1.425
3.562ValLys: 3.562 ± 0.903
5.181ValLeu: 5.181 ± 1.195
1.619ValMet: 1.619 ± 0.847
2.591ValAsn: 2.591 ± 0.836
2.591ValPro: 2.591 ± 0.794
0.972ValGln: 0.972 ± 0.48
2.267ValArg: 2.267 ± 0.927
3.886ValSer: 3.886 ± 0.936
2.915ValThr: 2.915 ± 0.854
3.238ValVal: 3.238 ± 1.202
0.972ValTrp: 0.972 ± 0.587
1.619ValTyr: 1.619 ± 0.56
0.0ValXaa: 0.0 ± 0.0
Trp
0.324TrpAla: 0.324 ± 0.269
0.0TrpCys: 0.0 ± 0.0
0.648TrpAsp: 0.648 ± 0.559
1.295TrpGlu: 1.295 ± 0.592
0.648TrpPhe: 0.648 ± 0.451
0.648TrpGly: 0.648 ± 0.37
0.324TrpHis: 0.324 ± 0.283
0.324TrpIle: 0.324 ± 0.295
0.324TrpLys: 0.324 ± 0.314
1.295TrpLeu: 1.295 ± 0.467
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.324TrpGln: 0.324 ± 0.283
0.324TrpArg: 0.324 ± 0.283
0.972TrpSer: 0.972 ± 0.503
0.324TrpThr: 0.324 ± 0.263
0.972TrpVal: 0.972 ± 0.598
0.0TrpTrp: 0.0 ± 0.0
0.324TrpTyr: 0.324 ± 0.283
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.972TyrAla: 0.972 ± 0.654
0.972TyrCys: 0.972 ± 0.443
1.295TyrAsp: 1.295 ± 0.579
3.238TyrGlu: 3.238 ± 1.078
1.943TyrPhe: 1.943 ± 1.035
0.648TyrGly: 0.648 ± 0.558
1.295TyrHis: 1.295 ± 0.614
3.886TyrIle: 3.886 ± 1.259
7.448TyrLys: 7.448 ± 1.945
5.829TyrLeu: 5.829 ± 1.249
1.295TyrMet: 1.295 ± 0.858
2.267TyrAsn: 2.267 ± 0.74
1.943TyrPro: 1.943 ± 1.041
1.619TyrGln: 1.619 ± 0.844
3.562TyrArg: 3.562 ± 0.716
1.295TyrSer: 1.295 ± 0.461
1.619TyrThr: 1.619 ± 0.538
1.619TyrVal: 1.619 ± 0.606
0.324TyrTrp: 0.324 ± 0.366
0.324TyrTyr: 0.324 ± 0.366
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3089 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski