Amino acid dipepetide frequency for Streptococcus satellite phage Javan198

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.631AlaAla: 3.631 ± 0.947
1.089AlaCys: 1.089 ± 0.534
3.994AlaAsp: 3.994 ± 0.899
5.084AlaGlu: 5.084 ± 1.494
2.905AlaPhe: 2.905 ± 1.153
2.542AlaGly: 2.542 ± 1.214
0.0AlaHis: 0.0 ± 0.0
3.994AlaIle: 3.994 ± 1.194
4.357AlaLys: 4.357 ± 1.355
2.542AlaLeu: 2.542 ± 1.111
1.816AlaMet: 1.816 ± 0.853
3.631AlaAsn: 3.631 ± 1.341
1.089AlaPro: 1.089 ± 0.588
2.542AlaGln: 2.542 ± 0.696
2.905AlaArg: 2.905 ± 1.233
4.357AlaSer: 4.357 ± 0.94
4.357AlaThr: 4.357 ± 1.244
3.268AlaVal: 3.268 ± 0.864
0.726AlaTrp: 0.726 ± 0.481
2.542AlaTyr: 2.542 ± 0.791
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.363CysGlu: 0.363 ± 0.33
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.363CysIle: 0.363 ± 0.35
0.363CysLys: 0.363 ± 0.288
0.0CysLeu: 0.0 ± 0.0
0.363CysMet: 0.363 ± 0.346
1.089CysAsn: 1.089 ± 0.479
0.363CysPro: 0.363 ± 0.33
1.452CysGln: 1.452 ± 0.703
0.363CysArg: 0.363 ± 0.347
0.363CysSer: 0.363 ± 0.288
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.905AspAla: 2.905 ± 0.896
0.363AspCys: 0.363 ± 0.288
2.542AspAsp: 2.542 ± 1.059
3.994AspGlu: 3.994 ± 1.437
2.542AspPhe: 2.542 ± 1.051
2.542AspGly: 2.542 ± 0.873
0.0AspHis: 0.0 ± 0.0
6.899AspIle: 6.899 ± 1.6
5.084AspLys: 5.084 ± 0.996
3.994AspLeu: 3.994 ± 0.922
2.542AspMet: 2.542 ± 0.727
5.81AspAsn: 5.81 ± 1.085
0.363AspPro: 0.363 ± 0.288
1.089AspGln: 1.089 ± 0.724
1.452AspArg: 1.452 ± 0.776
3.994AspSer: 3.994 ± 0.977
3.268AspThr: 3.268 ± 1.121
3.994AspVal: 3.994 ± 1.005
0.726AspTrp: 0.726 ± 0.458
2.905AspTyr: 2.905 ± 1.056
0.0AspXaa: 0.0 ± 0.0
Glu
3.631GluAla: 3.631 ± 1.284
0.0GluCys: 0.0 ± 0.0
4.357GluAsp: 4.357 ± 1.141
7.988GluGlu: 7.988 ± 1.812
3.268GluPhe: 3.268 ± 1.306
3.268GluGly: 3.268 ± 1.051
1.089GluHis: 1.089 ± 0.58
6.899GluIle: 6.899 ± 1.959
8.351GluLys: 8.351 ± 1.798
12.346GluLeu: 12.346 ± 2.075
1.816GluMet: 1.816 ± 0.8
6.536GluAsn: 6.536 ± 1.183
1.816GluPro: 1.816 ± 0.965
4.357GluGln: 4.357 ± 1.023
2.542GluArg: 2.542 ± 0.959
2.179GluSer: 2.179 ± 0.718
3.994GluThr: 3.994 ± 0.88
4.357GluVal: 4.357 ± 1.111
1.816GluTrp: 1.816 ± 0.875
3.268GluTyr: 3.268 ± 1.55
0.0GluXaa: 0.0 ± 0.0
Phe
1.816PheAla: 1.816 ± 0.696
0.0PheCys: 0.0 ± 0.0
3.268PheAsp: 3.268 ± 0.969
1.452PheGlu: 1.452 ± 0.801
2.542PhePhe: 2.542 ± 1.181
1.452PheGly: 1.452 ± 0.501
1.089PheHis: 1.089 ± 0.42
3.268PheIle: 3.268 ± 1.259
2.542PheLys: 2.542 ± 0.672
3.631PheLeu: 3.631 ± 0.927
1.816PheMet: 1.816 ± 0.763
1.089PheAsn: 1.089 ± 0.565
0.726PhePro: 0.726 ± 0.46
0.726PheGln: 0.726 ± 0.455
1.816PheArg: 1.816 ± 0.667
3.994PheSer: 3.994 ± 0.83
1.452PheThr: 1.452 ± 0.529
3.268PheVal: 3.268 ± 1.041
0.0PheTrp: 0.0 ± 0.0
0.363PheTyr: 0.363 ± 0.375
0.0PheXaa: 0.0 ± 0.0
Gly
1.816GlyAla: 1.816 ± 0.652
0.726GlyCys: 0.726 ± 0.466
1.089GlyAsp: 1.089 ± 0.42
3.994GlyGlu: 3.994 ± 1.088
1.452GlyPhe: 1.452 ± 0.61
2.542GlyGly: 2.542 ± 0.92
1.452GlyHis: 1.452 ± 0.519
3.994GlyIle: 3.994 ± 1.201
5.084GlyLys: 5.084 ± 1.746
2.905GlyLeu: 2.905 ± 1.03
1.089GlyMet: 1.089 ± 0.468
5.084GlyAsn: 5.084 ± 1.655
1.089GlyPro: 1.089 ± 0.632
1.452GlyGln: 1.452 ± 0.557
0.726GlyArg: 0.726 ± 0.56
1.089GlySer: 1.089 ± 0.593
2.542GlyThr: 2.542 ± 1.277
5.084GlyVal: 5.084 ± 1.618
0.363GlyTrp: 0.363 ± 0.33
2.179GlyTyr: 2.179 ± 0.662
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.363HisAsp: 0.363 ± 0.288
1.452HisGlu: 1.452 ± 0.618
1.089HisPhe: 1.089 ± 0.531
1.816HisGly: 1.816 ± 0.673
0.726HisHis: 0.726 ± 0.464
0.726HisIle: 0.726 ± 0.439
0.726HisLys: 0.726 ± 0.505
1.089HisLeu: 1.089 ± 0.525
0.0HisMet: 0.0 ± 0.0
1.452HisAsn: 1.452 ± 0.62
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.089HisArg: 1.089 ± 0.558
0.726HisSer: 0.726 ± 0.383
2.179HisThr: 2.179 ± 1.077
0.726HisVal: 0.726 ± 0.52
0.363HisTrp: 0.363 ± 0.39
0.363HisTyr: 0.363 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
2.905IleAla: 2.905 ± 0.795
0.363IleCys: 0.363 ± 0.475
4.72IleAsp: 4.72 ± 1.352
7.262IleGlu: 7.262 ± 1.468
1.816IlePhe: 1.816 ± 0.719
4.357IleGly: 4.357 ± 0.916
0.726IleHis: 0.726 ± 0.536
6.536IleIle: 6.536 ± 2.466
8.351IleLys: 8.351 ± 1.305
6.899IleLeu: 6.899 ± 1.687
1.452IleMet: 1.452 ± 0.612
4.72IleAsn: 4.72 ± 1.3
2.542IlePro: 2.542 ± 0.618
2.542IleGln: 2.542 ± 0.7
2.542IleArg: 2.542 ± 0.938
6.899IleSer: 6.899 ± 2.912
4.357IleThr: 4.357 ± 0.841
4.357IleVal: 4.357 ± 1.103
0.726IleTrp: 0.726 ± 0.464
1.816IleTyr: 1.816 ± 0.651
0.0IleXaa: 0.0 ± 0.0
Lys
7.988LysAla: 7.988 ± 2.158
0.363LysCys: 0.363 ± 0.359
5.447LysAsp: 5.447 ± 1.525
10.53LysGlu: 10.53 ± 1.724
1.452LysPhe: 1.452 ± 0.539
3.994LysGly: 3.994 ± 0.862
2.179LysHis: 2.179 ± 0.649
5.447LysIle: 5.447 ± 1.299
9.078LysLys: 9.078 ± 1.423
7.625LysLeu: 7.625 ± 1.297
1.452LysMet: 1.452 ± 0.842
6.173LysAsn: 6.173 ± 1.265
1.452LysPro: 1.452 ± 0.74
6.899LysGln: 6.899 ± 1.179
3.994LysArg: 3.994 ± 1.11
3.268LysSer: 3.268 ± 1.225
6.899LysThr: 6.899 ± 1.58
5.447LysVal: 5.447 ± 1.222
1.452LysTrp: 1.452 ± 0.815
5.81LysTyr: 5.81 ± 1.293
0.0LysXaa: 0.0 ± 0.0
Leu
7.625LeuAla: 7.625 ± 1.351
0.363LeuCys: 0.363 ± 0.33
7.988LeuAsp: 7.988 ± 1.622
9.441LeuGlu: 9.441 ± 1.919
4.357LeuPhe: 4.357 ± 0.818
3.994LeuGly: 3.994 ± 1.168
1.089LeuHis: 1.089 ± 0.539
3.268LeuIle: 3.268 ± 1.178
8.351LeuLys: 8.351 ± 1.419
8.351LeuLeu: 8.351 ± 1.368
1.452LeuMet: 1.452 ± 0.987
6.173LeuAsn: 6.173 ± 0.834
2.179LeuPro: 2.179 ± 0.526
3.631LeuGln: 3.631 ± 0.8
3.268LeuArg: 3.268 ± 1.058
6.173LeuSer: 6.173 ± 1.059
5.81LeuThr: 5.81 ± 1.146
3.268LeuVal: 3.268 ± 1.037
0.726LeuTrp: 0.726 ± 0.5
3.631LeuTyr: 3.631 ± 0.923
0.0LeuXaa: 0.0 ± 0.0
Met
1.452MetAla: 1.452 ± 0.69
0.0MetCys: 0.0 ± 0.0
2.179MetAsp: 2.179 ± 0.956
2.179MetGlu: 2.179 ± 1.013
0.363MetPhe: 0.363 ± 0.343
1.089MetGly: 1.089 ± 0.663
0.363MetHis: 0.363 ± 0.375
2.179MetIle: 2.179 ± 0.818
2.542MetLys: 2.542 ± 0.79
3.994MetLeu: 3.994 ± 1.09
2.542MetMet: 2.542 ± 1.057
1.816MetAsn: 1.816 ± 0.527
0.726MetPro: 0.726 ± 0.545
1.452MetGln: 1.452 ± 0.819
2.905MetArg: 2.905 ± 0.829
0.363MetSer: 0.363 ± 0.383
2.179MetThr: 2.179 ± 0.598
0.363MetVal: 0.363 ± 0.381
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.084AsnAla: 5.084 ± 1.12
0.0AsnCys: 0.0 ± 0.0
5.81AsnAsp: 5.81 ± 1.172
3.631AsnGlu: 3.631 ± 1.304
0.726AsnPhe: 0.726 ± 0.431
3.994AsnGly: 3.994 ± 1.174
0.0AsnHis: 0.0 ± 0.0
1.452AsnIle: 1.452 ± 0.605
6.536AsnLys: 6.536 ± 1.56
6.173AsnLeu: 6.173 ± 1.163
2.179AsnMet: 2.179 ± 0.705
2.905AsnAsn: 2.905 ± 0.804
3.268AsnPro: 3.268 ± 0.922
3.994AsnGln: 3.994 ± 0.789
3.268AsnArg: 3.268 ± 0.909
3.268AsnSer: 3.268 ± 1.016
3.631AsnThr: 3.631 ± 0.925
2.179AsnVal: 2.179 ± 0.7
1.452AsnTrp: 1.452 ± 0.68
2.542AsnTyr: 2.542 ± 1.131
0.0AsnXaa: 0.0 ± 0.0
Pro
0.363ProAla: 0.363 ± 0.347
0.0ProCys: 0.0 ± 0.0
0.363ProAsp: 0.363 ± 0.381
3.268ProGlu: 3.268 ± 0.943
1.816ProPhe: 1.816 ± 0.812
0.726ProGly: 0.726 ± 0.499
0.0ProHis: 0.0 ± 0.0
2.905ProIle: 2.905 ± 0.968
3.631ProLys: 3.631 ± 1.045
1.816ProLeu: 1.816 ± 0.664
0.0ProMet: 0.0 ± 0.0
2.542ProAsn: 2.542 ± 0.979
1.452ProPro: 1.452 ± 0.809
0.726ProGln: 0.726 ± 0.526
0.363ProArg: 0.363 ± 0.347
0.363ProSer: 0.363 ± 0.288
2.179ProThr: 2.179 ± 1.057
2.179ProVal: 2.179 ± 0.732
0.363ProTrp: 0.363 ± 0.288
1.452ProTyr: 1.452 ± 0.721
0.0ProXaa: 0.0 ± 0.0
Gln
3.631GlnAla: 3.631 ± 1.338
0.363GlnCys: 0.363 ± 0.33
2.542GlnAsp: 2.542 ± 0.865
3.268GlnGlu: 3.268 ± 1.079
1.452GlnPhe: 1.452 ± 0.743
1.452GlnGly: 1.452 ± 0.663
0.726GlnHis: 0.726 ± 0.458
3.268GlnIle: 3.268 ± 1.016
5.81GlnLys: 5.81 ± 1.159
4.357GlnLeu: 4.357 ± 1.579
1.452GlnMet: 1.452 ± 0.652
0.726GlnAsn: 0.726 ± 0.607
1.089GlnPro: 1.089 ± 0.544
3.631GlnGln: 3.631 ± 1.406
2.179GlnArg: 2.179 ± 0.607
6.173GlnSer: 6.173 ± 1.02
1.452GlnThr: 1.452 ± 0.803
1.816GlnVal: 1.816 ± 1.116
0.726GlnTrp: 0.726 ± 0.533
1.816GlnTyr: 1.816 ± 0.758
0.0GlnXaa: 0.0 ± 0.0
Arg
2.179ArgAla: 2.179 ± 1.008
0.0ArgCys: 0.0 ± 0.0
2.542ArgAsp: 2.542 ± 0.685
4.72ArgGlu: 4.72 ± 1.204
0.726ArgPhe: 0.726 ± 0.516
1.089ArgGly: 1.089 ± 0.485
0.726ArgHis: 0.726 ± 0.383
3.268ArgIle: 3.268 ± 1.255
4.72ArgLys: 4.72 ± 1.159
3.994ArgLeu: 3.994 ± 0.821
2.179ArgMet: 2.179 ± 0.728
1.452ArgAsn: 1.452 ± 0.637
0.363ArgPro: 0.363 ± 0.375
2.179ArgGln: 2.179 ± 1.135
2.905ArgArg: 2.905 ± 1.099
0.0ArgSer: 0.0 ± 0.0
2.542ArgThr: 2.542 ± 0.826
3.994ArgVal: 3.994 ± 0.993
0.363ArgTrp: 0.363 ± 0.35
1.452ArgTyr: 1.452 ± 1.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.542SerAla: 2.542 ± 0.56
0.0SerCys: 0.0 ± 0.0
2.542SerAsp: 2.542 ± 0.951
4.72SerGlu: 4.72 ± 1.569
3.268SerPhe: 3.268 ± 0.845
1.816SerGly: 1.816 ± 0.878
0.363SerHis: 0.363 ± 0.33
6.899SerIle: 6.899 ± 1.283
7.988SerLys: 7.988 ± 1.652
4.72SerLeu: 4.72 ± 1.299
1.089SerMet: 1.089 ± 0.604
3.631SerAsn: 3.631 ± 0.866
3.268SerPro: 3.268 ± 1.166
2.542SerGln: 2.542 ± 1.016
1.452SerArg: 1.452 ± 0.802
3.631SerSer: 3.631 ± 0.727
0.726SerThr: 0.726 ± 0.387
3.268SerVal: 3.268 ± 0.871
0.363SerTrp: 0.363 ± 0.347
3.268SerTyr: 3.268 ± 0.945
0.0SerXaa: 0.0 ± 0.0
Thr
3.994ThrAla: 3.994 ± 1.199
0.0ThrCys: 0.0 ± 0.0
2.905ThrAsp: 2.905 ± 0.985
3.994ThrGlu: 3.994 ± 1.337
0.726ThrPhe: 0.726 ± 0.436
5.084ThrGly: 5.084 ± 1.228
2.542ThrHis: 2.542 ± 0.728
3.994ThrIle: 3.994 ± 0.943
3.268ThrLys: 3.268 ± 1.264
5.084ThrLeu: 5.084 ± 1.168
1.452ThrMet: 1.452 ± 0.665
2.179ThrAsn: 2.179 ± 0.497
2.179ThrPro: 2.179 ± 0.626
3.268ThrGln: 3.268 ± 1.243
1.816ThrArg: 1.816 ± 0.833
2.179ThrSer: 2.179 ± 0.668
5.447ThrThr: 5.447 ± 1.461
4.357ThrVal: 4.357 ± 1.249
0.0ThrTrp: 0.0 ± 0.0
2.905ThrTyr: 2.905 ± 0.752
0.0ThrXaa: 0.0 ± 0.0
Val
1.816ValAla: 1.816 ± 0.775
0.0ValCys: 0.0 ± 0.0
2.542ValAsp: 2.542 ± 0.822
2.542ValGlu: 2.542 ± 1.063
2.179ValPhe: 2.179 ± 0.955
1.816ValGly: 1.816 ± 0.762
0.726ValHis: 0.726 ± 0.433
5.81ValIle: 5.81 ± 1.085
4.72ValLys: 4.72 ± 1.606
5.81ValLeu: 5.81 ± 1.318
2.179ValMet: 2.179 ± 0.963
2.542ValAsn: 2.542 ± 0.744
2.542ValPro: 2.542 ± 0.848
2.179ValGln: 2.179 ± 0.812
4.357ValArg: 4.357 ± 0.745
5.447ValSer: 5.447 ± 1.427
2.179ValThr: 2.179 ± 0.717
5.447ValVal: 5.447 ± 1.069
0.726ValTrp: 0.726 ± 0.492
3.994ValTyr: 3.994 ± 1.302
0.0ValXaa: 0.0 ± 0.0
Trp
0.726TrpAla: 0.726 ± 0.466
0.363TrpCys: 0.363 ± 0.376
0.726TrpAsp: 0.726 ± 0.419
1.816TrpGlu: 1.816 ± 0.61
0.363TrpPhe: 0.363 ± 0.343
0.0TrpGly: 0.0 ± 0.0
0.363TrpHis: 0.363 ± 0.347
1.452TrpIle: 1.452 ± 0.604
1.452TrpLys: 1.452 ± 0.66
1.452TrpLeu: 1.452 ± 0.711
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.363TrpGln: 0.363 ± 0.347
0.363TrpArg: 0.363 ± 0.376
0.726TrpSer: 0.726 ± 0.533
0.363TrpThr: 0.363 ± 0.347
0.0TrpVal: 0.0 ± 0.0
0.363TrpTrp: 0.363 ± 0.347
0.726TrpTyr: 0.726 ± 0.536
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.268TyrAla: 3.268 ± 1.218
1.089TyrCys: 1.089 ± 0.597
1.089TyrAsp: 1.089 ± 1.026
2.179TyrGlu: 2.179 ± 0.938
3.268TyrPhe: 3.268 ± 0.937
2.179TyrGly: 2.179 ± 0.688
0.726TyrHis: 0.726 ± 0.527
3.268TyrIle: 3.268 ± 1.107
4.357TyrLys: 4.357 ± 1.467
4.357TyrLeu: 4.357 ± 1.004
1.452TyrMet: 1.452 ± 0.581
2.542TyrAsn: 2.542 ± 0.638
0.0TyrPro: 0.0 ± 0.0
2.905TyrGln: 2.905 ± 0.551
1.089TyrArg: 1.089 ± 0.695
2.905TyrSer: 2.905 ± 1.069
1.816TyrThr: 1.816 ± 0.685
2.179TyrVal: 2.179 ± 0.9
0.363TyrTrp: 0.363 ± 0.347
1.452TyrTyr: 1.452 ± 0.673
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (2755 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski