Amino acid dipepetide frequency for Streptococcus satellite phage Javan619

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.935AlaAla: 0.935 ± 0.537
0.935AlaCys: 0.935 ± 0.449
3.742AlaAsp: 3.742 ± 0.752
3.742AlaGlu: 3.742 ± 1.906
2.495AlaPhe: 2.495 ± 0.731
2.183AlaGly: 2.183 ± 0.703
0.624AlaHis: 0.624 ± 0.445
3.118AlaIle: 3.118 ± 0.772
5.613AlaLys: 5.613 ± 1.522
4.989AlaLeu: 4.989 ± 1.32
2.183AlaMet: 2.183 ± 0.86
2.495AlaAsn: 2.495 ± 0.781
0.0AlaPro: 0.0 ± 0.0
2.183AlaGln: 2.183 ± 0.657
1.871AlaArg: 1.871 ± 0.775
4.054AlaSer: 4.054 ± 1.007
3.118AlaThr: 3.118 ± 0.867
2.183AlaVal: 2.183 ± 0.661
0.624AlaTrp: 0.624 ± 0.483
2.495AlaTyr: 2.495 ± 0.736
0.0AlaXaa: 0.0 ± 0.0
Cys
0.312CysAla: 0.312 ± 0.316
0.0CysCys: 0.0 ± 0.0
0.312CysAsp: 0.312 ± 0.329
0.935CysGlu: 0.935 ± 0.987
0.935CysPhe: 0.935 ± 0.442
0.312CysGly: 0.312 ± 0.316
0.0CysHis: 0.0 ± 0.0
1.871CysIle: 1.871 ± 0.822
0.0CysLys: 0.0 ± 0.0
0.935CysLeu: 0.935 ± 0.469
0.0CysMet: 0.0 ± 0.0
0.624CysAsn: 0.624 ± 0.347
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.624CysArg: 0.624 ± 0.376
0.312CysSer: 0.312 ± 0.316
0.312CysThr: 0.312 ± 0.316
0.312CysVal: 0.312 ± 0.242
0.0CysTrp: 0.0 ± 0.0
0.935CysTyr: 0.935 ± 0.405
0.0CysXaa: 0.0 ± 0.0
Asp
3.43AspAla: 3.43 ± 1.078
0.312AspCys: 0.312 ± 0.329
5.613AspAsp: 5.613 ± 1.564
3.43AspGlu: 3.43 ± 1.289
1.247AspPhe: 1.247 ± 0.571
1.871AspGly: 1.871 ± 0.66
0.0AspHis: 0.0 ± 0.0
6.548AspIle: 6.548 ± 1.621
7.795AspLys: 7.795 ± 1.522
6.86AspLeu: 6.86 ± 1.231
3.118AspMet: 3.118 ± 0.92
4.677AspAsn: 4.677 ± 0.873
0.624AspPro: 0.624 ± 0.483
0.935AspGln: 0.935 ± 0.414
0.935AspArg: 0.935 ± 0.51
3.118AspSer: 3.118 ± 0.885
1.871AspThr: 1.871 ± 0.663
4.989AspVal: 4.989 ± 1.089
1.247AspTrp: 1.247 ± 0.557
4.677AspTyr: 4.677 ± 0.819
0.0AspXaa: 0.0 ± 0.0
Glu
2.495GluAla: 2.495 ± 0.838
0.624GluCys: 0.624 ± 0.369
4.989GluAsp: 4.989 ± 1.268
6.236GluGlu: 6.236 ± 1.931
4.365GluPhe: 4.365 ± 0.989
1.559GluGly: 1.559 ± 0.681
2.183GluHis: 2.183 ± 1.048
7.484GluIle: 7.484 ± 1.621
9.355GluLys: 9.355 ± 2.287
12.785GluLeu: 12.785 ± 1.664
1.871GluMet: 1.871 ± 0.967
4.989GluAsn: 4.989 ± 1.373
3.118GluPro: 3.118 ± 0.569
3.43GluGln: 3.43 ± 0.982
3.742GluArg: 3.742 ± 1.877
4.677GluSer: 4.677 ± 1.29
3.118GluThr: 3.118 ± 0.861
4.054GluVal: 4.054 ± 0.866
0.935GluTrp: 0.935 ± 0.487
3.742GluTyr: 3.742 ± 1.035
0.0GluXaa: 0.0 ± 0.0
Phe
2.806PheAla: 2.806 ± 0.84
0.312PheCys: 0.312 ± 0.272
2.806PheAsp: 2.806 ± 1.015
4.365PheGlu: 4.365 ± 1.284
1.871PhePhe: 1.871 ± 0.595
1.247PheGly: 1.247 ± 0.542
1.247PheHis: 1.247 ± 0.58
2.495PheIle: 2.495 ± 0.792
3.118PheLys: 3.118 ± 0.632
5.301PheLeu: 5.301 ± 1.19
0.935PheMet: 0.935 ± 0.566
1.871PheAsn: 1.871 ± 0.649
1.559PhePro: 1.559 ± 0.713
0.312PheGln: 0.312 ± 0.329
2.495PheArg: 2.495 ± 1.138
3.118PheSer: 3.118 ± 0.824
2.495PheThr: 2.495 ± 0.83
3.742PheVal: 3.742 ± 0.941
0.312PheTrp: 0.312 ± 0.242
0.935PheTyr: 0.935 ± 0.535
0.0PheXaa: 0.0 ± 0.0
Gly
3.43GlyAla: 3.43 ± 1.0
1.559GlyCys: 1.559 ± 0.594
0.935GlyAsp: 0.935 ± 0.364
1.871GlyGlu: 1.871 ± 0.736
2.806GlyPhe: 2.806 ± 0.834
2.495GlyGly: 2.495 ± 0.757
0.312GlyHis: 0.312 ± 0.276
5.613GlyIle: 5.613 ± 1.162
3.43GlyLys: 3.43 ± 0.887
4.054GlyLeu: 4.054 ± 1.015
1.871GlyMet: 1.871 ± 0.662
1.247GlyAsn: 1.247 ± 0.477
0.0GlyPro: 0.0 ± 0.0
1.559GlyGln: 1.559 ± 0.539
0.312GlyArg: 0.312 ± 0.242
2.806GlySer: 2.806 ± 1.412
1.247GlyThr: 1.247 ± 0.774
3.43GlyVal: 3.43 ± 1.031
0.624GlyTrp: 0.624 ± 0.408
1.247GlyTyr: 1.247 ± 0.736
0.0GlyXaa: 0.0 ± 0.0
His
0.624HisAla: 0.624 ± 0.553
0.0HisCys: 0.0 ± 0.0
1.559HisAsp: 1.559 ± 0.703
0.312HisGlu: 0.312 ± 0.242
2.495HisPhe: 2.495 ± 0.723
0.935HisGly: 0.935 ± 0.443
0.0HisHis: 0.0 ± 0.0
2.183HisIle: 2.183 ± 0.996
1.247HisLys: 1.247 ± 0.63
1.247HisLeu: 1.247 ± 0.523
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.624HisPro: 0.624 ± 0.38
0.312HisGln: 0.312 ± 0.316
0.312HisArg: 0.312 ± 0.276
1.247HisSer: 1.247 ± 0.686
0.624HisThr: 0.624 ± 0.376
0.0HisVal: 0.0 ± 0.0
0.312HisTrp: 0.312 ± 0.273
0.935HisTyr: 0.935 ± 0.515
0.0HisXaa: 0.0 ± 0.0
Ile
4.989IleAla: 4.989 ± 1.352
0.312IleCys: 0.312 ± 0.316
5.925IleAsp: 5.925 ± 1.222
6.236IleGlu: 6.236 ± 1.535
3.118IlePhe: 3.118 ± 0.967
3.43IleGly: 3.43 ± 1.168
1.247IleHis: 1.247 ± 0.662
5.925IleIle: 5.925 ± 1.382
7.172IleLys: 7.172 ± 1.0
5.613IleLeu: 5.613 ± 1.69
0.935IleMet: 0.935 ± 0.545
3.742IleAsn: 3.742 ± 1.113
1.559IlePro: 1.559 ± 0.661
4.054IleGln: 4.054 ± 0.913
2.183IleArg: 2.183 ± 0.607
6.236IleSer: 6.236 ± 1.446
3.43IleThr: 3.43 ± 0.899
2.495IleVal: 2.495 ± 0.738
0.0IleTrp: 0.0 ± 0.0
4.054IleTyr: 4.054 ± 0.947
0.0IleXaa: 0.0 ± 0.0
Lys
5.925LysAla: 5.925 ± 1.533
0.0LysCys: 0.0 ± 0.0
4.365LysAsp: 4.365 ± 1.218
14.032LysGlu: 14.032 ± 1.915
1.247LysPhe: 1.247 ± 0.802
2.806LysGly: 2.806 ± 0.904
1.247LysHis: 1.247 ± 0.6
6.548LysIle: 6.548 ± 1.201
11.225LysLys: 11.225 ± 1.442
8.731LysLeu: 8.731 ± 2.041
2.495LysMet: 2.495 ± 0.952
7.795LysAsn: 7.795 ± 1.947
2.183LysPro: 2.183 ± 0.703
5.301LysGln: 5.301 ± 0.951
4.989LysArg: 4.989 ± 1.341
7.795LysSer: 7.795 ± 1.678
8.107LysThr: 8.107 ± 1.476
5.925LysVal: 5.925 ± 0.981
0.624LysTrp: 0.624 ± 0.373
4.677LysTyr: 4.677 ± 1.044
0.0LysXaa: 0.0 ± 0.0
Leu
5.301LeuAla: 5.301 ± 1.578
0.312LeuCys: 0.312 ± 0.242
5.613LeuAsp: 5.613 ± 1.273
7.484LeuGlu: 7.484 ± 1.916
4.677LeuPhe: 4.677 ± 1.134
4.054LeuGly: 4.054 ± 1.522
0.935LeuHis: 0.935 ± 0.412
8.107LeuIle: 8.107 ± 1.561
11.537LeuLys: 11.537 ± 1.889
12.473LeuLeu: 12.473 ± 2.25
3.118LeuMet: 3.118 ± 0.913
9.978LeuAsn: 9.978 ± 2.054
2.183LeuPro: 2.183 ± 0.489
6.236LeuGln: 6.236 ± 1.595
2.806LeuArg: 2.806 ± 0.721
3.742LeuSer: 3.742 ± 0.926
4.989LeuThr: 4.989 ± 1.417
5.613LeuVal: 5.613 ± 1.462
0.624LeuTrp: 0.624 ± 0.386
1.247LeuTyr: 1.247 ± 0.657
0.0LeuXaa: 0.0 ± 0.0
Met
1.247MetAla: 1.247 ± 0.628
0.0MetCys: 0.0 ± 0.0
2.183MetAsp: 2.183 ± 0.886
3.43MetGlu: 3.43 ± 1.119
1.559MetPhe: 1.559 ± 0.694
2.183MetGly: 2.183 ± 1.065
0.0MetHis: 0.0 ± 0.0
1.247MetIle: 1.247 ± 0.707
3.118MetLys: 3.118 ± 0.753
2.495MetLeu: 2.495 ± 0.733
0.935MetMet: 0.935 ± 0.479
0.935MetAsn: 0.935 ± 0.459
0.312MetPro: 0.312 ± 0.333
1.559MetGln: 1.559 ± 0.535
0.624MetArg: 0.624 ± 0.476
0.624MetSer: 0.624 ± 0.438
1.559MetThr: 1.559 ± 0.733
1.559MetVal: 1.559 ± 0.639
0.0MetTrp: 0.0 ± 0.0
0.312MetTyr: 0.312 ± 0.323
0.0MetXaa: 0.0 ± 0.0
Asn
2.806AsnAla: 2.806 ± 0.783
1.247AsnCys: 1.247 ± 0.779
4.989AsnAsp: 4.989 ± 1.151
9.355AsnGlu: 9.355 ± 1.663
2.183AsnPhe: 2.183 ± 0.891
3.742AsnGly: 3.742 ± 0.83
1.559AsnHis: 1.559 ± 0.519
2.806AsnIle: 2.806 ± 0.981
7.484AsnLys: 7.484 ± 1.668
5.925AsnLeu: 5.925 ± 1.806
0.624AsnMet: 0.624 ± 0.494
4.054AsnAsn: 4.054 ± 1.321
0.935AsnPro: 0.935 ± 0.411
1.871AsnGln: 1.871 ± 0.878
2.806AsnArg: 2.806 ± 0.931
4.365AsnSer: 4.365 ± 0.923
2.183AsnThr: 2.183 ± 0.865
2.495AsnVal: 2.495 ± 0.763
0.312AsnTrp: 0.312 ± 0.276
3.43AsnTyr: 3.43 ± 0.942
0.0AsnXaa: 0.0 ± 0.0
Pro
0.624ProAla: 0.624 ± 0.37
0.0ProCys: 0.0 ± 0.0
1.559ProAsp: 1.559 ± 0.639
1.559ProGlu: 1.559 ± 0.731
1.559ProPhe: 1.559 ± 0.552
0.0ProGly: 0.0 ± 0.0
0.312ProHis: 0.312 ± 0.316
0.624ProIle: 0.624 ± 0.392
2.495ProLys: 2.495 ± 0.993
2.183ProLeu: 2.183 ± 0.73
0.0ProMet: 0.0 ± 0.0
2.183ProAsn: 2.183 ± 0.924
0.935ProPro: 0.935 ± 0.459
0.312ProGln: 0.312 ± 0.316
1.247ProArg: 1.247 ± 0.544
0.624ProSer: 0.624 ± 0.435
1.871ProThr: 1.871 ± 0.65
1.559ProVal: 1.559 ± 0.75
0.0ProTrp: 0.0 ± 0.0
0.935ProTyr: 0.935 ± 0.457
0.0ProXaa: 0.0 ± 0.0
Gln
3.742GlnAla: 3.742 ± 0.919
0.312GlnCys: 0.312 ± 0.316
1.247GlnAsp: 1.247 ± 0.555
2.495GlnGlu: 2.495 ± 0.616
1.559GlnPhe: 1.559 ± 0.579
1.871GlnGly: 1.871 ± 0.744
0.312GlnHis: 0.312 ± 0.276
3.43GlnIle: 3.43 ± 1.014
2.183GlnLys: 2.183 ± 0.658
4.989GlnLeu: 4.989 ± 1.164
1.247GlnMet: 1.247 ± 0.593
2.495GlnAsn: 2.495 ± 0.981
0.935GlnPro: 0.935 ± 0.575
2.183GlnGln: 2.183 ± 0.629
2.183GlnArg: 2.183 ± 0.868
1.559GlnSer: 1.559 ± 0.652
1.559GlnThr: 1.559 ± 0.912
2.495GlnVal: 2.495 ± 0.661
0.312GlnTrp: 0.312 ± 0.329
2.183GlnTyr: 2.183 ± 0.686
0.0GlnXaa: 0.0 ± 0.0
Arg
2.806ArgAla: 2.806 ± 1.151
0.312ArgCys: 0.312 ± 0.316
2.806ArgAsp: 2.806 ± 0.957
3.43ArgGlu: 3.43 ± 1.257
2.495ArgPhe: 2.495 ± 1.103
0.312ArgGly: 0.312 ± 0.242
1.247ArgHis: 1.247 ± 0.573
2.495ArgIle: 2.495 ± 0.996
5.301ArgLys: 5.301 ± 1.546
4.365ArgLeu: 4.365 ± 1.002
0.935ArgMet: 0.935 ± 0.649
3.43ArgAsn: 3.43 ± 1.007
0.935ArgPro: 0.935 ± 0.575
1.871ArgGln: 1.871 ± 0.688
1.247ArgArg: 1.247 ± 0.738
0.624ArgSer: 0.624 ± 0.392
2.495ArgThr: 2.495 ± 0.864
2.495ArgVal: 2.495 ± 0.687
0.624ArgTrp: 0.624 ± 0.392
1.871ArgTyr: 1.871 ± 0.611
0.0ArgXaa: 0.0 ± 0.0
Ser
2.183SerAla: 2.183 ± 1.056
0.935SerCys: 0.935 ± 0.408
7.172SerAsp: 7.172 ± 1.529
6.236SerGlu: 6.236 ± 1.43
2.183SerPhe: 2.183 ± 0.681
3.43SerGly: 3.43 ± 1.085
0.624SerHis: 0.624 ± 0.331
3.43SerIle: 3.43 ± 1.057
7.172SerLys: 7.172 ± 1.539
3.118SerLeu: 3.118 ± 1.035
1.559SerMet: 1.559 ± 0.611
3.43SerAsn: 3.43 ± 0.839
0.935SerPro: 0.935 ± 0.568
1.871SerGln: 1.871 ± 0.653
1.559SerArg: 1.559 ± 0.63
4.054SerSer: 4.054 ± 1.96
3.742SerThr: 3.742 ± 1.227
2.183SerVal: 2.183 ± 0.734
0.624SerTrp: 0.624 ± 0.443
4.365SerTyr: 4.365 ± 0.895
0.0SerXaa: 0.0 ± 0.0
Thr
1.559ThrAla: 1.559 ± 0.767
0.0ThrCys: 0.0 ± 0.0
2.495ThrAsp: 2.495 ± 1.154
4.054ThrGlu: 4.054 ± 1.08
2.183ThrPhe: 2.183 ± 0.897
3.118ThrGly: 3.118 ± 0.71
0.935ThrHis: 0.935 ± 0.612
2.495ThrIle: 2.495 ± 0.92
5.925ThrLys: 5.925 ± 1.225
4.677ThrLeu: 4.677 ± 1.063
1.247ThrMet: 1.247 ± 0.588
4.054ThrAsn: 4.054 ± 1.069
1.247ThrPro: 1.247 ± 0.691
1.871ThrGln: 1.871 ± 0.675
2.806ThrArg: 2.806 ± 0.84
2.495ThrSer: 2.495 ± 1.079
2.806ThrThr: 2.806 ± 0.816
4.054ThrVal: 4.054 ± 1.777
0.312ThrTrp: 0.312 ± 0.351
2.806ThrTyr: 2.806 ± 0.669
0.0ThrXaa: 0.0 ± 0.0
Val
3.118ValAla: 3.118 ± 0.605
0.312ValCys: 0.312 ± 0.242
1.871ValAsp: 1.871 ± 0.523
3.43ValGlu: 3.43 ± 0.928
1.871ValPhe: 1.871 ± 0.954
2.183ValGly: 2.183 ± 0.585
0.935ValHis: 0.935 ± 0.638
4.054ValIle: 4.054 ± 0.971
4.989ValLys: 4.989 ± 0.918
5.301ValLeu: 5.301 ± 0.892
1.559ValMet: 1.559 ± 0.641
4.677ValAsn: 4.677 ± 0.955
1.871ValPro: 1.871 ± 0.664
1.559ValGln: 1.559 ± 0.689
3.43ValArg: 3.43 ± 1.49
5.301ValSer: 5.301 ± 1.094
2.806ValThr: 2.806 ± 0.912
4.054ValVal: 4.054 ± 0.84
0.624ValTrp: 0.624 ± 0.303
1.559ValTyr: 1.559 ± 0.909
0.0ValXaa: 0.0 ± 0.0
Trp
0.312TrpAla: 0.312 ± 0.276
0.0TrpCys: 0.0 ± 0.0
0.624TrpAsp: 0.624 ± 0.434
0.624TrpGlu: 0.624 ± 0.392
0.624TrpPhe: 0.624 ± 0.632
0.935TrpGly: 0.935 ± 0.526
0.0TrpHis: 0.0 ± 0.0
0.312TrpIle: 0.312 ± 0.273
0.312TrpLys: 0.312 ± 0.273
0.935TrpLeu: 0.935 ± 0.552
0.312TrpMet: 0.312 ± 0.329
0.624TrpAsn: 0.624 ± 0.341
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.935TrpArg: 0.935 ± 0.427
0.624TrpSer: 0.624 ± 0.331
0.312TrpThr: 0.312 ± 0.276
0.312TrpVal: 0.312 ± 0.333
0.312TrpTrp: 0.312 ± 0.276
0.624TrpTyr: 0.624 ± 0.373
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.935TyrAla: 0.935 ± 0.523
1.247TyrCys: 1.247 ± 0.631
2.495TyrAsp: 2.495 ± 0.723
2.495TyrGlu: 2.495 ± 0.696
1.871TyrPhe: 1.871 ± 0.64
2.183TyrGly: 2.183 ± 0.757
1.247TyrHis: 1.247 ± 0.654
1.871TyrIle: 1.871 ± 0.661
6.548TyrLys: 6.548 ± 1.509
4.365TyrLeu: 4.365 ± 0.83
0.624TyrMet: 0.624 ± 0.334
1.871TyrAsn: 1.871 ± 0.813
0.624TyrPro: 0.624 ± 0.373
1.871TyrGln: 1.871 ± 0.96
4.989TyrArg: 4.989 ± 0.867
3.43TyrSer: 3.43 ± 1.003
2.495TyrThr: 2.495 ± 1.17
1.559TyrVal: 1.559 ± 0.676
0.312TyrTrp: 0.312 ± 0.273
1.559TyrTyr: 1.559 ± 0.575
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (3208 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski