Amino acid dipepetide frequency for Streptococcus satellite phage Javan437

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.998AlaAla: 0.998 ± 0.682
0.0AlaCys: 0.0 ± 0.0
5.653AlaAsp: 5.653 ± 1.982
4.656AlaGlu: 4.656 ± 1.164
3.658AlaPhe: 3.658 ± 1.108
1.663AlaGly: 1.663 ± 0.641
1.663AlaHis: 1.663 ± 0.697
5.986AlaIle: 5.986 ± 1.119
3.991AlaLys: 3.991 ± 0.912
4.988AlaLeu: 4.988 ± 1.032
2.66AlaMet: 2.66 ± 1.134
1.33AlaAsn: 1.33 ± 0.658
1.663AlaPro: 1.663 ± 0.6
2.328AlaGln: 2.328 ± 0.704
2.328AlaArg: 2.328 ± 0.627
2.993AlaSer: 2.993 ± 0.797
4.323AlaThr: 4.323 ± 1.346
3.658AlaVal: 3.658 ± 1.045
0.0AlaTrp: 0.0 ± 0.0
1.995AlaTyr: 1.995 ± 0.775
0.0AlaXaa: 0.0 ± 0.0
Cys
0.333CysAla: 0.333 ± 0.312
0.0CysCys: 0.0 ± 0.0
0.333CysAsp: 0.333 ± 0.373
0.665CysGlu: 0.665 ± 0.415
0.333CysPhe: 0.333 ± 0.466
0.0CysGly: 0.0 ± 0.0
0.333CysHis: 0.333 ± 0.296
0.0CysIle: 0.0 ± 0.0
0.333CysLys: 0.333 ± 0.289
0.665CysLeu: 0.665 ± 0.422
0.333CysMet: 0.333 ± 0.389
0.665CysAsn: 0.665 ± 0.528
0.333CysPro: 0.333 ± 0.296
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.333CysSer: 0.333 ± 0.466
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.663AspAla: 1.663 ± 0.473
0.0AspCys: 0.0 ± 0.0
3.991AspAsp: 3.991 ± 0.877
4.988AspGlu: 4.988 ± 1.908
3.991AspPhe: 3.991 ± 0.993
2.993AspGly: 2.993 ± 0.979
0.333AspHis: 0.333 ± 0.357
4.988AspIle: 4.988 ± 0.903
5.986AspLys: 5.986 ± 1.557
6.651AspLeu: 6.651 ± 1.907
1.663AspMet: 1.663 ± 0.756
3.658AspAsn: 3.658 ± 1.047
0.665AspPro: 0.665 ± 0.381
0.333AspGln: 0.333 ± 0.289
3.326AspArg: 3.326 ± 0.891
2.993AspSer: 2.993 ± 1.264
4.323AspThr: 4.323 ± 1.446
1.995AspVal: 1.995 ± 0.528
0.0AspTrp: 0.0 ± 0.0
2.66AspTyr: 2.66 ± 0.72
0.0AspXaa: 0.0 ± 0.0
Glu
3.326GluAla: 3.326 ± 1.207
0.998GluCys: 0.998 ± 0.614
3.991GluAsp: 3.991 ± 1.327
7.316GluGlu: 7.316 ± 2.12
2.328GluPhe: 2.328 ± 0.924
2.328GluGly: 2.328 ± 0.527
0.998GluHis: 0.998 ± 0.523
9.977GluIle: 9.977 ± 1.986
7.981GluLys: 7.981 ± 1.577
10.974GluLeu: 10.974 ± 1.844
2.328GluMet: 2.328 ± 0.877
3.658GluAsn: 3.658 ± 0.781
2.993GluPro: 2.993 ± 1.109
5.653GluGln: 5.653 ± 1.635
4.656GluArg: 4.656 ± 1.087
3.658GluSer: 3.658 ± 0.922
5.321GluThr: 5.321 ± 1.121
7.649GluVal: 7.649 ± 2.504
0.333GluTrp: 0.333 ± 0.348
5.321GluTyr: 5.321 ± 1.495
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.665PheCys: 0.665 ± 0.505
2.993PheAsp: 2.993 ± 1.045
5.653PheGlu: 5.653 ± 2.115
0.998PhePhe: 0.998 ± 0.442
0.998PheGly: 0.998 ± 0.462
1.33PheHis: 1.33 ± 0.469
1.663PheIle: 1.663 ± 0.612
2.993PheLys: 2.993 ± 1.03
4.656PheLeu: 4.656 ± 1.066
0.998PheMet: 0.998 ± 0.566
1.995PheAsn: 1.995 ± 0.858
0.0PhePro: 0.0 ± 0.0
1.33PheGln: 1.33 ± 0.531
1.33PheArg: 1.33 ± 0.677
3.991PheSer: 3.991 ± 0.98
1.663PheThr: 1.663 ± 0.862
2.328PheVal: 2.328 ± 0.966
0.998PheTrp: 0.998 ± 0.537
2.993PheTyr: 2.993 ± 0.794
0.0PheXaa: 0.0 ± 0.0
Gly
0.665GlyAla: 0.665 ± 0.503
0.333GlyCys: 0.333 ± 0.466
1.995GlyAsp: 1.995 ± 0.818
2.66GlyGlu: 2.66 ± 0.677
2.328GlyPhe: 2.328 ± 0.833
1.995GlyGly: 1.995 ± 0.965
0.998GlyHis: 0.998 ± 0.522
4.323GlyIle: 4.323 ± 1.041
4.988GlyLys: 4.988 ± 1.033
7.649GlyLeu: 7.649 ± 1.51
0.998GlyMet: 0.998 ± 0.503
2.328GlyAsn: 2.328 ± 0.854
0.0GlyPro: 0.0 ± 0.0
1.663GlyGln: 1.663 ± 0.663
4.656GlyArg: 4.656 ± 1.152
1.33GlySer: 1.33 ± 0.723
1.995GlyThr: 1.995 ± 0.635
3.326GlyVal: 3.326 ± 0.981
0.998GlyTrp: 0.998 ± 0.717
2.993GlyTyr: 2.993 ± 0.879
0.0GlyXaa: 0.0 ± 0.0
His
1.663HisAla: 1.663 ± 0.771
0.333HisCys: 0.333 ± 0.466
0.998HisAsp: 0.998 ± 0.666
1.995HisGlu: 1.995 ± 0.895
0.333HisPhe: 0.333 ± 0.373
0.333HisGly: 0.333 ± 0.35
0.0HisHis: 0.0 ± 0.0
0.998HisIle: 0.998 ± 0.576
0.998HisLys: 0.998 ± 0.47
0.998HisLeu: 0.998 ± 0.635
0.333HisMet: 0.333 ± 0.333
0.665HisAsn: 0.665 ± 0.486
0.998HisPro: 0.998 ± 0.602
0.665HisGln: 0.665 ± 0.409
1.33HisArg: 1.33 ± 0.693
0.998HisSer: 0.998 ± 0.443
0.998HisThr: 0.998 ± 0.684
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.998HisTyr: 0.998 ± 0.81
0.0HisXaa: 0.0 ± 0.0
Ile
3.326IleAla: 3.326 ± 1.117
0.333IleCys: 0.333 ± 0.362
4.323IleAsp: 4.323 ± 1.322
7.316IleGlu: 7.316 ± 1.281
2.66IlePhe: 2.66 ± 0.852
4.988IleGly: 4.988 ± 1.091
0.333IleHis: 0.333 ± 0.293
5.653IleIle: 5.653 ± 1.433
7.316IleLys: 7.316 ± 1.601
6.651IleLeu: 6.651 ± 1.362
0.665IleMet: 0.665 ± 0.414
2.328IleAsn: 2.328 ± 1.108
1.663IlePro: 1.663 ± 0.526
4.323IleGln: 4.323 ± 1.291
2.993IleArg: 2.993 ± 0.967
5.321IleSer: 5.321 ± 1.432
3.991IleThr: 3.991 ± 0.752
2.66IleVal: 2.66 ± 0.645
0.665IleTrp: 0.665 ± 0.433
3.991IleTyr: 3.991 ± 1.295
0.0IleXaa: 0.0 ± 0.0
Lys
9.644LysAla: 9.644 ± 1.758
0.333LysCys: 0.333 ± 0.348
6.651LysAsp: 6.651 ± 1.163
9.312LysGlu: 9.312 ± 1.482
2.328LysPhe: 2.328 ± 0.849
5.321LysGly: 5.321 ± 1.706
1.663LysHis: 1.663 ± 0.732
5.321LysIle: 5.321 ± 1.02
7.316LysLys: 7.316 ± 1.628
7.316LysLeu: 7.316 ± 1.71
2.328LysMet: 2.328 ± 0.777
6.651LysAsn: 6.651 ± 1.368
2.328LysPro: 2.328 ± 0.919
5.321LysGln: 5.321 ± 1.242
5.321LysArg: 5.321 ± 1.34
7.316LysSer: 7.316 ± 1.481
5.986LysThr: 5.986 ± 1.846
2.993LysVal: 2.993 ± 0.819
0.998LysTrp: 0.998 ± 0.501
1.663LysTyr: 1.663 ± 0.593
0.0LysXaa: 0.0 ± 0.0
Leu
10.309LeuAla: 10.309 ± 1.539
0.0LeuCys: 0.0 ± 0.0
6.984LeuAsp: 6.984 ± 2.118
8.979LeuGlu: 8.979 ± 1.952
4.656LeuPhe: 4.656 ± 1.527
4.656LeuGly: 4.656 ± 1.068
1.33LeuHis: 1.33 ± 0.838
4.323LeuIle: 4.323 ± 1.229
9.977LeuLys: 9.977 ± 1.803
9.312LeuLeu: 9.312 ± 1.29
1.995LeuMet: 1.995 ± 0.814
6.984LeuAsn: 6.984 ± 1.743
3.991LeuPro: 3.991 ± 1.142
4.323LeuGln: 4.323 ± 0.873
4.323LeuArg: 4.323 ± 1.179
3.658LeuSer: 3.658 ± 1.077
7.316LeuThr: 7.316 ± 1.409
3.991LeuVal: 3.991 ± 0.934
0.665LeuTrp: 0.665 ± 0.481
3.991LeuTyr: 3.991 ± 1.309
0.0LeuXaa: 0.0 ± 0.0
Met
2.328MetAla: 2.328 ± 1.031
0.0MetCys: 0.0 ± 0.0
2.993MetAsp: 2.993 ± 1.181
0.998MetGlu: 0.998 ± 0.415
0.665MetPhe: 0.665 ± 0.433
1.663MetGly: 1.663 ± 0.641
0.333MetHis: 0.333 ± 0.357
1.995MetIle: 1.995 ± 0.774
3.326MetLys: 3.326 ± 0.955
0.998MetLeu: 0.998 ± 0.591
0.333MetMet: 0.333 ± 0.348
1.995MetAsn: 1.995 ± 0.828
0.333MetPro: 0.333 ± 0.339
0.333MetGln: 0.333 ± 0.312
1.33MetArg: 1.33 ± 0.639
0.998MetSer: 0.998 ± 0.687
2.328MetThr: 2.328 ± 1.027
0.998MetVal: 0.998 ± 0.647
0.0MetTrp: 0.0 ± 0.0
0.333MetTyr: 0.333 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
3.326AsnAla: 3.326 ± 0.835
0.333AsnCys: 0.333 ± 0.296
3.326AsnAsp: 3.326 ± 1.146
3.658AsnGlu: 3.658 ± 0.818
0.998AsnPhe: 0.998 ± 0.715
4.656AsnGly: 4.656 ± 1.405
1.995AsnHis: 1.995 ± 0.932
1.663AsnIle: 1.663 ± 0.696
4.656AsnLys: 4.656 ± 1.204
2.993AsnLeu: 2.993 ± 0.909
1.33AsnMet: 1.33 ± 0.623
0.998AsnAsn: 0.998 ± 0.512
0.998AsnPro: 0.998 ± 0.453
2.66AsnGln: 2.66 ± 0.656
1.995AsnArg: 1.995 ± 0.693
1.663AsnSer: 1.663 ± 0.734
3.658AsnThr: 3.658 ± 1.079
1.995AsnVal: 1.995 ± 0.783
1.33AsnTrp: 1.33 ± 0.421
3.326AsnTyr: 3.326 ± 1.171
0.0AsnXaa: 0.0 ± 0.0
Pro
1.663ProAla: 1.663 ± 0.65
0.0ProCys: 0.0 ± 0.0
2.328ProAsp: 2.328 ± 0.792
2.66ProGlu: 2.66 ± 1.137
1.995ProPhe: 1.995 ± 0.79
0.0ProGly: 0.0 ± 0.0
0.665ProHis: 0.665 ± 0.486
1.33ProIle: 1.33 ± 0.781
1.33ProLys: 1.33 ± 0.661
1.33ProLeu: 1.33 ± 0.863
0.665ProMet: 0.665 ± 0.453
1.33ProAsn: 1.33 ± 0.582
1.33ProPro: 1.33 ± 0.789
0.333ProGln: 0.333 ± 0.296
1.995ProArg: 1.995 ± 0.747
1.995ProSer: 1.995 ± 1.147
2.328ProThr: 2.328 ± 0.833
0.998ProVal: 0.998 ± 0.392
0.0ProTrp: 0.0 ± 0.0
2.993ProTyr: 2.993 ± 0.723
0.0ProXaa: 0.0 ± 0.0
Gln
3.991GlnAla: 3.991 ± 1.172
0.0GlnCys: 0.0 ± 0.0
1.663GlnAsp: 1.663 ± 0.65
5.653GlnGlu: 5.653 ± 1.674
1.33GlnPhe: 1.33 ± 0.662
2.328GlnGly: 2.328 ± 0.678
0.665GlnHis: 0.665 ± 0.375
2.66GlnIle: 2.66 ± 0.776
3.658GlnLys: 3.658 ± 0.936
4.656GlnLeu: 4.656 ± 0.858
1.33GlnMet: 1.33 ± 0.877
1.995GlnAsn: 1.995 ± 0.868
0.665GlnPro: 0.665 ± 0.433
3.991GlnGln: 3.991 ± 1.173
2.66GlnArg: 2.66 ± 1.035
1.663GlnSer: 1.663 ± 0.82
2.328GlnThr: 2.328 ± 0.837
2.66GlnVal: 2.66 ± 1.001
0.998GlnTrp: 0.998 ± 0.529
0.665GlnTyr: 0.665 ± 0.483
0.0GlnXaa: 0.0 ± 0.0
Arg
1.663ArgAla: 1.663 ± 0.427
0.333ArgCys: 0.333 ± 0.357
1.995ArgAsp: 1.995 ± 0.721
6.319ArgGlu: 6.319 ± 1.126
1.33ArgPhe: 1.33 ± 0.84
2.66ArgGly: 2.66 ± 0.825
0.665ArgHis: 0.665 ± 0.434
3.991ArgIle: 3.991 ± 1.393
5.321ArgLys: 5.321 ± 1.113
7.981ArgLeu: 7.981 ± 1.54
2.993ArgMet: 2.993 ± 0.694
1.663ArgAsn: 1.663 ± 0.998
1.663ArgPro: 1.663 ± 0.893
3.658ArgGln: 3.658 ± 0.97
2.328ArgArg: 2.328 ± 0.775
1.995ArgSer: 1.995 ± 0.727
2.66ArgThr: 2.66 ± 0.739
1.33ArgVal: 1.33 ± 0.745
0.665ArgTrp: 0.665 ± 0.39
2.66ArgTyr: 2.66 ± 1.015
0.0ArgXaa: 0.0 ± 0.0
Ser
2.993SerAla: 2.993 ± 1.132
0.0SerCys: 0.0 ± 0.0
1.995SerAsp: 1.995 ± 0.739
6.319SerGlu: 6.319 ± 1.937
1.663SerPhe: 1.663 ± 0.611
2.328SerGly: 2.328 ± 1.164
0.333SerHis: 0.333 ± 0.296
4.656SerIle: 4.656 ± 0.952
6.319SerLys: 6.319 ± 1.227
5.321SerLeu: 5.321 ± 1.288
0.998SerMet: 0.998 ± 0.835
3.326SerAsn: 3.326 ± 1.155
2.328SerPro: 2.328 ± 0.901
1.995SerGln: 1.995 ± 0.726
2.328SerArg: 2.328 ± 0.974
4.656SerSer: 4.656 ± 1.122
2.993SerThr: 2.993 ± 1.005
1.995SerVal: 1.995 ± 1.021
0.333SerTrp: 0.333 ± 0.289
3.326SerTyr: 3.326 ± 1.003
0.0SerXaa: 0.0 ± 0.0
Thr
3.326ThrAla: 3.326 ± 1.138
0.333ThrCys: 0.333 ± 0.389
1.33ThrAsp: 1.33 ± 0.602
3.326ThrGlu: 3.326 ± 1.002
2.66ThrPhe: 2.66 ± 0.783
4.656ThrGly: 4.656 ± 1.01
0.665ThrHis: 0.665 ± 0.387
5.653ThrIle: 5.653 ± 1.503
5.986ThrLys: 5.986 ± 1.164
4.988ThrLeu: 4.988 ± 1.05
0.998ThrMet: 0.998 ± 0.56
0.665ThrAsn: 0.665 ± 0.48
2.993ThrPro: 2.993 ± 0.976
3.326ThrGln: 3.326 ± 1.065
3.326ThrArg: 3.326 ± 0.884
3.658ThrSer: 3.658 ± 0.853
1.663ThrThr: 1.663 ± 0.656
5.986ThrVal: 5.986 ± 1.614
0.665ThrTrp: 0.665 ± 0.559
2.993ThrTyr: 2.993 ± 1.381
0.0ThrXaa: 0.0 ± 0.0
Val
3.326ValAla: 3.326 ± 1.153
0.333ValCys: 0.333 ± 0.289
2.328ValAsp: 2.328 ± 1.002
3.991ValGlu: 3.991 ± 1.735
1.663ValPhe: 1.663 ± 0.643
1.995ValGly: 1.995 ± 0.63
0.0ValHis: 0.0 ± 0.0
3.326ValIle: 3.326 ± 1.026
6.319ValLys: 6.319 ± 1.879
5.321ValLeu: 5.321 ± 1.684
0.0ValMet: 0.0 ± 0.0
2.66ValAsn: 2.66 ± 0.835
0.665ValPro: 0.665 ± 0.415
1.995ValGln: 1.995 ± 0.713
3.326ValArg: 3.326 ± 0.812
2.66ValSer: 2.66 ± 0.662
2.66ValThr: 2.66 ± 0.764
2.66ValVal: 2.66 ± 1.275
0.665ValTrp: 0.665 ± 0.513
1.995ValTyr: 1.995 ± 0.921
0.0ValXaa: 0.0 ± 0.0
Trp
0.665TrpAla: 0.665 ± 0.387
0.0TrpCys: 0.0 ± 0.0
0.333TrpAsp: 0.333 ± 0.373
2.328TrpGlu: 2.328 ± 0.784
0.333TrpPhe: 0.333 ± 0.466
0.333TrpGly: 0.333 ± 0.348
0.0TrpHis: 0.0 ± 0.0
0.998TrpIle: 0.998 ± 0.61
1.33TrpLys: 1.33 ± 0.682
0.998TrpLeu: 0.998 ± 0.602
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.665TrpArg: 0.665 ± 0.485
0.998TrpSer: 0.998 ± 0.475
0.333TrpThr: 0.333 ± 0.348
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.33TyrAla: 1.33 ± 0.666
0.333TyrCys: 0.333 ± 0.296
1.33TyrAsp: 1.33 ± 0.661
2.993TyrGlu: 2.993 ± 0.744
3.326TyrPhe: 3.326 ± 1.187
2.328TyrGly: 2.328 ± 1.082
1.33TyrHis: 1.33 ± 0.615
1.995TyrIle: 1.995 ± 0.861
5.986TyrLys: 5.986 ± 1.914
7.316TyrLeu: 7.316 ± 1.603
0.998TyrMet: 0.998 ± 0.529
2.66TyrAsn: 2.66 ± 0.721
1.663TyrPro: 1.663 ± 0.652
1.33TyrGln: 1.33 ± 0.672
3.658TyrArg: 3.658 ± 0.95
3.326TyrSer: 3.326 ± 0.906
2.328TyrThr: 2.328 ± 0.561
0.665TyrVal: 0.665 ± 0.513
0.0TyrTrp: 0.0 ± 0.0
0.998TyrTyr: 0.998 ± 0.497
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3008 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski