Amino acid dipepetide frequency for Streptococcus satellite phage Javan62

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.976AlaAla: 0.976 ± 0.464
0.325AlaCys: 0.325 ± 0.316
4.878AlaAsp: 4.878 ± 1.416
6.179AlaGlu: 6.179 ± 1.027
2.602AlaPhe: 2.602 ± 0.875
2.927AlaGly: 2.927 ± 1.028
0.65AlaHis: 0.65 ± 0.389
4.878AlaIle: 4.878 ± 1.365
3.577AlaLys: 3.577 ± 1.216
4.228AlaLeu: 4.228 ± 1.09
2.602AlaMet: 2.602 ± 0.93
2.276AlaAsn: 2.276 ± 0.799
2.276AlaPro: 2.276 ± 0.747
3.577AlaGln: 3.577 ± 0.863
3.252AlaArg: 3.252 ± 1.008
2.927AlaSer: 2.927 ± 1.123
3.577AlaThr: 3.577 ± 1.594
3.252AlaVal: 3.252 ± 1.278
0.325AlaTrp: 0.325 ± 0.277
2.602AlaTyr: 2.602 ± 0.963
0.0AlaXaa: 0.0 ± 0.0
Cys
0.65CysAla: 0.65 ± 0.36
0.0CysCys: 0.0 ± 0.0
1.626CysAsp: 1.626 ± 0.695
0.0CysGlu: 0.0 ± 0.0
0.325CysPhe: 0.325 ± 0.304
0.65CysGly: 0.65 ± 0.705
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.65CysLys: 0.65 ± 0.407
0.65CysLeu: 0.65 ± 0.468
0.325CysMet: 0.325 ± 0.351
0.976CysAsn: 0.976 ± 0.624
0.65CysPro: 0.65 ± 0.459
1.301CysGln: 1.301 ± 0.854
0.65CysArg: 0.65 ± 0.432
0.0CysSer: 0.0 ± 0.0
0.325CysThr: 0.325 ± 0.277
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.325CysTyr: 0.325 ± 0.274
0.0CysXaa: 0.0 ± 0.0
Asp
1.301AspAla: 1.301 ± 0.615
1.301AspCys: 1.301 ± 0.593
1.951AspAsp: 1.951 ± 0.731
2.276AspGlu: 2.276 ± 0.692
5.203AspPhe: 5.203 ± 1.206
1.951AspGly: 1.951 ± 0.62
0.325AspHis: 0.325 ± 0.283
4.878AspIle: 4.878 ± 1.1
5.203AspLys: 5.203 ± 1.336
6.504AspLeu: 6.504 ± 1.553
2.276AspMet: 2.276 ± 0.776
2.602AspAsn: 2.602 ± 0.831
1.951AspPro: 1.951 ± 0.705
1.626AspGln: 1.626 ± 0.773
3.252AspArg: 3.252 ± 0.816
2.927AspSer: 2.927 ± 0.923
1.951AspThr: 1.951 ± 0.744
2.602AspVal: 2.602 ± 1.052
0.325AspTrp: 0.325 ± 0.316
4.878AspTyr: 4.878 ± 1.293
0.0AspXaa: 0.0 ± 0.0
Glu
6.179GluAla: 6.179 ± 1.049
0.325GluCys: 0.325 ± 0.356
2.927GluAsp: 2.927 ± 0.919
3.902GluGlu: 3.902 ± 1.13
1.626GluPhe: 1.626 ± 0.726
3.577GluGly: 3.577 ± 1.088
1.301GluHis: 1.301 ± 0.549
3.577GluIle: 3.577 ± 0.919
5.854GluLys: 5.854 ± 1.147
8.78GluLeu: 8.78 ± 1.458
1.301GluMet: 1.301 ± 0.577
4.228GluAsn: 4.228 ± 1.641
2.276GluPro: 2.276 ± 0.984
2.276GluGln: 2.276 ± 0.898
5.854GluArg: 5.854 ± 1.629
4.228GluSer: 4.228 ± 1.046
1.951GluThr: 1.951 ± 0.629
2.602GluVal: 2.602 ± 0.648
0.65GluTrp: 0.65 ± 0.381
4.228GluTyr: 4.228 ± 1.222
0.0GluXaa: 0.0 ± 0.0
Phe
0.976PheAla: 0.976 ± 0.639
0.976PheCys: 0.976 ± 0.464
4.878PheAsp: 4.878 ± 1.203
3.252PheGlu: 3.252 ± 0.891
1.951PhePhe: 1.951 ± 0.769
3.577PheGly: 3.577 ± 0.942
0.325PheHis: 0.325 ± 0.285
6.179PheIle: 6.179 ± 1.453
2.927PheLys: 2.927 ± 0.714
4.878PheLeu: 4.878 ± 1.59
0.0PheMet: 0.0 ± 0.0
4.228PheAsn: 4.228 ± 1.124
0.976PhePro: 0.976 ± 0.641
1.951PheGln: 1.951 ± 0.752
1.951PheArg: 1.951 ± 0.865
2.927PheSer: 2.927 ± 0.819
1.951PheThr: 1.951 ± 0.749
1.301PheVal: 1.301 ± 0.706
0.976PheTrp: 0.976 ± 0.645
1.951PheTyr: 1.951 ± 0.673
0.0PheXaa: 0.0 ± 0.0
Gly
2.927GlyAla: 2.927 ± 1.314
0.65GlyCys: 0.65 ± 0.432
2.602GlyAsp: 2.602 ± 1.019
1.951GlyGlu: 1.951 ± 0.601
2.276GlyPhe: 2.276 ± 1.037
3.252GlyGly: 3.252 ± 1.286
1.301GlyHis: 1.301 ± 0.571
4.553GlyIle: 4.553 ± 0.764
4.553GlyLys: 4.553 ± 1.09
3.577GlyLeu: 3.577 ± 1.088
1.301GlyMet: 1.301 ± 0.733
3.252GlyAsn: 3.252 ± 1.113
0.325GlyPro: 0.325 ± 0.285
1.951GlyGln: 1.951 ± 0.688
3.902GlyArg: 3.902 ± 1.107
3.252GlySer: 3.252 ± 1.132
2.927GlyThr: 2.927 ± 0.729
3.252GlyVal: 3.252 ± 1.34
0.976GlyTrp: 0.976 ± 0.657
3.902GlyTyr: 3.902 ± 0.983
0.0GlyXaa: 0.0 ± 0.0
His
3.252HisAla: 3.252 ± 1.175
0.0HisCys: 0.0 ± 0.0
1.626HisAsp: 1.626 ± 0.877
1.301HisGlu: 1.301 ± 0.729
0.976HisPhe: 0.976 ± 0.452
2.927HisGly: 2.927 ± 0.9
0.0HisHis: 0.0 ± 0.0
0.325HisIle: 0.325 ± 0.361
1.626HisLys: 1.626 ± 0.795
2.276HisLeu: 2.276 ± 0.665
0.0HisMet: 0.0 ± 0.0
1.301HisAsn: 1.301 ± 0.653
0.325HisPro: 0.325 ± 0.419
0.325HisGln: 0.325 ± 0.304
0.0HisArg: 0.0 ± 0.0
0.976HisSer: 0.976 ± 0.568
1.301HisThr: 1.301 ± 0.412
0.325HisVal: 0.325 ± 0.304
0.0HisTrp: 0.0 ± 0.0
1.626HisTyr: 1.626 ± 0.612
0.0HisXaa: 0.0 ± 0.0
Ile
4.878IleAla: 4.878 ± 0.922
1.626IleCys: 1.626 ± 0.659
4.228IleAsp: 4.228 ± 1.064
3.902IleGlu: 3.902 ± 1.025
2.276IlePhe: 2.276 ± 1.055
1.301IleGly: 1.301 ± 0.634
1.626IleHis: 1.626 ± 0.741
4.553IleIle: 4.553 ± 0.879
5.203IleLys: 5.203 ± 1.002
6.179IleLeu: 6.179 ± 1.507
0.976IleMet: 0.976 ± 0.455
3.252IleAsn: 3.252 ± 0.923
3.252IlePro: 3.252 ± 0.549
1.626IleGln: 1.626 ± 0.672
2.276IleArg: 2.276 ± 0.826
5.528IleSer: 5.528 ± 1.113
5.854IleThr: 5.854 ± 1.022
2.276IleVal: 2.276 ± 0.85
0.976IleTrp: 0.976 ± 0.538
2.276IleTyr: 2.276 ± 0.648
0.0IleXaa: 0.0 ± 0.0
Lys
6.504LysAla: 6.504 ± 1.934
0.325LysCys: 0.325 ± 0.304
3.902LysAsp: 3.902 ± 0.958
7.154LysGlu: 7.154 ± 1.565
1.951LysPhe: 1.951 ± 0.539
5.854LysGly: 5.854 ± 1.623
3.252LysHis: 3.252 ± 1.013
6.179LysIle: 6.179 ± 1.247
4.878LysLys: 4.878 ± 1.509
8.13LysLeu: 8.13 ± 1.186
0.65LysMet: 0.65 ± 0.36
5.203LysAsn: 5.203 ± 0.893
5.854LysPro: 5.854 ± 1.385
2.927LysGln: 2.927 ± 1.012
7.154LysArg: 7.154 ± 1.83
4.553LysSer: 4.553 ± 0.884
6.179LysThr: 6.179 ± 1.261
3.252LysVal: 3.252 ± 0.859
0.65LysTrp: 0.65 ± 0.411
2.602LysTyr: 2.602 ± 0.892
0.0LysXaa: 0.0 ± 0.0
Leu
6.179LeuAla: 6.179 ± 1.484
0.325LeuCys: 0.325 ± 0.353
7.48LeuAsp: 7.48 ± 0.887
10.407LeuGlu: 10.407 ± 1.789
4.553LeuPhe: 4.553 ± 1.288
6.504LeuGly: 6.504 ± 1.651
1.301LeuHis: 1.301 ± 0.905
3.902LeuIle: 3.902 ± 1.106
8.455LeuLys: 8.455 ± 1.889
11.707LeuLeu: 11.707 ± 1.907
2.276LeuMet: 2.276 ± 0.699
4.228LeuAsn: 4.228 ± 1.336
3.577LeuPro: 3.577 ± 0.995
3.577LeuGln: 3.577 ± 0.819
3.577LeuArg: 3.577 ± 1.204
6.504LeuSer: 6.504 ± 1.475
7.154LeuThr: 7.154 ± 1.4
6.504LeuVal: 6.504 ± 1.225
0.65LeuTrp: 0.65 ± 0.432
4.228LeuTyr: 4.228 ± 1.059
0.0LeuXaa: 0.0 ± 0.0
Met
2.927MetAla: 2.927 ± 0.913
0.0MetCys: 0.0 ± 0.0
2.276MetAsp: 2.276 ± 0.758
1.951MetGlu: 1.951 ± 0.618
0.65MetPhe: 0.65 ± 0.482
0.325MetGly: 0.325 ± 0.316
0.0MetHis: 0.0 ± 0.0
0.65MetIle: 0.65 ± 0.412
1.951MetLys: 1.951 ± 0.923
1.951MetLeu: 1.951 ± 0.701
0.325MetMet: 0.325 ± 0.361
2.276MetAsn: 2.276 ± 0.744
0.325MetPro: 0.325 ± 0.316
0.325MetGln: 0.325 ± 0.328
0.65MetArg: 0.65 ± 0.468
1.301MetSer: 1.301 ± 0.559
3.577MetThr: 3.577 ± 1.06
2.927MetVal: 2.927 ± 0.639
0.325MetTrp: 0.325 ± 0.274
0.325MetTyr: 0.325 ± 0.356
0.0MetXaa: 0.0 ± 0.0
Asn
1.626AsnAla: 1.626 ± 0.613
0.0AsnCys: 0.0 ± 0.0
2.602AsnAsp: 2.602 ± 1.035
2.276AsnGlu: 2.276 ± 0.849
2.602AsnPhe: 2.602 ± 1.57
3.902AsnGly: 3.902 ± 1.073
2.276AsnHis: 2.276 ± 1.024
3.577AsnIle: 3.577 ± 1.494
4.878AsnLys: 4.878 ± 0.95
3.577AsnLeu: 3.577 ± 1.192
0.65AsnMet: 0.65 ± 0.57
2.927AsnAsn: 2.927 ± 1.448
3.577AsnPro: 3.577 ± 1.02
1.626AsnGln: 1.626 ± 0.639
4.228AsnArg: 4.228 ± 1.666
3.902AsnSer: 3.902 ± 1.08
2.276AsnThr: 2.276 ± 1.036
2.276AsnVal: 2.276 ± 0.619
0.65AsnTrp: 0.65 ± 0.389
2.927AsnTyr: 2.927 ± 0.731
0.0AsnXaa: 0.0 ± 0.0
Pro
2.602ProAla: 2.602 ± 0.608
0.0ProCys: 0.0 ± 0.0
1.626ProAsp: 1.626 ± 0.714
3.252ProGlu: 3.252 ± 0.939
3.577ProPhe: 3.577 ± 1.213
0.65ProGly: 0.65 ± 0.705
0.0ProHis: 0.0 ± 0.0
2.276ProIle: 2.276 ± 0.837
5.203ProLys: 5.203 ± 1.353
1.301ProLeu: 1.301 ± 0.527
0.976ProMet: 0.976 ± 0.51
2.276ProAsn: 2.276 ± 1.613
0.976ProPro: 0.976 ± 0.568
1.951ProGln: 1.951 ± 0.883
2.276ProArg: 2.276 ± 0.656
1.626ProSer: 1.626 ± 0.752
2.276ProThr: 2.276 ± 0.717
2.602ProVal: 2.602 ± 0.895
0.325ProTrp: 0.325 ± 0.316
1.301ProTyr: 1.301 ± 0.792
0.0ProXaa: 0.0 ± 0.0
Gln
2.602GlnAla: 2.602 ± 1.158
0.325GlnCys: 0.325 ± 0.304
1.301GlnAsp: 1.301 ± 0.719
2.927GlnGlu: 2.927 ± 0.809
0.65GlnPhe: 0.65 ± 0.412
1.951GlnGly: 1.951 ± 0.84
0.325GlnHis: 0.325 ± 0.285
1.951GlnIle: 1.951 ± 0.747
4.553GlnLys: 4.553 ± 1.594
5.854GlnLeu: 5.854 ± 1.109
1.626GlnMet: 1.626 ± 0.672
1.951GlnAsn: 1.951 ± 0.978
0.976GlnPro: 0.976 ± 0.808
2.276GlnGln: 2.276 ± 0.86
3.252GlnArg: 3.252 ± 0.762
1.951GlnSer: 1.951 ± 0.594
1.626GlnThr: 1.626 ± 0.58
1.626GlnVal: 1.626 ± 0.608
0.0GlnTrp: 0.0 ± 0.0
0.976GlnTyr: 0.976 ± 0.44
0.0GlnXaa: 0.0 ± 0.0
Arg
3.902ArgAla: 3.902 ± 1.142
0.976ArgCys: 0.976 ± 0.509
1.301ArgAsp: 1.301 ± 0.496
4.553ArgGlu: 4.553 ± 0.882
2.602ArgPhe: 2.602 ± 0.739
2.927ArgGly: 2.927 ± 0.935
1.301ArgHis: 1.301 ± 0.432
1.951ArgIle: 1.951 ± 0.634
5.528ArgLys: 5.528 ± 1.758
6.504ArgLeu: 6.504 ± 1.83
2.602ArgMet: 2.602 ± 1.243
2.927ArgAsn: 2.927 ± 0.963
2.276ArgPro: 2.276 ± 0.763
2.927ArgGln: 2.927 ± 0.968
2.276ArgArg: 2.276 ± 0.668
1.951ArgSer: 1.951 ± 0.673
2.927ArgThr: 2.927 ± 0.986
3.252ArgVal: 3.252 ± 0.917
0.325ArgTrp: 0.325 ± 0.347
3.577ArgTyr: 3.577 ± 1.484
0.0ArgXaa: 0.0 ± 0.0
Ser
1.951SerAla: 1.951 ± 0.968
0.65SerCys: 0.65 ± 0.468
3.577SerAsp: 3.577 ± 0.929
4.553SerGlu: 4.553 ± 1.212
2.276SerPhe: 2.276 ± 0.732
2.602SerGly: 2.602 ± 1.122
1.301SerHis: 1.301 ± 0.893
5.528SerIle: 5.528 ± 0.972
4.228SerLys: 4.228 ± 1.172
9.106SerLeu: 9.106 ± 2.001
1.951SerMet: 1.951 ± 0.779
2.602SerAsn: 2.602 ± 0.868
1.301SerPro: 1.301 ± 0.466
1.301SerGln: 1.301 ± 0.712
2.602SerArg: 2.602 ± 0.982
1.951SerSer: 1.951 ± 0.86
2.276SerThr: 2.276 ± 0.7
3.902SerVal: 3.902 ± 1.071
0.325SerTrp: 0.325 ± 0.316
2.927SerTyr: 2.927 ± 1.353
0.0SerXaa: 0.0 ± 0.0
Thr
2.276ThrAla: 2.276 ± 0.836
0.0ThrCys: 0.0 ± 0.0
2.602ThrAsp: 2.602 ± 0.939
1.951ThrGlu: 1.951 ± 0.811
7.805ThrPhe: 7.805 ± 1.723
3.252ThrGly: 3.252 ± 0.885
1.951ThrHis: 1.951 ± 0.617
2.927ThrIle: 2.927 ± 0.573
3.577ThrLys: 3.577 ± 1.31
7.805ThrLeu: 7.805 ± 1.215
1.626ThrMet: 1.626 ± 0.491
1.626ThrAsn: 1.626 ± 0.902
1.626ThrPro: 1.626 ± 0.571
1.626ThrGln: 1.626 ± 0.706
3.252ThrArg: 3.252 ± 0.904
2.927ThrSer: 2.927 ± 0.983
3.577ThrThr: 3.577 ± 1.18
4.553ThrVal: 4.553 ± 1.359
0.325ThrTrp: 0.325 ± 0.33
2.602ThrTyr: 2.602 ± 1.156
0.0ThrXaa: 0.0 ± 0.0
Val
4.228ValAla: 4.228 ± 1.131
0.65ValCys: 0.65 ± 0.433
2.276ValAsp: 2.276 ± 0.781
2.602ValGlu: 2.602 ± 0.967
1.301ValPhe: 1.301 ± 0.578
2.602ValGly: 2.602 ± 0.764
1.301ValHis: 1.301 ± 0.831
2.602ValIle: 2.602 ± 0.75
6.179ValLys: 6.179 ± 1.658
4.228ValLeu: 4.228 ± 0.939
1.626ValMet: 1.626 ± 0.901
1.626ValAsn: 1.626 ± 0.624
3.252ValPro: 3.252 ± 1.283
0.976ValGln: 0.976 ± 0.702
3.577ValArg: 3.577 ± 1.214
2.602ValSer: 2.602 ± 0.898
3.577ValThr: 3.577 ± 0.984
3.252ValVal: 3.252 ± 1.103
0.325ValTrp: 0.325 ± 0.361
3.252ValTyr: 3.252 ± 0.835
0.0ValXaa: 0.0 ± 0.0
Trp
0.325TrpAla: 0.325 ± 0.277
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.301TrpGlu: 1.301 ± 0.688
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.65TrpHis: 0.65 ± 0.407
0.65TrpIle: 0.65 ± 0.433
0.325TrpLys: 0.325 ± 0.408
1.626TrpLeu: 1.626 ± 0.555
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.325TrpPro: 0.325 ± 0.316
0.65TrpGln: 0.65 ± 0.407
0.0TrpArg: 0.0 ± 0.0
0.976TrpSer: 0.976 ± 0.481
0.325TrpThr: 0.325 ± 0.361
0.976TrpVal: 0.976 ± 0.455
0.0TrpTrp: 0.0 ± 0.0
0.325TrpTyr: 0.325 ± 0.316
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.626TyrAla: 1.626 ± 0.724
0.65TyrCys: 0.65 ± 0.401
1.626TyrAsp: 1.626 ± 0.609
1.951TyrGlu: 1.951 ± 0.782
2.927TyrPhe: 2.927 ± 1.181
1.951TyrGly: 1.951 ± 0.585
1.301TyrHis: 1.301 ± 0.715
2.927TyrIle: 2.927 ± 0.694
7.805TyrLys: 7.805 ± 1.856
4.228TyrLeu: 4.228 ± 1.51
1.626TyrMet: 1.626 ± 0.751
2.602TyrAsn: 2.602 ± 0.907
0.976TyrPro: 0.976 ± 0.685
3.902TyrGln: 3.902 ± 0.826
2.927TyrArg: 2.927 ± 1.006
3.902TyrSer: 3.902 ± 0.92
2.276TyrThr: 2.276 ± 0.955
1.301TyrVal: 1.301 ± 0.446
0.325TyrTrp: 0.325 ± 0.353
2.602TyrTyr: 2.602 ± 1.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (3076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski