Amino acid dipepetide frequency for Streptococcus satellite phage Javan547

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.579AlaAla: 3.579 ± 1.071
0.358AlaCys: 0.358 ± 0.297
5.011AlaAsp: 5.011 ± 1.265
6.084AlaGlu: 6.084 ± 1.357
4.295AlaPhe: 4.295 ± 1.013
3.937AlaGly: 3.937 ± 1.016
0.716AlaHis: 0.716 ± 0.545
4.295AlaIle: 4.295 ± 0.686
6.442AlaLys: 6.442 ± 2.374
5.369AlaLeu: 5.369 ± 1.241
1.79AlaMet: 1.79 ± 0.756
3.579AlaAsn: 3.579 ± 1.018
2.863AlaPro: 2.863 ± 0.878
3.221AlaGln: 3.221 ± 0.882
2.863AlaArg: 2.863 ± 0.687
3.221AlaSer: 3.221 ± 1.011
4.295AlaThr: 4.295 ± 1.889
3.221AlaVal: 3.221 ± 0.97
0.0AlaTrp: 0.0 ± 0.0
1.79AlaTyr: 1.79 ± 0.803
0.0AlaXaa: 0.0 ± 0.0
Cys
0.358CysAla: 0.358 ± 0.325
0.0CysCys: 0.0 ± 0.0
1.432CysAsp: 1.432 ± 0.736
0.358CysGlu: 0.358 ± 0.429
0.0CysPhe: 0.0 ± 0.0
0.358CysGly: 0.358 ± 0.387
0.0CysHis: 0.0 ± 0.0
0.358CysIle: 0.358 ± 0.325
1.074CysLys: 1.074 ± 0.492
0.716CysLeu: 0.716 ± 0.745
0.0CysMet: 0.0 ± 0.0
0.716CysAsn: 0.716 ± 0.595
0.0CysPro: 0.0 ± 0.0
0.358CysGln: 0.358 ± 0.419
0.716CysArg: 0.716 ± 0.508
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.074AspAla: 1.074 ± 0.648
1.074AspCys: 1.074 ± 0.647
2.863AspAsp: 2.863 ± 1.271
4.295AspGlu: 4.295 ± 0.917
2.505AspPhe: 2.505 ± 0.512
3.221AspGly: 3.221 ± 0.977
0.716AspHis: 0.716 ± 0.635
5.369AspIle: 5.369 ± 1.064
3.579AspLys: 3.579 ± 0.874
4.653AspLeu: 4.653 ± 1.525
2.147AspMet: 2.147 ± 0.734
1.432AspAsn: 1.432 ± 0.57
1.074AspPro: 1.074 ± 0.511
0.716AspGln: 0.716 ± 0.508
3.579AspArg: 3.579 ± 1.145
2.505AspSer: 2.505 ± 0.652
2.147AspThr: 2.147 ± 0.627
1.79AspVal: 1.79 ± 0.948
0.358AspTrp: 0.358 ± 0.297
4.653AspTyr: 4.653 ± 0.878
0.0AspXaa: 0.0 ± 0.0
Glu
4.653GluAla: 4.653 ± 1.022
1.432GluCys: 1.432 ± 0.832
3.937GluAsp: 3.937 ± 1.019
7.158GluGlu: 7.158 ± 2.566
2.863GluPhe: 2.863 ± 0.856
2.863GluGly: 2.863 ± 1.023
1.432GluHis: 1.432 ± 0.587
6.442GluIle: 6.442 ± 1.777
9.306GluLys: 9.306 ± 2.285
8.948GluLeu: 8.948 ± 1.497
1.432GluMet: 1.432 ± 0.751
3.579GluAsn: 3.579 ± 1.193
2.863GluPro: 2.863 ± 1.276
3.221GluGln: 3.221 ± 1.241
3.579GluArg: 3.579 ± 1.515
5.369GluSer: 5.369 ± 1.317
5.011GluThr: 5.011 ± 1.761
2.147GluVal: 2.147 ± 0.866
1.074GluTrp: 1.074 ± 0.577
4.653GluTyr: 4.653 ± 1.452
0.0GluXaa: 0.0 ± 0.0
Phe
2.147PheAla: 2.147 ± 0.783
0.0PheCys: 0.0 ± 0.0
2.505PheAsp: 2.505 ± 0.711
4.295PheGlu: 4.295 ± 1.193
1.79PhePhe: 1.79 ± 1.312
2.863PheGly: 2.863 ± 1.33
0.716PheHis: 0.716 ± 0.485
3.579PheIle: 3.579 ± 1.045
2.863PheLys: 2.863 ± 1.038
3.579PheLeu: 3.579 ± 1.181
0.358PheMet: 0.358 ± 0.307
3.579PheAsn: 3.579 ± 1.242
0.0PhePro: 0.0 ± 0.0
1.432PheGln: 1.432 ± 0.97
2.505PheArg: 2.505 ± 1.045
2.147PheSer: 2.147 ± 0.995
3.579PheThr: 3.579 ± 1.276
1.79PheVal: 1.79 ± 0.781
0.358PheTrp: 0.358 ± 0.297
2.505PheTyr: 2.505 ± 0.762
0.0PheXaa: 0.0 ± 0.0
Gly
3.221GlyAla: 3.221 ± 1.045
0.716GlyCys: 0.716 ± 0.508
1.79GlyAsp: 1.79 ± 0.897
2.863GlyGlu: 2.863 ± 0.793
3.221GlyPhe: 3.221 ± 1.109
1.79GlyGly: 1.79 ± 0.72
0.716GlyHis: 0.716 ± 0.529
3.937GlyIle: 3.937 ± 1.114
5.011GlyLys: 5.011 ± 1.096
6.084GlyLeu: 6.084 ± 1.547
0.716GlyMet: 0.716 ± 0.605
2.863GlyAsn: 2.863 ± 1.034
0.358GlyPro: 0.358 ± 0.348
1.432GlyGln: 1.432 ± 0.751
3.579GlyArg: 3.579 ± 0.882
2.505GlySer: 2.505 ± 1.294
0.716GlyThr: 0.716 ± 0.46
5.369GlyVal: 5.369 ± 1.826
0.716GlyTrp: 0.716 ± 0.595
1.79GlyTyr: 1.79 ± 1.12
0.0GlyXaa: 0.0 ± 0.0
His
3.579HisAla: 3.579 ± 1.305
0.358HisCys: 0.358 ± 0.372
0.716HisAsp: 0.716 ± 0.462
1.432HisGlu: 1.432 ± 0.736
1.074HisPhe: 1.074 ± 0.498
0.358HisGly: 0.358 ± 0.372
0.358HisHis: 0.358 ± 0.325
0.716HisIle: 0.716 ± 0.412
1.79HisLys: 1.79 ± 0.969
3.221HisLeu: 3.221 ± 1.026
0.0HisMet: 0.0 ± 0.0
1.432HisAsn: 1.432 ± 0.736
0.716HisPro: 0.716 ± 0.462
0.358HisGln: 0.358 ± 0.419
0.358HisArg: 0.358 ± 0.429
1.432HisSer: 1.432 ± 0.7
1.432HisThr: 1.432 ± 0.635
0.358HisVal: 0.358 ± 0.429
0.0HisTrp: 0.0 ± 0.0
0.716HisTyr: 0.716 ± 0.595
0.0HisXaa: 0.0 ± 0.0
Ile
4.653IleAla: 4.653 ± 1.16
0.358IleCys: 0.358 ± 0.419
2.863IleAsp: 2.863 ± 1.05
5.369IleGlu: 5.369 ± 1.494
2.505IlePhe: 2.505 ± 0.896
3.221IleGly: 3.221 ± 0.978
1.432IleHis: 1.432 ± 0.67
4.295IleIle: 4.295 ± 1.138
6.084IleLys: 6.084 ± 1.53
7.158IleLeu: 7.158 ± 1.213
0.716IleMet: 0.716 ± 0.484
3.221IleAsn: 3.221 ± 1.112
3.937IlePro: 3.937 ± 1.462
3.579IleGln: 3.579 ± 2.043
1.79IleArg: 1.79 ± 0.852
5.011IleSer: 5.011 ± 1.615
3.579IleThr: 3.579 ± 0.799
2.505IleVal: 2.505 ± 0.651
0.358IleTrp: 0.358 ± 0.448
1.79IleTyr: 1.79 ± 0.596
0.0IleXaa: 0.0 ± 0.0
Lys
6.442LysAla: 6.442 ± 1.816
0.0LysCys: 0.0 ± 0.0
3.221LysAsp: 3.221 ± 0.919
11.811LysGlu: 11.811 ± 2.807
1.79LysPhe: 1.79 ± 0.625
3.937LysGly: 3.937 ± 1.48
2.505LysHis: 2.505 ± 0.851
3.579LysIle: 3.579 ± 1.074
6.084LysLys: 6.084 ± 1.913
7.158LysLeu: 7.158 ± 1.832
2.505LysMet: 2.505 ± 1.093
6.084LysAsn: 6.084 ± 1.199
5.369LysPro: 5.369 ± 1.174
3.579LysGln: 3.579 ± 1.216
3.937LysArg: 3.937 ± 1.321
4.653LysSer: 4.653 ± 1.402
9.306LysThr: 9.306 ± 1.746
6.084LysVal: 6.084 ± 1.629
0.358LysTrp: 0.358 ± 0.372
2.505LysTyr: 2.505 ± 0.95
0.0LysXaa: 0.0 ± 0.0
Leu
9.306LeuAla: 9.306 ± 1.973
0.358LeuCys: 0.358 ± 0.297
7.516LeuAsp: 7.516 ± 1.427
7.516LeuGlu: 7.516 ± 1.838
3.579LeuPhe: 3.579 ± 1.103
4.295LeuGly: 4.295 ± 1.642
0.358LeuHis: 0.358 ± 0.325
6.442LeuIle: 6.442 ± 1.468
8.232LeuLys: 8.232 ± 1.59
11.095LeuLeu: 11.095 ± 1.878
2.505LeuMet: 2.505 ± 0.844
5.727LeuAsn: 5.727 ± 1.799
2.863LeuPro: 2.863 ± 1.123
4.295LeuGln: 4.295 ± 0.985
1.79LeuArg: 1.79 ± 0.976
6.084LeuSer: 6.084 ± 1.484
5.727LeuThr: 5.727 ± 1.171
6.442LeuVal: 6.442 ± 1.541
0.716LeuTrp: 0.716 ± 0.492
2.863LeuTyr: 2.863 ± 0.674
0.0LeuXaa: 0.0 ± 0.0
Met
3.221MetAla: 3.221 ± 1.097
0.0MetCys: 0.0 ± 0.0
1.074MetAsp: 1.074 ± 0.584
2.147MetGlu: 2.147 ± 0.643
0.358MetPhe: 0.358 ± 0.332
0.716MetGly: 0.716 ± 0.46
0.358MetHis: 0.358 ± 0.334
1.432MetIle: 1.432 ± 0.707
2.147MetLys: 2.147 ± 0.788
1.432MetLeu: 1.432 ± 0.871
0.358MetMet: 0.358 ± 0.334
2.147MetAsn: 2.147 ± 0.784
0.716MetPro: 0.716 ± 0.438
0.0MetGln: 0.0 ± 0.0
1.432MetArg: 1.432 ± 1.018
1.432MetSer: 1.432 ± 0.57
3.221MetThr: 3.221 ± 1.111
1.432MetVal: 1.432 ± 0.585
0.0MetTrp: 0.0 ± 0.0
0.358MetTyr: 0.358 ± 0.329
0.0MetXaa: 0.0 ± 0.0
Asn
2.863AsnAla: 2.863 ± 1.046
0.0AsnCys: 0.0 ± 0.0
3.579AsnAsp: 3.579 ± 1.585
2.147AsnGlu: 2.147 ± 0.85
0.358AsnPhe: 0.358 ± 0.456
6.084AsnGly: 6.084 ± 1.017
1.79AsnHis: 1.79 ± 0.969
3.579AsnIle: 3.579 ± 1.172
4.295AsnLys: 4.295 ± 0.894
4.653AsnLeu: 4.653 ± 1.769
2.147AsnMet: 2.147 ± 0.986
2.147AsnAsn: 2.147 ± 1.051
1.79AsnPro: 1.79 ± 0.797
3.221AsnGln: 3.221 ± 1.226
2.863AsnArg: 2.863 ± 1.05
3.579AsnSer: 3.579 ± 0.804
2.147AsnThr: 2.147 ± 0.908
2.147AsnVal: 2.147 ± 0.928
0.716AsnTrp: 0.716 ± 0.911
1.79AsnTyr: 1.79 ± 0.762
0.0AsnXaa: 0.0 ± 0.0
Pro
1.432ProAla: 1.432 ± 0.599
0.0ProCys: 0.0 ± 0.0
2.147ProAsp: 2.147 ± 0.855
3.937ProGlu: 3.937 ± 1.48
2.863ProPhe: 2.863 ± 1.041
0.716ProGly: 0.716 ± 0.517
0.358ProHis: 0.358 ± 0.4
0.716ProIle: 0.716 ± 0.596
4.295ProLys: 4.295 ± 1.661
2.863ProLeu: 2.863 ± 1.044
0.716ProMet: 0.716 ± 0.468
2.147ProAsn: 2.147 ± 1.591
1.432ProPro: 1.432 ± 0.518
2.147ProGln: 2.147 ± 0.786
4.295ProArg: 4.295 ± 1.068
1.79ProSer: 1.79 ± 0.779
2.147ProThr: 2.147 ± 0.507
2.863ProVal: 2.863 ± 0.798
0.358ProTrp: 0.358 ± 0.297
1.432ProTyr: 1.432 ± 1.137
0.0ProXaa: 0.0 ± 0.0
Gln
2.863GlnAla: 2.863 ± 0.901
0.358GlnCys: 0.358 ± 0.387
2.505GlnAsp: 2.505 ± 0.834
2.863GlnGlu: 2.863 ± 1.055
2.863GlnPhe: 2.863 ± 1.166
2.147GlnGly: 2.147 ± 0.867
1.074GlnHis: 1.074 ± 0.563
3.221GlnIle: 3.221 ± 2.021
3.221GlnLys: 3.221 ± 1.325
4.295GlnLeu: 4.295 ± 1.255
0.716GlnMet: 0.716 ± 0.561
2.505GlnAsn: 2.505 ± 1.022
1.432GlnPro: 1.432 ± 0.806
2.863GlnGln: 2.863 ± 1.072
1.79GlnArg: 1.79 ± 0.535
2.505GlnSer: 2.505 ± 1.025
2.863GlnThr: 2.863 ± 1.068
2.505GlnVal: 2.505 ± 0.85
0.716GlnTrp: 0.716 ± 0.492
0.358GlnTyr: 0.358 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
3.579ArgAla: 3.579 ± 0.732
0.358ArgCys: 0.358 ± 0.348
1.79ArgAsp: 1.79 ± 0.546
3.579ArgGlu: 3.579 ± 1.343
2.147ArgPhe: 2.147 ± 0.825
2.147ArgGly: 2.147 ± 0.948
0.716ArgHis: 0.716 ± 0.372
3.221ArgIle: 3.221 ± 0.966
3.579ArgLys: 3.579 ± 0.871
5.369ArgLeu: 5.369 ± 1.445
1.79ArgMet: 1.79 ± 0.9
2.505ArgAsn: 2.505 ± 0.92
1.79ArgPro: 1.79 ± 0.887
2.505ArgGln: 2.505 ± 1.232
1.432ArgArg: 1.432 ± 0.47
1.432ArgSer: 1.432 ± 0.767
4.653ArgThr: 4.653 ± 1.024
3.221ArgVal: 3.221 ± 0.845
0.358ArgTrp: 0.358 ± 0.332
1.074ArgTyr: 1.074 ± 0.58
0.0ArgXaa: 0.0 ± 0.0
Ser
2.147SerAla: 2.147 ± 0.715
0.0SerCys: 0.0 ± 0.0
2.147SerAsp: 2.147 ± 0.915
6.084SerGlu: 6.084 ± 1.644
2.147SerPhe: 2.147 ± 1.307
2.863SerGly: 2.863 ± 0.744
2.147SerHis: 2.147 ± 1.183
5.369SerIle: 5.369 ± 1.608
6.084SerLys: 6.084 ± 1.514
7.874SerLeu: 7.874 ± 1.442
0.716SerMet: 0.716 ± 0.512
1.79SerAsn: 1.79 ± 0.904
2.505SerPro: 2.505 ± 1.031
2.147SerGln: 2.147 ± 0.878
0.716SerArg: 0.716 ± 0.458
4.653SerSer: 4.653 ± 2.411
3.579SerThr: 3.579 ± 0.973
1.79SerVal: 1.79 ± 1.16
0.716SerTrp: 0.716 ± 0.435
5.011SerTyr: 5.011 ± 1.667
0.0SerXaa: 0.0 ± 0.0
Thr
5.011ThrAla: 5.011 ± 1.44
0.0ThrCys: 0.0 ± 0.0
1.432ThrAsp: 1.432 ± 0.743
3.937ThrGlu: 3.937 ± 1.177
5.011ThrPhe: 5.011 ± 1.798
3.221ThrGly: 3.221 ± 1.062
1.79ThrHis: 1.79 ± 0.643
3.579ThrIle: 3.579 ± 0.686
4.653ThrLys: 4.653 ± 1.417
5.011ThrLeu: 5.011 ± 1.166
1.432ThrMet: 1.432 ± 0.555
1.79ThrAsn: 1.79 ± 0.57
5.011ThrPro: 5.011 ± 1.294
1.79ThrGln: 1.79 ± 0.985
3.221ThrArg: 3.221 ± 0.662
4.653ThrSer: 4.653 ± 0.961
3.579ThrThr: 3.579 ± 1.444
4.653ThrVal: 4.653 ± 1.44
0.358ThrTrp: 0.358 ± 0.372
2.863ThrTyr: 2.863 ± 1.046
0.0ThrXaa: 0.0 ± 0.0
Val
4.653ValAla: 4.653 ± 1.136
0.716ValCys: 0.716 ± 0.548
1.79ValAsp: 1.79 ± 0.892
3.221ValGlu: 3.221 ± 0.944
2.863ValPhe: 2.863 ± 0.856
2.863ValGly: 2.863 ± 0.928
1.074ValHis: 1.074 ± 1.045
1.79ValIle: 1.79 ± 0.564
6.442ValLys: 6.442 ± 1.735
3.579ValLeu: 3.579 ± 0.94
1.074ValMet: 1.074 ± 0.638
2.147ValAsn: 2.147 ± 0.658
2.147ValPro: 2.147 ± 1.008
3.221ValGln: 3.221 ± 0.933
3.221ValArg: 3.221 ± 0.873
3.937ValSer: 3.937 ± 1.25
3.221ValThr: 3.221 ± 1.152
2.505ValVal: 2.505 ± 0.827
0.358ValTrp: 0.358 ± 0.334
2.147ValTyr: 2.147 ± 1.11
0.0ValXaa: 0.0 ± 0.0
Trp
0.716TrpAla: 0.716 ± 0.412
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.716TrpGlu: 0.716 ± 0.559
0.0TrpPhe: 0.0 ± 0.0
0.358TrpGly: 0.358 ± 0.325
1.074TrpHis: 1.074 ± 0.627
0.358TrpIle: 0.358 ± 0.297
0.358TrpLys: 0.358 ± 0.448
0.716TrpLeu: 0.716 ± 0.372
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.716TrpPro: 0.716 ± 0.485
0.358TrpGln: 0.358 ± 0.297
0.358TrpArg: 0.358 ± 0.297
0.716TrpSer: 0.716 ± 0.458
0.0TrpThr: 0.0 ± 0.0
0.716TrpVal: 0.716 ± 0.411
0.0TrpTrp: 0.0 ± 0.0
0.358TrpTyr: 0.358 ± 0.297
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.432TyrAla: 1.432 ± 0.714
0.358TyrCys: 0.358 ± 0.372
1.432TyrAsp: 1.432 ± 0.684
2.147TyrGlu: 2.147 ± 0.893
0.358TyrPhe: 0.358 ± 0.348
1.79TyrGly: 1.79 ± 0.618
1.074TyrHis: 1.074 ± 0.51
2.147TyrIle: 2.147 ± 0.69
5.369TyrLys: 5.369 ± 1.686
3.937TyrLeu: 3.937 ± 1.092
2.505TyrMet: 2.505 ± 0.856
2.505TyrAsn: 2.505 ± 1.366
1.074TyrPro: 1.074 ± 0.777
3.221TyrGln: 3.221 ± 1.007
3.221TyrArg: 3.221 ± 0.642
2.863TyrSer: 2.863 ± 1.241
1.79TyrThr: 1.79 ± 0.711
1.432TyrVal: 1.432 ± 0.541
0.0TyrTrp: 0.0 ± 0.0
1.432TyrTyr: 1.432 ± 0.759
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (2795 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski