Amino acid dipepetide frequency for Cellulophaga phage phi12a:1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.004AlaAla: 2.004 ± 0.892
0.0AlaCys: 0.0 ± 0.0
1.503AlaAsp: 1.503 ± 1.031
2.004AlaGlu: 2.004 ± 0.843
3.507AlaPhe: 3.507 ± 1.895
1.002AlaGly: 1.002 ± 0.74
1.503AlaHis: 1.503 ± 0.858
5.511AlaIle: 5.511 ± 1.752
7.014AlaLys: 7.014 ± 2.929
4.509AlaLeu: 4.509 ± 2.13
1.503AlaMet: 1.503 ± 0.959
3.507AlaAsn: 3.507 ± 1.304
0.0AlaPro: 0.0 ± 0.0
3.006AlaGln: 3.006 ± 1.153
2.004AlaArg: 2.004 ± 1.023
2.505AlaSer: 2.505 ± 0.932
3.507AlaThr: 3.507 ± 1.632
4.509AlaVal: 4.509 ± 1.445
1.503AlaTrp: 1.503 ± 0.751
1.002AlaTyr: 1.002 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
1.503CysAla: 1.503 ± 1.293
0.501CysCys: 0.501 ± 0.481
1.002CysAsp: 1.002 ± 0.647
0.501CysGlu: 0.501 ± 0.539
0.501CysPhe: 0.501 ± 0.539
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.503CysIle: 1.503 ± 0.67
0.0CysLys: 0.0 ± 0.0
1.503CysLeu: 1.503 ± 0.773
0.0CysMet: 0.0 ± 0.0
0.501CysAsn: 0.501 ± 0.39
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.501CysArg: 0.501 ± 0.39
1.503CysSer: 1.503 ± 0.81
0.0CysThr: 0.0 ± 0.0
0.501CysVal: 0.501 ± 0.593
0.0CysTrp: 0.0 ± 0.0
0.501CysTyr: 0.501 ± 0.639
0.0CysXaa: 0.0 ± 0.0
Asp
4.008AspAla: 4.008 ± 1.688
1.002AspCys: 1.002 ± 1.204
0.0AspAsp: 0.0 ± 0.0
3.006AspGlu: 3.006 ± 0.94
3.507AspPhe: 3.507 ± 1.937
3.006AspGly: 3.006 ± 1.209
0.0AspHis: 0.0 ± 0.0
2.505AspIle: 2.505 ± 1.528
1.503AspLys: 1.503 ± 0.827
5.01AspLeu: 5.01 ± 2.054
0.0AspMet: 0.0 ± 0.0
2.004AspAsn: 2.004 ± 1.224
2.004AspPro: 2.004 ± 1.057
2.004AspGln: 2.004 ± 1.002
1.002AspArg: 1.002 ± 0.608
2.004AspSer: 2.004 ± 1.002
3.006AspThr: 3.006 ± 1.504
5.01AspVal: 5.01 ± 2.325
1.503AspTrp: 1.503 ± 0.809
2.505AspTyr: 2.505 ± 1.148
0.0AspXaa: 0.0 ± 0.0
Glu
2.004GluAla: 2.004 ± 0.791
0.0GluCys: 0.0 ± 0.0
2.505GluAsp: 2.505 ± 1.175
0.501GluGlu: 0.501 ± 0.495
3.006GluPhe: 3.006 ± 1.162
2.505GluGly: 2.505 ± 1.309
0.501GluHis: 0.501 ± 0.602
6.012GluIle: 6.012 ± 1.788
3.006GluLys: 3.006 ± 1.127
7.014GluLeu: 7.014 ± 2.132
1.002GluMet: 1.002 ± 0.779
3.006GluAsn: 3.006 ± 0.731
0.0GluPro: 0.0 ± 0.0
3.006GluGln: 3.006 ± 1.291
1.503GluArg: 1.503 ± 0.938
4.509GluSer: 4.509 ± 1.864
3.006GluThr: 3.006 ± 1.407
1.503GluVal: 1.503 ± 0.667
0.501GluTrp: 0.501 ± 0.39
1.002GluTyr: 1.002 ± 0.566
0.0GluXaa: 0.0 ± 0.0
Phe
1.503PheAla: 1.503 ± 1.299
1.002PheCys: 1.002 ± 0.506
3.006PheAsp: 3.006 ± 1.393
2.505PheGlu: 2.505 ± 1.145
2.505PhePhe: 2.505 ± 0.858
2.505PheGly: 2.505 ± 0.78
0.501PheHis: 0.501 ± 0.593
4.008PheIle: 4.008 ± 1.84
9.018PheLys: 9.018 ± 1.772
2.505PheLeu: 2.505 ± 0.98
2.004PheMet: 2.004 ± 0.831
5.01PheAsn: 5.01 ± 2.122
2.004PhePro: 2.004 ± 0.864
2.505PheGln: 2.505 ± 1.53
4.008PheArg: 4.008 ± 1.328
2.004PheSer: 2.004 ± 0.912
3.507PheThr: 3.507 ± 1.621
2.505PheVal: 2.505 ± 1.55
1.002PheTrp: 1.002 ± 0.638
2.505PheTyr: 2.505 ± 1.195
0.0PheXaa: 0.0 ± 0.0
Gly
2.004GlyAla: 2.004 ± 0.977
0.0GlyCys: 0.0 ± 0.0
0.501GlyAsp: 0.501 ± 0.39
2.505GlyGlu: 2.505 ± 1.81
2.505GlyPhe: 2.505 ± 0.871
3.006GlyGly: 3.006 ± 1.49
0.501GlyHis: 0.501 ± 0.539
3.507GlyIle: 3.507 ± 1.317
4.509GlyLys: 4.509 ± 1.167
6.012GlyLeu: 6.012 ± 1.216
3.006GlyMet: 3.006 ± 1.204
3.507GlyAsn: 3.507 ± 1.162
0.501GlyPro: 0.501 ± 0.481
2.004GlyGln: 2.004 ± 1.03
2.505GlyArg: 2.505 ± 0.825
4.008GlySer: 4.008 ± 1.586
6.012GlyThr: 6.012 ± 1.484
6.513GlyVal: 6.513 ± 1.628
0.0GlyTrp: 0.0 ± 0.0
4.008GlyTyr: 4.008 ± 1.155
0.0GlyXaa: 0.0 ± 0.0
His
2.004HisAla: 2.004 ± 0.956
0.0HisCys: 0.0 ± 0.0
0.501HisAsp: 0.501 ± 0.593
2.004HisGlu: 2.004 ± 0.886
2.505HisPhe: 2.505 ± 1.509
1.503HisGly: 1.503 ± 0.751
0.0HisHis: 0.0 ± 0.0
1.503HisIle: 1.503 ± 0.975
1.503HisLys: 1.503 ± 1.279
1.002HisLeu: 1.002 ± 0.74
0.0HisMet: 0.0 ± 0.0
1.002HisAsn: 1.002 ± 0.647
0.501HisPro: 0.501 ± 0.495
0.0HisGln: 0.0 ± 0.0
1.503HisArg: 1.503 ± 1.09
0.501HisSer: 0.501 ± 0.593
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.501HisTrp: 0.501 ± 0.593
0.501HisTyr: 0.501 ± 0.39
0.0HisXaa: 0.0 ± 0.0
Ile
5.511IleAla: 5.511 ± 1.299
0.501IleCys: 0.501 ± 0.593
4.509IleAsp: 4.509 ± 1.917
4.008IleGlu: 4.008 ± 1.574
4.008IlePhe: 4.008 ± 1.222
3.006IleGly: 3.006 ± 1.339
0.501IleHis: 0.501 ± 0.593
5.01IleIle: 5.01 ± 1.351
9.519IleLys: 9.519 ± 2.252
7.014IleLeu: 7.014 ± 1.664
0.501IleMet: 0.501 ± 0.713
4.509IleAsn: 4.509 ± 1.283
3.006IlePro: 3.006 ± 1.529
2.004IleGln: 2.004 ± 0.752
2.505IleArg: 2.505 ± 0.966
7.014IleSer: 7.014 ± 2.205
5.01IleThr: 5.01 ± 2.108
5.01IleVal: 5.01 ± 1.612
1.503IleTrp: 1.503 ± 0.999
2.505IleTyr: 2.505 ± 0.796
0.0IleXaa: 0.0 ± 0.0
Lys
3.507LysAla: 3.507 ± 1.339
0.501LysCys: 0.501 ± 0.539
3.507LysAsp: 3.507 ± 1.098
4.509LysGlu: 4.509 ± 1.522
3.507LysPhe: 3.507 ± 1.574
7.014LysGly: 7.014 ± 1.808
0.501LysHis: 0.501 ± 0.39
7.014LysIle: 7.014 ± 1.522
8.016LysLys: 8.016 ± 2.665
9.018LysLeu: 9.018 ± 2.015
4.008LysMet: 4.008 ± 1.293
7.014LysAsn: 7.014 ± 2.385
3.507LysPro: 3.507 ± 1.123
2.505LysGln: 2.505 ± 1.136
5.01LysArg: 5.01 ± 2.391
6.012LysSer: 6.012 ± 1.445
7.515LysThr: 7.515 ± 1.652
3.006LysVal: 3.006 ± 0.745
2.004LysTrp: 2.004 ± 1.049
4.509LysTyr: 4.509 ± 1.779
0.0LysXaa: 0.0 ± 0.0
Leu
5.01LeuAla: 5.01 ± 1.185
1.503LeuCys: 1.503 ± 1.112
4.509LeuAsp: 4.509 ± 1.661
3.507LeuGlu: 3.507 ± 1.356
8.016LeuPhe: 8.016 ± 2.221
3.006LeuGly: 3.006 ± 1.186
1.503LeuHis: 1.503 ± 0.651
8.517LeuIle: 8.517 ± 1.946
7.515LeuLys: 7.515 ± 2.068
10.521LeuLeu: 10.521 ± 1.784
3.507LeuMet: 3.507 ± 1.085
5.01LeuAsn: 5.01 ± 1.742
3.507LeuPro: 3.507 ± 1.153
2.505LeuGln: 2.505 ± 0.969
1.503LeuArg: 1.503 ± 0.761
9.519LeuSer: 9.519 ± 1.647
7.014LeuThr: 7.014 ± 2.503
5.511LeuVal: 5.511 ± 1.78
0.0LeuTrp: 0.0 ± 0.0
5.01LeuTyr: 5.01 ± 1.849
0.0LeuXaa: 0.0 ± 0.0
Met
0.501MetAla: 0.501 ± 0.39
0.501MetCys: 0.501 ± 0.539
0.0MetAsp: 0.0 ± 0.0
1.503MetGlu: 1.503 ± 0.662
1.002MetPhe: 1.002 ± 0.779
1.503MetGly: 1.503 ± 0.933
0.0MetHis: 0.0 ± 0.0
2.505MetIle: 2.505 ± 1.025
2.505MetLys: 2.505 ± 1.31
1.002MetLeu: 1.002 ± 0.661
1.002MetMet: 1.002 ± 0.665
2.004MetAsn: 2.004 ± 0.895
0.501MetPro: 0.501 ± 0.39
2.004MetGln: 2.004 ± 0.722
1.503MetArg: 1.503 ± 0.895
2.004MetSer: 2.004 ± 1.183
0.0MetThr: 0.0 ± 0.0
3.507MetVal: 3.507 ± 1.247
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.509AsnAla: 4.509 ± 1.875
1.002AsnCys: 1.002 ± 1.186
2.505AsnAsp: 2.505 ± 0.958
3.507AsnGlu: 3.507 ± 1.786
2.004AsnPhe: 2.004 ± 1.037
8.016AsnGly: 8.016 ± 1.641
2.004AsnHis: 2.004 ± 0.701
4.509AsnIle: 4.509 ± 1.109
7.014AsnLys: 7.014 ± 2.437
4.509AsnLeu: 4.509 ± 1.77
1.002AsnMet: 1.002 ± 0.63
5.01AsnAsn: 5.01 ± 2.135
0.0AsnPro: 0.0 ± 0.0
2.505AsnGln: 2.505 ± 1.415
3.006AsnArg: 3.006 ± 1.109
3.507AsnSer: 3.507 ± 1.156
5.511AsnThr: 5.511 ± 1.18
4.509AsnVal: 4.509 ± 0.989
0.501AsnTrp: 0.501 ± 0.44
3.507AsnTyr: 3.507 ± 1.437
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.501ProCys: 0.501 ± 0.539
0.501ProAsp: 0.501 ± 0.481
0.0ProGlu: 0.0 ± 0.0
1.002ProPhe: 1.002 ± 0.779
0.501ProGly: 0.501 ± 0.39
1.002ProHis: 1.002 ± 0.816
3.507ProIle: 3.507 ± 1.31
0.0ProLys: 0.0 ± 0.0
4.008ProLeu: 4.008 ± 1.178
0.501ProMet: 0.501 ± 0.39
3.507ProAsn: 3.507 ± 1.949
0.0ProPro: 0.0 ± 0.0
0.501ProGln: 0.501 ± 0.539
1.503ProArg: 1.503 ± 0.839
1.002ProSer: 1.002 ± 0.589
1.002ProThr: 1.002 ± 0.628
2.004ProVal: 2.004 ± 1.017
0.501ProTrp: 0.501 ± 0.42
1.503ProTyr: 1.503 ± 0.716
0.0ProXaa: 0.0 ± 0.0
Gln
2.505GlnAla: 2.505 ± 1.095
0.0GlnCys: 0.0 ± 0.0
1.503GlnAsp: 1.503 ± 0.7
1.002GlnGlu: 1.002 ± 0.963
2.004GlnPhe: 2.004 ± 0.937
1.503GlnGly: 1.503 ± 0.853
0.501GlnHis: 0.501 ± 0.42
2.004GlnIle: 2.004 ± 1.193
3.507GlnLys: 3.507 ± 1.569
5.01GlnLeu: 5.01 ± 1.799
1.002GlnMet: 1.002 ± 0.665
3.507GlnAsn: 3.507 ± 1.703
0.501GlnPro: 0.501 ± 0.539
2.505GlnGln: 2.505 ± 0.976
2.505GlnArg: 2.505 ± 1.243
1.002GlnSer: 1.002 ± 0.63
2.505GlnThr: 2.505 ± 0.989
1.002GlnVal: 1.002 ± 0.698
0.0GlnTrp: 0.0 ± 0.0
2.004GlnTyr: 2.004 ± 0.932
0.0GlnXaa: 0.0 ± 0.0
Arg
0.501ArgAla: 0.501 ± 0.478
0.501ArgCys: 0.501 ± 0.39
3.006ArgAsp: 3.006 ± 1.029
1.503ArgGlu: 1.503 ± 0.853
2.004ArgPhe: 2.004 ± 0.975
1.503ArgGly: 1.503 ± 0.667
1.002ArgHis: 1.002 ± 0.885
3.006ArgIle: 3.006 ± 1.484
4.509ArgLys: 4.509 ± 1.118
4.509ArgLeu: 4.509 ± 1.561
1.002ArgMet: 1.002 ± 0.7
3.006ArgAsn: 3.006 ± 1.36
1.002ArgPro: 1.002 ± 0.661
1.002ArgGln: 1.002 ± 0.753
2.004ArgArg: 2.004 ± 1.128
4.008ArgSer: 4.008 ± 1.076
2.505ArgThr: 2.505 ± 0.827
2.505ArgVal: 2.505 ± 1.512
0.501ArgTrp: 0.501 ± 0.602
2.505ArgTyr: 2.505 ± 0.965
0.0ArgXaa: 0.0 ± 0.0
Ser
2.505SerAla: 2.505 ± 1.121
2.004SerCys: 2.004 ± 0.896
5.01SerAsp: 5.01 ± 1.529
7.014SerGlu: 7.014 ± 1.598
2.505SerPhe: 2.505 ± 1.017
4.008SerGly: 4.008 ± 1.749
3.507SerHis: 3.507 ± 1.738
3.507SerIle: 3.507 ± 1.204
4.509SerLys: 4.509 ± 0.992
4.509SerLeu: 4.509 ± 1.431
0.501SerMet: 0.501 ± 0.44
5.511SerAsn: 5.511 ± 2.239
3.507SerPro: 3.507 ± 1.641
3.507SerGln: 3.507 ± 1.532
3.507SerArg: 3.507 ± 1.428
3.507SerSer: 3.507 ± 1.418
5.511SerThr: 5.511 ± 1.802
4.008SerVal: 4.008 ± 1.232
0.501SerTrp: 0.501 ± 0.42
0.501SerTyr: 0.501 ± 0.593
0.0SerXaa: 0.0 ± 0.0
Thr
4.509ThrAla: 4.509 ± 1.01
0.501ThrCys: 0.501 ± 0.39
3.006ThrAsp: 3.006 ± 1.13
1.503ThrGlu: 1.503 ± 0.741
3.507ThrPhe: 3.507 ± 0.907
5.511ThrGly: 5.511 ± 1.168
2.004ThrHis: 2.004 ± 1.214
4.509ThrIle: 4.509 ± 1.154
6.012ThrLys: 6.012 ± 2.133
7.515ThrLeu: 7.515 ± 1.423
0.0ThrMet: 0.0 ± 0.0
4.008ThrAsn: 4.008 ± 0.961
0.0ThrPro: 0.0 ± 0.0
1.002ThrGln: 1.002 ± 0.598
3.006ThrArg: 3.006 ± 1.284
7.014ThrSer: 7.014 ± 1.861
1.002ThrThr: 1.002 ± 0.733
3.006ThrVal: 3.006 ± 0.993
0.501ThrTrp: 0.501 ± 0.478
3.507ThrTyr: 3.507 ± 1.927
0.0ThrXaa: 0.0 ± 0.0
Val
3.507ValAla: 3.507 ± 1.399
0.0ValCys: 0.0 ± 0.0
5.01ValAsp: 5.01 ± 1.014
3.006ValGlu: 3.006 ± 1.139
2.004ValPhe: 2.004 ± 1.253
4.509ValGly: 4.509 ± 1.167
0.501ValHis: 0.501 ± 0.39
3.006ValIle: 3.006 ± 1.345
8.016ValLys: 8.016 ± 2.009
9.018ValLeu: 9.018 ± 1.408
0.0ValMet: 0.0 ± 0.0
4.008ValAsn: 4.008 ± 1.228
1.503ValPro: 1.503 ± 0.992
2.505ValGln: 2.505 ± 0.914
0.0ValArg: 0.0 ± 0.0
3.006ValSer: 3.006 ± 1.237
3.507ValThr: 3.507 ± 1.151
10.02ValVal: 10.02 ± 3.848
1.503ValTrp: 1.503 ± 0.896
3.507ValTyr: 3.507 ± 1.664
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.501TrpGlu: 0.501 ± 0.44
2.004TrpPhe: 2.004 ± 1.354
1.503TrpGly: 1.503 ± 1.062
0.501TrpHis: 0.501 ± 0.495
1.002TrpIle: 1.002 ± 0.647
0.501TrpLys: 0.501 ± 0.495
1.002TrpLeu: 1.002 ± 0.881
0.0TrpMet: 0.0 ± 0.0
0.501TrpAsn: 0.501 ± 0.539
0.0TrpPro: 0.0 ± 0.0
0.501TrpGln: 0.501 ± 0.481
1.002TrpArg: 1.002 ± 0.779
1.503TrpSer: 1.503 ± 0.847
0.0TrpThr: 0.0 ± 0.0
1.002TrpVal: 1.002 ± 0.789
0.501TrpTrp: 0.501 ± 0.495
1.503TrpTyr: 1.503 ± 0.971
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.509TyrAla: 4.509 ± 1.504
0.501TyrCys: 0.501 ± 0.39
2.505TyrAsp: 2.505 ± 1.001
1.503TyrGlu: 1.503 ± 0.662
4.509TyrPhe: 4.509 ± 1.496
2.004TyrGly: 2.004 ± 0.618
0.501TyrHis: 0.501 ± 0.539
4.509TyrIle: 4.509 ± 1.632
4.008TyrLys: 4.008 ± 2.03
1.503TyrLeu: 1.503 ± 0.895
2.505TyrMet: 2.505 ± 1.052
2.505TyrAsn: 2.505 ± 1.075
1.002TyrPro: 1.002 ± 0.647
0.501TyrGln: 0.501 ± 0.39
2.004TyrArg: 2.004 ± 1.058
3.507TyrSer: 3.507 ± 1.449
1.503TyrThr: 1.503 ± 0.891
2.505TyrVal: 2.505 ± 0.809
0.501TyrTrp: 0.501 ± 0.42
2.004TyrTyr: 2.004 ± 0.932
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (1997 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski