Amino acid dipepetide frequency for Xanthomonas phage Xf409

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.444AlaAla: 15.444 ± 2.566
1.931AlaCys: 1.931 ± 0.767
6.95AlaAsp: 6.95 ± 1.039
4.633AlaGlu: 4.633 ± 1.062
4.633AlaPhe: 4.633 ± 1.87
7.722AlaGly: 7.722 ± 1.429
1.158AlaHis: 1.158 ± 0.539
5.792AlaIle: 5.792 ± 1.875
6.178AlaLys: 6.178 ± 1.404
15.83AlaLeu: 15.83 ± 4.267
5.792AlaMet: 5.792 ± 0.985
2.317AlaAsn: 2.317 ± 0.905
3.475AlaPro: 3.475 ± 0.968
5.792AlaGln: 5.792 ± 1.544
9.653AlaArg: 9.653 ± 1.756
6.564AlaSer: 6.564 ± 0.795
4.247AlaThr: 4.247 ± 1.052
6.564AlaVal: 6.564 ± 2.467
4.247AlaTrp: 4.247 ± 1.147
4.633AlaTyr: 4.633 ± 1.515
0.0AlaXaa: 0.0 ± 0.0
Cys
3.475CysAla: 3.475 ± 1.295
0.0CysCys: 0.0 ± 0.0
1.931CysAsp: 1.931 ± 1.511
0.772CysGlu: 0.772 ± 0.423
0.0CysPhe: 0.0 ± 0.0
1.544CysGly: 1.544 ± 0.599
0.0CysHis: 0.0 ± 0.0
1.158CysIle: 1.158 ± 0.618
0.772CysLys: 0.772 ± 0.604
1.544CysLeu: 1.544 ± 0.837
1.158CysMet: 1.158 ± 0.435
0.772CysAsn: 0.772 ± 0.604
1.931CysPro: 1.931 ± 1.511
0.386CysGln: 0.386 ± 0.302
1.544CysArg: 1.544 ± 0.521
2.317CysSer: 2.317 ± 0.98
1.158CysThr: 1.158 ± 0.57
1.158CysVal: 1.158 ± 0.607
0.772CysTrp: 0.772 ± 0.537
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.494AspAla: 8.494 ± 1.063
1.931AspCys: 1.931 ± 1.256
2.317AspAsp: 2.317 ± 0.83
1.931AspGlu: 1.931 ± 0.735
1.544AspPhe: 1.544 ± 0.637
8.494AspGly: 8.494 ± 2.636
0.772AspHis: 0.772 ± 0.595
1.158AspIle: 1.158 ± 0.468
1.931AspLys: 1.931 ± 0.579
3.861AspLeu: 3.861 ± 1.028
0.0AspMet: 0.0 ± 0.0
0.772AspAsn: 0.772 ± 0.571
3.861AspPro: 3.861 ± 0.867
2.703AspGln: 2.703 ± 0.836
2.317AspArg: 2.317 ± 0.585
1.544AspSer: 1.544 ± 0.599
3.089AspThr: 3.089 ± 1.045
3.861AspVal: 3.861 ± 0.917
0.772AspTrp: 0.772 ± 0.423
1.544AspTyr: 1.544 ± 0.848
0.0AspXaa: 0.0 ± 0.0
Glu
5.405GluAla: 5.405 ± 1.352
0.386GluCys: 0.386 ± 0.302
0.772GluAsp: 0.772 ± 0.477
1.158GluGlu: 1.158 ± 0.718
1.158GluPhe: 1.158 ± 0.644
2.703GluGly: 2.703 ± 1.363
0.386GluHis: 0.386 ± 0.327
1.931GluIle: 1.931 ± 0.72
2.317GluLys: 2.317 ± 0.992
5.792GluLeu: 5.792 ± 1.403
0.772GluMet: 0.772 ± 0.524
0.772GluAsn: 0.772 ± 0.885
2.703GluPro: 2.703 ± 1.291
1.931GluGln: 1.931 ± 0.799
3.089GluArg: 3.089 ± 1.774
3.089GluSer: 3.089 ± 0.918
0.772GluThr: 0.772 ± 0.423
1.158GluVal: 1.158 ± 0.553
0.386GluTrp: 0.386 ± 0.442
1.158GluTyr: 1.158 ± 0.567
0.0GluXaa: 0.0 ± 0.0
Phe
3.475PheAla: 3.475 ± 1.02
0.772PheCys: 0.772 ± 0.384
2.703PheAsp: 2.703 ± 0.822
0.386PheGlu: 0.386 ± 0.302
0.772PhePhe: 0.772 ± 0.512
6.95PheGly: 6.95 ± 1.528
0.386PheHis: 0.386 ± 0.367
0.772PheIle: 0.772 ± 0.471
1.931PheLys: 1.931 ± 0.999
1.931PheLeu: 1.931 ± 0.962
0.0PheMet: 0.0 ± 0.0
0.386PheAsn: 0.386 ± 0.287
2.317PhePro: 2.317 ± 0.887
0.772PheGln: 0.772 ± 0.544
4.247PheArg: 4.247 ± 1.202
1.158PheSer: 1.158 ± 0.587
1.544PheThr: 1.544 ± 0.758
2.317PheVal: 2.317 ± 1.093
0.386PheTrp: 0.386 ± 0.287
0.772PheTyr: 0.772 ± 0.526
0.0PheXaa: 0.0 ± 0.0
Gly
7.336GlyAla: 7.336 ± 1.186
1.931GlyCys: 1.931 ± 1.511
5.019GlyAsp: 5.019 ± 1.923
5.019GlyGlu: 5.019 ± 1.401
3.089GlyPhe: 3.089 ± 0.973
9.653GlyGly: 9.653 ± 2.17
1.931GlyHis: 1.931 ± 0.619
3.861GlyIle: 3.861 ± 0.694
3.861GlyLys: 3.861 ± 1.202
8.494GlyLeu: 8.494 ± 1.795
1.931GlyMet: 1.931 ± 0.927
2.703GlyAsn: 2.703 ± 0.96
2.317GlyPro: 2.317 ± 0.747
5.792GlyGln: 5.792 ± 2.54
6.95GlyArg: 6.95 ± 1.593
7.722GlySer: 7.722 ± 1.387
7.722GlyThr: 7.722 ± 2.321
4.247GlyVal: 4.247 ± 1.139
2.703GlyTrp: 2.703 ± 0.76
4.633GlyTyr: 4.633 ± 1.941
0.0GlyXaa: 0.0 ± 0.0
His
1.931HisAla: 1.931 ± 0.885
0.0HisCys: 0.0 ± 0.0
1.158HisAsp: 1.158 ± 0.462
0.386HisGlu: 0.386 ± 0.287
1.158HisPhe: 1.158 ± 0.501
1.158HisGly: 1.158 ± 0.665
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.772HisLys: 0.772 ± 0.422
1.158HisLeu: 1.158 ± 0.709
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.158HisPro: 1.158 ± 0.578
0.386HisGln: 0.386 ± 0.353
1.931HisArg: 1.931 ± 1.094
0.386HisSer: 0.386 ± 0.444
1.158HisThr: 1.158 ± 0.718
1.931HisVal: 1.931 ± 0.896
0.386HisTrp: 0.386 ± 0.367
1.158HisTyr: 1.158 ± 0.861
0.0HisXaa: 0.0 ± 0.0
Ile
5.792IleAla: 5.792 ± 1.131
1.158IleCys: 1.158 ± 0.553
3.475IleAsp: 3.475 ± 0.846
1.544IleGlu: 1.544 ± 0.668
1.931IlePhe: 1.931 ± 0.937
4.247IleGly: 4.247 ± 1.17
0.386IleHis: 0.386 ± 0.442
0.772IleIle: 0.772 ± 0.584
1.544IleLys: 1.544 ± 0.844
1.158IleLeu: 1.158 ± 0.666
1.544IleMet: 1.544 ± 0.917
0.772IleAsn: 0.772 ± 0.59
1.931IlePro: 1.931 ± 0.703
1.931IleGln: 1.931 ± 0.741
3.475IleArg: 3.475 ± 0.913
0.772IleSer: 0.772 ± 0.566
2.703IleThr: 2.703 ± 0.818
1.158IleVal: 1.158 ± 0.572
0.386IleTrp: 0.386 ± 0.512
0.386IleTyr: 0.386 ± 0.442
0.0IleXaa: 0.0 ± 0.0
Lys
3.861LysAla: 3.861 ± 1.006
0.772LysCys: 0.772 ± 0.604
3.089LysAsp: 3.089 ± 0.864
0.386LysGlu: 0.386 ± 0.327
1.544LysPhe: 1.544 ± 0.587
2.703LysGly: 2.703 ± 0.919
1.158LysHis: 1.158 ± 0.688
0.772LysIle: 0.772 ± 0.607
1.544LysLys: 1.544 ± 0.784
1.544LysLeu: 1.544 ± 1.016
0.772LysMet: 0.772 ± 0.448
1.158LysAsn: 1.158 ± 0.465
1.931LysPro: 1.931 ± 0.721
0.772LysGln: 0.772 ± 0.512
4.633LysArg: 4.633 ± 1.088
3.475LysSer: 3.475 ± 0.708
2.317LysThr: 2.317 ± 0.95
3.089LysVal: 3.089 ± 0.997
1.544LysTrp: 1.544 ± 0.852
0.772LysTyr: 0.772 ± 0.468
0.0LysXaa: 0.0 ± 0.0
Leu
13.127LeuAla: 13.127 ± 2.171
1.931LeuCys: 1.931 ± 0.97
5.019LeuAsp: 5.019 ± 1.281
3.089LeuGlu: 3.089 ± 1.121
2.317LeuPhe: 2.317 ± 1.377
6.178LeuGly: 6.178 ± 1.267
2.317LeuHis: 2.317 ± 0.71
3.475LeuIle: 3.475 ± 1.023
1.931LeuLys: 1.931 ± 0.771
5.792LeuLeu: 5.792 ± 1.909
2.317LeuMet: 2.317 ± 0.491
1.158LeuAsn: 1.158 ± 0.709
5.019LeuPro: 5.019 ± 1.481
3.089LeuGln: 3.089 ± 0.603
6.178LeuArg: 6.178 ± 1.315
3.475LeuSer: 3.475 ± 1.28
5.405LeuThr: 5.405 ± 2.063
8.88LeuVal: 8.88 ± 2.289
1.931LeuTrp: 1.931 ± 0.918
1.931LeuTyr: 1.931 ± 0.82
0.0LeuXaa: 0.0 ± 0.0
Met
3.475MetAla: 3.475 ± 1.003
0.386MetCys: 0.386 ± 0.302
0.386MetAsp: 0.386 ± 0.353
0.772MetGlu: 0.772 ± 0.527
0.0MetPhe: 0.0 ± 0.0
1.158MetGly: 1.158 ± 0.679
0.0MetHis: 0.0 ± 0.0
0.772MetIle: 0.772 ± 0.512
0.386MetLys: 0.386 ± 0.389
1.931MetLeu: 1.931 ± 0.69
1.544MetMet: 1.544 ± 0.871
0.0MetAsn: 0.0 ± 0.0
2.317MetPro: 2.317 ± 0.666
1.931MetGln: 1.931 ± 0.74
1.544MetArg: 1.544 ± 0.768
3.089MetSer: 3.089 ± 0.857
3.089MetThr: 3.089 ± 0.858
1.931MetVal: 1.931 ± 0.961
0.0MetTrp: 0.0 ± 0.0
0.772MetTyr: 0.772 ± 0.486
0.0MetXaa: 0.0 ± 0.0
Asn
3.089AsnAla: 3.089 ± 0.951
0.772AsnCys: 0.772 ± 0.384
1.544AsnAsp: 1.544 ± 0.635
1.544AsnGlu: 1.544 ± 0.642
0.386AsnPhe: 0.386 ± 0.376
5.019AsnGly: 5.019 ± 1.307
0.386AsnHis: 0.386 ± 0.353
0.386AsnIle: 0.386 ± 0.302
1.158AsnLys: 1.158 ± 0.591
0.386AsnLeu: 0.386 ± 0.367
0.0AsnMet: 0.0 ± 0.0
1.544AsnAsn: 1.544 ± 0.784
1.158AsnPro: 1.158 ± 0.434
0.386AsnGln: 0.386 ± 0.327
1.158AsnArg: 1.158 ± 0.703
0.772AsnSer: 0.772 ± 0.604
0.772AsnThr: 0.772 ± 0.518
1.544AsnVal: 1.544 ± 0.734
0.0AsnTrp: 0.0 ± 0.0
0.772AsnTyr: 0.772 ± 0.498
0.0AsnXaa: 0.0 ± 0.0
Pro
4.633ProAla: 4.633 ± 1.329
0.772ProCys: 0.772 ± 0.562
3.089ProAsp: 3.089 ± 1.415
2.703ProGlu: 2.703 ± 1.061
1.158ProPhe: 1.158 ± 0.558
4.247ProGly: 4.247 ± 1.147
0.772ProHis: 0.772 ± 0.574
1.544ProIle: 1.544 ± 0.855
2.703ProLys: 2.703 ± 0.725
2.703ProLeu: 2.703 ± 1.298
1.158ProMet: 1.158 ± 0.548
1.931ProAsn: 1.931 ± 0.765
2.317ProPro: 2.317 ± 1.096
1.931ProGln: 1.931 ± 0.959
5.019ProArg: 5.019 ± 1.701
3.475ProSer: 3.475 ± 0.993
4.633ProThr: 4.633 ± 1.111
3.861ProVal: 3.861 ± 1.499
2.317ProTrp: 2.317 ± 0.597
1.158ProTyr: 1.158 ± 0.587
0.0ProXaa: 0.0 ± 0.0
Gln
2.703GlnAla: 2.703 ± 0.801
0.772GlnCys: 0.772 ± 0.394
0.772GlnAsp: 0.772 ± 0.385
1.931GlnGlu: 1.931 ± 0.647
1.158GlnPhe: 1.158 ± 0.679
5.019GlnGly: 5.019 ± 1.569
1.158GlnHis: 1.158 ± 0.488
1.544GlnIle: 1.544 ± 0.599
0.772GlnLys: 0.772 ± 0.512
3.861GlnLeu: 3.861 ± 1.286
0.0GlnMet: 0.0 ± 0.0
1.158GlnAsn: 1.158 ± 0.587
4.633GlnPro: 4.633 ± 1.224
3.475GlnGln: 3.475 ± 2.152
3.475GlnArg: 3.475 ± 1.303
1.931GlnSer: 1.931 ± 0.704
1.931GlnThr: 1.931 ± 0.743
1.544GlnVal: 1.544 ± 0.725
1.931GlnTrp: 1.931 ± 1.018
0.772GlnTyr: 0.772 ± 0.518
0.0GlnXaa: 0.0 ± 0.0
Arg
7.722ArgAla: 7.722 ± 1.992
1.158ArgCys: 1.158 ± 0.665
5.792ArgAsp: 5.792 ± 1.494
5.019ArgGlu: 5.019 ± 1.706
1.931ArgPhe: 1.931 ± 0.711
7.722ArgGly: 7.722 ± 0.994
0.772ArgHis: 0.772 ± 0.46
4.247ArgIle: 4.247 ± 0.851
3.089ArgLys: 3.089 ± 1.219
8.88ArgLeu: 8.88 ± 1.275
1.158ArgMet: 1.158 ± 0.665
2.703ArgAsn: 2.703 ± 0.99
2.703ArgPro: 2.703 ± 0.983
1.158ArgGln: 1.158 ± 0.981
9.266ArgArg: 9.266 ± 3.071
6.178ArgSer: 6.178 ± 2.519
3.089ArgThr: 3.089 ± 1.046
3.861ArgVal: 3.861 ± 1.247
2.317ArgTrp: 2.317 ± 0.856
1.931ArgTyr: 1.931 ± 0.971
0.0ArgXaa: 0.0 ± 0.0
Ser
12.355SerAla: 12.355 ± 2.986
3.089SerCys: 3.089 ± 1.428
3.861SerAsp: 3.861 ± 1.177
1.544SerGlu: 1.544 ± 0.644
2.317SerPhe: 2.317 ± 1.113
5.792SerGly: 5.792 ± 1.746
1.544SerHis: 1.544 ± 0.607
1.931SerIle: 1.931 ± 0.696
2.317SerLys: 2.317 ± 0.92
3.861SerLeu: 3.861 ± 0.724
1.158SerMet: 1.158 ± 0.772
0.386SerAsn: 0.386 ± 0.353
4.247SerPro: 4.247 ± 1.08
1.931SerGln: 1.931 ± 1.216
3.089SerArg: 3.089 ± 0.748
3.475SerSer: 3.475 ± 0.99
2.317SerThr: 2.317 ± 1.083
3.475SerVal: 3.475 ± 1.394
1.158SerTrp: 1.158 ± 0.935
0.772SerTyr: 0.772 ± 0.575
0.0SerXaa: 0.0 ± 0.0
Thr
8.88ThrAla: 8.88 ± 1.7
3.089ThrCys: 3.089 ± 1.035
1.158ThrAsp: 1.158 ± 0.686
2.317ThrGlu: 2.317 ± 1.047
1.544ThrPhe: 1.544 ± 0.743
5.405ThrGly: 5.405 ± 1.905
1.158ThrHis: 1.158 ± 0.473
2.703ThrIle: 2.703 ± 0.766
0.772ThrLys: 0.772 ± 0.654
3.089ThrLeu: 3.089 ± 0.848
1.931ThrMet: 1.931 ± 0.646
0.772ThrAsn: 0.772 ± 0.489
2.703ThrPro: 2.703 ± 1.767
3.089ThrGln: 3.089 ± 1.133
4.633ThrArg: 4.633 ± 1.185
2.317ThrSer: 2.317 ± 0.917
3.089ThrThr: 3.089 ± 1.919
3.475ThrVal: 3.475 ± 1.112
0.772ThrTrp: 0.772 ± 0.478
1.158ThrTyr: 1.158 ± 0.468
0.0ThrXaa: 0.0 ± 0.0
Val
7.336ValAla: 7.336 ± 1.509
1.158ValCys: 1.158 ± 0.687
3.089ValAsp: 3.089 ± 0.891
1.158ValGlu: 1.158 ± 0.476
3.089ValPhe: 3.089 ± 0.97
7.722ValGly: 7.722 ± 1.758
1.544ValHis: 1.544 ± 0.645
2.317ValIle: 2.317 ± 0.896
0.772ValLys: 0.772 ± 0.566
8.88ValLeu: 8.88 ± 3.868
2.703ValMet: 2.703 ± 0.698
0.386ValAsn: 0.386 ± 0.302
2.703ValPro: 2.703 ± 0.819
1.544ValGln: 1.544 ± 0.882
3.861ValArg: 3.861 ± 1.125
5.405ValSer: 5.405 ± 2.117
1.931ValThr: 1.931 ± 0.821
6.178ValVal: 6.178 ± 1.75
3.089ValTrp: 3.089 ± 1.12
0.772ValTyr: 0.772 ± 0.532
0.0ValXaa: 0.0 ± 0.0
Trp
2.317TrpAla: 2.317 ± 1.397
0.386TrpCys: 0.386 ± 0.302
0.0TrpAsp: 0.0 ± 0.0
0.772TrpGlu: 0.772 ± 0.422
2.317TrpPhe: 2.317 ± 1.161
1.544TrpGly: 1.544 ± 0.725
0.0TrpHis: 0.0 ± 0.0
1.544TrpIle: 1.544 ± 0.943
1.931TrpLys: 1.931 ± 0.961
2.317TrpLeu: 2.317 ± 1.178
1.158TrpMet: 1.158 ± 0.474
1.544TrpAsn: 1.544 ± 0.612
1.544TrpPro: 1.544 ± 0.701
0.772TrpGln: 0.772 ± 0.562
1.544TrpArg: 1.544 ± 0.641
1.544TrpSer: 1.544 ± 0.658
1.544TrpThr: 1.544 ± 0.745
1.544TrpVal: 1.544 ± 0.741
1.544TrpTrp: 1.544 ± 0.7
1.544TrpTyr: 1.544 ± 0.447
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.703TyrAla: 2.703 ± 0.938
0.0TyrCys: 0.0 ± 0.0
0.772TyrAsp: 0.772 ± 0.498
0.772TyrGlu: 0.772 ± 0.512
1.931TyrPhe: 1.931 ± 0.914
2.317TyrGly: 2.317 ± 0.763
0.386TyrHis: 0.386 ± 0.287
0.386TyrIle: 0.386 ± 0.389
1.158TyrLys: 1.158 ± 0.543
1.544TyrLeu: 1.544 ± 0.599
0.0TyrMet: 0.0 ± 0.0
1.544TyrAsn: 1.544 ± 0.752
1.158TyrPro: 1.158 ± 0.528
0.772TyrGln: 0.772 ± 0.523
3.089TyrArg: 3.089 ± 0.901
1.931TyrSer: 1.931 ± 0.952
1.544TyrThr: 1.544 ± 0.651
3.861TyrVal: 3.861 ± 1.197
0.772TyrTrp: 0.772 ± 0.544
0.772TyrTyr: 0.772 ± 0.415
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2591 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski