Amino acid dipepetide frequency for Ralstonia phage RSM3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.072AlaAla: 18.072 ± 3.169
1.606AlaCys: 1.606 ± 0.851
5.622AlaAsp: 5.622 ± 1.623
4.819AlaGlu: 4.819 ± 2.35
4.819AlaPhe: 4.819 ± 1.64
9.237AlaGly: 9.237 ± 2.287
1.205AlaHis: 1.205 ± 0.74
7.229AlaIle: 7.229 ± 1.856
5.221AlaLys: 5.221 ± 1.906
11.647AlaLeu: 11.647 ± 1.998
5.622AlaMet: 5.622 ± 1.265
3.614AlaAsn: 3.614 ± 1.472
5.221AlaPro: 5.221 ± 2.748
8.032AlaGln: 8.032 ± 1.26
8.835AlaArg: 8.835 ± 2.427
6.426AlaSer: 6.426 ± 1.49
8.835AlaThr: 8.835 ± 2.243
11.245AlaVal: 11.245 ± 1.679
4.418AlaTrp: 4.418 ± 1.21
2.41AlaTyr: 2.41 ± 0.686
0.0AlaXaa: 0.0 ± 0.0
Cys
1.606CysAla: 1.606 ± 0.747
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.803CysGlu: 0.803 ± 0.624
0.402CysPhe: 0.402 ± 0.385
0.402CysGly: 0.402 ± 0.385
0.803CysHis: 0.803 ± 0.466
1.205CysIle: 1.205 ± 0.534
0.402CysLys: 0.402 ± 0.385
1.205CysLeu: 1.205 ± 0.797
0.803CysMet: 0.803 ± 0.587
0.803CysAsn: 0.803 ± 0.588
1.205CysPro: 1.205 ± 0.707
1.205CysGln: 1.205 ± 0.642
0.0CysArg: 0.0 ± 0.0
1.205CysSer: 1.205 ± 0.736
0.0CysThr: 0.0 ± 0.0
0.803CysVal: 0.803 ± 0.515
0.402CysTrp: 0.402 ± 0.419
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.229AspAla: 7.229 ± 1.866
0.0AspCys: 0.0 ± 0.0
2.41AspAsp: 2.41 ± 0.832
3.213AspGlu: 3.213 ± 1.2
1.205AspPhe: 1.205 ± 0.724
3.213AspGly: 3.213 ± 0.793
0.803AspHis: 0.803 ± 0.672
2.008AspIle: 2.008 ± 0.839
0.803AspLys: 0.803 ± 0.565
4.418AspLeu: 4.418 ± 1.639
0.402AspMet: 0.402 ± 0.327
1.205AspAsn: 1.205 ± 0.42
1.205AspPro: 1.205 ± 0.736
2.008AspGln: 2.008 ± 0.707
4.016AspArg: 4.016 ± 1.349
2.811AspSer: 2.811 ± 0.99
2.41AspThr: 2.41 ± 1.181
2.41AspVal: 2.41 ± 0.953
1.205AspTrp: 1.205 ± 0.786
2.008AspTyr: 2.008 ± 0.953
0.0AspXaa: 0.0 ± 0.0
Glu
7.631GluAla: 7.631 ± 2.834
0.803GluCys: 0.803 ± 0.466
2.41GluAsp: 2.41 ± 1.075
2.008GluGlu: 2.008 ± 0.837
3.213GluPhe: 3.213 ± 1.229
2.008GluGly: 2.008 ± 0.951
0.402GluHis: 0.402 ± 0.336
1.205GluIle: 1.205 ± 0.627
1.606GluLys: 1.606 ± 0.944
4.418GluLeu: 4.418 ± 1.832
0.0GluMet: 0.0 ± 0.0
1.606GluAsn: 1.606 ± 0.83
1.205GluPro: 1.205 ± 0.551
2.811GluGln: 2.811 ± 1.317
3.213GluArg: 3.213 ± 1.409
1.205GluSer: 1.205 ± 0.601
2.41GluThr: 2.41 ± 0.66
1.606GluVal: 1.606 ± 0.682
1.205GluTrp: 1.205 ± 0.476
0.803GluTyr: 0.803 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
2.811PheAla: 2.811 ± 1.17
0.0PheCys: 0.0 ± 0.0
1.606PheAsp: 1.606 ± 0.67
1.205PheGlu: 1.205 ± 0.54
4.418PhePhe: 4.418 ± 1.731
3.614PheGly: 3.614 ± 1.322
0.803PheHis: 0.803 ± 0.509
1.205PheIle: 1.205 ± 0.741
0.803PheLys: 0.803 ± 0.504
2.008PheLeu: 2.008 ± 0.85
1.205PheMet: 1.205 ± 0.619
2.008PheAsn: 2.008 ± 1.039
0.402PhePro: 0.402 ± 0.327
0.803PheGln: 0.803 ± 0.615
1.606PheArg: 1.606 ± 0.751
2.41PheSer: 2.41 ± 0.999
2.008PheThr: 2.008 ± 1.389
2.41PheVal: 2.41 ± 0.652
0.803PheTrp: 0.803 ± 0.463
0.803PheTyr: 0.803 ± 0.654
0.0PheXaa: 0.0 ± 0.0
Gly
9.639GlyAla: 9.639 ± 2.16
0.803GlyCys: 0.803 ± 0.654
3.213GlyAsp: 3.213 ± 0.835
2.008GlyGlu: 2.008 ± 1.124
3.614GlyPhe: 3.614 ± 1.693
9.639GlyGly: 9.639 ± 2.195
2.41GlyHis: 2.41 ± 0.683
2.41GlyIle: 2.41 ± 1.017
2.41GlyLys: 2.41 ± 1.213
5.622GlyLeu: 5.622 ± 2.149
2.008GlyMet: 2.008 ± 0.742
2.41GlyAsn: 2.41 ± 1.628
1.606GlyPro: 1.606 ± 0.776
2.008GlyGln: 2.008 ± 0.716
4.016GlyArg: 4.016 ± 1.308
5.221GlySer: 5.221 ± 1.771
4.418GlyThr: 4.418 ± 1.214
11.245GlyVal: 11.245 ± 2.256
1.205GlyTrp: 1.205 ± 0.535
3.614GlyTyr: 3.614 ± 1.26
0.0GlyXaa: 0.0 ± 0.0
His
1.205HisAla: 1.205 ± 0.713
0.402HisCys: 0.402 ± 0.419
0.803HisAsp: 0.803 ± 0.535
2.008HisGlu: 2.008 ± 1.121
1.205HisPhe: 1.205 ± 0.585
1.205HisGly: 1.205 ± 0.618
0.0HisHis: 0.0 ± 0.0
1.205HisIle: 1.205 ± 0.838
0.402HisLys: 0.402 ± 0.336
0.402HisLeu: 0.402 ± 0.44
0.803HisMet: 0.803 ± 0.77
0.803HisAsn: 0.803 ± 0.552
0.803HisPro: 0.803 ± 0.508
0.402HisGln: 0.402 ± 0.336
1.606HisArg: 1.606 ± 0.724
0.402HisSer: 0.402 ± 0.385
0.402HisThr: 0.402 ± 0.419
2.41HisVal: 2.41 ± 0.87
0.402HisTrp: 0.402 ± 0.419
0.803HisTyr: 0.803 ± 0.509
0.0HisXaa: 0.0 ± 0.0
Ile
6.426IleAla: 6.426 ± 1.451
0.0IleCys: 0.0 ± 0.0
1.205IleAsp: 1.205 ± 0.627
3.213IleGlu: 3.213 ± 1.441
1.205IlePhe: 1.205 ± 0.936
4.819IleGly: 4.819 ± 1.749
0.0IleHis: 0.0 ± 0.0
0.803IleIle: 0.803 ± 0.622
2.811IleLys: 2.811 ± 0.92
2.41IleLeu: 2.41 ± 0.927
0.803IleMet: 0.803 ± 0.624
2.811IleAsn: 2.811 ± 0.696
1.205IlePro: 1.205 ± 0.746
2.41IleGln: 2.41 ± 1.004
3.213IleArg: 3.213 ± 1.156
1.606IleSer: 1.606 ± 0.635
2.008IleThr: 2.008 ± 0.66
4.016IleVal: 4.016 ± 0.86
0.0IleTrp: 0.0 ± 0.0
0.402IleTyr: 0.402 ± 0.419
0.0IleXaa: 0.0 ± 0.0
Lys
4.016LysAla: 4.016 ± 1.161
0.402LysCys: 0.402 ± 0.419
2.41LysAsp: 2.41 ± 1.106
0.803LysGlu: 0.803 ± 0.485
0.402LysPhe: 0.402 ± 0.336
3.213LysGly: 3.213 ± 1.143
0.0LysHis: 0.0 ± 0.0
2.008LysIle: 2.008 ± 0.979
1.606LysLys: 1.606 ± 1.315
4.418LysLeu: 4.418 ± 1.5
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
3.213LysPro: 3.213 ± 1.984
2.811LysGln: 2.811 ± 1.063
4.418LysArg: 4.418 ± 1.706
1.205LysSer: 1.205 ± 0.551
4.016LysThr: 4.016 ± 1.035
4.418LysVal: 4.418 ± 1.117
1.205LysTrp: 1.205 ± 0.75
0.803LysTyr: 0.803 ± 0.503
0.0LysXaa: 0.0 ± 0.0
Leu
11.245LeuAla: 11.245 ± 1.938
1.205LeuCys: 1.205 ± 0.955
7.229LeuAsp: 7.229 ± 1.607
3.213LeuGlu: 3.213 ± 1.536
3.213LeuPhe: 3.213 ± 1.615
4.016LeuGly: 4.016 ± 1.771
2.41LeuHis: 2.41 ± 1.077
2.811LeuIle: 2.811 ± 0.782
4.016LeuLys: 4.016 ± 1.442
8.835LeuLeu: 8.835 ± 3.045
1.606LeuMet: 1.606 ± 0.686
2.008LeuAsn: 2.008 ± 0.851
2.811LeuPro: 2.811 ± 1.001
2.811LeuGln: 2.811 ± 1.318
4.016LeuArg: 4.016 ± 1.804
3.213LeuSer: 3.213 ± 1.34
4.819LeuThr: 4.819 ± 1.506
6.426LeuVal: 6.426 ± 1.772
2.008LeuTrp: 2.008 ± 0.771
1.205LeuTyr: 1.205 ± 0.588
0.0LeuXaa: 0.0 ± 0.0
Met
3.614MetAla: 3.614 ± 1.011
0.803MetCys: 0.803 ± 0.503
0.803MetAsp: 0.803 ± 0.612
0.402MetGlu: 0.402 ± 0.419
0.402MetPhe: 0.402 ± 0.401
1.205MetGly: 1.205 ± 0.712
1.205MetHis: 1.205 ± 0.648
1.606MetIle: 1.606 ± 0.683
1.205MetLys: 1.205 ± 0.819
2.008MetLeu: 2.008 ± 0.669
0.402MetMet: 0.402 ± 0.327
1.205MetAsn: 1.205 ± 0.958
0.803MetPro: 0.803 ± 0.363
1.205MetGln: 1.205 ± 0.665
0.803MetArg: 0.803 ± 0.663
3.614MetSer: 3.614 ± 1.113
2.41MetThr: 2.41 ± 1.409
1.205MetVal: 1.205 ± 0.896
0.402MetTrp: 0.402 ± 0.408
0.402MetTyr: 0.402 ± 0.327
0.0MetXaa: 0.0 ± 0.0
Asn
4.418AsnAla: 4.418 ± 1.058
0.402AsnCys: 0.402 ± 0.385
0.803AsnAsp: 0.803 ± 0.485
1.606AsnGlu: 1.606 ± 0.958
1.205AsnPhe: 1.205 ± 0.476
2.41AsnGly: 2.41 ± 0.947
0.0AsnHis: 0.0 ± 0.0
0.803AsnIle: 0.803 ± 0.543
1.606AsnLys: 1.606 ± 0.679
2.41AsnLeu: 2.41 ± 0.992
0.0AsnMet: 0.0 ± 0.0
0.402AsnAsn: 0.402 ± 0.336
3.614AsnPro: 3.614 ± 1.381
1.606AsnGln: 1.606 ± 0.756
1.606AsnArg: 1.606 ± 1.042
2.008AsnSer: 2.008 ± 0.724
3.614AsnThr: 3.614 ± 1.06
3.213AsnVal: 3.213 ± 0.977
0.402AsnTrp: 0.402 ± 0.327
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
7.229ProAla: 7.229 ± 1.443
0.803ProCys: 0.803 ± 0.432
2.41ProAsp: 2.41 ± 0.679
1.606ProGlu: 1.606 ± 1.17
0.0ProPhe: 0.0 ± 0.0
2.811ProGly: 2.811 ± 0.671
0.803ProHis: 0.803 ± 0.508
2.008ProIle: 2.008 ± 0.779
1.205ProLys: 1.205 ± 0.642
1.205ProLeu: 1.205 ± 0.657
0.0ProMet: 0.0 ± 0.0
1.205ProAsn: 1.205 ± 0.709
2.811ProPro: 2.811 ± 0.818
2.811ProGln: 2.811 ± 1.069
1.205ProArg: 1.205 ± 0.464
4.418ProSer: 4.418 ± 1.515
3.213ProThr: 3.213 ± 2.394
6.024ProVal: 6.024 ± 1.474
0.803ProTrp: 0.803 ± 0.502
0.803ProTyr: 0.803 ± 0.504
0.0ProXaa: 0.0 ± 0.0
Gln
5.221GlnAla: 5.221 ± 1.461
1.205GlnCys: 1.205 ± 0.718
1.606GlnAsp: 1.606 ± 0.927
0.803GlnGlu: 0.803 ± 0.815
0.803GlnPhe: 0.803 ± 0.815
2.811GlnGly: 2.811 ± 0.986
0.0GlnHis: 0.0 ± 0.0
2.811GlnIle: 2.811 ± 1.142
3.614GlnLys: 3.614 ± 0.814
3.614GlnLeu: 3.614 ± 1.361
2.008GlnMet: 2.008 ± 0.902
1.205GlnAsn: 1.205 ± 0.631
3.213GlnPro: 3.213 ± 1.277
2.811GlnGln: 2.811 ± 1.273
4.418GlnArg: 4.418 ± 1.609
2.811GlnSer: 2.811 ± 1.048
2.811GlnThr: 2.811 ± 1.101
2.41GlnVal: 2.41 ± 1.045
2.41GlnTrp: 2.41 ± 1.034
0.803GlnTyr: 0.803 ± 0.432
0.0GlnXaa: 0.0 ± 0.0
Arg
6.426ArgAla: 6.426 ± 1.853
1.205ArgCys: 1.205 ± 0.849
5.221ArgAsp: 5.221 ± 1.941
2.41ArgGlu: 2.41 ± 1.201
0.803ArgPhe: 0.803 ± 0.587
4.819ArgGly: 4.819 ± 1.783
1.606ArgHis: 1.606 ± 0.825
2.41ArgIle: 2.41 ± 1.188
1.606ArgLys: 1.606 ± 0.501
4.819ArgLeu: 4.819 ± 1.333
2.811ArgMet: 2.811 ± 1.453
0.803ArgAsn: 0.803 ± 0.547
2.008ArgPro: 2.008 ± 1.174
3.614ArgGln: 3.614 ± 1.033
4.418ArgArg: 4.418 ± 2.038
3.614ArgSer: 3.614 ± 1.618
2.41ArgThr: 2.41 ± 0.981
6.426ArgVal: 6.426 ± 1.409
1.606ArgTrp: 1.606 ± 1.042
0.402ArgTyr: 0.402 ± 0.336
0.0ArgXaa: 0.0 ± 0.0
Ser
8.835SerAla: 8.835 ± 1.173
1.205SerCys: 1.205 ± 0.647
2.41SerAsp: 2.41 ± 0.776
2.41SerGlu: 2.41 ± 0.973
1.205SerPhe: 1.205 ± 0.69
9.639SerGly: 9.639 ± 3.169
0.803SerHis: 0.803 ± 0.672
2.008SerIle: 2.008 ± 0.573
2.41SerLys: 2.41 ± 1.062
4.819SerLeu: 4.819 ± 2.258
2.811SerMet: 2.811 ± 0.911
2.008SerAsn: 2.008 ± 0.914
1.205SerPro: 1.205 ± 0.672
2.811SerGln: 2.811 ± 0.893
3.213SerArg: 3.213 ± 1.671
5.221SerSer: 5.221 ± 1.828
5.221SerThr: 5.221 ± 2.356
4.016SerVal: 4.016 ± 1.214
0.402SerTrp: 0.402 ± 0.327
1.205SerTyr: 1.205 ± 0.789
0.0SerXaa: 0.0 ± 0.0
Thr
7.631ThrAla: 7.631 ± 1.788
0.0ThrCys: 0.0 ± 0.0
2.008ThrAsp: 2.008 ± 1.037
4.016ThrGlu: 4.016 ± 1.151
1.606ThrPhe: 1.606 ± 0.681
4.819ThrGly: 4.819 ± 2.161
1.205ThrHis: 1.205 ± 0.838
2.811ThrIle: 2.811 ± 1.162
2.811ThrLys: 2.811 ± 0.792
4.418ThrLeu: 4.418 ± 1.209
2.008ThrMet: 2.008 ± 0.669
2.811ThrAsn: 2.811 ± 1.5
4.016ThrPro: 4.016 ± 1.429
2.41ThrGln: 2.41 ± 0.899
2.008ThrArg: 2.008 ± 0.481
3.213ThrSer: 3.213 ± 1.45
10.442ThrThr: 10.442 ± 6.022
5.221ThrVal: 5.221 ± 1.505
0.803ThrTrp: 0.803 ± 0.502
1.205ThrTyr: 1.205 ± 0.763
0.0ThrXaa: 0.0 ± 0.0
Val
16.064ValAla: 16.064 ± 2.314
1.205ValCys: 1.205 ± 0.692
0.803ValAsp: 0.803 ± 0.363
3.213ValGlu: 3.213 ± 1.499
2.41ValPhe: 2.41 ± 1.347
7.229ValGly: 7.229 ± 1.92
2.008ValHis: 2.008 ± 0.985
3.213ValIle: 3.213 ± 1.004
3.614ValLys: 3.614 ± 1.014
6.024ValLeu: 6.024 ± 1.336
1.205ValMet: 1.205 ± 0.596
2.41ValAsn: 2.41 ± 0.922
4.819ValPro: 4.819 ± 1.117
2.811ValGln: 2.811 ± 0.812
4.418ValArg: 4.418 ± 1.314
10.442ValSer: 10.442 ± 1.149
2.41ValThr: 2.41 ± 0.908
12.851ValVal: 12.851 ± 2.945
2.008ValTrp: 2.008 ± 1.053
2.41ValTyr: 2.41 ± 0.76
0.0ValXaa: 0.0 ± 0.0
Trp
2.41TrpAla: 2.41 ± 0.733
0.0TrpCys: 0.0 ± 0.0
0.803TrpAsp: 0.803 ± 0.547
1.205TrpGlu: 1.205 ± 0.627
0.0TrpPhe: 0.0 ± 0.0
0.402TrpGly: 0.402 ± 0.336
0.803TrpHis: 0.803 ± 0.485
0.0TrpIle: 0.0 ± 0.0
1.205TrpLys: 1.205 ± 0.674
3.213TrpLeu: 3.213 ± 1.11
0.803TrpMet: 0.803 ± 0.558
1.606TrpAsn: 1.606 ± 0.67
1.606TrpPro: 1.606 ± 0.687
1.205TrpGln: 1.205 ± 0.522
1.606TrpArg: 1.606 ± 0.763
0.803TrpSer: 0.803 ± 0.463
1.205TrpThr: 1.205 ± 0.583
1.205TrpVal: 1.205 ± 0.473
0.0TrpTrp: 0.0 ± 0.0
2.41TrpTyr: 2.41 ± 0.851
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.41TyrAla: 2.41 ± 0.839
1.205TyrCys: 1.205 ± 0.851
1.205TyrAsp: 1.205 ± 0.746
0.803TyrGlu: 0.803 ± 0.502
0.803TyrPhe: 0.803 ± 0.485
2.008TyrGly: 2.008 ± 0.573
0.402TyrHis: 0.402 ± 0.419
1.606TyrIle: 1.606 ± 1.094
2.008TyrLys: 2.008 ± 0.623
1.205TyrLeu: 1.205 ± 0.504
0.0TyrMet: 0.0 ± 0.0
1.205TyrAsn: 1.205 ± 0.642
0.402TyrPro: 0.402 ± 0.44
0.803TyrGln: 0.803 ± 0.363
1.205TyrArg: 1.205 ± 0.565
2.008TyrSer: 2.008 ± 0.792
0.402TyrThr: 0.402 ± 0.419
2.008TyrVal: 2.008 ± 0.719
0.803TyrTrp: 0.803 ± 0.624
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (2491 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski