Amino acid dipepetide frequency for Ralstonia phage RS611

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.031AlaAla: 22.031 ± 4.207
4.299AlaCys: 4.299 ± 0.994
6.985AlaAsp: 6.985 ± 1.752
6.448AlaGlu: 6.448 ± 2.343
2.687AlaPhe: 2.687 ± 1.275
13.971AlaGly: 13.971 ± 3.188
0.537AlaHis: 0.537 ± 0.579
4.299AlaIle: 4.299 ± 1.909
3.761AlaLys: 3.761 ± 1.034
10.747AlaLeu: 10.747 ± 2.047
5.373AlaMet: 5.373 ± 1.43
2.687AlaAsn: 2.687 ± 0.773
5.911AlaPro: 5.911 ± 1.914
5.911AlaGln: 5.911 ± 2.15
5.373AlaArg: 5.373 ± 2.304
5.911AlaSer: 5.911 ± 2.965
12.359AlaThr: 12.359 ± 4.855
11.284AlaVal: 11.284 ± 2.219
1.612AlaTrp: 1.612 ± 1.003
4.299AlaTyr: 4.299 ± 0.906
0.0AlaXaa: 0.0 ± 0.0
Cys
1.075CysAla: 1.075 ± 0.48
0.537CysCys: 0.537 ± 0.753
1.612CysAsp: 1.612 ± 0.67
1.075CysGlu: 1.075 ± 1.036
0.537CysPhe: 0.537 ± 0.634
0.537CysGly: 0.537 ± 0.475
0.537CysHis: 0.537 ± 0.433
0.537CysIle: 0.537 ± 0.433
0.537CysLys: 0.537 ± 0.508
1.612CysLeu: 1.612 ± 0.914
0.537CysMet: 0.537 ± 0.486
0.0CysAsn: 0.0 ± 0.0
1.075CysPro: 1.075 ± 0.674
0.537CysGln: 0.537 ± 0.475
1.075CysArg: 1.075 ± 0.519
1.075CysSer: 1.075 ± 0.866
3.224CysThr: 3.224 ± 0.815
2.149CysVal: 2.149 ± 0.947
0.537CysTrp: 0.537 ± 0.433
0.537CysTyr: 0.537 ± 0.391
0.0CysXaa: 0.0 ± 0.0
Asp
5.373AspAla: 5.373 ± 1.196
0.0AspCys: 0.0 ± 0.0
1.612AspAsp: 1.612 ± 0.743
1.075AspGlu: 1.075 ± 0.519
1.612AspPhe: 1.612 ± 1.2
6.448AspGly: 6.448 ± 2.632
1.075AspHis: 1.075 ± 0.828
0.537AspIle: 0.537 ± 0.475
0.537AspLys: 0.537 ± 0.634
5.373AspLeu: 5.373 ± 2.34
2.149AspMet: 2.149 ± 0.882
1.075AspAsn: 1.075 ± 0.674
4.836AspPro: 4.836 ± 1.319
2.149AspGln: 2.149 ± 0.78
2.149AspArg: 2.149 ± 1.079
2.687AspSer: 2.687 ± 0.694
2.687AspThr: 2.687 ± 1.134
3.224AspVal: 3.224 ± 1.289
2.687AspTrp: 2.687 ± 1.215
0.537AspTyr: 0.537 ± 0.475
0.0AspXaa: 0.0 ± 0.0
Glu
2.687GluAla: 2.687 ± 1.006
2.149GluCys: 2.149 ± 0.763
1.612GluAsp: 1.612 ± 0.697
2.149GluGlu: 2.149 ± 0.91
1.612GluPhe: 1.612 ± 1.014
2.149GluGly: 2.149 ± 0.853
0.537GluHis: 0.537 ± 0.475
2.687GluIle: 2.687 ± 1.215
2.687GluLys: 2.687 ± 0.642
3.761GluLeu: 3.761 ± 1.625
1.612GluMet: 1.612 ± 1.07
1.075GluAsn: 1.075 ± 0.572
2.149GluPro: 2.149 ± 1.106
2.687GluGln: 2.687 ± 0.812
4.299GluArg: 4.299 ± 2.202
2.687GluSer: 2.687 ± 1.223
3.761GluThr: 3.761 ± 1.011
3.224GluVal: 3.224 ± 0.987
1.612GluTrp: 1.612 ± 0.787
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.612PheAla: 1.612 ± 0.67
1.075PheCys: 1.075 ± 0.717
0.537PheAsp: 0.537 ± 0.391
1.075PheGlu: 1.075 ± 0.519
0.0PhePhe: 0.0 ± 0.0
1.612PheGly: 1.612 ± 0.916
0.537PheHis: 0.537 ± 0.475
2.149PheIle: 2.149 ± 0.798
2.149PheLys: 2.149 ± 1.342
2.687PheLeu: 2.687 ± 1.228
0.537PheMet: 0.537 ± 0.475
1.612PheAsn: 1.612 ± 0.44
1.612PhePro: 1.612 ± 0.942
0.0PheGln: 0.0 ± 0.0
2.149PheArg: 2.149 ± 0.729
1.612PheSer: 1.612 ± 0.994
1.612PheThr: 1.612 ± 0.821
2.687PheVal: 2.687 ± 1.308
0.537PheTrp: 0.537 ± 0.746
0.537PheTyr: 0.537 ± 0.433
0.0PheXaa: 0.0 ± 0.0
Gly
12.896GlyAla: 12.896 ± 2.619
1.612GlyCys: 1.612 ± 0.885
2.687GlyAsp: 2.687 ± 1.084
4.299GlyGlu: 4.299 ± 1.764
2.149GlyPhe: 2.149 ± 0.994
6.985GlyGly: 6.985 ± 1.548
0.537GlyHis: 0.537 ± 0.433
3.761GlyIle: 3.761 ± 1.382
5.373GlyLys: 5.373 ± 1.776
6.448GlyLeu: 6.448 ± 2.302
3.761GlyMet: 3.761 ± 1.406
3.224GlyAsn: 3.224 ± 1.575
3.224GlyPro: 3.224 ± 0.913
3.761GlyGln: 3.761 ± 1.018
6.985GlyArg: 6.985 ± 2.252
3.761GlySer: 3.761 ± 1.321
6.448GlyThr: 6.448 ± 1.328
4.836GlyVal: 4.836 ± 0.938
2.687GlyTrp: 2.687 ± 1.69
1.075GlyTyr: 1.075 ± 0.866
0.0GlyXaa: 0.0 ± 0.0
His
0.537HisAla: 0.537 ± 0.579
0.0HisCys: 0.0 ± 0.0
0.537HisAsp: 0.537 ± 0.475
1.612HisGlu: 1.612 ± 0.692
0.0HisPhe: 0.0 ± 0.0
1.612HisGly: 1.612 ± 0.914
0.0HisHis: 0.0 ± 0.0
1.612HisIle: 1.612 ± 0.787
0.537HisLys: 0.537 ± 0.475
1.612HisLeu: 1.612 ± 0.598
0.0HisMet: 0.0 ± 0.0
0.537HisAsn: 0.537 ± 0.634
0.537HisPro: 0.537 ± 0.753
0.0HisGln: 0.0 ± 0.0
1.075HisArg: 1.075 ± 1.506
1.075HisSer: 1.075 ± 1.016
0.537HisThr: 0.537 ± 0.391
2.149HisVal: 2.149 ± 1.039
0.0HisTrp: 0.0 ± 0.0
1.075HisTyr: 1.075 ± 0.637
0.0HisXaa: 0.0 ± 0.0
Ile
7.523IleAla: 7.523 ± 1.235
0.0IleCys: 0.0 ± 0.0
1.075IleAsp: 1.075 ± 1.286
2.149IleGlu: 2.149 ± 0.91
0.537IlePhe: 0.537 ± 0.746
3.761IleGly: 3.761 ± 1.263
1.075IleHis: 1.075 ± 0.695
3.224IleIle: 3.224 ± 1.4
2.687IleLys: 2.687 ± 0.977
2.687IleLeu: 2.687 ± 1.067
0.537IleMet: 0.537 ± 0.508
1.075IleAsn: 1.075 ± 0.63
4.299IlePro: 4.299 ± 1.405
1.075IleGln: 1.075 ± 0.51
3.224IleArg: 3.224 ± 1.256
2.149IleSer: 2.149 ± 0.668
3.761IleThr: 3.761 ± 2.244
3.761IleVal: 3.761 ± 0.994
0.0IleTrp: 0.0 ± 0.0
0.537IleTyr: 0.537 ± 0.634
0.0IleXaa: 0.0 ± 0.0
Lys
7.523LysAla: 7.523 ± 1.635
1.075LysCys: 1.075 ± 0.695
1.612LysAsp: 1.612 ± 0.908
0.537LysGlu: 0.537 ± 0.433
1.075LysPhe: 1.075 ± 0.51
4.299LysGly: 4.299 ± 1.524
0.537LysHis: 0.537 ± 0.475
1.612LysIle: 1.612 ± 0.869
2.149LysLys: 2.149 ± 1.111
3.224LysLeu: 3.224 ± 0.994
1.075LysMet: 1.075 ± 0.717
1.612LysAsn: 1.612 ± 1.031
4.299LysPro: 4.299 ± 1.251
1.612LysGln: 1.612 ± 0.969
4.299LysArg: 4.299 ± 0.788
3.761LysSer: 3.761 ± 1.511
3.224LysThr: 3.224 ± 1.078
2.687LysVal: 2.687 ± 1.004
0.537LysTrp: 0.537 ± 0.391
1.612LysTyr: 1.612 ± 0.732
0.0LysXaa: 0.0 ± 0.0
Leu
12.359LeuAla: 12.359 ± 3.306
1.075LeuCys: 1.075 ± 0.637
4.299LeuAsp: 4.299 ± 1.306
3.224LeuGlu: 3.224 ± 1.899
1.612LeuPhe: 1.612 ± 0.989
5.373LeuGly: 5.373 ± 1.688
2.149LeuHis: 2.149 ± 0.982
5.373LeuIle: 5.373 ± 1.525
5.373LeuLys: 5.373 ± 1.101
5.911LeuLeu: 5.911 ± 2.251
2.149LeuMet: 2.149 ± 1.159
0.537LeuAsn: 0.537 ± 0.508
5.373LeuPro: 5.373 ± 1.654
2.149LeuGln: 2.149 ± 1.224
6.448LeuArg: 6.448 ± 1.412
3.224LeuSer: 3.224 ± 1.46
4.836LeuThr: 4.836 ± 1.946
5.911LeuVal: 5.911 ± 1.716
1.075LeuTrp: 1.075 ± 0.954
1.075LeuTyr: 1.075 ± 0.519
0.0LeuXaa: 0.0 ± 0.0
Met
5.373MetAla: 5.373 ± 1.758
0.0MetCys: 0.0 ± 0.0
1.612MetAsp: 1.612 ± 0.869
2.687MetGlu: 2.687 ± 1.368
1.075MetPhe: 1.075 ± 0.68
1.612MetGly: 1.612 ± 0.725
1.075MetHis: 1.075 ± 0.892
1.075MetIle: 1.075 ± 0.931
2.687MetLys: 2.687 ± 0.792
2.687MetLeu: 2.687 ± 0.832
1.612MetMet: 1.612 ± 1.027
0.0MetAsn: 0.0 ± 0.0
2.149MetPro: 2.149 ± 1.262
1.075MetGln: 1.075 ± 0.828
2.687MetArg: 2.687 ± 1.071
1.612MetSer: 1.612 ± 0.732
1.612MetThr: 1.612 ± 0.884
1.612MetVal: 1.612 ± 0.908
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.149AsnAla: 2.149 ± 1.115
0.0AsnCys: 0.0 ± 0.0
1.075AsnAsp: 1.075 ± 0.637
0.537AsnGlu: 0.537 ± 0.508
1.075AsnPhe: 1.075 ± 0.637
4.299AsnGly: 4.299 ± 1.926
0.0AsnHis: 0.0 ± 0.0
1.075AsnIle: 1.075 ± 0.866
1.075AsnLys: 1.075 ± 0.637
0.537AsnLeu: 0.537 ± 0.433
1.075AsnMet: 1.075 ± 0.859
0.0AsnAsn: 0.0 ± 0.0
3.224AsnPro: 3.224 ± 1.151
0.537AsnGln: 0.537 ± 0.475
1.075AsnArg: 1.075 ± 1.158
1.612AsnSer: 1.612 ± 0.743
1.075AsnThr: 1.075 ± 0.48
1.612AsnVal: 1.612 ± 0.974
0.537AsnTrp: 0.537 ± 0.753
0.537AsnTyr: 0.537 ± 0.433
0.0AsnXaa: 0.0 ± 0.0
Pro
10.747ProAla: 10.747 ± 1.917
1.075ProCys: 1.075 ± 0.637
3.224ProAsp: 3.224 ± 1.489
3.761ProGlu: 3.761 ± 2.242
1.612ProPhe: 1.612 ± 1.266
4.299ProGly: 4.299 ± 1.224
1.075ProHis: 1.075 ± 0.889
3.224ProIle: 3.224 ± 1.181
2.149ProLys: 2.149 ± 0.798
4.836ProLeu: 4.836 ± 1.531
0.537ProMet: 0.537 ± 0.441
2.149ProAsn: 2.149 ± 0.947
2.687ProPro: 2.687 ± 1.211
1.612ProGln: 1.612 ± 0.707
1.075ProArg: 1.075 ± 1.158
3.224ProSer: 3.224 ± 1.441
4.836ProThr: 4.836 ± 1.729
6.448ProVal: 6.448 ± 2.061
0.0ProTrp: 0.0 ± 0.0
2.149ProTyr: 2.149 ± 1.11
0.0ProXaa: 0.0 ± 0.0
Gln
3.224GlnAla: 3.224 ± 0.835
1.075GlnCys: 1.075 ± 0.48
2.149GlnAsp: 2.149 ± 1.326
2.149GlnGlu: 2.149 ± 0.886
0.537GlnPhe: 0.537 ± 0.475
3.761GlnGly: 3.761 ± 1.351
1.075GlnHis: 1.075 ± 0.572
0.537GlnIle: 0.537 ± 0.391
2.149GlnLys: 2.149 ± 1.066
6.448GlnLeu: 6.448 ± 1.92
1.075GlnMet: 1.075 ± 0.519
0.537GlnAsn: 0.537 ± 0.391
1.612GlnPro: 1.612 ± 0.989
2.149GlnGln: 2.149 ± 0.964
3.224GlnArg: 3.224 ± 1.217
0.537GlnSer: 0.537 ± 0.753
1.612GlnThr: 1.612 ± 0.828
1.612GlnVal: 1.612 ± 0.67
0.0GlnTrp: 0.0 ± 0.0
0.537GlnTyr: 0.537 ± 0.475
0.0GlnXaa: 0.0 ± 0.0
Arg
5.911ArgAla: 5.911 ± 1.999
2.687ArgCys: 2.687 ± 1.192
2.687ArgAsp: 2.687 ± 0.8
3.761ArgGlu: 3.761 ± 1.551
0.537ArgPhe: 0.537 ± 0.508
4.836ArgGly: 4.836 ± 2.216
1.612ArgHis: 1.612 ± 0.598
2.687ArgIle: 2.687 ± 1.39
4.836ArgLys: 4.836 ± 1.685
2.687ArgLeu: 2.687 ± 1.166
2.149ArgMet: 2.149 ± 0.936
0.0ArgAsn: 0.0 ± 0.0
3.224ArgPro: 3.224 ± 0.929
3.761ArgGln: 3.761 ± 1.87
6.448ArgArg: 6.448 ± 2.782
3.224ArgSer: 3.224 ± 0.958
2.687ArgThr: 2.687 ± 1.56
4.299ArgVal: 4.299 ± 0.938
0.537ArgTrp: 0.537 ± 0.391
2.149ArgTyr: 2.149 ± 0.914
0.0ArgXaa: 0.0 ± 0.0
Ser
8.06SerAla: 8.06 ± 1.437
0.537SerCys: 0.537 ± 0.433
4.836SerAsp: 4.836 ± 1.901
2.149SerGlu: 2.149 ± 0.914
1.612SerPhe: 1.612 ± 1.301
2.687SerGly: 2.687 ± 1.081
1.612SerHis: 1.612 ± 1.009
3.224SerIle: 3.224 ± 1.221
3.761SerLys: 3.761 ± 1.109
5.373SerLeu: 5.373 ± 1.778
1.612SerMet: 1.612 ± 0.98
2.149SerAsn: 2.149 ± 1.329
2.687SerPro: 2.687 ± 1.17
1.075SerGln: 1.075 ± 0.718
1.075SerArg: 1.075 ± 1.506
4.299SerSer: 4.299 ± 1.333
2.687SerThr: 2.687 ± 0.708
2.149SerVal: 2.149 ± 1.026
0.537SerTrp: 0.537 ± 0.433
1.612SerTyr: 1.612 ± 0.818
0.0SerXaa: 0.0 ± 0.0
Thr
11.822ThrAla: 11.822 ± 3.188
1.075ThrCys: 1.075 ± 0.69
4.836ThrAsp: 4.836 ± 1.435
1.612ThrGlu: 1.612 ± 0.44
3.761ThrPhe: 3.761 ± 1.623
9.135ThrGly: 9.135 ± 2.893
0.537ThrHis: 0.537 ± 0.475
2.687ThrIle: 2.687 ± 1.007
1.612ThrLys: 1.612 ± 1.213
3.224ThrLeu: 3.224 ± 1.183
3.224ThrMet: 3.224 ± 0.965
1.612ThrAsn: 1.612 ± 0.779
4.836ThrPro: 4.836 ± 2.087
2.687ThrGln: 2.687 ± 0.908
1.612ThrArg: 1.612 ± 0.44
4.836ThrSer: 4.836 ± 2.186
3.224ThrThr: 3.224 ± 1.085
5.373ThrVal: 5.373 ± 1.553
1.612ThrTrp: 1.612 ± 0.866
2.149ThrTyr: 2.149 ± 0.77
0.0ThrXaa: 0.0 ± 0.0
Val
11.284ValAla: 11.284 ± 2.294
0.537ValCys: 0.537 ± 0.433
4.299ValAsp: 4.299 ± 1.356
1.612ValGlu: 1.612 ± 1.411
2.149ValPhe: 2.149 ± 0.824
6.985ValGly: 6.985 ± 2.557
0.537ValHis: 0.537 ± 0.391
3.761ValIle: 3.761 ± 0.884
3.224ValLys: 3.224 ± 1.017
6.448ValLeu: 6.448 ± 2.125
2.149ValMet: 2.149 ± 1.151
1.612ValAsn: 1.612 ± 1.299
5.373ValPro: 5.373 ± 1.602
2.149ValGln: 2.149 ± 0.772
2.149ValArg: 2.149 ± 0.969
4.299ValSer: 4.299 ± 1.341
8.598ValThr: 8.598 ± 3.381
3.224ValVal: 3.224 ± 1.67
0.0ValTrp: 0.0 ± 0.0
1.612ValTyr: 1.612 ± 0.916
0.0ValXaa: 0.0 ± 0.0
Trp
2.149TrpAla: 2.149 ± 1.147
0.537TrpCys: 0.537 ± 0.433
0.0TrpAsp: 0.0 ± 0.0
1.075TrpGlu: 1.075 ± 0.727
0.537TrpPhe: 0.537 ± 0.475
1.612TrpGly: 1.612 ± 1.003
0.0TrpHis: 0.0 ± 0.0
0.537TrpIle: 0.537 ± 0.391
0.0TrpLys: 0.0 ± 0.0
1.075TrpLeu: 1.075 ± 0.784
0.0TrpMet: 0.0 ± 0.0
1.075TrpAsn: 1.075 ± 0.866
1.075TrpPro: 1.075 ± 0.519
0.537TrpGln: 0.537 ± 0.475
0.0TrpArg: 0.0 ± 0.0
1.075TrpSer: 1.075 ± 0.48
0.537TrpThr: 0.537 ± 0.391
1.612TrpVal: 1.612 ± 0.908
0.537TrpTrp: 0.537 ± 0.433
1.075TrpTyr: 1.075 ± 0.828
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.149TyrAla: 2.149 ± 0.78
0.0TyrCys: 0.0 ± 0.0
0.537TyrAsp: 0.537 ± 0.475
2.687TyrGlu: 2.687 ± 1.215
1.612TyrPhe: 1.612 ± 0.916
0.537TyrGly: 0.537 ± 0.475
0.0TyrHis: 0.0 ± 0.0
0.537TyrIle: 0.537 ± 0.391
1.075TyrLys: 1.075 ± 0.519
2.149TyrLeu: 2.149 ± 1.1
0.537TyrMet: 0.537 ± 0.563
0.537TyrAsn: 0.537 ± 0.433
0.537TyrPro: 0.537 ± 0.433
0.537TyrGln: 0.537 ± 0.634
4.299TyrArg: 4.299 ± 1.132
1.075TyrSer: 1.075 ± 0.718
2.149TyrThr: 2.149 ± 0.847
2.149TyrVal: 2.149 ± 1.281
0.0TyrTrp: 0.0 ± 0.0
2.149TyrTyr: 2.149 ± 0.903
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (1862 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski