Amino acid dipepetide frequency for Rhodococcus phage RRH1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
26.071AlaAla: 26.071 ± 5.535
0.698AlaCys: 0.698 ± 0.474
11.872AlaAsp: 11.872 ± 1.487
4.888AlaGlu: 4.888 ± 1.055
3.957AlaPhe: 3.957 ± 0.931
9.544AlaGly: 9.544 ± 2.256
1.629AlaHis: 1.629 ± 0.677
6.983AlaIle: 6.983 ± 2.125
2.793AlaLys: 2.793 ± 1.033
10.009AlaLeu: 10.009 ± 1.753
4.19AlaMet: 4.19 ± 0.845
4.423AlaAsn: 4.423 ± 0.943
11.173AlaPro: 11.173 ± 2.043
6.052AlaGln: 6.052 ± 1.634
7.216AlaArg: 7.216 ± 1.676
3.259AlaSer: 3.259 ± 0.71
10.708AlaThr: 10.708 ± 1.419
14.432AlaVal: 14.432 ± 2.146
2.793AlaTrp: 2.793 ± 0.713
4.19AlaTyr: 4.19 ± 0.924
0.0AlaXaa: 0.0 ± 0.0
Cys
0.233CysAla: 0.233 ± 0.208
0.0CysCys: 0.0 ± 0.0
0.233CysAsp: 0.233 ± 0.255
0.233CysGlu: 0.233 ± 0.266
0.0CysPhe: 0.0 ± 0.0
0.698CysGly: 0.698 ± 0.436
0.466CysHis: 0.466 ± 0.302
0.0CysIle: 0.0 ± 0.0
0.233CysLys: 0.233 ± 0.202
0.233CysLeu: 0.233 ± 0.218
0.233CysMet: 0.233 ± 0.224
0.466CysAsn: 0.466 ± 0.33
0.0CysPro: 0.0 ± 0.0
0.233CysGln: 0.233 ± 0.202
1.164CysArg: 1.164 ± 0.681
0.466CysSer: 0.466 ± 0.311
0.233CysThr: 0.233 ± 0.218
0.466CysVal: 0.466 ± 0.388
0.233CysTrp: 0.233 ± 0.223
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
10.475AspAla: 10.475 ± 1.503
0.233AspCys: 0.233 ± 0.224
3.492AspAsp: 3.492 ± 0.997
1.164AspGlu: 1.164 ± 0.653
0.931AspPhe: 0.931 ± 0.401
7.682AspGly: 7.682 ± 0.868
1.397AspHis: 1.397 ± 0.576
3.259AspIle: 3.259 ± 0.67
0.931AspLys: 0.931 ± 0.462
4.19AspLeu: 4.19 ± 0.826
1.397AspMet: 1.397 ± 0.69
2.095AspAsn: 2.095 ± 0.782
9.777AspPro: 9.777 ± 2.035
3.724AspGln: 3.724 ± 1.12
5.819AspArg: 5.819 ± 1.13
2.095AspSer: 2.095 ± 0.69
4.19AspThr: 4.19 ± 1.335
6.518AspVal: 6.518 ± 1.171
1.397AspTrp: 1.397 ± 0.706
1.629AspTyr: 1.629 ± 0.536
0.0AspXaa: 0.0 ± 0.0
Glu
5.121GluAla: 5.121 ± 1.149
1.164GluCys: 1.164 ± 0.704
2.095GluAsp: 2.095 ± 0.419
0.698GluGlu: 0.698 ± 0.409
0.931GluPhe: 0.931 ± 0.482
3.724GluGly: 3.724 ± 0.932
0.931GluHis: 0.931 ± 0.418
0.698GluIle: 0.698 ± 0.3
0.698GluLys: 0.698 ± 0.349
3.259GluLeu: 3.259 ± 0.84
0.233GluMet: 0.233 ± 0.258
0.698GluAsn: 0.698 ± 0.376
2.095GluPro: 2.095 ± 0.703
2.793GluGln: 2.793 ± 0.702
3.026GluArg: 3.026 ± 0.833
2.793GluSer: 2.793 ± 0.9
1.629GluThr: 1.629 ± 0.667
2.328GluVal: 2.328 ± 0.854
0.466GluTrp: 0.466 ± 0.317
0.931GluTyr: 0.931 ± 0.506
0.0GluXaa: 0.0 ± 0.0
Phe
3.492PheAla: 3.492 ± 0.844
0.0PheCys: 0.0 ± 0.0
3.492PheAsp: 3.492 ± 0.879
0.466PheGlu: 0.466 ± 0.298
0.466PhePhe: 0.466 ± 0.31
2.561PheGly: 2.561 ± 0.67
0.698PheHis: 0.698 ± 0.314
0.698PheIle: 0.698 ± 0.453
0.466PheLys: 0.466 ± 0.31
2.095PheLeu: 2.095 ± 0.624
0.466PheMet: 0.466 ± 0.309
0.233PheAsn: 0.233 ± 0.229
2.561PhePro: 2.561 ± 0.559
1.164PheGln: 1.164 ± 0.459
1.629PheArg: 1.629 ± 0.466
0.931PheSer: 0.931 ± 0.392
3.026PheThr: 3.026 ± 0.753
1.397PheVal: 1.397 ± 0.498
0.466PheTrp: 0.466 ± 0.39
1.164PheTyr: 1.164 ± 0.569
0.0PheXaa: 0.0 ± 0.0
Gly
10.94GlyAla: 10.94 ± 1.822
0.233GlyCys: 0.233 ± 0.255
5.587GlyAsp: 5.587 ± 1.2
2.328GlyGlu: 2.328 ± 0.688
3.259GlyPhe: 3.259 ± 0.807
6.75GlyGly: 6.75 ± 1.065
2.095GlyHis: 2.095 ± 0.685
5.354GlyIle: 5.354 ± 1.245
0.931GlyLys: 0.931 ± 0.495
5.587GlyLeu: 5.587 ± 1.028
1.397GlyMet: 1.397 ± 0.553
2.561GlyAsn: 2.561 ± 0.758
3.259GlyPro: 3.259 ± 1.475
0.466GlyGln: 0.466 ± 0.31
6.285GlyArg: 6.285 ± 1.158
9.544GlySer: 9.544 ± 1.254
6.983GlyThr: 6.983 ± 1.315
4.423GlyVal: 4.423 ± 1.238
2.561GlyTrp: 2.561 ± 0.95
0.698GlyTyr: 0.698 ± 0.442
0.0GlyXaa: 0.0 ± 0.0
His
2.328HisAla: 2.328 ± 0.753
0.466HisCys: 0.466 ± 0.404
1.164HisAsp: 1.164 ± 0.504
0.931HisGlu: 0.931 ± 0.336
0.0HisPhe: 0.0 ± 0.0
1.629HisGly: 1.629 ± 0.579
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.466HisLys: 0.466 ± 0.335
2.095HisLeu: 2.095 ± 0.907
0.0HisMet: 0.0 ± 0.0
0.466HisAsn: 0.466 ± 0.276
1.164HisPro: 1.164 ± 0.542
0.466HisGln: 0.466 ± 0.293
2.328HisArg: 2.328 ± 0.721
1.397HisSer: 1.397 ± 0.668
1.397HisThr: 1.397 ± 0.53
1.164HisVal: 1.164 ± 0.518
0.698HisTrp: 0.698 ± 0.392
0.233HisTyr: 0.233 ± 0.258
0.0HisXaa: 0.0 ± 0.0
Ile
5.121IleAla: 5.121 ± 1.528
0.233IleCys: 0.233 ± 0.266
4.423IleAsp: 4.423 ± 1.104
3.724IleGlu: 3.724 ± 1.229
0.698IlePhe: 0.698 ± 0.437
5.121IleGly: 5.121 ± 1.265
1.164IleHis: 1.164 ± 0.513
0.233IleIle: 0.233 ± 0.202
0.466IleLys: 0.466 ± 0.458
3.492IleLeu: 3.492 ± 0.889
0.233IleMet: 0.233 ± 0.202
1.164IleAsn: 1.164 ± 0.466
3.957IlePro: 3.957 ± 0.989
1.397IleGln: 1.397 ± 0.461
5.819IleArg: 5.819 ± 0.928
2.095IleSer: 2.095 ± 1.068
1.862IleThr: 1.862 ± 0.586
4.888IleVal: 4.888 ± 1.253
0.466IleTrp: 0.466 ± 0.27
0.931IleTyr: 0.931 ± 0.381
0.0IleXaa: 0.0 ± 0.0
Lys
2.561LysAla: 2.561 ± 0.506
0.0LysCys: 0.0 ± 0.0
0.466LysAsp: 0.466 ± 0.305
0.233LysGlu: 0.233 ± 0.229
0.233LysPhe: 0.233 ± 0.193
1.164LysGly: 1.164 ± 0.66
0.0LysHis: 0.0 ± 0.0
0.931LysIle: 0.931 ± 0.426
1.397LysLys: 1.397 ± 0.849
2.095LysLeu: 2.095 ± 0.657
0.0LysMet: 0.0 ± 0.0
0.698LysAsn: 0.698 ± 0.36
0.931LysPro: 0.931 ± 0.677
0.931LysGln: 0.931 ± 0.439
2.095LysArg: 2.095 ± 0.538
0.698LysSer: 0.698 ± 0.372
1.164LysThr: 1.164 ± 0.533
0.931LysVal: 0.931 ± 0.39
0.931LysTrp: 0.931 ± 0.409
0.698LysTyr: 0.698 ± 0.327
0.0LysXaa: 0.0 ± 0.0
Leu
11.872LeuAla: 11.872 ± 1.119
0.233LeuCys: 0.233 ± 0.238
4.655LeuAsp: 4.655 ± 1.163
3.957LeuGlu: 3.957 ± 0.802
2.561LeuPhe: 2.561 ± 0.684
6.052LeuGly: 6.052 ± 0.875
1.164LeuHis: 1.164 ± 0.512
5.587LeuIle: 5.587 ± 1.366
1.629LeuLys: 1.629 ± 0.511
4.655LeuLeu: 4.655 ± 0.742
0.466LeuMet: 0.466 ± 0.359
2.095LeuAsn: 2.095 ± 0.872
3.259LeuPro: 3.259 ± 0.699
1.862LeuGln: 1.862 ± 0.616
4.423LeuArg: 4.423 ± 1.005
3.026LeuSer: 3.026 ± 0.775
3.492LeuThr: 3.492 ± 1.306
4.888LeuVal: 4.888 ± 1.023
0.931LeuTrp: 0.931 ± 0.443
0.466LeuTyr: 0.466 ± 0.456
0.0LeuXaa: 0.0 ± 0.0
Met
2.095MetAla: 2.095 ± 0.932
0.233MetCys: 0.233 ± 0.258
0.698MetAsp: 0.698 ± 0.479
0.466MetGlu: 0.466 ± 0.388
0.698MetPhe: 0.698 ± 0.296
1.164MetGly: 1.164 ± 0.547
0.466MetHis: 0.466 ± 0.345
0.931MetIle: 0.931 ± 0.565
0.466MetLys: 0.466 ± 0.35
1.397MetLeu: 1.397 ± 0.523
0.233MetMet: 0.233 ± 0.229
0.931MetAsn: 0.931 ± 0.619
1.164MetPro: 1.164 ± 0.454
0.233MetGln: 0.233 ± 0.202
0.466MetArg: 0.466 ± 0.331
2.793MetSer: 2.793 ± 0.784
1.629MetThr: 1.629 ± 0.57
0.466MetVal: 0.466 ± 0.244
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.655AsnAla: 4.655 ± 1.024
0.0AsnCys: 0.0 ± 0.0
1.397AsnAsp: 1.397 ± 0.605
0.0AsnGlu: 0.0 ± 0.0
0.466AsnPhe: 0.466 ± 0.244
3.492AsnGly: 3.492 ± 0.962
1.397AsnHis: 1.397 ± 0.686
1.629AsnIle: 1.629 ± 0.586
1.164AsnLys: 1.164 ± 0.404
1.397AsnLeu: 1.397 ± 0.56
0.233AsnMet: 0.233 ± 0.229
0.0AsnAsn: 0.0 ± 0.0
2.793AsnPro: 2.793 ± 0.507
0.931AsnGln: 0.931 ± 0.291
1.862AsnArg: 1.862 ± 0.6
1.862AsnSer: 1.862 ± 0.751
3.026AsnThr: 3.026 ± 0.859
1.862AsnVal: 1.862 ± 0.6
0.466AsnTrp: 0.466 ± 0.435
1.629AsnTyr: 1.629 ± 0.488
0.0AsnXaa: 0.0 ± 0.0
Pro
14.199ProAla: 14.199 ± 2.582
0.233ProCys: 0.233 ± 0.29
5.354ProAsp: 5.354 ± 1.009
1.397ProGlu: 1.397 ± 0.684
1.397ProPhe: 1.397 ± 0.536
4.888ProGly: 4.888 ± 1.012
0.931ProHis: 0.931 ± 0.402
4.19ProIle: 4.19 ± 1.329
0.466ProLys: 0.466 ± 0.318
2.793ProLeu: 2.793 ± 0.697
0.698ProMet: 0.698 ± 0.511
2.328ProAsn: 2.328 ± 0.928
2.328ProPro: 2.328 ± 0.696
1.164ProGln: 1.164 ± 0.602
4.19ProArg: 4.19 ± 1.171
4.888ProSer: 4.888 ± 1.08
7.216ProThr: 7.216 ± 1.301
7.449ProVal: 7.449 ± 1.532
2.095ProTrp: 2.095 ± 0.749
0.931ProTyr: 0.931 ± 0.521
0.0ProXaa: 0.0 ± 0.0
Gln
4.655GlnAla: 4.655 ± 1.436
0.0GlnCys: 0.0 ± 0.0
1.397GlnAsp: 1.397 ± 0.551
0.466GlnGlu: 0.466 ± 0.458
0.233GlnPhe: 0.233 ± 0.218
0.698GlnGly: 0.698 ± 0.373
0.466GlnHis: 0.466 ± 0.333
0.466GlnIle: 0.466 ± 0.276
0.233GlnLys: 0.233 ± 0.218
4.655GlnLeu: 4.655 ± 0.979
0.233GlnMet: 0.233 ± 0.251
1.164GlnAsn: 1.164 ± 0.447
2.328GlnPro: 2.328 ± 0.824
1.164GlnGln: 1.164 ± 0.558
2.793GlnArg: 2.793 ± 0.984
2.561GlnSer: 2.561 ± 0.549
2.095GlnThr: 2.095 ± 0.792
1.397GlnVal: 1.397 ± 0.441
0.698GlnTrp: 0.698 ± 0.282
0.698GlnTyr: 0.698 ± 0.343
0.0GlnXaa: 0.0 ± 0.0
Arg
6.983ArgAla: 6.983 ± 1.258
0.233ArgCys: 0.233 ± 0.224
5.354ArgAsp: 5.354 ± 1.227
3.026ArgGlu: 3.026 ± 1.222
2.561ArgPhe: 2.561 ± 0.694
5.121ArgGly: 5.121 ± 0.887
2.095ArgHis: 2.095 ± 0.746
6.285ArgIle: 6.285 ± 1.333
1.164ArgLys: 1.164 ± 0.471
4.423ArgLeu: 4.423 ± 1.127
2.793ArgMet: 2.793 ± 0.639
3.957ArgAsn: 3.957 ± 0.898
4.19ArgPro: 4.19 ± 1.104
0.931ArgGln: 0.931 ± 0.516
7.216ArgArg: 7.216 ± 2.382
2.793ArgSer: 2.793 ± 0.759
6.518ArgThr: 6.518 ± 0.913
6.518ArgVal: 6.518 ± 1.669
0.931ArgTrp: 0.931 ± 0.508
1.629ArgTyr: 1.629 ± 0.697
0.0ArgXaa: 0.0 ± 0.0
Ser
7.914SerAla: 7.914 ± 2.007
0.698SerCys: 0.698 ± 0.431
3.492SerAsp: 3.492 ± 0.897
3.492SerGlu: 3.492 ± 0.784
1.862SerPhe: 1.862 ± 0.666
4.888SerGly: 4.888 ± 0.792
1.629SerHis: 1.629 ± 0.584
2.095SerIle: 2.095 ± 0.553
0.233SerLys: 0.233 ± 0.224
4.19SerLeu: 4.19 ± 0.915
0.698SerMet: 0.698 ± 0.352
2.095SerAsn: 2.095 ± 0.766
5.587SerPro: 5.587 ± 0.934
0.931SerGln: 0.931 ± 0.458
4.19SerArg: 4.19 ± 1.192
4.888SerSer: 4.888 ± 0.663
3.492SerThr: 3.492 ± 0.835
5.354SerVal: 5.354 ± 1.071
2.328SerTrp: 2.328 ± 0.673
0.698SerTyr: 0.698 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
9.777ThrAla: 9.777 ± 1.369
0.466ThrCys: 0.466 ± 0.358
5.354ThrAsp: 5.354 ± 1.12
2.793ThrGlu: 2.793 ± 0.844
2.095ThrPhe: 2.095 ± 0.688
6.052ThrGly: 6.052 ± 0.933
0.466ThrHis: 0.466 ± 0.318
1.862ThrIle: 1.862 ± 0.474
1.629ThrLys: 1.629 ± 0.405
3.724ThrLeu: 3.724 ± 0.863
0.698ThrMet: 0.698 ± 0.306
1.862ThrAsn: 1.862 ± 0.609
5.587ThrPro: 5.587 ± 1.477
0.466ThrGln: 0.466 ± 0.258
5.819ThrArg: 5.819 ± 1.164
6.052ThrSer: 6.052 ± 0.848
6.285ThrThr: 6.285 ± 1.586
9.777ThrVal: 9.777 ± 1.554
1.397ThrTrp: 1.397 ± 0.491
1.629ThrTyr: 1.629 ± 0.729
0.0ThrXaa: 0.0 ± 0.0
Val
13.734ValAla: 13.734 ± 2.76
0.698ValCys: 0.698 ± 0.422
10.242ValAsp: 10.242 ± 1.919
4.19ValGlu: 4.19 ± 0.678
3.957ValPhe: 3.957 ± 0.927
5.354ValGly: 5.354 ± 1.519
0.466ValHis: 0.466 ± 0.355
4.19ValIle: 4.19 ± 0.875
2.095ValLys: 2.095 ± 0.659
3.957ValLeu: 3.957 ± 0.961
1.629ValMet: 1.629 ± 0.561
1.862ValAsn: 1.862 ± 0.412
4.655ValPro: 4.655 ± 1.047
1.862ValGln: 1.862 ± 0.513
5.121ValArg: 5.121 ± 1.098
4.19ValSer: 4.19 ± 1.12
6.285ValThr: 6.285 ± 1.47
7.449ValVal: 7.449 ± 1.01
1.164ValTrp: 1.164 ± 0.494
1.629ValTyr: 1.629 ± 0.764
0.0ValXaa: 0.0 ± 0.0
Trp
1.862TrpAla: 1.862 ± 0.736
0.0TrpCys: 0.0 ± 0.0
0.466TrpAsp: 0.466 ± 0.42
0.698TrpGlu: 0.698 ± 0.42
0.698TrpPhe: 0.698 ± 0.299
1.629TrpGly: 1.629 ± 0.816
0.466TrpHis: 0.466 ± 0.275
1.397TrpIle: 1.397 ± 0.45
0.698TrpLys: 0.698 ± 0.381
1.164TrpLeu: 1.164 ± 0.293
0.466TrpMet: 0.466 ± 0.304
0.698TrpAsn: 0.698 ± 0.353
0.698TrpPro: 0.698 ± 0.468
0.698TrpGln: 0.698 ± 0.436
2.561TrpArg: 2.561 ± 0.647
3.259TrpSer: 3.259 ± 0.827
0.931TrpThr: 0.931 ± 0.443
1.862TrpVal: 1.862 ± 0.635
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.793TyrAla: 2.793 ± 0.859
0.0TyrCys: 0.0 ± 0.0
1.164TyrAsp: 1.164 ± 0.371
1.164TyrGlu: 1.164 ± 0.407
0.466TyrPhe: 0.466 ± 0.288
3.026TyrGly: 3.026 ± 0.817
0.466TyrHis: 0.466 ± 0.293
0.698TyrIle: 0.698 ± 0.49
0.0TyrLys: 0.0 ± 0.0
1.862TyrLeu: 1.862 ± 0.73
0.0TyrMet: 0.0 ± 0.0
0.466TyrAsn: 0.466 ± 0.266
1.629TyrPro: 1.629 ± 0.596
0.931TyrGln: 0.931 ± 0.37
0.698TyrArg: 0.698 ± 0.447
1.164TyrSer: 1.164 ± 0.49
1.629TyrThr: 1.629 ± 0.63
1.164TyrVal: 1.164 ± 0.591
0.466TyrTrp: 0.466 ± 0.338
0.233TyrTyr: 0.233 ± 0.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (4297 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski