Amino acid dipepetide frequency for Paenibacillus sp. BC26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.109AlaAla: 10.109 ± 0.105
0.76AlaCys: 0.76 ± 0.019
4.634AlaAsp: 4.634 ± 0.059
5.69AlaGlu: 5.69 ± 0.078
3.577AlaPhe: 3.577 ± 0.048
7.044AlaGly: 7.044 ± 0.071
1.541AlaHis: 1.541 ± 0.032
5.776AlaIle: 5.776 ± 0.05
4.361AlaLys: 4.361 ± 0.051
8.366AlaLeu: 8.366 ± 0.068
2.376AlaMet: 2.376 ± 0.036
3.076AlaAsn: 3.076 ± 0.045
2.881AlaPro: 2.881 ± 0.041
2.78AlaGln: 2.78 ± 0.038
3.513AlaArg: 3.513 ± 0.047
5.507AlaSer: 5.507 ± 0.064
3.99AlaThr: 3.99 ± 0.059
6.755AlaVal: 6.755 ± 0.064
1.087AlaTrp: 1.087 ± 0.022
2.852AlaTyr: 2.852 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.582CysAla: 0.582 ± 0.018
0.121CysCys: 0.121 ± 0.009
0.415CysAsp: 0.415 ± 0.016
0.45CysGlu: 0.45 ± 0.014
0.338CysPhe: 0.338 ± 0.013
0.772CysGly: 0.772 ± 0.017
0.191CysHis: 0.191 ± 0.011
0.494CysIle: 0.494 ± 0.016
0.315CysLys: 0.315 ± 0.013
0.717CysLeu: 0.717 ± 0.019
0.222CysMet: 0.222 ± 0.009
0.259CysAsn: 0.259 ± 0.012
0.324CysPro: 0.324 ± 0.014
0.219CysGln: 0.219 ± 0.01
0.413CysArg: 0.413 ± 0.015
0.574CysSer: 0.574 ± 0.018
0.41CysThr: 0.41 ± 0.013
0.449CysVal: 0.449 ± 0.014
0.108CysTrp: 0.108 ± 0.007
0.294CysTyr: 0.294 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
4.506AspAla: 4.506 ± 0.057
0.38AspCys: 0.38 ± 0.014
2.547AspAsp: 2.547 ± 0.038
3.769AspGlu: 3.769 ± 0.05
2.134AspPhe: 2.134 ± 0.031
4.344AspGly: 4.344 ± 0.054
1.092AspHis: 1.092 ± 0.022
3.379AspIle: 3.379 ± 0.042
2.642AspLys: 2.642 ± 0.042
4.645AspLeu: 4.645 ± 0.049
1.391AspMet: 1.391 ± 0.026
2.004AspAsn: 2.004 ± 0.033
2.412AspPro: 2.412 ± 0.033
1.894AspGln: 1.894 ± 0.033
2.788AspArg: 2.788 ± 0.044
2.891AspSer: 2.891 ± 0.041
2.603AspThr: 2.603 ± 0.04
3.637AspVal: 3.637 ± 0.047
0.958AspTrp: 0.958 ± 0.023
2.195AspTyr: 2.195 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
6.182GluAla: 6.182 ± 0.074
0.361GluCys: 0.361 ± 0.013
2.927GluAsp: 2.927 ± 0.045
4.699GluGlu: 4.699 ± 0.067
2.062GluPhe: 2.062 ± 0.038
4.321GluGly: 4.321 ± 0.048
1.501GluHis: 1.501 ± 0.026
3.878GluIle: 3.878 ± 0.052
3.343GluLys: 3.343 ± 0.049
7.133GluLeu: 7.133 ± 0.084
1.883GluMet: 1.883 ± 0.032
2.209GluAsn: 2.209 ± 0.035
2.316GluPro: 2.316 ± 0.03
3.588GluGln: 3.588 ± 0.055
3.86GluArg: 3.86 ± 0.055
3.533GluSer: 3.533 ± 0.045
3.26GluThr: 3.26 ± 0.044
4.254GluVal: 4.254 ± 0.05
1.01GluTrp: 1.01 ± 0.02
1.789GluTyr: 1.789 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.376PheAla: 3.376 ± 0.042
0.344PheCys: 0.344 ± 0.012
2.395PheAsp: 2.395 ± 0.033
2.427PheGlu: 2.427 ± 0.035
1.807PhePhe: 1.807 ± 0.036
3.34PheGly: 3.34 ± 0.049
0.958PheHis: 0.958 ± 0.022
2.943PheIle: 2.943 ± 0.045
2.002PheLys: 2.002 ± 0.032
3.722PheLeu: 3.722 ± 0.054
1.193PheMet: 1.193 ± 0.024
1.715PheAsn: 1.715 ± 0.033
1.614PhePro: 1.614 ± 0.032
1.403PheGln: 1.403 ± 0.024
2.052PheArg: 2.052 ± 0.035
2.783PheSer: 2.783 ± 0.037
2.607PheThr: 2.607 ± 0.045
3.001PheVal: 3.001 ± 0.04
0.565PheTrp: 0.565 ± 0.016
1.556PheTyr: 1.556 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
5.973GlyAla: 5.973 ± 0.077
0.723GlyCys: 0.723 ± 0.019
3.757GlyAsp: 3.757 ± 0.049
4.463GlyGlu: 4.463 ± 0.049
3.316GlyPhe: 3.316 ± 0.047
5.812GlyGly: 5.812 ± 0.077
1.53GlyHis: 1.53 ± 0.031
5.675GlyIle: 5.675 ± 0.064
4.354GlyLys: 4.354 ± 0.051
7.098GlyLeu: 7.098 ± 0.063
2.386GlyMet: 2.386 ± 0.036
2.944GlyAsn: 2.944 ± 0.046
1.984GlyPro: 1.984 ± 0.032
2.642GlyGln: 2.642 ± 0.04
3.398GlyArg: 3.398 ± 0.044
5.08GlySer: 5.08 ± 0.067
4.653GlyThr: 4.653 ± 0.071
5.178GlyVal: 5.178 ± 0.055
1.162GlyTrp: 1.162 ± 0.026
3.019GlyTyr: 3.019 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
1.786HisAla: 1.786 ± 0.031
0.214HisCys: 0.214 ± 0.011
1.07HisAsp: 1.07 ± 0.021
1.322HisGlu: 1.322 ± 0.025
1.061HisPhe: 1.061 ± 0.023
1.573HisGly: 1.573 ± 0.031
0.599HisHis: 0.599 ± 0.018
1.366HisIle: 1.366 ± 0.027
0.797HisLys: 0.797 ± 0.021
2.036HisLeu: 2.036 ± 0.037
0.574HisMet: 0.574 ± 0.017
0.74HisAsn: 0.74 ± 0.018
1.186HisPro: 1.186 ± 0.025
0.762HisGln: 0.762 ± 0.018
1.101HisArg: 1.101 ± 0.023
1.172HisSer: 1.172 ± 0.024
1.112HisThr: 1.112 ± 0.022
1.526HisVal: 1.526 ± 0.028
0.342HisTrp: 0.342 ± 0.013
0.939HisTyr: 0.939 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.19IleAla: 6.19 ± 0.063
0.599IleCys: 0.599 ± 0.018
3.898IleAsp: 3.898 ± 0.045
4.221IleGlu: 4.221 ± 0.051
2.325IlePhe: 2.325 ± 0.041
5.448IleGly: 5.448 ± 0.063
1.492IleHis: 1.492 ± 0.029
4.213IleIle: 4.213 ± 0.055
2.783IleLys: 2.783 ± 0.042
5.342IleLeu: 5.342 ± 0.069
1.729IleMet: 1.729 ± 0.03
2.448IleAsn: 2.448 ± 0.036
3.128IlePro: 3.128 ± 0.05
2.312IleGln: 2.312 ± 0.035
3.672IleArg: 3.672 ± 0.05
4.47IleSer: 4.47 ± 0.045
4.005IleThr: 4.005 ± 0.048
5.229IleVal: 5.229 ± 0.053
0.764IleTrp: 0.764 ± 0.019
2.099IleTyr: 2.099 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
4.203LysAla: 4.203 ± 0.055
0.227LysCys: 0.227 ± 0.01
2.775LysAsp: 2.775 ± 0.041
3.603LysGlu: 3.603 ± 0.044
1.567LysPhe: 1.567 ± 0.03
3.375LysGly: 3.375 ± 0.04
1.12LysHis: 1.12 ± 0.024
2.648LysIle: 2.648 ± 0.037
2.813LysLys: 2.813 ± 0.042
5.697LysLeu: 5.697 ± 0.064
1.472LysMet: 1.472 ± 0.028
1.799LysAsn: 1.799 ± 0.032
2.404LysPro: 2.404 ± 0.037
2.604LysGln: 2.604 ± 0.037
2.833LysArg: 2.833 ± 0.04
2.961LysSer: 2.961 ± 0.039
2.616LysThr: 2.616 ± 0.035
3.388LysVal: 3.388 ± 0.042
0.752LysTrp: 0.752 ± 0.018
1.599LysTyr: 1.599 ± 0.031
0.0LysXaa: 0.0 ± 0.0
Leu
8.464LeuAla: 8.464 ± 0.079
0.828LeuCys: 0.828 ± 0.02
4.872LeuAsp: 4.872 ± 0.049
5.718LeuGlu: 5.718 ± 0.071
4.552LeuPhe: 4.552 ± 0.062
6.577LeuGly: 6.577 ± 0.062
2.203LeuHis: 2.203 ± 0.032
6.362LeuIle: 6.362 ± 0.066
4.959LeuLys: 4.959 ± 0.05
10.976LeuLeu: 10.976 ± 0.109
2.657LeuMet: 2.657 ± 0.039
3.748LeuAsn: 3.748 ± 0.045
4.568LeuPro: 4.568 ± 0.058
4.076LeuGln: 4.076 ± 0.054
5.045LeuArg: 5.045 ± 0.062
6.893LeuSer: 6.893 ± 0.063
5.792LeuThr: 5.792 ± 0.064
6.311LeuVal: 6.311 ± 0.066
1.099LeuTrp: 1.099 ± 0.025
3.227LeuTyr: 3.227 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.191MetAla: 2.191 ± 0.035
0.171MetCys: 0.171 ± 0.01
1.476MetAsp: 1.476 ± 0.031
1.758MetGlu: 1.758 ± 0.031
1.068MetPhe: 1.068 ± 0.023
1.74MetGly: 1.74 ± 0.032
0.523MetHis: 0.523 ± 0.016
1.883MetIle: 1.883 ± 0.033
1.988MetLys: 1.988 ± 0.03
3.049MetLeu: 3.049 ± 0.038
0.873MetMet: 0.873 ± 0.025
1.487MetAsn: 1.487 ± 0.025
1.242MetPro: 1.242 ± 0.027
1.064MetGln: 1.064 ± 0.023
1.441MetArg: 1.441 ± 0.028
1.806MetSer: 1.806 ± 0.031
1.727MetThr: 1.727 ± 0.029
1.669MetVal: 1.669 ± 0.026
0.269MetTrp: 0.269 ± 0.012
0.8MetTyr: 0.8 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.187AsnAla: 3.187 ± 0.037
0.265AsnCys: 0.265 ± 0.011
2.041AsnAsp: 2.041 ± 0.036
2.657AsnGlu: 2.657 ± 0.036
1.371AsnPhe: 1.371 ± 0.025
3.545AsnGly: 3.545 ± 0.049
0.858AsnHis: 0.858 ± 0.02
2.273AsnIle: 2.273 ± 0.038
1.985AsnLys: 1.985 ± 0.036
3.262AsnLeu: 3.262 ± 0.04
1.021AsnMet: 1.021 ± 0.022
1.839AsnAsn: 1.839 ± 0.037
1.999AsnPro: 1.999 ± 0.032
1.492AsnGln: 1.492 ± 0.027
2.187AsnArg: 2.187 ± 0.032
2.183AsnSer: 2.183 ± 0.041
2.155AsnThr: 2.155 ± 0.04
2.786AsnVal: 2.786 ± 0.04
0.623AsnTrp: 0.623 ± 0.018
1.489AsnTyr: 1.489 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
3.585ProAla: 3.585 ± 0.057
0.244ProCys: 0.244 ± 0.011
2.559ProAsp: 2.559 ± 0.036
3.101ProGlu: 3.101 ± 0.041
1.973ProPhe: 1.973 ± 0.035
2.968ProGly: 2.968 ± 0.049
0.928ProHis: 0.928 ± 0.023
2.87ProIle: 2.87 ± 0.035
1.715ProLys: 1.715 ± 0.03
3.971ProLeu: 3.971 ± 0.044
1.034ProMet: 1.034 ± 0.026
1.66ProAsn: 1.66 ± 0.033
1.279ProPro: 1.279 ± 0.031
1.424ProGln: 1.424 ± 0.028
1.476ProArg: 1.476 ± 0.028
2.645ProSer: 2.645 ± 0.034
2.106ProThr: 2.106 ± 0.035
3.164ProVal: 3.164 ± 0.045
0.55ProTrp: 0.55 ± 0.018
1.538ProTyr: 1.538 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.465GlnAla: 3.465 ± 0.043
0.204GlnCys: 0.204 ± 0.01
1.689GlnAsp: 1.689 ± 0.031
2.439GlnGlu: 2.439 ± 0.04
1.654GlnPhe: 1.654 ± 0.028
2.571GlnGly: 2.571 ± 0.043
0.845GlnHis: 0.845 ± 0.022
2.403GlnIle: 2.403 ± 0.036
1.64GlnLys: 1.64 ± 0.033
4.479GlnLeu: 4.479 ± 0.061
1.075GlnMet: 1.075 ± 0.023
1.239GlnAsn: 1.239 ± 0.024
1.596GlnPro: 1.596 ± 0.029
1.907GlnGln: 1.907 ± 0.037
1.885GlnArg: 1.885 ± 0.029
2.4GlnSer: 2.4 ± 0.035
2.126GlnThr: 2.126 ± 0.032
2.544GlnVal: 2.544 ± 0.033
0.56GlnTrp: 0.56 ± 0.015
1.283GlnTyr: 1.283 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
3.515ArgAla: 3.515 ± 0.042
0.355ArgCys: 0.355 ± 0.014
2.488ArgAsp: 2.488 ± 0.043
3.595ArgGlu: 3.595 ± 0.055
2.291ArgPhe: 2.291 ± 0.039
3.143ArgGly: 3.143 ± 0.04
1.046ArgHis: 1.046 ± 0.025
3.812ArgIle: 3.812 ± 0.047
2.834ArgLys: 2.834 ± 0.041
5.067ArgLeu: 5.067 ± 0.056
1.733ArgMet: 1.733 ± 0.029
2.112ArgAsn: 2.112 ± 0.034
1.643ArgPro: 1.643 ± 0.031
1.838ArgGln: 1.838 ± 0.031
2.66ArgArg: 2.66 ± 0.048
3.172ArgSer: 3.172 ± 0.038
2.688ArgThr: 2.688 ± 0.035
3.175ArgVal: 3.175 ± 0.047
0.676ArgTrp: 0.676 ± 0.018
1.976ArgTyr: 1.976 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
5.283SerAla: 5.283 ± 0.059
0.481SerCys: 0.481 ± 0.017
3.336SerAsp: 3.336 ± 0.047
3.81SerGlu: 3.81 ± 0.05
3.031SerPhe: 3.031 ± 0.046
5.583SerGly: 5.583 ± 0.067
1.274SerHis: 1.274 ± 0.028
4.381SerIle: 4.381 ± 0.045
3.172SerLys: 3.172 ± 0.042
6.204SerLeu: 6.204 ± 0.055
1.773SerMet: 1.773 ± 0.032
2.451SerAsn: 2.451 ± 0.044
2.523SerPro: 2.523 ± 0.042
2.023SerGln: 2.023 ± 0.032
2.989SerArg: 2.989 ± 0.043
4.389SerSer: 4.389 ± 0.064
3.323SerThr: 3.323 ± 0.044
4.579SerVal: 4.579 ± 0.058
0.943SerTrp: 0.943 ± 0.023
2.374SerTyr: 2.374 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
5.056ThrAla: 5.056 ± 0.062
0.358ThrCys: 0.358 ± 0.013
2.992ThrAsp: 2.992 ± 0.045
3.142ThrGlu: 3.142 ± 0.041
2.432ThrPhe: 2.432 ± 0.035
4.665ThrGly: 4.665 ± 0.07
1.02ThrHis: 1.02 ± 0.021
4.138ThrIle: 4.138 ± 0.05
2.503ThrLys: 2.503 ± 0.039
5.358ThrLeu: 5.358 ± 0.058
1.452ThrMet: 1.452 ± 0.026
2.268ThrAsn: 2.268 ± 0.037
2.563ThrPro: 2.563 ± 0.04
1.549ThrGln: 1.549 ± 0.029
2.226ThrArg: 2.226 ± 0.036
3.479ThrSer: 3.479 ± 0.056
3.188ThrThr: 3.188 ± 0.057
4.52ThrVal: 4.52 ± 0.064
0.679ThrTrp: 0.679 ± 0.02
1.97ThrTyr: 1.97 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
5.319ValAla: 5.319 ± 0.051
0.62ValCys: 0.62 ± 0.016
3.66ValAsp: 3.66 ± 0.046
4.221ValGlu: 4.221 ± 0.051
2.959ValPhe: 2.959 ± 0.04
4.603ValGly: 4.603 ± 0.048
1.492ValHis: 1.492 ± 0.026
4.891ValIle: 4.891 ± 0.056
3.801ValLys: 3.801 ± 0.05
6.982ValLeu: 6.982 ± 0.055
1.927ValMet: 1.927 ± 0.032
2.96ValAsn: 2.96 ± 0.047
3.115ValPro: 3.115 ± 0.041
2.634ValGln: 2.634 ± 0.035
3.515ValArg: 3.515 ± 0.049
4.843ValSer: 4.843 ± 0.046
4.544ValThr: 4.544 ± 0.081
5.127ValVal: 5.127 ± 0.057
0.914ValTrp: 0.914 ± 0.023
2.482ValTyr: 2.482 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.903TrpAla: 0.903 ± 0.023
0.114TrpCys: 0.114 ± 0.008
0.754TrpAsp: 0.754 ± 0.019
0.756TrpGlu: 0.756 ± 0.019
0.618TrpPhe: 0.618 ± 0.019
0.953TrpGly: 0.953 ± 0.023
0.322TrpHis: 0.322 ± 0.012
0.926TrpIle: 0.926 ± 0.02
0.784TrpLys: 0.784 ± 0.018
1.469TrpLeu: 1.469 ± 0.029
0.458TrpMet: 0.458 ± 0.014
0.747TrpAsn: 0.747 ± 0.021
0.46TrpPro: 0.46 ± 0.017
0.594TrpGln: 0.594 ± 0.018
0.696TrpArg: 0.696 ± 0.017
0.921TrpSer: 0.921 ± 0.021
0.785TrpThr: 0.785 ± 0.021
0.894TrpVal: 0.894 ± 0.021
0.227TrpTrp: 0.227 ± 0.011
0.465TrpTyr: 0.465 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.801TyrAla: 2.801 ± 0.043
0.298TyrCys: 0.298 ± 0.012
1.925TyrAsp: 1.925 ± 0.031
2.266TyrGlu: 2.266 ± 0.035
1.667TyrPhe: 1.667 ± 0.029
2.771TyrGly: 2.771 ± 0.035
0.743TyrHis: 0.743 ± 0.021
2.055TyrIle: 2.055 ± 0.036
1.669TyrLys: 1.669 ± 0.033
3.353TyrLeu: 3.353 ± 0.045
0.991TyrMet: 0.991 ± 0.023
1.514TyrAsn: 1.514 ± 0.03
1.556TyrPro: 1.556 ± 0.028
1.197TyrGln: 1.197 ± 0.022
2.068TyrArg: 2.068 ± 0.036
2.223TyrSer: 2.223 ± 0.038
1.868TyrThr: 1.868 ± 0.036
2.422TyrVal: 2.422 ± 0.039
0.558TyrTrp: 0.558 ± 0.017
1.433TyrTyr: 1.433 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6316 proteins (2117263 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski