Amino acid dipepetide frequency for Aliifodinibius sp. WN023

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.766AlaAla: 4.766 ± 0.089
0.47AlaCys: 0.47 ± 0.023
4.238AlaAsp: 4.238 ± 0.06
4.949AlaGlu: 4.949 ± 0.075
2.982AlaPhe: 2.982 ± 0.053
4.91AlaGly: 4.91 ± 0.086
1.213AlaHis: 1.213 ± 0.04
4.845AlaIle: 4.845 ± 0.083
3.788AlaLys: 3.788 ± 0.067
6.385AlaLeu: 6.385 ± 0.087
1.776AlaMet: 1.776 ± 0.042
2.952AlaAsn: 2.952 ± 0.056
2.14AlaPro: 2.14 ± 0.044
2.815AlaGln: 2.815 ± 0.066
2.862AlaArg: 2.862 ± 0.053
4.152AlaSer: 4.152 ± 0.065
3.621AlaThr: 3.621 ± 0.066
4.509AlaVal: 4.509 ± 0.073
0.728AlaTrp: 0.728 ± 0.029
2.196AlaTyr: 2.196 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.387CysAla: 0.387 ± 0.02
0.095CysCys: 0.095 ± 0.01
0.371CysAsp: 0.371 ± 0.021
0.4CysGlu: 0.4 ± 0.019
0.298CysPhe: 0.298 ± 0.018
0.546CysGly: 0.546 ± 0.025
0.166CysHis: 0.166 ± 0.014
0.402CysIle: 0.402 ± 0.018
0.307CysLys: 0.307 ± 0.021
0.479CysLeu: 0.479 ± 0.023
0.115CysMet: 0.115 ± 0.011
0.29CysAsn: 0.29 ± 0.018
0.281CysPro: 0.281 ± 0.02
0.2CysGln: 0.2 ± 0.013
0.29CysArg: 0.29 ± 0.018
0.484CysSer: 0.484 ± 0.021
0.325CysThr: 0.325 ± 0.018
0.359CysVal: 0.359 ± 0.02
0.06CysTrp: 0.06 ± 0.008
0.213CysTyr: 0.213 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.672AspAla: 3.672 ± 0.067
0.316AspCys: 0.316 ± 0.018
3.895AspAsp: 3.895 ± 0.074
5.413AspGlu: 5.413 ± 0.082
3.031AspPhe: 3.031 ± 0.056
4.397AspGly: 4.397 ± 0.083
1.363AspHis: 1.363 ± 0.04
4.885AspIle: 4.885 ± 0.071
3.572AspLys: 3.572 ± 0.062
5.987AspLeu: 5.987 ± 0.088
1.359AspMet: 1.359 ± 0.038
3.006AspAsn: 3.006 ± 0.061
2.718AspPro: 2.718 ± 0.05
2.772AspGln: 2.772 ± 0.052
3.017AspArg: 3.017 ± 0.054
3.777AspSer: 3.777 ± 0.07
3.171AspThr: 3.171 ± 0.053
3.902AspVal: 3.902 ± 0.063
0.876AspTrp: 0.876 ± 0.028
2.564AspTyr: 2.564 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
5.224GluAla: 5.224 ± 0.083
0.351GluCys: 0.351 ± 0.018
4.76GluAsp: 4.76 ± 0.074
7.306GluGlu: 7.306 ± 0.128
2.923GluPhe: 2.923 ± 0.054
4.757GluGly: 4.757 ± 0.074
1.565GluHis: 1.565 ± 0.041
5.265GluIle: 5.265 ± 0.08
5.361GluLys: 5.361 ± 0.099
7.224GluLeu: 7.224 ± 0.088
2.047GluMet: 2.047 ± 0.046
4.041GluAsn: 4.041 ± 0.075
2.294GluPro: 2.294 ± 0.053
3.64GluGln: 3.64 ± 0.065
3.543GluArg: 3.543 ± 0.064
4.253GluSer: 4.253 ± 0.064
3.74GluThr: 3.74 ± 0.059
5.127GluVal: 5.127 ± 0.071
0.914GluTrp: 0.914 ± 0.029
2.63GluTyr: 2.63 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
2.9PheAla: 2.9 ± 0.052
0.324PheCys: 0.324 ± 0.016
3.188PheAsp: 3.188 ± 0.06
3.472PheGlu: 3.472 ± 0.068
2.12PhePhe: 2.12 ± 0.057
3.415PheGly: 3.415 ± 0.068
0.772PheHis: 0.772 ± 0.028
3.039PheIle: 3.039 ± 0.061
2.331PheLys: 2.331 ± 0.049
4.047PheLeu: 4.047 ± 0.081
0.989PheMet: 0.989 ± 0.031
2.28PheAsn: 2.28 ± 0.056
1.529PhePro: 1.529 ± 0.041
1.412PheGln: 1.412 ± 0.041
1.834PheArg: 1.834 ± 0.048
3.589PheSer: 3.589 ± 0.076
2.582PheThr: 2.582 ± 0.061
2.857PheVal: 2.857 ± 0.061
0.654PheTrp: 0.654 ± 0.026
1.701PheTyr: 1.701 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
4.553GlyAla: 4.553 ± 0.076
0.548GlyCys: 0.548 ± 0.023
4.077GlyAsp: 4.077 ± 0.073
4.536GlyGlu: 4.536 ± 0.082
3.394GlyPhe: 3.394 ± 0.065
5.094GlyGly: 5.094 ± 0.088
1.356GlyHis: 1.356 ± 0.037
5.43GlyIle: 5.43 ± 0.082
3.961GlyLys: 3.961 ± 0.069
6.195GlyLeu: 6.195 ± 0.087
1.899GlyMet: 1.899 ± 0.046
3.383GlyAsn: 3.383 ± 0.059
1.897GlyPro: 1.897 ± 0.047
2.473GlyGln: 2.473 ± 0.052
2.824GlyArg: 2.824 ± 0.052
4.662GlySer: 4.662 ± 0.086
4.232GlyThr: 4.232 ± 0.081
4.422GlyVal: 4.422 ± 0.084
1.037GlyTrp: 1.037 ± 0.036
3.012GlyTyr: 3.012 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.157HisAla: 1.157 ± 0.033
0.18HisCys: 0.18 ± 0.014
1.091HisAsp: 1.091 ± 0.032
1.244HisGlu: 1.244 ± 0.041
1.029HisPhe: 1.029 ± 0.03
1.299HisGly: 1.299 ± 0.039
0.581HisHis: 0.581 ± 0.026
1.4HisIle: 1.4 ± 0.034
1.025HisLys: 1.025 ± 0.03
1.898HisLeu: 1.898 ± 0.042
0.359HisMet: 0.359 ± 0.022
0.922HisAsn: 0.922 ± 0.032
1.192HisPro: 1.192 ± 0.034
0.855HisGln: 0.855 ± 0.029
0.969HisArg: 0.969 ± 0.028
1.258HisSer: 1.258 ± 0.033
0.959HisThr: 0.959 ± 0.029
1.076HisVal: 1.076 ± 0.035
0.288HisTrp: 0.288 ± 0.018
0.811HisTyr: 0.811 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.099IleAla: 5.099 ± 0.079
0.53IleCys: 0.53 ± 0.026
4.875IleAsp: 4.875 ± 0.073
5.42IleGlu: 5.42 ± 0.076
3.025IlePhe: 3.025 ± 0.058
4.944IleGly: 4.944 ± 0.084
1.368IleHis: 1.368 ± 0.04
4.787IleIle: 4.787 ± 0.086
3.932IleLys: 3.932 ± 0.058
5.996IleLeu: 5.996 ± 0.095
1.306IleMet: 1.306 ± 0.042
3.652IleAsn: 3.652 ± 0.071
3.188IlePro: 3.188 ± 0.062
2.619IleGln: 2.619 ± 0.048
3.163IleArg: 3.163 ± 0.053
5.372IleSer: 5.372 ± 0.084
4.433IleThr: 4.433 ± 0.076
4.146IleVal: 4.146 ± 0.067
0.753IleTrp: 0.753 ± 0.03
2.316IleTyr: 2.316 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
4.23LysAla: 4.23 ± 0.075
0.204LysCys: 0.204 ± 0.013
3.308LysAsp: 3.308 ± 0.063
5.617LysGlu: 5.617 ± 0.097
1.905LysPhe: 1.905 ± 0.046
3.601LysGly: 3.601 ± 0.057
1.222LysHis: 1.222 ± 0.037
3.834LysIle: 3.834 ± 0.06
4.778LysLys: 4.778 ± 0.1
5.36LysLeu: 5.36 ± 0.075
1.62LysMet: 1.62 ± 0.042
3.09LysAsn: 3.09 ± 0.061
2.073LysPro: 2.073 ± 0.041
2.642LysGln: 2.642 ± 0.053
2.869LysArg: 2.869 ± 0.062
3.708LysSer: 3.708 ± 0.074
3.141LysThr: 3.141 ± 0.063
3.852LysVal: 3.852 ± 0.063
0.673LysTrp: 0.673 ± 0.026
2.177LysTyr: 2.177 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
6.362LeuAla: 6.362 ± 0.09
0.549LeuCys: 0.549 ± 0.024
5.759LeuAsp: 5.759 ± 0.08
6.558LeuGlu: 6.558 ± 0.105
4.468LeuPhe: 4.468 ± 0.079
6.065LeuGly: 6.065 ± 0.096
1.65LeuHis: 1.65 ± 0.041
6.276LeuIle: 6.276 ± 0.095
5.594LeuLys: 5.594 ± 0.088
8.776LeuLeu: 8.776 ± 0.13
2.185LeuMet: 2.185 ± 0.047
4.624LeuAsn: 4.624 ± 0.083
3.668LeuPro: 3.668 ± 0.056
3.781LeuGln: 3.781 ± 0.06
3.927LeuArg: 3.927 ± 0.065
6.897LeuSer: 6.897 ± 0.09
5.081LeuThr: 5.081 ± 0.078
5.45LeuVal: 5.45 ± 0.074
0.958LeuTrp: 0.958 ± 0.036
2.96LeuTyr: 2.96 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
1.856MetAla: 1.856 ± 0.049
0.092MetCys: 0.092 ± 0.009
1.602MetAsp: 1.602 ± 0.043
1.732MetGlu: 1.732 ± 0.043
0.838MetPhe: 0.838 ± 0.036
1.8MetGly: 1.8 ± 0.048
0.412MetHis: 0.412 ± 0.02
1.573MetIle: 1.573 ± 0.038
1.679MetLys: 1.679 ± 0.046
2.141MetLeu: 2.141 ± 0.046
0.759MetMet: 0.759 ± 0.03
1.311MetAsn: 1.311 ± 0.036
1.017MetPro: 1.017 ± 0.032
0.952MetGln: 0.952 ± 0.028
1.087MetArg: 1.087 ± 0.031
1.645MetSer: 1.645 ± 0.042
1.186MetThr: 1.186 ± 0.033
1.576MetVal: 1.576 ± 0.044
0.196MetTrp: 0.196 ± 0.012
0.649MetTyr: 0.649 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.094AsnAla: 3.094 ± 0.055
0.288AsnCys: 0.288 ± 0.016
2.718AsnAsp: 2.718 ± 0.05
3.401AsnGlu: 3.401 ± 0.059
2.202AsnPhe: 2.202 ± 0.056
3.343AsnGly: 3.343 ± 0.061
0.961AsnHis: 0.961 ± 0.03
4.01AsnIle: 4.01 ± 0.069
2.96AsnLys: 2.96 ± 0.062
4.282AsnLeu: 4.282 ± 0.072
1.372AsnMet: 1.372 ± 0.033
2.777AsnAsn: 2.777 ± 0.065
2.737AsnPro: 2.737 ± 0.06
2.067AsnGln: 2.067 ± 0.051
2.449AsnArg: 2.449 ± 0.05
3.164AsnSer: 3.164 ± 0.06
2.769AsnThr: 2.769 ± 0.062
3.012AsnVal: 3.012 ± 0.057
0.638AsnTrp: 0.638 ± 0.025
2.037AsnTyr: 2.037 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.315ProAla: 2.315 ± 0.048
0.199ProCys: 0.199 ± 0.015
3.12ProAsp: 3.12 ± 0.054
3.539ProGlu: 3.539 ± 0.066
1.813ProPhe: 1.813 ± 0.047
2.594ProGly: 2.594 ± 0.058
0.784ProHis: 0.784 ± 0.031
2.68ProIle: 2.68 ± 0.048
2.169ProLys: 2.169 ± 0.051
3.071ProLeu: 3.071 ± 0.053
0.818ProMet: 0.818 ± 0.028
1.99ProAsn: 1.99 ± 0.045
1.254ProPro: 1.254 ± 0.036
1.522ProGln: 1.522 ± 0.04
1.294ProArg: 1.294 ± 0.035
2.382ProSer: 2.382 ± 0.05
2.015ProThr: 2.015 ± 0.047
2.733ProVal: 2.733 ± 0.061
0.41ProTrp: 0.41 ± 0.02
1.398ProTyr: 1.398 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
2.682GlnAla: 2.682 ± 0.06
0.156GlnCys: 0.156 ± 0.012
2.261GlnAsp: 2.261 ± 0.05
3.158GlnGlu: 3.158 ± 0.063
1.743GlnPhe: 1.743 ± 0.042
2.121GlnGly: 2.121 ± 0.045
0.785GlnHis: 0.785 ± 0.025
2.66GlnIle: 2.66 ± 0.056
3.034GlnLys: 3.034 ± 0.059
3.947GlnLeu: 3.947 ± 0.069
1.053GlnMet: 1.053 ± 0.038
2.263GlnAsn: 2.263 ± 0.046
1.506GlnPro: 1.506 ± 0.045
2.533GlnGln: 2.533 ± 0.065
1.914GlnArg: 1.914 ± 0.047
2.708GlnSer: 2.708 ± 0.059
2.195GlnThr: 2.195 ± 0.046
2.306GlnVal: 2.306 ± 0.049
0.511GlnTrp: 0.511 ± 0.023
1.447GlnTyr: 1.447 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
2.667ArgAla: 2.667 ± 0.059
0.222ArgCys: 0.222 ± 0.015
2.62ArgAsp: 2.62 ± 0.048
3.424ArgGlu: 3.424 ± 0.063
2.145ArgPhe: 2.145 ± 0.041
2.566ArgGly: 2.566 ± 0.053
0.867ArgHis: 0.867 ± 0.028
3.282ArgIle: 3.282 ± 0.056
3.001ArgLys: 3.001 ± 0.058
3.998ArgLeu: 3.998 ± 0.066
1.144ArgMet: 1.144 ± 0.031
2.248ArgAsn: 2.248 ± 0.043
1.482ArgPro: 1.482 ± 0.04
1.795ArgGln: 1.795 ± 0.041
2.001ArgArg: 2.001 ± 0.052
2.868ArgSer: 2.868 ± 0.064
2.232ArgThr: 2.232 ± 0.046
2.646ArgVal: 2.646 ± 0.048
0.598ArgTrp: 0.598 ± 0.025
1.92ArgTyr: 1.92 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
4.227SerAla: 4.227 ± 0.07
0.464SerCys: 0.464 ± 0.022
4.468SerAsp: 4.468 ± 0.073
4.759SerGlu: 4.759 ± 0.073
3.253SerPhe: 3.253 ± 0.064
5.219SerGly: 5.219 ± 0.098
1.186SerHis: 1.186 ± 0.042
4.86SerIle: 4.86 ± 0.075
3.748SerLys: 3.748 ± 0.068
6.128SerLeu: 6.128 ± 0.086
1.47SerMet: 1.47 ± 0.037
3.339SerAsn: 3.339 ± 0.066
2.306SerPro: 2.306 ± 0.045
2.444SerGln: 2.444 ± 0.05
2.731SerArg: 2.731 ± 0.062
4.862SerSer: 4.862 ± 0.086
3.753SerThr: 3.753 ± 0.065
4.297SerVal: 4.297 ± 0.072
0.905SerTrp: 0.905 ± 0.031
2.584SerTyr: 2.584 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
3.775ThrAla: 3.775 ± 0.07
0.293ThrCys: 0.293 ± 0.017
3.777ThrAsp: 3.777 ± 0.06
3.746ThrGlu: 3.746 ± 0.06
2.777ThrPhe: 2.777 ± 0.058
4.318ThrGly: 4.318 ± 0.071
1.057ThrHis: 1.057 ± 0.031
4.169ThrIle: 4.169 ± 0.073
2.739ThrLys: 2.739 ± 0.051
5.178ThrLeu: 5.178 ± 0.079
1.156ThrMet: 1.156 ± 0.038
2.554ThrAsn: 2.554 ± 0.052
2.321ThrPro: 2.321 ± 0.051
1.931ThrGln: 1.931 ± 0.053
1.841ThrArg: 1.841 ± 0.038
3.448ThrSer: 3.448 ± 0.065
3.154ThrThr: 3.154 ± 0.059
3.955ThrVal: 3.955 ± 0.07
0.537ThrTrp: 0.537 ± 0.022
1.985ThrTyr: 1.985 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
4.374ValAla: 4.374 ± 0.073
0.398ValCys: 0.398 ± 0.02
4.421ValAsp: 4.421 ± 0.071
4.89ValGlu: 4.89 ± 0.073
2.789ValPhe: 2.789 ± 0.054
4.577ValGly: 4.577 ± 0.077
1.121ValHis: 1.121 ± 0.034
4.462ValIle: 4.462 ± 0.071
3.214ValLys: 3.214 ± 0.061
5.788ValLeu: 5.788 ± 0.087
1.558ValMet: 1.558 ± 0.037
3.037ValAsn: 3.037 ± 0.058
2.547ValPro: 2.547 ± 0.049
2.297ValGln: 2.297 ± 0.048
2.715ValArg: 2.715 ± 0.053
4.438ValSer: 4.438 ± 0.069
3.638ValThr: 3.638 ± 0.071
4.321ValVal: 4.321 ± 0.071
0.66ValTrp: 0.66 ± 0.026
2.03ValTyr: 2.03 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.769TrpAla: 0.769 ± 0.025
0.083TrpCys: 0.083 ± 0.008
0.805TrpAsp: 0.805 ± 0.031
0.786TrpGlu: 0.786 ± 0.031
0.535TrpPhe: 0.535 ± 0.026
0.822TrpGly: 0.822 ± 0.029
0.285TrpHis: 0.285 ± 0.018
0.861TrpIle: 0.861 ± 0.031
0.745TrpLys: 0.745 ± 0.03
1.189TrpLeu: 1.189 ± 0.037
0.389TrpMet: 0.389 ± 0.021
0.629TrpAsn: 0.629 ± 0.025
0.339TrpPro: 0.339 ± 0.019
0.534TrpGln: 0.534 ± 0.025
0.51TrpArg: 0.51 ± 0.025
0.804TrpSer: 0.804 ± 0.03
0.62TrpThr: 0.62 ± 0.024
0.722TrpVal: 0.722 ± 0.027
0.201TrpTrp: 0.201 ± 0.015
0.465TrpTyr: 0.465 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.187TyrAla: 2.187 ± 0.048
0.305TyrCys: 0.305 ± 0.018
2.337TyrAsp: 2.337 ± 0.049
2.615TyrGlu: 2.615 ± 0.055
1.766TyrPhe: 1.766 ± 0.041
2.597TyrGly: 2.597 ± 0.056
0.877TyrHis: 0.877 ± 0.026
2.202TyrIle: 2.202 ± 0.049
1.971TyrLys: 1.971 ± 0.048
3.523TyrLeu: 3.523 ± 0.061
0.727TyrMet: 0.727 ± 0.026
1.895TyrAsn: 1.895 ± 0.05
1.608TyrPro: 1.608 ± 0.044
1.701TyrGln: 1.701 ± 0.047
1.888TyrArg: 1.888 ± 0.047
2.531TyrSer: 2.531 ± 0.052
1.864TyrThr: 1.864 ± 0.045
2.007TyrVal: 2.007 ± 0.045
0.495TyrTrp: 0.495 ± 0.022
1.58TyrTyr: 1.58 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3115 proteins (1052852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski