Amino acid dipepetide frequency for Hymenobacter daecheongensis DSM 21074

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.201AlaAla: 15.201 ± 0.154
0.77AlaCys: 0.77 ± 0.024
5.71AlaAsp: 5.71 ± 0.069
5.912AlaGlu: 5.912 ± 0.078
3.779AlaPhe: 3.779 ± 0.063
9.294AlaGly: 9.294 ± 0.109
2.018AlaHis: 2.018 ± 0.046
3.972AlaIle: 3.972 ± 0.061
3.684AlaLys: 3.684 ± 0.079
11.585AlaLeu: 11.585 ± 0.144
1.871AlaMet: 1.871 ± 0.042
3.445AlaAsn: 3.445 ± 0.071
5.784AlaPro: 5.784 ± 0.115
5.059AlaGln: 5.059 ± 0.069
6.323AlaArg: 6.323 ± 0.09
5.217AlaSer: 5.217 ± 0.08
7.174AlaThr: 7.174 ± 0.131
7.267AlaVal: 7.267 ± 0.089
1.163AlaTrp: 1.163 ± 0.039
3.094AlaTyr: 3.094 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.66CysAla: 0.66 ± 0.024
0.113CysCys: 0.113 ± 0.01
0.333CysAsp: 0.333 ± 0.017
0.319CysGlu: 0.319 ± 0.016
0.314CysPhe: 0.314 ± 0.015
0.652CysGly: 0.652 ± 0.026
0.209CysHis: 0.209 ± 0.015
0.326CysIle: 0.326 ± 0.019
0.181CysLys: 0.181 ± 0.013
0.759CysLeu: 0.759 ± 0.025
0.111CysMet: 0.111 ± 0.009
0.23CysAsn: 0.23 ± 0.014
0.411CysPro: 0.411 ± 0.023
0.306CysGln: 0.306 ± 0.017
0.456CysArg: 0.456 ± 0.018
0.414CysSer: 0.414 ± 0.018
0.417CysThr: 0.417 ± 0.017
0.44CysVal: 0.44 ± 0.018
0.083CysTrp: 0.083 ± 0.008
0.271CysTyr: 0.271 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
5.009AspAla: 5.009 ± 0.068
0.332AspCys: 0.332 ± 0.017
2.359AspAsp: 2.359 ± 0.051
3.055AspGlu: 3.055 ± 0.057
2.667AspPhe: 2.667 ± 0.043
3.731AspGly: 3.731 ± 0.068
0.91AspHis: 0.91 ± 0.03
2.354AspIle: 2.354 ± 0.045
2.264AspLys: 2.264 ± 0.049
4.954AspLeu: 4.954 ± 0.069
0.97AspMet: 0.97 ± 0.03
1.885AspAsn: 1.885 ± 0.036
2.415AspPro: 2.415 ± 0.04
1.989AspGln: 1.989 ± 0.041
2.41AspArg: 2.41 ± 0.045
2.537AspSer: 2.537 ± 0.049
2.62AspThr: 2.62 ± 0.048
3.485AspVal: 3.485 ± 0.058
0.675AspTrp: 0.675 ± 0.021
2.101AspTyr: 2.101 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
5.823GluAla: 5.823 ± 0.085
0.269GluCys: 0.269 ± 0.016
2.105GluAsp: 2.105 ± 0.049
3.109GluGlu: 3.109 ± 0.07
2.103GluPhe: 2.103 ± 0.045
3.288GluGly: 3.288 ± 0.054
1.102GluHis: 1.102 ± 0.03
2.715GluIle: 2.715 ± 0.056
2.862GluLys: 2.862 ± 0.062
6.121GluLeu: 6.121 ± 0.091
1.313GluMet: 1.313 ± 0.027
2.028GluAsn: 2.028 ± 0.041
2.251GluPro: 2.251 ± 0.05
2.951GluGln: 2.951 ± 0.061
3.168GluArg: 3.168 ± 0.064
2.228GluSer: 2.228 ± 0.043
2.873GluThr: 2.873 ± 0.046
3.747GluVal: 3.747 ± 0.067
0.617GluTrp: 0.617 ± 0.024
1.694GluTyr: 1.694 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
3.844PheAla: 3.844 ± 0.061
0.348PheCys: 0.348 ± 0.016
2.444PheAsp: 2.444 ± 0.048
2.231PheGlu: 2.231 ± 0.047
1.909PhePhe: 1.909 ± 0.043
3.577PheGly: 3.577 ± 0.057
0.775PheHis: 0.775 ± 0.029
1.745PheIle: 1.745 ± 0.04
1.44PheLys: 1.44 ± 0.039
3.92PheLeu: 3.92 ± 0.068
0.755PheMet: 0.755 ± 0.027
1.684PheAsn: 1.684 ± 0.038
1.736PhePro: 1.736 ± 0.037
1.597PheGln: 1.597 ± 0.04
2.764PheArg: 2.764 ± 0.055
2.709PheSer: 2.709 ± 0.054
2.797PheThr: 2.797 ± 0.063
2.946PheVal: 2.946 ± 0.048
0.529PheTrp: 0.529 ± 0.025
1.5PheTyr: 1.5 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
6.661GlyAla: 6.661 ± 0.087
0.671GlyCys: 0.671 ± 0.026
3.036GlyAsp: 3.036 ± 0.057
3.489GlyGlu: 3.489 ± 0.054
3.361GlyPhe: 3.361 ± 0.056
6.241GlyGly: 6.241 ± 0.112
1.605GlyHis: 1.605 ± 0.039
3.825GlyIle: 3.825 ± 0.059
3.441GlyLys: 3.441 ± 0.06
8.451GlyLeu: 8.451 ± 0.097
1.536GlyMet: 1.536 ± 0.041
2.743GlyAsn: 2.743 ± 0.061
3.189GlyPro: 3.189 ± 0.064
3.878GlyGln: 3.878 ± 0.063
4.951GlyArg: 4.951 ± 0.069
4.701GlySer: 4.701 ± 0.092
5.63GlyThr: 5.63 ± 0.125
5.089GlyVal: 5.089 ± 0.065
1.044GlyTrp: 1.044 ± 0.033
2.832GlyTyr: 2.832 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
1.828HisAla: 1.828 ± 0.043
0.201HisCys: 0.201 ± 0.013
1.096HisAsp: 1.096 ± 0.033
1.131HisGlu: 1.131 ± 0.031
1.021HisPhe: 1.021 ± 0.024
1.548HisGly: 1.548 ± 0.038
0.587HisHis: 0.587 ± 0.026
0.976HisIle: 0.976 ± 0.034
0.656HisLys: 0.656 ± 0.025
2.386HisLeu: 2.386 ± 0.054
0.318HisMet: 0.318 ± 0.017
0.711HisAsn: 0.711 ± 0.024
1.294HisPro: 1.294 ± 0.031
0.919HisGln: 0.919 ± 0.03
1.214HisArg: 1.214 ± 0.038
0.946HisSer: 0.946 ± 0.026
1.202HisThr: 1.202 ± 0.034
1.169HisVal: 1.169 ± 0.032
0.299HisTrp: 0.299 ± 0.017
0.863HisTyr: 0.863 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
3.915IleAla: 3.915 ± 0.068
0.408IleCys: 0.408 ± 0.018
2.505IleAsp: 2.505 ± 0.05
2.586IleGlu: 2.586 ± 0.054
1.744IlePhe: 1.744 ± 0.049
3.75IleGly: 3.75 ± 0.064
0.811IleHis: 0.811 ± 0.025
2.462IleIle: 2.462 ± 0.061
1.823IleLys: 1.823 ± 0.045
3.936IleLeu: 3.936 ± 0.064
0.853IleMet: 0.853 ± 0.024
1.767IleAsn: 1.767 ± 0.042
2.183IlePro: 2.183 ± 0.046
1.691IleGln: 1.691 ± 0.036
2.904IleArg: 2.904 ± 0.056
2.998IleSer: 2.998 ± 0.051
3.016IleThr: 3.016 ± 0.057
3.035IleVal: 3.035 ± 0.048
0.465IleTrp: 0.465 ± 0.018
1.376IleTyr: 1.376 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
4.201LysAla: 4.201 ± 0.07
0.174LysCys: 0.174 ± 0.014
2.048LysAsp: 2.048 ± 0.048
2.292LysGlu: 2.292 ± 0.056
1.426LysPhe: 1.426 ± 0.037
2.725LysGly: 2.725 ± 0.058
0.762LysHis: 0.762 ± 0.028
1.971LysIle: 1.971 ± 0.053
2.307LysLys: 2.307 ± 0.063
4.206LysLeu: 4.206 ± 0.071
1.071LysMet: 1.071 ± 0.031
1.603LysAsn: 1.603 ± 0.043
2.189LysPro: 2.189 ± 0.047
1.754LysGln: 1.754 ± 0.039
2.019LysArg: 2.019 ± 0.048
2.107LysSer: 2.107 ± 0.042
2.502LysThr: 2.502 ± 0.05
2.829LysVal: 2.829 ± 0.064
0.393LysTrp: 0.393 ± 0.02
1.355LysTyr: 1.355 ± 0.037
0.0LysXaa: 0.0 ± 0.0
Leu
12.22LeuAla: 12.22 ± 0.139
0.744LeuCys: 0.744 ± 0.027
5.473LeuAsp: 5.473 ± 0.076
5.097LeuGlu: 5.097 ± 0.079
4.117LeuPhe: 4.117 ± 0.058
7.903LeuGly: 7.903 ± 0.112
2.391LeuHis: 2.391 ± 0.047
4.326LeuIle: 4.326 ± 0.065
4.063LeuLys: 4.063 ± 0.065
14.025LeuLeu: 14.025 ± 0.176
1.948LeuMet: 1.948 ± 0.045
4.289LeuAsn: 4.289 ± 0.068
6.385LeuPro: 6.385 ± 0.089
4.133LeuGln: 4.133 ± 0.067
8.018LeuArg: 8.018 ± 0.113
6.596LeuSer: 6.596 ± 0.078
7.491LeuThr: 7.491 ± 0.088
7.211LeuVal: 7.211 ± 0.097
1.069LeuTrp: 1.069 ± 0.035
3.197LeuTyr: 3.197 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.0MetAla: 2.0 ± 0.048
0.105MetCys: 0.105 ± 0.008
0.787MetAsp: 0.787 ± 0.028
0.896MetGlu: 0.896 ± 0.031
0.585MetPhe: 0.585 ± 0.024
1.294MetGly: 1.294 ± 0.039
0.398MetHis: 0.398 ± 0.018
0.746MetIle: 0.746 ± 0.031
1.159MetLys: 1.159 ± 0.033
2.179MetLeu: 2.179 ± 0.042
0.449MetMet: 0.449 ± 0.02
0.782MetAsn: 0.782 ± 0.027
1.263MetPro: 1.263 ± 0.032
0.899MetGln: 0.899 ± 0.032
1.262MetArg: 1.262 ± 0.028
1.142MetSer: 1.142 ± 0.029
1.103MetThr: 1.103 ± 0.033
1.234MetVal: 1.234 ± 0.034
0.146MetTrp: 0.146 ± 0.009
0.43MetTyr: 0.43 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.494AsnAla: 3.494 ± 0.059
0.256AsnCys: 0.256 ± 0.016
1.799AsnAsp: 1.799 ± 0.045
1.762AsnGlu: 1.762 ± 0.043
1.605AsnPhe: 1.605 ± 0.045
3.116AsnGly: 3.116 ± 0.065
0.655AsnHis: 0.655 ± 0.023
1.822AsnIle: 1.822 ± 0.039
1.421AsnLys: 1.421 ± 0.038
3.85AsnLeu: 3.85 ± 0.061
0.717AsnMet: 0.717 ± 0.022
1.509AsnAsn: 1.509 ± 0.053
2.559AsnPro: 2.559 ± 0.05
1.627AsnGln: 1.627 ± 0.038
2.145AsnArg: 2.145 ± 0.045
2.096AsnSer: 2.096 ± 0.043
2.315AsnThr: 2.315 ± 0.061
2.524AsnVal: 2.524 ± 0.05
0.441AsnTrp: 0.441 ± 0.02
1.371AsnTyr: 1.371 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
8.045ProAla: 8.045 ± 0.127
0.266ProCys: 0.266 ± 0.014
3.35ProAsp: 3.35 ± 0.057
3.515ProGlu: 3.515 ± 0.063
1.86ProPhe: 1.86 ± 0.039
4.065ProGly: 4.065 ± 0.063
1.024ProHis: 1.024 ± 0.033
2.022ProIle: 2.022 ± 0.041
1.835ProLys: 1.835 ± 0.043
5.147ProLeu: 5.147 ± 0.081
0.789ProMet: 0.789 ± 0.024
2.044ProAsn: 2.044 ± 0.047
2.242ProPro: 2.242 ± 0.058
2.024ProGln: 2.024 ± 0.035
2.628ProArg: 2.628 ± 0.058
2.311ProSer: 2.311 ± 0.047
3.581ProThr: 3.581 ± 0.068
4.124ProVal: 4.124 ± 0.071
0.548ProTrp: 0.548 ± 0.025
1.565ProTyr: 1.565 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
4.874GlnAla: 4.874 ± 0.078
0.22GlnCys: 0.22 ± 0.012
1.879GlnAsp: 1.879 ± 0.04
2.413GlnGlu: 2.413 ± 0.052
1.651GlnPhe: 1.651 ± 0.039
2.821GlnGly: 2.821 ± 0.051
1.196GlnHis: 1.196 ± 0.032
1.868GlnIle: 1.868 ± 0.041
1.795GlnLys: 1.795 ± 0.041
5.565GlnLeu: 5.565 ± 0.074
0.929GlnMet: 0.929 ± 0.03
1.682GlnAsn: 1.682 ± 0.038
3.011GlnPro: 3.011 ± 0.059
3.137GlnGln: 3.137 ± 0.071
3.144GlnArg: 3.144 ± 0.061
2.038GlnSer: 2.038 ± 0.042
2.51GlnThr: 2.51 ± 0.053
3.149GlnVal: 3.149 ± 0.048
0.511GlnTrp: 0.511 ± 0.022
1.508GlnTyr: 1.508 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
5.654ArgAla: 5.654 ± 0.091
0.396ArgCys: 0.396 ± 0.019
2.639ArgAsp: 2.639 ± 0.052
3.378ArgGlu: 3.378 ± 0.051
2.78ArgPhe: 2.78 ± 0.049
3.727ArgGly: 3.727 ± 0.059
1.613ArgHis: 1.613 ± 0.04
2.966ArgIle: 2.966 ± 0.048
2.326ArgLys: 2.326 ± 0.054
7.562ArgLeu: 7.562 ± 0.103
1.298ArgMet: 1.298 ± 0.033
2.205ArgAsn: 2.205 ± 0.048
3.494ArgPro: 3.494 ± 0.07
3.719ArgGln: 3.719 ± 0.069
4.778ArgArg: 4.778 ± 0.085
2.647ArgSer: 2.647 ± 0.047
3.646ArgThr: 3.646 ± 0.053
4.315ArgVal: 4.315 ± 0.073
0.868ArgTrp: 0.868 ± 0.027
2.5ArgTyr: 2.5 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
5.553SerAla: 5.553 ± 0.079
0.469SerCys: 0.469 ± 0.022
2.38SerAsp: 2.38 ± 0.047
2.39SerGlu: 2.39 ± 0.05
2.746SerPhe: 2.746 ± 0.054
4.851SerGly: 4.851 ± 0.098
0.971SerHis: 0.971 ± 0.032
2.667SerIle: 2.667 ± 0.053
2.005SerLys: 2.005 ± 0.047
5.713SerLeu: 5.713 ± 0.065
0.995SerMet: 0.995 ± 0.034
1.924SerAsn: 1.924 ± 0.045
2.85SerPro: 2.85 ± 0.052
2.101SerGln: 2.101 ± 0.044
3.156SerArg: 3.156 ± 0.048
3.32SerSer: 3.32 ± 0.071
3.413SerThr: 3.413 ± 0.076
4.035SerVal: 4.035 ± 0.079
0.691SerTrp: 0.691 ± 0.026
2.064SerTyr: 2.064 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
7.152ThrAla: 7.152 ± 0.125
0.365ThrCys: 0.365 ± 0.016
3.401ThrAsp: 3.401 ± 0.048
3.016ThrGlu: 3.016 ± 0.05
2.63ThrPhe: 2.63 ± 0.055
5.44ThrGly: 5.44 ± 0.104
1.108ThrHis: 1.108 ± 0.035
2.879ThrIle: 2.879 ± 0.056
2.147ThrLys: 2.147 ± 0.05
6.936ThrLeu: 6.936 ± 0.098
0.99ThrMet: 0.99 ± 0.03
2.274ThrAsn: 2.274 ± 0.053
3.927ThrPro: 3.927 ± 0.057
2.386ThrGln: 2.386 ± 0.055
3.172ThrArg: 3.172 ± 0.053
3.56ThrSer: 3.56 ± 0.079
4.521ThrThr: 4.521 ± 0.103
4.805ThrVal: 4.805 ± 0.095
0.672ThrTrp: 0.672 ± 0.023
2.232ThrTyr: 2.232 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
7.914ValAla: 7.914 ± 0.096
0.54ValCys: 0.54 ± 0.023
2.972ValAsp: 2.972 ± 0.055
3.551ValGlu: 3.551 ± 0.059
2.796ValPhe: 2.796 ± 0.05
5.089ValGly: 5.089 ± 0.073
1.171ValHis: 1.171 ± 0.036
2.92ValIle: 2.92 ± 0.049
2.817ValLys: 2.817 ± 0.059
8.107ValLeu: 8.107 ± 0.089
1.186ValMet: 1.186 ± 0.031
2.417ValAsn: 2.417 ± 0.048
3.751ValPro: 3.751 ± 0.057
3.15ValGln: 3.15 ± 0.055
4.828ValArg: 4.828 ± 0.069
4.137ValSer: 4.137 ± 0.067
4.104ValThr: 4.104 ± 0.095
5.677ValVal: 5.677 ± 0.089
0.759ValTrp: 0.759 ± 0.024
2.166ValTyr: 2.166 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
1.036TrpAla: 1.036 ± 0.031
0.091TrpCys: 0.091 ± 0.008
0.487TrpAsp: 0.487 ± 0.022
0.555TrpGlu: 0.555 ± 0.02
0.485TrpPhe: 0.485 ± 0.02
0.707TrpGly: 0.707 ± 0.025
0.304TrpHis: 0.304 ± 0.015
0.413TrpIle: 0.413 ± 0.02
0.447TrpLys: 0.447 ± 0.022
1.686TrpLeu: 1.686 ± 0.043
0.239TrpMet: 0.239 ± 0.013
0.46TrpAsn: 0.46 ± 0.024
0.479TrpPro: 0.479 ± 0.019
0.82TrpGln: 0.82 ± 0.029
0.786TrpArg: 0.786 ± 0.026
0.606TrpSer: 0.606 ± 0.026
0.627TrpThr: 0.627 ± 0.023
0.76TrpVal: 0.76 ± 0.027
0.178TrpTrp: 0.178 ± 0.013
0.392TrpTyr: 0.392 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.238TyrAla: 3.238 ± 0.057
0.27TyrCys: 0.27 ± 0.016
1.918TyrAsp: 1.918 ± 0.038
1.67TyrGlu: 1.67 ± 0.041
1.673TyrPhe: 1.673 ± 0.036
2.554TyrGly: 2.554 ± 0.05
0.756TyrHis: 0.756 ± 0.024
1.254TyrIle: 1.254 ± 0.035
1.26TyrLys: 1.26 ± 0.038
3.607TyrLeu: 3.607 ± 0.058
0.521TyrMet: 0.521 ± 0.021
1.398TyrAsn: 1.398 ± 0.035
1.571TyrPro: 1.571 ± 0.037
1.76TyrGln: 1.76 ± 0.045
2.385TyrArg: 2.385 ± 0.047
1.986TyrSer: 1.986 ± 0.046
2.073TyrThr: 2.073 ± 0.048
2.21TyrVal: 2.21 ± 0.045
0.407TyrTrp: 0.407 ± 0.019
1.371TyrTyr: 1.371 ± 0.039
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3657 proteins (1260074 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski