Amino acid dipepetide frequency for Polaribacter sp. Hel1_33_78

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.14AlaAla: 4.14 ± 0.1
0.503AlaCys: 0.503 ± 0.029
3.218AlaAsp: 3.218 ± 0.091
3.62AlaGlu: 3.62 ± 0.068
3.179AlaPhe: 3.179 ± 0.064
3.998AlaGly: 3.998 ± 0.079
1.032AlaHis: 1.032 ± 0.036
5.68AlaIle: 5.68 ± 0.082
4.95AlaLys: 4.95 ± 0.094
5.51AlaLeu: 5.51 ± 0.092
1.406AlaMet: 1.406 ± 0.044
3.494AlaAsn: 3.494 ± 0.092
1.656AlaPro: 1.656 ± 0.048
2.087AlaGln: 2.087 ± 0.053
1.843AlaArg: 1.843 ± 0.048
4.229AlaSer: 4.229 ± 0.077
3.606AlaThr: 3.606 ± 0.091
3.891AlaVal: 3.891 ± 0.068
0.569AlaTrp: 0.569 ± 0.029
2.102AlaTyr: 2.102 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.458CysAla: 0.458 ± 0.039
0.099CysCys: 0.099 ± 0.011
0.408CysAsp: 0.408 ± 0.036
0.412CysGlu: 0.412 ± 0.021
0.396CysPhe: 0.396 ± 0.022
0.582CysGly: 0.582 ± 0.03
0.157CysHis: 0.157 ± 0.014
0.58CysIle: 0.58 ± 0.029
0.531CysLys: 0.531 ± 0.027
0.586CysLeu: 0.586 ± 0.028
0.13CysMet: 0.13 ± 0.01
0.427CysAsn: 0.427 ± 0.023
0.275CysPro: 0.275 ± 0.019
0.203CysGln: 0.203 ± 0.014
0.174CysArg: 0.174 ± 0.013
0.531CysSer: 0.531 ± 0.025
0.406CysThr: 0.406 ± 0.024
0.392CysVal: 0.392 ± 0.022
0.066CysTrp: 0.066 ± 0.007
0.236CysTyr: 0.236 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
3.559AspAla: 3.559 ± 0.074
0.393AspCys: 0.393 ± 0.03
2.737AspAsp: 2.737 ± 0.083
3.539AspGlu: 3.539 ± 0.063
3.733AspPhe: 3.733 ± 0.07
3.518AspGly: 3.518 ± 0.116
0.764AspHis: 0.764 ± 0.029
4.576AspIle: 4.576 ± 0.077
4.316AspLys: 4.316 ± 0.085
5.179AspLeu: 5.179 ± 0.074
0.932AspMet: 0.932 ± 0.035
3.168AspAsn: 3.168 ± 0.075
1.396AspPro: 1.396 ± 0.047
1.243AspGln: 1.243 ± 0.039
1.637AspArg: 1.637 ± 0.038
3.065AspSer: 3.065 ± 0.064
2.75AspThr: 2.75 ± 0.071
3.717AspVal: 3.717 ± 0.114
0.722AspTrp: 0.722 ± 0.028
2.459AspTyr: 2.459 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
3.883GluAla: 3.883 ± 0.079
0.276GluCys: 0.276 ± 0.017
3.376GluAsp: 3.376 ± 0.066
4.628GluGlu: 4.628 ± 0.094
3.058GluPhe: 3.058 ± 0.064
3.479GluGly: 3.479 ± 0.064
1.011GluHis: 1.011 ± 0.033
6.173GluIle: 6.173 ± 0.086
6.334GluLys: 6.334 ± 0.105
5.875GluLeu: 5.875 ± 0.092
1.564GluMet: 1.564 ± 0.042
5.321GluAsn: 5.321 ± 0.086
1.317GluPro: 1.317 ± 0.043
1.953GluGln: 1.953 ± 0.046
2.202GluArg: 2.202 ± 0.058
3.254GluSer: 3.254 ± 0.055
3.73GluThr: 3.73 ± 0.061
4.124GluVal: 4.124 ± 0.066
0.574GluTrp: 0.574 ± 0.026
2.326GluTyr: 2.326 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
2.876PheAla: 2.876 ± 0.057
0.43PheCys: 0.43 ± 0.021
3.386PheAsp: 3.386 ± 0.064
3.257PheGlu: 3.257 ± 0.065
2.992PhePhe: 2.992 ± 0.08
3.921PheGly: 3.921 ± 0.065
0.854PheHis: 0.854 ± 0.028
4.52PheIle: 4.52 ± 0.081
4.414PheLys: 4.414 ± 0.082
5.216PheLeu: 5.216 ± 0.115
1.127PheMet: 1.127 ± 0.035
3.561PheAsn: 3.561 ± 0.072
1.751PhePro: 1.751 ± 0.046
1.603PheGln: 1.603 ± 0.045
1.555PheArg: 1.555 ± 0.045
4.54PheSer: 4.54 ± 0.08
3.286PheThr: 3.286 ± 0.064
3.087PheVal: 3.087 ± 0.07
0.585PheTrp: 0.585 ± 0.024
2.314PheTyr: 2.314 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
4.151GlyAla: 4.151 ± 0.089
0.529GlyCys: 0.529 ± 0.027
3.22GlyAsp: 3.22 ± 0.08
3.367GlyGlu: 3.367 ± 0.075
3.915GlyPhe: 3.915 ± 0.084
4.737GlyGly: 4.737 ± 0.161
1.08GlyHis: 1.08 ± 0.033
5.576GlyIle: 5.576 ± 0.074
5.232GlyLys: 5.232 ± 0.09
5.543GlyLeu: 5.543 ± 0.08
1.51GlyMet: 1.51 ± 0.047
3.938GlyAsn: 3.938 ± 0.085
1.22GlyPro: 1.22 ± 0.033
1.662GlyGln: 1.662 ± 0.051
1.993GlyArg: 1.993 ± 0.053
4.045GlySer: 4.045 ± 0.125
3.914GlyThr: 3.914 ± 0.121
4.271GlyVal: 4.271 ± 0.089
0.729GlyTrp: 0.729 ± 0.032
2.56GlyTyr: 2.56 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
0.901HisAla: 0.901 ± 0.032
0.147HisCys: 0.147 ± 0.011
0.725HisAsp: 0.725 ± 0.025
0.899HisGlu: 0.899 ± 0.036
1.151HisPhe: 1.151 ± 0.035
0.996HisGly: 0.996 ± 0.03
0.482HisHis: 0.482 ± 0.028
1.46HisIle: 1.46 ± 0.044
1.407HisLys: 1.407 ± 0.039
1.637HisLeu: 1.637 ± 0.048
0.281HisMet: 0.281 ± 0.018
0.945HisAsn: 0.945 ± 0.031
0.846HisPro: 0.846 ± 0.03
0.766HisGln: 0.766 ± 0.03
0.631HisArg: 0.631 ± 0.025
0.976HisSer: 0.976 ± 0.033
0.931HisThr: 0.931 ± 0.03
0.8HisVal: 0.8 ± 0.033
0.201HisTrp: 0.201 ± 0.017
0.742HisTyr: 0.742 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.823IleAla: 5.823 ± 0.08
0.64IleCys: 0.64 ± 0.029
5.191IleAsp: 5.191 ± 0.074
5.777IleGlu: 5.777 ± 0.095
4.176IlePhe: 4.176 ± 0.084
5.336IleGly: 5.336 ± 0.091
1.493IleHis: 1.493 ± 0.04
7.319IleIle: 7.319 ± 0.111
7.055IleLys: 7.055 ± 0.098
7.752IleLeu: 7.752 ± 0.113
1.449IleMet: 1.449 ± 0.046
5.593IleAsn: 5.593 ± 0.1
3.155IlePro: 3.155 ± 0.063
2.676IleGln: 2.676 ± 0.054
2.617IleArg: 2.617 ± 0.05
6.444IleSer: 6.444 ± 0.078
5.297IleThr: 5.297 ± 0.101
4.878IleVal: 4.878 ± 0.072
0.738IleTrp: 0.738 ± 0.029
3.006IleTyr: 3.006 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
4.765LysAla: 4.765 ± 0.077
0.348LysCys: 0.348 ± 0.021
4.573LysAsp: 4.573 ± 0.068
7.378LysGlu: 7.378 ± 0.129
3.411LysPhe: 3.411 ± 0.066
4.959LysGly: 4.959 ± 0.08
1.449LysHis: 1.449 ± 0.043
7.758LysIle: 7.758 ± 0.108
8.695LysLys: 8.695 ± 0.147
7.024LysLeu: 7.024 ± 0.097
2.215LysMet: 2.215 ± 0.056
6.688LysAsn: 6.688 ± 0.095
2.382LysPro: 2.382 ± 0.055
2.843LysGln: 2.843 ± 0.057
3.044LysArg: 3.044 ± 0.068
4.978LysSer: 4.978 ± 0.088
5.159LysThr: 5.159 ± 0.078
5.183LysVal: 5.183 ± 0.088
0.808LysTrp: 0.808 ± 0.037
3.299LysTyr: 3.299 ± 0.071
0.0LysXaa: 0.0 ± 0.0
Leu
5.402LeuAla: 5.402 ± 0.1
0.599LeuCys: 0.599 ± 0.027
4.911LeuAsp: 4.911 ± 0.081
6.256LeuGlu: 6.256 ± 0.093
5.199LeuPhe: 5.199 ± 0.094
5.745LeuGly: 5.745 ± 0.094
1.46LeuHis: 1.46 ± 0.042
7.407LeuIle: 7.407 ± 0.128
8.371LeuLys: 8.371 ± 0.123
8.291LeuLeu: 8.291 ± 0.145
1.872LeuMet: 1.872 ± 0.044
5.762LeuAsn: 5.762 ± 0.085
3.234LeuPro: 3.234 ± 0.061
3.057LeuGln: 3.057 ± 0.069
2.936LeuArg: 2.936 ± 0.066
6.596LeuSer: 6.596 ± 0.09
4.929LeuThr: 4.929 ± 0.082
5.173LeuVal: 5.173 ± 0.086
0.729LeuTrp: 0.729 ± 0.029
2.934LeuTyr: 2.934 ± 0.064
0.0LeuXaa: 0.0 ± 0.0
Met
1.416MetAla: 1.416 ± 0.043
0.135MetCys: 0.135 ± 0.012
1.009MetAsp: 1.009 ± 0.033
1.186MetGlu: 1.186 ± 0.039
0.917MetPhe: 0.917 ± 0.033
1.277MetGly: 1.277 ± 0.045
0.408MetHis: 0.408 ± 0.019
1.71MetIle: 1.71 ± 0.044
2.238MetLys: 2.238 ± 0.05
1.839MetLeu: 1.839 ± 0.05
0.58MetMet: 0.58 ± 0.028
1.392MetAsn: 1.392 ± 0.042
0.755MetPro: 0.755 ± 0.031
0.792MetGln: 0.792 ± 0.03
0.78MetArg: 0.78 ± 0.03
1.396MetSer: 1.396 ± 0.039
1.065MetThr: 1.065 ± 0.033
1.154MetVal: 1.154 ± 0.036
0.157MetTrp: 0.157 ± 0.014
0.69MetTyr: 0.69 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.779AsnAla: 3.779 ± 0.073
0.505AsnCys: 0.505 ± 0.027
3.326AsnAsp: 3.326 ± 0.08
3.703AsnGlu: 3.703 ± 0.064
3.575AsnPhe: 3.575 ± 0.069
4.112AsnGly: 4.112 ± 0.121
1.163AsnHis: 1.163 ± 0.039
5.615AsnIle: 5.615 ± 0.085
5.421AsnLys: 5.421 ± 0.089
5.972AsnLeu: 5.972 ± 0.096
1.268AsnMet: 1.268 ± 0.039
4.523AsnAsn: 4.523 ± 0.117
2.82AsnPro: 2.82 ± 0.052
2.421AsnGln: 2.421 ± 0.054
2.122AsnArg: 2.122 ± 0.052
4.509AsnSer: 4.509 ± 0.087
3.946AsnThr: 3.946 ± 0.115
3.522AsnVal: 3.522 ± 0.063
0.847AsnTrp: 0.847 ± 0.03
3.037AsnTyr: 3.037 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
1.745ProAla: 1.745 ± 0.069
0.195ProCys: 0.195 ± 0.016
1.693ProAsp: 1.693 ± 0.049
2.355ProGlu: 2.355 ± 0.048
1.895ProPhe: 1.895 ± 0.046
1.668ProGly: 1.668 ± 0.05
0.565ProHis: 0.565 ± 0.025
2.723ProIle: 2.723 ± 0.061
2.752ProLys: 2.752 ± 0.056
2.689ProLeu: 2.689 ± 0.055
0.657ProMet: 0.657 ± 0.027
2.27ProAsn: 2.27 ± 0.055
0.692ProPro: 0.692 ± 0.027
0.895ProGln: 0.895 ± 0.028
0.851ProArg: 0.851 ± 0.028
2.041ProSer: 2.041 ± 0.045
1.955ProThr: 1.955 ± 0.052
1.934ProVal: 1.934 ± 0.058
0.353ProTrp: 0.353 ± 0.022
1.185ProTyr: 1.185 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
1.622GlnAla: 1.622 ± 0.041
0.139GlnCys: 0.139 ± 0.012
1.493GlnAsp: 1.493 ± 0.039
2.316GlnGlu: 2.316 ± 0.054
1.773GlnPhe: 1.773 ± 0.044
1.65GlnGly: 1.65 ± 0.047
0.562GlnHis: 0.562 ± 0.028
2.749GlnIle: 2.749 ± 0.061
3.164GlnLys: 3.164 ± 0.053
3.109GlnLeu: 3.109 ± 0.058
0.724GlnMet: 0.724 ± 0.025
2.223GlnAsn: 2.223 ± 0.048
0.897GlnPro: 0.897 ± 0.035
1.376GlnGln: 1.376 ± 0.044
1.102GlnArg: 1.102 ± 0.039
1.758GlnSer: 1.758 ± 0.039
1.835GlnThr: 1.835 ± 0.061
1.752GlnVal: 1.752 ± 0.04
0.295GlnTrp: 0.295 ± 0.017
1.208GlnTyr: 1.208 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
1.96ArgAla: 1.96 ± 0.051
0.176ArgCys: 0.176 ± 0.013
1.578ArgAsp: 1.578 ± 0.046
2.054ArgGlu: 2.054 ± 0.048
1.804ArgPhe: 1.804 ± 0.054
1.885ArgGly: 1.885 ± 0.045
0.549ArgHis: 0.549 ± 0.026
2.85ArgIle: 2.85 ± 0.056
2.968ArgLys: 2.968 ± 0.057
2.878ArgLeu: 2.878 ± 0.058
0.804ArgMet: 0.804 ± 0.031
2.13ArgAsn: 2.13 ± 0.05
0.935ArgPro: 0.935 ± 0.031
0.959ArgGln: 0.959 ± 0.03
1.187ArgArg: 1.187 ± 0.039
1.725ArgSer: 1.725 ± 0.044
1.74ArgThr: 1.74 ± 0.046
1.938ArgVal: 1.938 ± 0.043
0.307ArgTrp: 0.307 ± 0.018
1.34ArgTyr: 1.34 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
3.753SerAla: 3.753 ± 0.068
0.65SerCys: 0.65 ± 0.028
3.512SerAsp: 3.512 ± 0.072
4.262SerGlu: 4.262 ± 0.071
4.241SerPhe: 4.241 ± 0.079
4.702SerGly: 4.702 ± 0.128
1.011SerHis: 1.011 ± 0.035
5.862SerIle: 5.862 ± 0.101
5.772SerLys: 5.772 ± 0.08
6.182SerLeu: 6.182 ± 0.106
1.278SerMet: 1.278 ± 0.033
4.196SerAsn: 4.196 ± 0.083
1.907SerPro: 1.907 ± 0.047
1.948SerGln: 1.948 ± 0.046
1.854SerArg: 1.854 ± 0.05
4.593SerSer: 4.593 ± 0.087
3.502SerThr: 3.502 ± 0.077
3.82SerVal: 3.82 ± 0.072
0.795SerTrp: 0.795 ± 0.031
2.765SerTyr: 2.765 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
3.779ThrAla: 3.779 ± 0.108
0.345ThrCys: 0.345 ± 0.022
3.148ThrAsp: 3.148 ± 0.097
3.383ThrGlu: 3.383 ± 0.067
3.337ThrPhe: 3.337 ± 0.062
3.872ThrGly: 3.872 ± 0.092
0.952ThrHis: 0.952 ± 0.037
5.457ThrIle: 5.457 ± 0.113
4.547ThrLys: 4.547 ± 0.066
5.153ThrLeu: 5.153 ± 0.078
0.929ThrMet: 0.929 ± 0.033
3.599ThrAsn: 3.599 ± 0.083
2.372ThrPro: 2.372 ± 0.11
1.646ThrGln: 1.646 ± 0.044
1.59ThrArg: 1.59 ± 0.037
4.099ThrSer: 4.099 ± 0.096
3.509ThrThr: 3.509 ± 0.143
3.538ThrVal: 3.538 ± 0.087
0.633ThrTrp: 0.633 ± 0.037
2.204ThrTyr: 2.204 ± 0.067
0.0ThrXaa: 0.0 ± 0.0
Val
3.991ValAla: 3.991 ± 0.076
0.523ValCys: 0.523 ± 0.037
3.334ValAsp: 3.334 ± 0.061
3.46ValGlu: 3.46 ± 0.066
3.454ValPhe: 3.454 ± 0.071
3.736ValGly: 3.736 ± 0.065
1.006ValHis: 1.006 ± 0.031
4.845ValIle: 4.845 ± 0.086
4.558ValLys: 4.558 ± 0.073
5.889ValLeu: 5.889 ± 0.086
1.182ValMet: 1.182 ± 0.039
3.542ValAsn: 3.542 ± 0.065
1.875ValPro: 1.875 ± 0.044
1.67ValGln: 1.67 ± 0.044
1.811ValArg: 1.811 ± 0.047
4.524ValSer: 4.524 ± 0.082
3.553ValThr: 3.553 ± 0.137
3.826ValVal: 3.826 ± 0.084
0.611ValTrp: 0.611 ± 0.029
2.12ValTyr: 2.12 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.54TrpAla: 0.54 ± 0.028
0.097TrpCys: 0.097 ± 0.01
0.581TrpAsp: 0.581 ± 0.027
0.614TrpGlu: 0.614 ± 0.023
0.611TrpPhe: 0.611 ± 0.026
0.667TrpGly: 0.667 ± 0.029
0.198TrpHis: 0.198 ± 0.015
0.793TrpIle: 0.793 ± 0.03
0.854TrpLys: 0.854 ± 0.036
0.932TrpLeu: 0.932 ± 0.036
0.312TrpMet: 0.312 ± 0.021
0.731TrpAsn: 0.731 ± 0.029
0.21TrpPro: 0.21 ± 0.017
0.382TrpGln: 0.382 ± 0.025
0.407TrpArg: 0.407 ± 0.023
0.737TrpSer: 0.737 ± 0.045
0.519TrpThr: 0.519 ± 0.026
0.563TrpVal: 0.563 ± 0.029
0.12TrpTrp: 0.12 ± 0.011
0.419TrpTyr: 0.419 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.171TyrAla: 2.171 ± 0.05
0.319TyrCys: 0.319 ± 0.021
1.983TyrAsp: 1.983 ± 0.05
1.994TyrGlu: 1.994 ± 0.052
2.458TyrPhe: 2.458 ± 0.066
2.352TyrGly: 2.352 ± 0.058
0.728TyrHis: 0.728 ± 0.029
2.761TyrIle: 2.761 ± 0.058
3.38TyrLys: 3.38 ± 0.063
3.608TyrLeu: 3.608 ± 0.067
0.675TyrMet: 0.675 ± 0.028
2.684TyrAsn: 2.684 ± 0.052
1.419TyrPro: 1.419 ± 0.035
1.565TyrGln: 1.565 ± 0.049
1.431TyrArg: 1.431 ± 0.036
2.564TyrSer: 2.564 ± 0.058
2.459TyrThr: 2.459 ± 0.094
1.949TyrVal: 1.949 ± 0.047
0.446TyrTrp: 0.446 ± 0.024
1.6TyrTyr: 1.6 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2741 proteins (952477 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski