Amino acid dipepetide frequency for Cryobacterium sp. Hh4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.094AlaAla: 20.094 ± 0.239
0.689AlaCys: 0.689 ± 0.029
7.646AlaAsp: 7.646 ± 0.111
7.642AlaGlu: 7.642 ± 0.108
3.815AlaPhe: 3.815 ± 0.072
13.015AlaGly: 13.015 ± 0.127
2.332AlaHis: 2.332 ± 0.06
5.788AlaIle: 5.788 ± 0.071
2.796AlaLys: 2.796 ± 0.054
13.763AlaLeu: 13.763 ± 0.155
2.596AlaMet: 2.596 ± 0.049
2.566AlaAsn: 2.566 ± 0.053
6.0AlaPro: 6.0 ± 0.103
3.571AlaGln: 3.571 ± 0.06
9.028AlaArg: 9.028 ± 0.11
7.111AlaSer: 7.111 ± 0.088
7.581AlaThr: 7.581 ± 0.087
11.217AlaVal: 11.217 ± 0.126
1.802AlaTrp: 1.802 ± 0.044
2.219AlaTyr: 2.219 ± 0.054
0.001AlaXaa: 0.001 ± 0.001
Cys
0.688CysAla: 0.688 ± 0.03
0.058CysCys: 0.058 ± 0.009
0.332CysAsp: 0.332 ± 0.019
0.273CysGlu: 0.273 ± 0.017
0.189CysPhe: 0.189 ± 0.014
0.635CysGly: 0.635 ± 0.024
0.119CysHis: 0.119 ± 0.011
0.21CysIle: 0.21 ± 0.014
0.09CysLys: 0.09 ± 0.009
0.507CysLeu: 0.507 ± 0.023
0.09CysMet: 0.09 ± 0.008
0.132CysAsn: 0.132 ± 0.014
0.328CysPro: 0.328 ± 0.022
0.142CysGln: 0.142 ± 0.011
0.384CysArg: 0.384 ± 0.018
0.37CysSer: 0.37 ± 0.023
0.355CysThr: 0.355 ± 0.022
0.466CysVal: 0.466 ± 0.024
0.086CysTrp: 0.086 ± 0.009
0.121CysTyr: 0.121 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.883AspAla: 7.883 ± 0.099
0.292AspCys: 0.292 ± 0.019
3.541AspAsp: 3.541 ± 0.068
3.657AspGlu: 3.657 ± 0.059
1.929AspPhe: 1.929 ± 0.048
5.302AspGly: 5.302 ± 0.08
1.222AspHis: 1.222 ± 0.04
2.371AspIle: 2.371 ± 0.052
1.175AspLys: 1.175 ± 0.042
6.347AspLeu: 6.347 ± 0.099
0.771AspMet: 0.771 ± 0.029
1.12AspAsn: 1.12 ± 0.037
3.868AspPro: 3.868 ± 0.065
1.712AspGln: 1.712 ± 0.039
4.315AspArg: 4.315 ± 0.075
2.873AspSer: 2.873 ± 0.058
3.066AspThr: 3.066 ± 0.062
4.842AspVal: 4.842 ± 0.07
0.983AspTrp: 0.983 ± 0.031
1.409AspTyr: 1.409 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
6.307GluAla: 6.307 ± 0.099
0.251GluCys: 0.251 ± 0.016
2.357GluAsp: 2.357 ± 0.061
2.597GluGlu: 2.597 ± 0.06
1.8GluPhe: 1.8 ± 0.041
3.395GluGly: 3.395 ± 0.068
1.362GluHis: 1.362 ± 0.041
2.728GluIle: 2.728 ± 0.052
1.467GluLys: 1.467 ± 0.049
6.355GluLeu: 6.355 ± 0.089
0.937GluMet: 0.937 ± 0.03
1.298GluAsn: 1.298 ± 0.033
2.891GluPro: 2.891 ± 0.059
1.896GluGln: 1.896 ± 0.05
4.354GluArg: 4.354 ± 0.078
2.975GluSer: 2.975 ± 0.058
3.056GluThr: 3.056 ± 0.054
4.25GluVal: 4.25 ± 0.07
0.738GluTrp: 0.738 ± 0.029
1.095GluTyr: 1.095 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.148PheAla: 4.148 ± 0.072
0.195PheCys: 0.195 ± 0.015
2.382PheAsp: 2.382 ± 0.053
1.765PheGlu: 1.765 ± 0.043
1.108PhePhe: 1.108 ± 0.038
3.579PheGly: 3.579 ± 0.066
0.593PheHis: 0.593 ± 0.022
1.335PheIle: 1.335 ± 0.043
0.578PheLys: 0.578 ± 0.026
3.05PheLeu: 3.05 ± 0.062
0.478PheMet: 0.478 ± 0.025
0.782PheAsn: 0.782 ± 0.032
1.558PhePro: 1.558 ± 0.039
0.827PheGln: 0.827 ± 0.03
1.831PheArg: 1.831 ± 0.04
1.94PheSer: 1.94 ± 0.046
2.213PheThr: 2.213 ± 0.044
2.848PheVal: 2.848 ± 0.059
0.481PheTrp: 0.481 ± 0.025
0.753PheTyr: 0.753 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
10.572GlyAla: 10.572 ± 0.122
0.615GlyCys: 0.615 ± 0.028
4.746GlyAsp: 4.746 ± 0.073
4.47GlyGlu: 4.47 ± 0.08
3.32GlyPhe: 3.32 ± 0.061
7.59GlyGly: 7.59 ± 0.113
1.967GlyHis: 1.967 ± 0.047
4.992GlyIle: 4.992 ± 0.08
2.4GlyLys: 2.4 ± 0.055
9.574GlyLeu: 9.574 ± 0.119
1.906GlyMet: 1.906 ± 0.046
1.971GlyAsn: 1.971 ± 0.048
4.027GlyPro: 4.027 ± 0.067
2.757GlyGln: 2.757 ± 0.052
6.468GlyArg: 6.468 ± 0.088
5.781GlySer: 5.781 ± 0.084
5.816GlyThr: 5.816 ± 0.09
7.709GlyVal: 7.709 ± 0.088
1.58GlyTrp: 1.58 ± 0.044
2.256GlyTyr: 2.256 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.196HisAla: 2.196 ± 0.053
0.153HisCys: 0.153 ± 0.013
1.214HisAsp: 1.214 ± 0.034
1.069HisGlu: 1.069 ± 0.041
0.559HisPhe: 0.559 ± 0.025
2.009HisGly: 2.009 ± 0.047
0.54HisHis: 0.54 ± 0.028
0.716HisIle: 0.716 ± 0.03
0.342HisLys: 0.342 ± 0.021
2.094HisLeu: 2.094 ± 0.059
0.283HisMet: 0.283 ± 0.017
0.44HisAsn: 0.44 ± 0.02
1.512HisPro: 1.512 ± 0.039
0.577HisGln: 0.577 ± 0.024
1.544HisArg: 1.544 ± 0.04
1.123HisSer: 1.123 ± 0.036
1.056HisThr: 1.056 ± 0.033
1.52HisVal: 1.52 ± 0.044
0.279HisTrp: 0.279 ± 0.017
0.432HisTyr: 0.432 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.46IleAla: 6.46 ± 0.083
0.272IleCys: 0.272 ± 0.015
3.535IleAsp: 3.535 ± 0.069
2.782IleGlu: 2.782 ± 0.063
1.371IlePhe: 1.371 ± 0.043
4.741IleGly: 4.741 ± 0.081
0.736IleHis: 0.736 ± 0.027
2.005IleIle: 2.005 ± 0.049
0.97IleLys: 0.97 ± 0.037
4.232IleLeu: 4.232 ± 0.08
0.702IleMet: 0.702 ± 0.028
1.122IleAsn: 1.122 ± 0.036
2.527IlePro: 2.527 ± 0.051
1.087IleGln: 1.087 ± 0.033
3.01IleArg: 3.01 ± 0.05
2.517IleSer: 2.517 ± 0.05
2.983IleThr: 2.983 ± 0.057
4.713IleVal: 4.713 ± 0.077
0.478IleTrp: 0.478 ± 0.021
0.839IleTyr: 0.839 ± 0.032
0.002IleXaa: 0.002 ± 0.001
Lys
2.82LysAla: 2.82 ± 0.064
0.09LysCys: 0.09 ± 0.01
1.163LysAsp: 1.163 ± 0.037
0.981LysGlu: 0.981 ± 0.037
0.614LysPhe: 0.614 ± 0.026
1.708LysGly: 1.708 ± 0.055
0.443LysHis: 0.443 ± 0.021
1.137LysIle: 1.137 ± 0.037
0.89LysLys: 0.89 ± 0.035
2.123LysLeu: 2.123 ± 0.049
0.484LysMet: 0.484 ± 0.021
0.659LysAsn: 0.659 ± 0.026
1.361LysPro: 1.361 ± 0.043
0.744LysGln: 0.744 ± 0.028
1.688LysArg: 1.688 ± 0.044
1.389LysSer: 1.389 ± 0.042
1.561LysThr: 1.561 ± 0.044
1.877LysVal: 1.877 ± 0.048
0.29LysTrp: 0.29 ± 0.02
0.482LysTyr: 0.482 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
15.43LeuAla: 15.43 ± 0.162
0.58LeuCys: 0.58 ± 0.023
6.537LeuAsp: 6.537 ± 0.08
4.995LeuGlu: 4.995 ± 0.079
3.076LeuPhe: 3.076 ± 0.066
9.766LeuGly: 9.766 ± 0.108
2.036LeuHis: 2.036 ± 0.051
4.832LeuIle: 4.832 ± 0.079
2.058LeuLys: 2.058 ± 0.057
10.849LeuLeu: 10.849 ± 0.152
1.734LeuMet: 1.734 ± 0.044
2.182LeuAsn: 2.182 ± 0.051
5.676LeuPro: 5.676 ± 0.082
2.432LeuGln: 2.432 ± 0.051
7.416LeuArg: 7.416 ± 0.1
6.187LeuSer: 6.187 ± 0.071
6.715LeuThr: 6.715 ± 0.083
9.643LeuVal: 9.643 ± 0.124
1.23LeuTrp: 1.23 ± 0.039
1.754LeuTyr: 1.754 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
2.172MetAla: 2.172 ± 0.053
0.095MetCys: 0.095 ± 0.009
0.81MetAsp: 0.81 ± 0.029
0.648MetGlu: 0.648 ± 0.026
0.544MetPhe: 0.544 ± 0.025
1.314MetGly: 1.314 ± 0.044
0.33MetHis: 0.33 ± 0.019
0.952MetIle: 0.952 ± 0.032
0.55MetLys: 0.55 ± 0.023
1.921MetLeu: 1.921 ± 0.044
0.343MetMet: 0.343 ± 0.023
0.535MetAsn: 0.535 ± 0.021
1.054MetPro: 1.054 ± 0.034
0.529MetGln: 0.529 ± 0.02
1.222MetArg: 1.222 ± 0.033
1.445MetSer: 1.445 ± 0.032
1.645MetThr: 1.645 ± 0.039
1.485MetVal: 1.485 ± 0.039
0.195MetTrp: 0.195 ± 0.015
0.273MetTyr: 0.273 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.61AsnAla: 2.61 ± 0.047
0.132AsnCys: 0.132 ± 0.011
1.291AsnAsp: 1.291 ± 0.043
1.059AsnGlu: 1.059 ± 0.036
0.762AsnPhe: 0.762 ± 0.032
2.135AsnGly: 2.135 ± 0.053
0.398AsnHis: 0.398 ± 0.021
0.974AsnIle: 0.974 ± 0.033
0.567AsnLys: 0.567 ± 0.025
2.287AsnLeu: 2.287 ± 0.049
0.363AsnMet: 0.363 ± 0.02
0.589AsnAsn: 0.589 ± 0.025
1.755AsnPro: 1.755 ± 0.041
0.765AsnGln: 0.765 ± 0.045
1.506AsnArg: 1.506 ± 0.039
1.219AsnSer: 1.219 ± 0.032
1.333AsnThr: 1.333 ± 0.038
1.765AsnVal: 1.765 ± 0.043
0.35AsnTrp: 0.35 ± 0.019
0.546AsnTyr: 0.546 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
7.559ProAla: 7.559 ± 0.104
0.224ProCys: 0.224 ± 0.015
3.691ProAsp: 3.691 ± 0.067
3.407ProGlu: 3.407 ± 0.068
1.688ProPhe: 1.688 ± 0.041
5.494ProGly: 5.494 ± 0.094
0.998ProHis: 0.998 ± 0.034
2.257ProIle: 2.257 ± 0.041
1.136ProLys: 1.136 ± 0.04
4.901ProLeu: 4.901 ± 0.079
0.962ProMet: 0.962 ± 0.032
1.197ProAsn: 1.197 ± 0.038
2.232ProPro: 2.232 ± 0.061
1.386ProGln: 1.386 ± 0.036
3.282ProArg: 3.282 ± 0.07
3.088ProSer: 3.088 ± 0.061
3.361ProThr: 3.361 ± 0.071
5.057ProVal: 5.057 ± 0.079
0.776ProTrp: 0.776 ± 0.024
0.971ProTyr: 0.971 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.847GlnAla: 3.847 ± 0.075
0.126GlnCys: 0.126 ± 0.014
1.313GlnAsp: 1.313 ± 0.031
1.182GlnGlu: 1.182 ± 0.038
0.899GlnPhe: 0.899 ± 0.033
2.158GlnGly: 2.158 ± 0.051
0.535GlnHis: 0.535 ± 0.023
1.507GlnIle: 1.507 ± 0.04
0.706GlnLys: 0.706 ± 0.028
3.177GlnLeu: 3.177 ± 0.062
0.52GlnMet: 0.52 ± 0.023
0.669GlnAsn: 0.669 ± 0.042
1.477GlnPro: 1.477 ± 0.037
1.017GlnGln: 1.017 ± 0.032
2.143GlnArg: 2.143 ± 0.055
1.592GlnSer: 1.592 ± 0.044
1.569GlnThr: 1.569 ± 0.042
2.651GlnVal: 2.651 ± 0.048
0.391GlnTrp: 0.391 ± 0.018
0.568GlnTyr: 0.568 ± 0.027
0.001GlnXaa: 0.001 ± 0.001
Arg
8.272ArgAla: 8.272 ± 0.107
0.333ArgCys: 0.333 ± 0.019
3.959ArgAsp: 3.959 ± 0.062
3.79ArgGlu: 3.79 ± 0.061
2.507ArgPhe: 2.507 ± 0.051
5.248ArgGly: 5.248 ± 0.078
1.564ArgHis: 1.564 ± 0.043
3.581ArgIle: 3.581 ± 0.065
1.651ArgLys: 1.651 ± 0.044
7.859ArgLeu: 7.859 ± 0.1
1.655ArgMet: 1.655 ± 0.042
1.509ArgAsn: 1.509 ± 0.043
3.756ArgPro: 3.756 ± 0.076
2.161ArgGln: 2.161 ± 0.047
5.982ArgArg: 5.982 ± 0.107
4.39ArgSer: 4.39 ± 0.069
4.102ArgThr: 4.102 ± 0.051
5.665ArgVal: 5.665 ± 0.086
1.046ArgTrp: 1.046 ± 0.034
1.545ArgTyr: 1.545 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
7.237SerAla: 7.237 ± 0.089
0.334SerCys: 0.334 ± 0.019
2.945SerAsp: 2.945 ± 0.059
2.626SerGlu: 2.626 ± 0.05
2.083SerPhe: 2.083 ± 0.043
5.969SerGly: 5.969 ± 0.088
1.061SerHis: 1.061 ± 0.034
2.813SerIle: 2.813 ± 0.053
1.233SerLys: 1.233 ± 0.043
5.948SerLeu: 5.948 ± 0.076
1.201SerMet: 1.201 ± 0.038
1.235SerAsn: 1.235 ± 0.036
3.293SerPro: 3.293 ± 0.06
1.539SerGln: 1.539 ± 0.039
4.011SerArg: 4.011 ± 0.066
3.421SerSer: 3.421 ± 0.063
3.905SerThr: 3.905 ± 0.073
5.242SerVal: 5.242 ± 0.074
0.864SerTrp: 0.864 ± 0.027
1.323SerTyr: 1.323 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
7.854ThrAla: 7.854 ± 0.099
0.317ThrCys: 0.317 ± 0.02
3.836ThrAsp: 3.836 ± 0.065
3.194ThrGlu: 3.194 ± 0.063
1.871ThrPhe: 1.871 ± 0.047
6.344ThrGly: 6.344 ± 0.081
1.115ThrHis: 1.115 ± 0.038
3.035ThrIle: 3.035 ± 0.058
1.29ThrLys: 1.29 ± 0.04
6.196ThrLeu: 6.196 ± 0.079
1.075ThrMet: 1.075 ± 0.035
1.384ThrAsn: 1.384 ± 0.038
3.821ThrPro: 3.821 ± 0.077
1.451ThrGln: 1.451 ± 0.04
4.006ThrArg: 4.006 ± 0.063
3.401ThrSer: 3.401 ± 0.062
3.822ThrThr: 3.822 ± 0.081
6.137ThrVal: 6.137 ± 0.089
0.807ThrTrp: 0.807 ± 0.033
1.055ThrTyr: 1.055 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
11.289ValAla: 11.289 ± 0.119
0.579ValCys: 0.579 ± 0.025
5.409ValAsp: 5.409 ± 0.071
4.39ValGlu: 4.39 ± 0.078
2.995ValPhe: 2.995 ± 0.064
7.288ValGly: 7.288 ± 0.104
1.658ValHis: 1.658 ± 0.041
4.49ValIle: 4.49 ± 0.07
1.86ValLys: 1.86 ± 0.051
9.672ValLeu: 9.672 ± 0.126
1.418ValMet: 1.418 ± 0.038
2.088ValAsn: 2.088 ± 0.047
4.71ValPro: 4.71 ± 0.08
2.225ValGln: 2.225 ± 0.04
5.647ValArg: 5.647 ± 0.077
5.333ValSer: 5.333 ± 0.084
5.871ValThr: 5.871 ± 0.08
8.204ValVal: 8.204 ± 0.106
1.074ValTrp: 1.074 ± 0.031
1.634ValTyr: 1.634 ± 0.039
0.001ValXaa: 0.001 ± 0.001
Trp
1.579TrpAla: 1.579 ± 0.044
0.095TrpCys: 0.095 ± 0.01
0.716TrpAsp: 0.716 ± 0.03
0.53TrpGlu: 0.53 ± 0.024
0.548TrpPhe: 0.548 ± 0.023
0.948TrpGly: 0.948 ± 0.034
0.307TrpHis: 0.307 ± 0.016
0.663TrpIle: 0.663 ± 0.024
0.343TrpLys: 0.343 ± 0.018
1.84TrpLeu: 1.84 ± 0.046
0.287TrpMet: 0.287 ± 0.018
0.434TrpAsn: 0.434 ± 0.023
0.717TrpPro: 0.717 ± 0.029
0.604TrpGln: 0.604 ± 0.028
1.082TrpArg: 1.082 ± 0.036
0.848TrpSer: 0.848 ± 0.035
0.866TrpThr: 0.866 ± 0.032
1.044TrpVal: 1.044 ± 0.032
0.316TrpTrp: 0.316 ± 0.018
0.301TrpTyr: 0.301 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.255TyrAla: 2.255 ± 0.059
0.148TyrCys: 0.148 ± 0.012
1.254TyrAsp: 1.254 ± 0.044
1.051TyrGlu: 1.051 ± 0.037
0.764TyrPhe: 0.764 ± 0.034
1.877TyrGly: 1.877 ± 0.048
0.344TyrHis: 0.344 ± 0.021
0.695TyrIle: 0.695 ± 0.029
0.402TyrLys: 0.402 ± 0.023
2.434TyrLeu: 2.434 ± 0.057
0.258TyrMet: 0.258 ± 0.016
0.495TyrAsn: 0.495 ± 0.025
1.1TyrPro: 1.1 ± 0.038
0.629TyrGln: 0.629 ± 0.026
1.655TyrArg: 1.655 ± 0.046
1.282TyrSer: 1.282 ± 0.04
1.132TyrThr: 1.132 ± 0.037
1.498TyrVal: 1.498 ± 0.044
0.304TyrTrp: 0.304 ± 0.019
0.487TyrTyr: 0.487 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.001
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3130 proteins (988464 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski