Amino acid dipepetide frequency for Clostridium sp. HMP27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.756AlaAla: 4.756 ± 0.095
0.786AlaCys: 0.786 ± 0.033
2.599AlaAsp: 2.599 ± 0.046
3.845AlaGlu: 3.845 ± 0.068
2.742AlaPhe: 2.742 ± 0.058
4.146AlaGly: 4.146 ± 0.078
0.917AlaHis: 0.917 ± 0.031
5.984AlaIle: 5.984 ± 0.085
4.645AlaLys: 4.645 ± 0.079
6.771AlaLeu: 6.771 ± 0.095
1.776AlaMet: 1.776 ± 0.049
2.508AlaAsn: 2.508 ± 0.055
1.641AlaPro: 1.641 ± 0.038
1.61AlaGln: 1.61 ± 0.04
2.011AlaArg: 2.011 ± 0.043
3.692AlaSer: 3.692 ± 0.071
2.747AlaThr: 2.747 ± 0.066
4.582AlaVal: 4.582 ± 0.075
0.401AlaTrp: 0.401 ± 0.022
2.191AlaTyr: 2.191 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
0.658CysAla: 0.658 ± 0.032
0.219CysCys: 0.219 ± 0.015
0.658CysAsp: 0.658 ± 0.029
0.741CysGlu: 0.741 ± 0.029
0.506CysPhe: 0.506 ± 0.021
1.174CysGly: 1.174 ± 0.034
0.226CysHis: 0.226 ± 0.013
1.19CysIle: 1.19 ± 0.035
0.923CysLys: 0.923 ± 0.032
0.872CysLeu: 0.872 ± 0.027
0.307CysMet: 0.307 ± 0.018
0.655CysAsn: 0.655 ± 0.026
0.53CysPro: 0.53 ± 0.027
0.244CysGln: 0.244 ± 0.016
0.429CysArg: 0.429 ± 0.02
0.818CysSer: 0.818 ± 0.037
0.613CysThr: 0.613 ± 0.026
0.631CysVal: 0.631 ± 0.022
0.068CysTrp: 0.068 ± 0.009
0.432CysTyr: 0.432 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
2.821AspAla: 2.821 ± 0.057
0.59AspCys: 0.59 ± 0.025
2.288AspAsp: 2.288 ± 0.046
4.434AspGlu: 4.434 ± 0.073
2.542AspPhe: 2.542 ± 0.054
3.226AspGly: 3.226 ± 0.065
0.571AspHis: 0.571 ± 0.026
6.159AspIle: 6.159 ± 0.083
5.251AspLys: 5.251 ± 0.083
4.622AspLeu: 4.622 ± 0.071
1.558AspMet: 1.558 ± 0.041
2.891AspAsn: 2.891 ± 0.056
1.289AspPro: 1.289 ± 0.033
0.778AspGln: 0.778 ± 0.03
1.809AspArg: 1.809 ± 0.045
2.801AspSer: 2.801 ± 0.051
2.496AspThr: 2.496 ± 0.041
3.385AspVal: 3.385 ± 0.054
0.343AspTrp: 0.343 ± 0.018
2.414AspTyr: 2.414 ± 0.046
0.0AspXaa: 0.0 ± 0.0
Glu
4.51GluAla: 4.51 ± 0.066
0.759GluCys: 0.759 ± 0.029
4.354GluAsp: 4.354 ± 0.071
7.307GluGlu: 7.307 ± 0.115
2.956GluPhe: 2.956 ± 0.048
4.417GluGly: 4.417 ± 0.069
1.004GluHis: 1.004 ± 0.033
6.897GluIle: 6.897 ± 0.096
7.379GluLys: 7.379 ± 0.101
6.769GluLeu: 6.769 ± 0.087
2.066GluMet: 2.066 ± 0.047
5.07GluAsn: 5.07 ± 0.076
1.433GluPro: 1.433 ± 0.038
1.851GluGln: 1.851 ± 0.041
2.793GluArg: 2.793 ± 0.062
3.626GluSer: 3.626 ± 0.066
2.972GluThr: 2.972 ± 0.061
5.348GluVal: 5.348 ± 0.079
0.437GluTrp: 0.437 ± 0.021
2.872GluTyr: 2.872 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
2.389PheAla: 2.389 ± 0.056
0.52PheCys: 0.52 ± 0.023
2.353PheAsp: 2.353 ± 0.057
2.609PheGlu: 2.609 ± 0.052
1.959PhePhe: 1.959 ± 0.047
2.968PheGly: 2.968 ± 0.057
0.598PheHis: 0.598 ± 0.025
4.74PheIle: 4.74 ± 0.087
3.824PheLys: 3.824 ± 0.065
3.923PheLeu: 3.923 ± 0.071
1.218PheMet: 1.218 ± 0.038
2.743PheAsn: 2.743 ± 0.054
1.266PhePro: 1.266 ± 0.034
1.081PheGln: 1.081 ± 0.035
1.185PheArg: 1.185 ± 0.031
3.063PheSer: 3.063 ± 0.052
2.319PheThr: 2.319 ± 0.045
2.649PheVal: 2.649 ± 0.051
0.352PheTrp: 0.352 ± 0.018
1.746PheTyr: 1.746 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.463GlyAla: 4.463 ± 0.094
1.011GlyCys: 1.011 ± 0.037
3.17GlyAsp: 3.17 ± 0.063
4.539GlyGlu: 4.539 ± 0.077
3.167GlyPhe: 3.167 ± 0.051
4.572GlyGly: 4.572 ± 0.088
1.048GlyHis: 1.048 ± 0.036
6.936GlyIle: 6.936 ± 0.094
5.626GlyLys: 5.626 ± 0.078
5.734GlyLeu: 5.734 ± 0.085
1.974GlyMet: 1.974 ± 0.054
3.278GlyAsn: 3.278 ± 0.058
1.374GlyPro: 1.374 ± 0.038
1.52GlyGln: 1.52 ± 0.039
2.403GlyArg: 2.403 ± 0.055
3.844GlySer: 3.844 ± 0.071
3.676GlyThr: 3.676 ± 0.06
4.972GlyVal: 4.972 ± 0.079
0.495GlyTrp: 0.495 ± 0.024
2.924GlyTyr: 2.924 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
0.771HisAla: 0.771 ± 0.031
0.207HisCys: 0.207 ± 0.016
0.728HisAsp: 0.728 ± 0.029
0.957HisGlu: 0.957 ± 0.032
0.621HisPhe: 0.621 ± 0.025
1.026HisGly: 1.026 ± 0.032
0.28HisHis: 0.28 ± 0.018
1.381HisIle: 1.381 ± 0.04
1.137HisLys: 1.137 ± 0.033
1.211HisLeu: 1.211 ± 0.035
0.429HisMet: 0.429 ± 0.02
0.813HisAsn: 0.813 ± 0.029
0.695HisPro: 0.695 ± 0.026
0.358HisGln: 0.358 ± 0.018
0.525HisArg: 0.525 ± 0.024
0.899HisSer: 0.899 ± 0.031
0.698HisThr: 0.698 ± 0.027
0.821HisVal: 0.821 ± 0.025
0.097HisTrp: 0.097 ± 0.01
0.554HisTyr: 0.554 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.036IleAla: 6.036 ± 0.095
1.215IleCys: 1.215 ± 0.036
5.5IleAsp: 5.5 ± 0.087
7.097IleGlu: 7.097 ± 0.092
4.175IlePhe: 4.175 ± 0.075
6.422IleGly: 6.422 ± 0.099
1.326IleHis: 1.326 ± 0.04
9.53IleIle: 9.53 ± 0.129
8.44IleLys: 8.44 ± 0.101
8.942IleLeu: 8.942 ± 0.107
2.58IleMet: 2.58 ± 0.055
5.981IleAsn: 5.981 ± 0.089
3.495IlePro: 3.495 ± 0.063
2.3IleGln: 2.3 ± 0.04
3.14IleArg: 3.14 ± 0.058
6.901IleSer: 6.901 ± 0.089
5.151IleThr: 5.151 ± 0.075
6.212IleVal: 6.212 ± 0.083
0.553IleTrp: 0.553 ± 0.025
3.71IleTyr: 3.71 ± 0.071
0.0IleXaa: 0.0 ± 0.0
Lys
5.011LysAla: 5.011 ± 0.084
0.852LysCys: 0.852 ± 0.037
5.729LysAsp: 5.729 ± 0.078
8.335LysGlu: 8.335 ± 0.109
3.136LysPhe: 3.136 ± 0.053
5.29LysGly: 5.29 ± 0.079
1.161LysHis: 1.161 ± 0.034
7.716LysIle: 7.716 ± 0.079
7.263LysLys: 7.263 ± 0.096
7.521LysLeu: 7.521 ± 0.097
2.4LysMet: 2.4 ± 0.044
5.735LysAsn: 5.735 ± 0.076
2.132LysPro: 2.132 ± 0.046
2.12LysGln: 2.12 ± 0.049
3.137LysArg: 3.137 ± 0.059
5.167LysSer: 5.167 ± 0.074
3.922LysThr: 3.922 ± 0.057
6.117LysVal: 6.117 ± 0.088
0.624LysTrp: 0.624 ± 0.025
3.767LysTyr: 3.767 ± 0.064
0.0LysXaa: 0.0 ± 0.0
Leu
5.477LeuAla: 5.477 ± 0.085
1.055LeuCys: 1.055 ± 0.031
4.803LeuAsp: 4.803 ± 0.065
6.194LeuGlu: 6.194 ± 0.089
3.802LeuPhe: 3.802 ± 0.071
6.493LeuGly: 6.493 ± 0.092
1.145LeuHis: 1.145 ± 0.033
8.426LeuIle: 8.426 ± 0.097
8.443LeuLys: 8.443 ± 0.116
7.871LeuLeu: 7.871 ± 0.101
2.523LeuMet: 2.523 ± 0.051
5.722LeuAsn: 5.722 ± 0.087
2.908LeuPro: 2.908 ± 0.056
2.416LeuGln: 2.416 ± 0.05
3.3LeuArg: 3.3 ± 0.059
6.842LeuSer: 6.842 ± 0.098
4.669LeuThr: 4.669 ± 0.073
5.358LeuVal: 5.358 ± 0.074
0.59LeuTrp: 0.59 ± 0.027
3.145LeuTyr: 3.145 ± 0.064
0.0LeuXaa: 0.0 ± 0.0
Met
1.984MetAla: 1.984 ± 0.044
0.311MetCys: 0.311 ± 0.017
1.654MetAsp: 1.654 ± 0.036
2.082MetGlu: 2.082 ± 0.046
1.138MetPhe: 1.138 ± 0.031
2.019MetGly: 2.019 ± 0.049
0.382MetHis: 0.382 ± 0.024
2.309MetIle: 2.309 ± 0.056
2.523MetLys: 2.523 ± 0.05
2.488MetLeu: 2.488 ± 0.042
0.785MetMet: 0.785 ± 0.028
1.705MetAsn: 1.705 ± 0.039
1.036MetPro: 1.036 ± 0.033
0.754MetGln: 0.754 ± 0.028
0.883MetArg: 0.883 ± 0.029
1.757MetSer: 1.757 ± 0.042
1.27MetThr: 1.27 ± 0.037
1.726MetVal: 1.726 ± 0.048
0.157MetTrp: 0.157 ± 0.012
0.896MetTyr: 0.896 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.025AsnAla: 3.025 ± 0.057
0.661AsnCys: 0.661 ± 0.026
2.566AsnAsp: 2.566 ± 0.046
4.081AsnGlu: 4.081 ± 0.055
2.478AsnPhe: 2.478 ± 0.047
3.415AsnGly: 3.415 ± 0.067
0.773AsnHis: 0.773 ± 0.029
6.773AsnIle: 6.773 ± 0.099
5.736AsnLys: 5.736 ± 0.089
5.488AsnLeu: 5.488 ± 0.089
1.672AsnMet: 1.672 ± 0.042
3.86AsnAsn: 3.86 ± 0.081
2.173AsnPro: 2.173 ± 0.047
1.268AsnGln: 1.268 ± 0.038
1.803AsnArg: 1.803 ± 0.042
3.702AsnSer: 3.702 ± 0.066
2.868AsnThr: 2.868 ± 0.055
3.458AsnVal: 3.458 ± 0.059
0.411AsnTrp: 0.411 ± 0.02
2.431AsnTyr: 2.431 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
1.599ProAla: 1.599 ± 0.044
0.36ProCys: 0.36 ± 0.02
1.448ProAsp: 1.448 ± 0.039
2.437ProGlu: 2.437 ± 0.05
1.409ProPhe: 1.409 ± 0.037
1.974ProGly: 1.974 ± 0.049
0.536ProHis: 0.536 ± 0.024
2.789ProIle: 2.789 ± 0.05
2.214ProLys: 2.214 ± 0.05
2.728ProLeu: 2.728 ± 0.053
0.826ProMet: 0.826 ± 0.026
1.457ProAsn: 1.457 ± 0.041
0.681ProPro: 0.681 ± 0.024
0.819ProGln: 0.819 ± 0.031
0.914ProArg: 0.914 ± 0.029
1.907ProSer: 1.907 ± 0.046
1.498ProThr: 1.498 ± 0.042
2.261ProVal: 2.261 ± 0.049
0.244ProTrp: 0.244 ± 0.016
1.342ProTyr: 1.342 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
1.567GlnAla: 1.567 ± 0.046
0.286GlnCys: 0.286 ± 0.016
1.302GlnAsp: 1.302 ± 0.038
1.801GlnGlu: 1.801 ± 0.044
0.942GlnPhe: 0.942 ± 0.031
1.712GlnGly: 1.712 ± 0.044
0.354GlnHis: 0.354 ± 0.021
2.151GlnIle: 2.151 ± 0.044
1.996GlnLys: 1.996 ± 0.044
2.029GlnLeu: 2.029 ± 0.056
0.755GlnMet: 0.755 ± 0.028
1.459GlnAsn: 1.459 ± 0.038
0.62GlnPro: 0.62 ± 0.027
0.723GlnGln: 0.723 ± 0.03
1.024GlnArg: 1.024 ± 0.033
1.482GlnSer: 1.482 ± 0.041
1.062GlnThr: 1.062 ± 0.035
1.617GlnVal: 1.617 ± 0.041
0.257GlnTrp: 0.257 ± 0.014
0.99GlnTyr: 0.99 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.051ArgAla: 2.051 ± 0.049
0.437ArgCys: 0.437 ± 0.019
1.858ArgAsp: 1.858 ± 0.048
3.137ArgGlu: 3.137 ± 0.059
1.442ArgPhe: 1.442 ± 0.037
2.151ArgGly: 2.151 ± 0.047
0.507ArgHis: 0.507 ± 0.024
3.128ArgIle: 3.128 ± 0.067
2.897ArgLys: 2.897 ± 0.049
2.93ArgLeu: 2.93 ± 0.054
0.982ArgMet: 0.982 ± 0.033
1.924ArgAsn: 1.924 ± 0.046
0.925ArgPro: 0.925 ± 0.03
0.951ArgGln: 0.951 ± 0.032
1.522ArgArg: 1.522 ± 0.04
1.635ArgSer: 1.635 ± 0.039
1.642ArgThr: 1.642 ± 0.043
2.388ArgVal: 2.388 ± 0.05
0.271ArgTrp: 0.271 ± 0.016
1.457ArgTyr: 1.457 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
3.322SerAla: 3.322 ± 0.061
0.665SerCys: 0.665 ± 0.028
2.829SerAsp: 2.829 ± 0.058
4.111SerGlu: 4.111 ± 0.074
3.134SerPhe: 3.134 ± 0.056
4.322SerGly: 4.322 ± 0.063
0.956SerHis: 0.956 ± 0.035
6.729SerIle: 6.729 ± 0.078
5.561SerLys: 5.561 ± 0.079
6.061SerLeu: 6.061 ± 0.084
1.788SerMet: 1.788 ± 0.042
3.645SerAsn: 3.645 ± 0.063
1.777SerPro: 1.777 ± 0.043
1.597SerGln: 1.597 ± 0.046
2.092SerArg: 2.092 ± 0.05
4.232SerSer: 4.232 ± 0.069
3.229SerThr: 3.229 ± 0.069
3.857SerVal: 3.857 ± 0.058
0.426SerTrp: 0.426 ± 0.022
2.589SerTyr: 2.589 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
3.224ThrAla: 3.224 ± 0.061
0.513ThrCys: 0.513 ± 0.022
2.23ThrAsp: 2.23 ± 0.052
3.067ThrGlu: 3.067 ± 0.051
2.213ThrPhe: 2.213 ± 0.047
3.839ThrGly: 3.839 ± 0.071
0.82ThrHis: 0.82 ± 0.029
4.751ThrIle: 4.751 ± 0.066
3.663ThrLys: 3.663 ± 0.057
4.994ThrLeu: 4.994 ± 0.078
1.242ThrMet: 1.242 ± 0.036
2.516ThrAsn: 2.516 ± 0.044
1.914ThrPro: 1.914 ± 0.047
1.1ThrGln: 1.1 ± 0.035
1.604ThrArg: 1.604 ± 0.043
3.231ThrSer: 3.231 ± 0.064
2.579ThrThr: 2.579 ± 0.054
3.467ThrVal: 3.467 ± 0.067
0.377ThrTrp: 0.377 ± 0.02
1.787ThrTyr: 1.787 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
4.136ValAla: 4.136 ± 0.069
0.864ValCys: 0.864 ± 0.029
3.578ValAsp: 3.578 ± 0.057
4.705ValGlu: 4.705 ± 0.069
2.931ValPhe: 2.931 ± 0.059
4.474ValGly: 4.474 ± 0.071
0.89ValHis: 0.89 ± 0.028
6.519ValIle: 6.519 ± 0.081
5.485ValLys: 5.485 ± 0.077
6.213ValLeu: 6.213 ± 0.078
1.724ValMet: 1.724 ± 0.042
3.643ValAsn: 3.643 ± 0.053
2.126ValPro: 2.126 ± 0.049
1.674ValGln: 1.674 ± 0.036
2.035ValArg: 2.035 ± 0.042
4.318ValSer: 4.318 ± 0.066
3.413ValThr: 3.413 ± 0.063
4.821ValVal: 4.821 ± 0.073
0.444ValTrp: 0.444 ± 0.022
2.405ValTyr: 2.405 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
0.409TrpAla: 0.409 ± 0.017
0.091TrpCys: 0.091 ± 0.01
0.367TrpAsp: 0.367 ± 0.018
0.448TrpGlu: 0.448 ± 0.02
0.316TrpPhe: 0.316 ± 0.019
0.495TrpGly: 0.495 ± 0.028
0.127TrpHis: 0.127 ± 0.012
0.683TrpIle: 0.683 ± 0.027
0.559TrpLys: 0.559 ± 0.022
0.589TrpLeu: 0.589 ± 0.024
0.211TrpMet: 0.211 ± 0.016
0.449TrpAsn: 0.449 ± 0.023
0.184TrpPro: 0.184 ± 0.014
0.211TrpGln: 0.211 ± 0.016
0.255TrpArg: 0.255 ± 0.017
0.433TrpSer: 0.433 ± 0.022
0.316TrpThr: 0.316 ± 0.02
0.428TrpVal: 0.428 ± 0.021
0.077TrpTrp: 0.077 ± 0.008
0.234TrpTyr: 0.234 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.141TyrAla: 2.141 ± 0.051
0.492TyrCys: 0.492 ± 0.023
2.253TyrAsp: 2.253 ± 0.054
2.893TyrGlu: 2.893 ± 0.058
1.894TyrPhe: 1.894 ± 0.044
2.589TyrGly: 2.589 ± 0.05
0.587TyrHis: 0.587 ± 0.025
3.916TyrIle: 3.916 ± 0.078
3.574TyrLys: 3.574 ± 0.066
3.424TyrLeu: 3.424 ± 0.061
1.041TyrMet: 1.041 ± 0.029
2.588TyrAsn: 2.588 ± 0.065
1.244TyrPro: 1.244 ± 0.034
0.755TyrGln: 0.755 ± 0.024
1.373TyrArg: 1.373 ± 0.039
2.573TyrSer: 2.573 ± 0.05
1.989TyrThr: 1.989 ± 0.039
2.301TyrVal: 2.301 ± 0.047
0.26TyrTrp: 0.26 ± 0.014
1.679TyrTyr: 1.679 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3451 proteins (1021203 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski