Amino acid dipepetide frequency for Hucho hucho (huchen)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.213AlaAla: 5.213 ± 0.019
1.304AlaCys: 1.304 ± 0.005
2.959AlaAsp: 2.959 ± 0.009
4.245AlaGlu: 4.245 ± 0.013
2.351AlaPhe: 2.351 ± 0.008
4.158AlaGly: 4.158 ± 0.013
1.465AlaHis: 1.465 ± 0.006
2.83AlaIle: 2.83 ± 0.007
3.389AlaLys: 3.389 ± 0.012
6.332AlaLeu: 6.332 ± 0.015
1.715AlaMet: 1.715 ± 0.006
2.194AlaAsn: 2.194 ± 0.007
3.245AlaPro: 3.245 ± 0.011
2.845AlaGln: 2.845 ± 0.01
3.14AlaArg: 3.14 ± 0.009
4.921AlaSer: 4.921 ± 0.013
3.481AlaThr: 3.481 ± 0.01
4.659AlaVal: 4.659 ± 0.011
0.657AlaTrp: 0.657 ± 0.004
1.569AlaTyr: 1.569 ± 0.006
0.001AlaXaa: 0.001 ± 0.0
Cys
1.157CysAla: 1.157 ± 0.006
0.657CysCys: 0.657 ± 0.005
1.177CysAsp: 1.177 ± 0.008
1.248CysGlu: 1.248 ± 0.008
0.943CysPhe: 0.943 ± 0.005
1.676CysGly: 1.676 ± 0.01
0.682CysHis: 0.682 ± 0.004
1.065CysIle: 1.065 ± 0.006
1.156CysLys: 1.156 ± 0.006
2.307CysLeu: 2.307 ± 0.009
0.53CysMet: 0.53 ± 0.004
0.865CysAsn: 0.865 ± 0.005
1.303CysPro: 1.303 ± 0.008
1.032CysGln: 1.032 ± 0.006
1.251CysArg: 1.251 ± 0.006
2.153CysSer: 2.153 ± 0.01
1.232CysThr: 1.232 ± 0.007
1.826CysVal: 1.826 ± 0.01
0.292CysTrp: 0.292 ± 0.003
0.661CysTyr: 0.661 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
2.762AspAla: 2.762 ± 0.008
1.206AspCys: 1.206 ± 0.007
2.985AspAsp: 2.985 ± 0.011
3.553AspGlu: 3.553 ± 0.011
2.106AspPhe: 2.106 ± 0.007
3.549AspGly: 3.549 ± 0.012
1.26AspHis: 1.26 ± 0.006
2.777AspIle: 2.777 ± 0.009
2.789AspLys: 2.789 ± 0.009
5.085AspLeu: 5.085 ± 0.011
1.405AspMet: 1.405 ± 0.006
2.118AspAsn: 2.118 ± 0.009
2.913AspPro: 2.913 ± 0.009
2.047AspGln: 2.047 ± 0.006
2.776AspArg: 2.776 ± 0.009
4.212AspSer: 4.212 ± 0.011
2.802AspThr: 2.802 ± 0.008
3.202AspVal: 3.202 ± 0.01
0.715AspTrp: 0.715 ± 0.004
1.635AspTyr: 1.635 ± 0.007
0.001AspXaa: 0.001 ± 0.0
Glu
4.468GluAla: 4.468 ± 0.014
1.272GluCys: 1.272 ± 0.008
4.319GluAsp: 4.319 ± 0.011
7.573GluGlu: 7.573 ± 0.028
1.95GluPhe: 1.95 ± 0.008
4.473GluGly: 4.473 ± 0.012
1.46GluHis: 1.46 ± 0.006
2.835GluIle: 2.835 ± 0.009
4.65GluLys: 4.65 ± 0.017
6.076GluLeu: 6.076 ± 0.017
1.816GluMet: 1.816 ± 0.008
2.664GluAsn: 2.664 ± 0.008
2.67GluPro: 2.67 ± 0.01
2.953GluGln: 2.953 ± 0.012
4.462GluArg: 4.462 ± 0.018
4.07GluSer: 4.07 ± 0.01
3.35GluThr: 3.35 ± 0.01
4.442GluVal: 4.442 ± 0.011
0.742GluTrp: 0.742 ± 0.004
1.645GluTyr: 1.645 ± 0.007
0.001GluXaa: 0.001 ± 0.0
Phe
1.9PheAla: 1.9 ± 0.007
0.955PheCys: 0.955 ± 0.005
1.794PheAsp: 1.794 ± 0.007
1.874PheGlu: 1.874 ± 0.007
1.614PhePhe: 1.614 ± 0.008
2.261PheGly: 2.261 ± 0.009
1.051PheHis: 1.051 ± 0.005
1.95PheIle: 1.95 ± 0.008
1.818PheLys: 1.818 ± 0.007
3.893PheLeu: 3.893 ± 0.013
0.853PheMet: 0.853 ± 0.005
1.578PheAsn: 1.578 ± 0.006
1.817PhePro: 1.817 ± 0.006
1.682PheGln: 1.682 ± 0.005
1.845PheArg: 1.845 ± 0.007
3.381PheSer: 3.381 ± 0.009
2.328PheThr: 2.328 ± 0.008
2.235PheVal: 2.235 ± 0.008
0.473PheTrp: 0.473 ± 0.004
1.241PheTyr: 1.241 ± 0.006
0.0PheXaa: 0.0 ± 0.0
Gly
3.985GlyAla: 3.985 ± 0.013
1.318GlyCys: 1.318 ± 0.007
3.415GlyAsp: 3.415 ± 0.01
4.443GlyGlu: 4.443 ± 0.014
2.413GlyPhe: 2.413 ± 0.009
5.752GlyGly: 5.752 ± 0.025
1.758GlyHis: 1.758 ± 0.008
2.653GlyIle: 2.653 ± 0.009
3.86GlyLys: 3.86 ± 0.011
5.64GlyLeu: 5.64 ± 0.014
1.697GlyMet: 1.697 ± 0.007
2.554GlyAsn: 2.554 ± 0.008
3.297GlyPro: 3.297 ± 0.016
2.895GlyGln: 2.895 ± 0.009
3.684GlyArg: 3.684 ± 0.014
5.501GlySer: 5.501 ± 0.016
3.592GlyThr: 3.592 ± 0.01
4.212GlyVal: 4.212 ± 0.012
0.815GlyTrp: 0.815 ± 0.005
1.971GlyTyr: 1.971 ± 0.009
0.001GlyXaa: 0.001 ± 0.0
His
1.319HisAla: 1.319 ± 0.006
0.841HisCys: 0.841 ± 0.005
1.03HisAsp: 1.03 ± 0.005
1.215HisGlu: 1.215 ± 0.006
1.07HisPhe: 1.07 ± 0.005
1.62HisGly: 1.62 ± 0.007
1.09HisHis: 1.09 ± 0.007
1.386HisIle: 1.386 ± 0.006
1.302HisLys: 1.302 ± 0.006
2.754HisLeu: 2.754 ± 0.009
0.705HisMet: 0.705 ± 0.005
1.084HisAsn: 1.084 ± 0.005
1.651HisPro: 1.651 ± 0.007
1.39HisGln: 1.39 ± 0.008
1.646HisArg: 1.646 ± 0.006
2.441HisSer: 2.441 ± 0.007
2.041HisThr: 2.041 ± 0.011
1.476HisVal: 1.476 ± 0.005
0.341HisTrp: 0.341 ± 0.003
0.913HisTyr: 0.913 ± 0.005
0.001HisXaa: 0.001 ± 0.0
Ile
2.597IleAla: 2.597 ± 0.008
1.08IleCys: 1.08 ± 0.006
2.227IleAsp: 2.227 ± 0.008
2.51IleGlu: 2.51 ± 0.007
1.817IlePhe: 1.817 ± 0.008
2.447IleGly: 2.447 ± 0.008
1.404IleHis: 1.404 ± 0.006
2.442IleIle: 2.442 ± 0.008
2.582IleLys: 2.582 ± 0.008
4.424IleLeu: 4.424 ± 0.012
1.161IleMet: 1.161 ± 0.006
2.003IleAsn: 2.003 ± 0.006
2.501IlePro: 2.501 ± 0.008
2.194IleGln: 2.194 ± 0.007
2.443IleArg: 2.443 ± 0.007
3.623IleSer: 3.623 ± 0.009
2.906IleThr: 2.906 ± 0.008
2.719IleVal: 2.719 ± 0.009
0.494IleTrp: 0.494 ± 0.004
1.437IleTyr: 1.437 ± 0.008
0.001IleXaa: 0.001 ± 0.0
Lys
3.72LysAla: 3.72 ± 0.011
1.068LysCys: 1.068 ± 0.006
3.205LysAsp: 3.205 ± 0.009
4.641LysGlu: 4.641 ± 0.016
1.635LysPhe: 1.635 ± 0.007
3.332LysGly: 3.332 ± 0.01
1.48LysHis: 1.48 ± 0.006
2.542LysIle: 2.542 ± 0.008
4.245LysLys: 4.245 ± 0.017
4.928LysLeu: 4.928 ± 0.013
1.556LysMet: 1.556 ± 0.006
2.265LysAsn: 2.265 ± 0.007
2.908LysPro: 2.908 ± 0.011
2.462LysGln: 2.462 ± 0.008
3.426LysArg: 3.426 ± 0.009
3.67LysSer: 3.67 ± 0.012
3.217LysThr: 3.217 ± 0.011
3.678LysVal: 3.678 ± 0.011
0.604LysTrp: 0.604 ± 0.004
1.553LysTyr: 1.553 ± 0.006
0.001LysXaa: 0.001 ± 0.0
Leu
6.021LeuAla: 6.021 ± 0.013
2.347LeuCys: 2.347 ± 0.008
4.907LeuAsp: 4.907 ± 0.011
6.444LeuGlu: 6.444 ± 0.017
3.583LeuPhe: 3.583 ± 0.011
5.448LeuGly: 5.448 ± 0.013
2.749LeuHis: 2.749 ± 0.009
3.836LeuIle: 3.836 ± 0.009
5.644LeuLys: 5.644 ± 0.014
9.864LeuLeu: 9.864 ± 0.022
2.172LeuMet: 2.172 ± 0.008
3.687LeuAsn: 3.687 ± 0.01
5.2LeuPro: 5.2 ± 0.014
5.264LeuGln: 5.264 ± 0.017
5.552LeuArg: 5.552 ± 0.012
8.406LeuSer: 8.406 ± 0.019
5.42LeuThr: 5.42 ± 0.012
5.623LeuVal: 5.623 ± 0.013
1.105LeuTrp: 1.105 ± 0.005
2.743LeuTyr: 2.743 ± 0.009
0.002LeuXaa: 0.002 ± 0.0
Met
2.049MetAla: 2.049 ± 0.007
0.542MetCys: 0.542 ± 0.004
1.579MetAsp: 1.579 ± 0.006
2.145MetGlu: 2.145 ± 0.008
0.913MetPhe: 0.913 ± 0.005
1.651MetGly: 1.651 ± 0.007
0.536MetHis: 0.536 ± 0.004
0.939MetIle: 0.939 ± 0.005
1.51MetLys: 1.51 ± 0.006
2.177MetLeu: 2.177 ± 0.007
0.749MetMet: 0.749 ± 0.005
0.959MetAsn: 0.959 ± 0.005
1.148MetPro: 1.148 ± 0.006
1.022MetGln: 1.022 ± 0.005
1.267MetArg: 1.267 ± 0.006
1.912MetSer: 1.912 ± 0.008
1.353MetThr: 1.353 ± 0.006
1.744MetVal: 1.744 ± 0.007
0.276MetTrp: 0.276 ± 0.003
0.716MetTyr: 0.716 ± 0.004
0.0MetXaa: 0.0 ± 0.0
Asn
2.165AsnAla: 2.165 ± 0.008
0.89AsnCys: 0.89 ± 0.006
1.775AsnAsp: 1.775 ± 0.007
2.06AsnGlu: 2.06 ± 0.007
1.39AsnPhe: 1.39 ± 0.005
2.835AsnGly: 2.835 ± 0.011
1.059AsnHis: 1.059 ± 0.005
2.2AsnIle: 2.2 ± 0.007
2.208AsnLys: 2.208 ± 0.008
3.681AsnLeu: 3.681 ± 0.01
1.127AsnMet: 1.127 ± 0.005
1.851AsnAsn: 1.851 ± 0.008
2.313AsnPro: 2.313 ± 0.008
1.724AsnGln: 1.724 ± 0.006
2.016AsnArg: 2.016 ± 0.007
3.126AsnSer: 3.126 ± 0.009
2.427AsnThr: 2.427 ± 0.008
2.421AsnVal: 2.421 ± 0.008
0.471AsnTrp: 0.471 ± 0.003
1.192AsnTyr: 1.192 ± 0.005
0.001AsnXaa: 0.001 ± 0.0
Pro
3.676ProAla: 3.676 ± 0.013
1.103ProCys: 1.103 ± 0.007
2.839ProAsp: 2.839 ± 0.009
3.638ProGlu: 3.638 ± 0.009
1.819ProPhe: 1.819 ± 0.008
4.062ProGly: 4.062 ± 0.02
1.533ProHis: 1.533 ± 0.008
2.009ProIle: 2.009 ± 0.007
2.514ProLys: 2.514 ± 0.009
4.905ProLeu: 4.905 ± 0.013
1.171ProMet: 1.171 ± 0.006
1.955ProAsn: 1.955 ± 0.007
4.974ProPro: 4.974 ± 0.021
2.601ProGln: 2.601 ± 0.011
2.718ProArg: 2.718 ± 0.009
5.3ProSer: 5.3 ± 0.017
3.178ProThr: 3.178 ± 0.012
3.602ProVal: 3.602 ± 0.012
0.569ProTrp: 0.569 ± 0.004
1.559ProTyr: 1.559 ± 0.007
0.002ProXaa: 0.002 ± 0.0
Gln
3.184GlnAla: 3.184 ± 0.012
1.117GlnCys: 1.117 ± 0.007
2.325GlnAsp: 2.325 ± 0.008
3.39GlnGlu: 3.39 ± 0.011
1.399GlnPhe: 1.399 ± 0.005
2.993GlnGly: 2.993 ± 0.012
1.389GlnHis: 1.389 ± 0.007
1.9GlnIle: 1.9 ± 0.007
2.403GlnLys: 2.403 ± 0.009
4.433GlnLeu: 4.433 ± 0.014
1.142GlnMet: 1.142 ± 0.005
1.694GlnAsn: 1.694 ± 0.006
2.471GlnPro: 2.471 ± 0.011
3.091GlnGln: 3.091 ± 0.024
3.087GlnArg: 3.087 ± 0.01
3.322GlnSer: 3.322 ± 0.01
2.501GlnThr: 2.501 ± 0.007
2.843GlnVal: 2.843 ± 0.009
0.559GlnTrp: 0.559 ± 0.004
1.319GlnTyr: 1.319 ± 0.006
0.001GlnXaa: 0.001 ± 0.0
Arg
3.362ArgAla: 3.362 ± 0.008
1.226ArgCys: 1.226 ± 0.007
3.01ArgAsp: 3.01 ± 0.01
4.076ArgGlu: 4.076 ± 0.014
1.942ArgPhe: 1.942 ± 0.007
3.63ArgGly: 3.63 ± 0.012
1.585ArgHis: 1.585 ± 0.007
2.447ArgIle: 2.447 ± 0.008
3.488ArgLys: 3.488 ± 0.01
5.26ArgLeu: 5.26 ± 0.012
1.401ArgMet: 1.401 ± 0.006
2.162ArgAsn: 2.162 ± 0.007
2.868ArgPro: 2.868 ± 0.01
2.66ArgGln: 2.66 ± 0.01
4.123ArgArg: 4.123 ± 0.014
4.155ArgSer: 4.155 ± 0.012
2.975ArgThr: 2.975 ± 0.009
3.45ArgVal: 3.45 ± 0.009
0.66ArgTrp: 0.66 ± 0.004
1.594ArgTyr: 1.594 ± 0.006
0.001ArgXaa: 0.001 ± 0.0
Ser
4.93SerAla: 4.93 ± 0.011
1.883SerCys: 1.883 ± 0.01
4.051SerAsp: 4.051 ± 0.011
4.564SerGlu: 4.564 ± 0.012
3.078SerPhe: 3.078 ± 0.009
5.399SerGly: 5.399 ± 0.014
2.262SerHis: 2.262 ± 0.008
3.4SerIle: 3.4 ± 0.009
3.805SerLys: 3.805 ± 0.01
8.466SerLeu: 8.466 ± 0.021
1.896SerMet: 1.896 ± 0.007
2.931SerAsn: 2.931 ± 0.009
5.651SerPro: 5.651 ± 0.02
3.733SerGln: 3.733 ± 0.01
4.176SerArg: 4.176 ± 0.012
8.833SerSer: 8.833 ± 0.028
4.731SerThr: 4.731 ± 0.014
5.297SerVal: 5.297 ± 0.012
0.945SerTrp: 0.945 ± 0.005
2.167SerTyr: 2.167 ± 0.007
0.002SerXaa: 0.002 ± 0.0
Thr
3.906ThrAla: 3.906 ± 0.009
1.374ThrCys: 1.374 ± 0.008
2.934ThrAsp: 2.934 ± 0.009
3.699ThrGlu: 3.699 ± 0.01
2.151ThrPhe: 2.151 ± 0.008
3.98ThrGly: 3.98 ± 0.01
1.776ThrHis: 1.776 ± 0.01
2.613ThrIle: 2.613 ± 0.008
2.785ThrLys: 2.785 ± 0.01
5.61ThrLeu: 5.61 ± 0.014
1.365ThrMet: 1.365 ± 0.005
2.005ThrAsn: 2.005 ± 0.007
3.687ThrPro: 3.687 ± 0.012
2.438ThrGln: 2.438 ± 0.007
2.583ThrArg: 2.583 ± 0.009
4.602ThrSer: 4.602 ± 0.013
3.63ThrThr: 3.63 ± 0.02
4.315ThrVal: 4.315 ± 0.012
0.683ThrTrp: 0.683 ± 0.004
1.537ThrTyr: 1.537 ± 0.006
0.001ThrXaa: 0.001 ± 0.0
Val
4.124ValAla: 4.124 ± 0.01
1.975ValCys: 1.975 ± 0.012
3.279ValAsp: 3.279 ± 0.009
4.186ValGlu: 4.186 ± 0.011
2.694ValPhe: 2.694 ± 0.009
3.717ValGly: 3.717 ± 0.01
1.63ValHis: 1.63 ± 0.006
3.08ValIle: 3.08 ± 0.009
3.737ValLys: 3.737 ± 0.01
6.213ValLeu: 6.213 ± 0.014
1.616ValMet: 1.616 ± 0.006
2.581ValAsn: 2.581 ± 0.007
3.256ValPro: 3.256 ± 0.01
2.74ValGln: 2.74 ± 0.009
3.362ValArg: 3.362 ± 0.01
5.191ValSer: 5.191 ± 0.011
4.039ValThr: 4.039 ± 0.012
4.566ValVal: 4.566 ± 0.011
0.825ValTrp: 0.825 ± 0.005
1.93ValTyr: 1.93 ± 0.008
0.001ValXaa: 0.001 ± 0.0
Trp
0.689TrpAla: 0.689 ± 0.004
0.268TrpCys: 0.268 ± 0.003
0.652TrpAsp: 0.652 ± 0.004
0.767TrpGlu: 0.767 ± 0.004
0.458TrpPhe: 0.458 ± 0.003
0.689TrpGly: 0.689 ± 0.004
0.282TrpHis: 0.282 ± 0.002
0.553TrpIle: 0.553 ± 0.004
0.724TrpLys: 0.724 ± 0.005
1.172TrpLeu: 1.172 ± 0.006
0.37TrpMet: 0.37 ± 0.003
0.536TrpAsn: 0.536 ± 0.004
0.462TrpPro: 0.462 ± 0.003
0.486TrpGln: 0.486 ± 0.003
0.771TrpArg: 0.771 ± 0.005
0.936TrpSer: 0.936 ± 0.005
0.7TrpThr: 0.7 ± 0.005
0.753TrpVal: 0.753 ± 0.004
0.204TrpTrp: 0.204 ± 0.002
0.36TrpTyr: 0.36 ± 0.003
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.444TyrAla: 1.444 ± 0.006
0.788TyrCys: 0.788 ± 0.005
1.436TyrAsp: 1.436 ± 0.007
1.619TyrGlu: 1.619 ± 0.007
1.223TyrPhe: 1.223 ± 0.006
1.783TyrGly: 1.783 ± 0.007
0.863TyrHis: 0.863 ± 0.006
1.553TyrIle: 1.553 ± 0.008
1.524TyrLys: 1.524 ± 0.007
2.797TyrLeu: 2.797 ± 0.009
0.737TyrMet: 0.737 ± 0.004
1.244TyrAsn: 1.244 ± 0.006
1.388TyrPro: 1.388 ± 0.006
1.306TyrGln: 1.306 ± 0.006
1.745TyrArg: 1.745 ± 0.007
2.445TyrSer: 2.445 ± 0.01
1.769TyrThr: 1.769 ± 0.008
1.677TyrVal: 1.677 ± 0.007
0.403TyrTrp: 0.403 ± 0.004
1.034TyrTyr: 1.034 ± 0.007
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.003XaaMet: 0.003 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.12XaaXaa: 0.12 ± 0.02
Statistics based on 90503 proteins (47467216 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski