Amino acid dipepetide frequency for Hydrogenophaga crassostreae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.033AlaAla: 15.033 ± 0.142
1.335AlaCys: 1.335 ± 0.034
6.048AlaAsp: 6.048 ± 0.063
6.267AlaGlu: 6.267 ± 0.07
4.217AlaPhe: 4.217 ± 0.056
9.753AlaGly: 9.753 ± 0.088
2.762AlaHis: 2.762 ± 0.044
5.156AlaIle: 5.156 ± 0.067
4.138AlaLys: 4.138 ± 0.067
14.496AlaLeu: 14.496 ± 0.134
3.916AlaMet: 3.916 ± 0.062
3.197AlaAsn: 3.197 ± 0.054
5.64AlaPro: 5.64 ± 0.071
5.646AlaGln: 5.646 ± 0.071
7.433AlaArg: 7.433 ± 0.083
7.22AlaSer: 7.22 ± 0.09
6.047AlaThr: 6.047 ± 0.075
8.908AlaVal: 8.908 ± 0.1
1.965AlaTrp: 1.965 ± 0.039
2.421AlaTyr: 2.421 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.14CysAla: 1.14 ± 0.024
0.117CysCys: 0.117 ± 0.01
0.557CysAsp: 0.557 ± 0.019
0.557CysGlu: 0.557 ± 0.019
0.341CysPhe: 0.341 ± 0.016
0.989CysGly: 0.989 ± 0.025
0.289CysHis: 0.289 ± 0.014
0.434CysIle: 0.434 ± 0.019
0.274CysLys: 0.274 ± 0.014
0.875CysLeu: 0.875 ± 0.025
0.238CysMet: 0.238 ± 0.013
0.244CysAsn: 0.244 ± 0.014
0.503CysPro: 0.503 ± 0.019
0.297CysGln: 0.297 ± 0.015
0.505CysArg: 0.505 ± 0.02
0.542CysSer: 0.542 ± 0.021
0.503CysThr: 0.503 ± 0.02
0.754CysVal: 0.754 ± 0.023
0.129CysTrp: 0.129 ± 0.009
0.19CysTyr: 0.19 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.558AspAla: 6.558 ± 0.064
0.433AspCys: 0.433 ± 0.016
2.567AspAsp: 2.567 ± 0.054
3.025AspGlu: 3.025 ± 0.048
2.0AspPhe: 2.0 ± 0.041
4.545AspGly: 4.545 ± 0.09
1.221AspHis: 1.221 ± 0.032
2.298AspIle: 2.298 ± 0.041
1.736AspLys: 1.736 ± 0.04
5.385AspLeu: 5.385 ± 0.062
1.38AspMet: 1.38 ± 0.034
1.372AspAsn: 1.372 ± 0.036
2.86AspPro: 2.86 ± 0.106
1.823AspGln: 1.823 ± 0.031
3.11AspArg: 3.11 ± 0.05
2.405AspSer: 2.405 ± 0.044
2.64AspThr: 2.64 ± 0.056
3.797AspVal: 3.797 ± 0.061
1.031AspTrp: 1.031 ± 0.03
1.227AspTyr: 1.227 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
6.914GluAla: 6.914 ± 0.082
0.418GluCys: 0.418 ± 0.018
2.189GluAsp: 2.189 ± 0.04
2.507GluGlu: 2.507 ± 0.052
1.691GluPhe: 1.691 ± 0.041
4.078GluGly: 4.078 ± 0.05
1.358GluHis: 1.358 ± 0.031
2.515GluIle: 2.515 ± 0.054
2.1GluLys: 2.1 ± 0.047
5.69GluLeu: 5.69 ± 0.073
1.4GluMet: 1.4 ± 0.029
1.31GluAsn: 1.31 ± 0.034
2.606GluPro: 2.606 ± 0.04
2.554GluGln: 2.554 ± 0.044
4.363GluArg: 4.363 ± 0.062
2.819GluSer: 2.819 ± 0.041
2.821GluThr: 2.821 ± 0.044
4.207GluVal: 4.207 ± 0.052
0.776GluTrp: 0.776 ± 0.021
0.927GluTyr: 0.927 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.061PheAla: 4.061 ± 0.058
0.426PheCys: 0.426 ± 0.019
2.539PheAsp: 2.539 ± 0.043
2.309PheGlu: 2.309 ± 0.041
1.492PhePhe: 1.492 ± 0.037
3.266PheGly: 3.266 ± 0.052
0.791PheHis: 0.791 ± 0.024
1.477PheIle: 1.477 ± 0.033
1.304PheLys: 1.304 ± 0.035
3.111PheLeu: 3.111 ± 0.056
0.94PheMet: 0.94 ± 0.027
1.298PheAsn: 1.298 ± 0.035
1.569PhePro: 1.569 ± 0.043
1.219PheGln: 1.219 ± 0.025
1.737PheArg: 1.737 ± 0.039
2.353PheSer: 2.353 ± 0.044
2.018PheThr: 2.018 ± 0.051
2.841PheVal: 2.841 ± 0.052
0.561PheTrp: 0.561 ± 0.023
0.959PheTyr: 0.959 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
8.888GlyAla: 8.888 ± 0.09
0.896GlyCys: 0.896 ± 0.023
4.002GlyAsp: 4.002 ± 0.063
4.521GlyGlu: 4.521 ± 0.059
3.436GlyPhe: 3.436 ± 0.046
6.697GlyGly: 6.697 ± 0.096
1.997GlyHis: 1.997 ± 0.033
3.865GlyIle: 3.865 ± 0.052
3.443GlyLys: 3.443 ± 0.057
9.421GlyLeu: 9.421 ± 0.098
2.558GlyMet: 2.558 ± 0.046
2.259GlyAsn: 2.259 ± 0.042
3.106GlyPro: 3.106 ± 0.06
3.731GlyGln: 3.731 ± 0.051
4.686GlyArg: 4.686 ± 0.062
4.85GlySer: 4.85 ± 0.065
4.179GlyThr: 4.179 ± 0.067
6.778GlyVal: 6.778 ± 0.071
1.527GlyTrp: 1.527 ± 0.034
2.221GlyTyr: 2.221 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.771HisAla: 2.771 ± 0.042
0.303HisCys: 0.303 ± 0.015
1.095HisAsp: 1.095 ± 0.029
1.106HisGlu: 1.106 ± 0.026
0.989HisPhe: 0.989 ± 0.027
2.117HisGly: 2.117 ± 0.04
0.705HisHis: 0.705 ± 0.025
1.042HisIle: 1.042 ± 0.029
0.64HisLys: 0.64 ± 0.021
2.417HisLeu: 2.417 ± 0.041
0.601HisMet: 0.601 ± 0.021
0.614HisAsn: 0.614 ± 0.023
1.54HisPro: 1.54 ± 0.033
0.854HisGln: 0.854 ± 0.029
1.482HisArg: 1.482 ± 0.034
1.16HisSer: 1.16 ± 0.031
1.207HisThr: 1.207 ± 0.03
1.524HisVal: 1.524 ± 0.039
0.487HisTrp: 0.487 ± 0.019
0.565HisTyr: 0.565 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
5.658IleAla: 5.658 ± 0.075
0.441IleCys: 0.441 ± 0.02
3.017IleAsp: 3.017 ± 0.049
3.221IleGlu: 3.221 ± 0.047
1.342IlePhe: 1.342 ± 0.034
4.003IleGly: 4.003 ± 0.056
0.887IleHis: 0.887 ± 0.027
1.468IleIle: 1.468 ± 0.038
1.628IleLys: 1.628 ± 0.042
3.32IleLeu: 3.32 ± 0.052
0.828IleMet: 0.828 ± 0.023
1.547IleAsn: 1.547 ± 0.037
2.076IlePro: 2.076 ± 0.042
1.506IleGln: 1.506 ± 0.032
2.431IleArg: 2.431 ± 0.049
2.54IleSer: 2.54 ± 0.043
2.611IleThr: 2.611 ± 0.069
3.198IleVal: 3.198 ± 0.059
0.608IleTrp: 0.608 ± 0.019
1.026IleTyr: 1.026 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
4.643LysAla: 4.643 ± 0.08
0.184LysCys: 0.184 ± 0.013
1.798LysAsp: 1.798 ± 0.043
1.665LysGlu: 1.665 ± 0.041
0.954LysPhe: 0.954 ± 0.028
2.906LysGly: 2.906 ± 0.051
0.714LysHis: 0.714 ± 0.024
1.477LysIle: 1.477 ± 0.037
1.523LysLys: 1.523 ± 0.051
3.537LysLeu: 3.537 ± 0.058
0.877LysMet: 0.877 ± 0.029
1.071LysAsn: 1.071 ± 0.029
2.176LysPro: 2.176 ± 0.042
1.312LysGln: 1.312 ± 0.03
2.367LysArg: 2.367 ± 0.044
1.999LysSer: 1.999 ± 0.044
2.156LysThr: 2.156 ± 0.044
2.689LysVal: 2.689 ± 0.048
0.378LysTrp: 0.378 ± 0.017
0.647LysTyr: 0.647 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
13.817LeuAla: 13.817 ± 0.137
1.024LeuCys: 1.024 ± 0.027
5.594LeuAsp: 5.594 ± 0.07
5.163LeuGlu: 5.163 ± 0.071
3.508LeuPhe: 3.508 ± 0.058
8.735LeuGly: 8.735 ± 0.103
2.3LeuHis: 2.3 ± 0.043
4.671LeuIle: 4.671 ± 0.069
4.061LeuLys: 4.061 ± 0.059
11.024LeuLeu: 11.024 ± 0.13
2.901LeuMet: 2.901 ± 0.053
3.229LeuAsn: 3.229 ± 0.05
6.136LeuPro: 6.136 ± 0.072
4.178LeuGln: 4.178 ± 0.059
6.831LeuArg: 6.831 ± 0.076
6.599LeuSer: 6.599 ± 0.076
5.76LeuThr: 5.76 ± 0.092
7.868LeuVal: 7.868 ± 0.076
1.371LeuTrp: 1.371 ± 0.038
1.912LeuTyr: 1.912 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.625MetAla: 3.625 ± 0.049
0.202MetCys: 0.202 ± 0.012
1.286MetAsp: 1.286 ± 0.034
1.161MetGlu: 1.161 ± 0.03
0.777MetPhe: 0.777 ± 0.024
2.418MetGly: 2.418 ± 0.048
0.632MetHis: 0.632 ± 0.023
1.025MetIle: 1.025 ± 0.028
1.133MetLys: 1.133 ± 0.031
2.834MetLeu: 2.834 ± 0.047
0.641MetMet: 0.641 ± 0.023
1.019MetAsn: 1.019 ± 0.024
1.679MetPro: 1.679 ± 0.034
1.131MetGln: 1.131 ± 0.026
1.758MetArg: 1.758 ± 0.036
1.788MetSer: 1.788 ± 0.032
1.605MetThr: 1.605 ± 0.039
2.197MetVal: 2.197 ± 0.044
0.263MetTrp: 0.263 ± 0.014
0.389MetTyr: 0.389 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.635AsnAla: 3.635 ± 0.052
0.285AsnCys: 0.285 ± 0.015
1.447AsnAsp: 1.447 ± 0.042
1.366AsnGlu: 1.366 ± 0.032
1.0AsnPhe: 1.0 ± 0.025
2.49AsnGly: 2.49 ± 0.043
0.653AsnHis: 0.653 ± 0.021
1.261AsnIle: 1.261 ± 0.026
0.962AsnLys: 0.962 ± 0.026
2.864AsnLeu: 2.864 ± 0.042
0.731AsnMet: 0.731 ± 0.024
0.925AsnAsn: 0.925 ± 0.031
2.027AsnPro: 2.027 ± 0.053
1.102AsnGln: 1.102 ± 0.028
1.697AsnArg: 1.697 ± 0.041
1.4AsnSer: 1.4 ± 0.033
1.693AsnThr: 1.693 ± 0.035
1.978AsnVal: 1.978 ± 0.044
0.482AsnTrp: 0.482 ± 0.02
0.67AsnTyr: 0.67 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
6.206ProAla: 6.206 ± 0.077
0.386ProCys: 0.386 ± 0.017
3.12ProAsp: 3.12 ± 0.065
3.516ProGlu: 3.516 ± 0.05
1.909ProPhe: 1.909 ± 0.037
4.626ProGly: 4.626 ± 0.068
1.171ProHis: 1.171 ± 0.028
1.912ProIle: 1.912 ± 0.044
1.726ProLys: 1.726 ± 0.042
5.203ProLeu: 5.203 ± 0.059
1.585ProMet: 1.585 ± 0.034
1.463ProAsn: 1.463 ± 0.031
2.441ProPro: 2.441 ± 0.056
2.007ProGln: 2.007 ± 0.04
2.562ProArg: 2.562 ± 0.045
3.071ProSer: 3.071 ± 0.044
2.676ProThr: 2.676 ± 0.056
4.421ProVal: 4.421 ± 0.116
0.827ProTrp: 0.827 ± 0.026
1.05ProTyr: 1.05 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
5.737GlnAla: 5.737 ± 0.079
0.32GlnCys: 0.32 ± 0.016
1.645GlnAsp: 1.645 ± 0.037
1.591GlnGlu: 1.591 ± 0.033
1.295GlnPhe: 1.295 ± 0.033
3.183GlnGly: 3.183 ± 0.056
0.974GlnHis: 0.974 ± 0.028
1.849GlnIle: 1.849 ± 0.038
1.251GlnLys: 1.251 ± 0.031
4.273GlnLeu: 4.273 ± 0.055
1.056GlnMet: 1.056 ± 0.029
0.957GlnAsn: 0.957 ± 0.023
2.346GlnPro: 2.346 ± 0.044
1.879GlnGln: 1.879 ± 0.035
3.48GlnArg: 3.48 ± 0.056
2.318GlnSer: 2.318 ± 0.044
2.236GlnThr: 2.236 ± 0.044
3.087GlnVal: 3.087 ± 0.046
0.71GlnTrp: 0.71 ± 0.024
0.743GlnTyr: 0.743 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
6.684ArgAla: 6.684 ± 0.074
0.613ArgCys: 0.613 ± 0.023
3.204ArgAsp: 3.204 ± 0.05
3.816ArgGlu: 3.816 ± 0.06
2.792ArgPhe: 2.792 ± 0.045
4.075ArgGly: 4.075 ± 0.063
1.643ArgHis: 1.643 ± 0.039
3.175ArgIle: 3.175 ± 0.05
2.029ArgLys: 2.029 ± 0.037
7.31ArgLeu: 7.31 ± 0.076
1.817ArgMet: 1.817 ± 0.037
1.724ArgAsn: 1.724 ± 0.033
2.784ArgPro: 2.784 ± 0.039
2.726ArgGln: 2.726 ± 0.041
4.026ArgArg: 4.026 ± 0.063
3.573ArgSer: 3.573 ± 0.058
2.949ArgThr: 2.949 ± 0.05
4.778ArgVal: 4.778 ± 0.057
1.184ArgTrp: 1.184 ± 0.033
1.677ArgTyr: 1.677 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
6.897SerAla: 6.897 ± 0.081
0.459SerCys: 0.459 ± 0.02
2.908SerAsp: 2.908 ± 0.05
3.066SerGlu: 3.066 ± 0.051
2.172SerPhe: 2.172 ± 0.043
5.453SerGly: 5.453 ± 0.067
1.337SerHis: 1.337 ± 0.028
2.451SerIle: 2.451 ± 0.046
1.795SerLys: 1.795 ± 0.04
6.067SerLeu: 6.067 ± 0.074
1.605SerMet: 1.605 ± 0.031
1.651SerAsn: 1.651 ± 0.035
3.177SerPro: 3.177 ± 0.055
2.189SerGln: 2.189 ± 0.043
3.565SerArg: 3.565 ± 0.059
3.414SerSer: 3.414 ± 0.068
3.143SerThr: 3.143 ± 0.05
4.301SerVal: 4.301 ± 0.066
0.789SerTrp: 0.789 ± 0.027
1.222SerTyr: 1.222 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
6.065ThrAla: 6.065 ± 0.068
0.461ThrCys: 0.461 ± 0.019
2.663ThrAsp: 2.663 ± 0.081
2.661ThrGlu: 2.661 ± 0.047
1.8ThrPhe: 1.8 ± 0.038
4.886ThrGly: 4.886 ± 0.08
1.3ThrHis: 1.3 ± 0.031
2.203ThrIle: 2.203 ± 0.073
1.326ThrLys: 1.326 ± 0.03
6.306ThrLeu: 6.306 ± 0.084
1.278ThrMet: 1.278 ± 0.028
1.309ThrAsn: 1.309 ± 0.038
3.524ThrPro: 3.524 ± 0.068
2.21ThrGln: 2.21 ± 0.056
3.079ThrArg: 3.079 ± 0.049
2.941ThrSer: 2.941 ± 0.045
2.996ThrThr: 2.996 ± 0.061
4.386ThrVal: 4.386 ± 0.08
0.718ThrTrp: 0.718 ± 0.023
1.138ThrTyr: 1.138 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
9.336ValAla: 9.336 ± 0.101
0.772ValCys: 0.772 ± 0.026
4.084ValAsp: 4.084 ± 0.09
4.021ValGlu: 4.021 ± 0.065
2.993ValPhe: 2.993 ± 0.051
5.881ValGly: 5.881 ± 0.084
1.653ValHis: 1.653 ± 0.038
3.586ValIle: 3.586 ± 0.05
2.798ValLys: 2.798 ± 0.056
8.051ValLeu: 8.051 ± 0.09
2.228ValMet: 2.228 ± 0.046
2.351ValAsn: 2.351 ± 0.058
3.868ValPro: 3.868 ± 0.077
2.941ValGln: 2.941 ± 0.043
4.73ValArg: 4.73 ± 0.06
4.556ValSer: 4.556 ± 0.063
3.941ValThr: 3.941 ± 0.076
6.506ValVal: 6.506 ± 0.089
1.153ValTrp: 1.153 ± 0.029
1.579ValTyr: 1.579 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.562TrpAla: 1.562 ± 0.037
0.164TrpCys: 0.164 ± 0.009
0.621TrpAsp: 0.621 ± 0.023
0.578TrpGlu: 0.578 ± 0.02
0.607TrpPhe: 0.607 ± 0.019
1.072TrpGly: 1.072 ± 0.024
0.397TrpHis: 0.397 ± 0.017
0.67TrpIle: 0.67 ± 0.021
0.492TrpLys: 0.492 ± 0.017
2.188TrpLeu: 2.188 ± 0.051
0.505TrpMet: 0.505 ± 0.021
0.485TrpAsn: 0.485 ± 0.019
0.78TrpPro: 0.78 ± 0.027
0.731TrpGln: 0.731 ± 0.025
1.254TrpArg: 1.254 ± 0.033
0.922TrpSer: 0.922 ± 0.025
0.807TrpThr: 0.807 ± 0.029
1.203TrpVal: 1.203 ± 0.034
0.326TrpTrp: 0.326 ± 0.019
0.282TrpTyr: 0.282 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.369TyrAla: 2.369 ± 0.042
0.239TyrCys: 0.239 ± 0.013
1.031TyrAsp: 1.031 ± 0.029
1.124TyrGlu: 1.124 ± 0.03
0.959TyrPhe: 0.959 ± 0.028
1.888TyrGly: 1.888 ± 0.039
0.435TyrHis: 0.435 ± 0.017
0.822TyrIle: 0.822 ± 0.025
0.714TyrLys: 0.714 ± 0.024
2.307TyrLeu: 2.307 ± 0.043
0.479TyrMet: 0.479 ± 0.018
0.644TyrAsn: 0.644 ± 0.027
1.041TyrPro: 1.041 ± 0.024
0.895TyrGln: 0.895 ± 0.028
1.484TyrArg: 1.484 ± 0.033
1.191TyrSer: 1.191 ± 0.031
1.261TyrThr: 1.261 ± 0.035
1.605TyrVal: 1.605 ± 0.029
0.363TyrTrp: 0.363 ± 0.016
0.552TyrTyr: 0.552 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4456 proteins (1430289 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski