Amino acid dipepetide frequency for Hydrogenimonas sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.933AlaAla: 7.933 ± 0.145
0.697AlaCys: 0.697 ± 0.043
4.151AlaAsp: 4.151 ± 0.083
5.643AlaGlu: 5.643 ± 0.101
3.248AlaPhe: 3.248 ± 0.074
5.706AlaGly: 5.706 ± 0.107
1.395AlaHis: 1.395 ± 0.052
5.241AlaIle: 5.241 ± 0.099
5.882AlaLys: 5.882 ± 0.101
9.081AlaLeu: 9.081 ± 0.121
2.417AlaMet: 2.417 ± 0.058
2.299AlaAsn: 2.299 ± 0.062
2.405AlaPro: 2.405 ± 0.067
2.12AlaGln: 2.12 ± 0.056
3.46AlaArg: 3.46 ± 0.069
4.335AlaSer: 4.335 ± 0.092
3.271AlaThr: 3.271 ± 0.07
6.268AlaVal: 6.268 ± 0.104
0.63AlaTrp: 0.63 ± 0.034
2.695AlaTyr: 2.695 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.674CysAla: 0.674 ± 0.033
0.084CysCys: 0.084 ± 0.013
0.708CysAsp: 0.708 ± 0.033
0.762CysGlu: 0.762 ± 0.037
0.333CysPhe: 0.333 ± 0.024
0.946CysGly: 0.946 ± 0.04
0.399CysHis: 0.399 ± 0.045
0.546CysIle: 0.546 ± 0.029
0.469CysLys: 0.469 ± 0.027
0.518CysLeu: 0.518 ± 0.033
0.187CysMet: 0.187 ± 0.017
0.337CysAsn: 0.337 ± 0.024
0.441CysPro: 0.441 ± 0.034
0.155CysGln: 0.155 ± 0.016
0.661CysArg: 0.661 ± 0.029
0.652CysSer: 0.652 ± 0.034
0.438CysThr: 0.438 ± 0.031
0.461CysVal: 0.461 ± 0.027
0.066CysTrp: 0.066 ± 0.011
0.356CysTyr: 0.356 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
4.361AspAla: 4.361 ± 0.08
0.435AspCys: 0.435 ± 0.026
2.752AspAsp: 2.752 ± 0.068
5.089AspGlu: 5.089 ± 0.098
2.887AspPhe: 2.887 ± 0.067
3.733AspGly: 3.733 ± 0.084
0.849AspHis: 0.849 ± 0.037
4.904AspIle: 4.904 ± 0.085
2.753AspLys: 2.753 ± 0.074
5.488AspLeu: 5.488 ± 0.098
1.476AspMet: 1.476 ± 0.045
1.703AspAsn: 1.703 ± 0.058
2.569AspPro: 2.569 ± 0.063
0.871AspGln: 0.871 ± 0.04
3.349AspArg: 3.349 ± 0.069
2.992AspSer: 2.992 ± 0.082
2.839AspThr: 2.839 ± 0.068
3.184AspVal: 3.184 ± 0.077
0.446AspTrp: 0.446 ± 0.03
2.117AspTyr: 2.117 ± 0.072
0.0AspXaa: 0.0 ± 0.0
Glu
6.674GluAla: 6.674 ± 0.109
0.739GluCys: 0.739 ± 0.037
3.61GluAsp: 3.61 ± 0.084
7.378GluGlu: 7.378 ± 0.154
2.759GluPhe: 2.759 ± 0.068
5.056GluGly: 5.056 ± 0.094
1.548GluHis: 1.548 ± 0.05
6.066GluIle: 6.066 ± 0.109
6.878GluLys: 6.878 ± 0.118
7.442GluLeu: 7.442 ± 0.117
2.023GluMet: 2.023 ± 0.056
3.334GluAsn: 3.334 ± 0.066
2.575GluPro: 2.575 ± 0.068
1.997GluGln: 1.997 ± 0.057
5.719GluArg: 5.719 ± 0.097
5.359GluSer: 5.359 ± 0.098
3.497GluThr: 3.497 ± 0.077
4.749GluVal: 4.749 ± 0.098
0.868GluTrp: 0.868 ± 0.037
2.615GluTyr: 2.615 ± 0.062
0.0GluXaa: 0.0 ± 0.0
Phe
3.642PheAla: 3.642 ± 0.078
0.46PheCys: 0.46 ± 0.027
3.282PheAsp: 3.282 ± 0.071
3.678PheGlu: 3.678 ± 0.075
2.474PhePhe: 2.474 ± 0.073
3.739PheGly: 3.739 ± 0.084
0.803PheHis: 0.803 ± 0.038
3.17PheIle: 3.17 ± 0.072
2.779PheLys: 2.779 ± 0.065
4.255PheLeu: 4.255 ± 0.11
1.274PheMet: 1.274 ± 0.045
1.708PheAsn: 1.708 ± 0.056
1.366PhePro: 1.366 ± 0.041
0.829PheGln: 0.829 ± 0.033
2.275PheArg: 2.275 ± 0.065
3.166PheSer: 3.166 ± 0.073
2.378PheThr: 2.378 ± 0.064
2.933PheVal: 2.933 ± 0.076
0.478PheTrp: 0.478 ± 0.029
1.737PheTyr: 1.737 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
5.241GlyAla: 5.241 ± 0.091
0.955GlyCys: 0.955 ± 0.043
3.69GlyAsp: 3.69 ± 0.079
5.111GlyGlu: 5.111 ± 0.096
3.537GlyPhe: 3.537 ± 0.072
5.183GlyGly: 5.183 ± 0.103
1.361GlyHis: 1.361 ± 0.053
5.054GlyIle: 5.054 ± 0.097
4.84GlyLys: 4.84 ± 0.094
5.883GlyLeu: 5.883 ± 0.099
2.114GlyMet: 2.114 ± 0.063
2.232GlyAsn: 2.232 ± 0.056
1.538GlyPro: 1.538 ± 0.051
1.321GlyGln: 1.321 ± 0.051
4.243GlyArg: 4.243 ± 0.094
4.464GlySer: 4.464 ± 0.084
3.261GlyThr: 3.261 ± 0.075
5.396GlyVal: 5.396 ± 0.104
0.835GlyTrp: 0.835 ± 0.036
3.083GlyTyr: 3.083 ± 0.077
0.0GlyXaa: 0.0 ± 0.0
His
1.266HisAla: 1.266 ± 0.049
0.215HisCys: 0.215 ± 0.018
0.969HisAsp: 0.969 ± 0.044
1.312HisGlu: 1.312 ± 0.043
0.992HisPhe: 0.992 ± 0.037
1.328HisGly: 1.328 ± 0.052
0.474HisHis: 0.474 ± 0.031
1.551HisIle: 1.551 ± 0.051
1.125HisLys: 1.125 ± 0.043
1.912HisLeu: 1.912 ± 0.06
0.537HisMet: 0.537 ± 0.03
0.747HisAsn: 0.747 ± 0.037
1.128HisPro: 1.128 ± 0.049
0.463HisGln: 0.463 ± 0.026
1.118HisArg: 1.118 ± 0.044
1.067HisSer: 1.067 ± 0.039
1.052HisThr: 1.052 ± 0.042
0.877HisVal: 0.877 ± 0.038
0.185HisTrp: 0.185 ± 0.017
0.839HisTyr: 0.839 ± 0.038
0.0HisXaa: 0.0 ± 0.0
Ile
6.143IleAla: 6.143 ± 0.113
0.661IleCys: 0.661 ± 0.033
5.054IleAsp: 5.054 ± 0.087
6.86IleGlu: 6.86 ± 0.114
3.624IlePhe: 3.624 ± 0.086
5.266IleGly: 5.266 ± 0.096
1.231IleHis: 1.231 ± 0.041
4.209IleIle: 4.209 ± 0.09
4.568IleLys: 4.568 ± 0.097
6.394IleLeu: 6.394 ± 0.11
1.401IleMet: 1.401 ± 0.044
2.422IleAsn: 2.422 ± 0.064
2.787IlePro: 2.787 ± 0.067
1.338IleGln: 1.338 ± 0.047
3.454IleArg: 3.454 ± 0.07
4.496IleSer: 4.496 ± 0.094
3.281IleThr: 3.281 ± 0.07
5.396IleVal: 5.396 ± 0.095
0.507IleTrp: 0.507 ± 0.027
2.634IleTyr: 2.634 ± 0.06
0.0IleXaa: 0.0 ± 0.0
Lys
4.942LysAla: 4.942 ± 0.082
0.478LysCys: 0.478 ± 0.029
3.123LysAsp: 3.123 ± 0.078
7.303LysGlu: 7.303 ± 0.12
1.964LysPhe: 1.964 ± 0.062
3.966LysGly: 3.966 ± 0.09
1.11LysHis: 1.11 ± 0.042
5.629LysIle: 5.629 ± 0.102
5.692LysLys: 5.692 ± 0.117
5.781LysLeu: 5.781 ± 0.097
2.083LysMet: 2.083 ± 0.052
2.868LysAsn: 2.868 ± 0.069
2.733LysPro: 2.733 ± 0.081
1.651LysGln: 1.651 ± 0.053
5.342LysArg: 5.342 ± 0.106
4.766LysSer: 4.766 ± 0.092
3.106LysThr: 3.106 ± 0.084
3.92LysVal: 3.92 ± 0.074
0.553LysTrp: 0.553 ± 0.03
1.997LysTyr: 1.997 ± 0.065
0.0LysXaa: 0.0 ± 0.0
Leu
7.435LeuAla: 7.435 ± 0.117
0.874LeuCys: 0.874 ± 0.039
5.445LeuAsp: 5.445 ± 0.092
7.446LeuGlu: 7.446 ± 0.127
5.71LeuPhe: 5.71 ± 0.122
6.231LeuGly: 6.231 ± 0.106
2.112LeuHis: 2.112 ± 0.059
6.097LeuIle: 6.097 ± 0.126
7.979LeuLys: 7.979 ± 0.136
10.603LeuLeu: 10.603 ± 0.192
2.465LeuMet: 2.465 ± 0.058
3.365LeuAsn: 3.365 ± 0.07
4.159LeuPro: 4.159 ± 0.094
2.954LeuGln: 2.954 ± 0.074
4.642LeuArg: 4.642 ± 0.092
6.44LeuSer: 6.44 ± 0.104
4.323LeuThr: 4.323 ± 0.079
5.664LeuVal: 5.664 ± 0.111
0.88LeuTrp: 0.88 ± 0.042
3.927LeuTyr: 3.927 ± 0.089
0.0LeuXaa: 0.0 ± 0.0
Met
2.31MetAla: 2.31 ± 0.063
0.17MetCys: 0.17 ± 0.017
1.34MetAsp: 1.34 ± 0.043
2.069MetGlu: 2.069 ± 0.062
0.911MetPhe: 0.911 ± 0.042
1.932MetGly: 1.932 ± 0.052
0.537MetHis: 0.537 ± 0.031
1.971MetIle: 1.971 ± 0.061
2.306MetLys: 2.306 ± 0.059
2.637MetLeu: 2.637 ± 0.056
0.851MetMet: 0.851 ± 0.04
0.943MetAsn: 0.943 ± 0.041
1.213MetPro: 1.213 ± 0.046
0.972MetGln: 0.972 ± 0.039
1.738MetArg: 1.738 ± 0.053
1.481MetSer: 1.481 ± 0.052
1.16MetThr: 1.16 ± 0.038
1.945MetVal: 1.945 ± 0.046
0.193MetTrp: 0.193 ± 0.017
0.587MetTyr: 0.587 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
2.578AsnAla: 2.578 ± 0.064
0.317AsnCys: 0.317 ± 0.023
1.844AsnAsp: 1.844 ± 0.056
2.473AsnGlu: 2.473 ± 0.072
1.59AsnPhe: 1.59 ± 0.048
2.753AsnGly: 2.753 ± 0.085
0.624AsnHis: 0.624 ± 0.031
3.084AsnIle: 3.084 ± 0.074
1.426AsnLys: 1.426 ± 0.056
3.452AsnLeu: 3.452 ± 0.07
0.806AsnMet: 0.806 ± 0.039
0.998AsnAsn: 0.998 ± 0.046
2.01AsnPro: 2.01 ± 0.049
0.702AsnGln: 0.702 ± 0.035
2.92AsnArg: 2.92 ± 0.066
2.074AsnSer: 2.074 ± 0.056
1.476AsnThr: 1.476 ± 0.05
2.091AsnVal: 2.091 ± 0.061
0.254AsnTrp: 0.254 ± 0.022
1.321AsnTyr: 1.321 ± 0.042
0.0AsnXaa: 0.0 ± 0.0
Pro
2.678ProAla: 2.678 ± 0.069
0.294ProCys: 0.294 ± 0.022
2.394ProAsp: 2.394 ± 0.062
3.379ProGlu: 3.379 ± 0.077
2.037ProPhe: 2.037 ± 0.059
2.416ProGly: 2.416 ± 0.062
0.866ProHis: 0.866 ± 0.037
2.41ProIle: 2.41 ± 0.06
2.687ProLys: 2.687 ± 0.062
3.958ProLeu: 3.958 ± 0.077
1.045ProMet: 1.045 ± 0.042
1.288ProAsn: 1.288 ± 0.044
1.347ProPro: 1.347 ± 0.049
1.091ProGln: 1.091 ± 0.039
1.444ProArg: 1.444 ± 0.052
2.097ProSer: 2.097 ± 0.057
1.637ProThr: 1.637 ± 0.053
3.124ProVal: 3.124 ± 0.056
0.385ProTrp: 0.385 ± 0.023
1.57ProTyr: 1.57 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
1.686GlnAla: 1.686 ± 0.054
0.215GlnCys: 0.215 ± 0.019
0.852GlnAsp: 0.852 ± 0.038
1.603GlnGlu: 1.603 ± 0.054
0.888GlnPhe: 0.888 ± 0.039
1.234GlnGly: 1.234 ± 0.048
0.414GlnHis: 0.414 ± 0.027
2.2GlnIle: 2.2 ± 0.057
2.433GlnLys: 2.433 ± 0.06
2.14GlnLeu: 2.14 ± 0.061
0.883GlnMet: 0.883 ± 0.04
1.295GlnAsn: 1.295 ± 0.044
0.9GlnPro: 0.9 ± 0.034
0.73GlnGln: 0.73 ± 0.037
1.41GlnArg: 1.41 ± 0.044
1.55GlnSer: 1.55 ± 0.046
1.18GlnThr: 1.18 ± 0.044
1.246GlnVal: 1.246 ± 0.044
0.244GlnTrp: 0.244 ± 0.017
0.717GlnTyr: 0.717 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
3.923ArgAla: 3.923 ± 0.087
0.561ArgCys: 0.561 ± 0.031
2.672ArgAsp: 2.672 ± 0.062
4.751ArgGlu: 4.751 ± 0.095
3.282ArgPhe: 3.282 ± 0.08
3.34ArgGly: 3.34 ± 0.076
1.289ArgHis: 1.289 ± 0.046
4.047ArgIle: 4.047 ± 0.078
3.697ArgLys: 3.697 ± 0.081
6.046ArgLeu: 6.046 ± 0.09
1.654ArgMet: 1.654 ± 0.052
2.022ArgAsn: 2.022 ± 0.06
1.918ArgPro: 1.918 ± 0.053
1.334ArgGln: 1.334 ± 0.05
3.587ArgArg: 3.587 ± 0.082
3.497ArgSer: 3.497 ± 0.064
2.091ArgThr: 2.091 ± 0.061
4.041ArgVal: 4.041 ± 0.091
0.602ArgTrp: 0.602 ± 0.032
3.026ArgTyr: 3.026 ± 0.076
0.0ArgXaa: 0.0 ± 0.0
Ser
4.84SerAla: 4.84 ± 0.098
0.593SerCys: 0.593 ± 0.033
3.529SerAsp: 3.529 ± 0.079
4.239SerGlu: 4.239 ± 0.092
3.369SerPhe: 3.369 ± 0.079
5.212SerGly: 5.212 ± 0.089
1.119SerHis: 1.119 ± 0.041
4.22SerIle: 4.22 ± 0.085
3.684SerLys: 3.684 ± 0.076
6.362SerLeu: 6.362 ± 0.11
1.709SerMet: 1.709 ± 0.053
1.824SerAsn: 1.824 ± 0.056
2.172SerPro: 2.172 ± 0.057
1.397SerGln: 1.397 ± 0.041
3.599SerArg: 3.599 ± 0.086
3.78SerSer: 3.78 ± 0.088
2.733SerThr: 2.733 ± 0.062
4.317SerVal: 4.317 ± 0.084
0.575SerTrp: 0.575 ± 0.034
2.321SerTyr: 2.321 ± 0.062
0.0SerXaa: 0.0 ± 0.0
Thr
3.716ThrAla: 3.716 ± 0.079
0.316ThrCys: 0.316 ± 0.023
2.548ThrAsp: 2.548 ± 0.069
2.805ThrGlu: 2.805 ± 0.069
2.068ThrPhe: 2.068 ± 0.052
3.598ThrGly: 3.598 ± 0.074
0.869ThrHis: 0.869 ± 0.035
3.432ThrIle: 3.432 ± 0.077
2.601ThrLys: 2.601 ± 0.069
5.571ThrLeu: 5.571 ± 0.082
1.154ThrMet: 1.154 ± 0.039
1.335ThrAsn: 1.335 ± 0.045
2.477ThrPro: 2.477 ± 0.062
1.193ThrGln: 1.193 ± 0.041
1.875ThrArg: 1.875 ± 0.054
2.31ThrSer: 2.31 ± 0.066
2.177ThrThr: 2.177 ± 0.064
3.762ThrVal: 3.762 ± 0.083
0.323ThrTrp: 0.323 ± 0.019
1.502ThrTyr: 1.502 ± 0.056
0.0ThrXaa: 0.0 ± 0.0
Val
5.531ValAla: 5.531 ± 0.098
0.704ValCys: 0.704 ± 0.036
4.052ValAsp: 4.052 ± 0.083
5.673ValGlu: 5.673 ± 0.087
2.401ValPhe: 2.401 ± 0.057
4.326ValGly: 4.326 ± 0.097
1.205ValHis: 1.205 ± 0.043
4.565ValIle: 4.565 ± 0.096
4.709ValLys: 4.709 ± 0.091
6.208ValLeu: 6.208 ± 0.117
1.951ValMet: 1.951 ± 0.054
2.318ValAsn: 2.318 ± 0.061
2.623ValPro: 2.623 ± 0.068
1.528ValGln: 1.528 ± 0.043
3.17ValArg: 3.17 ± 0.068
4.314ValSer: 4.314 ± 0.078
3.707ValThr: 3.707 ± 0.087
5.174ValVal: 5.174 ± 0.102
0.652ValTrp: 0.652 ± 0.036
2.135ValTyr: 2.135 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.555TrpAla: 0.555 ± 0.033
0.101TrpCys: 0.101 ± 0.012
0.428TrpAsp: 0.428 ± 0.028
0.569TrpGlu: 0.569 ± 0.032
0.52TrpPhe: 0.52 ± 0.031
0.592TrpGly: 0.592 ± 0.03
0.222TrpHis: 0.222 ± 0.018
0.717TrpIle: 0.717 ± 0.038
0.491TrpLys: 0.491 ± 0.032
1.174TrpLeu: 1.174 ± 0.046
0.35TrpMet: 0.35 ± 0.025
0.33TrpAsn: 0.33 ± 0.023
0.276TrpPro: 0.276 ± 0.02
0.348TrpGln: 0.348 ± 0.021
0.572TrpArg: 0.572 ± 0.03
0.529TrpSer: 0.529 ± 0.031
0.293TrpThr: 0.293 ± 0.021
0.604TrpVal: 0.604 ± 0.035
0.139TrpTrp: 0.139 ± 0.016
0.402TrpTyr: 0.402 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.75TyrAla: 2.75 ± 0.061
0.323TyrCys: 0.323 ± 0.025
2.356TyrAsp: 2.356 ± 0.064
2.747TyrGlu: 2.747 ± 0.072
1.833TyrPhe: 1.833 ± 0.057
2.787TyrGly: 2.787 ± 0.068
0.727TyrHis: 0.727 ± 0.032
2.611TyrIle: 2.611 ± 0.059
2.01TyrLys: 2.01 ± 0.065
3.845TyrLeu: 3.845 ± 0.1
0.881TyrMet: 0.881 ± 0.034
1.427TyrAsn: 1.427 ± 0.054
1.511TyrPro: 1.511 ± 0.052
0.826TyrGln: 0.826 ± 0.04
2.801TyrArg: 2.801 ± 0.071
2.226TyrSer: 2.226 ± 0.062
1.734TyrThr: 1.734 ± 0.056
1.794TyrVal: 1.794 ± 0.059
0.392TyrTrp: 0.392 ± 0.025
1.464TyrTyr: 1.464 ± 0.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2173 proteins (652339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski