Amino acid dipepetide frequency for Chromatium okenii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.55AlaAla: 13.55 ± 0.233
0.979AlaCys: 0.979 ± 0.038
5.801AlaAsp: 5.801 ± 0.098
6.764AlaGlu: 6.764 ± 0.114
3.439AlaPhe: 3.439 ± 0.077
7.545AlaGly: 7.545 ± 0.132
2.325AlaHis: 2.325 ± 0.063
6.157AlaIle: 6.157 ± 0.085
3.467AlaLys: 3.467 ± 0.081
12.556AlaLeu: 12.556 ± 0.195
2.495AlaMet: 2.495 ± 0.054
3.893AlaAsn: 3.893 ± 0.143
4.749AlaPro: 4.749 ± 0.148
5.582AlaGln: 5.582 ± 0.108
6.046AlaArg: 6.046 ± 0.101
4.896AlaSer: 4.896 ± 0.095
6.279AlaThr: 6.279 ± 0.156
7.551AlaVal: 7.551 ± 0.096
1.221AlaTrp: 1.221 ± 0.038
2.074AlaTyr: 2.074 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
1.256CysAla: 1.256 ± 0.042
0.231CysCys: 0.231 ± 0.018
0.578CysAsp: 0.578 ± 0.029
0.614CysGlu: 0.614 ± 0.032
0.436CysPhe: 0.436 ± 0.025
1.108CysGly: 1.108 ± 0.039
0.383CysHis: 0.383 ± 0.031
0.557CysIle: 0.557 ± 0.025
0.344CysLys: 0.344 ± 0.021
0.892CysLeu: 0.892 ± 0.035
0.22CysMet: 0.22 ± 0.015
0.403CysAsn: 0.403 ± 0.021
0.579CysPro: 0.579 ± 0.032
0.49CysGln: 0.49 ± 0.025
0.668CysArg: 0.668 ± 0.03
0.806CysSer: 0.806 ± 0.092
0.655CysThr: 0.655 ± 0.052
0.747CysVal: 0.747 ± 0.039
0.162CysTrp: 0.162 ± 0.015
0.351CysTyr: 0.351 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
6.429AspAla: 6.429 ± 0.097
0.777AspCys: 0.777 ± 0.053
3.357AspAsp: 3.357 ± 0.073
3.075AspGlu: 3.075 ± 0.071
2.409AspPhe: 2.409 ± 0.059
4.238AspGly: 4.238 ± 0.105
1.159AspHis: 1.159 ± 0.04
3.07AspIle: 3.07 ± 0.065
1.589AspLys: 1.589 ± 0.056
5.909AspLeu: 5.909 ± 0.087
1.157AspMet: 1.157 ± 0.044
1.785AspAsn: 1.785 ± 0.058
2.787AspPro: 2.787 ± 0.065
2.163AspGln: 2.163 ± 0.056
3.026AspArg: 3.026 ± 0.073
2.904AspSer: 2.904 ± 0.057
2.846AspThr: 2.846 ± 0.071
3.723AspVal: 3.723 ± 0.076
0.902AspTrp: 0.902 ± 0.034
1.874AspTyr: 1.874 ± 0.057
0.0AspXaa: 0.0 ± 0.0
Glu
5.09GluAla: 5.09 ± 0.092
0.5GluCys: 0.5 ± 0.026
2.357GluAsp: 2.357 ± 0.062
2.698GluGlu: 2.698 ± 0.072
2.03GluPhe: 2.03 ± 0.051
2.836GluGly: 2.836 ± 0.069
1.458GluHis: 1.458 ± 0.046
3.948GluIle: 3.948 ± 0.077
2.36GluLys: 2.36 ± 0.07
7.059GluLeu: 7.059 ± 0.108
1.582GluMet: 1.582 ± 0.045
1.971GluAsn: 1.971 ± 0.052
2.332GluPro: 2.332 ± 0.1
3.336GluGln: 3.336 ± 0.076
4.929GluArg: 4.929 ± 0.106
2.808GluSer: 2.808 ± 0.076
3.647GluThr: 3.647 ± 0.128
3.655GluVal: 3.655 ± 0.076
0.763GluTrp: 0.763 ± 0.03
1.202GluTyr: 1.202 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.863PheAla: 3.863 ± 0.074
0.497PheCys: 0.497 ± 0.025
2.56PheAsp: 2.56 ± 0.057
2.148PheGlu: 2.148 ± 0.048
1.534PhePhe: 1.534 ± 0.053
2.947PheGly: 2.947 ± 0.067
0.808PheHis: 0.808 ± 0.033
2.24PheIle: 2.24 ± 0.057
1.376PheLys: 1.376 ± 0.045
3.331PheLeu: 3.331 ± 0.073
0.781PheMet: 0.781 ± 0.036
1.819PheAsn: 1.819 ± 0.048
1.512PhePro: 1.512 ± 0.046
1.403PheGln: 1.403 ± 0.048
1.847PheArg: 1.847 ± 0.05
2.528PheSer: 2.528 ± 0.067
2.512PheThr: 2.512 ± 0.09
2.401PheVal: 2.401 ± 0.063
0.507PheTrp: 0.507 ± 0.029
1.059PheTyr: 1.059 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
6.786GlyAla: 6.786 ± 0.108
0.939GlyCys: 0.939 ± 0.036
3.89GlyAsp: 3.89 ± 0.071
3.811GlyGlu: 3.811 ± 0.075
3.016GlyPhe: 3.016 ± 0.075
5.14GlyGly: 5.14 ± 0.123
1.567GlyHis: 1.567 ± 0.048
4.476GlyIle: 4.476 ± 0.073
3.006GlyLys: 3.006 ± 0.077
6.962GlyLeu: 6.962 ± 0.116
1.824GlyMet: 1.824 ± 0.054
2.771GlyAsn: 2.771 ± 0.086
1.213GlyPro: 1.213 ± 0.041
2.821GlyGln: 2.821 ± 0.072
4.064GlyArg: 4.064 ± 0.095
3.987GlySer: 3.987 ± 0.096
4.026GlyThr: 4.026 ± 0.117
4.973GlyVal: 4.973 ± 0.089
1.118GlyTrp: 1.118 ± 0.042
2.415GlyTyr: 2.415 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
2.106HisAla: 2.106 ± 0.056
0.449HisCys: 0.449 ± 0.026
1.246HisAsp: 1.246 ± 0.043
1.086HisGlu: 1.086 ± 0.037
1.037HisPhe: 1.037 ± 0.037
1.709HisGly: 1.709 ± 0.052
0.811HisHis: 0.811 ± 0.038
1.192HisIle: 1.192 ± 0.039
0.616HisLys: 0.616 ± 0.03
2.737HisLeu: 2.737 ± 0.066
0.389HisMet: 0.389 ± 0.021
0.706HisAsn: 0.706 ± 0.031
1.47HisPro: 1.47 ± 0.044
1.083HisGln: 1.083 ± 0.036
1.592HisArg: 1.592 ± 0.052
1.35HisSer: 1.35 ± 0.045
1.068HisThr: 1.068 ± 0.038
1.251HisVal: 1.251 ± 0.037
0.447HisTrp: 0.447 ± 0.023
0.763HisTyr: 0.763 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
7.118IleAla: 7.118 ± 0.121
0.706IleCys: 0.706 ± 0.029
4.118IleAsp: 4.118 ± 0.075
4.173IleGlu: 4.173 ± 0.116
1.848IlePhe: 1.848 ± 0.052
4.64IleGly: 4.64 ± 0.08
1.351IleHis: 1.351 ± 0.038
3.029IleIle: 3.029 ± 0.072
2.359IleLys: 2.359 ± 0.091
5.018IleLeu: 5.018 ± 0.09
0.945IleMet: 0.945 ± 0.04
2.602IleAsn: 2.602 ± 0.071
2.8IlePro: 2.8 ± 0.06
2.234IleGln: 2.234 ± 0.078
3.314IleArg: 3.314 ± 0.07
3.432IleSer: 3.432 ± 0.068
3.845IleThr: 3.845 ± 0.092
3.605IleVal: 3.605 ± 0.072
0.646IleTrp: 0.646 ± 0.029
1.437IleTyr: 1.437 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
3.195LysAla: 3.195 ± 0.076
0.277LysCys: 0.277 ± 0.019
1.794LysAsp: 1.794 ± 0.057
1.844LysGlu: 1.844 ± 0.06
1.019LysPhe: 1.019 ± 0.041
2.157LysGly: 2.157 ± 0.055
0.826LysHis: 0.826 ± 0.032
2.31LysIle: 2.31 ± 0.063
1.825LysLys: 1.825 ± 0.07
3.503LysLeu: 3.503 ± 0.075
0.849LysMet: 0.849 ± 0.03
1.602LysAsn: 1.602 ± 0.05
2.229LysPro: 2.229 ± 0.12
1.747LysGln: 1.747 ± 0.049
2.465LysArg: 2.465 ± 0.065
2.069LysSer: 2.069 ± 0.058
2.623LysThr: 2.623 ± 0.072
2.188LysVal: 2.188 ± 0.072
0.46LysTrp: 0.46 ± 0.025
0.733LysTyr: 0.733 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
12.343LeuAla: 12.343 ± 0.188
1.027LeuCys: 1.027 ± 0.041
6.092LeuAsp: 6.092 ± 0.086
6.131LeuGlu: 6.131 ± 0.092
4.047LeuPhe: 4.047 ± 0.077
6.927LeuGly: 6.927 ± 0.106
2.339LeuHis: 2.339 ± 0.066
6.525LeuIle: 6.525 ± 0.108
4.14LeuLys: 4.14 ± 0.079
12.006LeuLeu: 12.006 ± 0.221
2.479LeuMet: 2.479 ± 0.059
4.497LeuAsn: 4.497 ± 0.089
5.775LeuPro: 5.775 ± 0.119
4.142LeuGln: 4.142 ± 0.089
7.383LeuArg: 7.383 ± 0.122
6.139LeuSer: 6.139 ± 0.098
7.445LeuThr: 7.445 ± 0.113
6.773LeuVal: 6.773 ± 0.103
1.124LeuTrp: 1.124 ± 0.048
2.187LeuTyr: 2.187 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
2.037MetAla: 2.037 ± 0.055
0.189MetCys: 0.189 ± 0.015
1.002MetAsp: 1.002 ± 0.036
1.053MetGlu: 1.053 ± 0.037
0.67MetPhe: 0.67 ± 0.028
1.303MetGly: 1.303 ± 0.037
0.43MetHis: 0.43 ± 0.024
1.291MetIle: 1.291 ± 0.042
1.003MetLys: 1.003 ± 0.032
2.402MetLeu: 2.402 ± 0.055
0.583MetMet: 0.583 ± 0.027
1.123MetAsn: 1.123 ± 0.035
1.348MetPro: 1.348 ± 0.045
1.01MetGln: 1.01 ± 0.039
1.613MetArg: 1.613 ± 0.048
1.425MetSer: 1.425 ± 0.043
1.776MetThr: 1.776 ± 0.047
1.414MetVal: 1.414 ± 0.037
0.172MetTrp: 0.172 ± 0.015
0.355MetTyr: 0.355 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
4.05AsnAla: 4.05 ± 0.095
0.63AsnCys: 0.63 ± 0.064
2.392AsnAsp: 2.392 ± 0.076
2.014AsnGlu: 2.014 ± 0.049
1.421AsnPhe: 1.421 ± 0.076
3.038AsnGly: 3.038 ± 0.081
0.954AsnHis: 0.954 ± 0.036
1.764AsnIle: 1.764 ± 0.041
1.383AsnLys: 1.383 ± 0.053
3.861AsnLeu: 3.861 ± 0.072
0.779AsnMet: 0.779 ± 0.032
1.543AsnAsn: 1.543 ± 0.06
2.288AsnPro: 2.288 ± 0.089
1.798AsnGln: 1.798 ± 0.047
2.224AsnArg: 2.224 ± 0.055
2.166AsnSer: 2.166 ± 0.063
2.213AsnThr: 2.213 ± 0.064
2.247AsnVal: 2.247 ± 0.079
0.675AsnTrp: 0.675 ± 0.029
1.122AsnTyr: 1.122 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
5.355ProAla: 5.355 ± 0.115
0.393ProCys: 0.393 ± 0.026
2.805ProAsp: 2.805 ± 0.061
3.244ProGlu: 3.244 ± 0.087
1.679ProPhe: 1.679 ± 0.044
2.542ProGly: 2.542 ± 0.056
1.002ProHis: 1.002 ± 0.038
2.779ProIle: 2.779 ± 0.154
1.388ProLys: 1.388 ± 0.048
5.214ProLeu: 5.214 ± 0.104
0.943ProMet: 0.943 ± 0.034
2.017ProAsn: 2.017 ± 0.088
2.603ProPro: 2.603 ± 0.087
1.815ProGln: 1.815 ± 0.049
2.266ProArg: 2.266 ± 0.062
2.439ProSer: 2.439 ± 0.06
2.868ProThr: 2.868 ± 0.07
3.677ProVal: 3.677 ± 0.155
0.556ProTrp: 0.556 ± 0.024
1.113ProTyr: 1.113 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
4.411GlnAla: 4.411 ± 0.092
0.477GlnCys: 0.477 ± 0.028
1.744GlnAsp: 1.744 ± 0.05
2.017GlnGlu: 2.017 ± 0.053
1.772GlnPhe: 1.772 ± 0.047
2.265GlnGly: 2.265 ± 0.05
1.317GlnHis: 1.317 ± 0.04
2.825GlnIle: 2.825 ± 0.067
1.359GlnLys: 1.359 ± 0.046
5.788GlnLeu: 5.788 ± 0.112
1.154GlnMet: 1.154 ± 0.04
1.246GlnAsn: 1.246 ± 0.045
2.402GlnPro: 2.402 ± 0.099
2.767GlnGln: 2.767 ± 0.071
3.826GlnArg: 3.826 ± 0.073
2.287GlnSer: 2.287 ± 0.116
2.525GlnThr: 2.525 ± 0.057
3.075GlnVal: 3.075 ± 0.077
0.612GlnTrp: 0.612 ± 0.028
0.844GlnTyr: 0.844 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
5.527ArgAla: 5.527 ± 0.101
0.798ArgCys: 0.798 ± 0.034
3.434ArgAsp: 3.434 ± 0.07
4.101ArgGlu: 4.101 ± 0.096
2.775ArgPhe: 2.775 ± 0.059
3.671ArgGly: 3.671 ± 0.078
1.691ArgHis: 1.691 ± 0.056
4.165ArgIle: 4.165 ± 0.073
2.112ArgLys: 2.112 ± 0.05
7.508ArgLeu: 7.508 ± 0.131
1.588ArgMet: 1.588 ± 0.047
2.11ArgAsn: 2.11 ± 0.052
2.44ArgPro: 2.44 ± 0.061
3.249ArgGln: 3.249 ± 0.08
4.223ArgArg: 4.223 ± 0.089
3.095ArgSer: 3.095 ± 0.076
2.879ArgThr: 2.879 ± 0.064
4.258ArgVal: 4.258 ± 0.093
1.011ArgTrp: 1.011 ± 0.041
1.974ArgTyr: 1.974 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
5.916SerAla: 5.916 ± 0.088
0.676SerCys: 0.676 ± 0.049
2.969SerAsp: 2.969 ± 0.069
2.933SerGlu: 2.933 ± 0.067
2.117SerPhe: 2.117 ± 0.054
5.245SerGly: 5.245 ± 0.143
1.108SerHis: 1.108 ± 0.041
3.242SerIle: 3.242 ± 0.081
2.008SerLys: 2.008 ± 0.058
5.404SerLeu: 5.404 ± 0.106
1.129SerMet: 1.129 ± 0.037
2.219SerAsn: 2.219 ± 0.067
2.425SerPro: 2.425 ± 0.068
2.047SerGln: 2.047 ± 0.074
3.089SerArg: 3.089 ± 0.071
3.469SerSer: 3.469 ± 0.085
3.23SerThr: 3.23 ± 0.078
3.753SerVal: 3.753 ± 0.094
0.719SerTrp: 0.719 ± 0.035
1.353SerTyr: 1.353 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
7.96ThrAla: 7.96 ± 0.238
0.63ThrCys: 0.63 ± 0.04
3.237ThrAsp: 3.237 ± 0.083
3.35ThrGlu: 3.35 ± 0.079
2.139ThrPhe: 2.139 ± 0.06
4.868ThrGly: 4.868 ± 0.112
1.221ThrHis: 1.221 ± 0.044
3.186ThrIle: 3.186 ± 0.08
1.688ThrLys: 1.688 ± 0.062
7.264ThrLeu: 7.264 ± 0.117
0.963ThrMet: 0.963 ± 0.038
2.029ThrAsn: 2.029 ± 0.072
3.494ThrPro: 3.494 ± 0.093
2.395ThrGln: 2.395 ± 0.061
3.119ThrArg: 3.119 ± 0.065
2.96ThrSer: 2.96 ± 0.075
3.659ThrThr: 3.659 ± 0.092
4.613ThrVal: 4.613 ± 0.115
0.652ThrTrp: 0.652 ± 0.033
1.305ThrTyr: 1.305 ± 0.066
0.0ThrXaa: 0.0 ± 0.0
Val
7.179ValAla: 7.179 ± 0.121
0.706ValCys: 0.706 ± 0.029
3.864ValAsp: 3.864 ± 0.083
3.798ValGlu: 3.798 ± 0.071
2.533ValPhe: 2.533 ± 0.059
4.203ValGly: 4.203 ± 0.09
1.366ValHis: 1.366 ± 0.042
4.641ValIle: 4.641 ± 0.088
2.522ValLys: 2.522 ± 0.067
7.223ValLeu: 7.223 ± 0.092
1.592ValMet: 1.592 ± 0.043
2.903ValAsn: 2.903 ± 0.083
2.606ValPro: 2.606 ± 0.056
2.35ValGln: 2.35 ± 0.096
3.955ValArg: 3.955 ± 0.087
3.762ValSer: 3.762 ± 0.07
4.654ValThr: 4.654 ± 0.145
4.711ValVal: 4.711 ± 0.096
0.744ValTrp: 0.744 ± 0.033
1.544ValTyr: 1.544 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.948TrpAla: 0.948 ± 0.038
0.193TrpCys: 0.193 ± 0.019
0.528TrpAsp: 0.528 ± 0.027
0.616TrpGlu: 0.616 ± 0.029
0.601TrpPhe: 0.601 ± 0.028
0.653TrpGly: 0.653 ± 0.028
0.33TrpHis: 0.33 ± 0.021
0.77TrpIle: 0.77 ± 0.033
0.502TrpLys: 0.502 ± 0.024
1.907TrpLeu: 1.907 ± 0.058
0.351TrpMet: 0.351 ± 0.02
0.53TrpAsn: 0.53 ± 0.026
0.428TrpPro: 0.428 ± 0.026
0.825TrpGln: 0.825 ± 0.032
1.063TrpArg: 1.063 ± 0.047
0.939TrpSer: 0.939 ± 0.041
0.63TrpThr: 0.63 ± 0.027
0.779TrpVal: 0.779 ± 0.035
0.243TrpTrp: 0.243 ± 0.019
0.319TrpTyr: 0.319 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.252TyrAla: 2.252 ± 0.058
0.406TyrCys: 0.406 ± 0.018
1.413TyrAsp: 1.413 ± 0.044
1.094TyrGlu: 1.094 ± 0.038
1.149TyrPhe: 1.149 ± 0.044
1.771TyrGly: 1.771 ± 0.053
0.662TyrHis: 0.662 ± 0.026
1.053TyrIle: 1.053 ± 0.039
0.666TyrLys: 0.666 ± 0.035
2.978TyrLeu: 2.978 ± 0.065
0.393TyrMet: 0.393 ± 0.022
0.795TyrAsn: 0.795 ± 0.036
1.216TyrPro: 1.216 ± 0.045
1.457TyrGln: 1.457 ± 0.047
1.9TyrArg: 1.9 ± 0.051
1.591TyrSer: 1.591 ± 0.057
1.348TyrThr: 1.348 ± 0.056
1.462TyrVal: 1.462 ± 0.043
0.42TyrTrp: 0.42 ± 0.023
0.77TyrTyr: 0.77 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2811 proteins (814741 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski