Amino acid dipepetide frequency for Melghiribacillus thermohalophilus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.153AlaAla: 5.153 ± 0.1
0.524AlaCys: 0.524 ± 0.028
3.273AlaAsp: 3.273 ± 0.073
4.567AlaGlu: 4.567 ± 0.089
3.103AlaPhe: 3.103 ± 0.066
5.275AlaGly: 5.275 ± 0.085
1.359AlaHis: 1.359 ± 0.047
5.383AlaIle: 5.383 ± 0.09
3.919AlaLys: 3.919 ± 0.073
6.788AlaLeu: 6.788 ± 0.099
1.907AlaMet: 1.907 ± 0.056
2.286AlaAsn: 2.286 ± 0.053
1.881AlaPro: 1.881 ± 0.055
2.272AlaGln: 2.272 ± 0.109
3.396AlaArg: 3.396 ± 0.06
3.656AlaSer: 3.656 ± 0.066
2.828AlaThr: 2.828 ± 0.064
5.222AlaVal: 5.222 ± 0.082
0.513AlaTrp: 0.513 ± 0.027
2.179AlaTyr: 2.179 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.348CysAla: 0.348 ± 0.023
0.092CysCys: 0.092 ± 0.013
0.372CysAsp: 0.372 ± 0.026
0.414CysGlu: 0.414 ± 0.026
0.286CysPhe: 0.286 ± 0.021
0.589CysGly: 0.589 ± 0.027
0.181CysHis: 0.181 ± 0.014
0.39CysIle: 0.39 ± 0.019
0.321CysLys: 0.321 ± 0.019
0.573CysLeu: 0.573 ± 0.028
0.178CysMet: 0.178 ± 0.016
0.242CysAsn: 0.242 ± 0.018
0.352CysPro: 0.352 ± 0.021
0.247CysGln: 0.247 ± 0.019
0.331CysArg: 0.331 ± 0.019
0.426CysSer: 0.426 ± 0.026
0.345CysThr: 0.345 ± 0.024
0.368CysVal: 0.368 ± 0.027
0.059CysTrp: 0.059 ± 0.009
0.265CysTyr: 0.265 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.127AspAla: 3.127 ± 0.072
0.354AspCys: 0.354 ± 0.02
2.945AspAsp: 2.945 ± 0.059
4.999AspGlu: 4.999 ± 0.093
2.39AspPhe: 2.39 ± 0.054
3.519AspGly: 3.519 ± 0.088
1.689AspHis: 1.689 ± 0.053
4.183AspIle: 4.183 ± 0.074
2.496AspLys: 2.496 ± 0.049
5.306AspLeu: 5.306 ± 0.073
1.486AspMet: 1.486 ± 0.04
1.47AspAsn: 1.47 ± 0.04
2.465AspPro: 2.465 ± 0.056
2.992AspGln: 2.992 ± 0.064
2.75AspArg: 2.75 ± 0.054
2.441AspSer: 2.441 ± 0.053
2.364AspThr: 2.364 ± 0.048
4.006AspVal: 4.006 ± 0.07
0.662AspTrp: 0.662 ± 0.03
2.218AspTyr: 2.218 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
5.116GluAla: 5.116 ± 0.089
0.389GluCys: 0.389 ± 0.029
4.152GluAsp: 4.152 ± 0.083
7.638GluGlu: 7.638 ± 0.122
2.433GluPhe: 2.433 ± 0.058
4.603GluGly: 4.603 ± 0.09
1.983GluHis: 1.983 ± 0.051
5.613GluIle: 5.613 ± 0.079
7.045GluLys: 7.045 ± 0.116
6.868GluLeu: 6.868 ± 0.098
2.518GluMet: 2.518 ± 0.057
3.81GluAsn: 3.81 ± 0.075
2.274GluPro: 2.274 ± 0.049
3.876GluGln: 3.876 ± 0.074
3.846GluArg: 3.846 ± 0.069
3.438GluSer: 3.438 ± 0.07
4.026GluThr: 4.026 ± 0.078
5.145GluVal: 5.145 ± 0.078
0.998GluTrp: 0.998 ± 0.036
2.428GluTyr: 2.428 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
2.799PheAla: 2.799 ± 0.061
0.308PheCys: 0.308 ± 0.023
2.387PheAsp: 2.387 ± 0.058
2.892PheGlu: 2.892 ± 0.054
2.594PhePhe: 2.594 ± 0.073
3.255PheGly: 3.255 ± 0.071
1.265PheHis: 1.265 ± 0.04
3.677PheIle: 3.677 ± 0.077
2.248PheLys: 2.248 ± 0.051
4.676PheLeu: 4.676 ± 0.112
1.326PheMet: 1.326 ± 0.047
1.564PheAsn: 1.564 ± 0.043
1.719PhePro: 1.719 ± 0.052
1.948PheGln: 1.948 ± 0.047
1.915PheArg: 1.915 ± 0.05
3.139PheSer: 3.139 ± 0.068
2.359PheThr: 2.359 ± 0.054
3.151PheVal: 3.151 ± 0.069
0.503PheTrp: 0.503 ± 0.027
1.826PheTyr: 1.826 ± 0.057
0.0PheXaa: 0.0 ± 0.0
Gly
4.65GlyAla: 4.65 ± 0.101
0.585GlyCys: 0.585 ± 0.029
3.441GlyAsp: 3.441 ± 0.062
4.767GlyGlu: 4.767 ± 0.082
3.367GlyPhe: 3.367 ± 0.072
4.629GlyGly: 4.629 ± 0.103
1.478GlyHis: 1.478 ± 0.054
5.77GlyIle: 5.77 ± 0.091
5.191GlyLys: 5.191 ± 0.082
6.464GlyLeu: 6.464 ± 0.082
2.228GlyMet: 2.228 ± 0.052
2.668GlyAsn: 2.668 ± 0.069
1.917GlyPro: 1.917 ± 0.063
2.224GlyGln: 2.224 ± 0.053
2.826GlyArg: 2.826 ± 0.059
3.768GlySer: 3.768 ± 0.075
3.932GlyThr: 3.932 ± 0.087
4.87GlyVal: 4.87 ± 0.097
0.769GlyTrp: 0.769 ± 0.03
3.01GlyTyr: 3.01 ± 0.068
0.0GlyXaa: 0.0 ± 0.0
His
1.529HisAla: 1.529 ± 0.041
0.179HisCys: 0.179 ± 0.016
1.274HisAsp: 1.274 ± 0.04
1.67HisGlu: 1.67 ± 0.047
1.196HisPhe: 1.196 ± 0.041
1.483HisGly: 1.483 ± 0.041
0.926HisHis: 0.926 ± 0.033
1.849HisIle: 1.849 ± 0.051
1.138HisLys: 1.138 ± 0.038
2.578HisLeu: 2.578 ± 0.056
0.652HisMet: 0.652 ± 0.028
0.732HisAsn: 0.732 ± 0.031
1.495HisPro: 1.495 ± 0.044
1.162HisGln: 1.162 ± 0.038
1.082HisArg: 1.082 ± 0.039
1.353HisSer: 1.353 ± 0.045
1.188HisThr: 1.188 ± 0.036
1.702HisVal: 1.702 ± 0.054
0.208HisTrp: 0.208 ± 0.016
1.013HisTyr: 1.013 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
5.459IleAla: 5.459 ± 0.102
0.557IleCys: 0.557 ± 0.031
4.38IleAsp: 4.38 ± 0.074
5.633IleGlu: 5.633 ± 0.084
3.381IlePhe: 3.381 ± 0.084
5.608IleGly: 5.608 ± 0.096
2.114IleHis: 2.114 ± 0.059
5.365IleIle: 5.365 ± 0.102
4.143IleLys: 4.143 ± 0.072
7.232IleLeu: 7.232 ± 0.117
1.807IleMet: 1.807 ± 0.054
2.69IleAsn: 2.69 ± 0.061
3.637IlePro: 3.637 ± 0.072
3.667IleGln: 3.667 ± 0.071
3.562IleArg: 3.562 ± 0.07
4.835IleSer: 4.835 ± 0.078
4.078IleThr: 4.078 ± 0.072
5.21IleVal: 5.21 ± 0.08
0.713IleTrp: 0.713 ± 0.031
2.753IleTyr: 2.753 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
4.08LysAla: 4.08 ± 0.073
0.296LysCys: 0.296 ± 0.022
3.905LysAsp: 3.905 ± 0.081
6.705LysGlu: 6.705 ± 0.098
1.758LysPhe: 1.758 ± 0.048
4.508LysGly: 4.508 ± 0.081
1.64LysHis: 1.64 ± 0.046
4.486LysIle: 4.486 ± 0.082
6.032LysLys: 6.032 ± 0.099
5.166LysLeu: 5.166 ± 0.09
2.098LysMet: 2.098 ± 0.049
3.243LysAsn: 3.243 ± 0.067
2.416LysPro: 2.416 ± 0.049
3.376LysGln: 3.376 ± 0.075
3.59LysArg: 3.59 ± 0.069
3.079LysSer: 3.079 ± 0.062
3.555LysThr: 3.555 ± 0.068
4.063LysVal: 4.063 ± 0.082
0.842LysTrp: 0.842 ± 0.034
2.102LysTyr: 2.102 ± 0.06
0.0LysXaa: 0.0 ± 0.0
Leu
6.678LeuAla: 6.678 ± 0.1
0.524LeuCys: 0.524 ± 0.028
4.933LeuAsp: 4.933 ± 0.088
6.744LeuGlu: 6.744 ± 0.098
4.966LeuPhe: 4.966 ± 0.107
6.219LeuGly: 6.219 ± 0.112
2.082LeuHis: 2.082 ± 0.052
7.192LeuIle: 7.192 ± 0.118
7.122LeuLys: 7.122 ± 0.109
9.564LeuLeu: 9.564 ± 0.163
2.617LeuMet: 2.617 ± 0.053
4.402LeuAsn: 4.402 ± 0.082
3.826LeuPro: 3.826 ± 0.07
3.277LeuGln: 3.277 ± 0.067
3.699LeuArg: 3.699 ± 0.072
6.521LeuSer: 6.521 ± 0.103
5.653LeuThr: 5.653 ± 0.091
5.67LeuVal: 5.67 ± 0.085
0.863LeuTrp: 0.863 ± 0.033
3.309LeuTyr: 3.309 ± 0.071
0.0LeuXaa: 0.0 ± 0.0
Met
2.245MetAla: 2.245 ± 0.066
0.137MetCys: 0.137 ± 0.013
1.755MetAsp: 1.755 ± 0.055
2.357MetGlu: 2.357 ± 0.047
1.226MetPhe: 1.226 ± 0.04
1.854MetGly: 1.854 ± 0.053
0.413MetHis: 0.413 ± 0.019
2.493MetIle: 2.493 ± 0.059
2.777MetLys: 2.777 ± 0.063
2.546MetLeu: 2.546 ± 0.062
1.021MetMet: 1.021 ± 0.043
1.698MetAsn: 1.698 ± 0.048
1.038MetPro: 1.038 ± 0.04
0.833MetGln: 0.833 ± 0.032
1.079MetArg: 1.079 ± 0.037
1.496MetSer: 1.496 ± 0.039
1.705MetThr: 1.705 ± 0.043
1.833MetVal: 1.833 ± 0.05
0.187MetTrp: 0.187 ± 0.015
0.782MetTyr: 0.782 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
2.263AsnAla: 2.263 ± 0.053
0.26AsnCys: 0.26 ± 0.018
2.075AsnAsp: 2.075 ± 0.055
3.252AsnGlu: 3.252 ± 0.068
1.416AsnPhe: 1.416 ± 0.045
2.97AsnGly: 2.97 ± 0.067
1.2AsnHis: 1.2 ± 0.043
3.266AsnIle: 3.266 ± 0.064
2.302AsnLys: 2.302 ± 0.052
3.584AsnLeu: 3.584 ± 0.069
1.266AsnMet: 1.266 ± 0.039
1.502AsnAsn: 1.502 ± 0.048
2.089AsnPro: 2.089 ± 0.049
2.285AsnGln: 2.285 ± 0.046
2.355AsnArg: 2.355 ± 0.053
1.765AsnSer: 1.765 ± 0.047
1.791AsnThr: 1.791 ± 0.045
2.731AsnVal: 2.731 ± 0.059
0.551AsnTrp: 0.551 ± 0.029
1.354AsnTyr: 1.354 ± 0.041
0.0AsnXaa: 0.0 ± 0.0
Pro
2.369ProAla: 2.369 ± 0.058
0.217ProCys: 0.217 ± 0.016
2.828ProAsp: 2.828 ± 0.058
3.793ProGlu: 3.793 ± 0.071
2.137ProPhe: 2.137 ± 0.047
2.92ProGly: 2.92 ± 0.065
0.882ProHis: 0.882 ± 0.033
2.603ProIle: 2.603 ± 0.054
2.05ProLys: 2.05 ± 0.06
3.564ProLeu: 3.564 ± 0.086
0.804ProMet: 0.804 ± 0.034
1.362ProAsn: 1.362 ± 0.039
1.115ProPro: 1.115 ± 0.039
1.037ProGln: 1.037 ± 0.04
1.299ProArg: 1.299 ± 0.042
2.194ProSer: 2.194 ± 0.057
1.569ProThr: 1.569 ± 0.044
3.351ProVal: 3.351 ± 0.068
0.393ProTrp: 0.393 ± 0.025
1.632ProTyr: 1.632 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
2.857GlnAla: 2.857 ± 0.061
0.158GlnCys: 0.158 ± 0.014
1.862GlnAsp: 1.862 ± 0.048
3.305GlnGlu: 3.305 ± 0.079
1.624GlnPhe: 1.624 ± 0.044
2.381GlnGly: 2.381 ± 0.064
0.92GlnHis: 0.92 ± 0.032
2.916GlnIle: 2.916 ± 0.062
3.316GlnLys: 3.316 ± 0.068
4.184GlnLeu: 4.184 ± 0.072
1.462GlnMet: 1.462 ± 0.042
1.84GlnAsn: 1.84 ± 0.049
1.346GlnPro: 1.346 ± 0.04
1.84GlnGln: 1.84 ± 0.068
1.617GlnArg: 1.617 ± 0.057
2.306GlnSer: 2.306 ± 0.06
2.324GlnThr: 2.324 ± 0.057
2.602GlnVal: 2.602 ± 0.063
0.463GlnTrp: 0.463 ± 0.023
1.508GlnTyr: 1.508 ± 0.052
0.0GlnXaa: 0.0 ± 0.0
Arg
2.596ArgAla: 2.596 ± 0.053
0.257ArgCys: 0.257 ± 0.02
2.33ArgAsp: 2.33 ± 0.057
3.898ArgGlu: 3.898 ± 0.068
2.108ArgPhe: 2.108 ± 0.041
2.683ArgGly: 2.683 ± 0.055
1.036ArgHis: 1.036 ± 0.034
3.323ArgIle: 3.323 ± 0.073
3.806ArgLys: 3.806 ± 0.077
4.444ArgLeu: 4.444 ± 0.093
1.543ArgMet: 1.543 ± 0.042
2.116ArgAsn: 2.116 ± 0.059
1.604ArgPro: 1.604 ± 0.047
1.841ArgGln: 1.841 ± 0.047
2.235ArgArg: 2.235 ± 0.055
2.389ArgSer: 2.389 ± 0.066
2.355ArgThr: 2.355 ± 0.057
2.893ArgVal: 2.893 ± 0.056
0.432ArgTrp: 0.432 ± 0.023
1.829ArgTyr: 1.829 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
3.371SerAla: 3.371 ± 0.067
0.372SerCys: 0.372 ± 0.02
2.867SerAsp: 2.867 ± 0.055
3.726SerGlu: 3.726 ± 0.077
3.153SerPhe: 3.153 ± 0.057
4.274SerGly: 4.274 ± 0.08
1.314SerHis: 1.314 ± 0.04
4.622SerIle: 4.622 ± 0.084
3.04SerLys: 3.04 ± 0.065
5.906SerLeu: 5.906 ± 0.088
1.748SerMet: 1.748 ± 0.047
1.925SerAsn: 1.925 ± 0.05
2.146SerPro: 2.146 ± 0.052
1.917SerGln: 1.917 ± 0.049
2.834SerArg: 2.834 ± 0.064
3.73SerSer: 3.73 ± 0.076
2.64SerThr: 2.64 ± 0.063
3.89SerVal: 3.89 ± 0.068
0.653SerTrp: 0.653 ± 0.03
2.134SerTyr: 2.134 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
3.867ThrAla: 3.867 ± 0.104
0.325ThrCys: 0.325 ± 0.019
2.803ThrAsp: 2.803 ± 0.065
3.423ThrGlu: 3.423 ± 0.072
2.661ThrPhe: 2.661 ± 0.058
4.376ThrGly: 4.376 ± 0.078
1.056ThrHis: 1.056 ± 0.038
4.38ThrIle: 4.38 ± 0.077
2.807ThrLys: 2.807 ± 0.056
4.875ThrLeu: 4.875 ± 0.083
1.337ThrMet: 1.337 ± 0.045
1.963ThrAsn: 1.963 ± 0.045
2.166ThrPro: 2.166 ± 0.05
1.247ThrGln: 1.247 ± 0.041
2.166ThrArg: 2.166 ± 0.046
3.023ThrSer: 3.023 ± 0.053
2.504ThrThr: 2.504 ± 0.067
3.954ThrVal: 3.954 ± 0.081
0.523ThrTrp: 0.523 ± 0.029
2.055ThrTyr: 2.055 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
4.328ValAla: 4.328 ± 0.076
0.535ValCys: 0.535 ± 0.029
3.526ValAsp: 3.526 ± 0.069
4.845ValGlu: 4.845 ± 0.086
3.245ValPhe: 3.245 ± 0.08
4.147ValGly: 4.147 ± 0.082
1.578ValHis: 1.578 ± 0.04
5.762ValIle: 5.762 ± 0.09
4.506ValLys: 4.506 ± 0.076
6.814ValLeu: 6.814 ± 0.097
2.098ValMet: 2.098 ± 0.057
2.896ValAsn: 2.896 ± 0.055
2.748ValPro: 2.748 ± 0.061
2.629ValGln: 2.629 ± 0.053
2.917ValArg: 2.917 ± 0.06
4.165ValSer: 4.165 ± 0.066
3.987ValThr: 3.987 ± 0.083
4.737ValVal: 4.737 ± 0.077
0.616ValTrp: 0.616 ± 0.027
2.439ValTyr: 2.439 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.515TrpAla: 0.515 ± 0.026
0.064TrpCys: 0.064 ± 0.008
0.54TrpAsp: 0.54 ± 0.027
0.665TrpGlu: 0.665 ± 0.028
0.568TrpPhe: 0.568 ± 0.034
0.683TrpGly: 0.683 ± 0.034
0.204TrpHis: 0.204 ± 0.014
0.972TrpIle: 0.972 ± 0.038
0.822TrpLys: 0.822 ± 0.031
1.194TrpLeu: 1.194 ± 0.034
0.414TrpMet: 0.414 ± 0.027
0.557TrpAsn: 0.557 ± 0.027
0.319TrpPro: 0.319 ± 0.024
0.315TrpGln: 0.315 ± 0.017
0.38TrpArg: 0.38 ± 0.02
0.559TrpSer: 0.559 ± 0.025
0.505TrpThr: 0.505 ± 0.028
0.704TrpVal: 0.704 ± 0.03
0.139TrpTrp: 0.139 ± 0.013
0.387TrpTyr: 0.387 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.135TyrAla: 2.135 ± 0.05
0.244TyrCys: 0.244 ± 0.017
2.213TyrAsp: 2.213 ± 0.086
2.904TyrGlu: 2.904 ± 0.061
1.94TyrPhe: 1.94 ± 0.055
2.579TyrGly: 2.579 ± 0.061
1.091TyrHis: 1.091 ± 0.041
2.579TyrIle: 2.579 ± 0.066
1.859TyrLys: 1.859 ± 0.053
3.534TyrLeu: 3.534 ± 0.075
0.962TyrMet: 0.962 ± 0.037
1.313TyrAsn: 1.313 ± 0.041
1.568TyrPro: 1.568 ± 0.037
1.846TyrGln: 1.846 ± 0.047
1.84TyrArg: 1.84 ± 0.044
1.978TyrSer: 1.978 ± 0.054
1.801TyrThr: 1.801 ± 0.043
2.416TyrVal: 2.416 ± 0.057
0.418TyrTrp: 0.418 ± 0.025
1.495TyrTyr: 1.495 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3015 proteins (846811 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski