Amino acid dipepetide frequency for Halobacillus salinus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.375AlaAla: 5.375 ± 0.082
0.541AlaCys: 0.541 ± 0.021
3.656AlaAsp: 3.656 ± 0.055
4.885AlaGlu: 4.885 ± 0.072
3.508AlaPhe: 3.508 ± 0.064
5.214AlaGly: 5.214 ± 0.081
1.315AlaHis: 1.315 ± 0.035
5.273AlaIle: 5.273 ± 0.072
4.032AlaLys: 4.032 ± 0.071
7.329AlaLeu: 7.329 ± 0.096
2.209AlaMet: 2.209 ± 0.046
2.42AlaAsn: 2.42 ± 0.05
2.134AlaPro: 2.134 ± 0.051
2.229AlaGln: 2.229 ± 0.036
2.753AlaArg: 2.753 ± 0.058
4.448AlaSer: 4.448 ± 0.071
3.826AlaThr: 3.826 ± 0.188
5.588AlaVal: 5.588 ± 0.084
0.69AlaTrp: 0.69 ± 0.028
2.404AlaTyr: 2.404 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.354CysAla: 0.354 ± 0.018
0.089CysCys: 0.089 ± 0.01
0.359CysAsp: 0.359 ± 0.017
0.4CysGlu: 0.4 ± 0.022
0.276CysPhe: 0.276 ± 0.018
0.54CysGly: 0.54 ± 0.024
0.191CysHis: 0.191 ± 0.014
0.377CysIle: 0.377 ± 0.017
0.272CysLys: 0.272 ± 0.018
0.569CysLeu: 0.569 ± 0.022
0.172CysMet: 0.172 ± 0.012
0.212CysAsn: 0.212 ± 0.013
0.303CysPro: 0.303 ± 0.017
0.227CysGln: 0.227 ± 0.016
0.255CysArg: 0.255 ± 0.015
0.415CysSer: 0.415 ± 0.02
0.325CysThr: 0.325 ± 0.017
0.35CysVal: 0.35 ± 0.019
0.049CysTrp: 0.049 ± 0.007
0.23CysTyr: 0.23 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.615AspAla: 3.615 ± 0.064
0.319AspCys: 0.319 ± 0.019
2.976AspAsp: 2.976 ± 0.06
5.005AspGlu: 5.005 ± 0.069
2.477AspPhe: 2.477 ± 0.05
3.719AspGly: 3.719 ± 0.074
1.52AspHis: 1.52 ± 0.039
3.7AspIle: 3.7 ± 0.059
2.946AspLys: 2.946 ± 0.058
5.44AspLeu: 5.44 ± 0.089
1.491AspMet: 1.491 ± 0.037
1.691AspAsn: 1.691 ± 0.038
2.27AspPro: 2.27 ± 0.044
2.881AspGln: 2.881 ± 0.064
2.824AspArg: 2.824 ± 0.054
2.815AspSer: 2.815 ± 0.046
2.633AspThr: 2.633 ± 0.048
4.383AspVal: 4.383 ± 0.07
0.771AspTrp: 0.771 ± 0.028
2.314AspTyr: 2.314 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
6.253GluAla: 6.253 ± 0.096
0.358GluCys: 0.358 ± 0.021
4.96GluAsp: 4.96 ± 0.08
8.941GluGlu: 8.941 ± 0.133
2.513GluPhe: 2.513 ± 0.047
5.145GluGly: 5.145 ± 0.08
1.741GluHis: 1.741 ± 0.038
4.624GluIle: 4.624 ± 0.074
6.131GluLys: 6.131 ± 0.092
7.042GluLeu: 7.042 ± 0.086
2.532GluMet: 2.532 ± 0.05
3.332GluAsn: 3.332 ± 0.065
2.315GluPro: 2.315 ± 0.049
4.019GluGln: 4.019 ± 0.069
4.205GluArg: 4.205 ± 0.076
4.164GluSer: 4.164 ± 0.074
4.061GluThr: 4.061 ± 0.059
5.767GluVal: 5.767 ± 0.09
1.067GluTrp: 1.067 ± 0.032
2.251GluTyr: 2.251 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
3.006PheAla: 3.006 ± 0.054
0.297PheCys: 0.297 ± 0.017
2.575PheAsp: 2.575 ± 0.049
3.096PheGlu: 3.096 ± 0.057
2.458PhePhe: 2.458 ± 0.056
3.413PheGly: 3.413 ± 0.07
1.104PheHis: 1.104 ± 0.033
3.354PheIle: 3.354 ± 0.071
2.026PheLys: 2.026 ± 0.042
4.642PheLeu: 4.642 ± 0.092
1.199PheMet: 1.199 ± 0.039
1.535PheAsn: 1.535 ± 0.033
1.672PhePro: 1.672 ± 0.039
1.799PheGln: 1.799 ± 0.045
1.738PheArg: 1.738 ± 0.045
3.142PheSer: 3.142 ± 0.059
2.677PheThr: 2.677 ± 0.045
3.23PheVal: 3.23 ± 0.057
0.501PheTrp: 0.501 ± 0.025
1.711PheTyr: 1.711 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
5.359GlyAla: 5.359 ± 0.2
0.543GlyCys: 0.543 ± 0.025
3.642GlyAsp: 3.642 ± 0.068
4.973GlyGlu: 4.973 ± 0.075
3.415GlyPhe: 3.415 ± 0.06
5.035GlyGly: 5.035 ± 0.094
1.45GlyHis: 1.45 ± 0.036
5.301GlyIle: 5.301 ± 0.082
4.227GlyLys: 4.227 ± 0.072
6.805GlyLeu: 6.805 ± 0.092
2.292GlyMet: 2.292 ± 0.051
2.429GlyAsn: 2.429 ± 0.05
1.88GlyPro: 1.88 ± 0.051
2.33GlyGln: 2.33 ± 0.049
2.896GlyArg: 2.896 ± 0.055
4.091GlySer: 4.091 ± 0.061
4.057GlyThr: 4.057 ± 0.063
5.662GlyVal: 5.662 ± 0.103
0.872GlyTrp: 0.872 ± 0.031
2.854GlyTyr: 2.854 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
1.352HisAla: 1.352 ± 0.034
0.182HisCys: 0.182 ± 0.015
1.174HisAsp: 1.174 ± 0.036
1.597HisGlu: 1.597 ± 0.046
1.107HisPhe: 1.107 ± 0.033
1.462HisGly: 1.462 ± 0.038
0.797HisHis: 0.797 ± 0.034
1.477HisIle: 1.477 ± 0.04
1.092HisLys: 1.092 ± 0.031
2.336HisLeu: 2.336 ± 0.045
0.608HisMet: 0.608 ± 0.024
0.814HisAsn: 0.814 ± 0.03
1.24HisPro: 1.24 ± 0.032
1.034HisGln: 1.034 ± 0.032
1.03HisArg: 1.03 ± 0.034
1.288HisSer: 1.288 ± 0.036
1.103HisThr: 1.103 ± 0.028
1.548HisVal: 1.548 ± 0.041
0.258HisTrp: 0.258 ± 0.015
0.927HisTyr: 0.927 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.511IleAla: 5.511 ± 0.073
0.462IleCys: 0.462 ± 0.02
4.105IleAsp: 4.105 ± 0.058
5.404IleGlu: 5.404 ± 0.076
2.867IlePhe: 2.867 ± 0.058
5.711IleGly: 5.711 ± 0.081
1.681IleHis: 1.681 ± 0.045
4.658IleIle: 4.658 ± 0.087
3.262IleLys: 3.262 ± 0.062
6.141IleLeu: 6.141 ± 0.101
1.653IleMet: 1.653 ± 0.04
2.432IleAsn: 2.432 ± 0.052
3.02IlePro: 3.02 ± 0.056
2.842IleGln: 2.842 ± 0.052
2.993IleArg: 2.993 ± 0.058
4.321IleSer: 4.321 ± 0.056
3.747IleThr: 3.747 ± 0.06
5.048IleVal: 5.048 ± 0.08
0.562IleTrp: 0.562 ± 0.025
2.108IleTyr: 2.108 ± 0.046
0.0IleXaa: 0.0 ± 0.0
Lys
4.149LysAla: 4.149 ± 0.068
0.234LysCys: 0.234 ± 0.015
3.468LysAsp: 3.468 ± 0.062
6.526LysGlu: 6.526 ± 0.089
1.605LysPhe: 1.605 ± 0.046
4.176LysGly: 4.176 ± 0.067
1.316LysHis: 1.316 ± 0.038
2.935LysIle: 2.935 ± 0.051
4.701LysLys: 4.701 ± 0.077
4.857LysLeu: 4.857 ± 0.068
1.803LysMet: 1.803 ± 0.042
2.484LysAsn: 2.484 ± 0.05
2.12LysPro: 2.12 ± 0.045
3.302LysGln: 3.302 ± 0.059
3.417LysArg: 3.417 ± 0.063
3.183LysSer: 3.183 ± 0.058
2.963LysThr: 2.963 ± 0.053
4.248LysVal: 4.248 ± 0.062
0.777LysTrp: 0.777 ± 0.03
1.732LysTyr: 1.732 ± 0.04
0.0LysXaa: 0.0 ± 0.0
Leu
7.095LeuAla: 7.095 ± 0.081
0.515LeuCys: 0.515 ± 0.027
5.177LeuAsp: 5.177 ± 0.074
7.097LeuGlu: 7.097 ± 0.081
4.947LeuPhe: 4.947 ± 0.089
6.495LeuGly: 6.495 ± 0.089
2.124LeuHis: 2.124 ± 0.044
6.576LeuIle: 6.576 ± 0.093
5.622LeuLys: 5.622 ± 0.086
9.822LeuLeu: 9.822 ± 0.139
2.743LeuMet: 2.743 ± 0.058
3.909LeuAsn: 3.909 ± 0.057
3.963LeuPro: 3.963 ± 0.062
3.588LeuGln: 3.588 ± 0.065
3.784LeuArg: 3.784 ± 0.065
6.931LeuSer: 6.931 ± 0.087
5.455LeuThr: 5.455 ± 0.074
6.684LeuVal: 6.684 ± 0.099
0.899LeuTrp: 0.899 ± 0.028
3.13LeuTyr: 3.13 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
2.023MetAla: 2.023 ± 0.036
0.12MetCys: 0.12 ± 0.011
1.701MetAsp: 1.701 ± 0.037
2.299MetGlu: 2.299 ± 0.05
1.105MetPhe: 1.105 ± 0.032
1.865MetGly: 1.865 ± 0.052
0.441MetHis: 0.441 ± 0.017
2.102MetIle: 2.102 ± 0.041
2.546MetLys: 2.546 ± 0.052
2.515MetLeu: 2.515 ± 0.051
0.994MetMet: 0.994 ± 0.032
1.707MetAsn: 1.707 ± 0.04
1.082MetPro: 1.082 ± 0.029
1.03MetGln: 1.03 ± 0.03
1.2MetArg: 1.2 ± 0.036
1.819MetSer: 1.819 ± 0.042
1.89MetThr: 1.89 ± 0.039
2.091MetVal: 2.091 ± 0.051
0.205MetTrp: 0.205 ± 0.014
0.744MetTyr: 0.744 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.424AsnAla: 2.424 ± 0.048
0.195AsnCys: 0.195 ± 0.013
2.19AsnAsp: 2.19 ± 0.041
3.12AsnGlu: 3.12 ± 0.057
1.407AsnPhe: 1.407 ± 0.033
2.893AsnGly: 2.893 ± 0.062
0.943AsnHis: 0.943 ± 0.03
2.654AsnIle: 2.654 ± 0.057
2.178AsnLys: 2.178 ± 0.042
3.243AsnLeu: 3.243 ± 0.052
1.095AsnMet: 1.095 ± 0.031
1.42AsnAsn: 1.42 ± 0.039
1.912AsnPro: 1.912 ± 0.042
2.056AsnGln: 2.056 ± 0.048
2.001AsnArg: 2.001 ± 0.044
1.902AsnSer: 1.902 ± 0.043
1.881AsnThr: 1.881 ± 0.045
2.861AsnVal: 2.861 ± 0.055
0.464AsnTrp: 0.464 ± 0.018
1.309AsnTyr: 1.309 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
2.382ProAla: 2.382 ± 0.052
0.202ProCys: 0.202 ± 0.015
2.381ProAsp: 2.381 ± 0.041
3.42ProGlu: 3.42 ± 0.059
2.073ProPhe: 2.073 ± 0.05
2.374ProGly: 2.374 ± 0.054
0.863ProHis: 0.863 ± 0.03
2.526ProIle: 2.526 ± 0.054
2.02ProLys: 2.02 ± 0.046
3.565ProLeu: 3.565 ± 0.059
0.965ProMet: 0.965 ± 0.033
1.506ProAsn: 1.506 ± 0.039
1.007ProPro: 1.007 ± 0.033
1.055ProGln: 1.055 ± 0.032
1.115ProArg: 1.115 ± 0.032
2.497ProSer: 2.497 ± 0.042
1.939ProThr: 1.939 ± 0.052
3.158ProVal: 3.158 ± 0.061
0.422ProTrp: 0.422 ± 0.022
1.482ProTyr: 1.482 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
3.039GlnAla: 3.039 ± 0.047
0.162GlnCys: 0.162 ± 0.013
2.16GlnAsp: 2.16 ± 0.048
3.505GlnGlu: 3.505 ± 0.066
1.646GlnPhe: 1.646 ± 0.04
2.524GlnGly: 2.524 ± 0.049
0.829GlnHis: 0.829 ± 0.029
2.366GlnIle: 2.366 ± 0.049
2.621GlnLys: 2.621 ± 0.048
4.26GlnLeu: 4.26 ± 0.066
1.387GlnMet: 1.387 ± 0.04
1.542GlnAsn: 1.542 ± 0.037
1.491GlnPro: 1.491 ± 0.039
2.105GlnGln: 2.105 ± 0.058
1.661GlnArg: 1.661 ± 0.046
2.659GlnSer: 2.659 ± 0.055
2.299GlnThr: 2.299 ± 0.045
2.865GlnVal: 2.865 ± 0.049
0.569GlnTrp: 0.569 ± 0.02
1.282GlnTyr: 1.282 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
2.554ArgAla: 2.554 ± 0.056
0.202ArgCys: 0.202 ± 0.012
2.309ArgAsp: 2.309 ± 0.053
3.708ArgGlu: 3.708 ± 0.062
2.076ArgPhe: 2.076 ± 0.048
2.553ArgGly: 2.553 ± 0.044
0.91ArgHis: 0.91 ± 0.03
3.021ArgIle: 3.021 ± 0.058
3.2ArgLys: 3.2 ± 0.053
4.343ArgLeu: 4.343 ± 0.077
1.512ArgMet: 1.512 ± 0.037
1.839ArgAsn: 1.839 ± 0.044
1.444ArgPro: 1.444 ± 0.035
1.749ArgGln: 1.749 ± 0.043
2.065ArgArg: 2.065 ± 0.046
2.654ArgSer: 2.654 ± 0.051
2.356ArgThr: 2.356 ± 0.051
2.984ArgVal: 2.984 ± 0.05
0.478ArgTrp: 0.478 ± 0.022
1.636ArgTyr: 1.636 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
3.637SerAla: 3.637 ± 0.059
0.339SerCys: 0.339 ± 0.018
3.156SerAsp: 3.156 ± 0.057
4.513SerGlu: 4.513 ± 0.075
3.299SerPhe: 3.299 ± 0.064
4.471SerGly: 4.471 ± 0.06
1.352SerHis: 1.352 ± 0.036
4.715SerIle: 4.715 ± 0.075
3.383SerLys: 3.383 ± 0.06
6.378SerLeu: 6.378 ± 0.074
1.95SerMet: 1.95 ± 0.047
2.304SerAsn: 2.304 ± 0.049
2.153SerPro: 2.153 ± 0.047
2.215SerGln: 2.215 ± 0.045
2.448SerArg: 2.448 ± 0.053
4.152SerSer: 4.152 ± 0.073
3.255SerThr: 3.255 ± 0.059
4.425SerVal: 4.425 ± 0.064
0.69SerTrp: 0.69 ± 0.029
2.382SerTyr: 2.382 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
3.756ThrAla: 3.756 ± 0.059
0.332ThrCys: 0.332 ± 0.018
2.878ThrAsp: 2.878 ± 0.057
3.769ThrGlu: 3.769 ± 0.062
2.755ThrPhe: 2.755 ± 0.053
4.46ThrGly: 4.46 ± 0.299
1.076ThrHis: 1.076 ± 0.03
4.305ThrIle: 4.305 ± 0.06
2.98ThrLys: 2.98 ± 0.055
5.263ThrLeu: 5.263 ± 0.068
1.479ThrMet: 1.479 ± 0.034
2.094ThrAsn: 2.094 ± 0.049
2.292ThrPro: 2.292 ± 0.046
1.57ThrGln: 1.57 ± 0.04
1.96ThrArg: 1.96 ± 0.045
3.335ThrSer: 3.335 ± 0.058
2.989ThrThr: 2.989 ± 0.061
4.205ThrVal: 4.205 ± 0.064
0.583ThrTrp: 0.583 ± 0.024
2.074ThrTyr: 2.074 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
5.151ValAla: 5.151 ± 0.081
0.565ValCys: 0.565 ± 0.023
4.235ValAsp: 4.235 ± 0.06
5.734ValGlu: 5.734 ± 0.078
3.246ValPhe: 3.246 ± 0.063
4.853ValGly: 4.853 ± 0.068
1.631ValHis: 1.631 ± 0.044
5.419ValIle: 5.419 ± 0.077
4.162ValLys: 4.162 ± 0.07
7.425ValLeu: 7.425 ± 0.091
2.081ValMet: 2.081 ± 0.047
2.752ValAsn: 2.752 ± 0.056
2.895ValPro: 2.895 ± 0.05
2.797ValGln: 2.797 ± 0.048
3.033ValArg: 3.033 ± 0.057
4.73ValSer: 4.73 ± 0.072
4.351ValThr: 4.351 ± 0.111
5.607ValVal: 5.607 ± 0.084
0.739ValTrp: 0.739 ± 0.026
2.401ValTyr: 2.401 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.634TrpAla: 0.634 ± 0.027
0.073TrpCys: 0.073 ± 0.007
0.555TrpAsp: 0.555 ± 0.019
0.691TrpGlu: 0.691 ± 0.028
0.574TrpPhe: 0.574 ± 0.023
0.748TrpGly: 0.748 ± 0.027
0.201TrpHis: 0.201 ± 0.016
0.878TrpIle: 0.878 ± 0.03
0.771TrpLys: 0.771 ± 0.026
1.268TrpLeu: 1.268 ± 0.041
0.444TrpMet: 0.444 ± 0.021
0.534TrpAsn: 0.534 ± 0.021
0.281TrpPro: 0.281 ± 0.018
0.41TrpGln: 0.41 ± 0.018
0.489TrpArg: 0.489 ± 0.021
0.703TrpSer: 0.703 ± 0.026
0.591TrpThr: 0.591 ± 0.024
0.795TrpVal: 0.795 ± 0.025
0.174TrpTrp: 0.174 ± 0.013
0.367TrpTyr: 0.367 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.162TyrAla: 2.162 ± 0.048
0.238TyrCys: 0.238 ± 0.015
2.13TyrAsp: 2.13 ± 0.043
2.738TyrGlu: 2.738 ± 0.054
1.825TyrPhe: 1.825 ± 0.043
2.46TyrGly: 2.46 ± 0.047
0.941TyrHis: 0.941 ± 0.029
2.268TyrIle: 2.268 ± 0.05
1.82TyrLys: 1.82 ± 0.045
3.31TyrLeu: 3.31 ± 0.061
0.87TyrMet: 0.87 ± 0.028
1.193TyrAsn: 1.193 ± 0.037
1.478TyrPro: 1.478 ± 0.038
1.733TyrGln: 1.733 ± 0.043
1.728TyrArg: 1.728 ± 0.041
1.969TyrSer: 1.969 ± 0.046
1.756TyrThr: 1.756 ± 0.045
2.307TyrVal: 2.307 ± 0.048
0.413TyrTrp: 0.413 ± 0.019
1.365TyrTyr: 1.365 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3800 proteins (1081736 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski