Amino acid dipepetide frequency for bacterium 1XD8-76

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.296AlaAla: 8.296 ± 0.128
1.195AlaCys: 1.195 ± 0.033
4.696AlaAsp: 4.696 ± 0.071
6.716AlaGlu: 6.716 ± 0.1
3.163AlaPhe: 3.163 ± 0.066
7.128AlaGly: 7.128 ± 0.086
1.063AlaHis: 1.063 ± 0.031
4.482AlaIle: 4.482 ± 0.067
4.453AlaLys: 4.453 ± 0.073
7.26AlaLeu: 7.26 ± 0.089
2.288AlaMet: 2.288 ± 0.042
2.317AlaAsn: 2.317 ± 0.045
2.193AlaPro: 2.193 ± 0.046
2.276AlaGln: 2.276 ± 0.046
3.129AlaArg: 3.129 ± 0.057
3.761AlaSer: 3.761 ± 0.063
2.725AlaThr: 2.725 ± 0.058
6.794AlaVal: 6.794 ± 0.096
0.663AlaTrp: 0.663 ± 0.022
2.794AlaTyr: 2.794 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
1.134CysAla: 1.134 ± 0.029
0.328CysCys: 0.328 ± 0.019
0.848CysAsp: 0.848 ± 0.03
0.954CysGlu: 0.954 ± 0.028
0.742CysPhe: 0.742 ± 0.027
1.746CysGly: 1.746 ± 0.042
0.322CysHis: 0.322 ± 0.018
1.157CysIle: 1.157 ± 0.034
0.754CysLys: 0.754 ± 0.027
1.339CysLeu: 1.339 ± 0.033
0.518CysMet: 0.518 ± 0.02
0.579CysAsn: 0.579 ± 0.022
0.637CysPro: 0.637 ± 0.028
0.38CysGln: 0.38 ± 0.019
0.988CysArg: 0.988 ± 0.031
0.93CysSer: 0.93 ± 0.031
0.647CysThr: 0.647 ± 0.029
1.073CysVal: 1.073 ± 0.027
0.124CysTrp: 0.124 ± 0.01
0.662CysTyr: 0.662 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
4.152AspAla: 4.152 ± 0.075
0.898AspCys: 0.898 ± 0.03
2.653AspAsp: 2.653 ± 0.053
4.32AspGlu: 4.32 ± 0.073
2.592AspPhe: 2.592 ± 0.047
4.335AspGly: 4.335 ± 0.082
0.849AspHis: 0.849 ± 0.025
4.524AspIle: 4.524 ± 0.068
3.191AspLys: 3.191 ± 0.059
4.418AspLeu: 4.418 ± 0.07
1.944AspMet: 1.944 ± 0.036
2.079AspAsn: 2.079 ± 0.046
1.764AspPro: 1.764 ± 0.044
1.211AspGln: 1.211 ± 0.034
3.016AspArg: 3.016 ± 0.054
2.833AspSer: 2.833 ± 0.051
2.868AspThr: 2.868 ± 0.047
3.62AspVal: 3.62 ± 0.058
0.613AspTrp: 0.613 ± 0.026
2.783AspTyr: 2.783 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
6.275GluAla: 6.275 ± 0.093
0.868GluCys: 0.868 ± 0.031
4.55GluAsp: 4.55 ± 0.079
9.044GluGlu: 9.044 ± 0.125
2.719GluPhe: 2.719 ± 0.045
4.915GluGly: 4.915 ± 0.072
1.236GluHis: 1.236 ± 0.034
5.926GluIle: 5.926 ± 0.079
7.57GluLys: 7.57 ± 0.094
7.119GluLeu: 7.119 ± 0.089
2.77GluMet: 2.77 ± 0.048
4.267GluAsn: 4.267 ± 0.062
2.123GluPro: 2.123 ± 0.053
2.933GluGln: 2.933 ± 0.052
4.476GluArg: 4.476 ± 0.08
3.703GluSer: 3.703 ± 0.07
3.954GluThr: 3.954 ± 0.059
4.542GluVal: 4.542 ± 0.065
0.805GluTrp: 0.805 ± 0.027
3.381GluTyr: 3.381 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
3.025PheAla: 3.025 ± 0.053
0.884PheCys: 0.884 ± 0.031
2.469PheAsp: 2.469 ± 0.045
2.688PheGlu: 2.688 ± 0.045
2.022PhePhe: 2.022 ± 0.052
3.189PheGly: 3.189 ± 0.059
0.791PheHis: 0.791 ± 0.027
2.597PheIle: 2.597 ± 0.055
1.686PheLys: 1.686 ± 0.042
4.443PheLeu: 4.443 ± 0.076
1.252PheMet: 1.252 ± 0.032
1.301PheAsn: 1.301 ± 0.036
1.491PhePro: 1.491 ± 0.038
1.312PheGln: 1.312 ± 0.032
2.075PheArg: 2.075 ± 0.043
2.987PheSer: 2.987 ± 0.057
2.15PheThr: 2.15 ± 0.045
2.929PheVal: 2.929 ± 0.058
0.511PheTrp: 0.511 ± 0.022
1.97PheTyr: 1.97 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
5.403GlyAla: 5.403 ± 0.081
1.301GlyCys: 1.301 ± 0.041
3.621GlyAsp: 3.621 ± 0.058
5.833GlyGlu: 5.833 ± 0.081
3.229GlyPhe: 3.229 ± 0.045
5.492GlyGly: 5.492 ± 0.085
1.226GlyHis: 1.226 ± 0.031
6.278GlyIle: 6.278 ± 0.075
5.485GlyLys: 5.485 ± 0.064
5.918GlyLeu: 5.918 ± 0.065
2.73GlyMet: 2.73 ± 0.053
3.175GlyAsn: 3.175 ± 0.061
1.205GlyPro: 1.205 ± 0.037
2.143GlyGln: 2.143 ± 0.039
3.871GlyArg: 3.871 ± 0.056
4.133GlySer: 4.133 ± 0.061
3.998GlyThr: 3.998 ± 0.063
5.117GlyVal: 5.117 ± 0.082
0.718GlyTrp: 0.718 ± 0.025
3.365GlyTyr: 3.365 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.059HisAla: 1.059 ± 0.03
0.293HisCys: 0.293 ± 0.016
0.767HisAsp: 0.767 ± 0.025
1.031HisGlu: 1.031 ± 0.03
0.794HisPhe: 0.794 ± 0.026
1.167HisGly: 1.167 ± 0.032
0.367HisHis: 0.367 ± 0.019
1.317HisIle: 1.317 ± 0.032
0.849HisLys: 0.849 ± 0.028
1.478HisLeu: 1.478 ± 0.034
0.53HisMet: 0.53 ± 0.019
0.684HisAsn: 0.684 ± 0.023
0.782HisPro: 0.782 ± 0.026
0.479HisGln: 0.479 ± 0.019
0.888HisArg: 0.888 ± 0.03
0.827HisSer: 0.827 ± 0.026
0.86HisThr: 0.86 ± 0.028
1.026HisVal: 1.026 ± 0.027
0.16HisTrp: 0.16 ± 0.012
0.772HisTyr: 0.772 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.442IleAla: 5.442 ± 0.072
1.433IleCys: 1.433 ± 0.035
3.835IleAsp: 3.835 ± 0.056
4.703IleGlu: 4.703 ± 0.068
3.04IlePhe: 3.04 ± 0.061
4.965IleGly: 4.965 ± 0.064
1.197IleHis: 1.197 ± 0.034
4.546IleIle: 4.546 ± 0.082
3.635IleLys: 3.635 ± 0.061
7.12IleLeu: 7.12 ± 0.089
1.959IleMet: 1.959 ± 0.042
2.535IleAsn: 2.535 ± 0.047
2.952IlePro: 2.952 ± 0.055
2.034IleGln: 2.034 ± 0.042
3.971IleArg: 3.971 ± 0.063
4.718IleSer: 4.718 ± 0.066
3.898IleThr: 3.898 ± 0.057
4.808IleVal: 4.808 ± 0.066
0.698IleTrp: 0.698 ± 0.027
2.852IleTyr: 2.852 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
4.75LysAla: 4.75 ± 0.074
0.731LysCys: 0.731 ± 0.027
3.431LysAsp: 3.431 ± 0.063
6.41LysGlu: 6.41 ± 0.085
1.79LysPhe: 1.79 ± 0.04
3.897LysGly: 3.897 ± 0.058
0.864LysHis: 0.864 ± 0.028
4.443LysIle: 4.443 ± 0.062
5.951LysLys: 5.951 ± 0.086
5.186LysLeu: 5.186 ± 0.073
2.094LysMet: 2.094 ± 0.044
3.29LysAsn: 3.29 ± 0.055
1.929LysPro: 1.929 ± 0.038
2.045LysGln: 2.045 ± 0.044
3.344LysArg: 3.344 ± 0.057
3.088LysSer: 3.088 ± 0.053
3.333LysThr: 3.333 ± 0.054
3.934LysVal: 3.934 ± 0.065
0.637LysTrp: 0.637 ± 0.022
2.693LysTyr: 2.693 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
6.951LeuAla: 6.951 ± 0.084
1.679LeuCys: 1.679 ± 0.047
4.7LeuAsp: 4.7 ± 0.069
6.814LeuGlu: 6.814 ± 0.086
4.322LeuPhe: 4.322 ± 0.09
5.806LeuGly: 5.806 ± 0.066
1.54LeuHis: 1.54 ± 0.04
5.771LeuIle: 5.771 ± 0.081
5.556LeuLys: 5.556 ± 0.08
9.131LeuLeu: 9.131 ± 0.139
2.611LeuMet: 2.611 ± 0.049
3.445LeuAsn: 3.445 ± 0.049
3.58LeuPro: 3.58 ± 0.064
3.09LeuGln: 3.09 ± 0.056
4.491LeuArg: 4.491 ± 0.069
6.408LeuSer: 6.408 ± 0.086
4.835LeuThr: 4.835 ± 0.066
5.257LeuVal: 5.257 ± 0.068
0.889LeuTrp: 0.889 ± 0.033
3.648LeuTyr: 3.648 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
2.624MetAla: 2.624 ± 0.05
0.363MetCys: 0.363 ± 0.018
1.858MetAsp: 1.858 ± 0.041
2.959MetGlu: 2.959 ± 0.046
0.984MetPhe: 0.984 ± 0.033
2.274MetGly: 2.274 ± 0.044
0.464MetHis: 0.464 ± 0.017
2.081MetIle: 2.081 ± 0.039
2.394MetLys: 2.394 ± 0.048
2.913MetLeu: 2.913 ± 0.054
1.023MetMet: 1.023 ± 0.031
1.438MetAsn: 1.438 ± 0.035
1.142MetPro: 1.142 ± 0.028
1.138MetGln: 1.138 ± 0.035
1.664MetArg: 1.664 ± 0.037
1.747MetSer: 1.747 ± 0.045
1.739MetThr: 1.739 ± 0.041
1.858MetVal: 1.858 ± 0.042
0.259MetTrp: 0.259 ± 0.013
0.906MetTyr: 0.906 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.937AsnAla: 2.937 ± 0.052
0.671AsnCys: 0.671 ± 0.025
1.901AsnAsp: 1.901 ± 0.042
2.612AsnGlu: 2.612 ± 0.056
1.601AsnPhe: 1.601 ± 0.039
3.296AsnGly: 3.296 ± 0.065
0.658AsnHis: 0.658 ± 0.023
3.245AsnIle: 3.245 ± 0.055
2.02AsnLys: 2.02 ± 0.048
3.517AsnLeu: 3.517 ± 0.054
1.362AsnMet: 1.362 ± 0.033
1.684AsnAsn: 1.684 ± 0.042
1.749AsnPro: 1.749 ± 0.04
1.137AsnGln: 1.137 ± 0.032
2.299AsnArg: 2.299 ± 0.047
2.157AsnSer: 2.157 ± 0.042
2.136AsnThr: 2.136 ± 0.048
2.733AsnVal: 2.733 ± 0.049
0.437AsnTrp: 0.437 ± 0.022
1.834AsnTyr: 1.834 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
2.79ProAla: 2.79 ± 0.055
0.469ProCys: 0.469 ± 0.02
2.316ProAsp: 2.316 ± 0.047
3.667ProGlu: 3.667 ± 0.066
1.557ProPhe: 1.557 ± 0.033
2.501ProGly: 2.501 ± 0.054
0.535ProHis: 0.535 ± 0.023
1.89ProIle: 1.89 ± 0.035
1.61ProLys: 1.61 ± 0.034
2.666ProLeu: 2.666 ± 0.047
0.826ProMet: 0.826 ± 0.031
1.052ProAsn: 1.052 ± 0.029
0.805ProPro: 0.805 ± 0.033
0.966ProGln: 0.966 ± 0.03
1.052ProArg: 1.052 ± 0.03
1.54ProSer: 1.54 ± 0.037
1.393ProThr: 1.393 ± 0.04
3.019ProVal: 3.019 ± 0.056
0.353ProTrp: 0.353 ± 0.017
1.453ProTyr: 1.453 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
2.42GlnAla: 2.42 ± 0.05
0.325GlnCys: 0.325 ± 0.017
1.57GlnAsp: 1.57 ± 0.033
2.947GlnGlu: 2.947 ± 0.05
1.118GlnPhe: 1.118 ± 0.029
2.085GlnGly: 2.085 ± 0.044
0.433GlnHis: 0.433 ± 0.019
2.406GlnIle: 2.406 ± 0.042
2.394GlnLys: 2.394 ± 0.051
2.384GlnLeu: 2.384 ± 0.046
1.206GlnMet: 1.206 ± 0.033
1.518GlnAsn: 1.518 ± 0.038
0.735GlnPro: 0.735 ± 0.027
1.157GlnGln: 1.157 ± 0.037
1.519GlnArg: 1.519 ± 0.036
1.497GlnSer: 1.497 ± 0.033
1.577GlnThr: 1.577 ± 0.037
1.968GlnVal: 1.968 ± 0.04
0.304GlnTrp: 0.304 ± 0.016
1.318GlnTyr: 1.318 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.311ArgAla: 3.311 ± 0.062
0.69ArgCys: 0.69 ± 0.02
2.612ArgAsp: 2.612 ± 0.054
5.427ArgGlu: 5.427 ± 0.085
1.995ArgPhe: 1.995 ± 0.041
3.161ArgGly: 3.161 ± 0.059
0.846ArgHis: 0.846 ± 0.028
3.919ArgIle: 3.919 ± 0.063
3.94ArgLys: 3.94 ± 0.066
4.376ArgLeu: 4.376 ± 0.074
1.852ArgMet: 1.852 ± 0.043
2.077ArgAsn: 2.077 ± 0.043
1.412ArgPro: 1.412 ± 0.038
2.042ArgGln: 2.042 ± 0.04
3.119ArgArg: 3.119 ± 0.061
2.412ArgSer: 2.412 ± 0.049
2.394ArgThr: 2.394 ± 0.047
2.968ArgVal: 2.968 ± 0.054
0.466ArgTrp: 0.466 ± 0.021
2.197ArgTyr: 2.197 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
4.375SerAla: 4.375 ± 0.064
0.878SerCys: 0.878 ± 0.026
3.065SerAsp: 3.065 ± 0.052
4.151SerGlu: 4.151 ± 0.056
2.816SerPhe: 2.816 ± 0.05
5.428SerGly: 5.428 ± 0.076
0.905SerHis: 0.905 ± 0.028
3.791SerIle: 3.791 ± 0.054
2.643SerLys: 2.643 ± 0.054
5.061SerLeu: 5.061 ± 0.071
1.824SerMet: 1.824 ± 0.036
1.872SerAsn: 1.872 ± 0.042
1.714SerPro: 1.714 ± 0.043
1.453SerGln: 1.453 ± 0.035
2.873SerArg: 2.873 ± 0.059
3.267SerSer: 3.267 ± 0.059
2.482SerThr: 2.482 ± 0.05
4.331SerVal: 4.331 ± 0.064
0.558SerTrp: 0.558 ± 0.022
2.424SerTyr: 2.424 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
4.467ThrAla: 4.467 ± 0.071
0.663ThrCys: 0.663 ± 0.022
3.042ThrAsp: 3.042 ± 0.052
3.982ThrGlu: 3.982 ± 0.061
1.985ThrPhe: 1.985 ± 0.045
4.726ThrGly: 4.726 ± 0.076
0.698ThrHis: 0.698 ± 0.021
3.342ThrIle: 3.342 ± 0.063
2.749ThrLys: 2.749 ± 0.051
4.474ThrLeu: 4.474 ± 0.062
1.354ThrMet: 1.354 ± 0.033
1.724ThrAsn: 1.724 ± 0.041
1.965ThrPro: 1.965 ± 0.038
1.278ThrGln: 1.278 ± 0.034
1.918ThrArg: 1.918 ± 0.043
2.448ThrSer: 2.448 ± 0.047
2.388ThrThr: 2.388 ± 0.054
4.175ThrVal: 4.175 ± 0.072
0.471ThrTrp: 0.471 ± 0.021
1.93ThrTyr: 1.93 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
4.485ValAla: 4.485 ± 0.071
1.308ValCys: 1.308 ± 0.029
3.546ValAsp: 3.546 ± 0.056
4.851ValGlu: 4.851 ± 0.073
3.014ValPhe: 3.014 ± 0.057
4.17ValGly: 4.17 ± 0.063
1.075ValHis: 1.075 ± 0.03
5.052ValIle: 5.052 ± 0.078
4.216ValLys: 4.216 ± 0.059
6.767ValLeu: 6.767 ± 0.091
2.171ValMet: 2.171 ± 0.044
2.697ValAsn: 2.697 ± 0.05
2.542ValPro: 2.542 ± 0.054
1.901ValGln: 1.901 ± 0.035
3.433ValArg: 3.433 ± 0.055
4.621ValSer: 4.621 ± 0.067
3.774ValThr: 3.774 ± 0.059
4.653ValVal: 4.653 ± 0.07
0.755ValTrp: 0.755 ± 0.028
2.794ValTyr: 2.794 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.594TrpAla: 0.594 ± 0.025
0.183TrpCys: 0.183 ± 0.011
0.577TrpAsp: 0.577 ± 0.024
0.871TrpGlu: 0.871 ± 0.029
0.394TrpPhe: 0.394 ± 0.02
0.728TrpGly: 0.728 ± 0.026
0.207TrpHis: 0.207 ± 0.013
0.708TrpIle: 0.708 ± 0.028
0.769TrpLys: 0.769 ± 0.028
0.948TrpLeu: 0.948 ± 0.032
0.307TrpMet: 0.307 ± 0.016
0.541TrpAsn: 0.541 ± 0.023
0.221TrpPro: 0.221 ± 0.014
0.435TrpGln: 0.435 ± 0.023
0.479TrpArg: 0.479 ± 0.022
0.509TrpSer: 0.509 ± 0.025
0.451TrpThr: 0.451 ± 0.02
0.467TrpVal: 0.467 ± 0.022
0.128TrpTrp: 0.128 ± 0.009
0.439TrpTyr: 0.439 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.903TyrAla: 2.903 ± 0.051
0.699TyrCys: 0.699 ± 0.023
2.606TyrAsp: 2.606 ± 0.053
3.197TyrGlu: 3.197 ± 0.051
1.896TyrPhe: 1.896 ± 0.047
3.232TyrGly: 3.232 ± 0.057
0.857TyrHis: 0.857 ± 0.026
2.865TyrIle: 2.865 ± 0.054
1.972TyrLys: 1.972 ± 0.043
4.004TyrLeu: 4.004 ± 0.072
1.207TyrMet: 1.207 ± 0.036
1.709TyrAsn: 1.709 ± 0.038
1.517TyrPro: 1.517 ± 0.036
1.474TyrGln: 1.474 ± 0.031
2.615TyrArg: 2.615 ± 0.048
2.321TyrSer: 2.321 ± 0.047
2.179TyrThr: 2.179 ± 0.044
2.553TyrVal: 2.553 ± 0.047
0.405TyrTrp: 0.405 ± 0.021
2.007TyrTyr: 2.007 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3768 proteins (1202740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski