Amino acid dipepetide frequency for Prevotellaceae bacterium KH2P17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.785AlaAla: 7.785 ± 0.113
1.134AlaCys: 1.134 ± 0.039
5.48AlaAsp: 5.48 ± 0.085
5.16AlaGlu: 5.16 ± 0.084
3.22AlaPhe: 3.22 ± 0.055
6.152AlaGly: 6.152 ± 0.09
1.549AlaHis: 1.549 ± 0.046
4.485AlaIle: 4.485 ± 0.087
4.137AlaLys: 4.137 ± 0.068
7.241AlaLeu: 7.241 ± 0.102
2.195AlaMet: 2.195 ± 0.056
3.501AlaAsn: 3.501 ± 0.063
2.631AlaPro: 2.631 ± 0.059
3.203AlaGln: 3.203 ± 0.059
4.289AlaArg: 4.289 ± 0.08
4.508AlaSer: 4.508 ± 0.082
4.428AlaThr: 4.428 ± 0.07
5.747AlaVal: 5.747 ± 0.084
0.955AlaTrp: 0.955 ± 0.038
3.229AlaTyr: 3.229 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.837CysAla: 0.837 ± 0.03
0.206CysCys: 0.206 ± 0.015
0.674CysAsp: 0.674 ± 0.027
0.58CysGlu: 0.58 ± 0.024
0.534CysPhe: 0.534 ± 0.023
1.043CysGly: 1.043 ± 0.039
0.341CysHis: 0.341 ± 0.02
0.796CysIle: 0.796 ± 0.03
0.61CysLys: 0.61 ± 0.025
1.12CysLeu: 1.12 ± 0.041
0.332CysMet: 0.332 ± 0.02
0.576CysAsn: 0.576 ± 0.025
0.505CysPro: 0.505 ± 0.026
0.38CysGln: 0.38 ± 0.02
0.792CysArg: 0.792 ± 0.032
0.8CysSer: 0.8 ± 0.03
0.681CysThr: 0.681 ± 0.03
0.784CysVal: 0.784 ± 0.031
0.181CysTrp: 0.181 ± 0.012
0.499CysTyr: 0.499 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
4.699AspAla: 4.699 ± 0.074
0.691AspCys: 0.691 ± 0.027
3.109AspAsp: 3.109 ± 0.064
4.126AspGlu: 4.126 ± 0.084
2.952AspPhe: 2.952 ± 0.061
4.61AspGly: 4.61 ± 0.085
0.958AspHis: 0.958 ± 0.03
3.975AspIle: 3.975 ± 0.068
3.638AspLys: 3.638 ± 0.076
4.379AspLeu: 4.379 ± 0.065
1.678AspMet: 1.678 ± 0.043
2.826AspAsn: 2.826 ± 0.05
1.855AspPro: 1.855 ± 0.046
1.303AspGln: 1.303 ± 0.035
2.922AspArg: 2.922 ± 0.058
3.134AspSer: 3.134 ± 0.067
2.78AspThr: 2.78 ± 0.059
3.806AspVal: 3.806 ± 0.066
0.966AspTrp: 0.966 ± 0.034
2.987AspTyr: 2.987 ± 0.058
0.0AspXaa: 0.0 ± 0.0
Glu
5.175GluAla: 5.175 ± 0.09
0.578GluCys: 0.578 ± 0.023
2.965GluAsp: 2.965 ± 0.065
3.989GluGlu: 3.989 ± 0.094
2.221GluPhe: 2.221 ± 0.047
3.91GluGly: 3.91 ± 0.068
1.377GluHis: 1.377 ± 0.043
3.659GluIle: 3.659 ± 0.069
3.846GluLys: 3.846 ± 0.089
5.639GluLeu: 5.639 ± 0.095
1.8GluMet: 1.8 ± 0.048
2.795GluAsn: 2.795 ± 0.061
1.984GluPro: 1.984 ± 0.05
2.87GluGln: 2.87 ± 0.062
3.427GluArg: 3.427 ± 0.066
2.699GluSer: 2.699 ± 0.049
3.192GluThr: 3.192 ± 0.063
3.822GluVal: 3.822 ± 0.062
0.778GluTrp: 0.778 ± 0.031
2.372GluTyr: 2.372 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
3.233PheAla: 3.233 ± 0.061
0.684PheCys: 0.684 ± 0.025
2.791PheAsp: 2.791 ± 0.054
2.165PheGlu: 2.165 ± 0.048
1.936PhePhe: 1.936 ± 0.053
3.434PheGly: 3.434 ± 0.066
0.862PheHis: 0.862 ± 0.034
2.463PheIle: 2.463 ± 0.052
1.978PheLys: 1.978 ± 0.049
3.548PheLeu: 3.548 ± 0.069
1.257PheMet: 1.257 ± 0.039
2.168PheAsn: 2.168 ± 0.051
1.566PhePro: 1.566 ± 0.041
1.097PheGln: 1.097 ± 0.029
2.399PheArg: 2.399 ± 0.052
3.181PheSer: 3.181 ± 0.06
2.748PheThr: 2.748 ± 0.057
2.85PheVal: 2.85 ± 0.065
0.546PheTrp: 0.546 ± 0.027
1.878PheTyr: 1.878 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
4.969GlyAla: 4.969 ± 0.086
0.934GlyCys: 0.934 ± 0.033
3.638GlyAsp: 3.638 ± 0.06
3.934GlyGlu: 3.934 ± 0.071
3.19GlyPhe: 3.19 ± 0.063
5.165GlyGly: 5.165 ± 0.094
1.569GlyHis: 1.569 ± 0.042
4.88GlyIle: 4.88 ± 0.082
4.701GlyLys: 4.701 ± 0.071
6.126GlyLeu: 6.126 ± 0.092
2.124GlyMet: 2.124 ± 0.053
3.682GlyAsn: 3.682 ± 0.065
1.46GlyPro: 1.46 ± 0.046
2.729GlyGln: 2.729 ± 0.051
4.146GlyArg: 4.146 ± 0.072
4.6GlySer: 4.6 ± 0.084
4.565GlyThr: 4.565 ± 0.084
4.989GlyVal: 4.989 ± 0.088
1.118GlyTrp: 1.118 ± 0.042
3.406GlyTyr: 3.406 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
1.51HisAla: 1.51 ± 0.046
0.336HisCys: 0.336 ± 0.02
1.148HisAsp: 1.148 ± 0.036
1.178HisGlu: 1.178 ± 0.03
0.999HisPhe: 0.999 ± 0.034
1.517HisGly: 1.517 ± 0.043
0.615HisHis: 0.615 ± 0.029
1.38HisIle: 1.38 ± 0.042
0.995HisLys: 0.995 ± 0.03
1.914HisLeu: 1.914 ± 0.045
0.421HisMet: 0.421 ± 0.019
1.011HisAsn: 1.011 ± 0.037
1.183HisPro: 1.183 ± 0.039
0.726HisGln: 0.726 ± 0.025
1.168HisArg: 1.168 ± 0.034
1.144HisSer: 1.144 ± 0.038
1.123HisThr: 1.123 ± 0.035
1.386HisVal: 1.386 ± 0.038
0.299HisTrp: 0.299 ± 0.018
0.938HisTyr: 0.938 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
4.985IleAla: 4.985 ± 0.088
0.78IleCys: 0.78 ± 0.032
4.116IleAsp: 4.116 ± 0.076
3.6IleGlu: 3.6 ± 0.071
2.227IlePhe: 2.227 ± 0.055
4.373IleGly: 4.373 ± 0.075
1.196IleHis: 1.196 ± 0.035
3.722IleIle: 3.722 ± 0.074
3.162IleLys: 3.162 ± 0.062
4.781IleLeu: 4.781 ± 0.081
1.388IleMet: 1.388 ± 0.043
2.951IleAsn: 2.951 ± 0.054
2.663IlePro: 2.663 ± 0.049
1.756IleGln: 1.756 ± 0.045
3.362IleArg: 3.362 ± 0.069
3.857IleSer: 3.857 ± 0.071
3.698IleThr: 3.698 ± 0.067
4.046IleVal: 4.046 ± 0.07
0.56IleTrp: 0.56 ± 0.025
2.378IleTyr: 2.378 ± 0.047
0.0IleXaa: 0.0 ± 0.0
Lys
4.924LysAla: 4.924 ± 0.089
0.492LysCys: 0.492 ± 0.023
3.447LysAsp: 3.447 ± 0.066
4.001LysGlu: 4.001 ± 0.087
2.001LysPhe: 2.001 ± 0.042
3.927LysGly: 3.927 ± 0.065
1.137LysHis: 1.137 ± 0.037
3.222LysIle: 3.222 ± 0.071
3.703LysLys: 3.703 ± 0.077
4.96LysLeu: 4.96 ± 0.07
1.779LysMet: 1.779 ± 0.046
2.673LysAsn: 2.673 ± 0.072
2.29LysPro: 2.29 ± 0.048
2.445LysGln: 2.445 ± 0.06
2.903LysArg: 2.903 ± 0.068
2.937LysSer: 2.937 ± 0.059
3.31LysThr: 3.31 ± 0.066
3.836LysVal: 3.836 ± 0.067
0.76LysTrp: 0.76 ± 0.029
2.503LysTyr: 2.503 ± 0.056
0.0LysXaa: 0.0 ± 0.0
Leu
7.183LeuAla: 7.183 ± 0.103
1.391LeuCys: 1.391 ± 0.04
4.793LeuAsp: 4.793 ± 0.078
4.295LeuGlu: 4.295 ± 0.084
3.979LeuPhe: 3.979 ± 0.085
5.745LeuGly: 5.745 ± 0.1
2.046LeuHis: 2.046 ± 0.049
4.647LeuIle: 4.647 ± 0.077
5.38LeuLys: 5.38 ± 0.079
9.056LeuLeu: 9.056 ± 0.139
2.737LeuMet: 2.737 ± 0.059
4.306LeuAsn: 4.306 ± 0.079
4.185LeuPro: 4.185 ± 0.067
3.842LeuGln: 3.842 ± 0.075
5.248LeuArg: 5.248 ± 0.076
6.187LeuSer: 6.187 ± 0.083
5.585LeuThr: 5.585 ± 0.075
5.225LeuVal: 5.225 ± 0.089
1.046LeuTrp: 1.046 ± 0.034
3.571LeuTyr: 3.571 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
2.565MetAla: 2.565 ± 0.055
0.21MetCys: 0.21 ± 0.016
1.589MetAsp: 1.589 ± 0.046
1.793MetGlu: 1.793 ± 0.044
0.925MetPhe: 0.925 ± 0.036
1.914MetGly: 1.914 ± 0.05
0.497MetHis: 0.497 ± 0.024
1.325MetIle: 1.325 ± 0.038
2.355MetLys: 2.355 ± 0.051
2.641MetLeu: 2.641 ± 0.064
0.931MetMet: 0.931 ± 0.035
1.509MetAsn: 1.509 ± 0.039
1.259MetPro: 1.259 ± 0.039
1.234MetGln: 1.234 ± 0.038
1.425MetArg: 1.425 ± 0.042
1.609MetSer: 1.609 ± 0.043
1.564MetThr: 1.564 ± 0.037
1.629MetVal: 1.629 ± 0.042
0.251MetTrp: 0.251 ± 0.016
0.819MetTyr: 0.819 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.702AsnAla: 3.702 ± 0.069
0.506AsnCys: 0.506 ± 0.023
2.667AsnAsp: 2.667 ± 0.057
2.614AsnGlu: 2.614 ± 0.054
2.022AsnPhe: 2.022 ± 0.048
3.979AsnGly: 3.979 ± 0.081
0.976AsnHis: 0.976 ± 0.03
3.283AsnIle: 3.283 ± 0.062
2.666AsnLys: 2.666 ± 0.065
4.134AsnLeu: 4.134 ± 0.07
1.236AsnMet: 1.236 ± 0.037
2.275AsnAsn: 2.275 ± 0.061
2.364AsnPro: 2.364 ± 0.054
1.567AsnGln: 1.567 ± 0.048
2.59AsnArg: 2.59 ± 0.056
2.647AsnSer: 2.647 ± 0.063
2.586AsnThr: 2.586 ± 0.06
3.245AsnVal: 3.245 ± 0.061
0.663AsnTrp: 0.663 ± 0.029
2.231AsnTyr: 2.231 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
3.542ProAla: 3.542 ± 0.064
0.366ProCys: 0.366 ± 0.021
2.563ProAsp: 2.563 ± 0.058
2.981ProGlu: 2.981 ± 0.062
1.801ProPhe: 1.801 ± 0.041
2.584ProGly: 2.584 ± 0.054
0.727ProHis: 0.727 ± 0.026
1.998ProIle: 1.998 ± 0.046
1.884ProLys: 1.884 ± 0.042
3.461ProLeu: 3.461 ± 0.054
1.04ProMet: 1.04 ± 0.033
1.632ProAsn: 1.632 ± 0.039
0.832ProPro: 0.832 ± 0.029
1.742ProGln: 1.742 ± 0.049
1.674ProArg: 1.674 ± 0.047
2.124ProSer: 2.124 ± 0.044
2.169ProThr: 2.169 ± 0.053
3.039ProVal: 3.039 ± 0.059
0.509ProTrp: 0.509 ± 0.024
1.761ProTyr: 1.761 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
3.211GlnAla: 3.211 ± 0.058
0.355GlnCys: 0.355 ± 0.018
1.735GlnAsp: 1.735 ± 0.047
2.311GlnGlu: 2.311 ± 0.064
1.471GlnPhe: 1.471 ± 0.04
2.499GlnGly: 2.499 ± 0.061
0.854GlnHis: 0.854 ± 0.032
2.149GlnIle: 2.149 ± 0.046
2.306GlnLys: 2.306 ± 0.051
3.913GlnLeu: 3.913 ± 0.08
1.133GlnMet: 1.133 ± 0.035
1.778GlnAsn: 1.778 ± 0.046
1.661GlnPro: 1.661 ± 0.043
2.265GlnGln: 2.265 ± 0.062
2.054GlnArg: 2.054 ± 0.049
2.096GlnSer: 2.096 ± 0.05
2.261GlnThr: 2.261 ± 0.049
2.371GlnVal: 2.371 ± 0.048
0.528GlnTrp: 0.528 ± 0.024
1.591GlnTyr: 1.591 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
3.458ArgAla: 3.458 ± 0.068
0.582ArgCys: 0.582 ± 0.027
2.593ArgAsp: 2.593 ± 0.058
3.374ArgGlu: 3.374 ± 0.073
2.409ArgPhe: 2.409 ± 0.052
3.073ArgGly: 3.073 ± 0.052
1.468ArgHis: 1.468 ± 0.047
3.447ArgIle: 3.447 ± 0.06
3.457ArgLys: 3.457 ± 0.064
5.371ArgLeu: 5.371 ± 0.078
1.812ArgMet: 1.812 ± 0.045
2.752ArgAsn: 2.752 ± 0.067
2.009ArgPro: 2.009 ± 0.053
2.91ArgGln: 2.91 ± 0.059
3.566ArgArg: 3.566 ± 0.065
3.0ArgSer: 3.0 ± 0.063
2.838ArgThr: 2.838 ± 0.056
3.02ArgVal: 3.02 ± 0.062
0.768ArgTrp: 0.768 ± 0.028
2.62ArgTyr: 2.62 ± 0.06
0.0ArgXaa: 0.0 ± 0.0
Ser
4.733SerAla: 4.733 ± 0.076
0.773SerCys: 0.773 ± 0.033
3.071SerAsp: 3.071 ± 0.055
3.155SerGlu: 3.155 ± 0.061
2.934SerPhe: 2.934 ± 0.059
4.519SerGly: 4.519 ± 0.084
1.342SerHis: 1.342 ± 0.041
3.575SerIle: 3.575 ± 0.064
3.009SerLys: 3.009 ± 0.068
5.631SerLeu: 5.631 ± 0.084
1.506SerMet: 1.506 ± 0.042
2.661SerAsn: 2.661 ± 0.059
2.364SerPro: 2.364 ± 0.044
2.019SerGln: 2.019 ± 0.048
3.151SerArg: 3.151 ± 0.056
3.543SerSer: 3.543 ± 0.072
3.12SerThr: 3.12 ± 0.065
4.213SerVal: 4.213 ± 0.072
0.873SerTrp: 0.873 ± 0.032
2.925SerTyr: 2.925 ± 0.07
0.0SerXaa: 0.0 ± 0.0
Thr
5.083ThrAla: 5.083 ± 0.076
0.615ThrCys: 0.615 ± 0.029
3.778ThrAsp: 3.778 ± 0.069
3.11ThrGlu: 3.11 ± 0.059
2.578ThrPhe: 2.578 ± 0.056
4.563ThrGly: 4.563 ± 0.083
1.101ThrHis: 1.101 ± 0.038
3.441ThrIle: 3.441 ± 0.056
2.589ThrLys: 2.589 ± 0.053
5.6ThrLeu: 5.6 ± 0.09
1.322ThrMet: 1.322 ± 0.036
2.441ThrAsn: 2.441 ± 0.059
2.774ThrPro: 2.774 ± 0.062
1.886ThrGln: 1.886 ± 0.038
2.571ThrArg: 2.571 ± 0.061
3.117ThrSer: 3.117 ± 0.063
3.405ThrThr: 3.405 ± 0.073
4.305ThrVal: 4.305 ± 0.078
0.69ThrTrp: 0.69 ± 0.027
2.505ThrTyr: 2.505 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
5.296ValAla: 5.296 ± 0.105
0.951ValCys: 0.951 ± 0.03
3.847ValAsp: 3.847 ± 0.07
3.733ValGlu: 3.733 ± 0.079
2.854ValPhe: 2.854 ± 0.069
4.586ValGly: 4.586 ± 0.079
1.114ValHis: 1.114 ± 0.037
3.93ValIle: 3.93 ± 0.068
3.787ValLys: 3.787 ± 0.07
5.843ValLeu: 5.843 ± 0.097
1.814ValMet: 1.814 ± 0.043
3.296ValAsn: 3.296 ± 0.065
2.715ValPro: 2.715 ± 0.059
2.205ValGln: 2.205 ± 0.049
3.521ValArg: 3.521 ± 0.066
4.611ValSer: 4.611 ± 0.072
3.99ValThr: 3.99 ± 0.079
4.891ValVal: 4.891 ± 0.083
0.807ValTrp: 0.807 ± 0.031
2.712ValTyr: 2.712 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.861TrpAla: 0.861 ± 0.031
0.176TrpCys: 0.176 ± 0.013
0.709TrpAsp: 0.709 ± 0.028
0.665TrpGlu: 0.665 ± 0.028
0.575TrpPhe: 0.575 ± 0.027
0.957TrpGly: 0.957 ± 0.035
0.321TrpHis: 0.321 ± 0.019
0.711TrpIle: 0.711 ± 0.027
0.808TrpLys: 0.808 ± 0.03
1.305TrpLeu: 1.305 ± 0.043
0.427TrpMet: 0.427 ± 0.022
0.803TrpAsn: 0.803 ± 0.031
0.369TrpPro: 0.369 ± 0.019
0.717TrpGln: 0.717 ± 0.031
0.711TrpArg: 0.711 ± 0.029
0.726TrpSer: 0.726 ± 0.027
0.782TrpThr: 0.782 ± 0.03
0.702TrpVal: 0.702 ± 0.025
0.226TrpTrp: 0.226 ± 0.016
0.511TrpTyr: 0.511 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.276TyrAla: 3.276 ± 0.065
0.511TyrCys: 0.511 ± 0.023
2.678TyrAsp: 2.678 ± 0.064
2.332TyrGlu: 2.332 ± 0.058
1.957TyrPhe: 1.957 ± 0.043
3.375TyrGly: 3.375 ± 0.075
0.943TyrHis: 0.943 ± 0.035
2.514TyrIle: 2.514 ± 0.05
2.236TyrLys: 2.236 ± 0.053
3.783TyrLeu: 3.783 ± 0.06
1.112TyrMet: 1.112 ± 0.039
2.32TyrAsn: 2.32 ± 0.062
1.757TyrPro: 1.757 ± 0.046
1.648TyrGln: 1.648 ± 0.049
2.609TyrArg: 2.609 ± 0.056
2.597TyrSer: 2.597 ± 0.06
2.648TyrThr: 2.648 ± 0.065
2.6TyrVal: 2.6 ± 0.061
0.54TyrTrp: 0.54 ± 0.023
2.124TyrTyr: 2.124 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2620 proteins (960076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski