Amino acid dipepetide frequency for Prevotella sp. 885

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.55AlaAla: 6.55 ± 0.1
1.014AlaCys: 1.014 ± 0.03
5.406AlaAsp: 5.406 ± 0.074
5.081AlaGlu: 5.081 ± 0.075
3.207AlaPhe: 3.207 ± 0.061
4.672AlaGly: 4.672 ± 0.079
1.299AlaHis: 1.299 ± 0.038
4.68AlaIle: 4.68 ± 0.081
4.908AlaLys: 4.908 ± 0.081
6.422AlaLeu: 6.422 ± 0.095
2.313AlaMet: 2.313 ± 0.05
3.528AlaAsn: 3.528 ± 0.066
2.399AlaPro: 2.399 ± 0.051
2.86AlaGln: 2.86 ± 0.055
3.213AlaArg: 3.213 ± 0.068
4.747AlaSer: 4.747 ± 0.07
4.445AlaThr: 4.445 ± 0.079
5.212AlaVal: 5.212 ± 0.094
0.867AlaTrp: 0.867 ± 0.028
3.091AlaTyr: 3.091 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.842CysAla: 0.842 ± 0.028
0.223CysCys: 0.223 ± 0.016
0.845CysAsp: 0.845 ± 0.034
0.75CysGlu: 0.75 ± 0.033
0.593CysPhe: 0.593 ± 0.028
1.116CysGly: 1.116 ± 0.039
0.325CysHis: 0.325 ± 0.02
0.85CysIle: 0.85 ± 0.028
0.773CysLys: 0.773 ± 0.027
1.163CysLeu: 1.163 ± 0.032
0.376CysMet: 0.376 ± 0.02
0.623CysAsn: 0.623 ± 0.027
0.56CysPro: 0.56 ± 0.023
0.464CysGln: 0.464 ± 0.02
0.765CysArg: 0.765 ± 0.028
0.859CysSer: 0.859 ± 0.034
0.667CysThr: 0.667 ± 0.026
0.829CysVal: 0.829 ± 0.029
0.175CysTrp: 0.175 ± 0.015
0.602CysTyr: 0.602 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
4.69AspAla: 4.69 ± 0.076
0.81AspCys: 0.81 ± 0.027
3.735AspAsp: 3.735 ± 0.07
4.355AspGlu: 4.355 ± 0.071
3.134AspPhe: 3.134 ± 0.054
5.048AspGly: 5.048 ± 0.083
0.999AspHis: 0.999 ± 0.032
4.571AspIle: 4.571 ± 0.072
4.081AspLys: 4.081 ± 0.064
4.463AspLeu: 4.463 ± 0.077
1.838AspMet: 1.838 ± 0.045
3.557AspAsn: 3.557 ± 0.07
1.802AspPro: 1.802 ± 0.035
1.396AspGln: 1.396 ± 0.038
2.779AspArg: 2.779 ± 0.056
3.267AspSer: 3.267 ± 0.052
3.108AspThr: 3.108 ± 0.06
4.289AspVal: 4.289 ± 0.067
0.853AspTrp: 0.853 ± 0.028
2.938AspTyr: 2.938 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
4.958GluAla: 4.958 ± 0.086
0.757GluCys: 0.757 ± 0.03
3.571GluAsp: 3.571 ± 0.07
4.896GluGlu: 4.896 ± 0.088
2.345GluPhe: 2.345 ± 0.049
4.395GluGly: 4.395 ± 0.066
1.34GluHis: 1.34 ± 0.041
3.874GluIle: 3.874 ± 0.075
4.653GluLys: 4.653 ± 0.078
5.104GluLeu: 5.104 ± 0.08
2.035GluMet: 2.035 ± 0.047
3.229GluAsn: 3.229 ± 0.058
1.823GluPro: 1.823 ± 0.044
2.668GluGln: 2.668 ± 0.054
3.463GluArg: 3.463 ± 0.073
3.073GluSer: 3.073 ± 0.053
3.373GluThr: 3.373 ± 0.057
3.868GluVal: 3.868 ± 0.057
0.891GluTrp: 0.891 ± 0.029
2.754GluTyr: 2.754 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
3.206PheAla: 3.206 ± 0.056
0.75PheCys: 0.75 ± 0.027
2.932PheAsp: 2.932 ± 0.053
2.328PheGlu: 2.328 ± 0.041
2.023PhePhe: 2.023 ± 0.051
3.162PheGly: 3.162 ± 0.066
0.897PheHis: 0.897 ± 0.033
2.536PheIle: 2.536 ± 0.054
2.333PheLys: 2.333 ± 0.048
3.384PheLeu: 3.384 ± 0.067
1.212PheMet: 1.212 ± 0.033
2.185PheAsn: 2.185 ± 0.053
1.462PhePro: 1.462 ± 0.033
1.093PheGln: 1.093 ± 0.034
2.1PheArg: 2.1 ± 0.044
3.215PheSer: 3.215 ± 0.063
2.769PheThr: 2.769 ± 0.063
3.044PheVal: 3.044 ± 0.057
0.516PheTrp: 0.516 ± 0.026
1.801PheTyr: 1.801 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.576GlyAla: 4.576 ± 0.078
1.01GlyCys: 1.01 ± 0.037
3.87GlyAsp: 3.87 ± 0.066
4.157GlyGlu: 4.157 ± 0.067
2.953GlyPhe: 2.953 ± 0.057
4.806GlyGly: 4.806 ± 0.119
1.257GlyHis: 1.257 ± 0.038
4.785GlyIle: 4.785 ± 0.073
5.454GlyLys: 5.454 ± 0.074
5.276GlyLeu: 5.276 ± 0.078
2.192GlyMet: 2.192 ± 0.052
3.592GlyAsn: 3.592 ± 0.074
1.231GlyPro: 1.231 ± 0.037
2.05GlyGln: 2.05 ± 0.048
3.059GlyArg: 3.059 ± 0.066
4.027GlySer: 4.027 ± 0.077
4.237GlyThr: 4.237 ± 0.071
4.863GlyVal: 4.863 ± 0.068
1.004GlyTrp: 1.004 ± 0.035
3.184GlyTyr: 3.184 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.236HisAla: 1.236 ± 0.035
0.306HisCys: 0.306 ± 0.017
1.216HisAsp: 1.216 ± 0.037
1.059HisGlu: 1.059 ± 0.034
1.001HisPhe: 1.001 ± 0.031
1.314HisGly: 1.314 ± 0.037
0.554HisHis: 0.554 ± 0.031
1.419HisIle: 1.419 ± 0.038
1.169HisLys: 1.169 ± 0.036
1.629HisLeu: 1.629 ± 0.044
0.433HisMet: 0.433 ± 0.019
1.077HisAsn: 1.077 ± 0.034
0.932HisPro: 0.932 ± 0.032
0.57HisGln: 0.57 ± 0.022
0.952HisArg: 0.952 ± 0.03
1.177HisSer: 1.177 ± 0.034
1.059HisThr: 1.059 ± 0.035
1.221HisVal: 1.221 ± 0.037
0.248HisTrp: 0.248 ± 0.016
0.949HisTyr: 0.949 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.044IleAla: 5.044 ± 0.073
0.99IleCys: 0.99 ± 0.034
4.474IleAsp: 4.474 ± 0.066
4.027IleGlu: 4.027 ± 0.064
2.429IlePhe: 2.429 ± 0.05
4.311IleGly: 4.311 ± 0.066
1.216IleHis: 1.216 ± 0.035
4.179IleIle: 4.179 ± 0.08
4.123IleLys: 4.123 ± 0.069
4.733IleLeu: 4.733 ± 0.079
1.59IleMet: 1.59 ± 0.043
3.273IleAsn: 3.273 ± 0.057
2.535IlePro: 2.535 ± 0.054
1.872IleGln: 1.872 ± 0.043
3.045IleArg: 3.045 ± 0.052
4.234IleSer: 4.234 ± 0.063
3.94IleThr: 3.94 ± 0.059
4.474IleVal: 4.474 ± 0.075
0.571IleTrp: 0.571 ± 0.025
2.482IleTyr: 2.482 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
5.384LysAla: 5.384 ± 0.085
0.636LysCys: 0.636 ± 0.025
4.186LysAsp: 4.186 ± 0.062
5.233LysGlu: 5.233 ± 0.089
2.206LysPhe: 2.206 ± 0.049
4.391LysGly: 4.391 ± 0.064
1.318LysHis: 1.318 ± 0.036
3.713LysIle: 3.713 ± 0.063
5.005LysLys: 5.005 ± 0.081
5.106LysLeu: 5.106 ± 0.071
2.253LysMet: 2.253 ± 0.037
3.342LysAsn: 3.342 ± 0.055
2.346LysPro: 2.346 ± 0.056
2.691LysGln: 2.691 ± 0.058
3.318LysArg: 3.318 ± 0.068
3.385LysSer: 3.385 ± 0.055
4.081LysThr: 4.081 ± 0.07
4.319LysVal: 4.319 ± 0.07
0.816LysTrp: 0.816 ± 0.028
2.87LysTyr: 2.87 ± 0.06
0.0LysXaa: 0.0 ± 0.0
Leu
6.432LeuAla: 6.432 ± 0.098
1.304LeuCys: 1.304 ± 0.041
4.723LeuAsp: 4.723 ± 0.08
4.219LeuGlu: 4.219 ± 0.068
3.635LeuPhe: 3.635 ± 0.069
5.294LeuGly: 5.294 ± 0.086
1.796LeuHis: 1.796 ± 0.05
4.476LeuIle: 4.476 ± 0.073
5.626LeuLys: 5.626 ± 0.071
7.514LeuLeu: 7.514 ± 0.107
2.383LeuMet: 2.383 ± 0.049
4.21LeuAsn: 4.21 ± 0.067
3.605LeuPro: 3.605 ± 0.062
3.051LeuGln: 3.051 ± 0.063
4.562LeuArg: 4.562 ± 0.072
6.239LeuSer: 6.239 ± 0.069
5.461LeuThr: 5.461 ± 0.074
4.955LeuVal: 4.955 ± 0.081
0.965LeuTrp: 0.965 ± 0.034
3.355LeuTyr: 3.355 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.544MetAla: 2.544 ± 0.049
0.307MetCys: 0.307 ± 0.017
1.575MetAsp: 1.575 ± 0.041
1.933MetGlu: 1.933 ± 0.046
1.141MetPhe: 1.141 ± 0.04
1.852MetGly: 1.852 ± 0.047
0.511MetHis: 0.511 ± 0.02
1.489MetIle: 1.489 ± 0.043
2.434MetLys: 2.434 ± 0.047
2.729MetLeu: 2.729 ± 0.058
0.955MetMet: 0.955 ± 0.03
1.488MetAsn: 1.488 ± 0.037
1.377MetPro: 1.377 ± 0.035
1.22MetGln: 1.22 ± 0.035
1.539MetArg: 1.539 ± 0.04
1.823MetSer: 1.823 ± 0.041
1.807MetThr: 1.807 ± 0.04
1.701MetVal: 1.701 ± 0.043
0.256MetTrp: 0.256 ± 0.018
0.898MetTyr: 0.898 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.948AsnAla: 3.948 ± 0.066
0.564AsnCys: 0.564 ± 0.024
2.928AsnAsp: 2.928 ± 0.059
2.938AsnGlu: 2.938 ± 0.061
2.019AsnPhe: 2.019 ± 0.05
4.035AsnGly: 4.035 ± 0.08
0.959AsnHis: 0.959 ± 0.03
3.801AsnIle: 3.801 ± 0.063
3.121AsnLys: 3.121 ± 0.059
4.052AsnLeu: 4.052 ± 0.061
1.404AsnMet: 1.404 ± 0.037
2.778AsnAsn: 2.778 ± 0.067
2.26AsnPro: 2.26 ± 0.049
1.413AsnGln: 1.413 ± 0.035
2.306AsnArg: 2.306 ± 0.05
2.794AsnSer: 2.794 ± 0.058
2.846AsnThr: 2.846 ± 0.058
3.678AsnVal: 3.678 ± 0.072
0.598AsnTrp: 0.598 ± 0.028
2.158AsnTyr: 2.158 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
2.603ProAla: 2.603 ± 0.052
0.397ProCys: 0.397 ± 0.019
2.35ProAsp: 2.35 ± 0.046
2.878ProGlu: 2.878 ± 0.06
1.662ProPhe: 1.662 ± 0.036
2.107ProGly: 2.107 ± 0.049
0.678ProHis: 0.678 ± 0.03
2.076ProIle: 2.076 ± 0.049
2.123ProLys: 2.123 ± 0.046
2.926ProLeu: 2.926 ± 0.048
1.043ProMet: 1.043 ± 0.032
1.684ProAsn: 1.684 ± 0.042
0.704ProPro: 0.704 ± 0.029
1.399ProGln: 1.399 ± 0.033
1.326ProArg: 1.326 ± 0.035
2.332ProSer: 2.332 ± 0.056
2.339ProThr: 2.339 ± 0.05
2.559ProVal: 2.559 ± 0.057
0.444ProTrp: 0.444 ± 0.024
1.667ProTyr: 1.667 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
2.367GlnAla: 2.367 ± 0.049
0.359GlnCys: 0.359 ± 0.018
1.601GlnAsp: 1.601 ± 0.045
2.022GlnGlu: 2.022 ± 0.056
1.323GlnPhe: 1.323 ± 0.03
1.888GlnGly: 1.888 ± 0.047
0.702GlnHis: 0.702 ± 0.029
2.162GlnIle: 2.162 ± 0.049
2.417GlnLys: 2.417 ± 0.059
3.251GlnLeu: 3.251 ± 0.061
1.163GlnMet: 1.163 ± 0.035
1.788GlnAsn: 1.788 ± 0.045
1.34GlnPro: 1.34 ± 0.037
1.742GlnGln: 1.742 ± 0.053
1.928GlnArg: 1.928 ± 0.056
1.887GlnSer: 1.887 ± 0.041
2.399GlnThr: 2.399 ± 0.048
1.931GlnVal: 1.931 ± 0.045
0.468GlnTrp: 0.468 ± 0.022
1.508GlnTyr: 1.508 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
2.777ArgAla: 2.777 ± 0.058
0.594ArgCys: 0.594 ± 0.024
2.458ArgAsp: 2.458 ± 0.051
2.852ArgGlu: 2.852 ± 0.065
2.267ArgPhe: 2.267 ± 0.045
2.591ArgGly: 2.591 ± 0.063
1.176ArgHis: 1.176 ± 0.034
3.525ArgIle: 3.525 ± 0.056
3.358ArgLys: 3.358 ± 0.065
4.733ArgLeu: 4.733 ± 0.079
1.718ArgMet: 1.718 ± 0.042
2.502ArgAsn: 2.502 ± 0.047
1.644ArgPro: 1.644 ± 0.038
2.216ArgGln: 2.216 ± 0.05
2.838ArgArg: 2.838 ± 0.061
2.589ArgSer: 2.589 ± 0.052
2.708ArgThr: 2.708 ± 0.053
2.815ArgVal: 2.815 ± 0.056
0.668ArgTrp: 0.668 ± 0.029
2.27ArgTyr: 2.27 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
4.701SerAla: 4.701 ± 0.068
0.822SerCys: 0.822 ± 0.029
3.825SerAsp: 3.825 ± 0.064
3.55SerGlu: 3.55 ± 0.058
3.055SerPhe: 3.055 ± 0.063
4.228SerGly: 4.228 ± 0.079
1.28SerHis: 1.28 ± 0.038
3.894SerIle: 3.894 ± 0.069
3.644SerLys: 3.644 ± 0.066
5.879SerLeu: 5.879 ± 0.085
1.684SerMet: 1.684 ± 0.037
2.736SerAsn: 2.736 ± 0.056
2.241SerPro: 2.241 ± 0.046
2.103SerGln: 2.103 ± 0.046
2.707SerArg: 2.707 ± 0.054
3.94SerSer: 3.94 ± 0.071
3.359SerThr: 3.359 ± 0.07
4.319SerVal: 4.319 ± 0.072
0.807SerTrp: 0.807 ± 0.032
2.687SerTyr: 2.687 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
4.753ThrAla: 4.753 ± 0.074
0.673ThrCys: 0.673 ± 0.021
4.072ThrAsp: 4.072 ± 0.075
3.468ThrGlu: 3.468 ± 0.061
2.776ThrPhe: 2.776 ± 0.055
4.091ThrGly: 4.091 ± 0.073
1.025ThrHis: 1.025 ± 0.035
4.219ThrIle: 4.219 ± 0.07
3.325ThrLys: 3.325 ± 0.058
5.485ThrLeu: 5.485 ± 0.081
1.523ThrMet: 1.523 ± 0.037
2.52ThrAsn: 2.52 ± 0.053
2.714ThrPro: 2.714 ± 0.047
1.749ThrGln: 1.749 ± 0.035
2.347ThrArg: 2.347 ± 0.048
3.78ThrSer: 3.78 ± 0.073
3.762ThrThr: 3.762 ± 0.068
4.275ThrVal: 4.275 ± 0.081
0.775ThrTrp: 0.775 ± 0.034
2.602ThrTyr: 2.602 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
5.213ValAla: 5.213 ± 0.081
1.083ValCys: 1.083 ± 0.038
4.143ValAsp: 4.143 ± 0.07
4.267ValGlu: 4.267 ± 0.072
2.892ValPhe: 2.892 ± 0.059
4.35ValGly: 4.35 ± 0.071
1.034ValHis: 1.034 ± 0.034
3.982ValIle: 3.982 ± 0.056
4.648ValLys: 4.648 ± 0.071
5.377ValLeu: 5.377 ± 0.082
1.905ValMet: 1.905 ± 0.045
3.31ValAsn: 3.31 ± 0.058
2.485ValPro: 2.485 ± 0.05
1.773ValGln: 1.773 ± 0.043
3.265ValArg: 3.265 ± 0.058
4.681ValSer: 4.681 ± 0.065
4.107ValThr: 4.107 ± 0.079
4.916ValVal: 4.916 ± 0.088
0.739ValTrp: 0.739 ± 0.024
2.731ValTyr: 2.731 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.832TrpAla: 0.832 ± 0.03
0.189TrpCys: 0.189 ± 0.013
0.702TrpAsp: 0.702 ± 0.029
0.668TrpGlu: 0.668 ± 0.025
0.5TrpPhe: 0.5 ± 0.021
0.888TrpGly: 0.888 ± 0.029
0.321TrpHis: 0.321 ± 0.017
0.698TrpIle: 0.698 ± 0.026
0.813TrpLys: 0.813 ± 0.027
1.189TrpLeu: 1.189 ± 0.039
0.43TrpMet: 0.43 ± 0.019
0.78TrpAsn: 0.78 ± 0.031
0.267TrpPro: 0.267 ± 0.019
0.576TrpGln: 0.576 ± 0.024
0.631TrpArg: 0.631 ± 0.028
0.689TrpSer: 0.689 ± 0.024
0.855TrpThr: 0.855 ± 0.036
0.678TrpVal: 0.678 ± 0.033
0.199TrpTrp: 0.199 ± 0.014
0.506TrpTyr: 0.506 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.259TyrAla: 3.259 ± 0.06
0.612TyrCys: 0.612 ± 0.026
3.099TyrAsp: 3.099 ± 0.065
2.392TyrGlu: 2.392 ± 0.048
1.788TyrPhe: 1.788 ± 0.049
3.045TyrGly: 3.045 ± 0.077
0.834TyrHis: 0.834 ± 0.026
2.613TyrIle: 2.613 ± 0.042
2.589TyrLys: 2.589 ± 0.056
3.387TyrLeu: 3.387 ± 0.065
1.129TyrMet: 1.129 ± 0.035
2.398TyrAsn: 2.398 ± 0.051
1.569TyrPro: 1.569 ± 0.042
1.34TyrGln: 1.34 ± 0.041
2.157TyrArg: 2.157 ± 0.044
2.727TyrSer: 2.727 ± 0.047
2.612TyrThr: 2.612 ± 0.067
2.956TyrVal: 2.956 ± 0.057
0.548TyrTrp: 0.548 ± 0.022
2.132TyrTyr: 2.132 ± 0.056
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2874 proteins (1060643 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski