Amino acid dipepetide frequency for Romboutsia weinsteinii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.106AlaAla: 3.106 ± 0.072
0.728AlaCys: 0.728 ± 0.03
2.346AlaAsp: 2.346 ± 0.049
2.677AlaGlu: 2.677 ± 0.055
2.263AlaPhe: 2.263 ± 0.053
3.601AlaGly: 3.601 ± 0.062
0.796AlaHis: 0.796 ± 0.027
5.917AlaIle: 5.917 ± 0.082
4.603AlaLys: 4.603 ± 0.081
5.522AlaLeu: 5.522 ± 0.068
1.803AlaMet: 1.803 ± 0.042
2.84AlaAsn: 2.84 ± 0.053
1.385AlaPro: 1.385 ± 0.039
1.466AlaGln: 1.466 ± 0.037
1.779AlaArg: 1.779 ± 0.039
3.536AlaSer: 3.536 ± 0.061
3.081AlaThr: 3.081 ± 0.058
3.698AlaVal: 3.698 ± 0.077
0.372AlaTrp: 0.372 ± 0.02
2.092AlaTyr: 2.092 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.629CysAla: 0.629 ± 0.023
0.232CysCys: 0.232 ± 0.021
0.741CysAsp: 0.741 ± 0.027
0.85CysGlu: 0.85 ± 0.028
0.467CysPhe: 0.467 ± 0.019
1.042CysGly: 1.042 ± 0.031
0.191CysHis: 0.191 ± 0.016
1.234CysIle: 1.234 ± 0.034
1.025CysLys: 1.025 ± 0.037
0.841CysLeu: 0.841 ± 0.03
0.356CysMet: 0.356 ± 0.019
0.736CysAsn: 0.736 ± 0.032
0.423CysPro: 0.423 ± 0.022
0.232CysGln: 0.232 ± 0.015
0.366CysArg: 0.366 ± 0.018
0.838CysSer: 0.838 ± 0.028
0.581CysThr: 0.581 ± 0.023
0.722CysVal: 0.722 ± 0.03
0.058CysTrp: 0.058 ± 0.006
0.399CysTyr: 0.399 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
2.753AspAla: 2.753 ± 0.052
0.64AspCys: 0.64 ± 0.028
3.134AspAsp: 3.134 ± 0.057
4.879AspGlu: 4.879 ± 0.074
2.648AspPhe: 2.648 ± 0.053
3.485AspGly: 3.485 ± 0.062
0.601AspHis: 0.601 ± 0.023
7.059AspIle: 7.059 ± 0.085
5.752AspLys: 5.752 ± 0.087
5.23AspLeu: 5.23 ± 0.072
1.701AspMet: 1.701 ± 0.038
3.696AspAsn: 3.696 ± 0.053
1.309AspPro: 1.309 ± 0.037
0.918AspGln: 0.918 ± 0.03
1.915AspArg: 1.915 ± 0.042
3.439AspSer: 3.439 ± 0.056
2.876AspThr: 2.876 ± 0.055
3.726AspVal: 3.726 ± 0.063
0.407AspTrp: 0.407 ± 0.018
2.858AspTyr: 2.858 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
3.786GluAla: 3.786 ± 0.076
0.73GluCys: 0.73 ± 0.027
4.94GluAsp: 4.94 ± 0.079
6.735GluGlu: 6.735 ± 0.102
3.003GluPhe: 3.003 ± 0.056
4.016GluGly: 4.016 ± 0.063
0.969GluHis: 0.969 ± 0.027
7.19GluIle: 7.19 ± 0.089
6.68GluLys: 6.68 ± 0.087
6.657GluLeu: 6.657 ± 0.086
1.761GluMet: 1.761 ± 0.045
5.383GluAsn: 5.383 ± 0.077
1.272GluPro: 1.272 ± 0.037
1.624GluGln: 1.624 ± 0.034
2.272GluArg: 2.272 ± 0.052
4.039GluSer: 4.039 ± 0.058
2.715GluThr: 2.715 ± 0.051
5.182GluVal: 5.182 ± 0.074
0.349GluTrp: 0.349 ± 0.019
3.24GluTyr: 3.24 ± 0.059
0.0GluXaa: 0.0 ± 0.0
Phe
2.347PheAla: 2.347 ± 0.053
0.471PheCys: 0.471 ± 0.019
2.641PheAsp: 2.641 ± 0.05
2.776PheGlu: 2.776 ± 0.05
1.769PhePhe: 1.769 ± 0.05
2.835PheGly: 2.835 ± 0.056
0.392PheHis: 0.392 ± 0.021
4.729PheIle: 4.729 ± 0.084
3.496PheLys: 3.496 ± 0.058
3.668PheLeu: 3.668 ± 0.07
1.223PheMet: 1.223 ± 0.031
2.832PheAsn: 2.832 ± 0.055
0.959PhePro: 0.959 ± 0.029
0.694PheGln: 0.694 ± 0.023
1.177PheArg: 1.177 ± 0.028
2.968PheSer: 2.968 ± 0.06
2.394PheThr: 2.394 ± 0.046
2.818PheVal: 2.818 ± 0.051
0.285PheTrp: 0.285 ± 0.014
1.656PheTyr: 1.656 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.029GlyAla: 4.029 ± 0.066
1.023GlyCys: 1.023 ± 0.034
3.23GlyAsp: 3.23 ± 0.061
4.015GlyGlu: 4.015 ± 0.057
3.102GlyPhe: 3.102 ± 0.058
4.134GlyGly: 4.134 ± 0.085
0.981GlyHis: 0.981 ± 0.031
6.707GlyIle: 6.707 ± 0.08
4.832GlyLys: 4.832 ± 0.068
5.592GlyLeu: 5.592 ± 0.082
1.9GlyMet: 1.9 ± 0.041
3.306GlyAsn: 3.306 ± 0.052
1.267GlyPro: 1.267 ± 0.046
1.471GlyGln: 1.471 ± 0.034
1.906GlyArg: 1.906 ± 0.042
4.024GlySer: 4.024 ± 0.066
3.27GlyThr: 3.27 ± 0.063
4.937GlyVal: 4.937 ± 0.074
0.461GlyTrp: 0.461 ± 0.024
3.129GlyTyr: 3.129 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
0.696HisAla: 0.696 ± 0.024
0.218HisCys: 0.218 ± 0.014
0.783HisAsp: 0.783 ± 0.024
0.912HisGlu: 0.912 ± 0.026
0.565HisPhe: 0.565 ± 0.021
0.956HisGly: 0.956 ± 0.031
0.273HisHis: 0.273 ± 0.016
1.446HisIle: 1.446 ± 0.03
1.038HisLys: 1.038 ± 0.029
1.167HisLeu: 1.167 ± 0.029
0.378HisMet: 0.378 ± 0.02
0.888HisAsn: 0.888 ± 0.024
0.54HisPro: 0.54 ± 0.026
0.334HisGln: 0.334 ± 0.014
0.492HisArg: 0.492 ± 0.023
0.837HisSer: 0.837 ± 0.028
0.733HisThr: 0.733 ± 0.024
0.779HisVal: 0.779 ± 0.024
0.108HisTrp: 0.108 ± 0.011
0.543HisTyr: 0.543 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.827IleAla: 5.827 ± 0.079
1.351IleCys: 1.351 ± 0.036
6.797IleAsp: 6.797 ± 0.075
7.408IleGlu: 7.408 ± 0.086
4.253IlePhe: 4.253 ± 0.073
6.652IleGly: 6.652 ± 0.09
1.325IleHis: 1.325 ± 0.034
9.567IleIle: 9.567 ± 0.142
8.594IleLys: 8.594 ± 0.091
9.465IleLeu: 9.465 ± 0.122
2.463IleMet: 2.463 ± 0.055
6.738IleAsn: 6.738 ± 0.101
3.25IlePro: 3.25 ± 0.057
2.152IleGln: 2.152 ± 0.041
2.881IleArg: 2.881 ± 0.055
7.716IleSer: 7.716 ± 0.101
5.012IleThr: 5.012 ± 0.066
6.709IleVal: 6.709 ± 0.083
0.541IleTrp: 0.541 ± 0.024
3.902IleTyr: 3.902 ± 0.071
0.0IleXaa: 0.0 ± 0.0
Lys
4.073LysAla: 4.073 ± 0.063
0.828LysCys: 0.828 ± 0.035
5.948LysAsp: 5.948 ± 0.085
8.349LysGlu: 8.349 ± 0.102
3.19LysPhe: 3.19 ± 0.056
4.508LysGly: 4.508 ± 0.067
1.241LysHis: 1.241 ± 0.028
8.144LysIle: 8.144 ± 0.086
7.524LysLys: 7.524 ± 0.091
7.473LysLeu: 7.473 ± 0.086
2.321LysMet: 2.321 ± 0.05
6.211LysAsn: 6.211 ± 0.078
1.913LysPro: 1.913 ± 0.042
2.056LysGln: 2.056 ± 0.045
2.641LysArg: 2.641 ± 0.045
5.6LysSer: 5.6 ± 0.074
3.749LysThr: 3.749 ± 0.055
5.603LysVal: 5.603 ± 0.067
0.556LysTrp: 0.556 ± 0.023
4.367LysTyr: 4.367 ± 0.065
0.0LysXaa: 0.0 ± 0.0
Leu
5.219LeuAla: 5.219 ± 0.071
1.017LeuCys: 1.017 ± 0.031
5.821LeuAsp: 5.821 ± 0.069
6.593LeuGlu: 6.593 ± 0.093
3.755LeuPhe: 3.755 ± 0.072
6.293LeuGly: 6.293 ± 0.08
1.039LeuHis: 1.039 ± 0.033
8.197LeuIle: 8.197 ± 0.125
7.637LeuLys: 7.637 ± 0.097
7.747LeuLeu: 7.747 ± 0.118
2.331LeuMet: 2.331 ± 0.048
5.796LeuAsn: 5.796 ± 0.078
2.664LeuPro: 2.664 ± 0.05
2.028LeuGln: 2.028 ± 0.042
2.948LeuArg: 2.948 ± 0.057
6.883LeuSer: 6.883 ± 0.086
4.336LeuThr: 4.336 ± 0.066
6.055LeuVal: 6.055 ± 0.076
0.513LeuTrp: 0.513 ± 0.022
3.151LeuTyr: 3.151 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
1.73MetAla: 1.73 ± 0.042
0.294MetCys: 0.294 ± 0.017
1.657MetAsp: 1.657 ± 0.036
1.675MetGlu: 1.675 ± 0.037
1.143MetPhe: 1.143 ± 0.029
1.831MetGly: 1.831 ± 0.042
0.335MetHis: 0.335 ± 0.017
2.62MetIle: 2.62 ± 0.048
2.429MetLys: 2.429 ± 0.049
2.373MetLeu: 2.373 ± 0.043
0.855MetMet: 0.855 ± 0.028
1.751MetAsn: 1.751 ± 0.039
0.896MetPro: 0.896 ± 0.027
0.674MetGln: 0.674 ± 0.025
0.863MetArg: 0.863 ± 0.029
1.882MetSer: 1.882 ± 0.045
1.361MetThr: 1.361 ± 0.037
1.646MetVal: 1.646 ± 0.043
0.165MetTrp: 0.165 ± 0.012
1.008MetTyr: 1.008 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
2.707AsnAla: 2.707 ± 0.05
0.673AsnCys: 0.673 ± 0.032
3.225AsnAsp: 3.225 ± 0.06
4.483AsnGlu: 4.483 ± 0.072
2.434AsnPhe: 2.434 ± 0.044
3.32AsnGly: 3.32 ± 0.058
0.907AsnHis: 0.907 ± 0.027
7.991AsnIle: 7.991 ± 0.108
6.474AsnLys: 6.474 ± 0.081
5.965AsnLeu: 5.965 ± 0.08
1.763AsnMet: 1.763 ± 0.045
4.877AsnAsn: 4.877 ± 0.091
2.111AsnPro: 2.111 ± 0.05
1.593AsnGln: 1.593 ± 0.041
1.942AsnArg: 1.942 ± 0.042
4.005AsnSer: 4.005 ± 0.07
3.363AsnThr: 3.363 ± 0.069
3.475AsnVal: 3.475 ± 0.061
0.433AsnTrp: 0.433 ± 0.023
2.804AsnTyr: 2.804 ± 0.054
0.0AsnXaa: 0.0 ± 0.0
Pro
1.451ProAla: 1.451 ± 0.046
0.325ProCys: 0.325 ± 0.018
1.328ProAsp: 1.328 ± 0.037
1.882ProGlu: 1.882 ± 0.047
1.277ProPhe: 1.277 ± 0.031
1.644ProGly: 1.644 ± 0.042
0.469ProHis: 0.469 ± 0.019
2.798ProIle: 2.798 ± 0.045
2.087ProLys: 2.087 ± 0.043
2.209ProLeu: 2.209 ± 0.046
0.783ProMet: 0.783 ± 0.024
1.633ProAsn: 1.633 ± 0.046
0.529ProPro: 0.529 ± 0.023
0.803ProGln: 0.803 ± 0.028
0.771ProArg: 0.771 ± 0.024
1.825ProSer: 1.825 ± 0.041
1.572ProThr: 1.572 ± 0.043
1.878ProVal: 1.878 ± 0.05
0.237ProTrp: 0.237 ± 0.02
1.166ProTyr: 1.166 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
1.349GlnAla: 1.349 ± 0.03
0.226GlnCys: 0.226 ± 0.015
1.295GlnAsp: 1.295 ± 0.035
1.796GlnGlu: 1.796 ± 0.039
0.868GlnPhe: 0.868 ± 0.025
1.636GlnGly: 1.636 ± 0.043
0.305GlnHis: 0.305 ± 0.017
2.108GlnIle: 2.108 ± 0.04
1.827GlnLys: 1.827 ± 0.038
1.96GlnLeu: 1.96 ± 0.038
0.642GlnMet: 0.642 ± 0.023
1.469GlnAsn: 1.469 ± 0.035
0.59GlnPro: 0.59 ± 0.021
0.656GlnGln: 0.656 ± 0.029
0.856GlnArg: 0.856 ± 0.028
1.463GlnSer: 1.463 ± 0.041
1.037GlnThr: 1.037 ± 0.033
1.563GlnVal: 1.563 ± 0.038
0.174GlnTrp: 0.174 ± 0.014
0.978GlnTyr: 0.978 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
1.652ArgAla: 1.652 ± 0.036
0.417ArgCys: 0.417 ± 0.02
1.882ArgAsp: 1.882 ± 0.039
2.676ArgGlu: 2.676 ± 0.05
1.346ArgPhe: 1.346 ± 0.035
1.95ArgGly: 1.95 ± 0.047
0.453ArgHis: 0.453 ± 0.017
2.825ArgIle: 2.825 ± 0.048
2.629ArgLys: 2.629 ± 0.053
2.795ArgLeu: 2.795 ± 0.051
0.826ArgMet: 0.826 ± 0.025
1.896ArgAsn: 1.896 ± 0.04
0.823ArgPro: 0.823 ± 0.029
0.783ArgGln: 0.783 ± 0.023
1.215ArgArg: 1.215 ± 0.036
1.632ArgSer: 1.632 ± 0.04
1.404ArgThr: 1.404 ± 0.035
2.291ArgVal: 2.291 ± 0.048
0.202ArgTrp: 0.202 ± 0.014
1.449ArgTyr: 1.449 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
3.203SerAla: 3.203 ± 0.06
0.722SerCys: 0.722 ± 0.028
3.432SerAsp: 3.432 ± 0.049
3.994SerGlu: 3.994 ± 0.063
2.949SerPhe: 2.949 ± 0.052
4.255SerGly: 4.255 ± 0.069
0.995SerHis: 0.995 ± 0.027
7.49SerIle: 7.49 ± 0.102
6.319SerLys: 6.319 ± 0.071
6.149SerLeu: 6.149 ± 0.081
1.826SerMet: 1.826 ± 0.038
4.433SerAsn: 4.433 ± 0.081
1.634SerPro: 1.634 ± 0.035
1.756SerGln: 1.756 ± 0.042
2.126SerArg: 2.126 ± 0.041
4.802SerSer: 4.802 ± 0.084
3.663SerThr: 3.663 ± 0.057
4.256SerVal: 4.256 ± 0.06
0.423SerTrp: 0.423 ± 0.021
2.789SerTyr: 2.789 ± 0.059
0.0SerXaa: 0.0 ± 0.0
Thr
2.656ThrAla: 2.656 ± 0.048
0.541ThrCys: 0.541 ± 0.023
2.567ThrAsp: 2.567 ± 0.054
2.757ThrGlu: 2.757 ± 0.053
2.117ThrPhe: 2.117 ± 0.053
3.51ThrGly: 3.51 ± 0.074
0.809ThrHis: 0.809 ± 0.027
5.155ThrIle: 5.155 ± 0.06
3.869ThrLys: 3.869 ± 0.062
4.778ThrLeu: 4.778 ± 0.066
1.267ThrMet: 1.267 ± 0.032
2.957ThrAsn: 2.957 ± 0.061
1.869ThrPro: 1.869 ± 0.049
1.165ThrGln: 1.165 ± 0.034
1.46ThrArg: 1.46 ± 0.039
3.543ThrSer: 3.543 ± 0.057
2.911ThrThr: 2.911 ± 0.073
3.495ThrVal: 3.495 ± 0.069
0.366ThrTrp: 0.366 ± 0.018
2.111ThrTyr: 2.111 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
4.119ValAla: 4.119 ± 0.066
0.927ValCys: 0.927 ± 0.032
4.268ValAsp: 4.268 ± 0.063
4.616ValGlu: 4.616 ± 0.072
2.867ValPhe: 2.867 ± 0.053
4.705ValGly: 4.705 ± 0.069
0.868ValHis: 0.868 ± 0.025
6.226ValIle: 6.226 ± 0.081
5.027ValLys: 5.027 ± 0.069
6.188ValLeu: 6.188 ± 0.078
1.696ValMet: 1.696 ± 0.042
3.772ValAsn: 3.772 ± 0.062
1.878ValPro: 1.878 ± 0.043
1.319ValGln: 1.319 ± 0.038
1.924ValArg: 1.924 ± 0.044
4.822ValSer: 4.822 ± 0.061
3.322ValThr: 3.322 ± 0.072
5.099ValVal: 5.099 ± 0.085
0.411ValTrp: 0.411 ± 0.018
2.587ValTyr: 2.587 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.365TrpAla: 0.365 ± 0.019
0.091TrpCys: 0.091 ± 0.01
0.418TrpAsp: 0.418 ± 0.022
0.413TrpGlu: 0.413 ± 0.018
0.291TrpPhe: 0.291 ± 0.016
0.467TrpGly: 0.467 ± 0.021
0.108TrpHis: 0.108 ± 0.01
0.69TrpIle: 0.69 ± 0.027
0.432TrpLys: 0.432 ± 0.02
0.53TrpLeu: 0.53 ± 0.025
0.195TrpMet: 0.195 ± 0.014
0.403TrpAsn: 0.403 ± 0.02
0.159TrpPro: 0.159 ± 0.011
0.187TrpGln: 0.187 ± 0.013
0.188TrpArg: 0.188 ± 0.014
0.416TrpSer: 0.416 ± 0.02
0.302TrpThr: 0.302 ± 0.019
0.42TrpVal: 0.42 ± 0.021
0.101TrpTrp: 0.101 ± 0.011
0.26TrpTyr: 0.26 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.913TyrAla: 1.913 ± 0.033
0.511TyrCys: 0.511 ± 0.02
2.571TyrAsp: 2.571 ± 0.045
3.059TyrGlu: 3.059 ± 0.054
1.824TyrPhe: 1.824 ± 0.045
2.475TyrGly: 2.475 ± 0.04
0.629TyrHis: 0.629 ± 0.029
4.509TyrIle: 4.509 ± 0.07
4.024TyrLys: 4.024 ± 0.066
3.716TyrLeu: 3.716 ± 0.064
1.098TyrMet: 1.098 ± 0.034
2.928TyrAsn: 2.928 ± 0.053
1.198TyrPro: 1.198 ± 0.035
0.886TyrGln: 0.886 ± 0.026
1.423TyrArg: 1.423 ± 0.034
2.945TyrSer: 2.945 ± 0.053
2.221TyrThr: 2.221 ± 0.047
2.289TyrVal: 2.289 ± 0.041
0.272TyrTrp: 0.272 ± 0.016
1.783TyrTyr: 1.783 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3979 proteins (1204381 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski