Amino acid dipepetide frequency for Terrisporobacter othiniensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.411AlaAla: 3.411 ± 0.068
0.85AlaCys: 0.85 ± 0.033
2.564AlaAsp: 2.564 ± 0.058
2.847AlaGlu: 2.847 ± 0.064
2.387AlaPhe: 2.387 ± 0.054
3.812AlaGly: 3.812 ± 0.068
0.806AlaHis: 0.806 ± 0.026
5.94AlaIle: 5.94 ± 0.094
4.766AlaLys: 4.766 ± 0.079
5.595AlaLeu: 5.595 ± 0.085
1.783AlaMet: 1.783 ± 0.048
2.807AlaAsn: 2.807 ± 0.054
1.369AlaPro: 1.369 ± 0.04
1.422AlaGln: 1.422 ± 0.035
1.747AlaArg: 1.747 ± 0.041
3.293AlaSer: 3.293 ± 0.063
3.047AlaThr: 3.047 ± 0.072
3.769AlaVal: 3.769 ± 0.077
0.37AlaTrp: 0.37 ± 0.022
2.283AlaTyr: 2.283 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.762CysAla: 0.762 ± 0.031
0.248CysCys: 0.248 ± 0.02
0.841CysAsp: 0.841 ± 0.033
1.084CysGlu: 1.084 ± 0.03
0.496CysPhe: 0.496 ± 0.023
1.232CysGly: 1.232 ± 0.034
0.224CysHis: 0.224 ± 0.016
1.262CysIle: 1.262 ± 0.04
1.157CysLys: 1.157 ± 0.038
0.958CysLeu: 0.958 ± 0.03
0.363CysMet: 0.363 ± 0.021
0.856CysAsn: 0.856 ± 0.028
0.547CysPro: 0.547 ± 0.027
0.26CysGln: 0.26 ± 0.017
0.409CysArg: 0.409 ± 0.021
0.816CysSer: 0.816 ± 0.029
0.639CysThr: 0.639 ± 0.026
0.837CysVal: 0.837 ± 0.031
0.065CysTrp: 0.065 ± 0.008
0.435CysTyr: 0.435 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
2.782AspAla: 2.782 ± 0.065
0.728AspCys: 0.728 ± 0.029
3.196AspAsp: 3.196 ± 0.063
5.074AspGlu: 5.074 ± 0.079
2.799AspPhe: 2.799 ± 0.062
3.122AspGly: 3.122 ± 0.06
0.623AspHis: 0.623 ± 0.028
6.139AspIle: 6.139 ± 0.094
5.808AspLys: 5.808 ± 0.083
5.459AspLeu: 5.459 ± 0.073
1.678AspMet: 1.678 ± 0.041
3.651AspAsn: 3.651 ± 0.063
1.235AspPro: 1.235 ± 0.033
0.86AspGln: 0.86 ± 0.029
1.762AspArg: 1.762 ± 0.042
3.112AspSer: 3.112 ± 0.062
2.554AspThr: 2.554 ± 0.049
3.716AspVal: 3.716 ± 0.058
0.308AspTrp: 0.308 ± 0.02
2.852AspTyr: 2.852 ± 0.063
0.0AspXaa: 0.0 ± 0.0
Glu
3.777GluAla: 3.777 ± 0.067
0.782GluCys: 0.782 ± 0.03
4.92GluAsp: 4.92 ± 0.076
7.476GluGlu: 7.476 ± 0.119
2.932GluPhe: 2.932 ± 0.056
4.135GluGly: 4.135 ± 0.061
0.896GluHis: 0.896 ± 0.03
7.313GluIle: 7.313 ± 0.089
7.377GluLys: 7.377 ± 0.095
6.415GluLeu: 6.415 ± 0.091
1.999GluMet: 1.999 ± 0.047
5.838GluAsn: 5.838 ± 0.084
1.311GluPro: 1.311 ± 0.034
1.534GluGln: 1.534 ± 0.04
2.308GluArg: 2.308 ± 0.052
3.689GluSer: 3.689 ± 0.064
2.778GluThr: 2.778 ± 0.058
5.27GluVal: 5.27 ± 0.077
0.334GluTrp: 0.334 ± 0.019
3.125GluTyr: 3.125 ± 0.059
0.0GluXaa: 0.0 ± 0.0
Phe
2.481PheAla: 2.481 ± 0.055
0.527PheCys: 0.527 ± 0.02
2.585PheAsp: 2.585 ± 0.054
2.746PheGlu: 2.746 ± 0.057
1.829PhePhe: 1.829 ± 0.042
3.005PheGly: 3.005 ± 0.065
0.456PheHis: 0.456 ± 0.022
4.576PheIle: 4.576 ± 0.082
3.507PheLys: 3.507 ± 0.056
3.862PheLeu: 3.862 ± 0.076
1.3PheMet: 1.3 ± 0.044
2.798PheAsn: 2.798 ± 0.053
1.082PhePro: 1.082 ± 0.031
0.782PheGln: 0.782 ± 0.029
1.154PheArg: 1.154 ± 0.034
2.875PheSer: 2.875 ± 0.061
2.384PheThr: 2.384 ± 0.054
2.934PheVal: 2.934 ± 0.057
0.274PheTrp: 0.274 ± 0.015
1.765PheTyr: 1.765 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
4.169GlyAla: 4.169 ± 0.089
1.108GlyCys: 1.108 ± 0.035
3.254GlyAsp: 3.254 ± 0.054
4.089GlyGlu: 4.089 ± 0.072
3.145GlyPhe: 3.145 ± 0.067
4.44GlyGly: 4.44 ± 0.089
0.98GlyHis: 0.98 ± 0.029
6.728GlyIle: 6.728 ± 0.089
5.307GlyLys: 5.307 ± 0.079
5.567GlyLeu: 5.567 ± 0.089
1.941GlyMet: 1.941 ± 0.047
3.104GlyAsn: 3.104 ± 0.059
1.325GlyPro: 1.325 ± 0.036
1.477GlyGln: 1.477 ± 0.038
2.105GlyArg: 2.105 ± 0.051
3.809GlySer: 3.809 ± 0.071
3.253GlyThr: 3.253 ± 0.061
4.863GlyVal: 4.863 ± 0.077
0.429GlyTrp: 0.429 ± 0.025
3.034GlyTyr: 3.034 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
0.675HisAla: 0.675 ± 0.027
0.228HisCys: 0.228 ± 0.017
0.737HisAsp: 0.737 ± 0.031
0.954HisGlu: 0.954 ± 0.03
0.567HisPhe: 0.567 ± 0.024
0.972HisGly: 0.972 ± 0.036
0.262HisHis: 0.262 ± 0.017
1.346HisIle: 1.346 ± 0.033
1.093HisLys: 1.093 ± 0.03
1.194HisLeu: 1.194 ± 0.037
0.372HisMet: 0.372 ± 0.019
0.791HisAsn: 0.791 ± 0.027
0.58HisPro: 0.58 ± 0.022
0.295HisGln: 0.295 ± 0.019
0.467HisArg: 0.467 ± 0.022
0.822HisSer: 0.822 ± 0.029
0.668HisThr: 0.668 ± 0.027
0.813HisVal: 0.813 ± 0.033
0.103HisTrp: 0.103 ± 0.01
0.503HisTyr: 0.503 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
5.637IleAla: 5.637 ± 0.087
1.507IleCys: 1.507 ± 0.042
6.176IleAsp: 6.176 ± 0.093
7.078IleGlu: 7.078 ± 0.105
4.18IlePhe: 4.18 ± 0.076
6.438IleGly: 6.438 ± 0.093
1.293IleHis: 1.293 ± 0.037
9.69IleIle: 9.69 ± 0.122
8.648IleLys: 8.648 ± 0.099
9.192IleLeu: 9.192 ± 0.108
2.458IleMet: 2.458 ± 0.049
6.567IleAsn: 6.567 ± 0.091
3.213IlePro: 3.213 ± 0.066
1.969IleGln: 1.969 ± 0.049
2.69IleArg: 2.69 ± 0.057
7.104IleSer: 7.104 ± 0.093
4.831IleThr: 4.831 ± 0.072
6.611IleVal: 6.611 ± 0.076
0.489IleTrp: 0.489 ± 0.025
3.844IleTyr: 3.844 ± 0.078
0.0IleXaa: 0.0 ± 0.0
Lys
4.427LysAla: 4.427 ± 0.065
1.0LysCys: 1.0 ± 0.037
6.026LysAsp: 6.026 ± 0.103
8.947LysGlu: 8.947 ± 0.126
3.231LysPhe: 3.231 ± 0.06
4.747LysGly: 4.747 ± 0.076
1.067LysHis: 1.067 ± 0.033
8.312LysIle: 8.312 ± 0.088
8.061LysLys: 8.061 ± 0.101
7.493LysLeu: 7.493 ± 0.086
2.535LysMet: 2.535 ± 0.045
6.573LysAsn: 6.573 ± 0.106
1.886LysPro: 1.886 ± 0.04
1.989LysGln: 1.989 ± 0.051
2.73LysArg: 2.73 ± 0.066
5.535LysSer: 5.535 ± 0.075
4.068LysThr: 4.068 ± 0.065
5.923LysVal: 5.923 ± 0.075
0.536LysTrp: 0.536 ± 0.025
4.425LysTyr: 4.425 ± 0.079
0.0LysXaa: 0.0 ± 0.0
Leu
5.265LeuAla: 5.265 ± 0.082
1.225LeuCys: 1.225 ± 0.037
5.496LeuAsp: 5.496 ± 0.093
6.408LeuGlu: 6.408 ± 0.086
3.928LeuPhe: 3.928 ± 0.078
6.376LeuGly: 6.376 ± 0.097
1.084LeuHis: 1.084 ± 0.036
8.224LeuIle: 8.224 ± 0.128
7.783LeuLys: 7.783 ± 0.083
7.836LeuLeu: 7.836 ± 0.12
2.328LeuMet: 2.328 ± 0.047
5.943LeuAsn: 5.943 ± 0.084
2.731LeuPro: 2.731 ± 0.052
2.262LeuGln: 2.262 ± 0.047
2.866LeuArg: 2.866 ± 0.059
6.639LeuSer: 6.639 ± 0.11
4.291LeuThr: 4.291 ± 0.064
5.861LeuVal: 5.861 ± 0.091
0.497LeuTrp: 0.497 ± 0.024
3.271LeuTyr: 3.271 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
1.875MetAla: 1.875 ± 0.048
0.32MetCys: 0.32 ± 0.015
1.674MetAsp: 1.674 ± 0.043
2.019MetGlu: 2.019 ± 0.051
1.014MetPhe: 1.014 ± 0.032
2.005MetGly: 2.005 ± 0.044
0.327MetHis: 0.327 ± 0.016
2.566MetIle: 2.566 ± 0.054
2.668MetLys: 2.668 ± 0.047
2.256MetLeu: 2.256 ± 0.054
0.85MetMet: 0.85 ± 0.03
1.782MetAsn: 1.782 ± 0.042
0.878MetPro: 0.878 ± 0.026
0.646MetGln: 0.646 ± 0.03
0.854MetArg: 0.854 ± 0.033
1.866MetSer: 1.866 ± 0.045
1.348MetThr: 1.348 ± 0.04
1.691MetVal: 1.691 ± 0.037
0.168MetTrp: 0.168 ± 0.012
0.941MetTyr: 0.941 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.824AsnAla: 2.824 ± 0.056
0.795AsnCys: 0.795 ± 0.031
3.076AsnAsp: 3.076 ± 0.062
4.529AsnGlu: 4.529 ± 0.066
2.515AsnPhe: 2.515 ± 0.052
3.218AsnGly: 3.218 ± 0.058
0.933AsnHis: 0.933 ± 0.033
7.385AsnIle: 7.385 ± 0.098
6.845AsnLys: 6.845 ± 0.1
6.017AsnLeu: 6.017 ± 0.089
1.863AsnMet: 1.863 ± 0.043
4.751AsnAsn: 4.751 ± 0.099
2.173AsnPro: 2.173 ± 0.055
1.507AsnGln: 1.507 ± 0.041
1.876AsnArg: 1.876 ± 0.044
3.684AsnSer: 3.684 ± 0.072
2.988AsnThr: 2.988 ± 0.052
3.664AsnVal: 3.664 ± 0.069
0.399AsnTrp: 0.399 ± 0.021
2.821AsnTyr: 2.821 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
1.452ProAla: 1.452 ± 0.046
0.364ProCys: 0.364 ± 0.02
1.356ProAsp: 1.356 ± 0.041
1.933ProGlu: 1.933 ± 0.044
1.296ProPhe: 1.296 ± 0.035
1.736ProGly: 1.736 ± 0.044
0.495ProHis: 0.495 ± 0.018
2.717ProIle: 2.717 ± 0.053
2.116ProLys: 2.116 ± 0.047
2.349ProLeu: 2.349 ± 0.054
0.763ProMet: 0.763 ± 0.026
1.512ProAsn: 1.512 ± 0.045
0.577ProPro: 0.577 ± 0.023
0.824ProGln: 0.824 ± 0.027
0.829ProArg: 0.829 ± 0.027
1.764ProSer: 1.764 ± 0.044
1.583ProThr: 1.583 ± 0.042
2.083ProVal: 2.083 ± 0.051
0.218ProTrp: 0.218 ± 0.015
1.26ProTyr: 1.26 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
1.299GlnAla: 1.299 ± 0.04
0.257GlnCys: 0.257 ± 0.017
1.152GlnAsp: 1.152 ± 0.038
1.616GlnGlu: 1.616 ± 0.047
0.885GlnPhe: 0.885 ± 0.032
1.52GlnGly: 1.52 ± 0.04
0.288GlnHis: 0.288 ± 0.017
2.193GlnIle: 2.193 ± 0.051
1.79GlnLys: 1.79 ± 0.041
2.122GlnLeu: 2.122 ± 0.044
0.67GlnMet: 0.67 ± 0.024
1.344GlnAsn: 1.344 ± 0.039
0.6GlnPro: 0.6 ± 0.023
0.608GlnGln: 0.608 ± 0.027
0.878GlnArg: 0.878 ± 0.033
1.404GlnSer: 1.404 ± 0.037
1.026GlnThr: 1.026 ± 0.033
1.575GlnVal: 1.575 ± 0.041
0.169GlnTrp: 0.169 ± 0.012
0.993GlnTyr: 0.993 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
1.708ArgAla: 1.708 ± 0.046
0.433ArgCys: 0.433 ± 0.022
1.8ArgAsp: 1.8 ± 0.038
2.655ArgGlu: 2.655 ± 0.062
1.277ArgPhe: 1.277 ± 0.034
1.904ArgGly: 1.904 ± 0.041
0.427ArgHis: 0.427 ± 0.02
2.763ArgIle: 2.763 ± 0.06
2.869ArgLys: 2.869 ± 0.052
2.607ArgLeu: 2.607 ± 0.05
0.902ArgMet: 0.902 ± 0.033
1.84ArgAsn: 1.84 ± 0.04
0.824ArgPro: 0.824 ± 0.029
0.917ArgGln: 0.917 ± 0.032
1.3ArgArg: 1.3 ± 0.041
1.475ArgSer: 1.475 ± 0.042
1.433ArgThr: 1.433 ± 0.037
2.228ArgVal: 2.228 ± 0.05
0.208ArgTrp: 0.208 ± 0.014
1.312ArgTyr: 1.312 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
3.229SerAla: 3.229 ± 0.061
0.782SerCys: 0.782 ± 0.03
3.116SerAsp: 3.116 ± 0.059
3.771SerGlu: 3.771 ± 0.067
3.031SerPhe: 3.031 ± 0.053
4.158SerGly: 4.158 ± 0.078
0.989SerHis: 0.989 ± 0.035
6.482SerIle: 6.482 ± 0.091
5.958SerLys: 5.958 ± 0.075
6.093SerLeu: 6.093 ± 0.083
1.738SerMet: 1.738 ± 0.043
3.894SerAsn: 3.894 ± 0.067
1.69SerPro: 1.69 ± 0.042
1.693SerGln: 1.693 ± 0.038
1.989SerArg: 1.989 ± 0.046
4.476SerSer: 4.476 ± 0.08
3.316SerThr: 3.316 ± 0.06
4.037SerVal: 4.037 ± 0.07
0.381SerTrp: 0.381 ± 0.021
2.69SerTyr: 2.69 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
2.7ThrAla: 2.7 ± 0.047
0.677ThrCys: 0.677 ± 0.028
2.372ThrAsp: 2.372 ± 0.047
2.601ThrGlu: 2.601 ± 0.052
2.165ThrPhe: 2.165 ± 0.051
3.514ThrGly: 3.514 ± 0.066
0.766ThrHis: 0.766 ± 0.025
4.847ThrIle: 4.847 ± 0.068
3.824ThrLys: 3.824 ± 0.064
4.863ThrLeu: 4.863 ± 0.067
1.211ThrMet: 1.211 ± 0.037
2.71ThrAsn: 2.71 ± 0.062
1.795ThrPro: 1.795 ± 0.045
1.071ThrGln: 1.071 ± 0.035
1.421ThrArg: 1.421 ± 0.037
3.413ThrSer: 3.413 ± 0.07
2.775ThrThr: 2.775 ± 0.066
3.476ThrVal: 3.476 ± 0.059
0.321ThrTrp: 0.321 ± 0.016
2.205ThrTyr: 2.205 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
4.054ValAla: 4.054 ± 0.071
1.025ValCys: 1.025 ± 0.033
4.127ValAsp: 4.127 ± 0.068
4.814ValGlu: 4.814 ± 0.075
3.0ValPhe: 3.0 ± 0.066
4.9ValGly: 4.9 ± 0.084
0.839ValHis: 0.839 ± 0.031
6.289ValIle: 6.289 ± 0.098
5.536ValLys: 5.536 ± 0.084
6.007ValLeu: 6.007 ± 0.086
1.72ValMet: 1.72 ± 0.04
3.766ValAsn: 3.766 ± 0.064
2.012ValPro: 2.012 ± 0.035
1.327ValGln: 1.327 ± 0.039
1.978ValArg: 1.978 ± 0.046
4.584ValSer: 4.584 ± 0.074
3.314ValThr: 3.314 ± 0.063
4.973ValVal: 4.973 ± 0.103
0.365ValTrp: 0.365 ± 0.019
2.593ValTyr: 2.593 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.356TrpAla: 0.356 ± 0.02
0.082TrpCys: 0.082 ± 0.009
0.336TrpAsp: 0.336 ± 0.021
0.398TrpGlu: 0.398 ± 0.023
0.283TrpPhe: 0.283 ± 0.018
0.404TrpGly: 0.404 ± 0.02
0.112TrpHis: 0.112 ± 0.011
0.607TrpIle: 0.607 ± 0.025
0.409TrpLys: 0.409 ± 0.023
0.512TrpLeu: 0.512 ± 0.022
0.182TrpMet: 0.182 ± 0.013
0.38TrpAsn: 0.38 ± 0.021
0.149TrpPro: 0.149 ± 0.014
0.185TrpGln: 0.185 ± 0.014
0.205TrpArg: 0.205 ± 0.014
0.375TrpSer: 0.375 ± 0.018
0.294TrpThr: 0.294 ± 0.019
0.386TrpVal: 0.386 ± 0.024
0.067TrpTrp: 0.067 ± 0.008
0.245TrpTyr: 0.245 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.987TyrAla: 1.987 ± 0.045
0.554TyrCys: 0.554 ± 0.023
2.653TyrAsp: 2.653 ± 0.054
3.168TyrGlu: 3.168 ± 0.065
1.962TyrPhe: 1.962 ± 0.05
2.489TyrGly: 2.489 ± 0.05
0.575TyrHis: 0.575 ± 0.025
4.235TyrIle: 4.235 ± 0.076
3.99TyrLys: 3.99 ± 0.069
3.922TyrLeu: 3.922 ± 0.066
1.09TyrMet: 1.09 ± 0.029
2.954TyrAsn: 2.954 ± 0.057
1.249TyrPro: 1.249 ± 0.033
0.763TyrGln: 0.763 ± 0.03
1.313TyrArg: 1.313 ± 0.037
2.778TyrSer: 2.778 ± 0.051
2.138TyrThr: 2.138 ± 0.051
2.513TyrVal: 2.513 ± 0.05
0.263TyrTrp: 0.263 ± 0.017
1.906TyrTyr: 1.906 ± 0.053
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3449 proteins (1029502 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski