Amino acid dipepetide frequency for Brevibacillus sp. SYP-B805

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.636AlaAla: 10.636 ± 0.152
0.887AlaCys: 0.887 ± 0.029
4.795AlaAsp: 4.795 ± 0.064
6.726AlaGlu: 6.726 ± 0.088
3.508AlaPhe: 3.508 ± 0.063
7.576AlaGly: 7.576 ± 0.095
1.818AlaHis: 1.818 ± 0.04
6.046AlaIle: 6.046 ± 0.078
4.847AlaLys: 4.847 ± 0.075
9.552AlaLeu: 9.552 ± 0.099
2.543AlaMet: 2.543 ± 0.051
2.742AlaAsn: 2.742 ± 0.048
3.117AlaPro: 3.117 ± 0.064
3.282AlaGln: 3.282 ± 0.058
5.051AlaArg: 5.051 ± 0.076
4.732AlaSer: 4.732 ± 0.069
4.005AlaThr: 4.005 ± 0.097
7.408AlaVal: 7.408 ± 0.088
1.096AlaTrp: 1.096 ± 0.036
2.931AlaTyr: 2.931 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
0.624CysAla: 0.624 ± 0.024
0.117CysCys: 0.117 ± 0.012
0.442CysAsp: 0.442 ± 0.019
0.488CysGlu: 0.488 ± 0.021
0.319CysPhe: 0.319 ± 0.019
0.832CysGly: 0.832 ± 0.029
0.226CysHis: 0.226 ± 0.015
0.459CysIle: 0.459 ± 0.019
0.318CysLys: 0.318 ± 0.017
0.816CysLeu: 0.816 ± 0.03
0.199CysMet: 0.199 ± 0.012
0.251CysAsn: 0.251 ± 0.013
0.392CysPro: 0.392 ± 0.018
0.264CysGln: 0.264 ± 0.015
0.552CysArg: 0.552 ± 0.023
0.527CysSer: 0.527 ± 0.024
0.419CysThr: 0.419 ± 0.021
0.471CysVal: 0.471 ± 0.019
0.102CysTrp: 0.102 ± 0.01
0.28CysTyr: 0.28 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.239AspAla: 4.239 ± 0.058
0.388AspCys: 0.388 ± 0.019
2.494AspAsp: 2.494 ± 0.068
3.746AspGlu: 3.746 ± 0.06
1.916AspPhe: 1.916 ± 0.043
3.81AspGly: 3.81 ± 0.071
1.06AspHis: 1.06 ± 0.033
3.06AspIle: 3.06 ± 0.053
2.348AspLys: 2.348 ± 0.051
4.909AspLeu: 4.909 ± 0.073
1.219AspMet: 1.219 ± 0.037
1.295AspAsn: 1.295 ± 0.037
2.517AspPro: 2.517 ± 0.048
1.871AspGln: 1.871 ± 0.044
3.078AspArg: 3.078 ± 0.05
2.076AspSer: 2.076 ± 0.052
2.273AspThr: 2.273 ± 0.044
3.983AspVal: 3.983 ± 0.065
0.684AspTrp: 0.684 ± 0.023
1.621AspTyr: 1.621 ± 0.036
0.0AspXaa: 0.0 ± 0.0
Glu
6.849GluAla: 6.849 ± 0.093
0.384GluCys: 0.384 ± 0.019
2.758GluAsp: 2.758 ± 0.051
6.486GluGlu: 6.486 ± 0.095
1.855GluPhe: 1.855 ± 0.044
4.398GluGly: 4.398 ± 0.074
1.506GluHis: 1.506 ± 0.038
4.846GluIle: 4.846 ± 0.078
4.756GluLys: 4.756 ± 0.078
6.806GluLeu: 6.806 ± 0.079
2.261GluMet: 2.261 ± 0.046
2.35GluAsn: 2.35 ± 0.046
2.357GluPro: 2.357 ± 0.045
3.653GluGln: 3.653 ± 0.066
5.056GluArg: 5.056 ± 0.074
2.971GluSer: 2.971 ± 0.062
3.697GluThr: 3.697 ± 0.053
4.871GluVal: 4.871 ± 0.064
0.99GluTrp: 0.99 ± 0.033
1.809GluTyr: 1.809 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
3.73PheAla: 3.73 ± 0.061
0.395PheCys: 0.395 ± 0.017
2.128PheAsp: 2.128 ± 0.042
2.059PheGlu: 2.059 ± 0.046
1.847PhePhe: 1.847 ± 0.045
3.131PheGly: 3.131 ± 0.051
1.028PheHis: 1.028 ± 0.031
2.118PheIle: 2.118 ± 0.048
1.204PheLys: 1.204 ± 0.036
4.389PheLeu: 4.389 ± 0.074
0.84PheMet: 0.84 ± 0.028
1.079PheAsn: 1.079 ± 0.036
1.81PhePro: 1.81 ± 0.037
1.4PheGln: 1.4 ± 0.034
2.153PheArg: 2.153 ± 0.041
2.376PheSer: 2.376 ± 0.052
2.274PheThr: 2.274 ± 0.049
2.951PheVal: 2.951 ± 0.052
0.49PheTrp: 0.49 ± 0.021
1.28PheTyr: 1.28 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
5.915GlyAla: 5.915 ± 0.111
0.776GlyCys: 0.776 ± 0.025
3.395GlyAsp: 3.395 ± 0.064
5.057GlyGlu: 5.057 ± 0.068
3.151GlyPhe: 3.151 ± 0.053
5.58GlyGly: 5.58 ± 0.088
1.501GlyHis: 1.501 ± 0.035
5.709GlyIle: 5.709 ± 0.069
4.731GlyLys: 4.731 ± 0.07
7.213GlyLeu: 7.213 ± 0.087
2.405GlyMet: 2.405 ± 0.053
2.307GlyAsn: 2.307 ± 0.054
2.2GlyPro: 2.2 ± 0.09
2.696GlyGln: 2.696 ± 0.054
4.216GlyArg: 4.216 ± 0.057
4.208GlySer: 4.208 ± 0.07
4.438GlyThr: 4.438 ± 0.061
5.704GlyVal: 5.704 ± 0.076
1.061GlyTrp: 1.061 ± 0.03
2.874GlyTyr: 2.874 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.964HisAla: 1.964 ± 0.046
0.204HisCys: 0.204 ± 0.014
1.058HisAsp: 1.058 ± 0.03
1.372HisGlu: 1.372 ± 0.035
1.021HisPhe: 1.021 ± 0.029
1.667HisGly: 1.667 ± 0.04
0.689HisHis: 0.689 ± 0.028
1.224HisIle: 1.224 ± 0.037
0.773HisLys: 0.773 ± 0.028
2.503HisLeu: 2.503 ± 0.05
0.51HisMet: 0.51 ± 0.022
0.595HisAsn: 0.595 ± 0.024
1.534HisPro: 1.534 ± 0.038
0.87HisGln: 0.87 ± 0.026
1.318HisArg: 1.318 ± 0.037
1.071HisSer: 1.071 ± 0.031
1.124HisThr: 1.124 ± 0.031
1.803HisVal: 1.803 ± 0.044
0.271HisTrp: 0.271 ± 0.015
0.791HisTyr: 0.791 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.672IleAla: 6.672 ± 0.087
0.593IleCys: 0.593 ± 0.023
3.547IleAsp: 3.547 ± 0.057
4.029IleGlu: 4.029 ± 0.061
2.045IlePhe: 2.045 ± 0.043
5.482IleGly: 5.482 ± 0.071
1.471IleHis: 1.471 ± 0.039
3.475IleIle: 3.475 ± 0.061
2.68IleLys: 2.68 ± 0.051
5.986IleLeu: 5.986 ± 0.081
1.349IleMet: 1.349 ± 0.037
2.042IleAsn: 2.042 ± 0.052
3.421IlePro: 3.421 ± 0.06
2.226IleGln: 2.226 ± 0.037
4.059IleArg: 4.059 ± 0.062
3.546IleSer: 3.546 ± 0.067
3.577IleThr: 3.577 ± 0.064
4.863IleVal: 4.863 ± 0.063
0.577IleTrp: 0.577 ± 0.02
1.794IleTyr: 1.794 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
4.662LysAla: 4.662 ± 0.065
0.254LysCys: 0.254 ± 0.016
2.195LysAsp: 2.195 ± 0.049
4.627LysGlu: 4.627 ± 0.072
1.098LysPhe: 1.098 ± 0.033
3.625LysGly: 3.625 ± 0.055
1.025LysHis: 1.025 ± 0.03
2.959LysIle: 2.959 ± 0.051
3.162LysLys: 3.162 ± 0.064
4.668LysLeu: 4.668 ± 0.068
1.562LysMet: 1.562 ± 0.039
1.682LysAsn: 1.682 ± 0.041
2.292LysPro: 2.292 ± 0.048
2.506LysGln: 2.506 ± 0.044
3.556LysArg: 3.556 ± 0.048
2.21LysSer: 2.21 ± 0.043
2.779LysThr: 2.779 ± 0.05
3.659LysVal: 3.659 ± 0.061
0.726LysTrp: 0.726 ± 0.029
1.402LysTyr: 1.402 ± 0.041
0.0LysXaa: 0.0 ± 0.0
Leu
10.664LeuAla: 10.664 ± 0.119
0.864LeuCys: 0.864 ± 0.029
5.033LeuAsp: 5.033 ± 0.078
6.441LeuGlu: 6.441 ± 0.09
4.558LeuPhe: 4.558 ± 0.078
7.188LeuGly: 7.188 ± 0.087
2.419LeuHis: 2.419 ± 0.051
6.123LeuIle: 6.123 ± 0.08
4.636LeuLys: 4.636 ± 0.066
11.739LeuLeu: 11.739 ± 0.144
2.426LeuMet: 2.426 ± 0.044
2.951LeuAsn: 2.951 ± 0.052
5.056LeuPro: 5.056 ± 0.069
4.454LeuGln: 4.454 ± 0.067
5.798LeuArg: 5.798 ± 0.067
6.139LeuSer: 6.139 ± 0.083
5.608LeuThr: 5.608 ± 0.079
7.204LeuVal: 7.204 ± 0.088
1.031LeuTrp: 1.031 ± 0.034
3.174LeuTyr: 3.174 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.658MetAla: 2.658 ± 0.049
0.151MetCys: 0.151 ± 0.011
1.257MetAsp: 1.257 ± 0.036
1.958MetGlu: 1.958 ± 0.037
0.85MetPhe: 0.85 ± 0.028
1.899MetGly: 1.899 ± 0.048
0.473MetHis: 0.473 ± 0.02
1.78MetIle: 1.78 ± 0.034
1.761MetLys: 1.761 ± 0.043
2.676MetLeu: 2.676 ± 0.052
0.781MetMet: 0.781 ± 0.028
1.104MetAsn: 1.104 ± 0.029
1.117MetPro: 1.117 ± 0.034
1.006MetGln: 1.006 ± 0.032
1.535MetArg: 1.535 ± 0.044
1.441MetSer: 1.441 ± 0.034
1.577MetThr: 1.577 ± 0.03
1.894MetVal: 1.894 ± 0.043
0.214MetTrp: 0.214 ± 0.014
0.647MetTyr: 0.647 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.55AsnAla: 2.55 ± 0.042
0.263AsnCys: 0.263 ± 0.016
1.41AsnAsp: 1.41 ± 0.037
1.967AsnGlu: 1.967 ± 0.047
0.972AsnPhe: 0.972 ± 0.029
2.835AsnGly: 2.835 ± 0.056
0.706AsnHis: 0.706 ± 0.026
1.933AsnIle: 1.933 ± 0.045
1.466AsnLys: 1.466 ± 0.04
3.092AsnLeu: 3.092 ± 0.05
0.792AsnMet: 0.792 ± 0.028
1.057AsnAsn: 1.057 ± 0.034
2.065AsnPro: 2.065 ± 0.042
1.382AsnGln: 1.382 ± 0.032
2.148AsnArg: 2.148 ± 0.044
1.313AsnSer: 1.313 ± 0.037
1.431AsnThr: 1.431 ± 0.041
2.355AsnVal: 2.355 ± 0.055
0.366AsnTrp: 0.366 ± 0.018
0.884AsnTyr: 0.884 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
4.199ProAla: 4.199 ± 0.066
0.259ProCys: 0.259 ± 0.015
2.879ProAsp: 2.879 ± 0.057
3.4ProGlu: 3.4 ± 0.058
2.044ProPhe: 2.044 ± 0.036
3.328ProGly: 3.328 ± 0.072
1.18ProHis: 1.18 ± 0.035
2.493ProIle: 2.493 ± 0.046
1.806ProLys: 1.806 ± 0.04
4.548ProLeu: 4.548 ± 0.076
0.966ProMet: 0.966 ± 0.034
1.355ProAsn: 1.355 ± 0.038
1.838ProPro: 1.838 ± 0.044
1.677ProGln: 1.677 ± 0.036
1.947ProArg: 1.947 ± 0.041
2.345ProSer: 2.345 ± 0.043
2.22ProThr: 2.22 ± 0.088
3.868ProVal: 3.868 ± 0.057
0.488ProTrp: 0.488 ± 0.02
1.656ProTyr: 1.656 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
4.33GlnAla: 4.33 ± 0.07
0.214GlnCys: 0.214 ± 0.014
1.52GlnAsp: 1.52 ± 0.033
3.149GlnGlu: 3.149 ± 0.057
1.418GlnPhe: 1.418 ± 0.035
2.55GlnGly: 2.55 ± 0.044
0.938GlnHis: 0.938 ± 0.032
2.501GlnIle: 2.501 ± 0.052
2.234GlnLys: 2.234 ± 0.043
4.225GlnLeu: 4.225 ± 0.069
1.146GlnMet: 1.146 ± 0.034
1.128GlnAsn: 1.128 ± 0.033
1.907GlnPro: 1.907 ± 0.045
2.169GlnGln: 2.169 ± 0.05
2.266GlnArg: 2.266 ± 0.044
1.799GlnSer: 1.799 ± 0.04
2.139GlnThr: 2.139 ± 0.045
2.983GlnVal: 2.983 ± 0.054
0.475GlnTrp: 0.475 ± 0.02
1.145GlnTyr: 1.145 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
4.212ArgAla: 4.212 ± 0.061
0.435ArgCys: 0.435 ± 0.023
2.753ArgAsp: 2.753 ± 0.049
5.098ArgGlu: 5.098 ± 0.072
2.721ArgPhe: 2.721 ± 0.049
3.369ArgGly: 3.369 ± 0.054
1.405ArgHis: 1.405 ± 0.033
4.073ArgIle: 4.073 ± 0.065
3.27ArgLys: 3.27 ± 0.055
6.641ArgLeu: 6.641 ± 0.091
1.912ArgMet: 1.912 ± 0.04
1.874ArgAsn: 1.874 ± 0.039
2.231ArgPro: 2.231 ± 0.045
2.768ArgGln: 2.768 ± 0.051
3.676ArgArg: 3.676 ± 0.068
2.862ArgSer: 2.862 ± 0.047
2.985ArgThr: 2.985 ± 0.056
4.054ArgVal: 4.054 ± 0.055
0.755ArgTrp: 0.755 ± 0.028
2.197ArgTyr: 2.197 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
4.38SerAla: 4.38 ± 0.056
0.435SerCys: 0.435 ± 0.02
2.356SerAsp: 2.356 ± 0.056
2.862SerGlu: 2.862 ± 0.049
2.527SerPhe: 2.527 ± 0.054
4.424SerGly: 4.424 ± 0.077
1.153SerHis: 1.153 ± 0.033
3.388SerIle: 3.388 ± 0.059
2.208SerLys: 2.208 ± 0.048
5.855SerLeu: 5.855 ± 0.077
1.482SerMet: 1.482 ± 0.04
1.425SerAsn: 1.425 ± 0.031
2.514SerPro: 2.514 ± 0.047
1.831SerGln: 1.831 ± 0.04
3.17SerArg: 3.17 ± 0.053
2.905SerSer: 2.905 ± 0.069
2.583SerThr: 2.583 ± 0.053
3.895SerVal: 3.895 ± 0.072
0.658SerTrp: 0.658 ± 0.026
1.769SerTyr: 1.769 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
4.948ThrAla: 4.948 ± 0.075
0.408ThrCys: 0.408 ± 0.02
2.573ThrAsp: 2.573 ± 0.046
3.097ThrGlu: 3.097 ± 0.06
2.284ThrPhe: 2.284 ± 0.051
4.871ThrGly: 4.871 ± 0.176
1.081ThrHis: 1.081 ± 0.032
3.654ThrIle: 3.654 ± 0.065
2.219ThrLys: 2.219 ± 0.043
5.531ThrLeu: 5.531 ± 0.067
1.336ThrMet: 1.336 ± 0.033
1.642ThrAsn: 1.642 ± 0.037
2.714ThrPro: 2.714 ± 0.05
1.576ThrGln: 1.576 ± 0.041
2.626ThrArg: 2.626 ± 0.049
2.671ThrSer: 2.671 ± 0.053
2.739ThrThr: 2.739 ± 0.063
4.526ThrVal: 4.526 ± 0.085
0.564ThrTrp: 0.564 ± 0.024
1.796ThrTyr: 1.796 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
6.506ValAla: 6.506 ± 0.085
0.69ValCys: 0.69 ± 0.023
3.725ValAsp: 3.725 ± 0.062
5.263ValGlu: 5.263 ± 0.076
2.74ValPhe: 2.74 ± 0.051
5.237ValGly: 5.237 ± 0.07
1.571ValHis: 1.571 ± 0.044
5.173ValIle: 5.173 ± 0.075
4.184ValLys: 4.184 ± 0.068
7.322ValLeu: 7.322 ± 0.08
1.981ValMet: 1.981 ± 0.041
2.645ValAsn: 2.645 ± 0.055
3.479ValPro: 3.479 ± 0.057
2.555ValGln: 2.555 ± 0.055
4.106ValArg: 4.106 ± 0.061
4.482ValSer: 4.482 ± 0.066
4.779ValThr: 4.779 ± 0.093
5.753ValVal: 5.753 ± 0.079
0.851ValTrp: 0.851 ± 0.02
2.267ValTyr: 2.267 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.028
0.086TrpCys: 0.086 ± 0.009
0.518TrpAsp: 0.518 ± 0.02
0.828TrpGlu: 0.828 ± 0.028
0.517TrpPhe: 0.517 ± 0.023
0.847TrpGly: 0.847 ± 0.03
0.255TrpHis: 0.255 ± 0.016
0.763TrpIle: 0.763 ± 0.029
0.74TrpLys: 0.74 ± 0.025
1.48TrpLeu: 1.48 ± 0.043
0.384TrpMet: 0.384 ± 0.019
0.483TrpAsn: 0.483 ± 0.022
0.368TrpPro: 0.368 ± 0.019
0.608TrpGln: 0.608 ± 0.027
0.732TrpArg: 0.732 ± 0.025
0.61TrpSer: 0.61 ± 0.024
0.556TrpThr: 0.556 ± 0.023
0.776TrpVal: 0.776 ± 0.025
0.177TrpTrp: 0.177 ± 0.013
0.381TrpTyr: 0.381 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.717TyrAla: 2.717 ± 0.047
0.295TyrCys: 0.295 ± 0.017
1.749TyrAsp: 1.749 ± 0.043
2.004TyrGlu: 2.004 ± 0.047
1.29TyrPhe: 1.29 ± 0.032
2.49TyrGly: 2.49 ± 0.045
0.858TyrHis: 0.858 ± 0.024
1.659TyrIle: 1.659 ± 0.039
1.305TyrLys: 1.305 ± 0.039
3.555TyrLeu: 3.555 ± 0.058
0.684TyrMet: 0.684 ± 0.023
0.984TyrAsn: 0.984 ± 0.032
1.565TyrPro: 1.565 ± 0.04
1.439TyrGln: 1.439 ± 0.036
2.248TyrArg: 2.248 ± 0.049
1.536TyrSer: 1.536 ± 0.039
1.654TyrThr: 1.654 ± 0.039
2.289TyrVal: 2.289 ± 0.045
0.379TyrTrp: 0.379 ± 0.016
1.157TyrTyr: 1.157 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3961 proteins (1184873 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski