Amino acid dipepetide frequency for Bacillus sp. AFS018417

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.609AlaAla: 5.609 ± 0.09
0.674AlaCys: 0.674 ± 0.024
2.949AlaAsp: 2.949 ± 0.048
4.358AlaGlu: 4.358 ± 0.065
3.323AlaPhe: 3.323 ± 0.057
4.909AlaGly: 4.909 ± 0.08
1.439AlaHis: 1.439 ± 0.035
5.805AlaIle: 5.805 ± 0.081
4.958AlaLys: 4.958 ± 0.075
7.171AlaLeu: 7.171 ± 0.088
2.089AlaMet: 2.089 ± 0.043
2.829AlaAsn: 2.829 ± 0.054
2.068AlaPro: 2.068 ± 0.045
2.223AlaGln: 2.223 ± 0.047
2.455AlaArg: 2.455 ± 0.047
3.815AlaSer: 3.815 ± 0.058
3.59AlaThr: 3.59 ± 0.057
5.317AlaVal: 5.317 ± 0.074
0.648AlaTrp: 0.648 ± 0.024
2.524AlaTyr: 2.524 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.021
0.137CysCys: 0.137 ± 0.011
0.412CysAsp: 0.412 ± 0.017
0.534CysGlu: 0.534 ± 0.024
0.465CysPhe: 0.465 ± 0.018
0.764CysGly: 0.764 ± 0.028
0.229CysHis: 0.229 ± 0.013
0.772CysIle: 0.772 ± 0.029
0.475CysLys: 0.475 ± 0.019
0.807CysLeu: 0.807 ± 0.025
0.251CysMet: 0.251 ± 0.014
0.373CysAsn: 0.373 ± 0.017
0.352CysPro: 0.352 ± 0.018
0.264CysGln: 0.264 ± 0.014
0.319CysArg: 0.319 ± 0.015
0.567CysSer: 0.567 ± 0.024
0.499CysThr: 0.499 ± 0.022
0.55CysVal: 0.55 ± 0.022
0.082CysTrp: 0.082 ± 0.01
0.323CysTyr: 0.323 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.162AspAla: 3.162 ± 0.061
0.404AspCys: 0.404 ± 0.02
2.053AspAsp: 2.053 ± 0.047
3.881AspGlu: 3.881 ± 0.064
2.217AspPhe: 2.217 ± 0.049
3.149AspGly: 3.149 ± 0.058
0.959AspHis: 0.959 ± 0.028
4.074AspIle: 4.074 ± 0.058
3.061AspLys: 3.061 ± 0.057
4.289AspLeu: 4.289 ± 0.062
1.298AspMet: 1.298 ± 0.033
1.737AspAsn: 1.737 ± 0.035
1.66AspPro: 1.66 ± 0.039
1.494AspGln: 1.494 ± 0.036
1.874AspArg: 1.874 ± 0.04
2.285AspSer: 2.285 ± 0.039
2.321AspThr: 2.321 ± 0.04
3.827AspVal: 3.827 ± 0.066
0.582AspTrp: 0.582 ± 0.025
1.972AspTyr: 1.972 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
5.082GluAla: 5.082 ± 0.076
0.459GluCys: 0.459 ± 0.021
3.41GluAsp: 3.41 ± 0.055
7.08GluGlu: 7.08 ± 0.104
2.402GluPhe: 2.402 ± 0.048
4.217GluGly: 4.217 ± 0.065
1.643GluHis: 1.643 ± 0.038
5.377GluIle: 5.377 ± 0.077
7.016GluLys: 7.016 ± 0.098
6.742GluLeu: 6.742 ± 0.074
2.279GluMet: 2.279 ± 0.043
3.523GluAsn: 3.523 ± 0.056
1.768GluPro: 1.768 ± 0.035
3.835GluGln: 3.835 ± 0.064
3.547GluArg: 3.547 ± 0.07
3.144GluSer: 3.144 ± 0.051
3.928GluThr: 3.928 ± 0.066
5.137GluVal: 5.137 ± 0.072
0.803GluTrp: 0.803 ± 0.026
2.42GluTyr: 2.42 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.274PheAla: 3.274 ± 0.052
0.418PheCys: 0.418 ± 0.018
2.065PheAsp: 2.065 ± 0.04
2.688PheGlu: 2.688 ± 0.047
2.484PhePhe: 2.484 ± 0.061
3.343PheGly: 3.343 ± 0.058
1.186PheHis: 1.186 ± 0.034
4.015PheIle: 4.015 ± 0.065
2.202PheLys: 2.202 ± 0.042
4.784PheLeu: 4.784 ± 0.072
1.233PheMet: 1.233 ± 0.038
1.654PheAsn: 1.654 ± 0.039
1.687PhePro: 1.687 ± 0.036
1.858PheGln: 1.858 ± 0.041
1.619PheArg: 1.619 ± 0.039
3.299PheSer: 3.299 ± 0.058
2.762PheThr: 2.762 ± 0.046
3.559PheVal: 3.559 ± 0.059
0.528PheTrp: 0.528 ± 0.025
1.802PheTyr: 1.802 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
4.855GlyAla: 4.855 ± 0.074
0.71GlyCys: 0.71 ± 0.024
3.004GlyAsp: 3.004 ± 0.057
4.369GlyGlu: 4.369 ± 0.068
3.328GlyPhe: 3.328 ± 0.053
4.706GlyGly: 4.706 ± 0.072
1.348GlyHis: 1.348 ± 0.034
5.982GlyIle: 5.982 ± 0.087
5.234GlyLys: 5.234 ± 0.065
6.195GlyLeu: 6.195 ± 0.078
2.154GlyMet: 2.154 ± 0.05
2.768GlyAsn: 2.768 ± 0.058
1.621GlyPro: 1.621 ± 0.035
2.152GlyGln: 2.152 ± 0.043
2.472GlyArg: 2.472 ± 0.054
3.619GlySer: 3.619 ± 0.054
4.145GlyThr: 4.145 ± 0.058
5.034GlyVal: 5.034 ± 0.08
0.845GlyTrp: 0.845 ± 0.027
2.77GlyTyr: 2.77 ± 0.052
0.002GlyXaa: 0.002 ± 0.001
His
1.399HisAla: 1.399 ± 0.032
0.219HisCys: 0.219 ± 0.014
1.09HisAsp: 1.09 ± 0.029
1.522HisGlu: 1.522 ± 0.032
1.143HisPhe: 1.143 ± 0.031
1.389HisGly: 1.389 ± 0.038
0.712HisHis: 0.712 ± 0.026
1.926HisIle: 1.926 ± 0.048
1.189HisLys: 1.189 ± 0.028
2.131HisLeu: 2.131 ± 0.042
0.616HisMet: 0.616 ± 0.022
0.909HisAsn: 0.909 ± 0.028
1.166HisPro: 1.166 ± 0.034
0.769HisGln: 0.769 ± 0.021
0.957HisArg: 0.957 ± 0.028
1.235HisSer: 1.235 ± 0.032
1.338HisThr: 1.338 ± 0.038
1.677HisVal: 1.677 ± 0.036
0.23HisTrp: 0.23 ± 0.015
0.904HisTyr: 0.904 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.042IleAla: 6.042 ± 0.084
0.824IleCys: 0.824 ± 0.027
4.007IleAsp: 4.007 ± 0.056
5.752IleGlu: 5.752 ± 0.083
3.511IlePhe: 3.511 ± 0.062
6.199IleGly: 6.199 ± 0.079
1.834IleHis: 1.834 ± 0.037
6.088IleIle: 6.088 ± 0.09
4.249IleLys: 4.249 ± 0.068
7.235IleLeu: 7.235 ± 0.092
1.948IleMet: 1.948 ± 0.037
2.947IleAsn: 2.947 ± 0.053
3.484IlePro: 3.484 ± 0.063
3.111IleGln: 3.111 ± 0.05
3.096IleArg: 3.096 ± 0.049
4.875IleSer: 4.875 ± 0.065
4.734IleThr: 4.734 ± 0.069
6.119IleVal: 6.119 ± 0.079
0.723IleTrp: 0.723 ± 0.03
2.472IleTyr: 2.472 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.456LysAla: 4.456 ± 0.069
0.404LysCys: 0.404 ± 0.02
3.667LysAsp: 3.667 ± 0.063
7.34LysGlu: 7.34 ± 0.103
2.092LysPhe: 2.092 ± 0.042
4.492LysGly: 4.492 ± 0.065
1.51LysHis: 1.51 ± 0.032
4.712LysIle: 4.712 ± 0.06
6.232LysLys: 6.232 ± 0.087
5.869LysLeu: 5.869 ± 0.078
2.425LysMet: 2.425 ± 0.041
3.292LysAsn: 3.292 ± 0.059
2.221LysPro: 2.221 ± 0.05
4.015LysGln: 4.015 ± 0.068
3.574LysArg: 3.574 ± 0.063
3.323LysSer: 3.323 ± 0.058
3.763LysThr: 3.763 ± 0.055
4.707LysVal: 4.707 ± 0.065
0.865LysTrp: 0.865 ± 0.027
2.45LysTyr: 2.45 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
6.926LeuAla: 6.926 ± 0.092
0.934LeuCys: 0.934 ± 0.027
4.31LeuAsp: 4.31 ± 0.06
6.478LeuGlu: 6.478 ± 0.095
4.987LeuPhe: 4.987 ± 0.079
6.388LeuGly: 6.388 ± 0.082
2.313LeuHis: 2.313 ± 0.048
6.716LeuIle: 6.716 ± 0.094
6.105LeuLys: 6.105 ± 0.069
10.263LeuLeu: 10.263 ± 0.122
2.354LeuMet: 2.354 ± 0.042
3.746LeuAsn: 3.746 ± 0.055
3.889LeuPro: 3.889 ± 0.05
4.7LeuGln: 4.7 ± 0.067
3.687LeuArg: 3.687 ± 0.066
6.245LeuSer: 6.245 ± 0.078
5.39LeuThr: 5.39 ± 0.06
6.22LeuVal: 6.22 ± 0.087
0.854LeuTrp: 0.854 ± 0.029
3.498LeuTyr: 3.498 ± 0.062
0.001LeuXaa: 0.001 ± 0.001
Met
1.815MetAla: 1.815 ± 0.035
0.188MetCys: 0.188 ± 0.013
1.26MetAsp: 1.26 ± 0.031
2.013MetGlu: 2.013 ± 0.045
1.179MetPhe: 1.179 ± 0.033
1.712MetGly: 1.712 ± 0.043
0.516MetHis: 0.516 ± 0.021
2.22MetIle: 2.22 ± 0.041
2.898MetLys: 2.898 ± 0.048
2.789MetLeu: 2.789 ± 0.051
1.002MetMet: 1.002 ± 0.03
1.666MetAsn: 1.666 ± 0.036
1.058MetPro: 1.058 ± 0.033
1.149MetGln: 1.149 ± 0.032
1.199MetArg: 1.199 ± 0.032
1.618MetSer: 1.618 ± 0.04
1.582MetThr: 1.582 ± 0.033
1.672MetVal: 1.672 ± 0.032
0.222MetTrp: 0.222 ± 0.014
0.956MetTyr: 0.956 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.695AsnAla: 2.695 ± 0.049
0.357AsnCys: 0.357 ± 0.016
2.009AsnAsp: 2.009 ± 0.043
3.489AsnGlu: 3.489 ± 0.057
1.68AsnPhe: 1.68 ± 0.035
3.279AsnGly: 3.279 ± 0.051
1.039AsnHis: 1.039 ± 0.031
3.68AsnIle: 3.68 ± 0.049
3.087AsnLys: 3.087 ± 0.06
3.638AsnLeu: 3.638 ± 0.055
1.299AsnMet: 1.299 ± 0.035
1.979AsnAsn: 1.979 ± 0.059
2.023AsnPro: 2.023 ± 0.041
1.716AsnGln: 1.716 ± 0.035
1.94AsnArg: 1.94 ± 0.037
2.14AsnSer: 2.14 ± 0.044
2.232AsnThr: 2.232 ± 0.046
3.227AsnVal: 3.227 ± 0.057
0.533AsnTrp: 0.533 ± 0.018
1.563AsnTyr: 1.563 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
2.234ProAla: 2.234 ± 0.047
0.24ProCys: 0.24 ± 0.013
1.677ProAsp: 1.677 ± 0.039
2.591ProGlu: 2.591 ± 0.044
2.015ProPhe: 2.015 ± 0.043
2.161ProGly: 2.161 ± 0.05
0.861ProHis: 0.861 ± 0.025
2.911ProIle: 2.911 ± 0.055
2.35ProLys: 2.35 ± 0.04
3.473ProLeu: 3.473 ± 0.049
0.804ProMet: 0.804 ± 0.025
1.788ProAsn: 1.788 ± 0.04
0.958ProPro: 0.958 ± 0.032
1.12ProGln: 1.12 ± 0.03
1.082ProArg: 1.082 ± 0.029
2.228ProSer: 2.228 ± 0.045
2.004ProThr: 2.004 ± 0.04
2.727ProVal: 2.727 ± 0.048
0.377ProTrp: 0.377 ± 0.017
1.518ProTyr: 1.518 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.751GlnAla: 2.751 ± 0.059
0.251GlnCys: 0.251 ± 0.015
1.728GlnAsp: 1.728 ± 0.039
3.172GlnGlu: 3.172 ± 0.06
1.732GlnPhe: 1.732 ± 0.035
2.338GlnGly: 2.338 ± 0.044
0.976GlnHis: 0.976 ± 0.026
2.814GlnIle: 2.814 ± 0.045
3.289GlnLys: 3.289 ± 0.059
3.982GlnLeu: 3.982 ± 0.064
1.174GlnMet: 1.174 ± 0.033
1.88GlnAsn: 1.88 ± 0.039
1.295GlnPro: 1.295 ± 0.036
2.08GlnGln: 2.08 ± 0.06
1.525GlnArg: 1.525 ± 0.035
2.156GlnSer: 2.156 ± 0.043
2.202GlnThr: 2.202 ± 0.04
2.547GlnVal: 2.547 ± 0.045
0.44GlnTrp: 0.44 ± 0.021
1.59GlnTyr: 1.59 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
2.449ArgAla: 2.449 ± 0.047
0.297ArgCys: 0.297 ± 0.018
2.0ArgAsp: 2.0 ± 0.04
3.15ArgGlu: 3.15 ± 0.053
1.948ArgPhe: 1.948 ± 0.047
2.294ArgGly: 2.294 ± 0.049
0.847ArgHis: 0.847 ± 0.03
3.081ArgIle: 3.081 ± 0.057
3.316ArgLys: 3.316 ± 0.053
3.754ArgLeu: 3.754 ± 0.06
1.299ArgMet: 1.299 ± 0.029
1.982ArgAsn: 1.982 ± 0.045
1.216ArgPro: 1.216 ± 0.03
1.525ArgGln: 1.525 ± 0.038
1.688ArgArg: 1.688 ± 0.043
2.064ArgSer: 2.064 ± 0.044
2.1ArgThr: 2.1 ± 0.042
2.606ArgVal: 2.606 ± 0.049
0.453ArgTrp: 0.453 ± 0.019
1.655ArgTyr: 1.655 ± 0.041
0.001ArgXaa: 0.001 ± 0.001
Ser
3.545SerAla: 3.545 ± 0.066
0.552SerCys: 0.552 ± 0.024
2.38SerAsp: 2.38 ± 0.04
3.385SerGlu: 3.385 ± 0.055
3.361SerPhe: 3.361 ± 0.064
3.883SerGly: 3.883 ± 0.067
1.304SerHis: 1.304 ± 0.03
4.892SerIle: 4.892 ± 0.075
3.705SerLys: 3.705 ± 0.054
5.874SerLeu: 5.874 ± 0.08
1.691SerMet: 1.691 ± 0.034
2.46SerAsn: 2.46 ± 0.052
2.021SerPro: 2.021 ± 0.044
1.928SerGln: 1.928 ± 0.039
2.118SerArg: 2.118 ± 0.042
3.615SerSer: 3.615 ± 0.068
2.955SerThr: 2.955 ± 0.053
3.944SerVal: 3.944 ± 0.062
0.621SerTrp: 0.621 ± 0.022
2.364SerTyr: 2.364 ± 0.05
0.001SerXaa: 0.001 ± 0.001
Thr
3.899ThrAla: 3.899 ± 0.068
0.464ThrCys: 0.464 ± 0.02
2.469ThrAsp: 2.469 ± 0.048
3.576ThrGlu: 3.576 ± 0.063
2.862ThrPhe: 2.862 ± 0.051
4.041ThrGly: 4.041 ± 0.067
1.191ThrHis: 1.191 ± 0.034
4.778ThrIle: 4.778 ± 0.061
3.783ThrLys: 3.783 ± 0.056
5.503ThrLeu: 5.503 ± 0.056
1.454ThrMet: 1.454 ± 0.037
2.746ThrAsn: 2.746 ± 0.048
2.211ThrPro: 2.211 ± 0.048
1.545ThrGln: 1.545 ± 0.035
1.781ThrArg: 1.781 ± 0.041
3.219ThrSer: 3.219 ± 0.049
3.076ThrThr: 3.076 ± 0.059
4.395ThrVal: 4.395 ± 0.063
0.58ThrTrp: 0.58 ± 0.025
2.171ThrTyr: 2.171 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
5.124ValAla: 5.124 ± 0.081
0.693ValCys: 0.693 ± 0.023
3.37ValAsp: 3.37 ± 0.061
4.85ValGlu: 4.85 ± 0.078
3.256ValPhe: 3.256 ± 0.058
4.703ValGly: 4.703 ± 0.071
1.526ValHis: 1.526 ± 0.037
5.637ValIle: 5.637 ± 0.077
4.952ValLys: 4.952 ± 0.081
7.05ValLeu: 7.05 ± 0.072
1.946ValMet: 1.946 ± 0.041
3.031ValAsn: 3.031 ± 0.055
2.79ValPro: 2.79 ± 0.057
2.741ValGln: 2.741 ± 0.051
2.727ValArg: 2.727 ± 0.048
4.501ValSer: 4.501 ± 0.062
4.43ValThr: 4.43 ± 0.066
5.387ValVal: 5.387 ± 0.076
0.724ValTrp: 0.724 ± 0.026
2.528ValTyr: 2.528 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.584TrpAla: 0.584 ± 0.024
0.106TrpCys: 0.106 ± 0.009
0.511TrpAsp: 0.511 ± 0.021
0.694TrpGlu: 0.694 ± 0.023
0.559TrpPhe: 0.559 ± 0.024
0.731TrpGly: 0.731 ± 0.026
0.199TrpHis: 0.199 ± 0.012
0.948TrpIle: 0.948 ± 0.028
0.879TrpLys: 0.879 ± 0.025
1.198TrpLeu: 1.198 ± 0.034
0.343TrpMet: 0.343 ± 0.019
0.618TrpAsn: 0.618 ± 0.023
0.238TrpPro: 0.238 ± 0.013
0.363TrpGln: 0.363 ± 0.018
0.436TrpArg: 0.436 ± 0.019
0.573TrpSer: 0.573 ± 0.023
0.516TrpThr: 0.516 ± 0.021
0.65TrpVal: 0.65 ± 0.025
0.14TrpTrp: 0.14 ± 0.009
0.373TrpTyr: 0.373 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.346TyrAla: 2.346 ± 0.048
0.35TyrCys: 0.35 ± 0.017
1.924TyrAsp: 1.924 ± 0.042
2.886TyrGlu: 2.886 ± 0.059
1.916TyrPhe: 1.916 ± 0.04
2.616TyrGly: 2.616 ± 0.043
0.891TyrHis: 0.891 ± 0.03
2.822TyrIle: 2.822 ± 0.05
2.429TyrLys: 2.429 ± 0.047
3.261TyrLeu: 3.261 ± 0.051
1.056TyrMet: 1.056 ± 0.031
1.676TyrAsn: 1.676 ± 0.046
1.394TyrPro: 1.394 ± 0.035
1.333TyrGln: 1.333 ± 0.034
1.627TyrArg: 1.627 ± 0.043
2.076TyrSer: 2.076 ± 0.044
2.178TyrThr: 2.178 ± 0.048
2.665TyrVal: 2.665 ± 0.045
0.409TyrTrp: 0.409 ± 0.021
1.588TyrTyr: 1.588 ± 0.042
0.001TyrXaa: 0.001 ± 0.001
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.001XaaArg: 0.001 ± 0.001
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.001
Statistics based on 4515 proteins (1237217 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski