Amino acid dipepetide frequency for Bacillus licheniformis (strain ATCC 14580 / DSM 13 / JCM 2505 / NBRC 12200 / NCIMB 9375 / NRRL NRS-1264 / Gibson 46)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.57AlaAla: 8.57 ± 0.135
0.678AlaCys: 0.678 ± 0.027
4.325AlaAsp: 4.325 ± 0.063
6.155AlaGlu: 6.155 ± 0.082
3.822AlaPhe: 3.822 ± 0.067
6.817AlaGly: 6.817 ± 0.087
1.353AlaHis: 1.353 ± 0.034
5.16AlaIle: 5.16 ± 0.086
5.293AlaLys: 5.293 ± 0.075
8.055AlaLeu: 8.055 ± 0.092
2.085AlaMet: 2.085 ± 0.052
2.601AlaAsn: 2.601 ± 0.05
2.391AlaPro: 2.391 ± 0.058
2.203AlaGln: 2.203 ± 0.044
2.919AlaArg: 2.919 ± 0.055
4.587AlaSer: 4.587 ± 0.07
2.995AlaThr: 2.995 ± 0.147
7.251AlaVal: 7.251 ± 0.091
0.659AlaTrp: 0.659 ± 0.026
2.555AlaTyr: 2.555 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.52CysAla: 0.52 ± 0.02
0.112CysCys: 0.112 ± 0.011
0.373CysAsp: 0.373 ± 0.017
0.47CysGlu: 0.47 ± 0.023
0.38CysPhe: 0.38 ± 0.019
0.754CysGly: 0.754 ± 0.028
0.211CysHis: 0.211 ± 0.013
0.544CysIle: 0.544 ± 0.022
0.335CysLys: 0.335 ± 0.017
0.831CysLeu: 0.831 ± 0.026
0.201CysMet: 0.201 ± 0.013
0.245CysAsn: 0.245 ± 0.014
0.35CysPro: 0.35 ± 0.019
0.235CysGln: 0.235 ± 0.013
0.385CysArg: 0.385 ± 0.018
0.534CysSer: 0.534 ± 0.024
0.407CysThr: 0.407 ± 0.018
0.44CysVal: 0.44 ± 0.019
0.072CysTrp: 0.072 ± 0.008
0.283CysTyr: 0.283 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.66AspAla: 3.66 ± 0.059
0.398AspCys: 0.398 ± 0.023
2.692AspAsp: 2.692 ± 0.052
4.474AspGlu: 4.474 ± 0.067
2.277AspPhe: 2.277 ± 0.042
3.738AspGly: 3.738 ± 0.06
1.284AspHis: 1.284 ± 0.03
4.127AspIle: 4.127 ± 0.061
3.019AspLys: 3.019 ± 0.059
4.957AspLeu: 4.957 ± 0.065
1.49AspMet: 1.49 ± 0.033
1.384AspAsn: 1.384 ± 0.037
2.13AspPro: 2.13 ± 0.042
2.222AspGln: 2.222 ± 0.047
2.557AspArg: 2.557 ± 0.05
2.561AspSer: 2.561 ± 0.06
2.298AspThr: 2.298 ± 0.046
3.624AspVal: 3.624 ± 0.055
0.609AspTrp: 0.609 ± 0.028
2.04AspTyr: 2.04 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
6.216GluAla: 6.216 ± 0.081
0.445GluCys: 0.445 ± 0.021
3.835GluAsp: 3.835 ± 0.061
6.675GluGlu: 6.675 ± 0.095
2.352GluPhe: 2.352 ± 0.048
4.321GluGly: 4.321 ± 0.067
1.757GluHis: 1.757 ± 0.041
4.795GluIle: 4.795 ± 0.074
6.989GluLys: 6.989 ± 0.079
7.032GluLeu: 7.032 ± 0.093
2.115GluMet: 2.115 ± 0.046
3.233GluAsn: 3.233 ± 0.06
2.191GluPro: 2.191 ± 0.043
3.28GluGln: 3.28 ± 0.059
3.953GluArg: 3.953 ± 0.065
3.453GluSer: 3.453 ± 0.051
4.159GluThr: 4.159 ± 0.064
4.252GluVal: 4.252 ± 0.072
0.862GluTrp: 0.862 ± 0.027
2.065GluTyr: 2.065 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
3.499PheAla: 3.499 ± 0.066
0.381PheCys: 0.381 ± 0.018
2.198PheAsp: 2.198 ± 0.036
2.794PheGlu: 2.794 ± 0.055
2.455PhePhe: 2.455 ± 0.055
3.346PheGly: 3.346 ± 0.063
1.094PheHis: 1.094 ± 0.032
3.684PheIle: 3.684 ± 0.071
2.571PheLys: 2.571 ± 0.045
4.609PheLeu: 4.609 ± 0.078
1.17PheMet: 1.17 ± 0.032
1.58PheAsn: 1.58 ± 0.041
1.66PhePro: 1.66 ± 0.037
1.722PheGln: 1.722 ± 0.035
1.755PheArg: 1.755 ± 0.04
3.377PheSer: 3.377 ± 0.067
2.487PheThr: 2.487 ± 0.051
2.941PheVal: 2.941 ± 0.065
0.505PheTrp: 0.505 ± 0.02
1.681PheTyr: 1.681 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
5.83GlyAla: 5.83 ± 0.172
0.675GlyCys: 0.675 ± 0.024
3.381GlyAsp: 3.381 ± 0.059
4.645GlyGlu: 4.645 ± 0.068
3.579GlyPhe: 3.579 ± 0.062
5.337GlyGly: 5.337 ± 0.087
1.481GlyHis: 1.481 ± 0.041
5.836GlyIle: 5.836 ± 0.073
5.154GlyLys: 5.154 ± 0.077
6.868GlyLeu: 6.868 ± 0.083
2.23GlyMet: 2.23 ± 0.048
2.438GlyAsn: 2.438 ± 0.048
2.032GlyPro: 2.032 ± 0.102
2.254GlyGln: 2.254 ± 0.048
3.352GlyArg: 3.352 ± 0.056
4.249GlySer: 4.249 ± 0.061
4.165GlyThr: 4.165 ± 0.066
5.0GlyVal: 5.0 ± 0.073
0.907GlyTrp: 0.907 ± 0.03
2.711GlyTyr: 2.711 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
1.478HisAla: 1.478 ± 0.034
0.196HisCys: 0.196 ± 0.013
1.174HisAsp: 1.174 ± 0.032
1.562HisGlu: 1.562 ± 0.037
1.102HisPhe: 1.102 ± 0.033
1.473HisGly: 1.473 ± 0.038
0.742HisHis: 0.742 ± 0.027
1.598HisIle: 1.598 ± 0.04
1.027HisLys: 1.027 ± 0.03
2.203HisLeu: 2.203 ± 0.045
0.557HisMet: 0.557 ± 0.022
0.692HisAsn: 0.692 ± 0.024
1.27HisPro: 1.27 ± 0.033
0.918HisGln: 0.918 ± 0.029
0.946HisArg: 0.946 ± 0.029
1.28HisSer: 1.28 ± 0.036
1.079HisThr: 1.079 ± 0.028
1.48HisVal: 1.48 ± 0.034
0.229HisTrp: 0.229 ± 0.013
0.848HisTyr: 0.848 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.106IleAla: 6.106 ± 0.083
0.648IleCys: 0.648 ± 0.023
4.045IleAsp: 4.045 ± 0.059
5.3IleGlu: 5.3 ± 0.07
2.971IlePhe: 2.971 ± 0.061
5.842IleGly: 5.842 ± 0.081
1.657IleHis: 1.657 ± 0.036
4.753IleIle: 4.753 ± 0.066
4.326IleLys: 4.326 ± 0.063
6.403IleLeu: 6.403 ± 0.09
1.624IleMet: 1.624 ± 0.038
2.577IleAsn: 2.577 ± 0.049
3.15IlePro: 3.15 ± 0.055
2.523IleGln: 2.523 ± 0.047
3.165IleArg: 3.165 ± 0.049
4.813IleSer: 4.813 ± 0.08
3.741IleThr: 3.741 ± 0.066
5.068IleVal: 5.068 ± 0.073
0.648IleTrp: 0.648 ± 0.026
2.193IleTyr: 2.193 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
5.692LysAla: 5.692 ± 0.085
0.306LysCys: 0.306 ± 0.018
4.032LysAsp: 4.032 ± 0.064
6.843LysGlu: 6.843 ± 0.088
1.616LysPhe: 1.616 ± 0.033
4.965LysGly: 4.965 ± 0.073
1.44LysHis: 1.44 ± 0.035
4.251LysIle: 4.251 ± 0.064
6.348LysLys: 6.348 ± 0.098
5.811LysLeu: 5.811 ± 0.073
2.084LysMet: 2.084 ± 0.043
3.225LysAsn: 3.225 ± 0.058
2.601LysPro: 2.601 ± 0.046
3.253LysGln: 3.253 ± 0.058
3.993LysArg: 3.993 ± 0.061
3.517LysSer: 3.517 ± 0.06
4.176LysThr: 4.176 ± 0.06
4.083LysVal: 4.083 ± 0.072
0.834LysTrp: 0.834 ± 0.029
1.822LysTyr: 1.822 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
8.024LeuAla: 8.024 ± 0.1
0.741LeuCys: 0.741 ± 0.024
4.867LeuAsp: 4.867 ± 0.068
6.405LeuGlu: 6.405 ± 0.084
5.009LeuPhe: 5.009 ± 0.083
6.338LeuGly: 6.338 ± 0.084
2.002LeuHis: 2.002 ± 0.051
7.055LeuIle: 7.055 ± 0.092
7.413LeuLys: 7.413 ± 0.082
9.96LeuLeu: 9.96 ± 0.12
2.484LeuMet: 2.484 ± 0.049
4.003LeuAsn: 4.003 ± 0.074
3.94LeuPro: 3.94 ± 0.069
3.193LeuGln: 3.193 ± 0.054
3.661LeuArg: 3.661 ± 0.063
7.037LeuSer: 7.037 ± 0.083
5.657LeuThr: 5.657 ± 0.071
5.64LeuVal: 5.64 ± 0.079
0.832LeuTrp: 0.832 ± 0.03
3.22LeuTyr: 3.22 ± 0.057
0.001LeuXaa: 0.001 ± 0.001
Met
2.245MetAla: 2.245 ± 0.053
0.164MetCys: 0.164 ± 0.013
1.265MetAsp: 1.265 ± 0.03
1.682MetGlu: 1.682 ± 0.033
1.147MetPhe: 1.147 ± 0.032
1.617MetGly: 1.617 ± 0.041
0.394MetHis: 0.394 ± 0.019
2.154MetIle: 2.154 ± 0.048
2.744MetLys: 2.744 ± 0.047
2.726MetLeu: 2.726 ± 0.05
0.93MetMet: 0.93 ± 0.031
1.517MetAsn: 1.517 ± 0.038
1.087MetPro: 1.087 ± 0.03
0.832MetGln: 0.832 ± 0.029
1.087MetArg: 1.087 ± 0.034
1.687MetSer: 1.687 ± 0.033
1.785MetThr: 1.785 ± 0.035
1.487MetVal: 1.487 ± 0.032
0.196MetTrp: 0.196 ± 0.013
0.695MetTyr: 0.695 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.768AsnAla: 2.768 ± 0.056
0.291AsnCys: 0.291 ± 0.014
1.877AsnAsp: 1.877 ± 0.043
3.02AsnGlu: 3.02 ± 0.064
1.26AsnPhe: 1.26 ± 0.035
3.396AsnGly: 3.396 ± 0.058
0.858AsnHis: 0.858 ± 0.03
2.917AsnIle: 2.917 ± 0.049
2.376AsnLys: 2.376 ± 0.049
3.127AsnLeu: 3.127 ± 0.053
1.081AsnMet: 1.081 ± 0.03
1.349AsnAsn: 1.349 ± 0.036
1.891AsnPro: 1.891 ± 0.039
1.6AsnGln: 1.6 ± 0.043
2.144AsnArg: 2.144 ± 0.048
1.894AsnSer: 1.894 ± 0.046
1.83AsnThr: 1.83 ± 0.041
2.658AsnVal: 2.658 ± 0.048
0.45AsnTrp: 0.45 ± 0.024
1.174AsnTyr: 1.174 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
3.022ProAla: 3.022 ± 0.056
0.249ProCys: 0.249 ± 0.018
2.477ProAsp: 2.477 ± 0.049
3.367ProGlu: 3.367 ± 0.061
2.063ProPhe: 2.063 ± 0.041
2.591ProGly: 2.591 ± 0.052
0.861ProHis: 0.861 ± 0.026
2.308ProIle: 2.308 ± 0.053
2.241ProLys: 2.241 ± 0.041
3.738ProLeu: 3.738 ± 0.063
0.758ProMet: 0.758 ± 0.025
1.443ProAsn: 1.443 ± 0.034
1.098ProPro: 1.098 ± 0.034
1.088ProGln: 1.088 ± 0.032
1.161ProArg: 1.161 ± 0.032
2.313ProSer: 2.313 ± 0.046
1.645ProThr: 1.645 ± 0.1
3.232ProVal: 3.232 ± 0.065
0.355ProTrp: 0.355 ± 0.02
1.495ProTyr: 1.495 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.914GlnAla: 2.914 ± 0.051
0.206GlnCys: 0.206 ± 0.014
1.625GlnAsp: 1.625 ± 0.036
2.503GlnGlu: 2.503 ± 0.048
1.495GlnPhe: 1.495 ± 0.029
1.933GlnGly: 1.933 ± 0.043
0.828GlnHis: 0.828 ± 0.025
2.463GlnIle: 2.463 ± 0.05
3.029GlnLys: 3.029 ± 0.051
3.739GlnLeu: 3.739 ± 0.062
1.135GlnMet: 1.135 ± 0.032
1.649GlnAsn: 1.649 ± 0.045
1.275GlnPro: 1.275 ± 0.036
1.552GlnGln: 1.552 ± 0.05
1.389GlnArg: 1.389 ± 0.032
2.154GlnSer: 2.154 ± 0.047
2.246GlnThr: 2.246 ± 0.051
2.016GlnVal: 2.016 ± 0.048
0.353GlnTrp: 0.353 ± 0.019
1.344GlnTyr: 1.344 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.775ArgAla: 2.775 ± 0.052
0.298ArgCys: 0.298 ± 0.017
2.176ArgAsp: 2.176 ± 0.042
3.451ArgGlu: 3.451 ± 0.059
2.282ArgPhe: 2.282 ± 0.049
2.477ArgGly: 2.477 ± 0.056
1.079ArgHis: 1.079 ± 0.029
3.133ArgIle: 3.133 ± 0.061
3.449ArgLys: 3.449 ± 0.061
4.766ArgLeu: 4.766 ± 0.076
1.39ArgMet: 1.39 ± 0.033
1.76ArgAsn: 1.76 ± 0.04
1.54ArgPro: 1.54 ± 0.042
1.76ArgGln: 1.76 ± 0.044
2.238ArgArg: 2.238 ± 0.055
2.43ArgSer: 2.43 ± 0.055
2.294ArgThr: 2.294 ± 0.049
2.564ArgVal: 2.564 ± 0.05
0.417ArgTrp: 0.417 ± 0.019
1.679ArgTyr: 1.679 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
4.813SerAla: 4.813 ± 0.071
0.485SerCys: 0.485 ± 0.021
2.819SerAsp: 2.819 ± 0.055
3.917SerGlu: 3.917 ± 0.07
3.456SerPhe: 3.456 ± 0.044
5.06SerGly: 5.06 ± 0.077
1.26SerHis: 1.26 ± 0.028
4.504SerIle: 4.504 ± 0.063
3.641SerLys: 3.641 ± 0.055
6.237SerLeu: 6.237 ± 0.078
1.757SerMet: 1.757 ± 0.039
1.909SerAsn: 1.909 ± 0.039
2.228SerPro: 2.228 ± 0.043
1.846SerGln: 1.846 ± 0.039
2.586SerArg: 2.586 ± 0.053
3.969SerSer: 3.969 ± 0.073
2.582SerThr: 2.582 ± 0.051
4.368SerVal: 4.368 ± 0.06
0.653SerTrp: 0.653 ± 0.026
2.189SerTyr: 2.189 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
4.922ThrAla: 4.922 ± 0.058
0.365ThrCys: 0.365 ± 0.017
2.807ThrAsp: 2.807 ± 0.047
3.58ThrGlu: 3.58 ± 0.063
2.559ThrPhe: 2.559 ± 0.055
4.882ThrGly: 4.882 ± 0.288
0.971ThrHis: 0.971 ± 0.027
3.852ThrIle: 3.852 ± 0.06
3.211ThrLys: 3.211 ± 0.061
4.853ThrLeu: 4.853 ± 0.061
1.208ThrMet: 1.208 ± 0.03
1.973ThrAsn: 1.973 ± 0.042
2.187ThrPro: 2.187 ± 0.046
1.151ThrGln: 1.151 ± 0.034
1.732ThrArg: 1.732 ± 0.036
3.03ThrSer: 3.03 ± 0.055
2.383ThrThr: 2.383 ± 0.047
4.256ThrVal: 4.256 ± 0.058
0.508ThrTrp: 0.508 ± 0.022
1.825ThrTyr: 1.825 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
4.492ValAla: 4.492 ± 0.071
0.662ValCys: 0.662 ± 0.022
3.123ValAsp: 3.123 ± 0.052
3.971ValGlu: 3.971 ± 0.064
3.381ValPhe: 3.381 ± 0.06
4.071ValGly: 4.071 ± 0.059
1.478ValHis: 1.478 ± 0.033
5.403ValIle: 5.403 ± 0.074
4.977ValLys: 4.977 ± 0.07
7.167ValLeu: 7.167 ± 0.078
1.935ValMet: 1.935 ± 0.04
2.717ValAsn: 2.717 ± 0.047
2.773ValPro: 2.773 ± 0.048
2.361ValGln: 2.361 ± 0.04
2.77ValArg: 2.77 ± 0.053
4.795ValSer: 4.795 ± 0.054
3.908ValThr: 3.908 ± 0.063
4.39ValVal: 4.39 ± 0.065
0.685ValTrp: 0.685 ± 0.024
2.417ValTyr: 2.417 ± 0.043
0.0ValXaa: 0.0 ± 0.0
Trp
0.667TrpAla: 0.667 ± 0.024
0.086TrpCys: 0.086 ± 0.01
0.511TrpAsp: 0.511 ± 0.021
0.634TrpGlu: 0.634 ± 0.024
0.536TrpPhe: 0.536 ± 0.021
0.673TrpGly: 0.673 ± 0.025
0.229TrpHis: 0.229 ± 0.014
0.771TrpIle: 0.771 ± 0.027
0.754TrpLys: 0.754 ± 0.025
1.217TrpLeu: 1.217 ± 0.035
0.316TrpMet: 0.316 ± 0.016
0.506TrpAsn: 0.506 ± 0.022
0.282TrpPro: 0.282 ± 0.015
0.336TrpGln: 0.336 ± 0.016
0.458TrpArg: 0.458 ± 0.021
0.612TrpSer: 0.612 ± 0.025
0.605TrpThr: 0.605 ± 0.024
0.614TrpVal: 0.614 ± 0.023
0.137TrpTrp: 0.137 ± 0.013
0.352TrpTyr: 0.352 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.261TyrAla: 2.261 ± 0.043
0.297TyrCys: 0.297 ± 0.015
1.935TyrAsp: 1.935 ± 0.046
2.533TyrGlu: 2.533 ± 0.047
1.767TyrPhe: 1.767 ± 0.04
2.493TyrGly: 2.493 ± 0.046
0.874TyrHis: 0.874 ± 0.032
2.244TyrIle: 2.244 ± 0.047
1.995TyrLys: 1.995 ± 0.043
3.184TyrLeu: 3.184 ± 0.055
0.891TyrMet: 0.891 ± 0.026
1.16TyrAsn: 1.16 ± 0.028
1.429TyrPro: 1.429 ± 0.037
1.518TyrGln: 1.518 ± 0.042
1.706TyrArg: 1.706 ± 0.04
1.988TyrSer: 1.988 ± 0.044
1.816TyrThr: 1.816 ± 0.047
2.11TyrVal: 2.11 ± 0.04
0.386TyrTrp: 0.386 ± 0.02
1.295TyrTyr: 1.295 ± 0.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4164 proteins (1201246 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski