Amino acid dipepetide frequency for Bacillus selenitireducens (strain ATCC 700615 / DSM 15326 / MLS10)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.619AlaAla: 6.619 ± 0.113
0.695AlaCys: 0.695 ± 0.03
4.595AlaAsp: 4.595 ± 0.08
5.512AlaGlu: 5.512 ± 0.091
3.799AlaPhe: 3.799 ± 0.065
6.517AlaGly: 6.517 ± 0.102
1.451AlaHis: 1.451 ± 0.039
5.453AlaIle: 5.453 ± 0.092
3.957AlaLys: 3.957 ± 0.083
7.802AlaLeu: 7.802 ± 0.089
2.707AlaMet: 2.707 ± 0.054
2.594AlaAsn: 2.594 ± 0.057
2.288AlaPro: 2.288 ± 0.053
2.038AlaGln: 2.038 ± 0.047
3.354AlaArg: 3.354 ± 0.06
4.632AlaSer: 4.632 ± 0.067
3.549AlaThr: 3.549 ± 0.071
6.685AlaVal: 6.685 ± 0.095
0.73AlaTrp: 0.73 ± 0.029
2.438AlaTyr: 2.438 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.391CysAla: 0.391 ± 0.024
0.066CysCys: 0.066 ± 0.009
0.341CysAsp: 0.341 ± 0.02
0.396CysGlu: 0.396 ± 0.023
0.221CysPhe: 0.221 ± 0.016
0.64CysGly: 0.64 ± 0.025
0.198CysHis: 0.198 ± 0.016
0.338CysIle: 0.338 ± 0.018
0.289CysLys: 0.289 ± 0.017
0.463CysLeu: 0.463 ± 0.022
0.148CysMet: 0.148 ± 0.014
0.214CysAsn: 0.214 ± 0.016
0.374CysPro: 0.374 ± 0.022
0.234CysGln: 0.234 ± 0.014
0.361CysArg: 0.361 ± 0.02
0.417CysSer: 0.417 ± 0.021
0.322CysThr: 0.322 ± 0.019
0.35CysVal: 0.35 ± 0.019
0.039CysTrp: 0.039 ± 0.006
0.186CysTyr: 0.186 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.631AspAla: 4.631 ± 0.077
0.305AspCys: 0.305 ± 0.019
3.904AspAsp: 3.904 ± 0.078
5.929AspGlu: 5.929 ± 0.109
2.526AspPhe: 2.526 ± 0.052
4.772AspGly: 4.772 ± 0.087
1.738AspHis: 1.738 ± 0.052
3.923AspIle: 3.923 ± 0.062
2.419AspLys: 2.419 ± 0.053
5.818AspLeu: 5.818 ± 0.077
1.756AspMet: 1.756 ± 0.048
1.673AspAsn: 1.673 ± 0.045
2.71AspPro: 2.71 ± 0.063
2.932AspGln: 2.932 ± 0.06
3.387AspArg: 3.387 ± 0.053
2.819AspSer: 2.819 ± 0.057
3.042AspThr: 3.042 ± 0.055
4.974AspVal: 4.974 ± 0.073
0.781AspTrp: 0.781 ± 0.03
2.269AspTyr: 2.269 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
7.273GluAla: 7.273 ± 0.107
0.275GluCys: 0.275 ± 0.019
5.051GluAsp: 5.051 ± 0.105
8.092GluGlu: 8.092 ± 0.153
2.162GluPhe: 2.162 ± 0.049
5.262GluGly: 5.262 ± 0.085
1.589GluHis: 1.589 ± 0.041
4.744GluIle: 4.744 ± 0.074
4.593GluLys: 4.593 ± 0.077
7.111GluLeu: 7.111 ± 0.102
2.56GluMet: 2.56 ± 0.058
2.944GluAsn: 2.944 ± 0.066
2.646GluPro: 2.646 ± 0.063
3.661GluGln: 3.661 ± 0.072
4.792GluArg: 4.792 ± 0.087
3.97GluSer: 3.97 ± 0.068
4.754GluThr: 4.754 ± 0.073
5.181GluVal: 5.181 ± 0.077
1.011GluTrp: 1.011 ± 0.03
1.783GluTyr: 1.783 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.105PheAla: 3.105 ± 0.064
0.279PheCys: 0.279 ± 0.014
2.89PheAsp: 2.89 ± 0.055
3.034PheGlu: 3.034 ± 0.057
2.208PhePhe: 2.208 ± 0.056
3.207PheGly: 3.207 ± 0.066
1.068PheHis: 1.068 ± 0.036
3.11PheIle: 3.11 ± 0.071
1.794PheLys: 1.794 ± 0.044
4.152PheLeu: 4.152 ± 0.088
1.278PheMet: 1.278 ± 0.037
1.516PheAsn: 1.516 ± 0.041
1.648PhePro: 1.648 ± 0.042
1.548PheGln: 1.548 ± 0.045
2.007PheArg: 2.007 ± 0.046
3.057PheSer: 3.057 ± 0.064
2.721PheThr: 2.721 ± 0.053
2.942PheVal: 2.942 ± 0.064
0.42PheTrp: 0.42 ± 0.022
1.454PheTyr: 1.454 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
5.349GlyAla: 5.349 ± 0.09
0.534GlyCys: 0.534 ± 0.025
4.309GlyAsp: 4.309 ± 0.066
5.358GlyGlu: 5.358 ± 0.083
3.576GlyPhe: 3.576 ± 0.064
5.182GlyGly: 5.182 ± 0.095
1.564GlyHis: 1.564 ± 0.043
5.539GlyIle: 5.539 ± 0.078
4.085GlyLys: 4.085 ± 0.064
6.77GlyLeu: 6.77 ± 0.089
2.497GlyMet: 2.497 ± 0.048
2.592GlyAsn: 2.592 ± 0.056
1.894GlyPro: 1.894 ± 0.039
2.425GlyGln: 2.425 ± 0.049
3.358GlyArg: 3.358 ± 0.061
4.316GlySer: 4.316 ± 0.065
4.369GlyThr: 4.369 ± 0.065
5.327GlyVal: 5.327 ± 0.074
0.756GlyTrp: 0.756 ± 0.029
2.766GlyTyr: 2.766 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
1.708HisAla: 1.708 ± 0.045
0.212HisCys: 0.212 ± 0.016
1.487HisAsp: 1.487 ± 0.05
1.774HisGlu: 1.774 ± 0.042
1.059HisPhe: 1.059 ± 0.033
1.532HisGly: 1.532 ± 0.042
0.835HisHis: 0.835 ± 0.034
1.435HisIle: 1.435 ± 0.039
0.901HisLys: 0.901 ± 0.03
2.317HisLeu: 2.317 ± 0.05
0.615HisMet: 0.615 ± 0.026
0.704HisAsn: 0.704 ± 0.031
1.263HisPro: 1.263 ± 0.042
0.977HisGln: 0.977 ± 0.033
1.13HisArg: 1.13 ± 0.03
1.13HisSer: 1.13 ± 0.037
1.123HisThr: 1.123 ± 0.036
1.71HisVal: 1.71 ± 0.042
0.243HisTrp: 0.243 ± 0.016
0.949HisTyr: 0.949 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
5.432IleAla: 5.432 ± 0.08
0.427IleCys: 0.427 ± 0.024
4.436IleAsp: 4.436 ± 0.076
5.113IleGlu: 5.113 ± 0.07
2.445IlePhe: 2.445 ± 0.064
5.341IleGly: 5.341 ± 0.091
1.709IleHis: 1.709 ± 0.04
4.304IleIle: 4.304 ± 0.077
2.729IleLys: 2.729 ± 0.058
6.086IleLeu: 6.086 ± 0.095
1.762IleMet: 1.762 ± 0.043
2.357IleAsn: 2.357 ± 0.051
3.138IlePro: 3.138 ± 0.051
2.696IleGln: 2.696 ± 0.057
3.952IleArg: 3.952 ± 0.072
4.285IleSer: 4.285 ± 0.075
4.037IleThr: 4.037 ± 0.067
4.692IleVal: 4.692 ± 0.078
0.571IleTrp: 0.571 ± 0.024
1.938IleTyr: 1.938 ± 0.047
0.0IleXaa: 0.0 ± 0.0
Lys
4.26LysAla: 4.26 ± 0.081
0.232LysCys: 0.232 ± 0.016
3.166LysAsp: 3.166 ± 0.064
5.174LysGlu: 5.174 ± 0.083
1.127LysPhe: 1.127 ± 0.034
3.75LysGly: 3.75 ± 0.076
1.136LysHis: 1.136 ± 0.035
2.721LysIle: 2.721 ± 0.056
3.939LysLys: 3.939 ± 0.08
4.207LysLeu: 4.207 ± 0.07
1.568LysMet: 1.568 ± 0.037
1.885LysAsn: 1.885 ± 0.051
1.975LysPro: 1.975 ± 0.049
2.54LysGln: 2.54 ± 0.051
3.263LysArg: 3.263 ± 0.067
2.708LysSer: 2.708 ± 0.05
3.015LysThr: 3.015 ± 0.062
3.282LysVal: 3.282 ± 0.071
0.577LysTrp: 0.577 ± 0.023
1.223LysTyr: 1.223 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
7.205LeuAla: 7.205 ± 0.089
0.429LeuCys: 0.429 ± 0.024
5.67LeuAsp: 5.67 ± 0.081
6.323LeuGlu: 6.323 ± 0.094
4.699LeuPhe: 4.699 ± 0.098
5.875LeuGly: 5.875 ± 0.086
2.082LeuHis: 2.082 ± 0.058
6.855LeuIle: 6.855 ± 0.104
5.25LeuLys: 5.25 ± 0.08
8.848LeuLeu: 8.848 ± 0.132
2.842LeuMet: 2.842 ± 0.052
3.846LeuAsn: 3.846 ± 0.053
3.778LeuPro: 3.778 ± 0.064
3.145LeuGln: 3.145 ± 0.052
4.011LeuArg: 4.011 ± 0.067
6.779LeuSer: 6.779 ± 0.093
5.994LeuThr: 5.994 ± 0.077
5.868LeuVal: 5.868 ± 0.101
0.747LeuTrp: 0.747 ± 0.03
3.032LeuTyr: 3.032 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.638MetAla: 2.638 ± 0.048
0.117MetCys: 0.117 ± 0.01
1.863MetAsp: 1.863 ± 0.046
2.027MetGlu: 2.027 ± 0.047
1.096MetPhe: 1.096 ± 0.034
1.847MetGly: 1.847 ± 0.043
0.582MetHis: 0.582 ± 0.026
2.539MetIle: 2.539 ± 0.057
2.432MetLys: 2.432 ± 0.053
2.536MetLeu: 2.536 ± 0.058
1.256MetMet: 1.256 ± 0.04
1.776MetAsn: 1.776 ± 0.04
1.158MetPro: 1.158 ± 0.033
1.081MetGln: 1.081 ± 0.035
1.313MetArg: 1.313 ± 0.039
1.802MetSer: 1.802 ± 0.045
2.429MetThr: 2.429 ± 0.048
1.87MetVal: 1.87 ± 0.051
0.22MetTrp: 0.22 ± 0.016
0.774MetTyr: 0.774 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
2.72AsnAla: 2.72 ± 0.056
0.181AsnCys: 0.181 ± 0.014
2.305AsnAsp: 2.305 ± 0.052
3.264AsnGlu: 3.264 ± 0.066
1.132AsnPhe: 1.132 ± 0.033
3.031AsnGly: 3.031 ± 0.069
0.998AsnHis: 0.998 ± 0.034
2.269AsnIle: 2.269 ± 0.049
1.621AsnLys: 1.621 ± 0.043
3.093AsnLeu: 3.093 ± 0.064
1.029AsnMet: 1.029 ± 0.037
1.352AsnAsn: 1.352 ± 0.047
1.9AsnPro: 1.9 ± 0.042
1.739AsnGln: 1.739 ± 0.045
2.465AsnArg: 2.465 ± 0.048
1.666AsnSer: 1.666 ± 0.045
1.811AsnThr: 1.811 ± 0.05
2.556AsnVal: 2.556 ± 0.053
0.433AsnTrp: 0.433 ± 0.022
1.047AsnTyr: 1.047 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
2.838ProAla: 2.838 ± 0.054
0.234ProCys: 0.234 ± 0.017
3.055ProAsp: 3.055 ± 0.06
3.715ProGlu: 3.715 ± 0.08
2.019ProPhe: 2.019 ± 0.045
2.842ProGly: 2.842 ± 0.067
0.869ProHis: 0.869 ± 0.03
2.061ProIle: 2.061 ± 0.044
1.709ProLys: 1.709 ± 0.052
3.497ProLeu: 3.497 ± 0.06
0.964ProMet: 0.964 ± 0.031
1.286ProAsn: 1.286 ± 0.038
1.108ProPro: 1.108 ± 0.031
1.017ProGln: 1.017 ± 0.033
1.213ProArg: 1.213 ± 0.036
2.154ProSer: 2.154 ± 0.043
1.572ProThr: 1.572 ± 0.035
3.825ProVal: 3.825 ± 0.063
0.428ProTrp: 0.428 ± 0.021
1.312ProTyr: 1.312 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
3.205GlnAla: 3.205 ± 0.059
0.159GlnCys: 0.159 ± 0.013
2.068GlnAsp: 2.068 ± 0.049
3.174GlnGlu: 3.174 ± 0.067
1.533GlnPhe: 1.533 ± 0.041
2.475GlnGly: 2.475 ± 0.05
0.759GlnHis: 0.759 ± 0.026
2.344GlnIle: 2.344 ± 0.048
2.057GlnLys: 2.057 ± 0.047
3.585GlnLeu: 3.585 ± 0.067
1.291GlnMet: 1.291 ± 0.034
1.343GlnAsn: 1.343 ± 0.036
1.249GlnPro: 1.249 ± 0.038
1.401GlnGln: 1.401 ± 0.053
1.739GlnArg: 1.739 ± 0.041
2.22GlnSer: 2.22 ± 0.049
2.284GlnThr: 2.284 ± 0.054
2.7GlnVal: 2.7 ± 0.061
0.402GlnTrp: 0.402 ± 0.02
1.109GlnTyr: 1.109 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
3.202ArgAla: 3.202 ± 0.069
0.273ArgCys: 0.273 ± 0.015
2.986ArgAsp: 2.986 ± 0.057
3.871ArgGlu: 3.871 ± 0.076
2.46ArgPhe: 2.46 ± 0.059
2.871ArgGly: 2.871 ± 0.059
1.248ArgHis: 1.248 ± 0.04
3.692ArgIle: 3.692 ± 0.051
3.061ArgLys: 3.061 ± 0.06
4.764ArgLeu: 4.764 ± 0.074
1.76ArgMet: 1.76 ± 0.043
1.987ArgAsn: 1.987 ± 0.047
1.623ArgPro: 1.623 ± 0.039
2.201ArgGln: 2.201 ± 0.047
2.582ArgArg: 2.582 ± 0.06
2.905ArgSer: 2.905 ± 0.051
2.665ArgThr: 2.665 ± 0.044
3.211ArgVal: 3.211 ± 0.059
0.551ArgTrp: 0.551 ± 0.025
1.82ArgTyr: 1.82 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
4.225SerAla: 4.225 ± 0.07
0.41SerCys: 0.41 ± 0.021
3.581SerAsp: 3.581 ± 0.065
4.35SerGlu: 4.35 ± 0.077
3.254SerPhe: 3.254 ± 0.071
5.092SerGly: 5.092 ± 0.074
1.301SerHis: 1.301 ± 0.035
3.921SerIle: 3.921 ± 0.074
2.676SerLys: 2.676 ± 0.059
5.765SerLeu: 5.765 ± 0.097
1.941SerMet: 1.941 ± 0.037
1.873SerAsn: 1.873 ± 0.045
2.148SerPro: 2.148 ± 0.05
1.886SerGln: 1.886 ± 0.041
2.823SerArg: 2.823 ± 0.057
3.656SerSer: 3.656 ± 0.069
2.818SerThr: 2.818 ± 0.06
4.611SerVal: 4.611 ± 0.071
0.672SerTrp: 0.672 ± 0.029
2.004SerTyr: 2.004 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
4.705ThrAla: 4.705 ± 0.074
0.358ThrCys: 0.358 ± 0.018
3.697ThrAsp: 3.697 ± 0.069
4.08ThrGlu: 4.08 ± 0.067
2.633ThrPhe: 2.633 ± 0.049
5.247ThrGly: 5.247 ± 0.076
1.125ThrHis: 1.125 ± 0.033
4.023ThrIle: 4.023 ± 0.07
2.52ThrLys: 2.52 ± 0.053
5.329ThrLeu: 5.329 ± 0.068
1.659ThrMet: 1.659 ± 0.047
1.965ThrAsn: 1.965 ± 0.052
2.262ThrPro: 2.262 ± 0.05
1.308ThrGln: 1.308 ± 0.04
2.301ThrArg: 2.301 ± 0.05
3.096ThrSer: 3.096 ± 0.063
2.709ThrThr: 2.709 ± 0.051
5.107ThrVal: 5.107 ± 0.08
0.586ThrTrp: 0.586 ± 0.025
2.009ThrTyr: 2.009 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
5.063ValAla: 5.063 ± 0.087
0.531ValCys: 0.531 ± 0.024
4.227ValAsp: 4.227 ± 0.067
4.967ValGlu: 4.967 ± 0.08
3.353ValPhe: 3.353 ± 0.063
4.109ValGly: 4.109 ± 0.072
1.73ValHis: 1.73 ± 0.042
5.593ValIle: 5.593 ± 0.085
3.881ValLys: 3.881 ± 0.071
6.76ValLeu: 6.76 ± 0.096
2.46ValMet: 2.46 ± 0.05
3.123ValAsn: 3.123 ± 0.061
2.891ValPro: 2.891 ± 0.054
2.43ValGln: 2.43 ± 0.055
3.374ValArg: 3.374 ± 0.056
5.001ValSer: 5.001 ± 0.076
5.083ValThr: 5.083 ± 0.088
4.862ValVal: 4.862 ± 0.087
0.688ValTrp: 0.688 ± 0.031
2.425ValTyr: 2.425 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.608TrpAla: 0.608 ± 0.027
0.058TrpCys: 0.058 ± 0.008
0.53TrpAsp: 0.53 ± 0.025
0.638TrpGlu: 0.638 ± 0.027
0.546TrpPhe: 0.546 ± 0.023
0.646TrpGly: 0.646 ± 0.028
0.265TrpHis: 0.265 ± 0.018
0.769TrpIle: 0.769 ± 0.032
0.519TrpLys: 0.519 ± 0.028
1.231TrpLeu: 1.231 ± 0.041
0.403TrpMet: 0.403 ± 0.019
0.452TrpAsn: 0.452 ± 0.023
0.312TrpPro: 0.312 ± 0.02
0.458TrpGln: 0.458 ± 0.021
0.505TrpArg: 0.505 ± 0.023
0.646TrpSer: 0.646 ± 0.026
0.625TrpThr: 0.625 ± 0.028
0.626TrpVal: 0.626 ± 0.027
0.131TrpTrp: 0.131 ± 0.011
0.364TrpTyr: 0.364 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.239TyrAla: 2.239 ± 0.049
0.211TyrCys: 0.211 ± 0.014
2.146TyrAsp: 2.146 ± 0.056
2.664TyrGlu: 2.664 ± 0.057
1.587TyrPhe: 1.587 ± 0.044
2.327TyrGly: 2.327 ± 0.045
0.858TyrHis: 0.858 ± 0.028
1.817TyrIle: 1.817 ± 0.045
1.316TyrLys: 1.316 ± 0.041
3.142TyrLeu: 3.142 ± 0.054
0.86TyrMet: 0.86 ± 0.03
1.127TyrAsn: 1.127 ± 0.039
1.356TyrPro: 1.356 ± 0.04
1.38TyrGln: 1.38 ± 0.038
1.804TyrArg: 1.804 ± 0.044
1.747TyrSer: 1.747 ± 0.047
1.797TyrThr: 1.797 ± 0.041
2.173TyrVal: 2.173 ± 0.048
0.35TyrTrp: 0.35 ± 0.019
1.219TyrTyr: 1.219 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3231 proteins (1028036 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski