Amino acid dipepetide frequency for Azospira oryzae (strain ATCC BAA-33 / DSM 13638 / PS) (Dechlorosoma suillum)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.893AlaAla: 17.893 ± 0.198
1.281AlaCys: 1.281 ± 0.04
6.428AlaAsp: 6.428 ± 0.076
7.968AlaGlu: 7.968 ± 0.098
3.807AlaPhe: 3.807 ± 0.065
10.482AlaGly: 10.482 ± 0.118
2.333AlaHis: 2.333 ± 0.047
5.227AlaIle: 5.227 ± 0.078
4.447AlaLys: 4.447 ± 0.1
14.308AlaLeu: 14.308 ± 0.156
3.132AlaMet: 3.132 ± 0.05
3.013AlaAsn: 3.013 ± 0.06
5.819AlaPro: 5.819 ± 0.085
4.75AlaGln: 4.75 ± 0.083
8.348AlaArg: 8.348 ± 0.11
5.683AlaSer: 5.683 ± 0.084
5.266AlaThr: 5.266 ± 0.084
8.53AlaVal: 8.53 ± 0.101
1.726AlaTrp: 1.726 ± 0.052
2.65AlaTyr: 2.65 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
1.046CysAla: 1.046 ± 0.032
0.154CysCys: 0.154 ± 0.011
0.525CysAsp: 0.525 ± 0.024
0.416CysGlu: 0.416 ± 0.02
0.33CysPhe: 0.33 ± 0.019
1.025CysGly: 1.025 ± 0.036
0.356CysHis: 0.356 ± 0.02
0.442CysIle: 0.442 ± 0.022
0.272CysLys: 0.272 ± 0.019
1.063CysLeu: 1.063 ± 0.032
0.176CysMet: 0.176 ± 0.014
0.286CysAsn: 0.286 ± 0.017
0.626CysPro: 0.626 ± 0.026
0.352CysGln: 0.352 ± 0.02
0.738CysArg: 0.738 ± 0.029
0.518CysSer: 0.518 ± 0.024
0.445CysThr: 0.445 ± 0.018
0.574CysVal: 0.574 ± 0.024
0.12CysTrp: 0.12 ± 0.009
0.238CysTyr: 0.238 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.595AspAla: 5.595 ± 0.073
0.507AspCys: 0.507 ± 0.024
2.615AspAsp: 2.615 ± 0.053
3.228AspGlu: 3.228 ± 0.054
2.348AspPhe: 2.348 ± 0.052
4.684AspGly: 4.684 ± 0.072
1.02AspHis: 1.02 ± 0.031
2.626AspIle: 2.626 ± 0.049
1.994AspLys: 1.994 ± 0.045
5.797AspLeu: 5.797 ± 0.075
1.146AspMet: 1.146 ± 0.034
1.413AspAsn: 1.413 ± 0.041
2.754AspPro: 2.754 ± 0.052
1.807AspGln: 1.807 ± 0.043
3.025AspArg: 3.025 ± 0.057
2.542AspSer: 2.542 ± 0.048
2.418AspThr: 2.418 ± 0.059
3.374AspVal: 3.374 ± 0.057
0.884AspTrp: 0.884 ± 0.029
1.543AspTyr: 1.543 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
8.095GluAla: 8.095 ± 0.114
0.455GluCys: 0.455 ± 0.021
2.569GluAsp: 2.569 ± 0.05
3.823GluGlu: 3.823 ± 0.08
1.996GluPhe: 1.996 ± 0.041
4.264GluGly: 4.264 ± 0.065
1.339GluHis: 1.339 ± 0.039
3.137GluIle: 3.137 ± 0.063
2.865GluLys: 2.865 ± 0.055
6.478GluLeu: 6.478 ± 0.096
1.559GluMet: 1.559 ± 0.037
1.717GluAsn: 1.717 ± 0.039
2.408GluPro: 2.408 ± 0.052
2.936GluGln: 2.936 ± 0.063
4.951GluArg: 4.951 ± 0.079
2.933GluSer: 2.933 ± 0.051
2.946GluThr: 2.946 ± 0.054
4.612GluVal: 4.612 ± 0.077
0.847GluTrp: 0.847 ± 0.027
1.246GluTyr: 1.246 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.211PheAla: 4.211 ± 0.063
0.417PheCys: 0.417 ± 0.019
2.282PheAsp: 2.282 ± 0.048
1.925PheGlu: 1.925 ± 0.048
1.481PhePhe: 1.481 ± 0.046
3.02PheGly: 3.02 ± 0.059
0.845PheHis: 0.845 ± 0.028
1.628PheIle: 1.628 ± 0.034
1.233PheLys: 1.233 ± 0.034
3.578PheLeu: 3.578 ± 0.058
0.727PheMet: 0.727 ± 0.024
1.197PheAsn: 1.197 ± 0.031
1.639PhePro: 1.639 ± 0.037
1.179PheGln: 1.179 ± 0.031
2.069PheArg: 2.069 ± 0.041
2.269PheSer: 2.269 ± 0.051
1.743PheThr: 1.743 ± 0.044
2.529PheVal: 2.529 ± 0.056
0.506PheTrp: 0.506 ± 0.023
0.951PheTyr: 0.951 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
8.344GlyAla: 8.344 ± 0.109
1.013GlyCys: 1.013 ± 0.034
4.135GlyAsp: 4.135 ± 0.065
5.228GlyGlu: 5.228 ± 0.072
3.139GlyPhe: 3.139 ± 0.06
6.949GlyGly: 6.949 ± 0.114
1.961GlyHis: 1.961 ± 0.047
4.271GlyIle: 4.271 ± 0.068
3.939GlyLys: 3.939 ± 0.067
9.162GlyLeu: 9.162 ± 0.103
2.167GlyMet: 2.167 ± 0.047
2.43GlyAsn: 2.43 ± 0.082
2.82GlyPro: 2.82 ± 0.049
3.404GlyGln: 3.404 ± 0.064
5.56GlyArg: 5.56 ± 0.067
4.464GlySer: 4.464 ± 0.081
3.847GlyThr: 3.847 ± 0.086
5.868GlyVal: 5.868 ± 0.081
1.261GlyTrp: 1.261 ± 0.04
2.332GlyTyr: 2.332 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
2.258HisAla: 2.258 ± 0.047
0.308HisCys: 0.308 ± 0.017
1.093HisAsp: 1.093 ± 0.033
1.138HisGlu: 1.138 ± 0.032
0.976HisPhe: 0.976 ± 0.031
2.022HisGly: 2.022 ± 0.049
0.665HisHis: 0.665 ± 0.028
0.982HisIle: 0.982 ± 0.03
0.654HisLys: 0.654 ± 0.023
2.636HisLeu: 2.636 ± 0.049
0.436HisMet: 0.436 ± 0.019
0.583HisAsn: 0.583 ± 0.022
1.526HisPro: 1.526 ± 0.042
0.834HisGln: 0.834 ± 0.028
1.444HisArg: 1.444 ± 0.039
1.081HisSer: 1.081 ± 0.034
0.968HisThr: 0.968 ± 0.028
1.289HisVal: 1.289 ± 0.032
0.354HisTrp: 0.354 ± 0.019
0.656HisTyr: 0.656 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.712IleAla: 5.712 ± 0.075
0.402IleCys: 0.402 ± 0.017
2.706IleAsp: 2.706 ± 0.05
2.915IleGlu: 2.915 ± 0.054
1.456IlePhe: 1.456 ± 0.04
4.084IleGly: 4.084 ± 0.066
0.982IleHis: 0.982 ± 0.031
1.844IleIle: 1.844 ± 0.046
1.726IleLys: 1.726 ± 0.043
4.279IleLeu: 4.279 ± 0.068
0.778IleMet: 0.778 ± 0.027
1.561IleAsn: 1.561 ± 0.035
2.299IlePro: 2.299 ± 0.044
1.499IleGln: 1.499 ± 0.039
2.895IleArg: 2.895 ± 0.054
2.666IleSer: 2.666 ± 0.051
2.384IleThr: 2.384 ± 0.051
3.306IleVal: 3.306 ± 0.062
0.423IleTrp: 0.423 ± 0.02
1.082IleTyr: 1.082 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
4.557LysAla: 4.557 ± 0.086
0.206LysCys: 0.206 ± 0.015
1.918LysAsp: 1.918 ± 0.06
2.319LysGlu: 2.319 ± 0.061
1.053LysPhe: 1.053 ± 0.034
2.858LysGly: 2.858 ± 0.059
0.771LysHis: 0.771 ± 0.028
1.738LysIle: 1.738 ± 0.042
1.849LysLys: 1.849 ± 0.056
3.836LysLeu: 3.836 ± 0.059
0.91LysMet: 0.91 ± 0.034
1.123LysAsn: 1.123 ± 0.032
2.254LysPro: 2.254 ± 0.061
1.478LysGln: 1.478 ± 0.041
2.3LysArg: 2.3 ± 0.042
1.885LysSer: 1.885 ± 0.048
1.95LysThr: 1.95 ± 0.047
2.969LysVal: 2.969 ± 0.058
0.375LysTrp: 0.375 ± 0.018
0.791LysTyr: 0.791 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
16.195LeuAla: 16.195 ± 0.166
1.074LeuCys: 1.074 ± 0.033
5.843LeuAsp: 5.843 ± 0.081
7.03LeuGlu: 7.03 ± 0.097
3.838LeuPhe: 3.838 ± 0.062
9.069LeuGly: 9.069 ± 0.098
2.344LeuHis: 2.344 ± 0.045
4.475LeuIle: 4.475 ± 0.061
4.308LeuLys: 4.308 ± 0.069
13.313LeuLeu: 13.313 ± 0.165
2.279LeuMet: 2.279 ± 0.049
3.007LeuAsn: 3.007 ± 0.061
6.595LeuPro: 6.595 ± 0.088
4.563LeuGln: 4.563 ± 0.078
7.165LeuArg: 7.165 ± 0.087
6.334LeuSer: 6.334 ± 0.081
5.575LeuThr: 5.575 ± 0.092
7.754LeuVal: 7.754 ± 0.103
1.361LeuTrp: 1.361 ± 0.043
2.255LeuTyr: 2.255 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.903MetAla: 2.903 ± 0.059
0.129MetCys: 0.129 ± 0.011
1.139MetAsp: 1.139 ± 0.03
1.322MetGlu: 1.322 ± 0.031
0.633MetPhe: 0.633 ± 0.024
1.763MetGly: 1.763 ± 0.043
0.467MetHis: 0.467 ± 0.018
0.881MetIle: 0.881 ± 0.029
1.011MetLys: 1.011 ± 0.031
2.302MetLeu: 2.302 ± 0.049
0.508MetMet: 0.508 ± 0.024
0.788MetAsn: 0.788 ± 0.027
1.361MetPro: 1.361 ± 0.037
0.908MetGln: 0.908 ± 0.026
1.356MetArg: 1.356 ± 0.036
1.515MetSer: 1.515 ± 0.033
1.281MetThr: 1.281 ± 0.038
1.589MetVal: 1.589 ± 0.038
0.177MetTrp: 0.177 ± 0.013
0.334MetTyr: 0.334 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.103AsnAla: 3.103 ± 0.065
0.3AsnCys: 0.3 ± 0.016
1.401AsnAsp: 1.401 ± 0.051
1.373AsnGlu: 1.373 ± 0.035
0.959AsnPhe: 0.959 ± 0.031
2.39AsnGly: 2.39 ± 0.051
0.557AsnHis: 0.557 ± 0.024
1.314AsnIle: 1.314 ± 0.036
0.918AsnLys: 0.918 ± 0.031
3.334AsnLeu: 3.334 ± 0.063
0.567AsnMet: 0.567 ± 0.023
0.829AsnAsn: 0.829 ± 0.028
1.986AsnPro: 1.986 ± 0.039
1.071AsnGln: 1.071 ± 0.029
1.987AsnArg: 1.987 ± 0.048
1.313AsnSer: 1.313 ± 0.041
1.363AsnThr: 1.363 ± 0.046
1.886AsnVal: 1.886 ± 0.043
0.364AsnTrp: 0.364 ± 0.02
0.739AsnTyr: 0.739 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
7.058ProAla: 7.058 ± 0.099
0.396ProCys: 0.396 ± 0.018
2.928ProAsp: 2.928 ± 0.053
4.022ProGlu: 4.022 ± 0.066
1.769ProPhe: 1.769 ± 0.042
4.545ProGly: 4.545 ± 0.076
1.075ProHis: 1.075 ± 0.032
1.951ProIle: 1.951 ± 0.044
1.644ProLys: 1.644 ± 0.045
5.69ProLeu: 5.69 ± 0.087
1.116ProMet: 1.116 ± 0.034
1.248ProAsn: 1.248 ± 0.031
2.748ProPro: 2.748 ± 0.066
1.957ProGln: 1.957 ± 0.041
3.071ProArg: 3.071 ± 0.068
2.374ProSer: 2.374 ± 0.051
2.215ProThr: 2.215 ± 0.049
4.085ProVal: 4.085 ± 0.064
0.767ProTrp: 0.767 ± 0.032
1.19ProTyr: 1.19 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
5.709GlnAla: 5.709 ± 0.086
0.293GlnCys: 0.293 ± 0.019
1.75GlnAsp: 1.75 ± 0.039
2.403GlnGlu: 2.403 ± 0.055
1.205GlnPhe: 1.205 ± 0.03
3.254GlnGly: 3.254 ± 0.057
0.876GlnHis: 0.876 ± 0.03
1.745GlnIle: 1.745 ± 0.048
1.38GlnLys: 1.38 ± 0.038
4.305GlnLeu: 4.305 ± 0.066
0.963GlnMet: 0.963 ± 0.031
0.962GlnAsn: 0.962 ± 0.031
2.174GlnPro: 2.174 ± 0.044
2.058GlnGln: 2.058 ± 0.046
3.203GlnArg: 3.203 ± 0.066
2.021GlnSer: 2.021 ± 0.048
1.769GlnThr: 1.769 ± 0.042
3.276GlnVal: 3.276 ± 0.07
0.552GlnTrp: 0.552 ± 0.025
0.776GlnTyr: 0.776 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
6.328ArgAla: 6.328 ± 0.092
0.655ArgCys: 0.655 ± 0.028
3.451ArgAsp: 3.451 ± 0.055
4.587ArgGlu: 4.587 ± 0.067
2.777ArgPhe: 2.777 ± 0.054
4.288ArgGly: 4.288 ± 0.067
1.991ArgHis: 1.991 ± 0.045
3.542ArgIle: 3.542 ± 0.056
2.131ArgLys: 2.131 ± 0.048
8.895ArgLeu: 8.895 ± 0.097
1.484ArgMet: 1.484 ± 0.038
1.969ArgAsn: 1.969 ± 0.047
3.238ArgPro: 3.238 ± 0.063
3.872ArgGln: 3.872 ± 0.074
5.323ArgArg: 5.323 ± 0.082
3.308ArgSer: 3.308 ± 0.058
2.639ArgThr: 2.639 ± 0.048
4.431ArgVal: 4.431 ± 0.065
0.996ArgTrp: 0.996 ± 0.029
1.897ArgTyr: 1.897 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
5.71SerAla: 5.71 ± 0.088
0.52SerCys: 0.52 ± 0.021
2.499SerAsp: 2.499 ± 0.047
2.656SerGlu: 2.656 ± 0.054
1.911SerPhe: 1.911 ± 0.049
5.097SerGly: 5.097 ± 0.085
1.229SerHis: 1.229 ± 0.033
2.441SerIle: 2.441 ± 0.047
1.608SerLys: 1.608 ± 0.043
6.529SerLeu: 6.529 ± 0.09
1.133SerMet: 1.133 ± 0.038
1.38SerAsn: 1.38 ± 0.039
2.84SerPro: 2.84 ± 0.05
1.947SerGln: 1.947 ± 0.047
3.906SerArg: 3.906 ± 0.067
2.938SerSer: 2.938 ± 0.066
2.411SerThr: 2.411 ± 0.06
3.599SerVal: 3.599 ± 0.063
0.668SerTrp: 0.668 ± 0.025
1.209SerTyr: 1.209 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
5.498ThrAla: 5.498 ± 0.084
0.438ThrCys: 0.438 ± 0.02
2.254ThrAsp: 2.254 ± 0.049
2.29ThrGlu: 2.29 ± 0.048
1.691ThrPhe: 1.691 ± 0.038
4.083ThrGly: 4.083 ± 0.093
0.949ThrHis: 0.949 ± 0.029
2.001ThrIle: 2.001 ± 0.043
1.33ThrLys: 1.33 ± 0.038
6.117ThrLeu: 6.117 ± 0.093
0.899ThrMet: 0.899 ± 0.034
1.159ThrAsn: 1.159 ± 0.034
3.314ThrPro: 3.314 ± 0.05
1.743ThrGln: 1.743 ± 0.037
2.962ThrArg: 2.962 ± 0.05
2.272ThrSer: 2.272 ± 0.063
2.376ThrThr: 2.376 ± 0.08
3.883ThrVal: 3.883 ± 0.079
0.59ThrTrp: 0.59 ± 0.022
1.131ThrTyr: 1.131 ± 0.039
0.0ThrXaa: 0.0 ± 0.0
Val
9.301ValAla: 9.301 ± 0.098
0.737ValCys: 0.737 ± 0.025
3.889ValAsp: 3.889 ± 0.06
4.644ValGlu: 4.644 ± 0.068
2.583ValPhe: 2.583 ± 0.047
5.452ValGly: 5.452 ± 0.081
1.333ValHis: 1.333 ± 0.035
3.31ValIle: 3.31 ± 0.058
2.576ValLys: 2.576 ± 0.057
7.962ValLeu: 7.962 ± 0.101
1.671ValMet: 1.671 ± 0.043
2.015ValAsn: 2.015 ± 0.044
3.571ValPro: 3.571 ± 0.056
2.483ValGln: 2.483 ± 0.047
4.486ValArg: 4.486 ± 0.05
4.15ValSer: 4.15 ± 0.059
3.754ValThr: 3.754 ± 0.063
5.969ValVal: 5.969 ± 0.083
0.842ValTrp: 0.842 ± 0.029
1.408ValTyr: 1.408 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.117TrpAla: 1.117 ± 0.036
0.148TrpCys: 0.148 ± 0.011
0.594TrpAsp: 0.594 ± 0.025
0.698TrpGlu: 0.698 ± 0.027
0.506TrpPhe: 0.506 ± 0.02
0.926TrpGly: 0.926 ± 0.032
0.318TrpHis: 0.318 ± 0.019
0.594TrpIle: 0.594 ± 0.025
0.471TrpLys: 0.471 ± 0.022
2.166TrpLeu: 2.166 ± 0.052
0.299TrpMet: 0.299 ± 0.016
0.407TrpAsn: 0.407 ± 0.021
0.589TrpPro: 0.589 ± 0.024
0.844TrpGln: 0.844 ± 0.033
1.115TrpArg: 1.115 ± 0.037
0.679TrpSer: 0.679 ± 0.023
0.494TrpThr: 0.494 ± 0.021
0.965TrpVal: 0.965 ± 0.034
0.241TrpTrp: 0.241 ± 0.015
0.259TrpTyr: 0.259 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.461TyrAla: 2.461 ± 0.047
0.271TyrCys: 0.271 ± 0.015
1.3TyrAsp: 1.3 ± 0.037
1.19TyrGlu: 1.19 ± 0.033
0.966TyrPhe: 0.966 ± 0.033
2.042TyrGly: 2.042 ± 0.046
0.492TyrHis: 0.492 ± 0.019
0.85TyrIle: 0.85 ± 0.028
0.784TyrLys: 0.784 ± 0.029
2.631TyrLeu: 2.631 ± 0.053
0.416TyrMet: 0.416 ± 0.02
0.669TyrAsn: 0.669 ± 0.025
1.288TyrPro: 1.288 ± 0.036
0.987TyrGln: 0.987 ± 0.029
1.899TyrArg: 1.899 ± 0.043
1.284TyrSer: 1.284 ± 0.037
1.136TyrThr: 1.136 ± 0.046
1.649TyrVal: 1.649 ± 0.045
0.373TyrTrp: 0.373 ± 0.018
0.666TyrTyr: 0.666 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3432 proteins (1138217 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski