Amino acid dipepetide frequency for Mycobacterium sp. 1423905.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.995AlaAla: 20.995 ± 0.258
0.973AlaCys: 0.973 ± 0.027
7.953AlaAsp: 7.953 ± 0.08
7.489AlaGlu: 7.489 ± 0.08
3.43AlaPhe: 3.43 ± 0.049
12.52AlaGly: 12.52 ± 0.315
2.619AlaHis: 2.619 ± 0.042
5.118AlaIle: 5.118 ± 0.058
3.049AlaLys: 3.049 ± 0.055
12.901AlaLeu: 12.901 ± 0.138
2.839AlaMet: 2.839 ± 0.046
2.919AlaAsn: 2.919 ± 0.067
6.501AlaPro: 6.501 ± 0.1
4.553AlaGln: 4.553 ± 0.062
8.465AlaArg: 8.465 ± 0.105
6.079AlaSer: 6.079 ± 0.081
7.228AlaThr: 7.228 ± 0.078
11.449AlaVal: 11.449 ± 0.106
1.583AlaTrp: 1.583 ± 0.037
2.396AlaTyr: 2.396 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
1.086CysAla: 1.086 ± 0.028
0.094CysCys: 0.094 ± 0.01
0.571CysAsp: 0.571 ± 0.022
0.392CysGlu: 0.392 ± 0.016
0.217CysPhe: 0.217 ± 0.013
0.948CysGly: 0.948 ± 0.025
0.199CysHis: 0.199 ± 0.011
0.274CysIle: 0.274 ± 0.014
0.128CysLys: 0.128 ± 0.01
0.657CysLeu: 0.657 ± 0.021
0.13CysMet: 0.13 ± 0.009
0.186CysAsn: 0.186 ± 0.011
0.52CysPro: 0.52 ± 0.021
0.239CysGln: 0.239 ± 0.011
0.574CysArg: 0.574 ± 0.018
0.487CysSer: 0.487 ± 0.019
0.465CysThr: 0.465 ± 0.017
0.658CysVal: 0.658 ± 0.021
0.147CysTrp: 0.147 ± 0.009
0.205CysTyr: 0.205 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.398AspAla: 7.398 ± 0.086
0.466AspCys: 0.466 ± 0.017
4.082AspAsp: 4.082 ± 0.064
3.834AspGlu: 3.834 ± 0.053
1.831AspPhe: 1.831 ± 0.038
5.684AspGly: 5.684 ± 0.084
1.415AspHis: 1.415 ± 0.029
2.658AspIle: 2.658 ± 0.044
1.302AspLys: 1.302 ± 0.035
5.93AspLeu: 5.93 ± 0.065
0.981AspMet: 0.981 ± 0.026
1.293AspAsn: 1.293 ± 0.033
4.195AspPro: 4.195 ± 0.058
1.871AspGln: 1.871 ± 0.041
4.372AspArg: 4.372 ± 0.056
2.832AspSer: 2.832 ± 0.05
2.991AspThr: 2.991 ± 0.046
5.183AspVal: 5.183 ± 0.061
0.991AspTrp: 0.991 ± 0.024
1.459AspTyr: 1.459 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
5.912GluAla: 5.912 ± 0.071
0.359GluCys: 0.359 ± 0.014
2.401GluAsp: 2.401 ± 0.047
2.538GluGlu: 2.538 ± 0.046
1.757GluPhe: 1.757 ± 0.036
3.155GluGly: 3.155 ± 0.051
1.486GluHis: 1.486 ± 0.031
2.418GluIle: 2.418 ± 0.046
1.268GluLys: 1.268 ± 0.028
6.353GluLeu: 6.353 ± 0.085
1.015GluMet: 1.015 ± 0.024
1.055GluAsn: 1.055 ± 0.026
2.94GluPro: 2.94 ± 0.063
2.268GluGln: 2.268 ± 0.039
4.257GluArg: 4.257 ± 0.064
2.572GluSer: 2.572 ± 0.045
2.457GluThr: 2.457 ± 0.041
4.391GluVal: 4.391 ± 0.049
0.71GluTrp: 0.71 ± 0.022
1.11GluTyr: 1.11 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
3.915PheAla: 3.915 ± 0.06
0.329PheCys: 0.329 ± 0.016
2.358PheAsp: 2.358 ± 0.049
1.526PheGlu: 1.526 ± 0.035
1.016PhePhe: 1.016 ± 0.023
3.593PheGly: 3.593 ± 0.062
0.669PheHis: 0.669 ± 0.022
1.089PheIle: 1.089 ± 0.026
0.552PheLys: 0.552 ± 0.02
2.504PheLeu: 2.504 ± 0.045
0.496PheMet: 0.496 ± 0.02
0.817PheAsn: 0.817 ± 0.032
1.502PhePro: 1.502 ± 0.04
0.765PheGln: 0.765 ± 0.023
1.707PheArg: 1.707 ± 0.037
1.76PheSer: 1.76 ± 0.041
2.065PheThr: 2.065 ± 0.04
2.576PheVal: 2.576 ± 0.044
0.432PheTrp: 0.432 ± 0.018
0.733PheTyr: 0.733 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
10.841GlyAla: 10.841 ± 0.288
0.793GlyCys: 0.793 ± 0.029
4.924GlyAsp: 4.924 ± 0.069
4.141GlyGlu: 4.141 ± 0.056
3.068GlyPhe: 3.068 ± 0.051
9.698GlyGly: 9.698 ± 0.619
2.124GlyHis: 2.124 ± 0.043
3.967GlyIle: 3.967 ± 0.055
2.354GlyLys: 2.354 ± 0.053
8.69GlyLeu: 8.69 ± 0.104
2.16GlyMet: 2.16 ± 0.042
2.625GlyAsn: 2.625 ± 0.196
4.606GlyPro: 4.606 ± 0.062
3.171GlyGln: 3.171 ± 0.065
6.132GlyArg: 6.132 ± 0.07
5.446GlySer: 5.446 ± 0.104
5.349GlyThr: 5.349 ± 0.164
7.504GlyVal: 7.504 ± 0.09
1.685GlyTrp: 1.685 ± 0.033
2.416GlyTyr: 2.416 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.496HisAla: 2.496 ± 0.044
0.21HisCys: 0.21 ± 0.013
1.408HisAsp: 1.408 ± 0.034
1.042HisGlu: 1.042 ± 0.03
0.692HisPhe: 0.692 ± 0.023
2.247HisGly: 2.247 ± 0.044
0.718HisHis: 0.718 ± 0.024
0.894HisIle: 0.894 ± 0.021
0.387HisLys: 0.387 ± 0.018
2.18HisLeu: 2.18 ± 0.042
0.355HisMet: 0.355 ± 0.017
0.516HisAsn: 0.516 ± 0.018
1.698HisPro: 1.698 ± 0.028
0.764HisGln: 0.764 ± 0.024
1.896HisArg: 1.896 ± 0.031
1.131HisSer: 1.131 ± 0.026
1.256HisThr: 1.256 ± 0.031
1.742HisVal: 1.742 ± 0.032
0.39HisTrp: 0.39 ± 0.016
0.611HisTyr: 0.611 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.022IleAla: 6.022 ± 0.073
0.405IleCys: 0.405 ± 0.016
3.26IleAsp: 3.26 ± 0.058
2.494IleGlu: 2.494 ± 0.045
1.057IlePhe: 1.057 ± 0.023
4.38IleGly: 4.38 ± 0.062
0.809IleHis: 0.809 ± 0.027
1.398IleIle: 1.398 ± 0.035
0.919IleLys: 0.919 ± 0.027
2.877IleLeu: 2.877 ± 0.043
0.617IleMet: 0.617 ± 0.019
1.279IleAsn: 1.279 ± 0.037
2.387IlePro: 2.387 ± 0.043
0.995IleGln: 0.995 ± 0.027
2.612IleArg: 2.612 ± 0.046
2.33IleSer: 2.33 ± 0.043
2.79IleThr: 2.79 ± 0.046
3.586IleVal: 3.586 ± 0.058
0.489IleTrp: 0.489 ± 0.019
0.817IleTyr: 0.817 ± 0.023
0.0IleXaa: 0.0 ± 0.0
Lys
2.615LysAla: 2.615 ± 0.049
0.128LysCys: 0.128 ± 0.009
1.125LysAsp: 1.125 ± 0.031
0.943LysGlu: 0.943 ± 0.029
0.638LysPhe: 0.638 ± 0.022
1.524LysGly: 1.524 ± 0.043
0.507LysHis: 0.507 ± 0.017
0.969LysIle: 0.969 ± 0.026
0.649LysLys: 0.649 ± 0.027
2.286LysLeu: 2.286 ± 0.041
0.473LysMet: 0.473 ± 0.022
0.532LysAsn: 0.532 ± 0.022
1.497LysPro: 1.497 ± 0.036
0.731LysGln: 0.731 ± 0.021
1.622LysArg: 1.622 ± 0.037
1.258LysSer: 1.258 ± 0.034
1.342LysThr: 1.342 ± 0.034
1.954LysVal: 1.954 ± 0.038
0.337LysTrp: 0.337 ± 0.015
0.475LysTyr: 0.475 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
13.945LeuAla: 13.945 ± 0.151
0.826LeuCys: 0.826 ± 0.023
6.27LeuAsp: 6.27 ± 0.082
4.137LeuGlu: 4.137 ± 0.067
2.779LeuPhe: 2.779 ± 0.046
8.533LeuGly: 8.533 ± 0.094
2.17LeuHis: 2.17 ± 0.036
4.121LeuIle: 4.121 ± 0.064
1.785LeuLys: 1.785 ± 0.038
9.783LeuLeu: 9.783 ± 0.107
1.689LeuMet: 1.689 ± 0.038
2.35LeuAsn: 2.35 ± 0.057
6.096LeuPro: 6.096 ± 0.07
2.806LeuGln: 2.806 ± 0.05
7.542LeuArg: 7.542 ± 0.076
5.785LeuSer: 5.785 ± 0.065
6.306LeuThr: 6.306 ± 0.064
8.255LeuVal: 8.255 ± 0.093
1.213LeuTrp: 1.213 ± 0.032
1.709LeuTyr: 1.709 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.572MetAla: 2.572 ± 0.042
0.19MetCys: 0.19 ± 0.011
0.882MetAsp: 0.882 ± 0.023
0.668MetGlu: 0.668 ± 0.02
0.636MetPhe: 0.636 ± 0.022
1.472MetGly: 1.472 ± 0.035
0.444MetHis: 0.444 ± 0.017
0.847MetIle: 0.847 ± 0.023
0.458MetLys: 0.458 ± 0.017
2.044MetLeu: 2.044 ± 0.038
0.426MetMet: 0.426 ± 0.018
0.475MetAsn: 0.475 ± 0.018
1.235MetPro: 1.235 ± 0.029
0.568MetGln: 0.568 ± 0.017
1.478MetArg: 1.478 ± 0.034
1.547MetSer: 1.547 ± 0.025
1.768MetThr: 1.768 ± 0.031
1.639MetVal: 1.639 ± 0.033
0.287MetTrp: 0.287 ± 0.013
0.352MetTyr: 0.352 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.944AsnAla: 2.944 ± 0.057
0.21AsnCys: 0.21 ± 0.012
1.362AsnAsp: 1.362 ± 0.033
1.063AsnGlu: 1.063 ± 0.031
0.791AsnPhe: 0.791 ± 0.039
2.821AsnGly: 2.821 ± 0.216
0.494AsnHis: 0.494 ± 0.018
1.017AsnIle: 1.017 ± 0.029
0.483AsnLys: 0.483 ± 0.017
2.264AsnLeu: 2.264 ± 0.076
0.468AsnMet: 0.468 ± 0.015
0.666AsnAsn: 0.666 ± 0.028
1.797AsnPro: 1.797 ± 0.034
0.791AsnGln: 0.791 ± 0.024
1.625AsnArg: 1.625 ± 0.034
1.299AsnSer: 1.299 ± 0.04
1.387AsnThr: 1.387 ± 0.037
1.839AsnVal: 1.839 ± 0.034
0.418AsnTrp: 0.418 ± 0.017
0.613AsnTyr: 0.613 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
7.506ProAla: 7.506 ± 0.11
0.303ProCys: 0.303 ± 0.014
4.266ProAsp: 4.266 ± 0.064
3.659ProGlu: 3.659 ± 0.064
1.697ProPhe: 1.697 ± 0.037
5.698ProGly: 5.698 ± 0.074
1.259ProHis: 1.259 ± 0.027
2.201ProIle: 2.201 ± 0.043
1.316ProLys: 1.316 ± 0.033
4.985ProLeu: 4.985 ± 0.056
1.222ProMet: 1.222 ± 0.03
1.453ProAsn: 1.453 ± 0.041
4.117ProPro: 4.117 ± 0.11
2.059ProGln: 2.059 ± 0.036
3.431ProArg: 3.431 ± 0.055
3.176ProSer: 3.176 ± 0.055
3.476ProThr: 3.476 ± 0.061
5.046ProVal: 5.046 ± 0.065
0.879ProTrp: 0.879 ± 0.025
1.211ProTyr: 1.211 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.077GlnAla: 4.077 ± 0.075
0.231GlnCys: 0.231 ± 0.012
1.303GlnAsp: 1.303 ± 0.03
1.243GlnGlu: 1.243 ± 0.029
1.055GlnPhe: 1.055 ± 0.029
2.121GlnGly: 2.121 ± 0.046
0.852GlnHis: 0.852 ± 0.021
1.517GlnIle: 1.517 ± 0.03
0.586GlnLys: 0.586 ± 0.019
3.979GlnLeu: 3.979 ± 0.061
0.692GlnMet: 0.692 ± 0.024
0.699GlnAsn: 0.699 ± 0.03
2.197GlnPro: 2.197 ± 0.048
1.5GlnGln: 1.5 ± 0.034
3.11GlnArg: 3.11 ± 0.053
1.609GlnSer: 1.609 ± 0.038
1.773GlnThr: 1.773 ± 0.039
2.714GlnVal: 2.714 ± 0.042
0.616GlnTrp: 0.616 ± 0.022
0.707GlnTyr: 0.707 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
8.241ArgAla: 8.241 ± 0.09
0.613ArgCys: 0.613 ± 0.022
4.082ArgAsp: 4.082 ± 0.062
3.609ArgGlu: 3.609 ± 0.057
2.445ArgPhe: 2.445 ± 0.043
5.128ArgGly: 5.128 ± 0.063
1.822ArgHis: 1.822 ± 0.04
3.325ArgIle: 3.325 ± 0.051
1.562ArgLys: 1.562 ± 0.038
7.521ArgLeu: 7.521 ± 0.085
1.784ArgMet: 1.784 ± 0.035
1.688ArgAsn: 1.688 ± 0.034
4.063ArgPro: 4.063 ± 0.063
2.466ArgGln: 2.466 ± 0.043
6.524ArgArg: 6.524 ± 0.097
4.036ArgSer: 4.036 ± 0.056
3.958ArgThr: 3.958 ± 0.053
5.522ArgVal: 5.522 ± 0.072
1.391ArgTrp: 1.391 ± 0.033
1.917ArgTyr: 1.917 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
7.262SerAla: 7.262 ± 0.079
0.404SerCys: 0.404 ± 0.017
3.213SerAsp: 3.213 ± 0.046
2.472SerGlu: 2.472 ± 0.041
1.716SerPhe: 1.716 ± 0.045
5.955SerGly: 5.955 ± 0.101
1.07SerHis: 1.07 ± 0.026
2.156SerIle: 2.156 ± 0.039
1.163SerLys: 1.163 ± 0.027
4.78SerLeu: 4.78 ± 0.069
1.315SerMet: 1.315 ± 0.028
1.271SerAsn: 1.271 ± 0.034
3.095SerPro: 3.095 ± 0.049
1.629SerGln: 1.629 ± 0.036
3.723SerArg: 3.723 ± 0.047
3.177SerSer: 3.177 ± 0.043
3.252SerThr: 3.252 ± 0.045
4.571SerVal: 4.571 ± 0.061
0.948SerTrp: 0.948 ± 0.026
1.306SerTyr: 1.306 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
7.87ThrAla: 7.87 ± 0.082
0.447ThrCys: 0.447 ± 0.019
3.585ThrAsp: 3.585 ± 0.052
2.988ThrGlu: 2.988 ± 0.043
1.749ThrPhe: 1.749 ± 0.036
6.088ThrGly: 6.088 ± 0.125
1.187ThrHis: 1.187 ± 0.029
2.314ThrIle: 2.314 ± 0.043
1.299ThrLys: 1.299 ± 0.031
5.447ThrLeu: 5.447 ± 0.058
1.149ThrMet: 1.149 ± 0.027
1.402ThrAsn: 1.402 ± 0.036
3.734ThrPro: 3.734 ± 0.053
1.647ThrGln: 1.647 ± 0.035
3.575ThrArg: 3.575 ± 0.048
3.139ThrSer: 3.139 ± 0.048
3.896ThrThr: 3.896 ± 0.072
5.637ThrVal: 5.637 ± 0.061
0.83ThrTrp: 0.83 ± 0.021
1.313ThrTyr: 1.313 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
11.25ValAla: 11.25 ± 0.102
0.774ValCys: 0.774 ± 0.023
5.555ValAsp: 5.555 ± 0.07
4.428ValGlu: 4.428 ± 0.06
2.522ValPhe: 2.522 ± 0.044
6.995ValGly: 6.995 ± 0.076
1.874ValHis: 1.874 ± 0.041
3.982ValIle: 3.982 ± 0.056
1.688ValLys: 1.688 ± 0.038
8.742ValLeu: 8.742 ± 0.085
1.564ValMet: 1.564 ± 0.033
2.218ValAsn: 2.218 ± 0.041
4.638ValPro: 4.638 ± 0.067
2.275ValGln: 2.275 ± 0.041
5.846ValArg: 5.846 ± 0.068
4.735ValSer: 4.735 ± 0.059
5.481ValThr: 5.481 ± 0.064
8.371ValVal: 8.371 ± 0.104
1.007ValTrp: 1.007 ± 0.027
1.539ValTyr: 1.539 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.553TrpAla: 1.553 ± 0.029
0.167TrpCys: 0.167 ± 0.012
0.786TrpAsp: 0.786 ± 0.024
0.663TrpGlu: 0.663 ± 0.021
0.524TrpPhe: 0.524 ± 0.02
1.015TrpGly: 1.015 ± 0.023
0.391TrpHis: 0.391 ± 0.017
0.627TrpIle: 0.627 ± 0.021
0.317TrpLys: 0.317 ± 0.016
1.799TrpLeu: 1.799 ± 0.039
0.321TrpMet: 0.321 ± 0.013
0.425TrpAsn: 0.425 ± 0.017
0.864TrpPro: 0.864 ± 0.025
0.668TrpGln: 0.668 ± 0.022
1.286TrpArg: 1.286 ± 0.031
0.949TrpSer: 0.949 ± 0.025
0.884TrpThr: 0.884 ± 0.023
1.159TrpVal: 1.159 ± 0.032
0.345TrpTrp: 0.345 ± 0.016
0.321TrpTyr: 0.321 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.557TyrAla: 2.557 ± 0.04
0.253TyrCys: 0.253 ± 0.013
1.385TyrAsp: 1.385 ± 0.03
1.093TyrGlu: 1.093 ± 0.027
0.786TyrPhe: 0.786 ± 0.026
2.122TyrGly: 2.122 ± 0.044
0.525TyrHis: 0.525 ± 0.02
0.631TyrIle: 0.631 ± 0.022
0.339TyrLys: 0.339 ± 0.016
2.378TyrLeu: 2.378 ± 0.048
0.29TyrMet: 0.29 ± 0.012
0.481TyrAsn: 0.481 ± 0.02
1.276TyrPro: 1.276 ± 0.031
0.816TyrGln: 0.816 ± 0.021
1.92TyrArg: 1.92 ± 0.036
1.131TyrSer: 1.131 ± 0.027
1.172TyrThr: 1.172 ± 0.027
1.687TyrVal: 1.687 ± 0.033
0.367TyrTrp: 0.367 ± 0.015
0.501TyrTyr: 0.501 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4711 proteins (1535493 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski