Amino acid dipepetide frequency for Crenothrix polyspora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.524AlaAla: 8.524 ± 0.127
1.064AlaCys: 1.064 ± 0.031
4.944AlaAsp: 4.944 ± 0.071
5.319AlaGlu: 5.319 ± 0.09
3.427AlaPhe: 3.427 ± 0.063
6.456AlaGly: 6.456 ± 0.093
2.028AlaHis: 2.028 ± 0.047
6.298AlaIle: 6.298 ± 0.075
5.049AlaLys: 5.049 ± 0.077
10.555AlaLeu: 10.555 ± 0.126
2.346AlaMet: 2.346 ± 0.045
3.779AlaAsn: 3.779 ± 0.063
3.048AlaPro: 3.048 ± 0.058
3.835AlaGln: 3.835 ± 0.065
3.679AlaArg: 3.679 ± 0.067
5.312AlaSer: 5.312 ± 0.077
5.146AlaThr: 5.146 ± 0.089
6.364AlaVal: 6.364 ± 0.083
1.136AlaTrp: 1.136 ± 0.032
2.64AlaTyr: 2.64 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.937CysAla: 0.937 ± 0.026
0.176CysCys: 0.176 ± 0.013
0.549CysAsp: 0.549 ± 0.026
0.501CysGlu: 0.501 ± 0.02
0.501CysPhe: 0.501 ± 0.023
0.903CysGly: 0.903 ± 0.033
0.383CysHis: 0.383 ± 0.022
0.604CysIle: 0.604 ± 0.024
0.486CysLys: 0.486 ± 0.025
1.189CysLeu: 1.189 ± 0.032
0.217CysMet: 0.217 ± 0.013
0.459CysAsn: 0.459 ± 0.022
0.53CysPro: 0.53 ± 0.021
0.482CysGln: 0.482 ± 0.019
0.525CysArg: 0.525 ± 0.022
0.724CysSer: 0.724 ± 0.029
0.649CysThr: 0.649 ± 0.032
0.849CysVal: 0.849 ± 0.03
0.131CysTrp: 0.131 ± 0.01
0.387CysTyr: 0.387 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
4.676AspAla: 4.676 ± 0.07
0.618AspCys: 0.618 ± 0.023
3.065AspAsp: 3.065 ± 0.064
2.943AspGlu: 2.943 ± 0.053
2.647AspPhe: 2.647 ± 0.053
3.789AspGly: 3.789 ± 0.096
1.091AspHis: 1.091 ± 0.033
4.085AspIle: 4.085 ± 0.063
3.273AspLys: 3.273 ± 0.054
4.994AspLeu: 4.994 ± 0.077
1.346AspMet: 1.346 ± 0.034
2.738AspAsn: 2.738 ± 0.051
2.072AspPro: 2.072 ± 0.047
1.779AspGln: 1.779 ± 0.045
2.272AspArg: 2.272 ± 0.048
3.293AspSer: 3.293 ± 0.061
3.222AspThr: 3.222 ± 0.081
3.761AspVal: 3.761 ± 0.071
0.876AspTrp: 0.876 ± 0.027
2.016AspTyr: 2.016 ± 0.046
0.002AspXaa: 0.002 ± 0.001
Glu
4.676GluAla: 4.676 ± 0.076
0.47GluCys: 0.47 ± 0.024
2.351GluAsp: 2.351 ± 0.048
2.543GluGlu: 2.543 ± 0.057
2.032GluPhe: 2.032 ± 0.046
2.774GluGly: 2.774 ± 0.055
1.327GluHis: 1.327 ± 0.037
3.637GluIle: 3.637 ± 0.06
3.404GluLys: 3.404 ± 0.068
5.694GluLeu: 5.694 ± 0.082
1.285GluMet: 1.285 ± 0.038
2.374GluAsn: 2.374 ± 0.049
1.8GluPro: 1.8 ± 0.043
3.164GluGln: 3.164 ± 0.059
2.89GluArg: 2.89 ± 0.068
2.723GluSer: 2.723 ± 0.048
2.924GluThr: 2.924 ± 0.055
3.149GluVal: 3.149 ± 0.065
0.63GluTrp: 0.63 ± 0.026
1.519GluTyr: 1.519 ± 0.038
0.001GluXaa: 0.001 ± 0.001
Phe
3.322PheAla: 3.322 ± 0.053
0.522PheCys: 0.522 ± 0.024
2.634PheAsp: 2.634 ± 0.051
2.021PheGlu: 2.021 ± 0.044
1.957PhePhe: 1.957 ± 0.047
2.861PheGly: 2.861 ± 0.058
0.9PheHis: 0.9 ± 0.031
2.77PheIle: 2.77 ± 0.046
2.212PheLys: 2.212 ± 0.046
3.765PheLeu: 3.765 ± 0.062
1.07PheMet: 1.07 ± 0.034
2.121PheAsn: 2.121 ± 0.042
1.469PhePro: 1.469 ± 0.039
1.302PheGln: 1.302 ± 0.035
1.605PheArg: 1.605 ± 0.041
3.222PheSer: 3.222 ± 0.053
2.419PheThr: 2.419 ± 0.048
2.68PheVal: 2.68 ± 0.049
0.598PheTrp: 0.598 ± 0.022
1.455PheTyr: 1.455 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
5.285GlyAla: 5.285 ± 0.101
0.982GlyCys: 0.982 ± 0.033
3.626GlyAsp: 3.626 ± 0.11
3.307GlyGlu: 3.307 ± 0.058
3.268GlyPhe: 3.268 ± 0.057
5.037GlyGly: 5.037 ± 0.104
1.544GlyHis: 1.544 ± 0.04
4.829GlyIle: 4.829 ± 0.077
4.309GlyLys: 4.309 ± 0.065
7.07GlyLeu: 7.07 ± 0.092
1.802GlyMet: 1.802 ± 0.044
3.079GlyAsn: 3.079 ± 0.083
1.32GlyPro: 1.32 ± 0.035
2.767GlyGln: 2.767 ± 0.052
3.04GlyArg: 3.04 ± 0.058
4.115GlySer: 4.115 ± 0.078
3.667GlyThr: 3.667 ± 0.087
5.018GlyVal: 5.018 ± 0.075
0.992GlyTrp: 0.992 ± 0.033
2.519GlyTyr: 2.519 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
2.097HisAla: 2.097 ± 0.045
0.332HisCys: 0.332 ± 0.02
1.292HisAsp: 1.292 ± 0.037
1.14HisGlu: 1.14 ± 0.032
1.188HisPhe: 1.188 ± 0.029
1.639HisGly: 1.639 ± 0.037
0.747HisHis: 0.747 ± 0.031
1.688HisIle: 1.688 ± 0.045
1.195HisLys: 1.195 ± 0.035
2.324HisLeu: 2.324 ± 0.053
0.482HisMet: 0.482 ± 0.022
1.166HisAsn: 1.166 ± 0.035
1.276HisPro: 1.276 ± 0.037
0.992HisGln: 0.992 ± 0.03
1.023HisArg: 1.023 ± 0.032
1.465HisSer: 1.465 ± 0.04
1.342HisThr: 1.342 ± 0.037
1.408HisVal: 1.408 ± 0.038
0.418HisTrp: 0.418 ± 0.02
1.033HisTyr: 1.033 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.638IleAla: 6.638 ± 0.079
0.642IleCys: 0.642 ± 0.025
4.225IleAsp: 4.225 ± 0.07
3.898IleGlu: 3.898 ± 0.073
2.382IlePhe: 2.382 ± 0.06
4.472IleGly: 4.472 ± 0.071
1.405IleHis: 1.405 ± 0.038
4.293IleIle: 4.293 ± 0.074
4.029IleLys: 4.029 ± 0.068
5.582IleLeu: 5.582 ± 0.09
1.388IleMet: 1.388 ± 0.037
3.638IleAsn: 3.638 ± 0.064
2.91IlePro: 2.91 ± 0.058
2.23IleGln: 2.23 ± 0.046
2.876IleArg: 2.876 ± 0.049
4.402IleSer: 4.402 ± 0.057
4.239IleThr: 4.239 ± 0.072
4.167IleVal: 4.167 ± 0.072
0.633IleTrp: 0.633 ± 0.022
1.853IleTyr: 1.853 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.969LysAla: 4.969 ± 0.063
0.393LysCys: 0.393 ± 0.018
2.857LysAsp: 2.857 ± 0.053
2.589LysGlu: 2.589 ± 0.061
1.67LysPhe: 1.67 ± 0.034
3.168LysGly: 3.168 ± 0.056
1.451LysHis: 1.451 ± 0.043
4.037LysIle: 4.037 ± 0.069
3.508LysLys: 3.508 ± 0.068
5.561LysLeu: 5.561 ± 0.075
1.453LysMet: 1.453 ± 0.044
2.98LysAsn: 2.98 ± 0.063
2.781LysPro: 2.781 ± 0.059
2.871LysGln: 2.871 ± 0.057
2.53LysArg: 2.53 ± 0.053
3.175LysSer: 3.175 ± 0.058
3.81LysThr: 3.81 ± 0.062
3.214LysVal: 3.214 ± 0.063
0.509LysTrp: 0.509 ± 0.02
1.356LysTyr: 1.356 ± 0.037
0.0LysXaa: 0.0 ± 0.0
Leu
10.573LeuAla: 10.573 ± 0.118
1.193LeuCys: 1.193 ± 0.035
5.555LeuAsp: 5.555 ± 0.07
5.246LeuGlu: 5.246 ± 0.081
4.155LeuPhe: 4.155 ± 0.077
6.763LeuGly: 6.763 ± 0.091
2.454LeuHis: 2.454 ± 0.058
6.561LeuIle: 6.561 ± 0.094
5.844LeuLys: 5.844 ± 0.084
11.459LeuLeu: 11.459 ± 0.147
2.522LeuMet: 2.522 ± 0.048
4.753LeuAsn: 4.753 ± 0.068
5.124LeuPro: 5.124 ± 0.072
4.341LeuGln: 4.341 ± 0.076
4.779LeuArg: 4.779 ± 0.07
7.639LeuSer: 7.639 ± 0.083
6.323LeuThr: 6.323 ± 0.08
6.452LeuVal: 6.452 ± 0.072
1.219LeuTrp: 1.219 ± 0.036
2.869LeuTyr: 2.869 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.554MetAla: 2.554 ± 0.054
0.173MetCys: 0.173 ± 0.013
1.133MetAsp: 1.133 ± 0.032
1.053MetGlu: 1.053 ± 0.031
0.685MetPhe: 0.685 ± 0.026
1.63MetGly: 1.63 ± 0.039
0.536MetHis: 0.536 ± 0.022
1.371MetIle: 1.371 ± 0.036
1.319MetLys: 1.319 ± 0.035
2.536MetLeu: 2.536 ± 0.047
0.62MetMet: 0.62 ± 0.024
1.064MetAsn: 1.064 ± 0.03
1.302MetPro: 1.302 ± 0.033
1.136MetGln: 1.136 ± 0.034
1.163MetArg: 1.163 ± 0.033
1.596MetSer: 1.596 ± 0.033
1.581MetThr: 1.581 ± 0.038
1.516MetVal: 1.516 ± 0.034
0.174MetTrp: 0.174 ± 0.014
0.488MetTyr: 0.488 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
4.137AsnAla: 4.137 ± 0.069
0.486AsnCys: 0.486 ± 0.022
2.458AsnAsp: 2.458 ± 0.075
1.975AsnGlu: 1.975 ± 0.044
1.694AsnPhe: 1.694 ± 0.037
3.244AsnGly: 3.244 ± 0.087
1.07AsnHis: 1.07 ± 0.034
2.935AsnIle: 2.935 ± 0.057
2.486AsnLys: 2.486 ± 0.055
4.223AsnLeu: 4.223 ± 0.07
0.948AsnMet: 0.948 ± 0.028
2.438AsnAsn: 2.438 ± 0.069
2.479AsnPro: 2.479 ± 0.052
1.98AsnGln: 1.98 ± 0.051
2.081AsnArg: 2.081 ± 0.039
2.779AsnSer: 2.779 ± 0.059
3.106AsnThr: 3.106 ± 0.069
2.726AsnVal: 2.726 ± 0.058
0.71AsnTrp: 0.71 ± 0.028
1.527AsnTyr: 1.527 ± 0.042
0.0AsnXaa: 0.0 ± 0.0
Pro
3.624ProAla: 3.624 ± 0.057
0.428ProCys: 0.428 ± 0.02
2.747ProAsp: 2.747 ± 0.048
2.809ProGlu: 2.809 ± 0.057
1.706ProPhe: 1.706 ± 0.046
2.49ProGly: 2.49 ± 0.044
0.938ProHis: 0.938 ± 0.033
2.622ProIle: 2.622 ± 0.055
2.163ProLys: 2.163 ± 0.051
4.269ProLeu: 4.269 ± 0.065
0.987ProMet: 0.987 ± 0.031
1.651ProAsn: 1.651 ± 0.036
1.619ProPro: 1.619 ± 0.055
1.569ProGln: 1.569 ± 0.038
1.461ProArg: 1.461 ± 0.04
2.494ProSer: 2.494 ± 0.051
2.398ProThr: 2.398 ± 0.055
3.537ProVal: 3.537 ± 0.065
0.517ProTrp: 0.517 ± 0.022
1.295ProTyr: 1.295 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.379GlnAla: 4.379 ± 0.08
0.456GlnCys: 0.456 ± 0.019
1.991GlnAsp: 1.991 ± 0.042
2.021GlnGlu: 2.021 ± 0.052
1.686GlnPhe: 1.686 ± 0.039
2.715GlnGly: 2.715 ± 0.049
1.532GlnHis: 1.532 ± 0.041
2.629GlnIle: 2.629 ± 0.046
2.275GlnLys: 2.275 ± 0.046
4.969GlnLeu: 4.969 ± 0.088
0.992GlnMet: 0.992 ± 0.028
1.698GlnAsn: 1.698 ± 0.043
1.822GlnPro: 1.822 ± 0.044
3.223GlnGln: 3.223 ± 0.086
2.459GlnArg: 2.459 ± 0.052
2.427GlnSer: 2.427 ± 0.048
2.343GlnThr: 2.343 ± 0.049
2.701GlnVal: 2.701 ± 0.046
0.681GlnTrp: 0.681 ± 0.027
1.284GlnTyr: 1.284 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.357ArgAla: 3.357 ± 0.058
0.514ArgCys: 0.514 ± 0.022
2.399ArgAsp: 2.399 ± 0.05
2.407ArgGlu: 2.407 ± 0.052
2.153ArgPhe: 2.153 ± 0.048
2.5ArgGly: 2.5 ± 0.052
1.252ArgHis: 1.252 ± 0.031
3.118ArgIle: 3.118 ± 0.055
2.232ArgLys: 2.232 ± 0.047
5.368ArgLeu: 5.368 ± 0.082
1.154ArgMet: 1.154 ± 0.031
1.827ArgAsn: 1.827 ± 0.04
1.69ArgPro: 1.69 ± 0.046
2.252ArgGln: 2.252 ± 0.054
2.209ArgArg: 2.209 ± 0.055
2.536ArgSer: 2.536 ± 0.055
2.183ArgThr: 2.183 ± 0.04
3.091ArgVal: 3.091 ± 0.051
0.72ArgTrp: 0.72 ± 0.029
1.995ArgTyr: 1.995 ± 0.05
0.001ArgXaa: 0.001 ± 0.001
Ser
5.857SerAla: 5.857 ± 0.077
0.767SerCys: 0.767 ± 0.036
3.219SerAsp: 3.219 ± 0.058
2.955SerGlu: 2.955 ± 0.051
2.658SerPhe: 2.658 ± 0.05
4.952SerGly: 4.952 ± 0.075
1.536SerHis: 1.536 ± 0.039
3.981SerIle: 3.981 ± 0.06
3.124SerLys: 3.124 ± 0.05
6.583SerLeu: 6.583 ± 0.083
1.325SerMet: 1.325 ± 0.038
2.741SerAsn: 2.741 ± 0.062
2.538SerPro: 2.538 ± 0.05
2.53SerGln: 2.53 ± 0.047
2.527SerArg: 2.527 ± 0.048
4.027SerSer: 4.027 ± 0.073
3.549SerThr: 3.549 ± 0.069
4.741SerVal: 4.741 ± 0.07
0.825SerTrp: 0.825 ± 0.031
1.972SerTyr: 1.972 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
5.594ThrAla: 5.594 ± 0.1
0.563ThrCys: 0.563 ± 0.024
3.283ThrAsp: 3.283 ± 0.061
3.113ThrGlu: 3.113 ± 0.052
1.95ThrPhe: 1.95 ± 0.042
4.772ThrGly: 4.772 ± 0.102
1.524ThrHis: 1.524 ± 0.04
3.379ThrIle: 3.379 ± 0.066
2.418ThrLys: 2.418 ± 0.05
7.152ThrLeu: 7.152 ± 0.096
1.044ThrMet: 1.044 ± 0.03
2.163ThrAsn: 2.163 ± 0.052
3.156ThrPro: 3.156 ± 0.062
2.764ThrGln: 2.764 ± 0.052
2.443ThrArg: 2.443 ± 0.046
3.158ThrSer: 3.158 ± 0.058
3.33ThrThr: 3.33 ± 0.077
4.627ThrVal: 4.627 ± 0.08
0.676ThrTrp: 0.676 ± 0.025
1.575ThrTyr: 1.575 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
6.113ValAla: 6.113 ± 0.082
0.851ValCys: 0.851 ± 0.032
3.868ValAsp: 3.868 ± 0.062
3.534ValGlu: 3.534 ± 0.061
3.104ValPhe: 3.104 ± 0.052
4.384ValGly: 4.384 ± 0.075
1.366ValHis: 1.366 ± 0.035
4.801ValIle: 4.801 ± 0.075
3.491ValLys: 3.491 ± 0.067
7.331ValLeu: 7.331 ± 0.095
1.748ValMet: 1.748 ± 0.042
2.989ValAsn: 2.989 ± 0.061
2.476ValPro: 2.476 ± 0.045
2.287ValGln: 2.287 ± 0.044
2.899ValArg: 2.899 ± 0.055
4.552ValSer: 4.552 ± 0.071
4.101ValThr: 4.101 ± 0.086
4.88ValVal: 4.88 ± 0.079
0.816ValTrp: 0.816 ± 0.03
1.991ValTyr: 1.991 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.924TrpAla: 0.924 ± 0.032
0.148TrpCys: 0.148 ± 0.013
0.668TrpAsp: 0.668 ± 0.029
0.54TrpGlu: 0.54 ± 0.02
0.512TrpPhe: 0.512 ± 0.024
0.769TrpGly: 0.769 ± 0.028
0.414TrpHis: 0.414 ± 0.018
0.675TrpIle: 0.675 ± 0.028
0.573TrpLys: 0.573 ± 0.027
1.854TrpLeu: 1.854 ± 0.048
0.308TrpMet: 0.308 ± 0.016
0.489TrpAsn: 0.489 ± 0.023
0.522TrpPro: 0.522 ± 0.021
1.001TrpGln: 1.001 ± 0.033
0.779TrpArg: 0.779 ± 0.03
0.728TrpSer: 0.728 ± 0.029
0.577TrpThr: 0.577 ± 0.022
0.866TrpVal: 0.866 ± 0.031
0.195TrpTrp: 0.195 ± 0.013
0.413TrpTyr: 0.413 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.711TyrAla: 2.711 ± 0.049
0.403TyrCys: 0.403 ± 0.02
1.695TyrAsp: 1.695 ± 0.042
1.455TyrGlu: 1.455 ± 0.038
1.531TyrPhe: 1.531 ± 0.041
2.258TyrGly: 2.258 ± 0.052
0.75TyrHis: 0.75 ± 0.028
1.592TyrIle: 1.592 ± 0.044
1.493TyrLys: 1.493 ± 0.034
3.341TyrLeu: 3.341 ± 0.062
0.591TyrMet: 0.591 ± 0.022
1.336TyrAsn: 1.336 ± 0.034
1.387TyrPro: 1.387 ± 0.036
1.779TyrGln: 1.779 ± 0.04
1.721TyrArg: 1.721 ± 0.043
2.041TyrSer: 2.041 ± 0.044
1.772TyrThr: 1.772 ± 0.044
1.836TyrVal: 1.836 ± 0.041
0.499TyrTrp: 0.499 ± 0.021
1.099TyrTyr: 1.099 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.001XaaGln: 0.001 ± 0.001
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3943 proteins (1122853 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski