Amino acid dipepetide frequency for bacterium D16-50

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.616AlaAla: 8.616 ± 0.121
1.182AlaCys: 1.182 ± 0.032
4.618AlaAsp: 4.618 ± 0.055
6.339AlaGlu: 6.339 ± 0.083
3.175AlaPhe: 3.175 ± 0.052
7.201AlaGly: 7.201 ± 0.075
1.178AlaHis: 1.178 ± 0.029
4.08AlaIle: 4.08 ± 0.068
4.285AlaLys: 4.285 ± 0.062
7.709AlaLeu: 7.709 ± 0.095
2.453AlaMet: 2.453 ± 0.046
2.314AlaAsn: 2.314 ± 0.044
2.378AlaPro: 2.378 ± 0.048
2.79AlaGln: 2.79 ± 0.057
3.671AlaArg: 3.671 ± 0.057
3.999AlaSer: 3.999 ± 0.058
2.747AlaThr: 2.747 ± 0.053
6.803AlaVal: 6.803 ± 0.079
0.785AlaTrp: 0.785 ± 0.023
3.043AlaTyr: 3.043 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
1.114CysAla: 1.114 ± 0.033
0.329CysCys: 0.329 ± 0.016
0.822CysAsp: 0.822 ± 0.023
0.864CysGlu: 0.864 ± 0.025
0.734CysPhe: 0.734 ± 0.027
1.761CysGly: 1.761 ± 0.038
0.352CysHis: 0.352 ± 0.015
1.065CysIle: 1.065 ± 0.029
0.676CysLys: 0.676 ± 0.022
1.376CysLeu: 1.376 ± 0.033
0.466CysMet: 0.466 ± 0.02
0.511CysAsn: 0.511 ± 0.02
0.612CysPro: 0.612 ± 0.025
0.495CysGln: 0.495 ± 0.019
1.188CysArg: 1.188 ± 0.033
0.935CysSer: 0.935 ± 0.029
0.727CysThr: 0.727 ± 0.024
1.075CysVal: 1.075 ± 0.031
0.157CysTrp: 0.157 ± 0.011
0.631CysTyr: 0.631 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
4.114AspAla: 4.114 ± 0.059
0.883AspCys: 0.883 ± 0.027
2.567AspAsp: 2.567 ± 0.053
4.05AspGlu: 4.05 ± 0.056
2.751AspPhe: 2.751 ± 0.044
4.831AspGly: 4.831 ± 0.072
0.694AspHis: 0.694 ± 0.025
4.162AspIle: 4.162 ± 0.064
3.4AspLys: 3.4 ± 0.052
4.399AspLeu: 4.399 ± 0.058
2.07AspMet: 2.07 ± 0.045
2.006AspAsn: 2.006 ± 0.037
1.834AspPro: 1.834 ± 0.05
1.348AspGln: 1.348 ± 0.032
2.938AspArg: 2.938 ± 0.046
3.169AspSer: 3.169 ± 0.051
2.847AspThr: 2.847 ± 0.044
3.578AspVal: 3.578 ± 0.057
0.654AspTrp: 0.654 ± 0.026
2.758AspTyr: 2.758 ± 0.046
0.0AspXaa: 0.0 ± 0.0
Glu
6.392GluAla: 6.392 ± 0.067
0.925GluCys: 0.925 ± 0.033
4.663GluAsp: 4.663 ± 0.057
7.982GluGlu: 7.982 ± 0.099
2.517GluPhe: 2.517 ± 0.041
5.984GluGly: 5.984 ± 0.084
1.292GluHis: 1.292 ± 0.033
5.085GluIle: 5.085 ± 0.074
6.213GluLys: 6.213 ± 0.081
7.098GluLeu: 7.098 ± 0.083
2.341GluMet: 2.341 ± 0.043
3.862GluAsn: 3.862 ± 0.058
2.154GluPro: 2.154 ± 0.042
2.974GluGln: 2.974 ± 0.056
4.641GluArg: 4.641 ± 0.071
3.824GluSer: 3.824 ± 0.09
3.65GluThr: 3.65 ± 0.051
4.184GluVal: 4.184 ± 0.061
0.817GluTrp: 0.817 ± 0.027
3.429GluTyr: 3.429 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
2.969PheAla: 2.969 ± 0.044
0.923PheCys: 0.923 ± 0.031
2.407PheAsp: 2.407 ± 0.042
2.455PheGlu: 2.455 ± 0.043
1.899PhePhe: 1.899 ± 0.043
3.133PheGly: 3.133 ± 0.053
0.818PheHis: 0.818 ± 0.028
2.318PheIle: 2.318 ± 0.043
1.575PheLys: 1.575 ± 0.037
4.28PheLeu: 4.28 ± 0.069
1.124PheMet: 1.124 ± 0.029
1.28PheAsn: 1.28 ± 0.031
1.441PhePro: 1.441 ± 0.03
1.444PheGln: 1.444 ± 0.032
2.267PheArg: 2.267 ± 0.041
2.933PheSer: 2.933 ± 0.048
1.991PheThr: 1.991 ± 0.036
2.749PheVal: 2.749 ± 0.052
0.537PheTrp: 0.537 ± 0.02
1.829PheTyr: 1.829 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
5.742GlyAla: 5.742 ± 0.087
1.404GlyCys: 1.404 ± 0.04
3.923GlyAsp: 3.923 ± 0.062
5.975GlyGlu: 5.975 ± 0.075
3.201GlyPhe: 3.201 ± 0.051
5.955GlyGly: 5.955 ± 0.089
1.398GlyHis: 1.398 ± 0.03
5.845GlyIle: 5.845 ± 0.076
5.292GlyLys: 5.292 ± 0.066
6.382GlyLeu: 6.382 ± 0.072
2.783GlyMet: 2.783 ± 0.055
3.301GlyAsn: 3.301 ± 0.068
1.443GlyPro: 1.443 ± 0.054
2.774GlyGln: 2.774 ± 0.057
4.875GlyArg: 4.875 ± 0.07
4.694GlySer: 4.694 ± 0.065
4.126GlyThr: 4.126 ± 0.066
4.888GlyVal: 4.888 ± 0.069
0.838GlyTrp: 0.838 ± 0.029
3.466GlyTyr: 3.466 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.038HisAla: 1.038 ± 0.029
0.294HisCys: 0.294 ± 0.017
0.898HisAsp: 0.898 ± 0.025
1.133HisGlu: 1.133 ± 0.029
0.811HisPhe: 0.811 ± 0.027
1.397HisGly: 1.397 ± 0.033
0.37HisHis: 0.37 ± 0.025
1.323HisIle: 1.323 ± 0.032
0.901HisLys: 0.901 ± 0.027
1.375HisLeu: 1.375 ± 0.034
0.584HisMet: 0.584 ± 0.02
0.635HisAsn: 0.635 ± 0.024
0.827HisPro: 0.827 ± 0.023
0.522HisGln: 0.522 ± 0.019
1.025HisArg: 1.025 ± 0.029
0.955HisSer: 0.955 ± 0.021
0.849HisThr: 0.849 ± 0.028
1.092HisVal: 1.092 ± 0.031
0.194HisTrp: 0.194 ± 0.013
0.788HisTyr: 0.788 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
4.932IleAla: 4.932 ± 0.069
1.285IleCys: 1.285 ± 0.031
3.374IleAsp: 3.374 ± 0.052
3.984IleGlu: 3.984 ± 0.062
2.621IlePhe: 2.621 ± 0.051
4.632IleGly: 4.632 ± 0.068
1.153IleHis: 1.153 ± 0.029
3.729IleIle: 3.729 ± 0.067
3.164IleLys: 3.164 ± 0.056
6.529IleLeu: 6.529 ± 0.077
1.741IleMet: 1.741 ± 0.036
2.237IleAsn: 2.237 ± 0.046
2.765IlePro: 2.765 ± 0.048
2.049IleGln: 2.049 ± 0.046
3.822IleArg: 3.822 ± 0.052
4.343IleSer: 4.343 ± 0.059
3.258IleThr: 3.258 ± 0.049
4.12IleVal: 4.12 ± 0.064
0.709IleTrp: 0.709 ± 0.025
2.588IleTyr: 2.588 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.958LysAla: 4.958 ± 0.075
0.73LysCys: 0.73 ± 0.023
3.412LysAsp: 3.412 ± 0.058
5.841LysGlu: 5.841 ± 0.081
1.52LysPhe: 1.52 ± 0.034
4.323LysGly: 4.323 ± 0.058
0.924LysHis: 0.924 ± 0.027
3.745LysIle: 3.745 ± 0.057
4.723LysLys: 4.723 ± 0.065
5.0LysLeu: 5.0 ± 0.062
1.766LysMet: 1.766 ± 0.036
2.828LysAsn: 2.828 ± 0.045
1.854LysPro: 1.854 ± 0.036
1.994LysGln: 1.994 ± 0.047
3.36LysArg: 3.36 ± 0.053
2.952LysSer: 2.952 ± 0.053
3.139LysThr: 3.139 ± 0.057
3.618LysVal: 3.618 ± 0.06
0.665LysTrp: 0.665 ± 0.022
2.552LysTyr: 2.552 ± 0.041
0.0LysXaa: 0.0 ± 0.0
Leu
7.517LeuAla: 7.517 ± 0.094
1.721LeuCys: 1.721 ± 0.032
5.041LeuAsp: 5.041 ± 0.069
7.242LeuGlu: 7.242 ± 0.084
3.984LeuPhe: 3.984 ± 0.072
6.449LeuGly: 6.449 ± 0.086
1.595LeuHis: 1.595 ± 0.039
5.032LeuIle: 5.032 ± 0.067
5.412LeuLys: 5.412 ± 0.06
9.563LeuLeu: 9.563 ± 0.129
2.682LeuMet: 2.682 ± 0.044
3.399LeuAsn: 3.399 ± 0.044
3.761LeuPro: 3.761 ± 0.059
3.17LeuGln: 3.17 ± 0.052
4.806LeuArg: 4.806 ± 0.067
6.296LeuSer: 6.296 ± 0.071
4.935LeuThr: 4.935 ± 0.062
5.4LeuVal: 5.4 ± 0.069
1.075LeuTrp: 1.075 ± 0.032
3.822LeuTyr: 3.822 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.689MetAla: 2.689 ± 0.049
0.356MetCys: 0.356 ± 0.017
1.903MetAsp: 1.903 ± 0.036
2.942MetGlu: 2.942 ± 0.051
0.957MetPhe: 0.957 ± 0.033
2.439MetGly: 2.439 ± 0.049
0.458MetHis: 0.458 ± 0.018
1.811MetIle: 1.811 ± 0.035
2.179MetLys: 2.179 ± 0.038
2.79MetLeu: 2.79 ± 0.044
0.834MetMet: 0.834 ± 0.027
1.384MetAsn: 1.384 ± 0.034
1.181MetPro: 1.181 ± 0.033
1.025MetGln: 1.025 ± 0.028
1.563MetArg: 1.563 ± 0.033
1.67MetSer: 1.67 ± 0.038
1.549MetThr: 1.549 ± 0.041
1.908MetVal: 1.908 ± 0.041
0.25MetTrp: 0.25 ± 0.013
0.925MetTyr: 0.925 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
2.984AsnAla: 2.984 ± 0.052
0.566AsnCys: 0.566 ± 0.021
1.839AsnAsp: 1.839 ± 0.038
2.381AsnGlu: 2.381 ± 0.042
1.377AsnPhe: 1.377 ± 0.033
3.4AsnGly: 3.4 ± 0.052
0.704AsnHis: 0.704 ± 0.023
2.79AsnIle: 2.79 ± 0.05
2.004AsnLys: 2.004 ± 0.037
3.54AsnLeu: 3.54 ± 0.051
1.23AsnMet: 1.23 ± 0.03
1.488AsnAsn: 1.488 ± 0.035
1.849AsnPro: 1.849 ± 0.041
1.313AsnGln: 1.313 ± 0.032
2.269AsnArg: 2.269 ± 0.042
2.107AsnSer: 2.107 ± 0.042
1.945AsnThr: 1.945 ± 0.04
2.726AsnVal: 2.726 ± 0.047
0.394AsnTrp: 0.394 ± 0.017
1.664AsnTyr: 1.664 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
2.75ProAla: 2.75 ± 0.046
0.46ProCys: 0.46 ± 0.019
2.356ProAsp: 2.356 ± 0.045
3.81ProGlu: 3.81 ± 0.065
1.466ProPhe: 1.466 ± 0.033
2.473ProGly: 2.473 ± 0.041
0.599ProHis: 0.599 ± 0.019
1.977ProIle: 1.977 ± 0.036
1.859ProLys: 1.859 ± 0.038
2.921ProLeu: 2.921 ± 0.048
1.013ProMet: 1.013 ± 0.028
1.071ProAsn: 1.071 ± 0.028
0.957ProPro: 0.957 ± 0.028
1.252ProGln: 1.252 ± 0.029
1.296ProArg: 1.296 ± 0.035
1.781ProSer: 1.781 ± 0.039
1.485ProThr: 1.485 ± 0.048
2.606ProVal: 2.606 ± 0.049
0.351ProTrp: 0.351 ± 0.017
1.53ProTyr: 1.53 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
2.658GlnAla: 2.658 ± 0.051
0.386GlnCys: 0.386 ± 0.017
1.675GlnAsp: 1.675 ± 0.035
3.472GlnGlu: 3.472 ± 0.052
1.133GlnPhe: 1.133 ± 0.029
2.621GlnGly: 2.621 ± 0.05
0.465GlnHis: 0.465 ± 0.019
2.38GlnIle: 2.38 ± 0.046
2.535GlnLys: 2.535 ± 0.048
2.889GlnLeu: 2.889 ± 0.057
1.09GlnMet: 1.09 ± 0.03
1.597GlnAsn: 1.597 ± 0.036
0.984GlnPro: 0.984 ± 0.028
1.235GlnGln: 1.235 ± 0.033
1.908GlnArg: 1.908 ± 0.044
1.688GlnSer: 1.688 ± 0.037
1.669GlnThr: 1.669 ± 0.034
2.122GlnVal: 2.122 ± 0.043
0.377GlnTrp: 0.377 ± 0.017
1.321GlnTyr: 1.321 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.556ArgAla: 3.556 ± 0.058
0.736ArgCys: 0.736 ± 0.023
2.753ArgAsp: 2.753 ± 0.047
5.06ArgGlu: 5.06 ± 0.075
2.274ArgPhe: 2.274 ± 0.039
3.375ArgGly: 3.375 ± 0.054
1.019ArgHis: 1.019 ± 0.028
3.917ArgIle: 3.917 ± 0.056
3.909ArgLys: 3.909 ± 0.062
5.439ArgLeu: 5.439 ± 0.068
1.996ArgMet: 1.996 ± 0.039
2.375ArgAsn: 2.375 ± 0.046
1.784ArgPro: 1.784 ± 0.035
2.39ArgGln: 2.39 ± 0.046
3.882ArgArg: 3.882 ± 0.074
2.718ArgSer: 2.718 ± 0.041
2.527ArgThr: 2.527 ± 0.042
3.001ArgVal: 3.001 ± 0.042
0.53ArgTrp: 0.53 ± 0.018
2.292ArgTyr: 2.292 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
4.526SerAla: 4.526 ± 0.074
0.921SerCys: 0.921 ± 0.028
3.134SerAsp: 3.134 ± 0.049
3.923SerGlu: 3.923 ± 0.086
2.779SerPhe: 2.779 ± 0.045
5.539SerGly: 5.539 ± 0.09
1.07SerHis: 1.07 ± 0.028
3.502SerIle: 3.502 ± 0.053
2.787SerLys: 2.787 ± 0.05
5.362SerLeu: 5.362 ± 0.072
1.856SerMet: 1.856 ± 0.034
1.855SerAsn: 1.855 ± 0.042
1.969SerPro: 1.969 ± 0.036
1.95SerGln: 1.95 ± 0.037
3.291SerArg: 3.291 ± 0.047
3.324SerSer: 3.324 ± 0.063
2.449SerThr: 2.449 ± 0.068
4.292SerVal: 4.292 ± 0.058
0.557SerTrp: 0.557 ± 0.021
2.405SerTyr: 2.405 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
4.555ThrAla: 4.555 ± 0.07
0.594ThrCys: 0.594 ± 0.023
2.888ThrAsp: 2.888 ± 0.063
3.788ThrGlu: 3.788 ± 0.052
1.881ThrPhe: 1.881 ± 0.039
4.177ThrGly: 4.177 ± 0.067
0.81ThrHis: 0.81 ± 0.024
2.987ThrIle: 2.987 ± 0.048
2.498ThrLys: 2.498 ± 0.048
4.507ThrLeu: 4.507 ± 0.06
1.313ThrMet: 1.313 ± 0.034
1.598ThrAsn: 1.598 ± 0.042
1.95ThrPro: 1.95 ± 0.042
1.455ThrGln: 1.455 ± 0.035
1.948ThrArg: 1.948 ± 0.036
2.522ThrSer: 2.522 ± 0.065
2.15ThrThr: 2.15 ± 0.048
4.282ThrVal: 4.282 ± 0.076
0.474ThrTrp: 0.474 ± 0.018
1.909ThrTyr: 1.909 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
4.606ValAla: 4.606 ± 0.064
1.28ValCys: 1.28 ± 0.032
3.527ValAsp: 3.527 ± 0.05
4.747ValGlu: 4.747 ± 0.063
2.979ValPhe: 2.979 ± 0.048
4.223ValGly: 4.223 ± 0.073
1.055ValHis: 1.055 ± 0.029
4.166ValIle: 4.166 ± 0.059
3.709ValLys: 3.709 ± 0.06
6.683ValLeu: 6.683 ± 0.087
1.992ValMet: 1.992 ± 0.043
2.638ValAsn: 2.638 ± 0.047
2.536ValPro: 2.536 ± 0.045
2.043ValGln: 2.043 ± 0.044
3.686ValArg: 3.686 ± 0.048
4.658ValSer: 4.658 ± 0.072
3.682ValThr: 3.682 ± 0.07
4.45ValVal: 4.45 ± 0.073
0.827ValTrp: 0.827 ± 0.027
2.803ValTyr: 2.803 ± 0.051
0.0ValXaa: 0.0 ± 0.0
Trp
0.662TrpAla: 0.662 ± 0.021
0.195TrpCys: 0.195 ± 0.01
0.615TrpAsp: 0.615 ± 0.021
0.932TrpGlu: 0.932 ± 0.025
0.417TrpPhe: 0.417 ± 0.016
0.873TrpGly: 0.873 ± 0.025
0.209TrpHis: 0.209 ± 0.013
0.616TrpIle: 0.616 ± 0.022
0.773TrpLys: 0.773 ± 0.024
1.006TrpLeu: 1.006 ± 0.028
0.353TrpMet: 0.353 ± 0.018
0.576TrpAsn: 0.576 ± 0.019
0.24TrpPro: 0.24 ± 0.014
0.49TrpGln: 0.49 ± 0.017
0.589TrpArg: 0.589 ± 0.023
0.51TrpSer: 0.51 ± 0.019
0.473TrpThr: 0.473 ± 0.019
0.621TrpVal: 0.621 ± 0.021
0.135TrpTrp: 0.135 ± 0.009
0.471TrpTyr: 0.471 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.987TyrAla: 2.987 ± 0.052
0.717TyrCys: 0.717 ± 0.025
2.639TyrAsp: 2.639 ± 0.069
3.108TyrGlu: 3.108 ± 0.047
1.895TyrPhe: 1.895 ± 0.044
3.518TyrGly: 3.518 ± 0.058
0.849TyrHis: 0.849 ± 0.025
2.505TyrIle: 2.505 ± 0.042
2.009TyrLys: 2.009 ± 0.045
4.051TyrLeu: 4.051 ± 0.061
1.124TyrMet: 1.124 ± 0.029
1.62TyrAsn: 1.62 ± 0.033
1.497TyrPro: 1.497 ± 0.03
1.491TyrGln: 1.491 ± 0.031
2.506TyrArg: 2.506 ± 0.041
2.414TyrSer: 2.414 ± 0.047
2.089TyrThr: 2.089 ± 0.044
2.777TyrVal: 2.777 ± 0.043
0.431TyrTrp: 0.431 ± 0.021
1.945TyrTyr: 1.945 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4338 proteins (1406950 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski