Amino acid dipepetide frequency for Sphingobacterium sp. ML3W

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.132AlaAla: 5.132 ± 0.082
0.579AlaCys: 0.579 ± 0.023
3.804AlaAsp: 3.804 ± 0.055
3.96AlaGlu: 3.96 ± 0.058
3.215AlaPhe: 3.215 ± 0.055
4.709AlaGly: 4.709 ± 0.062
1.272AlaHis: 1.272 ± 0.028
5.792AlaIle: 5.792 ± 0.079
4.665AlaLys: 4.665 ± 0.067
6.765AlaLeu: 6.765 ± 0.084
1.646AlaMet: 1.646 ± 0.034
3.566AlaAsn: 3.566 ± 0.056
2.021AlaPro: 2.021 ± 0.034
2.821AlaGln: 2.821 ± 0.046
2.291AlaArg: 2.291 ± 0.045
4.457AlaSer: 4.457 ± 0.06
3.857AlaThr: 3.857 ± 0.054
4.486AlaVal: 4.486 ± 0.064
0.713AlaTrp: 0.713 ± 0.022
2.82AlaTyr: 2.82 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.467CysAla: 0.467 ± 0.019
0.124CysCys: 0.124 ± 0.008
0.365CysAsp: 0.365 ± 0.017
0.361CysGlu: 0.361 ± 0.017
0.381CysPhe: 0.381 ± 0.018
0.561CysGly: 0.561 ± 0.022
0.196CysHis: 0.196 ± 0.01
0.604CysIle: 0.604 ± 0.024
0.432CysLys: 0.432 ± 0.019
0.726CysLeu: 0.726 ± 0.026
0.171CysMet: 0.171 ± 0.011
0.359CysAsn: 0.359 ± 0.016
0.288CysPro: 0.288 ± 0.016
0.287CysGln: 0.287 ± 0.015
0.251CysArg: 0.251 ± 0.013
0.538CysSer: 0.538 ± 0.022
0.418CysThr: 0.418 ± 0.018
0.389CysVal: 0.389 ± 0.016
0.065CysTrp: 0.065 ± 0.006
0.331CysTyr: 0.331 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.775AspAla: 3.775 ± 0.061
0.342AspCys: 0.342 ± 0.017
2.619AspAsp: 2.619 ± 0.048
3.533AspGlu: 3.533 ± 0.055
3.326AspPhe: 3.326 ± 0.046
3.74AspGly: 3.74 ± 0.06
1.121AspHis: 1.121 ± 0.032
4.353AspIle: 4.353 ± 0.06
4.136AspLys: 4.136 ± 0.058
5.595AspLeu: 5.595 ± 0.078
1.289AspMet: 1.289 ± 0.031
2.88AspAsn: 2.88 ± 0.053
1.971AspPro: 1.971 ± 0.041
2.204AspGln: 2.204 ± 0.038
2.365AspArg: 2.365 ± 0.039
3.003AspSer: 3.003 ± 0.056
2.386AspThr: 2.386 ± 0.044
3.431AspVal: 3.431 ± 0.047
0.844AspTrp: 0.844 ± 0.026
2.746AspTyr: 2.746 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
4.07GluAla: 4.07 ± 0.068
0.321GluCys: 0.321 ± 0.017
3.157GluAsp: 3.157 ± 0.048
4.37GluGlu: 4.37 ± 0.071
2.524GluPhe: 2.524 ± 0.04
3.475GluGly: 3.475 ± 0.06
1.149GluHis: 1.149 ± 0.029
5.003GluIle: 5.003 ± 0.064
5.133GluLys: 5.133 ± 0.074
5.93GluLeu: 5.93 ± 0.066
1.548GluMet: 1.548 ± 0.036
3.846GluAsn: 3.846 ± 0.051
1.516GluPro: 1.516 ± 0.036
2.685GluGln: 2.685 ± 0.052
2.77GluArg: 2.77 ± 0.044
3.372GluSer: 3.372 ± 0.052
3.06GluThr: 3.06 ± 0.051
4.145GluVal: 4.145 ± 0.05
0.669GluTrp: 0.669 ± 0.025
2.185GluTyr: 2.185 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.175PheAla: 3.175 ± 0.039
0.419PheCys: 0.419 ± 0.019
3.138PheAsp: 3.138 ± 0.049
3.131PheGlu: 3.131 ± 0.047
2.53PhePhe: 2.53 ± 0.058
3.387PheGly: 3.387 ± 0.046
0.897PheHis: 0.897 ± 0.029
3.614PheIle: 3.614 ± 0.065
3.363PheLys: 3.363 ± 0.052
4.34PheLeu: 4.34 ± 0.068
1.162PheMet: 1.162 ± 0.031
2.998PheAsn: 2.998 ± 0.055
1.768PhePro: 1.768 ± 0.035
1.696PheGln: 1.696 ± 0.035
1.8PheArg: 1.8 ± 0.04
3.764PheSer: 3.764 ± 0.057
2.755PheThr: 2.755 ± 0.05
2.909PheVal: 2.909 ± 0.051
0.603PheTrp: 0.603 ± 0.022
2.053PheTyr: 2.053 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
4.343GlyAla: 4.343 ± 0.066
0.514GlyCys: 0.514 ± 0.02
3.295GlyAsp: 3.295 ± 0.052
3.4GlyGlu: 3.4 ± 0.057
3.421GlyPhe: 3.421 ± 0.054
4.481GlyGly: 4.481 ± 0.074
1.197GlyHis: 1.197 ± 0.029
5.4GlyIle: 5.4 ± 0.068
5.061GlyLys: 5.061 ± 0.058
6.156GlyLeu: 6.156 ± 0.074
1.74GlyMet: 1.74 ± 0.038
3.636GlyAsn: 3.636 ± 0.055
1.428GlyPro: 1.428 ± 0.038
2.361GlyGln: 2.361 ± 0.043
2.483GlyArg: 2.483 ± 0.05
4.175GlySer: 4.175 ± 0.069
3.886GlyThr: 3.886 ± 0.059
4.325GlyVal: 4.325 ± 0.065
0.869GlyTrp: 0.869 ± 0.026
3.118GlyTyr: 3.118 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
1.225HisAla: 1.225 ± 0.033
0.181HisCys: 0.181 ± 0.012
1.034HisAsp: 1.034 ± 0.03
1.069HisGlu: 1.069 ± 0.03
1.11HisPhe: 1.11 ± 0.031
1.111HisGly: 1.111 ± 0.031
0.538HisHis: 0.538 ± 0.02
1.621HisIle: 1.621 ± 0.035
1.052HisLys: 1.052 ± 0.025
1.85HisLeu: 1.85 ± 0.045
0.401HisMet: 0.401 ± 0.016
0.97HisAsn: 0.97 ± 0.029
0.889HisPro: 0.889 ± 0.024
0.837HisGln: 0.837 ± 0.027
0.759HisArg: 0.759 ± 0.025
1.036HisSer: 1.036 ± 0.028
1.002HisThr: 1.002 ± 0.032
1.093HisVal: 1.093 ± 0.029
0.261HisTrp: 0.261 ± 0.014
0.963HisTyr: 0.963 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.083IleAla: 6.083 ± 0.077
0.704IleCys: 0.704 ± 0.023
4.71IleAsp: 4.71 ± 0.055
4.78IleGlu: 4.78 ± 0.072
3.423IlePhe: 3.423 ± 0.062
5.245IleGly: 5.245 ± 0.069
1.44IleHis: 1.44 ± 0.032
5.546IleIle: 5.546 ± 0.089
5.346IleLys: 5.346 ± 0.075
6.974IleLeu: 6.974 ± 0.088
1.533IleMet: 1.533 ± 0.036
4.331IleAsn: 4.331 ± 0.068
3.306IlePro: 3.306 ± 0.052
2.923IleGln: 2.923 ± 0.049
3.013IleArg: 3.013 ± 0.048
5.492IleSer: 5.492 ± 0.067
4.24IleThr: 4.24 ± 0.063
4.648IleVal: 4.648 ± 0.069
0.811IleTrp: 0.811 ± 0.027
2.758IleTyr: 2.758 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
4.715LysAla: 4.715 ± 0.07
0.329LysCys: 0.329 ± 0.018
4.315LysAsp: 4.315 ± 0.06
5.425LysGlu: 5.425 ± 0.073
2.818LysPhe: 2.818 ± 0.042
4.61LysGly: 4.61 ± 0.059
1.319LysHis: 1.319 ± 0.032
5.517LysIle: 5.517 ± 0.067
5.662LysLys: 5.662 ± 0.085
6.347LysLeu: 6.347 ± 0.067
1.983LysMet: 1.983 ± 0.038
4.652LysAsn: 4.652 ± 0.057
2.4LysPro: 2.4 ± 0.039
2.857LysGln: 2.857 ± 0.05
2.874LysArg: 2.874 ± 0.042
4.59LysSer: 4.59 ± 0.064
4.016LysThr: 4.016 ± 0.051
4.321LysVal: 4.321 ± 0.053
0.796LysTrp: 0.796 ± 0.025
2.871LysTyr: 2.871 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
6.68LeuAla: 6.68 ± 0.078
0.754LeuCys: 0.754 ± 0.026
5.33LeuAsp: 5.33 ± 0.063
5.478LeuGlu: 5.478 ± 0.067
4.855LeuPhe: 4.855 ± 0.081
5.965LeuGly: 5.965 ± 0.078
1.762LeuHis: 1.762 ± 0.039
6.89LeuIle: 6.89 ± 0.09
7.363LeuLys: 7.363 ± 0.079
9.533LeuLeu: 9.533 ± 0.109
2.258LeuMet: 2.258 ± 0.043
5.556LeuAsn: 5.556 ± 0.077
3.826LeuPro: 3.826 ± 0.051
3.677LeuGln: 3.677 ± 0.053
3.562LeuArg: 3.562 ± 0.052
7.069LeuSer: 7.069 ± 0.078
5.351LeuThr: 5.351 ± 0.067
5.43LeuVal: 5.43 ± 0.066
0.906LeuTrp: 0.906 ± 0.025
3.457LeuTyr: 3.457 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
1.678MetAla: 1.678 ± 0.036
0.135MetCys: 0.135 ± 0.008
1.344MetAsp: 1.344 ± 0.033
1.483MetGlu: 1.483 ± 0.032
0.851MetPhe: 0.851 ± 0.025
1.571MetGly: 1.571 ± 0.033
0.424MetHis: 0.424 ± 0.019
1.67MetIle: 1.67 ± 0.038
2.011MetLys: 2.011 ± 0.035
2.192MetLeu: 2.192 ± 0.04
0.659MetMet: 0.659 ± 0.023
1.433MetAsn: 1.433 ± 0.031
0.938MetPro: 0.938 ± 0.025
0.969MetGln: 0.969 ± 0.028
0.966MetArg: 0.966 ± 0.027
1.492MetSer: 1.492 ± 0.03
1.262MetThr: 1.262 ± 0.029
1.376MetVal: 1.376 ± 0.033
0.214MetTrp: 0.214 ± 0.014
0.694MetTyr: 0.694 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.735AsnAla: 3.735 ± 0.064
0.342AsnCys: 0.342 ± 0.016
2.901AsnAsp: 2.901 ± 0.05
3.213AsnGlu: 3.213 ± 0.051
2.849AsnPhe: 2.849 ± 0.053
3.81AsnGly: 3.81 ± 0.068
1.046AsnHis: 1.046 ± 0.031
4.379AsnIle: 4.379 ± 0.074
4.183AsnLys: 4.183 ± 0.062
5.345AsnLeu: 5.345 ± 0.06
1.316AsnMet: 1.316 ± 0.033
3.512AsnAsn: 3.512 ± 0.071
2.685AsnPro: 2.685 ± 0.04
2.316AsnGln: 2.316 ± 0.048
2.448AsnArg: 2.448 ± 0.046
3.584AsnSer: 3.584 ± 0.058
3.278AsnThr: 3.278 ± 0.06
3.111AsnVal: 3.111 ± 0.051
0.821AsnTrp: 0.821 ± 0.031
2.626AsnTyr: 2.626 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
2.327ProAla: 2.327 ± 0.048
0.201ProCys: 0.201 ± 0.013
2.084ProAsp: 2.084 ± 0.043
2.621ProGlu: 2.621 ± 0.048
1.882ProPhe: 1.882 ± 0.037
2.031ProGly: 2.031 ± 0.04
0.671ProHis: 0.671 ± 0.026
2.91ProIle: 2.91 ± 0.05
2.32ProLys: 2.32 ± 0.046
3.173ProLeu: 3.173 ± 0.052
0.742ProMet: 0.742 ± 0.024
2.103ProAsn: 2.103 ± 0.035
0.779ProPro: 0.779 ± 0.023
1.306ProGln: 1.306 ± 0.036
1.049ProArg: 1.049 ± 0.029
2.242ProSer: 2.242 ± 0.044
2.168ProThr: 2.168 ± 0.036
2.378ProVal: 2.378 ± 0.041
0.393ProTrp: 0.393 ± 0.019
1.539ProTyr: 1.539 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
2.6GlnAla: 2.6 ± 0.049
0.191GlnCys: 0.191 ± 0.012
2.071GlnAsp: 2.071 ± 0.041
2.57GlnGlu: 2.57 ± 0.046
1.813GlnPhe: 1.813 ± 0.035
2.092GlnGly: 2.092 ± 0.038
0.885GlnHis: 0.885 ± 0.024
2.976GlnIle: 2.976 ± 0.051
2.824GlnLys: 2.824 ± 0.048
4.113GlnLeu: 4.113 ± 0.058
0.886GlnMet: 0.886 ± 0.028
2.183GlnAsn: 2.183 ± 0.045
1.174GlnPro: 1.174 ± 0.032
1.975GlnGln: 1.975 ± 0.043
1.483GlnArg: 1.483 ± 0.029
2.264GlnSer: 2.264 ± 0.041
1.928GlnThr: 1.928 ± 0.042
2.623GlnVal: 2.623 ± 0.046
0.425GlnTrp: 0.425 ± 0.018
1.723GlnTyr: 1.723 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
2.488ArgAla: 2.488 ± 0.048
0.214ArgCys: 0.214 ± 0.012
2.08ArgAsp: 2.08 ± 0.045
2.264ArgGlu: 2.264 ± 0.041
2.013ArgPhe: 2.013 ± 0.039
2.133ArgGly: 2.133 ± 0.042
0.688ArgHis: 0.688 ± 0.023
3.188ArgIle: 3.188 ± 0.055
2.916ArgLys: 2.916 ± 0.048
3.696ArgLeu: 3.696 ± 0.056
1.038ArgMet: 1.038 ± 0.027
2.318ArgAsn: 2.318 ± 0.038
1.275ArgPro: 1.275 ± 0.032
1.348ArgGln: 1.348 ± 0.029
1.432ArgArg: 1.432 ± 0.037
2.343ArgSer: 2.343 ± 0.042
2.067ArgThr: 2.067 ± 0.041
2.418ArgVal: 2.418 ± 0.038
0.521ArgTrp: 0.521 ± 0.021
1.866ArgTyr: 1.866 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
4.137SerAla: 4.137 ± 0.059
0.651SerCys: 0.651 ± 0.022
3.497SerAsp: 3.497 ± 0.052
3.484SerGlu: 3.484 ± 0.05
3.802SerPhe: 3.802 ± 0.064
4.648SerGly: 4.648 ± 0.059
1.147SerHis: 1.147 ± 0.03
5.269SerIle: 5.269 ± 0.069
4.528SerLys: 4.528 ± 0.056
6.42SerLeu: 6.42 ± 0.063
1.341SerMet: 1.341 ± 0.031
3.57SerAsn: 3.57 ± 0.052
2.218SerPro: 2.218 ± 0.039
2.18SerGln: 2.18 ± 0.038
2.39SerArg: 2.39 ± 0.042
4.568SerSer: 4.568 ± 0.068
3.802SerThr: 3.802 ± 0.058
3.913SerVal: 3.913 ± 0.06
0.758SerTrp: 0.758 ± 0.025
3.037SerTyr: 3.037 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
4.131ThrAla: 4.131 ± 0.059
0.371ThrCys: 0.371 ± 0.017
3.217ThrAsp: 3.217 ± 0.054
3.059ThrGlu: 3.059 ± 0.057
2.763ThrPhe: 2.763 ± 0.049
4.143ThrGly: 4.143 ± 0.06
0.999ThrHis: 0.999 ± 0.026
4.365ThrIle: 4.365 ± 0.055
3.551ThrLys: 3.551 ± 0.048
5.37ThrLeu: 5.37 ± 0.063
1.022ThrMet: 1.022 ± 0.026
2.849ThrAsn: 2.849 ± 0.055
2.257ThrPro: 2.257 ± 0.045
1.783ThrGln: 1.783 ± 0.035
1.751ThrArg: 1.751 ± 0.036
3.456ThrSer: 3.456 ± 0.049
3.099ThrThr: 3.099 ± 0.053
3.652ThrVal: 3.652 ± 0.053
0.619ThrTrp: 0.619 ± 0.022
2.389ThrTyr: 2.389 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
4.328ValAla: 4.328 ± 0.07
0.515ValCys: 0.515 ± 0.021
3.613ValAsp: 3.613 ± 0.05
3.706ValGlu: 3.706 ± 0.059
3.048ValPhe: 3.048 ± 0.053
3.924ValGly: 3.924 ± 0.064
1.109ValHis: 1.109 ± 0.028
4.616ValIle: 4.616 ± 0.068
4.267ValLys: 4.267 ± 0.062
6.029ValLeu: 6.029 ± 0.071
1.39ValMet: 1.39 ± 0.035
3.538ValAsn: 3.538 ± 0.045
2.228ValPro: 2.228 ± 0.041
2.226ValGln: 2.226 ± 0.042
2.308ValArg: 2.308 ± 0.04
4.316ValSer: 4.316 ± 0.054
3.308ValThr: 3.308 ± 0.052
4.132ValVal: 4.132 ± 0.065
0.66ValTrp: 0.66 ± 0.022
2.333ValTyr: 2.333 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
0.698TrpAla: 0.698 ± 0.026
0.103TrpCys: 0.103 ± 0.009
0.681TrpAsp: 0.681 ± 0.024
0.717TrpGlu: 0.717 ± 0.025
0.512TrpPhe: 0.512 ± 0.022
0.792TrpGly: 0.792 ± 0.026
0.233TrpHis: 0.233 ± 0.012
0.797TrpIle: 0.797 ± 0.024
0.911TrpLys: 0.911 ± 0.031
1.089TrpLeu: 1.089 ± 0.03
0.344TrpMet: 0.344 ± 0.016
0.804TrpAsn: 0.804 ± 0.024
0.283TrpPro: 0.283 ± 0.015
0.486TrpGln: 0.486 ± 0.022
0.501TrpArg: 0.501 ± 0.018
0.752TrpSer: 0.752 ± 0.027
0.645TrpThr: 0.645 ± 0.024
0.663TrpVal: 0.663 ± 0.022
0.16TrpTrp: 0.16 ± 0.014
0.494TrpTyr: 0.494 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.782TyrAla: 2.782 ± 0.05
0.323TyrCys: 0.323 ± 0.016
2.403TyrAsp: 2.403 ± 0.045
2.305TyrGlu: 2.305 ± 0.046
2.365TyrPhe: 2.365 ± 0.04
2.863TyrGly: 2.863 ± 0.051
0.911TyrHis: 0.911 ± 0.029
2.792TyrIle: 2.792 ± 0.046
2.693TyrLys: 2.693 ± 0.043
4.058TyrLeu: 4.058 ± 0.067
0.893TyrMet: 0.893 ± 0.027
2.396TyrAsn: 2.396 ± 0.055
1.669TyrPro: 1.669 ± 0.036
1.864TyrGln: 1.864 ± 0.039
1.81TyrArg: 1.81 ± 0.038
2.846TyrSer: 2.846 ± 0.046
2.32TyrThr: 2.32 ± 0.044
2.153TyrVal: 2.153 ± 0.035
0.555TyrTrp: 0.555 ± 0.021
1.964TyrTyr: 1.964 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4058 proteins (1400076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski