Amino acid dipepetide frequency for Caenorhabditis elegans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.796AlaAla: 5.796 ± 0.047
1.113AlaCys: 1.113 ± 0.013
3.294AlaAsp: 3.294 ± 0.025
4.298AlaGlu: 4.298 ± 0.038
2.587AlaPhe: 2.587 ± 0.02
3.454AlaGly: 3.454 ± 0.027
1.462AlaHis: 1.462 ± 0.012
3.793AlaIle: 3.793 ± 0.021
3.831AlaLys: 3.831 ± 0.032
5.252AlaLeu: 5.252 ± 0.033
1.637AlaMet: 1.637 ± 0.012
2.764AlaAsn: 2.764 ± 0.017
3.948AlaPro: 3.948 ± 0.04
2.888AlaGln: 2.888 ± 0.025
3.195AlaArg: 3.195 ± 0.022
5.249AlaSer: 5.249 ± 0.031
4.042AlaThr: 4.042 ± 0.026
4.32AlaVal: 4.32 ± 0.027
0.573AlaTrp: 0.573 ± 0.008
1.693AlaTyr: 1.693 ± 0.014
0.0AlaXaa: 0.0 ± 0.0
Cys
1.201CysAla: 1.201 ± 0.015
0.534CysCys: 0.534 ± 0.012
1.109CysAsp: 1.109 ± 0.014
1.211CysGlu: 1.211 ± 0.014
0.885CysPhe: 0.885 ± 0.011
1.224CysGly: 1.224 ± 0.014
0.477CysHis: 0.477 ± 0.007
1.123CysIle: 1.123 ± 0.011
1.047CysLys: 1.047 ± 0.012
1.619CysLeu: 1.619 ± 0.014
0.463CysMet: 0.463 ± 0.006
0.856CysAsn: 0.856 ± 0.012
1.034CysPro: 1.034 ± 0.019
0.874CysGln: 0.874 ± 0.017
1.052CysArg: 1.052 ± 0.012
1.687CysSer: 1.687 ± 0.02
1.088CysThr: 1.088 ± 0.012
1.237CysVal: 1.237 ± 0.015
0.212CysTrp: 0.212 ± 0.005
0.581CysTyr: 0.581 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
3.509AspAla: 3.509 ± 0.034
0.962AspCys: 0.962 ± 0.013
4.244AspAsp: 4.244 ± 0.033
4.746AspGlu: 4.746 ± 0.033
2.44AspPhe: 2.44 ± 0.014
3.603AspGly: 3.603 ± 0.052
1.19AspHis: 1.19 ± 0.012
2.989AspIle: 2.989 ± 0.019
2.909AspLys: 2.909 ± 0.03
4.372AspLeu: 4.372 ± 0.026
1.3AspMet: 1.3 ± 0.011
2.252AspAsn: 2.252 ± 0.014
2.434AspPro: 2.434 ± 0.018
2.056AspGln: 2.056 ± 0.017
2.691AspArg: 2.691 ± 0.023
4.112AspSer: 4.112 ± 0.026
2.597AspThr: 2.597 ± 0.014
3.683AspVal: 3.683 ± 0.026
0.619AspTrp: 0.619 ± 0.007
1.728AspTyr: 1.728 ± 0.014
0.0AspXaa: 0.0 ± 0.0
Glu
4.239GluAla: 4.239 ± 0.04
1.158GluCys: 1.158 ± 0.016
4.045GluAsp: 4.045 ± 0.028
6.365GluGlu: 6.365 ± 0.053
2.453GluPhe: 2.453 ± 0.017
2.773GluGly: 2.773 ± 0.024
1.621GluHis: 1.621 ± 0.013
4.076GluIle: 4.076 ± 0.025
5.937GluLys: 5.937 ± 0.049
5.364GluLeu: 5.364 ± 0.033
1.965GluMet: 1.965 ± 0.015
3.706GluAsn: 3.706 ± 0.017
2.827GluPro: 2.827 ± 0.034
2.962GluGln: 2.962 ± 0.024
3.559GluArg: 3.559 ± 0.03
4.528GluSer: 4.528 ± 0.033
3.9GluThr: 3.9 ± 0.038
3.709GluVal: 3.709 ± 0.031
0.722GluTrp: 0.722 ± 0.008
1.928GluTyr: 1.928 ± 0.013
0.0GluXaa: 0.0 ± 0.0
Phe
2.691PheAla: 2.691 ± 0.019
0.987PheCys: 0.987 ± 0.011
2.605PheAsp: 2.605 ± 0.016
2.756PheGlu: 2.756 ± 0.019
2.261PhePhe: 2.261 ± 0.019
2.869PheGly: 2.869 ± 0.021
1.055PheHis: 1.055 ± 0.009
2.58PheIle: 2.58 ± 0.018
2.156PheLys: 2.156 ± 0.016
4.084PheLeu: 4.084 ± 0.023
1.118PheMet: 1.118 ± 0.01
2.053PheAsn: 2.053 ± 0.014
1.804PhePro: 1.804 ± 0.015
1.701PheGln: 1.701 ± 0.015
2.086PheArg: 2.086 ± 0.014
3.496PheSer: 3.496 ± 0.021
2.335PheThr: 2.335 ± 0.015
2.984PheVal: 2.984 ± 0.018
0.531PheTrp: 0.531 ± 0.007
1.576PheTyr: 1.576 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
3.603GlyAla: 3.603 ± 0.028
1.056GlyCys: 1.056 ± 0.013
2.907GlyAsp: 2.907 ± 0.021
3.34GlyGlu: 3.34 ± 0.031
2.492GlyPhe: 2.492 ± 0.02
4.459GlyGly: 4.459 ± 0.049
1.239GlyHis: 1.239 ± 0.011
3.04GlyIle: 3.04 ± 0.017
3.32GlyLys: 3.32 ± 0.026
3.909GlyLeu: 3.909 ± 0.024
1.314GlyMet: 1.314 ± 0.013
2.715GlyAsn: 2.715 ± 0.025
2.352GlyPro: 2.352 ± 0.039
2.151GlyGln: 2.151 ± 0.023
2.744GlyArg: 2.744 ± 0.02
4.665GlySer: 4.665 ± 0.033
3.148GlyThr: 3.148 ± 0.025
3.278GlyVal: 3.278 ± 0.019
0.637GlyTrp: 0.637 ± 0.01
1.948GlyTyr: 1.948 ± 0.018
0.0GlyXaa: 0.0 ± 0.0
His
1.253HisAla: 1.253 ± 0.011
0.5HisCys: 0.5 ± 0.007
1.139HisAsp: 1.139 ± 0.01
1.345HisGlu: 1.345 ± 0.011
1.194HisPhe: 1.194 ± 0.011
1.308HisGly: 1.308 ± 0.014
0.99HisHis: 0.99 ± 0.016
1.283HisIle: 1.283 ± 0.012
1.135HisLys: 1.135 ± 0.009
2.135HisLeu: 2.135 ± 0.017
0.569HisMet: 0.569 ± 0.007
1.046HisAsn: 1.046 ± 0.01
1.221HisPro: 1.221 ± 0.012
1.184HisGln: 1.184 ± 0.014
1.397HisArg: 1.397 ± 0.012
1.816HisSer: 1.816 ± 0.012
1.143HisThr: 1.143 ± 0.01
1.571HisVal: 1.571 ± 0.012
0.277HisTrp: 0.277 ± 0.005
0.797HisTyr: 0.797 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
3.862IleAla: 3.862 ± 0.022
1.265IleCys: 1.265 ± 0.013
3.495IleAsp: 3.495 ± 0.023
3.898IleGlu: 3.898 ± 0.02
2.899IlePhe: 2.899 ± 0.023
3.251IleGly: 3.251 ± 0.022
1.352IleHis: 1.352 ± 0.01
3.465IleIle: 3.465 ± 0.021
2.966IleLys: 2.966 ± 0.019
5.157IleLeu: 5.157 ± 0.027
1.329IleMet: 1.329 ± 0.013
2.565IleAsn: 2.565 ± 0.016
3.159IlePro: 3.159 ± 0.022
2.315IleGln: 2.315 ± 0.013
3.143IleArg: 3.143 ± 0.019
4.786IleSer: 4.786 ± 0.024
3.209IleThr: 3.209 ± 0.018
3.974IleVal: 3.974 ± 0.022
0.632IleTrp: 0.632 ± 0.008
1.818IleTyr: 1.818 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
3.454LysAla: 3.454 ± 0.026
1.286LysCys: 1.286 ± 0.016
3.234LysAsp: 3.234 ± 0.039
4.538LysGlu: 4.538 ± 0.044
2.438LysPhe: 2.438 ± 0.016
2.414LysGly: 2.414 ± 0.019
1.328LysHis: 1.328 ± 0.012
3.772LysIle: 3.772 ± 0.021
5.813LysLys: 5.813 ± 0.063
5.417LysLeu: 5.417 ± 0.051
1.87LysMet: 1.87 ± 0.013
3.306LysAsn: 3.306 ± 0.018
2.844LysPro: 2.844 ± 0.031
2.42LysGln: 2.42 ± 0.029
3.519LysArg: 3.519 ± 0.02
4.722LysSer: 4.722 ± 0.032
3.674LysThr: 3.674 ± 0.022
3.432LysVal: 3.432 ± 0.021
0.748LysTrp: 0.748 ± 0.009
1.934LysTyr: 1.934 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
5.595LeuAla: 5.595 ± 0.028
1.585LeuCys: 1.585 ± 0.014
4.366LeuAsp: 4.366 ± 0.026
5.647LeuGlu: 5.647 ± 0.034
3.868LeuPhe: 3.868 ± 0.025
3.885LeuGly: 3.885 ± 0.02
2.02LeuHis: 2.02 ± 0.015
5.05LeuIle: 5.05 ± 0.029
5.563LeuLys: 5.563 ± 0.039
7.919LeuLeu: 7.919 ± 0.043
2.147LeuMet: 2.147 ± 0.017
4.044LeuAsn: 4.044 ± 0.024
4.355LeuPro: 4.355 ± 0.038
3.615LeuGln: 3.615 ± 0.026
4.598LeuArg: 4.598 ± 0.027
6.54LeuSer: 6.54 ± 0.028
4.74LeuThr: 4.74 ± 0.021
4.999LeuVal: 4.999 ± 0.02
0.807LeuTrp: 0.807 ± 0.009
2.299LeuTyr: 2.299 ± 0.017
0.0LeuXaa: 0.0 ± 0.0
Met
1.765MetAla: 1.765 ± 0.013
0.507MetCys: 0.507 ± 0.007
1.421MetAsp: 1.421 ± 0.011
1.769MetGlu: 1.769 ± 0.013
1.15MetPhe: 1.15 ± 0.011
1.183MetGly: 1.183 ± 0.011
0.556MetHis: 0.556 ± 0.006
1.551MetIle: 1.551 ± 0.012
1.744MetLys: 1.744 ± 0.013
2.154MetLeu: 2.154 ± 0.017
0.883MetMet: 0.883 ± 0.01
1.355MetAsn: 1.355 ± 0.011
1.197MetPro: 1.197 ± 0.012
1.046MetGln: 1.046 ± 0.012
1.41MetArg: 1.41 ± 0.011
2.268MetSer: 2.268 ± 0.016
1.553MetThr: 1.553 ± 0.012
1.424MetVal: 1.424 ± 0.011
0.256MetTrp: 0.256 ± 0.004
0.741MetTyr: 0.741 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.928AsnAla: 2.928 ± 0.016
0.958AsnCys: 0.958 ± 0.011
2.616AsnAsp: 2.616 ± 0.015
3.147AsnGlu: 3.147 ± 0.016
2.129AsnPhe: 2.129 ± 0.015
3.454AsnGly: 3.454 ± 0.025
1.097AsnHis: 1.097 ± 0.01
2.697AsnIle: 2.697 ± 0.016
2.422AsnLys: 2.422 ± 0.018
3.993AsnLeu: 3.993 ± 0.025
1.247AsnMet: 1.247 ± 0.011
2.511AsnAsn: 2.511 ± 0.019
2.25AsnPro: 2.25 ± 0.015
2.104AsnGln: 2.104 ± 0.017
2.499AsnArg: 2.499 ± 0.015
3.989AsnSer: 3.989 ± 0.022
2.446AsnThr: 2.446 ± 0.016
3.152AsnVal: 3.152 ± 0.017
0.523AsnTrp: 0.523 ± 0.007
1.621AsnTyr: 1.621 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
3.588ProAla: 3.588 ± 0.034
0.713ProCys: 0.713 ± 0.01
2.399ProAsp: 2.399 ± 0.029
3.304ProGlu: 3.304 ± 0.028
1.932ProPhe: 1.932 ± 0.014
2.796ProGly: 2.796 ± 0.045
1.01ProHis: 1.01 ± 0.01
2.962ProIle: 2.962 ± 0.024
2.917ProLys: 2.917 ± 0.024
3.776ProLeu: 3.776 ± 0.033
1.266ProMet: 1.266 ± 0.013
2.29ProAsn: 2.29 ± 0.017
4.442ProPro: 4.442 ± 0.045
2.266ProGln: 2.266 ± 0.021
2.404ProArg: 2.404 ± 0.017
4.734ProSer: 4.734 ± 0.034
3.73ProThr: 3.73 ± 0.048
3.359ProVal: 3.359 ± 0.033
0.394ProTrp: 0.394 ± 0.006
1.359ProTyr: 1.359 ± 0.014
0.0ProXaa: 0.0 ± 0.0
Gln
2.568GlnAla: 2.568 ± 0.025
0.899GlnCys: 0.899 ± 0.018
1.664GlnAsp: 1.664 ± 0.013
2.508GlnGlu: 2.508 ± 0.028
1.866GlnPhe: 1.866 ± 0.014
1.766GlnGly: 1.766 ± 0.018
1.149GlnHis: 1.149 ± 0.013
2.456GlnIle: 2.456 ± 0.015
2.99GlnLys: 2.99 ± 0.028
4.045GlnLeu: 4.045 ± 0.027
1.392GlnMet: 1.392 ± 0.014
2.297GlnAsn: 2.297 ± 0.021
2.286GlnPro: 2.286 ± 0.024
3.561GlnGln: 3.561 ± 0.058
2.314GlnArg: 2.314 ± 0.016
3.009GlnSer: 3.009 ± 0.018
2.266GlnThr: 2.266 ± 0.016
2.309GlnVal: 2.309 ± 0.015
0.5GlnTrp: 0.5 ± 0.007
1.346GlnTyr: 1.346 ± 0.013
0.0GlnXaa: 0.0 ± 0.0
Arg
3.106ArgAla: 3.106 ± 0.018
0.972ArgCys: 0.972 ± 0.012
2.715ArgAsp: 2.715 ± 0.018
3.424ArgGlu: 3.424 ± 0.029
2.201ArgPhe: 2.201 ± 0.016
2.585ArgGly: 2.585 ± 0.022
1.364ArgHis: 1.364 ± 0.013
3.11ArgIle: 3.11 ± 0.019
3.873ArgLys: 3.873 ± 0.023
4.433ArgLeu: 4.433 ± 0.026
1.372ArgMet: 1.372 ± 0.01
2.744ArgAsn: 2.744 ± 0.017
2.417ArgPro: 2.417 ± 0.02
2.371ArgGln: 2.371 ± 0.018
4.146ArgArg: 4.146 ± 0.033
4.121ArgSer: 4.121 ± 0.027
2.729ArgThr: 2.729 ± 0.015
3.052ArgVal: 3.052 ± 0.018
0.554ArgTrp: 0.554 ± 0.007
1.531ArgTyr: 1.531 ± 0.014
0.0ArgXaa: 0.0 ± 0.0
Ser
5.467SerAla: 5.467 ± 0.025
1.523SerCys: 1.523 ± 0.02
4.361SerAsp: 4.361 ± 0.025
5.048SerGlu: 5.048 ± 0.035
3.396SerPhe: 3.396 ± 0.018
4.7SerGly: 4.7 ± 0.048
1.731SerHis: 1.731 ± 0.013
4.711SerIle: 4.711 ± 0.024
4.433SerLys: 4.433 ± 0.028
6.398SerLeu: 6.398 ± 0.029
2.019SerMet: 2.019 ± 0.015
3.83SerAsn: 3.83 ± 0.022
4.369SerPro: 4.369 ± 0.033
3.251SerGln: 3.251 ± 0.02
4.155SerArg: 4.155 ± 0.026
9.568SerSer: 9.568 ± 0.073
6.032SerThr: 6.032 ± 0.046
4.862SerVal: 4.862 ± 0.026
0.778SerTrp: 0.778 ± 0.008
2.182SerTyr: 2.182 ± 0.017
0.0SerXaa: 0.0 ± 0.0
Thr
3.983ThrAla: 3.983 ± 0.025
1.285ThrCys: 1.285 ± 0.018
2.946ThrAsp: 2.946 ± 0.046
3.613ThrGlu: 3.613 ± 0.036
2.491ThrPhe: 2.491 ± 0.018
3.163ThrGly: 3.163 ± 0.022
1.15ThrHis: 1.15 ± 0.01
3.74ThrIle: 3.74 ± 0.023
3.065ThrLys: 3.065 ± 0.019
4.676ThrLeu: 4.676 ± 0.022
1.409ThrMet: 1.409 ± 0.012
2.583ThrAsn: 2.583 ± 0.016
3.665ThrPro: 3.665 ± 0.031
2.062ThrGln: 2.062 ± 0.016
2.629ThrArg: 2.629 ± 0.015
5.679ThrSer: 5.679 ± 0.04
5.329ThrThr: 5.329 ± 0.084
4.265ThrVal: 4.265 ± 0.033
0.633ThrTrp: 0.633 ± 0.007
1.682ThrTyr: 1.682 ± 0.015
0.0ThrXaa: 0.0 ± 0.0
Val
4.274ValAla: 4.274 ± 0.017
1.278ValCys: 1.278 ± 0.014
3.577ValAsp: 3.577 ± 0.018
4.458ValGlu: 4.458 ± 0.054
2.933ValPhe: 2.933 ± 0.017
3.083ValGly: 3.083 ± 0.018
1.471ValHis: 1.471 ± 0.012
3.743ValIle: 3.743 ± 0.024
3.654ValLys: 3.654 ± 0.023
5.343ValLeu: 5.343 ± 0.025
1.49ValMet: 1.49 ± 0.013
2.768ValAsn: 2.768 ± 0.017
3.266ValPro: 3.266 ± 0.028
2.587ValGln: 2.587 ± 0.016
3.001ValArg: 3.001 ± 0.019
4.687ValSer: 4.687 ± 0.02
3.768ValThr: 3.768 ± 0.03
4.347ValVal: 4.347 ± 0.025
0.647ValTrp: 0.647 ± 0.007
1.883ValTyr: 1.883 ± 0.013
0.0ValXaa: 0.0 ± 0.0
Trp
0.593TrpAla: 0.593 ± 0.008
0.216TrpCys: 0.216 ± 0.005
0.543TrpAsp: 0.543 ± 0.008
0.558TrpGlu: 0.558 ± 0.007
0.53TrpPhe: 0.53 ± 0.007
0.443TrpGly: 0.443 ± 0.008
0.245TrpHis: 0.245 ± 0.004
0.729TrpIle: 0.729 ± 0.009
0.837TrpLys: 0.837 ± 0.009
0.978TrpLeu: 0.978 ± 0.009
0.356TrpMet: 0.356 ± 0.005
0.634TrpAsn: 0.634 ± 0.009
0.393TrpPro: 0.393 ± 0.007
0.427TrpGln: 0.427 ± 0.007
0.59TrpArg: 0.59 ± 0.007
0.793TrpSer: 0.793 ± 0.009
0.677TrpThr: 0.677 ± 0.009
0.535TrpVal: 0.535 ± 0.008
0.161TrpTrp: 0.161 ± 0.004
0.377TrpTyr: 0.377 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.714TyrAla: 1.714 ± 0.015
0.719TyrCys: 0.719 ± 0.009
1.754TyrAsp: 1.754 ± 0.013
1.866TyrGlu: 1.866 ± 0.013
1.582TyrPhe: 1.582 ± 0.015
1.902TyrGly: 1.902 ± 0.016
0.793TyrHis: 0.793 ± 0.008
1.672TyrIle: 1.672 ± 0.016
1.553TyrLys: 1.553 ± 0.014
2.619TyrLeu: 2.619 ± 0.018
0.786TyrMet: 0.786 ± 0.008
1.461TyrAsn: 1.461 ± 0.013
1.377TyrPro: 1.377 ± 0.021
1.351TyrGln: 1.351 ± 0.015
1.687TyrArg: 1.687 ± 0.013
2.363TyrSer: 2.363 ± 0.016
1.645TyrThr: 1.645 ± 0.012
1.767TyrVal: 1.767 ± 0.014
0.412TyrTrp: 0.412 ± 0.006
1.198TyrTyr: 1.198 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28314 proteins (13466555 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski