Amino acid dipepetide frequency for Daucus carota subsp. sativus (Carrot)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.076AlaAla: 6.076 ± 0.036
1.204AlaCys: 1.204 ± 0.011
3.304AlaAsp: 3.304 ± 0.02
4.263AlaGlu: 4.263 ± 0.024
2.674AlaPhe: 2.674 ± 0.016
4.097AlaGly: 4.097 ± 0.023
1.3AlaHis: 1.3 ± 0.02
3.685AlaIle: 3.685 ± 0.019
3.885AlaLys: 3.885 ± 0.022
6.342AlaLeu: 6.342 ± 0.029
1.707AlaMet: 1.707 ± 0.012
2.673AlaAsn: 2.673 ± 0.015
2.818AlaPro: 2.818 ± 0.021
2.125AlaGln: 2.125 ± 0.015
3.248AlaArg: 3.248 ± 0.019
5.844AlaSer: 5.844 ± 0.025
3.596AlaThr: 3.596 ± 0.018
4.758AlaVal: 4.758 ± 0.023
0.734AlaTrp: 0.734 ± 0.009
1.851AlaTyr: 1.851 ± 0.013
0.001AlaXaa: 0.001 ± 0.0
Cys
1.005CysAla: 1.005 ± 0.009
0.499CysCys: 0.499 ± 0.007
0.918CysAsp: 0.918 ± 0.01
0.92CysGlu: 0.92 ± 0.011
0.873CysPhe: 0.873 ± 0.008
1.346CysGly: 1.346 ± 0.014
0.43CysHis: 0.43 ± 0.007
1.038CysIle: 1.038 ± 0.011
1.135CysLys: 1.135 ± 0.012
1.827CysLeu: 1.827 ± 0.013
0.442CysMet: 0.442 ± 0.006
0.919CysAsn: 0.919 ± 0.009
0.867CysPro: 0.867 ± 0.01
0.597CysGln: 0.597 ± 0.007
1.006CysArg: 1.006 ± 0.01
1.791CysSer: 1.791 ± 0.014
0.915CysThr: 0.915 ± 0.009
1.086CysVal: 1.086 ± 0.009
0.231CysTrp: 0.231 ± 0.005
0.573CysTyr: 0.573 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
3.434AspAla: 3.434 ± 0.019
0.979AspCys: 0.979 ± 0.01
3.821AspAsp: 3.821 ± 0.024
4.174AspGlu: 4.174 ± 0.023
2.367AspPhe: 2.367 ± 0.016
3.724AspGly: 3.724 ± 0.02
1.297AspHis: 1.297 ± 0.009
3.322AspIle: 3.322 ± 0.017
2.98AspLys: 2.98 ± 0.016
5.113AspLeu: 5.113 ± 0.024
1.464AspMet: 1.464 ± 0.011
2.326AspAsn: 2.326 ± 0.015
2.528AspPro: 2.528 ± 0.015
1.897AspGln: 1.897 ± 0.012
2.36AspArg: 2.36 ± 0.016
4.411AspSer: 4.411 ± 0.023
2.422AspThr: 2.422 ± 0.014
3.768AspVal: 3.768 ± 0.02
0.698AspTrp: 0.698 ± 0.008
1.707AspTyr: 1.707 ± 0.013
0.001AspXaa: 0.001 ± 0.0
Glu
4.698GluAla: 4.698 ± 0.028
0.935GluCys: 0.935 ± 0.01
4.091GluAsp: 4.091 ± 0.023
5.864GluGlu: 5.864 ± 0.035
2.421GluPhe: 2.421 ± 0.015
3.617GluGly: 3.617 ± 0.018
1.295GluHis: 1.295 ± 0.011
3.76GluIle: 3.76 ± 0.019
4.801GluLys: 4.801 ± 0.028
5.873GluLeu: 5.873 ± 0.031
1.802GluMet: 1.802 ± 0.014
3.243GluAsn: 3.243 ± 0.021
2.039GluPro: 2.039 ± 0.015
2.161GluGln: 2.161 ± 0.018
3.175GluArg: 3.175 ± 0.021
4.618GluSer: 4.618 ± 0.021
3.094GluThr: 3.094 ± 0.017
4.28GluVal: 4.28 ± 0.022
0.718GluTrp: 0.718 ± 0.007
1.752GluTyr: 1.752 ± 0.012
0.001GluXaa: 0.001 ± 0.0
Phe
2.464PheAla: 2.464 ± 0.015
0.879PheCys: 0.879 ± 0.009
2.424PheAsp: 2.424 ± 0.015
2.335PheGlu: 2.335 ± 0.015
1.841PhePhe: 1.841 ± 0.015
2.984PheGly: 2.984 ± 0.02
1.041PheHis: 1.041 ± 0.008
2.089PheIle: 2.089 ± 0.015
2.202PheLys: 2.202 ± 0.013
4.1PheLeu: 4.1 ± 0.024
0.981PheMet: 0.981 ± 0.009
1.827PheAsn: 1.827 ± 0.013
2.004PhePro: 2.004 ± 0.016
1.522PheGln: 1.522 ± 0.01
2.025PheArg: 2.025 ± 0.015
4.01PheSer: 4.01 ± 0.022
2.08PheThr: 2.08 ± 0.015
2.818PheVal: 2.818 ± 0.016
0.541PheTrp: 0.541 ± 0.007
1.332PheTyr: 1.332 ± 0.013
0.001PheXaa: 0.001 ± 0.0
Gly
3.836GlyAla: 3.836 ± 0.021
1.233GlyCys: 1.233 ± 0.011
3.486GlyAsp: 3.486 ± 0.017
3.663GlyGlu: 3.663 ± 0.023
3.011GlyPhe: 3.011 ± 0.017
5.324GlyGly: 5.324 ± 0.059
1.5GlyHis: 1.5 ± 0.011
3.577GlyIle: 3.577 ± 0.019
4.161GlyLys: 4.161 ± 0.02
5.821GlyLeu: 5.821 ± 0.025
1.541GlyMet: 1.541 ± 0.013
3.209GlyAsn: 3.209 ± 0.02
2.367GlyPro: 2.367 ± 0.02
2.066GlyGln: 2.066 ± 0.016
3.442GlyArg: 3.442 ± 0.021
5.93GlySer: 5.93 ± 0.027
3.3GlyThr: 3.3 ± 0.018
4.246GlyVal: 4.246 ± 0.022
0.879GlyTrp: 0.879 ± 0.009
2.083GlyTyr: 2.083 ± 0.017
0.002GlyXaa: 0.002 ± 0.0
His
1.353HisAla: 1.353 ± 0.009
0.479HisCys: 0.479 ± 0.006
1.182HisAsp: 1.182 ± 0.008
1.265HisGlu: 1.265 ± 0.01
1.033HisPhe: 1.033 ± 0.009
1.599HisGly: 1.599 ± 0.015
0.867HisHis: 0.867 ± 0.011
1.268HisIle: 1.268 ± 0.009
1.266HisLys: 1.266 ± 0.013
2.342HisLeu: 2.342 ± 0.013
0.576HisMet: 0.576 ± 0.007
1.026HisAsn: 1.026 ± 0.009
1.328HisPro: 1.328 ± 0.026
1.013HisGln: 1.013 ± 0.01
1.268HisArg: 1.268 ± 0.01
1.94HisSer: 1.94 ± 0.012
1.024HisThr: 1.024 ± 0.01
1.547HisVal: 1.547 ± 0.01
0.295HisTrp: 0.295 ± 0.005
0.731HisTyr: 0.731 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.529IleAla: 3.529 ± 0.019
1.14IleCys: 1.14 ± 0.01
3.015IleAsp: 3.015 ± 0.017
3.187IleGlu: 3.187 ± 0.02
2.307IlePhe: 2.307 ± 0.017
3.369IleGly: 3.369 ± 0.018
1.302IleHis: 1.302 ± 0.011
2.987IleIle: 2.987 ± 0.016
3.117IleLys: 3.117 ± 0.018
5.284IleLeu: 5.284 ± 0.026
1.226IleMet: 1.226 ± 0.011
2.388IleAsn: 2.388 ± 0.016
2.997IlePro: 2.997 ± 0.021
2.047IleGln: 2.047 ± 0.012
2.675IleArg: 2.675 ± 0.014
5.083IleSer: 5.083 ± 0.023
2.764IleThr: 2.764 ± 0.015
3.579IleVal: 3.579 ± 0.022
0.7IleTrp: 0.7 ± 0.009
1.652IleTyr: 1.652 ± 0.012
0.001IleXaa: 0.001 ± 0.0
Lys
4.048LysAla: 4.048 ± 0.02
1.025LysCys: 1.025 ± 0.012
3.479LysAsp: 3.479 ± 0.019
4.523LysGlu: 4.523 ± 0.028
2.351LysPhe: 2.351 ± 0.014
3.616LysGly: 3.616 ± 0.019
1.393LysHis: 1.393 ± 0.011
3.423LysIle: 3.423 ± 0.018
5.034LysLys: 5.034 ± 0.033
6.098LysLeu: 6.098 ± 0.026
1.563LysMet: 1.563 ± 0.012
3.0LysAsn: 3.0 ± 0.017
2.697LysPro: 2.697 ± 0.017
2.336LysGln: 2.336 ± 0.016
3.55LysArg: 3.55 ± 0.021
4.878LysSer: 4.878 ± 0.023
2.997LysThr: 2.997 ± 0.016
3.931LysVal: 3.931 ± 0.016
0.783LysTrp: 0.783 ± 0.008
1.778LysTyr: 1.778 ± 0.013
0.002LysXaa: 0.002 ± 0.0
Leu
6.318LeuAla: 6.318 ± 0.027
1.775LeuCys: 1.775 ± 0.014
5.102LeuAsp: 5.102 ± 0.025
6.131LeuGlu: 6.131 ± 0.029
3.749LeuPhe: 3.749 ± 0.022
5.668LeuGly: 5.668 ± 0.027
2.481LeuHis: 2.481 ± 0.017
4.73LeuIle: 4.73 ± 0.024
6.191LeuLys: 6.191 ± 0.027
9.392LeuLeu: 9.392 ± 0.037
2.175LeuMet: 2.175 ± 0.014
4.135LeuAsn: 4.135 ± 0.019
5.013LeuPro: 5.013 ± 0.023
4.161LeuGln: 4.161 ± 0.023
5.013LeuArg: 5.013 ± 0.021
8.461LeuSer: 8.461 ± 0.037
4.403LeuThr: 4.403 ± 0.021
6.345LeuVal: 6.345 ± 0.027
1.088LeuTrp: 1.088 ± 0.011
2.602LeuTyr: 2.602 ± 0.016
0.002LeuXaa: 0.002 ± 0.0
Met
2.0MetAla: 2.0 ± 0.013
0.335MetCys: 0.335 ± 0.005
1.479MetAsp: 1.479 ± 0.011
1.899MetGlu: 1.899 ± 0.015
0.893MetPhe: 0.893 ± 0.01
1.577MetGly: 1.577 ± 0.012
0.564MetHis: 0.564 ± 0.008
1.308MetIle: 1.308 ± 0.01
1.685MetLys: 1.685 ± 0.012
2.219MetLeu: 2.219 ± 0.016
0.708MetMet: 0.708 ± 0.009
1.149MetAsn: 1.149 ± 0.009
1.06MetPro: 1.06 ± 0.011
0.98MetGln: 0.98 ± 0.01
1.172MetArg: 1.172 ± 0.01
1.91MetSer: 1.91 ± 0.013
1.117MetThr: 1.117 ± 0.009
1.648MetVal: 1.648 ± 0.012
0.268MetTrp: 0.268 ± 0.004
0.652MetTyr: 0.652 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.745AsnAla: 2.745 ± 0.016
0.918AsnCys: 0.918 ± 0.009
2.256AsnAsp: 2.256 ± 0.016
2.617AsnGlu: 2.617 ± 0.018
2.062AsnPhe: 2.062 ± 0.012
3.199AsnGly: 3.199 ± 0.024
1.145AsnHis: 1.145 ± 0.01
2.75AsnIle: 2.75 ± 0.016
2.64AsnLys: 2.64 ± 0.017
4.813AsnLeu: 4.813 ± 0.029
1.246AsnMet: 1.246 ± 0.01
2.5AsnAsn: 2.5 ± 0.019
2.299AsnPro: 2.299 ± 0.014
1.812AsnGln: 1.812 ± 0.014
2.052AsnArg: 2.052 ± 0.012
4.178AsnSer: 4.178 ± 0.024
2.191AsnThr: 2.191 ± 0.015
3.064AsnVal: 3.064 ± 0.017
0.572AsnTrp: 0.572 ± 0.006
1.468AsnTyr: 1.468 ± 0.011
0.0AsnXaa: 0.0 ± 0.0
Pro
3.066ProAla: 3.066 ± 0.026
0.753ProCys: 0.753 ± 0.008
2.537ProAsp: 2.537 ± 0.015
3.079ProGlu: 3.079 ± 0.019
1.904ProPhe: 1.904 ± 0.014
2.764ProGly: 2.764 ± 0.019
1.099ProHis: 1.099 ± 0.011
2.264ProIle: 2.264 ± 0.014
2.71ProLys: 2.71 ± 0.015
4.173ProLeu: 4.173 ± 0.021
0.952ProMet: 0.952 ± 0.01
2.208ProAsn: 2.208 ± 0.014
3.8ProPro: 3.8 ± 0.077
1.822ProGln: 1.822 ± 0.017
2.313ProArg: 2.313 ± 0.016
4.917ProSer: 4.917 ± 0.058
2.538ProThr: 2.538 ± 0.017
3.328ProVal: 3.328 ± 0.02
0.584ProTrp: 0.584 ± 0.008
1.345ProTyr: 1.345 ± 0.013
0.001ProXaa: 0.001 ± 0.0
Gln
2.403GlnAla: 2.403 ± 0.016
0.586GlnCys: 0.586 ± 0.007
1.77GlnAsp: 1.77 ± 0.013
2.429GlnGlu: 2.429 ± 0.019
1.416GlnPhe: 1.416 ± 0.011
2.139GlnGly: 2.139 ± 0.015
0.927GlnHis: 0.927 ± 0.009
2.049GlnIle: 2.049 ± 0.011
2.389GlnLys: 2.389 ± 0.017
3.57GlnLeu: 3.57 ± 0.021
0.98GlnMet: 0.98 ± 0.009
1.889GlnAsn: 1.889 ± 0.012
1.712GlnPro: 1.712 ± 0.017
1.989GlnGln: 1.989 ± 0.024
1.946GlnArg: 1.946 ± 0.015
2.901GlnSer: 2.901 ± 0.016
1.814GlnThr: 1.814 ± 0.011
2.431GlnVal: 2.431 ± 0.013
0.429GlnTrp: 0.429 ± 0.006
1.005GlnTyr: 1.005 ± 0.01
0.001GlnXaa: 0.001 ± 0.0
Arg
3.108ArgAla: 3.108 ± 0.016
0.928ArgCys: 0.928 ± 0.01
2.614ArgAsp: 2.614 ± 0.015
3.133ArgGlu: 3.133 ± 0.019
2.1ArgPhe: 2.1 ± 0.013
3.171ArgGly: 3.171 ± 0.02
1.191ArgHis: 1.191 ± 0.01
2.835ArgIle: 2.835 ± 0.016
3.778ArgLys: 3.778 ± 0.022
4.681ArgLeu: 4.681 ± 0.023
1.295ArgMet: 1.295 ± 0.011
2.511ArgAsn: 2.511 ± 0.014
2.237ArgPro: 2.237 ± 0.015
1.773ArgGln: 1.773 ± 0.013
3.611ArgArg: 3.611 ± 0.022
4.233ArgSer: 4.233 ± 0.023
2.421ArgThr: 2.421 ± 0.013
3.263ArgVal: 3.263 ± 0.019
0.674ArgTrp: 0.674 ± 0.008
1.455ArgTyr: 1.455 ± 0.011
0.002ArgXaa: 0.002 ± 0.0
Ser
5.388SerAla: 5.388 ± 0.022
1.732SerCys: 1.732 ± 0.013
4.608SerAsp: 4.608 ± 0.021
4.942SerGlu: 4.942 ± 0.025
3.881SerPhe: 3.881 ± 0.019
6.12SerGly: 6.12 ± 0.027
1.966SerHis: 1.966 ± 0.013
4.517SerIle: 4.517 ± 0.022
4.965SerLys: 4.965 ± 0.025
8.435SerLeu: 8.435 ± 0.034
2.037SerMet: 2.037 ± 0.013
4.197SerAsn: 4.197 ± 0.024
4.612SerPro: 4.612 ± 0.053
3.001SerGln: 3.001 ± 0.019
4.419SerArg: 4.419 ± 0.02
10.656SerSer: 10.656 ± 0.054
4.87SerThr: 4.87 ± 0.024
5.48SerVal: 5.48 ± 0.024
1.144SerTrp: 1.144 ± 0.012
2.448SerTyr: 2.448 ± 0.014
0.001SerXaa: 0.001 ± 0.0
Thr
3.392ThrAla: 3.392 ± 0.016
0.965ThrCys: 0.965 ± 0.009
2.497ThrAsp: 2.497 ± 0.015
2.94ThrGlu: 2.94 ± 0.019
2.056ThrPhe: 2.056 ± 0.013
3.371ThrGly: 3.371 ± 0.019
1.041ThrHis: 1.041 ± 0.011
2.826ThrIle: 2.826 ± 0.017
2.815ThrLys: 2.815 ± 0.017
4.618ThrLeu: 4.618 ± 0.021
1.169ThrMet: 1.169 ± 0.008
2.289ThrAsn: 2.289 ± 0.014
2.687ThrPro: 2.687 ± 0.019
1.597ThrGln: 1.597 ± 0.013
2.454ThrArg: 2.454 ± 0.014
4.789ThrSer: 4.789 ± 0.026
3.017ThrThr: 3.017 ± 0.017
3.389ThrVal: 3.389 ± 0.02
0.628ThrTrp: 0.628 ± 0.007
1.486ThrTyr: 1.486 ± 0.01
0.001ThrXaa: 0.001 ± 0.0
Val
4.761ValAla: 4.761 ± 0.024
1.184ValCys: 1.184 ± 0.01
3.872ValAsp: 3.872 ± 0.019
4.509ValGlu: 4.509 ± 0.024
2.716ValPhe: 2.716 ± 0.017
4.106ValGly: 4.106 ± 0.021
1.563ValHis: 1.563 ± 0.01
3.609ValIle: 3.609 ± 0.02
4.111ValLys: 4.111 ± 0.017
6.305ValLeu: 6.305 ± 0.028
1.614ValMet: 1.614 ± 0.012
2.813ValAsn: 2.813 ± 0.016
3.308ValPro: 3.308 ± 0.018
2.406ValGln: 2.406 ± 0.015
3.056ValArg: 3.056 ± 0.016
5.493ValSer: 5.493 ± 0.024
3.36ValThr: 3.36 ± 0.02
5.032ValVal: 5.032 ± 0.025
0.769ValTrp: 0.769 ± 0.009
2.041ValTyr: 2.041 ± 0.014
0.001ValXaa: 0.001 ± 0.0
Trp
0.737TrpAla: 0.737 ± 0.008
0.232TrpCys: 0.232 ± 0.005
0.69TrpAsp: 0.69 ± 0.008
0.72TrpGlu: 0.72 ± 0.008
0.528TrpPhe: 0.528 ± 0.007
0.719TrpGly: 0.719 ± 0.009
0.266TrpHis: 0.266 ± 0.005
0.717TrpIle: 0.717 ± 0.007
0.926TrpLys: 0.926 ± 0.01
1.175TrpLeu: 1.175 ± 0.01
0.327TrpMet: 0.327 ± 0.006
0.713TrpAsn: 0.713 ± 0.009
0.481TrpPro: 0.481 ± 0.007
0.437TrpGln: 0.437 ± 0.006
0.775TrpArg: 0.775 ± 0.008
0.942TrpSer: 0.942 ± 0.01
0.657TrpThr: 0.657 ± 0.008
0.751TrpVal: 0.751 ± 0.008
0.222TrpTrp: 0.222 ± 0.005
0.36TrpTyr: 0.36 ± 0.006
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.822TyrAla: 1.822 ± 0.015
0.638TyrCys: 0.638 ± 0.007
1.648TyrAsp: 1.648 ± 0.012
1.643TyrGlu: 1.643 ± 0.013
1.318TyrPhe: 1.318 ± 0.012
2.169TyrGly: 2.169 ± 0.018
0.734TyrHis: 0.734 ± 0.008
1.596TyrIle: 1.596 ± 0.012
1.71TyrLys: 1.71 ± 0.013
2.751TyrLeu: 2.751 ± 0.017
0.816TyrMet: 0.816 ± 0.009
1.512TyrAsn: 1.512 ± 0.012
1.306TyrPro: 1.306 ± 0.013
1.029TyrGln: 1.029 ± 0.008
1.451TyrArg: 1.451 ± 0.012
2.432TyrSer: 2.432 ± 0.016
1.445TyrThr: 1.445 ± 0.013
1.884TyrVal: 1.884 ± 0.015
0.417TyrTrp: 0.417 ± 0.006
1.069TyrTyr: 1.069 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.037XaaXaa: 0.037 ± 0.011
Statistics based on 31771 proteins (12603374 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski