Amino acid dipepetide frequency for Lactuca sativa (Garden lettuce)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.13AlaAla: 5.13 ± 0.027
1.11AlaCys: 1.11 ± 0.009
2.744AlaAsp: 2.744 ± 0.016
3.323AlaGlu: 3.323 ± 0.02
2.582AlaPhe: 2.582 ± 0.015
3.511AlaGly: 3.511 ± 0.021
1.202AlaHis: 1.202 ± 0.009
3.772AlaIle: 3.772 ± 0.019
3.599AlaLys: 3.599 ± 0.017
5.835AlaLeu: 5.835 ± 0.024
1.731AlaMet: 1.731 ± 0.012
2.481AlaAsn: 2.481 ± 0.013
2.456AlaPro: 2.456 ± 0.016
1.779AlaGln: 1.779 ± 0.013
2.831AlaArg: 2.831 ± 0.016
5.202AlaSer: 5.202 ± 0.022
3.612AlaThr: 3.612 ± 0.017
4.164AlaVal: 4.164 ± 0.021
0.676AlaTrp: 0.676 ± 0.007
1.772AlaTyr: 1.772 ± 0.011
0.004AlaXaa: 0.004 ± 0.001
Cys
0.856CysAla: 0.856 ± 0.01
0.515CysCys: 0.515 ± 0.007
0.919CysAsp: 0.919 ± 0.009
0.905CysGlu: 0.905 ± 0.009
0.887CysPhe: 0.887 ± 0.008
1.385CysGly: 1.385 ± 0.012
0.467CysHis: 0.467 ± 0.006
1.01CysIle: 1.01 ± 0.009
1.131CysLys: 1.131 ± 0.011
1.841CysLeu: 1.841 ± 0.014
0.459CysMet: 0.459 ± 0.006
0.88CysAsn: 0.88 ± 0.009
0.847CysPro: 0.847 ± 0.012
0.522CysGln: 0.522 ± 0.007
0.953CysArg: 0.953 ± 0.009
1.674CysSer: 1.674 ± 0.014
0.809CysThr: 0.809 ± 0.009
1.117CysVal: 1.117 ± 0.009
0.247CysTrp: 0.247 ± 0.004
0.576CysTyr: 0.576 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
3.17AspAla: 3.17 ± 0.019
0.934AspCys: 0.934 ± 0.009
4.167AspAsp: 4.167 ± 0.027
4.284AspGlu: 4.284 ± 0.021
2.433AspPhe: 2.433 ± 0.015
3.667AspGly: 3.667 ± 0.02
1.397AspHis: 1.397 ± 0.009
3.19AspIle: 3.19 ± 0.018
2.783AspLys: 2.783 ± 0.016
5.234AspLeu: 5.234 ± 0.022
1.478AspMet: 1.478 ± 0.012
2.296AspAsn: 2.296 ± 0.014
2.578AspPro: 2.578 ± 0.014
1.755AspGln: 1.755 ± 0.013
2.158AspArg: 2.158 ± 0.015
4.098AspSer: 4.098 ± 0.019
2.401AspThr: 2.401 ± 0.013
3.918AspVal: 3.918 ± 0.019
0.706AspTrp: 0.706 ± 0.008
1.635AspTyr: 1.635 ± 0.012
0.001AspXaa: 0.001 ± 0.0
Glu
4.265GluAla: 4.265 ± 0.022
0.922GluCys: 0.922 ± 0.009
3.991GluAsp: 3.991 ± 0.021
6.031GluGlu: 6.031 ± 0.037
2.476GluPhe: 2.476 ± 0.015
3.665GluGly: 3.665 ± 0.017
1.275GluHis: 1.275 ± 0.01
3.986GluIle: 3.986 ± 0.021
4.953GluLys: 4.953 ± 0.028
5.802GluLeu: 5.802 ± 0.023
1.91GluMet: 1.91 ± 0.012
3.213GluAsn: 3.213 ± 0.017
1.947GluPro: 1.947 ± 0.013
1.961GluGln: 1.961 ± 0.013
3.075GluArg: 3.075 ± 0.019
4.773GluSer: 4.773 ± 0.022
3.207GluThr: 3.207 ± 0.018
4.246GluVal: 4.246 ± 0.023
0.756GluTrp: 0.756 ± 0.008
1.732GluTyr: 1.732 ± 0.012
0.002GluXaa: 0.002 ± 0.0
Phe
2.32PheAla: 2.32 ± 0.014
0.879PheCys: 0.879 ± 0.009
2.493PheAsp: 2.493 ± 0.013
2.392PheGlu: 2.392 ± 0.013
2.072PhePhe: 2.072 ± 0.014
3.177PheGly: 3.177 ± 0.017
1.181PheHis: 1.181 ± 0.012
2.321PheIle: 2.321 ± 0.015
2.301PheLys: 2.301 ± 0.014
4.381PheLeu: 4.381 ± 0.024
1.092PheMet: 1.092 ± 0.009
1.916PheAsn: 1.916 ± 0.012
2.075PhePro: 2.075 ± 0.013
1.579PheGln: 1.579 ± 0.011
2.004PheArg: 2.004 ± 0.013
3.989PheSer: 3.989 ± 0.019
2.209PheThr: 2.209 ± 0.015
2.854PheVal: 2.854 ± 0.016
0.562PheTrp: 0.562 ± 0.008
1.341PheTyr: 1.341 ± 0.011
0.001PheXaa: 0.001 ± 0.0
Gly
3.419GlyAla: 3.419 ± 0.021
1.302GlyCys: 1.302 ± 0.012
3.509GlyAsp: 3.509 ± 0.018
3.613GlyGlu: 3.613 ± 0.017
3.275GlyPhe: 3.275 ± 0.015
6.0GlyGly: 6.0 ± 0.047
1.464GlyHis: 1.464 ± 0.011
3.725GlyIle: 3.725 ± 0.019
4.103GlyLys: 4.103 ± 0.02
5.656GlyLeu: 5.656 ± 0.022
1.577GlyMet: 1.577 ± 0.011
3.253GlyAsn: 3.253 ± 0.021
2.247GlyPro: 2.247 ± 0.015
1.914GlyGln: 1.914 ± 0.014
3.372GlyArg: 3.372 ± 0.017
5.685GlySer: 5.685 ± 0.026
3.067GlyThr: 3.067 ± 0.018
4.527GlyVal: 4.527 ± 0.02
0.894GlyTrp: 0.894 ± 0.009
2.162GlyTyr: 2.162 ± 0.016
0.004GlyXaa: 0.004 ± 0.001
His
1.273HisAla: 1.273 ± 0.009
0.465HisCys: 0.465 ± 0.006
1.302HisAsp: 1.302 ± 0.01
1.483HisGlu: 1.483 ± 0.011
1.035HisPhe: 1.035 ± 0.009
1.754HisGly: 1.754 ± 0.015
1.103HisHis: 1.103 ± 0.012
1.293HisIle: 1.293 ± 0.011
1.323HisLys: 1.323 ± 0.01
2.515HisLeu: 2.515 ± 0.016
0.615HisMet: 0.615 ± 0.007
1.108HisAsn: 1.108 ± 0.01
1.374HisPro: 1.374 ± 0.01
1.067HisGln: 1.067 ± 0.009
1.391HisArg: 1.391 ± 0.01
1.837HisSer: 1.837 ± 0.014
1.099HisThr: 1.099 ± 0.01
1.63HisVal: 1.63 ± 0.011
0.301HisTrp: 0.301 ± 0.006
0.72HisTyr: 0.72 ± 0.009
0.001HisXaa: 0.001 ± 0.0
Ile
3.492IleAla: 3.492 ± 0.018
1.145IleCys: 1.145 ± 0.011
3.228IleAsp: 3.228 ± 0.016
3.456IleGlu: 3.456 ± 0.017
2.399IlePhe: 2.399 ± 0.015
3.763IleGly: 3.763 ± 0.018
1.49IleHis: 1.49 ± 0.011
3.236IleIle: 3.236 ± 0.018
3.406IleLys: 3.406 ± 0.016
5.533IleLeu: 5.533 ± 0.024
1.321IleMet: 1.321 ± 0.011
2.556IleAsn: 2.556 ± 0.015
3.294IlePro: 3.294 ± 0.023
2.14IleGln: 2.14 ± 0.014
2.73IleArg: 2.73 ± 0.015
5.084IleSer: 5.084 ± 0.022
3.046IleThr: 3.046 ± 0.016
3.739IleVal: 3.739 ± 0.015
0.771IleTrp: 0.771 ± 0.008
1.674IleTyr: 1.674 ± 0.012
0.002IleXaa: 0.002 ± 0.0
Lys
3.778LysAla: 3.778 ± 0.021
0.992LysCys: 0.992 ± 0.008
3.44LysAsp: 3.44 ± 0.018
4.841LysGlu: 4.841 ± 0.03
2.279LysPhe: 2.279 ± 0.012
3.706LysGly: 3.706 ± 0.017
1.481LysHis: 1.481 ± 0.011
3.694LysIle: 3.694 ± 0.019
5.39LysLys: 5.39 ± 0.029
6.219LysLeu: 6.219 ± 0.025
1.732LysMet: 1.732 ± 0.012
3.081LysAsn: 3.081 ± 0.017
2.757LysPro: 2.757 ± 0.017
2.302LysGln: 2.302 ± 0.015
3.561LysArg: 3.561 ± 0.019
4.974LysSer: 4.974 ± 0.026
3.271LysThr: 3.271 ± 0.017
4.067LysVal: 4.067 ± 0.021
0.866LysTrp: 0.866 ± 0.009
1.702LysTyr: 1.702 ± 0.012
0.002LysXaa: 0.002 ± 0.0
Leu
5.729LeuAla: 5.729 ± 0.025
1.771LeuCys: 1.771 ± 0.014
5.184LeuAsp: 5.184 ± 0.024
6.277LeuGlu: 6.277 ± 0.031
3.915LeuPhe: 3.915 ± 0.017
5.553LeuGly: 5.553 ± 0.025
2.728LeuHis: 2.728 ± 0.016
5.028LeuIle: 5.028 ± 0.022
6.538LeuLys: 6.538 ± 0.03
9.642LeuLeu: 9.642 ± 0.04
2.333LeuMet: 2.333 ± 0.012
4.17LeuAsn: 4.17 ± 0.018
4.991LeuPro: 4.991 ± 0.021
4.086LeuGln: 4.086 ± 0.022
4.86LeuArg: 4.86 ± 0.024
8.392LeuSer: 8.392 ± 0.04
4.793LeuThr: 4.793 ± 0.023
6.293LeuVal: 6.293 ± 0.024
1.137LeuTrp: 1.137 ± 0.012
2.548LeuTyr: 2.548 ± 0.016
0.003LeuXaa: 0.003 ± 0.001
Met
2.108MetAla: 2.108 ± 0.012
0.35MetCys: 0.35 ± 0.005
1.525MetAsp: 1.525 ± 0.011
2.226MetGlu: 2.226 ± 0.014
0.944MetPhe: 0.944 ± 0.008
1.707MetGly: 1.707 ± 0.013
0.547MetHis: 0.547 ± 0.006
1.476MetIle: 1.476 ± 0.011
1.985MetLys: 1.985 ± 0.015
2.294MetLeu: 2.294 ± 0.014
0.845MetMet: 0.845 ± 0.009
1.245MetAsn: 1.245 ± 0.01
1.007MetPro: 1.007 ± 0.009
0.865MetGln: 0.865 ± 0.008
1.173MetArg: 1.173 ± 0.009
1.929MetSer: 1.929 ± 0.012
1.167MetThr: 1.167 ± 0.009
1.899MetVal: 1.899 ± 0.011
0.288MetTrp: 0.288 ± 0.005
0.658MetTyr: 0.658 ± 0.008
0.001MetXaa: 0.001 ± 0.0
Asn
2.423AsnAla: 2.423 ± 0.013
0.823AsnCys: 0.823 ± 0.009
2.383AsnAsp: 2.383 ± 0.015
2.846AsnGlu: 2.846 ± 0.018
2.027AsnPhe: 2.027 ± 0.013
3.457AsnGly: 3.457 ± 0.016
1.336AsnHis: 1.336 ± 0.009
2.822AsnIle: 2.822 ± 0.015
2.737AsnLys: 2.737 ± 0.016
5.038AsnLeu: 5.038 ± 0.027
1.288AsnMet: 1.288 ± 0.01
2.845AsnAsn: 2.845 ± 0.02
2.542AsnPro: 2.542 ± 0.013
1.886AsnGln: 1.886 ± 0.013
2.125AsnArg: 2.125 ± 0.015
3.987AsnSer: 3.987 ± 0.019
2.334AsnThr: 2.334 ± 0.015
2.918AsnVal: 2.918 ± 0.015
0.614AsnTrp: 0.614 ± 0.007
1.374AsnTyr: 1.374 ± 0.011
0.001AsnXaa: 0.001 ± 0.0
Pro
2.446ProAla: 2.446 ± 0.016
0.699ProCys: 0.699 ± 0.008
2.365ProAsp: 2.365 ± 0.016
3.013ProGlu: 3.013 ± 0.017
2.021ProPhe: 2.021 ± 0.013
2.418ProGly: 2.418 ± 0.016
1.135ProHis: 1.135 ± 0.01
2.608ProIle: 2.608 ± 0.016
2.838ProLys: 2.838 ± 0.018
4.164ProLeu: 4.164 ± 0.018
1.043ProMet: 1.043 ± 0.009
2.474ProAsn: 2.474 ± 0.017
4.211ProPro: 4.211 ± 0.068
1.748ProGln: 1.748 ± 0.015
2.145ProArg: 2.145 ± 0.013
4.788ProSer: 4.788 ± 0.026
2.821ProThr: 2.821 ± 0.017
3.018ProVal: 3.018 ± 0.019
0.585ProTrp: 0.585 ± 0.007
1.315ProTyr: 1.315 ± 0.013
0.003ProXaa: 0.003 ± 0.0
Gln
1.95GlnAla: 1.95 ± 0.013
0.52GlnCys: 0.52 ± 0.007
1.64GlnAsp: 1.64 ± 0.011
2.374GlnGlu: 2.374 ± 0.017
1.365GlnPhe: 1.365 ± 0.012
1.945GlnGly: 1.945 ± 0.013
0.925GlnHis: 0.925 ± 0.01
2.105GlnIle: 2.105 ± 0.011
2.393GlnLys: 2.393 ± 0.015
3.438GlnLeu: 3.438 ± 0.019
0.997GlnMet: 0.997 ± 0.009
1.798GlnAsn: 1.798 ± 0.013
1.691GlnPro: 1.691 ± 0.014
1.952GlnGln: 1.952 ± 0.025
1.875GlnArg: 1.875 ± 0.012
2.724GlnSer: 2.724 ± 0.014
1.808GlnThr: 1.808 ± 0.012
2.243GlnVal: 2.243 ± 0.012
0.463GlnTrp: 0.463 ± 0.006
0.886GlnTyr: 0.886 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
2.679ArgAla: 2.679 ± 0.015
0.932ArgCys: 0.932 ± 0.01
2.419ArgAsp: 2.419 ± 0.013
2.928ArgGlu: 2.928 ± 0.016
2.292ArgPhe: 2.292 ± 0.013
2.922ArgGly: 2.922 ± 0.018
1.209ArgHis: 1.209 ± 0.01
2.878ArgIle: 2.878 ± 0.015
3.679ArgLys: 3.679 ± 0.018
4.797ArgLeu: 4.797 ± 0.022
1.316ArgMet: 1.316 ± 0.01
2.431ArgAsn: 2.431 ± 0.017
2.016ArgPro: 2.016 ± 0.012
1.588ArgGln: 1.588 ± 0.013
3.541ArgArg: 3.541 ± 0.02
4.1ArgSer: 4.1 ± 0.025
2.276ArgThr: 2.276 ± 0.016
3.205ArgVal: 3.205 ± 0.015
0.732ArgTrp: 0.732 ± 0.008
1.447ArgTyr: 1.447 ± 0.01
0.002ArgXaa: 0.002 ± 0.0
Ser
4.364SerAla: 4.364 ± 0.019
1.647SerCys: 1.647 ± 0.012
4.371SerAsp: 4.371 ± 0.022
4.537SerGlu: 4.537 ± 0.022
4.171SerPhe: 4.171 ± 0.021
5.751SerGly: 5.751 ± 0.027
2.007SerHis: 2.007 ± 0.013
4.984SerIle: 4.984 ± 0.019
5.04SerLys: 5.04 ± 0.024
8.489SerLeu: 8.489 ± 0.033
2.265SerMet: 2.265 ± 0.013
4.37SerAsn: 4.37 ± 0.019
4.323SerPro: 4.323 ± 0.028
2.869SerGln: 2.869 ± 0.018
4.136SerArg: 4.136 ± 0.02
10.482SerSer: 10.482 ± 0.043
4.818SerThr: 4.818 ± 0.022
5.015SerVal: 5.015 ± 0.021
1.169SerTrp: 1.169 ± 0.01
2.448SerTyr: 2.448 ± 0.014
0.002SerXaa: 0.002 ± 0.0
Thr
3.018ThrAla: 3.018 ± 0.017
0.966ThrCys: 0.966 ± 0.01
2.383ThrAsp: 2.383 ± 0.015
2.792ThrGlu: 2.792 ± 0.016
2.208ThrPhe: 2.208 ± 0.014
3.322ThrGly: 3.322 ± 0.019
1.205ThrHis: 1.205 ± 0.009
3.2ThrIle: 3.2 ± 0.019
3.056ThrLys: 3.056 ± 0.017
4.83ThrLeu: 4.83 ± 0.02
1.325ThrMet: 1.325 ± 0.01
2.526ThrAsn: 2.526 ± 0.014
2.881ThrPro: 2.881 ± 0.019
1.685ThrGln: 1.685 ± 0.011
2.426ThrArg: 2.426 ± 0.012
4.922ThrSer: 4.922 ± 0.021
3.797ThrThr: 3.797 ± 0.024
3.154ThrVal: 3.154 ± 0.018
0.685ThrTrp: 0.685 ± 0.007
1.501ThrTyr: 1.501 ± 0.013
0.002ThrXaa: 0.002 ± 0.0
Val
4.642ValAla: 4.642 ± 0.023
1.153ValCys: 1.153 ± 0.009
3.918ValAsp: 3.918 ± 0.019
4.398ValGlu: 4.398 ± 0.019
2.87ValPhe: 2.87 ± 0.016
4.172ValGly: 4.172 ± 0.021
1.5ValHis: 1.5 ± 0.011
3.765ValIle: 3.765 ± 0.019
4.176ValLys: 4.176 ± 0.023
6.171ValLeu: 6.171 ± 0.024
1.701ValMet: 1.701 ± 0.013
2.948ValAsn: 2.948 ± 0.015
2.889ValPro: 2.889 ± 0.015
2.044ValGln: 2.044 ± 0.013
2.857ValArg: 2.857 ± 0.016
5.385ValSer: 5.385 ± 0.022
3.268ValThr: 3.268 ± 0.016
5.404ValVal: 5.404 ± 0.026
0.797ValTrp: 0.797 ± 0.008
2.083ValTyr: 2.083 ± 0.012
0.002ValXaa: 0.002 ± 0.0
Trp
0.717TrpAla: 0.717 ± 0.008
0.265TrpCys: 0.265 ± 0.004
0.682TrpAsp: 0.682 ± 0.008
0.766TrpGlu: 0.766 ± 0.007
0.577TrpPhe: 0.577 ± 0.007
0.754TrpGly: 0.754 ± 0.008
0.279TrpHis: 0.279 ± 0.005
0.76TrpIle: 0.76 ± 0.007
0.991TrpLys: 0.991 ± 0.008
1.237TrpLeu: 1.237 ± 0.012
0.368TrpMet: 0.368 ± 0.005
0.741TrpAsn: 0.741 ± 0.008
0.466TrpPro: 0.466 ± 0.006
0.41TrpGln: 0.41 ± 0.005
0.807TrpArg: 0.807 ± 0.008
0.993TrpSer: 0.993 ± 0.011
0.621TrpThr: 0.621 ± 0.007
0.876TrpVal: 0.876 ± 0.01
0.247TrpTrp: 0.247 ± 0.005
0.359TrpTyr: 0.359 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.732TyrAla: 1.732 ± 0.012
0.61TyrCys: 0.61 ± 0.006
1.622TyrAsp: 1.622 ± 0.011
1.701TyrGlu: 1.701 ± 0.013
1.31TyrPhe: 1.31 ± 0.01
2.136TyrGly: 2.136 ± 0.017
0.756TyrHis: 0.756 ± 0.008
1.649TyrIle: 1.649 ± 0.011
1.712TyrLys: 1.712 ± 0.012
2.852TyrLeu: 2.852 ± 0.017
0.843TyrMet: 0.843 ± 0.008
1.469TyrAsn: 1.469 ± 0.012
1.262TyrPro: 1.262 ± 0.01
0.94TyrGln: 0.94 ± 0.009
1.367TyrArg: 1.367 ± 0.011
2.238TyrSer: 2.238 ± 0.014
1.458TyrThr: 1.458 ± 0.012
1.855TyrVal: 1.855 ± 0.014
0.422TyrTrp: 0.422 ± 0.005
1.014TyrTyr: 1.014 ± 0.009
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.004XaaGly: 0.004 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.003XaaLeu: 0.003 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.23XaaXaa: 0.23 ± 0.034
Statistics based on 37927 proteins (13441818 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski