Amino acid dipepetide frequency for Elaeis guineensis var. tenera (Oil palm)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.342AlaAla: 8.342 ± 0.042
1.414AlaCys: 1.414 ± 0.01
3.564AlaAsp: 3.564 ± 0.018
4.699AlaGlu: 4.699 ± 0.023
2.893AlaPhe: 2.893 ± 0.018
5.037AlaGly: 5.037 ± 0.024
1.465AlaHis: 1.465 ± 0.01
3.864AlaIle: 3.864 ± 0.018
3.823AlaLys: 3.823 ± 0.018
7.248AlaLeu: 7.248 ± 0.03
1.964AlaMet: 1.964 ± 0.014
2.568AlaAsn: 2.568 ± 0.014
3.432AlaPro: 3.432 ± 0.022
2.247AlaGln: 2.247 ± 0.014
4.083AlaArg: 4.083 ± 0.02
6.855AlaSer: 6.855 ± 0.024
3.83AlaThr: 3.83 ± 0.018
5.411AlaVal: 5.411 ± 0.022
0.841AlaTrp: 0.841 ± 0.008
1.885AlaTyr: 1.885 ± 0.012
0.002AlaXaa: 0.002 ± 0.0
Cys
1.06CysAla: 1.06 ± 0.011
0.53CysCys: 0.53 ± 0.006
0.861CysAsp: 0.861 ± 0.009
0.866CysGlu: 0.866 ± 0.007
0.82CysPhe: 0.82 ± 0.007
1.415CysGly: 1.415 ± 0.011
0.509CysHis: 0.509 ± 0.006
0.984CysIle: 0.984 ± 0.009
1.056CysLys: 1.056 ± 0.01
1.861CysLeu: 1.861 ± 0.012
0.471CysMet: 0.471 ± 0.006
0.784CysAsn: 0.784 ± 0.007
0.976CysPro: 0.976 ± 0.01
0.627CysGln: 0.627 ± 0.007
1.169CysArg: 1.169 ± 0.011
1.919CysSer: 1.919 ± 0.014
0.869CysThr: 0.869 ± 0.008
0.976CysVal: 0.976 ± 0.008
0.247CysTrp: 0.247 ± 0.004
0.527CysTyr: 0.527 ± 0.007
0.001CysXaa: 0.001 ± 0.0
Asp
3.969AspAla: 3.969 ± 0.02
0.933AspCys: 0.933 ± 0.01
3.52AspAsp: 3.52 ± 0.021
3.9AspGlu: 3.9 ± 0.019
2.187AspPhe: 2.187 ± 0.013
4.23AspGly: 4.23 ± 0.021
1.29AspHis: 1.29 ± 0.01
2.841AspIle: 2.841 ± 0.013
2.5AspLys: 2.5 ± 0.015
5.153AspLeu: 5.153 ± 0.021
1.363AspMet: 1.363 ± 0.01
1.837AspAsn: 1.837 ± 0.013
2.853AspPro: 2.853 ± 0.016
1.69AspGln: 1.69 ± 0.012
2.729AspArg: 2.729 ± 0.017
4.141AspSer: 4.141 ± 0.018
2.064AspThr: 2.064 ± 0.012
3.583AspVal: 3.583 ± 0.02
0.712AspTrp: 0.712 ± 0.008
1.437AspTyr: 1.437 ± 0.011
0.002AspXaa: 0.002 ± 0.0
Glu
5.206GluAla: 5.206 ± 0.024
0.916GluCys: 0.916 ± 0.009
3.89GluAsp: 3.89 ± 0.021
6.469GluGlu: 6.469 ± 0.035
2.226GluPhe: 2.226 ± 0.014
4.086GluGly: 4.086 ± 0.019
1.346GluHis: 1.346 ± 0.01
3.536GluIle: 3.536 ± 0.018
4.523GluLys: 4.523 ± 0.027
5.939GluLeu: 5.939 ± 0.028
1.819GluMet: 1.819 ± 0.013
2.801GluAsn: 2.801 ± 0.016
2.288GluPro: 2.288 ± 0.014
2.24GluGln: 2.24 ± 0.013
3.787GluArg: 3.787 ± 0.021
4.454GluSer: 4.454 ± 0.02
2.847GluThr: 2.847 ± 0.015
4.223GluVal: 4.223 ± 0.019
0.753GluTrp: 0.753 ± 0.007
1.565GluTyr: 1.565 ± 0.011
0.002GluXaa: 0.002 ± 0.0
Phe
2.574PheAla: 2.574 ± 0.015
0.839PheCys: 0.839 ± 0.007
2.222PheAsp: 2.222 ± 0.013
2.123PheGlu: 2.123 ± 0.013
1.843PhePhe: 1.843 ± 0.012
2.988PheGly: 2.988 ± 0.022
1.131PheHis: 1.131 ± 0.009
1.863PheIle: 1.863 ± 0.013
1.77PheLys: 1.77 ± 0.013
4.2PheLeu: 4.2 ± 0.02
0.9PheMet: 0.9 ± 0.008
1.455PheAsn: 1.455 ± 0.01
2.103PhePro: 2.103 ± 0.014
1.485PheGln: 1.485 ± 0.01
2.104PheArg: 2.104 ± 0.013
3.89PheSer: 3.89 ± 0.02
1.735PheThr: 1.735 ± 0.012
2.545PheVal: 2.545 ± 0.015
0.549PheTrp: 0.549 ± 0.007
1.158PheTyr: 1.158 ± 0.01
0.002PheXaa: 0.002 ± 0.0
Gly
4.547GlyAla: 4.547 ± 0.021
1.338GlyCys: 1.338 ± 0.013
3.63GlyAsp: 3.63 ± 0.016
3.974GlyGlu: 3.974 ± 0.018
3.052GlyPhe: 3.052 ± 0.018
6.393GlyGly: 6.393 ± 0.048
1.671GlyHis: 1.671 ± 0.011
3.544GlyIle: 3.544 ± 0.017
3.946GlyLys: 3.946 ± 0.018
6.155GlyLeu: 6.155 ± 0.023
1.678GlyMet: 1.678 ± 0.012
2.967GlyAsn: 2.967 ± 0.016
2.787GlyPro: 2.787 ± 0.015
2.176GlyGln: 2.176 ± 0.014
4.313GlyArg: 4.313 ± 0.022
6.438GlySer: 6.438 ± 0.029
3.372GlyThr: 3.372 ± 0.016
4.208GlyVal: 4.208 ± 0.021
0.995GlyTrp: 0.995 ± 0.009
2.049GlyTyr: 2.049 ± 0.016
0.004GlyXaa: 0.004 ± 0.001
His
1.678HisAla: 1.678 ± 0.011
0.539HisCys: 0.539 ± 0.007
1.222HisAsp: 1.222 ± 0.01
1.337HisGlu: 1.337 ± 0.009
1.004HisPhe: 1.004 ± 0.008
1.941HisGly: 1.941 ± 0.014
1.033HisHis: 1.033 ± 0.012
1.165HisIle: 1.165 ± 0.01
1.104HisLys: 1.104 ± 0.009
2.584HisLeu: 2.584 ± 0.015
0.583HisMet: 0.583 ± 0.006
0.842HisAsn: 0.842 ± 0.008
1.578HisPro: 1.578 ± 0.011
1.074HisGln: 1.074 ± 0.009
1.566HisArg: 1.566 ± 0.011
2.017HisSer: 2.017 ± 0.014
0.915HisThr: 0.915 ± 0.008
1.582HisVal: 1.582 ± 0.011
0.304HisTrp: 0.304 ± 0.004
0.68HisTyr: 0.68 ± 0.007
0.001HisXaa: 0.001 ± 0.0
Ile
3.648IleAla: 3.648 ± 0.016
1.053IleCys: 1.053 ± 0.01
2.692IleAsp: 2.692 ± 0.014
2.95IleGlu: 2.95 ± 0.015
2.05IlePhe: 2.05 ± 0.014
3.269IleGly: 3.269 ± 0.015
1.287IleHis: 1.287 ± 0.011
2.533IleIle: 2.533 ± 0.016
2.622IleLys: 2.622 ± 0.013
5.006IleLeu: 5.006 ± 0.021
1.084IleMet: 1.084 ± 0.009
1.91IleAsn: 1.91 ± 0.011
2.945IlePro: 2.945 ± 0.022
1.902IleGln: 1.902 ± 0.012
2.638IleArg: 2.638 ± 0.014
4.652IleSer: 4.652 ± 0.019
2.373IleThr: 2.373 ± 0.013
3.102IleVal: 3.102 ± 0.014
0.661IleTrp: 0.661 ± 0.007
1.417IleTyr: 1.417 ± 0.011
0.002IleXaa: 0.002 ± 0.0
Lys
4.049LysAla: 4.049 ± 0.02
0.878LysCys: 0.878 ± 0.009
3.067LysAsp: 3.067 ± 0.018
4.421LysGlu: 4.421 ± 0.024
1.873LysPhe: 1.873 ± 0.012
3.461LysGly: 3.461 ± 0.014
1.298LysHis: 1.298 ± 0.011
2.864LysIle: 2.864 ± 0.015
4.223LysLys: 4.223 ± 0.025
5.382LysLeu: 5.382 ± 0.023
1.408LysMet: 1.408 ± 0.011
2.311LysAsn: 2.311 ± 0.015
2.668LysPro: 2.668 ± 0.014
2.119LysGln: 2.119 ± 0.014
3.46LysArg: 3.46 ± 0.019
4.156LysSer: 4.156 ± 0.022
2.495LysThr: 2.495 ± 0.014
3.536LysVal: 3.536 ± 0.018
0.701LysTrp: 0.701 ± 0.007
1.443LysTyr: 1.443 ± 0.011
0.003LysXaa: 0.003 ± 0.001
Leu
7.192LeuAla: 7.192 ± 0.025
1.846LeuCys: 1.846 ± 0.012
5.179LeuAsp: 5.179 ± 0.022
6.378LeuGlu: 6.378 ± 0.025
3.749LeuPhe: 3.749 ± 0.021
6.026LeuGly: 6.026 ± 0.026
2.772LeuHis: 2.772 ± 0.015
4.212LeuIle: 4.212 ± 0.02
5.583LeuLys: 5.583 ± 0.024
10.451LeuLeu: 10.451 ± 0.038
2.134LeuMet: 2.134 ± 0.014
3.484LeuAsn: 3.484 ± 0.015
5.622LeuPro: 5.622 ± 0.025
4.433LeuGln: 4.433 ± 0.022
5.891LeuArg: 5.891 ± 0.021
8.664LeuSer: 8.664 ± 0.037
4.154LeuThr: 4.154 ± 0.02
6.203LeuVal: 6.203 ± 0.025
1.146LeuTrp: 1.146 ± 0.009
2.413LeuTyr: 2.413 ± 0.013
0.003LeuXaa: 0.003 ± 0.0
Met
2.336MetAla: 2.336 ± 0.013
0.302MetCys: 0.302 ± 0.005
1.509MetAsp: 1.509 ± 0.01
2.106MetGlu: 2.106 ± 0.013
0.743MetPhe: 0.743 ± 0.008
1.708MetGly: 1.708 ± 0.013
0.579MetHis: 0.579 ± 0.006
1.129MetIle: 1.129 ± 0.01
1.444MetLys: 1.444 ± 0.01
2.247MetLeu: 2.247 ± 0.012
0.675MetMet: 0.675 ± 0.007
0.931MetAsn: 0.931 ± 0.008
1.219MetPro: 1.219 ± 0.011
0.984MetGln: 0.984 ± 0.009
1.294MetArg: 1.294 ± 0.01
1.721MetSer: 1.721 ± 0.013
1.034MetThr: 1.034 ± 0.01
1.685MetVal: 1.685 ± 0.012
0.261MetTrp: 0.261 ± 0.005
0.579MetTyr: 0.579 ± 0.007
0.001MetXaa: 0.001 ± 0.0
Asn
2.682AsnAla: 2.682 ± 0.013
0.784AsnCys: 0.784 ± 0.009
1.865AsnAsp: 1.865 ± 0.01
2.209AsnGlu: 2.209 ± 0.014
1.617AsnPhe: 1.617 ± 0.011
3.021AsnGly: 3.021 ± 0.015
1.016AsnHis: 1.016 ± 0.009
2.194AsnIle: 2.194 ± 0.013
2.004AsnLys: 2.004 ± 0.012
4.16AsnLeu: 4.16 ± 0.024
1.02AsnMet: 1.02 ± 0.01
1.782AsnAsn: 1.782 ± 0.015
2.277AsnPro: 2.277 ± 0.014
1.524AsnGln: 1.524 ± 0.01
1.891AsnArg: 1.891 ± 0.011
3.57AsnSer: 3.57 ± 0.019
1.676AsnThr: 1.676 ± 0.011
2.432AsnVal: 2.432 ± 0.012
0.521AsnTrp: 0.521 ± 0.007
1.125AsnTyr: 1.125 ± 0.01
0.001AsnXaa: 0.001 ± 0.0
Pro
3.975ProAla: 3.975 ± 0.024
0.833ProCys: 0.833 ± 0.009
2.687ProAsp: 2.687 ± 0.016
3.283ProGlu: 3.283 ± 0.016
2.076ProPhe: 2.076 ± 0.015
3.057ProGly: 3.057 ± 0.016
1.213ProHis: 1.213 ± 0.01
2.277ProIle: 2.277 ± 0.011
2.585ProLys: 2.585 ± 0.015
4.678ProLeu: 4.678 ± 0.022
1.049ProMet: 1.049 ± 0.009
2.135ProAsn: 2.135 ± 0.014
4.763ProPro: 4.763 ± 0.046
1.872ProGln: 1.872 ± 0.015
2.784ProArg: 2.784 ± 0.016
5.889ProSer: 5.889 ± 0.029
2.698ProThr: 2.698 ± 0.016
3.263ProVal: 3.263 ± 0.019
0.644ProTrp: 0.644 ± 0.007
1.321ProTyr: 1.321 ± 0.009
0.004ProXaa: 0.004 ± 0.001
Gln
2.624GlnAla: 2.624 ± 0.014
0.602GlnCys: 0.602 ± 0.007
1.645GlnAsp: 1.645 ± 0.011
2.48GlnGlu: 2.48 ± 0.015
1.285GlnPhe: 1.285 ± 0.01
2.146GlnGly: 2.146 ± 0.013
0.98GlnHis: 0.98 ± 0.01
1.925GlnIle: 1.925 ± 0.012
2.304GlnLys: 2.304 ± 0.015
3.674GlnLeu: 3.674 ± 0.017
1.017GlnMet: 1.017 ± 0.009
1.613GlnAsn: 1.613 ± 0.011
1.856GlnPro: 1.856 ± 0.016
2.256GlnGln: 2.256 ± 0.036
2.275GlnArg: 2.275 ± 0.015
2.834GlnSer: 2.834 ± 0.016
1.642GlnThr: 1.642 ± 0.011
2.261GlnVal: 2.261 ± 0.013
0.458GlnTrp: 0.458 ± 0.005
0.905GlnTyr: 0.905 ± 0.009
0.001GlnXaa: 0.001 ± 0.0
Arg
3.88ArgAla: 3.88 ± 0.02
1.115ArgCys: 1.115 ± 0.01
2.834ArgAsp: 2.834 ± 0.017
3.635ArgGlu: 3.635 ± 0.018
2.254ArgPhe: 2.254 ± 0.013
3.634ArgGly: 3.634 ± 0.018
1.453ArgHis: 1.453 ± 0.011
2.888ArgIle: 2.888 ± 0.014
3.75ArgLys: 3.75 ± 0.02
5.483ArgLeu: 5.483 ± 0.021
1.455ArgMet: 1.455 ± 0.01
2.307ArgAsn: 2.307 ± 0.014
2.879ArgPro: 2.879 ± 0.017
2.004ArgGln: 2.004 ± 0.013
4.871ArgArg: 4.871 ± 0.028
5.006ArgSer: 5.006 ± 0.025
2.575ArgThr: 2.575 ± 0.014
3.374ArgVal: 3.374 ± 0.016
0.874ArgTrp: 0.874 ± 0.009
1.518ArgTyr: 1.518 ± 0.011
0.002ArgXaa: 0.002 ± 0.0
Ser
6.093SerAla: 6.093 ± 0.024
1.742SerCys: 1.742 ± 0.012
4.476SerAsp: 4.476 ± 0.018
4.742SerGlu: 4.742 ± 0.022
3.772SerPhe: 3.772 ± 0.016
6.333SerGly: 6.333 ± 0.027
2.113SerHis: 2.113 ± 0.014
4.35SerIle: 4.35 ± 0.018
4.568SerLys: 4.568 ± 0.019
8.783SerLeu: 8.783 ± 0.033
2.173SerMet: 2.173 ± 0.013
3.707SerAsn: 3.707 ± 0.018
5.137SerPro: 5.137 ± 0.029
3.07SerGln: 3.07 ± 0.019
4.85SerArg: 4.85 ± 0.023
11.394SerSer: 11.394 ± 0.053
4.568SerThr: 4.568 ± 0.02
5.097SerVal: 5.097 ± 0.021
1.178SerTrp: 1.178 ± 0.011
2.221SerTyr: 2.221 ± 0.015
0.004SerXaa: 0.004 ± 0.001
Thr
3.694ThrAla: 3.694 ± 0.017
0.892ThrCys: 0.892 ± 0.008
2.24ThrAsp: 2.24 ± 0.014
2.654ThrGlu: 2.654 ± 0.015
1.872ThrPhe: 1.872 ± 0.012
3.37ThrGly: 3.37 ± 0.017
0.951ThrHis: 0.951 ± 0.009
2.492ThrIle: 2.492 ± 0.015
2.369ThrLys: 2.369 ± 0.014
4.335ThrLeu: 4.335 ± 0.019
1.128ThrMet: 1.128 ± 0.008
1.844ThrAsn: 1.844 ± 0.012
2.567ThrPro: 2.567 ± 0.016
1.405ThrGln: 1.405 ± 0.01
2.306ThrArg: 2.306 ± 0.012
4.435ThrSer: 4.435 ± 0.021
2.696ThrThr: 2.696 ± 0.016
3.204ThrVal: 3.204 ± 0.017
0.599ThrTrp: 0.599 ± 0.007
1.283ThrTyr: 1.283 ± 0.01
0.002ThrXaa: 0.002 ± 0.0
Val
5.245ValAla: 5.245 ± 0.02
1.14ValCys: 1.14 ± 0.011
3.671ValAsp: 3.671 ± 0.018
4.365ValGlu: 4.365 ± 0.019
2.5ValPhe: 2.5 ± 0.015
4.292ValGly: 4.292 ± 0.023
1.566ValHis: 1.566 ± 0.011
3.145ValIle: 3.145 ± 0.016
3.415ValLys: 3.415 ± 0.017
6.305ValLeu: 6.305 ± 0.023
1.516ValMet: 1.516 ± 0.011
2.256ValAsn: 2.256 ± 0.014
3.356ValPro: 3.356 ± 0.017
2.288ValGln: 2.288 ± 0.014
3.353ValArg: 3.353 ± 0.016
5.158ValSer: 5.158 ± 0.022
3.024ValThr: 3.024 ± 0.017
4.749ValVal: 4.749 ± 0.023
0.763ValTrp: 0.763 ± 0.009
1.781ValTyr: 1.781 ± 0.011
0.002ValXaa: 0.002 ± 0.0
Trp
0.843TrpAla: 0.843 ± 0.009
0.235TrpCys: 0.235 ± 0.004
0.71TrpAsp: 0.71 ± 0.009
0.786TrpGlu: 0.786 ± 0.008
0.504TrpPhe: 0.504 ± 0.006
0.749TrpGly: 0.749 ± 0.008
0.326TrpHis: 0.326 ± 0.005
0.667TrpIle: 0.667 ± 0.007
0.879TrpLys: 0.879 ± 0.008
1.244TrpLeu: 1.244 ± 0.009
0.352TrpMet: 0.352 ± 0.005
0.684TrpAsn: 0.684 ± 0.009
0.526TrpPro: 0.526 ± 0.006
0.462TrpGln: 0.462 ± 0.006
0.918TrpArg: 0.918 ± 0.009
1.011TrpSer: 1.011 ± 0.009
0.623TrpThr: 0.623 ± 0.006
0.772TrpVal: 0.772 ± 0.008
0.234TrpTrp: 0.234 ± 0.005
0.319TrpTyr: 0.319 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.827TyrAla: 1.827 ± 0.013
0.596TyrCys: 0.596 ± 0.007
1.446TyrAsp: 1.446 ± 0.011
1.539TyrGlu: 1.539 ± 0.011
1.155TyrPhe: 1.155 ± 0.01
2.076TyrGly: 2.076 ± 0.015
0.747TyrHis: 0.747 ± 0.009
1.323TyrIle: 1.323 ± 0.01
1.346TyrLys: 1.346 ± 0.011
2.645TyrLeu: 2.645 ± 0.016
0.697TyrMet: 0.697 ± 0.007
1.15TyrAsn: 1.15 ± 0.01
1.251TyrPro: 1.251 ± 0.01
0.926TyrGln: 0.926 ± 0.009
1.505TyrArg: 1.505 ± 0.012
2.166TyrSer: 2.166 ± 0.014
1.146TyrThr: 1.146 ± 0.01
1.683TyrVal: 1.683 ± 0.011
0.402TyrTrp: 0.402 ± 0.005
0.9TyrTyr: 0.9 ± 0.01
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.004XaaGly: 0.004 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.003XaaLys: 0.003 ± 0.001
0.003XaaLeu: 0.003 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.005XaaPro: 0.005 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.003XaaSer: 0.003 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.009XaaXaa: 0.009 ± 0.002
Statistics based on 30667 proteins (14051939 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski