Amino acid dipepetide frequency for Artemisia annua (Sweet wormwood)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.867AlaAla: 4.867 ± 0.023
1.168AlaCys: 1.168 ± 0.008
2.917AlaAsp: 2.917 ± 0.012
3.554AlaGlu: 3.554 ± 0.013
2.501AlaPhe: 2.501 ± 0.012
3.409AlaGly: 3.409 ± 0.015
1.253AlaHis: 1.253 ± 0.007
3.58AlaIle: 3.58 ± 0.014
3.891AlaLys: 3.891 ± 0.014
5.684AlaLeu: 5.684 ± 0.02
1.647AlaMet: 1.647 ± 0.009
2.71AlaAsn: 2.71 ± 0.011
2.412AlaPro: 2.412 ± 0.011
1.897AlaGln: 1.897 ± 0.009
3.063AlaArg: 3.063 ± 0.013
5.136AlaSer: 5.136 ± 0.016
3.512AlaThr: 3.512 ± 0.015
4.027AlaVal: 4.027 ± 0.015
0.715AlaTrp: 0.715 ± 0.006
1.844AlaTyr: 1.844 ± 0.009
0.002AlaXaa: 0.002 ± 0.0
Cys
0.917CysAla: 0.917 ± 0.007
0.543CysCys: 0.543 ± 0.005
1.045CysAsp: 1.045 ± 0.007
0.967CysGlu: 0.967 ± 0.006
0.943CysPhe: 0.943 ± 0.006
1.46CysGly: 1.46 ± 0.009
0.489CysHis: 0.489 ± 0.005
1.073CysIle: 1.073 ± 0.008
1.278CysLys: 1.278 ± 0.008
1.924CysLeu: 1.924 ± 0.012
0.485CysMet: 0.485 ± 0.004
0.976CysAsn: 0.976 ± 0.008
0.922CysPro: 0.922 ± 0.007
0.565CysGln: 0.565 ± 0.005
0.993CysArg: 0.993 ± 0.007
1.684CysSer: 1.684 ± 0.011
0.913CysThr: 0.913 ± 0.007
1.179CysVal: 1.179 ± 0.007
0.265CysTrp: 0.265 ± 0.003
0.654CysTyr: 0.654 ± 0.005
0.001CysXaa: 0.001 ± 0.0
Asp
3.284AspAla: 3.284 ± 0.013
0.999AspCys: 0.999 ± 0.007
4.106AspAsp: 4.106 ± 0.019
4.421AspGlu: 4.421 ± 0.017
2.371AspPhe: 2.371 ± 0.01
3.763AspGly: 3.763 ± 0.016
1.382AspHis: 1.382 ± 0.008
3.418AspIle: 3.418 ± 0.013
3.129AspLys: 3.129 ± 0.013
5.203AspLeu: 5.203 ± 0.017
1.538AspMet: 1.538 ± 0.008
2.468AspAsn: 2.468 ± 0.01
2.483AspPro: 2.483 ± 0.01
1.867AspGln: 1.867 ± 0.01
2.315AspArg: 2.315 ± 0.012
4.054AspSer: 4.054 ± 0.016
2.595AspThr: 2.595 ± 0.01
4.174AspVal: 4.174 ± 0.014
0.726AspTrp: 0.726 ± 0.006
1.676AspTyr: 1.676 ± 0.01
0.002AspXaa: 0.002 ± 0.0
Glu
4.241GluAla: 4.241 ± 0.015
1.066GluCys: 1.066 ± 0.007
4.018GluAsp: 4.018 ± 0.016
5.594GluGlu: 5.594 ± 0.022
2.442GluPhe: 2.442 ± 0.01
3.562GluGly: 3.562 ± 0.013
1.316GluHis: 1.316 ± 0.008
3.859GluIle: 3.859 ± 0.015
4.823GluLys: 4.823 ± 0.019
5.87GluLeu: 5.87 ± 0.021
1.822GluMet: 1.822 ± 0.009
3.318GluAsn: 3.318 ± 0.015
2.13GluPro: 2.13 ± 0.01
2.088GluGln: 2.088 ± 0.01
3.087GluArg: 3.087 ± 0.012
4.616GluSer: 4.616 ± 0.015
3.256GluThr: 3.256 ± 0.013
4.388GluVal: 4.388 ± 0.015
0.796GluTrp: 0.796 ± 0.006
1.879GluTyr: 1.879 ± 0.011
0.001GluXaa: 0.001 ± 0.0
Phe
2.358PheAla: 2.358 ± 0.01
0.895PheCys: 0.895 ± 0.006
2.496PheAsp: 2.496 ± 0.012
2.44PheGlu: 2.44 ± 0.011
1.851PhePhe: 1.851 ± 0.01
3.068PheGly: 3.068 ± 0.015
1.115PheHis: 1.115 ± 0.007
2.289PheIle: 2.289 ± 0.011
2.549PheLys: 2.549 ± 0.011
4.035PheLeu: 4.035 ± 0.017
1.067PheMet: 1.067 ± 0.007
1.93PheAsn: 1.93 ± 0.011
1.915PhePro: 1.915 ± 0.011
1.541PheGln: 1.541 ± 0.009
2.002PheArg: 2.002 ± 0.01
3.741PheSer: 3.741 ± 0.014
2.212PheThr: 2.212 ± 0.01
2.909PheVal: 2.909 ± 0.014
0.59PheTrp: 0.59 ± 0.005
1.384PheTyr: 1.384 ± 0.009
0.002PheXaa: 0.002 ± 0.0
Gly
3.268GlyAla: 3.268 ± 0.015
1.351GlyCys: 1.351 ± 0.009
3.41GlyAsp: 3.41 ± 0.013
3.443GlyGlu: 3.443 ± 0.013
3.11GlyPhe: 3.11 ± 0.013
4.783GlyGly: 4.783 ± 0.028
1.502GlyHis: 1.502 ± 0.008
3.497GlyIle: 3.497 ± 0.012
4.151GlyLys: 4.151 ± 0.015
5.652GlyLeu: 5.652 ± 0.019
1.492GlyMet: 1.492 ± 0.008
3.212GlyAsn: 3.212 ± 0.014
2.294GlyPro: 2.294 ± 0.012
1.946GlyGln: 1.946 ± 0.01
3.255GlyArg: 3.255 ± 0.015
5.59GlySer: 5.59 ± 0.018
3.152GlyThr: 3.152 ± 0.012
4.297GlyVal: 4.297 ± 0.014
0.911GlyTrp: 0.911 ± 0.006
2.162GlyTyr: 2.162 ± 0.01
0.002GlyXaa: 0.002 ± 0.0
His
1.335HisAla: 1.335 ± 0.008
0.526HisCys: 0.526 ± 0.005
1.302HisAsp: 1.302 ± 0.007
1.445HisGlu: 1.445 ± 0.008
1.065HisPhe: 1.065 ± 0.007
1.652HisGly: 1.652 ± 0.009
0.874HisHis: 0.874 ± 0.007
1.369HisIle: 1.369 ± 0.008
1.377HisLys: 1.377 ± 0.008
2.462HisLeu: 2.462 ± 0.012
0.634HisMet: 0.634 ± 0.005
1.143HisAsn: 1.143 ± 0.007
1.261HisPro: 1.261 ± 0.008
0.955HisGln: 0.955 ± 0.006
1.247HisArg: 1.247 ± 0.007
1.774HisSer: 1.774 ± 0.009
1.135HisThr: 1.135 ± 0.007
1.684HisVal: 1.684 ± 0.01
0.322HisTrp: 0.322 ± 0.004
0.75HisTyr: 0.75 ± 0.005
0.001HisXaa: 0.001 ± 0.0
Ile
3.382IleAla: 3.382 ± 0.014
1.208IleCys: 1.208 ± 0.008
3.258IleAsp: 3.258 ± 0.012
3.416IleGlu: 3.416 ± 0.012
2.341IlePhe: 2.341 ± 0.011
3.523IleGly: 3.523 ± 0.013
1.422IleHis: 1.422 ± 0.008
3.152IleIle: 3.152 ± 0.013
3.474IleLys: 3.474 ± 0.013
5.258IleLeu: 5.258 ± 0.018
1.305IleMet: 1.305 ± 0.008
2.575IleAsn: 2.575 ± 0.01
3.042IlePro: 3.042 ± 0.013
2.056IleGln: 2.056 ± 0.01
2.752IleArg: 2.752 ± 0.012
4.83IleSer: 4.83 ± 0.015
2.945IleThr: 2.945 ± 0.011
3.738IleVal: 3.738 ± 0.012
0.808IleTrp: 0.808 ± 0.006
1.678IleTyr: 1.678 ± 0.009
0.002IleXaa: 0.002 ± 0.0
Lys
3.942LysAla: 3.942 ± 0.015
1.156LysCys: 1.156 ± 0.009
3.74LysAsp: 3.74 ± 0.014
4.911LysGlu: 4.911 ± 0.017
2.404LysPhe: 2.404 ± 0.011
3.803LysGly: 3.803 ± 0.015
1.583LysHis: 1.583 ± 0.009
3.688LysIle: 3.688 ± 0.013
5.542LysLys: 5.542 ± 0.022
6.354LysLeu: 6.354 ± 0.02
1.759LysMet: 1.759 ± 0.009
3.223LysAsn: 3.223 ± 0.012
2.787LysPro: 2.787 ± 0.013
2.448LysGln: 2.448 ± 0.012
3.837LysArg: 3.837 ± 0.016
4.879LysSer: 4.879 ± 0.016
3.365LysThr: 3.365 ± 0.014
4.471LysVal: 4.471 ± 0.016
0.982LysTrp: 0.982 ± 0.006
1.91LysTyr: 1.91 ± 0.009
0.002LysXaa: 0.002 ± 0.0
Leu
5.796LeuAla: 5.796 ± 0.018
1.842LeuCys: 1.842 ± 0.01
5.29LeuAsp: 5.29 ± 0.02
6.335LeuGlu: 6.335 ± 0.023
3.838LeuPhe: 3.838 ± 0.015
5.41LeuGly: 5.41 ± 0.018
2.506LeuHis: 2.506 ± 0.011
4.764LeuIle: 4.764 ± 0.017
6.768LeuLys: 6.768 ± 0.02
9.066LeuLeu: 9.066 ± 0.024
2.253LeuMet: 2.253 ± 0.01
4.265LeuAsn: 4.265 ± 0.016
4.631LeuPro: 4.631 ± 0.013
3.878LeuGln: 3.878 ± 0.012
4.892LeuArg: 4.892 ± 0.018
7.945LeuSer: 7.945 ± 0.029
4.755LeuThr: 4.755 ± 0.015
6.289LeuVal: 6.289 ± 0.018
1.226LeuTrp: 1.226 ± 0.008
2.594LeuTyr: 2.594 ± 0.012
0.003LeuXaa: 0.003 ± 0.0
Met
1.921MetAla: 1.921 ± 0.009
0.413MetCys: 0.413 ± 0.004
1.568MetAsp: 1.568 ± 0.009
1.962MetGlu: 1.962 ± 0.011
0.996MetPhe: 0.996 ± 0.007
1.629MetGly: 1.629 ± 0.009
0.584MetHis: 0.584 ± 0.005
1.434MetIle: 1.434 ± 0.009
1.871MetLys: 1.871 ± 0.009
2.304MetLeu: 2.304 ± 0.011
0.801MetMet: 0.801 ± 0.006
1.223MetAsn: 1.223 ± 0.007
1.013MetPro: 1.013 ± 0.007
0.904MetGln: 0.904 ± 0.006
1.154MetArg: 1.154 ± 0.007
1.964MetSer: 1.964 ± 0.01
1.26MetThr: 1.26 ± 0.008
1.889MetVal: 1.889 ± 0.01
0.306MetTrp: 0.306 ± 0.004
0.681MetTyr: 0.681 ± 0.005
0.001MetXaa: 0.001 ± 0.0
Asn
2.658AsnAla: 2.658 ± 0.011
0.879AsnCys: 0.879 ± 0.007
2.664AsnAsp: 2.664 ± 0.01
3.125AsnGlu: 3.125 ± 0.012
1.985AsnPhe: 1.985 ± 0.01
3.533AsnGly: 3.533 ± 0.015
1.268AsnHis: 1.268 ± 0.007
2.941AsnIle: 2.941 ± 0.013
3.068AsnLys: 3.068 ± 0.013
4.903AsnLeu: 4.903 ± 0.022
1.358AsnMet: 1.358 ± 0.008
2.907AsnAsn: 2.907 ± 0.016
2.301AsnPro: 2.301 ± 0.012
1.856AsnGln: 1.856 ± 0.01
2.187AsnArg: 2.187 ± 0.01
3.792AsnSer: 3.792 ± 0.013
2.417AsnThr: 2.417 ± 0.011
3.391AsnVal: 3.391 ± 0.013
0.624AsnTrp: 0.624 ± 0.004
1.416AsnTyr: 1.416 ± 0.009
0.001AsnXaa: 0.001 ± 0.0
Pro
2.448ProAla: 2.448 ± 0.012
0.68ProCys: 0.68 ± 0.006
2.366ProAsp: 2.366 ± 0.009
2.872ProGlu: 2.872 ± 0.011
1.895ProPhe: 1.895 ± 0.01
2.304ProGly: 2.304 ± 0.011
1.056ProHis: 1.056 ± 0.007
2.398ProIle: 2.398 ± 0.01
2.898ProLys: 2.898 ± 0.014
3.935ProLeu: 3.935 ± 0.014
1.037ProMet: 1.037 ± 0.007
2.446ProAsn: 2.446 ± 0.011
3.199ProPro: 3.199 ± 0.031
1.702ProGln: 1.702 ± 0.01
2.117ProArg: 2.117 ± 0.011
4.425ProSer: 4.425 ± 0.018
2.785ProThr: 2.785 ± 0.013
3.047ProVal: 3.047 ± 0.013
0.565ProTrp: 0.565 ± 0.005
1.333ProTyr: 1.333 ± 0.009
0.001ProXaa: 0.001 ± 0.0
Gln
2.144GlnAla: 2.144 ± 0.011
0.569GlnCys: 0.569 ± 0.005
1.737GlnAsp: 1.737 ± 0.008
2.381GlnGlu: 2.381 ± 0.011
1.354GlnPhe: 1.354 ± 0.009
1.978GlnGly: 1.978 ± 0.01
0.919GlnHis: 0.919 ± 0.007
1.956GlnIle: 1.956 ± 0.01
2.426GlnLys: 2.426 ± 0.01
3.4GlnLeu: 3.4 ± 0.014
0.999GlnMet: 0.999 ± 0.007
1.855GlnAsn: 1.855 ± 0.01
1.714GlnPro: 1.714 ± 0.012
1.898GlnGln: 1.898 ± 0.017
1.95GlnArg: 1.95 ± 0.009
2.6GlnSer: 2.6 ± 0.012
1.832GlnThr: 1.832 ± 0.011
2.37GlnVal: 2.37 ± 0.009
0.464GlnTrp: 0.464 ± 0.005
0.956GlnTyr: 0.956 ± 0.007
0.001GlnXaa: 0.001 ± 0.0
Arg
2.777ArgAla: 2.777 ± 0.013
1.013ArgCys: 1.013 ± 0.007
2.591ArgAsp: 2.591 ± 0.011
2.93ArgGlu: 2.93 ± 0.012
2.294ArgPhe: 2.294 ± 0.011
2.918ArgGly: 2.918 ± 0.013
1.236ArgHis: 1.236 ± 0.008
2.817ArgIle: 2.817 ± 0.012
3.757ArgLys: 3.757 ± 0.015
4.93ArgLeu: 4.93 ± 0.016
1.344ArgMet: 1.344 ± 0.008
2.53ArgAsn: 2.53 ± 0.011
2.097ArgPro: 2.097 ± 0.011
1.644ArgGln: 1.644 ± 0.009
3.359ArgArg: 3.359 ± 0.016
3.99ArgSer: 3.99 ± 0.015
2.359ArgThr: 2.359 ± 0.011
3.352ArgVal: 3.352 ± 0.012
0.737ArgTrp: 0.737 ± 0.006
1.532ArgTyr: 1.532 ± 0.008
0.002ArgXaa: 0.002 ± 0.0
Ser
4.366SerAla: 4.366 ± 0.017
1.674SerCys: 1.674 ± 0.008
4.372SerAsp: 4.372 ± 0.016
4.438SerGlu: 4.438 ± 0.015
3.897SerPhe: 3.897 ± 0.014
5.443SerGly: 5.443 ± 0.019
1.953SerHis: 1.953 ± 0.011
4.573SerIle: 4.573 ± 0.014
5.189SerLys: 5.189 ± 0.018
8.215SerLeu: 8.215 ± 0.023
2.113SerMet: 2.113 ± 0.009
4.224SerAsn: 4.224 ± 0.016
3.808SerPro: 3.808 ± 0.02
2.836SerGln: 2.836 ± 0.014
4.061SerArg: 4.061 ± 0.014
9.419SerSer: 9.419 ± 0.034
4.64SerThr: 4.64 ± 0.015
5.19SerVal: 5.19 ± 0.018
1.188SerTrp: 1.188 ± 0.008
2.481SerTyr: 2.481 ± 0.01
0.003SerXaa: 0.003 ± 0.0
Thr
2.937ThrAla: 2.937 ± 0.013
1.056ThrCys: 1.056 ± 0.007
2.558ThrAsp: 2.558 ± 0.011
2.856ThrGlu: 2.856 ± 0.012
2.191ThrPhe: 2.191 ± 0.009
3.213ThrGly: 3.213 ± 0.013
1.222ThrHis: 1.222 ± 0.007
3.125ThrIle: 3.125 ± 0.011
3.211ThrLys: 3.211 ± 0.013
4.813ThrLeu: 4.813 ± 0.017
1.31ThrMet: 1.31 ± 0.007
2.673ThrAsn: 2.673 ± 0.012
2.745ThrPro: 2.745 ± 0.015
1.745ThrGln: 1.745 ± 0.009
2.642ThrArg: 2.642 ± 0.01
4.954ThrSer: 4.954 ± 0.012
3.517ThrThr: 3.517 ± 0.017
3.286ThrVal: 3.286 ± 0.013
0.739ThrTrp: 0.739 ± 0.005
1.575ThrTyr: 1.575 ± 0.009
0.002ThrXaa: 0.002 ± 0.0
Val
4.549ValAla: 4.549 ± 0.015
1.246ValCys: 1.246 ± 0.009
4.052ValAsp: 4.052 ± 0.014
4.524ValGlu: 4.524 ± 0.017
2.925ValPhe: 2.925 ± 0.012
4.139ValGly: 4.139 ± 0.014
1.535ValHis: 1.535 ± 0.007
3.749ValIle: 3.749 ± 0.014
4.45ValLys: 4.45 ± 0.014
6.226ValLeu: 6.226 ± 0.019
1.677ValMet: 1.677 ± 0.008
3.238ValAsn: 3.238 ± 0.014
3.02ValPro: 3.02 ± 0.013
2.241ValGln: 2.241 ± 0.011
3.012ValArg: 3.012 ± 0.011
5.435ValSer: 5.435 ± 0.016
3.543ValThr: 3.543 ± 0.011
5.212ValVal: 5.212 ± 0.018
0.881ValTrp: 0.881 ± 0.006
2.133ValTyr: 2.133 ± 0.01
0.002ValXaa: 0.002 ± 0.0
Trp
0.729TrpAla: 0.729 ± 0.006
0.322TrpCys: 0.322 ± 0.004
0.723TrpAsp: 0.723 ± 0.006
0.802TrpGlu: 0.802 ± 0.006
0.627TrpPhe: 0.627 ± 0.005
0.719TrpGly: 0.719 ± 0.007
0.302TrpHis: 0.302 ± 0.004
0.794TrpIle: 0.794 ± 0.006
1.06TrpLys: 1.06 ± 0.006
1.264TrpLeu: 1.264 ± 0.008
0.383TrpMet: 0.383 ± 0.004
0.781TrpAsn: 0.781 ± 0.006
0.467TrpPro: 0.467 ± 0.005
0.424TrpGln: 0.424 ± 0.005
0.811TrpArg: 0.811 ± 0.007
1.032TrpSer: 1.032 ± 0.006
0.675TrpThr: 0.675 ± 0.005
0.931TrpVal: 0.931 ± 0.007
0.266TrpTrp: 0.266 ± 0.004
0.419TrpTyr: 0.419 ± 0.004
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.869TyrAla: 1.869 ± 0.01
0.672TyrCys: 0.672 ± 0.005
1.758TyrAsp: 1.758 ± 0.008
1.751TyrGlu: 1.751 ± 0.009
1.358TyrPhe: 1.358 ± 0.008
2.172TyrGly: 2.172 ± 0.011
0.794TyrHis: 0.794 ± 0.006
1.687TyrIle: 1.687 ± 0.008
1.872TyrLys: 1.872 ± 0.009
2.844TyrLeu: 2.844 ± 0.013
0.854TyrMet: 0.854 ± 0.006
1.578TyrAsn: 1.578 ± 0.009
1.266TyrPro: 1.266 ± 0.009
1.011TyrGln: 1.011 ± 0.007
1.471TyrArg: 1.471 ± 0.009
2.219TyrSer: 2.219 ± 0.008
1.501TyrThr: 1.501 ± 0.009
1.96TyrVal: 1.96 ± 0.008
0.423TyrTrp: 0.423 ± 0.004
1.079TyrTyr: 1.079 ± 0.008
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.003XaaLeu: 0.003 ± 0.0
0.002XaaMet: 0.002 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.003XaaSer: 0.003 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66068 proteins (25287345 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski