Amino acid dipepetide frequency for Aquilegia coerulea (Rocky mountain columbine)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.299AlaAla: 5.299 ± 0.028
1.179AlaCys: 1.179 ± 0.01
2.929AlaAsp: 2.929 ± 0.015
3.901AlaGlu: 3.901 ± 0.02
2.62AlaPhe: 2.62 ± 0.014
3.826AlaGly: 3.826 ± 0.017
1.269AlaHis: 1.269 ± 0.009
3.755AlaIle: 3.755 ± 0.019
3.746AlaLys: 3.746 ± 0.018
6.344AlaLeu: 6.344 ± 0.023
1.683AlaMet: 1.683 ± 0.012
2.505AlaAsn: 2.505 ± 0.013
2.418AlaPro: 2.418 ± 0.015
2.019AlaGln: 2.019 ± 0.013
3.102AlaArg: 3.102 ± 0.015
5.504AlaSer: 5.504 ± 0.024
3.392AlaThr: 3.392 ± 0.016
4.467AlaVal: 4.467 ± 0.019
0.73AlaTrp: 0.73 ± 0.007
1.822AlaTyr: 1.822 ± 0.01
0.0AlaXaa: 0.0 ± 0.0
Cys
0.935CysAla: 0.935 ± 0.008
0.517CysCys: 0.517 ± 0.006
0.862CysAsp: 0.862 ± 0.009
0.911CysGlu: 0.911 ± 0.007
0.922CysPhe: 0.922 ± 0.008
1.314CysGly: 1.314 ± 0.01
0.465CysHis: 0.465 ± 0.006
1.097CysIle: 1.097 ± 0.009
1.208CysLys: 1.208 ± 0.009
1.913CysLeu: 1.913 ± 0.013
0.471CysMet: 0.471 ± 0.006
0.88CysAsn: 0.88 ± 0.007
0.872CysPro: 0.872 ± 0.008
0.615CysGln: 0.615 ± 0.008
1.012CysArg: 1.012 ± 0.009
1.804CysSer: 1.804 ± 0.011
0.914CysThr: 0.914 ± 0.008
1.066CysVal: 1.066 ± 0.009
0.256CysTrp: 0.256 ± 0.005
0.57CysTyr: 0.57 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
3.248AspAla: 3.248 ± 0.014
0.945AspCys: 0.945 ± 0.009
3.556AspAsp: 3.556 ± 0.022
3.944AspGlu: 3.944 ± 0.021
2.388AspPhe: 2.388 ± 0.012
3.716AspGly: 3.716 ± 0.017
1.202AspHis: 1.202 ± 0.009
3.261AspIle: 3.261 ± 0.016
2.806AspLys: 2.806 ± 0.015
5.172AspLeu: 5.172 ± 0.021
1.362AspMet: 1.362 ± 0.009
2.226AspAsn: 2.226 ± 0.013
2.501AspPro: 2.501 ± 0.015
1.766AspGln: 1.766 ± 0.01
2.403AspArg: 2.403 ± 0.015
4.225AspSer: 4.225 ± 0.02
2.331AspThr: 2.331 ± 0.013
3.775AspVal: 3.775 ± 0.015
0.729AspTrp: 0.729 ± 0.007
1.648AspTyr: 1.648 ± 0.011
0.0AspXaa: 0.0 ± 0.0
Glu
4.455GluAla: 4.455 ± 0.021
0.929GluCys: 0.929 ± 0.009
4.037GluAsp: 4.037 ± 0.022
6.296GluGlu: 6.296 ± 0.04
2.381GluPhe: 2.381 ± 0.013
3.623GluGly: 3.623 ± 0.015
1.262GluHis: 1.262 ± 0.009
3.808GluIle: 3.808 ± 0.017
4.858GluLys: 4.858 ± 0.024
5.964GluLeu: 5.964 ± 0.027
1.767GluMet: 1.767 ± 0.012
3.208GluAsn: 3.208 ± 0.016
2.179GluPro: 2.179 ± 0.031
2.208GluGln: 2.208 ± 0.012
3.355GluArg: 3.355 ± 0.018
4.491GluSer: 4.491 ± 0.019
3.126GluThr: 3.126 ± 0.014
4.326GluVal: 4.326 ± 0.018
0.735GluTrp: 0.735 ± 0.007
1.703GluTyr: 1.703 ± 0.01
0.0GluXaa: 0.0 ± 0.0
Phe
2.402PheAla: 2.402 ± 0.012
0.928PheCys: 0.928 ± 0.008
2.355PheAsp: 2.355 ± 0.013
2.375PheGlu: 2.375 ± 0.012
2.114PhePhe: 2.114 ± 0.016
3.047PheGly: 3.047 ± 0.017
1.106PheHis: 1.106 ± 0.009
2.297PheIle: 2.297 ± 0.014
2.208PheLys: 2.208 ± 0.012
4.33PheLeu: 4.33 ± 0.022
0.985PheMet: 0.985 ± 0.008
1.901PheAsn: 1.901 ± 0.012
1.993PhePro: 1.993 ± 0.012
1.621PheGln: 1.621 ± 0.012
1.984PheArg: 1.984 ± 0.013
4.106PheSer: 4.106 ± 0.018
2.079PheThr: 2.079 ± 0.013
2.796PheVal: 2.796 ± 0.015
0.559PheTrp: 0.559 ± 0.007
1.356PheTyr: 1.356 ± 0.01
0.0PheXaa: 0.0 ± 0.0
Gly
3.634GlyAla: 3.634 ± 0.02
1.235GlyCys: 1.235 ± 0.01
3.302GlyAsp: 3.302 ± 0.016
3.64GlyGlu: 3.64 ± 0.027
3.08GlyPhe: 3.08 ± 0.015
5.19GlyGly: 5.19 ± 0.035
1.501GlyHis: 1.501 ± 0.013
3.79GlyIle: 3.79 ± 0.016
4.068GlyLys: 4.068 ± 0.02
5.888GlyLeu: 5.888 ± 0.023
1.561GlyMet: 1.561 ± 0.011
3.088GlyAsn: 3.088 ± 0.02
2.323GlyPro: 2.323 ± 0.016
2.086GlyGln: 2.086 ± 0.012
3.564GlyArg: 3.564 ± 0.023
5.821GlySer: 5.821 ± 0.024
3.277GlyThr: 3.277 ± 0.016
4.209GlyVal: 4.209 ± 0.018
0.924GlyTrp: 0.924 ± 0.008
2.157GlyTyr: 2.157 ± 0.014
0.0GlyXaa: 0.0 ± 0.0
His
1.341HisAla: 1.341 ± 0.011
0.502HisCys: 0.502 ± 0.005
1.131HisAsp: 1.131 ± 0.009
1.278HisGlu: 1.278 ± 0.009
1.083HisPhe: 1.083 ± 0.008
1.659HisGly: 1.659 ± 0.01
0.935HisHis: 0.935 ± 0.011
1.345HisIle: 1.345 ± 0.009
1.168HisLys: 1.168 ± 0.009
2.46HisLeu: 2.46 ± 0.014
0.573HisMet: 0.573 ± 0.006
1.002HisAsn: 1.002 ± 0.008
1.359HisPro: 1.359 ± 0.01
1.063HisGln: 1.063 ± 0.011
1.293HisArg: 1.293 ± 0.009
1.968HisSer: 1.968 ± 0.011
1.087HisThr: 1.087 ± 0.009
1.527HisVal: 1.527 ± 0.01
0.308HisTrp: 0.308 ± 0.004
0.766HisTyr: 0.766 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.597IleAla: 3.597 ± 0.017
1.17IleCys: 1.17 ± 0.01
3.092IleAsp: 3.092 ± 0.014
3.412IleGlu: 3.412 ± 0.017
2.445IlePhe: 2.445 ± 0.016
3.521IleGly: 3.521 ± 0.017
1.42IleHis: 1.42 ± 0.01
3.12IleIle: 3.12 ± 0.02
3.169IleLys: 3.169 ± 0.016
5.552IleLeu: 5.552 ± 0.024
1.254IleMet: 1.254 ± 0.01
2.41IleAsn: 2.41 ± 0.011
3.045IlePro: 3.045 ± 0.018
2.163IleGln: 2.163 ± 0.014
2.661IleArg: 2.661 ± 0.014
5.268IleSer: 5.268 ± 0.021
2.899IleThr: 2.899 ± 0.014
3.704IleVal: 3.704 ± 0.016
0.734IleTrp: 0.734 ± 0.008
1.716IleTyr: 1.716 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
3.915LysAla: 3.915 ± 0.018
1.006LysCys: 1.006 ± 0.007
3.47LysAsp: 3.47 ± 0.016
4.817LysGlu: 4.817 ± 0.023
2.262LysPhe: 2.262 ± 0.014
3.618LysGly: 3.618 ± 0.02
1.369LysHis: 1.369 ± 0.009
3.406LysIle: 3.406 ± 0.014
4.973LysLys: 4.973 ± 0.026
6.042LysLeu: 6.042 ± 0.02
1.53LysMet: 1.53 ± 0.01
2.849LysAsn: 2.849 ± 0.013
2.661LysPro: 2.661 ± 0.013
2.384LysGln: 2.384 ± 0.014
3.518LysArg: 3.518 ± 0.018
4.696LysSer: 4.696 ± 0.018
3.063LysThr: 3.063 ± 0.015
4.03LysVal: 4.03 ± 0.016
0.791LysTrp: 0.791 ± 0.007
1.721LysTyr: 1.721 ± 0.011
0.0LysXaa: 0.0 ± 0.0
Leu
6.149LeuAla: 6.149 ± 0.026
1.856LeuCys: 1.856 ± 0.013
5.127LeuAsp: 5.127 ± 0.019
6.372LeuGlu: 6.372 ± 0.023
4.016LeuPhe: 4.016 ± 0.021
5.768LeuGly: 5.768 ± 0.025
2.636LeuHis: 2.636 ± 0.015
4.921LeuIle: 4.921 ± 0.019
6.347LeuLys: 6.347 ± 0.024
10.108LeuLeu: 10.108 ± 0.04
2.221LeuMet: 2.221 ± 0.01
4.147LeuAsn: 4.147 ± 0.015
5.079LeuPro: 5.079 ± 0.018
4.439LeuGln: 4.439 ± 0.02
5.143LeuArg: 5.143 ± 0.022
8.553LeuSer: 8.553 ± 0.035
4.444LeuThr: 4.444 ± 0.018
6.392LeuVal: 6.392 ± 0.021
1.131LeuTrp: 1.131 ± 0.01
2.688LeuTyr: 2.688 ± 0.015
0.0LeuXaa: 0.0 ± 0.0
Met
1.959MetAla: 1.959 ± 0.013
0.351MetCys: 0.351 ± 0.005
1.504MetAsp: 1.504 ± 0.011
2.004MetGlu: 2.004 ± 0.011
0.893MetPhe: 0.893 ± 0.008
1.597MetGly: 1.597 ± 0.01
0.558MetHis: 0.558 ± 0.005
1.278MetIle: 1.278 ± 0.01
1.699MetLys: 1.699 ± 0.011
2.236MetLeu: 2.236 ± 0.011
0.729MetMet: 0.729 ± 0.008
1.071MetAsn: 1.071 ± 0.009
1.029MetPro: 1.029 ± 0.01
0.959MetGln: 0.959 ± 0.007
1.171MetArg: 1.171 ± 0.008
1.798MetSer: 1.798 ± 0.011
1.085MetThr: 1.085 ± 0.009
1.75MetVal: 1.75 ± 0.01
0.268MetTrp: 0.268 ± 0.004
0.649MetTyr: 0.649 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
2.581AsnAla: 2.581 ± 0.015
0.853AsnCys: 0.853 ± 0.009
2.237AsnAsp: 2.237 ± 0.014
2.668AsnGlu: 2.668 ± 0.014
1.945AsnPhe: 1.945 ± 0.013
3.335AsnGly: 3.335 ± 0.019
1.169AsnHis: 1.169 ± 0.009
2.739AsnIle: 2.739 ± 0.014
2.631AsnLys: 2.631 ± 0.014
4.684AsnLeu: 4.684 ± 0.023
1.164AsnMet: 1.164 ± 0.009
2.553AsnAsn: 2.553 ± 0.016
2.358AsnPro: 2.358 ± 0.014
1.812AsnGln: 1.812 ± 0.011
2.043AsnArg: 2.043 ± 0.012
4.002AsnSer: 4.002 ± 0.016
2.195AsnThr: 2.195 ± 0.012
2.956AsnVal: 2.956 ± 0.013
0.565AsnTrp: 0.565 ± 0.007
1.406AsnTyr: 1.406 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
2.568ProAla: 2.568 ± 0.016
0.734ProCys: 0.734 ± 0.007
2.403ProAsp: 2.403 ± 0.014
2.962ProGlu: 2.962 ± 0.017
1.994ProPhe: 1.994 ± 0.012
2.755ProGly: 2.755 ± 0.043
1.084ProHis: 1.084 ± 0.008
2.494ProIle: 2.494 ± 0.014
2.664ProLys: 2.664 ± 0.014
4.247ProLeu: 4.247 ± 0.02
0.974ProMet: 0.974 ± 0.008
2.278ProAsn: 2.278 ± 0.014
3.288ProPro: 3.288 ± 0.031
1.739ProGln: 1.739 ± 0.012
2.141ProArg: 2.141 ± 0.012
4.992ProSer: 4.992 ± 0.028
2.645ProThr: 2.645 ± 0.015
2.931ProVal: 2.931 ± 0.015
0.575ProTrp: 0.575 ± 0.006
1.347ProTyr: 1.347 ± 0.012
0.0ProXaa: 0.0 ± 0.0
Gln
2.332GlnAla: 2.332 ± 0.014
0.614GlnCys: 0.614 ± 0.008
1.737GlnAsp: 1.737 ± 0.012
2.424GlnGlu: 2.424 ± 0.015
1.474GlnPhe: 1.474 ± 0.009
2.162GlnGly: 2.162 ± 0.014
0.99GlnHis: 0.99 ± 0.008
2.124GlnIle: 2.124 ± 0.014
2.414GlnLys: 2.414 ± 0.013
3.698GlnLeu: 3.698 ± 0.017
0.97GlnMet: 0.97 ± 0.008
1.851GlnAsn: 1.851 ± 0.012
1.645GlnPro: 1.645 ± 0.013
2.418GlnGln: 2.418 ± 0.039
2.101GlnArg: 2.101 ± 0.013
2.921GlnSer: 2.921 ± 0.017
1.842GlnThr: 1.842 ± 0.012
2.51GlnVal: 2.51 ± 0.012
0.45GlnTrp: 0.45 ± 0.006
1.023GlnTyr: 1.023 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
2.923ArgAla: 2.923 ± 0.016
0.962ArgCys: 0.962 ± 0.008
2.615ArgAsp: 2.615 ± 0.015
3.219ArgGlu: 3.219 ± 0.017
2.164ArgPhe: 2.164 ± 0.012
3.043ArgGly: 3.043 ± 0.018
1.214ArgHis: 1.214 ± 0.009
2.907ArgIle: 2.907 ± 0.015
3.706ArgLys: 3.706 ± 0.017
4.847ArgLeu: 4.847 ± 0.02
1.3ArgMet: 1.3 ± 0.009
2.516ArgAsn: 2.516 ± 0.013
2.277ArgPro: 2.277 ± 0.032
1.81ArgGln: 1.81 ± 0.013
3.524ArgArg: 3.524 ± 0.019
4.254ArgSer: 4.254 ± 0.024
2.428ArgThr: 2.428 ± 0.013
3.181ArgVal: 3.181 ± 0.014
0.695ArgTrp: 0.695 ± 0.007
1.461ArgTyr: 1.461 ± 0.012
0.0ArgXaa: 0.0 ± 0.0
Ser
4.945SerAla: 4.945 ± 0.019
1.708SerCys: 1.708 ± 0.011
4.373SerAsp: 4.373 ± 0.022
4.831SerGlu: 4.831 ± 0.022
4.003SerPhe: 4.003 ± 0.019
5.728SerGly: 5.728 ± 0.025
1.984SerHis: 1.984 ± 0.013
4.956SerIle: 4.956 ± 0.018
5.031SerLys: 5.031 ± 0.019
8.698SerLeu: 8.698 ± 0.026
2.094SerMet: 2.094 ± 0.012
4.254SerAsn: 4.254 ± 0.018
4.209SerPro: 4.209 ± 0.023
3.075SerGln: 3.075 ± 0.016
4.363SerArg: 4.363 ± 0.022
11.569SerSer: 11.569 ± 0.046
4.994SerThr: 4.994 ± 0.019
5.331SerVal: 5.331 ± 0.021
1.17SerTrp: 1.17 ± 0.009
2.403SerTyr: 2.403 ± 0.014
0.0SerXaa: 0.0 ± 0.0
Thr
3.164ThrAla: 3.164 ± 0.014
0.969ThrCys: 0.969 ± 0.009
2.394ThrAsp: 2.394 ± 0.011
2.841ThrGlu: 2.841 ± 0.015
2.101ThrPhe: 2.101 ± 0.012
3.269ThrGly: 3.269 ± 0.017
1.095ThrHis: 1.095 ± 0.009
3.009ThrIle: 3.009 ± 0.015
2.908ThrLys: 2.908 ± 0.016
4.788ThrLeu: 4.788 ± 0.017
1.249ThrMet: 1.249 ± 0.008
2.305ThrAsn: 2.305 ± 0.011
2.616ThrPro: 2.616 ± 0.017
1.644ThrGln: 1.644 ± 0.011
2.34ThrArg: 2.34 ± 0.014
4.896ThrSer: 4.896 ± 0.02
3.467ThrThr: 3.467 ± 0.021
3.359ThrVal: 3.359 ± 0.017
0.644ThrTrp: 0.644 ± 0.007
1.49ThrTyr: 1.49 ± 0.012
0.0ThrXaa: 0.0 ± 0.0
Val
4.567ValAla: 4.567 ± 0.02
1.251ValCys: 1.251 ± 0.01
3.802ValAsp: 3.802 ± 0.017
4.448ValGlu: 4.448 ± 0.02
2.747ValPhe: 2.747 ± 0.013
4.21ValGly: 4.21 ± 0.02
1.551ValHis: 1.551 ± 0.01
3.644ValIle: 3.644 ± 0.018
3.927ValLys: 3.927 ± 0.016
6.45ValLeu: 6.45 ± 0.024
1.576ValMet: 1.576 ± 0.009
2.689ValAsn: 2.689 ± 0.015
3.153ValPro: 3.153 ± 0.016
2.412ValGln: 2.412 ± 0.011
3.062ValArg: 3.062 ± 0.015
5.481ValSer: 5.481 ± 0.018
3.214ValThr: 3.214 ± 0.015
5.009ValVal: 5.009 ± 0.024
0.764ValTrp: 0.764 ± 0.007
1.98ValTyr: 1.98 ± 0.011
0.0ValXaa: 0.0 ± 0.0
Trp
0.733TrpAla: 0.733 ± 0.007
0.24TrpCys: 0.24 ± 0.004
0.699TrpAsp: 0.699 ± 0.007
0.763TrpGlu: 0.763 ± 0.008
0.551TrpPhe: 0.551 ± 0.006
0.727TrpGly: 0.727 ± 0.009
0.277TrpHis: 0.277 ± 0.004
0.764TrpIle: 0.764 ± 0.008
0.948TrpLys: 0.948 ± 0.008
1.209TrpLeu: 1.209 ± 0.01
0.347TrpMet: 0.347 ± 0.005
0.716TrpAsn: 0.716 ± 0.008
0.506TrpPro: 0.506 ± 0.007
0.423TrpGln: 0.423 ± 0.004
0.78TrpArg: 0.78 ± 0.007
0.964TrpSer: 0.964 ± 0.008
0.652TrpThr: 0.652 ± 0.006
0.778TrpVal: 0.778 ± 0.007
0.235TrpTrp: 0.235 ± 0.004
0.36TrpTyr: 0.36 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.759TyrAla: 1.759 ± 0.011
0.659TyrCys: 0.659 ± 0.008
1.577TyrAsp: 1.577 ± 0.011
1.608TyrGlu: 1.608 ± 0.01
1.36TyrPhe: 1.36 ± 0.01
2.233TyrGly: 2.233 ± 0.015
0.762TyrHis: 0.762 ± 0.008
1.637TyrIle: 1.637 ± 0.011
1.639TyrLys: 1.639 ± 0.01
2.906TyrLeu: 2.906 ± 0.016
0.779TyrMet: 0.779 ± 0.008
1.446TyrAsn: 1.446 ± 0.011
1.306TyrPro: 1.306 ± 0.009
1.043TyrGln: 1.043 ± 0.009
1.486TyrArg: 1.486 ± 0.01
2.405TyrSer: 2.405 ± 0.013
1.415TyrThr: 1.415 ± 0.009
1.839TyrVal: 1.839 ± 0.012
0.41TyrTrp: 0.41 ± 0.005
1.038TyrTyr: 1.038 ± 0.01
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.006XaaXaa: 0.006 ± 0.003
Statistics based on 37018 proteins (15996891 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski