Amino acid dipepetide frequency for Coffea arabica (Arabian coffee)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.341AlaAla: 6.341 ± 0.024
1.255AlaCys: 1.255 ± 0.007
3.256AlaAsp: 3.256 ± 0.01
4.381AlaGlu: 4.381 ± 0.017
2.786AlaPhe: 2.786 ± 0.011
4.124AlaGly: 4.124 ± 0.016
1.297AlaHis: 1.297 ± 0.009
3.835AlaIle: 3.835 ± 0.013
3.965AlaLys: 3.965 ± 0.012
6.676AlaLeu: 6.676 ± 0.021
1.692AlaMet: 1.692 ± 0.009
2.646AlaAsn: 2.646 ± 0.011
2.686AlaPro: 2.686 ± 0.011
2.185AlaGln: 2.185 ± 0.009
3.36AlaArg: 3.36 ± 0.012
6.043AlaSer: 6.043 ± 0.02
3.488AlaThr: 3.488 ± 0.013
4.709AlaVal: 4.709 ± 0.017
0.818AlaTrp: 0.818 ± 0.006
1.891AlaTyr: 1.891 ± 0.01
0.004AlaXaa: 0.004 ± 0.0
Cys
0.97CysAla: 0.97 ± 0.007
0.569CysCys: 0.569 ± 0.005
0.879CysAsp: 0.879 ± 0.006
0.907CysGlu: 0.907 ± 0.007
0.932CysPhe: 0.932 ± 0.006
1.413CysGly: 1.413 ± 0.007
0.489CysHis: 0.489 ± 0.004
1.06CysIle: 1.06 ± 0.007
1.186CysLys: 1.186 ± 0.007
1.954CysLeu: 1.954 ± 0.01
0.455CysMet: 0.455 ± 0.004
0.901CysAsn: 0.901 ± 0.007
1.041CysPro: 1.041 ± 0.008
0.696CysGln: 0.696 ± 0.005
1.097CysArg: 1.097 ± 0.007
1.896CysSer: 1.896 ± 0.008
0.906CysThr: 0.906 ± 0.006
1.003CysVal: 1.003 ± 0.007
0.272CysTrp: 0.272 ± 0.003
0.573CysTyr: 0.573 ± 0.005
0.002CysXaa: 0.002 ± 0.0
Asp
3.545AspAla: 3.545 ± 0.013
1.032AspCys: 1.032 ± 0.007
3.554AspAsp: 3.554 ± 0.016
3.955AspGlu: 3.955 ± 0.013
2.476AspPhe: 2.476 ± 0.011
3.72AspGly: 3.72 ± 0.015
1.283AspHis: 1.283 ± 0.007
3.104AspIle: 3.104 ± 0.013
2.72AspLys: 2.72 ± 0.013
5.248AspLeu: 5.248 ± 0.018
1.351AspMet: 1.351 ± 0.007
2.062AspAsn: 2.062 ± 0.01
2.522AspPro: 2.522 ± 0.011
1.886AspGln: 1.886 ± 0.01
2.443AspArg: 2.443 ± 0.012
4.175AspSer: 4.175 ± 0.015
2.102AspThr: 2.102 ± 0.009
3.563AspVal: 3.563 ± 0.012
0.734AspTrp: 0.734 ± 0.006
1.578AspTyr: 1.578 ± 0.009
0.004AspXaa: 0.004 ± 0.0
Glu
4.753GluAla: 4.753 ± 0.015
0.896GluCys: 0.896 ± 0.007
3.983GluAsp: 3.983 ± 0.016
6.084GluGlu: 6.084 ± 0.025
2.459GluPhe: 2.459 ± 0.01
3.821GluGly: 3.821 ± 0.014
1.304GluHis: 1.304 ± 0.009
3.981GluIle: 3.981 ± 0.015
4.758GluLys: 4.758 ± 0.019
6.229GluLeu: 6.229 ± 0.021
1.76GluMet: 1.76 ± 0.009
3.157GluAsn: 3.157 ± 0.015
2.066GluPro: 2.066 ± 0.01
2.233GluGln: 2.233 ± 0.011
3.334GluArg: 3.334 ± 0.015
4.525GluSer: 4.525 ± 0.018
2.986GluThr: 2.986 ± 0.013
4.345GluVal: 4.345 ± 0.015
0.762GluTrp: 0.762 ± 0.006
1.648GluTyr: 1.648 ± 0.008
0.004GluXaa: 0.004 ± 0.001
Phe
2.56PheAla: 2.56 ± 0.01
0.971PheCys: 0.971 ± 0.006
2.381PheAsp: 2.381 ± 0.012
2.272PheGlu: 2.272 ± 0.01
1.94PhePhe: 1.94 ± 0.01
3.074PheGly: 3.074 ± 0.014
1.134PheHis: 1.134 ± 0.007
2.084PheIle: 2.084 ± 0.011
2.112PheLys: 2.112 ± 0.008
4.545PheLeu: 4.545 ± 0.017
0.939PheMet: 0.939 ± 0.006
1.8PheAsn: 1.8 ± 0.009
2.214PhePro: 2.214 ± 0.01
1.678PheGln: 1.678 ± 0.007
2.13PheArg: 2.13 ± 0.008
4.021PheSer: 4.021 ± 0.014
1.959PheThr: 1.959 ± 0.011
2.692PheVal: 2.692 ± 0.012
0.606PheTrp: 0.606 ± 0.005
1.276PheTyr: 1.276 ± 0.007
0.004PheXaa: 0.004 ± 0.0
Gly
3.819GlyAla: 3.819 ± 0.018
1.292GlyCys: 1.292 ± 0.009
3.255GlyAsp: 3.255 ± 0.01
3.674GlyGlu: 3.674 ± 0.013
3.097GlyPhe: 3.097 ± 0.015
5.264GlyGly: 5.264 ± 0.037
1.556GlyHis: 1.556 ± 0.008
3.719GlyIle: 3.719 ± 0.013
4.123GlyLys: 4.123 ± 0.015
5.969GlyLeu: 5.969 ± 0.019
1.504GlyMet: 1.504 ± 0.007
3.13GlyAsn: 3.13 ± 0.013
2.42GlyPro: 2.42 ± 0.011
2.174GlyGln: 2.174 ± 0.009
3.66GlyArg: 3.66 ± 0.016
5.743GlySer: 5.743 ± 0.019
3.142GlyThr: 3.142 ± 0.011
3.94GlyVal: 3.94 ± 0.014
0.885GlyTrp: 0.885 ± 0.006
2.049GlyTyr: 2.049 ± 0.012
0.008GlyXaa: 0.008 ± 0.001
His
1.43HisAla: 1.43 ± 0.007
0.552HisCys: 0.552 ± 0.004
1.173HisAsp: 1.173 ± 0.007
1.329HisGlu: 1.329 ± 0.007
1.13HisPhe: 1.13 ± 0.007
1.659HisGly: 1.659 ± 0.008
0.952HisHis: 0.952 ± 0.01
1.248HisIle: 1.248 ± 0.007
1.184HisLys: 1.184 ± 0.008
2.657HisLeu: 2.657 ± 0.011
0.545HisMet: 0.545 ± 0.005
0.955HisAsn: 0.955 ± 0.006
1.399HisPro: 1.399 ± 0.008
1.1HisGln: 1.1 ± 0.006
1.398HisArg: 1.398 ± 0.009
1.905HisSer: 1.905 ± 0.01
0.916HisThr: 0.916 ± 0.006
1.536HisVal: 1.536 ± 0.008
0.328HisTrp: 0.328 ± 0.004
0.718HisTyr: 0.718 ± 0.006
0.002HisXaa: 0.002 ± 0.0
Ile
3.598IleAla: 3.598 ± 0.013
1.167IleCys: 1.167 ± 0.007
2.938IleAsp: 2.938 ± 0.011
3.224IleGlu: 3.224 ± 0.012
2.336IlePhe: 2.336 ± 0.01
3.428IleGly: 3.428 ± 0.014
1.358IleHis: 1.358 ± 0.008
2.798IleIle: 2.798 ± 0.012
3.002IleLys: 3.002 ± 0.012
5.538IleLeu: 5.538 ± 0.018
1.137IleMet: 1.137 ± 0.006
2.183IleAsn: 2.183 ± 0.01
3.144IlePro: 3.144 ± 0.015
2.138IleGln: 2.138 ± 0.01
2.78IleArg: 2.78 ± 0.011
4.991IleSer: 4.991 ± 0.014
2.519IleThr: 2.519 ± 0.011
3.407IleVal: 3.407 ± 0.013
0.759IleTrp: 0.759 ± 0.006
1.481IleTyr: 1.481 ± 0.008
0.003IleXaa: 0.003 ± 0.0
Lys
3.983LysAla: 3.983 ± 0.015
1.042LysCys: 1.042 ± 0.007
3.326LysAsp: 3.326 ± 0.014
4.636LysGlu: 4.636 ± 0.022
2.355LysPhe: 2.355 ± 0.011
3.49LysGly: 3.49 ± 0.013
1.362LysHis: 1.362 ± 0.008
3.306LysIle: 3.306 ± 0.013
4.576LysLys: 4.576 ± 0.02
6.383LysLeu: 6.383 ± 0.017
1.514LysMet: 1.514 ± 0.009
2.687LysAsn: 2.687 ± 0.01
2.586LysPro: 2.586 ± 0.013
2.285LysGln: 2.285 ± 0.012
3.592LysArg: 3.592 ± 0.013
4.585LysSer: 4.585 ± 0.016
2.742LysThr: 2.742 ± 0.009
3.91LysVal: 3.91 ± 0.015
0.806LysTrp: 0.806 ± 0.006
1.706LysTyr: 1.706 ± 0.011
0.005LysXaa: 0.005 ± 0.0
Leu
6.738LeuAla: 6.738 ± 0.02
1.933LeuCys: 1.933 ± 0.009
5.367LeuAsp: 5.367 ± 0.017
6.678LeuGlu: 6.678 ± 0.025
3.768LeuPhe: 3.768 ± 0.015
5.817LeuGly: 5.817 ± 0.018
2.742LeuHis: 2.742 ± 0.011
4.764LeuIle: 4.764 ± 0.014
6.478LeuLys: 6.478 ± 0.019
10.214LeuLeu: 10.214 ± 0.026
2.141LeuMet: 2.141 ± 0.009
4.088LeuAsn: 4.088 ± 0.013
5.457LeuPro: 5.457 ± 0.019
4.683LeuGln: 4.683 ± 0.018
5.739LeuArg: 5.739 ± 0.018
8.815LeuSer: 8.815 ± 0.026
4.466LeuThr: 4.466 ± 0.016
6.49LeuVal: 6.49 ± 0.018
1.233LeuTrp: 1.233 ± 0.008
2.596LeuTyr: 2.596 ± 0.01
0.007LeuXaa: 0.007 ± 0.001
Met
2.076MetAla: 2.076 ± 0.01
0.34MetCys: 0.34 ± 0.004
1.436MetAsp: 1.436 ± 0.008
1.896MetGlu: 1.896 ± 0.008
0.778MetPhe: 0.778 ± 0.006
1.489MetGly: 1.489 ± 0.008
0.554MetHis: 0.554 ± 0.004
1.215MetIle: 1.215 ± 0.007
1.603MetLys: 1.603 ± 0.009
2.215MetLeu: 2.215 ± 0.01
0.66MetMet: 0.66 ± 0.006
0.983MetAsn: 0.983 ± 0.007
1.104MetPro: 1.104 ± 0.007
0.959MetGln: 0.959 ± 0.007
1.202MetArg: 1.202 ± 0.007
1.662MetSer: 1.662 ± 0.008
1.056MetThr: 1.056 ± 0.006
1.624MetVal: 1.624 ± 0.008
0.261MetTrp: 0.261 ± 0.003
0.561MetTyr: 0.561 ± 0.005
0.001MetXaa: 0.001 ± 0.0
Asn
2.721AsnAla: 2.721 ± 0.01
0.923AsnCys: 0.923 ± 0.007
2.095AsnAsp: 2.095 ± 0.01
2.553AsnGlu: 2.553 ± 0.012
2.071AsnPhe: 2.071 ± 0.01
3.147AsnGly: 3.147 ± 0.013
1.114AsnHis: 1.114 ± 0.008
2.447AsnIle: 2.447 ± 0.012
2.329AsnLys: 2.329 ± 0.011
5.074AsnLeu: 5.074 ± 0.02
1.087AsnMet: 1.087 ± 0.007
2.144AsnAsn: 2.144 ± 0.013
2.372AsnPro: 2.372 ± 0.011
1.782AsnGln: 1.782 ± 0.01
2.054AsnArg: 2.054 ± 0.009
3.9AsnSer: 3.9 ± 0.015
1.821AsnThr: 1.821 ± 0.009
2.751AsnVal: 2.751 ± 0.011
0.591AsnTrp: 0.591 ± 0.005
1.288AsnTyr: 1.288 ± 0.008
0.003AsnXaa: 0.003 ± 0.0
Pro
3.031ProAla: 3.031 ± 0.012
0.805ProCys: 0.805 ± 0.006
2.484ProAsp: 2.484 ± 0.011
3.232ProGlu: 3.232 ± 0.013
2.009ProPhe: 2.009 ± 0.01
2.714ProGly: 2.714 ± 0.011
1.104ProHis: 1.104 ± 0.008
2.323ProIle: 2.323 ± 0.01
2.707ProLys: 2.707 ± 0.012
4.287ProLeu: 4.287 ± 0.014
0.914ProMet: 0.914 ± 0.006
2.265ProAsn: 2.265 ± 0.011
3.778ProPro: 3.778 ± 0.042
1.833ProGln: 1.833 ± 0.01
2.448ProArg: 2.448 ± 0.012
4.977ProSer: 4.977 ± 0.017
2.523ProThr: 2.523 ± 0.011
3.067ProVal: 3.067 ± 0.013
0.616ProTrp: 0.616 ± 0.005
1.361ProTyr: 1.361 ± 0.01
0.006ProXaa: 0.006 ± 0.0
Gln
2.422GlnAla: 2.422 ± 0.01
0.58GlnCys: 0.58 ± 0.004
1.743GlnAsp: 1.743 ± 0.009
2.571GlnGlu: 2.571 ± 0.012
1.485GlnPhe: 1.485 ± 0.008
2.252GlnGly: 2.252 ± 0.01
0.991GlnHis: 0.991 ± 0.007
2.157GlnIle: 2.157 ± 0.009
2.515GlnLys: 2.515 ± 0.011
3.997GlnLeu: 3.997 ± 0.016
1.027GlnMet: 1.027 ± 0.006
1.913GlnAsn: 1.913 ± 0.009
1.774GlnPro: 1.774 ± 0.011
2.218GlnGln: 2.218 ± 0.028
2.191GlnArg: 2.191 ± 0.009
2.859GlnSer: 2.859 ± 0.011
1.71GlnThr: 1.71 ± 0.008
2.481GlnVal: 2.481 ± 0.009
0.471GlnTrp: 0.471 ± 0.004
0.964GlnTyr: 0.964 ± 0.007
0.002GlnXaa: 0.002 ± 0.0
Arg
3.285ArgAla: 3.285 ± 0.013
1.027ArgCys: 1.027 ± 0.007
2.692ArgAsp: 2.692 ± 0.012
3.409ArgGlu: 3.409 ± 0.016
2.231ArgPhe: 2.231 ± 0.01
3.234ArgGly: 3.234 ± 0.015
1.321ArgHis: 1.321 ± 0.007
3.008ArgIle: 3.008 ± 0.012
3.82ArgLys: 3.82 ± 0.013
5.295ArgLeu: 5.295 ± 0.017
1.328ArgMet: 1.328 ± 0.006
2.549ArgAsn: 2.549 ± 0.011
2.311ArgPro: 2.311 ± 0.012
1.982ArgGln: 1.982 ± 0.01
3.951ArgArg: 3.951 ± 0.016
4.28ArgSer: 4.28 ± 0.016
2.486ArgThr: 2.486 ± 0.01
3.252ArgVal: 3.252 ± 0.011
0.766ArgTrp: 0.766 ± 0.006
1.525ArgTyr: 1.525 ± 0.007
0.004ArgXaa: 0.004 ± 0.0
Ser
5.416SerAla: 5.416 ± 0.019
1.826SerCys: 1.826 ± 0.009
4.285SerAsp: 4.285 ± 0.016
4.763SerGlu: 4.763 ± 0.018
4.039SerPhe: 4.039 ± 0.014
5.834SerGly: 5.834 ± 0.018
1.96SerHis: 1.96 ± 0.009
4.594SerIle: 4.594 ± 0.014
4.796SerLys: 4.796 ± 0.016
8.739SerLeu: 8.739 ± 0.024
2.017SerMet: 2.017 ± 0.01
4.046SerAsn: 4.046 ± 0.014
4.289SerPro: 4.289 ± 0.021
3.133SerGln: 3.133 ± 0.012
4.593SerArg: 4.593 ± 0.016
10.723SerSer: 10.723 ± 0.035
4.623SerThr: 4.623 ± 0.014
4.991SerVal: 4.991 ± 0.017
1.199SerTrp: 1.199 ± 0.008
2.261SerTyr: 2.261 ± 0.01
0.005SerXaa: 0.005 ± 0.001
Thr
3.395ThrAla: 3.395 ± 0.012
0.947ThrCys: 0.947 ± 0.006
2.27ThrAsp: 2.27 ± 0.01
2.751ThrGlu: 2.751 ± 0.011
2.035ThrPhe: 2.035 ± 0.01
3.15ThrGly: 3.15 ± 0.011
0.973ThrHis: 0.973 ± 0.006
2.688ThrIle: 2.688 ± 0.011
2.598ThrLys: 2.598 ± 0.012
4.557ThrLeu: 4.557 ± 0.016
1.092ThrMet: 1.092 ± 0.007
2.029ThrAsn: 2.029 ± 0.01
2.443ThrPro: 2.443 ± 0.01
1.464ThrGln: 1.464 ± 0.007
2.323ThrArg: 2.323 ± 0.011
4.584ThrSer: 4.584 ± 0.016
2.794ThrThr: 2.794 ± 0.013
3.086ThrVal: 3.086 ± 0.013
0.634ThrTrp: 0.634 ± 0.005
1.372ThrTyr: 1.372 ± 0.009
0.004ThrXaa: 0.004 ± 0.0
Val
4.769ValAla: 4.769 ± 0.019
1.097ValCys: 1.097 ± 0.006
3.688ValAsp: 3.688 ± 0.014
4.353ValGlu: 4.353 ± 0.016
2.614ValPhe: 2.614 ± 0.012
4.021ValGly: 4.021 ± 0.014
1.548ValHis: 1.548 ± 0.007
3.431ValIle: 3.431 ± 0.013
3.874ValLys: 3.874 ± 0.014
6.424ValLeu: 6.424 ± 0.018
1.443ValMet: 1.443 ± 0.008
2.635ValAsn: 2.635 ± 0.01
3.081ValPro: 3.081 ± 0.012
2.408ValGln: 2.408 ± 0.009
3.089ValArg: 3.089 ± 0.011
5.184ValSer: 5.184 ± 0.015
3.007ValThr: 3.007 ± 0.011
4.694ValVal: 4.694 ± 0.016
0.767ValTrp: 0.767 ± 0.006
1.92ValTyr: 1.92 ± 0.01
0.004ValXaa: 0.004 ± 0.0
Trp
0.783TrpAla: 0.783 ± 0.006
0.257TrpCys: 0.257 ± 0.003
0.713TrpAsp: 0.713 ± 0.007
0.821TrpGlu: 0.821 ± 0.006
0.521TrpPhe: 0.521 ± 0.005
0.733TrpGly: 0.733 ± 0.007
0.301TrpHis: 0.301 ± 0.003
0.77TrpIle: 0.77 ± 0.006
0.991TrpLys: 0.991 ± 0.007
1.269TrpLeu: 1.269 ± 0.008
0.36TrpMet: 0.36 ± 0.004
0.745TrpAsn: 0.745 ± 0.006
0.51TrpPro: 0.51 ± 0.005
0.473TrpGln: 0.473 ± 0.004
0.866TrpArg: 0.866 ± 0.006
0.986TrpSer: 0.986 ± 0.007
0.681TrpThr: 0.681 ± 0.006
0.801TrpVal: 0.801 ± 0.006
0.254TrpTrp: 0.254 ± 0.004
0.342TrpTyr: 0.342 ± 0.004
0.002TrpXaa: 0.002 ± 0.0
Tyr
1.8TyrAla: 1.8 ± 0.009
0.69TyrCys: 0.69 ± 0.006
1.536TyrAsp: 1.536 ± 0.008
1.59TyrGlu: 1.59 ± 0.008
1.325TyrPhe: 1.325 ± 0.008
2.033TyrGly: 2.033 ± 0.011
0.77TyrHis: 0.77 ± 0.006
1.396TyrIle: 1.396 ± 0.008
1.535TyrLys: 1.535 ± 0.009
2.936TyrLeu: 2.936 ± 0.012
0.706TyrMet: 0.706 ± 0.006
1.346TyrAsn: 1.346 ± 0.008
1.28TyrPro: 1.28 ± 0.008
1.01TyrGln: 1.01 ± 0.007
1.501TyrArg: 1.501 ± 0.008
2.272TyrSer: 2.272 ± 0.011
1.256TyrThr: 1.256 ± 0.008
1.707TyrVal: 1.707 ± 0.009
0.417TyrTrp: 0.417 ± 0.005
1.019TyrTyr: 1.019 ± 0.007
0.002TyrXaa: 0.002 ± 0.0
Xaa
0.004XaaAla: 0.004 ± 0.001
0.002XaaCys: 0.002 ± 0.0
0.004XaaAsp: 0.004 ± 0.0
0.006XaaGlu: 0.006 ± 0.001
0.004XaaPhe: 0.004 ± 0.0
0.009XaaGly: 0.009 ± 0.001
0.002XaaHis: 0.002 ± 0.0
0.003XaaIle: 0.003 ± 0.0
0.005XaaLys: 0.005 ± 0.001
0.007XaaLeu: 0.007 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.003XaaAsn: 0.003 ± 0.0
0.006XaaPro: 0.006 ± 0.001
0.002XaaGln: 0.002 ± 0.0
0.004XaaArg: 0.004 ± 0.0
0.004XaaSer: 0.004 ± 0.0
0.003XaaThr: 0.003 ± 0.0
0.004XaaVal: 0.004 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.002XaaTyr: 0.002 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53197 proteins (25389828 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski