Amino acid dipepetide frequency for Capsicum baccatum (Peruvian pepper)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.588AlaAla: 5.588 ± 0.033
1.175AlaCys: 1.175 ± 0.011
3.0AlaAsp: 3.0 ± 0.017
4.045AlaGlu: 4.045 ± 0.022
2.626AlaPhe: 2.626 ± 0.016
3.887AlaGly: 3.887 ± 0.019
1.216AlaHis: 1.216 ± 0.011
3.617AlaIle: 3.617 ± 0.02
3.832AlaLys: 3.832 ± 0.022
6.243AlaLeu: 6.243 ± 0.024
1.803AlaMet: 1.803 ± 0.013
2.611AlaAsn: 2.611 ± 0.017
2.777AlaPro: 2.777 ± 0.021
1.913AlaGln: 1.913 ± 0.013
3.183AlaArg: 3.183 ± 0.019
5.424AlaSer: 5.424 ± 0.024
3.604AlaThr: 3.604 ± 0.021
4.363AlaVal: 4.363 ± 0.022
0.762AlaTrp: 0.762 ± 0.008
1.876AlaTyr: 1.876 ± 0.014
0.004AlaXaa: 0.004 ± 0.001
Cys
0.987CysAla: 0.987 ± 0.009
0.545CysCys: 0.545 ± 0.007
0.957CysAsp: 0.957 ± 0.008
0.876CysGlu: 0.876 ± 0.01
0.915CysPhe: 0.915 ± 0.008
1.453CysGly: 1.453 ± 0.013
0.602CysHis: 0.602 ± 0.01
1.048CysIle: 1.048 ± 0.008
1.18CysLys: 1.18 ± 0.011
1.962CysLeu: 1.962 ± 0.014
0.442CysMet: 0.442 ± 0.006
0.842CysAsn: 0.842 ± 0.008
0.969CysPro: 0.969 ± 0.01
0.628CysGln: 0.628 ± 0.007
1.03CysArg: 1.03 ± 0.01
1.834CysSer: 1.834 ± 0.012
0.916CysThr: 0.916 ± 0.008
1.044CysVal: 1.044 ± 0.008
0.247CysTrp: 0.247 ± 0.004
0.593CysTyr: 0.593 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
3.343AspAla: 3.343 ± 0.018
0.991AspCys: 0.991 ± 0.008
3.779AspAsp: 3.779 ± 0.026
4.167AspGlu: 4.167 ± 0.022
2.407AspPhe: 2.407 ± 0.014
3.643AspGly: 3.643 ± 0.022
1.195AspHis: 1.195 ± 0.011
3.288AspIle: 3.288 ± 0.019
2.879AspLys: 2.879 ± 0.015
5.343AspLeu: 5.343 ± 0.022
1.437AspMet: 1.437 ± 0.01
2.361AspAsn: 2.361 ± 0.016
2.424AspPro: 2.424 ± 0.014
1.706AspGln: 1.706 ± 0.012
2.302AspArg: 2.302 ± 0.022
4.169AspSer: 4.169 ± 0.019
2.279AspThr: 2.279 ± 0.015
3.977AspVal: 3.977 ± 0.018
0.687AspTrp: 0.687 ± 0.007
1.598AspTyr: 1.598 ± 0.012
0.002AspXaa: 0.002 ± 0.0
Glu
4.326GluAla: 4.326 ± 0.02
0.96GluCys: 0.96 ± 0.01
3.902GluAsp: 3.902 ± 0.021
6.002GluGlu: 6.002 ± 0.051
2.507GluPhe: 2.507 ± 0.013
3.61GluGly: 3.61 ± 0.02
1.267GluHis: 1.267 ± 0.009
4.056GluIle: 4.056 ± 0.021
4.891GluLys: 4.891 ± 0.033
6.09GluLeu: 6.09 ± 0.027
1.837GluMet: 1.837 ± 0.012
3.24GluAsn: 3.24 ± 0.019
2.015GluPro: 2.015 ± 0.015
2.118GluGln: 2.118 ± 0.014
3.287GluArg: 3.287 ± 0.019
4.595GluSer: 4.595 ± 0.021
2.86GluThr: 2.86 ± 0.015
4.317GluVal: 4.317 ± 0.022
0.737GluTrp: 0.737 ± 0.009
1.753GluTyr: 1.753 ± 0.013
0.002GluXaa: 0.002 ± 0.0
Phe
2.333PheAla: 2.333 ± 0.014
0.913PheCys: 0.913 ± 0.008
2.46PheAsp: 2.46 ± 0.015
2.401PheGlu: 2.401 ± 0.015
1.833PhePhe: 1.833 ± 0.013
3.058PheGly: 3.058 ± 0.019
1.14PheHis: 1.14 ± 0.01
2.151PheIle: 2.151 ± 0.014
2.294PheLys: 2.294 ± 0.015
4.267PheLeu: 4.267 ± 0.021
1.009PheMet: 1.009 ± 0.009
1.832PheAsn: 1.832 ± 0.012
2.182PhePro: 2.182 ± 0.014
1.533PheGln: 1.533 ± 0.011
2.026PheArg: 2.026 ± 0.015
4.031PheSer: 4.031 ± 0.018
2.047PheThr: 2.047 ± 0.013
2.674PheVal: 2.674 ± 0.016
0.51PheTrp: 0.51 ± 0.007
1.314PheTyr: 1.314 ± 0.011
0.003PheXaa: 0.003 ± 0.0
Gly
3.971GlyAla: 3.971 ± 0.026
1.296GlyCys: 1.296 ± 0.011
3.261GlyAsp: 3.261 ± 0.022
3.594GlyGlu: 3.594 ± 0.019
3.003GlyPhe: 3.003 ± 0.016
4.847GlyGly: 4.847 ± 0.044
1.595GlyHis: 1.595 ± 0.012
3.611GlyIle: 3.611 ± 0.017
4.121GlyLys: 4.121 ± 0.018
5.801GlyLeu: 5.801 ± 0.025
1.498GlyMet: 1.498 ± 0.01
3.162GlyAsn: 3.162 ± 0.017
2.366GlyPro: 2.366 ± 0.015
1.972GlyGln: 1.972 ± 0.014
3.593GlyArg: 3.593 ± 0.019
5.76GlySer: 5.76 ± 0.025
3.3GlyThr: 3.3 ± 0.018
4.034GlyVal: 4.034 ± 0.018
0.839GlyTrp: 0.839 ± 0.014
2.056GlyTyr: 2.056 ± 0.016
0.004GlyXaa: 0.004 ± 0.001
His
1.472HisAla: 1.472 ± 0.011
0.516HisCys: 0.516 ± 0.005
1.206HisAsp: 1.206 ± 0.01
1.264HisGlu: 1.264 ± 0.01
1.097HisPhe: 1.097 ± 0.01
1.778HisGly: 1.778 ± 0.017
0.957HisHis: 0.957 ± 0.011
1.256HisIle: 1.256 ± 0.01
1.152HisLys: 1.152 ± 0.01
2.605HisLeu: 2.605 ± 0.016
0.556HisMet: 0.556 ± 0.006
1.037HisAsn: 1.037 ± 0.011
1.241HisPro: 1.241 ± 0.012
0.966HisGln: 0.966 ± 0.01
1.301HisArg: 1.301 ± 0.01
1.893HisSer: 1.893 ± 0.014
0.956HisThr: 0.956 ± 0.008
1.543HisVal: 1.543 ± 0.012
0.333HisTrp: 0.333 ± 0.005
0.74HisTyr: 0.74 ± 0.007
0.002HisXaa: 0.002 ± 0.0
Ile
3.574IleAla: 3.574 ± 0.018
1.136IleCys: 1.136 ± 0.01
3.19IleAsp: 3.19 ± 0.018
3.427IleGlu: 3.427 ± 0.019
2.441IlePhe: 2.441 ± 0.015
3.629IleGly: 3.629 ± 0.023
1.414IleHis: 1.414 ± 0.011
3.011IleIle: 3.011 ± 0.018
3.095IleLys: 3.095 ± 0.015
5.585IleLeu: 5.585 ± 0.022
1.232IleMet: 1.232 ± 0.01
2.36IleAsn: 2.36 ± 0.015
3.274IlePro: 3.274 ± 0.022
2.033IleGln: 2.033 ± 0.015
2.691IleArg: 2.691 ± 0.015
5.061IleSer: 5.061 ± 0.023
2.721IleThr: 2.721 ± 0.015
3.696IleVal: 3.696 ± 0.016
0.741IleTrp: 0.741 ± 0.008
1.559IleTyr: 1.559 ± 0.013
0.003IleXaa: 0.003 ± 0.0
Lys
3.903LysAla: 3.903 ± 0.021
1.084LysCys: 1.084 ± 0.011
3.425LysAsp: 3.425 ± 0.019
4.79LysGlu: 4.79 ± 0.026
2.342LysPhe: 2.342 ± 0.014
3.734LysGly: 3.734 ± 0.016
1.37LysHis: 1.37 ± 0.012
3.548LysIle: 3.548 ± 0.018
5.151LysLys: 5.151 ± 0.032
6.327LysLeu: 6.327 ± 0.026
1.646LysMet: 1.646 ± 0.012
2.93LysAsn: 2.93 ± 0.02
2.501LysPro: 2.501 ± 0.021
2.235LysGln: 2.235 ± 0.014
3.618LysArg: 3.618 ± 0.02
4.87LysSer: 4.87 ± 0.023
2.83LysThr: 2.83 ± 0.018
4.105LysVal: 4.105 ± 0.02
0.775LysTrp: 0.775 ± 0.008
1.839LysTyr: 1.839 ± 0.012
0.003LysXaa: 0.003 ± 0.0
Leu
6.203LeuAla: 6.203 ± 0.025
1.845LeuCys: 1.845 ± 0.013
5.332LeuAsp: 5.332 ± 0.024
6.393LeuGlu: 6.393 ± 0.033
3.859LeuPhe: 3.859 ± 0.019
5.891LeuGly: 5.891 ± 0.029
2.492LeuHis: 2.492 ± 0.013
4.859LeuIle: 4.859 ± 0.024
6.453LeuLys: 6.453 ± 0.025
9.667LeuLeu: 9.667 ± 0.039
2.218LeuMet: 2.218 ± 0.014
4.159LeuAsn: 4.159 ± 0.022
5.21LeuPro: 5.21 ± 0.022
4.211LeuGln: 4.211 ± 0.02
5.504LeuArg: 5.504 ± 0.025
8.722LeuSer: 8.722 ± 0.032
4.635LeuThr: 4.635 ± 0.021
6.531LeuVal: 6.531 ± 0.027
1.17LeuTrp: 1.17 ± 0.009
2.626LeuTyr: 2.626 ± 0.017
0.004LeuXaa: 0.004 ± 0.0
Met
1.946MetAla: 1.946 ± 0.011
0.364MetCys: 0.364 ± 0.005
1.514MetAsp: 1.514 ± 0.016
1.917MetGlu: 1.917 ± 0.012
0.834MetPhe: 0.834 ± 0.009
1.555MetGly: 1.555 ± 0.011
0.569MetHis: 0.569 ± 0.006
1.352MetIle: 1.352 ± 0.011
1.752MetLys: 1.752 ± 0.012
2.278MetLeu: 2.278 ± 0.014
0.76MetMet: 0.76 ± 0.011
1.182MetAsn: 1.182 ± 0.01
1.11MetPro: 1.11 ± 0.01
0.943MetGln: 0.943 ± 0.009
1.213MetArg: 1.213 ± 0.011
1.871MetSer: 1.871 ± 0.012
1.175MetThr: 1.175 ± 0.01
1.721MetVal: 1.721 ± 0.013
0.272MetTrp: 0.272 ± 0.004
0.643MetTyr: 0.643 ± 0.008
0.001MetXaa: 0.001 ± 0.0
Asn
2.673AsnAla: 2.673 ± 0.015
0.897AsnCys: 0.897 ± 0.008
2.406AsnAsp: 2.406 ± 0.016
2.747AsnGlu: 2.747 ± 0.015
2.131AsnPhe: 2.131 ± 0.014
3.137AsnGly: 3.137 ± 0.019
1.096AsnHis: 1.096 ± 0.01
2.805AsnIle: 2.805 ± 0.019
2.595AsnLys: 2.595 ± 0.014
5.003AsnLeu: 5.003 ± 0.028
1.199AsnMet: 1.199 ± 0.011
2.705AsnAsn: 2.705 ± 0.023
2.33AsnPro: 2.33 ± 0.015
1.755AsnGln: 1.755 ± 0.014
1.96AsnArg: 1.96 ± 0.013
3.955AsnSer: 3.955 ± 0.02
1.973AsnThr: 1.973 ± 0.011
3.11AsnVal: 3.11 ± 0.016
0.566AsnTrp: 0.566 ± 0.007
1.393AsnTyr: 1.393 ± 0.011
0.002AsnXaa: 0.002 ± 0.0
Pro
2.802ProAla: 2.802 ± 0.017
0.797ProCys: 0.797 ± 0.008
2.43ProAsp: 2.43 ± 0.013
3.012ProGlu: 3.012 ± 0.02
1.929ProPhe: 1.929 ± 0.011
2.49ProGly: 2.49 ± 0.016
1.077ProHis: 1.077 ± 0.01
2.455ProIle: 2.455 ± 0.014
2.82ProLys: 2.82 ± 0.018
4.19ProLeu: 4.19 ± 0.018
0.964ProMet: 0.964 ± 0.009
2.396ProAsn: 2.396 ± 0.013
3.744ProPro: 3.744 ± 0.062
1.836ProGln: 1.836 ± 0.015
2.471ProArg: 2.471 ± 0.017
5.048ProSer: 5.048 ± 0.028
2.703ProThr: 2.703 ± 0.017
2.952ProVal: 2.952 ± 0.018
0.849ProTrp: 0.849 ± 0.015
1.406ProTyr: 1.406 ± 0.013
0.004ProXaa: 0.004 ± 0.001
Gln
2.226GlnAla: 2.226 ± 0.013
0.616GlnCys: 0.616 ± 0.007
1.662GlnAsp: 1.662 ± 0.013
2.286GlnGlu: 2.286 ± 0.015
1.329GlnPhe: 1.329 ± 0.009
2.004GlnGly: 2.004 ± 0.013
0.939GlnHis: 0.939 ± 0.009
2.004GlnIle: 2.004 ± 0.011
2.449GlnLys: 2.449 ± 0.016
3.68GlnLeu: 3.68 ± 0.019
0.933GlnMet: 0.933 ± 0.01
1.823GlnAsn: 1.823 ± 0.014
1.57GlnPro: 1.57 ± 0.012
1.948GlnGln: 1.948 ± 0.023
1.976GlnArg: 1.976 ± 0.012
2.722GlnSer: 2.722 ± 0.015
1.646GlnThr: 1.646 ± 0.011
2.325GlnVal: 2.325 ± 0.013
0.461GlnTrp: 0.461 ± 0.006
0.949GlnTyr: 0.949 ± 0.008
0.002GlnXaa: 0.002 ± 0.0
Arg
3.134ArgAla: 3.134 ± 0.018
1.097ArgCys: 1.097 ± 0.011
2.642ArgAsp: 2.642 ± 0.019
3.192ArgGlu: 3.192 ± 0.015
2.154ArgPhe: 2.154 ± 0.014
3.207ArgGly: 3.207 ± 0.021
1.384ArgHis: 1.384 ± 0.011
3.043ArgIle: 3.043 ± 0.016
3.726ArgLys: 3.726 ± 0.015
4.845ArgLeu: 4.845 ± 0.026
1.348ArgMet: 1.348 ± 0.011
2.455ArgAsn: 2.455 ± 0.015
2.331ArgPro: 2.331 ± 0.018
1.736ArgGln: 1.736 ± 0.013
3.906ArgArg: 3.906 ± 0.022
4.189ArgSer: 4.189 ± 0.021
2.532ArgThr: 2.532 ± 0.016
3.118ArgVal: 3.118 ± 0.017
0.747ArgTrp: 0.747 ± 0.007
1.529ArgTyr: 1.529 ± 0.012
0.005ArgXaa: 0.005 ± 0.001
Ser
4.808SerAla: 4.808 ± 0.023
1.725SerCys: 1.725 ± 0.012
4.34SerAsp: 4.34 ± 0.02
4.679SerGlu: 4.679 ± 0.022
4.002SerPhe: 4.002 ± 0.018
5.541SerGly: 5.541 ± 0.024
1.944SerHis: 1.944 ± 0.013
4.851SerIle: 4.851 ± 0.02
5.136SerLys: 5.136 ± 0.021
8.631SerLeu: 8.631 ± 0.032
2.094SerMet: 2.094 ± 0.012
4.152SerAsn: 4.152 ± 0.022
4.525SerPro: 4.525 ± 0.038
2.944SerGln: 2.944 ± 0.016
4.392SerArg: 4.392 ± 0.021
10.348SerSer: 10.348 ± 0.044
4.819SerThr: 4.819 ± 0.021
5.124SerVal: 5.124 ± 0.022
1.201SerTrp: 1.201 ± 0.012
2.514SerTyr: 2.514 ± 0.014
0.004SerXaa: 0.004 ± 0.001
Thr
3.081ThrAla: 3.081 ± 0.017
0.983ThrCys: 0.983 ± 0.01
2.35ThrAsp: 2.35 ± 0.014
2.724ThrGlu: 2.724 ± 0.017
2.043ThrPhe: 2.043 ± 0.012
3.336ThrGly: 3.336 ± 0.017
1.001ThrHis: 1.001 ± 0.01
3.003ThrIle: 3.003 ± 0.015
2.749ThrLys: 2.749 ± 0.017
4.733ThrLeu: 4.733 ± 0.018
1.251ThrMet: 1.251 ± 0.01
2.239ThrAsn: 2.239 ± 0.014
2.559ThrPro: 2.559 ± 0.02
1.45ThrGln: 1.45 ± 0.012
2.461ThrArg: 2.461 ± 0.014
4.884ThrSer: 4.884 ± 0.022
3.11ThrThr: 3.11 ± 0.018
3.276ThrVal: 3.276 ± 0.016
0.602ThrTrp: 0.602 ± 0.007
1.457ThrTyr: 1.457 ± 0.014
0.003ThrXaa: 0.003 ± 0.001
Val
4.647ValAla: 4.647 ± 0.022
1.138ValCys: 1.138 ± 0.009
3.832ValAsp: 3.832 ± 0.017
4.513ValGlu: 4.513 ± 0.024
2.671ValPhe: 2.671 ± 0.014
4.027ValGly: 4.027 ± 0.018
1.567ValHis: 1.567 ± 0.012
3.544ValIle: 3.544 ± 0.018
4.089ValLys: 4.089 ± 0.021
6.361ValLeu: 6.361 ± 0.026
1.608ValMet: 1.608 ± 0.013
2.78ValAsn: 2.78 ± 0.019
3.424ValPro: 3.424 ± 0.023
2.283ValGln: 2.283 ± 0.015
3.057ValArg: 3.057 ± 0.017
5.088ValSer: 5.088 ± 0.022
3.226ValThr: 3.226 ± 0.016
4.964ValVal: 4.964 ± 0.026
0.731ValTrp: 0.731 ± 0.008
1.985ValTyr: 1.985 ± 0.012
0.003ValXaa: 0.003 ± 0.0
Trp
0.713TrpAla: 0.713 ± 0.008
0.35TrpCys: 0.35 ± 0.009
0.67TrpAsp: 0.67 ± 0.008
0.741TrpGlu: 0.741 ± 0.007
0.549TrpPhe: 0.549 ± 0.007
0.693TrpGly: 0.693 ± 0.008
0.346TrpHis: 0.346 ± 0.007
0.789TrpIle: 0.789 ± 0.009
0.993TrpLys: 0.993 ± 0.009
1.24TrpLeu: 1.24 ± 0.01
0.39TrpMet: 0.39 ± 0.013
0.744TrpAsn: 0.744 ± 0.008
0.456TrpPro: 0.456 ± 0.007
0.394TrpGln: 0.394 ± 0.005
0.877TrpArg: 0.877 ± 0.01
0.964TrpSer: 0.964 ± 0.009
0.642TrpThr: 0.642 ± 0.008
0.774TrpVal: 0.774 ± 0.008
0.247TrpTrp: 0.247 ± 0.005
0.356TrpTyr: 0.356 ± 0.005
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.812TyrAla: 1.812 ± 0.012
0.64TyrCys: 0.64 ± 0.007
1.617TyrAsp: 1.617 ± 0.011
1.6TyrGlu: 1.6 ± 0.012
1.333TyrPhe: 1.333 ± 0.011
2.161TyrGly: 2.161 ± 0.014
0.699TyrHis: 0.699 ± 0.008
1.579TyrIle: 1.579 ± 0.012
1.666TyrLys: 1.666 ± 0.018
3.129TyrLeu: 3.129 ± 0.015
0.747TyrMet: 0.747 ± 0.008
1.428TyrAsn: 1.428 ± 0.013
1.362TyrPro: 1.362 ± 0.012
0.942TyrGln: 0.942 ± 0.009
1.47TyrArg: 1.47 ± 0.011
2.34TyrSer: 2.34 ± 0.014
1.32TyrThr: 1.32 ± 0.011
1.887TyrVal: 1.887 ± 0.018
0.453TyrTrp: 0.453 ± 0.006
1.067TyrTyr: 1.067 ± 0.016
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.004XaaAla: 0.004 ± 0.001
0.002XaaCys: 0.002 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.005XaaGly: 0.005 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.005XaaLeu: 0.005 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.003XaaAsn: 0.003 ± 0.0
0.004XaaPro: 0.004 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.004XaaArg: 0.004 ± 0.001
0.005XaaSer: 0.005 ± 0.001
0.003XaaThr: 0.003 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.089XaaXaa: 0.089 ± 0.015
Statistics based on 35641 proteins (13376091 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski