Amino acid dipepetide frequency for Capsicum chinense (Scotch bonnet) (Bonnet pepper)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.546AlaAla: 5.546 ± 0.032
1.173AlaCys: 1.173 ± 0.011
2.911AlaAsp: 2.911 ± 0.016
4.016AlaGlu: 4.016 ± 0.021
2.62AlaPhe: 2.62 ± 0.016
3.76AlaGly: 3.76 ± 0.02
1.223AlaHis: 1.223 ± 0.011
3.721AlaIle: 3.721 ± 0.017
3.926AlaLys: 3.926 ± 0.019
6.166AlaLeu: 6.166 ± 0.026
1.681AlaMet: 1.681 ± 0.012
2.686AlaAsn: 2.686 ± 0.014
2.64AlaPro: 2.64 ± 0.018
1.956AlaGln: 1.956 ± 0.012
3.067AlaArg: 3.067 ± 0.018
5.442AlaSer: 5.442 ± 0.021
3.479AlaThr: 3.479 ± 0.018
4.312AlaVal: 4.312 ± 0.021
0.715AlaTrp: 0.715 ± 0.008
1.867AlaTyr: 1.867 ± 0.013
0.003AlaXaa: 0.003 ± 0.0
Cys
0.975CysAla: 0.975 ± 0.01
0.537CysCys: 0.537 ± 0.007
0.981CysAsp: 0.981 ± 0.009
0.884CysGlu: 0.884 ± 0.009
0.945CysPhe: 0.945 ± 0.009
1.408CysGly: 1.408 ± 0.012
0.495CysHis: 0.495 ± 0.007
1.073CysIle: 1.073 ± 0.009
1.196CysLys: 1.196 ± 0.011
1.892CysLeu: 1.892 ± 0.014
0.437CysMet: 0.437 ± 0.006
0.861CysAsn: 0.861 ± 0.009
0.935CysPro: 0.935 ± 0.01
0.633CysGln: 0.633 ± 0.008
0.998CysArg: 0.998 ± 0.009
1.823CysSer: 1.823 ± 0.012
0.903CysThr: 0.903 ± 0.009
1.046CysVal: 1.046 ± 0.009
0.252CysTrp: 0.252 ± 0.005
0.602CysTyr: 0.602 ± 0.007
0.001CysXaa: 0.001 ± 0.0
Asp
3.319AspAla: 3.319 ± 0.016
1.018AspCys: 1.018 ± 0.009
3.764AspAsp: 3.764 ± 0.024
4.219AspGlu: 4.219 ± 0.023
2.499AspPhe: 2.499 ± 0.016
3.625AspGly: 3.625 ± 0.02
1.24AspHis: 1.24 ± 0.011
3.317AspIle: 3.317 ± 0.02
2.939AspLys: 2.939 ± 0.014
5.381AspLeu: 5.381 ± 0.026
1.435AspMet: 1.435 ± 0.01
2.358AspAsn: 2.358 ± 0.016
2.486AspPro: 2.486 ± 0.015
1.745AspGln: 1.745 ± 0.013
2.246AspArg: 2.246 ± 0.014
4.138AspSer: 4.138 ± 0.018
2.287AspThr: 2.287 ± 0.015
3.992AspVal: 3.992 ± 0.018
0.704AspTrp: 0.704 ± 0.007
1.681AspTyr: 1.681 ± 0.011
0.002AspXaa: 0.002 ± 0.0
Glu
4.355GluAla: 4.355 ± 0.021
0.984GluCys: 0.984 ± 0.008
3.965GluAsp: 3.965 ± 0.022
6.147GluGlu: 6.147 ± 0.054
2.526GluPhe: 2.526 ± 0.016
3.558GluGly: 3.558 ± 0.016
1.291GluHis: 1.291 ± 0.01
4.087GluIle: 4.087 ± 0.021
4.971GluLys: 4.971 ± 0.035
6.097GluLeu: 6.097 ± 0.026
1.856GluMet: 1.856 ± 0.013
3.315GluAsn: 3.315 ± 0.019
1.974GluPro: 1.974 ± 0.018
2.146GluGln: 2.146 ± 0.016
3.211GluArg: 3.211 ± 0.02
4.597GluSer: 4.597 ± 0.021
2.881GluThr: 2.881 ± 0.017
4.355GluVal: 4.355 ± 0.021
0.732GluTrp: 0.732 ± 0.006
1.835GluTyr: 1.835 ± 0.012
0.003GluXaa: 0.003 ± 0.001
Phe
2.371PheAla: 2.371 ± 0.013
0.895PheCys: 0.895 ± 0.009
2.533PheAsp: 2.533 ± 0.015
2.411PheGlu: 2.411 ± 0.017
1.917PhePhe: 1.917 ± 0.016
3.121PheGly: 3.121 ± 0.019
1.137PheHis: 1.137 ± 0.009
2.216PheIle: 2.216 ± 0.014
2.323PheLys: 2.323 ± 0.014
4.361PheLeu: 4.361 ± 0.024
1.034PheMet: 1.034 ± 0.008
1.89PheAsn: 1.89 ± 0.013
2.138PhePro: 2.138 ± 0.014
1.549PheGln: 1.549 ± 0.011
1.949PheArg: 1.949 ± 0.012
4.045PheSer: 4.045 ± 0.021
2.024PheThr: 2.024 ± 0.013
2.813PheVal: 2.813 ± 0.017
0.536PheTrp: 0.536 ± 0.007
1.341PheTyr: 1.341 ± 0.011
0.002PheXaa: 0.002 ± 0.0
Gly
3.677GlyAla: 3.677 ± 0.02
1.248GlyCys: 1.248 ± 0.011
3.288GlyAsp: 3.288 ± 0.018
3.588GlyGlu: 3.588 ± 0.021
3.023GlyPhe: 3.023 ± 0.017
4.935GlyGly: 4.935 ± 0.053
1.492GlyHis: 1.492 ± 0.011
3.671GlyIle: 3.671 ± 0.019
4.168GlyLys: 4.168 ± 0.02
5.858GlyLeu: 5.858 ± 0.025
1.522GlyMet: 1.522 ± 0.012
3.133GlyAsn: 3.133 ± 0.019
2.319GlyPro: 2.319 ± 0.014
1.927GlyGln: 1.927 ± 0.013
3.338GlyArg: 3.338 ± 0.019
5.534GlySer: 5.534 ± 0.023
3.18GlyThr: 3.18 ± 0.016
4.168GlyVal: 4.168 ± 0.021
0.823GlyTrp: 0.823 ± 0.008
2.126GlyTyr: 2.126 ± 0.014
0.004GlyXaa: 0.004 ± 0.001
His
1.329HisAla: 1.329 ± 0.011
0.514HisCys: 0.514 ± 0.006
1.207HisAsp: 1.207 ± 0.01
1.254HisGlu: 1.254 ± 0.01
1.12HisPhe: 1.12 ± 0.009
1.651HisGly: 1.651 ± 0.013
0.911HisHis: 0.911 ± 0.012
1.287HisIle: 1.287 ± 0.009
1.221HisLys: 1.221 ± 0.011
2.505HisLeu: 2.505 ± 0.015
0.567HisMet: 0.567 ± 0.007
1.069HisAsn: 1.069 ± 0.01
1.273HisPro: 1.273 ± 0.012
0.97HisGln: 0.97 ± 0.01
1.267HisArg: 1.267 ± 0.011
1.842HisSer: 1.842 ± 0.014
0.963HisThr: 0.963 ± 0.009
1.543HisVal: 1.543 ± 0.011
0.338HisTrp: 0.338 ± 0.005
0.779HisTyr: 0.779 ± 0.008
0.001HisXaa: 0.001 ± 0.0
Ile
3.654IleAla: 3.654 ± 0.019
1.153IleCys: 1.153 ± 0.01
3.244IleAsp: 3.244 ± 0.017
3.418IleGlu: 3.418 ± 0.02
2.48IlePhe: 2.48 ± 0.018
3.581IleGly: 3.581 ± 0.021
1.407IleHis: 1.407 ± 0.01
3.094IleIle: 3.094 ± 0.018
3.252IleLys: 3.252 ± 0.014
5.666IleLeu: 5.666 ± 0.023
1.271IleMet: 1.271 ± 0.01
2.528IleAsn: 2.528 ± 0.014
3.275IlePro: 3.275 ± 0.02
2.085IleGln: 2.085 ± 0.014
2.677IleArg: 2.677 ± 0.014
5.126IleSer: 5.126 ± 0.021
2.78IleThr: 2.78 ± 0.016
3.731IleVal: 3.731 ± 0.018
0.762IleTrp: 0.762 ± 0.008
1.619IleTyr: 1.619 ± 0.013
0.003IleXaa: 0.003 ± 0.001
Lys
3.96LysAla: 3.96 ± 0.019
1.125LysCys: 1.125 ± 0.01
3.496LysAsp: 3.496 ± 0.017
4.921LysGlu: 4.921 ± 0.027
2.427LysPhe: 2.427 ± 0.014
3.759LysGly: 3.759 ± 0.017
1.416LysHis: 1.416 ± 0.012
3.689LysIle: 3.689 ± 0.017
5.272LysLys: 5.272 ± 0.031
6.431LysLeu: 6.431 ± 0.03
1.734LysMet: 1.734 ± 0.012
3.045LysAsn: 3.045 ± 0.015
2.566LysPro: 2.566 ± 0.019
2.269LysGln: 2.269 ± 0.014
3.639LysArg: 3.639 ± 0.017
4.926LysSer: 4.926 ± 0.022
2.903LysThr: 2.903 ± 0.017
4.156LysVal: 4.156 ± 0.02
0.803LysTrp: 0.803 ± 0.009
1.912LysTyr: 1.912 ± 0.013
0.003LysXaa: 0.003 ± 0.0
Leu
6.152LeuAla: 6.152 ± 0.022
1.844LeuCys: 1.844 ± 0.013
5.362LeuAsp: 5.362 ± 0.024
6.404LeuGlu: 6.404 ± 0.03
3.914LeuPhe: 3.914 ± 0.02
5.703LeuGly: 5.703 ± 0.021
2.528LeuHis: 2.528 ± 0.016
4.978LeuIle: 4.978 ± 0.022
6.601LeuLys: 6.601 ± 0.027
9.721LeuLeu: 9.721 ± 0.037
2.278LeuMet: 2.278 ± 0.015
4.195LeuAsn: 4.195 ± 0.019
5.059LeuPro: 5.059 ± 0.026
4.19LeuGln: 4.19 ± 0.019
5.294LeuArg: 5.294 ± 0.022
8.685LeuSer: 8.685 ± 0.037
4.613LeuThr: 4.613 ± 0.02
6.46LeuVal: 6.46 ± 0.024
1.188LeuTrp: 1.188 ± 0.011
2.695LeuTyr: 2.695 ± 0.016
0.004LeuXaa: 0.004 ± 0.001
Met
1.905MetAla: 1.905 ± 0.013
0.351MetCys: 0.351 ± 0.005
1.516MetAsp: 1.516 ± 0.011
1.955MetGlu: 1.955 ± 0.014
0.881MetPhe: 0.881 ± 0.007
1.559MetGly: 1.559 ± 0.01
0.562MetHis: 0.562 ± 0.007
1.318MetIle: 1.318 ± 0.012
1.799MetLys: 1.799 ± 0.012
2.317MetLeu: 2.317 ± 0.015
0.766MetMet: 0.766 ± 0.01
1.151MetAsn: 1.151 ± 0.01
1.106MetPro: 1.106 ± 0.01
0.95MetGln: 0.95 ± 0.009
1.208MetArg: 1.208 ± 0.01
1.878MetSer: 1.878 ± 0.012
1.206MetThr: 1.206 ± 0.01
1.727MetVal: 1.727 ± 0.013
0.268MetTrp: 0.268 ± 0.005
0.669MetTyr: 0.669 ± 0.007
0.001MetXaa: 0.001 ± 0.0
Asn
2.721AsnAla: 2.721 ± 0.015
0.938AsnCys: 0.938 ± 0.009
2.428AsnAsp: 2.428 ± 0.017
2.815AsnGlu: 2.815 ± 0.015
2.138AsnPhe: 2.138 ± 0.015
3.197AsnGly: 3.197 ± 0.018
1.137AsnHis: 1.137 ± 0.01
2.88AsnIle: 2.88 ± 0.016
2.698AsnLys: 2.698 ± 0.016
5.052AsnLeu: 5.052 ± 0.027
1.217AsnMet: 1.217 ± 0.01
2.784AsnAsn: 2.784 ± 0.021
2.295AsnPro: 2.295 ± 0.017
1.705AsnGln: 1.705 ± 0.012
1.991AsnArg: 1.991 ± 0.014
4.041AsnSer: 4.041 ± 0.02
2.016AsnThr: 2.016 ± 0.014
3.079AsnVal: 3.079 ± 0.016
0.596AsnTrp: 0.596 ± 0.008
1.486AsnTyr: 1.486 ± 0.01
0.003AsnXaa: 0.003 ± 0.0
Pro
2.674ProAla: 2.674 ± 0.015
0.747ProCys: 0.747 ± 0.008
2.314ProAsp: 2.314 ± 0.016
2.948ProGlu: 2.948 ± 0.019
1.947ProPhe: 1.947 ± 0.014
2.433ProGly: 2.433 ± 0.016
1.056ProHis: 1.056 ± 0.01
2.461ProIle: 2.461 ± 0.014
2.901ProLys: 2.901 ± 0.017
4.157ProLeu: 4.157 ± 0.018
0.973ProMet: 0.973 ± 0.009
2.408ProAsn: 2.408 ± 0.015
3.583ProPro: 3.583 ± 0.058
1.709ProGln: 1.709 ± 0.014
2.28ProArg: 2.28 ± 0.015
4.928ProSer: 4.928 ± 0.024
2.584ProThr: 2.584 ± 0.017
2.9ProVal: 2.9 ± 0.019
0.672ProTrp: 0.672 ± 0.009
1.358ProTyr: 1.358 ± 0.013
0.003ProXaa: 0.003 ± 0.0
Gln
2.158GlnAla: 2.158 ± 0.014
0.613GlnCys: 0.613 ± 0.007
1.684GlnAsp: 1.684 ± 0.013
2.343GlnGlu: 2.343 ± 0.016
1.356GlnPhe: 1.356 ± 0.01
1.967GlnGly: 1.967 ± 0.014
0.969GlnHis: 0.969 ± 0.009
2.033GlnIle: 2.033 ± 0.012
2.447GlnLys: 2.447 ± 0.016
3.655GlnLeu: 3.655 ± 0.019
0.96GlnMet: 0.96 ± 0.009
1.835GlnAsn: 1.835 ± 0.013
1.56GlnPro: 1.56 ± 0.014
2.022GlnGln: 2.022 ± 0.026
1.925GlnArg: 1.925 ± 0.013
2.674GlnSer: 2.674 ± 0.016
1.627GlnThr: 1.627 ± 0.011
2.382GlnVal: 2.382 ± 0.015
0.433GlnTrp: 0.433 ± 0.007
0.983GlnTyr: 0.983 ± 0.008
0.002GlnXaa: 0.002 ± 0.0
Arg
2.915ArgAla: 2.915 ± 0.017
0.968ArgCys: 0.968 ± 0.008
2.637ArgAsp: 2.637 ± 0.013
3.167ArgGlu: 3.167 ± 0.019
2.105ArgPhe: 2.105 ± 0.014
3.022ArgGly: 3.022 ± 0.019
1.218ArgHis: 1.218 ± 0.011
2.961ArgIle: 2.961 ± 0.015
3.814ArgLys: 3.814 ± 0.019
4.682ArgLeu: 4.682 ± 0.021
1.318ArgMet: 1.318 ± 0.009
2.462ArgAsn: 2.462 ± 0.014
2.124ArgPro: 2.124 ± 0.016
1.676ArgGln: 1.676 ± 0.013
3.678ArgArg: 3.678 ± 0.022
4.02ArgSer: 4.02 ± 0.023
2.392ArgThr: 2.392 ± 0.015
3.146ArgVal: 3.146 ± 0.017
0.699ArgTrp: 0.699 ± 0.008
1.53ArgTyr: 1.53 ± 0.011
0.003ArgXaa: 0.003 ± 0.001
Ser
4.849SerAla: 4.849 ± 0.022
1.776SerCys: 1.776 ± 0.014
4.367SerAsp: 4.367 ± 0.019
4.705SerGlu: 4.705 ± 0.028
3.937SerPhe: 3.937 ± 0.018
5.591SerGly: 5.591 ± 0.028
1.925SerHis: 1.925 ± 0.014
4.87SerIle: 4.87 ± 0.024
5.187SerLys: 5.187 ± 0.022
8.564SerLeu: 8.564 ± 0.037
2.117SerMet: 2.117 ± 0.014
4.15SerAsn: 4.15 ± 0.02
4.263SerPro: 4.263 ± 0.033
2.914SerGln: 2.914 ± 0.016
4.246SerArg: 4.246 ± 0.023
10.295SerSer: 10.295 ± 0.042
4.771SerThr: 4.771 ± 0.023
5.083SerVal: 5.083 ± 0.02
1.201SerTrp: 1.201 ± 0.012
2.493SerTyr: 2.493 ± 0.015
0.005SerXaa: 0.005 ± 0.001
Thr
3.089ThrAla: 3.089 ± 0.017
0.923ThrCys: 0.923 ± 0.009
2.398ThrAsp: 2.398 ± 0.015
2.749ThrGlu: 2.749 ± 0.016
2.091ThrPhe: 2.091 ± 0.012
3.17ThrGly: 3.17 ± 0.018
1.005ThrHis: 1.005 ± 0.009
3.043ThrIle: 3.043 ± 0.016
2.842ThrLys: 2.842 ± 0.017
4.689ThrLeu: 4.689 ± 0.02
1.2ThrMet: 1.2 ± 0.01
2.276ThrAsn: 2.276 ± 0.014
2.515ThrPro: 2.515 ± 0.017
1.495ThrGln: 1.495 ± 0.01
2.334ThrArg: 2.334 ± 0.013
4.73ThrSer: 4.73 ± 0.021
3.085ThrThr: 3.085 ± 0.017
3.255ThrVal: 3.255 ± 0.017
0.614ThrTrp: 0.614 ± 0.007
1.469ThrTyr: 1.469 ± 0.011
0.003ThrXaa: 0.003 ± 0.0
Val
4.633ValAla: 4.633 ± 0.023
1.145ValCys: 1.145 ± 0.011
3.912ValAsp: 3.912 ± 0.019
4.516ValGlu: 4.516 ± 0.021
2.695ValPhe: 2.695 ± 0.015
4.072ValGly: 4.072 ± 0.02
1.547ValHis: 1.547 ± 0.01
3.708ValIle: 3.708 ± 0.017
4.172ValLys: 4.172 ± 0.019
6.339ValLeu: 6.339 ± 0.026
1.598ValMet: 1.598 ± 0.012
2.868ValAsn: 2.868 ± 0.018
3.106ValPro: 3.106 ± 0.023
2.296ValGln: 2.296 ± 0.012
2.942ValArg: 2.942 ± 0.017
5.178ValSer: 5.178 ± 0.023
3.27ValThr: 3.27 ± 0.016
5.08ValVal: 5.08 ± 0.024
0.767ValTrp: 0.767 ± 0.008
2.06ValTyr: 2.06 ± 0.016
0.003ValXaa: 0.003 ± 0.0
Trp
0.747TrpAla: 0.747 ± 0.008
0.249TrpCys: 0.249 ± 0.006
0.686TrpAsp: 0.686 ± 0.008
0.737TrpGlu: 0.737 ± 0.008
0.57TrpPhe: 0.57 ± 0.008
0.721TrpGly: 0.721 ± 0.009
0.324TrpHis: 0.324 ± 0.006
0.798TrpIle: 0.798 ± 0.008
1.012TrpLys: 1.012 ± 0.01
1.198TrpLeu: 1.198 ± 0.01
0.356TrpMet: 0.356 ± 0.005
0.738TrpAsn: 0.738 ± 0.009
0.465TrpPro: 0.465 ± 0.007
0.407TrpGln: 0.407 ± 0.005
0.785TrpArg: 0.785 ± 0.008
0.983TrpSer: 0.983 ± 0.009
0.636TrpThr: 0.636 ± 0.007
0.766TrpVal: 0.766 ± 0.007
0.23TrpTrp: 0.23 ± 0.005
0.36TrpTyr: 0.36 ± 0.005
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.881TyrAla: 1.881 ± 0.013
0.672TyrCys: 0.672 ± 0.008
1.699TyrAsp: 1.699 ± 0.012
1.685TyrGlu: 1.685 ± 0.013
1.413TyrPhe: 1.413 ± 0.012
2.185TyrGly: 2.185 ± 0.017
0.728TyrHis: 0.728 ± 0.007
1.596TyrIle: 1.596 ± 0.011
1.708TyrLys: 1.708 ± 0.017
3.134TyrLeu: 3.134 ± 0.019
0.77TyrMet: 0.77 ± 0.007
1.461TyrAsn: 1.461 ± 0.012
1.352TyrPro: 1.352 ± 0.011
0.983TyrGln: 0.983 ± 0.008
1.459TyrArg: 1.459 ± 0.012
2.417TyrSer: 2.417 ± 0.014
1.375TyrThr: 1.375 ± 0.01
1.91TyrVal: 1.91 ± 0.013
0.436TyrTrp: 0.436 ± 0.006
1.114TyrTyr: 1.114 ± 0.018
0.002TyrXaa: 0.002 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.004XaaGly: 0.004 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.004XaaIle: 0.004 ± 0.0
0.003XaaLys: 0.003 ± 0.0
0.005XaaLeu: 0.005 ± 0.001
0.002XaaMet: 0.002 ± 0.0
0.003XaaAsn: 0.003 ± 0.0
0.004XaaPro: 0.004 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.003XaaArg: 0.003 ± 0.001
0.004XaaSer: 0.004 ± 0.001
0.003XaaThr: 0.003 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.077XaaXaa: 0.077 ± 0.015
Statistics based on 34757 proteins (12923466 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski