Amino acid dipepetide frequency for Penicillium subrubescens

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.837AlaAla: 8.837 ± 0.064
1.064AlaCys: 1.064 ± 0.015
4.285AlaAsp: 4.285 ± 0.029
5.123AlaGlu: 5.123 ± 0.036
3.171AlaPhe: 3.171 ± 0.028
5.872AlaGly: 5.872 ± 0.038
1.809AlaHis: 1.809 ± 0.018
4.404AlaIle: 4.404 ± 0.033
3.93AlaLys: 3.93 ± 0.028
7.862AlaLeu: 7.862 ± 0.042
2.048AlaMet: 2.048 ± 0.021
2.993AlaAsn: 2.993 ± 0.025
4.718AlaPro: 4.718 ± 0.044
3.459AlaGln: 3.459 ± 0.027
4.875AlaArg: 4.875 ± 0.034
7.232AlaSer: 7.232 ± 0.043
5.309AlaThr: 5.309 ± 0.033
5.401AlaVal: 5.401 ± 0.035
1.189AlaTrp: 1.189 ± 0.014
2.176AlaTyr: 2.176 ± 0.02
0.001AlaXaa: 0.001 ± 0.001
Cys
0.938CysAla: 0.938 ± 0.014
0.248CysCys: 0.248 ± 0.008
0.677CysAsp: 0.677 ± 0.012
0.605CysGlu: 0.605 ± 0.011
0.551CysPhe: 0.551 ± 0.011
0.952CysGly: 0.952 ± 0.015
0.327CysHis: 0.327 ± 0.008
0.677CysIle: 0.677 ± 0.011
0.488CysLys: 0.488 ± 0.009
1.287CysLeu: 1.287 ± 0.015
0.278CysMet: 0.278 ± 0.008
0.431CysAsn: 0.431 ± 0.009
0.647CysPro: 0.647 ± 0.012
0.479CysGln: 0.479 ± 0.009
0.748CysArg: 0.748 ± 0.014
0.895CysSer: 0.895 ± 0.013
0.692CysThr: 0.692 ± 0.013
0.827CysVal: 0.827 ± 0.013
0.206CysTrp: 0.206 ± 0.006
0.364CysTyr: 0.364 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.612AspAla: 4.612 ± 0.033
0.628AspCys: 0.628 ± 0.011
3.899AspAsp: 3.899 ± 0.038
4.326AspGlu: 4.326 ± 0.039
2.194AspPhe: 2.194 ± 0.02
3.925AspGly: 3.925 ± 0.035
1.318AspHis: 1.318 ± 0.015
3.129AspIle: 3.129 ± 0.027
2.221AspLys: 2.221 ± 0.023
5.21AspLeu: 5.21 ± 0.035
1.273AspMet: 1.273 ± 0.015
1.863AspAsn: 1.863 ± 0.018
3.373AspPro: 3.373 ± 0.024
2.07AspGln: 2.07 ± 0.021
3.118AspArg: 3.118 ± 0.028
4.197AspSer: 4.197 ± 0.029
2.982AspThr: 2.982 ± 0.028
3.586AspVal: 3.586 ± 0.029
0.907AspTrp: 0.907 ± 0.012
1.63AspTyr: 1.63 ± 0.017
0.001AspXaa: 0.001 ± 0.0
Glu
5.266GluAla: 5.266 ± 0.038
0.631GluCys: 0.631 ± 0.011
4.134GluAsp: 4.134 ± 0.033
5.247GluGlu: 5.247 ± 0.062
2.014GluPhe: 2.014 ± 0.019
3.642GluGly: 3.642 ± 0.028
1.359GluHis: 1.359 ± 0.018
3.237GluIle: 3.237 ± 0.027
3.542GluLys: 3.542 ± 0.029
5.143GluLeu: 5.143 ± 0.037
1.541GluMet: 1.541 ± 0.018
2.371GluAsn: 2.371 ± 0.02
2.811GluPro: 2.811 ± 0.039
2.429GluGln: 2.429 ± 0.024
3.764GluArg: 3.764 ± 0.035
4.399GluSer: 4.399 ± 0.031
3.571GluThr: 3.571 ± 0.026
3.5GluVal: 3.5 ± 0.029
0.911GluTrp: 0.911 ± 0.014
1.695GluTyr: 1.695 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.104PheAla: 3.104 ± 0.028
0.559PheCys: 0.559 ± 0.011
2.342PheAsp: 2.342 ± 0.02
2.146PheGlu: 2.146 ± 0.021
1.694PhePhe: 1.694 ± 0.022
2.921PheGly: 2.921 ± 0.032
0.945PheHis: 0.945 ± 0.012
1.868PheIle: 1.868 ± 0.02
1.479PheLys: 1.479 ± 0.015
3.565PheLeu: 3.565 ± 0.031
0.831PheMet: 0.831 ± 0.012
1.452PheAsn: 1.452 ± 0.015
1.972PhePro: 1.972 ± 0.018
1.476PheGln: 1.476 ± 0.015
1.947PheArg: 1.947 ± 0.017
2.98PheSer: 2.98 ± 0.025
2.161PheThr: 2.161 ± 0.021
2.437PheVal: 2.437 ± 0.021
0.684PheTrp: 0.684 ± 0.011
1.161PheTyr: 1.161 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.364GlyAla: 5.364 ± 0.037
0.892GlyCys: 0.892 ± 0.014
3.596GlyAsp: 3.596 ± 0.026
3.531GlyGlu: 3.531 ± 0.026
2.861GlyPhe: 2.861 ± 0.024
5.498GlyGly: 5.498 ± 0.047
1.763GlyHis: 1.763 ± 0.021
3.681GlyIle: 3.681 ± 0.03
3.28GlyLys: 3.28 ± 0.025
6.249GlyLeu: 6.249 ± 0.043
1.638GlyMet: 1.638 ± 0.016
2.542GlyAsn: 2.542 ± 0.026
3.386GlyPro: 3.386 ± 0.03
2.642GlyGln: 2.642 ± 0.023
4.011GlyArg: 4.011 ± 0.031
5.829GlySer: 5.829 ± 0.039
3.954GlyThr: 3.954 ± 0.031
4.433GlyVal: 4.433 ± 0.032
1.208GlyTrp: 1.208 ± 0.016
2.226GlyTyr: 2.226 ± 0.02
0.0GlyXaa: 0.0 ± 0.0
His
1.86HisAla: 1.86 ± 0.019
0.333HisCys: 0.333 ± 0.007
1.346HisAsp: 1.346 ± 0.016
1.387HisGlu: 1.387 ± 0.017
0.948HisPhe: 0.948 ± 0.014
1.738HisGly: 1.738 ± 0.019
0.852HisHis: 0.852 ± 0.016
1.228HisIle: 1.228 ± 0.015
0.849HisLys: 0.849 ± 0.012
2.296HisLeu: 2.296 ± 0.021
0.505HisMet: 0.505 ± 0.011
0.846HisAsn: 0.846 ± 0.014
1.667HisPro: 1.667 ± 0.02
1.018HisGln: 1.018 ± 0.016
1.578HisArg: 1.578 ± 0.019
1.859HisSer: 1.859 ± 0.02
1.288HisThr: 1.288 ± 0.015
1.418HisVal: 1.418 ± 0.016
0.38HisTrp: 0.38 ± 0.009
0.714HisTyr: 0.714 ± 0.011
0.001HisXaa: 0.001 ± 0.0
Ile
4.343IleAla: 4.343 ± 0.033
0.789IleCys: 0.789 ± 0.011
2.935IleAsp: 2.935 ± 0.023
2.939IleGlu: 2.939 ± 0.022
2.109IlePhe: 2.109 ± 0.022
3.349IleGly: 3.349 ± 0.033
1.258IleHis: 1.258 ± 0.016
2.625IleIle: 2.625 ± 0.027
2.109IleLys: 2.109 ± 0.02
4.717IleLeu: 4.717 ± 0.037
1.073IleMet: 1.073 ± 0.015
1.832IleAsn: 1.832 ± 0.018
3.189IlePro: 3.189 ± 0.025
2.033IleGln: 2.033 ± 0.021
2.818IleArg: 2.818 ± 0.022
4.009IleSer: 4.009 ± 0.026
2.927IleThr: 2.927 ± 0.027
3.208IleVal: 3.208 ± 0.027
0.76IleTrp: 0.76 ± 0.011
1.497IleTyr: 1.497 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
4.065LysAla: 4.065 ± 0.03
0.49LysCys: 0.49 ± 0.011
2.659LysAsp: 2.659 ± 0.022
3.164LysGlu: 3.164 ± 0.028
1.47LysPhe: 1.47 ± 0.015
2.821LysGly: 2.821 ± 0.022
1.069LysHis: 1.069 ± 0.012
2.25LysIle: 2.25 ± 0.021
3.023LysLys: 3.023 ± 0.042
3.911LysLeu: 3.911 ± 0.026
1.012LysMet: 1.012 ± 0.014
1.746LysAsn: 1.746 ± 0.02
2.589LysPro: 2.589 ± 0.027
1.79LysGln: 1.79 ± 0.016
3.164LysArg: 3.164 ± 0.031
3.389LysSer: 3.389 ± 0.025
2.701LysThr: 2.701 ± 0.025
2.698LysVal: 2.698 ± 0.023
0.654LysTrp: 0.654 ± 0.01
1.342LysTyr: 1.342 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
7.933LeuAla: 7.933 ± 0.039
1.204LeuCys: 1.204 ± 0.013
5.252LeuAsp: 5.252 ± 0.036
5.562LeuGlu: 5.562 ± 0.037
3.396LeuPhe: 3.396 ± 0.029
6.085LeuGly: 6.085 ± 0.038
2.281LeuHis: 2.281 ± 0.022
4.082LeuIle: 4.082 ± 0.033
4.059LeuLys: 4.059 ± 0.031
8.51LeuLeu: 8.51 ± 0.06
1.874LeuMet: 1.874 ± 0.018
3.241LeuAsn: 3.241 ± 0.025
5.402LeuPro: 5.402 ± 0.032
3.925LeuGln: 3.925 ± 0.032
5.755LeuArg: 5.755 ± 0.04
7.378LeuSer: 7.378 ± 0.036
4.792LeuThr: 4.792 ± 0.025
5.504LeuVal: 5.504 ± 0.032
1.269LeuTrp: 1.269 ± 0.016
2.444LeuTyr: 2.444 ± 0.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.262MetAla: 2.262 ± 0.021
0.249MetCys: 0.249 ± 0.007
1.294MetAsp: 1.294 ± 0.014
1.342MetGlu: 1.342 ± 0.015
0.763MetPhe: 0.763 ± 0.012
1.539MetGly: 1.539 ± 0.02
0.507MetHis: 0.507 ± 0.009
1.078MetIle: 1.078 ± 0.015
1.019MetLys: 1.019 ± 0.014
1.864MetLeu: 1.864 ± 0.02
0.599MetMet: 0.599 ± 0.01
0.841MetAsn: 0.841 ± 0.012
1.274MetPro: 1.274 ± 0.016
0.901MetGln: 0.901 ± 0.014
1.26MetArg: 1.26 ± 0.016
1.909MetSer: 1.909 ± 0.015
1.377MetThr: 1.377 ± 0.016
1.371MetVal: 1.371 ± 0.015
0.273MetTrp: 0.273 ± 0.006
0.526MetTyr: 0.526 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.166AsnAla: 3.166 ± 0.026
0.435AsnCys: 0.435 ± 0.009
1.964AsnAsp: 1.964 ± 0.019
2.062AsnGlu: 2.062 ± 0.02
1.404AsnPhe: 1.404 ± 0.014
3.008AsnGly: 3.008 ± 0.029
0.865AsnHis: 0.865 ± 0.012
2.073AsnIle: 2.073 ± 0.021
1.459AsnLys: 1.459 ± 0.018
3.343AsnLeu: 3.343 ± 0.025
0.868AsnMet: 0.868 ± 0.013
1.443AsnAsn: 1.443 ± 0.018
2.53AsnPro: 2.53 ± 0.023
1.402AsnGln: 1.402 ± 0.016
1.929AsnArg: 1.929 ± 0.018
2.741AsnSer: 2.741 ± 0.023
2.196AsnThr: 2.196 ± 0.021
2.356AsnVal: 2.356 ± 0.02
0.605AsnTrp: 0.605 ± 0.01
1.086AsnTyr: 1.086 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
5.223ProAla: 5.223 ± 0.05
0.537ProCys: 0.537 ± 0.01
3.205ProAsp: 3.205 ± 0.026
3.926ProGlu: 3.926 ± 0.039
2.091ProPhe: 2.091 ± 0.019
3.941ProGly: 3.941 ± 0.031
1.293ProHis: 1.293 ± 0.016
2.629ProIle: 2.629 ± 0.022
2.5ProLys: 2.5 ± 0.024
4.667ProLeu: 4.667 ± 0.03
1.097ProMet: 1.097 ± 0.015
2.18ProAsn: 2.18 ± 0.021
4.643ProPro: 4.643 ± 0.068
2.423ProGln: 2.423 ± 0.027
3.394ProArg: 3.394 ± 0.031
5.968ProSer: 5.968 ± 0.045
3.973ProThr: 3.973 ± 0.028
3.691ProVal: 3.691 ± 0.033
0.804ProTrp: 0.804 ± 0.012
1.509ProTyr: 1.509 ± 0.019
0.001ProXaa: 0.001 ± 0.0
Gln
3.557GlnAla: 3.557 ± 0.026
0.473GlnCys: 0.473 ± 0.011
2.071GlnAsp: 2.071 ± 0.017
2.405GlnGlu: 2.405 ± 0.022
1.378GlnPhe: 1.378 ± 0.015
2.488GlnGly: 2.488 ± 0.023
1.054GlnHis: 1.054 ± 0.015
2.038GlnIle: 2.038 ± 0.017
1.982GlnLys: 1.982 ± 0.019
3.525GlnLeu: 3.525 ± 0.028
0.938GlnMet: 0.938 ± 0.014
1.597GlnAsn: 1.597 ± 0.019
2.553GlnPro: 2.553 ± 0.03
2.304GlnGln: 2.304 ± 0.046
2.614GlnArg: 2.614 ± 0.024
3.29GlnSer: 3.29 ± 0.03
2.383GlnThr: 2.383 ± 0.021
2.281GlnVal: 2.281 ± 0.021
0.626GlnTrp: 0.626 ± 0.01
1.197GlnTyr: 1.197 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
4.644ArgAla: 4.644 ± 0.029
0.711ArgCys: 0.711 ± 0.013
3.378ArgAsp: 3.378 ± 0.03
3.754ArgGlu: 3.754 ± 0.032
2.145ArgPhe: 2.145 ± 0.021
3.709ArgGly: 3.709 ± 0.029
1.54ArgHis: 1.54 ± 0.018
2.917ArgIle: 2.917 ± 0.024
3.25ArgLys: 3.25 ± 0.027
5.5ArgLeu: 5.5 ± 0.039
1.29ArgMet: 1.29 ± 0.015
2.211ArgAsn: 2.211 ± 0.018
3.452ArgPro: 3.452 ± 0.03
2.606ArgGln: 2.606 ± 0.022
4.93ArgArg: 4.93 ± 0.04
4.682ArgSer: 4.682 ± 0.039
3.234ArgThr: 3.234 ± 0.027
3.406ArgVal: 3.406 ± 0.026
0.963ArgTrp: 0.963 ± 0.015
1.699ArgTyr: 1.699 ± 0.016
0.001ArgXaa: 0.001 ± 0.0
Ser
6.699SerAla: 6.699 ± 0.038
0.877SerCys: 0.877 ± 0.014
4.239SerAsp: 4.239 ± 0.028
4.253SerGlu: 4.253 ± 0.031
3.029SerPhe: 3.029 ± 0.025
5.644SerGly: 5.644 ± 0.036
2.017SerHis: 2.017 ± 0.018
4.046SerIle: 4.046 ± 0.029
3.619SerLys: 3.619 ± 0.029
7.315SerLeu: 7.315 ± 0.043
1.8SerMet: 1.8 ± 0.016
3.037SerAsn: 3.037 ± 0.026
5.428SerPro: 5.428 ± 0.051
3.391SerGln: 3.391 ± 0.024
4.946SerArg: 4.946 ± 0.037
8.685SerSer: 8.685 ± 0.072
5.756SerThr: 5.756 ± 0.041
4.765SerVal: 4.765 ± 0.032
1.185SerTrp: 1.185 ± 0.015
2.126SerTyr: 2.126 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
5.218ThrAla: 5.218 ± 0.03
0.73ThrCys: 0.73 ± 0.012
2.916ThrAsp: 2.916 ± 0.027
3.196ThrGlu: 3.196 ± 0.028
2.232ThrPhe: 2.232 ± 0.019
4.319ThrGly: 4.319 ± 0.037
1.28ThrHis: 1.28 ± 0.014
3.167ThrIle: 3.167 ± 0.027
2.52ThrLys: 2.52 ± 0.024
5.25ThrLeu: 5.25 ± 0.032
1.189ThrMet: 1.189 ± 0.016
2.138ThrAsn: 2.138 ± 0.021
4.284ThrPro: 4.284 ± 0.035
2.158ThrGln: 2.158 ± 0.02
3.118ThrArg: 3.118 ± 0.025
5.373ThrSer: 5.373 ± 0.039
4.286ThrThr: 4.286 ± 0.046
3.889ThrVal: 3.889 ± 0.033
0.901ThrTrp: 0.901 ± 0.011
1.638ThrTyr: 1.638 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
5.227ValAla: 5.227 ± 0.035
0.844ValCys: 0.844 ± 0.012
3.744ValAsp: 3.744 ± 0.027
3.82ValGlu: 3.82 ± 0.033
2.492ValPhe: 2.492 ± 0.024
4.013ValGly: 4.013 ± 0.032
1.413ValHis: 1.413 ± 0.017
3.136ValIle: 3.136 ± 0.027
2.765ValLys: 2.765 ± 0.026
5.595ValLeu: 5.595 ± 0.035
1.371ValMet: 1.371 ± 0.016
2.354ValAsn: 2.354 ± 0.022
3.607ValPro: 3.607 ± 0.028
2.467ValGln: 2.467 ± 0.02
3.438ValArg: 3.438 ± 0.027
4.803ValSer: 4.803 ± 0.035
3.62ValThr: 3.62 ± 0.034
4.271ValVal: 4.271 ± 0.031
0.872ValTrp: 0.872 ± 0.013
1.777ValTyr: 1.777 ± 0.02
0.001ValXaa: 0.001 ± 0.0
Trp
1.201TrpAla: 1.201 ± 0.014
0.208TrpCys: 0.208 ± 0.006
0.909TrpAsp: 0.909 ± 0.014
0.877TrpGlu: 0.877 ± 0.012
0.558TrpPhe: 0.558 ± 0.01
0.976TrpGly: 0.976 ± 0.014
0.372TrpHis: 0.372 ± 0.009
0.825TrpIle: 0.825 ± 0.012
0.85TrpLys: 0.85 ± 0.012
1.399TrpLeu: 1.399 ± 0.017
0.394TrpMet: 0.394 ± 0.008
0.662TrpAsn: 0.662 ± 0.012
0.633TrpPro: 0.633 ± 0.011
0.597TrpGln: 0.597 ± 0.01
0.955TrpArg: 0.955 ± 0.015
1.121TrpSer: 1.121 ± 0.014
0.945TrpThr: 0.945 ± 0.013
0.919TrpVal: 0.919 ± 0.013
0.307TrpTrp: 0.307 ± 0.007
0.46TrpTyr: 0.46 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.238TyrAla: 2.238 ± 0.02
0.416TyrCys: 0.416 ± 0.007
1.615TyrAsp: 1.615 ± 0.02
1.546TyrGlu: 1.546 ± 0.016
1.226TyrPhe: 1.226 ± 0.015
2.144TyrGly: 2.144 ± 0.025
0.789TyrHis: 0.789 ± 0.011
1.428TyrIle: 1.428 ± 0.016
1.065TyrLys: 1.065 ± 0.013
2.74TyrLeu: 2.74 ± 0.027
0.63TyrMet: 0.63 ± 0.011
1.156TyrAsn: 1.156 ± 0.016
1.535TyrPro: 1.535 ± 0.017
1.186TyrGln: 1.186 ± 0.016
1.658TyrArg: 1.658 ± 0.016
2.115TyrSer: 2.115 ± 0.021
1.651TyrThr: 1.651 ± 0.017
1.668TyrVal: 1.668 ± 0.016
0.463TyrTrp: 0.463 ± 0.009
0.968TyrTyr: 0.968 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14038 proteins (5856265 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski