Amino acid dipepetide frequency for Penicillium sp. occitanis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.624AlaAla: 8.624 ± 0.065
0.988AlaCys: 0.988 ± 0.013
4.209AlaAsp: 4.209 ± 0.032
4.978AlaGlu: 4.978 ± 0.041
3.181AlaPhe: 3.181 ± 0.028
5.618AlaGly: 5.618 ± 0.04
1.687AlaHis: 1.687 ± 0.018
4.697AlaIle: 4.697 ± 0.028
3.889AlaLys: 3.889 ± 0.031
7.576AlaLeu: 7.576 ± 0.043
1.946AlaMet: 1.946 ± 0.017
3.121AlaAsn: 3.121 ± 0.025
4.118AlaPro: 4.118 ± 0.038
3.255AlaGln: 3.255 ± 0.027
4.436AlaArg: 4.436 ± 0.031
6.884AlaSer: 6.884 ± 0.042
5.377AlaThr: 5.377 ± 0.03
5.404AlaVal: 5.404 ± 0.034
1.166AlaTrp: 1.166 ± 0.015
2.279AlaTyr: 2.279 ± 0.02
0.0AlaXaa: 0.0 ± 0.0
Cys
0.89CysAla: 0.89 ± 0.016
0.188CysCys: 0.188 ± 0.007
0.618CysAsp: 0.618 ± 0.011
0.548CysGlu: 0.548 ± 0.01
0.536CysPhe: 0.536 ± 0.01
0.838CysGly: 0.838 ± 0.016
0.299CysHis: 0.299 ± 0.008
0.711CysIle: 0.711 ± 0.013
0.428CysLys: 0.428 ± 0.01
1.204CysLeu: 1.204 ± 0.013
0.247CysMet: 0.247 ± 0.006
0.402CysAsn: 0.402 ± 0.009
0.565CysPro: 0.565 ± 0.011
0.42CysGln: 0.42 ± 0.008
0.633CysArg: 0.633 ± 0.012
0.79CysSer: 0.79 ± 0.013
0.634CysThr: 0.634 ± 0.011
0.771CysVal: 0.771 ± 0.011
0.194CysTrp: 0.194 ± 0.006
0.339CysTyr: 0.339 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
4.563AspAla: 4.563 ± 0.031
0.591AspCys: 0.591 ± 0.01
4.188AspAsp: 4.188 ± 0.045
4.478AspGlu: 4.478 ± 0.035
2.289AspPhe: 2.289 ± 0.022
3.878AspGly: 3.878 ± 0.027
1.235AspHis: 1.235 ± 0.016
3.455AspIle: 3.455 ± 0.028
2.355AspLys: 2.355 ± 0.023
5.192AspLeu: 5.192 ± 0.035
1.302AspMet: 1.302 ± 0.016
2.087AspAsn: 2.087 ± 0.02
3.133AspPro: 3.133 ± 0.025
1.889AspGln: 1.889 ± 0.018
2.904AspArg: 2.904 ± 0.026
4.146AspSer: 4.146 ± 0.03
3.119AspThr: 3.119 ± 0.025
3.835AspVal: 3.835 ± 0.027
0.894AspTrp: 0.894 ± 0.012
1.789AspTyr: 1.789 ± 0.016
0.0AspXaa: 0.0 ± 0.0
Glu
5.109GluAla: 5.109 ± 0.039
0.565GluCys: 0.565 ± 0.01
4.21GluAsp: 4.21 ± 0.041
5.425GluGlu: 5.425 ± 0.051
2.049GluPhe: 2.049 ± 0.019
3.508GluGly: 3.508 ± 0.025
1.391GluHis: 1.391 ± 0.015
3.486GluIle: 3.486 ± 0.028
3.819GluLys: 3.819 ± 0.033
5.22GluLeu: 5.22 ± 0.032
1.444GluMet: 1.444 ± 0.02
2.565GluAsn: 2.565 ± 0.026
2.525GluPro: 2.525 ± 0.024
2.522GluGln: 2.522 ± 0.022
3.688GluArg: 3.688 ± 0.032
4.376GluSer: 4.376 ± 0.03
3.7GluThr: 3.7 ± 0.028
3.563GluVal: 3.563 ± 0.026
0.893GluTrp: 0.893 ± 0.012
1.832GluTyr: 1.832 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
3.127PheAla: 3.127 ± 0.025
0.536PheCys: 0.536 ± 0.012
2.368PheAsp: 2.368 ± 0.018
2.263PheGlu: 2.263 ± 0.022
1.709PhePhe: 1.709 ± 0.021
2.928PheGly: 2.928 ± 0.03
0.918PheHis: 0.918 ± 0.013
2.017PheIle: 2.017 ± 0.022
1.518PheLys: 1.518 ± 0.018
3.683PheLeu: 3.683 ± 0.032
0.849PheMet: 0.849 ± 0.012
1.54PheAsn: 1.54 ± 0.017
1.948PhePro: 1.948 ± 0.019
1.474PheGln: 1.474 ± 0.015
1.934PheArg: 1.934 ± 0.018
3.117PheSer: 3.117 ± 0.024
2.235PheThr: 2.235 ± 0.02
2.519PheVal: 2.519 ± 0.024
0.664PheTrp: 0.664 ± 0.012
1.215PheTyr: 1.215 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
5.087GlyAla: 5.087 ± 0.036
0.795GlyCys: 0.795 ± 0.012
3.468GlyAsp: 3.468 ± 0.023
3.372GlyGlu: 3.372 ± 0.024
2.896GlyPhe: 2.896 ± 0.025
5.45GlyGly: 5.45 ± 0.044
1.654GlyHis: 1.654 ± 0.019
3.852GlyIle: 3.852 ± 0.032
3.328GlyLys: 3.328 ± 0.027
6.141GlyLeu: 6.141 ± 0.035
1.522GlyMet: 1.522 ± 0.018
2.604GlyAsn: 2.604 ± 0.024
3.026GlyPro: 3.026 ± 0.025
2.498GlyGln: 2.498 ± 0.022
3.741GlyArg: 3.741 ± 0.031
5.478GlySer: 5.478 ± 0.038
3.981GlyThr: 3.981 ± 0.029
4.368GlyVal: 4.368 ± 0.028
1.157GlyTrp: 1.157 ± 0.017
2.321GlyTyr: 2.321 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
1.755HisAla: 1.755 ± 0.02
0.285HisCys: 0.285 ± 0.008
1.34HisAsp: 1.34 ± 0.014
1.38HisGlu: 1.38 ± 0.018
0.94HisPhe: 0.94 ± 0.014
1.681HisGly: 1.681 ± 0.019
0.83HisHis: 0.83 ± 0.015
1.304HisIle: 1.304 ± 0.015
0.903HisLys: 0.903 ± 0.012
2.209HisLeu: 2.209 ± 0.019
0.475HisMet: 0.475 ± 0.009
0.903HisAsn: 0.903 ± 0.014
1.519HisPro: 1.519 ± 0.018
0.958HisGln: 0.958 ± 0.015
1.427HisArg: 1.427 ± 0.018
1.756HisSer: 1.756 ± 0.018
1.273HisThr: 1.273 ± 0.016
1.447HisVal: 1.447 ± 0.017
0.348HisTrp: 0.348 ± 0.008
0.717HisTyr: 0.717 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.6IleAla: 4.6 ± 0.029
0.783IleCys: 0.783 ± 0.012
3.257IleAsp: 3.257 ± 0.024
3.229IleGlu: 3.229 ± 0.026
2.25IlePhe: 2.25 ± 0.022
3.559IleGly: 3.559 ± 0.027
1.295IleHis: 1.295 ± 0.014
2.996IleIle: 2.996 ± 0.027
2.382IleLys: 2.382 ± 0.022
5.035IleLeu: 5.035 ± 0.034
1.099IleMet: 1.099 ± 0.015
2.129IleAsn: 2.129 ± 0.02
3.293IlePro: 3.293 ± 0.024
2.131IleGln: 2.131 ± 0.02
2.936IleArg: 2.936 ± 0.023
4.313IleSer: 4.313 ± 0.025
3.194IleThr: 3.194 ± 0.027
3.542IleVal: 3.542 ± 0.03
0.779IleTrp: 0.779 ± 0.014
1.654IleTyr: 1.654 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
4.034LysAla: 4.034 ± 0.03
0.441LysCys: 0.441 ± 0.009
2.802LysAsp: 2.802 ± 0.024
3.419LysGlu: 3.419 ± 0.032
1.526LysPhe: 1.526 ± 0.018
2.823LysGly: 2.823 ± 0.029
1.085LysHis: 1.085 ± 0.013
2.49LysIle: 2.49 ± 0.021
3.149LysLys: 3.149 ± 0.04
4.05LysLeu: 4.05 ± 0.028
0.981LysMet: 0.981 ± 0.012
1.847LysAsn: 1.847 ± 0.019
2.526LysPro: 2.526 ± 0.024
1.837LysGln: 1.837 ± 0.018
3.212LysArg: 3.212 ± 0.033
3.573LysSer: 3.573 ± 0.028
2.861LysThr: 2.861 ± 0.022
2.821LysVal: 2.821 ± 0.021
0.703LysTrp: 0.703 ± 0.01
1.495LysTyr: 1.495 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
7.625LeuAla: 7.625 ± 0.043
1.151LeuCys: 1.151 ± 0.015
5.3LeuAsp: 5.3 ± 0.032
5.561LeuGlu: 5.561 ± 0.039
3.47LeuPhe: 3.47 ± 0.029
5.915LeuGly: 5.915 ± 0.035
2.237LeuHis: 2.237 ± 0.023
4.469LeuIle: 4.469 ± 0.034
4.254LeuLys: 4.254 ± 0.027
8.617LeuLeu: 8.617 ± 0.057
1.822LeuMet: 1.822 ± 0.018
3.454LeuAsn: 3.454 ± 0.028
5.286LeuPro: 5.286 ± 0.031
3.975LeuGln: 3.975 ± 0.028
5.527LeuArg: 5.527 ± 0.033
7.516LeuSer: 7.516 ± 0.042
4.973LeuThr: 4.973 ± 0.03
5.481LeuVal: 5.481 ± 0.035
1.223LeuTrp: 1.223 ± 0.017
2.532LeuTyr: 2.532 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.144MetAla: 2.144 ± 0.02
0.231MetCys: 0.231 ± 0.007
1.207MetAsp: 1.207 ± 0.013
1.261MetGlu: 1.261 ± 0.016
0.785MetPhe: 0.785 ± 0.012
1.428MetGly: 1.428 ± 0.016
0.476MetHis: 0.476 ± 0.01
1.147MetIle: 1.147 ± 0.015
0.979MetLys: 0.979 ± 0.013
1.857MetLeu: 1.857 ± 0.022
0.567MetMet: 0.567 ± 0.01
0.859MetAsn: 0.859 ± 0.013
1.171MetPro: 1.171 ± 0.016
0.845MetGln: 0.845 ± 0.013
1.187MetArg: 1.187 ± 0.016
1.84MetSer: 1.84 ± 0.018
1.376MetThr: 1.376 ± 0.015
1.28MetVal: 1.28 ± 0.015
0.26MetTrp: 0.26 ± 0.007
0.536MetTyr: 0.536 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
3.311AsnAla: 3.311 ± 0.024
0.433AsnCys: 0.433 ± 0.01
2.217AsnAsp: 2.217 ± 0.023
2.246AsnGlu: 2.246 ± 0.022
1.489AsnPhe: 1.489 ± 0.015
3.161AsnGly: 3.161 ± 0.026
0.925AsnHis: 0.925 ± 0.012
2.388AsnIle: 2.388 ± 0.02
1.674AsnLys: 1.674 ± 0.016
3.492AsnLeu: 3.492 ± 0.022
0.892AsnMet: 0.892 ± 0.014
1.861AsnAsn: 1.861 ± 0.021
2.442AsnPro: 2.442 ± 0.022
1.438AsnGln: 1.438 ± 0.015
1.953AsnArg: 1.953 ± 0.019
2.898AsnSer: 2.898 ± 0.025
2.49AsnThr: 2.49 ± 0.024
2.561AsnVal: 2.561 ± 0.02
0.608AsnTrp: 0.608 ± 0.011
1.182AsnTyr: 1.182 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
4.649ProAla: 4.649 ± 0.039
0.452ProCys: 0.452 ± 0.01
3.059ProAsp: 3.059 ± 0.023
3.714ProGlu: 3.714 ± 0.026
2.055ProPhe: 2.055 ± 0.021
3.504ProGly: 3.504 ± 0.026
1.188ProHis: 1.188 ± 0.017
2.649ProIle: 2.649 ± 0.023
2.409ProLys: 2.409 ± 0.021
4.583ProLeu: 4.583 ± 0.029
0.975ProMet: 0.975 ± 0.015
2.195ProAsn: 2.195 ± 0.021
4.185ProPro: 4.185 ± 0.051
2.255ProGln: 2.255 ± 0.023
2.958ProArg: 2.958 ± 0.027
5.508ProSer: 5.508 ± 0.043
3.737ProThr: 3.737 ± 0.03
3.477ProVal: 3.477 ± 0.025
0.757ProTrp: 0.757 ± 0.012
1.562ProTyr: 1.562 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
3.3GlnAla: 3.3 ± 0.028
0.403GlnCys: 0.403 ± 0.009
2.03GlnAsp: 2.03 ± 0.021
2.431GlnGlu: 2.431 ± 0.024
1.368GlnPhe: 1.368 ± 0.015
2.32GlnGly: 2.32 ± 0.02
1.04GlnHis: 1.04 ± 0.013
2.1GlnIle: 2.1 ± 0.021
2.048GlnLys: 2.048 ± 0.02
3.501GlnLeu: 3.501 ± 0.027
0.857GlnMet: 0.857 ± 0.014
1.745GlnAsn: 1.745 ± 0.017
2.378GlnPro: 2.378 ± 0.027
2.417GlnGln: 2.417 ± 0.034
2.54GlnArg: 2.54 ± 0.025
3.187GlnSer: 3.187 ± 0.025
2.426GlnThr: 2.426 ± 0.021
2.171GlnVal: 2.171 ± 0.021
0.594GlnTrp: 0.594 ± 0.011
1.26GlnTyr: 1.26 ± 0.013
0.0GlnXaa: 0.0 ± 0.0
Arg
4.211ArgAla: 4.211 ± 0.031
0.602ArgCys: 0.602 ± 0.011
3.239ArgAsp: 3.239 ± 0.03
3.617ArgGlu: 3.617 ± 0.03
2.118ArgPhe: 2.118 ± 0.02
3.339ArgGly: 3.339 ± 0.027
1.437ArgHis: 1.437 ± 0.017
3.031ArgIle: 3.031 ± 0.024
3.311ArgLys: 3.311 ± 0.026
5.368ArgLeu: 5.368 ± 0.034
1.201ArgMet: 1.201 ± 0.014
2.31ArgAsn: 2.31 ± 0.021
3.016ArgPro: 3.016 ± 0.028
2.475ArgGln: 2.475 ± 0.026
4.577ArgArg: 4.577 ± 0.043
4.36ArgSer: 4.36 ± 0.037
3.075ArgThr: 3.075 ± 0.021
3.227ArgVal: 3.227 ± 0.023
0.899ArgTrp: 0.899 ± 0.013
1.699ArgTyr: 1.699 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
6.446SerAla: 6.446 ± 0.038
0.766SerCys: 0.766 ± 0.015
4.28SerAsp: 4.28 ± 0.031
4.183SerGlu: 4.183 ± 0.03
3.126SerPhe: 3.126 ± 0.027
5.327SerGly: 5.327 ± 0.033
1.91SerHis: 1.91 ± 0.018
4.389SerIle: 4.389 ± 0.033
3.688SerLys: 3.688 ± 0.028
7.435SerLeu: 7.435 ± 0.038
1.69SerMet: 1.69 ± 0.018
3.248SerAsn: 3.248 ± 0.025
4.985SerPro: 4.985 ± 0.039
3.287SerGln: 3.287 ± 0.028
4.66SerArg: 4.66 ± 0.034
8.756SerSer: 8.756 ± 0.065
5.895SerThr: 5.895 ± 0.042
4.709SerVal: 4.709 ± 0.03
1.151SerTrp: 1.151 ± 0.015
2.213SerTyr: 2.213 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
5.328ThrAla: 5.328 ± 0.033
0.687ThrCys: 0.687 ± 0.012
3.031ThrAsp: 3.031 ± 0.024
3.391ThrGlu: 3.391 ± 0.027
2.36ThrPhe: 2.36 ± 0.025
4.312ThrGly: 4.312 ± 0.034
1.271ThrHis: 1.271 ± 0.016
3.433ThrIle: 3.433 ± 0.027
2.655ThrLys: 2.655 ± 0.024
5.32ThrLeu: 5.32 ± 0.033
1.205ThrMet: 1.205 ± 0.015
2.377ThrAsn: 2.377 ± 0.021
4.089ThrPro: 4.089 ± 0.039
2.18ThrGln: 2.18 ± 0.022
2.937ThrArg: 2.937 ± 0.022
5.586ThrSer: 5.586 ± 0.038
4.809ThrThr: 4.809 ± 0.047
3.947ThrVal: 3.947 ± 0.026
0.925ThrTrp: 0.925 ± 0.014
1.779ThrTyr: 1.779 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
5.214ValAla: 5.214 ± 0.034
0.768ValCys: 0.768 ± 0.012
3.805ValAsp: 3.805 ± 0.026
3.877ValGlu: 3.877 ± 0.035
2.544ValPhe: 2.544 ± 0.024
3.991ValGly: 3.991 ± 0.031
1.408ValHis: 1.408 ± 0.014
3.371ValIle: 3.371 ± 0.025
2.872ValLys: 2.872 ± 0.025
5.673ValLeu: 5.673 ± 0.039
1.312ValMet: 1.312 ± 0.017
2.441ValAsn: 2.441 ± 0.023
3.484ValPro: 3.484 ± 0.025
2.443ValGln: 2.443 ± 0.019
3.284ValArg: 3.284 ± 0.028
4.81ValSer: 4.81 ± 0.034
3.747ValThr: 3.747 ± 0.027
4.462ValVal: 4.462 ± 0.042
0.876ValTrp: 0.876 ± 0.013
1.845ValTyr: 1.845 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.148TrpAla: 1.148 ± 0.016
0.191TrpCys: 0.191 ± 0.006
0.912TrpAsp: 0.912 ± 0.015
0.839TrpGlu: 0.839 ± 0.012
0.565TrpPhe: 0.565 ± 0.01
0.915TrpGly: 0.915 ± 0.014
0.368TrpHis: 0.368 ± 0.008
0.854TrpIle: 0.854 ± 0.011
0.842TrpLys: 0.842 ± 0.013
1.385TrpLeu: 1.385 ± 0.018
0.381TrpMet: 0.381 ± 0.008
0.681TrpAsn: 0.681 ± 0.011
0.601TrpPro: 0.601 ± 0.01
0.595TrpGln: 0.595 ± 0.01
0.929TrpArg: 0.929 ± 0.012
1.079TrpSer: 1.079 ± 0.016
0.919TrpThr: 0.919 ± 0.015
0.894TrpVal: 0.894 ± 0.013
0.278TrpTrp: 0.278 ± 0.007
0.473TrpTyr: 0.473 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.266TyrAla: 2.266 ± 0.023
0.401TyrCys: 0.401 ± 0.009
1.781TyrAsp: 1.781 ± 0.017
1.678TyrGlu: 1.678 ± 0.018
1.307TyrPhe: 1.307 ± 0.017
2.241TyrGly: 2.241 ± 0.027
0.798TyrHis: 0.798 ± 0.012
1.618TyrIle: 1.618 ± 0.017
1.173TyrLys: 1.173 ± 0.016
2.848TyrLeu: 2.848 ± 0.024
0.671TyrMet: 0.671 ± 0.012
1.292TyrAsn: 1.292 ± 0.016
1.579TyrPro: 1.579 ± 0.019
1.223TyrGln: 1.223 ± 0.015
1.648TyrArg: 1.648 ± 0.018
2.171TyrSer: 2.171 ± 0.023
1.798TyrThr: 1.798 ± 0.018
1.748TyrVal: 1.748 ± 0.02
0.481TyrTrp: 0.481 ± 0.009
1.065TyrTyr: 1.065 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11231 proteins (5743013 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski