Amino acid dipepetide frequency for Wickerhamomyces ciferrii (strain F-60-10 / ATCC 14091 / CBS 111 / JCM 3599 / NBRC 0793 / NRRL Y-1031) (Yeast) (Pichia ciferrii)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.555AlaAla: 3.555 ± 0.06
0.499AlaCys: 0.499 ± 0.015
2.266AlaAsp: 2.266 ± 0.03
2.712AlaGlu: 2.712 ± 0.033
2.183AlaPhe: 2.183 ± 0.034
2.838AlaGly: 2.838 ± 0.045
0.913AlaHis: 0.913 ± 0.018
3.617AlaIle: 3.617 ± 0.04
3.726AlaLys: 3.726 ± 0.039
4.937AlaLeu: 4.937 ± 0.048
0.837AlaMet: 0.837 ± 0.016
2.634AlaAsn: 2.634 ± 0.035
2.402AlaPro: 2.402 ± 0.053
2.116AlaGln: 2.116 ± 0.034
1.904AlaArg: 1.904 ± 0.026
4.31AlaSer: 4.31 ± 0.06
2.93AlaThr: 2.93 ± 0.034
2.772AlaVal: 2.772 ± 0.035
0.44AlaTrp: 0.44 ± 0.014
1.556AlaTyr: 1.556 ± 0.024
0.0AlaXaa: 0.0 ± 0.0
Cys
0.465CysAla: 0.465 ± 0.013
0.17CysCys: 0.17 ± 0.008
0.534CysAsp: 0.534 ± 0.014
0.465CysGlu: 0.465 ± 0.013
0.585CysPhe: 0.585 ± 0.015
0.628CysGly: 0.628 ± 0.016
0.22CysHis: 0.22 ± 0.01
0.659CysIle: 0.659 ± 0.014
0.509CysLys: 0.509 ± 0.012
0.946CysLeu: 0.946 ± 0.022
0.165CysMet: 0.165 ± 0.007
0.381CysAsn: 0.381 ± 0.011
0.373CysPro: 0.373 ± 0.013
0.294CysGln: 0.294 ± 0.01
0.288CysArg: 0.288 ± 0.011
0.757CysSer: 0.757 ± 0.017
0.451CysThr: 0.451 ± 0.013
0.503CysVal: 0.503 ± 0.015
0.11CysTrp: 0.11 ± 0.007
0.345CysTyr: 0.345 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
2.674AspAla: 2.674 ± 0.034
0.472AspCys: 0.472 ± 0.013
5.621AspAsp: 5.621 ± 0.085
5.926AspGlu: 5.926 ± 0.07
2.958AspPhe: 2.958 ± 0.036
2.894AspGly: 2.894 ± 0.039
1.322AspHis: 1.322 ± 0.022
4.326AspIle: 4.326 ± 0.041
3.76AspLys: 3.76 ± 0.04
6.197AspLeu: 6.197 ± 0.051
0.76AspMet: 0.76 ± 0.014
3.203AspAsn: 3.203 ± 0.035
2.548AspPro: 2.548 ± 0.03
2.689AspGln: 2.689 ± 0.034
1.795AspArg: 1.795 ± 0.024
5.065AspSer: 5.065 ± 0.044
2.675AspThr: 2.675 ± 0.033
3.398AspVal: 3.398 ± 0.032
0.614AspTrp: 0.614 ± 0.015
2.456AspTyr: 2.456 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
3.216GluAla: 3.216 ± 0.041
0.451GluCys: 0.451 ± 0.012
4.785GluAsp: 4.785 ± 0.056
6.203GluGlu: 6.203 ± 0.09
3.197GluPhe: 3.197 ± 0.033
2.737GluGly: 2.737 ± 0.03
1.215GluHis: 1.215 ± 0.02
5.197GluIle: 5.197 ± 0.05
5.121GluLys: 5.121 ± 0.06
6.929GluLeu: 6.929 ± 0.066
1.039GluMet: 1.039 ± 0.018
4.348GluAsn: 4.348 ± 0.044
2.171GluPro: 2.171 ± 0.03
2.8GluGln: 2.8 ± 0.037
2.621GluArg: 2.621 ± 0.032
5.552GluSer: 5.552 ± 0.054
3.604GluThr: 3.604 ± 0.04
3.424GluVal: 3.424 ± 0.036
0.595GluTrp: 0.595 ± 0.013
2.24GluTyr: 2.24 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
2.342PheAla: 2.342 ± 0.03
0.445PheCys: 0.445 ± 0.013
3.004PheAsp: 3.004 ± 0.037
3.177PheGlu: 3.177 ± 0.033
2.206PhePhe: 2.206 ± 0.029
2.948PheGly: 2.948 ± 0.047
0.999PheHis: 0.999 ± 0.018
3.71PheIle: 3.71 ± 0.038
3.873PheLys: 3.873 ± 0.032
4.154PheLeu: 4.154 ± 0.041
0.785PheMet: 0.785 ± 0.018
3.392PheAsn: 3.392 ± 0.031
1.822PhePro: 1.822 ± 0.028
2.205PheGln: 2.205 ± 0.027
1.452PheArg: 1.452 ± 0.022
3.265PheSer: 3.265 ± 0.036
2.582PheThr: 2.582 ± 0.03
2.343PheVal: 2.343 ± 0.029
0.564PheTrp: 0.564 ± 0.015
1.636PheTyr: 1.636 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
2.906GlyAla: 2.906 ± 0.049
0.544GlyCys: 0.544 ± 0.014
2.92GlyAsp: 2.92 ± 0.035
3.025GlyGlu: 3.025 ± 0.033
2.834GlyPhe: 2.834 ± 0.036
3.577GlyGly: 3.577 ± 0.063
1.013GlyHis: 1.013 ± 0.02
3.923GlyIle: 3.923 ± 0.041
3.491GlyLys: 3.491 ± 0.042
5.007GlyLeu: 5.007 ± 0.044
0.745GlyMet: 0.745 ± 0.014
2.807GlyAsn: 2.807 ± 0.032
1.719GlyPro: 1.719 ± 0.027
1.725GlyGln: 1.725 ± 0.029
1.887GlyArg: 1.887 ± 0.029
4.791GlySer: 4.791 ± 0.057
2.741GlyThr: 2.741 ± 0.031
3.105GlyVal: 3.105 ± 0.037
0.648GlyTrp: 0.648 ± 0.014
2.078GlyTyr: 2.078 ± 0.027
0.0GlyXaa: 0.0 ± 0.0
His
0.841HisAla: 0.841 ± 0.016
0.206HisCys: 0.206 ± 0.008
1.221HisAsp: 1.221 ± 0.02
1.282HisGlu: 1.282 ± 0.025
0.93HisPhe: 0.93 ± 0.018
1.133HisGly: 1.133 ± 0.02
0.716HisHis: 0.716 ± 0.02
1.418HisIle: 1.418 ± 0.021
1.372HisLys: 1.372 ± 0.025
2.01HisLeu: 2.01 ± 0.021
0.286HisMet: 0.286 ± 0.01
1.18HisAsn: 1.18 ± 0.025
1.035HisPro: 1.035 ± 0.019
1.099HisGln: 1.099 ± 0.026
0.815HisArg: 0.815 ± 0.017
1.7HisSer: 1.7 ± 0.025
0.985HisThr: 0.985 ± 0.017
0.985HisVal: 0.985 ± 0.017
0.215HisTrp: 0.215 ± 0.008
0.822HisTyr: 0.822 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
3.604IleAla: 3.604 ± 0.038
0.719IleCys: 0.719 ± 0.019
4.945IleAsp: 4.945 ± 0.04
4.919IleGlu: 4.919 ± 0.043
3.281IlePhe: 3.281 ± 0.033
3.821IleGly: 3.821 ± 0.047
1.566IleHis: 1.566 ± 0.022
5.421IleIle: 5.421 ± 0.052
5.65IleLys: 5.65 ± 0.048
6.744IleLeu: 6.744 ± 0.057
1.166IleMet: 1.166 ± 0.02
5.123IleAsn: 5.123 ± 0.051
3.902IlePro: 3.902 ± 0.038
2.921IleGln: 2.921 ± 0.032
2.648IleArg: 2.648 ± 0.03
6.445IleSer: 6.445 ± 0.056
4.267IleThr: 4.267 ± 0.047
3.732IleVal: 3.732 ± 0.04
0.807IleTrp: 0.807 ± 0.016
2.451IleTyr: 2.451 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
3.633LysAla: 3.633 ± 0.036
0.564LysCys: 0.564 ± 0.017
4.737LysAsp: 4.737 ± 0.052
5.427LysGlu: 5.427 ± 0.062
3.79LysPhe: 3.79 ± 0.036
3.077LysGly: 3.077 ± 0.037
1.475LysHis: 1.475 ± 0.022
6.006LysIle: 6.006 ± 0.058
6.491LysLys: 6.491 ± 0.07
7.588LysLeu: 7.588 ± 0.059
1.12LysMet: 1.12 ± 0.02
4.927LysAsn: 4.927 ± 0.047
3.332LysPro: 3.332 ± 0.046
3.089LysGln: 3.089 ± 0.034
3.428LysArg: 3.428 ± 0.038
6.383LysSer: 6.383 ± 0.053
4.264LysThr: 4.264 ± 0.038
3.797LysVal: 3.797 ± 0.04
0.706LysTrp: 0.706 ± 0.015
2.616LysTyr: 2.616 ± 0.031
0.0LysXaa: 0.0 ± 0.0
Leu
4.721LeuAla: 4.721 ± 0.048
0.847LeuCys: 0.847 ± 0.02
5.539LeuAsp: 5.539 ± 0.048
6.273LeuGlu: 6.273 ± 0.056
4.159LeuPhe: 4.159 ± 0.046
4.693LeuGly: 4.693 ± 0.045
1.765LeuHis: 1.765 ± 0.022
7.104LeuIle: 7.104 ± 0.056
8.328LeuLys: 8.328 ± 0.065
8.644LeuLeu: 8.644 ± 0.067
1.557LeuMet: 1.557 ± 0.024
6.841LeuAsn: 6.841 ± 0.058
4.282LeuPro: 4.282 ± 0.044
3.994LeuGln: 3.994 ± 0.041
3.78LeuArg: 3.78 ± 0.041
8.32LeuSer: 8.32 ± 0.059
5.296LeuThr: 5.296 ± 0.045
4.86LeuVal: 4.86 ± 0.046
0.827LeuTrp: 0.827 ± 0.017
3.013LeuTyr: 3.013 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
0.906MetAla: 0.906 ± 0.019
0.166MetCys: 0.166 ± 0.007
0.881MetAsp: 0.881 ± 0.016
0.875MetGlu: 0.875 ± 0.018
0.776MetPhe: 0.776 ± 0.016
0.872MetGly: 0.872 ± 0.017
0.174MetHis: 0.174 ± 0.007
1.193MetIle: 1.193 ± 0.017
1.178MetLys: 1.178 ± 0.022
1.274MetLeu: 1.274 ± 0.02
0.343MetMet: 0.343 ± 0.01
1.085MetAsn: 1.085 ± 0.021
0.486MetPro: 0.486 ± 0.014
0.385MetGln: 0.385 ± 0.01
0.55MetArg: 0.55 ± 0.013
1.644MetSer: 1.644 ± 0.023
0.882MetThr: 0.882 ± 0.017
0.855MetVal: 0.855 ± 0.017
0.123MetTrp: 0.123 ± 0.007
0.418MetTyr: 0.418 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.668AsnAla: 2.668 ± 0.033
0.55AsnCys: 0.55 ± 0.015
4.581AsnAsp: 4.581 ± 0.042
4.742AsnGlu: 4.742 ± 0.045
3.084AsnPhe: 3.084 ± 0.031
3.663AsnGly: 3.663 ± 0.038
1.553AsnHis: 1.553 ± 0.025
4.366AsnIle: 4.366 ± 0.048
4.654AsnLys: 4.654 ± 0.042
6.089AsnLeu: 6.089 ± 0.056
0.879AsnMet: 0.879 ± 0.017
5.239AsnAsn: 5.239 ± 0.071
2.754AsnPro: 2.754 ± 0.035
3.172AsnGln: 3.172 ± 0.042
1.96AsnArg: 1.96 ± 0.024
6.049AsnSer: 6.049 ± 0.059
3.28AsnThr: 3.28 ± 0.039
3.222AsnVal: 3.222 ± 0.032
0.689AsnTrp: 0.689 ± 0.015
2.473AsnTyr: 2.473 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
1.998ProAla: 1.998 ± 0.038
0.258ProCys: 0.258 ± 0.01
2.139ProAsp: 2.139 ± 0.03
3.01ProGlu: 3.01 ± 0.035
1.946ProPhe: 1.946 ± 0.026
2.0ProGly: 2.0 ± 0.031
0.822ProHis: 0.822 ± 0.019
3.397ProIle: 3.397 ± 0.033
3.584ProLys: 3.584 ± 0.038
3.87ProLeu: 3.87 ± 0.036
0.582ProMet: 0.582 ± 0.016
2.906ProAsn: 2.906 ± 0.035
2.517ProPro: 2.517 ± 0.071
2.323ProGln: 2.323 ± 0.048
1.541ProArg: 1.541 ± 0.022
4.262ProSer: 4.262 ± 0.058
2.754ProThr: 2.754 ± 0.036
2.314ProVal: 2.314 ± 0.031
0.389ProTrp: 0.389 ± 0.011
1.41ProTyr: 1.41 ± 0.022
0.0ProXaa: 0.0 ± 0.0
Gln
2.067GlnAla: 2.067 ± 0.031
0.299GlnCys: 0.299 ± 0.009
2.53GlnAsp: 2.53 ± 0.028
2.722GlnGlu: 2.722 ± 0.036
1.94GlnPhe: 1.94 ± 0.026
1.976GlnGly: 1.976 ± 0.03
1.01GlnHis: 1.01 ± 0.024
3.05GlnIle: 3.05 ± 0.029
2.94GlnLys: 2.94 ± 0.032
4.045GlnLeu: 4.045 ± 0.041
0.599GlnMet: 0.599 ± 0.014
3.064GlnAsn: 3.064 ± 0.038
2.046GlnPro: 2.046 ± 0.048
4.001GlnGln: 4.001 ± 0.13
1.824GlnArg: 1.824 ± 0.028
3.769GlnSer: 3.769 ± 0.049
2.216GlnThr: 2.216 ± 0.029
2.22GlnVal: 2.22 ± 0.027
0.383GlnTrp: 0.383 ± 0.011
1.496GlnTyr: 1.496 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
1.958ArgAla: 1.958 ± 0.031
0.367ArgCys: 0.367 ± 0.01
2.13ArgAsp: 2.13 ± 0.027
2.314ArgGlu: 2.314 ± 0.029
1.864ArgPhe: 1.864 ± 0.029
1.866ArgGly: 1.866 ± 0.034
0.802ArgHis: 0.802 ± 0.016
2.615ArgIle: 2.615 ± 0.028
2.916ArgLys: 2.916 ± 0.033
3.626ArgLeu: 3.626 ± 0.038
0.582ArgMet: 0.582 ± 0.013
2.2ArgAsn: 2.2 ± 0.025
1.543ArgPro: 1.543 ± 0.026
1.497ArgGln: 1.497 ± 0.021
2.237ArgArg: 2.237 ± 0.035
3.505ArgSer: 3.505 ± 0.041
1.908ArgThr: 1.908 ± 0.025
1.969ArgVal: 1.969 ± 0.029
0.397ArgTrp: 0.397 ± 0.012
1.438ArgTyr: 1.438 ± 0.022
0.0ArgXaa: 0.0 ± 0.0
Ser
3.995SerAla: 3.995 ± 0.06
0.688SerCys: 0.688 ± 0.018
4.602SerAsp: 4.602 ± 0.045
4.715SerGlu: 4.715 ± 0.047
4.0SerPhe: 4.0 ± 0.038
4.331SerGly: 4.331 ± 0.05
1.63SerHis: 1.63 ± 0.026
7.2SerIle: 7.2 ± 0.055
7.224SerLys: 7.224 ± 0.06
8.185SerLeu: 8.185 ± 0.063
1.291SerMet: 1.291 ± 0.019
6.837SerAsn: 6.837 ± 0.071
3.801SerPro: 3.801 ± 0.052
3.747SerGln: 3.747 ± 0.041
3.358SerArg: 3.358 ± 0.037
11.057SerSer: 11.057 ± 0.228
6.158SerThr: 6.158 ± 0.058
4.045SerVal: 4.045 ± 0.045
0.809SerTrp: 0.809 ± 0.019
2.745SerTyr: 2.745 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
2.797ThrAla: 2.797 ± 0.033
0.492ThrCys: 0.492 ± 0.012
2.73ThrAsp: 2.73 ± 0.033
3.033ThrGlu: 3.033 ± 0.038
2.445ThrPhe: 2.445 ± 0.033
3.139ThrGly: 3.139 ± 0.041
1.055ThrHis: 1.055 ± 0.017
4.182ThrIle: 4.182 ± 0.033
4.371ThrLys: 4.371 ± 0.045
5.155ThrLeu: 5.155 ± 0.044
0.79ThrMet: 0.79 ± 0.015
3.701ThrAsn: 3.701 ± 0.038
3.212ThrPro: 3.212 ± 0.043
2.176ThrGln: 2.176 ± 0.03
2.101ThrArg: 2.101 ± 0.027
5.572ThrSer: 5.572 ± 0.053
4.085ThrThr: 4.085 ± 0.056
2.787ThrVal: 2.787 ± 0.037
0.516ThrTrp: 0.516 ± 0.016
1.705ThrTyr: 1.705 ± 0.023
0.0ThrXaa: 0.0 ± 0.0
Val
2.748ValAla: 2.748 ± 0.038
0.509ValCys: 0.509 ± 0.015
3.275ValAsp: 3.275 ± 0.035
3.717ValGlu: 3.717 ± 0.043
2.523ValPhe: 2.523 ± 0.033
2.76ValGly: 2.76 ± 0.034
1.02ValHis: 1.02 ± 0.018
3.575ValIle: 3.575 ± 0.034
3.892ValLys: 3.892 ± 0.037
5.11ValLeu: 5.11 ± 0.046
0.846ValMet: 0.846 ± 0.017
2.877ValAsn: 2.877 ± 0.027
2.41ValPro: 2.41 ± 0.037
2.034ValGln: 2.034 ± 0.027
1.798ValArg: 1.798 ± 0.025
4.48ValSer: 4.48 ± 0.047
2.693ValThr: 2.693 ± 0.035
3.085ValVal: 3.085 ± 0.038
0.527ValTrp: 0.527 ± 0.014
1.743ValTyr: 1.743 ± 0.023
0.0ValXaa: 0.0 ± 0.0
Trp
0.48TrpAla: 0.48 ± 0.012
0.159TrpCys: 0.159 ± 0.007
0.646TrpAsp: 0.646 ± 0.017
0.604TrpGlu: 0.604 ± 0.015
0.56TrpPhe: 0.56 ± 0.013
0.549TrpGly: 0.549 ± 0.014
0.157TrpHis: 0.157 ± 0.006
0.773TrpIle: 0.773 ± 0.014
0.803TrpLys: 0.803 ± 0.016
0.929TrpLeu: 0.929 ± 0.02
0.179TrpMet: 0.179 ± 0.008
0.672TrpAsn: 0.672 ± 0.015
0.276TrpPro: 0.276 ± 0.01
0.258TrpGln: 0.258 ± 0.01
0.481TrpArg: 0.481 ± 0.012
0.815TrpSer: 0.815 ± 0.017
0.512TrpThr: 0.512 ± 0.015
0.53TrpVal: 0.53 ± 0.013
0.161TrpTrp: 0.161 ± 0.008
0.372TrpTyr: 0.372 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.575TyrAla: 1.575 ± 0.024
0.442TyrCys: 0.442 ± 0.012
2.267TyrAsp: 2.267 ± 0.028
2.304TyrGlu: 2.304 ± 0.028
1.642TyrPhe: 1.642 ± 0.024
1.985TyrGly: 1.985 ± 0.031
0.826TyrHis: 0.826 ± 0.017
2.4TyrIle: 2.4 ± 0.031
2.569TyrLys: 2.569 ± 0.028
3.403TyrLeu: 3.403 ± 0.032
0.518TyrMet: 0.518 ± 0.014
2.25TyrAsn: 2.25 ± 0.031
1.391TyrPro: 1.391 ± 0.021
1.673TyrGln: 1.673 ± 0.028
1.263TyrArg: 1.263 ± 0.019
2.675TyrSer: 2.675 ± 0.032
1.762TyrThr: 1.762 ± 0.024
1.678TyrVal: 1.678 ± 0.023
0.393TyrTrp: 0.393 ± 0.011
1.433TyrTyr: 1.433 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6698 proteins (3252096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski