Amino acid dipepetide frequency for Candida glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) (Yeast) (Torulopsis glabrata)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.981AlaAla: 3.981 ± 0.063
0.62AlaCys: 0.62 ± 0.016
2.942AlaAsp: 2.942 ± 0.035
3.299AlaGlu: 3.299 ± 0.044
2.194AlaPhe: 2.194 ± 0.028
3.227AlaGly: 3.227 ± 0.053
1.122AlaHis: 1.122 ± 0.021
3.685AlaIle: 3.685 ± 0.038
3.956AlaLys: 3.956 ± 0.05
5.285AlaLeu: 5.285 ± 0.054
1.359AlaMet: 1.359 ± 0.025
3.145AlaAsn: 3.145 ± 0.04
2.346AlaPro: 2.346 ± 0.063
2.189AlaGln: 2.189 ± 0.04
2.372AlaArg: 2.372 ± 0.036
4.556AlaSer: 4.556 ± 0.044
3.533AlaThr: 3.533 ± 0.049
3.315AlaVal: 3.315 ± 0.042
0.483AlaTrp: 0.483 ± 0.016
1.709AlaTyr: 1.709 ± 0.025
0.0AlaXaa: 0.0 ± 0.0
Cys
0.64CysAla: 0.64 ± 0.018
0.247CysCys: 0.247 ± 0.011
0.676CysAsp: 0.676 ± 0.021
0.573CysGlu: 0.573 ± 0.017
0.577CysPhe: 0.577 ± 0.017
0.798CysGly: 0.798 ± 0.021
0.305CysHis: 0.305 ± 0.011
0.841CysIle: 0.841 ± 0.02
0.723CysLys: 0.723 ± 0.02
1.15CysLeu: 1.15 ± 0.021
0.252CysMet: 0.252 ± 0.01
0.628CysAsn: 0.628 ± 0.016
0.47CysPro: 0.47 ± 0.017
0.379CysGln: 0.379 ± 0.01
0.494CysArg: 0.494 ± 0.016
0.921CysSer: 0.921 ± 0.022
0.576CysThr: 0.576 ± 0.015
0.745CysVal: 0.745 ± 0.016
0.144CysTrp: 0.144 ± 0.007
0.475CysTyr: 0.475 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.164AspAla: 3.164 ± 0.039
0.624AspCys: 0.624 ± 0.017
4.846AspAsp: 4.846 ± 0.068
5.206AspGlu: 5.206 ± 0.07
2.628AspPhe: 2.628 ± 0.033
3.202AspGly: 3.202 ± 0.053
1.15AspHis: 1.15 ± 0.02
4.659AspIle: 4.659 ± 0.048
4.122AspLys: 4.122 ± 0.044
5.557AspLeu: 5.557 ± 0.048
1.403AspMet: 1.403 ± 0.024
3.513AspAsn: 3.513 ± 0.043
2.502AspPro: 2.502 ± 0.034
1.967AspGln: 1.967 ± 0.028
2.209AspArg: 2.209 ± 0.032
4.976AspSer: 4.976 ± 0.055
3.248AspThr: 3.248 ± 0.036
3.635AspVal: 3.635 ± 0.037
0.637AspTrp: 0.637 ± 0.021
2.382AspTyr: 2.382 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
3.523GluAla: 3.523 ± 0.047
0.638GluCys: 0.638 ± 0.019
4.736GluAsp: 4.736 ± 0.066
6.137GluGlu: 6.137 ± 0.092
2.805GluPhe: 2.805 ± 0.037
2.877GluGly: 2.877 ± 0.066
1.367GluHis: 1.367 ± 0.024
4.396GluIle: 4.396 ± 0.043
5.346GluLys: 5.346 ± 0.068
6.642GluLeu: 6.642 ± 0.059
1.471GluMet: 1.471 ± 0.023
4.275GluAsn: 4.275 ± 0.042
2.087GluPro: 2.087 ± 0.029
2.807GluGln: 2.807 ± 0.039
3.013GluArg: 3.013 ± 0.044
4.945GluSer: 4.945 ± 0.053
3.679GluThr: 3.679 ± 0.048
3.773GluVal: 3.773 ± 0.042
0.64GluTrp: 0.64 ± 0.016
2.444GluTyr: 2.444 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
2.282PheAla: 2.282 ± 0.034
0.503PheCys: 0.503 ± 0.016
2.762PheAsp: 2.762 ± 0.03
2.788PheGlu: 2.788 ± 0.035
1.91PhePhe: 1.91 ± 0.037
2.598PheGly: 2.598 ± 0.055
0.923PheHis: 0.923 ± 0.023
2.784PheIle: 2.784 ± 0.039
2.952PheLys: 2.952 ± 0.039
3.869PheLeu: 3.869 ± 0.043
0.914PheMet: 0.914 ± 0.019
2.488PheAsn: 2.488 ± 0.035
1.643PhePro: 1.643 ± 0.023
1.673PheGln: 1.673 ± 0.029
1.564PheArg: 1.564 ± 0.029
3.363PheSer: 3.363 ± 0.045
2.342PheThr: 2.342 ± 0.031
2.471PheVal: 2.471 ± 0.031
0.489PheTrp: 0.489 ± 0.018
1.513PheTyr: 1.513 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
2.965GlyAla: 2.965 ± 0.051
0.667GlyCys: 0.667 ± 0.016
2.867GlyAsp: 2.867 ± 0.045
3.023GlyGlu: 3.023 ± 0.06
2.291GlyPhe: 2.291 ± 0.033
3.481GlyGly: 3.481 ± 0.116
1.182GlyHis: 1.182 ± 0.024
3.5GlyIle: 3.5 ± 0.036
3.768GlyLys: 3.768 ± 0.041
4.514GlyLeu: 4.514 ± 0.054
1.11GlyMet: 1.11 ± 0.028
3.217GlyAsn: 3.217 ± 0.052
1.626GlyPro: 1.626 ± 0.027
1.817GlyGln: 1.817 ± 0.038
2.186GlyArg: 2.186 ± 0.03
5.434GlySer: 5.434 ± 0.458
3.138GlyThr: 3.138 ± 0.052
3.22GlyVal: 3.22 ± 0.044
0.559GlyTrp: 0.559 ± 0.015
1.91GlyTyr: 1.91 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
1.01HisAla: 1.01 ± 0.02
0.296HisCys: 0.296 ± 0.01
1.209HisAsp: 1.209 ± 0.025
1.296HisGlu: 1.296 ± 0.025
0.917HisPhe: 0.917 ± 0.019
1.194HisGly: 1.194 ± 0.024
0.637HisHis: 0.637 ± 0.018
1.509HisIle: 1.509 ± 0.03
1.337HisLys: 1.337 ± 0.026
2.037HisLeu: 2.037 ± 0.029
0.475HisMet: 0.475 ± 0.013
1.195HisAsn: 1.195 ± 0.024
0.972HisPro: 0.972 ± 0.023
0.796HisGln: 0.796 ± 0.019
1.016HisArg: 1.016 ± 0.021
1.82HisSer: 1.82 ± 0.029
1.194HisThr: 1.194 ± 0.029
1.194HisVal: 1.194 ± 0.024
0.222HisTrp: 0.222 ± 0.009
0.774HisTyr: 0.774 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
3.704IleAla: 3.704 ± 0.041
0.879IleCys: 0.879 ± 0.017
4.475IleAsp: 4.475 ± 0.051
4.377IleGlu: 4.377 ± 0.055
2.696IlePhe: 2.696 ± 0.036
3.143IleGly: 3.143 ± 0.042
1.346IleHis: 1.346 ± 0.023
4.114IleIle: 4.114 ± 0.056
4.514IleLys: 4.514 ± 0.045
6.02IleLeu: 6.02 ± 0.066
1.353IleMet: 1.353 ± 0.021
3.805IleAsn: 3.805 ± 0.045
3.305IlePro: 3.305 ± 0.039
2.408IleGln: 2.408 ± 0.031
2.827IleArg: 2.827 ± 0.037
5.745IleSer: 5.745 ± 0.052
3.913IleThr: 3.913 ± 0.066
3.811IleVal: 3.811 ± 0.044
0.663IleTrp: 0.663 ± 0.014
2.13IleTyr: 2.13 ± 0.036
0.0IleXaa: 0.0 ± 0.0
Lys
3.682LysAla: 3.682 ± 0.045
0.775LysCys: 0.775 ± 0.021
4.341LysAsp: 4.341 ± 0.048
5.435LysGlu: 5.435 ± 0.063
3.014LysPhe: 3.014 ± 0.033
2.981LysGly: 2.981 ± 0.039
1.525LysHis: 1.525 ± 0.026
4.509LysIle: 4.509 ± 0.041
6.634LysLys: 6.634 ± 0.078
7.243LysLeu: 7.243 ± 0.063
1.553LysMet: 1.553 ± 0.025
4.357LysAsn: 4.357 ± 0.049
2.967LysPro: 2.967 ± 0.036
2.944LysGln: 2.944 ± 0.037
3.876LysArg: 3.876 ± 0.05
5.576LysSer: 5.576 ± 0.053
3.979LysThr: 3.979 ± 0.044
4.016LysVal: 4.016 ± 0.042
0.764LysTrp: 0.764 ± 0.018
2.841LysTyr: 2.841 ± 0.04
0.0LysXaa: 0.0 ± 0.0
Leu
5.279LeuAla: 5.279 ± 0.058
1.22LeuCys: 1.22 ± 0.025
5.592LeuAsp: 5.592 ± 0.054
6.256LeuGlu: 6.256 ± 0.059
4.021LeuPhe: 4.021 ± 0.056
4.531LeuGly: 4.531 ± 0.048
1.998LeuHis: 1.998 ± 0.026
5.535LeuIle: 5.535 ± 0.055
7.271LeuLys: 7.271 ± 0.064
9.17LeuLeu: 9.17 ± 0.079
2.009LeuMet: 2.009 ± 0.026
5.33LeuAsn: 5.33 ± 0.054
4.371LeuPro: 4.371 ± 0.043
4.085LeuGln: 4.085 ± 0.045
4.415LeuArg: 4.415 ± 0.043
7.8LeuSer: 7.8 ± 0.07
5.034LeuThr: 5.034 ± 0.048
5.266LeuVal: 5.266 ± 0.055
0.897LeuTrp: 0.897 ± 0.018
3.067LeuTyr: 3.067 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
1.449MetAla: 1.449 ± 0.026
0.236MetCys: 0.236 ± 0.008
1.422MetAsp: 1.422 ± 0.025
1.408MetGlu: 1.408 ± 0.022
0.892MetPhe: 0.892 ± 0.021
1.216MetGly: 1.216 ± 0.024
0.416MetHis: 0.416 ± 0.014
1.369MetIle: 1.369 ± 0.025
1.695MetLys: 1.695 ± 0.029
1.972MetLeu: 1.972 ± 0.03
0.582MetMet: 0.582 ± 0.015
1.371MetAsn: 1.371 ± 0.035
0.893MetPro: 0.893 ± 0.022
0.798MetGln: 0.798 ± 0.019
0.928MetArg: 0.928 ± 0.018
2.098MetSer: 2.098 ± 0.03
1.253MetThr: 1.253 ± 0.023
1.287MetVal: 1.287 ± 0.02
0.179MetTrp: 0.179 ± 0.007
0.642MetTyr: 0.642 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.231AsnAla: 3.231 ± 0.04
0.667AsnCys: 0.667 ± 0.016
3.967AsnAsp: 3.967 ± 0.039
4.184AsnGlu: 4.184 ± 0.043
2.378AsnPhe: 2.378 ± 0.032
3.518AsnGly: 3.518 ± 0.052
1.141AsnHis: 1.141 ± 0.021
4.198AsnIle: 4.198 ± 0.04
4.157AsnLys: 4.157 ± 0.049
4.945AsnLeu: 4.945 ± 0.052
1.406AsnMet: 1.406 ± 0.025
4.364AsnAsn: 4.364 ± 0.081
2.577AsnPro: 2.577 ± 0.069
2.079AsnGln: 2.079 ± 0.037
2.304AsnArg: 2.304 ± 0.03
5.587AsnSer: 5.587 ± 0.065
3.532AsnThr: 3.532 ± 0.051
3.338AsnVal: 3.338 ± 0.034
0.6AsnTrp: 0.6 ± 0.018
2.182AsnTyr: 2.182 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
2.364ProAla: 2.364 ± 0.045
0.332ProCys: 0.332 ± 0.013
2.23ProAsp: 2.23 ± 0.031
3.041ProGlu: 3.041 ± 0.043
1.741ProPhe: 1.741 ± 0.029
2.054ProGly: 2.054 ± 0.067
0.92ProHis: 0.92 ± 0.025
2.715ProIle: 2.715 ± 0.034
2.779ProLys: 2.779 ± 0.041
3.79ProLeu: 3.79 ± 0.037
0.919ProMet: 0.919 ± 0.021
2.389ProAsn: 2.389 ± 0.035
2.338ProPro: 2.338 ± 0.075
1.899ProGln: 1.899 ± 0.038
1.696ProArg: 1.696 ± 0.026
3.993ProSer: 3.993 ± 0.091
2.732ProThr: 2.732 ± 0.036
2.777ProVal: 2.777 ± 0.039
0.39ProTrp: 0.39 ± 0.012
1.401ProTyr: 1.401 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
2.078GlnAla: 2.078 ± 0.042
0.426GlnCys: 0.426 ± 0.015
2.102GlnAsp: 2.102 ± 0.032
2.696GlnGlu: 2.696 ± 0.041
1.666GlnPhe: 1.666 ± 0.028
1.762GlnGly: 1.762 ± 0.033
0.831GlnHis: 0.831 ± 0.021
2.447GlnIle: 2.447 ± 0.033
2.799GlnLys: 2.799 ± 0.038
4.123GlnLeu: 4.123 ± 0.053
0.914GlnMet: 0.914 ± 0.021
2.403GlnAsn: 2.403 ± 0.036
1.584GlnPro: 1.584 ± 0.036
2.622GlnGln: 2.622 ± 0.097
1.953GlnArg: 1.953 ± 0.031
2.935GlnSer: 2.935 ± 0.04
1.943GlnThr: 1.943 ± 0.03
2.091GlnVal: 2.091 ± 0.027
0.404GlnTrp: 0.404 ± 0.013
1.496GlnTyr: 1.496 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
2.402ArgAla: 2.402 ± 0.033
0.53ArgCys: 0.53 ± 0.015
2.614ArgAsp: 2.614 ± 0.037
2.843ArgGlu: 2.843 ± 0.037
1.809ArgPhe: 1.809 ± 0.025
2.145ArgGly: 2.145 ± 0.041
1.013ArgHis: 1.013 ± 0.02
2.835ArgIle: 2.835 ± 0.032
3.592ArgLys: 3.592 ± 0.045
4.112ArgLeu: 4.112 ± 0.044
0.976ArgMet: 0.976 ± 0.021
2.684ArgAsn: 2.684 ± 0.035
1.655ArgPro: 1.655 ± 0.027
1.729ArgGln: 1.729 ± 0.029
2.834ArgArg: 2.834 ± 0.043
3.543ArgSer: 3.543 ± 0.048
2.442ArgThr: 2.442 ± 0.034
2.455ArgVal: 2.455 ± 0.035
0.452ArgTrp: 0.452 ± 0.013
1.683ArgTyr: 1.683 ± 0.026
0.0ArgXaa: 0.0 ± 0.0
Ser
4.537SerAla: 4.537 ± 0.049
0.838SerCys: 0.838 ± 0.022
4.885SerAsp: 4.885 ± 0.058
4.959SerGlu: 4.959 ± 0.06
3.424SerPhe: 3.424 ± 0.052
5.005SerGly: 5.005 ± 0.344
1.829SerHis: 1.829 ± 0.034
5.622SerIle: 5.622 ± 0.056
6.191SerLys: 6.191 ± 0.061
7.608SerLeu: 7.608 ± 0.067
1.959SerMet: 1.959 ± 0.036
5.7SerAsn: 5.7 ± 0.08
3.679SerPro: 3.679 ± 0.06
3.302SerGln: 3.302 ± 0.044
3.714SerArg: 3.714 ± 0.046
9.709SerSer: 9.709 ± 0.151
5.658SerThr: 5.658 ± 0.063
4.818SerVal: 4.818 ± 0.059
0.728SerTrp: 0.728 ± 0.017
2.654SerTyr: 2.654 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
3.349ThrAla: 3.349 ± 0.038
0.607ThrCys: 0.607 ± 0.018
3.272ThrAsp: 3.272 ± 0.045
3.496ThrGlu: 3.496 ± 0.043
2.249ThrPhe: 2.249 ± 0.032
3.245ThrGly: 3.245 ± 0.059
1.186ThrHis: 1.186 ± 0.024
3.959ThrIle: 3.959 ± 0.058
4.026ThrLys: 4.026 ± 0.048
5.289ThrLeu: 5.289 ± 0.049
1.214ThrMet: 1.214 ± 0.022
3.542ThrAsn: 3.542 ± 0.044
3.032ThrPro: 3.032 ± 0.049
1.945ThrGln: 1.945 ± 0.029
2.384ThrArg: 2.384 ± 0.037
5.185ThrSer: 5.185 ± 0.067
4.5ThrThr: 4.5 ± 0.134
3.71ThrVal: 3.71 ± 0.061
0.527ThrTrp: 0.527 ± 0.015
1.8ThrTyr: 1.8 ± 0.03
0.0ThrXaa: 0.0 ± 0.0
Val
3.443ValAla: 3.443 ± 0.042
0.752ValCys: 0.752 ± 0.018
3.767ValAsp: 3.767 ± 0.042
3.814ValGlu: 3.814 ± 0.051
2.494ValPhe: 2.494 ± 0.036
3.02ValGly: 3.02 ± 0.034
1.167ValHis: 1.167 ± 0.021
3.742ValIle: 3.742 ± 0.04
3.966ValLys: 3.966 ± 0.041
5.445ValLeu: 5.445 ± 0.056
1.197ValMet: 1.197 ± 0.021
3.236ValAsn: 3.236 ± 0.046
2.827ValPro: 2.827 ± 0.034
2.074ValGln: 2.074 ± 0.036
2.523ValArg: 2.523 ± 0.033
5.022ValSer: 5.022 ± 0.056
3.484ValThr: 3.484 ± 0.048
3.902ValVal: 3.902 ± 0.054
0.573ValTrp: 0.573 ± 0.016
1.882ValTyr: 1.882 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
0.504TrpAla: 0.504 ± 0.014
0.215TrpCys: 0.215 ± 0.009
0.678TrpAsp: 0.678 ± 0.017
0.596TrpGlu: 0.596 ± 0.016
0.468TrpPhe: 0.468 ± 0.015
0.506TrpGly: 0.506 ± 0.015
0.214TrpHis: 0.214 ± 0.009
0.629TrpIle: 0.629 ± 0.02
0.814TrpLys: 0.814 ± 0.018
0.907TrpLeu: 0.907 ± 0.02
0.213TrpMet: 0.213 ± 0.009
0.608TrpAsn: 0.608 ± 0.015
0.28TrpPro: 0.28 ± 0.011
0.358TrpGln: 0.358 ± 0.013
0.495TrpArg: 0.495 ± 0.016
0.788TrpSer: 0.788 ± 0.016
0.533TrpThr: 0.533 ± 0.019
0.559TrpVal: 0.559 ± 0.015
0.138TrpTrp: 0.138 ± 0.009
0.404TrpTyr: 0.404 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.732TyrAla: 1.732 ± 0.031
0.541TyrCys: 0.541 ± 0.013
2.248TyrAsp: 2.248 ± 0.033
2.174TyrGlu: 2.174 ± 0.031
1.66TyrPhe: 1.66 ± 0.027
1.97TyrGly: 1.97 ± 0.047
0.826TyrHis: 0.826 ± 0.019
2.181TyrIle: 2.181 ± 0.028
2.37TyrLys: 2.37 ± 0.03
3.47TyrLeu: 3.47 ± 0.045
0.828TyrMet: 0.828 ± 0.017
2.103TyrAsn: 2.103 ± 0.032
1.344TyrPro: 1.344 ± 0.022
1.406TyrGln: 1.406 ± 0.026
1.545TyrArg: 1.545 ± 0.025
2.822TyrSer: 2.822 ± 0.034
1.801TyrThr: 1.801 ± 0.033
1.949TyrVal: 1.949 ± 0.03
0.42TyrTrp: 0.42 ± 0.015
1.493TyrTyr: 1.493 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5201 proteins (2635782 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski