Amino acid dipepetide frequency for Malassezia sympodialis (strain ATCC 42132) (Atopic eczema-associated yeast)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.105AlaAla: 13.105 ± 0.182
1.44AlaCys: 1.44 ± 0.035
5.302AlaAsp: 5.302 ± 0.061
6.15AlaGlu: 6.15 ± 0.079
3.136AlaPhe: 3.136 ± 0.044
5.842AlaGly: 5.842 ± 0.066
3.334AlaHis: 3.334 ± 0.054
3.385AlaIle: 3.385 ± 0.045
3.693AlaLys: 3.693 ± 0.063
11.761AlaLeu: 11.761 ± 0.159
2.719AlaMet: 2.719 ± 0.037
2.317AlaAsn: 2.317 ± 0.037
9.067AlaPro: 9.067 ± 0.139
5.296AlaGln: 5.296 ± 0.071
8.041AlaArg: 8.041 ± 0.096
8.607AlaSer: 8.607 ± 0.08
5.716AlaThr: 5.716 ± 0.06
6.228AlaVal: 6.228 ± 0.066
1.901AlaTrp: 1.901 ± 0.046
2.517AlaTyr: 2.517 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.382CysAla: 1.382 ± 0.035
0.194CysCys: 0.194 ± 0.01
0.696CysAsp: 0.696 ± 0.019
0.679CysGlu: 0.679 ± 0.016
0.461CysPhe: 0.461 ± 0.016
0.874CysGly: 0.874 ± 0.022
0.355CysHis: 0.355 ± 0.015
0.602CysIle: 0.602 ± 0.016
0.397CysLys: 0.397 ± 0.015
1.258CysLeu: 1.258 ± 0.027
0.336CysMet: 0.336 ± 0.012
0.28CysAsn: 0.28 ± 0.012
0.598CysPro: 0.598 ± 0.021
0.409CysGln: 0.409 ± 0.013
0.742CysArg: 0.742 ± 0.018
0.735CysSer: 0.735 ± 0.018
0.756CysThr: 0.756 ± 0.02
1.047CysVal: 1.047 ± 0.019
0.175CysTrp: 0.175 ± 0.009
0.285CysTyr: 0.285 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.313AspAla: 7.313 ± 0.091
0.464AspCys: 0.464 ± 0.013
4.378AspAsp: 4.378 ± 0.074
5.114AspGlu: 5.114 ± 0.063
1.602AspPhe: 1.602 ± 0.027
3.651AspGly: 3.651 ± 0.049
1.186AspHis: 1.186 ± 0.025
2.107AspIle: 2.107 ± 0.036
1.81AspLys: 1.81 ± 0.033
4.977AspLeu: 4.977 ± 0.05
1.685AspMet: 1.685 ± 0.031
1.249AspAsn: 1.249 ± 0.03
3.247AspPro: 3.247 ± 0.037
1.684AspGln: 1.684 ± 0.03
2.935AspArg: 2.935 ± 0.036
3.257AspSer: 3.257 ± 0.051
3.349AspThr: 3.349 ± 0.041
4.555AspVal: 4.555 ± 0.053
0.789AspTrp: 0.789 ± 0.023
1.322AspTyr: 1.322 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
7.278GluAla: 7.278 ± 0.094
0.648GluCys: 0.648 ± 0.016
3.466GluAsp: 3.466 ± 0.061
4.379GluGlu: 4.379 ± 0.088
1.57GluPhe: 1.57 ± 0.03
3.117GluGly: 3.117 ± 0.041
1.897GluHis: 1.897 ± 0.032
2.072GluIle: 2.072 ± 0.038
2.586GluLys: 2.586 ± 0.052
5.865GluLeu: 5.865 ± 0.06
1.441GluMet: 1.441 ± 0.026
1.686GluAsn: 1.686 ± 0.028
3.553GluPro: 3.553 ± 0.055
2.958GluGln: 2.958 ± 0.051
5.224GluArg: 5.224 ± 0.065
4.018GluSer: 4.018 ± 0.052
3.051GluThr: 3.051 ± 0.043
3.156GluVal: 3.156 ± 0.045
0.844GluTrp: 0.844 ± 0.019
1.496GluTyr: 1.496 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
2.86PheAla: 2.86 ± 0.032
0.476PheCys: 0.476 ± 0.017
1.969PheAsp: 1.969 ± 0.032
1.813PheGlu: 1.813 ± 0.03
1.336PhePhe: 1.336 ± 0.031
2.35PheGly: 2.35 ± 0.05
0.967PheHis: 0.967 ± 0.022
1.057PheIle: 1.057 ± 0.023
0.972PheLys: 0.972 ± 0.024
3.29PheLeu: 3.29 ± 0.05
0.716PheMet: 0.716 ± 0.021
0.846PheAsn: 0.846 ± 0.021
1.592PhePro: 1.592 ± 0.033
1.328PheGln: 1.328 ± 0.026
1.944PheArg: 1.944 ± 0.027
2.42PheSer: 2.42 ± 0.043
1.557PheThr: 1.557 ± 0.033
2.584PheVal: 2.584 ± 0.04
0.49PheTrp: 0.49 ± 0.017
0.929PheTyr: 0.929 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
6.772GlyAla: 6.772 ± 0.084
0.707GlyCys: 0.707 ± 0.019
3.22GlyAsp: 3.22 ± 0.045
3.214GlyGlu: 3.214 ± 0.043
2.087GlyPhe: 2.087 ± 0.038
4.436GlyGly: 4.436 ± 0.089
1.854GlyHis: 1.854 ± 0.034
2.527GlyIle: 2.527 ± 0.038
2.33GlyLys: 2.33 ± 0.049
5.831GlyLeu: 5.831 ± 0.058
1.568GlyMet: 1.568 ± 0.03
1.503GlyAsn: 1.503 ± 0.033
3.374GlyPro: 3.374 ± 0.05
2.281GlyGln: 2.281 ± 0.039
4.102GlyArg: 4.102 ± 0.05
4.635GlySer: 4.635 ± 0.07
3.934GlyThr: 3.934 ± 0.047
4.149GlyVal: 4.149 ± 0.053
0.955GlyTrp: 0.955 ± 0.025
1.592GlyTyr: 1.592 ± 0.034
0.0GlyXaa: 0.0 ± 0.0
His
3.771HisAla: 3.771 ± 0.063
0.336HisCys: 0.336 ± 0.013
1.575HisAsp: 1.575 ± 0.024
1.738HisGlu: 1.738 ± 0.027
0.941HisPhe: 0.941 ± 0.021
2.094HisGly: 2.094 ± 0.03
0.836HisHis: 0.836 ± 0.026
1.264HisIle: 1.264 ± 0.025
0.866HisLys: 0.866 ± 0.021
2.728HisLeu: 2.728 ± 0.043
0.857HisMet: 0.857 ± 0.023
0.671HisAsn: 0.671 ± 0.021
1.745HisPro: 1.745 ± 0.031
0.941HisGln: 0.941 ± 0.022
1.86HisArg: 1.86 ± 0.033
1.802HisSer: 1.802 ± 0.031
1.823HisThr: 1.823 ± 0.033
2.366HisVal: 2.366 ± 0.039
0.426HisTrp: 0.426 ± 0.016
0.717HisTyr: 0.717 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
3.353IleAla: 3.353 ± 0.044
0.487IleCys: 0.487 ± 0.015
2.316IleAsp: 2.316 ± 0.034
2.263IleGlu: 2.263 ± 0.033
1.27IlePhe: 1.27 ± 0.028
2.283IleGly: 2.283 ± 0.045
1.089IleHis: 1.089 ± 0.024
1.385IleIle: 1.385 ± 0.039
1.425IleLys: 1.425 ± 0.033
3.578IleLeu: 3.578 ± 0.05
0.847IleMet: 0.847 ± 0.022
1.076IleAsn: 1.076 ± 0.024
2.168IlePro: 2.168 ± 0.033
1.725IleGln: 1.725 ± 0.034
2.3IleArg: 2.3 ± 0.04
2.495IleSer: 2.495 ± 0.037
1.86IleThr: 1.86 ± 0.033
2.873IleVal: 2.873 ± 0.042
0.486IleTrp: 0.486 ± 0.017
0.927IleTyr: 0.927 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
3.378LysAla: 3.378 ± 0.056
0.355LysCys: 0.355 ± 0.014
2.008LysAsp: 2.008 ± 0.039
2.406LysGlu: 2.406 ± 0.056
0.958LysPhe: 0.958 ± 0.023
1.962LysGly: 1.962 ± 0.04
0.987LysHis: 0.987 ± 0.022
1.453LysIle: 1.453 ± 0.034
2.152LysLys: 2.152 ± 0.058
3.206LysLeu: 3.206 ± 0.048
0.869LysMet: 0.869 ± 0.02
1.243LysAsn: 1.243 ± 0.026
1.985LysPro: 1.985 ± 0.046
1.573LysGln: 1.573 ± 0.027
2.876LysArg: 2.876 ± 0.048
2.461LysSer: 2.461 ± 0.042
1.933LysThr: 1.933 ± 0.032
2.136LysVal: 2.136 ± 0.037
0.407LysTrp: 0.407 ± 0.014
1.015LysTyr: 1.015 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
10.76LeuAla: 10.76 ± 0.128
1.589LeuCys: 1.589 ± 0.031
5.934LeuAsp: 5.934 ± 0.052
5.859LeuGlu: 5.859 ± 0.066
3.438LeuPhe: 3.438 ± 0.045
6.275LeuGly: 6.275 ± 0.06
3.076LeuHis: 3.076 ± 0.045
3.053LeuIle: 3.053 ± 0.049
3.001LeuLys: 3.001 ± 0.045
10.4LeuLeu: 10.4 ± 0.116
2.014LeuMet: 2.014 ± 0.029
2.522LeuAsn: 2.522 ± 0.044
6.285LeuPro: 6.285 ± 0.065
4.753LeuGln: 4.753 ± 0.063
7.578LeuArg: 7.578 ± 0.072
7.478LeuSer: 7.478 ± 0.077
4.769LeuThr: 4.769 ± 0.05
7.041LeuVal: 7.041 ± 0.073
1.389LeuTrp: 1.389 ± 0.029
2.645LeuTyr: 2.645 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.702MetAla: 2.702 ± 0.035
0.275MetCys: 0.275 ± 0.011
1.646MetAsp: 1.646 ± 0.025
1.481MetGlu: 1.481 ± 0.026
0.696MetPhe: 0.696 ± 0.018
1.545MetGly: 1.545 ± 0.031
0.771MetHis: 0.771 ± 0.019
0.881MetIle: 0.881 ± 0.026
0.737MetLys: 0.737 ± 0.02
2.436MetLeu: 2.436 ± 0.036
0.592MetMet: 0.592 ± 0.018
0.754MetAsn: 0.754 ± 0.019
1.801MetPro: 1.801 ± 0.035
1.147MetGln: 1.147 ± 0.024
1.798MetArg: 1.798 ± 0.029
2.076MetSer: 2.076 ± 0.039
1.255MetThr: 1.255 ± 0.025
1.493MetVal: 1.493 ± 0.027
0.282MetTrp: 0.282 ± 0.014
0.681MetTyr: 0.681 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.669AsnAla: 2.669 ± 0.04
0.26AsnCys: 0.26 ± 0.011
1.554AsnAsp: 1.554 ± 0.032
1.834AsnGlu: 1.834 ± 0.034
0.824AsnPhe: 0.824 ± 0.022
1.634AsnGly: 1.634 ± 0.033
0.579AsnHis: 0.579 ± 0.017
1.173AsnIle: 1.173 ± 0.027
1.1AsnLys: 1.1 ± 0.025
2.417AsnLeu: 2.417 ± 0.045
0.775AsnMet: 0.775 ± 0.019
0.806AsnAsn: 0.806 ± 0.026
1.57AsnPro: 1.57 ± 0.03
0.963AsnGln: 0.963 ± 0.022
1.371AsnArg: 1.371 ± 0.025
1.693AsnSer: 1.693 ± 0.052
1.507AsnThr: 1.507 ± 0.033
2.03AsnVal: 2.03 ± 0.034
0.342AsnTrp: 0.342 ± 0.013
0.746AsnTyr: 0.746 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
7.783ProAla: 7.783 ± 0.097
0.612ProCys: 0.612 ± 0.023
3.247ProAsp: 3.247 ± 0.041
3.778ProGlu: 3.778 ± 0.05
1.894ProPhe: 1.894 ± 0.028
3.933ProGly: 3.933 ± 0.05
1.708ProHis: 1.708 ± 0.028
2.064ProIle: 2.064 ± 0.035
2.137ProLys: 2.137 ± 0.037
5.853ProLeu: 5.853 ± 0.068
1.651ProMet: 1.651 ± 0.033
1.635ProAsn: 1.635 ± 0.032
6.114ProPro: 6.114 ± 0.111
2.152ProGln: 2.152 ± 0.041
4.329ProArg: 4.329 ± 0.052
6.181ProSer: 6.181 ± 0.085
4.075ProThr: 4.075 ± 0.059
4.423ProVal: 4.423 ± 0.065
0.919ProTrp: 0.919 ± 0.02
1.518ProTyr: 1.518 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
4.998GlnAla: 4.998 ± 0.073
0.546GlnCys: 0.546 ± 0.017
2.025GlnAsp: 2.025 ± 0.029
2.262GlnGlu: 2.262 ± 0.035
1.168GlnPhe: 1.168 ± 0.025
2.623GlnGly: 2.623 ± 0.038
1.311GlnHis: 1.311 ± 0.023
1.529GlnIle: 1.529 ± 0.027
1.432GlnLys: 1.432 ± 0.025
4.35GlnLeu: 4.35 ± 0.057
1.021GlnMet: 1.021 ± 0.027
1.146GlnAsn: 1.146 ± 0.026
2.381GlnPro: 2.381 ± 0.048
2.156GlnGln: 2.156 ± 0.042
3.72GlnArg: 3.72 ± 0.046
2.78GlnSer: 2.78 ± 0.035
2.082GlnThr: 2.082 ± 0.03
2.894GlnVal: 2.894 ± 0.039
0.641GlnTrp: 0.641 ± 0.016
1.072GlnTyr: 1.072 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
8.483ArgAla: 8.483 ± 0.093
0.811ArgCys: 0.811 ± 0.02
3.681ArgAsp: 3.681 ± 0.045
4.323ArgGlu: 4.323 ± 0.051
2.177ArgPhe: 2.177 ± 0.031
4.037ArgGly: 4.037 ± 0.052
2.158ArgHis: 2.158 ± 0.035
2.84ArgIle: 2.84 ± 0.039
2.575ArgLys: 2.575 ± 0.042
6.992ArgLeu: 6.992 ± 0.067
1.8ArgMet: 1.8 ± 0.032
1.598ArgAsn: 1.598 ± 0.028
4.221ArgPro: 4.221 ± 0.057
2.805ArgGln: 2.805 ± 0.047
6.127ArgArg: 6.127 ± 0.071
4.846ArgSer: 4.846 ± 0.054
4.204ArgThr: 4.204 ± 0.05
4.769ArgVal: 4.769 ± 0.062
1.038ArgTrp: 1.038 ± 0.025
1.66ArgTyr: 1.66 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
7.404SerAla: 7.404 ± 0.077
0.736SerCys: 0.736 ± 0.023
4.229SerAsp: 4.229 ± 0.054
4.12SerGlu: 4.12 ± 0.057
2.513SerPhe: 2.513 ± 0.045
4.522SerGly: 4.522 ± 0.065
2.058SerHis: 2.058 ± 0.037
2.805SerIle: 2.805 ± 0.041
2.793SerLys: 2.793 ± 0.048
7.43SerLeu: 7.43 ± 0.068
2.235SerMet: 2.235 ± 0.037
2.104SerAsn: 2.104 ± 0.056
4.721SerPro: 4.721 ± 0.086
2.926SerGln: 2.926 ± 0.042
4.581SerArg: 4.581 ± 0.059
7.252SerSer: 7.252 ± 0.134
4.708SerThr: 4.708 ± 0.054
4.97SerVal: 4.97 ± 0.047
0.957SerTrp: 0.957 ± 0.023
1.733SerTyr: 1.733 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
5.046ThrAla: 5.046 ± 0.058
0.685ThrCys: 0.685 ± 0.018
2.836ThrAsp: 2.836 ± 0.038
3.031ThrGlu: 3.031 ± 0.048
1.791ThrPhe: 1.791 ± 0.026
3.539ThrGly: 3.539 ± 0.045
1.714ThrHis: 1.714 ± 0.031
2.115ThrIle: 2.115 ± 0.033
1.919ThrLys: 1.919 ± 0.033
5.93ThrLeu: 5.93 ± 0.058
1.435ThrMet: 1.435 ± 0.025
1.495ThrAsn: 1.495 ± 0.033
4.343ThrPro: 4.343 ± 0.06
2.3ThrGln: 2.3 ± 0.032
3.57ThrArg: 3.57 ± 0.048
4.562ThrSer: 4.562 ± 0.058
3.072ThrThr: 3.072 ± 0.046
3.509ThrVal: 3.509 ± 0.041
0.863ThrTrp: 0.863 ± 0.019
1.474ThrTyr: 1.474 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
6.512ValAla: 6.512 ± 0.08
1.055ValCys: 1.055 ± 0.023
3.969ValAsp: 3.969 ± 0.042
3.489ValGlu: 3.489 ± 0.043
2.218ValPhe: 2.218 ± 0.038
3.751ValGly: 3.751 ± 0.043
2.275ValHis: 2.275 ± 0.035
2.277ValIle: 2.277 ± 0.036
2.061ValLys: 2.061 ± 0.035
7.43ValLeu: 7.43 ± 0.077
1.4ValMet: 1.4 ± 0.028
1.664ValAsn: 1.664 ± 0.032
5.314ValPro: 5.314 ± 0.065
3.244ValGln: 3.244 ± 0.041
5.319ValArg: 5.319 ± 0.055
4.896ValSer: 4.896 ± 0.047
3.205ValThr: 3.205 ± 0.044
4.588ValVal: 4.588 ± 0.052
1.083ValTrp: 1.083 ± 0.026
1.912ValTyr: 1.912 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.341TrpAla: 1.341 ± 0.028
0.224TrpCys: 0.224 ± 0.011
1.051TrpAsp: 1.051 ± 0.024
0.754TrpGlu: 0.754 ± 0.018
0.456TrpPhe: 0.456 ± 0.016
0.787TrpGly: 0.787 ± 0.021
0.524TrpHis: 0.524 ± 0.016
0.63TrpIle: 0.63 ± 0.018
0.517TrpLys: 0.517 ± 0.015
1.54TrpLeu: 1.54 ± 0.03
0.37TrpMet: 0.37 ± 0.013
0.516TrpAsn: 0.516 ± 0.018
0.703TrpPro: 0.703 ± 0.02
0.589TrpGln: 0.589 ± 0.017
1.166TrpArg: 1.166 ± 0.026
1.035TrpSer: 1.035 ± 0.024
0.91TrpThr: 0.91 ± 0.022
0.844TrpVal: 0.844 ± 0.02
0.246TrpTrp: 0.246 ± 0.013
0.382TrpTyr: 0.382 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.647TyrAla: 2.647 ± 0.047
0.358TyrCys: 0.358 ± 0.014
1.573TyrAsp: 1.573 ± 0.025
1.616TyrGlu: 1.616 ± 0.03
0.958TyrPhe: 0.958 ± 0.027
1.819TyrGly: 1.819 ± 0.04
0.68TyrHis: 0.68 ± 0.017
0.986TyrIle: 0.986 ± 0.019
0.86TyrLys: 0.86 ± 0.022
2.588TyrLeu: 2.588 ± 0.042
0.718TyrMet: 0.718 ± 0.018
0.736TyrAsn: 0.736 ± 0.021
1.223TyrPro: 1.223 ± 0.033
0.909TyrGln: 0.909 ± 0.019
1.56TyrArg: 1.56 ± 0.029
1.566TyrSer: 1.566 ± 0.028
1.465TyrThr: 1.465 ± 0.026
2.009TyrVal: 2.009 ± 0.03
0.354TyrTrp: 0.354 ± 0.013
0.758TyrTyr: 0.758 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4501 proteins (2239772 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski