Amino acid dipepetide frequency for Candida maltosa (strain Xu316) (Yeast)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.925AlaAla: 3.925 ± 0.075
0.592AlaCys: 0.592 ± 0.015
2.471AlaAsp: 2.471 ± 0.033
2.936AlaGlu: 2.936 ± 0.045
2.144AlaPhe: 2.144 ± 0.029
2.926AlaGly: 2.926 ± 0.048
0.936AlaHis: 0.936 ± 0.019
3.743AlaIle: 3.743 ± 0.039
3.855AlaLys: 3.855 ± 0.053
4.7AlaLeu: 4.7 ± 0.05
1.049AlaMet: 1.049 ± 0.021
3.042AlaAsn: 3.042 ± 0.038
2.331AlaPro: 2.331 ± 0.037
1.85AlaGln: 1.85 ± 0.031
2.081AlaArg: 2.081 ± 0.03
4.441AlaSer: 4.441 ± 0.045
3.585AlaThr: 3.585 ± 0.044
3.084AlaVal: 3.084 ± 0.04
0.459AlaTrp: 0.459 ± 0.015
1.632AlaTyr: 1.632 ± 0.025
0.0AlaXaa: 0.0 ± 0.0
Cys
0.516CysAla: 0.516 ± 0.016
0.233CysCys: 0.233 ± 0.01
0.61CysAsp: 0.61 ± 0.016
0.543CysGlu: 0.543 ± 0.014
0.591CysPhe: 0.591 ± 0.014
0.772CysGly: 0.772 ± 0.02
0.271CysHis: 0.271 ± 0.011
0.77CysIle: 0.77 ± 0.019
0.645CysLys: 0.645 ± 0.016
1.112CysLeu: 1.112 ± 0.021
0.197CysMet: 0.197 ± 0.008
0.52CysAsn: 0.52 ± 0.016
0.477CysPro: 0.477 ± 0.013
0.394CysGln: 0.394 ± 0.013
0.406CysArg: 0.406 ± 0.012
0.805CysSer: 0.805 ± 0.02
0.531CysThr: 0.531 ± 0.015
0.635CysVal: 0.635 ± 0.016
0.141CysTrp: 0.141 ± 0.006
0.402CysTyr: 0.402 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
2.89AspAla: 2.89 ± 0.039
0.577AspCys: 0.577 ± 0.015
5.994AspAsp: 5.994 ± 0.088
5.81AspGlu: 5.81 ± 0.068
2.978AspPhe: 2.978 ± 0.037
2.966AspGly: 2.966 ± 0.042
1.162AspHis: 1.162 ± 0.019
4.21AspIle: 4.21 ± 0.038
3.979AspLys: 3.979 ± 0.042
5.805AspLeu: 5.805 ± 0.052
1.057AspMet: 1.057 ± 0.021
3.408AspAsn: 3.408 ± 0.039
2.541AspPro: 2.541 ± 0.03
2.035AspGln: 2.035 ± 0.025
1.84AspArg: 1.84 ± 0.024
4.644AspSer: 4.644 ± 0.055
2.971AspThr: 2.971 ± 0.04
3.543AspVal: 3.543 ± 0.04
0.62AspTrp: 0.62 ± 0.014
2.439AspTyr: 2.439 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
3.114GluAla: 3.114 ± 0.04
0.588GluCys: 0.588 ± 0.015
4.794GluAsp: 4.794 ± 0.062
7.176GluGlu: 7.176 ± 0.108
2.95GluPhe: 2.95 ± 0.035
2.609GluGly: 2.609 ± 0.037
1.161GluHis: 1.161 ± 0.021
4.691GluIle: 4.691 ± 0.048
5.104GluLys: 5.104 ± 0.063
6.571GluLeu: 6.571 ± 0.061
1.268GluMet: 1.268 ± 0.021
3.988GluAsn: 3.988 ± 0.039
2.328GluPro: 2.328 ± 0.047
2.618GluGln: 2.618 ± 0.03
2.42GluArg: 2.42 ± 0.036
5.38GluSer: 5.38 ± 0.068
3.662GluThr: 3.662 ± 0.054
3.808GluVal: 3.808 ± 0.046
0.584GluTrp: 0.584 ± 0.015
2.402GluTyr: 2.402 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
2.415PheAla: 2.415 ± 0.034
0.481PheCys: 0.481 ± 0.015
2.983PheAsp: 2.983 ± 0.038
2.885PheGlu: 2.885 ± 0.035
2.075PhePhe: 2.075 ± 0.037
2.849PheGly: 2.849 ± 0.045
0.97PheHis: 0.97 ± 0.02
3.233PheIle: 3.233 ± 0.043
3.24PheLys: 3.24 ± 0.035
3.898PheLeu: 3.898 ± 0.045
0.88PheMet: 0.88 ± 0.021
2.932PheAsn: 2.932 ± 0.035
1.929PhePro: 1.929 ± 0.028
1.796PheGln: 1.796 ± 0.025
1.537PheArg: 1.537 ± 0.024
3.374PheSer: 3.374 ± 0.038
2.74PheThr: 2.74 ± 0.031
2.681PheVal: 2.681 ± 0.033
0.518PheTrp: 0.518 ± 0.015
1.624PheTyr: 1.624 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
2.879GlyAla: 2.879 ± 0.041
0.654GlyCys: 0.654 ± 0.017
3.012GlyAsp: 3.012 ± 0.041
2.98GlyGlu: 2.98 ± 0.036
2.602GlyPhe: 2.602 ± 0.038
3.735GlyGly: 3.735 ± 0.063
1.022GlyHis: 1.022 ± 0.022
3.516GlyIle: 3.516 ± 0.042
3.649GlyLys: 3.649 ± 0.047
4.622GlyLeu: 4.622 ± 0.041
0.935GlyMet: 0.935 ± 0.02
2.984GlyAsn: 2.984 ± 0.047
1.622GlyPro: 1.622 ± 0.026
1.604GlyGln: 1.604 ± 0.027
1.917GlyArg: 1.917 ± 0.032
4.552GlySer: 4.552 ± 0.058
2.939GlyThr: 2.939 ± 0.041
3.324GlyVal: 3.324 ± 0.043
0.617GlyTrp: 0.617 ± 0.019
2.086GlyTyr: 2.086 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
0.961HisAla: 0.961 ± 0.021
0.248HisCys: 0.248 ± 0.009
1.298HisAsp: 1.298 ± 0.02
1.355HisGlu: 1.355 ± 0.022
0.911HisPhe: 0.911 ± 0.019
1.102HisGly: 1.102 ± 0.019
0.96HisHis: 0.96 ± 0.032
1.318HisIle: 1.318 ± 0.021
1.357HisLys: 1.357 ± 0.023
1.924HisLeu: 1.924 ± 0.027
0.324HisMet: 0.324 ± 0.011
1.229HisAsn: 1.229 ± 0.023
1.125HisPro: 1.125 ± 0.024
1.147HisGln: 1.147 ± 0.023
0.875HisArg: 0.875 ± 0.02
1.599HisSer: 1.599 ± 0.024
1.013HisThr: 1.013 ± 0.019
1.131HisVal: 1.131 ± 0.023
0.206HisTrp: 0.206 ± 0.009
0.809HisTyr: 0.809 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
3.684IleAla: 3.684 ± 0.04
0.77IleCys: 0.77 ± 0.016
4.727IleAsp: 4.727 ± 0.041
4.392IleGlu: 4.392 ± 0.042
2.993IlePhe: 2.993 ± 0.045
3.527IleGly: 3.527 ± 0.038
1.491IleHis: 1.491 ± 0.022
4.811IleIle: 4.811 ± 0.049
4.904IleLys: 4.904 ± 0.046
6.143IleLeu: 6.143 ± 0.049
1.274IleMet: 1.274 ± 0.023
4.442IleAsn: 4.442 ± 0.048
3.656IlePro: 3.656 ± 0.041
2.542IleGln: 2.542 ± 0.03
2.58IleArg: 2.58 ± 0.033
5.689IleSer: 5.689 ± 0.055
4.143IleThr: 4.143 ± 0.042
4.07IleVal: 4.07 ± 0.042
0.711IleTrp: 0.711 ± 0.016
2.309IleTyr: 2.309 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
3.29LysAla: 3.29 ± 0.047
0.709LysCys: 0.709 ± 0.017
4.001LysAsp: 4.001 ± 0.047
5.066LysGlu: 5.066 ± 0.06
3.432LysPhe: 3.432 ± 0.041
2.693LysGly: 2.693 ± 0.036
1.515LysHis: 1.515 ± 0.023
5.102LysIle: 5.102 ± 0.046
6.676LysLys: 6.676 ± 0.079
7.612LysLeu: 7.612 ± 0.063
1.268LysMet: 1.268 ± 0.024
4.286LysAsn: 4.286 ± 0.042
3.245LysPro: 3.245 ± 0.042
3.098LysGln: 3.098 ± 0.035
3.259LysArg: 3.259 ± 0.041
5.984LysSer: 5.984 ± 0.058
3.738LysThr: 3.738 ± 0.037
4.09LysVal: 4.09 ± 0.041
0.713LysTrp: 0.713 ± 0.018
2.918LysTyr: 2.918 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
5.085LeuAla: 5.085 ± 0.053
0.989LeuCys: 0.989 ± 0.021
5.328LeuAsp: 5.328 ± 0.046
5.814LeuGlu: 5.814 ± 0.057
4.01LeuPhe: 4.01 ± 0.044
4.572LeuGly: 4.572 ± 0.052
1.908LeuHis: 1.908 ± 0.026
6.433LeuIle: 6.433 ± 0.063
6.98LeuLys: 6.98 ± 0.057
8.386LeuLeu: 8.386 ± 0.076
1.761LeuMet: 1.761 ± 0.022
5.791LeuAsn: 5.791 ± 0.058
4.588LeuPro: 4.588 ± 0.041
3.898LeuGln: 3.898 ± 0.044
3.672LeuArg: 3.672 ± 0.041
7.966LeuSer: 7.966 ± 0.06
5.511LeuThr: 5.511 ± 0.049
5.388LeuVal: 5.388 ± 0.05
0.773LeuTrp: 0.773 ± 0.019
2.964LeuTyr: 2.964 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
1.109MetAla: 1.109 ± 0.022
0.207MetCys: 0.207 ± 0.009
1.064MetAsp: 1.064 ± 0.022
1.078MetGlu: 1.078 ± 0.018
0.816MetPhe: 0.816 ± 0.017
0.968MetGly: 0.968 ± 0.018
0.282MetHis: 0.282 ± 0.011
1.283MetIle: 1.283 ± 0.022
1.424MetLys: 1.424 ± 0.023
1.529MetLeu: 1.529 ± 0.025
0.488MetMet: 0.488 ± 0.016
1.193MetAsn: 1.193 ± 0.022
0.662MetPro: 0.662 ± 0.016
0.589MetGln: 0.589 ± 0.016
0.637MetArg: 0.637 ± 0.014
1.885MetSer: 1.885 ± 0.022
1.134MetThr: 1.134 ± 0.019
1.092MetVal: 1.092 ± 0.021
0.159MetTrp: 0.159 ± 0.008
0.63MetTyr: 0.63 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.784AsnAla: 2.784 ± 0.032
0.62AsnCys: 0.62 ± 0.015
4.047AsnAsp: 4.047 ± 0.04
3.924AsnGlu: 3.924 ± 0.04
2.713AsnPhe: 2.713 ± 0.036
3.591AsnGly: 3.591 ± 0.051
1.456AsnHis: 1.456 ± 0.023
3.864AsnIle: 3.864 ± 0.039
4.182AsnLys: 4.182 ± 0.04
5.755AsnLeu: 5.755 ± 0.065
1.005AsnMet: 1.005 ± 0.017
5.258AsnAsn: 5.258 ± 0.087
2.776AsnPro: 2.776 ± 0.033
2.666AsnGln: 2.666 ± 0.036
2.048AsnArg: 2.048 ± 0.029
5.25AsnSer: 5.25 ± 0.062
3.422AsnThr: 3.422 ± 0.041
3.245AsnVal: 3.245 ± 0.031
0.689AsnTrp: 0.689 ± 0.018
2.482AsnTyr: 2.482 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
2.352ProAla: 2.352 ± 0.043
0.323ProCys: 0.323 ± 0.012
2.399ProAsp: 2.399 ± 0.031
3.283ProGlu: 3.283 ± 0.034
1.9ProPhe: 1.9 ± 0.028
2.048ProGly: 2.048 ± 0.031
0.975ProHis: 0.975 ± 0.022
3.214ProIle: 3.214 ± 0.034
3.299ProLys: 3.299 ± 0.044
3.642ProLeu: 3.642 ± 0.042
0.784ProMet: 0.784 ± 0.02
2.672ProAsn: 2.672 ± 0.035
3.578ProPro: 3.578 ± 0.081
2.463ProGln: 2.463 ± 0.052
1.633ProArg: 1.633 ± 0.027
4.319ProSer: 4.319 ± 0.048
3.305ProThr: 3.305 ± 0.046
2.951ProVal: 2.951 ± 0.037
0.376ProTrp: 0.376 ± 0.011
1.453ProTyr: 1.453 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
2.057GlnAla: 2.057 ± 0.033
0.382GlnCys: 0.382 ± 0.013
2.115GlnAsp: 2.115 ± 0.027
2.855GlnGlu: 2.855 ± 0.039
1.865GlnPhe: 1.865 ± 0.028
1.682GlnGly: 1.682 ± 0.027
1.053GlnHis: 1.053 ± 0.023
2.597GlnIle: 2.597 ± 0.033
2.575GlnLys: 2.575 ± 0.034
4.037GlnLeu: 4.037 ± 0.039
0.745GlnMet: 0.745 ± 0.018
2.138GlnAsn: 2.138 ± 0.032
2.41GlnPro: 2.41 ± 0.051
5.073GlnGln: 5.073 ± 0.151
1.588GlnArg: 1.588 ± 0.025
3.289GlnSer: 3.289 ± 0.042
2.088GlnThr: 2.088 ± 0.027
2.192GlnVal: 2.192 ± 0.029
0.356GlnTrp: 0.356 ± 0.012
1.479GlnTyr: 1.479 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
1.863ArgAla: 1.863 ± 0.028
0.424ArgCys: 0.424 ± 0.013
2.118ArgAsp: 2.118 ± 0.027
2.333ArgGlu: 2.333 ± 0.033
1.803ArgPhe: 1.803 ± 0.027
1.854ArgGly: 1.854 ± 0.035
0.828ArgHis: 0.828 ± 0.019
2.593ArgIle: 2.593 ± 0.027
3.246ArgLys: 3.246 ± 0.041
3.584ArgLeu: 3.584 ± 0.038
0.743ArgMet: 0.743 ± 0.015
2.267ArgAsn: 2.267 ± 0.031
1.502ArgPro: 1.502 ± 0.027
1.608ArgGln: 1.608 ± 0.026
2.243ArgArg: 2.243 ± 0.033
3.095ArgSer: 3.095 ± 0.036
1.96ArgThr: 1.96 ± 0.029
2.051ArgVal: 2.051 ± 0.029
0.396ArgTrp: 0.396 ± 0.013
1.407ArgTyr: 1.407 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
4.153SerAla: 4.153 ± 0.051
0.786SerCys: 0.786 ± 0.019
4.66SerAsp: 4.66 ± 0.05
4.685SerGlu: 4.685 ± 0.057
3.827SerPhe: 3.827 ± 0.037
4.428SerGly: 4.428 ± 0.059
1.697SerHis: 1.697 ± 0.03
6.236SerIle: 6.236 ± 0.053
6.127SerLys: 6.127 ± 0.053
7.664SerLeu: 7.664 ± 0.054
1.546SerMet: 1.546 ± 0.024
5.489SerAsn: 5.489 ± 0.055
4.037SerPro: 4.037 ± 0.052
3.347SerGln: 3.347 ± 0.039
3.236SerArg: 3.236 ± 0.033
10.613SerSer: 10.613 ± 0.155
6.425SerThr: 6.425 ± 0.07
4.631SerVal: 4.631 ± 0.045
0.793SerTrp: 0.793 ± 0.018
2.634SerTyr: 2.634 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
3.171ThrAla: 3.171 ± 0.033
0.612ThrCys: 0.612 ± 0.017
3.193ThrAsp: 3.193 ± 0.04
3.388ThrGlu: 3.388 ± 0.05
2.526ThrPhe: 2.526 ± 0.031
3.288ThrGly: 3.288 ± 0.041
1.084ThrHis: 1.084 ± 0.02
4.318ThrIle: 4.318 ± 0.044
4.244ThrLys: 4.244 ± 0.044
4.937ThrLeu: 4.937 ± 0.053
0.969ThrMet: 0.969 ± 0.022
4.006ThrAsn: 4.006 ± 0.049
3.529ThrPro: 3.529 ± 0.051
1.954ThrGln: 1.954 ± 0.028
2.146ThrArg: 2.146 ± 0.031
5.85ThrSer: 5.85 ± 0.064
5.817ThrThr: 5.817 ± 0.104
3.374ThrVal: 3.374 ± 0.043
0.597ThrTrp: 0.597 ± 0.015
1.818ThrTyr: 1.818 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
3.362ValAla: 3.362 ± 0.04
0.669ValCys: 0.669 ± 0.016
3.758ValAsp: 3.758 ± 0.044
3.969ValGlu: 3.969 ± 0.055
2.582ValPhe: 2.582 ± 0.035
3.075ValGly: 3.075 ± 0.043
1.073ValHis: 1.073 ± 0.021
3.939ValIle: 3.939 ± 0.039
4.054ValLys: 4.054 ± 0.045
5.288ValLeu: 5.288 ± 0.051
1.088ValMet: 1.088 ± 0.021
3.259ValAsn: 3.259 ± 0.032
2.853ValPro: 2.853 ± 0.034
1.984ValGln: 1.984 ± 0.028
2.002ValArg: 2.002 ± 0.031
4.794ValSer: 4.794 ± 0.043
3.449ValThr: 3.449 ± 0.042
3.91ValVal: 3.91 ± 0.047
0.589ValTrp: 0.589 ± 0.017
2.022ValTyr: 2.022 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
0.459TrpAla: 0.459 ± 0.015
0.193TrpCys: 0.193 ± 0.008
0.599TrpAsp: 0.599 ± 0.015
0.586TrpGlu: 0.586 ± 0.013
0.539TrpPhe: 0.539 ± 0.015
0.554TrpGly: 0.554 ± 0.015
0.173TrpHis: 0.173 ± 0.009
0.713TrpIle: 0.713 ± 0.017
0.792TrpLys: 0.792 ± 0.017
0.956TrpLeu: 0.956 ± 0.02
0.212TrpMet: 0.212 ± 0.008
0.624TrpAsn: 0.624 ± 0.016
0.289TrpPro: 0.289 ± 0.012
0.309TrpGln: 0.309 ± 0.011
0.425TrpArg: 0.425 ± 0.014
0.78TrpSer: 0.78 ± 0.018
0.569TrpThr: 0.569 ± 0.016
0.563TrpVal: 0.563 ± 0.017
0.172TrpTrp: 0.172 ± 0.008
0.389TrpTyr: 0.389 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.713TyrAla: 1.713 ± 0.026
0.516TyrCys: 0.516 ± 0.015
2.296TyrAsp: 2.296 ± 0.03
2.158TyrGlu: 2.158 ± 0.032
1.785TyrPhe: 1.785 ± 0.023
2.012TyrGly: 2.012 ± 0.035
0.874TyrHis: 0.874 ± 0.02
2.277TyrIle: 2.277 ± 0.031
2.361TyrLys: 2.361 ± 0.029
3.654TyrLeu: 3.654 ± 0.048
0.635TyrMet: 0.635 ± 0.015
2.248TyrAsn: 2.248 ± 0.032
1.492TyrPro: 1.492 ± 0.024
1.602TyrGln: 1.602 ± 0.028
1.376TyrArg: 1.376 ± 0.022
2.702TyrSer: 2.702 ± 0.037
1.818TyrThr: 1.818 ± 0.026
1.954TyrVal: 1.954 ± 0.029
0.427TyrTrp: 0.427 ± 0.013
1.588TyrTyr: 1.588 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5976 proteins (2856021 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski