Amino acid dipepetide frequency for [Candida] intermedia

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.192AlaAla: 5.192 ± 0.076
0.726AlaCys: 0.726 ± 0.016
3.319AlaAsp: 3.319 ± 0.041
4.056AlaGlu: 4.056 ± 0.076
2.586AlaPhe: 2.586 ± 0.036
3.675AlaGly: 3.675 ± 0.057
1.268AlaHis: 1.268 ± 0.023
3.852AlaIle: 3.852 ± 0.04
4.291AlaLys: 4.291 ± 0.05
6.407AlaLeu: 6.407 ± 0.052
1.379AlaMet: 1.379 ± 0.022
3.244AlaAsn: 3.244 ± 0.04
3.039AlaPro: 3.039 ± 0.055
2.402AlaGln: 2.402 ± 0.035
2.903AlaArg: 2.903 ± 0.034
6.093AlaSer: 6.093 ± 0.256
4.054AlaThr: 4.054 ± 0.056
4.145AlaVal: 4.145 ± 0.05
0.599AlaTrp: 0.599 ± 0.016
1.92AlaTyr: 1.92 ± 0.028
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.015
0.226CysCys: 0.226 ± 0.01
0.661CysAsp: 0.661 ± 0.018
0.593CysGlu: 0.593 ± 0.018
0.599CysPhe: 0.599 ± 0.013
0.843CysGly: 0.843 ± 0.021
0.282CysHis: 0.282 ± 0.01
0.705CysIle: 0.705 ± 0.015
0.596CysLys: 0.596 ± 0.015
1.25CysLeu: 1.25 ± 0.026
0.239CysMet: 0.239 ± 0.011
0.515CysAsn: 0.515 ± 0.014
0.493CysPro: 0.493 ± 0.016
0.374CysGln: 0.374 ± 0.012
0.481CysArg: 0.481 ± 0.013
0.817CysSer: 0.817 ± 0.018
0.577CysThr: 0.577 ± 0.016
0.751CysVal: 0.751 ± 0.019
0.164CysTrp: 0.164 ± 0.007
0.41CysTyr: 0.41 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.678AspAla: 3.678 ± 0.113
0.562AspCys: 0.562 ± 0.016
4.477AspAsp: 4.477 ± 0.059
5.014AspGlu: 5.014 ± 0.079
2.833AspPhe: 2.833 ± 0.038
3.148AspGly: 3.148 ± 0.053
1.219AspHis: 1.219 ± 0.022
3.667AspIle: 3.667 ± 0.041
3.323AspLys: 3.323 ± 0.039
6.131AspLeu: 6.131 ± 0.056
1.204AspMet: 1.204 ± 0.021
2.604AspAsn: 2.604 ± 0.036
2.561AspPro: 2.561 ± 0.033
1.834AspGln: 1.834 ± 0.028
2.147AspArg: 2.147 ± 0.029
5.023AspSer: 5.023 ± 0.073
2.945AspThr: 2.945 ± 0.037
3.892AspVal: 3.892 ± 0.052
0.639AspTrp: 0.639 ± 0.016
2.217AspTyr: 2.217 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
4.262GluAla: 4.262 ± 0.182
0.63GluCys: 0.63 ± 0.018
4.427GluAsp: 4.427 ± 0.056
5.973GluGlu: 5.973 ± 0.085
2.807GluPhe: 2.807 ± 0.041
2.939GluGly: 2.939 ± 0.04
1.314GluHis: 1.314 ± 0.026
4.0GluIle: 4.0 ± 0.045
4.962GluLys: 4.962 ± 0.065
6.801GluLeu: 6.801 ± 0.076
1.382GluMet: 1.382 ± 0.02
3.469GluAsn: 3.469 ± 0.038
2.222GluPro: 2.222 ± 0.041
2.403GluGln: 2.403 ± 0.037
2.822GluArg: 2.822 ± 0.035
5.236GluSer: 5.236 ± 0.092
3.778GluThr: 3.778 ± 0.094
4.254GluVal: 4.254 ± 0.063
0.664GluTrp: 0.664 ± 0.016
2.191GluTyr: 2.191 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
2.838PheAla: 2.838 ± 0.037
0.55PheCys: 0.55 ± 0.014
2.878PheAsp: 2.878 ± 0.035
2.733PheGlu: 2.733 ± 0.037
2.113PhePhe: 2.113 ± 0.042
2.993PheGly: 2.993 ± 0.051
1.032PheHis: 1.032 ± 0.019
2.395PheIle: 2.395 ± 0.036
2.611PheLys: 2.611 ± 0.035
4.291PheLeu: 4.291 ± 0.048
0.954PheMet: 0.954 ± 0.02
2.34PheAsn: 2.34 ± 0.03
1.936PhePro: 1.936 ± 0.037
1.64PheGln: 1.64 ± 0.024
1.805PheArg: 1.805 ± 0.026
3.546PheSer: 3.546 ± 0.04
2.52PheThr: 2.52 ± 0.045
2.989PheVal: 2.989 ± 0.041
0.53PheTrp: 0.53 ± 0.016
1.499PheTyr: 1.499 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
3.74GlyAla: 3.74 ± 0.063
0.706GlyCys: 0.706 ± 0.015
3.121GlyAsp: 3.121 ± 0.041
3.228GlyGlu: 3.228 ± 0.046
2.606GlyPhe: 2.606 ± 0.033
3.79GlyGly: 3.79 ± 0.061
1.245GlyHis: 1.245 ± 0.025
3.336GlyIle: 3.336 ± 0.04
3.41GlyLys: 3.41 ± 0.04
5.282GlyLeu: 5.282 ± 0.053
1.139GlyMet: 1.139 ± 0.023
2.754GlyAsn: 2.754 ± 0.039
2.096GlyPro: 2.096 ± 0.043
1.913GlyGln: 1.913 ± 0.035
2.33GlyArg: 2.33 ± 0.031
5.305GlySer: 5.305 ± 0.11
3.318GlyThr: 3.318 ± 0.086
3.777GlyVal: 3.777 ± 0.06
0.669GlyTrp: 0.669 ± 0.016
2.059GlyTyr: 2.059 ± 0.029
0.0GlyXaa: 0.0 ± 0.0
His
1.148HisAla: 1.148 ± 0.021
0.26HisCys: 0.26 ± 0.01
1.225HisAsp: 1.225 ± 0.025
1.335HisGlu: 1.335 ± 0.022
1.034HisPhe: 1.034 ± 0.019
1.26HisGly: 1.26 ± 0.025
0.756HisHis: 0.756 ± 0.022
1.291HisIle: 1.291 ± 0.019
1.251HisLys: 1.251 ± 0.024
2.389HisLeu: 2.389 ± 0.032
0.427HisMet: 0.427 ± 0.011
1.054HisAsn: 1.054 ± 0.02
1.135HisPro: 1.135 ± 0.021
0.957HisGln: 0.957 ± 0.024
1.001HisArg: 1.001 ± 0.018
1.792HisSer: 1.792 ± 0.033
1.078HisThr: 1.078 ± 0.017
1.272HisVal: 1.272 ± 0.021
0.223HisTrp: 0.223 ± 0.009
0.829HisTyr: 0.829 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
3.786IleAla: 3.786 ± 0.047
0.738IleCys: 0.738 ± 0.016
3.661IleAsp: 3.661 ± 0.038
3.664IleGlu: 3.664 ± 0.042
2.489IlePhe: 2.489 ± 0.037
3.105IleGly: 3.105 ± 0.042
1.325IleHis: 1.325 ± 0.022
3.163IleIle: 3.163 ± 0.041
3.418IleLys: 3.418 ± 0.04
5.669IleLeu: 5.669 ± 0.057
1.175IleMet: 1.175 ± 0.022
2.896IleAsn: 2.896 ± 0.038
2.942IlePro: 2.942 ± 0.035
2.06IleGln: 2.06 ± 0.028
2.529IleArg: 2.529 ± 0.03
4.914IleSer: 4.914 ± 0.065
3.261IleThr: 3.261 ± 0.04
3.959IleVal: 3.959 ± 0.055
0.595IleTrp: 0.595 ± 0.015
1.882IleTyr: 1.882 ± 0.023
0.0IleXaa: 0.0 ± 0.0
Lys
3.866LysAla: 3.866 ± 0.045
0.676LysCys: 0.676 ± 0.016
3.645LysAsp: 3.645 ± 0.041
4.546LysGlu: 4.546 ± 0.061
2.869LysPhe: 2.869 ± 0.037
2.826LysGly: 2.826 ± 0.04
1.41LysHis: 1.41 ± 0.023
3.672LysIle: 3.672 ± 0.04
5.39LysLys: 5.39 ± 0.073
7.098LysLeu: 7.098 ± 0.061
1.25LysMet: 1.25 ± 0.021
3.184LysAsn: 3.184 ± 0.04
2.785LysPro: 2.785 ± 0.038
2.471LysGln: 2.471 ± 0.034
3.28LysArg: 3.28 ± 0.041
4.95LysSer: 4.95 ± 0.054
3.394LysThr: 3.394 ± 0.038
4.223LysVal: 4.223 ± 0.037
0.721LysTrp: 0.721 ± 0.017
2.438LysTyr: 2.438 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
7.211LeuAla: 7.211 ± 0.055
1.224LeuCys: 1.224 ± 0.025
5.86LeuAsp: 5.86 ± 0.049
6.44LeuGlu: 6.44 ± 0.067
4.191LeuPhe: 4.191 ± 0.052
5.403LeuGly: 5.403 ± 0.044
2.179LeuHis: 2.179 ± 0.029
5.424LeuIle: 5.424 ± 0.06
6.73LeuLys: 6.73 ± 0.06
10.245LeuLeu: 10.245 ± 0.093
2.1LeuMet: 2.1 ± 0.027
5.098LeuAsn: 5.098 ± 0.048
4.889LeuPro: 4.889 ± 0.049
4.067LeuGln: 4.067 ± 0.048
4.935LeuArg: 4.935 ± 0.05
8.6LeuSer: 8.6 ± 0.089
5.74LeuThr: 5.74 ± 0.049
6.558LeuVal: 6.558 ± 0.056
0.982LeuTrp: 0.982 ± 0.02
3.032LeuTyr: 3.032 ± 0.031
0.0LeuXaa: 0.0 ± 0.0
Met
1.618MetAla: 1.618 ± 0.026
0.257MetCys: 0.257 ± 0.01
1.29MetAsp: 1.29 ± 0.023
1.264MetGlu: 1.264 ± 0.023
0.913MetPhe: 0.913 ± 0.021
1.193MetGly: 1.193 ± 0.023
0.354MetHis: 0.354 ± 0.012
1.077MetIle: 1.077 ± 0.022
1.329MetLys: 1.329 ± 0.022
1.955MetLeu: 1.955 ± 0.025
0.502MetMet: 0.502 ± 0.015
1.082MetAsn: 1.082 ± 0.026
0.908MetPro: 0.908 ± 0.021
0.62MetGln: 0.62 ± 0.016
0.896MetArg: 0.896 ± 0.02
2.012MetSer: 2.012 ± 0.029
1.121MetThr: 1.121 ± 0.021
1.325MetVal: 1.325 ± 0.021
0.197MetTrp: 0.197 ± 0.009
0.636MetTyr: 0.636 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.106AsnAla: 3.106 ± 0.04
0.543AsnCys: 0.543 ± 0.014
2.982AsnAsp: 2.982 ± 0.035
3.256AsnGlu: 3.256 ± 0.038
2.356AsnPhe: 2.356 ± 0.028
3.14AsnGly: 3.14 ± 0.045
1.123AsnHis: 1.123 ± 0.023
2.894AsnIle: 2.894 ± 0.034
2.825AsnLys: 2.825 ± 0.031
5.174AsnLeu: 5.174 ± 0.052
1.097AsnMet: 1.097 ± 0.025
2.526AsnAsn: 2.526 ± 0.044
2.296AsnPro: 2.296 ± 0.033
1.849AsnGln: 1.849 ± 0.034
1.943AsnArg: 1.943 ± 0.032
4.408AsnSer: 4.408 ± 0.073
2.747AsnThr: 2.747 ± 0.065
3.184AsnVal: 3.184 ± 0.033
0.574AsnTrp: 0.574 ± 0.015
1.928AsnTyr: 1.928 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
2.823ProAla: 2.823 ± 0.041
0.345ProCys: 0.345 ± 0.012
2.372ProAsp: 2.372 ± 0.032
3.352ProGlu: 3.352 ± 0.054
1.874ProPhe: 1.874 ± 0.027
2.372ProGly: 2.372 ± 0.047
1.017ProHis: 1.017 ± 0.02
2.513ProIle: 2.513 ± 0.032
2.872ProLys: 2.872 ± 0.038
4.404ProLeu: 4.404 ± 0.048
0.833ProMet: 0.833 ± 0.019
2.189ProAsn: 2.189 ± 0.033
2.711ProPro: 2.711 ± 0.066
2.09ProGln: 2.09 ± 0.037
1.871ProArg: 1.871 ± 0.03
3.942ProSer: 3.942 ± 0.05
2.85ProThr: 2.85 ± 0.037
3.173ProVal: 3.173 ± 0.042
0.449ProTrp: 0.449 ± 0.019
1.413ProTyr: 1.413 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
2.287GlnAla: 2.287 ± 0.039
0.382GlnCys: 0.382 ± 0.012
1.841GlnAsp: 1.841 ± 0.026
2.365GlnGlu: 2.365 ± 0.037
1.731GlnPhe: 1.731 ± 0.029
1.793GlnGly: 1.793 ± 0.036
0.872GlnHis: 0.872 ± 0.021
2.195GlnIle: 2.195 ± 0.028
2.517GlnLys: 2.517 ± 0.035
4.202GlnLeu: 4.202 ± 0.044
0.907GlnMet: 0.907 ± 0.021
1.975GlnAsn: 1.975 ± 0.031
1.693GlnPro: 1.693 ± 0.031
2.391GlnGln: 2.391 ± 0.087
1.763GlnArg: 1.763 ± 0.027
2.8GlnSer: 2.8 ± 0.046
1.996GlnThr: 1.996 ± 0.029
2.35GlnVal: 2.35 ± 0.034
0.409GlnTrp: 0.409 ± 0.011
1.37GlnTyr: 1.37 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
2.853ArgAla: 2.853 ± 0.03
0.486ArgCys: 0.486 ± 0.014
2.399ArgAsp: 2.399 ± 0.034
2.764ArgGlu: 2.764 ± 0.039
1.948ArgPhe: 1.948 ± 0.027
2.195ArgGly: 2.195 ± 0.034
1.007ArgHis: 1.007 ± 0.018
2.587ArgIle: 2.587 ± 0.031
3.437ArgLys: 3.437 ± 0.045
4.547ArgLeu: 4.547 ± 0.049
0.995ArgMet: 0.995 ± 0.017
2.272ArgAsn: 2.272 ± 0.027
1.817ArgPro: 1.817 ± 0.03
1.763ArgGln: 1.763 ± 0.027
2.728ArgArg: 2.728 ± 0.037
3.456ArgSer: 3.456 ± 0.052
2.295ArgThr: 2.295 ± 0.032
2.607ArgVal: 2.607 ± 0.035
0.472ArgTrp: 0.472 ± 0.014
1.513ArgTyr: 1.513 ± 0.025
0.0ArgXaa: 0.0 ± 0.0
Ser
5.459SerAla: 5.459 ± 0.095
0.777SerCys: 0.777 ± 0.02
5.047SerAsp: 5.047 ± 0.127
5.451SerGlu: 5.451 ± 0.156
3.593SerPhe: 3.593 ± 0.052
5.4SerGly: 5.4 ± 0.127
1.851SerHis: 1.851 ± 0.028
4.93SerIle: 4.93 ± 0.062
5.43SerLys: 5.43 ± 0.054
8.442SerLeu: 8.442 ± 0.083
1.693SerMet: 1.693 ± 0.027
4.219SerAsn: 4.219 ± 0.052
3.931SerPro: 3.931 ± 0.054
3.265SerGln: 3.265 ± 0.047
3.768SerArg: 3.768 ± 0.042
9.354SerSer: 9.354 ± 0.223
5.828SerThr: 5.828 ± 0.116
5.302SerVal: 5.302 ± 0.058
0.822SerTrp: 0.822 ± 0.019
2.513SerTyr: 2.513 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
3.575ThrAla: 3.575 ± 0.055
0.639ThrCys: 0.639 ± 0.017
3.051ThrAsp: 3.051 ± 0.071
3.386ThrGlu: 3.386 ± 0.073
2.567ThrPhe: 2.567 ± 0.034
3.454ThrGly: 3.454 ± 0.065
1.142ThrHis: 1.142 ± 0.025
3.387ThrIle: 3.387 ± 0.037
3.576ThrLys: 3.576 ± 0.034
5.502ThrLeu: 5.502 ± 0.047
1.077ThrMet: 1.077 ± 0.019
3.056ThrAsn: 3.056 ± 0.069
3.137ThrPro: 3.137 ± 0.057
1.946ThrGln: 1.946 ± 0.032
2.314ThrArg: 2.314 ± 0.026
5.906ThrSer: 5.906 ± 0.147
4.873ThrThr: 4.873 ± 0.328
3.806ThrVal: 3.806 ± 0.076
0.682ThrTrp: 0.682 ± 0.043
1.861ThrTyr: 1.861 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
4.568ValAla: 4.568 ± 0.044
0.82ValCys: 0.82 ± 0.019
4.143ValAsp: 4.143 ± 0.06
4.496ValGlu: 4.496 ± 0.06
2.887ValPhe: 2.887 ± 0.034
3.604ValGly: 3.604 ± 0.046
1.329ValHis: 1.329 ± 0.021
3.655ValIle: 3.655 ± 0.049
3.972ValLys: 3.972 ± 0.039
6.387ValLeu: 6.387 ± 0.056
1.248ValMet: 1.248 ± 0.021
3.078ValAsn: 3.078 ± 0.037
3.161ValPro: 3.161 ± 0.035
2.138ValGln: 2.138 ± 0.028
2.705ValArg: 2.705 ± 0.04
5.515ValSer: 5.515 ± 0.067
3.93ValThr: 3.93 ± 0.087
4.875ValVal: 4.875 ± 0.056
0.666ValTrp: 0.666 ± 0.014
2.068ValTyr: 2.068 ± 0.029
0.0ValXaa: 0.0 ± 0.0
Trp
0.616TrpAla: 0.616 ± 0.016
0.202TrpCys: 0.202 ± 0.008
0.646TrpAsp: 0.646 ± 0.018
0.596TrpGlu: 0.596 ± 0.015
0.508TrpPhe: 0.508 ± 0.015
0.604TrpGly: 0.604 ± 0.018
0.216TrpHis: 0.216 ± 0.008
0.636TrpIle: 0.636 ± 0.016
0.776TrpLys: 0.776 ± 0.016
1.073TrpLeu: 1.073 ± 0.022
0.236TrpMet: 0.236 ± 0.009
0.609TrpAsn: 0.609 ± 0.016
0.329TrpPro: 0.329 ± 0.012
0.349TrpGln: 0.349 ± 0.011
0.539TrpArg: 0.539 ± 0.016
0.774TrpSer: 0.774 ± 0.019
0.717TrpThr: 0.717 ± 0.041
0.636TrpVal: 0.636 ± 0.016
0.176TrpTrp: 0.176 ± 0.009
0.404TrpTyr: 0.404 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.882TyrAla: 1.882 ± 0.027
0.467TyrCys: 0.467 ± 0.011
2.075TyrAsp: 2.075 ± 0.029
2.023TyrGlu: 2.023 ± 0.026
1.689TyrPhe: 1.689 ± 0.028
2.088TyrGly: 2.088 ± 0.03
0.778TyrHis: 0.778 ± 0.018
1.853TyrIle: 1.853 ± 0.03
1.976TyrLys: 1.976 ± 0.027
3.659TyrLeu: 3.659 ± 0.039
0.71TyrMet: 0.71 ± 0.017
1.786TyrAsn: 1.786 ± 0.03
1.384TyrPro: 1.384 ± 0.025
1.311TyrGln: 1.311 ± 0.025
1.383TyrArg: 1.383 ± 0.021
2.633TyrSer: 2.633 ± 0.036
1.912TyrThr: 1.912 ± 0.051
2.168TyrVal: 2.168 ± 0.042
0.408TyrTrp: 0.408 ± 0.014
1.382TyrTyr: 1.382 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5921 proteins (2935372 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski