Amino acid dipepetide frequency for [Candida] arabinofermentans NRRL YB-2248

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.911AlaAla: 3.911 ± 0.063
0.655AlaCys: 0.655 ± 0.017
2.661AlaAsp: 2.661 ± 0.036
3.076AlaGlu: 3.076 ± 0.047
2.082AlaPhe: 2.082 ± 0.033
2.887AlaGly: 2.887 ± 0.046
0.895AlaHis: 0.895 ± 0.021
3.632AlaIle: 3.632 ± 0.043
3.744AlaLys: 3.744 ± 0.047
4.841AlaLeu: 4.841 ± 0.052
1.141AlaMet: 1.141 ± 0.022
2.722AlaAsn: 2.722 ± 0.034
2.209AlaPro: 2.209 ± 0.044
1.844AlaGln: 1.844 ± 0.035
2.031AlaArg: 2.031 ± 0.03
4.846AlaSer: 4.846 ± 0.057
3.623AlaThr: 3.623 ± 0.044
2.978AlaVal: 2.978 ± 0.039
0.448AlaTrp: 0.448 ± 0.013
1.621AlaTyr: 1.621 ± 0.025
0.003AlaXaa: 0.003 ± 0.001
Cys
0.572CysAla: 0.572 ± 0.014
0.311CysCys: 0.311 ± 0.012
0.676CysAsp: 0.676 ± 0.018
0.611CysGlu: 0.611 ± 0.017
0.704CysPhe: 0.704 ± 0.016
0.806CysGly: 0.806 ± 0.02
0.279CysHis: 0.279 ± 0.011
0.925CysIle: 0.925 ± 0.023
0.71CysLys: 0.71 ± 0.015
1.27CysLeu: 1.27 ± 0.024
0.259CysMet: 0.259 ± 0.011
0.604CysAsn: 0.604 ± 0.016
0.455CysPro: 0.455 ± 0.015
0.372CysGln: 0.372 ± 0.013
0.409CysArg: 0.409 ± 0.014
1.044CysSer: 1.044 ± 0.022
0.579CysThr: 0.579 ± 0.016
0.676CysVal: 0.676 ± 0.016
0.153CysTrp: 0.153 ± 0.007
0.515CysTyr: 0.515 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
2.863AspAla: 2.863 ± 0.039
0.685AspCys: 0.685 ± 0.016
6.304AspAsp: 6.304 ± 0.1
5.546AspGlu: 5.546 ± 0.062
2.802AspPhe: 2.802 ± 0.03
3.051AspGly: 3.051 ± 0.042
1.118AspHis: 1.118 ± 0.019
4.062AspIle: 4.062 ± 0.039
3.654AspLys: 3.654 ± 0.039
6.207AspLeu: 6.207 ± 0.056
1.161AspMet: 1.161 ± 0.02
3.199AspAsn: 3.199 ± 0.041
2.479AspPro: 2.479 ± 0.033
2.089AspGln: 2.089 ± 0.03
1.936AspArg: 1.936 ± 0.031
5.444AspSer: 5.444 ± 0.063
2.934AspThr: 2.934 ± 0.033
3.433AspVal: 3.433 ± 0.039
0.613AspTrp: 0.613 ± 0.014
2.431AspTyr: 2.431 ± 0.033
0.001AspXaa: 0.001 ± 0.001
Glu
3.242GluAla: 3.242 ± 0.044
0.628GluCys: 0.628 ± 0.017
4.356GluAsp: 4.356 ± 0.056
5.848GluGlu: 5.848 ± 0.075
3.02GluPhe: 3.02 ± 0.038
2.599GluGly: 2.599 ± 0.037
1.071GluHis: 1.071 ± 0.022
4.619GluIle: 4.619 ± 0.045
4.963GluLys: 4.963 ± 0.054
7.096GluLeu: 7.096 ± 0.076
1.458GluMet: 1.458 ± 0.022
3.83GluAsn: 3.83 ± 0.049
1.963GluPro: 1.963 ± 0.038
2.491GluGln: 2.491 ± 0.035
2.581GluArg: 2.581 ± 0.035
5.555GluSer: 5.555 ± 0.055
3.595GluThr: 3.595 ± 0.036
3.406GluVal: 3.406 ± 0.039
0.617GluTrp: 0.617 ± 0.016
2.257GluTyr: 2.257 ± 0.031
0.001GluXaa: 0.001 ± 0.001
Phe
2.231PheAla: 2.231 ± 0.032
0.518PheCys: 0.518 ± 0.015
2.98PheAsp: 2.98 ± 0.034
3.152PheGlu: 3.152 ± 0.04
1.93PhePhe: 1.93 ± 0.032
2.786PheGly: 2.786 ± 0.048
0.846PheHis: 0.846 ± 0.018
3.092PheIle: 3.092 ± 0.046
3.594PheLys: 3.594 ± 0.04
3.816PheLeu: 3.816 ± 0.044
0.952PheMet: 0.952 ± 0.02
2.909PheAsn: 2.909 ± 0.036
1.59PhePro: 1.59 ± 0.023
1.761PheGln: 1.761 ± 0.027
1.458PheArg: 1.458 ± 0.029
3.453PheSer: 3.453 ± 0.033
2.503PheThr: 2.503 ± 0.033
2.489PheVal: 2.489 ± 0.034
0.492PheTrp: 0.492 ± 0.016
1.555PheTyr: 1.555 ± 0.027
0.001PheXaa: 0.001 ± 0.0
Gly
2.806GlyAla: 2.806 ± 0.045
0.783GlyCys: 0.783 ± 0.018
3.095GlyAsp: 3.095 ± 0.041
2.888GlyGlu: 2.888 ± 0.038
2.617GlyPhe: 2.617 ± 0.037
3.642GlyGly: 3.642 ± 0.076
0.991GlyHis: 0.991 ± 0.021
3.405GlyIle: 3.405 ± 0.041
3.361GlyLys: 3.361 ± 0.04
5.096GlyLeu: 5.096 ± 0.057
1.08GlyMet: 1.08 ± 0.023
2.644GlyAsn: 2.644 ± 0.035
1.475GlyPro: 1.475 ± 0.025
1.494GlyGln: 1.494 ± 0.028
1.923GlyArg: 1.923 ± 0.03
4.955GlySer: 4.955 ± 0.055
2.901GlyThr: 2.901 ± 0.042
3.101GlyVal: 3.101 ± 0.043
0.616GlyTrp: 0.616 ± 0.017
2.067GlyTyr: 2.067 ± 0.03
0.0GlyXaa: 0.0 ± 0.0
His
0.859HisAla: 0.859 ± 0.017
0.27HisCys: 0.27 ± 0.011
1.124HisAsp: 1.124 ± 0.023
1.145HisGlu: 1.145 ± 0.024
0.866HisPhe: 0.866 ± 0.016
0.973HisGly: 0.973 ± 0.021
0.768HisHis: 0.768 ± 0.025
1.236HisIle: 1.236 ± 0.022
1.169HisLys: 1.169 ± 0.023
1.975HisLeu: 1.975 ± 0.031
0.376HisMet: 0.376 ± 0.011
1.001HisAsn: 1.001 ± 0.017
1.023HisPro: 1.023 ± 0.021
1.088HisGln: 1.088 ± 0.025
0.841HisArg: 0.841 ± 0.019
1.736HisSer: 1.736 ± 0.029
0.971HisThr: 0.971 ± 0.019
0.956HisVal: 0.956 ± 0.02
0.197HisTrp: 0.197 ± 0.009
0.758HisTyr: 0.758 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
3.584IleAla: 3.584 ± 0.041
0.895IleCys: 0.895 ± 0.02
4.652IleAsp: 4.652 ± 0.047
4.718IleGlu: 4.718 ± 0.049
2.823IlePhe: 2.823 ± 0.04
3.548IleGly: 3.548 ± 0.044
1.343IleHis: 1.343 ± 0.023
4.611IleIle: 4.611 ± 0.05
4.907IleLys: 4.907 ± 0.055
6.406IleLeu: 6.406 ± 0.056
1.384IleMet: 1.384 ± 0.022
4.189IleAsn: 4.189 ± 0.047
3.477IlePro: 3.477 ± 0.037
2.645IleGln: 2.645 ± 0.032
2.572IleArg: 2.572 ± 0.033
6.199IleSer: 6.199 ± 0.051
4.07IleThr: 4.07 ± 0.042
3.965IleVal: 3.965 ± 0.044
0.704IleTrp: 0.704 ± 0.018
2.389IleTyr: 2.389 ± 0.035
0.001IleXaa: 0.001 ± 0.001
Lys
3.573LysAla: 3.573 ± 0.043
0.727LysCys: 0.727 ± 0.017
4.054LysAsp: 4.054 ± 0.04
4.816LysGlu: 4.816 ± 0.053
3.273LysPhe: 3.273 ± 0.035
2.94LysGly: 2.94 ± 0.038
1.339LysHis: 1.339 ± 0.021
5.059LysIle: 5.059 ± 0.05
6.382LysLys: 6.382 ± 0.08
8.205LysLeu: 8.205 ± 0.066
1.471LysMet: 1.471 ± 0.025
4.104LysAsn: 4.104 ± 0.043
3.019LysPro: 3.019 ± 0.032
2.938LysGln: 2.938 ± 0.036
3.489LysArg: 3.489 ± 0.04
6.263LysSer: 6.263 ± 0.059
3.913LysThr: 3.913 ± 0.037
4.003LysVal: 4.003 ± 0.043
0.692LysTrp: 0.692 ± 0.015
2.634LysTyr: 2.634 ± 0.034
0.001LysXaa: 0.001 ± 0.001
Leu
5.15LeuAla: 5.15 ± 0.055
1.162LeuCys: 1.162 ± 0.023
5.587LeuAsp: 5.587 ± 0.047
6.069LeuGlu: 6.069 ± 0.061
4.131LeuPhe: 4.131 ± 0.046
4.548LeuGly: 4.548 ± 0.04
1.779LeuHis: 1.779 ± 0.025
7.302LeuIle: 7.302 ± 0.069
8.377LeuLys: 8.377 ± 0.071
9.261LeuLeu: 9.261 ± 0.076
2.153LeuMet: 2.153 ± 0.028
6.75LeuAsn: 6.75 ± 0.066
4.411LeuPro: 4.411 ± 0.051
4.035LeuGln: 4.035 ± 0.045
3.94LeuArg: 3.94 ± 0.047
8.878LeuSer: 8.878 ± 0.061
5.629LeuThr: 5.629 ± 0.047
5.05LeuVal: 5.05 ± 0.048
0.794LeuTrp: 0.794 ± 0.018
3.069LeuTyr: 3.069 ± 0.036
0.003LeuXaa: 0.003 ± 0.001
Met
1.204MetAla: 1.204 ± 0.024
0.257MetCys: 0.257 ± 0.01
1.242MetAsp: 1.242 ± 0.021
1.179MetGlu: 1.179 ± 0.021
0.933MetPhe: 0.933 ± 0.019
1.058MetGly: 1.058 ± 0.024
0.315MetHis: 0.315 ± 0.011
1.56MetIle: 1.56 ± 0.027
1.582MetLys: 1.582 ± 0.028
1.872MetLeu: 1.872 ± 0.025
0.59MetMet: 0.59 ± 0.017
1.394MetAsn: 1.394 ± 0.022
0.748MetPro: 0.748 ± 0.019
0.639MetGln: 0.639 ± 0.016
0.8MetArg: 0.8 ± 0.016
2.151MetSer: 2.151 ± 0.034
1.197MetThr: 1.197 ± 0.021
1.138MetVal: 1.138 ± 0.023
0.17MetTrp: 0.17 ± 0.008
0.624MetTyr: 0.624 ± 0.015
0.001MetXaa: 0.001 ± 0.0
Asn
2.618AsnAla: 2.618 ± 0.033
0.688AsnCys: 0.688 ± 0.015
4.245AsnAsp: 4.245 ± 0.052
4.101AsnGlu: 4.101 ± 0.049
2.54AsnPhe: 2.54 ± 0.038
3.443AsnGly: 3.443 ± 0.04
1.233AsnHis: 1.233 ± 0.024
3.705AsnIle: 3.705 ± 0.04
4.074AsnLys: 4.074 ± 0.042
5.852AsnLeu: 5.852 ± 0.056
1.082AsnMet: 1.082 ± 0.021
4.549AsnAsn: 4.549 ± 0.078
2.42AsnPro: 2.42 ± 0.033
2.313AsnGln: 2.313 ± 0.034
1.959AsnArg: 1.959 ± 0.027
5.767AsnSer: 5.767 ± 0.066
3.168AsnThr: 3.168 ± 0.037
3.015AsnVal: 3.015 ± 0.033
0.619AsnTrp: 0.619 ± 0.015
2.316AsnTyr: 2.316 ± 0.033
0.001AsnXaa: 0.001 ± 0.001
Pro
2.203ProAla: 2.203 ± 0.037
0.324ProCys: 0.324 ± 0.011
2.177ProAsp: 2.177 ± 0.031
2.753ProGlu: 2.753 ± 0.038
1.823ProPhe: 1.823 ± 0.029
1.824ProGly: 1.824 ± 0.036
0.775ProHis: 0.775 ± 0.018
3.096ProIle: 3.096 ± 0.032
2.798ProLys: 2.798 ± 0.036
3.857ProLeu: 3.857 ± 0.038
0.787ProMet: 0.787 ± 0.019
2.355ProAsn: 2.355 ± 0.032
2.477ProPro: 2.477 ± 0.069
1.938ProGln: 1.938 ± 0.045
1.455ProArg: 1.455 ± 0.023
4.216ProSer: 4.216 ± 0.051
2.983ProThr: 2.983 ± 0.04
2.507ProVal: 2.507 ± 0.038
0.365ProTrp: 0.365 ± 0.013
1.376ProTyr: 1.376 ± 0.025
0.001ProXaa: 0.001 ± 0.001
Gln
1.941GlnAla: 1.941 ± 0.032
0.395GlnCys: 0.395 ± 0.015
1.974GlnAsp: 1.974 ± 0.026
2.264GlnGlu: 2.264 ± 0.031
1.874GlnPhe: 1.874 ± 0.027
1.594GlnGly: 1.594 ± 0.03
1.049GlnHis: 1.049 ± 0.029
2.573GlnIle: 2.573 ± 0.033
2.464GlnLys: 2.464 ± 0.03
4.497GlnLeu: 4.497 ± 0.042
0.851GlnMet: 0.851 ± 0.02
1.999GlnAsn: 1.999 ± 0.033
1.909GlnPro: 1.909 ± 0.044
5.09GlnGln: 5.09 ± 0.194
1.666GlnArg: 1.666 ± 0.029
3.443GlnSer: 3.443 ± 0.048
2.109GlnThr: 2.109 ± 0.031
1.991GlnVal: 1.991 ± 0.026
0.322GlnTrp: 0.322 ± 0.012
1.379GlnTyr: 1.379 ± 0.026
0.001GlnXaa: 0.001 ± 0.001
Arg
1.998ArgAla: 1.998 ± 0.031
0.502ArgCys: 0.502 ± 0.014
2.091ArgAsp: 2.091 ± 0.029
2.299ArgGlu: 2.299 ± 0.031
1.899ArgPhe: 1.899 ± 0.029
1.922ArgGly: 1.922 ± 0.029
0.807ArgHis: 0.807 ± 0.018
2.511ArgIle: 2.511 ± 0.029
3.053ArgLys: 3.053 ± 0.036
4.111ArgLeu: 4.111 ± 0.042
0.859ArgMet: 0.859 ± 0.021
2.06ArgAsn: 2.06 ± 0.028
1.424ArgPro: 1.424 ± 0.026
1.522ArgGln: 1.522 ± 0.027
2.234ArgArg: 2.234 ± 0.038
3.512ArgSer: 3.512 ± 0.039
2.026ArgThr: 2.026 ± 0.023
2.053ArgVal: 2.053 ± 0.029
0.407ArgTrp: 0.407 ± 0.013
1.435ArgTyr: 1.435 ± 0.024
0.001ArgXaa: 0.001 ± 0.001
Ser
4.48SerAla: 4.48 ± 0.047
0.989SerCys: 0.989 ± 0.021
5.085SerAsp: 5.085 ± 0.051
4.885SerGlu: 4.885 ± 0.054
3.941SerPhe: 3.941 ± 0.037
4.53SerGly: 4.53 ± 0.059
1.69SerHis: 1.69 ± 0.026
6.905SerIle: 6.905 ± 0.066
6.884SerLys: 6.884 ± 0.062
8.822SerLeu: 8.822 ± 0.067
1.855SerMet: 1.855 ± 0.026
6.215SerAsn: 6.215 ± 0.073
3.802SerPro: 3.802 ± 0.049
3.477SerGln: 3.477 ± 0.039
3.493SerArg: 3.493 ± 0.04
11.65SerSer: 11.65 ± 0.162
6.704SerThr: 6.704 ± 0.073
4.604SerVal: 4.604 ± 0.05
0.82SerTrp: 0.82 ± 0.019
2.906SerTyr: 2.906 ± 0.038
0.001SerXaa: 0.001 ± 0.001
Thr
3.32ThrAla: 3.32 ± 0.044
0.684ThrCys: 0.684 ± 0.017
3.07ThrAsp: 3.07 ± 0.035
3.282ThrGlu: 3.282 ± 0.037
2.371ThrPhe: 2.371 ± 0.029
3.277ThrGly: 3.277 ± 0.041
1.063ThrHis: 1.063 ± 0.02
4.17ThrIle: 4.17 ± 0.04
4.105ThrLys: 4.105 ± 0.047
5.286ThrLeu: 5.286 ± 0.042
1.096ThrMet: 1.096 ± 0.02
3.515ThrAsn: 3.515 ± 0.04
3.171ThrPro: 3.171 ± 0.039
2.001ThrGln: 2.001 ± 0.028
2.181ThrArg: 2.181 ± 0.03
6.102ThrSer: 6.102 ± 0.069
5.044ThrThr: 5.044 ± 0.085
3.284ThrVal: 3.284 ± 0.042
0.506ThrTrp: 0.506 ± 0.015
1.855ThrTyr: 1.855 ± 0.027
0.002ThrXaa: 0.002 ± 0.001
Val
3.14ValAla: 3.14 ± 0.041
0.683ValCys: 0.683 ± 0.016
3.724ValAsp: 3.724 ± 0.042
3.826ValGlu: 3.826 ± 0.052
2.299ValPhe: 2.299 ± 0.033
3.023ValGly: 3.023 ± 0.04
0.984ValHis: 0.984 ± 0.019
3.666ValIle: 3.666 ± 0.039
3.875ValLys: 3.875 ± 0.039
5.087ValLeu: 5.087 ± 0.051
1.145ValMet: 1.145 ± 0.021
2.926ValAsn: 2.926 ± 0.035
2.395ValPro: 2.395 ± 0.034
1.842ValGln: 1.842 ± 0.026
1.996ValArg: 1.996 ± 0.029
4.802ValSer: 4.802 ± 0.045
3.107ValThr: 3.107 ± 0.042
3.418ValVal: 3.418 ± 0.04
0.513ValTrp: 0.513 ± 0.015
1.845ValTyr: 1.845 ± 0.027
0.001ValXaa: 0.001 ± 0.0
Trp
0.464TrpAla: 0.464 ± 0.012
0.245TrpCys: 0.245 ± 0.009
0.606TrpAsp: 0.606 ± 0.016
0.559TrpGlu: 0.559 ± 0.015
0.534TrpPhe: 0.534 ± 0.015
0.53TrpGly: 0.53 ± 0.015
0.169TrpHis: 0.169 ± 0.008
0.684TrpIle: 0.684 ± 0.018
0.78TrpLys: 0.78 ± 0.017
0.946TrpLeu: 0.946 ± 0.019
0.237TrpMet: 0.237 ± 0.012
0.583TrpAsn: 0.583 ± 0.015
0.262TrpPro: 0.262 ± 0.009
0.286TrpGln: 0.286 ± 0.01
0.441TrpArg: 0.441 ± 0.013
0.782TrpSer: 0.782 ± 0.019
0.502TrpThr: 0.502 ± 0.015
0.498TrpVal: 0.498 ± 0.014
0.143TrpTrp: 0.143 ± 0.007
0.352TrpTyr: 0.352 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.69TyrAla: 1.69 ± 0.031
0.529TyrCys: 0.529 ± 0.014
2.305TyrAsp: 2.305 ± 0.031
2.178TyrGlu: 2.178 ± 0.031
1.646TyrPhe: 1.646 ± 0.03
1.96TyrGly: 1.96 ± 0.03
0.819TyrHis: 0.819 ± 0.017
2.264TyrIle: 2.264 ± 0.029
2.416TyrLys: 2.416 ± 0.035
3.568TyrLeu: 3.568 ± 0.044
0.714TyrMet: 0.714 ± 0.016
2.162TyrAsn: 2.162 ± 0.027
1.383TyrPro: 1.383 ± 0.026
1.519TyrGln: 1.519 ± 0.026
1.321TyrArg: 1.321 ± 0.025
2.938TyrSer: 2.938 ± 0.029
1.825TyrThr: 1.825 ± 0.028
1.733TyrVal: 1.733 ± 0.027
0.409TyrTrp: 0.409 ± 0.014
1.52TyrTyr: 1.52 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.002XaaAsp: 0.002 ± 0.001
0.002XaaGlu: 0.002 ± 0.001
0.001XaaPhe: 0.001 ± 0.001
0.001XaaGly: 0.001 ± 0.001
0.001XaaHis: 0.001 ± 0.001
0.001XaaIle: 0.001 ± 0.001
0.001XaaLys: 0.001 ± 0.001
0.003XaaLeu: 0.003 ± 0.001
0.001XaaMet: 0.001 ± 0.001
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.002XaaSer: 0.002 ± 0.001
0.001XaaThr: 0.001 ± 0.001
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.064XaaXaa: 0.064 ± 0.02
Statistics based on 5827 proteins (2735454 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski