Amino acid dipepetide frequency for Golovinomyces cichoracearum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.102AlaAla: 5.102 ± 0.056
0.915AlaCys: 0.915 ± 0.016
2.976AlaAsp: 2.976 ± 0.032
4.039AlaGlu: 4.039 ± 0.047
2.457AlaPhe: 2.457 ± 0.035
3.722AlaGly: 3.722 ± 0.042
1.404AlaHis: 1.404 ± 0.021
4.126AlaIle: 4.126 ± 0.043
3.858AlaLys: 3.858 ± 0.037
6.223AlaLeu: 6.223 ± 0.048
1.362AlaMet: 1.362 ± 0.022
2.664AlaAsn: 2.664 ± 0.032
2.914AlaPro: 2.914 ± 0.035
2.706AlaGln: 2.706 ± 0.033
3.732AlaArg: 3.732 ± 0.037
5.723AlaSer: 5.723 ± 0.052
3.861AlaThr: 3.861 ± 0.033
3.711AlaVal: 3.711 ± 0.038
0.752AlaTrp: 0.752 ± 0.016
1.709AlaTyr: 1.709 ± 0.024
0.001AlaXaa: 0.001 ± 0.0
Cys
0.788CysAla: 0.788 ± 0.018
0.232CysCys: 0.232 ± 0.01
0.715CysAsp: 0.715 ± 0.016
0.756CysGlu: 0.756 ± 0.014
0.577CysPhe: 0.577 ± 0.014
0.882CysGly: 0.882 ± 0.017
0.381CysHis: 0.381 ± 0.012
0.834CysIle: 0.834 ± 0.017
0.72CysLys: 0.72 ± 0.017
1.35CysLeu: 1.35 ± 0.024
0.266CysMet: 0.266 ± 0.01
0.585CysAsn: 0.585 ± 0.014
0.626CysPro: 0.626 ± 0.013
0.549CysGln: 0.549 ± 0.013
0.747CysArg: 0.747 ± 0.016
1.188CysSer: 1.188 ± 0.022
0.711CysThr: 0.711 ± 0.015
0.711CysVal: 0.711 ± 0.017
0.173CysTrp: 0.173 ± 0.008
0.373CysTyr: 0.373 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
3.18AspAla: 3.18 ± 0.036
0.703AspCys: 0.703 ± 0.018
3.702AspAsp: 3.702 ± 0.055
4.624AspGlu: 4.624 ± 0.05
2.429AspPhe: 2.429 ± 0.027
3.108AspGly: 3.108 ± 0.036
1.204AspHis: 1.204 ± 0.02
3.707AspIle: 3.707 ± 0.04
2.872AspLys: 2.872 ± 0.035
5.274AspLeu: 5.274 ± 0.04
1.111AspMet: 1.111 ± 0.019
2.368AspAsn: 2.368 ± 0.028
2.702AspPro: 2.702 ± 0.029
2.107AspGln: 2.107 ± 0.028
2.755AspArg: 2.755 ± 0.032
4.761AspSer: 4.761 ± 0.05
2.749AspThr: 2.749 ± 0.034
3.132AspVal: 3.132 ± 0.034
0.693AspTrp: 0.693 ± 0.017
1.613AspTyr: 1.613 ± 0.023
0.0AspXaa: 0.0 ± 0.0
Glu
4.382GluAla: 4.382 ± 0.044
0.714GluCys: 0.714 ± 0.015
4.074GluAsp: 4.074 ± 0.05
5.58GluGlu: 5.58 ± 0.061
2.32GluPhe: 2.32 ± 0.025
3.02GluGly: 3.02 ± 0.03
1.235GluHis: 1.235 ± 0.02
4.908GluIle: 4.908 ± 0.053
5.276GluLys: 5.276 ± 0.051
5.532GluLeu: 5.532 ± 0.05
1.465GluMet: 1.465 ± 0.023
4.128GluAsn: 4.128 ± 0.041
2.252GluPro: 2.252 ± 0.032
2.349GluGln: 2.349 ± 0.033
3.797GluArg: 3.797 ± 0.047
5.342GluSer: 5.342 ± 0.051
3.687GluThr: 3.687 ± 0.04
3.392GluVal: 3.392 ± 0.035
0.826GluTrp: 0.826 ± 0.018
1.839GluTyr: 1.839 ± 0.029
0.001GluXaa: 0.001 ± 0.001
Phe
2.457PheAla: 2.457 ± 0.033
0.622PheCys: 0.622 ± 0.014
2.295PheAsp: 2.295 ± 0.028
2.519PheGlu: 2.519 ± 0.033
1.685PhePhe: 1.685 ± 0.029
2.497PheGly: 2.497 ± 0.037
0.912PheHis: 0.912 ± 0.017
2.279PheIle: 2.279 ± 0.036
2.09PheLys: 2.09 ± 0.028
3.74PheLeu: 3.74 ± 0.042
0.785PheMet: 0.785 ± 0.016
1.847PheAsn: 1.847 ± 0.026
1.879PhePro: 1.879 ± 0.029
1.547PheGln: 1.547 ± 0.024
2.015PheArg: 2.015 ± 0.027
3.781PheSer: 3.781 ± 0.035
2.183PheThr: 2.183 ± 0.027
2.131PheVal: 2.131 ± 0.03
0.539PheTrp: 0.539 ± 0.016
1.173PheTyr: 1.173 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
3.321GlyAla: 3.321 ± 0.036
0.753GlyCys: 0.753 ± 0.016
2.811GlyAsp: 2.811 ± 0.035
3.066GlyGlu: 3.066 ± 0.038
2.426GlyPhe: 2.426 ± 0.032
3.78GlyGly: 3.78 ± 0.055
1.288GlyHis: 1.288 ± 0.023
3.793GlyIle: 3.793 ± 0.035
3.586GlyLys: 3.586 ± 0.034
5.074GlyLeu: 5.074 ± 0.044
1.201GlyMet: 1.201 ± 0.023
2.716GlyAsn: 2.716 ± 0.031
2.282GlyPro: 2.282 ± 0.029
1.987GlyGln: 1.987 ± 0.03
3.21GlyArg: 3.21 ± 0.036
4.83GlySer: 4.83 ± 0.037
3.204GlyThr: 3.204 ± 0.034
3.235GlyVal: 3.235 ± 0.033
0.822GlyTrp: 0.822 ± 0.017
1.783GlyTyr: 1.783 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
1.298HisAla: 1.298 ± 0.022
0.317HisCys: 0.317 ± 0.012
1.233HisAsp: 1.233 ± 0.021
1.454HisGlu: 1.454 ± 0.023
0.913HisPhe: 0.913 ± 0.017
1.35HisGly: 1.35 ± 0.021
0.706HisHis: 0.706 ± 0.021
1.414HisIle: 1.414 ± 0.022
1.177HisLys: 1.177 ± 0.02
2.224HisLeu: 2.224 ± 0.029
0.424HisMet: 0.424 ± 0.013
1.076HisAsn: 1.076 ± 0.019
1.361HisPro: 1.361 ± 0.023
1.107HisGln: 1.107 ± 0.021
1.368HisArg: 1.368 ± 0.025
2.024HisSer: 2.024 ± 0.028
1.201HisThr: 1.201 ± 0.019
1.214HisVal: 1.214 ± 0.019
0.258HisTrp: 0.258 ± 0.009
0.674HisTyr: 0.674 ± 0.015
0.001HisXaa: 0.001 ± 0.0
Ile
4.046IleAla: 4.046 ± 0.04
1.014IleCys: 1.014 ± 0.019
3.755IleAsp: 3.755 ± 0.038
4.181IleGlu: 4.181 ± 0.039
2.775IlePhe: 2.775 ± 0.034
3.408IleGly: 3.408 ± 0.041
1.453IleHis: 1.453 ± 0.022
3.943IleIle: 3.943 ± 0.037
3.848IleLys: 3.848 ± 0.036
6.21IleLeu: 6.21 ± 0.049
1.227IleMet: 1.227 ± 0.018
3.102IleAsn: 3.102 ± 0.035
3.466IlePro: 3.466 ± 0.034
2.515IleGln: 2.515 ± 0.029
3.406IleArg: 3.406 ± 0.037
6.487IleSer: 6.487 ± 0.052
3.514IleThr: 3.514 ± 0.036
3.35IleVal: 3.35 ± 0.034
0.789IleTrp: 0.789 ± 0.017
1.878IleTyr: 1.878 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.978LysAla: 3.978 ± 0.042
0.724LysCys: 0.724 ± 0.018
3.477LysAsp: 3.477 ± 0.035
4.363LysGlu: 4.363 ± 0.048
2.31LysPhe: 2.31 ± 0.027
2.939LysGly: 2.939 ± 0.036
1.228LysHis: 1.228 ± 0.02
4.566LysIle: 4.566 ± 0.04
5.269LysLys: 5.269 ± 0.062
5.456LysLeu: 5.456 ± 0.044
1.334LysMet: 1.334 ± 0.02
3.8LysAsn: 3.8 ± 0.047
2.79LysPro: 2.79 ± 0.035
2.182LysGln: 2.182 ± 0.023
3.945LysArg: 3.945 ± 0.042
5.88LysSer: 5.88 ± 0.062
3.733LysThr: 3.733 ± 0.041
3.184LysVal: 3.184 ± 0.028
0.728LysTrp: 0.728 ± 0.016
1.909LysTyr: 1.909 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
6.342LeuAla: 6.342 ± 0.054
1.244LeuCys: 1.244 ± 0.021
5.158LeuAsp: 5.158 ± 0.048
6.13LeuGlu: 6.13 ± 0.052
3.377LeuPhe: 3.377 ± 0.039
4.961LeuGly: 4.961 ± 0.042
2.149LeuHis: 2.149 ± 0.023
5.342LeuIle: 5.342 ± 0.048
5.922LeuLys: 5.922 ± 0.048
8.301LeuLeu: 8.301 ± 0.087
1.797LeuMet: 1.797 ± 0.028
4.514LeuAsn: 4.514 ± 0.04
5.023LeuPro: 5.023 ± 0.041
3.927LeuGln: 3.927 ± 0.043
5.626LeuArg: 5.626 ± 0.05
8.519LeuSer: 8.519 ± 0.063
5.041LeuThr: 5.041 ± 0.047
4.847LeuVal: 4.847 ± 0.048
1.035LeuTrp: 1.035 ± 0.023
2.379LeuTyr: 2.379 ± 0.034
0.002LeuXaa: 0.002 ± 0.001
Met
1.701MetAla: 1.701 ± 0.022
0.238MetCys: 0.238 ± 0.01
1.083MetAsp: 1.083 ± 0.019
1.289MetGlu: 1.289 ± 0.022
0.659MetPhe: 0.659 ± 0.016
1.197MetGly: 1.197 ± 0.023
0.384MetHis: 0.384 ± 0.012
1.247MetIle: 1.247 ± 0.021
1.348MetLys: 1.348 ± 0.022
1.682MetLeu: 1.682 ± 0.027
0.542MetMet: 0.542 ± 0.014
1.058MetAsn: 1.058 ± 0.02
0.976MetPro: 0.976 ± 0.017
0.747MetGln: 0.747 ± 0.018
1.152MetArg: 1.152 ± 0.02
1.915MetSer: 1.915 ± 0.024
1.295MetThr: 1.295 ± 0.02
1.071MetVal: 1.071 ± 0.021
0.211MetTrp: 0.211 ± 0.009
0.486MetTyr: 0.486 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.736AsnAla: 2.736 ± 0.032
0.696AsnCys: 0.696 ± 0.013
2.655AsnAsp: 2.655 ± 0.027
3.263AsnGlu: 3.263 ± 0.035
2.296AsnPhe: 2.296 ± 0.033
2.798AsnGly: 2.798 ± 0.033
1.229AsnHis: 1.229 ± 0.021
3.407AsnIle: 3.407 ± 0.035
2.82AsnLys: 2.82 ± 0.031
4.987AsnLeu: 4.987 ± 0.044
1.019AsnMet: 1.019 ± 0.018
2.642AsnAsn: 2.642 ± 0.036
2.749AsnPro: 2.749 ± 0.035
2.133AsnGln: 2.133 ± 0.032
2.537AsnArg: 2.537 ± 0.035
5.24AsnSer: 5.24 ± 0.051
2.866AsnThr: 2.866 ± 0.034
2.608AsnVal: 2.608 ± 0.03
0.591AsnTrp: 0.591 ± 0.014
1.557AsnTyr: 1.557 ± 0.026
0.001AsnXaa: 0.001 ± 0.0
Pro
2.978ProAla: 2.978 ± 0.031
0.532ProCys: 0.532 ± 0.013
2.532ProAsp: 2.532 ± 0.03
3.358ProGlu: 3.358 ± 0.04
1.796ProPhe: 1.796 ± 0.029
2.662ProGly: 2.662 ± 0.036
1.154ProHis: 1.154 ± 0.019
3.05ProIle: 3.05 ± 0.032
3.069ProLys: 3.069 ± 0.036
4.33ProLeu: 4.33 ± 0.043
0.845ProMet: 0.845 ± 0.018
2.514ProAsn: 2.514 ± 0.032
3.446ProPro: 3.446 ± 0.053
2.19ProGln: 2.19 ± 0.028
2.627ProArg: 2.627 ± 0.032
5.282ProSer: 5.282 ± 0.052
3.257ProThr: 3.257 ± 0.035
2.754ProVal: 2.754 ± 0.029
0.516ProTrp: 0.516 ± 0.014
1.337ProTyr: 1.337 ± 0.022
0.001ProXaa: 0.001 ± 0.001
Gln
2.693GlnAla: 2.693 ± 0.029
0.441GlnCys: 0.441 ± 0.012
2.042GlnAsp: 2.042 ± 0.027
2.619GlnGlu: 2.619 ± 0.033
1.346GlnPhe: 1.346 ± 0.024
1.902GlnGly: 1.902 ± 0.027
0.896GlnHis: 0.896 ± 0.015
2.779GlnIle: 2.779 ± 0.031
2.935GlnLys: 2.935 ± 0.032
3.463GlnLeu: 3.463 ± 0.043
0.847GlnMet: 0.847 ± 0.016
2.518GlnAsn: 2.518 ± 0.032
1.906GlnPro: 1.906 ± 0.031
2.105GlnGln: 2.105 ± 0.051
2.475GlnArg: 2.475 ± 0.032
3.58GlnSer: 3.58 ± 0.034
2.381GlnThr: 2.381 ± 0.031
2.134GlnVal: 2.134 ± 0.029
0.445GlnTrp: 0.445 ± 0.012
1.159GlnTyr: 1.159 ± 0.018
0.001GlnXaa: 0.001 ± 0.0
Arg
3.611ArgAla: 3.611 ± 0.042
0.736ArgCys: 0.736 ± 0.018
3.132ArgAsp: 3.132 ± 0.033
3.87ArgGlu: 3.87 ± 0.045
2.088ArgPhe: 2.088 ± 0.025
3.078ArgGly: 3.078 ± 0.039
1.362ArgHis: 1.362 ± 0.021
3.586ArgIle: 3.586 ± 0.036
4.227ArgLys: 4.227 ± 0.048
5.182ArgLeu: 5.182 ± 0.047
1.173ArgMet: 1.173 ± 0.021
3.047ArgAsn: 3.047 ± 0.037
2.649ArgPro: 2.649 ± 0.029
2.391ArgGln: 2.391 ± 0.031
4.234ArgArg: 4.234 ± 0.053
4.977ArgSer: 4.977 ± 0.059
2.967ArgThr: 2.967 ± 0.029
2.872ArgVal: 2.872 ± 0.032
0.695ArgTrp: 0.695 ± 0.016
1.634ArgTyr: 1.634 ± 0.026
0.0ArgXaa: 0.0 ± 0.0
Ser
5.334SerAla: 5.334 ± 0.049
1.178SerCys: 1.178 ± 0.02
4.819SerAsp: 4.819 ± 0.047
5.417SerGlu: 5.417 ± 0.053
3.528SerPhe: 3.528 ± 0.039
5.064SerGly: 5.064 ± 0.047
2.331SerHis: 2.331 ± 0.033
5.863SerIle: 5.863 ± 0.052
5.813SerLys: 5.813 ± 0.052
8.665SerLeu: 8.665 ± 0.059
1.812SerMet: 1.812 ± 0.023
4.923SerAsn: 4.923 ± 0.054
5.216SerPro: 5.216 ± 0.067
4.021SerGln: 4.021 ± 0.04
5.527SerArg: 5.527 ± 0.05
10.848SerSer: 10.848 ± 0.099
5.987SerThr: 5.987 ± 0.045
4.554SerVal: 4.554 ± 0.044
1.025SerTrp: 1.025 ± 0.02
2.413SerTyr: 2.413 ± 0.031
0.002SerXaa: 0.002 ± 0.001
Thr
3.806ThrAla: 3.806 ± 0.035
0.776ThrCys: 0.776 ± 0.016
2.806ThrAsp: 2.806 ± 0.032
3.406ThrGlu: 3.406 ± 0.036
2.145ThrPhe: 2.145 ± 0.027
3.402ThrGly: 3.402 ± 0.033
1.298ThrHis: 1.298 ± 0.024
3.665ThrIle: 3.665 ± 0.036
3.437ThrLys: 3.437 ± 0.036
5.17ThrLeu: 5.17 ± 0.046
1.082ThrMet: 1.082 ± 0.019
2.831ThrAsn: 2.831 ± 0.037
3.432ThrPro: 3.432 ± 0.039
2.285ThrGln: 2.285 ± 0.033
3.074ThrArg: 3.074 ± 0.033
6.006ThrSer: 6.006 ± 0.049
3.798ThrThr: 3.798 ± 0.044
3.193ThrVal: 3.193 ± 0.034
0.618ThrTrp: 0.618 ± 0.016
1.6ThrTyr: 1.6 ± 0.023
0.001ThrXaa: 0.001 ± 0.001
Val
3.705ValAla: 3.705 ± 0.038
0.721ValCys: 0.721 ± 0.016
3.127ValAsp: 3.127 ± 0.03
3.679ValGlu: 3.679 ± 0.039
2.12ValPhe: 2.12 ± 0.031
2.984ValGly: 2.984 ± 0.032
1.122ValHis: 1.122 ± 0.019
3.336ValIle: 3.336 ± 0.031
3.307ValLys: 3.307 ± 0.035
4.843ValLeu: 4.843 ± 0.046
1.14ValMet: 1.14 ± 0.019
2.5ValAsn: 2.5 ± 0.03
2.717ValPro: 2.717 ± 0.033
2.102ValGln: 2.102 ± 0.027
2.929ValArg: 2.929 ± 0.035
4.583ValSer: 4.583 ± 0.042
3.165ValThr: 3.165 ± 0.031
3.14ValVal: 3.14 ± 0.037
0.654ValTrp: 0.654 ± 0.017
1.464ValTyr: 1.464 ± 0.024
0.002ValXaa: 0.002 ± 0.001
Trp
0.78TrpAla: 0.78 ± 0.016
0.175TrpCys: 0.175 ± 0.008
0.72TrpAsp: 0.72 ± 0.016
0.75TrpGlu: 0.75 ± 0.016
0.434TrpPhe: 0.434 ± 0.013
0.663TrpGly: 0.663 ± 0.017
0.279TrpHis: 0.279 ± 0.01
0.806TrpIle: 0.806 ± 0.017
0.814TrpLys: 0.814 ± 0.016
1.088TrpLeu: 1.088 ± 0.022
0.279TrpMet: 0.279 ± 0.009
0.629TrpAsn: 0.629 ± 0.015
0.473TrpPro: 0.473 ± 0.013
0.447TrpGln: 0.447 ± 0.013
0.744TrpArg: 0.744 ± 0.015
0.916TrpSer: 0.916 ± 0.018
0.725TrpThr: 0.725 ± 0.016
0.639TrpVal: 0.639 ± 0.015
0.211TrpTrp: 0.211 ± 0.009
0.348TrpTyr: 0.348 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.719TyrAla: 1.719 ± 0.027
0.434TyrCys: 0.434 ± 0.012
1.684TyrAsp: 1.684 ± 0.025
1.757TyrGlu: 1.757 ± 0.025
1.294TyrPhe: 1.294 ± 0.022
1.75TyrGly: 1.75 ± 0.026
0.778TyrHis: 0.778 ± 0.017
1.718TyrIle: 1.718 ± 0.027
1.405TyrLys: 1.405 ± 0.021
2.842TyrLeu: 2.842 ± 0.036
0.569TyrMet: 0.569 ± 0.014
1.363TyrAsn: 1.363 ± 0.023
1.37TyrPro: 1.37 ± 0.021
1.268TyrGln: 1.268 ± 0.024
1.609TyrArg: 1.609 ± 0.027
2.455TyrSer: 2.455 ± 0.029
1.503TyrThr: 1.503 ± 0.025
1.471TyrVal: 1.471 ± 0.024
0.34TyrTrp: 0.34 ± 0.009
0.917TyrTyr: 0.917 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.001
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.002XaaGly: 0.002 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.0XaaArg: 0.0 ± 0.0
0.002XaaSer: 0.002 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.015XaaXaa: 0.015 ± 0.006
Statistics based on 6757 proteins (3060206 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski