Amino acid dipepetide frequency for Chaetomium thermophilum (strain DSM 1495 / CBS 144.50 / IMI 039719)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.139AlaAla: 11.139 ± 0.102
0.961AlaCys: 0.961 ± 0.019
4.17AlaAsp: 4.17 ± 0.039
5.77AlaGlu: 5.77 ± 0.092
2.888AlaPhe: 2.888 ± 0.031
6.07AlaGly: 6.07 ± 0.05
1.752AlaHis: 1.752 ± 0.023
3.993AlaIle: 3.993 ± 0.037
4.383AlaLys: 4.383 ± 0.052
7.841AlaLeu: 7.841 ± 0.057
1.868AlaMet: 1.868 ± 0.025
2.964AlaAsn: 2.964 ± 0.028
5.781AlaPro: 5.781 ± 0.088
3.845AlaGln: 3.845 ± 0.042
5.582AlaArg: 5.582 ± 0.047
7.147AlaSer: 7.147 ± 0.058
5.446AlaThr: 5.446 ± 0.048
5.62AlaVal: 5.62 ± 0.044
1.118AlaTrp: 1.118 ± 0.017
2.047AlaTyr: 2.047 ± 0.026
0.0AlaXaa: 0.0 ± 0.0
Cys
0.861CysAla: 0.861 ± 0.015
0.206CysCys: 0.206 ± 0.008
0.569CysAsp: 0.569 ± 0.013
0.583CysGlu: 0.583 ± 0.013
0.455CysPhe: 0.455 ± 0.013
0.829CysGly: 0.829 ± 0.02
0.277CysHis: 0.277 ± 0.009
0.588CysIle: 0.588 ± 0.012
0.474CysLys: 0.474 ± 0.011
1.091CysLeu: 1.091 ± 0.02
0.232CysMet: 0.232 ± 0.009
0.37CysAsn: 0.37 ± 0.01
0.659CysPro: 0.659 ± 0.014
0.393CysGln: 0.393 ± 0.01
0.714CysArg: 0.714 ± 0.014
0.777CysSer: 0.777 ± 0.017
0.581CysThr: 0.581 ± 0.013
0.702CysVal: 0.702 ± 0.014
0.186CysTrp: 0.186 ± 0.007
0.308CysTyr: 0.308 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
4.466AspAla: 4.466 ± 0.039
0.556AspCys: 0.556 ± 0.012
4.494AspAsp: 4.494 ± 0.054
4.726AspGlu: 4.726 ± 0.052
1.983AspPhe: 1.983 ± 0.023
3.977AspGly: 3.977 ± 0.035
1.125AspHis: 1.125 ± 0.021
2.704AspIle: 2.704 ± 0.033
2.457AspLys: 2.457 ± 0.032
4.715AspLeu: 4.715 ± 0.039
1.182AspMet: 1.182 ± 0.019
1.822AspAsn: 1.822 ± 0.024
3.667AspPro: 3.667 ± 0.034
1.707AspGln: 1.707 ± 0.023
3.143AspArg: 3.143 ± 0.046
3.759AspSer: 3.759 ± 0.037
2.571AspThr: 2.571 ± 0.026
3.54AspVal: 3.54 ± 0.034
0.825AspTrp: 0.825 ± 0.016
1.468AspTyr: 1.468 ± 0.02
0.0AspXaa: 0.0 ± 0.0
Glu
5.764GluAla: 5.764 ± 0.08
0.593GluCys: 0.593 ± 0.015
4.321GluAsp: 4.321 ± 0.049
7.356GluGlu: 7.356 ± 0.13
1.93GluPhe: 1.93 ± 0.024
4.383GluGly: 4.383 ± 0.047
1.362GluHis: 1.362 ± 0.018
2.965GluIle: 2.965 ± 0.033
4.325GluLys: 4.325 ± 0.052
5.5GluLeu: 5.5 ± 0.05
1.529GluMet: 1.529 ± 0.021
2.173GluAsn: 2.173 ± 0.024
3.309GluPro: 3.309 ± 0.089
2.864GluGln: 2.864 ± 0.036
4.67GluArg: 4.67 ± 0.058
4.055GluSer: 4.055 ± 0.043
3.394GluThr: 3.394 ± 0.048
3.894GluVal: 3.894 ± 0.048
0.937GluTrp: 0.937 ± 0.017
1.634GluTyr: 1.634 ± 0.022
0.001GluXaa: 0.001 ± 0.0
Phe
2.836PheAla: 2.836 ± 0.028
0.467PheCys: 0.467 ± 0.011
2.094PheAsp: 2.094 ± 0.022
2.026PheGlu: 2.026 ± 0.025
1.406PhePhe: 1.406 ± 0.02
2.609PheGly: 2.609 ± 0.037
0.794PheHis: 0.794 ± 0.014
1.564PheIle: 1.564 ± 0.02
1.387PheLys: 1.387 ± 0.02
3.162PheLeu: 3.162 ± 0.034
0.672PheMet: 0.672 ± 0.013
1.252PheAsn: 1.252 ± 0.02
1.909PhePro: 1.909 ± 0.021
1.208PheGln: 1.208 ± 0.022
1.95PheArg: 1.95 ± 0.026
2.624PheSer: 2.624 ± 0.031
1.947PheThr: 1.947 ± 0.021
2.195PheVal: 2.195 ± 0.024
0.56PheTrp: 0.56 ± 0.013
0.985PheTyr: 0.985 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.366GlyAla: 5.366 ± 0.039
0.798GlyCys: 0.798 ± 0.017
3.489GlyAsp: 3.489 ± 0.035
4.166GlyGlu: 4.166 ± 0.042
2.538GlyPhe: 2.538 ± 0.029
6.032GlyGly: 6.032 ± 0.063
1.594GlyHis: 1.594 ± 0.024
3.214GlyIle: 3.214 ± 0.035
3.587GlyLys: 3.587 ± 0.033
5.746GlyLeu: 5.746 ± 0.051
1.535GlyMet: 1.535 ± 0.025
2.496GlyAsn: 2.496 ± 0.033
3.678GlyPro: 3.678 ± 0.041
2.599GlyGln: 2.599 ± 0.028
4.403GlyArg: 4.403 ± 0.044
5.728GlySer: 5.728 ± 0.056
3.95GlyThr: 3.95 ± 0.039
4.464GlyVal: 4.464 ± 0.037
1.135GlyTrp: 1.135 ± 0.02
2.012GlyTyr: 2.012 ± 0.03
0.001GlyXaa: 0.001 ± 0.0
His
1.807HisAla: 1.807 ± 0.02
0.271HisCys: 0.271 ± 0.009
1.222HisAsp: 1.222 ± 0.018
1.24HisGlu: 1.24 ± 0.018
0.821HisPhe: 0.821 ± 0.016
1.612HisGly: 1.612 ± 0.024
1.081HisHis: 1.081 ± 0.028
1.113HisIle: 1.113 ± 0.016
0.906HisLys: 0.906 ± 0.017
2.103HisLeu: 2.103 ± 0.024
0.441HisMet: 0.441 ± 0.01
0.817HisAsn: 0.817 ± 0.017
1.804HisPro: 1.804 ± 0.027
1.033HisGln: 1.033 ± 0.019
1.527HisArg: 1.527 ± 0.023
1.696HisSer: 1.696 ± 0.024
1.187HisThr: 1.187 ± 0.019
1.315HisVal: 1.315 ± 0.019
0.315HisTrp: 0.315 ± 0.009
0.687HisTyr: 0.687 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
3.887IleAla: 3.887 ± 0.036
0.632IleCys: 0.632 ± 0.013
2.623IleAsp: 2.623 ± 0.028
2.861IleGlu: 2.861 ± 0.033
1.662IlePhe: 1.662 ± 0.025
2.772IleGly: 2.772 ± 0.029
1.045IleHis: 1.045 ± 0.018
2.273IleIle: 2.273 ± 0.03
2.198IleLys: 2.198 ± 0.028
4.146IleLeu: 4.146 ± 0.043
0.909IleMet: 0.909 ± 0.015
1.652IleAsn: 1.652 ± 0.022
3.142IlePro: 3.142 ± 0.03
1.665IleGln: 1.665 ± 0.022
2.816IleArg: 2.816 ± 0.03
3.257IleSer: 3.257 ± 0.028
2.731IleThr: 2.731 ± 0.028
2.98IleVal: 2.98 ± 0.032
0.63IleTrp: 0.63 ± 0.012
1.242IleTyr: 1.242 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
4.51LysAla: 4.51 ± 0.051
0.437LysCys: 0.437 ± 0.01
2.71LysAsp: 2.71 ± 0.031
4.007LysGlu: 4.007 ± 0.054
1.39LysPhe: 1.39 ± 0.018
3.137LysGly: 3.137 ± 0.034
1.147LysHis: 1.147 ± 0.021
2.184LysIle: 2.184 ± 0.026
4.024LysLys: 4.024 ± 0.057
4.187LysLeu: 4.187 ± 0.036
0.986LysMet: 0.986 ± 0.018
1.671LysAsn: 1.671 ± 0.022
3.166LysPro: 3.166 ± 0.041
2.012LysGln: 2.012 ± 0.025
3.693LysArg: 3.693 ± 0.04
3.131LysSer: 3.131 ± 0.033
2.81LysThr: 2.81 ± 0.034
2.915LysVal: 2.915 ± 0.037
0.647LysTrp: 0.647 ± 0.012
1.304LysTyr: 1.304 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
8.117LeuAla: 8.117 ± 0.061
1.072LeuCys: 1.072 ± 0.016
4.865LeuAsp: 4.865 ± 0.043
5.593LeuGlu: 5.593 ± 0.05
3.052LeuPhe: 3.052 ± 0.036
5.583LeuGly: 5.583 ± 0.051
2.057LeuHis: 2.057 ± 0.026
3.649LeuIle: 3.649 ± 0.037
4.211LeuLys: 4.211 ± 0.041
8.128LeuLeu: 8.128 ± 0.071
1.636LeuMet: 1.636 ± 0.023
2.951LeuAsn: 2.951 ± 0.035
5.899LeuPro: 5.899 ± 0.048
3.538LeuGln: 3.538 ± 0.034
6.176LeuArg: 6.176 ± 0.046
6.853LeuSer: 6.853 ± 0.048
4.827LeuThr: 4.827 ± 0.041
5.199LeuVal: 5.199 ± 0.044
1.121LeuTrp: 1.121 ± 0.021
2.117LeuTyr: 2.117 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.151MetAla: 2.151 ± 0.025
0.209MetCys: 0.209 ± 0.008
1.129MetAsp: 1.129 ± 0.017
1.249MetGlu: 1.249 ± 0.018
0.64MetPhe: 0.64 ± 0.014
1.395MetGly: 1.395 ± 0.019
0.405MetHis: 0.405 ± 0.011
0.875MetIle: 0.875 ± 0.018
0.926MetLys: 0.926 ± 0.015
1.767MetLeu: 1.767 ± 0.021
0.541MetMet: 0.541 ± 0.014
0.635MetAsn: 0.635 ± 0.015
1.291MetPro: 1.291 ± 0.022
0.776MetGln: 0.776 ± 0.014
1.321MetArg: 1.321 ± 0.019
1.676MetSer: 1.676 ± 0.022
1.225MetThr: 1.225 ± 0.019
1.258MetVal: 1.258 ± 0.02
0.243MetTrp: 0.243 ± 0.008
0.481MetTyr: 0.481 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.958AsnAla: 2.958 ± 0.027
0.379AsnCys: 0.379 ± 0.012
1.838AsnAsp: 1.838 ± 0.025
1.919AsnGlu: 1.919 ± 0.024
1.196AsnPhe: 1.196 ± 0.019
3.03AsnGly: 3.03 ± 0.034
0.813AsnHis: 0.813 ± 0.015
1.799AsnIle: 1.799 ± 0.024
1.604AsnLys: 1.604 ± 0.021
2.998AsnLeu: 2.998 ± 0.032
0.737AsnMet: 0.737 ± 0.014
1.797AsnAsn: 1.797 ± 0.046
2.662AsnPro: 2.662 ± 0.028
1.252AsnGln: 1.252 ± 0.021
1.918AsnArg: 1.918 ± 0.028
2.493AsnSer: 2.493 ± 0.028
2.015AsnThr: 2.015 ± 0.022
2.078AsnVal: 2.078 ± 0.022
0.51AsnTrp: 0.51 ± 0.012
0.944AsnTyr: 0.944 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
6.548ProAla: 6.548 ± 0.108
0.508ProCys: 0.508 ± 0.013
3.457ProAsp: 3.457 ± 0.032
4.37ProGlu: 4.37 ± 0.069
2.017ProPhe: 2.017 ± 0.023
4.444ProGly: 4.444 ± 0.049
1.434ProHis: 1.434 ± 0.022
2.671ProIle: 2.671 ± 0.028
2.999ProLys: 2.999 ± 0.034
5.071ProLeu: 5.071 ± 0.039
1.049ProMet: 1.049 ± 0.018
2.411ProAsn: 2.411 ± 0.028
7.155ProPro: 7.155 ± 0.106
2.956ProGln: 2.956 ± 0.046
3.801ProArg: 3.801 ± 0.037
6.824ProSer: 6.824 ± 0.072
4.568ProThr: 4.568 ± 0.052
4.077ProVal: 4.077 ± 0.06
0.723ProTrp: 0.723 ± 0.014
1.582ProTyr: 1.582 ± 0.02
0.001ProXaa: 0.001 ± 0.0
Gln
3.915GlnAla: 3.915 ± 0.043
0.386GlnCys: 0.386 ± 0.01
1.848GlnAsp: 1.848 ± 0.023
2.586GlnGlu: 2.586 ± 0.028
1.158GlnPhe: 1.158 ± 0.02
2.346GlnGly: 2.346 ± 0.027
1.175GlnHis: 1.175 ± 0.019
1.724GlnIle: 1.724 ± 0.024
1.997GlnLys: 1.997 ± 0.026
3.472GlnLeu: 3.472 ± 0.034
0.828GlnMet: 0.828 ± 0.017
1.374GlnAsn: 1.374 ± 0.022
3.335GlnPro: 3.335 ± 0.053
4.103GlnGln: 4.103 ± 0.109
2.789GlnArg: 2.789 ± 0.028
2.876GlnSer: 2.876 ± 0.031
2.317GlnThr: 2.317 ± 0.029
2.147GlnVal: 2.147 ± 0.026
0.504GlnTrp: 0.504 ± 0.011
1.054GlnTyr: 1.054 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
5.29ArgAla: 5.29 ± 0.043
0.686ArgCys: 0.686 ± 0.014
3.596ArgAsp: 3.596 ± 0.05
4.783ArgGlu: 4.783 ± 0.059
2.106ArgPhe: 2.106 ± 0.024
4.029ArgGly: 4.029 ± 0.044
1.616ArgHis: 1.616 ± 0.021
2.894ArgIle: 2.894 ± 0.033
3.707ArgLys: 3.707 ± 0.037
5.72ArgLeu: 5.72 ± 0.045
1.376ArgMet: 1.376 ± 0.021
2.163ArgAsn: 2.163 ± 0.024
4.056ArgPro: 4.056 ± 0.042
2.758ArgGln: 2.758 ± 0.031
5.929ArgArg: 5.929 ± 0.057
4.864ArgSer: 4.864 ± 0.046
3.272ArgThr: 3.272 ± 0.031
3.571ArgVal: 3.571 ± 0.034
0.968ArgTrp: 0.968 ± 0.018
1.663ArgTyr: 1.663 ± 0.02
0.0ArgXaa: 0.0 ± 0.0
Ser
6.793SerAla: 6.793 ± 0.057
0.739SerCys: 0.739 ± 0.015
3.917SerAsp: 3.917 ± 0.035
4.04SerGlu: 4.04 ± 0.039
2.579SerPhe: 2.579 ± 0.029
5.447SerGly: 5.447 ± 0.051
1.81SerHis: 1.81 ± 0.024
3.351SerIle: 3.351 ± 0.032
3.492SerLys: 3.492 ± 0.033
6.731SerLeu: 6.731 ± 0.045
1.483SerMet: 1.483 ± 0.019
2.724SerAsn: 2.724 ± 0.032
6.083SerPro: 6.083 ± 0.072
3.323SerGln: 3.323 ± 0.041
5.106SerArg: 5.106 ± 0.05
9.788SerSer: 9.788 ± 0.121
5.452SerThr: 5.452 ± 0.057
4.363SerVal: 4.363 ± 0.041
0.954SerTrp: 0.954 ± 0.016
1.756SerTyr: 1.756 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.549ThrAla: 5.549 ± 0.042
0.641ThrCys: 0.641 ± 0.011
2.614ThrAsp: 2.614 ± 0.028
3.073ThrGlu: 3.073 ± 0.044
2.027ThrPhe: 2.027 ± 0.024
4.102ThrGly: 4.102 ± 0.04
1.169ThrHis: 1.169 ± 0.016
2.883ThrIle: 2.883 ± 0.033
2.571ThrLys: 2.571 ± 0.028
5.043ThrLeu: 5.043 ± 0.04
1.064ThrMet: 1.064 ± 0.016
2.016ThrAsn: 2.016 ± 0.022
4.94ThrPro: 4.94 ± 0.049
2.104ThrGln: 2.104 ± 0.025
3.185ThrArg: 3.185 ± 0.029
5.345ThrSer: 5.345 ± 0.055
4.845ThrThr: 4.845 ± 0.058
3.681ThrVal: 3.681 ± 0.041
0.74ThrTrp: 0.74 ± 0.015
1.421ThrTyr: 1.421 ± 0.024
0.0ThrXaa: 0.0 ± 0.0
Val
5.333ValAla: 5.333 ± 0.047
0.762ValCys: 0.762 ± 0.012
3.524ValAsp: 3.524 ± 0.036
4.156ValGlu: 4.156 ± 0.074
2.235ValPhe: 2.235 ± 0.029
3.89ValGly: 3.89 ± 0.038
1.294ValHis: 1.294 ± 0.019
2.793ValIle: 2.793 ± 0.031
3.015ValLys: 3.015 ± 0.034
5.447ValLeu: 5.447 ± 0.046
1.252ValMet: 1.252 ± 0.02
2.107ValAsn: 2.107 ± 0.029
4.039ValPro: 4.039 ± 0.041
2.287ValGln: 2.287 ± 0.026
3.724ValArg: 3.724 ± 0.037
4.391ValSer: 4.391 ± 0.037
3.584ValThr: 3.584 ± 0.034
4.441ValVal: 4.441 ± 0.043
0.893ValTrp: 0.893 ± 0.016
1.648ValTyr: 1.648 ± 0.022
0.0ValXaa: 0.0 ± 0.0
Trp
1.092TrpAla: 1.092 ± 0.016
0.198TrpCys: 0.198 ± 0.007
0.848TrpAsp: 0.848 ± 0.017
0.933TrpGlu: 0.933 ± 0.017
0.503TrpPhe: 0.503 ± 0.012
0.91TrpGly: 0.91 ± 0.017
0.328TrpHis: 0.328 ± 0.01
0.641TrpIle: 0.641 ± 0.014
0.745TrpLys: 0.745 ± 0.015
1.304TrpLeu: 1.304 ± 0.017
0.331TrpMet: 0.331 ± 0.009
0.517TrpAsn: 0.517 ± 0.012
0.597TrpPro: 0.597 ± 0.014
0.519TrpGln: 0.519 ± 0.012
0.978TrpArg: 0.978 ± 0.016
0.895TrpSer: 0.895 ± 0.018
0.832TrpThr: 0.832 ± 0.014
0.851TrpVal: 0.851 ± 0.017
0.269TrpTrp: 0.269 ± 0.009
0.388TrpTyr: 0.388 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.004TyrAla: 2.004 ± 0.023
0.354TyrCys: 0.354 ± 0.01
1.559TyrAsp: 1.559 ± 0.023
1.523TyrGlu: 1.523 ± 0.024
1.061TyrPhe: 1.061 ± 0.015
1.931TyrGly: 1.931 ± 0.029
0.72TyrHis: 0.72 ± 0.014
1.245TyrIle: 1.245 ± 0.021
1.06TyrLys: 1.06 ± 0.02
2.471TyrLeu: 2.471 ± 0.029
0.54TyrMet: 0.54 ± 0.012
1.026TyrAsn: 1.026 ± 0.022
1.495TyrPro: 1.495 ± 0.023
0.993TyrGln: 0.993 ± 0.015
1.624TyrArg: 1.624 ± 0.019
1.751TyrSer: 1.751 ± 0.02
1.457TyrThr: 1.457 ± 0.021
1.527TyrVal: 1.527 ± 0.021
0.4TyrTrp: 0.4 ± 0.009
0.891TyrTyr: 0.891 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.046XaaXaa: 0.046 ± 0.021
Statistics based on 7185 proteins (3924665 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski