Amino acid dipepetide frequency for Hypocrea atroviridis (strain ATCC 20476 / IMI 206040) (Trichoderma atroviride)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.927AlaAla: 9.927 ± 0.068
1.125AlaCys: 1.125 ± 0.016
4.427AlaAsp: 4.427 ± 0.034
5.151AlaGlu: 5.151 ± 0.037
3.251AlaPhe: 3.251 ± 0.027
5.75AlaGly: 5.75 ± 0.038
1.785AlaHis: 1.785 ± 0.02
4.679AlaIle: 4.679 ± 0.035
4.333AlaLys: 4.333 ± 0.035
7.897AlaLeu: 7.897 ± 0.041
2.079AlaMet: 2.079 ± 0.023
3.189AlaAsn: 3.189 ± 0.026
4.584AlaPro: 4.584 ± 0.036
3.511AlaGln: 3.511 ± 0.03
4.746AlaArg: 4.746 ± 0.032
7.615AlaSer: 7.615 ± 0.049
5.346AlaThr: 5.346 ± 0.038
5.48AlaVal: 5.48 ± 0.033
1.222AlaTrp: 1.222 ± 0.022
2.268AlaTyr: 2.268 ± 0.025
0.0AlaXaa: 0.0 ± 0.0
Cys
0.947CysAla: 0.947 ± 0.015
0.281CysCys: 0.281 ± 0.009
0.692CysAsp: 0.692 ± 0.011
0.613CysGlu: 0.613 ± 0.012
0.579CysPhe: 0.579 ± 0.012
0.983CysGly: 0.983 ± 0.017
0.345CysHis: 0.345 ± 0.009
0.763CysIle: 0.763 ± 0.014
0.533CysLys: 0.533 ± 0.011
1.299CysLeu: 1.299 ± 0.016
0.269CysMet: 0.269 ± 0.006
0.474CysAsn: 0.474 ± 0.011
0.65CysPro: 0.65 ± 0.015
0.501CysGln: 0.501 ± 0.009
0.756CysArg: 0.756 ± 0.014
0.995CysSer: 0.995 ± 0.015
0.686CysThr: 0.686 ± 0.012
0.801CysVal: 0.801 ± 0.013
0.224CysTrp: 0.224 ± 0.007
0.383CysTyr: 0.383 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
4.959AspAla: 4.959 ± 0.033
0.671AspCys: 0.671 ± 0.011
4.427AspAsp: 4.427 ± 0.049
4.614AspGlu: 4.614 ± 0.04
2.286AspPhe: 2.286 ± 0.022
4.245AspGly: 4.245 ± 0.032
1.185AspHis: 1.185 ± 0.017
3.284AspIle: 3.284 ± 0.023
2.603AspLys: 2.603 ± 0.026
4.975AspLeu: 4.975 ± 0.035
1.356AspMet: 1.356 ± 0.017
1.972AspAsn: 1.972 ± 0.019
3.034AspPro: 3.034 ± 0.028
1.845AspGln: 1.845 ± 0.019
2.877AspArg: 2.877 ± 0.03
4.143AspSer: 4.143 ± 0.026
2.853AspThr: 2.853 ± 0.021
3.763AspVal: 3.763 ± 0.034
0.897AspTrp: 0.897 ± 0.013
1.642AspTyr: 1.642 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.672GluAla: 5.672 ± 0.036
0.636GluCys: 0.636 ± 0.012
4.17GluAsp: 4.17 ± 0.037
5.305GluGlu: 5.305 ± 0.058
2.017GluPhe: 2.017 ± 0.02
3.504GluGly: 3.504 ± 0.025
1.354GluHis: 1.354 ± 0.017
3.169GluIle: 3.169 ± 0.029
3.635GluLys: 3.635 ± 0.037
5.255GluLeu: 5.255 ± 0.037
1.519GluMet: 1.519 ± 0.017
2.252GluAsn: 2.252 ± 0.021
2.663GluPro: 2.663 ± 0.028
2.45GluGln: 2.45 ± 0.023
3.688GluArg: 3.688 ± 0.032
4.237GluSer: 4.237 ± 0.031
3.419GluThr: 3.419 ± 0.026
3.316GluVal: 3.316 ± 0.03
0.908GluTrp: 0.908 ± 0.012
1.665GluTyr: 1.665 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
3.14PheAla: 3.14 ± 0.021
0.59PheCys: 0.59 ± 0.011
2.377PheAsp: 2.377 ± 0.021
2.159PheGlu: 2.159 ± 0.02
1.713PhePhe: 1.713 ± 0.022
2.928PheGly: 2.928 ± 0.027
0.93PheHis: 0.93 ± 0.014
2.01PheIle: 2.01 ± 0.023
1.605PheLys: 1.605 ± 0.018
3.495PheLeu: 3.495 ± 0.031
0.809PheMet: 0.809 ± 0.014
1.519PheAsn: 1.519 ± 0.016
1.94PhePro: 1.94 ± 0.019
1.488PheGln: 1.488 ± 0.017
1.905PheArg: 1.905 ± 0.019
3.132PheSer: 3.132 ± 0.027
2.161PheThr: 2.161 ± 0.02
2.423PheVal: 2.423 ± 0.023
0.668PheTrp: 0.668 ± 0.012
1.163PheTyr: 1.163 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
5.412GlyAla: 5.412 ± 0.039
0.875GlyCys: 0.875 ± 0.017
3.67GlyAsp: 3.67 ± 0.034
3.613GlyGlu: 3.613 ± 0.03
2.844GlyPhe: 2.844 ± 0.025
5.613GlyGly: 5.613 ± 0.066
1.72GlyHis: 1.72 ± 0.02
3.721GlyIle: 3.721 ± 0.032
3.466GlyLys: 3.466 ± 0.033
6.1GlyLeu: 6.1 ± 0.041
1.519GlyMet: 1.519 ± 0.02
2.62GlyAsn: 2.62 ± 0.025
3.156GlyPro: 3.156 ± 0.034
2.562GlyGln: 2.562 ± 0.024
3.997GlyArg: 3.997 ± 0.032
5.779GlySer: 5.779 ± 0.041
3.776GlyThr: 3.776 ± 0.03
4.272GlyVal: 4.272 ± 0.031
1.189GlyTrp: 1.189 ± 0.016
2.141GlyTyr: 2.141 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
1.774HisAla: 1.774 ± 0.02
0.343HisCys: 0.343 ± 0.008
1.344HisAsp: 1.344 ± 0.016
1.283HisGlu: 1.283 ± 0.015
0.943HisPhe: 0.943 ± 0.014
1.714HisGly: 1.714 ± 0.02
0.853HisHis: 0.853 ± 0.018
1.257HisIle: 1.257 ± 0.017
0.939HisLys: 0.939 ± 0.015
2.277HisLeu: 2.277 ± 0.022
0.516HisMet: 0.516 ± 0.01
0.87HisAsn: 0.87 ± 0.012
1.512HisPro: 1.512 ± 0.018
1.04HisGln: 1.04 ± 0.017
1.471HisArg: 1.471 ± 0.017
1.773HisSer: 1.773 ± 0.022
1.182HisThr: 1.182 ± 0.015
1.485HisVal: 1.485 ± 0.021
0.359HisTrp: 0.359 ± 0.008
0.711HisTyr: 0.711 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.452IleAla: 4.452 ± 0.035
0.813IleCys: 0.813 ± 0.015
3.042IleAsp: 3.042 ± 0.024
2.997IleGlu: 2.997 ± 0.032
2.152IlePhe: 2.152 ± 0.023
3.369IleGly: 3.369 ± 0.03
1.293IleHis: 1.293 ± 0.016
2.799IleIle: 2.799 ± 0.026
2.41IleLys: 2.41 ± 0.022
4.743IleLeu: 4.743 ± 0.036
1.087IleMet: 1.087 ± 0.013
1.975IleAsn: 1.975 ± 0.019
2.969IlePro: 2.969 ± 0.024
2.102IleGln: 2.102 ± 0.027
2.934IleArg: 2.934 ± 0.024
4.096IleSer: 4.096 ± 0.034
2.922IleThr: 2.922 ± 0.027
3.272IleVal: 3.272 ± 0.029
0.794IleTrp: 0.794 ± 0.014
1.487IleTyr: 1.487 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
4.523LysAla: 4.523 ± 0.033
0.519LysCys: 0.519 ± 0.012
2.953LysAsp: 2.953 ± 0.033
3.328LysGlu: 3.328 ± 0.033
1.558LysPhe: 1.558 ± 0.017
2.969LysGly: 2.969 ± 0.03
1.125LysHis: 1.125 ± 0.017
2.409LysIle: 2.409 ± 0.021
3.406LysLys: 3.406 ± 0.042
4.341LysLeu: 4.341 ± 0.031
1.077LysMet: 1.077 ± 0.015
1.814LysAsn: 1.814 ± 0.018
2.643LysPro: 2.643 ± 0.024
1.906LysGln: 1.906 ± 0.022
3.36LysArg: 3.36 ± 0.031
3.57LysSer: 3.57 ± 0.03
2.911LysThr: 2.911 ± 0.021
2.779LysVal: 2.779 ± 0.026
0.698LysTrp: 0.698 ± 0.01
1.426LysTyr: 1.426 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
8.05LeuAla: 8.05 ± 0.043
1.25LeuCys: 1.25 ± 0.018
5.315LeuAsp: 5.315 ± 0.037
5.542LeuGlu: 5.542 ± 0.04
3.452LeuPhe: 3.452 ± 0.033
6.032LeuGly: 6.032 ± 0.04
2.214LeuHis: 2.214 ± 0.022
4.194LeuIle: 4.194 ± 0.033
4.338LeuLys: 4.338 ± 0.032
8.605LeuLeu: 8.605 ± 0.067
1.857LeuMet: 1.857 ± 0.019
3.205LeuAsn: 3.205 ± 0.027
5.331LeuPro: 5.331 ± 0.033
3.97LeuGln: 3.97 ± 0.034
5.559LeuArg: 5.559 ± 0.04
7.429LeuSer: 7.429 ± 0.044
4.69LeuThr: 4.69 ± 0.032
5.501LeuVal: 5.501 ± 0.045
1.29LeuTrp: 1.29 ± 0.016
2.452LeuTyr: 2.452 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.419MetAla: 2.419 ± 0.02
0.244MetCys: 0.244 ± 0.006
1.286MetAsp: 1.286 ± 0.018
1.323MetGlu: 1.323 ± 0.016
0.743MetPhe: 0.743 ± 0.013
1.49MetGly: 1.49 ± 0.018
0.464MetHis: 0.464 ± 0.009
1.017MetIle: 1.017 ± 0.015
1.062MetLys: 1.062 ± 0.016
1.942MetLeu: 1.942 ± 0.02
0.619MetMet: 0.619 ± 0.011
0.84MetAsn: 0.84 ± 0.013
1.343MetPro: 1.343 ± 0.019
0.881MetGln: 0.881 ± 0.015
1.283MetArg: 1.283 ± 0.017
1.858MetSer: 1.858 ± 0.019
1.272MetThr: 1.272 ± 0.017
1.285MetVal: 1.285 ± 0.015
0.277MetTrp: 0.277 ± 0.008
0.535MetTyr: 0.535 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.239AsnAla: 3.239 ± 0.027
0.474AsnCys: 0.474 ± 0.009
2.026AsnAsp: 2.026 ± 0.02
2.051AsnGlu: 2.051 ± 0.022
1.433AsnPhe: 1.433 ± 0.016
3.056AsnGly: 3.056 ± 0.031
0.873AsnHis: 0.873 ± 0.012
2.16AsnIle: 2.16 ± 0.021
1.692AsnLys: 1.692 ± 0.022
3.319AsnLeu: 3.319 ± 0.026
0.908AsnMet: 0.908 ± 0.013
1.631AsnAsn: 1.631 ± 0.023
2.34AsnPro: 2.34 ± 0.022
1.376AsnGln: 1.376 ± 0.019
1.959AsnArg: 1.959 ± 0.024
2.856AsnSer: 2.856 ± 0.026
2.133AsnThr: 2.133 ± 0.021
2.365AsnVal: 2.365 ± 0.024
0.6AsnTrp: 0.6 ± 0.01
1.113AsnTyr: 1.113 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
4.953ProAla: 4.953 ± 0.037
0.53ProCys: 0.53 ± 0.011
3.065ProAsp: 3.065 ± 0.028
3.591ProGlu: 3.591 ± 0.03
2.05ProPhe: 2.05 ± 0.022
3.705ProGly: 3.705 ± 0.033
1.227ProHis: 1.227 ± 0.016
2.565ProIle: 2.565 ± 0.027
2.623ProLys: 2.623 ± 0.028
4.728ProLeu: 4.728 ± 0.034
1.076ProMet: 1.076 ± 0.015
2.195ProAsn: 2.195 ± 0.022
4.682ProPro: 4.682 ± 0.074
2.359ProGln: 2.359 ± 0.025
3.168ProArg: 3.168 ± 0.032
5.581ProSer: 5.581 ± 0.044
3.561ProThr: 3.561 ± 0.033
3.349ProVal: 3.349 ± 0.026
0.762ProTrp: 0.762 ± 0.013
1.467ProTyr: 1.467 ± 0.017
0.0ProXaa: 0.0 ± 0.0
Gln
3.572GlnAla: 3.572 ± 0.035
0.463GlnCys: 0.463 ± 0.01
2.124GlnAsp: 2.124 ± 0.02
2.327GlnGlu: 2.327 ± 0.021
1.374GlnPhe: 1.374 ± 0.018
2.466GlnGly: 2.466 ± 0.025
1.066GlnHis: 1.066 ± 0.017
1.943GlnIle: 1.943 ± 0.017
2.012GlnLys: 2.012 ± 0.022
3.633GlnLeu: 3.633 ± 0.03
0.929GlnMet: 0.929 ± 0.015
1.568GlnAsn: 1.568 ± 0.017
2.471GlnPro: 2.471 ± 0.029
2.553GlnGln: 2.553 ± 0.053
2.633GlnArg: 2.633 ± 0.027
3.117GlnSer: 3.117 ± 0.024
2.294GlnThr: 2.294 ± 0.022
2.203GlnVal: 2.203 ± 0.02
0.615GlnTrp: 0.615 ± 0.011
1.191GlnTyr: 1.191 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.504ArgAla: 4.504 ± 0.031
0.731ArgCys: 0.731 ± 0.014
3.36ArgAsp: 3.36 ± 0.037
3.608ArgGlu: 3.608 ± 0.034
2.137ArgPhe: 2.137 ± 0.02
3.641ArgGly: 3.641 ± 0.034
1.56ArgHis: 1.56 ± 0.018
2.976ArgIle: 2.976 ± 0.024
3.234ArgLys: 3.234 ± 0.03
5.525ArgLeu: 5.525 ± 0.042
1.27ArgMet: 1.27 ± 0.015
2.201ArgAsn: 2.201 ± 0.023
3.333ArgPro: 3.333 ± 0.031
2.6ArgGln: 2.6 ± 0.024
4.812ArgArg: 4.812 ± 0.045
4.5ArgSer: 4.5 ± 0.038
2.976ArgThr: 2.976 ± 0.023
3.239ArgVal: 3.239 ± 0.024
0.904ArgTrp: 0.904 ± 0.014
1.653ArgTyr: 1.653 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
6.711SerAla: 6.711 ± 0.042
0.953SerCys: 0.953 ± 0.015
4.24SerAsp: 4.24 ± 0.03
4.107SerGlu: 4.107 ± 0.029
3.121SerPhe: 3.121 ± 0.028
5.595SerGly: 5.595 ± 0.047
1.955SerHis: 1.955 ± 0.02
4.24SerIle: 4.24 ± 0.029
3.834SerLys: 3.834 ± 0.036
7.381SerLeu: 7.381 ± 0.041
1.757SerMet: 1.757 ± 0.02
3.082SerAsn: 3.082 ± 0.028
5.126SerPro: 5.126 ± 0.047
3.356SerGln: 3.356 ± 0.031
4.866SerArg: 4.866 ± 0.04
8.622SerSer: 8.622 ± 0.078
5.201SerThr: 5.201 ± 0.044
4.52SerVal: 4.52 ± 0.029
1.188SerTrp: 1.188 ± 0.018
2.133SerTyr: 2.133 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.279ThrAla: 5.279 ± 0.038
0.783ThrCys: 0.783 ± 0.014
2.762ThrAsp: 2.762 ± 0.023
3.012ThrGlu: 3.012 ± 0.027
2.239ThrPhe: 2.239 ± 0.023
4.0ThrGly: 4.0 ± 0.031
1.217ThrHis: 1.217 ± 0.016
3.133ThrIle: 3.133 ± 0.032
2.64ThrLys: 2.64 ± 0.022
5.12ThrLeu: 5.12 ± 0.037
1.197ThrMet: 1.197 ± 0.017
2.098ThrAsn: 2.098 ± 0.02
3.922ThrPro: 3.922 ± 0.031
1.995ThrGln: 1.995 ± 0.018
2.966ThrArg: 2.966 ± 0.022
4.96ThrSer: 4.96 ± 0.045
3.898ThrThr: 3.898 ± 0.036
3.597ThrVal: 3.597 ± 0.03
0.862ThrTrp: 0.862 ± 0.013
1.605ThrTyr: 1.605 ± 0.017
0.0ThrXaa: 0.0 ± 0.0
Val
5.393ValAla: 5.393 ± 0.036
0.835ValCys: 0.835 ± 0.014
3.696ValAsp: 3.696 ± 0.026
3.733ValGlu: 3.733 ± 0.029
2.477ValPhe: 2.477 ± 0.024
3.887ValGly: 3.887 ± 0.03
1.344ValHis: 1.344 ± 0.016
3.079ValIle: 3.079 ± 0.027
2.941ValLys: 2.941 ± 0.028
5.555ValLeu: 5.555 ± 0.039
1.305ValMet: 1.305 ± 0.017
2.243ValAsn: 2.243 ± 0.021
3.42ValPro: 3.42 ± 0.029
2.355ValGln: 2.355 ± 0.023
3.215ValArg: 3.215 ± 0.025
4.557ValSer: 4.557 ± 0.033
3.466ValThr: 3.466 ± 0.028
4.251ValVal: 4.251 ± 0.037
0.88ValTrp: 0.88 ± 0.016
1.748ValTyr: 1.748 ± 0.02
0.0ValXaa: 0.0 ± 0.0
Trp
1.226TrpAla: 1.226 ± 0.017
0.204TrpCys: 0.204 ± 0.007
0.962TrpAsp: 0.962 ± 0.018
0.866TrpGlu: 0.866 ± 0.014
0.558TrpPhe: 0.558 ± 0.012
0.944TrpGly: 0.944 ± 0.015
0.376TrpHis: 0.376 ± 0.009
0.833TrpIle: 0.833 ± 0.014
0.829TrpLys: 0.829 ± 0.013
1.416TrpLeu: 1.416 ± 0.02
0.379TrpMet: 0.379 ± 0.01
0.689TrpAsn: 0.689 ± 0.012
0.652TrpPro: 0.652 ± 0.012
0.591TrpGln: 0.591 ± 0.01
0.965TrpArg: 0.965 ± 0.014
1.08TrpSer: 1.08 ± 0.016
0.916TrpThr: 0.916 ± 0.015
0.876TrpVal: 0.876 ± 0.013
0.299TrpTrp: 0.299 ± 0.008
0.429TrpTyr: 0.429 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.212TyrAla: 2.212 ± 0.021
0.455TyrCys: 0.455 ± 0.01
1.692TyrAsp: 1.692 ± 0.022
1.61TyrGlu: 1.61 ± 0.02
1.226TyrPhe: 1.226 ± 0.018
2.153TyrGly: 2.153 ± 0.023
0.761TyrHis: 0.761 ± 0.013
1.482TyrIle: 1.482 ± 0.017
1.181TyrLys: 1.181 ± 0.017
2.668TyrLeu: 2.668 ± 0.022
0.652TyrMet: 0.652 ± 0.012
1.167TyrAsn: 1.167 ± 0.015
1.464TyrPro: 1.464 ± 0.015
1.111TyrGln: 1.111 ± 0.016
1.606TyrArg: 1.606 ± 0.018
2.089TyrSer: 2.089 ± 0.022
1.624TyrThr: 1.624 ± 0.02
1.601TyrVal: 1.601 ± 0.017
0.457TyrTrp: 0.457 ± 0.009
0.953TyrTyr: 0.953 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11815 proteins (5388752 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski