Amino acid dipepetide frequency for Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) (Aspergillus fumigatus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.979AlaAla: 8.979 ± 0.063
1.087AlaCys: 1.087 ± 0.017
4.347AlaAsp: 4.347 ± 0.032
5.282AlaGlu: 5.282 ± 0.053
3.179AlaPhe: 3.179 ± 0.03
5.779AlaGly: 5.779 ± 0.036
1.796AlaHis: 1.796 ± 0.018
4.229AlaIle: 4.229 ± 0.032
3.93AlaLys: 3.93 ± 0.034
7.872AlaLeu: 7.872 ± 0.048
1.939AlaMet: 1.939 ± 0.02
2.94AlaAsn: 2.94 ± 0.025
4.656AlaPro: 4.656 ± 0.043
3.436AlaGln: 3.436 ± 0.029
5.111AlaArg: 5.111 ± 0.032
7.363AlaSer: 7.363 ± 0.048
5.244AlaThr: 5.244 ± 0.034
5.56AlaVal: 5.56 ± 0.038
1.155AlaTrp: 1.155 ± 0.018
2.175AlaTyr: 2.175 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
0.967CysAla: 0.967 ± 0.014
0.263CysCys: 0.263 ± 0.008
0.684CysAsp: 0.684 ± 0.011
0.613CysGlu: 0.613 ± 0.012
0.551CysPhe: 0.551 ± 0.011
0.951CysGly: 0.951 ± 0.016
0.346CysHis: 0.346 ± 0.009
0.696CysIle: 0.696 ± 0.013
0.508CysLys: 0.508 ± 0.011
1.387CysLeu: 1.387 ± 0.018
0.277CysMet: 0.277 ± 0.008
0.446CysAsn: 0.446 ± 0.01
0.701CysPro: 0.701 ± 0.014
0.471CysGln: 0.471 ± 0.011
0.853CysArg: 0.853 ± 0.015
0.999CysSer: 0.999 ± 0.015
0.697CysThr: 0.697 ± 0.013
0.831CysVal: 0.831 ± 0.015
0.207CysTrp: 0.207 ± 0.007
0.36CysTyr: 0.36 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
4.641AspAla: 4.641 ± 0.038
0.644AspCys: 0.644 ± 0.013
3.821AspAsp: 3.821 ± 0.044
4.301AspGlu: 4.301 ± 0.04
2.144AspPhe: 2.144 ± 0.021
3.905AspGly: 3.905 ± 0.031
1.267AspHis: 1.267 ± 0.017
3.002AspIle: 3.002 ± 0.025
2.209AspLys: 2.209 ± 0.024
5.105AspLeu: 5.105 ± 0.036
1.204AspMet: 1.204 ± 0.018
1.806AspAsn: 1.806 ± 0.021
3.371AspPro: 3.371 ± 0.026
1.91AspGln: 1.91 ± 0.021
3.186AspArg: 3.186 ± 0.029
4.119AspSer: 4.119 ± 0.032
2.862AspThr: 2.862 ± 0.025
3.618AspVal: 3.618 ± 0.025
0.849AspTrp: 0.849 ± 0.014
1.614AspTyr: 1.614 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
5.27GluAla: 5.27 ± 0.043
0.626GluCys: 0.626 ± 0.012
4.049GluAsp: 4.049 ± 0.04
5.444GluGlu: 5.444 ± 0.06
1.928GluPhe: 1.928 ± 0.022
3.657GluGly: 3.657 ± 0.028
1.385GluHis: 1.385 ± 0.018
3.113GluIle: 3.113 ± 0.029
3.596GluLys: 3.596 ± 0.034
5.245GluLeu: 5.245 ± 0.045
1.434GluMet: 1.434 ± 0.02
2.292GluAsn: 2.292 ± 0.024
2.831GluPro: 2.831 ± 0.058
2.555GluGln: 2.555 ± 0.028
4.071GluArg: 4.071 ± 0.04
4.388GluSer: 4.388 ± 0.037
3.517GluThr: 3.517 ± 0.032
3.531GluVal: 3.531 ± 0.027
0.864GluTrp: 0.864 ± 0.015
1.695GluTyr: 1.695 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.075PheAla: 3.075 ± 0.028
0.588PheCys: 0.588 ± 0.012
2.227PheAsp: 2.227 ± 0.024
2.09PheGlu: 2.09 ± 0.024
1.679PhePhe: 1.679 ± 0.024
2.746PheGly: 2.746 ± 0.035
0.936PheHis: 0.936 ± 0.014
1.804PheIle: 1.804 ± 0.021
1.417PheLys: 1.417 ± 0.018
3.644PheLeu: 3.644 ± 0.028
0.772PheMet: 0.772 ± 0.013
1.384PheAsn: 1.384 ± 0.016
2.019PhePro: 2.019 ± 0.024
1.407PheGln: 1.407 ± 0.022
2.065PheArg: 2.065 ± 0.024
3.073PheSer: 3.073 ± 0.026
2.075PheThr: 2.075 ± 0.022
2.412PheVal: 2.412 ± 0.025
0.646PheTrp: 0.646 ± 0.013
1.136PheTyr: 1.136 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
5.241GlyAla: 5.241 ± 0.04
0.906GlyCys: 0.906 ± 0.015
3.504GlyAsp: 3.504 ± 0.032
3.567GlyGlu: 3.567 ± 0.029
2.733GlyPhe: 2.733 ± 0.028
5.258GlyGly: 5.258 ± 0.051
1.67GlyHis: 1.67 ± 0.02
3.474GlyIle: 3.474 ± 0.033
3.307GlyLys: 3.307 ± 0.03
6.138GlyLeu: 6.138 ± 0.042
1.533GlyMet: 1.533 ± 0.018
2.4GlyAsn: 2.4 ± 0.025
3.341GlyPro: 3.341 ± 0.029
2.606GlyGln: 2.606 ± 0.027
4.151GlyArg: 4.151 ± 0.031
5.621GlySer: 5.621 ± 0.041
3.887GlyThr: 3.887 ± 0.03
4.365GlyVal: 4.365 ± 0.035
1.161GlyTrp: 1.161 ± 0.018
2.127GlyTyr: 2.127 ± 0.025
0.0GlyXaa: 0.0 ± 0.0
His
1.872HisAla: 1.872 ± 0.02
0.342HisCys: 0.342 ± 0.008
1.308HisAsp: 1.308 ± 0.015
1.294HisGlu: 1.294 ± 0.018
0.931HisPhe: 0.931 ± 0.014
1.74HisGly: 1.74 ± 0.022
0.867HisHis: 0.867 ± 0.018
1.21HisIle: 1.21 ± 0.017
0.855HisLys: 0.855 ± 0.013
2.347HisLeu: 2.347 ± 0.025
0.482HisMet: 0.482 ± 0.01
0.836HisAsn: 0.836 ± 0.015
1.738HisPro: 1.738 ± 0.018
1.01HisGln: 1.01 ± 0.016
1.66HisArg: 1.66 ± 0.021
1.936HisSer: 1.936 ± 0.023
1.28HisThr: 1.28 ± 0.018
1.403HisVal: 1.403 ± 0.016
0.349HisTrp: 0.349 ± 0.009
0.716HisTyr: 0.716 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.228IleAla: 4.228 ± 0.032
0.787IleCys: 0.787 ± 0.013
2.825IleAsp: 2.825 ± 0.028
2.836IleGlu: 2.836 ± 0.028
1.995IlePhe: 1.995 ± 0.021
3.127IleGly: 3.127 ± 0.028
1.229IleHis: 1.229 ± 0.014
2.54IleIle: 2.54 ± 0.03
2.063IleLys: 2.063 ± 0.025
4.766IleLeu: 4.766 ± 0.034
0.967IleMet: 0.967 ± 0.015
1.765IleAsn: 1.765 ± 0.02
3.157IlePro: 3.157 ± 0.026
1.908IleGln: 1.908 ± 0.022
2.945IleArg: 2.945 ± 0.024
3.893IleSer: 3.893 ± 0.03
2.759IleThr: 2.759 ± 0.025
3.219IleVal: 3.219 ± 0.03
0.719IleTrp: 0.719 ± 0.013
1.477IleTyr: 1.477 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
4.055LysAla: 4.055 ± 0.037
0.498LysCys: 0.498 ± 0.011
2.645LysAsp: 2.645 ± 0.027
3.303LysGlu: 3.303 ± 0.034
1.38LysPhe: 1.38 ± 0.019
2.851LysGly: 2.851 ± 0.027
1.101LysHis: 1.101 ± 0.016
2.19LysIle: 2.19 ± 0.024
3.1LysLys: 3.1 ± 0.04
3.969LysLeu: 3.969 ± 0.032
0.934LysMet: 0.934 ± 0.014
1.679LysAsn: 1.679 ± 0.019
2.615LysPro: 2.615 ± 0.031
1.837LysGln: 1.837 ± 0.021
3.395LysArg: 3.395 ± 0.031
3.339LysSer: 3.339 ± 0.032
2.633LysThr: 2.633 ± 0.026
2.746LysVal: 2.746 ± 0.027
0.627LysTrp: 0.627 ± 0.012
1.383LysTyr: 1.383 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
8.052LeuAla: 8.052 ± 0.045
1.293LeuCys: 1.293 ± 0.021
5.18LeuAsp: 5.18 ± 0.034
5.556LeuGlu: 5.556 ± 0.049
3.399LeuPhe: 3.399 ± 0.032
5.958LeuGly: 5.958 ± 0.037
2.327LeuHis: 2.327 ± 0.024
4.093LeuIle: 4.093 ± 0.037
4.116LeuLys: 4.116 ± 0.038
8.786LeuLeu: 8.786 ± 0.07
1.814LeuMet: 1.814 ± 0.019
3.213LeuAsn: 3.213 ± 0.025
5.586LeuPro: 5.586 ± 0.041
3.956LeuGln: 3.956 ± 0.038
6.12LeuArg: 6.12 ± 0.042
7.771LeuSer: 7.771 ± 0.046
4.944LeuThr: 4.944 ± 0.04
5.574LeuVal: 5.574 ± 0.037
1.228LeuTrp: 1.228 ± 0.017
2.455LeuTyr: 2.455 ± 0.027
0.0LeuXaa: 0.0 ± 0.0
Met
2.137MetAla: 2.137 ± 0.02
0.244MetCys: 0.244 ± 0.007
1.205MetAsp: 1.205 ± 0.016
1.284MetGlu: 1.284 ± 0.018
0.719MetPhe: 0.719 ± 0.014
1.385MetGly: 1.385 ± 0.018
0.462MetHis: 0.462 ± 0.01
1.022MetIle: 1.022 ± 0.016
0.98MetLys: 0.98 ± 0.014
1.863MetLeu: 1.863 ± 0.024
0.562MetMet: 0.562 ± 0.012
0.787MetAsn: 0.787 ± 0.015
1.174MetPro: 1.174 ± 0.019
0.864MetGln: 0.864 ± 0.014
1.285MetArg: 1.285 ± 0.017
1.806MetSer: 1.806 ± 0.021
1.3MetThr: 1.3 ± 0.017
1.328MetVal: 1.328 ± 0.016
0.254MetTrp: 0.254 ± 0.007
0.526MetTyr: 0.526 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.154AsnAla: 3.154 ± 0.032
0.467AsnCys: 0.467 ± 0.011
1.893AsnAsp: 1.893 ± 0.019
2.0AsnGlu: 2.0 ± 0.019
1.334AsnPhe: 1.334 ± 0.018
2.91AsnGly: 2.91 ± 0.023
0.867AsnHis: 0.867 ± 0.014
1.971AsnIle: 1.971 ± 0.024
1.452AsnLys: 1.452 ± 0.021
3.236AsnLeu: 3.236 ± 0.027
0.794AsnMet: 0.794 ± 0.013
1.395AsnAsn: 1.395 ± 0.019
2.439AsnPro: 2.439 ± 0.023
1.341AsnGln: 1.341 ± 0.018
2.009AsnArg: 2.009 ± 0.02
2.654AsnSer: 2.654 ± 0.027
2.055AsnThr: 2.055 ± 0.024
2.3AsnVal: 2.3 ± 0.026
0.546AsnTrp: 0.546 ± 0.01
1.046AsnTyr: 1.046 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
5.244ProAla: 5.244 ± 0.045
0.59ProCys: 0.59 ± 0.012
3.206ProAsp: 3.206 ± 0.028
3.842ProGlu: 3.842 ± 0.037
2.109ProPhe: 2.109 ± 0.025
3.889ProGly: 3.889 ± 0.035
1.371ProHis: 1.371 ± 0.02
2.487ProIle: 2.487 ± 0.026
2.484ProLys: 2.484 ± 0.026
4.848ProLeu: 4.848 ± 0.036
1.016ProMet: 1.016 ± 0.019
2.119ProAsn: 2.119 ± 0.026
4.902ProPro: 4.902 ± 0.063
2.43ProGln: 2.43 ± 0.03
3.516ProArg: 3.516 ± 0.032
6.34ProSer: 6.34 ± 0.056
3.858ProThr: 3.858 ± 0.034
3.717ProVal: 3.717 ± 0.038
0.771ProTrp: 0.771 ± 0.012
1.54ProTyr: 1.54 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.471GlnAla: 3.471 ± 0.034
0.479GlnCys: 0.479 ± 0.01
1.968GlnAsp: 1.968 ± 0.021
2.455GlnGlu: 2.455 ± 0.024
1.322GlnPhe: 1.322 ± 0.019
2.423GlnGly: 2.423 ± 0.028
1.057GlnHis: 1.057 ± 0.018
1.937GlnIle: 1.937 ± 0.021
1.995GlnLys: 1.995 ± 0.026
3.582GlnLeu: 3.582 ± 0.029
0.903GlnMet: 0.903 ± 0.014
1.546GlnAsn: 1.546 ± 0.019
2.556GlnPro: 2.556 ± 0.031
2.329GlnGln: 2.329 ± 0.042
2.79GlnArg: 2.79 ± 0.026
3.36GlnSer: 3.36 ± 0.032
2.373GlnThr: 2.373 ± 0.023
2.24GlnVal: 2.24 ± 0.023
0.588GlnTrp: 0.588 ± 0.011
1.139GlnTyr: 1.139 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.866ArgAla: 4.866 ± 0.034
0.771ArgCys: 0.771 ± 0.016
3.379ArgAsp: 3.379 ± 0.033
4.022ArgGlu: 4.022 ± 0.043
2.279ArgPhe: 2.279 ± 0.023
3.688ArgGly: 3.688 ± 0.039
1.631ArgHis: 1.631 ± 0.022
3.038ArgIle: 3.038 ± 0.027
3.514ArgLys: 3.514 ± 0.032
5.877ArgLeu: 5.877 ± 0.041
1.365ArgMet: 1.365 ± 0.017
2.297ArgAsn: 2.297 ± 0.023
3.604ArgPro: 3.604 ± 0.037
2.768ArgGln: 2.768 ± 0.027
5.384ArgArg: 5.384 ± 0.049
5.117ArgSer: 5.117 ± 0.038
3.451ArgThr: 3.451 ± 0.027
3.564ArgVal: 3.564 ± 0.029
0.969ArgTrp: 0.969 ± 0.013
1.741ArgTyr: 1.741 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
6.929SerAla: 6.929 ± 0.051
0.948SerCys: 0.948 ± 0.015
4.252SerAsp: 4.252 ± 0.035
4.237SerGlu: 4.237 ± 0.036
3.082SerPhe: 3.082 ± 0.027
5.576SerGly: 5.576 ± 0.035
2.034SerHis: 2.034 ± 0.024
4.064SerIle: 4.064 ± 0.035
3.643SerLys: 3.643 ± 0.03
7.623SerLeu: 7.623 ± 0.048
1.698SerMet: 1.698 ± 0.021
2.983SerAsn: 2.983 ± 0.031
5.578SerPro: 5.578 ± 0.052
3.493SerGln: 3.493 ± 0.032
5.316SerArg: 5.316 ± 0.039
9.086SerSer: 9.086 ± 0.08
5.636SerThr: 5.636 ± 0.048
4.845SerVal: 4.845 ± 0.032
1.127SerTrp: 1.127 ± 0.017
2.078SerTyr: 2.078 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.257ThrAla: 5.257 ± 0.038
0.768ThrCys: 0.768 ± 0.013
2.886ThrAsp: 2.886 ± 0.026
3.129ThrGlu: 3.129 ± 0.029
2.2ThrPhe: 2.2 ± 0.023
4.196ThrGly: 4.196 ± 0.036
1.276ThrHis: 1.276 ± 0.016
3.042ThrIle: 3.042 ± 0.028
2.455ThrLys: 2.455 ± 0.022
5.241ThrLeu: 5.241 ± 0.038
1.189ThrMet: 1.189 ± 0.015
2.025ThrAsn: 2.025 ± 0.025
4.195ThrPro: 4.195 ± 0.038
2.065ThrGln: 2.065 ± 0.02
3.124ThrArg: 3.124 ± 0.027
5.294ThrSer: 5.294 ± 0.042
4.114ThrThr: 4.114 ± 0.056
3.96ThrVal: 3.96 ± 0.03
0.842ThrTrp: 0.842 ± 0.015
1.606ThrTyr: 1.606 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
5.296ValAla: 5.296 ± 0.038
0.895ValCys: 0.895 ± 0.015
3.727ValAsp: 3.727 ± 0.03
3.855ValGlu: 3.855 ± 0.032
2.496ValPhe: 2.496 ± 0.024
3.979ValGly: 3.979 ± 0.032
1.442ValHis: 1.442 ± 0.017
3.068ValIle: 3.068 ± 0.028
2.787ValLys: 2.787 ± 0.026
5.74ValLeu: 5.74 ± 0.04
1.32ValMet: 1.32 ± 0.016
2.231ValAsn: 2.231 ± 0.021
3.663ValPro: 3.663 ± 0.031
2.451ValGln: 2.451 ± 0.021
3.636ValArg: 3.636 ± 0.029
4.929ValSer: 4.929 ± 0.034
3.622ValThr: 3.622 ± 0.037
4.257ValVal: 4.257 ± 0.036
0.863ValTrp: 0.863 ± 0.014
1.796ValTyr: 1.796 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.115TrpAla: 1.115 ± 0.016
0.187TrpCys: 0.187 ± 0.006
0.881TrpAsp: 0.881 ± 0.016
0.822TrpGlu: 0.822 ± 0.013
0.537TrpPhe: 0.537 ± 0.012
0.916TrpGly: 0.916 ± 0.016
0.35TrpHis: 0.35 ± 0.009
0.758TrpIle: 0.758 ± 0.012
0.791TrpLys: 0.791 ± 0.014
1.38TrpLeu: 1.38 ± 0.018
0.373TrpMet: 0.373 ± 0.008
0.621TrpAsn: 0.621 ± 0.012
0.593TrpPro: 0.593 ± 0.011
0.569TrpGln: 0.569 ± 0.012
0.996TrpArg: 0.996 ± 0.017
1.057TrpSer: 1.057 ± 0.015
0.945TrpThr: 0.945 ± 0.015
0.881TrpVal: 0.881 ± 0.014
0.272TrpTrp: 0.272 ± 0.009
0.433TrpTyr: 0.433 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.21TyrAla: 2.21 ± 0.021
0.427TyrCys: 0.427 ± 0.011
1.591TyrAsp: 1.591 ± 0.019
1.561TyrGlu: 1.561 ± 0.018
1.195TyrPhe: 1.195 ± 0.017
2.056TyrGly: 2.056 ± 0.025
0.762TyrHis: 0.762 ± 0.013
1.467TyrIle: 1.467 ± 0.017
1.077TyrLys: 1.077 ± 0.016
2.773TyrLeu: 2.773 ± 0.028
0.609TyrMet: 0.609 ± 0.013
1.098TyrAsn: 1.098 ± 0.016
1.559TyrPro: 1.559 ± 0.021
1.107TyrGln: 1.107 ± 0.014
1.712TyrArg: 1.712 ± 0.021
2.085TyrSer: 2.085 ± 0.019
1.613TyrThr: 1.613 ± 0.021
1.701TyrVal: 1.701 ± 0.021
0.441TyrTrp: 0.441 ± 0.008
0.961TyrTyr: 0.961 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9647 proteins (4742919 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski