Amino acid dipepetide frequency for Aspergillus welwitschiae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.583AlaAla: 8.583 ± 0.058
1.196AlaCys: 1.196 ± 0.016
4.121AlaAsp: 4.121 ± 0.031
4.87AlaGlu: 4.87 ± 0.037
3.134AlaPhe: 3.134 ± 0.025
5.674AlaGly: 5.674 ± 0.034
1.821AlaHis: 1.821 ± 0.017
4.317AlaIle: 4.317 ± 0.029
3.59AlaLys: 3.59 ± 0.032
7.782AlaLeu: 7.782 ± 0.036
1.995AlaMet: 1.995 ± 0.021
2.83AlaAsn: 2.83 ± 0.024
4.475AlaPro: 4.475 ± 0.041
3.262AlaGln: 3.262 ± 0.026
4.833AlaArg: 4.833 ± 0.031
7.159AlaSer: 7.159 ± 0.042
5.219AlaThr: 5.219 ± 0.033
5.434AlaVal: 5.434 ± 0.034
1.196AlaTrp: 1.196 ± 0.016
2.253AlaTyr: 2.253 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
1.013CysAla: 1.013 ± 0.015
0.301CysCys: 0.301 ± 0.009
0.74CysAsp: 0.74 ± 0.012
0.647CysGlu: 0.647 ± 0.011
0.624CysPhe: 0.624 ± 0.01
0.976CysGly: 0.976 ± 0.015
0.404CysHis: 0.404 ± 0.009
0.795CysIle: 0.795 ± 0.012
0.493CysLys: 0.493 ± 0.01
1.503CysLeu: 1.503 ± 0.018
0.314CysMet: 0.314 ± 0.008
0.457CysAsn: 0.457 ± 0.01
0.723CysPro: 0.723 ± 0.012
0.522CysGln: 0.522 ± 0.01
0.908CysArg: 0.908 ± 0.015
1.089CysSer: 1.089 ± 0.013
0.78CysThr: 0.78 ± 0.013
0.897CysVal: 0.897 ± 0.015
0.241CysTrp: 0.241 ± 0.006
0.405CysTyr: 0.405 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.479AspAla: 4.479 ± 0.033
0.665AspCys: 0.665 ± 0.012
3.84AspAsp: 3.84 ± 0.036
4.117AspGlu: 4.117 ± 0.035
2.093AspPhe: 2.093 ± 0.023
3.857AspGly: 3.857 ± 0.026
1.29AspHis: 1.29 ± 0.016
3.106AspIle: 3.106 ± 0.026
2.154AspLys: 2.154 ± 0.021
5.112AspLeu: 5.112 ± 0.034
1.26AspMet: 1.26 ± 0.015
1.875AspAsn: 1.875 ± 0.021
3.31AspPro: 3.31 ± 0.026
1.872AspGln: 1.872 ± 0.02
3.124AspArg: 3.124 ± 0.028
3.99AspSer: 3.99 ± 0.03
3.004AspThr: 3.004 ± 0.025
3.611AspVal: 3.611 ± 0.027
0.903AspTrp: 0.903 ± 0.012
1.683AspTyr: 1.683 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
5.123GluAla: 5.123 ± 0.038
0.684GluCys: 0.684 ± 0.012
3.9GluAsp: 3.9 ± 0.034
5.259GluGlu: 5.259 ± 0.051
1.885GluPhe: 1.885 ± 0.017
3.736GluGly: 3.736 ± 0.03
1.393GluHis: 1.393 ± 0.016
2.959GluIle: 2.959 ± 0.024
3.346GluLys: 3.346 ± 0.032
5.065GluLeu: 5.065 ± 0.035
1.449GluMet: 1.449 ± 0.015
2.188GluAsn: 2.188 ± 0.018
2.777GluPro: 2.777 ± 0.04
2.496GluGln: 2.496 ± 0.022
3.869GluArg: 3.869 ± 0.032
4.268GluSer: 4.268 ± 0.032
3.476GluThr: 3.476 ± 0.027
3.559GluVal: 3.559 ± 0.025
0.9GluTrp: 0.9 ± 0.013
1.738GluTyr: 1.738 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
2.964PheAla: 2.964 ± 0.024
0.635PheCys: 0.635 ± 0.011
2.216PheAsp: 2.216 ± 0.019
2.042PheGlu: 2.042 ± 0.019
1.732PhePhe: 1.732 ± 0.023
2.775PheGly: 2.775 ± 0.025
0.989PheHis: 0.989 ± 0.012
1.889PheIle: 1.889 ± 0.021
1.32PheLys: 1.32 ± 0.015
3.75PheLeu: 3.75 ± 0.029
0.789PheMet: 0.789 ± 0.011
1.382PheAsn: 1.382 ± 0.015
2.088PhePro: 2.088 ± 0.021
1.416PheGln: 1.416 ± 0.014
2.089PheArg: 2.089 ± 0.022
3.092PheSer: 3.092 ± 0.023
2.196PheThr: 2.196 ± 0.02
2.403PheVal: 2.403 ± 0.024
0.673PheTrp: 0.673 ± 0.011
1.189PheTyr: 1.189 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
5.134GlyAla: 5.134 ± 0.037
0.98GlyCys: 0.98 ± 0.014
3.555GlyAsp: 3.555 ± 0.026
3.577GlyGlu: 3.577 ± 0.027
2.773GlyPhe: 2.773 ± 0.024
5.355GlyGly: 5.355 ± 0.044
1.709GlyHis: 1.709 ± 0.018
3.57GlyIle: 3.57 ± 0.028
3.176GlyLys: 3.176 ± 0.025
6.227GlyLeu: 6.227 ± 0.034
1.611GlyMet: 1.611 ± 0.019
2.416GlyAsn: 2.416 ± 0.022
3.326GlyPro: 3.326 ± 0.028
2.509GlyGln: 2.509 ± 0.023
4.093GlyArg: 4.093 ± 0.026
5.624GlySer: 5.624 ± 0.044
3.944GlyThr: 3.944 ± 0.029
4.526GlyVal: 4.526 ± 0.032
1.214GlyTrp: 1.214 ± 0.015
2.247GlyTyr: 2.247 ± 0.02
0.0GlyXaa: 0.0 ± 0.0
His
1.916HisAla: 1.916 ± 0.018
0.387HisCys: 0.387 ± 0.009
1.33HisAsp: 1.33 ± 0.016
1.333HisGlu: 1.333 ± 0.016
0.947HisPhe: 0.947 ± 0.014
1.751HisGly: 1.751 ± 0.02
0.949HisHis: 0.949 ± 0.017
1.303HisIle: 1.303 ± 0.015
0.842HisLys: 0.842 ± 0.011
2.438HisLeu: 2.438 ± 0.021
0.531HisMet: 0.531 ± 0.011
0.873HisAsn: 0.873 ± 0.013
1.837HisPro: 1.837 ± 0.018
1.012HisGln: 1.012 ± 0.013
1.665HisArg: 1.665 ± 0.018
1.915HisSer: 1.915 ± 0.021
1.383HisThr: 1.383 ± 0.017
1.474HisVal: 1.474 ± 0.017
0.376HisTrp: 0.376 ± 0.008
0.773HisTyr: 0.773 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.187IleAla: 4.187 ± 0.033
0.866IleCys: 0.866 ± 0.014
2.814IleAsp: 2.814 ± 0.022
2.756IleGlu: 2.756 ± 0.025
2.093IlePhe: 2.093 ± 0.019
3.229IleGly: 3.229 ± 0.026
1.326IleHis: 1.326 ± 0.016
2.643IleIle: 2.643 ± 0.024
1.939IleLys: 1.939 ± 0.02
4.875IleLeu: 4.875 ± 0.034
1.061IleMet: 1.061 ± 0.015
1.789IleAsn: 1.789 ± 0.018
3.218IlePro: 3.218 ± 0.022
1.955IleGln: 1.955 ± 0.019
2.91IleArg: 2.91 ± 0.024
3.994IleSer: 3.994 ± 0.028
2.915IleThr: 2.915 ± 0.025
3.247IleVal: 3.247 ± 0.03
0.762IleTrp: 0.762 ± 0.013
1.555IleTyr: 1.555 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
3.745LysAla: 3.745 ± 0.029
0.511LysCys: 0.511 ± 0.01
2.472LysAsp: 2.472 ± 0.026
3.089LysGlu: 3.089 ± 0.03
1.288LysPhe: 1.288 ± 0.014
2.728LysGly: 2.728 ± 0.022
1.054LysHis: 1.054 ± 0.013
2.091LysIle: 2.091 ± 0.02
2.741LysLys: 2.741 ± 0.033
3.774LysLeu: 3.774 ± 0.028
0.921LysMet: 0.921 ± 0.012
1.565LysAsn: 1.565 ± 0.015
2.483LysPro: 2.483 ± 0.024
1.728LysGln: 1.728 ± 0.017
3.128LysArg: 3.128 ± 0.025
3.146LysSer: 3.146 ± 0.028
2.501LysThr: 2.501 ± 0.022
2.641LysVal: 2.641 ± 0.022
0.631LysTrp: 0.631 ± 0.01
1.339LysTyr: 1.339 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
7.913LeuAla: 7.913 ± 0.04
1.397LeuCys: 1.397 ± 0.015
5.198LeuAsp: 5.198 ± 0.03
5.488LeuGlu: 5.488 ± 0.039
3.567LeuPhe: 3.567 ± 0.028
6.137LeuGly: 6.137 ± 0.033
2.434LeuHis: 2.434 ± 0.021
4.155LeuIle: 4.155 ± 0.031
3.848LeuLys: 3.848 ± 0.027
9.07LeuLeu: 9.07 ± 0.057
1.89LeuMet: 1.89 ± 0.018
3.172LeuAsn: 3.172 ± 0.025
5.72LeuPro: 5.72 ± 0.035
3.993LeuGln: 3.993 ± 0.031
5.954LeuArg: 5.954 ± 0.036
7.811LeuSer: 7.811 ± 0.046
5.034LeuThr: 5.034 ± 0.034
5.687LeuVal: 5.687 ± 0.036
1.29LeuTrp: 1.29 ± 0.017
2.62LeuTyr: 2.62 ± 0.027
0.0LeuXaa: 0.0 ± 0.0
Met
2.177MetAla: 2.177 ± 0.023
0.267MetCys: 0.267 ± 0.006
1.238MetAsp: 1.238 ± 0.015
1.312MetGlu: 1.312 ± 0.015
0.765MetPhe: 0.765 ± 0.012
1.507MetGly: 1.507 ± 0.019
0.516MetHis: 0.516 ± 0.01
1.079MetIle: 1.079 ± 0.014
0.971MetLys: 0.971 ± 0.013
1.925MetLeu: 1.925 ± 0.018
0.597MetMet: 0.597 ± 0.01
0.803MetAsn: 0.803 ± 0.013
1.271MetPro: 1.271 ± 0.015
0.893MetGln: 0.893 ± 0.014
1.32MetArg: 1.32 ± 0.016
1.914MetSer: 1.914 ± 0.018
1.333MetThr: 1.333 ± 0.015
1.385MetVal: 1.385 ± 0.016
0.274MetTrp: 0.274 ± 0.006
0.569MetTyr: 0.569 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.981AsnAla: 2.981 ± 0.019
0.488AsnCys: 0.488 ± 0.009
1.857AsnAsp: 1.857 ± 0.02
1.941AsnGlu: 1.941 ± 0.02
1.319AsnPhe: 1.319 ± 0.015
2.836AsnGly: 2.836 ± 0.025
0.871AsnHis: 0.871 ± 0.013
2.047AsnIle: 2.047 ± 0.021
1.393AsnLys: 1.393 ± 0.018
3.214AsnLeu: 3.214 ± 0.025
0.848AsnMet: 0.848 ± 0.013
1.473AsnAsn: 1.473 ± 0.02
2.437AsnPro: 2.437 ± 0.021
1.32AsnGln: 1.32 ± 0.016
1.974AsnArg: 1.974 ± 0.02
2.649AsnSer: 2.649 ± 0.028
2.165AsnThr: 2.165 ± 0.02
2.283AsnVal: 2.283 ± 0.02
0.574AsnTrp: 0.574 ± 0.011
1.11AsnTyr: 1.11 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
5.076ProAla: 5.076 ± 0.048
0.637ProCys: 0.637 ± 0.01
3.219ProAsp: 3.219 ± 0.024
3.835ProGlu: 3.835 ± 0.033
2.157ProPhe: 2.157 ± 0.021
3.859ProGly: 3.859 ± 0.032
1.417ProHis: 1.417 ± 0.017
2.592ProIle: 2.592 ± 0.024
2.398ProLys: 2.398 ± 0.024
4.877ProLeu: 4.877 ± 0.027
1.094ProMet: 1.094 ± 0.014
2.168ProAsn: 2.168 ± 0.019
4.733ProPro: 4.733 ± 0.064
2.427ProGln: 2.427 ± 0.027
3.401ProArg: 3.401 ± 0.03
6.147ProSer: 6.147 ± 0.045
3.988ProThr: 3.988 ± 0.031
3.716ProVal: 3.716 ± 0.026
0.817ProTrp: 0.817 ± 0.012
1.596ProTyr: 1.596 ± 0.016
0.0ProXaa: 0.0 ± 0.0
Gln
3.359GlnAla: 3.359 ± 0.03
0.504GlnCys: 0.504 ± 0.01
2.01GlnAsp: 2.01 ± 0.02
2.422GlnGlu: 2.422 ± 0.021
1.319GlnPhe: 1.319 ± 0.016
2.412GlnGly: 2.412 ± 0.02
1.072GlnHis: 1.072 ± 0.012
1.913GlnIle: 1.913 ± 0.017
1.89GlnLys: 1.89 ± 0.02
3.614GlnLeu: 3.614 ± 0.029
0.89GlnMet: 0.89 ± 0.012
1.486GlnAsn: 1.486 ± 0.016
2.577GlnPro: 2.577 ± 0.028
2.367GlnGln: 2.367 ± 0.041
2.641GlnArg: 2.641 ± 0.022
3.217GlnSer: 3.217 ± 0.023
2.337GlnThr: 2.337 ± 0.021
2.227GlnVal: 2.227 ± 0.02
0.62GlnTrp: 0.62 ± 0.01
1.213GlnTyr: 1.213 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
4.648ArgAla: 4.648 ± 0.034
0.847ArgCys: 0.847 ± 0.013
3.314ArgAsp: 3.314 ± 0.026
3.802ArgGlu: 3.802 ± 0.028
2.204ArgPhe: 2.204 ± 0.021
3.707ArgGly: 3.707 ± 0.029
1.626ArgHis: 1.626 ± 0.018
2.956ArgIle: 2.956 ± 0.023
3.294ArgLys: 3.294 ± 0.029
5.81ArgLeu: 5.81 ± 0.035
1.359ArgMet: 1.359 ± 0.014
2.199ArgAsn: 2.199 ± 0.02
3.522ArgPro: 3.522 ± 0.029
2.679ArgGln: 2.679 ± 0.023
5.103ArgArg: 5.103 ± 0.038
4.943ArgSer: 4.943 ± 0.034
3.302ArgThr: 3.302 ± 0.027
3.558ArgVal: 3.558 ± 0.027
0.974ArgTrp: 0.974 ± 0.012
1.797ArgTyr: 1.797 ± 0.017
0.0ArgXaa: 0.0 ± 0.0
Ser
6.631SerAla: 6.631 ± 0.046
1.006SerCys: 1.006 ± 0.014
4.265SerAsp: 4.265 ± 0.032
4.166SerGlu: 4.166 ± 0.03
3.142SerPhe: 3.142 ± 0.024
5.529SerGly: 5.529 ± 0.038
2.077SerHis: 2.077 ± 0.02
4.094SerIle: 4.094 ± 0.03
3.322SerLys: 3.322 ± 0.029
7.679SerLeu: 7.679 ± 0.042
1.775SerMet: 1.775 ± 0.019
2.95SerAsn: 2.95 ± 0.023
5.455SerPro: 5.455 ± 0.049
3.319SerGln: 3.319 ± 0.027
5.103SerArg: 5.103 ± 0.04
8.996SerSer: 8.996 ± 0.078
5.734SerThr: 5.734 ± 0.038
4.808SerVal: 4.808 ± 0.028
1.212SerTrp: 1.212 ± 0.012
2.202SerTyr: 2.202 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.167ThrAla: 5.167 ± 0.033
0.815ThrCys: 0.815 ± 0.013
2.951ThrAsp: 2.951 ± 0.021
3.165ThrGlu: 3.165 ± 0.027
2.244ThrPhe: 2.244 ± 0.024
4.258ThrGly: 4.258 ± 0.027
1.376ThrHis: 1.376 ± 0.015
3.154ThrIle: 3.154 ± 0.024
2.358ThrLys: 2.358 ± 0.02
5.446ThrLeu: 5.446 ± 0.039
1.256ThrMet: 1.256 ± 0.015
2.094ThrAsn: 2.094 ± 0.022
4.199ThrPro: 4.199 ± 0.034
2.098ThrGln: 2.098 ± 0.018
3.148ThrArg: 3.148 ± 0.025
5.351ThrSer: 5.351 ± 0.036
4.528ThrThr: 4.528 ± 0.046
3.934ThrVal: 3.934 ± 0.024
0.906ThrTrp: 0.906 ± 0.014
1.752ThrTyr: 1.752 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
5.238ValAla: 5.238 ± 0.033
0.951ValCys: 0.951 ± 0.014
3.72ValAsp: 3.72 ± 0.028
3.793ValGlu: 3.793 ± 0.028
2.529ValPhe: 2.529 ± 0.025
4.083ValGly: 4.083 ± 0.028
1.493ValHis: 1.493 ± 0.017
3.11ValIle: 3.11 ± 0.025
2.672ValLys: 2.672 ± 0.022
5.821ValLeu: 5.821 ± 0.037
1.4ValMet: 1.4 ± 0.015
2.249ValAsn: 2.249 ± 0.02
3.681ValPro: 3.681 ± 0.022
2.464ValGln: 2.464 ± 0.023
3.617ValArg: 3.617 ± 0.024
4.862ValSer: 4.862 ± 0.032
3.629ValThr: 3.629 ± 0.025
4.442ValVal: 4.442 ± 0.032
0.901ValTrp: 0.901 ± 0.012
1.879ValTyr: 1.879 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.152TrpAla: 1.152 ± 0.015
0.222TrpCys: 0.222 ± 0.007
0.909TrpAsp: 0.909 ± 0.014
0.87TrpGlu: 0.87 ± 0.01
0.569TrpPhe: 0.569 ± 0.011
0.978TrpGly: 0.978 ± 0.013
0.371TrpHis: 0.371 ± 0.007
0.814TrpIle: 0.814 ± 0.011
0.804TrpLys: 0.804 ± 0.011
1.453TrpLeu: 1.453 ± 0.02
0.4TrpMet: 0.4 ± 0.009
0.647TrpAsn: 0.647 ± 0.011
0.659TrpPro: 0.659 ± 0.011
0.584TrpGln: 0.584 ± 0.011
1.006TrpArg: 1.006 ± 0.015
1.124TrpSer: 1.124 ± 0.016
0.984TrpThr: 0.984 ± 0.015
0.96TrpVal: 0.96 ± 0.014
0.298TrpTrp: 0.298 ± 0.008
0.46TrpTyr: 0.46 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.259TyrAla: 2.259 ± 0.021
0.473TyrCys: 0.473 ± 0.008
1.679TyrAsp: 1.679 ± 0.017
1.584TyrGlu: 1.584 ± 0.016
1.245TyrPhe: 1.245 ± 0.016
2.182TyrGly: 2.182 ± 0.022
0.847TyrHis: 0.847 ± 0.011
1.542TyrIle: 1.542 ± 0.018
1.026TyrLys: 1.026 ± 0.013
2.946TyrLeu: 2.946 ± 0.023
0.678TyrMet: 0.678 ± 0.011
1.161TyrAsn: 1.161 ± 0.018
1.664TyrPro: 1.664 ± 0.019
1.173TyrGln: 1.173 ± 0.014
1.757TyrArg: 1.757 ± 0.016
2.174TyrSer: 2.174 ± 0.024
1.749TyrThr: 1.749 ± 0.018
1.744TyrVal: 1.744 ± 0.019
0.499TyrTrp: 0.499 ± 0.009
1.066TyrTyr: 1.066 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 13680 proteins (6103230 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski