Amino acid dipepetide frequency for Aspergillus coremiiformis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.804AlaAla: 7.804 ± 0.065
1.019AlaCys: 1.019 ± 0.017
3.983AlaAsp: 3.983 ± 0.031
4.865AlaGlu: 4.865 ± 0.047
3.049AlaPhe: 3.049 ± 0.027
5.3AlaGly: 5.3 ± 0.041
1.758AlaHis: 1.758 ± 0.021
4.154AlaIle: 4.154 ± 0.036
3.63AlaLys: 3.63 ± 0.036
7.526AlaLeu: 7.526 ± 0.046
1.878AlaMet: 1.878 ± 0.022
2.789AlaAsn: 2.789 ± 0.027
4.409AlaPro: 4.409 ± 0.044
3.255AlaGln: 3.255 ± 0.029
4.811AlaArg: 4.811 ± 0.036
6.913AlaSer: 6.913 ± 0.05
5.057AlaThr: 5.057 ± 0.039
5.251AlaVal: 5.251 ± 0.04
1.072AlaTrp: 1.072 ± 0.018
2.086AlaTyr: 2.086 ± 0.024
0.0AlaXaa: 0.0 ± 0.0
Cys
0.926CysAla: 0.926 ± 0.016
0.25CysCys: 0.25 ± 0.009
0.679CysAsp: 0.679 ± 0.013
0.612CysGlu: 0.612 ± 0.012
0.551CysPhe: 0.551 ± 0.011
0.925CysGly: 0.925 ± 0.019
0.36CysHis: 0.36 ± 0.01
0.736CysIle: 0.736 ± 0.014
0.505CysLys: 0.505 ± 0.012
1.329CysLeu: 1.329 ± 0.019
0.283CysMet: 0.283 ± 0.008
0.448CysAsn: 0.448 ± 0.011
0.675CysPro: 0.675 ± 0.016
0.481CysGln: 0.481 ± 0.011
0.82CysArg: 0.82 ± 0.015
0.974CysSer: 0.974 ± 0.017
0.696CysThr: 0.696 ± 0.013
0.799CysVal: 0.799 ± 0.013
0.208CysTrp: 0.208 ± 0.007
0.37CysTyr: 0.37 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
4.357AspAla: 4.357 ± 0.038
0.614AspCys: 0.614 ± 0.011
3.937AspAsp: 3.937 ± 0.041
4.261AspGlu: 4.261 ± 0.039
2.179AspPhe: 2.179 ± 0.022
3.856AspGly: 3.856 ± 0.031
1.322AspHis: 1.322 ± 0.017
3.185AspIle: 3.185 ± 0.027
2.215AspLys: 2.215 ± 0.025
5.179AspLeu: 5.179 ± 0.036
1.24AspMet: 1.24 ± 0.016
1.888AspAsn: 1.888 ± 0.022
3.387AspPro: 3.387 ± 0.028
1.98AspGln: 1.98 ± 0.02
3.231AspArg: 3.231 ± 0.035
4.213AspSer: 4.213 ± 0.035
2.96AspThr: 2.96 ± 0.029
3.597AspVal: 3.597 ± 0.03
0.838AspTrp: 0.838 ± 0.014
1.656AspTyr: 1.656 ± 0.022
0.0AspXaa: 0.0 ± 0.0
Glu
4.893GluAla: 4.893 ± 0.042
0.629GluCys: 0.629 ± 0.013
4.1GluAsp: 4.1 ± 0.044
5.38GluGlu: 5.38 ± 0.059
1.98GluPhe: 1.98 ± 0.023
3.613GluGly: 3.613 ± 0.031
1.406GluHis: 1.406 ± 0.018
3.12GluIle: 3.12 ± 0.029
3.666GluLys: 3.666 ± 0.04
5.16GluLeu: 5.16 ± 0.043
1.438GluMet: 1.438 ± 0.021
2.45GluAsn: 2.45 ± 0.026
2.805GluPro: 2.805 ± 0.044
2.534GluGln: 2.534 ± 0.026
4.011GluArg: 4.011 ± 0.039
4.481GluSer: 4.481 ± 0.04
3.558GluThr: 3.558 ± 0.031
3.507GluVal: 3.507 ± 0.033
0.877GluTrp: 0.877 ± 0.014
1.708GluTyr: 1.708 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
2.968PheAla: 2.968 ± 0.031
0.575PheCys: 0.575 ± 0.011
2.25PheAsp: 2.25 ± 0.022
2.079PheGlu: 2.079 ± 0.025
1.725PhePhe: 1.725 ± 0.025
2.796PheGly: 2.796 ± 0.031
1.022PheHis: 1.022 ± 0.016
1.852PheIle: 1.852 ± 0.02
1.434PheLys: 1.434 ± 0.018
3.784PheLeu: 3.784 ± 0.037
0.782PheMet: 0.782 ± 0.014
1.435PheAsn: 1.435 ± 0.019
2.09PhePro: 2.09 ± 0.019
1.459PheGln: 1.459 ± 0.021
2.098PheArg: 2.098 ± 0.023
3.147PheSer: 3.147 ± 0.032
2.22PheThr: 2.22 ± 0.025
2.366PheVal: 2.366 ± 0.023
0.637PheTrp: 0.637 ± 0.014
1.162PheTyr: 1.162 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
4.866GlyAla: 4.866 ± 0.048
0.891GlyCys: 0.891 ± 0.019
3.539GlyAsp: 3.539 ± 0.028
3.533GlyGlu: 3.533 ± 0.03
2.788GlyPhe: 2.788 ± 0.027
5.204GlyGly: 5.204 ± 0.049
1.67GlyHis: 1.67 ± 0.02
3.529GlyIle: 3.529 ± 0.029
3.272GlyLys: 3.272 ± 0.029
6.06GlyLeu: 6.06 ± 0.039
1.506GlyMet: 1.506 ± 0.018
2.526GlyAsn: 2.526 ± 0.026
3.34GlyPro: 3.34 ± 0.032
2.535GlyGln: 2.535 ± 0.025
4.077GlyArg: 4.077 ± 0.035
5.692GlySer: 5.692 ± 0.043
3.856GlyThr: 3.856 ± 0.032
4.338GlyVal: 4.338 ± 0.036
1.131GlyTrp: 1.131 ± 0.019
2.126GlyTyr: 2.126 ± 0.023
0.0GlyXaa: 0.0 ± 0.0
His
1.842HisAla: 1.842 ± 0.022
0.356HisCys: 0.356 ± 0.009
1.339HisAsp: 1.339 ± 0.017
1.329HisGlu: 1.329 ± 0.018
0.96HisPhe: 0.96 ± 0.017
1.744HisGly: 1.744 ± 0.021
0.88HisHis: 0.88 ± 0.018
1.296HisIle: 1.296 ± 0.017
0.868HisLys: 0.868 ± 0.014
2.367HisLeu: 2.367 ± 0.026
0.509HisMet: 0.509 ± 0.011
0.859HisAsn: 0.859 ± 0.015
1.811HisPro: 1.811 ± 0.021
1.042HisGln: 1.042 ± 0.016
1.72HisArg: 1.72 ± 0.023
2.021HisSer: 2.021 ± 0.022
1.347HisThr: 1.347 ± 0.019
1.456HisVal: 1.456 ± 0.019
0.36HisTrp: 0.36 ± 0.008
0.709HisTyr: 0.709 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.117IleAla: 4.117 ± 0.031
0.782IleCys: 0.782 ± 0.016
2.846IleAsp: 2.846 ± 0.026
2.821IleGlu: 2.821 ± 0.027
2.061IlePhe: 2.061 ± 0.024
3.148IleGly: 3.148 ± 0.031
1.314IleHis: 1.314 ± 0.017
2.561IleIle: 2.561 ± 0.028
2.068IleLys: 2.068 ± 0.025
4.838IleLeu: 4.838 ± 0.035
1.029IleMet: 1.029 ± 0.015
1.786IleAsn: 1.786 ± 0.024
3.342IlePro: 3.342 ± 0.029
2.019IleGln: 2.019 ± 0.021
2.972IleArg: 2.972 ± 0.028
4.031IleSer: 4.031 ± 0.03
2.867IleThr: 2.867 ± 0.029
3.24IleVal: 3.24 ± 0.032
0.714IleTrp: 0.714 ± 0.014
1.502IleTyr: 1.502 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
3.876LysAla: 3.876 ± 0.035
0.503LysCys: 0.503 ± 0.011
2.702LysAsp: 2.702 ± 0.024
3.303LysGlu: 3.303 ± 0.032
1.402LysPhe: 1.402 ± 0.019
2.861LysGly: 2.861 ± 0.03
1.081LysHis: 1.081 ± 0.016
2.25LysIle: 2.25 ± 0.024
3.157LysLys: 3.157 ± 0.047
3.983LysLeu: 3.983 ± 0.036
0.97LysMet: 0.97 ± 0.015
1.771LysAsn: 1.771 ± 0.02
2.63LysPro: 2.63 ± 0.029
1.87LysGln: 1.87 ± 0.025
3.429LysArg: 3.429 ± 0.036
3.47LysSer: 3.47 ± 0.034
2.71LysThr: 2.71 ± 0.029
2.707LysVal: 2.707 ± 0.027
0.617LysTrp: 0.617 ± 0.011
1.383LysTyr: 1.383 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
7.541LeuAla: 7.541 ± 0.045
1.263LeuCys: 1.263 ± 0.017
5.193LeuAsp: 5.193 ± 0.034
5.596LeuGlu: 5.596 ± 0.044
3.538LeuPhe: 3.538 ± 0.033
5.921LeuGly: 5.921 ± 0.039
2.365LeuHis: 2.365 ± 0.023
4.11LeuIle: 4.11 ± 0.038
4.186LeuLys: 4.186 ± 0.034
8.665LeuLeu: 8.665 ± 0.056
1.805LeuMet: 1.805 ± 0.02
3.296LeuAsn: 3.296 ± 0.029
5.585LeuPro: 5.585 ± 0.037
3.998LeuGln: 3.998 ± 0.036
6.029LeuArg: 6.029 ± 0.047
7.913LeuSer: 7.913 ± 0.05
4.951LeuThr: 4.951 ± 0.037
5.564LeuVal: 5.564 ± 0.044
1.208LeuTrp: 1.208 ± 0.023
2.505LeuTyr: 2.505 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
2.081MetAla: 2.081 ± 0.02
0.251MetCys: 0.251 ± 0.007
1.262MetAsp: 1.262 ± 0.019
1.347MetGlu: 1.347 ± 0.02
0.731MetPhe: 0.731 ± 0.013
1.446MetGly: 1.446 ± 0.021
0.474MetHis: 0.474 ± 0.012
1.029MetIle: 1.029 ± 0.017
1.014MetLys: 1.014 ± 0.016
1.834MetLeu: 1.834 ± 0.023
0.541MetMet: 0.541 ± 0.013
0.846MetAsn: 0.846 ± 0.014
1.189MetPro: 1.189 ± 0.017
0.855MetGln: 0.855 ± 0.015
1.241MetArg: 1.241 ± 0.02
1.88MetSer: 1.88 ± 0.022
1.297MetThr: 1.297 ± 0.019
1.357MetVal: 1.357 ± 0.02
0.247MetTrp: 0.247 ± 0.008
0.529MetTyr: 0.529 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.043AsnAla: 3.043 ± 0.025
0.469AsnCys: 0.469 ± 0.012
1.975AsnAsp: 1.975 ± 0.021
2.1AsnGlu: 2.1 ± 0.024
1.354AsnPhe: 1.354 ± 0.019
2.875AsnGly: 2.875 ± 0.034
0.916AsnHis: 0.916 ± 0.016
2.069AsnIle: 2.069 ± 0.023
1.518AsnLys: 1.518 ± 0.019
3.336AsnLeu: 3.336 ± 0.028
0.826AsnMet: 0.826 ± 0.015
1.458AsnAsn: 1.458 ± 0.023
2.616AsnPro: 2.616 ± 0.029
1.44AsnGln: 1.44 ± 0.019
2.104AsnArg: 2.104 ± 0.022
2.79AsnSer: 2.79 ± 0.027
2.187AsnThr: 2.187 ± 0.021
2.374AsnVal: 2.374 ± 0.023
0.552AsnTrp: 0.552 ± 0.01
1.057AsnTyr: 1.057 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
4.779ProAla: 4.779 ± 0.041
0.577ProCys: 0.577 ± 0.013
3.225ProAsp: 3.225 ± 0.032
3.886ProGlu: 3.886 ± 0.044
2.196ProPhe: 2.196 ± 0.022
3.95ProGly: 3.95 ± 0.035
1.383ProHis: 1.383 ± 0.018
2.601ProIle: 2.601 ± 0.026
2.604ProLys: 2.604 ± 0.029
4.849ProLeu: 4.849 ± 0.036
1.122ProMet: 1.122 ± 0.017
2.225ProAsn: 2.225 ± 0.023
5.07ProPro: 5.07 ± 0.067
2.505ProGln: 2.505 ± 0.034
3.591ProArg: 3.591 ± 0.036
6.368ProSer: 6.368 ± 0.059
4.059ProThr: 4.059 ± 0.035
3.76ProVal: 3.76 ± 0.033
0.768ProTrp: 0.768 ± 0.012
1.563ProTyr: 1.563 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
3.242GlnAla: 3.242 ± 0.03
0.488GlnCys: 0.488 ± 0.012
2.085GlnAsp: 2.085 ± 0.024
2.504GlnGlu: 2.504 ± 0.026
1.341GlnPhe: 1.341 ± 0.017
2.42GlnGly: 2.42 ± 0.023
1.072GlnHis: 1.072 ± 0.016
1.987GlnIle: 1.987 ± 0.026
2.038GlnLys: 2.038 ± 0.023
3.61GlnLeu: 3.61 ± 0.028
0.891GlnMet: 0.891 ± 0.014
1.639GlnAsn: 1.639 ± 0.021
2.659GlnPro: 2.659 ± 0.037
2.335GlnGln: 2.335 ± 0.051
2.766GlnArg: 2.766 ± 0.024
3.362GlnSer: 3.362 ± 0.036
2.437GlnThr: 2.437 ± 0.029
2.247GlnVal: 2.247 ± 0.026
0.588GlnTrp: 0.588 ± 0.011
1.186GlnTyr: 1.186 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.572ArgAla: 4.572 ± 0.032
0.791ArgCys: 0.791 ± 0.015
3.408ArgAsp: 3.408 ± 0.036
3.943ArgGlu: 3.943 ± 0.037
2.277ArgPhe: 2.277 ± 0.023
3.758ArgGly: 3.758 ± 0.035
1.639ArgHis: 1.639 ± 0.02
3.012ArgIle: 3.012 ± 0.026
3.544ArgLys: 3.544 ± 0.031
5.741ArgLeu: 5.741 ± 0.039
1.354ArgMet: 1.354 ± 0.02
2.373ArgAsn: 2.373 ± 0.021
3.653ArgPro: 3.653 ± 0.043
2.735ArgGln: 2.735 ± 0.025
5.34ArgArg: 5.34 ± 0.044
5.204ArgSer: 5.204 ± 0.045
3.416ArgThr: 3.416 ± 0.033
3.561ArgVal: 3.561 ± 0.028
0.951ArgTrp: 0.951 ± 0.014
1.779ArgTyr: 1.779 ± 0.022
0.0ArgXaa: 0.0 ± 0.0
Ser
6.493SerAla: 6.493 ± 0.044
0.938SerCys: 0.938 ± 0.017
4.349SerAsp: 4.349 ± 0.039
4.274SerGlu: 4.274 ± 0.035
3.22SerPhe: 3.22 ± 0.032
5.58SerGly: 5.58 ± 0.038
2.167SerHis: 2.167 ± 0.023
4.142SerIle: 4.142 ± 0.033
3.707SerLys: 3.707 ± 0.03
7.714SerLeu: 7.714 ± 0.045
1.749SerMet: 1.749 ± 0.018
3.12SerAsn: 3.12 ± 0.037
5.684SerPro: 5.684 ± 0.057
3.513SerGln: 3.513 ± 0.036
5.345SerArg: 5.345 ± 0.047
9.108SerSer: 9.108 ± 0.077
5.712SerThr: 5.712 ± 0.042
4.826SerVal: 4.826 ± 0.039
1.143SerTrp: 1.143 ± 0.016
2.144SerTyr: 2.144 ± 0.025
0.0SerXaa: 0.0 ± 0.0
Thr
5.024ThrAla: 5.024 ± 0.039
0.767ThrCys: 0.767 ± 0.016
2.927ThrAsp: 2.927 ± 0.024
3.222ThrGlu: 3.222 ± 0.032
2.26ThrPhe: 2.26 ± 0.022
4.194ThrGly: 4.194 ± 0.031
1.349ThrHis: 1.349 ± 0.016
3.083ThrIle: 3.083 ± 0.031
2.54ThrLys: 2.54 ± 0.025
5.378ThrLeu: 5.378 ± 0.039
1.219ThrMet: 1.219 ± 0.016
2.092ThrAsn: 2.092 ± 0.024
4.33ThrPro: 4.33 ± 0.045
2.149ThrGln: 2.149 ± 0.025
3.191ThrArg: 3.191 ± 0.029
5.286ThrSer: 5.286 ± 0.045
4.073ThrThr: 4.073 ± 0.04
3.95ThrVal: 3.95 ± 0.038
0.836ThrTrp: 0.836 ± 0.016
1.614ThrTyr: 1.614 ± 0.021
0.0ThrXaa: 0.0 ± 0.0
Val
5.006ValAla: 5.006 ± 0.035
0.86ValCys: 0.86 ± 0.014
3.78ValAsp: 3.78 ± 0.034
3.853ValGlu: 3.853 ± 0.041
2.52ValPhe: 2.52 ± 0.026
3.943ValGly: 3.943 ± 0.035
1.476ValHis: 1.476 ± 0.019
3.05ValIle: 3.05 ± 0.028
2.826ValLys: 2.826 ± 0.026
5.732ValLeu: 5.732 ± 0.044
1.312ValMet: 1.312 ± 0.019
2.29ValAsn: 2.29 ± 0.024
3.643ValPro: 3.643 ± 0.033
2.468ValGln: 2.468 ± 0.024
3.591ValArg: 3.591 ± 0.03
4.901ValSer: 4.901 ± 0.041
3.538ValThr: 3.538 ± 0.031
4.218ValVal: 4.218 ± 0.041
0.858ValTrp: 0.858 ± 0.015
1.809ValTyr: 1.809 ± 0.02
0.0ValXaa: 0.0 ± 0.0
Trp
1.07TrpAla: 1.07 ± 0.016
0.186TrpCys: 0.186 ± 0.007
0.903TrpAsp: 0.903 ± 0.015
0.851TrpGlu: 0.851 ± 0.014
0.53TrpPhe: 0.53 ± 0.011
0.89TrpGly: 0.89 ± 0.016
0.353TrpHis: 0.353 ± 0.008
0.775TrpIle: 0.775 ± 0.014
0.804TrpLys: 0.804 ± 0.015
1.35TrpLeu: 1.35 ± 0.02
0.378TrpMet: 0.378 ± 0.009
0.644TrpAsn: 0.644 ± 0.014
0.573TrpPro: 0.573 ± 0.012
0.555TrpGln: 0.555 ± 0.01
0.947TrpArg: 0.947 ± 0.017
1.029TrpSer: 1.029 ± 0.017
0.901TrpThr: 0.901 ± 0.017
0.874TrpVal: 0.874 ± 0.014
0.28TrpTrp: 0.28 ± 0.009
0.419TrpTyr: 0.419 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.112TyrAla: 2.112 ± 0.025
0.418TyrCys: 0.418 ± 0.01
1.612TyrAsp: 1.612 ± 0.024
1.557TyrGlu: 1.557 ± 0.02
1.22TyrPhe: 1.22 ± 0.018
2.053TyrGly: 2.053 ± 0.025
0.807TyrHis: 0.807 ± 0.017
1.516TyrIle: 1.516 ± 0.019
1.08TyrLys: 1.08 ± 0.018
2.798TyrLeu: 2.798 ± 0.027
0.621TyrMet: 0.621 ± 0.012
1.127TyrAsn: 1.127 ± 0.017
1.59TyrPro: 1.59 ± 0.021
1.166TyrGln: 1.166 ± 0.017
1.735TyrArg: 1.735 ± 0.021
2.143TyrSer: 2.143 ± 0.026
1.641TyrThr: 1.641 ± 0.021
1.683TyrVal: 1.683 ± 0.018
0.428TyrTrp: 0.428 ± 0.009
0.958TyrTyr: 0.958 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9078 proteins (4351296 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski