Amino acid dipepetide frequency for Aspergillus avenaceus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.808AlaAla: 7.808 ± 0.049
1.133AlaCys: 1.133 ± 0.014
4.048AlaAsp: 4.048 ± 0.031
4.738AlaGlu: 4.738 ± 0.035
3.182AlaPhe: 3.182 ± 0.025
5.407AlaGly: 5.407 ± 0.038
1.748AlaHis: 1.748 ± 0.019
4.273AlaIle: 4.273 ± 0.034
3.642AlaLys: 3.642 ± 0.027
7.695AlaLeu: 7.695 ± 0.043
1.968AlaMet: 1.968 ± 0.019
2.841AlaAsn: 2.841 ± 0.022
4.39AlaPro: 4.39 ± 0.037
3.274AlaGln: 3.274 ± 0.032
4.678AlaArg: 4.678 ± 0.03
6.85AlaSer: 6.85 ± 0.046
5.072AlaThr: 5.072 ± 0.032
5.396AlaVal: 5.396 ± 0.035
1.156AlaTrp: 1.156 ± 0.015
2.228AlaTyr: 2.228 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
1.001CysAla: 1.001 ± 0.015
0.269CysCys: 0.269 ± 0.008
0.729CysAsp: 0.729 ± 0.011
0.651CysGlu: 0.651 ± 0.011
0.601CysPhe: 0.601 ± 0.012
0.97CysGly: 0.97 ± 0.016
0.382CysHis: 0.382 ± 0.008
0.78CysIle: 0.78 ± 0.012
0.512CysLys: 0.512 ± 0.011
1.43CysLeu: 1.43 ± 0.019
0.313CysMet: 0.313 ± 0.008
0.463CysAsn: 0.463 ± 0.01
0.695CysPro: 0.695 ± 0.012
0.506CysGln: 0.506 ± 0.01
0.849CysArg: 0.849 ± 0.013
1.02CysSer: 1.02 ± 0.015
0.76CysThr: 0.76 ± 0.013
0.87CysVal: 0.87 ± 0.013
0.215CysTrp: 0.215 ± 0.007
0.4CysTyr: 0.4 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
4.486AspAla: 4.486 ± 0.031
0.663AspCys: 0.663 ± 0.012
3.944AspAsp: 3.944 ± 0.041
4.158AspGlu: 4.158 ± 0.037
2.162AspPhe: 2.162 ± 0.021
3.947AspGly: 3.947 ± 0.029
1.297AspHis: 1.297 ± 0.015
3.189AspIle: 3.189 ± 0.026
2.295AspLys: 2.295 ± 0.027
5.204AspLeu: 5.204 ± 0.038
1.304AspMet: 1.304 ± 0.017
1.948AspAsn: 1.948 ± 0.019
3.386AspPro: 3.386 ± 0.026
1.967AspGln: 1.967 ± 0.02
3.173AspArg: 3.173 ± 0.027
4.191AspSer: 4.191 ± 0.039
3.072AspThr: 3.072 ± 0.023
3.687AspVal: 3.687 ± 0.028
0.883AspTrp: 0.883 ± 0.013
1.677AspTyr: 1.677 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
4.956GluAla: 4.956 ± 0.034
0.667GluCys: 0.667 ± 0.01
4.002GluAsp: 4.002 ± 0.037
5.07GluGlu: 5.07 ± 0.055
1.97GluPhe: 1.97 ± 0.019
3.646GluGly: 3.646 ± 0.027
1.421GluHis: 1.421 ± 0.016
3.079GluIle: 3.079 ± 0.025
3.498GluLys: 3.498 ± 0.033
5.08GluLeu: 5.08 ± 0.037
1.415GluMet: 1.415 ± 0.016
2.337GluAsn: 2.337 ± 0.023
2.763GluPro: 2.763 ± 0.035
2.49GluGln: 2.49 ± 0.025
3.853GluArg: 3.853 ± 0.03
4.439GluSer: 4.439 ± 0.036
3.508GluThr: 3.508 ± 0.028
3.493GluVal: 3.493 ± 0.028
0.903GluTrp: 0.903 ± 0.014
1.766GluTyr: 1.766 ± 0.018
0.0GluXaa: 0.0 ± 0.0
Phe
3.032PheAla: 3.032 ± 0.03
0.618PheCys: 0.618 ± 0.011
2.28PheAsp: 2.28 ± 0.021
2.117PheGlu: 2.117 ± 0.022
1.741PhePhe: 1.741 ± 0.02
2.888PheGly: 2.888 ± 0.028
1.009PheHis: 1.009 ± 0.014
1.938PheIle: 1.938 ± 0.02
1.468PheLys: 1.468 ± 0.016
3.747PheLeu: 3.747 ± 0.028
0.839PheMet: 0.839 ± 0.012
1.468PheAsn: 1.468 ± 0.018
2.043PhePro: 2.043 ± 0.019
1.463PheGln: 1.463 ± 0.015
2.067PheArg: 2.067 ± 0.021
3.088PheSer: 3.088 ± 0.026
2.261PheThr: 2.261 ± 0.02
2.418PheVal: 2.418 ± 0.025
0.67PheTrp: 0.67 ± 0.013
1.223PheTyr: 1.223 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
5.052GlyAla: 5.052 ± 0.039
0.947GlyCys: 0.947 ± 0.015
3.626GlyAsp: 3.626 ± 0.026
3.525GlyGlu: 3.525 ± 0.026
2.855GlyPhe: 2.855 ± 0.027
5.221GlyGly: 5.221 ± 0.052
1.711GlyHis: 1.711 ± 0.02
3.659GlyIle: 3.659 ± 0.027
3.256GlyLys: 3.256 ± 0.028
6.242GlyLeu: 6.242 ± 0.036
1.532GlyMet: 1.532 ± 0.018
2.524GlyAsn: 2.524 ± 0.026
3.293GlyPro: 3.293 ± 0.028
2.531GlyGln: 2.531 ± 0.021
4.032GlyArg: 4.032 ± 0.034
5.707GlySer: 5.707 ± 0.039
3.936GlyThr: 3.936 ± 0.03
4.568GlyVal: 4.568 ± 0.035
1.174GlyTrp: 1.174 ± 0.016
2.274GlyTyr: 2.274 ± 0.024
0.0GlyXaa: 0.0 ± 0.0
His
1.851HisAla: 1.851 ± 0.021
0.371HisCys: 0.371 ± 0.009
1.359HisAsp: 1.359 ± 0.016
1.346HisGlu: 1.346 ± 0.018
0.972HisPhe: 0.972 ± 0.014
1.745HisGly: 1.745 ± 0.021
0.859HisHis: 0.859 ± 0.016
1.302HisIle: 1.302 ± 0.015
0.892HisLys: 0.892 ± 0.013
2.353HisLeu: 2.353 ± 0.02
0.532HisMet: 0.532 ± 0.011
0.91HisAsn: 0.91 ± 0.014
1.75HisPro: 1.75 ± 0.019
0.998HisGln: 0.998 ± 0.014
1.609HisArg: 1.609 ± 0.018
1.947HisSer: 1.947 ± 0.021
1.36HisThr: 1.36 ± 0.015
1.473HisVal: 1.473 ± 0.019
0.368HisTrp: 0.368 ± 0.008
0.748HisTyr: 0.748 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
4.244IleAla: 4.244 ± 0.031
0.85IleCys: 0.85 ± 0.013
2.889IleAsp: 2.889 ± 0.025
2.823IleGlu: 2.823 ± 0.023
2.101IlePhe: 2.101 ± 0.024
3.316IleGly: 3.316 ± 0.03
1.319IleHis: 1.319 ± 0.016
2.647IleIle: 2.647 ± 0.026
2.06IleLys: 2.06 ± 0.022
4.87IleLeu: 4.87 ± 0.037
1.063IleMet: 1.063 ± 0.014
1.836IleAsn: 1.836 ± 0.018
3.253IlePro: 3.253 ± 0.024
2.009IleGln: 2.009 ± 0.02
2.851IleArg: 2.851 ± 0.025
3.995IleSer: 3.995 ± 0.03
2.91IleThr: 2.91 ± 0.024
3.264IleVal: 3.264 ± 0.026
0.742IleTrp: 0.742 ± 0.013
1.53IleTyr: 1.53 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
3.839LysAla: 3.839 ± 0.032
0.542LysCys: 0.542 ± 0.01
2.69LysAsp: 2.69 ± 0.024
3.204LysGlu: 3.204 ± 0.031
1.395LysPhe: 1.395 ± 0.015
2.91LysGly: 2.91 ± 0.026
1.11LysHis: 1.11 ± 0.014
2.136LysIle: 2.136 ± 0.021
2.913LysLys: 2.913 ± 0.039
3.886LysLeu: 3.886 ± 0.034
0.942LysMet: 0.942 ± 0.015
1.68LysAsn: 1.68 ± 0.018
2.634LysPro: 2.634 ± 0.03
1.839LysGln: 1.839 ± 0.022
3.253LysArg: 3.253 ± 0.028
3.373LysSer: 3.373 ± 0.028
2.648LysThr: 2.648 ± 0.024
2.762LysVal: 2.762 ± 0.024
0.658LysTrp: 0.658 ± 0.011
1.354LysTyr: 1.354 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
7.655LeuAla: 7.655 ± 0.045
1.349LeuCys: 1.349 ± 0.016
5.259LeuAsp: 5.259 ± 0.033
5.531LeuGlu: 5.531 ± 0.042
3.568LeuPhe: 3.568 ± 0.029
6.087LeuGly: 6.087 ± 0.038
2.353LeuHis: 2.353 ± 0.025
4.122LeuIle: 4.122 ± 0.035
4.059LeuLys: 4.059 ± 0.034
8.652LeuLeu: 8.652 ± 0.055
1.879LeuMet: 1.879 ± 0.019
3.336LeuAsn: 3.336 ± 0.025
5.514LeuPro: 5.514 ± 0.038
3.926LeuGln: 3.926 ± 0.029
5.869LeuArg: 5.869 ± 0.042
7.732LeuSer: 7.732 ± 0.045
4.972LeuThr: 4.972 ± 0.032
5.653LeuVal: 5.653 ± 0.042
1.27LeuTrp: 1.27 ± 0.015
2.596LeuTyr: 2.596 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
2.123MetAla: 2.123 ± 0.018
0.267MetCys: 0.267 ± 0.007
1.257MetAsp: 1.257 ± 0.019
1.297MetGlu: 1.297 ± 0.015
0.798MetPhe: 0.798 ± 0.012
1.508MetGly: 1.508 ± 0.019
0.503MetHis: 0.503 ± 0.011
1.058MetIle: 1.058 ± 0.014
1.027MetLys: 1.027 ± 0.014
1.92MetLeu: 1.92 ± 0.02
0.569MetMet: 0.569 ± 0.011
0.836MetAsn: 0.836 ± 0.013
1.233MetPro: 1.233 ± 0.015
0.862MetGln: 0.862 ± 0.013
1.266MetArg: 1.266 ± 0.017
1.916MetSer: 1.916 ± 0.016
1.341MetThr: 1.341 ± 0.015
1.381MetVal: 1.381 ± 0.015
0.273MetTrp: 0.273 ± 0.007
0.555MetTyr: 0.555 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
3.004AsnAla: 3.004 ± 0.023
0.509AsnCys: 0.509 ± 0.011
2.009AsnAsp: 2.009 ± 0.02
2.051AsnGlu: 2.051 ± 0.02
1.358AsnPhe: 1.358 ± 0.016
2.919AsnGly: 2.919 ± 0.027
0.897AsnHis: 0.897 ± 0.013
2.13AsnIle: 2.13 ± 0.019
1.527AsnLys: 1.527 ± 0.019
3.306AsnLeu: 3.306 ± 0.029
0.891AsnMet: 0.891 ± 0.012
1.489AsnAsn: 1.489 ± 0.021
2.514AsnPro: 2.514 ± 0.024
1.398AsnGln: 1.398 ± 0.017
2.025AsnArg: 2.025 ± 0.018
2.752AsnSer: 2.752 ± 0.03
2.257AsnThr: 2.257 ± 0.024
2.421AsnVal: 2.421 ± 0.02
0.591AsnTrp: 0.591 ± 0.012
1.11AsnTyr: 1.11 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
4.693ProAla: 4.693 ± 0.041
0.584ProCys: 0.584 ± 0.01
3.247ProAsp: 3.247 ± 0.025
3.917ProGlu: 3.917 ± 0.035
2.166ProPhe: 2.166 ± 0.02
3.933ProGly: 3.933 ± 0.039
1.341ProHis: 1.341 ± 0.019
2.588ProIle: 2.588 ± 0.024
2.545ProLys: 2.545 ± 0.026
4.745ProLeu: 4.745 ± 0.034
1.104ProMet: 1.104 ± 0.015
2.238ProAsn: 2.238 ± 0.024
4.629ProPro: 4.629 ± 0.065
2.398ProGln: 2.398 ± 0.032
3.396ProArg: 3.396 ± 0.032
5.939ProSer: 5.939 ± 0.047
3.944ProThr: 3.944 ± 0.033
3.744ProVal: 3.744 ± 0.029
0.791ProTrp: 0.791 ± 0.013
1.579ProTyr: 1.579 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
3.269GlnAla: 3.269 ± 0.028
0.508GlnCys: 0.508 ± 0.011
2.096GlnAsp: 2.096 ± 0.023
2.434GlnGlu: 2.434 ± 0.025
1.337GlnPhe: 1.337 ± 0.015
2.458GlnGly: 2.458 ± 0.022
1.065GlnHis: 1.065 ± 0.016
1.945GlnIle: 1.945 ± 0.019
1.991GlnLys: 1.991 ± 0.022
3.547GlnLeu: 3.547 ± 0.029
0.91GlnMet: 0.91 ± 0.014
1.596GlnAsn: 1.596 ± 0.02
2.49GlnPro: 2.49 ± 0.031
2.187GlnGln: 2.187 ± 0.036
2.654GlnArg: 2.654 ± 0.025
3.26GlnSer: 3.26 ± 0.028
2.354GlnThr: 2.354 ± 0.023
2.245GlnVal: 2.245 ± 0.019
0.608GlnTrp: 0.608 ± 0.012
1.229GlnTyr: 1.229 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.514ArgAla: 4.514 ± 0.03
0.798ArgCys: 0.798 ± 0.013
3.325ArgAsp: 3.325 ± 0.031
3.76ArgGlu: 3.76 ± 0.029
2.277ArgPhe: 2.277 ± 0.022
3.647ArgGly: 3.647 ± 0.03
1.596ArgHis: 1.596 ± 0.019
2.941ArgIle: 2.941 ± 0.022
3.317ArgLys: 3.317 ± 0.031
5.654ArgLeu: 5.654 ± 0.047
1.328ArgMet: 1.328 ± 0.016
2.299ArgAsn: 2.299 ± 0.021
3.456ArgPro: 3.456 ± 0.033
2.605ArgGln: 2.605 ± 0.022
4.934ArgArg: 4.934 ± 0.043
4.868ArgSer: 4.868 ± 0.04
3.308ArgThr: 3.308 ± 0.025
3.539ArgVal: 3.539 ± 0.027
0.958ArgTrp: 0.958 ± 0.014
1.808ArgTyr: 1.808 ± 0.02
0.0ArgXaa: 0.0 ± 0.0
Ser
6.432SerAla: 6.432 ± 0.041
0.97SerCys: 0.97 ± 0.014
4.414SerAsp: 4.414 ± 0.036
4.27SerGlu: 4.27 ± 0.032
3.174SerPhe: 3.174 ± 0.029
5.511SerGly: 5.511 ± 0.039
2.06SerHis: 2.06 ± 0.021
4.139SerIle: 4.139 ± 0.033
3.588SerLys: 3.588 ± 0.028
7.565SerLeu: 7.565 ± 0.038
1.729SerMet: 1.729 ± 0.02
3.037SerAsn: 3.037 ± 0.025
5.364SerPro: 5.364 ± 0.049
3.378SerGln: 3.378 ± 0.032
5.03SerArg: 5.03 ± 0.04
8.484SerSer: 8.484 ± 0.074
5.541SerThr: 5.541 ± 0.038
4.832SerVal: 4.832 ± 0.037
1.18SerTrp: 1.18 ± 0.015
2.206SerTyr: 2.206 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
5.067ThrAla: 5.067 ± 0.031
0.81ThrCys: 0.81 ± 0.013
2.989ThrAsp: 2.989 ± 0.025
3.191ThrGlu: 3.191 ± 0.024
2.285ThrPhe: 2.285 ± 0.023
4.308ThrGly: 4.308 ± 0.033
1.37ThrHis: 1.37 ± 0.016
3.101ThrIle: 3.101 ± 0.026
2.488ThrLys: 2.488 ± 0.022
5.383ThrLeu: 5.383 ± 0.033
1.227ThrMet: 1.227 ± 0.014
2.119ThrAsn: 2.119 ± 0.021
4.212ThrPro: 4.212 ± 0.037
2.117ThrGln: 2.117 ± 0.02
3.131ThrArg: 3.131 ± 0.028
5.161ThrSer: 5.161 ± 0.035
4.059ThrThr: 4.059 ± 0.038
4.014ThrVal: 4.014 ± 0.033
0.911ThrTrp: 0.911 ± 0.015
1.715ThrTyr: 1.715 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
5.129ValAla: 5.129 ± 0.038
0.896ValCys: 0.896 ± 0.013
3.855ValAsp: 3.855 ± 0.027
3.808ValGlu: 3.808 ± 0.03
2.597ValPhe: 2.597 ± 0.023
4.095ValGly: 4.095 ± 0.037
1.51ValHis: 1.51 ± 0.017
3.152ValIle: 3.152 ± 0.027
2.766ValLys: 2.766 ± 0.024
5.819ValLeu: 5.819 ± 0.039
1.391ValMet: 1.391 ± 0.017
2.355ValAsn: 2.355 ± 0.023
3.676ValPro: 3.676 ± 0.028
2.486ValGln: 2.486 ± 0.022
3.536ValArg: 3.536 ± 0.028
4.913ValSer: 4.913 ± 0.034
3.654ValThr: 3.654 ± 0.033
4.36ValVal: 4.36 ± 0.037
0.918ValTrp: 0.918 ± 0.014
1.896ValTyr: 1.896 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.121TrpAla: 1.121 ± 0.015
0.21TrpCys: 0.21 ± 0.006
0.914TrpAsp: 0.914 ± 0.013
0.864TrpGlu: 0.864 ± 0.013
0.566TrpPhe: 0.566 ± 0.009
0.96TrpGly: 0.96 ± 0.014
0.374TrpHis: 0.374 ± 0.008
0.804TrpIle: 0.804 ± 0.014
0.814TrpLys: 0.814 ± 0.012
1.433TrpLeu: 1.433 ± 0.019
0.391TrpMet: 0.391 ± 0.008
0.659TrpAsn: 0.659 ± 0.013
0.613TrpPro: 0.613 ± 0.011
0.576TrpGln: 0.576 ± 0.011
0.981TrpArg: 0.981 ± 0.013
1.116TrpSer: 1.116 ± 0.016
0.936TrpThr: 0.936 ± 0.015
0.935TrpVal: 0.935 ± 0.014
0.286TrpTrp: 0.286 ± 0.008
0.467TrpTyr: 0.467 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.251TyrAla: 2.251 ± 0.023
0.455TyrCys: 0.455 ± 0.01
1.699TyrAsp: 1.699 ± 0.017
1.599TyrGlu: 1.599 ± 0.019
1.274TyrPhe: 1.274 ± 0.015
2.19TyrGly: 2.19 ± 0.023
0.821TyrHis: 0.821 ± 0.013
1.586TyrIle: 1.586 ± 0.019
1.111TyrLys: 1.111 ± 0.017
2.852TyrLeu: 2.852 ± 0.021
0.666TyrMet: 0.666 ± 0.012
1.189TyrAsn: 1.189 ± 0.016
1.613TyrPro: 1.613 ± 0.017
1.17TyrGln: 1.17 ± 0.015
1.742TyrArg: 1.742 ± 0.017
2.152TyrSer: 2.152 ± 0.023
1.768TyrThr: 1.768 ± 0.017
1.756TyrVal: 1.756 ± 0.02
0.465TyrTrp: 0.465 ± 0.009
1.019TyrTyr: 1.019 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11290 proteins (5386922 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski