Amino acid dipepetide frequency for Blastomyces percursus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.118AlaAla: 8.118 ± 0.058
0.944AlaCys: 0.944 ± 0.014
3.995AlaAsp: 3.995 ± 0.033
5.004AlaGlu: 5.004 ± 0.044
2.837AlaPhe: 2.837 ± 0.028
5.372AlaGly: 5.372 ± 0.043
1.721AlaHis: 1.721 ± 0.021
4.079AlaIle: 4.079 ± 0.029
4.022AlaLys: 4.022 ± 0.035
7.186AlaLeu: 7.186 ± 0.046
1.715AlaMet: 1.715 ± 0.019
2.897AlaAsn: 2.897 ± 0.025
4.371AlaPro: 4.371 ± 0.038
3.136AlaGln: 3.136 ± 0.033
4.935AlaArg: 4.935 ± 0.034
6.961AlaSer: 6.961 ± 0.044
5.017AlaThr: 5.017 ± 0.036
4.82AlaVal: 4.82 ± 0.039
0.938AlaTrp: 0.938 ± 0.015
1.9AlaTyr: 1.9 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
0.858CysAla: 0.858 ± 0.015
0.232CysCys: 0.232 ± 0.008
0.635CysAsp: 0.635 ± 0.012
0.662CysGlu: 0.662 ± 0.012
0.502CysPhe: 0.502 ± 0.011
0.915CysGly: 0.915 ± 0.016
0.35CysHis: 0.35 ± 0.01
0.703CysIle: 0.703 ± 0.011
0.532CysLys: 0.532 ± 0.011
1.257CysLeu: 1.257 ± 0.018
0.262CysMet: 0.262 ± 0.007
0.435CysAsn: 0.435 ± 0.009
0.718CysPro: 0.718 ± 0.016
0.496CysGln: 0.496 ± 0.011
0.829CysArg: 0.829 ± 0.015
0.956CysSer: 0.956 ± 0.016
0.652CysThr: 0.652 ± 0.011
0.744CysVal: 0.744 ± 0.013
0.194CysTrp: 0.194 ± 0.007
0.354CysTyr: 0.354 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
4.265AspAla: 4.265 ± 0.033
0.633AspCys: 0.633 ± 0.012
4.37AspAsp: 4.37 ± 0.048
4.464AspGlu: 4.464 ± 0.038
2.095AspPhe: 2.095 ± 0.025
4.099AspGly: 4.099 ± 0.032
1.264AspHis: 1.264 ± 0.019
3.414AspIle: 3.414 ± 0.028
2.446AspLys: 2.446 ± 0.028
4.836AspLeu: 4.836 ± 0.039
1.251AspMet: 1.251 ± 0.017
2.051AspAsn: 2.051 ± 0.021
3.341AspPro: 3.341 ± 0.031
1.825AspGln: 1.825 ± 0.02
3.249AspArg: 3.249 ± 0.034
4.238AspSer: 4.238 ± 0.036
2.948AspThr: 2.948 ± 0.024
3.531AspVal: 3.531 ± 0.033
0.803AspTrp: 0.803 ± 0.014
1.538AspTyr: 1.538 ± 0.02
0.0AspXaa: 0.0 ± 0.0
Glu
5.06GluAla: 5.06 ± 0.04
0.665GluCys: 0.665 ± 0.013
4.156GluAsp: 4.156 ± 0.037
5.517GluGlu: 5.517 ± 0.052
2.016GluPhe: 2.016 ± 0.02
3.758GluGly: 3.758 ± 0.035
1.427GluHis: 1.427 ± 0.019
3.297GluIle: 3.297 ± 0.033
3.882GluLys: 3.882 ± 0.043
5.345GluLeu: 5.345 ± 0.042
1.462GluMet: 1.462 ± 0.017
2.625GluAsn: 2.625 ± 0.028
2.932GluPro: 2.932 ± 0.038
2.562GluGln: 2.562 ± 0.027
4.369GluArg: 4.369 ± 0.036
4.464GluSer: 4.464 ± 0.039
3.413GluThr: 3.413 ± 0.029
3.395GluVal: 3.395 ± 0.03
0.862GluTrp: 0.862 ± 0.013
1.752GluTyr: 1.752 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
2.721PheAla: 2.721 ± 0.028
0.556PheCys: 0.556 ± 0.013
2.207PheAsp: 2.207 ± 0.024
2.149PheGlu: 2.149 ± 0.022
1.435PhePhe: 1.435 ± 0.02
2.56PheGly: 2.56 ± 0.031
0.98PheHis: 0.98 ± 0.016
1.75PheIle: 1.75 ± 0.023
1.538PheLys: 1.538 ± 0.019
3.358PheLeu: 3.358 ± 0.032
0.726PheMet: 0.726 ± 0.015
1.423PheAsn: 1.423 ± 0.019
2.04PhePro: 2.04 ± 0.022
1.363PheGln: 1.363 ± 0.016
2.117PheArg: 2.117 ± 0.025
3.051PheSer: 3.051 ± 0.026
2.046PheThr: 2.046 ± 0.022
2.177PheVal: 2.177 ± 0.024
0.559PheTrp: 0.559 ± 0.012
1.016PheTyr: 1.016 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
4.754GlyAla: 4.754 ± 0.036
0.837GlyCys: 0.837 ± 0.016
3.701GlyAsp: 3.701 ± 0.031
3.793GlyGlu: 3.793 ± 0.033
2.516GlyPhe: 2.516 ± 0.027
5.662GlyGly: 5.662 ± 0.057
1.65GlyHis: 1.65 ± 0.02
3.426GlyIle: 3.426 ± 0.027
3.598GlyLys: 3.598 ± 0.03
5.496GlyLeu: 5.496 ± 0.038
1.549GlyMet: 1.549 ± 0.022
2.647GlyAsn: 2.647 ± 0.028
3.221GlyPro: 3.221 ± 0.029
2.401GlyGln: 2.401 ± 0.03
4.293GlyArg: 4.293 ± 0.038
5.641GlySer: 5.641 ± 0.041
3.733GlyThr: 3.733 ± 0.035
4.009GlyVal: 4.009 ± 0.033
1.038GlyTrp: 1.038 ± 0.015
1.972GlyTyr: 1.972 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
1.808HisAla: 1.808 ± 0.02
0.346HisCys: 0.346 ± 0.009
1.346HisAsp: 1.346 ± 0.019
1.335HisGlu: 1.335 ± 0.018
0.939HisPhe: 0.939 ± 0.014
1.714HisGly: 1.714 ± 0.022
0.973HisHis: 0.973 ± 0.023
1.352HisIle: 1.352 ± 0.017
0.951HisLys: 0.951 ± 0.014
2.345HisLeu: 2.345 ± 0.024
0.483HisMet: 0.483 ± 0.01
0.933HisAsn: 0.933 ± 0.014
1.896HisPro: 1.896 ± 0.024
1.136HisGln: 1.136 ± 0.02
1.712HisArg: 1.712 ± 0.023
2.063HisSer: 2.063 ± 0.024
1.337HisThr: 1.337 ± 0.017
1.388HisVal: 1.388 ± 0.016
0.314HisTrp: 0.314 ± 0.008
0.71HisTyr: 0.71 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
3.926IleAla: 3.926 ± 0.035
0.769IleCys: 0.769 ± 0.013
3.027IleAsp: 3.027 ± 0.026
3.029IleGlu: 3.029 ± 0.03
2.036IlePhe: 2.036 ± 0.023
2.99IleGly: 2.99 ± 0.032
1.307IleHis: 1.307 ± 0.018
2.635IleIle: 2.635 ± 0.029
2.273IleLys: 2.273 ± 0.025
4.759IleLeu: 4.759 ± 0.038
0.969IleMet: 0.969 ± 0.013
1.883IleAsn: 1.883 ± 0.022
3.38IlePro: 3.38 ± 0.029
1.965IleGln: 1.965 ± 0.021
3.075IleArg: 3.075 ± 0.026
4.413IleSer: 4.413 ± 0.032
2.877IleThr: 2.877 ± 0.028
3.0IleVal: 3.0 ± 0.031
0.683IleTrp: 0.683 ± 0.014
1.437IleTyr: 1.437 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
4.094LysAla: 4.094 ± 0.036
0.548LysCys: 0.548 ± 0.012
2.754LysAsp: 2.754 ± 0.029
3.571LysGlu: 3.571 ± 0.032
1.576LysPhe: 1.576 ± 0.02
3.038LysGly: 3.038 ± 0.029
1.202LysHis: 1.202 ± 0.017
2.496LysIle: 2.496 ± 0.027
3.397LysLys: 3.397 ± 0.052
4.418LysLeu: 4.418 ± 0.037
1.036LysMet: 1.036 ± 0.015
1.883LysAsn: 1.883 ± 0.02
2.98LysPro: 2.98 ± 0.031
2.055LysGln: 2.055 ± 0.024
3.937LysArg: 3.937 ± 0.039
3.799LysSer: 3.799 ± 0.033
2.751LysThr: 2.751 ± 0.027
2.757LysVal: 2.757 ± 0.025
0.661LysTrp: 0.661 ± 0.015
1.459LysTyr: 1.459 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
7.248LeuAla: 7.248 ± 0.039
1.204LeuCys: 1.204 ± 0.019
5.117LeuAsp: 5.117 ± 0.041
5.675LeuGlu: 5.675 ± 0.041
3.235LeuPhe: 3.235 ± 0.032
5.493LeuGly: 5.493 ± 0.041
2.307LeuHis: 2.307 ± 0.026
3.929LeuIle: 3.929 ± 0.031
4.517LeuLys: 4.517 ± 0.039
8.256LeuLeu: 8.256 ± 0.062
1.689LeuMet: 1.689 ± 0.021
3.26LeuAsn: 3.26 ± 0.028
5.59LeuPro: 5.59 ± 0.04
3.938LeuGln: 3.938 ± 0.034
6.122LeuArg: 6.122 ± 0.04
7.543LeuSer: 7.543 ± 0.041
4.684LeuThr: 4.684 ± 0.032
4.973LeuVal: 4.973 ± 0.038
1.067LeuTrp: 1.067 ± 0.018
2.282LeuTyr: 2.282 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
2.051MetAla: 2.051 ± 0.022
0.223MetCys: 0.223 ± 0.008
1.247MetAsp: 1.247 ± 0.018
1.411MetGlu: 1.411 ± 0.018
0.688MetPhe: 0.688 ± 0.012
1.371MetGly: 1.371 ± 0.019
0.454MetHis: 0.454 ± 0.01
0.968MetIle: 0.968 ± 0.015
1.082MetLys: 1.082 ± 0.016
1.74MetLeu: 1.74 ± 0.02
0.525MetMet: 0.525 ± 0.01
0.79MetAsn: 0.79 ± 0.015
1.185MetPro: 1.185 ± 0.018
0.798MetGln: 0.798 ± 0.015
1.25MetArg: 1.25 ± 0.017
1.792MetSer: 1.792 ± 0.021
1.243MetThr: 1.243 ± 0.017
1.24MetVal: 1.24 ± 0.018
0.243MetTrp: 0.243 ± 0.006
0.474MetTyr: 0.474 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.109AsnAla: 3.109 ± 0.025
0.482AsnCys: 0.482 ± 0.011
2.158AsnAsp: 2.158 ± 0.023
2.23AsnGlu: 2.23 ± 0.022
1.389AsnPhe: 1.389 ± 0.017
2.981AsnGly: 2.981 ± 0.031
0.969AsnHis: 0.969 ± 0.017
2.273AsnIle: 2.273 ± 0.024
1.711AsnLys: 1.711 ± 0.021
3.387AsnLeu: 3.387 ± 0.026
0.836AsnMet: 0.836 ± 0.015
1.934AsnAsn: 1.934 ± 0.032
2.727AsnPro: 2.727 ± 0.027
1.448AsnGln: 1.448 ± 0.022
2.296AsnArg: 2.296 ± 0.026
3.126AsnSer: 3.126 ± 0.029
2.258AsnThr: 2.258 ± 0.024
2.276AsnVal: 2.276 ± 0.023
0.497AsnTrp: 0.497 ± 0.011
1.047AsnTyr: 1.047 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
5.013ProAla: 5.013 ± 0.048
0.562ProCys: 0.562 ± 0.012
3.235ProAsp: 3.235 ± 0.03
3.81ProGlu: 3.81 ± 0.038
2.093ProPhe: 2.093 ± 0.024
3.824ProGly: 3.824 ± 0.033
1.546ProHis: 1.546 ± 0.02
2.785ProIle: 2.785 ± 0.023
2.808ProLys: 2.808 ± 0.026
5.005ProLeu: 5.005 ± 0.038
1.059ProMet: 1.059 ± 0.016
2.442ProAsn: 2.442 ± 0.028
5.811ProPro: 5.811 ± 0.071
2.68ProGln: 2.68 ± 0.034
3.717ProArg: 3.717 ± 0.031
6.688ProSer: 6.688 ± 0.055
4.387ProThr: 4.387 ± 0.044
3.444ProVal: 3.444 ± 0.033
0.677ProTrp: 0.677 ± 0.013
1.528ProTyr: 1.528 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
3.225GlnAla: 3.225 ± 0.035
0.464GlnCys: 0.464 ± 0.01
1.966GlnAsp: 1.966 ± 0.021
2.423GlnGlu: 2.423 ± 0.031
1.3GlnPhe: 1.3 ± 0.018
2.316GlnGly: 2.316 ± 0.025
1.243GlnHis: 1.243 ± 0.02
1.948GlnIle: 1.948 ± 0.024
2.139GlnLys: 2.139 ± 0.023
3.633GlnLeu: 3.633 ± 0.03
0.843GlnMet: 0.843 ± 0.016
1.726GlnAsn: 1.726 ± 0.023
2.846GlnPro: 2.846 ± 0.037
2.887GlnGln: 2.887 ± 0.06
2.902GlnArg: 2.902 ± 0.028
3.288GlnSer: 3.288 ± 0.029
2.264GlnThr: 2.264 ± 0.025
2.056GlnVal: 2.056 ± 0.022
0.493GlnTrp: 0.493 ± 0.011
1.109GlnTyr: 1.109 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
4.836ArgAla: 4.836 ± 0.037
0.824ArgCys: 0.824 ± 0.015
3.588ArgAsp: 3.588 ± 0.036
4.24ArgGlu: 4.24 ± 0.039
2.183ArgPhe: 2.183 ± 0.022
4.043ArgGly: 4.043 ± 0.042
1.712ArgHis: 1.712 ± 0.019
3.129ArgIle: 3.129 ± 0.027
3.925ArgLys: 3.925 ± 0.031
5.69ArgLeu: 5.69 ± 0.037
1.388ArgMet: 1.388 ± 0.016
2.666ArgAsn: 2.666 ± 0.025
3.812ArgPro: 3.812 ± 0.035
2.891ArgGln: 2.891 ± 0.026
5.818ArgArg: 5.818 ± 0.053
5.311ArgSer: 5.311 ± 0.044
3.523ArgThr: 3.523 ± 0.03
3.506ArgVal: 3.506 ± 0.03
0.894ArgTrp: 0.894 ± 0.015
1.778ArgTyr: 1.778 ± 0.022
0.0ArgXaa: 0.0 ± 0.0
Ser
6.433SerAla: 6.433 ± 0.048
0.912SerCys: 0.912 ± 0.015
4.276SerAsp: 4.276 ± 0.034
4.253SerGlu: 4.253 ± 0.039
3.023SerPhe: 3.023 ± 0.027
5.56SerGly: 5.56 ± 0.039
2.206SerHis: 2.206 ± 0.025
4.185SerIle: 4.185 ± 0.033
4.063SerLys: 4.063 ± 0.037
7.442SerLeu: 7.442 ± 0.044
1.695SerMet: 1.695 ± 0.02
3.431SerAsn: 3.431 ± 0.033
6.221SerPro: 6.221 ± 0.055
3.585SerGln: 3.585 ± 0.036
5.713SerArg: 5.713 ± 0.041
9.899SerSer: 9.899 ± 0.081
5.898SerThr: 5.898 ± 0.045
4.446SerVal: 4.446 ± 0.035
1.06SerTrp: 1.06 ± 0.018
2.053SerTyr: 2.053 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
4.976ThrAla: 4.976 ± 0.042
0.718ThrCys: 0.718 ± 0.014
2.823ThrAsp: 2.823 ± 0.024
3.148ThrGlu: 3.148 ± 0.029
2.126ThrPhe: 2.126 ± 0.022
3.884ThrGly: 3.884 ± 0.032
1.325ThrHis: 1.325 ± 0.018
3.045ThrIle: 3.045 ± 0.031
2.699ThrLys: 2.699 ± 0.029
5.07ThrLeu: 5.07 ± 0.038
1.129ThrMet: 1.129 ± 0.015
2.236ThrAsn: 2.236 ± 0.026
4.57ThrPro: 4.57 ± 0.05
2.122ThrGln: 2.122 ± 0.027
3.351ThrArg: 3.351 ± 0.031
5.6ThrSer: 5.6 ± 0.04
4.493ThrThr: 4.493 ± 0.05
3.47ThrVal: 3.47 ± 0.031
0.697ThrTrp: 0.697 ± 0.014
1.531ThrTyr: 1.531 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
4.59ValAla: 4.59 ± 0.029
0.764ValCys: 0.764 ± 0.015
3.629ValAsp: 3.629 ± 0.031
3.88ValGlu: 3.88 ± 0.036
2.171ValPhe: 2.171 ± 0.025
3.737ValGly: 3.737 ± 0.034
1.37ValHis: 1.37 ± 0.017
2.883ValIle: 2.883 ± 0.028
2.9ValLys: 2.9 ± 0.025
5.148ValLeu: 5.148 ± 0.04
1.214ValMet: 1.214 ± 0.02
2.213ValAsn: 2.213 ± 0.024
3.38ValPro: 3.38 ± 0.032
2.205ValGln: 2.205 ± 0.024
3.43ValArg: 3.43 ± 0.029
4.505ValSer: 4.505 ± 0.035
3.246ValThr: 3.246 ± 0.034
3.85ValVal: 3.85 ± 0.036
0.748ValTrp: 0.748 ± 0.012
1.527ValTyr: 1.527 ± 0.02
0.0ValXaa: 0.0 ± 0.0
Trp
0.967TrpAla: 0.967 ± 0.013
0.175TrpCys: 0.175 ± 0.006
0.815TrpAsp: 0.815 ± 0.016
0.848TrpGlu: 0.848 ± 0.016
0.457TrpPhe: 0.457 ± 0.01
0.772TrpGly: 0.772 ± 0.013
0.32TrpHis: 0.32 ± 0.009
0.678TrpIle: 0.678 ± 0.012
0.808TrpLys: 0.808 ± 0.013
1.206TrpLeu: 1.206 ± 0.015
0.333TrpMet: 0.333 ± 0.008
0.587TrpAsn: 0.587 ± 0.013
0.534TrpPro: 0.534 ± 0.012
0.503TrpGln: 0.503 ± 0.012
0.955TrpArg: 0.955 ± 0.014
0.932TrpSer: 0.932 ± 0.013
0.759TrpThr: 0.759 ± 0.012
0.787TrpVal: 0.787 ± 0.015
0.235TrpTrp: 0.235 ± 0.009
0.368TrpTyr: 0.368 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.916TyrAla: 1.916 ± 0.023
0.43TyrCys: 0.43 ± 0.011
1.615TyrAsp: 1.615 ± 0.019
1.519TyrGlu: 1.519 ± 0.02
1.153TyrPhe: 1.153 ± 0.016
1.848TyrGly: 1.848 ± 0.023
0.755TyrHis: 0.755 ± 0.013
1.455TyrIle: 1.455 ± 0.02
1.119TyrLys: 1.119 ± 0.016
2.553TyrLeu: 2.553 ± 0.026
0.61TyrMet: 0.61 ± 0.011
1.073TyrAsn: 1.073 ± 0.014
1.56TyrPro: 1.56 ± 0.025
1.076TyrGln: 1.076 ± 0.014
1.686TyrArg: 1.686 ± 0.019
2.083TyrSer: 2.083 ± 0.024
1.487TyrThr: 1.487 ± 0.018
1.521TyrVal: 1.521 ± 0.02
0.374TyrTrp: 0.374 ± 0.009
0.888TyrTyr: 0.888 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10283 proteins (4463529 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski