Amino acid dipepetide frequency for Pseudogymnoascus verrucosus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.052AlaAla: 10.052 ± 0.072
1.06AlaCys: 1.06 ± 0.015
4.331AlaAsp: 4.331 ± 0.035
5.412AlaGlu: 5.412 ± 0.052
3.158AlaPhe: 3.158 ± 0.028
6.246AlaGly: 6.246 ± 0.039
1.727AlaHis: 1.727 ± 0.02
4.548AlaIle: 4.548 ± 0.032
4.365AlaLys: 4.365 ± 0.038
7.863AlaLeu: 7.863 ± 0.047
2.024AlaMet: 2.024 ± 0.022
3.114AlaAsn: 3.114 ± 0.029
5.24AlaPro: 5.24 ± 0.05
3.403AlaGln: 3.403 ± 0.032
4.829AlaArg: 4.829 ± 0.037
7.224AlaSer: 7.224 ± 0.038
5.625AlaThr: 5.625 ± 0.038
5.601AlaVal: 5.601 ± 0.037
1.145AlaTrp: 1.145 ± 0.017
2.231AlaTyr: 2.231 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
0.963CysAla: 0.963 ± 0.013
0.239CysCys: 0.239 ± 0.008
0.66CysAsp: 0.66 ± 0.012
0.578CysGlu: 0.578 ± 0.011
0.525CysPhe: 0.525 ± 0.011
0.984CysGly: 0.984 ± 0.017
0.3CysHis: 0.3 ± 0.009
0.703CysIle: 0.703 ± 0.013
0.512CysLys: 0.512 ± 0.01
1.181CysLeu: 1.181 ± 0.017
0.272CysMet: 0.272 ± 0.007
0.432CysAsn: 0.432 ± 0.011
0.609CysPro: 0.609 ± 0.013
0.408CysGln: 0.408 ± 0.009
0.675CysArg: 0.675 ± 0.012
0.875CysSer: 0.875 ± 0.016
0.689CysThr: 0.689 ± 0.013
0.773CysVal: 0.773 ± 0.013
0.2CysTrp: 0.2 ± 0.006
0.359CysTyr: 0.359 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
4.816AspAla: 4.816 ± 0.031
0.583AspCys: 0.583 ± 0.011
4.069AspAsp: 4.069 ± 0.04
4.441AspGlu: 4.441 ± 0.046
2.199AspPhe: 2.199 ± 0.024
4.231AspGly: 4.231 ± 0.035
1.072AspHis: 1.072 ± 0.015
3.137AspIle: 3.137 ± 0.024
2.407AspLys: 2.407 ± 0.024
4.847AspLeu: 4.847 ± 0.033
1.254AspMet: 1.254 ± 0.015
1.923AspAsn: 1.923 ± 0.02
3.03AspPro: 3.03 ± 0.023
1.672AspGln: 1.672 ± 0.019
2.806AspArg: 2.806 ± 0.027
4.021AspSer: 4.021 ± 0.031
2.967AspThr: 2.967 ± 0.026
3.689AspVal: 3.689 ± 0.029
0.849AspTrp: 0.849 ± 0.014
1.598AspTyr: 1.598 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
5.703GluAla: 5.703 ± 0.052
0.61GluCys: 0.61 ± 0.012
4.126GluAsp: 4.126 ± 0.039
5.842GluGlu: 5.842 ± 0.066
1.972GluPhe: 1.972 ± 0.022
4.395GluGly: 4.395 ± 0.036
1.242GluHis: 1.242 ± 0.016
3.116GluIle: 3.116 ± 0.03
3.79GluLys: 3.79 ± 0.037
5.078GluLeu: 5.078 ± 0.037
1.571GluMet: 1.571 ± 0.019
2.197GluAsn: 2.197 ± 0.02
2.634GluPro: 2.634 ± 0.034
2.255GluGln: 2.255 ± 0.027
3.974GluArg: 3.974 ± 0.038
4.226GluSer: 4.226 ± 0.029
3.447GluThr: 3.447 ± 0.029
3.946GluVal: 3.946 ± 0.034
0.896GluTrp: 0.896 ± 0.014
1.677GluTyr: 1.677 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
3.13PheAla: 3.13 ± 0.032
0.541PheCys: 0.541 ± 0.012
2.226PheAsp: 2.226 ± 0.02
2.189PheGlu: 2.189 ± 0.023
1.589PhePhe: 1.589 ± 0.022
3.008PheGly: 3.008 ± 0.033
0.843PheHis: 0.843 ± 0.014
1.889PheIle: 1.889 ± 0.022
1.577PheLys: 1.577 ± 0.018
3.34PheLeu: 3.34 ± 0.028
0.823PheMet: 0.823 ± 0.014
1.433PheAsn: 1.433 ± 0.018
1.894PhePro: 1.894 ± 0.021
1.351PheGln: 1.351 ± 0.02
1.807PheArg: 1.807 ± 0.019
2.907PheSer: 2.907 ± 0.024
2.154PheThr: 2.154 ± 0.021
2.377PheVal: 2.377 ± 0.022
0.623PheTrp: 0.623 ± 0.012
1.111PheTyr: 1.111 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
5.939GlyAla: 5.939 ± 0.041
0.913GlyCys: 0.913 ± 0.017
3.82GlyAsp: 3.82 ± 0.029
4.253GlyGlu: 4.253 ± 0.035
2.924GlyPhe: 2.924 ± 0.025
7.208GlyGly: 7.208 ± 0.081
1.625GlyHis: 1.625 ± 0.021
3.732GlyIle: 3.732 ± 0.036
3.852GlyLys: 3.852 ± 0.032
6.153GlyLeu: 6.153 ± 0.036
1.774GlyMet: 1.774 ± 0.021
2.729GlyAsn: 2.729 ± 0.023
3.354GlyPro: 3.354 ± 0.033
2.587GlyGln: 2.587 ± 0.026
4.261GlyArg: 4.261 ± 0.031
5.863GlySer: 5.863 ± 0.039
4.147GlyThr: 4.147 ± 0.035
4.81GlyVal: 4.81 ± 0.039
1.259GlyTrp: 1.259 ± 0.017
2.216GlyTyr: 2.216 ± 0.024
0.0GlyXaa: 0.0 ± 0.0
His
1.736HisAla: 1.736 ± 0.019
0.295HisCys: 0.295 ± 0.008
1.179HisAsp: 1.179 ± 0.014
1.202HisGlu: 1.202 ± 0.016
0.868HisPhe: 0.868 ± 0.014
1.625HisGly: 1.625 ± 0.019
0.729HisHis: 0.729 ± 0.015
1.217HisIle: 1.217 ± 0.014
0.912HisLys: 0.912 ± 0.015
2.067HisLeu: 2.067 ± 0.025
0.452HisMet: 0.452 ± 0.009
0.845HisAsn: 0.845 ± 0.014
1.543HisPro: 1.543 ± 0.019
0.864HisGln: 0.864 ± 0.014
1.341HisArg: 1.341 ± 0.015
1.65HisSer: 1.65 ± 0.021
1.257HisThr: 1.257 ± 0.017
1.34HisVal: 1.34 ± 0.017
0.31HisTrp: 0.31 ± 0.008
0.675HisTyr: 0.675 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.466IleAla: 4.466 ± 0.034
0.759IleCys: 0.759 ± 0.013
2.921IleAsp: 2.921 ± 0.024
2.953IleGlu: 2.953 ± 0.028
2.066IlePhe: 2.066 ± 0.023
3.41IleGly: 3.41 ± 0.031
1.139IleHis: 1.139 ± 0.016
2.737IleIle: 2.737 ± 0.03
2.295IleLys: 2.295 ± 0.024
4.644IleLeu: 4.644 ± 0.037
1.073IleMet: 1.073 ± 0.017
1.917IleAsn: 1.917 ± 0.023
3.091IlePro: 3.091 ± 0.025
1.85IleGln: 1.85 ± 0.02
2.689IleArg: 2.689 ± 0.024
4.04IleSer: 4.04 ± 0.03
3.089IleThr: 3.089 ± 0.026
3.263IleVal: 3.263 ± 0.032
0.708IleTrp: 0.708 ± 0.014
1.501IleTyr: 1.501 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
4.579LysAla: 4.579 ± 0.038
0.509LysCys: 0.509 ± 0.01
2.814LysAsp: 2.814 ± 0.024
3.552LysGlu: 3.552 ± 0.034
1.533LysPhe: 1.533 ± 0.019
3.3LysGly: 3.3 ± 0.029
1.067LysHis: 1.067 ± 0.014
2.391LysIle: 2.391 ± 0.025
3.427LysLys: 3.427 ± 0.05
4.175LysLeu: 4.175 ± 0.031
1.064LysMet: 1.064 ± 0.014
1.798LysAsn: 1.798 ± 0.02
2.697LysPro: 2.697 ± 0.025
1.707LysGln: 1.707 ± 0.019
3.305LysArg: 3.305 ± 0.033
3.628LysSer: 3.628 ± 0.029
2.911LysThr: 2.911 ± 0.026
2.993LysVal: 2.993 ± 0.031
0.695LysTrp: 0.695 ± 0.012
1.419LysTyr: 1.419 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
7.881LeuAla: 7.881 ± 0.049
1.127LeuCys: 1.127 ± 0.015
4.974LeuAsp: 4.974 ± 0.033
5.396LeuGlu: 5.396 ± 0.04
3.235LeuPhe: 3.235 ± 0.032
6.053LeuGly: 6.053 ± 0.034
2.076LeuHis: 2.076 ± 0.022
4.023LeuIle: 4.023 ± 0.032
4.291LeuLys: 4.291 ± 0.033
8.298LeuLeu: 8.298 ± 0.065
1.784LeuMet: 1.784 ± 0.019
3.099LeuAsn: 3.099 ± 0.023
5.424LeuPro: 5.424 ± 0.035
3.556LeuGln: 3.556 ± 0.031
5.359LeuArg: 5.359 ± 0.038
7.179LeuSer: 7.179 ± 0.046
4.863LeuThr: 4.863 ± 0.031
5.278LeuVal: 5.278 ± 0.042
1.183LeuTrp: 1.183 ± 0.017
2.323LeuTyr: 2.323 ± 0.021
0.0LeuXaa: 0.0 ± 0.0
Met
2.299MetAla: 2.299 ± 0.021
0.233MetCys: 0.233 ± 0.006
1.263MetAsp: 1.263 ± 0.015
1.362MetGlu: 1.362 ± 0.016
0.771MetPhe: 0.771 ± 0.015
1.663MetGly: 1.663 ± 0.021
0.489MetHis: 0.489 ± 0.009
0.996MetIle: 0.996 ± 0.014
1.073MetLys: 1.073 ± 0.015
1.847MetLeu: 1.847 ± 0.021
0.608MetMet: 0.608 ± 0.012
0.809MetAsn: 0.809 ± 0.013
1.246MetPro: 1.246 ± 0.016
0.826MetGln: 0.826 ± 0.015
1.269MetArg: 1.269 ± 0.016
1.901MetSer: 1.901 ± 0.018
1.242MetThr: 1.242 ± 0.017
1.355MetVal: 1.355 ± 0.02
0.273MetTrp: 0.273 ± 0.007
0.526MetTyr: 0.526 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
3.198AsnAla: 3.198 ± 0.031
0.471AsnCys: 0.471 ± 0.01
1.941AsnAsp: 1.941 ± 0.022
2.006AsnGlu: 2.006 ± 0.023
1.421AsnPhe: 1.421 ± 0.018
3.221AsnGly: 3.221 ± 0.031
0.801AsnHis: 0.801 ± 0.013
2.169AsnIle: 2.169 ± 0.022
1.581AsnLys: 1.581 ± 0.019
3.219AsnLeu: 3.219 ± 0.026
0.867AsnMet: 0.867 ± 0.014
1.601AsnAsn: 1.601 ± 0.028
2.446AsnPro: 2.446 ± 0.024
1.295AsnGln: 1.295 ± 0.02
1.889AsnArg: 1.889 ± 0.022
2.855AsnSer: 2.855 ± 0.023
2.237AsnThr: 2.237 ± 0.025
2.283AsnVal: 2.283 ± 0.023
0.554AsnTrp: 0.554 ± 0.011
1.112AsnTyr: 1.112 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
5.332ProAla: 5.332 ± 0.04
0.479ProCys: 0.479 ± 0.011
2.897ProAsp: 2.897 ± 0.023
3.606ProGlu: 3.606 ± 0.028
1.959ProPhe: 1.959 ± 0.018
3.922ProGly: 3.922 ± 0.035
1.26ProHis: 1.26 ± 0.019
2.646ProIle: 2.646 ± 0.026
2.751ProLys: 2.751 ± 0.024
4.78ProLeu: 4.78 ± 0.035
1.06ProMet: 1.06 ± 0.016
2.211ProAsn: 2.211 ± 0.024
5.19ProPro: 5.19 ± 0.064
2.394ProGln: 2.394 ± 0.028
3.273ProArg: 3.273 ± 0.035
5.796ProSer: 5.796 ± 0.046
4.29ProThr: 4.29 ± 0.042
3.523ProVal: 3.523 ± 0.033
0.709ProTrp: 0.709 ± 0.012
1.493ProTyr: 1.493 ± 0.021
0.0ProXaa: 0.0 ± 0.0
Gln
3.307GlnAla: 3.307 ± 0.033
0.417GlnCys: 0.417 ± 0.01
1.858GlnAsp: 1.858 ± 0.02
2.232GlnGlu: 2.232 ± 0.025
1.252GlnPhe: 1.252 ± 0.016
2.486GlnGly: 2.486 ± 0.028
0.949GlnHis: 0.949 ± 0.014
1.844GlnIle: 1.844 ± 0.018
1.889GlnLys: 1.889 ± 0.024
3.264GlnLeu: 3.264 ± 0.026
0.856GlnMet: 0.856 ± 0.014
1.524GlnAsn: 1.524 ± 0.019
2.357GlnPro: 2.357 ± 0.026
2.47GlnGln: 2.47 ± 0.059
2.415GlnArg: 2.415 ± 0.022
2.884GlnSer: 2.884 ± 0.027
2.206GlnThr: 2.206 ± 0.022
2.034GlnVal: 2.034 ± 0.024
0.553GlnTrp: 0.553 ± 0.01
1.167GlnTyr: 1.167 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
4.651ArgAla: 4.651 ± 0.034
0.669ArgCys: 0.669 ± 0.012
3.231ArgAsp: 3.231 ± 0.029
3.876ArgGlu: 3.876 ± 0.04
1.975ArgPhe: 1.975 ± 0.018
3.981ArgGly: 3.981 ± 0.035
1.43ArgHis: 1.43 ± 0.017
2.808ArgIle: 2.808 ± 0.022
3.311ArgLys: 3.311 ± 0.032
5.027ArgLeu: 5.027 ± 0.037
1.281ArgMet: 1.281 ± 0.016
2.166ArgAsn: 2.166 ± 0.025
3.23ArgPro: 3.23 ± 0.034
2.386ArgGln: 2.386 ± 0.025
4.813ArgArg: 4.813 ± 0.051
4.418ArgSer: 4.418 ± 0.038
3.169ArgThr: 3.169 ± 0.027
3.289ArgVal: 3.289 ± 0.026
0.842ArgTrp: 0.842 ± 0.015
1.543ArgTyr: 1.543 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
6.655SerAla: 6.655 ± 0.044
0.881SerCys: 0.881 ± 0.014
4.129SerAsp: 4.129 ± 0.034
4.009SerGlu: 4.009 ± 0.029
2.94SerPhe: 2.94 ± 0.028
5.715SerGly: 5.715 ± 0.038
1.818SerHis: 1.818 ± 0.02
4.145SerIle: 4.145 ± 0.032
3.796SerLys: 3.796 ± 0.033
7.067SerLeu: 7.067 ± 0.046
1.7SerMet: 1.7 ± 0.019
3.064SerAsn: 3.064 ± 0.023
5.485SerPro: 5.485 ± 0.054
3.101SerGln: 3.101 ± 0.034
4.676SerArg: 4.676 ± 0.038
8.264SerSer: 8.264 ± 0.071
5.652SerThr: 5.652 ± 0.041
4.517SerVal: 4.517 ± 0.03
1.096SerTrp: 1.096 ± 0.016
2.093SerTyr: 2.093 ± 0.025
0.0SerXaa: 0.0 ± 0.0
Thr
5.424ThrAla: 5.424 ± 0.036
0.77ThrCys: 0.77 ± 0.011
2.882ThrAsp: 2.882 ± 0.025
3.182ThrGlu: 3.182 ± 0.027
2.278ThrPhe: 2.278 ± 0.024
4.358ThrGly: 4.358 ± 0.036
1.27ThrHis: 1.27 ± 0.017
3.263ThrIle: 3.263 ± 0.025
2.734ThrLys: 2.734 ± 0.026
5.24ThrLeu: 5.24 ± 0.039
1.206ThrMet: 1.206 ± 0.016
2.22ThrAsn: 2.22 ± 0.022
4.56ThrPro: 4.56 ± 0.039
2.037ThrGln: 2.037 ± 0.018
3.008ThrArg: 3.008 ± 0.026
5.405ThrSer: 5.405 ± 0.041
4.717ThrThr: 4.717 ± 0.044
3.719ThrVal: 3.719 ± 0.03
0.854ThrTrp: 0.854 ± 0.013
1.681ThrTyr: 1.681 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
5.674ValAla: 5.674 ± 0.037
0.782ValCys: 0.782 ± 0.014
3.75ValAsp: 3.75 ± 0.028
4.187ValGlu: 4.187 ± 0.039
2.417ValPhe: 2.417 ± 0.025
4.412ValGly: 4.412 ± 0.034
1.258ValHis: 1.258 ± 0.015
3.049ValIle: 3.049 ± 0.026
3.054ValLys: 3.054 ± 0.026
5.464ValLeu: 5.464 ± 0.043
1.376ValMet: 1.376 ± 0.02
2.251ValAsn: 2.251 ± 0.023
3.485ValPro: 3.485 ± 0.03
2.21ValGln: 2.21 ± 0.021
3.315ValArg: 3.315 ± 0.025
4.524ValSer: 4.524 ± 0.032
3.56ValThr: 3.56 ± 0.032
4.706ValVal: 4.706 ± 0.035
0.874ValTrp: 0.874 ± 0.013
1.705ValTyr: 1.705 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
1.182TrpAla: 1.182 ± 0.017
0.185TrpCys: 0.185 ± 0.006
0.877TrpAsp: 0.877 ± 0.014
0.876TrpGlu: 0.876 ± 0.013
0.526TrpPhe: 0.526 ± 0.012
1.035TrpGly: 1.035 ± 0.016
0.317TrpHis: 0.317 ± 0.007
0.725TrpIle: 0.725 ± 0.013
0.778TrpLys: 0.778 ± 0.014
1.263TrpLeu: 1.263 ± 0.017
0.375TrpMet: 0.375 ± 0.008
0.622TrpAsn: 0.622 ± 0.013
0.584TrpPro: 0.584 ± 0.012
0.516TrpGln: 0.516 ± 0.011
0.902TrpArg: 0.902 ± 0.015
1.027TrpSer: 1.027 ± 0.015
0.894TrpThr: 0.894 ± 0.014
0.932TrpVal: 0.932 ± 0.014
0.279TrpTrp: 0.279 ± 0.009
0.43TrpTyr: 0.43 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.213TyrAla: 2.213 ± 0.023
0.414TyrCys: 0.414 ± 0.01
1.665TyrAsp: 1.665 ± 0.022
1.539TyrGlu: 1.539 ± 0.018
1.205TyrPhe: 1.205 ± 0.016
2.167TyrGly: 2.167 ± 0.024
0.695TyrHis: 0.695 ± 0.013
1.471TyrIle: 1.471 ± 0.019
1.171TyrLys: 1.171 ± 0.018
2.585TyrLeu: 2.585 ± 0.026
0.631TyrMet: 0.631 ± 0.011
1.209TyrAsn: 1.209 ± 0.017
1.474TyrPro: 1.474 ± 0.021
1.072TyrGln: 1.072 ± 0.013
1.493TyrArg: 1.493 ± 0.019
2.116TyrSer: 2.116 ± 0.021
1.691TyrThr: 1.691 ± 0.02
1.624TyrVal: 1.624 ± 0.019
0.424TyrTrp: 0.424 ± 0.009
0.94TyrTyr: 0.94 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10683 proteins (5291759 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski