Amino acid dipepetide frequency for Hirsutella minnesotensis 3608

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.242AlaAla: 10.242 ± 0.069
1.256AlaCys: 1.256 ± 0.017
4.871AlaAsp: 4.871 ± 0.038
5.475AlaGlu: 5.475 ± 0.064
3.375AlaPhe: 3.375 ± 0.031
5.88AlaGly: 5.88 ± 0.045
1.975AlaHis: 1.975 ± 0.023
4.248AlaIle: 4.248 ± 0.03
4.373AlaLys: 4.373 ± 0.04
8.188AlaLeu: 8.188 ± 0.059
2.169AlaMet: 2.169 ± 0.024
2.903AlaAsn: 2.903 ± 0.028
4.891AlaPro: 4.891 ± 0.046
3.665AlaGln: 3.665 ± 0.037
6.011AlaArg: 6.011 ± 0.044
7.582AlaSer: 7.582 ± 0.054
5.387AlaThr: 5.387 ± 0.043
5.845AlaVal: 5.845 ± 0.043
1.348AlaTrp: 1.348 ± 0.023
2.131AlaTyr: 2.131 ± 0.025
0.0AlaXaa: 0.0 ± 0.0
Cys
1.128CysAla: 1.128 ± 0.017
0.273CysCys: 0.273 ± 0.009
0.795CysAsp: 0.795 ± 0.013
0.691CysGlu: 0.691 ± 0.013
0.546CysPhe: 0.546 ± 0.011
1.028CysGly: 1.028 ± 0.014
0.398CysHis: 0.398 ± 0.012
0.694CysIle: 0.694 ± 0.013
0.576CysLys: 0.576 ± 0.013
1.464CysLeu: 1.464 ± 0.021
0.273CysMet: 0.273 ± 0.008
0.465CysAsn: 0.465 ± 0.012
0.79CysPro: 0.79 ± 0.016
0.602CysGln: 0.602 ± 0.013
1.079CysArg: 1.079 ± 0.018
1.002CysSer: 1.002 ± 0.017
0.738CysThr: 0.738 ± 0.016
0.861CysVal: 0.861 ± 0.016
0.223CysTrp: 0.223 ± 0.009
0.373CysTyr: 0.373 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
5.246AspAla: 5.246 ± 0.037
0.749AspCys: 0.749 ± 0.014
4.619AspAsp: 4.619 ± 0.052
4.602AspGlu: 4.602 ± 0.046
2.344AspPhe: 2.344 ± 0.025
4.491AspGly: 4.491 ± 0.037
1.353AspHis: 1.353 ± 0.02
2.86AspIle: 2.86 ± 0.028
2.578AspLys: 2.578 ± 0.028
5.18AspLeu: 5.18 ± 0.04
1.289AspMet: 1.289 ± 0.019
1.889AspAsn: 1.889 ± 0.024
3.178AspPro: 3.178 ± 0.028
2.079AspGln: 2.079 ± 0.023
3.558AspArg: 3.558 ± 0.056
4.17AspSer: 4.17 ± 0.042
2.737AspThr: 2.737 ± 0.027
3.792AspVal: 3.792 ± 0.031
0.999AspTrp: 0.999 ± 0.015
1.51AspTyr: 1.51 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
6.109GluAla: 6.109 ± 0.058
0.74GluCys: 0.74 ± 0.015
3.881GluAsp: 3.881 ± 0.057
4.736GluGlu: 4.736 ± 0.067
1.942GluPhe: 1.942 ± 0.023
3.449GluGly: 3.449 ± 0.03
1.478GluHis: 1.478 ± 0.019
2.592GluIle: 2.592 ± 0.026
3.191GluLys: 3.191 ± 0.038
5.267GluLeu: 5.267 ± 0.043
1.421GluMet: 1.421 ± 0.021
1.896GluAsn: 1.896 ± 0.026
3.012GluPro: 3.012 ± 0.045
2.597GluGln: 2.597 ± 0.028
4.387GluArg: 4.387 ± 0.036
4.137GluSer: 4.137 ± 0.04
3.344GluThr: 3.344 ± 0.033
3.411GluVal: 3.411 ± 0.031
0.93GluTrp: 0.93 ± 0.017
1.531GluTyr: 1.531 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.078PheAla: 3.078 ± 0.032
0.628PheCys: 0.628 ± 0.012
2.402PheAsp: 2.402 ± 0.026
2.166PheGlu: 2.166 ± 0.027
1.539PhePhe: 1.539 ± 0.019
2.643PheGly: 2.643 ± 0.033
0.901PheHis: 0.901 ± 0.016
1.628PheIle: 1.628 ± 0.023
1.395PheLys: 1.395 ± 0.018
3.437PheLeu: 3.437 ± 0.035
0.732PheMet: 0.732 ± 0.012
1.307PheAsn: 1.307 ± 0.02
1.902PhePro: 1.902 ± 0.025
1.403PheGln: 1.403 ± 0.018
2.221PheArg: 2.221 ± 0.025
2.718PheSer: 2.718 ± 0.03
1.917PheThr: 1.917 ± 0.021
2.397PheVal: 2.397 ± 0.028
0.64PheTrp: 0.64 ± 0.013
0.943PheTyr: 0.943 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.518GlyAla: 5.518 ± 0.046
0.993GlyCys: 0.993 ± 0.016
3.731GlyAsp: 3.731 ± 0.034
3.453GlyGlu: 3.453 ± 0.034
2.586GlyPhe: 2.586 ± 0.03
5.566GlyGly: 5.566 ± 0.07
1.915GlyHis: 1.915 ± 0.031
3.233GlyIle: 3.233 ± 0.031
3.442GlyLys: 3.442 ± 0.035
5.942GlyLeu: 5.942 ± 0.045
1.523GlyMet: 1.523 ± 0.018
2.406GlyAsn: 2.406 ± 0.032
3.584GlyPro: 3.584 ± 0.037
2.816GlyGln: 2.816 ± 0.034
4.876GlyArg: 4.876 ± 0.068
5.706GlySer: 5.706 ± 0.049
3.715GlyThr: 3.715 ± 0.037
4.01GlyVal: 4.01 ± 0.035
1.077GlyTrp: 1.077 ± 0.018
2.025GlyTyr: 2.025 ± 0.029
0.0GlyXaa: 0.0 ± 0.0
His
2.001HisAla: 2.001 ± 0.022
0.386HisCys: 0.386 ± 0.01
1.521HisAsp: 1.521 ± 0.023
1.361HisGlu: 1.361 ± 0.018
1.023HisPhe: 1.023 ± 0.016
1.975HisGly: 1.975 ± 0.028
0.938HisHis: 0.938 ± 0.02
1.159HisIle: 1.159 ± 0.016
0.976HisLys: 0.976 ± 0.016
2.303HisLeu: 2.303 ± 0.026
0.542HisMet: 0.542 ± 0.011
0.797HisAsn: 0.797 ± 0.013
1.717HisPro: 1.717 ± 0.023
1.073HisGln: 1.073 ± 0.019
1.8HisArg: 1.8 ± 0.022
1.742HisSer: 1.742 ± 0.022
1.156HisThr: 1.156 ± 0.017
1.648HisVal: 1.648 ± 0.02
0.413HisTrp: 0.413 ± 0.009
0.667HisTyr: 0.667 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
3.882IleAla: 3.882 ± 0.035
0.726IleCys: 0.726 ± 0.015
2.756IleAsp: 2.756 ± 0.024
2.627IleGlu: 2.627 ± 0.029
1.707IlePhe: 1.707 ± 0.024
2.844IleGly: 2.844 ± 0.029
1.174IleHis: 1.174 ± 0.02
2.088IleIle: 2.088 ± 0.028
1.983IleLys: 1.983 ± 0.023
4.082IleLeu: 4.082 ± 0.038
0.936IleMet: 0.936 ± 0.018
1.636IleAsn: 1.636 ± 0.021
2.664IlePro: 2.664 ± 0.028
1.838IleGln: 1.838 ± 0.021
3.052IleArg: 3.052 ± 0.03
3.284IleSer: 3.284 ± 0.028
2.478IleThr: 2.478 ± 0.027
3.017IleVal: 3.017 ± 0.034
0.73IleTrp: 0.73 ± 0.014
1.161IleTyr: 1.161 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
4.529LysAla: 4.529 ± 0.045
0.549LysCys: 0.549 ± 0.012
2.595LysAsp: 2.595 ± 0.033
3.068LysGlu: 3.068 ± 0.035
1.259LysPhe: 1.259 ± 0.018
2.859LysGly: 2.859 ± 0.031
1.154LysHis: 1.154 ± 0.017
2.001LysIle: 2.001 ± 0.024
2.884LysLys: 2.884 ± 0.045
4.059LysLeu: 4.059 ± 0.037
0.992LysMet: 0.992 ± 0.014
1.455LysAsn: 1.455 ± 0.019
2.653LysPro: 2.653 ± 0.03
1.905LysGln: 1.905 ± 0.025
3.734LysArg: 3.734 ± 0.036
3.236LysSer: 3.236 ± 0.032
2.844LysThr: 2.844 ± 0.054
2.66LysVal: 2.66 ± 0.026
0.704LysTrp: 0.704 ± 0.014
1.264LysTyr: 1.264 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
8.37LeuAla: 8.37 ± 0.056
1.37LeuCys: 1.37 ± 0.02
5.563LeuAsp: 5.563 ± 0.042
5.529LeuGlu: 5.529 ± 0.045
3.18LeuPhe: 3.18 ± 0.033
6.053LeuGly: 6.053 ± 0.046
2.257LeuHis: 2.257 ± 0.026
3.493LeuIle: 3.493 ± 0.034
3.906LeuLys: 3.906 ± 0.039
8.503LeuLeu: 8.503 ± 0.075
1.724LeuMet: 1.724 ± 0.019
2.857LeuAsn: 2.857 ± 0.025
5.55LeuPro: 5.55 ± 0.046
4.067LeuGln: 4.067 ± 0.033
6.807LeuArg: 6.807 ± 0.044
7.131LeuSer: 7.131 ± 0.051
4.634LeuThr: 4.634 ± 0.036
5.542LeuVal: 5.542 ± 0.05
1.277LeuTrp: 1.277 ± 0.02
2.192LeuTyr: 2.192 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.432MetAla: 2.432 ± 0.025
0.247MetCys: 0.247 ± 0.007
1.381MetAsp: 1.381 ± 0.022
1.254MetGlu: 1.254 ± 0.018
0.632MetPhe: 0.632 ± 0.012
1.39MetGly: 1.39 ± 0.02
0.486MetHis: 0.486 ± 0.01
0.869MetIle: 0.869 ± 0.015
0.935MetLys: 0.935 ± 0.017
1.879MetLeu: 1.879 ± 0.023
0.579MetMet: 0.579 ± 0.012
0.739MetAsn: 0.739 ± 0.015
1.354MetPro: 1.354 ± 0.018
0.849MetGln: 0.849 ± 0.016
1.414MetArg: 1.414 ± 0.017
1.828MetSer: 1.828 ± 0.021
1.329MetThr: 1.329 ± 0.021
1.279MetVal: 1.279 ± 0.019
0.245MetTrp: 0.245 ± 0.008
0.429MetTyr: 0.429 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.023AsnAla: 3.023 ± 0.031
0.476AsnCys: 0.476 ± 0.011
1.854AsnAsp: 1.854 ± 0.02
1.795AsnGlu: 1.795 ± 0.022
1.172AsnPhe: 1.172 ± 0.017
2.647AsnGly: 2.647 ± 0.035
0.868AsnHis: 0.868 ± 0.017
1.69AsnIle: 1.69 ± 0.023
1.467AsnLys: 1.467 ± 0.023
2.997AsnLeu: 2.997 ± 0.03
0.81AsnMet: 0.81 ± 0.016
1.277AsnAsn: 1.277 ± 0.025
2.165AsnPro: 2.165 ± 0.024
1.325AsnGln: 1.325 ± 0.021
2.137AsnArg: 2.137 ± 0.023
2.366AsnSer: 2.366 ± 0.026
1.776AsnThr: 1.776 ± 0.021
2.118AsnVal: 2.118 ± 0.021
0.506AsnTrp: 0.506 ± 0.012
0.861AsnTyr: 0.861 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
5.54ProAla: 5.54 ± 0.05
0.673ProCys: 0.673 ± 0.013
3.414ProAsp: 3.414 ± 0.027
3.719ProGlu: 3.719 ± 0.035
2.008ProPhe: 2.008 ± 0.023
4.126ProGly: 4.126 ± 0.045
1.334ProHis: 1.334 ± 0.02
2.414ProIle: 2.414 ± 0.028
2.474ProLys: 2.474 ± 0.031
4.809ProLeu: 4.809 ± 0.04
1.061ProMet: 1.061 ± 0.018
1.904ProAsn: 1.904 ± 0.025
4.951ProPro: 4.951 ± 0.069
2.458ProGln: 2.458 ± 0.035
4.004ProArg: 4.004 ± 0.039
5.679ProSer: 5.679 ± 0.068
3.617ProThr: 3.617 ± 0.04
3.622ProVal: 3.622 ± 0.039
0.806ProTrp: 0.806 ± 0.014
1.311ProTyr: 1.311 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
4.006GlnAla: 4.006 ± 0.039
0.514GlnCys: 0.514 ± 0.012
2.34GlnAsp: 2.34 ± 0.027
2.613GlnGlu: 2.613 ± 0.052
1.268GlnPhe: 1.268 ± 0.017
2.89GlnGly: 2.89 ± 0.038
1.157GlnHis: 1.157 ± 0.019
1.708GlnIle: 1.708 ± 0.022
1.865GlnLys: 1.865 ± 0.025
3.805GlnLeu: 3.805 ± 0.035
0.916GlnMet: 0.916 ± 0.017
1.363GlnAsn: 1.363 ± 0.02
2.631GlnPro: 2.631 ± 0.037
2.644GlnGln: 2.644 ± 0.055
3.278GlnArg: 3.278 ± 0.028
3.081GlnSer: 3.081 ± 0.041
2.209GlnThr: 2.209 ± 0.028
2.341GlnVal: 2.341 ± 0.026
0.653GlnTrp: 0.653 ± 0.012
1.063GlnTyr: 1.063 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
5.72ArgAla: 5.72 ± 0.044
1.024ArgCys: 1.024 ± 0.016
4.198ArgAsp: 4.198 ± 0.04
4.191ArgGlu: 4.191 ± 0.037
2.485ArgPhe: 2.485 ± 0.027
4.379ArgGly: 4.379 ± 0.037
2.018ArgHis: 2.018 ± 0.026
3.053ArgIle: 3.053 ± 0.029
3.708ArgLys: 3.708 ± 0.055
6.664ArgLeu: 6.664 ± 0.05
1.401ArgMet: 1.401 ± 0.017
2.365ArgAsn: 2.365 ± 0.025
4.176ArgPro: 4.176 ± 0.043
3.6ArgGln: 3.6 ± 0.053
6.578ArgArg: 6.578 ± 0.054
5.152ArgSer: 5.152 ± 0.039
3.542ArgThr: 3.542 ± 0.034
3.892ArgVal: 3.892 ± 0.032
1.17ArgTrp: 1.17 ± 0.019
1.742ArgTyr: 1.742 ± 0.022
0.0ArgXaa: 0.0 ± 0.0
Ser
6.751SerAla: 6.751 ± 0.049
1.035SerCys: 1.035 ± 0.015
4.159SerAsp: 4.159 ± 0.035
3.882SerGlu: 3.882 ± 0.034
2.925SerPhe: 2.925 ± 0.027
5.25SerGly: 5.25 ± 0.042
1.965SerHis: 1.965 ± 0.024
3.582SerIle: 3.582 ± 0.035
3.412SerLys: 3.412 ± 0.034
7.009SerLeu: 7.009 ± 0.05
1.726SerMet: 1.726 ± 0.024
2.552SerAsn: 2.552 ± 0.03
5.324SerPro: 5.324 ± 0.06
3.322SerGln: 3.322 ± 0.038
5.649SerArg: 5.649 ± 0.049
7.941SerSer: 7.941 ± 0.087
4.798SerThr: 4.798 ± 0.045
4.326SerVal: 4.326 ± 0.04
1.199SerTrp: 1.199 ± 0.019
1.886SerTyr: 1.886 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
5.263ThrAla: 5.263 ± 0.042
0.751ThrCys: 0.751 ± 0.016
2.84ThrAsp: 2.84 ± 0.027
2.811ThrGlu: 2.811 ± 0.035
2.013ThrPhe: 2.013 ± 0.024
3.978ThrGly: 3.978 ± 0.063
1.226ThrHis: 1.226 ± 0.019
2.703ThrIle: 2.703 ± 0.027
2.539ThrLys: 2.539 ± 0.027
4.931ThrLeu: 4.931 ± 0.038
1.184ThrMet: 1.184 ± 0.016
1.75ThrAsn: 1.75 ± 0.02
3.878ThrPro: 3.878 ± 0.043
1.969ThrGln: 1.969 ± 0.025
3.568ThrArg: 3.568 ± 0.03
4.711ThrSer: 4.711 ± 0.046
3.604ThrThr: 3.604 ± 0.051
3.587ThrVal: 3.587 ± 0.038
0.861ThrTrp: 0.861 ± 0.015
1.429ThrTyr: 1.429 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
5.692ValAla: 5.692 ± 0.044
0.921ValCys: 0.921 ± 0.016
3.844ValAsp: 3.844 ± 0.03
3.847ValGlu: 3.847 ± 0.037
2.401ValPhe: 2.401 ± 0.025
3.879ValGly: 3.879 ± 0.038
1.424ValHis: 1.424 ± 0.019
2.703ValIle: 2.703 ± 0.028
2.774ValLys: 2.774 ± 0.027
5.565ValLeu: 5.565 ± 0.042
1.341ValMet: 1.341 ± 0.018
2.169ValAsn: 2.169 ± 0.026
3.526ValPro: 3.526 ± 0.034
2.484ValGln: 2.484 ± 0.023
3.953ValArg: 3.953 ± 0.04
4.433ValSer: 4.433 ± 0.036
3.399ValThr: 3.399 ± 0.035
4.43ValVal: 4.43 ± 0.044
0.881ValTrp: 0.881 ± 0.015
1.576ValTyr: 1.576 ± 0.022
0.0ValXaa: 0.0 ± 0.0
Trp
1.241TrpAla: 1.241 ± 0.021
0.216TrpCys: 0.216 ± 0.007
0.875TrpAsp: 0.875 ± 0.017
0.839TrpGlu: 0.839 ± 0.013
0.532TrpPhe: 0.532 ± 0.012
0.823TrpGly: 0.823 ± 0.016
0.451TrpHis: 0.451 ± 0.01
0.802TrpIle: 0.802 ± 0.013
0.875TrpLys: 0.875 ± 0.015
1.516TrpLeu: 1.516 ± 0.023
0.376TrpMet: 0.376 ± 0.01
0.647TrpAsn: 0.647 ± 0.014
0.691TrpPro: 0.691 ± 0.012
0.639TrpGln: 0.639 ± 0.011
1.163TrpArg: 1.163 ± 0.017
1.031TrpSer: 1.031 ± 0.014
1.025TrpThr: 1.025 ± 0.017
0.918TrpVal: 0.918 ± 0.016
0.298TrpTrp: 0.298 ± 0.008
0.446TrpTyr: 0.446 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.046TyrAla: 2.046 ± 0.026
0.469TyrCys: 0.469 ± 0.012
1.585TyrAsp: 1.585 ± 0.019
1.391TyrGlu: 1.391 ± 0.019
1.06TyrPhe: 1.06 ± 0.019
1.966TyrGly: 1.966 ± 0.032
0.727TyrHis: 0.727 ± 0.012
1.144TyrIle: 1.144 ± 0.016
1.053TyrLys: 1.053 ± 0.018
2.406TyrLeu: 2.406 ± 0.035
0.557TyrMet: 0.557 ± 0.011
0.953TyrAsn: 0.953 ± 0.017
1.288TyrPro: 1.288 ± 0.021
1.008TyrGln: 1.008 ± 0.015
1.787TyrArg: 1.787 ± 0.022
1.763TyrSer: 1.763 ± 0.022
1.347TyrThr: 1.347 ± 0.019
1.546TyrVal: 1.546 ± 0.018
0.445TyrTrp: 0.445 ± 0.011
0.844TyrTyr: 0.844 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9072 proteins (4418322 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski