Amino acid dipepetide frequency for Coccidioides immitis H538.4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.922AlaAla: 7.922 ± 0.055
1.056AlaCys: 1.056 ± 0.021
3.784AlaAsp: 3.784 ± 0.035
4.957AlaGlu: 4.957 ± 0.043
2.963AlaPhe: 2.963 ± 0.03
5.369AlaGly: 5.369 ± 0.04
1.725AlaHis: 1.725 ± 0.022
4.147AlaIle: 4.147 ± 0.036
4.076AlaLys: 4.076 ± 0.035
7.335AlaLeu: 7.335 ± 0.052
1.821AlaMet: 1.821 ± 0.022
2.843AlaAsn: 2.843 ± 0.026
4.455AlaPro: 4.455 ± 0.046
3.133AlaGln: 3.133 ± 0.028
5.065AlaArg: 5.065 ± 0.04
7.038AlaSer: 7.038 ± 0.049
4.77AlaThr: 4.77 ± 0.037
4.942AlaVal: 4.942 ± 0.036
1.005AlaTrp: 1.005 ± 0.018
1.899AlaTyr: 1.899 ± 0.024
0.0AlaXaa: 0.0 ± 0.0
Cys
0.955CysAla: 0.955 ± 0.016
0.319CysCys: 0.319 ± 0.011
0.665CysAsp: 0.665 ± 0.014
0.666CysGlu: 0.666 ± 0.013
0.578CysPhe: 0.578 ± 0.013
0.997CysGly: 0.997 ± 0.017
0.392CysHis: 0.392 ± 0.011
0.749CysIle: 0.749 ± 0.015
0.552CysLys: 0.552 ± 0.012
1.375CysLeu: 1.375 ± 0.019
0.317CysMet: 0.317 ± 0.009
0.458CysAsn: 0.458 ± 0.01
0.776CysPro: 0.776 ± 0.017
0.505CysGln: 0.505 ± 0.012
0.949CysArg: 0.949 ± 0.018
1.099CysSer: 1.099 ± 0.018
0.682CysThr: 0.682 ± 0.013
0.807CysVal: 0.807 ± 0.015
0.206CysTrp: 0.206 ± 0.008
0.368CysTyr: 0.368 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
4.152AspAla: 4.152 ± 0.039
0.669AspCys: 0.669 ± 0.015
3.977AspAsp: 3.977 ± 0.047
4.246AspGlu: 4.246 ± 0.041
2.102AspPhe: 2.102 ± 0.026
3.861AspGly: 3.861 ± 0.033
1.237AspHis: 1.237 ± 0.019
3.268AspIle: 3.268 ± 0.032
2.341AspLys: 2.341 ± 0.026
4.883AspLeu: 4.883 ± 0.041
1.209AspMet: 1.209 ± 0.018
1.869AspAsn: 1.869 ± 0.022
3.334AspPro: 3.334 ± 0.032
1.813AspGln: 1.813 ± 0.021
3.168AspArg: 3.168 ± 0.035
4.18AspSer: 4.18 ± 0.036
2.804AspThr: 2.804 ± 0.027
3.481AspVal: 3.481 ± 0.03
0.791AspTrp: 0.791 ± 0.015
1.483AspTyr: 1.483 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
5.101GluAla: 5.101 ± 0.042
0.696GluCys: 0.696 ± 0.015
4.056GluAsp: 4.056 ± 0.042
5.512GluGlu: 5.512 ± 0.063
2.043GluPhe: 2.043 ± 0.024
3.633GluGly: 3.633 ± 0.032
1.367GluHis: 1.367 ± 0.02
3.287GluIle: 3.287 ± 0.028
3.865GluLys: 3.865 ± 0.038
5.295GluLeu: 5.295 ± 0.042
1.476GluMet: 1.476 ± 0.022
2.541GluAsn: 2.541 ± 0.032
2.945GluPro: 2.945 ± 0.031
2.463GluGln: 2.463 ± 0.028
4.271GluArg: 4.271 ± 0.043
4.581GluSer: 4.581 ± 0.04
3.38GluThr: 3.38 ± 0.032
3.394GluVal: 3.394 ± 0.033
0.867GluTrp: 0.867 ± 0.015
1.666GluTyr: 1.666 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
2.838PheAla: 2.838 ± 0.03
0.628PheCys: 0.628 ± 0.014
2.183PheAsp: 2.183 ± 0.026
2.165PheGlu: 2.165 ± 0.026
1.539PhePhe: 1.539 ± 0.024
2.718PheGly: 2.718 ± 0.032
0.94PheHis: 0.94 ± 0.014
1.804PheIle: 1.804 ± 0.023
1.579PheLys: 1.579 ± 0.02
3.491PheLeu: 3.491 ± 0.032
0.77PheMet: 0.77 ± 0.013
1.425PheAsn: 1.425 ± 0.021
2.122PhePro: 2.122 ± 0.026
1.431PheGln: 1.431 ± 0.021
2.179PheArg: 2.179 ± 0.027
3.248PheSer: 3.248 ± 0.03
2.069PheThr: 2.069 ± 0.026
2.245PheVal: 2.245 ± 0.023
0.597PheTrp: 0.597 ± 0.014
1.054PheTyr: 1.054 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
4.829GlyAla: 4.829 ± 0.041
0.914GlyCys: 0.914 ± 0.017
3.522GlyAsp: 3.522 ± 0.033
3.719GlyGlu: 3.719 ± 0.036
2.655GlyPhe: 2.655 ± 0.034
5.426GlyGly: 5.426 ± 0.059
1.613GlyHis: 1.613 ± 0.021
3.541GlyIle: 3.541 ± 0.033
3.589GlyLys: 3.589 ± 0.039
5.841GlyLeu: 5.841 ± 0.044
1.582GlyMet: 1.582 ± 0.019
2.548GlyAsn: 2.548 ± 0.026
3.273GlyPro: 3.273 ± 0.035
2.45GlyGln: 2.45 ± 0.027
4.417GlyArg: 4.417 ± 0.036
5.659GlySer: 5.659 ± 0.045
3.689GlyThr: 3.689 ± 0.034
4.115GlyVal: 4.115 ± 0.04
1.127GlyTrp: 1.127 ± 0.016
2.017GlyTyr: 2.017 ± 0.027
0.0GlyXaa: 0.0 ± 0.0
His
1.837HisAla: 1.837 ± 0.022
0.383HisCys: 0.383 ± 0.012
1.269HisAsp: 1.269 ± 0.018
1.305HisGlu: 1.305 ± 0.021
0.968HisPhe: 0.968 ± 0.016
1.759HisGly: 1.759 ± 0.023
0.858HisHis: 0.858 ± 0.018
1.325HisIle: 1.325 ± 0.018
0.964HisLys: 0.964 ± 0.016
2.295HisLeu: 2.295 ± 0.024
0.512HisMet: 0.512 ± 0.013
0.879HisAsn: 0.879 ± 0.016
1.81HisPro: 1.81 ± 0.02
1.019HisGln: 1.019 ± 0.018
1.767HisArg: 1.767 ± 0.02
2.033HisSer: 2.033 ± 0.023
1.278HisThr: 1.278 ± 0.017
1.378HisVal: 1.378 ± 0.018
0.335HisTrp: 0.335 ± 0.009
0.654HisTyr: 0.654 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
3.997IleAla: 3.997 ± 0.038
0.823IleCys: 0.823 ± 0.016
2.872IleAsp: 2.872 ± 0.029
2.95IleGlu: 2.95 ± 0.03
2.096IlePhe: 2.096 ± 0.024
3.059IleGly: 3.059 ± 0.03
1.325IleHis: 1.325 ± 0.018
2.613IleIle: 2.613 ± 0.031
2.328IleLys: 2.328 ± 0.026
4.829IleLeu: 4.829 ± 0.04
1.056IleMet: 1.056 ± 0.018
1.836IleAsn: 1.836 ± 0.024
3.412IlePro: 3.412 ± 0.028
1.988IleGln: 1.988 ± 0.023
3.134IleArg: 3.134 ± 0.034
4.366IleSer: 4.366 ± 0.036
2.775IleThr: 2.775 ± 0.027
3.046IleVal: 3.046 ± 0.03
0.7IleTrp: 0.7 ± 0.013
1.42IleTyr: 1.42 ± 0.019
0.0IleXaa: 0.0 ± 0.0
Lys
4.213LysAla: 4.213 ± 0.037
0.583LysCys: 0.583 ± 0.013
2.821LysAsp: 2.821 ± 0.029
3.584LysGlu: 3.584 ± 0.039
1.592LysPhe: 1.592 ± 0.02
3.081LysGly: 3.081 ± 0.034
1.178LysHis: 1.178 ± 0.019
2.429LysIle: 2.429 ± 0.025
3.508LysLys: 3.508 ± 0.049
4.349LysLeu: 4.349 ± 0.037
1.089LysMet: 1.089 ± 0.016
1.833LysAsn: 1.833 ± 0.02
2.894LysPro: 2.894 ± 0.032
2.05LysGln: 2.05 ± 0.022
3.889LysArg: 3.889 ± 0.034
3.814LysSer: 3.814 ± 0.039
2.743LysThr: 2.743 ± 0.026
2.818LysVal: 2.818 ± 0.027
0.664LysTrp: 0.664 ± 0.014
1.446LysTyr: 1.446 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
7.309LeuAla: 7.309 ± 0.051
1.315LeuCys: 1.315 ± 0.022
5.082LeuAsp: 5.082 ± 0.038
5.646LeuGlu: 5.646 ± 0.048
3.357LeuPhe: 3.357 ± 0.034
5.79LeuGly: 5.79 ± 0.039
2.359LeuHis: 2.359 ± 0.025
4.032LeuIle: 4.032 ± 0.032
4.485LeuLys: 4.485 ± 0.037
8.464LeuLeu: 8.464 ± 0.07
1.78LeuMet: 1.78 ± 0.022
3.294LeuAsn: 3.294 ± 0.03
5.589LeuPro: 5.589 ± 0.042
3.863LeuGln: 3.863 ± 0.038
6.254LeuArg: 6.254 ± 0.048
7.868LeuSer: 7.868 ± 0.054
4.601LeuThr: 4.601 ± 0.039
5.201LeuVal: 5.201 ± 0.041
1.157LeuTrp: 1.157 ± 0.018
2.291LeuTyr: 2.291 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
2.214MetAla: 2.214 ± 0.022
0.252MetCys: 0.252 ± 0.009
1.241MetAsp: 1.241 ± 0.017
1.394MetGlu: 1.394 ± 0.018
0.731MetPhe: 0.731 ± 0.014
1.453MetGly: 1.453 ± 0.02
0.49MetHis: 0.49 ± 0.011
1.042MetIle: 1.042 ± 0.017
1.122MetLys: 1.122 ± 0.017
1.835MetLeu: 1.835 ± 0.024
0.547MetMet: 0.547 ± 0.011
0.814MetAsn: 0.814 ± 0.014
1.249MetPro: 1.249 ± 0.018
0.854MetGln: 0.854 ± 0.015
1.323MetArg: 1.323 ± 0.019
1.833MetSer: 1.833 ± 0.025
1.23MetThr: 1.23 ± 0.017
1.27MetVal: 1.27 ± 0.02
0.262MetTrp: 0.262 ± 0.008
0.501MetTyr: 0.501 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.99AsnAla: 2.99 ± 0.032
0.497AsnCys: 0.497 ± 0.011
1.957AsnAsp: 1.957 ± 0.022
2.106AsnGlu: 2.106 ± 0.025
1.396AsnPhe: 1.396 ± 0.02
2.89AsnGly: 2.89 ± 0.03
0.916AsnHis: 0.916 ± 0.015
2.184AsnIle: 2.184 ± 0.024
1.649AsnLys: 1.649 ± 0.021
3.345AsnLeu: 3.345 ± 0.032
0.828AsnMet: 0.828 ± 0.017
1.531AsnAsn: 1.531 ± 0.023
2.666AsnPro: 2.666 ± 0.026
1.376AsnGln: 1.376 ± 0.022
2.216AsnArg: 2.216 ± 0.023
2.898AsnSer: 2.898 ± 0.028
2.109AsnThr: 2.109 ± 0.027
2.269AsnVal: 2.269 ± 0.022
0.53AsnTrp: 0.53 ± 0.012
0.99AsnTyr: 0.99 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
4.99ProAla: 4.99 ± 0.045
0.677ProCys: 0.677 ± 0.014
3.148ProAsp: 3.148 ± 0.029
3.941ProGlu: 3.941 ± 0.033
2.173ProPhe: 2.173 ± 0.026
4.023ProGly: 4.023 ± 0.037
1.459ProHis: 1.459 ± 0.021
2.688ProIle: 2.688 ± 0.029
2.869ProLys: 2.869 ± 0.032
4.932ProLeu: 4.932 ± 0.041
1.103ProMet: 1.103 ± 0.018
2.356ProAsn: 2.356 ± 0.031
5.626ProPro: 5.626 ± 0.075
2.533ProGln: 2.533 ± 0.032
3.796ProArg: 3.796 ± 0.04
6.477ProSer: 6.477 ± 0.053
3.975ProThr: 3.975 ± 0.032
3.607ProVal: 3.607 ± 0.031
0.741ProTrp: 0.741 ± 0.015
1.519ProTyr: 1.519 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
3.243GlnAla: 3.243 ± 0.032
0.494GlnCys: 0.494 ± 0.012
1.95GlnAsp: 1.95 ± 0.022
2.405GlnGlu: 2.405 ± 0.027
1.312GlnPhe: 1.312 ± 0.022
2.431GlnGly: 2.431 ± 0.026
1.084GlnHis: 1.084 ± 0.018
1.917GlnIle: 1.917 ± 0.024
2.14GlnLys: 2.14 ± 0.024
3.478GlnLeu: 3.478 ± 0.034
0.885GlnMet: 0.885 ± 0.015
1.628GlnAsn: 1.628 ± 0.022
2.716GlnPro: 2.716 ± 0.039
2.26GlnGln: 2.26 ± 0.041
2.851GlnArg: 2.851 ± 0.029
3.285GlnSer: 3.285 ± 0.035
2.203GlnThr: 2.203 ± 0.022
2.036GlnVal: 2.036 ± 0.02
0.503GlnTrp: 0.503 ± 0.012
1.054GlnTyr: 1.054 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
4.898ArgAla: 4.898 ± 0.036
0.898ArgCys: 0.898 ± 0.016
3.536ArgAsp: 3.536 ± 0.039
4.233ArgGlu: 4.233 ± 0.04
2.307ArgPhe: 2.307 ± 0.028
4.022ArgGly: 4.022 ± 0.044
1.744ArgHis: 1.744 ± 0.023
3.274ArgIle: 3.274 ± 0.03
3.833ArgLys: 3.833 ± 0.039
5.869ArgLeu: 5.869 ± 0.044
1.462ArgMet: 1.462 ± 0.021
2.551ArgAsn: 2.551 ± 0.026
3.92ArgPro: 3.92 ± 0.036
2.793ArgGln: 2.793 ± 0.031
6.036ArgArg: 6.036 ± 0.051
5.578ArgSer: 5.578 ± 0.056
3.544ArgThr: 3.544 ± 0.029
3.577ArgVal: 3.577 ± 0.034
0.961ArgTrp: 0.961 ± 0.016
1.738ArgTyr: 1.738 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
6.566SerAla: 6.566 ± 0.042
0.976SerCys: 0.976 ± 0.017
4.198SerAsp: 4.198 ± 0.036
4.438SerGlu: 4.438 ± 0.039
3.126SerPhe: 3.126 ± 0.032
5.614SerGly: 5.614 ± 0.042
2.184SerHis: 2.184 ± 0.026
4.226SerIle: 4.226 ± 0.037
4.139SerLys: 4.139 ± 0.041
7.805SerLeu: 7.805 ± 0.047
1.826SerMet: 1.826 ± 0.022
3.161SerAsn: 3.161 ± 0.028
5.99SerPro: 5.99 ± 0.054
3.513SerGln: 3.513 ± 0.033
5.909SerArg: 5.909 ± 0.051
9.504SerSer: 9.504 ± 0.067
5.627SerThr: 5.627 ± 0.042
4.554SerVal: 4.554 ± 0.04
1.076SerTrp: 1.076 ± 0.018
2.005SerTyr: 2.005 ± 0.025
0.0SerXaa: 0.0 ± 0.0
Thr
4.677ThrAla: 4.677 ± 0.038
0.755ThrCys: 0.755 ± 0.015
2.667ThrAsp: 2.667 ± 0.027
3.058ThrGlu: 3.058 ± 0.027
2.129ThrPhe: 2.129 ± 0.023
3.965ThrGly: 3.965 ± 0.029
1.309ThrHis: 1.309 ± 0.021
2.975ThrIle: 2.975 ± 0.027
2.684ThrLys: 2.684 ± 0.028
5.064ThrLeu: 5.064 ± 0.038
1.181ThrMet: 1.181 ± 0.02
1.973ThrAsn: 1.973 ± 0.026
4.208ThrPro: 4.208 ± 0.042
2.012ThrGln: 2.012 ± 0.02
3.348ThrArg: 3.348 ± 0.033
5.267ThrSer: 5.267 ± 0.041
3.76ThrThr: 3.76 ± 0.043
3.432ThrVal: 3.432 ± 0.034
0.776ThrTrp: 0.776 ± 0.015
1.43ThrTyr: 1.43 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
4.68ValAla: 4.68 ± 0.04
0.842ValCys: 0.842 ± 0.017
3.554ValAsp: 3.554 ± 0.034
3.824ValGlu: 3.824 ± 0.035
2.288ValPhe: 2.288 ± 0.032
3.687ValGly: 3.687 ± 0.038
1.369ValHis: 1.369 ± 0.018
2.899ValIle: 2.899 ± 0.028
2.952ValLys: 2.952 ± 0.027
5.356ValLeu: 5.356 ± 0.047
1.242ValMet: 1.242 ± 0.02
2.189ValAsn: 2.189 ± 0.024
3.545ValPro: 3.545 ± 0.037
2.244ValGln: 2.244 ± 0.024
3.518ValArg: 3.518 ± 0.034
4.675ValSer: 4.675 ± 0.041
3.216ValThr: 3.216 ± 0.031
3.912ValVal: 3.912 ± 0.038
0.809ValTrp: 0.809 ± 0.015
1.579ValTyr: 1.579 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.034TrpAla: 1.034 ± 0.016
0.204TrpCys: 0.204 ± 0.007
0.837TrpAsp: 0.837 ± 0.017
0.846TrpGlu: 0.846 ± 0.017
0.484TrpPhe: 0.484 ± 0.011
0.839TrpGly: 0.839 ± 0.014
0.332TrpHis: 0.332 ± 0.01
0.74TrpIle: 0.74 ± 0.015
0.803TrpLys: 0.803 ± 0.016
1.301TrpLeu: 1.301 ± 0.02
0.374TrpMet: 0.374 ± 0.01
0.617TrpAsn: 0.617 ± 0.013
0.609TrpPro: 0.609 ± 0.012
0.524TrpGln: 0.524 ± 0.012
1.001TrpArg: 1.001 ± 0.015
0.998TrpSer: 0.998 ± 0.015
0.802TrpThr: 0.802 ± 0.014
0.792TrpVal: 0.792 ± 0.015
0.257TrpTrp: 0.257 ± 0.009
0.362TrpTyr: 0.362 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.862TyrAla: 1.862 ± 0.024
0.432TyrCys: 0.432 ± 0.011
1.549TyrAsp: 1.549 ± 0.023
1.441TyrGlu: 1.441 ± 0.019
1.183TyrPhe: 1.183 ± 0.019
1.912TyrGly: 1.912 ± 0.026
0.746TyrHis: 0.746 ± 0.014
1.484TyrIle: 1.484 ± 0.021
1.099TyrLys: 1.099 ± 0.019
2.597TyrLeu: 2.597 ± 0.027
0.597TyrMet: 0.597 ± 0.013
0.997TyrAsn: 0.997 ± 0.016
1.494TyrPro: 1.494 ± 0.021
1.051TyrGln: 1.051 ± 0.017
1.687TyrArg: 1.687 ± 0.022
2.038TyrSer: 2.038 ± 0.026
1.415TyrThr: 1.415 ± 0.02
1.502TyrVal: 1.502 ± 0.02
0.393TyrTrp: 0.393 ± 0.011
0.849TyrTyr: 0.849 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10588 proteins (3769138 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski