Amino acid dipepetide frequency for Capronia epimyces CBS 606.96

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.34AlaAla: 9.34 ± 0.067
1.063AlaCys: 1.063 ± 0.016
4.561AlaAsp: 4.561 ± 0.03
5.385AlaGlu: 5.385 ± 0.042
3.182AlaPhe: 3.182 ± 0.028
6.246AlaGly: 6.246 ± 0.04
1.894AlaHis: 1.894 ± 0.023
4.273AlaIle: 4.273 ± 0.032
3.98AlaLys: 3.98 ± 0.037
7.953AlaLeu: 7.953 ± 0.047
1.896AlaMet: 1.896 ± 0.02
2.945AlaAsn: 2.945 ± 0.023
4.662AlaPro: 4.662 ± 0.035
3.666AlaGln: 3.666 ± 0.03
5.333AlaArg: 5.333 ± 0.033
7.453AlaSer: 7.453 ± 0.047
5.611AlaThr: 5.611 ± 0.041
5.773AlaVal: 5.773 ± 0.041
1.197AlaTrp: 1.197 ± 0.017
2.229AlaTyr: 2.229 ± 0.02
0.0AlaXaa: 0.0 ± 0.0
Cys
0.924CysAla: 0.924 ± 0.016
0.208CysCys: 0.208 ± 0.007
0.602CysAsp: 0.602 ± 0.011
0.541CysGlu: 0.541 ± 0.011
0.511CysPhe: 0.511 ± 0.01
0.856CysGly: 0.856 ± 0.016
0.332CysHis: 0.332 ± 0.008
0.665CysIle: 0.665 ± 0.012
0.47CysLys: 0.47 ± 0.011
1.283CysLeu: 1.283 ± 0.017
0.242CysMet: 0.242 ± 0.007
0.386CysAsn: 0.386 ± 0.009
0.641CysPro: 0.641 ± 0.013
0.432CysGln: 0.432 ± 0.008
0.735CysArg: 0.735 ± 0.014
0.823CysSer: 0.823 ± 0.014
0.641CysThr: 0.641 ± 0.012
0.768CysVal: 0.768 ± 0.013
0.191CysTrp: 0.191 ± 0.006
0.347CysTyr: 0.347 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.788AspAla: 4.788 ± 0.033
0.603AspCys: 0.603 ± 0.013
4.323AspAsp: 4.323 ± 0.045
4.514AspGlu: 4.514 ± 0.041
2.224AspPhe: 2.224 ± 0.023
4.16AspGly: 4.16 ± 0.034
1.429AspHis: 1.429 ± 0.019
2.886AspIle: 2.886 ± 0.03
2.382AspLys: 2.382 ± 0.023
5.393AspLeu: 5.393 ± 0.038
1.187AspMet: 1.187 ± 0.017
1.784AspAsn: 1.784 ± 0.019
3.511AspPro: 3.511 ± 0.028
2.15AspGln: 2.15 ± 0.021
3.265AspArg: 3.265 ± 0.029
4.046AspSer: 4.046 ± 0.032
3.017AspThr: 3.017 ± 0.027
3.773AspVal: 3.773 ± 0.031
0.887AspTrp: 0.887 ± 0.015
1.569AspTyr: 1.569 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
5.57GluAla: 5.57 ± 0.042
0.566GluCys: 0.566 ± 0.012
4.153GluAsp: 4.153 ± 0.042
5.116GluGlu: 5.116 ± 0.046
1.844GluPhe: 1.844 ± 0.019
3.605GluGly: 3.605 ± 0.028
1.471GluHis: 1.471 ± 0.02
2.983GluIle: 2.983 ± 0.027
3.444GluLys: 3.444 ± 0.033
5.158GluLeu: 5.158 ± 0.037
1.364GluMet: 1.364 ± 0.016
2.055GluAsn: 2.055 ± 0.022
2.771GluPro: 2.771 ± 0.028
2.614GluGln: 2.614 ± 0.027
3.932GluArg: 3.932 ± 0.032
4.137GluSer: 4.137 ± 0.033
3.47GluThr: 3.47 ± 0.029
3.699GluVal: 3.699 ± 0.031
0.848GluTrp: 0.848 ± 0.013
1.599GluTyr: 1.599 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.103PheAla: 3.103 ± 0.029
0.538PheCys: 0.538 ± 0.011
2.29PheAsp: 2.29 ± 0.023
2.111PheGlu: 2.111 ± 0.022
1.578PhePhe: 1.578 ± 0.022
2.862PheGly: 2.862 ± 0.03
0.911PheHis: 0.911 ± 0.014
1.703PheIle: 1.703 ± 0.021
1.388PheLys: 1.388 ± 0.016
3.483PheLeu: 3.483 ± 0.036
0.739PheMet: 0.739 ± 0.013
1.329PheAsn: 1.329 ± 0.016
1.933PhePro: 1.933 ± 0.018
1.415PheGln: 1.415 ± 0.017
1.958PheArg: 1.958 ± 0.02
2.851PheSer: 2.851 ± 0.028
2.018PheThr: 2.018 ± 0.018
2.396PheVal: 2.396 ± 0.024
0.655PheTrp: 0.655 ± 0.013
1.075PheTyr: 1.075 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.436GlyAla: 5.436 ± 0.039
0.811GlyCys: 0.811 ± 0.014
3.643GlyAsp: 3.643 ± 0.034
3.523GlyGlu: 3.523 ± 0.028
2.824GlyPhe: 2.824 ± 0.027
5.751GlyGly: 5.751 ± 0.051
1.838GlyHis: 1.838 ± 0.022
3.449GlyIle: 3.449 ± 0.03
3.323GlyLys: 3.323 ± 0.031
6.41GlyLeu: 6.41 ± 0.039
1.551GlyMet: 1.551 ± 0.019
2.346GlyAsn: 2.346 ± 0.021
3.582GlyPro: 3.582 ± 0.027
2.82GlyGln: 2.82 ± 0.025
4.335GlyArg: 4.335 ± 0.035
5.72GlySer: 5.72 ± 0.045
4.101GlyThr: 4.101 ± 0.036
4.437GlyVal: 4.437 ± 0.036
1.176GlyTrp: 1.176 ± 0.018
2.097GlyTyr: 2.097 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
2.011HisAla: 2.011 ± 0.022
0.31HisCys: 0.31 ± 0.008
1.5HisAsp: 1.5 ± 0.019
1.45HisGlu: 1.45 ± 0.018
0.922HisPhe: 0.922 ± 0.014
1.89HisGly: 1.89 ± 0.02
0.911HisHis: 0.911 ± 0.02
1.194HisIle: 1.194 ± 0.017
0.926HisLys: 0.926 ± 0.013
2.372HisLeu: 2.372 ± 0.023
0.468HisMet: 0.468 ± 0.011
0.836HisAsn: 0.836 ± 0.015
1.763HisPro: 1.763 ± 0.02
1.07HisGln: 1.07 ± 0.015
1.608HisArg: 1.608 ± 0.016
1.888HisSer: 1.888 ± 0.02
1.307HisThr: 1.307 ± 0.015
1.586HisVal: 1.586 ± 0.018
0.351HisTrp: 0.351 ± 0.009
0.725HisTyr: 0.725 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
4.113IleAla: 4.113 ± 0.034
0.726IleCys: 0.726 ± 0.013
2.833IleAsp: 2.833 ± 0.026
2.72IleGlu: 2.72 ± 0.024
1.875IlePhe: 1.875 ± 0.021
3.151IleGly: 3.151 ± 0.033
1.176IleHis: 1.176 ± 0.017
2.273IleIle: 2.273 ± 0.027
2.06IleLys: 2.06 ± 0.02
4.527IleLeu: 4.527 ± 0.034
0.925IleMet: 0.925 ± 0.013
1.682IleAsn: 1.682 ± 0.018
3.013IlePro: 3.013 ± 0.024
1.881IleGln: 1.881 ± 0.02
2.72IleArg: 2.72 ± 0.02
3.65IleSer: 3.65 ± 0.025
2.663IleThr: 2.663 ± 0.025
3.15IleVal: 3.15 ± 0.029
0.701IleTrp: 0.701 ± 0.013
1.37IleTyr: 1.37 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
4.278LysAla: 4.278 ± 0.037
0.451LysCys: 0.451 ± 0.011
2.688LysAsp: 2.688 ± 0.024
3.139LysGlu: 3.139 ± 0.028
1.383LysPhe: 1.383 ± 0.02
2.791LysGly: 2.791 ± 0.027
1.172LysHis: 1.172 ± 0.016
2.177LysIle: 2.177 ± 0.024
2.972LysLys: 2.972 ± 0.044
3.944LysLeu: 3.944 ± 0.033
0.934LysMet: 0.934 ± 0.016
1.546LysAsn: 1.546 ± 0.02
2.624LysPro: 2.624 ± 0.025
1.885LysGln: 1.885 ± 0.02
3.307LysArg: 3.307 ± 0.028
3.361LysSer: 3.361 ± 0.028
2.735LysThr: 2.735 ± 0.024
2.804LysVal: 2.804 ± 0.026
0.64LysTrp: 0.64 ± 0.011
1.368LysTyr: 1.368 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
8.292LeuAla: 8.292 ± 0.048
1.217LeuCys: 1.217 ± 0.018
5.536LeuAsp: 5.536 ± 0.038
5.573LeuGlu: 5.573 ± 0.035
3.304LeuPhe: 3.304 ± 0.033
6.119LeuGly: 6.119 ± 0.036
2.312LeuHis: 2.312 ± 0.023
3.827LeuIle: 3.827 ± 0.028
4.18LeuLys: 4.18 ± 0.032
8.55LeuLeu: 8.55 ± 0.062
1.736LeuMet: 1.736 ± 0.019
3.067LeuAsn: 3.067 ± 0.026
5.608LeuPro: 5.608 ± 0.039
4.026LeuGln: 4.026 ± 0.027
5.847LeuArg: 5.847 ± 0.041
7.281LeuSer: 7.281 ± 0.045
4.982LeuThr: 4.982 ± 0.035
5.714LeuVal: 5.714 ± 0.032
1.233LeuTrp: 1.233 ± 0.019
2.379LeuTyr: 2.379 ± 0.023
0.0LeuXaa: 0.0 ± 0.0
Met
2.26MetAla: 2.26 ± 0.02
0.224MetCys: 0.224 ± 0.007
1.155MetAsp: 1.155 ± 0.015
1.095MetGlu: 1.095 ± 0.016
0.714MetPhe: 0.714 ± 0.011
1.346MetGly: 1.346 ± 0.019
0.482MetHis: 0.482 ± 0.01
0.927MetIle: 0.927 ± 0.013
0.866MetLys: 0.866 ± 0.014
1.797MetLeu: 1.797 ± 0.022
0.503MetMet: 0.503 ± 0.01
0.718MetAsn: 0.718 ± 0.011
1.229MetPro: 1.229 ± 0.016
0.831MetGln: 0.831 ± 0.013
1.149MetArg: 1.149 ± 0.014
1.775MetSer: 1.775 ± 0.019
1.262MetThr: 1.262 ± 0.017
1.272MetVal: 1.272 ± 0.017
0.234MetTrp: 0.234 ± 0.007
0.516MetTyr: 0.516 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.025AsnAla: 3.025 ± 0.027
0.399AsnCys: 0.399 ± 0.01
1.833AsnAsp: 1.833 ± 0.023
1.854AsnGlu: 1.854 ± 0.016
1.246AsnPhe: 1.246 ± 0.015
2.824AsnGly: 2.824 ± 0.028
0.865AsnHis: 0.865 ± 0.012
1.862AsnIle: 1.862 ± 0.019
1.409AsnLys: 1.409 ± 0.015
3.176AsnLeu: 3.176 ± 0.025
0.751AsnMet: 0.751 ± 0.012
1.304AsnAsn: 1.304 ± 0.02
2.369AsnPro: 2.369 ± 0.022
1.303AsnGln: 1.303 ± 0.016
1.857AsnArg: 1.857 ± 0.018
2.5AsnSer: 2.5 ± 0.021
2.096AsnThr: 2.096 ± 0.024
2.303AsnVal: 2.303 ± 0.021
0.511AsnTrp: 0.511 ± 0.011
0.969AsnTyr: 0.969 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
5.444ProAla: 5.444 ± 0.045
0.479ProCys: 0.479 ± 0.009
3.459ProAsp: 3.459 ± 0.03
3.734ProGlu: 3.734 ± 0.029
2.052ProPhe: 2.052 ± 0.023
4.178ProGly: 4.178 ± 0.03
1.38ProHis: 1.38 ± 0.019
2.408ProIle: 2.408 ± 0.022
2.542ProLys: 2.542 ± 0.025
4.851ProLeu: 4.851 ± 0.031
1.0ProMet: 1.0 ± 0.014
2.12ProAsn: 2.12 ± 0.021
5.17ProPro: 5.17 ± 0.062
2.6ProGln: 2.6 ± 0.032
3.54ProArg: 3.54 ± 0.031
6.036ProSer: 6.036 ± 0.043
4.137ProThr: 4.137 ± 0.033
3.758ProVal: 3.758 ± 0.032
0.751ProTrp: 0.751 ± 0.011
1.53ProTyr: 1.53 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
3.841GlnAla: 3.841 ± 0.028
0.432GlnCys: 0.432 ± 0.011
2.265GlnAsp: 2.265 ± 0.022
2.537GlnGlu: 2.537 ± 0.021
1.284GlnPhe: 1.284 ± 0.017
2.583GlnGly: 2.583 ± 0.026
1.204GlnHis: 1.204 ± 0.015
1.973GlnIle: 1.973 ± 0.021
1.961GlnLys: 1.961 ± 0.022
3.595GlnLeu: 3.595 ± 0.03
0.854GlnMet: 0.854 ± 0.015
1.524GlnAsn: 1.524 ± 0.018
2.729GlnPro: 2.729 ± 0.032
2.43GlnGln: 2.43 ± 0.037
2.708GlnArg: 2.708 ± 0.027
3.254GlnSer: 3.254 ± 0.034
2.515GlnThr: 2.515 ± 0.026
2.342GlnVal: 2.342 ± 0.02
0.572GlnTrp: 0.572 ± 0.01
1.213GlnTyr: 1.213 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.951ArgAla: 4.951 ± 0.033
0.673ArgCys: 0.673 ± 0.012
3.507ArgAsp: 3.507 ± 0.032
3.794ArgGlu: 3.794 ± 0.034
2.108ArgPhe: 2.108 ± 0.019
3.721ArgGly: 3.721 ± 0.036
1.729ArgHis: 1.729 ± 0.02
2.809ArgIle: 2.809 ± 0.023
3.478ArgLys: 3.478 ± 0.03
5.773ArgLeu: 5.773 ± 0.036
1.237ArgMet: 1.237 ± 0.015
2.147ArgAsn: 2.147 ± 0.019
3.851ArgPro: 3.851 ± 0.029
2.828ArgGln: 2.828 ± 0.028
5.374ArgArg: 5.374 ± 0.043
4.85ArgSer: 4.85 ± 0.04
3.409ArgThr: 3.409 ± 0.025
3.378ArgVal: 3.378 ± 0.028
0.954ArgTrp: 0.954 ± 0.013
1.703ArgTyr: 1.703 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
6.782SerAla: 6.782 ± 0.044
0.798SerCys: 0.798 ± 0.011
4.169SerAsp: 4.169 ± 0.036
4.032SerGlu: 4.032 ± 0.032
2.882SerPhe: 2.882 ± 0.026
5.522SerGly: 5.522 ± 0.039
2.021SerHis: 2.021 ± 0.023
3.837SerIle: 3.837 ± 0.031
3.555SerLys: 3.555 ± 0.03
7.26SerLeu: 7.26 ± 0.044
1.619SerMet: 1.619 ± 0.018
2.828SerAsn: 2.828 ± 0.026
5.581SerPro: 5.581 ± 0.051
3.391SerGln: 3.391 ± 0.031
5.106SerArg: 5.106 ± 0.041
8.611SerSer: 8.611 ± 0.074
5.837SerThr: 5.837 ± 0.052
4.623SerVal: 4.623 ± 0.032
1.09SerTrp: 1.09 ± 0.013
1.968SerTyr: 1.968 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.567ThrAla: 5.567 ± 0.037
0.695ThrCys: 0.695 ± 0.013
2.912ThrAsp: 2.912 ± 0.027
3.086ThrGlu: 3.086 ± 0.024
2.198ThrPhe: 2.198 ± 0.019
4.394ThrGly: 4.394 ± 0.038
1.319ThrHis: 1.319 ± 0.016
3.016ThrIle: 3.016 ± 0.028
2.566ThrLys: 2.566 ± 0.022
5.282ThrLeu: 5.282 ± 0.036
1.15ThrMet: 1.15 ± 0.013
2.1ThrAsn: 2.1 ± 0.022
4.327ThrPro: 4.327 ± 0.037
2.173ThrGln: 2.173 ± 0.021
3.277ThrArg: 3.277 ± 0.025
5.586ThrSer: 5.586 ± 0.048
4.608ThrThr: 4.608 ± 0.074
3.913ThrVal: 3.913 ± 0.03
0.848ThrTrp: 0.848 ± 0.013
1.575ThrTyr: 1.575 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
5.559ValAla: 5.559 ± 0.039
0.821ValCys: 0.821 ± 0.014
3.956ValAsp: 3.956 ± 0.034
3.9ValGlu: 3.9 ± 0.03
2.471ValPhe: 2.471 ± 0.024
4.228ValGly: 4.228 ± 0.032
1.497ValHis: 1.497 ± 0.017
2.853ValIle: 2.853 ± 0.023
2.871ValLys: 2.871 ± 0.024
5.829ValLeu: 5.829 ± 0.043
1.248ValMet: 1.248 ± 0.016
2.155ValAsn: 2.155 ± 0.024
3.695ValPro: 3.695 ± 0.029
2.565ValGln: 2.565 ± 0.024
3.628ValArg: 3.628 ± 0.024
4.679ValSer: 4.679 ± 0.029
3.619ValThr: 3.619 ± 0.032
4.581ValVal: 4.581 ± 0.033
0.913ValTrp: 0.913 ± 0.014
1.71ValTyr: 1.71 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
1.146TrpAla: 1.146 ± 0.016
0.196TrpCys: 0.196 ± 0.007
0.858TrpAsp: 0.858 ± 0.015
0.809TrpGlu: 0.809 ± 0.013
0.538TrpPhe: 0.538 ± 0.011
0.858TrpGly: 0.858 ± 0.015
0.369TrpHis: 0.369 ± 0.007
0.768TrpIle: 0.768 ± 0.013
0.776TrpLys: 0.776 ± 0.012
1.408TrpLeu: 1.408 ± 0.018
0.361TrpMet: 0.361 ± 0.009
0.578TrpAsn: 0.578 ± 0.01
0.628TrpPro: 0.628 ± 0.011
0.588TrpGln: 0.588 ± 0.011
0.98TrpArg: 0.98 ± 0.012
1.054TrpSer: 1.054 ± 0.015
0.984TrpThr: 0.984 ± 0.014
0.841TrpVal: 0.841 ± 0.012
0.266TrpTrp: 0.266 ± 0.006
0.45TrpTyr: 0.45 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.211TyrAla: 2.211 ± 0.022
0.388TyrCys: 0.388 ± 0.008
1.646TyrAsp: 1.646 ± 0.019
1.485TyrGlu: 1.485 ± 0.018
1.197TyrPhe: 1.197 ± 0.017
2.086TyrGly: 2.086 ± 0.02
0.779TyrHis: 0.779 ± 0.014
1.325TyrIle: 1.325 ± 0.016
1.052TyrLys: 1.052 ± 0.014
2.737TyrLeu: 2.737 ± 0.024
0.592TyrMet: 0.592 ± 0.011
1.005TyrAsn: 1.005 ± 0.015
1.485TyrPro: 1.485 ± 0.02
1.148TyrGln: 1.148 ± 0.015
1.616TyrArg: 1.616 ± 0.019
1.958TyrSer: 1.958 ± 0.021
1.58TyrThr: 1.58 ± 0.018
1.667TyrVal: 1.667 ± 0.019
0.436TyrTrp: 0.436 ± 0.009
0.888TyrTyr: 0.888 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10469 proteins (5150417 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski