Amino acid dipepetide frequency for Penicillium italicum (Blue mold)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.385AlaAla: 8.385 ± 0.066
0.995AlaCys: 0.995 ± 0.015
4.25AlaAsp: 4.25 ± 0.039
5.105AlaGlu: 5.105 ± 0.047
3.104AlaPhe: 3.104 ± 0.028
5.593AlaGly: 5.593 ± 0.041
1.773AlaHis: 1.773 ± 0.018
4.317AlaIle: 4.317 ± 0.036
3.927AlaLys: 3.927 ± 0.032
7.665AlaLeu: 7.665 ± 0.052
1.963AlaMet: 1.963 ± 0.022
2.952AlaAsn: 2.952 ± 0.027
4.834AlaPro: 4.834 ± 0.052
3.417AlaGln: 3.417 ± 0.031
4.767AlaArg: 4.767 ± 0.038
7.153AlaSer: 7.153 ± 0.051
5.194AlaThr: 5.194 ± 0.043
5.263AlaVal: 5.263 ± 0.039
1.107AlaTrp: 1.107 ± 0.017
2.085AlaTyr: 2.085 ± 0.022
0.001AlaXaa: 0.001 ± 0.001
Cys
0.897CysAla: 0.897 ± 0.016
0.214CysCys: 0.214 ± 0.009
0.645CysAsp: 0.645 ± 0.012
0.591CysGlu: 0.591 ± 0.011
0.545CysPhe: 0.545 ± 0.012
0.895CysGly: 0.895 ± 0.014
0.325CysHis: 0.325 ± 0.009
0.669CysIle: 0.669 ± 0.014
0.476CysLys: 0.476 ± 0.01
1.238CysLeu: 1.238 ± 0.019
0.258CysMet: 0.258 ± 0.006
0.409CysAsn: 0.409 ± 0.009
0.623CysPro: 0.623 ± 0.015
0.444CysGln: 0.444 ± 0.011
0.698CysArg: 0.698 ± 0.015
0.839CysSer: 0.839 ± 0.015
0.64CysThr: 0.64 ± 0.013
0.781CysVal: 0.781 ± 0.014
0.187CysTrp: 0.187 ± 0.007
0.335CysTyr: 0.335 ± 0.007
0.001CysXaa: 0.001 ± 0.0
Asp
4.494AspAla: 4.494 ± 0.039
0.604AspCys: 0.604 ± 0.013
3.955AspAsp: 3.955 ± 0.04
4.445AspGlu: 4.445 ± 0.042
2.231AspPhe: 2.231 ± 0.024
3.817AspGly: 3.817 ± 0.032
1.345AspHis: 1.345 ± 0.019
3.113AspIle: 3.113 ± 0.029
2.286AspLys: 2.286 ± 0.025
5.311AspLeu: 5.311 ± 0.042
1.285AspMet: 1.285 ± 0.02
1.906AspAsn: 1.906 ± 0.02
3.417AspPro: 3.417 ± 0.034
2.072AspGln: 2.072 ± 0.022
3.084AspArg: 3.084 ± 0.032
4.348AspSer: 4.348 ± 0.037
3.053AspThr: 3.053 ± 0.029
3.6AspVal: 3.6 ± 0.026
0.888AspTrp: 0.888 ± 0.016
1.605AspTyr: 1.605 ± 0.02
0.001AspXaa: 0.001 ± 0.0
Glu
5.222GluAla: 5.222 ± 0.051
0.618GluCys: 0.618 ± 0.013
4.24GluAsp: 4.24 ± 0.043
5.435GluGlu: 5.435 ± 0.062
2.074GluPhe: 2.074 ± 0.021
3.639GluGly: 3.639 ± 0.034
1.442GluHis: 1.442 ± 0.018
3.307GluIle: 3.307 ± 0.028
3.639GluLys: 3.639 ± 0.035
5.279GluLeu: 5.279 ± 0.052
1.548GluMet: 1.548 ± 0.018
2.447GluAsn: 2.447 ± 0.028
2.921GluPro: 2.921 ± 0.06
2.536GluGln: 2.536 ± 0.03
3.8GluArg: 3.8 ± 0.037
4.616GluSer: 4.616 ± 0.045
3.726GluThr: 3.726 ± 0.031
3.526GluVal: 3.526 ± 0.029
0.877GluTrp: 0.877 ± 0.013
1.665GluTyr: 1.665 ± 0.021
0.001GluXaa: 0.001 ± 0.0
Phe
3.037PheAla: 3.037 ± 0.031
0.555PheCys: 0.555 ± 0.011
2.36PheAsp: 2.36 ± 0.023
2.241PheGlu: 2.241 ± 0.024
1.739PhePhe: 1.739 ± 0.064
2.847PheGly: 2.847 ± 0.028
0.953PheHis: 0.953 ± 0.015
1.883PheIle: 1.883 ± 0.023
1.519PheLys: 1.519 ± 0.019
3.559PheLeu: 3.559 ± 0.033
0.839PheMet: 0.839 ± 0.013
1.519PheAsn: 1.519 ± 0.019
2.023PhePro: 2.023 ± 0.022
1.455PheGln: 1.455 ± 0.015
2.002PheArg: 2.002 ± 0.022
3.066PheSer: 3.066 ± 0.027
2.176PheThr: 2.176 ± 0.032
2.427PheVal: 2.427 ± 0.029
0.653PheTrp: 0.653 ± 0.013
1.135PheTyr: 1.135 ± 0.016
0.001PheXaa: 0.001 ± 0.001
Gly
5.13GlyAla: 5.13 ± 0.037
0.849GlyCys: 0.849 ± 0.016
3.528GlyAsp: 3.528 ± 0.031
3.534GlyGlu: 3.534 ± 0.027
2.852GlyPhe: 2.852 ± 0.027
5.344GlyGly: 5.344 ± 0.057
1.67GlyHis: 1.67 ± 0.021
3.587GlyIle: 3.587 ± 0.032
3.29GlyLys: 3.29 ± 0.036
6.061GlyLeu: 6.061 ± 0.042
1.601GlyMet: 1.601 ± 0.021
2.54GlyAsn: 2.54 ± 0.025
3.358GlyPro: 3.358 ± 0.033
2.55GlyGln: 2.55 ± 0.034
3.89GlyArg: 3.89 ± 0.034
5.766GlySer: 5.766 ± 0.053
3.877GlyThr: 3.877 ± 0.031
4.289GlyVal: 4.289 ± 0.034
1.133GlyTrp: 1.133 ± 0.019
2.061GlyTyr: 2.061 ± 0.024
0.001GlyXaa: 0.001 ± 0.0
His
1.828HisAla: 1.828 ± 0.023
0.316HisCys: 0.316 ± 0.008
1.334HisAsp: 1.334 ± 0.018
1.372HisGlu: 1.372 ± 0.019
0.952HisPhe: 0.952 ± 0.014
1.732HisGly: 1.732 ± 0.022
0.812HisHis: 0.812 ± 0.014
1.259HisIle: 1.259 ± 0.015
0.895HisLys: 0.895 ± 0.015
2.301HisLeu: 2.301 ± 0.027
0.508HisMet: 0.508 ± 0.009
0.894HisAsn: 0.894 ± 0.013
1.691HisPro: 1.691 ± 0.02
1.01HisGln: 1.01 ± 0.016
1.598HisArg: 1.598 ± 0.018
1.944HisSer: 1.944 ± 0.021
1.321HisThr: 1.321 ± 0.017
1.453HisVal: 1.453 ± 0.019
0.348HisTrp: 0.348 ± 0.009
0.695HisTyr: 0.695 ± 0.013
0.001HisXaa: 0.001 ± 0.0
Ile
4.256IleAla: 4.256 ± 0.038
0.749IleCys: 0.749 ± 0.013
2.94IleAsp: 2.94 ± 0.025
3.008IleGlu: 3.008 ± 0.028
2.11IlePhe: 2.11 ± 0.034
3.265IleGly: 3.265 ± 0.031
1.255IleHis: 1.255 ± 0.016
2.599IleIle: 2.599 ± 0.032
2.214IleLys: 2.214 ± 0.025
4.668IleLeu: 4.668 ± 0.042
1.088IleMet: 1.088 ± 0.013
1.854IleAsn: 1.854 ± 0.021
3.289IlePro: 3.289 ± 0.028
2.005IleGln: 2.005 ± 0.023
2.788IleArg: 2.788 ± 0.025
4.132IleSer: 4.132 ± 0.031
2.897IleThr: 2.897 ± 0.027
3.204IleVal: 3.204 ± 0.029
0.736IleTrp: 0.736 ± 0.014
1.459IleTyr: 1.459 ± 0.021
0.0IleXaa: 0.0 ± 0.0
Lys
4.047LysAla: 4.047 ± 0.04
0.477LysCys: 0.477 ± 0.011
2.686LysAsp: 2.686 ± 0.028
3.307LysGlu: 3.307 ± 0.032
1.511LysPhe: 1.511 ± 0.019
2.857LysGly: 2.857 ± 0.029
1.088LysHis: 1.088 ± 0.015
2.327LysIle: 2.327 ± 0.024
3.212LysLys: 3.212 ± 0.044
4.034LysLeu: 4.034 ± 0.033
1.018LysMet: 1.018 ± 0.017
1.792LysAsn: 1.792 ± 0.023
2.719LysPro: 2.719 ± 0.03
1.873LysGln: 1.873 ± 0.025
3.293LysArg: 3.293 ± 0.033
3.637LysSer: 3.637 ± 0.03
2.779LysThr: 2.779 ± 0.029
2.755LysVal: 2.755 ± 0.026
0.672LysTrp: 0.672 ± 0.012
1.336LysTyr: 1.336 ± 0.02
0.001LysXaa: 0.001 ± 0.0
Leu
7.759LeuAla: 7.759 ± 0.047
1.187LeuCys: 1.187 ± 0.019
5.292LeuAsp: 5.292 ± 0.042
5.619LeuGlu: 5.619 ± 0.052
3.412LeuPhe: 3.412 ± 0.032
6.012LeuGly: 6.012 ± 0.044
2.255LeuHis: 2.255 ± 0.023
4.078LeuIle: 4.078 ± 0.037
4.178LeuLys: 4.178 ± 0.034
8.344LeuLeu: 8.344 ± 0.061
1.839LeuMet: 1.839 ± 0.022
3.346LeuAsn: 3.346 ± 0.039
5.487LeuPro: 5.487 ± 0.038
3.878LeuGln: 3.878 ± 0.033
5.706LeuArg: 5.706 ± 0.038
7.521LeuSer: 7.521 ± 0.055
4.814LeuThr: 4.814 ± 0.033
5.515LeuVal: 5.515 ± 0.041
1.189LeuTrp: 1.189 ± 0.018
2.359LeuTyr: 2.359 ± 0.02
0.001LeuXaa: 0.001 ± 0.001
Met
2.196MetAla: 2.196 ± 0.021
0.248MetCys: 0.248 ± 0.007
1.305MetAsp: 1.305 ± 0.014
1.334MetGlu: 1.334 ± 0.018
0.791MetPhe: 0.791 ± 0.013
1.529MetGly: 1.529 ± 0.021
0.497MetHis: 0.497 ± 0.01
1.077MetIle: 1.077 ± 0.016
1.028MetLys: 1.028 ± 0.018
1.872MetLeu: 1.872 ± 0.021
0.595MetMet: 0.595 ± 0.014
0.875MetAsn: 0.875 ± 0.017
1.258MetPro: 1.258 ± 0.017
0.914MetGln: 0.914 ± 0.015
1.254MetArg: 1.254 ± 0.018
1.937MetSer: 1.937 ± 0.021
1.34MetThr: 1.34 ± 0.018
1.333MetVal: 1.333 ± 0.016
0.263MetTrp: 0.263 ± 0.008
0.515MetTyr: 0.515 ± 0.012
0.001MetXaa: 0.001 ± 0.0
Asn
3.143AsnAla: 3.143 ± 0.029
0.442AsnCys: 0.442 ± 0.01
1.996AsnAsp: 1.996 ± 0.022
2.152AsnGlu: 2.152 ± 0.022
1.431AsnPhe: 1.431 ± 0.017
2.845AsnGly: 2.845 ± 0.03
0.899AsnHis: 0.899 ± 0.016
2.081AsnIle: 2.081 ± 0.02
1.558AsnLys: 1.558 ± 0.021
3.421AsnLeu: 3.421 ± 0.037
0.89AsnMet: 0.89 ± 0.015
1.452AsnAsn: 1.452 ± 0.025
2.644AsnPro: 2.644 ± 0.023
1.445AsnGln: 1.445 ± 0.022
2.008AsnArg: 2.008 ± 0.022
2.866AsnSer: 2.866 ± 0.026
2.244AsnThr: 2.244 ± 0.021
2.344AsnVal: 2.344 ± 0.025
0.569AsnTrp: 0.569 ± 0.012
1.074AsnTyr: 1.074 ± 0.016
0.001AsnXaa: 0.001 ± 0.0
Pro
5.238ProAla: 5.238 ± 0.053
0.509ProCys: 0.509 ± 0.013
3.194ProAsp: 3.194 ± 0.033
4.051ProGlu: 4.051 ± 0.053
2.13ProPhe: 2.13 ± 0.025
3.855ProGly: 3.855 ± 0.037
1.377ProHis: 1.377 ± 0.02
2.711ProIle: 2.711 ± 0.027
2.719ProLys: 2.719 ± 0.034
4.754ProLeu: 4.754 ± 0.038
1.152ProMet: 1.152 ± 0.019
2.266ProAsn: 2.266 ± 0.025
4.719ProPro: 4.719 ± 0.065
2.493ProGln: 2.493 ± 0.032
3.485ProArg: 3.485 ± 0.036
6.18ProSer: 6.18 ± 0.051
4.166ProThr: 4.166 ± 0.042
3.724ProVal: 3.724 ± 0.032
0.746ProTrp: 0.746 ± 0.013
1.474ProTyr: 1.474 ± 0.021
0.002ProXaa: 0.002 ± 0.001
Gln
3.388GlnAla: 3.388 ± 0.032
0.433GlnCys: 0.433 ± 0.011
2.086GlnAsp: 2.086 ± 0.02
2.433GlnGlu: 2.433 ± 0.022
1.397GlnPhe: 1.397 ± 0.02
2.383GlnGly: 2.383 ± 0.023
1.056GlnHis: 1.056 ± 0.018
2.038GlnIle: 2.038 ± 0.02
2.061GlnLys: 2.061 ± 0.028
3.506GlnLeu: 3.506 ± 0.031
0.951GlnMet: 0.951 ± 0.016
1.668GlnAsn: 1.668 ± 0.021
2.616GlnPro: 2.616 ± 0.037
2.191GlnGln: 2.191 ± 0.044
2.602GlnArg: 2.602 ± 0.025
3.383GlnSer: 3.383 ± 0.035
2.451GlnThr: 2.451 ± 0.022
2.235GlnVal: 2.235 ± 0.023
0.586GlnTrp: 0.586 ± 0.011
1.178GlnTyr: 1.178 ± 0.016
0.001GlnXaa: 0.001 ± 0.0
Arg
4.597ArgAla: 4.597 ± 0.034
0.653ArgCys: 0.653 ± 0.014
3.346ArgAsp: 3.346 ± 0.034
3.776ArgGlu: 3.776 ± 0.043
2.178ArgPhe: 2.178 ± 0.02
3.666ArgGly: 3.666 ± 0.037
1.525ArgHis: 1.525 ± 0.019
2.895ArgIle: 2.895 ± 0.028
3.335ArgLys: 3.335 ± 0.031
5.424ArgLeu: 5.424 ± 0.041
1.316ArgMet: 1.316 ± 0.017
2.248ArgAsn: 2.248 ± 0.024
3.443ArgPro: 3.443 ± 0.035
2.546ArgGln: 2.546 ± 0.024
4.812ArgArg: 4.812 ± 0.051
4.813ArgSer: 4.813 ± 0.043
3.246ArgThr: 3.246 ± 0.029
3.408ArgVal: 3.408 ± 0.026
0.912ArgTrp: 0.912 ± 0.015
1.605ArgTyr: 1.605 ± 0.018
0.003ArgXaa: 0.003 ± 0.001
Ser
6.734SerAla: 6.734 ± 0.046
0.828SerCys: 0.828 ± 0.012
4.457SerAsp: 4.457 ± 0.04
4.468SerGlu: 4.468 ± 0.041
3.128SerPhe: 3.128 ± 0.026
5.591SerGly: 5.591 ± 0.05
2.092SerHis: 2.092 ± 0.025
4.152SerIle: 4.152 ± 0.032
3.808SerLys: 3.808 ± 0.035
7.47SerLeu: 7.47 ± 0.054
1.781SerMet: 1.781 ± 0.021
3.143SerAsn: 3.143 ± 0.027
5.704SerPro: 5.704 ± 0.062
3.478SerGln: 3.478 ± 0.038
4.998SerArg: 4.998 ± 0.045
8.817SerSer: 8.817 ± 0.09
5.763SerThr: 5.763 ± 0.054
4.786SerVal: 4.786 ± 0.033
1.133SerTrp: 1.133 ± 0.017
2.101SerTyr: 2.101 ± 0.024
0.002SerXaa: 0.002 ± 0.001
Thr
5.117ThrAla: 5.117 ± 0.034
0.686ThrCys: 0.686 ± 0.014
2.954ThrAsp: 2.954 ± 0.031
3.427ThrGlu: 3.427 ± 0.033
2.203ThrPhe: 2.203 ± 0.024
4.166ThrGly: 4.166 ± 0.031
1.331ThrHis: 1.331 ± 0.018
3.143ThrIle: 3.143 ± 0.034
2.65ThrLys: 2.65 ± 0.026
5.292ThrLeu: 5.292 ± 0.038
1.193ThrMet: 1.193 ± 0.017
2.159ThrAsn: 2.159 ± 0.023
4.475ThrPro: 4.475 ± 0.046
2.235ThrGln: 2.235 ± 0.024
3.125ThrArg: 3.125 ± 0.025
5.383ThrSer: 5.383 ± 0.044
4.204ThrThr: 4.204 ± 0.057
3.794ThrVal: 3.794 ± 0.037
0.833ThrTrp: 0.833 ± 0.014
1.564ThrTyr: 1.564 ± 0.021
0.001ThrXaa: 0.001 ± 0.001
Val
5.182ValAla: 5.182 ± 0.038
0.793ValCys: 0.793 ± 0.015
3.789ValAsp: 3.789 ± 0.03
3.865ValGlu: 3.865 ± 0.035
2.52ValPhe: 2.52 ± 0.028
3.94ValGly: 3.94 ± 0.037
1.454ValHis: 1.454 ± 0.019
3.084ValIle: 3.084 ± 0.03
2.758ValLys: 2.758 ± 0.029
5.555ValLeu: 5.555 ± 0.045
1.353ValMet: 1.353 ± 0.018
2.264ValAsn: 2.264 ± 0.022
3.628ValPro: 3.628 ± 0.032
2.408ValGln: 2.408 ± 0.028
3.336ValArg: 3.336 ± 0.027
4.886ValSer: 4.886 ± 0.036
3.554ValThr: 3.554 ± 0.044
4.21ValVal: 4.21 ± 0.037
0.834ValTrp: 0.834 ± 0.015
1.692ValTyr: 1.692 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
1.111TrpAla: 1.111 ± 0.016
0.181TrpCys: 0.181 ± 0.006
0.89TrpAsp: 0.89 ± 0.015
0.854TrpGlu: 0.854 ± 0.014
0.504TrpPhe: 0.504 ± 0.011
0.919TrpGly: 0.919 ± 0.016
0.351TrpHis: 0.351 ± 0.008
0.787TrpIle: 0.787 ± 0.014
0.801TrpLys: 0.801 ± 0.013
1.351TrpLeu: 1.351 ± 0.018
0.375TrpMet: 0.375 ± 0.01
0.644TrpAsn: 0.644 ± 0.012
0.577TrpPro: 0.577 ± 0.011
0.552TrpGln: 0.552 ± 0.011
0.912TrpArg: 0.912 ± 0.015
1.076TrpSer: 1.076 ± 0.014
0.902TrpThr: 0.902 ± 0.016
0.881TrpVal: 0.881 ± 0.014
0.286TrpTrp: 0.286 ± 0.009
0.42TrpTyr: 0.42 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.088TyrAla: 2.088 ± 0.024
0.372TyrCys: 0.372 ± 0.009
1.61TyrAsp: 1.61 ± 0.02
1.541TyrGlu: 1.541 ± 0.018
1.175TyrPhe: 1.175 ± 0.018
2.016TyrGly: 2.016 ± 0.025
0.762TyrHis: 0.762 ± 0.014
1.407TyrIle: 1.407 ± 0.019
1.066TyrLys: 1.066 ± 0.014
2.685TyrLeu: 2.685 ± 0.027
0.61TyrMet: 0.61 ± 0.012
1.129TyrAsn: 1.129 ± 0.016
1.517TyrPro: 1.517 ± 0.021
1.14TyrGln: 1.14 ± 0.015
1.587TyrArg: 1.587 ± 0.02
2.07TyrSer: 2.07 ± 0.023
1.587TyrThr: 1.587 ± 0.019
1.576TyrVal: 1.576 ± 0.022
0.423TyrTrp: 0.423 ± 0.009
0.898TyrTyr: 0.898 ± 0.015
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.002XaaGlu: 0.002 ± 0.001
0.001XaaPhe: 0.001 ± 0.001
0.001XaaGly: 0.001 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.003XaaArg: 0.003 ± 0.001
0.002XaaSer: 0.002 ± 0.001
0.002XaaThr: 0.002 ± 0.001
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 9993 proteins (4757581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski