Amino acid dipepetide frequency for Penicilliopsis zonata CBS 506.65

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.028AlaAla: 10.028 ± 0.076
1.153AlaCys: 1.153 ± 0.017
4.431AlaAsp: 4.431 ± 0.035
5.211AlaGlu: 5.211 ± 0.046
3.178AlaPhe: 3.178 ± 0.028
5.948AlaGly: 5.948 ± 0.045
1.792AlaHis: 1.792 ± 0.019
4.216AlaIle: 4.216 ± 0.032
3.515AlaLys: 3.515 ± 0.032
8.193AlaLeu: 8.193 ± 0.048
1.996AlaMet: 1.996 ± 0.022
2.721AlaAsn: 2.721 ± 0.022
4.478AlaPro: 4.478 ± 0.045
3.388AlaGln: 3.388 ± 0.036
5.191AlaArg: 5.191 ± 0.04
7.539AlaSer: 7.539 ± 0.046
5.391AlaThr: 5.391 ± 0.038
5.94AlaVal: 5.94 ± 0.045
1.229AlaTrp: 1.229 ± 0.018
2.196AlaTyr: 2.196 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
0.962CysAla: 0.962 ± 0.017
0.249CysCys: 0.249 ± 0.009
0.662CysAsp: 0.662 ± 0.013
0.616CysGlu: 0.616 ± 0.014
0.573CysPhe: 0.573 ± 0.011
0.912CysGly: 0.912 ± 0.017
0.336CysHis: 0.336 ± 0.009
0.685CysIle: 0.685 ± 0.015
0.448CysLys: 0.448 ± 0.01
1.432CysLeu: 1.432 ± 0.021
0.282CysMet: 0.282 ± 0.007
0.403CysAsn: 0.403 ± 0.009
0.682CysPro: 0.682 ± 0.016
0.477CysGln: 0.477 ± 0.011
0.846CysArg: 0.846 ± 0.013
0.984CysSer: 0.984 ± 0.017
0.707CysThr: 0.707 ± 0.013
0.858CysVal: 0.858 ± 0.014
0.204CysTrp: 0.204 ± 0.006
0.363CysTyr: 0.363 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
4.607AspAla: 4.607 ± 0.038
0.656AspCys: 0.656 ± 0.012
4.184AspAsp: 4.184 ± 0.05
4.472AspGlu: 4.472 ± 0.041
2.137AspPhe: 2.137 ± 0.023
3.995AspGly: 3.995 ± 0.031
1.314AspHis: 1.314 ± 0.018
2.877AspIle: 2.877 ± 0.029
2.098AspLys: 2.098 ± 0.024
5.332AspLeu: 5.332 ± 0.038
1.185AspMet: 1.185 ± 0.015
1.722AspAsn: 1.722 ± 0.019
3.337AspPro: 3.337 ± 0.029
1.924AspGln: 1.924 ± 0.023
3.291AspArg: 3.291 ± 0.031
4.196AspSer: 4.196 ± 0.037
2.933AspThr: 2.933 ± 0.028
3.513AspVal: 3.513 ± 0.029
0.873AspTrp: 0.873 ± 0.012
1.6AspTyr: 1.6 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
5.358GluAla: 5.358 ± 0.038
0.635GluCys: 0.635 ± 0.012
4.052GluAsp: 4.052 ± 0.039
5.948GluGlu: 5.948 ± 0.071
1.888GluPhe: 1.888 ± 0.022
3.646GluGly: 3.646 ± 0.031
1.316GluHis: 1.316 ± 0.016
3.197GluIle: 3.197 ± 0.029
3.428GluLys: 3.428 ± 0.037
5.155GluLeu: 5.155 ± 0.038
1.572GluMet: 1.572 ± 0.019
2.209GluAsn: 2.209 ± 0.024
2.699GluPro: 2.699 ± 0.038
2.575GluGln: 2.575 ± 0.025
3.934GluArg: 3.934 ± 0.035
4.337GluSer: 4.337 ± 0.038
3.781GluThr: 3.781 ± 0.031
3.459GluVal: 3.459 ± 0.027
0.896GluTrp: 0.896 ± 0.017
1.752GluTyr: 1.752 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.104PheAla: 3.104 ± 0.028
0.591PheCys: 0.591 ± 0.012
2.272PheAsp: 2.272 ± 0.024
2.067PheGlu: 2.067 ± 0.02
1.746PhePhe: 1.746 ± 0.025
2.739PheGly: 2.739 ± 0.029
1.002PheHis: 1.002 ± 0.014
1.832PheIle: 1.832 ± 0.023
1.326PheLys: 1.326 ± 0.017
3.798PheLeu: 3.798 ± 0.03
0.777PheMet: 0.777 ± 0.013
1.333PheAsn: 1.333 ± 0.018
2.055PhePro: 2.055 ± 0.021
1.422PheGln: 1.422 ± 0.018
2.037PheArg: 2.037 ± 0.022
3.082PheSer: 3.082 ± 0.03
2.15PheThr: 2.15 ± 0.022
2.371PheVal: 2.371 ± 0.025
0.675PheTrp: 0.675 ± 0.012
1.124PheTyr: 1.124 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
5.152GlyAla: 5.152 ± 0.042
0.884GlyCys: 0.884 ± 0.015
3.583GlyAsp: 3.583 ± 0.03
3.707GlyGlu: 3.707 ± 0.032
2.799GlyPhe: 2.799 ± 0.027
5.783GlyGly: 5.783 ± 0.058
1.62GlyHis: 1.62 ± 0.02
3.449GlyIle: 3.449 ± 0.029
3.085GlyLys: 3.085 ± 0.03
6.153GlyLeu: 6.153 ± 0.035
1.533GlyMet: 1.533 ± 0.021
2.297GlyAsn: 2.297 ± 0.022
3.207GlyPro: 3.207 ± 0.03
2.477GlyGln: 2.477 ± 0.027
4.107GlyArg: 4.107 ± 0.036
5.771GlySer: 5.771 ± 0.056
3.817GlyThr: 3.817 ± 0.027
4.426GlyVal: 4.426 ± 0.031
1.153GlyTrp: 1.153 ± 0.018
2.158GlyTyr: 2.158 ± 0.027
0.0GlyXaa: 0.0 ± 0.0
His
1.915HisAla: 1.915 ± 0.023
0.344HisCys: 0.344 ± 0.008
1.309HisAsp: 1.309 ± 0.019
1.338HisGlu: 1.338 ± 0.017
0.901HisPhe: 0.901 ± 0.013
1.689HisGly: 1.689 ± 0.022
1.078HisHis: 1.078 ± 0.027
1.181HisIle: 1.181 ± 0.013
0.804HisLys: 0.804 ± 0.015
2.404HisLeu: 2.404 ± 0.023
0.467HisMet: 0.467 ± 0.011
0.769HisAsn: 0.769 ± 0.015
1.728HisPro: 1.728 ± 0.022
1.074HisGln: 1.074 ± 0.015
1.669HisArg: 1.669 ± 0.02
1.866HisSer: 1.866 ± 0.023
1.294HisThr: 1.294 ± 0.017
1.434HisVal: 1.434 ± 0.017
0.352HisTrp: 0.352 ± 0.011
0.671HisTyr: 0.671 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.102IleAla: 4.102 ± 0.03
0.753IleCys: 0.753 ± 0.013
2.797IleAsp: 2.797 ± 0.026
2.801IleGlu: 2.801 ± 0.024
2.001IlePhe: 2.001 ± 0.019
3.038IleGly: 3.038 ± 0.031
1.235IleHis: 1.235 ± 0.017
2.43IleIle: 2.43 ± 0.027
1.957IleLys: 1.957 ± 0.021
4.755IleLeu: 4.755 ± 0.035
0.997IleMet: 0.997 ± 0.015
1.709IleAsn: 1.709 ± 0.021
3.06IlePro: 3.06 ± 0.025
1.928IleGln: 1.928 ± 0.02
2.826IleArg: 2.826 ± 0.026
3.833IleSer: 3.833 ± 0.028
2.738IleThr: 2.738 ± 0.025
3.152IleVal: 3.152 ± 0.03
0.688IleTrp: 0.688 ± 0.015
1.447IleTyr: 1.447 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
3.772LysAla: 3.772 ± 0.034
0.459LysCys: 0.459 ± 0.011
2.38LysAsp: 2.38 ± 0.022
3.106LysGlu: 3.106 ± 0.034
1.253LysPhe: 1.253 ± 0.018
2.556LysGly: 2.556 ± 0.027
0.986LysHis: 0.986 ± 0.015
2.109LysIle: 2.109 ± 0.024
2.969LysLys: 2.969 ± 0.04
3.663LysLeu: 3.663 ± 0.03
0.946LysMet: 0.946 ± 0.014
1.551LysAsn: 1.551 ± 0.02
2.378LysPro: 2.378 ± 0.025
1.754LysGln: 1.754 ± 0.023
3.206LysArg: 3.206 ± 0.032
3.025LysSer: 3.025 ± 0.026
2.661LysThr: 2.661 ± 0.025
2.47LysVal: 2.47 ± 0.026
0.592LysTrp: 0.592 ± 0.009
1.217LysTyr: 1.217 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
8.571LeuAla: 8.571 ± 0.049
1.323LeuCys: 1.323 ± 0.018
5.489LeuAsp: 5.489 ± 0.034
5.613LeuGlu: 5.613 ± 0.04
3.638LeuPhe: 3.638 ± 0.032
6.044LeuGly: 6.044 ± 0.042
2.352LeuHis: 2.352 ± 0.024
3.974LeuIle: 3.974 ± 0.034
3.891LeuLys: 3.891 ± 0.035
9.204LeuLeu: 9.204 ± 0.061
1.78LeuMet: 1.78 ± 0.021
3.08LeuAsn: 3.08 ± 0.028
5.811LeuPro: 5.811 ± 0.04
4.105LeuGln: 4.105 ± 0.035
6.068LeuArg: 6.068 ± 0.042
7.727LeuSer: 7.727 ± 0.045
4.956LeuThr: 4.956 ± 0.034
5.967LeuVal: 5.967 ± 0.045
1.25LeuTrp: 1.25 ± 0.018
2.547LeuTyr: 2.547 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.204MetAla: 2.204 ± 0.025
0.23MetCys: 0.23 ± 0.007
1.275MetAsp: 1.275 ± 0.018
1.294MetGlu: 1.294 ± 0.017
0.73MetPhe: 0.73 ± 0.014
1.477MetGly: 1.477 ± 0.017
0.473MetHis: 0.473 ± 0.012
1.008MetIle: 1.008 ± 0.016
0.959MetLys: 0.959 ± 0.014
1.893MetLeu: 1.893 ± 0.02
0.586MetMet: 0.586 ± 0.012
0.778MetAsn: 0.778 ± 0.013
1.172MetPro: 1.172 ± 0.017
0.873MetGln: 0.873 ± 0.016
1.225MetArg: 1.225 ± 0.017
1.773MetSer: 1.773 ± 0.018
1.283MetThr: 1.283 ± 0.018
1.351MetVal: 1.351 ± 0.018
0.249MetTrp: 0.249 ± 0.008
0.523MetTyr: 0.523 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.856AsnAla: 2.856 ± 0.026
0.421AsnCys: 0.421 ± 0.012
1.763AsnAsp: 1.763 ± 0.02
1.929AsnGlu: 1.929 ± 0.025
1.211AsnPhe: 1.211 ± 0.018
2.666AsnGly: 2.666 ± 0.029
0.835AsnHis: 0.835 ± 0.013
1.874AsnIle: 1.874 ± 0.022
1.361AsnLys: 1.361 ± 0.019
3.134AsnLeu: 3.134 ± 0.03
0.766AsnMet: 0.766 ± 0.014
1.569AsnAsn: 1.569 ± 0.027
2.341AsnPro: 2.341 ± 0.023
1.256AsnGln: 1.256 ± 0.019
1.975AsnArg: 1.975 ± 0.023
2.459AsnSer: 2.459 ± 0.02
2.095AsnThr: 2.095 ± 0.022
2.083AsnVal: 2.083 ± 0.023
0.512AsnTrp: 0.512 ± 0.01
0.984AsnTyr: 0.984 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
5.409ProAla: 5.409 ± 0.051
0.561ProCys: 0.561 ± 0.013
3.192ProAsp: 3.192 ± 0.025
3.692ProGlu: 3.692 ± 0.032
2.139ProPhe: 2.139 ± 0.022
3.895ProGly: 3.895 ± 0.033
1.318ProHis: 1.318 ± 0.018
2.392ProIle: 2.392 ± 0.024
2.205ProLys: 2.205 ± 0.022
5.03ProLeu: 5.03 ± 0.036
1.017ProMet: 1.017 ± 0.015
1.871ProAsn: 1.871 ± 0.018
5.219ProPro: 5.219 ± 0.09
2.307ProGln: 2.307 ± 0.031
3.489ProArg: 3.489 ± 0.028
6.109ProSer: 6.109 ± 0.057
3.745ProThr: 3.745 ± 0.035
3.866ProVal: 3.866 ± 0.036
0.777ProTrp: 0.777 ± 0.012
1.49ProTyr: 1.49 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.562GlnAla: 3.562 ± 0.033
0.449GlnCys: 0.449 ± 0.01
2.046GlnAsp: 2.046 ± 0.024
2.455GlnGlu: 2.455 ± 0.025
1.285GlnPhe: 1.285 ± 0.017
2.461GlnGly: 2.461 ± 0.024
1.091GlnHis: 1.091 ± 0.018
1.902GlnIle: 1.902 ± 0.02
1.848GlnLys: 1.848 ± 0.021
3.597GlnLeu: 3.597 ± 0.03
0.913GlnMet: 0.913 ± 0.017
1.467GlnAsn: 1.467 ± 0.018
2.565GlnPro: 2.565 ± 0.032
2.959GlnGln: 2.959 ± 0.066
2.721GlnArg: 2.721 ± 0.026
3.122GlnSer: 3.122 ± 0.03
2.418GlnThr: 2.418 ± 0.023
2.246GlnVal: 2.246 ± 0.022
0.57GlnTrp: 0.57 ± 0.01
1.132GlnTyr: 1.132 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
4.825ArgAla: 4.825 ± 0.034
0.778ArgCys: 0.778 ± 0.015
3.375ArgAsp: 3.375 ± 0.037
3.995ArgGlu: 3.995 ± 0.039
2.351ArgPhe: 2.351 ± 0.023
3.793ArgGly: 3.793 ± 0.037
1.64ArgHis: 1.64 ± 0.019
2.933ArgIle: 2.933 ± 0.024
3.247ArgLys: 3.247 ± 0.031
6.014ArgLeu: 6.014 ± 0.041
1.309ArgMet: 1.309 ± 0.015
2.148ArgAsn: 2.148 ± 0.021
3.535ArgPro: 3.535 ± 0.031
2.765ArgGln: 2.765 ± 0.026
5.432ArgArg: 5.432 ± 0.054
4.875ArgSer: 4.875 ± 0.043
3.259ArgThr: 3.259 ± 0.032
3.585ArgVal: 3.585 ± 0.028
0.986ArgTrp: 0.986 ± 0.014
1.744ArgTyr: 1.744 ± 0.023
0.0ArgXaa: 0.0 ± 0.0
Ser
6.784SerAla: 6.784 ± 0.043
0.936SerCys: 0.936 ± 0.015
4.132SerAsp: 4.132 ± 0.036
4.117SerGlu: 4.117 ± 0.034
3.136SerPhe: 3.136 ± 0.026
5.403SerGly: 5.403 ± 0.062
2.039SerHis: 2.039 ± 0.023
4.02SerIle: 4.02 ± 0.032
3.336SerLys: 3.336 ± 0.032
7.952SerLeu: 7.952 ± 0.044
1.687SerMet: 1.687 ± 0.019
2.775SerAsn: 2.775 ± 0.024
5.617SerPro: 5.617 ± 0.051
3.258SerGln: 3.258 ± 0.028
5.139SerArg: 5.139 ± 0.045
9.985SerSer: 9.985 ± 0.106
5.75SerThr: 5.75 ± 0.047
4.855SerVal: 4.855 ± 0.034
1.129SerTrp: 1.129 ± 0.016
2.065SerTyr: 2.065 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.74ThrAla: 5.74 ± 0.037
0.753ThrCys: 0.753 ± 0.014
3.005ThrAsp: 3.005 ± 0.027
3.15ThrGlu: 3.15 ± 0.029
2.185ThrPhe: 2.185 ± 0.021
4.262ThrGly: 4.262 ± 0.035
1.283ThrHis: 1.283 ± 0.018
2.99ThrIle: 2.99 ± 0.026
2.253ThrLys: 2.253 ± 0.021
5.475ThrLeu: 5.475 ± 0.031
1.194ThrMet: 1.194 ± 0.018
1.923ThrAsn: 1.923 ± 0.023
4.119ThrPro: 4.119 ± 0.039
2.048ThrGln: 2.048 ± 0.024
3.172ThrArg: 3.172 ± 0.025
5.185ThrSer: 5.185 ± 0.041
4.909ThrThr: 4.909 ± 0.065
4.095ThrVal: 4.095 ± 0.032
0.833ThrTrp: 0.833 ± 0.016
1.585ThrTyr: 1.585 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
5.42ValAla: 5.42 ± 0.041
0.903ValCys: 0.903 ± 0.017
3.849ValAsp: 3.849 ± 0.036
3.87ValGlu: 3.87 ± 0.032
2.632ValPhe: 2.632 ± 0.026
3.929ValGly: 3.929 ± 0.029
1.473ValHis: 1.473 ± 0.018
2.989ValIle: 2.989 ± 0.029
2.613ValLys: 2.613 ± 0.026
5.959ValLeu: 5.959 ± 0.045
1.336ValMet: 1.336 ± 0.019
2.105ValAsn: 2.105 ± 0.02
3.596ValPro: 3.596 ± 0.031
2.509ValGln: 2.509 ± 0.024
3.571ValArg: 3.571 ± 0.025
5.065ValSer: 5.065 ± 0.037
3.613ValThr: 3.613 ± 0.031
4.416ValVal: 4.416 ± 0.038
0.932ValTrp: 0.932 ± 0.016
1.805ValTyr: 1.805 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.161TrpAla: 1.161 ± 0.017
0.187TrpCys: 0.187 ± 0.006
0.86TrpAsp: 0.86 ± 0.016
0.892TrpGlu: 0.892 ± 0.015
0.529TrpPhe: 0.529 ± 0.01
0.89TrpGly: 0.89 ± 0.016
0.352TrpHis: 0.352 ± 0.009
0.749TrpIle: 0.749 ± 0.012
0.754TrpLys: 0.754 ± 0.014
1.413TrpLeu: 1.413 ± 0.02
0.392TrpMet: 0.392 ± 0.009
0.609TrpAsn: 0.609 ± 0.011
0.615TrpPro: 0.615 ± 0.013
0.593TrpGln: 0.593 ± 0.011
0.998TrpArg: 0.998 ± 0.016
1.074TrpSer: 1.074 ± 0.016
0.953TrpThr: 0.953 ± 0.014
0.897TrpVal: 0.897 ± 0.015
0.275TrpTrp: 0.275 ± 0.008
0.429TrpTyr: 0.429 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.202TyrAla: 2.202 ± 0.023
0.418TyrCys: 0.418 ± 0.011
1.591TyrAsp: 1.591 ± 0.02
1.561TyrGlu: 1.561 ± 0.02
1.19TyrPhe: 1.19 ± 0.016
2.034TyrGly: 2.034 ± 0.023
0.754TyrHis: 0.754 ± 0.013
1.442TyrIle: 1.442 ± 0.019
0.958TyrLys: 0.958 ± 0.015
2.837TyrLeu: 2.837 ± 0.024
0.621TyrMet: 0.621 ± 0.012
1.049TyrAsn: 1.049 ± 0.014
1.511TyrPro: 1.511 ± 0.02
1.115TyrGln: 1.115 ± 0.018
1.694TyrArg: 1.694 ± 0.02
2.105TyrSer: 2.105 ± 0.023
1.704TyrThr: 1.704 ± 0.022
1.6TyrVal: 1.6 ± 0.019
0.445TyrTrp: 0.445 ± 0.01
0.99TyrTyr: 0.99 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9870 proteins (4774852 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski