Amino acid dipepetide frequency for Epicoccum nigrum (Soil fungus) (Epicoccum purpurascens)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.035AlaAla: 10.035 ± 0.073
1.091AlaCys: 1.091 ± 0.016
4.487AlaAsp: 4.487 ± 0.027
5.361AlaGlu: 5.361 ± 0.044
3.241AlaPhe: 3.241 ± 0.028
6.117AlaGly: 6.117 ± 0.046
2.01AlaHis: 2.01 ± 0.024
4.165AlaIle: 4.165 ± 0.034
4.391AlaLys: 4.391 ± 0.035
8.143AlaLeu: 8.143 ± 0.047
2.052AlaMet: 2.052 ± 0.019
3.148AlaAsn: 3.148 ± 0.024
5.565AlaPro: 5.565 ± 0.055
3.832AlaGln: 3.832 ± 0.032
5.2AlaArg: 5.2 ± 0.033
7.677AlaSer: 7.677 ± 0.047
5.654AlaThr: 5.654 ± 0.035
5.597AlaVal: 5.597 ± 0.039
1.225AlaTrp: 1.225 ± 0.017
2.327AlaTyr: 2.327 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
1.003CysAla: 1.003 ± 0.015
0.256CysCys: 0.256 ± 0.008
0.647CysAsp: 0.647 ± 0.012
0.583CysGlu: 0.583 ± 0.011
0.502CysPhe: 0.502 ± 0.011
0.956CysGly: 0.956 ± 0.015
0.3CysHis: 0.3 ± 0.008
0.701CysIle: 0.701 ± 0.012
0.528CysLys: 0.528 ± 0.01
1.143CysLeu: 1.143 ± 0.016
0.279CysMet: 0.279 ± 0.007
0.466CysAsn: 0.466 ± 0.01
0.611CysPro: 0.611 ± 0.012
0.419CysGln: 0.419 ± 0.009
0.705CysArg: 0.705 ± 0.014
0.888CysSer: 0.888 ± 0.013
0.748CysThr: 0.748 ± 0.012
0.818CysVal: 0.818 ± 0.013
0.196CysTrp: 0.196 ± 0.006
0.366CysTyr: 0.366 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
5.262AspAla: 5.262 ± 0.032
0.617AspCys: 0.617 ± 0.012
4.247AspAsp: 4.247 ± 0.037
4.414AspGlu: 4.414 ± 0.033
2.214AspPhe: 2.214 ± 0.024
3.92AspGly: 3.92 ± 0.032
1.237AspHis: 1.237 ± 0.016
2.901AspIle: 2.901 ± 0.024
2.475AspLys: 2.475 ± 0.028
4.941AspLeu: 4.941 ± 0.035
1.309AspMet: 1.309 ± 0.014
1.858AspAsn: 1.858 ± 0.019
3.101AspPro: 3.101 ± 0.029
1.827AspGln: 1.827 ± 0.018
3.017AspArg: 3.017 ± 0.033
3.962AspSer: 3.962 ± 0.027
3.098AspThr: 3.098 ± 0.025
3.856AspVal: 3.856 ± 0.027
0.91AspTrp: 0.91 ± 0.014
1.589AspTyr: 1.589 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.618GluAla: 5.618 ± 0.046
0.597GluCys: 0.597 ± 0.011
4.212GluAsp: 4.212 ± 0.03
5.434GluGlu: 5.434 ± 0.054
1.832GluPhe: 1.832 ± 0.018
3.955GluGly: 3.955 ± 0.031
1.517GluHis: 1.517 ± 0.019
2.778GluIle: 2.778 ± 0.024
3.71GluLys: 3.71 ± 0.032
5.166GluLeu: 5.166 ± 0.034
1.425GluMet: 1.425 ± 0.018
2.036GluAsn: 2.036 ± 0.019
2.766GluPro: 2.766 ± 0.04
2.669GluGln: 2.669 ± 0.026
4.044GluArg: 4.044 ± 0.035
3.954GluSer: 3.954 ± 0.03
3.352GluThr: 3.352 ± 0.029
3.589GluVal: 3.589 ± 0.026
0.888GluTrp: 0.888 ± 0.014
1.643GluTyr: 1.643 ± 0.018
0.0GluXaa: 0.0 ± 0.0
Phe
3.249PheAla: 3.249 ± 0.025
0.532PheCys: 0.532 ± 0.009
2.307PheAsp: 2.307 ± 0.02
2.172PheGlu: 2.172 ± 0.023
1.584PhePhe: 1.584 ± 0.021
2.83PheGly: 2.83 ± 0.029
0.848PheHis: 0.848 ± 0.014
1.645PheIle: 1.645 ± 0.017
1.559PheLys: 1.559 ± 0.019
3.275PheLeu: 3.275 ± 0.029
0.763PheMet: 0.763 ± 0.011
1.41PheAsn: 1.41 ± 0.017
1.837PhePro: 1.837 ± 0.02
1.345PheGln: 1.345 ± 0.018
1.83PheArg: 1.83 ± 0.019
2.844PheSer: 2.844 ± 0.026
2.218PheThr: 2.218 ± 0.025
2.41PheVal: 2.41 ± 0.022
0.639PheTrp: 0.639 ± 0.011
1.049PheTyr: 1.049 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
5.768GlyAla: 5.768 ± 0.047
0.886GlyCys: 0.886 ± 0.014
3.599GlyAsp: 3.599 ± 0.034
3.745GlyGlu: 3.745 ± 0.03
2.749GlyPhe: 2.749 ± 0.028
5.927GlyGly: 5.927 ± 0.063
1.653GlyHis: 1.653 ± 0.021
3.329GlyIle: 3.329 ± 0.028
3.512GlyLys: 3.512 ± 0.03
5.916GlyLeu: 5.916 ± 0.04
1.62GlyMet: 1.62 ± 0.022
2.498GlyAsn: 2.498 ± 0.028
3.128GlyPro: 3.128 ± 0.029
2.536GlyGln: 2.536 ± 0.025
4.13GlyArg: 4.13 ± 0.035
5.522GlySer: 5.522 ± 0.043
3.966GlyThr: 3.966 ± 0.031
4.412GlyVal: 4.412 ± 0.03
1.17GlyTrp: 1.17 ± 0.016
2.149GlyTyr: 2.149 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
2.028HisAla: 2.028 ± 0.02
0.333HisCys: 0.333 ± 0.008
1.383HisAsp: 1.383 ± 0.017
1.373HisGlu: 1.373 ± 0.017
0.913HisPhe: 0.913 ± 0.015
1.689HisGly: 1.689 ± 0.019
0.863HisHis: 0.863 ± 0.017
1.215HisIle: 1.215 ± 0.016
1.014HisLys: 1.014 ± 0.013
2.236HisLeu: 2.236 ± 0.021
0.521HisMet: 0.521 ± 0.01
0.889HisAsn: 0.889 ± 0.013
1.643HisPro: 1.643 ± 0.022
1.011HisGln: 1.011 ± 0.017
1.507HisArg: 1.507 ± 0.019
1.867HisSer: 1.867 ± 0.022
1.411HisThr: 1.411 ± 0.013
1.517HisVal: 1.517 ± 0.017
0.344HisTrp: 0.344 ± 0.008
0.688HisTyr: 0.688 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.363IleAla: 4.363 ± 0.035
0.693IleCys: 0.693 ± 0.013
2.831IleAsp: 2.831 ± 0.023
2.827IleGlu: 2.827 ± 0.024
1.848IlePhe: 1.848 ± 0.022
3.094IleGly: 3.094 ± 0.03
1.096IleHis: 1.096 ± 0.015
2.271IleIle: 2.271 ± 0.026
2.151IleLys: 2.151 ± 0.021
4.135IleLeu: 4.135 ± 0.031
0.978IleMet: 0.978 ± 0.016
1.734IleAsn: 1.734 ± 0.018
2.823IlePro: 2.823 ± 0.024
1.757IleGln: 1.757 ± 0.021
2.605IleArg: 2.605 ± 0.025
3.539IleSer: 3.539 ± 0.027
2.855IleThr: 2.855 ± 0.026
3.137IleVal: 3.137 ± 0.027
0.701IleTrp: 0.701 ± 0.012
1.339IleTyr: 1.339 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
4.542LysAla: 4.542 ± 0.035
0.492LysCys: 0.492 ± 0.01
2.969LysAsp: 2.969 ± 0.026
3.525LysGlu: 3.525 ± 0.035
1.424LysPhe: 1.424 ± 0.02
3.106LysGly: 3.106 ± 0.026
1.22LysHis: 1.22 ± 0.017
2.242LysIle: 2.242 ± 0.02
3.613LysLys: 3.613 ± 0.046
4.077LysLeu: 4.077 ± 0.034
1.077LysMet: 1.077 ± 0.015
1.779LysAsn: 1.779 ± 0.019
2.768LysPro: 2.768 ± 0.028
2.005LysGln: 2.005 ± 0.021
3.429LysArg: 3.429 ± 0.029
3.475LysSer: 3.475 ± 0.034
2.974LysThr: 2.974 ± 0.029
2.826LysVal: 2.826 ± 0.025
0.703LysTrp: 0.703 ± 0.012
1.372LysTyr: 1.372 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
7.868LeuAla: 7.868 ± 0.038
1.185LeuCys: 1.185 ± 0.018
5.107LeuAsp: 5.107 ± 0.038
5.33LeuGlu: 5.33 ± 0.044
3.196LeuPhe: 3.196 ± 0.029
5.744LeuGly: 5.744 ± 0.033
2.258LeuHis: 2.258 ± 0.021
3.744LeuIle: 3.744 ± 0.034
4.222LeuLys: 4.222 ± 0.03
8.106LeuLeu: 8.106 ± 0.058
1.722LeuMet: 1.722 ± 0.017
3.151LeuAsn: 3.151 ± 0.027
5.53LeuPro: 5.53 ± 0.036
3.901LeuGln: 3.901 ± 0.034
5.651LeuArg: 5.651 ± 0.038
7.001LeuSer: 7.001 ± 0.038
4.852LeuThr: 4.852 ± 0.035
5.191LeuVal: 5.191 ± 0.037
1.195LeuTrp: 1.195 ± 0.015
2.332LeuTyr: 2.332 ± 0.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.206MetAla: 2.206 ± 0.021
0.257MetCys: 0.257 ± 0.007
1.223MetAsp: 1.223 ± 0.017
1.193MetGlu: 1.193 ± 0.018
0.773MetPhe: 0.773 ± 0.013
1.441MetGly: 1.441 ± 0.018
0.541MetHis: 0.541 ± 0.01
0.936MetIle: 0.936 ± 0.012
1.021MetLys: 1.021 ± 0.015
1.918MetLeu: 1.918 ± 0.019
0.59MetMet: 0.59 ± 0.012
0.808MetAsn: 0.808 ± 0.014
1.327MetPro: 1.327 ± 0.015
0.975MetGln: 0.975 ± 0.014
1.336MetArg: 1.336 ± 0.015
1.91MetSer: 1.91 ± 0.019
1.289MetThr: 1.289 ± 0.015
1.274MetVal: 1.274 ± 0.015
0.299MetTrp: 0.299 ± 0.007
0.583MetTyr: 0.583 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
3.389AsnAla: 3.389 ± 0.03
0.429AsnCys: 0.429 ± 0.01
2.014AsnAsp: 2.014 ± 0.019
1.991AsnGlu: 1.991 ± 0.019
1.38AsnPhe: 1.38 ± 0.018
2.924AsnGly: 2.924 ± 0.028
0.82AsnHis: 0.82 ± 0.013
1.969AsnIle: 1.969 ± 0.019
1.638AsnLys: 1.638 ± 0.02
3.113AsnLeu: 3.113 ± 0.03
0.881AsnMet: 0.881 ± 0.014
1.537AsnAsn: 1.537 ± 0.024
2.293AsnPro: 2.293 ± 0.023
1.292AsnGln: 1.292 ± 0.018
1.827AsnArg: 1.827 ± 0.02
2.623AsnSer: 2.623 ± 0.022
2.323AsnThr: 2.323 ± 0.024
2.356AsnVal: 2.356 ± 0.02
0.554AsnTrp: 0.554 ± 0.01
1.063AsnTyr: 1.063 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
5.832ProAla: 5.832 ± 0.055
0.497ProCys: 0.497 ± 0.01
3.0ProAsp: 3.0 ± 0.03
3.584ProGlu: 3.584 ± 0.034
2.028ProPhe: 2.028 ± 0.018
3.714ProGly: 3.714 ± 0.032
1.423ProHis: 1.423 ± 0.02
2.483ProIle: 2.483 ± 0.022
2.749ProLys: 2.749 ± 0.029
4.684ProLeu: 4.684 ± 0.034
1.105ProMet: 1.105 ± 0.016
2.158ProAsn: 2.158 ± 0.023
5.115ProPro: 5.115 ± 0.072
2.622ProGln: 2.622 ± 0.034
3.396ProArg: 3.396 ± 0.027
5.85ProSer: 5.85 ± 0.042
4.232ProThr: 4.232 ± 0.039
3.56ProVal: 3.56 ± 0.034
0.719ProTrp: 0.719 ± 0.013
1.527ProTyr: 1.527 ± 0.021
0.0ProXaa: 0.0 ± 0.0
Gln
3.632GlnAla: 3.632 ± 0.029
0.462GlnCys: 0.462 ± 0.01
2.132GlnAsp: 2.132 ± 0.023
2.391GlnGlu: 2.391 ± 0.021
1.277GlnPhe: 1.277 ± 0.017
2.451GlnGly: 2.451 ± 0.027
1.256GlnHis: 1.256 ± 0.018
1.869GlnIle: 1.869 ± 0.02
2.036GlnLys: 2.036 ± 0.021
3.539GlnLeu: 3.539 ± 0.031
0.918GlnMet: 0.918 ± 0.016
1.53GlnAsn: 1.53 ± 0.02
2.618GlnPro: 2.618 ± 0.034
2.877GlnGln: 2.877 ± 0.06
2.794GlnArg: 2.794 ± 0.026
3.137GlnSer: 3.137 ± 0.032
2.443GlnThr: 2.443 ± 0.023
2.179GlnVal: 2.179 ± 0.022
0.594GlnTrp: 0.594 ± 0.011
1.274GlnTyr: 1.274 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
4.94ArgAla: 4.94 ± 0.031
0.719ArgCys: 0.719 ± 0.013
3.289ArgAsp: 3.289 ± 0.032
3.755ArgGlu: 3.755 ± 0.035
2.048ArgPhe: 2.048 ± 0.022
3.722ArgGly: 3.722 ± 0.033
1.566ArgHis: 1.566 ± 0.019
2.8ArgIle: 2.8 ± 0.024
3.505ArgLys: 3.505 ± 0.031
5.279ArgLeu: 5.279 ± 0.038
1.33ArgMet: 1.33 ± 0.013
2.262ArgAsn: 2.262 ± 0.021
3.453ArgPro: 3.453 ± 0.033
2.611ArgGln: 2.611 ± 0.024
4.954ArgArg: 4.954 ± 0.044
4.804ArgSer: 4.804 ± 0.038
3.427ArgThr: 3.427 ± 0.028
3.35ArgVal: 3.35 ± 0.027
0.923ArgTrp: 0.923 ± 0.015
1.634ArgTyr: 1.634 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
7.013SerAla: 7.013 ± 0.048
0.857SerCys: 0.857 ± 0.015
4.103SerAsp: 4.103 ± 0.032
3.98SerGlu: 3.98 ± 0.031
2.922SerPhe: 2.922 ± 0.025
5.325SerGly: 5.325 ± 0.036
1.921SerHis: 1.921 ± 0.019
3.831SerIle: 3.831 ± 0.031
3.778SerLys: 3.778 ± 0.03
6.784SerLeu: 6.784 ± 0.037
1.728SerMet: 1.728 ± 0.02
2.991SerAsn: 2.991 ± 0.027
5.415SerPro: 5.415 ± 0.046
3.186SerGln: 3.186 ± 0.03
4.727SerArg: 4.727 ± 0.04
8.347SerSer: 8.347 ± 0.072
5.745SerThr: 5.745 ± 0.043
4.569SerVal: 4.569 ± 0.033
1.095SerTrp: 1.095 ± 0.015
2.139SerTyr: 2.139 ± 0.02
0.0SerXaa: 0.0 ± 0.0
Thr
5.63ThrAla: 5.63 ± 0.034
0.78ThrCys: 0.78 ± 0.014
2.959ThrAsp: 2.959 ± 0.027
3.137ThrGlu: 3.137 ± 0.026
2.297ThrPhe: 2.297 ± 0.024
4.155ThrGly: 4.155 ± 0.034
1.384ThrHis: 1.384 ± 0.017
3.041ThrIle: 3.041 ± 0.026
2.734ThrLys: 2.734 ± 0.024
5.372ThrLeu: 5.372 ± 0.034
1.236ThrMet: 1.236 ± 0.016
2.183ThrAsn: 2.183 ± 0.02
4.613ThrPro: 4.613 ± 0.039
2.307ThrGln: 2.307 ± 0.025
3.172ThrArg: 3.172 ± 0.024
5.443ThrSer: 5.443 ± 0.043
4.447ThrThr: 4.447 ± 0.04
3.759ThrVal: 3.759 ± 0.03
0.873ThrTrp: 0.873 ± 0.013
1.72ThrTyr: 1.72 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
5.427ValAla: 5.427 ± 0.036
0.819ValCys: 0.819 ± 0.013
3.66ValAsp: 3.66 ± 0.027
3.984ValGlu: 3.984 ± 0.033
2.396ValPhe: 2.396 ± 0.024
4.085ValGly: 4.085 ± 0.032
1.463ValHis: 1.463 ± 0.02
2.736ValIle: 2.736 ± 0.025
2.987ValLys: 2.987 ± 0.03
5.596ValLeu: 5.596 ± 0.034
1.332ValMet: 1.332 ± 0.016
2.172ValAsn: 2.172 ± 0.025
3.583ValPro: 3.583 ± 0.029
2.534ValGln: 2.534 ± 0.022
3.51ValArg: 3.51 ± 0.028
4.501ValSer: 4.501 ± 0.036
3.519ValThr: 3.519 ± 0.028
4.382ValVal: 4.382 ± 0.037
0.885ValTrp: 0.885 ± 0.014
1.731ValTyr: 1.731 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
1.189TrpAla: 1.189 ± 0.015
0.207TrpCys: 0.207 ± 0.006
0.891TrpAsp: 0.891 ± 0.014
0.823TrpGlu: 0.823 ± 0.014
0.541TrpPhe: 0.541 ± 0.01
0.94TrpGly: 0.94 ± 0.015
0.353TrpHis: 0.353 ± 0.008
0.751TrpIle: 0.751 ± 0.012
0.819TrpLys: 0.819 ± 0.013
1.331TrpLeu: 1.331 ± 0.018
0.381TrpMet: 0.381 ± 0.009
0.642TrpAsn: 0.642 ± 0.011
0.615TrpPro: 0.615 ± 0.012
0.59TrpGln: 0.59 ± 0.01
0.96TrpArg: 0.96 ± 0.013
1.067TrpSer: 1.067 ± 0.014
0.95TrpThr: 0.95 ± 0.015
0.868TrpVal: 0.868 ± 0.012
0.283TrpTrp: 0.283 ± 0.008
0.455TrpTyr: 0.455 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.325TyrAla: 2.325 ± 0.024
0.405TyrCys: 0.405 ± 0.008
1.694TyrAsp: 1.694 ± 0.019
1.582TyrGlu: 1.582 ± 0.022
1.181TyrPhe: 1.181 ± 0.015
2.12TyrGly: 2.12 ± 0.024
0.718TyrHis: 0.718 ± 0.012
1.37TyrIle: 1.37 ± 0.019
1.179TyrLys: 1.179 ± 0.016
2.532TyrLeu: 2.532 ± 0.025
0.654TyrMet: 0.654 ± 0.011
1.165TyrAsn: 1.165 ± 0.014
1.473TyrPro: 1.473 ± 0.018
1.114TyrGln: 1.114 ± 0.014
1.578TyrArg: 1.578 ± 0.017
2.044TyrSer: 2.044 ± 0.021
1.738TyrThr: 1.738 ± 0.021
1.65TyrVal: 1.65 ± 0.018
0.457TyrTrp: 0.457 ± 0.011
0.925TyrTyr: 0.925 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12021 proteins (5572407 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski