Amino acid dipepetide frequency for Edhazardia aedis (strain USNM 41457) (Microsporidian parasite)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.491AlaAla: 1.491 ± 0.074
0.594AlaCys: 0.594 ± 0.022
1.815AlaAsp: 1.815 ± 0.037
2.319AlaGlu: 2.319 ± 0.054
1.844AlaPhe: 1.844 ± 0.038
1.193AlaGly: 1.193 ± 0.044
0.639AlaHis: 0.639 ± 0.022
2.943AlaIle: 2.943 ± 0.048
3.147AlaLys: 3.147 ± 0.058
3.0AlaLeu: 3.0 ± 0.061
0.624AlaMet: 0.624 ± 0.021
2.591AlaAsn: 2.591 ± 0.052
0.783AlaPro: 0.783 ± 0.03
1.163AlaGln: 1.163 ± 0.03
1.159AlaArg: 1.159 ± 0.03
2.448AlaSer: 2.448 ± 0.051
1.552AlaThr: 1.552 ± 0.034
1.669AlaVal: 1.669 ± 0.044
0.149AlaTrp: 0.149 ± 0.011
1.259AlaTyr: 1.259 ± 0.029
0.0AlaXaa: 0.0 ± 0.0
Cys
0.733CysAla: 0.733 ± 0.026
0.496CysCys: 0.496 ± 0.017
1.351CysAsp: 1.351 ± 0.029
1.452CysGlu: 1.452 ± 0.034
1.332CysPhe: 1.332 ± 0.03
0.905CysGly: 0.905 ± 0.03
0.282CysHis: 0.282 ± 0.013
1.702CysIle: 1.702 ± 0.036
1.862CysLys: 1.862 ± 0.035
1.883CysLeu: 1.883 ± 0.037
0.344CysMet: 0.344 ± 0.014
1.526CysAsn: 1.526 ± 0.036
0.433CysPro: 0.433 ± 0.023
0.469CysGln: 0.469 ± 0.02
0.666CysArg: 0.666 ± 0.022
1.479CysSer: 1.479 ± 0.033
0.843CysThr: 0.843 ± 0.027
1.111CysVal: 1.111 ± 0.025
0.091CysTrp: 0.091 ± 0.008
0.802CysTyr: 0.802 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
1.84AspAla: 1.84 ± 0.034
1.054AspCys: 1.054 ± 0.028
3.727AspAsp: 3.727 ± 0.061
4.555AspGlu: 4.555 ± 0.069
3.904AspPhe: 3.904 ± 0.056
1.757AspGly: 1.757 ± 0.043
0.971AspHis: 0.971 ± 0.025
5.616AspIle: 5.616 ± 0.08
5.611AspLys: 5.611 ± 0.079
5.2AspLeu: 5.2 ± 0.063
1.05AspMet: 1.05 ± 0.028
5.159AspAsn: 5.159 ± 0.1
1.286AspPro: 1.286 ± 0.034
1.797AspGln: 1.797 ± 0.046
1.668AspArg: 1.668 ± 0.044
4.667AspSer: 4.667 ± 0.064
2.56AspThr: 2.56 ± 0.042
2.86AspVal: 2.86 ± 0.044
0.237AspTrp: 0.237 ± 0.012
2.305AspTyr: 2.305 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
2.098GluAla: 2.098 ± 0.055
1.235GluCys: 1.235 ± 0.032
3.72GluAsp: 3.72 ± 0.061
5.272GluGlu: 5.272 ± 0.118
3.217GluPhe: 3.217 ± 0.056
1.552GluGly: 1.552 ± 0.046
1.102GluHis: 1.102 ± 0.027
7.012GluIle: 7.012 ± 0.081
9.213GluLys: 9.213 ± 0.108
4.653GluLeu: 4.653 ± 0.068
1.55GluMet: 1.55 ± 0.033
8.758GluAsn: 8.758 ± 0.113
1.211GluPro: 1.211 ± 0.028
1.872GluGln: 1.872 ± 0.04
2.207GluArg: 2.207 ± 0.04
4.95GluSer: 4.95 ± 0.072
3.348GluThr: 3.348 ± 0.046
2.55GluVal: 2.55 ± 0.042
0.26GluTrp: 0.26 ± 0.013
2.739GluTyr: 2.739 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
1.849PheAla: 1.849 ± 0.041
1.489PheCys: 1.489 ± 0.029
3.89PheAsp: 3.89 ± 0.051
3.67PheGlu: 3.67 ± 0.056
4.274PhePhe: 4.274 ± 0.068
1.98PheGly: 1.98 ± 0.043
1.048PheHis: 1.048 ± 0.025
5.034PheIle: 5.034 ± 0.073
4.856PheLys: 4.856 ± 0.065
6.319PheLeu: 6.319 ± 0.087
1.229PheMet: 1.229 ± 0.033
4.411PheAsn: 4.411 ± 0.061
1.279PhePro: 1.279 ± 0.028
1.658PheGln: 1.658 ± 0.034
1.814PheArg: 1.814 ± 0.041
4.792PheSer: 4.792 ± 0.065
2.58PheThr: 2.58 ± 0.046
3.167PheVal: 3.167 ± 0.055
0.35PheTrp: 0.35 ± 0.016
3.22PheTyr: 3.22 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
1.107GlyAla: 1.107 ± 0.043
0.682GlyCys: 0.682 ± 0.024
1.642GlyAsp: 1.642 ± 0.038
1.826GlyGlu: 1.826 ± 0.046
1.911GlyPhe: 1.911 ± 0.042
1.209GlyGly: 1.209 ± 0.052
0.593GlyHis: 0.593 ± 0.019
2.785GlyIle: 2.785 ± 0.05
3.103GlyLys: 3.103 ± 0.049
2.609GlyLeu: 2.609 ± 0.046
0.68GlyMet: 0.68 ± 0.026
2.632GlyAsn: 2.632 ± 0.05
0.591GlyPro: 0.591 ± 0.023
0.747GlyGln: 0.747 ± 0.022
1.092GlyArg: 1.092 ± 0.033
2.271GlySer: 2.271 ± 0.056
1.57GlyThr: 1.57 ± 0.039
1.533GlyVal: 1.533 ± 0.035
0.156GlyTrp: 0.156 ± 0.011
1.265GlyTyr: 1.265 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
0.595HisAla: 0.595 ± 0.022
0.4HisCys: 0.4 ± 0.017
1.027HisAsp: 1.027 ± 0.028
1.362HisGlu: 1.362 ± 0.033
1.181HisPhe: 1.181 ± 0.028
0.658HisGly: 0.658 ± 0.023
0.438HisHis: 0.438 ± 0.017
1.786HisIle: 1.786 ± 0.037
1.959HisLys: 1.959 ± 0.035
1.75HisLeu: 1.75 ± 0.033
0.357HisMet: 0.357 ± 0.015
1.696HisAsn: 1.696 ± 0.038
0.555HisPro: 0.555 ± 0.02
0.637HisGln: 0.637 ± 0.025
0.708HisArg: 0.708 ± 0.022
1.487HisSer: 1.487 ± 0.032
1.025HisThr: 1.025 ± 0.028
0.898HisVal: 0.898 ± 0.025
0.08HisTrp: 0.08 ± 0.007
0.735HisTyr: 0.735 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
2.917IleAla: 2.917 ± 0.057
2.027IleCys: 2.027 ± 0.041
6.027IleAsp: 6.027 ± 0.078
6.307IleGlu: 6.307 ± 0.075
6.083IlePhe: 6.083 ± 0.079
2.739IleGly: 2.739 ± 0.047
1.817IleHis: 1.817 ± 0.039
7.525IleIle: 7.525 ± 0.091
9.164IleLys: 9.164 ± 0.095
8.919IleLeu: 8.919 ± 0.095
1.594IleMet: 1.594 ± 0.035
8.492IleAsn: 8.492 ± 0.132
2.377IlePro: 2.377 ± 0.043
3.04IleGln: 3.04 ± 0.051
2.791IleArg: 2.791 ± 0.057
7.644IleSer: 7.644 ± 0.09
4.163IleThr: 4.163 ± 0.063
4.26IleVal: 4.26 ± 0.064
0.381IleTrp: 0.381 ± 0.016
4.286IleTyr: 4.286 ± 0.058
0.0IleXaa: 0.0 ± 0.0
Lys
2.798LysAla: 2.798 ± 0.054
1.805LysCys: 1.805 ± 0.034
5.494LysAsp: 5.494 ± 0.079
7.101LysGlu: 7.101 ± 0.103
4.803LysPhe: 4.803 ± 0.064
2.537LysGly: 2.537 ± 0.046
2.147LysHis: 2.147 ± 0.037
11.187LysIle: 11.187 ± 0.104
12.826LysLys: 12.826 ± 0.134
7.618LysLeu: 7.618 ± 0.077
2.53LysMet: 2.53 ± 0.043
13.129LysAsn: 13.129 ± 0.142
2.417LysPro: 2.417 ± 0.052
3.182LysGln: 3.182 ± 0.044
3.636LysArg: 3.636 ± 0.051
7.682LysSer: 7.682 ± 0.09
5.855LysThr: 5.855 ± 0.065
3.604LysVal: 3.604 ± 0.056
0.386LysTrp: 0.386 ± 0.017
4.718LysTyr: 4.718 ± 0.059
0.0LysXaa: 0.0 ± 0.0
Leu
2.795LeuAla: 2.795 ± 0.055
1.93LeuCys: 1.93 ± 0.039
4.743LeuAsp: 4.743 ± 0.062
5.803LeuGlu: 5.803 ± 0.08
5.36LeuPhe: 5.36 ± 0.067
2.488LeuGly: 2.488 ± 0.05
1.85LeuHis: 1.85 ± 0.039
7.311LeuIle: 7.311 ± 0.08
9.577LeuLys: 9.577 ± 0.102
7.847LeuLeu: 7.847 ± 0.096
1.731LeuMet: 1.731 ± 0.04
7.851LeuAsn: 7.851 ± 0.096
2.204LeuPro: 2.204 ± 0.041
3.157LeuGln: 3.157 ± 0.056
3.163LeuArg: 3.163 ± 0.055
6.71LeuSer: 6.71 ± 0.078
3.672LeuThr: 3.672 ± 0.065
3.772LeuVal: 3.772 ± 0.061
0.434LeuTrp: 0.434 ± 0.016
3.932LeuTyr: 3.932 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
0.712MetAla: 0.712 ± 0.023
0.477MetCys: 0.477 ± 0.015
0.885MetAsp: 0.885 ± 0.024
1.118MetGlu: 1.118 ± 0.029
1.225MetPhe: 1.225 ± 0.032
0.578MetGly: 0.578 ± 0.02
0.542MetHis: 0.542 ± 0.019
1.724MetIle: 1.724 ± 0.032
2.193MetLys: 2.193 ± 0.041
1.949MetLeu: 1.949 ± 0.037
0.458MetMet: 0.458 ± 0.019
1.74MetAsn: 1.74 ± 0.041
0.572MetPro: 0.572 ± 0.024
0.833MetGln: 0.833 ± 0.027
0.795MetArg: 0.795 ± 0.023
1.39MetSer: 1.39 ± 0.028
0.785MetThr: 0.785 ± 0.022
0.929MetVal: 0.929 ± 0.025
0.123MetTrp: 0.123 ± 0.009
0.816MetTyr: 0.816 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
3.046AsnAla: 3.046 ± 0.065
1.615AsnCys: 1.615 ± 0.037
5.725AsnAsp: 5.725 ± 0.081
6.477AsnGlu: 6.477 ± 0.082
5.374AsnPhe: 5.374 ± 0.074
2.661AsnGly: 2.661 ± 0.051
1.798AsnHis: 1.798 ± 0.04
10.641AsnIle: 10.641 ± 0.167
9.253AsnLys: 9.253 ± 0.114
8.489AsnLeu: 8.489 ± 0.084
1.97AsnMet: 1.97 ± 0.037
11.613AsnAsn: 11.613 ± 0.332
2.313AsnPro: 2.313 ± 0.069
3.312AsnGln: 3.312 ± 0.067
2.804AsnArg: 2.804 ± 0.047
8.378AsnSer: 8.378 ± 0.152
6.112AsnThr: 6.112 ± 0.137
4.402AsnVal: 4.402 ± 0.068
0.325AsnTrp: 0.325 ± 0.014
3.682AsnTyr: 3.682 ± 0.058
0.001AsnXaa: 0.001 ± 0.001
Pro
0.869ProAla: 0.869 ± 0.032
0.429ProCys: 0.429 ± 0.021
1.219ProAsp: 1.219 ± 0.032
1.775ProGlu: 1.775 ± 0.048
1.472ProPhe: 1.472 ± 0.037
0.812ProGly: 0.812 ± 0.039
0.49ProHis: 0.49 ± 0.019
2.091ProIle: 2.091 ± 0.042
2.287ProLys: 2.287 ± 0.043
2.033ProLeu: 2.033 ± 0.047
0.418ProMet: 0.418 ± 0.016
2.047ProAsn: 2.047 ± 0.057
0.777ProPro: 0.777 ± 0.08
0.973ProGln: 0.973 ± 0.033
0.776ProArg: 0.776 ± 0.023
1.869ProSer: 1.869 ± 0.058
1.27ProThr: 1.27 ± 0.034
1.292ProVal: 1.292 ± 0.037
0.091ProTrp: 0.091 ± 0.008
0.881ProTyr: 0.881 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
0.961GlnAla: 0.961 ± 0.034
0.519GlnCys: 0.519 ± 0.017
1.416GlnAsp: 1.416 ± 0.031
2.239GlnGlu: 2.239 ± 0.04
1.481GlnPhe: 1.481 ± 0.036
0.736GlnGly: 0.736 ± 0.026
0.616GlnHis: 0.616 ± 0.022
3.176GlnIle: 3.176 ± 0.047
4.263GlnLys: 4.263 ± 0.063
2.218GlnLeu: 2.218 ± 0.042
0.655GlnMet: 0.655 ± 0.022
4.105GlnAsn: 4.105 ± 0.073
0.827GlnPro: 0.827 ± 0.032
1.249GlnGln: 1.249 ± 0.035
1.134GlnArg: 1.134 ± 0.03
2.449GlnSer: 2.449 ± 0.046
1.67GlnThr: 1.67 ± 0.034
1.134GlnVal: 1.134 ± 0.032
0.118GlnTrp: 0.118 ± 0.009
1.271GlnTyr: 1.271 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
0.96ArgAla: 0.96 ± 0.027
0.627ArgCys: 0.627 ± 0.024
1.63ArgAsp: 1.63 ± 0.042
2.106ArgGlu: 2.106 ± 0.045
1.794ArgPhe: 1.794 ± 0.035
1.013ArgGly: 1.013 ± 0.034
0.62ArgHis: 0.62 ± 0.02
3.096ArgIle: 3.096 ± 0.049
4.249ArgLys: 4.249 ± 0.056
2.606ArgLeu: 2.606 ± 0.047
0.79ArgMet: 0.79 ± 0.024
3.346ArgAsn: 3.346 ± 0.049
0.747ArgPro: 0.747 ± 0.026
0.95ArgGln: 0.95 ± 0.027
1.625ArgArg: 1.625 ± 0.042
2.333ArgSer: 2.333 ± 0.044
1.47ArgThr: 1.47 ± 0.035
1.412ArgVal: 1.412 ± 0.039
0.164ArgTrp: 0.164 ± 0.01
1.391ArgTyr: 1.391 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
2.767SerAla: 2.767 ± 0.052
1.395SerCys: 1.395 ± 0.029
4.909SerAsp: 4.909 ± 0.056
5.503SerGlu: 5.503 ± 0.06
4.33SerPhe: 4.33 ± 0.06
2.49SerGly: 2.49 ± 0.045
1.518SerHis: 1.518 ± 0.033
6.669SerIle: 6.669 ± 0.08
7.956SerLys: 7.956 ± 0.099
6.496SerLeu: 6.496 ± 0.068
1.378SerMet: 1.378 ± 0.03
8.266SerAsn: 8.266 ± 0.162
1.77SerPro: 1.77 ± 0.053
2.649SerGln: 2.649 ± 0.049
2.393SerArg: 2.393 ± 0.041
7.396SerSer: 7.396 ± 0.133
4.129SerThr: 4.129 ± 0.07
3.566SerVal: 3.566 ± 0.055
0.321SerTrp: 0.321 ± 0.016
3.008SerTyr: 3.008 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
1.717ThrAla: 1.717 ± 0.037
0.877ThrCys: 0.877 ± 0.027
2.906ThrAsp: 2.906 ± 0.05
3.561ThrGlu: 3.561 ± 0.058
2.722ThrPhe: 2.722 ± 0.043
1.615ThrGly: 1.615 ± 0.032
1.036ThrHis: 1.036 ± 0.029
4.394ThrIle: 4.394 ± 0.057
5.032ThrLys: 5.032 ± 0.061
3.998ThrLeu: 3.998 ± 0.053
0.812ThrMet: 0.812 ± 0.025
4.911ThrAsn: 4.911 ± 0.097
1.425ThrPro: 1.425 ± 0.049
1.734ThrGln: 1.734 ± 0.037
1.531ThrArg: 1.531 ± 0.032
4.037ThrSer: 4.037 ± 0.072
2.793ThrThr: 2.793 ± 0.054
2.365ThrVal: 2.365 ± 0.048
0.193ThrTrp: 0.193 ± 0.012
1.838ThrTyr: 1.838 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
1.587ValAla: 1.587 ± 0.04
1.102ValCys: 1.102 ± 0.033
2.976ValAsp: 2.976 ± 0.046
3.073ValGlu: 3.073 ± 0.051
3.339ValPhe: 3.339 ± 0.046
1.52ValGly: 1.52 ± 0.039
0.859ValHis: 0.859 ± 0.022
3.581ValIle: 3.581 ± 0.051
4.258ValLys: 4.258 ± 0.062
4.329ValLeu: 4.329 ± 0.056
0.738ValMet: 0.738 ± 0.02
3.562ValAsn: 3.562 ± 0.053
1.282ValPro: 1.282 ± 0.033
1.395ValGln: 1.395 ± 0.034
1.381ValArg: 1.381 ± 0.032
3.466ValSer: 3.466 ± 0.056
1.891ValThr: 1.891 ± 0.043
2.451ValVal: 2.451 ± 0.05
0.223ValTrp: 0.223 ± 0.011
2.084ValTyr: 2.084 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.164TrpAla: 0.164 ± 0.012
0.103TrpCys: 0.103 ± 0.009
0.248TrpAsp: 0.248 ± 0.015
0.231TrpGlu: 0.231 ± 0.013
0.322TrpPhe: 0.322 ± 0.015
0.171TrpGly: 0.171 ± 0.014
0.098TrpHis: 0.098 ± 0.007
0.369TrpIle: 0.369 ± 0.018
0.39TrpLys: 0.39 ± 0.018
0.421TrpLeu: 0.421 ± 0.016
0.115TrpMet: 0.115 ± 0.009
0.31TrpAsn: 0.31 ± 0.014
0.111TrpPro: 0.111 ± 0.009
0.131TrpGln: 0.131 ± 0.009
0.171TrpArg: 0.171 ± 0.011
0.32TrpSer: 0.32 ± 0.015
0.182TrpThr: 0.182 ± 0.012
0.229TrpVal: 0.229 ± 0.012
0.023TrpTrp: 0.023 ± 0.004
0.205TrpTyr: 0.205 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.378TyrAla: 1.378 ± 0.031
0.906TyrCys: 0.906 ± 0.027
2.476TyrAsp: 2.476 ± 0.044
2.766TyrGlu: 2.766 ± 0.046
2.937TyrPhe: 2.937 ± 0.057
1.392TyrGly: 1.392 ± 0.032
0.87TyrHis: 0.87 ± 0.025
3.913TyrIle: 3.913 ± 0.063
4.254TyrLys: 4.254 ± 0.066
3.992TyrLeu: 3.992 ± 0.058
0.821TyrMet: 0.821 ± 0.024
3.845TyrAsn: 3.845 ± 0.076
0.911TyrPro: 0.911 ± 0.027
1.29TyrGln: 1.29 ± 0.033
1.412TyrArg: 1.412 ± 0.036
3.14TyrSer: 3.14 ± 0.051
2.026TyrThr: 2.026 ± 0.035
1.897TyrVal: 1.897 ± 0.034
0.21TyrTrp: 0.21 ± 0.01
1.798TyrTyr: 1.798 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4189 proteins (1478472 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski