Amino acid dipepetide frequency for Microcystis aeruginosa (strain NIES-843)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.335AlaAla: 6.335 ± 0.097
0.705AlaCys: 0.705 ± 0.023
3.594AlaAsp: 3.594 ± 0.053
4.61AlaGlu: 4.61 ± 0.06
2.794AlaPhe: 2.794 ± 0.046
4.888AlaGly: 4.888 ± 0.078
1.116AlaHis: 1.116 ± 0.024
6.625AlaIle: 6.625 ± 0.079
4.335AlaLys: 4.335 ± 0.065
7.988AlaLeu: 7.988 ± 0.099
1.447AlaMet: 1.447 ± 0.029
3.209AlaAsn: 3.209 ± 0.061
2.487AlaPro: 2.487 ± 0.053
3.297AlaGln: 3.297 ± 0.047
3.528AlaArg: 3.528 ± 0.057
4.065AlaSer: 4.065 ± 0.054
4.158AlaThr: 4.158 ± 0.079
4.947AlaVal: 4.947 ± 0.066
1.022AlaTrp: 1.022 ± 0.031
2.314AlaTyr: 2.314 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.606CysAla: 0.606 ± 0.023
0.167CysCys: 0.167 ± 0.011
0.508CysAsp: 0.508 ± 0.019
0.521CysGlu: 0.521 ± 0.018
0.507CysPhe: 0.507 ± 0.017
0.892CysGly: 0.892 ± 0.027
0.323CysHis: 0.323 ± 0.02
0.492CysIle: 0.492 ± 0.02
0.328CysLys: 0.328 ± 0.017
1.421CysLeu: 1.421 ± 0.036
0.168CysMet: 0.168 ± 0.01
0.315CysAsn: 0.315 ± 0.013
0.634CysPro: 0.634 ± 0.02
0.87CysGln: 0.87 ± 0.025
0.676CysArg: 0.676 ± 0.021
0.704CysSer: 0.704 ± 0.023
0.404CysThr: 0.404 ± 0.015
0.611CysVal: 0.611 ± 0.023
0.129CysTrp: 0.129 ± 0.008
0.42CysTyr: 0.42 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
2.855AspAla: 2.855 ± 0.045
0.592AspCys: 0.592 ± 0.021
2.117AspAsp: 2.117 ± 0.047
2.778AspGlu: 2.778 ± 0.047
2.551AspPhe: 2.551 ± 0.04
3.063AspGly: 3.063 ± 0.053
0.923AspHis: 0.923 ± 0.027
3.328AspIle: 3.328 ± 0.053
2.684AspLys: 2.684 ± 0.044
5.884AspLeu: 5.884 ± 0.065
0.658AspMet: 0.658 ± 0.021
2.518AspAsn: 2.518 ± 0.045
2.184AspPro: 2.184 ± 0.04
1.786AspGln: 1.786 ± 0.031
3.395AspArg: 3.395 ± 0.055
3.071AspSer: 3.071 ± 0.049
2.254AspThr: 2.254 ± 0.058
2.479AspVal: 2.479 ± 0.043
0.993AspTrp: 0.993 ± 0.028
2.177AspTyr: 2.177 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
4.853GluAla: 4.853 ± 0.075
0.454GluCys: 0.454 ± 0.018
2.921GluAsp: 2.921 ± 0.046
4.706GluGlu: 4.706 ± 0.075
2.357GluPhe: 2.357 ± 0.045
3.396GluGly: 3.396 ± 0.055
0.886GluHis: 0.886 ± 0.026
5.862GluIle: 5.862 ± 0.076
5.045GluLys: 5.045 ± 0.066
6.722GluLeu: 6.722 ± 0.078
1.279GluMet: 1.279 ± 0.029
3.303GluAsn: 3.303 ± 0.048
2.208GluPro: 2.208 ± 0.045
3.361GluGln: 3.361 ± 0.049
3.819GluArg: 3.819 ± 0.058
3.718GluSer: 3.718 ± 0.054
3.84GluThr: 3.84 ± 0.059
3.925GluVal: 3.925 ± 0.054
0.751GluTrp: 0.751 ± 0.026
1.911GluTyr: 1.911 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
2.888PheAla: 2.888 ± 0.046
0.641PheCys: 0.641 ± 0.02
2.187PheAsp: 2.187 ± 0.039
2.192PheGlu: 2.192 ± 0.046
1.997PhePhe: 1.997 ± 0.041
2.686PheGly: 2.686 ± 0.044
0.76PheHis: 0.76 ± 0.023
2.503PheIle: 2.503 ± 0.042
1.77PheLys: 1.77 ± 0.037
4.824PheLeu: 4.824 ± 0.061
0.704PheMet: 0.704 ± 0.024
1.888PheAsn: 1.888 ± 0.037
2.113PhePro: 2.113 ± 0.042
1.993PheGln: 1.993 ± 0.04
2.035PheArg: 2.035 ± 0.038
3.255PheSer: 3.255 ± 0.054
2.191PheThr: 2.191 ± 0.039
2.311PheVal: 2.311 ± 0.047
0.776PheTrp: 0.776 ± 0.024
1.596PheTyr: 1.596 ± 0.037
0.0PheXaa: 0.0 ± 0.0
Gly
4.078GlyAla: 4.078 ± 0.065
0.846GlyCys: 0.846 ± 0.023
3.359GlyAsp: 3.359 ± 0.054
4.313GlyGlu: 4.313 ± 0.054
3.007GlyPhe: 3.007 ± 0.051
4.667GlyGly: 4.667 ± 0.078
1.164GlyHis: 1.164 ± 0.034
5.278GlyIle: 5.278 ± 0.071
4.32GlyLys: 4.32 ± 0.065
6.789GlyLeu: 6.789 ± 0.081
1.354GlyMet: 1.354 ± 0.033
2.968GlyAsn: 2.968 ± 0.065
1.172GlyPro: 1.172 ± 0.031
3.107GlyGln: 3.107 ± 0.047
3.243GlyArg: 3.243 ± 0.051
3.892GlySer: 3.892 ± 0.066
3.575GlyThr: 3.575 ± 0.087
4.379GlyVal: 4.379 ± 0.06
1.027GlyTrp: 1.027 ± 0.028
2.399GlyTyr: 2.399 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
0.921HisAla: 0.921 ± 0.029
0.303HisCys: 0.303 ± 0.015
0.749HisAsp: 0.749 ± 0.021
0.762HisGlu: 0.762 ± 0.023
0.882HisPhe: 0.882 ± 0.023
1.015HisGly: 1.015 ± 0.028
0.559HisHis: 0.559 ± 0.025
1.049HisIle: 1.049 ± 0.025
0.684HisLys: 0.684 ± 0.023
2.559HisLeu: 2.559 ± 0.049
0.135HisMet: 0.135 ± 0.009
0.654HisAsn: 0.654 ± 0.021
1.428HisPro: 1.428 ± 0.034
1.145HisGln: 1.145 ± 0.03
1.136HisArg: 1.136 ± 0.026
1.135HisSer: 1.135 ± 0.027
0.801HisThr: 0.801 ± 0.026
0.606HisVal: 0.606 ± 0.02
0.345HisTrp: 0.345 ± 0.017
0.662HisTyr: 0.662 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.719IleAla: 6.719 ± 0.066
0.819IleCys: 0.819 ± 0.029
4.14IleAsp: 4.14 ± 0.058
5.119IleGlu: 5.119 ± 0.067
2.961IlePhe: 2.961 ± 0.052
4.585IleGly: 4.585 ± 0.063
1.163IleHis: 1.163 ± 0.028
5.116IleIle: 5.116 ± 0.069
3.667IleLys: 3.667 ± 0.058
7.53IleLeu: 7.53 ± 0.071
0.966IleMet: 0.966 ± 0.025
3.456IleAsn: 3.456 ± 0.059
3.835IlePro: 3.835 ± 0.056
2.536IleGln: 2.536 ± 0.045
3.298IleArg: 3.298 ± 0.051
4.877IleSer: 4.877 ± 0.056
4.062IleThr: 4.062 ± 0.056
4.205IleVal: 4.205 ± 0.063
0.856IleTrp: 0.856 ± 0.026
2.34IleTyr: 2.34 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.154LysAla: 4.154 ± 0.061
0.479LysCys: 0.479 ± 0.021
2.427LysAsp: 2.427 ± 0.048
3.618LysGlu: 3.618 ± 0.066
1.823LysPhe: 1.823 ± 0.034
3.048LysGly: 3.048 ± 0.051
0.81LysHis: 0.81 ± 0.024
4.614LysIle: 4.614 ± 0.063
3.625LysLys: 3.625 ± 0.078
5.894LysLeu: 5.894 ± 0.073
1.112LysMet: 1.112 ± 0.025
3.017LysAsn: 3.017 ± 0.055
2.42LysPro: 2.42 ± 0.044
2.742LysGln: 2.742 ± 0.056
2.845LysArg: 2.845 ± 0.055
3.585LysSer: 3.585 ± 0.05
3.378LysThr: 3.378 ± 0.056
3.323LysVal: 3.323 ± 0.048
0.534LysTrp: 0.534 ± 0.019
1.674LysTyr: 1.674 ± 0.036
0.0LysXaa: 0.0 ± 0.0
Leu
9.612LeuAla: 9.612 ± 0.11
1.016LeuCys: 1.016 ± 0.03
5.587LeuAsp: 5.587 ± 0.068
8.331LeuGlu: 8.331 ± 0.096
3.958LeuPhe: 3.958 ± 0.058
8.039LeuGly: 8.039 ± 0.101
1.716LeuHis: 1.716 ± 0.04
7.726LeuIle: 7.726 ± 0.072
6.687LeuLys: 6.687 ± 0.087
11.173LeuLeu: 11.173 ± 0.114
1.968LeuMet: 1.968 ± 0.038
4.89LeuAsn: 4.89 ± 0.069
5.735LeuPro: 5.735 ± 0.081
5.044LeuGln: 5.044 ± 0.07
5.344LeuArg: 5.344 ± 0.07
7.947LeuSer: 7.947 ± 0.089
6.858LeuThr: 6.858 ± 0.083
6.751LeuVal: 6.751 ± 0.067
1.518LeuTrp: 1.518 ± 0.04
2.911LeuTyr: 2.911 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
1.698MetAla: 1.698 ± 0.031
0.118MetCys: 0.118 ± 0.008
0.779MetAsp: 0.779 ± 0.023
0.969MetGlu: 0.969 ± 0.024
0.524MetPhe: 0.524 ± 0.018
1.348MetGly: 1.348 ± 0.033
0.169MetHis: 0.169 ± 0.011
1.271MetIle: 1.271 ± 0.027
0.993MetLys: 0.993 ± 0.025
1.733MetLeu: 1.733 ± 0.04
0.348MetMet: 0.348 ± 0.018
0.805MetAsn: 0.805 ± 0.025
0.807MetPro: 0.807 ± 0.025
0.666MetGln: 0.666 ± 0.022
0.929MetArg: 0.929 ± 0.025
1.223MetSer: 1.223 ± 0.028
1.355MetThr: 1.355 ± 0.032
1.255MetVal: 1.255 ± 0.028
0.158MetTrp: 0.158 ± 0.01
0.329MetTyr: 0.329 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.718AsnAla: 2.718 ± 0.045
0.645AsnCys: 0.645 ± 0.023
1.78AsnAsp: 1.78 ± 0.05
1.866AsnGlu: 1.866 ± 0.039
2.284AsnPhe: 2.284 ± 0.042
2.419AsnGly: 2.419 ± 0.059
0.953AsnHis: 0.953 ± 0.029
2.79AsnIle: 2.79 ± 0.049
1.825AsnLys: 1.825 ± 0.037
6.184AsnLeu: 6.184 ± 0.08
0.688AsnMet: 0.688 ± 0.025
2.312AsnAsn: 2.312 ± 0.055
3.314AsnPro: 3.314 ± 0.054
2.981AsnGln: 2.981 ± 0.056
2.647AsnArg: 2.647 ± 0.05
3.09AsnSer: 3.09 ± 0.063
2.039AsnThr: 2.039 ± 0.043
1.944AsnVal: 1.944 ± 0.036
0.85AsnTrp: 0.85 ± 0.026
1.937AsnTyr: 1.937 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
2.914ProAla: 2.914 ± 0.046
0.424ProCys: 0.424 ± 0.017
2.973ProAsp: 2.973 ± 0.047
3.744ProGlu: 3.744 ± 0.061
1.778ProPhe: 1.778 ± 0.039
2.297ProGly: 2.297 ± 0.051
1.108ProHis: 1.108 ± 0.039
3.222ProIle: 3.222 ± 0.053
2.336ProLys: 2.336 ± 0.041
5.158ProLeu: 5.158 ± 0.066
0.807ProMet: 0.807 ± 0.027
2.201ProAsn: 2.201 ± 0.039
2.453ProPro: 2.453 ± 0.053
2.542ProGln: 2.542 ± 0.042
2.009ProArg: 2.009 ± 0.042
3.16ProSer: 3.16 ± 0.052
2.682ProThr: 2.682 ± 0.06
2.931ProVal: 2.931 ± 0.05
0.572ProTrp: 0.572 ± 0.02
1.412ProTyr: 1.412 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.026GlnAla: 4.026 ± 0.055
0.332GlnCys: 0.332 ± 0.015
2.01GlnAsp: 2.01 ± 0.037
4.157GlnGlu: 4.157 ± 0.062
1.82GlnPhe: 1.82 ± 0.038
3.432GlnGly: 3.432 ± 0.051
0.606GlnHis: 0.606 ± 0.023
3.581GlnIle: 3.581 ± 0.049
3.498GlnLys: 3.498 ± 0.069
5.707GlnLeu: 5.707 ± 0.072
0.923GlnMet: 0.923 ± 0.022
1.899GlnAsn: 1.899 ± 0.034
2.107GlnPro: 2.107 ± 0.042
3.042GlnGln: 3.042 ± 0.058
2.783GlnArg: 2.783 ± 0.05
2.849GlnSer: 2.849 ± 0.046
2.514GlnThr: 2.514 ± 0.048
3.303GlnVal: 3.303 ± 0.051
0.747GlnTrp: 0.747 ± 0.026
1.264GlnTyr: 1.264 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
3.037ArgAla: 3.037 ± 0.059
0.58ArgCys: 0.58 ± 0.021
2.451ArgAsp: 2.451 ± 0.044
4.009ArgGlu: 4.009 ± 0.061
2.475ArgPhe: 2.475 ± 0.038
3.466ArgGly: 3.466 ± 0.055
1.032ArgHis: 1.032 ± 0.027
3.441ArgIle: 3.441 ± 0.055
2.608ArgLys: 2.608 ± 0.05
6.334ArgLeu: 6.334 ± 0.071
0.905ArgMet: 0.905 ± 0.027
2.078ArgAsn: 2.078 ± 0.041
2.163ArgPro: 2.163 ± 0.045
3.653ArgGln: 3.653 ± 0.052
3.112ArgArg: 3.112 ± 0.065
3.034ArgSer: 3.034 ± 0.045
2.294ArgThr: 2.294 ± 0.039
3.304ArgVal: 3.304 ± 0.049
1.001ArgTrp: 1.001 ± 0.031
1.871ArgTyr: 1.871 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
4.043SerAla: 4.043 ± 0.06
0.799SerCys: 0.799 ± 0.025
3.175SerAsp: 3.175 ± 0.052
3.663SerGlu: 3.663 ± 0.054
2.622SerPhe: 2.622 ± 0.041
4.343SerGly: 4.343 ± 0.07
1.297SerHis: 1.297 ± 0.03
3.968SerIle: 3.968 ± 0.056
2.883SerLys: 2.883 ± 0.047
8.632SerLeu: 8.632 ± 0.092
1.216SerMet: 1.216 ± 0.025
2.615SerAsn: 2.615 ± 0.051
3.923SerPro: 3.923 ± 0.063
3.761SerGln: 3.761 ± 0.055
3.384SerArg: 3.384 ± 0.049
4.429SerSer: 4.429 ± 0.078
2.896SerThr: 2.896 ± 0.054
3.67SerVal: 3.67 ± 0.056
1.01SerTrp: 1.01 ± 0.028
2.06SerTyr: 2.06 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
4.623ThrAla: 4.623 ± 0.083
0.473ThrCys: 0.473 ± 0.021
2.594ThrAsp: 2.594 ± 0.046
3.361ThrGlu: 3.361 ± 0.052
2.082ThrPhe: 2.082 ± 0.04
4.263ThrGly: 4.263 ± 0.08
0.913ThrHis: 0.913 ± 0.025
4.156ThrIle: 4.156 ± 0.075
2.223ThrLys: 2.223 ± 0.037
6.158ThrLeu: 6.158 ± 0.088
0.82ThrMet: 0.82 ± 0.025
2.116ThrAsn: 2.116 ± 0.041
3.178ThrPro: 3.178 ± 0.057
2.148ThrGln: 2.148 ± 0.043
2.457ThrArg: 2.457 ± 0.042
3.212ThrSer: 3.212 ± 0.056
3.011ThrThr: 3.011 ± 0.063
3.909ThrVal: 3.909 ± 0.059
0.675ThrTrp: 0.675 ± 0.018
1.647ThrTyr: 1.647 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
4.769ValAla: 4.769 ± 0.062
0.661ValCys: 0.661 ± 0.022
2.811ValAsp: 2.811 ± 0.047
4.007ValGlu: 4.007 ± 0.058
2.673ValPhe: 2.673 ± 0.044
3.97ValGly: 3.97 ± 0.052
0.884ValHis: 0.884 ± 0.028
4.618ValIle: 4.618 ± 0.056
3.537ValLys: 3.537 ± 0.048
5.724ValLeu: 5.724 ± 0.066
1.301ValMet: 1.301 ± 0.032
2.972ValAsn: 2.972 ± 0.057
2.42ValPro: 2.42 ± 0.047
2.165ValGln: 2.165 ± 0.039
3.068ValArg: 3.068 ± 0.051
4.101ValSer: 4.101 ± 0.061
3.545ValThr: 3.545 ± 0.056
3.792ValVal: 3.792 ± 0.059
0.866ValTrp: 0.866 ± 0.026
1.877ValTyr: 1.877 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
0.766TrpAla: 0.766 ± 0.024
0.175TrpCys: 0.175 ± 0.01
0.623TrpAsp: 0.623 ± 0.025
0.86TrpGlu: 0.86 ± 0.026
0.643TrpPhe: 0.643 ± 0.021
1.092TrpGly: 1.092 ± 0.028
0.351TrpHis: 0.351 ± 0.015
0.845TrpIle: 0.845 ± 0.023
0.696TrpLys: 0.696 ± 0.024
2.171TrpLeu: 2.171 ± 0.043
0.282TrpMet: 0.282 ± 0.014
0.565TrpAsn: 0.565 ± 0.022
0.318TrpPro: 0.318 ± 0.016
1.327TrpGln: 1.327 ± 0.034
0.876TrpArg: 0.876 ± 0.021
0.899TrpSer: 0.899 ± 0.027
0.643TrpThr: 0.643 ± 0.021
0.756TrpVal: 0.756 ± 0.023
0.212TrpTrp: 0.212 ± 0.012
0.519TrpTyr: 0.519 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.85TyrAla: 1.85 ± 0.038
0.469TyrCys: 0.469 ± 0.018
1.503TyrAsp: 1.503 ± 0.034
1.729TyrGlu: 1.729 ± 0.038
1.572TyrPhe: 1.572 ± 0.033
2.031TyrGly: 2.031 ± 0.034
0.798TyrHis: 0.798 ± 0.027
1.766TyrIle: 1.766 ± 0.036
1.379TyrLys: 1.379 ± 0.029
4.226TyrLeu: 4.226 ± 0.062
0.396TyrMet: 0.396 ± 0.014
1.484TyrAsn: 1.484 ± 0.039
1.822TyrPro: 1.822 ± 0.037
2.434TyrGln: 2.434 ± 0.047
2.265TyrArg: 2.265 ± 0.036
2.14TyrSer: 2.14 ± 0.041
1.518TyrThr: 1.518 ± 0.032
1.36TyrVal: 1.36 ± 0.032
0.58TyrTrp: 0.58 ± 0.02
1.253TyrTyr: 1.253 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5981 proteins (1510946 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski