Amino acid dipepetide frequency for Micromonospora nigra

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.667AlaAla: 23.667 ± 0.18
1.033AlaCys: 1.033 ± 0.028
9.489AlaAsp: 9.489 ± 0.098
8.417AlaGlu: 8.417 ± 0.073
3.186AlaPhe: 3.186 ± 0.05
15.142AlaGly: 15.142 ± 0.119
2.739AlaHis: 2.739 ± 0.043
3.412AlaIle: 3.412 ± 0.052
1.99AlaLys: 1.99 ± 0.048
14.588AlaLeu: 14.588 ± 0.126
2.323AlaMet: 2.323 ± 0.037
2.024AlaAsn: 2.024 ± 0.042
7.336AlaPro: 7.336 ± 0.082
3.739AlaGln: 3.739 ± 0.05
11.022AlaArg: 11.022 ± 0.096
5.333AlaSer: 5.333 ± 0.061
8.54AlaThr: 8.54 ± 0.08
13.617AlaVal: 13.617 ± 0.117
1.943AlaTrp: 1.943 ± 0.034
2.819AlaTyr: 2.819 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
1.011CysAla: 1.011 ± 0.026
0.093CysCys: 0.093 ± 0.007
0.542CysAsp: 0.542 ± 0.019
0.337CysGlu: 0.337 ± 0.014
0.205CysPhe: 0.205 ± 0.011
0.92CysGly: 0.92 ± 0.024
0.206CysHis: 0.206 ± 0.012
0.127CysIle: 0.127 ± 0.009
0.093CysLys: 0.093 ± 0.007
0.731CysLeu: 0.731 ± 0.02
0.093CysMet: 0.093 ± 0.007
0.143CysAsn: 0.143 ± 0.01
0.518CysPro: 0.518 ± 0.019
0.19CysGln: 0.19 ± 0.011
0.716CysArg: 0.716 ± 0.02
0.388CysSer: 0.388 ± 0.015
0.455CysThr: 0.455 ± 0.016
0.634CysVal: 0.634 ± 0.017
0.136CysTrp: 0.136 ± 0.01
0.156CysTyr: 0.156 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
8.094AspAla: 8.094 ± 0.08
0.43AspCys: 0.43 ± 0.017
4.058AspAsp: 4.058 ± 0.055
3.79AspGlu: 3.79 ± 0.054
1.405AspPhe: 1.405 ± 0.029
6.495AspGly: 6.495 ± 0.077
1.423AspHis: 1.423 ± 0.029
1.508AspIle: 1.508 ± 0.032
0.793AspLys: 0.793 ± 0.025
7.077AspLeu: 7.077 ± 0.07
0.713AspMet: 0.713 ± 0.02
0.901AspAsn: 0.901 ± 0.028
5.283AspPro: 5.283 ± 0.06
1.681AspGln: 1.681 ± 0.029
6.017AspArg: 6.017 ± 0.067
2.222AspSer: 2.222 ± 0.038
3.167AspThr: 3.167 ± 0.048
5.267AspVal: 5.267 ± 0.063
1.06AspTrp: 1.06 ± 0.025
1.05AspTyr: 1.05 ± 0.025
0.0AspXaa: 0.0 ± 0.0
Glu
6.522GluAla: 6.522 ± 0.074
0.357GluCys: 0.357 ± 0.014
2.047GluAsp: 2.047 ± 0.038
2.392GluGlu: 2.392 ± 0.049
1.352GluPhe: 1.352 ± 0.029
3.218GluGly: 3.218 ± 0.05
1.317GluHis: 1.317 ± 0.031
2.019GluIle: 2.019 ± 0.037
0.916GluLys: 0.916 ± 0.028
6.306GluLeu: 6.306 ± 0.067
0.806GluMet: 0.806 ± 0.022
0.776GluAsn: 0.776 ± 0.025
3.481GluPro: 3.481 ± 0.049
2.2GluGln: 2.2 ± 0.042
4.925GluArg: 4.925 ± 0.06
2.074GluSer: 2.074 ± 0.035
2.416GluThr: 2.416 ± 0.037
4.805GluVal: 4.805 ± 0.064
0.693GluTrp: 0.693 ± 0.019
0.988GluTyr: 0.988 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
3.649PheAla: 3.649 ± 0.045
0.239PheCys: 0.239 ± 0.012
1.983PheAsp: 1.983 ± 0.036
1.098PheGlu: 1.098 ± 0.024
0.779PhePhe: 0.779 ± 0.024
2.823PheGly: 2.823 ± 0.042
0.58PheHis: 0.58 ± 0.019
0.57PheIle: 0.57 ± 0.018
0.344PheLys: 0.344 ± 0.015
2.378PheLeu: 2.378 ± 0.039
0.289PheMet: 0.289 ± 0.012
0.486PheAsn: 0.486 ± 0.018
1.288PhePro: 1.288 ± 0.029
0.536PheGln: 0.536 ± 0.018
1.803PheArg: 1.803 ± 0.031
1.244PheSer: 1.244 ± 0.026
1.843PheThr: 1.843 ± 0.035
2.405PheVal: 2.405 ± 0.036
0.429PheTrp: 0.429 ± 0.015
0.55PheTyr: 0.55 ± 0.02
0.0PheXaa: 0.0 ± 0.0
Gly
10.908GlyAla: 10.908 ± 0.088
0.837GlyCys: 0.837 ± 0.024
5.546GlyAsp: 5.546 ± 0.061
4.673GlyGlu: 4.673 ± 0.05
2.678GlyPhe: 2.678 ± 0.04
9.108GlyGly: 9.108 ± 0.103
2.28GlyHis: 2.28 ± 0.044
2.946GlyIle: 2.946 ± 0.051
1.601GlyLys: 1.601 ± 0.034
9.399GlyLeu: 9.399 ± 0.091
1.846GlyMet: 1.846 ± 0.033
1.636GlyAsn: 1.636 ± 0.041
5.587GlyPro: 5.587 ± 0.076
2.883GlyGln: 2.883 ± 0.045
8.688GlyArg: 8.688 ± 0.079
4.757GlySer: 4.757 ± 0.065
6.043GlyThr: 6.043 ± 0.079
8.57GlyVal: 8.57 ± 0.096
2.03GlyTrp: 2.03 ± 0.035
2.332GlyTyr: 2.332 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.631HisAla: 2.631 ± 0.039
0.188HisCys: 0.188 ± 0.01
1.413HisAsp: 1.413 ± 0.028
1.023HisGlu: 1.023 ± 0.029
0.506HisPhe: 0.506 ± 0.017
2.2HisGly: 2.2 ± 0.041
0.689HisHis: 0.689 ± 0.024
0.495HisIle: 0.495 ± 0.015
0.239HisLys: 0.239 ± 0.012
2.673HisLeu: 2.673 ± 0.042
0.256HisMet: 0.256 ± 0.012
0.341HisAsn: 0.341 ± 0.014
1.874HisPro: 1.874 ± 0.04
0.647HisGln: 0.647 ± 0.021
2.339HisArg: 2.339 ± 0.039
0.937HisSer: 0.937 ± 0.022
1.22HisThr: 1.22 ± 0.029
1.803HisVal: 1.803 ± 0.033
0.341HisTrp: 0.341 ± 0.016
0.412HisTyr: 0.412 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
4.114IleAla: 4.114 ± 0.052
0.274IleCys: 0.274 ± 0.012
2.114IleAsp: 2.114 ± 0.038
1.659IleGlu: 1.659 ± 0.037
0.724IlePhe: 0.724 ± 0.021
3.085IleGly: 3.085 ± 0.036
0.485IleHis: 0.485 ± 0.017
0.798IleIle: 0.798 ± 0.024
0.533IleLys: 0.533 ± 0.018
2.239IleLeu: 2.239 ± 0.044
0.384IleMet: 0.384 ± 0.016
0.647IleAsn: 0.647 ± 0.019
1.491IlePro: 1.491 ± 0.028
0.624IleGln: 0.624 ± 0.02
2.239IleArg: 2.239 ± 0.036
1.564IleSer: 1.564 ± 0.03
1.925IleThr: 1.925 ± 0.033
2.536IleVal: 2.536 ± 0.04
0.356IleTrp: 0.356 ± 0.015
0.501IleTyr: 0.501 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
1.905LysAla: 1.905 ± 0.047
0.093LysCys: 0.093 ± 0.007
0.696LysAsp: 0.696 ± 0.024
0.669LysGlu: 0.669 ± 0.025
0.314LysPhe: 0.314 ± 0.012
1.119LysGly: 1.119 ± 0.029
0.275LysHis: 0.275 ± 0.014
0.659LysIle: 0.659 ± 0.022
0.422LysLys: 0.422 ± 0.021
1.554LysLeu: 1.554 ± 0.033
0.272LysMet: 0.272 ± 0.015
0.286LysAsn: 0.286 ± 0.013
0.978LysPro: 0.978 ± 0.029
0.494LysGln: 0.494 ± 0.018
1.135LysArg: 1.135 ± 0.031
0.77LysSer: 0.77 ± 0.022
0.895LysThr: 0.895 ± 0.025
1.413LysVal: 1.413 ± 0.03
0.193LysTrp: 0.193 ± 0.012
0.295LysTyr: 0.295 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
16.994LeuAla: 16.994 ± 0.142
0.748LeuCys: 0.748 ± 0.021
7.122LeuAsp: 7.122 ± 0.076
3.676LeuGlu: 3.676 ± 0.047
2.526LeuPhe: 2.526 ± 0.04
9.156LeuGly: 9.156 ± 0.085
2.396LeuHis: 2.396 ± 0.044
2.773LeuIle: 2.773 ± 0.044
1.24LeuLys: 1.24 ± 0.029
11.7LeuLeu: 11.7 ± 0.126
1.305LeuMet: 1.305 ± 0.031
1.508LeuAsn: 1.508 ± 0.033
7.019LeuPro: 7.019 ± 0.07
1.708LeuGln: 1.708 ± 0.034
9.597LeuArg: 9.597 ± 0.085
4.556LeuSer: 4.556 ± 0.056
7.295LeuThr: 7.295 ± 0.07
10.372LeuVal: 10.372 ± 0.104
1.316LeuTrp: 1.316 ± 0.029
1.582LeuTyr: 1.582 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.041MetAla: 2.041 ± 0.039
0.105MetCys: 0.105 ± 0.008
0.688MetAsp: 0.688 ± 0.017
0.552MetGlu: 0.552 ± 0.018
0.419MetPhe: 0.419 ± 0.015
1.058MetGly: 1.058 ± 0.024
0.273MetHis: 0.273 ± 0.01
0.585MetIle: 0.585 ± 0.019
0.307MetLys: 0.307 ± 0.014
1.629MetLeu: 1.629 ± 0.028
0.241MetMet: 0.241 ± 0.012
0.317MetAsn: 0.317 ± 0.014
1.007MetPro: 1.007 ± 0.024
0.4MetGln: 0.4 ± 0.014
1.359MetArg: 1.359 ± 0.026
1.146MetSer: 1.146 ± 0.025
1.53MetThr: 1.53 ± 0.034
1.28MetVal: 1.28 ± 0.027
0.193MetTrp: 0.193 ± 0.011
0.261MetTyr: 0.261 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.017AsnAla: 2.017 ± 0.036
0.154AsnCys: 0.154 ± 0.01
0.902AsnAsp: 0.902 ± 0.026
0.681AsnGlu: 0.681 ± 0.022
0.469AsnPhe: 0.469 ± 0.017
1.726AsnGly: 1.726 ± 0.039
0.371AsnHis: 0.371 ± 0.015
0.529AsnIle: 0.529 ± 0.016
0.257AsnLys: 0.257 ± 0.015
1.772AsnLeu: 1.772 ± 0.032
0.244AsnMet: 0.244 ± 0.01
0.413AsnAsn: 0.413 ± 0.019
1.424AsnPro: 1.424 ± 0.038
0.536AsnGln: 0.536 ± 0.02
1.406AsnArg: 1.406 ± 0.032
0.853AsnSer: 0.853 ± 0.024
0.978AsnThr: 0.978 ± 0.028
1.331AsnVal: 1.331 ± 0.032
0.299AsnTrp: 0.299 ± 0.012
0.39AsnTyr: 0.39 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
10.655ProAla: 10.655 ± 0.105
0.341ProCys: 0.341 ± 0.014
4.896ProAsp: 4.896 ± 0.056
3.732ProGlu: 3.732 ± 0.05
1.342ProPhe: 1.342 ± 0.027
6.822ProGly: 6.822 ± 0.074
1.36ProHis: 1.36 ± 0.026
1.403ProIle: 1.403 ± 0.031
0.894ProLys: 0.894 ± 0.028
5.328ProLeu: 5.328 ± 0.063
0.941ProMet: 0.941 ± 0.026
0.994ProAsn: 0.994 ± 0.023
4.64ProPro: 4.64 ± 0.084
1.727ProGln: 1.727 ± 0.035
4.275ProArg: 4.275 ± 0.053
2.952ProSer: 2.952 ± 0.046
4.427ProThr: 4.427 ± 0.069
6.375ProVal: 6.375 ± 0.071
0.995ProTrp: 0.995 ± 0.024
1.206ProTyr: 1.206 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
3.735GlnAla: 3.735 ± 0.058
0.152GlnCys: 0.152 ± 0.01
1.093GlnAsp: 1.093 ± 0.026
1.137GlnGlu: 1.137 ± 0.026
0.688GlnPhe: 0.688 ± 0.023
1.948GlnGly: 1.948 ± 0.033
0.599GlnHis: 0.599 ± 0.016
0.959GlnIle: 0.959 ± 0.026
0.373GlnLys: 0.373 ± 0.015
3.138GlnLeu: 3.138 ± 0.042
0.437GlnMet: 0.437 ± 0.016
0.433GlnAsn: 0.433 ± 0.017
1.953GlnPro: 1.953 ± 0.036
1.173GlnGln: 1.173 ± 0.029
2.862GlnArg: 2.862 ± 0.043
1.077GlnSer: 1.077 ± 0.024
1.297GlnThr: 1.297 ± 0.031
2.94GlnVal: 2.94 ± 0.045
0.505GlnTrp: 0.505 ± 0.015
0.482GlnTyr: 0.482 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
10.963ArgAla: 10.963 ± 0.101
0.703ArgCys: 0.703 ± 0.024
4.83ArgAsp: 4.83 ± 0.049
4.343ArgGlu: 4.343 ± 0.051
2.385ArgPhe: 2.385 ± 0.04
6.157ArgGly: 6.157 ± 0.068
2.464ArgHis: 2.464 ± 0.04
3.01ArgIle: 3.01 ± 0.04
1.189ArgLys: 1.189 ± 0.033
9.874ArgLeu: 9.874 ± 0.086
1.826ArgMet: 1.826 ± 0.032
1.448ArgAsn: 1.448 ± 0.029
5.919ArgPro: 5.919 ± 0.08
2.858ArgGln: 2.858 ± 0.04
9.367ArgArg: 9.367 ± 0.099
4.089ArgSer: 4.089 ± 0.043
5.032ArgThr: 5.032 ± 0.054
7.0ArgVal: 7.0 ± 0.061
1.787ArgTrp: 1.787 ± 0.041
1.944ArgTyr: 1.944 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
6.135SerAla: 6.135 ± 0.066
0.385SerCys: 0.385 ± 0.016
2.331SerAsp: 2.331 ± 0.035
1.825SerGlu: 1.825 ± 0.032
1.36SerPhe: 1.36 ± 0.03
5.318SerGly: 5.318 ± 0.065
0.907SerHis: 0.907 ± 0.017
1.337SerIle: 1.337 ± 0.026
0.646SerLys: 0.646 ± 0.021
4.168SerLeu: 4.168 ± 0.056
0.905SerMet: 0.905 ± 0.024
0.844SerAsn: 0.844 ± 0.027
3.09SerPro: 3.09 ± 0.045
1.069SerGln: 1.069 ± 0.028
3.609SerArg: 3.609 ± 0.049
2.366SerSer: 2.366 ± 0.05
3.07SerThr: 3.07 ± 0.045
3.998SerVal: 3.998 ± 0.054
0.867SerTrp: 0.867 ± 0.023
1.151SerTyr: 1.151 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
9.069ThrAla: 9.069 ± 0.084
0.467ThrCys: 0.467 ± 0.016
3.942ThrAsp: 3.942 ± 0.05
2.891ThrGlu: 2.891 ± 0.042
1.547ThrPhe: 1.547 ± 0.029
7.097ThrGly: 7.097 ± 0.064
1.109ThrHis: 1.109 ± 0.025
1.783ThrIle: 1.783 ± 0.036
0.827ThrLys: 0.827 ± 0.025
5.888ThrLeu: 5.888 ± 0.055
0.928ThrMet: 0.928 ± 0.025
1.082ThrAsn: 1.082 ± 0.028
4.464ThrPro: 4.464 ± 0.066
1.247ThrGln: 1.247 ± 0.027
4.424ThrArg: 4.424 ± 0.053
2.957ThrSer: 2.957 ± 0.047
4.065ThrThr: 4.065 ± 0.073
6.65ThrVal: 6.65 ± 0.061
0.961ThrTrp: 0.961 ± 0.023
1.283ThrTyr: 1.283 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
13.575ValAla: 13.575 ± 0.127
0.758ValCys: 0.758 ± 0.024
6.661ValAsp: 6.661 ± 0.069
5.104ValGlu: 5.104 ± 0.063
2.387ValPhe: 2.387 ± 0.046
7.929ValGly: 7.929 ± 0.075
1.898ValHis: 1.898 ± 0.032
2.539ValIle: 2.539 ± 0.039
1.274ValLys: 1.274 ± 0.033
9.969ValLeu: 9.969 ± 0.1
1.162ValMet: 1.162 ± 0.024
1.696ValAsn: 1.696 ± 0.033
5.787ValPro: 5.787 ± 0.055
1.969ValGln: 1.969 ± 0.031
7.781ValArg: 7.781 ± 0.071
4.237ValSer: 4.237 ± 0.056
6.391ValThr: 6.391 ± 0.065
9.693ValVal: 9.693 ± 0.114
1.155ValTrp: 1.155 ± 0.027
1.433ValTyr: 1.433 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
1.763TrpAla: 1.763 ± 0.035
0.172TrpCys: 0.172 ± 0.01
0.74TrpAsp: 0.74 ± 0.021
0.642TrpGlu: 0.642 ± 0.019
0.499TrpPhe: 0.499 ± 0.017
1.02TrpGly: 1.02 ± 0.028
0.427TrpHis: 0.427 ± 0.018
0.491TrpIle: 0.491 ± 0.017
0.243TrpLys: 0.243 ± 0.013
1.99TrpLeu: 1.99 ± 0.033
0.248TrpMet: 0.248 ± 0.011
0.408TrpAsn: 0.408 ± 0.017
0.991TrpPro: 0.991 ± 0.024
0.691TrpGln: 0.691 ± 0.021
1.723TrpArg: 1.723 ± 0.034
1.037TrpSer: 1.037 ± 0.025
1.026TrpThr: 1.026 ± 0.021
1.101TrpVal: 1.101 ± 0.023
0.41TrpTrp: 0.41 ± 0.016
0.385TrpTyr: 0.385 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.911TyrAla: 2.911 ± 0.043
0.162TyrCys: 0.162 ± 0.011
1.342TyrAsp: 1.342 ± 0.029
0.972TyrGlu: 0.972 ± 0.026
0.541TyrPhe: 0.541 ± 0.018
2.005TyrGly: 2.005 ± 0.039
0.428TyrHis: 0.428 ± 0.015
0.348TyrIle: 0.348 ± 0.016
0.257TyrLys: 0.257 ± 0.014
2.188TyrLeu: 2.188 ± 0.037
0.172TyrMet: 0.172 ± 0.01
0.369TyrAsn: 0.369 ± 0.017
1.176TyrPro: 1.176 ± 0.028
0.61TyrGln: 0.61 ± 0.017
1.902TyrArg: 1.902 ± 0.04
0.821TyrSer: 0.821 ± 0.024
1.067TyrThr: 1.067 ± 0.028
1.609TyrVal: 1.609 ± 0.028
0.34TyrTrp: 0.34 ± 0.013
0.388TyrTyr: 0.388 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5321 proteins (1856018 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski