Amino acid dipepetide frequency for Micromonospora olivasterospora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.957AlaAla: 23.957 ± 0.19
0.985AlaCys: 0.985 ± 0.026
8.891AlaAsp: 8.891 ± 0.071
8.58AlaGlu: 8.58 ± 0.095
3.174AlaPhe: 3.174 ± 0.048
15.294AlaGly: 15.294 ± 0.119
2.583AlaHis: 2.583 ± 0.04
3.597AlaIle: 3.597 ± 0.053
2.211AlaLys: 2.211 ± 0.047
14.052AlaLeu: 14.052 ± 0.151
2.356AlaMet: 2.356 ± 0.04
2.039AlaAsn: 2.039 ± 0.038
7.853AlaPro: 7.853 ± 0.09
3.582AlaGln: 3.582 ± 0.039
11.308AlaArg: 11.308 ± 0.123
5.971AlaSer: 5.971 ± 0.108
7.783AlaThr: 7.783 ± 0.077
12.724AlaVal: 12.724 ± 0.105
2.026AlaTrp: 2.026 ± 0.041
2.74AlaTyr: 2.74 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
1.002CysAla: 1.002 ± 0.023
0.134CysCys: 0.134 ± 0.01
0.507CysAsp: 0.507 ± 0.02
0.327CysGlu: 0.327 ± 0.011
0.19CysPhe: 0.19 ± 0.01
0.96CysGly: 0.96 ± 0.028
0.186CysHis: 0.186 ± 0.011
0.139CysIle: 0.139 ± 0.008
0.09CysLys: 0.09 ± 0.008
0.646CysLeu: 0.646 ± 0.02
0.118CysMet: 0.118 ± 0.006
0.124CysAsn: 0.124 ± 0.008
0.575CysPro: 0.575 ± 0.016
0.187CysGln: 0.187 ± 0.011
0.739CysArg: 0.739 ± 0.023
0.409CysSer: 0.409 ± 0.016
0.448CysThr: 0.448 ± 0.014
0.627CysVal: 0.627 ± 0.019
0.165CysTrp: 0.165 ± 0.01
0.165CysTyr: 0.165 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
7.773AspAla: 7.773 ± 0.081
0.381AspCys: 0.381 ± 0.015
3.731AspAsp: 3.731 ± 0.056
3.708AspGlu: 3.708 ± 0.055
1.436AspPhe: 1.436 ± 0.028
6.381AspGly: 6.381 ± 0.083
1.314AspHis: 1.314 ± 0.028
1.643AspIle: 1.643 ± 0.032
0.94AspLys: 0.94 ± 0.03
6.644AspLeu: 6.644 ± 0.071
0.727AspMet: 0.727 ± 0.023
0.964AspAsn: 0.964 ± 0.028
5.121AspPro: 5.121 ± 0.078
1.813AspGln: 1.813 ± 0.05
5.755AspArg: 5.755 ± 0.055
2.203AspSer: 2.203 ± 0.041
2.873AspThr: 2.873 ± 0.041
4.883AspVal: 4.883 ± 0.058
1.071AspTrp: 1.071 ± 0.03
0.994AspTyr: 0.994 ± 0.023
0.0AspXaa: 0.0 ± 0.0
Glu
6.674GluAla: 6.674 ± 0.084
0.335GluCys: 0.335 ± 0.013
2.096GluAsp: 2.096 ± 0.038
2.565GluGlu: 2.565 ± 0.048
1.433GluPhe: 1.433 ± 0.028
3.457GluGly: 3.457 ± 0.055
1.343GluHis: 1.343 ± 0.033
2.033GluIle: 2.033 ± 0.04
1.058GluLys: 1.058 ± 0.027
6.312GluLeu: 6.312 ± 0.077
0.768GluMet: 0.768 ± 0.021
0.797GluAsn: 0.797 ± 0.021
3.511GluPro: 3.511 ± 0.05
2.161GluGln: 2.161 ± 0.032
5.167GluArg: 5.167 ± 0.07
2.231GluSer: 2.231 ± 0.042
2.497GluThr: 2.497 ± 0.041
4.539GluVal: 4.539 ± 0.054
0.793GluTrp: 0.793 ± 0.023
1.038GluTyr: 1.038 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
3.568PheAla: 3.568 ± 0.049
0.239PheCys: 0.239 ± 0.012
1.978PheAsp: 1.978 ± 0.036
1.104PheGlu: 1.104 ± 0.027
0.773PhePhe: 0.773 ± 0.024
2.881PheGly: 2.881 ± 0.038
0.544PheHis: 0.544 ± 0.017
0.59PheIle: 0.59 ± 0.017
0.367PheLys: 0.367 ± 0.016
2.423PheLeu: 2.423 ± 0.035
0.321PheMet: 0.321 ± 0.013
0.536PheAsn: 0.536 ± 0.019
1.283PhePro: 1.283 ± 0.026
0.608PheGln: 0.608 ± 0.017
1.755PheArg: 1.755 ± 0.033
1.254PheSer: 1.254 ± 0.026
1.823PheThr: 1.823 ± 0.032
2.307PheVal: 2.307 ± 0.037
0.425PheTrp: 0.425 ± 0.017
0.536PheTyr: 0.536 ± 0.02
0.0PheXaa: 0.0 ± 0.0
Gly
10.994GlyAla: 10.994 ± 0.089
0.853GlyCys: 0.853 ± 0.023
5.251GlyAsp: 5.251 ± 0.052
4.847GlyGlu: 4.847 ± 0.058
2.601GlyPhe: 2.601 ± 0.041
9.12GlyGly: 9.12 ± 0.087
2.327GlyHis: 2.327 ± 0.061
3.012GlyIle: 3.012 ± 0.043
2.027GlyLys: 2.027 ± 0.045
9.383GlyLeu: 9.383 ± 0.092
1.927GlyMet: 1.927 ± 0.035
1.758GlyAsn: 1.758 ± 0.038
5.951GlyPro: 5.951 ± 0.086
3.035GlyGln: 3.035 ± 0.053
9.083GlyArg: 9.083 ± 0.133
4.849GlySer: 4.849 ± 0.057
5.611GlyThr: 5.611 ± 0.064
8.021GlyVal: 8.021 ± 0.073
1.972GlyTrp: 1.972 ± 0.033
2.279GlyTyr: 2.279 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
2.568HisAla: 2.568 ± 0.044
0.172HisCys: 0.172 ± 0.009
1.294HisAsp: 1.294 ± 0.025
1.009HisGlu: 1.009 ± 0.027
0.536HisPhe: 0.536 ± 0.018
2.184HisGly: 2.184 ± 0.039
0.619HisHis: 0.619 ± 0.017
0.59HisIle: 0.59 ± 0.023
0.234HisLys: 0.234 ± 0.01
2.423HisLeu: 2.423 ± 0.042
0.294HisMet: 0.294 ± 0.015
0.374HisAsn: 0.374 ± 0.017
1.799HisPro: 1.799 ± 0.031
0.61HisGln: 0.61 ± 0.017
2.18HisArg: 2.18 ± 0.041
0.821HisSer: 0.821 ± 0.019
1.226HisThr: 1.226 ± 0.055
1.734HisVal: 1.734 ± 0.03
0.326HisTrp: 0.326 ± 0.013
0.424HisTyr: 0.424 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
4.437IleAla: 4.437 ± 0.062
0.272IleCys: 0.272 ± 0.012
2.222IleAsp: 2.222 ± 0.042
1.748IleGlu: 1.748 ± 0.042
0.735IlePhe: 0.735 ± 0.019
3.139IleGly: 3.139 ± 0.045
0.558IleHis: 0.558 ± 0.016
0.899IleIle: 0.899 ± 0.023
0.639IleLys: 0.639 ± 0.022
2.221IleLeu: 2.221 ± 0.035
0.409IleMet: 0.409 ± 0.015
0.692IleAsn: 0.692 ± 0.023
1.622IlePro: 1.622 ± 0.033
0.688IleGln: 0.688 ± 0.02
2.306IleArg: 2.306 ± 0.038
1.539IleSer: 1.539 ± 0.027
2.013IleThr: 2.013 ± 0.039
2.634IleVal: 2.634 ± 0.049
0.504IleTrp: 0.504 ± 0.049
0.512IleTyr: 0.512 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
2.189LysAla: 2.189 ± 0.047
0.102LysCys: 0.102 ± 0.007
0.834LysAsp: 0.834 ± 0.028
0.779LysGlu: 0.779 ± 0.025
0.372LysPhe: 0.372 ± 0.014
1.32LysGly: 1.32 ± 0.038
0.323LysHis: 0.323 ± 0.015
0.744LysIle: 0.744 ± 0.023
0.557LysLys: 0.557 ± 0.024
1.824LysLeu: 1.824 ± 0.041
0.317LysMet: 0.317 ± 0.012
0.373LysAsn: 0.373 ± 0.014
1.116LysPro: 1.116 ± 0.026
0.637LysGln: 0.637 ± 0.019
1.328LysArg: 1.328 ± 0.03
0.868LysSer: 0.868 ± 0.021
1.082LysThr: 1.082 ± 0.03
1.474LysVal: 1.474 ± 0.032
0.219LysTrp: 0.219 ± 0.01
0.342LysTyr: 0.342 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
16.468LeuAla: 16.468 ± 0.149
0.721LeuCys: 0.721 ± 0.02
6.818LeuAsp: 6.818 ± 0.075
4.02LeuGlu: 4.02 ± 0.059
2.511LeuPhe: 2.511 ± 0.046
8.969LeuGly: 8.969 ± 0.086
2.141LeuHis: 2.141 ± 0.034
3.03LeuIle: 3.03 ± 0.048
1.452LeuLys: 1.452 ± 0.039
11.282LeuLeu: 11.282 ± 0.128
1.337LeuMet: 1.337 ± 0.03
1.616LeuAsn: 1.616 ± 0.029
7.027LeuPro: 7.027 ± 0.079
2.012LeuGln: 2.012 ± 0.038
9.341LeuArg: 9.341 ± 0.091
4.827LeuSer: 4.827 ± 0.055
6.43LeuThr: 6.43 ± 0.069
9.421LeuVal: 9.421 ± 0.099
1.312LeuTrp: 1.312 ± 0.031
1.6LeuTyr: 1.6 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.068MetAla: 2.068 ± 0.032
0.105MetCys: 0.105 ± 0.008
0.841MetAsp: 0.841 ± 0.031
0.586MetGlu: 0.586 ± 0.015
0.422MetPhe: 0.422 ± 0.015
1.124MetGly: 1.124 ± 0.026
0.286MetHis: 0.286 ± 0.013
0.632MetIle: 0.632 ± 0.018
0.318MetLys: 0.318 ± 0.013
1.699MetLeu: 1.699 ± 0.035
0.256MetMet: 0.256 ± 0.012
0.33MetAsn: 0.33 ± 0.013
1.127MetPro: 1.127 ± 0.028
0.437MetGln: 0.437 ± 0.016
1.391MetArg: 1.391 ± 0.031
1.19MetSer: 1.19 ± 0.023
1.454MetThr: 1.454 ± 0.03
1.305MetVal: 1.305 ± 0.027
0.193MetTrp: 0.193 ± 0.01
0.244MetTyr: 0.244 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.145AsnAla: 2.145 ± 0.034
0.153AsnCys: 0.153 ± 0.009
0.938AsnAsp: 0.938 ± 0.029
0.745AsnGlu: 0.745 ± 0.023
0.487AsnPhe: 0.487 ± 0.017
1.689AsnGly: 1.689 ± 0.041
0.414AsnHis: 0.414 ± 0.017
0.551AsnIle: 0.551 ± 0.017
0.34AsnLys: 0.34 ± 0.015
1.864AsnLeu: 1.864 ± 0.035
0.26AsnMet: 0.26 ± 0.013
0.437AsnAsn: 0.437 ± 0.018
1.425AsnPro: 1.425 ± 0.029
0.523AsnGln: 0.523 ± 0.019
1.424AsnArg: 1.424 ± 0.033
0.809AsnSer: 0.809 ± 0.022
0.947AsnThr: 0.947 ± 0.024
1.361AsnVal: 1.361 ± 0.028
0.335AsnTrp: 0.335 ± 0.014
0.376AsnTyr: 0.376 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
11.102ProAla: 11.102 ± 0.118
0.377ProCys: 0.377 ± 0.015
4.966ProAsp: 4.966 ± 0.08
3.874ProGlu: 3.874 ± 0.057
1.387ProPhe: 1.387 ± 0.034
7.557ProGly: 7.557 ± 0.112
1.318ProHis: 1.318 ± 0.03
1.515ProIle: 1.515 ± 0.026
0.998ProLys: 0.998 ± 0.025
5.363ProLeu: 5.363 ± 0.056
1.04ProMet: 1.04 ± 0.027
0.985ProAsn: 0.985 ± 0.026
4.992ProPro: 4.992 ± 0.094
1.889ProGln: 1.889 ± 0.053
4.589ProArg: 4.589 ± 0.066
3.238ProSer: 3.238 ± 0.046
4.003ProThr: 4.003 ± 0.053
6.033ProVal: 6.033 ± 0.065
1.008ProTrp: 1.008 ± 0.022
1.309ProTyr: 1.309 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.904GlnAla: 3.904 ± 0.055
0.145GlnCys: 0.145 ± 0.009
1.082GlnAsp: 1.082 ± 0.027
1.325GlnGlu: 1.325 ± 0.045
0.741GlnPhe: 0.741 ± 0.018
1.981GlnGly: 1.981 ± 0.038
0.702GlnHis: 0.702 ± 0.025
0.999GlnIle: 0.999 ± 0.026
0.479GlnLys: 0.479 ± 0.02
3.281GlnLeu: 3.281 ± 0.051
0.427GlnMet: 0.427 ± 0.015
0.446GlnAsn: 0.446 ± 0.015
2.263GlnPro: 2.263 ± 0.068
1.25GlnGln: 1.25 ± 0.036
2.873GlnArg: 2.873 ± 0.044
1.097GlnSer: 1.097 ± 0.028
1.393GlnThr: 1.393 ± 0.04
2.781GlnVal: 2.781 ± 0.04
0.507GlnTrp: 0.507 ± 0.019
0.5GlnTyr: 0.5 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
10.86ArgAla: 10.86 ± 0.112
0.789ArgCys: 0.789 ± 0.021
4.794ArgAsp: 4.794 ± 0.048
4.435ArgGlu: 4.435 ± 0.06
2.381ArgPhe: 2.381 ± 0.037
6.313ArgGly: 6.313 ± 0.06
2.244ArgHis: 2.244 ± 0.04
3.217ArgIle: 3.217 ± 0.063
1.362ArgLys: 1.362 ± 0.029
9.707ArgLeu: 9.707 ± 0.116
1.974ArgMet: 1.974 ± 0.037
1.475ArgAsn: 1.475 ± 0.031
6.265ArgPro: 6.265 ± 0.084
2.901ArgGln: 2.901 ± 0.05
9.781ArgArg: 9.781 ± 0.118
4.193ArgSer: 4.193 ± 0.055
4.919ArgThr: 4.919 ± 0.06
6.836ArgVal: 6.836 ± 0.068
1.826ArgTrp: 1.826 ± 0.034
2.172ArgTyr: 2.172 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
6.417SerAla: 6.417 ± 0.079
0.419SerCys: 0.419 ± 0.017
2.508SerAsp: 2.508 ± 0.041
1.934SerGlu: 1.934 ± 0.035
1.358SerPhe: 1.358 ± 0.027
5.607SerGly: 5.607 ± 0.088
0.86SerHis: 0.86 ± 0.022
1.402SerIle: 1.402 ± 0.029
0.782SerLys: 0.782 ± 0.023
4.14SerLeu: 4.14 ± 0.052
0.99SerMet: 0.99 ± 0.021
0.829SerAsn: 0.829 ± 0.024
3.316SerPro: 3.316 ± 0.038
1.189SerGln: 1.189 ± 0.028
3.871SerArg: 3.871 ± 0.05
2.522SerSer: 2.522 ± 0.041
3.14SerThr: 3.14 ± 0.052
3.983SerVal: 3.983 ± 0.044
1.127SerTrp: 1.127 ± 0.067
1.065SerTyr: 1.065 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
8.232ThrAla: 8.232 ± 0.085
0.458ThrCys: 0.458 ± 0.017
3.575ThrAsp: 3.575 ± 0.048
2.778ThrGlu: 2.778 ± 0.037
1.472ThrPhe: 1.472 ± 0.029
6.333ThrGly: 6.333 ± 0.062
1.033ThrHis: 1.033 ± 0.029
1.863ThrIle: 1.863 ± 0.033
0.942ThrLys: 0.942 ± 0.03
5.514ThrLeu: 5.514 ± 0.067
0.925ThrMet: 0.925 ± 0.022
1.018ThrAsn: 1.018 ± 0.025
4.295ThrPro: 4.295 ± 0.062
1.237ThrGln: 1.237 ± 0.026
4.223ThrArg: 4.223 ± 0.046
3.245ThrSer: 3.245 ± 0.086
3.822ThrThr: 3.822 ± 0.079
6.158ThrVal: 6.158 ± 0.082
0.902ThrTrp: 0.902 ± 0.025
1.191ThrTyr: 1.191 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
12.707ValAla: 12.707 ± 0.11
0.712ValCys: 0.712 ± 0.02
5.759ValAsp: 5.759 ± 0.065
4.826ValGlu: 4.826 ± 0.057
2.216ValPhe: 2.216 ± 0.038
7.626ValGly: 7.626 ± 0.083
1.77ValHis: 1.77 ± 0.035
2.46ValIle: 2.46 ± 0.044
1.398ValLys: 1.398 ± 0.032
9.38ValLeu: 9.38 ± 0.104
1.111ValMet: 1.111 ± 0.028
1.636ValAsn: 1.636 ± 0.033
5.722ValPro: 5.722 ± 0.066
1.991ValGln: 1.991 ± 0.035
7.703ValArg: 7.703 ± 0.079
4.358ValSer: 4.358 ± 0.054
5.766ValThr: 5.766 ± 0.073
8.701ValVal: 8.701 ± 0.083
1.151ValTrp: 1.151 ± 0.031
1.452ValTyr: 1.452 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
1.917TrpAla: 1.917 ± 0.039
0.189TrpCys: 0.189 ± 0.01
0.883TrpAsp: 0.883 ± 0.052
0.611TrpGlu: 0.611 ± 0.021
0.522TrpPhe: 0.522 ± 0.017
0.912TrpGly: 0.912 ± 0.019
0.392TrpHis: 0.392 ± 0.017
0.487TrpIle: 0.487 ± 0.018
0.31TrpLys: 0.31 ± 0.013
1.988TrpLeu: 1.988 ± 0.037
0.268TrpMet: 0.268 ± 0.012
0.447TrpAsn: 0.447 ± 0.022
1.112TrpPro: 1.112 ± 0.032
0.788TrpGln: 0.788 ± 0.05
1.752TrpArg: 1.752 ± 0.037
1.008TrpSer: 1.008 ± 0.026
0.988TrpThr: 0.988 ± 0.022
1.268TrpVal: 1.268 ± 0.031
0.418TrpTrp: 0.418 ± 0.018
0.368TrpTyr: 0.368 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.727TyrAla: 2.727 ± 0.042
0.196TyrCys: 0.196 ± 0.011
1.385TyrAsp: 1.385 ± 0.035
1.005TyrGlu: 1.005 ± 0.028
0.57TyrPhe: 0.57 ± 0.018
2.058TyrGly: 2.058 ± 0.038
0.46TyrHis: 0.46 ± 0.015
0.384TyrIle: 0.384 ± 0.016
0.295TyrLys: 0.295 ± 0.012
2.19TyrLeu: 2.19 ± 0.036
0.184TyrMet: 0.184 ± 0.01
0.342TyrAsn: 0.342 ± 0.018
1.17TyrPro: 1.17 ± 0.024
0.637TyrGln: 0.637 ± 0.019
1.873TyrArg: 1.873 ± 0.033
0.828TyrSer: 0.828 ± 0.024
0.999TyrThr: 0.999 ± 0.028
1.654TyrVal: 1.654 ± 0.032
0.349TyrTrp: 0.349 ± 0.013
0.386TyrTyr: 0.386 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6202 proteins (2002684 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski