Amino acid dipepetide frequency for Muribaculaceae bacterium Isolate-037 (Harlan)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.32AlaAla: 6.32 ± 0.108
0.851AlaCys: 0.851 ± 0.029
5.102AlaAsp: 5.102 ± 0.069
4.755AlaGlu: 4.755 ± 0.083
2.927AlaPhe: 2.927 ± 0.046
5.07AlaGly: 5.07 ± 0.068
1.173AlaHis: 1.173 ± 0.03
5.079AlaIle: 5.079 ± 0.073
4.133AlaLys: 4.133 ± 0.057
6.435AlaLeu: 6.435 ± 0.072
2.074AlaMet: 2.074 ± 0.045
3.297AlaAsn: 3.297 ± 0.061
2.483AlaPro: 2.483 ± 0.047
2.256AlaGln: 2.256 ± 0.05
3.358AlaArg: 3.358 ± 0.061
4.804AlaSer: 4.804 ± 0.067
4.227AlaThr: 4.227 ± 0.056
4.83AlaVal: 4.83 ± 0.069
0.868AlaTrp: 0.868 ± 0.026
2.797AlaTyr: 2.797 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.839CysAla: 0.839 ± 0.024
0.194CysCys: 0.194 ± 0.012
0.763CysAsp: 0.763 ± 0.024
0.637CysGlu: 0.637 ± 0.026
0.488CysPhe: 0.488 ± 0.019
1.163CysGly: 1.163 ± 0.031
0.306CysHis: 0.306 ± 0.014
0.785CysIle: 0.785 ± 0.024
0.583CysLys: 0.583 ± 0.02
0.958CysLeu: 0.958 ± 0.025
0.319CysMet: 0.319 ± 0.014
0.564CysAsn: 0.564 ± 0.021
0.488CysPro: 0.488 ± 0.022
0.29CysGln: 0.29 ± 0.015
0.815CysArg: 0.815 ± 0.024
0.83CysSer: 0.83 ± 0.025
0.53CysThr: 0.53 ± 0.019
0.798CysVal: 0.798 ± 0.025
0.161CysTrp: 0.161 ± 0.011
0.508CysTyr: 0.508 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
4.59AspAla: 4.59 ± 0.07
0.74AspCys: 0.74 ± 0.027
3.51AspAsp: 3.51 ± 0.065
4.04AspGlu: 4.04 ± 0.061
3.18AspPhe: 3.18 ± 0.052
5.114AspGly: 5.114 ± 0.075
0.906AspHis: 0.906 ± 0.029
4.486AspIle: 4.486 ± 0.059
4.098AspLys: 4.098 ± 0.062
4.781AspLeu: 4.781 ± 0.057
1.917AspMet: 1.917 ± 0.039
3.594AspAsn: 3.594 ± 0.066
2.313AspPro: 2.313 ± 0.04
1.087AspGln: 1.087 ± 0.031
3.115AspArg: 3.115 ± 0.047
4.233AspSer: 4.233 ± 0.063
2.924AspThr: 2.924 ± 0.05
3.934AspVal: 3.934 ± 0.061
0.947AspTrp: 0.947 ± 0.028
3.031AspTyr: 3.031 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
4.619GluAla: 4.619 ± 0.08
0.712GluCys: 0.712 ± 0.026
3.391GluAsp: 3.391 ± 0.052
4.647GluGlu: 4.647 ± 0.088
2.488GluPhe: 2.488 ± 0.05
3.979GluGly: 3.979 ± 0.067
1.011GluHis: 1.011 ± 0.027
5.061GluIle: 5.061 ± 0.065
4.549GluLys: 4.549 ± 0.067
5.478GluLeu: 5.478 ± 0.07
2.002GluMet: 2.002 ± 0.043
3.59GluAsn: 3.59 ± 0.059
1.927GluPro: 1.927 ± 0.041
1.955GluGln: 1.955 ± 0.042
3.478GluArg: 3.478 ± 0.058
3.882GluSer: 3.882 ± 0.057
3.244GluThr: 3.244 ± 0.051
3.867GluVal: 3.867 ± 0.059
0.969GluTrp: 0.969 ± 0.026
2.842GluTyr: 2.842 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
3.024PheAla: 3.024 ± 0.049
0.616PheCys: 0.616 ± 0.02
2.917PheAsp: 2.917 ± 0.052
2.313PheGlu: 2.313 ± 0.044
1.926PhePhe: 1.926 ± 0.04
3.312PheGly: 3.312 ± 0.062
0.85PheHis: 0.85 ± 0.027
3.01PheIle: 3.01 ± 0.058
2.323PheLys: 2.323 ± 0.045
3.483PheLeu: 3.483 ± 0.058
1.257PheMet: 1.257 ± 0.036
2.468PheAsn: 2.468 ± 0.047
1.684PhePro: 1.684 ± 0.037
1.037PheGln: 1.037 ± 0.03
2.262PheArg: 2.262 ± 0.038
3.569PheSer: 3.569 ± 0.054
2.574PheThr: 2.574 ± 0.04
2.587PheVal: 2.587 ± 0.049
0.534PheTrp: 0.534 ± 0.023
1.698PheTyr: 1.698 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
4.635GlyAla: 4.635 ± 0.068
0.992GlyCys: 0.992 ± 0.032
4.151GlyAsp: 4.151 ± 0.061
4.461GlyGlu: 4.461 ± 0.066
3.125GlyPhe: 3.125 ± 0.044
4.747GlyGly: 4.747 ± 0.086
1.247GlyHis: 1.247 ± 0.036
5.344GlyIle: 5.344 ± 0.068
5.222GlyLys: 5.222 ± 0.064
5.574GlyLeu: 5.574 ± 0.068
2.177GlyMet: 2.177 ± 0.042
4.074GlyAsn: 4.074 ± 0.068
1.312GlyPro: 1.312 ± 0.036
1.829GlyGln: 1.829 ± 0.039
3.263GlyArg: 3.263 ± 0.055
4.328GlySer: 4.328 ± 0.071
3.788GlyThr: 3.788 ± 0.069
4.915GlyVal: 4.915 ± 0.073
1.108GlyTrp: 1.108 ± 0.029
3.312GlyTyr: 3.312 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
1.098HisAla: 1.098 ± 0.029
0.265HisCys: 0.265 ± 0.013
1.185HisAsp: 1.185 ± 0.032
1.004HisGlu: 1.004 ± 0.024
0.873HisPhe: 0.873 ± 0.025
1.247HisGly: 1.247 ± 0.032
0.452HisHis: 0.452 ± 0.023
1.347HisIle: 1.347 ± 0.034
0.971HisLys: 0.971 ± 0.03
1.579HisLeu: 1.579 ± 0.035
0.289HisMet: 0.289 ± 0.015
0.965HisAsn: 0.965 ± 0.025
0.978HisPro: 0.978 ± 0.028
0.524HisGln: 0.524 ± 0.02
0.956HisArg: 0.956 ± 0.027
1.166HisSer: 1.166 ± 0.032
1.001HisThr: 1.001 ± 0.026
0.996HisVal: 0.996 ± 0.026
0.249HisTrp: 0.249 ± 0.014
0.799HisTyr: 0.799 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.33IleAla: 5.33 ± 0.075
0.965IleCys: 0.965 ± 0.025
4.896IleAsp: 4.896 ± 0.064
4.499IleGlu: 4.499 ± 0.065
2.891IlePhe: 2.891 ± 0.052
4.751IleGly: 4.751 ± 0.069
1.147IleHis: 1.147 ± 0.033
4.607IleIle: 4.607 ± 0.077
3.976IleLys: 3.976 ± 0.066
5.699IleLeu: 5.699 ± 0.075
1.673IleMet: 1.673 ± 0.039
3.559IleAsn: 3.559 ± 0.051
3.257IlePro: 3.257 ± 0.051
1.738IleGln: 1.738 ± 0.041
3.428IleArg: 3.428 ± 0.057
5.413IleSer: 5.413 ± 0.066
3.872IleThr: 3.872 ± 0.055
4.468IleVal: 4.468 ± 0.06
0.762IleTrp: 0.762 ± 0.025
2.691IleTyr: 2.691 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
4.547LysAla: 4.547 ± 0.069
0.557LysCys: 0.557 ± 0.023
3.898LysAsp: 3.898 ± 0.066
4.741LysGlu: 4.741 ± 0.076
2.321LysPhe: 2.321 ± 0.041
4.281LysGly: 4.281 ± 0.064
0.96LysHis: 0.96 ± 0.027
4.308LysIle: 4.308 ± 0.062
4.086LysLys: 4.086 ± 0.071
4.727LysLeu: 4.727 ± 0.062
1.957LysMet: 1.957 ± 0.039
3.146LysAsn: 3.146 ± 0.055
2.139LysPro: 2.139 ± 0.039
1.768LysGln: 1.768 ± 0.037
3.279LysArg: 3.279 ± 0.058
3.716LysSer: 3.716 ± 0.051
3.33LysThr: 3.33 ± 0.049
3.97LysVal: 3.97 ± 0.057
0.841LysTrp: 0.841 ± 0.026
2.636LysTyr: 2.636 ± 0.038
0.0LysXaa: 0.0 ± 0.0
Leu
6.093LeuAla: 6.093 ± 0.07
1.209LeuCys: 1.209 ± 0.031
5.048LeuAsp: 5.048 ± 0.058
4.668LeuGlu: 4.668 ± 0.067
3.683LeuPhe: 3.683 ± 0.063
5.266LeuGly: 5.266 ± 0.071
1.683LeuHis: 1.683 ± 0.038
5.432LeuIle: 5.432 ± 0.067
5.451LeuLys: 5.451 ± 0.077
7.504LeuLeu: 7.504 ± 0.089
2.406LeuMet: 2.406 ± 0.044
4.488LeuAsn: 4.488 ± 0.059
4.221LeuPro: 4.221 ± 0.06
2.717LeuGln: 2.717 ± 0.049
4.762LeuArg: 4.762 ± 0.073
7.11LeuSer: 7.11 ± 0.084
5.25LeuThr: 5.25 ± 0.058
4.623LeuVal: 4.623 ± 0.068
1.054LeuTrp: 1.054 ± 0.028
3.324LeuTyr: 3.324 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.328MetAla: 2.328 ± 0.041
0.261MetCys: 0.261 ± 0.013
1.388MetAsp: 1.388 ± 0.032
1.796MetGlu: 1.796 ± 0.037
1.018MetPhe: 1.018 ± 0.029
1.664MetGly: 1.664 ± 0.034
0.43MetHis: 0.43 ± 0.017
1.76MetIle: 1.76 ± 0.034
2.194MetLys: 2.194 ± 0.044
2.625MetLeu: 2.625 ± 0.044
0.91MetMet: 0.91 ± 0.027
1.42MetAsn: 1.42 ± 0.028
1.355MetPro: 1.355 ± 0.028
0.974MetGln: 0.974 ± 0.025
1.61MetArg: 1.61 ± 0.037
1.84MetSer: 1.84 ± 0.036
1.776MetThr: 1.776 ± 0.031
1.71MetVal: 1.71 ± 0.037
0.29MetTrp: 0.29 ± 0.016
0.777MetTyr: 0.777 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
3.752AsnAla: 3.752 ± 0.056
0.541AsnCys: 0.541 ± 0.02
3.04AsnAsp: 3.04 ± 0.049
2.886AsnGlu: 2.886 ± 0.051
2.263AsnPhe: 2.263 ± 0.047
4.244AsnGly: 4.244 ± 0.061
1.038AsnHis: 1.038 ± 0.026
3.804AsnIle: 3.804 ± 0.066
2.733AsnLys: 2.733 ± 0.05
4.61AsnLeu: 4.61 ± 0.063
1.314AsnMet: 1.314 ± 0.031
2.705AsnAsn: 2.705 ± 0.056
2.899AsnPro: 2.899 ± 0.05
1.443AsnGln: 1.443 ± 0.037
2.717AsnArg: 2.717 ± 0.047
3.091AsnSer: 3.091 ± 0.05
2.572AsnThr: 2.572 ± 0.052
3.336AsnVal: 3.336 ± 0.057
0.695AsnTrp: 0.695 ± 0.024
2.313AsnTyr: 2.313 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.887ProAla: 2.887 ± 0.051
0.369ProCys: 0.369 ± 0.015
3.125ProAsp: 3.125 ± 0.049
3.583ProGlu: 3.583 ± 0.048
1.808ProPhe: 1.808 ± 0.039
2.643ProGly: 2.643 ± 0.044
0.715ProHis: 0.715 ± 0.024
2.407ProIle: 2.407 ± 0.041
2.049ProLys: 2.049 ± 0.044
3.222ProLeu: 3.222 ± 0.057
1.027ProMet: 1.027 ± 0.028
1.676ProAsn: 1.676 ± 0.032
0.908ProPro: 0.908 ± 0.031
1.371ProGln: 1.371 ± 0.03
1.51ProArg: 1.51 ± 0.034
2.693ProSer: 2.693 ± 0.043
2.282ProThr: 2.282 ± 0.047
2.93ProVal: 2.93 ± 0.049
0.498ProTrp: 0.498 ± 0.018
1.636ProTyr: 1.636 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
1.949GlnAla: 1.949 ± 0.042
0.28GlnCys: 0.28 ± 0.016
1.458GlnAsp: 1.458 ± 0.035
1.798GlnGlu: 1.798 ± 0.035
1.231GlnPhe: 1.231 ± 0.029
1.75GlnGly: 1.75 ± 0.034
0.489GlnHis: 0.489 ± 0.019
2.038GlnIle: 2.038 ± 0.039
1.825GlnLys: 1.825 ± 0.04
2.803GlnLeu: 2.803 ± 0.05
0.887GlnMet: 0.887 ± 0.026
1.517GlnAsn: 1.517 ± 0.04
1.111GlnPro: 1.111 ± 0.03
1.179GlnGln: 1.179 ± 0.038
1.662GlnArg: 1.662 ± 0.036
1.809GlnSer: 1.809 ± 0.038
1.756GlnThr: 1.756 ± 0.037
1.55GlnVal: 1.55 ± 0.034
0.449GlnTrp: 0.449 ± 0.019
1.287GlnTyr: 1.287 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.918ArgAla: 2.918 ± 0.05
0.548ArgCys: 0.548 ± 0.021
2.779ArgAsp: 2.779 ± 0.047
3.339ArgGlu: 3.339 ± 0.061
2.485ArgPhe: 2.485 ± 0.048
2.815ArgGly: 2.815 ± 0.043
1.225ArgHis: 1.225 ± 0.032
3.973ArgIle: 3.973 ± 0.054
3.56ArgLys: 3.56 ± 0.053
5.14ArgLeu: 5.14 ± 0.075
1.598ArgMet: 1.598 ± 0.036
2.865ArgAsn: 2.865 ± 0.048
1.782ArgPro: 1.782 ± 0.034
1.953ArgGln: 1.953 ± 0.045
3.096ArgArg: 3.096 ± 0.061
2.634ArgSer: 2.634 ± 0.048
2.386ArgThr: 2.386 ± 0.042
2.862ArgVal: 2.862 ± 0.046
0.703ArgTrp: 0.703 ± 0.023
2.397ArgTyr: 2.397 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
4.749SerAla: 4.749 ± 0.065
0.829SerCys: 0.829 ± 0.022
4.499SerAsp: 4.499 ± 0.062
3.998SerGlu: 3.998 ± 0.06
3.244SerPhe: 3.244 ± 0.053
5.196SerGly: 5.196 ± 0.073
1.299SerHis: 1.299 ± 0.03
4.537SerIle: 4.537 ± 0.057
3.602SerLys: 3.602 ± 0.06
6.423SerLeu: 6.423 ± 0.081
1.76SerMet: 1.76 ± 0.04
3.079SerAsn: 3.079 ± 0.059
2.815SerPro: 2.815 ± 0.056
1.963SerGln: 1.963 ± 0.035
3.235SerArg: 3.235 ± 0.058
4.369SerSer: 4.369 ± 0.073
3.474SerThr: 3.474 ± 0.058
4.77SerVal: 4.77 ± 0.074
0.886SerTrp: 0.886 ± 0.026
2.762SerTyr: 2.762 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
4.137ThrAla: 4.137 ± 0.062
0.493ThrCys: 0.493 ± 0.018
3.838ThrAsp: 3.838 ± 0.059
3.178ThrGlu: 3.178 ± 0.056
2.471ThrPhe: 2.471 ± 0.045
4.213ThrGly: 4.213 ± 0.061
0.969ThrHis: 0.969 ± 0.029
3.747ThrIle: 3.747 ± 0.056
2.658ThrLys: 2.658 ± 0.05
5.203ThrLeu: 5.203 ± 0.062
1.221ThrMet: 1.221 ± 0.03
2.323ThrAsn: 2.323 ± 0.046
3.008ThrPro: 3.008 ± 0.053
1.518ThrGln: 1.518 ± 0.033
2.347ThrArg: 2.347 ± 0.041
3.496ThrSer: 3.496 ± 0.056
3.201ThrThr: 3.201 ± 0.063
4.056ThrVal: 4.056 ± 0.058
0.628ThrTrp: 0.628 ± 0.023
2.282ThrTyr: 2.282 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
5.294ValAla: 5.294 ± 0.077
0.808ValCys: 0.808 ± 0.025
3.939ValAsp: 3.939 ± 0.064
4.245ValGlu: 4.245 ± 0.064
2.55ValPhe: 2.55 ± 0.046
4.125ValGly: 4.125 ± 0.067
0.873ValHis: 0.873 ± 0.023
4.363ValIle: 4.363 ± 0.06
4.163ValLys: 4.163 ± 0.056
4.849ValLeu: 4.849 ± 0.061
1.894ValMet: 1.894 ± 0.042
3.419ValAsn: 3.419 ± 0.055
2.528ValPro: 2.528 ± 0.049
1.493ValGln: 1.493 ± 0.03
3.112ValArg: 3.112 ± 0.045
4.703ValSer: 4.703 ± 0.064
3.822ValThr: 3.822 ± 0.059
4.633ValVal: 4.633 ± 0.071
0.727ValTrp: 0.727 ± 0.023
2.478ValTyr: 2.478 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.784TrpAla: 0.784 ± 0.024
0.192TrpCys: 0.192 ± 0.013
0.774TrpAsp: 0.774 ± 0.027
0.824TrpGlu: 0.824 ± 0.025
0.552TrpPhe: 0.552 ± 0.022
0.983TrpGly: 0.983 ± 0.03
0.292TrpHis: 0.292 ± 0.015
0.896TrpIle: 0.896 ± 0.028
0.816TrpLys: 0.816 ± 0.028
1.265TrpLeu: 1.265 ± 0.038
0.404TrpMet: 0.404 ± 0.018
0.853TrpAsn: 0.853 ± 0.029
0.246TrpPro: 0.246 ± 0.014
0.521TrpGln: 0.521 ± 0.022
0.727TrpArg: 0.727 ± 0.023
0.821TrpSer: 0.821 ± 0.027
0.704TrpThr: 0.704 ± 0.026
0.75TrpVal: 0.75 ± 0.024
0.264TrpTrp: 0.264 ± 0.016
0.518TrpTyr: 0.518 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.944TyrAla: 2.944 ± 0.048
0.595TyrCys: 0.595 ± 0.019
2.836TyrAsp: 2.836 ± 0.058
2.28TyrGlu: 2.28 ± 0.04
1.922TyrPhe: 1.922 ± 0.035
3.04TyrGly: 3.04 ± 0.054
0.884TyrHis: 0.884 ± 0.027
2.639TyrIle: 2.639 ± 0.058
2.215TyrLys: 2.215 ± 0.044
3.635TyrLeu: 3.635 ± 0.05
1.039TyrMet: 1.039 ± 0.03
2.393TyrAsn: 2.393 ± 0.052
1.886TyrPro: 1.886 ± 0.04
1.217TyrGln: 1.217 ± 0.033
2.325TyrArg: 2.325 ± 0.038
2.984TyrSer: 2.984 ± 0.054
2.273TyrThr: 2.273 ± 0.053
2.43TyrVal: 2.43 ± 0.036
0.55TyrTrp: 0.55 ± 0.021
1.985TyrTyr: 1.985 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3984 proteins (1419065 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski