Amino acid dipepetide frequency for Anaerobium acetethylicum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.726AlaAla: 7.726 ± 0.119
1.077AlaCys: 1.077 ± 0.028
4.743AlaAsp: 4.743 ± 0.078
5.84AlaGlu: 5.84 ± 0.086
3.172AlaPhe: 3.172 ± 0.055
6.679AlaGly: 6.679 ± 0.084
0.981AlaHis: 0.981 ± 0.029
5.509AlaIle: 5.509 ± 0.075
4.682AlaLys: 4.682 ± 0.069
6.609AlaLeu: 6.609 ± 0.082
2.343AlaMet: 2.343 ± 0.037
2.697AlaAsn: 2.697 ± 0.049
1.93AlaPro: 1.93 ± 0.046
1.883AlaGln: 1.883 ± 0.037
2.733AlaArg: 2.733 ± 0.045
4.298AlaSer: 4.298 ± 0.072
3.402AlaThr: 3.402 ± 0.06
6.378AlaVal: 6.378 ± 0.083
0.577AlaTrp: 0.577 ± 0.02
2.9AlaTyr: 2.9 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.837CysAla: 0.837 ± 0.027
0.24CysCys: 0.24 ± 0.015
0.795CysAsp: 0.795 ± 0.022
0.834CysGlu: 0.834 ± 0.024
0.567CysPhe: 0.567 ± 0.022
1.39CysGly: 1.39 ± 0.038
0.279CysHis: 0.279 ± 0.016
1.073CysIle: 1.073 ± 0.029
0.688CysLys: 0.688 ± 0.02
0.994CysLeu: 0.994 ± 0.029
0.427CysMet: 0.427 ± 0.016
0.6CysAsn: 0.6 ± 0.022
0.599CysPro: 0.599 ± 0.023
0.336CysGln: 0.336 ± 0.015
0.742CysArg: 0.742 ± 0.026
0.893CysSer: 0.893 ± 0.027
0.705CysThr: 0.705 ± 0.021
0.818CysVal: 0.818 ± 0.024
0.112CysTrp: 0.112 ± 0.009
0.496CysTyr: 0.496 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
4.212AspAla: 4.212 ± 0.067
0.728AspCys: 0.728 ± 0.026
2.757AspAsp: 2.757 ± 0.044
4.583AspGlu: 4.583 ± 0.061
2.707AspPhe: 2.707 ± 0.043
4.351AspGly: 4.351 ± 0.075
0.809AspHis: 0.809 ± 0.028
4.873AspIle: 4.873 ± 0.069
3.506AspLys: 3.506 ± 0.046
4.492AspLeu: 4.492 ± 0.064
1.819AspMet: 1.819 ± 0.038
2.27AspAsn: 2.27 ± 0.049
1.709AspPro: 1.709 ± 0.044
1.382AspGln: 1.382 ± 0.034
2.573AspArg: 2.573 ± 0.045
3.441AspSer: 3.441 ± 0.058
2.817AspThr: 2.817 ± 0.053
3.654AspVal: 3.654 ± 0.056
0.6AspTrp: 0.6 ± 0.024
2.812AspTyr: 2.812 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
5.942GluAla: 5.942 ± 0.086
0.766GluCys: 0.766 ± 0.027
4.203GluAsp: 4.203 ± 0.067
6.997GluGlu: 6.997 ± 0.095
3.083GluPhe: 3.083 ± 0.054
4.585GluGly: 4.585 ± 0.058
1.256GluHis: 1.256 ± 0.031
6.253GluIle: 6.253 ± 0.082
6.743GluLys: 6.743 ± 0.087
6.86GluLeu: 6.86 ± 0.074
2.459GluMet: 2.459 ± 0.045
4.231GluAsn: 4.231 ± 0.057
1.828GluPro: 1.828 ± 0.045
2.434GluGln: 2.434 ± 0.048
3.019GluArg: 3.019 ± 0.051
3.795GluSer: 3.795 ± 0.054
3.932GluThr: 3.932 ± 0.062
4.415GluVal: 4.415 ± 0.058
0.716GluTrp: 0.716 ± 0.029
3.246GluTyr: 3.246 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
3.023PheAla: 3.023 ± 0.057
0.674PheCys: 0.674 ± 0.021
2.575PheAsp: 2.575 ± 0.045
3.1PheGlu: 3.1 ± 0.056
1.844PhePhe: 1.844 ± 0.048
3.324PheGly: 3.324 ± 0.053
0.763PheHis: 0.763 ± 0.027
3.08PheIle: 3.08 ± 0.055
2.33PheLys: 2.33 ± 0.047
3.759PheLeu: 3.759 ± 0.063
1.364PheMet: 1.364 ± 0.032
1.762PheAsn: 1.762 ± 0.039
1.299PhePro: 1.299 ± 0.031
1.244PheGln: 1.244 ± 0.028
1.846PheArg: 1.846 ± 0.042
3.007PheSer: 3.007 ± 0.054
2.287PheThr: 2.287 ± 0.049
2.929PheVal: 2.929 ± 0.047
0.439PheTrp: 0.439 ± 0.019
1.742PheTyr: 1.742 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
5.027GlyAla: 5.027 ± 0.08
1.206GlyCys: 1.206 ± 0.035
3.486GlyAsp: 3.486 ± 0.043
4.684GlyGlu: 4.684 ± 0.067
3.432GlyPhe: 3.432 ± 0.054
5.005GlyGly: 5.005 ± 0.073
1.255GlyHis: 1.255 ± 0.032
6.952GlyIle: 6.952 ± 0.077
5.335GlyLys: 5.335 ± 0.076
5.985GlyLeu: 5.985 ± 0.07
2.524GlyMet: 2.524 ± 0.045
3.453GlyAsn: 3.453 ± 0.061
1.36GlyPro: 1.36 ± 0.033
1.948GlyGln: 1.948 ± 0.052
3.239GlyArg: 3.239 ± 0.054
4.456GlySer: 4.456 ± 0.07
4.451GlyThr: 4.451 ± 0.075
4.738GlyVal: 4.738 ± 0.07
0.664GlyTrp: 0.664 ± 0.024
3.347GlyTyr: 3.347 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.145HisAla: 1.145 ± 0.029
0.263HisCys: 0.263 ± 0.014
0.836HisAsp: 0.836 ± 0.025
1.108HisGlu: 1.108 ± 0.029
0.828HisPhe: 0.828 ± 0.026
1.23HisGly: 1.23 ± 0.03
0.344HisHis: 0.344 ± 0.017
1.307HisIle: 1.307 ± 0.035
0.949HisLys: 0.949 ± 0.027
1.374HisLeu: 1.374 ± 0.036
0.514HisMet: 0.514 ± 0.02
0.703HisAsn: 0.703 ± 0.024
0.788HisPro: 0.788 ± 0.028
0.476HisGln: 0.476 ± 0.02
0.663HisArg: 0.663 ± 0.023
0.935HisSer: 0.935 ± 0.028
0.829HisThr: 0.829 ± 0.023
1.003HisVal: 1.003 ± 0.028
0.153HisTrp: 0.153 ± 0.012
0.782HisTyr: 0.782 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.135IleAla: 6.135 ± 0.08
1.218IleCys: 1.218 ± 0.031
4.394IleAsp: 4.394 ± 0.058
5.601IleGlu: 5.601 ± 0.074
3.143IlePhe: 3.143 ± 0.065
5.533IleGly: 5.533 ± 0.077
1.285IleHis: 1.285 ± 0.034
5.999IleIle: 5.999 ± 0.074
4.747IleLys: 4.747 ± 0.062
7.143IleLeu: 7.143 ± 0.083
2.311IleMet: 2.311 ± 0.042
3.625IleAsn: 3.625 ± 0.061
3.184IlePro: 3.184 ± 0.057
2.31IleGln: 2.31 ± 0.046
3.794IleArg: 3.794 ± 0.058
5.507IleSer: 5.507 ± 0.068
4.354IleThr: 4.354 ± 0.061
5.085IleVal: 5.085 ± 0.068
0.61IleTrp: 0.61 ± 0.021
2.905IleTyr: 2.905 ± 0.046
0.001IleXaa: 0.001 ± 0.001
Lys
5.311LysAla: 5.311 ± 0.063
0.705LysCys: 0.705 ± 0.026
3.87LysAsp: 3.87 ± 0.057
6.375LysGlu: 6.375 ± 0.085
2.01LysPhe: 2.01 ± 0.041
4.343LysGly: 4.343 ± 0.063
1.062LysHis: 1.062 ± 0.03
5.254LysIle: 5.254 ± 0.062
5.911LysLys: 5.911 ± 0.075
5.323LysLeu: 5.323 ± 0.08
2.251LysMet: 2.251 ± 0.042
3.756LysAsn: 3.756 ± 0.062
1.939LysPro: 1.939 ± 0.042
2.069LysGln: 2.069 ± 0.042
2.84LysArg: 2.84 ± 0.051
3.666LysSer: 3.666 ± 0.054
3.834LysThr: 3.834 ± 0.055
4.316LysVal: 4.316 ± 0.062
0.593LysTrp: 0.593 ± 0.022
2.906LysTyr: 2.906 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
6.685LeuAla: 6.685 ± 0.073
1.245LeuCys: 1.245 ± 0.036
4.73LeuAsp: 4.73 ± 0.065
6.366LeuGlu: 6.366 ± 0.075
3.73LeuPhe: 3.73 ± 0.069
5.923LeuGly: 5.923 ± 0.076
1.402LeuHis: 1.402 ± 0.036
6.289LeuIle: 6.289 ± 0.083
6.144LeuLys: 6.144 ± 0.066
7.728LeuLeu: 7.728 ± 0.089
2.579LeuMet: 2.579 ± 0.049
4.067LeuAsn: 4.067 ± 0.063
3.216LeuPro: 3.216 ± 0.052
2.625LeuGln: 2.625 ± 0.045
3.389LeuArg: 3.389 ± 0.061
6.135LeuSer: 6.135 ± 0.077
4.634LeuThr: 4.634 ± 0.062
5.475LeuVal: 5.475 ± 0.067
0.697LeuTrp: 0.697 ± 0.024
3.311LeuTyr: 3.311 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.458MetAla: 2.458 ± 0.048
0.299MetCys: 0.299 ± 0.015
1.945MetAsp: 1.945 ± 0.036
2.532MetGlu: 2.532 ± 0.05
1.14MetPhe: 1.14 ± 0.029
2.189MetGly: 2.189 ± 0.043
0.487MetHis: 0.487 ± 0.019
2.371MetIle: 2.371 ± 0.048
2.735MetLys: 2.735 ± 0.044
2.752MetLeu: 2.752 ± 0.056
0.995MetMet: 0.995 ± 0.025
1.719MetAsn: 1.719 ± 0.032
1.224MetPro: 1.224 ± 0.035
0.984MetGln: 0.984 ± 0.028
1.111MetArg: 1.111 ± 0.028
1.827MetSer: 1.827 ± 0.038
1.703MetThr: 1.703 ± 0.04
2.007MetVal: 2.007 ± 0.04
0.195MetTrp: 0.195 ± 0.013
0.902MetTyr: 0.902 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.298AsnAla: 3.298 ± 0.06
0.64AsnCys: 0.64 ± 0.023
2.229AsnAsp: 2.229 ± 0.042
3.147AsnGlu: 3.147 ± 0.053
1.806AsnPhe: 1.806 ± 0.037
3.739AsnGly: 3.739 ± 0.059
0.786AsnHis: 0.786 ± 0.023
3.777AsnIle: 3.777 ± 0.059
2.823AsnLys: 2.823 ± 0.042
3.899AsnLeu: 3.899 ± 0.063
1.531AsnMet: 1.531 ± 0.034
2.088AsnAsn: 2.088 ± 0.043
2.185AsnPro: 2.185 ± 0.04
1.519AsnGln: 1.519 ± 0.035
2.283AsnArg: 2.283 ± 0.046
2.74AsnSer: 2.74 ± 0.044
2.488AsnThr: 2.488 ± 0.05
2.965AsnVal: 2.965 ± 0.045
0.472AsnTrp: 0.472 ± 0.02
2.035AsnTyr: 2.035 ± 0.042
0.0AsnXaa: 0.0 ± 0.0
Pro
2.543ProAla: 2.543 ± 0.051
0.432ProCys: 0.432 ± 0.019
2.256ProAsp: 2.256 ± 0.042
3.16ProGlu: 3.16 ± 0.055
1.593ProPhe: 1.593 ± 0.041
2.334ProGly: 2.334 ± 0.039
0.582ProHis: 0.582 ± 0.02
2.036ProIle: 2.036 ± 0.04
1.797ProLys: 1.797 ± 0.032
2.551ProLeu: 2.551 ± 0.046
0.887ProMet: 0.887 ± 0.025
1.293ProAsn: 1.293 ± 0.03
0.736ProPro: 0.736 ± 0.025
0.881ProGln: 0.881 ± 0.027
0.963ProArg: 0.963 ± 0.032
1.706ProSer: 1.706 ± 0.037
1.492ProThr: 1.492 ± 0.037
3.024ProVal: 3.024 ± 0.055
0.326ProTrp: 0.326 ± 0.016
1.415ProTyr: 1.415 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
2.247GlnAla: 2.247 ± 0.039
0.287GlnCys: 0.287 ± 0.016
1.519GlnAsp: 1.519 ± 0.035
2.265GlnGlu: 2.265 ± 0.04
1.115GlnPhe: 1.115 ± 0.03
1.832GlnGly: 1.832 ± 0.045
0.418GlnHis: 0.418 ± 0.017
2.422GlnIle: 2.422 ± 0.045
2.353GlnLys: 2.353 ± 0.039
2.53GlnLeu: 2.53 ± 0.048
1.045GlnMet: 1.045 ± 0.03
1.58GlnAsn: 1.58 ± 0.037
0.794GlnPro: 0.794 ± 0.025
0.981GlnGln: 0.981 ± 0.03
1.092GlnArg: 1.092 ± 0.035
1.551GlnSer: 1.551 ± 0.035
1.518GlnThr: 1.518 ± 0.038
1.814GlnVal: 1.814 ± 0.033
0.278GlnTrp: 0.278 ± 0.014
1.304GlnTyr: 1.304 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
2.419ArgAla: 2.419 ± 0.04
0.522ArgCys: 0.522 ± 0.02
2.173ArgAsp: 2.173 ± 0.036
3.711ArgGlu: 3.711 ± 0.062
1.855ArgPhe: 1.855 ± 0.037
2.394ArgGly: 2.394 ± 0.042
0.739ArgHis: 0.739 ± 0.025
3.643ArgIle: 3.643 ± 0.052
3.459ArgLys: 3.459 ± 0.061
3.686ArgLeu: 3.686 ± 0.056
1.527ArgMet: 1.527 ± 0.036
2.342ArgAsn: 2.342 ± 0.042
1.264ArgPro: 1.264 ± 0.037
1.411ArgGln: 1.411 ± 0.035
1.877ArgArg: 1.877 ± 0.041
2.112ArgSer: 2.112 ± 0.041
2.293ArgThr: 2.293 ± 0.041
2.406ArgVal: 2.406 ± 0.043
0.349ArgTrp: 0.349 ± 0.016
1.915ArgTyr: 1.915 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
4.401SerAla: 4.401 ± 0.064
0.793SerCys: 0.793 ± 0.026
3.495SerAsp: 3.495 ± 0.062
4.255SerGlu: 4.255 ± 0.058
2.818SerPhe: 2.818 ± 0.048
5.209SerGly: 5.209 ± 0.082
0.998SerHis: 0.998 ± 0.029
4.794SerIle: 4.794 ± 0.067
3.506SerLys: 3.506 ± 0.06
5.336SerLeu: 5.336 ± 0.07
1.922SerMet: 1.922 ± 0.036
2.554SerAsn: 2.554 ± 0.051
1.837SerPro: 1.837 ± 0.039
1.7SerGln: 1.7 ± 0.036
2.84SerArg: 2.84 ± 0.046
3.813SerSer: 3.813 ± 0.073
2.951SerThr: 2.951 ± 0.053
4.231SerVal: 4.231 ± 0.061
0.582SerTrp: 0.582 ± 0.021
2.655SerTyr: 2.655 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
4.556ThrAla: 4.556 ± 0.093
0.573ThrCys: 0.573 ± 0.022
3.289ThrAsp: 3.289 ± 0.065
3.92ThrGlu: 3.92 ± 0.057
2.108ThrPhe: 2.108 ± 0.045
4.786ThrGly: 4.786 ± 0.074
0.776ThrHis: 0.776 ± 0.025
4.123ThrIle: 4.123 ± 0.056
2.977ThrLys: 2.977 ± 0.053
4.387ThrLeu: 4.387 ± 0.059
1.472ThrMet: 1.472 ± 0.036
2.164ThrAsn: 2.164 ± 0.039
2.073ThrPro: 2.073 ± 0.045
1.314ThrGln: 1.314 ± 0.031
1.903ThrArg: 1.903 ± 0.037
3.0ThrSer: 3.0 ± 0.057
2.864ThrThr: 2.864 ± 0.059
4.253ThrVal: 4.253 ± 0.068
0.462ThrTrp: 0.462 ± 0.02
2.109ThrTyr: 2.109 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
4.833ValAla: 4.833 ± 0.062
1.021ValCys: 1.021 ± 0.031
3.693ValAsp: 3.693 ± 0.061
4.611ValGlu: 4.611 ± 0.062
3.113ValPhe: 3.113 ± 0.058
4.077ValGly: 4.077 ± 0.061
1.029ValHis: 1.029 ± 0.03
5.365ValIle: 5.365 ± 0.067
4.562ValLys: 4.562 ± 0.057
6.494ValLeu: 6.494 ± 0.079
2.115ValMet: 2.115 ± 0.043
3.015ValAsn: 3.015 ± 0.047
2.413ValPro: 2.413 ± 0.05
1.766ValGln: 1.766 ± 0.035
2.786ValArg: 2.786 ± 0.049
4.702ValSer: 4.702 ± 0.075
3.912ValThr: 3.912 ± 0.075
4.574ValVal: 4.574 ± 0.072
0.574ValTrp: 0.574 ± 0.019
2.615ValTyr: 2.615 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.548TrpAla: 0.548 ± 0.021
0.146TrpCys: 0.146 ± 0.01
0.539TrpAsp: 0.539 ± 0.02
0.653TrpGlu: 0.653 ± 0.027
0.374TrpPhe: 0.374 ± 0.017
0.655TrpGly: 0.655 ± 0.024
0.177TrpHis: 0.177 ± 0.013
0.694TrpIle: 0.694 ± 0.022
0.65TrpLys: 0.65 ± 0.023
0.771TrpLeu: 0.771 ± 0.024
0.306TrpMet: 0.306 ± 0.015
0.564TrpAsn: 0.564 ± 0.019
0.226TrpPro: 0.226 ± 0.014
0.319TrpGln: 0.319 ± 0.015
0.32TrpArg: 0.32 ± 0.016
0.522TrpSer: 0.522 ± 0.023
0.471TrpThr: 0.471 ± 0.024
0.496TrpVal: 0.496 ± 0.02
0.133TrpTrp: 0.133 ± 0.013
0.363TrpTyr: 0.363 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.814TyrAla: 2.814 ± 0.054
0.586TyrCys: 0.586 ± 0.022
2.567TyrAsp: 2.567 ± 0.056
3.024TyrGlu: 3.024 ± 0.053
1.954TyrPhe: 1.954 ± 0.035
2.968TyrGly: 2.968 ± 0.045
0.803TyrHis: 0.803 ± 0.027
3.067TyrIle: 3.067 ± 0.053
2.435TyrLys: 2.435 ± 0.044
3.696TyrLeu: 3.696 ± 0.058
1.184TyrMet: 1.184 ± 0.028
1.969TyrAsn: 1.969 ± 0.043
1.407TyrPro: 1.407 ± 0.029
1.362TyrGln: 1.362 ± 0.034
2.081TyrArg: 2.081 ± 0.044
2.585TyrSer: 2.585 ± 0.051
2.173TyrThr: 2.173 ± 0.045
2.691TyrVal: 2.691 ± 0.043
0.395TyrTrp: 0.395 ± 0.017
1.966TyrTyr: 1.966 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4003 proteins (1325591 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski