Amino acid dipepetide frequency for Clostridium autoethanogenum DSM 10061

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.863AlaAla: 4.863 ± 0.092
0.806AlaCys: 0.806 ± 0.029
2.813AlaAsp: 2.813 ± 0.043
3.37AlaGlu: 3.37 ± 0.064
2.554AlaPhe: 2.554 ± 0.051
4.083AlaGly: 4.083 ± 0.071
0.807AlaHis: 0.807 ± 0.025
5.269AlaIle: 5.269 ± 0.062
4.655AlaLys: 4.655 ± 0.063
5.825AlaLeu: 5.825 ± 0.084
1.524AlaMet: 1.524 ± 0.034
2.638AlaAsn: 2.638 ± 0.052
1.517AlaPro: 1.517 ± 0.04
1.565AlaGln: 1.565 ± 0.039
1.848AlaArg: 1.848 ± 0.043
3.834AlaSer: 3.834 ± 0.072
2.401AlaThr: 2.401 ± 0.068
4.851AlaVal: 4.851 ± 0.083
0.353AlaTrp: 0.353 ± 0.019
2.132AlaTyr: 2.132 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.718CysAla: 0.718 ± 0.026
0.253CysCys: 0.253 ± 0.016
0.759CysAsp: 0.759 ± 0.022
0.824CysGlu: 0.824 ± 0.023
0.608CysPhe: 0.608 ± 0.022
1.28CysGly: 1.28 ± 0.037
0.26CysHis: 0.26 ± 0.014
1.41CysIle: 1.41 ± 0.039
1.22CysLys: 1.22 ± 0.038
1.034CysLeu: 1.034 ± 0.033
0.382CysMet: 0.382 ± 0.019
0.802CysAsn: 0.802 ± 0.027
0.572CysPro: 0.572 ± 0.027
0.313CysGln: 0.313 ± 0.016
0.445CysArg: 0.445 ± 0.021
1.001CysSer: 1.001 ± 0.026
0.709CysThr: 0.709 ± 0.028
0.79CysVal: 0.79 ± 0.026
0.085CysTrp: 0.085 ± 0.008
0.492CysTyr: 0.492 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
2.85AspAla: 2.85 ± 0.058
0.676AspCys: 0.676 ± 0.022
2.601AspAsp: 2.601 ± 0.049
4.283AspGlu: 4.283 ± 0.065
2.619AspPhe: 2.619 ± 0.048
3.229AspGly: 3.229 ± 0.059
0.542AspHis: 0.542 ± 0.02
6.04AspIle: 6.04 ± 0.076
5.595AspLys: 5.595 ± 0.074
4.564AspLeu: 4.564 ± 0.061
1.681AspMet: 1.681 ± 0.036
3.188AspAsn: 3.188 ± 0.06
1.498AspPro: 1.498 ± 0.036
0.89AspGln: 0.89 ± 0.029
1.81AspArg: 1.81 ± 0.037
3.289AspSer: 3.289 ± 0.057
2.742AspThr: 2.742 ± 0.049
3.702AspVal: 3.702 ± 0.063
0.373AspTrp: 0.373 ± 0.021
2.437AspTyr: 2.437 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
3.694GluAla: 3.694 ± 0.06
0.779GluCys: 0.779 ± 0.025
4.167GluAsp: 4.167 ± 0.062
5.428GluGlu: 5.428 ± 0.087
2.53GluPhe: 2.53 ± 0.048
3.688GluGly: 3.688 ± 0.051
0.85GluHis: 0.85 ± 0.028
6.065GluIle: 6.065 ± 0.086
7.385GluLys: 7.385 ± 0.095
5.981GluLeu: 5.981 ± 0.08
1.722GluMet: 1.722 ± 0.038
4.954GluAsn: 4.954 ± 0.066
1.304GluPro: 1.304 ± 0.033
1.715GluGln: 1.715 ± 0.043
2.215GluArg: 2.215 ± 0.049
3.488GluSer: 3.488 ± 0.057
2.722GluThr: 2.722 ± 0.044
4.32GluVal: 4.32 ± 0.073
0.362GluTrp: 0.362 ± 0.016
2.597GluTyr: 2.597 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
2.313PheAla: 2.313 ± 0.051
0.625PheCys: 0.625 ± 0.019
2.361PheAsp: 2.361 ± 0.045
2.429PheGlu: 2.429 ± 0.047
1.928PhePhe: 1.928 ± 0.055
2.759PheGly: 2.759 ± 0.048
0.611PheHis: 0.611 ± 0.025
4.522PheIle: 4.522 ± 0.075
3.975PheLys: 3.975 ± 0.059
3.808PheLeu: 3.808 ± 0.063
1.23PheMet: 1.23 ± 0.033
2.803PheAsn: 2.803 ± 0.053
1.335PhePro: 1.335 ± 0.03
1.099PheGln: 1.099 ± 0.029
1.234PheArg: 1.234 ± 0.029
3.307PheSer: 3.307 ± 0.059
2.479PheThr: 2.479 ± 0.048
2.669PheVal: 2.669 ± 0.051
0.311PheTrp: 0.311 ± 0.017
1.772PheTyr: 1.772 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
4.108GlyAla: 4.108 ± 0.074
1.059GlyCys: 1.059 ± 0.034
3.208GlyAsp: 3.208 ± 0.06
3.878GlyGlu: 3.878 ± 0.057
2.998GlyPhe: 2.998 ± 0.051
4.493GlyGly: 4.493 ± 0.075
0.962GlyHis: 0.962 ± 0.03
6.893GlyIle: 6.893 ± 0.079
6.027GlyLys: 6.027 ± 0.075
5.099GlyLeu: 5.099 ± 0.073
1.948GlyMet: 1.948 ± 0.041
3.488GlyAsn: 3.488 ± 0.061
1.328GlyPro: 1.328 ± 0.057
1.582GlyGln: 1.582 ± 0.039
2.224GlyArg: 2.224 ± 0.043
3.988GlySer: 3.988 ± 0.074
3.73GlyThr: 3.73 ± 0.081
4.48GlyVal: 4.48 ± 0.082
0.522GlyTrp: 0.522 ± 0.022
2.982GlyTyr: 2.982 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
0.677HisAla: 0.677 ± 0.021
0.246HisCys: 0.246 ± 0.013
0.641HisAsp: 0.641 ± 0.024
0.839HisGlu: 0.839 ± 0.029
0.581HisPhe: 0.581 ± 0.02
0.932HisGly: 0.932 ± 0.025
0.295HisHis: 0.295 ± 0.017
1.335HisIle: 1.335 ± 0.034
1.09HisLys: 1.09 ± 0.029
1.139HisLeu: 1.139 ± 0.029
0.419HisMet: 0.419 ± 0.018
0.784HisAsn: 0.784 ± 0.026
0.63HisPro: 0.63 ± 0.023
0.321HisGln: 0.321 ± 0.018
0.495HisArg: 0.495 ± 0.018
0.908HisSer: 0.908 ± 0.031
0.706HisThr: 0.706 ± 0.025
0.803HisVal: 0.803 ± 0.028
0.1HisTrp: 0.1 ± 0.008
0.549HisTyr: 0.549 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
5.746IleAla: 5.746 ± 0.076
1.491IleCys: 1.491 ± 0.037
5.446IleAsp: 5.446 ± 0.077
6.228IleGlu: 6.228 ± 0.091
4.298IlePhe: 4.298 ± 0.072
6.363IleGly: 6.363 ± 0.082
1.23IleHis: 1.23 ± 0.031
9.064IleIle: 9.064 ± 0.119
8.635IleLys: 8.635 ± 0.1
8.951IleLeu: 8.951 ± 0.111
2.492IleMet: 2.492 ± 0.048
5.861IleAsn: 5.861 ± 0.067
3.429IlePro: 3.429 ± 0.045
2.224IleGln: 2.224 ± 0.043
3.027IleArg: 3.027 ± 0.052
7.257IleSer: 7.257 ± 0.09
4.941IleThr: 4.941 ± 0.071
6.34IleVal: 6.34 ± 0.083
0.572IleTrp: 0.572 ± 0.023
3.514IleTyr: 3.514 ± 0.058
0.0IleXaa: 0.0 ± 0.0
Lys
4.836LysAla: 4.836 ± 0.069
1.118LysCys: 1.118 ± 0.039
5.91LysAsp: 5.91 ± 0.079
7.171LysGlu: 7.171 ± 0.093
3.583LysPhe: 3.583 ± 0.053
5.074LysGly: 5.074 ± 0.066
1.113LysHis: 1.113 ± 0.03
8.618LysIle: 8.618 ± 0.091
8.785LysLys: 8.785 ± 0.113
8.126LysLeu: 8.126 ± 0.071
2.657LysMet: 2.657 ± 0.046
7.12LysAsn: 7.12 ± 0.092
2.165LysPro: 2.165 ± 0.045
2.383LysGln: 2.383 ± 0.047
2.992LysArg: 2.992 ± 0.054
5.99LysSer: 5.99 ± 0.073
4.033LysThr: 4.033 ± 0.053
6.4LysVal: 6.4 ± 0.074
0.678LysTrp: 0.678 ± 0.027
4.345LysTyr: 4.345 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
5.045LeuAla: 5.045 ± 0.066
1.302LeuCys: 1.302 ± 0.039
4.751LeuAsp: 4.751 ± 0.067
5.283LeuGlu: 5.283 ± 0.081
3.641LeuPhe: 3.641 ± 0.063
5.942LeuGly: 5.942 ± 0.075
1.151LeuHis: 1.151 ± 0.029
7.836LeuIle: 7.836 ± 0.1
9.144LeuLys: 9.144 ± 0.113
7.44LeuLeu: 7.44 ± 0.098
2.382LeuMet: 2.382 ± 0.048
6.0LeuAsn: 6.0 ± 0.074
2.873LeuPro: 2.873 ± 0.052
2.254LeuGln: 2.254 ± 0.046
2.901LeuArg: 2.901 ± 0.056
6.6LeuSer: 6.6 ± 0.08
4.35LeuThr: 4.35 ± 0.072
5.18LeuVal: 5.18 ± 0.063
0.642LeuTrp: 0.642 ± 0.023
3.21LeuTyr: 3.21 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
1.815MetAla: 1.815 ± 0.047
0.351MetCys: 0.351 ± 0.018
1.728MetAsp: 1.728 ± 0.036
2.008MetGlu: 2.008 ± 0.037
1.039MetPhe: 1.039 ± 0.025
1.903MetGly: 1.903 ± 0.041
0.37MetHis: 0.37 ± 0.018
2.264MetIle: 2.264 ± 0.047
2.671MetLys: 2.671 ± 0.053
2.443MetLeu: 2.443 ± 0.049
0.699MetMet: 0.699 ± 0.025
1.787MetAsn: 1.787 ± 0.039
0.961MetPro: 0.961 ± 0.028
0.697MetGln: 0.697 ± 0.025
0.857MetArg: 0.857 ± 0.026
1.851MetSer: 1.851 ± 0.043
1.282MetThr: 1.282 ± 0.031
1.664MetVal: 1.664 ± 0.04
0.174MetTrp: 0.174 ± 0.013
0.966MetTyr: 0.966 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.061AsnAla: 3.061 ± 0.06
0.878AsnCys: 0.878 ± 0.03
2.807AsnAsp: 2.807 ± 0.057
3.79AsnGlu: 3.79 ± 0.057
2.783AsnPhe: 2.783 ± 0.049
3.574AsnGly: 3.574 ± 0.058
0.736AsnHis: 0.736 ± 0.028
7.019AsnIle: 7.019 ± 0.09
5.974AsnLys: 5.974 ± 0.073
5.607AsnLeu: 5.607 ± 0.066
1.9AsnMet: 1.9 ± 0.037
3.836AsnAsn: 3.836 ± 0.065
2.1AsnPro: 2.1 ± 0.043
1.426AsnGln: 1.426 ± 0.038
2.012AsnArg: 2.012 ± 0.042
4.441AsnSer: 4.441 ± 0.072
3.043AsnThr: 3.043 ± 0.058
4.065AsnVal: 4.065 ± 0.059
0.452AsnTrp: 0.452 ± 0.02
2.681AsnTyr: 2.681 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
1.582ProAla: 1.582 ± 0.042
0.402ProCys: 0.402 ± 0.019
1.692ProAsp: 1.692 ± 0.034
2.262ProGlu: 2.262 ± 0.042
1.419ProPhe: 1.419 ± 0.039
1.874ProGly: 1.874 ± 0.044
0.484ProHis: 0.484 ± 0.021
2.838ProIle: 2.838 ± 0.052
2.321ProLys: 2.321 ± 0.04
2.485ProLeu: 2.485 ± 0.041
0.737ProMet: 0.737 ± 0.025
1.488ProAsn: 1.488 ± 0.043
0.664ProPro: 0.664 ± 0.024
0.841ProGln: 0.841 ± 0.026
0.84ProArg: 0.84 ± 0.025
1.83ProSer: 1.83 ± 0.041
1.52ProThr: 1.52 ± 0.066
2.288ProVal: 2.288 ± 0.044
0.263ProTrp: 0.263 ± 0.016
1.32ProTyr: 1.32 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
1.418GlnAla: 1.418 ± 0.043
0.334GlnCys: 0.334 ± 0.018
1.336GlnAsp: 1.336 ± 0.033
1.541GlnGlu: 1.541 ± 0.038
1.022GlnPhe: 1.022 ± 0.029
1.536GlnGly: 1.536 ± 0.038
0.357GlnHis: 0.357 ± 0.016
2.312GlnIle: 2.312 ± 0.046
2.352GlnLys: 2.352 ± 0.045
2.176GlnLeu: 2.176 ± 0.049
0.741GlnMet: 0.741 ± 0.024
1.738GlnAsn: 1.738 ± 0.039
0.622GlnPro: 0.622 ± 0.023
0.794GlnGln: 0.794 ± 0.031
0.86GlnArg: 0.86 ± 0.027
1.498GlnSer: 1.498 ± 0.037
1.142GlnThr: 1.142 ± 0.031
1.633GlnVal: 1.633 ± 0.035
0.2GlnTrp: 0.2 ± 0.012
1.077GlnTyr: 1.077 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
1.849ArgAla: 1.849 ± 0.037
0.442ArgCys: 0.442 ± 0.019
1.79ArgAsp: 1.79 ± 0.038
2.625ArgGlu: 2.625 ± 0.054
1.363ArgPhe: 1.363 ± 0.03
1.959ArgGly: 1.959 ± 0.042
0.5ArgHis: 0.5 ± 0.02
2.997ArgIle: 2.997 ± 0.051
3.183ArgLys: 3.183 ± 0.052
2.654ArgLeu: 2.654 ± 0.048
0.896ArgMet: 0.896 ± 0.031
2.083ArgAsn: 2.083 ± 0.042
0.821ArgPro: 0.821 ± 0.024
0.953ArgGln: 0.953 ± 0.028
1.353ArgArg: 1.353 ± 0.042
1.684ArgSer: 1.684 ± 0.041
1.564ArgThr: 1.564 ± 0.036
2.036ArgVal: 2.036 ± 0.044
0.246ArgTrp: 0.246 ± 0.015
1.352ArgTyr: 1.352 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
3.577SerAla: 3.577 ± 0.06
0.855SerCys: 0.855 ± 0.027
3.48SerAsp: 3.48 ± 0.063
4.114SerGlu: 4.114 ± 0.058
3.116SerPhe: 3.116 ± 0.05
4.86SerGly: 4.86 ± 0.089
0.95SerHis: 0.95 ± 0.031
6.808SerIle: 6.808 ± 0.084
6.321SerLys: 6.321 ± 0.076
5.905SerLeu: 5.905 ± 0.075
1.862SerMet: 1.862 ± 0.039
4.042SerAsn: 4.042 ± 0.064
1.878SerPro: 1.878 ± 0.037
1.9SerGln: 1.9 ± 0.044
2.121SerArg: 2.121 ± 0.049
5.091SerSer: 5.091 ± 0.088
3.519SerThr: 3.519 ± 0.065
4.206SerVal: 4.206 ± 0.064
0.476SerTrp: 0.476 ± 0.02
2.668SerTyr: 2.668 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
3.397ThrAla: 3.397 ± 0.072
0.633ThrCys: 0.633 ± 0.025
2.48ThrAsp: 2.48 ± 0.048
2.7ThrGlu: 2.7 ± 0.05
2.197ThrPhe: 2.197 ± 0.038
4.007ThrGly: 4.007 ± 0.133
0.708ThrHis: 0.708 ± 0.024
4.686ThrIle: 4.686 ± 0.077
3.655ThrLys: 3.655 ± 0.057
4.606ThrLeu: 4.606 ± 0.061
1.191ThrMet: 1.191 ± 0.031
2.61ThrAsn: 2.61 ± 0.048
1.807ThrPro: 1.807 ± 0.039
1.163ThrGln: 1.163 ± 0.031
1.426ThrArg: 1.426 ± 0.035
3.548ThrSer: 3.548 ± 0.068
2.706ThrThr: 2.706 ± 0.056
3.814ThrVal: 3.814 ± 0.095
0.368ThrTrp: 0.368 ± 0.019
1.841ThrTyr: 1.841 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
3.759ValAla: 3.759 ± 0.064
1.0ValCys: 1.0 ± 0.029
3.896ValAsp: 3.896 ± 0.058
4.271ValGlu: 4.271 ± 0.065
2.952ValPhe: 2.952 ± 0.053
4.302ValGly: 4.302 ± 0.066
0.873ValHis: 0.873 ± 0.029
6.419ValIle: 6.419 ± 0.078
5.879ValLys: 5.879 ± 0.065
6.009ValLeu: 6.009 ± 0.076
1.712ValMet: 1.712 ± 0.04
3.83ValAsn: 3.83 ± 0.061
2.232ValPro: 2.232 ± 0.046
1.605ValGln: 1.605 ± 0.036
1.99ValArg: 1.99 ± 0.039
4.879ValSer: 4.879 ± 0.076
3.608ValThr: 3.608 ± 0.087
4.727ValVal: 4.727 ± 0.076
0.4ValTrp: 0.4 ± 0.02
2.497ValTyr: 2.497 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.365TrpAla: 0.365 ± 0.019
0.103TrpCys: 0.103 ± 0.011
0.374TrpAsp: 0.374 ± 0.019
0.371TrpGlu: 0.371 ± 0.02
0.32TrpPhe: 0.32 ± 0.017
0.488TrpGly: 0.488 ± 0.023
0.128TrpHis: 0.128 ± 0.01
0.749TrpIle: 0.749 ± 0.032
0.604TrpLys: 0.604 ± 0.023
0.555TrpLeu: 0.555 ± 0.022
0.215TrpMet: 0.215 ± 0.013
0.502TrpAsn: 0.502 ± 0.018
0.182TrpPro: 0.182 ± 0.013
0.213TrpGln: 0.213 ± 0.012
0.254TrpArg: 0.254 ± 0.017
0.458TrpSer: 0.458 ± 0.021
0.323TrpThr: 0.323 ± 0.021
0.416TrpVal: 0.416 ± 0.018
0.076TrpTrp: 0.076 ± 0.008
0.251TrpTyr: 0.251 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.033TyrAla: 2.033 ± 0.041
0.605TyrCys: 0.605 ± 0.022
2.37TyrAsp: 2.37 ± 0.052
2.552TyrGlu: 2.552 ± 0.043
2.017TyrPhe: 2.017 ± 0.044
2.652TyrGly: 2.652 ± 0.045
0.563TyrHis: 0.563 ± 0.022
3.967TyrIle: 3.967 ± 0.064
3.626TyrLys: 3.626 ± 0.066
3.519TyrLeu: 3.519 ± 0.058
1.139TyrMet: 1.139 ± 0.029
2.628TyrAsn: 2.628 ± 0.048
1.233TyrPro: 1.233 ± 0.032
0.752TyrGln: 0.752 ± 0.026
1.446TyrArg: 1.446 ± 0.038
2.805TyrSer: 2.805 ± 0.048
2.025TyrThr: 2.025 ± 0.052
2.46TyrVal: 2.46 ± 0.045
0.295TyrTrp: 0.295 ± 0.016
1.897TyrTyr: 1.897 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4013 proteins (1212424 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski