Amino acid dipepetide frequency for Methanothrix soehngenii (strain ATCC 5969 / DSM 3671 / JCM 10134 / NBRC 103675 / OCM 69 / GP-6) (Methanosaeta concilii)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.174AlaAla: 8.174 ± 0.132
1.058AlaCys: 1.058 ± 0.044
4.6AlaAsp: 4.6 ± 0.08
6.076AlaGlu: 6.076 ± 0.095
3.229AlaPhe: 3.229 ± 0.064
6.844AlaGly: 6.844 ± 0.103
1.322AlaHis: 1.322 ± 0.045
6.279AlaIle: 6.279 ± 0.099
4.112AlaLys: 4.112 ± 0.076
9.026AlaLeu: 9.026 ± 0.137
2.52AlaMet: 2.52 ± 0.063
2.611AlaAsn: 2.611 ± 0.072
2.639AlaPro: 2.639 ± 0.072
2.468AlaGln: 2.468 ± 0.062
5.405AlaArg: 5.405 ± 0.097
5.4AlaSer: 5.4 ± 0.093
3.735AlaThr: 3.735 ± 0.073
5.843AlaVal: 5.843 ± 0.09
1.041AlaTrp: 1.041 ± 0.046
2.331AlaTyr: 2.331 ± 0.062
0.0AlaXaa: 0.0 ± 0.0
Cys
0.865CysAla: 0.865 ± 0.034
0.254CysCys: 0.254 ± 0.021
0.681CysAsp: 0.681 ± 0.04
0.674CysGlu: 0.674 ± 0.031
0.461CysPhe: 0.461 ± 0.023
1.367CysGly: 1.367 ± 0.044
0.292CysHis: 0.292 ± 0.018
0.891CysIle: 0.891 ± 0.033
0.552CysLys: 0.552 ± 0.025
1.137CysLeu: 1.137 ± 0.042
0.328CysMet: 0.328 ± 0.025
0.46CysAsn: 0.46 ± 0.029
0.875CysPro: 0.875 ± 0.037
0.528CysGln: 0.528 ± 0.03
0.839CysArg: 0.839 ± 0.036
0.946CysSer: 0.946 ± 0.037
0.63CysThr: 0.63 ± 0.033
0.678CysVal: 0.678 ± 0.036
0.124CysTrp: 0.124 ± 0.011
0.428CysTyr: 0.428 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
4.286AspAla: 4.286 ± 0.079
0.749AspCys: 0.749 ± 0.028
2.948AspAsp: 2.948 ± 0.073
4.312AspGlu: 4.312 ± 0.077
2.346AspPhe: 2.346 ± 0.055
4.271AspGly: 4.271 ± 0.081
1.03AspHis: 1.03 ± 0.041
4.294AspIle: 4.294 ± 0.077
2.651AspLys: 2.651 ± 0.062
6.996AspLeu: 6.996 ± 0.109
1.633AspMet: 1.633 ± 0.048
1.64AspAsn: 1.64 ± 0.055
3.124AspPro: 3.124 ± 0.075
1.933AspGln: 1.933 ± 0.055
3.694AspArg: 3.694 ± 0.075
3.537AspSer: 3.537 ± 0.069
2.118AspThr: 2.118 ± 0.055
3.9AspVal: 3.9 ± 0.074
0.828AspTrp: 0.828 ± 0.036
2.002AspTyr: 2.002 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
6.466GluAla: 6.466 ± 0.122
0.72GluCys: 0.72 ± 0.032
4.054GluAsp: 4.054 ± 0.072
6.242GluGlu: 6.242 ± 0.106
2.211GluPhe: 2.211 ± 0.067
4.99GluGly: 4.99 ± 0.086
1.246GluHis: 1.246 ± 0.043
5.512GluIle: 5.512 ± 0.094
4.745GluLys: 4.745 ± 0.09
6.673GluLeu: 6.673 ± 0.103
2.444GluMet: 2.444 ± 0.057
2.735GluAsn: 2.735 ± 0.075
2.409GluPro: 2.409 ± 0.056
1.826GluGln: 1.826 ± 0.045
4.754GluArg: 4.754 ± 0.087
4.27GluSer: 4.27 ± 0.083
2.931GluThr: 2.931 ± 0.071
4.792GluVal: 4.792 ± 0.083
0.741GluTrp: 0.741 ± 0.031
2.015GluTyr: 2.015 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
2.872PheAla: 2.872 ± 0.059
0.539PheCys: 0.539 ± 0.027
2.331PheAsp: 2.331 ± 0.052
2.332PheGlu: 2.332 ± 0.053
1.494PhePhe: 1.494 ± 0.042
2.937PheGly: 2.937 ± 0.065
0.647PheHis: 0.647 ± 0.03
2.462PheIle: 2.462 ± 0.065
1.546PheLys: 1.546 ± 0.046
3.681PheLeu: 3.681 ± 0.082
0.932PheMet: 0.932 ± 0.039
1.332PheAsn: 1.332 ± 0.046
1.405PhePro: 1.405 ± 0.037
1.161PheGln: 1.161 ± 0.04
2.03PheArg: 2.03 ± 0.051
2.821PheSer: 2.821 ± 0.075
1.686PheThr: 1.686 ± 0.045
2.445PheVal: 2.445 ± 0.058
0.44PheTrp: 0.44 ± 0.021
1.247PheTyr: 1.247 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
5.697GlyAla: 5.697 ± 0.102
1.151GlyCys: 1.151 ± 0.045
4.033GlyAsp: 4.033 ± 0.083
5.213GlyGlu: 5.213 ± 0.095
3.063GlyPhe: 3.063 ± 0.063
5.513GlyGly: 5.513 ± 0.123
1.353GlyHis: 1.353 ± 0.04
5.787GlyIle: 5.787 ± 0.096
4.594GlyLys: 4.594 ± 0.075
7.326GlyLeu: 7.326 ± 0.113
2.51GlyMet: 2.51 ± 0.063
2.7GlyAsn: 2.7 ± 0.081
2.539GlyPro: 2.539 ± 0.064
2.237GlyGln: 2.237 ± 0.052
4.807GlyArg: 4.807 ± 0.089
5.621GlySer: 5.621 ± 0.105
3.846GlyThr: 3.846 ± 0.076
4.804GlyVal: 4.804 ± 0.081
1.073GlyTrp: 1.073 ± 0.045
2.968GlyTyr: 2.968 ± 0.073
0.0GlyXaa: 0.0 ± 0.0
His
1.135HisAla: 1.135 ± 0.043
0.299HisCys: 0.299 ± 0.019
0.927HisAsp: 0.927 ± 0.038
1.027HisGlu: 1.027 ± 0.039
0.658HisPhe: 0.658 ± 0.028
1.313HisGly: 1.313 ± 0.042
0.429HisHis: 0.429 ± 0.025
1.218HisIle: 1.218 ± 0.046
0.824HisLys: 0.824 ± 0.032
1.926HisLeu: 1.926 ± 0.053
0.439HisMet: 0.439 ± 0.024
0.689HisAsn: 0.689 ± 0.03
1.161HisPro: 1.161 ± 0.046
0.566HisGln: 0.566 ± 0.025
1.052HisArg: 1.052 ± 0.039
1.09HisSer: 1.09 ± 0.039
0.708HisThr: 0.708 ± 0.03
1.01HisVal: 1.01 ± 0.033
0.203HisTrp: 0.203 ± 0.018
0.554HisTyr: 0.554 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.217IleAla: 6.217 ± 0.108
1.067IleCys: 1.067 ± 0.039
4.639IleAsp: 4.639 ± 0.081
5.277IleGlu: 5.277 ± 0.093
2.854IlePhe: 2.854 ± 0.063
5.011IleGly: 5.011 ± 0.075
1.124IleHis: 1.124 ± 0.038
4.79IleIle: 4.79 ± 0.09
4.009IleLys: 4.009 ± 0.081
7.009IleLeu: 7.009 ± 0.101
1.867IleMet: 1.867 ± 0.048
2.628IleAsn: 2.628 ± 0.063
3.196IlePro: 3.196 ± 0.071
2.007IleGln: 2.007 ± 0.053
3.897IleArg: 3.897 ± 0.077
5.614IleSer: 5.614 ± 0.102
3.387IleThr: 3.387 ± 0.068
4.501IleVal: 4.501 ± 0.086
0.763IleTrp: 0.763 ± 0.037
2.367IleTyr: 2.367 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
4.634LysAla: 4.634 ± 0.091
0.534LysCys: 0.534 ± 0.027
3.38LysAsp: 3.38 ± 0.066
4.442LysGlu: 4.442 ± 0.079
1.325LysPhe: 1.325 ± 0.041
4.219LysGly: 4.219 ± 0.075
0.732LysHis: 0.732 ± 0.03
3.996LysIle: 3.996 ± 0.076
3.565LysLys: 3.565 ± 0.074
4.168LysLeu: 4.168 ± 0.085
1.715LysMet: 1.715 ± 0.055
2.224LysAsn: 2.224 ± 0.057
2.035LysPro: 2.035 ± 0.05
1.161LysGln: 1.161 ± 0.04
3.27LysArg: 3.27 ± 0.069
3.572LysSer: 3.572 ± 0.065
2.752LysThr: 2.752 ± 0.079
3.303LysVal: 3.303 ± 0.073
0.513LysTrp: 0.513 ± 0.026
1.517LysTyr: 1.517 ± 0.054
0.0LysXaa: 0.0 ± 0.0
Leu
9.686LeuAla: 9.686 ± 0.136
1.341LeuCys: 1.341 ± 0.041
6.012LeuAsp: 6.012 ± 0.101
7.172LeuGlu: 7.172 ± 0.096
3.561LeuPhe: 3.561 ± 0.092
7.504LeuGly: 7.504 ± 0.114
1.561LeuHis: 1.561 ± 0.046
6.689LeuIle: 6.689 ± 0.123
5.718LeuLys: 5.718 ± 0.096
9.527LeuLeu: 9.527 ± 0.169
2.701LeuMet: 2.701 ± 0.066
3.65LeuAsn: 3.65 ± 0.092
4.284LeuPro: 4.284 ± 0.077
2.987LeuGln: 2.987 ± 0.061
5.196LeuArg: 5.196 ± 0.079
6.886LeuSer: 6.886 ± 0.105
4.065LeuThr: 4.065 ± 0.07
6.628LeuVal: 6.628 ± 0.097
1.017LeuTrp: 1.017 ± 0.05
2.591LeuTyr: 2.591 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.802MetAla: 2.802 ± 0.064
0.236MetCys: 0.236 ± 0.017
1.895MetAsp: 1.895 ± 0.049
1.973MetGlu: 1.973 ± 0.049
0.664MetPhe: 0.664 ± 0.032
2.276MetGly: 2.276 ± 0.062
0.482MetHis: 0.482 ± 0.026
2.179MetIle: 2.179 ± 0.06
1.891MetLys: 1.891 ± 0.049
2.661MetLeu: 2.661 ± 0.061
0.809MetMet: 0.809 ± 0.036
1.225MetAsn: 1.225 ± 0.042
1.328MetPro: 1.328 ± 0.044
0.838MetGln: 0.838 ± 0.032
1.638MetArg: 1.638 ± 0.039
1.583MetSer: 1.583 ± 0.048
1.386MetThr: 1.386 ± 0.043
1.972MetVal: 1.972 ± 0.045
0.209MetTrp: 0.209 ± 0.017
0.545MetTyr: 0.545 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.912AsnAla: 2.912 ± 0.061
0.539AsnCys: 0.539 ± 0.026
1.721AsnAsp: 1.721 ± 0.056
2.277AsnGlu: 2.277 ± 0.058
1.214AsnPhe: 1.214 ± 0.047
2.509AsnGly: 2.509 ± 0.059
0.658AsnHis: 0.658 ± 0.039
2.68AsnIle: 2.68 ± 0.071
1.715AsnLys: 1.715 ± 0.051
3.749AsnLeu: 3.749 ± 0.074
0.923AsnMet: 0.923 ± 0.036
1.331AsnAsn: 1.331 ± 0.063
2.159AsnPro: 2.159 ± 0.069
1.224AsnGln: 1.224 ± 0.044
2.12AsnArg: 2.12 ± 0.049
2.326AsnSer: 2.326 ± 0.066
1.485AsnThr: 1.485 ± 0.047
2.253AsnVal: 2.253 ± 0.058
0.495AsnTrp: 0.495 ± 0.026
1.236AsnTyr: 1.236 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
3.374ProAla: 3.374 ± 0.084
0.529ProCys: 0.529 ± 0.026
2.927ProAsp: 2.927 ± 0.066
3.848ProGlu: 3.848 ± 0.085
1.646ProPhe: 1.646 ± 0.047
3.751ProGly: 3.751 ± 0.076
0.783ProHis: 0.783 ± 0.029
2.498ProIle: 2.498 ± 0.058
1.96ProLys: 1.96 ± 0.049
4.145ProLeu: 4.145 ± 0.079
0.988ProMet: 0.988 ± 0.03
1.285ProAsn: 1.285 ± 0.046
1.708ProPro: 1.708 ± 0.05
1.466ProGln: 1.466 ± 0.054
2.062ProArg: 2.062 ± 0.051
2.81ProSer: 2.81 ± 0.074
1.783ProThr: 1.783 ± 0.054
3.181ProVal: 3.181 ± 0.072
0.495ProTrp: 0.495 ± 0.026
1.308ProTyr: 1.308 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.745GlnAla: 2.745 ± 0.063
0.349GlnCys: 0.349 ± 0.026
1.667GlnAsp: 1.667 ± 0.051
2.434GlnGlu: 2.434 ± 0.053
0.964GlnPhe: 0.964 ± 0.041
2.225GlnGly: 2.225 ± 0.055
0.41GlnHis: 0.41 ± 0.019
2.482GlnIle: 2.482 ± 0.057
1.878GlnLys: 1.878 ± 0.053
2.399GlnLeu: 2.399 ± 0.059
1.038GlnMet: 1.038 ± 0.039
1.141GlnAsn: 1.141 ± 0.04
1.136GlnPro: 1.136 ± 0.039
0.886GlnGln: 0.886 ± 0.046
1.837GlnArg: 1.837 ± 0.044
2.018GlnSer: 2.018 ± 0.06
1.524GlnThr: 1.524 ± 0.052
1.999GlnVal: 1.999 ± 0.049
0.35GlnTrp: 0.35 ± 0.036
0.76GlnTyr: 0.76 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
4.813ArgAla: 4.813 ± 0.083
0.76ArgCys: 0.76 ± 0.03
3.438ArgAsp: 3.438 ± 0.074
4.526ArgGlu: 4.526 ± 0.086
2.265ArgPhe: 2.265 ± 0.05
4.071ArgGly: 4.071 ± 0.069
0.993ArgHis: 0.993 ± 0.038
4.645ArgIle: 4.645 ± 0.071
3.055ArgLys: 3.055 ± 0.077
5.904ArgLeu: 5.904 ± 0.09
1.88ArgMet: 1.88 ± 0.041
2.007ArgAsn: 2.007 ± 0.055
2.336ArgPro: 2.336 ± 0.052
1.737ArgGln: 1.737 ± 0.049
3.586ArgArg: 3.586 ± 0.076
4.016ArgSer: 4.016 ± 0.082
2.545ArgThr: 2.545 ± 0.054
3.72ArgVal: 3.72 ± 0.085
0.65ArgTrp: 0.65 ± 0.028
2.083ArgTyr: 2.083 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
5.152SerAla: 5.152 ± 0.102
0.912SerCys: 0.912 ± 0.037
3.782SerAsp: 3.782 ± 0.078
4.101SerGlu: 4.101 ± 0.073
2.76SerPhe: 2.76 ± 0.062
5.992SerGly: 5.992 ± 0.098
1.175SerHis: 1.175 ± 0.041
4.967SerIle: 4.967 ± 0.082
3.096SerLys: 3.096 ± 0.069
7.05SerLeu: 7.05 ± 0.118
1.906SerMet: 1.906 ± 0.053
2.305SerAsn: 2.305 ± 0.074
2.869SerPro: 2.869 ± 0.069
2.277SerGln: 2.277 ± 0.061
4.246SerArg: 4.246 ± 0.071
5.191SerSer: 5.191 ± 0.127
3.13SerThr: 3.13 ± 0.073
4.063SerVal: 4.063 ± 0.081
0.875SerTrp: 0.875 ± 0.048
2.348SerTyr: 2.348 ± 0.065
0.0SerXaa: 0.0 ± 0.0
Thr
3.906ThrAla: 3.906 ± 0.094
0.557ThrCys: 0.557 ± 0.027
2.533ThrAsp: 2.533 ± 0.07
2.807ThrGlu: 2.807 ± 0.055
1.607ThrPhe: 1.607 ± 0.053
4.406ThrGly: 4.406 ± 0.085
0.764ThrHis: 0.764 ± 0.033
3.395ThrIle: 3.395 ± 0.078
1.927ThrLys: 1.927 ± 0.056
4.363ThrLeu: 4.363 ± 0.074
1.13ThrMet: 1.13 ± 0.036
1.534ThrAsn: 1.534 ± 0.052
2.218ThrPro: 2.218 ± 0.059
1.094ThrGln: 1.094 ± 0.041
2.395ThrArg: 2.395 ± 0.059
3.073ThrSer: 3.073 ± 0.068
2.349ThrThr: 2.349 ± 0.067
3.209ThrVal: 3.209 ± 0.076
0.506ThrTrp: 0.506 ± 0.039
1.365ThrTyr: 1.365 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
5.86ValAla: 5.86 ± 0.099
0.855ValCys: 0.855 ± 0.036
4.058ValAsp: 4.058 ± 0.07
4.516ValGlu: 4.516 ± 0.091
2.393ValPhe: 2.393 ± 0.055
4.518ValGly: 4.518 ± 0.086
1.277ValHis: 1.277 ± 0.042
4.518ValIle: 4.518 ± 0.08
3.192ValLys: 3.192 ± 0.072
6.577ValLeu: 6.577 ± 0.11
1.783ValMet: 1.783 ± 0.049
2.211ValAsn: 2.211 ± 0.055
2.995ValPro: 2.995 ± 0.06
2.085ValGln: 2.085 ± 0.061
3.69ValArg: 3.69 ± 0.079
4.363ValSer: 4.363 ± 0.083
3.075ValThr: 3.075 ± 0.072
4.996ValVal: 4.996 ± 0.098
0.622ValTrp: 0.622 ± 0.034
2.058ValTyr: 2.058 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.755TrpAla: 0.755 ± 0.033
0.146TrpCys: 0.146 ± 0.014
0.692TrpAsp: 0.692 ± 0.035
0.707TrpGlu: 0.707 ± 0.032
0.427TrpPhe: 0.427 ± 0.026
0.738TrpGly: 0.738 ± 0.034
0.247TrpHis: 0.247 ± 0.02
0.835TrpIle: 0.835 ± 0.036
0.682TrpLys: 0.682 ± 0.028
1.263TrpLeu: 1.263 ± 0.057
0.38TrpMet: 0.38 ± 0.021
0.555TrpAsn: 0.555 ± 0.031
0.417TrpPro: 0.417 ± 0.022
0.495TrpGln: 0.495 ± 0.03
0.68TrpArg: 0.68 ± 0.031
0.798TrpSer: 0.798 ± 0.035
0.613TrpThr: 0.613 ± 0.065
0.578TrpVal: 0.578 ± 0.027
0.202TrpTrp: 0.202 ± 0.02
0.347TrpTyr: 0.347 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.361TyrAla: 2.361 ± 0.057
0.415TyrCys: 0.415 ± 0.029
1.974TyrAsp: 1.974 ± 0.07
1.82TyrGlu: 1.82 ± 0.045
1.201TyrPhe: 1.201 ± 0.035
2.378TyrGly: 2.378 ± 0.064
0.7TyrHis: 0.7 ± 0.028
2.107TyrIle: 2.107 ± 0.054
1.346TyrLys: 1.346 ± 0.044
3.193TyrLeu: 3.193 ± 0.069
0.664TyrMet: 0.664 ± 0.029
1.269TyrAsn: 1.269 ± 0.049
1.714TyrPro: 1.714 ± 0.047
1.266TyrGln: 1.266 ± 0.043
1.83TyrArg: 1.83 ± 0.043
2.265TyrSer: 2.265 ± 0.064
1.391TyrThr: 1.391 ± 0.047
1.768TyrVal: 1.768 ± 0.043
0.409TyrTrp: 0.409 ± 0.023
1.195TyrTyr: 1.195 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2791 proteins (821985 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski