Amino acid dipepetide frequency for Methanomethylophilus alvus (strain Mx1201)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.155AlaAla: 8.155 ± 0.188
1.214AlaCys: 1.214 ± 0.05
5.852AlaAsp: 5.852 ± 0.113
6.064AlaGlu: 6.064 ± 0.131
3.097AlaPhe: 3.097 ± 0.079
6.523AlaGly: 6.523 ± 0.125
1.206AlaHis: 1.206 ± 0.048
5.408AlaIle: 5.408 ± 0.121
4.756AlaLys: 4.756 ± 0.123
7.028AlaLeu: 7.028 ± 0.144
2.89AlaMet: 2.89 ± 0.073
2.158AlaAsn: 2.158 ± 0.07
2.443AlaPro: 2.443 ± 0.074
1.858AlaGln: 1.858 ± 0.073
3.709AlaArg: 3.709 ± 0.098
4.52AlaSer: 4.52 ± 0.108
3.364AlaThr: 3.364 ± 0.082
7.523AlaVal: 7.523 ± 0.14
0.575AlaTrp: 0.575 ± 0.038
2.716AlaTyr: 2.716 ± 0.074
0.0AlaXaa: 0.0 ± 0.0
Cys
1.247CysAla: 1.247 ± 0.056
0.359CysCys: 0.359 ± 0.034
1.2CysAsp: 1.2 ± 0.05
0.927CysGlu: 0.927 ± 0.039
0.562CysPhe: 0.562 ± 0.032
2.026CysGly: 2.026 ± 0.076
0.342CysHis: 0.342 ± 0.035
1.049CysIle: 1.049 ± 0.051
0.817CysLys: 0.817 ± 0.039
1.172CysLeu: 1.172 ± 0.059
0.518CysMet: 0.518 ± 0.035
0.511CysAsn: 0.511 ± 0.037
1.098CysPro: 1.098 ± 0.057
0.41CysGln: 0.41 ± 0.031
1.09CysArg: 1.09 ± 0.052
1.054CysSer: 1.054 ± 0.046
0.831CysThr: 0.831 ± 0.04
1.043CysVal: 1.043 ± 0.043
0.147CysTrp: 0.147 ± 0.017
0.52CysTyr: 0.52 ± 0.036
0.0CysXaa: 0.0 ± 0.0
Asp
5.469AspAla: 5.469 ± 0.118
0.964AspCys: 0.964 ± 0.05
4.312AspAsp: 4.312 ± 0.105
4.791AspGlu: 4.791 ± 0.101
2.541AspPhe: 2.541 ± 0.073
6.286AspGly: 6.286 ± 0.147
1.135AspHis: 1.135 ± 0.045
5.418AspIle: 5.418 ± 0.095
3.167AspLys: 3.167 ± 0.083
6.24AspLeu: 6.24 ± 0.136
2.72AspMet: 2.72 ± 0.084
2.093AspAsn: 2.093 ± 0.064
3.509AspPro: 3.509 ± 0.081
1.219AspGln: 1.219 ± 0.052
4.402AspArg: 4.402 ± 0.106
3.914AspSer: 3.914 ± 0.089
3.199AspThr: 3.199 ± 0.086
5.636AspVal: 5.636 ± 0.119
0.683AspTrp: 0.683 ± 0.042
2.435AspTyr: 2.435 ± 0.077
0.0AspXaa: 0.0 ± 0.0
Glu
5.453GluAla: 5.453 ± 0.132
1.084GluCys: 1.084 ± 0.053
4.776GluAsp: 4.776 ± 0.111
5.571GluGlu: 5.571 ± 0.131
2.307GluPhe: 2.307 ± 0.072
5.074GluGly: 5.074 ± 0.112
1.166GluHis: 1.166 ± 0.05
4.719GluIle: 4.719 ± 0.109
5.131GluLys: 5.131 ± 0.111
5.013GluLeu: 5.013 ± 0.105
2.441GluMet: 2.441 ± 0.07
2.661GluAsn: 2.661 ± 0.078
2.019GluPro: 2.019 ± 0.058
1.53GluGln: 1.53 ± 0.07
3.798GluArg: 3.798 ± 0.095
3.912GluSer: 3.912 ± 0.096
3.454GluThr: 3.454 ± 0.083
4.75GluVal: 4.75 ± 0.115
0.732GluTrp: 0.732 ± 0.039
2.757GluTyr: 2.757 ± 0.08
0.0GluXaa: 0.0 ± 0.0
Phe
2.523PheAla: 2.523 ± 0.081
0.701PheCys: 0.701 ± 0.04
2.887PheAsp: 2.887 ± 0.085
2.527PheGlu: 2.527 ± 0.082
1.406PhePhe: 1.406 ± 0.063
3.315PheGly: 3.315 ± 0.078
0.617PheHis: 0.617 ± 0.035
2.329PheIle: 2.329 ± 0.072
1.752PheLys: 1.752 ± 0.062
3.022PheLeu: 3.022 ± 0.082
1.3PheMet: 1.3 ± 0.051
1.257PheAsn: 1.257 ± 0.052
1.486PhePro: 1.486 ± 0.054
0.77PheGln: 0.77 ± 0.038
2.07PheArg: 2.07 ± 0.074
2.696PheSer: 2.696 ± 0.08
1.96PheThr: 1.96 ± 0.072
3.163PheVal: 3.163 ± 0.087
0.314PheTrp: 0.314 ± 0.027
1.272PheTyr: 1.272 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
5.793GlyAla: 5.793 ± 0.108
1.697GlyCys: 1.697 ± 0.07
4.982GlyAsp: 4.982 ± 0.105
4.673GlyGlu: 4.673 ± 0.1
3.167GlyPhe: 3.167 ± 0.067
5.832GlyGly: 5.832 ± 0.143
1.589GlyHis: 1.589 ± 0.056
6.142GlyIle: 6.142 ± 0.12
5.848GlyLys: 5.848 ± 0.107
6.238GlyLeu: 6.238 ± 0.135
3.169GlyMet: 3.169 ± 0.08
3.077GlyAsn: 3.077 ± 0.078
2.482GlyPro: 2.482 ± 0.08
1.665GlyGln: 1.665 ± 0.061
4.764GlyArg: 4.764 ± 0.118
5.151GlySer: 5.151 ± 0.132
5.003GlyThr: 5.003 ± 0.124
5.844GlyVal: 5.844 ± 0.113
0.862GlyTrp: 0.862 ± 0.057
3.189GlyTyr: 3.189 ± 0.084
0.0GlyXaa: 0.0 ± 0.0
His
1.119HisAla: 1.119 ± 0.049
0.34HisCys: 0.34 ± 0.031
1.025HisAsp: 1.025 ± 0.052
0.905HisGlu: 0.905 ± 0.044
0.674HisPhe: 0.674 ± 0.039
1.455HisGly: 1.455 ± 0.058
0.385HisHis: 0.385 ± 0.053
1.164HisIle: 1.164 ± 0.057
0.701HisLys: 0.701 ± 0.035
1.357HisLeu: 1.357 ± 0.054
0.607HisMet: 0.607 ± 0.032
0.577HisAsn: 0.577 ± 0.035
0.968HisPro: 0.968 ± 0.045
0.342HisGln: 0.342 ± 0.026
0.982HisArg: 0.982 ± 0.046
0.972HisSer: 0.972 ± 0.045
0.825HisThr: 0.825 ± 0.041
1.357HisVal: 1.357 ± 0.053
0.163HisTrp: 0.163 ± 0.02
0.581HisTyr: 0.581 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
5.579IleAla: 5.579 ± 0.119
1.223IleCys: 1.223 ± 0.053
4.62IleAsp: 4.62 ± 0.093
4.322IleGlu: 4.322 ± 0.103
2.27IlePhe: 2.27 ± 0.085
5.716IleGly: 5.716 ± 0.11
1.178IleHis: 1.178 ± 0.054
3.957IleIle: 3.957 ± 0.109
3.093IleLys: 3.093 ± 0.09
5.587IleLeu: 5.587 ± 0.142
2.021IleMet: 2.021 ± 0.07
2.076IleAsn: 2.076 ± 0.064
3.338IlePro: 3.338 ± 0.075
1.357IleGln: 1.357 ± 0.058
3.827IleArg: 3.827 ± 0.099
4.799IleSer: 4.799 ± 0.097
3.468IleThr: 3.468 ± 0.086
5.467IleVal: 5.467 ± 0.114
0.524IleTrp: 0.524 ± 0.033
1.948IleTyr: 1.948 ± 0.069
0.0IleXaa: 0.0 ± 0.0
Lys
5.424LysAla: 5.424 ± 0.133
0.817LysCys: 0.817 ± 0.046
4.834LysAsp: 4.834 ± 0.099
4.709LysGlu: 4.709 ± 0.112
1.783LysPhe: 1.783 ± 0.063
4.475LysGly: 4.475 ± 0.116
0.825LysHis: 0.825 ± 0.046
3.745LysIle: 3.745 ± 0.109
4.454LysLys: 4.454 ± 0.131
3.747LysLeu: 3.747 ± 0.105
1.879LysMet: 1.879 ± 0.058
2.274LysAsn: 2.274 ± 0.081
1.594LysPro: 1.594 ± 0.061
1.353LysGln: 1.353 ± 0.058
2.92LysArg: 2.92 ± 0.095
2.91LysSer: 2.91 ± 0.097
3.593LysThr: 3.593 ± 0.088
4.53LysVal: 4.53 ± 0.091
0.579LysTrp: 0.579 ± 0.038
2.062LysTyr: 2.062 ± 0.077
0.0LysXaa: 0.0 ± 0.0
Leu
6.845LeuAla: 6.845 ± 0.144
1.453LeuCys: 1.453 ± 0.057
5.757LeuAsp: 5.757 ± 0.119
5.249LeuGlu: 5.249 ± 0.115
3.13LeuPhe: 3.13 ± 0.1
6.492LeuGly: 6.492 ± 0.111
1.121LeuHis: 1.121 ± 0.05
4.807LeuIle: 4.807 ± 0.132
4.634LeuLys: 4.634 ± 0.1
5.922LeuLeu: 5.922 ± 0.152
3.008LeuMet: 3.008 ± 0.072
2.733LeuAsn: 2.733 ± 0.082
3.065LeuPro: 3.065 ± 0.087
1.628LeuGln: 1.628 ± 0.061
4.615LeuArg: 4.615 ± 0.116
5.905LeuSer: 5.905 ± 0.112
4.124LeuThr: 4.124 ± 0.101
5.73LeuVal: 5.73 ± 0.132
0.652LeuTrp: 0.652 ± 0.041
2.557LeuTyr: 2.557 ± 0.09
0.0LeuXaa: 0.0 ± 0.0
Met
3.13MetAla: 3.13 ± 0.079
0.609MetCys: 0.609 ± 0.04
2.659MetAsp: 2.659 ± 0.07
2.382MetGlu: 2.382 ± 0.069
1.359MetPhe: 1.359 ± 0.062
2.686MetGly: 2.686 ± 0.081
0.556MetHis: 0.556 ± 0.036
2.15MetIle: 2.15 ± 0.067
2.15MetLys: 2.15 ± 0.062
2.598MetLeu: 2.598 ± 0.074
1.43MetMet: 1.43 ± 0.059
1.278MetAsn: 1.278 ± 0.048
1.306MetPro: 1.306 ± 0.052
0.811MetGln: 0.811 ± 0.043
1.81MetArg: 1.81 ± 0.064
2.539MetSer: 2.539 ± 0.074
2.16MetThr: 2.16 ± 0.065
2.614MetVal: 2.614 ± 0.07
0.271MetTrp: 0.271 ± 0.025
1.141MetTyr: 1.141 ± 0.059
0.0MetXaa: 0.0 ± 0.0
Asn
2.773AsnAla: 2.773 ± 0.081
0.52AsnCys: 0.52 ± 0.033
1.952AsnAsp: 1.952 ± 0.075
1.942AsnGlu: 1.942 ± 0.068
1.037AsnPhe: 1.037 ± 0.05
3.167AsnGly: 3.167 ± 0.093
0.518AsnHis: 0.518 ± 0.038
2.659AsnIle: 2.659 ± 0.078
1.665AsnLys: 1.665 ± 0.075
2.722AsnLeu: 2.722 ± 0.084
1.235AsnMet: 1.235 ± 0.058
1.184AsnAsn: 1.184 ± 0.065
1.763AsnPro: 1.763 ± 0.062
0.652AsnGln: 0.652 ± 0.042
1.893AsnArg: 1.893 ± 0.07
2.034AsnSer: 2.034 ± 0.069
1.75AsnThr: 1.75 ± 0.071
2.753AsnVal: 2.753 ± 0.082
0.302AsnTrp: 0.302 ± 0.03
1.204AsnTyr: 1.204 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
2.973ProAla: 2.973 ± 0.1
0.581ProCys: 0.581 ± 0.035
3.11ProAsp: 3.11 ± 0.085
3.762ProGlu: 3.762 ± 0.101
1.528ProPhe: 1.528 ± 0.053
3.014ProGly: 3.014 ± 0.088
0.695ProHis: 0.695 ± 0.037
2.18ProIle: 2.18 ± 0.067
2.233ProLys: 2.233 ± 0.076
2.979ProLeu: 2.979 ± 0.081
1.162ProMet: 1.162 ± 0.049
1.149ProAsn: 1.149 ± 0.055
1.2ProPro: 1.2 ± 0.053
0.913ProGln: 0.913 ± 0.042
1.761ProArg: 1.761 ± 0.062
2.588ProSer: 2.588 ± 0.072
1.761ProThr: 1.761 ± 0.061
3.417ProVal: 3.417 ± 0.089
0.346ProTrp: 0.346 ± 0.026
1.492ProTyr: 1.492 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
1.805GlnAla: 1.805 ± 0.056
0.346GlnCys: 0.346 ± 0.027
1.214GlnAsp: 1.214 ± 0.055
1.357GlnGlu: 1.357 ± 0.055
0.862GlnPhe: 0.862 ± 0.044
1.402GlnGly: 1.402 ± 0.06
0.357GlnHis: 0.357 ± 0.024
1.594GlnIle: 1.594 ± 0.059
1.555GlnLys: 1.555 ± 0.058
1.683GlnLeu: 1.683 ± 0.062
1.086GlnMet: 1.086 ± 0.045
0.893GlnAsn: 0.893 ± 0.048
0.66GlnPro: 0.66 ± 0.037
0.624GlnGln: 0.624 ± 0.041
1.198GlnArg: 1.198 ± 0.052
1.188GlnSer: 1.188 ± 0.05
1.202GlnThr: 1.202 ± 0.049
1.386GlnVal: 1.386 ± 0.051
0.21GlnTrp: 0.21 ± 0.023
0.958GlnTyr: 0.958 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
3.709ArgAla: 3.709 ± 0.094
0.935ArgCys: 0.935 ± 0.053
3.641ArgAsp: 3.641 ± 0.105
3.802ArgGlu: 3.802 ± 0.096
2.244ArgPhe: 2.244 ± 0.062
3.69ArgGly: 3.69 ± 0.11
1.037ArgHis: 1.037 ± 0.047
4.057ArgIle: 4.057 ± 0.109
3.684ArgLys: 3.684 ± 0.089
4.344ArgLeu: 4.344 ± 0.088
2.35ArgMet: 2.35 ± 0.074
2.227ArgAsn: 2.227 ± 0.067
2.038ArgPro: 2.038 ± 0.073
1.337ArgGln: 1.337 ± 0.06
3.462ArgArg: 3.462 ± 0.12
3.489ArgSer: 3.489 ± 0.089
2.865ArgThr: 2.865 ± 0.091
3.228ArgVal: 3.228 ± 0.083
0.499ArgTrp: 0.499 ± 0.032
2.321ArgTyr: 2.321 ± 0.071
0.0ArgXaa: 0.0 ± 0.0
Ser
5.076SerAla: 5.076 ± 0.113
0.911SerCys: 0.911 ± 0.053
4.791SerAsp: 4.791 ± 0.098
4.583SerGlu: 4.583 ± 0.1
2.396SerPhe: 2.396 ± 0.069
5.742SerGly: 5.742 ± 0.138
0.921SerHis: 0.921 ± 0.047
4.043SerIle: 4.043 ± 0.09
3.917SerLys: 3.917 ± 0.107
5.033SerLeu: 5.033 ± 0.122
2.254SerMet: 2.254 ± 0.067
1.903SerAsn: 1.903 ± 0.075
2.262SerPro: 2.262 ± 0.074
1.378SerGln: 1.378 ± 0.054
3.24SerArg: 3.24 ± 0.082
4.481SerSer: 4.481 ± 0.124
3.002SerThr: 3.002 ± 0.076
5.461SerVal: 5.461 ± 0.141
0.548SerTrp: 0.548 ± 0.037
2.18SerTyr: 2.18 ± 0.076
0.002SerXaa: 0.002 ± 0.002
Thr
4.338ThrAla: 4.338 ± 0.098
0.768ThrCys: 0.768 ± 0.037
3.772ThrAsp: 3.772 ± 0.078
3.352ThrGlu: 3.352 ± 0.082
2.001ThrPhe: 2.001 ± 0.074
4.734ThrGly: 4.734 ± 0.095
0.827ThrHis: 0.827 ± 0.04
3.252ThrIle: 3.252 ± 0.096
2.378ThrLys: 2.378 ± 0.072
4.092ThrLeu: 4.092 ± 0.103
1.543ThrMet: 1.543 ± 0.056
1.557ThrAsn: 1.557 ± 0.064
2.193ThrPro: 2.193 ± 0.075
1.121ThrGln: 1.121 ± 0.05
2.129ThrArg: 2.129 ± 0.071
3.326ThrSer: 3.326 ± 0.089
2.458ThrThr: 2.458 ± 0.084
5.537ThrVal: 5.537 ± 0.15
0.465ThrTrp: 0.465 ± 0.029
2.032ThrTyr: 2.032 ± 0.078
0.0ThrXaa: 0.0 ± 0.0
Val
6.026ValAla: 6.026 ± 0.122
1.51ValCys: 1.51 ± 0.06
5.339ValAsp: 5.339 ± 0.11
4.884ValGlu: 4.884 ± 0.118
3.23ValPhe: 3.23 ± 0.099
5.608ValGly: 5.608 ± 0.13
1.229ValHis: 1.229 ± 0.05
5.123ValIle: 5.123 ± 0.103
4.501ValLys: 4.501 ± 0.097
6.847ValLeu: 6.847 ± 0.149
2.598ValMet: 2.598 ± 0.071
2.431ValAsn: 2.431 ± 0.074
3.778ValPro: 3.778 ± 0.104
1.634ValGln: 1.634 ± 0.063
4.585ValArg: 4.585 ± 0.115
5.673ValSer: 5.673 ± 0.134
4.397ValThr: 4.397 ± 0.116
5.954ValVal: 5.954 ± 0.135
0.689ValTrp: 0.689 ± 0.048
2.58ValTyr: 2.58 ± 0.081
0.0ValXaa: 0.0 ± 0.0
Trp
0.732TrpAla: 0.732 ± 0.043
0.177TrpCys: 0.177 ± 0.017
0.644TrpAsp: 0.644 ± 0.039
0.569TrpGlu: 0.569 ± 0.03
0.438TrpPhe: 0.438 ± 0.036
0.605TrpGly: 0.605 ± 0.037
0.171TrpHis: 0.171 ± 0.018
0.569TrpIle: 0.569 ± 0.038
0.624TrpLys: 0.624 ± 0.04
0.666TrpLeu: 0.666 ± 0.037
0.367TrpMet: 0.367 ± 0.029
0.469TrpAsn: 0.469 ± 0.042
0.273TrpPro: 0.273 ± 0.02
0.236TrpGln: 0.236 ± 0.022
0.434TrpArg: 0.434 ± 0.03
0.507TrpSer: 0.507 ± 0.032
0.487TrpThr: 0.487 ± 0.03
0.564TrpVal: 0.564 ± 0.037
0.112TrpTrp: 0.112 ± 0.022
0.389TrpTyr: 0.389 ± 0.035
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.885TyrAla: 2.885 ± 0.089
0.713TyrCys: 0.713 ± 0.038
2.769TyrAsp: 2.769 ± 0.073
1.981TyrGlu: 1.981 ± 0.071
1.335TyrPhe: 1.335 ± 0.047
3.319TyrGly: 3.319 ± 0.109
0.624TyrHis: 0.624 ± 0.031
2.007TyrIle: 2.007 ± 0.07
1.447TyrLys: 1.447 ± 0.061
3.132TyrLeu: 3.132 ± 0.089
1.062TyrMet: 1.062 ± 0.048
1.217TyrAsn: 1.217 ± 0.051
1.371TyrPro: 1.371 ± 0.057
0.742TyrGln: 0.742 ± 0.038
2.274TyrArg: 2.274 ± 0.078
2.466TyrSer: 2.466 ± 0.097
1.966TyrThr: 1.966 ± 0.089
2.667TyrVal: 2.667 ± 0.084
0.361TyrTrp: 0.361 ± 0.029
1.38TyrTyr: 1.38 ± 0.061
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.002XaaMet: 0.002 ± 0.002
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1643 proteins (490742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski