Amino acid dipepetide frequency for Methanobrevibacter curvatus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.544AlaAla: 2.544 ± 0.098
0.508AlaCys: 0.508 ± 0.033
2.639AlaAsp: 2.639 ± 0.073
2.814AlaGlu: 2.814 ± 0.073
1.919AlaPhe: 1.919 ± 0.068
3.193AlaGly: 3.193 ± 0.1
0.84AlaHis: 0.84 ± 0.04
6.627AlaIle: 6.627 ± 0.154
4.443AlaLys: 4.443 ± 0.101
4.592AlaLeu: 4.592 ± 0.104
1.057AlaMet: 1.057 ± 0.045
3.096AlaAsn: 3.096 ± 0.127
1.384AlaPro: 1.384 ± 0.05
1.14AlaGln: 1.14 ± 0.047
1.423AlaArg: 1.423 ± 0.056
3.11AlaSer: 3.11 ± 0.081
2.731AlaThr: 2.731 ± 0.102
3.294AlaVal: 3.294 ± 0.094
0.282AlaTrp: 0.282 ± 0.024
1.739AlaTyr: 1.739 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.445CysAla: 0.445 ± 0.03
0.138CysCys: 0.138 ± 0.016
0.59CysAsp: 0.59 ± 0.033
0.653CysGlu: 0.653 ± 0.038
0.383CysPhe: 0.383 ± 0.026
1.098CysGly: 1.098 ± 0.051
0.222CysHis: 0.222 ± 0.019
0.658CysIle: 0.658 ± 0.04
0.748CysLys: 0.748 ± 0.039
0.671CysLeu: 0.671 ± 0.037
0.182CysMet: 0.182 ± 0.018
0.491CysAsn: 0.491 ± 0.03
0.632CysPro: 0.632 ± 0.05
0.252CysGln: 0.252 ± 0.021
0.272CysArg: 0.272 ± 0.022
0.604CysSer: 0.604 ± 0.035
0.395CysThr: 0.395 ± 0.028
0.577CysVal: 0.577 ± 0.039
0.062CysTrp: 0.062 ± 0.011
0.247CysTyr: 0.247 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
2.489AspAla: 2.489 ± 0.074
0.524AspCys: 0.524 ± 0.026
3.167AspAsp: 3.167 ± 0.081
4.989AspGlu: 4.989 ± 0.112
3.223AspPhe: 3.223 ± 0.093
3.28AspGly: 3.28 ± 0.089
0.734AspHis: 0.734 ± 0.036
6.147AspIle: 6.147 ± 0.121
5.239AspLys: 5.239 ± 0.124
6.046AspLeu: 6.046 ± 0.121
1.179AspMet: 1.179 ± 0.055
3.935AspAsn: 3.935 ± 0.113
1.55AspPro: 1.55 ± 0.055
0.826AspGln: 0.826 ± 0.04
1.375AspArg: 1.375 ± 0.053
3.755AspSer: 3.755 ± 0.089
2.358AspThr: 2.358 ± 0.065
3.425AspVal: 3.425 ± 0.089
0.42AspTrp: 0.42 ± 0.029
2.475AspTyr: 2.475 ± 0.083
0.0AspXaa: 0.0 ± 0.0
Glu
3.137GluAla: 3.137 ± 0.078
0.517GluCys: 0.517 ± 0.032
4.263GluAsp: 4.263 ± 0.115
5.986GluGlu: 5.986 ± 0.141
3.349GluPhe: 3.349 ± 0.09
3.283GluGly: 3.283 ± 0.092
0.93GluHis: 0.93 ± 0.04
8.074GluIle: 8.074 ± 0.155
7.916GluLys: 7.916 ± 0.182
6.523GluLeu: 6.523 ± 0.138
1.516GluMet: 1.516 ± 0.059
6.286GluAsn: 6.286 ± 0.139
1.227GluPro: 1.227 ± 0.058
1.057GluGln: 1.057 ± 0.048
1.908GluArg: 1.908 ± 0.069
3.991GluSer: 3.991 ± 0.088
2.98GluThr: 2.98 ± 0.069
3.785GluVal: 3.785 ± 0.082
0.417GluTrp: 0.417 ± 0.027
2.717GluTyr: 2.717 ± 0.087
0.0GluXaa: 0.0 ± 0.0
Phe
2.014PheAla: 2.014 ± 0.062
0.372PheCys: 0.372 ± 0.028
2.8PheAsp: 2.8 ± 0.077
2.869PheGlu: 2.869 ± 0.085
2.004PhePhe: 2.004 ± 0.072
2.734PheGly: 2.734 ± 0.08
0.667PheHis: 0.667 ± 0.036
4.436PheIle: 4.436 ± 0.115
4.8PheLys: 4.8 ± 0.117
4.577PheLeu: 4.577 ± 0.104
0.81PheMet: 0.81 ± 0.039
3.794PheAsn: 3.794 ± 0.101
1.347PhePro: 1.347 ± 0.049
1.042PheGln: 1.042 ± 0.047
1.264PheArg: 1.264 ± 0.051
3.884PheSer: 3.884 ± 0.112
2.143PheThr: 2.143 ± 0.075
2.448PheVal: 2.448 ± 0.076
0.245PheTrp: 0.245 ± 0.021
1.959PheTyr: 1.959 ± 0.066
0.0PheXaa: 0.0 ± 0.0
Gly
3.799GlyAla: 3.799 ± 0.163
0.669GlyCys: 0.669 ± 0.039
3.347GlyAsp: 3.347 ± 0.103
3.831GlyGlu: 3.831 ± 0.098
3.013GlyPhe: 3.013 ± 0.076
4.408GlyGly: 4.408 ± 0.198
1.075GlyHis: 1.075 ± 0.04
6.348GlyIle: 6.348 ± 0.141
5.201GlyLys: 5.201 ± 0.105
5.292GlyLeu: 5.292 ± 0.108
1.273GlyMet: 1.273 ± 0.057
3.638GlyAsn: 3.638 ± 0.146
1.393GlyPro: 1.393 ± 0.069
1.206GlyGln: 1.206 ± 0.051
1.981GlyArg: 1.981 ± 0.066
4.182GlySer: 4.182 ± 0.127
3.338GlyThr: 3.338 ± 0.111
4.095GlyVal: 4.095 ± 0.098
0.417GlyTrp: 0.417 ± 0.024
2.454GlyTyr: 2.454 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
0.653HisAla: 0.653 ± 0.039
0.237HisCys: 0.237 ± 0.02
0.899HisAsp: 0.899 ± 0.042
1.042HisGlu: 1.042 ± 0.049
0.703HisPhe: 0.703 ± 0.039
1.185HisGly: 1.185 ± 0.045
0.36HisHis: 0.36 ± 0.03
1.352HisIle: 1.352 ± 0.049
1.144HisLys: 1.144 ± 0.051
1.522HisLeu: 1.522 ± 0.051
0.312HisMet: 0.312 ± 0.025
0.987HisAsn: 0.987 ± 0.04
0.754HisPro: 0.754 ± 0.037
0.406HisGln: 0.406 ± 0.026
0.567HisArg: 0.567 ± 0.032
1.08HisSer: 1.08 ± 0.051
0.731HisThr: 0.731 ± 0.037
0.883HisVal: 0.883 ± 0.044
0.113HisTrp: 0.113 ± 0.015
0.683HisTyr: 0.683 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.141IleAla: 6.141 ± 0.108
1.019IleCys: 1.019 ± 0.046
5.995IleAsp: 5.995 ± 0.115
7.126IleGlu: 7.126 ± 0.161
4.839IlePhe: 4.839 ± 0.129
5.764IleGly: 5.764 ± 0.111
1.709IleHis: 1.709 ± 0.049
9.624IleIle: 9.624 ± 0.166
9.338IleLys: 9.338 ± 0.156
9.827IleLeu: 9.827 ± 0.162
1.79IleMet: 1.79 ± 0.063
7.441IleAsn: 7.441 ± 0.155
3.52IlePro: 3.52 ± 0.084
2.132IleGln: 2.132 ± 0.064
2.87IleArg: 2.87 ± 0.08
7.988IleSer: 7.988 ± 0.149
4.855IleThr: 4.855 ± 0.121
5.935IleVal: 5.935 ± 0.122
0.508IleTrp: 0.508 ± 0.028
3.914IleTyr: 3.914 ± 0.124
0.0IleXaa: 0.0 ± 0.0
Lys
4.15LysAla: 4.15 ± 0.096
0.757LysCys: 0.757 ± 0.043
5.425LysAsp: 5.425 ± 0.11
8.004LysGlu: 8.004 ± 0.164
4.035LysPhe: 4.035 ± 0.1
4.623LysGly: 4.623 ± 0.095
1.423LysHis: 1.423 ± 0.053
10.811LysIle: 10.811 ± 0.195
9.974LysLys: 9.974 ± 0.194
7.997LysLeu: 7.997 ± 0.142
2.132LysMet: 2.132 ± 0.071
9.052LysAsn: 9.052 ± 0.215
2.325LysPro: 2.325 ± 0.069
1.838LysGln: 1.838 ± 0.058
2.687LysArg: 2.687 ± 0.068
6.013LysSer: 6.013 ± 0.121
5.419LysThr: 5.419 ± 0.112
4.667LysVal: 4.667 ± 0.089
0.708LysTrp: 0.708 ± 0.038
4.171LysTyr: 4.171 ± 0.101
0.0LysXaa: 0.0 ± 0.0
Leu
4.872LeuAla: 4.872 ± 0.109
0.69LeuCys: 0.69 ± 0.035
5.705LeuAsp: 5.705 ± 0.126
6.553LeuGlu: 6.553 ± 0.134
4.127LeuPhe: 4.127 ± 0.105
5.135LeuGly: 5.135 ± 0.118
1.172LeuHis: 1.172 ± 0.053
9.089LeuIle: 9.089 ± 0.147
9.946LeuLys: 9.946 ± 0.154
7.407LeuLeu: 7.407 ± 0.16
2.012LeuMet: 2.012 ± 0.062
8.18LeuAsn: 8.18 ± 0.168
2.591LeuPro: 2.591 ± 0.072
1.628LeuGln: 1.628 ± 0.059
2.666LeuArg: 2.666 ± 0.074
6.874LeuSer: 6.874 ± 0.141
4.447LeuThr: 4.447 ± 0.093
4.975LeuVal: 4.975 ± 0.118
0.5LeuTrp: 0.5 ± 0.029
2.934LeuTyr: 2.934 ± 0.076
0.0LeuXaa: 0.0 ± 0.0
Met
1.478MetAla: 1.478 ± 0.051
0.162MetCys: 0.162 ± 0.018
1.384MetAsp: 1.384 ± 0.056
1.592MetGlu: 1.592 ± 0.055
0.782MetPhe: 0.782 ± 0.04
1.289MetGly: 1.289 ± 0.054
0.275MetHis: 0.275 ± 0.021
1.949MetIle: 1.949 ± 0.06
1.954MetLys: 1.954 ± 0.065
1.425MetLeu: 1.425 ± 0.055
0.394MetMet: 0.394 ± 0.032
1.234MetAsn: 1.234 ± 0.052
0.605MetPro: 0.605 ± 0.034
0.378MetGln: 0.378 ± 0.028
0.561MetArg: 0.561 ± 0.03
1.285MetSer: 1.285 ± 0.047
0.93MetThr: 0.93 ± 0.043
1.358MetVal: 1.358 ± 0.053
0.086MetTrp: 0.086 ± 0.014
0.568MetTyr: 0.568 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
2.851AsnAla: 2.851 ± 0.102
0.696AsnCys: 0.696 ± 0.04
3.815AsnAsp: 3.815 ± 0.087
5.296AsnGlu: 5.296 ± 0.129
3.437AsnPhe: 3.437 ± 0.101
4.867AsnGly: 4.867 ± 0.264
1.202AsnHis: 1.202 ± 0.052
7.972AsnIle: 7.972 ± 0.172
7.921AsnLys: 7.921 ± 0.189
7.372AsnLeu: 7.372 ± 0.152
1.312AsnMet: 1.312 ± 0.046
8.315AsnAsn: 8.315 ± 0.427
2.724AsnPro: 2.724 ± 0.078
1.94AsnGln: 1.94 ± 0.065
2.032AsnArg: 2.032 ± 0.063
5.965AsnSer: 5.965 ± 0.236
3.741AsnThr: 3.741 ± 0.13
3.914AsnVal: 3.914 ± 0.176
0.51AsnTrp: 0.51 ± 0.032
3.095AsnTyr: 3.095 ± 0.084
0.0AsnXaa: 0.0 ± 0.0
Pro
1.28ProAla: 1.28 ± 0.051
0.307ProCys: 0.307 ± 0.025
1.682ProAsp: 1.682 ± 0.083
2.235ProGlu: 2.235 ± 0.073
1.55ProPhe: 1.55 ± 0.058
1.771ProGly: 1.771 ± 0.063
0.533ProHis: 0.533 ± 0.03
2.884ProIle: 2.884 ± 0.084
2.685ProLys: 2.685 ± 0.08
2.816ProLeu: 2.816 ± 0.077
0.59ProMet: 0.59 ± 0.032
1.993ProAsn: 1.993 ± 0.055
0.918ProPro: 0.918 ± 0.054
0.778ProGln: 0.778 ± 0.037
0.907ProArg: 0.907 ± 0.055
2.058ProSer: 2.058 ± 0.056
1.486ProThr: 1.486 ± 0.055
2.182ProVal: 2.182 ± 0.071
0.215ProTrp: 0.215 ± 0.022
1.188ProTyr: 1.188 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
1.075GlnAla: 1.075 ± 0.044
0.207GlnCys: 0.207 ± 0.02
1.001GlnAsp: 1.001 ± 0.043
1.446GlnGlu: 1.446 ± 0.053
0.796GlnPhe: 0.796 ± 0.035
1.2GlnGly: 1.2 ± 0.051
0.367GlnHis: 0.367 ± 0.028
2.253GlnIle: 2.253 ± 0.072
2.057GlnLys: 2.057 ± 0.07
1.914GlnLeu: 1.914 ± 0.058
0.505GlnMet: 0.505 ± 0.028
1.559GlnAsn: 1.559 ± 0.059
0.47GlnPro: 0.47 ± 0.033
0.463GlnGln: 0.463 ± 0.031
0.77GlnArg: 0.77 ± 0.035
1.412GlnSer: 1.412 ± 0.053
1.13GlnThr: 1.13 ± 0.046
1.033GlnVal: 1.033 ± 0.044
0.217GlnTrp: 0.217 ± 0.02
0.916GlnTyr: 0.916 ± 0.049
0.0GlnXaa: 0.0 ± 0.0
Arg
1.476ArgAla: 1.476 ± 0.057
0.3ArgCys: 0.3 ± 0.025
1.557ArgAsp: 1.557 ± 0.062
2.267ArgGlu: 2.267 ± 0.073
1.389ArgPhe: 1.389 ± 0.045
1.868ArgGly: 1.868 ± 0.064
0.464ArgHis: 0.464 ± 0.033
2.839ArgIle: 2.839 ± 0.072
2.636ArgLys: 2.636 ± 0.081
2.544ArgLeu: 2.544 ± 0.102
0.623ArgMet: 0.623 ± 0.035
1.825ArgAsn: 1.825 ± 0.057
0.913ArgPro: 0.913 ± 0.047
0.65ArgGln: 0.65 ± 0.036
1.23ArgArg: 1.23 ± 0.057
1.665ArgSer: 1.665 ± 0.06
1.474ArgThr: 1.474 ± 0.06
1.658ArgVal: 1.658 ± 0.053
0.184ArgTrp: 0.184 ± 0.019
1.08ArgTyr: 1.08 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
3.208SerAla: 3.208 ± 0.102
0.59SerCys: 0.59 ± 0.029
3.541SerAsp: 3.541 ± 0.076
3.87SerGlu: 3.87 ± 0.096
3.495SerPhe: 3.495 ± 0.092
4.669SerGly: 4.669 ± 0.149
1.204SerHis: 1.204 ± 0.047
6.652SerIle: 6.652 ± 0.122
7.139SerLys: 7.139 ± 0.136
6.336SerLeu: 6.336 ± 0.115
1.246SerMet: 1.246 ± 0.052
5.958SerAsn: 5.958 ± 0.248
2.177SerPro: 2.177 ± 0.068
1.912SerGln: 1.912 ± 0.053
1.972SerArg: 1.972 ± 0.064
5.642SerSer: 5.642 ± 0.196
3.783SerThr: 3.783 ± 0.128
3.7SerVal: 3.7 ± 0.12
0.515SerTrp: 0.515 ± 0.035
2.886SerTyr: 2.886 ± 0.069
0.0SerXaa: 0.0 ± 0.0
Thr
2.738ThrAla: 2.738 ± 0.077
0.482ThrCys: 0.482 ± 0.033
2.602ThrAsp: 2.602 ± 0.069
2.514ThrGlu: 2.514 ± 0.071
2.297ThrPhe: 2.297 ± 0.068
3.845ThrGly: 3.845 ± 0.174
0.817ThrHis: 0.817 ± 0.037
5.502ThrIle: 5.502 ± 0.123
3.977ThrLys: 3.977 ± 0.084
4.593ThrLeu: 4.593 ± 0.098
0.853ThrMet: 0.853 ± 0.041
3.637ThrAsn: 3.637 ± 0.159
1.968ThrPro: 1.968 ± 0.082
1.08ThrGln: 1.08 ± 0.042
1.356ThrArg: 1.356 ± 0.054
3.259ThrSer: 3.259 ± 0.098
3.024ThrThr: 3.024 ± 0.113
3.245ThrVal: 3.245 ± 0.103
0.3ThrTrp: 0.3 ± 0.028
1.82ThrTyr: 1.82 ± 0.06
0.0ThrXaa: 0.0 ± 0.0
Val
3.234ValAla: 3.234 ± 0.09
0.658ValCys: 0.658 ± 0.044
3.817ValAsp: 3.817 ± 0.08
4.0ValGlu: 4.0 ± 0.111
2.588ValPhe: 2.588 ± 0.077
3.688ValGly: 3.688 ± 0.089
0.872ValHis: 0.872 ± 0.043
5.059ValIle: 5.059 ± 0.098
5.181ValLys: 5.181 ± 0.097
5.668ValLeu: 5.668 ± 0.122
1.034ValMet: 1.034 ± 0.048
3.818ValAsn: 3.818 ± 0.148
1.982ValPro: 1.982 ± 0.078
1.013ValGln: 1.013 ± 0.042
1.513ValArg: 1.513 ± 0.064
4.263ValSer: 4.263 ± 0.107
2.653ValThr: 2.653 ± 0.106
3.96ValVal: 3.96 ± 0.126
0.344ValTrp: 0.344 ± 0.038
2.164ValTyr: 2.164 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.295TrpAla: 0.295 ± 0.023
0.067TrpCys: 0.067 ± 0.011
0.537TrpAsp: 0.537 ± 0.036
0.408TrpGlu: 0.408 ± 0.026
0.298TrpPhe: 0.298 ± 0.026
0.413TrpGly: 0.413 ± 0.026
0.097TrpHis: 0.097 ± 0.013
0.563TrpIle: 0.563 ± 0.033
0.505TrpLys: 0.505 ± 0.034
0.561TrpLeu: 0.561 ± 0.035
0.164TrpMet: 0.164 ± 0.018
0.579TrpAsn: 0.579 ± 0.035
0.125TrpPro: 0.125 ± 0.015
0.177TrpGln: 0.177 ± 0.019
0.214TrpArg: 0.214 ± 0.02
0.346TrpSer: 0.346 ± 0.026
0.314TrpThr: 0.314 ± 0.025
0.399TrpVal: 0.399 ± 0.03
0.064TrpTrp: 0.064 ± 0.011
0.312TrpTyr: 0.312 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.499TyrAla: 1.499 ± 0.052
0.42TyrCys: 0.42 ± 0.029
2.373TyrAsp: 2.373 ± 0.072
2.371TyrGlu: 2.371 ± 0.057
1.982TyrPhe: 1.982 ± 0.062
2.646TyrGly: 2.646 ± 0.079
0.644TyrHis: 0.644 ± 0.033
3.495TyrIle: 3.495 ± 0.1
3.409TyrLys: 3.409 ± 0.095
3.822TyrLeu: 3.822 ± 0.101
0.703TyrMet: 0.703 ± 0.039
3.156TyrAsn: 3.156 ± 0.106
1.455TyrPro: 1.455 ± 0.057
0.872TyrGln: 0.872 ± 0.041
1.073TyrArg: 1.073 ± 0.051
3.118TyrSer: 3.118 ± 0.094
1.975TyrThr: 1.975 ± 0.078
1.981TyrVal: 1.981 ± 0.066
0.33TyrTrp: 0.33 ± 0.026
1.643TyrTyr: 1.643 ± 0.056
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1968 proteins (566479 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski