Amino acid dipepetide frequency for Methanomassiliicoccus intestinalis (strain Issoire-Mx1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.123AlaAla: 7.123 ± 0.173
1.033AlaCys: 1.033 ± 0.048
4.823AlaAsp: 4.823 ± 0.1
5.268AlaGlu: 5.268 ± 0.102
2.826AlaPhe: 2.826 ± 0.094
5.65AlaGly: 5.65 ± 0.102
1.072AlaHis: 1.072 ± 0.048
5.39AlaIle: 5.39 ± 0.114
4.409AlaLys: 4.409 ± 0.108
6.881AlaLeu: 6.881 ± 0.145
2.195AlaMet: 2.195 ± 0.069
2.483AlaAsn: 2.483 ± 0.083
2.301AlaPro: 2.301 ± 0.068
1.781AlaGln: 1.781 ± 0.058
2.621AlaArg: 2.621 ± 0.091
5.005AlaSer: 5.005 ± 0.124
3.252AlaThr: 3.252 ± 0.082
6.677AlaVal: 6.677 ± 0.121
0.599AlaTrp: 0.599 ± 0.033
2.575AlaTyr: 2.575 ± 0.074
0.0AlaXaa: 0.0 ± 0.0
Cys
1.024CysAla: 1.024 ± 0.048
0.279CysCys: 0.279 ± 0.024
0.787CysAsp: 0.787 ± 0.037
0.855CysGlu: 0.855 ± 0.046
0.537CysPhe: 0.537 ± 0.032
1.695CysGly: 1.695 ± 0.074
0.268CysHis: 0.268 ± 0.02
1.264CysIle: 1.264 ± 0.053
0.86CysLys: 0.86 ± 0.043
1.054CysLeu: 1.054 ± 0.051
0.4CysMet: 0.4 ± 0.028
0.634CysAsn: 0.634 ± 0.034
0.972CysPro: 0.972 ± 0.057
0.3CysGln: 0.3 ± 0.028
0.782CysArg: 0.782 ± 0.039
1.21CysSer: 1.21 ± 0.051
0.851CysThr: 0.851 ± 0.042
0.956CysVal: 0.956 ± 0.047
0.126CysTrp: 0.126 ± 0.013
0.51CysTyr: 0.51 ± 0.033
0.0CysXaa: 0.0 ± 0.0
Asp
4.69AspAla: 4.69 ± 0.109
0.896AspCys: 0.896 ± 0.04
3.656AspAsp: 3.656 ± 0.085
4.814AspGlu: 4.814 ± 0.094
2.296AspPhe: 2.296 ± 0.065
4.466AspGly: 4.466 ± 0.132
0.961AspHis: 0.961 ± 0.041
5.296AspIle: 5.296 ± 0.104
3.668AspLys: 3.668 ± 0.094
5.995AspLeu: 5.995 ± 0.112
1.909AspMet: 1.909 ± 0.063
2.669AspAsn: 2.669 ± 0.088
2.538AspPro: 2.538 ± 0.073
1.258AspGln: 1.258 ± 0.046
2.374AspArg: 2.374 ± 0.072
4.246AspSer: 4.246 ± 0.125
2.801AspThr: 2.801 ± 0.084
4.674AspVal: 4.674 ± 0.105
0.629AspTrp: 0.629 ± 0.038
2.33AspTyr: 2.33 ± 0.072
0.0AspXaa: 0.0 ± 0.0
Glu
4.58GluAla: 4.58 ± 0.116
1.022GluCys: 1.022 ± 0.046
4.082GluAsp: 4.082 ± 0.09
5.479GluGlu: 5.479 ± 0.123
2.563GluPhe: 2.563 ± 0.081
4.137GluGly: 4.137 ± 0.094
1.098GluHis: 1.098 ± 0.047
5.923GluIle: 5.923 ± 0.115
5.271GluLys: 5.271 ± 0.114
6.138GluLeu: 6.138 ± 0.138
2.207GluMet: 2.207 ± 0.059
3.668GluAsn: 3.668 ± 0.093
1.847GluPro: 1.847 ± 0.07
1.537GluGln: 1.537 ± 0.055
3.096GluArg: 3.096 ± 0.099
4.427GluSer: 4.427 ± 0.098
3.414GluThr: 3.414 ± 0.077
4.457GluVal: 4.457 ± 0.1
0.679GluTrp: 0.679 ± 0.032
2.845GluTyr: 2.845 ± 0.069
0.0GluXaa: 0.0 ± 0.0
Phe
2.595PheAla: 2.595 ± 0.077
0.618PheCys: 0.618 ± 0.039
2.504PheAsp: 2.504 ± 0.067
2.572PheGlu: 2.572 ± 0.07
1.42PhePhe: 1.42 ± 0.067
2.888PheGly: 2.888 ± 0.086
0.579PheHis: 0.579 ± 0.033
2.886PheIle: 2.886 ± 0.085
1.996PheLys: 1.996 ± 0.055
3.028PheLeu: 3.028 ± 0.102
1.029PheMet: 1.029 ± 0.049
1.823PheAsn: 1.823 ± 0.074
1.495PhePro: 1.495 ± 0.062
0.906PheGln: 0.906 ± 0.042
1.388PheArg: 1.388 ± 0.045
3.14PheSer: 3.14 ± 0.08
2.342PheThr: 2.342 ± 0.097
2.458PheVal: 2.458 ± 0.074
0.304PheTrp: 0.304 ± 0.022
1.299PheTyr: 1.299 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
5.133GlyAla: 5.133 ± 0.115
1.246GlyCys: 1.246 ± 0.063
4.104GlyAsp: 4.104 ± 0.109
4.153GlyGlu: 4.153 ± 0.096
2.673GlyPhe: 2.673 ± 0.069
5.57GlyGly: 5.57 ± 0.27
1.169GlyHis: 1.169 ± 0.042
6.27GlyIle: 6.27 ± 0.118
5.301GlyLys: 5.301 ± 0.102
5.33GlyLeu: 5.33 ± 0.098
2.321GlyMet: 2.321 ± 0.069
3.329GlyAsn: 3.329 ± 0.144
1.925GlyPro: 1.925 ± 0.066
1.475GlyGln: 1.475 ± 0.054
3.124GlyArg: 3.124 ± 0.093
5.399GlySer: 5.399 ± 0.178
4.555GlyThr: 4.555 ± 0.159
5.042GlyVal: 5.042 ± 0.103
1.024GlyTrp: 1.024 ± 0.075
3.183GlyTyr: 3.183 ± 0.106
0.0GlyXaa: 0.0 ± 0.0
His
1.104HisAla: 1.104 ± 0.048
0.3HisCys: 0.3 ± 0.026
0.954HisAsp: 0.954 ± 0.043
0.972HisGlu: 0.972 ± 0.044
0.627HisPhe: 0.627 ± 0.034
1.148HisGly: 1.148 ± 0.05
0.343HisHis: 0.343 ± 0.027
1.381HisIle: 1.381 ± 0.053
0.834HisLys: 0.834 ± 0.036
1.432HisLeu: 1.432 ± 0.054
0.498HisMet: 0.498 ± 0.034
0.672HisAsn: 0.672 ± 0.038
0.922HisPro: 0.922 ± 0.049
0.469HisGln: 0.469 ± 0.033
0.722HisArg: 0.722 ± 0.037
1.047HisSer: 1.047 ± 0.042
1.008HisThr: 1.008 ± 0.041
1.011HisVal: 1.011 ± 0.043
0.203HisTrp: 0.203 ± 0.018
0.611HisTyr: 0.611 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.21IleAla: 6.21 ± 0.139
1.274IleCys: 1.274 ± 0.054
5.188IleAsp: 5.188 ± 0.105
5.269IleGlu: 5.269 ± 0.116
3.014IlePhe: 3.014 ± 0.088
5.854IleGly: 5.854 ± 0.134
1.308IleHis: 1.308 ± 0.051
6.638IleIle: 6.638 ± 0.141
4.496IleLys: 4.496 ± 0.088
7.237IleLeu: 7.237 ± 0.145
1.926IleMet: 1.926 ± 0.063
3.469IleAsn: 3.469 ± 0.095
3.641IlePro: 3.641 ± 0.091
1.852IleGln: 1.852 ± 0.052
3.055IleArg: 3.055 ± 0.084
6.805IleSer: 6.805 ± 0.124
4.893IleThr: 4.893 ± 0.157
5.644IleVal: 5.644 ± 0.118
0.663IleTrp: 0.663 ± 0.036
2.46IleTyr: 2.46 ± 0.083
0.0IleXaa: 0.0 ± 0.0
Lys
4.235LysAla: 4.235 ± 0.108
1.029LysCys: 1.029 ± 0.052
4.214LysAsp: 4.214 ± 0.098
5.047LysGlu: 5.047 ± 0.111
2.063LysPhe: 2.063 ± 0.054
3.928LysGly: 3.928 ± 0.096
1.002LysHis: 1.002 ± 0.044
5.348LysIle: 5.348 ± 0.104
4.926LysLys: 4.926 ± 0.119
5.339LysLeu: 5.339 ± 0.114
2.044LysMet: 2.044 ± 0.063
3.434LysAsn: 3.434 ± 0.073
1.987LysPro: 1.987 ± 0.068
1.608LysGln: 1.608 ± 0.062
2.664LysArg: 2.664 ± 0.07
4.336LysSer: 4.336 ± 0.085
3.944LysThr: 3.944 ± 0.101
4.239LysVal: 4.239 ± 0.089
0.65LysTrp: 0.65 ± 0.044
2.511LysTyr: 2.511 ± 0.068
0.0LysXaa: 0.0 ± 0.0
Leu
6.407LeuAla: 6.407 ± 0.15
1.315LeuCys: 1.315 ± 0.049
5.643LeuAsp: 5.643 ± 0.112
5.872LeuGlu: 5.872 ± 0.126
3.13LeuPhe: 3.13 ± 0.095
6.174LeuGly: 6.174 ± 0.121
1.36LeuHis: 1.36 ± 0.042
6.741LeuIle: 6.741 ± 0.152
5.931LeuLys: 5.931 ± 0.122
6.871LeuLeu: 6.871 ± 0.178
2.644LeuMet: 2.644 ± 0.079
4.223LeuAsn: 4.223 ± 0.091
3.412LeuPro: 3.412 ± 0.071
2.015LeuGln: 2.015 ± 0.066
3.773LeuArg: 3.773 ± 0.104
6.887LeuSer: 6.887 ± 0.14
4.683LeuThr: 4.683 ± 0.111
5.424LeuVal: 5.424 ± 0.119
0.618LeuTrp: 0.618 ± 0.036
2.723LeuTyr: 2.723 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
2.076MetAla: 2.076 ± 0.065
0.361MetCys: 0.361 ± 0.028
1.777MetAsp: 1.777 ± 0.055
1.799MetGlu: 1.799 ± 0.059
0.97MetPhe: 0.97 ± 0.046
2.008MetGly: 2.008 ± 0.061
0.551MetHis: 0.551 ± 0.035
2.447MetIle: 2.447 ± 0.065
2.266MetLys: 2.266 ± 0.078
2.428MetLeu: 2.428 ± 0.07
0.995MetMet: 0.995 ± 0.05
1.557MetAsn: 1.557 ± 0.056
1.178MetPro: 1.178 ± 0.042
0.746MetGln: 0.746 ± 0.038
1.312MetArg: 1.312 ± 0.048
2.234MetSer: 2.234 ± 0.063
1.589MetThr: 1.589 ± 0.063
1.756MetVal: 1.756 ± 0.063
0.245MetTrp: 0.245 ± 0.02
0.828MetTyr: 0.828 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
3.405AsnAla: 3.405 ± 0.095
0.777AsnCys: 0.777 ± 0.046
2.684AsnAsp: 2.684 ± 0.083
2.916AsnGlu: 2.916 ± 0.081
1.575AsnPhe: 1.575 ± 0.061
3.897AsnGly: 3.897 ± 0.165
0.716AsnHis: 0.716 ± 0.036
3.931AsnIle: 3.931 ± 0.109
2.591AsnLys: 2.591 ± 0.062
3.979AsnLeu: 3.979 ± 0.1
1.292AsnMet: 1.292 ± 0.051
2.204AsnAsn: 2.204 ± 0.088
2.015AsnPro: 2.015 ± 0.061
0.958AsnGln: 0.958 ± 0.043
1.587AsnArg: 1.587 ± 0.058
3.405AsnSer: 3.405 ± 0.106
2.481AsnThr: 2.481 ± 0.098
3.457AsnVal: 3.457 ± 0.111
0.466AsnTrp: 0.466 ± 0.03
1.591AsnTyr: 1.591 ± 0.069
0.0AsnXaa: 0.0 ± 0.0
Pro
2.687ProAla: 2.687 ± 0.09
0.549ProCys: 0.549 ± 0.035
2.78ProAsp: 2.78 ± 0.075
3.249ProGlu: 3.249 ± 0.082
1.523ProPhe: 1.523 ± 0.053
2.463ProGly: 2.463 ± 0.075
0.764ProHis: 0.764 ± 0.042
2.529ProIle: 2.529 ± 0.074
2.143ProLys: 2.143 ± 0.059
3.135ProLeu: 3.135 ± 0.085
0.865ProMet: 0.865 ± 0.042
1.507ProAsn: 1.507 ± 0.055
1.233ProPro: 1.233 ± 0.052
1.061ProGln: 1.061 ± 0.044
1.267ProArg: 1.267 ± 0.054
2.669ProSer: 2.669 ± 0.074
1.966ProThr: 1.966 ± 0.071
3.115ProVal: 3.115 ± 0.078
0.331ProTrp: 0.331 ± 0.026
1.374ProTyr: 1.374 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
1.594GlnAla: 1.594 ± 0.057
0.341GlnCys: 0.341 ± 0.029
1.258GlnAsp: 1.258 ± 0.048
1.653GlnGlu: 1.653 ± 0.056
0.803GlnPhe: 0.803 ± 0.041
1.383GlnGly: 1.383 ± 0.051
0.377GlnHis: 0.377 ± 0.027
2.11GlnIle: 2.11 ± 0.072
1.759GlnLys: 1.759 ± 0.06
1.738GlnLeu: 1.738 ± 0.054
0.873GlnMet: 0.873 ± 0.042
1.304GlnAsn: 1.304 ± 0.047
0.775GlnPro: 0.775 ± 0.04
0.684GlnGln: 0.684 ± 0.043
1.109GlnArg: 1.109 ± 0.051
1.633GlnSer: 1.633 ± 0.054
1.53GlnThr: 1.53 ± 0.054
1.384GlnVal: 1.384 ± 0.052
0.275GlnTrp: 0.275 ± 0.024
1.086GlnTyr: 1.086 ± 0.049
0.0GlnXaa: 0.0 ± 0.0
Arg
2.63ArgAla: 2.63 ± 0.084
0.668ArgCys: 0.668 ± 0.038
2.373ArgAsp: 2.373 ± 0.084
2.916ArgGlu: 2.916 ± 0.099
1.521ArgPhe: 1.521 ± 0.057
2.771ArgGly: 2.771 ± 0.083
0.711ArgHis: 0.711 ± 0.038
3.274ArgIle: 3.274 ± 0.095
2.856ArgLys: 2.856 ± 0.091
3.524ArgLeu: 3.524 ± 0.101
1.331ArgMet: 1.331 ± 0.057
1.823ArgAsn: 1.823 ± 0.071
1.399ArgPro: 1.399 ± 0.062
1.04ArgGln: 1.04 ± 0.05
1.932ArgArg: 1.932 ± 0.073
2.726ArgSer: 2.726 ± 0.071
1.996ArgThr: 1.996 ± 0.064
2.607ArgVal: 2.607 ± 0.078
0.373ArgTrp: 0.373 ± 0.028
1.631ArgTyr: 1.631 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
5.513SerAla: 5.513 ± 0.112
0.94SerCys: 0.94 ± 0.048
4.543SerAsp: 4.543 ± 0.109
5.234SerGlu: 5.234 ± 0.117
2.788SerPhe: 2.788 ± 0.071
6.007SerGly: 6.007 ± 0.158
1.164SerHis: 1.164 ± 0.043
5.996SerIle: 5.996 ± 0.127
5.093SerLys: 5.093 ± 0.106
6.316SerLeu: 6.316 ± 0.143
2.069SerMet: 2.069 ± 0.063
3.346SerAsn: 3.346 ± 0.132
2.552SerPro: 2.552 ± 0.078
1.919SerGln: 1.919 ± 0.056
2.954SerArg: 2.954 ± 0.086
6.142SerSer: 6.142 ± 0.155
3.579SerThr: 3.579 ± 0.113
5.147SerVal: 5.147 ± 0.119
0.713SerTrp: 0.713 ± 0.038
2.726SerTyr: 2.726 ± 0.083
0.0SerXaa: 0.0 ± 0.0
Thr
4.411ThrAla: 4.411 ± 0.105
0.656ThrCys: 0.656 ± 0.036
3.332ThrAsp: 3.332 ± 0.106
3.325ThrGlu: 3.325 ± 0.087
2.421ThrPhe: 2.421 ± 0.114
4.358ThrGly: 4.358 ± 0.114
0.862ThrHis: 0.862 ± 0.042
4.583ThrIle: 4.583 ± 0.151
2.955ThrLys: 2.955 ± 0.093
5.076ThrLeu: 5.076 ± 0.113
1.354ThrMet: 1.354 ± 0.057
2.234ThrAsn: 2.234 ± 0.108
2.437ThrPro: 2.437 ± 0.072
1.288ThrGln: 1.288 ± 0.054
1.66ThrArg: 1.66 ± 0.056
4.025ThrSer: 4.025 ± 0.137
3.009ThrThr: 3.009 ± 0.119
4.535ThrVal: 4.535 ± 0.145
0.363ThrTrp: 0.363 ± 0.024
2.12ThrTyr: 2.12 ± 0.094
0.0ThrXaa: 0.0 ± 0.0
Val
5.022ValAla: 5.022 ± 0.106
1.303ValCys: 1.303 ± 0.06
4.324ValAsp: 4.324 ± 0.094
4.237ValGlu: 4.237 ± 0.108
2.799ValPhe: 2.799 ± 0.077
4.603ValGly: 4.603 ± 0.097
1.125ValHis: 1.125 ± 0.051
5.582ValIle: 5.582 ± 0.116
4.594ValLys: 4.594 ± 0.109
6.238ValLeu: 6.238 ± 0.128
1.937ValMet: 1.937 ± 0.062
3.069ValAsn: 3.069 ± 0.1
2.977ValPro: 2.977 ± 0.081
1.672ValGln: 1.672 ± 0.048
2.863ValArg: 2.863 ± 0.077
5.813ValSer: 5.813 ± 0.122
4.176ValThr: 4.176 ± 0.133
5.051ValVal: 5.051 ± 0.119
0.633ValTrp: 0.633 ± 0.036
2.628ValTyr: 2.628 ± 0.099
0.0ValXaa: 0.0 ± 0.0
Trp
0.622TrpAla: 0.622 ± 0.034
0.126TrpCys: 0.126 ± 0.015
0.661TrpAsp: 0.661 ± 0.034
0.64TrpGlu: 0.64 ± 0.032
0.327TrpPhe: 0.327 ± 0.026
0.558TrpGly: 0.558 ± 0.035
0.171TrpHis: 0.171 ± 0.02
0.757TrpIle: 0.757 ± 0.041
0.666TrpLys: 0.666 ± 0.036
0.714TrpLeu: 0.714 ± 0.038
0.32TrpMet: 0.32 ± 0.026
0.592TrpAsn: 0.592 ± 0.033
0.235TrpPro: 0.235 ± 0.019
0.247TrpGln: 0.247 ± 0.021
0.386TrpArg: 0.386 ± 0.026
0.606TrpSer: 0.606 ± 0.027
0.54TrpThr: 0.54 ± 0.036
0.556TrpVal: 0.556 ± 0.031
0.116TrpTrp: 0.116 ± 0.017
0.565TrpTyr: 0.565 ± 0.053
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.902TyrAla: 2.902 ± 0.104
0.633TyrCys: 0.633 ± 0.035
2.481TyrAsp: 2.481 ± 0.085
2.2TyrGlu: 2.2 ± 0.061
1.395TyrPhe: 1.395 ± 0.051
2.764TyrGly: 2.764 ± 0.127
0.656TyrHis: 0.656 ± 0.032
2.557TyrIle: 2.557 ± 0.072
1.93TyrLys: 1.93 ± 0.071
3.391TyrLeu: 3.391 ± 0.085
0.912TyrMet: 0.912 ± 0.048
1.886TyrAsn: 1.886 ± 0.089
1.37TyrPro: 1.37 ± 0.05
0.851TyrGln: 0.851 ± 0.038
1.434TyrArg: 1.434 ± 0.051
2.987TyrSer: 2.987 ± 0.095
2.396TyrThr: 2.396 ± 0.127
2.454TyrVal: 2.454 ± 0.068
0.396TyrTrp: 0.396 ± 0.028
1.443TyrTyr: 1.443 ± 0.062
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1826 proteins (562683 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski