Amino acid dipepetide frequency for Smithella sp. ME-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.975AlaAla: 6.975 ± 0.236
1.052AlaCys: 1.052 ± 0.076
4.152AlaAsp: 4.152 ± 0.156
4.702AlaGlu: 4.702 ± 0.164
3.231AlaPhe: 3.231 ± 0.154
5.972AlaGly: 5.972 ± 0.191
1.324AlaHis: 1.324 ± 0.086
6.163AlaIle: 6.163 ± 0.181
4.37AlaLys: 4.37 ± 0.176
7.438AlaLeu: 7.438 ± 0.22
2.267AlaMet: 2.267 ± 0.116
2.752AlaAsn: 2.752 ± 0.111
2.327AlaPro: 2.327 ± 0.11
2.479AlaGln: 2.479 ± 0.119
3.901AlaArg: 3.901 ± 0.174
4.185AlaSer: 4.185 ± 0.165
3.727AlaThr: 3.727 ± 0.135
5.809AlaVal: 5.809 ± 0.177
0.708AlaTrp: 0.708 ± 0.065
2.37AlaTyr: 2.37 ± 0.121
0.0AlaXaa: 0.0 ± 0.0
Cys
0.91CysAla: 0.91 ± 0.073
0.251CysCys: 0.251 ± 0.045
0.67CysAsp: 0.67 ± 0.079
0.714CysGlu: 0.714 ± 0.067
0.594CysPhe: 0.594 ± 0.051
1.204CysGly: 1.204 ± 0.092
0.229CysHis: 0.229 ± 0.039
0.687CysIle: 0.687 ± 0.058
0.659CysLys: 0.659 ± 0.054
1.204CysLeu: 1.204 ± 0.069
0.316CysMet: 0.316 ± 0.039
0.501CysAsn: 0.501 ± 0.053
0.855CysPro: 0.855 ± 0.089
0.392CysGln: 0.392 ± 0.054
0.654CysArg: 0.654 ± 0.063
0.747CysSer: 0.747 ± 0.06
0.654CysThr: 0.654 ± 0.058
0.67CysVal: 0.67 ± 0.06
0.153CysTrp: 0.153 ± 0.03
0.414CysTyr: 0.414 ± 0.053
0.0CysXaa: 0.0 ± 0.0
Asp
3.88AspAla: 3.88 ± 0.138
0.583AspCys: 0.583 ± 0.07
2.637AspAsp: 2.637 ± 0.134
3.569AspGlu: 3.569 ± 0.138
2.632AspPhe: 2.632 ± 0.11
3.209AspGly: 3.209 ± 0.119
0.981AspHis: 0.981 ± 0.079
4.893AspIle: 4.893 ± 0.16
3.94AspLys: 3.94 ± 0.152
5.024AspLeu: 5.024 ± 0.19
1.504AspMet: 1.504 ± 0.079
2.321AspAsn: 2.321 ± 0.123
2.016AspPro: 2.016 ± 0.112
1.498AspGln: 1.498 ± 0.104
2.141AspArg: 2.141 ± 0.122
2.616AspSer: 2.616 ± 0.14
2.343AspThr: 2.343 ± 0.108
3.891AspVal: 3.891 ± 0.143
0.621AspTrp: 0.621 ± 0.062
2.245AspTyr: 2.245 ± 0.117
0.0AspXaa: 0.0 ± 0.0
Glu
4.871GluAla: 4.871 ± 0.157
0.496GluCys: 0.496 ± 0.046
3.019GluAsp: 3.019 ± 0.14
4.779GluGlu: 4.779 ± 0.174
2.599GluPhe: 2.599 ± 0.125
3.471GluGly: 3.471 ± 0.124
1.09GluHis: 1.09 ± 0.072
5.901GluIle: 5.901 ± 0.193
6.631GluLys: 6.631 ± 0.214
5.596GluLeu: 5.596 ± 0.192
1.793GluMet: 1.793 ± 0.101
3.417GluAsn: 3.417 ± 0.145
1.847GluPro: 1.847 ± 0.086
2.163GluGln: 2.163 ± 0.115
3.22GluArg: 3.22 ± 0.16
3.417GluSer: 3.417 ± 0.128
3.128GluThr: 3.128 ± 0.123
3.836GluVal: 3.836 ± 0.144
0.588GluTrp: 0.588 ± 0.063
1.929GluTyr: 1.929 ± 0.101
0.0GluXaa: 0.0 ± 0.0
Phe
3.231PheAla: 3.231 ± 0.135
0.752PheCys: 0.752 ± 0.066
2.556PheAsp: 2.556 ± 0.128
2.027PheGlu: 2.027 ± 0.081
2.468PhePhe: 2.468 ± 0.138
3.324PheGly: 3.324 ± 0.137
0.823PheHis: 0.823 ± 0.067
3.705PheIle: 3.705 ± 0.178
2.741PheLys: 2.741 ± 0.137
4.615PheLeu: 4.615 ± 0.197
1.095PheMet: 1.095 ± 0.085
2.092PheAsn: 2.092 ± 0.106
1.815PhePro: 1.815 ± 0.102
1.33PheGln: 1.33 ± 0.09
1.94PheArg: 1.94 ± 0.099
3.357PheSer: 3.357 ± 0.144
2.67PheThr: 2.67 ± 0.115
3.144PheVal: 3.144 ± 0.15
0.594PheTrp: 0.594 ± 0.072
1.711PheTyr: 1.711 ± 0.104
0.0PheXaa: 0.0 ± 0.0
Gly
4.779GlyAla: 4.779 ± 0.187
1.117GlyCys: 1.117 ± 0.087
3.226GlyAsp: 3.226 ± 0.129
3.896GlyGlu: 3.896 ± 0.133
3.291GlyPhe: 3.291 ± 0.137
4.779GlyGly: 4.779 ± 0.232
1.379GlyHis: 1.379 ± 0.086
6.13GlyIle: 6.13 ± 0.209
5.444GlyLys: 5.444 ± 0.186
6.272GlyLeu: 6.272 ± 0.176
2.054GlyMet: 2.054 ± 0.107
2.953GlyAsn: 2.953 ± 0.13
1.678GlyPro: 1.678 ± 0.099
1.858GlyGln: 1.858 ± 0.094
3.199GlyArg: 3.199 ± 0.14
3.907GlySer: 3.907 ± 0.155
3.46GlyThr: 3.46 ± 0.134
4.354GlyVal: 4.354 ± 0.167
0.828GlyTrp: 0.828 ± 0.076
2.686GlyTyr: 2.686 ± 0.134
0.0GlyXaa: 0.0 ± 0.0
His
1.253HisAla: 1.253 ± 0.079
0.316HisCys: 0.316 ± 0.044
0.861HisAsp: 0.861 ± 0.075
1.084HisGlu: 1.084 ± 0.08
0.861HisPhe: 0.861 ± 0.077
1.373HisGly: 1.373 ± 0.09
0.49HisHis: 0.49 ± 0.063
1.291HisIle: 1.291 ± 0.084
1.052HisLys: 1.052 ± 0.074
1.923HisLeu: 1.923 ± 0.129
0.414HisMet: 0.414 ± 0.048
0.79HisAsn: 0.79 ± 0.069
1.15HisPro: 1.15 ± 0.083
0.665HisGln: 0.665 ± 0.061
0.943HisArg: 0.943 ± 0.077
1.019HisSer: 1.019 ± 0.077
0.866HisThr: 0.866 ± 0.071
1.052HisVal: 1.052 ± 0.077
0.223HisTrp: 0.223 ± 0.037
0.627HisTyr: 0.627 ± 0.058
0.0HisXaa: 0.0 ± 0.0
Ile
6.441IleAla: 6.441 ± 0.218
1.073IleCys: 1.073 ± 0.087
4.446IleAsp: 4.446 ± 0.156
5.155IleGlu: 5.155 ± 0.193
3.961IlePhe: 3.961 ± 0.153
5.078IleGly: 5.078 ± 0.168
1.4IleHis: 1.4 ± 0.092
7.269IleIle: 7.269 ± 0.26
6.283IleLys: 6.283 ± 0.236
7.547IleLeu: 7.547 ± 0.212
1.923IleMet: 1.923 ± 0.105
4.316IleAsn: 4.316 ± 0.182
3.722IlePro: 3.722 ± 0.139
2.196IleGln: 2.196 ± 0.116
3.803IleArg: 3.803 ± 0.138
5.324IleSer: 5.324 ± 0.175
4.348IleThr: 4.348 ± 0.157
5.411IleVal: 5.411 ± 0.164
0.834IleTrp: 0.834 ± 0.074
2.479IleTyr: 2.479 ± 0.115
0.0IleXaa: 0.0 ± 0.0
Lys
4.937LysAla: 4.937 ± 0.186
0.757LysCys: 0.757 ± 0.07
4.158LysAsp: 4.158 ± 0.159
5.591LysGlu: 5.591 ± 0.19
2.496LysPhe: 2.496 ± 0.126
4.288LysGly: 4.288 ± 0.175
0.975LysHis: 0.975 ± 0.076
6.615LysIle: 6.615 ± 0.208
7.171LysLys: 7.171 ± 0.229
5.918LysLeu: 5.918 ± 0.179
2.403LysMet: 2.403 ± 0.12
4.185LysAsn: 4.185 ± 0.146
2.605LysPro: 2.605 ± 0.13
2.528LysGln: 2.528 ± 0.136
3.482LysArg: 3.482 ± 0.153
4.021LysSer: 4.021 ± 0.17
3.891LysThr: 3.891 ± 0.171
4.43LysVal: 4.43 ± 0.159
0.861LysTrp: 0.861 ± 0.075
2.605LysTyr: 2.605 ± 0.131
0.0LysXaa: 0.0 ± 0.0
Leu
7.296LeuAla: 7.296 ± 0.215
1.21LeuCys: 1.21 ± 0.078
4.915LeuAsp: 4.915 ± 0.134
5.569LeuGlu: 5.569 ± 0.157
4.479LeuPhe: 4.479 ± 0.161
5.961LeuGly: 5.961 ± 0.208
1.727LeuHis: 1.727 ± 0.109
7.574LeuIle: 7.574 ± 0.245
7.563LeuLys: 7.563 ± 0.229
9.04LeuLeu: 9.04 ± 0.248
2.517LeuMet: 2.517 ± 0.132
4.343LeuAsn: 4.343 ± 0.156
4.425LeuPro: 4.425 ± 0.188
2.877LeuGln: 2.877 ± 0.145
4.599LeuArg: 4.599 ± 0.197
6.283LeuSer: 6.283 ± 0.185
5.04LeuThr: 5.04 ± 0.153
5.498LeuVal: 5.498 ± 0.186
0.823LeuTrp: 0.823 ± 0.067
2.654LeuTyr: 2.654 ± 0.133
0.0LeuXaa: 0.0 ± 0.0
Met
2.523MetAla: 2.523 ± 0.141
0.163MetCys: 0.163 ± 0.034
1.444MetAsp: 1.444 ± 0.084
2.12MetGlu: 2.12 ± 0.111
1.014MetPhe: 1.014 ± 0.072
1.962MetGly: 1.962 ± 0.103
0.485MetHis: 0.485 ± 0.046
1.945MetIle: 1.945 ± 0.107
2.349MetLys: 2.349 ± 0.119
2.24MetLeu: 2.24 ± 0.122
0.697MetMet: 0.697 ± 0.07
1.357MetAsn: 1.357 ± 0.098
1.281MetPro: 1.281 ± 0.075
0.861MetGln: 0.861 ± 0.071
1.34MetArg: 1.34 ± 0.091
1.607MetSer: 1.607 ± 0.093
1.33MetThr: 1.33 ± 0.079
1.629MetVal: 1.629 ± 0.087
0.267MetTrp: 0.267 ± 0.041
0.529MetTyr: 0.529 ± 0.058
0.0MetXaa: 0.0 ± 0.0
Asn
3.226AsnAla: 3.226 ± 0.144
0.561AsnCys: 0.561 ± 0.057
2.185AsnAsp: 2.185 ± 0.115
2.49AsnGlu: 2.49 ± 0.124
2.082AsnPhe: 2.082 ± 0.096
2.741AsnGly: 2.741 ± 0.114
0.899AsnHis: 0.899 ± 0.085
4.659AsnIle: 4.659 ± 0.197
3.15AsnLys: 3.15 ± 0.138
4.762AsnLeu: 4.762 ± 0.167
1.122AsnMet: 1.122 ± 0.076
2.158AsnAsn: 2.158 ± 0.137
2.283AsnPro: 2.283 ± 0.111
1.537AsnGln: 1.537 ± 0.092
2.245AsnArg: 2.245 ± 0.124
2.566AsnSer: 2.566 ± 0.128
1.918AsnThr: 1.918 ± 0.095
3.084AsnVal: 3.084 ± 0.128
0.512AsnTrp: 0.512 ± 0.061
1.744AsnTyr: 1.744 ± 0.094
0.0AsnXaa: 0.0 ± 0.0
Pro
3.28ProAla: 3.28 ± 0.145
0.452ProCys: 0.452 ± 0.052
2.932ProAsp: 2.932 ± 0.137
3.302ProGlu: 3.302 ± 0.154
2.082ProPhe: 2.082 ± 0.117
2.866ProGly: 2.866 ± 0.127
0.79ProHis: 0.79 ± 0.064
2.419ProIle: 2.419 ± 0.13
2.12ProLys: 2.12 ± 0.115
3.64ProLeu: 3.64 ± 0.147
0.921ProMet: 0.921 ± 0.077
1.384ProAsn: 1.384 ± 0.075
1.591ProPro: 1.591 ± 0.126
1.548ProGln: 1.548 ± 0.105
1.466ProArg: 1.466 ± 0.109
2.234ProSer: 2.234 ± 0.119
1.825ProThr: 1.825 ± 0.146
3.329ProVal: 3.329 ± 0.143
0.518ProTrp: 0.518 ± 0.064
1.275ProTyr: 1.275 ± 0.078
0.0ProXaa: 0.0 ± 0.0
Gln
2.517GlnAla: 2.517 ± 0.112
0.305GlnCys: 0.305 ± 0.044
1.291GlnAsp: 1.291 ± 0.081
2.19GlnGlu: 2.19 ± 0.123
1.117GlnPhe: 1.117 ± 0.077
1.983GlnGly: 1.983 ± 0.104
0.518GlnHis: 0.518 ± 0.066
2.844GlnIle: 2.844 ± 0.114
2.801GlnLys: 2.801 ± 0.135
2.823GlnLeu: 2.823 ± 0.128
0.97GlnMet: 0.97 ± 0.069
1.613GlnAsn: 1.613 ± 0.119
1.215GlnPro: 1.215 ± 0.082
1.128GlnGln: 1.128 ± 0.097
1.591GlnArg: 1.591 ± 0.088
2.032GlnSer: 2.032 ± 0.11
1.689GlnThr: 1.689 ± 0.1
1.809GlnVal: 1.809 ± 0.094
0.392GlnTrp: 0.392 ± 0.049
1.144GlnTyr: 1.144 ± 0.08
0.0GlnXaa: 0.0 ± 0.0
Arg
3.291ArgAla: 3.291 ± 0.14
0.539ArgCys: 0.539 ± 0.051
2.332ArgAsp: 2.332 ± 0.121
3.569ArgGlu: 3.569 ± 0.151
2.147ArgPhe: 2.147 ± 0.107
2.899ArgGly: 2.899 ± 0.13
0.937ArgHis: 0.937 ± 0.078
3.912ArgIle: 3.912 ± 0.155
3.787ArgLys: 3.787 ± 0.149
4.621ArgLeu: 4.621 ± 0.183
1.362ArgMet: 1.362 ± 0.086
2.245ArgAsn: 2.245 ± 0.108
1.76ArgPro: 1.76 ± 0.1
1.913ArgGln: 1.913 ± 0.128
2.572ArgArg: 2.572 ± 0.174
2.626ArgSer: 2.626 ± 0.139
1.973ArgThr: 1.973 ± 0.116
2.888ArgVal: 2.888 ± 0.129
0.572ArgTrp: 0.572 ± 0.06
1.755ArgTyr: 1.755 ± 0.109
0.0ArgXaa: 0.0 ± 0.0
Ser
4.261SerAla: 4.261 ± 0.159
0.774SerCys: 0.774 ± 0.064
3.03SerAsp: 3.03 ± 0.128
3.449SerGlu: 3.449 ± 0.163
3.193SerPhe: 3.193 ± 0.142
4.708SerGly: 4.708 ± 0.171
1.166SerHis: 1.166 ± 0.097
4.594SerIle: 4.594 ± 0.159
3.466SerLys: 3.466 ± 0.142
6.239SerLeu: 6.239 ± 0.188
1.591SerMet: 1.591 ± 0.097
2.267SerAsn: 2.267 ± 0.125
2.408SerPro: 2.408 ± 0.118
1.934SerGln: 1.934 ± 0.11
3.084SerArg: 3.084 ± 0.141
3.989SerSer: 3.989 ± 0.148
2.73SerThr: 2.73 ± 0.12
4.087SerVal: 4.087 ± 0.127
0.719SerTrp: 0.719 ± 0.07
2.136SerTyr: 2.136 ± 0.112
0.0SerXaa: 0.0 ± 0.0
Thr
4.327ThrAla: 4.327 ± 0.172
0.583ThrCys: 0.583 ± 0.056
2.632ThrAsp: 2.632 ± 0.121
3.013ThrGlu: 3.013 ± 0.14
2.038ThrPhe: 2.038 ± 0.101
4.457ThrGly: 4.457 ± 0.169
0.905ThrHis: 0.905 ± 0.073
3.891ThrIle: 3.891 ± 0.139
3.193ThrLys: 3.193 ± 0.118
4.561ThrLeu: 4.561 ± 0.173
1.384ThrMet: 1.384 ± 0.092
1.951ThrAsn: 1.951 ± 0.078
2.294ThrPro: 2.294 ± 0.16
1.449ThrGln: 1.449 ± 0.089
2.114ThrArg: 2.114 ± 0.099
3.002ThrSer: 3.002 ± 0.136
2.49ThrThr: 2.49 ± 0.131
3.318ThrVal: 3.318 ± 0.149
0.55ThrTrp: 0.55 ± 0.061
1.58ThrTyr: 1.58 ± 0.096
0.0ThrXaa: 0.0 ± 0.0
Val
4.844ValAla: 4.844 ± 0.162
0.883ValCys: 0.883 ± 0.069
3.678ValAsp: 3.678 ± 0.148
3.951ValGlu: 3.951 ± 0.172
3.166ValPhe: 3.166 ± 0.157
3.994ValGly: 3.994 ± 0.158
1.117ValHis: 1.117 ± 0.075
5.498ValIle: 5.498 ± 0.187
4.256ValLys: 4.256 ± 0.168
6.239ValLeu: 6.239 ± 0.19
1.706ValMet: 1.706 ± 0.101
3.16ValAsn: 3.16 ± 0.127
2.861ValPro: 2.861 ± 0.117
1.82ValGln: 1.82 ± 0.101
2.975ValArg: 2.975 ± 0.13
4.294ValSer: 4.294 ± 0.167
3.689ValThr: 3.689 ± 0.146
4.768ValVal: 4.768 ± 0.171
0.627ValTrp: 0.627 ± 0.059
1.923ValTyr: 1.923 ± 0.101
0.0ValXaa: 0.0 ± 0.0
Trp
0.708TrpAla: 0.708 ± 0.068
0.136TrpCys: 0.136 ± 0.024
0.496TrpAsp: 0.496 ± 0.055
0.757TrpGlu: 0.757 ± 0.072
0.518TrpPhe: 0.518 ± 0.053
0.643TrpGly: 0.643 ± 0.064
0.245TrpHis: 0.245 ± 0.04
0.888TrpIle: 0.888 ± 0.067
0.806TrpLys: 0.806 ± 0.068
1.182TrpLeu: 1.182 ± 0.086
0.387TrpMet: 0.387 ± 0.043
0.523TrpAsn: 0.523 ± 0.051
0.376TrpPro: 0.376 ± 0.051
0.518TrpGln: 0.518 ± 0.055
0.659TrpArg: 0.659 ± 0.064
0.561TrpSer: 0.561 ± 0.056
0.463TrpThr: 0.463 ± 0.057
0.55TrpVal: 0.55 ± 0.069
0.191TrpTrp: 0.191 ± 0.035
0.398TrpTyr: 0.398 ± 0.052
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.354TyrAla: 2.354 ± 0.121
0.474TyrCys: 0.474 ± 0.046
1.891TyrAsp: 1.891 ± 0.107
1.864TyrGlu: 1.864 ± 0.096
2.022TyrPhe: 2.022 ± 0.108
2.376TyrGly: 2.376 ± 0.115
0.774TyrHis: 0.774 ± 0.068
2.131TyrIle: 2.131 ± 0.111
2.136TyrLys: 2.136 ± 0.127
3.509TyrLeu: 3.509 ± 0.149
0.708TyrMet: 0.708 ± 0.058
1.597TyrAsn: 1.597 ± 0.097
1.466TyrPro: 1.466 ± 0.097
1.281TyrGln: 1.281 ± 0.087
1.771TyrArg: 1.771 ± 0.09
2.038TyrSer: 2.038 ± 0.105
1.498TyrThr: 1.498 ± 0.09
1.891TyrVal: 1.891 ± 0.09
0.425TyrTrp: 0.425 ± 0.057
1.498TyrTyr: 1.498 ± 0.104
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 705 proteins (183521 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski