Amino acid dipepetide frequency for TM7 phylum sp. oral taxon 352

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.628AlaAla: 7.628 ± 0.239
0.443AlaCys: 0.443 ± 0.05
4.748AlaAsp: 4.748 ± 0.167
5.668AlaGlu: 5.668 ± 0.199
2.572AlaPhe: 2.572 ± 0.116
6.111AlaGly: 6.111 ± 0.215
1.522AlaHis: 1.522 ± 0.097
6.395AlaIle: 6.395 ± 0.2
5.894AlaLys: 5.894 ± 0.184
7.233AlaLeu: 7.233 ± 0.197
2.018AlaMet: 2.018 ± 0.106
3.328AlaAsn: 3.328 ± 0.128
2.726AlaPro: 2.726 ± 0.136
2.769AlaGln: 2.769 ± 0.113
4.454AlaArg: 4.454 ± 0.153
4.979AlaSer: 4.979 ± 0.162
4.397AlaThr: 4.397 ± 0.157
5.668AlaVal: 5.668 ± 0.177
0.775AlaTrp: 0.775 ± 0.064
2.311AlaTyr: 2.311 ± 0.147
0.0AlaXaa: 0.0 ± 0.0
Cys
0.501CysAla: 0.501 ± 0.049
0.043CysCys: 0.043 ± 0.016
0.429CysAsp: 0.429 ± 0.044
0.356CysGlu: 0.356 ± 0.045
0.299CysPhe: 0.299 ± 0.039
0.602CysGly: 0.602 ± 0.06
0.169CysHis: 0.169 ± 0.025
0.385CysIle: 0.385 ± 0.05
0.4CysLys: 0.4 ± 0.05
0.544CysLeu: 0.544 ± 0.053
0.144CysMet: 0.144 ± 0.026
0.197CysAsn: 0.197 ± 0.032
0.356CysPro: 0.356 ± 0.047
0.27CysGln: 0.27 ± 0.039
0.299CysArg: 0.299 ± 0.039
0.419CysSer: 0.419 ± 0.045
0.217CysThr: 0.217 ± 0.032
0.472CysVal: 0.472 ± 0.05
0.13CysTrp: 0.13 ± 0.042
0.222CysTyr: 0.222 ± 0.034
0.0CysXaa: 0.0 ± 0.0
Asp
4.214AspAla: 4.214 ± 0.155
0.414AspCys: 0.414 ± 0.046
4.175AspAsp: 4.175 ± 0.153
4.955AspGlu: 4.955 ± 0.172
2.634AspPhe: 2.634 ± 0.118
4.719AspGly: 4.719 ± 0.187
0.881AspHis: 0.881 ± 0.065
4.57AspIle: 4.57 ± 0.146
4.382AspLys: 4.382 ± 0.159
4.854AspLeu: 4.854 ± 0.151
1.397AspMet: 1.397 ± 0.08
2.966AspAsn: 2.966 ± 0.141
1.96AspPro: 1.96 ± 0.1
1.859AspGln: 1.859 ± 0.089
2.35AspArg: 2.35 ± 0.103
3.568AspSer: 3.568 ± 0.143
2.774AspThr: 2.774 ± 0.114
4.088AspVal: 4.088 ± 0.149
0.612AspTrp: 0.612 ± 0.061
2.21AspTyr: 2.21 ± 0.12
0.0AspXaa: 0.0 ± 0.0
Glu
4.917GluAla: 4.917 ± 0.169
0.327GluCys: 0.327 ± 0.038
3.231GluAsp: 3.231 ± 0.147
4.609GluGlu: 4.609 ± 0.189
2.726GluPhe: 2.726 ± 0.114
3.159GluGly: 3.159 ± 0.118
0.944GluHis: 0.944 ± 0.072
5.326GluIle: 5.326 ± 0.152
5.923GluLys: 5.923 ± 0.187
7.108GluLeu: 7.108 ± 0.21
1.748GluMet: 1.748 ± 0.11
3.178GluAsn: 3.178 ± 0.135
1.729GluPro: 1.729 ± 0.098
2.461GluGln: 2.461 ± 0.119
3.511GluArg: 3.511 ± 0.135
3.539GluSer: 3.539 ± 0.132
3.019GluThr: 3.019 ± 0.129
4.71GluVal: 4.71 ± 0.167
0.655GluTrp: 0.655 ± 0.062
2.206GluTyr: 2.206 ± 0.103
0.0GluXaa: 0.0 ± 0.0
Phe
3.284PheAla: 3.284 ± 0.131
0.337PheCys: 0.337 ± 0.04
2.49PheAsp: 2.49 ± 0.124
1.926PheGlu: 1.926 ± 0.107
1.531PhePhe: 1.531 ± 0.104
3.096PheGly: 3.096 ± 0.134
0.602PheHis: 0.602 ± 0.064
2.697PheIle: 2.697 ± 0.126
1.926PheLys: 1.926 ± 0.092
3.462PheLeu: 3.462 ± 0.147
0.968PheMet: 0.968 ± 0.066
1.719PheAsn: 1.719 ± 0.091
1.252PhePro: 1.252 ± 0.085
0.891PheGln: 0.891 ± 0.064
1.551PheArg: 1.551 ± 0.086
3.135PheSer: 3.135 ± 0.137
2.364PheThr: 2.364 ± 0.106
2.856PheVal: 2.856 ± 0.123
0.549PheTrp: 0.549 ± 0.058
1.339PheTyr: 1.339 ± 0.077
0.0PheXaa: 0.0 ± 0.0
Gly
5.124GlyAla: 5.124 ± 0.191
0.51GlyCys: 0.51 ± 0.054
4.103GlyAsp: 4.103 ± 0.167
4.474GlyGlu: 4.474 ± 0.133
2.706GlyPhe: 2.706 ± 0.102
5.273GlyGly: 5.273 ± 0.217
1.045GlyHis: 1.045 ± 0.076
4.45GlyIle: 4.45 ± 0.156
5.124GlyLys: 5.124 ± 0.156
6.39GlyLeu: 6.39 ± 0.189
1.502GlyMet: 1.502 ± 0.073
2.841GlyAsn: 2.841 ± 0.138
1.425GlyPro: 1.425 ± 0.081
2.355GlyGln: 2.355 ± 0.121
3.472GlyArg: 3.472 ± 0.15
4.45GlySer: 4.45 ± 0.188
3.265GlyThr: 3.265 ± 0.145
5.909GlyVal: 5.909 ± 0.211
1.011GlyTrp: 1.011 ± 0.065
2.461GlyTyr: 2.461 ± 0.108
0.0GlyXaa: 0.0 ± 0.0
His
1.329HisAla: 1.329 ± 0.084
0.169HisCys: 0.169 ± 0.025
1.064HisAsp: 1.064 ± 0.077
1.108HisGlu: 1.108 ± 0.068
0.636HisPhe: 0.636 ± 0.06
1.103HisGly: 1.103 ± 0.071
0.443HisHis: 0.443 ± 0.052
1.199HisIle: 1.199 ± 0.079
1.079HisLys: 1.079 ± 0.068
1.555HisLeu: 1.555 ± 0.097
0.332HisMet: 0.332 ± 0.037
0.886HisAsn: 0.886 ± 0.068
0.982HisPro: 0.982 ± 0.079
0.645HisGln: 0.645 ± 0.064
0.727HisArg: 0.727 ± 0.057
1.084HisSer: 1.084 ± 0.087
0.929HisThr: 0.929 ± 0.067
0.973HisVal: 0.973 ± 0.065
0.212HisTrp: 0.212 ± 0.038
0.544HisTyr: 0.544 ± 0.058
0.0HisXaa: 0.0 ± 0.0
Ile
6.689IleAla: 6.689 ± 0.206
0.51IleCys: 0.51 ± 0.051
5.471IleAsp: 5.471 ± 0.158
4.946IleGlu: 4.946 ± 0.143
2.774IlePhe: 2.774 ± 0.129
4.859IleGly: 4.859 ± 0.167
1.204IleHis: 1.204 ± 0.082
5.981IleIle: 5.981 ± 0.216
4.373IleLys: 4.373 ± 0.148
6.097IleLeu: 6.097 ± 0.23
1.618IleMet: 1.618 ± 0.098
3.366IleAsn: 3.366 ± 0.124
2.702IlePro: 2.702 ± 0.125
2.128IleGln: 2.128 ± 0.112
3.154IleArg: 3.154 ± 0.133
5.355IleSer: 5.355 ± 0.176
4.161IleThr: 4.161 ± 0.146
5.548IleVal: 5.548 ± 0.148
0.713IleTrp: 0.713 ± 0.055
2.037IleTyr: 2.037 ± 0.105
0.0IleXaa: 0.0 ± 0.0
Lys
4.758LysAla: 4.758 ± 0.182
0.289LysCys: 0.289 ± 0.038
3.843LysAsp: 3.843 ± 0.157
4.32LysGlu: 4.32 ± 0.171
2.336LysPhe: 2.336 ± 0.102
3.352LysGly: 3.352 ± 0.145
1.141LysHis: 1.141 ± 0.081
5.938LysIle: 5.938 ± 0.179
6.535LysLys: 6.535 ± 0.209
6.655LysLeu: 6.655 ± 0.192
2.042LysMet: 2.042 ± 0.107
4.18LysAsn: 4.18 ± 0.161
2.528LysPro: 2.528 ± 0.108
2.543LysGln: 2.543 ± 0.111
3.559LysArg: 3.559 ± 0.146
4.758LysSer: 4.758 ± 0.178
4.334LysThr: 4.334 ± 0.153
4.084LysVal: 4.084 ± 0.148
0.568LysTrp: 0.568 ± 0.055
2.292LysTyr: 2.292 ± 0.096
0.0LysXaa: 0.0 ± 0.0
Leu
8.822LeuAla: 8.822 ± 0.221
0.549LeuCys: 0.549 ± 0.063
5.09LeuAsp: 5.09 ± 0.179
5.48LeuGlu: 5.48 ± 0.172
3.313LeuPhe: 3.313 ± 0.157
5.793LeuGly: 5.793 ± 0.177
1.666LeuHis: 1.666 ± 0.106
6.275LeuIle: 6.275 ± 0.215
5.687LeuLys: 5.687 ± 0.147
8.374LeuLeu: 8.374 ± 0.296
2.23LeuMet: 2.23 ± 0.11
3.814LeuAsn: 3.814 ± 0.156
4.58LeuPro: 4.58 ± 0.148
3.294LeuGln: 3.294 ± 0.149
5.085LeuArg: 5.085 ± 0.163
6.944LeuSer: 6.944 ± 0.188
5.105LeuThr: 5.105 ± 0.14
6.174LeuVal: 6.174 ± 0.2
0.876LeuTrp: 0.876 ± 0.07
2.379LeuTyr: 2.379 ± 0.116
0.0LeuXaa: 0.0 ± 0.0
Met
2.167MetAla: 2.167 ± 0.097
0.193MetCys: 0.193 ± 0.028
1.242MetAsp: 1.242 ± 0.082
1.175MetGlu: 1.175 ± 0.084
0.819MetPhe: 0.819 ± 0.052
1.498MetGly: 1.498 ± 0.084
0.303MetHis: 0.303 ± 0.035
1.825MetIle: 1.825 ± 0.104
1.931MetLys: 1.931 ± 0.104
1.912MetLeu: 1.912 ± 0.081
0.766MetMet: 0.766 ± 0.072
1.295MetAsn: 1.295 ± 0.083
1.002MetPro: 1.002 ± 0.075
0.823MetGln: 0.823 ± 0.071
1.286MetArg: 1.286 ± 0.077
1.979MetSer: 1.979 ± 0.12
1.58MetThr: 1.58 ± 0.085
1.478MetVal: 1.478 ± 0.081
0.178MetTrp: 0.178 ± 0.031
0.689MetTyr: 0.689 ± 0.06
0.0MetXaa: 0.0 ± 0.0
Asn
3.125AsnAla: 3.125 ± 0.128
0.327AsnCys: 0.327 ± 0.042
2.552AsnAsp: 2.552 ± 0.135
2.75AsnGlu: 2.75 ± 0.112
1.676AsnPhe: 1.676 ± 0.085
3.4AsnGly: 3.4 ± 0.138
0.722AsnHis: 0.722 ± 0.056
3.361AsnIle: 3.361 ± 0.143
3.058AsnLys: 3.058 ± 0.12
4.296AsnLeu: 4.296 ± 0.178
1.055AsnMet: 1.055 ± 0.075
2.393AsnAsn: 2.393 ± 0.125
2.523AsnPro: 2.523 ± 0.11
1.743AsnGln: 1.743 ± 0.098
2.297AsnArg: 2.297 ± 0.111
3.337AsnSer: 3.337 ± 0.148
2.061AsnThr: 2.061 ± 0.101
3.034AsnVal: 3.034 ± 0.117
0.549AsnTrp: 0.549 ± 0.053
1.647AsnTyr: 1.647 ± 0.093
0.0AsnXaa: 0.0 ± 0.0
Pro
3.135ProAla: 3.135 ± 0.144
0.178ProCys: 0.178 ± 0.03
2.461ProAsp: 2.461 ± 0.122
3.14ProGlu: 3.14 ± 0.118
1.493ProPhe: 1.493 ± 0.07
2.331ProGly: 2.331 ± 0.097
0.785ProHis: 0.785 ± 0.064
2.6ProIle: 2.6 ± 0.13
2.673ProLys: 2.673 ± 0.102
2.981ProLeu: 2.981 ± 0.123
0.713ProMet: 0.713 ± 0.059
1.763ProAsn: 1.763 ± 0.086
1.026ProPro: 1.026 ± 0.085
1.666ProGln: 1.666 ± 0.142
1.758ProArg: 1.758 ± 0.099
2.504ProSer: 2.504 ± 0.129
2.273ProThr: 2.273 ± 0.118
2.971ProVal: 2.971 ± 0.117
0.323ProTrp: 0.323 ± 0.037
1.04ProTyr: 1.04 ± 0.08
0.0ProXaa: 0.0 ± 0.0
Gln
3.072GlnAla: 3.072 ± 0.121
0.149GlnCys: 0.149 ± 0.029
1.469GlnAsp: 1.469 ± 0.091
1.941GlnGlu: 1.941 ± 0.099
1.44GlnPhe: 1.44 ± 0.078
1.652GlnGly: 1.652 ± 0.094
0.588GlnHis: 0.588 ± 0.053
2.73GlnIle: 2.73 ± 0.128
2.827GlnLys: 2.827 ± 0.113
3.732GlnLeu: 3.732 ± 0.156
0.905GlnMet: 0.905 ± 0.063
1.632GlnAsn: 1.632 ± 0.103
1.825GlnPro: 1.825 ± 0.154
1.753GlnGln: 1.753 ± 0.108
2.027GlnArg: 2.027 ± 0.092
2.263GlnSer: 2.263 ± 0.091
2.287GlnThr: 2.287 ± 0.112
2.153GlnVal: 2.153 ± 0.103
0.366GlnTrp: 0.366 ± 0.048
1.218GlnTyr: 1.218 ± 0.084
0.0GlnXaa: 0.0 ± 0.0
Arg
3.636ArgAla: 3.636 ± 0.159
0.342ArgCys: 0.342 ± 0.048
3.063ArgAsp: 3.063 ± 0.124
3.785ArgGlu: 3.785 ± 0.142
1.917ArgPhe: 1.917 ± 0.099
3.183ArgGly: 3.183 ± 0.134
1.069ArgHis: 1.069 ± 0.077
3.024ArgIle: 3.024 ± 0.133
3.222ArgLys: 3.222 ± 0.129
4.893ArgLeu: 4.893 ± 0.161
1.334ArgMet: 1.334 ± 0.074
2.104ArgAsn: 2.104 ± 0.093
1.772ArgPro: 1.772 ± 0.103
2.658ArgGln: 2.658 ± 0.101
3.592ArgArg: 3.592 ± 0.166
3.082ArgSer: 3.082 ± 0.103
2.62ArgThr: 2.62 ± 0.125
3.409ArgVal: 3.409 ± 0.125
0.525ArgTrp: 0.525 ± 0.057
1.844ArgTyr: 1.844 ± 0.097
0.0ArgXaa: 0.0 ± 0.0
Ser
5.312SerAla: 5.312 ± 0.194
0.39SerCys: 0.39 ± 0.051
4.045SerAsp: 4.045 ± 0.15
4.348SerGlu: 4.348 ± 0.147
2.759SerPhe: 2.759 ± 0.121
5.384SerGly: 5.384 ± 0.16
1.156SerHis: 1.156 ± 0.075
4.353SerIle: 4.353 ± 0.173
4.103SerLys: 4.103 ± 0.145
6.145SerLeu: 6.145 ± 0.179
1.541SerMet: 1.541 ± 0.086
2.682SerAsn: 2.682 ± 0.12
2.615SerPro: 2.615 ± 0.136
2.711SerGln: 2.711 ± 0.118
3.472SerArg: 3.472 ± 0.127
4.618SerSer: 4.618 ± 0.193
3.554SerThr: 3.554 ± 0.142
5.032SerVal: 5.032 ± 0.156
0.732SerTrp: 0.732 ± 0.06
1.753SerTyr: 1.753 ± 0.109
0.0SerXaa: 0.0 ± 0.0
Thr
4.401ThrAla: 4.401 ± 0.143
0.337ThrCys: 0.337 ± 0.05
3.087ThrAsp: 3.087 ± 0.129
3.164ThrGlu: 3.164 ± 0.128
2.047ThrPhe: 2.047 ± 0.101
4.358ThrGly: 4.358 ± 0.158
1.021ThrHis: 1.021 ± 0.066
4.348ThrIle: 4.348 ± 0.156
3.352ThrLys: 3.352 ± 0.145
4.931ThrLeu: 4.931 ± 0.153
1.189ThrMet: 1.189 ± 0.07
2.316ThrAsn: 2.316 ± 0.124
2.976ThrPro: 2.976 ± 0.117
1.527ThrGln: 1.527 ± 0.098
2.307ThrArg: 2.307 ± 0.106
3.246ThrSer: 3.246 ± 0.142
3.39ThrThr: 3.39 ± 0.13
4.103ThrVal: 4.103 ± 0.163
0.539ThrTrp: 0.539 ± 0.069
1.714ThrTyr: 1.714 ± 0.091
0.0ThrXaa: 0.0 ± 0.0
Val
6.361ValAla: 6.361 ± 0.182
0.467ValCys: 0.467 ± 0.05
4.401ValAsp: 4.401 ± 0.149
4.527ValGlu: 4.527 ± 0.162
2.48ValPhe: 2.48 ± 0.133
4.907ValGly: 4.907 ± 0.189
0.992ValHis: 0.992 ± 0.07
5.49ValIle: 5.49 ± 0.19
4.931ValLys: 4.931 ± 0.159
6.308ValLeu: 6.308 ± 0.212
1.777ValMet: 1.777 ± 0.094
3.13ValAsn: 3.13 ± 0.125
2.384ValPro: 2.384 ± 0.122
2.008ValGln: 2.008 ± 0.092
3.636ValArg: 3.636 ± 0.128
4.849ValSer: 4.849 ± 0.15
3.852ValThr: 3.852 ± 0.155
5.769ValVal: 5.769 ± 0.197
0.665ValTrp: 0.665 ± 0.064
2.186ValTyr: 2.186 ± 0.11
0.0ValXaa: 0.0 ± 0.0
Trp
0.742TrpAla: 0.742 ± 0.069
0.14TrpCys: 0.14 ± 0.023
0.506TrpAsp: 0.506 ± 0.057
0.385TrpGlu: 0.385 ± 0.043
0.462TrpPhe: 0.462 ± 0.05
0.66TrpGly: 0.66 ± 0.063
0.231TrpHis: 0.231 ± 0.032
0.597TrpIle: 0.597 ± 0.057
0.679TrpLys: 0.679 ± 0.059
1.281TrpLeu: 1.281 ± 0.107
0.289TrpMet: 0.289 ± 0.038
0.51TrpAsn: 0.51 ± 0.06
0.303TrpPro: 0.303 ± 0.045
0.655TrpGln: 0.655 ± 0.056
0.785TrpArg: 0.785 ± 0.06
0.631TrpSer: 0.631 ± 0.064
0.535TrpThr: 0.535 ± 0.06
0.592TrpVal: 0.592 ± 0.053
0.197TrpTrp: 0.197 ± 0.032
0.299TrpTyr: 0.299 ± 0.045
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.422TyrAla: 2.422 ± 0.112
0.337TyrCys: 0.337 ± 0.051
2.114TyrAsp: 2.114 ± 0.111
1.97TyrGlu: 1.97 ± 0.102
1.358TyrPhe: 1.358 ± 0.096
2.456TyrGly: 2.456 ± 0.112
0.515TyrHis: 0.515 ± 0.052
2.138TyrIle: 2.138 ± 0.111
1.955TyrLys: 1.955 ± 0.104
2.779TyrLeu: 2.779 ± 0.119
0.568TyrMet: 0.568 ± 0.05
1.604TyrAsn: 1.604 ± 0.1
1.146TyrPro: 1.146 ± 0.089
1.271TyrGln: 1.271 ± 0.073
1.753TyrArg: 1.753 ± 0.087
1.965TyrSer: 1.965 ± 0.094
1.637TyrThr: 1.637 ± 0.1
2.1TyrVal: 2.1 ± 0.102
0.303TyrTrp: 0.303 ± 0.039
1.156TyrTyr: 1.156 ± 0.08
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 703 proteins (207660 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski