Amino acid dipepetide frequency for TM7 phylum sp. oral taxon 351

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.436AlaAla: 6.436 ± 0.206
0.399AlaCys: 0.399 ± 0.036
4.418AlaAsp: 4.418 ± 0.14
5.538AlaGlu: 5.538 ± 0.207
2.434AlaPhe: 2.434 ± 0.099
5.559AlaGly: 5.559 ± 0.179
1.403AlaHis: 1.403 ± 0.082
5.822AlaIle: 5.822 ± 0.18
6.127AlaLys: 6.127 ± 0.173
6.725AlaLeu: 6.725 ± 0.198
1.84AlaMet: 1.84 ± 0.095
3.159AlaAsn: 3.159 ± 0.128
2.612AlaPro: 2.612 ± 0.102
2.358AlaGln: 2.358 ± 0.104
3.88AlaArg: 3.88 ± 0.133
4.944AlaSer: 4.944 ± 0.163
4.041AlaThr: 4.041 ± 0.139
4.851AlaVal: 4.851 ± 0.17
0.615AlaTrp: 0.615 ± 0.056
2.29AlaTyr: 2.29 ± 0.099
0.0AlaXaa: 0.0 ± 0.0
Cys
0.403CysAla: 0.403 ± 0.043
0.034CysCys: 0.034 ± 0.012
0.382CysAsp: 0.382 ± 0.047
0.386CysGlu: 0.386 ± 0.044
0.276CysPhe: 0.276 ± 0.033
0.589CysGly: 0.589 ± 0.057
0.191CysHis: 0.191 ± 0.034
0.386CysIle: 0.386 ± 0.04
0.335CysLys: 0.335 ± 0.042
0.657CysLeu: 0.657 ± 0.061
0.127CysMet: 0.127 ± 0.022
0.297CysAsn: 0.297 ± 0.04
0.386CysPro: 0.386 ± 0.046
0.233CysGln: 0.233 ± 0.032
0.259CysArg: 0.259 ± 0.036
0.445CysSer: 0.445 ± 0.054
0.331CysThr: 0.331 ± 0.038
0.36CysVal: 0.36 ± 0.043
0.072CysTrp: 0.072 ± 0.018
0.246CysTyr: 0.246 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
3.918AspAla: 3.918 ± 0.131
0.386AspCys: 0.386 ± 0.044
3.519AspAsp: 3.519 ± 0.142
4.571AspGlu: 4.571 ± 0.164
3.108AspPhe: 3.108 ± 0.102
4.032AspGly: 4.032 ± 0.151
1.056AspHis: 1.056 ± 0.084
4.287AspIle: 4.287 ± 0.151
4.007AspLys: 4.007 ± 0.145
5.614AspLeu: 5.614 ± 0.166
1.268AspMet: 1.268 ± 0.079
2.794AspAsn: 2.794 ± 0.118
2.264AspPro: 2.264 ± 0.101
2.268AspGln: 2.268 ± 0.114
2.417AspArg: 2.417 ± 0.112
3.384AspSer: 3.384 ± 0.142
2.879AspThr: 2.879 ± 0.129
3.511AspVal: 3.511 ± 0.131
0.522AspTrp: 0.522 ± 0.05
2.565AspTyr: 2.565 ± 0.12
0.0AspXaa: 0.0 ± 0.0
Glu
5.101GluAla: 5.101 ± 0.181
0.335GluCys: 0.335 ± 0.037
3.871GluAsp: 3.871 ± 0.147
5.147GluGlu: 5.147 ± 0.186
2.582GluPhe: 2.582 ± 0.106
3.388GluGly: 3.388 ± 0.141
1.119GluHis: 1.119 ± 0.063
5.8GluIle: 5.8 ± 0.164
6.61GluLys: 6.61 ± 0.207
6.801GluLeu: 6.801 ± 0.18
1.815GluMet: 1.815 ± 0.098
3.443GluAsn: 3.443 ± 0.139
1.675GluPro: 1.675 ± 0.079
2.798GluGln: 2.798 ± 0.114
3.46GluArg: 3.46 ± 0.139
3.74GluSer: 3.74 ± 0.111
3.451GluThr: 3.451 ± 0.114
4.707GluVal: 4.707 ± 0.142
0.572GluTrp: 0.572 ± 0.047
2.417GluTyr: 2.417 ± 0.105
0.0GluXaa: 0.0 ± 0.0
Phe
3.248PheAla: 3.248 ± 0.119
0.36PheCys: 0.36 ± 0.04
2.688PheAsp: 2.688 ± 0.124
2.247PheGlu: 2.247 ± 0.107
1.751PhePhe: 1.751 ± 0.115
2.866PheGly: 2.866 ± 0.118
0.636PheHis: 0.636 ± 0.052
2.93PheIle: 2.93 ± 0.149
2.374PheLys: 2.374 ± 0.106
3.812PheLeu: 3.812 ± 0.165
1.043PheMet: 1.043 ± 0.068
2.086PheAsn: 2.086 ± 0.105
1.293PhePro: 1.293 ± 0.084
1.115PheGln: 1.115 ± 0.064
1.632PheArg: 1.632 ± 0.076
3.252PheSer: 3.252 ± 0.13
2.387PheThr: 2.387 ± 0.102
2.65PheVal: 2.65 ± 0.105
0.53PheTrp: 0.53 ± 0.047
1.556PheTyr: 1.556 ± 0.08
0.0PheXaa: 0.0 ± 0.0
Gly
4.584GlyAla: 4.584 ± 0.145
0.424GlyCys: 0.424 ± 0.051
3.676GlyAsp: 3.676 ± 0.141
4.461GlyGlu: 4.461 ± 0.131
2.875GlyPhe: 2.875 ± 0.11
4.842GlyGly: 4.842 ± 0.207
1.064GlyHis: 1.064 ± 0.072
4.567GlyIle: 4.567 ± 0.136
4.948GlyLys: 4.948 ± 0.159
6.118GlyLeu: 6.118 ± 0.191
1.509GlyMet: 1.509 ± 0.088
2.739GlyAsn: 2.739 ± 0.124
1.395GlyPro: 1.395 ± 0.08
2.4GlyGln: 2.4 ± 0.104
3.172GlyArg: 3.172 ± 0.127
3.905GlySer: 3.905 ± 0.141
3.265GlyThr: 3.265 ± 0.118
5.245GlyVal: 5.245 ± 0.181
0.772GlyTrp: 0.772 ± 0.056
2.218GlyTyr: 2.218 ± 0.092
0.0GlyXaa: 0.0 ± 0.0
His
1.264HisAla: 1.264 ± 0.07
0.178HisCys: 0.178 ± 0.026
0.916HisAsp: 0.916 ± 0.079
1.128HisGlu: 1.128 ± 0.07
0.691HisPhe: 0.691 ± 0.061
1.225HisGly: 1.225 ± 0.083
0.424HisHis: 0.424 ± 0.043
1.238HisIle: 1.238 ± 0.072
1.035HisLys: 1.035 ± 0.068
1.789HisLeu: 1.789 ± 0.098
0.314HisMet: 0.314 ± 0.039
0.878HisAsn: 0.878 ± 0.065
0.886HisPro: 0.886 ± 0.065
0.81HisGln: 0.81 ± 0.072
0.784HisArg: 0.784 ± 0.055
1.085HisSer: 1.085 ± 0.063
1.035HisThr: 1.035 ± 0.065
1.035HisVal: 1.035 ± 0.063
0.161HisTrp: 0.161 ± 0.027
0.619HisTyr: 0.619 ± 0.05
0.0HisXaa: 0.0 ± 0.0
Ile
6.216IleAla: 6.216 ± 0.163
0.589IleCys: 0.589 ± 0.054
4.974IleAsp: 4.974 ± 0.153
5.296IleGlu: 5.296 ± 0.162
3.091IlePhe: 3.091 ± 0.173
4.393IleGly: 4.393 ± 0.154
1.213IleHis: 1.213 ± 0.07
6.118IleIle: 6.118 ± 0.222
5.114IleLys: 5.114 ± 0.16
7.06IleLeu: 7.06 ± 0.198
1.637IleMet: 1.637 ± 0.078
3.655IleAsn: 3.655 ± 0.138
2.603IlePro: 2.603 ± 0.106
2.268IleGln: 2.268 ± 0.11
3.35IleArg: 3.35 ± 0.117
5.724IleSer: 5.724 ± 0.176
4.312IleThr: 4.312 ± 0.145
4.872IleVal: 4.872 ± 0.158
0.564IleTrp: 0.564 ± 0.056
2.379IleTyr: 2.379 ± 0.1
0.0IleXaa: 0.0 ± 0.0
Lys
4.842LysAla: 4.842 ± 0.16
0.28LysCys: 0.28 ± 0.031
3.948LysAsp: 3.948 ± 0.136
5.109LysGlu: 5.109 ± 0.178
2.417LysPhe: 2.417 ± 0.107
3.396LysGly: 3.396 ± 0.131
1.242LysHis: 1.242 ± 0.084
6.555LysIle: 6.555 ± 0.155
7.073LysLys: 7.073 ± 0.204
6.682LysLeu: 6.682 ± 0.193
2.095LysMet: 2.095 ± 0.111
4.74LysAsn: 4.74 ± 0.142
2.514LysPro: 2.514 ± 0.102
2.964LysGln: 2.964 ± 0.117
3.405LysArg: 3.405 ± 0.136
4.456LysSer: 4.456 ± 0.148
4.982LysThr: 4.982 ± 0.147
4.367LysVal: 4.367 ± 0.153
0.598LysTrp: 0.598 ± 0.05
2.582LysTyr: 2.582 ± 0.1
0.0LysXaa: 0.0 ± 0.0
Leu
7.857LeuAla: 7.857 ± 0.207
0.568LeuCys: 0.568 ± 0.046
5.538LeuAsp: 5.538 ± 0.169
5.813LeuGlu: 5.813 ± 0.185
3.384LeuPhe: 3.384 ± 0.152
5.792LeuGly: 5.792 ± 0.167
1.582LeuHis: 1.582 ± 0.092
7.064LeuIle: 7.064 ± 0.2
6.576LeuLys: 6.576 ± 0.173
7.853LeuLeu: 7.853 ± 0.289
2.137LeuMet: 2.137 ± 0.093
4.562LeuAsn: 4.562 ± 0.153
4.202LeuPro: 4.202 ± 0.14
2.994LeuGln: 2.994 ± 0.113
4.592LeuArg: 4.592 ± 0.147
7.221LeuSer: 7.221 ± 0.189
5.457LeuThr: 5.457 ± 0.151
6.038LeuVal: 6.038 ± 0.196
0.78LeuTrp: 0.78 ± 0.059
2.455LeuTyr: 2.455 ± 0.116
0.0LeuXaa: 0.0 ± 0.0
Met
1.878MetAla: 1.878 ± 0.086
0.14MetCys: 0.14 ± 0.024
1.272MetAsp: 1.272 ± 0.085
1.387MetGlu: 1.387 ± 0.081
0.975MetPhe: 0.975 ± 0.102
1.497MetGly: 1.497 ± 0.092
0.343MetHis: 0.343 ± 0.038
1.688MetIle: 1.688 ± 0.081
1.959MetLys: 1.959 ± 0.083
1.611MetLeu: 1.611 ± 0.078
0.789MetMet: 0.789 ± 0.07
1.353MetAsn: 1.353 ± 0.072
0.967MetPro: 0.967 ± 0.064
0.95MetGln: 0.95 ± 0.069
1.187MetArg: 1.187 ± 0.073
2.035MetSer: 2.035 ± 0.109
1.48MetThr: 1.48 ± 0.078
1.412MetVal: 1.412 ± 0.091
0.127MetTrp: 0.127 ± 0.021
0.64MetTyr: 0.64 ± 0.057
0.0MetXaa: 0.0 ± 0.0
Asn
2.985AsnAla: 2.985 ± 0.116
0.432AsnCys: 0.432 ± 0.046
2.358AsnAsp: 2.358 ± 0.104
2.921AsnGlu: 2.921 ± 0.135
2.239AsnPhe: 2.239 ± 0.093
3.299AsnGly: 3.299 ± 0.122
0.865AsnHis: 0.865 ± 0.059
3.706AsnIle: 3.706 ± 0.142
3.29AsnLys: 3.29 ± 0.119
5.287AsnLeu: 5.287 ± 0.175
1.251AsnMet: 1.251 ± 0.079
2.671AsnAsn: 2.671 ± 0.124
2.586AsnPro: 2.586 ± 0.101
2.209AsnGln: 2.209 ± 0.106
2.256AsnArg: 2.256 ± 0.096
3.307AsnSer: 3.307 ± 0.144
2.858AsnThr: 2.858 ± 0.117
3.002AsnVal: 3.002 ± 0.137
0.522AsnTrp: 0.522 ± 0.045
1.827AsnTyr: 1.827 ± 0.102
0.0AsnXaa: 0.0 ± 0.0
Pro
2.743ProAla: 2.743 ± 0.125
0.148ProCys: 0.148 ± 0.026
2.425ProAsp: 2.425 ± 0.087
3.312ProGlu: 3.312 ± 0.142
1.446ProPhe: 1.446 ± 0.068
2.251ProGly: 2.251 ± 0.089
0.746ProHis: 0.746 ± 0.06
2.493ProIle: 2.493 ± 0.115
2.595ProLys: 2.595 ± 0.111
2.938ProLeu: 2.938 ± 0.114
0.666ProMet: 0.666 ± 0.06
1.836ProAsn: 1.836 ± 0.089
0.984ProPro: 0.984 ± 0.059
1.285ProGln: 1.285 ± 0.098
1.467ProArg: 1.467 ± 0.075
2.413ProSer: 2.413 ± 0.119
2.277ProThr: 2.277 ± 0.109
2.998ProVal: 2.998 ± 0.122
0.343ProTrp: 0.343 ± 0.038
1.039ProTyr: 1.039 ± 0.062
0.0ProXaa: 0.0 ± 0.0
Gln
2.921GlnAla: 2.921 ± 0.124
0.161GlnCys: 0.161 ± 0.027
1.705GlnAsp: 1.705 ± 0.091
2.366GlnGlu: 2.366 ± 0.129
1.353GlnPhe: 1.353 ± 0.073
1.717GlnGly: 1.717 ± 0.087
0.619GlnHis: 0.619 ± 0.05
3.07GlnIle: 3.07 ± 0.126
3.29GlnLys: 3.29 ± 0.125
3.536GlnLeu: 3.536 ± 0.131
0.835GlnMet: 0.835 ± 0.067
2.226GlnAsn: 2.226 ± 0.092
1.497GlnPro: 1.497 ± 0.088
1.59GlnGln: 1.59 ± 0.103
1.781GlnArg: 1.781 ± 0.09
2.065GlnSer: 2.065 ± 0.096
2.421GlnThr: 2.421 ± 0.111
2.095GlnVal: 2.095 ± 0.091
0.322GlnTrp: 0.322 ± 0.038
1.281GlnTyr: 1.281 ± 0.075
0.0GlnXaa: 0.0 ± 0.0
Arg
3.282ArgAla: 3.282 ± 0.12
0.288ArgCys: 0.288 ± 0.043
2.642ArgAsp: 2.642 ± 0.104
3.71ArgGlu: 3.71 ± 0.141
2.035ArgPhe: 2.035 ± 0.096
3.015ArgGly: 3.015 ± 0.115
0.937ArgHis: 0.937 ± 0.064
3.256ArgIle: 3.256 ± 0.122
3.328ArgLys: 3.328 ± 0.124
4.609ArgLeu: 4.609 ± 0.149
1.221ArgMet: 1.221 ± 0.073
2.294ArgAsn: 2.294 ± 0.111
1.7ArgPro: 1.7 ± 0.083
2.107ArgGln: 2.107 ± 0.101
2.964ArgArg: 2.964 ± 0.139
2.663ArgSer: 2.663 ± 0.112
2.51ArgThr: 2.51 ± 0.106
3.006ArgVal: 3.006 ± 0.131
0.445ArgTrp: 0.445 ± 0.044
1.641ArgTyr: 1.641 ± 0.076
0.0ArgXaa: 0.0 ± 0.0
Ser
4.719SerAla: 4.719 ± 0.169
0.454SerCys: 0.454 ± 0.053
3.888SerAsp: 3.888 ± 0.145
4.681SerGlu: 4.681 ± 0.151
2.985SerPhe: 2.985 ± 0.119
5.067SerGly: 5.067 ± 0.165
1.124SerHis: 1.124 ± 0.068
4.355SerIle: 4.355 ± 0.15
4.499SerLys: 4.499 ± 0.167
6.254SerLeu: 6.254 ± 0.163
1.531SerMet: 1.531 ± 0.084
3.116SerAsn: 3.116 ± 0.15
2.374SerPro: 2.374 ± 0.115
2.65SerGln: 2.65 ± 0.105
3.29SerArg: 3.29 ± 0.114
4.876SerSer: 4.876 ± 0.217
3.583SerThr: 3.583 ± 0.144
4.55SerVal: 4.55 ± 0.135
0.636SerTrp: 0.636 ± 0.055
1.997SerTyr: 1.997 ± 0.091
0.0SerXaa: 0.0 ± 0.0
Thr
4.346ThrAla: 4.346 ± 0.146
0.403ThrCys: 0.403 ± 0.05
3.405ThrAsp: 3.405 ± 0.112
3.714ThrGlu: 3.714 ± 0.127
2.137ThrPhe: 2.137 ± 0.113
4.037ThrGly: 4.037 ± 0.147
1.043ThrHis: 1.043 ± 0.06
4.35ThrIle: 4.35 ± 0.166
3.833ThrLys: 3.833 ± 0.122
5.067ThrLeu: 5.067 ± 0.161
1.132ThrMet: 1.132 ± 0.071
2.731ThrAsn: 2.731 ± 0.115
2.871ThrPro: 2.871 ± 0.129
1.683ThrGln: 1.683 ± 0.088
2.273ThrArg: 2.273 ± 0.095
3.808ThrSer: 3.808 ± 0.149
3.774ThrThr: 3.774 ± 0.169
4.045ThrVal: 4.045 ± 0.147
0.432ThrTrp: 0.432 ± 0.042
1.832ThrTyr: 1.832 ± 0.084
0.0ThrXaa: 0.0 ± 0.0
Val
5.605ValAla: 5.605 ± 0.187
0.454ValCys: 0.454 ± 0.044
4.143ValAsp: 4.143 ± 0.132
4.55ValGlu: 4.55 ± 0.147
2.599ValPhe: 2.599 ± 0.111
4.27ValGly: 4.27 ± 0.162
1.03ValHis: 1.03 ± 0.066
5.279ValIle: 5.279 ± 0.167
4.66ValLys: 4.66 ± 0.152
5.817ValLeu: 5.817 ± 0.168
1.611ValMet: 1.611 ± 0.091
3.032ValAsn: 3.032 ± 0.115
2.235ValPro: 2.235 ± 0.114
1.832ValGln: 1.832 ± 0.086
3.138ValArg: 3.138 ± 0.126
4.634ValSer: 4.634 ± 0.151
3.765ValThr: 3.765 ± 0.141
5.16ValVal: 5.16 ± 0.196
0.53ValTrp: 0.53 ± 0.049
2.086ValTyr: 2.086 ± 0.089
0.0ValXaa: 0.0 ± 0.0
Trp
0.594TrpAla: 0.594 ± 0.05
0.11TrpCys: 0.11 ± 0.022
0.538TrpAsp: 0.538 ± 0.049
0.39TrpGlu: 0.39 ± 0.041
0.445TrpPhe: 0.445 ± 0.043
0.611TrpGly: 0.611 ± 0.058
0.182TrpHis: 0.182 ± 0.031
0.496TrpIle: 0.496 ± 0.049
0.505TrpLys: 0.505 ± 0.05
0.899TrpLeu: 0.899 ± 0.06
0.225TrpMet: 0.225 ± 0.032
0.462TrpAsn: 0.462 ± 0.049
0.293TrpPro: 0.293 ± 0.034
0.674TrpGln: 0.674 ± 0.057
0.594TrpArg: 0.594 ± 0.053
0.585TrpSer: 0.585 ± 0.062
0.432TrpThr: 0.432 ± 0.042
0.522TrpVal: 0.522 ± 0.044
0.14TrpTrp: 0.14 ± 0.024
0.267TrpTyr: 0.267 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.506TyrAla: 2.506 ± 0.107
0.25TyrCys: 0.25 ± 0.034
2.065TyrAsp: 2.065 ± 0.095
2.112TyrGlu: 2.112 ± 0.094
1.637TyrPhe: 1.637 ± 0.092
2.332TyrGly: 2.332 ± 0.114
0.674TyrHis: 0.674 ± 0.052
1.967TyrIle: 1.967 ± 0.091
2.09TyrLys: 2.09 ± 0.101
3.333TyrLeu: 3.333 ± 0.114
0.64TyrMet: 0.64 ± 0.053
1.798TyrAsn: 1.798 ± 0.102
1.069TyrPro: 1.069 ± 0.059
1.662TyrGln: 1.662 ± 0.086
1.755TyrArg: 1.755 ± 0.084
2.086TyrSer: 2.086 ± 0.099
1.675TyrThr: 1.675 ± 0.085
2.006TyrVal: 2.006 ± 0.101
0.28TyrTrp: 0.28 ± 0.031
1.336TyrTyr: 1.336 ± 0.074
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 797 proteins (235844 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski