Amino acid dipepetide frequency for Tropheryma whipplei (strain Twist) (Whipple s bacillus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.796AlaAla: 7.796 ± 0.304
1.248AlaCys: 1.248 ± 0.07
3.76AlaAsp: 3.76 ± 0.143
3.836AlaGlu: 3.836 ± 0.136
3.388AlaPhe: 3.388 ± 0.12
6.4AlaGly: 6.4 ± 0.174
1.559AlaHis: 1.559 ± 0.071
5.755AlaIle: 5.755 ± 0.157
4.204AlaLys: 4.204 ± 0.187
9.329AlaLeu: 9.329 ± 0.207
1.666AlaMet: 1.666 ± 0.09
2.565AlaAsn: 2.565 ± 0.1
2.997AlaPro: 2.997 ± 0.138
2.762AlaGln: 2.762 ± 0.117
5.744AlaArg: 5.744 ± 0.158
5.998AlaSer: 5.998 ± 0.162
4.003AlaThr: 4.003 ± 0.153
7.034AlaVal: 7.034 ± 0.167
0.88AlaTrp: 0.88 ± 0.055
2.402AlaTyr: 2.402 ± 0.1
0.0AlaXaa: 0.0 ± 0.0
Cys
1.191CysAla: 1.191 ± 0.066
0.22CysCys: 0.22 ± 0.029
0.838CysAsp: 0.838 ± 0.063
0.649CysGlu: 0.649 ± 0.054
0.634CysPhe: 0.634 ± 0.048
1.032CysGly: 1.032 ± 0.07
0.266CysHis: 0.266 ± 0.035
0.831CysIle: 0.831 ± 0.057
0.603CysLys: 0.603 ± 0.055
1.426CysLeu: 1.426 ± 0.068
0.254CysMet: 0.254 ± 0.034
0.565CysAsn: 0.565 ± 0.053
0.501CysPro: 0.501 ± 0.051
0.406CysGln: 0.406 ± 0.034
0.713CysArg: 0.713 ± 0.05
0.994CysSer: 0.994 ± 0.061
0.694CysThr: 0.694 ± 0.059
1.123CysVal: 1.123 ± 0.071
0.091CysTrp: 0.091 ± 0.018
0.326CysTyr: 0.326 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
4.101AspAla: 4.101 ± 0.136
0.85AspCys: 0.85 ± 0.061
2.496AspAsp: 2.496 ± 0.098
2.948AspGlu: 2.948 ± 0.118
2.299AspPhe: 2.299 ± 0.095
3.464AspGly: 3.464 ± 0.117
0.797AspHis: 0.797 ± 0.058
4.113AspIle: 4.113 ± 0.141
2.519AspLys: 2.519 ± 0.107
5.797AspLeu: 5.797 ± 0.152
1.066AspMet: 1.066 ± 0.065
1.878AspAsn: 1.878 ± 0.1
2.652AspPro: 2.652 ± 0.107
1.491AspGln: 1.491 ± 0.089
3.263AspArg: 3.263 ± 0.133
4.018AspSer: 4.018 ± 0.124
2.902AspThr: 2.902 ± 0.125
4.029AspVal: 4.029 ± 0.136
0.596AspTrp: 0.596 ± 0.043
1.411AspTyr: 1.411 ± 0.07
0.0AspXaa: 0.0 ± 0.0
Glu
3.995GluAla: 3.995 ± 0.16
0.675GluCys: 0.675 ± 0.045
2.428GluAsp: 2.428 ± 0.115
2.644GluGlu: 2.644 ± 0.13
1.734GluPhe: 1.734 ± 0.088
3.198GluGly: 3.198 ± 0.131
1.184GluHis: 1.184 ± 0.067
4.321GluIle: 4.321 ± 0.149
3.065GluLys: 3.065 ± 0.131
4.936GluLeu: 4.936 ± 0.145
1.138GluMet: 1.138 ± 0.071
2.09GluAsn: 2.09 ± 0.098
1.719GluPro: 1.719 ± 0.089
1.893GluGln: 1.893 ± 0.102
3.179GluArg: 3.179 ± 0.109
3.543GluSer: 3.543 ± 0.132
2.489GluThr: 2.489 ± 0.102
3.801GluVal: 3.801 ± 0.137
0.527GluTrp: 0.527 ± 0.054
1.65GluTyr: 1.65 ± 0.073
0.0GluXaa: 0.0 ± 0.0
Phe
3.703PheAla: 3.703 ± 0.131
0.721PheCys: 0.721 ± 0.054
2.697PheAsp: 2.697 ± 0.105
1.923PheGlu: 1.923 ± 0.094
2.197PhePhe: 2.197 ± 0.11
3.543PheGly: 3.543 ± 0.116
0.694PheHis: 0.694 ± 0.056
2.458PheIle: 2.458 ± 0.104
1.294PheLys: 1.294 ± 0.062
4.564PheLeu: 4.564 ± 0.162
0.861PheMet: 0.861 ± 0.057
1.074PheAsn: 1.074 ± 0.062
1.726PhePro: 1.726 ± 0.084
0.899PheGln: 0.899 ± 0.059
2.208PheArg: 2.208 ± 0.088
3.953PheSer: 3.953 ± 0.135
2.318PheThr: 2.318 ± 0.098
3.43PheVal: 3.43 ± 0.131
0.512PheTrp: 0.512 ± 0.048
1.297PheTyr: 1.297 ± 0.081
0.0PheXaa: 0.0 ± 0.0
Gly
5.615GlyAla: 5.615 ± 0.149
0.964GlyCys: 0.964 ± 0.07
3.593GlyAsp: 3.593 ± 0.122
3.282GlyGlu: 3.282 ± 0.118
3.813GlyPhe: 3.813 ± 0.125
4.997GlyGly: 4.997 ± 0.169
1.696GlyHis: 1.696 ± 0.091
5.566GlyIle: 5.566 ± 0.162
3.995GlyLys: 3.995 ± 0.127
7.459GlyLeu: 7.459 ± 0.178
1.707GlyMet: 1.707 ± 0.08
2.747GlyAsn: 2.747 ± 0.101
2.705GlyPro: 2.705 ± 0.113
2.466GlyGln: 2.466 ± 0.127
4.242GlyArg: 4.242 ± 0.157
5.338GlySer: 5.338 ± 0.158
3.843GlyThr: 3.843 ± 0.196
6.514GlyVal: 6.514 ± 0.179
0.785GlyTrp: 0.785 ± 0.062
2.348GlyTyr: 2.348 ± 0.089
0.0GlyXaa: 0.0 ± 0.0
His
1.628HisAla: 1.628 ± 0.086
0.186HisCys: 0.186 ± 0.026
1.134HisAsp: 1.134 ± 0.073
1.055HisGlu: 1.055 ± 0.063
0.755HisPhe: 0.755 ± 0.061
1.521HisGly: 1.521 ± 0.078
0.436HisHis: 0.436 ± 0.044
1.62HisIle: 1.62 ± 0.081
0.952HisLys: 0.952 ± 0.053
1.893HisLeu: 1.893 ± 0.089
0.508HisMet: 0.508 ± 0.045
0.892HisAsn: 0.892 ± 0.059
1.305HisPro: 1.305 ± 0.073
0.467HisGln: 0.467 ± 0.043
1.286HisArg: 1.286 ± 0.074
1.692HisSer: 1.692 ± 0.088
1.351HisThr: 1.351 ± 0.063
1.37HisVal: 1.37 ± 0.063
0.235HisTrp: 0.235 ± 0.034
0.52HisTyr: 0.52 ± 0.045
0.0HisXaa: 0.0 ± 0.0
Ile
6.764IleAla: 6.764 ± 0.19
0.96IleCys: 0.96 ± 0.067
4.465IleAsp: 4.465 ± 0.137
3.646IleGlu: 3.646 ± 0.144
2.762IlePhe: 2.762 ± 0.119
4.799IleGly: 4.799 ± 0.133
1.153IleHis: 1.153 ± 0.067
3.441IleIle: 3.441 ± 0.119
2.834IleLys: 2.834 ± 0.113
6.465IleLeu: 6.465 ± 0.224
1.074IleMet: 1.074 ± 0.069
2.291IleAsn: 2.291 ± 0.089
3.452IlePro: 3.452 ± 0.103
1.772IleGln: 1.772 ± 0.09
4.094IleArg: 4.094 ± 0.155
5.543IleSer: 5.543 ± 0.134
4.412IleThr: 4.412 ± 0.165
5.228IleVal: 5.228 ± 0.137
0.592IleTrp: 0.592 ± 0.053
1.666IleTyr: 1.666 ± 0.097
0.0IleXaa: 0.0 ± 0.0
Lys
3.418LysAla: 3.418 ± 0.112
0.508LysCys: 0.508 ± 0.043
2.36LysAsp: 2.36 ± 0.1
2.269LysGlu: 2.269 ± 0.116
1.445LysPhe: 1.445 ± 0.077
2.743LysGly: 2.743 ± 0.112
1.218LysHis: 1.218 ± 0.072
3.471LysIle: 3.471 ± 0.117
2.902LysLys: 2.902 ± 0.123
4.283LysLeu: 4.283 ± 0.149
1.005LysMet: 1.005 ± 0.067
2.489LysAsn: 2.489 ± 0.111
2.318LysPro: 2.318 ± 0.164
1.844LysGln: 1.844 ± 0.08
3.445LysArg: 3.445 ± 0.112
3.509LysSer: 3.509 ± 0.123
3.096LysThr: 3.096 ± 0.117
3.437LysVal: 3.437 ± 0.128
0.448LysTrp: 0.448 ± 0.041
1.29LysTyr: 1.29 ± 0.069
0.0LysXaa: 0.0 ± 0.0
Leu
8.957LeuAla: 8.957 ± 0.198
1.354LeuCys: 1.354 ± 0.076
5.736LeuAsp: 5.736 ± 0.18
5.725LeuGlu: 5.725 ± 0.171
4.333LeuPhe: 4.333 ± 0.152
7.163LeuGly: 7.163 ± 0.183
2.162LeuHis: 2.162 ± 0.088
6.328LeuIle: 6.328 ± 0.179
4.215LeuLys: 4.215 ± 0.144
11.078LeuLeu: 11.078 ± 0.306
1.741LeuMet: 1.741 ± 0.07
3.009LeuAsn: 3.009 ± 0.102
5.368LeuPro: 5.368 ± 0.16
3.058LeuGln: 3.058 ± 0.11
6.787LeuArg: 6.787 ± 0.184
9.697LeuSer: 9.697 ± 0.203
5.042LeuThr: 5.042 ± 0.174
7.588LeuVal: 7.588 ± 0.175
0.929LeuTrp: 0.929 ± 0.063
2.364LeuTyr: 2.364 ± 0.102
0.0LeuXaa: 0.0 ± 0.0
Met
1.445MetAla: 1.445 ± 0.073
0.243MetCys: 0.243 ± 0.032
0.77MetAsp: 0.77 ± 0.053
0.709MetGlu: 0.709 ± 0.05
0.782MetPhe: 0.782 ± 0.063
1.404MetGly: 1.404 ± 0.077
0.687MetHis: 0.687 ± 0.058
1.017MetIle: 1.017 ± 0.055
0.884MetLys: 0.884 ± 0.065
2.193MetLeu: 2.193 ± 0.09
0.307MetMet: 0.307 ± 0.031
0.763MetAsn: 0.763 ± 0.054
1.191MetPro: 1.191 ± 0.077
0.755MetGln: 0.755 ± 0.045
1.609MetArg: 1.609 ± 0.078
1.851MetSer: 1.851 ± 0.088
1.15MetThr: 1.15 ± 0.072
1.316MetVal: 1.316 ± 0.078
0.209MetTrp: 0.209 ± 0.028
0.535MetTyr: 0.535 ± 0.049
0.0MetXaa: 0.0 ± 0.0
Asn
2.781AsnAla: 2.781 ± 0.11
0.436AsnCys: 0.436 ± 0.04
1.461AsnAsp: 1.461 ± 0.07
1.51AsnGlu: 1.51 ± 0.077
1.305AsnPhe: 1.305 ± 0.074
2.28AsnGly: 2.28 ± 0.094
0.607AsnHis: 0.607 ± 0.046
2.678AsnIle: 2.678 ± 0.086
1.832AsnLys: 1.832 ± 0.074
3.817AsnLeu: 3.817 ± 0.117
0.755AsnMet: 0.755 ± 0.054
1.521AsnAsn: 1.521 ± 0.08
2.333AsnPro: 2.333 ± 0.092
1.058AsnGln: 1.058 ± 0.068
2.682AsnArg: 2.682 ± 0.109
2.77AsnSer: 2.77 ± 0.116
2.42AsnThr: 2.42 ± 0.125
2.136AsnVal: 2.136 ± 0.086
0.425AsnTrp: 0.425 ± 0.042
0.975AsnTyr: 0.975 ± 0.059
0.0AsnXaa: 0.0 ± 0.0
Pro
3.414ProAla: 3.414 ± 0.243
0.493ProCys: 0.493 ± 0.047
2.838ProAsp: 2.838 ± 0.123
2.929ProGlu: 2.929 ± 0.106
1.855ProPhe: 1.855 ± 0.076
4.105ProGly: 4.105 ± 0.192
1.051ProHis: 1.051 ± 0.069
2.826ProIle: 2.826 ± 0.097
2.128ProLys: 2.128 ± 0.089
3.885ProLeu: 3.885 ± 0.109
0.653ProMet: 0.653 ± 0.052
1.703ProAsn: 1.703 ± 0.082
1.844ProPro: 1.844 ± 0.094
1.396ProGln: 1.396 ± 0.07
2.489ProArg: 2.489 ± 0.106
3.684ProSer: 3.684 ± 0.157
2.386ProThr: 2.386 ± 0.132
4.132ProVal: 4.132 ± 0.155
0.52ProTrp: 0.52 ± 0.045
1.282ProTyr: 1.282 ± 0.082
0.0ProXaa: 0.0 ± 0.0
Gln
2.777GlnAla: 2.777 ± 0.122
0.277GlnCys: 0.277 ± 0.041
1.571GlnAsp: 1.571 ± 0.08
1.544GlnGlu: 1.544 ± 0.08
1.089GlnPhe: 1.089 ± 0.061
1.996GlnGly: 1.996 ± 0.107
0.744GlnHis: 0.744 ± 0.065
2.512GlnIle: 2.512 ± 0.095
1.867GlnLys: 1.867 ± 0.091
3.039GlnLeu: 3.039 ± 0.12
0.66GlnMet: 0.66 ± 0.05
1.354GlnAsn: 1.354 ± 0.074
1.389GlnPro: 1.389 ± 0.093
1.104GlnGln: 1.104 ± 0.062
1.87GlnArg: 1.87 ± 0.089
2.136GlnSer: 2.136 ± 0.095
1.552GlnThr: 1.552 ± 0.088
2.28GlnVal: 2.28 ± 0.093
0.25GlnTrp: 0.25 ± 0.033
0.899GlnTyr: 0.899 ± 0.058
0.0GlnXaa: 0.0 ± 0.0
Arg
5.065ArgAla: 5.065 ± 0.135
0.736ArgCys: 0.736 ± 0.049
3.358ArgAsp: 3.358 ± 0.1
3.418ArgGlu: 3.418 ± 0.132
2.648ArgPhe: 2.648 ± 0.108
4.367ArgGly: 4.367 ± 0.14
1.449ArgHis: 1.449 ± 0.072
4.272ArgIle: 4.272 ± 0.15
3.251ArgLys: 3.251 ± 0.116
6.248ArgLeu: 6.248 ± 0.161
1.483ArgMet: 1.483 ± 0.079
2.375ArgAsn: 2.375 ± 0.084
2.322ArgPro: 2.322 ± 0.091
2.216ArgGln: 2.216 ± 0.105
4.378ArgArg: 4.378 ± 0.177
4.371ArgSer: 4.371 ± 0.126
2.929ArgThr: 2.929 ± 0.114
5.478ArgVal: 5.478 ± 0.155
0.618ArgTrp: 0.618 ± 0.052
1.931ArgTyr: 1.931 ± 0.091
0.0ArgXaa: 0.0 ± 0.0
Ser
6.685SerAla: 6.685 ± 0.164
1.248SerCys: 1.248 ± 0.065
4.484SerAsp: 4.484 ± 0.141
4.363SerGlu: 4.363 ± 0.136
3.304SerPhe: 3.304 ± 0.121
7.447SerGly: 7.447 ± 0.202
1.563SerHis: 1.563 ± 0.072
4.997SerIle: 4.997 ± 0.132
3.251SerLys: 3.251 ± 0.106
7.865SerLeu: 7.865 ± 0.214
1.571SerMet: 1.571 ± 0.08
2.523SerAsn: 2.523 ± 0.097
3.285SerPro: 3.285 ± 0.106
2.39SerGln: 2.39 ± 0.089
4.769SerArg: 4.769 ± 0.144
6.093SerSer: 6.093 ± 0.208
3.517SerThr: 3.517 ± 0.158
6.988SerVal: 6.988 ± 0.184
0.804SerTrp: 0.804 ± 0.051
2.007SerTyr: 2.007 ± 0.095
0.0SerXaa: 0.0 ± 0.0
Thr
4.378ThrAla: 4.378 ± 0.133
0.74ThrCys: 0.74 ± 0.058
2.955ThrAsp: 2.955 ± 0.123
2.512ThrGlu: 2.512 ± 0.094
2.083ThrPhe: 2.083 ± 0.096
5.129ThrGly: 5.129 ± 0.219
1.316ThrHis: 1.316 ± 0.084
3.16ThrIle: 3.16 ± 0.139
2.288ThrLys: 2.288 ± 0.115
5.846ThrLeu: 5.846 ± 0.187
0.873ThrMet: 0.873 ± 0.066
1.882ThrAsn: 1.882 ± 0.11
3.202ThrPro: 3.202 ± 0.133
1.84ThrGln: 1.84 ± 0.109
3.141ThrArg: 3.141 ± 0.101
3.817ThrSer: 3.817 ± 0.149
3.236ThrThr: 3.236 ± 0.224
4.382ThrVal: 4.382 ± 0.209
0.47ThrTrp: 0.47 ± 0.04
1.483ThrTyr: 1.483 ± 0.166
0.0ThrXaa: 0.0 ± 0.0
Val
6.34ValAla: 6.34 ± 0.157
1.104ValCys: 1.104 ± 0.074
4.021ValAsp: 4.021 ± 0.121
3.452ValGlu: 3.452 ± 0.116
3.942ValPhe: 3.942 ± 0.152
5.133ValGly: 5.133 ± 0.134
1.574ValHis: 1.574 ± 0.083
5.63ValIle: 5.63 ± 0.155
3.506ValLys: 3.506 ± 0.117
8.51ValLeu: 8.51 ± 0.198
1.779ValMet: 1.779 ± 0.081
2.899ValAsn: 2.899 ± 0.124
3.475ValPro: 3.475 ± 0.125
1.988ValGln: 1.988 ± 0.096
4.397ValArg: 4.397 ± 0.133
7.026ValSer: 7.026 ± 0.17
4.932ValThr: 4.932 ± 0.217
6.48ValVal: 6.48 ± 0.237
0.785ValTrp: 0.785 ± 0.054
2.477ValTyr: 2.477 ± 0.142
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.061
0.095TrpCys: 0.095 ± 0.018
0.402TrpAsp: 0.402 ± 0.035
0.463TrpGlu: 0.463 ± 0.04
0.508TrpPhe: 0.508 ± 0.054
0.778TrpGly: 0.778 ± 0.052
0.285TrpHis: 0.285 ± 0.03
0.618TrpIle: 0.618 ± 0.054
0.474TrpLys: 0.474 ± 0.042
1.184TrpLeu: 1.184 ± 0.071
0.152TrpMet: 0.152 ± 0.025
0.341TrpAsn: 0.341 ± 0.04
0.508TrpPro: 0.508 ± 0.05
0.395TrpGln: 0.395 ± 0.038
0.626TrpArg: 0.626 ± 0.052
0.751TrpSer: 0.751 ± 0.053
0.497TrpThr: 0.497 ± 0.05
0.774TrpVal: 0.774 ± 0.058
0.19TrpTrp: 0.19 ± 0.032
0.247TrpTyr: 0.247 ± 0.036
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.398TyrAla: 2.398 ± 0.105
0.3TyrCys: 0.3 ± 0.027
1.32TyrAsp: 1.32 ± 0.072
1.339TyrGlu: 1.339 ± 0.083
1.199TyrPhe: 1.199 ± 0.069
2.174TyrGly: 2.174 ± 0.1
0.364TyrHis: 0.364 ± 0.038
1.798TyrIle: 1.798 ± 0.103
1.434TyrLys: 1.434 ± 0.087
2.819TyrLeu: 2.819 ± 0.111
0.561TyrMet: 0.561 ± 0.051
0.823TyrAsn: 0.823 ± 0.059
1.32TyrPro: 1.32 ± 0.069
0.709TyrGln: 0.709 ± 0.052
1.992TyrArg: 1.992 ± 0.077
2.371TyrSer: 2.371 ± 0.105
1.965TyrThr: 1.965 ± 0.182
1.931TyrVal: 1.931 ± 0.106
0.292TyrTrp: 0.292 ± 0.037
0.668TyrTyr: 0.668 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 805 proteins (263585 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski