Amino acid dipepetide frequency for Thermosipho atlanticus DSM 15807

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.466AlaAla: 3.466 ± 0.115
0.317AlaCys: 0.317 ± 0.03
2.193AlaAsp: 2.193 ± 0.078
3.087AlaGlu: 3.087 ± 0.092
2.788AlaPhe: 2.788 ± 0.076
3.754AlaGly: 3.754 ± 0.095
0.832AlaHis: 0.832 ± 0.045
5.53AlaIle: 5.53 ± 0.12
4.638AlaLys: 4.638 ± 0.109
5.572AlaLeu: 5.572 ± 0.102
1.291AlaMet: 1.291 ± 0.051
2.129AlaAsn: 2.129 ± 0.064
1.457AlaPro: 1.457 ± 0.055
1.299AlaGln: 1.299 ± 0.053
2.281AlaArg: 2.281 ± 0.071
2.75AlaSer: 2.75 ± 0.069
2.602AlaThr: 2.602 ± 0.075
3.744AlaVal: 3.744 ± 0.11
0.375AlaTrp: 0.375 ± 0.028
2.197AlaTyr: 2.197 ± 0.075
0.0AlaXaa: 0.0 ± 0.0
Cys
0.311CysAla: 0.311 ± 0.026
0.06CysCys: 0.06 ± 0.011
0.369CysAsp: 0.369 ± 0.026
0.449CysGlu: 0.449 ± 0.027
0.245CysPhe: 0.245 ± 0.021
0.665CysGly: 0.665 ± 0.043
0.118CysHis: 0.118 ± 0.017
0.415CysIle: 0.415 ± 0.031
0.505CysLys: 0.505 ± 0.033
0.375CysLeu: 0.375 ± 0.029
0.102CysMet: 0.102 ± 0.012
0.329CysAsn: 0.329 ± 0.03
0.433CysPro: 0.433 ± 0.035
0.128CysGln: 0.128 ± 0.017
0.208CysArg: 0.208 ± 0.024
0.331CysSer: 0.331 ± 0.025
0.309CysThr: 0.309 ± 0.026
0.373CysVal: 0.373 ± 0.032
0.052CysTrp: 0.052 ± 0.008
0.224CysTyr: 0.224 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
2.327AspAla: 2.327 ± 0.073
0.287AspCys: 0.287 ± 0.027
2.129AspAsp: 2.129 ± 0.076
4.259AspGlu: 4.259 ± 0.106
3.253AspPhe: 3.253 ± 0.084
3.091AspGly: 3.091 ± 0.083
0.555AspHis: 0.555 ± 0.032
5.141AspIle: 5.141 ± 0.109
3.868AspLys: 3.868 ± 0.1
4.564AspLeu: 4.564 ± 0.077
1.014AspMet: 1.014 ± 0.05
2.253AspAsn: 2.253 ± 0.069
1.922AspPro: 1.922 ± 0.069
0.748AspGln: 0.748 ± 0.036
1.511AspArg: 1.511 ± 0.062
2.427AspSer: 2.427 ± 0.061
2.022AspThr: 2.022 ± 0.066
3.642AspVal: 3.642 ± 0.09
0.479AspTrp: 0.479 ± 0.03
2.351AspTyr: 2.351 ± 0.077
0.0AspXaa: 0.0 ± 0.0
Glu
3.714GluAla: 3.714 ± 0.095
0.291GluCys: 0.291 ± 0.026
3.452GluAsp: 3.452 ± 0.078
6.586GluGlu: 6.586 ± 0.163
3.67GluPhe: 3.67 ± 0.097
3.796GluGly: 3.796 ± 0.094
1.09GluHis: 1.09 ± 0.055
8.565GluIle: 8.565 ± 0.144
9.657GluLys: 9.657 ± 0.17
7.18GluLeu: 7.18 ± 0.149
1.852GluMet: 1.852 ± 0.061
5.272GluAsn: 5.272 ± 0.109
1.768GluPro: 1.768 ± 0.056
1.523GluGln: 1.523 ± 0.058
3.035GluArg: 3.035 ± 0.094
2.914GluSer: 2.914 ± 0.091
3.233GluThr: 3.233 ± 0.09
4.891GluVal: 4.891 ± 0.096
0.587GluTrp: 0.587 ± 0.031
3.173GluTyr: 3.173 ± 0.082
0.0GluXaa: 0.0 ± 0.0
Phe
2.734PheAla: 2.734 ± 0.086
0.345PheCys: 0.345 ± 0.023
3.187PheAsp: 3.187 ± 0.09
4.424PheGlu: 4.424 ± 0.106
3.245PhePhe: 3.245 ± 0.117
4.017PheGly: 4.017 ± 0.117
0.74PheHis: 0.74 ± 0.04
4.772PheIle: 4.772 ± 0.14
4.283PheLys: 4.283 ± 0.095
6.157PheLeu: 6.157 ± 0.135
1.026PheMet: 1.026 ± 0.052
2.808PheAsn: 2.808 ± 0.075
1.91PhePro: 1.91 ± 0.064
1.09PheGln: 1.09 ± 0.049
1.668PheArg: 1.668 ± 0.065
4.604PheSer: 4.604 ± 0.117
2.453PheThr: 2.453 ± 0.082
4.323PheVal: 4.323 ± 0.103
0.521PheTrp: 0.521 ± 0.033
2.524PheTyr: 2.524 ± 0.077
0.0PheXaa: 0.0 ± 0.0
Gly
3.834GlyAla: 3.834 ± 0.111
0.493GlyCys: 0.493 ± 0.035
2.846GlyAsp: 2.846 ± 0.082
4.103GlyGlu: 4.103 ± 0.095
3.578GlyPhe: 3.578 ± 0.079
4.223GlyGly: 4.223 ± 0.107
1.054GlyHis: 1.054 ± 0.046
6.963GlyIle: 6.963 ± 0.13
6.552GlyLys: 6.552 ± 0.126
5.55GlyLeu: 5.55 ± 0.142
1.63GlyMet: 1.63 ± 0.054
3.057GlyAsn: 3.057 ± 0.089
1.638GlyPro: 1.638 ± 0.06
1.305GlyGln: 1.305 ± 0.052
2.303GlyArg: 2.303 ± 0.063
3.123GlySer: 3.123 ± 0.074
3.536GlyThr: 3.536 ± 0.081
4.632GlyVal: 4.632 ± 0.102
0.625GlyTrp: 0.625 ± 0.04
3.005GlyTyr: 3.005 ± 0.083
0.0GlyXaa: 0.0 ± 0.0
His
0.826HisAla: 0.826 ± 0.041
0.128HisCys: 0.128 ± 0.014
0.613HisAsp: 0.613 ± 0.037
0.956HisGlu: 0.956 ± 0.054
0.874HisPhe: 0.874 ± 0.042
1.163HisGly: 1.163 ± 0.048
0.365HisHis: 0.365 ± 0.037
1.293HisIle: 1.293 ± 0.059
0.96HisLys: 0.96 ± 0.044
1.369HisLeu: 1.369 ± 0.048
0.295HisMet: 0.295 ± 0.026
0.653HisAsn: 0.653 ± 0.041
0.846HisPro: 0.846 ± 0.045
0.261HisGln: 0.261 ± 0.022
0.583HisArg: 0.583 ± 0.033
0.876HisSer: 0.876 ± 0.045
0.661HisThr: 0.661 ± 0.038
1.034HisVal: 1.034 ± 0.041
0.138HisTrp: 0.138 ± 0.018
0.591HisTyr: 0.591 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
5.692IleAla: 5.692 ± 0.124
0.603IleCys: 0.603 ± 0.037
5.468IleAsp: 5.468 ± 0.106
7.637IleGlu: 7.637 ± 0.144
5.921IlePhe: 5.921 ± 0.139
5.959IleGly: 5.959 ± 0.123
1.263IleHis: 1.263 ± 0.055
8.914IleIle: 8.914 ± 0.179
8.436IleLys: 8.436 ± 0.149
10.16IleLeu: 10.16 ± 0.153
1.83IleMet: 1.83 ± 0.064
4.993IleAsn: 4.993 ± 0.093
4.123IlePro: 4.123 ± 0.103
1.898IleGln: 1.898 ± 0.068
3.347IleArg: 3.347 ± 0.073
6.919IleSer: 6.919 ± 0.141
4.692IleThr: 4.692 ± 0.106
7.11IleVal: 7.11 ± 0.118
0.742IleTrp: 0.742 ± 0.041
3.85IleTyr: 3.85 ± 0.092
0.0IleXaa: 0.0 ± 0.0
Lys
4.285LysAla: 4.285 ± 0.098
0.527LysCys: 0.527 ± 0.037
4.586LysAsp: 4.586 ± 0.114
8.481LysGlu: 8.481 ± 0.169
4.37LysPhe: 4.37 ± 0.086
4.877LysGly: 4.877 ± 0.106
1.305LysHis: 1.305 ± 0.055
10.495LysIle: 10.495 ± 0.152
9.487LysLys: 9.487 ± 0.148
8.543LysLeu: 8.543 ± 0.152
2.135LysMet: 2.135 ± 0.076
6.143LysAsn: 6.143 ± 0.134
2.57LysPro: 2.57 ± 0.072
1.882LysGln: 1.882 ± 0.064
3.602LysArg: 3.602 ± 0.096
4.438LysSer: 4.438 ± 0.095
4.125LysThr: 4.125 ± 0.091
6.727LysVal: 6.727 ± 0.129
0.79LysTrp: 0.79 ± 0.04
4.586LysTyr: 4.586 ± 0.108
0.0LysXaa: 0.0 ± 0.0
Leu
5.107LeuAla: 5.107 ± 0.103
0.501LeuCys: 0.501 ± 0.03
4.544LeuAsp: 4.544 ± 0.097
7.49LeuGlu: 7.49 ± 0.136
5.264LeuPhe: 5.264 ± 0.13
6.228LeuGly: 6.228 ± 0.12
1.273LeuHis: 1.273 ± 0.052
8.903LeuIle: 8.903 ± 0.158
10.244LeuLys: 10.244 ± 0.153
8.775LeuLeu: 8.775 ± 0.155
2.081LeuMet: 2.081 ± 0.067
5.44LeuAsn: 5.44 ± 0.118
3.442LeuPro: 3.442 ± 0.083
1.88LeuGln: 1.88 ± 0.058
3.752LeuArg: 3.752 ± 0.085
6.849LeuSer: 6.849 ± 0.136
4.7LeuThr: 4.7 ± 0.111
6.029LeuVal: 6.029 ± 0.115
0.772LeuTrp: 0.772 ± 0.04
3.606LeuTyr: 3.606 ± 0.074
0.0LeuXaa: 0.0 ± 0.0
Met
1.261MetAla: 1.261 ± 0.057
0.134MetCys: 0.134 ± 0.017
0.932MetAsp: 0.932 ± 0.049
1.489MetGlu: 1.489 ± 0.054
1.06MetPhe: 1.06 ± 0.049
1.421MetGly: 1.421 ± 0.06
0.303MetHis: 0.303 ± 0.028
2.016MetIle: 2.016 ± 0.067
2.586MetLys: 2.586 ± 0.076
1.986MetLeu: 1.986 ± 0.062
0.463MetMet: 0.463 ± 0.036
1.169MetAsn: 1.169 ± 0.054
0.762MetPro: 0.762 ± 0.043
0.415MetGln: 0.415 ± 0.028
1.028MetArg: 1.028 ± 0.039
1.09MetSer: 1.09 ± 0.046
0.928MetThr: 0.928 ± 0.043
1.409MetVal: 1.409 ± 0.052
0.184MetTrp: 0.184 ± 0.017
0.89MetTyr: 0.89 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
2.58AsnAla: 2.58 ± 0.076
0.335AsnCys: 0.335 ± 0.025
2.403AsnAsp: 2.403 ± 0.076
3.961AsnGlu: 3.961 ± 0.087
3.397AsnPhe: 3.397 ± 0.092
3.49AsnGly: 3.49 ± 0.1
0.635AsnHis: 0.635 ± 0.038
5.678AsnIle: 5.678 ± 0.108
4.333AsnLys: 4.333 ± 0.09
5.795AsnLeu: 5.795 ± 0.119
1.126AsnMet: 1.126 ± 0.046
2.932AsnAsn: 2.932 ± 0.1
2.303AsnPro: 2.303 ± 0.074
0.976AsnGln: 0.976 ± 0.048
1.598AsnArg: 1.598 ± 0.064
3.033AsnSer: 3.033 ± 0.088
2.277AsnThr: 2.277 ± 0.065
3.981AsnVal: 3.981 ± 0.083
0.563AsnTrp: 0.563 ± 0.036
2.554AsnTyr: 2.554 ± 0.071
0.0AsnXaa: 0.0 ± 0.0
Pro
1.521ProAla: 1.521 ± 0.06
0.2ProCys: 0.2 ± 0.019
1.954ProAsp: 1.954 ± 0.071
3.201ProGlu: 3.201 ± 0.085
2.113ProPhe: 2.113 ± 0.064
2.349ProGly: 2.349 ± 0.075
0.603ProHis: 0.603 ± 0.034
3.159ProIle: 3.159 ± 0.073
2.696ProLys: 2.696 ± 0.071
3.027ProLeu: 3.027 ± 0.081
0.716ProMet: 0.716 ± 0.042
1.76ProAsn: 1.76 ± 0.062
1.002ProPro: 1.002 ± 0.055
0.806ProGln: 0.806 ± 0.045
1.092ProArg: 1.092 ± 0.045
1.83ProSer: 1.83 ± 0.059
1.922ProThr: 1.922 ± 0.059
2.594ProVal: 2.594 ± 0.071
0.311ProTrp: 0.311 ± 0.026
1.561ProTyr: 1.561 ± 0.064
0.0ProXaa: 0.0 ± 0.0
Gln
1.0GlnAla: 1.0 ± 0.047
0.092GlnCys: 0.092 ± 0.015
0.764GlnAsp: 0.764 ± 0.042
1.531GlnGlu: 1.531 ± 0.061
1.032GlnPhe: 1.032 ± 0.054
1.056GlnGly: 1.056 ± 0.041
0.343GlnHis: 0.343 ± 0.029
2.135GlnIle: 2.135 ± 0.068
2.151GlnLys: 2.151 ± 0.062
2.099GlnLeu: 2.099 ± 0.064
0.547GlnMet: 0.547 ± 0.032
1.287GlnAsn: 1.287 ± 0.053
0.617GlnPro: 0.617 ± 0.033
0.507GlnGln: 0.507 ± 0.035
0.902GlnArg: 0.902 ± 0.046
0.902GlnSer: 0.902 ± 0.05
0.96GlnThr: 0.96 ± 0.04
1.283GlnVal: 1.283 ± 0.054
0.186GlnTrp: 0.186 ± 0.018
0.8GlnTyr: 0.8 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
1.768ArgAla: 1.768 ± 0.07
0.259ArgCys: 0.259 ± 0.024
1.469ArgAsp: 1.469 ± 0.053
2.754ArgGlu: 2.754 ± 0.092
2.032ArgPhe: 2.032 ± 0.058
2.251ArgGly: 2.251 ± 0.079
0.545ArgHis: 0.545 ± 0.035
3.652ArgIle: 3.652 ± 0.104
4.087ArgLys: 4.087 ± 0.109
3.231ArgLeu: 3.231 ± 0.081
0.932ArgMet: 0.932 ± 0.043
2.119ArgAsn: 2.119 ± 0.059
1.068ArgPro: 1.068 ± 0.045
0.804ArgGln: 0.804 ± 0.039
1.76ArgArg: 1.76 ± 0.07
1.499ArgSer: 1.499 ± 0.056
1.672ArgThr: 1.672 ± 0.058
2.399ArgVal: 2.399 ± 0.075
0.325ArgTrp: 0.325 ± 0.03
1.692ArgTyr: 1.692 ± 0.063
0.0ArgXaa: 0.0 ± 0.0
Ser
2.7SerAla: 2.7 ± 0.061
0.351SerCys: 0.351 ± 0.029
2.451SerAsp: 2.451 ± 0.076
3.907SerGlu: 3.907 ± 0.087
3.893SerPhe: 3.893 ± 0.12
4.099SerGly: 4.099 ± 0.096
0.862SerHis: 0.862 ± 0.033
5.6SerIle: 5.6 ± 0.121
5.358SerLys: 5.358 ± 0.121
5.917SerLeu: 5.917 ± 0.133
1.215SerMet: 1.215 ± 0.051
2.942SerAsn: 2.942 ± 0.081
1.992SerPro: 1.992 ± 0.069
1.225SerGln: 1.225 ± 0.056
1.96SerArg: 1.96 ± 0.063
3.518SerSer: 3.518 ± 0.087
2.748SerThr: 2.748 ± 0.085
3.572SerVal: 3.572 ± 0.09
0.549SerTrp: 0.549 ± 0.032
2.391SerTyr: 2.391 ± 0.075
0.0SerXaa: 0.0 ± 0.0
Thr
2.61ThrAla: 2.61 ± 0.075
0.293ThrCys: 0.293 ± 0.026
2.067ThrAsp: 2.067 ± 0.074
2.85ThrGlu: 2.85 ± 0.069
2.772ThrPhe: 2.772 ± 0.07
3.586ThrGly: 3.586 ± 0.105
0.854ThrHis: 0.854 ± 0.044
4.642ThrIle: 4.642 ± 0.088
3.526ThrLys: 3.526 ± 0.091
4.819ThrLeu: 4.819 ± 0.107
0.922ThrMet: 0.922 ± 0.054
2.419ThrAsn: 2.419 ± 0.073
2.325ThrPro: 2.325 ± 0.073
0.882ThrGln: 0.882 ± 0.047
1.541ThrArg: 1.541 ± 0.053
2.656ThrSer: 2.656 ± 0.072
2.495ThrThr: 2.495 ± 0.085
3.235ThrVal: 3.235 ± 0.081
0.395ThrTrp: 0.395 ± 0.03
1.844ThrTyr: 1.844 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
3.975ValAla: 3.975 ± 0.096
0.437ValCys: 0.437 ± 0.032
3.704ValAsp: 3.704 ± 0.088
5.36ValGlu: 5.36 ± 0.129
3.989ValPhe: 3.989 ± 0.098
4.514ValGly: 4.514 ± 0.094
0.93ValHis: 0.93 ± 0.047
6.919ValIle: 6.919 ± 0.141
6.093ValLys: 6.093 ± 0.134
6.604ValLeu: 6.604 ± 0.119
1.387ValMet: 1.387 ± 0.054
3.299ValAsn: 3.299 ± 0.096
2.546ValPro: 2.546 ± 0.079
1.421ValGln: 1.421 ± 0.05
2.319ValArg: 2.319 ± 0.071
4.305ValSer: 4.305 ± 0.089
3.153ValThr: 3.153 ± 0.08
5.097ValVal: 5.097 ± 0.119
0.597ValTrp: 0.597 ± 0.033
2.866ValTyr: 2.866 ± 0.071
0.0ValXaa: 0.0 ± 0.0
Trp
0.427TrpAla: 0.427 ± 0.032
0.066TrpCys: 0.066 ± 0.01
0.491TrpAsp: 0.491 ± 0.037
0.599TrpGlu: 0.599 ± 0.035
0.445TrpPhe: 0.445 ± 0.03
0.639TrpGly: 0.639 ± 0.038
0.168TrpHis: 0.168 ± 0.019
0.834TrpIle: 0.834 ± 0.043
0.852TrpLys: 0.852 ± 0.041
0.748TrpLeu: 0.748 ± 0.042
0.235TrpMet: 0.235 ± 0.021
0.577TrpAsn: 0.577 ± 0.036
0.245TrpPro: 0.245 ± 0.024
0.21TrpGln: 0.21 ± 0.023
0.391TrpArg: 0.391 ± 0.026
0.405TrpSer: 0.405 ± 0.029
0.347TrpThr: 0.347 ± 0.026
0.489TrpVal: 0.489 ± 0.032
0.156TrpTrp: 0.156 ± 0.019
0.447TrpTyr: 0.447 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.165TyrAla: 2.165 ± 0.071
0.281TyrCys: 0.281 ± 0.022
2.219TyrAsp: 2.219 ± 0.076
3.127TyrGlu: 3.127 ± 0.083
2.888TyrPhe: 2.888 ± 0.078
2.872TyrGly: 2.872 ± 0.089
0.647TyrHis: 0.647 ± 0.035
3.931TyrIle: 3.931 ± 0.107
3.646TyrLys: 3.646 ± 0.09
4.408TyrLeu: 4.408 ± 0.092
0.722TyrMet: 0.722 ± 0.035
2.369TyrAsn: 2.369 ± 0.068
1.447TyrPro: 1.447 ± 0.056
0.934TyrGln: 0.934 ± 0.047
1.439TyrArg: 1.439 ± 0.053
2.782TyrSer: 2.782 ± 0.065
1.9TyrThr: 1.9 ± 0.064
2.95TyrVal: 2.95 ± 0.076
0.425TyrTrp: 0.425 ± 0.028
2.026TyrTyr: 2.026 ± 0.066
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1560 proteins (501096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski