Amino acid dipepetide frequency for Thermodesulfobacterium commune DSM 2178

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.192AlaAla: 3.192 ± 0.119
0.948AlaCys: 0.948 ± 0.058
2.226AlaAsp: 2.226 ± 0.075
4.658AlaGlu: 4.658 ± 0.096
2.986AlaPhe: 2.986 ± 0.082
4.148AlaGly: 4.148 ± 0.102
1.008AlaHis: 1.008 ± 0.043
4.844AlaIle: 4.844 ± 0.116
5.885AlaLys: 5.885 ± 0.116
7.747AlaLeu: 7.747 ± 0.145
1.265AlaMet: 1.265 ± 0.055
1.942AlaAsn: 1.942 ± 0.074
1.714AlaPro: 1.714 ± 0.068
1.947AlaGln: 1.947 ± 0.073
2.354AlaArg: 2.354 ± 0.081
2.988AlaSer: 2.988 ± 0.079
2.733AlaThr: 2.733 ± 0.084
4.058AlaVal: 4.058 ± 0.1
0.56AlaTrp: 0.56 ± 0.039
2.452AlaTyr: 2.452 ± 0.077
0.0AlaXaa: 0.0 ± 0.0
Cys
0.777CysAla: 0.777 ± 0.042
0.162CysCys: 0.162 ± 0.02
0.423CysAsp: 0.423 ± 0.03
0.746CysGlu: 0.746 ± 0.041
0.622CysPhe: 0.622 ± 0.037
1.134CysGly: 1.134 ± 0.06
0.368CysHis: 0.368 ± 0.068
0.627CysIle: 0.627 ± 0.044
0.813CysLys: 0.813 ± 0.048
1.123CysLeu: 1.123 ± 0.054
0.195CysMet: 0.195 ± 0.023
0.323CysAsn: 0.323 ± 0.027
0.835CysPro: 0.835 ± 0.052
0.31CysGln: 0.31 ± 0.027
0.399CysArg: 0.399 ± 0.03
0.587CysSer: 0.587 ± 0.039
0.379CysThr: 0.379 ± 0.03
0.711CysVal: 0.711 ± 0.048
0.109CysTrp: 0.109 ± 0.016
0.416CysTyr: 0.416 ± 0.03
0.0CysXaa: 0.0 ± 0.0
Asp
1.905AspAla: 1.905 ± 0.061
0.454AspCys: 0.454 ± 0.032
1.236AspAsp: 1.236 ± 0.055
3.256AspGlu: 3.256 ± 0.096
3.156AspPhe: 3.156 ± 0.078
2.425AspGly: 2.425 ± 0.075
0.645AspHis: 0.645 ± 0.035
3.227AspIle: 3.227 ± 0.081
3.249AspLys: 3.249 ± 0.082
6.607AspLeu: 6.607 ± 0.125
0.671AspMet: 0.671 ± 0.039
1.092AspAsn: 1.092 ± 0.059
2.576AspPro: 2.576 ± 0.092
1.227AspGln: 1.227 ± 0.046
1.801AspArg: 1.801 ± 0.066
1.426AspSer: 1.426 ± 0.056
1.59AspThr: 1.59 ± 0.057
2.644AspVal: 2.644 ± 0.072
0.6AspTrp: 0.6 ± 0.035
1.883AspTyr: 1.883 ± 0.069
0.0AspXaa: 0.0 ± 0.0
Glu
5.435GluAla: 5.435 ± 0.121
0.523GluCys: 0.523 ± 0.034
4.009GluAsp: 4.009 ± 0.108
9.413GluGlu: 9.413 ± 0.215
3.504GluPhe: 3.504 ± 0.097
5.331GluGly: 5.331 ± 0.114
1.008GluHis: 1.008 ± 0.045
8.151GluIle: 8.151 ± 0.15
10.332GluLys: 10.332 ± 0.191
8.033GluLeu: 8.033 ± 0.143
1.577GluMet: 1.577 ± 0.065
3.661GluAsn: 3.661 ± 0.096
2.23GluPro: 2.23 ± 0.072
1.641GluGln: 1.641 ± 0.061
4.135GluArg: 4.135 ± 0.107
2.472GluSer: 2.472 ± 0.076
3.821GluThr: 3.821 ± 0.1
7.646GluVal: 7.646 ± 0.151
0.695GluTrp: 0.695 ± 0.038
2.357GluTyr: 2.357 ± 0.071
0.0GluXaa: 0.0 ± 0.0
Phe
2.465PheAla: 2.465 ± 0.076
0.8PheCys: 0.8 ± 0.041
2.279PheAsp: 2.279 ± 0.07
3.688PheGlu: 3.688 ± 0.099
3.477PhePhe: 3.477 ± 0.128
3.256PheGly: 3.256 ± 0.083
0.746PheHis: 0.746 ± 0.036
4.297PheIle: 4.297 ± 0.102
4.372PheLys: 4.372 ± 0.1
7.761PheLeu: 7.761 ± 0.201
0.959PheMet: 0.959 ± 0.048
1.973PheAsn: 1.973 ± 0.077
2.155PhePro: 2.155 ± 0.071
1.539PheGln: 1.539 ± 0.058
1.962PheArg: 1.962 ± 0.077
3.805PheSer: 3.805 ± 0.125
2.403PheThr: 2.403 ± 0.072
3.134PheVal: 3.134 ± 0.098
0.839PheTrp: 0.839 ± 0.055
2.718PheTyr: 2.718 ± 0.094
0.0PheXaa: 0.0 ± 0.0
Gly
4.151GlyAla: 4.151 ± 0.126
0.946GlyCys: 0.946 ± 0.051
2.636GlyAsp: 2.636 ± 0.071
4.817GlyGlu: 4.817 ± 0.115
4.073GlyPhe: 4.073 ± 0.093
4.372GlyGly: 4.372 ± 0.132
1.083GlyHis: 1.083 ± 0.049
6.018GlyIle: 6.018 ± 0.127
5.825GlyLys: 5.825 ± 0.124
7.431GlyLeu: 7.431 ± 0.145
1.486GlyMet: 1.486 ± 0.066
1.801GlyAsn: 1.801 ± 0.068
1.969GlyPro: 1.969 ± 0.062
1.508GlyGln: 1.508 ± 0.057
2.753GlyArg: 2.753 ± 0.081
2.873GlySer: 2.873 ± 0.081
3.054GlyThr: 3.054 ± 0.087
4.983GlyVal: 4.983 ± 0.118
0.751GlyTrp: 0.751 ± 0.044
2.873GlyTyr: 2.873 ± 0.078
0.0GlyXaa: 0.0 ± 0.0
His
0.864HisAla: 0.864 ± 0.045
0.182HisCys: 0.182 ± 0.02
0.485HisAsp: 0.485 ± 0.035
0.791HisGlu: 0.791 ± 0.047
0.868HisPhe: 0.868 ± 0.042
1.063HisGly: 1.063 ± 0.053
0.414HisHis: 0.414 ± 0.036
1.134HisIle: 1.134 ± 0.048
1.116HisLys: 1.116 ± 0.062
2.055HisLeu: 2.055 ± 0.069
0.277HisMet: 0.277 ± 0.023
0.485HisAsn: 0.485 ± 0.035
1.26HisPro: 1.26 ± 0.06
0.596HisGln: 0.596 ± 0.038
0.729HisArg: 0.729 ± 0.038
0.662HisSer: 0.662 ± 0.036
0.788HisThr: 0.788 ± 0.04
0.862HisVal: 0.862 ± 0.051
0.166HisTrp: 0.166 ± 0.017
0.622HisTyr: 0.622 ± 0.038
0.0HisXaa: 0.0 ± 0.0
Ile
5.267IleAla: 5.267 ± 0.124
0.764IleCys: 0.764 ± 0.045
3.798IleAsp: 3.798 ± 0.099
6.675IleGlu: 6.675 ± 0.13
4.055IlePhe: 4.055 ± 0.108
5.048IleGly: 5.048 ± 0.114
1.099IleHis: 1.099 ± 0.055
5.101IleIle: 5.101 ± 0.114
7.688IleLys: 7.688 ± 0.127
8.319IleLeu: 8.319 ± 0.138
1.121IleMet: 1.121 ± 0.058
3.09IleAsn: 3.09 ± 0.083
3.847IlePro: 3.847 ± 0.093
1.878IleGln: 1.878 ± 0.071
3.152IleArg: 3.152 ± 0.076
4.567IleSer: 4.567 ± 0.125
3.816IleThr: 3.816 ± 0.084
4.611IleVal: 4.611 ± 0.115
0.642IleTrp: 0.642 ± 0.037
2.631IleTyr: 2.631 ± 0.078
0.0IleXaa: 0.0 ± 0.0
Lys
6.104LysAla: 6.104 ± 0.116
0.622LysCys: 0.622 ± 0.046
4.644LysAsp: 4.644 ± 0.118
10.848LysGlu: 10.848 ± 0.206
3.3LysPhe: 3.3 ± 0.081
5.949LysGly: 5.949 ± 0.138
1.344LysHis: 1.344 ± 0.056
7.823LysIle: 7.823 ± 0.14
8.908LysLys: 8.908 ± 0.172
7.982LysLeu: 7.982 ± 0.149
1.504LysMet: 1.504 ± 0.065
4.277LysAsn: 4.277 ± 0.107
3.586LysPro: 3.586 ± 0.091
2.345LysGln: 2.345 ± 0.079
4.049LysArg: 4.049 ± 0.102
3.378LysSer: 3.378 ± 0.079
4.656LysThr: 4.656 ± 0.102
7.353LysVal: 7.353 ± 0.155
0.631LysTrp: 0.631 ± 0.043
2.52LysTyr: 2.52 ± 0.073
0.0LysXaa: 0.0 ± 0.0
Leu
7.455LeuAla: 7.455 ± 0.135
1.231LeuCys: 1.231 ± 0.051
4.558LeuAsp: 4.558 ± 0.112
10.044LeuGlu: 10.044 ± 0.179
5.705LeuPhe: 5.705 ± 0.14
7.931LeuGly: 7.931 ± 0.161
1.369LeuHis: 1.369 ± 0.065
8.516LeuIle: 8.516 ± 0.158
12.208LeuLys: 12.208 ± 0.209
11.138LeuLeu: 11.138 ± 0.204
1.962LeuMet: 1.962 ± 0.067
4.578LeuAsn: 4.578 ± 0.127
5.254LeuPro: 5.254 ± 0.132
2.972LeuGln: 2.972 ± 0.093
4.866LeuArg: 4.866 ± 0.099
7.661LeuSer: 7.661 ± 0.159
5.592LeuThr: 5.592 ± 0.116
7.479LeuVal: 7.479 ± 0.138
1.2LeuTrp: 1.2 ± 0.062
3.741LeuTyr: 3.741 ± 0.095
0.0LeuXaa: 0.0 ± 0.0
Met
1.435MetAla: 1.435 ± 0.066
0.173MetCys: 0.173 ± 0.02
0.738MetAsp: 0.738 ± 0.039
1.382MetGlu: 1.382 ± 0.054
0.813MetPhe: 0.813 ± 0.044
1.265MetGly: 1.265 ± 0.056
0.217MetHis: 0.217 ± 0.023
1.216MetIle: 1.216 ± 0.059
1.723MetLys: 1.723 ± 0.058
1.674MetLeu: 1.674 ± 0.063
0.315MetMet: 0.315 ± 0.028
0.569MetAsn: 0.569 ± 0.04
0.813MetPro: 0.813 ± 0.045
0.441MetGln: 0.441 ± 0.032
0.771MetArg: 0.771 ± 0.042
0.999MetSer: 0.999 ± 0.045
0.678MetThr: 0.678 ± 0.044
1.705MetVal: 1.705 ± 0.059
0.126MetTrp: 0.126 ± 0.019
0.434MetTyr: 0.434 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
1.705AsnAla: 1.705 ± 0.063
0.423AsnCys: 0.423 ± 0.039
0.946AsnAsp: 0.946 ± 0.051
1.9AsnGlu: 1.9 ± 0.075
2.277AsnPhe: 2.277 ± 0.079
1.55AsnGly: 1.55 ± 0.048
0.627AsnHis: 0.627 ± 0.043
2.782AsnIle: 2.782 ± 0.088
2.594AsnLys: 2.594 ± 0.075
6.343AsnLeu: 6.343 ± 0.145
0.574AsnMet: 0.574 ± 0.031
1.03AsnAsn: 1.03 ± 0.057
2.795AsnPro: 2.795 ± 0.078
1.502AsnGln: 1.502 ± 0.071
1.561AsnArg: 1.561 ± 0.058
1.756AsnSer: 1.756 ± 0.065
1.524AsnThr: 1.524 ± 0.05
2.175AsnVal: 2.175 ± 0.07
0.498AsnTrp: 0.498 ± 0.032
1.561AsnTyr: 1.561 ± 0.065
0.0AsnXaa: 0.0 ± 0.0
Pro
2.049ProAla: 2.049 ± 0.082
0.414ProCys: 0.414 ± 0.033
1.971ProAsp: 1.971 ± 0.063
4.974ProGlu: 4.974 ± 0.11
2.682ProPhe: 2.682 ± 0.07
2.773ProGly: 2.773 ± 0.082
0.78ProHis: 0.78 ± 0.044
2.744ProIle: 2.744 ± 0.077
3.621ProLys: 3.621 ± 0.091
4.501ProLeu: 4.501 ± 0.101
0.693ProMet: 0.693 ± 0.04
1.517ProAsn: 1.517 ± 0.064
1.854ProPro: 1.854 ± 0.081
1.612ProGln: 1.612 ± 0.064
1.318ProArg: 1.318 ± 0.051
2.361ProSer: 2.361 ± 0.086
1.838ProThr: 1.838 ± 0.074
3.413ProVal: 3.413 ± 0.09
0.54ProTrp: 0.54 ± 0.037
2.135ProTyr: 2.135 ± 0.069
0.0ProXaa: 0.0 ± 0.0
Gln
2.08GlnAla: 2.08 ± 0.068
0.177GlnCys: 0.177 ± 0.022
1.081GlnAsp: 1.081 ± 0.052
2.884GlnGlu: 2.884 ± 0.077
1.101GlnPhe: 1.101 ± 0.053
2.027GlnGly: 2.027 ± 0.076
0.414GlnHis: 0.414 ± 0.032
2.261GlnIle: 2.261 ± 0.082
3.3GlnLys: 3.3 ± 0.102
2.423GlnLeu: 2.423 ± 0.083
0.494GlnMet: 0.494 ± 0.037
1.234GlnAsn: 1.234 ± 0.06
1.165GlnPro: 1.165 ± 0.062
0.901GlnGln: 0.901 ± 0.047
1.417GlnArg: 1.417 ± 0.055
1.123GlnSer: 1.123 ± 0.049
1.586GlnThr: 1.586 ± 0.058
2.292GlnVal: 2.292 ± 0.076
0.215GlnTrp: 0.215 ± 0.024
0.682GlnTyr: 0.682 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
2.571ArgAla: 2.571 ± 0.079
0.598ArgCys: 0.598 ± 0.036
1.783ArgAsp: 1.783 ± 0.062
3.747ArgGlu: 3.747 ± 0.101
2.558ArgPhe: 2.558 ± 0.081
2.569ArgGly: 2.569 ± 0.079
0.591ArgHis: 0.591 ± 0.036
3.32ArgIle: 3.32 ± 0.091
3.274ArgLys: 3.274 ± 0.092
4.955ArgLeu: 4.955 ± 0.119
0.819ArgMet: 0.819 ± 0.044
1.426ArgAsn: 1.426 ± 0.06
1.542ArgPro: 1.542 ± 0.065
0.997ArgGln: 0.997 ± 0.05
1.887ArgArg: 1.887 ± 0.07
1.674ArgSer: 1.674 ± 0.073
1.615ArgThr: 1.615 ± 0.065
3.141ArgVal: 3.141 ± 0.091
0.478ArgTrp: 0.478 ± 0.031
1.635ArgTyr: 1.635 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
2.733SerAla: 2.733 ± 0.076
0.611SerCys: 0.611 ± 0.04
1.801SerAsp: 1.801 ± 0.067
3.539SerGlu: 3.539 ± 0.095
3.592SerPhe: 3.592 ± 0.111
3.417SerGly: 3.417 ± 0.091
0.915SerHis: 0.915 ± 0.046
3.307SerIle: 3.307 ± 0.09
3.781SerLys: 3.781 ± 0.096
7.114SerLeu: 7.114 ± 0.145
0.915SerMet: 0.915 ± 0.047
1.55SerAsn: 1.55 ± 0.066
2.591SerPro: 2.591 ± 0.085
1.976SerGln: 1.976 ± 0.071
1.688SerArg: 1.688 ± 0.062
3.203SerSer: 3.203 ± 0.108
2.093SerThr: 2.093 ± 0.062
2.935SerVal: 2.935 ± 0.083
0.596SerTrp: 0.596 ± 0.038
2.122SerTyr: 2.122 ± 0.066
0.0SerXaa: 0.0 ± 0.0
Thr
2.536ThrAla: 2.536 ± 0.081
0.538ThrCys: 0.538 ± 0.039
1.787ThrAsp: 1.787 ± 0.055
3.107ThrGlu: 3.107 ± 0.082
2.618ThrPhe: 2.618 ± 0.079
3.528ThrGly: 3.528 ± 0.093
0.881ThrHis: 0.881 ± 0.046
3.302ThrIle: 3.302 ± 0.078
3.329ThrLys: 3.329 ± 0.087
5.61ThrLeu: 5.61 ± 0.131
0.682ThrMet: 0.682 ± 0.036
1.453ThrAsn: 1.453 ± 0.057
2.565ThrPro: 2.565 ± 0.081
1.597ThrGln: 1.597 ± 0.064
1.524ThrArg: 1.524 ± 0.063
2.602ThrSer: 2.602 ± 0.08
2.111ThrThr: 2.111 ± 0.085
2.85ThrVal: 2.85 ± 0.076
0.412ThrTrp: 0.412 ± 0.031
1.872ThrTyr: 1.872 ± 0.068
0.0ThrXaa: 0.0 ± 0.0
Val
4.306ValAla: 4.306 ± 0.13
1.037ValCys: 1.037 ± 0.048
3.26ValAsp: 3.26 ± 0.085
5.818ValGlu: 5.818 ± 0.124
4.272ValPhe: 4.272 ± 0.097
4.323ValGly: 4.323 ± 0.106
0.906ValHis: 0.906 ± 0.045
5.495ValIle: 5.495 ± 0.103
6.361ValLys: 6.361 ± 0.126
8.361ValLeu: 8.361 ± 0.145
1.324ValMet: 1.324 ± 0.052
2.465ValAsn: 2.465 ± 0.075
2.684ValPro: 2.684 ± 0.066
1.393ValGln: 1.393 ± 0.057
2.713ValArg: 2.713 ± 0.075
4.12ValSer: 4.12 ± 0.099
2.567ValThr: 2.567 ± 0.075
5.858ValVal: 5.858 ± 0.129
0.764ValTrp: 0.764 ± 0.042
2.975ValTyr: 2.975 ± 0.088
0.0ValXaa: 0.0 ± 0.0
Trp
0.676TrpAla: 0.676 ± 0.043
0.097TrpCys: 0.097 ± 0.016
0.554TrpAsp: 0.554 ± 0.037
1.045TrpGlu: 1.045 ± 0.046
0.598TrpPhe: 0.598 ± 0.037
0.795TrpGly: 0.795 ± 0.039
0.162TrpHis: 0.162 ± 0.019
0.766TrpIle: 0.766 ± 0.042
0.738TrpLys: 0.738 ± 0.043
1.172TrpLeu: 1.172 ± 0.057
0.206TrpMet: 0.206 ± 0.022
0.343TrpAsn: 0.343 ± 0.028
0.261TrpPro: 0.261 ± 0.032
0.354TrpGln: 0.354 ± 0.032
0.385TrpArg: 0.385 ± 0.026
0.403TrpSer: 0.403 ± 0.032
0.323TrpThr: 0.323 ± 0.028
0.864TrpVal: 0.864 ± 0.047
0.157TrpTrp: 0.157 ± 0.02
0.401TrpTyr: 0.401 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.949TyrAla: 1.949 ± 0.069
0.359TyrCys: 0.359 ± 0.028
1.455TyrAsp: 1.455 ± 0.069
2.638TyrGlu: 2.638 ± 0.075
2.37TyrPhe: 2.37 ± 0.08
2.496TyrGly: 2.496 ± 0.077
0.844TyrHis: 0.844 ± 0.043
2.239TyrIle: 2.239 ± 0.073
2.709TyrLys: 2.709 ± 0.07
5.114TyrLeu: 5.114 ± 0.123
0.445TyrMet: 0.445 ± 0.032
1.313TyrAsn: 1.313 ± 0.054
2.095TyrPro: 2.095 ± 0.073
2.122TyrGln: 2.122 ± 0.069
1.716TyrArg: 1.716 ± 0.053
1.818TyrSer: 1.818 ± 0.063
1.739TyrThr: 1.739 ± 0.055
2.288TyrVal: 2.288 ± 0.071
0.319TyrTrp: 0.319 ± 0.028
1.586TyrTyr: 1.586 ± 0.074
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1430 proteins (451505 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski