Amino acid dipepetide frequency for Thermoplasma acidophilum (strain ATCC 25905 / DSM 1728 / JCM 9062 / NBRC 15155 / AMRC-C165)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.262AlaAla: 5.262 ± 0.141
0.413AlaCys: 0.413 ± 0.032
3.731AlaAsp: 3.731 ± 0.087
3.905AlaGlu: 3.905 ± 0.101
3.539AlaPhe: 3.539 ± 0.1
5.512AlaGly: 5.512 ± 0.126
0.964AlaHis: 0.964 ± 0.047
6.749AlaIle: 6.749 ± 0.135
3.563AlaLys: 3.563 ± 0.106
6.427AlaLeu: 6.427 ± 0.139
2.575AlaMet: 2.575 ± 0.075
2.332AlaAsn: 2.332 ± 0.086
1.926AlaPro: 1.926 ± 0.074
1.436AlaGln: 1.436 ± 0.064
3.817AlaArg: 3.817 ± 0.093
5.298AlaSer: 5.298 ± 0.113
2.923AlaThr: 2.923 ± 0.085
5.745AlaVal: 5.745 ± 0.137
0.596AlaTrp: 0.596 ± 0.04
2.89AlaTyr: 2.89 ± 0.078
0.0AlaXaa: 0.0 ± 0.0
Cys
0.32CysAla: 0.32 ± 0.026
0.053CysCys: 0.053 ± 0.011
0.45CysAsp: 0.45 ± 0.034
0.293CysGlu: 0.293 ± 0.025
0.21CysPhe: 0.21 ± 0.022
0.702CysGly: 0.702 ± 0.046
0.163CysHis: 0.163 ± 0.021
0.386CysIle: 0.386 ± 0.031
0.267CysLys: 0.267 ± 0.026
0.342CysLeu: 0.342 ± 0.029
0.159CysMet: 0.159 ± 0.018
0.265CysAsn: 0.265 ± 0.03
0.472CysPro: 0.472 ± 0.038
0.135CysGln: 0.135 ± 0.022
0.366CysArg: 0.366 ± 0.026
0.483CysSer: 0.483 ± 0.037
0.302CysThr: 0.302 ± 0.024
0.364CysVal: 0.364 ± 0.029
0.051CysTrp: 0.051 ± 0.011
0.223CysTyr: 0.223 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
3.773AspAla: 3.773 ± 0.101
0.252AspCys: 0.252 ± 0.024
3.071AspAsp: 3.071 ± 0.099
3.91AspGlu: 3.91 ± 0.104
2.641AspPhe: 2.641 ± 0.072
3.799AspGly: 3.799 ± 0.092
1.255AspHis: 1.255 ± 0.065
4.986AspIle: 4.986 ± 0.116
2.151AspLys: 2.151 ± 0.084
6.699AspLeu: 6.699 ± 0.137
1.75AspMet: 1.75 ± 0.061
1.664AspAsn: 1.664 ± 0.06
3.124AspPro: 3.124 ± 0.08
1.624AspGln: 1.624 ± 0.064
3.691AspArg: 3.691 ± 0.111
3.281AspSer: 3.281 ± 0.096
2.321AspThr: 2.321 ± 0.072
4.382AspVal: 4.382 ± 0.117
0.439AspTrp: 0.439 ± 0.03
2.698AspTyr: 2.698 ± 0.076
0.0AspXaa: 0.0 ± 0.0
Glu
4.4GluAla: 4.4 ± 0.119
0.338GluCys: 0.338 ± 0.027
3.914GluAsp: 3.914 ± 0.104
4.333GluGlu: 4.333 ± 0.123
2.575GluPhe: 2.575 ± 0.082
3.51GluGly: 3.51 ± 0.096
0.898GluHis: 0.898 ± 0.046
6.039GluIle: 6.039 ± 0.135
4.938GluLys: 4.938 ± 0.138
3.711GluLeu: 3.711 ± 0.095
2.129GluMet: 2.129 ± 0.07
3.44GluAsn: 3.44 ± 0.09
1.606GluPro: 1.606 ± 0.062
0.993GluGln: 0.993 ± 0.052
3.497GluArg: 3.497 ± 0.097
3.526GluSer: 3.526 ± 0.087
2.906GluThr: 2.906 ± 0.086
3.788GluVal: 3.788 ± 0.101
0.536GluTrp: 0.536 ± 0.037
2.762GluTyr: 2.762 ± 0.086
0.0GluXaa: 0.0 ± 0.0
Phe
3.248PheAla: 3.248 ± 0.093
0.267PheCys: 0.267 ± 0.022
2.52PheAsp: 2.52 ± 0.065
2.33PheGlu: 2.33 ± 0.074
2.456PhePhe: 2.456 ± 0.087
3.464PheGly: 3.464 ± 0.107
0.852PheHis: 0.852 ± 0.048
4.002PheIle: 4.002 ± 0.122
1.811PheLys: 1.811 ± 0.072
4.691PheLeu: 4.691 ± 0.154
1.251PheMet: 1.251 ± 0.052
2.041PheAsn: 2.041 ± 0.071
1.939PhePro: 1.939 ± 0.062
1.041PheGln: 1.041 ± 0.047
2.566PheArg: 2.566 ± 0.082
4.126PheSer: 4.126 ± 0.114
2.438PheThr: 2.438 ± 0.086
3.321PheVal: 3.321 ± 0.084
0.446PheTrp: 0.446 ± 0.035
2.202PheTyr: 2.202 ± 0.083
0.0PheXaa: 0.0 ± 0.0
Gly
3.799GlyAla: 3.799 ± 0.103
0.494GlyCys: 0.494 ± 0.037
3.528GlyAsp: 3.528 ± 0.101
3.923GlyGlu: 3.923 ± 0.092
3.705GlyPhe: 3.705 ± 0.106
4.927GlyGly: 4.927 ± 0.112
1.277GlyHis: 1.277 ± 0.051
7.519GlyIle: 7.519 ± 0.119
4.825GlyLys: 4.825 ± 0.12
6.211GlyLeu: 6.211 ± 0.142
2.211GlyMet: 2.211 ± 0.064
3.177GlyAsn: 3.177 ± 0.102
2.147GlyPro: 2.147 ± 0.069
1.584GlyGln: 1.584 ± 0.068
3.93GlyArg: 3.93 ± 0.092
6.008GlySer: 6.008 ± 0.117
3.956GlyThr: 3.956 ± 0.098
4.783GlyVal: 4.783 ± 0.11
0.746GlyTrp: 0.746 ± 0.051
3.76GlyTyr: 3.76 ± 0.112
0.0GlyXaa: 0.0 ± 0.0
His
1.103HisAla: 1.103 ± 0.051
0.13HisCys: 0.13 ± 0.016
0.971HisAsp: 0.971 ± 0.043
0.874HisGlu: 0.874 ± 0.044
0.735HisPhe: 0.735 ± 0.044
1.525HisGly: 1.525 ± 0.055
0.329HisHis: 0.329 ± 0.034
1.388HisIle: 1.388 ± 0.054
0.554HisLys: 0.554 ± 0.039
1.454HisLeu: 1.454 ± 0.066
0.532HisMet: 0.532 ± 0.037
0.613HisAsn: 0.613 ± 0.038
0.905HisPro: 0.905 ± 0.049
0.364HisGln: 0.364 ± 0.027
0.964HisArg: 0.964 ± 0.052
1.114HisSer: 1.114 ± 0.053
0.766HisThr: 0.766 ± 0.037
1.269HisVal: 1.269 ± 0.059
0.17HisTrp: 0.17 ± 0.022
0.697HisTyr: 0.697 ± 0.042
0.0HisXaa: 0.0 ± 0.0
Ile
7.034IleAla: 7.034 ± 0.139
0.439IleCys: 0.439 ± 0.033
5.536IleAsp: 5.536 ± 0.136
5.403IleGlu: 5.403 ± 0.133
4.208IlePhe: 4.208 ± 0.108
6.632IleGly: 6.632 ± 0.135
1.209IleHis: 1.209 ± 0.056
7.175IleIle: 7.175 ± 0.17
4.594IleLys: 4.594 ± 0.12
7.515IleLeu: 7.515 ± 0.145
2.701IleMet: 2.701 ± 0.087
3.693IleAsn: 3.693 ± 0.099
4.027IlePro: 4.027 ± 0.113
1.567IleGln: 1.567 ± 0.064
5.022IleArg: 5.022 ± 0.117
8.029IleSer: 8.029 ± 0.156
4.748IleThr: 4.748 ± 0.103
6.394IleVal: 6.394 ± 0.114
0.633IleTrp: 0.633 ± 0.041
3.656IleTyr: 3.656 ± 0.085
0.0IleXaa: 0.0 ± 0.0
Lys
3.804LysAla: 3.804 ± 0.099
0.415LysCys: 0.415 ± 0.031
3.371LysAsp: 3.371 ± 0.108
3.863LysGlu: 3.863 ± 0.122
2.299LysPhe: 2.299 ± 0.078
3.166LysGly: 3.166 ± 0.101
0.938LysHis: 0.938 ± 0.044
5.353LysIle: 5.353 ± 0.118
4.468LysLys: 4.468 ± 0.12
4.042LysLeu: 4.042 ± 0.104
1.935LysMet: 1.935 ± 0.067
3.21LysAsn: 3.21 ± 0.087
2.187LysPro: 2.187 ± 0.071
1.214LysGln: 1.214 ± 0.058
3.257LysArg: 3.257 ± 0.092
3.005LysSer: 3.005 ± 0.09
2.668LysThr: 2.668 ± 0.094
3.766LysVal: 3.766 ± 0.102
0.468LysTrp: 0.468 ± 0.032
2.981LysTyr: 2.981 ± 0.08
0.0LysXaa: 0.0 ± 0.0
Leu
6.341LeuAla: 6.341 ± 0.14
0.472LeuCys: 0.472 ± 0.036
4.483LeuAsp: 4.483 ± 0.109
4.678LeuGlu: 4.678 ± 0.122
3.916LeuPhe: 3.916 ± 0.141
5.816LeuGly: 5.816 ± 0.13
1.366LeuHis: 1.366 ± 0.057
7.475LeuIle: 7.475 ± 0.179
5.434LeuLys: 5.434 ± 0.126
6.615LeuLeu: 6.615 ± 0.165
2.683LeuMet: 2.683 ± 0.077
3.731LeuAsn: 3.731 ± 0.085
3.391LeuPro: 3.391 ± 0.101
1.825LeuGln: 1.825 ± 0.06
4.486LeuArg: 4.486 ± 0.101
7.186LeuSer: 7.186 ± 0.131
4.168LeuThr: 4.168 ± 0.121
5.441LeuVal: 5.441 ± 0.131
0.649LeuTrp: 0.649 ± 0.043
3.682LeuTyr: 3.682 ± 0.093
0.0LeuXaa: 0.0 ± 0.0
Met
2.612MetAla: 2.612 ± 0.085
0.154MetCys: 0.154 ± 0.018
2.061MetAsp: 2.061 ± 0.067
2.072MetGlu: 2.072 ± 0.08
1.258MetPhe: 1.258 ± 0.057
2.125MetGly: 2.125 ± 0.063
0.638MetHis: 0.638 ± 0.037
3.003MetIle: 3.003 ± 0.082
2.268MetLys: 2.268 ± 0.079
2.237MetLeu: 2.237 ± 0.068
1.037MetMet: 1.037 ± 0.049
1.617MetAsn: 1.617 ± 0.057
1.425MetPro: 1.425 ± 0.056
0.865MetGln: 0.865 ± 0.05
1.699MetArg: 1.699 ± 0.062
2.081MetSer: 2.081 ± 0.061
1.432MetThr: 1.432 ± 0.052
2.16MetVal: 2.16 ± 0.081
0.203MetTrp: 0.203 ± 0.024
1.008MetTyr: 1.008 ± 0.05
0.0MetXaa: 0.0 ± 0.0
Asn
3.212AsnAla: 3.212 ± 0.099
0.225AsnCys: 0.225 ± 0.022
2.112AsnAsp: 2.112 ± 0.079
2.35AsnGlu: 2.35 ± 0.083
1.924AsnPhe: 1.924 ± 0.063
3.766AsnGly: 3.766 ± 0.098
0.552AsnHis: 0.552 ± 0.034
3.885AsnIle: 3.885 ± 0.098
1.664AsnLys: 1.664 ± 0.062
3.634AsnLeu: 3.634 ± 0.083
1.306AsnMet: 1.306 ± 0.056
1.542AsnAsn: 1.542 ± 0.072
2.153AsnPro: 2.153 ± 0.079
0.878AsnGln: 0.878 ± 0.052
2.29AsnArg: 2.29 ± 0.077
2.974AsnSer: 2.974 ± 0.093
2.162AsnThr: 2.162 ± 0.09
3.484AsnVal: 3.484 ± 0.091
0.324AsnTrp: 0.324 ± 0.027
2.076AsnTyr: 2.076 ± 0.077
0.0AsnXaa: 0.0 ± 0.0
Pro
2.727ProAla: 2.727 ± 0.085
0.214ProCys: 0.214 ± 0.023
2.848ProAsp: 2.848 ± 0.085
3.243ProGlu: 3.243 ± 0.09
1.973ProPhe: 1.973 ± 0.072
2.97ProGly: 2.97 ± 0.085
0.664ProHis: 0.664 ± 0.041
2.811ProIle: 2.811 ± 0.087
1.944ProLys: 1.944 ± 0.066
3.206ProLeu: 3.206 ± 0.085
1.112ProMet: 1.112 ± 0.05
1.289ProAsn: 1.289 ± 0.06
1.364ProPro: 1.364 ± 0.071
0.988ProGln: 0.988 ± 0.043
1.498ProArg: 1.498 ± 0.053
2.952ProSer: 2.952 ± 0.086
1.774ProThr: 1.774 ± 0.069
3.581ProVal: 3.581 ± 0.098
0.426ProTrp: 0.426 ± 0.033
2.05ProTyr: 2.05 ± 0.079
0.0ProXaa: 0.0 ± 0.0
Gln
1.633GlnAla: 1.633 ± 0.064
0.132GlnCys: 0.132 ± 0.018
1.24GlnAsp: 1.24 ± 0.056
1.333GlnGlu: 1.333 ± 0.058
0.953GlnPhe: 0.953 ± 0.044
1.372GlnGly: 1.372 ± 0.049
0.342GlnHis: 0.342 ± 0.028
2.134GlnIle: 2.134 ± 0.083
1.564GlnLys: 1.564 ± 0.06
1.522GlnLeu: 1.522 ± 0.064
0.845GlnMet: 0.845 ± 0.047
1.063GlnAsn: 1.063 ± 0.053
0.726GlnPro: 0.726 ± 0.039
0.549GlnGln: 0.549 ± 0.044
1.304GlnArg: 1.304 ± 0.059
1.255GlnSer: 1.255 ± 0.059
0.891GlnThr: 0.891 ± 0.047
1.399GlnVal: 1.399 ± 0.06
0.227GlnTrp: 0.227 ± 0.024
1.086GlnTyr: 1.086 ± 0.053
0.0GlnXaa: 0.0 ± 0.0
Arg
2.829ArgAla: 2.829 ± 0.092
0.446ArgCys: 0.446 ± 0.037
3.283ArgAsp: 3.283 ± 0.091
3.495ArgGlu: 3.495 ± 0.105
2.31ArgPhe: 2.31 ± 0.085
3.109ArgGly: 3.109 ± 0.092
0.907ArgHis: 0.907 ± 0.045
5.856ArgIle: 5.856 ± 0.12
4.172ArgLys: 4.172 ± 0.111
3.727ArgLeu: 3.727 ± 0.096
2.059ArgMet: 2.059 ± 0.072
2.795ArgAsn: 2.795 ± 0.078
1.849ArgPro: 1.849 ± 0.064
1.103ArgGln: 1.103 ± 0.055
3.466ArgArg: 3.466 ± 0.1
4.622ArgSer: 4.622 ± 0.107
2.453ArgThr: 2.453 ± 0.067
3.204ArgVal: 3.204 ± 0.081
0.439ArgTrp: 0.439 ± 0.029
2.866ArgTyr: 2.866 ± 0.087
0.0ArgXaa: 0.0 ± 0.0
Ser
5.167SerAla: 5.167 ± 0.119
0.437SerCys: 0.437 ± 0.031
4.313SerAsp: 4.313 ± 0.113
4.32SerGlu: 4.32 ± 0.102
3.735SerPhe: 3.735 ± 0.114
6.502SerGly: 6.502 ± 0.137
1.167SerHis: 1.167 ± 0.053
6.804SerIle: 6.804 ± 0.15
3.857SerLys: 3.857 ± 0.096
6.663SerLeu: 6.663 ± 0.143
2.575SerMet: 2.575 ± 0.08
2.802SerAsn: 2.802 ± 0.105
2.853SerPro: 2.853 ± 0.091
1.617SerGln: 1.617 ± 0.064
3.848SerArg: 3.848 ± 0.1
5.801SerSer: 5.801 ± 0.153
3.543SerThr: 3.543 ± 0.092
5.551SerVal: 5.551 ± 0.113
0.666SerTrp: 0.666 ± 0.04
3.455SerTyr: 3.455 ± 0.084
0.0SerXaa: 0.0 ± 0.0
Thr
3.652ThrAla: 3.652 ± 0.095
0.287ThrCys: 0.287 ± 0.027
2.716ThrAsp: 2.716 ± 0.072
2.577ThrGlu: 2.577 ± 0.072
2.348ThrPhe: 2.348 ± 0.078
4.638ThrGly: 4.638 ± 0.099
0.785ThrHis: 0.785 ± 0.043
3.93ThrIle: 3.93 ± 0.11
2.041ThrLys: 2.041 ± 0.067
4.064ThrLeu: 4.064 ± 0.104
1.463ThrMet: 1.463 ± 0.055
1.701ThrAsn: 1.701 ± 0.071
2.12ThrPro: 2.12 ± 0.075
0.982ThrGln: 0.982 ± 0.053
2.061ThrArg: 2.061 ± 0.071
3.285ThrSer: 3.285 ± 0.103
2.317ThrThr: 2.317 ± 0.086
4.307ThrVal: 4.307 ± 0.11
0.408ThrTrp: 0.408 ± 0.031
2.114ThrTyr: 2.114 ± 0.082
0.0ThrXaa: 0.0 ± 0.0
Val
4.733ValAla: 4.733 ± 0.095
0.492ValCys: 0.492 ± 0.042
4.108ValAsp: 4.108 ± 0.11
4.024ValGlu: 4.024 ± 0.106
3.504ValPhe: 3.504 ± 0.093
4.53ValGly: 4.53 ± 0.1
1.13ValHis: 1.13 ± 0.048
6.401ValIle: 6.401 ± 0.132
4.216ValLys: 4.216 ± 0.114
6.043ValLeu: 6.043 ± 0.123
2.156ValMet: 2.156 ± 0.073
3.206ValAsn: 3.206 ± 0.091
3.113ValPro: 3.113 ± 0.086
1.547ValGln: 1.547 ± 0.063
3.817ValArg: 3.817 ± 0.103
6.383ValSer: 6.383 ± 0.132
3.351ValThr: 3.351 ± 0.098
4.823ValVal: 4.823 ± 0.121
0.58ValTrp: 0.58 ± 0.035
3.415ValTyr: 3.415 ± 0.105
0.0ValXaa: 0.0 ± 0.0
Trp
0.607TrpAla: 0.607 ± 0.043
0.057TrpCys: 0.057 ± 0.01
0.421TrpAsp: 0.421 ± 0.03
0.408TrpGlu: 0.408 ± 0.028
0.43TrpPhe: 0.43 ± 0.031
0.578TrpGly: 0.578 ± 0.04
0.19TrpHis: 0.19 ± 0.022
0.878TrpIle: 0.878 ± 0.051
0.6TrpLys: 0.6 ± 0.04
0.609TrpLeu: 0.609 ± 0.033
0.282TrpMet: 0.282 ± 0.024
0.536TrpAsn: 0.536 ± 0.041
0.34TrpPro: 0.34 ± 0.027
0.263TrpGln: 0.263 ± 0.025
0.382TrpArg: 0.382 ± 0.028
0.585TrpSer: 0.585 ± 0.04
0.382TrpThr: 0.382 ± 0.029
0.448TrpVal: 0.448 ± 0.03
0.113TrpTrp: 0.113 ± 0.016
0.43TrpTyr: 0.43 ± 0.035
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.349TyrAla: 3.349 ± 0.104
0.289TyrCys: 0.289 ± 0.024
2.833TyrAsp: 2.833 ± 0.081
2.504TyrGlu: 2.504 ± 0.082
2.295TyrPhe: 2.295 ± 0.085
3.868TyrGly: 3.868 ± 0.089
0.83TyrHis: 0.83 ± 0.046
3.44TyrIle: 3.44 ± 0.102
1.476TyrLys: 1.476 ± 0.059
4.426TyrLeu: 4.426 ± 0.105
1.194TyrMet: 1.194 ± 0.051
1.796TyrAsn: 1.796 ± 0.072
1.966TyrPro: 1.966 ± 0.071
0.999TyrGln: 0.999 ± 0.044
3.029TyrArg: 3.029 ± 0.091
3.685TyrSer: 3.685 ± 0.097
2.292TyrThr: 2.292 ± 0.077
3.363TyrVal: 3.363 ± 0.094
0.419TyrTrp: 0.419 ± 0.029
2.299TyrTyr: 2.299 ± 0.086
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1482 proteins (453232 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski