Amino acid dipepetide frequency for Candidatus Parvarchaeum acidophilus ARMAN-5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.175AlaAla: 4.175 ± 0.182
0.487AlaCys: 0.487 ± 0.046
2.64AlaAsp: 2.64 ± 0.107
3.161AlaGlu: 3.161 ± 0.12
3.183AlaPhe: 3.183 ± 0.139
4.172AlaGly: 4.172 ± 0.181
0.63AlaHis: 0.63 ± 0.052
5.48AlaIle: 5.48 ± 0.164
4.187AlaLys: 4.187 ± 0.16
5.545AlaLeu: 5.545 ± 0.164
1.365AlaMet: 1.365 ± 0.089
2.848AlaAsn: 2.848 ± 0.128
1.501AlaPro: 1.501 ± 0.094
1.38AlaGln: 1.38 ± 0.077
1.882AlaArg: 1.882 ± 0.097
4.447AlaSer: 4.447 ± 0.162
2.893AlaThr: 2.893 ± 0.141
4.643AlaVal: 4.643 ± 0.13
0.347AlaTrp: 0.347 ± 0.034
2.493AlaTyr: 2.493 ± 0.1
0.0AlaXaa: 0.0 ± 0.0
Cys
0.4CysAla: 0.4 ± 0.038
0.049CysCys: 0.049 ± 0.017
0.392CysAsp: 0.392 ± 0.036
0.339CysGlu: 0.339 ± 0.039
0.26CysPhe: 0.26 ± 0.04
0.811CysGly: 0.811 ± 0.06
0.109CysHis: 0.109 ± 0.018
0.566CysIle: 0.566 ± 0.058
0.585CysLys: 0.585 ± 0.052
0.479CysLeu: 0.479 ± 0.046
0.136CysMet: 0.136 ± 0.02
0.468CysAsn: 0.468 ± 0.05
0.415CysPro: 0.415 ± 0.046
0.181CysGln: 0.181 ± 0.026
0.302CysArg: 0.302 ± 0.033
0.724CysSer: 0.724 ± 0.068
0.585CysThr: 0.585 ± 0.081
0.475CysVal: 0.475 ± 0.047
0.053CysTrp: 0.053 ± 0.015
0.355CysTyr: 0.355 ± 0.053
0.0CysXaa: 0.0 ± 0.0
Asp
2.731AspAla: 2.731 ± 0.13
0.37AspCys: 0.37 ± 0.047
2.218AspAsp: 2.218 ± 0.101
3.255AspGlu: 3.255 ± 0.131
2.708AspPhe: 2.708 ± 0.115
2.629AspGly: 2.629 ± 0.116
0.449AspHis: 0.449 ± 0.04
5.767AspIle: 5.767 ± 0.201
4.832AspLys: 4.832 ± 0.193
4.432AspLeu: 4.432 ± 0.163
1.35AspMet: 1.35 ± 0.08
2.738AspAsn: 2.738 ± 0.113
1.531AspPro: 1.531 ± 0.075
0.758AspGln: 0.758 ± 0.061
1.973AspArg: 1.973 ± 0.108
4.002AspSer: 4.002 ± 0.145
2.489AspThr: 2.489 ± 0.093
3.123AspVal: 3.123 ± 0.112
0.396AspTrp: 0.396 ± 0.038
2.667AspTyr: 2.667 ± 0.101
0.0AspXaa: 0.0 ± 0.0
Glu
3.436GluAla: 3.436 ± 0.131
0.313GluCys: 0.313 ± 0.037
3.323GluAsp: 3.323 ± 0.128
4.703GluGlu: 4.703 ± 0.178
2.686GluPhe: 2.686 ± 0.104
3.266GluGly: 3.266 ± 0.129
0.818GluHis: 0.818 ± 0.061
5.888GluIle: 5.888 ± 0.181
6.85GluLys: 6.85 ± 0.214
5.639GluLeu: 5.639 ± 0.201
1.46GluMet: 1.46 ± 0.085
4.27GluAsn: 4.27 ± 0.138
1.32GluPro: 1.32 ± 0.075
1.305GluGln: 1.305 ± 0.08
2.489GluArg: 2.489 ± 0.11
3.753GluSer: 3.753 ± 0.132
2.784GluThr: 2.784 ± 0.104
3.53GluVal: 3.53 ± 0.141
0.4GluTrp: 0.4 ± 0.037
2.572GluTyr: 2.572 ± 0.099
0.0GluXaa: 0.0 ± 0.0
Phe
2.399PheAla: 2.399 ± 0.103
0.396PheCys: 0.396 ± 0.04
2.663PheAsp: 2.663 ± 0.106
2.425PheGlu: 2.425 ± 0.116
2.429PhePhe: 2.429 ± 0.121
3.338PheGly: 3.338 ± 0.127
0.543PheHis: 0.543 ± 0.045
4.809PheIle: 4.809 ± 0.138
3.297PheLys: 3.297 ± 0.122
4.496PheLeu: 4.496 ± 0.146
1.169PheMet: 1.169 ± 0.074
3.134PheAsn: 3.134 ± 0.125
1.418PhePro: 1.418 ± 0.071
0.86PheGln: 0.86 ± 0.047
1.58PheArg: 1.58 ± 0.073
4.975PheSer: 4.975 ± 0.169
2.546PheThr: 2.546 ± 0.109
3.146PheVal: 3.146 ± 0.116
0.347PheTrp: 0.347 ± 0.038
2.278PheTyr: 2.278 ± 0.099
0.0PheXaa: 0.0 ± 0.0
Gly
3.493GlyAla: 3.493 ± 0.149
0.566GlyCys: 0.566 ± 0.064
2.75GlyAsp: 2.75 ± 0.127
3.444GlyGlu: 3.444 ± 0.136
3.221GlyPhe: 3.221 ± 0.111
3.942GlyGly: 3.942 ± 0.163
0.905GlyHis: 0.905 ± 0.062
6.284GlyIle: 6.284 ± 0.161
5.978GlyLys: 5.978 ± 0.171
5.854GlyLeu: 5.854 ± 0.169
1.584GlyMet: 1.584 ± 0.072
3.776GlyAsn: 3.776 ± 0.159
1.452GlyPro: 1.452 ± 0.08
1.373GlyGln: 1.373 ± 0.078
2.105GlyArg: 2.105 ± 0.099
5.016GlySer: 5.016 ± 0.251
3.579GlyThr: 3.579 ± 0.191
4.405GlyVal: 4.405 ± 0.135
0.577GlyTrp: 0.577 ± 0.053
3.413GlyTyr: 3.413 ± 0.123
0.0GlyXaa: 0.0 ± 0.0
His
0.705HisAla: 0.705 ± 0.053
0.094HisCys: 0.094 ± 0.02
0.524HisAsp: 0.524 ± 0.043
0.619HisGlu: 0.619 ± 0.051
0.735HisPhe: 0.735 ± 0.054
0.928HisGly: 0.928 ± 0.063
0.215HisHis: 0.215 ± 0.027
1.086HisIle: 1.086 ± 0.067
0.954HisLys: 0.954 ± 0.053
1.128HisLeu: 1.128 ± 0.074
0.351HisMet: 0.351 ± 0.039
0.664HisAsn: 0.664 ± 0.057
0.539HisPro: 0.539 ± 0.045
0.226HisGln: 0.226 ± 0.029
0.502HisArg: 0.502 ± 0.045
1.026HisSer: 1.026 ± 0.062
0.671HisThr: 0.671 ± 0.053
0.818HisVal: 0.818 ± 0.06
0.124HisTrp: 0.124 ± 0.023
0.592HisTyr: 0.592 ± 0.052
0.0HisXaa: 0.0 ± 0.0
Ile
6.107IleAla: 6.107 ± 0.172
0.6IleCys: 0.6 ± 0.053
5.065IleAsp: 5.065 ± 0.177
5.541IleGlu: 5.541 ± 0.19
4.134IlePhe: 4.134 ± 0.156
5.582IleGly: 5.582 ± 0.177
1.116IleHis: 1.116 ± 0.056
8.709IleIle: 8.709 ± 0.239
9.203IleLys: 9.203 ± 0.278
7.74IleLeu: 7.74 ± 0.224
2.018IleMet: 2.018 ± 0.094
6.454IleAsn: 6.454 ± 0.184
3.73IlePro: 3.73 ± 0.128
1.709IleGln: 1.709 ± 0.082
2.972IleArg: 2.972 ± 0.114
8.189IleSer: 8.189 ± 0.198
5.069IleThr: 5.069 ± 0.165
5.258IleVal: 5.258 ± 0.152
0.517IleTrp: 0.517 ± 0.047
3.334IleTyr: 3.334 ± 0.113
0.0IleXaa: 0.0 ± 0.0
Lys
4.394LysAla: 4.394 ± 0.163
0.622LysCys: 0.622 ± 0.056
5.318LysAsp: 5.318 ± 0.193
7.593LysGlu: 7.593 ± 0.246
3.319LysPhe: 3.319 ± 0.12
4.918LysGly: 4.918 ± 0.156
1.113LysHis: 1.113 ± 0.066
7.842LysIle: 7.842 ± 0.253
8.852LysLys: 8.852 ± 0.285
7.34LysLeu: 7.34 ± 0.184
2.063LysMet: 2.063 ± 0.1
6.08LysAsn: 6.08 ± 0.18
2.493LysPro: 2.493 ± 0.104
2.018LysGln: 2.018 ± 0.092
3.644LysArg: 3.644 ± 0.139
5.446LysSer: 5.446 ± 0.164
4.334LysThr: 4.334 ± 0.118
5.231LysVal: 5.231 ± 0.16
0.554LysTrp: 0.554 ± 0.05
3.576LysTyr: 3.576 ± 0.128
0.0LysXaa: 0.0 ± 0.0
Leu
5.292LeuAla: 5.292 ± 0.178
0.615LeuCys: 0.615 ± 0.056
4.798LeuAsp: 4.798 ± 0.153
5.363LeuGlu: 5.363 ± 0.148
4.209LeuPhe: 4.209 ± 0.159
5.533LeuGly: 5.533 ± 0.173
1.162LeuHis: 1.162 ± 0.071
7.691LeuIle: 7.691 ± 0.216
7.574LeuLys: 7.574 ± 0.202
8.147LeuLeu: 8.147 ± 0.273
2.071LeuMet: 2.071 ± 0.098
5.522LeuAsn: 5.522 ± 0.168
3.991LeuPro: 3.991 ± 0.148
1.965LeuGln: 1.965 ± 0.09
3.097LeuArg: 3.097 ± 0.131
8.894LeuSer: 8.894 ± 0.206
4.371LeuThr: 4.371 ± 0.142
5.556LeuVal: 5.556 ± 0.15
0.532LeuTrp: 0.532 ± 0.054
3.783LeuTyr: 3.783 ± 0.136
0.0LeuXaa: 0.0 ± 0.0
Met
1.426MetAla: 1.426 ± 0.074
0.181MetCys: 0.181 ± 0.026
1.467MetAsp: 1.467 ± 0.072
1.859MetGlu: 1.859 ± 0.093
0.962MetPhe: 0.962 ± 0.058
1.354MetGly: 1.354 ± 0.081
0.445MetHis: 0.445 ± 0.045
1.633MetIle: 1.633 ± 0.081
1.954MetLys: 1.954 ± 0.103
2.361MetLeu: 2.361 ± 0.109
0.453MetMet: 0.453 ± 0.043
1.38MetAsn: 1.38 ± 0.079
1.033MetPro: 1.033 ± 0.063
0.864MetGln: 0.864 ± 0.059
0.856MetArg: 0.856 ± 0.068
1.595MetSer: 1.595 ± 0.083
1.049MetThr: 1.049 ± 0.059
1.471MetVal: 1.471 ± 0.082
0.121MetTrp: 0.121 ± 0.022
0.837MetTyr: 0.837 ± 0.056
0.0MetXaa: 0.0 ± 0.0
Asn
3.383AsnAla: 3.383 ± 0.145
0.588AsnCys: 0.588 ± 0.072
2.821AsnAsp: 2.821 ± 0.128
3.862AsnGlu: 3.862 ± 0.129
2.908AsnPhe: 2.908 ± 0.102
5.099AsnGly: 5.099 ± 0.22
0.668AsnHis: 0.668 ± 0.052
6.091AsnIle: 6.091 ± 0.193
5.39AsnLys: 5.39 ± 0.163
5.582AsnLeu: 5.582 ± 0.191
1.448AsnMet: 1.448 ± 0.079
3.919AsnAsn: 3.919 ± 0.182
2.293AsnPro: 2.293 ± 0.092
1.554AsnGln: 1.554 ± 0.098
1.859AsnArg: 1.859 ± 0.094
5.424AsnSer: 5.424 ± 0.243
3.221AsnThr: 3.221 ± 0.127
4.017AsnVal: 4.017 ± 0.15
0.487AsnTrp: 0.487 ± 0.047
3.217AsnTyr: 3.217 ± 0.162
0.0AsnXaa: 0.0 ± 0.0
Pro
1.81ProAla: 1.81 ± 0.092
0.166ProCys: 0.166 ± 0.026
1.426ProAsp: 1.426 ± 0.068
2.225ProGlu: 2.225 ± 0.094
1.931ProPhe: 1.931 ± 0.082
1.939ProGly: 1.939 ± 0.097
0.453ProHis: 0.453 ± 0.041
2.833ProIle: 2.833 ± 0.108
2.308ProLys: 2.308 ± 0.094
3.214ProLeu: 3.214 ± 0.106
0.634ProMet: 0.634 ± 0.056
2.044ProAsn: 2.044 ± 0.123
1.018ProPro: 1.018 ± 0.068
0.909ProGln: 0.909 ± 0.061
0.917ProArg: 0.917 ± 0.059
3.545ProSer: 3.545 ± 0.243
1.705ProThr: 1.705 ± 0.086
2.289ProVal: 2.289 ± 0.087
0.272ProTrp: 0.272 ± 0.032
1.528ProTyr: 1.528 ± 0.083
0.0ProXaa: 0.0 ± 0.0
Gln
1.271GlnAla: 1.271 ± 0.074
0.204GlnCys: 0.204 ± 0.026
1.056GlnAsp: 1.056 ± 0.068
1.373GlnGlu: 1.373 ± 0.071
0.95GlnPhe: 0.95 ± 0.066
1.177GlnGly: 1.177 ± 0.08
0.256GlnHis: 0.256 ± 0.028
1.946GlnIle: 1.946 ± 0.078
2.003GlnLys: 2.003 ± 0.092
2.093GlnLeu: 2.093 ± 0.089
0.592GlnMet: 0.592 ± 0.044
1.709GlnAsn: 1.709 ± 0.092
0.739GlnPro: 0.739 ± 0.058
0.909GlnGln: 0.909 ± 0.069
0.901GlnArg: 0.901 ± 0.059
1.743GlnSer: 1.743 ± 0.105
1.158GlnThr: 1.158 ± 0.077
1.301GlnVal: 1.301 ± 0.062
0.177GlnTrp: 0.177 ± 0.028
1.03GlnTyr: 1.03 ± 0.073
0.0GlnXaa: 0.0 ± 0.0
Arg
1.871ArgAla: 1.871 ± 0.089
0.256ArgCys: 0.256 ± 0.028
1.859ArgAsp: 1.859 ± 0.105
2.38ArgGlu: 2.38 ± 0.111
1.686ArgPhe: 1.686 ± 0.077
1.98ArgGly: 1.98 ± 0.088
0.509ArgHis: 0.509 ± 0.047
3.395ArgIle: 3.395 ± 0.142
3.357ArgLys: 3.357 ± 0.141
3.289ArgLeu: 3.289 ± 0.115
0.924ArgMet: 0.924 ± 0.063
2.108ArgAsn: 2.108 ± 0.088
0.917ArgPro: 0.917 ± 0.064
0.879ArgGln: 0.879 ± 0.071
1.784ArgArg: 1.784 ± 0.097
2.037ArgSer: 2.037 ± 0.088
1.46ArgThr: 1.46 ± 0.079
1.995ArgVal: 1.995 ± 0.09
0.313ArgTrp: 0.313 ± 0.033
1.66ArgTyr: 1.66 ± 0.089
0.0ArgXaa: 0.0 ± 0.0
Ser
4.843SerAla: 4.843 ± 0.169
0.702SerCys: 0.702 ± 0.091
3.742SerAsp: 3.742 ± 0.137
4.247SerGlu: 4.247 ± 0.14
4.605SerPhe: 4.605 ± 0.17
6.122SerGly: 6.122 ± 0.324
0.973SerHis: 0.973 ± 0.068
7.936SerIle: 7.936 ± 0.222
6.642SerLys: 6.642 ± 0.186
7.419SerLeu: 7.419 ± 0.18
1.856SerMet: 1.856 ± 0.084
5.439SerAsn: 5.439 ± 0.284
2.659SerPro: 2.659 ± 0.145
2.067SerGln: 2.067 ± 0.101
2.595SerArg: 2.595 ± 0.116
8.385SerSer: 8.385 ± 0.516
4.617SerThr: 4.617 ± 0.245
5.488SerVal: 5.488 ± 0.196
0.686SerTrp: 0.686 ± 0.07
3.636SerTyr: 3.636 ± 0.18
0.0SerXaa: 0.0 ± 0.0
Thr
3.553ThrAla: 3.553 ± 0.137
0.468ThrCys: 0.468 ± 0.049
2.369ThrAsp: 2.369 ± 0.096
2.746ThrGlu: 2.746 ± 0.117
2.618ThrPhe: 2.618 ± 0.122
3.889ThrGly: 3.889 ± 0.189
0.664ThrHis: 0.664 ± 0.058
4.466ThrIle: 4.466 ± 0.135
3.685ThrLys: 3.685 ± 0.111
4.884ThrLeu: 4.884 ± 0.145
1.079ThrMet: 1.079 ± 0.069
3.24ThrAsn: 3.24 ± 0.171
2.082ThrPro: 2.082 ± 0.095
1.339ThrGln: 1.339 ± 0.069
1.471ThrArg: 1.471 ± 0.075
4.33ThrSer: 4.33 ± 0.284
2.885ThrThr: 2.885 ± 0.186
3.681ThrVal: 3.681 ± 0.147
0.343ThrTrp: 0.343 ± 0.041
2.316ThrTyr: 2.316 ± 0.151
0.0ThrXaa: 0.0 ± 0.0
Val
3.462ValAla: 3.462 ± 0.121
0.547ValCys: 0.547 ± 0.044
3.176ValAsp: 3.176 ± 0.127
3.255ValGlu: 3.255 ± 0.132
3.274ValPhe: 3.274 ± 0.115
3.628ValGly: 3.628 ± 0.13
0.785ValHis: 0.785 ± 0.054
6.454ValIle: 6.454 ± 0.18
5.111ValLys: 5.111 ± 0.17
5.658ValLeu: 5.658 ± 0.176
1.497ValMet: 1.497 ± 0.073
4.107ValAsn: 4.107 ± 0.148
2.116ValPro: 2.116 ± 0.084
1.245ValGln: 1.245 ± 0.069
1.875ValArg: 1.875 ± 0.092
5.937ValSer: 5.937 ± 0.219
3.402ValThr: 3.402 ± 0.157
3.757ValVal: 3.757 ± 0.138
0.441ValTrp: 0.441 ± 0.041
3.146ValTyr: 3.146 ± 0.119
0.0ValXaa: 0.0 ± 0.0
Trp
0.422TrpAla: 0.422 ± 0.037
0.068TrpCys: 0.068 ± 0.014
0.396TrpAsp: 0.396 ± 0.038
0.4TrpGlu: 0.4 ± 0.04
0.279TrpPhe: 0.279 ± 0.032
0.415TrpGly: 0.415 ± 0.043
0.128TrpHis: 0.128 ± 0.022
0.528TrpIle: 0.528 ± 0.05
0.543TrpLys: 0.543 ± 0.054
0.781TrpLeu: 0.781 ± 0.053
0.207TrpMet: 0.207 ± 0.032
0.468TrpAsn: 0.468 ± 0.047
0.26TrpPro: 0.26 ± 0.034
0.17TrpGln: 0.17 ± 0.025
0.339TrpArg: 0.339 ± 0.031
0.513TrpSer: 0.513 ± 0.067
0.392TrpThr: 0.392 ± 0.056
0.468TrpVal: 0.468 ± 0.038
0.106TrpTrp: 0.106 ± 0.016
0.283TrpTyr: 0.283 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.286TyrAla: 2.286 ± 0.093
0.392TyrCys: 0.392 ± 0.039
2.414TyrAsp: 2.414 ± 0.098
2.014TyrGlu: 2.014 ± 0.086
2.15TyrPhe: 2.15 ± 0.109
2.995TyrGly: 2.995 ± 0.108
0.562TyrHis: 0.562 ± 0.041
3.644TyrIle: 3.644 ± 0.12
3.451TyrLys: 3.451 ± 0.124
4.085TyrLeu: 4.085 ± 0.119
1.041TyrMet: 1.041 ± 0.068
3.466TyrAsn: 3.466 ± 0.187
1.656TyrPro: 1.656 ± 0.087
0.879TyrGln: 0.879 ± 0.073
1.516TyrArg: 1.516 ± 0.088
4.677TyrSer: 4.677 ± 0.196
2.897TyrThr: 2.897 ± 0.148
2.218TyrVal: 2.218 ± 0.098
0.373TyrTrp: 0.373 ± 0.04
2.139TyrTyr: 2.139 ± 0.114
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1002 proteins (265128 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski