Amino acid dipepetide frequency for candidate division MSBL1 archaeon SCGC-AAA259O05

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.173AlaAla: 4.173 ± 0.157
0.791AlaCys: 0.791 ± 0.064
3.416AlaAsp: 3.416 ± 0.135
6.404AlaGlu: 6.404 ± 0.17
2.373AlaPhe: 2.373 ± 0.101
5.165AlaGly: 5.165 ± 0.172
1.076AlaHis: 1.076 ± 0.064
3.571AlaIle: 3.571 ± 0.132
3.922AlaLys: 3.922 ± 0.15
5.689AlaLeu: 5.689 ± 0.186
1.39AlaMet: 1.39 ± 0.069
1.716AlaAsn: 1.716 ± 0.089
2.152AlaPro: 2.152 ± 0.1
1.235AlaGln: 1.235 ± 0.077
3.663AlaArg: 3.663 ± 0.145
3.738AlaSer: 3.738 ± 0.141
2.738AlaThr: 2.738 ± 0.101
4.655AlaVal: 4.655 ± 0.16
0.712AlaTrp: 0.712 ± 0.068
1.733AlaTyr: 1.733 ± 0.092
0.0AlaXaa: 0.0 ± 0.0
Cys
0.481CysAla: 0.481 ± 0.048
0.13CysCys: 0.13 ± 0.029
0.653CysAsp: 0.653 ± 0.055
1.021CysGlu: 1.021 ± 0.069
0.381CysPhe: 0.381 ± 0.044
1.285CysGly: 1.285 ± 0.078
0.276CysHis: 0.276 ± 0.031
0.41CysIle: 0.41 ± 0.049
0.456CysLys: 0.456 ± 0.045
0.925CysLeu: 0.925 ± 0.061
0.255CysMet: 0.255 ± 0.037
0.289CysAsn: 0.289 ± 0.043
0.854CysPro: 0.854 ± 0.063
0.26CysGln: 0.26 ± 0.036
0.712CysArg: 0.712 ± 0.06
0.804CysSer: 0.804 ± 0.061
0.456CysThr: 0.456 ± 0.049
0.54CysVal: 0.54 ± 0.048
0.142CysTrp: 0.142 ± 0.025
0.314CysTyr: 0.314 ± 0.05
0.0CysXaa: 0.0 ± 0.0
Asp
3.169AspAla: 3.169 ± 0.12
0.636AspCys: 0.636 ± 0.051
2.88AspAsp: 2.88 ± 0.126
5.818AspGlu: 5.818 ± 0.153
2.738AspPhe: 2.738 ± 0.105
3.7AspGly: 3.7 ± 0.137
1.126AspHis: 1.126 ± 0.091
3.545AspIle: 3.545 ± 0.142
3.211AspLys: 3.211 ± 0.149
6.735AspLeu: 6.735 ± 0.183
1.365AspMet: 1.365 ± 0.075
1.674AspAsn: 1.674 ± 0.085
3.106AspPro: 3.106 ± 0.107
1.231AspGln: 1.231 ± 0.064
3.822AspArg: 3.822 ± 0.128
3.29AspSer: 3.29 ± 0.125
2.047AspThr: 2.047 ± 0.179
4.186AspVal: 4.186 ± 0.135
1.08AspTrp: 1.08 ± 0.081
2.39AspTyr: 2.39 ± 0.153
0.0AspXaa: 0.0 ± 0.0
Glu
5.906GluAla: 5.906 ± 0.165
0.85GluCys: 0.85 ± 0.061
6.191GluAsp: 6.191 ± 0.18
13.378GluGlu: 13.378 ± 0.323
3.441GluPhe: 3.441 ± 0.119
7.702GluGly: 7.702 ± 0.184
1.398GluHis: 1.398 ± 0.068
7.283GluIle: 7.283 ± 0.195
10.712GluLys: 10.712 ± 0.25
8.363GluLeu: 8.363 ± 0.187
2.52GluMet: 2.52 ± 0.111
5.408GluAsn: 5.408 ± 0.221
3.232GluPro: 3.232 ± 0.116
1.666GluGln: 1.666 ± 0.079
7.032GluArg: 7.032 ± 0.205
5.312GluSer: 5.312 ± 0.15
4.546GluThr: 4.546 ± 0.138
7.304GluVal: 7.304 ± 0.202
1.373GluTrp: 1.373 ± 0.095
2.679GluTyr: 2.679 ± 0.11
0.0GluXaa: 0.0 ± 0.0
Phe
2.361PheAla: 2.361 ± 0.111
0.498PheCys: 0.498 ± 0.051
2.675PheAsp: 2.675 ± 0.114
3.972PheGlu: 3.972 ± 0.139
1.825PhePhe: 1.825 ± 0.116
3.282PheGly: 3.282 ± 0.127
0.908PheHis: 0.908 ± 0.066
1.821PheIle: 1.821 ± 0.095
2.072PheLys: 2.072 ± 0.109
4.424PheLeu: 4.424 ± 0.158
0.691PheMet: 0.691 ± 0.045
1.105PheAsn: 1.105 ± 0.072
1.62PhePro: 1.62 ± 0.09
0.984PheGln: 0.984 ± 0.071
2.147PheArg: 2.147 ± 0.098
3.328PheSer: 3.328 ± 0.11
1.586PheThr: 1.586 ± 0.082
2.553PheVal: 2.553 ± 0.102
0.569PheTrp: 0.569 ± 0.049
1.235PheTyr: 1.235 ± 0.074
0.0PheXaa: 0.0 ± 0.0
Gly
4.588GlyAla: 4.588 ± 0.163
0.85GlyCys: 0.85 ± 0.057
4.207GlyAsp: 4.207 ± 0.15
7.924GlyGlu: 7.924 ± 0.176
3.185GlyPhe: 3.185 ± 0.123
6.3GlyGly: 6.3 ± 0.206
1.519GlyHis: 1.519 ± 0.09
5.128GlyIle: 5.128 ± 0.164
5.919GlyLys: 5.919 ± 0.18
6.229GlyLeu: 6.229 ± 0.227
1.905GlyMet: 1.905 ± 0.097
2.532GlyAsn: 2.532 ± 0.094
2.403GlyPro: 2.403 ± 0.105
1.365GlyGln: 1.365 ± 0.081
4.793GlyArg: 4.793 ± 0.142
5.14GlySer: 5.14 ± 0.138
3.617GlyThr: 3.617 ± 0.117
5.58GlyVal: 5.58 ± 0.15
1.113GlyTrp: 1.113 ± 0.079
2.666GlyTyr: 2.666 ± 0.107
0.0GlyXaa: 0.0 ± 0.0
His
1.118HisAla: 1.118 ± 0.069
0.26HisCys: 0.26 ± 0.031
1.009HisAsp: 1.009 ± 0.063
1.419HisGlu: 1.419 ± 0.076
0.808HisPhe: 0.808 ± 0.053
1.553HisGly: 1.553 ± 0.072
0.435HisHis: 0.435 ± 0.043
1.03HisIle: 1.03 ± 0.067
0.866HisLys: 0.866 ± 0.05
1.867HisLeu: 1.867 ± 0.1
0.293HisMet: 0.293 ± 0.032
0.527HisAsn: 0.527 ± 0.053
1.118HisPro: 1.118 ± 0.071
0.506HisGln: 0.506 ± 0.049
1.134HisArg: 1.134 ± 0.067
1.189HisSer: 1.189 ± 0.114
0.666HisThr: 0.666 ± 0.054
1.285HisVal: 1.285 ± 0.063
0.31HisTrp: 0.31 ± 0.033
0.569HisTyr: 0.569 ± 0.047
0.0HisXaa: 0.0 ± 0.0
Ile
3.964IleAla: 3.964 ± 0.157
0.72IleCys: 0.72 ± 0.053
3.562IleAsp: 3.562 ± 0.12
6.363IleGlu: 6.363 ± 0.156
2.499IlePhe: 2.499 ± 0.119
4.646IleGly: 4.646 ± 0.156
1.21IleHis: 1.21 ± 0.084
3.449IleIle: 3.449 ± 0.132
3.324IleLys: 3.324 ± 0.123
5.517IleLeu: 5.517 ± 0.177
1.134IleMet: 1.134 ± 0.069
1.854IleAsn: 1.854 ± 0.096
3.227IlePro: 3.227 ± 0.11
1.624IleGln: 1.624 ± 0.089
3.625IleArg: 3.625 ± 0.129
4.563IleSer: 4.563 ± 0.151
2.825IleThr: 2.825 ± 0.128
4.265IleVal: 4.265 ± 0.131
0.611IleTrp: 0.611 ± 0.057
1.712IleTyr: 1.712 ± 0.093
0.0IleXaa: 0.0 ± 0.0
Lys
4.429LysAla: 4.429 ± 0.147
0.653LysCys: 0.653 ± 0.059
3.818LysAsp: 3.818 ± 0.173
7.777LysGlu: 7.777 ± 0.196
2.562LysPhe: 2.562 ± 0.099
4.776LysGly: 4.776 ± 0.132
1.176LysHis: 1.176 ± 0.072
5.584LysIle: 5.584 ± 0.183
6.229LysLys: 6.229 ± 0.195
6.111LysLeu: 6.111 ± 0.153
1.62LysMet: 1.62 ± 0.081
3.114LysAsn: 3.114 ± 0.125
2.553LysPro: 2.553 ± 0.096
1.394LysGln: 1.394 ± 0.07
4.445LysArg: 4.445 ± 0.161
4.383LysSer: 4.383 ± 0.136
3.391LysThr: 3.391 ± 0.147
4.563LysVal: 4.563 ± 0.12
0.837LysTrp: 0.837 ± 0.074
1.997LysTyr: 1.997 ± 0.099
0.0LysXaa: 0.0 ± 0.0
Leu
5.923LeuAla: 5.923 ± 0.189
0.858LeuCys: 0.858 ± 0.058
5.802LeuAsp: 5.802 ± 0.172
10.565LeuGlu: 10.565 ± 0.276
3.29LeuPhe: 3.29 ± 0.124
6.672LeuGly: 6.672 ± 0.225
1.427LeuHis: 1.427 ± 0.069
4.864LeuIle: 4.864 ± 0.175
6.605LeuLys: 6.605 ± 0.172
7.932LeuLeu: 7.932 ± 0.263
1.821LeuMet: 1.821 ± 0.085
3.089LeuAsn: 3.089 ± 0.111
4.09LeuPro: 4.09 ± 0.157
2.085LeuGln: 2.085 ± 0.091
5.777LeuArg: 5.777 ± 0.16
6.61LeuSer: 6.61 ± 0.2
4.27LeuThr: 4.27 ± 0.135
5.693LeuVal: 5.693 ± 0.152
1.084LeuTrp: 1.084 ± 0.074
2.294LeuTyr: 2.294 ± 0.101
0.0LeuXaa: 0.0 ± 0.0
Met
1.503MetAla: 1.503 ± 0.069
0.159MetCys: 0.159 ± 0.025
1.49MetAsp: 1.49 ± 0.076
2.332MetGlu: 2.332 ± 0.103
0.573MetPhe: 0.573 ± 0.053
1.741MetGly: 1.741 ± 0.088
0.255MetHis: 0.255 ± 0.032
1.444MetIle: 1.444 ± 0.073
1.913MetLys: 1.913 ± 0.088
1.578MetLeu: 1.578 ± 0.088
0.469MetMet: 0.469 ± 0.044
0.913MetAsn: 0.913 ± 0.065
1.026MetPro: 1.026 ± 0.064
0.318MetGln: 0.318 ± 0.033
1.339MetArg: 1.339 ± 0.071
1.528MetSer: 1.528 ± 0.079
1.193MetThr: 1.193 ± 0.064
1.339MetVal: 1.339 ± 0.086
0.201MetTrp: 0.201 ± 0.029
0.456MetTyr: 0.456 ± 0.045
0.0MetXaa: 0.0 ± 0.0
Asn
2.039AsnAla: 2.039 ± 0.091
0.456AsnCys: 0.456 ± 0.047
1.582AsnAsp: 1.582 ± 0.084
2.645AsnGlu: 2.645 ± 0.095
1.892AsnPhe: 1.892 ± 0.09
2.365AsnGly: 2.365 ± 0.102
0.72AsnHis: 0.72 ± 0.054
2.206AsnIle: 2.206 ± 0.094
1.708AsnLys: 1.708 ± 0.083
4.085AsnLeu: 4.085 ± 0.149
0.707AsnMet: 0.707 ± 0.056
0.9AsnAsn: 0.9 ± 0.057
2.294AsnPro: 2.294 ± 0.107
1.067AsnGln: 1.067 ± 0.058
2.239AsnArg: 2.239 ± 0.101
2.156AsnSer: 2.156 ± 0.098
1.637AsnThr: 1.637 ± 0.091
2.499AsnVal: 2.499 ± 0.103
0.62AsnTrp: 0.62 ± 0.05
1.26AsnTyr: 1.26 ± 0.072
0.0AsnXaa: 0.0 ± 0.0
Pro
2.168ProAla: 2.168 ± 0.103
0.419ProCys: 0.419 ± 0.045
2.997ProAsp: 2.997 ± 0.108
5.37ProGlu: 5.37 ± 0.159
1.637ProPhe: 1.637 ± 0.085
3.428ProGly: 3.428 ± 0.135
0.992ProHis: 0.992 ± 0.072
2.219ProIle: 2.219 ± 0.099
2.608ProLys: 2.608 ± 0.089
3.767ProLeu: 3.767 ± 0.14
0.913ProMet: 0.913 ± 0.066
1.436ProAsn: 1.436 ± 0.075
2.147ProPro: 2.147 ± 0.096
1.009ProGln: 1.009 ± 0.065
2.394ProArg: 2.394 ± 0.097
3.139ProSer: 3.139 ± 0.123
1.884ProThr: 1.884 ± 0.081
2.922ProVal: 2.922 ± 0.124
0.544ProTrp: 0.544 ± 0.048
1.423ProTyr: 1.423 ± 0.081
0.0ProXaa: 0.0 ± 0.0
Gln
1.578GlnAla: 1.578 ± 0.093
0.167GlnCys: 0.167 ± 0.025
1.143GlnAsp: 1.143 ± 0.072
2.252GlnGlu: 2.252 ± 0.094
0.724GlnPhe: 0.724 ± 0.06
1.561GlnGly: 1.561 ± 0.077
0.389GlnHis: 0.389 ± 0.037
1.465GlnIle: 1.465 ± 0.085
1.871GlnLys: 1.871 ± 0.086
1.951GlnLeu: 1.951 ± 0.088
0.506GlnMet: 0.506 ± 0.045
0.925GlnAsn: 0.925 ± 0.066
0.791GlnPro: 0.791 ± 0.055
0.544GlnGln: 0.544 ± 0.054
1.373GlnArg: 1.373 ± 0.082
1.193GlnSer: 1.193 ± 0.075
1.105GlnThr: 1.105 ± 0.057
1.507GlnVal: 1.507 ± 0.081
0.251GlnTrp: 0.251 ± 0.037
0.628GlnTyr: 0.628 ± 0.058
0.0GlnXaa: 0.0 ± 0.0
Arg
3.763ArgAla: 3.763 ± 0.13
0.59ArgCys: 0.59 ± 0.054
3.345ArgAsp: 3.345 ± 0.124
7.149ArgGlu: 7.149 ± 0.205
2.537ArgPhe: 2.537 ± 0.116
4.944ArgGly: 4.944 ± 0.144
0.854ArgHis: 0.854 ± 0.062
4.173ArgIle: 4.173 ± 0.124
5.785ArgLys: 5.785 ± 0.184
4.789ArgLeu: 4.789 ± 0.141
1.574ArgMet: 1.574 ± 0.073
2.482ArgAsn: 2.482 ± 0.103
2.315ArgPro: 2.315 ± 0.117
1.256ArgGln: 1.256 ± 0.078
4.32ArgArg: 4.32 ± 0.159
3.725ArgSer: 3.725 ± 0.118
2.867ArgThr: 2.867 ± 0.106
3.956ArgVal: 3.956 ± 0.134
0.77ArgTrp: 0.77 ± 0.071
1.725ArgTyr: 1.725 ± 0.096
0.0ArgXaa: 0.0 ± 0.0
Ser
3.843SerAla: 3.843 ± 0.14
0.728SerCys: 0.728 ± 0.063
3.713SerAsp: 3.713 ± 0.127
6.555SerGlu: 6.555 ± 0.185
2.955SerPhe: 2.955 ± 0.117
5.437SerGly: 5.437 ± 0.158
1.206SerHis: 1.206 ± 0.069
3.805SerIle: 3.805 ± 0.129
4.257SerLys: 4.257 ± 0.129
6.107SerLeu: 6.107 ± 0.171
1.39SerMet: 1.39 ± 0.081
2.009SerAsn: 2.009 ± 0.089
3.123SerPro: 3.123 ± 0.106
1.699SerGln: 1.699 ± 0.078
3.968SerArg: 3.968 ± 0.137
4.646SerSer: 4.646 ± 0.187
2.989SerThr: 2.989 ± 0.115
4.307SerVal: 4.307 ± 0.144
0.841SerTrp: 0.841 ± 0.052
2.059SerTyr: 2.059 ± 0.087
0.0SerXaa: 0.0 ± 0.0
Thr
2.951ThrAla: 2.951 ± 0.113
0.557ThrCys: 0.557 ± 0.077
2.419ThrAsp: 2.419 ± 0.124
4.257ThrGlu: 4.257 ± 0.151
1.733ThrPhe: 1.733 ± 0.081
3.843ThrGly: 3.843 ± 0.131
0.737ThrHis: 0.737 ± 0.064
2.683ThrIle: 2.683 ± 0.119
2.457ThrLys: 2.457 ± 0.109
4.324ThrLeu: 4.324 ± 0.14
0.833ThrMet: 0.833 ± 0.058
1.281ThrAsn: 1.281 ± 0.058
2.281ThrPro: 2.281 ± 0.092
1.009ThrGln: 1.009 ± 0.066
2.57ThrArg: 2.57 ± 0.104
2.989ThrSer: 2.989 ± 0.116
2.256ThrThr: 2.256 ± 0.125
3.763ThrVal: 3.763 ± 0.143
0.603ThrTrp: 0.603 ± 0.053
1.494ThrTyr: 1.494 ± 0.078
0.0ThrXaa: 0.0 ± 0.0
Val
4.127ValAla: 4.127 ± 0.156
0.72ValCys: 0.72 ± 0.054
4.362ValAsp: 4.362 ± 0.145
7.376ValGlu: 7.376 ± 0.177
2.662ValPhe: 2.662 ± 0.11
5.245ValGly: 5.245 ± 0.158
1.352ValHis: 1.352 ± 0.07
3.675ValIle: 3.675 ± 0.14
4.776ValLys: 4.776 ± 0.161
6.007ValLeu: 6.007 ± 0.185
1.377ValMet: 1.377 ± 0.081
2.206ValAsn: 2.206 ± 0.097
3.114ValPro: 3.114 ± 0.136
1.478ValGln: 1.478 ± 0.086
4.353ValArg: 4.353 ± 0.153
4.969ValSer: 4.969 ± 0.155
3.068ValThr: 3.068 ± 0.112
4.68ValVal: 4.68 ± 0.17
0.812ValTrp: 0.812 ± 0.061
1.988ValTyr: 1.988 ± 0.102
0.0ValXaa: 0.0 ± 0.0
Trp
0.653TrpAla: 0.653 ± 0.051
0.163TrpCys: 0.163 ± 0.026
0.695TrpAsp: 0.695 ± 0.067
1.109TrpGlu: 1.109 ± 0.078
0.611TrpPhe: 0.611 ± 0.076
0.862TrpGly: 0.862 ± 0.063
0.243TrpHis: 0.243 ± 0.031
1.021TrpIle: 1.021 ± 0.066
1.113TrpLys: 1.113 ± 0.077
1.055TrpLeu: 1.055 ± 0.065
0.427TrpMet: 0.427 ± 0.046
0.666TrpAsn: 0.666 ± 0.051
0.389TrpPro: 0.389 ± 0.039
0.301TrpGln: 0.301 ± 0.042
1.038TrpArg: 1.038 ± 0.065
0.879TrpSer: 0.879 ± 0.062
0.64TrpThr: 0.64 ± 0.065
0.808TrpVal: 0.808 ± 0.065
0.251TrpTrp: 0.251 ± 0.037
0.364TrpTyr: 0.364 ± 0.039
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.574TyrAla: 1.574 ± 0.08
0.44TyrCys: 0.44 ± 0.044
1.792TyrAsp: 1.792 ± 0.124
2.608TyrGlu: 2.608 ± 0.11
1.231TyrPhe: 1.231 ± 0.072
2.453TyrGly: 2.453 ± 0.107
0.661TyrHis: 0.661 ± 0.053
1.319TyrIle: 1.319 ± 0.074
1.607TyrLys: 1.607 ± 0.121
3.294TyrLeu: 3.294 ± 0.113
0.578TyrMet: 0.578 ± 0.046
0.95TyrAsn: 0.95 ± 0.069
1.582TyrPro: 1.582 ± 0.081
0.896TyrGln: 0.896 ± 0.055
2.172TyrArg: 2.172 ± 0.092
2.064TyrSer: 2.064 ± 0.088
1.26TyrThr: 1.26 ± 0.077
1.942TyrVal: 1.942 ± 0.091
0.565TyrTrp: 0.565 ± 0.047
1.151TyrTyr: 1.151 ± 0.075
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1138 proteins (238899 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski