Amino acid dipepetide frequency for Candidatus Altiarchaeales archaeon WOR_SM1_SCG

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.461AlaAla: 4.461 ± 0.107
1.003AlaCys: 1.003 ± 0.051
3.083AlaAsp: 3.083 ± 0.087
4.296AlaGlu: 4.296 ± 0.103
2.08AlaPhe: 2.08 ± 0.057
5.233AlaGly: 5.233 ± 0.11
0.985AlaHis: 0.985 ± 0.045
4.607AlaIle: 4.607 ± 0.111
4.634AlaLys: 4.634 ± 0.102
5.263AlaLeu: 5.263 ± 0.137
1.328AlaMet: 1.328 ± 0.044
2.574AlaAsn: 2.574 ± 0.086
1.481AlaPro: 1.481 ± 0.054
1.251AlaGln: 1.251 ± 0.051
2.453AlaArg: 2.453 ± 0.074
3.313AlaSer: 3.313 ± 0.081
2.489AlaThr: 2.489 ± 0.083
4.657AlaVal: 4.657 ± 0.104
0.513AlaTrp: 0.513 ± 0.032
1.85AlaTyr: 1.85 ± 0.062
0.0AlaXaa: 0.0 ± 0.0
Cys
0.923CysAla: 0.923 ± 0.048
0.323CysCys: 0.323 ± 0.032
1.201CysAsp: 1.201 ± 0.079
1.254CysGlu: 1.254 ± 0.051
0.623CysPhe: 0.623 ± 0.037
1.629CysGly: 1.629 ± 0.075
0.256CysHis: 0.256 ± 0.02
1.177CysIle: 1.177 ± 0.056
1.085CysLys: 1.085 ± 0.047
1.025CysLeu: 1.025 ± 0.044
0.353CysMet: 0.353 ± 0.023
0.938CysAsn: 0.938 ± 0.085
0.932CysPro: 0.932 ± 0.049
0.297CysGln: 0.297 ± 0.025
0.594CysArg: 0.594 ± 0.031
1.111CysSer: 1.111 ± 0.089
0.913CysThr: 0.913 ± 0.063
0.959CysVal: 0.959 ± 0.045
0.149CysTrp: 0.149 ± 0.016
0.572CysTyr: 0.572 ± 0.039
0.0CysXaa: 0.0 ± 0.0
Asp
4.063AspAla: 4.063 ± 0.117
1.038AspCys: 1.038 ± 0.072
3.301AspAsp: 3.301 ± 0.1
5.003AspGlu: 5.003 ± 0.103
2.777AspPhe: 2.777 ± 0.076
3.579AspGly: 3.579 ± 0.156
0.642AspHis: 0.642 ± 0.033
5.443AspIle: 5.443 ± 0.104
4.828AspLys: 4.828 ± 0.106
4.672AspLeu: 4.672 ± 0.101
1.449AspMet: 1.449 ± 0.054
3.176AspAsn: 3.176 ± 0.118
1.88AspPro: 1.88 ± 0.059
0.603AspGln: 0.603 ± 0.036
1.968AspArg: 1.968 ± 0.054
3.337AspSer: 3.337 ± 0.09
2.718AspThr: 2.718 ± 0.097
4.167AspVal: 4.167 ± 0.094
0.677AspTrp: 0.677 ± 0.036
2.663AspTyr: 2.663 ± 0.07
0.0AspXaa: 0.0 ± 0.0
Glu
3.357GluAla: 3.357 ± 0.093
1.075GluCys: 1.075 ± 0.052
3.914GluAsp: 3.914 ± 0.088
6.425GluGlu: 6.425 ± 0.19
3.489GluPhe: 3.489 ± 0.085
4.188GluGly: 4.188 ± 0.095
1.347GluHis: 1.347 ± 0.054
8.751GluIle: 8.751 ± 0.161
9.177GluLys: 9.177 ± 0.189
7.059GluLeu: 7.059 ± 0.149
1.857GluMet: 1.857 ± 0.054
5.456GluAsn: 5.456 ± 0.101
1.987GluPro: 1.987 ± 0.066
1.661GluGln: 1.661 ± 0.062
3.228GluArg: 3.228 ± 0.08
3.838GluSer: 3.838 ± 0.086
3.835GluThr: 3.835 ± 0.1
4.659GluVal: 4.659 ± 0.093
0.72GluTrp: 0.72 ± 0.033
2.89GluTyr: 2.89 ± 0.083
0.0GluXaa: 0.0 ± 0.0
Phe
2.462PheAla: 2.462 ± 0.075
0.652PheCys: 0.652 ± 0.039
2.535PheAsp: 2.535 ± 0.065
3.144PheGlu: 3.144 ± 0.081
1.943PhePhe: 1.943 ± 0.066
2.908PheGly: 2.908 ± 0.079
0.722PheHis: 0.722 ± 0.036
3.794PheIle: 3.794 ± 0.084
3.444PheLys: 3.444 ± 0.096
3.972PheLeu: 3.972 ± 0.121
1.089PheMet: 1.089 ± 0.052
2.664PheAsn: 2.664 ± 0.076
1.518PhePro: 1.518 ± 0.053
0.813PheGln: 0.813 ± 0.04
1.672PheArg: 1.672 ± 0.058
2.882PheSer: 2.882 ± 0.075
2.089PheThr: 2.089 ± 0.068
2.731PheVal: 2.731 ± 0.076
0.449PheTrp: 0.449 ± 0.024
1.62PheTyr: 1.62 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
4.139GlyAla: 4.139 ± 0.095
1.306GlyCys: 1.306 ± 0.074
4.385GlyAsp: 4.385 ± 0.114
5.096GlyGlu: 5.096 ± 0.105
3.077GlyPhe: 3.077 ± 0.082
4.764GlyGly: 4.764 ± 0.123
1.066GlyHis: 1.066 ± 0.047
7.393GlyIle: 7.393 ± 0.15
6.253GlyLys: 6.253 ± 0.125
4.817GlyLeu: 4.817 ± 0.095
1.836GlyMet: 1.836 ± 0.062
3.914GlyAsn: 3.914 ± 0.124
1.242GlyPro: 1.242 ± 0.051
1.119GlyGln: 1.119 ± 0.038
2.637GlyArg: 2.637 ± 0.066
3.948GlySer: 3.948 ± 0.107
3.958GlyThr: 3.958 ± 0.115
4.677GlyVal: 4.677 ± 0.091
0.764GlyTrp: 0.764 ± 0.041
2.651GlyTyr: 2.651 ± 0.088
0.0GlyXaa: 0.0 ± 0.0
His
1.064HisAla: 1.064 ± 0.045
0.292HisCys: 0.292 ± 0.026
0.93HisAsp: 0.93 ± 0.036
1.299HisGlu: 1.299 ± 0.046
0.685HisPhe: 0.685 ± 0.034
1.154HisGly: 1.154 ± 0.048
0.368HisHis: 0.368 ± 0.026
1.328HisIle: 1.328 ± 0.049
1.232HisLys: 1.232 ± 0.046
1.457HisLeu: 1.457 ± 0.055
0.178HisMet: 0.178 ± 0.015
0.877HisAsn: 0.877 ± 0.035
0.863HisPro: 0.863 ± 0.041
0.311HisGln: 0.311 ± 0.023
0.833HisArg: 0.833 ± 0.04
0.924HisSer: 0.924 ± 0.037
0.875HisThr: 0.875 ± 0.036
0.789HisVal: 0.789 ± 0.037
0.148HisTrp: 0.148 ± 0.016
0.688HisTyr: 0.688 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.426IleAla: 5.426 ± 0.121
1.261IleCys: 1.261 ± 0.05
5.284IleAsp: 5.284 ± 0.114
7.018IleGlu: 7.018 ± 0.126
3.949IlePhe: 3.949 ± 0.094
5.997IleGly: 5.997 ± 0.13
1.504IleHis: 1.504 ± 0.049
8.323IleIle: 8.323 ± 0.162
8.25IleLys: 8.25 ± 0.139
8.268IleLeu: 8.268 ± 0.145
1.806IleMet: 1.806 ± 0.059
5.412IleAsn: 5.412 ± 0.119
4.022IlePro: 4.022 ± 0.099
1.67IleGln: 1.67 ± 0.053
3.371IleArg: 3.371 ± 0.083
6.068IleSer: 6.068 ± 0.094
5.121IleThr: 5.121 ± 0.125
5.006IleVal: 5.006 ± 0.086
0.769IleTrp: 0.769 ± 0.037
3.517IleTyr: 3.517 ± 0.115
0.0IleXaa: 0.0 ± 0.0
Lys
3.771LysAla: 3.771 ± 0.101
1.11LysCys: 1.11 ± 0.045
4.966LysAsp: 4.966 ± 0.131
8.113LysGlu: 8.113 ± 0.176
3.853LysPhe: 3.853 ± 0.102
4.509LysGly: 4.509 ± 0.112
1.504LysHis: 1.504 ± 0.05
10.255LysIle: 10.255 ± 0.172
9.307LysLys: 9.307 ± 0.196
7.687LysLeu: 7.687 ± 0.156
1.935LysMet: 1.935 ± 0.063
6.304LysAsn: 6.304 ± 0.103
2.742LysPro: 2.742 ± 0.078
2.137LysGln: 2.137 ± 0.078
3.518LysArg: 3.518 ± 0.086
4.628LysSer: 4.628 ± 0.105
4.709LysThr: 4.709 ± 0.098
4.473LysVal: 4.473 ± 0.094
0.766LysTrp: 0.766 ± 0.038
3.36LysTyr: 3.36 ± 0.079
0.0LysXaa: 0.0 ± 0.0
Leu
4.736LeuAla: 4.736 ± 0.109
1.352LeuCys: 1.352 ± 0.044
4.506LeuAsp: 4.506 ± 0.089
5.951LeuGlu: 5.951 ± 0.129
3.696LeuPhe: 3.696 ± 0.097
5.467LeuGly: 5.467 ± 0.141
1.264LeuHis: 1.264 ± 0.045
8.063LeuIle: 8.063 ± 0.15
8.952LeuLys: 8.952 ± 0.186
7.19LeuLeu: 7.19 ± 0.167
2.092LeuMet: 2.092 ± 0.057
5.505LeuAsn: 5.505 ± 0.119
2.888LeuPro: 2.888 ± 0.073
1.644LeuGln: 1.644 ± 0.059
3.296LeuArg: 3.296 ± 0.083
5.613LeuSer: 5.613 ± 0.104
4.57LeuThr: 4.57 ± 0.104
4.528LeuVal: 4.528 ± 0.102
0.725LeuTrp: 0.725 ± 0.036
2.55LeuTyr: 2.55 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
1.343MetAla: 1.343 ± 0.053
0.297MetCys: 0.297 ± 0.023
1.422MetAsp: 1.422 ± 0.052
1.78MetGlu: 1.78 ± 0.053
0.796MetPhe: 0.796 ± 0.037
1.472MetGly: 1.472 ± 0.055
0.448MetHis: 0.448 ± 0.027
1.614MetIle: 1.614 ± 0.054
2.413MetLys: 2.413 ± 0.065
1.926MetLeu: 1.926 ± 0.053
0.548MetMet: 0.548 ± 0.04
1.449MetAsn: 1.449 ± 0.043
0.926MetPro: 0.926 ± 0.049
0.732MetGln: 0.732 ± 0.033
1.043MetArg: 1.043 ± 0.047
1.262MetSer: 1.262 ± 0.054
1.166MetThr: 1.166 ± 0.041
1.314MetVal: 1.314 ± 0.05
0.177MetTrp: 0.177 ± 0.019
0.615MetTyr: 0.615 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
3.272AsnAla: 3.272 ± 0.082
1.187AsnCys: 1.187 ± 0.089
2.984AsnAsp: 2.984 ± 0.114
4.455AsnGlu: 4.455 ± 0.096
2.867AsnPhe: 2.867 ± 0.079
2.925AsnGly: 2.925 ± 0.117
0.74AsnHis: 0.74 ± 0.037
5.581AsnIle: 5.581 ± 0.136
5.124AsnLys: 5.124 ± 0.096
5.872AsnLeu: 5.872 ± 0.112
1.102AsnMet: 1.102 ± 0.038
4.453AsnAsn: 4.453 ± 0.334
2.483AsnPro: 2.483 ± 0.059
1.102AsnGln: 1.102 ± 0.05
1.99AsnArg: 1.99 ± 0.054
3.532AsnSer: 3.532 ± 0.141
3.122AsnThr: 3.122 ± 0.206
3.238AsnVal: 3.238 ± 0.111
0.668AsnTrp: 0.668 ± 0.044
2.676AsnTyr: 2.676 ± 0.099
0.0AsnXaa: 0.0 ± 0.0
Pro
2.136ProAla: 2.136 ± 0.065
0.458ProCys: 0.458 ± 0.027
2.51ProAsp: 2.51 ± 0.078
3.706ProGlu: 3.706 ± 0.084
1.427ProPhe: 1.427 ± 0.059
3.256ProGly: 3.256 ± 0.094
0.646ProHis: 0.646 ± 0.028
2.077ProIle: 2.077 ± 0.068
2.396ProLys: 2.396 ± 0.067
2.692ProLeu: 2.692 ± 0.07
0.665ProMet: 0.665 ± 0.036
1.191ProAsn: 1.191 ± 0.046
1.37ProPro: 1.37 ± 0.057
0.834ProGln: 0.834 ± 0.041
1.165ProArg: 1.165 ± 0.054
1.687ProSer: 1.687 ± 0.061
1.501ProThr: 1.501 ± 0.072
2.67ProVal: 2.67 ± 0.077
0.327ProTrp: 0.327 ± 0.023
1.299ProTyr: 1.299 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
1.096GlnAla: 1.096 ± 0.045
0.314GlnCys: 0.314 ± 0.024
1.069GlnAsp: 1.069 ± 0.047
1.697GlnGlu: 1.697 ± 0.066
0.798GlnPhe: 0.798 ± 0.034
1.34GlnGly: 1.34 ± 0.05
0.332GlnHis: 0.332 ± 0.027
1.734GlnIle: 1.734 ± 0.05
2.211GlnLys: 2.211 ± 0.077
1.547GlnLeu: 1.547 ± 0.051
0.553GlnMet: 0.553 ± 0.032
1.219GlnAsn: 1.219 ± 0.045
0.56GlnPro: 0.56 ± 0.031
0.6GlnGln: 0.6 ± 0.035
0.842GlnArg: 0.842 ± 0.034
1.017GlnSer: 1.017 ± 0.037
1.05GlnThr: 1.05 ± 0.047
1.376GlnVal: 1.376 ± 0.048
0.181GlnTrp: 0.181 ± 0.016
0.706GlnTyr: 0.706 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
2.092ArgAla: 2.092 ± 0.065
0.594ArgCys: 0.594 ± 0.031
2.486ArgAsp: 2.486 ± 0.072
3.815ArgGlu: 3.815 ± 0.099
1.627ArgPhe: 1.627 ± 0.051
2.722ArgGly: 2.722 ± 0.071
0.685ArgHis: 0.685 ± 0.034
3.534ArgIle: 3.534 ± 0.079
3.762ArgLys: 3.762 ± 0.084
2.839ArgLeu: 2.839 ± 0.079
1.075ArgMet: 1.075 ± 0.041
2.115ArgAsn: 2.115 ± 0.059
0.985ArgPro: 0.985 ± 0.045
0.964ArgGln: 0.964 ± 0.04
1.851ArgArg: 1.851 ± 0.074
1.714ArgSer: 1.714 ± 0.053
1.825ArgThr: 1.825 ± 0.064
2.495ArgVal: 2.495 ± 0.061
0.431ArgTrp: 0.431 ± 0.027
1.53ArgTyr: 1.53 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
3.663SerAla: 3.663 ± 0.082
1.009SerCys: 1.009 ± 0.067
3.873SerAsp: 3.873 ± 0.11
4.5SerGlu: 4.5 ± 0.101
2.445SerPhe: 2.445 ± 0.072
5.749SerGly: 5.749 ± 0.15
0.959SerHis: 0.959 ± 0.035
4.552SerIle: 4.552 ± 0.1
4.273SerLys: 4.273 ± 0.093
4.38SerLeu: 4.38 ± 0.113
1.32SerMet: 1.32 ± 0.043
3.32SerAsn: 3.32 ± 0.197
2.07SerPro: 2.07 ± 0.068
1.245SerGln: 1.245 ± 0.044
2.376SerArg: 2.376 ± 0.073
3.923SerSer: 3.923 ± 0.164
2.616SerThr: 2.616 ± 0.084
3.82SerVal: 3.82 ± 0.081
0.563SerTrp: 0.563 ± 0.038
2.028SerTyr: 2.028 ± 0.066
0.002SerXaa: 0.002 ± 0.001
Thr
3.352ThrAla: 3.352 ± 0.111
0.897ThrCys: 0.897 ± 0.074
2.973ThrAsp: 2.973 ± 0.102
3.392ThrGlu: 3.392 ± 0.077
1.886ThrPhe: 1.886 ± 0.068
5.045ThrGly: 5.045 ± 0.144
0.933ThrHis: 0.933 ± 0.035
4.491ThrIle: 4.491 ± 0.106
3.602ThrLys: 3.602 ± 0.08
4.345ThrLeu: 4.345 ± 0.111
0.948ThrMet: 0.948 ± 0.039
2.507ThrAsn: 2.507 ± 0.13
2.276ThrPro: 2.276 ± 0.071
1.187ThrGln: 1.187 ± 0.052
1.9ThrArg: 1.9 ± 0.057
2.676ThrSer: 2.676 ± 0.071
2.85ThrThr: 2.85 ± 0.106
3.328ThrVal: 3.328 ± 0.107
0.434ThrTrp: 0.434 ± 0.029
1.821ThrTyr: 1.821 ± 0.077
0.0ThrXaa: 0.0 ± 0.0
Val
3.514ValAla: 3.514 ± 0.094
1.178ValCys: 1.178 ± 0.059
3.626ValAsp: 3.626 ± 0.079
4.646ValGlu: 4.646 ± 0.093
2.883ValPhe: 2.883 ± 0.075
3.715ValGly: 3.715 ± 0.085
1.025ValHis: 1.025 ± 0.041
5.634ValIle: 5.634 ± 0.113
5.169ValLys: 5.169 ± 0.101
5.362ValLeu: 5.362 ± 0.099
1.632ValMet: 1.632 ± 0.062
3.281ValAsn: 3.281 ± 0.101
2.198ValPro: 2.198 ± 0.064
1.114ValGln: 1.114 ± 0.039
2.562ValArg: 2.562 ± 0.1
4.319ValSer: 4.319 ± 0.111
3.11ValThr: 3.11 ± 0.123
4.732ValVal: 4.732 ± 0.101
0.539ValTrp: 0.539 ± 0.028
2.281ValTyr: 2.281 ± 0.083
0.0ValXaa: 0.0 ± 0.0
Trp
0.515TrpAla: 0.515 ± 0.034
0.163TrpCys: 0.163 ± 0.016
0.617TrpAsp: 0.617 ± 0.036
0.624TrpGlu: 0.624 ± 0.031
0.472TrpPhe: 0.472 ± 0.03
0.614TrpGly: 0.614 ± 0.034
0.192TrpHis: 0.192 ± 0.018
0.758TrpIle: 0.758 ± 0.035
0.85TrpLys: 0.85 ± 0.047
0.743TrpLeu: 0.743 ± 0.033
0.279TrpMet: 0.279 ± 0.022
0.764TrpAsn: 0.764 ± 0.049
0.145TrpPro: 0.145 ± 0.014
0.3TrpGln: 0.3 ± 0.02
0.355TrpArg: 0.355 ± 0.027
0.533TrpSer: 0.533 ± 0.035
0.477TrpThr: 0.477 ± 0.028
0.614TrpVal: 0.614 ± 0.031
0.178TrpTrp: 0.178 ± 0.018
0.387TrpTyr: 0.387 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.169TyrAla: 2.169 ± 0.06
0.805TyrCys: 0.805 ± 0.045
2.319TyrAsp: 2.319 ± 0.068
2.699TyrGlu: 2.699 ± 0.065
1.726TyrPhe: 1.726 ± 0.058
2.775TyrGly: 2.775 ± 0.109
0.677TyrHis: 0.677 ± 0.035
2.935TyrIle: 2.935 ± 0.079
2.687TyrLys: 2.687 ± 0.072
3.474TyrLeu: 3.474 ± 0.095
0.696TyrMet: 0.696 ± 0.03
2.253TyrAsn: 2.253 ± 0.09
1.443TyrPro: 1.443 ± 0.053
0.656TyrGln: 0.656 ± 0.036
1.487TyrArg: 1.487 ± 0.055
2.361TyrSer: 2.361 ± 0.086
1.85TyrThr: 1.85 ± 0.068
2.29TyrVal: 2.29 ± 0.079
0.397TyrTrp: 0.397 ± 0.031
1.673TyrTyr: 1.673 ± 0.066
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.001
Statistics based on 2164 proteins (656850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski