Amino acid dipepetide frequency for Thermoplasmatales archaeon B_DKE

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.792AlaAla: 4.792 ± 0.118
0.451AlaCys: 0.451 ± 0.031
3.065AlaAsp: 3.065 ± 0.079
3.799AlaGlu: 3.799 ± 0.102
3.283AlaPhe: 3.283 ± 0.097
5.416AlaGly: 5.416 ± 0.095
0.961AlaHis: 0.961 ± 0.051
5.917AlaIle: 5.917 ± 0.124
3.326AlaLys: 3.326 ± 0.082
6.368AlaLeu: 6.368 ± 0.125
2.284AlaMet: 2.284 ± 0.07
2.443AlaAsn: 2.443 ± 0.078
1.85AlaPro: 1.85 ± 0.051
1.401AlaGln: 1.401 ± 0.055
3.316AlaArg: 3.316 ± 0.089
5.023AlaSer: 5.023 ± 0.117
3.272AlaThr: 3.272 ± 0.09
5.226AlaVal: 5.226 ± 0.105
0.537AlaTrp: 0.537 ± 0.033
2.331AlaTyr: 2.331 ± 0.066
0.0AlaXaa: 0.0 ± 0.0
Cys
0.389CysAla: 0.389 ± 0.031
0.056CysCys: 0.056 ± 0.011
0.378CysAsp: 0.378 ± 0.026
0.35CysGlu: 0.35 ± 0.029
0.259CysPhe: 0.259 ± 0.02
0.784CysGly: 0.784 ± 0.047
0.166CysHis: 0.166 ± 0.016
0.419CysIle: 0.419 ± 0.028
0.322CysLys: 0.322 ± 0.024
0.399CysLeu: 0.399 ± 0.029
0.155CysMet: 0.155 ± 0.016
0.348CysAsn: 0.348 ± 0.029
0.425CysPro: 0.425 ± 0.037
0.171CysGln: 0.171 ± 0.017
0.436CysArg: 0.436 ± 0.029
0.535CysSer: 0.535 ± 0.038
0.402CysThr: 0.402 ± 0.03
0.408CysVal: 0.408 ± 0.028
0.032CysTrp: 0.032 ± 0.009
0.166CysTyr: 0.166 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.283AspAla: 3.283 ± 0.101
0.326AspCys: 0.326 ± 0.025
2.392AspAsp: 2.392 ± 0.069
3.538AspGlu: 3.538 ± 0.083
2.674AspPhe: 2.674 ± 0.071
3.531AspGly: 3.531 ± 0.084
1.027AspHis: 1.027 ± 0.046
4.516AspIle: 4.516 ± 0.095
2.504AspLys: 2.504 ± 0.071
5.168AspLeu: 5.168 ± 0.125
1.468AspMet: 1.468 ± 0.053
1.831AspAsn: 1.831 ± 0.055
2.465AspPro: 2.465 ± 0.085
1.312AspGln: 1.312 ± 0.048
3.011AspArg: 3.011 ± 0.09
3.899AspSer: 3.899 ± 0.091
2.489AspThr: 2.489 ± 0.07
3.404AspVal: 3.404 ± 0.084
0.419AspTrp: 0.419 ± 0.025
2.116AspTyr: 2.116 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
3.965GluAla: 3.965 ± 0.097
0.384GluCys: 0.384 ± 0.028
3.139GluAsp: 3.139 ± 0.086
4.194GluGlu: 4.194 ± 0.115
2.864GluPhe: 2.864 ± 0.077
3.449GluGly: 3.449 ± 0.088
0.959GluHis: 0.959 ± 0.041
6.228GluIle: 6.228 ± 0.123
5.379GluLys: 5.379 ± 0.12
5.306GluLeu: 5.306 ± 0.118
1.919GluMet: 1.919 ± 0.069
3.523GluAsn: 3.523 ± 0.084
1.839GluPro: 1.839 ± 0.061
1.373GluGln: 1.373 ± 0.044
3.372GluArg: 3.372 ± 0.081
4.365GluSer: 4.365 ± 0.105
3.236GluThr: 3.236 ± 0.082
3.834GluVal: 3.834 ± 0.1
0.568GluTrp: 0.568 ± 0.034
2.338GluTyr: 2.338 ± 0.07
0.0GluXaa: 0.0 ± 0.0
Phe
2.806PheAla: 2.806 ± 0.084
0.298PheCys: 0.298 ± 0.026
2.266PheAsp: 2.266 ± 0.072
2.254PheGlu: 2.254 ± 0.061
2.422PhePhe: 2.422 ± 0.089
3.573PheGly: 3.573 ± 0.093
0.859PheHis: 0.859 ± 0.04
3.775PheIle: 3.775 ± 0.107
1.93PheLys: 1.93 ± 0.057
4.723PheLeu: 4.723 ± 0.128
1.319PheMet: 1.319 ± 0.05
2.098PheAsn: 2.098 ± 0.07
2.018PhePro: 2.018 ± 0.054
1.233PheGln: 1.233 ± 0.051
2.552PheArg: 2.552 ± 0.069
4.674PheSer: 4.674 ± 0.103
2.804PheThr: 2.804 ± 0.078
3.314PheVal: 3.314 ± 0.08
0.434PheTrp: 0.434 ± 0.029
2.003PheTyr: 2.003 ± 0.068
0.0PheXaa: 0.0 ± 0.0
Gly
4.473GlyAla: 4.473 ± 0.105
0.49GlyCys: 0.49 ± 0.032
3.391GlyAsp: 3.391 ± 0.082
4.054GlyGlu: 4.054 ± 0.092
3.7GlyPhe: 3.7 ± 0.089
5.097GlyGly: 5.097 ± 0.119
1.25GlyHis: 1.25 ± 0.052
7.436GlyIle: 7.436 ± 0.121
6.107GlyLys: 6.107 ± 0.122
6.271GlyLeu: 6.271 ± 0.106
2.362GlyMet: 2.362 ± 0.063
3.801GlyAsn: 3.801 ± 0.101
2.018GlyPro: 2.018 ± 0.063
1.705GlyGln: 1.705 ± 0.056
3.581GlyArg: 3.581 ± 0.092
6.126GlySer: 6.126 ± 0.107
4.615GlyThr: 4.615 ± 0.117
4.85GlyVal: 4.85 ± 0.102
0.633GlyTrp: 0.633 ± 0.037
3.24GlyTyr: 3.24 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
0.987HisAla: 0.987 ± 0.047
0.149HisCys: 0.149 ± 0.017
0.892HisAsp: 0.892 ± 0.041
0.935HisGlu: 0.935 ± 0.047
0.85HisPhe: 0.85 ± 0.043
1.436HisGly: 1.436 ± 0.056
0.348HisHis: 0.348 ± 0.029
1.232HisIle: 1.232 ± 0.049
0.715HisLys: 0.715 ± 0.038
1.468HisLeu: 1.468 ± 0.053
0.442HisMet: 0.442 ± 0.027
0.745HisAsn: 0.745 ± 0.037
0.838HisPro: 0.838 ± 0.038
0.361HisGln: 0.361 ± 0.026
0.81HisArg: 0.81 ± 0.039
1.241HisSer: 1.241 ± 0.052
0.822HisThr: 0.822 ± 0.043
1.198HisVal: 1.198 ± 0.044
0.149HisTrp: 0.149 ± 0.016
0.723HisTyr: 0.723 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
6.023IleAla: 6.023 ± 0.115
0.516IleCys: 0.516 ± 0.029
4.438IleAsp: 4.438 ± 0.093
4.924IleGlu: 4.924 ± 0.108
4.134IlePhe: 4.134 ± 0.11
6.258IleGly: 6.258 ± 0.133
1.235IleHis: 1.235 ± 0.048
6.614IleIle: 6.614 ± 0.132
4.201IleLys: 4.201 ± 0.1
7.859IleLeu: 7.859 ± 0.157
2.292IleMet: 2.292 ± 0.062
3.709IleAsn: 3.709 ± 0.092
3.866IlePro: 3.866 ± 0.093
1.764IleGln: 1.764 ± 0.065
4.497IleArg: 4.497 ± 0.112
7.914IleSer: 7.914 ± 0.148
4.499IleThr: 4.499 ± 0.091
5.919IleVal: 5.919 ± 0.111
0.557IleTrp: 0.557 ± 0.033
2.88IleTyr: 2.88 ± 0.067
0.0IleXaa: 0.0 ± 0.0
Lys
3.59LysAla: 3.59 ± 0.085
0.52LysCys: 0.52 ± 0.035
3.173LysAsp: 3.173 ± 0.076
4.507LysGlu: 4.507 ± 0.105
2.683LysPhe: 2.683 ± 0.083
3.899LysGly: 3.899 ± 0.096
0.956LysHis: 0.956 ± 0.039
5.31LysIle: 5.31 ± 0.118
4.788LysLys: 4.788 ± 0.1
5.287LysLeu: 5.287 ± 0.106
1.882LysMet: 1.882 ± 0.054
3.18LysAsn: 3.18 ± 0.082
2.148LysPro: 2.148 ± 0.069
1.436LysGln: 1.436 ± 0.049
3.244LysArg: 3.244 ± 0.091
4.041LysSer: 4.041 ± 0.093
3.162LysThr: 3.162 ± 0.084
3.823LysVal: 3.823 ± 0.09
0.572LysTrp: 0.572 ± 0.033
2.579LysTyr: 2.579 ± 0.068
0.0LysXaa: 0.0 ± 0.0
Leu
6.137LeuAla: 6.137 ± 0.106
0.561LeuCys: 0.561 ± 0.03
4.738LeuAsp: 4.738 ± 0.09
5.306LeuGlu: 5.306 ± 0.091
4.119LeuPhe: 4.119 ± 0.102
6.636LeuGly: 6.636 ± 0.129
1.358LeuHis: 1.358 ± 0.052
7.035LeuIle: 7.035 ± 0.128
5.858LeuLys: 5.858 ± 0.11
7.959LeuLeu: 7.959 ± 0.176
2.459LeuMet: 2.459 ± 0.075
4.287LeuAsn: 4.287 ± 0.101
3.564LeuPro: 3.564 ± 0.082
2.344LeuGln: 2.344 ± 0.07
4.348LeuArg: 4.348 ± 0.106
8.281LeuSer: 8.281 ± 0.13
4.589LeuThr: 4.589 ± 0.111
6.251LeuVal: 6.251 ± 0.131
0.725LeuTrp: 0.725 ± 0.038
3.229LeuTyr: 3.229 ± 0.079
0.0LeuXaa: 0.0 ± 0.0
Met
2.109MetAla: 2.109 ± 0.064
0.145MetCys: 0.145 ± 0.016
1.813MetAsp: 1.813 ± 0.06
2.13MetGlu: 2.13 ± 0.069
1.084MetPhe: 1.084 ± 0.048
2.266MetGly: 2.266 ± 0.069
0.496MetHis: 0.496 ± 0.033
2.333MetIle: 2.333 ± 0.073
2.53MetLys: 2.53 ± 0.072
2.394MetLeu: 2.394 ± 0.073
0.728MetMet: 0.728 ± 0.037
1.504MetAsn: 1.504 ± 0.06
1.222MetPro: 1.222 ± 0.047
0.751MetGln: 0.751 ± 0.034
1.353MetArg: 1.353 ± 0.055
1.995MetSer: 1.995 ± 0.065
1.494MetThr: 1.494 ± 0.061
1.96MetVal: 1.96 ± 0.061
0.218MetTrp: 0.218 ± 0.019
0.805MetTyr: 0.805 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
3.124AsnAla: 3.124 ± 0.07
0.324AsnCys: 0.324 ± 0.027
1.954AsnAsp: 1.954 ± 0.063
2.599AsnGlu: 2.599 ± 0.086
2.264AsnPhe: 2.264 ± 0.07
3.862AsnGly: 3.862 ± 0.11
0.736AsnHis: 0.736 ± 0.037
3.784AsnIle: 3.784 ± 0.096
1.915AsnLys: 1.915 ± 0.064
4.382AsnLeu: 4.382 ± 0.089
1.313AsnMet: 1.313 ± 0.052
2.046AsnAsn: 2.046 ± 0.075
2.353AsnPro: 2.353 ± 0.066
1.043AsnGln: 1.043 ± 0.045
2.342AsnArg: 2.342 ± 0.068
3.784AsnSer: 3.784 ± 0.106
2.344AsnThr: 2.344 ± 0.075
3.324AsnVal: 3.324 ± 0.078
0.462AsnTrp: 0.462 ± 0.028
2.051AsnTyr: 2.051 ± 0.076
0.0AsnXaa: 0.0 ± 0.0
Pro
2.564ProAla: 2.564 ± 0.075
0.216ProCys: 0.216 ± 0.02
2.502ProAsp: 2.502 ± 0.071
3.335ProGlu: 3.335 ± 0.088
1.861ProPhe: 1.861 ± 0.065
3.132ProGly: 3.132 ± 0.088
0.702ProHis: 0.702 ± 0.039
2.515ProIle: 2.515 ± 0.061
1.69ProLys: 1.69 ± 0.064
3.406ProLeu: 3.406 ± 0.092
1.004ProMet: 1.004 ± 0.045
1.446ProAsn: 1.446 ± 0.058
1.353ProPro: 1.353 ± 0.057
0.939ProGln: 0.939 ± 0.046
1.451ProArg: 1.451 ± 0.05
2.892ProSer: 2.892 ± 0.082
1.744ProThr: 1.744 ± 0.064
3.424ProVal: 3.424 ± 0.077
0.402ProTrp: 0.402 ± 0.029
1.729ProTyr: 1.729 ± 0.062
0.0ProXaa: 0.0 ± 0.0
Gln
1.571GlnAla: 1.571 ± 0.052
0.143GlnCys: 0.143 ± 0.015
1.177GlnAsp: 1.177 ± 0.046
1.6GlnGlu: 1.6 ± 0.054
1.09GlnPhe: 1.09 ± 0.043
1.518GlnGly: 1.518 ± 0.05
0.341GlnHis: 0.341 ± 0.029
2.018GlnIle: 2.018 ± 0.064
1.749GlnLys: 1.749 ± 0.048
1.9GlnLeu: 1.9 ± 0.063
0.846GlnMet: 0.846 ± 0.045
1.261GlnAsn: 1.261 ± 0.054
0.768GlnPro: 0.768 ± 0.036
0.663GlnGln: 0.663 ± 0.042
1.224GlnArg: 1.224 ± 0.043
1.679GlnSer: 1.679 ± 0.049
1.17GlnThr: 1.17 ± 0.054
1.587GlnVal: 1.587 ± 0.051
0.298GlnTrp: 0.298 ± 0.024
0.978GlnTyr: 0.978 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
2.631ArgAla: 2.631 ± 0.077
0.35ArgCys: 0.35 ± 0.03
2.88ArgAsp: 2.88 ± 0.084
4.021ArgGlu: 4.021 ± 0.11
2.118ArgPhe: 2.118 ± 0.065
3.234ArgGly: 3.234 ± 0.076
0.855ArgHis: 0.855 ± 0.043
4.717ArgIle: 4.717 ± 0.109
4.628ArgLys: 4.628 ± 0.104
3.983ArgLeu: 3.983 ± 0.087
1.772ArgMet: 1.772 ± 0.06
2.826ArgAsn: 2.826 ± 0.084
1.422ArgPro: 1.422 ± 0.045
1.118ArgGln: 1.118 ± 0.047
2.793ArgArg: 2.793 ± 0.079
3.59ArgSer: 3.59 ± 0.085
2.552ArgThr: 2.552 ± 0.064
3.355ArgVal: 3.355 ± 0.089
0.466ArgTrp: 0.466 ± 0.032
2.046ArgTyr: 2.046 ± 0.066
0.0ArgXaa: 0.0 ± 0.0
Ser
5.422SerAla: 5.422 ± 0.111
0.455SerCys: 0.455 ± 0.032
4.084SerAsp: 4.084 ± 0.084
4.803SerGlu: 4.803 ± 0.11
3.745SerPhe: 3.745 ± 0.09
7.508SerGly: 7.508 ± 0.131
1.233SerHis: 1.233 ± 0.045
6.847SerIle: 6.847 ± 0.127
4.17SerLys: 4.17 ± 0.095
7.324SerLeu: 7.324 ± 0.145
2.502SerMet: 2.502 ± 0.065
3.104SerAsn: 3.104 ± 0.088
2.899SerPro: 2.899 ± 0.067
2.01SerGln: 2.01 ± 0.064
4.471SerArg: 4.471 ± 0.097
6.562SerSer: 6.562 ± 0.137
4.3SerThr: 4.3 ± 0.105
5.872SerVal: 5.872 ± 0.105
0.768SerTrp: 0.768 ± 0.04
3.143SerTyr: 3.143 ± 0.078
0.0SerXaa: 0.0 ± 0.0
Thr
3.661ThrAla: 3.661 ± 0.083
0.287ThrCys: 0.287 ± 0.027
2.729ThrAsp: 2.729 ± 0.069
3.018ThrGlu: 3.018 ± 0.084
2.433ThrPhe: 2.433 ± 0.07
5.308ThrGly: 5.308 ± 0.095
0.846ThrHis: 0.846 ± 0.04
4.168ThrIle: 4.168 ± 0.085
2.435ThrLys: 2.435 ± 0.069
4.881ThrLeu: 4.881 ± 0.103
1.485ThrMet: 1.485 ± 0.047
2.081ThrAsn: 2.081 ± 0.08
2.333ThrPro: 2.333 ± 0.071
1.164ThrGln: 1.164 ± 0.047
2.459ThrArg: 2.459 ± 0.063
3.877ThrSer: 3.877 ± 0.09
2.8ThrThr: 2.8 ± 0.078
4.367ThrVal: 4.367 ± 0.1
0.442ThrTrp: 0.442 ± 0.03
2.118ThrTyr: 2.118 ± 0.08
0.0ThrXaa: 0.0 ± 0.0
Val
4.587ValAla: 4.587 ± 0.096
0.501ValCys: 0.501 ± 0.032
3.732ValAsp: 3.732 ± 0.096
4.112ValGlu: 4.112 ± 0.09
3.38ValPhe: 3.38 ± 0.092
4.544ValGly: 4.544 ± 0.112
1.122ValHis: 1.122 ± 0.044
5.744ValIle: 5.744 ± 0.124
4.263ValLys: 4.263 ± 0.097
6.038ValLeu: 6.038 ± 0.115
1.953ValMet: 1.953 ± 0.063
3.296ValAsn: 3.296 ± 0.085
2.955ValPro: 2.955 ± 0.072
1.595ValGln: 1.595 ± 0.054
3.501ValArg: 3.501 ± 0.091
6.536ValSer: 6.536 ± 0.095
4.058ValThr: 4.058 ± 0.103
5.045ValVal: 5.045 ± 0.111
0.548ValTrp: 0.548 ± 0.035
2.843ValTyr: 2.843 ± 0.079
0.0ValXaa: 0.0 ± 0.0
Trp
0.49TrpAla: 0.49 ± 0.028
0.089TrpCys: 0.089 ± 0.014
0.458TrpAsp: 0.458 ± 0.033
0.503TrpGlu: 0.503 ± 0.031
0.374TrpPhe: 0.374 ± 0.032
0.576TrpGly: 0.576 ± 0.034
0.155TrpHis: 0.155 ± 0.018
0.827TrpIle: 0.827 ± 0.046
0.702TrpLys: 0.702 ± 0.036
0.784TrpLeu: 0.784 ± 0.038
0.237TrpMet: 0.237 ± 0.021
0.507TrpAsn: 0.507 ± 0.033
0.279TrpPro: 0.279 ± 0.026
0.27TrpGln: 0.27 ± 0.021
0.436TrpArg: 0.436 ± 0.029
0.497TrpSer: 0.497 ± 0.029
0.415TrpThr: 0.415 ± 0.027
0.592TrpVal: 0.592 ± 0.038
0.091TrpTrp: 0.091 ± 0.014
0.395TrpTyr: 0.395 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.456TyrAla: 2.456 ± 0.073
0.341TyrCys: 0.341 ± 0.029
2.17TyrAsp: 2.17 ± 0.065
2.252TyrGlu: 2.252 ± 0.066
2.02TyrPhe: 2.02 ± 0.064
3.171TyrGly: 3.171 ± 0.075
0.738TyrHis: 0.738 ± 0.036
2.668TyrIle: 2.668 ± 0.076
1.567TyrLys: 1.567 ± 0.051
3.883TyrLeu: 3.883 ± 0.093
0.905TyrMet: 0.905 ± 0.037
1.971TyrAsn: 1.971 ± 0.081
1.694TyrPro: 1.694 ± 0.063
0.928TyrGln: 0.928 ± 0.041
2.159TyrArg: 2.159 ± 0.07
3.709TyrSer: 3.709 ± 0.097
2.126TyrThr: 2.126 ± 0.067
2.595TyrVal: 2.595 ± 0.076
0.36TyrTrp: 0.36 ± 0.027
1.727TyrTyr: 1.727 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1928 proteins (536744 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski