Amino acid dipepetide frequency for Halobaculum gomorrense

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.842AlaAla: 17.842 ± 0.258
0.815AlaCys: 0.815 ± 0.034
11.966AlaAsp: 11.966 ± 0.147
9.569AlaGlu: 9.569 ± 0.139
4.317AlaPhe: 4.317 ± 0.068
12.182AlaGly: 12.182 ± 0.159
1.897AlaHis: 1.897 ± 0.048
4.051AlaIle: 4.051 ± 0.074
1.739AlaLys: 1.739 ± 0.051
11.454AlaLeu: 11.454 ± 0.159
2.162AlaMet: 2.162 ± 0.048
2.469AlaAsn: 2.469 ± 0.052
4.732AlaPro: 4.732 ± 0.09
1.964AlaGln: 1.964 ± 0.044
7.229AlaArg: 7.229 ± 0.099
5.848AlaSer: 5.848 ± 0.085
7.561AlaThr: 7.561 ± 0.104
13.058AlaVal: 13.058 ± 0.188
1.204AlaTrp: 1.204 ± 0.04
2.865AlaTyr: 2.865 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.663CysAla: 0.663 ± 0.029
0.079CysCys: 0.079 ± 0.009
0.604CysAsp: 0.604 ± 0.026
0.555CysGlu: 0.555 ± 0.025
0.191CysPhe: 0.191 ± 0.014
0.928CysGly: 0.928 ± 0.035
0.172CysHis: 0.172 ± 0.014
0.229CysIle: 0.229 ± 0.015
0.095CysLys: 0.095 ± 0.011
0.517CysLeu: 0.517 ± 0.023
0.107CysMet: 0.107 ± 0.012
0.165CysAsn: 0.165 ± 0.014
0.549CysPro: 0.549 ± 0.026
0.132CysGln: 0.132 ± 0.013
0.462CysArg: 0.462 ± 0.024
0.411CysSer: 0.411 ± 0.02
0.368CysThr: 0.368 ± 0.019
0.534CysVal: 0.534 ± 0.026
0.07CysTrp: 0.07 ± 0.008
0.146CysTyr: 0.146 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
12.917AspAla: 12.917 ± 0.184
0.605AspCys: 0.605 ± 0.025
8.579AspAsp: 8.579 ± 0.135
7.631AspGlu: 7.631 ± 0.118
1.891AspPhe: 1.891 ± 0.046
9.554AspGly: 9.554 ± 0.139
1.858AspHis: 1.858 ± 0.045
2.919AspIle: 2.919 ± 0.062
0.884AspLys: 0.884 ± 0.034
7.638AspLeu: 7.638 ± 0.098
1.22AspMet: 1.22 ± 0.036
1.132AspAsn: 1.132 ± 0.041
5.242AspPro: 5.242 ± 0.088
1.414AspGln: 1.414 ± 0.041
7.164AspArg: 7.164 ± 0.095
3.696AspSer: 3.696 ± 0.069
4.177AspThr: 4.177 ± 0.066
6.74AspVal: 6.74 ± 0.094
1.031AspTrp: 1.031 ± 0.034
1.722AspTyr: 1.722 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
9.486GluAla: 9.486 ± 0.122
0.548GluCys: 0.548 ± 0.027
4.905GluAsp: 4.905 ± 0.092
6.472GluGlu: 6.472 ± 0.13
2.93GluPhe: 2.93 ± 0.056
5.701GluGly: 5.701 ± 0.093
2.046GluHis: 2.046 ± 0.054
3.023GluIle: 3.023 ± 0.074
1.508GluLys: 1.508 ± 0.046
6.788GluLeu: 6.788 ± 0.104
1.668GluMet: 1.668 ± 0.048
1.778GluAsn: 1.778 ± 0.049
3.785GluPro: 3.785 ± 0.077
2.195GluGln: 2.195 ± 0.049
7.837GluArg: 7.837 ± 0.106
4.72GluSer: 4.72 ± 0.085
5.725GluThr: 5.725 ± 0.087
6.173GluVal: 6.173 ± 0.094
1.084GluTrp: 1.084 ± 0.041
2.456GluTyr: 2.456 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.988PheAla: 3.988 ± 0.067
0.272PheCys: 0.272 ± 0.017
3.252PheAsp: 3.252 ± 0.066
2.865PheGlu: 2.865 ± 0.062
0.948PhePhe: 0.948 ± 0.041
3.205PheGly: 3.205 ± 0.061
0.639PheHis: 0.639 ± 0.025
0.869PheIle: 0.869 ± 0.035
0.405PheLys: 0.405 ± 0.023
2.712PheLeu: 2.712 ± 0.066
0.467PheMet: 0.467 ± 0.021
0.597PheAsn: 0.597 ± 0.028
1.311PhePro: 1.311 ± 0.035
0.616PheGln: 0.616 ± 0.027
1.868PheArg: 1.868 ± 0.049
1.659PheSer: 1.659 ± 0.05
1.854PheThr: 1.854 ± 0.044
3.098PheVal: 3.098 ± 0.07
0.346PheTrp: 0.346 ± 0.022
0.78PheTyr: 0.78 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
10.281GlyAla: 10.281 ± 0.148
0.703GlyCys: 0.703 ± 0.029
8.72GlyAsp: 8.72 ± 0.123
7.342GlyGlu: 7.342 ± 0.097
3.189GlyPhe: 3.189 ± 0.067
9.13GlyGly: 9.13 ± 0.134
1.737GlyHis: 1.737 ± 0.051
3.598GlyIle: 3.598 ± 0.077
1.663GlyLys: 1.663 ± 0.044
7.353GlyLeu: 7.353 ± 0.106
1.673GlyMet: 1.673 ± 0.047
1.825GlyAsn: 1.825 ± 0.05
3.677GlyPro: 3.677 ± 0.07
1.808GlyGln: 1.808 ± 0.05
5.529GlyArg: 5.529 ± 0.092
5.245GlySer: 5.245 ± 0.081
6.008GlyThr: 6.008 ± 0.084
9.102GlyVal: 9.102 ± 0.119
1.162GlyTrp: 1.162 ± 0.043
2.641GlyTyr: 2.641 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
2.178HisAla: 2.178 ± 0.052
0.183HisCys: 0.183 ± 0.015
1.804HisAsp: 1.804 ± 0.052
1.684HisGlu: 1.684 ± 0.044
0.517HisPhe: 0.517 ± 0.026
2.016HisGly: 2.016 ± 0.049
0.503HisHis: 0.503 ± 0.028
0.556HisIle: 0.556 ± 0.025
0.283HisLys: 0.283 ± 0.018
1.653HisLeu: 1.653 ± 0.051
0.251HisMet: 0.251 ± 0.017
0.429HisAsn: 0.429 ± 0.025
1.226HisPro: 1.226 ± 0.03
0.347HisGln: 0.347 ± 0.02
1.27HisArg: 1.27 ± 0.038
0.84HisSer: 0.84 ± 0.033
1.04HisThr: 1.04 ± 0.04
1.867HisVal: 1.867 ± 0.049
0.229HisTrp: 0.229 ± 0.019
0.5HisTyr: 0.5 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
4.499IleAla: 4.499 ± 0.071
0.25IleCys: 0.25 ± 0.015
3.584IleAsp: 3.584 ± 0.07
3.404IleGlu: 3.404 ± 0.068
0.671IlePhe: 0.671 ± 0.03
3.414IleGly: 3.414 ± 0.061
0.704IleHis: 0.704 ± 0.029
0.863IleIle: 0.863 ± 0.037
0.592IleLys: 0.592 ± 0.028
2.191IleLeu: 2.191 ± 0.063
0.402IleMet: 0.402 ± 0.02
0.801IleAsn: 0.801 ± 0.032
1.81IlePro: 1.81 ± 0.048
0.779IleGln: 0.779 ± 0.031
2.339IleArg: 2.339 ± 0.055
1.641IleSer: 1.641 ± 0.043
1.982IleThr: 1.982 ± 0.051
3.171IleVal: 3.171 ± 0.068
0.233IleTrp: 0.233 ± 0.014
0.678IleTyr: 0.678 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
1.684LysAla: 1.684 ± 0.047
0.101LysCys: 0.101 ± 0.01
0.892LysAsp: 0.892 ± 0.038
1.132LysGlu: 1.132 ± 0.041
0.483LysPhe: 0.483 ± 0.023
1.248LysGly: 1.248 ± 0.039
0.41LysHis: 0.41 ± 0.02
0.571LysIle: 0.571 ± 0.025
0.451LysLys: 0.451 ± 0.026
1.551LysLeu: 1.551 ± 0.037
0.316LysMet: 0.316 ± 0.018
0.443LysAsn: 0.443 ± 0.022
0.917LysPro: 0.917 ± 0.035
0.673LysGln: 0.673 ± 0.028
1.625LysArg: 1.625 ± 0.046
0.93LysSer: 0.93 ± 0.035
1.187LysThr: 1.187 ± 0.039
1.066LysVal: 1.066 ± 0.04
0.173LysTrp: 0.173 ± 0.015
0.514LysTyr: 0.514 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
12.214LeuAla: 12.214 ± 0.15
0.562LeuCys: 0.562 ± 0.025
7.716LeuAsp: 7.716 ± 0.112
5.444LeuGlu: 5.444 ± 0.09
2.926LeuPhe: 2.926 ± 0.057
7.84LeuGly: 7.84 ± 0.118
1.462LeuHis: 1.462 ± 0.044
2.615LeuIle: 2.615 ± 0.058
1.441LeuLys: 1.441 ± 0.045
8.092LeuLeu: 8.092 ± 0.125
1.189LeuMet: 1.189 ± 0.041
1.701LeuAsn: 1.701 ± 0.046
4.239LeuPro: 4.239 ± 0.084
1.721LeuGln: 1.721 ± 0.048
6.002LeuArg: 6.002 ± 0.08
5.398LeuSer: 5.398 ± 0.087
5.003LeuThr: 5.003 ± 0.066
8.368LeuVal: 8.368 ± 0.124
0.82LeuTrp: 0.82 ± 0.034
1.963LeuTyr: 1.963 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
1.773MetAla: 1.773 ± 0.049
0.105MetCys: 0.105 ± 0.011
1.221MetAsp: 1.221 ± 0.038
1.047MetGlu: 1.047 ± 0.032
0.458MetPhe: 0.458 ± 0.025
1.353MetGly: 1.353 ± 0.037
0.332MetHis: 0.332 ± 0.021
0.536MetIle: 0.536 ± 0.029
0.399MetLys: 0.399 ± 0.023
1.455MetLeu: 1.455 ± 0.039
0.274MetMet: 0.274 ± 0.018
0.579MetAsn: 0.579 ± 0.024
0.79MetPro: 0.79 ± 0.03
0.431MetGln: 0.431 ± 0.026
1.173MetArg: 1.173 ± 0.034
1.551MetSer: 1.551 ± 0.04
1.476MetThr: 1.476 ± 0.042
1.105MetVal: 1.105 ± 0.037
0.148MetTrp: 0.148 ± 0.013
0.377MetTyr: 0.377 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
2.582AsnAla: 2.582 ± 0.053
0.199AsnCys: 0.199 ± 0.015
1.59AsnAsp: 1.59 ± 0.042
1.473AsnGlu: 1.473 ± 0.049
0.587AsnPhe: 0.587 ± 0.025
1.952AsnGly: 1.952 ± 0.049
0.433AsnHis: 0.433 ± 0.022
0.707AsnIle: 0.707 ± 0.034
0.399AsnLys: 0.399 ± 0.023
1.693AsnLeu: 1.693 ± 0.049
0.365AsnMet: 0.365 ± 0.02
0.49AsnAsn: 0.49 ± 0.027
1.438AsnPro: 1.438 ± 0.04
0.488AsnGln: 0.488 ± 0.02
1.473AsnArg: 1.473 ± 0.037
0.891AsnSer: 0.891 ± 0.032
1.202AsnThr: 1.202 ± 0.04
2.061AsnVal: 2.061 ± 0.049
0.278AsnTrp: 0.278 ± 0.02
0.592AsnTyr: 0.592 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
5.639ProAla: 5.639 ± 0.085
0.252ProCys: 0.252 ± 0.014
4.92ProAsp: 4.92 ± 0.081
4.676ProGlu: 4.676 ± 0.079
1.588ProPhe: 1.588 ± 0.038
4.205ProGly: 4.205 ± 0.079
0.912ProHis: 0.912 ± 0.034
1.571ProIle: 1.571 ± 0.044
0.792ProLys: 0.792 ± 0.029
3.748ProLeu: 3.748 ± 0.066
0.854ProMet: 0.854 ± 0.031
1.074ProAsn: 1.074 ± 0.038
2.278ProPro: 2.278 ± 0.057
0.883ProGln: 0.883 ± 0.031
2.426ProArg: 2.426 ± 0.053
2.625ProSer: 2.625 ± 0.05
3.247ProThr: 3.247 ± 0.075
4.46ProVal: 4.46 ± 0.075
0.497ProTrp: 0.497 ± 0.023
1.127ProTyr: 1.127 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
2.223GlnAla: 2.223 ± 0.045
0.134GlnCys: 0.134 ± 0.012
0.997GlnAsp: 0.997 ± 0.036
1.373GlnGlu: 1.373 ± 0.047
0.884GlnPhe: 0.884 ± 0.03
1.434GlnGly: 1.434 ± 0.04
0.438GlnHis: 0.438 ± 0.021
0.966GlnIle: 0.966 ± 0.038
0.462GlnLys: 0.462 ± 0.026
1.883GlnLeu: 1.883 ± 0.05
0.392GlnMet: 0.392 ± 0.023
0.585GlnAsn: 0.585 ± 0.023
0.921GlnPro: 0.921 ± 0.029
0.802GlnGln: 0.802 ± 0.036
1.78GlnArg: 1.78 ± 0.051
1.241GlnSer: 1.241 ± 0.032
1.364GlnThr: 1.364 ± 0.044
1.623GlnVal: 1.623 ± 0.041
0.314GlnTrp: 0.314 ± 0.019
0.673GlnTyr: 0.673 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
7.666ArgAla: 7.666 ± 0.109
0.488ArgCys: 0.488 ± 0.022
5.603ArgAsp: 5.603 ± 0.08
7.19ArgGlu: 7.19 ± 0.101
2.393ArgPhe: 2.393 ± 0.055
5.135ArgGly: 5.135 ± 0.085
1.231ArgHis: 1.231 ± 0.036
2.758ArgIle: 2.758 ± 0.053
1.244ArgLys: 1.244 ± 0.039
6.275ArgLeu: 6.275 ± 0.093
1.429ArgMet: 1.429 ± 0.043
1.439ArgAsn: 1.439 ± 0.043
2.828ArgPro: 2.828 ± 0.058
1.473ArgGln: 1.473 ± 0.049
5.457ArgArg: 5.457 ± 0.094
3.579ArgSer: 3.579 ± 0.062
4.107ArgThr: 4.107 ± 0.061
6.386ArgVal: 6.386 ± 0.101
0.783ArgTrp: 0.783 ± 0.034
1.946ArgTyr: 1.946 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
5.941SerAla: 5.941 ± 0.075
0.289SerCys: 0.289 ± 0.02
4.479SerAsp: 4.479 ± 0.075
3.861SerGlu: 3.861 ± 0.072
1.796SerPhe: 1.796 ± 0.044
5.436SerGly: 5.436 ± 0.086
0.935SerHis: 0.935 ± 0.034
1.908SerIle: 1.908 ± 0.04
1.042SerLys: 1.042 ± 0.035
4.664SerLeu: 4.664 ± 0.071
0.999SerMet: 0.999 ± 0.036
1.141SerAsn: 1.141 ± 0.037
2.582SerPro: 2.582 ± 0.057
1.103SerGln: 1.103 ± 0.036
3.147SerArg: 3.147 ± 0.061
2.678SerSer: 2.678 ± 0.07
3.458SerThr: 3.458 ± 0.066
5.014SerVal: 5.014 ± 0.081
0.573SerTrp: 0.573 ± 0.026
1.265SerTyr: 1.265 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
7.434ThrAla: 7.434 ± 0.095
0.344ThrCys: 0.344 ± 0.022
5.743ThrAsp: 5.743 ± 0.074
4.391ThrGlu: 4.391 ± 0.072
1.954ThrPhe: 1.954 ± 0.044
5.985ThrGly: 5.985 ± 0.085
1.142ThrHis: 1.142 ± 0.036
2.158ThrIle: 2.158 ± 0.05
0.921ThrLys: 0.921 ± 0.034
5.808ThrLeu: 5.808 ± 0.09
0.96ThrMet: 0.96 ± 0.028
1.358ThrAsn: 1.358 ± 0.044
3.324ThrPro: 3.324 ± 0.073
1.112ThrGln: 1.112 ± 0.04
3.475ThrArg: 3.475 ± 0.059
2.561ThrSer: 2.561 ± 0.06
3.839ThrThr: 3.839 ± 0.073
7.042ThrVal: 7.042 ± 0.091
0.614ThrTrp: 0.614 ± 0.026
1.602ThrTyr: 1.602 ± 0.044
0.0ThrXaa: 0.0 ± 0.0
Val
12.031ValAla: 12.031 ± 0.181
0.74ValCys: 0.74 ± 0.027
8.255ValAsp: 8.255 ± 0.108
7.707ValGlu: 7.707 ± 0.097
2.91ValPhe: 2.91 ± 0.053
8.499ValGly: 8.499 ± 0.118
1.708ValHis: 1.708 ± 0.044
3.125ValIle: 3.125 ± 0.068
1.419ValLys: 1.419 ± 0.04
7.777ValLeu: 7.777 ± 0.108
1.284ValMet: 1.284 ± 0.038
2.009ValAsn: 2.009 ± 0.047
4.497ValPro: 4.497 ± 0.071
1.627ValGln: 1.627 ± 0.044
6.307ValArg: 6.307 ± 0.106
5.124ValSer: 5.124 ± 0.081
6.101ValThr: 6.101 ± 0.099
9.456ValVal: 9.456 ± 0.131
0.816ValTrp: 0.816 ± 0.032
2.107ValTyr: 2.107 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
1.048TrpAla: 1.048 ± 0.034
0.088TrpCys: 0.088 ± 0.01
0.73TrpAsp: 0.73 ± 0.029
0.805TrpGlu: 0.805 ± 0.034
0.378TrpPhe: 0.378 ± 0.023
0.877TrpGly: 0.877 ± 0.027
0.226TrpHis: 0.226 ± 0.016
0.403TrpIle: 0.403 ± 0.024
0.218TrpLys: 0.218 ± 0.015
1.123TrpLeu: 1.123 ± 0.044
0.209TrpMet: 0.209 ± 0.016
0.337TrpAsn: 0.337 ± 0.022
0.431TrpPro: 0.431 ± 0.023
0.286TrpGln: 0.286 ± 0.017
0.936TrpArg: 0.936 ± 0.031
0.577TrpSer: 0.577 ± 0.028
0.753TrpThr: 0.753 ± 0.029
0.886TrpVal: 0.886 ± 0.035
0.181TrpTrp: 0.181 ± 0.014
0.367TrpTyr: 0.367 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.836TyrAla: 2.836 ± 0.057
0.219TyrCys: 0.219 ± 0.015
2.457TyrAsp: 2.457 ± 0.054
2.208TyrGlu: 2.208 ± 0.054
0.742TyrPhe: 0.742 ± 0.032
2.291TyrGly: 2.291 ± 0.047
0.587TyrHis: 0.587 ± 0.023
0.581TyrIle: 0.581 ± 0.027
0.41TyrLys: 0.41 ± 0.022
2.341TyrLeu: 2.341 ± 0.056
0.36TyrMet: 0.36 ± 0.018
0.54TyrAsn: 0.54 ± 0.023
1.197TyrPro: 1.197 ± 0.035
0.637TyrGln: 0.637 ± 0.026
1.912TyrArg: 1.912 ± 0.048
1.099TyrSer: 1.099 ± 0.034
1.313TyrThr: 1.313 ± 0.04
2.288TyrVal: 2.288 ± 0.044
0.302TyrTrp: 0.302 ± 0.018
0.742TyrTyr: 0.742 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3149 proteins (922802 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski