Amino acid dipepetide frequency for Haloarculaceae archaeon HArcel1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.695AlaAla: 14.695 ± 0.222
0.668AlaCys: 0.668 ± 0.03
11.162AlaAsp: 11.162 ± 0.15
7.324AlaGlu: 7.324 ± 0.125
3.641AlaPhe: 3.641 ± 0.081
10.186AlaGly: 10.186 ± 0.131
2.143AlaHis: 2.143 ± 0.058
7.006AlaIle: 7.006 ± 0.11
1.502AlaLys: 1.502 ± 0.056
11.032AlaLeu: 11.032 ± 0.166
2.308AlaMet: 2.308 ± 0.057
2.387AlaAsn: 2.387 ± 0.075
4.508AlaPro: 4.508 ± 0.078
2.255AlaGln: 2.255 ± 0.063
6.804AlaArg: 6.804 ± 0.11
6.137AlaSer: 6.137 ± 0.112
8.546AlaThr: 8.546 ± 0.146
11.205AlaVal: 11.205 ± 0.174
1.169AlaTrp: 1.169 ± 0.045
2.601AlaTyr: 2.601 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.615CysAla: 0.615 ± 0.031
0.065CysCys: 0.065 ± 0.009
0.577CysAsp: 0.577 ± 0.029
0.577CysGlu: 0.577 ± 0.03
0.163CysPhe: 0.163 ± 0.015
0.8CysGly: 0.8 ± 0.04
0.17CysHis: 0.17 ± 0.015
0.229CysIle: 0.229 ± 0.016
0.094CysLys: 0.094 ± 0.011
0.487CysLeu: 0.487 ± 0.024
0.097CysMet: 0.097 ± 0.012
0.161CysAsn: 0.161 ± 0.016
0.524CysPro: 0.524 ± 0.03
0.175CysGln: 0.175 ± 0.016
0.576CysArg: 0.576 ± 0.028
0.442CysSer: 0.442 ± 0.028
0.399CysThr: 0.399 ± 0.027
0.487CysVal: 0.487 ± 0.027
0.101CysTrp: 0.101 ± 0.012
0.165CysTyr: 0.165 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
11.715AspAla: 11.715 ± 0.166
0.816AspCys: 0.816 ± 0.037
8.325AspAsp: 8.325 ± 0.137
6.919AspGlu: 6.919 ± 0.14
1.869AspPhe: 1.869 ± 0.059
8.812AspGly: 8.812 ± 0.162
2.278AspHis: 2.278 ± 0.059
2.866AspIle: 2.866 ± 0.082
0.707AspLys: 0.707 ± 0.033
8.168AspLeu: 8.168 ± 0.134
1.063AspMet: 1.063 ± 0.042
1.126AspAsn: 1.126 ± 0.05
6.449AspPro: 6.449 ± 0.107
2.16AspGln: 2.16 ± 0.054
10.113AspArg: 10.113 ± 0.146
4.151AspSer: 4.151 ± 0.087
4.202AspThr: 4.202 ± 0.074
7.388AspVal: 7.388 ± 0.111
1.339AspTrp: 1.339 ± 0.052
1.687AspTyr: 1.687 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
8.099GluAla: 8.099 ± 0.133
0.521GluCys: 0.521 ± 0.026
5.223GluAsp: 5.223 ± 0.11
4.771GluGlu: 4.771 ± 0.109
2.546GluPhe: 2.546 ± 0.057
5.093GluGly: 5.093 ± 0.088
1.758GluHis: 1.758 ± 0.047
3.804GluIle: 3.804 ± 0.082
1.233GluLys: 1.233 ± 0.047
5.657GluLeu: 5.657 ± 0.108
1.7GluMet: 1.7 ± 0.051
1.888GluAsn: 1.888 ± 0.05
3.16GluPro: 3.16 ± 0.085
2.412GluGln: 2.412 ± 0.069
7.563GluArg: 7.563 ± 0.104
5.272GluSer: 5.272 ± 0.089
6.705GluThr: 6.705 ± 0.114
4.731GluVal: 4.731 ± 0.094
1.098GluTrp: 1.098 ± 0.042
2.407GluTyr: 2.407 ± 0.067
0.0GluXaa: 0.0 ± 0.0
Phe
3.208PheAla: 3.208 ± 0.068
0.215PheCys: 0.215 ± 0.015
3.326PheAsp: 3.326 ± 0.077
3.194PheGlu: 3.194 ± 0.075
0.857PhePhe: 0.857 ± 0.035
2.868PheGly: 2.868 ± 0.07
0.566PheHis: 0.566 ± 0.026
0.763PheIle: 0.763 ± 0.034
0.409PheLys: 0.409 ± 0.029
2.158PheLeu: 2.158 ± 0.063
0.41PheMet: 0.41 ± 0.025
0.542PheAsn: 0.542 ± 0.028
1.2PhePro: 1.2 ± 0.038
0.766PheGln: 0.766 ± 0.03
1.522PheArg: 1.522 ± 0.044
1.366PheSer: 1.366 ± 0.042
1.59PheThr: 1.59 ± 0.048
3.123PheVal: 3.123 ± 0.067
0.371PheTrp: 0.371 ± 0.02
0.71PheTyr: 0.71 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
8.835GlyAla: 8.835 ± 0.144
0.586GlyCys: 0.586 ± 0.029
7.039GlyAsp: 7.039 ± 0.114
6.171GlyGlu: 6.171 ± 0.107
2.64GlyPhe: 2.64 ± 0.06
7.106GlyGly: 7.106 ± 0.136
1.81GlyHis: 1.81 ± 0.051
3.752GlyIle: 3.752 ± 0.076
1.359GlyLys: 1.359 ± 0.041
7.62GlyLeu: 7.62 ± 0.138
1.458GlyMet: 1.458 ± 0.051
1.633GlyAsn: 1.633 ± 0.054
4.318GlyPro: 4.318 ± 0.087
2.216GlyGln: 2.216 ± 0.06
5.296GlyArg: 5.296 ± 0.093
5.401GlySer: 5.401 ± 0.103
6.252GlyThr: 6.252 ± 0.158
7.744GlyVal: 7.744 ± 0.119
1.174GlyTrp: 1.174 ± 0.045
2.272GlyTyr: 2.272 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
2.121HisAla: 2.121 ± 0.06
0.218HisCys: 0.218 ± 0.016
1.922HisAsp: 1.922 ± 0.053
1.482HisGlu: 1.482 ± 0.04
0.499HisPhe: 0.499 ± 0.026
1.792HisGly: 1.792 ± 0.06
0.557HisHis: 0.557 ± 0.027
0.573HisIle: 0.573 ± 0.031
0.264HisLys: 0.264 ± 0.019
1.832HisLeu: 1.832 ± 0.049
0.254HisMet: 0.254 ± 0.019
0.386HisAsn: 0.386 ± 0.021
1.348HisPro: 1.348 ± 0.041
0.501HisGln: 0.501 ± 0.024
1.564HisArg: 1.564 ± 0.042
0.928HisSer: 0.928 ± 0.037
1.111HisThr: 1.111 ± 0.038
1.979HisVal: 1.979 ± 0.053
0.267HisTrp: 0.267 ± 0.019
0.487HisTyr: 0.487 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.97IleAla: 5.97 ± 0.101
0.282IleCys: 0.282 ± 0.016
5.782IleAsp: 5.782 ± 0.096
4.906IleGlu: 4.906 ± 0.091
0.874IlePhe: 0.874 ± 0.043
4.259IleGly: 4.259 ± 0.085
0.853IleHis: 0.853 ± 0.033
0.89IleIle: 0.89 ± 0.038
0.625IleLys: 0.625 ± 0.031
2.649IleLeu: 2.649 ± 0.073
0.444IleMet: 0.444 ± 0.023
0.852IleAsn: 0.852 ± 0.032
2.007IlePro: 2.007 ± 0.054
1.157IleGln: 1.157 ± 0.038
2.596IleArg: 2.596 ± 0.056
1.958IleSer: 1.958 ± 0.057
2.54IleThr: 2.54 ± 0.064
4.628IleVal: 4.628 ± 0.088
0.354IleTrp: 0.354 ± 0.025
0.857IleTyr: 0.857 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
1.353LysAla: 1.353 ± 0.048
0.094LysCys: 0.094 ± 0.011
0.837LysAsp: 0.837 ± 0.04
0.856LysGlu: 0.856 ± 0.038
0.445LysPhe: 0.445 ± 0.024
0.976LysGly: 0.976 ± 0.044
0.32LysHis: 0.32 ± 0.019
0.707LysIle: 0.707 ± 0.032
0.31LysLys: 0.31 ± 0.028
1.262LysLeu: 1.262 ± 0.048
0.277LysMet: 0.277 ± 0.019
0.402LysAsn: 0.402 ± 0.026
0.714LysPro: 0.714 ± 0.03
0.521LysGln: 0.521 ± 0.028
1.245LysArg: 1.245 ± 0.043
0.942LysSer: 0.942 ± 0.036
1.08LysThr: 1.08 ± 0.041
0.876LysVal: 0.876 ± 0.035
0.191LysTrp: 0.191 ± 0.017
0.445LysTyr: 0.445 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
11.751LeuAla: 11.751 ± 0.177
0.511LeuCys: 0.511 ± 0.025
9.284LeuAsp: 9.284 ± 0.138
5.791LeuGlu: 5.791 ± 0.115
2.759LeuPhe: 2.759 ± 0.073
7.119LeuGly: 7.119 ± 0.122
1.317LeuHis: 1.317 ± 0.042
3.268LeuIle: 3.268 ± 0.076
1.243LeuLys: 1.243 ± 0.046
7.057LeuLeu: 7.057 ± 0.141
1.183LeuMet: 1.183 ± 0.039
1.536LeuAsn: 1.536 ± 0.046
3.907LeuPro: 3.907 ± 0.07
2.064LeuGln: 2.064 ± 0.056
5.02LeuArg: 5.02 ± 0.083
5.041LeuSer: 5.041 ± 0.099
5.081LeuThr: 5.081 ± 0.087
8.067LeuVal: 8.067 ± 0.147
0.821LeuTrp: 0.821 ± 0.033
2.031LeuTyr: 2.031 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.016MetAla: 2.016 ± 0.052
0.111MetCys: 0.111 ± 0.012
1.321MetAsp: 1.321 ± 0.043
0.953MetGlu: 0.953 ± 0.04
0.429MetPhe: 0.429 ± 0.024
1.359MetGly: 1.359 ± 0.041
0.357MetHis: 0.357 ± 0.021
0.712MetIle: 0.712 ± 0.03
0.309MetLys: 0.309 ± 0.023
1.26MetLeu: 1.26 ± 0.041
0.296MetMet: 0.296 ± 0.019
0.476MetAsn: 0.476 ± 0.021
0.799MetPro: 0.799 ± 0.031
0.443MetGln: 0.443 ± 0.024
0.929MetArg: 0.929 ± 0.029
1.279MetSer: 1.279 ± 0.042
1.433MetThr: 1.433 ± 0.045
1.247MetVal: 1.247 ± 0.038
0.148MetTrp: 0.148 ± 0.014
0.328MetTyr: 0.328 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
2.356AsnAla: 2.356 ± 0.065
0.166AsnCys: 0.166 ± 0.014
1.596AsnAsp: 1.596 ± 0.052
1.388AsnGlu: 1.388 ± 0.057
0.544AsnPhe: 0.544 ± 0.03
1.77AsnGly: 1.77 ± 0.075
0.44AsnHis: 0.44 ± 0.025
0.654AsnIle: 0.654 ± 0.031
0.329AsnLys: 0.329 ± 0.022
1.805AsnLeu: 1.805 ± 0.053
0.376AsnMet: 0.376 ± 0.024
0.458AsnAsn: 0.458 ± 0.032
1.179AsnPro: 1.179 ± 0.042
0.54AsnGln: 0.54 ± 0.032
1.453AsnArg: 1.453 ± 0.046
0.916AsnSer: 0.916 ± 0.041
1.23AsnThr: 1.23 ± 0.055
2.03AsnVal: 2.03 ± 0.053
0.306AsnTrp: 0.306 ± 0.021
0.581AsnTyr: 0.581 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
5.59ProAla: 5.59 ± 0.112
0.275ProCys: 0.275 ± 0.021
5.773ProAsp: 5.773 ± 0.097
4.303ProGlu: 4.303 ± 0.083
1.411ProPhe: 1.411 ± 0.044
4.155ProGly: 4.155 ± 0.09
0.859ProHis: 0.859 ± 0.035
2.669ProIle: 2.669 ± 0.06
0.659ProLys: 0.659 ± 0.027
3.57ProLeu: 3.57 ± 0.066
0.795ProMet: 0.795 ± 0.028
1.01ProAsn: 1.01 ± 0.035
2.492ProPro: 2.492 ± 0.067
0.971ProGln: 0.971 ± 0.036
2.422ProArg: 2.422 ± 0.059
2.708ProSer: 2.708 ± 0.067
3.831ProThr: 3.831 ± 0.076
4.385ProVal: 4.385 ± 0.091
0.599ProTrp: 0.599 ± 0.029
1.148ProTyr: 1.148 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
2.506GlnAla: 2.506 ± 0.059
0.199GlnCys: 0.199 ± 0.016
1.601GlnAsp: 1.601 ± 0.049
1.41GlnGlu: 1.41 ± 0.051
1.185GlnPhe: 1.185 ± 0.036
1.533GlnGly: 1.533 ± 0.054
0.52GlnHis: 0.52 ± 0.026
1.42GlnIle: 1.42 ± 0.044
0.491GlnLys: 0.491 ± 0.029
1.925GlnLeu: 1.925 ± 0.058
0.472GlnMet: 0.472 ± 0.026
0.669GlnAsn: 0.669 ± 0.032
1.125GlnPro: 1.125 ± 0.041
0.857GlnGln: 0.857 ± 0.042
2.041GlnArg: 2.041 ± 0.053
1.792GlnSer: 1.792 ± 0.053
1.779GlnThr: 1.779 ± 0.05
1.905GlnVal: 1.905 ± 0.057
0.353GlnTrp: 0.353 ± 0.024
0.772GlnTyr: 0.772 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
7.867ArgAla: 7.867 ± 0.125
0.462ArgCys: 0.462 ± 0.027
5.542ArgAsp: 5.542 ± 0.103
6.385ArgGlu: 6.385 ± 0.106
2.151ArgPhe: 2.151 ± 0.049
4.328ArgGly: 4.328 ± 0.073
1.272ArgHis: 1.272 ± 0.04
3.536ArgIle: 3.536 ± 0.065
1.029ArgLys: 1.029 ± 0.044
6.44ArgLeu: 6.44 ± 0.091
1.239ArgMet: 1.239 ± 0.042
1.288ArgAsn: 1.288 ± 0.041
3.355ArgPro: 3.355 ± 0.074
1.87ArgGln: 1.87 ± 0.047
5.325ArgArg: 5.325 ± 0.111
4.528ArgSer: 4.528 ± 0.079
4.307ArgThr: 4.307 ± 0.081
6.013ArgVal: 6.013 ± 0.1
0.873ArgTrp: 0.873 ± 0.039
1.805ArgTyr: 1.805 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
5.989SerAla: 5.989 ± 0.096
0.309SerCys: 0.309 ± 0.025
4.839SerAsp: 4.839 ± 0.088
4.337SerGlu: 4.337 ± 0.087
1.591SerPhe: 1.591 ± 0.047
5.563SerGly: 5.563 ± 0.11
0.952SerHis: 0.952 ± 0.036
3.13SerIle: 3.13 ± 0.068
0.928SerLys: 0.928 ± 0.042
4.381SerLeu: 4.381 ± 0.087
1.042SerMet: 1.042 ± 0.037
1.281SerAsn: 1.281 ± 0.053
2.583SerPro: 2.583 ± 0.063
1.247SerGln: 1.247 ± 0.041
3.044SerArg: 3.044 ± 0.059
2.893SerSer: 2.893 ± 0.076
3.813SerThr: 3.813 ± 0.083
5.708SerVal: 5.708 ± 0.096
0.586SerTrp: 0.586 ± 0.028
1.234SerTyr: 1.234 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
7.697ThrAla: 7.697 ± 0.139
0.366ThrCys: 0.366 ± 0.023
6.599ThrAsp: 6.599 ± 0.129
4.178ThrGlu: 4.178 ± 0.086
1.921ThrPhe: 1.921 ± 0.057
5.948ThrGly: 5.948 ± 0.11
1.324ThrHis: 1.324 ± 0.043
3.705ThrIle: 3.705 ± 0.063
0.777ThrLys: 0.777 ± 0.034
6.142ThrLeu: 6.142 ± 0.083
0.976ThrMet: 0.976 ± 0.036
1.359ThrAsn: 1.359 ± 0.047
3.722ThrPro: 3.722 ± 0.075
1.401ThrGln: 1.401 ± 0.044
3.446ThrArg: 3.446 ± 0.066
2.937ThrSer: 2.937 ± 0.075
4.989ThrThr: 4.989 ± 0.131
7.886ThrVal: 7.886 ± 0.141
0.707ThrTrp: 0.707 ± 0.032
1.681ThrTyr: 1.681 ± 0.055
0.0ThrXaa: 0.0 ± 0.0
Val
11.056ValAla: 11.056 ± 0.183
0.664ValCys: 0.664 ± 0.029
8.101ValAsp: 8.101 ± 0.116
7.901ValGlu: 7.901 ± 0.121
2.577ValPhe: 2.577 ± 0.067
8.146ValGly: 8.146 ± 0.152
1.754ValHis: 1.754 ± 0.046
3.557ValIle: 3.557 ± 0.08
1.059ValLys: 1.059 ± 0.04
7.943ValLeu: 7.943 ± 0.128
1.311ValMet: 1.311 ± 0.044
1.811ValAsn: 1.811 ± 0.058
4.495ValPro: 4.495 ± 0.086
1.927ValGln: 1.927 ± 0.052
5.756ValArg: 5.756 ± 0.096
4.948ValSer: 4.948 ± 0.085
6.509ValThr: 6.509 ± 0.118
9.016ValVal: 9.016 ± 0.16
0.885ValTrp: 0.885 ± 0.039
1.982ValTyr: 1.982 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
1.163TrpAla: 1.163 ± 0.043
0.114TrpCys: 0.114 ± 0.012
0.795TrpAsp: 0.795 ± 0.038
0.721TrpGlu: 0.721 ± 0.036
0.376TrpPhe: 0.376 ± 0.023
0.853TrpGly: 0.853 ± 0.034
0.225TrpHis: 0.225 ± 0.02
0.611TrpIle: 0.611 ± 0.031
0.208TrpLys: 0.208 ± 0.016
1.221TrpLeu: 1.221 ± 0.047
0.191TrpMet: 0.191 ± 0.014
0.316TrpAsn: 0.316 ± 0.023
0.619TrpPro: 0.619 ± 0.03
0.421TrpGln: 0.421 ± 0.029
0.887TrpArg: 0.887 ± 0.039
0.661TrpSer: 0.661 ± 0.033
1.025TrpThr: 1.025 ± 0.048
0.9TrpVal: 0.9 ± 0.042
0.229TrpTrp: 0.229 ± 0.018
0.343TrpTyr: 0.343 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.678TyrAla: 2.678 ± 0.069
0.261TyrCys: 0.261 ± 0.018
2.511TyrAsp: 2.511 ± 0.063
2.075TyrGlu: 2.075 ± 0.058
0.685TyrPhe: 0.685 ± 0.029
2.016TyrGly: 2.016 ± 0.055
0.613TyrHis: 0.613 ± 0.032
0.628TyrIle: 0.628 ± 0.031
0.318TyrLys: 0.318 ± 0.021
2.243TyrLeu: 2.243 ± 0.062
0.311TyrMet: 0.311 ± 0.018
0.538TyrAsn: 0.538 ± 0.029
1.121TyrPro: 1.121 ± 0.038
0.649TyrGln: 0.649 ± 0.031
2.012TyrArg: 2.012 ± 0.053
1.148TyrSer: 1.148 ± 0.041
1.34TyrThr: 1.34 ± 0.048
2.082TyrVal: 2.082 ± 0.055
0.306TyrTrp: 0.306 ± 0.022
0.775TyrTyr: 0.775 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2532 proteins (790179 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski