Amino acid dipepetide frequency for archaeon D22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.683AlaAla: 3.683 ± 0.237
0.581AlaCys: 0.581 ± 0.055
3.129AlaAsp: 3.129 ± 0.154
3.71AlaGlu: 3.71 ± 0.165
2.676AlaPhe: 2.676 ± 0.142
4.007AlaGly: 4.007 ± 0.221
1.02AlaHis: 1.02 ± 0.088
4.298AlaIle: 4.298 ± 0.195
4.791AlaLys: 4.791 ± 0.224
5.089AlaLeu: 5.089 ± 0.275
1.196AlaMet: 1.196 ± 0.104
2.818AlaAsn: 2.818 ± 0.162
1.466AlaPro: 1.466 ± 0.118
1.608AlaGln: 1.608 ± 0.099
2.129AlaArg: 2.129 ± 0.137
3.832AlaSer: 3.832 ± 0.177
2.926AlaThr: 2.926 ± 0.178
3.46AlaVal: 3.46 ± 0.175
0.5AlaTrp: 0.5 ± 0.063
2.298AlaTyr: 2.298 ± 0.136
0.0AlaXaa: 0.0 ± 0.0
Cys
0.601CysAla: 0.601 ± 0.063
0.074CysCys: 0.074 ± 0.023
0.838CysAsp: 0.838 ± 0.176
0.784CysGlu: 0.784 ± 0.098
0.439CysPhe: 0.439 ± 0.051
0.737CysGly: 0.737 ± 0.082
0.223CysHis: 0.223 ± 0.051
0.649CysIle: 0.649 ± 0.067
0.73CysLys: 0.73 ± 0.08
0.757CysLeu: 0.757 ± 0.074
0.243CysMet: 0.243 ± 0.044
0.581CysAsn: 0.581 ± 0.062
0.412CysPro: 0.412 ± 0.055
0.291CysGln: 0.291 ± 0.04
0.284CysArg: 0.284 ± 0.039
0.75CysSer: 0.75 ± 0.086
0.514CysThr: 0.514 ± 0.07
0.683CysVal: 0.683 ± 0.067
0.101CysTrp: 0.101 ± 0.032
0.385CysTyr: 0.385 ± 0.068
0.0CysXaa: 0.0 ± 0.0
Asp
3.507AspAla: 3.507 ± 0.16
0.912AspCys: 0.912 ± 0.177
3.94AspAsp: 3.94 ± 0.29
5.19AspGlu: 5.19 ± 0.245
3.352AspPhe: 3.352 ± 0.166
4.041AspGly: 4.041 ± 0.302
0.845AspHis: 0.845 ± 0.082
5.176AspIle: 5.176 ± 0.213
4.899AspLys: 4.899 ± 0.206
5.636AspLeu: 5.636 ± 0.181
1.541AspMet: 1.541 ± 0.105
3.757AspAsn: 3.757 ± 0.21
2.081AspPro: 2.081 ± 0.149
1.203AspGln: 1.203 ± 0.089
1.696AspArg: 1.696 ± 0.114
4.386AspSer: 4.386 ± 0.198
2.967AspThr: 2.967 ± 0.166
3.974AspVal: 3.974 ± 0.174
0.473AspTrp: 0.473 ± 0.059
3.23AspTyr: 3.23 ± 0.185
0.0AspXaa: 0.0 ± 0.0
Glu
4.095GluAla: 4.095 ± 0.178
0.696GluCys: 0.696 ± 0.082
4.839GluAsp: 4.839 ± 0.238
6.724GluGlu: 6.724 ± 0.343
3.419GluPhe: 3.419 ± 0.167
4.19GluGly: 4.19 ± 0.159
1.21GluHis: 1.21 ± 0.097
6.819GluIle: 6.819 ± 0.231
7.812GluLys: 7.812 ± 0.36
7.217GluLeu: 7.217 ± 0.248
1.716GluMet: 1.716 ± 0.102
4.568GluAsn: 4.568 ± 0.175
1.656GluPro: 1.656 ± 0.122
2.176GluGln: 2.176 ± 0.16
2.406GluArg: 2.406 ± 0.166
4.217GluSer: 4.217 ± 0.175
3.602GluThr: 3.602 ± 0.169
5.224GluVal: 5.224 ± 0.192
0.574GluTrp: 0.574 ± 0.059
3.075GluTyr: 3.075 ± 0.147
0.0GluXaa: 0.0 ± 0.0
Phe
2.798PheAla: 2.798 ± 0.141
0.507PheCys: 0.507 ± 0.067
3.669PheAsp: 3.669 ± 0.154
3.859PheGlu: 3.859 ± 0.173
2.663PhePhe: 2.663 ± 0.207
3.041PheGly: 3.041 ± 0.138
0.818PheHis: 0.818 ± 0.075
3.744PheIle: 3.744 ± 0.202
3.23PheLys: 3.23 ± 0.178
4.616PheLeu: 4.616 ± 0.251
1.095PheMet: 1.095 ± 0.09
2.744PheAsn: 2.744 ± 0.181
1.284PhePro: 1.284 ± 0.094
1.02PheGln: 1.02 ± 0.078
1.622PheArg: 1.622 ± 0.107
3.94PheSer: 3.94 ± 0.188
2.622PheThr: 2.622 ± 0.179
3.311PheVal: 3.311 ± 0.182
0.439PheTrp: 0.439 ± 0.066
1.831PheTyr: 1.831 ± 0.113
0.0PheXaa: 0.0 ± 0.0
Gly
3.764GlyAla: 3.764 ± 0.198
0.689GlyCys: 0.689 ± 0.083
3.669GlyAsp: 3.669 ± 0.263
4.109GlyGlu: 4.109 ± 0.179
3.44GlyPhe: 3.44 ± 0.148
4.426GlyGly: 4.426 ± 0.284
1.23GlyHis: 1.23 ± 0.103
5.305GlyIle: 5.305 ± 0.219
4.663GlyLys: 4.663 ± 0.204
5.427GlyLeu: 5.427 ± 0.241
1.5GlyMet: 1.5 ± 0.113
3.183GlyAsn: 3.183 ± 0.271
1.453GlyPro: 1.453 ± 0.1
1.683GlyGln: 1.683 ± 0.112
2.284GlyArg: 2.284 ± 0.13
3.987GlySer: 3.987 ± 0.26
3.575GlyThr: 3.575 ± 0.236
4.676GlyVal: 4.676 ± 0.208
0.487GlyTrp: 0.487 ± 0.054
2.967GlyTyr: 2.967 ± 0.176
0.0GlyXaa: 0.0 ± 0.0
His
1.0HisAla: 1.0 ± 0.071
0.189HisCys: 0.189 ± 0.037
1.054HisAsp: 1.054 ± 0.087
0.987HisGlu: 0.987 ± 0.084
1.088HisPhe: 1.088 ± 0.093
1.102HisGly: 1.102 ± 0.09
0.311HisHis: 0.311 ± 0.053
1.46HisIle: 1.46 ± 0.102
1.412HisLys: 1.412 ± 0.107
1.358HisLeu: 1.358 ± 0.117
0.311HisMet: 0.311 ± 0.049
1.007HisAsn: 1.007 ± 0.089
0.703HisPro: 0.703 ± 0.075
0.514HisGln: 0.514 ± 0.064
0.574HisArg: 0.574 ± 0.064
1.297HisSer: 1.297 ± 0.098
0.865HisThr: 0.865 ± 0.073
0.953HisVal: 0.953 ± 0.087
0.101HisTrp: 0.101 ± 0.026
0.642HisTyr: 0.642 ± 0.063
0.0HisXaa: 0.0 ± 0.0
Ile
4.426IleAla: 4.426 ± 0.212
0.716IleCys: 0.716 ± 0.072
5.345IleAsp: 5.345 ± 0.229
6.947IleGlu: 6.947 ± 0.245
4.068IlePhe: 4.068 ± 0.236
5.379IleGly: 5.379 ± 0.228
1.331IleHis: 1.331 ± 0.107
7.427IleIle: 7.427 ± 0.33
6.9IleLys: 6.9 ± 0.269
7.9IleLeu: 7.9 ± 0.34
1.554IleMet: 1.554 ± 0.108
4.92IleAsn: 4.92 ± 0.274
3.115IlePro: 3.115 ± 0.149
2.156IleGln: 2.156 ± 0.131
3.088IleArg: 3.088 ± 0.157
6.515IleSer: 6.515 ± 0.287
4.44IleThr: 4.44 ± 0.231
5.427IleVal: 5.427 ± 0.177
0.46IleTrp: 0.46 ± 0.057
2.811IleTyr: 2.811 ± 0.126
0.0IleXaa: 0.0 ± 0.0
Lys
4.17LysAla: 4.17 ± 0.202
0.514LysCys: 0.514 ± 0.065
5.068LysAsp: 5.068 ± 0.21
7.954LysGlu: 7.954 ± 0.404
3.264LysPhe: 3.264 ± 0.167
4.271LysGly: 4.271 ± 0.221
1.379LysHis: 1.379 ± 0.106
8.366LysIle: 8.366 ± 0.343
9.326LysLys: 9.326 ± 0.487
6.812LysLeu: 6.812 ± 0.294
2.149LysMet: 2.149 ± 0.13
5.474LysAsn: 5.474 ± 0.209
2.291LysPro: 2.291 ± 0.142
2.237LysGln: 2.237 ± 0.137
3.196LysArg: 3.196 ± 0.171
4.812LysSer: 4.812 ± 0.218
4.785LysThr: 4.785 ± 0.193
5.122LysVal: 5.122 ± 0.226
0.581LysTrp: 0.581 ± 0.056
3.271LysTyr: 3.271 ± 0.188
0.0LysXaa: 0.0 ± 0.0
Leu
5.224LeuAla: 5.224 ± 0.228
0.797LeuCys: 0.797 ± 0.075
5.494LeuAsp: 5.494 ± 0.203
6.535LeuGlu: 6.535 ± 0.244
4.318LeuPhe: 4.318 ± 0.259
5.616LeuGly: 5.616 ± 0.22
1.243LeuHis: 1.243 ± 0.088
7.265LeuIle: 7.265 ± 0.33
8.238LeuLys: 8.238 ± 0.367
7.657LeuLeu: 7.657 ± 0.341
1.96LeuMet: 1.96 ± 0.156
5.339LeuAsn: 5.339 ± 0.221
2.879LeuPro: 2.879 ± 0.152
2.061LeuGln: 2.061 ± 0.129
3.419LeuArg: 3.419 ± 0.162
6.92LeuSer: 6.92 ± 0.222
5.163LeuThr: 5.163 ± 0.223
5.555LeuVal: 5.555 ± 0.216
0.534LeuTrp: 0.534 ± 0.074
2.859LeuTyr: 2.859 ± 0.151
0.0LeuXaa: 0.0 ± 0.0
Met
1.331MetAla: 1.331 ± 0.109
0.216MetCys: 0.216 ± 0.041
1.216MetAsp: 1.216 ± 0.11
1.588MetGlu: 1.588 ± 0.111
0.98MetPhe: 0.98 ± 0.081
1.385MetGly: 1.385 ± 0.102
0.446MetHis: 0.446 ± 0.054
2.156MetIle: 2.156 ± 0.135
1.96MetLys: 1.96 ± 0.139
1.919MetLeu: 1.919 ± 0.115
0.487MetMet: 0.487 ± 0.059
1.237MetAsn: 1.237 ± 0.088
0.73MetPro: 0.73 ± 0.077
0.608MetGln: 0.608 ± 0.072
0.885MetArg: 0.885 ± 0.074
1.541MetSer: 1.541 ± 0.122
1.264MetThr: 1.264 ± 0.095
1.419MetVal: 1.419 ± 0.094
0.203MetTrp: 0.203 ± 0.043
0.669MetTyr: 0.669 ± 0.084
0.0MetXaa: 0.0 ± 0.0
Asn
2.798AsnAla: 2.798 ± 0.15
0.574AsnCys: 0.574 ± 0.075
3.575AsnAsp: 3.575 ± 0.225
4.061AsnGlu: 4.061 ± 0.197
2.75AsnPhe: 2.75 ± 0.176
3.528AsnGly: 3.528 ± 0.237
0.892AsnHis: 0.892 ± 0.074
5.271AsnIle: 5.271 ± 0.367
4.217AsnLys: 4.217 ± 0.186
5.616AsnLeu: 5.616 ± 0.242
1.358AsnMet: 1.358 ± 0.103
3.71AsnAsn: 3.71 ± 0.234
2.5AsnPro: 2.5 ± 0.116
1.838AsnGln: 1.838 ± 0.127
1.649AsnArg: 1.649 ± 0.099
4.359AsnSer: 4.359 ± 0.241
2.561AsnThr: 2.561 ± 0.193
3.98AsnVal: 3.98 ± 0.311
0.426AsnTrp: 0.426 ± 0.055
2.656AsnTyr: 2.656 ± 0.182
0.0AsnXaa: 0.0 ± 0.0
Pro
1.629ProAla: 1.629 ± 0.113
0.25ProCys: 0.25 ± 0.037
1.919ProAsp: 1.919 ± 0.138
2.548ProGlu: 2.548 ± 0.151
1.642ProPhe: 1.642 ± 0.106
2.156ProGly: 2.156 ± 0.15
0.615ProHis: 0.615 ± 0.066
2.487ProIle: 2.487 ± 0.14
2.446ProLys: 2.446 ± 0.149
2.784ProLeu: 2.784 ± 0.141
0.656ProMet: 0.656 ± 0.073
1.608ProAsn: 1.608 ± 0.114
0.818ProPro: 0.818 ± 0.074
0.784ProGln: 0.784 ± 0.069
1.074ProArg: 1.074 ± 0.101
2.372ProSer: 2.372 ± 0.136
1.608ProThr: 1.608 ± 0.112
2.223ProVal: 2.223 ± 0.139
0.297ProTrp: 0.297 ± 0.053
1.162ProTyr: 1.162 ± 0.11
0.0ProXaa: 0.0 ± 0.0
Gln
1.5GlnAla: 1.5 ± 0.116
0.122GlnCys: 0.122 ± 0.027
1.507GlnAsp: 1.507 ± 0.094
1.912GlnGlu: 1.912 ± 0.138
1.142GlnPhe: 1.142 ± 0.104
1.487GlnGly: 1.487 ± 0.106
0.405GlnHis: 0.405 ± 0.054
2.696GlnIle: 2.696 ± 0.153
2.642GlnLys: 2.642 ± 0.152
1.994GlnLeu: 1.994 ± 0.107
0.723GlnMet: 0.723 ± 0.076
1.771GlnAsn: 1.771 ± 0.123
0.662GlnPro: 0.662 ± 0.068
0.737GlnGln: 0.737 ± 0.091
1.054GlnArg: 1.054 ± 0.082
1.669GlnSer: 1.669 ± 0.11
1.304GlnThr: 1.304 ± 0.094
1.906GlnVal: 1.906 ± 0.129
0.142GlnTrp: 0.142 ± 0.03
0.906GlnTyr: 0.906 ± 0.073
0.0GlnXaa: 0.0 ± 0.0
Arg
2.041ArgAla: 2.041 ± 0.133
0.372ArgCys: 0.372 ± 0.057
2.122ArgAsp: 2.122 ± 0.139
2.494ArgGlu: 2.494 ± 0.149
1.595ArgPhe: 1.595 ± 0.133
1.879ArgGly: 1.879 ± 0.127
0.845ArgHis: 0.845 ± 0.092
3.244ArgIle: 3.244 ± 0.164
2.838ArgLys: 2.838 ± 0.185
3.311ArgLeu: 3.311 ± 0.177
0.831ArgMet: 0.831 ± 0.092
1.946ArgAsn: 1.946 ± 0.121
1.054ArgPro: 1.054 ± 0.097
0.926ArgGln: 0.926 ± 0.082
1.325ArgArg: 1.325 ± 0.139
2.068ArgSer: 2.068 ± 0.152
1.723ArgThr: 1.723 ± 0.117
2.149ArgVal: 2.149 ± 0.131
0.257ArgTrp: 0.257 ± 0.047
1.406ArgTyr: 1.406 ± 0.108
0.0ArgXaa: 0.0 ± 0.0
Ser
3.825SerAla: 3.825 ± 0.169
0.797SerCys: 0.797 ± 0.098
4.507SerAsp: 4.507 ± 0.229
5.014SerGlu: 5.014 ± 0.194
3.832SerPhe: 3.832 ± 0.174
4.676SerGly: 4.676 ± 0.299
1.23SerHis: 1.23 ± 0.079
5.764SerIle: 5.764 ± 0.259
5.879SerLys: 5.879 ± 0.244
6.041SerLeu: 6.041 ± 0.264
1.48SerMet: 1.48 ± 0.11
4.244SerAsn: 4.244 ± 0.251
2.149SerPro: 2.149 ± 0.117
1.926SerGln: 1.926 ± 0.119
2.298SerArg: 2.298 ± 0.132
5.609SerSer: 5.609 ± 0.342
3.365SerThr: 3.365 ± 0.198
4.42SerVal: 4.42 ± 0.273
0.628SerTrp: 0.628 ± 0.085
3.048SerTyr: 3.048 ± 0.149
0.0SerXaa: 0.0 ± 0.0
Thr
2.832ThrAla: 2.832 ± 0.171
0.649ThrCys: 0.649 ± 0.088
3.156ThrAsp: 3.156 ± 0.174
3.244ThrGlu: 3.244 ± 0.171
2.467ThrPhe: 2.467 ± 0.152
3.588ThrGly: 3.588 ± 0.225
0.845ThrHis: 0.845 ± 0.065
4.785ThrIle: 4.785 ± 0.245
4.055ThrLys: 4.055 ± 0.189
4.933ThrLeu: 4.933 ± 0.25
0.892ThrMet: 0.892 ± 0.09
3.149ThrAsn: 3.149 ± 0.252
1.933ThrPro: 1.933 ± 0.114
1.46ThrGln: 1.46 ± 0.087
1.46ThrArg: 1.46 ± 0.098
3.663ThrSer: 3.663 ± 0.249
2.994ThrThr: 2.994 ± 0.24
3.663ThrVal: 3.663 ± 0.212
0.52ThrTrp: 0.52 ± 0.07
2.115ThrTyr: 2.115 ± 0.15
0.0ThrXaa: 0.0 ± 0.0
Val
3.609ValAla: 3.609 ± 0.171
0.791ValCys: 0.791 ± 0.083
4.426ValAsp: 4.426 ± 0.233
5.089ValGlu: 5.089 ± 0.196
2.865ValPhe: 2.865 ± 0.139
4.068ValGly: 4.068 ± 0.224
1.095ValHis: 1.095 ± 0.082
4.981ValIle: 4.981 ± 0.208
5.481ValLys: 5.481 ± 0.231
5.69ValLeu: 5.69 ± 0.223
1.493ValMet: 1.493 ± 0.108
3.325ValAsn: 3.325 ± 0.243
2.399ValPro: 2.399 ± 0.129
1.649ValGln: 1.649 ± 0.093
2.311ValArg: 2.311 ± 0.142
5.427ValSer: 5.427 ± 0.208
3.737ValThr: 3.737 ± 0.197
4.447ValVal: 4.447 ± 0.206
0.446ValTrp: 0.446 ± 0.056
2.122ValTyr: 2.122 ± 0.131
0.0ValXaa: 0.0 ± 0.0
Trp
0.419TrpAla: 0.419 ± 0.06
0.108TrpCys: 0.108 ± 0.028
0.487TrpAsp: 0.487 ± 0.059
0.514TrpGlu: 0.514 ± 0.067
0.324TrpPhe: 0.324 ± 0.042
0.527TrpGly: 0.527 ± 0.054
0.149TrpHis: 0.149 ± 0.032
0.656TrpIle: 0.656 ± 0.072
0.52TrpLys: 0.52 ± 0.06
0.649TrpLeu: 0.649 ± 0.073
0.209TrpMet: 0.209 ± 0.04
0.399TrpAsn: 0.399 ± 0.062
0.203TrpPro: 0.203 ± 0.04
0.223TrpGln: 0.223 ± 0.039
0.311TrpArg: 0.311 ± 0.042
0.581TrpSer: 0.581 ± 0.069
0.507TrpThr: 0.507 ± 0.072
0.473TrpVal: 0.473 ± 0.054
0.061TrpTrp: 0.061 ± 0.023
0.304TrpTyr: 0.304 ± 0.055
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.967TyrAla: 1.967 ± 0.125
0.52TyrCys: 0.52 ± 0.082
2.845TyrAsp: 2.845 ± 0.157
2.973TyrGlu: 2.973 ± 0.205
2.527TyrPhe: 2.527 ± 0.139
2.413TyrGly: 2.413 ± 0.128
0.797TyrHis: 0.797 ± 0.073
2.379TyrIle: 2.379 ± 0.146
3.0TyrLys: 3.0 ± 0.152
3.669TyrLeu: 3.669 ± 0.162
0.838TyrMet: 0.838 ± 0.077
2.541TyrAsn: 2.541 ± 0.188
1.291TyrPro: 1.291 ± 0.098
1.183TyrGln: 1.183 ± 0.094
1.358TyrArg: 1.358 ± 0.103
2.791TyrSer: 2.791 ± 0.162
1.879TyrThr: 1.879 ± 0.17
2.392TyrVal: 2.392 ± 0.129
0.392TyrTrp: 0.392 ± 0.051
2.021TyrTyr: 2.021 ± 0.164
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 474 proteins (147978 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski