Amino acid dipepetide frequency for candidate division MSBL1 archaeon SCGC-AAA382C18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.44AlaAla: 3.44 ± 0.205
0.621AlaCys: 0.621 ± 0.068
3.122AlaAsp: 3.122 ± 0.154
5.236AlaGlu: 5.236 ± 0.207
2.251AlaPhe: 2.251 ± 0.142
4.403AlaGly: 4.403 ± 0.234
0.818AlaHis: 0.818 ± 0.069
4.365AlaIle: 4.365 ± 0.208
3.903AlaLys: 3.903 ± 0.176
5.964AlaLeu: 5.964 ± 0.24
1.508AlaMet: 1.508 ± 0.124
1.766AlaAsn: 1.766 ± 0.117
1.766AlaPro: 1.766 ± 0.138
1.031AlaGln: 1.031 ± 0.089
3.001AlaArg: 3.001 ± 0.181
3.6AlaSer: 3.6 ± 0.165
2.425AlaThr: 2.425 ± 0.123
3.978AlaVal: 3.978 ± 0.189
0.629AlaTrp: 0.629 ± 0.073
1.319AlaTyr: 1.319 ± 0.097
0.0AlaXaa: 0.0 ± 0.0
Cys
0.394CysAla: 0.394 ± 0.054
0.114CysCys: 0.114 ± 0.03
0.515CysAsp: 0.515 ± 0.061
0.818CysGlu: 0.818 ± 0.071
0.409CysPhe: 0.409 ± 0.057
1.205CysGly: 1.205 ± 0.106
0.182CysHis: 0.182 ± 0.036
0.402CysIle: 0.402 ± 0.052
0.591CysLys: 0.591 ± 0.072
0.652CysLeu: 0.652 ± 0.07
0.205CysMet: 0.205 ± 0.039
0.28CysAsn: 0.28 ± 0.048
0.682CysPro: 0.682 ± 0.072
0.242CysGln: 0.242 ± 0.046
0.447CysArg: 0.447 ± 0.057
0.697CysSer: 0.697 ± 0.073
0.462CysThr: 0.462 ± 0.064
0.432CysVal: 0.432 ± 0.059
0.136CysTrp: 0.136 ± 0.029
0.205CysTyr: 0.205 ± 0.046
0.0CysXaa: 0.0 ± 0.0
Asp
2.88AspAla: 2.88 ± 0.172
0.576AspCys: 0.576 ± 0.068
3.16AspAsp: 3.16 ± 0.214
6.002AspGlu: 6.002 ± 0.278
3.001AspPhe: 3.001 ± 0.15
3.698AspGly: 3.698 ± 0.185
1.068AspHis: 1.068 ± 0.099
5.289AspIle: 5.289 ± 0.197
4.35AspLys: 4.35 ± 0.192
6.381AspLeu: 6.381 ± 0.279
1.508AspMet: 1.508 ± 0.099
2.016AspAsn: 2.016 ± 0.157
2.463AspPro: 2.463 ± 0.145
1.22AspGln: 1.22 ± 0.099
3.334AspArg: 3.334 ± 0.164
3.365AspSer: 3.365 ± 0.18
2.379AspThr: 2.379 ± 0.143
4.638AspVal: 4.638 ± 0.224
0.902AspTrp: 0.902 ± 0.08
2.577AspTyr: 2.577 ± 0.146
0.0AspXaa: 0.0 ± 0.0
Glu
5.615GluAla: 5.615 ± 0.232
0.697GluCys: 0.697 ± 0.072
6.381GluAsp: 6.381 ± 0.308
11.897GluGlu: 11.897 ± 0.517
3.221GluPhe: 3.221 ± 0.15
5.82GluGly: 5.82 ± 0.197
1.228GluHis: 1.228 ± 0.107
8.412GluIle: 8.412 ± 0.265
11.731GluLys: 11.731 ± 0.402
8.108GluLeu: 8.108 ± 0.286
2.486GluMet: 2.486 ± 0.151
5.585GluAsn: 5.585 ± 0.244
2.69GluPro: 2.69 ± 0.164
1.705GluGln: 1.705 ± 0.119
5.403GluArg: 5.403 ± 0.229
5.001GluSer: 5.001 ± 0.176
4.418GluThr: 4.418 ± 0.184
6.373GluVal: 6.373 ± 0.256
1.114GluTrp: 1.114 ± 0.118
2.66GluTyr: 2.66 ± 0.137
0.0GluXaa: 0.0 ± 0.0
Phe
2.311PheAla: 2.311 ± 0.139
0.424PheCys: 0.424 ± 0.054
2.652PheAsp: 2.652 ± 0.15
3.645PheGlu: 3.645 ± 0.168
1.766PhePhe: 1.766 ± 0.138
2.864PheGly: 2.864 ± 0.149
0.75PheHis: 0.75 ± 0.075
2.326PheIle: 2.326 ± 0.15
2.273PheLys: 2.273 ± 0.164
4.774PheLeu: 4.774 ± 0.244
0.841PheMet: 0.841 ± 0.073
1.266PheAsn: 1.266 ± 0.103
1.766PhePro: 1.766 ± 0.122
1.122PheGln: 1.122 ± 0.095
1.788PheArg: 1.788 ± 0.112
3.547PheSer: 3.547 ± 0.207
1.872PheThr: 1.872 ± 0.129
2.637PheVal: 2.637 ± 0.163
0.568PheTrp: 0.568 ± 0.064
1.561PheTyr: 1.561 ± 0.125
0.0PheXaa: 0.0 ± 0.0
Gly
3.857GlyAla: 3.857 ± 0.207
0.697GlyCys: 0.697 ± 0.083
3.819GlyAsp: 3.819 ± 0.161
6.563GlyGlu: 6.563 ± 0.232
3.175GlyPhe: 3.175 ± 0.165
5.373GlyGly: 5.373 ± 0.288
1.235GlyHis: 1.235 ± 0.095
5.865GlyIle: 5.865 ± 0.264
6.009GlyLys: 6.009 ± 0.199
6.282GlyLeu: 6.282 ± 0.264
1.826GlyMet: 1.826 ± 0.124
2.387GlyAsn: 2.387 ± 0.144
2.046GlyPro: 2.046 ± 0.143
1.402GlyGln: 1.402 ± 0.097
3.569GlyArg: 3.569 ± 0.162
4.501GlySer: 4.501 ± 0.223
3.706GlyThr: 3.706 ± 0.175
5.252GlyVal: 5.252 ± 0.249
0.871GlyTrp: 0.871 ± 0.073
2.433GlyTyr: 2.433 ± 0.167
0.0GlyXaa: 0.0 ± 0.0
His
0.932HisAla: 0.932 ± 0.077
0.152HisCys: 0.152 ± 0.038
0.94HisAsp: 0.94 ± 0.088
1.432HisGlu: 1.432 ± 0.122
0.72HisPhe: 0.72 ± 0.084
1.258HisGly: 1.258 ± 0.094
0.447HisHis: 0.447 ± 0.059
1.015HisIle: 1.015 ± 0.086
0.841HisLys: 0.841 ± 0.074
1.72HisLeu: 1.72 ± 0.125
0.288HisMet: 0.288 ± 0.045
0.697HisAsn: 0.697 ± 0.071
1.144HisPro: 1.144 ± 0.103
0.538HisGln: 0.538 ± 0.072
0.962HisArg: 0.962 ± 0.083
1.0HisSer: 1.0 ± 0.083
0.849HisThr: 0.849 ± 0.086
1.0HisVal: 1.0 ± 0.072
0.235HisTrp: 0.235 ± 0.042
0.546HisTyr: 0.546 ± 0.066
0.0HisXaa: 0.0 ± 0.0
Ile
4.6IleAla: 4.6 ± 0.21
0.909IleCys: 0.909 ± 0.087
4.888IleAsp: 4.888 ± 0.191
7.426IleGlu: 7.426 ± 0.284
3.13IlePhe: 3.13 ± 0.169
5.729IleGly: 5.729 ± 0.23
1.303IleHis: 1.303 ± 0.098
5.32IleIle: 5.32 ± 0.239
4.615IleLys: 4.615 ± 0.179
7.373IleLeu: 7.373 ± 0.329
1.508IleMet: 1.508 ± 0.116
2.713IleAsn: 2.713 ± 0.142
3.493IlePro: 3.493 ± 0.166
1.864IleGln: 1.864 ± 0.12
3.471IleArg: 3.471 ± 0.153
5.676IleSer: 5.676 ± 0.23
3.539IleThr: 3.539 ± 0.199
4.653IleVal: 4.653 ± 0.218
0.659IleTrp: 0.659 ± 0.077
2.425IleTyr: 2.425 ± 0.15
0.0IleXaa: 0.0 ± 0.0
Lys
4.259LysAla: 4.259 ± 0.188
0.758LysCys: 0.758 ± 0.087
4.229LysAsp: 4.229 ± 0.17
8.737LysGlu: 8.737 ± 0.356
2.721LysPhe: 2.721 ± 0.145
4.41LysGly: 4.41 ± 0.188
1.281LysHis: 1.281 ± 0.101
6.904LysIle: 6.904 ± 0.253
7.684LysLys: 7.684 ± 0.356
6.35LysLeu: 6.35 ± 0.228
1.728LysMet: 1.728 ± 0.113
4.554LysAsn: 4.554 ± 0.195
2.395LysPro: 2.395 ± 0.127
1.697LysGln: 1.697 ± 0.134
3.994LysArg: 3.994 ± 0.186
5.092LysSer: 5.092 ± 0.22
4.031LysThr: 4.031 ± 0.195
4.873LysVal: 4.873 ± 0.217
0.773LysTrp: 0.773 ± 0.089
2.63LysTyr: 2.63 ± 0.165
0.0LysXaa: 0.0 ± 0.0
Leu
6.047LeuAla: 6.047 ± 0.25
0.644LeuCys: 0.644 ± 0.062
6.426LeuAsp: 6.426 ± 0.233
9.7LeuGlu: 9.7 ± 0.411
3.819LeuPhe: 3.819 ± 0.209
6.911LeuGly: 6.911 ± 0.257
1.41LeuHis: 1.41 ± 0.095
6.04LeuIle: 6.04 ± 0.306
7.199LeuLys: 7.199 ± 0.261
7.76LeuLeu: 7.76 ± 0.282
1.773LeuMet: 1.773 ± 0.116
3.6LeuAsn: 3.6 ± 0.161
3.562LeuPro: 3.562 ± 0.162
2.281LeuGln: 2.281 ± 0.121
4.744LeuArg: 4.744 ± 0.203
7.237LeuSer: 7.237 ± 0.232
4.471LeuThr: 4.471 ± 0.18
5.896LeuVal: 5.896 ± 0.261
0.864LeuTrp: 0.864 ± 0.073
2.379LeuTyr: 2.379 ± 0.147
0.0LeuXaa: 0.0 ± 0.0
Met
1.334MetAla: 1.334 ± 0.094
0.144MetCys: 0.144 ± 0.037
1.516MetAsp: 1.516 ± 0.124
2.182MetGlu: 2.182 ± 0.13
0.765MetPhe: 0.765 ± 0.077
1.91MetGly: 1.91 ± 0.121
0.242MetHis: 0.242 ± 0.046
1.417MetIle: 1.417 ± 0.107
1.97MetLys: 1.97 ± 0.112
1.667MetLeu: 1.667 ± 0.113
0.508MetMet: 0.508 ± 0.067
1.137MetAsn: 1.137 ± 0.098
0.72MetPro: 0.72 ± 0.075
0.462MetGln: 0.462 ± 0.061
1.137MetArg: 1.137 ± 0.078
1.66MetSer: 1.66 ± 0.104
1.273MetThr: 1.273 ± 0.107
1.622MetVal: 1.622 ± 0.115
0.205MetTrp: 0.205 ± 0.047
0.47MetTyr: 0.47 ± 0.065
0.0MetXaa: 0.0 ± 0.0
Asn
1.864AsnAla: 1.864 ± 0.121
0.485AsnCys: 0.485 ± 0.068
1.728AsnAsp: 1.728 ± 0.101
3.024AsnGlu: 3.024 ± 0.17
2.031AsnPhe: 2.031 ± 0.145
2.811AsnGly: 2.811 ± 0.147
0.902AsnHis: 0.902 ± 0.08
3.38AsnIle: 3.38 ± 0.182
2.448AsnLys: 2.448 ± 0.167
4.85AsnLeu: 4.85 ± 0.205
0.978AsnMet: 0.978 ± 0.079
1.516AsnAsn: 1.516 ± 0.107
2.743AsnPro: 2.743 ± 0.125
1.296AsnGln: 1.296 ± 0.102
2.546AsnArg: 2.546 ± 0.154
2.592AsnSer: 2.592 ± 0.126
1.667AsnThr: 1.667 ± 0.095
2.395AsnVal: 2.395 ± 0.139
0.546AsnTrp: 0.546 ± 0.073
1.553AsnTyr: 1.553 ± 0.112
0.0AsnXaa: 0.0 ± 0.0
Pro
1.932ProAla: 1.932 ± 0.165
0.265ProCys: 0.265 ± 0.045
2.811ProAsp: 2.811 ± 0.158
4.441ProGlu: 4.441 ± 0.167
1.644ProPhe: 1.644 ± 0.111
2.607ProGly: 2.607 ± 0.171
0.803ProHis: 0.803 ± 0.071
2.872ProIle: 2.872 ± 0.167
2.645ProLys: 2.645 ± 0.146
3.205ProLeu: 3.205 ± 0.164
0.818ProMet: 0.818 ± 0.075
1.637ProAsn: 1.637 ± 0.114
1.811ProPro: 1.811 ± 0.14
0.925ProGln: 0.925 ± 0.077
1.667ProArg: 1.667 ± 0.123
2.774ProSer: 2.774 ± 0.144
2.016ProThr: 2.016 ± 0.145
2.592ProVal: 2.592 ± 0.141
0.364ProTrp: 0.364 ± 0.053
1.273ProTyr: 1.273 ± 0.11
0.0ProXaa: 0.0 ± 0.0
Gln
1.387GlnAla: 1.387 ± 0.103
0.121GlnCys: 0.121 ± 0.037
1.432GlnAsp: 1.432 ± 0.11
2.425GlnGlu: 2.425 ± 0.147
0.871GlnPhe: 0.871 ± 0.084
1.341GlnGly: 1.341 ± 0.101
0.326GlnHis: 0.326 ± 0.05
1.804GlnIle: 1.804 ± 0.113
2.342GlnLys: 2.342 ± 0.148
1.857GlnLeu: 1.857 ± 0.139
0.637GlnMet: 0.637 ± 0.068
1.228GlnAsn: 1.228 ± 0.088
0.75GlnPro: 0.75 ± 0.072
0.606GlnGln: 0.606 ± 0.071
1.326GlnArg: 1.326 ± 0.124
1.175GlnSer: 1.175 ± 0.108
1.053GlnThr: 1.053 ± 0.095
1.417GlnVal: 1.417 ± 0.099
0.159GlnTrp: 0.159 ± 0.034
0.765GlnTyr: 0.765 ± 0.085
0.0GlnXaa: 0.0 ± 0.0
Arg
2.584ArgAla: 2.584 ± 0.158
0.386ArgCys: 0.386 ± 0.05
3.145ArgAsp: 3.145 ± 0.151
6.381ArgGlu: 6.381 ± 0.257
1.97ArgPhe: 1.97 ± 0.132
3.493ArgGly: 3.493 ± 0.162
0.735ArgHis: 0.735 ± 0.068
3.888ArgIle: 3.888 ± 0.157
5.282ArgLys: 5.282 ± 0.227
4.463ArgLeu: 4.463 ± 0.182
1.22ArgMet: 1.22 ± 0.092
2.577ArgAsn: 2.577 ± 0.141
1.599ArgPro: 1.599 ± 0.126
1.008ArgGln: 1.008 ± 0.102
2.918ArgArg: 2.918 ± 0.183
2.622ArgSer: 2.622 ± 0.145
2.236ArgThr: 2.236 ± 0.128
3.046ArgVal: 3.046 ± 0.174
0.546ArgTrp: 0.546 ± 0.066
1.629ArgTyr: 1.629 ± 0.104
0.0ArgXaa: 0.0 ± 0.0
Ser
3.13SerAla: 3.13 ± 0.174
0.546SerCys: 0.546 ± 0.059
4.312SerAsp: 4.312 ± 0.191
6.229SerGlu: 6.229 ± 0.211
2.88SerPhe: 2.88 ± 0.176
4.926SerGly: 4.926 ± 0.198
1.175SerHis: 1.175 ± 0.095
4.911SerIle: 4.911 ± 0.228
4.835SerLys: 4.835 ± 0.193
6.123SerLeu: 6.123 ± 0.26
1.516SerMet: 1.516 ± 0.114
2.561SerAsn: 2.561 ± 0.149
2.971SerPro: 2.971 ± 0.142
1.819SerGln: 1.819 ± 0.115
3.243SerArg: 3.243 ± 0.189
4.895SerSer: 4.895 ± 0.221
3.046SerThr: 3.046 ± 0.142
4.335SerVal: 4.335 ± 0.188
0.682SerTrp: 0.682 ± 0.073
2.031SerTyr: 2.031 ± 0.147
0.0SerXaa: 0.0 ± 0.0
Thr
2.834ThrAla: 2.834 ± 0.15
0.432ThrCys: 0.432 ± 0.067
2.925ThrAsp: 2.925 ± 0.137
4.107ThrGlu: 4.107 ± 0.195
1.932ThrPhe: 1.932 ± 0.113
3.63ThrGly: 3.63 ± 0.165
0.955ThrHis: 0.955 ± 0.079
3.456ThrIle: 3.456 ± 0.172
2.857ThrLys: 2.857 ± 0.168
4.388ThrLeu: 4.388 ± 0.188
0.993ThrMet: 0.993 ± 0.083
1.675ThrAsn: 1.675 ± 0.119
2.008ThrPro: 2.008 ± 0.117
1.266ThrGln: 1.266 ± 0.092
2.296ThrArg: 2.296 ± 0.133
3.137ThrSer: 3.137 ± 0.151
2.622ThrThr: 2.622 ± 0.158
3.592ThrVal: 3.592 ± 0.157
0.477ThrTrp: 0.477 ± 0.053
1.629ThrTyr: 1.629 ± 0.128
0.0ThrXaa: 0.0 ± 0.0
Val
3.774ValAla: 3.774 ± 0.192
0.637ValCys: 0.637 ± 0.073
4.289ValAsp: 4.289 ± 0.216
6.525ValGlu: 6.525 ± 0.249
2.713ValPhe: 2.713 ± 0.165
5.145ValGly: 5.145 ± 0.228
1.061ValHis: 1.061 ± 0.102
4.6ValIle: 4.6 ± 0.211
4.638ValLys: 4.638 ± 0.191
6.244ValLeu: 6.244 ± 0.224
1.159ValMet: 1.159 ± 0.098
2.486ValAsn: 2.486 ± 0.128
2.834ValPro: 2.834 ± 0.144
1.41ValGln: 1.41 ± 0.105
3.304ValArg: 3.304 ± 0.17
4.714ValSer: 4.714 ± 0.187
3.152ValThr: 3.152 ± 0.151
4.418ValVal: 4.418 ± 0.19
0.546ValTrp: 0.546 ± 0.059
1.955ValTyr: 1.955 ± 0.138
0.0ValXaa: 0.0 ± 0.0
Trp
0.485TrpAla: 0.485 ± 0.055
0.068TrpCys: 0.068 ± 0.021
0.652TrpAsp: 0.652 ± 0.079
0.925TrpGlu: 0.925 ± 0.081
0.371TrpPhe: 0.371 ± 0.05
0.697TrpGly: 0.697 ± 0.082
0.227TrpHis: 0.227 ± 0.038
1.023TrpIle: 1.023 ± 0.09
1.068TrpLys: 1.068 ± 0.084
1.061TrpLeu: 1.061 ± 0.089
0.121TrpMet: 0.121 ± 0.031
0.523TrpAsn: 0.523 ± 0.069
0.296TrpPro: 0.296 ± 0.05
0.258TrpGln: 0.258 ± 0.047
0.75TrpArg: 0.75 ± 0.082
0.69TrpSer: 0.69 ± 0.08
0.5TrpThr: 0.5 ± 0.064
0.652TrpVal: 0.652 ± 0.074
0.129TrpTrp: 0.129 ± 0.033
0.402TrpTyr: 0.402 ± 0.068
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.485TyrAla: 1.485 ± 0.102
0.394TyrCys: 0.394 ± 0.054
1.864TyrAsp: 1.864 ± 0.139
2.849TyrGlu: 2.849 ± 0.176
1.364TyrPhe: 1.364 ± 0.1
2.645TyrGly: 2.645 ± 0.162
0.568TyrHis: 0.568 ± 0.077
2.008TyrIle: 2.008 ± 0.14
1.819TyrLys: 1.819 ± 0.115
3.539TyrLeu: 3.539 ± 0.2
0.712TyrMet: 0.712 ± 0.072
1.152TyrAsn: 1.152 ± 0.091
1.364TyrPro: 1.364 ± 0.103
0.871TyrGln: 0.871 ± 0.086
1.826TyrArg: 1.826 ± 0.128
2.19TyrSer: 2.19 ± 0.147
1.493TyrThr: 1.493 ± 0.115
1.804TyrVal: 1.804 ± 0.121
0.493TyrTrp: 0.493 ± 0.063
1.076TyrTyr: 1.076 ± 0.098
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 551 proteins (131962 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski