Amino acid dipepetide frequency for Candidatus Marinamargulisbacteria bacterium SCGC AG-333-B06

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.558AlaAla: 3.558 ± 0.177
0.909AlaCys: 0.909 ± 0.076
3.14AlaAsp: 3.14 ± 0.16
2.53AlaGlu: 2.53 ± 0.159
2.649AlaPhe: 2.649 ± 0.139
3.649AlaGly: 3.649 ± 0.195
1.203AlaHis: 1.203 ± 0.099
5.495AlaIle: 5.495 ± 0.188
4.162AlaLys: 4.162 ± 0.165
5.789AlaLeu: 5.789 ± 0.21
1.355AlaMet: 1.355 ± 0.091
2.751AlaAsn: 2.751 ± 0.133
1.44AlaPro: 1.44 ± 0.093
1.706AlaGln: 1.706 ± 0.107
1.756AlaArg: 1.756 ± 0.107
3.999AlaSer: 3.999 ± 0.159
3.377AlaThr: 3.377 ± 0.162
3.276AlaVal: 3.276 ± 0.151
0.457AlaTrp: 0.457 ± 0.056
2.31AlaTyr: 2.31 ± 0.1
0.0AlaXaa: 0.0 ± 0.0
Cys
0.689CysAla: 0.689 ± 0.063
0.288CysCys: 0.288 ± 0.04
0.808CysAsp: 0.808 ± 0.075
0.503CysGlu: 0.503 ± 0.052
0.926CysPhe: 0.926 ± 0.086
0.954CysGly: 0.954 ± 0.078
0.395CysHis: 0.395 ± 0.052
1.175CysIle: 1.175 ± 0.084
0.678CysLys: 0.678 ± 0.057
1.548CysLeu: 1.548 ± 0.118
0.282CysMet: 0.282 ± 0.037
0.627CysAsn: 0.627 ± 0.065
0.401CysPro: 0.401 ± 0.044
0.531CysGln: 0.531 ± 0.052
0.39CysArg: 0.39 ± 0.047
0.921CysSer: 0.921 ± 0.083
0.491CysThr: 0.491 ± 0.05
0.757CysVal: 0.757 ± 0.069
0.09CysTrp: 0.09 ± 0.023
0.508CysTyr: 0.508 ± 0.051
0.0CysXaa: 0.0 ± 0.0
Asp
2.897AspAla: 2.897 ± 0.157
0.779AspCys: 0.779 ± 0.063
3.264AspAsp: 3.264 ± 0.146
2.897AspGlu: 2.897 ± 0.142
2.632AspPhe: 2.632 ± 0.13
2.496AspGly: 2.496 ± 0.148
1.485AspHis: 1.485 ± 0.09
5.394AspIle: 5.394 ± 0.175
3.767AspLys: 3.767 ± 0.151
5.58AspLeu: 5.58 ± 0.241
1.186AspMet: 1.186 ± 0.085
3.027AspAsn: 3.027 ± 0.151
2.016AspPro: 2.016 ± 0.115
2.186AspGln: 2.186 ± 0.129
1.858AspArg: 1.858 ± 0.116
4.191AspSer: 4.191 ± 0.174
3.298AspThr: 3.298 ± 0.152
3.434AspVal: 3.434 ± 0.144
0.531AspTrp: 0.531 ± 0.059
3.089AspTyr: 3.089 ± 0.157
0.0AspXaa: 0.0 ± 0.0
Glu
3.191GluAla: 3.191 ± 0.119
0.65GluCys: 0.65 ± 0.064
2.762GluAsp: 2.762 ± 0.148
3.191GluGlu: 3.191 ± 0.164
2.412GluPhe: 2.412 ± 0.114
2.7GluGly: 2.7 ± 0.139
1.124GluHis: 1.124 ± 0.08
4.355GluIle: 4.355 ± 0.149
5.275GluLys: 5.275 ± 0.237
5.529GluLeu: 5.529 ± 0.165
1.051GluMet: 1.051 ± 0.081
2.959GluAsn: 2.959 ± 0.138
1.468GluPro: 1.468 ± 0.09
1.886GluGln: 1.886 ± 0.12
1.853GluArg: 1.853 ± 0.115
4.292GluSer: 4.292 ± 0.159
3.479GluThr: 3.479 ± 0.154
2.739GluVal: 2.739 ± 0.165
0.361GluTrp: 0.361 ± 0.042
1.898GluTyr: 1.898 ± 0.12
0.0GluXaa: 0.0 ± 0.0
Phe
2.118PheAla: 2.118 ± 0.125
0.768PheCys: 0.768 ± 0.075
2.92PheAsp: 2.92 ± 0.158
2.587PheGlu: 2.587 ± 0.118
3.146PhePhe: 3.146 ± 0.186
2.897PheGly: 2.897 ± 0.132
1.147PheHis: 1.147 ± 0.082
4.479PheIle: 4.479 ± 0.211
3.852PheLys: 3.852 ± 0.173
4.987PheLeu: 4.987 ± 0.239
1.039PheMet: 1.039 ± 0.079
3.31PheAsn: 3.31 ± 0.162
1.672PhePro: 1.672 ± 0.102
1.909PheGln: 1.909 ± 0.115
1.508PheArg: 1.508 ± 0.086
4.84PheSer: 4.84 ± 0.207
2.225PheThr: 2.225 ± 0.124
2.813PheVal: 2.813 ± 0.14
0.525PheTrp: 0.525 ± 0.067
2.112PheTyr: 2.112 ± 0.12
0.0PheXaa: 0.0 ± 0.0
Gly
3.445GlyAla: 3.445 ± 0.171
0.74GlyCys: 0.74 ± 0.074
2.863GlyAsp: 2.863 ± 0.168
2.169GlyGlu: 2.169 ± 0.124
3.072GlyPhe: 3.072 ± 0.142
3.626GlyGly: 3.626 ± 0.205
1.355GlyHis: 1.355 ± 0.074
5.371GlyIle: 5.371 ± 0.18
3.694GlyLys: 3.694 ± 0.178
5.812GlyLeu: 5.812 ± 0.21
1.372GlyMet: 1.372 ± 0.089
2.739GlyAsn: 2.739 ± 0.136
1.485GlyPro: 1.485 ± 0.092
1.542GlyGln: 1.542 ± 0.095
1.886GlyArg: 1.886 ± 0.143
3.987GlySer: 3.987 ± 0.16
2.937GlyThr: 2.937 ± 0.134
3.716GlyVal: 3.716 ± 0.171
0.582GlyTrp: 0.582 ± 0.062
2.553GlyTyr: 2.553 ± 0.127
0.0GlyXaa: 0.0 ± 0.0
His
1.305HisAla: 1.305 ± 0.097
0.361HisCys: 0.361 ± 0.047
1.621HisAsp: 1.621 ± 0.103
1.13HisGlu: 1.13 ± 0.077
1.31HisPhe: 1.31 ± 0.102
1.355HisGly: 1.355 ± 0.081
0.96HisHis: 0.96 ± 0.087
2.35HisIle: 2.35 ± 0.153
1.344HisLys: 1.344 ± 0.088
2.35HisLeu: 2.35 ± 0.112
0.537HisMet: 0.537 ± 0.062
1.355HisAsn: 1.355 ± 0.085
1.214HisPro: 1.214 ± 0.098
0.977HisGln: 0.977 ± 0.066
0.836HisArg: 0.836 ± 0.072
1.728HisSer: 1.728 ± 0.11
1.243HisThr: 1.243 ± 0.08
1.485HisVal: 1.485 ± 0.097
0.192HisTrp: 0.192 ± 0.041
1.389HisTyr: 1.389 ± 0.084
0.0HisXaa: 0.0 ± 0.0
Ile
5.337IleAla: 5.337 ± 0.183
1.147IleCys: 1.147 ± 0.077
5.224IleAsp: 5.224 ± 0.189
5.552IleGlu: 5.552 ± 0.204
4.298IlePhe: 4.298 ± 0.187
5.332IleGly: 5.332 ± 0.219
2.231IleHis: 2.231 ± 0.118
8.647IleIle: 8.647 ± 0.326
7.348IleLys: 7.348 ± 0.219
8.127IleLeu: 8.127 ± 0.283
1.824IleMet: 1.824 ± 0.107
5.371IleAsn: 5.371 ± 0.191
3.846IlePro: 3.846 ± 0.152
3.62IleGln: 3.62 ± 0.147
2.943IleArg: 2.943 ± 0.144
6.907IleSer: 6.907 ± 0.23
5.157IleThr: 5.157 ± 0.16
4.665IleVal: 4.665 ± 0.152
0.52IleTrp: 0.52 ± 0.057
3.061IleTyr: 3.061 ± 0.147
0.0IleXaa: 0.0 ± 0.0
Lys
4.219LysAla: 4.219 ± 0.179
0.542LysCys: 0.542 ± 0.062
4.055LysAsp: 4.055 ± 0.163
5.93LysGlu: 5.93 ± 0.237
2.366LysPhe: 2.366 ± 0.123
3.536LysGly: 3.536 ± 0.165
2.22LysHis: 2.22 ± 0.127
6.032LysIle: 6.032 ± 0.242
8.105LysLys: 8.105 ± 0.268
6.642LysLeu: 6.642 ± 0.221
1.497LysMet: 1.497 ± 0.095
4.383LysAsn: 4.383 ± 0.216
2.767LysPro: 2.767 ± 0.136
4.422LysGln: 4.422 ± 0.191
3.411LysArg: 3.411 ± 0.145
5.077LysSer: 5.077 ± 0.156
5.117LysThr: 5.117 ± 0.182
3.44LysVal: 3.44 ± 0.163
0.542LysTrp: 0.542 ± 0.058
2.423LysTyr: 2.423 ± 0.129
0.0LysXaa: 0.0 ± 0.0
Leu
6.602LeuAla: 6.602 ± 0.204
1.265LeuCys: 1.265 ± 0.085
5.727LeuAsp: 5.727 ± 0.174
5.546LeuGlu: 5.546 ± 0.174
5.58LeuPhe: 5.58 ± 0.242
5.778LeuGly: 5.778 ± 0.177
1.988LeuHis: 1.988 ± 0.105
8.895LeuIle: 8.895 ± 0.287
7.512LeuLys: 7.512 ± 0.236
10.403LeuLeu: 10.403 ± 0.37
2.366LeuMet: 2.366 ± 0.117
6.314LeuAsn: 6.314 ± 0.211
3.649LeuPro: 3.649 ± 0.128
2.773LeuGln: 2.773 ± 0.142
3.101LeuArg: 3.101 ± 0.13
8.63LeuSer: 8.63 ± 0.268
6.015LeuThr: 6.015 ± 0.212
5.332LeuVal: 5.332 ± 0.181
0.757LeuTrp: 0.757 ± 0.067
3.53LeuTyr: 3.53 ± 0.142
0.0LeuXaa: 0.0 ± 0.0
Met
1.361MetAla: 1.361 ± 0.096
0.232MetCys: 0.232 ± 0.032
1.186MetAsp: 1.186 ± 0.087
0.87MetGlu: 0.87 ± 0.067
0.904MetPhe: 0.904 ± 0.075
1.531MetGly: 1.531 ± 0.107
0.401MetHis: 0.401 ± 0.048
2.22MetIle: 2.22 ± 0.11
1.785MetLys: 1.785 ± 0.095
1.982MetLeu: 1.982 ± 0.114
0.695MetMet: 0.695 ± 0.064
1.18MetAsn: 1.18 ± 0.088
0.836MetPro: 0.836 ± 0.071
0.599MetGln: 0.599 ± 0.055
0.65MetArg: 0.65 ± 0.054
1.734MetSer: 1.734 ± 0.092
1.293MetThr: 1.293 ± 0.074
1.423MetVal: 1.423 ± 0.085
0.164MetTrp: 0.164 ± 0.033
0.712MetTyr: 0.712 ± 0.064
0.0MetXaa: 0.0 ± 0.0
Asn
2.717AsnAla: 2.717 ± 0.129
0.672AsnCys: 0.672 ± 0.056
3.01AsnAsp: 3.01 ± 0.149
2.722AsnGlu: 2.722 ± 0.115
2.457AsnPhe: 2.457 ± 0.122
2.536AsnGly: 2.536 ± 0.132
1.728AsnHis: 1.728 ± 0.094
5.399AsnIle: 5.399 ± 0.206
4.784AsnLys: 4.784 ± 0.224
5.151AsnLeu: 5.151 ± 0.215
1.045AsnMet: 1.045 ± 0.09
3.598AsnAsn: 3.598 ± 0.201
2.773AsnPro: 2.773 ± 0.123
2.959AsnGln: 2.959 ± 0.139
2.028AsnArg: 2.028 ± 0.113
3.462AsnSer: 3.462 ± 0.156
3.298AsnThr: 3.298 ± 0.152
3.072AsnVal: 3.072 ± 0.123
0.48AsnTrp: 0.48 ± 0.055
2.293AsnTyr: 2.293 ± 0.119
0.0AsnXaa: 0.0 ± 0.0
Pro
1.711ProAla: 1.711 ± 0.093
0.294ProCys: 0.294 ± 0.038
2.056ProAsp: 2.056 ± 0.116
2.095ProGlu: 2.095 ± 0.126
2.208ProPhe: 2.208 ± 0.126
1.785ProGly: 1.785 ± 0.11
0.994ProHis: 0.994 ± 0.079
3.807ProIle: 3.807 ± 0.156
2.604ProLys: 2.604 ± 0.13
3.711ProLeu: 3.711 ± 0.139
0.712ProMet: 0.712 ± 0.055
2.22ProAsn: 2.22 ± 0.118
1.147ProPro: 1.147 ± 0.107
1.079ProGln: 1.079 ± 0.078
0.926ProArg: 0.926 ± 0.064
2.53ProSer: 2.53 ± 0.128
1.932ProThr: 1.932 ± 0.105
2.287ProVal: 2.287 ± 0.117
0.328ProTrp: 0.328 ± 0.048
1.355ProTyr: 1.355 ± 0.082
0.0ProXaa: 0.0 ± 0.0
Gln
2.536GlnAla: 2.536 ± 0.129
0.57GlnCys: 0.57 ± 0.061
2.073GlnAsp: 2.073 ± 0.118
2.35GlnGlu: 2.35 ± 0.131
2.078GlnPhe: 2.078 ± 0.116
1.847GlnGly: 1.847 ± 0.099
1.22GlnHis: 1.22 ± 0.097
2.598GlnIle: 2.598 ± 0.122
3.01GlnLys: 3.01 ± 0.154
4.417GlnLeu: 4.417 ± 0.159
0.678GlnMet: 0.678 ± 0.062
1.881GlnAsn: 1.881 ± 0.118
1.135GlnPro: 1.135 ± 0.084
1.559GlnGln: 1.559 ± 0.101
1.389GlnArg: 1.389 ± 0.108
3.039GlnSer: 3.039 ± 0.138
2.304GlnThr: 2.304 ± 0.109
1.836GlnVal: 1.836 ± 0.106
0.401GlnTrp: 0.401 ± 0.049
1.598GlnTyr: 1.598 ± 0.083
0.0GlnXaa: 0.0 ± 0.0
Arg
1.463ArgAla: 1.463 ± 0.098
0.463ArgCys: 0.463 ± 0.052
1.915ArgAsp: 1.915 ± 0.101
1.745ArgGlu: 1.745 ± 0.104
1.824ArgPhe: 1.824 ± 0.119
1.598ArgGly: 1.598 ± 0.106
0.796ArgHis: 0.796 ± 0.062
2.824ArgIle: 2.824 ± 0.13
2.389ArgLys: 2.389 ± 0.127
3.778ArgLeu: 3.778 ± 0.148
0.768ArgMet: 0.768 ± 0.07
1.706ArgAsn: 1.706 ± 0.109
1.101ArgPro: 1.101 ± 0.08
1.361ArgGln: 1.361 ± 0.088
1.271ArgArg: 1.271 ± 0.085
2.553ArgSer: 2.553 ± 0.121
1.491ArgThr: 1.491 ± 0.081
1.926ArgVal: 1.926 ± 0.118
0.424ArgTrp: 0.424 ± 0.048
1.841ArgTyr: 1.841 ± 0.102
0.0ArgXaa: 0.0 ± 0.0
Ser
3.62SerAla: 3.62 ± 0.178
0.909SerCys: 0.909 ± 0.073
4.225SerAsp: 4.225 ± 0.167
3.569SerGlu: 3.569 ± 0.134
4.49SerPhe: 4.49 ± 0.203
3.954SerGly: 3.954 ± 0.171
1.768SerHis: 1.768 ± 0.116
7.359SerIle: 7.359 ± 0.26
5.569SerLys: 5.569 ± 0.198
9.025SerLeu: 9.025 ± 0.319
1.711SerMet: 1.711 ± 0.107
4.134SerAsn: 4.134 ± 0.188
2.638SerPro: 2.638 ± 0.127
3.106SerGln: 3.106 ± 0.144
2.237SerArg: 2.237 ± 0.117
6.083SerSer: 6.083 ± 0.237
3.643SerThr: 3.643 ± 0.133
3.914SerVal: 3.914 ± 0.15
0.683SerTrp: 0.683 ± 0.061
3.231SerTyr: 3.231 ± 0.15
0.0SerXaa: 0.0 ± 0.0
Thr
3.072ThrAla: 3.072 ± 0.131
0.796ThrCys: 0.796 ± 0.068
3.067ThrAsp: 3.067 ± 0.14
2.57ThrGlu: 2.57 ± 0.135
2.784ThrPhe: 2.784 ± 0.145
3.016ThrGly: 3.016 ± 0.147
1.468ThrHis: 1.468 ± 0.076
5.614ThrIle: 5.614 ± 0.212
4.1ThrLys: 4.1 ± 0.145
6.393ThrLeu: 6.393 ± 0.207
1.248ThrMet: 1.248 ± 0.078
2.999ThrAsn: 2.999 ± 0.149
2.27ThrPro: 2.27 ± 0.126
2.231ThrGln: 2.231 ± 0.114
1.418ThrArg: 1.418 ± 0.093
3.824ThrSer: 3.824 ± 0.155
2.959ThrThr: 2.959 ± 0.119
3.225ThrVal: 3.225 ± 0.133
0.474ThrTrp: 0.474 ± 0.05
2.496ThrTyr: 2.496 ± 0.138
0.0ThrXaa: 0.0 ± 0.0
Val
3.242ValAla: 3.242 ± 0.149
0.858ValCys: 0.858 ± 0.077
3.191ValAsp: 3.191 ± 0.141
2.717ValGlu: 2.717 ± 0.138
2.926ValPhe: 2.926 ± 0.134
3.327ValGly: 3.327 ± 0.16
1.135ValHis: 1.135 ± 0.079
5.253ValIle: 5.253 ± 0.177
3.581ValLys: 3.581 ± 0.149
5.541ValLeu: 5.541 ± 0.198
1.406ValMet: 1.406 ± 0.097
2.852ValAsn: 2.852 ± 0.135
1.982ValPro: 1.982 ± 0.106
1.564ValGln: 1.564 ± 0.105
1.937ValArg: 1.937 ± 0.113
4.665ValSer: 4.665 ± 0.168
3.355ValThr: 3.355 ± 0.142
3.524ValVal: 3.524 ± 0.178
0.384ValTrp: 0.384 ± 0.046
2.095ValTyr: 2.095 ± 0.115
0.0ValXaa: 0.0 ± 0.0
Trp
0.463TrpAla: 0.463 ± 0.055
0.124TrpCys: 0.124 ± 0.024
0.514TrpAsp: 0.514 ± 0.054
0.559TrpGlu: 0.559 ± 0.052
0.537TrpPhe: 0.537 ± 0.068
0.604TrpGly: 0.604 ± 0.064
0.22TrpHis: 0.22 ± 0.036
0.666TrpIle: 0.666 ± 0.067
0.503TrpLys: 0.503 ± 0.051
0.909TrpLeu: 0.909 ± 0.073
0.186TrpMet: 0.186 ± 0.032
0.474TrpAsn: 0.474 ± 0.05
0.277TrpPro: 0.277 ± 0.042
0.299TrpGln: 0.299 ± 0.044
0.265TrpArg: 0.265 ± 0.041
0.486TrpSer: 0.486 ± 0.048
0.356TrpThr: 0.356 ± 0.044
0.553TrpVal: 0.553 ± 0.059
0.119TrpTrp: 0.119 ± 0.024
0.305TrpTyr: 0.305 ± 0.046
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.965TyrAla: 1.965 ± 0.1
0.593TyrCys: 0.593 ± 0.066
2.389TyrAsp: 2.389 ± 0.107
1.943TyrGlu: 1.943 ± 0.132
2.253TyrPhe: 2.253 ± 0.147
2.429TyrGly: 2.429 ± 0.116
1.203TyrHis: 1.203 ± 0.089
3.225TyrIle: 3.225 ± 0.148
2.666TyrLys: 2.666 ± 0.132
4.089TyrLeu: 4.089 ± 0.158
0.791TyrMet: 0.791 ± 0.06
2.304TyrAsn: 2.304 ± 0.118
1.66TyrPro: 1.66 ± 0.103
2.208TyrGln: 2.208 ± 0.116
1.576TyrArg: 1.576 ± 0.083
2.92TyrSer: 2.92 ± 0.135
2.056TyrThr: 2.056 ± 0.1
2.118TyrVal: 2.118 ± 0.123
0.407TyrTrp: 0.407 ± 0.049
1.728TyrTyr: 1.728 ± 0.114
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 544 proteins (177059 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski