Amino acid dipepetide frequency for Candidatus Marinamargulisbacteria bacterium SCGC AG-439-L15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.694AlaAla: 4.694 ± 0.247
0.863AlaCys: 0.863 ± 0.057
2.897AlaAsp: 2.897 ± 0.118
3.43AlaGlu: 3.43 ± 0.142
3.003AlaPhe: 3.003 ± 0.116
3.81AlaGly: 3.81 ± 0.121
1.52AlaHis: 1.52 ± 0.069
5.277AlaIle: 5.277 ± 0.145
4.61AlaLys: 4.61 ± 0.138
7.691AlaLeu: 7.691 ± 0.181
1.601AlaMet: 1.601 ± 0.074
2.43AlaAsn: 2.43 ± 0.088
2.308AlaPro: 2.308 ± 0.086
2.592AlaGln: 2.592 ± 0.106
2.196AlaArg: 2.196 ± 0.087
4.744AlaSer: 4.744 ± 0.165
3.402AlaThr: 3.402 ± 0.121
4.074AlaVal: 4.074 ± 0.147
0.502AlaTrp: 0.502 ± 0.039
2.286AlaTyr: 2.286 ± 0.095
0.0AlaXaa: 0.0 ± 0.0
Cys
0.554CysAla: 0.554 ± 0.045
0.168CysCys: 0.168 ± 0.025
0.692CysAsp: 0.692 ± 0.045
0.713CysGlu: 0.713 ± 0.05
0.667CysPhe: 0.667 ± 0.049
0.757CysGly: 0.757 ± 0.055
0.352CysHis: 0.352 ± 0.035
0.838CysIle: 0.838 ± 0.056
0.573CysLys: 0.573 ± 0.044
1.399CysLeu: 1.399 ± 0.07
0.24CysMet: 0.24 ± 0.027
0.38CysAsn: 0.38 ± 0.036
0.57CysPro: 0.57 ± 0.046
0.517CysGln: 0.517 ± 0.047
0.511CysArg: 0.511 ± 0.045
0.695CysSer: 0.695 ± 0.045
0.554CysThr: 0.554 ± 0.043
0.645CysVal: 0.645 ± 0.048
0.059CysTrp: 0.059 ± 0.014
0.464CysTyr: 0.464 ± 0.043
0.0CysXaa: 0.0 ± 0.0
Asp
3.912AspAla: 3.912 ± 0.133
0.667AspCys: 0.667 ± 0.046
2.769AspAsp: 2.769 ± 0.114
3.511AspGlu: 3.511 ± 0.119
2.439AspPhe: 2.439 ± 0.085
3.099AspGly: 3.099 ± 0.104
1.237AspHis: 1.237 ± 0.068
4.321AspIle: 4.321 ± 0.12
3.813AspLys: 3.813 ± 0.127
5.816AspLeu: 5.816 ± 0.161
1.218AspMet: 1.218 ± 0.057
2.09AspAsn: 2.09 ± 0.08
2.707AspPro: 2.707 ± 0.12
2.299AspGln: 2.299 ± 0.099
2.33AspArg: 2.33 ± 0.097
3.766AspSer: 3.766 ± 0.125
3.243AspThr: 3.243 ± 0.119
3.402AspVal: 3.402 ± 0.124
0.561AspTrp: 0.561 ± 0.046
2.177AspTyr: 2.177 ± 0.094
0.0AspXaa: 0.0 ± 0.0
Glu
4.476GluAla: 4.476 ± 0.159
0.505GluCys: 0.505 ± 0.043
3.701GluAsp: 3.701 ± 0.116
3.978GluGlu: 3.978 ± 0.135
2.156GluPhe: 2.156 ± 0.084
4.012GluGly: 4.012 ± 0.128
1.193GluHis: 1.193 ± 0.065
4.339GluIle: 4.339 ± 0.123
5.928GluLys: 5.928 ± 0.162
6.495GluLeu: 6.495 ± 0.163
1.579GluMet: 1.579 ± 0.068
3.087GluAsn: 3.087 ± 0.109
1.576GluPro: 1.576 ± 0.074
2.062GluGln: 2.062 ± 0.085
2.383GluArg: 2.383 ± 0.097
4.523GluSer: 4.523 ± 0.12
4.766GluThr: 4.766 ± 0.132
3.726GluVal: 3.726 ± 0.126
0.48GluTrp: 0.48 ± 0.038
1.741GluTyr: 1.741 ± 0.078
0.0GluXaa: 0.0 ± 0.0
Phe
2.227PheAla: 2.227 ± 0.098
0.579PheCys: 0.579 ± 0.042
2.529PheAsp: 2.529 ± 0.082
2.694PheGlu: 2.694 ± 0.099
2.632PhePhe: 2.632 ± 0.13
2.779PheGly: 2.779 ± 0.112
0.897PheHis: 0.897 ± 0.045
3.233PheIle: 3.233 ± 0.131
3.246PheLys: 3.246 ± 0.106
5.087PheLeu: 5.087 ± 0.166
1.068PheMet: 1.068 ± 0.06
2.05PheAsn: 2.05 ± 0.088
1.822PhePro: 1.822 ± 0.071
1.776PheGln: 1.776 ± 0.073
1.523PheArg: 1.523 ± 0.07
4.155PheSer: 4.155 ± 0.141
2.299PheThr: 2.299 ± 0.083
2.442PheVal: 2.442 ± 0.102
0.505PheTrp: 0.505 ± 0.038
1.629PheTyr: 1.629 ± 0.081
0.0PheXaa: 0.0 ± 0.0
Gly
4.296GlyAla: 4.296 ± 0.132
0.841GlyCys: 0.841 ± 0.057
3.261GlyAsp: 3.261 ± 0.107
3.149GlyGlu: 3.149 ± 0.101
3.106GlyPhe: 3.106 ± 0.106
4.442GlyGly: 4.442 ± 0.179
1.445GlyHis: 1.445 ± 0.07
5.018GlyIle: 5.018 ± 0.156
3.897GlyLys: 3.897 ± 0.118
7.046GlyLeu: 7.046 ± 0.175
1.558GlyMet: 1.558 ± 0.069
2.336GlyAsn: 2.336 ± 0.102
1.953GlyPro: 1.953 ± 0.082
2.205GlyGln: 2.205 ± 0.081
2.489GlyArg: 2.489 ± 0.09
4.439GlySer: 4.439 ± 0.155
3.794GlyThr: 3.794 ± 0.14
4.975GlyVal: 4.975 ± 0.156
0.639GlyTrp: 0.639 ± 0.046
2.458GlyTyr: 2.458 ± 0.106
0.0GlyXaa: 0.0 ± 0.0
His
1.352HisAla: 1.352 ± 0.066
0.333HisCys: 0.333 ± 0.034
1.112HisAsp: 1.112 ± 0.057
1.112HisGlu: 1.112 ± 0.058
1.193HisPhe: 1.193 ± 0.07
1.187HisGly: 1.187 ± 0.066
0.754HisHis: 0.754 ± 0.056
1.623HisIle: 1.623 ± 0.092
1.352HisLys: 1.352 ± 0.068
2.311HisLeu: 2.311 ± 0.092
0.433HisMet: 0.433 ± 0.036
0.891HisAsn: 0.891 ± 0.057
1.184HisPro: 1.184 ± 0.07
1.156HisGln: 1.156 ± 0.069
0.931HisArg: 0.931 ± 0.059
1.389HisSer: 1.389 ± 0.066
1.181HisThr: 1.181 ± 0.068
1.137HisVal: 1.137 ± 0.06
0.283HisTrp: 0.283 ± 0.033
1.056HisTyr: 1.056 ± 0.063
0.0HisXaa: 0.0 ± 0.0
Ile
4.722IleAla: 4.722 ± 0.137
0.785IleCys: 0.785 ± 0.063
4.252IleAsp: 4.252 ± 0.113
4.604IleGlu: 4.604 ± 0.136
2.807IlePhe: 2.807 ± 0.123
4.769IleGly: 4.769 ± 0.146
1.623IleHis: 1.623 ± 0.075
4.744IleIle: 4.744 ± 0.191
4.987IleLys: 4.987 ± 0.137
6.897IleLeu: 6.897 ± 0.207
1.343IleMet: 1.343 ± 0.064
2.95IleAsn: 2.95 ± 0.096
3.314IlePro: 3.314 ± 0.129
3.582IleGln: 3.582 ± 0.12
2.772IleArg: 2.772 ± 0.098
5.554IleSer: 5.554 ± 0.138
3.972IleThr: 3.972 ± 0.139
4.084IleVal: 4.084 ± 0.145
0.486IleTrp: 0.486 ± 0.041
2.165IleTyr: 2.165 ± 0.107
0.0IleXaa: 0.0 ± 0.0
Lys
4.853LysAla: 4.853 ± 0.135
0.517LysCys: 0.517 ± 0.044
4.539LysAsp: 4.539 ± 0.129
6.482LysGlu: 6.482 ± 0.166
1.657LysPhe: 1.657 ± 0.081
4.623LysGly: 4.623 ± 0.114
1.374LysHis: 1.374 ± 0.072
4.514LysIle: 4.514 ± 0.136
7.616LysLys: 7.616 ± 0.228
6.417LysLeu: 6.417 ± 0.156
1.564LysMet: 1.564 ± 0.069
3.532LysAsn: 3.532 ± 0.126
2.43LysPro: 2.43 ± 0.098
3.349LysGln: 3.349 ± 0.115
3.679LysArg: 3.679 ± 0.107
4.738LysSer: 4.738 ± 0.116
5.395LysThr: 5.395 ± 0.152
4.445LysVal: 4.445 ± 0.123
0.57LysTrp: 0.57 ± 0.042
1.816LysTyr: 1.816 ± 0.076
0.0LysXaa: 0.0 ± 0.0
Leu
6.507LeuAla: 6.507 ± 0.156
1.28LeuCys: 1.28 ± 0.077
6.012LeuAsp: 6.012 ± 0.159
6.616LeuGlu: 6.616 ± 0.153
5.04LeuPhe: 5.04 ± 0.172
7.292LeuGly: 7.292 ± 0.181
1.966LeuHis: 1.966 ± 0.087
7.015LeuIle: 7.015 ± 0.2
8.678LeuLys: 8.678 ± 0.196
10.6LeuLeu: 10.6 ± 0.27
2.258LeuMet: 2.258 ± 0.093
4.657LeuAsn: 4.657 ± 0.12
4.031LeuPro: 4.031 ± 0.121
3.788LeuGln: 3.788 ± 0.114
3.825LeuArg: 3.825 ± 0.099
10.435LeuSer: 10.435 ± 0.179
5.962LeuThr: 5.962 ± 0.129
5.76LeuVal: 5.76 ± 0.139
0.801LeuTrp: 0.801 ± 0.053
3.461LeuTyr: 3.461 ± 0.128
0.0LeuXaa: 0.0 ± 0.0
Met
1.853MetAla: 1.853 ± 0.068
0.178MetCys: 0.178 ± 0.023
1.255MetAsp: 1.255 ± 0.07
1.006MetGlu: 1.006 ± 0.059
0.576MetPhe: 0.576 ± 0.04
1.726MetGly: 1.726 ± 0.082
0.312MetHis: 0.312 ± 0.034
1.782MetIle: 1.782 ± 0.074
1.573MetLys: 1.573 ± 0.068
1.707MetLeu: 1.707 ± 0.074
0.716MetMet: 0.716 ± 0.048
0.854MetAsn: 0.854 ± 0.058
0.91MetPro: 0.91 ± 0.05
0.67MetGln: 0.67 ± 0.047
0.916MetArg: 0.916 ± 0.056
2.012MetSer: 2.012 ± 0.072
1.638MetThr: 1.638 ± 0.064
1.554MetVal: 1.554 ± 0.071
0.112MetTrp: 0.112 ± 0.017
0.449MetTyr: 0.449 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
2.822AsnAla: 2.822 ± 0.094
0.477AsnCys: 0.477 ± 0.036
2.075AsnAsp: 2.075 ± 0.096
2.514AsnGlu: 2.514 ± 0.086
1.676AsnPhe: 1.676 ± 0.076
2.67AsnGly: 2.67 ± 0.114
0.913AsnHis: 0.913 ± 0.06
2.897AsnIle: 2.897 ± 0.111
2.95AsnLys: 2.95 ± 0.126
3.875AsnLeu: 3.875 ± 0.101
0.844AsnMet: 0.844 ± 0.046
1.825AsnAsn: 1.825 ± 0.097
2.52AsnPro: 2.52 ± 0.092
1.772AsnGln: 1.772 ± 0.077
2.015AsnArg: 2.015 ± 0.083
2.906AsnSer: 2.906 ± 0.12
2.819AsnThr: 2.819 ± 0.124
2.112AsnVal: 2.112 ± 0.084
0.477AsnTrp: 0.477 ± 0.037
1.483AsnTyr: 1.483 ± 0.079
0.0AsnXaa: 0.0 ± 0.0
Pro
1.941ProAla: 1.941 ± 0.093
0.361ProCys: 0.361 ± 0.032
2.364ProAsp: 2.364 ± 0.083
3.149ProGlu: 3.149 ± 0.102
2.05ProPhe: 2.05 ± 0.093
2.218ProGly: 2.218 ± 0.081
0.947ProHis: 0.947 ± 0.059
3.053ProIle: 3.053 ± 0.116
3.174ProLys: 3.174 ± 0.112
4.087ProLeu: 4.087 ± 0.121
0.738ProMet: 0.738 ± 0.051
2.04ProAsn: 2.04 ± 0.084
1.492ProPro: 1.492 ± 0.085
1.433ProGln: 1.433 ± 0.06
1.215ProArg: 1.215 ± 0.073
3.321ProSer: 3.321 ± 0.116
2.224ProThr: 2.224 ± 0.091
2.483ProVal: 2.483 ± 0.086
0.315ProTrp: 0.315 ± 0.028
1.439ProTyr: 1.439 ± 0.066
0.0ProXaa: 0.0 ± 0.0
Gln
2.853GlnAla: 2.853 ± 0.118
0.374GlnCys: 0.374 ± 0.029
2.271GlnAsp: 2.271 ± 0.089
2.81GlnGlu: 2.81 ± 0.122
1.95GlnPhe: 1.95 ± 0.082
2.09GlnGly: 2.09 ± 0.076
0.825GlnHis: 0.825 ± 0.054
2.112GlnIle: 2.112 ± 0.083
3.931GlnLys: 3.931 ± 0.122
4.909GlnLeu: 4.909 ± 0.148
0.723GlnMet: 0.723 ± 0.045
1.869GlnAsn: 1.869 ± 0.089
1.034GlnPro: 1.034 ± 0.064
1.713GlnGln: 1.713 ± 0.093
1.657GlnArg: 1.657 ± 0.07
3.146GlnSer: 3.146 ± 0.118
2.651GlnThr: 2.651 ± 0.101
2.657GlnVal: 2.657 ± 0.096
0.321GlnTrp: 0.321 ± 0.03
1.268GlnTyr: 1.268 ± 0.061
0.0GlnXaa: 0.0 ± 0.0
Arg
2.495ArgAla: 2.495 ± 0.105
0.477ArgCys: 0.477 ± 0.042
2.137ArgAsp: 2.137 ± 0.088
2.564ArgGlu: 2.564 ± 0.095
2.134ArgPhe: 2.134 ± 0.081
2.196ArgGly: 2.196 ± 0.086
1.059ArgHis: 1.059 ± 0.074
2.467ArgIle: 2.467 ± 0.095
2.349ArgLys: 2.349 ± 0.093
4.847ArgLeu: 4.847 ± 0.121
0.769ArgMet: 0.769 ± 0.052
1.483ArgAsn: 1.483 ± 0.081
1.573ArgPro: 1.573 ± 0.07
1.972ArgGln: 1.972 ± 0.089
1.841ArgArg: 1.841 ± 0.097
2.576ArgSer: 2.576 ± 0.096
1.86ArgThr: 1.86 ± 0.09
2.751ArgVal: 2.751 ± 0.103
0.308ArgTrp: 0.308 ± 0.028
1.804ArgTyr: 1.804 ± 0.081
0.0ArgXaa: 0.0 ± 0.0
Ser
4.345SerAla: 4.345 ± 0.138
0.825SerCys: 0.825 ± 0.062
4.539SerAsp: 4.539 ± 0.142
5.006SerGlu: 5.006 ± 0.138
4.131SerPhe: 4.131 ± 0.125
4.965SerGly: 4.965 ± 0.146
1.645SerHis: 1.645 ± 0.071
5.333SerIle: 5.333 ± 0.152
5.152SerLys: 5.152 ± 0.143
8.504SerLeu: 8.504 ± 0.177
1.548SerMet: 1.548 ± 0.066
2.913SerAsn: 2.913 ± 0.117
3.031SerPro: 3.031 ± 0.1
3.255SerGln: 3.255 ± 0.121
2.934SerArg: 2.934 ± 0.126
6.059SerSer: 6.059 ± 0.226
4.358SerThr: 4.358 ± 0.14
5.106SerVal: 5.106 ± 0.157
0.66SerTrp: 0.66 ± 0.045
2.561SerTyr: 2.561 ± 0.095
0.0SerXaa: 0.0 ± 0.0
Thr
3.757ThrAla: 3.757 ± 0.121
0.667ThrCys: 0.667 ± 0.043
3.022ThrAsp: 3.022 ± 0.1
3.398ThrGlu: 3.398 ± 0.115
2.676ThrPhe: 2.676 ± 0.089
3.763ThrGly: 3.763 ± 0.099
1.551ThrHis: 1.551 ± 0.082
4.75ThrIle: 4.75 ± 0.14
3.598ThrLys: 3.598 ± 0.115
7.052ThrLeu: 7.052 ± 0.169
1.199ThrMet: 1.199 ± 0.057
2.293ThrAsn: 2.293 ± 0.101
3.28ThrPro: 3.28 ± 0.117
2.629ThrGln: 2.629 ± 0.096
2.171ThrArg: 2.171 ± 0.073
3.747ThrSer: 3.747 ± 0.117
3.467ThrThr: 3.467 ± 0.131
4.074ThrVal: 4.074 ± 0.118
0.439ThrTrp: 0.439 ± 0.042
1.99ThrTyr: 1.99 ± 0.092
0.0ThrXaa: 0.0 ± 0.0
Val
3.807ValAla: 3.807 ± 0.13
0.807ValCys: 0.807 ± 0.056
3.427ValAsp: 3.427 ± 0.115
3.374ValGlu: 3.374 ± 0.101
3.099ValPhe: 3.099 ± 0.116
4.109ValGly: 4.109 ± 0.148
1.227ValHis: 1.227 ± 0.066
4.402ValIle: 4.402 ± 0.127
3.822ValLys: 3.822 ± 0.107
6.75ValLeu: 6.75 ± 0.163
1.614ValMet: 1.614 ± 0.082
2.246ValAsn: 2.246 ± 0.09
2.545ValPro: 2.545 ± 0.106
2.371ValGln: 2.371 ± 0.086
2.283ValArg: 2.283 ± 0.089
5.813ValSer: 5.813 ± 0.145
3.526ValThr: 3.526 ± 0.121
4.473ValVal: 4.473 ± 0.158
0.439ValTrp: 0.439 ± 0.039
2.199ValTyr: 2.199 ± 0.08
0.0ValXaa: 0.0 ± 0.0
Trp
0.477TrpAla: 0.477 ± 0.036
0.121TrpCys: 0.121 ± 0.02
0.511TrpAsp: 0.511 ± 0.046
0.598TrpGlu: 0.598 ± 0.045
0.396TrpPhe: 0.396 ± 0.042
0.673TrpGly: 0.673 ± 0.04
0.221TrpHis: 0.221 ± 0.028
0.645TrpIle: 0.645 ± 0.046
0.48TrpLys: 0.48 ± 0.039
0.779TrpLeu: 0.779 ± 0.049
0.202TrpMet: 0.202 ± 0.027
0.324TrpAsn: 0.324 ± 0.03
0.305TrpPro: 0.305 ± 0.025
0.361TrpGln: 0.361 ± 0.038
0.349TrpArg: 0.349 ± 0.032
0.53TrpSer: 0.53 ± 0.047
0.414TrpThr: 0.414 ± 0.035
0.623TrpVal: 0.623 ± 0.048
0.112TrpTrp: 0.112 ± 0.02
0.29TrpTyr: 0.29 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.984TyrAla: 1.984 ± 0.085
0.526TyrCys: 0.526 ± 0.05
2.012TyrAsp: 2.012 ± 0.085
1.997TyrGlu: 1.997 ± 0.083
1.788TyrPhe: 1.788 ± 0.081
2.168TyrGly: 2.168 ± 0.095
0.91TyrHis: 0.91 ± 0.063
2.05TyrIle: 2.05 ± 0.085
2.38TyrLys: 2.38 ± 0.098
3.716TyrLeu: 3.716 ± 0.128
0.583TyrMet: 0.583 ± 0.047
1.389TyrAsn: 1.389 ± 0.07
1.458TyrPro: 1.458 ± 0.073
1.663TyrGln: 1.663 ± 0.062
1.638TyrArg: 1.638 ± 0.071
2.321TyrSer: 2.321 ± 0.093
2.0TyrThr: 2.0 ± 0.082
1.81TyrVal: 1.81 ± 0.093
0.343TyrTrp: 0.343 ± 0.032
1.464TyrTyr: 1.464 ± 0.074
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 955 proteins (321027 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski