Amino acid dipepetide frequency for Candidatus Bipolaricaulis anaerobius

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.546AlaAla: 14.546 ± 0.381
1.204AlaCys: 1.204 ± 0.078
5.44AlaAsp: 5.44 ± 0.122
8.111AlaGlu: 8.111 ± 0.189
4.255AlaPhe: 4.255 ± 0.099
10.948AlaGly: 10.948 ± 0.192
2.565AlaHis: 2.565 ± 0.087
5.25AlaIle: 5.25 ± 0.133
2.847AlaLys: 2.847 ± 0.099
14.418AlaLeu: 14.418 ± 0.228
2.22AlaMet: 2.22 ± 0.075
1.78AlaAsn: 1.78 ± 0.074
5.142AlaPro: 5.142 ± 0.119
3.88AlaGln: 3.88 ± 0.163
9.712AlaArg: 9.712 ± 0.18
4.961AlaSer: 4.961 ± 0.149
5.732AlaThr: 5.732 ± 0.146
10.353AlaVal: 10.353 ± 0.176
1.691AlaTrp: 1.691 ± 0.065
2.847AlaTyr: 2.847 ± 0.087
0.0AlaXaa: 0.0 ± 0.0
Cys
1.062CysAla: 1.062 ± 0.07
0.2CysCys: 0.2 ± 0.039
0.542CysAsp: 0.542 ± 0.041
0.458CysGlu: 0.458 ± 0.033
0.222CysPhe: 0.222 ± 0.022
1.211CysGly: 1.211 ± 0.068
0.197CysHis: 0.197 ± 0.023
0.275CysIle: 0.275 ± 0.029
0.137CysLys: 0.137 ± 0.018
0.867CysLeu: 0.867 ± 0.044
0.125CysMet: 0.125 ± 0.018
0.171CysAsn: 0.171 ± 0.022
1.021CysPro: 1.021 ± 0.066
0.275CysGln: 0.275 ± 0.025
0.696CysArg: 0.696 ± 0.045
0.462CysSer: 0.462 ± 0.044
0.486CysThr: 0.486 ± 0.042
0.626CysVal: 0.626 ± 0.041
0.108CysTrp: 0.108 ± 0.015
0.21CysTyr: 0.21 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
4.877AspAla: 4.877 ± 0.131
0.383AspCys: 0.383 ± 0.032
2.189AspAsp: 2.189 ± 0.133
3.781AspGlu: 3.781 ± 0.103
1.529AspPhe: 1.529 ± 0.061
4.749AspGly: 4.749 ± 0.194
1.163AspHis: 1.163 ± 0.054
1.854AspIle: 1.854 ± 0.071
0.877AspLys: 0.877 ± 0.05
7.078AspLeu: 7.078 ± 0.148
0.621AspMet: 0.621 ± 0.043
0.655AspAsn: 0.655 ± 0.048
4.171AspPro: 4.171 ± 0.112
1.219AspGln: 1.219 ± 0.06
4.354AspArg: 4.354 ± 0.111
1.474AspSer: 1.474 ± 0.077
1.828AspThr: 1.828 ± 0.108
3.93AspVal: 3.93 ± 0.093
0.792AspTrp: 0.792 ± 0.054
1.223AspTyr: 1.223 ± 0.063
0.0AspXaa: 0.0 ± 0.0
Glu
8.578GluAla: 8.578 ± 0.189
0.332GluCys: 0.332 ± 0.032
2.748GluAsp: 2.748 ± 0.1
5.948GluGlu: 5.948 ± 0.167
2.09GluPhe: 2.09 ± 0.084
6.261GluGly: 6.261 ± 0.135
1.226GluHis: 1.226 ± 0.063
3.909GluIle: 3.909 ± 0.113
2.114GluLys: 2.114 ± 0.082
7.593GluLeu: 7.593 ± 0.165
1.257GluMet: 1.257 ± 0.062
1.096GluAsn: 1.096 ± 0.047
3.078GluPro: 3.078 ± 0.103
1.594GluGln: 1.594 ± 0.069
6.5GluArg: 6.5 ± 0.148
1.859GluSer: 1.859 ± 0.068
2.598GluThr: 2.598 ± 0.092
6.941GluVal: 6.941 ± 0.157
0.783GluTrp: 0.783 ± 0.04
1.192GluTyr: 1.192 ± 0.063
0.0GluXaa: 0.0 ± 0.0
Phe
3.718PheAla: 3.718 ± 0.097
0.373PheCys: 0.373 ± 0.03
1.768PheAsp: 1.768 ± 0.074
1.58PheGlu: 1.58 ± 0.067
1.17PhePhe: 1.17 ± 0.052
3.061PheGly: 3.061 ± 0.087
0.694PheHis: 0.694 ± 0.039
1.134PheIle: 1.134 ± 0.059
0.564PheLys: 0.564 ± 0.037
4.234PheLeu: 4.234 ± 0.11
0.499PheMet: 0.499 ± 0.038
0.636PheAsn: 0.636 ± 0.034
2.237PhePro: 2.237 ± 0.076
0.995PheGln: 0.995 ± 0.055
2.637PheArg: 2.637 ± 0.084
1.987PheSer: 1.987 ± 0.08
2.057PheThr: 2.057 ± 0.086
2.762PheVal: 2.762 ± 0.084
0.515PheTrp: 0.515 ± 0.035
0.905PheTyr: 0.905 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
9.551GlyAla: 9.551 ± 0.177
1.036GlyCys: 1.036 ± 0.062
4.424GlyAsp: 4.424 ± 0.126
7.244GlyGlu: 7.244 ± 0.144
3.239GlyPhe: 3.239 ± 0.088
8.547GlyGly: 8.547 ± 0.195
1.982GlyHis: 1.982 ± 0.074
5.351GlyIle: 5.351 ± 0.128
3.437GlyLys: 3.437 ± 0.096
9.847GlyLeu: 9.847 ± 0.189
2.37GlyMet: 2.37 ± 0.079
1.744GlyAsn: 1.744 ± 0.061
4.482GlyPro: 4.482 ± 0.119
2.618GlyGln: 2.618 ± 0.078
7.514GlyArg: 7.514 ± 0.163
4.311GlySer: 4.311 ± 0.145
5.479GlyThr: 5.479 ± 0.164
7.485GlyVal: 7.485 ± 0.143
1.717GlyTrp: 1.717 ± 0.083
2.57GlyTyr: 2.57 ± 0.09
0.0GlyXaa: 0.0 ± 0.0
His
2.042HisAla: 2.042 ± 0.068
0.19HisCys: 0.19 ± 0.025
1.024HisAsp: 1.024 ± 0.053
1.166HisGlu: 1.166 ± 0.051
0.65HisPhe: 0.65 ± 0.039
2.04HisGly: 2.04 ± 0.069
0.515HisHis: 0.515 ± 0.038
0.831HisIle: 0.831 ± 0.047
0.402HisLys: 0.402 ± 0.032
2.389HisLeu: 2.389 ± 0.087
0.296HisMet: 0.296 ± 0.026
0.419HisAsn: 0.419 ± 0.029
1.741HisPro: 1.741 ± 0.071
0.537HisGln: 0.537 ± 0.037
1.799HisArg: 1.799 ± 0.073
0.72HisSer: 0.72 ± 0.044
0.903HisThr: 0.903 ± 0.058
1.635HisVal: 1.635 ± 0.076
0.299HisTrp: 0.299 ± 0.029
0.629HisTyr: 0.629 ± 0.045
0.0HisXaa: 0.0 ± 0.0
Ile
5.931IleAla: 5.931 ± 0.114
0.349IleCys: 0.349 ± 0.03
1.353IleAsp: 1.353 ± 0.065
3.299IleGlu: 3.299 ± 0.102
1.076IlePhe: 1.076 ± 0.056
4.308IleGly: 4.308 ± 0.113
0.891IleHis: 0.891 ± 0.053
1.452IleIle: 1.452 ± 0.07
0.78IleLys: 0.78 ± 0.049
5.161IleLeu: 5.161 ± 0.123
0.566IleMet: 0.566 ± 0.038
0.785IleAsn: 0.785 ± 0.05
3.215IlePro: 3.215 ± 0.095
1.233IleGln: 1.233 ± 0.06
3.485IleArg: 3.485 ± 0.113
2.037IleSer: 2.037 ± 0.079
2.396IleThr: 2.396 ± 0.099
4.275IleVal: 4.275 ± 0.095
0.508IleTrp: 0.508 ± 0.033
0.963IleTyr: 0.963 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
2.868LysAla: 2.868 ± 0.085
0.183LysCys: 0.183 ± 0.019
1.163LysAsp: 1.163 ± 0.065
1.886LysGlu: 1.886 ± 0.074
0.585LysPhe: 0.585 ± 0.04
2.213LysGly: 2.213 ± 0.085
0.405LysHis: 0.405 ± 0.032
1.223LysIle: 1.223 ± 0.065
0.997LysLys: 0.997 ± 0.065
2.772LysLeu: 2.772 ± 0.097
0.46LysMet: 0.46 ± 0.034
0.482LysAsn: 0.482 ± 0.036
1.293LysPro: 1.293 ± 0.058
0.448LysGln: 0.448 ± 0.035
2.136LysArg: 2.136 ± 0.071
0.872LysSer: 0.872 ± 0.05
1.457LysThr: 1.457 ± 0.063
2.42LysVal: 2.42 ± 0.087
0.299LysTrp: 0.299 ± 0.025
0.518LysTyr: 0.518 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
16.044LeuAla: 16.044 ± 0.237
0.98LeuCys: 0.98 ± 0.052
6.057LeuAsp: 6.057 ± 0.125
5.522LeuGlu: 5.522 ± 0.133
3.74LeuPhe: 3.74 ± 0.105
10.707LeuGly: 10.707 ± 0.194
2.073LeuHis: 2.073 ± 0.076
4.263LeuIle: 4.263 ± 0.124
2.346LeuLys: 2.346 ± 0.091
12.176LeuLeu: 12.176 ± 0.278
1.568LeuMet: 1.568 ± 0.063
1.712LeuAsn: 1.712 ± 0.073
6.827LeuPro: 6.827 ± 0.151
2.319LeuGln: 2.319 ± 0.082
9.231LeuArg: 9.231 ± 0.182
6.553LeuSer: 6.553 ± 0.116
5.963LeuThr: 5.963 ± 0.132
10.001LeuVal: 10.001 ± 0.18
1.736LeuTrp: 1.736 ± 0.069
2.432LeuTyr: 2.432 ± 0.08
0.0LeuXaa: 0.0 ± 0.0
Met
2.379MetAla: 2.379 ± 0.078
0.125MetCys: 0.125 ± 0.017
0.927MetAsp: 0.927 ± 0.049
1.091MetGlu: 1.091 ± 0.056
0.511MetPhe: 0.511 ± 0.039
1.7MetGly: 1.7 ± 0.068
0.222MetHis: 0.222 ± 0.024
0.949MetIle: 0.949 ± 0.051
0.696MetLys: 0.696 ± 0.041
1.445MetLeu: 1.445 ± 0.06
0.395MetMet: 0.395 ± 0.036
0.506MetAsn: 0.506 ± 0.034
0.92MetPro: 0.92 ± 0.059
0.347MetGln: 0.347 ± 0.032
1.558MetArg: 1.558 ± 0.059
0.865MetSer: 0.865 ± 0.041
1.055MetThr: 1.055 ± 0.056
1.536MetVal: 1.536 ± 0.061
0.214MetTrp: 0.214 ± 0.02
0.328MetTyr: 0.328 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
1.818AsnAla: 1.818 ± 0.064
0.171AsnCys: 0.171 ± 0.019
0.766AsnAsp: 0.766 ± 0.064
0.925AsnGlu: 0.925 ± 0.058
0.588AsnPhe: 0.588 ± 0.04
1.467AsnGly: 1.467 ± 0.07
0.311AsnHis: 0.311 ± 0.028
0.727AsnIle: 0.727 ± 0.045
0.397AsnLys: 0.397 ± 0.032
2.223AsnLeu: 2.223 ± 0.065
0.371AsnMet: 0.371 ± 0.032
0.328AsnAsn: 0.328 ± 0.031
1.647AsnPro: 1.647 ± 0.086
0.486AsnGln: 0.486 ± 0.04
1.32AsnArg: 1.32 ± 0.058
0.578AsnSer: 0.578 ± 0.039
0.79AsnThr: 0.79 ± 0.042
1.7AsnVal: 1.7 ± 0.078
0.275AsnTrp: 0.275 ± 0.027
0.45AsnTyr: 0.45 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
6.175ProAla: 6.175 ± 0.144
0.648ProCys: 0.648 ± 0.046
3.644ProAsp: 3.644 ± 0.092
4.549ProGlu: 4.549 ± 0.129
2.206ProPhe: 2.206 ± 0.077
5.785ProGly: 5.785 ± 0.128
1.325ProHis: 1.325 ± 0.061
2.211ProIle: 2.211 ± 0.083
1.252ProLys: 1.252 ± 0.055
6.119ProLeu: 6.119 ± 0.122
1.103ProMet: 1.103 ± 0.053
1.031ProAsn: 1.031 ± 0.056
3.99ProPro: 3.99 ± 0.122
1.681ProGln: 1.681 ± 0.06
4.125ProArg: 4.125 ± 0.097
2.921ProSer: 2.921 ± 0.093
3.427ProThr: 3.427 ± 0.09
4.747ProVal: 4.747 ± 0.126
1.019ProTrp: 1.019 ± 0.061
1.573ProTyr: 1.573 ± 0.069
0.0ProXaa: 0.0 ± 0.0
Gln
3.853GlnAla: 3.853 ± 0.168
0.226GlnCys: 0.226 ± 0.026
1.202GlnAsp: 1.202 ± 0.049
1.833GlnGlu: 1.833 ± 0.073
0.927GlnPhe: 0.927 ± 0.05
2.724GlnGly: 2.724 ± 0.076
0.414GlnHis: 0.414 ± 0.033
1.315GlnIle: 1.315 ± 0.056
0.73GlnLys: 0.73 ± 0.047
2.454GlnLeu: 2.454 ± 0.093
0.45GlnMet: 0.45 ± 0.035
0.45GlnAsn: 0.45 ± 0.032
1.382GlnPro: 1.382 ± 0.045
0.583GlnGln: 0.583 ± 0.042
2.148GlnArg: 2.148 ± 0.07
1.038GlnSer: 1.038 ± 0.048
1.233GlnThr: 1.233 ± 0.055
2.497GlnVal: 2.497 ± 0.082
0.412GlnTrp: 0.412 ± 0.032
0.617GlnTyr: 0.617 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
9.433ArgAla: 9.433 ± 0.19
0.843ArgCys: 0.843 ± 0.055
4.113ArgAsp: 4.113 ± 0.104
6.444ArgGlu: 6.444 ± 0.16
2.967ArgPhe: 2.967 ± 0.095
7.225ArgGly: 7.225 ± 0.177
1.508ArgHis: 1.508 ± 0.062
4.289ArgIle: 4.289 ± 0.112
2.061ArgLys: 2.061 ± 0.078
8.528ArgLeu: 8.528 ± 0.213
1.898ArgMet: 1.898 ± 0.073
1.337ArgAsn: 1.337 ± 0.062
4.229ArgPro: 4.229 ± 0.122
2.129ArgGln: 2.129 ± 0.082
7.068ArgArg: 7.068 ± 0.146
3.636ArgSer: 3.636 ± 0.096
4.024ArgThr: 4.024 ± 0.091
6.623ArgVal: 6.623 ± 0.139
1.397ArgTrp: 1.397 ± 0.057
2.216ArgTyr: 2.216 ± 0.073
0.0ArgXaa: 0.0 ± 0.0
Ser
4.597SerAla: 4.597 ± 0.132
0.499SerCys: 0.499 ± 0.035
2.117SerAsp: 2.117 ± 0.1
2.252SerGlu: 2.252 ± 0.071
1.804SerPhe: 1.804 ± 0.074
4.807SerGly: 4.807 ± 0.124
0.978SerHis: 0.978 ± 0.054
1.51SerIle: 1.51 ± 0.063
0.809SerLys: 0.809 ± 0.044
5.606SerLeu: 5.606 ± 0.137
0.684SerMet: 0.684 ± 0.041
0.571SerAsn: 0.571 ± 0.041
3.545SerPro: 3.545 ± 0.112
1.392SerGln: 1.392 ± 0.052
3.393SerArg: 3.393 ± 0.099
2.468SerSer: 2.468 ± 0.095
2.165SerThr: 2.165 ± 0.084
3.35SerVal: 3.35 ± 0.087
0.826SerTrp: 0.826 ± 0.05
1.125SerTyr: 1.125 ± 0.055
0.0SerXaa: 0.0 ± 0.0
Thr
5.368ThrAla: 5.368 ± 0.143
0.518ThrCys: 0.518 ± 0.041
2.521ThrAsp: 2.521 ± 0.114
2.709ThrGlu: 2.709 ± 0.087
1.968ThrPhe: 1.968 ± 0.081
5.281ThrGly: 5.281 ± 0.131
0.968ThrHis: 0.968 ± 0.055
2.418ThrIle: 2.418 ± 0.084
1.279ThrLys: 1.279 ± 0.062
5.729ThrLeu: 5.729 ± 0.124
0.966ThrMet: 0.966 ± 0.048
1.045ThrAsn: 1.045 ± 0.065
3.227ThrPro: 3.227 ± 0.087
1.192ThrGln: 1.192 ± 0.058
3.054ThrArg: 3.054 ± 0.099
2.406ThrSer: 2.406 ± 0.083
2.89ThrThr: 2.89 ± 0.111
5.529ThrVal: 5.529 ± 0.174
0.865ThrTrp: 0.865 ± 0.045
1.479ThrTyr: 1.479 ± 0.069
0.0ThrXaa: 0.0 ± 0.0
Val
10.77ValAla: 10.77 ± 0.193
0.759ValCys: 0.759 ± 0.038
4.54ValAsp: 4.54 ± 0.108
6.382ValGlu: 6.382 ± 0.146
2.692ValPhe: 2.692 ± 0.079
8.234ValGly: 8.234 ± 0.153
1.866ValHis: 1.866 ± 0.074
3.475ValIle: 3.475 ± 0.096
2.042ValLys: 2.042 ± 0.081
9.13ValLeu: 9.13 ± 0.158
1.317ValMet: 1.317 ± 0.066
1.734ValAsn: 1.734 ± 0.066
5.089ValPro: 5.089 ± 0.112
2.343ValGln: 2.343 ± 0.072
7.795ValArg: 7.795 ± 0.155
3.781ValSer: 3.781 ± 0.104
4.58ValThr: 4.58 ± 0.155
8.621ValVal: 8.621 ± 0.186
1.142ValTrp: 1.142 ± 0.061
1.883ValTyr: 1.883 ± 0.069
0.0ValXaa: 0.0 ± 0.0
Trp
1.669TrpAla: 1.669 ± 0.068
0.111TrpCys: 0.111 ± 0.016
0.934TrpAsp: 0.934 ± 0.042
1.072TrpGlu: 1.072 ± 0.047
0.446TrpPhe: 0.446 ± 0.033
1.317TrpGly: 1.317 ± 0.062
0.34TrpHis: 0.34 ± 0.031
0.763TrpIle: 0.763 ± 0.038
0.402TrpLys: 0.402 ± 0.034
1.618TrpLeu: 1.618 ± 0.071
0.289TrpMet: 0.289 ± 0.027
0.388TrpAsn: 0.388 ± 0.029
0.809TrpPro: 0.809 ± 0.042
0.573TrpGln: 0.573 ± 0.038
1.267TrpArg: 1.267 ± 0.057
0.612TrpSer: 0.612 ± 0.035
0.84TrpThr: 0.84 ± 0.051
1.245TrpVal: 1.245 ± 0.053
0.275TrpTrp: 0.275 ± 0.029
0.318TrpTyr: 0.318 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.62TyrAla: 2.62 ± 0.083
0.222TyrCys: 0.222 ± 0.029
1.276TyrAsp: 1.276 ± 0.063
1.447TyrGlu: 1.447 ± 0.069
0.86TyrPhe: 0.86 ± 0.052
2.36TyrGly: 2.36 ± 0.07
0.585TyrHis: 0.585 ± 0.044
0.867TyrIle: 0.867 ± 0.047
0.511TyrLys: 0.511 ± 0.039
2.863TyrLeu: 2.863 ± 0.087
0.311TyrMet: 0.311 ± 0.028
0.47TyrAsn: 0.47 ± 0.043
1.486TyrPro: 1.486 ± 0.068
0.645TyrGln: 0.645 ± 0.047
2.175TyrArg: 2.175 ± 0.096
0.956TyrSer: 0.956 ± 0.052
1.508TyrThr: 1.508 ± 0.067
1.898TyrVal: 1.898 ± 0.076
0.417TyrTrp: 0.417 ± 0.039
0.674TyrTyr: 0.674 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1319 proteins (415242 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski