Amino acid dipepetide frequency for Candidatus Gullanella endobia

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.061AlaAla: 5.061 ± 0.218
0.961AlaCys: 0.961 ± 0.092
3.111AlaAsp: 3.111 ± 0.146
3.879AlaGlu: 3.879 ± 0.162
2.403AlaPhe: 2.403 ± 0.117
4.413AlaGly: 4.413 ± 0.158
1.482AlaHis: 1.482 ± 0.108
6.456AlaIle: 6.456 ± 0.22
4.567AlaLys: 4.567 ± 0.172
8.052AlaLeu: 8.052 ± 0.323
1.936AlaMet: 1.936 ± 0.108
3.111AlaAsn: 3.111 ± 0.159
1.709AlaPro: 1.709 ± 0.131
2.25AlaGln: 2.25 ± 0.125
4.106AlaArg: 4.106 ± 0.193
3.872AlaSer: 3.872 ± 0.183
3.171AlaThr: 3.171 ± 0.137
4.366AlaVal: 4.366 ± 0.194
0.701AlaTrp: 0.701 ± 0.07
2.01AlaTyr: 2.01 ± 0.109
0.0AlaXaa: 0.0 ± 0.0
Cys
0.788CysAla: 0.788 ± 0.065
0.154CysCys: 0.154 ± 0.032
0.694CysAsp: 0.694 ± 0.071
0.547CysGlu: 0.547 ± 0.061
0.567CysPhe: 0.567 ± 0.063
0.941CysGly: 0.941 ± 0.07
0.374CysHis: 0.374 ± 0.053
1.041CysIle: 1.041 ± 0.096
0.688CysLys: 0.688 ± 0.068
1.202CysLeu: 1.202 ± 0.081
0.247CysMet: 0.247 ± 0.04
0.614CysAsn: 0.614 ± 0.065
0.401CysPro: 0.401 ± 0.053
0.527CysGln: 0.527 ± 0.053
0.728CysArg: 0.728 ± 0.067
0.895CysSer: 0.895 ± 0.084
0.621CysThr: 0.621 ± 0.067
0.621CysVal: 0.621 ± 0.064
0.194CysTrp: 0.194 ± 0.039
0.467CysTyr: 0.467 ± 0.062
0.0CysXaa: 0.0 ± 0.0
Asp
2.991AspAla: 2.991 ± 0.137
0.588AspCys: 0.588 ± 0.053
2.063AspAsp: 2.063 ± 0.126
2.877AspGlu: 2.877 ± 0.139
2.337AspPhe: 2.337 ± 0.132
2.984AspGly: 2.984 ± 0.151
1.095AspHis: 1.095 ± 0.085
5.027AspIle: 5.027 ± 0.175
3.071AspLys: 3.071 ± 0.165
4.687AspLeu: 4.687 ± 0.18
1.349AspMet: 1.349 ± 0.099
2.844AspAsn: 2.844 ± 0.136
1.883AspPro: 1.883 ± 0.123
1.569AspGln: 1.569 ± 0.11
2.771AspArg: 2.771 ± 0.155
2.817AspSer: 2.817 ± 0.146
2.53AspThr: 2.53 ± 0.137
2.978AspVal: 2.978 ± 0.143
0.601AspTrp: 0.601 ± 0.065
2.043AspTyr: 2.043 ± 0.114
0.0AspXaa: 0.0 ± 0.0
Glu
4.099GluAla: 4.099 ± 0.18
0.574GluCys: 0.574 ± 0.067
2.37GluAsp: 2.37 ± 0.125
3.492GluGlu: 3.492 ± 0.155
2.05GluPhe: 2.05 ± 0.123
3.118GluGly: 3.118 ± 0.135
1.275GluHis: 1.275 ± 0.086
5.568GluIle: 5.568 ± 0.191
4.513GluLys: 4.513 ± 0.187
5.868GluLeu: 5.868 ± 0.19
1.556GluMet: 1.556 ± 0.115
3.038GluAsn: 3.038 ± 0.123
1.489GluPro: 1.489 ± 0.099
2.517GluGln: 2.517 ± 0.136
3.492GluArg: 3.492 ± 0.174
2.744GluSer: 2.744 ± 0.124
2.777GluThr: 2.777 ± 0.134
3.652GluVal: 3.652 ± 0.197
0.574GluTrp: 0.574 ± 0.074
1.582GluTyr: 1.582 ± 0.097
0.0GluXaa: 0.0 ± 0.0
Phe
2.183PheAla: 2.183 ± 0.138
0.654PheCys: 0.654 ± 0.072
2.143PheAsp: 2.143 ± 0.131
1.763PheGlu: 1.763 ± 0.106
1.883PhePhe: 1.883 ± 0.137
2.724PheGly: 2.724 ± 0.143
0.968PheHis: 0.968 ± 0.08
3.959PheIle: 3.959 ± 0.176
1.876PheLys: 1.876 ± 0.107
3.512PheLeu: 3.512 ± 0.179
0.895PheMet: 0.895 ± 0.084
2.403PheAsn: 2.403 ± 0.12
1.449PhePro: 1.449 ± 0.095
1.315PheGln: 1.315 ± 0.091
2.116PheArg: 2.116 ± 0.123
3.866PheSer: 3.866 ± 0.174
2.13PheThr: 2.13 ± 0.113
1.976PheVal: 1.976 ± 0.12
0.354PheTrp: 0.354 ± 0.053
1.516PheTyr: 1.516 ± 0.118
0.0PheXaa: 0.0 ± 0.0
Gly
3.699GlyAla: 3.699 ± 0.154
0.968GlyCys: 0.968 ± 0.077
2.958GlyAsp: 2.958 ± 0.153
3.632GlyGlu: 3.632 ± 0.169
2.857GlyPhe: 2.857 ± 0.149
4.213GlyGly: 4.213 ± 0.199
1.562GlyHis: 1.562 ± 0.104
6.389GlyIle: 6.389 ± 0.217
4.847GlyLys: 4.847 ± 0.168
5.962GlyLeu: 5.962 ± 0.209
1.609GlyMet: 1.609 ± 0.124
2.784GlyAsn: 2.784 ± 0.137
1.756GlyPro: 1.756 ± 0.107
2.41GlyGln: 2.41 ± 0.138
3.652GlyArg: 3.652 ± 0.162
3.652GlySer: 3.652 ± 0.16
3.145GlyThr: 3.145 ± 0.128
4.133GlyVal: 4.133 ± 0.164
0.674GlyTrp: 0.674 ± 0.074
2.357GlyTyr: 2.357 ± 0.134
0.0GlyXaa: 0.0 ± 0.0
His
1.415HisAla: 1.415 ± 0.094
0.374HisCys: 0.374 ± 0.053
1.001HisAsp: 1.001 ± 0.077
0.961HisGlu: 0.961 ± 0.086
1.128HisPhe: 1.128 ± 0.103
1.642HisGly: 1.642 ± 0.113
0.808HisHis: 0.808 ± 0.082
2.217HisIle: 2.217 ± 0.13
1.215HisLys: 1.215 ± 0.093
2.136HisLeu: 2.136 ± 0.129
0.641HisMet: 0.641 ± 0.06
1.289HisAsn: 1.289 ± 0.092
1.068HisPro: 1.068 ± 0.092
1.202HisGln: 1.202 ± 0.095
1.182HisArg: 1.182 ± 0.08
1.322HisSer: 1.322 ± 0.091
1.195HisThr: 1.195 ± 0.094
1.235HisVal: 1.235 ± 0.084
0.3HisTrp: 0.3 ± 0.049
1.041HisTyr: 1.041 ± 0.096
0.0HisXaa: 0.0 ± 0.0
Ile
6.877IleAla: 6.877 ± 0.213
1.195IleCys: 1.195 ± 0.096
5.361IleAsp: 5.361 ± 0.171
5.495IleGlu: 5.495 ± 0.196
3.251IlePhe: 3.251 ± 0.176
6.376IleGly: 6.376 ± 0.239
1.956IleHis: 1.956 ± 0.121
7.918IleIle: 7.918 ± 0.249
5.762IleLys: 5.762 ± 0.236
8.512IleLeu: 8.512 ± 0.286
1.929IleMet: 1.929 ± 0.152
5.521IleAsn: 5.521 ± 0.192
3.485IlePro: 3.485 ± 0.157
2.938IleGln: 2.938 ± 0.148
4.573IleArg: 4.573 ± 0.158
6.723IleSer: 6.723 ± 0.196
5.581IleThr: 5.581 ± 0.185
5.127IleVal: 5.127 ± 0.158
0.754IleTrp: 0.754 ± 0.077
2.497IleTyr: 2.497 ± 0.145
0.0IleXaa: 0.0 ± 0.0
Lys
4.253LysAla: 4.253 ± 0.181
0.594LysCys: 0.594 ± 0.064
2.791LysAsp: 2.791 ± 0.155
3.719LysGlu: 3.719 ± 0.17
1.836LysPhe: 1.836 ± 0.107
3.432LysGly: 3.432 ± 0.163
1.322LysHis: 1.322 ± 0.096
6.089LysIle: 6.089 ± 0.214
5.081LysLys: 5.081 ± 0.226
6.342LysLeu: 6.342 ± 0.219
1.636LysMet: 1.636 ± 0.109
4.193LysAsn: 4.193 ± 0.189
2.444LysPro: 2.444 ± 0.142
2.57LysGln: 2.57 ± 0.132
3.819LysArg: 3.819 ± 0.162
3.585LysSer: 3.585 ± 0.146
3.745LysThr: 3.745 ± 0.155
3.926LysVal: 3.926 ± 0.186
0.474LysTrp: 0.474 ± 0.063
1.943LysTyr: 1.943 ± 0.103
0.0LysXaa: 0.0 ± 0.0
Leu
7.571LeuAla: 7.571 ± 0.226
1.289LeuCys: 1.289 ± 0.08
5.541LeuAsp: 5.541 ± 0.196
6.176LeuGlu: 6.176 ± 0.224
3.926LeuPhe: 3.926 ± 0.19
5.975LeuGly: 5.975 ± 0.209
2.644LeuHis: 2.644 ± 0.134
8.459LeuIle: 8.459 ± 0.281
6.389LeuLys: 6.389 ± 0.201
11.076LeuLeu: 11.076 ± 0.41
2.537LeuMet: 2.537 ± 0.134
5.321LeuAsn: 5.321 ± 0.195
5.007LeuPro: 5.007 ± 0.179
3.525LeuGln: 3.525 ± 0.174
5.922LeuArg: 5.922 ± 0.196
7.284LeuSer: 7.284 ± 0.231
5.815LeuThr: 5.815 ± 0.197
6.536LeuVal: 6.536 ± 0.224
1.028LeuTrp: 1.028 ± 0.101
2.944LeuTyr: 2.944 ± 0.149
0.0LeuXaa: 0.0 ± 0.0
Met
1.863MetAla: 1.863 ± 0.105
0.194MetCys: 0.194 ± 0.035
0.968MetAsp: 0.968 ± 0.077
1.329MetGlu: 1.329 ± 0.103
0.881MetPhe: 0.881 ± 0.072
1.462MetGly: 1.462 ± 0.09
0.527MetHis: 0.527 ± 0.053
2.11MetIle: 2.11 ± 0.128
1.662MetLys: 1.662 ± 0.098
2.951MetLeu: 2.951 ± 0.151
0.714MetMet: 0.714 ± 0.063
1.248MetAsn: 1.248 ± 0.099
1.015MetPro: 1.015 ± 0.091
1.001MetGln: 1.001 ± 0.091
1.642MetArg: 1.642 ± 0.114
1.529MetSer: 1.529 ± 0.112
1.442MetThr: 1.442 ± 0.089
1.756MetVal: 1.756 ± 0.122
0.167MetTrp: 0.167 ± 0.039
0.547MetTyr: 0.547 ± 0.056
0.0MetXaa: 0.0 ± 0.0
Asn
3.124AsnAla: 3.124 ± 0.151
0.554AsnCys: 0.554 ± 0.056
2.477AsnAsp: 2.477 ± 0.117
2.784AsnGlu: 2.784 ± 0.136
2.043AsnPhe: 2.043 ± 0.123
3.124AsnGly: 3.124 ± 0.172
1.202AsnHis: 1.202 ± 0.098
5.314AsnIle: 5.314 ± 0.206
3.625AsnLys: 3.625 ± 0.186
5.007AsnLeu: 5.007 ± 0.165
1.409AsnMet: 1.409 ± 0.09
2.964AsnAsn: 2.964 ± 0.192
2.156AsnPro: 2.156 ± 0.127
2.403AsnGln: 2.403 ± 0.14
3.018AsnArg: 3.018 ± 0.135
3.078AsnSer: 3.078 ± 0.174
2.991AsnThr: 2.991 ± 0.135
2.557AsnVal: 2.557 ± 0.14
0.734AsnTrp: 0.734 ± 0.057
1.843AsnTyr: 1.843 ± 0.122
0.0AsnXaa: 0.0 ± 0.0
Pro
2.143ProAla: 2.143 ± 0.123
0.327ProCys: 0.327 ± 0.048
1.916ProAsp: 1.916 ± 0.117
2.517ProGlu: 2.517 ± 0.122
1.589ProPhe: 1.589 ± 0.093
2.457ProGly: 2.457 ± 0.145
0.681ProHis: 0.681 ± 0.072
3.378ProIle: 3.378 ± 0.158
2.377ProLys: 2.377 ± 0.131
3.906ProLeu: 3.906 ± 0.17
1.001ProMet: 1.001 ± 0.085
1.669ProAsn: 1.669 ± 0.12
1.055ProPro: 1.055 ± 0.089
1.168ProGln: 1.168 ± 0.099
1.549ProArg: 1.549 ± 0.112
2.17ProSer: 2.17 ± 0.124
1.97ProThr: 1.97 ± 0.111
2.624ProVal: 2.624 ± 0.135
0.494ProTrp: 0.494 ± 0.066
1.415ProTyr: 1.415 ± 0.098
0.0ProXaa: 0.0 ± 0.0
Gln
3.084GlnAla: 3.084 ± 0.156
0.401GlnCys: 0.401 ± 0.054
1.502GlnAsp: 1.502 ± 0.099
2.023GlnGlu: 2.023 ± 0.126
1.596GlnPhe: 1.596 ± 0.1
2.21GlnGly: 2.21 ± 0.131
0.948GlnHis: 0.948 ± 0.088
3.538GlnIle: 3.538 ± 0.136
2.423GlnLys: 2.423 ± 0.14
4.553GlnLeu: 4.553 ± 0.198
0.908GlnMet: 0.908 ± 0.072
1.542GlnAsn: 1.542 ± 0.089
1.429GlnPro: 1.429 ± 0.09
1.722GlnGln: 1.722 ± 0.117
2.303GlnArg: 2.303 ± 0.142
1.816GlnSer: 1.816 ± 0.12
1.636GlnThr: 1.636 ± 0.106
2.744GlnVal: 2.744 ± 0.146
0.421GlnTrp: 0.421 ± 0.048
1.462GlnTyr: 1.462 ± 0.105
0.0GlnXaa: 0.0 ± 0.0
Arg
3.832ArgAla: 3.832 ± 0.166
0.668ArgCys: 0.668 ± 0.071
2.804ArgAsp: 2.804 ± 0.16
3.318ArgGlu: 3.318 ± 0.158
2.617ArgPhe: 2.617 ± 0.139
3.365ArgGly: 3.365 ± 0.145
1.228ArgHis: 1.228 ± 0.09
5.228ArgIle: 5.228 ± 0.193
3.285ArgLys: 3.285 ± 0.165
6.189ArgLeu: 6.189 ± 0.259
1.389ArgMet: 1.389 ± 0.101
2.877ArgAsn: 2.877 ± 0.151
1.716ArgPro: 1.716 ± 0.102
2.837ArgGln: 2.837 ± 0.155
3.225ArgArg: 3.225 ± 0.188
3.084ArgSer: 3.084 ± 0.135
2.55ArgThr: 2.55 ± 0.125
3.291ArgVal: 3.291 ± 0.156
0.681ArgTrp: 0.681 ± 0.063
2.243ArgTyr: 2.243 ± 0.12
0.0ArgXaa: 0.0 ± 0.0
Ser
4.279SerAla: 4.279 ± 0.171
0.714SerCys: 0.714 ± 0.067
3.412SerAsp: 3.412 ± 0.145
3.585SerGlu: 3.585 ± 0.158
2.444SerPhe: 2.444 ± 0.123
4.673SerGly: 4.673 ± 0.157
1.402SerHis: 1.402 ± 0.097
5.294SerIle: 5.294 ± 0.181
3.578SerLys: 3.578 ± 0.161
6.616SerLeu: 6.616 ± 0.254
1.489SerMet: 1.489 ± 0.103
3.392SerAsn: 3.392 ± 0.152
2.016SerPro: 2.016 ± 0.108
2.176SerGln: 2.176 ± 0.114
3.385SerArg: 3.385 ± 0.161
3.939SerSer: 3.939 ± 0.188
3.325SerThr: 3.325 ± 0.14
3.625SerVal: 3.625 ± 0.171
0.634SerTrp: 0.634 ± 0.065
2.257SerTyr: 2.257 ± 0.099
0.0SerXaa: 0.0 ± 0.0
Thr
3.672ThrAla: 3.672 ± 0.143
0.641ThrCys: 0.641 ± 0.071
2.744ThrAsp: 2.744 ± 0.142
2.804ThrGlu: 2.804 ± 0.122
2.01ThrPhe: 2.01 ± 0.117
3.772ThrGly: 3.772 ± 0.162
1.302ThrHis: 1.302 ± 0.093
4.413ThrIle: 4.413 ± 0.167
2.904ThrLys: 2.904 ± 0.159
6.683ThrLeu: 6.683 ± 0.223
1.035ThrMet: 1.035 ± 0.087
2.37ThrAsn: 2.37 ± 0.111
2.37ThrPro: 2.37 ± 0.136
1.816ThrGln: 1.816 ± 0.11
2.764ThrArg: 2.764 ± 0.144
3.211ThrSer: 3.211 ± 0.148
2.797ThrThr: 2.797 ± 0.165
3.392ThrVal: 3.392 ± 0.166
0.454ThrTrp: 0.454 ± 0.054
1.596ThrTyr: 1.596 ± 0.115
0.0ThrXaa: 0.0 ± 0.0
Val
4.166ValAla: 4.166 ± 0.182
0.674ValCys: 0.674 ± 0.07
3.218ValAsp: 3.218 ± 0.156
3.619ValGlu: 3.619 ± 0.178
2.156ValPhe: 2.156 ± 0.115
3.679ValGly: 3.679 ± 0.19
1.315ValHis: 1.315 ± 0.097
5.802ValIle: 5.802 ± 0.219
3.759ValLys: 3.759 ± 0.184
6.443ValLeu: 6.443 ± 0.208
1.776ValMet: 1.776 ± 0.116
3.044ValAsn: 3.044 ± 0.137
2.303ValPro: 2.303 ± 0.132
1.729ValGln: 1.729 ± 0.107
3.412ValArg: 3.412 ± 0.156
4.106ValSer: 4.106 ± 0.187
3.365ValThr: 3.365 ± 0.171
4.233ValVal: 4.233 ± 0.215
0.507ValTrp: 0.507 ± 0.069
1.582ValTyr: 1.582 ± 0.107
0.0ValXaa: 0.0 ± 0.0
Trp
0.454TrpAla: 0.454 ± 0.057
0.16TrpCys: 0.16 ± 0.035
0.407TrpAsp: 0.407 ± 0.053
0.381TrpGlu: 0.381 ± 0.045
0.527TrpPhe: 0.527 ± 0.056
0.474TrpGly: 0.474 ± 0.057
0.267TrpHis: 0.267 ± 0.04
0.961TrpIle: 0.961 ± 0.077
0.648TrpLys: 0.648 ± 0.066
1.482TrpLeu: 1.482 ± 0.126
0.294TrpMet: 0.294 ± 0.04
0.461TrpAsn: 0.461 ± 0.05
0.407TrpPro: 0.407 ± 0.058
0.688TrpGln: 0.688 ± 0.072
0.721TrpArg: 0.721 ± 0.073
0.514TrpSer: 0.514 ± 0.058
0.381TrpThr: 0.381 ± 0.048
0.527TrpVal: 0.527 ± 0.053
0.12TrpTrp: 0.12 ± 0.033
0.407TrpTyr: 0.407 ± 0.059
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.063TyrAla: 2.063 ± 0.115
0.628TyrCys: 0.628 ± 0.064
1.622TyrAsp: 1.622 ± 0.102
1.449TyrGlu: 1.449 ± 0.104
1.582TyrPhe: 1.582 ± 0.105
2.223TyrGly: 2.223 ± 0.117
1.015TyrHis: 1.015 ± 0.08
2.671TyrIle: 2.671 ± 0.141
1.529TyrLys: 1.529 ± 0.104
3.785TyrLeu: 3.785 ± 0.166
0.654TyrMet: 0.654 ± 0.059
1.722TyrAsn: 1.722 ± 0.109
1.248TyrPro: 1.248 ± 0.1
1.896TyrGln: 1.896 ± 0.121
2.05TyrArg: 2.05 ± 0.118
2.03TyrSer: 2.03 ± 0.13
1.582TyrThr: 1.582 ± 0.121
1.589TyrVal: 1.589 ± 0.103
0.414TyrTrp: 0.414 ± 0.055
1.295TyrTyr: 1.295 ± 0.113
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 460 proteins (149785 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski