Amino acid dipepetide frequency for Neorickettsia sennetsu (strain ATCC VR-367 / Miyayama) (Ehrlichia sennetsu)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.615AlaAla: 5.615 ± 0.191
1.195AlaCys: 1.195 ± 0.071
3.014AlaAsp: 3.014 ± 0.118
4.169AlaGlu: 4.169 ± 0.145
3.461AlaPhe: 3.461 ± 0.134
4.776AlaGly: 4.776 ± 0.17
1.331AlaHis: 1.331 ± 0.082
5.144AlaIle: 5.144 ± 0.16
4.293AlaLys: 4.293 ± 0.136
7.85AlaLeu: 7.85 ± 0.21
1.639AlaMet: 1.639 ± 0.079
2.566AlaAsn: 2.566 ± 0.118
1.95AlaPro: 1.95 ± 0.099
2.034AlaGln: 2.034 ± 0.102
3.653AlaArg: 3.653 ± 0.128
5.54AlaSer: 5.54 ± 0.173
3.241AlaThr: 3.241 ± 0.123
5.615AlaVal: 5.615 ± 0.179
0.392AlaTrp: 0.392 ± 0.037
1.926AlaTyr: 1.926 ± 0.091
0.0AlaXaa: 0.0 ± 0.0
Cys
1.171CysAla: 1.171 ± 0.081
0.388CysCys: 0.388 ± 0.039
0.903CysAsp: 0.903 ± 0.065
0.919CysGlu: 0.919 ± 0.061
0.955CysPhe: 0.955 ± 0.064
1.327CysGly: 1.327 ± 0.073
0.348CysHis: 0.348 ± 0.037
1.279CysIle: 1.279 ± 0.071
1.055CysLys: 1.055 ± 0.069
1.495CysLeu: 1.495 ± 0.101
0.42CysMet: 0.42 ± 0.038
0.667CysAsn: 0.667 ± 0.044
0.508CysPro: 0.508 ± 0.043
0.424CysGln: 0.424 ± 0.038
0.847CysArg: 0.847 ± 0.05
1.467CysSer: 1.467 ± 0.084
0.891CysThr: 0.891 ± 0.058
1.363CysVal: 1.363 ± 0.074
0.112CysTrp: 0.112 ± 0.022
0.639CysTyr: 0.639 ± 0.052
0.0CysXaa: 0.0 ± 0.0
Asp
3.517AspAla: 3.517 ± 0.146
0.795AspCys: 0.795 ± 0.068
2.19AspAsp: 2.19 ± 0.111
3.253AspGlu: 3.253 ± 0.132
2.702AspPhe: 2.702 ± 0.098
3.137AspGly: 3.137 ± 0.121
0.807AspHis: 0.807 ± 0.056
3.425AspIle: 3.425 ± 0.126
2.95AspLys: 2.95 ± 0.107
4.932AspLeu: 4.932 ± 0.16
0.943AspMet: 0.943 ± 0.067
1.815AspAsn: 1.815 ± 0.073
2.058AspPro: 2.058 ± 0.088
1.007AspGln: 1.007 ± 0.055
2.106AspArg: 2.106 ± 0.092
3.429AspSer: 3.429 ± 0.127
2.242AspThr: 2.242 ± 0.099
3.969AspVal: 3.969 ± 0.129
0.388AspTrp: 0.388 ± 0.04
1.603AspTyr: 1.603 ± 0.081
0.0AspXaa: 0.0 ± 0.0
Glu
4.305GluAla: 4.305 ± 0.162
0.867GluCys: 0.867 ± 0.06
2.75GluAsp: 2.75 ± 0.113
4.968GluGlu: 4.968 ± 0.175
2.518GluPhe: 2.518 ± 0.103
3.669GluGly: 3.669 ± 0.133
1.251GluHis: 1.251 ± 0.077
5.703GluIle: 5.703 ± 0.172
6.027GluLys: 6.027 ± 0.173
5.991GluLeu: 5.991 ± 0.166
1.603GluMet: 1.603 ± 0.078
3.241GluAsn: 3.241 ± 0.126
1.523GluPro: 1.523 ± 0.096
1.918GluGln: 1.918 ± 0.104
3.685GluArg: 3.685 ± 0.105
4.269GluSer: 4.269 ± 0.13
2.778GluThr: 2.778 ± 0.114
4.884GluVal: 4.884 ± 0.145
0.42GluTrp: 0.42 ± 0.04
2.19GluTyr: 2.19 ± 0.089
0.0GluXaa: 0.0 ± 0.0
Phe
3.529PheAla: 3.529 ± 0.126
1.095PheCys: 1.095 ± 0.071
2.182PheAsp: 2.182 ± 0.084
2.582PheGlu: 2.582 ± 0.095
3.345PhePhe: 3.345 ± 0.16
3.365PheGly: 3.365 ± 0.126
1.091PheHis: 1.091 ± 0.075
3.273PheIle: 3.273 ± 0.116
2.266PheLys: 2.266 ± 0.104
5.995PheLeu: 5.995 ± 0.208
1.119PheMet: 1.119 ± 0.063
1.779PheAsn: 1.779 ± 0.086
1.771PhePro: 1.771 ± 0.089
1.171PheGln: 1.171 ± 0.068
2.042PheArg: 2.042 ± 0.091
5.312PheSer: 5.312 ± 0.156
2.446PheThr: 2.446 ± 0.107
3.685PheVal: 3.685 ± 0.135
0.464PheTrp: 0.464 ± 0.046
1.675PheTyr: 1.675 ± 0.09
0.0PheXaa: 0.0 ± 0.0
Gly
4.644GlyAla: 4.644 ± 0.157
1.259GlyCys: 1.259 ± 0.081
3.05GlyAsp: 3.05 ± 0.121
4.125GlyGlu: 4.125 ± 0.128
3.301GlyPhe: 3.301 ± 0.123
4.832GlyGly: 4.832 ± 0.176
1.239GlyHis: 1.239 ± 0.066
5.683GlyIle: 5.683 ± 0.153
5.276GlyLys: 5.276 ± 0.171
6.087GlyLeu: 6.087 ± 0.172
1.847GlyMet: 1.847 ± 0.094
2.514GlyAsn: 2.514 ± 0.115
1.571GlyPro: 1.571 ± 0.088
1.647GlyGln: 1.647 ± 0.086
3.109GlyArg: 3.109 ± 0.116
4.672GlySer: 4.672 ± 0.149
3.329GlyThr: 3.329 ± 0.119
5.32GlyVal: 5.32 ± 0.154
0.612GlyTrp: 0.612 ± 0.058
2.206GlyTyr: 2.206 ± 0.077
0.0GlyXaa: 0.0 ± 0.0
His
1.391HisAla: 1.391 ± 0.081
0.404HisCys: 0.404 ± 0.041
1.003HisAsp: 1.003 ± 0.067
1.031HisGlu: 1.031 ± 0.064
0.951HisPhe: 0.951 ± 0.063
1.371HisGly: 1.371 ± 0.074
0.472HisHis: 0.472 ± 0.047
1.495HisIle: 1.495 ± 0.079
1.111HisLys: 1.111 ± 0.068
2.154HisLeu: 2.154 ± 0.091
0.524HisMet: 0.524 ± 0.042
0.843HisAsn: 0.843 ± 0.059
0.927HisPro: 0.927 ± 0.068
0.556HisGln: 0.556 ± 0.045
1.075HisArg: 1.075 ± 0.07
1.579HisSer: 1.579 ± 0.071
0.935HisThr: 0.935 ± 0.062
1.463HisVal: 1.463 ± 0.08
0.14HisTrp: 0.14 ± 0.023
0.823HisTyr: 0.823 ± 0.05
0.0HisXaa: 0.0 ± 0.0
Ile
5.883IleAla: 5.883 ± 0.18
1.211IleCys: 1.211 ± 0.083
3.637IleAsp: 3.637 ± 0.127
4.648IleGlu: 4.648 ± 0.157
3.685IlePhe: 3.685 ± 0.132
4.996IleGly: 4.996 ± 0.156
1.295IleHis: 1.295 ± 0.072
4.888IleIle: 4.888 ± 0.147
4.572IleLys: 4.572 ± 0.133
7.834IleLeu: 7.834 ± 0.201
1.419IleMet: 1.419 ± 0.071
2.87IleAsn: 2.87 ± 0.106
3.014IlePro: 3.014 ± 0.114
1.954IleGln: 1.954 ± 0.093
3.649IleArg: 3.649 ± 0.128
6.167IleSer: 6.167 ± 0.164
3.961IleThr: 3.961 ± 0.136
5.611IleVal: 5.611 ± 0.149
0.356IleTrp: 0.356 ± 0.043
1.942IleTyr: 1.942 ± 0.105
0.0IleXaa: 0.0 ± 0.0
Lys
4.261LysAla: 4.261 ± 0.153
0.919LysCys: 0.919 ± 0.065
3.03LysAsp: 3.03 ± 0.1
4.948LysGlu: 4.948 ± 0.157
2.774LysPhe: 2.774 ± 0.104
3.473LysGly: 3.473 ± 0.13
1.335LysHis: 1.335 ± 0.069
5.651LysIle: 5.651 ± 0.167
5.627LysLys: 5.627 ± 0.159
6.459LysLeu: 6.459 ± 0.171
1.723LysMet: 1.723 ± 0.085
3.597LysAsn: 3.597 ± 0.124
1.898LysPro: 1.898 ± 0.101
1.807LysGln: 1.807 ± 0.088
3.625LysArg: 3.625 ± 0.141
4.636LysSer: 4.636 ± 0.142
3.153LysThr: 3.153 ± 0.11
5.164LysVal: 5.164 ± 0.152
0.404LysTrp: 0.404 ± 0.038
2.082LysTyr: 2.082 ± 0.083
0.0LysXaa: 0.0 ± 0.0
Leu
7.322LeuAla: 7.322 ± 0.167
2.062LeuCys: 2.062 ± 0.104
4.94LeuAsp: 4.94 ± 0.148
6.962LeuGlu: 6.962 ± 0.193
5.388LeuPhe: 5.388 ± 0.19
6.663LeuGly: 6.663 ± 0.176
2.446LeuHis: 2.446 ± 0.094
6.571LeuIle: 6.571 ± 0.169
6.827LeuLys: 6.827 ± 0.181
11.866LeuLeu: 11.866 ± 0.283
2.122LeuMet: 2.122 ± 0.103
4.173LeuAsn: 4.173 ± 0.121
4.44LeuPro: 4.44 ± 0.124
3.022LeuGln: 3.022 ± 0.113
5.416LeuArg: 5.416 ± 0.143
9.289LeuSer: 9.289 ± 0.204
4.576LeuThr: 4.576 ± 0.147
7.126LeuVal: 7.126 ± 0.188
0.815LeuTrp: 0.815 ± 0.064
3.157LeuTyr: 3.157 ± 0.106
0.0LeuXaa: 0.0 ± 0.0
Met
1.375MetAla: 1.375 ± 0.072
0.384MetCys: 0.384 ± 0.038
1.119MetAsp: 1.119 ± 0.068
1.343MetGlu: 1.343 ± 0.07
1.067MetPhe: 1.067 ± 0.068
1.535MetGly: 1.535 ± 0.071
0.56MetHis: 0.56 ± 0.052
1.655MetIle: 1.655 ± 0.08
1.543MetLys: 1.543 ± 0.086
2.798MetLeu: 2.798 ± 0.136
0.54MetMet: 0.54 ± 0.045
1.047MetAsn: 1.047 ± 0.07
0.875MetPro: 0.875 ± 0.07
0.975MetGln: 0.975 ± 0.061
1.543MetArg: 1.543 ± 0.084
1.866MetSer: 1.866 ± 0.086
0.879MetThr: 0.879 ± 0.061
1.423MetVal: 1.423 ± 0.075
0.136MetTrp: 0.136 ± 0.024
0.631MetTyr: 0.631 ± 0.054
0.0MetXaa: 0.0 ± 0.0
Asn
2.926AsnAla: 2.926 ± 0.107
0.683AsnCys: 0.683 ± 0.049
1.743AsnAsp: 1.743 ± 0.081
2.51AsnGlu: 2.51 ± 0.094
2.242AsnPhe: 2.242 ± 0.094
2.742AsnGly: 2.742 ± 0.109
0.719AsnHis: 0.719 ± 0.054
3.117AsnIle: 3.117 ± 0.108
2.874AsnLys: 2.874 ± 0.12
4.229AsnLeu: 4.229 ± 0.141
0.991AsnMet: 0.991 ± 0.06
1.811AsnAsn: 1.811 ± 0.088
2.014AsnPro: 2.014 ± 0.093
0.979AsnGln: 0.979 ± 0.061
2.018AsnArg: 2.018 ± 0.107
3.094AsnSer: 3.094 ± 0.131
2.138AsnThr: 2.138 ± 0.098
3.014AsnVal: 3.014 ± 0.11
0.312AsnTrp: 0.312 ± 0.038
1.339AsnTyr: 1.339 ± 0.08
0.0AsnXaa: 0.0 ± 0.0
Pro
2.018ProAla: 2.018 ± 0.102
0.544ProCys: 0.544 ± 0.048
1.791ProAsp: 1.791 ± 0.083
2.87ProGlu: 2.87 ± 0.109
1.739ProPhe: 1.739 ± 0.076
2.434ProGly: 2.434 ± 0.126
0.743ProHis: 0.743 ± 0.061
2.414ProIle: 2.414 ± 0.112
2.002ProLys: 2.002 ± 0.089
3.405ProLeu: 3.405 ± 0.14
0.671ProMet: 0.671 ± 0.052
1.427ProAsn: 1.427 ± 0.083
1.023ProPro: 1.023 ± 0.068
1.151ProGln: 1.151 ± 0.072
1.431ProArg: 1.431 ± 0.074
2.75ProSer: 2.75 ± 0.112
1.827ProThr: 1.827 ± 0.082
2.878ProVal: 2.878 ± 0.117
0.272ProTrp: 0.272 ± 0.037
1.179ProTyr: 1.179 ± 0.064
0.0ProXaa: 0.0 ± 0.0
Gln
1.763GlnAla: 1.763 ± 0.097
0.42GlnCys: 0.42 ± 0.045
1.299GlnAsp: 1.299 ± 0.082
1.998GlnGlu: 1.998 ± 0.087
1.267GlnPhe: 1.267 ± 0.075
1.603GlnGly: 1.603 ± 0.086
0.623GlnHis: 0.623 ± 0.047
2.026GlnIle: 2.026 ± 0.1
2.058GlnLys: 2.058 ± 0.102
2.762GlnLeu: 2.762 ± 0.108
0.743GlnMet: 0.743 ± 0.059
1.403GlnAsn: 1.403 ± 0.081
0.883GlnPro: 0.883 ± 0.064
0.975GlnGln: 0.975 ± 0.075
1.439GlnArg: 1.439 ± 0.078
2.03GlnSer: 2.03 ± 0.105
1.243GlnThr: 1.243 ± 0.079
2.078GlnVal: 2.078 ± 0.094
0.16GlnTrp: 0.16 ± 0.028
0.871GlnTyr: 0.871 ± 0.065
0.0GlnXaa: 0.0 ± 0.0
Arg
3.225ArgAla: 3.225 ± 0.114
0.843ArgCys: 0.843 ± 0.06
2.59ArgAsp: 2.59 ± 0.094
3.641ArgGlu: 3.641 ± 0.114
2.554ArgPhe: 2.554 ± 0.108
3.09ArgGly: 3.09 ± 0.113
0.891ArgHis: 0.891 ± 0.058
3.861ArgIle: 3.861 ± 0.116
3.689ArgLys: 3.689 ± 0.134
4.732ArgLeu: 4.732 ± 0.137
1.419ArgMet: 1.419 ± 0.07
2.338ArgAsn: 2.338 ± 0.094
1.391ArgPro: 1.391 ± 0.07
1.271ArgGln: 1.271 ± 0.077
2.91ArgArg: 2.91 ± 0.125
3.809ArgSer: 3.809 ± 0.13
2.11ArgThr: 2.11 ± 0.092
3.861ArgVal: 3.861 ± 0.136
0.38ArgTrp: 0.38 ± 0.035
1.691ArgTyr: 1.691 ± 0.078
0.0ArgXaa: 0.0 ± 0.0
Ser
5.296SerAla: 5.296 ± 0.161
1.383SerCys: 1.383 ± 0.087
4.025SerAsp: 4.025 ± 0.15
4.564SerGlu: 4.564 ± 0.142
4.273SerPhe: 4.273 ± 0.159
6.411SerGly: 6.411 ± 0.166
1.591SerHis: 1.591 ± 0.093
5.919SerIle: 5.919 ± 0.179
4.636SerLys: 4.636 ± 0.133
8.313SerLeu: 8.313 ± 0.184
1.795SerMet: 1.795 ± 0.095
2.926SerAsn: 2.926 ± 0.105
2.682SerPro: 2.682 ± 0.088
2.086SerGln: 2.086 ± 0.086
3.645SerArg: 3.645 ± 0.124
6.839SerSer: 6.839 ± 0.205
4.237SerThr: 4.237 ± 0.151
6.075SerVal: 6.075 ± 0.152
0.592SerTrp: 0.592 ± 0.041
2.414SerTyr: 2.414 ± 0.101
0.0SerXaa: 0.0 ± 0.0
Thr
3.269ThrAla: 3.269 ± 0.127
0.715ThrCys: 0.715 ± 0.06
2.346ThrAsp: 2.346 ± 0.093
3.098ThrGlu: 3.098 ± 0.116
2.386ThrPhe: 2.386 ± 0.108
3.437ThrGly: 3.437 ± 0.129
1.143ThrHis: 1.143 ± 0.061
3.157ThrIle: 3.157 ± 0.124
2.874ThrLys: 2.874 ± 0.116
5.196ThrLeu: 5.196 ± 0.165
1.043ThrMet: 1.043 ± 0.062
1.902ThrAsn: 1.902 ± 0.097
2.046ThrPro: 2.046 ± 0.093
1.563ThrGln: 1.563 ± 0.083
2.158ThrArg: 2.158 ± 0.092
3.761ThrSer: 3.761 ± 0.125
2.462ThrThr: 2.462 ± 0.115
3.593ThrVal: 3.593 ± 0.121
0.34ThrTrp: 0.34 ± 0.036
1.351ThrTyr: 1.351 ± 0.078
0.0ThrXaa: 0.0 ± 0.0
Val
5.496ValAla: 5.496 ± 0.152
1.379ValCys: 1.379 ± 0.07
3.745ValAsp: 3.745 ± 0.138
4.76ValGlu: 4.76 ± 0.174
3.669ValPhe: 3.669 ± 0.147
5.068ValGly: 5.068 ± 0.155
1.435ValHis: 1.435 ± 0.07
5.56ValIle: 5.56 ± 0.147
4.528ValLys: 4.528 ± 0.123
8.697ValLeu: 8.697 ± 0.229
1.799ValMet: 1.799 ± 0.088
2.806ValAsn: 2.806 ± 0.108
2.698ValPro: 2.698 ± 0.115
1.978ValGln: 1.978 ± 0.09
3.949ValArg: 3.949 ± 0.139
5.971ValSer: 5.971 ± 0.173
3.669ValThr: 3.669 ± 0.124
6.655ValVal: 6.655 ± 0.21
0.392ValTrp: 0.392 ± 0.042
2.162ValTyr: 2.162 ± 0.09
0.0ValXaa: 0.0 ± 0.0
Trp
0.412TrpAla: 0.412 ± 0.044
0.116TrpCys: 0.116 ± 0.026
0.28TrpAsp: 0.28 ± 0.033
0.392TrpGlu: 0.392 ± 0.031
0.324TrpPhe: 0.324 ± 0.042
0.436TrpGly: 0.436 ± 0.041
0.212TrpHis: 0.212 ± 0.026
0.572TrpIle: 0.572 ± 0.048
0.488TrpLys: 0.488 ± 0.047
0.971TrpLeu: 0.971 ± 0.065
0.144TrpMet: 0.144 ± 0.02
0.332TrpAsn: 0.332 ± 0.044
0.232TrpPro: 0.232 ± 0.033
0.268TrpGln: 0.268 ± 0.027
0.368TrpArg: 0.368 ± 0.035
0.424TrpSer: 0.424 ± 0.047
0.252TrpThr: 0.252 ± 0.032
0.432TrpVal: 0.432 ± 0.045
0.096TrpTrp: 0.096 ± 0.019
0.264TrpTyr: 0.264 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.986TyrAla: 1.986 ± 0.085
0.516TyrCys: 0.516 ± 0.045
1.635TyrAsp: 1.635 ± 0.078
1.783TyrGlu: 1.783 ± 0.088
1.587TyrPhe: 1.587 ± 0.085
1.958TyrGly: 1.958 ± 0.087
0.715TyrHis: 0.715 ± 0.059
2.198TyrIle: 2.198 ± 0.084
1.87TyrLys: 1.87 ± 0.093
3.417TyrLeu: 3.417 ± 0.136
0.787TyrMet: 0.787 ± 0.063
1.439TyrAsn: 1.439 ± 0.079
1.063TyrPro: 1.063 ± 0.07
0.859TyrGln: 0.859 ± 0.058
1.627TyrArg: 1.627 ± 0.09
2.798TyrSer: 2.798 ± 0.117
1.471TyrThr: 1.471 ± 0.068
2.206TyrVal: 2.206 ± 0.092
0.232TyrTrp: 0.232 ± 0.031
1.075TyrTyr: 1.075 ± 0.074
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 932 proteins (250202 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski