Amino acid dipepetide frequency for Candidatus Phytoplasma phoenicium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.152AlaAla: 2.152 ± 0.189
0.369AlaCys: 0.369 ± 0.058
1.603AlaAsp: 1.603 ± 0.127
1.951AlaGlu: 1.951 ± 0.16
2.342AlaPhe: 2.342 ± 0.186
1.951AlaGly: 1.951 ± 0.166
0.992AlaHis: 0.992 ± 0.093
4.399AlaIle: 4.399 ± 0.246
3.798AlaLys: 3.798 ± 0.207
5.0AlaLeu: 5.0 ± 0.225
0.802AlaMet: 0.802 ± 0.096
1.973AlaAsn: 1.973 ± 0.133
1.087AlaPro: 1.087 ± 0.111
2.395AlaGln: 2.395 ± 0.145
1.466AlaArg: 1.466 ± 0.12
2.838AlaSer: 2.838 ± 0.207
2.194AlaThr: 2.194 ± 0.132
2.078AlaVal: 2.078 ± 0.178
0.19AlaTrp: 0.19 ± 0.054
1.635AlaTyr: 1.635 ± 0.133
0.0AlaXaa: 0.0 ± 0.0
Cys
0.338CysAla: 0.338 ± 0.06
0.19CysCys: 0.19 ± 0.044
0.411CysAsp: 0.411 ± 0.08
0.58CysGlu: 0.58 ± 0.075
1.002CysPhe: 1.002 ± 0.128
0.475CysGly: 0.475 ± 0.079
0.327CysHis: 0.327 ± 0.055
0.686CysIle: 0.686 ± 0.094
0.443CysLys: 0.443 ± 0.075
1.297CysLeu: 1.297 ± 0.12
0.148CysMet: 0.148 ± 0.036
0.496CysAsn: 0.496 ± 0.068
0.411CysPro: 0.411 ± 0.057
0.57CysGln: 0.57 ± 0.073
0.327CysArg: 0.327 ± 0.063
0.612CysSer: 0.612 ± 0.074
0.411CysThr: 0.411 ± 0.07
0.485CysVal: 0.485 ± 0.064
0.105CysTrp: 0.105 ± 0.039
0.443CysTyr: 0.443 ± 0.074
0.0CysXaa: 0.0 ± 0.0
Asp
2.162AspAla: 2.162 ± 0.176
0.443AspCys: 0.443 ± 0.064
1.92AspAsp: 1.92 ± 0.143
2.257AspGlu: 2.257 ± 0.161
3.049AspPhe: 3.049 ± 0.199
1.614AspGly: 1.614 ± 0.138
0.738AspHis: 0.738 ± 0.1
5.338AspIle: 5.338 ± 0.247
3.903AspLys: 3.903 ± 0.22
5.39AspLeu: 5.39 ± 0.204
0.876AspMet: 0.876 ± 0.105
2.574AspAsn: 2.574 ± 0.173
1.319AspPro: 1.319 ± 0.125
1.508AspGln: 1.508 ± 0.117
0.949AspArg: 0.949 ± 0.118
2.12AspSer: 2.12 ± 0.174
1.772AspThr: 1.772 ± 0.138
2.648AspVal: 2.648 ± 0.176
0.274AspTrp: 0.274 ± 0.055
1.751AspTyr: 1.751 ± 0.153
0.0AspXaa: 0.0 ± 0.0
Glu
2.658GluAla: 2.658 ± 0.175
0.295GluCys: 0.295 ± 0.05
2.373GluAsp: 2.373 ± 0.162
4.557GluGlu: 4.557 ± 0.316
2.426GluPhe: 2.426 ± 0.164
2.595GluGly: 2.595 ± 0.162
1.234GluHis: 1.234 ± 0.117
7.184GluIle: 7.184 ± 0.263
7.701GluLys: 7.701 ± 0.358
5.211GluLeu: 5.211 ± 0.238
1.403GluMet: 1.403 ± 0.114
4.662GluAsn: 4.662 ± 0.228
1.34GluPro: 1.34 ± 0.122
3.439GluGln: 3.439 ± 0.193
1.646GluArg: 1.646 ± 0.139
2.215GluSer: 2.215 ± 0.155
3.165GluThr: 3.165 ± 0.186
3.112GluVal: 3.112 ± 0.184
0.327GluTrp: 0.327 ± 0.058
2.068GluTyr: 2.068 ± 0.153
0.0GluXaa: 0.0 ± 0.0
Phe
1.909PheAla: 1.909 ± 0.127
0.949PheCys: 0.949 ± 0.107
3.08PheAsp: 3.08 ± 0.166
2.785PheGlu: 2.785 ± 0.174
4.504PhePhe: 4.504 ± 0.311
2.194PheGly: 2.194 ± 0.196
1.329PheHis: 1.329 ± 0.123
5.612PheIle: 5.612 ± 0.312
4.409PheLys: 4.409 ± 0.227
8.059PheLeu: 8.059 ± 0.373
0.981PheMet: 0.981 ± 0.104
3.428PheAsn: 3.428 ± 0.181
1.92PhePro: 1.92 ± 0.166
3.449PheGln: 3.449 ± 0.208
1.551PheArg: 1.551 ± 0.137
3.871PheSer: 3.871 ± 0.214
2.437PheThr: 2.437 ± 0.166
2.89PheVal: 2.89 ± 0.203
0.464PheTrp: 0.464 ± 0.06
3.133PheTyr: 3.133 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
2.458GlyAla: 2.458 ± 0.169
0.411GlyCys: 0.411 ± 0.065
2.11GlyAsp: 2.11 ± 0.139
2.479GlyGlu: 2.479 ± 0.179
2.764GlyPhe: 2.764 ± 0.162
2.479GlyGly: 2.479 ± 0.208
1.297GlyHis: 1.297 ± 0.129
4.789GlyIle: 4.789 ± 0.216
4.241GlyLys: 4.241 ± 0.218
4.346GlyLeu: 4.346 ± 0.192
0.949GlyMet: 0.949 ± 0.099
2.11GlyAsn: 2.11 ± 0.142
1.16GlyPro: 1.16 ± 0.106
1.762GlyGln: 1.762 ± 0.147
1.551GlyArg: 1.551 ± 0.153
2.331GlySer: 2.331 ± 0.162
2.226GlyThr: 2.226 ± 0.161
2.996GlyVal: 2.996 ± 0.197
0.285GlyTrp: 0.285 ± 0.05
1.835GlyTyr: 1.835 ± 0.168
0.0GlyXaa: 0.0 ± 0.0
His
0.791HisAla: 0.791 ± 0.099
0.169HisCys: 0.169 ± 0.038
1.034HisAsp: 1.034 ± 0.093
0.939HisGlu: 0.939 ± 0.079
1.414HisPhe: 1.414 ± 0.116
0.939HisGly: 0.939 ± 0.092
0.854HisHis: 0.854 ± 0.111
1.941HisIle: 1.941 ± 0.112
1.941HisLys: 1.941 ± 0.151
2.901HisLeu: 2.901 ± 0.17
0.264HisMet: 0.264 ± 0.048
1.382HisAsn: 1.382 ± 0.114
0.928HisPro: 0.928 ± 0.123
1.719HisGln: 1.719 ± 0.143
0.791HisArg: 0.791 ± 0.09
1.297HisSer: 1.297 ± 0.133
0.886HisThr: 0.886 ± 0.097
0.918HisVal: 0.918 ± 0.103
0.158HisTrp: 0.158 ± 0.031
0.96HisTyr: 0.96 ± 0.107
0.0HisXaa: 0.0 ± 0.0
Ile
4.895IleAla: 4.895 ± 0.214
1.118IleCys: 1.118 ± 0.095
5.496IleAsp: 5.496 ± 0.243
6.087IleGlu: 6.087 ± 0.303
5.844IlePhe: 5.844 ± 0.31
4.715IleGly: 4.715 ± 0.24
1.962IleHis: 1.962 ± 0.139
11.371IleIle: 11.371 ± 0.438
10.57IleLys: 10.57 ± 0.361
12.131IleLeu: 12.131 ± 0.357
2.046IleMet: 2.046 ± 0.156
7.099IleAsn: 7.099 ± 0.299
3.819IlePro: 3.819 ± 0.19
4.599IleGln: 4.599 ± 0.212
2.954IleArg: 2.954 ± 0.205
6.994IleSer: 6.994 ± 0.284
5.2IleThr: 5.2 ± 0.204
5.464IleVal: 5.464 ± 0.226
0.549IleTrp: 0.549 ± 0.073
3.365IleTyr: 3.365 ± 0.186
0.0IleXaa: 0.0 ± 0.0
Lys
3.755LysAla: 3.755 ± 0.221
0.633LysCys: 0.633 ± 0.085
4.504LysAsp: 4.504 ± 0.252
8.154LysGlu: 8.154 ± 0.302
3.766LysPhe: 3.766 ± 0.215
4.452LysGly: 4.452 ± 0.277
1.814LysHis: 1.814 ± 0.127
13.471LysIle: 13.471 ± 0.434
14.125LysLys: 14.125 ± 0.497
8.112LysLeu: 8.112 ± 0.279
2.511LysMet: 2.511 ± 0.153
9.62LysAsn: 9.62 ± 0.302
2.553LysPro: 2.553 ± 0.155
5.306LysGln: 5.306 ± 0.24
2.785LysArg: 2.785 ± 0.171
4.23LysSer: 4.23 ± 0.229
5.96LysThr: 5.96 ± 0.244
4.283LysVal: 4.283 ± 0.226
0.58LysTrp: 0.58 ± 0.08
4.198LysTyr: 4.198 ± 0.188
0.0LysXaa: 0.0 ± 0.0
Leu
4.768LeuAla: 4.768 ± 0.181
1.371LeuCys: 1.371 ± 0.114
4.409LeuAsp: 4.409 ± 0.21
6.994LeuGlu: 6.994 ± 0.25
5.939LeuPhe: 5.939 ± 0.362
5.148LeuGly: 5.148 ± 0.244
2.025LeuHis: 2.025 ± 0.158
10.137LeuIle: 10.137 ± 0.333
12.648LeuLys: 12.648 ± 0.335
10.749LeuLeu: 10.749 ± 0.488
2.173LeuMet: 2.173 ± 0.138
7.627LeuAsn: 7.627 ± 0.288
3.207LeuPro: 3.207 ± 0.199
5.127LeuGln: 5.127 ± 0.258
3.07LeuArg: 3.07 ± 0.193
7.363LeuSer: 7.363 ± 0.317
6.15LeuThr: 6.15 ± 0.248
5.411LeuVal: 5.411 ± 0.26
0.654LeuTrp: 0.654 ± 0.091
3.555LeuTyr: 3.555 ± 0.184
0.0LeuXaa: 0.0 ± 0.0
Met
0.802MetAla: 0.802 ± 0.093
0.158MetCys: 0.158 ± 0.04
0.833MetAsp: 0.833 ± 0.097
1.276MetGlu: 1.276 ± 0.109
1.108MetPhe: 1.108 ± 0.117
0.96MetGly: 0.96 ± 0.091
0.401MetHis: 0.401 ± 0.064
2.352MetIle: 2.352 ± 0.143
2.205MetLys: 2.205 ± 0.15
1.92MetLeu: 1.92 ± 0.183
0.443MetMet: 0.443 ± 0.059
1.635MetAsn: 1.635 ± 0.111
0.665MetPro: 0.665 ± 0.093
0.939MetGln: 0.939 ± 0.083
0.443MetArg: 0.443 ± 0.068
1.108MetSer: 1.108 ± 0.114
1.245MetThr: 1.245 ± 0.115
0.928MetVal: 0.928 ± 0.105
0.116MetTrp: 0.116 ± 0.035
0.454MetTyr: 0.454 ± 0.073
0.0MetXaa: 0.0 ± 0.0
Asn
2.606AsnAla: 2.606 ± 0.165
0.538AsnCys: 0.538 ± 0.095
2.732AsnAsp: 2.732 ± 0.196
3.428AsnGlu: 3.428 ± 0.184
4.631AsnPhe: 4.631 ± 0.235
2.226AsnGly: 2.226 ± 0.184
1.857AsnHis: 1.857 ± 0.149
7.943AsnIle: 7.943 ± 0.266
7.163AsnLys: 7.163 ± 0.287
7.795AsnLeu: 7.795 ± 0.346
1.319AsnMet: 1.319 ± 0.125
5.559AsnAsn: 5.559 ± 0.256
2.426AsnPro: 2.426 ± 0.155
4.23AsnGln: 4.23 ± 0.223
1.508AsnArg: 1.508 ± 0.131
3.07AsnSer: 3.07 ± 0.184
3.186AsnThr: 3.186 ± 0.155
2.827AsnVal: 2.827 ± 0.163
0.39AsnTrp: 0.39 ± 0.068
3.175AsnTyr: 3.175 ± 0.186
0.0AsnXaa: 0.0 ± 0.0
Pro
0.802ProAla: 0.802 ± 0.096
0.264ProCys: 0.264 ± 0.061
0.928ProAsp: 0.928 ± 0.089
1.825ProGlu: 1.825 ± 0.147
2.205ProPhe: 2.205 ± 0.165
1.414ProGly: 1.414 ± 0.129
0.918ProHis: 0.918 ± 0.114
2.69ProIle: 2.69 ± 0.203
2.922ProLys: 2.922 ± 0.185
3.26ProLeu: 3.26 ± 0.164
0.464ProMet: 0.464 ± 0.072
1.793ProAsn: 1.793 ± 0.148
0.781ProPro: 0.781 ± 0.098
2.068ProGln: 2.068 ± 0.129
0.749ProArg: 0.749 ± 0.084
2.004ProSer: 2.004 ± 0.162
1.582ProThr: 1.582 ± 0.142
1.382ProVal: 1.382 ± 0.138
0.222ProTrp: 0.222 ± 0.047
1.635ProTyr: 1.635 ± 0.148
0.0ProXaa: 0.0 ± 0.0
Gln
2.015GlnAla: 2.015 ± 0.148
0.348GlnCys: 0.348 ± 0.063
1.667GlnAsp: 1.667 ± 0.125
3.713GlnGlu: 3.713 ± 0.209
2.184GlnPhe: 2.184 ± 0.159
1.93GlnGly: 1.93 ± 0.14
0.939GlnHis: 0.939 ± 0.101
6.466GlnIle: 6.466 ± 0.262
8.302GlnLys: 8.302 ± 0.315
4.61GlnLeu: 4.61 ± 0.244
1.308GlnMet: 1.308 ± 0.11
4.135GlnAsn: 4.135 ± 0.216
1.16GlnPro: 1.16 ± 0.115
3.85GlnGln: 3.85 ± 0.256
1.751GlnArg: 1.751 ± 0.146
2.141GlnSer: 2.141 ± 0.169
3.154GlnThr: 3.154 ± 0.191
1.888GlnVal: 1.888 ± 0.151
0.454GlnTrp: 0.454 ± 0.081
1.846GlnTyr: 1.846 ± 0.144
0.0GlnXaa: 0.0 ± 0.0
Arg
1.118ArgAla: 1.118 ± 0.125
0.232ArgCys: 0.232 ± 0.047
1.234ArgAsp: 1.234 ± 0.133
1.54ArgGlu: 1.54 ± 0.142
1.814ArgPhe: 1.814 ± 0.148
1.445ArgGly: 1.445 ± 0.133
0.633ArgHis: 0.633 ± 0.081
3.122ArgIle: 3.122 ± 0.176
2.985ArgLys: 2.985 ± 0.173
3.101ArgLeu: 3.101 ± 0.194
0.76ArgMet: 0.76 ± 0.083
1.994ArgAsn: 1.994 ± 0.149
0.802ArgPro: 0.802 ± 0.094
1.677ArgGln: 1.677 ± 0.166
1.16ArgArg: 1.16 ± 0.121
1.382ArgSer: 1.382 ± 0.128
1.53ArgThr: 1.53 ± 0.122
1.245ArgVal: 1.245 ± 0.11
0.179ArgTrp: 0.179 ± 0.049
1.213ArgTyr: 1.213 ± 0.122
0.0ArgXaa: 0.0 ± 0.0
Ser
2.036SerAla: 2.036 ± 0.131
0.876SerCys: 0.876 ± 0.1
1.941SerAsp: 1.941 ± 0.157
2.996SerGlu: 2.996 ± 0.182
4.452SerPhe: 4.452 ± 0.265
2.89SerGly: 2.89 ± 0.147
1.297SerHis: 1.297 ± 0.122
4.916SerIle: 4.916 ± 0.238
4.747SerLys: 4.747 ± 0.239
7.226SerLeu: 7.226 ± 0.344
1.002SerMet: 1.002 ± 0.109
3.196SerAsn: 3.196 ± 0.193
1.572SerPro: 1.572 ± 0.131
3.112SerGln: 3.112 ± 0.197
1.867SerArg: 1.867 ± 0.145
3.798SerSer: 3.798 ± 0.206
2.416SerThr: 2.416 ± 0.142
2.553SerVal: 2.553 ± 0.148
0.327SerTrp: 0.327 ± 0.062
2.352SerTyr: 2.352 ± 0.157
0.0SerXaa: 0.0 ± 0.0
Thr
1.846ThrAla: 1.846 ± 0.139
0.38ThrCys: 0.38 ± 0.063
1.92ThrAsp: 1.92 ± 0.136
2.395ThrGlu: 2.395 ± 0.199
2.732ThrPhe: 2.732 ± 0.191
2.479ThrGly: 2.479 ± 0.184
1.382ThrHis: 1.382 ± 0.138
4.905ThrIle: 4.905 ± 0.222
5.485ThrLys: 5.485 ± 0.211
5.559ThrLeu: 5.559 ± 0.257
0.823ThrMet: 0.823 ± 0.09
3.449ThrAsn: 3.449 ± 0.193
2.015ThrPro: 2.015 ± 0.136
3.027ThrGln: 3.027 ± 0.155
1.551ThrArg: 1.551 ± 0.125
2.975ThrSer: 2.975 ± 0.206
3.122ThrThr: 3.122 ± 0.191
2.226ThrVal: 2.226 ± 0.152
0.306ThrTrp: 0.306 ± 0.052
1.941ThrTyr: 1.941 ± 0.146
0.0ThrXaa: 0.0 ± 0.0
Val
2.3ValAla: 2.3 ± 0.188
0.496ValCys: 0.496 ± 0.066
2.321ValAsp: 2.321 ± 0.19
3.017ValGlu: 3.017 ± 0.197
3.502ValPhe: 3.502 ± 0.158
2.574ValGly: 2.574 ± 0.193
0.876ValHis: 0.876 ± 0.093
4.747ValIle: 4.747 ± 0.233
3.829ValLys: 3.829 ± 0.217
5.833ValLeu: 5.833 ± 0.237
0.907ValMet: 0.907 ± 0.091
2.964ValAsn: 2.964 ± 0.176
1.498ValPro: 1.498 ± 0.112
1.941ValGln: 1.941 ± 0.143
1.424ValArg: 1.424 ± 0.141
2.933ValSer: 2.933 ± 0.186
2.12ValThr: 2.12 ± 0.153
2.859ValVal: 2.859 ± 0.193
0.211ValTrp: 0.211 ± 0.046
1.867ValTyr: 1.867 ± 0.145
0.0ValXaa: 0.0 ± 0.0
Trp
0.222TrpAla: 0.222 ± 0.052
0.084TrpCys: 0.084 ± 0.028
0.243TrpAsp: 0.243 ± 0.056
0.306TrpGlu: 0.306 ± 0.056
0.549TrpPhe: 0.549 ± 0.091
0.295TrpGly: 0.295 ± 0.05
0.137TrpHis: 0.137 ± 0.043
0.496TrpIle: 0.496 ± 0.077
0.549TrpLys: 0.549 ± 0.072
0.812TrpLeu: 0.812 ± 0.105
0.148TrpMet: 0.148 ± 0.041
0.432TrpAsn: 0.432 ± 0.081
0.232TrpPro: 0.232 ± 0.06
0.306TrpGln: 0.306 ± 0.058
0.179TrpArg: 0.179 ± 0.042
0.316TrpSer: 0.316 ± 0.054
0.169TrpThr: 0.169 ± 0.043
0.285TrpVal: 0.285 ± 0.049
0.095TrpTrp: 0.095 ± 0.033
0.316TrpTyr: 0.316 ± 0.058
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.582TyrAla: 1.582 ± 0.131
0.432TyrCys: 0.432 ± 0.068
1.846TyrAsp: 1.846 ± 0.147
2.331TyrGlu: 2.331 ± 0.183
2.732TyrPhe: 2.732 ± 0.185
1.804TyrGly: 1.804 ± 0.154
1.213TyrHis: 1.213 ± 0.103
3.713TyrIle: 3.713 ± 0.203
2.848TyrLys: 2.848 ± 0.172
5.211TyrLeu: 5.211 ± 0.248
0.654TyrMet: 0.654 ± 0.085
2.489TyrAsn: 2.489 ± 0.165
1.013TyrPro: 1.013 ± 0.107
2.838TyrGln: 2.838 ± 0.179
1.445TyrArg: 1.445 ± 0.142
1.973TyrSer: 1.973 ± 0.135
1.593TyrThr: 1.593 ± 0.124
1.741TyrVal: 1.741 ± 0.146
0.285TyrTrp: 0.285 ± 0.058
1.688TyrTyr: 1.688 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 327 proteins (94800 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski