Amino acid dipepetide frequency for Candidatus Phytoplasma pruni

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.418AlaAla: 2.418 ± 0.151
0.286AlaCys: 0.286 ± 0.039
2.125AlaAsp: 2.125 ± 0.125
2.656AlaGlu: 2.656 ± 0.14
2.622AlaPhe: 2.622 ± 0.143
2.098AlaGly: 2.098 ± 0.171
0.776AlaHis: 0.776 ± 0.085
3.95AlaIle: 3.95 ± 0.166
4.433AlaLys: 4.433 ± 0.182
5.305AlaLeu: 5.305 ± 0.205
0.967AlaMet: 0.967 ± 0.066
2.567AlaAsn: 2.567 ± 0.131
1.022AlaPro: 1.022 ± 0.095
2.016AlaGln: 2.016 ± 0.147
1.505AlaArg: 1.505 ± 0.088
2.758AlaSer: 2.758 ± 0.167
2.152AlaThr: 2.152 ± 0.129
2.636AlaVal: 2.636 ± 0.17
0.252AlaTrp: 0.252 ± 0.036
1.505AlaTyr: 1.505 ± 0.095
0.0AlaXaa: 0.0 ± 0.0
Cys
0.3CysAla: 0.3 ± 0.047
0.089CysCys: 0.089 ± 0.025
0.388CysAsp: 0.388 ± 0.05
0.368CysGlu: 0.368 ± 0.049
0.49CysPhe: 0.49 ± 0.053
0.395CysGly: 0.395 ± 0.058
0.157CysHis: 0.157 ± 0.034
0.32CysIle: 0.32 ± 0.047
0.334CysLys: 0.334 ± 0.045
0.804CysLeu: 0.804 ± 0.069
0.17CysMet: 0.17 ± 0.033
0.259CysAsn: 0.259 ± 0.039
0.204CysPro: 0.204 ± 0.032
0.415CysGln: 0.415 ± 0.054
0.177CysArg: 0.177 ± 0.028
0.347CysSer: 0.347 ± 0.048
0.238CysThr: 0.238 ± 0.039
0.361CysVal: 0.361 ± 0.049
0.068CysTrp: 0.068 ± 0.026
0.266CysTyr: 0.266 ± 0.047
0.0CysXaa: 0.0 ± 0.0
Asp
2.145AspAla: 2.145 ± 0.128
0.266AspCys: 0.266 ± 0.043
2.329AspAsp: 2.329 ± 0.142
3.357AspGlu: 3.357 ± 0.147
3.31AspPhe: 3.31 ± 0.158
2.098AspGly: 2.098 ± 0.129
0.742AspHis: 0.742 ± 0.069
4.624AspIle: 4.624 ± 0.164
5.734AspLys: 5.734 ± 0.236
6.156AspLeu: 6.156 ± 0.193
0.749AspMet: 0.749 ± 0.064
3.221AspAsn: 3.221 ± 0.165
1.655AspPro: 1.655 ± 0.112
1.989AspGln: 1.989 ± 0.119
1.26AspArg: 1.26 ± 0.093
2.54AspSer: 2.54 ± 0.134
1.784AspThr: 1.784 ± 0.122
3.126AspVal: 3.126 ± 0.137
0.341AspTrp: 0.341 ± 0.046
2.213AspTyr: 2.213 ± 0.124
0.0AspXaa: 0.0 ± 0.0
Glu
3.323GluAla: 3.323 ± 0.177
0.279GluCys: 0.279 ± 0.043
2.853GluAsp: 2.853 ± 0.149
6.388GluGlu: 6.388 ± 0.258
2.397GluPhe: 2.397 ± 0.129
2.676GluGly: 2.676 ± 0.144
1.13GluHis: 1.13 ± 0.083
7.171GluIle: 7.171 ± 0.291
9.2GluLys: 9.2 ± 0.312
6.061GluLeu: 6.061 ± 0.227
1.886GluMet: 1.886 ± 0.106
4.917GluAsn: 4.917 ± 0.179
1.594GluPro: 1.594 ± 0.132
3.63GluGln: 3.63 ± 0.162
1.995GluArg: 1.995 ± 0.112
2.956GluSer: 2.956 ± 0.148
3.841GluThr: 3.841 ± 0.172
3.732GluVal: 3.732 ± 0.161
0.402GluTrp: 0.402 ± 0.053
2.356GluTyr: 2.356 ± 0.154
0.0GluXaa: 0.0 ± 0.0
Phe
2.159PheAla: 2.159 ± 0.131
0.449PheCys: 0.449 ± 0.05
3.024PheAsp: 3.024 ± 0.134
2.697PheGlu: 2.697 ± 0.13
4.236PhePhe: 4.236 ± 0.273
1.989PheGly: 1.989 ± 0.107
1.26PheHis: 1.26 ± 0.094
4.883PheIle: 4.883 ± 0.214
5.128PheLys: 5.128 ± 0.19
7.58PheLeu: 7.58 ± 0.288
1.137PheMet: 1.137 ± 0.097
3.596PheAsn: 3.596 ± 0.161
1.634PhePro: 1.634 ± 0.105
2.649PheGln: 2.649 ± 0.156
1.464PheArg: 1.464 ± 0.098
4.141PheSer: 4.141 ± 0.191
2.247PheThr: 2.247 ± 0.128
3.269PheVal: 3.269 ± 0.181
0.449PheTrp: 0.449 ± 0.054
2.853PheTyr: 2.853 ± 0.151
0.0PheXaa: 0.0 ± 0.0
Gly
2.309GlyAla: 2.309 ± 0.136
0.361GlyCys: 0.361 ± 0.049
2.111GlyAsp: 2.111 ± 0.118
2.356GlyGlu: 2.356 ± 0.134
2.636GlyPhe: 2.636 ± 0.173
2.71GlyGly: 2.71 ± 0.19
1.185GlyHis: 1.185 ± 0.088
4.072GlyIle: 4.072 ± 0.185
4.236GlyLys: 4.236 ± 0.175
4.27GlyLeu: 4.27 ± 0.174
1.049GlyMet: 1.049 ± 0.082
2.275GlyAsn: 2.275 ± 0.127
1.049GlyPro: 1.049 ± 0.098
1.866GlyGln: 1.866 ± 0.123
1.539GlyArg: 1.539 ± 0.136
2.894GlySer: 2.894 ± 0.151
2.288GlyThr: 2.288 ± 0.141
3.235GlyVal: 3.235 ± 0.166
0.327GlyTrp: 0.327 ± 0.048
1.934GlyTyr: 1.934 ± 0.143
0.0GlyXaa: 0.0 ± 0.0
His
0.824HisAla: 0.824 ± 0.067
0.109HisCys: 0.109 ± 0.028
0.817HisAsp: 0.817 ± 0.084
1.158HisGlu: 1.158 ± 0.087
1.28HisPhe: 1.28 ± 0.097
0.81HisGly: 0.81 ± 0.083
0.552HisHis: 0.552 ± 0.073
1.594HisIle: 1.594 ± 0.119
1.771HisLys: 1.771 ± 0.112
2.479HisLeu: 2.479 ± 0.126
0.313HisMet: 0.313 ± 0.046
1.199HisAsn: 1.199 ± 0.1
0.851HisPro: 0.851 ± 0.077
1.103HisGln: 1.103 ± 0.084
0.695HisArg: 0.695 ± 0.071
1.076HisSer: 1.076 ± 0.088
0.817HisThr: 0.817 ± 0.075
0.947HisVal: 0.947 ± 0.083
0.116HisTrp: 0.116 ± 0.029
0.947HisTyr: 0.947 ± 0.076
0.0HisXaa: 0.0 ± 0.0
Ile
4.794IleAla: 4.794 ± 0.216
0.579IleCys: 0.579 ± 0.067
5.285IleAsp: 5.285 ± 0.202
5.966IleGlu: 5.966 ± 0.232
4.849IlePhe: 4.849 ± 0.213
4.243IleGly: 4.243 ± 0.198
1.498IleHis: 1.498 ± 0.11
8.901IleIle: 8.901 ± 0.336
9.623IleLys: 9.623 ± 0.313
9.507IleLeu: 9.507 ± 0.289
1.784IleMet: 1.784 ± 0.103
6.381IleAsn: 6.381 ± 0.225
2.99IlePro: 2.99 ± 0.165
3.466IleGln: 3.466 ± 0.127
2.377IleArg: 2.377 ± 0.129
5.877IleSer: 5.877 ± 0.182
4.658IleThr: 4.658 ± 0.165
5.326IleVal: 5.326 ± 0.196
0.388IleTrp: 0.388 ± 0.059
3.228IleTyr: 3.228 ± 0.163
0.0IleXaa: 0.0 ± 0.0
Lys
4.365LysAla: 4.365 ± 0.165
0.436LysCys: 0.436 ± 0.054
5.748LysAsp: 5.748 ± 0.214
9.48LysGlu: 9.48 ± 0.288
3.637LysPhe: 3.637 ± 0.159
4.372LysGly: 4.372 ± 0.185
2.016LysHis: 2.016 ± 0.116
11.291LysIle: 11.291 ± 0.329
14.39LysLys: 14.39 ± 0.378
8.104LysLeu: 8.104 ± 0.243
2.969LysMet: 2.969 ± 0.122
9.418LysAsn: 9.418 ± 0.306
2.772LysPro: 2.772 ± 0.14
5.298LysGln: 5.298 ± 0.234
3.371LysArg: 3.371 ± 0.139
4.263LysSer: 4.263 ± 0.151
7.518LysThr: 7.518 ± 0.201
5.046LysVal: 5.046 ± 0.198
0.504LysTrp: 0.504 ± 0.067
3.943LysTyr: 3.943 ± 0.167
0.0LysXaa: 0.0 ± 0.0
Leu
4.617LeuAla: 4.617 ± 0.205
0.606LeuCys: 0.606 ± 0.056
5.169LeuAsp: 5.169 ± 0.207
8.063LeuGlu: 8.063 ± 0.255
6.47LeuPhe: 6.47 ± 0.281
5.005LeuGly: 5.005 ± 0.213
1.641LeuHis: 1.641 ± 0.106
8.955LeuIle: 8.955 ± 0.234
12.326LeuLys: 12.326 ± 0.296
11.101LeuLeu: 11.101 ± 0.348
2.268LeuMet: 2.268 ± 0.12
7.076LeuAsn: 7.076 ± 0.254
3.167LeuPro: 3.167 ± 0.178
4.324LeuGln: 4.324 ± 0.191
2.792LeuArg: 2.792 ± 0.159
7.3LeuSer: 7.3 ± 0.231
5.7LeuThr: 5.7 ± 0.185
5.782LeuVal: 5.782 ± 0.209
0.572LeuTrp: 0.572 ± 0.064
3.46LeuTyr: 3.46 ± 0.165
0.0LeuXaa: 0.0 ± 0.0
Met
1.09MetAla: 1.09 ± 0.081
0.116MetCys: 0.116 ± 0.029
0.926MetAsp: 0.926 ± 0.089
1.246MetGlu: 1.246 ± 0.087
1.185MetPhe: 1.185 ± 0.086
1.205MetGly: 1.205 ± 0.093
0.306MetHis: 0.306 ± 0.039
2.104MetIle: 2.104 ± 0.143
2.268MetLys: 2.268 ± 0.132
1.961MetLeu: 1.961 ± 0.101
0.558MetMet: 0.558 ± 0.062
1.784MetAsn: 1.784 ± 0.11
0.64MetPro: 0.64 ± 0.071
0.919MetGln: 0.919 ± 0.082
0.558MetArg: 0.558 ± 0.059
1.444MetSer: 1.444 ± 0.085
0.953MetThr: 0.953 ± 0.082
1.328MetVal: 1.328 ± 0.102
0.075MetTrp: 0.075 ± 0.024
0.504MetTyr: 0.504 ± 0.054
0.0MetXaa: 0.0 ± 0.0
Asn
2.758AsnAla: 2.758 ± 0.147
0.422AsnCys: 0.422 ± 0.056
3.391AsnAsp: 3.391 ± 0.16
4.195AsnGlu: 4.195 ± 0.197
3.95AsnPhe: 3.95 ± 0.186
2.384AsnGly: 2.384 ± 0.151
1.546AsnHis: 1.546 ± 0.109
6.694AsnIle: 6.694 ± 0.287
8.227AsnLys: 8.227 ± 0.284
7.164AsnLeu: 7.164 ± 0.244
1.321AsnMet: 1.321 ± 0.096
6.422AsnAsn: 6.422 ± 0.312
2.472AsnPro: 2.472 ± 0.143
3.5AsnGln: 3.5 ± 0.172
1.587AsnArg: 1.587 ± 0.112
3.534AsnSer: 3.534 ± 0.159
2.84AsnThr: 2.84 ± 0.117
3.521AsnVal: 3.521 ± 0.161
0.456AsnTrp: 0.456 ± 0.064
2.976AsnTyr: 2.976 ± 0.152
0.0AsnXaa: 0.0 ± 0.0
Pro
0.94ProAla: 0.94 ± 0.082
0.266ProCys: 0.266 ± 0.042
1.294ProAsp: 1.294 ± 0.1
2.172ProGlu: 2.172 ± 0.114
1.968ProPhe: 1.968 ± 0.13
1.376ProGly: 1.376 ± 0.108
0.865ProHis: 0.865 ± 0.072
2.315ProIle: 2.315 ± 0.121
2.785ProLys: 2.785 ± 0.127
3.255ProLeu: 3.255 ± 0.151
0.395ProMet: 0.395 ± 0.044
1.764ProAsn: 1.764 ± 0.11
0.824ProPro: 0.824 ± 0.079
1.811ProGln: 1.811 ± 0.129
0.722ProArg: 0.722 ± 0.067
2.057ProSer: 2.057 ± 0.126
1.573ProThr: 1.573 ± 0.109
1.58ProVal: 1.58 ± 0.114
0.177ProTrp: 0.177 ± 0.036
1.294ProTyr: 1.294 ± 0.097
0.0ProXaa: 0.0 ± 0.0
Gln
1.832GlnAla: 1.832 ± 0.102
0.143GlnCys: 0.143 ± 0.03
1.832GlnAsp: 1.832 ± 0.122
4.474GlnGlu: 4.474 ± 0.215
1.737GlnPhe: 1.737 ± 0.118
1.764GlnGly: 1.764 ± 0.098
0.715GlnHis: 0.715 ± 0.076
4.931GlnIle: 4.931 ± 0.173
6.797GlnLys: 6.797 ± 0.259
4.29GlnLeu: 4.29 ± 0.202
1.124GlnMet: 1.124 ± 0.074
3.834GlnAsn: 3.834 ± 0.178
1.151GlnPro: 1.151 ± 0.091
2.629GlnGln: 2.629 ± 0.195
1.525GlnArg: 1.525 ± 0.116
1.893GlnSer: 1.893 ± 0.111
2.881GlnThr: 2.881 ± 0.162
1.893GlnVal: 1.893 ± 0.128
0.286GlnTrp: 0.286 ± 0.06
1.287GlnTyr: 1.287 ± 0.086
0.0GlnXaa: 0.0 ± 0.0
Arg
1.165ArgAla: 1.165 ± 0.081
0.204ArgCys: 0.204 ± 0.036
1.423ArgAsp: 1.423 ± 0.108
1.682ArgGlu: 1.682 ± 0.115
1.634ArgPhe: 1.634 ± 0.11
1.41ArgGly: 1.41 ± 0.108
0.558ArgHis: 0.558 ± 0.057
2.942ArgIle: 2.942 ± 0.152
3.33ArgLys: 3.33 ± 0.165
2.853ArgLeu: 2.853 ± 0.149
0.756ArgMet: 0.756 ± 0.067
2.009ArgAsn: 2.009 ± 0.136
0.858ArgPro: 0.858 ± 0.07
1.321ArgGln: 1.321 ± 0.102
1.076ArgArg: 1.076 ± 0.088
1.471ArgSer: 1.471 ± 0.105
1.498ArgThr: 1.498 ± 0.12
1.362ArgVal: 1.362 ± 0.107
0.15ArgTrp: 0.15 ± 0.031
1.192ArgTyr: 1.192 ± 0.088
0.0ArgXaa: 0.0 ± 0.0
Ser
2.302SerAla: 2.302 ± 0.149
0.443SerCys: 0.443 ± 0.056
3.078SerAsp: 3.078 ± 0.163
3.507SerGlu: 3.507 ± 0.15
4.801SerPhe: 4.801 ± 0.197
3.003SerGly: 3.003 ± 0.158
1.212SerHis: 1.212 ± 0.084
4.386SerIle: 4.386 ± 0.182
5.074SerLys: 5.074 ± 0.193
7.205SerLeu: 7.205 ± 0.228
1.11SerMet: 1.11 ± 0.088
3.16SerAsn: 3.16 ± 0.135
1.525SerPro: 1.525 ± 0.087
2.976SerGln: 2.976 ± 0.19
1.784SerArg: 1.784 ± 0.132
4.188SerSer: 4.188 ± 0.17
2.084SerThr: 2.084 ± 0.122
3.228SerVal: 3.228 ± 0.166
0.422SerTrp: 0.422 ± 0.06
2.336SerTyr: 2.336 ± 0.114
0.0SerXaa: 0.0 ± 0.0
Thr
2.118ThrAla: 2.118 ± 0.108
0.368ThrCys: 0.368 ± 0.046
2.431ThrAsp: 2.431 ± 0.12
2.663ThrGlu: 2.663 ± 0.139
2.779ThrPhe: 2.779 ± 0.158
2.343ThrGly: 2.343 ± 0.138
1.212ThrHis: 1.212 ± 0.095
4.467ThrIle: 4.467 ± 0.163
4.774ThrLys: 4.774 ± 0.169
6.0ThrLeu: 6.0 ± 0.207
0.831ThrMet: 0.831 ± 0.08
3.208ThrAsn: 3.208 ± 0.156
1.88ThrPro: 1.88 ± 0.113
2.254ThrGln: 2.254 ± 0.134
1.464ThrArg: 1.464 ± 0.101
3.031ThrSer: 3.031 ± 0.129
2.813ThrThr: 2.813 ± 0.154
3.01ThrVal: 3.01 ± 0.152
0.279ThrTrp: 0.279 ± 0.049
2.05ThrTyr: 2.05 ± 0.11
0.0ThrXaa: 0.0 ± 0.0
Val
2.642ValAla: 2.642 ± 0.158
0.415ValCys: 0.415 ± 0.057
2.853ValAsp: 2.853 ± 0.16
3.351ValGlu: 3.351 ± 0.168
3.718ValPhe: 3.718 ± 0.187
2.887ValGly: 2.887 ± 0.172
0.981ValHis: 0.981 ± 0.071
4.849ValIle: 4.849 ± 0.205
4.876ValLys: 4.876 ± 0.164
6.654ValLeu: 6.654 ± 0.269
1.096ValMet: 1.096 ± 0.095
3.289ValAsn: 3.289 ± 0.145
1.757ValPro: 1.757 ± 0.136
1.982ValGln: 1.982 ± 0.095
1.478ValArg: 1.478 ± 0.109
3.725ValSer: 3.725 ± 0.188
2.424ValThr: 2.424 ± 0.151
4.045ValVal: 4.045 ± 0.234
0.443ValTrp: 0.443 ± 0.059
1.948ValTyr: 1.948 ± 0.115
0.0ValXaa: 0.0 ± 0.0
Trp
0.313TrpAla: 0.313 ± 0.044
0.054TrpCys: 0.054 ± 0.02
0.266TrpAsp: 0.266 ± 0.042
0.375TrpGlu: 0.375 ± 0.053
0.449TrpPhe: 0.449 ± 0.054
0.306TrpGly: 0.306 ± 0.051
0.197TrpHis: 0.197 ± 0.039
0.449TrpIle: 0.449 ± 0.061
0.599TrpLys: 0.599 ± 0.058
0.715TrpLeu: 0.715 ± 0.066
0.109TrpMet: 0.109 ± 0.026
0.443TrpAsn: 0.443 ± 0.061
0.163TrpPro: 0.163 ± 0.037
0.279TrpGln: 0.279 ± 0.047
0.252TrpArg: 0.252 ± 0.039
0.252TrpSer: 0.252 ± 0.038
0.232TrpThr: 0.232 ± 0.042
0.238TrpVal: 0.238 ± 0.042
0.061TrpTrp: 0.061 ± 0.02
0.286TrpTyr: 0.286 ± 0.046
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.634TyrAla: 1.634 ± 0.097
0.259TyrCys: 0.259 ± 0.046
2.295TyrAsp: 2.295 ± 0.141
2.636TyrGlu: 2.636 ± 0.131
2.697TyrPhe: 2.697 ± 0.187
1.628TyrGly: 1.628 ± 0.104
0.906TyrHis: 0.906 ± 0.075
2.819TyrIle: 2.819 ± 0.14
3.037TyrLys: 3.037 ± 0.149
4.931TyrLeu: 4.931 ± 0.164
0.572TyrMet: 0.572 ± 0.063
2.54TyrAsn: 2.54 ± 0.127
1.226TyrPro: 1.226 ± 0.084
2.588TyrGln: 2.588 ± 0.136
1.273TyrArg: 1.273 ± 0.082
2.084TyrSer: 2.084 ± 0.124
1.43TyrThr: 1.43 ± 0.113
1.75TyrVal: 1.75 ± 0.097
0.272TyrTrp: 0.272 ± 0.043
1.607TyrTyr: 1.607 ± 0.108
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 549 proteins (146841 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski