Amino acid dipepetide frequency for Sinorhizobium fredii GR64

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.77AlaAla: 14.77 ± 0.681
0.813AlaCys: 0.813 ± 0.132
6.193AlaAsp: 6.193 ± 0.346
6.599AlaGlu: 6.599 ± 0.423
4.307AlaPhe: 4.307 ± 0.326
9.779AlaGly: 9.779 ± 0.505
2.015AlaHis: 2.015 ± 0.18
7.154AlaIle: 7.154 ± 0.325
4.64AlaLys: 4.64 ± 0.35
10.832AlaLeu: 10.832 ± 0.401
3.198AlaMet: 3.198 ± 0.292
3.087AlaAsn: 3.087 ± 0.21
3.716AlaPro: 3.716 ± 0.293
4.085AlaGln: 4.085 ± 0.333
7.801AlaArg: 7.801 ± 0.468
6.877AlaSer: 6.877 ± 0.343
5.86AlaThr: 5.86 ± 0.355
8.854AlaVal: 8.854 ± 0.48
1.368AlaTrp: 1.368 ± 0.188
1.996AlaTyr: 1.996 ± 0.197
0.0AlaXaa: 0.0 ± 0.0
Cys
0.869CysAla: 0.869 ± 0.108
0.055CysCys: 0.055 ± 0.038
0.684CysAsp: 0.684 ± 0.114
0.536CysGlu: 0.536 ± 0.102
0.37CysPhe: 0.37 ± 0.084
0.665CysGly: 0.665 ± 0.159
0.259CysHis: 0.259 ± 0.061
0.37CysIle: 0.37 ± 0.074
0.148CysLys: 0.148 ± 0.059
0.906CysLeu: 0.906 ± 0.133
0.111CysMet: 0.111 ± 0.049
0.129CysAsn: 0.129 ± 0.054
0.462CysPro: 0.462 ± 0.105
0.185CysGln: 0.185 ± 0.056
0.592CysArg: 0.592 ± 0.107
0.592CysSer: 0.592 ± 0.095
0.24CysThr: 0.24 ± 0.076
0.739CysVal: 0.739 ± 0.118
0.166CysTrp: 0.166 ± 0.062
0.203CysTyr: 0.203 ± 0.053
0.0CysXaa: 0.0 ± 0.0
Asp
6.747AspAla: 6.747 ± 0.391
0.444AspCys: 0.444 ± 0.095
2.958AspAsp: 2.958 ± 0.321
4.178AspGlu: 4.178 ± 0.279
2.107AspPhe: 2.107 ± 0.196
4.806AspGly: 4.806 ± 0.285
1.294AspHis: 1.294 ± 0.139
2.68AspIle: 2.68 ± 0.258
1.571AspLys: 1.571 ± 0.177
6.451AspLeu: 6.451 ± 0.458
1.479AspMet: 1.479 ± 0.171
1.534AspAsn: 1.534 ± 0.202
2.902AspPro: 2.902 ± 0.249
1.682AspGln: 1.682 ± 0.19
4.141AspArg: 4.141 ± 0.314
2.107AspSer: 2.107 ± 0.225
2.662AspThr: 2.662 ± 0.228
4.141AspVal: 4.141 ± 0.288
0.98AspTrp: 0.98 ± 0.125
1.405AspTyr: 1.405 ± 0.165
0.0AspXaa: 0.0 ± 0.0
Glu
6.895GluAla: 6.895 ± 0.433
0.296GluCys: 0.296 ± 0.092
2.754GluAsp: 2.754 ± 0.239
3.66GluGlu: 3.66 ± 0.274
2.126GluPhe: 2.126 ± 0.204
4.547GluGly: 4.547 ± 0.326
1.109GluHis: 1.109 ± 0.126
4.658GluIle: 4.658 ± 0.305
2.403GluLys: 2.403 ± 0.227
6.193GluLeu: 6.193 ± 0.402
1.682GluMet: 1.682 ± 0.162
1.904GluAsn: 1.904 ± 0.188
2.2GluPro: 2.2 ± 0.196
2.44GluGln: 2.44 ± 0.261
5.379GluArg: 5.379 ± 0.438
2.995GluSer: 2.995 ± 0.286
4.011GluThr: 4.011 ± 0.284
3.993GluVal: 3.993 ± 0.316
0.795GluTrp: 0.795 ± 0.113
1.128GluTyr: 1.128 ± 0.154
0.0GluXaa: 0.0 ± 0.0
Phe
4.418PheAla: 4.418 ± 0.293
0.37PheCys: 0.37 ± 0.078
2.588PheAsp: 2.588 ± 0.238
2.496PheGlu: 2.496 ± 0.213
1.146PhePhe: 1.146 ± 0.141
3.919PheGly: 3.919 ± 0.272
0.795PheHis: 0.795 ± 0.145
1.553PheIle: 1.553 ± 0.151
1.257PheLys: 1.257 ± 0.154
3.179PheLeu: 3.179 ± 0.249
0.481PheMet: 0.481 ± 0.095
1.368PheAsn: 1.368 ± 0.193
1.793PhePro: 1.793 ± 0.217
1.109PheGln: 1.109 ± 0.138
2.514PheArg: 2.514 ± 0.227
1.996PheSer: 1.996 ± 0.182
1.479PheThr: 1.479 ± 0.195
2.847PheVal: 2.847 ± 0.248
0.481PheTrp: 0.481 ± 0.111
0.887PheTyr: 0.887 ± 0.135
0.0PheXaa: 0.0 ± 0.0
Gly
7.708GlyAla: 7.708 ± 0.428
0.721GlyCys: 0.721 ± 0.137
4.381GlyAsp: 4.381 ± 0.332
4.825GlyGlu: 4.825 ± 0.389
3.568GlyPhe: 3.568 ± 0.267
6.655GlyGly: 6.655 ± 0.471
2.033GlyHis: 2.033 ± 0.182
5.342GlyIle: 5.342 ± 0.332
4.03GlyLys: 4.03 ± 0.26
8.06GlyLeu: 8.06 ± 0.42
1.922GlyMet: 1.922 ± 0.231
2.366GlyAsn: 2.366 ± 0.195
2.921GlyPro: 2.921 ± 0.235
2.551GlyGln: 2.551 ± 0.201
5.934GlyArg: 5.934 ± 0.296
5.638GlySer: 5.638 ± 0.387
4.159GlyThr: 4.159 ± 0.316
6.877GlyVal: 6.877 ± 0.391
1.22GlyTrp: 1.22 ± 0.172
1.978GlyTyr: 1.978 ± 0.215
0.0GlyXaa: 0.0 ± 0.0
His
1.978HisAla: 1.978 ± 0.188
0.185HisCys: 0.185 ± 0.06
1.22HisAsp: 1.22 ± 0.17
1.091HisGlu: 1.091 ± 0.119
1.017HisPhe: 1.017 ± 0.144
1.682HisGly: 1.682 ± 0.234
0.518HisHis: 0.518 ± 0.111
0.98HisIle: 0.98 ± 0.128
0.407HisLys: 0.407 ± 0.078
2.144HisLeu: 2.144 ± 0.211
0.481HisMet: 0.481 ± 0.108
0.665HisAsn: 0.665 ± 0.104
1.349HisPro: 1.349 ± 0.17
0.536HisGln: 0.536 ± 0.109
1.738HisArg: 1.738 ± 0.185
1.275HisSer: 1.275 ± 0.184
0.813HisThr: 0.813 ± 0.12
1.239HisVal: 1.239 ± 0.147
0.24HisTrp: 0.24 ± 0.073
0.665HisTyr: 0.665 ± 0.108
0.0HisXaa: 0.0 ± 0.0
Ile
7.357IleAla: 7.357 ± 0.372
0.536IleCys: 0.536 ± 0.1
4.436IleAsp: 4.436 ± 0.217
3.716IleGlu: 3.716 ± 0.286
1.645IlePhe: 1.645 ± 0.172
5.268IleGly: 5.268 ± 0.368
1.054IleHis: 1.054 ± 0.126
3.106IleIle: 3.106 ± 0.284
1.775IleLys: 1.775 ± 0.188
4.788IleLeu: 4.788 ± 0.343
1.202IleMet: 1.202 ± 0.14
1.812IleAsn: 1.812 ± 0.209
2.459IlePro: 2.459 ± 0.217
1.386IleGln: 1.386 ± 0.15
3.549IleArg: 3.549 ± 0.249
3.66IleSer: 3.66 ± 0.21
2.459IleThr: 2.459 ± 0.204
4.991IleVal: 4.991 ± 0.361
0.61IleTrp: 0.61 ± 0.114
1.405IleTyr: 1.405 ± 0.178
0.0IleXaa: 0.0 ± 0.0
Lys
4.51LysAla: 4.51 ± 0.334
0.388LysCys: 0.388 ± 0.091
1.886LysAsp: 1.886 ± 0.178
2.033LysGlu: 2.033 ± 0.21
1.035LysPhe: 1.035 ± 0.151
2.403LysGly: 2.403 ± 0.191
0.647LysHis: 0.647 ± 0.114
2.532LysIle: 2.532 ± 0.245
1.571LysLys: 1.571 ± 0.178
3.568LysLeu: 3.568 ± 0.261
0.832LysMet: 0.832 ± 0.133
0.887LysAsn: 0.887 ± 0.128
2.292LysPro: 2.292 ± 0.198
1.312LysGln: 1.312 ± 0.156
2.366LysArg: 2.366 ± 0.244
2.551LysSer: 2.551 ± 0.235
2.181LysThr: 2.181 ± 0.199
3.198LysVal: 3.198 ± 0.291
0.573LysTrp: 0.573 ± 0.116
0.776LysTyr: 0.776 ± 0.134
0.0LysXaa: 0.0 ± 0.0
Leu
11.387LeuAla: 11.387 ± 0.484
0.869LeuCys: 0.869 ± 0.12
5.305LeuAsp: 5.305 ± 0.329
5.767LeuGlu: 5.767 ± 0.327
3.106LeuPhe: 3.106 ± 0.294
8.485LeuGly: 8.485 ± 0.468
1.756LeuHis: 1.756 ± 0.184
5.139LeuIle: 5.139 ± 0.383
3.882LeuLys: 3.882 ± 0.287
8.244LeuLeu: 8.244 ± 0.499
2.348LeuMet: 2.348 ± 0.206
3.124LeuAsn: 3.124 ± 0.22
4.621LeuPro: 4.621 ± 0.328
3.309LeuGln: 3.309 ± 0.225
7.172LeuArg: 7.172 ± 0.464
7.671LeuSer: 7.671 ± 0.412
4.917LeuThr: 4.917 ± 0.265
7.764LeuVal: 7.764 ± 0.43
0.665LeuTrp: 0.665 ± 0.128
1.627LeuTyr: 1.627 ± 0.172
0.0LeuXaa: 0.0 ± 0.0
Met
3.069MetAla: 3.069 ± 0.289
0.166MetCys: 0.166 ± 0.06
0.813MetAsp: 0.813 ± 0.127
1.202MetGlu: 1.202 ± 0.162
0.906MetPhe: 0.906 ± 0.151
1.553MetGly: 1.553 ± 0.164
0.37MetHis: 0.37 ± 0.094
1.128MetIle: 1.128 ± 0.137
1.22MetLys: 1.22 ± 0.164
2.422MetLeu: 2.422 ± 0.237
0.592MetMet: 0.592 ± 0.122
0.647MetAsn: 0.647 ± 0.139
0.943MetPro: 0.943 ± 0.126
0.869MetGln: 0.869 ± 0.142
1.664MetArg: 1.664 ± 0.182
2.181MetSer: 2.181 ± 0.185
1.83MetThr: 1.83 ± 0.188
1.886MetVal: 1.886 ± 0.187
0.333MetTrp: 0.333 ± 0.082
0.277MetTyr: 0.277 ± 0.065
0.0MetXaa: 0.0 ± 0.0
Asn
3.272AsnAla: 3.272 ± 0.264
0.351AsnCys: 0.351 ± 0.091
1.59AsnAsp: 1.59 ± 0.16
1.867AsnGlu: 1.867 ± 0.186
1.128AsnPhe: 1.128 ± 0.144
2.939AsnGly: 2.939 ± 0.237
0.629AsnHis: 0.629 ± 0.108
1.59AsnIle: 1.59 ± 0.155
0.536AsnLys: 0.536 ± 0.101
3.124AsnLeu: 3.124 ± 0.277
0.869AsnMet: 0.869 ± 0.113
1.017AsnAsn: 1.017 ± 0.146
1.553AsnPro: 1.553 ± 0.18
1.054AsnGln: 1.054 ± 0.105
1.775AsnArg: 1.775 ± 0.167
1.516AsnSer: 1.516 ± 0.181
1.423AsnThr: 1.423 ± 0.177
2.532AsnVal: 2.532 ± 0.243
0.277AsnTrp: 0.277 ± 0.072
0.481AsnTyr: 0.481 ± 0.097
0.0AsnXaa: 0.0 ± 0.0
Pro
4.788ProAla: 4.788 ± 0.339
0.24ProCys: 0.24 ± 0.065
2.736ProAsp: 2.736 ± 0.282
3.106ProGlu: 3.106 ± 0.262
1.959ProPhe: 1.959 ± 0.189
3.549ProGly: 3.549 ± 0.238
0.887ProHis: 0.887 ± 0.146
2.329ProIle: 2.329 ± 0.226
1.664ProLys: 1.664 ± 0.205
4.196ProLeu: 4.196 ± 0.35
0.832ProMet: 0.832 ± 0.13
1.22ProAsn: 1.22 ± 0.155
2.237ProPro: 2.237 ± 0.251
1.479ProGln: 1.479 ± 0.196
2.163ProArg: 2.163 ± 0.254
3.179ProSer: 3.179 ± 0.268
2.366ProThr: 2.366 ± 0.207
3.753ProVal: 3.753 ± 0.277
0.795ProTrp: 0.795 ± 0.105
0.887ProTyr: 0.887 ± 0.143
0.0ProXaa: 0.0 ± 0.0
Gln
3.623GlnAla: 3.623 ± 0.279
0.148GlnCys: 0.148 ± 0.05
1.368GlnAsp: 1.368 ± 0.165
1.775GlnGlu: 1.775 ± 0.196
1.22GlnPhe: 1.22 ± 0.164
2.237GlnGly: 2.237 ± 0.225
0.758GlnHis: 0.758 ± 0.101
2.274GlnIle: 2.274 ± 0.221
1.497GlnLys: 1.497 ± 0.142
3.327GlnLeu: 3.327 ± 0.25
1.128GlnMet: 1.128 ± 0.161
1.017GlnAsn: 1.017 ± 0.163
1.405GlnPro: 1.405 ± 0.143
1.775GlnGln: 1.775 ± 0.206
2.255GlnArg: 2.255 ± 0.238
2.459GlnSer: 2.459 ± 0.257
2.089GlnThr: 2.089 ± 0.236
2.329GlnVal: 2.329 ± 0.2
0.296GlnTrp: 0.296 ± 0.071
0.573GlnTyr: 0.573 ± 0.097
0.0GlnXaa: 0.0 ± 0.0
Arg
7.135ArgAla: 7.135 ± 0.331
0.592ArgCys: 0.592 ± 0.107
3.993ArgAsp: 3.993 ± 0.334
4.788ArgGlu: 4.788 ± 0.365
2.496ArgPhe: 2.496 ± 0.242
4.436ArgGly: 4.436 ± 0.31
1.608ArgHis: 1.608 ± 0.183
3.531ArgIle: 3.531 ± 0.257
2.921ArgLys: 2.921 ± 0.289
7.135ArgLeu: 7.135 ± 0.491
1.959ArgMet: 1.959 ± 0.169
2.385ArgAsn: 2.385 ± 0.196
3.42ArgPro: 3.42 ± 0.246
2.921ArgGln: 2.921 ± 0.261
5.472ArgArg: 5.472 ± 0.424
4.825ArgSer: 4.825 ± 0.315
3.438ArgThr: 3.438 ± 0.272
4.104ArgVal: 4.104 ± 0.282
0.924ArgTrp: 0.924 ± 0.132
1.719ArgTyr: 1.719 ± 0.177
0.0ArgXaa: 0.0 ± 0.0
Ser
6.895SerAla: 6.895 ± 0.361
0.499SerCys: 0.499 ± 0.101
3.605SerAsp: 3.605 ± 0.305
4.067SerGlu: 4.067 ± 0.324
2.237SerPhe: 2.237 ± 0.186
6.433SerGly: 6.433 ± 0.353
1.46SerHis: 1.46 ± 0.15
3.734SerIle: 3.734 ± 0.321
2.366SerLys: 2.366 ± 0.249
5.804SerLeu: 5.804 ± 0.326
1.571SerMet: 1.571 ± 0.166
1.719SerAsn: 1.719 ± 0.215
2.662SerPro: 2.662 ± 0.283
1.886SerGln: 1.886 ± 0.178
4.547SerArg: 4.547 ± 0.292
5.12SerSer: 5.12 ± 0.353
3.438SerThr: 3.438 ± 0.233
4.751SerVal: 4.751 ± 0.344
0.869SerTrp: 0.869 ± 0.116
1.664SerTyr: 1.664 ± 0.155
0.0SerXaa: 0.0 ± 0.0
Thr
6.377ThrAla: 6.377 ± 0.315
0.388ThrCys: 0.388 ± 0.093
2.717ThrAsp: 2.717 ± 0.245
2.902ThrGlu: 2.902 ± 0.244
2.089ThrPhe: 2.089 ± 0.224
4.825ThrGly: 4.825 ± 0.305
1.054ThrHis: 1.054 ± 0.151
3.531ThrIle: 3.531 ± 0.263
1.719ThrLys: 1.719 ± 0.201
5.083ThrLeu: 5.083 ± 0.337
1.091ThrMet: 1.091 ± 0.157
1.312ThrAsn: 1.312 ± 0.145
2.496ThrPro: 2.496 ± 0.232
1.627ThrGln: 1.627 ± 0.222
2.736ThrArg: 2.736 ± 0.213
3.457ThrSer: 3.457 ± 0.282
2.976ThrThr: 2.976 ± 0.275
3.697ThrVal: 3.697 ± 0.27
0.647ThrTrp: 0.647 ± 0.11
1.275ThrTyr: 1.275 ± 0.185
0.0ThrXaa: 0.0 ± 0.0
Val
8.466ValAla: 8.466 ± 0.427
0.795ValCys: 0.795 ± 0.148
4.769ValAsp: 4.769 ± 0.334
4.769ValGlu: 4.769 ± 0.243
2.976ValPhe: 2.976 ± 0.26
6.267ValGly: 6.267 ± 0.447
1.349ValHis: 1.349 ± 0.17
4.141ValIle: 4.141 ± 0.321
2.717ValLys: 2.717 ± 0.221
8.134ValLeu: 8.134 ± 0.506
1.497ValMet: 1.497 ± 0.19
2.329ValAsn: 2.329 ± 0.206
3.346ValPro: 3.346 ± 0.223
2.052ValGln: 2.052 ± 0.206
5.324ValArg: 5.324 ± 0.395
5.268ValSer: 5.268 ± 0.358
3.753ValThr: 3.753 ± 0.235
6.063ValVal: 6.063 ± 0.375
0.85ValTrp: 0.85 ± 0.105
1.46ValTyr: 1.46 ± 0.19
0.0ValXaa: 0.0 ± 0.0
Trp
1.072TrpAla: 1.072 ± 0.15
0.166TrpCys: 0.166 ± 0.062
0.832TrpAsp: 0.832 ± 0.137
0.555TrpGlu: 0.555 ± 0.107
0.499TrpPhe: 0.499 ± 0.096
0.776TrpGly: 0.776 ± 0.151
0.314TrpHis: 0.314 ± 0.082
0.684TrpIle: 0.684 ± 0.124
0.536TrpLys: 0.536 ± 0.097
1.405TrpLeu: 1.405 ± 0.18
0.425TrpMet: 0.425 ± 0.084
0.536TrpAsn: 0.536 ± 0.109
0.536TrpPro: 0.536 ± 0.098
0.518TrpGln: 0.518 ± 0.093
1.091TrpArg: 1.091 ± 0.147
0.721TrpSer: 0.721 ± 0.121
0.906TrpThr: 0.906 ± 0.116
0.555TrpVal: 0.555 ± 0.098
0.166TrpTrp: 0.166 ± 0.061
0.148TrpTyr: 0.148 ± 0.055
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.625TyrAla: 2.625 ± 0.257
0.259TyrCys: 0.259 ± 0.075
1.442TyrAsp: 1.442 ± 0.19
1.239TyrGlu: 1.239 ± 0.152
0.739TyrPhe: 0.739 ± 0.113
1.664TyrGly: 1.664 ± 0.207
0.314TyrHis: 0.314 ± 0.074
0.758TyrIle: 0.758 ± 0.127
0.684TyrLys: 0.684 ± 0.139
2.163TyrLeu: 2.163 ± 0.2
0.351TyrMet: 0.351 ± 0.098
0.462TyrAsn: 0.462 ± 0.112
1.054TyrPro: 1.054 ± 0.131
0.776TyrGln: 0.776 ± 0.108
1.59TyrArg: 1.59 ± 0.166
1.239TyrSer: 1.239 ± 0.182
0.998TyrThr: 0.998 ± 0.179
2.015TyrVal: 2.015 ± 0.202
0.185TyrTrp: 0.185 ± 0.06
0.684TyrTyr: 0.684 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 166 proteins (54098 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski