Amino acid dipepetide frequency for Ralstonia phage RSP15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.314AlaAla: 6.314 ± 0.695
0.545AlaCys: 0.545 ± 0.122
4.438AlaAsp: 4.438 ± 0.355
4.76AlaGlu: 4.76 ± 0.425
3.268AlaPhe: 3.268 ± 0.251
4.881AlaGly: 4.881 ± 0.308
0.948AlaHis: 0.948 ± 0.138
4.518AlaIle: 4.518 ± 0.295
5.769AlaLys: 5.769 ± 0.375
6.011AlaLeu: 6.011 ± 0.364
2.178AlaMet: 2.178 ± 0.212
3.772AlaAsn: 3.772 ± 0.376
2.017AlaPro: 2.017 ± 0.25
3.147AlaGln: 3.147 ± 0.273
3.248AlaArg: 3.248 ± 0.269
5.043AlaSer: 5.043 ± 0.434
4.317AlaThr: 4.317 ± 0.389
4.498AlaVal: 4.498 ± 0.334
0.888AlaTrp: 0.888 ± 0.123
2.844AlaTyr: 2.844 ± 0.229
0.0AlaXaa: 0.0 ± 0.0
Cys
0.766CysAla: 0.766 ± 0.134
0.222CysCys: 0.222 ± 0.062
0.625CysAsp: 0.625 ± 0.094
0.545CysGlu: 0.545 ± 0.092
0.424CysPhe: 0.424 ± 0.104
0.928CysGly: 0.928 ± 0.155
0.262CysHis: 0.262 ± 0.082
0.323CysIle: 0.323 ± 0.089
0.383CysLys: 0.383 ± 0.086
0.827CysLeu: 0.827 ± 0.153
0.383CysMet: 0.383 ± 0.084
0.565CysAsn: 0.565 ± 0.115
0.565CysPro: 0.565 ± 0.105
0.242CysGln: 0.242 ± 0.065
0.444CysArg: 0.444 ± 0.098
0.605CysSer: 0.605 ± 0.118
0.464CysThr: 0.464 ± 0.107
0.605CysVal: 0.605 ± 0.098
0.081CysTrp: 0.081 ± 0.038
0.383CysTyr: 0.383 ± 0.078
0.0CysXaa: 0.0 ± 0.0
Asp
5.063AspAla: 5.063 ± 0.297
0.484AspCys: 0.484 ± 0.106
3.974AspAsp: 3.974 ± 0.331
4.075AspGlu: 4.075 ± 0.375
3.611AspPhe: 3.611 ± 0.293
5.587AspGly: 5.587 ± 0.356
0.746AspHis: 0.746 ± 0.119
4.155AspIle: 4.155 ± 0.265
3.893AspLys: 3.893 ± 0.311
4.881AspLeu: 4.881 ± 0.304
1.856AspMet: 1.856 ± 0.185
2.905AspAsn: 2.905 ± 0.211
2.784AspPro: 2.784 ± 0.256
1.694AspGln: 1.694 ± 0.189
2.602AspArg: 2.602 ± 0.217
4.014AspSer: 4.014 ± 0.299
3.691AspThr: 3.691 ± 0.283
4.095AspVal: 4.095 ± 0.301
1.089AspTrp: 1.089 ± 0.161
3.409AspTyr: 3.409 ± 0.249
0.0AspXaa: 0.0 ± 0.0
Glu
5.446GluAla: 5.446 ± 0.388
0.726GluCys: 0.726 ± 0.126
3.974GluAsp: 3.974 ± 0.341
5.164GluGlu: 5.164 ± 0.419
3.369GluPhe: 3.369 ± 0.229
4.276GluGly: 4.276 ± 0.272
1.23GluHis: 1.23 ± 0.193
4.7GluIle: 4.7 ± 0.34
4.942GluLys: 4.942 ± 0.38
4.781GluLeu: 4.781 ± 0.295
2.32GluMet: 2.32 ± 0.232
3.066GluAsn: 3.066 ± 0.269
1.553GluPro: 1.553 ± 0.195
2.421GluGln: 2.421 ± 0.226
3.026GluArg: 3.026 ± 0.272
3.106GluSer: 3.106 ± 0.278
3.611GluThr: 3.611 ± 0.315
4.781GluVal: 4.781 ± 0.309
1.13GluTrp: 1.13 ± 0.169
2.985GluTyr: 2.985 ± 0.242
0.0GluXaa: 0.0 ± 0.0
Phe
2.945PheAla: 2.945 ± 0.178
0.403PheCys: 0.403 ± 0.098
3.792PheAsp: 3.792 ± 0.312
3.086PheGlu: 3.086 ± 0.275
1.876PhePhe: 1.876 ± 0.198
3.59PheGly: 3.59 ± 0.278
0.585PheHis: 0.585 ± 0.105
2.622PheIle: 2.622 ± 0.231
2.965PheLys: 2.965 ± 0.23
3.631PheLeu: 3.631 ± 0.254
1.513PheMet: 1.513 ± 0.183
3.086PheAsn: 3.086 ± 0.265
1.715PhePro: 1.715 ± 0.162
1.493PheGln: 1.493 ± 0.191
2.017PheArg: 2.017 ± 0.213
2.521PheSer: 2.521 ± 0.181
2.844PheThr: 2.844 ± 0.258
3.772PheVal: 3.772 ± 0.307
0.746PheTrp: 0.746 ± 0.132
1.896PheTyr: 1.896 ± 0.242
0.0PheXaa: 0.0 ± 0.0
Gly
4.054GlyAla: 4.054 ± 0.355
0.585GlyCys: 0.585 ± 0.122
4.458GlyAsp: 4.458 ± 0.28
3.832GlyGlu: 3.832 ± 0.242
3.389GlyPhe: 3.389 ± 0.243
5.527GlyGly: 5.527 ± 0.518
1.21GlyHis: 1.21 ± 0.172
4.196GlyIle: 4.196 ± 0.283
5.628GlyLys: 5.628 ± 0.323
4.881GlyLeu: 4.881 ± 0.365
1.573GlyMet: 1.573 ± 0.158
4.317GlyAsn: 4.317 ± 0.371
2.32GlyPro: 2.32 ± 0.218
2.824GlyGln: 2.824 ± 0.306
2.521GlyArg: 2.521 ± 0.215
4.781GlySer: 4.781 ± 0.347
4.861GlyThr: 4.861 ± 0.529
5.244GlyVal: 5.244 ± 0.275
1.13GlyTrp: 1.13 ± 0.169
3.227GlyTyr: 3.227 ± 0.241
0.0GlyXaa: 0.0 ± 0.0
His
1.009HisAla: 1.009 ± 0.153
0.161HisCys: 0.161 ± 0.066
1.17HisAsp: 1.17 ± 0.145
0.766HisGlu: 0.766 ± 0.137
0.787HisPhe: 0.787 ± 0.136
1.271HisGly: 1.271 ± 0.194
0.545HisHis: 0.545 ± 0.106
1.412HisIle: 1.412 ± 0.22
0.888HisLys: 0.888 ± 0.137
1.573HisLeu: 1.573 ± 0.198
0.403HisMet: 0.403 ± 0.085
0.746HisAsn: 0.746 ± 0.133
1.029HisPro: 1.029 ± 0.15
0.504HisGln: 0.504 ± 0.103
0.645HisArg: 0.645 ± 0.111
0.988HisSer: 0.988 ± 0.125
0.908HisThr: 0.908 ± 0.123
1.311HisVal: 1.311 ± 0.166
0.323HisTrp: 0.323 ± 0.079
0.706HisTyr: 0.706 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
4.639IleAla: 4.639 ± 0.322
0.565IleCys: 0.565 ± 0.104
4.902IleAsp: 4.902 ± 0.327
4.861IleGlu: 4.861 ± 0.311
2.421IlePhe: 2.421 ± 0.237
3.752IleGly: 3.752 ± 0.254
1.251IleHis: 1.251 ± 0.185
3.248IleIle: 3.248 ± 0.279
4.982IleLys: 4.982 ± 0.323
5.023IleLeu: 5.023 ± 0.334
1.533IleMet: 1.533 ± 0.189
3.248IleAsn: 3.248 ± 0.287
2.582IlePro: 2.582 ± 0.196
2.441IleGln: 2.441 ± 0.234
3.106IleArg: 3.106 ± 0.27
3.792IleSer: 3.792 ± 0.267
3.893IleThr: 3.893 ± 0.271
3.913IleVal: 3.913 ± 0.287
0.746IleTrp: 0.746 ± 0.135
1.977IleTyr: 1.977 ± 0.219
0.0IleXaa: 0.0 ± 0.0
Lys
5.648LysAla: 5.648 ± 0.468
0.645LysCys: 0.645 ± 0.118
4.74LysAsp: 4.74 ± 0.312
5.527LysGlu: 5.527 ± 0.384
3.49LysPhe: 3.49 ± 0.269
4.559LysGly: 4.559 ± 0.33
1.351LysHis: 1.351 ± 0.149
4.518LysIle: 4.518 ± 0.3
5.547LysLys: 5.547 ± 0.475
4.801LysLeu: 4.801 ± 0.326
2.542LysMet: 2.542 ± 0.218
4.034LysAsn: 4.034 ± 0.273
2.279LysPro: 2.279 ± 0.215
2.461LysGln: 2.461 ± 0.235
2.683LysArg: 2.683 ± 0.249
3.51LysSer: 3.51 ± 0.279
3.49LysThr: 3.49 ± 0.237
4.781LysVal: 4.781 ± 0.289
0.968LysTrp: 0.968 ± 0.109
3.308LysTyr: 3.308 ± 0.302
0.0LysXaa: 0.0 ± 0.0
Leu
5.527LeuAla: 5.527 ± 0.356
0.645LeuCys: 0.645 ± 0.107
4.902LeuAsp: 4.902 ± 0.331
4.982LeuGlu: 4.982 ± 0.362
3.51LeuPhe: 3.51 ± 0.259
4.478LeuGly: 4.478 ± 0.323
1.331LeuHis: 1.331 ± 0.165
4.438LeuIle: 4.438 ± 0.345
6.293LeuLys: 6.293 ± 0.345
4.922LeuLeu: 4.922 ± 0.336
1.755LeuMet: 1.755 ± 0.186
4.68LeuAsn: 4.68 ± 0.293
3.227LeuPro: 3.227 ± 0.256
2.804LeuGln: 2.804 ± 0.248
3.53LeuArg: 3.53 ± 0.303
4.538LeuSer: 4.538 ± 0.297
4.478LeuThr: 4.478 ± 0.312
4.337LeuVal: 4.337 ± 0.309
0.787LeuTrp: 0.787 ± 0.125
2.723LeuTyr: 2.723 ± 0.23
0.0LeuXaa: 0.0 ± 0.0
Met
1.896MetAla: 1.896 ± 0.217
0.222MetCys: 0.222 ± 0.07
1.654MetAsp: 1.654 ± 0.174
1.573MetGlu: 1.573 ± 0.218
1.21MetPhe: 1.21 ± 0.145
1.815MetGly: 1.815 ± 0.214
0.484MetHis: 0.484 ± 0.093
1.836MetIle: 1.836 ± 0.187
2.118MetLys: 2.118 ± 0.211
2.078MetLeu: 2.078 ± 0.226
0.827MetMet: 0.827 ± 0.145
1.594MetAsn: 1.594 ± 0.169
1.109MetPro: 1.109 ± 0.163
0.888MetGln: 0.888 ± 0.156
1.21MetArg: 1.21 ± 0.14
2.521MetSer: 2.521 ± 0.222
1.634MetThr: 1.634 ± 0.213
1.614MetVal: 1.614 ± 0.151
0.343MetTrp: 0.343 ± 0.081
0.807MetTyr: 0.807 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
4.357AsnAla: 4.357 ± 0.383
0.645AsnCys: 0.645 ± 0.116
2.703AsnAsp: 2.703 ± 0.207
2.844AsnGlu: 2.844 ± 0.183
2.299AsnPhe: 2.299 ± 0.19
5.244AsnGly: 5.244 ± 0.467
0.948AsnHis: 0.948 ± 0.14
3.389AsnIle: 3.389 ± 0.226
3.167AsnLys: 3.167 ± 0.239
4.538AsnLeu: 4.538 ± 0.275
1.392AsnMet: 1.392 ± 0.209
2.864AsnAsn: 2.864 ± 0.265
2.985AsnPro: 2.985 ± 0.266
2.219AsnGln: 2.219 ± 0.216
2.763AsnArg: 2.763 ± 0.251
3.409AsnSer: 3.409 ± 0.28
3.348AsnThr: 3.348 ± 0.313
3.449AsnVal: 3.449 ± 0.251
0.645AsnTrp: 0.645 ± 0.124
2.098AsnTyr: 2.098 ± 0.212
0.0AsnXaa: 0.0 ± 0.0
Pro
2.38ProAla: 2.38 ± 0.232
0.202ProCys: 0.202 ± 0.064
2.864ProAsp: 2.864 ± 0.243
3.369ProGlu: 3.369 ± 0.258
1.775ProPhe: 1.775 ± 0.19
2.36ProGly: 2.36 ± 0.211
0.948ProHis: 0.948 ± 0.171
2.239ProIle: 2.239 ± 0.218
2.784ProLys: 2.784 ± 0.25
2.34ProLeu: 2.34 ± 0.221
0.807ProMet: 0.807 ± 0.116
2.259ProAsn: 2.259 ± 0.191
0.888ProPro: 0.888 ± 0.156
1.23ProGln: 1.23 ± 0.137
1.372ProArg: 1.372 ± 0.156
2.461ProSer: 2.461 ± 0.258
2.118ProThr: 2.118 ± 0.194
2.884ProVal: 2.884 ± 0.225
0.565ProTrp: 0.565 ± 0.131
1.674ProTyr: 1.674 ± 0.175
0.0ProXaa: 0.0 ± 0.0
Gln
2.4GlnAla: 2.4 ± 0.233
0.323GlnCys: 0.323 ± 0.079
2.118GlnAsp: 2.118 ± 0.187
2.642GlnGlu: 2.642 ± 0.23
1.291GlnPhe: 1.291 ± 0.177
2.259GlnGly: 2.259 ± 0.289
0.605GlnHis: 0.605 ± 0.122
3.167GlnIle: 3.167 ± 0.258
2.36GlnLys: 2.36 ± 0.215
2.38GlnLeu: 2.38 ± 0.234
1.069GlnMet: 1.069 ± 0.139
2.098GlnAsn: 2.098 ± 0.227
1.15GlnPro: 1.15 ± 0.149
1.432GlnGln: 1.432 ± 0.171
1.553GlnArg: 1.553 ± 0.227
2.098GlnSer: 2.098 ± 0.231
2.219GlnThr: 2.219 ± 0.215
2.4GlnVal: 2.4 ± 0.229
0.565GlnTrp: 0.565 ± 0.114
1.775GlnTyr: 1.775 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
3.066ArgAla: 3.066 ± 0.253
0.464ArgCys: 0.464 ± 0.109
2.703ArgAsp: 2.703 ± 0.199
3.147ArgGlu: 3.147 ± 0.277
2.239ArgPhe: 2.239 ± 0.215
2.723ArgGly: 2.723 ± 0.228
0.666ArgHis: 0.666 ± 0.135
3.207ArgIle: 3.207 ± 0.257
3.147ArgLys: 3.147 ± 0.243
3.631ArgLeu: 3.631 ± 0.277
1.13ArgMet: 1.13 ± 0.133
2.299ArgAsn: 2.299 ± 0.239
1.291ArgPro: 1.291 ± 0.154
1.432ArgGln: 1.432 ± 0.15
1.856ArgArg: 1.856 ± 0.194
2.199ArgSer: 2.199 ± 0.238
2.4ArgThr: 2.4 ± 0.258
3.409ArgVal: 3.409 ± 0.271
0.847ArgTrp: 0.847 ± 0.126
1.896ArgTyr: 1.896 ± 0.22
0.0ArgXaa: 0.0 ± 0.0
Ser
4.861SerAla: 4.861 ± 0.449
0.645SerCys: 0.645 ± 0.119
3.772SerAsp: 3.772 ± 0.306
3.127SerGlu: 3.127 ± 0.254
2.884SerPhe: 2.884 ± 0.259
5.204SerGly: 5.204 ± 0.449
0.988SerHis: 0.988 ± 0.148
3.631SerIle: 3.631 ± 0.26
4.095SerLys: 4.095 ± 0.239
4.74SerLeu: 4.74 ± 0.275
1.735SerMet: 1.735 ± 0.188
3.268SerAsn: 3.268 ± 0.259
2.078SerPro: 2.078 ± 0.19
2.017SerGln: 2.017 ± 0.21
2.219SerArg: 2.219 ± 0.215
3.207SerSer: 3.207 ± 0.306
3.55SerThr: 3.55 ± 0.312
4.337SerVal: 4.337 ± 0.338
0.645SerTrp: 0.645 ± 0.116
2.481SerTyr: 2.481 ± 0.249
0.0SerXaa: 0.0 ± 0.0
Thr
3.974ThrAla: 3.974 ± 0.376
0.585ThrCys: 0.585 ± 0.109
3.711ThrAsp: 3.711 ± 0.301
3.429ThrGlu: 3.429 ± 0.276
3.308ThrPhe: 3.308 ± 0.276
3.954ThrGly: 3.954 ± 0.319
0.827ThrHis: 0.827 ± 0.131
4.095ThrIle: 4.095 ± 0.273
3.691ThrLys: 3.691 ± 0.281
4.599ThrLeu: 4.599 ± 0.28
1.21ThrMet: 1.21 ± 0.151
3.711ThrAsn: 3.711 ± 0.294
2.905ThrPro: 2.905 ± 0.209
1.936ThrGln: 1.936 ± 0.218
2.421ThrArg: 2.421 ± 0.259
3.227ThrSer: 3.227 ± 0.351
3.974ThrThr: 3.974 ± 0.403
4.216ThrVal: 4.216 ± 0.351
0.766ThrTrp: 0.766 ± 0.109
2.501ThrTyr: 2.501 ± 0.249
0.0ThrXaa: 0.0 ± 0.0
Val
4.922ValAla: 4.922 ± 0.47
0.787ValCys: 0.787 ± 0.127
4.276ValAsp: 4.276 ± 0.302
4.74ValGlu: 4.74 ± 0.339
3.187ValPhe: 3.187 ± 0.278
4.236ValGly: 4.236 ± 0.284
1.089ValHis: 1.089 ± 0.144
4.296ValIle: 4.296 ± 0.234
4.74ValLys: 4.74 ± 0.312
4.599ValLeu: 4.599 ± 0.327
1.654ValMet: 1.654 ± 0.189
3.873ValAsn: 3.873 ± 0.323
2.824ValPro: 2.824 ± 0.255
2.4ValGln: 2.4 ± 0.276
3.469ValArg: 3.469 ± 0.247
4.155ValSer: 4.155 ± 0.28
4.377ValThr: 4.377 ± 0.308
4.942ValVal: 4.942 ± 0.4
0.968ValTrp: 0.968 ± 0.133
3.268ValTyr: 3.268 ± 0.273
0.0ValXaa: 0.0 ± 0.0
Trp
1.15TrpAla: 1.15 ± 0.139
0.242TrpCys: 0.242 ± 0.07
0.807TrpAsp: 0.807 ± 0.145
1.069TrpGlu: 1.069 ± 0.16
0.625TrpPhe: 0.625 ± 0.126
0.968TrpGly: 0.968 ± 0.134
0.363TrpHis: 0.363 ± 0.086
0.706TrpIle: 0.706 ± 0.132
0.888TrpLys: 0.888 ± 0.125
0.867TrpLeu: 0.867 ± 0.116
0.504TrpMet: 0.504 ± 0.089
0.928TrpAsn: 0.928 ± 0.126
0.363TrpPro: 0.363 ± 0.102
0.605TrpGln: 0.605 ± 0.115
0.766TrpArg: 0.766 ± 0.118
0.928TrpSer: 0.928 ± 0.121
0.625TrpThr: 0.625 ± 0.112
0.888TrpVal: 0.888 ± 0.124
0.202TrpTrp: 0.202 ± 0.072
0.363TrpTyr: 0.363 ± 0.098
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.985TyrAla: 2.985 ± 0.239
0.545TyrCys: 0.545 ± 0.108
3.026TyrAsp: 3.026 ± 0.181
3.066TyrGlu: 3.066 ± 0.279
2.138TyrPhe: 2.138 ± 0.19
2.945TyrGly: 2.945 ± 0.272
0.766TyrHis: 0.766 ± 0.119
2.178TyrIle: 2.178 ± 0.232
2.602TyrLys: 2.602 ± 0.206
2.784TyrLeu: 2.784 ± 0.203
1.029TyrMet: 1.029 ± 0.117
1.997TyrAsn: 1.997 ± 0.201
1.856TyrPro: 1.856 ± 0.219
1.715TyrGln: 1.715 ± 0.205
2.32TyrArg: 2.32 ± 0.223
2.38TyrSer: 2.38 ± 0.192
2.239TyrThr: 2.239 ± 0.196
3.328TyrVal: 3.328 ± 0.218
0.424TyrTrp: 0.424 ± 0.101
1.876TyrTyr: 1.876 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 244 proteins (49577 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski