Amino acid dipepetide frequency for Raoultella phage Ro1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.044AlaAla: 6.044 ± 0.465
0.934AlaCys: 0.934 ± 0.156
4.784AlaAsp: 4.784 ± 0.324
4.784AlaGlu: 4.784 ± 0.382
2.987AlaPhe: 2.987 ± 0.257
5.414AlaGly: 5.414 ± 0.361
1.167AlaHis: 1.167 ± 0.159
4.131AlaIle: 4.131 ± 0.342
4.971AlaLys: 4.971 ± 0.347
5.764AlaLeu: 5.764 ± 0.472
2.334AlaMet: 2.334 ± 0.209
3.127AlaAsn: 3.127 ± 0.301
2.217AlaPro: 2.217 ± 0.276
2.264AlaGln: 2.264 ± 0.236
3.641AlaArg: 3.641 ± 0.316
3.897AlaSer: 3.897 ± 0.343
4.061AlaThr: 4.061 ± 0.338
4.224AlaVal: 4.224 ± 0.326
1.12AlaTrp: 1.12 ± 0.17
2.801AlaTyr: 2.801 ± 0.278
0.0AlaXaa: 0.0 ± 0.0
Cys
0.583CysAla: 0.583 ± 0.11
0.233CysCys: 0.233 ± 0.056
0.887CysAsp: 0.887 ± 0.144
0.793CysGlu: 0.793 ± 0.149
0.583CysPhe: 0.583 ± 0.114
1.074CysGly: 1.074 ± 0.192
0.467CysHis: 0.467 ± 0.105
0.77CysIle: 0.77 ± 0.135
0.98CysLys: 0.98 ± 0.148
1.074CysLeu: 1.074 ± 0.169
0.257CysMet: 0.257 ± 0.08
0.537CysAsn: 0.537 ± 0.103
0.467CysPro: 0.467 ± 0.11
0.397CysGln: 0.397 ± 0.097
0.513CysArg: 0.513 ± 0.129
0.98CysSer: 0.98 ± 0.159
0.63CysThr: 0.63 ± 0.114
0.863CysVal: 0.863 ± 0.125
0.233CysTrp: 0.233 ± 0.064
0.607CysTyr: 0.607 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
4.084AspAla: 4.084 ± 0.34
0.583AspCys: 0.583 ± 0.124
3.711AspAsp: 3.711 ± 0.265
4.761AspGlu: 4.761 ± 0.342
2.754AspPhe: 2.754 ± 0.251
5.018AspGly: 5.018 ± 0.397
1.47AspHis: 1.47 ± 0.191
3.827AspIle: 3.827 ± 0.353
4.598AspLys: 4.598 ± 0.334
6.208AspLeu: 6.208 ± 0.386
1.96AspMet: 1.96 ± 0.209
3.291AspAsn: 3.291 ± 0.336
2.731AspPro: 2.731 ± 0.269
1.937AspGln: 1.937 ± 0.233
3.151AspArg: 3.151 ± 0.269
3.361AspSer: 3.361 ± 0.308
3.104AspThr: 3.104 ± 0.253
4.714AspVal: 4.714 ± 0.33
1.424AspTrp: 1.424 ± 0.166
2.567AspTyr: 2.567 ± 0.222
0.0AspXaa: 0.0 ± 0.0
Glu
5.088GluAla: 5.088 ± 0.38
0.747GluCys: 0.747 ± 0.148
3.921GluAsp: 3.921 ± 0.274
4.994GluGlu: 4.994 ± 0.387
2.777GluPhe: 2.777 ± 0.261
3.921GluGly: 3.921 ± 0.299
1.26GluHis: 1.26 ± 0.16
4.831GluIle: 4.831 ± 0.388
5.274GluLys: 5.274 ± 0.405
5.484GluLeu: 5.484 ± 0.374
2.1GluMet: 2.1 ± 0.216
3.617GluAsn: 3.617 ± 0.246
1.447GluPro: 1.447 ± 0.175
1.96GluGln: 1.96 ± 0.25
3.804GluArg: 3.804 ± 0.279
3.804GluSer: 3.804 ± 0.328
3.104GluThr: 3.104 ± 0.238
4.341GluVal: 4.341 ± 0.294
1.284GluTrp: 1.284 ± 0.157
2.731GluTyr: 2.731 ± 0.266
0.0GluXaa: 0.0 ± 0.0
Phe
2.637PheAla: 2.637 ± 0.267
0.63PheCys: 0.63 ± 0.125
3.244PheAsp: 3.244 ± 0.351
2.684PheGlu: 2.684 ± 0.26
1.61PhePhe: 1.61 ± 0.214
3.337PheGly: 3.337 ± 0.328
0.747PheHis: 0.747 ± 0.13
2.474PheIle: 2.474 ± 0.24
2.287PheLys: 2.287 ± 0.235
3.011PheLeu: 3.011 ± 0.341
1.167PheMet: 1.167 ± 0.162
2.357PheAsn: 2.357 ± 0.238
1.564PhePro: 1.564 ± 0.177
1.237PheGln: 1.237 ± 0.178
1.937PheArg: 1.937 ± 0.231
3.057PheSer: 3.057 ± 0.24
2.404PheThr: 2.404 ± 0.214
3.081PheVal: 3.081 ± 0.26
0.723PheTrp: 0.723 ± 0.14
1.517PheTyr: 1.517 ± 0.177
0.0PheXaa: 0.0 ± 0.0
Gly
4.411GlyAla: 4.411 ± 0.347
0.84GlyCys: 0.84 ± 0.132
4.271GlyAsp: 4.271 ± 0.351
4.528GlyGlu: 4.528 ± 0.32
3.267GlyPhe: 3.267 ± 0.287
4.621GlyGly: 4.621 ± 0.375
1.447GlyHis: 1.447 ± 0.21
3.827GlyIle: 3.827 ± 0.323
6.068GlyLys: 6.068 ± 0.367
4.831GlyLeu: 4.831 ± 0.31
2.287GlyMet: 2.287 ± 0.218
3.641GlyAsn: 3.641 ± 0.337
1.704GlyPro: 1.704 ± 0.195
2.124GlyGln: 2.124 ± 0.252
3.174GlyArg: 3.174 ± 0.258
3.991GlySer: 3.991 ± 0.364
3.711GlyThr: 3.711 ± 0.321
5.508GlyVal: 5.508 ± 0.34
1.424GlyTrp: 1.424 ± 0.192
3.197GlyTyr: 3.197 ± 0.28
0.0GlyXaa: 0.0 ± 0.0
His
1.027HisAla: 1.027 ± 0.149
0.373HisCys: 0.373 ± 0.089
1.144HisAsp: 1.144 ± 0.166
1.19HisGlu: 1.19 ± 0.175
1.097HisPhe: 1.097 ± 0.163
1.634HisGly: 1.634 ± 0.175
0.467HisHis: 0.467 ± 0.098
1.237HisIle: 1.237 ± 0.169
1.424HisLys: 1.424 ± 0.172
1.54HisLeu: 1.54 ± 0.209
0.49HisMet: 0.49 ± 0.104
0.98HisAsn: 0.98 ± 0.137
0.793HisPro: 0.793 ± 0.118
0.56HisGln: 0.56 ± 0.133
1.144HisArg: 1.144 ± 0.169
1.004HisSer: 1.004 ± 0.166
1.05HisThr: 1.05 ± 0.136
1.144HisVal: 1.144 ± 0.162
0.257HisTrp: 0.257 ± 0.074
0.934HisTyr: 0.934 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
4.481IleAla: 4.481 ± 0.304
0.934IleCys: 0.934 ± 0.152
4.224IleAsp: 4.224 ± 0.289
4.084IleGlu: 4.084 ± 0.326
2.38IlePhe: 2.38 ± 0.234
3.524IleGly: 3.524 ± 0.314
1.494IleHis: 1.494 ± 0.169
3.641IleIle: 3.641 ± 0.314
3.757IleLys: 3.757 ± 0.279
4.317IleLeu: 4.317 ± 0.323
1.75IleMet: 1.75 ± 0.175
2.637IleAsn: 2.637 ± 0.218
2.474IlePro: 2.474 ± 0.225
1.47IleGln: 1.47 ± 0.192
3.221IleArg: 3.221 ± 0.291
4.341IleSer: 4.341 ± 0.318
3.011IleThr: 3.011 ± 0.208
4.434IleVal: 4.434 ± 0.266
0.723IleTrp: 0.723 ± 0.123
2.497IleTyr: 2.497 ± 0.225
0.0IleXaa: 0.0 ± 0.0
Lys
5.764LysAla: 5.764 ± 0.522
0.793LysCys: 0.793 ± 0.116
5.111LysAsp: 5.111 ± 0.362
5.298LysGlu: 5.298 ± 0.398
2.941LysPhe: 2.941 ± 0.282
4.504LysGly: 4.504 ± 0.306
1.354LysHis: 1.354 ± 0.225
4.388LysIle: 4.388 ± 0.322
5.391LysLys: 5.391 ± 0.462
5.298LysLeu: 5.298 ± 0.329
2.334LysMet: 2.334 ± 0.213
3.221LysAsn: 3.221 ± 0.254
2.217LysPro: 2.217 ± 0.237
2.52LysGln: 2.52 ± 0.249
3.711LysArg: 3.711 ± 0.361
3.617LysSer: 3.617 ± 0.274
4.411LysThr: 4.411 ± 0.325
5.484LysVal: 5.484 ± 0.351
1.167LysTrp: 1.167 ± 0.169
2.871LysTyr: 2.871 ± 0.235
0.0LysXaa: 0.0 ± 0.0
Leu
6.114LeuAla: 6.114 ± 0.409
1.12LeuCys: 1.12 ± 0.195
5.764LeuAsp: 5.764 ± 0.357
5.951LeuGlu: 5.951 ± 0.414
2.661LeuPhe: 2.661 ± 0.29
4.434LeuGly: 4.434 ± 0.353
1.634LeuHis: 1.634 ± 0.194
4.201LeuIle: 4.201 ± 0.304
5.788LeuLys: 5.788 ± 0.385
5.858LeuLeu: 5.858 ± 0.441
2.24LeuMet: 2.24 ± 0.22
3.991LeuAsn: 3.991 ± 0.309
3.431LeuPro: 3.431 ± 0.281
2.357LeuGln: 2.357 ± 0.254
4.481LeuArg: 4.481 ± 0.345
5.764LeuSer: 5.764 ± 0.359
4.458LeuThr: 4.458 ± 0.331
5.438LeuVal: 5.438 ± 0.381
1.4LeuTrp: 1.4 ± 0.178
3.127LeuTyr: 3.127 ± 0.252
0.0LeuXaa: 0.0 ± 0.0
Met
2.217MetAla: 2.217 ± 0.238
0.28MetCys: 0.28 ± 0.078
1.214MetAsp: 1.214 ± 0.178
1.704MetGlu: 1.704 ± 0.221
1.354MetPhe: 1.354 ± 0.181
1.61MetGly: 1.61 ± 0.228
0.327MetHis: 0.327 ± 0.089
1.89MetIle: 1.89 ± 0.217
3.011MetLys: 3.011 ± 0.277
2.427MetLeu: 2.427 ± 0.24
1.097MetMet: 1.097 ± 0.173
1.494MetAsn: 1.494 ± 0.176
1.05MetPro: 1.05 ± 0.175
0.84MetGln: 0.84 ± 0.133
1.26MetArg: 1.26 ± 0.167
1.937MetSer: 1.937 ± 0.166
1.984MetThr: 1.984 ± 0.207
2.03MetVal: 2.03 ± 0.207
0.35MetTrp: 0.35 ± 0.097
0.91MetTyr: 0.91 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
3.641AsnAla: 3.641 ± 0.353
0.7AsnCys: 0.7 ± 0.148
2.777AsnAsp: 2.777 ± 0.247
2.194AsnGlu: 2.194 ± 0.23
2.194AsnPhe: 2.194 ± 0.216
4.131AsnGly: 4.131 ± 0.359
0.98AsnHis: 0.98 ± 0.157
2.684AsnIle: 2.684 ± 0.238
3.454AsnLys: 3.454 ± 0.286
4.177AsnLeu: 4.177 ± 0.323
1.4AsnMet: 1.4 ± 0.17
2.777AsnAsn: 2.777 ± 0.303
2.544AsnPro: 2.544 ± 0.239
1.61AsnGln: 1.61 ± 0.203
2.731AsnArg: 2.731 ± 0.241
2.731AsnSer: 2.731 ± 0.261
3.034AsnThr: 3.034 ± 0.322
3.057AsnVal: 3.057 ± 0.31
0.63AsnTrp: 0.63 ± 0.131
1.727AsnTyr: 1.727 ± 0.212
0.0AsnXaa: 0.0 ± 0.0
Pro
2.357ProAla: 2.357 ± 0.285
0.397ProCys: 0.397 ± 0.103
2.754ProAsp: 2.754 ± 0.291
3.011ProGlu: 3.011 ± 0.307
1.727ProPhe: 1.727 ± 0.202
2.357ProGly: 2.357 ± 0.242
0.607ProHis: 0.607 ± 0.11
1.494ProIle: 1.494 ± 0.17
2.054ProLys: 2.054 ± 0.237
2.824ProLeu: 2.824 ± 0.294
0.863ProMet: 0.863 ± 0.136
1.727ProAsn: 1.727 ± 0.202
1.214ProPro: 1.214 ± 0.212
1.727ProGln: 1.727 ± 0.176
1.54ProArg: 1.54 ± 0.184
2.194ProSer: 2.194 ± 0.245
1.96ProThr: 1.96 ± 0.217
2.987ProVal: 2.987 ± 0.221
0.443ProTrp: 0.443 ± 0.097
1.68ProTyr: 1.68 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
2.287GlnAla: 2.287 ± 0.224
0.42GlnCys: 0.42 ± 0.088
2.147GlnAsp: 2.147 ± 0.27
2.194GlnGlu: 2.194 ± 0.254
1.167GlnPhe: 1.167 ± 0.157
2.007GlnGly: 2.007 ± 0.203
0.49GlnHis: 0.49 ± 0.111
2.264GlnIle: 2.264 ± 0.246
2.124GlnLys: 2.124 ± 0.224
2.59GlnLeu: 2.59 ± 0.274
1.074GlnMet: 1.074 ± 0.158
1.634GlnAsn: 1.634 ± 0.183
0.957GlnPro: 0.957 ± 0.156
1.564GlnGln: 1.564 ± 0.211
1.61GlnArg: 1.61 ± 0.162
1.797GlnSer: 1.797 ± 0.206
1.89GlnThr: 1.89 ± 0.236
2.264GlnVal: 2.264 ± 0.224
0.49GlnTrp: 0.49 ± 0.113
1.214GlnTyr: 1.214 ± 0.191
0.0GlnXaa: 0.0 ± 0.0
Arg
3.081ArgAla: 3.081 ± 0.282
0.863ArgCys: 0.863 ± 0.143
3.594ArgAsp: 3.594 ± 0.257
3.151ArgGlu: 3.151 ± 0.252
1.727ArgPhe: 1.727 ± 0.208
3.734ArgGly: 3.734 ± 0.281
0.98ArgHis: 0.98 ± 0.176
2.941ArgIle: 2.941 ± 0.23
3.874ArgLys: 3.874 ± 0.345
4.341ArgLeu: 4.341 ± 0.316
1.214ArgMet: 1.214 ± 0.167
2.707ArgAsn: 2.707 ± 0.218
1.774ArgPro: 1.774 ± 0.187
2.007ArgGln: 2.007 ± 0.203
2.894ArgArg: 2.894 ± 0.293
3.104ArgSer: 3.104 ± 0.287
2.544ArgThr: 2.544 ± 0.253
3.384ArgVal: 3.384 ± 0.275
0.7ArgTrp: 0.7 ± 0.113
1.844ArgTyr: 1.844 ± 0.169
0.0ArgXaa: 0.0 ± 0.0
Ser
4.621SerAla: 4.621 ± 0.403
0.723SerCys: 0.723 ± 0.128
3.664SerAsp: 3.664 ± 0.28
3.501SerGlu: 3.501 ± 0.281
2.777SerPhe: 2.777 ± 0.268
4.784SerGly: 4.784 ± 0.372
1.004SerHis: 1.004 ± 0.167
3.571SerIle: 3.571 ± 0.304
4.271SerLys: 4.271 ± 0.284
4.948SerLeu: 4.948 ± 0.365
1.494SerMet: 1.494 ± 0.173
2.987SerAsn: 2.987 ± 0.279
2.217SerPro: 2.217 ± 0.227
1.797SerGln: 1.797 ± 0.199
2.917SerArg: 2.917 ± 0.245
3.687SerSer: 3.687 ± 0.302
3.641SerThr: 3.641 ± 0.302
4.411SerVal: 4.411 ± 0.359
1.027SerTrp: 1.027 ± 0.192
2.357SerTyr: 2.357 ± 0.271
0.0SerXaa: 0.0 ± 0.0
Thr
3.734ThrAla: 3.734 ± 0.356
0.583ThrCys: 0.583 ± 0.127
3.407ThrAsp: 3.407 ± 0.266
3.594ThrGlu: 3.594 ± 0.299
2.614ThrPhe: 2.614 ± 0.25
4.458ThrGly: 4.458 ± 0.3
1.19ThrHis: 1.19 ± 0.19
3.851ThrIle: 3.851 ± 0.35
3.921ThrLys: 3.921 ± 0.267
4.714ThrLeu: 4.714 ± 0.362
1.19ThrMet: 1.19 ± 0.177
2.264ThrAsn: 2.264 ± 0.269
2.987ThrPro: 2.987 ± 0.299
1.727ThrGln: 1.727 ± 0.184
2.31ThrArg: 2.31 ± 0.249
3.221ThrSer: 3.221 ± 0.254
3.477ThrThr: 3.477 ± 0.352
4.504ThrVal: 4.504 ± 0.356
1.027ThrTrp: 1.027 ± 0.191
2.31ThrTyr: 2.31 ± 0.226
0.0ThrXaa: 0.0 ± 0.0
Val
4.738ValAla: 4.738 ± 0.333
1.027ValCys: 1.027 ± 0.175
4.551ValAsp: 4.551 ± 0.343
4.668ValGlu: 4.668 ± 0.306
2.917ValPhe: 2.917 ± 0.271
4.831ValGly: 4.831 ± 0.314
1.144ValHis: 1.144 ± 0.158
4.271ValIle: 4.271 ± 0.321
5.181ValLys: 5.181 ± 0.338
5.601ValLeu: 5.601 ± 0.29
1.867ValMet: 1.867 ± 0.213
3.454ValAsn: 3.454 ± 0.27
2.17ValPro: 2.17 ± 0.229
2.147ValGln: 2.147 ± 0.247
3.687ValArg: 3.687 ± 0.282
4.458ValSer: 4.458 ± 0.333
4.854ValThr: 4.854 ± 0.382
6.068ValVal: 6.068 ± 0.492
1.12ValTrp: 1.12 ± 0.167
2.684ValTyr: 2.684 ± 0.286
0.0ValXaa: 0.0 ± 0.0
Trp
0.98TrpAla: 0.98 ± 0.167
0.187TrpCys: 0.187 ± 0.076
1.144TrpAsp: 1.144 ± 0.188
1.284TrpGlu: 1.284 ± 0.163
0.63TrpPhe: 0.63 ± 0.128
0.863TrpGly: 0.863 ± 0.141
0.303TrpHis: 0.303 ± 0.075
0.817TrpIle: 0.817 ± 0.139
1.19TrpLys: 1.19 ± 0.157
1.914TrpLeu: 1.914 ± 0.206
0.677TrpMet: 0.677 ± 0.129
0.934TrpAsn: 0.934 ± 0.162
0.327TrpPro: 0.327 ± 0.081
0.583TrpGln: 0.583 ± 0.098
0.63TrpArg: 0.63 ± 0.118
1.027TrpSer: 1.027 ± 0.14
0.863TrpThr: 0.863 ± 0.149
0.934TrpVal: 0.934 ± 0.142
0.513TrpTrp: 0.513 ± 0.135
0.84TrpTyr: 0.84 ± 0.141
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.777TyrAla: 2.777 ± 0.271
0.56TyrCys: 0.56 ± 0.121
2.941TyrAsp: 2.941 ± 0.274
2.124TyrGlu: 2.124 ± 0.229
1.54TyrPhe: 1.54 ± 0.185
2.614TyrGly: 2.614 ± 0.252
0.957TyrHis: 0.957 ± 0.141
2.287TyrIle: 2.287 ± 0.258
2.801TyrLys: 2.801 ± 0.299
3.291TyrLeu: 3.291 ± 0.275
1.167TyrMet: 1.167 ± 0.181
1.797TyrAsn: 1.797 ± 0.208
1.634TyrPro: 1.634 ± 0.184
1.354TyrGln: 1.354 ± 0.172
2.1TyrArg: 2.1 ± 0.212
2.427TyrSer: 2.427 ± 0.236
2.894TyrThr: 2.894 ± 0.265
2.567TyrVal: 2.567 ± 0.226
0.583TyrTrp: 0.583 ± 0.12
1.914TyrTyr: 1.914 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 245 proteins (42850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski