Amino acid dipepetide frequency for Dickeya phage RC-2014

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.228AlaAla: 6.228 ± 0.46
0.646AlaCys: 0.646 ± 0.122
4.569AlaAsp: 4.569 ± 0.347
4.482AlaGlu: 4.482 ± 0.355
2.608AlaPhe: 2.608 ± 0.233
5.0AlaGly: 5.0 ± 0.369
1.465AlaHis: 1.465 ± 0.196
4.504AlaIle: 4.504 ± 0.283
4.461AlaLys: 4.461 ± 0.386
5.948AlaLeu: 5.948 ± 0.438
1.746AlaMet: 1.746 ± 0.163
2.952AlaAsn: 2.952 ± 0.254
2.737AlaPro: 2.737 ± 0.262
2.737AlaGln: 2.737 ± 0.263
3.211AlaArg: 3.211 ± 0.221
4.181AlaSer: 4.181 ± 0.364
4.224AlaThr: 4.224 ± 0.372
5.387AlaVal: 5.387 ± 0.382
1.142AlaTrp: 1.142 ± 0.151
2.457AlaTyr: 2.457 ± 0.241
0.0AlaXaa: 0.0 ± 0.0
Cys
0.711CysAla: 0.711 ± 0.118
0.172CysCys: 0.172 ± 0.052
0.69CysAsp: 0.69 ± 0.12
0.646CysGlu: 0.646 ± 0.113
0.323CysPhe: 0.323 ± 0.08
0.646CysGly: 0.646 ± 0.124
0.366CysHis: 0.366 ± 0.108
0.754CysIle: 0.754 ± 0.137
0.603CysLys: 0.603 ± 0.108
0.797CysLeu: 0.797 ± 0.136
0.28CysMet: 0.28 ± 0.082
0.625CysAsn: 0.625 ± 0.116
0.474CysPro: 0.474 ± 0.082
0.409CysGln: 0.409 ± 0.085
0.668CysArg: 0.668 ± 0.115
0.862CysSer: 0.862 ± 0.154
0.625CysThr: 0.625 ± 0.113
1.034CysVal: 1.034 ± 0.162
0.108CysTrp: 0.108 ± 0.043
0.302CysTyr: 0.302 ± 0.078
0.0CysXaa: 0.0 ± 0.0
Asp
4.418AspAla: 4.418 ± 0.377
0.409AspCys: 0.409 ± 0.101
3.944AspAsp: 3.944 ± 0.391
4.569AspGlu: 4.569 ± 0.338
3.34AspPhe: 3.34 ± 0.268
4.827AspGly: 4.827 ± 0.285
1.121AspHis: 1.121 ± 0.168
4.353AspIle: 4.353 ± 0.298
3.707AspLys: 3.707 ± 0.283
5.366AspLeu: 5.366 ± 0.367
2.047AspMet: 2.047 ± 0.217
2.801AspAsn: 2.801 ± 0.281
3.103AspPro: 3.103 ± 0.279
2.09AspGln: 2.09 ± 0.173
2.629AspArg: 2.629 ± 0.243
3.103AspSer: 3.103 ± 0.25
3.814AspThr: 3.814 ± 0.291
4.375AspVal: 4.375 ± 0.287
1.056AspTrp: 1.056 ± 0.166
3.168AspTyr: 3.168 ± 0.29
0.0AspXaa: 0.0 ± 0.0
Glu
4.461GluAla: 4.461 ± 0.35
0.754GluCys: 0.754 ± 0.147
4.094GluAsp: 4.094 ± 0.302
4.806GluGlu: 4.806 ± 0.419
3.211GluPhe: 3.211 ± 0.267
4.138GluGly: 4.138 ± 0.312
1.508GluHis: 1.508 ± 0.203
3.685GluIle: 3.685 ± 0.285
3.707GluLys: 3.707 ± 0.306
5.883GluLeu: 5.883 ± 0.339
2.414GluMet: 2.414 ± 0.238
2.845GluAsn: 2.845 ± 0.265
2.047GluPro: 2.047 ± 0.213
2.521GluGln: 2.521 ± 0.208
3.879GluArg: 3.879 ± 0.37
3.491GluSer: 3.491 ± 0.264
3.707GluThr: 3.707 ± 0.246
4.827GluVal: 4.827 ± 0.299
1.142GluTrp: 1.142 ± 0.15
2.758GluTyr: 2.758 ± 0.209
0.0GluXaa: 0.0 ± 0.0
Phe
2.586PheAla: 2.586 ± 0.247
0.345PheCys: 0.345 ± 0.091
2.629PheAsp: 2.629 ± 0.26
3.211PheGlu: 3.211 ± 0.293
1.53PhePhe: 1.53 ± 0.204
3.146PheGly: 3.146 ± 0.227
0.905PheHis: 0.905 ± 0.135
2.672PheIle: 2.672 ± 0.228
2.629PheLys: 2.629 ± 0.245
3.039PheLeu: 3.039 ± 0.273
1.336PheMet: 1.336 ± 0.166
2.608PheAsn: 2.608 ± 0.253
1.616PhePro: 1.616 ± 0.202
1.659PheGln: 1.659 ± 0.159
2.177PheArg: 2.177 ± 0.245
2.888PheSer: 2.888 ± 0.253
3.06PheThr: 3.06 ± 0.267
2.629PheVal: 2.629 ± 0.244
0.646PheTrp: 0.646 ± 0.121
1.487PheTyr: 1.487 ± 0.181
0.0PheXaa: 0.0 ± 0.0
Gly
4.353GlyAla: 4.353 ± 0.345
0.97GlyCys: 0.97 ± 0.175
3.836GlyAsp: 3.836 ± 0.284
4.332GlyGlu: 4.332 ± 0.313
2.608GlyPhe: 2.608 ± 0.224
5.064GlyGly: 5.064 ± 0.572
1.293GlyHis: 1.293 ± 0.169
4.763GlyIle: 4.763 ± 0.333
5.021GlyLys: 5.021 ± 0.316
5.172GlyLeu: 5.172 ± 0.294
1.983GlyMet: 1.983 ± 0.227
3.685GlyAsn: 3.685 ± 0.326
1.271GlyPro: 1.271 ± 0.189
2.349GlyGln: 2.349 ± 0.227
2.888GlyArg: 2.888 ± 0.262
4.224GlySer: 4.224 ± 0.4
4.116GlyThr: 4.116 ± 0.349
5.581GlyVal: 5.581 ± 0.397
1.315GlyTrp: 1.315 ± 0.176
2.37GlyTyr: 2.37 ± 0.206
0.0GlyXaa: 0.0 ± 0.0
His
0.97HisAla: 0.97 ± 0.152
0.302HisCys: 0.302 ± 0.096
1.164HisAsp: 1.164 ± 0.186
0.776HisGlu: 0.776 ± 0.129
0.991HisPhe: 0.991 ± 0.156
1.271HisGly: 1.271 ± 0.189
0.517HisHis: 0.517 ± 0.121
1.315HisIle: 1.315 ± 0.186
1.379HisLys: 1.379 ± 0.181
1.552HisLeu: 1.552 ± 0.164
0.496HisMet: 0.496 ± 0.093
0.797HisAsn: 0.797 ± 0.136
1.164HisPro: 1.164 ± 0.145
0.776HisGln: 0.776 ± 0.136
1.013HisArg: 1.013 ± 0.131
1.013HisSer: 1.013 ± 0.174
1.099HisThr: 1.099 ± 0.173
1.271HisVal: 1.271 ± 0.171
0.215HisTrp: 0.215 ± 0.081
0.797HisTyr: 0.797 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
3.707IleAla: 3.707 ± 0.27
0.776IleCys: 0.776 ± 0.125
4.633IleAsp: 4.633 ± 0.379
4.827IleGlu: 4.827 ± 0.282
1.422IlePhe: 1.422 ± 0.167
3.599IleGly: 3.599 ± 0.253
1.164IleHis: 1.164 ± 0.177
3.276IleIle: 3.276 ± 0.294
3.857IleLys: 3.857 ± 0.342
4.267IleLeu: 4.267 ± 0.288
1.444IleMet: 1.444 ± 0.196
3.276IleAsn: 3.276 ± 0.275
3.211IlePro: 3.211 ± 0.273
2.866IleGln: 2.866 ± 0.251
3.103IleArg: 3.103 ± 0.249
3.211IleSer: 3.211 ± 0.259
4.439IleThr: 4.439 ± 0.339
4.073IleVal: 4.073 ± 0.304
0.582IleTrp: 0.582 ± 0.119
1.853IleTyr: 1.853 ± 0.228
0.0IleXaa: 0.0 ± 0.0
Lys
4.245LysAla: 4.245 ± 0.296
0.56LysCys: 0.56 ± 0.134
4.547LysAsp: 4.547 ± 0.296
4.569LysGlu: 4.569 ± 0.413
3.34LysPhe: 3.34 ± 0.284
4.138LysGly: 4.138 ± 0.352
1.013LysHis: 1.013 ± 0.141
3.556LysIle: 3.556 ± 0.305
4.288LysLys: 4.288 ± 0.371
5.517LysLeu: 5.517 ± 0.353
2.672LysMet: 2.672 ± 0.275
2.672LysAsn: 2.672 ± 0.216
2.651LysPro: 2.651 ± 0.295
2.478LysGln: 2.478 ± 0.226
3.362LysArg: 3.362 ± 0.299
3.879LysSer: 3.879 ± 0.263
3.663LysThr: 3.663 ± 0.278
4.569LysVal: 4.569 ± 0.33
0.905LysTrp: 0.905 ± 0.148
2.478LysTyr: 2.478 ± 0.282
0.0LysXaa: 0.0 ± 0.0
Leu
6.271LeuAla: 6.271 ± 0.37
0.84LeuCys: 0.84 ± 0.141
5.387LeuAsp: 5.387 ± 0.419
4.827LeuGlu: 4.827 ± 0.307
3.168LeuPhe: 3.168 ± 0.294
5.0LeuGly: 5.0 ± 0.306
1.315LeuHis: 1.315 ± 0.181
3.987LeuIle: 3.987 ± 0.257
6.357LeuLys: 6.357 ± 0.344
6.637LeuLeu: 6.637 ± 0.426
2.069LeuMet: 2.069 ± 0.201
4.547LeuAsn: 4.547 ± 0.287
3.599LeuPro: 3.599 ± 0.266
2.888LeuGln: 2.888 ± 0.308
3.728LeuArg: 3.728 ± 0.271
5.387LeuSer: 5.387 ± 0.305
4.978LeuThr: 4.978 ± 0.364
5.732LeuVal: 5.732 ± 0.334
0.862LeuTrp: 0.862 ± 0.155
2.909LeuTyr: 2.909 ± 0.268
0.0LeuXaa: 0.0 ± 0.0
Met
2.457MetAla: 2.457 ± 0.265
0.302MetCys: 0.302 ± 0.083
1.616MetAsp: 1.616 ± 0.187
1.487MetGlu: 1.487 ± 0.213
1.638MetPhe: 1.638 ± 0.189
1.638MetGly: 1.638 ± 0.185
0.453MetHis: 0.453 ± 0.114
1.465MetIle: 1.465 ± 0.15
2.263MetLys: 2.263 ± 0.227
2.414MetLeu: 2.414 ± 0.248
1.034MetMet: 1.034 ± 0.163
1.465MetAsn: 1.465 ± 0.174
1.121MetPro: 1.121 ± 0.161
1.099MetGln: 1.099 ± 0.166
1.659MetArg: 1.659 ± 0.202
2.177MetSer: 2.177 ± 0.21
1.681MetThr: 1.681 ± 0.186
2.004MetVal: 2.004 ± 0.205
0.431MetTrp: 0.431 ± 0.09
0.862MetTyr: 0.862 ± 0.124
0.0MetXaa: 0.0 ± 0.0
Asn
3.922AsnAla: 3.922 ± 0.315
0.646AsnCys: 0.646 ± 0.101
2.478AsnAsp: 2.478 ± 0.246
2.457AsnGlu: 2.457 ± 0.236
2.241AsnPhe: 2.241 ± 0.21
4.116AsnGly: 4.116 ± 0.284
1.034AsnHis: 1.034 ± 0.153
3.039AsnIle: 3.039 ± 0.257
2.823AsnLys: 2.823 ± 0.225
3.836AsnLeu: 3.836 ± 0.27
1.379AsnMet: 1.379 ± 0.152
2.758AsnAsn: 2.758 ± 0.266
2.866AsnPro: 2.866 ± 0.281
1.832AsnGln: 1.832 ± 0.21
2.888AsnArg: 2.888 ± 0.28
2.715AsnSer: 2.715 ± 0.235
3.125AsnThr: 3.125 ± 0.346
3.663AsnVal: 3.663 ± 0.279
0.668AsnTrp: 0.668 ± 0.14
1.767AsnTyr: 1.767 ± 0.21
0.0AsnXaa: 0.0 ± 0.0
Pro
2.974ProAla: 2.974 ± 0.251
0.366ProCys: 0.366 ± 0.081
2.974ProAsp: 2.974 ± 0.218
3.534ProGlu: 3.534 ± 0.346
1.789ProPhe: 1.789 ± 0.209
2.392ProGly: 2.392 ± 0.22
0.625ProHis: 0.625 ± 0.115
1.81ProIle: 1.81 ± 0.214
2.414ProLys: 2.414 ± 0.216
3.362ProLeu: 3.362 ± 0.229
1.185ProMet: 1.185 ± 0.144
2.004ProAsn: 2.004 ± 0.203
1.056ProPro: 1.056 ± 0.161
1.465ProGln: 1.465 ± 0.165
1.681ProArg: 1.681 ± 0.193
2.995ProSer: 2.995 ± 0.26
2.586ProThr: 2.586 ± 0.31
3.017ProVal: 3.017 ± 0.267
0.625ProTrp: 0.625 ± 0.106
1.315ProTyr: 1.315 ± 0.179
0.0ProXaa: 0.0 ± 0.0
Gln
2.823GlnAla: 2.823 ± 0.262
0.345GlnCys: 0.345 ± 0.082
2.457GlnAsp: 2.457 ± 0.229
2.22GlnGlu: 2.22 ± 0.282
1.939GlnPhe: 1.939 ± 0.232
2.306GlnGly: 2.306 ± 0.188
0.711GlnHis: 0.711 ± 0.118
2.263GlnIle: 2.263 ± 0.197
2.543GlnLys: 2.543 ± 0.235
2.931GlnLeu: 2.931 ± 0.235
1.142GlnMet: 1.142 ± 0.131
1.681GlnAsn: 1.681 ± 0.191
1.379GlnPro: 1.379 ± 0.137
1.961GlnGln: 1.961 ± 0.244
1.853GlnArg: 1.853 ± 0.173
2.392GlnSer: 2.392 ± 0.254
2.478GlnThr: 2.478 ± 0.207
2.327GlnVal: 2.327 ± 0.219
0.431GlnTrp: 0.431 ± 0.095
1.53GlnTyr: 1.53 ± 0.174
0.0GlnXaa: 0.0 ± 0.0
Arg
3.125ArgAla: 3.125 ± 0.237
0.754ArgCys: 0.754 ± 0.15
3.189ArgAsp: 3.189 ± 0.298
3.125ArgGlu: 3.125 ± 0.234
2.306ArgPhe: 2.306 ± 0.244
3.017ArgGly: 3.017 ± 0.249
1.013ArgHis: 1.013 ± 0.157
3.383ArgIle: 3.383 ± 0.289
3.189ArgLys: 3.189 ± 0.326
4.482ArgLeu: 4.482 ± 0.375
1.875ArgMet: 1.875 ± 0.222
2.414ArgAsn: 2.414 ± 0.22
1.616ArgPro: 1.616 ± 0.189
1.81ArgGln: 1.81 ± 0.176
2.823ArgArg: 2.823 ± 0.274
2.888ArgSer: 2.888 ± 0.234
2.758ArgThr: 2.758 ± 0.218
3.297ArgVal: 3.297 ± 0.323
0.668ArgTrp: 0.668 ± 0.12
2.198ArgTyr: 2.198 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
4.03SerAla: 4.03 ± 0.295
0.517SerCys: 0.517 ± 0.138
3.491SerAsp: 3.491 ± 0.283
3.814SerGlu: 3.814 ± 0.297
2.758SerPhe: 2.758 ± 0.241
4.698SerGly: 4.698 ± 0.459
1.142SerHis: 1.142 ± 0.21
4.008SerIle: 4.008 ± 0.3
4.094SerLys: 4.094 ± 0.334
4.892SerLeu: 4.892 ± 0.327
1.508SerMet: 1.508 ± 0.164
3.663SerAsn: 3.663 ± 0.324
2.349SerPro: 2.349 ± 0.239
2.177SerGln: 2.177 ± 0.22
2.888SerArg: 2.888 ± 0.244
4.482SerSer: 4.482 ± 0.412
3.319SerThr: 3.319 ± 0.313
4.138SerVal: 4.138 ± 0.319
0.711SerTrp: 0.711 ± 0.12
2.435SerTyr: 2.435 ± 0.28
0.0SerXaa: 0.0 ± 0.0
Thr
4.504ThrAla: 4.504 ± 0.422
0.646ThrCys: 0.646 ± 0.118
3.663ThrAsp: 3.663 ± 0.331
4.094ThrGlu: 4.094 ± 0.282
2.608ThrPhe: 2.608 ± 0.263
4.633ThrGly: 4.633 ± 0.348
0.84ThrHis: 0.84 ± 0.139
3.944ThrIle: 3.944 ± 0.357
3.922ThrLys: 3.922 ± 0.237
4.892ThrLeu: 4.892 ± 0.388
1.228ThrMet: 1.228 ± 0.149
3.082ThrAsn: 3.082 ± 0.272
3.534ThrPro: 3.534 ± 0.318
2.004ThrGln: 2.004 ± 0.242
3.254ThrArg: 3.254 ± 0.285
3.491ThrSer: 3.491 ± 0.265
3.836ThrThr: 3.836 ± 0.398
4.612ThrVal: 4.612 ± 0.405
0.776ThrTrp: 0.776 ± 0.136
1.702ThrTyr: 1.702 ± 0.229
0.0ThrXaa: 0.0 ± 0.0
Val
5.043ValAla: 5.043 ± 0.356
0.97ValCys: 0.97 ± 0.198
5.172ValAsp: 5.172 ± 0.344
4.849ValGlu: 4.849 ± 0.374
2.823ValPhe: 2.823 ± 0.268
4.31ValGly: 4.31 ± 0.358
1.228ValHis: 1.228 ± 0.166
3.965ValIle: 3.965 ± 0.284
5.043ValLys: 5.043 ± 0.38
5.194ValLeu: 5.194 ± 0.317
2.09ValMet: 2.09 ± 0.224
3.513ValAsn: 3.513 ± 0.306
2.543ValPro: 2.543 ± 0.262
2.758ValGln: 2.758 ± 0.215
3.189ValArg: 3.189 ± 0.279
4.827ValSer: 4.827 ± 0.364
5.172ValThr: 5.172 ± 0.41
6.012ValVal: 6.012 ± 0.486
1.164ValTrp: 1.164 ± 0.161
2.974ValTyr: 2.974 ± 0.277
0.0ValXaa: 0.0 ± 0.0
Trp
0.991TrpAla: 0.991 ± 0.167
0.194TrpCys: 0.194 ± 0.066
0.884TrpAsp: 0.884 ± 0.144
1.207TrpGlu: 1.207 ± 0.128
0.582TrpPhe: 0.582 ± 0.105
0.776TrpGly: 0.776 ± 0.11
0.194TrpHis: 0.194 ± 0.063
0.754TrpIle: 0.754 ± 0.127
0.948TrpLys: 0.948 ± 0.202
1.444TrpLeu: 1.444 ± 0.164
0.366TrpMet: 0.366 ± 0.099
0.862TrpAsn: 0.862 ± 0.13
0.388TrpPro: 0.388 ± 0.093
0.345TrpGln: 0.345 ± 0.081
0.927TrpArg: 0.927 ± 0.146
0.733TrpSer: 0.733 ± 0.122
0.711TrpThr: 0.711 ± 0.116
1.164TrpVal: 1.164 ± 0.159
0.172TrpTrp: 0.172 ± 0.049
0.56TrpTyr: 0.56 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.866TyrAla: 2.866 ± 0.241
0.582TyrCys: 0.582 ± 0.131
2.823TyrAsp: 2.823 ± 0.271
2.047TyrGlu: 2.047 ± 0.201
1.573TyrPhe: 1.573 ± 0.178
2.478TyrGly: 2.478 ± 0.293
0.97TyrHis: 0.97 ± 0.163
2.09TyrIle: 2.09 ± 0.19
1.853TyrLys: 1.853 ± 0.194
2.866TyrLeu: 2.866 ± 0.228
0.948TyrMet: 0.948 ± 0.13
2.004TyrAsn: 2.004 ± 0.173
1.444TyrPro: 1.444 ± 0.183
1.487TyrGln: 1.487 ± 0.184
2.112TyrArg: 2.112 ± 0.223
2.198TyrSer: 2.198 ± 0.211
1.875TyrThr: 1.875 ± 0.269
3.125TyrVal: 3.125 ± 0.244
0.582TyrTrp: 0.582 ± 0.11
1.379TyrTyr: 1.379 ± 0.16
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 196 proteins (46405 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski