Amino acid dipepetide frequency for Salmonella phage NR01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.823AlaAla: 7.823 ± 1.174
0.585AlaCys: 0.585 ± 0.136
3.727AlaAsp: 3.727 ± 0.398
5.113AlaGlu: 5.113 ± 0.517
3.357AlaPhe: 3.357 ± 0.276
5.544AlaGly: 5.544 ± 0.506
1.571AlaHis: 1.571 ± 0.236
4.928AlaIle: 4.928 ± 0.358
6.376AlaLys: 6.376 ± 0.523
6.468AlaLeu: 6.468 ± 0.481
1.848AlaMet: 1.848 ± 0.262
3.573AlaAsn: 3.573 ± 0.447
2.372AlaPro: 2.372 ± 0.315
3.511AlaGln: 3.511 ± 0.368
3.049AlaArg: 3.049 ± 0.318
4.281AlaSer: 4.281 ± 0.474
3.604AlaThr: 3.604 ± 0.421
4.281AlaVal: 4.281 ± 0.432
0.832AlaTrp: 0.832 ± 0.142
2.618AlaTyr: 2.618 ± 0.32
0.0AlaXaa: 0.0 ± 0.0
Cys
0.708CysAla: 0.708 ± 0.145
0.308CysCys: 0.308 ± 0.11
0.524CysAsp: 0.524 ± 0.13
0.678CysGlu: 0.678 ± 0.148
0.524CysPhe: 0.524 ± 0.133
0.832CysGly: 0.832 ± 0.2
0.246CysHis: 0.246 ± 0.081
0.616CysIle: 0.616 ± 0.128
0.739CysLys: 0.739 ± 0.158
0.893CysLeu: 0.893 ± 0.173
0.308CysMet: 0.308 ± 0.09
0.616CysAsn: 0.616 ± 0.121
0.616CysPro: 0.616 ± 0.169
0.462CysGln: 0.462 ± 0.128
0.524CysArg: 0.524 ± 0.129
1.14CysSer: 1.14 ± 0.238
0.585CysThr: 0.585 ± 0.149
0.524CysVal: 0.524 ± 0.135
0.185CysTrp: 0.185 ± 0.073
0.37CysTyr: 0.37 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
4.404AspAla: 4.404 ± 0.383
0.585AspCys: 0.585 ± 0.144
2.772AspAsp: 2.772 ± 0.403
4.62AspGlu: 4.62 ± 0.358
2.618AspPhe: 2.618 ± 0.312
3.727AspGly: 3.727 ± 0.328
1.232AspHis: 1.232 ± 0.199
4.343AspIle: 4.343 ± 0.377
4.682AspLys: 4.682 ± 0.338
5.606AspLeu: 5.606 ± 0.536
1.694AspMet: 1.694 ± 0.209
2.587AspAsn: 2.587 ± 0.281
2.649AspPro: 2.649 ± 0.244
1.324AspGln: 1.324 ± 0.211
2.587AspArg: 2.587 ± 0.306
3.727AspSer: 3.727 ± 0.371
3.542AspThr: 3.542 ± 0.322
3.234AspVal: 3.234 ± 0.312
0.708AspTrp: 0.708 ± 0.159
2.618AspTyr: 2.618 ± 0.312
0.0AspXaa: 0.0 ± 0.0
Glu
5.76GluAla: 5.76 ± 0.364
0.924GluCys: 0.924 ± 0.176
3.604GluAsp: 3.604 ± 0.32
5.02GluGlu: 5.02 ± 0.386
2.464GluPhe: 2.464 ± 0.246
3.45GluGly: 3.45 ± 0.311
1.232GluHis: 1.232 ± 0.161
4.866GluIle: 4.866 ± 0.363
4.805GluLys: 4.805 ± 0.41
7.146GluLeu: 7.146 ± 0.409
2.341GluMet: 2.341 ± 0.297
3.172GluAsn: 3.172 ± 0.307
1.509GluPro: 1.509 ± 0.26
2.834GluGln: 2.834 ± 0.344
3.049GluArg: 3.049 ± 0.342
3.788GluSer: 3.788 ± 0.31
3.634GluThr: 3.634 ± 0.321
4.25GluVal: 4.25 ± 0.356
1.14GluTrp: 1.14 ± 0.194
3.234GluTyr: 3.234 ± 0.33
0.0GluXaa: 0.0 ± 0.0
Phe
2.618PheAla: 2.618 ± 0.279
0.4PheCys: 0.4 ± 0.121
2.772PheAsp: 2.772 ± 0.293
2.649PheGlu: 2.649 ± 0.282
1.478PhePhe: 1.478 ± 0.208
2.71PheGly: 2.71 ± 0.355
1.232PheHis: 1.232 ± 0.197
2.526PheIle: 2.526 ± 0.278
3.234PheLys: 3.234 ± 0.334
3.388PheLeu: 3.388 ± 0.342
0.678PheMet: 0.678 ± 0.126
2.741PheAsn: 2.741 ± 0.301
1.324PhePro: 1.324 ± 0.218
0.862PheGln: 0.862 ± 0.141
1.786PheArg: 1.786 ± 0.314
3.08PheSer: 3.08 ± 0.299
2.279PheThr: 2.279 ± 0.226
2.71PheVal: 2.71 ± 0.275
0.37PheTrp: 0.37 ± 0.097
1.54PheTyr: 1.54 ± 0.183
0.0PheXaa: 0.0 ± 0.0
Gly
4.805GlyAla: 4.805 ± 0.657
0.986GlyCys: 0.986 ± 0.196
3.573GlyAsp: 3.573 ± 0.384
4.528GlyGlu: 4.528 ± 0.35
3.203GlyPhe: 3.203 ± 0.314
4.035GlyGly: 4.035 ± 0.568
1.17GlyHis: 1.17 ± 0.214
4.682GlyIle: 4.682 ± 0.37
4.836GlyLys: 4.836 ± 0.347
4.805GlyLeu: 4.805 ± 0.342
1.509GlyMet: 1.509 ± 0.249
3.573GlyAsn: 3.573 ± 0.472
0.986GlyPro: 0.986 ± 0.173
2.31GlyGln: 2.31 ± 0.253
2.895GlyArg: 2.895 ± 0.309
4.528GlySer: 4.528 ± 0.44
4.312GlyThr: 4.312 ± 0.498
4.62GlyVal: 4.62 ± 0.376
1.016GlyTrp: 1.016 ± 0.215
3.542GlyTyr: 3.542 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.294HisAla: 1.294 ± 0.176
0.4HisCys: 0.4 ± 0.106
1.14HisAsp: 1.14 ± 0.217
1.016HisGlu: 1.016 ± 0.165
0.493HisPhe: 0.493 ± 0.146
1.478HisGly: 1.478 ± 0.205
0.678HisHis: 0.678 ± 0.16
1.571HisIle: 1.571 ± 0.242
1.324HisLys: 1.324 ± 0.231
1.848HisLeu: 1.848 ± 0.252
0.339HisMet: 0.339 ± 0.111
1.047HisAsn: 1.047 ± 0.216
0.801HisPro: 0.801 ± 0.135
0.493HisGln: 0.493 ± 0.127
1.324HisArg: 1.324 ± 0.189
1.14HisSer: 1.14 ± 0.202
0.893HisThr: 0.893 ± 0.158
0.739HisVal: 0.739 ± 0.162
0.092HisTrp: 0.092 ± 0.05
0.554HisTyr: 0.554 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
4.712IleAla: 4.712 ± 0.362
0.893IleCys: 0.893 ± 0.189
4.774IleAsp: 4.774 ± 0.401
4.404IleGlu: 4.404 ± 0.355
2.31IlePhe: 2.31 ± 0.279
3.973IleGly: 3.973 ± 0.374
1.263IleHis: 1.263 ± 0.177
4.035IleIle: 4.035 ± 0.36
4.682IleLys: 4.682 ± 0.372
5.359IleLeu: 5.359 ± 0.423
1.786IleMet: 1.786 ± 0.227
4.25IleAsn: 4.25 ± 0.405
2.895IlePro: 2.895 ± 0.326
2.187IleGln: 2.187 ± 0.229
3.111IleArg: 3.111 ± 0.352
4.312IleSer: 4.312 ± 0.28
4.836IleThr: 4.836 ± 0.386
3.45IleVal: 3.45 ± 0.283
0.739IleTrp: 0.739 ± 0.176
2.187IleTyr: 2.187 ± 0.257
0.0IleXaa: 0.0 ± 0.0
Lys
6.468LysAla: 6.468 ± 0.418
0.678LysCys: 0.678 ± 0.169
4.743LysAsp: 4.743 ± 0.416
4.928LysGlu: 4.928 ± 0.452
3.172LysPhe: 3.172 ± 0.36
3.634LysGly: 3.634 ± 0.335
1.201LysHis: 1.201 ± 0.224
4.004LysIle: 4.004 ± 0.334
4.928LysLys: 4.928 ± 0.5
6.776LysLeu: 6.776 ± 0.431
2.125LysMet: 2.125 ± 0.285
3.85LysAsn: 3.85 ± 0.38
2.341LysPro: 2.341 ± 0.318
3.049LysGln: 3.049 ± 0.392
3.326LysArg: 3.326 ± 0.293
4.528LysSer: 4.528 ± 0.412
4.066LysThr: 4.066 ± 0.419
4.682LysVal: 4.682 ± 0.427
0.893LysTrp: 0.893 ± 0.173
3.573LysTyr: 3.573 ± 0.279
0.0LysXaa: 0.0 ± 0.0
Leu
7.207LeuAla: 7.207 ± 0.594
0.708LeuCys: 0.708 ± 0.158
6.283LeuAsp: 6.283 ± 0.388
7.423LeuGlu: 7.423 ± 0.508
2.864LeuPhe: 2.864 ± 0.313
5.328LeuGly: 5.328 ± 0.475
1.786LeuHis: 1.786 ± 0.241
5.236LeuIle: 5.236 ± 0.384
5.883LeuLys: 5.883 ± 0.374
6.376LeuLeu: 6.376 ± 0.54
1.94LeuMet: 1.94 ± 0.253
5.267LeuAsn: 5.267 ± 0.438
3.573LeuPro: 3.573 ± 0.372
3.172LeuGln: 3.172 ± 0.345
4.22LeuArg: 4.22 ± 0.321
4.897LeuSer: 4.897 ± 0.47
4.651LeuThr: 4.651 ± 0.389
5.421LeuVal: 5.421 ± 0.447
0.678LeuTrp: 0.678 ± 0.126
3.018LeuTyr: 3.018 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
1.448MetAla: 1.448 ± 0.217
0.339MetCys: 0.339 ± 0.1
1.201MetAsp: 1.201 ± 0.213
1.94MetGlu: 1.94 ± 0.251
0.708MetPhe: 0.708 ± 0.151
1.386MetGly: 1.386 ± 0.201
0.37MetHis: 0.37 ± 0.115
1.848MetIle: 1.848 ± 0.29
2.094MetLys: 2.094 ± 0.26
2.187MetLeu: 2.187 ± 0.283
0.524MetMet: 0.524 ± 0.147
1.109MetAsn: 1.109 ± 0.158
0.862MetPro: 0.862 ± 0.144
1.078MetGln: 1.078 ± 0.189
1.016MetArg: 1.016 ± 0.174
2.094MetSer: 2.094 ± 0.271
1.632MetThr: 1.632 ± 0.213
1.324MetVal: 1.324 ± 0.229
0.154MetTrp: 0.154 ± 0.062
0.924MetTyr: 0.924 ± 0.164
0.0MetXaa: 0.0 ± 0.0
Asn
3.234AsnAla: 3.234 ± 0.412
0.647AsnCys: 0.647 ± 0.178
2.402AsnAsp: 2.402 ± 0.286
2.926AsnGlu: 2.926 ± 0.335
2.002AsnPhe: 2.002 ± 0.259
4.99AsnGly: 4.99 ± 0.493
0.708AsnHis: 0.708 ± 0.177
4.22AsnIle: 4.22 ± 0.384
4.25AsnLys: 4.25 ± 0.39
4.866AsnLeu: 4.866 ± 0.472
1.047AsnMet: 1.047 ± 0.162
3.881AsnAsn: 3.881 ± 0.386
2.649AsnPro: 2.649 ± 0.3
1.848AsnGln: 1.848 ± 0.364
2.895AsnArg: 2.895 ± 0.338
4.066AsnSer: 4.066 ± 0.328
3.357AsnThr: 3.357 ± 0.316
3.634AsnVal: 3.634 ± 0.381
0.647AsnTrp: 0.647 ± 0.156
2.064AsnTyr: 2.064 ± 0.283
0.0AsnXaa: 0.0 ± 0.0
Pro
2.649ProAla: 2.649 ± 0.315
0.431ProCys: 0.431 ± 0.105
2.68ProAsp: 2.68 ± 0.318
3.018ProGlu: 3.018 ± 0.314
1.448ProPhe: 1.448 ± 0.229
1.971ProGly: 1.971 ± 0.222
0.616ProHis: 0.616 ± 0.132
2.248ProIle: 2.248 ± 0.32
2.156ProLys: 2.156 ± 0.268
2.218ProLeu: 2.218 ± 0.245
0.4ProMet: 0.4 ± 0.118
2.094ProAsn: 2.094 ± 0.29
1.386ProPro: 1.386 ± 0.244
0.924ProGln: 0.924 ± 0.173
1.386ProArg: 1.386 ± 0.218
2.248ProSer: 2.248 ± 0.271
2.125ProThr: 2.125 ± 0.267
2.433ProVal: 2.433 ± 0.261
0.431ProTrp: 0.431 ± 0.119
1.694ProTyr: 1.694 ± 0.252
0.0ProXaa: 0.0 ± 0.0
Gln
2.834GlnAla: 2.834 ± 0.413
0.431GlnCys: 0.431 ± 0.107
2.033GlnAsp: 2.033 ± 0.342
3.018GlnGlu: 3.018 ± 0.354
1.694GlnPhe: 1.694 ± 0.213
1.91GlnGly: 1.91 ± 0.189
0.431GlnHis: 0.431 ± 0.124
2.587GlnIle: 2.587 ± 0.358
2.464GlnLys: 2.464 ± 0.38
3.819GlnLeu: 3.819 ± 0.421
1.109GlnMet: 1.109 ± 0.208
1.817GlnAsn: 1.817 ± 0.222
0.462GlnPro: 0.462 ± 0.124
2.433GlnGln: 2.433 ± 0.472
1.756GlnArg: 1.756 ± 0.242
2.064GlnSer: 2.064 ± 0.263
1.879GlnThr: 1.879 ± 0.281
2.71GlnVal: 2.71 ± 0.281
0.554GlnTrp: 0.554 ± 0.163
1.448GlnTyr: 1.448 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
3.172ArgAla: 3.172 ± 0.31
0.4ArgCys: 0.4 ± 0.121
3.049ArgAsp: 3.049 ± 0.322
3.142ArgGlu: 3.142 ± 0.35
1.602ArgPhe: 1.602 ± 0.225
3.634ArgGly: 3.634 ± 0.254
0.647ArgHis: 0.647 ± 0.157
2.71ArgIle: 2.71 ± 0.337
3.419ArgLys: 3.419 ± 0.466
4.312ArgLeu: 4.312 ± 0.303
1.263ArgMet: 1.263 ± 0.233
2.957ArgAsn: 2.957 ± 0.267
1.478ArgPro: 1.478 ± 0.21
1.91ArgGln: 1.91 ± 0.232
2.464ArgArg: 2.464 ± 0.303
2.495ArgSer: 2.495 ± 0.269
2.526ArgThr: 2.526 ± 0.243
2.649ArgVal: 2.649 ± 0.278
0.708ArgTrp: 0.708 ± 0.167
1.817ArgTyr: 1.817 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
4.343SerAla: 4.343 ± 0.522
0.739SerCys: 0.739 ± 0.163
3.111SerAsp: 3.111 ± 0.344
3.234SerGlu: 3.234 ± 0.348
3.049SerPhe: 3.049 ± 0.269
5.39SerGly: 5.39 ± 0.438
0.832SerHis: 0.832 ± 0.192
4.528SerIle: 4.528 ± 0.427
4.774SerLys: 4.774 ± 0.411
6.16SerLeu: 6.16 ± 0.46
1.632SerMet: 1.632 ± 0.208
3.912SerAsn: 3.912 ± 0.346
2.156SerPro: 2.156 ± 0.268
1.848SerGln: 1.848 ± 0.245
3.172SerArg: 3.172 ± 0.345
4.066SerSer: 4.066 ± 0.336
3.542SerThr: 3.542 ± 0.31
4.035SerVal: 4.035 ± 0.35
1.078SerTrp: 1.078 ± 0.165
2.587SerTyr: 2.587 ± 0.213
0.0SerXaa: 0.0 ± 0.0
Thr
4.281ThrAla: 4.281 ± 0.512
0.524ThrCys: 0.524 ± 0.129
2.834ThrAsp: 2.834 ± 0.299
3.604ThrGlu: 3.604 ± 0.315
2.495ThrPhe: 2.495 ± 0.303
5.051ThrGly: 5.051 ± 0.514
0.862ThrHis: 0.862 ± 0.178
3.85ThrIle: 3.85 ± 0.393
3.758ThrLys: 3.758 ± 0.384
4.497ThrLeu: 4.497 ± 0.383
1.201ThrMet: 1.201 ± 0.204
2.926ThrAsn: 2.926 ± 0.295
2.464ThrPro: 2.464 ± 0.247
2.279ThrGln: 2.279 ± 0.216
2.372ThrArg: 2.372 ± 0.287
3.85ThrSer: 3.85 ± 0.387
3.357ThrThr: 3.357 ± 0.355
4.127ThrVal: 4.127 ± 0.382
0.862ThrTrp: 0.862 ± 0.21
2.064ThrTyr: 2.064 ± 0.276
0.0ThrXaa: 0.0 ± 0.0
Val
4.897ValAla: 4.897 ± 0.376
0.647ValCys: 0.647 ± 0.16
4.374ValAsp: 4.374 ± 0.334
3.85ValGlu: 3.85 ± 0.372
2.402ValPhe: 2.402 ± 0.257
3.604ValGly: 3.604 ± 0.354
1.263ValHis: 1.263 ± 0.208
3.881ValIle: 3.881 ± 0.362
4.497ValLys: 4.497 ± 0.35
4.62ValLeu: 4.62 ± 0.323
1.232ValMet: 1.232 ± 0.17
3.45ValAsn: 3.45 ± 0.402
2.433ValPro: 2.433 ± 0.262
2.618ValGln: 2.618 ± 0.309
2.926ValArg: 2.926 ± 0.293
4.312ValSer: 4.312 ± 0.377
3.573ValThr: 3.573 ± 0.367
4.189ValVal: 4.189 ± 0.406
0.801ValTrp: 0.801 ± 0.172
2.526ValTyr: 2.526 ± 0.276
0.0ValXaa: 0.0 ± 0.0
Trp
0.462TrpAla: 0.462 ± 0.131
0.246TrpCys: 0.246 ± 0.084
0.832TrpAsp: 0.832 ± 0.159
1.109TrpGlu: 1.109 ± 0.174
0.616TrpPhe: 0.616 ± 0.167
0.708TrpGly: 0.708 ± 0.168
0.216TrpHis: 0.216 ± 0.08
0.708TrpIle: 0.708 ± 0.149
1.047TrpLys: 1.047 ± 0.133
1.324TrpLeu: 1.324 ± 0.228
0.37TrpMet: 0.37 ± 0.105
0.77TrpAsn: 0.77 ± 0.156
0.277TrpPro: 0.277 ± 0.084
0.739TrpGln: 0.739 ± 0.16
0.616TrpArg: 0.616 ± 0.137
0.77TrpSer: 0.77 ± 0.188
0.678TrpThr: 0.678 ± 0.164
0.678TrpVal: 0.678 ± 0.146
0.216TrpTrp: 0.216 ± 0.087
0.339TrpTyr: 0.339 ± 0.102
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.341TyrAla: 2.341 ± 0.256
0.524TyrCys: 0.524 ± 0.116
2.834TyrAsp: 2.834 ± 0.338
2.002TyrGlu: 2.002 ± 0.261
1.879TyrPhe: 1.879 ± 0.281
2.556TyrGly: 2.556 ± 0.273
1.109TyrHis: 1.109 ± 0.167
2.772TyrIle: 2.772 ± 0.285
3.018TyrLys: 3.018 ± 0.266
3.265TyrLeu: 3.265 ± 0.413
0.893TyrMet: 0.893 ± 0.162
2.741TyrAsn: 2.741 ± 0.304
1.324TyrPro: 1.324 ± 0.191
1.571TyrGln: 1.571 ± 0.282
1.879TyrArg: 1.879 ± 0.236
2.803TyrSer: 2.803 ± 0.298
2.187TyrThr: 2.187 ± 0.242
2.402TyrVal: 2.402 ± 0.313
0.616TyrTrp: 0.616 ± 0.134
1.725TyrTyr: 1.725 ± 0.291
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 148 proteins (32468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski