Amino acid dipepetide frequency for Streptomyces phage TP1604

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.602AlaAla: 22.602 ± 1.642
1.09AlaCys: 1.09 ± 0.279
8.261AlaAsp: 8.261 ± 0.906
10.67AlaGlu: 10.67 ± 0.824
4.016AlaPhe: 4.016 ± 0.399
11.531AlaGly: 11.531 ± 0.799
2.581AlaHis: 2.581 ± 0.32
3.901AlaIle: 3.901 ± 0.605
3.671AlaLys: 3.671 ± 0.531
10.441AlaLeu: 10.441 ± 1.06
3.499AlaMet: 3.499 ± 0.403
3.385AlaAsn: 3.385 ± 0.581
6.31AlaPro: 6.31 ± 0.859
4.188AlaGln: 4.188 ± 0.41
8.547AlaArg: 8.547 ± 0.882
7.802AlaSer: 7.802 ± 0.689
9.523AlaThr: 9.523 ± 0.811
10.383AlaVal: 10.383 ± 1.013
1.492AlaTrp: 1.492 ± 0.296
2.811AlaTyr: 2.811 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
2.18CysAla: 2.18 ± 0.35
0.516CysCys: 0.516 ± 0.215
0.918CysAsp: 0.918 ± 0.275
0.287CysGlu: 0.287 ± 0.134
0.344CysPhe: 0.344 ± 0.131
2.295CysGly: 2.295 ± 0.541
0.516CysHis: 0.516 ± 0.188
0.459CysIle: 0.459 ± 0.185
0.287CysLys: 0.287 ± 0.121
1.09CysLeu: 1.09 ± 0.314
0.344CysMet: 0.344 ± 0.134
0.402CysAsn: 0.402 ± 0.124
1.262CysPro: 1.262 ± 0.323
0.688CysGln: 0.688 ± 0.177
0.918CysArg: 0.918 ± 0.229
0.631CysSer: 0.631 ± 0.203
1.033CysThr: 1.033 ± 0.286
1.319CysVal: 1.319 ± 0.278
0.344CysTrp: 0.344 ± 0.127
0.402CysTyr: 0.402 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
9.81AspAla: 9.81 ± 0.68
1.319AspCys: 1.319 ± 0.326
4.761AspAsp: 4.761 ± 0.889
3.844AspGlu: 3.844 ± 0.5
1.836AspPhe: 1.836 ± 0.295
5.622AspGly: 5.622 ± 0.667
1.319AspHis: 1.319 ± 0.335
2.18AspIle: 2.18 ± 0.384
0.975AspLys: 0.975 ± 0.294
5.737AspLeu: 5.737 ± 0.66
1.09AspMet: 1.09 ± 0.283
1.205AspAsn: 1.205 ± 0.204
3.729AspPro: 3.729 ± 0.576
2.237AspGln: 2.237 ± 0.315
4.933AspArg: 4.933 ± 0.634
2.868AspSer: 2.868 ± 0.379
3.327AspThr: 3.327 ± 0.319
5.966AspVal: 5.966 ± 0.579
0.918AspTrp: 0.918 ± 0.284
1.033AspTyr: 1.033 ± 0.252
0.0AspXaa: 0.0 ± 0.0
Glu
7.974GluAla: 7.974 ± 0.805
1.377GluCys: 1.377 ± 0.283
3.671GluAsp: 3.671 ± 0.513
1.319GluGlu: 1.319 ± 0.286
1.377GluPhe: 1.377 ± 0.28
4.016GluGly: 4.016 ± 0.383
0.975GluHis: 0.975 ± 0.234
1.147GluIle: 1.147 ± 0.316
0.975GluLys: 0.975 ± 0.181
5.737GluLeu: 5.737 ± 0.619
0.975GluMet: 0.975 ± 0.212
0.631GluAsn: 0.631 ± 0.181
3.385GluPro: 3.385 ± 0.444
1.664GluGln: 1.664 ± 0.306
4.647GluArg: 4.647 ± 0.561
2.524GluSer: 2.524 ± 0.326
3.499GluThr: 3.499 ± 0.447
3.901GluVal: 3.901 ± 0.461
1.033GluTrp: 1.033 ± 0.234
0.918GluTyr: 0.918 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
3.442PheAla: 3.442 ± 0.37
0.229PheCys: 0.229 ± 0.121
2.065PheAsp: 2.065 ± 0.452
1.95PheGlu: 1.95 ± 0.314
0.746PhePhe: 0.746 ± 0.235
3.04PheGly: 3.04 ± 0.363
0.402PheHis: 0.402 ± 0.14
0.688PheIle: 0.688 ± 0.19
0.229PheLys: 0.229 ± 0.13
2.811PheLeu: 2.811 ± 0.537
0.516PheMet: 0.516 ± 0.158
1.262PheAsn: 1.262 ± 0.288
1.319PhePro: 1.319 ± 0.287
1.147PheGln: 1.147 ± 0.218
1.664PheArg: 1.664 ± 0.282
1.262PheSer: 1.262 ± 0.295
2.008PheThr: 2.008 ± 0.314
1.95PheVal: 1.95 ± 0.302
0.172PheTrp: 0.172 ± 0.084
0.344PheTyr: 0.344 ± 0.131
0.0PheXaa: 0.0 ± 0.0
Gly
9.982GlyAla: 9.982 ± 1.276
2.18GlyCys: 2.18 ± 0.487
5.392GlyAsp: 5.392 ± 0.783
4.302GlyGlu: 4.302 ± 0.45
2.811GlyPhe: 2.811 ± 0.371
9.236GlyGly: 9.236 ± 1.962
1.893GlyHis: 1.893 ± 0.397
3.442GlyIle: 3.442 ± 0.404
3.27GlyLys: 3.27 ± 0.515
7.113GlyLeu: 7.113 ± 0.84
1.664GlyMet: 1.664 ± 0.406
2.524GlyAsn: 2.524 ± 0.344
4.819GlyPro: 4.819 ± 0.506
3.901GlyGln: 3.901 ± 0.526
6.654GlyArg: 6.654 ± 0.481
4.532GlySer: 4.532 ± 0.649
6.482GlyThr: 6.482 ± 0.62
6.138GlyVal: 6.138 ± 0.525
2.237GlyTrp: 2.237 ± 0.404
1.893GlyTyr: 1.893 ± 0.338
0.0GlyXaa: 0.0 ± 0.0
His
2.409HisAla: 2.409 ± 0.403
0.402HisCys: 0.402 ± 0.154
1.033HisAsp: 1.033 ± 0.241
0.803HisGlu: 0.803 ± 0.218
0.402HisPhe: 0.402 ± 0.149
1.549HisGly: 1.549 ± 0.345
0.574HisHis: 0.574 ± 0.163
0.516HisIle: 0.516 ± 0.161
0.287HisLys: 0.287 ± 0.134
1.147HisLeu: 1.147 ± 0.267
0.574HisMet: 0.574 ± 0.187
0.803HisAsn: 0.803 ± 0.207
0.918HisPro: 0.918 ± 0.241
0.86HisGln: 0.86 ± 0.225
1.893HisArg: 1.893 ± 0.304
0.688HisSer: 0.688 ± 0.186
1.377HisThr: 1.377 ± 0.258
2.123HisVal: 2.123 ± 0.398
0.918HisTrp: 0.918 ± 0.233
0.344HisTyr: 0.344 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
4.647IleAla: 4.647 ± 0.555
0.516IleCys: 0.516 ± 0.193
1.549IleAsp: 1.549 ± 0.239
3.04IleGlu: 3.04 ± 0.337
0.631IlePhe: 0.631 ± 0.162
2.926IleGly: 2.926 ± 0.422
0.746IleHis: 0.746 ± 0.199
1.492IleIle: 1.492 ± 0.353
1.09IleLys: 1.09 ± 0.241
2.409IleLeu: 2.409 ± 0.562
0.746IleMet: 0.746 ± 0.199
1.319IleAsn: 1.319 ± 0.416
2.524IlePro: 2.524 ± 0.362
1.377IleGln: 1.377 ± 0.257
2.295IleArg: 2.295 ± 0.376
2.467IleSer: 2.467 ± 0.436
2.467IleThr: 2.467 ± 0.5
2.123IleVal: 2.123 ± 0.398
0.344IleTrp: 0.344 ± 0.138
0.631IleTyr: 0.631 ± 0.161
0.0IleXaa: 0.0 ± 0.0
Lys
3.958LysAla: 3.958 ± 0.518
0.229LysCys: 0.229 ± 0.105
1.95LysAsp: 1.95 ± 0.351
0.86LysGlu: 0.86 ± 0.224
0.574LysPhe: 0.574 ± 0.186
2.754LysGly: 2.754 ± 0.367
0.918LysHis: 0.918 ± 0.219
1.033LysIle: 1.033 ± 0.211
1.492LysLys: 1.492 ± 0.295
2.983LysLeu: 2.983 ± 0.43
0.975LysMet: 0.975 ± 0.233
0.918LysAsn: 0.918 ± 0.246
2.065LysPro: 2.065 ± 0.361
1.205LysGln: 1.205 ± 0.234
1.836LysArg: 1.836 ± 0.32
1.778LysSer: 1.778 ± 0.318
1.893LysThr: 1.893 ± 0.329
1.893LysVal: 1.893 ± 0.373
0.229LysTrp: 0.229 ± 0.1
0.574LysTyr: 0.574 ± 0.174
0.0LysXaa: 0.0 ± 0.0
Leu
10.785LeuAla: 10.785 ± 1.111
1.09LeuCys: 1.09 ± 0.307
6.081LeuAsp: 6.081 ± 0.583
3.844LeuGlu: 3.844 ± 0.369
2.295LeuPhe: 2.295 ± 0.378
7.515LeuGly: 7.515 ± 0.896
0.975LeuHis: 0.975 ± 0.27
3.04LeuIle: 3.04 ± 0.616
3.499LeuLys: 3.499 ± 0.418
6.54LeuLeu: 6.54 ± 0.84
1.778LeuMet: 1.778 ± 0.321
2.467LeuAsn: 2.467 ± 0.344
4.532LeuPro: 4.532 ± 0.408
1.319LeuGln: 1.319 ± 0.246
5.909LeuArg: 5.909 ± 0.627
4.761LeuSer: 4.761 ± 0.563
8.146LeuThr: 8.146 ± 0.694
6.654LeuVal: 6.654 ± 0.485
0.688LeuTrp: 0.688 ± 0.242
1.434LeuTyr: 1.434 ± 0.266
0.0LeuXaa: 0.0 ± 0.0
Met
4.016MetAla: 4.016 ± 0.526
0.115MetCys: 0.115 ± 0.08
1.377MetAsp: 1.377 ± 0.279
0.574MetGlu: 0.574 ± 0.173
0.402MetPhe: 0.402 ± 0.161
1.778MetGly: 1.778 ± 0.309
0.229MetHis: 0.229 ± 0.119
1.205MetIle: 1.205 ± 0.257
1.262MetLys: 1.262 ± 0.284
1.664MetLeu: 1.664 ± 0.274
0.402MetMet: 0.402 ± 0.129
0.803MetAsn: 0.803 ± 0.207
0.975MetPro: 0.975 ± 0.16
0.688MetGln: 0.688 ± 0.18
1.606MetArg: 1.606 ± 0.299
1.377MetSer: 1.377 ± 0.274
1.721MetThr: 1.721 ± 0.304
1.377MetVal: 1.377 ± 0.258
0.402MetTrp: 0.402 ± 0.147
0.574MetTyr: 0.574 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
3.385AsnAla: 3.385 ± 0.432
0.459AsnCys: 0.459 ± 0.171
1.606AsnAsp: 1.606 ± 0.253
1.033AsnGlu: 1.033 ± 0.254
0.918AsnPhe: 0.918 ± 0.188
3.27AsnGly: 3.27 ± 0.429
0.459AsnHis: 0.459 ± 0.169
1.319AsnIle: 1.319 ± 0.308
0.402AsnLys: 0.402 ± 0.139
2.524AsnLeu: 2.524 ± 0.563
0.574AsnMet: 0.574 ± 0.194
0.516AsnAsn: 0.516 ± 0.159
1.778AsnPro: 1.778 ± 0.297
1.377AsnGln: 1.377 ± 0.266
2.123AsnArg: 2.123 ± 0.359
1.262AsnSer: 1.262 ± 0.262
1.319AsnThr: 1.319 ± 0.25
2.123AsnVal: 2.123 ± 0.453
0.229AsnTrp: 0.229 ± 0.087
0.516AsnTyr: 0.516 ± 0.14
0.0AsnXaa: 0.0 ± 0.0
Pro
7.744ProAla: 7.744 ± 0.753
1.205ProCys: 1.205 ± 0.341
4.016ProAsp: 4.016 ± 0.584
4.475ProGlu: 4.475 ± 0.616
1.09ProPhe: 1.09 ± 0.274
5.392ProGly: 5.392 ± 0.62
1.09ProHis: 1.09 ± 0.282
1.664ProIle: 1.664 ± 0.398
2.123ProLys: 2.123 ± 0.372
4.016ProLeu: 4.016 ± 0.578
0.918ProMet: 0.918 ± 0.277
1.262ProAsn: 1.262 ± 0.265
3.155ProPro: 3.155 ± 0.517
1.664ProGln: 1.664 ± 0.322
3.27ProArg: 3.27 ± 0.484
3.671ProSer: 3.671 ± 0.48
2.639ProThr: 2.639 ± 0.375
5.507ProVal: 5.507 ± 0.491
1.147ProTrp: 1.147 ± 0.256
1.033ProTyr: 1.033 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
4.589GlnAla: 4.589 ± 0.53
0.688GlnCys: 0.688 ± 0.236
1.778GlnAsp: 1.778 ± 0.29
0.631GlnGlu: 0.631 ± 0.213
1.147GlnPhe: 1.147 ± 0.251
2.123GlnGly: 2.123 ± 0.312
0.459GlnHis: 0.459 ± 0.171
0.975GlnIle: 0.975 ± 0.258
0.516GlnLys: 0.516 ± 0.162
4.245GlnLeu: 4.245 ± 0.448
0.975GlnMet: 0.975 ± 0.216
0.918GlnAsn: 0.918 ± 0.221
2.409GlnPro: 2.409 ± 0.38
1.549GlnGln: 1.549 ± 0.248
2.123GlnArg: 2.123 ± 0.387
0.975GlnSer: 0.975 ± 0.232
2.065GlnThr: 2.065 ± 0.392
2.295GlnVal: 2.295 ± 0.288
0.918GlnTrp: 0.918 ± 0.235
0.86GlnTyr: 0.86 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
7.974ArgAla: 7.974 ± 0.826
1.147ArgCys: 1.147 ± 0.283
4.589ArgAsp: 4.589 ± 0.638
3.098ArgGlu: 3.098 ± 0.419
1.664ArgPhe: 1.664 ± 0.316
4.876ArgGly: 4.876 ± 0.473
1.721ArgHis: 1.721 ± 0.319
3.27ArgIle: 3.27 ± 0.433
2.983ArgLys: 2.983 ± 0.437
5.622ArgLeu: 5.622 ± 0.7
2.467ArgMet: 2.467 ± 0.303
2.008ArgAsn: 2.008 ± 0.384
3.04ArgPro: 3.04 ± 0.359
2.18ArgGln: 2.18 ± 0.327
5.679ArgArg: 5.679 ± 0.538
2.639ArgSer: 2.639 ± 0.345
5.22ArgThr: 5.22 ± 0.575
5.909ArgVal: 5.909 ± 0.607
1.205ArgTrp: 1.205 ± 0.239
1.95ArgTyr: 1.95 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
6.196SerAla: 6.196 ± 0.698
0.975SerCys: 0.975 ± 0.314
2.639SerAsp: 2.639 ± 0.496
2.639SerGlu: 2.639 ± 0.397
1.95SerPhe: 1.95 ± 0.357
6.54SerGly: 6.54 ± 0.613
1.033SerHis: 1.033 ± 0.214
1.549SerIle: 1.549 ± 0.334
1.319SerLys: 1.319 ± 0.237
4.073SerLeu: 4.073 ± 0.471
1.836SerMet: 1.836 ± 0.359
1.147SerAsn: 1.147 ± 0.227
3.327SerPro: 3.327 ± 0.419
1.09SerGln: 1.09 ± 0.239
3.212SerArg: 3.212 ± 0.448
2.811SerSer: 2.811 ± 0.504
3.557SerThr: 3.557 ± 0.362
4.302SerVal: 4.302 ± 0.421
0.803SerTrp: 0.803 ± 0.211
1.319SerTyr: 1.319 ± 0.323
0.0SerXaa: 0.0 ± 0.0
Thr
9.408ThrAla: 9.408 ± 0.752
0.975ThrCys: 0.975 ± 0.377
3.958ThrAsp: 3.958 ± 0.426
4.532ThrGlu: 4.532 ± 0.537
1.721ThrPhe: 1.721 ± 0.209
8.031ThrGly: 8.031 ± 0.595
1.147ThrHis: 1.147 ± 0.261
2.754ThrIle: 2.754 ± 0.476
2.008ThrLys: 2.008 ± 0.306
5.45ThrLeu: 5.45 ± 0.477
1.492ThrMet: 1.492 ± 0.332
1.549ThrAsn: 1.549 ± 0.277
3.844ThrPro: 3.844 ± 0.503
1.778ThrGln: 1.778 ± 0.275
3.671ThrArg: 3.671 ± 0.403
3.385ThrSer: 3.385 ± 0.405
3.958ThrThr: 3.958 ± 0.53
6.31ThrVal: 6.31 ± 0.552
1.09ThrTrp: 1.09 ± 0.237
1.549ThrTyr: 1.549 ± 0.256
0.0ThrXaa: 0.0 ± 0.0
Val
11.875ValAla: 11.875 ± 1.049
1.033ValCys: 1.033 ± 0.234
6.597ValAsp: 6.597 ± 0.592
2.352ValGlu: 2.352 ± 0.331
2.409ValPhe: 2.409 ± 0.296
4.417ValGly: 4.417 ± 0.482
1.778ValHis: 1.778 ± 0.285
3.614ValIle: 3.614 ± 0.378
2.467ValLys: 2.467 ± 0.36
6.081ValLeu: 6.081 ± 0.751
1.205ValMet: 1.205 ± 0.238
2.581ValAsn: 2.581 ± 0.4
5.622ValPro: 5.622 ± 0.629
2.237ValGln: 2.237 ± 0.359
5.564ValArg: 5.564 ± 0.579
4.704ValSer: 4.704 ± 0.532
6.425ValThr: 6.425 ± 0.623
6.196ValVal: 6.196 ± 0.755
1.492ValTrp: 1.492 ± 0.291
0.746ValTyr: 0.746 ± 0.236
0.0ValXaa: 0.0 ± 0.0
Trp
1.778TrpAla: 1.778 ± 0.271
0.344TrpCys: 0.344 ± 0.114
1.033TrpAsp: 1.033 ± 0.236
0.229TrpGlu: 0.229 ± 0.098
0.574TrpPhe: 0.574 ± 0.184
0.918TrpGly: 0.918 ± 0.234
0.402TrpHis: 0.402 ± 0.234
0.344TrpIle: 0.344 ± 0.129
0.746TrpLys: 0.746 ± 0.178
1.778TrpLeu: 1.778 ± 0.322
0.172TrpMet: 0.172 ± 0.088
0.803TrpAsn: 0.803 ± 0.224
1.09TrpPro: 1.09 ± 0.287
0.459TrpGln: 0.459 ± 0.143
0.803TrpArg: 0.803 ± 0.27
1.434TrpSer: 1.434 ± 0.274
0.975TrpThr: 0.975 ± 0.243
1.377TrpVal: 1.377 ± 0.28
0.459TrpTrp: 0.459 ± 0.164
0.516TrpTyr: 0.516 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.008TyrAla: 2.008 ± 0.329
0.344TyrCys: 0.344 ± 0.128
1.434TyrAsp: 1.434 ± 0.312
1.262TyrGlu: 1.262 ± 0.334
0.344TyrPhe: 0.344 ± 0.147
2.639TyrGly: 2.639 ± 0.459
0.402TyrHis: 0.402 ± 0.151
0.746TyrIle: 0.746 ± 0.186
0.516TyrLys: 0.516 ± 0.166
1.893TyrLeu: 1.893 ± 0.335
0.287TyrMet: 0.287 ± 0.136
0.688TyrAsn: 0.688 ± 0.188
0.975TyrPro: 0.975 ± 0.199
0.631TyrGln: 0.631 ± 0.171
1.893TyrArg: 1.893 ± 0.41
0.746TyrSer: 0.746 ± 0.222
0.975TyrThr: 0.975 ± 0.246
1.549TyrVal: 1.549 ± 0.339
0.115TyrTrp: 0.115 ± 0.089
0.172TyrTyr: 0.172 ± 0.11
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (17433 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski