Amino acid dipepetide frequency for Rhodococcus phage Bryce

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.452AlaAla: 10.452 ± 1.337
0.716AlaCys: 0.716 ± 0.232
5.369AlaAsp: 5.369 ± 0.634
9.092AlaGlu: 9.092 ± 0.895
3.078AlaPhe: 3.078 ± 0.405
7.446AlaGly: 7.446 ± 0.949
1.36AlaHis: 1.36 ± 0.346
4.296AlaIle: 4.296 ± 0.723
3.651AlaLys: 3.651 ± 0.408
8.734AlaLeu: 8.734 ± 0.959
2.219AlaMet: 2.219 ± 0.387
3.078AlaAsn: 3.078 ± 0.593
4.224AlaPro: 4.224 ± 0.542
3.293AlaGln: 3.293 ± 0.574
6.157AlaArg: 6.157 ± 0.662
5.226AlaSer: 5.226 ± 0.636
4.725AlaThr: 4.725 ± 0.619
7.302AlaVal: 7.302 ± 0.718
1.718AlaTrp: 1.718 ± 0.392
2.792AlaTyr: 2.792 ± 0.539
0.0AlaXaa: 0.0 ± 0.0
Cys
0.358CysAla: 0.358 ± 0.142
0.0CysCys: 0.0 ± 0.0
0.43CysAsp: 0.43 ± 0.168
0.788CysGlu: 0.788 ± 0.315
0.143CysPhe: 0.143 ± 0.1
1.074CysGly: 1.074 ± 0.272
0.143CysHis: 0.143 ± 0.084
0.215CysIle: 0.215 ± 0.114
0.573CysLys: 0.573 ± 0.218
0.286CysLeu: 0.286 ± 0.132
0.072CysMet: 0.072 ± 0.071
0.286CysAsn: 0.286 ± 0.126
0.716CysPro: 0.716 ± 0.234
0.43CysGln: 0.43 ± 0.18
0.43CysArg: 0.43 ± 0.211
0.43CysSer: 0.43 ± 0.195
0.358CysThr: 0.358 ± 0.171
0.788CysVal: 0.788 ± 0.273
0.143CysTrp: 0.143 ± 0.085
0.43CysTyr: 0.43 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
5.656AspAla: 5.656 ± 0.67
0.644AspCys: 0.644 ± 0.272
3.078AspAsp: 3.078 ± 0.423
5.584AspGlu: 5.584 ± 0.766
2.076AspPhe: 2.076 ± 0.459
5.942AspGly: 5.942 ± 0.722
1.933AspHis: 1.933 ± 0.425
1.145AspIle: 1.145 ± 0.316
3.078AspLys: 3.078 ± 0.445
6.372AspLeu: 6.372 ± 0.779
2.148AspMet: 2.148 ± 0.37
2.148AspAsn: 2.148 ± 0.311
4.868AspPro: 4.868 ± 0.72
2.076AspGln: 2.076 ± 0.386
3.222AspArg: 3.222 ± 0.465
3.58AspSer: 3.58 ± 0.462
3.651AspThr: 3.651 ± 0.426
5.298AspVal: 5.298 ± 0.573
1.289AspTrp: 1.289 ± 0.322
2.721AspTyr: 2.721 ± 0.544
0.0AspXaa: 0.0 ± 0.0
Glu
7.732GluAla: 7.732 ± 0.702
0.573GluCys: 0.573 ± 0.194
5.226GluAsp: 5.226 ± 0.549
5.083GluGlu: 5.083 ± 0.799
2.577GluPhe: 2.577 ± 0.452
6.372GluGly: 6.372 ± 0.674
1.36GluHis: 1.36 ± 0.353
4.868GluIle: 4.868 ± 0.816
2.506GluLys: 2.506 ± 0.469
7.302GluLeu: 7.302 ± 0.843
1.432GluMet: 1.432 ± 0.304
2.792GluAsn: 2.792 ± 0.515
2.506GluPro: 2.506 ± 0.486
2.506GluGln: 2.506 ± 0.432
4.152GluArg: 4.152 ± 0.728
3.365GluSer: 3.365 ± 0.58
4.582GluThr: 4.582 ± 0.543
6.586GluVal: 6.586 ± 0.722
1.432GluTrp: 1.432 ± 0.378
1.933GluTyr: 1.933 ± 0.346
0.0GluXaa: 0.0 ± 0.0
Phe
3.365PheAla: 3.365 ± 0.561
0.215PheCys: 0.215 ± 0.137
2.935PheAsp: 2.935 ± 0.511
2.291PheGlu: 2.291 ± 0.395
0.644PhePhe: 0.644 ± 0.174
2.721PheGly: 2.721 ± 0.42
0.716PheHis: 0.716 ± 0.198
1.503PheIle: 1.503 ± 0.261
1.503PheLys: 1.503 ± 0.366
3.007PheLeu: 3.007 ± 0.527
0.644PheMet: 0.644 ± 0.167
1.145PheAsn: 1.145 ± 0.241
1.36PhePro: 1.36 ± 0.254
1.145PheGln: 1.145 ± 0.324
2.363PheArg: 2.363 ± 0.451
2.577PheSer: 2.577 ± 0.358
1.79PheThr: 1.79 ± 0.356
2.721PheVal: 2.721 ± 0.493
0.501PheTrp: 0.501 ± 0.145
1.074PheTyr: 1.074 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
6.658GlyAla: 6.658 ± 0.985
0.644GlyCys: 0.644 ± 0.255
6.658GlyAsp: 6.658 ± 0.628
4.94GlyGlu: 4.94 ± 0.641
4.582GlyPhe: 4.582 ± 0.684
6.229GlyGly: 6.229 ± 0.73
1.933GlyHis: 1.933 ± 0.351
4.081GlyIle: 4.081 ± 0.726
4.081GlyLys: 4.081 ± 0.498
5.513GlyLeu: 5.513 ± 0.739
2.363GlyMet: 2.363 ± 0.746
3.007GlyAsn: 3.007 ± 0.493
3.58GlyPro: 3.58 ± 0.418
3.15GlyGln: 3.15 ± 0.603
4.224GlyArg: 4.224 ± 0.558
5.799GlySer: 5.799 ± 1.062
5.513GlyThr: 5.513 ± 0.7
6.873GlyVal: 6.873 ± 0.845
2.219GlyTrp: 2.219 ± 0.4
3.007GlyTyr: 3.007 ± 0.552
0.0GlyXaa: 0.0 ± 0.0
His
1.575HisAla: 1.575 ± 0.325
0.072HisCys: 0.072 ± 0.069
1.647HisAsp: 1.647 ± 0.403
1.575HisGlu: 1.575 ± 0.45
0.716HisPhe: 0.716 ± 0.216
1.217HisGly: 1.217 ± 0.303
0.501HisHis: 0.501 ± 0.194
1.432HisIle: 1.432 ± 0.351
0.573HisLys: 0.573 ± 0.204
1.432HisLeu: 1.432 ± 0.362
0.573HisMet: 0.573 ± 0.238
0.716HisAsn: 0.716 ± 0.205
1.145HisPro: 1.145 ± 0.276
0.716HisGln: 0.716 ± 0.187
1.79HisArg: 1.79 ± 0.407
0.788HisSer: 0.788 ± 0.347
1.002HisThr: 1.002 ± 0.303
1.289HisVal: 1.289 ± 0.247
0.43HisTrp: 0.43 ± 0.191
0.788HisTyr: 0.788 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
5.298IleAla: 5.298 ± 0.721
0.358IleCys: 0.358 ± 0.214
3.58IleAsp: 3.58 ± 0.514
3.794IleGlu: 3.794 ± 0.618
0.859IlePhe: 0.859 ± 0.279
4.868IleGly: 4.868 ± 0.696
1.002IleHis: 1.002 ± 0.229
1.432IleIle: 1.432 ± 0.28
1.145IleLys: 1.145 ± 0.325
3.222IleLeu: 3.222 ± 0.561
0.644IleMet: 0.644 ± 0.203
1.36IleAsn: 1.36 ± 0.29
2.649IlePro: 2.649 ± 0.51
0.931IleGln: 0.931 ± 0.232
3.508IleArg: 3.508 ± 0.528
2.076IleSer: 2.076 ± 0.393
3.794IleThr: 3.794 ± 0.61
3.293IleVal: 3.293 ± 0.644
0.43IleTrp: 0.43 ± 0.199
1.575IleTyr: 1.575 ± 0.368
0.0IleXaa: 0.0 ± 0.0
Lys
4.009LysAla: 4.009 ± 0.564
0.143LysCys: 0.143 ± 0.097
2.649LysAsp: 2.649 ± 0.304
1.861LysGlu: 1.861 ± 0.306
1.217LysPhe: 1.217 ± 0.311
3.436LysGly: 3.436 ± 0.822
1.074LysHis: 1.074 ± 0.273
1.575LysIle: 1.575 ± 0.335
1.861LysLys: 1.861 ± 0.449
4.296LysLeu: 4.296 ± 0.674
0.716LysMet: 0.716 ± 0.258
1.217LysAsn: 1.217 ± 0.288
1.647LysPro: 1.647 ± 0.4
0.644LysGln: 0.644 ± 0.193
2.219LysArg: 2.219 ± 0.463
2.219LysSer: 2.219 ± 0.319
2.363LysThr: 2.363 ± 0.566
3.58LysVal: 3.58 ± 0.616
0.788LysTrp: 0.788 ± 0.221
1.002LysTyr: 1.002 ± 0.261
0.0LysXaa: 0.0 ± 0.0
Leu
9.379LeuAla: 9.379 ± 0.922
0.573LeuCys: 0.573 ± 0.241
5.871LeuAsp: 5.871 ± 0.776
6.229LeuGlu: 6.229 ± 0.808
2.363LeuPhe: 2.363 ± 0.332
6.73LeuGly: 6.73 ± 1.019
1.79LeuHis: 1.79 ± 0.394
3.078LeuIle: 3.078 ± 0.435
3.508LeuLys: 3.508 ± 0.526
5.298LeuLeu: 5.298 ± 0.715
1.861LeuMet: 1.861 ± 0.47
2.363LeuAsn: 2.363 ± 0.413
4.51LeuPro: 4.51 ± 0.536
2.864LeuGln: 2.864 ± 0.433
5.441LeuArg: 5.441 ± 0.7
5.298LeuSer: 5.298 ± 0.65
6.085LeuThr: 6.085 ± 0.637
6.443LeuVal: 6.443 ± 0.636
1.575LeuTrp: 1.575 ± 0.395
2.577LeuTyr: 2.577 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
1.933MetAla: 1.933 ± 0.34
0.0MetCys: 0.0 ± 0.0
1.217MetAsp: 1.217 ± 0.297
1.432MetGlu: 1.432 ± 0.317
0.931MetPhe: 0.931 ± 0.262
2.291MetGly: 2.291 ± 0.353
0.358MetHis: 0.358 ± 0.155
1.503MetIle: 1.503 ± 0.352
1.002MetLys: 1.002 ± 0.295
1.647MetLeu: 1.647 ± 0.267
0.358MetMet: 0.358 ± 0.159
0.859MetAsn: 0.859 ± 0.23
0.788MetPro: 0.788 ± 0.226
0.501MetGln: 0.501 ± 0.226
1.933MetArg: 1.933 ± 0.478
3.078MetSer: 3.078 ± 0.461
1.933MetThr: 1.933 ± 0.361
1.074MetVal: 1.074 ± 0.244
0.215MetTrp: 0.215 ± 0.13
0.573MetTyr: 0.573 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
3.222AsnAla: 3.222 ± 0.518
0.644AsnCys: 0.644 ± 0.264
2.076AsnAsp: 2.076 ± 0.394
2.005AsnGlu: 2.005 ± 0.394
1.002AsnPhe: 1.002 ± 0.304
3.938AsnGly: 3.938 ± 0.571
0.931AsnHis: 0.931 ± 0.236
1.36AsnIle: 1.36 ± 0.291
1.002AsnLys: 1.002 ± 0.255
3.15AsnLeu: 3.15 ± 0.449
1.002AsnMet: 1.002 ± 0.273
1.503AsnAsn: 1.503 ± 0.31
2.148AsnPro: 2.148 ± 0.341
0.788AsnGln: 0.788 ± 0.238
1.647AsnArg: 1.647 ± 0.419
2.219AsnSer: 2.219 ± 0.416
2.076AsnThr: 2.076 ± 0.454
2.434AsnVal: 2.434 ± 0.355
0.859AsnTrp: 0.859 ± 0.224
1.074AsnTyr: 1.074 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
4.367ProAla: 4.367 ± 0.505
0.358ProCys: 0.358 ± 0.184
2.792ProAsp: 2.792 ± 0.337
3.508ProGlu: 3.508 ± 0.537
1.289ProPhe: 1.289 ± 0.231
4.224ProGly: 4.224 ± 0.584
0.788ProHis: 0.788 ± 0.224
2.792ProIle: 2.792 ± 0.557
1.36ProLys: 1.36 ± 0.286
3.723ProLeu: 3.723 ± 0.462
1.289ProMet: 1.289 ± 0.249
2.076ProAsn: 2.076 ± 0.408
1.289ProPro: 1.289 ± 0.289
1.503ProGln: 1.503 ± 0.267
2.649ProArg: 2.649 ± 0.581
2.291ProSer: 2.291 ± 0.355
3.794ProThr: 3.794 ± 0.62
4.224ProVal: 4.224 ± 0.672
1.145ProTrp: 1.145 ± 0.28
1.432ProTyr: 1.432 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
3.222GlnAla: 3.222 ± 0.44
0.143GlnCys: 0.143 ± 0.133
2.005GlnAsp: 2.005 ± 0.351
1.647GlnGlu: 1.647 ± 0.39
1.074GlnPhe: 1.074 ± 0.264
2.363GlnGly: 2.363 ± 0.472
0.573GlnHis: 0.573 ± 0.216
1.861GlnIle: 1.861 ± 0.311
0.788GlnLys: 0.788 ± 0.217
2.721GlnLeu: 2.721 ± 0.499
0.859GlnMet: 0.859 ± 0.255
0.716GlnAsn: 0.716 ± 0.218
1.718GlnPro: 1.718 ± 0.354
1.289GlnGln: 1.289 ± 0.334
3.007GlnArg: 3.007 ± 0.415
1.861GlnSer: 1.861 ± 0.352
1.718GlnThr: 1.718 ± 0.297
3.365GlnVal: 3.365 ± 0.474
0.644GlnTrp: 0.644 ± 0.219
0.788GlnTyr: 0.788 ± 0.257
0.0GlnXaa: 0.0 ± 0.0
Arg
5.298ArgAla: 5.298 ± 0.582
1.145ArgCys: 1.145 ± 0.319
4.152ArgAsp: 4.152 ± 0.569
4.868ArgGlu: 4.868 ± 0.801
2.363ArgPhe: 2.363 ± 0.421
3.794ArgGly: 3.794 ± 0.465
1.217ArgHis: 1.217 ± 0.408
2.649ArgIle: 2.649 ± 0.541
2.792ArgLys: 2.792 ± 0.453
5.513ArgLeu: 5.513 ± 0.616
0.931ArgMet: 0.931 ± 0.331
2.291ArgAsn: 2.291 ± 0.483
2.721ArgPro: 2.721 ± 0.492
2.363ArgGln: 2.363 ± 0.402
3.866ArgArg: 3.866 ± 0.491
3.436ArgSer: 3.436 ± 0.532
4.081ArgThr: 4.081 ± 0.524
4.296ArgVal: 4.296 ± 0.507
1.36ArgTrp: 1.36 ± 0.306
1.79ArgTyr: 1.79 ± 0.439
0.0ArgXaa: 0.0 ± 0.0
Ser
5.011SerAla: 5.011 ± 0.624
0.501SerCys: 0.501 ± 0.195
3.938SerAsp: 3.938 ± 0.46
4.152SerGlu: 4.152 ± 0.579
2.792SerPhe: 2.792 ± 0.524
6.085SerGly: 6.085 ± 0.811
1.145SerHis: 1.145 ± 0.251
2.219SerIle: 2.219 ± 0.415
2.076SerLys: 2.076 ± 0.483
5.226SerLeu: 5.226 ± 0.659
1.79SerMet: 1.79 ± 0.38
2.721SerAsn: 2.721 ± 0.439
2.291SerPro: 2.291 ± 0.386
1.79SerGln: 1.79 ± 0.347
3.508SerArg: 3.508 ± 0.435
3.508SerSer: 3.508 ± 0.535
3.365SerThr: 3.365 ± 0.432
4.653SerVal: 4.653 ± 0.68
1.145SerTrp: 1.145 ± 0.283
1.718SerTyr: 1.718 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
4.94ThrAla: 4.94 ± 0.551
0.358ThrCys: 0.358 ± 0.168
3.15ThrAsp: 3.15 ± 0.444
5.083ThrGlu: 5.083 ± 0.552
2.005ThrPhe: 2.005 ± 0.386
6.014ThrGly: 6.014 ± 0.692
0.931ThrHis: 0.931 ± 0.267
3.436ThrIle: 3.436 ± 0.546
2.649ThrLys: 2.649 ± 0.387
5.369ThrLeu: 5.369 ± 0.542
1.289ThrMet: 1.289 ± 0.327
1.575ThrAsn: 1.575 ± 0.392
3.794ThrPro: 3.794 ± 0.573
1.933ThrGln: 1.933 ± 0.4
3.15ThrArg: 3.15 ± 0.441
3.078ThrSer: 3.078 ± 0.433
3.508ThrThr: 3.508 ± 0.603
5.011ThrVal: 5.011 ± 0.494
1.145ThrTrp: 1.145 ± 0.23
2.506ThrTyr: 2.506 ± 0.42
0.0ThrXaa: 0.0 ± 0.0
Val
7.66ValAla: 7.66 ± 0.91
0.859ValCys: 0.859 ± 0.251
5.513ValAsp: 5.513 ± 0.547
6.801ValGlu: 6.801 ± 0.668
2.792ValPhe: 2.792 ± 0.435
6.014ValGly: 6.014 ± 0.798
1.074ValHis: 1.074 ± 0.284
3.723ValIle: 3.723 ± 0.437
3.078ValLys: 3.078 ± 0.405
7.231ValLeu: 7.231 ± 0.678
1.647ValMet: 1.647 ± 0.42
3.15ValAsn: 3.15 ± 0.397
3.651ValPro: 3.651 ± 0.481
2.363ValGln: 2.363 ± 0.43
4.439ValArg: 4.439 ± 0.59
5.226ValSer: 5.226 ± 0.53
4.152ValThr: 4.152 ± 0.638
6.372ValVal: 6.372 ± 0.846
1.217ValTrp: 1.217 ± 0.337
1.933ValTyr: 1.933 ± 0.391
0.0ValXaa: 0.0 ± 0.0
Trp
1.36TrpAla: 1.36 ± 0.322
0.0TrpCys: 0.0 ± 0.0
1.432TrpAsp: 1.432 ± 0.286
1.432TrpGlu: 1.432 ± 0.305
0.788TrpPhe: 0.788 ± 0.223
1.718TrpGly: 1.718 ± 0.415
0.644TrpHis: 0.644 ± 0.223
0.931TrpIle: 0.931 ± 0.252
0.859TrpLys: 0.859 ± 0.229
1.289TrpLeu: 1.289 ± 0.354
0.644TrpMet: 0.644 ± 0.184
0.931TrpAsn: 0.931 ± 0.31
0.644TrpPro: 0.644 ± 0.23
0.788TrpGln: 0.788 ± 0.2
0.501TrpArg: 0.501 ± 0.155
1.432TrpSer: 1.432 ± 0.36
1.145TrpThr: 1.145 ± 0.265
1.289TrpVal: 1.289 ± 0.31
0.358TrpTrp: 0.358 ± 0.202
0.644TrpTyr: 0.644 ± 0.238
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.436TyrAla: 3.436 ± 0.524
0.143TyrCys: 0.143 ± 0.092
2.792TyrAsp: 2.792 ± 0.395
2.721TyrGlu: 2.721 ± 0.45
0.931TyrPhe: 0.931 ± 0.26
2.577TyrGly: 2.577 ± 0.498
0.501TyrHis: 0.501 ± 0.181
1.718TyrIle: 1.718 ± 0.377
0.501TyrLys: 0.501 ± 0.205
2.506TyrLeu: 2.506 ± 0.512
1.074TyrMet: 1.074 ± 0.259
1.145TyrAsn: 1.145 ± 0.275
0.644TyrPro: 0.644 ± 0.216
1.289TyrGln: 1.289 ± 0.223
2.721TyrArg: 2.721 ± 0.487
2.148TyrSer: 2.148 ± 0.416
1.289TyrThr: 1.289 ± 0.317
2.005TyrVal: 2.005 ± 0.408
0.215TyrTrp: 0.215 ± 0.119
0.644TyrTyr: 0.644 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (13969 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski