Amino acid dipepetide frequency for Bacillus phage phi4B1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.927AlaAla: 4.927 ± 1.191
0.456AlaCys: 0.456 ± 0.237
3.011AlaAsp: 3.011 ± 0.482
4.197AlaGlu: 4.197 ± 0.566
4.197AlaPhe: 4.197 ± 0.53
2.828AlaGly: 2.828 ± 0.835
0.73AlaHis: 0.73 ± 0.284
5.201AlaIle: 5.201 ± 0.911
6.022AlaLys: 6.022 ± 0.594
4.836AlaLeu: 4.836 ± 0.72
1.734AlaMet: 1.734 ± 0.542
2.828AlaAsn: 2.828 ± 0.536
1.46AlaPro: 1.46 ± 0.464
1.369AlaGln: 1.369 ± 0.279
3.558AlaArg: 3.558 ± 0.535
3.285AlaSer: 3.285 ± 0.534
3.285AlaThr: 3.285 ± 0.737
3.193AlaVal: 3.193 ± 0.437
1.004AlaTrp: 1.004 ± 0.259
2.372AlaTyr: 2.372 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.365CysAla: 0.365 ± 0.207
0.182CysCys: 0.182 ± 0.127
0.912CysAsp: 0.912 ± 0.34
0.639CysGlu: 0.639 ± 0.258
0.639CysPhe: 0.639 ± 0.204
0.091CysGly: 0.091 ± 0.108
0.091CysHis: 0.091 ± 0.097
0.182CysIle: 0.182 ± 0.135
0.547CysLys: 0.547 ± 0.2
0.639CysLeu: 0.639 ± 0.259
0.365CysMet: 0.365 ± 0.183
0.365CysAsn: 0.365 ± 0.17
0.274CysPro: 0.274 ± 0.161
0.182CysGln: 0.182 ± 0.12
0.456CysArg: 0.456 ± 0.269
0.73CysSer: 0.73 ± 0.317
0.547CysThr: 0.547 ± 0.179
0.639CysVal: 0.639 ± 0.301
0.091CysTrp: 0.091 ± 0.087
0.456CysTyr: 0.456 ± 0.252
0.0CysXaa: 0.0 ± 0.0
Asp
4.106AspAla: 4.106 ± 0.622
0.182AspCys: 0.182 ± 0.126
3.376AspAsp: 3.376 ± 0.526
5.657AspGlu: 5.657 ± 0.812
2.555AspPhe: 2.555 ± 0.643
3.376AspGly: 3.376 ± 0.614
0.73AspHis: 0.73 ± 0.311
5.018AspIle: 5.018 ± 0.713
5.018AspLys: 5.018 ± 0.443
4.106AspLeu: 4.106 ± 0.632
2.099AspMet: 2.099 ± 0.32
3.193AspAsn: 3.193 ± 0.481
1.277AspPro: 1.277 ± 0.306
1.916AspGln: 1.916 ± 0.36
2.19AspArg: 2.19 ± 0.604
2.646AspSer: 2.646 ± 0.609
3.467AspThr: 3.467 ± 0.504
4.015AspVal: 4.015 ± 0.442
1.46AspTrp: 1.46 ± 0.456
2.555AspTyr: 2.555 ± 0.524
0.0AspXaa: 0.0 ± 0.0
Glu
4.745GluAla: 4.745 ± 0.674
0.821GluCys: 0.821 ± 0.342
5.018GluAsp: 5.018 ± 0.632
7.026GluGlu: 7.026 ± 0.881
3.832GluPhe: 3.832 ± 0.675
4.106GluGly: 4.106 ± 0.664
0.912GluHis: 0.912 ± 0.287
5.931GluIle: 5.931 ± 0.686
8.759GluLys: 8.759 ± 1.045
8.577GluLeu: 8.577 ± 0.856
3.65GluMet: 3.65 ± 0.445
4.471GluAsn: 4.471 ± 0.579
1.551GluPro: 1.551 ± 0.342
3.832GluGln: 3.832 ± 0.655
3.467GluArg: 3.467 ± 0.59
3.467GluSer: 3.467 ± 0.432
5.109GluThr: 5.109 ± 0.54
5.839GluVal: 5.839 ± 0.791
1.825GluTrp: 1.825 ± 0.63
2.19GluTyr: 2.19 ± 0.474
0.0GluXaa: 0.0 ± 0.0
Phe
2.099PheAla: 2.099 ± 0.368
0.274PheCys: 0.274 ± 0.158
3.011PheAsp: 3.011 ± 0.493
2.92PheGlu: 2.92 ± 0.419
2.372PhePhe: 2.372 ± 0.453
3.102PheGly: 3.102 ± 0.373
0.73PheHis: 0.73 ± 0.331
2.92PheIle: 2.92 ± 0.572
5.201PheLys: 5.201 ± 0.767
3.741PheLeu: 3.741 ± 0.598
1.186PheMet: 1.186 ± 0.321
2.828PheAsn: 2.828 ± 0.486
0.821PhePro: 0.821 ± 0.243
1.004PheGln: 1.004 ± 0.333
2.19PheArg: 2.19 ± 0.427
2.19PheSer: 2.19 ± 0.414
3.193PheThr: 3.193 ± 0.735
2.92PheVal: 2.92 ± 0.433
0.73PheTrp: 0.73 ± 0.232
1.46PheTyr: 1.46 ± 0.353
0.0PheXaa: 0.0 ± 0.0
Gly
2.92GlyAla: 2.92 ± 0.772
0.547GlyCys: 0.547 ± 0.321
3.285GlyAsp: 3.285 ± 0.548
4.927GlyGlu: 4.927 ± 0.505
2.92GlyPhe: 2.92 ± 0.481
3.558GlyGly: 3.558 ± 0.746
0.912GlyHis: 0.912 ± 0.283
4.745GlyIle: 4.745 ± 0.571
5.839GlyLys: 5.839 ± 0.884
4.288GlyLeu: 4.288 ± 0.753
2.281GlyMet: 2.281 ± 0.546
2.555GlyAsn: 2.555 ± 0.53
1.004GlyPro: 1.004 ± 0.273
2.737GlyGln: 2.737 ± 0.478
2.646GlyArg: 2.646 ± 0.524
3.193GlySer: 3.193 ± 0.759
3.376GlyThr: 3.376 ± 0.584
3.193GlyVal: 3.193 ± 0.447
1.46GlyTrp: 1.46 ± 0.478
3.102GlyTyr: 3.102 ± 0.598
0.0GlyXaa: 0.0 ± 0.0
His
1.46HisAla: 1.46 ± 0.419
0.365HisCys: 0.365 ± 0.179
0.912HisAsp: 0.912 ± 0.351
0.912HisGlu: 0.912 ± 0.275
1.095HisPhe: 1.095 ± 0.283
0.821HisGly: 0.821 ± 0.218
0.456HisHis: 0.456 ± 0.17
1.551HisIle: 1.551 ± 0.371
1.186HisLys: 1.186 ± 0.393
1.642HisLeu: 1.642 ± 0.427
0.182HisMet: 0.182 ± 0.14
0.73HisAsn: 0.73 ± 0.234
0.547HisPro: 0.547 ± 0.218
0.365HisGln: 0.365 ± 0.142
0.639HisArg: 0.639 ± 0.214
1.095HisSer: 1.095 ± 0.367
0.73HisThr: 0.73 ± 0.251
0.821HisVal: 0.821 ± 0.293
0.365HisTrp: 0.365 ± 0.164
0.365HisTyr: 0.365 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
5.109IleAla: 5.109 ± 0.657
1.004IleCys: 1.004 ± 0.294
4.836IleAsp: 4.836 ± 0.663
7.117IleGlu: 7.117 ± 0.702
1.825IlePhe: 1.825 ± 0.538
3.832IleGly: 3.832 ± 0.638
1.095IleHis: 1.095 ± 0.302
3.832IleIle: 3.832 ± 0.477
5.201IleLys: 5.201 ± 0.674
5.748IleLeu: 5.748 ± 1.127
1.825IleMet: 1.825 ± 0.451
3.923IleAsn: 3.923 ± 0.632
2.646IlePro: 2.646 ± 0.456
2.737IleGln: 2.737 ± 0.482
2.646IleArg: 2.646 ± 0.625
3.65IleSer: 3.65 ± 0.487
4.015IleThr: 4.015 ± 0.639
3.741IleVal: 3.741 ± 0.603
1.004IleTrp: 1.004 ± 0.244
1.916IleTyr: 1.916 ± 0.349
0.0IleXaa: 0.0 ± 0.0
Lys
5.748LysAla: 5.748 ± 0.63
0.639LysCys: 0.639 ± 0.273
5.657LysAsp: 5.657 ± 0.45
9.58LysGlu: 9.58 ± 0.836
2.828LysPhe: 2.828 ± 0.495
5.566LysGly: 5.566 ± 0.734
1.004LysHis: 1.004 ± 0.392
4.927LysIle: 4.927 ± 0.572
9.033LysLys: 9.033 ± 1.125
8.029LysLeu: 8.029 ± 0.888
3.193LysMet: 3.193 ± 0.583
5.383LysAsn: 5.383 ± 0.535
2.464LysPro: 2.464 ± 0.449
4.106LysGln: 4.106 ± 0.678
4.836LysArg: 4.836 ± 0.58
4.015LysSer: 4.015 ± 0.523
6.296LysThr: 6.296 ± 0.878
6.204LysVal: 6.204 ± 0.566
1.551LysTrp: 1.551 ± 0.431
3.193LysTyr: 3.193 ± 0.734
0.0LysXaa: 0.0 ± 0.0
Leu
4.653LeuAla: 4.653 ± 0.648
0.821LeuCys: 0.821 ± 0.329
4.836LeuAsp: 4.836 ± 0.586
6.843LeuGlu: 6.843 ± 0.595
3.193LeuPhe: 3.193 ± 0.659
6.204LeuGly: 6.204 ± 0.861
1.916LeuHis: 1.916 ± 0.587
4.38LeuIle: 4.38 ± 0.775
8.942LeuLys: 8.942 ± 0.826
6.204LeuLeu: 6.204 ± 0.782
2.007LeuMet: 2.007 ± 0.448
4.471LeuAsn: 4.471 ± 0.577
2.281LeuPro: 2.281 ± 0.374
2.737LeuGln: 2.737 ± 0.539
3.102LeuArg: 3.102 ± 0.429
5.292LeuSer: 5.292 ± 0.795
4.927LeuThr: 4.927 ± 0.527
4.653LeuVal: 4.653 ± 0.804
0.73LeuTrp: 0.73 ± 0.222
2.737LeuTyr: 2.737 ± 0.468
0.0LeuXaa: 0.0 ± 0.0
Met
2.281MetAla: 2.281 ± 0.564
0.182MetCys: 0.182 ± 0.136
1.551MetAsp: 1.551 ± 0.31
1.825MetGlu: 1.825 ± 0.381
1.004MetPhe: 1.004 ± 0.297
1.277MetGly: 1.277 ± 0.339
0.639MetHis: 0.639 ± 0.242
1.369MetIle: 1.369 ± 0.344
4.197MetLys: 4.197 ± 0.449
1.825MetLeu: 1.825 ± 0.41
1.277MetMet: 1.277 ± 0.335
2.281MetAsn: 2.281 ± 0.34
1.186MetPro: 1.186 ± 0.43
0.73MetGln: 0.73 ± 0.186
2.19MetArg: 2.19 ± 0.364
2.464MetSer: 2.464 ± 0.655
1.825MetThr: 1.825 ± 0.378
0.821MetVal: 0.821 ± 0.228
0.274MetTrp: 0.274 ± 0.159
1.095MetTyr: 1.095 ± 0.32
0.0MetXaa: 0.0 ± 0.0
Asn
3.832AsnAla: 3.832 ± 0.58
0.365AsnCys: 0.365 ± 0.197
3.102AsnAsp: 3.102 ± 0.577
4.197AsnGlu: 4.197 ± 0.679
2.281AsnPhe: 2.281 ± 0.477
3.741AsnGly: 3.741 ± 0.794
1.004AsnHis: 1.004 ± 0.299
4.106AsnIle: 4.106 ± 0.573
4.836AsnLys: 4.836 ± 0.752
3.832AsnLeu: 3.832 ± 0.55
1.734AsnMet: 1.734 ± 0.351
3.65AsnAsn: 3.65 ± 0.549
2.281AsnPro: 2.281 ± 0.502
1.734AsnGln: 1.734 ± 0.352
2.737AsnArg: 2.737 ± 0.51
2.464AsnSer: 2.464 ± 0.439
3.467AsnThr: 3.467 ± 0.607
2.646AsnVal: 2.646 ± 0.54
1.186AsnTrp: 1.186 ± 0.612
1.551AsnTyr: 1.551 ± 0.337
0.0AsnXaa: 0.0 ± 0.0
Pro
1.186ProAla: 1.186 ± 0.351
0.182ProCys: 0.182 ± 0.119
1.277ProAsp: 1.277 ± 0.282
2.555ProGlu: 2.555 ± 0.58
1.004ProPhe: 1.004 ± 0.378
1.277ProGly: 1.277 ± 0.414
0.912ProHis: 0.912 ± 0.276
2.555ProIle: 2.555 ± 0.591
2.099ProLys: 2.099 ± 0.396
1.916ProLeu: 1.916 ± 0.358
0.912ProMet: 0.912 ± 0.387
1.642ProAsn: 1.642 ± 0.565
1.551ProPro: 1.551 ± 0.338
0.821ProGln: 0.821 ± 0.234
0.821ProArg: 0.821 ± 0.24
1.277ProSer: 1.277 ± 0.278
2.19ProThr: 2.19 ± 0.37
1.46ProVal: 1.46 ± 0.35
0.456ProTrp: 0.456 ± 0.202
1.186ProTyr: 1.186 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
3.011GlnAla: 3.011 ± 0.51
0.182GlnCys: 0.182 ± 0.139
2.007GlnAsp: 2.007 ± 0.324
3.011GlnGlu: 3.011 ± 0.508
1.186GlnPhe: 1.186 ± 0.371
2.737GlnGly: 2.737 ± 0.54
0.912GlnHis: 0.912 ± 0.267
1.916GlnIle: 1.916 ± 0.434
3.65GlnLys: 3.65 ± 0.46
2.372GlnLeu: 2.372 ± 0.445
1.095GlnMet: 1.095 ± 0.277
1.734GlnAsn: 1.734 ± 0.326
1.277GlnPro: 1.277 ± 0.324
2.007GlnGln: 2.007 ± 0.403
1.734GlnArg: 1.734 ± 0.421
1.551GlnSer: 1.551 ± 0.304
2.19GlnThr: 2.19 ± 0.638
2.007GlnVal: 2.007 ± 0.359
0.639GlnTrp: 0.639 ± 0.215
1.277GlnTyr: 1.277 ± 0.342
0.0GlnXaa: 0.0 ± 0.0
Arg
2.281ArgAla: 2.281 ± 0.598
0.182ArgCys: 0.182 ± 0.131
2.19ArgAsp: 2.19 ± 0.415
3.467ArgGlu: 3.467 ± 0.488
2.464ArgPhe: 2.464 ± 0.546
3.65ArgGly: 3.65 ± 0.451
0.821ArgHis: 0.821 ± 0.26
3.741ArgIle: 3.741 ± 0.44
3.65ArgLys: 3.65 ± 0.566
5.018ArgLeu: 5.018 ± 0.568
1.734ArgMet: 1.734 ± 0.408
2.828ArgAsn: 2.828 ± 0.422
0.639ArgPro: 0.639 ± 0.265
1.277ArgGln: 1.277 ± 0.367
2.737ArgArg: 2.737 ± 0.627
1.916ArgSer: 1.916 ± 0.394
2.555ArgThr: 2.555 ± 0.537
3.467ArgVal: 3.467 ± 0.716
1.004ArgTrp: 1.004 ± 0.304
1.825ArgTyr: 1.825 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
2.555SerAla: 2.555 ± 0.528
0.547SerCys: 0.547 ± 0.259
2.737SerAsp: 2.737 ± 0.497
4.471SerGlu: 4.471 ± 0.508
2.646SerPhe: 2.646 ± 0.385
3.193SerGly: 3.193 ± 0.46
0.456SerHis: 0.456 ± 0.179
3.193SerIle: 3.193 ± 0.533
4.562SerLys: 4.562 ± 0.857
3.467SerLeu: 3.467 ± 0.698
1.916SerMet: 1.916 ± 0.392
2.281SerAsn: 2.281 ± 0.368
0.821SerPro: 0.821 ± 0.248
1.369SerGln: 1.369 ± 0.343
2.92SerArg: 2.92 ± 0.64
2.646SerSer: 2.646 ± 0.388
3.011SerThr: 3.011 ± 0.537
4.288SerVal: 4.288 ± 0.799
0.73SerTrp: 0.73 ± 0.33
3.285SerTyr: 3.285 ± 0.583
0.0SerXaa: 0.0 ± 0.0
Thr
3.376ThrAla: 3.376 ± 0.861
0.365ThrCys: 0.365 ± 0.157
3.193ThrAsp: 3.193 ± 0.531
4.38ThrGlu: 4.38 ± 0.611
2.646ThrPhe: 2.646 ± 0.537
4.562ThrGly: 4.562 ± 0.514
0.912ThrHis: 0.912 ± 0.293
4.106ThrIle: 4.106 ± 0.441
5.839ThrLys: 5.839 ± 0.77
6.113ThrLeu: 6.113 ± 0.678
1.277ThrMet: 1.277 ± 0.283
2.92ThrAsn: 2.92 ± 0.813
2.464ThrPro: 2.464 ± 0.492
2.646ThrGln: 2.646 ± 0.513
3.011ThrArg: 3.011 ± 0.417
2.19ThrSer: 2.19 ± 0.367
3.923ThrThr: 3.923 ± 0.648
4.288ThrVal: 4.288 ± 0.737
0.821ThrTrp: 0.821 ± 0.345
1.642ThrTyr: 1.642 ± 0.348
0.0ThrXaa: 0.0 ± 0.0
Val
3.102ValAla: 3.102 ± 0.5
0.639ValCys: 0.639 ± 0.289
4.562ValAsp: 4.562 ± 0.812
6.113ValGlu: 6.113 ± 0.996
2.281ValPhe: 2.281 ± 0.396
3.558ValGly: 3.558 ± 0.561
0.639ValHis: 0.639 ± 0.23
4.653ValIle: 4.653 ± 0.693
5.474ValLys: 5.474 ± 0.748
3.558ValLeu: 3.558 ± 0.57
1.186ValMet: 1.186 ± 0.325
4.288ValAsn: 4.288 ± 0.46
1.642ValPro: 1.642 ± 0.426
2.828ValGln: 2.828 ± 0.396
2.737ValArg: 2.737 ± 0.463
4.106ValSer: 4.106 ± 0.508
3.923ValThr: 3.923 ± 0.735
3.376ValVal: 3.376 ± 0.615
0.365ValTrp: 0.365 ± 0.205
1.916ValTyr: 1.916 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
0.639TrpAla: 0.639 ± 0.298
0.091TrpCys: 0.091 ± 0.09
1.277TrpAsp: 1.277 ± 0.352
1.551TrpGlu: 1.551 ± 0.386
1.642TrpPhe: 1.642 ± 0.727
0.547TrpGly: 0.547 ± 0.195
0.456TrpHis: 0.456 ± 0.173
1.004TrpIle: 1.004 ± 0.361
1.369TrpLys: 1.369 ± 0.33
1.734TrpLeu: 1.734 ± 0.471
0.182TrpMet: 0.182 ± 0.104
0.639TrpAsn: 0.639 ± 0.294
0.365TrpPro: 0.365 ± 0.2
0.547TrpGln: 0.547 ± 0.17
0.639TrpArg: 0.639 ± 0.257
1.095TrpSer: 1.095 ± 0.397
0.912TrpThr: 0.912 ± 0.262
1.004TrpVal: 1.004 ± 0.268
1.095TrpTrp: 1.095 ± 0.775
0.821TrpTyr: 0.821 ± 0.332
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.099TyrAla: 2.099 ± 0.46
0.365TyrCys: 0.365 ± 0.193
2.19TyrAsp: 2.19 ± 0.348
3.741TyrGlu: 3.741 ± 0.62
2.281TyrPhe: 2.281 ± 0.538
1.551TyrGly: 1.551 ± 0.373
0.912TyrHis: 0.912 ± 0.309
2.372TyrIle: 2.372 ± 0.621
2.828TyrLys: 2.828 ± 0.534
3.285TyrLeu: 3.285 ± 0.613
0.365TyrMet: 0.365 ± 0.18
1.734TyrAsn: 1.734 ± 0.387
0.73TyrPro: 0.73 ± 0.276
1.551TyrGln: 1.551 ± 0.326
2.19TyrArg: 2.19 ± 0.416
1.825TyrSer: 1.825 ± 0.353
1.734TyrThr: 1.734 ± 0.401
2.372TyrVal: 2.372 ± 0.539
0.821TyrTrp: 0.821 ± 0.269
2.007TyrTyr: 2.007 ± 0.584
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (10961 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski