Amino acid dipepetide frequency for Bacillus phage PBC1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.601AlaAla: 6.601 ± 1.555
0.398AlaCys: 0.398 ± 0.179
4.295AlaAsp: 4.295 ± 0.565
3.976AlaGlu: 3.976 ± 0.602
2.784AlaPhe: 2.784 ± 0.568
5.01AlaGly: 5.01 ± 0.533
1.352AlaHis: 1.352 ± 0.419
6.362AlaIle: 6.362 ± 1.194
6.68AlaLys: 6.68 ± 0.818
6.84AlaLeu: 6.84 ± 0.85
3.181AlaMet: 3.181 ± 0.641
4.613AlaAsn: 4.613 ± 0.703
2.545AlaPro: 2.545 ± 0.42
3.181AlaGln: 3.181 ± 0.482
3.817AlaArg: 3.817 ± 0.564
5.408AlaSer: 5.408 ± 0.686
4.533AlaThr: 4.533 ± 0.572
5.169AlaVal: 5.169 ± 0.7
1.113AlaTrp: 1.113 ± 0.427
2.068AlaTyr: 2.068 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.318CysAla: 0.318 ± 0.199
0.08CysCys: 0.08 ± 0.078
0.239CysAsp: 0.239 ± 0.117
0.239CysGlu: 0.239 ± 0.15
0.159CysPhe: 0.159 ± 0.095
0.477CysGly: 0.477 ± 0.275
0.239CysHis: 0.239 ± 0.136
0.159CysIle: 0.159 ± 0.124
0.557CysLys: 0.557 ± 0.226
0.636CysLeu: 0.636 ± 0.205
0.08CysMet: 0.08 ± 0.077
0.557CysAsn: 0.557 ± 0.252
0.557CysPro: 0.557 ± 0.268
0.398CysGln: 0.398 ± 0.204
0.398CysArg: 0.398 ± 0.195
0.398CysSer: 0.398 ± 0.167
0.636CysThr: 0.636 ± 0.308
0.318CysVal: 0.318 ± 0.163
0.08CysTrp: 0.08 ± 0.073
0.398CysTyr: 0.398 ± 0.195
0.0CysXaa: 0.0 ± 0.0
Asp
4.215AspAla: 4.215 ± 0.429
0.318AspCys: 0.318 ± 0.165
2.863AspAsp: 2.863 ± 0.581
5.408AspGlu: 5.408 ± 0.909
2.624AspPhe: 2.624 ± 0.453
4.295AspGly: 4.295 ± 0.552
1.193AspHis: 1.193 ± 0.285
4.692AspIle: 4.692 ± 0.675
5.488AspLys: 5.488 ± 0.517
5.09AspLeu: 5.09 ± 0.667
2.784AspMet: 2.784 ± 0.435
4.295AspAsn: 4.295 ± 0.9
2.227AspPro: 2.227 ± 0.484
1.432AspGln: 1.432 ± 0.305
2.704AspArg: 2.704 ± 0.551
2.465AspSer: 2.465 ± 0.362
3.579AspThr: 3.579 ± 0.616
3.817AspVal: 3.817 ± 0.485
1.75AspTrp: 1.75 ± 0.38
2.704AspTyr: 2.704 ± 0.538
0.0AspXaa: 0.0 ± 0.0
Glu
5.965GluAla: 5.965 ± 0.791
0.477GluCys: 0.477 ± 0.214
5.09GluAsp: 5.09 ± 0.933
6.84GluGlu: 6.84 ± 1.152
3.181GluPhe: 3.181 ± 0.446
4.613GluGly: 4.613 ± 0.493
0.954GluHis: 0.954 ± 0.283
4.136GluIle: 4.136 ± 0.644
4.374GluLys: 4.374 ± 0.529
6.521GluLeu: 6.521 ± 0.608
2.068GluMet: 2.068 ± 0.367
2.465GluAsn: 2.465 ± 0.374
2.704GluPro: 2.704 ± 0.561
3.261GluGln: 3.261 ± 0.52
3.738GluArg: 3.738 ± 0.575
2.784GluSer: 2.784 ± 0.398
2.784GluThr: 2.784 ± 0.466
5.09GluVal: 5.09 ± 0.62
0.875GluTrp: 0.875 ± 0.251
2.545GluTyr: 2.545 ± 0.561
0.0GluXaa: 0.0 ± 0.0
Phe
2.227PheAla: 2.227 ± 0.406
0.398PheCys: 0.398 ± 0.162
2.943PheAsp: 2.943 ± 0.596
2.784PheGlu: 2.784 ± 0.474
1.193PhePhe: 1.193 ± 0.325
2.068PheGly: 2.068 ± 0.349
0.954PheHis: 0.954 ± 0.252
2.545PheIle: 2.545 ± 0.421
2.465PheLys: 2.465 ± 0.464
2.624PheLeu: 2.624 ± 0.375
1.432PheMet: 1.432 ± 0.275
2.624PheAsn: 2.624 ± 0.383
0.795PhePro: 0.795 ± 0.231
1.034PheGln: 1.034 ± 0.362
1.272PheArg: 1.272 ± 0.339
2.624PheSer: 2.624 ± 0.388
3.181PheThr: 3.181 ± 0.585
1.829PheVal: 1.829 ± 0.329
0.08PheTrp: 0.08 ± 0.074
1.511PheTyr: 1.511 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
5.09GlyAla: 5.09 ± 0.554
0.477GlyCys: 0.477 ± 0.194
3.34GlyAsp: 3.34 ± 0.492
4.374GlyGlu: 4.374 ± 0.775
2.227GlyPhe: 2.227 ± 0.428
4.533GlyGly: 4.533 ± 0.724
1.193GlyHis: 1.193 ± 0.289
4.613GlyIle: 4.613 ± 0.695
6.68GlyLys: 6.68 ± 0.83
5.249GlyLeu: 5.249 ± 0.608
1.988GlyMet: 1.988 ± 0.46
2.465GlyAsn: 2.465 ± 0.393
1.432GlyPro: 1.432 ± 0.479
1.909GlyGln: 1.909 ± 0.35
2.943GlyArg: 2.943 ± 0.557
3.34GlySer: 3.34 ± 0.728
3.897GlyThr: 3.897 ± 0.719
4.772GlyVal: 4.772 ± 0.618
0.954GlyTrp: 0.954 ± 0.283
3.102GlyTyr: 3.102 ± 0.428
0.0GlyXaa: 0.0 ± 0.0
His
1.67HisAla: 1.67 ± 0.326
0.08HisCys: 0.08 ± 0.076
1.272HisAsp: 1.272 ± 0.404
1.034HisGlu: 1.034 ± 0.305
0.795HisPhe: 0.795 ± 0.262
1.272HisGly: 1.272 ± 0.293
0.477HisHis: 0.477 ± 0.19
0.954HisIle: 0.954 ± 0.255
0.636HisLys: 0.636 ± 0.22
1.511HisLeu: 1.511 ± 0.338
0.716HisMet: 0.716 ± 0.257
0.875HisAsn: 0.875 ± 0.29
0.636HisPro: 0.636 ± 0.219
0.557HisGln: 0.557 ± 0.188
0.875HisArg: 0.875 ± 0.232
1.193HisSer: 1.193 ± 0.35
1.511HisThr: 1.511 ± 0.463
0.875HisVal: 0.875 ± 0.273
0.477HisTrp: 0.477 ± 0.197
1.352HisTyr: 1.352 ± 0.322
0.0HisXaa: 0.0 ± 0.0
Ile
4.772IleAla: 4.772 ± 0.617
0.318IleCys: 0.318 ± 0.143
5.885IleAsp: 5.885 ± 0.604
4.931IleGlu: 4.931 ± 0.691
1.432IlePhe: 1.432 ± 0.398
3.34IleGly: 3.34 ± 0.432
1.432IleHis: 1.432 ± 0.29
2.943IleIle: 2.943 ± 0.633
5.249IleLys: 5.249 ± 1.241
3.976IleLeu: 3.976 ± 0.483
2.465IleMet: 2.465 ± 0.487
2.465IleAsn: 2.465 ± 0.499
3.261IlePro: 3.261 ± 0.795
2.545IleGln: 2.545 ± 0.351
2.545IleArg: 2.545 ± 0.508
3.34IleSer: 3.34 ± 0.57
4.454IleThr: 4.454 ± 0.501
4.772IleVal: 4.772 ± 0.68
0.636IleTrp: 0.636 ± 0.501
2.863IleTyr: 2.863 ± 0.457
0.0IleXaa: 0.0 ± 0.0
Lys
7.635LysAla: 7.635 ± 0.878
0.318LysCys: 0.318 ± 0.208
4.454LysAsp: 4.454 ± 0.587
6.203LysGlu: 6.203 ± 0.678
2.624LysPhe: 2.624 ± 0.52
4.851LysGly: 4.851 ± 0.514
1.67LysHis: 1.67 ± 0.39
4.136LysIle: 4.136 ± 0.593
6.362LysLys: 6.362 ± 0.889
6.601LysLeu: 6.601 ± 0.626
2.863LysMet: 2.863 ± 0.449
2.306LysAsn: 2.306 ± 0.489
3.499LysPro: 3.499 ± 0.571
3.34LysGln: 3.34 ± 0.526
3.102LysArg: 3.102 ± 0.501
4.374LysSer: 4.374 ± 0.807
4.374LysThr: 4.374 ± 0.68
5.328LysVal: 5.328 ± 0.861
1.034LysTrp: 1.034 ± 0.259
3.261LysTyr: 3.261 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
6.203LeuAla: 6.203 ± 0.907
0.636LeuCys: 0.636 ± 0.251
5.567LeuAsp: 5.567 ± 0.643
5.488LeuGlu: 5.488 ± 0.611
2.545LeuPhe: 2.545 ± 0.356
4.374LeuGly: 4.374 ± 0.516
1.511LeuHis: 1.511 ± 0.436
4.374LeuIle: 4.374 ± 0.661
5.806LeuLys: 5.806 ± 0.524
5.249LeuLeu: 5.249 ± 0.692
2.624LeuMet: 2.624 ± 0.542
4.295LeuAsn: 4.295 ± 0.579
3.181LeuPro: 3.181 ± 0.613
2.545LeuGln: 2.545 ± 0.449
3.42LeuArg: 3.42 ± 0.517
6.442LeuSer: 6.442 ± 0.625
5.328LeuThr: 5.328 ± 0.537
4.215LeuVal: 4.215 ± 0.643
1.034LeuTrp: 1.034 ± 0.274
1.988LeuTyr: 1.988 ± 0.409
0.0LeuXaa: 0.0 ± 0.0
Met
2.624MetAla: 2.624 ± 0.557
0.08MetCys: 0.08 ± 0.083
2.147MetAsp: 2.147 ± 0.352
2.147MetGlu: 2.147 ± 0.458
1.511MetPhe: 1.511 ± 0.309
2.784MetGly: 2.784 ± 0.413
0.716MetHis: 0.716 ± 0.184
2.147MetIle: 2.147 ± 0.46
2.784MetLys: 2.784 ± 0.472
2.545MetLeu: 2.545 ± 0.46
1.113MetMet: 1.113 ± 0.32
1.988MetAsn: 1.988 ± 0.486
1.193MetPro: 1.193 ± 0.269
1.352MetGln: 1.352 ± 0.273
1.113MetArg: 1.113 ± 0.32
2.386MetSer: 2.386 ± 0.515
1.75MetThr: 1.75 ± 0.332
1.113MetVal: 1.113 ± 0.301
0.08MetTrp: 0.08 ± 0.069
1.113MetTyr: 1.113 ± 0.304
0.0MetXaa: 0.0 ± 0.0
Asn
4.931AsnAla: 4.931 ± 1.105
0.875AsnCys: 0.875 ± 0.333
2.227AsnAsp: 2.227 ± 0.39
2.943AsnGlu: 2.943 ± 0.629
2.068AsnPhe: 2.068 ± 0.411
5.647AsnGly: 5.647 ± 1.206
1.034AsnHis: 1.034 ± 0.239
3.261AsnIle: 3.261 ± 0.456
2.784AsnLys: 2.784 ± 0.394
3.976AsnLeu: 3.976 ± 0.542
1.432AsnMet: 1.432 ± 0.486
2.704AsnAsn: 2.704 ± 0.494
1.272AsnPro: 1.272 ± 0.285
1.591AsnGln: 1.591 ± 0.319
1.988AsnArg: 1.988 ± 0.434
3.499AsnSer: 3.499 ± 0.691
1.988AsnThr: 1.988 ± 0.443
4.295AsnVal: 4.295 ± 0.605
0.636AsnTrp: 0.636 ± 0.204
1.591AsnTyr: 1.591 ± 0.414
0.0AsnXaa: 0.0 ± 0.0
Pro
3.579ProAla: 3.579 ± 0.464
0.239ProCys: 0.239 ± 0.128
2.863ProAsp: 2.863 ± 0.49
2.227ProGlu: 2.227 ± 0.391
0.795ProPhe: 0.795 ± 0.235
0.716ProGly: 0.716 ± 0.214
0.477ProHis: 0.477 ± 0.178
2.465ProIle: 2.465 ± 0.416
3.261ProLys: 3.261 ± 0.696
2.704ProLeu: 2.704 ± 0.391
1.272ProMet: 1.272 ± 0.239
2.465ProAsn: 2.465 ± 0.562
1.75ProPro: 1.75 ± 0.357
0.954ProGln: 0.954 ± 0.207
1.352ProArg: 1.352 ± 0.368
1.909ProSer: 1.909 ± 0.357
2.784ProThr: 2.784 ± 0.691
3.261ProVal: 3.261 ± 0.518
0.636ProTrp: 0.636 ± 0.193
1.829ProTyr: 1.829 ± 0.427
0.0ProXaa: 0.0 ± 0.0
Gln
4.374GlnAla: 4.374 ± 0.882
0.159GlnCys: 0.159 ± 0.124
1.829GlnAsp: 1.829 ± 0.383
2.704GlnGlu: 2.704 ± 0.398
1.113GlnPhe: 1.113 ± 0.311
2.465GlnGly: 2.465 ± 0.358
0.477GlnHis: 0.477 ± 0.183
2.624GlnIle: 2.624 ± 0.399
2.784GlnLys: 2.784 ± 0.422
3.022GlnLeu: 3.022 ± 0.362
0.716GlnMet: 0.716 ± 0.231
1.193GlnAsn: 1.193 ± 0.317
0.875GlnPro: 0.875 ± 0.25
1.988GlnGln: 1.988 ± 0.865
1.432GlnArg: 1.432 ± 0.364
1.829GlnSer: 1.829 ± 0.403
1.591GlnThr: 1.591 ± 0.383
2.545GlnVal: 2.545 ± 0.413
1.034GlnTrp: 1.034 ± 0.389
1.67GlnTyr: 1.67 ± 0.282
0.0GlnXaa: 0.0 ± 0.0
Arg
2.863ArgAla: 2.863 ± 0.424
0.318ArgCys: 0.318 ± 0.164
2.545ArgAsp: 2.545 ± 0.41
2.147ArgGlu: 2.147 ± 0.418
1.511ArgPhe: 1.511 ± 0.389
2.227ArgGly: 2.227 ± 0.33
0.795ArgHis: 0.795 ± 0.296
2.784ArgIle: 2.784 ± 0.454
3.897ArgLys: 3.897 ± 0.789
4.136ArgLeu: 4.136 ± 0.598
1.829ArgMet: 1.829 ± 0.381
1.432ArgAsn: 1.432 ± 0.359
2.068ArgPro: 2.068 ± 0.372
1.67ArgGln: 1.67 ± 0.316
1.988ArgArg: 1.988 ± 0.394
2.386ArgSer: 2.386 ± 0.483
2.704ArgThr: 2.704 ± 0.475
3.42ArgVal: 3.42 ± 0.541
0.477ArgTrp: 0.477 ± 0.192
1.511ArgTyr: 1.511 ± 0.447
0.0ArgXaa: 0.0 ± 0.0
Ser
5.169SerAla: 5.169 ± 0.927
0.398SerCys: 0.398 ± 0.203
4.851SerAsp: 4.851 ± 0.809
3.42SerGlu: 3.42 ± 0.5
1.829SerPhe: 1.829 ± 0.44
4.374SerGly: 4.374 ± 0.565
1.113SerHis: 1.113 ± 0.307
3.102SerIle: 3.102 ± 0.456
4.772SerLys: 4.772 ± 0.608
3.42SerLeu: 3.42 ± 0.435
1.67SerMet: 1.67 ± 0.349
3.022SerAsn: 3.022 ± 0.628
2.068SerPro: 2.068 ± 0.385
2.306SerGln: 2.306 ± 0.449
2.306SerArg: 2.306 ± 0.309
4.692SerSer: 4.692 ± 0.51
3.658SerThr: 3.658 ± 0.747
3.897SerVal: 3.897 ± 0.628
0.636SerTrp: 0.636 ± 0.241
2.068SerTyr: 2.068 ± 0.503
0.0SerXaa: 0.0 ± 0.0
Thr
4.215ThrAla: 4.215 ± 0.588
0.398ThrCys: 0.398 ± 0.196
3.579ThrAsp: 3.579 ± 0.586
5.01ThrGlu: 5.01 ± 0.969
3.34ThrPhe: 3.34 ± 0.435
5.169ThrGly: 5.169 ± 0.684
1.034ThrHis: 1.034 ± 0.305
3.579ThrIle: 3.579 ± 0.768
4.215ThrLys: 4.215 ± 0.719
3.579ThrLeu: 3.579 ± 0.445
1.113ThrMet: 1.113 ± 0.237
2.784ThrAsn: 2.784 ± 0.598
3.022ThrPro: 3.022 ± 0.55
1.988ThrGln: 1.988 ± 0.428
1.75ThrArg: 1.75 ± 0.325
2.863ThrSer: 2.863 ± 0.517
3.658ThrThr: 3.658 ± 0.717
4.692ThrVal: 4.692 ± 0.767
1.113ThrTrp: 1.113 ± 0.336
2.784ThrTyr: 2.784 ± 0.563
0.0ThrXaa: 0.0 ± 0.0
Val
4.136ValAla: 4.136 ± 0.658
0.239ValCys: 0.239 ± 0.139
4.295ValAsp: 4.295 ± 0.707
4.454ValGlu: 4.454 ± 0.52
2.784ValPhe: 2.784 ± 0.478
2.784ValGly: 2.784 ± 0.502
1.113ValHis: 1.113 ± 0.393
5.408ValIle: 5.408 ± 0.637
5.249ValLys: 5.249 ± 0.68
5.249ValLeu: 5.249 ± 0.677
1.909ValMet: 1.909 ± 0.34
3.579ValAsn: 3.579 ± 0.487
3.261ValPro: 3.261 ± 0.436
2.784ValGln: 2.784 ± 0.376
2.863ValArg: 2.863 ± 0.405
3.658ValSer: 3.658 ± 0.624
4.136ValThr: 4.136 ± 0.73
3.897ValVal: 3.897 ± 0.511
1.75ValTrp: 1.75 ± 0.84
3.261ValTyr: 3.261 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
0.875TrpAla: 0.875 ± 0.326
0.318TrpCys: 0.318 ± 0.15
1.272TrpAsp: 1.272 ± 0.249
0.875TrpGlu: 0.875 ± 0.305
0.318TrpPhe: 0.318 ± 0.14
0.795TrpGly: 0.795 ± 0.209
0.318TrpHis: 0.318 ± 0.146
1.272TrpIle: 1.272 ± 0.252
0.557TrpLys: 0.557 ± 0.19
1.113TrpLeu: 1.113 ± 0.293
0.318TrpMet: 0.318 ± 0.167
2.386TrpAsn: 2.386 ± 1.237
0.0TrpPro: 0.0 ± 0.0
0.398TrpGln: 0.398 ± 0.15
0.875TrpArg: 0.875 ± 0.235
0.954TrpSer: 0.954 ± 0.266
0.636TrpThr: 0.636 ± 0.179
0.795TrpVal: 0.795 ± 0.404
0.08TrpTrp: 0.08 ± 0.092
0.636TrpTyr: 0.636 ± 0.232
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.465TyrAla: 2.465 ± 0.546
0.398TyrCys: 0.398 ± 0.167
2.386TyrAsp: 2.386 ± 0.35
3.261TyrGlu: 3.261 ± 0.724
1.75TyrPhe: 1.75 ± 0.35
2.784TyrGly: 2.784 ± 0.525
0.716TyrHis: 0.716 ± 0.233
2.068TyrIle: 2.068 ± 0.472
3.738TyrLys: 3.738 ± 0.525
2.784TyrLeu: 2.784 ± 0.353
1.034TyrMet: 1.034 ± 0.327
2.227TyrAsn: 2.227 ± 0.461
1.193TyrPro: 1.193 ± 0.424
1.193TyrGln: 1.193 ± 0.307
2.227TyrArg: 2.227 ± 0.457
2.227TyrSer: 2.227 ± 0.461
2.784TyrThr: 2.784 ± 0.625
2.704TyrVal: 2.704 ± 0.467
0.318TyrTrp: 0.318 ± 0.16
1.432TyrTyr: 1.432 ± 0.397
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12575 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski