Amino acid dipepetide frequency for Enterobacteria phage PRD1 (Bacteriophage PRD1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.753AlaAla: 19.753 ± 2.649
0.798AlaCys: 0.798 ± 0.334
2.993AlaAsp: 2.993 ± 0.608
5.587AlaGlu: 5.587 ± 1.59
3.192AlaPhe: 3.192 ± 0.531
12.969AlaGly: 12.969 ± 1.905
0.798AlaHis: 0.798 ± 0.374
6.784AlaIle: 6.784 ± 1.117
6.584AlaLys: 6.584 ± 1.547
9.377AlaLeu: 9.377 ± 1.515
3.99AlaMet: 3.99 ± 1.028
5.786AlaAsn: 5.786 ± 1.674
3.99AlaPro: 3.99 ± 0.955
4.988AlaGln: 4.988 ± 1.202
3.192AlaArg: 3.192 ± 0.809
6.584AlaSer: 6.584 ± 1.277
3.591AlaThr: 3.591 ± 0.776
9.577AlaVal: 9.577 ± 1.486
0.798AlaTrp: 0.798 ± 0.352
3.99AlaTyr: 3.99 ± 1.171
0.0AlaXaa: 0.0 ± 0.0
Cys
0.998CysAla: 0.998 ± 0.474
0.0CysCys: 0.0 ± 0.0
0.599CysAsp: 0.599 ± 0.302
0.599CysGlu: 0.599 ± 0.252
0.2CysPhe: 0.2 ± 0.157
0.599CysGly: 0.599 ± 0.318
0.0CysHis: 0.0 ± 0.0
0.798CysIle: 0.798 ± 0.453
0.0CysLys: 0.0 ± 0.0
0.399CysLeu: 0.399 ± 0.264
0.0CysMet: 0.0 ± 0.0
0.798CysAsn: 0.798 ± 0.579
0.798CysPro: 0.798 ± 0.406
0.798CysGln: 0.798 ± 0.353
0.2CysArg: 0.2 ± 0.17
0.599CysSer: 0.599 ± 0.459
0.0CysThr: 0.0 ± 0.0
0.599CysVal: 0.599 ± 0.354
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.99AspAla: 3.99 ± 0.734
0.599AspCys: 0.599 ± 0.351
1.596AspAsp: 1.596 ± 0.625
3.591AspGlu: 3.591 ± 0.917
3.791AspPhe: 3.791 ± 0.927
1.995AspGly: 1.995 ± 0.666
0.2AspHis: 0.2 ± 0.17
2.594AspIle: 2.594 ± 0.61
2.394AspLys: 2.394 ± 0.504
5.188AspLeu: 5.188 ± 0.782
0.798AspMet: 0.798 ± 0.525
2.993AspAsn: 2.993 ± 0.852
2.993AspPro: 2.993 ± 0.927
1.397AspGln: 1.397 ± 0.564
1.197AspArg: 1.197 ± 0.46
3.392AspSer: 3.392 ± 0.72
2.594AspThr: 2.594 ± 0.58
2.993AspVal: 2.993 ± 0.638
0.2AspTrp: 0.2 ± 0.202
2.195AspTyr: 2.195 ± 0.709
0.0AspXaa: 0.0 ± 0.0
Glu
6.584GluAla: 6.584 ± 1.379
0.399GluCys: 0.399 ± 0.296
0.998GluAsp: 0.998 ± 0.372
2.594GluGlu: 2.594 ± 0.684
2.195GluPhe: 2.195 ± 0.588
3.791GluGly: 3.791 ± 0.97
1.197GluHis: 1.197 ± 0.442
4.589GluIle: 4.589 ± 0.817
3.591GluLys: 3.591 ± 1.055
2.793GluLeu: 2.793 ± 0.719
2.195GluMet: 2.195 ± 0.734
4.389GluAsn: 4.389 ± 0.845
2.594GluPro: 2.594 ± 0.72
1.796GluGln: 1.796 ± 0.707
0.998GluArg: 0.998 ± 0.563
2.195GluSer: 2.195 ± 0.636
4.789GluThr: 4.789 ± 1.03
1.796GluVal: 1.796 ± 0.444
1.397GluTrp: 1.397 ± 0.584
1.596GluTyr: 1.596 ± 0.584
0.0GluXaa: 0.0 ± 0.0
Phe
3.591PheAla: 3.591 ± 0.717
0.0PheCys: 0.0 ± 0.0
3.192PheAsp: 3.192 ± 0.674
1.995PheGlu: 1.995 ± 0.713
0.998PhePhe: 0.998 ± 0.322
4.589PheGly: 4.589 ± 0.806
0.798PheHis: 0.798 ± 0.507
3.791PheIle: 3.791 ± 1.36
0.998PheLys: 0.998 ± 0.456
2.993PheLeu: 2.993 ± 0.586
1.397PheMet: 1.397 ± 0.389
2.394PheAsn: 2.394 ± 0.801
1.796PhePro: 1.796 ± 0.553
1.796PheGln: 1.796 ± 0.781
2.394PheArg: 2.394 ± 0.625
2.793PheSer: 2.793 ± 0.629
2.195PheThr: 2.195 ± 0.678
2.793PheVal: 2.793 ± 0.885
0.0PheTrp: 0.0 ± 0.0
1.197PheTyr: 1.197 ± 0.719
0.0PheXaa: 0.0 ± 0.0
Gly
11.173GlyAla: 11.173 ± 1.8
0.599GlyCys: 0.599 ± 0.348
4.988GlyAsp: 4.988 ± 0.995
3.99GlyGlu: 3.99 ± 1.144
4.589GlyPhe: 4.589 ± 1.05
10.176GlyGly: 10.176 ± 2.825
0.0GlyHis: 0.0 ± 0.0
4.988GlyIle: 4.988 ± 1.045
6.185GlyLys: 6.185 ± 1.158
5.786GlyLeu: 5.786 ± 0.924
1.796GlyMet: 1.796 ± 0.655
3.99GlyAsn: 3.99 ± 1.078
3.791GlyPro: 3.791 ± 0.866
2.594GlyGln: 2.594 ± 0.635
3.192GlyArg: 3.192 ± 0.806
5.587GlySer: 5.587 ± 1.444
3.791GlyThr: 3.791 ± 1.274
6.185GlyVal: 6.185 ± 1.232
0.798GlyTrp: 0.798 ± 0.38
3.591GlyTyr: 3.591 ± 0.681
0.0GlyXaa: 0.0 ± 0.0
His
0.998HisAla: 0.998 ± 0.316
0.399HisCys: 0.399 ± 0.259
0.399HisAsp: 0.399 ± 0.275
0.2HisGlu: 0.2 ± 0.17
0.998HisPhe: 0.998 ± 0.419
0.399HisGly: 0.399 ± 0.27
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.798HisLys: 0.798 ± 0.507
1.197HisLeu: 1.197 ± 0.424
0.0HisMet: 0.0 ± 0.0
0.798HisAsn: 0.798 ± 0.422
0.599HisPro: 0.599 ± 0.354
0.0HisGln: 0.0 ± 0.0
0.399HisArg: 0.399 ± 0.249
0.599HisSer: 0.599 ± 0.331
0.399HisThr: 0.399 ± 0.23
0.798HisVal: 0.798 ± 0.333
0.2HisTrp: 0.2 ± 0.216
0.599HisTyr: 0.599 ± 0.313
0.0HisXaa: 0.0 ± 0.0
Ile
7.382IleAla: 7.382 ± 0.839
0.2IleCys: 0.2 ± 0.17
4.19IleAsp: 4.19 ± 0.662
3.392IleGlu: 3.392 ± 0.846
1.995IlePhe: 1.995 ± 0.72
3.99IleGly: 3.99 ± 0.906
0.599IleHis: 0.599 ± 0.354
6.784IleIle: 6.784 ± 1.351
4.389IleLys: 4.389 ± 0.932
6.185IleLeu: 6.185 ± 1.114
0.998IleMet: 0.998 ± 0.437
3.791IleAsn: 3.791 ± 0.944
3.591IlePro: 3.591 ± 0.939
1.796IleGln: 1.796 ± 0.566
1.796IleArg: 1.796 ± 0.474
4.389IleSer: 4.389 ± 0.825
3.791IleThr: 3.791 ± 0.912
4.389IleVal: 4.389 ± 0.912
0.399IleTrp: 0.399 ± 0.262
3.791IleTyr: 3.791 ± 0.677
0.0IleXaa: 0.0 ± 0.0
Lys
6.784LysAla: 6.784 ± 1.502
0.399LysCys: 0.399 ± 0.289
0.998LysAsp: 0.998 ± 0.477
1.995LysGlu: 1.995 ± 0.685
3.392LysPhe: 3.392 ± 0.821
3.591LysGly: 3.591 ± 0.914
0.798LysHis: 0.798 ± 0.333
3.591LysIle: 3.591 ± 1.035
4.19LysLys: 4.19 ± 1.385
3.791LysLeu: 3.791 ± 0.755
1.197LysMet: 1.197 ± 0.474
2.195LysAsn: 2.195 ± 0.653
3.591LysPro: 3.591 ± 1.0
1.796LysGln: 1.796 ± 0.493
3.392LysArg: 3.392 ± 1.0
1.995LysSer: 1.995 ± 0.667
2.793LysThr: 2.793 ± 0.602
3.591LysVal: 3.591 ± 0.953
0.399LysTrp: 0.399 ± 0.272
2.594LysTyr: 2.594 ± 1.014
0.0LysXaa: 0.0 ± 0.0
Leu
8.579LeuAla: 8.579 ± 1.745
0.599LeuCys: 0.599 ± 0.368
3.99LeuAsp: 3.99 ± 0.815
4.19LeuGlu: 4.19 ± 0.927
3.392LeuPhe: 3.392 ± 0.775
5.786LeuGly: 5.786 ± 1.167
0.798LeuHis: 0.798 ± 0.323
5.188LeuIle: 5.188 ± 1.205
4.389LeuLys: 4.389 ± 1.053
5.387LeuLeu: 5.387 ± 1.22
3.791LeuMet: 3.791 ± 0.857
3.192LeuAsn: 3.192 ± 0.844
3.99LeuPro: 3.99 ± 1.305
3.791LeuGln: 3.791 ± 1.372
2.594LeuArg: 2.594 ± 0.54
4.19LeuSer: 4.19 ± 1.234
5.786LeuThr: 5.786 ± 1.241
4.389LeuVal: 4.389 ± 0.749
1.197LeuTrp: 1.197 ± 0.423
2.793LeuTyr: 2.793 ± 0.579
0.0LeuXaa: 0.0 ± 0.0
Met
3.99MetAla: 3.99 ± 1.125
0.0MetCys: 0.0 ± 0.0
0.998MetAsp: 0.998 ± 0.416
1.995MetGlu: 1.995 ± 0.655
0.998MetPhe: 0.998 ± 0.439
2.195MetGly: 2.195 ± 0.697
0.0MetHis: 0.0 ± 0.0
1.796MetIle: 1.796 ± 0.67
1.995MetLys: 1.995 ± 0.726
0.998MetLeu: 0.998 ± 0.511
0.798MetMet: 0.798 ± 0.528
1.197MetAsn: 1.197 ± 0.371
2.195MetPro: 2.195 ± 0.608
1.596MetGln: 1.596 ± 0.557
0.998MetArg: 0.998 ± 0.509
1.397MetSer: 1.397 ± 0.711
2.195MetThr: 2.195 ± 0.644
1.197MetVal: 1.197 ± 0.413
0.0MetTrp: 0.0 ± 0.0
0.798MetTyr: 0.798 ± 0.336
0.0MetXaa: 0.0 ± 0.0
Asn
3.392AsnAla: 3.392 ± 0.868
0.798AsnCys: 0.798 ± 0.352
2.394AsnAsp: 2.394 ± 0.792
1.197AsnGlu: 1.197 ± 0.611
2.195AsnPhe: 2.195 ± 0.49
4.19AsnGly: 4.19 ± 1.25
0.798AsnHis: 0.798 ± 0.458
3.791AsnIle: 3.791 ± 1.123
0.599AsnLys: 0.599 ± 0.327
3.791AsnLeu: 3.791 ± 1.174
0.599AsnMet: 0.599 ± 0.329
2.594AsnAsn: 2.594 ± 0.662
4.389AsnPro: 4.389 ± 1.652
2.993AsnGln: 2.993 ± 0.589
2.394AsnArg: 2.394 ± 0.666
3.392AsnSer: 3.392 ± 0.679
3.99AsnThr: 3.99 ± 0.979
2.793AsnVal: 2.793 ± 0.958
1.397AsnTrp: 1.397 ± 0.673
2.993AsnTyr: 2.993 ± 0.537
0.0AsnXaa: 0.0 ± 0.0
Pro
7.183ProAla: 7.183 ± 1.198
0.399ProCys: 0.399 ± 0.315
2.793ProAsp: 2.793 ± 0.672
3.791ProGlu: 3.791 ± 1.16
2.394ProPhe: 2.394 ± 0.756
3.192ProGly: 3.192 ± 0.953
0.998ProHis: 0.998 ± 0.451
2.993ProIle: 2.993 ± 0.659
2.594ProLys: 2.594 ± 0.713
3.99ProLeu: 3.99 ± 1.18
1.796ProMet: 1.796 ± 0.63
1.995ProAsn: 1.995 ± 0.484
1.397ProPro: 1.397 ± 0.694
1.197ProGln: 1.197 ± 0.416
2.195ProArg: 2.195 ± 0.857
2.195ProSer: 2.195 ± 0.67
1.796ProThr: 1.796 ± 0.626
6.784ProVal: 6.784 ± 0.93
0.399ProTrp: 0.399 ± 0.286
1.197ProTyr: 1.197 ± 0.484
0.0ProXaa: 0.0 ± 0.0
Gln
4.789GlnAla: 4.789 ± 1.304
0.2GlnCys: 0.2 ± 0.225
1.197GlnAsp: 1.197 ± 0.518
2.793GlnGlu: 2.793 ± 1.04
1.596GlnPhe: 1.596 ± 0.726
2.394GlnGly: 2.394 ± 0.79
0.2GlnHis: 0.2 ± 0.17
2.594GlnIle: 2.594 ± 0.775
1.197GlnLys: 1.197 ± 0.45
4.389GlnLeu: 4.389 ± 1.382
1.197GlnMet: 1.197 ± 0.552
1.397GlnAsn: 1.397 ± 0.549
1.397GlnPro: 1.397 ± 0.471
2.793GlnGln: 2.793 ± 1.037
3.192GlnArg: 3.192 ± 1.027
2.993GlnSer: 2.993 ± 1.086
2.993GlnThr: 2.993 ± 0.71
2.594GlnVal: 2.594 ± 0.653
0.599GlnTrp: 0.599 ± 0.313
1.197GlnTyr: 1.197 ± 0.525
0.0GlnXaa: 0.0 ± 0.0
Arg
2.793ArgAla: 2.793 ± 0.607
0.2ArgCys: 0.2 ± 0.17
2.394ArgAsp: 2.394 ± 0.729
1.397ArgGlu: 1.397 ± 0.524
1.796ArgPhe: 1.796 ± 0.656
3.99ArgGly: 3.99 ± 0.893
0.599ArgHis: 0.599 ± 0.302
3.192ArgIle: 3.192 ± 0.669
2.195ArgLys: 2.195 ± 0.645
2.793ArgLeu: 2.793 ± 0.751
0.599ArgMet: 0.599 ± 0.433
1.596ArgAsn: 1.596 ± 0.458
2.394ArgPro: 2.394 ± 0.722
1.596ArgGln: 1.596 ± 0.585
2.195ArgArg: 2.195 ± 0.771
1.596ArgSer: 1.596 ± 0.555
2.195ArgThr: 2.195 ± 0.656
2.793ArgVal: 2.793 ± 0.602
0.399ArgTrp: 0.399 ± 0.315
0.798ArgTyr: 0.798 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
5.587SerAla: 5.587 ± 1.323
0.399SerCys: 0.399 ± 0.279
3.192SerAsp: 3.192 ± 0.523
1.596SerGlu: 1.596 ± 0.531
1.596SerPhe: 1.596 ± 0.592
8.579SerGly: 8.579 ± 2.321
0.2SerHis: 0.2 ± 0.243
3.791SerIle: 3.791 ± 0.788
3.192SerLys: 3.192 ± 0.778
3.99SerLeu: 3.99 ± 1.258
0.998SerMet: 0.998 ± 0.396
3.791SerAsn: 3.791 ± 0.896
2.793SerPro: 2.793 ± 0.712
2.195SerGln: 2.195 ± 0.591
1.995SerArg: 1.995 ± 0.522
3.591SerSer: 3.591 ± 0.802
2.594SerThr: 2.594 ± 1.21
3.791SerVal: 3.791 ± 0.842
1.197SerTrp: 1.197 ± 0.455
1.397SerTyr: 1.397 ± 0.686
0.0SerXaa: 0.0 ± 0.0
Thr
6.784ThrAla: 6.784 ± 1.46
0.599ThrCys: 0.599 ± 0.342
3.392ThrAsp: 3.392 ± 0.947
4.589ThrGlu: 4.589 ± 0.744
1.796ThrPhe: 1.796 ± 0.471
6.385ThrGly: 6.385 ± 1.102
0.2ThrHis: 0.2 ± 0.157
3.99ThrIle: 3.99 ± 0.881
1.397ThrLys: 1.397 ± 0.544
5.188ThrLeu: 5.188 ± 0.71
1.995ThrMet: 1.995 ± 0.504
1.596ThrAsn: 1.596 ± 0.495
2.993ThrPro: 2.993 ± 0.854
2.594ThrGln: 2.594 ± 0.809
0.998ThrArg: 0.998 ± 0.415
2.993ThrSer: 2.993 ± 0.713
2.793ThrThr: 2.793 ± 0.69
4.589ThrVal: 4.589 ± 1.65
0.399ThrTrp: 0.399 ± 0.294
1.397ThrTyr: 1.397 ± 0.557
0.0ThrXaa: 0.0 ± 0.0
Val
7.183ValAla: 7.183 ± 0.926
0.399ValCys: 0.399 ± 0.317
2.993ValAsp: 2.993 ± 0.972
5.188ValGlu: 5.188 ± 0.866
1.397ValPhe: 1.397 ± 0.551
3.791ValGly: 3.791 ± 0.909
0.399ValHis: 0.399 ± 0.292
4.19ValIle: 4.19 ± 0.909
3.791ValLys: 3.791 ± 0.926
4.589ValLeu: 4.589 ± 1.165
1.796ValMet: 1.796 ± 0.531
4.19ValAsn: 4.19 ± 0.845
4.19ValPro: 4.19 ± 0.939
2.993ValGln: 2.993 ± 0.703
1.995ValArg: 1.995 ± 0.786
3.99ValSer: 3.99 ± 0.872
6.385ValThr: 6.385 ± 1.416
3.591ValVal: 3.591 ± 1.372
0.798ValTrp: 0.798 ± 0.514
3.392ValTyr: 3.392 ± 0.716
0.0ValXaa: 0.0 ± 0.0
Trp
0.599TrpAla: 0.599 ± 0.347
0.2TrpCys: 0.2 ± 0.17
0.798TrpAsp: 0.798 ± 0.463
0.998TrpGlu: 0.998 ± 0.432
0.599TrpPhe: 0.599 ± 0.345
1.596TrpGly: 1.596 ± 0.649
0.399TrpHis: 0.399 ± 0.263
0.399TrpIle: 0.399 ± 0.269
0.2TrpLys: 0.2 ± 0.216
2.394TrpLeu: 2.394 ± 0.58
0.399TrpMet: 0.399 ± 0.289
0.2TrpAsn: 0.2 ± 0.237
0.399TrpPro: 0.399 ± 0.241
0.2TrpGln: 0.2 ± 0.204
0.399TrpArg: 0.399 ± 0.274
0.2TrpSer: 0.2 ± 0.204
0.0TrpThr: 0.0 ± 0.0
1.197TrpVal: 1.197 ± 0.399
0.599TrpTrp: 0.599 ± 0.384
0.599TrpTyr: 0.599 ± 0.298
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.392TyrAla: 3.392 ± 0.757
0.798TyrCys: 0.798 ± 0.352
2.594TyrAsp: 2.594 ± 0.697
0.798TyrGlu: 0.798 ± 0.313
2.594TyrPhe: 2.594 ± 0.742
3.591TyrGly: 3.591 ± 0.532
0.798TyrHis: 0.798 ± 0.368
1.995TyrIle: 1.995 ± 0.514
2.394TyrLys: 2.394 ± 0.978
2.793TyrLeu: 2.793 ± 0.735
0.798TyrMet: 0.798 ± 0.408
1.796TyrAsn: 1.796 ± 0.406
1.397TyrPro: 1.397 ± 0.367
2.793TyrGln: 2.793 ± 0.824
1.796TyrArg: 1.796 ± 0.574
1.796TyrSer: 1.796 ± 0.472
1.995TyrThr: 1.995 ± 0.628
0.798TyrVal: 0.798 ± 0.394
1.197TyrTrp: 1.197 ± 0.4
1.995TyrTyr: 1.995 ± 0.694
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 31 proteins (5013 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski