Amino acid dipepetide frequency for Staphylococcus phage BP39

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.549AlaAla: 0.549 ± 0.435
0.183AlaCys: 0.183 ± 0.197
1.647AlaAsp: 1.647 ± 0.394
1.647AlaGlu: 1.647 ± 0.639
1.647AlaPhe: 1.647 ± 0.442
2.744AlaGly: 2.744 ± 0.765
0.366AlaHis: 0.366 ± 0.242
3.11AlaIle: 3.11 ± 1.019
4.757AlaLys: 4.757 ± 0.881
4.025AlaLeu: 4.025 ± 0.952
0.732AlaMet: 0.732 ± 0.373
2.012AlaAsn: 2.012 ± 0.61
0.732AlaPro: 0.732 ± 0.422
0.732AlaGln: 0.732 ± 0.338
2.012AlaArg: 2.012 ± 0.468
1.647AlaSer: 1.647 ± 0.451
3.293AlaThr: 3.293 ± 0.981
3.11AlaVal: 3.11 ± 0.645
0.549AlaTrp: 0.549 ± 0.27
4.025AlaTyr: 4.025 ± 0.787
0.0AlaXaa: 0.0 ± 0.0
Cys
0.366CysAla: 0.366 ± 0.287
0.0CysCys: 0.0 ± 0.0
0.183CysAsp: 0.183 ± 0.152
0.183CysGlu: 0.183 ± 0.152
0.549CysPhe: 0.549 ± 0.345
0.366CysGly: 0.366 ± 0.211
0.183CysHis: 0.183 ± 0.152
0.366CysIle: 0.366 ± 0.272
0.0CysLys: 0.0 ± 0.0
0.549CysLeu: 0.549 ± 0.402
0.549CysMet: 0.549 ± 0.25
0.366CysAsn: 0.366 ± 0.209
0.0CysPro: 0.0 ± 0.0
0.183CysGln: 0.183 ± 0.197
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.732CysThr: 0.732 ± 0.262
0.549CysVal: 0.549 ± 0.366
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.293AspAla: 3.293 ± 0.553
0.183AspCys: 0.183 ± 0.152
8.416AspAsp: 8.416 ± 1.102
5.854AspGlu: 5.854 ± 1.135
4.208AspPhe: 4.208 ± 0.701
3.476AspGly: 3.476 ± 1.23
0.915AspHis: 0.915 ± 0.353
6.586AspIle: 6.586 ± 1.559
5.671AspLys: 5.671 ± 1.113
6.037AspLeu: 6.037 ± 1.162
1.281AspMet: 1.281 ± 0.585
6.586AspAsn: 6.586 ± 1.464
0.732AspPro: 0.732 ± 0.387
1.098AspGln: 1.098 ± 0.478
2.195AspArg: 2.195 ± 0.655
3.293AspSer: 3.293 ± 1.04
3.842AspThr: 3.842 ± 0.838
4.94AspVal: 4.94 ± 0.709
0.366AspTrp: 0.366 ± 0.226
5.671AspTyr: 5.671 ± 0.897
0.0AspXaa: 0.0 ± 0.0
Glu
1.829GluAla: 1.829 ± 0.488
0.549GluCys: 0.549 ± 0.316
3.842GluAsp: 3.842 ± 0.945
4.208GluGlu: 4.208 ± 1.64
4.025GluPhe: 4.025 ± 0.958
1.098GluGly: 1.098 ± 0.504
2.012GluHis: 2.012 ± 0.608
3.842GluIle: 3.842 ± 0.714
4.757GluLys: 4.757 ± 0.702
6.037GluLeu: 6.037 ± 1.141
2.561GluMet: 2.561 ± 0.667
4.391GluAsn: 4.391 ± 0.878
1.829GluPro: 1.829 ± 0.4
2.927GluGln: 2.927 ± 0.702
2.195GluArg: 2.195 ± 0.623
5.123GluSer: 5.123 ± 1.288
4.025GluThr: 4.025 ± 1.029
3.11GluVal: 3.11 ± 0.546
0.549GluTrp: 0.549 ± 0.288
4.025GluTyr: 4.025 ± 1.103
0.0GluXaa: 0.0 ± 0.0
Phe
2.012PheAla: 2.012 ± 0.607
0.366PheCys: 0.366 ± 0.285
4.208PheAsp: 4.208 ± 1.261
3.293PheGlu: 3.293 ± 0.759
1.829PhePhe: 1.829 ± 0.471
2.195PheGly: 2.195 ± 0.619
0.732PheHis: 0.732 ± 0.401
3.842PheIle: 3.842 ± 0.682
5.123PheLys: 5.123 ± 1.087
4.391PheLeu: 4.391 ± 0.865
0.915PheMet: 0.915 ± 0.341
4.574PheAsn: 4.574 ± 1.301
1.647PhePro: 1.647 ± 0.507
2.744PheGln: 2.744 ± 0.616
1.098PheArg: 1.098 ± 0.442
3.476PheSer: 3.476 ± 0.94
4.208PheThr: 4.208 ± 0.885
3.476PheVal: 3.476 ± 0.606
0.366PheTrp: 0.366 ± 0.226
3.842PheTyr: 3.842 ± 0.85
0.0PheXaa: 0.0 ± 0.0
Gly
1.464GlyAla: 1.464 ± 0.427
0.0GlyCys: 0.0 ± 0.0
2.927GlyAsp: 2.927 ± 0.689
1.829GlyGlu: 1.829 ± 0.795
3.476GlyPhe: 3.476 ± 0.883
4.025GlyGly: 4.025 ± 1.321
0.549GlyHis: 0.549 ± 0.278
3.293GlyIle: 3.293 ± 0.691
4.208GlyLys: 4.208 ± 1.004
3.293GlyLeu: 3.293 ± 0.624
1.098GlyMet: 1.098 ± 0.502
4.574GlyAsn: 4.574 ± 1.125
0.0GlyPro: 0.0 ± 0.0
2.378GlyGln: 2.378 ± 0.723
1.464GlyArg: 1.464 ± 0.453
2.012GlySer: 2.012 ± 0.765
2.195GlyThr: 2.195 ± 0.642
3.842GlyVal: 3.842 ± 0.888
1.098GlyTrp: 1.098 ± 0.42
3.293GlyTyr: 3.293 ± 0.848
0.0GlyXaa: 0.0 ± 0.0
His
0.183HisAla: 0.183 ± 0.152
0.0HisCys: 0.0 ± 0.0
0.732HisAsp: 0.732 ± 0.351
1.647HisGlu: 1.647 ± 0.511
1.829HisPhe: 1.829 ± 0.538
0.732HisGly: 0.732 ± 0.323
0.183HisHis: 0.183 ± 0.215
1.829HisIle: 1.829 ± 0.578
1.464HisLys: 1.464 ± 0.442
0.915HisLeu: 0.915 ± 0.438
0.549HisMet: 0.549 ± 0.284
2.012HisAsn: 2.012 ± 0.664
0.183HisPro: 0.183 ± 0.152
0.183HisGln: 0.183 ± 0.178
0.549HisArg: 0.549 ± 0.313
0.915HisSer: 0.915 ± 0.396
1.281HisThr: 1.281 ± 0.472
0.549HisVal: 0.549 ± 0.295
0.183HisTrp: 0.183 ± 0.222
1.829HisTyr: 1.829 ± 0.472
0.0HisXaa: 0.0 ± 0.0
Ile
4.025IleAla: 4.025 ± 0.96
0.0IleCys: 0.0 ± 0.0
8.965IleAsp: 8.965 ± 0.938
4.208IleGlu: 4.208 ± 1.004
2.012IlePhe: 2.012 ± 0.612
3.293IleGly: 3.293 ± 1.057
1.647IleHis: 1.647 ± 0.405
4.574IleIle: 4.574 ± 1.35
6.586IleLys: 6.586 ± 1.504
5.488IleLeu: 5.488 ± 1.103
1.647IleMet: 1.647 ± 0.474
8.782IleAsn: 8.782 ± 1.415
2.378IlePro: 2.378 ± 0.588
1.829IleGln: 1.829 ± 0.779
2.378IleArg: 2.378 ± 0.7
2.378IleSer: 2.378 ± 0.525
4.208IleThr: 4.208 ± 0.876
3.476IleVal: 3.476 ± 0.782
0.549IleTrp: 0.549 ± 0.316
5.123IleTyr: 5.123 ± 1.391
0.0IleXaa: 0.0 ± 0.0
Lys
2.378LysAla: 2.378 ± 0.569
0.366LysCys: 0.366 ± 0.209
5.488LysAsp: 5.488 ± 0.904
6.403LysGlu: 6.403 ± 1.459
4.025LysPhe: 4.025 ± 0.741
4.208LysGly: 4.208 ± 1.214
1.464LysHis: 1.464 ± 0.489
7.135LysIle: 7.135 ± 1.096
7.501LysLys: 7.501 ± 0.887
7.318LysLeu: 7.318 ± 1.328
2.561LysMet: 2.561 ± 0.469
5.671LysAsn: 5.671 ± 1.172
2.744LysPro: 2.744 ± 0.689
3.842LysGln: 3.842 ± 0.867
4.208LysArg: 4.208 ± 1.196
7.135LysSer: 7.135 ± 1.132
4.757LysThr: 4.757 ± 0.83
3.11LysVal: 3.11 ± 0.707
0.915LysTrp: 0.915 ± 0.333
4.574LysTyr: 4.574 ± 0.876
0.0LysXaa: 0.0 ± 0.0
Leu
4.208LeuAla: 4.208 ± 0.808
0.183LeuCys: 0.183 ± 0.21
4.757LeuAsp: 4.757 ± 0.693
4.208LeuGlu: 4.208 ± 0.812
4.391LeuPhe: 4.391 ± 0.625
3.293LeuGly: 3.293 ± 0.726
0.915LeuHis: 0.915 ± 0.446
5.488LeuIle: 5.488 ± 1.011
6.769LeuLys: 6.769 ± 1.028
6.586LeuLeu: 6.586 ± 1.118
1.829LeuMet: 1.829 ± 0.716
7.318LeuAsn: 7.318 ± 1.138
1.829LeuPro: 1.829 ± 0.543
4.391LeuGln: 4.391 ± 0.805
3.11LeuArg: 3.11 ± 0.643
6.403LeuSer: 6.403 ± 1.533
4.94LeuThr: 4.94 ± 1.115
3.476LeuVal: 3.476 ± 0.493
0.183LeuTrp: 0.183 ± 0.198
4.94LeuTyr: 4.94 ± 1.091
0.0LeuXaa: 0.0 ± 0.0
Met
0.915MetAla: 0.915 ± 0.504
0.183MetCys: 0.183 ± 0.152
1.464MetAsp: 1.464 ± 0.501
1.464MetGlu: 1.464 ± 0.729
1.464MetPhe: 1.464 ± 0.631
0.732MetGly: 0.732 ± 0.343
0.183MetHis: 0.183 ± 0.197
1.829MetIle: 1.829 ± 0.71
3.476MetLys: 3.476 ± 0.617
2.561MetLeu: 2.561 ± 1.131
0.732MetMet: 0.732 ± 0.381
1.464MetAsn: 1.464 ± 0.557
0.0MetPro: 0.0 ± 0.0
2.012MetGln: 2.012 ± 0.588
1.281MetArg: 1.281 ± 0.341
1.281MetSer: 1.281 ± 0.469
2.561MetThr: 2.561 ± 0.709
0.732MetVal: 0.732 ± 0.486
0.183MetTrp: 0.183 ± 0.183
1.281MetTyr: 1.281 ± 0.378
0.0MetXaa: 0.0 ± 0.0
Asn
4.757AsnAla: 4.757 ± 1.313
0.732AsnCys: 0.732 ± 0.328
7.318AsnAsp: 7.318 ± 1.033
6.769AsnGlu: 6.769 ± 1.176
4.574AsnPhe: 4.574 ± 1.004
5.123AsnGly: 5.123 ± 0.854
1.829AsnHis: 1.829 ± 0.553
5.488AsnIle: 5.488 ± 0.885
6.952AsnLys: 6.952 ± 1.504
4.574AsnLeu: 4.574 ± 1.005
1.829AsnMet: 1.829 ± 0.693
6.403AsnAsn: 6.403 ± 1.318
2.378AsnPro: 2.378 ± 0.647
3.659AsnGln: 3.659 ± 0.761
2.195AsnArg: 2.195 ± 1.017
5.671AsnSer: 5.671 ± 1.042
6.403AsnThr: 6.403 ± 0.912
4.757AsnVal: 4.757 ± 0.645
0.915AsnTrp: 0.915 ± 0.433
4.208AsnTyr: 4.208 ± 0.827
0.0AsnXaa: 0.0 ± 0.0
Pro
0.732ProAla: 0.732 ± 0.27
0.183ProCys: 0.183 ± 0.152
1.464ProAsp: 1.464 ± 0.491
1.098ProGlu: 1.098 ± 0.43
1.464ProPhe: 1.464 ± 0.432
0.366ProGly: 0.366 ± 0.2
0.183ProHis: 0.183 ± 0.145
2.195ProIle: 2.195 ± 0.573
2.744ProLys: 2.744 ± 0.638
2.195ProLeu: 2.195 ± 0.459
0.915ProMet: 0.915 ± 0.568
1.829ProAsn: 1.829 ± 0.484
0.732ProPro: 0.732 ± 0.356
1.098ProGln: 1.098 ± 0.609
0.732ProArg: 0.732 ± 0.364
1.829ProSer: 1.829 ± 0.52
1.829ProThr: 1.829 ± 0.893
1.281ProVal: 1.281 ± 0.463
0.366ProTrp: 0.366 ± 0.246
2.195ProTyr: 2.195 ± 0.47
0.0ProXaa: 0.0 ± 0.0
Gln
2.195GlnAla: 2.195 ± 0.577
0.732GlnCys: 0.732 ± 0.448
2.378GlnAsp: 2.378 ± 0.927
1.647GlnGlu: 1.647 ± 0.549
1.647GlnPhe: 1.647 ± 0.638
1.829GlnGly: 1.829 ± 0.554
0.183GlnHis: 0.183 ± 0.177
3.476GlnIle: 3.476 ± 0.846
2.195GlnLys: 2.195 ± 0.539
4.025GlnLeu: 4.025 ± 0.615
1.464GlnMet: 1.464 ± 0.574
4.391GlnAsn: 4.391 ± 0.884
1.647GlnPro: 1.647 ± 0.611
1.829GlnGln: 1.829 ± 0.806
0.915GlnArg: 0.915 ± 0.531
2.195GlnSer: 2.195 ± 0.744
1.464GlnThr: 1.464 ± 0.494
1.829GlnVal: 1.829 ± 0.623
0.915GlnTrp: 0.915 ± 0.494
2.378GlnTyr: 2.378 ± 0.723
0.0GlnXaa: 0.0 ± 0.0
Arg
1.647ArgAla: 1.647 ± 0.715
0.183ArgCys: 0.183 ± 0.196
2.744ArgAsp: 2.744 ± 0.71
3.293ArgGlu: 3.293 ± 0.891
2.744ArgPhe: 2.744 ± 0.577
1.281ArgGly: 1.281 ± 0.433
1.098ArgHis: 1.098 ± 0.493
1.281ArgIle: 1.281 ± 0.607
2.195ArgLys: 2.195 ± 0.451
1.098ArgLeu: 1.098 ± 0.571
1.281ArgMet: 1.281 ± 0.362
3.11ArgAsn: 3.11 ± 0.8
1.098ArgPro: 1.098 ± 0.466
2.195ArgGln: 2.195 ± 0.471
1.098ArgArg: 1.098 ± 0.29
1.829ArgSer: 1.829 ± 0.536
0.915ArgThr: 0.915 ± 0.453
2.195ArgVal: 2.195 ± 0.588
0.0ArgTrp: 0.0 ± 0.0
2.012ArgTyr: 2.012 ± 0.501
0.0ArgXaa: 0.0 ± 0.0
Ser
3.659SerAla: 3.659 ± 0.722
0.0SerCys: 0.0 ± 0.0
4.574SerAsp: 4.574 ± 0.756
4.025SerGlu: 4.025 ± 1.017
3.659SerPhe: 3.659 ± 0.662
3.842SerGly: 3.842 ± 0.965
0.549SerHis: 0.549 ± 0.254
3.842SerIle: 3.842 ± 0.665
7.318SerLys: 7.318 ± 1.169
4.757SerLeu: 4.757 ± 0.766
1.281SerMet: 1.281 ± 0.483
5.671SerAsn: 5.671 ± 1.556
2.195SerPro: 2.195 ± 0.473
2.561SerGln: 2.561 ± 0.729
1.829SerArg: 1.829 ± 0.444
4.391SerSer: 4.391 ± 1.127
3.293SerThr: 3.293 ± 1.48
2.927SerVal: 2.927 ± 0.616
0.366SerTrp: 0.366 ± 0.247
2.378SerTyr: 2.378 ± 0.709
0.0SerXaa: 0.0 ± 0.0
Thr
1.098ThrAla: 1.098 ± 0.401
0.366ThrCys: 0.366 ± 0.365
4.574ThrAsp: 4.574 ± 0.928
4.574ThrGlu: 4.574 ± 1.554
4.391ThrPhe: 4.391 ± 0.774
2.927ThrGly: 2.927 ± 0.575
2.012ThrHis: 2.012 ± 0.532
5.854ThrIle: 5.854 ± 1.154
4.757ThrLys: 4.757 ± 1.167
4.94ThrLeu: 4.94 ± 0.855
1.464ThrMet: 1.464 ± 0.503
4.208ThrAsn: 4.208 ± 0.903
1.829ThrPro: 1.829 ± 0.8
1.829ThrGln: 1.829 ± 0.739
1.464ThrArg: 1.464 ± 0.421
5.123ThrSer: 5.123 ± 0.762
3.293ThrThr: 3.293 ± 1.066
2.561ThrVal: 2.561 ± 0.527
0.549ThrTrp: 0.549 ± 0.34
3.293ThrTyr: 3.293 ± 0.676
0.0ThrXaa: 0.0 ± 0.0
Val
2.195ValAla: 2.195 ± 0.733
0.732ValCys: 0.732 ± 0.283
3.293ValAsp: 3.293 ± 0.846
2.927ValGlu: 2.927 ± 0.747
2.744ValPhe: 2.744 ± 0.716
1.281ValGly: 1.281 ± 0.653
0.732ValHis: 0.732 ± 0.373
3.842ValIle: 3.842 ± 0.806
4.757ValLys: 4.757 ± 0.804
3.293ValLeu: 3.293 ± 0.707
1.464ValMet: 1.464 ± 0.427
5.488ValAsn: 5.488 ± 0.961
2.012ValPro: 2.012 ± 0.637
2.012ValGln: 2.012 ± 0.693
2.927ValArg: 2.927 ± 0.577
4.025ValSer: 4.025 ± 0.88
3.293ValThr: 3.293 ± 0.591
3.842ValVal: 3.842 ± 0.997
0.549ValTrp: 0.549 ± 0.351
2.561ValTyr: 2.561 ± 0.655
0.0ValXaa: 0.0 ± 0.0
Trp
0.366TrpAla: 0.366 ± 0.29
0.0TrpCys: 0.0 ± 0.0
1.098TrpAsp: 1.098 ± 0.372
0.366TrpGlu: 0.366 ± 0.242
0.549TrpPhe: 0.549 ± 0.249
0.549TrpGly: 0.549 ± 0.4
0.366TrpHis: 0.366 ± 0.355
1.098TrpIle: 1.098 ± 0.478
0.549TrpLys: 0.549 ± 0.444
1.647TrpLeu: 1.647 ± 0.655
0.183TrpMet: 0.183 ± 0.196
0.732TrpAsn: 0.732 ± 0.482
0.0TrpPro: 0.0 ± 0.0
0.366TrpGln: 0.366 ± 0.278
0.0TrpArg: 0.0 ± 0.0
0.549TrpSer: 0.549 ± 0.279
0.366TrpThr: 0.366 ± 0.247
0.183TrpVal: 0.183 ± 0.155
0.0TrpTrp: 0.0 ± 0.0
0.366TrpTyr: 0.366 ± 0.29
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.647TyrAla: 1.647 ± 0.555
0.183TyrCys: 0.183 ± 0.152
4.757TyrAsp: 4.757 ± 0.732
3.293TyrGlu: 3.293 ± 0.903
3.11TyrPhe: 3.11 ± 0.828
3.659TyrGly: 3.659 ± 0.93
1.647TyrHis: 1.647 ± 0.595
4.574TyrIle: 4.574 ± 1.125
4.025TyrLys: 4.025 ± 0.551
5.488TyrLeu: 5.488 ± 1.22
1.281TyrMet: 1.281 ± 0.676
6.952TyrAsn: 6.952 ± 1.217
1.464TyrPro: 1.464 ± 0.35
1.464TyrGln: 1.464 ± 0.472
1.647TyrArg: 1.647 ± 0.684
3.842TyrSer: 3.842 ± 0.852
4.208TyrThr: 4.208 ± 0.696
4.025TyrVal: 4.025 ± 0.581
0.732TyrTrp: 0.732 ± 0.357
4.208TyrTyr: 4.208 ± 1.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (5467 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski