Amino acid dipepetide frequency for Bacillus phage BeachBum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.157AlaAla: 0.157 ± 0.193
0.313AlaCys: 0.313 ± 0.188
2.192AlaAsp: 2.192 ± 0.503
4.385AlaGlu: 4.385 ± 1.036
2.975AlaPhe: 2.975 ± 0.663
3.915AlaGly: 3.915 ± 1.025
0.783AlaHis: 0.783 ± 0.323
5.168AlaIle: 5.168 ± 0.625
5.951AlaLys: 5.951 ± 1.081
3.758AlaLeu: 3.758 ± 0.72
1.723AlaMet: 1.723 ± 0.695
2.505AlaAsn: 2.505 ± 0.705
2.036AlaPro: 2.036 ± 0.625
1.096AlaGln: 1.096 ± 0.34
2.192AlaArg: 2.192 ± 0.582
2.819AlaSer: 2.819 ± 0.634
3.758AlaThr: 3.758 ± 0.647
2.662AlaVal: 2.662 ± 0.738
0.313AlaTrp: 0.313 ± 0.219
2.505AlaTyr: 2.505 ± 0.741
0.0AlaXaa: 0.0 ± 0.0
Cys
0.626CysAla: 0.626 ± 0.345
0.0CysCys: 0.0 ± 0.0
1.096CysAsp: 1.096 ± 0.62
0.94CysGlu: 0.94 ± 0.318
0.313CysPhe: 0.313 ± 0.209
0.783CysGly: 0.783 ± 0.354
0.313CysHis: 0.313 ± 0.192
0.47CysIle: 0.47 ± 0.284
0.0CysLys: 0.0 ± 0.0
0.313CysLeu: 0.313 ± 0.243
0.313CysMet: 0.313 ± 0.232
0.626CysAsn: 0.626 ± 0.44
0.157CysPro: 0.157 ± 0.147
0.157CysGln: 0.157 ± 0.151
0.313CysArg: 0.313 ± 0.202
0.626CysSer: 0.626 ± 0.334
0.157CysThr: 0.157 ± 0.147
0.626CysVal: 0.626 ± 0.348
0.0CysTrp: 0.0 ± 0.0
0.47CysTyr: 0.47 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
3.758AspAla: 3.758 ± 0.707
0.94AspCys: 0.94 ± 0.396
2.975AspAsp: 2.975 ± 0.761
5.011AspGlu: 5.011 ± 0.813
2.975AspPhe: 2.975 ± 0.499
5.324AspGly: 5.324 ± 0.865
0.626AspHis: 0.626 ± 0.285
3.602AspIle: 3.602 ± 0.829
5.168AspLys: 5.168 ± 0.864
4.541AspLeu: 4.541 ± 0.78
1.879AspMet: 1.879 ± 0.56
3.602AspAsn: 3.602 ± 0.674
1.723AspPro: 1.723 ± 0.588
1.096AspGln: 1.096 ± 0.56
2.505AspArg: 2.505 ± 0.507
3.602AspSer: 3.602 ± 0.476
4.071AspThr: 4.071 ± 1.253
4.541AspVal: 4.541 ± 1.009
0.313AspTrp: 0.313 ± 0.186
3.288AspTyr: 3.288 ± 0.829
0.0AspXaa: 0.0 ± 0.0
Glu
3.288GluAla: 3.288 ± 0.868
0.47GluCys: 0.47 ± 0.256
5.951GluAsp: 5.951 ± 0.77
7.673GluGlu: 7.673 ± 1.729
3.758GluPhe: 3.758 ± 0.842
5.794GluGly: 5.794 ± 0.86
1.723GluHis: 1.723 ± 0.634
7.516GluIle: 7.516 ± 1.236
4.385GluLys: 4.385 ± 0.732
6.42GluLeu: 6.42 ± 1.057
2.349GluMet: 2.349 ± 0.698
4.541GluAsn: 4.541 ± 0.905
1.879GluPro: 1.879 ± 0.383
3.445GluGln: 3.445 ± 0.708
2.975GluArg: 2.975 ± 0.713
4.385GluSer: 4.385 ± 0.823
6.264GluThr: 6.264 ± 1.014
4.228GluVal: 4.228 ± 0.818
1.723GluTrp: 1.723 ± 0.532
3.445GluTyr: 3.445 ± 0.867
0.0GluXaa: 0.0 ± 0.0
Phe
2.505PheAla: 2.505 ± 0.784
0.157PheCys: 0.157 ± 0.16
3.758PheAsp: 3.758 ± 0.672
3.915PheGlu: 3.915 ± 0.755
1.409PhePhe: 1.409 ± 0.619
2.819PheGly: 2.819 ± 0.614
0.94PheHis: 0.94 ± 0.224
2.662PheIle: 2.662 ± 0.776
5.951PheLys: 5.951 ± 1.208
2.036PheLeu: 2.036 ± 0.608
0.94PheMet: 0.94 ± 0.31
2.819PheAsn: 2.819 ± 0.53
1.409PhePro: 1.409 ± 0.453
0.94PheGln: 0.94 ± 0.328
1.566PheArg: 1.566 ± 0.449
2.192PheSer: 2.192 ± 0.537
3.915PheThr: 3.915 ± 1.034
2.036PheVal: 2.036 ± 0.518
0.313PheTrp: 0.313 ± 0.186
2.349PheTyr: 2.349 ± 0.691
0.0PheXaa: 0.0 ± 0.0
Gly
3.132GlyAla: 3.132 ± 0.672
0.626GlyCys: 0.626 ± 0.279
3.288GlyAsp: 3.288 ± 0.742
5.951GlyGlu: 5.951 ± 0.767
2.819GlyPhe: 2.819 ± 0.631
4.071GlyGly: 4.071 ± 0.993
0.626GlyHis: 0.626 ± 0.326
4.541GlyIle: 4.541 ± 1.084
6.89GlyLys: 6.89 ± 0.976
3.602GlyLeu: 3.602 ± 0.493
0.783GlyMet: 0.783 ± 0.28
5.794GlyAsn: 5.794 ± 1.384
0.783GlyPro: 0.783 ± 0.353
1.253GlyGln: 1.253 ± 0.463
1.723GlyArg: 1.723 ± 0.404
4.228GlySer: 4.228 ± 1.069
4.541GlyThr: 4.541 ± 0.991
5.168GlyVal: 5.168 ± 0.947
0.47GlyTrp: 0.47 ± 0.277
3.758GlyTyr: 3.758 ± 1.025
0.0GlyXaa: 0.0 ± 0.0
His
1.096HisAla: 1.096 ± 0.44
0.313HisCys: 0.313 ± 0.252
1.096HisAsp: 1.096 ± 0.449
1.566HisGlu: 1.566 ± 0.478
0.94HisPhe: 0.94 ± 0.329
0.47HisGly: 0.47 ± 0.28
0.626HisHis: 0.626 ± 0.28
1.253HisIle: 1.253 ± 0.389
0.94HisLys: 0.94 ± 0.347
1.253HisLeu: 1.253 ± 0.424
1.096HisMet: 1.096 ± 0.485
1.253HisAsn: 1.253 ± 0.363
0.313HisPro: 0.313 ± 0.196
0.47HisGln: 0.47 ± 0.243
1.096HisArg: 1.096 ± 0.397
1.096HisSer: 1.096 ± 0.444
1.409HisThr: 1.409 ± 0.441
0.94HisVal: 0.94 ± 0.356
0.0HisTrp: 0.0 ± 0.0
1.566HisTyr: 1.566 ± 0.58
0.0HisXaa: 0.0 ± 0.0
Ile
3.445IleAla: 3.445 ± 0.571
0.313IleCys: 0.313 ± 0.242
5.481IleAsp: 5.481 ± 0.927
7.203IleGlu: 7.203 ± 1.398
2.349IlePhe: 2.349 ± 0.595
3.445IleGly: 3.445 ± 0.561
2.036IleHis: 2.036 ± 0.644
3.758IleIle: 3.758 ± 0.939
5.481IleLys: 5.481 ± 0.881
3.758IleLeu: 3.758 ± 0.896
2.819IleMet: 2.819 ± 0.584
5.324IleAsn: 5.324 ± 0.991
1.566IlePro: 1.566 ± 0.418
3.132IleGln: 3.132 ± 0.712
3.915IleArg: 3.915 ± 1.171
3.445IleSer: 3.445 ± 0.618
5.168IleThr: 5.168 ± 0.739
3.445IleVal: 3.445 ± 0.685
0.783IleTrp: 0.783 ± 0.465
2.192IleTyr: 2.192 ± 0.636
0.0IleXaa: 0.0 ± 0.0
Lys
4.541LysAla: 4.541 ± 1.02
0.94LysCys: 0.94 ± 0.376
4.854LysAsp: 4.854 ± 1.043
8.926LysGlu: 8.926 ± 1.415
2.975LysPhe: 2.975 ± 1.036
5.794LysGly: 5.794 ± 0.943
1.566LysHis: 1.566 ± 0.54
3.915LysIle: 3.915 ± 0.871
7.673LysLys: 7.673 ± 1.194
9.082LysLeu: 9.082 ± 0.995
3.288LysMet: 3.288 ± 0.937
4.071LysAsn: 4.071 ± 0.758
2.036LysPro: 2.036 ± 0.56
4.071LysGln: 4.071 ± 1.119
5.794LysArg: 5.794 ± 0.992
3.445LysSer: 3.445 ± 0.518
4.541LysThr: 4.541 ± 0.772
4.228LysVal: 4.228 ± 0.682
0.47LysTrp: 0.47 ± 0.301
3.758LysTyr: 3.758 ± 0.822
0.0LysXaa: 0.0 ± 0.0
Leu
3.445LeuAla: 3.445 ± 0.55
0.626LeuCys: 0.626 ± 0.302
3.915LeuAsp: 3.915 ± 0.465
5.637LeuGlu: 5.637 ± 1.02
2.662LeuPhe: 2.662 ± 0.655
3.132LeuGly: 3.132 ± 0.724
1.566LeuHis: 1.566 ± 0.701
3.445LeuIle: 3.445 ± 0.854
5.324LeuLys: 5.324 ± 0.918
4.541LeuLeu: 4.541 ± 0.785
2.349LeuMet: 2.349 ± 0.594
5.637LeuAsn: 5.637 ± 0.671
3.132LeuPro: 3.132 ± 0.765
2.505LeuGln: 2.505 ± 0.983
2.819LeuArg: 2.819 ± 0.646
3.915LeuSer: 3.915 ± 0.662
4.385LeuThr: 4.385 ± 0.856
3.602LeuVal: 3.602 ± 0.88
0.313LeuTrp: 0.313 ± 0.186
3.915LeuTyr: 3.915 ± 0.929
0.0LeuXaa: 0.0 ± 0.0
Met
1.409MetAla: 1.409 ± 0.441
0.47MetCys: 0.47 ± 0.305
1.253MetAsp: 1.253 ± 0.444
2.349MetGlu: 2.349 ± 0.685
1.096MetPhe: 1.096 ± 0.376
2.192MetGly: 2.192 ± 0.728
0.626MetHis: 0.626 ± 0.414
0.94MetIle: 0.94 ± 0.389
3.132MetLys: 3.132 ± 0.827
1.566MetLeu: 1.566 ± 0.485
1.409MetMet: 1.409 ± 0.482
2.819MetAsn: 2.819 ± 0.747
0.626MetPro: 0.626 ± 0.334
1.409MetGln: 1.409 ± 0.478
1.096MetArg: 1.096 ± 0.384
1.409MetSer: 1.409 ± 0.446
2.036MetThr: 2.036 ± 0.523
3.445MetVal: 3.445 ± 0.667
0.0MetTrp: 0.0 ± 0.0
2.349MetTyr: 2.349 ± 0.405
0.0MetXaa: 0.0 ± 0.0
Asn
3.915AsnAla: 3.915 ± 0.79
0.47AsnCys: 0.47 ± 0.251
3.758AsnAsp: 3.758 ± 0.72
6.264AsnGlu: 6.264 ± 0.911
2.505AsnPhe: 2.505 ± 0.63
5.324AsnGly: 5.324 ± 1.101
1.409AsnHis: 1.409 ± 0.645
5.168AsnIle: 5.168 ± 1.028
5.481AsnLys: 5.481 ± 1.126
4.071AsnLeu: 4.071 ± 0.879
1.409AsnMet: 1.409 ± 0.594
4.385AsnAsn: 4.385 ± 0.799
1.409AsnPro: 1.409 ± 0.39
2.975AsnGln: 2.975 ± 0.956
2.192AsnArg: 2.192 ± 0.798
4.071AsnSer: 4.071 ± 0.743
3.915AsnThr: 3.915 ± 0.852
4.071AsnVal: 4.071 ± 0.781
1.253AsnTrp: 1.253 ± 0.308
3.445AsnTyr: 3.445 ± 0.715
0.0AsnXaa: 0.0 ± 0.0
Pro
1.253ProAla: 1.253 ± 0.399
0.313ProCys: 0.313 ± 0.203
1.566ProAsp: 1.566 ± 0.566
2.192ProGlu: 2.192 ± 0.627
1.879ProPhe: 1.879 ± 0.601
0.313ProGly: 0.313 ± 0.215
0.157ProHis: 0.157 ± 0.156
1.409ProIle: 1.409 ± 0.442
1.879ProLys: 1.879 ± 0.434
1.879ProLeu: 1.879 ± 0.538
1.253ProMet: 1.253 ± 0.544
2.036ProAsn: 2.036 ± 0.499
1.096ProPro: 1.096 ± 0.482
1.253ProGln: 1.253 ± 0.487
1.096ProArg: 1.096 ± 0.306
2.349ProSer: 2.349 ± 0.51
2.349ProThr: 2.349 ± 0.624
2.662ProVal: 2.662 ± 0.621
0.313ProTrp: 0.313 ± 0.219
1.566ProTyr: 1.566 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
2.662GlnAla: 2.662 ± 0.801
0.157GlnCys: 0.157 ± 0.193
1.879GlnAsp: 1.879 ± 0.707
1.723GlnGlu: 1.723 ± 0.541
1.879GlnPhe: 1.879 ± 0.712
2.662GlnGly: 2.662 ± 0.566
0.94GlnHis: 0.94 ± 0.35
3.288GlnIle: 3.288 ± 0.671
3.132GlnLys: 3.132 ± 0.662
2.505GlnLeu: 2.505 ± 0.592
1.096GlnMet: 1.096 ± 0.836
2.349GlnAsn: 2.349 ± 0.711
1.096GlnPro: 1.096 ± 0.436
1.409GlnGln: 1.409 ± 0.516
1.409GlnArg: 1.409 ± 0.482
1.879GlnSer: 1.879 ± 0.703
2.662GlnThr: 2.662 ± 0.814
2.192GlnVal: 2.192 ± 0.639
0.47GlnTrp: 0.47 ± 0.22
1.566GlnTyr: 1.566 ± 0.391
0.0GlnXaa: 0.0 ± 0.0
Arg
1.879ArgAla: 1.879 ± 0.49
0.626ArgCys: 0.626 ± 0.276
2.505ArgAsp: 2.505 ± 0.711
2.349ArgGlu: 2.349 ± 0.56
3.288ArgPhe: 3.288 ± 0.828
2.349ArgGly: 2.349 ± 0.613
0.626ArgHis: 0.626 ± 0.263
2.662ArgIle: 2.662 ± 0.57
4.698ArgLys: 4.698 ± 0.878
3.132ArgLeu: 3.132 ± 0.902
1.879ArgMet: 1.879 ± 0.55
3.288ArgAsn: 3.288 ± 0.805
0.94ArgPro: 0.94 ± 0.364
1.723ArgGln: 1.723 ± 0.427
1.723ArgArg: 1.723 ± 0.623
1.723ArgSer: 1.723 ± 0.528
2.505ArgThr: 2.505 ± 0.54
2.192ArgVal: 2.192 ± 0.475
0.626ArgTrp: 0.626 ± 0.327
1.409ArgTyr: 1.409 ± 0.567
0.0ArgXaa: 0.0 ± 0.0
Ser
2.975SerAla: 2.975 ± 0.934
0.626SerCys: 0.626 ± 0.24
3.758SerAsp: 3.758 ± 0.741
4.228SerGlu: 4.228 ± 0.823
1.723SerPhe: 1.723 ± 0.438
2.192SerGly: 2.192 ± 0.445
0.94SerHis: 0.94 ± 0.484
3.445SerIle: 3.445 ± 0.755
5.637SerLys: 5.637 ± 0.882
4.698SerLeu: 4.698 ± 0.649
1.566SerMet: 1.566 ± 0.481
2.819SerAsn: 2.819 ± 0.612
1.723SerPro: 1.723 ± 0.433
2.505SerGln: 2.505 ± 0.639
1.253SerArg: 1.253 ± 0.432
3.445SerSer: 3.445 ± 0.831
2.975SerThr: 2.975 ± 0.781
4.385SerVal: 4.385 ± 0.745
0.47SerTrp: 0.47 ± 0.24
2.192SerTyr: 2.192 ± 0.655
0.0SerXaa: 0.0 ± 0.0
Thr
3.445ThrAla: 3.445 ± 0.677
0.47ThrCys: 0.47 ± 0.227
4.228ThrAsp: 4.228 ± 0.901
3.758ThrGlu: 3.758 ± 0.765
3.758ThrPhe: 3.758 ± 0.698
5.481ThrGly: 5.481 ± 1.314
1.409ThrHis: 1.409 ± 0.455
5.794ThrIle: 5.794 ± 0.976
6.264ThrLys: 6.264 ± 1.138
3.445ThrLeu: 3.445 ± 0.733
1.409ThrMet: 1.409 ± 0.373
4.071ThrAsn: 4.071 ± 0.961
2.819ThrPro: 2.819 ± 0.389
2.505ThrGln: 2.505 ± 0.66
3.132ThrArg: 3.132 ± 0.681
4.385ThrSer: 4.385 ± 0.928
5.168ThrThr: 5.168 ± 1.047
3.288ThrVal: 3.288 ± 0.985
1.096ThrTrp: 1.096 ± 0.356
1.723ThrTyr: 1.723 ± 0.482
0.0ThrXaa: 0.0 ± 0.0
Val
3.915ValAla: 3.915 ± 1.324
0.0ValCys: 0.0 ± 0.0
4.071ValAsp: 4.071 ± 0.719
4.385ValGlu: 4.385 ± 0.763
2.505ValPhe: 2.505 ± 0.61
2.819ValGly: 2.819 ± 0.635
0.94ValHis: 0.94 ± 0.349
5.794ValIle: 5.794 ± 1.02
4.228ValLys: 4.228 ± 0.73
2.349ValLeu: 2.349 ± 0.713
2.036ValMet: 2.036 ± 0.469
5.951ValAsn: 5.951 ± 0.799
2.662ValPro: 2.662 ± 0.468
1.879ValGln: 1.879 ± 0.654
2.505ValArg: 2.505 ± 0.642
2.349ValSer: 2.349 ± 0.514
5.168ValThr: 5.168 ± 0.962
3.758ValVal: 3.758 ± 0.905
1.096ValTrp: 1.096 ± 0.449
2.505ValTyr: 2.505 ± 0.442
0.0ValXaa: 0.0 ± 0.0
Trp
0.94TrpAla: 0.94 ± 0.571
0.0TrpCys: 0.0 ± 0.0
0.626TrpAsp: 0.626 ± 0.349
0.783TrpGlu: 0.783 ± 0.409
0.783TrpPhe: 0.783 ± 0.342
0.47TrpGly: 0.47 ± 0.261
0.313TrpHis: 0.313 ± 0.187
0.47TrpIle: 0.47 ± 0.244
1.096TrpLys: 1.096 ± 0.467
1.096TrpLeu: 1.096 ± 0.396
0.157TrpMet: 0.157 ± 0.153
0.783TrpAsn: 0.783 ± 0.386
0.0TrpPro: 0.0 ± 0.0
0.626TrpGln: 0.626 ± 0.292
0.313TrpArg: 0.313 ± 0.19
0.313TrpSer: 0.313 ± 0.213
0.783TrpThr: 0.783 ± 0.328
0.783TrpVal: 0.783 ± 0.351
0.157TrpTrp: 0.157 ± 0.165
0.94TrpTyr: 0.94 ± 0.37
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.819TyrAla: 2.819 ± 0.733
0.47TyrCys: 0.47 ± 0.31
3.445TyrAsp: 3.445 ± 0.835
2.975TyrGlu: 2.975 ± 0.615
2.036TyrPhe: 2.036 ± 0.502
4.698TyrGly: 4.698 ± 1.012
0.626TyrHis: 0.626 ± 0.32
3.758TyrIle: 3.758 ± 0.651
2.662TyrLys: 2.662 ± 0.545
2.819TyrLeu: 2.819 ± 0.662
1.723TyrMet: 1.723 ± 0.537
2.819TyrAsn: 2.819 ± 0.631
1.409TyrPro: 1.409 ± 0.405
2.505TyrGln: 2.505 ± 0.61
2.505TyrArg: 2.505 ± 0.586
1.879TyrSer: 1.879 ± 0.523
1.879TyrThr: 1.879 ± 0.48
2.662TyrVal: 2.662 ± 0.55
1.253TyrTrp: 1.253 ± 0.359
1.879TyrTyr: 1.879 ± 0.58
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30 proteins (6387 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski