Amino acid dipepetide frequency for Bacillus phage Bam35c (Bacillus thuringiensis bacteriophage Bam35c)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.405AlaAla: 0.405 ± 0.289
0.811AlaCys: 0.811 ± 0.424
3.041AlaAsp: 3.041 ± 0.924
5.879AlaGlu: 5.879 ± 1.607
2.23AlaPhe: 2.23 ± 0.729
5.879AlaGly: 5.879 ± 1.172
1.622AlaHis: 1.622 ± 0.572
2.433AlaIle: 2.433 ± 0.791
6.69AlaLys: 6.69 ± 1.564
5.068AlaLeu: 5.068 ± 1.133
1.824AlaMet: 1.824 ± 0.515
2.23AlaAsn: 2.23 ± 0.715
2.027AlaPro: 2.027 ± 0.616
2.23AlaGln: 2.23 ± 0.747
3.041AlaArg: 3.041 ± 0.592
4.054AlaSer: 4.054 ± 0.791
4.865AlaThr: 4.865 ± 1.325
2.635AlaVal: 2.635 ± 0.779
0.203AlaTrp: 0.203 ± 0.22
2.635AlaTyr: 2.635 ± 0.647
0.0AlaXaa: 0.0 ± 0.0
Cys
0.405CysAla: 0.405 ± 0.231
0.0CysCys: 0.0 ± 0.0
0.811CysAsp: 0.811 ± 0.501
0.608CysGlu: 0.608 ± 0.317
0.811CysPhe: 0.811 ± 0.342
0.0CysGly: 0.0 ± 0.0
0.203CysHis: 0.203 ± 0.162
0.811CysIle: 0.811 ± 0.407
0.608CysLys: 0.608 ± 0.316
0.203CysLeu: 0.203 ± 0.252
0.203CysMet: 0.203 ± 0.162
0.405CysAsn: 0.405 ± 0.309
0.608CysPro: 0.608 ± 0.497
0.0CysGln: 0.0 ± 0.0
0.811CysArg: 0.811 ± 0.526
0.0CysSer: 0.0 ± 0.0
0.203CysThr: 0.203 ± 0.252
0.203CysVal: 0.203 ± 0.195
0.203CysTrp: 0.203 ± 0.25
0.203CysTyr: 0.203 ± 0.197
0.0CysXaa: 0.0 ± 0.0
Asp
3.243AspAla: 3.243 ± 1.019
0.608AspCys: 0.608 ± 0.373
2.23AspAsp: 2.23 ± 0.484
4.257AspGlu: 4.257 ± 1.339
4.257AspPhe: 4.257 ± 1.059
3.041AspGly: 3.041 ± 0.67
0.811AspHis: 0.811 ± 0.45
1.824AspIle: 1.824 ± 0.463
5.473AspLys: 5.473 ± 1.319
4.054AspLeu: 4.054 ± 0.831
3.446AspMet: 3.446 ± 0.791
1.824AspAsn: 1.824 ± 0.527
2.635AspPro: 2.635 ± 0.87
1.014AspGln: 1.014 ± 0.505
2.027AspArg: 2.027 ± 0.635
3.243AspSer: 3.243 ± 0.783
2.635AspThr: 2.635 ± 0.545
3.041AspVal: 3.041 ± 0.56
0.405AspTrp: 0.405 ± 0.251
2.838AspTyr: 2.838 ± 0.871
0.0AspXaa: 0.0 ± 0.0
Glu
3.852GluAla: 3.852 ± 1.041
0.405GluCys: 0.405 ± 0.275
3.649GluAsp: 3.649 ± 1.125
10.136GluGlu: 10.136 ± 5.79
4.054GluPhe: 4.054 ± 0.689
6.081GluGly: 6.081 ± 1.137
1.216GluHis: 1.216 ± 0.842
3.649GluIle: 3.649 ± 0.887
5.473GluLys: 5.473 ± 1.161
6.69GluLeu: 6.69 ± 0.998
2.027GluMet: 2.027 ± 0.671
3.446GluAsn: 3.446 ± 0.807
1.824GluPro: 1.824 ± 0.593
3.649GluGln: 3.649 ± 0.837
5.473GluArg: 5.473 ± 1.182
2.027GluSer: 2.027 ± 0.717
5.676GluThr: 5.676 ± 1.32
5.473GluVal: 5.473 ± 1.059
1.216GluTrp: 1.216 ± 0.57
2.838GluTyr: 2.838 ± 1.239
0.0GluXaa: 0.0 ± 0.0
Phe
2.838PheAla: 2.838 ± 0.737
0.608PheCys: 0.608 ± 0.479
3.649PheAsp: 3.649 ± 0.84
3.649PheGlu: 3.649 ± 0.835
2.027PhePhe: 2.027 ± 0.557
1.824PheGly: 1.824 ± 0.667
0.405PheHis: 0.405 ± 0.284
3.649PheIle: 3.649 ± 0.797
1.419PheLys: 1.419 ± 0.411
3.852PheLeu: 3.852 ± 1.096
1.419PheMet: 1.419 ± 0.536
2.027PheAsn: 2.027 ± 0.748
2.838PhePro: 2.838 ± 0.714
1.419PheGln: 1.419 ± 0.549
1.824PheArg: 1.824 ± 0.519
2.635PheSer: 2.635 ± 0.964
4.054PheThr: 4.054 ± 0.863
2.635PheVal: 2.635 ± 0.875
0.608PheTrp: 0.608 ± 0.327
0.608PheTyr: 0.608 ± 0.318
0.0PheXaa: 0.0 ± 0.0
Gly
3.649GlyAla: 3.649 ± 0.76
0.608GlyCys: 0.608 ± 0.325
4.46GlyAsp: 4.46 ± 1.649
2.838GlyGlu: 2.838 ± 0.634
3.041GlyPhe: 3.041 ± 0.895
8.311GlyGly: 8.311 ± 2.005
0.405GlyHis: 0.405 ± 0.236
3.649GlyIle: 3.649 ± 1.122
7.906GlyLys: 7.906 ± 1.534
5.068GlyLeu: 5.068 ± 1.242
2.433GlyMet: 2.433 ± 0.627
2.838GlyAsn: 2.838 ± 0.75
1.014GlyPro: 1.014 ± 0.398
2.433GlyGln: 2.433 ± 0.682
4.054GlyArg: 4.054 ± 1.011
5.068GlySer: 5.068 ± 1.068
4.46GlyThr: 4.46 ± 1.152
5.271GlyVal: 5.271 ± 1.019
1.216GlyTrp: 1.216 ± 0.51
4.662GlyTyr: 4.662 ± 0.95
0.0GlyXaa: 0.0 ± 0.0
His
1.216HisAla: 1.216 ± 0.533
0.203HisCys: 0.203 ± 0.197
0.811HisAsp: 0.811 ± 0.395
1.216HisGlu: 1.216 ± 0.447
0.811HisPhe: 0.811 ± 0.49
0.405HisGly: 0.405 ± 0.278
0.405HisHis: 0.405 ± 0.432
1.216HisIle: 1.216 ± 0.451
1.622HisLys: 1.622 ± 0.421
0.608HisLeu: 0.608 ± 0.408
0.0HisMet: 0.0 ± 0.0
0.608HisAsn: 0.608 ± 0.359
0.608HisPro: 0.608 ± 0.3
0.405HisGln: 0.405 ± 0.288
0.608HisArg: 0.608 ± 0.354
1.419HisSer: 1.419 ± 0.589
0.608HisThr: 0.608 ± 0.342
2.23HisVal: 2.23 ± 0.667
0.0HisTrp: 0.0 ± 0.0
1.014HisTyr: 1.014 ± 0.373
0.0HisXaa: 0.0 ± 0.0
Ile
3.649IleAla: 3.649 ± 0.755
0.0IleCys: 0.0 ± 0.0
2.433IleAsp: 2.433 ± 0.825
4.46IleGlu: 4.46 ± 0.933
1.419IlePhe: 1.419 ± 0.53
2.838IleGly: 2.838 ± 0.805
1.419IleHis: 1.419 ± 0.643
4.054IleIle: 4.054 ± 1.1
3.446IleLys: 3.446 ± 0.799
4.865IleLeu: 4.865 ± 0.998
1.824IleMet: 1.824 ± 0.487
3.446IleAsn: 3.446 ± 0.817
3.041IlePro: 3.041 ± 1.109
3.041IleGln: 3.041 ± 0.792
2.23IleArg: 2.23 ± 0.582
2.433IleSer: 2.433 ± 0.751
1.622IleThr: 1.622 ± 0.559
3.852IleVal: 3.852 ± 0.765
1.014IleTrp: 1.014 ± 0.478
3.243IleTyr: 3.243 ± 0.841
0.0IleXaa: 0.0 ± 0.0
Lys
6.69LysAla: 6.69 ± 1.231
0.0LysCys: 0.0 ± 0.0
4.662LysAsp: 4.662 ± 1.011
8.717LysGlu: 8.717 ± 1.541
2.23LysPhe: 2.23 ± 0.623
6.284LysGly: 6.284 ± 1.305
1.216LysHis: 1.216 ± 0.644
3.243LysIle: 3.243 ± 0.781
9.122LysLys: 9.122 ± 2.56
7.298LysLeu: 7.298 ± 1.462
2.23LysMet: 2.23 ± 0.81
4.865LysAsn: 4.865 ± 0.864
5.068LysPro: 5.068 ± 1.715
4.257LysGln: 4.257 ± 0.763
4.865LysArg: 4.865 ± 1.023
4.257LysSer: 4.257 ± 1.105
5.676LysThr: 5.676 ± 1.02
4.46LysVal: 4.46 ± 0.736
1.419LysTrp: 1.419 ± 0.58
2.433LysTyr: 2.433 ± 0.827
0.0LysXaa: 0.0 ± 0.0
Leu
3.852LeuAla: 3.852 ± 0.788
0.811LeuCys: 0.811 ± 0.492
4.257LeuAsp: 4.257 ± 1.021
7.501LeuGlu: 7.501 ± 1.367
4.865LeuPhe: 4.865 ± 0.903
2.838LeuGly: 2.838 ± 0.677
0.811LeuHis: 0.811 ± 0.301
3.852LeuIle: 3.852 ± 0.86
6.69LeuLys: 6.69 ± 1.163
7.703LeuLeu: 7.703 ± 1.522
3.446LeuMet: 3.446 ± 1.067
4.257LeuAsn: 4.257 ± 1.098
4.257LeuPro: 4.257 ± 1.307
3.649LeuGln: 3.649 ± 0.838
2.838LeuArg: 2.838 ± 0.678
4.662LeuSer: 4.662 ± 0.916
4.865LeuThr: 4.865 ± 0.853
4.46LeuVal: 4.46 ± 1.139
1.622LeuTrp: 1.622 ± 0.567
3.243LeuTyr: 3.243 ± 1.443
0.0LeuXaa: 0.0 ± 0.0
Met
2.23MetAla: 2.23 ± 0.512
0.405MetCys: 0.405 ± 0.274
1.824MetAsp: 1.824 ± 0.619
2.635MetGlu: 2.635 ± 0.609
0.608MetPhe: 0.608 ± 0.524
2.23MetGly: 2.23 ± 0.731
0.405MetHis: 0.405 ± 0.278
1.622MetIle: 1.622 ± 0.693
1.824MetLys: 1.824 ± 0.708
2.433MetLeu: 2.433 ± 0.864
1.014MetMet: 1.014 ± 0.458
2.027MetAsn: 2.027 ± 0.822
1.014MetPro: 1.014 ± 0.503
1.216MetGln: 1.216 ± 0.483
1.622MetArg: 1.622 ± 0.73
2.027MetSer: 2.027 ± 0.75
2.027MetThr: 2.027 ± 0.536
2.433MetVal: 2.433 ± 0.555
0.811MetTrp: 0.811 ± 0.449
1.622MetTyr: 1.622 ± 0.566
0.0MetXaa: 0.0 ± 0.0
Asn
4.257AsnAla: 4.257 ± 0.8
0.203AsnCys: 0.203 ± 0.162
2.433AsnAsp: 2.433 ± 0.702
3.243AsnGlu: 3.243 ± 0.769
2.027AsnPhe: 2.027 ± 0.597
4.054AsnGly: 4.054 ± 0.793
1.216AsnHis: 1.216 ± 0.515
2.433AsnIle: 2.433 ± 0.731
3.243AsnLys: 3.243 ± 0.88
3.243AsnLeu: 3.243 ± 0.916
1.419AsnMet: 1.419 ± 0.743
3.041AsnAsn: 3.041 ± 0.717
0.811AsnPro: 0.811 ± 0.413
0.811AsnGln: 0.811 ± 0.415
1.622AsnArg: 1.622 ± 0.633
3.852AsnSer: 3.852 ± 0.813
4.257AsnThr: 4.257 ± 0.778
3.041AsnVal: 3.041 ± 0.962
0.405AsnTrp: 0.405 ± 0.307
1.622AsnTyr: 1.622 ± 0.52
0.0AsnXaa: 0.0 ± 0.0
Pro
2.838ProAla: 2.838 ± 0.922
0.405ProCys: 0.405 ± 0.307
1.622ProAsp: 1.622 ± 0.627
2.23ProGlu: 2.23 ± 0.606
2.23ProPhe: 2.23 ± 0.636
2.23ProGly: 2.23 ± 0.662
0.405ProHis: 0.405 ± 0.252
3.041ProIle: 3.041 ± 0.855
4.257ProLys: 4.257 ± 1.695
2.635ProLeu: 2.635 ± 0.753
0.203ProMet: 0.203 ± 0.177
1.824ProAsn: 1.824 ± 0.575
1.216ProPro: 1.216 ± 0.692
1.216ProGln: 1.216 ± 0.496
1.824ProArg: 1.824 ± 0.58
3.852ProSer: 3.852 ± 0.886
2.23ProThr: 2.23 ± 0.628
4.054ProVal: 4.054 ± 0.814
0.405ProTrp: 0.405 ± 0.341
1.824ProTyr: 1.824 ± 0.56
0.0ProXaa: 0.0 ± 0.0
Gln
2.838GlnAla: 2.838 ± 0.961
0.405GlnCys: 0.405 ± 0.284
1.622GlnAsp: 1.622 ± 0.609
1.824GlnGlu: 1.824 ± 0.917
1.014GlnPhe: 1.014 ± 0.383
2.433GlnGly: 2.433 ± 0.776
0.405GlnHis: 0.405 ± 0.284
2.23GlnIle: 2.23 ± 0.504
3.041GlnLys: 3.041 ± 0.647
3.041GlnLeu: 3.041 ± 0.907
1.419GlnMet: 1.419 ± 0.561
1.622GlnAsn: 1.622 ± 0.56
1.216GlnPro: 1.216 ± 0.429
1.622GlnGln: 1.622 ± 0.804
1.824GlnArg: 1.824 ± 0.571
1.824GlnSer: 1.824 ± 0.672
2.027GlnThr: 2.027 ± 0.604
3.446GlnVal: 3.446 ± 0.828
0.608GlnTrp: 0.608 ± 0.41
2.027GlnTyr: 2.027 ± 0.644
0.0GlnXaa: 0.0 ± 0.0
Arg
3.243ArgAla: 3.243 ± 0.704
0.203ArgCys: 0.203 ± 0.197
2.838ArgAsp: 2.838 ± 0.729
4.662ArgGlu: 4.662 ± 0.932
1.622ArgPhe: 1.622 ± 0.572
2.838ArgGly: 2.838 ± 0.915
0.405ArgHis: 0.405 ± 0.316
3.243ArgIle: 3.243 ± 0.805
4.46ArgLys: 4.46 ± 0.864
4.865ArgLeu: 4.865 ± 1.058
2.027ArgMet: 2.027 ± 0.657
1.419ArgAsn: 1.419 ± 0.546
2.23ArgPro: 2.23 ± 0.795
2.027ArgGln: 2.027 ± 0.797
2.23ArgArg: 2.23 ± 0.671
2.027ArgSer: 2.027 ± 0.715
1.419ArgThr: 1.419 ± 0.484
4.054ArgVal: 4.054 ± 1.168
0.0ArgTrp: 0.0 ± 0.0
0.811ArgTyr: 0.811 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
2.433SerAla: 2.433 ± 0.876
0.405SerCys: 0.405 ± 0.252
2.635SerAsp: 2.635 ± 0.838
3.446SerGlu: 3.446 ± 0.648
2.027SerPhe: 2.027 ± 0.583
5.879SerGly: 5.879 ± 1.214
1.216SerHis: 1.216 ± 0.406
5.271SerIle: 5.271 ± 1.257
6.284SerLys: 6.284 ± 1.146
2.838SerLeu: 2.838 ± 0.843
1.824SerMet: 1.824 ± 0.648
2.838SerAsn: 2.838 ± 0.804
2.635SerPro: 2.635 ± 0.658
2.027SerGln: 2.027 ± 0.6
2.838SerArg: 2.838 ± 0.954
3.852SerSer: 3.852 ± 0.868
2.635SerThr: 2.635 ± 0.744
3.649SerVal: 3.649 ± 0.812
1.014SerTrp: 1.014 ± 0.541
2.23SerTyr: 2.23 ± 0.634
0.0SerXaa: 0.0 ± 0.0
Thr
2.838ThrAla: 2.838 ± 0.659
0.0ThrCys: 0.0 ± 0.0
2.635ThrAsp: 2.635 ± 0.869
3.041ThrGlu: 3.041 ± 0.831
3.243ThrPhe: 3.243 ± 0.812
6.892ThrGly: 6.892 ± 1.369
0.405ThrHis: 0.405 ± 0.344
3.243ThrIle: 3.243 ± 0.707
6.892ThrLys: 6.892 ± 0.982
6.284ThrLeu: 6.284 ± 1.251
1.216ThrMet: 1.216 ± 0.54
3.649ThrAsn: 3.649 ± 0.981
2.23ThrPro: 2.23 ± 0.617
2.027ThrGln: 2.027 ± 0.896
2.635ThrArg: 2.635 ± 0.696
4.054ThrSer: 4.054 ± 1.095
4.054ThrThr: 4.054 ± 1.101
4.257ThrVal: 4.257 ± 0.737
1.014ThrTrp: 1.014 ± 0.401
0.608ThrTyr: 0.608 ± 0.313
0.0ThrXaa: 0.0 ± 0.0
Val
4.865ValAla: 4.865 ± 1.047
0.0ValCys: 0.0 ± 0.0
3.446ValAsp: 3.446 ± 0.923
4.662ValGlu: 4.662 ± 1.053
3.243ValPhe: 3.243 ± 0.786
4.662ValGly: 4.662 ± 0.626
1.419ValHis: 1.419 ± 0.493
2.838ValIle: 2.838 ± 0.665
4.46ValLys: 4.46 ± 0.885
6.081ValLeu: 6.081 ± 1.127
2.23ValMet: 2.23 ± 0.756
2.433ValAsn: 2.433 ± 0.929
3.649ValPro: 3.649 ± 0.912
1.824ValGln: 1.824 ± 0.519
2.433ValArg: 2.433 ± 0.761
4.257ValSer: 4.257 ± 0.998
5.676ValThr: 5.676 ± 1.441
5.473ValVal: 5.473 ± 1.301
1.419ValTrp: 1.419 ± 0.55
2.838ValTyr: 2.838 ± 0.724
0.0ValXaa: 0.0 ± 0.0
Trp
1.622TrpAla: 1.622 ± 0.543
0.608TrpCys: 0.608 ± 0.345
0.811TrpAsp: 0.811 ± 0.357
0.811TrpGlu: 0.811 ± 0.443
1.216TrpPhe: 1.216 ± 0.552
0.608TrpGly: 0.608 ± 0.378
0.405TrpHis: 0.405 ± 0.395
0.405TrpIle: 0.405 ± 0.314
1.824TrpLys: 1.824 ± 0.576
1.014TrpLeu: 1.014 ± 0.485
0.0TrpMet: 0.0 ± 0.0
0.203TrpAsn: 0.203 ± 0.182
0.0TrpPro: 0.0 ± 0.0
0.811TrpGln: 0.811 ± 0.462
0.608TrpArg: 0.608 ± 0.376
0.811TrpSer: 0.811 ± 0.374
0.405TrpThr: 0.405 ± 0.249
0.608TrpVal: 0.608 ± 0.365
0.203TrpTrp: 0.203 ± 0.22
0.811TrpTyr: 0.811 ± 0.54
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.838TyrAla: 2.838 ± 0.929
0.608TyrCys: 0.608 ± 0.31
3.041TyrAsp: 3.041 ± 0.864
3.041TyrGlu: 3.041 ± 0.807
1.014TyrPhe: 1.014 ± 0.395
4.054TyrGly: 4.054 ± 1.239
1.216TyrHis: 1.216 ± 0.54
1.622TyrIle: 1.622 ± 0.857
4.257TyrLys: 4.257 ± 0.977
2.635TyrLeu: 2.635 ± 0.663
1.622TyrMet: 1.622 ± 0.506
2.433TyrAsn: 2.433 ± 0.647
1.622TyrPro: 1.622 ± 0.472
0.811TyrGln: 0.811 ± 0.367
1.216TyrArg: 1.216 ± 0.436
1.622TyrSer: 1.622 ± 0.539
1.824TyrThr: 1.824 ± 0.69
2.635TyrVal: 2.635 ± 0.697
0.0TyrTrp: 0.0 ± 0.0
2.027TyrTyr: 2.027 ± 0.654
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 32 proteins (4934 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski