Amino acid dipepetide frequency for Bacillus phage vB_BsuP-Goe1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.601AlaAla: 2.601 ± 0.611
0.694AlaCys: 0.694 ± 0.337
2.948AlaAsp: 2.948 ± 0.706
3.295AlaGlu: 3.295 ± 0.863
2.601AlaPhe: 2.601 ± 1.107
3.989AlaGly: 3.989 ± 0.874
0.347AlaHis: 0.347 ± 0.2
4.683AlaIle: 4.683 ± 1.508
4.683AlaLys: 4.683 ± 0.894
3.642AlaLeu: 3.642 ± 1.032
1.387AlaMet: 1.387 ± 0.435
3.469AlaAsn: 3.469 ± 0.704
1.387AlaPro: 1.387 ± 0.411
3.295AlaGln: 3.295 ± 0.718
2.255AlaArg: 2.255 ± 0.695
4.856AlaSer: 4.856 ± 1.345
5.029AlaThr: 5.029 ± 1.212
4.336AlaVal: 4.336 ± 1.121
0.867AlaTrp: 0.867 ± 0.372
3.122AlaTyr: 3.122 ± 0.642
0.0AlaXaa: 0.0 ± 0.0
Cys
0.347CysAla: 0.347 ± 0.251
0.0CysCys: 0.0 ± 0.0
0.52CysAsp: 0.52 ± 0.401
0.694CysGlu: 0.694 ± 0.363
0.52CysPhe: 0.52 ± 0.219
0.52CysGly: 0.52 ± 0.323
0.173CysHis: 0.173 ± 0.156
0.694CysIle: 0.694 ± 0.348
0.0CysLys: 0.0 ± 0.0
0.173CysLeu: 0.173 ± 0.183
0.173CysMet: 0.173 ± 0.2
1.041CysAsn: 1.041 ± 0.537
0.0CysPro: 0.0 ± 0.0
0.52CysGln: 0.52 ± 0.314
0.173CysArg: 0.173 ± 0.179
0.694CysSer: 0.694 ± 0.377
0.347CysThr: 0.347 ± 0.257
0.867CysVal: 0.867 ± 0.371
0.0CysTrp: 0.0 ± 0.0
0.52CysTyr: 0.52 ± 0.279
0.0CysXaa: 0.0 ± 0.0
Asp
2.081AspAla: 2.081 ± 0.792
0.694AspCys: 0.694 ± 0.252
4.683AspAsp: 4.683 ± 0.749
4.683AspGlu: 4.683 ± 0.716
2.775AspPhe: 2.775 ± 0.759
5.376AspGly: 5.376 ± 1.306
0.347AspHis: 0.347 ± 0.257
5.55AspIle: 5.55 ± 1.055
3.122AspLys: 3.122 ± 0.873
4.856AspLeu: 4.856 ± 1.056
0.694AspMet: 0.694 ± 0.432
3.642AspAsn: 3.642 ± 0.823
1.908AspPro: 1.908 ± 0.443
0.347AspGln: 0.347 ± 0.261
2.428AspArg: 2.428 ± 0.517
4.162AspSer: 4.162 ± 0.655
4.162AspThr: 4.162 ± 1.113
5.376AspVal: 5.376 ± 1.247
0.867AspTrp: 0.867 ± 0.426
2.775AspTyr: 2.775 ± 0.849
0.0AspXaa: 0.0 ± 0.0
Glu
1.908GluAla: 1.908 ± 0.586
0.52GluCys: 0.52 ± 0.304
3.122GluAsp: 3.122 ± 0.973
3.815GluGlu: 3.815 ± 0.696
2.775GluPhe: 2.775 ± 0.804
4.162GluGly: 4.162 ± 0.824
2.428GluHis: 2.428 ± 0.733
6.243GluIle: 6.243 ± 1.19
6.59GluLys: 6.59 ± 1.3
4.509GluLeu: 4.509 ± 0.991
1.908GluMet: 1.908 ± 0.692
2.948GluAsn: 2.948 ± 0.711
0.867GluPro: 0.867 ± 0.336
2.428GluGln: 2.428 ± 0.628
3.642GluArg: 3.642 ± 0.659
3.989GluSer: 3.989 ± 0.85
4.509GluThr: 4.509 ± 1.152
4.336GluVal: 4.336 ± 1.306
1.214GluTrp: 1.214 ± 0.497
4.509GluTyr: 4.509 ± 0.79
0.0GluXaa: 0.0 ± 0.0
Phe
1.908PheAla: 1.908 ± 0.42
0.0PheCys: 0.0 ± 0.0
3.469PheAsp: 3.469 ± 0.884
3.642PheGlu: 3.642 ± 0.673
1.561PhePhe: 1.561 ± 0.513
2.428PheGly: 2.428 ± 0.674
1.214PheHis: 1.214 ± 0.466
3.469PheIle: 3.469 ± 1.177
4.509PheLys: 4.509 ± 1.395
2.255PheLeu: 2.255 ± 0.794
1.041PheMet: 1.041 ± 0.322
3.122PheAsn: 3.122 ± 0.802
0.867PhePro: 0.867 ± 0.469
1.041PheGln: 1.041 ± 0.393
1.561PheArg: 1.561 ± 0.492
2.428PheSer: 2.428 ± 0.693
2.255PheThr: 2.255 ± 0.551
2.081PheVal: 2.081 ± 0.563
0.173PheTrp: 0.173 ± 0.168
1.387PheTyr: 1.387 ± 0.505
0.0PheXaa: 0.0 ± 0.0
Gly
3.989GlyAla: 3.989 ± 0.771
0.347GlyCys: 0.347 ± 0.225
3.989GlyAsp: 3.989 ± 0.848
3.295GlyGlu: 3.295 ± 0.686
2.775GlyPhe: 2.775 ± 0.917
4.336GlyGly: 4.336 ± 1.625
1.041GlyHis: 1.041 ± 0.38
5.203GlyIle: 5.203 ± 1.45
4.336GlyLys: 4.336 ± 1.0
5.203GlyLeu: 5.203 ± 0.612
1.734GlyMet: 1.734 ± 0.575
6.59GlyAsn: 6.59 ± 0.979
0.0GlyPro: 0.0 ± 0.0
1.908GlyGln: 1.908 ± 0.659
1.908GlyArg: 1.908 ± 0.502
5.55GlySer: 5.55 ± 1.228
3.815GlyThr: 3.815 ± 0.82
5.376GlyVal: 5.376 ± 1.488
0.694GlyTrp: 0.694 ± 0.52
4.162GlyTyr: 4.162 ± 0.71
0.0GlyXaa: 0.0 ± 0.0
His
1.734HisAla: 1.734 ± 0.7
0.0HisCys: 0.0 ± 0.0
0.347HisAsp: 0.347 ± 0.234
1.734HisGlu: 1.734 ± 0.673
0.173HisPhe: 0.173 ± 0.184
0.867HisGly: 0.867 ± 0.359
0.52HisHis: 0.52 ± 0.36
0.867HisIle: 0.867 ± 0.435
0.867HisLys: 0.867 ± 0.434
1.041HisLeu: 1.041 ± 0.412
0.347HisMet: 0.347 ± 0.221
0.347HisAsn: 0.347 ± 0.237
0.52HisPro: 0.52 ± 0.35
0.867HisGln: 0.867 ± 0.357
0.52HisArg: 0.52 ± 0.416
1.214HisSer: 1.214 ± 0.506
0.694HisThr: 0.694 ± 0.569
1.387HisVal: 1.387 ± 0.377
0.347HisTrp: 0.347 ± 0.252
1.214HisTyr: 1.214 ± 0.437
0.0HisXaa: 0.0 ± 0.0
Ile
4.336IleAla: 4.336 ± 0.726
0.694IleCys: 0.694 ± 0.259
5.376IleAsp: 5.376 ± 0.914
5.029IleGlu: 5.029 ± 1.257
2.428IlePhe: 2.428 ± 0.768
4.683IleGly: 4.683 ± 1.254
1.561IleHis: 1.561 ± 0.367
3.989IleIle: 3.989 ± 0.852
5.029IleLys: 5.029 ± 0.925
4.162IleLeu: 4.162 ± 0.773
1.214IleMet: 1.214 ± 0.541
5.029IleAsn: 5.029 ± 0.766
1.908IlePro: 1.908 ± 0.627
2.601IleGln: 2.601 ± 1.031
2.948IleArg: 2.948 ± 0.541
5.376IleSer: 5.376 ± 1.47
4.683IleThr: 4.683 ± 0.848
5.203IleVal: 5.203 ± 0.661
0.173IleTrp: 0.173 ± 0.169
2.948IleTyr: 2.948 ± 0.608
0.0IleXaa: 0.0 ± 0.0
Lys
5.029LysAla: 5.029 ± 1.196
0.347LysCys: 0.347 ± 0.227
3.642LysAsp: 3.642 ± 1.202
5.897LysGlu: 5.897 ± 1.215
3.122LysPhe: 3.122 ± 0.765
3.642LysGly: 3.642 ± 0.659
1.214LysHis: 1.214 ± 0.541
4.162LysIle: 4.162 ± 0.832
5.203LysLys: 5.203 ± 1.54
6.937LysLeu: 6.937 ± 1.607
3.642LysMet: 3.642 ± 0.726
5.723LysAsn: 5.723 ± 1.067
2.428LysPro: 2.428 ± 0.52
2.948LysGln: 2.948 ± 0.716
3.122LysArg: 3.122 ± 0.743
4.683LysSer: 4.683 ± 1.016
6.243LysThr: 6.243 ± 0.935
4.856LysVal: 4.856 ± 0.707
1.561LysTrp: 1.561 ± 0.559
2.255LysTyr: 2.255 ± 0.842
0.0LysXaa: 0.0 ± 0.0
Leu
3.989LeuAla: 3.989 ± 0.811
0.867LeuCys: 0.867 ± 0.43
4.509LeuAsp: 4.509 ± 0.843
6.07LeuGlu: 6.07 ± 1.229
2.601LeuPhe: 2.601 ± 0.824
3.642LeuGly: 3.642 ± 0.712
0.867LeuHis: 0.867 ± 0.321
3.989LeuIle: 3.989 ± 0.846
7.111LeuLys: 7.111 ± 1.71
4.683LeuLeu: 4.683 ± 1.274
2.428LeuMet: 2.428 ± 0.574
6.417LeuAsn: 6.417 ± 0.816
2.775LeuPro: 2.775 ± 0.938
1.734LeuGln: 1.734 ± 0.578
3.642LeuArg: 3.642 ± 1.03
6.764LeuSer: 6.764 ± 0.911
4.162LeuThr: 4.162 ± 0.83
5.029LeuVal: 5.029 ± 0.969
0.52LeuTrp: 0.52 ± 0.276
4.336LeuTyr: 4.336 ± 1.107
0.0LeuXaa: 0.0 ± 0.0
Met
1.734MetAla: 1.734 ± 0.639
0.173MetCys: 0.173 ± 0.169
1.041MetAsp: 1.041 ± 0.392
1.561MetGlu: 1.561 ± 0.497
1.561MetPhe: 1.561 ± 0.606
1.908MetGly: 1.908 ± 0.523
0.347MetHis: 0.347 ± 0.218
1.908MetIle: 1.908 ± 0.702
2.775MetLys: 2.775 ± 0.755
1.561MetLeu: 1.561 ± 0.517
0.52MetMet: 0.52 ± 0.313
1.387MetAsn: 1.387 ± 0.59
0.867MetPro: 0.867 ± 0.333
1.041MetGln: 1.041 ± 0.301
2.081MetArg: 2.081 ± 0.762
1.214MetSer: 1.214 ± 0.344
2.081MetThr: 2.081 ± 0.637
2.081MetVal: 2.081 ± 0.448
0.347MetTrp: 0.347 ± 0.215
1.041MetTyr: 1.041 ± 0.396
0.0MetXaa: 0.0 ± 0.0
Asn
5.897AsnAla: 5.897 ± 1.277
1.041AsnCys: 1.041 ± 0.938
4.509AsnAsp: 4.509 ± 0.907
4.509AsnGlu: 4.509 ± 0.999
2.428AsnPhe: 2.428 ± 0.581
5.203AsnGly: 5.203 ± 1.551
0.0AsnHis: 0.0 ± 0.0
3.815AsnIle: 3.815 ± 1.126
5.55AsnLys: 5.55 ± 1.048
5.55AsnLeu: 5.55 ± 0.892
1.908AsnMet: 1.908 ± 0.621
5.203AsnAsn: 5.203 ± 1.018
3.989AsnPro: 3.989 ± 0.896
2.428AsnGln: 2.428 ± 0.697
1.561AsnArg: 1.561 ± 0.582
3.469AsnSer: 3.469 ± 0.74
5.55AsnThr: 5.55 ± 1.024
5.376AsnVal: 5.376 ± 0.872
0.694AsnTrp: 0.694 ± 0.271
2.601AsnTyr: 2.601 ± 0.627
0.0AsnXaa: 0.0 ± 0.0
Pro
2.255ProAla: 2.255 ± 0.821
0.347ProCys: 0.347 ± 0.257
2.081ProAsp: 2.081 ± 0.628
1.908ProGlu: 1.908 ± 0.614
1.214ProPhe: 1.214 ± 0.613
0.173ProGly: 0.173 ± 0.169
0.347ProHis: 0.347 ± 0.246
0.694ProIle: 0.694 ± 0.273
1.734ProLys: 1.734 ± 0.511
3.295ProLeu: 3.295 ± 0.611
1.041ProMet: 1.041 ± 0.478
1.908ProAsn: 1.908 ± 0.509
0.867ProPro: 0.867 ± 0.454
0.867ProGln: 0.867 ± 0.372
1.214ProArg: 1.214 ± 0.417
2.948ProSer: 2.948 ± 0.841
0.694ProThr: 0.694 ± 0.261
2.428ProVal: 2.428 ± 0.829
0.0ProTrp: 0.0 ± 0.0
2.081ProTyr: 2.081 ± 0.398
0.0ProXaa: 0.0 ± 0.0
Gln
2.255GlnAla: 2.255 ± 0.605
0.0GlnCys: 0.0 ± 0.0
0.867GlnAsp: 0.867 ± 0.317
2.428GlnGlu: 2.428 ± 0.532
1.734GlnPhe: 1.734 ± 0.434
2.081GlnGly: 2.081 ± 0.565
0.173GlnHis: 0.173 ± 0.179
2.601GlnIle: 2.601 ± 0.552
2.428GlnLys: 2.428 ± 0.531
3.642GlnLeu: 3.642 ± 0.877
1.041GlnMet: 1.041 ± 0.464
2.081GlnAsn: 2.081 ± 0.677
0.867GlnPro: 0.867 ± 0.462
1.041GlnGln: 1.041 ± 0.496
1.734GlnArg: 1.734 ± 0.636
1.214GlnSer: 1.214 ± 0.39
1.387GlnThr: 1.387 ± 0.607
2.428GlnVal: 2.428 ± 0.57
0.52GlnTrp: 0.52 ± 0.389
1.734GlnTyr: 1.734 ± 0.476
0.0GlnXaa: 0.0 ± 0.0
Arg
1.908ArgAla: 1.908 ± 0.556
0.173ArgCys: 0.173 ± 0.183
2.255ArgAsp: 2.255 ± 0.577
2.775ArgGlu: 2.775 ± 0.724
1.908ArgPhe: 1.908 ± 0.651
3.295ArgGly: 3.295 ± 0.708
0.694ArgHis: 0.694 ± 0.339
3.815ArgIle: 3.815 ± 0.691
2.775ArgLys: 2.775 ± 0.761
2.428ArgLeu: 2.428 ± 0.644
1.561ArgMet: 1.561 ± 0.546
3.469ArgAsn: 3.469 ± 0.868
1.734ArgPro: 1.734 ± 0.456
1.041ArgGln: 1.041 ± 0.354
1.214ArgArg: 1.214 ± 0.53
1.561ArgSer: 1.561 ± 0.535
3.469ArgThr: 3.469 ± 0.942
1.561ArgVal: 1.561 ± 0.444
0.52ArgTrp: 0.52 ± 0.248
1.041ArgTyr: 1.041 ± 0.361
0.0ArgXaa: 0.0 ± 0.0
Ser
5.203SerAla: 5.203 ± 1.448
0.173SerCys: 0.173 ± 0.179
5.029SerAsp: 5.029 ± 0.703
4.162SerGlu: 4.162 ± 1.163
2.428SerPhe: 2.428 ± 0.993
5.203SerGly: 5.203 ± 1.275
0.694SerHis: 0.694 ± 0.35
3.815SerIle: 3.815 ± 0.918
4.336SerLys: 4.336 ± 0.929
6.07SerLeu: 6.07 ± 1.034
1.387SerMet: 1.387 ± 0.483
5.029SerAsn: 5.029 ± 1.13
1.387SerPro: 1.387 ± 0.41
1.561SerGln: 1.561 ± 0.523
3.122SerArg: 3.122 ± 0.79
4.336SerSer: 4.336 ± 0.756
4.162SerThr: 4.162 ± 0.905
2.948SerVal: 2.948 ± 0.662
0.347SerTrp: 0.347 ± 0.249
4.509SerTyr: 4.509 ± 0.855
0.0SerXaa: 0.0 ± 0.0
Thr
5.203ThrAla: 5.203 ± 0.924
0.52ThrCys: 0.52 ± 0.26
3.642ThrAsp: 3.642 ± 0.729
3.295ThrGlu: 3.295 ± 0.695
2.255ThrPhe: 2.255 ± 0.554
5.723ThrGly: 5.723 ± 1.05
0.52ThrHis: 0.52 ± 0.228
6.07ThrIle: 6.07 ± 1.057
5.029ThrLys: 5.029 ± 1.089
6.243ThrLeu: 6.243 ± 1.334
1.387ThrMet: 1.387 ± 0.509
4.509ThrAsn: 4.509 ± 0.963
1.734ThrPro: 1.734 ± 0.43
2.081ThrGln: 2.081 ± 0.607
1.734ThrArg: 1.734 ± 0.613
3.642ThrSer: 3.642 ± 0.86
6.764ThrThr: 6.764 ± 1.838
4.683ThrVal: 4.683 ± 1.11
1.214ThrTrp: 1.214 ± 0.321
2.775ThrTyr: 2.775 ± 0.663
0.0ThrXaa: 0.0 ± 0.0
Val
3.642ValAla: 3.642 ± 1.054
0.867ValCys: 0.867 ± 0.323
4.856ValAsp: 4.856 ± 1.29
3.815ValGlu: 3.815 ± 0.763
2.775ValPhe: 2.775 ± 1.02
4.856ValGly: 4.856 ± 0.832
1.561ValHis: 1.561 ± 0.503
4.162ValIle: 4.162 ± 0.732
6.59ValLys: 6.59 ± 1.094
3.989ValLeu: 3.989 ± 1.134
1.734ValMet: 1.734 ± 0.427
5.203ValAsn: 5.203 ± 0.913
1.908ValPro: 1.908 ± 0.486
2.081ValGln: 2.081 ± 0.383
2.428ValArg: 2.428 ± 0.663
5.029ValSer: 5.029 ± 1.176
6.243ValThr: 6.243 ± 1.074
3.295ValVal: 3.295 ± 1.125
1.041ValTrp: 1.041 ± 0.403
2.601ValTyr: 2.601 ± 0.642
0.0ValXaa: 0.0 ± 0.0
Trp
1.041TrpAla: 1.041 ± 0.581
0.0TrpCys: 0.0 ± 0.0
0.173TrpAsp: 0.173 ± 0.152
0.694TrpGlu: 0.694 ± 0.305
1.041TrpPhe: 1.041 ± 0.435
0.173TrpGly: 0.173 ± 0.173
0.347TrpHis: 0.347 ± 0.223
0.347TrpIle: 0.347 ± 0.189
0.694TrpLys: 0.694 ± 0.32
1.387TrpLeu: 1.387 ± 0.532
0.347TrpMet: 0.347 ± 0.23
1.214TrpAsn: 1.214 ± 0.527
0.0TrpPro: 0.0 ± 0.0
0.867TrpGln: 0.867 ± 0.298
0.173TrpArg: 0.173 ± 0.169
0.867TrpSer: 0.867 ± 0.273
0.867TrpThr: 0.867 ± 0.565
1.214TrpVal: 1.214 ± 0.633
0.0TrpTrp: 0.0 ± 0.0
0.347TrpTyr: 0.347 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.706
0.52TyrCys: 0.52 ± 0.328
3.642TyrAsp: 3.642 ± 0.865
3.122TyrGlu: 3.122 ± 0.538
1.734TyrPhe: 1.734 ± 0.576
3.989TyrGly: 3.989 ± 0.864
1.214TyrHis: 1.214 ± 0.468
3.295TyrIle: 3.295 ± 0.776
3.295TyrLys: 3.295 ± 0.943
4.509TyrLeu: 4.509 ± 1.069
1.561TyrMet: 1.561 ± 0.478
2.948TyrAsn: 2.948 ± 0.804
2.081TyrPro: 2.081 ± 0.495
1.561TyrGln: 1.561 ± 0.753
1.734TyrArg: 1.734 ± 0.632
1.908TyrSer: 1.908 ± 0.669
1.908TyrThr: 1.908 ± 0.621
3.815TyrVal: 3.815 ± 0.903
0.694TyrTrp: 0.694 ± 0.302
1.734TyrTyr: 1.734 ± 0.558
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (5767 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski