Amino acid dipepetide frequency for Bacillus phage VMY22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.519AlaAla: 0.519 ± 0.31
0.346AlaCys: 0.346 ± 0.244
2.593AlaAsp: 2.593 ± 0.554
2.939AlaGlu: 2.939 ± 1.16
3.111AlaPhe: 3.111 ± 0.754
2.42AlaGly: 2.42 ± 0.867
0.519AlaHis: 0.519 ± 0.307
3.457AlaIle: 3.457 ± 0.588
3.111AlaLys: 3.111 ± 0.634
3.457AlaLeu: 3.457 ± 0.62
2.074AlaMet: 2.074 ± 0.575
2.42AlaAsn: 2.42 ± 0.638
1.383AlaPro: 1.383 ± 0.502
1.729AlaGln: 1.729 ± 0.435
1.901AlaArg: 1.901 ± 0.665
2.766AlaSer: 2.766 ± 0.603
5.013AlaThr: 5.013 ± 0.958
2.42AlaVal: 2.42 ± 0.606
0.864AlaTrp: 0.864 ± 0.476
2.247AlaTyr: 2.247 ± 0.683
0.0AlaXaa: 0.0 ± 0.0
Cys
0.346CysAla: 0.346 ± 0.287
0.173CysCys: 0.173 ± 0.202
0.864CysAsp: 0.864 ± 0.325
0.864CysGlu: 0.864 ± 0.459
0.0CysPhe: 0.0 ± 0.0
0.346CysGly: 0.346 ± 0.251
0.173CysHis: 0.173 ± 0.143
0.0CysIle: 0.0 ± 0.0
0.691CysLys: 0.691 ± 0.402
0.173CysLeu: 0.173 ± 0.174
0.346CysMet: 0.346 ± 0.239
0.691CysAsn: 0.691 ± 0.267
0.173CysPro: 0.173 ± 0.202
0.346CysGln: 0.346 ± 0.253
0.346CysArg: 0.346 ± 0.232
0.346CysSer: 0.346 ± 0.25
0.0CysThr: 0.0 ± 0.0
0.519CysVal: 0.519 ± 0.258
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.939AspAla: 2.939 ± 0.967
0.173AspCys: 0.173 ± 0.173
2.074AspAsp: 2.074 ± 0.699
4.494AspGlu: 4.494 ± 0.776
3.457AspPhe: 3.457 ± 0.462
6.05AspGly: 6.05 ± 0.776
1.21AspHis: 1.21 ± 0.279
3.457AspIle: 3.457 ± 0.82
3.284AspLys: 3.284 ± 0.832
4.84AspLeu: 4.84 ± 1.133
1.901AspMet: 1.901 ± 0.497
4.322AspAsn: 4.322 ± 0.942
2.074AspPro: 2.074 ± 0.498
0.173AspGln: 0.173 ± 0.185
3.63AspArg: 3.63 ± 0.823
5.013AspSer: 5.013 ± 0.764
3.111AspThr: 3.111 ± 0.908
5.532AspVal: 5.532 ± 0.855
0.519AspTrp: 0.519 ± 0.397
3.111AspTyr: 3.111 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
5.359GluAla: 5.359 ± 0.802
1.21GluCys: 1.21 ± 0.604
3.803GluAsp: 3.803 ± 0.963
5.704GluGlu: 5.704 ± 0.839
4.494GluPhe: 4.494 ± 1.116
6.914GluGly: 6.914 ± 1.085
1.901GluHis: 1.901 ± 0.536
5.704GluIle: 5.704 ± 0.922
2.593GluLys: 2.593 ± 0.56
4.667GluLeu: 4.667 ± 0.846
2.766GluMet: 2.766 ± 0.738
4.149GluAsn: 4.149 ± 0.772
1.037GluPro: 1.037 ± 0.416
3.457GluGln: 3.457 ± 0.844
2.939GluArg: 2.939 ± 0.825
4.322GluSer: 4.322 ± 0.944
5.186GluThr: 5.186 ± 1.28
4.667GluVal: 4.667 ± 1.222
1.21GluTrp: 1.21 ± 0.408
3.63GluTyr: 3.63 ± 0.58
0.0GluXaa: 0.0 ± 0.0
Phe
1.901PheAla: 1.901 ± 0.507
0.173PheCys: 0.173 ± 0.171
3.284PheAsp: 3.284 ± 0.833
2.593PheGlu: 2.593 ± 0.586
1.556PhePhe: 1.556 ± 0.695
1.901PheGly: 1.901 ± 0.601
1.037PheHis: 1.037 ± 0.389
4.149PheIle: 4.149 ± 0.921
4.84PheLys: 4.84 ± 1.067
2.247PheLeu: 2.247 ± 0.659
2.593PheMet: 2.593 ± 0.712
3.803PheAsn: 3.803 ± 0.668
1.21PhePro: 1.21 ± 0.494
2.074PheGln: 2.074 ± 0.456
1.729PheArg: 1.729 ± 0.587
1.729PheSer: 1.729 ± 0.472
3.976PheThr: 3.976 ± 0.984
3.803PheVal: 3.803 ± 0.74
0.173PheTrp: 0.173 ± 0.171
2.766PheTyr: 2.766 ± 0.71
0.0PheXaa: 0.0 ± 0.0
Gly
3.284GlyAla: 3.284 ± 0.834
0.0GlyCys: 0.0 ± 0.0
3.63GlyAsp: 3.63 ± 0.902
6.05GlyGlu: 6.05 ± 0.907
2.247GlyPhe: 2.247 ± 0.469
3.803GlyGly: 3.803 ± 0.793
0.691GlyHis: 0.691 ± 0.309
3.284GlyIle: 3.284 ± 0.472
7.087GlyLys: 7.087 ± 1.225
4.322GlyLeu: 4.322 ± 1.04
1.901GlyMet: 1.901 ± 0.696
3.976GlyAsn: 3.976 ± 0.865
0.0GlyPro: 0.0 ± 0.0
2.766GlyGln: 2.766 ± 0.575
2.074GlyArg: 2.074 ± 0.52
3.803GlySer: 3.803 ± 0.882
3.63GlyThr: 3.63 ± 0.892
5.359GlyVal: 5.359 ± 1.11
1.21GlyTrp: 1.21 ± 0.608
3.111GlyTyr: 3.111 ± 0.697
0.0GlyXaa: 0.0 ± 0.0
His
0.519HisAla: 0.519 ± 0.324
0.0HisCys: 0.0 ± 0.0
1.383HisAsp: 1.383 ± 0.524
1.21HisGlu: 1.21 ± 0.388
1.383HisPhe: 1.383 ± 0.455
0.691HisGly: 0.691 ± 0.299
0.864HisHis: 0.864 ± 0.322
1.556HisIle: 1.556 ± 0.512
1.383HisLys: 1.383 ± 0.544
1.21HisLeu: 1.21 ± 0.615
0.519HisMet: 0.519 ± 0.275
1.383HisAsn: 1.383 ± 0.401
1.037HisPro: 1.037 ± 0.446
0.864HisGln: 0.864 ± 0.34
1.556HisArg: 1.556 ± 0.454
0.691HisSer: 0.691 ± 0.302
2.939HisThr: 2.939 ± 0.748
1.901HisVal: 1.901 ± 0.855
0.0HisTrp: 0.0 ± 0.0
1.037HisTyr: 1.037 ± 0.389
0.0HisXaa: 0.0 ± 0.0
Ile
2.939IleAla: 2.939 ± 0.784
0.346IleCys: 0.346 ± 0.203
4.667IleAsp: 4.667 ± 0.479
6.569IleGlu: 6.569 ± 1.617
3.111IlePhe: 3.111 ± 0.644
3.457IleGly: 3.457 ± 0.842
2.42IleHis: 2.42 ± 0.634
3.457IleIle: 3.457 ± 0.974
6.742IleLys: 6.742 ± 1.219
3.63IleLeu: 3.63 ± 0.74
1.556IleMet: 1.556 ± 0.611
3.803IleAsn: 3.803 ± 0.614
2.247IlePro: 2.247 ± 0.553
3.111IleGln: 3.111 ± 0.675
3.63IleArg: 3.63 ± 0.786
3.803IleSer: 3.803 ± 0.683
3.803IleThr: 3.803 ± 0.939
4.322IleVal: 4.322 ± 0.673
0.864IleTrp: 0.864 ± 0.353
2.593IleTyr: 2.593 ± 0.629
0.0IleXaa: 0.0 ± 0.0
Lys
3.457LysAla: 3.457 ± 0.679
0.519LysCys: 0.519 ± 0.265
6.05LysAsp: 6.05 ± 1.002
7.779LysGlu: 7.779 ± 1.068
3.284LysPhe: 3.284 ± 0.954
5.532LysGly: 5.532 ± 0.987
2.074LysHis: 2.074 ± 0.471
3.111LysIle: 3.111 ± 0.748
5.359LysLys: 5.359 ± 1.036
6.396LysLeu: 6.396 ± 0.891
3.63LysMet: 3.63 ± 0.762
3.976LysAsn: 3.976 ± 0.846
3.457LysPro: 3.457 ± 0.749
3.976LysGln: 3.976 ± 0.631
3.284LysArg: 3.284 ± 0.858
4.667LysSer: 4.667 ± 0.732
6.05LysThr: 6.05 ± 1.077
3.63LysVal: 3.63 ± 0.744
1.556LysTrp: 1.556 ± 0.411
2.939LysTyr: 2.939 ± 0.744
0.0LysXaa: 0.0 ± 0.0
Leu
2.939LeuAla: 2.939 ± 0.702
0.346LeuCys: 0.346 ± 0.254
2.939LeuAsp: 2.939 ± 0.792
4.667LeuGlu: 4.667 ± 1.057
2.939LeuPhe: 2.939 ± 0.748
2.939LeuGly: 2.939 ± 0.555
1.556LeuHis: 1.556 ± 0.489
5.013LeuIle: 5.013 ± 1.117
6.05LeuLys: 6.05 ± 1.041
5.359LeuLeu: 5.359 ± 0.989
1.901LeuMet: 1.901 ± 0.747
4.494LeuAsn: 4.494 ± 0.707
3.111LeuPro: 3.111 ± 0.734
2.766LeuGln: 2.766 ± 0.795
2.247LeuArg: 2.247 ± 0.658
4.149LeuSer: 4.149 ± 0.992
5.186LeuThr: 5.186 ± 0.989
4.322LeuVal: 4.322 ± 0.56
1.037LeuTrp: 1.037 ± 0.483
3.111LeuTyr: 3.111 ± 0.945
0.0LeuXaa: 0.0 ± 0.0
Met
1.037MetAla: 1.037 ± 0.331
0.173MetCys: 0.173 ± 0.202
2.074MetAsp: 2.074 ± 0.556
2.939MetGlu: 2.939 ± 0.623
2.074MetPhe: 2.074 ± 0.834
0.519MetGly: 0.519 ± 0.313
1.037MetHis: 1.037 ± 0.29
3.111MetIle: 3.111 ± 0.914
3.976MetLys: 3.976 ± 0.795
2.074MetLeu: 2.074 ± 0.511
1.556MetMet: 1.556 ± 0.506
2.42MetAsn: 2.42 ± 0.524
1.037MetPro: 1.037 ± 0.441
0.519MetGln: 0.519 ± 0.555
0.864MetArg: 0.864 ± 0.332
1.556MetSer: 1.556 ± 0.432
3.284MetThr: 3.284 ± 0.795
2.247MetVal: 2.247 ± 0.491
0.173MetTrp: 0.173 ± 0.185
1.21MetTyr: 1.21 ± 0.507
0.0MetXaa: 0.0 ± 0.0
Asn
2.939AsnAla: 2.939 ± 0.846
0.519AsnCys: 0.519 ± 0.253
3.63AsnAsp: 3.63 ± 0.667
5.013AsnGlu: 5.013 ± 0.948
3.111AsnPhe: 3.111 ± 0.591
5.013AsnGly: 5.013 ± 1.216
1.556AsnHis: 1.556 ± 0.5
4.149AsnIle: 4.149 ± 0.757
4.84AsnLys: 4.84 ± 0.983
4.149AsnLeu: 4.149 ± 0.826
1.901AsnMet: 1.901 ± 0.565
2.247AsnAsn: 2.247 ± 0.78
2.939AsnPro: 2.939 ± 0.488
3.457AsnGln: 3.457 ± 0.808
3.803AsnArg: 3.803 ± 0.727
4.494AsnSer: 4.494 ± 1.018
4.149AsnThr: 4.149 ± 0.677
3.111AsnVal: 3.111 ± 0.877
1.383AsnTrp: 1.383 ± 0.363
3.111AsnTyr: 3.111 ± 0.656
0.0AsnXaa: 0.0 ± 0.0
Pro
1.901ProAla: 1.901 ± 0.495
0.173ProCys: 0.173 ± 0.185
1.729ProAsp: 1.729 ± 0.558
2.593ProGlu: 2.593 ± 0.546
2.42ProPhe: 2.42 ± 0.478
0.346ProGly: 0.346 ± 0.26
0.519ProHis: 0.519 ± 0.261
3.111ProIle: 3.111 ± 0.606
3.111ProLys: 3.111 ± 0.858
1.901ProLeu: 1.901 ± 0.578
0.864ProMet: 0.864 ± 0.528
2.939ProAsn: 2.939 ± 0.753
0.691ProPro: 0.691 ± 0.392
1.383ProGln: 1.383 ± 0.456
0.691ProArg: 0.691 ± 0.296
2.42ProSer: 2.42 ± 0.716
0.691ProThr: 0.691 ± 0.292
2.766ProVal: 2.766 ± 0.763
0.346ProTrp: 0.346 ± 0.205
2.247ProTyr: 2.247 ± 0.498
0.0ProXaa: 0.0 ± 0.0
Gln
1.901GlnAla: 1.901 ± 0.544
0.346GlnCys: 0.346 ± 0.294
2.593GlnAsp: 2.593 ± 0.667
2.42GlnGlu: 2.42 ± 0.658
1.729GlnPhe: 1.729 ± 0.619
2.939GlnGly: 2.939 ± 0.706
0.346GlnHis: 0.346 ± 0.344
2.593GlnIle: 2.593 ± 0.809
3.284GlnLys: 3.284 ± 0.639
3.63GlnLeu: 3.63 ± 0.688
0.519GlnMet: 0.519 ± 0.294
2.247GlnAsn: 2.247 ± 0.613
0.691GlnPro: 0.691 ± 0.369
1.729GlnGln: 1.729 ± 0.427
1.556GlnArg: 1.556 ± 0.598
2.939GlnSer: 2.939 ± 0.632
2.247GlnThr: 2.247 ± 0.702
2.247GlnVal: 2.247 ± 0.774
0.691GlnTrp: 0.691 ± 0.394
1.901GlnTyr: 1.901 ± 0.464
0.0GlnXaa: 0.0 ± 0.0
Arg
2.593ArgAla: 2.593 ± 0.53
0.173ArgCys: 0.173 ± 0.174
2.766ArgAsp: 2.766 ± 0.743
3.457ArgGlu: 3.457 ± 0.677
0.864ArgPhe: 0.864 ± 0.531
1.383ArgGly: 1.383 ± 0.549
1.21ArgHis: 1.21 ± 0.415
3.803ArgIle: 3.803 ± 0.871
2.593ArgLys: 2.593 ± 0.571
2.939ArgLeu: 2.939 ± 0.84
1.901ArgMet: 1.901 ± 0.565
4.322ArgAsn: 4.322 ± 1.112
1.21ArgPro: 1.21 ± 0.473
1.21ArgGln: 1.21 ± 0.439
1.556ArgArg: 1.556 ± 0.826
1.383ArgSer: 1.383 ± 0.529
2.766ArgThr: 2.766 ± 0.895
2.593ArgVal: 2.593 ± 0.982
0.346ArgTrp: 0.346 ± 0.243
2.939ArgTyr: 2.939 ± 0.982
0.0ArgXaa: 0.0 ± 0.0
Ser
1.556SerAla: 1.556 ± 0.51
0.864SerCys: 0.864 ± 0.285
3.803SerAsp: 3.803 ± 0.839
3.803SerGlu: 3.803 ± 1.04
2.766SerPhe: 2.766 ± 0.749
3.284SerGly: 3.284 ± 0.787
1.383SerHis: 1.383 ± 0.519
5.532SerIle: 5.532 ± 0.85
4.494SerLys: 4.494 ± 0.578
4.149SerLeu: 4.149 ± 0.744
2.939SerMet: 2.939 ± 0.553
4.494SerAsn: 4.494 ± 1.115
1.729SerPro: 1.729 ± 0.568
2.42SerGln: 2.42 ± 0.903
1.556SerArg: 1.556 ± 0.537
2.593SerSer: 2.593 ± 0.739
3.457SerThr: 3.457 ± 0.688
3.457SerVal: 3.457 ± 0.822
0.691SerTrp: 0.691 ± 0.273
2.766SerTyr: 2.766 ± 0.642
0.0SerXaa: 0.0 ± 0.0
Thr
2.766ThrAla: 2.766 ± 0.724
0.346ThrCys: 0.346 ± 0.242
3.63ThrAsp: 3.63 ± 0.794
4.84ThrGlu: 4.84 ± 0.947
3.111ThrPhe: 3.111 ± 0.841
6.396ThrGly: 6.396 ± 1.573
0.864ThrHis: 0.864 ± 0.364
4.322ThrIle: 4.322 ± 0.655
6.742ThrLys: 6.742 ± 1.092
4.322ThrLeu: 4.322 ± 0.842
1.037ThrMet: 1.037 ± 0.535
3.457ThrAsn: 3.457 ± 0.649
4.84ThrPro: 4.84 ± 0.711
2.247ThrGln: 2.247 ± 0.528
3.63ThrArg: 3.63 ± 0.699
5.013ThrSer: 5.013 ± 0.88
2.939ThrThr: 2.939 ± 1.082
4.667ThrVal: 4.667 ± 0.91
1.556ThrTrp: 1.556 ± 0.393
2.593ThrTyr: 2.593 ± 0.8
0.0ThrXaa: 0.0 ± 0.0
Val
2.939ValAla: 2.939 ± 0.87
0.346ValCys: 0.346 ± 0.213
4.84ValAsp: 4.84 ± 1.055
3.284ValGlu: 3.284 ± 0.933
2.939ValPhe: 2.939 ± 0.7
3.803ValGly: 3.803 ± 0.629
1.037ValHis: 1.037 ± 0.422
3.63ValIle: 3.63 ± 0.764
5.704ValLys: 5.704 ± 0.927
3.111ValLeu: 3.111 ± 0.61
1.901ValMet: 1.901 ± 0.466
4.84ValAsn: 4.84 ± 0.852
2.247ValPro: 2.247 ± 0.585
2.247ValGln: 2.247 ± 0.534
2.247ValArg: 2.247 ± 0.728
3.976ValSer: 3.976 ± 0.906
6.914ValThr: 6.914 ± 1.264
3.284ValVal: 3.284 ± 0.891
1.383ValTrp: 1.383 ± 0.47
2.593ValTyr: 2.593 ± 0.597
0.0ValXaa: 0.0 ± 0.0
Trp
1.21TrpAla: 1.21 ± 0.468
0.0TrpCys: 0.0 ± 0.0
1.037TrpAsp: 1.037 ± 0.367
1.21TrpGlu: 1.21 ± 0.503
0.691TrpPhe: 0.691 ± 0.247
1.21TrpGly: 1.21 ± 0.412
0.864TrpHis: 0.864 ± 0.385
1.037TrpIle: 1.037 ± 0.471
0.864TrpLys: 0.864 ± 0.344
0.519TrpLeu: 0.519 ± 0.244
0.691TrpMet: 0.691 ± 0.337
1.383TrpAsn: 1.383 ± 0.483
0.173TrpPro: 0.173 ± 0.185
0.691TrpGln: 0.691 ± 0.297
0.519TrpArg: 0.519 ± 0.435
0.0TrpSer: 0.0 ± 0.0
1.556TrpThr: 1.556 ± 0.573
0.519TrpVal: 0.519 ± 0.274
0.0TrpTrp: 0.0 ± 0.0
0.691TrpTyr: 0.691 ± 0.407
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.247TyrAla: 2.247 ± 0.828
0.173TyrCys: 0.173 ± 0.143
3.976TyrAsp: 3.976 ± 0.824
3.284TyrGlu: 3.284 ± 0.779
2.247TyrPhe: 2.247 ± 0.425
3.284TyrGly: 3.284 ± 0.82
0.864TyrHis: 0.864 ± 0.307
2.593TyrIle: 2.593 ± 0.66
3.976TyrLys: 3.976 ± 0.775
3.976TyrLeu: 3.976 ± 0.909
1.21TyrMet: 1.21 ± 0.341
4.322TyrAsn: 4.322 ± 0.766
1.556TyrPro: 1.556 ± 0.479
1.383TyrGln: 1.383 ± 0.608
2.247TyrArg: 2.247 ± 0.637
2.074TyrSer: 2.074 ± 0.606
2.42TyrThr: 2.42 ± 0.667
1.901TyrVal: 1.901 ± 0.592
0.864TyrTrp: 0.864 ± 0.339
1.901TyrTyr: 1.901 ± 0.621
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (5786 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski