Amino acid dipepetide frequency for Bacillus phage DK3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.399AlaAla: 0.399 ± 0.25
0.665AlaCys: 0.665 ± 0.335
3.323AlaAsp: 3.323 ± 0.794
4.519AlaGlu: 4.519 ± 0.795
3.057AlaPhe: 3.057 ± 0.574
3.057AlaGly: 3.057 ± 0.704
0.399AlaHis: 0.399 ± 0.274
2.924AlaIle: 2.924 ± 0.84
4.387AlaLys: 4.387 ± 0.772
2.393AlaLeu: 2.393 ± 0.41
1.329AlaMet: 1.329 ± 0.357
2.127AlaAsn: 2.127 ± 0.612
1.196AlaPro: 1.196 ± 0.556
1.595AlaGln: 1.595 ± 0.518
1.595AlaArg: 1.595 ± 0.429
1.994AlaSer: 1.994 ± 0.583
2.791AlaThr: 2.791 ± 0.739
2.659AlaVal: 2.659 ± 0.761
0.399AlaTrp: 0.399 ± 0.186
2.127AlaTyr: 2.127 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.665CysAla: 0.665 ± 0.297
0.0CysCys: 0.0 ± 0.0
1.063CysAsp: 1.063 ± 0.411
1.196CysGlu: 1.196 ± 0.458
0.266CysPhe: 0.266 ± 0.209
1.196CysGly: 1.196 ± 0.509
0.133CysHis: 0.133 ± 0.136
0.266CysIle: 0.266 ± 0.167
1.196CysLys: 1.196 ± 0.382
0.665CysLeu: 0.665 ± 0.339
0.665CysMet: 0.665 ± 0.371
0.399CysAsn: 0.399 ± 0.214
0.399CysPro: 0.399 ± 0.184
0.266CysGln: 0.266 ± 0.192
0.266CysArg: 0.266 ± 0.18
0.133CysSer: 0.133 ± 0.12
0.399CysThr: 0.399 ± 0.212
0.93CysVal: 0.93 ± 0.402
0.133CysTrp: 0.133 ± 0.132
0.532CysTyr: 0.532 ± 0.225
0.0CysXaa: 0.0 ± 0.0
Asp
2.924AspAla: 2.924 ± 0.577
0.532AspCys: 0.532 ± 0.254
4.652AspAsp: 4.652 ± 0.747
5.45AspGlu: 5.45 ± 0.789
4.121AspPhe: 4.121 ± 0.57
4.387AspGly: 4.387 ± 1.254
1.063AspHis: 1.063 ± 0.375
4.387AspIle: 4.387 ± 0.752
6.912AspLys: 6.912 ± 1.163
4.121AspLeu: 4.121 ± 0.664
2.924AspMet: 2.924 ± 0.548
3.722AspAsn: 3.722 ± 0.798
2.924AspPro: 2.924 ± 0.729
2.393AspGln: 2.393 ± 0.412
1.994AspArg: 1.994 ± 0.46
3.323AspSer: 3.323 ± 0.708
3.456AspThr: 3.456 ± 0.972
5.982AspVal: 5.982 ± 0.72
0.532AspTrp: 0.532 ± 0.36
2.659AspTyr: 2.659 ± 0.638
0.0AspXaa: 0.0 ± 0.0
Glu
3.456GluAla: 3.456 ± 0.651
1.196GluCys: 1.196 ± 0.426
4.254GluAsp: 4.254 ± 0.811
6.513GluGlu: 6.513 ± 1.976
4.121GluPhe: 4.121 ± 0.906
4.918GluGly: 4.918 ± 0.791
1.329GluHis: 1.329 ± 0.411
7.045GluIle: 7.045 ± 1.088
8.64GluLys: 8.64 ± 2.157
8.108GluLeu: 8.108 ± 1.635
2.526GluMet: 2.526 ± 0.742
5.716GluAsn: 5.716 ± 0.936
1.196GluPro: 1.196 ± 0.469
2.659GluGln: 2.659 ± 0.559
3.855GluArg: 3.855 ± 0.877
4.121GluSer: 4.121 ± 0.597
3.057GluThr: 3.057 ± 0.66
5.184GluVal: 5.184 ± 0.928
1.329GluTrp: 1.329 ± 0.43
3.722GluTyr: 3.722 ± 0.656
0.0GluXaa: 0.0 ± 0.0
Phe
1.063PheAla: 1.063 ± 0.341
0.798PheCys: 0.798 ± 0.364
3.988PheAsp: 3.988 ± 0.705
3.19PheGlu: 3.19 ± 0.54
1.063PhePhe: 1.063 ± 0.384
1.861PheGly: 1.861 ± 0.49
1.462PheHis: 1.462 ± 0.471
3.722PheIle: 3.722 ± 0.667
4.387PheLys: 4.387 ± 0.973
2.791PheLeu: 2.791 ± 0.509
1.861PheMet: 1.861 ± 0.473
3.057PheAsn: 3.057 ± 0.676
1.063PhePro: 1.063 ± 0.353
1.329PheGln: 1.329 ± 0.343
0.665PheArg: 0.665 ± 0.375
2.526PheSer: 2.526 ± 0.582
2.924PheThr: 2.924 ± 0.628
3.323PheVal: 3.323 ± 0.645
0.532PheTrp: 0.532 ± 0.251
2.924PheTyr: 2.924 ± 0.593
0.0PheXaa: 0.0 ± 0.0
Gly
3.323GlyAla: 3.323 ± 0.818
0.532GlyCys: 0.532 ± 0.257
3.323GlyAsp: 3.323 ± 0.944
4.785GlyGlu: 4.785 ± 0.627
3.057GlyPhe: 3.057 ± 0.518
2.791GlyGly: 2.791 ± 0.666
0.532GlyHis: 0.532 ± 0.25
4.652GlyIle: 4.652 ± 0.66
5.716GlyLys: 5.716 ± 1.318
4.652GlyLeu: 4.652 ± 0.703
1.861GlyMet: 1.861 ± 0.477
4.254GlyAsn: 4.254 ± 0.863
0.133GlyPro: 0.133 ± 0.113
1.994GlyGln: 1.994 ± 0.585
2.526GlyArg: 2.526 ± 0.632
4.519GlySer: 4.519 ± 0.677
4.121GlyThr: 4.121 ± 0.917
3.19GlyVal: 3.19 ± 0.816
0.665GlyTrp: 0.665 ± 0.286
4.254GlyTyr: 4.254 ± 0.814
0.0GlyXaa: 0.0 ± 0.0
His
0.399HisAla: 0.399 ± 0.219
0.399HisCys: 0.399 ± 0.228
0.93HisAsp: 0.93 ± 0.304
1.063HisGlu: 1.063 ± 0.401
0.665HisPhe: 0.665 ± 0.27
0.665HisGly: 0.665 ± 0.292
0.399HisHis: 0.399 ± 0.192
2.393HisIle: 2.393 ± 0.5
1.063HisLys: 1.063 ± 0.407
2.26HisLeu: 2.26 ± 0.635
0.399HisMet: 0.399 ± 0.226
1.063HisAsn: 1.063 ± 0.424
0.133HisPro: 0.133 ± 0.122
0.798HisGln: 0.798 ± 0.25
0.798HisArg: 0.798 ± 0.331
1.063HisSer: 1.063 ± 0.437
1.329HisThr: 1.329 ± 0.462
0.93HisVal: 0.93 ± 0.276
0.133HisTrp: 0.133 ± 0.136
1.063HisTyr: 1.063 ± 0.301
0.0HisXaa: 0.0 ± 0.0
Ile
3.323IleAla: 3.323 ± 0.673
0.532IleCys: 0.532 ± 0.314
5.583IleAsp: 5.583 ± 0.759
6.38IleGlu: 6.38 ± 1.052
1.861IlePhe: 1.861 ± 0.41
4.918IleGly: 4.918 ± 0.691
1.462IleHis: 1.462 ± 0.395
3.988IleIle: 3.988 ± 1.097
6.912IleLys: 6.912 ± 1.234
4.121IleLeu: 4.121 ± 0.771
2.393IleMet: 2.393 ± 0.565
4.519IleAsn: 4.519 ± 0.823
1.861IlePro: 1.861 ± 0.485
2.26IleGln: 2.26 ± 0.437
3.19IleArg: 3.19 ± 0.682
3.19IleSer: 3.19 ± 0.621
3.855IleThr: 3.855 ± 0.616
4.785IleVal: 4.785 ± 1.031
0.798IleTrp: 0.798 ± 0.303
3.456IleTyr: 3.456 ± 0.724
0.0IleXaa: 0.0 ± 0.0
Lys
3.456LysAla: 3.456 ± 0.941
0.93LysCys: 0.93 ± 0.324
6.513LysAsp: 6.513 ± 1.191
10.767LysGlu: 10.767 ± 2.638
3.855LysPhe: 3.855 ± 0.779
4.387LysGly: 4.387 ± 0.567
1.728LysHis: 1.728 ± 0.531
5.184LysIle: 5.184 ± 0.882
9.172LysLys: 9.172 ± 1.284
6.912LysLeu: 6.912 ± 0.948
4.254LysMet: 4.254 ± 0.981
5.849LysAsn: 5.849 ± 1.036
1.728LysPro: 1.728 ± 0.52
3.057LysGln: 3.057 ± 0.74
3.988LysArg: 3.988 ± 0.579
3.722LysSer: 3.722 ± 0.626
5.716LysThr: 5.716 ± 0.944
7.178LysVal: 7.178 ± 0.82
1.196LysTrp: 1.196 ± 0.366
4.785LysTyr: 4.785 ± 1.292
0.0LysXaa: 0.0 ± 0.0
Leu
3.057LeuAla: 3.057 ± 0.56
0.532LeuCys: 0.532 ± 0.246
5.982LeuAsp: 5.982 ± 0.842
4.652LeuGlu: 4.652 ± 0.916
2.26LeuPhe: 2.26 ± 0.587
4.121LeuGly: 4.121 ± 0.632
2.127LeuHis: 2.127 ± 0.535
4.254LeuIle: 4.254 ± 0.773
7.045LeuLys: 7.045 ± 1.082
3.722LeuLeu: 3.722 ± 0.678
3.057LeuMet: 3.057 ± 0.557
5.184LeuAsn: 5.184 ± 0.793
1.994LeuPro: 1.994 ± 0.407
2.924LeuGln: 2.924 ± 0.737
4.519LeuArg: 4.519 ± 0.583
3.456LeuSer: 3.456 ± 0.632
4.785LeuThr: 4.785 ± 0.888
3.722LeuVal: 3.722 ± 0.686
0.93LeuTrp: 0.93 ± 0.466
2.526LeuTyr: 2.526 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
1.329MetAla: 1.329 ± 0.352
0.399MetCys: 0.399 ± 0.254
1.462MetAsp: 1.462 ± 0.438
3.057MetGlu: 3.057 ± 0.816
1.728MetPhe: 1.728 ± 0.529
2.393MetGly: 2.393 ± 0.509
0.399MetHis: 0.399 ± 0.214
2.127MetIle: 2.127 ± 0.6
3.722MetLys: 3.722 ± 0.775
2.924MetLeu: 2.924 ± 0.606
1.728MetMet: 1.728 ± 0.679
3.589MetAsn: 3.589 ± 0.815
0.665MetPro: 0.665 ± 0.279
1.063MetGln: 1.063 ± 0.326
1.462MetArg: 1.462 ± 0.471
1.329MetSer: 1.329 ± 0.354
1.994MetThr: 1.994 ± 0.586
1.329MetVal: 1.329 ± 0.416
0.93MetTrp: 0.93 ± 0.329
1.462MetTyr: 1.462 ± 0.381
0.0MetXaa: 0.0 ± 0.0
Asn
2.26AsnAla: 2.26 ± 0.616
0.665AsnCys: 0.665 ± 0.312
4.254AsnAsp: 4.254 ± 0.804
4.918AsnGlu: 4.918 ± 0.744
2.924AsnPhe: 2.924 ± 0.551
4.121AsnGly: 4.121 ± 0.845
0.665AsnHis: 0.665 ± 0.304
3.855AsnIle: 3.855 ± 0.984
6.646AsnLys: 6.646 ± 1.257
5.051AsnLeu: 5.051 ± 0.891
2.127AsnMet: 2.127 ± 0.489
3.988AsnAsn: 3.988 ± 0.695
2.393AsnPro: 2.393 ± 0.474
2.26AsnGln: 2.26 ± 0.446
3.589AsnArg: 3.589 ± 0.898
4.254AsnSer: 4.254 ± 0.703
2.526AsnThr: 2.526 ± 0.668
4.387AsnVal: 4.387 ± 0.882
1.329AsnTrp: 1.329 ± 0.389
3.456AsnTyr: 3.456 ± 0.655
0.0AsnXaa: 0.0 ± 0.0
Pro
1.063ProAla: 1.063 ± 0.371
0.133ProCys: 0.133 ± 0.128
1.196ProAsp: 1.196 ± 0.423
2.393ProGlu: 2.393 ± 0.586
1.595ProPhe: 1.595 ± 0.454
1.329ProGly: 1.329 ± 0.374
0.399ProHis: 0.399 ± 0.222
1.861ProIle: 1.861 ± 0.397
1.861ProLys: 1.861 ± 0.482
1.329ProLeu: 1.329 ± 0.348
0.93ProMet: 0.93 ± 0.317
1.595ProAsn: 1.595 ± 0.489
1.196ProPro: 1.196 ± 0.318
1.196ProGln: 1.196 ± 0.366
0.93ProArg: 0.93 ± 0.32
1.861ProSer: 1.861 ± 0.574
1.595ProThr: 1.595 ± 0.393
1.196ProVal: 1.196 ± 0.325
0.266ProTrp: 0.266 ± 0.154
1.595ProTyr: 1.595 ± 0.422
0.0ProXaa: 0.0 ± 0.0
Gln
1.595GlnAla: 1.595 ± 0.493
0.266GlnCys: 0.266 ± 0.189
1.462GlnAsp: 1.462 ± 0.417
2.26GlnGlu: 2.26 ± 0.742
0.93GlnPhe: 0.93 ± 0.36
2.127GlnGly: 2.127 ± 0.818
0.399GlnHis: 0.399 ± 0.23
3.057GlnIle: 3.057 ± 0.58
3.456GlnLys: 3.456 ± 0.609
3.19GlnLeu: 3.19 ± 0.697
0.665GlnMet: 0.665 ± 0.329
1.994GlnAsn: 1.994 ± 0.462
1.063GlnPro: 1.063 ± 0.352
1.462GlnGln: 1.462 ± 0.446
1.994GlnArg: 1.994 ± 0.63
1.462GlnSer: 1.462 ± 0.446
1.861GlnThr: 1.861 ± 0.373
1.994GlnVal: 1.994 ± 0.58
0.532GlnTrp: 0.532 ± 0.321
2.26GlnTyr: 2.26 ± 0.489
0.0GlnXaa: 0.0 ± 0.0
Arg
2.659ArgAla: 2.659 ± 0.498
0.532ArgCys: 0.532 ± 0.273
2.26ArgAsp: 2.26 ± 0.559
4.254ArgGlu: 4.254 ± 0.797
2.127ArgPhe: 2.127 ± 0.431
2.393ArgGly: 2.393 ± 0.839
0.532ArgHis: 0.532 ± 0.311
2.26ArgIle: 2.26 ± 0.499
4.519ArgLys: 4.519 ± 0.714
2.659ArgLeu: 2.659 ± 0.533
1.595ArgMet: 1.595 ± 0.524
2.659ArgAsn: 2.659 ± 0.662
0.93ArgPro: 0.93 ± 0.396
2.127ArgGln: 2.127 ± 0.535
1.595ArgArg: 1.595 ± 0.43
1.329ArgSer: 1.329 ± 0.325
1.994ArgThr: 1.994 ± 0.563
1.462ArgVal: 1.462 ± 0.427
0.532ArgTrp: 0.532 ± 0.213
2.791ArgTyr: 2.791 ± 0.559
0.0ArgXaa: 0.0 ± 0.0
Ser
2.791SerAla: 2.791 ± 0.722
0.93SerCys: 0.93 ± 0.3
3.19SerAsp: 3.19 ± 0.839
3.057SerGlu: 3.057 ± 0.713
2.26SerPhe: 2.26 ± 0.666
3.057SerGly: 3.057 ± 0.619
1.063SerHis: 1.063 ± 0.419
4.519SerIle: 4.519 ± 0.741
2.924SerLys: 2.924 ± 0.763
3.855SerLeu: 3.855 ± 0.729
1.595SerMet: 1.595 ± 0.312
3.988SerAsn: 3.988 ± 0.87
1.196SerPro: 1.196 ± 0.328
1.728SerGln: 1.728 ± 0.474
1.063SerArg: 1.063 ± 0.346
1.728SerSer: 1.728 ± 0.608
2.393SerThr: 2.393 ± 0.769
3.722SerVal: 3.722 ± 0.581
0.532SerTrp: 0.532 ± 0.234
3.057SerTyr: 3.057 ± 0.533
0.0SerXaa: 0.0 ± 0.0
Thr
3.057ThrAla: 3.057 ± 0.912
0.665ThrCys: 0.665 ± 0.319
4.387ThrAsp: 4.387 ± 0.992
3.722ThrGlu: 3.722 ± 0.625
2.526ThrPhe: 2.526 ± 0.751
3.988ThrGly: 3.988 ± 0.948
0.93ThrHis: 0.93 ± 0.362
4.785ThrIle: 4.785 ± 0.931
4.387ThrLys: 4.387 ± 0.685
4.254ThrLeu: 4.254 ± 0.892
1.595ThrMet: 1.595 ± 0.448
2.924ThrAsn: 2.924 ± 0.591
1.063ThrPro: 1.063 ± 0.375
1.462ThrGln: 1.462 ± 0.42
2.526ThrArg: 2.526 ± 0.618
2.659ThrSer: 2.659 ± 0.679
4.387ThrThr: 4.387 ± 1.079
4.918ThrVal: 4.918 ± 0.693
0.133ThrTrp: 0.133 ± 0.122
2.393ThrTyr: 2.393 ± 0.53
0.0ThrXaa: 0.0 ± 0.0
Val
2.791ValAla: 2.791 ± 0.617
0.532ValCys: 0.532 ± 0.189
5.583ValAsp: 5.583 ± 0.61
5.716ValGlu: 5.716 ± 0.751
2.659ValPhe: 2.659 ± 0.472
5.051ValGly: 5.051 ± 0.614
1.462ValHis: 1.462 ± 0.519
3.988ValIle: 3.988 ± 0.716
5.716ValLys: 5.716 ± 0.795
3.589ValLeu: 3.589 ± 0.617
1.728ValMet: 1.728 ± 0.459
3.988ValAsn: 3.988 ± 0.696
2.526ValPro: 2.526 ± 0.577
1.994ValGln: 1.994 ± 0.535
2.127ValArg: 2.127 ± 0.406
3.057ValSer: 3.057 ± 0.539
4.652ValThr: 4.652 ± 1.057
5.051ValVal: 5.051 ± 0.852
1.728ValTrp: 1.728 ± 0.514
1.861ValTyr: 1.861 ± 0.477
0.0ValXaa: 0.0 ± 0.0
Trp
0.399TrpAla: 0.399 ± 0.224
0.0TrpCys: 0.0 ± 0.0
0.665TrpAsp: 0.665 ± 0.304
1.196TrpGlu: 1.196 ± 0.422
1.595TrpPhe: 1.595 ± 0.484
1.063TrpGly: 1.063 ± 0.399
0.532TrpHis: 0.532 ± 0.208
0.798TrpIle: 0.798 ± 0.273
1.196TrpLys: 1.196 ± 0.447
1.063TrpLeu: 1.063 ± 0.395
0.798TrpMet: 0.798 ± 0.344
1.196TrpAsn: 1.196 ± 0.403
0.0TrpPro: 0.0 ± 0.0
0.399TrpGln: 0.399 ± 0.211
0.532TrpArg: 0.532 ± 0.265
0.266TrpSer: 0.266 ± 0.166
0.266TrpThr: 0.266 ± 0.189
0.93TrpVal: 0.93 ± 0.305
0.266TrpTrp: 0.266 ± 0.226
0.798TrpTyr: 0.798 ± 0.309
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.924TyrAla: 2.924 ± 0.535
0.532TyrCys: 0.532 ± 0.272
4.387TyrAsp: 4.387 ± 0.737
3.855TyrGlu: 3.855 ± 0.6
1.861TyrPhe: 1.861 ± 0.348
3.057TyrGly: 3.057 ± 0.557
0.93TyrHis: 0.93 ± 0.3
3.323TyrIle: 3.323 ± 0.596
4.519TyrLys: 4.519 ± 0.974
3.19TyrLeu: 3.19 ± 0.628
1.063TyrMet: 1.063 ± 0.397
3.855TyrAsn: 3.855 ± 0.588
1.861TyrPro: 1.861 ± 0.43
1.063TyrGln: 1.063 ± 0.47
1.994TyrArg: 1.994 ± 0.569
2.659TyrSer: 2.659 ± 0.703
2.659TyrThr: 2.659 ± 0.459
2.924TyrVal: 2.924 ± 0.747
1.063TyrTrp: 1.063 ± 0.381
2.526TyrTyr: 2.526 ± 0.564
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (7524 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski