Amino acid dipepetide frequency for Enterobacteria phage IME10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.292AlaAla: 9.292 ± 1.918
0.539AlaCys: 0.539 ± 0.224
6.733AlaAsp: 6.733 ± 0.817
9.022AlaGlu: 9.022 ± 2.448
1.885AlaPhe: 1.885 ± 0.515
5.656AlaGly: 5.656 ± 1.11
1.481AlaHis: 1.481 ± 0.434
5.386AlaIle: 5.386 ± 0.91
7.002AlaLys: 7.002 ± 1.506
6.329AlaLeu: 6.329 ± 1.104
3.501AlaMet: 3.501 ± 0.812
6.194AlaAsn: 6.194 ± 1.196
2.559AlaPro: 2.559 ± 0.628
3.771AlaGln: 3.771 ± 0.751
4.982AlaArg: 4.982 ± 0.915
4.982AlaSer: 4.982 ± 0.565
6.06AlaThr: 6.06 ± 0.819
4.982AlaVal: 4.982 ± 0.743
1.077AlaTrp: 1.077 ± 0.35
3.097AlaTyr: 3.097 ± 0.635
0.0AlaXaa: 0.0 ± 0.0
Cys
0.943CysAla: 0.943 ± 0.361
0.135CysCys: 0.135 ± 0.143
0.269CysAsp: 0.269 ± 0.148
0.539CysGlu: 0.539 ± 0.235
0.269CysPhe: 0.269 ± 0.175
1.212CysGly: 1.212 ± 0.522
0.404CysHis: 0.404 ± 0.312
0.539CysIle: 0.539 ± 0.263
0.943CysLys: 0.943 ± 0.311
0.404CysLeu: 0.404 ± 0.277
0.135CysMet: 0.135 ± 0.127
0.269CysAsn: 0.269 ± 0.17
0.539CysPro: 0.539 ± 0.317
0.673CysGln: 0.673 ± 0.274
0.943CysArg: 0.943 ± 0.387
0.943CysSer: 0.943 ± 0.433
0.404CysThr: 0.404 ± 0.279
0.808CysVal: 0.808 ± 0.348
0.404CysTrp: 0.404 ± 0.226
0.404CysTyr: 0.404 ± 0.275
0.0CysXaa: 0.0 ± 0.0
Asp
7.002AspAla: 7.002 ± 0.833
1.212AspCys: 1.212 ± 0.374
3.905AspAsp: 3.905 ± 0.753
4.04AspGlu: 4.04 ± 0.691
2.289AspPhe: 2.289 ± 0.468
5.252AspGly: 5.252 ± 0.737
1.077AspHis: 1.077 ± 0.442
4.175AspIle: 4.175 ± 0.607
3.636AspLys: 3.636 ± 0.553
4.175AspLeu: 4.175 ± 0.729
1.077AspMet: 1.077 ± 0.32
2.424AspAsn: 2.424 ± 0.448
1.751AspPro: 1.751 ± 0.47
1.751AspGln: 1.751 ± 0.495
2.828AspArg: 2.828 ± 0.652
3.771AspSer: 3.771 ± 0.799
2.02AspThr: 2.02 ± 0.511
5.252AspVal: 5.252 ± 1.056
0.808AspTrp: 0.808 ± 0.31
4.04AspTyr: 4.04 ± 0.585
0.0AspXaa: 0.0 ± 0.0
Glu
7.541GluAla: 7.541 ± 2.109
0.808GluCys: 0.808 ± 0.323
3.232GluAsp: 3.232 ± 0.558
5.252GluGlu: 5.252 ± 1.725
2.155GluPhe: 2.155 ± 0.494
3.367GluGly: 3.367 ± 0.496
1.885GluHis: 1.885 ± 0.497
4.175GluIle: 4.175 ± 0.924
5.252GluLys: 5.252 ± 1.01
6.598GluLeu: 6.598 ± 0.926
2.02GluMet: 2.02 ± 0.541
2.828GluAsn: 2.828 ± 0.645
2.963GluPro: 2.963 ± 0.692
4.982GluGln: 4.982 ± 0.845
4.444GluArg: 4.444 ± 1.312
3.905GluSer: 3.905 ± 0.873
2.693GluThr: 2.693 ± 0.527
3.771GluVal: 3.771 ± 0.929
1.885GluTrp: 1.885 ± 0.468
1.616GluTyr: 1.616 ± 0.495
0.0GluXaa: 0.0 ± 0.0
Phe
2.828PheAla: 2.828 ± 0.563
0.135PheCys: 0.135 ± 0.145
2.828PheAsp: 2.828 ± 0.476
2.424PheGlu: 2.424 ± 0.637
0.808PhePhe: 0.808 ± 0.342
1.077PheGly: 1.077 ± 0.272
0.404PheHis: 0.404 ± 0.217
2.424PheIle: 2.424 ± 0.611
1.616PheLys: 1.616 ± 0.473
1.751PheLeu: 1.751 ± 0.476
0.943PheMet: 0.943 ± 0.336
1.212PheAsn: 1.212 ± 0.344
0.808PhePro: 0.808 ± 0.32
1.616PheGln: 1.616 ± 0.612
1.077PheArg: 1.077 ± 0.321
2.693PheSer: 2.693 ± 0.621
2.289PheThr: 2.289 ± 0.506
0.943PheVal: 0.943 ± 0.267
0.943PheTrp: 0.943 ± 0.369
0.943PheTyr: 0.943 ± 0.446
0.0PheXaa: 0.0 ± 0.0
Gly
5.386GlyAla: 5.386 ± 0.813
0.539GlyCys: 0.539 ± 0.224
3.501GlyAsp: 3.501 ± 0.657
4.579GlyGlu: 4.579 ± 0.655
2.828GlyPhe: 2.828 ± 0.341
4.309GlyGly: 4.309 ± 1.004
1.077GlyHis: 1.077 ± 0.367
4.579GlyIle: 4.579 ± 0.607
4.444GlyLys: 4.444 ± 0.737
4.982GlyLeu: 4.982 ± 0.995
2.155GlyMet: 2.155 ± 0.601
3.232GlyAsn: 3.232 ± 0.429
0.808GlyPro: 0.808 ± 0.292
4.175GlyGln: 4.175 ± 0.976
5.117GlyArg: 5.117 ± 0.963
3.636GlySer: 3.636 ± 0.686
4.04GlyThr: 4.04 ± 0.745
5.656GlyVal: 5.656 ± 0.925
1.481GlyTrp: 1.481 ± 0.46
2.559GlyTyr: 2.559 ± 0.583
0.0GlyXaa: 0.0 ± 0.0
His
0.943HisAla: 0.943 ± 0.415
0.135HisCys: 0.135 ± 0.125
1.481HisAsp: 1.481 ± 0.407
1.885HisGlu: 1.885 ± 0.402
0.135HisPhe: 0.135 ± 0.133
2.155HisGly: 2.155 ± 0.604
0.269HisHis: 0.269 ± 0.176
0.673HisIle: 0.673 ± 0.25
1.347HisLys: 1.347 ± 0.418
1.616HisLeu: 1.616 ± 0.492
0.135HisMet: 0.135 ± 0.171
0.539HisAsn: 0.539 ± 0.224
0.808HisPro: 0.808 ± 0.273
0.539HisGln: 0.539 ± 0.324
1.347HisArg: 1.347 ± 0.355
0.673HisSer: 0.673 ± 0.24
0.135HisThr: 0.135 ± 0.133
0.673HisVal: 0.673 ± 0.321
0.269HisTrp: 0.269 ± 0.164
0.404HisTyr: 0.404 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
5.117IleAla: 5.117 ± 0.578
0.808IleCys: 0.808 ± 0.296
3.905IleAsp: 3.905 ± 0.654
5.656IleGlu: 5.656 ± 0.978
1.347IlePhe: 1.347 ± 0.364
4.04IleGly: 4.04 ± 0.81
1.212IleHis: 1.212 ± 0.348
3.367IleIle: 3.367 ± 0.811
2.693IleLys: 2.693 ± 0.564
4.579IleLeu: 4.579 ± 0.61
0.673IleMet: 0.673 ± 0.282
3.097IleAsn: 3.097 ± 0.739
3.232IlePro: 3.232 ± 0.638
2.424IleGln: 2.424 ± 0.517
3.097IleArg: 3.097 ± 0.489
2.963IleSer: 2.963 ± 0.565
4.04IleThr: 4.04 ± 0.746
2.559IleVal: 2.559 ± 0.488
0.673IleTrp: 0.673 ± 0.315
1.481IleTyr: 1.481 ± 0.532
0.0IleXaa: 0.0 ± 0.0
Lys
6.194LysAla: 6.194 ± 1.705
1.077LysCys: 1.077 ± 0.402
3.232LysAsp: 3.232 ± 0.767
5.386LysGlu: 5.386 ± 0.823
1.885LysPhe: 1.885 ± 0.466
4.848LysGly: 4.848 ± 0.659
0.808LysHis: 0.808 ± 0.307
2.559LysIle: 2.559 ± 0.57
4.579LysLys: 4.579 ± 1.037
4.713LysLeu: 4.713 ± 0.771
1.347LysMet: 1.347 ± 0.369
2.02LysAsn: 2.02 ± 0.463
3.097LysPro: 3.097 ± 0.685
3.367LysGln: 3.367 ± 0.707
3.905LysArg: 3.905 ± 0.646
3.501LysSer: 3.501 ± 0.622
2.693LysThr: 2.693 ± 0.583
3.501LysVal: 3.501 ± 0.68
0.539LysTrp: 0.539 ± 0.279
2.155LysTyr: 2.155 ± 0.475
0.0LysXaa: 0.0 ± 0.0
Leu
8.214LeuAla: 8.214 ± 1.856
0.673LeuCys: 0.673 ± 0.26
5.117LeuAsp: 5.117 ± 0.997
4.982LeuGlu: 4.982 ± 0.823
2.02LeuPhe: 2.02 ± 0.514
3.367LeuGly: 3.367 ± 1.083
0.404LeuHis: 0.404 ± 0.265
2.963LeuIle: 2.963 ± 0.508
4.982LeuLys: 4.982 ± 0.906
5.386LeuLeu: 5.386 ± 0.846
2.559LeuMet: 2.559 ± 0.408
4.309LeuAsn: 4.309 ± 0.631
3.636LeuPro: 3.636 ± 0.668
3.367LeuGln: 3.367 ± 0.883
4.848LeuArg: 4.848 ± 0.797
5.252LeuSer: 5.252 ± 0.768
5.252LeuThr: 5.252 ± 0.693
4.579LeuVal: 4.579 ± 0.576
0.943LeuTrp: 0.943 ± 0.391
2.559LeuTyr: 2.559 ± 0.611
0.0LeuXaa: 0.0 ± 0.0
Met
2.828MetAla: 2.828 ± 0.566
0.135MetCys: 0.135 ± 0.14
1.751MetAsp: 1.751 ± 0.562
1.751MetGlu: 1.751 ± 0.423
0.808MetPhe: 0.808 ± 0.257
2.289MetGly: 2.289 ± 0.595
0.135MetHis: 0.135 ± 0.111
1.751MetIle: 1.751 ± 0.457
1.751MetLys: 1.751 ± 0.499
2.693MetLeu: 2.693 ± 0.46
1.616MetMet: 1.616 ± 0.583
0.808MetAsn: 0.808 ± 0.296
1.077MetPro: 1.077 ± 0.375
1.751MetGln: 1.751 ± 0.514
2.424MetArg: 2.424 ± 0.733
1.616MetSer: 1.616 ± 0.399
1.751MetThr: 1.751 ± 0.545
1.212MetVal: 1.212 ± 0.344
0.0MetTrp: 0.0 ± 0.0
0.808MetTyr: 0.808 ± 0.376
0.0MetXaa: 0.0 ± 0.0
Asn
4.848AsnAla: 4.848 ± 0.763
0.673AsnCys: 0.673 ± 0.344
3.771AsnAsp: 3.771 ± 1.03
1.751AsnGlu: 1.751 ± 0.564
0.539AsnPhe: 0.539 ± 0.25
4.175AsnGly: 4.175 ± 0.675
1.212AsnHis: 1.212 ± 0.465
2.559AsnIle: 2.559 ± 0.592
2.02AsnLys: 2.02 ± 0.408
2.963AsnLeu: 2.963 ± 0.61
1.347AsnMet: 1.347 ± 0.419
2.963AsnAsn: 2.963 ± 0.848
3.097AsnPro: 3.097 ± 0.666
3.636AsnGln: 3.636 ± 0.847
2.559AsnArg: 2.559 ± 0.733
2.155AsnSer: 2.155 ± 0.613
3.367AsnThr: 3.367 ± 0.748
2.424AsnVal: 2.424 ± 0.675
0.269AsnTrp: 0.269 ± 0.173
1.212AsnTyr: 1.212 ± 0.4
0.0AsnXaa: 0.0 ± 0.0
Pro
2.828ProAla: 2.828 ± 0.621
0.404ProCys: 0.404 ± 0.22
3.501ProAsp: 3.501 ± 0.522
4.04ProGlu: 4.04 ± 0.842
1.347ProPhe: 1.347 ± 0.473
2.155ProGly: 2.155 ± 0.485
0.673ProHis: 0.673 ± 0.284
2.424ProIle: 2.424 ± 0.655
3.097ProLys: 3.097 ± 0.618
2.559ProLeu: 2.559 ± 0.532
1.077ProMet: 1.077 ± 0.427
1.481ProAsn: 1.481 ± 0.285
1.077ProPro: 1.077 ± 0.332
1.751ProGln: 1.751 ± 0.496
1.885ProArg: 1.885 ± 0.446
3.232ProSer: 3.232 ± 0.717
1.616ProThr: 1.616 ± 0.387
2.559ProVal: 2.559 ± 0.668
0.404ProTrp: 0.404 ± 0.209
1.212ProTyr: 1.212 ± 0.454
0.0ProXaa: 0.0 ± 0.0
Gln
6.598GlnAla: 6.598 ± 1.366
0.539GlnCys: 0.539 ± 0.277
1.885GlnAsp: 1.885 ± 0.573
2.155GlnGlu: 2.155 ± 0.486
1.212GlnPhe: 1.212 ± 0.313
2.963GlnGly: 2.963 ± 0.623
0.673GlnHis: 0.673 ± 0.256
3.771GlnIle: 3.771 ± 0.696
2.693GlnLys: 2.693 ± 0.552
4.175GlnLeu: 4.175 ± 0.624
2.559GlnMet: 2.559 ± 0.587
2.02GlnAsn: 2.02 ± 0.727
1.751GlnPro: 1.751 ± 0.428
4.713GlnGln: 4.713 ± 1.339
4.04GlnArg: 4.04 ± 0.703
4.309GlnSer: 4.309 ± 1.344
2.424GlnThr: 2.424 ± 0.635
2.559GlnVal: 2.559 ± 0.679
1.212GlnTrp: 1.212 ± 0.38
2.02GlnTyr: 2.02 ± 0.628
0.0GlnXaa: 0.0 ± 0.0
Arg
4.982ArgAla: 4.982 ± 0.789
0.673ArgCys: 0.673 ± 0.343
3.636ArgAsp: 3.636 ± 0.807
4.848ArgGlu: 4.848 ± 1.375
1.347ArgPhe: 1.347 ± 0.406
4.309ArgGly: 4.309 ± 0.68
1.077ArgHis: 1.077 ± 0.321
4.444ArgIle: 4.444 ± 0.964
5.252ArgLys: 5.252 ± 1.107
4.848ArgLeu: 4.848 ± 0.792
2.02ArgMet: 2.02 ± 0.551
3.097ArgAsn: 3.097 ± 0.531
1.481ArgPro: 1.481 ± 0.492
3.097ArgGln: 3.097 ± 0.978
4.444ArgArg: 4.444 ± 0.932
3.232ArgSer: 3.232 ± 0.586
1.885ArgThr: 1.885 ± 0.503
2.828ArgVal: 2.828 ± 0.727
0.539ArgTrp: 0.539 ± 0.261
2.963ArgTyr: 2.963 ± 0.673
0.0ArgXaa: 0.0 ± 0.0
Ser
4.309SerAla: 4.309 ± 0.96
0.269SerCys: 0.269 ± 0.189
3.367SerAsp: 3.367 ± 0.487
3.771SerGlu: 3.771 ± 0.868
2.559SerPhe: 2.559 ± 0.505
5.386SerGly: 5.386 ± 0.805
0.808SerHis: 0.808 ± 0.332
3.905SerIle: 3.905 ± 0.655
1.885SerLys: 1.885 ± 0.533
5.117SerLeu: 5.117 ± 1.005
1.751SerMet: 1.751 ± 0.46
3.367SerAsn: 3.367 ± 0.805
3.501SerPro: 3.501 ± 0.701
4.848SerGln: 4.848 ± 1.082
3.367SerArg: 3.367 ± 0.69
3.501SerSer: 3.501 ± 0.883
2.289SerThr: 2.289 ± 0.522
2.963SerVal: 2.963 ± 0.701
1.077SerTrp: 1.077 ± 0.351
2.155SerTyr: 2.155 ± 0.541
0.0SerXaa: 0.0 ± 0.0
Thr
5.252ThrAla: 5.252 ± 0.806
0.808ThrCys: 0.808 ± 0.322
3.501ThrAsp: 3.501 ± 0.623
2.424ThrGlu: 2.424 ± 0.513
2.02ThrPhe: 2.02 ± 0.635
4.444ThrGly: 4.444 ± 0.849
0.673ThrHis: 0.673 ± 0.297
2.02ThrIle: 2.02 ± 0.628
2.828ThrLys: 2.828 ± 0.558
3.501ThrLeu: 3.501 ± 0.876
1.212ThrMet: 1.212 ± 0.458
2.155ThrAsn: 2.155 ± 0.516
3.232ThrPro: 3.232 ± 0.627
2.559ThrGln: 2.559 ± 0.474
2.424ThrArg: 2.424 ± 0.521
2.559ThrSer: 2.559 ± 0.778
3.097ThrThr: 3.097 ± 0.465
4.579ThrVal: 4.579 ± 0.923
0.943ThrTrp: 0.943 ± 0.27
1.481ThrTyr: 1.481 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
4.982ValAla: 4.982 ± 0.774
0.539ValCys: 0.539 ± 0.325
3.232ValAsp: 3.232 ± 0.649
4.713ValGlu: 4.713 ± 0.727
2.155ValPhe: 2.155 ± 0.55
5.252ValGly: 5.252 ± 0.917
0.808ValHis: 0.808 ± 0.279
3.501ValIle: 3.501 ± 0.51
2.963ValLys: 2.963 ± 0.495
3.232ValLeu: 3.232 ± 0.658
1.347ValMet: 1.347 ± 0.314
3.905ValAsn: 3.905 ± 0.555
1.751ValPro: 1.751 ± 0.536
2.424ValGln: 2.424 ± 0.544
2.963ValArg: 2.963 ± 0.549
4.175ValSer: 4.175 ± 0.611
3.905ValThr: 3.905 ± 0.807
4.444ValVal: 4.444 ± 0.967
1.347ValTrp: 1.347 ± 0.462
1.616ValTyr: 1.616 ± 0.716
0.0ValXaa: 0.0 ± 0.0
Trp
1.212TrpAla: 1.212 ± 0.4
0.269TrpCys: 0.269 ± 0.187
0.943TrpAsp: 0.943 ± 0.395
0.404TrpGlu: 0.404 ± 0.327
0.539TrpPhe: 0.539 ± 0.243
1.077TrpGly: 1.077 ± 0.43
0.404TrpHis: 0.404 ± 0.245
0.269TrpIle: 0.269 ± 0.176
0.943TrpLys: 0.943 ± 0.39
1.751TrpLeu: 1.751 ± 0.496
0.673TrpMet: 0.673 ± 0.278
0.539TrpAsn: 0.539 ± 0.286
0.808TrpPro: 0.808 ± 0.338
1.347TrpGln: 1.347 ± 0.385
1.212TrpArg: 1.212 ± 0.437
0.943TrpSer: 0.943 ± 0.391
0.404TrpThr: 0.404 ± 0.232
0.943TrpVal: 0.943 ± 0.34
0.404TrpTrp: 0.404 ± 0.216
0.404TrpTyr: 0.404 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.367TyrAla: 3.367 ± 0.701
0.808TyrCys: 0.808 ± 0.313
2.693TyrAsp: 2.693 ± 0.746
1.751TyrGlu: 1.751 ± 0.668
1.616TyrPhe: 1.616 ± 0.41
1.751TyrGly: 1.751 ± 0.58
0.673TyrHis: 0.673 ± 0.304
1.481TyrIle: 1.481 ± 0.532
1.481TyrLys: 1.481 ± 0.355
4.04TyrLeu: 4.04 ± 0.743
0.673TyrMet: 0.673 ± 0.387
1.077TyrAsn: 1.077 ± 0.485
1.347TyrPro: 1.347 ± 0.538
1.481TyrGln: 1.481 ± 0.365
3.097TyrArg: 3.097 ± 0.778
2.155TyrSer: 2.155 ± 0.654
1.347TyrThr: 1.347 ± 0.392
2.02TyrVal: 2.02 ± 0.404
0.269TyrTrp: 0.269 ± 0.174
0.808TyrTyr: 0.808 ± 0.396
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (7427 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski