Amino acid dipepetide frequency for Escherichia phage mEp460_ev081

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.32AlaAla: 13.32 ± 3.19
0.847AlaCys: 0.847 ± 0.287
4.158AlaAsp: 4.158 ± 0.684
6.16AlaGlu: 6.16 ± 0.931
3.773AlaPhe: 3.773 ± 0.553
8.084AlaGly: 8.084 ± 0.947
1.54AlaHis: 1.54 ± 0.378
5.775AlaIle: 5.775 ± 0.89
4.004AlaLys: 4.004 ± 0.685
7.853AlaLeu: 7.853 ± 0.916
3.003AlaMet: 3.003 ± 0.523
2.31AlaAsn: 2.31 ± 0.441
2.695AlaPro: 2.695 ± 0.527
4.158AlaGln: 4.158 ± 0.7
6.545AlaArg: 6.545 ± 0.798
9.008AlaSer: 9.008 ± 2.167
5.236AlaThr: 5.236 ± 1.346
6.929AlaVal: 6.929 ± 0.923
1.54AlaTrp: 1.54 ± 0.324
2.464AlaTyr: 2.464 ± 0.389
0.0AlaXaa: 0.0 ± 0.0
Cys
0.693CysAla: 0.693 ± 0.267
0.231CysCys: 0.231 ± 0.143
0.539CysAsp: 0.539 ± 0.196
0.847CysGlu: 0.847 ± 0.308
0.385CysPhe: 0.385 ± 0.18
1.155CysGly: 1.155 ± 0.323
0.077CysHis: 0.077 ± 0.072
0.231CysIle: 0.231 ± 0.137
0.462CysLys: 0.462 ± 0.2
0.77CysLeu: 0.77 ± 0.271
0.308CysMet: 0.308 ± 0.152
0.462CysAsn: 0.462 ± 0.188
0.616CysPro: 0.616 ± 0.255
0.231CysGln: 0.231 ± 0.145
1.309CysArg: 1.309 ± 0.334
0.77CysSer: 0.77 ± 0.266
1.078CysThr: 1.078 ± 0.285
0.616CysVal: 0.616 ± 0.222
0.385CysTrp: 0.385 ± 0.175
0.462CysTyr: 0.462 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
6.391AspAla: 6.391 ± 0.658
0.847AspCys: 0.847 ± 0.28
3.157AspAsp: 3.157 ± 0.572
3.927AspGlu: 3.927 ± 0.769
2.002AspPhe: 2.002 ± 0.421
6.16AspGly: 6.16 ± 0.841
0.693AspHis: 0.693 ± 0.217
3.311AspIle: 3.311 ± 0.467
3.003AspLys: 3.003 ± 0.525
4.389AspLeu: 4.389 ± 0.725
1.309AspMet: 1.309 ± 0.296
2.849AspAsn: 2.849 ± 0.421
2.849AspPro: 2.849 ± 0.591
2.079AspGln: 2.079 ± 0.383
2.618AspArg: 2.618 ± 0.56
2.541AspSer: 2.541 ± 0.49
2.926AspThr: 2.926 ± 0.534
3.927AspVal: 3.927 ± 0.623
1.309AspTrp: 1.309 ± 0.405
1.463AspTyr: 1.463 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
5.929GluAla: 5.929 ± 0.723
0.77GluCys: 0.77 ± 0.259
3.234GluAsp: 3.234 ± 0.403
4.004GluGlu: 4.004 ± 0.665
2.002GluPhe: 2.002 ± 0.421
3.619GluGly: 3.619 ± 0.624
1.001GluHis: 1.001 ± 0.248
3.773GluIle: 3.773 ± 0.529
3.696GluLys: 3.696 ± 0.642
5.698GluLeu: 5.698 ± 0.667
2.002GluMet: 2.002 ± 0.457
2.772GluAsn: 2.772 ± 0.539
2.464GluPro: 2.464 ± 0.514
4.543GluGln: 4.543 ± 0.737
4.62GluArg: 4.62 ± 0.822
3.696GluSer: 3.696 ± 0.576
4.774GluThr: 4.774 ± 0.694
3.773GluVal: 3.773 ± 0.583
0.847GluTrp: 0.847 ± 0.222
1.848GluTyr: 1.848 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
1.694PheAla: 1.694 ± 0.357
0.462PheCys: 0.462 ± 0.173
2.233PheAsp: 2.233 ± 0.553
1.925PheGlu: 1.925 ± 0.368
0.77PhePhe: 0.77 ± 0.225
1.771PheGly: 1.771 ± 0.429
0.385PheHis: 0.385 ± 0.151
2.079PheIle: 2.079 ± 0.364
2.002PheLys: 2.002 ± 0.517
2.002PheLeu: 2.002 ± 0.383
1.001PheMet: 1.001 ± 0.222
1.771PheAsn: 1.771 ± 0.359
1.694PhePro: 1.694 ± 0.385
1.001PheGln: 1.001 ± 0.295
2.156PheArg: 2.156 ± 0.485
3.003PheSer: 3.003 ± 0.611
1.925PheThr: 1.925 ± 0.361
2.618PheVal: 2.618 ± 0.443
0.847PheTrp: 0.847 ± 0.24
1.155PheTyr: 1.155 ± 0.227
0.0PheXaa: 0.0 ± 0.0
Gly
5.775GlyAla: 5.775 ± 0.767
0.539GlyCys: 0.539 ± 0.215
4.543GlyAsp: 4.543 ± 0.709
4.697GlyGlu: 4.697 ± 0.631
2.464GlyPhe: 2.464 ± 0.605
4.004GlyGly: 4.004 ± 0.468
0.924GlyHis: 0.924 ± 0.197
4.774GlyIle: 4.774 ± 0.704
5.159GlyLys: 5.159 ± 0.572
3.927GlyLeu: 3.927 ± 0.513
2.464GlyMet: 2.464 ± 0.414
3.542GlyAsn: 3.542 ± 0.636
1.54GlyPro: 1.54 ± 0.344
2.772GlyGln: 2.772 ± 0.526
4.774GlyArg: 4.774 ± 0.512
5.005GlySer: 5.005 ± 0.814
4.235GlyThr: 4.235 ± 0.912
6.314GlyVal: 6.314 ± 0.703
1.463GlyTrp: 1.463 ± 0.356
1.925GlyTyr: 1.925 ± 0.426
0.0GlyXaa: 0.0 ± 0.0
His
1.463HisAla: 1.463 ± 0.442
0.385HisCys: 0.385 ± 0.167
0.693HisAsp: 0.693 ± 0.203
0.693HisGlu: 0.693 ± 0.249
0.693HisPhe: 0.693 ± 0.263
0.924HisGly: 0.924 ± 0.283
0.616HisHis: 0.616 ± 0.298
1.001HisIle: 1.001 ± 0.354
1.078HisLys: 1.078 ± 0.327
1.386HisLeu: 1.386 ± 0.321
0.308HisMet: 0.308 ± 0.164
0.693HisAsn: 0.693 ± 0.252
0.77HisPro: 0.77 ± 0.253
1.001HisGln: 1.001 ± 0.252
1.463HisArg: 1.463 ± 0.277
1.309HisSer: 1.309 ± 0.434
1.386HisThr: 1.386 ± 0.409
0.77HisVal: 0.77 ± 0.251
0.154HisTrp: 0.154 ± 0.111
0.462HisTyr: 0.462 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
4.466IleAla: 4.466 ± 0.684
1.309IleCys: 1.309 ± 0.338
3.927IleAsp: 3.927 ± 0.603
2.849IleGlu: 2.849 ± 0.435
1.925IlePhe: 1.925 ± 0.376
3.157IleGly: 3.157 ± 0.635
1.155IleHis: 1.155 ± 0.317
2.31IleIle: 2.31 ± 0.426
3.157IleLys: 3.157 ± 0.542
2.849IleLeu: 2.849 ± 0.411
1.617IleMet: 1.617 ± 0.408
2.618IleAsn: 2.618 ± 0.448
2.772IlePro: 2.772 ± 0.613
1.694IleGln: 1.694 ± 0.351
4.235IleArg: 4.235 ± 0.683
4.235IleSer: 4.235 ± 0.597
3.696IleThr: 3.696 ± 0.741
2.618IleVal: 2.618 ± 0.494
0.616IleTrp: 0.616 ± 0.252
1.386IleTyr: 1.386 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
4.851LysAla: 4.851 ± 0.696
1.001LysCys: 1.001 ± 0.32
3.311LysAsp: 3.311 ± 0.501
2.772LysGlu: 2.772 ± 0.581
1.694LysPhe: 1.694 ± 0.458
3.696LysGly: 3.696 ± 0.575
0.924LysHis: 0.924 ± 0.229
3.234LysIle: 3.234 ± 0.474
3.542LysLys: 3.542 ± 0.785
3.388LysLeu: 3.388 ± 0.497
1.309LysMet: 1.309 ± 0.294
2.695LysAsn: 2.695 ± 0.538
1.771LysPro: 1.771 ± 0.306
2.156LysGln: 2.156 ± 0.447
3.465LysArg: 3.465 ± 0.534
3.696LysSer: 3.696 ± 0.659
5.39LysThr: 5.39 ± 0.617
3.311LysVal: 3.311 ± 0.681
1.309LysTrp: 1.309 ± 0.408
1.771LysTyr: 1.771 ± 0.343
0.0LysXaa: 0.0 ± 0.0
Leu
7.545LeuAla: 7.545 ± 0.798
0.539LeuCys: 0.539 ± 0.236
4.312LeuAsp: 4.312 ± 0.517
3.927LeuGlu: 3.927 ± 0.529
1.694LeuPhe: 1.694 ± 0.424
4.389LeuGly: 4.389 ± 0.728
1.463LeuHis: 1.463 ± 0.382
3.003LeuIle: 3.003 ± 0.421
4.928LeuLys: 4.928 ± 0.622
5.082LeuLeu: 5.082 ± 0.658
1.386LeuMet: 1.386 ± 0.355
4.389LeuAsn: 4.389 ± 0.817
3.311LeuPro: 3.311 ± 0.656
2.926LeuGln: 2.926 ± 0.524
4.697LeuArg: 4.697 ± 0.628
6.852LeuSer: 6.852 ± 0.863
5.775LeuThr: 5.775 ± 0.726
4.543LeuVal: 4.543 ± 0.633
1.155LeuTrp: 1.155 ± 0.28
1.771LeuTyr: 1.771 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
2.233MetAla: 2.233 ± 0.437
0.231MetCys: 0.231 ± 0.134
1.386MetAsp: 1.386 ± 0.328
1.617MetGlu: 1.617 ± 0.354
0.539MetPhe: 0.539 ± 0.181
1.848MetGly: 1.848 ± 0.344
0.308MetHis: 0.308 ± 0.165
0.847MetIle: 0.847 ± 0.272
1.386MetLys: 1.386 ± 0.376
2.156MetLeu: 2.156 ± 0.463
0.693MetMet: 0.693 ± 0.281
1.54MetAsn: 1.54 ± 0.297
1.078MetPro: 1.078 ± 0.291
1.386MetGln: 1.386 ± 0.344
1.925MetArg: 1.925 ± 0.395
1.771MetSer: 1.771 ± 0.39
2.464MetThr: 2.464 ± 0.513
1.694MetVal: 1.694 ± 0.351
0.231MetTrp: 0.231 ± 0.137
0.616MetTyr: 0.616 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
4.312AsnAla: 4.312 ± 0.741
0.539AsnCys: 0.539 ± 0.201
2.31AsnAsp: 2.31 ± 0.378
2.079AsnGlu: 2.079 ± 0.455
1.232AsnPhe: 1.232 ± 0.286
4.466AsnGly: 4.466 ± 0.491
0.847AsnHis: 0.847 ± 0.301
2.464AsnIle: 2.464 ± 0.397
2.079AsnLys: 2.079 ± 0.429
3.157AsnLeu: 3.157 ± 0.576
0.77AsnMet: 0.77 ± 0.233
1.925AsnAsn: 1.925 ± 0.374
2.387AsnPro: 2.387 ± 0.451
2.233AsnGln: 2.233 ± 0.504
2.464AsnArg: 2.464 ± 0.605
2.387AsnSer: 2.387 ± 0.425
3.157AsnThr: 3.157 ± 0.587
2.002AsnVal: 2.002 ± 0.382
0.539AsnTrp: 0.539 ± 0.174
1.309AsnTyr: 1.309 ± 0.364
0.0AsnXaa: 0.0 ± 0.0
Pro
4.312ProAla: 4.312 ± 0.69
0.539ProCys: 0.539 ± 0.196
3.311ProAsp: 3.311 ± 0.687
4.312ProGlu: 4.312 ± 0.924
1.001ProPhe: 1.001 ± 0.362
3.234ProGly: 3.234 ± 0.467
1.001ProHis: 1.001 ± 0.246
0.924ProIle: 0.924 ± 0.281
1.925ProLys: 1.925 ± 0.345
2.695ProLeu: 2.695 ± 0.569
0.924ProMet: 0.924 ± 0.369
1.771ProAsn: 1.771 ± 0.427
1.771ProPro: 1.771 ± 0.491
1.001ProGln: 1.001 ± 0.256
1.694ProArg: 1.694 ± 0.384
3.157ProSer: 3.157 ± 0.594
1.694ProThr: 1.694 ± 0.433
3.85ProVal: 3.85 ± 0.603
0.847ProTrp: 0.847 ± 0.243
0.693ProTyr: 0.693 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
4.466GlnAla: 4.466 ± 1.029
0.539GlnCys: 0.539 ± 0.215
1.771GlnAsp: 1.771 ± 0.317
2.849GlnGlu: 2.849 ± 0.525
1.617GlnPhe: 1.617 ± 0.305
2.002GlnGly: 2.002 ± 0.303
0.847GlnHis: 0.847 ± 0.28
1.925GlnIle: 1.925 ± 0.505
2.695GlnLys: 2.695 ± 0.565
3.927GlnLeu: 3.927 ± 0.932
1.54GlnMet: 1.54 ± 0.499
2.31GlnAsn: 2.31 ± 0.585
1.694GlnPro: 1.694 ± 0.438
3.311GlnGln: 3.311 ± 1.146
2.772GlnArg: 2.772 ± 0.725
2.849GlnSer: 2.849 ± 0.403
2.079GlnThr: 2.079 ± 0.463
3.311GlnVal: 3.311 ± 0.622
0.693GlnTrp: 0.693 ± 0.319
1.386GlnTyr: 1.386 ± 0.281
0.0GlnXaa: 0.0 ± 0.0
Arg
5.159ArgAla: 5.159 ± 0.542
0.924ArgCys: 0.924 ± 0.338
3.465ArgAsp: 3.465 ± 0.56
5.313ArgGlu: 5.313 ± 0.702
1.386ArgPhe: 1.386 ± 0.285
3.542ArgGly: 3.542 ± 0.524
1.617ArgHis: 1.617 ± 0.31
3.773ArgIle: 3.773 ± 0.546
3.157ArgLys: 3.157 ± 0.534
5.159ArgLeu: 5.159 ± 0.771
2.31ArgMet: 2.31 ± 0.489
2.464ArgAsn: 2.464 ± 0.446
2.618ArgPro: 2.618 ± 0.65
4.081ArgGln: 4.081 ± 0.931
4.774ArgArg: 4.774 ± 0.789
2.387ArgSer: 2.387 ± 0.364
3.234ArgThr: 3.234 ± 0.709
3.773ArgVal: 3.773 ± 0.756
1.463ArgTrp: 1.463 ± 0.332
2.541ArgTyr: 2.541 ± 0.446
0.0ArgXaa: 0.0 ± 0.0
Ser
9.701SerAla: 9.701 ± 2.92
0.385SerCys: 0.385 ± 0.158
4.774SerAsp: 4.774 ± 0.538
5.544SerGlu: 5.544 ± 0.895
1.463SerPhe: 1.463 ± 0.227
7.699SerGly: 7.699 ± 0.892
0.77SerHis: 0.77 ± 0.349
2.926SerIle: 2.926 ± 0.492
3.465SerLys: 3.465 ± 0.623
4.851SerLeu: 4.851 ± 0.681
1.463SerMet: 1.463 ± 0.3
1.54SerAsn: 1.54 ± 0.292
2.849SerPro: 2.849 ± 0.434
2.464SerGln: 2.464 ± 0.334
3.927SerArg: 3.927 ± 0.586
5.39SerSer: 5.39 ± 1.528
4.389SerThr: 4.389 ± 1.02
5.159SerVal: 5.159 ± 0.714
0.616SerTrp: 0.616 ± 0.216
1.386SerTyr: 1.386 ± 0.24
0.0SerXaa: 0.0 ± 0.0
Thr
7.545ThrAla: 7.545 ± 1.188
0.385ThrCys: 0.385 ± 0.151
4.543ThrAsp: 4.543 ± 0.648
5.159ThrGlu: 5.159 ± 0.737
2.387ThrPhe: 2.387 ± 0.4
4.466ThrGly: 4.466 ± 0.612
1.232ThrHis: 1.232 ± 0.337
3.311ThrIle: 3.311 ± 0.454
3.003ThrLys: 3.003 ± 0.556
4.235ThrLeu: 4.235 ± 0.741
1.232ThrMet: 1.232 ± 0.354
2.233ThrAsn: 2.233 ± 0.619
2.772ThrPro: 2.772 ± 0.864
2.31ThrGln: 2.31 ± 0.475
3.234ThrArg: 3.234 ± 0.591
4.851ThrSer: 4.851 ± 1.096
3.85ThrThr: 3.85 ± 0.862
4.928ThrVal: 4.928 ± 0.898
0.924ThrTrp: 0.924 ± 0.283
2.002ThrTyr: 2.002 ± 0.38
0.0ThrXaa: 0.0 ± 0.0
Val
6.006ValAla: 6.006 ± 0.897
0.616ValCys: 0.616 ± 0.211
3.08ValAsp: 3.08 ± 0.611
3.773ValGlu: 3.773 ± 0.651
3.619ValPhe: 3.619 ± 0.583
3.157ValGly: 3.157 ± 0.563
1.001ValHis: 1.001 ± 0.308
4.389ValIle: 4.389 ± 0.602
4.466ValLys: 4.466 ± 0.615
5.467ValLeu: 5.467 ± 0.889
1.309ValMet: 1.309 ± 0.384
2.695ValAsn: 2.695 ± 0.45
2.695ValPro: 2.695 ± 0.556
2.695ValGln: 2.695 ± 0.511
3.773ValArg: 3.773 ± 0.678
5.082ValSer: 5.082 ± 0.668
5.082ValThr: 5.082 ± 0.811
5.082ValVal: 5.082 ± 0.822
0.77ValTrp: 0.77 ± 0.274
2.31ValTyr: 2.31 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
1.232TrpAla: 1.232 ± 0.293
0.231TrpCys: 0.231 ± 0.123
1.001TrpAsp: 1.001 ± 0.342
0.693TrpGlu: 0.693 ± 0.258
0.847TrpPhe: 0.847 ± 0.27
1.078TrpGly: 1.078 ± 0.307
0.385TrpHis: 0.385 ± 0.166
0.924TrpIle: 0.924 ± 0.249
1.078TrpLys: 1.078 ± 0.328
2.233TrpLeu: 2.233 ± 0.573
0.539TrpMet: 0.539 ± 0.199
0.462TrpAsn: 0.462 ± 0.196
0.77TrpPro: 0.77 ± 0.331
0.77TrpGln: 0.77 ± 0.233
0.924TrpArg: 0.924 ± 0.311
1.078TrpSer: 1.078 ± 0.29
0.847TrpThr: 0.847 ± 0.201
1.001TrpVal: 1.001 ± 0.303
0.462TrpTrp: 0.462 ± 0.192
0.385TrpTyr: 0.385 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.31TyrAla: 2.31 ± 0.575
0.308TyrCys: 0.308 ± 0.183
2.002TyrAsp: 2.002 ± 0.384
1.925TyrGlu: 1.925 ± 0.418
1.001TyrPhe: 1.001 ± 0.295
2.387TyrGly: 2.387 ± 0.39
0.462TyrHis: 0.462 ± 0.173
1.617TyrIle: 1.617 ± 0.403
1.001TyrLys: 1.001 ± 0.314
2.541TyrLeu: 2.541 ± 0.409
0.385TyrMet: 0.385 ± 0.158
1.617TyrAsn: 1.617 ± 0.265
1.54TyrPro: 1.54 ± 0.355
1.771TyrGln: 1.771 ± 0.347
1.771TyrArg: 1.771 ± 0.388
1.694TyrSer: 1.694 ± 0.376
1.386TyrThr: 1.386 ± 0.375
0.847TyrVal: 0.847 ± 0.256
0.77TyrTrp: 0.77 ± 0.203
0.385TyrTyr: 0.385 ± 0.174
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12989 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski