Amino acid dipepetide frequency for Escherichia phage vB_EcoS Sa179lw

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.176AlaAla: 8.176 ± 1.479
0.839AlaCys: 0.839 ± 0.254
4.892AlaAsp: 4.892 ± 0.653
6.01AlaGlu: 6.01 ± 0.724
2.586AlaPhe: 2.586 ± 0.39
7.477AlaGly: 7.477 ± 0.74
1.258AlaHis: 1.258 ± 0.316
6.289AlaIle: 6.289 ± 0.598
5.381AlaLys: 5.381 ± 0.784
6.988AlaLeu: 6.988 ± 0.71
3.564AlaMet: 3.564 ± 0.568
3.774AlaAsn: 3.774 ± 0.702
2.096AlaPro: 2.096 ± 0.335
3.424AlaGln: 3.424 ± 0.792
4.193AlaArg: 4.193 ± 0.541
5.87AlaSer: 5.87 ± 0.963
5.521AlaThr: 5.521 ± 0.925
6.01AlaVal: 6.01 ± 0.652
1.118AlaTrp: 1.118 ± 0.308
2.306AlaTyr: 2.306 ± 0.389
0.0AlaXaa: 0.0 ± 0.0
Cys
1.048CysAla: 1.048 ± 0.265
0.349CysCys: 0.349 ± 0.172
1.118CysAsp: 1.118 ± 0.323
0.839CysGlu: 0.839 ± 0.221
0.419CysPhe: 0.419 ± 0.154
1.537CysGly: 1.537 ± 0.45
0.349CysHis: 0.349 ± 0.177
0.699CysIle: 0.699 ± 0.261
1.118CysLys: 1.118 ± 0.236
0.978CysLeu: 0.978 ± 0.246
0.419CysMet: 0.419 ± 0.18
0.629CysAsn: 0.629 ± 0.236
0.559CysPro: 0.559 ± 0.182
0.14CysGln: 0.14 ± 0.092
0.559CysArg: 0.559 ± 0.176
0.769CysSer: 0.769 ± 0.217
0.769CysThr: 0.769 ± 0.259
0.839CysVal: 0.839 ± 0.238
0.21CysTrp: 0.21 ± 0.11
0.489CysTyr: 0.489 ± 0.243
0.0CysXaa: 0.0 ± 0.0
Asp
5.8AspAla: 5.8 ± 0.738
0.489AspCys: 0.489 ± 0.235
5.66AspAsp: 5.66 ± 0.902
5.521AspGlu: 5.521 ± 0.51
2.516AspPhe: 2.516 ± 0.362
6.569AspGly: 6.569 ± 0.792
0.839AspHis: 0.839 ± 0.25
3.564AspIle: 3.564 ± 0.541
3.564AspLys: 3.564 ± 0.642
3.215AspLeu: 3.215 ± 0.489
2.516AspMet: 2.516 ± 0.537
2.027AspAsn: 2.027 ± 0.405
1.817AspPro: 1.817 ± 0.276
1.188AspGln: 1.188 ± 0.289
2.655AspArg: 2.655 ± 0.434
2.935AspSer: 2.935 ± 0.418
2.795AspThr: 2.795 ± 0.519
4.472AspVal: 4.472 ± 0.572
0.769AspTrp: 0.769 ± 0.214
3.284AspTyr: 3.284 ± 0.564
0.0AspXaa: 0.0 ± 0.0
Glu
5.521GluAla: 5.521 ± 0.631
1.398GluCys: 1.398 ± 0.331
3.913GluAsp: 3.913 ± 0.646
4.472GluGlu: 4.472 ± 0.563
3.005GluPhe: 3.005 ± 0.472
3.075GluGly: 3.075 ± 0.435
1.048GluHis: 1.048 ± 0.259
5.451GluIle: 5.451 ± 0.483
4.263GluLys: 4.263 ± 0.688
5.73GluLeu: 5.73 ± 0.591
3.354GluMet: 3.354 ± 0.472
3.075GluAsn: 3.075 ± 0.445
2.027GluPro: 2.027 ± 0.428
3.494GluGln: 3.494 ± 0.568
3.494GluArg: 3.494 ± 0.556
3.494GluSer: 3.494 ± 0.427
2.935GluThr: 2.935 ± 0.362
4.193GluVal: 4.193 ± 0.537
1.258GluTrp: 1.258 ± 0.314
2.725GluTyr: 2.725 ± 0.352
0.0GluXaa: 0.0 ± 0.0
Phe
2.166PheAla: 2.166 ± 0.412
1.048PheCys: 1.048 ± 0.308
3.005PheAsp: 3.005 ± 0.384
2.446PheGlu: 2.446 ± 0.384
1.118PhePhe: 1.118 ± 0.241
3.215PheGly: 3.215 ± 0.415
0.419PheHis: 0.419 ± 0.156
3.145PheIle: 3.145 ± 0.433
1.398PheLys: 1.398 ± 0.321
1.957PheLeu: 1.957 ± 0.464
1.118PheMet: 1.118 ± 0.284
2.516PheAsn: 2.516 ± 0.443
1.118PhePro: 1.118 ± 0.327
0.839PheGln: 0.839 ± 0.226
1.817PheArg: 1.817 ± 0.351
2.446PheSer: 2.446 ± 0.376
1.817PheThr: 1.817 ± 0.366
2.516PheVal: 2.516 ± 0.414
0.629PheTrp: 0.629 ± 0.212
1.328PheTyr: 1.328 ± 0.374
0.0PheXaa: 0.0 ± 0.0
Gly
6.15GlyAla: 6.15 ± 0.83
0.839GlyCys: 0.839 ± 0.211
3.843GlyAsp: 3.843 ± 0.456
4.752GlyGlu: 4.752 ± 0.701
2.655GlyPhe: 2.655 ± 0.462
4.752GlyGly: 4.752 ± 0.586
0.489GlyHis: 0.489 ± 0.191
4.333GlyIle: 4.333 ± 0.475
5.031GlyLys: 5.031 ± 0.608
3.354GlyLeu: 3.354 ± 0.452
1.468GlyMet: 1.468 ± 0.336
3.145GlyAsn: 3.145 ± 0.506
0.769GlyPro: 0.769 ± 0.276
2.516GlyGln: 2.516 ± 0.444
3.564GlyArg: 3.564 ± 0.49
4.682GlySer: 4.682 ± 0.547
5.031GlyThr: 5.031 ± 0.811
5.031GlyVal: 5.031 ± 0.616
1.188GlyTrp: 1.188 ± 0.345
4.053GlyTyr: 4.053 ± 0.533
0.0GlyXaa: 0.0 ± 0.0
His
1.188HisAla: 1.188 ± 0.276
0.419HisCys: 0.419 ± 0.14
1.048HisAsp: 1.048 ± 0.299
0.839HisGlu: 0.839 ± 0.233
0.699HisPhe: 0.699 ± 0.205
1.048HisGly: 1.048 ± 0.237
0.699HisHis: 0.699 ± 0.242
0.559HisIle: 0.559 ± 0.185
1.048HisLys: 1.048 ± 0.291
1.188HisLeu: 1.188 ± 0.291
0.559HisMet: 0.559 ± 0.213
0.699HisAsn: 0.699 ± 0.222
1.118HisPro: 1.118 ± 0.275
0.349HisGln: 0.349 ± 0.174
0.629HisArg: 0.629 ± 0.218
0.978HisSer: 0.978 ± 0.258
0.699HisThr: 0.699 ± 0.25
1.398HisVal: 1.398 ± 0.373
0.28HisTrp: 0.28 ± 0.145
0.629HisTyr: 0.629 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
5.241IleAla: 5.241 ± 0.613
0.769IleCys: 0.769 ± 0.27
5.521IleAsp: 5.521 ± 0.619
4.263IleGlu: 4.263 ± 0.693
2.236IlePhe: 2.236 ± 0.38
4.263IleGly: 4.263 ± 0.571
1.258IleHis: 1.258 ± 0.254
5.171IleIle: 5.171 ± 0.664
4.752IleLys: 4.752 ± 0.609
3.494IleLeu: 3.494 ± 0.498
2.096IleMet: 2.096 ± 0.454
3.424IleAsn: 3.424 ± 0.405
1.887IlePro: 1.887 ± 0.379
1.817IleGln: 1.817 ± 0.407
3.564IleArg: 3.564 ± 0.605
5.171IleSer: 5.171 ± 0.723
4.682IleThr: 4.682 ± 0.537
4.752IleVal: 4.752 ± 0.52
0.419IleTrp: 0.419 ± 0.151
1.957IleTyr: 1.957 ± 0.46
0.0IleXaa: 0.0 ± 0.0
Lys
5.87LysAla: 5.87 ± 0.789
1.048LysCys: 1.048 ± 0.253
2.725LysAsp: 2.725 ± 0.423
3.284LysGlu: 3.284 ± 0.655
2.446LysPhe: 2.446 ± 0.42
3.145LysGly: 3.145 ± 0.596
1.747LysHis: 1.747 ± 0.37
3.354LysIle: 3.354 ± 0.419
3.424LysLys: 3.424 ± 0.634
4.892LysLeu: 4.892 ± 0.777
2.516LysMet: 2.516 ± 0.514
3.354LysAsn: 3.354 ± 0.433
2.865LysPro: 2.865 ± 0.443
3.215LysGln: 3.215 ± 0.587
3.494LysArg: 3.494 ± 0.562
4.752LysSer: 4.752 ± 0.615
2.795LysThr: 2.795 ± 0.468
3.983LysVal: 3.983 ± 0.522
1.328LysTrp: 1.328 ± 0.279
2.446LysTyr: 2.446 ± 0.455
0.0LysXaa: 0.0 ± 0.0
Leu
6.639LeuAla: 6.639 ± 0.814
1.188LeuCys: 1.188 ± 0.303
3.494LeuAsp: 3.494 ± 0.472
3.145LeuGlu: 3.145 ± 0.44
2.306LeuPhe: 2.306 ± 0.361
3.774LeuGly: 3.774 ± 0.503
0.978LeuHis: 0.978 ± 0.23
4.123LeuIle: 4.123 ± 0.59
4.682LeuLys: 4.682 ± 0.639
4.542LeuLeu: 4.542 ± 0.582
2.027LeuMet: 2.027 ± 0.363
3.494LeuAsn: 3.494 ± 0.512
2.865LeuPro: 2.865 ± 0.455
3.145LeuGln: 3.145 ± 0.523
4.123LeuArg: 4.123 ± 0.408
5.521LeuSer: 5.521 ± 0.793
4.123LeuThr: 4.123 ± 0.484
4.263LeuVal: 4.263 ± 0.662
1.537LeuTrp: 1.537 ± 0.3
2.376LeuTyr: 2.376 ± 0.503
0.0LeuXaa: 0.0 ± 0.0
Met
3.564MetAla: 3.564 ± 0.495
0.419MetCys: 0.419 ± 0.177
2.096MetAsp: 2.096 ± 0.423
1.887MetGlu: 1.887 ± 0.321
1.537MetPhe: 1.537 ± 0.406
1.188MetGly: 1.188 ± 0.325
0.629MetHis: 0.629 ± 0.205
2.586MetIle: 2.586 ± 0.376
2.516MetLys: 2.516 ± 0.488
1.957MetLeu: 1.957 ± 0.337
1.537MetMet: 1.537 ± 0.273
1.677MetAsn: 1.677 ± 0.359
1.817MetPro: 1.817 ± 0.398
1.398MetGln: 1.398 ± 0.341
1.607MetArg: 1.607 ± 0.238
1.957MetSer: 1.957 ± 0.307
2.027MetThr: 2.027 ± 0.399
1.468MetVal: 1.468 ± 0.355
0.28MetTrp: 0.28 ± 0.141
1.328MetTyr: 1.328 ± 0.265
0.0MetXaa: 0.0 ± 0.0
Asn
4.403AsnAla: 4.403 ± 0.56
0.769AsnCys: 0.769 ± 0.215
3.284AsnAsp: 3.284 ± 0.451
2.795AsnGlu: 2.795 ± 0.373
0.839AsnPhe: 0.839 ± 0.297
4.542AsnGly: 4.542 ± 0.501
1.048AsnHis: 1.048 ± 0.333
2.795AsnIle: 2.795 ± 0.409
3.424AsnLys: 3.424 ± 0.533
3.215AsnLeu: 3.215 ± 0.477
1.118AsnMet: 1.118 ± 0.251
2.516AsnAsn: 2.516 ± 0.485
2.096AsnPro: 2.096 ± 0.328
1.677AsnGln: 1.677 ± 0.389
2.655AsnArg: 2.655 ± 0.36
2.586AsnSer: 2.586 ± 0.388
2.376AsnThr: 2.376 ± 0.64
3.354AsnVal: 3.354 ± 0.579
0.629AsnTrp: 0.629 ± 0.24
2.306AsnTyr: 2.306 ± 0.395
0.0AsnXaa: 0.0 ± 0.0
Pro
3.075ProAla: 3.075 ± 0.436
0.14ProCys: 0.14 ± 0.11
2.586ProAsp: 2.586 ± 0.51
2.935ProGlu: 2.935 ± 0.396
1.258ProPhe: 1.258 ± 0.253
1.677ProGly: 1.677 ± 0.378
0.699ProHis: 0.699 ± 0.219
2.027ProIle: 2.027 ± 0.308
1.677ProLys: 1.677 ± 0.381
2.516ProLeu: 2.516 ± 0.439
1.398ProMet: 1.398 ± 0.3
1.468ProAsn: 1.468 ± 0.214
1.258ProPro: 1.258 ± 0.3
1.118ProGln: 1.118 ± 0.332
1.188ProArg: 1.188 ± 0.333
2.096ProSer: 2.096 ± 0.405
1.887ProThr: 1.887 ± 0.405
3.284ProVal: 3.284 ± 0.456
0.28ProTrp: 0.28 ± 0.185
1.258ProTyr: 1.258 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
3.494GlnAla: 3.494 ± 0.725
0.349GlnCys: 0.349 ± 0.149
0.978GlnAsp: 0.978 ± 0.315
2.655GlnGlu: 2.655 ± 0.409
1.398GlnPhe: 1.398 ± 0.407
1.328GlnGly: 1.328 ± 0.423
0.769GlnHis: 0.769 ± 0.257
2.935GlnIle: 2.935 ± 0.392
2.376GlnLys: 2.376 ± 0.465
3.564GlnLeu: 3.564 ± 0.645
1.048GlnMet: 1.048 ± 0.27
1.188GlnAsn: 1.188 ± 0.235
1.118GlnPro: 1.118 ± 0.269
2.655GlnGln: 2.655 ± 1.059
2.376GlnArg: 2.376 ± 0.42
3.424GlnSer: 3.424 ± 0.568
2.236GlnThr: 2.236 ± 0.456
2.096GlnVal: 2.096 ± 0.491
0.699GlnTrp: 0.699 ± 0.184
1.258GlnTyr: 1.258 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
4.053ArgAla: 4.053 ± 0.438
0.978ArgCys: 0.978 ± 0.266
2.935ArgAsp: 2.935 ± 0.423
4.403ArgGlu: 4.403 ± 0.668
1.817ArgPhe: 1.817 ± 0.448
2.446ArgGly: 2.446 ± 0.475
0.699ArgHis: 0.699 ± 0.191
3.284ArgIle: 3.284 ± 0.464
3.145ArgLys: 3.145 ± 0.487
4.403ArgLeu: 4.403 ± 0.671
1.537ArgMet: 1.537 ± 0.328
3.704ArgAsn: 3.704 ± 0.436
1.258ArgPro: 1.258 ± 0.318
2.027ArgGln: 2.027 ± 0.431
3.145ArgArg: 3.145 ± 0.451
2.166ArgSer: 2.166 ± 0.331
1.677ArgThr: 1.677 ± 0.498
3.354ArgVal: 3.354 ± 0.549
0.419ArgTrp: 0.419 ± 0.173
2.725ArgTyr: 2.725 ± 0.447
0.0ArgXaa: 0.0 ± 0.0
Ser
6.709SerAla: 6.709 ± 0.963
1.048SerCys: 1.048 ± 0.31
4.752SerAsp: 4.752 ± 0.561
4.542SerGlu: 4.542 ± 0.51
3.005SerPhe: 3.005 ± 0.415
5.031SerGly: 5.031 ± 0.56
0.839SerHis: 0.839 ± 0.214
4.193SerIle: 4.193 ± 0.761
3.913SerLys: 3.913 ± 0.51
5.031SerLeu: 5.031 ± 0.607
1.887SerMet: 1.887 ± 0.364
2.935SerAsn: 2.935 ± 0.534
2.096SerPro: 2.096 ± 0.37
2.725SerGln: 2.725 ± 0.491
2.516SerArg: 2.516 ± 0.419
4.542SerSer: 4.542 ± 0.724
3.913SerThr: 3.913 ± 0.718
4.403SerVal: 4.403 ± 0.711
0.978SerTrp: 0.978 ± 0.225
1.957SerTyr: 1.957 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
5.101ThrAla: 5.101 ± 0.709
0.419ThrCys: 0.419 ± 0.164
3.354ThrAsp: 3.354 ± 0.452
4.472ThrGlu: 4.472 ± 0.494
1.817ThrPhe: 1.817 ± 0.496
5.521ThrGly: 5.521 ± 0.649
0.489ThrHis: 0.489 ± 0.167
3.634ThrIle: 3.634 ± 0.704
3.284ThrLys: 3.284 ± 0.499
3.564ThrLeu: 3.564 ± 0.593
1.607ThrMet: 1.607 ± 0.326
2.655ThrAsn: 2.655 ± 0.578
3.354ThrPro: 3.354 ± 0.433
1.048ThrGln: 1.048 ± 0.293
2.306ThrArg: 2.306 ± 0.369
4.822ThrSer: 4.822 ± 0.709
4.333ThrThr: 4.333 ± 0.761
3.354ThrVal: 3.354 ± 0.494
0.419ThrTrp: 0.419 ± 0.263
2.096ThrTyr: 2.096 ± 0.355
0.0ThrXaa: 0.0 ± 0.0
Val
5.94ValAla: 5.94 ± 0.805
0.489ValCys: 0.489 ± 0.176
3.005ValAsp: 3.005 ± 0.483
4.892ValGlu: 4.892 ± 0.6
2.446ValPhe: 2.446 ± 0.475
4.193ValGly: 4.193 ± 0.507
0.769ValHis: 0.769 ± 0.232
5.031ValIle: 5.031 ± 0.583
4.612ValLys: 4.612 ± 0.593
3.634ValLeu: 3.634 ± 0.503
2.586ValMet: 2.586 ± 0.395
3.634ValAsn: 3.634 ± 0.577
1.747ValPro: 1.747 ± 0.368
2.516ValGln: 2.516 ± 0.399
3.494ValArg: 3.494 ± 0.604
5.451ValSer: 5.451 ± 0.858
4.682ValThr: 4.682 ± 0.663
5.521ValVal: 5.521 ± 0.8
1.188ValTrp: 1.188 ± 0.276
2.516ValTyr: 2.516 ± 0.358
0.0ValXaa: 0.0 ± 0.0
Trp
0.839TrpAla: 0.839 ± 0.228
0.07TrpCys: 0.07 ± 0.068
1.048TrpAsp: 1.048 ± 0.222
0.839TrpGlu: 0.839 ± 0.292
0.699TrpPhe: 0.699 ± 0.246
0.349TrpGly: 0.349 ± 0.156
0.629TrpHis: 0.629 ± 0.221
1.118TrpIle: 1.118 ± 0.314
0.908TrpLys: 0.908 ± 0.275
1.537TrpLeu: 1.537 ± 0.428
0.349TrpMet: 0.349 ± 0.225
0.629TrpAsn: 0.629 ± 0.222
0.419TrpPro: 0.419 ± 0.196
0.978TrpGln: 0.978 ± 0.274
1.188TrpArg: 1.188 ± 0.304
0.769TrpSer: 0.769 ± 0.214
0.699TrpThr: 0.699 ± 0.196
1.328TrpVal: 1.328 ± 0.284
0.14TrpTrp: 0.14 ± 0.107
0.21TrpTyr: 0.21 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.795TyrAla: 2.795 ± 0.303
0.769TyrCys: 0.769 ± 0.2
2.935TyrAsp: 2.935 ± 0.428
2.935TyrGlu: 2.935 ± 0.433
1.258TyrPhe: 1.258 ± 0.371
2.516TyrGly: 2.516 ± 0.492
0.419TyrHis: 0.419 ± 0.169
2.446TyrIle: 2.446 ± 0.468
2.236TyrLys: 2.236 ± 0.411
2.586TyrLeu: 2.586 ± 0.501
0.699TyrMet: 0.699 ± 0.228
1.887TyrAsn: 1.887 ± 0.338
1.537TyrPro: 1.537 ± 0.384
1.677TyrGln: 1.677 ± 0.331
1.607TyrArg: 1.607 ± 0.247
2.586TyrSer: 2.586 ± 0.458
2.655TyrThr: 2.655 ± 0.441
2.725TyrVal: 2.725 ± 0.432
0.978TyrTrp: 0.978 ± 0.326
1.887TyrTyr: 1.887 ± 0.502
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (14311 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski