Amino acid dipepetide frequency for Escherichia phage vB_EcoS-12469I

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.647AlaAla: 8.647 ± 1.326
0.695AlaCys: 0.695 ± 0.254
4.246AlaAsp: 4.246 ± 0.596
5.713AlaGlu: 5.713 ± 0.742
3.629AlaPhe: 3.629 ± 0.509
5.867AlaGly: 5.867 ± 0.775
1.004AlaHis: 1.004 ± 0.328
6.639AlaIle: 6.639 ± 0.621
6.253AlaLys: 6.253 ± 1.19
7.797AlaLeu: 7.797 ± 1.159
1.93AlaMet: 1.93 ± 0.481
4.401AlaAsn: 4.401 ± 0.682
2.239AlaPro: 2.239 ± 0.479
3.474AlaGln: 3.474 ± 0.662
5.018AlaArg: 5.018 ± 0.625
5.945AlaSer: 5.945 ± 0.808
4.401AlaThr: 4.401 ± 0.594
4.864AlaVal: 4.864 ± 0.654
0.926AlaTrp: 0.926 ± 0.28
2.084AlaTyr: 2.084 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.54CysAla: 0.54 ± 0.264
0.154CysCys: 0.154 ± 0.114
0.849CysAsp: 0.849 ± 0.288
0.926CysGlu: 0.926 ± 0.288
0.309CysPhe: 0.309 ± 0.169
1.39CysGly: 1.39 ± 0.358
0.232CysHis: 0.232 ± 0.139
0.309CysIle: 0.309 ± 0.18
1.235CysLys: 1.235 ± 0.318
1.004CysLeu: 1.004 ± 0.299
0.309CysMet: 0.309 ± 0.144
0.386CysAsn: 0.386 ± 0.208
0.386CysPro: 0.386 ± 0.167
0.077CysGln: 0.077 ± 0.074
0.772CysArg: 0.772 ± 0.263
1.081CysSer: 1.081 ± 0.333
1.004CysThr: 1.004 ± 0.285
0.849CysVal: 0.849 ± 0.295
0.309CysTrp: 0.309 ± 0.169
0.154CysTyr: 0.154 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
4.864AspAla: 4.864 ± 0.758
0.54AspCys: 0.54 ± 0.196
3.011AspAsp: 3.011 ± 0.56
4.787AspGlu: 4.787 ± 0.737
2.007AspPhe: 2.007 ± 0.346
6.794AspGly: 6.794 ± 0.824
0.849AspHis: 0.849 ± 0.388
4.169AspIle: 4.169 ± 0.491
4.632AspLys: 4.632 ± 0.449
4.323AspLeu: 4.323 ± 0.57
1.544AspMet: 1.544 ± 0.346
2.779AspAsn: 2.779 ± 0.458
1.698AspPro: 1.698 ± 0.288
1.312AspGln: 1.312 ± 0.3
2.084AspArg: 2.084 ± 0.468
2.779AspSer: 2.779 ± 0.545
3.551AspThr: 3.551 ± 0.537
3.783AspVal: 3.783 ± 0.473
1.235AspTrp: 1.235 ± 0.311
3.242AspTyr: 3.242 ± 0.424
0.0AspXaa: 0.0 ± 0.0
Glu
5.79GluAla: 5.79 ± 0.624
0.849GluCys: 0.849 ± 0.338
2.934GluAsp: 2.934 ± 0.408
4.092GluGlu: 4.092 ± 0.64
3.242GluPhe: 3.242 ± 0.55
3.937GluGly: 3.937 ± 0.563
0.386GluHis: 0.386 ± 0.19
5.713GluIle: 5.713 ± 0.571
3.474GluLys: 3.474 ± 0.662
6.099GluLeu: 6.099 ± 0.812
2.934GluMet: 2.934 ± 0.619
3.242GluAsn: 3.242 ± 0.46
1.93GluPro: 1.93 ± 0.431
2.934GluGln: 2.934 ± 0.72
2.934GluArg: 2.934 ± 0.469
4.787GluSer: 4.787 ± 0.613
3.397GluThr: 3.397 ± 0.492
5.018GluVal: 5.018 ± 0.689
0.463GluTrp: 0.463 ± 0.189
2.47GluTyr: 2.47 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
2.239PheAla: 2.239 ± 0.447
0.695PheCys: 0.695 ± 0.185
3.32PheAsp: 3.32 ± 0.469
2.548PheGlu: 2.548 ± 0.538
0.849PhePhe: 0.849 ± 0.26
3.474PheGly: 3.474 ± 0.581
0.772PheHis: 0.772 ± 0.272
2.393PheIle: 2.393 ± 0.393
2.625PheLys: 2.625 ± 0.509
1.93PheLeu: 1.93 ± 0.452
0.926PheMet: 0.926 ± 0.244
2.316PheAsn: 2.316 ± 0.379
1.081PhePro: 1.081 ± 0.26
2.007PheGln: 2.007 ± 0.408
1.776PheArg: 1.776 ± 0.487
2.548PheSer: 2.548 ± 0.48
2.47PheThr: 2.47 ± 0.348
2.47PheVal: 2.47 ± 0.454
0.386PheTrp: 0.386 ± 0.179
0.695PheTyr: 0.695 ± 0.192
0.0PheXaa: 0.0 ± 0.0
Gly
4.709GlyAla: 4.709 ± 0.649
1.467GlyCys: 1.467 ± 0.35
4.015GlyAsp: 4.015 ± 0.474
4.787GlyGlu: 4.787 ± 0.719
2.47GlyPhe: 2.47 ± 0.505
5.636GlyGly: 5.636 ± 1.019
0.849GlyHis: 0.849 ± 0.346
6.099GlyIle: 6.099 ± 0.568
5.713GlyLys: 5.713 ± 0.879
7.103GlyLeu: 7.103 ± 0.738
2.239GlyMet: 2.239 ± 0.451
3.165GlyAsn: 3.165 ± 0.473
0.772GlyPro: 0.772 ± 0.219
2.316GlyGln: 2.316 ± 0.411
2.934GlyArg: 2.934 ± 0.356
5.559GlySer: 5.559 ± 0.796
4.401GlyThr: 4.401 ± 0.81
6.331GlyVal: 6.331 ± 0.63
1.004GlyTrp: 1.004 ± 0.326
4.015GlyTyr: 4.015 ± 0.699
0.0GlyXaa: 0.0 ± 0.0
His
0.849HisAla: 0.849 ± 0.311
0.309HisCys: 0.309 ± 0.149
0.772HisAsp: 0.772 ± 0.269
0.772HisGlu: 0.772 ± 0.236
0.695HisPhe: 0.695 ± 0.232
0.772HisGly: 0.772 ± 0.312
0.386HisHis: 0.386 ± 0.202
1.312HisIle: 1.312 ± 0.386
1.004HisLys: 1.004 ± 0.301
1.081HisLeu: 1.081 ± 0.292
0.309HisMet: 0.309 ± 0.175
0.386HisAsn: 0.386 ± 0.157
0.232HisPro: 0.232 ± 0.199
0.386HisGln: 0.386 ± 0.161
0.772HisArg: 0.772 ± 0.255
0.54HisSer: 0.54 ± 0.245
0.772HisThr: 0.772 ± 0.237
0.695HisVal: 0.695 ± 0.235
0.077HisTrp: 0.077 ± 0.08
0.463HisTyr: 0.463 ± 0.202
0.0HisXaa: 0.0 ± 0.0
Ile
6.253IleAla: 6.253 ± 1.037
0.463IleCys: 0.463 ± 0.202
5.404IleAsp: 5.404 ± 0.661
3.706IleGlu: 3.706 ± 0.41
2.007IlePhe: 2.007 ± 0.362
4.478IleGly: 4.478 ± 0.664
1.081IleHis: 1.081 ± 0.32
3.629IleIle: 3.629 ± 0.547
5.095IleLys: 5.095 ± 0.66
3.474IleLeu: 3.474 ± 0.56
1.853IleMet: 1.853 ± 0.484
4.709IleAsn: 4.709 ± 0.645
3.011IlePro: 3.011 ± 0.485
2.548IleGln: 2.548 ± 0.43
3.937IleArg: 3.937 ± 0.464
5.867IleSer: 5.867 ± 0.677
5.25IleThr: 5.25 ± 0.552
3.474IleVal: 3.474 ± 0.53
0.849IleTrp: 0.849 ± 0.22
3.165IleTyr: 3.165 ± 0.711
0.0IleXaa: 0.0 ± 0.0
Lys
6.717LysAla: 6.717 ± 1.003
0.54LysCys: 0.54 ± 0.214
4.323LysAsp: 4.323 ± 0.584
5.404LysGlu: 5.404 ± 0.793
2.548LysPhe: 2.548 ± 0.41
2.934LysGly: 2.934 ± 0.522
0.695LysHis: 0.695 ± 0.238
3.783LysIle: 3.783 ± 0.448
3.397LysLys: 3.397 ± 0.519
4.787LysLeu: 4.787 ± 0.758
2.47LysMet: 2.47 ± 0.599
2.934LysAsn: 2.934 ± 0.583
2.084LysPro: 2.084 ± 0.4
2.856LysGln: 2.856 ± 0.466
2.779LysArg: 2.779 ± 0.537
4.555LysSer: 4.555 ± 0.656
3.165LysThr: 3.165 ± 0.463
4.787LysVal: 4.787 ± 0.678
0.772LysTrp: 0.772 ± 0.3
3.629LysTyr: 3.629 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
6.794LeuAla: 6.794 ± 0.871
0.849LeuCys: 0.849 ± 0.222
4.787LeuAsp: 4.787 ± 0.515
4.246LeuGlu: 4.246 ± 0.732
1.621LeuPhe: 1.621 ± 0.315
5.018LeuGly: 5.018 ± 0.865
1.081LeuHis: 1.081 ± 0.354
4.864LeuIle: 4.864 ± 0.673
3.937LeuLys: 3.937 ± 0.47
4.246LeuLeu: 4.246 ± 0.498
1.39LeuMet: 1.39 ± 0.309
3.474LeuAsn: 3.474 ± 0.475
2.702LeuPro: 2.702 ± 0.428
2.779LeuGln: 2.779 ± 1.03
3.86LeuArg: 3.86 ± 0.578
6.099LeuSer: 6.099 ± 0.637
4.787LeuThr: 4.787 ± 0.749
5.867LeuVal: 5.867 ± 0.432
0.309LeuTrp: 0.309 ± 0.172
2.162LeuTyr: 2.162 ± 0.402
0.0LeuXaa: 0.0 ± 0.0
Met
3.474MetAla: 3.474 ± 0.561
0.232MetCys: 0.232 ± 0.159
1.312MetAsp: 1.312 ± 0.366
0.926MetGlu: 0.926 ± 0.269
1.158MetPhe: 1.158 ± 0.296
0.772MetGly: 0.772 ± 0.235
0.232MetHis: 0.232 ± 0.143
2.007MetIle: 2.007 ± 0.331
1.544MetLys: 1.544 ± 0.297
2.007MetLeu: 2.007 ± 0.523
0.849MetMet: 0.849 ± 0.339
1.698MetAsn: 1.698 ± 0.442
0.618MetPro: 0.618 ± 0.241
1.081MetGln: 1.081 ± 0.309
1.467MetArg: 1.467 ± 0.347
1.698MetSer: 1.698 ± 0.341
2.084MetThr: 2.084 ± 0.37
1.39MetVal: 1.39 ± 0.305
0.386MetTrp: 0.386 ± 0.205
0.463MetTyr: 0.463 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
4.246AsnAla: 4.246 ± 0.505
0.54AsnCys: 0.54 ± 0.18
2.779AsnAsp: 2.779 ± 0.423
3.551AsnGlu: 3.551 ± 0.571
2.316AsnPhe: 2.316 ± 0.386
6.331AsnGly: 6.331 ± 0.962
0.618AsnHis: 0.618 ± 0.21
2.779AsnIle: 2.779 ± 0.395
2.934AsnLys: 2.934 ± 0.444
3.86AsnLeu: 3.86 ± 0.612
1.158AsnMet: 1.158 ± 0.306
2.856AsnAsn: 2.856 ± 0.426
1.853AsnPro: 1.853 ± 0.342
2.548AsnGln: 2.548 ± 0.648
2.162AsnArg: 2.162 ± 0.323
3.937AsnSer: 3.937 ± 0.464
2.007AsnThr: 2.007 ± 0.444
3.474AsnVal: 3.474 ± 0.43
0.54AsnTrp: 0.54 ± 0.188
1.467AsnTyr: 1.467 ± 0.335
0.0AsnXaa: 0.0 ± 0.0
Pro
2.856ProAla: 2.856 ± 0.396
0.309ProCys: 0.309 ± 0.197
2.162ProAsp: 2.162 ± 0.423
3.011ProGlu: 3.011 ± 0.442
1.467ProPhe: 1.467 ± 0.292
1.853ProGly: 1.853 ± 0.362
0.463ProHis: 0.463 ± 0.149
2.007ProIle: 2.007 ± 0.295
1.235ProLys: 1.235 ± 0.351
2.007ProLeu: 2.007 ± 0.381
0.54ProMet: 0.54 ± 0.197
1.235ProAsn: 1.235 ± 0.271
0.463ProPro: 0.463 ± 0.22
1.467ProGln: 1.467 ± 0.464
1.467ProArg: 1.467 ± 0.344
1.235ProSer: 1.235 ± 0.24
1.621ProThr: 1.621 ± 0.367
3.165ProVal: 3.165 ± 0.555
0.463ProTrp: 0.463 ± 0.224
1.467ProTyr: 1.467 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
4.478GlnAla: 4.478 ± 0.985
0.695GlnCys: 0.695 ± 0.303
1.776GlnAsp: 1.776 ± 0.369
2.856GlnGlu: 2.856 ± 0.573
1.621GlnPhe: 1.621 ± 0.286
2.625GlnGly: 2.625 ± 0.483
0.309GlnHis: 0.309 ± 0.15
3.397GlnIle: 3.397 ± 0.64
2.084GlnLys: 2.084 ± 0.36
2.779GlnLeu: 2.779 ± 0.726
0.926GlnMet: 0.926 ± 0.234
1.544GlnAsn: 1.544 ± 0.475
1.158GlnPro: 1.158 ± 0.382
2.393GlnGln: 2.393 ± 0.955
1.93GlnArg: 1.93 ± 0.469
2.779GlnSer: 2.779 ± 0.498
1.621GlnThr: 1.621 ± 0.338
2.47GlnVal: 2.47 ± 0.493
0.54GlnTrp: 0.54 ± 0.2
1.235GlnTyr: 1.235 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
4.401ArgAla: 4.401 ± 0.59
0.772ArgCys: 0.772 ± 0.347
2.162ArgAsp: 2.162 ± 0.388
3.783ArgGlu: 3.783 ± 0.557
2.548ArgPhe: 2.548 ± 0.325
2.934ArgGly: 2.934 ± 0.423
0.463ArgHis: 0.463 ± 0.19
4.015ArgIle: 4.015 ± 0.57
4.864ArgLys: 4.864 ± 0.491
3.474ArgLeu: 3.474 ± 0.458
1.312ArgMet: 1.312 ± 0.33
1.93ArgAsn: 1.93 ± 0.453
1.467ArgPro: 1.467 ± 0.394
1.698ArgGln: 1.698 ± 0.455
2.856ArgArg: 2.856 ± 0.528
2.702ArgSer: 2.702 ± 0.377
1.853ArgThr: 1.853 ± 0.372
3.86ArgVal: 3.86 ± 0.509
0.386ArgTrp: 0.386 ± 0.163
2.162ArgTyr: 2.162 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
5.327SerAla: 5.327 ± 0.826
0.695SerCys: 0.695 ± 0.322
5.173SerAsp: 5.173 ± 0.56
5.327SerGlu: 5.327 ± 0.549
2.47SerPhe: 2.47 ± 0.471
7.18SerGly: 7.18 ± 0.887
1.158SerHis: 1.158 ± 0.306
4.092SerIle: 4.092 ± 0.512
3.937SerLys: 3.937 ± 0.559
4.709SerLeu: 4.709 ± 0.697
1.544SerMet: 1.544 ± 0.357
4.169SerAsn: 4.169 ± 0.813
2.393SerPro: 2.393 ± 0.396
2.934SerGln: 2.934 ± 0.511
3.32SerArg: 3.32 ± 0.534
4.709SerSer: 4.709 ± 0.694
3.629SerThr: 3.629 ± 0.491
5.173SerVal: 5.173 ± 0.683
0.695SerTrp: 0.695 ± 0.264
2.316SerTyr: 2.316 ± 0.426
0.0SerXaa: 0.0 ± 0.0
Thr
5.173ThrAla: 5.173 ± 0.783
0.54ThrCys: 0.54 ± 0.221
2.856ThrAsp: 2.856 ± 0.445
2.548ThrGlu: 2.548 ± 0.385
2.393ThrPhe: 2.393 ± 0.46
6.408ThrGly: 6.408 ± 0.823
0.463ThrHis: 0.463 ± 0.174
4.401ThrIle: 4.401 ± 0.547
2.856ThrLys: 2.856 ± 0.535
3.242ThrLeu: 3.242 ± 0.439
0.849ThrMet: 0.849 ± 0.231
3.629ThrAsn: 3.629 ± 0.489
2.316ThrPro: 2.316 ± 0.354
2.393ThrGln: 2.393 ± 0.501
2.702ThrArg: 2.702 ± 0.382
3.551ThrSer: 3.551 ± 0.441
3.088ThrThr: 3.088 ± 0.573
3.86ThrVal: 3.86 ± 0.553
0.386ThrTrp: 0.386 ± 0.148
2.625ThrTyr: 2.625 ± 0.46
0.0ThrXaa: 0.0 ± 0.0
Val
5.404ValAla: 5.404 ± 0.724
1.158ValCys: 1.158 ± 0.377
4.864ValAsp: 4.864 ± 0.539
4.401ValGlu: 4.401 ± 0.797
2.548ValPhe: 2.548 ± 0.367
3.629ValGly: 3.629 ± 0.643
0.849ValHis: 0.849 ± 0.218
4.941ValIle: 4.941 ± 0.698
5.481ValLys: 5.481 ± 0.656
3.706ValLeu: 3.706 ± 0.463
1.312ValMet: 1.312 ± 0.347
4.092ValAsn: 4.092 ± 0.591
2.084ValPro: 2.084 ± 0.437
2.239ValGln: 2.239 ± 0.593
4.015ValArg: 4.015 ± 0.526
6.176ValSer: 6.176 ± 0.671
4.323ValThr: 4.323 ± 0.531
5.713ValVal: 5.713 ± 0.85
0.849ValTrp: 0.849 ± 0.228
2.47ValTyr: 2.47 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
0.54TrpAla: 0.54 ± 0.176
0.309TrpCys: 0.309 ± 0.149
0.695TrpAsp: 0.695 ± 0.184
0.772TrpGlu: 0.772 ± 0.261
0.618TrpPhe: 0.618 ± 0.262
0.849TrpGly: 0.849 ± 0.257
0.232TrpHis: 0.232 ± 0.11
1.004TrpIle: 1.004 ± 0.326
0.926TrpLys: 0.926 ± 0.248
0.618TrpLeu: 0.618 ± 0.178
0.232TrpMet: 0.232 ± 0.128
0.54TrpAsn: 0.54 ± 0.182
0.463TrpPro: 0.463 ± 0.215
0.463TrpGln: 0.463 ± 0.159
0.695TrpArg: 0.695 ± 0.205
0.772TrpSer: 0.772 ± 0.329
0.463TrpThr: 0.463 ± 0.224
0.54TrpVal: 0.54 ± 0.184
0.077TrpTrp: 0.077 ± 0.08
0.309TrpTyr: 0.309 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.47TyrAla: 2.47 ± 0.443
0.463TyrCys: 0.463 ± 0.189
2.548TyrAsp: 2.548 ± 0.523
2.47TyrGlu: 2.47 ± 0.469
1.004TyrPhe: 1.004 ± 0.29
2.702TyrGly: 2.702 ± 0.398
0.463TyrHis: 0.463 ± 0.223
2.702TyrIle: 2.702 ± 0.444
1.93TyrLys: 1.93 ± 0.446
2.702TyrLeu: 2.702 ± 0.411
0.695TyrMet: 0.695 ± 0.232
2.47TyrAsn: 2.47 ± 0.355
1.544TyrPro: 1.544 ± 0.345
1.312TyrGln: 1.312 ± 0.337
2.162TyrArg: 2.162 ± 0.472
3.629TyrSer: 3.629 ± 0.667
2.393TyrThr: 2.393 ± 0.329
2.47TyrVal: 2.47 ± 0.417
0.463TyrTrp: 0.463 ± 0.183
1.467TyrTyr: 1.467 ± 0.303
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski