Amino acid dipepetide frequency for Escherichia phage vB_EcoS_fKuEco01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.784AlaAla: 10.784 ± 1.678
1.123AlaCys: 1.123 ± 0.279
6.74AlaAsp: 6.74 ± 0.82
7.115AlaGlu: 7.115 ± 0.879
4.493AlaPhe: 4.493 ± 0.524
7.489AlaGly: 7.489 ± 0.735
1.198AlaHis: 1.198 ± 0.335
5.916AlaIle: 5.916 ± 0.772
6.665AlaLys: 6.665 ± 0.758
8.313AlaLeu: 8.313 ± 0.838
2.097AlaMet: 2.097 ± 0.448
3.52AlaAsn: 3.52 ± 0.445
3.819AlaPro: 3.819 ± 0.545
3.07AlaGln: 3.07 ± 0.522
3.969AlaArg: 3.969 ± 0.509
5.467AlaSer: 5.467 ± 0.737
4.418AlaThr: 4.418 ± 0.639
7.189AlaVal: 7.189 ± 0.745
1.797AlaTrp: 1.797 ± 0.372
3.52AlaTyr: 3.52 ± 0.375
0.0AlaXaa: 0.0 ± 0.0
Cys
1.198CysAla: 1.198 ± 0.358
0.15CysCys: 0.15 ± 0.14
1.048CysAsp: 1.048 ± 0.238
0.974CysGlu: 0.974 ± 0.298
0.225CysPhe: 0.225 ± 0.143
1.123CysGly: 1.123 ± 0.318
0.15CysHis: 0.15 ± 0.113
0.449CysIle: 0.449 ± 0.192
0.674CysLys: 0.674 ± 0.291
0.599CysLeu: 0.599 ± 0.243
0.075CysMet: 0.075 ± 0.072
0.374CysAsn: 0.374 ± 0.202
0.374CysPro: 0.374 ± 0.151
0.3CysGln: 0.3 ± 0.146
0.824CysArg: 0.824 ± 0.278
0.599CysSer: 0.599 ± 0.288
0.974CysThr: 0.974 ± 0.346
0.749CysVal: 0.749 ± 0.231
0.15CysTrp: 0.15 ± 0.096
0.524CysTyr: 0.524 ± 0.194
0.0CysXaa: 0.0 ± 0.0
Asp
6.89AspAla: 6.89 ± 0.752
0.749AspCys: 0.749 ± 0.265
4.868AspAsp: 4.868 ± 0.563
4.643AspGlu: 4.643 ± 0.634
2.396AspPhe: 2.396 ± 0.333
6.59AspGly: 6.59 ± 0.882
0.824AspHis: 0.824 ± 0.245
4.194AspIle: 4.194 ± 0.472
3.295AspLys: 3.295 ± 0.396
4.418AspLeu: 4.418 ± 0.621
1.123AspMet: 1.123 ± 0.283
2.696AspAsn: 2.696 ± 0.587
1.273AspPro: 1.273 ± 0.365
1.198AspGln: 1.198 ± 0.34
2.471AspArg: 2.471 ± 0.427
2.621AspSer: 2.621 ± 0.456
3.445AspThr: 3.445 ± 0.411
4.044AspVal: 4.044 ± 0.48
0.899AspTrp: 0.899 ± 0.223
2.471AspTyr: 2.471 ± 0.337
0.0AspXaa: 0.0 ± 0.0
Glu
5.692GluAla: 5.692 ± 0.772
0.524GluCys: 0.524 ± 0.316
3.969GluAsp: 3.969 ± 0.61
5.392GluGlu: 5.392 ± 1.044
2.621GluPhe: 2.621 ± 0.446
4.269GluGly: 4.269 ± 0.59
0.899GluHis: 0.899 ± 0.279
3.595GluIle: 3.595 ± 0.526
4.493GluLys: 4.493 ± 0.797
5.991GluLeu: 5.991 ± 0.638
2.621GluMet: 2.621 ± 0.507
1.947GluAsn: 1.947 ± 0.389
2.172GluPro: 2.172 ± 0.501
3.67GluGln: 3.67 ± 0.772
4.493GluArg: 4.493 ± 0.626
3.22GluSer: 3.22 ± 0.534
3.37GluThr: 3.37 ± 0.639
4.643GluVal: 4.643 ± 0.5
1.423GluTrp: 1.423 ± 0.31
2.696GluTyr: 2.696 ± 0.528
0.0GluXaa: 0.0 ± 0.0
Phe
2.546PheAla: 2.546 ± 0.423
0.749PheCys: 0.749 ± 0.226
3.52PheAsp: 3.52 ± 0.589
2.696PheGlu: 2.696 ± 0.425
1.123PhePhe: 1.123 ± 0.333
3.445PheGly: 3.445 ± 0.504
0.524PheHis: 0.524 ± 0.175
2.771PheIle: 2.771 ± 0.506
2.546PheLys: 2.546 ± 0.397
2.097PheLeu: 2.097 ± 0.374
0.899PheMet: 0.899 ± 0.205
1.872PheAsn: 1.872 ± 0.31
1.423PhePro: 1.423 ± 0.31
0.899PheGln: 0.899 ± 0.228
1.947PheArg: 1.947 ± 0.28
3.295PheSer: 3.295 ± 0.43
2.621PheThr: 2.621 ± 0.409
2.621PheVal: 2.621 ± 0.445
0.674PheTrp: 0.674 ± 0.217
1.423PheTyr: 1.423 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
6.815GlyAla: 6.815 ± 0.851
1.348GlyCys: 1.348 ± 0.319
4.493GlyAsp: 4.493 ± 0.722
5.167GlyGlu: 5.167 ± 0.675
3.37GlyPhe: 3.37 ± 0.563
5.392GlyGly: 5.392 ± 0.708
1.722GlyHis: 1.722 ± 0.422
2.921GlyIle: 2.921 ± 0.454
5.392GlyLys: 5.392 ± 0.874
5.766GlyLeu: 5.766 ± 0.557
2.172GlyMet: 2.172 ± 0.536
3.22GlyAsn: 3.22 ± 0.464
1.722GlyPro: 1.722 ± 0.318
3.67GlyGln: 3.67 ± 0.66
3.445GlyArg: 3.445 ± 0.5
5.092GlySer: 5.092 ± 0.678
4.344GlyThr: 4.344 ± 0.598
5.692GlyVal: 5.692 ± 1.001
1.198GlyTrp: 1.198 ± 0.279
2.471GlyTyr: 2.471 ± 0.488
0.0GlyXaa: 0.0 ± 0.0
His
1.123HisAla: 1.123 ± 0.227
0.449HisCys: 0.449 ± 0.139
0.974HisAsp: 0.974 ± 0.24
1.048HisGlu: 1.048 ± 0.282
0.674HisPhe: 0.674 ± 0.2
1.348HisGly: 1.348 ± 0.37
0.674HisHis: 0.674 ± 0.256
1.123HisIle: 1.123 ± 0.261
1.123HisLys: 1.123 ± 0.349
1.797HisLeu: 1.797 ± 0.375
0.225HisMet: 0.225 ± 0.141
0.824HisAsn: 0.824 ± 0.235
0.974HisPro: 0.974 ± 0.25
0.599HisGln: 0.599 ± 0.177
1.273HisArg: 1.273 ± 0.318
0.599HisSer: 0.599 ± 0.226
0.749HisThr: 0.749 ± 0.368
1.348HisVal: 1.348 ± 0.469
0.15HisTrp: 0.15 ± 0.093
0.599HisTyr: 0.599 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
5.167IleAla: 5.167 ± 0.499
0.824IleCys: 0.824 ± 0.254
3.744IleAsp: 3.744 ± 0.484
3.894IleGlu: 3.894 ± 0.403
1.573IlePhe: 1.573 ± 0.379
2.546IleGly: 2.546 ± 0.431
0.449IleHis: 0.449 ± 0.211
2.921IleIle: 2.921 ± 0.368
3.445IleLys: 3.445 ± 0.653
3.07IleLeu: 3.07 ± 0.484
0.974IleMet: 0.974 ± 0.261
3.07IleAsn: 3.07 ± 0.504
2.846IlePro: 2.846 ± 0.428
1.947IleGln: 1.947 ± 0.406
2.172IleArg: 2.172 ± 0.348
3.744IleSer: 3.744 ± 0.493
4.643IleThr: 4.643 ± 0.647
4.269IleVal: 4.269 ± 0.563
0.899IleTrp: 0.899 ± 0.256
1.573IleTyr: 1.573 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
6.515LysAla: 6.515 ± 1.109
0.374LysCys: 0.374 ± 0.179
3.22LysAsp: 3.22 ± 0.643
4.868LysGlu: 4.868 ± 0.9
2.097LysPhe: 2.097 ± 0.32
3.595LysGly: 3.595 ± 0.472
1.423LysHis: 1.423 ± 0.322
2.322LysIle: 2.322 ± 0.528
3.595LysLys: 3.595 ± 0.543
4.793LysLeu: 4.793 ± 0.558
3.07LysMet: 3.07 ± 0.572
1.947LysAsn: 1.947 ± 0.475
2.621LysPro: 2.621 ± 0.451
2.172LysGln: 2.172 ± 0.476
4.269LysArg: 4.269 ± 0.585
2.546LysSer: 2.546 ± 0.436
3.969LysThr: 3.969 ± 0.524
3.52LysVal: 3.52 ± 0.544
0.599LysTrp: 0.599 ± 0.179
2.546LysTyr: 2.546 ± 0.422
0.0LysXaa: 0.0 ± 0.0
Leu
8.912LeuAla: 8.912 ± 0.844
0.824LeuCys: 0.824 ± 0.256
3.819LeuAsp: 3.819 ± 0.573
4.568LeuGlu: 4.568 ± 0.757
3.145LeuPhe: 3.145 ± 0.622
5.841LeuGly: 5.841 ± 0.615
1.573LeuHis: 1.573 ± 0.42
4.568LeuIle: 4.568 ± 0.553
3.894LeuLys: 3.894 ± 0.539
5.467LeuLeu: 5.467 ± 0.71
1.648LeuMet: 1.648 ± 0.289
4.344LeuAsn: 4.344 ± 0.675
4.119LeuPro: 4.119 ± 0.623
2.771LeuGln: 2.771 ± 0.466
4.418LeuArg: 4.418 ± 0.549
4.194LeuSer: 4.194 ± 0.545
5.392LeuThr: 5.392 ± 0.828
4.344LeuVal: 4.344 ± 0.606
1.048LeuTrp: 1.048 ± 0.299
2.696LeuTyr: 2.696 ± 0.494
0.0LeuXaa: 0.0 ± 0.0
Met
2.621MetAla: 2.621 ± 0.402
0.449MetCys: 0.449 ± 0.142
0.599MetAsp: 0.599 ± 0.255
1.273MetGlu: 1.273 ± 0.344
0.824MetPhe: 0.824 ± 0.272
2.247MetGly: 2.247 ± 0.367
0.374MetHis: 0.374 ± 0.194
1.648MetIle: 1.648 ± 0.398
1.872MetLys: 1.872 ± 0.368
1.722MetLeu: 1.722 ± 0.357
0.449MetMet: 0.449 ± 0.209
0.899MetAsn: 0.899 ± 0.26
1.423MetPro: 1.423 ± 0.25
0.899MetGln: 0.899 ± 0.234
1.423MetArg: 1.423 ± 0.265
2.097MetSer: 2.097 ± 0.325
1.648MetThr: 1.648 ± 0.318
1.722MetVal: 1.722 ± 0.385
0.3MetTrp: 0.3 ± 0.151
0.15MetTyr: 0.15 ± 0.102
0.0MetXaa: 0.0 ± 0.0
Asn
4.643AsnAla: 4.643 ± 0.492
0.449AsnCys: 0.449 ± 0.172
2.396AsnAsp: 2.396 ± 0.357
2.621AsnGlu: 2.621 ± 0.444
1.123AsnPhe: 1.123 ± 0.268
4.044AsnGly: 4.044 ± 0.588
0.749AsnHis: 0.749 ± 0.266
2.022AsnIle: 2.022 ± 0.401
2.097AsnLys: 2.097 ± 0.309
2.846AsnLeu: 2.846 ± 0.371
1.048AsnMet: 1.048 ± 0.317
2.396AsnAsn: 2.396 ± 0.421
1.722AsnPro: 1.722 ± 0.266
1.573AsnGln: 1.573 ± 0.364
2.771AsnArg: 2.771 ± 0.534
2.996AsnSer: 2.996 ± 0.367
2.471AsnThr: 2.471 ± 0.505
3.52AsnVal: 3.52 ± 0.472
0.824AsnTrp: 0.824 ± 0.22
1.123AsnTyr: 1.123 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
3.295ProAla: 3.295 ± 0.404
0.524ProCys: 0.524 ± 0.16
2.996ProAsp: 2.996 ± 0.463
3.37ProGlu: 3.37 ± 0.408
1.947ProPhe: 1.947 ± 0.396
2.696ProGly: 2.696 ± 0.478
0.599ProHis: 0.599 ± 0.24
1.498ProIle: 1.498 ± 0.356
1.348ProLys: 1.348 ± 0.302
2.846ProLeu: 2.846 ± 0.487
0.899ProMet: 0.899 ± 0.282
1.797ProAsn: 1.797 ± 0.338
0.974ProPro: 0.974 ± 0.283
1.198ProGln: 1.198 ± 0.264
1.648ProArg: 1.648 ± 0.362
2.546ProSer: 2.546 ± 0.398
2.396ProThr: 2.396 ± 0.436
4.194ProVal: 4.194 ± 0.55
0.374ProTrp: 0.374 ± 0.183
1.048ProTyr: 1.048 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
3.67GlnAla: 3.67 ± 0.682
0.374GlnCys: 0.374 ± 0.212
1.797GlnAsp: 1.797 ± 0.337
2.247GlnGlu: 2.247 ± 0.449
1.423GlnPhe: 1.423 ± 0.279
1.722GlnGly: 1.722 ± 0.302
0.899GlnHis: 0.899 ± 0.248
1.947GlnIle: 1.947 ± 0.353
2.846GlnLys: 2.846 ± 0.496
3.595GlnLeu: 3.595 ± 0.468
1.198GlnMet: 1.198 ± 0.261
1.573GlnAsn: 1.573 ± 0.388
1.348GlnPro: 1.348 ± 0.241
2.322GlnGln: 2.322 ± 0.647
2.696GlnArg: 2.696 ± 0.439
1.722GlnSer: 1.722 ± 0.372
2.022GlnThr: 2.022 ± 0.447
2.846GlnVal: 2.846 ± 0.46
0.749GlnTrp: 0.749 ± 0.231
1.573GlnTyr: 1.573 ± 0.39
0.0GlnXaa: 0.0 ± 0.0
Arg
4.643ArgAla: 4.643 ± 0.491
0.225ArgCys: 0.225 ± 0.139
2.621ArgAsp: 2.621 ± 0.392
3.894ArgGlu: 3.894 ± 0.642
1.947ArgPhe: 1.947 ± 0.362
3.819ArgGly: 3.819 ± 0.504
1.498ArgHis: 1.498 ± 0.364
2.546ArgIle: 2.546 ± 0.411
3.819ArgLys: 3.819 ± 0.525
4.568ArgLeu: 4.568 ± 0.624
1.797ArgMet: 1.797 ± 0.329
2.546ArgAsn: 2.546 ± 0.339
1.423ArgPro: 1.423 ± 0.315
3.445ArgGln: 3.445 ± 0.515
4.718ArgArg: 4.718 ± 0.705
3.07ArgSer: 3.07 ± 0.355
2.546ArgThr: 2.546 ± 0.427
3.22ArgVal: 3.22 ± 0.406
0.674ArgTrp: 0.674 ± 0.238
1.498ArgTyr: 1.498 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
5.167SerAla: 5.167 ± 0.592
0.3SerCys: 0.3 ± 0.144
3.37SerAsp: 3.37 ± 0.509
3.07SerGlu: 3.07 ± 0.511
2.471SerPhe: 2.471 ± 0.427
6.216SerGly: 6.216 ± 0.76
1.273SerHis: 1.273 ± 0.281
3.295SerIle: 3.295 ± 0.729
3.295SerLys: 3.295 ± 0.544
4.643SerLeu: 4.643 ± 0.492
1.273SerMet: 1.273 ± 0.275
2.546SerAsn: 2.546 ± 0.476
2.471SerPro: 2.471 ± 0.547
2.097SerGln: 2.097 ± 0.515
3.145SerArg: 3.145 ± 0.495
3.22SerSer: 3.22 ± 0.847
4.119SerThr: 4.119 ± 0.528
4.269SerVal: 4.269 ± 0.412
0.449SerTrp: 0.449 ± 0.223
2.097SerTyr: 2.097 ± 0.427
0.0SerXaa: 0.0 ± 0.0
Thr
6.965ThrAla: 6.965 ± 0.88
0.824ThrCys: 0.824 ± 0.25
3.744ThrAsp: 3.744 ± 0.444
3.52ThrGlu: 3.52 ± 0.519
3.145ThrPhe: 3.145 ± 0.535
5.092ThrGly: 5.092 ± 0.886
1.048ThrHis: 1.048 ± 0.342
3.37ThrIle: 3.37 ± 0.462
3.445ThrLys: 3.445 ± 0.461
4.868ThrLeu: 4.868 ± 0.532
0.749ThrMet: 0.749 ± 0.231
1.947ThrAsn: 1.947 ± 0.412
3.595ThrPro: 3.595 ± 0.608
1.872ThrGln: 1.872 ± 0.405
2.471ThrArg: 2.471 ± 0.368
3.295ThrSer: 3.295 ± 0.477
3.969ThrThr: 3.969 ± 0.66
3.52ThrVal: 3.52 ± 0.456
1.048ThrTrp: 1.048 ± 0.281
2.097ThrTyr: 2.097 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
7.714ValAla: 7.714 ± 0.575
0.749ValCys: 0.749 ± 0.218
3.744ValAsp: 3.744 ± 0.519
4.269ValGlu: 4.269 ± 0.769
2.172ValPhe: 2.172 ± 0.344
4.044ValGly: 4.044 ± 0.578
0.674ValHis: 0.674 ± 0.202
4.643ValIle: 4.643 ± 0.733
3.819ValLys: 3.819 ± 0.55
5.766ValLeu: 5.766 ± 0.841
1.198ValMet: 1.198 ± 0.326
3.595ValAsn: 3.595 ± 0.62
2.172ValPro: 2.172 ± 0.472
2.846ValGln: 2.846 ± 0.461
3.67ValArg: 3.67 ± 0.549
5.242ValSer: 5.242 ± 0.626
4.943ValThr: 4.943 ± 0.705
4.718ValVal: 4.718 ± 0.874
0.899ValTrp: 0.899 ± 0.272
2.471ValTyr: 2.471 ± 0.527
0.0ValXaa: 0.0 ± 0.0
Trp
1.048TrpAla: 1.048 ± 0.362
0.075TrpCys: 0.075 ± 0.077
0.974TrpAsp: 0.974 ± 0.277
0.524TrpGlu: 0.524 ± 0.238
0.974TrpPhe: 0.974 ± 0.323
0.749TrpGly: 0.749 ± 0.246
0.374TrpHis: 0.374 ± 0.197
0.374TrpIle: 0.374 ± 0.162
0.749TrpLys: 0.749 ± 0.217
2.022TrpLeu: 2.022 ± 0.356
0.674TrpMet: 0.674 ± 0.211
0.674TrpAsn: 0.674 ± 0.255
0.449TrpPro: 0.449 ± 0.208
0.599TrpGln: 0.599 ± 0.213
0.824TrpArg: 0.824 ± 0.259
1.123TrpSer: 1.123 ± 0.424
0.749TrpThr: 0.749 ± 0.2
0.899TrpVal: 0.899 ± 0.299
0.075TrpTrp: 0.075 ± 0.065
0.524TrpTyr: 0.524 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.295TyrAla: 3.295 ± 0.52
0.449TyrCys: 0.449 ± 0.192
2.471TyrAsp: 2.471 ± 0.623
2.621TyrGlu: 2.621 ± 0.39
1.947TyrPhe: 1.947 ± 0.389
3.145TyrGly: 3.145 ± 0.493
0.749TyrHis: 0.749 ± 0.231
1.648TyrIle: 1.648 ± 0.285
1.648TyrLys: 1.648 ± 0.294
2.396TyrLeu: 2.396 ± 0.362
0.374TyrMet: 0.374 ± 0.153
1.648TyrAsn: 1.648 ± 0.359
1.273TyrPro: 1.273 ± 0.394
1.423TyrGln: 1.423 ± 0.262
1.872TyrArg: 1.872 ± 0.322
2.097TyrSer: 2.097 ± 0.387
1.797TyrThr: 1.797 ± 0.497
2.022TyrVal: 2.022 ± 0.353
0.225TyrTrp: 0.225 ± 0.126
1.123TyrTyr: 1.123 ± 0.216
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (13354 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski