Amino acid dipepetide frequency for Enterobacteria phage 2851

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.549AlaAla: 10.549 ± 1.515
0.869AlaCys: 0.869 ± 0.224
5.647AlaAsp: 5.647 ± 0.712
7.943AlaGlu: 7.943 ± 0.983
3.227AlaPhe: 3.227 ± 0.419
7.136AlaGly: 7.136 ± 1.046
1.365AlaHis: 1.365 ± 0.304
5.213AlaIle: 5.213 ± 0.525
3.971AlaLys: 3.971 ± 0.583
8.315AlaLeu: 8.315 ± 0.876
3.041AlaMet: 3.041 ± 0.382
3.475AlaAsn: 3.475 ± 0.419
2.296AlaPro: 2.296 ± 0.322
3.847AlaGln: 3.847 ± 0.521
6.392AlaArg: 6.392 ± 0.782
6.95AlaSer: 6.95 ± 0.788
4.406AlaThr: 4.406 ± 0.641
6.702AlaVal: 6.702 ± 1.131
1.986AlaTrp: 1.986 ± 0.361
2.358AlaTyr: 2.358 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.869CysAla: 0.869 ± 0.254
0.434CysCys: 0.434 ± 0.19
0.745CysAsp: 0.745 ± 0.224
0.434CysGlu: 0.434 ± 0.161
0.31CysPhe: 0.31 ± 0.144
1.117CysGly: 1.117 ± 0.303
0.496CysHis: 0.496 ± 0.184
0.621CysIle: 0.621 ± 0.208
0.496CysLys: 0.496 ± 0.189
1.241CysLeu: 1.241 ± 0.246
0.31CysMet: 0.31 ± 0.15
0.558CysAsn: 0.558 ± 0.189
0.434CysPro: 0.434 ± 0.165
0.372CysGln: 0.372 ± 0.134
1.179CysArg: 1.179 ± 0.313
1.117CysSer: 1.117 ± 0.238
1.241CysThr: 1.241 ± 0.297
0.869CysVal: 0.869 ± 0.215
0.186CysTrp: 0.186 ± 0.101
0.496CysTyr: 0.496 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
5.275AspAla: 5.275 ± 0.825
0.558AspCys: 0.558 ± 0.2
3.599AspAsp: 3.599 ± 0.599
3.909AspGlu: 3.909 ± 0.469
2.048AspPhe: 2.048 ± 0.382
5.771AspGly: 5.771 ± 0.727
0.745AspHis: 0.745 ± 0.16
3.537AspIle: 3.537 ± 0.451
2.979AspLys: 2.979 ± 0.522
3.909AspLeu: 3.909 ± 0.617
1.862AspMet: 1.862 ± 0.351
2.42AspAsn: 2.42 ± 0.399
2.234AspPro: 2.234 ± 0.46
1.179AspGln: 1.179 ± 0.204
3.475AspArg: 3.475 ± 0.386
2.668AspSer: 2.668 ± 0.312
2.792AspThr: 2.792 ± 0.519
3.847AspVal: 3.847 ± 0.636
0.993AspTrp: 0.993 ± 0.3
2.048AspTyr: 2.048 ± 0.421
0.0AspXaa: 0.0 ± 0.0
Glu
6.764GluAla: 6.764 ± 0.891
1.055GluCys: 1.055 ± 0.257
2.792GluAsp: 2.792 ± 0.353
3.599GluGlu: 3.599 ± 0.597
3.041GluPhe: 3.041 ± 0.426
3.475GluGly: 3.475 ± 0.385
1.179GluHis: 1.179 ± 0.275
4.096GluIle: 4.096 ± 0.522
4.22GluLys: 4.22 ± 0.459
6.081GluLeu: 6.081 ± 0.665
2.172GluMet: 2.172 ± 0.319
2.73GluAsn: 2.73 ± 0.364
2.172GluPro: 2.172 ± 0.375
3.971GluGln: 3.971 ± 0.519
4.902GluArg: 4.902 ± 0.611
4.096GluSer: 4.096 ± 0.602
3.041GluThr: 3.041 ± 0.56
3.785GluVal: 3.785 ± 0.523
0.931GluTrp: 0.931 ± 0.235
1.986GluTyr: 1.986 ± 0.346
0.0GluXaa: 0.0 ± 0.0
Phe
2.172PheAla: 2.172 ± 0.423
0.621PheCys: 0.621 ± 0.188
1.986PheAsp: 1.986 ± 0.31
1.613PheGlu: 1.613 ± 0.261
0.745PhePhe: 0.745 ± 0.243
2.544PheGly: 2.544 ± 0.316
0.621PheHis: 0.621 ± 0.225
1.489PheIle: 1.489 ± 0.296
1.613PheLys: 1.613 ± 0.406
2.234PheLeu: 2.234 ± 0.446
1.055PheMet: 1.055 ± 0.325
1.179PheAsn: 1.179 ± 0.238
1.117PhePro: 1.117 ± 0.295
0.807PheGln: 0.807 ± 0.206
2.73PheArg: 2.73 ± 0.419
3.971PheSer: 3.971 ± 0.526
2.296PheThr: 2.296 ± 0.45
2.358PheVal: 2.358 ± 0.478
0.558PheTrp: 0.558 ± 0.183
0.745PheTyr: 0.745 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
5.709GlyAla: 5.709 ± 0.731
0.807GlyCys: 0.807 ± 0.227
4.592GlyAsp: 4.592 ± 0.542
4.902GlyGlu: 4.902 ± 0.744
2.73GlyPhe: 2.73 ± 0.417
5.275GlyGly: 5.275 ± 0.697
1.241GlyHis: 1.241 ± 0.239
4.158GlyIle: 4.158 ± 0.599
4.22GlyLys: 4.22 ± 0.466
5.15GlyLeu: 5.15 ± 0.578
2.979GlyMet: 2.979 ± 0.495
3.723GlyAsn: 3.723 ± 0.429
2.234GlyPro: 2.234 ± 1.41
2.979GlyGln: 2.979 ± 0.475
4.282GlyArg: 4.282 ± 0.562
4.84GlySer: 4.84 ± 0.659
3.041GlyThr: 3.041 ± 0.473
5.088GlyVal: 5.088 ± 0.479
1.862GlyTrp: 1.862 ± 0.285
1.986GlyTyr: 1.986 ± 0.341
0.0GlyXaa: 0.0 ± 0.0
His
1.303HisAla: 1.303 ± 0.312
0.434HisCys: 0.434 ± 0.169
1.427HisAsp: 1.427 ± 0.298
1.303HisGlu: 1.303 ± 0.31
0.745HisPhe: 0.745 ± 0.22
1.924HisGly: 1.924 ± 0.372
0.993HisHis: 0.993 ± 0.268
1.365HisIle: 1.365 ± 0.381
0.993HisLys: 0.993 ± 0.249
1.613HisLeu: 1.613 ± 0.366
0.062HisMet: 0.062 ± 0.061
0.558HisAsn: 0.558 ± 0.172
0.993HisPro: 0.993 ± 0.235
0.621HisGln: 0.621 ± 0.164
1.055HisArg: 1.055 ± 0.268
1.179HisSer: 1.179 ± 0.241
1.117HisThr: 1.117 ± 0.249
1.055HisVal: 1.055 ± 0.228
0.372HisTrp: 0.372 ± 0.161
0.558HisTyr: 0.558 ± 0.192
0.0HisXaa: 0.0 ± 0.0
Ile
4.84IleAla: 4.84 ± 0.599
0.869IleCys: 0.869 ± 0.247
3.661IleAsp: 3.661 ± 0.485
3.165IleGlu: 3.165 ± 0.441
0.745IlePhe: 0.745 ± 0.197
3.227IleGly: 3.227 ± 0.433
1.055IleHis: 1.055 ± 0.249
2.606IleIle: 2.606 ± 0.425
3.103IleLys: 3.103 ± 0.357
3.599IleLeu: 3.599 ± 0.49
1.303IleMet: 1.303 ± 0.278
2.73IleAsn: 2.73 ± 0.492
2.792IlePro: 2.792 ± 0.59
2.172IleGln: 2.172 ± 0.354
4.158IleArg: 4.158 ± 0.365
5.523IleSer: 5.523 ± 0.669
4.096IleThr: 4.096 ± 0.478
2.11IleVal: 2.11 ± 0.342
0.621IleTrp: 0.621 ± 0.209
1.179IleTyr: 1.179 ± 0.349
0.0IleXaa: 0.0 ± 0.0
Lys
5.585LysAla: 5.585 ± 0.536
0.496LysCys: 0.496 ± 0.174
2.482LysAsp: 2.482 ± 0.36
3.351LysGlu: 3.351 ± 0.501
1.117LysPhe: 1.117 ± 0.312
4.158LysGly: 4.158 ± 0.71
1.179LysHis: 1.179 ± 0.318
2.606LysIle: 2.606 ± 0.33
3.289LysLys: 3.289 ± 0.503
3.599LysLeu: 3.599 ± 0.527
1.986LysMet: 1.986 ± 0.398
2.296LysAsn: 2.296 ± 0.351
2.854LysPro: 2.854 ± 0.482
2.917LysGln: 2.917 ± 0.491
3.103LysArg: 3.103 ± 0.484
3.413LysSer: 3.413 ± 0.54
3.537LysThr: 3.537 ± 0.529
2.854LysVal: 2.854 ± 0.589
0.807LysTrp: 0.807 ± 0.256
1.241LysTyr: 1.241 ± 0.3
0.0LysXaa: 0.0 ± 0.0
Leu
9.432LeuAla: 9.432 ± 0.854
1.427LeuCys: 1.427 ± 0.352
3.661LeuAsp: 3.661 ± 0.499
4.592LeuGlu: 4.592 ± 0.613
2.42LeuPhe: 2.42 ± 0.347
4.158LeuGly: 4.158 ± 0.784
1.862LeuHis: 1.862 ± 0.346
4.406LeuIle: 4.406 ± 0.558
5.088LeuLys: 5.088 ± 0.559
5.895LeuLeu: 5.895 ± 0.605
2.11LeuMet: 2.11 ± 0.304
4.158LeuAsn: 4.158 ± 0.569
3.661LeuPro: 3.661 ± 0.538
3.599LeuGln: 3.599 ± 0.647
6.392LeuArg: 6.392 ± 0.578
5.957LeuSer: 5.957 ± 0.668
5.337LeuThr: 5.337 ± 0.531
3.785LeuVal: 3.785 ± 0.525
1.862LeuTrp: 1.862 ± 0.314
2.172LeuTyr: 2.172 ± 0.361
0.0LeuXaa: 0.0 ± 0.0
Met
3.103MetAla: 3.103 ± 0.421
0.124MetCys: 0.124 ± 0.087
1.489MetAsp: 1.489 ± 0.334
1.055MetGlu: 1.055 ± 0.27
0.683MetPhe: 0.683 ± 0.176
1.365MetGly: 1.365 ± 0.297
0.186MetHis: 0.186 ± 0.108
1.117MetIle: 1.117 ± 0.267
1.986MetLys: 1.986 ± 0.471
2.854MetLeu: 2.854 ± 0.46
0.621MetMet: 0.621 ± 0.224
1.241MetAsn: 1.241 ± 0.261
1.055MetPro: 1.055 ± 0.257
1.489MetGln: 1.489 ± 0.302
1.738MetArg: 1.738 ± 0.288
2.73MetSer: 2.73 ± 0.425
2.792MetThr: 2.792 ± 0.381
1.179MetVal: 1.179 ± 0.358
0.434MetTrp: 0.434 ± 0.162
0.31MetTyr: 0.31 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
4.592AsnAla: 4.592 ± 0.579
0.496AsnCys: 0.496 ± 0.184
2.482AsnAsp: 2.482 ± 0.366
2.792AsnGlu: 2.792 ± 0.464
0.869AsnPhe: 0.869 ± 0.26
3.847AsnGly: 3.847 ± 0.656
1.303AsnHis: 1.303 ± 0.325
2.73AsnIle: 2.73 ± 0.344
2.42AsnLys: 2.42 ± 0.348
3.227AsnLeu: 3.227 ± 0.484
0.558AsnMet: 0.558 ± 0.184
1.365AsnAsn: 1.365 ± 0.315
2.482AsnPro: 2.482 ± 0.345
1.427AsnGln: 1.427 ± 0.303
2.234AsnArg: 2.234 ± 0.402
2.979AsnSer: 2.979 ± 0.373
2.42AsnThr: 2.42 ± 0.449
2.42AsnVal: 2.42 ± 0.36
0.496AsnTrp: 0.496 ± 0.15
1.055AsnTyr: 1.055 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
3.351ProAla: 3.351 ± 0.623
0.31ProCys: 0.31 ± 0.141
3.351ProAsp: 3.351 ± 0.532
4.406ProGlu: 4.406 ± 0.495
1.427ProPhe: 1.427 ± 0.279
3.227ProGly: 3.227 ± 0.424
0.807ProHis: 0.807 ± 0.214
1.489ProIle: 1.489 ± 0.33
1.986ProLys: 1.986 ± 0.586
2.482ProLeu: 2.482 ± 0.297
0.745ProMet: 0.745 ± 0.243
1.179ProAsn: 1.179 ± 0.259
2.048ProPro: 2.048 ± 0.449
1.862ProGln: 1.862 ± 0.45
1.8ProArg: 1.8 ± 0.439
3.165ProSer: 3.165 ± 0.43
1.862ProThr: 1.862 ± 0.305
4.406ProVal: 4.406 ± 0.562
0.496ProTrp: 0.496 ± 0.204
1.117ProTyr: 1.117 ± 0.235
0.0ProXaa: 0.0 ± 0.0
Gln
3.909GlnAla: 3.909 ± 0.606
0.931GlnCys: 0.931 ± 0.204
1.8GlnAsp: 1.8 ± 0.302
2.42GlnGlu: 2.42 ± 0.488
1.613GlnPhe: 1.613 ± 0.297
3.041GlnGly: 3.041 ± 0.454
1.055GlnHis: 1.055 ± 0.221
2.979GlnIle: 2.979 ± 0.353
2.917GlnLys: 2.917 ± 0.46
3.537GlnLeu: 3.537 ± 0.463
1.303GlnMet: 1.303 ± 0.301
1.675GlnAsn: 1.675 ± 0.514
1.738GlnPro: 1.738 ± 0.298
2.42GlnGln: 2.42 ± 0.534
2.73GlnArg: 2.73 ± 0.54
2.917GlnSer: 2.917 ± 0.497
1.862GlnThr: 1.862 ± 0.43
1.986GlnVal: 1.986 ± 0.401
0.869GlnTrp: 0.869 ± 0.232
1.675GlnTyr: 1.675 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
4.778ArgAla: 4.778 ± 0.685
0.745ArgCys: 0.745 ± 0.265
3.909ArgAsp: 3.909 ± 0.832
5.585ArgGlu: 5.585 ± 0.582
2.296ArgPhe: 2.296 ± 0.464
3.785ArgGly: 3.785 ± 0.441
1.738ArgHis: 1.738 ± 0.33
3.351ArgIle: 3.351 ± 0.556
4.034ArgLys: 4.034 ± 0.512
6.702ArgLeu: 6.702 ± 0.749
1.862ArgMet: 1.862 ± 0.31
3.599ArgAsn: 3.599 ± 0.487
2.544ArgPro: 2.544 ± 0.535
3.475ArgGln: 3.475 ± 0.569
5.895ArgArg: 5.895 ± 0.722
2.979ArgSer: 2.979 ± 0.491
2.854ArgThr: 2.854 ± 0.459
4.406ArgVal: 4.406 ± 0.49
0.993ArgTrp: 0.993 ± 0.256
1.613ArgTyr: 1.613 ± 0.415
0.0ArgXaa: 0.0 ± 0.0
Ser
8.129SerAla: 8.129 ± 1.012
0.931SerCys: 0.931 ± 0.247
3.351SerAsp: 3.351 ± 0.459
4.344SerGlu: 4.344 ± 0.557
1.924SerPhe: 1.924 ± 0.324
5.833SerGly: 5.833 ± 0.796
0.807SerHis: 0.807 ± 0.229
3.475SerIle: 3.475 ± 0.446
2.296SerLys: 2.296 ± 0.382
6.516SerLeu: 6.516 ± 0.804
2.296SerMet: 2.296 ± 0.312
2.792SerAsn: 2.792 ± 0.355
2.979SerPro: 2.979 ± 0.437
4.282SerGln: 4.282 ± 0.533
4.22SerArg: 4.22 ± 0.441
4.22SerSer: 4.22 ± 0.519
3.351SerThr: 3.351 ± 0.47
5.213SerVal: 5.213 ± 0.642
0.621SerTrp: 0.621 ± 0.203
1.862SerTyr: 1.862 ± 0.453
0.0SerXaa: 0.0 ± 0.0
Thr
5.275ThrAla: 5.275 ± 0.617
0.993ThrCys: 0.993 ± 0.345
3.103ThrAsp: 3.103 ± 0.565
4.592ThrGlu: 4.592 ± 0.721
2.172ThrPhe: 2.172 ± 0.42
5.088ThrGly: 5.088 ± 0.715
1.117ThrHis: 1.117 ± 0.237
2.73ThrIle: 2.73 ± 0.465
2.42ThrLys: 2.42 ± 0.348
6.019ThrLeu: 6.019 ± 0.671
0.807ThrMet: 0.807 ± 0.238
1.551ThrAsn: 1.551 ± 0.342
3.103ThrPro: 3.103 ± 0.579
2.048ThrGln: 2.048 ± 0.291
2.358ThrArg: 2.358 ± 0.328
2.792ThrSer: 2.792 ± 0.483
3.661ThrThr: 3.661 ± 0.516
5.088ThrVal: 5.088 ± 0.634
0.496ThrTrp: 0.496 ± 0.172
1.427ThrTyr: 1.427 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
6.267ValAla: 6.267 ± 0.657
0.993ValCys: 0.993 ± 0.24
3.227ValAsp: 3.227 ± 0.354
3.971ValGlu: 3.971 ± 0.454
1.986ValPhe: 1.986 ± 0.493
4.034ValGly: 4.034 ± 0.594
0.621ValHis: 0.621 ± 0.196
3.785ValIle: 3.785 ± 0.523
3.103ValLys: 3.103 ± 0.45
5.337ValLeu: 5.337 ± 0.688
1.489ValMet: 1.489 ± 0.288
3.599ValAsn: 3.599 ± 0.404
2.979ValPro: 2.979 ± 0.431
1.862ValGln: 1.862 ± 0.448
4.654ValArg: 4.654 ± 0.561
5.026ValSer: 5.026 ± 0.682
4.406ValThr: 4.406 ± 0.692
4.716ValVal: 4.716 ± 0.554
1.179ValTrp: 1.179 ± 0.31
1.924ValTyr: 1.924 ± 0.333
0.0ValXaa: 0.0 ± 0.0
Trp
1.117TrpAla: 1.117 ± 0.273
0.31TrpCys: 0.31 ± 0.129
0.745TrpAsp: 0.745 ± 0.254
0.745TrpGlu: 0.745 ± 0.196
0.745TrpPhe: 0.745 ± 0.219
1.055TrpGly: 1.055 ± 0.209
0.621TrpHis: 0.621 ± 0.21
0.496TrpIle: 0.496 ± 0.175
0.745TrpLys: 0.745 ± 0.231
1.862TrpLeu: 1.862 ± 0.399
0.372TrpMet: 0.372 ± 0.136
0.496TrpAsn: 0.496 ± 0.16
0.621TrpPro: 0.621 ± 0.2
0.869TrpGln: 0.869 ± 0.203
1.489TrpArg: 1.489 ± 0.281
0.807TrpSer: 0.807 ± 0.221
1.055TrpThr: 1.055 ± 0.25
1.738TrpVal: 1.738 ± 0.288
0.248TrpTrp: 0.248 ± 0.153
0.372TrpTyr: 0.372 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.73TyrAla: 2.73 ± 0.413
0.124TyrCys: 0.124 ± 0.088
1.8TyrAsp: 1.8 ± 0.337
1.365TyrGlu: 1.365 ± 0.294
1.613TyrPhe: 1.613 ± 0.358
1.738TyrGly: 1.738 ± 0.399
0.683TyrHis: 0.683 ± 0.154
1.303TyrIle: 1.303 ± 0.355
0.993TyrLys: 0.993 ± 0.234
1.8TyrLeu: 1.8 ± 0.344
0.558TyrMet: 0.558 ± 0.225
0.869TyrAsn: 0.869 ± 0.274
1.179TyrPro: 1.179 ± 0.311
1.179TyrGln: 1.179 ± 0.37
2.358TyrArg: 2.358 ± 0.434
2.11TyrSer: 2.11 ± 0.338
1.675TyrThr: 1.675 ± 0.327
1.551TyrVal: 1.551 ± 0.33
0.496TyrTrp: 0.496 ± 0.227
0.683TyrTyr: 0.683 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (16116 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski