Amino acid dipepetide frequency for Escherichia phage HK630

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.512AlaAla: 10.512 ± 1.61
0.515AlaCys: 0.515 ± 0.205
5.513AlaAsp: 5.513 ± 0.606
6.395AlaGlu: 6.395 ± 0.733
3.749AlaPhe: 3.749 ± 0.511
6.542AlaGly: 6.542 ± 0.986
1.544AlaHis: 1.544 ± 0.284
6.395AlaIle: 6.395 ± 0.732
4.557AlaLys: 4.557 ± 0.555
7.057AlaLeu: 7.057 ± 0.723
3.014AlaMet: 3.014 ± 0.375
2.867AlaAsn: 2.867 ± 0.542
2.352AlaPro: 2.352 ± 0.5
4.043AlaGln: 4.043 ± 0.881
6.395AlaArg: 6.395 ± 0.881
6.101AlaSer: 6.101 ± 0.618
4.704AlaThr: 4.704 ± 0.815
6.542AlaVal: 6.542 ± 0.757
2.205AlaTrp: 2.205 ± 0.481
3.308AlaTyr: 3.308 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
1.103CysAla: 1.103 ± 0.312
0.441CysCys: 0.441 ± 0.239
0.441CysAsp: 0.441 ± 0.167
0.809CysGlu: 0.809 ± 0.265
0.221CysPhe: 0.221 ± 0.116
1.029CysGly: 1.029 ± 0.246
0.294CysHis: 0.294 ± 0.172
0.809CysIle: 0.809 ± 0.232
0.735CysLys: 0.735 ± 0.244
0.588CysLeu: 0.588 ± 0.191
0.147CysMet: 0.147 ± 0.088
0.441CysAsn: 0.441 ± 0.164
0.515CysPro: 0.515 ± 0.23
0.221CysGln: 0.221 ± 0.106
1.029CysArg: 1.029 ± 0.358
1.103CysSer: 1.103 ± 0.305
0.809CysThr: 0.809 ± 0.254
0.809CysVal: 0.809 ± 0.22
0.221CysTrp: 0.221 ± 0.099
0.441CysTyr: 0.441 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
5.954AspAla: 5.954 ± 0.832
0.735AspCys: 0.735 ± 0.234
4.263AspAsp: 4.263 ± 0.494
4.41AspGlu: 4.41 ± 0.69
1.691AspPhe: 1.691 ± 0.299
5.072AspGly: 5.072 ± 0.755
0.735AspHis: 0.735 ± 0.217
3.896AspIle: 3.896 ± 0.54
3.234AspLys: 3.234 ± 0.539
4.043AspLeu: 4.043 ± 0.491
1.617AspMet: 1.617 ± 0.347
2.132AspAsn: 2.132 ± 0.41
2.573AspPro: 2.573 ± 0.613
1.103AspGln: 1.103 ± 0.284
2.867AspArg: 2.867 ± 0.458
2.94AspSer: 2.94 ± 0.463
3.381AspThr: 3.381 ± 0.551
3.749AspVal: 3.749 ± 0.509
1.25AspTrp: 1.25 ± 0.3
2.205AspTyr: 2.205 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
6.616GluAla: 6.616 ± 0.87
0.882GluCys: 0.882 ± 0.314
3.014GluAsp: 3.014 ± 0.44
4.263GluGlu: 4.263 ± 0.582
1.985GluPhe: 1.985 ± 0.376
3.381GluGly: 3.381 ± 0.49
1.323GluHis: 1.323 ± 0.407
3.381GluIle: 3.381 ± 0.437
4.41GluLys: 4.41 ± 0.508
6.542GluLeu: 6.542 ± 0.684
1.764GluMet: 1.764 ± 0.352
3.308GluAsn: 3.308 ± 0.577
1.691GluPro: 1.691 ± 0.301
5.146GluGln: 5.146 ± 0.597
3.822GluArg: 3.822 ± 0.649
3.969GluSer: 3.969 ± 0.469
3.969GluThr: 3.969 ± 0.425
3.381GluVal: 3.381 ± 0.545
0.882GluTrp: 0.882 ± 0.25
1.764GluTyr: 1.764 ± 0.379
0.0GluXaa: 0.0 ± 0.0
Phe
2.279PheAla: 2.279 ± 0.417
0.588PheCys: 0.588 ± 0.192
2.793PheAsp: 2.793 ± 0.478
1.617PheGlu: 1.617 ± 0.329
1.397PhePhe: 1.397 ± 0.312
2.279PheGly: 2.279 ± 0.369
1.323PheHis: 1.323 ± 0.3
2.499PheIle: 2.499 ± 0.579
1.103PheLys: 1.103 ± 0.277
2.426PheLeu: 2.426 ± 0.372
0.735PheMet: 0.735 ± 0.174
1.176PheAsn: 1.176 ± 0.24
1.103PhePro: 1.103 ± 0.288
0.662PheGln: 0.662 ± 0.186
2.94PheArg: 2.94 ± 0.411
2.793PheSer: 2.793 ± 0.611
3.087PheThr: 3.087 ± 0.409
2.72PheVal: 2.72 ± 0.408
0.441PheTrp: 0.441 ± 0.138
1.176PheTyr: 1.176 ± 0.304
0.0PheXaa: 0.0 ± 0.0
Gly
5.072GlyAla: 5.072 ± 0.807
0.809GlyCys: 0.809 ± 0.234
4.116GlyAsp: 4.116 ± 0.438
4.852GlyGlu: 4.852 ± 0.552
2.426GlyPhe: 2.426 ± 0.392
5.146GlyGly: 5.146 ± 0.83
1.176GlyHis: 1.176 ± 0.325
3.969GlyIle: 3.969 ± 0.436
3.602GlyLys: 3.602 ± 0.539
5.366GlyLeu: 5.366 ± 0.561
2.793GlyMet: 2.793 ± 0.555
3.161GlyAsn: 3.161 ± 0.485
0.956GlyPro: 0.956 ± 0.215
3.161GlyGln: 3.161 ± 0.501
4.484GlyArg: 4.484 ± 0.563
3.822GlySer: 3.822 ± 0.538
4.19GlyThr: 4.19 ± 0.684
4.631GlyVal: 4.631 ± 0.539
1.25GlyTrp: 1.25 ± 0.276
2.867GlyTyr: 2.867 ± 0.485
0.0GlyXaa: 0.0 ± 0.0
His
1.25HisAla: 1.25 ± 0.327
0.294HisCys: 0.294 ± 0.14
0.956HisAsp: 0.956 ± 0.226
1.176HisGlu: 1.176 ± 0.343
1.103HisPhe: 1.103 ± 0.357
1.397HisGly: 1.397 ± 0.352
0.515HisHis: 0.515 ± 0.193
1.25HisIle: 1.25 ± 0.291
1.617HisLys: 1.617 ± 0.342
1.838HisLeu: 1.838 ± 0.358
0.221HisMet: 0.221 ± 0.123
0.809HisAsn: 0.809 ± 0.225
1.103HisPro: 1.103 ± 0.284
0.809HisGln: 0.809 ± 0.256
1.397HisArg: 1.397 ± 0.286
1.103HisSer: 1.103 ± 0.331
0.956HisThr: 0.956 ± 0.235
0.882HisVal: 0.882 ± 0.208
0.147HisTrp: 0.147 ± 0.1
0.956HisTyr: 0.956 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
5.146IleAla: 5.146 ± 0.623
0.735IleCys: 0.735 ± 0.229
3.675IleAsp: 3.675 ± 0.55
3.969IleGlu: 3.969 ± 0.583
1.323IlePhe: 1.323 ± 0.399
3.896IleGly: 3.896 ± 0.747
1.029IleHis: 1.029 ± 0.261
2.499IleIle: 2.499 ± 0.523
3.381IleLys: 3.381 ± 0.592
3.749IleLeu: 3.749 ± 0.568
0.882IleMet: 0.882 ± 0.22
2.499IleAsn: 2.499 ± 0.392
2.279IlePro: 2.279 ± 0.402
2.499IleGln: 2.499 ± 0.412
3.381IleArg: 3.381 ± 0.549
4.19IleSer: 4.19 ± 0.543
5.219IleThr: 5.219 ± 0.659
3.014IleVal: 3.014 ± 0.434
0.882IleTrp: 0.882 ± 0.245
1.544IleTyr: 1.544 ± 0.34
0.0IleXaa: 0.0 ± 0.0
Lys
5.293LysAla: 5.293 ± 0.816
0.956LysCys: 0.956 ± 0.295
2.867LysAsp: 2.867 ± 0.428
4.263LysGlu: 4.263 ± 0.436
1.617LysPhe: 1.617 ± 0.329
3.087LysGly: 3.087 ± 0.6
1.617LysHis: 1.617 ± 0.345
2.499LysIle: 2.499 ± 0.466
4.263LysLys: 4.263 ± 0.643
4.263LysLeu: 4.263 ± 0.651
1.544LysMet: 1.544 ± 0.382
2.573LysAsn: 2.573 ± 0.485
2.646LysPro: 2.646 ± 0.506
2.499LysGln: 2.499 ± 0.441
3.675LysArg: 3.675 ± 0.555
3.749LysSer: 3.749 ± 0.459
3.308LysThr: 3.308 ± 0.518
3.308LysVal: 3.308 ± 0.495
1.029LysTrp: 1.029 ± 0.246
1.911LysTyr: 1.911 ± 0.311
0.0LysXaa: 0.0 ± 0.0
Leu
8.527LeuAla: 8.527 ± 1.046
0.882LeuCys: 0.882 ± 0.273
3.749LeuAsp: 3.749 ± 0.514
4.704LeuGlu: 4.704 ± 0.649
2.793LeuPhe: 2.793 ± 0.469
3.896LeuGly: 3.896 ± 0.564
1.691LeuHis: 1.691 ± 0.351
4.043LeuIle: 4.043 ± 0.68
5.66LeuLys: 5.66 ± 0.739
5.954LeuLeu: 5.954 ± 0.662
1.691LeuMet: 1.691 ± 0.327
3.381LeuAsn: 3.381 ± 0.47
4.116LeuPro: 4.116 ± 0.426
2.793LeuGln: 2.793 ± 0.453
5.219LeuArg: 5.219 ± 0.565
6.983LeuSer: 6.983 ± 0.709
5.66LeuThr: 5.66 ± 0.82
4.778LeuVal: 4.778 ± 0.555
1.544LeuTrp: 1.544 ± 0.29
1.838LeuTyr: 1.838 ± 0.438
0.0LeuXaa: 0.0 ± 0.0
Met
3.381MetAla: 3.381 ± 0.647
0.221MetCys: 0.221 ± 0.125
1.103MetAsp: 1.103 ± 0.306
1.029MetGlu: 1.029 ± 0.301
1.029MetPhe: 1.029 ± 0.233
1.323MetGly: 1.323 ± 0.234
0.735MetHis: 0.735 ± 0.282
1.103MetIle: 1.103 ± 0.276
1.838MetLys: 1.838 ± 0.417
3.014MetLeu: 3.014 ± 0.488
0.809MetMet: 0.809 ± 0.259
0.956MetAsn: 0.956 ± 0.237
1.544MetPro: 1.544 ± 0.312
1.838MetGln: 1.838 ± 0.397
1.691MetArg: 1.691 ± 0.266
1.838MetSer: 1.838 ± 0.314
2.94MetThr: 2.94 ± 0.59
1.544MetVal: 1.544 ± 0.287
0.221MetTrp: 0.221 ± 0.096
0.809MetTyr: 0.809 ± 0.256
0.0MetXaa: 0.0 ± 0.0
Asn
3.161AsnAla: 3.161 ± 0.421
0.882AsnCys: 0.882 ± 0.243
1.985AsnAsp: 1.985 ± 0.285
2.573AsnGlu: 2.573 ± 0.385
0.882AsnPhe: 0.882 ± 0.263
4.116AsnGly: 4.116 ± 0.503
1.029AsnHis: 1.029 ± 0.282
2.646AsnIle: 2.646 ± 0.431
2.058AsnLys: 2.058 ± 0.4
2.573AsnLeu: 2.573 ± 0.389
1.029AsnMet: 1.029 ± 0.278
1.911AsnAsn: 1.911 ± 0.445
1.691AsnPro: 1.691 ± 0.297
1.47AsnGln: 1.47 ± 0.36
2.426AsnArg: 2.426 ± 0.606
2.499AsnSer: 2.499 ± 0.347
2.646AsnThr: 2.646 ± 0.44
2.132AsnVal: 2.132 ± 0.345
0.441AsnTrp: 0.441 ± 0.166
1.323AsnTyr: 1.323 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
4.19ProAla: 4.19 ± 0.614
0.221ProCys: 0.221 ± 0.139
3.308ProAsp: 3.308 ± 0.515
2.279ProGlu: 2.279 ± 0.399
1.397ProPhe: 1.397 ± 0.306
3.161ProGly: 3.161 ± 0.409
0.809ProHis: 0.809 ± 0.22
1.25ProIle: 1.25 ± 0.372
2.205ProLys: 2.205 ± 0.452
3.014ProLeu: 3.014 ± 0.397
0.809ProMet: 0.809 ± 0.219
1.176ProAsn: 1.176 ± 0.243
2.058ProPro: 2.058 ± 0.547
1.176ProGln: 1.176 ± 0.259
1.691ProArg: 1.691 ± 0.392
2.279ProSer: 2.279 ± 0.504
1.911ProThr: 1.911 ± 0.428
3.161ProVal: 3.161 ± 0.424
0.588ProTrp: 0.588 ± 0.248
0.956ProTyr: 0.956 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
4.631GlnAla: 4.631 ± 0.711
0.662GlnCys: 0.662 ± 0.197
1.47GlnAsp: 1.47 ± 0.32
2.646GlnGlu: 2.646 ± 0.493
1.103GlnPhe: 1.103 ± 0.33
1.985GlnGly: 1.985 ± 0.428
0.515GlnHis: 0.515 ± 0.198
3.014GlnIle: 3.014 ± 0.368
3.014GlnLys: 3.014 ± 0.514
3.381GlnLeu: 3.381 ± 0.465
1.617GlnMet: 1.617 ± 0.355
1.47GlnAsn: 1.47 ± 0.323
1.47GlnPro: 1.47 ± 0.365
3.087GlnGln: 3.087 ± 0.648
2.94GlnArg: 2.94 ± 0.378
3.014GlnSer: 3.014 ± 0.442
2.573GlnThr: 2.573 ± 0.479
3.087GlnVal: 3.087 ± 0.421
0.515GlnTrp: 0.515 ± 0.239
1.617GlnTyr: 1.617 ± 0.3
0.0GlnXaa: 0.0 ± 0.0
Arg
4.631ArgAla: 4.631 ± 0.552
0.662ArgCys: 0.662 ± 0.241
3.822ArgAsp: 3.822 ± 0.656
5.587ArgGlu: 5.587 ± 0.717
1.985ArgPhe: 1.985 ± 0.386
4.263ArgGly: 4.263 ± 0.552
1.47ArgHis: 1.47 ± 0.312
3.749ArgIle: 3.749 ± 0.595
3.749ArgLys: 3.749 ± 0.604
5.293ArgLeu: 5.293 ± 0.505
2.646ArgMet: 2.646 ± 0.339
3.161ArgAsn: 3.161 ± 0.415
1.47ArgPro: 1.47 ± 0.38
3.161ArgGln: 3.161 ± 0.493
5.734ArgArg: 5.734 ± 1.078
3.014ArgSer: 3.014 ± 0.515
2.793ArgThr: 2.793 ± 0.371
3.528ArgVal: 3.528 ± 0.622
1.103ArgTrp: 1.103 ± 0.233
2.646ArgTyr: 2.646 ± 0.35
0.0ArgXaa: 0.0 ± 0.0
Ser
5.954SerAla: 5.954 ± 0.674
0.588SerCys: 0.588 ± 0.188
4.19SerAsp: 4.19 ± 0.601
4.337SerGlu: 4.337 ± 0.551
2.499SerPhe: 2.499 ± 0.398
6.616SerGly: 6.616 ± 0.734
1.103SerHis: 1.103 ± 0.232
3.014SerIle: 3.014 ± 0.533
2.499SerLys: 2.499 ± 0.378
5.146SerLeu: 5.146 ± 0.742
2.352SerMet: 2.352 ± 0.426
2.279SerAsn: 2.279 ± 0.487
2.205SerPro: 2.205 ± 0.419
3.087SerGln: 3.087 ± 0.393
4.925SerArg: 4.925 ± 0.574
3.675SerSer: 3.675 ± 0.417
3.602SerThr: 3.602 ± 0.456
5.293SerVal: 5.293 ± 0.768
0.735SerTrp: 0.735 ± 0.26
1.323SerTyr: 1.323 ± 0.315
0.0SerXaa: 0.0 ± 0.0
Thr
6.836ThrAla: 6.836 ± 0.854
0.662ThrCys: 0.662 ± 0.219
3.528ThrAsp: 3.528 ± 0.503
4.116ThrGlu: 4.116 ± 0.593
3.455ThrPhe: 3.455 ± 0.572
4.263ThrGly: 4.263 ± 0.569
1.029ThrHis: 1.029 ± 0.272
3.014ThrIle: 3.014 ± 0.516
2.72ThrLys: 2.72 ± 0.419
5.807ThrLeu: 5.807 ± 0.714
1.397ThrMet: 1.397 ± 0.324
1.397ThrAsn: 1.397 ± 0.286
3.749ThrPro: 3.749 ± 0.679
2.352ThrGln: 2.352 ± 0.403
2.94ThrArg: 2.94 ± 0.416
3.308ThrSer: 3.308 ± 0.554
3.161ThrThr: 3.161 ± 0.533
4.925ThrVal: 4.925 ± 0.99
1.103ThrTrp: 1.103 ± 0.271
1.911ThrTyr: 1.911 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
5.881ValAla: 5.881 ± 0.807
0.662ValCys: 0.662 ± 0.227
3.749ValAsp: 3.749 ± 0.467
3.675ValGlu: 3.675 ± 0.43
2.279ValPhe: 2.279 ± 0.335
3.969ValGly: 3.969 ± 0.661
0.809ValHis: 0.809 ± 0.225
3.969ValIle: 3.969 ± 0.541
4.116ValLys: 4.116 ± 0.614
5.44ValLeu: 5.44 ± 0.777
2.646ValMet: 2.646 ± 0.406
3.161ValAsn: 3.161 ± 0.526
2.573ValPro: 2.573 ± 0.423
2.352ValGln: 2.352 ± 0.652
3.528ValArg: 3.528 ± 0.584
5.146ValSer: 5.146 ± 0.655
4.263ValThr: 4.263 ± 0.72
4.484ValVal: 4.484 ± 0.537
0.662ValTrp: 0.662 ± 0.21
1.838ValTyr: 1.838 ± 0.447
0.0ValXaa: 0.0 ± 0.0
Trp
1.25TrpAla: 1.25 ± 0.23
0.368TrpCys: 0.368 ± 0.138
1.103TrpAsp: 1.103 ± 0.288
0.515TrpGlu: 0.515 ± 0.177
0.662TrpPhe: 0.662 ± 0.201
0.735TrpGly: 0.735 ± 0.201
0.441TrpHis: 0.441 ± 0.194
0.735TrpIle: 0.735 ± 0.278
1.103TrpLys: 1.103 ± 0.332
1.47TrpLeu: 1.47 ± 0.358
0.588TrpMet: 0.588 ± 0.171
0.662TrpAsn: 0.662 ± 0.182
0.662TrpPro: 0.662 ± 0.221
0.515TrpGln: 0.515 ± 0.185
1.103TrpArg: 1.103 ± 0.34
0.956TrpSer: 0.956 ± 0.23
0.735TrpThr: 0.735 ± 0.218
1.544TrpVal: 1.544 ± 0.343
0.294TrpTrp: 0.294 ± 0.173
0.515TrpTyr: 0.515 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.646TyrAla: 2.646 ± 0.442
0.441TyrCys: 0.441 ± 0.156
2.205TyrAsp: 2.205 ± 0.317
2.279TyrGlu: 2.279 ± 0.427
1.544TyrPhe: 1.544 ± 0.31
1.691TyrGly: 1.691 ± 0.285
0.588TyrHis: 0.588 ± 0.205
2.058TyrIle: 2.058 ± 0.321
1.103TyrLys: 1.103 ± 0.351
2.94TyrLeu: 2.94 ± 0.451
0.588TyrMet: 0.588 ± 0.219
0.956TyrAsn: 0.956 ± 0.199
0.882TyrPro: 0.882 ± 0.267
1.838TyrGln: 1.838 ± 0.295
2.352TyrArg: 2.352 ± 0.4
2.94TyrSer: 2.94 ± 0.466
1.764TyrThr: 1.764 ± 0.306
1.691TyrVal: 1.691 ± 0.311
0.441TyrTrp: 0.441 ± 0.148
1.176TyrTyr: 1.176 ± 0.312
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13605 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski