Amino acid dipepetide frequency for Escherichia phage HK629

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.671AlaAla: 12.671 ± 2.287
0.348AlaCys: 0.348 ± 0.151
5.152AlaAsp: 5.152 ± 0.552
6.614AlaGlu: 6.614 ± 0.906
3.62AlaPhe: 3.62 ± 0.484
6.893AlaGly: 6.893 ± 1.051
2.089AlaHis: 2.089 ± 0.697
5.431AlaIle: 5.431 ± 0.676
5.152AlaLys: 5.152 ± 0.687
6.684AlaLeu: 6.684 ± 0.776
2.506AlaMet: 2.506 ± 0.401
2.855AlaAsn: 2.855 ± 0.522
2.367AlaPro: 2.367 ± 0.389
3.76AlaGln: 3.76 ± 0.839
6.336AlaArg: 6.336 ± 0.926
7.45AlaSer: 7.45 ± 1.382
5.709AlaThr: 5.709 ± 0.955
6.196AlaVal: 6.196 ± 0.8
2.089AlaTrp: 2.089 ± 0.397
3.412AlaTyr: 3.412 ± 0.422
0.0AlaXaa: 0.0 ± 0.0
Cys
1.184CysAla: 1.184 ± 0.295
0.487CysCys: 0.487 ± 0.229
0.766CysAsp: 0.766 ± 0.209
0.627CysGlu: 0.627 ± 0.22
0.209CysPhe: 0.209 ± 0.114
0.905CysGly: 0.905 ± 0.307
0.209CysHis: 0.209 ± 0.113
0.766CysIle: 0.766 ± 0.254
0.487CysLys: 0.487 ± 0.183
0.627CysLeu: 0.627 ± 0.19
0.139CysMet: 0.139 ± 0.081
0.418CysAsn: 0.418 ± 0.142
0.487CysPro: 0.487 ± 0.229
0.348CysGln: 0.348 ± 0.152
0.835CysArg: 0.835 ± 0.292
1.184CysSer: 1.184 ± 0.275
0.627CysThr: 0.627 ± 0.238
0.696CysVal: 0.696 ± 0.245
0.278CysTrp: 0.278 ± 0.114
0.278CysTyr: 0.278 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
6.057AspAla: 6.057 ± 0.856
0.905AspCys: 0.905 ± 0.257
4.456AspAsp: 4.456 ± 0.507
3.62AspGlu: 3.62 ± 0.586
1.81AspPhe: 1.81 ± 0.273
4.595AspGly: 4.595 ± 0.672
0.766AspHis: 0.766 ± 0.199
3.62AspIle: 3.62 ± 0.47
3.063AspLys: 3.063 ± 0.438
3.829AspLeu: 3.829 ± 0.512
1.323AspMet: 1.323 ± 0.334
2.367AspAsn: 2.367 ± 0.391
2.506AspPro: 2.506 ± 0.586
0.975AspGln: 0.975 ± 0.275
2.855AspArg: 2.855 ± 0.436
3.481AspSer: 3.481 ± 0.488
3.203AspThr: 3.203 ± 0.489
3.829AspVal: 3.829 ± 0.529
1.044AspTrp: 1.044 ± 0.295
2.367AspTyr: 2.367 ± 0.395
0.0AspXaa: 0.0 ± 0.0
Glu
6.823GluAla: 6.823 ± 0.928
0.557GluCys: 0.557 ± 0.212
3.133GluAsp: 3.133 ± 0.508
4.177GluGlu: 4.177 ± 0.635
2.228GluPhe: 2.228 ± 0.438
3.412GluGly: 3.412 ± 0.51
1.184GluHis: 1.184 ± 0.313
3.133GluIle: 3.133 ± 0.405
4.386GluLys: 4.386 ± 0.516
6.405GluLeu: 6.405 ± 0.756
1.88GluMet: 1.88 ± 0.416
3.551GluAsn: 3.551 ± 0.532
1.81GluPro: 1.81 ± 0.393
4.317GluGln: 4.317 ± 0.601
3.62GluArg: 3.62 ± 0.574
3.62GluSer: 3.62 ± 0.497
4.595GluThr: 4.595 ± 0.544
3.412GluVal: 3.412 ± 0.541
0.905GluTrp: 0.905 ± 0.24
1.81GluTyr: 1.81 ± 0.325
0.0GluXaa: 0.0 ± 0.0
Phe
1.88PheAla: 1.88 ± 0.379
0.418PheCys: 0.418 ± 0.151
2.715PheAsp: 2.715 ± 0.436
1.462PheGlu: 1.462 ± 0.325
1.114PhePhe: 1.114 ± 0.266
2.089PheGly: 2.089 ± 0.327
0.975PheHis: 0.975 ± 0.234
2.089PheIle: 2.089 ± 0.48
1.253PheLys: 1.253 ± 0.287
2.506PheLeu: 2.506 ± 0.447
0.975PheMet: 0.975 ± 0.219
1.044PheAsn: 1.044 ± 0.248
0.975PhePro: 0.975 ± 0.264
0.835PheGln: 0.835 ± 0.209
2.437PheArg: 2.437 ± 0.501
2.646PheSer: 2.646 ± 0.566
3.342PheThr: 3.342 ± 0.402
2.506PheVal: 2.506 ± 0.387
0.418PheTrp: 0.418 ± 0.148
1.114PheTyr: 1.114 ± 0.303
0.0PheXaa: 0.0 ± 0.0
Gly
5.5GlyAla: 5.5 ± 0.954
1.044GlyCys: 1.044 ± 0.254
4.177GlyAsp: 4.177 ± 0.413
5.013GlyGlu: 5.013 ± 0.591
2.158GlyPhe: 2.158 ± 0.383
4.734GlyGly: 4.734 ± 0.903
0.835GlyHis: 0.835 ± 0.286
3.969GlyIle: 3.969 ± 0.504
4.177GlyLys: 4.177 ± 0.548
4.734GlyLeu: 4.734 ± 0.602
2.576GlyMet: 2.576 ± 0.434
3.272GlyAsn: 3.272 ± 0.585
0.975GlyPro: 0.975 ± 0.182
3.342GlyGln: 3.342 ± 0.471
4.317GlyArg: 4.317 ± 0.465
4.247GlySer: 4.247 ± 0.587
4.804GlyThr: 4.804 ± 0.795
4.456GlyVal: 4.456 ± 0.567
1.253GlyTrp: 1.253 ± 0.275
3.342GlyTyr: 3.342 ± 0.554
0.0GlyXaa: 0.0 ± 0.0
His
1.671HisAla: 1.671 ± 0.358
0.278HisCys: 0.278 ± 0.138
0.905HisAsp: 0.905 ± 0.236
0.835HisGlu: 0.835 ± 0.256
0.975HisPhe: 0.975 ± 0.344
1.184HisGly: 1.184 ± 0.25
0.418HisHis: 0.418 ± 0.191
1.253HisIle: 1.253 ± 0.294
0.975HisLys: 0.975 ± 0.276
1.671HisLeu: 1.671 ± 0.3
0.209HisMet: 0.209 ± 0.121
0.696HisAsn: 0.696 ± 0.202
1.044HisPro: 1.044 ± 0.245
0.627HisGln: 0.627 ± 0.212
1.462HisArg: 1.462 ± 0.321
1.392HisSer: 1.392 ± 0.435
1.114HisThr: 1.114 ± 0.388
0.975HisVal: 0.975 ± 0.244
0.209HisTrp: 0.209 ± 0.118
0.696HisTyr: 0.696 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
4.526IleAla: 4.526 ± 0.555
0.905IleCys: 0.905 ± 0.225
3.481IleAsp: 3.481 ± 0.516
3.551IleGlu: 3.551 ± 0.588
1.253IlePhe: 1.253 ± 0.346
4.247IleGly: 4.247 ± 0.677
0.975IleHis: 0.975 ± 0.276
2.158IleIle: 2.158 ± 0.556
3.272IleLys: 3.272 ± 0.635
3.551IleLeu: 3.551 ± 0.481
0.905IleMet: 0.905 ± 0.251
2.924IleAsn: 2.924 ± 0.569
2.298IlePro: 2.298 ± 0.315
2.367IleGln: 2.367 ± 0.394
3.551IleArg: 3.551 ± 0.472
4.247IleSer: 4.247 ± 0.512
4.386IleThr: 4.386 ± 0.625
3.063IleVal: 3.063 ± 0.466
0.627IleTrp: 0.627 ± 0.252
1.323IleTyr: 1.323 ± 0.316
0.0IleXaa: 0.0 ± 0.0
Lys
5.013LysAla: 5.013 ± 0.777
1.044LysCys: 1.044 ± 0.336
2.646LysAsp: 2.646 ± 0.387
4.526LysGlu: 4.526 ± 0.517
1.88LysPhe: 1.88 ± 0.348
3.412LysGly: 3.412 ± 0.524
1.114LysHis: 1.114 ± 0.274
3.133LysIle: 3.133 ± 0.534
4.177LysLys: 4.177 ± 0.652
3.829LysLeu: 3.829 ± 0.598
1.88LysMet: 1.88 ± 0.375
2.924LysAsn: 2.924 ± 0.571
2.715LysPro: 2.715 ± 0.511
2.298LysGln: 2.298 ± 0.392
3.62LysArg: 3.62 ± 0.488
4.108LysSer: 4.108 ± 0.539
3.412LysThr: 3.412 ± 0.469
3.203LysVal: 3.203 ± 0.495
1.184LysTrp: 1.184 ± 0.286
2.019LysTyr: 2.019 ± 0.339
0.0LysXaa: 0.0 ± 0.0
Leu
8.355LeuAla: 8.355 ± 0.876
0.696LeuCys: 0.696 ± 0.198
4.038LeuAsp: 4.038 ± 0.562
4.665LeuGlu: 4.665 ± 0.704
2.437LeuPhe: 2.437 ± 0.425
3.829LeuGly: 3.829 ± 0.577
1.392LeuHis: 1.392 ± 0.339
3.969LeuIle: 3.969 ± 0.618
5.709LeuLys: 5.709 ± 0.591
5.361LeuLeu: 5.361 ± 0.548
1.88LeuMet: 1.88 ± 0.337
3.133LeuAsn: 3.133 ± 0.472
4.456LeuPro: 4.456 ± 0.541
2.646LeuGln: 2.646 ± 0.464
5.083LeuArg: 5.083 ± 0.57
7.102LeuSer: 7.102 ± 0.562
5.57LeuThr: 5.57 ± 0.848
4.456LeuVal: 4.456 ± 0.548
1.532LeuTrp: 1.532 ± 0.257
1.81LeuTyr: 1.81 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
3.063MetAla: 3.063 ± 0.595
0.278MetCys: 0.278 ± 0.142
1.044MetAsp: 1.044 ± 0.262
1.253MetGlu: 1.253 ± 0.284
1.114MetPhe: 1.114 ± 0.221
2.019MetGly: 2.019 ± 0.368
0.278MetHis: 0.278 ± 0.197
1.253MetIle: 1.253 ± 0.372
1.741MetLys: 1.741 ± 0.331
2.785MetLeu: 2.785 ± 0.431
0.696MetMet: 0.696 ± 0.241
1.184MetAsn: 1.184 ± 0.273
1.671MetPro: 1.671 ± 0.346
1.671MetGln: 1.671 ± 0.38
1.88MetArg: 1.88 ± 0.296
1.81MetSer: 1.81 ± 0.366
2.646MetThr: 2.646 ± 0.515
1.671MetVal: 1.671 ± 0.311
0.209MetTrp: 0.209 ± 0.106
0.835MetTyr: 0.835 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
3.481AsnAla: 3.481 ± 0.63
0.975AsnCys: 0.975 ± 0.264
2.228AsnAsp: 2.228 ± 0.352
2.646AsnGlu: 2.646 ± 0.425
0.905AsnPhe: 0.905 ± 0.267
4.247AsnGly: 4.247 ± 0.561
0.835AsnHis: 0.835 ± 0.226
2.367AsnIle: 2.367 ± 0.405
2.228AsnLys: 2.228 ± 0.428
2.855AsnLeu: 2.855 ± 0.447
1.044AsnMet: 1.044 ± 0.273
2.437AsnAsn: 2.437 ± 0.531
2.158AsnPro: 2.158 ± 0.29
0.766AsnGln: 0.766 ± 0.267
2.924AsnArg: 2.924 ± 0.721
2.924AsnSer: 2.924 ± 0.478
2.855AsnThr: 2.855 ± 0.532
2.089AsnVal: 2.089 ± 0.332
0.627AsnTrp: 0.627 ± 0.185
1.392AsnTyr: 1.392 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
3.969ProAla: 3.969 ± 0.609
0.07ProCys: 0.07 ± 0.068
3.272ProAsp: 3.272 ± 0.507
2.924ProGlu: 2.924 ± 0.463
1.114ProPhe: 1.114 ± 0.309
2.994ProGly: 2.994 ± 0.387
0.696ProHis: 0.696 ± 0.21
1.392ProIle: 1.392 ± 0.387
2.089ProLys: 2.089 ± 0.356
2.855ProLeu: 2.855 ± 0.472
1.253ProMet: 1.253 ± 0.324
1.323ProAsn: 1.323 ± 0.271
2.228ProPro: 2.228 ± 0.539
1.184ProGln: 1.184 ± 0.255
1.741ProArg: 1.741 ± 0.336
2.715ProSer: 2.715 ± 0.484
1.949ProThr: 1.949 ± 0.374
2.924ProVal: 2.924 ± 0.418
0.418ProTrp: 0.418 ± 0.168
0.835ProTyr: 0.835 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
4.177GlnAla: 4.177 ± 0.781
0.418GlnCys: 0.418 ± 0.156
1.184GlnAsp: 1.184 ± 0.247
2.785GlnGlu: 2.785 ± 0.496
1.044GlnPhe: 1.044 ± 0.327
2.089GlnGly: 2.089 ± 0.385
0.557GlnHis: 0.557 ± 0.226
2.715GlnIle: 2.715 ± 0.339
2.576GlnLys: 2.576 ± 0.481
3.62GlnLeu: 3.62 ± 0.405
1.88GlnMet: 1.88 ± 0.374
1.462GlnAsn: 1.462 ± 0.273
1.323GlnPro: 1.323 ± 0.279
2.437GlnGln: 2.437 ± 0.608
2.785GlnArg: 2.785 ± 0.448
2.855GlnSer: 2.855 ± 0.442
2.576GlnThr: 2.576 ± 0.528
2.715GlnVal: 2.715 ± 0.443
0.418GlnTrp: 0.418 ± 0.203
1.462GlnTyr: 1.462 ± 0.256
0.0GlnXaa: 0.0 ± 0.0
Arg
5.083ArgAla: 5.083 ± 0.56
0.557ArgCys: 0.557 ± 0.208
3.342ArgAsp: 3.342 ± 0.629
5.152ArgGlu: 5.152 ± 0.766
1.671ArgPhe: 1.671 ± 0.35
4.526ArgGly: 4.526 ± 0.53
1.462ArgHis: 1.462 ± 0.299
3.76ArgIle: 3.76 ± 0.574
4.317ArgLys: 4.317 ± 0.523
5.431ArgLeu: 5.431 ± 0.62
2.576ArgMet: 2.576 ± 0.381
2.924ArgAsn: 2.924 ± 0.448
1.81ArgPro: 1.81 ± 0.316
3.133ArgGln: 3.133 ± 0.51
5.709ArgArg: 5.709 ± 0.95
2.994ArgSer: 2.994 ± 0.43
2.855ArgThr: 2.855 ± 0.375
3.272ArgVal: 3.272 ± 0.616
1.184ArgTrp: 1.184 ± 0.251
2.367ArgTyr: 2.367 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
7.31SerAla: 7.31 ± 1.775
0.627SerCys: 0.627 ± 0.214
3.829SerAsp: 3.829 ± 0.485
4.665SerGlu: 4.665 ± 0.694
2.646SerPhe: 2.646 ± 0.369
7.171SerGly: 7.171 ± 0.841
1.253SerHis: 1.253 ± 0.348
2.855SerIle: 2.855 ± 0.457
2.994SerLys: 2.994 ± 0.464
5.361SerLeu: 5.361 ± 0.708
2.298SerMet: 2.298 ± 0.369
2.298SerAsn: 2.298 ± 0.497
2.019SerPro: 2.019 ± 0.368
3.203SerGln: 3.203 ± 0.429
5.152SerArg: 5.152 ± 0.673
4.038SerSer: 4.038 ± 0.744
4.456SerThr: 4.456 ± 0.726
5.083SerVal: 5.083 ± 0.716
0.696SerTrp: 0.696 ± 0.25
1.671SerTyr: 1.671 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
7.38ThrAla: 7.38 ± 0.699
0.766ThrCys: 0.766 ± 0.23
3.69ThrAsp: 3.69 ± 0.4
4.386ThrGlu: 4.386 ± 0.645
2.994ThrPhe: 2.994 ± 0.456
4.386ThrGly: 4.386 ± 0.591
1.392ThrHis: 1.392 ± 0.335
3.272ThrIle: 3.272 ± 0.521
2.715ThrLys: 2.715 ± 0.364
5.431ThrLeu: 5.431 ± 0.569
1.253ThrMet: 1.253 ± 0.358
2.089ThrAsn: 2.089 ± 0.537
3.272ThrPro: 3.272 ± 0.565
2.506ThrGln: 2.506 ± 0.43
2.646ThrArg: 2.646 ± 0.318
3.62ThrSer: 3.62 ± 0.758
3.481ThrThr: 3.481 ± 0.499
5.431ThrVal: 5.431 ± 0.921
0.975ThrTrp: 0.975 ± 0.246
2.019ThrTyr: 2.019 ± 0.394
0.0ThrXaa: 0.0 ± 0.0
Val
5.779ValAla: 5.779 ± 0.635
0.627ValCys: 0.627 ± 0.267
3.69ValAsp: 3.69 ± 0.416
3.133ValGlu: 3.133 ± 0.421
1.88ValPhe: 1.88 ± 0.319
3.551ValGly: 3.551 ± 0.607
0.766ValHis: 0.766 ± 0.203
3.76ValIle: 3.76 ± 0.503
4.665ValLys: 4.665 ± 0.595
5.709ValLeu: 5.709 ± 0.721
2.646ValMet: 2.646 ± 0.476
3.272ValAsn: 3.272 ± 0.422
2.158ValPro: 2.158 ± 0.432
2.367ValGln: 2.367 ± 0.563
3.272ValArg: 3.272 ± 0.5
5.291ValSer: 5.291 ± 0.664
3.969ValThr: 3.969 ± 0.618
5.083ValVal: 5.083 ± 0.532
0.627ValTrp: 0.627 ± 0.197
1.81ValTyr: 1.81 ± 0.361
0.0ValXaa: 0.0 ± 0.0
Trp
1.044TrpAla: 1.044 ± 0.235
0.209TrpCys: 0.209 ± 0.12
1.114TrpAsp: 1.114 ± 0.297
0.835TrpGlu: 0.835 ± 0.24
0.348TrpPhe: 0.348 ± 0.132
0.418TrpGly: 0.418 ± 0.164
0.348TrpHis: 0.348 ± 0.154
0.557TrpIle: 0.557 ± 0.223
1.114TrpLys: 1.114 ± 0.291
1.81TrpLeu: 1.81 ± 0.324
0.696TrpMet: 0.696 ± 0.185
0.627TrpAsn: 0.627 ± 0.18
0.557TrpPro: 0.557 ± 0.164
0.487TrpGln: 0.487 ± 0.186
1.671TrpArg: 1.671 ± 0.383
1.044TrpSer: 1.044 ± 0.283
0.696TrpThr: 0.696 ± 0.237
1.253TrpVal: 1.253 ± 0.283
0.209TrpTrp: 0.209 ± 0.141
0.557TrpTyr: 0.557 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.367TyrAla: 2.367 ± 0.436
0.348TyrCys: 0.348 ± 0.15
1.81TyrAsp: 1.81 ± 0.32
2.089TyrGlu: 2.089 ± 0.384
1.392TyrPhe: 1.392 ± 0.275
2.228TyrGly: 2.228 ± 0.442
1.044TyrHis: 1.044 ± 0.27
2.158TyrIle: 2.158 ± 0.399
1.044TyrLys: 1.044 ± 0.276
3.063TyrLeu: 3.063 ± 0.487
0.418TyrMet: 0.418 ± 0.156
1.184TyrAsn: 1.184 ± 0.227
1.114TyrPro: 1.114 ± 0.283
1.81TyrGln: 1.81 ± 0.312
2.298TyrArg: 2.298 ± 0.434
2.855TyrSer: 2.855 ± 0.459
1.462TyrThr: 1.462 ± 0.315
1.741TyrVal: 1.741 ± 0.344
0.696TyrTrp: 0.696 ± 0.219
0.975TyrTyr: 0.975 ± 0.274
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (14364 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski