Amino acid dipepetide frequency for Haemophilus phage HP1 (strain HP1c1) (Bacteriophage HP1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.781AlaAla: 6.781 ± 1.505
0.506AlaCys: 0.506 ± 0.228
4.757AlaAsp: 4.757 ± 0.665
8.096AlaGlu: 8.096 ± 0.953
3.441AlaPhe: 3.441 ± 0.722
4.251AlaGly: 4.251 ± 0.65
1.113AlaHis: 1.113 ± 0.364
4.453AlaIle: 4.453 ± 0.749
7.995AlaLys: 7.995 ± 0.855
8.4AlaLeu: 8.4 ± 1.458
1.518AlaMet: 1.518 ± 0.401
4.757AlaAsn: 4.757 ± 0.812
2.024AlaPro: 2.024 ± 0.4
3.239AlaGln: 3.239 ± 0.637
2.631AlaArg: 2.631 ± 0.419
3.34AlaSer: 3.34 ± 0.393
4.554AlaThr: 4.554 ± 0.483
5.769AlaVal: 5.769 ± 1.035
0.405AlaTrp: 0.405 ± 0.164
2.834AlaTyr: 2.834 ± 0.628
0.0AlaXaa: 0.0 ± 0.0
Cys
0.506CysAla: 0.506 ± 0.249
0.0CysCys: 0.0 ± 0.0
0.506CysAsp: 0.506 ± 0.233
0.607CysGlu: 0.607 ± 0.229
0.506CysPhe: 0.506 ± 0.207
0.607CysGly: 0.607 ± 0.251
0.202CysHis: 0.202 ± 0.159
1.113CysIle: 1.113 ± 0.353
0.304CysLys: 0.304 ± 0.169
0.708CysLeu: 0.708 ± 0.242
0.101CysMet: 0.101 ± 0.107
0.405CysAsn: 0.405 ± 0.187
0.405CysPro: 0.405 ± 0.202
0.101CysGln: 0.101 ± 0.09
0.304CysArg: 0.304 ± 0.204
0.81CysSer: 0.81 ± 0.263
0.304CysThr: 0.304 ± 0.173
1.012CysVal: 1.012 ± 0.304
0.101CysTrp: 0.101 ± 0.103
0.405CysTyr: 0.405 ± 0.192
0.0CysXaa: 0.0 ± 0.0
Asp
2.53AspAla: 2.53 ± 0.479
0.506AspCys: 0.506 ± 0.222
2.53AspAsp: 2.53 ± 0.456
4.251AspGlu: 4.251 ± 0.571
3.239AspPhe: 3.239 ± 0.519
4.352AspGly: 4.352 ± 0.682
0.911AspHis: 0.911 ± 0.28
3.137AspIle: 3.137 ± 0.472
3.745AspLys: 3.745 ± 0.595
5.465AspLeu: 5.465 ± 0.797
1.214AspMet: 1.214 ± 0.366
2.733AspAsn: 2.733 ± 0.508
2.125AspPro: 2.125 ± 0.611
1.619AspGln: 1.619 ± 0.418
1.923AspArg: 1.923 ± 0.36
4.149AspSer: 4.149 ± 0.817
2.328AspThr: 2.328 ± 0.416
4.554AspVal: 4.554 ± 0.6
0.506AspTrp: 0.506 ± 0.245
3.137AspTyr: 3.137 ± 0.677
0.0AspXaa: 0.0 ± 0.0
Glu
3.036GluAla: 3.036 ± 0.511
0.708GluCys: 0.708 ± 0.367
3.036GluAsp: 3.036 ± 0.523
4.858GluGlu: 4.858 ± 0.961
2.429GluPhe: 2.429 ± 0.411
2.125GluGly: 2.125 ± 0.44
2.125GluHis: 2.125 ± 0.449
5.465GluIle: 5.465 ± 0.847
5.87GluLys: 5.87 ± 0.894
7.388GluLeu: 7.388 ± 0.943
2.024GluMet: 2.024 ± 0.478
4.959GluAsn: 4.959 ± 0.718
1.316GluPro: 1.316 ± 0.326
3.947GluGln: 3.947 ± 0.647
3.542GluArg: 3.542 ± 0.761
3.846GluSer: 3.846 ± 0.548
4.554GluThr: 4.554 ± 0.79
3.745GluVal: 3.745 ± 0.651
1.113GluTrp: 1.113 ± 0.298
2.226GluTyr: 2.226 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
3.846PheAla: 3.846 ± 0.524
0.0PheCys: 0.0 ± 0.0
3.036PheAsp: 3.036 ± 0.495
2.53PheGlu: 2.53 ± 0.604
1.417PhePhe: 1.417 ± 0.402
3.036PheGly: 3.036 ± 0.749
1.113PheHis: 1.113 ± 0.35
3.441PheIle: 3.441 ± 0.499
2.429PheLys: 2.429 ± 0.503
3.137PheLeu: 3.137 ± 0.584
0.607PheMet: 0.607 ± 0.213
3.846PheAsn: 3.846 ± 0.616
1.316PhePro: 1.316 ± 0.332
1.619PheGln: 1.619 ± 0.402
1.923PheArg: 1.923 ± 0.458
3.34PheSer: 3.34 ± 0.622
2.226PheThr: 2.226 ± 0.489
2.53PheVal: 2.53 ± 0.561
0.911PheTrp: 0.911 ± 0.281
1.518PheTyr: 1.518 ± 0.428
0.0PheXaa: 0.0 ± 0.0
Gly
3.745GlyAla: 3.745 ± 0.694
0.506GlyCys: 0.506 ± 0.27
4.048GlyAsp: 4.048 ± 0.711
4.858GlyGlu: 4.858 ± 0.726
2.834GlyPhe: 2.834 ± 0.537
4.352GlyGly: 4.352 ± 0.745
0.607GlyHis: 0.607 ± 0.194
4.352GlyIle: 4.352 ± 0.848
4.655GlyLys: 4.655 ± 0.803
4.655GlyLeu: 4.655 ± 0.916
2.328GlyMet: 2.328 ± 0.495
4.453GlyAsn: 4.453 ± 0.628
0.202GlyPro: 0.202 ± 0.143
2.53GlyGln: 2.53 ± 0.428
2.935GlyArg: 2.935 ± 0.591
3.34GlySer: 3.34 ± 0.741
3.036GlyThr: 3.036 ± 0.671
5.465GlyVal: 5.465 ± 1.014
1.113GlyTrp: 1.113 ± 0.361
2.328GlyTyr: 2.328 ± 0.405
0.0GlyXaa: 0.0 ± 0.0
His
1.923HisAla: 1.923 ± 0.38
0.202HisCys: 0.202 ± 0.118
0.81HisAsp: 0.81 ± 0.266
0.911HisGlu: 0.911 ± 0.287
0.911HisPhe: 0.911 ± 0.325
1.214HisGly: 1.214 ± 0.28
0.506HisHis: 0.506 ± 0.222
1.214HisIle: 1.214 ± 0.307
1.214HisLys: 1.214 ± 0.362
1.822HisLeu: 1.822 ± 0.458
0.506HisMet: 0.506 ± 0.237
0.911HisAsn: 0.911 ± 0.328
0.405HisPro: 0.405 ± 0.195
1.214HisGln: 1.214 ± 0.466
1.113HisArg: 1.113 ± 0.335
0.81HisSer: 0.81 ± 0.267
1.012HisThr: 1.012 ± 0.374
0.911HisVal: 0.911 ± 0.317
0.405HisTrp: 0.405 ± 0.185
1.214HisTyr: 1.214 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
5.566IleAla: 5.566 ± 0.651
0.911IleCys: 0.911 ± 0.305
5.364IleAsp: 5.364 ± 0.763
4.959IleGlu: 4.959 ± 0.705
2.53IlePhe: 2.53 ± 0.423
4.251IleGly: 4.251 ± 0.972
0.607IleHis: 0.607 ± 0.229
4.251IleIle: 4.251 ± 0.719
4.959IleLys: 4.959 ± 0.632
4.554IleLeu: 4.554 ± 0.657
0.708IleMet: 0.708 ± 0.335
3.846IleAsn: 3.846 ± 0.602
2.328IlePro: 2.328 ± 0.432
2.834IleGln: 2.834 ± 0.419
3.643IleArg: 3.643 ± 0.715
3.745IleSer: 3.745 ± 0.634
3.947IleThr: 3.947 ± 0.642
2.935IleVal: 2.935 ± 0.465
0.304IleTrp: 0.304 ± 0.164
2.024IleTyr: 2.024 ± 0.363
0.0IleXaa: 0.0 ± 0.0
Lys
6.578LysAla: 6.578 ± 0.688
0.708LysCys: 0.708 ± 0.259
4.352LysAsp: 4.352 ± 0.592
4.655LysGlu: 4.655 ± 0.781
2.53LysPhe: 2.53 ± 0.305
4.048LysGly: 4.048 ± 0.671
1.417LysHis: 1.417 ± 0.385
4.655LysIle: 4.655 ± 0.778
5.566LysLys: 5.566 ± 0.848
7.287LysLeu: 7.287 ± 0.732
2.328LysMet: 2.328 ± 0.589
4.554LysAsn: 4.554 ± 0.716
1.619LysPro: 1.619 ± 0.348
4.251LysGln: 4.251 ± 0.78
4.048LysArg: 4.048 ± 0.927
4.858LysSer: 4.858 ± 0.664
4.757LysThr: 4.757 ± 0.799
4.554LysVal: 4.554 ± 0.655
1.923LysTrp: 1.923 ± 0.558
3.036LysTyr: 3.036 ± 0.579
0.0LysXaa: 0.0 ± 0.0
Leu
8.096LeuAla: 8.096 ± 0.929
1.012LeuCys: 1.012 ± 0.297
5.566LeuAsp: 5.566 ± 0.733
4.554LeuGlu: 4.554 ± 0.55
4.048LeuPhe: 4.048 ± 0.709
5.364LeuGly: 5.364 ± 1.217
1.822LeuHis: 1.822 ± 0.375
5.263LeuIle: 5.263 ± 0.897
7.388LeuLys: 7.388 ± 1.022
6.781LeuLeu: 6.781 ± 0.759
1.518LeuMet: 1.518 ± 0.511
5.06LeuAsn: 5.06 ± 0.871
4.251LeuPro: 4.251 ± 0.555
3.643LeuGln: 3.643 ± 0.454
4.048LeuArg: 4.048 ± 0.772
7.59LeuSer: 7.59 ± 0.602
5.971LeuThr: 5.971 ± 0.741
4.048LeuVal: 4.048 ± 0.815
0.405LeuTrp: 0.405 ± 0.159
2.429LeuTyr: 2.429 ± 0.524
0.0LeuXaa: 0.0 ± 0.0
Met
1.923MetAla: 1.923 ± 0.395
0.607MetCys: 0.607 ± 0.266
0.202MetAsp: 0.202 ± 0.118
1.619MetGlu: 1.619 ± 0.437
1.012MetPhe: 1.012 ± 0.325
1.316MetGly: 1.316 ± 0.327
0.202MetHis: 0.202 ± 0.128
1.316MetIle: 1.316 ± 0.397
1.417MetLys: 1.417 ± 0.462
2.631MetLeu: 2.631 ± 0.616
0.506MetMet: 0.506 ± 0.229
1.316MetAsn: 1.316 ± 0.4
0.81MetPro: 0.81 ± 0.22
0.911MetGln: 0.911 ± 0.25
1.113MetArg: 1.113 ± 0.349
1.518MetSer: 1.518 ± 0.527
1.417MetThr: 1.417 ± 0.384
1.417MetVal: 1.417 ± 0.336
0.607MetTrp: 0.607 ± 0.245
0.202MetTyr: 0.202 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
4.453AsnAla: 4.453 ± 0.684
0.506AsnCys: 0.506 ± 0.223
2.834AsnAsp: 2.834 ± 0.537
3.846AsnGlu: 3.846 ± 0.688
2.226AsnPhe: 2.226 ± 0.481
5.566AsnGly: 5.566 ± 0.768
1.214AsnHis: 1.214 ± 0.296
4.251AsnIle: 4.251 ± 0.506
4.453AsnLys: 4.453 ± 0.679
4.352AsnLeu: 4.352 ± 0.701
1.518AsnMet: 1.518 ± 0.286
2.935AsnAsn: 2.935 ± 0.572
2.429AsnPro: 2.429 ± 0.703
3.036AsnGln: 3.036 ± 0.558
3.137AsnArg: 3.137 ± 0.397
3.441AsnSer: 3.441 ± 0.614
2.631AsnThr: 2.631 ± 0.525
2.733AsnVal: 2.733 ± 0.531
1.214AsnTrp: 1.214 ± 0.306
1.619AsnTyr: 1.619 ± 0.361
0.0AsnXaa: 0.0 ± 0.0
Pro
2.226ProAla: 2.226 ± 0.435
0.304ProCys: 0.304 ± 0.165
1.822ProAsp: 1.822 ± 0.425
2.834ProGlu: 2.834 ± 0.661
2.328ProPhe: 2.328 ± 0.555
1.214ProGly: 1.214 ± 0.335
0.911ProHis: 0.911 ± 0.299
2.226ProIle: 2.226 ± 0.417
2.429ProLys: 2.429 ± 0.519
2.834ProLeu: 2.834 ± 0.533
0.81ProMet: 0.81 ± 0.267
1.923ProAsn: 1.923 ± 0.56
1.214ProPro: 1.214 ± 0.309
1.619ProGln: 1.619 ± 0.394
0.708ProArg: 0.708 ± 0.232
1.72ProSer: 1.72 ± 0.345
1.822ProThr: 1.822 ± 0.361
1.417ProVal: 1.417 ± 0.439
0.405ProTrp: 0.405 ± 0.168
1.012ProTyr: 1.012 ± 0.305
0.0ProXaa: 0.0 ± 0.0
Gln
4.655GlnAla: 4.655 ± 0.665
0.101GlnCys: 0.101 ± 0.083
0.81GlnAsp: 0.81 ± 0.302
2.733GlnGlu: 2.733 ± 0.481
2.024GlnPhe: 2.024 ± 0.497
2.935GlnGly: 2.935 ± 0.554
0.708GlnHis: 0.708 ± 0.223
3.239GlnIle: 3.239 ± 0.52
4.352GlnLys: 4.352 ± 0.674
3.947GlnLeu: 3.947 ± 0.683
1.417GlnMet: 1.417 ± 0.397
3.036GlnAsn: 3.036 ± 0.635
1.417GlnPro: 1.417 ± 0.282
3.137GlnGln: 3.137 ± 0.573
2.834GlnArg: 2.834 ± 0.701
3.036GlnSer: 3.036 ± 0.504
2.328GlnThr: 2.328 ± 0.536
2.226GlnVal: 2.226 ± 0.493
0.607GlnTrp: 0.607 ± 0.258
1.316GlnTyr: 1.316 ± 0.282
0.0GlnXaa: 0.0 ± 0.0
Arg
3.846ArgAla: 3.846 ± 0.675
0.506ArgCys: 0.506 ± 0.205
2.733ArgAsp: 2.733 ± 0.466
3.745ArgGlu: 3.745 ± 0.688
1.619ArgPhe: 1.619 ± 0.357
1.923ArgGly: 1.923 ± 0.424
0.81ArgHis: 0.81 ± 0.274
2.834ArgIle: 2.834 ± 0.575
3.947ArgLys: 3.947 ± 0.861
4.858ArgLeu: 4.858 ± 0.912
0.708ArgMet: 0.708 ± 0.232
2.935ArgAsn: 2.935 ± 0.503
1.113ArgPro: 1.113 ± 0.346
2.024ArgGln: 2.024 ± 0.401
2.429ArgArg: 2.429 ± 0.574
2.429ArgSer: 2.429 ± 0.437
3.239ArgThr: 3.239 ± 0.45
1.822ArgVal: 1.822 ± 0.372
0.81ArgTrp: 0.81 ± 0.202
2.226ArgTyr: 2.226 ± 0.532
0.0ArgXaa: 0.0 ± 0.0
Ser
6.072SerAla: 6.072 ± 0.876
0.202SerCys: 0.202 ± 0.128
4.149SerAsp: 4.149 ± 0.585
3.542SerGlu: 3.542 ± 0.592
3.441SerPhe: 3.441 ± 0.534
5.364SerGly: 5.364 ± 0.877
1.113SerHis: 1.113 ± 0.312
3.745SerIle: 3.745 ± 0.66
4.149SerLys: 4.149 ± 0.665
5.667SerLeu: 5.667 ± 0.912
1.012SerMet: 1.012 ± 0.327
3.542SerAsn: 3.542 ± 0.609
2.328SerPro: 2.328 ± 0.562
2.226SerGln: 2.226 ± 0.411
2.53SerArg: 2.53 ± 0.418
3.542SerSer: 3.542 ± 0.609
3.036SerThr: 3.036 ± 0.417
3.846SerVal: 3.846 ± 0.805
0.506SerTrp: 0.506 ± 0.219
1.72SerTyr: 1.72 ± 0.343
0.0SerXaa: 0.0 ± 0.0
Thr
5.465ThrAla: 5.465 ± 0.61
0.708ThrCys: 0.708 ± 0.294
2.935ThrAsp: 2.935 ± 0.582
3.745ThrGlu: 3.745 ± 0.56
2.125ThrPhe: 2.125 ± 0.409
3.441ThrGly: 3.441 ± 0.484
1.518ThrHis: 1.518 ± 0.443
3.34ThrIle: 3.34 ± 0.608
4.959ThrLys: 4.959 ± 0.635
4.858ThrLeu: 4.858 ± 0.671
1.012ThrMet: 1.012 ± 0.397
2.125ThrAsn: 2.125 ± 0.367
2.53ThrPro: 2.53 ± 0.409
3.137ThrGln: 3.137 ± 0.578
1.72ThrArg: 1.72 ± 0.48
3.137ThrSer: 3.137 ± 0.586
2.834ThrThr: 2.834 ± 0.636
4.251ThrVal: 4.251 ± 0.728
0.304ThrTrp: 0.304 ± 0.143
1.923ThrTyr: 1.923 ± 0.53
0.0ThrXaa: 0.0 ± 0.0
Val
5.161ValAla: 5.161 ± 0.87
0.405ValCys: 0.405 ± 0.2
3.036ValAsp: 3.036 ± 0.462
4.554ValGlu: 4.554 ± 0.62
2.328ValPhe: 2.328 ± 0.456
4.352ValGly: 4.352 ± 1.051
1.012ValHis: 1.012 ± 0.293
2.935ValIle: 2.935 ± 0.557
4.251ValLys: 4.251 ± 0.581
4.251ValLeu: 4.251 ± 0.765
1.214ValMet: 1.214 ± 0.337
3.036ValAsn: 3.036 ± 0.503
2.733ValPro: 2.733 ± 0.532
2.631ValGln: 2.631 ± 0.683
3.34ValArg: 3.34 ± 0.532
4.352ValSer: 4.352 ± 0.637
4.149ValThr: 4.149 ± 0.711
2.935ValVal: 2.935 ± 0.572
1.012ValTrp: 1.012 ± 0.253
2.024ValTyr: 2.024 ± 0.352
0.0ValXaa: 0.0 ± 0.0
Trp
1.417TrpAla: 1.417 ± 0.316
0.101TrpCys: 0.101 ± 0.083
0.506TrpAsp: 0.506 ± 0.262
0.405TrpGlu: 0.405 ± 0.213
0.708TrpPhe: 0.708 ± 0.309
0.405TrpGly: 0.405 ± 0.141
0.607TrpHis: 0.607 ± 0.23
1.214TrpIle: 1.214 ± 0.295
1.012TrpLys: 1.012 ± 0.374
1.316TrpLeu: 1.316 ± 0.37
0.202TrpMet: 0.202 ± 0.144
0.202TrpAsn: 0.202 ± 0.137
0.202TrpPro: 0.202 ± 0.149
1.316TrpGln: 1.316 ± 0.312
0.81TrpArg: 0.81 ± 0.405
0.911TrpSer: 0.911 ± 0.305
0.607TrpThr: 0.607 ± 0.222
0.708TrpVal: 0.708 ± 0.258
0.304TrpTrp: 0.304 ± 0.158
0.101TrpTyr: 0.101 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.036TyrAla: 3.036 ± 0.546
0.506TyrCys: 0.506 ± 0.228
2.024TyrAsp: 2.024 ± 0.493
1.012TyrGlu: 1.012 ± 0.35
2.125TyrPhe: 2.125 ± 0.44
2.226TyrGly: 2.226 ± 0.456
0.708TyrHis: 0.708 ± 0.262
1.822TyrIle: 1.822 ± 0.404
2.631TyrLys: 2.631 ± 0.447
3.947TyrLeu: 3.947 ± 0.718
0.708TyrMet: 0.708 ± 0.249
1.822TyrAsn: 1.822 ± 0.381
1.012TyrPro: 1.012 ± 0.35
1.822TyrGln: 1.822 ± 0.345
1.822TyrArg: 1.822 ± 0.405
1.822TyrSer: 1.822 ± 0.352
1.316TyrThr: 1.316 ± 0.38
2.834TyrVal: 2.834 ± 0.54
0.101TyrTrp: 0.101 ± 0.09
1.113TyrTyr: 1.113 ± 0.305
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (9882 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski