Amino acid dipepetide frequency for Escherichia phage ZG49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.524AlaAla: 9.524 ± 1.145
0.935AlaCys: 0.935 ± 0.328
5.782AlaAsp: 5.782 ± 0.661
5.102AlaGlu: 5.102 ± 0.797
3.231AlaPhe: 3.231 ± 0.473
7.653AlaGly: 7.653 ± 1.077
1.19AlaHis: 1.19 ± 0.34
5.187AlaIle: 5.187 ± 0.768
5.697AlaLys: 5.697 ± 0.797
7.398AlaLeu: 7.398 ± 1.001
2.466AlaMet: 2.466 ± 0.458
3.316AlaAsn: 3.316 ± 0.404
3.061AlaPro: 3.061 ± 0.558
2.891AlaGln: 2.891 ± 0.562
3.656AlaArg: 3.656 ± 0.521
5.527AlaSer: 5.527 ± 0.617
4.167AlaThr: 4.167 ± 0.72
6.463AlaVal: 6.463 ± 1.069
1.701AlaTrp: 1.701 ± 0.408
2.466AlaTyr: 2.466 ± 0.585
0.0AlaXaa: 0.0 ± 0.0
Cys
0.68CysAla: 0.68 ± 0.25
0.0CysCys: 0.0 ± 0.0
0.68CysAsp: 0.68 ± 0.322
0.595CysGlu: 0.595 ± 0.212
0.935CysPhe: 0.935 ± 0.375
0.425CysGly: 0.425 ± 0.185
0.255CysHis: 0.255 ± 0.15
0.34CysIle: 0.34 ± 0.173
0.51CysLys: 0.51 ± 0.236
1.105CysLeu: 1.105 ± 0.338
0.425CysMet: 0.425 ± 0.249
0.17CysAsn: 0.17 ± 0.115
0.595CysPro: 0.595 ± 0.331
0.255CysGln: 0.255 ± 0.146
0.34CysArg: 0.34 ± 0.16
0.51CysSer: 0.51 ± 0.245
0.085CysThr: 0.085 ± 0.097
0.51CysVal: 0.51 ± 0.209
0.085CysTrp: 0.085 ± 0.079
0.17CysTyr: 0.17 ± 0.136
0.0CysXaa: 0.0 ± 0.0
Asp
7.313AspAla: 7.313 ± 0.634
0.935AspCys: 0.935 ± 0.446
4.847AspAsp: 4.847 ± 0.752
3.741AspGlu: 3.741 ± 0.524
2.551AspPhe: 2.551 ± 0.48
6.973AspGly: 6.973 ± 0.741
1.105AspHis: 1.105 ± 0.294
3.146AspIle: 3.146 ± 0.469
3.486AspLys: 3.486 ± 0.628
5.357AspLeu: 5.357 ± 0.626
1.786AspMet: 1.786 ± 0.375
2.126AspAsn: 2.126 ± 0.46
2.636AspPro: 2.636 ± 0.665
2.211AspGln: 2.211 ± 0.47
2.296AspArg: 2.296 ± 0.399
3.146AspSer: 3.146 ± 0.475
4.167AspThr: 4.167 ± 0.648
4.847AspVal: 4.847 ± 0.477
0.85AspTrp: 0.85 ± 0.347
2.381AspTyr: 2.381 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
6.548GluAla: 6.548 ± 0.984
0.34GluCys: 0.34 ± 0.169
4.507GluAsp: 4.507 ± 0.639
4.167GluGlu: 4.167 ± 0.696
2.721GluPhe: 2.721 ± 0.519
4.932GluGly: 4.932 ± 0.791
1.02GluHis: 1.02 ± 0.23
2.891GluIle: 2.891 ± 0.536
3.571GluLys: 3.571 ± 0.732
5.357GluLeu: 5.357 ± 0.781
2.466GluMet: 2.466 ± 0.522
2.296GluAsn: 2.296 ± 0.516
2.126GluPro: 2.126 ± 0.448
2.381GluGln: 2.381 ± 0.534
3.741GluArg: 3.741 ± 0.496
4.252GluSer: 4.252 ± 0.63
3.571GluThr: 3.571 ± 0.421
4.677GluVal: 4.677 ± 0.902
1.531GluTrp: 1.531 ± 0.314
2.551GluTyr: 2.551 ± 0.589
0.0GluXaa: 0.0 ± 0.0
Phe
2.806PheAla: 2.806 ± 0.471
0.425PheCys: 0.425 ± 0.247
3.316PheAsp: 3.316 ± 0.473
2.126PheGlu: 2.126 ± 0.395
1.19PhePhe: 1.19 ± 0.34
2.551PheGly: 2.551 ± 0.485
0.595PheHis: 0.595 ± 0.206
2.041PheIle: 2.041 ± 0.406
2.551PheLys: 2.551 ± 0.519
2.891PheLeu: 2.891 ± 0.339
0.85PheMet: 0.85 ± 0.266
1.701PheAsn: 1.701 ± 0.356
1.531PhePro: 1.531 ± 0.377
0.85PheGln: 0.85 ± 0.296
1.361PheArg: 1.361 ± 0.299
3.231PheSer: 3.231 ± 0.412
2.551PheThr: 2.551 ± 0.457
2.721PheVal: 2.721 ± 0.45
0.425PheTrp: 0.425 ± 0.161
1.276PheTyr: 1.276 ± 0.333
0.0PheXaa: 0.0 ± 0.0
Gly
6.718GlyAla: 6.718 ± 0.818
0.765GlyCys: 0.765 ± 0.27
5.187GlyAsp: 5.187 ± 0.912
4.592GlyGlu: 4.592 ± 0.708
2.296GlyPhe: 2.296 ± 0.367
5.782GlyGly: 5.782 ± 0.623
0.935GlyHis: 0.935 ± 0.361
4.252GlyIle: 4.252 ± 0.498
6.293GlyLys: 6.293 ± 0.851
5.697GlyLeu: 5.697 ± 0.829
2.466GlyMet: 2.466 ± 0.561
2.721GlyAsn: 2.721 ± 0.487
1.02GlyPro: 1.02 ± 0.351
2.636GlyGln: 2.636 ± 0.451
5.527GlyArg: 5.527 ± 0.696
7.313GlySer: 7.313 ± 1.029
5.272GlyThr: 5.272 ± 0.725
6.122GlyVal: 6.122 ± 0.815
1.19GlyTrp: 1.19 ± 0.302
3.401GlyTyr: 3.401 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
0.68HisAla: 0.68 ± 0.26
0.255HisCys: 0.255 ± 0.171
1.19HisAsp: 1.19 ± 0.466
1.105HisGlu: 1.105 ± 0.39
0.425HisPhe: 0.425 ± 0.225
0.935HisGly: 0.935 ± 0.279
0.17HisHis: 0.17 ± 0.134
0.765HisIle: 0.765 ± 0.177
1.02HisLys: 1.02 ± 0.274
1.701HisLeu: 1.701 ± 0.413
0.425HisMet: 0.425 ± 0.173
0.425HisAsn: 0.425 ± 0.15
0.68HisPro: 0.68 ± 0.236
0.51HisGln: 0.51 ± 0.226
1.105HisArg: 1.105 ± 0.272
0.935HisSer: 0.935 ± 0.236
1.02HisThr: 1.02 ± 0.307
1.02HisVal: 1.02 ± 0.254
0.34HisTrp: 0.34 ± 0.126
0.425HisTyr: 0.425 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
3.912IleAla: 3.912 ± 0.673
0.68IleCys: 0.68 ± 0.287
3.316IleAsp: 3.316 ± 0.372
3.486IleGlu: 3.486 ± 0.389
1.105IlePhe: 1.105 ± 0.318
4.422IleGly: 4.422 ± 0.472
0.85IleHis: 0.85 ± 0.271
1.956IleIle: 1.956 ± 0.375
3.827IleLys: 3.827 ± 0.552
3.401IleLeu: 3.401 ± 0.475
0.85IleMet: 0.85 ± 0.244
2.636IleAsn: 2.636 ± 0.584
1.956IlePro: 1.956 ± 0.426
1.701IleGln: 1.701 ± 0.487
3.316IleArg: 3.316 ± 0.522
2.976IleSer: 2.976 ± 0.512
3.316IleThr: 3.316 ± 0.583
3.997IleVal: 3.997 ± 0.443
0.595IleTrp: 0.595 ± 0.238
1.105IleTyr: 1.105 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
6.803LysAla: 6.803 ± 0.886
0.765LysCys: 0.765 ± 0.3
3.912LysAsp: 3.912 ± 0.55
3.741LysGlu: 3.741 ± 0.469
2.296LysPhe: 2.296 ± 0.522
4.677LysGly: 4.677 ± 0.578
1.361LysHis: 1.361 ± 0.397
1.871LysIle: 1.871 ± 0.348
3.912LysLys: 3.912 ± 0.957
5.527LysLeu: 5.527 ± 0.669
1.786LysMet: 1.786 ± 0.314
2.126LysAsn: 2.126 ± 0.459
2.721LysPro: 2.721 ± 0.679
1.956LysGln: 1.956 ± 0.499
3.912LysArg: 3.912 ± 0.643
4.337LysSer: 4.337 ± 0.618
3.912LysThr: 3.912 ± 0.536
4.762LysVal: 4.762 ± 0.765
1.276LysTrp: 1.276 ± 0.419
2.381LysTyr: 2.381 ± 0.41
0.0LysXaa: 0.0 ± 0.0
Leu
6.378LeuAla: 6.378 ± 0.736
0.17LeuCys: 0.17 ± 0.126
4.762LeuAsp: 4.762 ± 0.475
6.207LeuGlu: 6.207 ± 0.83
2.381LeuPhe: 2.381 ± 0.442
5.017LeuGly: 5.017 ± 0.836
0.68LeuHis: 0.68 ± 0.228
4.167LeuIle: 4.167 ± 0.599
6.293LeuLys: 6.293 ± 0.74
4.677LeuLeu: 4.677 ± 0.65
3.486LeuMet: 3.486 ± 0.658
4.167LeuAsn: 4.167 ± 0.472
3.656LeuPro: 3.656 ± 0.404
3.656LeuGln: 3.656 ± 0.56
5.102LeuArg: 5.102 ± 0.509
5.612LeuSer: 5.612 ± 0.794
5.272LeuThr: 5.272 ± 0.724
5.017LeuVal: 5.017 ± 0.662
0.85LeuTrp: 0.85 ± 0.248
2.296LeuTyr: 2.296 ± 0.509
0.0LeuXaa: 0.0 ± 0.0
Met
3.316MetAla: 3.316 ± 0.453
0.34MetCys: 0.34 ± 0.169
1.786MetAsp: 1.786 ± 0.402
2.211MetGlu: 2.211 ± 0.432
1.105MetPhe: 1.105 ± 0.317
2.551MetGly: 2.551 ± 0.584
0.255MetHis: 0.255 ± 0.148
1.531MetIle: 1.531 ± 0.298
1.276MetLys: 1.276 ± 0.323
2.721MetLeu: 2.721 ± 0.454
0.51MetMet: 0.51 ± 0.199
1.02MetAsn: 1.02 ± 0.323
1.105MetPro: 1.105 ± 0.324
1.02MetGln: 1.02 ± 0.335
1.02MetArg: 1.02 ± 0.228
2.636MetSer: 2.636 ± 0.604
1.786MetThr: 1.786 ± 0.406
1.956MetVal: 1.956 ± 0.426
0.34MetTrp: 0.34 ± 0.212
0.765MetTyr: 0.765 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
3.827AsnAla: 3.827 ± 0.634
0.425AsnCys: 0.425 ± 0.213
2.126AsnAsp: 2.126 ± 0.485
1.956AsnGlu: 1.956 ± 0.462
1.786AsnPhe: 1.786 ± 0.271
4.507AsnGly: 4.507 ± 0.641
0.34AsnHis: 0.34 ± 0.14
2.636AsnIle: 2.636 ± 0.4
2.636AsnLys: 2.636 ± 0.411
3.656AsnLeu: 3.656 ± 0.585
1.02AsnMet: 1.02 ± 0.308
2.466AsnAsn: 2.466 ± 0.663
2.806AsnPro: 2.806 ± 0.52
1.19AsnGln: 1.19 ± 0.258
1.956AsnArg: 1.956 ± 0.467
2.551AsnSer: 2.551 ± 0.681
1.701AsnThr: 1.701 ± 0.314
3.571AsnVal: 3.571 ± 0.69
0.085AsnTrp: 0.085 ± 0.082
1.531AsnTyr: 1.531 ± 0.461
0.0AsnXaa: 0.0 ± 0.0
Pro
3.061ProAla: 3.061 ± 0.618
0.425ProCys: 0.425 ± 0.249
2.126ProAsp: 2.126 ± 0.39
3.656ProGlu: 3.656 ± 0.68
1.19ProPhe: 1.19 ± 0.23
1.786ProGly: 1.786 ± 0.327
0.51ProHis: 0.51 ± 0.193
1.956ProIle: 1.956 ± 0.369
3.316ProLys: 3.316 ± 0.531
2.296ProLeu: 2.296 ± 0.464
0.85ProMet: 0.85 ± 0.318
2.211ProAsn: 2.211 ± 0.474
0.765ProPro: 0.765 ± 0.238
1.531ProGln: 1.531 ± 0.325
1.871ProArg: 1.871 ± 0.531
3.231ProSer: 3.231 ± 0.53
3.146ProThr: 3.146 ± 0.482
2.806ProVal: 2.806 ± 0.456
0.68ProTrp: 0.68 ± 0.253
0.935ProTyr: 0.935 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
3.997GlnAla: 3.997 ± 0.51
0.085GlnCys: 0.085 ± 0.097
2.806GlnAsp: 2.806 ± 0.676
1.956GlnGlu: 1.956 ± 0.411
2.126GlnPhe: 2.126 ± 0.349
2.551GlnGly: 2.551 ± 0.531
0.425GlnHis: 0.425 ± 0.157
1.105GlnIle: 1.105 ± 0.289
1.956GlnLys: 1.956 ± 0.409
3.571GlnLeu: 3.571 ± 0.688
1.19GlnMet: 1.19 ± 0.337
1.276GlnAsn: 1.276 ± 0.415
0.935GlnPro: 0.935 ± 0.322
1.871GlnGln: 1.871 ± 0.606
2.126GlnArg: 2.126 ± 0.613
2.976GlnSer: 2.976 ± 0.474
1.871GlnThr: 1.871 ± 0.458
2.551GlnVal: 2.551 ± 0.335
0.765GlnTrp: 0.765 ± 0.321
1.19GlnTyr: 1.19 ± 0.4
0.0GlnXaa: 0.0 ± 0.0
Arg
4.337ArgAla: 4.337 ± 0.812
0.34ArgCys: 0.34 ± 0.175
4.082ArgAsp: 4.082 ± 0.449
3.997ArgGlu: 3.997 ± 0.629
2.466ArgPhe: 2.466 ± 0.354
4.252ArgGly: 4.252 ± 0.477
0.595ArgHis: 0.595 ± 0.255
3.401ArgIle: 3.401 ± 0.76
3.061ArgLys: 3.061 ± 0.552
5.272ArgLeu: 5.272 ± 0.718
1.02ArgMet: 1.02 ± 0.314
2.806ArgAsn: 2.806 ± 0.491
2.126ArgPro: 2.126 ± 0.414
2.551ArgGln: 2.551 ± 0.457
2.296ArgArg: 2.296 ± 0.413
3.741ArgSer: 3.741 ± 0.674
2.211ArgThr: 2.211 ± 0.352
2.721ArgVal: 2.721 ± 0.531
1.276ArgTrp: 1.276 ± 0.305
1.531ArgTyr: 1.531 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
4.762SerAla: 4.762 ± 0.807
0.85SerCys: 0.85 ± 0.371
5.952SerAsp: 5.952 ± 0.695
4.252SerGlu: 4.252 ± 0.714
2.381SerPhe: 2.381 ± 0.331
6.293SerGly: 6.293 ± 0.999
1.956SerHis: 1.956 ± 0.437
2.976SerIle: 2.976 ± 0.66
2.976SerLys: 2.976 ± 0.492
4.677SerLeu: 4.677 ± 0.773
2.041SerMet: 2.041 ± 0.382
3.486SerAsn: 3.486 ± 0.828
3.401SerPro: 3.401 ± 0.599
2.381SerGln: 2.381 ± 0.581
3.486SerArg: 3.486 ± 0.635
4.677SerSer: 4.677 ± 0.814
4.337SerThr: 4.337 ± 0.679
4.252SerVal: 4.252 ± 0.524
0.935SerTrp: 0.935 ± 0.251
2.976SerTyr: 2.976 ± 0.626
0.0SerXaa: 0.0 ± 0.0
Thr
3.741ThrAla: 3.741 ± 0.711
0.17ThrCys: 0.17 ± 0.116
3.061ThrAsp: 3.061 ± 0.446
5.187ThrGlu: 5.187 ± 0.622
2.296ThrPhe: 2.296 ± 0.467
5.527ThrGly: 5.527 ± 0.646
0.68ThrHis: 0.68 ± 0.232
4.422ThrIle: 4.422 ± 0.674
3.146ThrLys: 3.146 ± 0.57
4.422ThrLeu: 4.422 ± 0.577
2.126ThrMet: 2.126 ± 0.474
2.211ThrAsn: 2.211 ± 0.611
3.061ThrPro: 3.061 ± 0.406
2.381ThrGln: 2.381 ± 0.483
2.381ThrArg: 2.381 ± 0.463
3.316ThrSer: 3.316 ± 0.77
3.231ThrThr: 3.231 ± 0.568
4.507ThrVal: 4.507 ± 0.529
0.595ThrTrp: 0.595 ± 0.233
1.616ThrTyr: 1.616 ± 0.339
0.0ThrXaa: 0.0 ± 0.0
Val
4.932ValAla: 4.932 ± 0.625
0.17ValCys: 0.17 ± 0.135
3.741ValAsp: 3.741 ± 0.526
4.932ValGlu: 4.932 ± 0.65
2.891ValPhe: 2.891 ± 0.495
5.442ValGly: 5.442 ± 0.701
1.19ValHis: 1.19 ± 0.548
2.976ValIle: 2.976 ± 0.484
5.357ValLys: 5.357 ± 0.745
5.442ValLeu: 5.442 ± 0.701
2.211ValMet: 2.211 ± 0.561
3.146ValAsn: 3.146 ± 0.557
2.721ValPro: 2.721 ± 0.494
2.976ValGln: 2.976 ± 0.631
4.847ValArg: 4.847 ± 0.626
5.102ValSer: 5.102 ± 0.62
4.252ValThr: 4.252 ± 0.584
5.527ValVal: 5.527 ± 0.855
0.425ValTrp: 0.425 ± 0.176
2.296ValTyr: 2.296 ± 0.44
0.0ValXaa: 0.0 ± 0.0
Trp
0.51TrpAla: 0.51 ± 0.176
0.255TrpCys: 0.255 ± 0.153
0.765TrpAsp: 0.765 ± 0.249
0.85TrpGlu: 0.85 ± 0.199
0.68TrpPhe: 0.68 ± 0.243
1.02TrpGly: 1.02 ± 0.31
0.51TrpHis: 0.51 ± 0.197
0.255TrpIle: 0.255 ± 0.147
1.361TrpLys: 1.361 ± 0.356
2.126TrpLeu: 2.126 ± 0.523
0.34TrpMet: 0.34 ± 0.191
0.765TrpAsn: 0.765 ± 0.271
0.17TrpPro: 0.17 ± 0.114
0.595TrpGln: 0.595 ± 0.246
1.02TrpArg: 1.02 ± 0.274
1.105TrpSer: 1.105 ± 0.474
0.51TrpThr: 0.51 ± 0.209
0.935TrpVal: 0.935 ± 0.295
0.085TrpTrp: 0.085 ± 0.084
0.51TrpTyr: 0.51 ± 0.194
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.571TyrAla: 3.571 ± 0.7
0.34TyrCys: 0.34 ± 0.17
1.956TyrAsp: 1.956 ± 0.436
1.786TyrGlu: 1.786 ± 0.496
0.935TyrPhe: 0.935 ± 0.255
2.891TyrGly: 2.891 ± 0.539
0.51TyrHis: 0.51 ± 0.236
1.531TyrIle: 1.531 ± 0.467
1.786TyrLys: 1.786 ± 0.39
2.551TyrLeu: 2.551 ± 0.398
0.85TyrMet: 0.85 ± 0.268
1.701TyrAsn: 1.701 ± 0.371
1.276TyrPro: 1.276 ± 0.392
1.786TyrGln: 1.786 ± 0.535
2.721TyrArg: 2.721 ± 0.58
1.871TyrSer: 1.871 ± 0.381
1.786TyrThr: 1.786 ± 0.329
1.616TyrVal: 1.616 ± 0.408
0.34TyrTrp: 0.34 ± 0.164
1.105TyrTyr: 1.105 ± 0.286
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (11761 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski