Amino acid dipepetide frequency for Pseudomonas phage phi8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.907AlaAla: 11.907 ± 1.327
0.866AlaCys: 0.866 ± 0.355
4.763AlaAsp: 4.763 ± 0.754
5.412AlaGlu: 5.412 ± 1.202
3.897AlaPhe: 3.897 ± 0.788
6.711AlaGly: 6.711 ± 1.866
2.381AlaHis: 2.381 ± 0.868
7.577AlaIle: 7.577 ± 1.172
5.629AlaLys: 5.629 ± 1.088
10.392AlaLeu: 10.392 ± 1.613
3.464AlaMet: 3.464 ± 0.805
3.68AlaAsn: 3.68 ± 1.047
3.031AlaPro: 3.031 ± 0.643
4.763AlaGln: 4.763 ± 1.182
7.144AlaArg: 7.144 ± 1.256
6.278AlaSer: 6.278 ± 1.135
6.711AlaThr: 6.711 ± 1.0
8.443AlaVal: 8.443 ± 1.474
0.433AlaTrp: 0.433 ± 0.353
2.381AlaTyr: 2.381 ± 0.531
0.0AlaXaa: 0.0 ± 0.0
Cys
0.649CysAla: 0.649 ± 0.36
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.433CysGlu: 0.433 ± 0.331
0.649CysPhe: 0.649 ± 0.333
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.216CysIle: 0.216 ± 0.193
0.216CysLys: 0.216 ± 0.166
0.433CysLeu: 0.433 ± 0.424
0.0CysMet: 0.0 ± 0.0
0.433CysAsn: 0.433 ± 0.253
0.216CysPro: 0.216 ± 0.212
0.0CysGln: 0.0 ± 0.0
0.433CysArg: 0.433 ± 0.251
0.216CysSer: 0.216 ± 0.209
0.0CysThr: 0.0 ± 0.0
0.216CysVal: 0.216 ± 0.209
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.928AspAla: 6.928 ± 0.99
0.216AspCys: 0.216 ± 0.232
3.464AspAsp: 3.464 ± 1.169
3.247AspGlu: 3.247 ± 1.178
2.165AspPhe: 2.165 ± 0.986
3.464AspGly: 3.464 ± 0.982
1.732AspHis: 1.732 ± 0.664
3.897AspIle: 3.897 ± 0.783
2.165AspLys: 2.165 ± 0.378
5.629AspLeu: 5.629 ± 1.078
1.515AspMet: 1.515 ± 0.448
0.433AspAsn: 0.433 ± 0.294
2.381AspPro: 2.381 ± 0.777
1.948AspGln: 1.948 ± 0.702
3.464AspArg: 3.464 ± 0.96
3.68AspSer: 3.68 ± 1.275
2.381AspThr: 2.381 ± 0.708
6.928AspVal: 6.928 ± 1.005
0.216AspTrp: 0.216 ± 0.228
2.598AspTyr: 2.598 ± 0.841
0.0AspXaa: 0.0 ± 0.0
Glu
4.546GluAla: 4.546 ± 1.08
0.649GluCys: 0.649 ± 0.313
2.814GluAsp: 2.814 ± 0.752
3.031GluGlu: 3.031 ± 0.695
2.165GluPhe: 2.165 ± 0.581
2.814GluGly: 2.814 ± 0.935
0.866GluHis: 0.866 ± 0.357
4.33GluIle: 4.33 ± 0.669
3.031GluLys: 3.031 ± 0.944
6.278GluLeu: 6.278 ± 0.779
1.732GluMet: 1.732 ± 0.556
1.732GluAsn: 1.732 ± 0.584
0.866GluPro: 0.866 ± 0.368
2.598GluGln: 2.598 ± 0.662
3.464GluArg: 3.464 ± 0.728
4.33GluSer: 4.33 ± 0.531
2.814GluThr: 2.814 ± 0.547
4.33GluVal: 4.33 ± 0.861
0.216GluTrp: 0.216 ± 0.212
1.299GluTyr: 1.299 ± 0.656
0.0GluXaa: 0.0 ± 0.0
Phe
3.68PheAla: 3.68 ± 0.644
0.216PheCys: 0.216 ± 0.166
1.515PheAsp: 1.515 ± 0.641
1.948PheGlu: 1.948 ± 0.985
1.299PhePhe: 1.299 ± 0.387
3.68PheGly: 3.68 ± 1.095
1.515PheHis: 1.515 ± 0.569
1.515PheIle: 1.515 ± 0.555
1.948PheLys: 1.948 ± 0.741
3.464PheLeu: 3.464 ± 0.692
1.732PheMet: 1.732 ± 0.548
1.299PheAsn: 1.299 ± 0.605
0.866PhePro: 0.866 ± 0.394
0.866PheGln: 0.866 ± 0.35
0.216PheArg: 0.216 ± 0.232
3.68PheSer: 3.68 ± 0.982
2.381PheThr: 2.381 ± 0.914
3.247PheVal: 3.247 ± 0.598
0.0PheTrp: 0.0 ± 0.0
1.082PheTyr: 1.082 ± 0.408
0.0PheXaa: 0.0 ± 0.0
Gly
8.227GlyAla: 8.227 ± 1.667
0.216GlyCys: 0.216 ± 0.166
4.763GlyAsp: 4.763 ± 0.834
4.979GlyGlu: 4.979 ± 0.947
3.68GlyPhe: 3.68 ± 0.799
5.196GlyGly: 5.196 ± 1.385
0.866GlyHis: 0.866 ± 0.49
4.546GlyIle: 4.546 ± 1.13
4.546GlyLys: 4.546 ± 1.215
7.144GlyLeu: 7.144 ± 1.258
1.948GlyMet: 1.948 ± 0.549
2.814GlyAsn: 2.814 ± 0.954
1.948GlyPro: 1.948 ± 0.651
1.515GlyGln: 1.515 ± 0.627
2.165GlyArg: 2.165 ± 0.726
4.763GlySer: 4.763 ± 0.953
4.113GlyThr: 4.113 ± 0.75
8.443GlyVal: 8.443 ± 1.105
1.082GlyTrp: 1.082 ± 0.419
2.165GlyTyr: 2.165 ± 0.81
0.0GlyXaa: 0.0 ± 0.0
His
1.299HisAla: 1.299 ± 0.93
0.0HisCys: 0.0 ± 0.0
0.866HisAsp: 0.866 ± 0.396
1.082HisGlu: 1.082 ± 0.51
1.732HisPhe: 1.732 ± 0.698
1.082HisGly: 1.082 ± 0.498
0.433HisHis: 0.433 ± 0.262
0.866HisIle: 0.866 ± 0.393
1.082HisLys: 1.082 ± 0.492
1.515HisLeu: 1.515 ± 0.595
0.433HisMet: 0.433 ± 0.312
0.0HisAsn: 0.0 ± 0.0
0.433HisPro: 0.433 ± 0.267
1.082HisGln: 1.082 ± 0.544
1.082HisArg: 1.082 ± 0.509
0.866HisSer: 0.866 ± 0.419
0.866HisThr: 0.866 ± 0.448
0.866HisVal: 0.866 ± 0.478
0.216HisTrp: 0.216 ± 0.241
1.299HisTyr: 1.299 ± 0.395
0.0HisXaa: 0.0 ± 0.0
Ile
7.144IleAla: 7.144 ± 1.908
0.0IleCys: 0.0 ± 0.0
2.814IleAsp: 2.814 ± 0.417
4.546IleGlu: 4.546 ± 1.14
0.433IlePhe: 0.433 ± 0.236
4.113IleGly: 4.113 ± 1.23
0.433IleHis: 0.433 ± 0.253
4.546IleIle: 4.546 ± 0.75
1.515IleLys: 1.515 ± 0.561
3.897IleLeu: 3.897 ± 0.998
1.948IleMet: 1.948 ± 0.524
2.814IleAsn: 2.814 ± 0.609
2.598IlePro: 2.598 ± 0.576
1.082IleGln: 1.082 ± 0.525
4.113IleArg: 4.113 ± 0.818
2.598IleSer: 2.598 ± 0.685
4.546IleThr: 4.546 ± 0.7
3.897IleVal: 3.897 ± 1.269
0.433IleTrp: 0.433 ± 0.313
1.082IleTyr: 1.082 ± 0.48
0.0IleXaa: 0.0 ± 0.0
Lys
4.33LysAla: 4.33 ± 1.193
0.216LysCys: 0.216 ± 0.212
3.247LysAsp: 3.247 ± 0.852
2.814LysGlu: 2.814 ± 0.746
1.515LysPhe: 1.515 ± 0.652
5.629LysGly: 5.629 ± 1.126
1.299LysHis: 1.299 ± 0.523
1.515LysIle: 1.515 ± 0.428
3.464LysLys: 3.464 ± 0.983
3.247LysLeu: 3.247 ± 0.873
1.082LysMet: 1.082 ± 0.542
1.732LysAsn: 1.732 ± 0.763
1.299LysPro: 1.299 ± 0.626
1.948LysGln: 1.948 ± 0.473
3.247LysArg: 3.247 ± 0.976
2.598LysSer: 2.598 ± 0.692
3.031LysThr: 3.031 ± 0.853
2.598LysVal: 2.598 ± 0.615
0.866LysTrp: 0.866 ± 0.531
1.732LysTyr: 1.732 ± 0.765
0.0LysXaa: 0.0 ± 0.0
Leu
11.041LeuAla: 11.041 ± 1.555
0.649LeuCys: 0.649 ± 0.306
4.979LeuAsp: 4.979 ± 0.974
5.196LeuGlu: 5.196 ± 1.09
3.68LeuPhe: 3.68 ± 1.468
8.227LeuGly: 8.227 ± 1.365
1.732LeuHis: 1.732 ± 0.633
5.196LeuIle: 5.196 ± 0.994
4.33LeuLys: 4.33 ± 0.514
8.66LeuLeu: 8.66 ± 2.006
3.68LeuMet: 3.68 ± 0.961
2.814LeuAsn: 2.814 ± 0.538
4.113LeuPro: 4.113 ± 0.86
3.247LeuGln: 3.247 ± 1.061
5.196LeuArg: 5.196 ± 0.901
5.845LeuSer: 5.845 ± 1.063
7.794LeuThr: 7.794 ± 1.605
6.062LeuVal: 6.062 ± 0.94
0.216LeuTrp: 0.216 ± 0.263
2.598LeuTyr: 2.598 ± 0.637
0.0LeuXaa: 0.0 ± 0.0
Met
2.814MetAla: 2.814 ± 0.573
0.649MetCys: 0.649 ± 0.394
2.165MetAsp: 2.165 ± 0.712
1.948MetGlu: 1.948 ± 0.454
0.866MetPhe: 0.866 ± 0.421
2.165MetGly: 2.165 ± 0.634
0.433MetHis: 0.433 ± 0.335
1.948MetIle: 1.948 ± 0.554
0.866MetLys: 0.866 ± 0.606
4.763MetLeu: 4.763 ± 0.985
0.649MetMet: 0.649 ± 0.373
1.515MetAsn: 1.515 ± 0.642
1.299MetPro: 1.299 ± 0.476
2.381MetGln: 2.381 ± 0.738
1.732MetArg: 1.732 ± 0.425
3.464MetSer: 3.464 ± 0.699
1.515MetThr: 1.515 ± 0.472
3.247MetVal: 3.247 ± 0.61
0.0MetTrp: 0.0 ± 0.0
0.649MetTyr: 0.649 ± 0.354
0.0MetXaa: 0.0 ± 0.0
Asn
3.247AsnAla: 3.247 ± 0.881
0.0AsnCys: 0.0 ± 0.0
1.515AsnAsp: 1.515 ± 0.342
1.948AsnGlu: 1.948 ± 0.58
1.299AsnPhe: 1.299 ± 0.499
4.33AsnGly: 4.33 ± 1.017
0.866AsnHis: 0.866 ± 0.506
2.165AsnIle: 2.165 ± 0.63
0.649AsnLys: 0.649 ± 0.452
2.814AsnLeu: 2.814 ± 1.018
1.515AsnMet: 1.515 ± 0.432
1.299AsnAsn: 1.299 ± 0.58
1.515AsnPro: 1.515 ± 0.443
1.299AsnGln: 1.299 ± 0.443
2.598AsnArg: 2.598 ± 0.549
1.732AsnSer: 1.732 ± 0.785
1.299AsnThr: 1.299 ± 0.652
3.031AsnVal: 3.031 ± 1.052
0.433AsnTrp: 0.433 ± 0.279
0.433AsnTyr: 0.433 ± 0.306
0.0AsnXaa: 0.0 ± 0.0
Pro
2.814ProAla: 2.814 ± 0.827
0.0ProCys: 0.0 ± 0.0
2.814ProAsp: 2.814 ± 0.777
1.515ProGlu: 1.515 ± 0.444
1.515ProPhe: 1.515 ± 0.699
3.247ProGly: 3.247 ± 0.729
0.0ProHis: 0.0 ± 0.0
1.948ProIle: 1.948 ± 0.51
0.433ProLys: 0.433 ± 0.217
3.031ProLeu: 3.031 ± 0.618
1.082ProMet: 1.082 ± 0.389
1.515ProAsn: 1.515 ± 0.595
1.515ProPro: 1.515 ± 0.67
1.082ProGln: 1.082 ± 0.501
2.381ProArg: 2.381 ± 0.653
5.412ProSer: 5.412 ± 0.975
3.031ProThr: 3.031 ± 0.728
1.732ProVal: 1.732 ± 0.493
0.649ProTrp: 0.649 ± 0.362
0.649ProTyr: 0.649 ± 0.373
0.0ProXaa: 0.0 ± 0.0
Gln
3.68GlnAla: 3.68 ± 0.853
0.216GlnCys: 0.216 ± 0.166
1.082GlnAsp: 1.082 ± 0.536
1.082GlnGlu: 1.082 ± 0.294
1.082GlnPhe: 1.082 ± 0.399
1.948GlnGly: 1.948 ± 0.678
0.0GlnHis: 0.0 ± 0.0
0.866GlnIle: 0.866 ± 0.42
1.732GlnLys: 1.732 ± 0.794
4.763GlnLeu: 4.763 ± 1.085
1.732GlnMet: 1.732 ± 0.577
1.732GlnAsn: 1.732 ± 0.411
1.732GlnPro: 1.732 ± 0.48
1.732GlnGln: 1.732 ± 0.55
2.165GlnArg: 2.165 ± 0.638
2.814GlnSer: 2.814 ± 0.63
2.598GlnThr: 2.598 ± 0.856
2.598GlnVal: 2.598 ± 0.809
0.433GlnTrp: 0.433 ± 0.251
0.866GlnTyr: 0.866 ± 0.388
0.0GlnXaa: 0.0 ± 0.0
Arg
5.196ArgAla: 5.196 ± 0.92
0.0ArgCys: 0.0 ± 0.0
3.031ArgAsp: 3.031 ± 0.964
1.948ArgGlu: 1.948 ± 0.81
3.247ArgPhe: 3.247 ± 0.827
3.897ArgGly: 3.897 ± 0.666
0.649ArgHis: 0.649 ± 0.347
2.814ArgIle: 2.814 ± 0.795
2.381ArgLys: 2.381 ± 0.691
6.062ArgLeu: 6.062 ± 1.25
1.082ArgMet: 1.082 ± 0.539
1.732ArgAsn: 1.732 ± 0.546
2.381ArgPro: 2.381 ± 0.525
1.948ArgGln: 1.948 ± 0.667
3.464ArgArg: 3.464 ± 0.715
4.763ArgSer: 4.763 ± 0.98
3.897ArgThr: 3.897 ± 0.951
5.845ArgVal: 5.845 ± 1.237
0.433ArgTrp: 0.433 ± 0.308
3.031ArgTyr: 3.031 ± 0.713
0.0ArgXaa: 0.0 ± 0.0
Ser
8.01SerAla: 8.01 ± 0.92
0.0SerCys: 0.0 ± 0.0
4.763SerAsp: 4.763 ± 1.035
3.031SerGlu: 3.031 ± 1.02
2.598SerPhe: 2.598 ± 0.744
3.897SerGly: 3.897 ± 0.989
1.082SerHis: 1.082 ± 0.55
4.113SerIle: 4.113 ± 0.937
4.113SerLys: 4.113 ± 1.09
7.144SerLeu: 7.144 ± 1.318
3.247SerMet: 3.247 ± 0.828
2.814SerAsn: 2.814 ± 0.84
2.381SerPro: 2.381 ± 0.516
1.948SerGln: 1.948 ± 0.779
2.598SerArg: 2.598 ± 0.661
6.495SerSer: 6.495 ± 1.046
3.897SerThr: 3.897 ± 1.088
6.495SerVal: 6.495 ± 1.239
1.299SerTrp: 1.299 ± 0.495
2.165SerTyr: 2.165 ± 0.878
0.0SerXaa: 0.0 ± 0.0
Thr
7.577ThrAla: 7.577 ± 1.55
0.0ThrCys: 0.0 ± 0.0
3.68ThrAsp: 3.68 ± 1.03
3.031ThrGlu: 3.031 ± 0.992
1.082ThrPhe: 1.082 ± 0.353
5.845ThrGly: 5.845 ± 1.191
0.649ThrHis: 0.649 ± 0.318
2.814ThrIle: 2.814 ± 0.993
1.732ThrLys: 1.732 ± 0.608
6.928ThrLeu: 6.928 ± 0.768
3.68ThrMet: 3.68 ± 1.233
2.381ThrAsn: 2.381 ± 0.679
3.031ThrPro: 3.031 ± 0.64
2.165ThrGln: 2.165 ± 0.499
3.897ThrArg: 3.897 ± 1.002
3.897ThrSer: 3.897 ± 0.833
4.33ThrThr: 4.33 ± 0.872
3.247ThrVal: 3.247 ± 0.596
0.866ThrTrp: 0.866 ± 0.404
1.299ThrTyr: 1.299 ± 0.432
0.0ThrXaa: 0.0 ± 0.0
Val
8.876ValAla: 8.876 ± 1.046
0.0ValCys: 0.0 ± 0.0
6.711ValAsp: 6.711 ± 1.125
4.33ValGlu: 4.33 ± 1.196
2.381ValPhe: 2.381 ± 0.953
6.278ValGly: 6.278 ± 1.419
1.732ValHis: 1.732 ± 0.751
2.381ValIle: 2.381 ± 0.669
5.196ValLys: 5.196 ± 0.893
6.278ValLeu: 6.278 ± 1.184
2.814ValMet: 2.814 ± 0.728
2.165ValAsn: 2.165 ± 0.827
3.464ValPro: 3.464 ± 0.937
1.948ValGln: 1.948 ± 0.567
6.495ValArg: 6.495 ± 1.128
5.412ValSer: 5.412 ± 1.595
5.629ValThr: 5.629 ± 1.033
8.66ValVal: 8.66 ± 1.73
0.433ValTrp: 0.433 ± 0.289
1.948ValTyr: 1.948 ± 0.668
0.0ValXaa: 0.0 ± 0.0
Trp
1.082TrpAla: 1.082 ± 0.544
0.216TrpCys: 0.216 ± 0.166
0.866TrpAsp: 0.866 ± 0.69
0.433TrpGlu: 0.433 ± 0.295
0.0TrpPhe: 0.0 ± 0.0
0.216TrpGly: 0.216 ± 0.263
0.216TrpHis: 0.216 ± 0.166
0.433TrpIle: 0.433 ± 0.336
0.649TrpLys: 0.649 ± 0.349
1.299TrpLeu: 1.299 ± 0.635
0.0TrpMet: 0.0 ± 0.0
0.433TrpAsn: 0.433 ± 0.274
0.216TrpPro: 0.216 ± 0.212
0.216TrpGln: 0.216 ± 0.232
0.216TrpArg: 0.216 ± 0.166
1.082TrpSer: 1.082 ± 0.493
0.216TrpThr: 0.216 ± 0.209
0.866TrpVal: 0.866 ± 0.407
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.814TyrAla: 2.814 ± 1.054
0.0TyrCys: 0.0 ± 0.0
2.598TyrAsp: 2.598 ± 0.605
1.948TyrGlu: 1.948 ± 0.502
0.866TyrPhe: 0.866 ± 0.352
2.381TyrGly: 2.381 ± 0.772
0.216TyrHis: 0.216 ± 0.241
0.866TyrIle: 0.866 ± 0.553
1.732TyrLys: 1.732 ± 0.704
1.732TyrLeu: 1.732 ± 0.507
1.515TyrMet: 1.515 ± 0.443
0.649TyrAsn: 0.649 ± 0.394
1.082TyrPro: 1.082 ± 0.639
1.082TyrGln: 1.082 ± 0.424
1.732TyrArg: 1.732 ± 0.704
2.165TyrSer: 2.165 ± 0.968
1.082TyrThr: 1.082 ± 0.289
2.381TyrVal: 2.381 ± 0.689
0.433TyrTrp: 0.433 ± 0.336
0.649TyrTyr: 0.649 ± 0.254
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (4620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski