Amino acid dipepetide frequency for Pseudomonas phage phi1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.694AlaAla: 13.694 ± 1.982
1.122AlaCys: 1.122 ± 0.274
5.986AlaAsp: 5.986 ± 0.743
10.326AlaGlu: 10.326 ± 1.238
4.041AlaPhe: 4.041 ± 0.654
7.109AlaGly: 7.109 ± 1.047
1.272AlaHis: 1.272 ± 0.3
5.687AlaIle: 5.687 ± 0.596
6.136AlaLys: 6.136 ± 0.942
10.102AlaLeu: 10.102 ± 0.961
4.041AlaMet: 4.041 ± 0.561
3.068AlaAsn: 3.068 ± 0.545
3.442AlaPro: 3.442 ± 0.624
4.49AlaGln: 4.49 ± 0.859
6.211AlaArg: 6.211 ± 0.894
7.483AlaSer: 7.483 ± 1.236
4.939AlaThr: 4.939 ± 0.692
4.265AlaVal: 4.265 ± 0.601
2.32AlaTrp: 2.32 ± 0.395
2.843AlaTyr: 2.843 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
1.048CysAla: 1.048 ± 0.293
0.299CysCys: 0.299 ± 0.202
0.599CysAsp: 0.599 ± 0.211
0.823CysGlu: 0.823 ± 0.25
0.374CysPhe: 0.374 ± 0.168
1.197CysGly: 1.197 ± 0.326
0.374CysHis: 0.374 ± 0.18
0.599CysIle: 0.599 ± 0.178
0.823CysLys: 0.823 ± 0.265
0.748CysLeu: 0.748 ± 0.278
0.15CysMet: 0.15 ± 0.114
0.449CysAsn: 0.449 ± 0.169
0.673CysPro: 0.673 ± 0.256
0.748CysGln: 0.748 ± 0.242
1.122CysArg: 1.122 ± 0.286
0.898CysSer: 0.898 ± 0.286
0.299CysThr: 0.299 ± 0.167
0.823CysVal: 0.823 ± 0.348
0.524CysTrp: 0.524 ± 0.228
0.449CysTyr: 0.449 ± 0.177
0.0CysXaa: 0.0 ± 0.0
Asp
5.986AspAla: 5.986 ± 0.602
0.973AspCys: 0.973 ± 0.26
4.415AspAsp: 4.415 ± 0.701
3.891AspGlu: 3.891 ± 0.515
2.769AspPhe: 2.769 ± 0.408
6.136AspGly: 6.136 ± 0.775
0.748AspHis: 0.748 ± 0.242
3.143AspIle: 3.143 ± 0.655
2.769AspLys: 2.769 ± 0.452
5.238AspLeu: 5.238 ± 0.545
1.721AspMet: 1.721 ± 0.352
1.646AspAsn: 1.646 ± 0.4
2.32AspPro: 2.32 ± 0.424
2.469AspGln: 2.469 ± 0.532
3.592AspArg: 3.592 ± 0.511
3.442AspSer: 3.442 ± 0.454
1.721AspThr: 1.721 ± 0.373
4.714AspVal: 4.714 ± 0.598
1.646AspTrp: 1.646 ± 0.385
1.646AspTyr: 1.646 ± 0.448
0.0AspXaa: 0.0 ± 0.0
Glu
7.707GluAla: 7.707 ± 1.096
1.048GluCys: 1.048 ± 0.32
2.769GluAsp: 2.769 ± 0.454
4.49GluGlu: 4.49 ± 0.742
2.918GluPhe: 2.918 ± 0.46
3.517GluGly: 3.517 ± 0.518
1.272GluHis: 1.272 ± 0.336
3.891GluIle: 3.891 ± 0.552
3.891GluLys: 3.891 ± 0.579
5.462GluLeu: 5.462 ± 0.746
2.17GluMet: 2.17 ± 0.373
2.17GluAsn: 2.17 ± 0.448
3.442GluPro: 3.442 ± 0.52
3.667GluGln: 3.667 ± 0.613
4.939GluArg: 4.939 ± 1.115
3.891GluSer: 3.891 ± 0.53
4.49GluThr: 4.49 ± 0.63
5.163GluVal: 5.163 ± 0.597
1.646GluTrp: 1.646 ± 0.36
2.02GluTyr: 2.02 ± 0.344
0.0GluXaa: 0.0 ± 0.0
Phe
3.143PheAla: 3.143 ± 0.484
0.673PheCys: 0.673 ± 0.204
2.544PheAsp: 2.544 ± 0.427
2.843PheGlu: 2.843 ± 0.526
0.823PhePhe: 0.823 ± 0.305
3.367PheGly: 3.367 ± 0.667
0.673PheHis: 0.673 ± 0.244
1.497PheIle: 1.497 ± 0.462
1.871PheLys: 1.871 ± 0.386
2.769PheLeu: 2.769 ± 0.424
0.898PheMet: 0.898 ± 0.287
0.973PheAsn: 0.973 ± 0.246
1.721PhePro: 1.721 ± 0.308
0.898PheGln: 0.898 ± 0.238
2.17PheArg: 2.17 ± 0.337
1.871PheSer: 1.871 ± 0.397
1.871PheThr: 1.871 ± 0.406
2.694PheVal: 2.694 ± 0.432
0.449PheTrp: 0.449 ± 0.206
1.048PheTyr: 1.048 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
6.66GlyAla: 6.66 ± 0.661
0.973GlyCys: 0.973 ± 0.285
4.19GlyAsp: 4.19 ± 0.58
5.163GlyGlu: 5.163 ± 0.742
3.292GlyPhe: 3.292 ± 0.649
6.286GlyGly: 6.286 ± 0.828
1.946GlyHis: 1.946 ± 0.387
5.388GlyIle: 5.388 ± 0.576
3.966GlyLys: 3.966 ± 0.551
5.911GlyLeu: 5.911 ± 0.652
2.17GlyMet: 2.17 ± 0.348
3.143GlyAsn: 3.143 ± 0.823
2.095GlyPro: 2.095 ± 0.473
2.918GlyGln: 2.918 ± 0.548
4.714GlyArg: 4.714 ± 0.631
4.415GlySer: 4.415 ± 0.676
3.966GlyThr: 3.966 ± 0.598
5.388GlyVal: 5.388 ± 0.615
1.272GlyTrp: 1.272 ± 0.292
2.095GlyTyr: 2.095 ± 0.376
0.0GlyXaa: 0.0 ± 0.0
His
1.422HisAla: 1.422 ± 0.281
0.299HisCys: 0.299 ± 0.161
1.272HisAsp: 1.272 ± 0.279
1.272HisGlu: 1.272 ± 0.294
0.673HisPhe: 0.673 ± 0.259
1.646HisGly: 1.646 ± 0.417
0.224HisHis: 0.224 ± 0.124
1.197HisIle: 1.197 ± 0.297
0.673HisLys: 0.673 ± 0.204
1.721HisLeu: 1.721 ± 0.379
0.599HisMet: 0.599 ± 0.188
0.299HisAsn: 0.299 ± 0.17
1.122HisPro: 1.122 ± 0.317
0.673HisGln: 0.673 ± 0.205
1.422HisArg: 1.422 ± 0.381
1.347HisSer: 1.347 ± 0.372
0.449HisThr: 0.449 ± 0.167
0.823HisVal: 0.823 ± 0.264
0.524HisTrp: 0.524 ± 0.219
0.599HisTyr: 0.599 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
6.51IleAla: 6.51 ± 0.787
0.748IleCys: 0.748 ± 0.302
4.789IleAsp: 4.789 ± 0.618
3.741IleGlu: 3.741 ± 0.418
1.571IlePhe: 1.571 ± 0.31
4.34IleGly: 4.34 ± 0.55
1.197IleHis: 1.197 ± 0.347
3.068IleIle: 3.068 ± 0.692
2.769IleLys: 2.769 ± 0.512
4.34IleLeu: 4.34 ± 0.896
0.898IleMet: 0.898 ± 0.292
2.32IleAsn: 2.32 ± 0.566
2.469IlePro: 2.469 ± 0.388
1.796IleGln: 1.796 ± 0.428
3.068IleArg: 3.068 ± 0.493
3.592IleSer: 3.592 ± 0.567
3.891IleThr: 3.891 ± 0.476
2.694IleVal: 2.694 ± 0.416
0.524IleTrp: 0.524 ± 0.178
2.245IleTyr: 2.245 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
5.163LysAla: 5.163 ± 0.726
0.374LysCys: 0.374 ± 0.158
2.544LysAsp: 2.544 ± 0.446
3.292LysGlu: 3.292 ± 0.547
1.347LysPhe: 1.347 ± 0.361
3.966LysGly: 3.966 ± 0.533
1.197LysHis: 1.197 ± 0.277
1.796LysIle: 1.796 ± 0.352
2.993LysLys: 2.993 ± 0.557
3.891LysLeu: 3.891 ± 0.697
0.898LysMet: 0.898 ± 0.361
1.422LysAsn: 1.422 ± 0.349
2.32LysPro: 2.32 ± 0.449
2.469LysGln: 2.469 ± 0.466
3.966LysArg: 3.966 ± 0.494
2.993LysSer: 2.993 ± 0.384
2.843LysThr: 2.843 ± 0.402
3.143LysVal: 3.143 ± 0.425
0.673LysTrp: 0.673 ± 0.207
1.571LysTyr: 1.571 ± 0.374
0.0LysXaa: 0.0 ± 0.0
Leu
9.578LeuAla: 9.578 ± 1.193
1.197LeuCys: 1.197 ± 0.37
6.136LeuAsp: 6.136 ± 0.775
5.911LeuGlu: 5.911 ± 0.691
1.571LeuPhe: 1.571 ± 0.449
4.939LeuGly: 4.939 ± 0.766
1.646LeuHis: 1.646 ± 0.428
5.163LeuIle: 5.163 ± 0.808
3.891LeuLys: 3.891 ± 0.551
7.183LeuLeu: 7.183 ± 0.937
1.347LeuMet: 1.347 ± 0.312
3.143LeuAsn: 3.143 ± 0.443
3.068LeuPro: 3.068 ± 0.5
2.544LeuGln: 2.544 ± 0.396
6.36LeuArg: 6.36 ± 0.776
5.537LeuSer: 5.537 ± 0.681
4.864LeuThr: 4.864 ± 0.641
5.388LeuVal: 5.388 ± 0.57
0.823LeuTrp: 0.823 ± 0.292
2.02LeuTyr: 2.02 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
3.592MetAla: 3.592 ± 0.509
0.299MetCys: 0.299 ± 0.154
1.646MetAsp: 1.646 ± 0.343
1.122MetGlu: 1.122 ± 0.292
0.748MetPhe: 0.748 ± 0.223
1.571MetGly: 1.571 ± 0.514
0.599MetHis: 0.599 ± 0.243
1.347MetIle: 1.347 ± 0.268
0.748MetLys: 0.748 ± 0.271
2.694MetLeu: 2.694 ± 0.453
0.823MetMet: 0.823 ± 0.26
1.197MetAsn: 1.197 ± 0.277
1.646MetPro: 1.646 ± 0.282
1.122MetGln: 1.122 ± 0.318
2.02MetArg: 2.02 ± 0.437
1.721MetSer: 1.721 ± 0.332
2.32MetThr: 2.32 ± 0.336
1.571MetVal: 1.571 ± 0.353
0.15MetTrp: 0.15 ± 0.097
0.823MetTyr: 0.823 ± 0.355
0.0MetXaa: 0.0 ± 0.0
Asn
5.013AsnAla: 5.013 ± 0.599
0.599AsnCys: 0.599 ± 0.267
1.796AsnAsp: 1.796 ± 0.383
1.796AsnGlu: 1.796 ± 0.367
0.524AsnPhe: 0.524 ± 0.204
2.993AsnGly: 2.993 ± 0.431
0.374AsnHis: 0.374 ± 0.161
2.17AsnIle: 2.17 ± 0.514
1.122AsnLys: 1.122 ± 0.255
2.32AsnLeu: 2.32 ± 0.357
0.823AsnMet: 0.823 ± 0.189
0.823AsnAsn: 0.823 ± 0.244
2.095AsnPro: 2.095 ± 0.471
1.197AsnGln: 1.197 ± 0.331
1.646AsnArg: 1.646 ± 0.555
2.17AsnSer: 2.17 ± 0.47
1.946AsnThr: 1.946 ± 0.677
1.422AsnVal: 1.422 ± 0.428
0.673AsnTrp: 0.673 ± 0.239
1.272AsnTyr: 1.272 ± 0.25
0.0AsnXaa: 0.0 ± 0.0
Pro
4.19ProAla: 4.19 ± 0.521
0.524ProCys: 0.524 ± 0.179
2.544ProAsp: 2.544 ± 0.424
3.218ProGlu: 3.218 ± 0.462
1.347ProPhe: 1.347 ± 0.349
4.415ProGly: 4.415 ± 0.719
0.823ProHis: 0.823 ± 0.3
1.871ProIle: 1.871 ± 0.326
1.497ProLys: 1.497 ± 0.292
4.34ProLeu: 4.34 ± 0.613
1.048ProMet: 1.048 ± 0.307
0.973ProAsn: 0.973 ± 0.243
1.347ProPro: 1.347 ± 0.331
1.497ProGln: 1.497 ± 0.387
3.218ProArg: 3.218 ± 0.513
2.769ProSer: 2.769 ± 0.495
2.17ProThr: 2.17 ± 0.448
2.619ProVal: 2.619 ± 0.465
0.748ProTrp: 0.748 ± 0.283
0.973ProTyr: 0.973 ± 0.235
0.0ProXaa: 0.0 ± 0.0
Gln
4.939GlnAla: 4.939 ± 0.947
0.673GlnCys: 0.673 ± 0.216
1.646GlnAsp: 1.646 ± 0.354
1.946GlnGlu: 1.946 ± 0.392
1.571GlnPhe: 1.571 ± 0.296
2.694GlnGly: 2.694 ± 0.396
0.449GlnHis: 0.449 ± 0.16
2.02GlnIle: 2.02 ± 0.567
2.17GlnLys: 2.17 ± 0.398
3.367GlnLeu: 3.367 ± 0.406
1.796GlnMet: 1.796 ± 0.409
1.122GlnAsn: 1.122 ± 0.272
1.721GlnPro: 1.721 ± 0.445
2.17GlnGln: 2.17 ± 0.574
3.517GlnArg: 3.517 ± 0.515
2.17GlnSer: 2.17 ± 0.492
1.796GlnThr: 1.796 ± 0.325
2.17GlnVal: 2.17 ± 0.377
0.748GlnTrp: 0.748 ± 0.237
1.122GlnTyr: 1.122 ± 0.275
0.0GlnXaa: 0.0 ± 0.0
Arg
5.687ArgAla: 5.687 ± 0.674
0.524ArgCys: 0.524 ± 0.164
3.966ArgAsp: 3.966 ± 0.456
5.911ArgGlu: 5.911 ± 1.093
2.02ArgPhe: 2.02 ± 0.437
3.592ArgGly: 3.592 ± 0.671
1.721ArgHis: 1.721 ± 0.349
4.864ArgIle: 4.864 ± 0.636
3.367ArgLys: 3.367 ± 0.498
6.136ArgLeu: 6.136 ± 0.628
2.02ArgMet: 2.02 ± 0.443
1.571ArgAsn: 1.571 ± 0.366
2.469ArgPro: 2.469 ± 0.447
2.993ArgGln: 2.993 ± 0.414
5.163ArgArg: 5.163 ± 0.769
3.891ArgSer: 3.891 ± 0.69
3.292ArgThr: 3.292 ± 0.429
3.442ArgVal: 3.442 ± 0.522
1.122ArgTrp: 1.122 ± 0.29
2.245ArgTyr: 2.245 ± 0.431
0.0ArgXaa: 0.0 ± 0.0
Ser
7.707SerAla: 7.707 ± 1.099
0.524SerCys: 0.524 ± 0.175
3.966SerAsp: 3.966 ± 0.556
4.265SerGlu: 4.265 ± 0.601
2.694SerPhe: 2.694 ± 0.568
5.612SerGly: 5.612 ± 0.69
0.973SerHis: 0.973 ± 0.293
3.667SerIle: 3.667 ± 0.621
3.292SerLys: 3.292 ± 0.692
4.49SerLeu: 4.49 ± 0.636
2.619SerMet: 2.619 ± 0.476
2.843SerAsn: 2.843 ± 0.696
2.544SerPro: 2.544 ± 0.461
1.871SerGln: 1.871 ± 0.444
2.993SerArg: 2.993 ± 0.42
5.013SerSer: 5.013 ± 0.834
2.993SerThr: 2.993 ± 0.527
3.442SerVal: 3.442 ± 0.743
0.823SerTrp: 0.823 ± 0.208
1.646SerTyr: 1.646 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
6.286ThrAla: 6.286 ± 0.819
0.449ThrCys: 0.449 ± 0.218
2.993ThrAsp: 2.993 ± 0.573
3.442ThrGlu: 3.442 ± 0.473
2.17ThrPhe: 2.17 ± 0.359
5.837ThrGly: 5.837 ± 0.965
0.673ThrHis: 0.673 ± 0.227
3.517ThrIle: 3.517 ± 0.47
2.095ThrLys: 2.095 ± 0.341
3.068ThrLeu: 3.068 ± 0.487
0.898ThrMet: 0.898 ± 0.235
1.871ThrAsn: 1.871 ± 0.442
3.143ThrPro: 3.143 ± 0.481
1.871ThrGln: 1.871 ± 0.424
2.469ThrArg: 2.469 ± 0.494
3.367ThrSer: 3.367 ± 0.56
1.871ThrThr: 1.871 ± 0.406
3.068ThrVal: 3.068 ± 0.399
1.422ThrTrp: 1.422 ± 0.362
1.571ThrTyr: 1.571 ± 0.345
0.0ThrXaa: 0.0 ± 0.0
Val
6.585ValAla: 6.585 ± 0.717
0.898ValCys: 0.898 ± 0.278
4.639ValAsp: 4.639 ± 0.736
4.041ValGlu: 4.041 ± 0.666
2.694ValPhe: 2.694 ± 0.497
4.565ValGly: 4.565 ± 0.697
1.122ValHis: 1.122 ± 0.307
2.918ValIle: 2.918 ± 0.57
3.068ValLys: 3.068 ± 0.563
3.891ValLeu: 3.891 ± 0.618
1.272ValMet: 1.272 ± 0.38
2.095ValAsn: 2.095 ± 0.412
2.544ValPro: 2.544 ± 0.488
1.646ValGln: 1.646 ± 0.411
2.918ValArg: 2.918 ± 0.462
4.265ValSer: 4.265 ± 0.519
4.116ValThr: 4.116 ± 0.531
3.592ValVal: 3.592 ± 0.534
0.524ValTrp: 0.524 ± 0.221
1.796ValTyr: 1.796 ± 0.305
0.0ValXaa: 0.0 ± 0.0
Trp
1.122TrpAla: 1.122 ± 0.274
0.374TrpCys: 0.374 ± 0.16
0.973TrpAsp: 0.973 ± 0.245
1.197TrpGlu: 1.197 ± 0.297
0.823TrpPhe: 0.823 ± 0.272
0.449TrpGly: 0.449 ± 0.314
0.449TrpHis: 0.449 ± 0.192
1.122TrpIle: 1.122 ± 0.284
0.673TrpLys: 0.673 ± 0.181
1.946TrpLeu: 1.946 ± 0.433
0.673TrpMet: 0.673 ± 0.301
0.748TrpAsn: 0.748 ± 0.292
0.449TrpPro: 0.449 ± 0.172
1.048TrpGln: 1.048 ± 0.329
1.721TrpArg: 1.721 ± 0.341
1.197TrpSer: 1.197 ± 0.269
0.898TrpThr: 0.898 ± 0.253
0.823TrpVal: 0.823 ± 0.242
0.15TrpTrp: 0.15 ± 0.097
0.299TrpTyr: 0.299 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.544TyrAla: 2.544 ± 0.432
0.449TyrCys: 0.449 ± 0.168
1.571TyrAsp: 1.571 ± 0.326
1.946TyrGlu: 1.946 ± 0.361
0.823TyrPhe: 0.823 ± 0.204
1.721TyrGly: 1.721 ± 0.346
0.524TyrHis: 0.524 ± 0.203
1.646TyrIle: 1.646 ± 0.337
1.048TyrLys: 1.048 ± 0.304
2.694TyrLeu: 2.694 ± 0.566
0.823TyrMet: 0.823 ± 0.244
0.898TyrAsn: 0.898 ± 0.234
1.497TyrPro: 1.497 ± 0.364
1.721TyrGln: 1.721 ± 0.348
2.769TyrArg: 2.769 ± 0.509
1.946TyrSer: 1.946 ± 0.407
1.347TyrThr: 1.347 ± 0.314
1.946TyrVal: 1.946 ± 0.353
0.374TyrTrp: 0.374 ± 0.163
0.973TyrTyr: 0.973 ± 0.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13365 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski