Amino acid dipepetide frequency for Pseudomonas phage phiIBB-PF7A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.16AlaAla: 11.16 ± 1.012
0.698AlaCys: 0.698 ± 0.269
7.13AlaAsp: 7.13 ± 0.927
5.115AlaGlu: 5.115 ± 0.451
3.41AlaPhe: 3.41 ± 0.527
8.525AlaGly: 8.525 ± 0.803
1.628AlaHis: 1.628 ± 0.373
3.953AlaIle: 3.953 ± 0.63
6.588AlaLys: 6.588 ± 0.641
9.455AlaLeu: 9.455 ± 0.838
3.41AlaMet: 3.41 ± 0.734
2.635AlaAsn: 2.635 ± 0.409
3.255AlaPro: 3.255 ± 0.61
5.193AlaGln: 5.193 ± 0.931
6.51AlaArg: 6.51 ± 0.759
4.96AlaSer: 4.96 ± 0.895
5.27AlaThr: 5.27 ± 1.093
7.595AlaVal: 7.595 ± 0.678
1.395AlaTrp: 1.395 ± 0.443
2.713AlaTyr: 2.713 ± 0.606
0.0AlaXaa: 0.0 ± 0.0
Cys
1.008CysAla: 1.008 ± 0.314
0.0CysCys: 0.0 ± 0.0
0.698CysAsp: 0.698 ± 0.339
0.31CysGlu: 0.31 ± 0.164
0.31CysPhe: 0.31 ± 0.182
0.465CysGly: 0.465 ± 0.218
0.155CysHis: 0.155 ± 0.11
0.465CysIle: 0.465 ± 0.201
0.465CysLys: 0.465 ± 0.203
0.62CysLeu: 0.62 ± 0.222
0.31CysMet: 0.31 ± 0.147
0.388CysAsn: 0.388 ± 0.146
0.233CysPro: 0.233 ± 0.127
0.31CysGln: 0.31 ± 0.153
0.775CysArg: 0.775 ± 0.309
0.233CysSer: 0.233 ± 0.141
0.155CysThr: 0.155 ± 0.13
0.388CysVal: 0.388 ± 0.178
0.31CysTrp: 0.31 ± 0.141
0.543CysTyr: 0.543 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
7.363AspAla: 7.363 ± 0.799
0.388AspCys: 0.388 ± 0.195
4.185AspAsp: 4.185 ± 0.661
3.023AspGlu: 3.023 ± 0.303
3.1AspPhe: 3.1 ± 0.475
6.588AspGly: 6.588 ± 0.633
1.783AspHis: 1.783 ± 0.424
3.798AspIle: 3.798 ± 0.436
3.1AspLys: 3.1 ± 0.552
5.193AspLeu: 5.193 ± 0.615
2.015AspMet: 2.015 ± 0.44
1.938AspAsn: 1.938 ± 0.261
3.333AspPro: 3.333 ± 0.498
2.635AspGln: 2.635 ± 0.437
4.65AspArg: 4.65 ± 0.714
2.558AspSer: 2.558 ± 0.529
2.713AspThr: 2.713 ± 0.33
4.34AspVal: 4.34 ± 0.644
1.395AspTrp: 1.395 ± 0.354
1.55AspTyr: 1.55 ± 0.343
0.0AspXaa: 0.0 ± 0.0
Glu
8.06GluAla: 8.06 ± 0.858
0.698GluCys: 0.698 ± 0.299
4.263GluAsp: 4.263 ± 0.51
5.038GluGlu: 5.038 ± 0.848
2.635GluPhe: 2.635 ± 0.485
5.348GluGly: 5.348 ± 0.594
1.473GluHis: 1.473 ± 0.357
2.79GluIle: 2.79 ± 0.488
2.17GluLys: 2.17 ± 0.383
4.805GluLeu: 4.805 ± 0.655
1.55GluMet: 1.55 ± 0.299
3.1GluAsn: 3.1 ± 0.451
1.86GluPro: 1.86 ± 0.37
2.635GluGln: 2.635 ± 0.463
3.798GluArg: 3.798 ± 0.448
3.488GluSer: 3.488 ± 0.508
3.953GluThr: 3.953 ± 0.468
4.108GluVal: 4.108 ± 0.521
1.24GluTrp: 1.24 ± 0.358
2.558GluTyr: 2.558 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
2.635PheAla: 2.635 ± 0.501
0.388PheCys: 0.388 ± 0.291
3.1PheAsp: 3.1 ± 0.467
2.17PheGlu: 2.17 ± 0.374
1.318PhePhe: 1.318 ± 0.281
2.945PheGly: 2.945 ± 0.38
1.008PheHis: 1.008 ± 0.339
1.628PheIle: 1.628 ± 0.406
2.015PheLys: 2.015 ± 0.49
3.488PheLeu: 3.488 ± 0.514
1.55PheMet: 1.55 ± 0.279
2.17PheAsn: 2.17 ± 0.403
1.628PhePro: 1.628 ± 0.346
1.163PheGln: 1.163 ± 0.301
1.783PheArg: 1.783 ± 0.375
1.628PheSer: 1.628 ± 0.313
2.403PheThr: 2.403 ± 0.298
2.325PheVal: 2.325 ± 0.477
0.388PheTrp: 0.388 ± 0.183
0.775PheTyr: 0.775 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
8.293GlyAla: 8.293 ± 1.237
0.853GlyCys: 0.853 ± 0.314
4.495GlyAsp: 4.495 ± 0.547
5.425GlyGlu: 5.425 ± 0.622
3.488GlyPhe: 3.488 ± 0.628
6.045GlyGly: 6.045 ± 0.788
2.093GlyHis: 2.093 ± 0.371
4.805GlyIle: 4.805 ± 0.783
5.348GlyLys: 5.348 ± 0.789
6.278GlyLeu: 6.278 ± 0.93
1.86GlyMet: 1.86 ± 0.445
3.1GlyAsn: 3.1 ± 0.45
2.17GlyPro: 2.17 ± 0.399
3.023GlyGln: 3.023 ± 0.414
4.96GlyArg: 4.96 ± 0.65
5.735GlySer: 5.735 ± 0.7
5.193GlyThr: 5.193 ± 0.587
4.728GlyVal: 4.728 ± 0.518
1.085GlyTrp: 1.085 ± 0.286
2.635GlyTyr: 2.635 ± 0.501
0.0GlyXaa: 0.0 ± 0.0
His
1.938HisAla: 1.938 ± 0.475
0.233HisCys: 0.233 ± 0.134
1.163HisAsp: 1.163 ± 0.357
1.55HisGlu: 1.55 ± 0.314
0.93HisPhe: 0.93 ± 0.284
1.55HisGly: 1.55 ± 0.329
0.543HisHis: 0.543 ± 0.214
1.085HisIle: 1.085 ± 0.283
1.628HisLys: 1.628 ± 0.329
2.015HisLeu: 2.015 ± 0.417
1.008HisMet: 1.008 ± 0.234
0.93HisAsn: 0.93 ± 0.223
0.465HisPro: 0.465 ± 0.192
0.62HisGln: 0.62 ± 0.206
1.783HisArg: 1.783 ± 0.469
1.24HisSer: 1.24 ± 0.263
1.085HisThr: 1.085 ± 0.292
1.473HisVal: 1.473 ± 0.396
0.388HisTrp: 0.388 ± 0.13
0.93HisTyr: 0.93 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
3.1IleAla: 3.1 ± 0.437
0.698IleCys: 0.698 ± 0.207
3.023IleAsp: 3.023 ± 0.503
3.178IleGlu: 3.178 ± 0.41
0.698IlePhe: 0.698 ± 0.217
3.255IleGly: 3.255 ± 0.444
0.93IleHis: 0.93 ± 0.312
1.86IleIle: 1.86 ± 0.315
3.41IleLys: 3.41 ± 0.696
4.03IleLeu: 4.03 ± 0.557
1.24IleMet: 1.24 ± 0.348
1.783IleAsn: 1.783 ± 0.439
2.558IlePro: 2.558 ± 0.467
2.868IleGln: 2.868 ± 0.451
2.868IleArg: 2.868 ± 0.339
2.17IleSer: 2.17 ± 0.379
3.023IleThr: 3.023 ± 0.405
3.1IleVal: 3.1 ± 0.567
0.388IleTrp: 0.388 ± 0.172
1.163IleTyr: 1.163 ± 0.271
0.0IleXaa: 0.0 ± 0.0
Lys
6.82LysAla: 6.82 ± 0.867
0.465LysCys: 0.465 ± 0.172
4.883LysAsp: 4.883 ± 0.598
3.488LysGlu: 3.488 ± 0.521
1.86LysPhe: 1.86 ± 0.318
5.038LysGly: 5.038 ± 0.663
2.015LysHis: 2.015 ± 0.509
2.093LysIle: 2.093 ± 0.362
2.945LysLys: 2.945 ± 0.709
5.27LysLeu: 5.27 ± 0.507
1.938LysMet: 1.938 ± 0.284
1.55LysAsn: 1.55 ± 0.398
2.945LysPro: 2.945 ± 0.538
1.86LysGln: 1.86 ± 0.394
3.41LysArg: 3.41 ± 0.703
1.938LysSer: 1.938 ± 0.389
2.79LysThr: 2.79 ± 0.453
5.115LysVal: 5.115 ± 0.695
0.388LysTrp: 0.388 ± 0.152
1.938LysTyr: 1.938 ± 0.302
0.0LysXaa: 0.0 ± 0.0
Leu
7.905LeuAla: 7.905 ± 0.786
0.31LeuCys: 0.31 ± 0.135
4.805LeuAsp: 4.805 ± 0.747
6.51LeuGlu: 6.51 ± 0.871
2.79LeuPhe: 2.79 ± 0.562
5.115LeuGly: 5.115 ± 0.748
1.705LeuHis: 1.705 ± 0.303
4.185LeuIle: 4.185 ± 0.614
6.2LeuLys: 6.2 ± 0.623
4.65LeuLeu: 4.65 ± 0.523
2.79LeuMet: 2.79 ± 0.444
4.03LeuAsn: 4.03 ± 0.565
2.945LeuPro: 2.945 ± 0.49
3.72LeuGln: 3.72 ± 0.624
5.038LeuArg: 5.038 ± 0.54
4.573LeuSer: 4.573 ± 0.541
5.115LeuThr: 5.115 ± 0.7
6.045LeuVal: 6.045 ± 0.509
0.775LeuTrp: 0.775 ± 0.259
2.558LeuTyr: 2.558 ± 0.492
0.0LeuXaa: 0.0 ± 0.0
Met
4.263MetAla: 4.263 ± 0.614
0.233MetCys: 0.233 ± 0.133
1.938MetAsp: 1.938 ± 0.359
2.558MetGlu: 2.558 ± 0.509
0.775MetPhe: 0.775 ± 0.242
2.945MetGly: 2.945 ± 0.439
0.775MetHis: 0.775 ± 0.261
1.318MetIle: 1.318 ± 0.247
1.395MetLys: 1.395 ± 0.378
2.325MetLeu: 2.325 ± 0.391
0.62MetMet: 0.62 ± 0.234
0.853MetAsn: 0.853 ± 0.212
1.55MetPro: 1.55 ± 0.328
1.163MetGln: 1.163 ± 0.326
0.853MetArg: 0.853 ± 0.225
2.325MetSer: 2.325 ± 0.407
2.48MetThr: 2.48 ± 0.536
2.17MetVal: 2.17 ± 0.261
0.465MetTrp: 0.465 ± 0.185
0.388MetTyr: 0.388 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
4.263AsnAla: 4.263 ± 0.542
0.233AsnCys: 0.233 ± 0.141
1.783AsnAsp: 1.783 ± 0.36
2.17AsnGlu: 2.17 ± 0.407
2.015AsnPhe: 2.015 ± 0.4
3.178AsnGly: 3.178 ± 0.466
1.318AsnHis: 1.318 ± 0.318
1.705AsnIle: 1.705 ± 0.319
1.783AsnLys: 1.783 ± 0.352
3.333AsnLeu: 3.333 ± 0.369
1.318AsnMet: 1.318 ± 0.282
0.853AsnAsn: 0.853 ± 0.25
2.79AsnPro: 2.79 ± 0.523
1.318AsnGln: 1.318 ± 0.275
2.093AsnArg: 2.093 ± 0.553
1.318AsnSer: 1.318 ± 0.354
1.86AsnThr: 1.86 ± 0.422
3.178AsnVal: 3.178 ± 0.574
0.31AsnTrp: 0.31 ± 0.137
1.24AsnTyr: 1.24 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
2.325ProAla: 2.325 ± 0.411
0.543ProCys: 0.543 ± 0.163
3.178ProAsp: 3.178 ± 0.427
3.565ProGlu: 3.565 ± 0.596
1.55ProPhe: 1.55 ± 0.32
2.868ProGly: 2.868 ± 0.537
0.775ProHis: 0.775 ± 0.219
1.085ProIle: 1.085 ± 0.283
2.17ProLys: 2.17 ± 0.458
2.868ProLeu: 2.868 ± 0.422
1.008ProMet: 1.008 ± 0.285
2.403ProAsn: 2.403 ± 0.324
1.085ProPro: 1.085 ± 0.432
1.86ProGln: 1.86 ± 0.383
2.093ProArg: 2.093 ± 0.396
2.17ProSer: 2.17 ± 0.363
2.635ProThr: 2.635 ± 0.434
3.41ProVal: 3.41 ± 0.694
0.465ProTrp: 0.465 ± 0.242
1.008ProTyr: 1.008 ± 0.327
0.0ProXaa: 0.0 ± 0.0
Gln
5.503GlnAla: 5.503 ± 1.037
0.078GlnCys: 0.078 ± 0.088
2.868GlnAsp: 2.868 ± 0.686
2.17GlnGlu: 2.17 ± 0.359
2.093GlnPhe: 2.093 ± 0.331
4.65GlnGly: 4.65 ± 0.783
0.775GlnHis: 0.775 ± 0.207
2.17GlnIle: 2.17 ± 0.428
2.093GlnLys: 2.093 ± 0.275
3.875GlnLeu: 3.875 ± 0.474
1.628GlnMet: 1.628 ± 0.377
1.55GlnAsn: 1.55 ± 0.42
0.93GlnPro: 0.93 ± 0.28
2.015GlnGln: 2.015 ± 0.427
2.713GlnArg: 2.713 ± 0.634
2.248GlnSer: 2.248 ± 0.595
2.015GlnThr: 2.015 ± 0.323
2.713GlnVal: 2.713 ± 0.436
0.465GlnTrp: 0.465 ± 0.215
1.008GlnTyr: 1.008 ± 0.371
0.0GlnXaa: 0.0 ± 0.0
Arg
4.96ArgAla: 4.96 ± 0.573
0.465ArgCys: 0.465 ± 0.237
3.798ArgAsp: 3.798 ± 0.682
3.565ArgGlu: 3.565 ± 0.524
2.403ArgPhe: 2.403 ± 0.528
5.348ArgGly: 5.348 ± 0.552
0.93ArgHis: 0.93 ± 0.353
2.403ArgIle: 2.403 ± 0.392
3.255ArgLys: 3.255 ± 0.483
5.425ArgLeu: 5.425 ± 0.561
2.015ArgMet: 2.015 ± 0.401
2.713ArgAsn: 2.713 ± 0.454
2.093ArgPro: 2.093 ± 0.431
3.023ArgGln: 3.023 ± 0.581
3.488ArgArg: 3.488 ± 0.486
3.798ArgSer: 3.798 ± 0.468
2.868ArgThr: 2.868 ± 0.456
3.643ArgVal: 3.643 ± 0.551
0.698ArgTrp: 0.698 ± 0.255
1.938ArgTyr: 1.938 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
5.425SerAla: 5.425 ± 0.97
0.465SerCys: 0.465 ± 0.201
4.573SerAsp: 4.573 ± 0.539
3.565SerGlu: 3.565 ± 0.469
2.248SerPhe: 2.248 ± 0.46
4.108SerGly: 4.108 ± 0.542
1.55SerHis: 1.55 ± 0.348
2.093SerIle: 2.093 ± 0.388
3.565SerLys: 3.565 ± 0.589
4.263SerLeu: 4.263 ± 0.653
1.473SerMet: 1.473 ± 0.363
1.938SerAsn: 1.938 ± 0.442
2.48SerPro: 2.48 ± 0.378
1.783SerGln: 1.783 ± 0.356
2.48SerArg: 2.48 ± 0.399
3.333SerSer: 3.333 ± 0.523
2.248SerThr: 2.248 ± 0.547
3.798SerVal: 3.798 ± 0.543
0.775SerTrp: 0.775 ± 0.216
1.86SerTyr: 1.86 ± 0.442
0.0SerXaa: 0.0 ± 0.0
Thr
5.193ThrAla: 5.193 ± 0.618
0.31ThrCys: 0.31 ± 0.166
3.488ThrAsp: 3.488 ± 0.535
4.263ThrGlu: 4.263 ± 0.562
1.628ThrPhe: 1.628 ± 0.401
5.038ThrGly: 5.038 ± 0.762
1.008ThrHis: 1.008 ± 0.26
3.565ThrIle: 3.565 ± 0.547
4.108ThrLys: 4.108 ± 0.572
4.65ThrLeu: 4.65 ± 0.682
1.783ThrMet: 1.783 ± 0.386
1.395ThrAsn: 1.395 ± 0.293
2.325ThrPro: 2.325 ± 0.469
2.79ThrGln: 2.79 ± 0.561
2.403ThrArg: 2.403 ± 0.548
3.1ThrSer: 3.1 ± 0.48
3.41ThrThr: 3.41 ± 0.719
4.03ThrVal: 4.03 ± 0.893
0.465ThrTrp: 0.465 ± 0.192
1.783ThrTyr: 1.783 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
6.123ValAla: 6.123 ± 0.746
0.543ValCys: 0.543 ± 0.242
3.565ValAsp: 3.565 ± 0.717
5.58ValGlu: 5.58 ± 0.549
2.248ValPhe: 2.248 ± 0.499
5.193ValGly: 5.193 ± 0.749
1.318ValHis: 1.318 ± 0.272
2.868ValIle: 2.868 ± 0.543
3.41ValLys: 3.41 ± 0.678
5.658ValLeu: 5.658 ± 0.616
2.015ValMet: 2.015 ± 0.426
3.255ValAsn: 3.255 ± 0.506
3.023ValPro: 3.023 ± 0.46
3.488ValGln: 3.488 ± 0.492
4.185ValArg: 4.185 ± 0.554
4.728ValSer: 4.728 ± 0.454
4.883ValThr: 4.883 ± 0.569
4.495ValVal: 4.495 ± 0.697
0.93ValTrp: 0.93 ± 0.223
2.015ValTyr: 2.015 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
1.395TrpAla: 1.395 ± 0.442
0.078TrpCys: 0.078 ± 0.085
0.543TrpAsp: 0.543 ± 0.203
0.465TrpGlu: 0.465 ± 0.252
0.31TrpPhe: 0.31 ± 0.149
0.853TrpGly: 0.853 ± 0.153
0.465TrpHis: 0.465 ± 0.183
0.31TrpIle: 0.31 ± 0.122
1.473TrpLys: 1.473 ± 0.373
1.395TrpLeu: 1.395 ± 0.418
0.388TrpMet: 0.388 ± 0.164
0.543TrpAsn: 0.543 ± 0.187
0.465TrpPro: 0.465 ± 0.21
0.698TrpGln: 0.698 ± 0.169
0.775TrpArg: 0.775 ± 0.233
0.775TrpSer: 0.775 ± 0.28
0.853TrpThr: 0.853 ± 0.237
1.163TrpVal: 1.163 ± 0.376
0.078TrpTrp: 0.078 ± 0.075
0.155TrpTyr: 0.155 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.868TyrAla: 2.868 ± 0.659
0.465TyrCys: 0.465 ± 0.203
2.17TyrAsp: 2.17 ± 0.347
1.86TyrGlu: 1.86 ± 0.322
0.698TyrPhe: 0.698 ± 0.225
2.558TyrGly: 2.558 ± 0.471
0.31TyrHis: 0.31 ± 0.161
1.163TyrIle: 1.163 ± 0.283
1.628TyrLys: 1.628 ± 0.42
2.17TyrLeu: 2.17 ± 0.364
1.395TyrMet: 1.395 ± 0.372
0.93TyrAsn: 0.93 ± 0.307
0.93TyrPro: 0.93 ± 0.27
1.473TyrGln: 1.473 ± 0.404
2.17TyrArg: 2.17 ± 0.383
1.705TyrSer: 1.705 ± 0.366
1.783TyrThr: 1.783 ± 0.376
1.705TyrVal: 1.705 ± 0.439
0.775TyrTrp: 0.775 ± 0.26
0.465TyrTyr: 0.465 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12904 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski