Amino acid dipepetide frequency for Pseudomonas phage PPpW-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.699AlaAla: 11.699 ± 1.245
0.557AlaCys: 0.557 ± 0.241
5.491AlaAsp: 5.491 ± 0.561
6.765AlaGlu: 6.765 ± 0.709
3.581AlaPhe: 3.581 ± 0.532
7.959AlaGly: 7.959 ± 0.636
1.592AlaHis: 1.592 ± 0.38
4.377AlaIle: 4.377 ± 0.573
6.526AlaLys: 6.526 ± 0.985
8.277AlaLeu: 8.277 ± 0.872
3.9AlaMet: 3.9 ± 0.633
3.024AlaAsn: 3.024 ± 0.61
3.263AlaPro: 3.263 ± 0.487
4.536AlaGln: 4.536 ± 0.618
7.004AlaArg: 7.004 ± 0.794
5.651AlaSer: 5.651 ± 0.823
5.969AlaThr: 5.969 ± 1.104
6.765AlaVal: 6.765 ± 0.644
1.114AlaTrp: 1.114 ± 0.445
3.661AlaTyr: 3.661 ± 0.744
0.0AlaXaa: 0.0 ± 0.0
Cys
1.273CysAla: 1.273 ± 0.28
0.0CysCys: 0.0 ± 0.0
0.557CysAsp: 0.557 ± 0.306
0.557CysGlu: 0.557 ± 0.258
0.637CysPhe: 0.637 ± 0.243
0.796CysGly: 0.796 ± 0.232
0.318CysHis: 0.318 ± 0.167
0.796CysIle: 0.796 ± 0.269
0.318CysLys: 0.318 ± 0.156
0.955CysLeu: 0.955 ± 0.331
0.398CysMet: 0.398 ± 0.256
0.239CysAsn: 0.239 ± 0.172
0.398CysPro: 0.398 ± 0.206
0.637CysGln: 0.637 ± 0.266
0.955CysArg: 0.955 ± 0.332
0.159CysSer: 0.159 ± 0.111
0.159CysThr: 0.159 ± 0.121
0.637CysVal: 0.637 ± 0.339
0.08CysTrp: 0.08 ± 0.09
0.159CysTyr: 0.159 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
7.163AspAla: 7.163 ± 0.824
0.637AspCys: 0.637 ± 0.283
4.457AspAsp: 4.457 ± 0.589
3.104AspGlu: 3.104 ± 0.504
3.82AspPhe: 3.82 ± 0.463
6.208AspGly: 6.208 ± 0.982
1.83AspHis: 1.83 ± 0.41
3.661AspIle: 3.661 ± 0.672
2.626AspLys: 2.626 ± 0.582
4.059AspLeu: 4.059 ± 0.666
1.751AspMet: 1.751 ± 0.358
2.547AspAsn: 2.547 ± 0.508
2.388AspPro: 2.388 ± 0.647
2.388AspGln: 2.388 ± 0.428
4.138AspArg: 4.138 ± 0.515
2.626AspSer: 2.626 ± 0.619
3.343AspThr: 3.343 ± 0.395
3.661AspVal: 3.661 ± 0.727
1.035AspTrp: 1.035 ± 0.283
1.671AspTyr: 1.671 ± 0.312
0.0AspXaa: 0.0 ± 0.0
Glu
7.481GluAla: 7.481 ± 0.736
0.716GluCys: 0.716 ± 0.285
3.979GluAsp: 3.979 ± 0.614
3.581GluGlu: 3.581 ± 0.66
2.626GluPhe: 2.626 ± 0.542
5.571GluGly: 5.571 ± 0.732
1.353GluHis: 1.353 ± 0.387
3.82GluIle: 3.82 ± 0.584
2.547GluLys: 2.547 ± 0.429
4.696GluLeu: 4.696 ± 0.691
1.91GluMet: 1.91 ± 0.359
2.228GluAsn: 2.228 ± 0.433
1.592GluPro: 1.592 ± 0.447
2.467GluGln: 2.467 ± 0.577
4.377GluArg: 4.377 ± 0.569
3.82GluSer: 3.82 ± 0.716
4.616GluThr: 4.616 ± 0.547
4.138GluVal: 4.138 ± 0.794
0.955GluTrp: 0.955 ± 0.355
1.751GluTyr: 1.751 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
2.149PheAla: 2.149 ± 0.561
0.716PheCys: 0.716 ± 0.241
2.706PheAsp: 2.706 ± 0.464
1.671PheGlu: 1.671 ± 0.335
0.796PhePhe: 0.796 ± 0.238
2.786PheGly: 2.786 ± 0.4
0.875PheHis: 0.875 ± 0.266
1.671PheIle: 1.671 ± 0.431
1.592PheLys: 1.592 ± 0.366
3.741PheLeu: 3.741 ± 0.466
0.716PheMet: 0.716 ± 0.192
1.83PheAsn: 1.83 ± 0.506
1.353PhePro: 1.353 ± 0.408
0.955PheGln: 0.955 ± 0.224
2.308PheArg: 2.308 ± 0.441
2.547PheSer: 2.547 ± 0.462
3.263PheThr: 3.263 ± 0.521
2.388PheVal: 2.388 ± 0.397
0.637PheTrp: 0.637 ± 0.221
0.637PheTyr: 0.637 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
8.197GlyAla: 8.197 ± 1.175
1.671GlyCys: 1.671 ± 0.503
5.889GlyAsp: 5.889 ± 0.716
4.457GlyGlu: 4.457 ± 0.514
3.661GlyPhe: 3.661 ± 0.619
6.446GlyGly: 6.446 ± 0.665
1.114GlyHis: 1.114 ± 0.304
5.014GlyIle: 5.014 ± 0.744
4.298GlyLys: 4.298 ± 0.793
6.049GlyLeu: 6.049 ± 0.827
1.353GlyMet: 1.353 ± 0.327
3.343GlyAsn: 3.343 ± 0.498
2.626GlyPro: 2.626 ± 0.473
3.024GlyGln: 3.024 ± 0.478
5.094GlyArg: 5.094 ± 0.68
5.491GlySer: 5.491 ± 0.62
4.138GlyThr: 4.138 ± 0.69
7.163GlyVal: 7.163 ± 0.929
0.955GlyTrp: 0.955 ± 0.297
3.104GlyTyr: 3.104 ± 0.555
0.0GlyXaa: 0.0 ± 0.0
His
1.353HisAla: 1.353 ± 0.32
0.478HisCys: 0.478 ± 0.238
1.433HisAsp: 1.433 ± 0.428
1.353HisGlu: 1.353 ± 0.358
0.716HisPhe: 0.716 ± 0.26
1.91HisGly: 1.91 ± 0.441
0.398HisHis: 0.398 ± 0.17
1.035HisIle: 1.035 ± 0.3
1.194HisLys: 1.194 ± 0.32
2.388HisLeu: 2.388 ± 0.592
0.716HisMet: 0.716 ± 0.234
0.478HisAsn: 0.478 ± 0.183
0.875HisPro: 0.875 ± 0.326
0.478HisGln: 0.478 ± 0.212
1.273HisArg: 1.273 ± 0.461
1.194HisSer: 1.194 ± 0.311
0.716HisThr: 0.716 ± 0.219
1.273HisVal: 1.273 ± 0.313
0.398HisTrp: 0.398 ± 0.147
0.875HisTyr: 0.875 ± 0.25
0.0HisXaa: 0.0 ± 0.0
Ile
4.934IleAla: 4.934 ± 0.621
0.796IleCys: 0.796 ± 0.271
3.104IleAsp: 3.104 ± 0.47
3.263IleGlu: 3.263 ± 0.497
0.637IlePhe: 0.637 ± 0.236
3.661IleGly: 3.661 ± 0.593
0.637IleHis: 0.637 ± 0.198
2.547IleIle: 2.547 ± 0.375
3.104IleLys: 3.104 ± 0.607
4.536IleLeu: 4.536 ± 0.57
0.955IleMet: 0.955 ± 0.252
1.99IleAsn: 1.99 ± 0.479
2.149IlePro: 2.149 ± 0.338
2.626IleGln: 2.626 ± 0.454
2.786IleArg: 2.786 ± 0.397
2.547IleSer: 2.547 ± 0.397
3.661IleThr: 3.661 ± 0.675
3.024IleVal: 3.024 ± 0.475
0.398IleTrp: 0.398 ± 0.147
1.353IleTyr: 1.353 ± 0.356
0.0IleXaa: 0.0 ± 0.0
Lys
6.367LysAla: 6.367 ± 0.927
0.318LysCys: 0.318 ± 0.176
4.138LysAsp: 4.138 ± 0.559
3.581LysGlu: 3.581 ± 0.609
1.99LysPhe: 1.99 ± 0.394
4.855LysGly: 4.855 ± 0.916
1.273LysHis: 1.273 ± 0.433
2.069LysIle: 2.069 ± 0.413
2.308LysLys: 2.308 ± 0.567
4.536LysLeu: 4.536 ± 0.677
1.512LysMet: 1.512 ± 0.297
1.433LysAsn: 1.433 ± 0.423
2.945LysPro: 2.945 ± 0.639
2.149LysGln: 2.149 ± 0.378
3.183LysArg: 3.183 ± 0.483
2.626LysSer: 2.626 ± 0.457
2.945LysThr: 2.945 ± 0.598
4.377LysVal: 4.377 ± 0.774
1.035LysTrp: 1.035 ± 0.349
1.671LysTyr: 1.671 ± 0.412
0.0LysXaa: 0.0 ± 0.0
Leu
7.959LeuAla: 7.959 ± 0.738
0.478LeuCys: 0.478 ± 0.216
5.332LeuAsp: 5.332 ± 0.761
5.332LeuGlu: 5.332 ± 0.72
2.786LeuPhe: 2.786 ± 0.376
6.128LeuGly: 6.128 ± 0.551
1.751LeuHis: 1.751 ± 0.377
3.343LeuIle: 3.343 ± 0.499
6.287LeuLys: 6.287 ± 0.562
5.094LeuLeu: 5.094 ± 0.622
2.388LeuMet: 2.388 ± 0.404
3.661LeuAsn: 3.661 ± 0.513
2.945LeuPro: 2.945 ± 0.505
4.138LeuGln: 4.138 ± 0.501
3.82LeuArg: 3.82 ± 0.528
4.934LeuSer: 4.934 ± 0.811
4.855LeuThr: 4.855 ± 0.612
6.367LeuVal: 6.367 ± 0.689
1.114LeuTrp: 1.114 ± 0.294
2.308LeuTyr: 2.308 ± 0.511
0.0LeuXaa: 0.0 ± 0.0
Met
3.741MetAla: 3.741 ± 0.606
0.159MetCys: 0.159 ± 0.117
1.353MetAsp: 1.353 ± 0.44
2.069MetGlu: 2.069 ± 0.38
0.637MetPhe: 0.637 ± 0.191
1.99MetGly: 1.99 ± 0.404
0.398MetHis: 0.398 ± 0.214
1.114MetIle: 1.114 ± 0.338
1.592MetLys: 1.592 ± 0.269
2.626MetLeu: 2.626 ± 0.43
0.796MetMet: 0.796 ± 0.293
1.273MetAsn: 1.273 ± 0.407
1.671MetPro: 1.671 ± 0.415
0.716MetGln: 0.716 ± 0.247
0.875MetArg: 0.875 ± 0.361
2.149MetSer: 2.149 ± 0.421
2.388MetThr: 2.388 ± 0.452
2.069MetVal: 2.069 ± 0.343
0.398MetTrp: 0.398 ± 0.19
0.557MetTyr: 0.557 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
3.741AsnAla: 3.741 ± 0.492
0.159AsnCys: 0.159 ± 0.109
2.228AsnAsp: 2.228 ± 0.555
1.99AsnGlu: 1.99 ± 0.352
1.433AsnPhe: 1.433 ± 0.356
3.581AsnGly: 3.581 ± 0.591
0.796AsnHis: 0.796 ± 0.289
2.149AsnIle: 2.149 ± 0.378
1.592AsnLys: 1.592 ± 0.363
3.422AsnLeu: 3.422 ± 0.645
1.194AsnMet: 1.194 ± 0.355
1.035AsnAsn: 1.035 ± 0.257
2.467AsnPro: 2.467 ± 0.455
1.035AsnGln: 1.035 ± 0.232
2.149AsnArg: 2.149 ± 0.44
2.228AsnSer: 2.228 ± 0.511
1.91AsnThr: 1.91 ± 0.466
3.581AsnVal: 3.581 ± 0.776
0.716AsnTrp: 0.716 ± 0.286
1.671AsnTyr: 1.671 ± 0.417
0.0AsnXaa: 0.0 ± 0.0
Pro
3.422ProAla: 3.422 ± 0.567
0.318ProCys: 0.318 ± 0.141
3.581ProAsp: 3.581 ± 0.511
4.218ProGlu: 4.218 ± 0.618
1.433ProPhe: 1.433 ± 0.313
3.024ProGly: 3.024 ± 0.537
0.796ProHis: 0.796 ± 0.252
1.512ProIle: 1.512 ± 0.232
2.547ProLys: 2.547 ± 0.52
2.706ProLeu: 2.706 ± 0.597
1.194ProMet: 1.194 ± 0.301
1.91ProAsn: 1.91 ± 0.346
0.955ProPro: 0.955 ± 0.318
1.592ProGln: 1.592 ± 0.378
2.626ProArg: 2.626 ± 0.458
1.91ProSer: 1.91 ± 0.412
1.83ProThr: 1.83 ± 0.422
2.547ProVal: 2.547 ± 0.484
0.398ProTrp: 0.398 ± 0.204
0.955ProTyr: 0.955 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
4.775GlnAla: 4.775 ± 0.806
0.318GlnCys: 0.318 ± 0.169
2.228GlnAsp: 2.228 ± 0.434
2.626GlnGlu: 2.626 ± 0.541
1.671GlnPhe: 1.671 ± 0.38
3.422GlnGly: 3.422 ± 0.522
0.716GlnHis: 0.716 ± 0.284
2.308GlnIle: 2.308 ± 0.332
2.308GlnLys: 2.308 ± 0.437
4.059GlnLeu: 4.059 ± 0.481
1.512GlnMet: 1.512 ± 0.409
1.512GlnAsn: 1.512 ± 0.33
1.114GlnPro: 1.114 ± 0.266
1.99GlnGln: 1.99 ± 0.442
2.228GlnArg: 2.228 ± 0.544
2.149GlnSer: 2.149 ± 0.367
1.671GlnThr: 1.671 ± 0.493
2.547GlnVal: 2.547 ± 0.427
1.114GlnTrp: 1.114 ± 0.417
1.273GlnTyr: 1.273 ± 0.281
0.0GlnXaa: 0.0 ± 0.0
Arg
5.094ArgAla: 5.094 ± 0.831
0.637ArgCys: 0.637 ± 0.284
3.502ArgAsp: 3.502 ± 0.47
3.82ArgGlu: 3.82 ± 0.702
2.069ArgPhe: 2.069 ± 0.453
4.934ArgGly: 4.934 ± 0.604
0.875ArgHis: 0.875 ± 0.323
3.183ArgIle: 3.183 ± 0.462
3.581ArgLys: 3.581 ± 0.583
5.014ArgLeu: 5.014 ± 0.636
2.308ArgMet: 2.308 ± 0.407
3.661ArgAsn: 3.661 ± 0.523
2.069ArgPro: 2.069 ± 0.343
3.422ArgGln: 3.422 ± 0.5
3.104ArgArg: 3.104 ± 0.563
3.82ArgSer: 3.82 ± 0.629
2.308ArgThr: 2.308 ± 0.443
3.024ArgVal: 3.024 ± 0.458
0.557ArgTrp: 0.557 ± 0.204
2.069ArgTyr: 2.069 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
6.128SerAla: 6.128 ± 0.807
0.796SerCys: 0.796 ± 0.243
3.183SerAsp: 3.183 ± 0.435
3.581SerGlu: 3.581 ± 0.572
1.91SerPhe: 1.91 ± 0.399
5.332SerGly: 5.332 ± 0.824
1.671SerHis: 1.671 ± 0.366
2.308SerIle: 2.308 ± 0.436
3.104SerLys: 3.104 ± 0.484
3.502SerLeu: 3.502 ± 0.517
1.194SerMet: 1.194 ± 0.346
2.228SerAsn: 2.228 ± 0.56
2.388SerPro: 2.388 ± 0.448
2.228SerGln: 2.228 ± 0.374
3.183SerArg: 3.183 ± 0.432
3.581SerSer: 3.581 ± 0.604
3.422SerThr: 3.422 ± 0.568
3.979SerVal: 3.979 ± 0.645
1.035SerTrp: 1.035 ± 0.217
2.069SerTyr: 2.069 ± 0.511
0.0SerXaa: 0.0 ± 0.0
Thr
4.457ThrAla: 4.457 ± 0.577
0.239ThrCys: 0.239 ± 0.165
3.581ThrAsp: 3.581 ± 0.397
4.536ThrGlu: 4.536 ± 0.548
1.592ThrPhe: 1.592 ± 0.417
6.208ThrGly: 6.208 ± 0.906
1.114ThrHis: 1.114 ± 0.302
3.104ThrIle: 3.104 ± 0.433
2.945ThrLys: 2.945 ± 0.543
5.73ThrLeu: 5.73 ± 0.5
1.114ThrMet: 1.114 ± 0.267
1.83ThrAsn: 1.83 ± 0.335
3.581ThrPro: 3.581 ± 0.578
2.149ThrGln: 2.149 ± 0.518
2.467ThrArg: 2.467 ± 0.453
3.661ThrSer: 3.661 ± 0.716
3.183ThrThr: 3.183 ± 0.588
4.616ThrVal: 4.616 ± 0.904
0.159ThrTrp: 0.159 ± 0.12
1.433ThrTyr: 1.433 ± 0.317
0.0ThrXaa: 0.0 ± 0.0
Val
6.844ValAla: 6.844 ± 0.774
0.478ValCys: 0.478 ± 0.226
4.138ValAsp: 4.138 ± 0.489
4.536ValGlu: 4.536 ± 0.525
1.83ValPhe: 1.83 ± 0.391
4.696ValGly: 4.696 ± 0.511
2.069ValHis: 2.069 ± 0.408
3.263ValIle: 3.263 ± 0.583
4.298ValLys: 4.298 ± 0.656
5.571ValLeu: 5.571 ± 0.63
1.91ValMet: 1.91 ± 0.447
3.183ValAsn: 3.183 ± 0.484
3.024ValPro: 3.024 ± 0.358
3.024ValGln: 3.024 ± 0.45
4.775ValArg: 4.775 ± 0.515
3.502ValSer: 3.502 ± 0.698
4.775ValThr: 4.775 ± 0.62
5.173ValVal: 5.173 ± 0.684
1.035ValTrp: 1.035 ± 0.235
2.308ValTyr: 2.308 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
1.592TrpAla: 1.592 ± 0.287
0.08TrpCys: 0.08 ± 0.087
0.716TrpAsp: 0.716 ± 0.253
0.557TrpGlu: 0.557 ± 0.239
0.875TrpPhe: 0.875 ± 0.306
1.114TrpGly: 1.114 ± 0.274
0.478TrpHis: 0.478 ± 0.198
0.716TrpIle: 0.716 ± 0.215
1.194TrpLys: 1.194 ± 0.353
1.353TrpLeu: 1.353 ± 0.323
0.318TrpMet: 0.318 ± 0.137
0.398TrpAsn: 0.398 ± 0.164
0.239TrpPro: 0.239 ± 0.117
0.796TrpGln: 0.796 ± 0.232
0.478TrpArg: 0.478 ± 0.195
0.796TrpSer: 0.796 ± 0.309
0.875TrpThr: 0.875 ± 0.227
0.796TrpVal: 0.796 ± 0.28
0.159TrpTrp: 0.159 ± 0.109
0.398TrpTyr: 0.398 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.343TyrAla: 3.343 ± 0.624
0.478TyrCys: 0.478 ± 0.238
1.512TyrAsp: 1.512 ± 0.417
2.706TyrGlu: 2.706 ± 0.594
0.637TyrPhe: 0.637 ± 0.226
2.467TyrGly: 2.467 ± 0.413
0.478TyrHis: 0.478 ± 0.205
0.716TyrIle: 0.716 ± 0.205
1.353TyrLys: 1.353 ± 0.421
2.706TyrLeu: 2.706 ± 0.566
1.353TyrMet: 1.353 ± 0.313
1.273TyrAsn: 1.273 ± 0.27
1.433TyrPro: 1.433 ± 0.343
1.194TyrGln: 1.194 ± 0.321
2.228TyrArg: 2.228 ± 0.316
1.433TyrSer: 1.433 ± 0.313
1.671TyrThr: 1.671 ± 0.353
2.308TyrVal: 2.308 ± 0.395
0.637TyrTrp: 0.637 ± 0.202
0.398TyrTyr: 0.398 ± 0.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12566 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski