Amino acid dipepetide frequency for Pseudomonas phage PPpW-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.872AlaAla: 19.872 ± 1.91
0.883AlaCys: 0.883 ± 0.272
7.654AlaAsp: 7.654 ± 0.835
7.875AlaGlu: 7.875 ± 1.064
3.827AlaPhe: 3.827 ± 0.617
10.746AlaGly: 10.746 ± 1.089
1.914AlaHis: 1.914 ± 0.438
5.667AlaIle: 5.667 ± 0.572
7.581AlaLys: 7.581 ± 0.837
8.611AlaLeu: 8.611 ± 0.773
4.416AlaMet: 4.416 ± 0.629
5.741AlaAsn: 5.741 ± 0.514
3.901AlaPro: 3.901 ± 0.6
5.888AlaGln: 5.888 ± 0.77
8.758AlaArg: 8.758 ± 0.893
5.888AlaSer: 5.888 ± 0.727
5.962AlaThr: 5.962 ± 0.836
9.126AlaVal: 9.126 ± 1.003
2.282AlaTrp: 2.282 ± 0.509
2.797AlaTyr: 2.797 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
1.325CysAla: 1.325 ± 0.321
0.147CysCys: 0.147 ± 0.107
0.883CysAsp: 0.883 ± 0.247
0.736CysGlu: 0.736 ± 0.241
0.074CysPhe: 0.074 ± 0.071
1.178CysGly: 1.178 ± 0.306
0.368CysHis: 0.368 ± 0.163
0.294CysIle: 0.294 ± 0.148
0.883CysLys: 0.883 ± 0.256
0.294CysLeu: 0.294 ± 0.152
0.147CysMet: 0.147 ± 0.112
0.221CysAsn: 0.221 ± 0.123
0.81CysPro: 0.81 ± 0.223
0.147CysGln: 0.147 ± 0.085
0.957CysArg: 0.957 ± 0.239
0.81CysSer: 0.81 ± 0.219
0.442CysThr: 0.442 ± 0.177
0.589CysVal: 0.589 ± 0.226
0.221CysTrp: 0.221 ± 0.105
0.294CysTyr: 0.294 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
7.949AspAla: 7.949 ± 0.823
0.883AspCys: 0.883 ± 0.205
3.606AspAsp: 3.606 ± 0.681
3.386AspGlu: 3.386 ± 0.545
1.84AspPhe: 1.84 ± 0.485
6.698AspGly: 6.698 ± 0.709
1.03AspHis: 1.03 ± 0.359
2.282AspIle: 2.282 ± 0.378
3.091AspLys: 3.091 ± 0.5
5.667AspLeu: 5.667 ± 0.562
1.398AspMet: 1.398 ± 0.331
1.914AspAsn: 1.914 ± 0.311
2.355AspPro: 2.355 ± 0.393
3.165AspGln: 3.165 ± 0.492
2.65AspArg: 2.65 ± 0.463
2.355AspSer: 2.355 ± 0.412
1.914AspThr: 1.914 ± 0.39
4.122AspVal: 4.122 ± 0.603
1.104AspTrp: 1.104 ± 0.255
1.84AspTyr: 1.84 ± 0.352
0.0AspXaa: 0.0 ± 0.0
Glu
6.256GluAla: 6.256 ± 0.792
0.957GluCys: 0.957 ± 0.267
2.944GluAsp: 2.944 ± 0.558
3.754GluGlu: 3.754 ± 0.654
2.061GluPhe: 2.061 ± 0.517
5.152GluGly: 5.152 ± 0.657
1.546GluHis: 1.546 ± 0.319
3.386GluIle: 3.386 ± 0.598
2.282GluLys: 2.282 ± 0.46
6.256GluLeu: 6.256 ± 0.761
1.766GluMet: 1.766 ± 0.272
1.693GluAsn: 1.693 ± 0.35
1.987GluPro: 1.987 ± 0.322
2.576GluGln: 2.576 ± 0.455
3.974GluArg: 3.974 ± 0.581
2.944GluSer: 2.944 ± 0.512
3.606GluThr: 3.606 ± 0.557
3.606GluVal: 3.606 ± 0.437
1.398GluTrp: 1.398 ± 0.343
1.766GluTyr: 1.766 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
3.165PheAla: 3.165 ± 0.565
0.221PheCys: 0.221 ± 0.125
1.546PheAsp: 1.546 ± 0.343
1.619PheGlu: 1.619 ± 0.301
0.957PhePhe: 0.957 ± 0.281
3.165PheGly: 3.165 ± 0.533
0.368PheHis: 0.368 ± 0.129
1.619PheIle: 1.619 ± 0.341
2.061PheLys: 2.061 ± 0.379
2.061PheLeu: 2.061 ± 0.404
0.736PheMet: 0.736 ± 0.184
1.325PheAsn: 1.325 ± 0.327
1.03PhePro: 1.03 ± 0.247
0.736PheGln: 0.736 ± 0.28
2.061PheArg: 2.061 ± 0.299
1.251PheSer: 1.251 ± 0.293
1.914PheThr: 1.914 ± 0.418
1.84PheVal: 1.84 ± 0.329
0.662PheTrp: 0.662 ± 0.196
0.368PheTyr: 0.368 ± 0.196
0.0PheXaa: 0.0 ± 0.0
Gly
9.274GlyAla: 9.274 ± 1.019
0.736GlyCys: 0.736 ± 0.169
4.49GlyAsp: 4.49 ± 0.588
5.741GlyGlu: 5.741 ± 0.63
2.87GlyPhe: 2.87 ± 0.532
7.066GlyGly: 7.066 ± 0.674
1.472GlyHis: 1.472 ± 0.335
4.858GlyIle: 4.858 ± 0.493
6.403GlyLys: 6.403 ± 0.874
5.52GlyLeu: 5.52 ± 0.71
2.87GlyMet: 2.87 ± 0.472
2.65GlyAsn: 2.65 ± 0.623
2.134GlyPro: 2.134 ± 0.376
3.901GlyGln: 3.901 ± 0.55
5.005GlyArg: 5.005 ± 0.689
4.858GlySer: 4.858 ± 0.497
5.373GlyThr: 5.373 ± 0.745
6.035GlyVal: 6.035 ± 0.744
2.061GlyTrp: 2.061 ± 0.471
1.987GlyTyr: 1.987 ± 0.347
0.0GlyXaa: 0.0 ± 0.0
His
2.134HisAla: 2.134 ± 0.342
0.368HisCys: 0.368 ± 0.141
1.325HisAsp: 1.325 ± 0.314
1.251HisGlu: 1.251 ± 0.272
0.589HisPhe: 0.589 ± 0.198
1.472HisGly: 1.472 ± 0.314
0.368HisHis: 0.368 ± 0.182
0.515HisIle: 0.515 ± 0.214
0.883HisLys: 0.883 ± 0.303
1.619HisLeu: 1.619 ± 0.4
0.294HisMet: 0.294 ± 0.156
0.368HisAsn: 0.368 ± 0.172
0.81HisPro: 0.81 ± 0.254
0.515HisGln: 0.515 ± 0.19
1.693HisArg: 1.693 ± 0.311
0.957HisSer: 0.957 ± 0.274
1.104HisThr: 1.104 ± 0.375
0.221HisVal: 0.221 ± 0.135
0.589HisTrp: 0.589 ± 0.153
0.589HisTyr: 0.589 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
5.888IleAla: 5.888 ± 0.666
0.294IleCys: 0.294 ± 0.161
3.606IleAsp: 3.606 ± 0.483
2.429IleGlu: 2.429 ± 0.301
1.251IlePhe: 1.251 ± 0.281
4.342IleGly: 4.342 ± 0.584
0.662IleHis: 0.662 ± 0.185
2.208IleIle: 2.208 ± 0.358
1.766IleLys: 1.766 ± 0.328
3.312IleLeu: 3.312 ± 0.457
1.03IleMet: 1.03 ± 0.252
1.693IleAsn: 1.693 ± 0.348
1.619IlePro: 1.619 ± 0.245
1.546IleGln: 1.546 ± 0.397
2.87IleArg: 2.87 ± 0.447
3.606IleSer: 3.606 ± 0.536
3.091IleThr: 3.091 ± 0.462
3.238IleVal: 3.238 ± 0.548
0.294IleTrp: 0.294 ± 0.153
1.03IleTyr: 1.03 ± 0.347
0.0IleXaa: 0.0 ± 0.0
Lys
5.52LysAla: 5.52 ± 0.599
0.368LysCys: 0.368 ± 0.196
2.723LysAsp: 2.723 ± 0.43
2.134LysGlu: 2.134 ± 0.437
1.766LysPhe: 1.766 ± 0.36
4.416LysGly: 4.416 ± 0.837
0.81LysHis: 0.81 ± 0.214
1.914LysIle: 1.914 ± 0.412
1.178LysLys: 1.178 ± 0.284
4.122LysLeu: 4.122 ± 0.424
1.325LysMet: 1.325 ± 0.361
1.546LysAsn: 1.546 ± 0.3
2.87LysPro: 2.87 ± 0.559
2.87LysGln: 2.87 ± 0.577
3.754LysArg: 3.754 ± 0.518
3.386LysSer: 3.386 ± 0.51
3.68LysThr: 3.68 ± 0.487
3.606LysVal: 3.606 ± 0.493
0.662LysTrp: 0.662 ± 0.189
0.957LysTyr: 0.957 ± 0.33
0.0LysXaa: 0.0 ± 0.0
Leu
10.746LeuAla: 10.746 ± 0.974
0.957LeuCys: 0.957 ± 0.262
4.931LeuAsp: 4.931 ± 0.661
5.446LeuGlu: 5.446 ± 0.673
1.914LeuPhe: 1.914 ± 0.335
6.109LeuGly: 6.109 ± 0.584
1.398LeuHis: 1.398 ± 0.389
2.576LeuIle: 2.576 ± 0.349
3.312LeuLys: 3.312 ± 0.463
6.256LeuLeu: 6.256 ± 0.65
1.325LeuMet: 1.325 ± 0.327
2.944LeuAsn: 2.944 ± 0.411
3.901LeuPro: 3.901 ± 0.544
3.312LeuGln: 3.312 ± 0.669
6.256LeuArg: 6.256 ± 0.655
3.533LeuSer: 3.533 ± 0.577
5.226LeuThr: 5.226 ± 0.662
6.698LeuVal: 6.698 ± 0.64
1.178LeuTrp: 1.178 ± 0.263
1.766LeuTyr: 1.766 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
4.563MetAla: 4.563 ± 0.631
0.0MetCys: 0.0 ± 0.0
1.398MetAsp: 1.398 ± 0.301
1.398MetGlu: 1.398 ± 0.347
0.442MetPhe: 0.442 ± 0.194
2.061MetGly: 2.061 ± 0.367
0.294MetHis: 0.294 ± 0.147
1.178MetIle: 1.178 ± 0.321
1.619MetLys: 1.619 ± 0.334
2.282MetLeu: 2.282 ± 0.385
0.515MetMet: 0.515 ± 0.16
1.104MetAsn: 1.104 ± 0.263
0.957MetPro: 0.957 ± 0.237
1.03MetGln: 1.03 ± 0.271
2.502MetArg: 2.502 ± 0.441
1.619MetSer: 1.619 ± 0.296
1.693MetThr: 1.693 ± 0.342
1.987MetVal: 1.987 ± 0.485
0.147MetTrp: 0.147 ± 0.098
0.221MetTyr: 0.221 ± 0.121
0.0MetXaa: 0.0 ± 0.0
Asn
6.182AsnAla: 6.182 ± 0.604
0.442AsnCys: 0.442 ± 0.152
2.502AsnAsp: 2.502 ± 0.379
1.84AsnGlu: 1.84 ± 0.412
1.178AsnPhe: 1.178 ± 0.282
2.797AsnGly: 2.797 ± 0.437
0.515AsnHis: 0.515 ± 0.194
1.693AsnIle: 1.693 ± 0.311
1.03AsnLys: 1.03 ± 0.214
2.65AsnLeu: 2.65 ± 0.421
0.81AsnMet: 0.81 ± 0.252
1.325AsnAsn: 1.325 ± 0.404
2.208AsnPro: 2.208 ± 0.516
1.104AsnGln: 1.104 ± 0.318
2.134AsnArg: 2.134 ± 0.398
1.398AsnSer: 1.398 ± 0.281
1.472AsnThr: 1.472 ± 0.302
2.355AsnVal: 2.355 ± 0.327
0.515AsnTrp: 0.515 ± 0.229
0.589AsnTyr: 0.589 ± 0.201
0.0AsnXaa: 0.0 ± 0.0
Pro
5.741ProAla: 5.741 ± 0.716
0.515ProCys: 0.515 ± 0.169
2.65ProAsp: 2.65 ± 0.493
2.429ProGlu: 2.429 ± 0.395
0.883ProPhe: 0.883 ± 0.218
3.974ProGly: 3.974 ± 0.539
0.81ProHis: 0.81 ± 0.242
1.84ProIle: 1.84 ± 0.315
1.914ProLys: 1.914 ± 0.339
2.944ProLeu: 2.944 ± 0.445
1.178ProMet: 1.178 ± 0.307
0.883ProAsn: 0.883 ± 0.236
2.061ProPro: 2.061 ± 0.466
2.134ProGln: 2.134 ± 0.405
2.134ProArg: 2.134 ± 0.41
2.576ProSer: 2.576 ± 0.43
2.429ProThr: 2.429 ± 0.473
3.091ProVal: 3.091 ± 0.548
0.515ProTrp: 0.515 ± 0.18
0.81ProTyr: 0.81 ± 0.26
0.0ProXaa: 0.0 ± 0.0
Gln
5.888GlnAla: 5.888 ± 0.781
0.589GlnCys: 0.589 ± 0.251
1.84GlnAsp: 1.84 ± 0.389
2.061GlnGlu: 2.061 ± 0.308
1.398GlnPhe: 1.398 ± 0.282
2.429GlnGly: 2.429 ± 0.522
0.81GlnHis: 0.81 ± 0.209
1.987GlnIle: 1.987 ± 0.295
1.251GlnLys: 1.251 ± 0.307
3.312GlnLeu: 3.312 ± 0.425
1.546GlnMet: 1.546 ± 0.331
1.178GlnAsn: 1.178 ± 0.268
2.061GlnPro: 2.061 ± 0.365
2.429GlnGln: 2.429 ± 0.708
3.754GlnArg: 3.754 ± 0.657
1.987GlnSer: 1.987 ± 0.443
2.87GlnThr: 2.87 ± 0.479
3.533GlnVal: 3.533 ± 0.548
0.589GlnTrp: 0.589 ± 0.244
1.472GlnTyr: 1.472 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
8.979ArgAla: 8.979 ± 0.919
0.589ArgCys: 0.589 ± 0.23
3.901ArgAsp: 3.901 ± 0.577
5.373ArgGlu: 5.373 ± 0.723
1.03ArgPhe: 1.03 ± 0.257
4.269ArgGly: 4.269 ± 0.555
0.957ArgHis: 0.957 ± 0.279
2.87ArgIle: 2.87 ± 0.541
3.901ArgLys: 3.901 ± 0.605
6.256ArgLeu: 6.256 ± 0.716
2.061ArgMet: 2.061 ± 0.344
2.282ArgAsn: 2.282 ± 0.421
2.723ArgPro: 2.723 ± 0.514
3.238ArgGln: 3.238 ± 0.518
4.71ArgArg: 4.71 ± 0.516
3.533ArgSer: 3.533 ± 0.45
3.606ArgThr: 3.606 ± 0.515
3.606ArgVal: 3.606 ± 0.526
0.81ArgTrp: 0.81 ± 0.254
2.282ArgTyr: 2.282 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
7.139SerAla: 7.139 ± 0.898
0.515SerCys: 0.515 ± 0.218
3.238SerAsp: 3.238 ± 0.478
2.282SerGlu: 2.282 ± 0.443
2.061SerPhe: 2.061 ± 0.316
4.416SerGly: 4.416 ± 0.625
0.662SerHis: 0.662 ± 0.245
2.723SerIle: 2.723 ± 0.348
3.459SerLys: 3.459 ± 0.526
3.68SerLeu: 3.68 ± 0.518
1.325SerMet: 1.325 ± 0.342
1.546SerAsn: 1.546 ± 0.389
2.134SerPro: 2.134 ± 0.419
1.693SerGln: 1.693 ± 0.268
3.974SerArg: 3.974 ± 0.487
2.576SerSer: 2.576 ± 0.441
3.238SerThr: 3.238 ± 0.662
3.754SerVal: 3.754 ± 0.448
0.662SerTrp: 0.662 ± 0.259
1.03SerTyr: 1.03 ± 0.251
0.0SerXaa: 0.0 ± 0.0
Thr
6.992ThrAla: 6.992 ± 0.778
0.589ThrCys: 0.589 ± 0.183
3.238ThrAsp: 3.238 ± 0.396
3.386ThrGlu: 3.386 ± 0.471
1.766ThrPhe: 1.766 ± 0.371
6.182ThrGly: 6.182 ± 0.653
1.03ThrHis: 1.03 ± 0.327
3.533ThrIle: 3.533 ± 0.559
2.208ThrLys: 2.208 ± 0.474
5.446ThrLeu: 5.446 ± 0.769
1.546ThrMet: 1.546 ± 0.369
1.693ThrAsn: 1.693 ± 0.314
2.944ThrPro: 2.944 ± 0.574
1.693ThrGln: 1.693 ± 0.3
1.987ThrArg: 1.987 ± 0.442
3.018ThrSer: 3.018 ± 0.489
4.342ThrThr: 4.342 ± 0.881
4.269ThrVal: 4.269 ± 0.662
0.957ThrTrp: 0.957 ± 0.25
1.472ThrTyr: 1.472 ± 0.378
0.0ThrXaa: 0.0 ± 0.0
Val
7.728ValAla: 7.728 ± 0.781
0.883ValCys: 0.883 ± 0.326
3.68ValAsp: 3.68 ± 0.455
4.416ValGlu: 4.416 ± 0.658
1.84ValPhe: 1.84 ± 0.38
5.667ValGly: 5.667 ± 0.679
1.104ValHis: 1.104 ± 0.24
3.386ValIle: 3.386 ± 0.469
3.018ValLys: 3.018 ± 0.47
5.446ValLeu: 5.446 ± 0.653
1.693ValMet: 1.693 ± 0.375
3.018ValAsn: 3.018 ± 0.468
3.386ValPro: 3.386 ± 0.543
3.606ValGln: 3.606 ± 0.669
5.078ValArg: 5.078 ± 0.552
3.459ValSer: 3.459 ± 0.414
4.048ValThr: 4.048 ± 0.586
5.667ValVal: 5.667 ± 0.698
0.736ValTrp: 0.736 ± 0.273
1.766ValTyr: 1.766 ± 0.329
0.0ValXaa: 0.0 ± 0.0
Trp
1.766TrpAla: 1.766 ± 0.372
0.147TrpCys: 0.147 ± 0.094
0.883TrpAsp: 0.883 ± 0.319
1.178TrpGlu: 1.178 ± 0.264
0.442TrpPhe: 0.442 ± 0.272
1.325TrpGly: 1.325 ± 0.349
1.03TrpHis: 1.03 ± 0.247
0.221TrpIle: 0.221 ± 0.115
0.883TrpLys: 0.883 ± 0.288
1.619TrpLeu: 1.619 ± 0.342
0.294TrpMet: 0.294 ± 0.154
0.442TrpAsn: 0.442 ± 0.17
0.589TrpPro: 0.589 ± 0.23
0.662TrpGln: 0.662 ± 0.363
0.81TrpArg: 0.81 ± 0.254
1.325TrpSer: 1.325 ± 0.218
1.104TrpThr: 1.104 ± 0.362
0.736TrpVal: 0.736 ± 0.207
0.368TrpTrp: 0.368 ± 0.17
0.442TrpTyr: 0.442 ± 0.163
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.282TyrAla: 2.282 ± 0.447
0.736TyrCys: 0.736 ± 0.22
2.208TyrAsp: 2.208 ± 0.349
1.178TyrGlu: 1.178 ± 0.262
0.515TyrPhe: 0.515 ± 0.16
1.84TyrGly: 1.84 ± 0.448
0.515TyrHis: 0.515 ± 0.219
1.03TyrIle: 1.03 ± 0.342
0.883TyrLys: 0.883 ± 0.256
2.429TyrLeu: 2.429 ± 0.474
0.662TyrMet: 0.662 ± 0.199
1.178TyrAsn: 1.178 ± 0.252
1.03TyrPro: 1.03 ± 0.29
0.81TyrGln: 0.81 ± 0.237
1.914TyrArg: 1.914 ± 0.375
1.03TyrSer: 1.03 ± 0.252
1.178TyrThr: 1.178 ± 0.299
1.546TyrVal: 1.546 ± 0.296
0.515TyrTrp: 0.515 ± 0.175
0.81TyrTyr: 0.81 ± 0.266
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13588 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski