Amino acid dipepetide frequency for Pseudomonas phage PPPL-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.275AlaAla: 10.275 ± 1.283
0.706AlaCys: 0.706 ± 0.184
5.726AlaAsp: 5.726 ± 0.702
5.334AlaGlu: 5.334 ± 0.664
3.138AlaPhe: 3.138 ± 0.52
8.236AlaGly: 8.236 ± 0.752
2.039AlaHis: 2.039 ± 0.416
5.961AlaIle: 5.961 ± 0.855
5.726AlaLys: 5.726 ± 0.502
7.216AlaLeu: 7.216 ± 0.878
2.353AlaMet: 2.353 ± 0.474
4.706AlaAsn: 4.706 ± 0.636
3.687AlaPro: 3.687 ± 0.494
4.706AlaGln: 4.706 ± 0.704
5.648AlaArg: 5.648 ± 0.792
6.353AlaSer: 6.353 ± 0.809
7.059AlaThr: 7.059 ± 0.783
5.412AlaVal: 5.412 ± 0.76
1.255AlaTrp: 1.255 ± 0.374
3.138AlaTyr: 3.138 ± 0.645
0.0AlaXaa: 0.0 ± 0.0
Cys
1.333CysAla: 1.333 ± 0.363
0.0CysCys: 0.0 ± 0.0
0.706CysAsp: 0.706 ± 0.266
0.471CysGlu: 0.471 ± 0.217
0.549CysPhe: 0.549 ± 0.184
0.392CysGly: 0.392 ± 0.167
0.392CysHis: 0.392 ± 0.182
0.314CysIle: 0.314 ± 0.157
0.706CysLys: 0.706 ± 0.243
0.471CysLeu: 0.471 ± 0.21
0.235CysMet: 0.235 ± 0.127
0.314CysAsn: 0.314 ± 0.159
0.235CysPro: 0.235 ± 0.131
0.235CysGln: 0.235 ± 0.128
0.706CysArg: 0.706 ± 0.243
0.628CysSer: 0.628 ± 0.259
0.157CysThr: 0.157 ± 0.111
0.549CysVal: 0.549 ± 0.198
0.0CysTrp: 0.0 ± 0.0
0.078CysTyr: 0.078 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
5.804AspAla: 5.804 ± 0.723
0.392AspCys: 0.392 ± 0.179
5.412AspAsp: 5.412 ± 0.692
5.02AspGlu: 5.02 ± 0.629
2.902AspPhe: 2.902 ± 0.464
6.589AspGly: 6.589 ± 0.696
1.255AspHis: 1.255 ± 0.36
3.608AspIle: 3.608 ± 0.512
3.687AspLys: 3.687 ± 0.458
5.177AspLeu: 5.177 ± 0.652
1.883AspMet: 1.883 ± 0.364
2.196AspAsn: 2.196 ± 0.458
2.745AspPro: 2.745 ± 0.538
2.275AspGln: 2.275 ± 0.387
3.138AspArg: 3.138 ± 0.493
2.51AspSer: 2.51 ± 0.478
2.667AspThr: 2.667 ± 0.439
4.706AspVal: 4.706 ± 0.574
1.098AspTrp: 1.098 ± 0.351
2.353AspTyr: 2.353 ± 0.458
0.0AspXaa: 0.0 ± 0.0
Glu
7.608GluAla: 7.608 ± 0.939
1.098GluCys: 1.098 ± 0.33
4.549GluAsp: 4.549 ± 0.578
3.922GluGlu: 3.922 ± 0.737
2.745GluPhe: 2.745 ± 0.506
4.863GluGly: 4.863 ± 0.773
1.883GluHis: 1.883 ± 0.402
3.059GluIle: 3.059 ± 0.494
2.588GluLys: 2.588 ± 0.437
4.157GluLeu: 4.157 ± 0.587
2.275GluMet: 2.275 ± 0.324
2.902GluAsn: 2.902 ± 0.436
2.118GluPro: 2.118 ± 0.408
3.608GluGln: 3.608 ± 0.608
4.863GluArg: 4.863 ± 0.634
3.451GluSer: 3.451 ± 0.527
4.0GluThr: 4.0 ± 0.662
4.471GluVal: 4.471 ± 0.726
1.098GluTrp: 1.098 ± 0.366
2.118GluTyr: 2.118 ± 0.413
0.0GluXaa: 0.0 ± 0.0
Phe
2.981PheAla: 2.981 ± 0.557
0.471PheCys: 0.471 ± 0.181
2.275PheAsp: 2.275 ± 0.556
2.118PheGlu: 2.118 ± 0.521
1.177PhePhe: 1.177 ± 0.377
2.902PheGly: 2.902 ± 0.576
1.02PheHis: 1.02 ± 0.258
0.863PheIle: 0.863 ± 0.374
2.196PheLys: 2.196 ± 0.489
3.059PheLeu: 3.059 ± 0.553
1.255PheMet: 1.255 ± 0.245
1.961PheAsn: 1.961 ± 0.427
1.961PhePro: 1.961 ± 0.457
1.02PheGln: 1.02 ± 0.293
2.039PheArg: 2.039 ± 0.354
2.353PheSer: 2.353 ± 0.515
2.902PheThr: 2.902 ± 0.474
2.588PheVal: 2.588 ± 0.42
0.392PheTrp: 0.392 ± 0.155
0.784PheTyr: 0.784 ± 0.218
0.0PheXaa: 0.0 ± 0.0
Gly
6.981GlyAla: 6.981 ± 0.956
0.549GlyCys: 0.549 ± 0.217
4.785GlyAsp: 4.785 ± 0.565
5.098GlyGlu: 5.098 ± 0.671
2.824GlyPhe: 2.824 ± 0.409
6.04GlyGly: 6.04 ± 0.654
1.647GlyHis: 1.647 ± 0.503
4.942GlyIle: 4.942 ± 0.652
4.785GlyLys: 4.785 ± 0.81
6.353GlyLeu: 6.353 ± 0.957
1.961GlyMet: 1.961 ± 0.397
3.216GlyAsn: 3.216 ± 0.721
2.275GlyPro: 2.275 ± 0.418
3.53GlyGln: 3.53 ± 0.549
4.314GlyArg: 4.314 ± 0.567
6.118GlySer: 6.118 ± 0.668
4.785GlyThr: 4.785 ± 0.512
4.079GlyVal: 4.079 ± 0.56
1.255GlyTrp: 1.255 ± 0.392
3.059GlyTyr: 3.059 ± 0.538
0.0GlyXaa: 0.0 ± 0.0
His
1.49HisAla: 1.49 ± 0.347
0.314HisCys: 0.314 ± 0.175
0.941HisAsp: 0.941 ± 0.342
1.569HisGlu: 1.569 ± 0.407
0.706HisPhe: 0.706 ± 0.233
1.961HisGly: 1.961 ± 0.363
0.549HisHis: 0.549 ± 0.272
1.02HisIle: 1.02 ± 0.223
0.863HisLys: 0.863 ± 0.293
2.118HisLeu: 2.118 ± 0.489
0.863HisMet: 0.863 ± 0.22
0.941HisAsn: 0.941 ± 0.241
0.784HisPro: 0.784 ± 0.351
0.628HisGln: 0.628 ± 0.216
1.098HisArg: 1.098 ± 0.28
1.177HisSer: 1.177 ± 0.318
0.784HisThr: 0.784 ± 0.232
1.49HisVal: 1.49 ± 0.424
0.471HisTrp: 0.471 ± 0.237
0.941HisTyr: 0.941 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
4.706IleAla: 4.706 ± 0.395
0.549IleCys: 0.549 ± 0.268
2.745IleAsp: 2.745 ± 0.517
4.314IleGlu: 4.314 ± 0.646
1.177IlePhe: 1.177 ± 0.268
3.138IleGly: 3.138 ± 0.568
1.02IleHis: 1.02 ± 0.365
1.961IleIle: 1.961 ± 0.431
3.294IleLys: 3.294 ± 0.388
4.393IleLeu: 4.393 ± 0.614
0.706IleMet: 0.706 ± 0.231
2.353IleAsn: 2.353 ± 0.523
1.804IlePro: 1.804 ± 0.412
2.039IleGln: 2.039 ± 0.377
3.608IleArg: 3.608 ± 0.462
2.432IleSer: 2.432 ± 0.362
3.294IleThr: 3.294 ± 0.438
3.53IleVal: 3.53 ± 0.485
0.549IleTrp: 0.549 ± 0.159
1.255IleTyr: 1.255 ± 0.257
0.0IleXaa: 0.0 ± 0.0
Lys
7.452LysAla: 7.452 ± 0.864
0.314LysCys: 0.314 ± 0.145
4.157LysAsp: 4.157 ± 0.723
4.236LysGlu: 4.236 ± 0.699
2.118LysPhe: 2.118 ± 0.327
4.079LysGly: 4.079 ± 0.706
1.02LysHis: 1.02 ± 0.423
2.667LysIle: 2.667 ± 0.534
3.373LysLys: 3.373 ± 0.644
5.177LysLeu: 5.177 ± 0.62
1.098LysMet: 1.098 ± 0.303
1.647LysAsn: 1.647 ± 0.36
2.588LysPro: 2.588 ± 0.556
2.51LysGln: 2.51 ± 0.61
3.059LysArg: 3.059 ± 0.511
2.667LysSer: 2.667 ± 0.433
3.294LysThr: 3.294 ± 0.51
4.863LysVal: 4.863 ± 0.568
0.706LysTrp: 0.706 ± 0.256
1.883LysTyr: 1.883 ± 0.409
0.0LysXaa: 0.0 ± 0.0
Leu
8.55LeuAla: 8.55 ± 0.96
0.471LeuCys: 0.471 ± 0.174
4.785LeuAsp: 4.785 ± 0.501
4.942LeuGlu: 4.942 ± 0.684
2.275LeuPhe: 2.275 ± 0.53
5.255LeuGly: 5.255 ± 0.609
1.883LeuHis: 1.883 ± 0.345
4.628LeuIle: 4.628 ± 0.565
6.51LeuLys: 6.51 ± 0.737
5.02LeuLeu: 5.02 ± 0.636
1.883LeuMet: 1.883 ± 0.271
4.079LeuAsn: 4.079 ± 0.567
2.275LeuPro: 2.275 ± 0.369
4.706LeuGln: 4.706 ± 0.629
5.098LeuArg: 5.098 ± 0.697
4.314LeuSer: 4.314 ± 0.795
4.785LeuThr: 4.785 ± 0.603
5.098LeuVal: 5.098 ± 0.715
1.098LeuTrp: 1.098 ± 0.332
2.275LeuTyr: 2.275 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
2.981MetAla: 2.981 ± 0.435
0.235MetCys: 0.235 ± 0.144
2.432MetAsp: 2.432 ± 0.421
1.647MetGlu: 1.647 ± 0.353
0.706MetPhe: 0.706 ± 0.228
1.883MetGly: 1.883 ± 0.329
0.549MetHis: 0.549 ± 0.212
1.412MetIle: 1.412 ± 0.312
1.098MetLys: 1.098 ± 0.315
2.588MetLeu: 2.588 ± 0.398
0.549MetMet: 0.549 ± 0.19
1.098MetAsn: 1.098 ± 0.347
1.49MetPro: 1.49 ± 0.304
1.255MetGln: 1.255 ± 0.306
1.02MetArg: 1.02 ± 0.254
2.196MetSer: 2.196 ± 0.381
1.647MetThr: 1.647 ± 0.358
1.177MetVal: 1.177 ± 0.239
0.235MetTrp: 0.235 ± 0.118
0.392MetTyr: 0.392 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
4.393AsnAla: 4.393 ± 0.723
0.392AsnCys: 0.392 ± 0.153
2.667AsnAsp: 2.667 ± 0.424
3.059AsnGlu: 3.059 ± 0.53
1.726AsnPhe: 1.726 ± 0.482
4.393AsnGly: 4.393 ± 0.758
0.628AsnHis: 0.628 ± 0.235
1.726AsnIle: 1.726 ± 0.336
1.098AsnLys: 1.098 ± 0.272
3.216AsnLeu: 3.216 ± 0.423
0.863AsnMet: 0.863 ± 0.228
1.49AsnAsn: 1.49 ± 0.298
2.588AsnPro: 2.588 ± 0.331
1.726AsnGln: 1.726 ± 0.362
2.275AsnArg: 2.275 ± 0.53
3.216AsnSer: 3.216 ± 0.676
1.883AsnThr: 1.883 ± 0.32
2.588AsnVal: 2.588 ± 0.624
0.863AsnTrp: 0.863 ± 0.287
1.49AsnTyr: 1.49 ± 0.337
0.0AsnXaa: 0.0 ± 0.0
Pro
2.981ProAla: 2.981 ± 0.503
0.471ProCys: 0.471 ± 0.186
3.922ProAsp: 3.922 ± 0.694
3.373ProGlu: 3.373 ± 0.467
1.569ProPhe: 1.569 ± 0.254
2.275ProGly: 2.275 ± 0.413
0.941ProHis: 0.941 ± 0.263
1.49ProIle: 1.49 ± 0.319
2.51ProLys: 2.51 ± 0.386
2.588ProLeu: 2.588 ± 0.448
0.706ProMet: 0.706 ± 0.249
2.196ProAsn: 2.196 ± 0.445
0.784ProPro: 0.784 ± 0.207
1.412ProGln: 1.412 ± 0.34
2.196ProArg: 2.196 ± 0.552
2.196ProSer: 2.196 ± 0.394
2.118ProThr: 2.118 ± 0.448
3.138ProVal: 3.138 ± 0.473
0.549ProTrp: 0.549 ± 0.223
1.333ProTyr: 1.333 ± 0.37
0.0ProXaa: 0.0 ± 0.0
Gln
4.785GlnAla: 4.785 ± 0.632
0.314GlnCys: 0.314 ± 0.185
2.51GlnAsp: 2.51 ± 0.481
3.843GlnGlu: 3.843 ± 0.707
2.196GlnPhe: 2.196 ± 0.311
3.765GlnGly: 3.765 ± 0.625
0.549GlnHis: 0.549 ± 0.17
1.804GlnIle: 1.804 ± 0.336
1.883GlnLys: 1.883 ± 0.414
3.843GlnLeu: 3.843 ± 0.546
1.569GlnMet: 1.569 ± 0.372
1.569GlnAsn: 1.569 ± 0.399
1.177GlnPro: 1.177 ± 0.218
2.667GlnGln: 2.667 ± 0.797
3.138GlnArg: 3.138 ± 0.674
2.196GlnSer: 2.196 ± 0.381
1.49GlnThr: 1.49 ± 0.351
2.745GlnVal: 2.745 ± 0.415
0.706GlnTrp: 0.706 ± 0.245
1.49GlnTyr: 1.49 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
4.628ArgAla: 4.628 ± 0.573
0.392ArgCys: 0.392 ± 0.227
3.765ArgAsp: 3.765 ± 0.45
4.628ArgGlu: 4.628 ± 0.772
2.275ArgPhe: 2.275 ± 0.457
5.02ArgGly: 5.02 ± 0.564
1.098ArgHis: 1.098 ± 0.332
2.745ArgIle: 2.745 ± 0.429
3.922ArgLys: 3.922 ± 0.691
5.255ArgLeu: 5.255 ± 0.467
1.804ArgMet: 1.804 ± 0.434
2.432ArgAsn: 2.432 ± 0.411
1.726ArgPro: 1.726 ± 0.404
3.059ArgGln: 3.059 ± 0.403
3.138ArgArg: 3.138 ± 0.538
3.843ArgSer: 3.843 ± 0.504
3.059ArgThr: 3.059 ± 0.509
2.981ArgVal: 2.981 ± 0.443
1.098ArgTrp: 1.098 ± 0.392
1.804ArgTyr: 1.804 ± 0.302
0.0ArgXaa: 0.0 ± 0.0
Ser
6.667SerAla: 6.667 ± 0.904
0.471SerCys: 0.471 ± 0.201
5.098SerAsp: 5.098 ± 0.576
3.059SerGlu: 3.059 ± 0.429
2.51SerPhe: 2.51 ± 0.478
4.942SerGly: 4.942 ± 0.693
1.098SerHis: 1.098 ± 0.256
2.745SerIle: 2.745 ± 0.44
3.53SerLys: 3.53 ± 0.535
4.0SerLeu: 4.0 ± 0.659
1.49SerMet: 1.49 ± 0.312
2.588SerAsn: 2.588 ± 0.626
2.196SerPro: 2.196 ± 0.38
2.745SerGln: 2.745 ± 0.475
3.138SerArg: 3.138 ± 0.514
3.53SerSer: 3.53 ± 0.631
2.588SerThr: 2.588 ± 0.509
3.216SerVal: 3.216 ± 0.483
0.628SerTrp: 0.628 ± 0.228
1.961SerTyr: 1.961 ± 0.514
0.0SerXaa: 0.0 ± 0.0
Thr
4.079ThrAla: 4.079 ± 0.606
0.392ThrCys: 0.392 ± 0.221
3.922ThrAsp: 3.922 ± 0.657
3.53ThrGlu: 3.53 ± 0.548
2.51ThrPhe: 2.51 ± 0.467
4.471ThrGly: 4.471 ± 0.551
1.098ThrHis: 1.098 ± 0.231
3.53ThrIle: 3.53 ± 0.551
4.157ThrLys: 4.157 ± 0.508
5.569ThrLeu: 5.569 ± 0.743
1.255ThrMet: 1.255 ± 0.292
1.804ThrAsn: 1.804 ± 0.346
3.216ThrPro: 3.216 ± 0.402
1.883ThrGln: 1.883 ± 0.34
2.432ThrArg: 2.432 ± 0.338
3.059ThrSer: 3.059 ± 0.439
3.059ThrThr: 3.059 ± 0.642
4.549ThrVal: 4.549 ± 0.563
0.863ThrTrp: 0.863 ± 0.168
1.726ThrTyr: 1.726 ± 0.378
0.0ThrXaa: 0.0 ± 0.0
Val
6.275ValAla: 6.275 ± 0.863
0.549ValCys: 0.549 ± 0.216
2.824ValAsp: 2.824 ± 0.454
3.922ValGlu: 3.922 ± 0.692
1.726ValPhe: 1.726 ± 0.425
4.942ValGly: 4.942 ± 0.583
1.098ValHis: 1.098 ± 0.302
3.059ValIle: 3.059 ± 0.56
4.0ValLys: 4.0 ± 0.606
4.628ValLeu: 4.628 ± 0.578
2.51ValMet: 2.51 ± 0.407
2.824ValAsn: 2.824 ± 0.401
3.451ValPro: 3.451 ± 0.707
2.745ValGln: 2.745 ± 0.416
4.706ValArg: 4.706 ± 0.567
3.53ValSer: 3.53 ± 0.471
4.471ValThr: 4.471 ± 0.544
4.628ValVal: 4.628 ± 0.814
1.02ValTrp: 1.02 ± 0.249
2.039ValTyr: 2.039 ± 0.501
0.0ValXaa: 0.0 ± 0.0
Trp
1.569TrpAla: 1.569 ± 0.274
0.078TrpCys: 0.078 ± 0.081
0.706TrpAsp: 0.706 ± 0.23
0.628TrpGlu: 0.628 ± 0.229
0.628TrpPhe: 0.628 ± 0.168
0.706TrpGly: 0.706 ± 0.196
0.549TrpHis: 0.549 ± 0.223
0.549TrpIle: 0.549 ± 0.194
1.02TrpLys: 1.02 ± 0.319
1.883TrpLeu: 1.883 ± 0.411
0.392TrpMet: 0.392 ± 0.147
0.706TrpAsn: 0.706 ± 0.233
0.314TrpPro: 0.314 ± 0.171
0.471TrpGln: 0.471 ± 0.204
0.941TrpArg: 0.941 ± 0.218
0.863TrpSer: 0.863 ± 0.317
0.784TrpThr: 0.784 ± 0.238
1.333TrpVal: 1.333 ± 0.403
0.157TrpTrp: 0.157 ± 0.111
0.392TrpTyr: 0.392 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.138TyrAla: 3.138 ± 0.448
0.314TyrCys: 0.314 ± 0.156
1.569TyrAsp: 1.569 ± 0.317
2.667TyrGlu: 2.667 ± 0.573
0.863TyrPhe: 0.863 ± 0.293
2.745TyrGly: 2.745 ± 0.35
0.392TyrHis: 0.392 ± 0.199
0.784TyrIle: 0.784 ± 0.303
1.961TyrLys: 1.961 ± 0.378
3.216TyrLeu: 3.216 ± 0.497
0.863TyrMet: 0.863 ± 0.242
1.255TyrAsn: 1.255 ± 0.245
1.333TyrPro: 1.333 ± 0.359
1.02TyrGln: 1.02 ± 0.233
2.118TyrArg: 2.118 ± 0.465
1.569TyrSer: 1.569 ± 0.32
2.275TyrThr: 2.275 ± 0.427
1.804TyrVal: 1.804 ± 0.373
0.628TyrTrp: 0.628 ± 0.24
0.784TyrTyr: 0.784 ± 0.262
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (12750 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski