Amino acid dipepetide frequency for Pseudomonas phage PaP2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.46AlaAla: 8.46 ± 1.028
0.832AlaCys: 0.832 ± 0.328
3.675AlaAsp: 3.675 ± 0.497
5.964AlaGlu: 5.964 ± 0.609
3.259AlaPhe: 3.259 ± 0.412
5.34AlaGly: 5.34 ± 0.804
1.387AlaHis: 1.387 ± 0.332
5.34AlaIle: 5.34 ± 0.611
5.548AlaLys: 5.548 ± 0.597
8.877AlaLeu: 8.877 ± 0.859
2.288AlaMet: 2.288 ± 0.45
3.883AlaAsn: 3.883 ± 0.554
2.566AlaPro: 2.566 ± 0.449
3.953AlaGln: 3.953 ± 0.87
3.953AlaArg: 3.953 ± 0.652
5.617AlaSer: 5.617 ± 0.677
4.924AlaThr: 4.924 ± 0.444
4.716AlaVal: 4.716 ± 0.572
1.11AlaTrp: 1.11 ± 0.267
2.843AlaTyr: 2.843 ± 0.494
0.0AlaXaa: 0.0 ± 0.0
Cys
0.624CysAla: 0.624 ± 0.224
0.139CysCys: 0.139 ± 0.099
0.416CysAsp: 0.416 ± 0.172
0.416CysGlu: 0.416 ± 0.179
0.763CysPhe: 0.763 ± 0.267
0.624CysGly: 0.624 ± 0.242
0.069CysHis: 0.069 ± 0.072
0.416CysIle: 0.416 ± 0.22
0.763CysLys: 0.763 ± 0.202
1.11CysLeu: 1.11 ± 0.412
0.208CysMet: 0.208 ± 0.119
0.485CysAsn: 0.485 ± 0.17
0.139CysPro: 0.139 ± 0.103
0.693CysGln: 0.693 ± 0.235
0.902CysArg: 0.902 ± 0.268
0.693CysSer: 0.693 ± 0.267
0.277CysThr: 0.277 ± 0.134
0.208CysVal: 0.208 ± 0.129
0.069CysTrp: 0.069 ± 0.068
0.416CysTyr: 0.416 ± 0.151
0.0CysXaa: 0.0 ± 0.0
Asp
3.19AspAla: 3.19 ± 0.578
0.763AspCys: 0.763 ± 0.23
2.635AspAsp: 2.635 ± 0.47
2.011AspGlu: 2.011 ± 0.414
2.982AspPhe: 2.982 ± 0.578
2.497AspGly: 2.497 ± 0.402
0.832AspHis: 0.832 ± 0.225
3.051AspIle: 3.051 ± 0.43
2.843AspLys: 2.843 ± 0.376
5.548AspLeu: 5.548 ± 0.683
2.08AspMet: 2.08 ± 0.394
3.051AspAsn: 3.051 ± 0.407
3.745AspPro: 3.745 ± 0.657
2.705AspGln: 2.705 ± 0.711
2.705AspArg: 2.705 ± 0.429
3.745AspSer: 3.745 ± 0.554
3.745AspThr: 3.745 ± 0.63
2.427AspVal: 2.427 ± 0.464
1.803AspTrp: 1.803 ± 0.386
2.427AspTyr: 2.427 ± 0.42
0.0AspXaa: 0.0 ± 0.0
Glu
6.449GluAla: 6.449 ± 0.71
0.347GluCys: 0.347 ± 0.161
4.785GluAsp: 4.785 ± 0.578
5.34GluGlu: 5.34 ± 0.734
2.982GluPhe: 2.982 ± 0.491
3.675GluGly: 3.675 ± 0.54
0.763GluHis: 0.763 ± 0.218
3.814GluIle: 3.814 ± 0.526
3.259GluLys: 3.259 ± 0.478
6.172GluLeu: 6.172 ± 0.783
1.318GluMet: 1.318 ± 0.338
2.774GluAsn: 2.774 ± 0.409
1.664GluPro: 1.664 ± 0.381
3.19GluGln: 3.19 ± 0.482
2.288GluArg: 2.288 ± 0.39
3.537GluSer: 3.537 ± 0.407
3.467GluThr: 3.467 ± 0.538
4.508GluVal: 4.508 ± 0.39
1.595GluTrp: 1.595 ± 0.292
2.288GluTyr: 2.288 ± 0.368
0.0GluXaa: 0.0 ± 0.0
Phe
2.774PheAla: 2.774 ± 0.409
0.693PheCys: 0.693 ± 0.242
2.358PheAsp: 2.358 ± 0.398
2.497PheGlu: 2.497 ± 0.462
1.872PhePhe: 1.872 ± 0.491
2.982PheGly: 2.982 ± 0.455
0.902PheHis: 0.902 ± 0.364
2.427PheIle: 2.427 ± 0.512
1.942PheLys: 1.942 ± 0.354
4.161PheLeu: 4.161 ± 0.464
0.971PheMet: 0.971 ± 0.281
2.358PheAsn: 2.358 ± 0.453
2.219PhePro: 2.219 ± 0.705
2.15PheGln: 2.15 ± 0.45
2.288PheArg: 2.288 ± 0.355
2.843PheSer: 2.843 ± 0.48
2.08PheThr: 2.08 ± 0.326
1.734PheVal: 1.734 ± 0.367
0.416PheTrp: 0.416 ± 0.179
1.803PheTyr: 1.803 ± 0.339
0.0PheXaa: 0.0 ± 0.0
Gly
4.092GlyAla: 4.092 ± 0.472
0.624GlyCys: 0.624 ± 0.19
3.537GlyAsp: 3.537 ± 0.551
3.745GlyGlu: 3.745 ± 0.522
2.427GlyPhe: 2.427 ± 0.673
3.745GlyGly: 3.745 ± 0.578
0.763GlyHis: 0.763 ± 0.245
4.438GlyIle: 4.438 ± 0.528
4.508GlyLys: 4.508 ± 0.763
6.033GlyLeu: 6.033 ± 0.632
2.08GlyMet: 2.08 ± 0.373
3.259GlyAsn: 3.259 ± 0.575
1.526GlyPro: 1.526 ± 0.413
3.259GlyGln: 3.259 ± 0.689
2.913GlyArg: 2.913 ± 0.483
3.953GlySer: 3.953 ± 0.523
3.814GlyThr: 3.814 ± 0.482
3.121GlyVal: 3.121 ± 0.46
0.971GlyTrp: 0.971 ± 0.286
3.675GlyTyr: 3.675 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
1.04HisAla: 1.04 ± 0.368
0.069HisCys: 0.069 ± 0.061
0.485HisAsp: 0.485 ± 0.15
0.763HisGlu: 0.763 ± 0.197
0.624HisPhe: 0.624 ± 0.248
1.11HisGly: 1.11 ± 0.403
0.069HisHis: 0.069 ± 0.068
0.624HisIle: 0.624 ± 0.152
0.971HisLys: 0.971 ± 0.277
1.526HisLeu: 1.526 ± 0.328
0.416HisMet: 0.416 ± 0.184
0.902HisAsn: 0.902 ± 0.231
0.485HisPro: 0.485 ± 0.156
0.347HisGln: 0.347 ± 0.145
0.693HisArg: 0.693 ± 0.25
1.387HisSer: 1.387 ± 0.369
0.624HisThr: 0.624 ± 0.146
0.763HisVal: 0.763 ± 0.218
0.139HisTrp: 0.139 ± 0.1
0.555HisTyr: 0.555 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
4.3IleAla: 4.3 ± 0.634
0.693IleCys: 0.693 ± 0.221
3.398IleAsp: 3.398 ± 0.677
3.121IleGlu: 3.121 ± 0.447
2.011IlePhe: 2.011 ± 0.394
4.924IleGly: 4.924 ± 0.767
1.179IleHis: 1.179 ± 0.268
3.259IleIle: 3.259 ± 0.498
3.953IleLys: 3.953 ± 0.591
5.964IleLeu: 5.964 ± 0.856
1.456IleMet: 1.456 ± 0.3
3.259IleAsn: 3.259 ± 0.463
4.022IlePro: 4.022 ± 0.75
3.19IleGln: 3.19 ± 0.411
3.259IleArg: 3.259 ± 0.725
5.201IleSer: 5.201 ± 0.798
3.259IleThr: 3.259 ± 0.303
2.774IleVal: 2.774 ± 0.453
1.04IleTrp: 1.04 ± 0.344
1.734IleTyr: 1.734 ± 0.424
0.0IleXaa: 0.0 ± 0.0
Lys
8.183LysAla: 8.183 ± 0.837
0.416LysCys: 0.416 ± 0.15
2.705LysAsp: 2.705 ± 0.479
4.577LysGlu: 4.577 ± 0.792
2.288LysPhe: 2.288 ± 0.428
3.259LysGly: 3.259 ± 0.423
0.277LysHis: 0.277 ± 0.139
3.675LysIle: 3.675 ± 0.628
3.329LysLys: 3.329 ± 0.601
4.854LysLeu: 4.854 ± 0.592
1.664LysMet: 1.664 ± 0.378
1.803LysAsn: 1.803 ± 0.431
3.329LysPro: 3.329 ± 0.541
1.803LysGln: 1.803 ± 0.325
2.843LysArg: 2.843 ± 0.4
3.883LysSer: 3.883 ± 0.615
3.537LysThr: 3.537 ± 0.369
3.814LysVal: 3.814 ± 0.583
0.832LysTrp: 0.832 ± 0.232
2.427LysTyr: 2.427 ± 0.412
0.0LysXaa: 0.0 ± 0.0
Leu
8.53LeuAla: 8.53 ± 1.076
0.832LeuCys: 0.832 ± 0.278
5.756LeuAsp: 5.756 ± 0.769
7.351LeuGlu: 7.351 ± 0.728
3.121LeuPhe: 3.121 ± 0.534
5.687LeuGly: 5.687 ± 0.63
1.387LeuHis: 1.387 ± 0.298
6.241LeuIle: 6.241 ± 0.718
6.311LeuLys: 6.311 ± 0.786
7.004LeuLeu: 7.004 ± 0.889
2.427LeuMet: 2.427 ± 0.457
4.993LeuAsn: 4.993 ± 0.45
4.3LeuPro: 4.3 ± 0.621
3.745LeuGln: 3.745 ± 0.749
4.577LeuArg: 4.577 ± 0.607
6.311LeuSer: 6.311 ± 0.77
5.687LeuThr: 5.687 ± 0.635
5.548LeuVal: 5.548 ± 0.557
0.971LeuTrp: 0.971 ± 0.273
2.566LeuTyr: 2.566 ± 0.369
0.0LeuXaa: 0.0 ± 0.0
Met
3.606MetAla: 3.606 ± 0.585
0.277MetCys: 0.277 ± 0.16
2.011MetAsp: 2.011 ± 0.314
1.872MetGlu: 1.872 ± 0.308
0.624MetPhe: 0.624 ± 0.166
2.566MetGly: 2.566 ± 0.376
0.069MetHis: 0.069 ± 0.061
1.664MetIle: 1.664 ± 0.355
1.04MetLys: 1.04 ± 0.296
2.497MetLeu: 2.497 ± 0.435
0.763MetMet: 0.763 ± 0.29
1.872MetAsn: 1.872 ± 0.364
0.763MetPro: 0.763 ± 0.26
0.902MetGln: 0.902 ± 0.255
1.595MetArg: 1.595 ± 0.343
1.526MetSer: 1.526 ± 0.314
2.219MetThr: 2.219 ± 0.432
1.179MetVal: 1.179 ± 0.246
0.347MetTrp: 0.347 ± 0.129
0.902MetTyr: 0.902 ± 0.222
0.0MetXaa: 0.0 ± 0.0
Asn
3.467AsnAla: 3.467 ± 0.456
0.139AsnCys: 0.139 ± 0.099
1.526AsnAsp: 1.526 ± 0.345
2.566AsnGlu: 2.566 ± 0.528
2.219AsnPhe: 2.219 ± 0.301
3.19AsnGly: 3.19 ± 0.487
0.624AsnHis: 0.624 ± 0.249
2.705AsnIle: 2.705 ± 0.523
3.259AsnLys: 3.259 ± 0.365
5.479AsnLeu: 5.479 ± 0.705
1.803AsnMet: 1.803 ± 0.343
2.913AsnAsn: 2.913 ± 0.584
3.675AsnPro: 3.675 ± 0.473
2.497AsnGln: 2.497 ± 0.394
2.566AsnArg: 2.566 ± 0.418
3.606AsnSer: 3.606 ± 0.543
2.08AsnThr: 2.08 ± 0.561
2.635AsnVal: 2.635 ± 0.532
0.832AsnTrp: 0.832 ± 0.253
2.219AsnTyr: 2.219 ± 0.361
0.0AsnXaa: 0.0 ± 0.0
Pro
3.675ProAla: 3.675 ± 0.517
0.277ProCys: 0.277 ± 0.153
3.537ProAsp: 3.537 ± 0.767
3.675ProGlu: 3.675 ± 0.577
1.734ProPhe: 1.734 ± 0.299
2.843ProGly: 2.843 ± 0.439
0.277ProHis: 0.277 ± 0.132
3.398ProIle: 3.398 ± 0.426
2.774ProLys: 2.774 ± 0.565
3.329ProLeu: 3.329 ± 0.494
0.763ProMet: 0.763 ± 0.211
2.843ProAsn: 2.843 ± 0.431
1.664ProPro: 1.664 ± 0.728
2.011ProGln: 2.011 ± 0.533
1.803ProArg: 1.803 ± 0.341
3.467ProSer: 3.467 ± 0.401
2.705ProThr: 2.705 ± 0.583
3.537ProVal: 3.537 ± 0.419
0.416ProTrp: 0.416 ± 0.206
1.387ProTyr: 1.387 ± 0.381
0.0ProXaa: 0.0 ± 0.0
Gln
5.27GlnAla: 5.27 ± 0.816
0.208GlnCys: 0.208 ± 0.116
2.497GlnAsp: 2.497 ± 0.445
2.566GlnGlu: 2.566 ± 0.452
1.595GlnPhe: 1.595 ± 0.368
3.606GlnGly: 3.606 ± 0.505
0.693GlnHis: 0.693 ± 0.2
2.705GlnIle: 2.705 ± 0.404
2.15GlnLys: 2.15 ± 0.4
4.3GlnLeu: 4.3 ± 0.667
1.456GlnMet: 1.456 ± 0.271
2.358GlnAsn: 2.358 ± 0.488
1.734GlnPro: 1.734 ± 0.298
3.19GlnGln: 3.19 ± 1.035
2.566GlnArg: 2.566 ± 0.553
3.121GlnSer: 3.121 ± 0.644
2.982GlnThr: 2.982 ± 0.46
3.329GlnVal: 3.329 ± 0.546
0.555GlnTrp: 0.555 ± 0.236
0.971GlnTyr: 0.971 ± 0.198
0.0GlnXaa: 0.0 ± 0.0
Arg
4.022ArgAla: 4.022 ± 0.429
0.485ArgCys: 0.485 ± 0.164
2.705ArgAsp: 2.705 ± 0.389
3.19ArgGlu: 3.19 ± 0.408
1.872ArgPhe: 1.872 ± 0.37
2.358ArgGly: 2.358 ± 0.302
0.693ArgHis: 0.693 ± 0.189
3.606ArgIle: 3.606 ± 0.549
2.635ArgLys: 2.635 ± 0.432
4.993ArgLeu: 4.993 ± 0.624
1.456ArgMet: 1.456 ± 0.277
2.497ArgAsn: 2.497 ± 0.342
2.15ArgPro: 2.15 ± 0.291
2.913ArgGln: 2.913 ± 0.467
2.427ArgArg: 2.427 ± 0.472
2.913ArgSer: 2.913 ± 0.588
2.566ArgThr: 2.566 ± 0.415
3.259ArgVal: 3.259 ± 0.379
0.832ArgTrp: 0.832 ± 0.235
1.387ArgTyr: 1.387 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
5.062SerAla: 5.062 ± 0.505
0.832SerCys: 0.832 ± 0.289
3.051SerAsp: 3.051 ± 0.507
4.022SerGlu: 4.022 ± 0.574
4.716SerPhe: 4.716 ± 0.488
3.675SerGly: 3.675 ± 0.787
0.832SerHis: 0.832 ± 0.175
4.092SerIle: 4.092 ± 0.587
4.23SerLys: 4.23 ± 0.555
7.351SerLeu: 7.351 ± 0.685
1.872SerMet: 1.872 ± 0.385
2.497SerAsn: 2.497 ± 0.43
3.051SerPro: 3.051 ± 0.368
3.329SerGln: 3.329 ± 0.783
3.883SerArg: 3.883 ± 0.721
5.27SerSer: 5.27 ± 1.094
3.953SerThr: 3.953 ± 0.747
3.745SerVal: 3.745 ± 0.472
0.693SerTrp: 0.693 ± 0.239
2.219SerTyr: 2.219 ± 0.466
0.0SerXaa: 0.0 ± 0.0
Thr
4.369ThrAla: 4.369 ± 0.637
0.416ThrCys: 0.416 ± 0.169
2.913ThrAsp: 2.913 ± 0.4
3.745ThrGlu: 3.745 ± 0.477
2.635ThrPhe: 2.635 ± 0.372
4.438ThrGly: 4.438 ± 0.503
1.04ThrHis: 1.04 ± 0.224
3.051ThrIle: 3.051 ± 0.615
3.259ThrLys: 3.259 ± 0.47
4.3ThrLeu: 4.3 ± 0.455
1.942ThrMet: 1.942 ± 0.327
2.982ThrAsn: 2.982 ± 0.425
3.19ThrPro: 3.19 ± 0.425
3.19ThrGln: 3.19 ± 0.548
2.913ThrArg: 2.913 ± 0.464
3.883ThrSer: 3.883 ± 0.825
4.369ThrThr: 4.369 ± 1.125
2.843ThrVal: 2.843 ± 0.53
0.624ThrTrp: 0.624 ± 0.18
2.288ThrTyr: 2.288 ± 0.397
0.0ThrXaa: 0.0 ± 0.0
Val
3.883ValAla: 3.883 ± 0.5
0.763ValCys: 0.763 ± 0.267
3.745ValAsp: 3.745 ± 0.62
3.745ValGlu: 3.745 ± 0.454
1.595ValPhe: 1.595 ± 0.366
3.19ValGly: 3.19 ± 0.514
0.971ValHis: 0.971 ± 0.216
4.022ValIle: 4.022 ± 0.595
3.259ValLys: 3.259 ± 0.411
4.716ValLeu: 4.716 ± 0.807
1.664ValMet: 1.664 ± 0.278
2.705ValAsn: 2.705 ± 0.494
3.398ValPro: 3.398 ± 0.428
2.08ValGln: 2.08 ± 0.506
2.843ValArg: 2.843 ± 0.398
3.883ValSer: 3.883 ± 0.527
3.467ValThr: 3.467 ± 0.488
3.398ValVal: 3.398 ± 0.565
0.347ValTrp: 0.347 ± 0.135
2.497ValTyr: 2.497 ± 0.369
0.0ValXaa: 0.0 ± 0.0
Trp
0.763TrpAla: 0.763 ± 0.245
0.069TrpCys: 0.069 ± 0.069
1.04TrpAsp: 1.04 ± 0.273
1.179TrpGlu: 1.179 ± 0.249
0.416TrpPhe: 0.416 ± 0.195
0.485TrpGly: 0.485 ± 0.235
0.139TrpHis: 0.139 ± 0.098
1.11TrpIle: 1.11 ± 0.287
0.971TrpLys: 0.971 ± 0.309
1.318TrpLeu: 1.318 ± 0.311
0.624TrpMet: 0.624 ± 0.177
0.902TrpAsn: 0.902 ± 0.285
0.624TrpPro: 0.624 ± 0.239
1.11TrpGln: 1.11 ± 0.244
0.624TrpArg: 0.624 ± 0.186
0.971TrpSer: 0.971 ± 0.266
0.832TrpThr: 0.832 ± 0.238
0.624TrpVal: 0.624 ± 0.16
0.277TrpTrp: 0.277 ± 0.127
0.416TrpTyr: 0.416 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.774TyrAla: 2.774 ± 0.446
0.555TyrCys: 0.555 ± 0.149
2.011TyrAsp: 2.011 ± 0.337
1.872TyrGlu: 1.872 ± 0.37
1.734TyrPhe: 1.734 ± 0.439
2.288TyrGly: 2.288 ± 0.356
0.624TyrHis: 0.624 ± 0.232
2.358TyrIle: 2.358 ± 0.539
2.497TyrLys: 2.497 ± 0.395
3.814TyrLeu: 3.814 ± 0.373
0.971TyrMet: 0.971 ± 0.301
1.664TyrAsn: 1.664 ± 0.315
1.872TyrPro: 1.872 ± 0.443
1.664TyrGln: 1.664 ± 0.332
1.318TyrArg: 1.318 ± 0.269
2.566TyrSer: 2.566 ± 0.415
1.803TyrThr: 1.803 ± 0.362
2.011TyrVal: 2.011 ± 0.362
0.624TyrTrp: 0.624 ± 0.159
0.971TyrTyr: 0.971 ± 0.317
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (14421 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski