Amino acid dipepetide frequency for Pseudomonas phage AF

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.271AlaAla: 15.271 ± 2.129
0.727AlaCys: 0.727 ± 0.288
6.908AlaAsp: 6.908 ± 1.022
7.926AlaGlu: 7.926 ± 0.964
4.145AlaPhe: 4.145 ± 0.49
7.999AlaGly: 7.999 ± 1.1
1.6AlaHis: 1.6 ± 0.42
5.454AlaIle: 5.454 ± 0.559
5.745AlaLys: 5.745 ± 0.722
9.308AlaLeu: 9.308 ± 0.906
3.563AlaMet: 3.563 ± 0.573
2.909AlaAsn: 2.909 ± 0.683
2.763AlaPro: 2.763 ± 0.468
4.945AlaGln: 4.945 ± 0.79
7.635AlaArg: 7.635 ± 0.815
7.272AlaSer: 7.272 ± 0.74
6.472AlaThr: 6.472 ± 0.866
6.545AlaVal: 6.545 ± 0.728
2.4AlaTrp: 2.4 ± 0.648
3.709AlaTyr: 3.709 ± 0.471
0.0AlaXaa: 0.0 ± 0.0
Cys
0.873CysAla: 0.873 ± 0.298
0.218CysCys: 0.218 ± 0.122
0.582CysAsp: 0.582 ± 0.242
0.654CysGlu: 0.654 ± 0.257
0.364CysPhe: 0.364 ± 0.149
1.163CysGly: 1.163 ± 0.393
0.654CysHis: 0.654 ± 0.248
0.364CysIle: 0.364 ± 0.166
0.8CysLys: 0.8 ± 0.285
0.8CysLeu: 0.8 ± 0.322
0.291CysMet: 0.291 ± 0.174
0.218CysAsn: 0.218 ± 0.13
0.291CysPro: 0.291 ± 0.177
0.364CysGln: 0.364 ± 0.197
0.654CysArg: 0.654 ± 0.218
0.727CysSer: 0.727 ± 0.259
0.582CysThr: 0.582 ± 0.216
0.8CysVal: 0.8 ± 0.234
0.145CysTrp: 0.145 ± 0.118
0.364CysTyr: 0.364 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
8.072AspAla: 8.072 ± 1.026
0.145AspCys: 0.145 ± 0.112
3.636AspAsp: 3.636 ± 0.519
3.2AspGlu: 3.2 ± 0.48
1.745AspPhe: 1.745 ± 0.343
5.526AspGly: 5.526 ± 0.619
0.654AspHis: 0.654 ± 0.261
2.981AspIle: 2.981 ± 0.368
2.909AspLys: 2.909 ± 0.361
5.526AspLeu: 5.526 ± 0.789
1.963AspMet: 1.963 ± 0.376
2.182AspAsn: 2.182 ± 0.484
3.054AspPro: 3.054 ± 0.549
2.981AspGln: 2.981 ± 0.708
2.909AspArg: 2.909 ± 0.458
3.272AspSer: 3.272 ± 0.611
2.691AspThr: 2.691 ± 0.428
4.072AspVal: 4.072 ± 0.473
1.454AspTrp: 1.454 ± 0.386
1.672AspTyr: 1.672 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
7.49GluAla: 7.49 ± 0.96
0.582GluCys: 0.582 ± 0.231
2.327GluAsp: 2.327 ± 0.382
3.781GluGlu: 3.781 ± 0.539
1.818GluPhe: 1.818 ± 0.404
4.654GluGly: 4.654 ± 0.499
1.236GluHis: 1.236 ± 0.409
4.363GluIle: 4.363 ± 0.591
2.909GluLys: 2.909 ± 0.491
4.654GluLeu: 4.654 ± 0.587
1.745GluMet: 1.745 ± 0.344
1.963GluAsn: 1.963 ± 0.39
2.691GluPro: 2.691 ± 0.475
4.799GluGln: 4.799 ± 1.187
5.89GluArg: 5.89 ± 0.733
2.472GluSer: 2.472 ± 0.394
2.618GluThr: 2.618 ± 0.385
4.363GluVal: 4.363 ± 0.423
1.454GluTrp: 1.454 ± 0.4
1.163GluTyr: 1.163 ± 0.263
0.0GluXaa: 0.0 ± 0.0
Phe
3.636PheAla: 3.636 ± 0.528
0.364PheCys: 0.364 ± 0.156
2.254PheAsp: 2.254 ± 0.423
1.891PheGlu: 1.891 ± 0.384
1.091PhePhe: 1.091 ± 0.307
1.963PheGly: 1.963 ± 0.353
0.582PheHis: 0.582 ± 0.245
1.382PheIle: 1.382 ± 0.279
1.745PheLys: 1.745 ± 0.342
1.672PheLeu: 1.672 ± 0.436
0.654PheMet: 0.654 ± 0.203
1.672PheAsn: 1.672 ± 0.311
1.6PhePro: 1.6 ± 0.368
1.163PheGln: 1.163 ± 0.291
2.109PheArg: 2.109 ± 0.282
2.472PheSer: 2.472 ± 0.345
2.545PheThr: 2.545 ± 0.764
2.4PheVal: 2.4 ± 0.503
0.364PheTrp: 0.364 ± 0.156
1.309PheTyr: 1.309 ± 0.387
0.0PheXaa: 0.0 ± 0.0
Gly
7.344GlyAla: 7.344 ± 0.828
0.509GlyCys: 0.509 ± 0.191
4.581GlyAsp: 4.581 ± 0.458
5.09GlyGlu: 5.09 ± 0.639
3.636GlyPhe: 3.636 ± 0.509
5.163GlyGly: 5.163 ± 0.735
1.527GlyHis: 1.527 ± 0.314
3.49GlyIle: 3.49 ± 0.439
4.508GlyLys: 4.508 ± 0.508
5.381GlyLeu: 5.381 ± 0.64
2.327GlyMet: 2.327 ± 0.344
2.836GlyAsn: 2.836 ± 0.428
2.836GlyPro: 2.836 ± 0.385
4.072GlyGln: 4.072 ± 0.593
3.854GlyArg: 3.854 ± 0.562
4.145GlySer: 4.145 ± 0.727
3.854GlyThr: 3.854 ± 0.757
5.526GlyVal: 5.526 ± 0.685
1.163GlyTrp: 1.163 ± 0.262
3.2GlyTyr: 3.2 ± 0.435
0.0GlyXaa: 0.0 ± 0.0
His
1.163HisAla: 1.163 ± 0.325
0.291HisCys: 0.291 ± 0.143
1.672HisAsp: 1.672 ± 0.337
1.163HisGlu: 1.163 ± 0.297
0.364HisPhe: 0.364 ± 0.153
1.745HisGly: 1.745 ± 0.344
0.073HisHis: 0.073 ± 0.082
0.654HisIle: 0.654 ± 0.199
0.8HisLys: 0.8 ± 0.282
1.163HisLeu: 1.163 ± 0.231
0.291HisMet: 0.291 ± 0.145
0.582HisAsn: 0.582 ± 0.192
1.018HisPro: 1.018 ± 0.244
0.509HisGln: 0.509 ± 0.155
0.945HisArg: 0.945 ± 0.225
0.945HisSer: 0.945 ± 0.284
0.873HisThr: 0.873 ± 0.262
0.873HisVal: 0.873 ± 0.23
0.218HisTrp: 0.218 ± 0.113
0.654HisTyr: 0.654 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
4.727IleAla: 4.727 ± 0.565
0.509IleCys: 0.509 ± 0.253
4.29IleAsp: 4.29 ± 0.604
3.2IleGlu: 3.2 ± 0.508
1.309IlePhe: 1.309 ± 0.324
5.09IleGly: 5.09 ± 0.573
1.018IleHis: 1.018 ± 0.283
2.4IleIle: 2.4 ± 0.461
2.909IleLys: 2.909 ± 0.514
3.2IleLeu: 3.2 ± 0.523
1.018IleMet: 1.018 ± 0.374
1.672IleAsn: 1.672 ± 0.505
2.545IlePro: 2.545 ± 0.35
2.836IleGln: 2.836 ± 0.52
3.418IleArg: 3.418 ± 0.428
2.763IleSer: 2.763 ± 0.448
3.127IleThr: 3.127 ± 0.624
2.472IleVal: 2.472 ± 0.348
0.654IleTrp: 0.654 ± 0.227
1.527IleTyr: 1.527 ± 0.308
0.0IleXaa: 0.0 ± 0.0
Lys
5.745LysAla: 5.745 ± 0.715
0.945LysCys: 0.945 ± 0.273
3.927LysAsp: 3.927 ± 0.512
2.4LysGlu: 2.4 ± 0.434
1.891LysPhe: 1.891 ± 0.459
2.909LysGly: 2.909 ± 0.54
0.945LysHis: 0.945 ± 0.257
2.763LysIle: 2.763 ± 0.453
2.763LysLys: 2.763 ± 0.599
3.781LysLeu: 3.781 ± 0.472
1.527LysMet: 1.527 ± 0.335
1.818LysAsn: 1.818 ± 0.341
2.836LysPro: 2.836 ± 0.565
2.254LysGln: 2.254 ± 0.506
3.2LysArg: 3.2 ± 0.5
2.545LysSer: 2.545 ± 0.365
2.472LysThr: 2.472 ± 0.375
2.763LysVal: 2.763 ± 0.564
0.727LysTrp: 0.727 ± 0.23
0.873LysTyr: 0.873 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
9.526LeuAla: 9.526 ± 0.92
1.163LeuCys: 1.163 ± 0.345
4.29LeuAsp: 4.29 ± 0.774
4.508LeuGlu: 4.508 ± 0.43
1.963LeuPhe: 1.963 ± 0.33
6.254LeuGly: 6.254 ± 0.897
1.018LeuHis: 1.018 ± 0.288
4.581LeuIle: 4.581 ± 0.531
2.909LeuLys: 2.909 ± 0.402
6.617LeuLeu: 6.617 ± 0.751
2.545LeuMet: 2.545 ± 0.457
4.145LeuAsn: 4.145 ± 0.622
4.145LeuPro: 4.145 ± 0.679
2.909LeuGln: 2.909 ± 0.391
4.363LeuArg: 4.363 ± 0.539
4.872LeuSer: 4.872 ± 0.616
4.945LeuThr: 4.945 ± 0.48
5.017LeuVal: 5.017 ± 0.62
0.945LeuTrp: 0.945 ± 0.278
1.891LeuTyr: 1.891 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
3.272MetAla: 3.272 ± 0.587
0.291MetCys: 0.291 ± 0.167
1.018MetAsp: 1.018 ± 0.311
1.163MetGlu: 1.163 ± 0.322
0.727MetPhe: 0.727 ± 0.205
1.163MetGly: 1.163 ± 0.277
0.727MetHis: 0.727 ± 0.287
1.818MetIle: 1.818 ± 0.462
1.091MetLys: 1.091 ± 0.288
2.4MetLeu: 2.4 ± 0.397
0.218MetMet: 0.218 ± 0.123
1.527MetAsn: 1.527 ± 0.369
1.6MetPro: 1.6 ± 0.276
1.454MetGln: 1.454 ± 0.384
2.036MetArg: 2.036 ± 0.403
1.963MetSer: 1.963 ± 0.381
2.618MetThr: 2.618 ± 0.433
1.672MetVal: 1.672 ± 0.338
0.436MetTrp: 0.436 ± 0.16
0.654MetTyr: 0.654 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
4.436AsnAla: 4.436 ± 0.726
0.436AsnCys: 0.436 ± 0.178
2.182AsnAsp: 2.182 ± 0.336
2.254AsnGlu: 2.254 ± 0.356
1.454AsnPhe: 1.454 ± 0.389
2.618AsnGly: 2.618 ± 0.558
0.873AsnHis: 0.873 ± 0.236
1.6AsnIle: 1.6 ± 0.282
1.745AsnLys: 1.745 ± 0.302
2.691AsnLeu: 2.691 ± 0.531
0.582AsnMet: 0.582 ± 0.246
2.036AsnAsn: 2.036 ± 0.588
2.472AsnPro: 2.472 ± 0.337
2.4AsnGln: 2.4 ± 0.409
2.109AsnArg: 2.109 ± 0.366
1.891AsnSer: 1.891 ± 0.377
2.036AsnThr: 2.036 ± 0.371
2.618AsnVal: 2.618 ± 0.484
0.509AsnTrp: 0.509 ± 0.183
1.091AsnTyr: 1.091 ± 0.393
0.0AsnXaa: 0.0 ± 0.0
Pro
4.145ProAla: 4.145 ± 0.513
0.582ProCys: 0.582 ± 0.247
2.691ProAsp: 2.691 ± 0.528
4.799ProGlu: 4.799 ± 0.709
1.454ProPhe: 1.454 ± 0.295
2.763ProGly: 2.763 ± 0.421
0.654ProHis: 0.654 ± 0.305
1.672ProIle: 1.672 ± 0.297
2.327ProLys: 2.327 ± 0.336
3.636ProLeu: 3.636 ± 0.503
1.309ProMet: 1.309 ± 0.374
1.672ProAsn: 1.672 ± 0.391
2.691ProPro: 2.691 ± 0.535
1.745ProGln: 1.745 ± 0.354
2.327ProArg: 2.327 ± 0.398
2.836ProSer: 2.836 ± 0.523
2.691ProThr: 2.691 ± 0.464
2.981ProVal: 2.981 ± 0.485
0.945ProTrp: 0.945 ± 0.234
1.236ProTyr: 1.236 ± 0.448
0.0ProXaa: 0.0 ± 0.0
Gln
6.472GlnAla: 6.472 ± 0.858
0.436GlnCys: 0.436 ± 0.231
2.254GlnAsp: 2.254 ± 0.339
2.909GlnGlu: 2.909 ± 0.525
2.036GlnPhe: 2.036 ± 0.414
2.691GlnGly: 2.691 ± 0.493
0.509GlnHis: 0.509 ± 0.207
2.836GlnIle: 2.836 ± 0.504
2.327GlnLys: 2.327 ± 0.529
4.508GlnLeu: 4.508 ± 0.822
1.672GlnMet: 1.672 ± 0.301
2.182GlnAsn: 2.182 ± 0.448
2.036GlnPro: 2.036 ± 0.414
4.145GlnGln: 4.145 ± 0.703
3.127GlnArg: 3.127 ± 0.645
3.054GlnSer: 3.054 ± 0.372
2.182GlnThr: 2.182 ± 0.412
3.854GlnVal: 3.854 ± 0.513
0.509GlnTrp: 0.509 ± 0.19
1.454GlnTyr: 1.454 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
6.108ArgAla: 6.108 ± 0.815
1.018ArgCys: 1.018 ± 0.286
4.436ArgAsp: 4.436 ± 0.88
4.363ArgGlu: 4.363 ± 0.86
2.618ArgPhe: 2.618 ± 0.395
4.218ArgGly: 4.218 ± 0.496
1.091ArgHis: 1.091 ± 0.226
2.691ArgIle: 2.691 ± 0.435
3.345ArgLys: 3.345 ± 0.578
5.017ArgLeu: 5.017 ± 0.569
1.454ArgMet: 1.454 ± 0.218
1.745ArgAsn: 1.745 ± 0.369
1.891ArgPro: 1.891 ± 0.547
3.709ArgGln: 3.709 ± 0.495
4.218ArgArg: 4.218 ± 0.868
3.418ArgSer: 3.418 ± 0.581
2.836ArgThr: 2.836 ± 0.524
3.927ArgVal: 3.927 ± 0.447
1.091ArgTrp: 1.091 ± 0.246
2.036ArgTyr: 2.036 ± 0.447
0.0ArgXaa: 0.0 ± 0.0
Ser
6.326SerAla: 6.326 ± 0.732
0.509SerCys: 0.509 ± 0.214
3.854SerAsp: 3.854 ± 0.537
3.709SerGlu: 3.709 ± 0.658
1.818SerPhe: 1.818 ± 0.271
5.817SerGly: 5.817 ± 0.718
0.654SerHis: 0.654 ± 0.181
2.981SerIle: 2.981 ± 0.53
2.981SerLys: 2.981 ± 0.492
5.599SerLeu: 5.599 ± 0.485
1.963SerMet: 1.963 ± 0.349
2.763SerAsn: 2.763 ± 0.442
2.618SerPro: 2.618 ± 0.348
2.472SerGln: 2.472 ± 0.586
2.182SerArg: 2.182 ± 0.342
4.654SerSer: 4.654 ± 1.733
4.436SerThr: 4.436 ± 0.735
2.618SerVal: 2.618 ± 0.42
0.727SerTrp: 0.727 ± 0.253
1.382SerTyr: 1.382 ± 0.364
0.0SerXaa: 0.0 ± 0.0
Thr
6.399ThrAla: 6.399 ± 0.772
0.654ThrCys: 0.654 ± 0.279
3.2ThrAsp: 3.2 ± 0.5
2.909ThrGlu: 2.909 ± 0.478
1.527ThrPhe: 1.527 ± 0.309
5.89ThrGly: 5.89 ± 0.795
0.582ThrHis: 0.582 ± 0.202
3.054ThrIle: 3.054 ± 0.601
2.4ThrLys: 2.4 ± 0.341
4.436ThrLeu: 4.436 ± 0.608
1.527ThrMet: 1.527 ± 0.36
2.4ThrAsn: 2.4 ± 0.375
3.418ThrPro: 3.418 ± 0.584
2.182ThrGln: 2.182 ± 0.436
3.272ThrArg: 3.272 ± 0.548
3.781ThrSer: 3.781 ± 0.625
4.363ThrThr: 4.363 ± 0.987
4.436ThrVal: 4.436 ± 0.924
0.582ThrTrp: 0.582 ± 0.215
1.891ThrTyr: 1.891 ± 0.503
0.0ThrXaa: 0.0 ± 0.0
Val
7.272ValAla: 7.272 ± 0.613
1.018ValCys: 1.018 ± 0.306
4.29ValAsp: 4.29 ± 0.698
3.636ValGlu: 3.636 ± 0.571
1.818ValPhe: 1.818 ± 0.354
4.072ValGly: 4.072 ± 0.56
1.091ValHis: 1.091 ± 0.328
3.636ValIle: 3.636 ± 0.482
2.618ValLys: 2.618 ± 0.559
3.999ValLeu: 3.999 ± 0.639
2.254ValMet: 2.254 ± 0.292
2.036ValAsn: 2.036 ± 0.413
2.836ValPro: 2.836 ± 0.459
4.072ValGln: 4.072 ± 0.493
3.781ValArg: 3.781 ± 0.502
3.345ValSer: 3.345 ± 0.399
4.799ValThr: 4.799 ± 0.898
3.345ValVal: 3.345 ± 0.499
1.163ValTrp: 1.163 ± 0.361
1.382ValTyr: 1.382 ± 0.357
0.0ValXaa: 0.0 ± 0.0
Trp
1.818TrpAla: 1.818 ± 0.628
0.291TrpCys: 0.291 ± 0.133
0.873TrpAsp: 0.873 ± 0.24
1.091TrpGlu: 1.091 ± 0.324
0.291TrpPhe: 0.291 ± 0.173
0.727TrpGly: 0.727 ± 0.24
0.218TrpHis: 0.218 ± 0.131
1.018TrpIle: 1.018 ± 0.227
1.091TrpLys: 1.091 ± 0.323
1.672TrpLeu: 1.672 ± 0.38
0.436TrpMet: 0.436 ± 0.161
0.654TrpAsn: 0.654 ± 0.22
0.727TrpPro: 0.727 ± 0.196
0.873TrpGln: 0.873 ± 0.282
1.382TrpArg: 1.382 ± 0.283
1.018TrpSer: 1.018 ± 0.225
1.163TrpThr: 1.163 ± 0.299
0.945TrpVal: 0.945 ± 0.238
0.218TrpTrp: 0.218 ± 0.104
0.218TrpTyr: 0.218 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.2TyrAla: 3.2 ± 0.401
0.218TyrCys: 0.218 ± 0.135
1.382TyrAsp: 1.382 ± 0.358
2.327TyrGlu: 2.327 ± 0.413
0.8TyrPhe: 0.8 ± 0.222
2.618TyrGly: 2.618 ± 0.548
0.291TyrHis: 0.291 ± 0.131
0.945TyrIle: 0.945 ± 0.273
1.163TyrLys: 1.163 ± 0.222
2.545TyrLeu: 2.545 ± 0.425
0.436TyrMet: 0.436 ± 0.151
1.018TyrAsn: 1.018 ± 0.272
1.091TyrPro: 1.091 ± 0.21
1.309TyrGln: 1.309 ± 0.364
1.891TyrArg: 1.891 ± 0.294
2.618TyrSer: 2.618 ± 0.493
1.6TyrThr: 1.6 ± 0.305
1.236TyrVal: 1.236 ± 0.359
1.018TyrTrp: 1.018 ± 0.308
0.582TyrTyr: 0.582 ± 0.152
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (13753 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski