Amino acid dipepetide frequency for Pseudomonas phage AUS531phi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.248AlaAla: 16.248 ± 2.0
0.867AlaCys: 0.867 ± 0.292
6.788AlaAsp: 6.788 ± 0.704
9.388AlaGlu: 9.388 ± 1.003
3.466AlaPhe: 3.466 ± 0.447
9.388AlaGly: 9.388 ± 0.897
2.527AlaHis: 2.527 ± 0.447
5.633AlaIle: 5.633 ± 0.601
5.921AlaLys: 5.921 ± 0.655
11.482AlaLeu: 11.482 ± 0.931
3.25AlaMet: 3.25 ± 0.418
4.044AlaAsn: 4.044 ± 0.661
5.488AlaPro: 5.488 ± 0.823
5.994AlaGln: 5.994 ± 0.665
7.871AlaArg: 7.871 ± 0.845
5.56AlaSer: 5.56 ± 0.697
6.932AlaThr: 6.932 ± 0.863
7.51AlaVal: 7.51 ± 0.639
2.383AlaTrp: 2.383 ± 0.43
2.889AlaTyr: 2.889 ± 0.558
0.0AlaXaa: 0.0 ± 0.0
Cys
0.867CysAla: 0.867 ± 0.258
0.144CysCys: 0.144 ± 0.097
0.361CysAsp: 0.361 ± 0.175
0.217CysGlu: 0.217 ± 0.135
0.217CysPhe: 0.217 ± 0.132
1.805CysGly: 1.805 ± 0.435
0.289CysHis: 0.289 ± 0.14
0.361CysIle: 0.361 ± 0.159
0.217CysLys: 0.217 ± 0.117
0.794CysLeu: 0.794 ± 0.28
0.217CysMet: 0.217 ± 0.133
0.361CysAsn: 0.361 ± 0.16
0.65CysPro: 0.65 ± 0.213
0.505CysGln: 0.505 ± 0.171
0.794CysArg: 0.794 ± 0.248
0.217CysSer: 0.217 ± 0.125
0.217CysThr: 0.217 ± 0.136
0.289CysVal: 0.289 ± 0.123
0.217CysTrp: 0.217 ± 0.134
0.289CysTyr: 0.289 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
6.427AspAla: 6.427 ± 0.627
0.867AspCys: 0.867 ± 0.23
4.261AspAsp: 4.261 ± 0.545
4.261AspGlu: 4.261 ± 0.5
1.805AspPhe: 1.805 ± 0.306
5.705AspGly: 5.705 ± 0.663
1.011AspHis: 1.011 ± 0.31
2.816AspIle: 2.816 ± 0.443
1.805AspLys: 1.805 ± 0.354
6.355AspLeu: 6.355 ± 0.622
1.661AspMet: 1.661 ± 0.306
0.794AspAsn: 0.794 ± 0.221
2.744AspPro: 2.744 ± 0.432
3.683AspGln: 3.683 ± 0.496
3.972AspArg: 3.972 ± 0.562
2.383AspSer: 2.383 ± 0.455
2.744AspThr: 2.744 ± 0.411
4.694AspVal: 4.694 ± 0.628
1.228AspTrp: 1.228 ± 0.27
1.372AspTyr: 1.372 ± 0.268
0.0AspXaa: 0.0 ± 0.0
Glu
8.16GluAla: 8.16 ± 0.884
0.722GluCys: 0.722 ± 0.27
2.455GluAsp: 2.455 ± 0.496
3.105GluGlu: 3.105 ± 0.601
1.661GluPhe: 1.661 ± 0.361
3.538GluGly: 3.538 ± 0.512
1.372GluHis: 1.372 ± 0.351
4.838GluIle: 4.838 ± 0.515
3.033GluLys: 3.033 ± 0.572
6.499GluLeu: 6.499 ± 0.689
1.444GluMet: 1.444 ± 0.36
1.733GluAsn: 1.733 ± 0.319
2.889GluPro: 2.889 ± 0.509
4.694GluGln: 4.694 ± 0.756
5.272GluArg: 5.272 ± 0.707
2.672GluSer: 2.672 ± 0.473
3.177GluThr: 3.177 ± 0.504
3.538GluVal: 3.538 ± 0.49
1.083GluTrp: 1.083 ± 0.263
1.3GluTyr: 1.3 ± 0.253
0.0GluXaa: 0.0 ± 0.0
Phe
2.455PheAla: 2.455 ± 0.5
0.217PheCys: 0.217 ± 0.15
2.383PheAsp: 2.383 ± 0.421
2.311PheGlu: 2.311 ± 0.386
0.867PhePhe: 0.867 ± 0.316
2.311PheGly: 2.311 ± 0.464
0.794PheHis: 0.794 ± 0.263
1.3PheIle: 1.3 ± 0.348
0.722PheLys: 0.722 ± 0.247
2.383PheLeu: 2.383 ± 0.488
0.65PheMet: 0.65 ± 0.198
0.939PheAsn: 0.939 ± 0.241
1.011PhePro: 1.011 ± 0.26
1.155PheGln: 1.155 ± 0.293
1.95PheArg: 1.95 ± 0.357
1.95PheSer: 1.95 ± 0.343
1.878PheThr: 1.878 ± 0.358
2.527PheVal: 2.527 ± 0.434
0.144PheTrp: 0.144 ± 0.08
0.867PheTyr: 0.867 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
7.871GlyAla: 7.871 ± 0.638
0.939GlyCys: 0.939 ± 0.301
3.827GlyAsp: 3.827 ± 0.511
6.499GlyGlu: 6.499 ± 0.514
3.322GlyPhe: 3.322 ± 0.595
8.016GlyGly: 8.016 ± 1.127
1.805GlyHis: 1.805 ± 0.332
2.527GlyIle: 2.527 ± 0.338
3.322GlyLys: 3.322 ± 0.557
8.016GlyLeu: 8.016 ± 0.878
2.239GlyMet: 2.239 ± 0.462
2.889GlyAsn: 2.889 ± 0.561
3.033GlyPro: 3.033 ± 0.474
2.889GlyGln: 2.889 ± 0.489
6.21GlyArg: 6.21 ± 0.571
3.827GlySer: 3.827 ± 0.552
3.972GlyThr: 3.972 ± 0.496
5.705GlyVal: 5.705 ± 0.8
1.805GlyTrp: 1.805 ± 0.31
2.166GlyTyr: 2.166 ± 0.414
0.0GlyXaa: 0.0 ± 0.0
His
3.322HisAla: 3.322 ± 0.598
0.217HisCys: 0.217 ± 0.118
1.011HisAsp: 1.011 ± 0.264
1.3HisGlu: 1.3 ± 0.343
0.578HisPhe: 0.578 ± 0.247
1.878HisGly: 1.878 ± 0.368
0.578HisHis: 0.578 ± 0.249
0.939HisIle: 0.939 ± 0.267
0.361HisLys: 0.361 ± 0.18
2.6HisLeu: 2.6 ± 0.571
0.289HisMet: 0.289 ± 0.139
0.65HisAsn: 0.65 ± 0.195
1.155HisPro: 1.155 ± 0.277
1.011HisGln: 1.011 ± 0.304
1.516HisArg: 1.516 ± 0.393
1.155HisSer: 1.155 ± 0.306
1.083HisThr: 1.083 ± 0.32
0.65HisVal: 0.65 ± 0.192
0.65HisTrp: 0.65 ± 0.247
0.144HisTyr: 0.144 ± 0.101
0.0HisXaa: 0.0 ± 0.0
Ile
5.994IleAla: 5.994 ± 0.587
0.289IleCys: 0.289 ± 0.133
4.477IleAsp: 4.477 ± 0.479
3.538IleGlu: 3.538 ± 0.518
0.722IlePhe: 0.722 ± 0.217
3.755IleGly: 3.755 ± 0.589
0.578IleHis: 0.578 ± 0.193
2.383IleIle: 2.383 ± 0.381
2.311IleLys: 2.311 ± 0.263
3.033IleLeu: 3.033 ± 0.52
0.867IleMet: 0.867 ± 0.21
1.228IleAsn: 1.228 ± 0.389
1.516IlePro: 1.516 ± 0.332
1.589IleGln: 1.589 ± 0.298
3.466IleArg: 3.466 ± 0.559
3.322IleSer: 3.322 ± 0.598
3.105IleThr: 3.105 ± 0.607
3.177IleVal: 3.177 ± 0.484
0.578IleTrp: 0.578 ± 0.214
1.372IleTyr: 1.372 ± 0.347
0.0IleXaa: 0.0 ± 0.0
Lys
5.416LysAla: 5.416 ± 0.844
0.217LysCys: 0.217 ± 0.139
1.878LysAsp: 1.878 ± 0.307
2.022LysGlu: 2.022 ± 0.385
1.3LysPhe: 1.3 ± 0.338
3.394LysGly: 3.394 ± 0.537
0.939LysHis: 0.939 ± 0.279
1.589LysIle: 1.589 ± 0.392
1.95LysLys: 1.95 ± 0.377
3.972LysLeu: 3.972 ± 0.523
1.011LysMet: 1.011 ± 0.207
1.083LysAsn: 1.083 ± 0.282
2.166LysPro: 2.166 ± 0.418
1.95LysGln: 1.95 ± 0.332
3.466LysArg: 3.466 ± 0.468
1.805LysSer: 1.805 ± 0.359
2.094LysThr: 2.094 ± 0.351
3.105LysVal: 3.105 ± 0.551
0.433LysTrp: 0.433 ± 0.152
0.867LysTyr: 0.867 ± 0.256
0.0LysXaa: 0.0 ± 0.0
Leu
10.904LeuAla: 10.904 ± 1.093
1.011LeuCys: 1.011 ± 0.249
7.366LeuAsp: 7.366 ± 0.746
6.571LeuGlu: 6.571 ± 0.672
3.105LeuPhe: 3.105 ± 0.405
6.788LeuGly: 6.788 ± 0.777
1.589LeuHis: 1.589 ± 0.375
4.549LeuIle: 4.549 ± 0.655
3.611LeuLys: 3.611 ± 0.642
7.871LeuLeu: 7.871 ± 0.991
1.444LeuMet: 1.444 ± 0.361
3.033LeuAsn: 3.033 ± 0.475
4.188LeuPro: 4.188 ± 0.703
4.766LeuGln: 4.766 ± 0.613
7.149LeuArg: 7.149 ± 0.699
4.477LeuSer: 4.477 ± 0.532
4.91LeuThr: 4.91 ± 0.559
7.293LeuVal: 7.293 ± 0.862
0.794LeuTrp: 0.794 ± 0.248
2.239LeuTyr: 2.239 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
3.033MetAla: 3.033 ± 0.333
0.505MetCys: 0.505 ± 0.218
1.372MetAsp: 1.372 ± 0.322
1.228MetGlu: 1.228 ± 0.316
0.722MetPhe: 0.722 ± 0.199
1.155MetGly: 1.155 ± 0.327
0.217MetHis: 0.217 ± 0.122
0.939MetIle: 0.939 ± 0.273
1.3MetLys: 1.3 ± 0.337
1.805MetLeu: 1.805 ± 0.321
0.289MetMet: 0.289 ± 0.13
0.65MetAsn: 0.65 ± 0.222
1.228MetPro: 1.228 ± 0.27
1.011MetGln: 1.011 ± 0.318
1.878MetArg: 1.878 ± 0.406
1.95MetSer: 1.95 ± 0.377
1.661MetThr: 1.661 ± 0.285
1.228MetVal: 1.228 ± 0.431
0.144MetTrp: 0.144 ± 0.11
0.361MetTyr: 0.361 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
3.611AsnAla: 3.611 ± 0.687
0.361AsnCys: 0.361 ± 0.192
1.516AsnAsp: 1.516 ± 0.278
1.444AsnGlu: 1.444 ± 0.284
0.722AsnPhe: 0.722 ± 0.21
3.177AsnGly: 3.177 ± 0.515
0.65AsnHis: 0.65 ± 0.218
0.867AsnIle: 0.867 ± 0.284
0.794AsnLys: 0.794 ± 0.251
2.961AsnLeu: 2.961 ± 0.472
0.433AsnMet: 0.433 ± 0.166
0.433AsnAsn: 0.433 ± 0.166
2.166AsnPro: 2.166 ± 0.358
1.372AsnGln: 1.372 ± 0.37
2.239AsnArg: 2.239 ± 0.36
2.094AsnSer: 2.094 ± 0.535
1.3AsnThr: 1.3 ± 0.301
2.383AsnVal: 2.383 ± 0.396
0.505AsnTrp: 0.505 ± 0.211
0.722AsnTyr: 0.722 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
7.005ProAla: 7.005 ± 0.923
0.217ProCys: 0.217 ± 0.13
3.322ProAsp: 3.322 ± 0.547
3.177ProGlu: 3.177 ± 0.536
1.011ProPhe: 1.011 ± 0.295
4.261ProGly: 4.261 ± 0.61
1.155ProHis: 1.155 ± 0.303
1.95ProIle: 1.95 ± 0.369
1.516ProLys: 1.516 ± 0.403
3.538ProLeu: 3.538 ± 0.564
1.083ProMet: 1.083 ± 0.298
1.661ProAsn: 1.661 ± 0.339
1.878ProPro: 1.878 ± 0.371
1.444ProGln: 1.444 ± 0.345
2.889ProArg: 2.889 ± 0.649
2.383ProSer: 2.383 ± 0.456
3.25ProThr: 3.25 ± 0.617
3.972ProVal: 3.972 ± 0.517
0.867ProTrp: 0.867 ± 0.252
1.155ProTyr: 1.155 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
7.582GlnAla: 7.582 ± 0.851
0.144GlnCys: 0.144 ± 0.093
2.239GlnAsp: 2.239 ± 0.369
2.022GlnGlu: 2.022 ± 0.373
1.661GlnPhe: 1.661 ± 0.37
2.889GlnGly: 2.889 ± 0.399
1.372GlnHis: 1.372 ± 0.359
3.033GlnIle: 3.033 ± 0.438
1.878GlnLys: 1.878 ± 0.423
5.199GlnLeu: 5.199 ± 0.59
1.228GlnMet: 1.228 ± 0.294
0.939GlnAsn: 0.939 ± 0.243
2.239GlnPro: 2.239 ± 0.413
4.188GlnGln: 4.188 ± 0.518
4.91GlnArg: 4.91 ± 0.62
2.311GlnSer: 2.311 ± 0.411
1.589GlnThr: 1.589 ± 0.342
2.816GlnVal: 2.816 ± 0.424
0.722GlnTrp: 0.722 ± 0.187
1.444GlnTyr: 1.444 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
8.016ArgAla: 8.016 ± 1.011
0.433ArgCys: 0.433 ± 0.163
4.477ArgAsp: 4.477 ± 0.557
4.983ArgGlu: 4.983 ± 0.59
2.239ArgPhe: 2.239 ± 0.356
4.549ArgGly: 4.549 ± 0.496
1.733ArgHis: 1.733 ± 0.36
3.25ArgIle: 3.25 ± 0.46
2.961ArgLys: 2.961 ± 0.62
7.582ArgLeu: 7.582 ± 0.673
2.022ArgMet: 2.022 ± 0.383
2.816ArgAsn: 2.816 ± 0.476
3.972ArgPro: 3.972 ± 0.693
4.333ArgGln: 4.333 ± 0.476
5.777ArgArg: 5.777 ± 0.813
3.611ArgSer: 3.611 ± 0.483
2.816ArgThr: 2.816 ± 0.523
3.611ArgVal: 3.611 ± 0.581
0.794ArgTrp: 0.794 ± 0.269
2.022ArgTyr: 2.022 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
6.282SerAla: 6.282 ± 0.648
0.289SerCys: 0.289 ± 0.144
3.25SerAsp: 3.25 ± 0.437
2.527SerGlu: 2.527 ± 0.532
1.444SerPhe: 1.444 ± 0.353
5.055SerGly: 5.055 ± 0.612
1.228SerHis: 1.228 ± 0.312
2.6SerIle: 2.6 ± 0.413
2.672SerLys: 2.672 ± 0.477
4.766SerLeu: 4.766 ± 0.693
1.372SerMet: 1.372 ± 0.269
1.3SerAsn: 1.3 ± 0.371
3.105SerPro: 3.105 ± 0.564
2.6SerGln: 2.6 ± 0.505
2.961SerArg: 2.961 ± 0.468
3.755SerSer: 3.755 ± 0.531
2.239SerThr: 2.239 ± 0.421
2.961SerVal: 2.961 ± 0.557
0.867SerTrp: 0.867 ± 0.253
0.722SerTyr: 0.722 ± 0.228
0.0SerXaa: 0.0 ± 0.0
Thr
6.932ThrAla: 6.932 ± 0.763
0.722ThrCys: 0.722 ± 0.224
2.6ThrAsp: 2.6 ± 0.449
2.383ThrGlu: 2.383 ± 0.467
1.228ThrPhe: 1.228 ± 0.368
4.766ThrGly: 4.766 ± 0.534
0.722ThrHis: 0.722 ± 0.243
3.466ThrIle: 3.466 ± 0.546
2.166ThrLys: 2.166 ± 0.346
5.199ThrLeu: 5.199 ± 0.562
0.939ThrMet: 0.939 ± 0.253
1.372ThrAsn: 1.372 ± 0.253
2.744ThrPro: 2.744 ± 0.443
1.95ThrGln: 1.95 ± 0.32
2.744ThrArg: 2.744 ± 0.415
2.094ThrSer: 2.094 ± 0.401
3.25ThrThr: 3.25 ± 0.413
3.611ThrVal: 3.611 ± 0.574
0.939ThrTrp: 0.939 ± 0.248
1.228ThrTyr: 1.228 ± 0.315
0.0ThrXaa: 0.0 ± 0.0
Val
9.315ValAla: 9.315 ± 0.832
0.65ValCys: 0.65 ± 0.235
4.549ValAsp: 4.549 ± 0.559
4.044ValGlu: 4.044 ± 0.58
2.022ValPhe: 2.022 ± 0.404
4.766ValGly: 4.766 ± 0.818
1.733ValHis: 1.733 ± 0.352
3.322ValIle: 3.322 ± 0.484
2.6ValLys: 2.6 ± 0.449
5.705ValLeu: 5.705 ± 0.724
1.733ValMet: 1.733 ± 0.304
2.383ValAsn: 2.383 ± 0.435
3.322ValPro: 3.322 ± 0.589
3.25ValGln: 3.25 ± 0.461
3.177ValArg: 3.177 ± 0.534
4.261ValSer: 4.261 ± 0.63
2.744ValThr: 2.744 ± 0.404
4.622ValVal: 4.622 ± 0.726
0.794ValTrp: 0.794 ± 0.273
1.444ValTyr: 1.444 ± 0.387
0.0ValXaa: 0.0 ± 0.0
Trp
1.372TrpAla: 1.372 ± 0.347
0.072TrpCys: 0.072 ± 0.071
0.505TrpAsp: 0.505 ± 0.221
1.011TrpGlu: 1.011 ± 0.233
0.217TrpPhe: 0.217 ± 0.113
1.3TrpGly: 1.3 ± 0.304
0.361TrpHis: 0.361 ± 0.179
0.433TrpIle: 0.433 ± 0.132
1.011TrpLys: 1.011 ± 0.279
1.516TrpLeu: 1.516 ± 0.359
0.433TrpMet: 0.433 ± 0.189
0.578TrpAsn: 0.578 ± 0.216
0.939TrpPro: 0.939 ± 0.295
0.65TrpGln: 0.65 ± 0.187
1.372TrpArg: 1.372 ± 0.317
1.155TrpSer: 1.155 ± 0.336
1.011TrpThr: 1.011 ± 0.265
1.083TrpVal: 1.083 ± 0.324
0.433TrpTrp: 0.433 ± 0.178
0.289TrpTyr: 0.289 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.816TyrAla: 2.816 ± 0.364
0.144TyrCys: 0.144 ± 0.128
1.95TyrAsp: 1.95 ± 0.393
1.155TyrGlu: 1.155 ± 0.271
0.505TyrPhe: 0.505 ± 0.196
2.383TyrGly: 2.383 ± 0.425
0.433TyrHis: 0.433 ± 0.186
0.505TyrIle: 0.505 ± 0.197
0.505TyrLys: 0.505 ± 0.219
2.383TyrLeu: 2.383 ± 0.355
0.0TyrMet: 0.0 ± 0.0
0.867TyrAsn: 0.867 ± 0.31
1.3TyrPro: 1.3 ± 0.346
1.372TyrGln: 1.372 ± 0.298
2.383TyrArg: 2.383 ± 0.374
1.011TyrSer: 1.011 ± 0.243
1.155TyrThr: 1.155 ± 0.269
1.661TyrVal: 1.661 ± 0.379
0.361TyrTrp: 0.361 ± 0.195
0.505TyrTyr: 0.505 ± 0.195
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13849 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski