Amino acid dipepetide frequency for Pseudomonas phage MPK7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.838AlaAla: 13.838 ± 1.329
0.972AlaCys: 0.972 ± 0.299
5.76AlaAsp: 5.76 ± 0.581
8.228AlaGlu: 8.228 ± 0.838
4.039AlaPhe: 4.039 ± 0.564
10.696AlaGly: 10.696 ± 1.456
2.169AlaHis: 2.169 ± 0.42
3.815AlaIle: 3.815 ± 0.623
5.984AlaLys: 5.984 ± 0.791
10.547AlaLeu: 10.547 ± 0.954
3.74AlaMet: 3.74 ± 0.548
3.516AlaAsn: 3.516 ± 0.684
4.712AlaPro: 4.712 ± 0.869
6.059AlaGln: 6.059 ± 0.867
5.012AlaArg: 5.012 ± 0.709
4.563AlaSer: 4.563 ± 0.647
6.358AlaThr: 6.358 ± 0.838
8.452AlaVal: 8.452 ± 0.943
1.122AlaTrp: 1.122 ± 0.189
2.917AlaTyr: 2.917 ± 0.67
0.0AlaXaa: 0.0 ± 0.0
Cys
1.197CysAla: 1.197 ± 0.327
0.075CysCys: 0.075 ± 0.071
0.299CysAsp: 0.299 ± 0.18
0.524CysGlu: 0.524 ± 0.199
0.224CysPhe: 0.224 ± 0.125
0.673CysGly: 0.673 ± 0.268
0.299CysHis: 0.299 ± 0.173
0.673CysIle: 0.673 ± 0.379
0.449CysLys: 0.449 ± 0.278
1.047CysLeu: 1.047 ± 0.295
0.299CysMet: 0.299 ± 0.18
0.224CysAsn: 0.224 ± 0.126
0.374CysPro: 0.374 ± 0.157
0.598CysGln: 0.598 ± 0.246
0.524CysArg: 0.524 ± 0.237
0.15CysSer: 0.15 ± 0.109
0.673CysThr: 0.673 ± 0.221
0.449CysVal: 0.449 ± 0.21
0.15CysTrp: 0.15 ± 0.164
0.524CysTyr: 0.524 ± 0.265
0.0CysXaa: 0.0 ± 0.0
Asp
5.909AspAla: 5.909 ± 0.546
0.449AspCys: 0.449 ± 0.211
3.067AspAsp: 3.067 ± 0.418
5.161AspGlu: 5.161 ± 0.429
1.795AspPhe: 1.795 ± 0.315
4.338AspGly: 4.338 ± 0.676
1.346AspHis: 1.346 ± 0.3
3.216AspIle: 3.216 ± 0.346
2.468AspLys: 2.468 ± 0.461
5.012AspLeu: 5.012 ± 0.503
1.272AspMet: 1.272 ± 0.27
2.02AspAsn: 2.02 ± 0.4
3.89AspPro: 3.89 ± 0.392
2.169AspGln: 2.169 ± 0.438
3.964AspArg: 3.964 ± 0.622
3.291AspSer: 3.291 ± 0.421
3.665AspThr: 3.665 ± 0.71
3.067AspVal: 3.067 ± 0.386
0.898AspTrp: 0.898 ± 0.184
1.87AspTyr: 1.87 ± 0.464
0.0AspXaa: 0.0 ± 0.0
Glu
9.275GluAla: 9.275 ± 1.144
0.374GluCys: 0.374 ± 0.187
2.917GluAsp: 2.917 ± 0.543
3.89GluGlu: 3.89 ± 0.554
2.468GluPhe: 2.468 ± 0.343
5.012GluGly: 5.012 ± 0.55
1.646GluHis: 1.646 ± 0.353
2.618GluIle: 2.618 ± 0.429
2.094GluLys: 2.094 ± 0.447
6.807GluLeu: 6.807 ± 0.722
2.094GluMet: 2.094 ± 0.306
1.795GluAsn: 1.795 ± 0.318
1.496GluPro: 1.496 ± 0.343
3.441GluGln: 3.441 ± 0.504
4.039GluArg: 4.039 ± 0.606
2.319GluSer: 2.319 ± 0.464
2.468GluThr: 2.468 ± 0.48
4.563GluVal: 4.563 ± 0.683
0.972GluTrp: 0.972 ± 0.194
2.094GluTyr: 2.094 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
2.319PheAla: 2.319 ± 0.402
0.449PheCys: 0.449 ± 0.186
2.917PheAsp: 2.917 ± 0.404
1.795PheGlu: 1.795 ± 0.412
1.047PhePhe: 1.047 ± 0.213
2.169PheGly: 2.169 ± 0.437
0.748PheHis: 0.748 ± 0.206
1.72PheIle: 1.72 ± 0.309
1.87PheLys: 1.87 ± 0.235
2.468PheLeu: 2.468 ± 0.369
0.898PheMet: 0.898 ± 0.207
1.421PheAsn: 1.421 ± 0.279
2.094PhePro: 2.094 ± 0.455
1.646PheGln: 1.646 ± 0.269
2.468PheArg: 2.468 ± 0.431
2.169PheSer: 2.169 ± 0.431
1.496PheThr: 1.496 ± 0.268
1.496PheVal: 1.496 ± 0.274
0.449PheTrp: 0.449 ± 0.161
0.598PheTyr: 0.598 ± 0.209
0.0PheXaa: 0.0 ± 0.0
Gly
7.031GlyAla: 7.031 ± 0.961
0.898GlyCys: 0.898 ± 0.249
3.89GlyAsp: 3.89 ± 0.507
4.488GlyGlu: 4.488 ± 0.681
2.917GlyPhe: 2.917 ± 0.631
5.61GlyGly: 5.61 ± 0.947
0.898GlyHis: 0.898 ± 0.217
4.862GlyIle: 4.862 ± 0.506
4.189GlyLys: 4.189 ± 0.513
7.106GlyLeu: 7.106 ± 0.74
2.768GlyMet: 2.768 ± 0.521
2.02GlyAsn: 2.02 ± 0.459
2.244GlyPro: 2.244 ± 0.401
3.665GlyGln: 3.665 ± 0.593
6.732GlyArg: 6.732 ± 0.533
4.413GlySer: 4.413 ± 0.445
5.685GlyThr: 5.685 ± 0.724
6.358GlyVal: 6.358 ± 0.727
1.795GlyTrp: 1.795 ± 0.464
2.394GlyTyr: 2.394 ± 0.42
0.0GlyXaa: 0.0 ± 0.0
His
1.496HisAla: 1.496 ± 0.281
0.299HisCys: 0.299 ± 0.157
1.047HisAsp: 1.047 ± 0.294
1.571HisGlu: 1.571 ± 0.37
0.748HisPhe: 0.748 ± 0.234
1.272HisGly: 1.272 ± 0.28
0.524HisHis: 0.524 ± 0.218
0.972HisIle: 0.972 ± 0.248
0.972HisLys: 0.972 ± 0.366
1.945HisLeu: 1.945 ± 0.379
0.673HisMet: 0.673 ± 0.248
0.898HisAsn: 0.898 ± 0.252
0.823HisPro: 0.823 ± 0.27
1.047HisGln: 1.047 ± 0.239
2.244HisArg: 2.244 ± 0.44
1.047HisSer: 1.047 ± 0.315
1.272HisThr: 1.272 ± 0.329
2.169HisVal: 2.169 ± 0.407
0.524HisTrp: 0.524 ± 0.155
0.748HisTyr: 0.748 ± 0.236
0.0HisXaa: 0.0 ± 0.0
Ile
5.46IleAla: 5.46 ± 0.773
0.374IleCys: 0.374 ± 0.169
3.366IleAsp: 3.366 ± 0.524
2.693IleGlu: 2.693 ± 0.331
1.197IlePhe: 1.197 ± 0.302
2.992IleGly: 2.992 ± 0.466
1.421IleHis: 1.421 ± 0.287
1.795IleIle: 1.795 ± 0.376
2.693IleLys: 2.693 ± 0.442
2.319IleLeu: 2.319 ± 0.588
1.272IleMet: 1.272 ± 0.319
1.945IleAsn: 1.945 ± 0.276
2.394IlePro: 2.394 ± 0.394
2.468IleGln: 2.468 ± 0.444
2.917IleArg: 2.917 ± 0.533
0.972IleSer: 0.972 ± 0.264
2.543IleThr: 2.543 ± 0.497
2.917IleVal: 2.917 ± 0.539
0.524IleTrp: 0.524 ± 0.211
0.898IleTyr: 0.898 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
6.134LysAla: 6.134 ± 0.77
0.299LysCys: 0.299 ± 0.153
2.992LysAsp: 2.992 ± 0.518
2.094LysGlu: 2.094 ± 0.404
1.571LysPhe: 1.571 ± 0.287
4.413LysGly: 4.413 ± 0.51
0.898LysHis: 0.898 ± 0.337
0.972LysIle: 0.972 ± 0.228
1.795LysLys: 1.795 ± 0.465
4.189LysLeu: 4.189 ± 0.696
1.945LysMet: 1.945 ± 0.285
1.122LysAsn: 1.122 ± 0.352
2.394LysPro: 2.394 ± 0.381
2.693LysGln: 2.693 ± 0.492
2.917LysArg: 2.917 ± 0.417
2.169LysSer: 2.169 ± 0.458
2.02LysThr: 2.02 ± 0.386
3.964LysVal: 3.964 ± 0.615
0.673LysTrp: 0.673 ± 0.212
0.898LysTyr: 0.898 ± 0.265
0.0LysXaa: 0.0 ± 0.0
Leu
10.173LeuAla: 10.173 ± 0.894
0.898LeuCys: 0.898 ± 0.254
6.283LeuAsp: 6.283 ± 0.775
5.984LeuGlu: 5.984 ± 0.673
1.795LeuPhe: 1.795 ± 0.438
7.63LeuGly: 7.63 ± 0.885
2.394LeuHis: 2.394 ± 0.382
3.366LeuIle: 3.366 ± 0.443
3.441LeuLys: 3.441 ± 0.336
6.882LeuLeu: 6.882 ± 0.799
2.244LeuMet: 2.244 ± 0.416
3.516LeuAsn: 3.516 ± 0.532
4.338LeuPro: 4.338 ± 0.565
3.067LeuGln: 3.067 ± 0.393
6.208LeuArg: 6.208 ± 0.647
4.787LeuSer: 4.787 ± 0.551
4.712LeuThr: 4.712 ± 0.661
5.46LeuVal: 5.46 ± 0.609
1.346LeuTrp: 1.346 ± 0.254
3.142LeuTyr: 3.142 ± 0.537
0.0LeuXaa: 0.0 ± 0.0
Met
4.264MetAla: 4.264 ± 0.598
0.0MetCys: 0.0 ± 0.0
2.094MetAsp: 2.094 ± 0.267
1.87MetGlu: 1.87 ± 0.392
0.524MetPhe: 0.524 ± 0.246
2.094MetGly: 2.094 ± 0.362
0.972MetHis: 0.972 ± 0.333
0.673MetIle: 0.673 ± 0.2
1.047MetLys: 1.047 ± 0.248
2.319MetLeu: 2.319 ± 0.444
0.823MetMet: 0.823 ± 0.232
1.421MetAsn: 1.421 ± 0.394
1.421MetPro: 1.421 ± 0.304
1.795MetGln: 1.795 ± 0.414
1.72MetArg: 1.72 ± 0.32
2.02MetSer: 2.02 ± 0.329
2.917MetThr: 2.917 ± 0.359
1.047MetVal: 1.047 ± 0.296
0.299MetTrp: 0.299 ± 0.152
0.898MetTyr: 0.898 ± 0.221
0.0MetXaa: 0.0 ± 0.0
Asn
3.815AsnAla: 3.815 ± 0.557
0.075AsnCys: 0.075 ± 0.087
1.945AsnAsp: 1.945 ± 0.367
1.945AsnGlu: 1.945 ± 0.313
0.972AsnPhe: 0.972 ± 0.302
2.768AsnGly: 2.768 ± 0.386
0.598AsnHis: 0.598 ± 0.191
1.272AsnIle: 1.272 ± 0.333
2.169AsnLys: 2.169 ± 0.371
3.291AsnLeu: 3.291 ± 0.489
0.673AsnMet: 0.673 ± 0.233
1.272AsnAsn: 1.272 ± 0.32
2.319AsnPro: 2.319 ± 0.399
1.421AsnGln: 1.421 ± 0.326
1.87AsnArg: 1.87 ± 0.378
1.496AsnSer: 1.496 ± 0.459
2.094AsnThr: 2.094 ± 0.395
3.441AsnVal: 3.441 ± 0.727
1.272AsnTrp: 1.272 ± 0.293
1.197AsnTyr: 1.197 ± 0.234
0.0AsnXaa: 0.0 ± 0.0
Pro
5.909ProAla: 5.909 ± 1.022
0.449ProCys: 0.449 ± 0.192
2.618ProAsp: 2.618 ± 0.547
4.039ProGlu: 4.039 ± 0.614
1.197ProPhe: 1.197 ± 0.307
4.039ProGly: 4.039 ± 0.587
0.748ProHis: 0.748 ± 0.22
1.646ProIle: 1.646 ± 0.315
2.319ProLys: 2.319 ± 0.367
3.815ProLeu: 3.815 ± 0.43
0.673ProMet: 0.673 ± 0.191
2.02ProAsn: 2.02 ± 0.431
1.795ProPro: 1.795 ± 0.663
1.945ProGln: 1.945 ± 0.624
2.543ProArg: 2.543 ± 0.44
2.618ProSer: 2.618 ± 0.49
2.468ProThr: 2.468 ± 0.432
3.291ProVal: 3.291 ± 0.661
0.898ProTrp: 0.898 ± 0.258
1.421ProTyr: 1.421 ± 0.283
0.0ProXaa: 0.0 ± 0.0
Gln
6.807GlnAla: 6.807 ± 0.982
0.598GlnCys: 0.598 ± 0.239
2.02GlnAsp: 2.02 ± 0.346
2.917GlnGlu: 2.917 ± 0.445
1.421GlnPhe: 1.421 ± 0.375
4.039GlnGly: 4.039 ± 0.535
0.898GlnHis: 0.898 ± 0.233
2.244GlnIle: 2.244 ± 0.264
1.197GlnLys: 1.197 ± 0.234
3.74GlnLeu: 3.74 ± 0.54
1.047GlnMet: 1.047 ± 0.227
1.646GlnAsn: 1.646 ± 0.292
1.87GlnPro: 1.87 ± 0.61
2.468GlnGln: 2.468 ± 0.453
3.291GlnArg: 3.291 ± 0.528
2.244GlnSer: 2.244 ± 0.431
1.795GlnThr: 1.795 ± 0.341
3.291GlnVal: 3.291 ± 0.482
0.598GlnTrp: 0.598 ± 0.244
1.795GlnTyr: 1.795 ± 0.375
0.0GlnXaa: 0.0 ± 0.0
Arg
7.106ArgAla: 7.106 ± 0.868
0.673ArgCys: 0.673 ± 0.236
4.937ArgAsp: 4.937 ± 0.588
3.964ArgGlu: 3.964 ± 0.599
3.142ArgPhe: 3.142 ± 0.53
5.311ArgGly: 5.311 ± 0.544
1.197ArgHis: 1.197 ± 0.327
3.441ArgIle: 3.441 ± 0.541
3.74ArgLys: 3.74 ± 0.689
5.161ArgLeu: 5.161 ± 0.446
2.693ArgMet: 2.693 ± 0.456
2.992ArgAsn: 2.992 ± 0.441
2.992ArgPro: 2.992 ± 0.512
2.468ArgGln: 2.468 ± 0.339
6.508ArgArg: 6.508 ± 0.741
3.89ArgSer: 3.89 ± 0.635
3.291ArgThr: 3.291 ± 0.374
4.264ArgVal: 4.264 ± 0.615
1.571ArgTrp: 1.571 ± 0.364
1.72ArgTyr: 1.72 ± 0.4
0.0ArgXaa: 0.0 ± 0.0
Ser
5.311SerAla: 5.311 ± 0.597
0.449SerCys: 0.449 ± 0.176
2.543SerAsp: 2.543 ± 0.372
2.842SerGlu: 2.842 ± 0.486
1.197SerPhe: 1.197 ± 0.264
3.815SerGly: 3.815 ± 0.473
1.122SerHis: 1.122 ± 0.286
2.244SerIle: 2.244 ± 0.57
2.693SerLys: 2.693 ± 0.349
5.61SerLeu: 5.61 ± 0.683
1.87SerMet: 1.87 ± 0.319
1.421SerAsn: 1.421 ± 0.252
2.02SerPro: 2.02 ± 0.281
2.394SerGln: 2.394 ± 0.447
4.563SerArg: 4.563 ± 0.674
1.795SerSer: 1.795 ± 0.458
1.87SerThr: 1.87 ± 0.487
3.067SerVal: 3.067 ± 0.583
0.524SerTrp: 0.524 ± 0.196
1.346SerTyr: 1.346 ± 0.256
0.0SerXaa: 0.0 ± 0.0
Thr
6.657ThrAla: 6.657 ± 0.924
0.748ThrCys: 0.748 ± 0.241
2.992ThrAsp: 2.992 ± 0.606
2.468ThrGlu: 2.468 ± 0.389
2.02ThrPhe: 2.02 ± 0.338
4.563ThrGly: 4.563 ± 0.583
1.646ThrHis: 1.646 ± 0.282
3.216ThrIle: 3.216 ± 0.428
2.768ThrLys: 2.768 ± 0.455
4.338ThrLeu: 4.338 ± 0.576
1.346ThrMet: 1.346 ± 0.356
2.094ThrAsn: 2.094 ± 0.421
3.441ThrPro: 3.441 ± 0.452
2.094ThrGln: 2.094 ± 0.369
2.917ThrArg: 2.917 ± 0.554
2.842ThrSer: 2.842 ± 0.44
2.468ThrThr: 2.468 ± 0.445
2.992ThrVal: 2.992 ± 0.514
0.898ThrTrp: 0.898 ± 0.269
1.72ThrTyr: 1.72 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
7.031ValAla: 7.031 ± 0.766
0.972ValCys: 0.972 ± 0.318
4.712ValAsp: 4.712 ± 0.627
3.516ValGlu: 3.516 ± 0.443
2.319ValPhe: 2.319 ± 0.426
5.236ValGly: 5.236 ± 0.721
1.571ValHis: 1.571 ± 0.454
2.543ValIle: 2.543 ± 0.534
2.842ValLys: 2.842 ± 0.463
6.582ValLeu: 6.582 ± 0.689
2.394ValMet: 2.394 ± 0.395
2.319ValAsn: 2.319 ± 0.357
3.516ValPro: 3.516 ± 0.864
2.468ValGln: 2.468 ± 0.529
6.358ValArg: 6.358 ± 0.72
2.543ValSer: 2.543 ± 0.404
3.74ValThr: 3.74 ± 0.623
4.712ValVal: 4.712 ± 0.609
0.374ValTrp: 0.374 ± 0.178
2.02ValTyr: 2.02 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
1.272TrpAla: 1.272 ± 0.344
0.299TrpCys: 0.299 ± 0.132
1.272TrpAsp: 1.272 ± 0.362
0.748TrpGlu: 0.748 ± 0.264
0.748TrpPhe: 0.748 ± 0.265
1.047TrpGly: 1.047 ± 0.334
0.224TrpHis: 0.224 ± 0.15
0.673TrpIle: 0.673 ± 0.257
0.524TrpLys: 0.524 ± 0.155
1.346TrpLeu: 1.346 ± 0.295
0.524TrpMet: 0.524 ± 0.193
0.972TrpAsn: 0.972 ± 0.257
0.598TrpPro: 0.598 ± 0.223
0.673TrpGln: 0.673 ± 0.272
1.346TrpArg: 1.346 ± 0.325
1.047TrpSer: 1.047 ± 0.309
0.823TrpThr: 0.823 ± 0.308
0.748TrpVal: 0.748 ± 0.311
0.224TrpTrp: 0.224 ± 0.172
0.673TrpTyr: 0.673 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.496TyrAla: 1.496 ± 0.34
0.075TyrCys: 0.075 ± 0.078
1.197TyrAsp: 1.197 ± 0.434
1.646TyrGlu: 1.646 ± 0.299
0.898TyrPhe: 0.898 ± 0.271
2.244TyrGly: 2.244 ± 0.392
0.972TyrHis: 0.972 ± 0.242
1.197TyrIle: 1.197 ± 0.377
0.898TyrLys: 0.898 ± 0.303
3.142TyrLeu: 3.142 ± 0.453
0.898TyrMet: 0.898 ± 0.247
1.346TyrAsn: 1.346 ± 0.358
1.72TyrPro: 1.72 ± 0.403
1.346TyrGln: 1.346 ± 0.282
2.917TyrArg: 2.917 ± 0.413
2.319TyrSer: 2.319 ± 0.485
1.87TyrThr: 1.87 ± 0.273
2.02TyrVal: 2.02 ± 0.319
0.673TyrTrp: 0.673 ± 0.199
0.449TyrTyr: 0.449 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (13370 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski