Amino acid dipepetide frequency for Morganella phage IME1369_01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.984AlaAla: 8.984 ± 1.971
0.859AlaCys: 0.859 ± 0.358
5.078AlaAsp: 5.078 ± 0.886
6.406AlaGlu: 6.406 ± 0.779
2.734AlaPhe: 2.734 ± 0.362
6.328AlaGly: 6.328 ± 0.887
1.094AlaHis: 1.094 ± 0.302
5.078AlaIle: 5.078 ± 0.569
6.406AlaLys: 6.406 ± 0.749
7.187AlaLeu: 7.187 ± 0.802
3.047AlaMet: 3.047 ± 0.472
3.047AlaAsn: 3.047 ± 0.611
2.578AlaPro: 2.578 ± 0.444
3.828AlaGln: 3.828 ± 1.296
3.672AlaArg: 3.672 ± 0.436
4.297AlaSer: 4.297 ± 0.898
4.453AlaThr: 4.453 ± 1.055
5.781AlaVal: 5.781 ± 0.903
1.25AlaTrp: 1.25 ± 0.265
2.578AlaTyr: 2.578 ± 0.365
0.0AlaXaa: 0.0 ± 0.0
Cys
0.391CysAla: 0.391 ± 0.196
0.469CysCys: 0.469 ± 0.185
1.016CysAsp: 1.016 ± 0.305
0.625CysGlu: 0.625 ± 0.266
0.469CysPhe: 0.469 ± 0.212
1.016CysGly: 1.016 ± 0.415
0.312CysHis: 0.312 ± 0.146
0.937CysIle: 0.937 ± 0.218
0.703CysLys: 0.703 ± 0.274
0.703CysLeu: 0.703 ± 0.242
0.078CysMet: 0.078 ± 0.071
0.625CysAsn: 0.625 ± 0.222
0.469CysPro: 0.469 ± 0.187
0.469CysGln: 0.469 ± 0.205
1.172CysArg: 1.172 ± 0.373
0.781CysSer: 0.781 ± 0.235
0.937CysThr: 0.937 ± 0.293
0.625CysVal: 0.625 ± 0.178
0.078CysTrp: 0.078 ± 0.068
0.391CysTyr: 0.391 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
5.468AspAla: 5.468 ± 0.563
1.094AspCys: 1.094 ± 0.331
3.593AspAsp: 3.593 ± 0.637
4.375AspGlu: 4.375 ± 0.678
1.953AspPhe: 1.953 ± 0.534
6.015AspGly: 6.015 ± 0.715
0.391AspHis: 0.391 ± 0.166
3.593AspIle: 3.593 ± 0.584
3.593AspLys: 3.593 ± 0.578
4.218AspLeu: 4.218 ± 0.625
1.797AspMet: 1.797 ± 0.348
3.047AspAsn: 3.047 ± 0.593
1.484AspPro: 1.484 ± 0.402
1.172AspGln: 1.172 ± 0.284
2.5AspArg: 2.5 ± 0.509
3.593AspSer: 3.593 ± 0.48
2.265AspThr: 2.265 ± 0.49
3.593AspVal: 3.593 ± 0.356
0.625AspTrp: 0.625 ± 0.187
2.89AspTyr: 2.89 ± 0.521
0.0AspXaa: 0.0 ± 0.0
Glu
6.25GluAla: 6.25 ± 1.303
0.703GluCys: 0.703 ± 0.251
2.265GluAsp: 2.265 ± 0.357
2.89GluGlu: 2.89 ± 0.528
3.203GluPhe: 3.203 ± 0.498
2.969GluGly: 2.969 ± 0.541
1.484GluHis: 1.484 ± 0.353
4.765GluIle: 4.765 ± 0.624
4.453GluLys: 4.453 ± 0.65
5.859GluLeu: 5.859 ± 0.924
2.109GluMet: 2.109 ± 0.419
2.812GluAsn: 2.812 ± 0.434
2.422GluPro: 2.422 ± 0.487
4.14GluGln: 4.14 ± 0.577
3.75GluArg: 3.75 ± 0.541
3.828GluSer: 3.828 ± 0.51
2.5GluThr: 2.5 ± 0.459
3.593GluVal: 3.593 ± 0.586
1.719GluTrp: 1.719 ± 0.37
2.812GluTyr: 2.812 ± 0.469
0.0GluXaa: 0.0 ± 0.0
Phe
2.187PheAla: 2.187 ± 0.408
0.469PheCys: 0.469 ± 0.174
2.422PheAsp: 2.422 ± 0.406
2.656PheGlu: 2.656 ± 0.436
1.328PhePhe: 1.328 ± 0.338
3.515PheGly: 3.515 ± 0.5
0.625PheHis: 0.625 ± 0.211
2.187PheIle: 2.187 ± 0.438
1.875PheLys: 1.875 ± 0.444
3.047PheLeu: 3.047 ± 0.457
1.016PheMet: 1.016 ± 0.293
2.734PheAsn: 2.734 ± 0.431
1.484PhePro: 1.484 ± 0.352
1.406PheGln: 1.406 ± 0.262
2.734PheArg: 2.734 ± 0.425
2.89PheSer: 2.89 ± 0.636
2.187PheThr: 2.187 ± 0.477
2.578PheVal: 2.578 ± 0.523
0.547PheTrp: 0.547 ± 0.208
1.094PheTyr: 1.094 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
5.0GlyAla: 5.0 ± 0.996
1.016GlyCys: 1.016 ± 0.287
3.437GlyAsp: 3.437 ± 0.54
4.843GlyGlu: 4.843 ± 0.717
3.125GlyPhe: 3.125 ± 0.556
5.703GlyGly: 5.703 ± 0.888
1.016GlyHis: 1.016 ± 0.301
5.078GlyIle: 5.078 ± 0.68
5.312GlyLys: 5.312 ± 0.775
5.859GlyLeu: 5.859 ± 0.789
1.719GlyMet: 1.719 ± 0.418
3.75GlyAsn: 3.75 ± 0.542
1.406GlyPro: 1.406 ± 0.307
3.281GlyGln: 3.281 ± 0.641
3.515GlyArg: 3.515 ± 0.5
4.687GlySer: 4.687 ± 0.533
4.062GlyThr: 4.062 ± 0.622
4.531GlyVal: 4.531 ± 0.461
1.094GlyTrp: 1.094 ± 0.318
2.578GlyTyr: 2.578 ± 0.47
0.0GlyXaa: 0.0 ± 0.0
His
1.875HisAla: 1.875 ± 0.408
0.156HisCys: 0.156 ± 0.093
1.719HisAsp: 1.719 ± 0.397
0.469HisGlu: 0.469 ± 0.205
0.781HisPhe: 0.781 ± 0.27
1.25HisGly: 1.25 ± 0.347
0.234HisHis: 0.234 ± 0.137
1.328HisIle: 1.328 ± 0.319
0.703HisLys: 0.703 ± 0.22
1.094HisLeu: 1.094 ± 0.311
0.547HisMet: 0.547 ± 0.222
0.703HisAsn: 0.703 ± 0.244
1.25HisPro: 1.25 ± 0.323
0.703HisGln: 0.703 ± 0.212
1.094HisArg: 1.094 ± 0.287
0.625HisSer: 0.625 ± 0.235
1.172HisThr: 1.172 ± 0.377
0.781HisVal: 0.781 ± 0.281
0.078HisTrp: 0.078 ± 0.082
0.859HisTyr: 0.859 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.859IleAla: 5.859 ± 0.533
0.859IleCys: 0.859 ± 0.266
3.672IleAsp: 3.672 ± 0.641
4.531IleGlu: 4.531 ± 0.446
2.187IlePhe: 2.187 ± 0.369
4.375IleGly: 4.375 ± 0.631
1.016IleHis: 1.016 ± 0.299
5.156IleIle: 5.156 ± 0.783
4.218IleLys: 4.218 ± 0.488
3.828IleLeu: 3.828 ± 0.617
1.25IleMet: 1.25 ± 0.334
4.375IleAsn: 4.375 ± 0.746
3.047IlePro: 3.047 ± 0.419
2.344IleGln: 2.344 ± 0.404
3.984IleArg: 3.984 ± 0.559
4.531IleSer: 4.531 ± 0.545
3.593IleThr: 3.593 ± 0.622
3.437IleVal: 3.437 ± 0.408
0.781IleTrp: 0.781 ± 0.238
1.172IleTyr: 1.172 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
5.625LysAla: 5.625 ± 0.72
0.469LysCys: 0.469 ± 0.198
3.437LysAsp: 3.437 ± 0.423
4.062LysGlu: 4.062 ± 0.551
2.656LysPhe: 2.656 ± 0.47
3.906LysGly: 3.906 ± 0.524
0.937LysHis: 0.937 ± 0.28
2.812LysIle: 2.812 ± 0.497
4.218LysLys: 4.218 ± 0.649
4.609LysLeu: 4.609 ± 0.783
1.953LysMet: 1.953 ± 0.34
3.203LysAsn: 3.203 ± 0.52
3.437LysPro: 3.437 ± 0.615
3.359LysGln: 3.359 ± 0.41
4.062LysArg: 4.062 ± 0.61
4.375LysSer: 4.375 ± 0.659
4.297LysThr: 4.297 ± 0.638
4.843LysVal: 4.843 ± 0.642
0.937LysTrp: 0.937 ± 0.258
2.344LysTyr: 2.344 ± 0.45
0.0LysXaa: 0.0 ± 0.0
Leu
7.031LeuAla: 7.031 ± 1.299
1.016LeuCys: 1.016 ± 0.317
4.375LeuAsp: 4.375 ± 0.496
4.609LeuGlu: 4.609 ± 0.611
2.5LeuPhe: 2.5 ± 0.515
4.218LeuGly: 4.218 ± 0.56
1.64LeuHis: 1.64 ± 0.371
5.156LeuIle: 5.156 ± 0.659
4.765LeuLys: 4.765 ± 0.575
4.921LeuLeu: 4.921 ± 0.65
1.64LeuMet: 1.64 ± 0.511
4.297LeuAsn: 4.297 ± 0.438
3.125LeuPro: 3.125 ± 0.537
2.578LeuGln: 2.578 ± 0.547
4.062LeuArg: 4.062 ± 0.5
6.484LeuSer: 6.484 ± 0.86
5.39LeuThr: 5.39 ± 0.726
3.906LeuVal: 3.906 ± 0.509
0.625LeuTrp: 0.625 ± 0.217
1.64LeuTyr: 1.64 ± 0.349
0.0LeuXaa: 0.0 ± 0.0
Met
2.265MetAla: 2.265 ± 0.526
0.078MetCys: 0.078 ± 0.085
1.64MetAsp: 1.64 ± 0.337
1.016MetGlu: 1.016 ± 0.273
1.172MetPhe: 1.172 ± 0.327
1.562MetGly: 1.562 ± 0.329
0.469MetHis: 0.469 ± 0.171
2.265MetIle: 2.265 ± 0.379
2.265MetLys: 2.265 ± 0.371
2.031MetLeu: 2.031 ± 0.298
0.469MetMet: 0.469 ± 0.191
1.484MetAsn: 1.484 ± 0.33
1.094MetPro: 1.094 ± 0.279
1.484MetGln: 1.484 ± 0.305
1.719MetArg: 1.719 ± 0.409
2.109MetSer: 2.109 ± 0.518
2.109MetThr: 2.109 ± 0.567
1.562MetVal: 1.562 ± 0.343
0.312MetTrp: 0.312 ± 0.137
0.625MetTyr: 0.625 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.593AsnAla: 3.593 ± 0.514
0.469AsnCys: 0.469 ± 0.195
2.89AsnAsp: 2.89 ± 0.4
2.031AsnGlu: 2.031 ± 0.387
1.562AsnPhe: 1.562 ± 0.332
4.297AsnGly: 4.297 ± 0.721
1.562AsnHis: 1.562 ± 0.383
4.062AsnIle: 4.062 ± 0.456
2.656AsnLys: 2.656 ± 0.381
2.812AsnLeu: 2.812 ± 0.539
1.328AsnMet: 1.328 ± 0.343
3.359AsnAsn: 3.359 ± 0.54
2.344AsnPro: 2.344 ± 0.436
2.265AsnGln: 2.265 ± 0.368
2.969AsnArg: 2.969 ± 0.545
3.75AsnSer: 3.75 ± 0.549
2.5AsnThr: 2.5 ± 0.507
2.969AsnVal: 2.969 ± 0.359
0.781AsnTrp: 0.781 ± 0.209
1.484AsnTyr: 1.484 ± 0.433
0.0AsnXaa: 0.0 ± 0.0
Pro
3.047ProAla: 3.047 ± 0.487
0.547ProCys: 0.547 ± 0.239
3.203ProAsp: 3.203 ± 0.446
3.437ProGlu: 3.437 ± 0.636
1.562ProPhe: 1.562 ± 0.386
1.875ProGly: 1.875 ± 0.436
1.172ProHis: 1.172 ± 0.304
1.797ProIle: 1.797 ± 0.44
2.109ProLys: 2.109 ± 0.381
2.734ProLeu: 2.734 ± 0.569
0.781ProMet: 0.781 ± 0.315
1.719ProAsn: 1.719 ± 0.348
1.25ProPro: 1.25 ± 0.314
1.016ProGln: 1.016 ± 0.297
1.64ProArg: 1.64 ± 0.349
1.484ProSer: 1.484 ± 0.333
1.64ProThr: 1.64 ± 0.398
4.531ProVal: 4.531 ± 0.523
0.391ProTrp: 0.391 ± 0.169
0.625ProTyr: 0.625 ± 0.204
0.0ProXaa: 0.0 ± 0.0
Gln
4.062GlnAla: 4.062 ± 0.697
0.547GlnCys: 0.547 ± 0.207
2.187GlnAsp: 2.187 ± 0.365
2.578GlnGlu: 2.578 ± 0.6
1.562GlnPhe: 1.562 ± 0.314
2.422GlnGly: 2.422 ± 0.426
0.703GlnHis: 0.703 ± 0.265
3.203GlnIle: 3.203 ± 0.476
3.047GlnLys: 3.047 ± 0.546
3.437GlnLeu: 3.437 ± 0.568
1.406GlnMet: 1.406 ± 0.334
1.719GlnAsn: 1.719 ± 0.448
1.25GlnPro: 1.25 ± 0.332
2.578GlnGln: 2.578 ± 0.416
3.281GlnArg: 3.281 ± 0.642
2.89GlnSer: 2.89 ± 0.521
1.875GlnThr: 1.875 ± 0.315
2.5GlnVal: 2.5 ± 0.491
0.937GlnTrp: 0.937 ± 0.246
1.172GlnTyr: 1.172 ± 0.256
0.0GlnXaa: 0.0 ± 0.0
Arg
3.906ArgAla: 3.906 ± 0.533
0.703ArgCys: 0.703 ± 0.284
3.125ArgAsp: 3.125 ± 0.575
4.765ArgGlu: 4.765 ± 0.703
2.344ArgPhe: 2.344 ± 0.496
2.5ArgGly: 2.5 ± 0.477
1.25ArgHis: 1.25 ± 0.338
3.828ArgIle: 3.828 ± 0.513
3.828ArgLys: 3.828 ± 0.584
4.687ArgLeu: 4.687 ± 0.468
1.797ArgMet: 1.797 ± 0.402
2.969ArgAsn: 2.969 ± 0.483
1.875ArgPro: 1.875 ± 0.434
2.344ArgGln: 2.344 ± 0.47
3.75ArgArg: 3.75 ± 0.627
3.359ArgSer: 3.359 ± 0.561
2.656ArgThr: 2.656 ± 0.382
3.906ArgVal: 3.906 ± 0.604
0.859ArgTrp: 0.859 ± 0.29
2.812ArgTyr: 2.812 ± 0.431
0.0ArgXaa: 0.0 ± 0.0
Ser
6.171SerAla: 6.171 ± 0.899
0.781SerCys: 0.781 ± 0.283
4.609SerAsp: 4.609 ± 0.607
4.297SerGlu: 4.297 ± 0.482
2.734SerPhe: 2.734 ± 0.51
6.25SerGly: 6.25 ± 0.688
1.328SerHis: 1.328 ± 0.304
2.969SerIle: 2.969 ± 0.4
3.515SerLys: 3.515 ± 0.568
4.921SerLeu: 4.921 ± 0.837
1.953SerMet: 1.953 ± 0.347
3.047SerAsn: 3.047 ± 0.443
2.187SerPro: 2.187 ± 0.423
2.656SerGln: 2.656 ± 0.493
4.687SerArg: 4.687 ± 0.511
2.656SerSer: 2.656 ± 0.482
3.203SerThr: 3.203 ± 0.536
4.14SerVal: 4.14 ± 0.838
1.172SerTrp: 1.172 ± 0.296
1.64SerTyr: 1.64 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
5.781ThrAla: 5.781 ± 0.792
0.469ThrCys: 0.469 ± 0.186
2.734ThrAsp: 2.734 ± 0.457
4.921ThrGlu: 4.921 ± 0.595
2.109ThrPhe: 2.109 ± 0.345
5.312ThrGly: 5.312 ± 0.739
0.625ThrHis: 0.625 ± 0.198
2.422ThrIle: 2.422 ± 0.324
3.906ThrLys: 3.906 ± 0.686
3.984ThrLeu: 3.984 ± 0.565
1.797ThrMet: 1.797 ± 0.43
2.734ThrAsn: 2.734 ± 0.51
2.265ThrPro: 2.265 ± 0.476
2.031ThrGln: 2.031 ± 0.302
2.265ThrArg: 2.265 ± 0.417
2.969ThrSer: 2.969 ± 0.418
3.515ThrThr: 3.515 ± 0.549
4.453ThrVal: 4.453 ± 0.69
0.547ThrTrp: 0.547 ± 0.234
1.797ThrTyr: 1.797 ± 0.348
0.0ThrXaa: 0.0 ± 0.0
Val
4.453ValAla: 4.453 ± 0.648
0.859ValCys: 0.859 ± 0.303
3.75ValAsp: 3.75 ± 0.506
3.203ValGlu: 3.203 ± 0.508
2.109ValPhe: 2.109 ± 0.365
4.14ValGly: 4.14 ± 0.544
1.016ValHis: 1.016 ± 0.231
4.297ValIle: 4.297 ± 0.612
5.156ValLys: 5.156 ± 0.559
4.062ValLeu: 4.062 ± 0.645
1.562ValMet: 1.562 ± 0.326
2.5ValAsn: 2.5 ± 0.381
2.344ValPro: 2.344 ± 0.365
3.359ValGln: 3.359 ± 0.682
3.359ValArg: 3.359 ± 0.433
5.546ValSer: 5.546 ± 0.673
5.39ValThr: 5.39 ± 0.805
5.625ValVal: 5.625 ± 0.681
1.016ValTrp: 1.016 ± 0.298
2.422ValTyr: 2.422 ± 0.424
0.0ValXaa: 0.0 ± 0.0
Trp
0.937TrpAla: 0.937 ± 0.254
0.469TrpCys: 0.469 ± 0.198
0.547TrpAsp: 0.547 ± 0.213
0.547TrpGlu: 0.547 ± 0.181
1.172TrpPhe: 1.172 ± 0.34
0.859TrpGly: 0.859 ± 0.231
0.078TrpHis: 0.078 ± 0.076
0.781TrpIle: 0.781 ± 0.332
1.25TrpLys: 1.25 ± 0.268
1.328TrpLeu: 1.328 ± 0.327
0.234TrpMet: 0.234 ± 0.136
0.469TrpAsn: 0.469 ± 0.181
0.312TrpPro: 0.312 ± 0.187
0.703TrpGln: 0.703 ± 0.252
1.328TrpArg: 1.328 ± 0.275
1.094TrpSer: 1.094 ± 0.314
0.781TrpThr: 0.781 ± 0.308
1.172TrpVal: 1.172 ± 0.303
0.391TrpTrp: 0.391 ± 0.187
0.625TrpTyr: 0.625 ± 0.222
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.797TyrAla: 1.797 ± 0.383
0.234TyrCys: 0.234 ± 0.121
1.562TyrAsp: 1.562 ± 0.349
2.265TyrGlu: 2.265 ± 0.441
1.719TyrPhe: 1.719 ± 0.409
2.5TyrGly: 2.5 ± 0.4
0.469TyrHis: 0.469 ± 0.158
2.5TyrIle: 2.5 ± 0.487
1.562TyrLys: 1.562 ± 0.372
2.656TyrLeu: 2.656 ± 0.482
1.172TyrMet: 1.172 ± 0.277
1.094TyrAsn: 1.094 ± 0.277
0.937TyrPro: 0.937 ± 0.277
1.64TyrGln: 1.64 ± 0.323
1.719TyrArg: 1.719 ± 0.423
2.969TyrSer: 2.969 ± 0.495
2.187TyrThr: 2.187 ± 0.414
1.64TyrVal: 1.64 ± 0.331
0.937TyrTrp: 0.937 ± 0.273
1.172TyrTyr: 1.172 ± 0.269
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (12802 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski