Amino acid dipepetide frequency for Yersinia phage phi80-18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.832AlaAla: 10.832 ± 1.205
1.039AlaCys: 1.039 ± 0.295
6.306AlaAsp: 6.306 ± 1.21
6.974AlaGlu: 6.974 ± 0.852
2.522AlaPhe: 2.522 ± 0.455
7.196AlaGly: 7.196 ± 0.976
2.3AlaHis: 2.3 ± 0.432
3.932AlaIle: 3.932 ± 0.673
7.419AlaLys: 7.419 ± 0.778
8.606AlaLeu: 8.606 ± 0.684
2.151AlaMet: 2.151 ± 0.436
3.042AlaAsn: 3.042 ± 0.533
3.339AlaPro: 3.339 ± 0.37
4.897AlaGln: 4.897 ± 0.887
4.6AlaArg: 4.6 ± 0.57
4.971AlaSer: 4.971 ± 0.66
4.6AlaThr: 4.6 ± 0.737
7.048AlaVal: 7.048 ± 0.843
0.668AlaTrp: 0.668 ± 0.216
3.116AlaTyr: 3.116 ± 0.445
0.0AlaXaa: 0.0 ± 0.0
Cys
0.445CysAla: 0.445 ± 0.26
0.148CysCys: 0.148 ± 0.114
0.668CysAsp: 0.668 ± 0.292
0.668CysGlu: 0.668 ± 0.177
0.371CysPhe: 0.371 ± 0.193
1.187CysGly: 1.187 ± 0.343
0.223CysHis: 0.223 ± 0.114
0.594CysIle: 0.594 ± 0.198
0.371CysLys: 0.371 ± 0.185
1.039CysLeu: 1.039 ± 0.315
0.594CysMet: 0.594 ± 0.203
0.594CysAsn: 0.594 ± 0.28
0.519CysPro: 0.519 ± 0.208
0.371CysGln: 0.371 ± 0.144
0.594CysArg: 0.594 ± 0.223
0.89CysSer: 0.89 ± 0.215
0.668CysThr: 0.668 ± 0.243
0.742CysVal: 0.742 ± 0.263
0.074CysTrp: 0.074 ± 0.082
0.816CysTyr: 0.816 ± 0.306
0.0CysXaa: 0.0 ± 0.0
Asp
6.232AspAla: 6.232 ± 0.691
0.668AspCys: 0.668 ± 0.225
3.635AspAsp: 3.635 ± 0.417
3.784AspGlu: 3.784 ± 0.581
2.077AspPhe: 2.077 ± 0.498
5.119AspGly: 5.119 ± 0.784
0.668AspHis: 0.668 ± 0.26
2.968AspIle: 2.968 ± 0.47
3.635AspLys: 3.635 ± 0.475
4.971AspLeu: 4.971 ± 0.689
2.819AspMet: 2.819 ± 0.482
2.597AspAsn: 2.597 ± 0.477
2.151AspPro: 2.151 ± 0.479
1.335AspGln: 1.335 ± 0.429
2.893AspArg: 2.893 ± 0.545
4.303AspSer: 4.303 ± 0.51
4.822AspThr: 4.822 ± 0.565
5.045AspVal: 5.045 ± 0.679
1.039AspTrp: 1.039 ± 0.241
2.968AspTyr: 2.968 ± 0.421
0.0AspXaa: 0.0 ± 0.0
Glu
5.861GluAla: 5.861 ± 0.623
0.742GluCys: 0.742 ± 0.248
3.264GluAsp: 3.264 ± 0.524
3.858GluGlu: 3.858 ± 0.569
2.3GluPhe: 2.3 ± 0.472
4.303GluGly: 4.303 ± 0.612
1.632GluHis: 1.632 ± 0.387
3.19GluIle: 3.19 ± 0.508
3.413GluLys: 3.413 ± 0.541
6.158GluLeu: 6.158 ± 0.738
2.522GluMet: 2.522 ± 0.378
2.374GluAsn: 2.374 ± 0.397
2.226GluPro: 2.226 ± 0.585
3.635GluGln: 3.635 ± 0.696
3.413GluArg: 3.413 ± 0.405
3.264GluSer: 3.264 ± 0.574
2.597GluThr: 2.597 ± 0.406
4.748GluVal: 4.748 ± 0.613
0.816GluTrp: 0.816 ± 0.223
2.374GluTyr: 2.374 ± 0.425
0.0GluXaa: 0.0 ± 0.0
Phe
2.597PheAla: 2.597 ± 0.262
0.371PheCys: 0.371 ± 0.158
2.226PheAsp: 2.226 ± 0.402
1.558PheGlu: 1.558 ± 0.316
1.335PhePhe: 1.335 ± 0.355
2.745PheGly: 2.745 ± 0.414
0.668PheHis: 0.668 ± 0.21
2.448PheIle: 2.448 ± 0.484
2.151PheLys: 2.151 ± 0.321
2.671PheLeu: 2.671 ± 0.361
1.41PheMet: 1.41 ± 0.315
2.597PheAsn: 2.597 ± 0.461
0.816PhePro: 0.816 ± 0.266
1.706PheGln: 1.706 ± 0.343
2.077PheArg: 2.077 ± 0.391
1.855PheSer: 1.855 ± 0.386
2.3PheThr: 2.3 ± 0.387
1.855PheVal: 1.855 ± 0.322
0.371PheTrp: 0.371 ± 0.165
1.41PheTyr: 1.41 ± 0.39
0.0PheXaa: 0.0 ± 0.0
Gly
5.935GlyAla: 5.935 ± 0.844
1.41GlyCys: 1.41 ± 0.392
4.971GlyAsp: 4.971 ± 0.488
4.08GlyGlu: 4.08 ± 0.482
2.448GlyPhe: 2.448 ± 0.365
4.971GlyGly: 4.971 ± 0.809
0.964GlyHis: 0.964 ± 0.282
4.08GlyIle: 4.08 ± 0.506
6.009GlyLys: 6.009 ± 0.722
5.935GlyLeu: 5.935 ± 0.801
2.003GlyMet: 2.003 ± 0.538
3.339GlyAsn: 3.339 ± 0.443
1.855GlyPro: 1.855 ± 0.36
2.522GlyGln: 2.522 ± 0.361
4.08GlyArg: 4.08 ± 0.566
5.267GlySer: 5.267 ± 0.716
3.784GlyThr: 3.784 ± 0.572
4.748GlyVal: 4.748 ± 0.629
0.89GlyTrp: 0.89 ± 0.258
2.893GlyTyr: 2.893 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.41HisAla: 1.41 ± 0.28
0.371HisCys: 0.371 ± 0.146
1.706HisAsp: 1.706 ± 0.425
1.261HisGlu: 1.261 ± 0.345
0.668HisPhe: 0.668 ± 0.247
1.929HisGly: 1.929 ± 0.527
0.519HisHis: 0.519 ± 0.255
1.261HisIle: 1.261 ± 0.322
1.187HisLys: 1.187 ± 0.31
1.41HisLeu: 1.41 ± 0.336
0.742HisMet: 0.742 ± 0.171
0.742HisAsn: 0.742 ± 0.226
0.742HisPro: 0.742 ± 0.276
0.742HisGln: 0.742 ± 0.19
1.632HisArg: 1.632 ± 0.264
1.039HisSer: 1.039 ± 0.275
1.261HisThr: 1.261 ± 0.335
0.89HisVal: 0.89 ± 0.232
0.223HisTrp: 0.223 ± 0.122
0.594HisTyr: 0.594 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
3.709IleAla: 3.709 ± 0.401
0.297IleCys: 0.297 ± 0.133
3.264IleAsp: 3.264 ± 0.308
2.3IleGlu: 2.3 ± 0.297
1.484IlePhe: 1.484 ± 0.374
3.784IleGly: 3.784 ± 0.639
1.261IleHis: 1.261 ± 0.28
2.374IleIle: 2.374 ± 0.563
4.08IleLys: 4.08 ± 0.627
3.042IleLeu: 3.042 ± 0.432
1.484IleMet: 1.484 ± 0.289
2.374IleAsn: 2.374 ± 0.324
1.929IlePro: 1.929 ± 0.401
2.003IleGln: 2.003 ± 0.388
3.487IleArg: 3.487 ± 0.451
3.487IleSer: 3.487 ± 0.577
3.264IleThr: 3.264 ± 0.523
2.968IleVal: 2.968 ± 0.483
0.519IleTrp: 0.519 ± 0.249
1.855IleTyr: 1.855 ± 0.353
0.0IleXaa: 0.0 ± 0.0
Lys
6.751LysAla: 6.751 ± 0.924
0.519LysCys: 0.519 ± 0.212
4.526LysAsp: 4.526 ± 0.381
4.377LysGlu: 4.377 ± 0.586
2.448LysPhe: 2.448 ± 0.445
4.08LysGly: 4.08 ± 0.609
1.335LysHis: 1.335 ± 0.317
2.3LysIle: 2.3 ± 0.432
3.487LysLys: 3.487 ± 0.552
6.084LysLeu: 6.084 ± 0.749
2.226LysMet: 2.226 ± 0.449
2.151LysAsn: 2.151 ± 0.497
3.635LysPro: 3.635 ± 0.557
2.151LysGln: 2.151 ± 0.402
3.19LysArg: 3.19 ± 0.519
3.561LysSer: 3.561 ± 0.441
2.968LysThr: 2.968 ± 0.375
4.451LysVal: 4.451 ± 0.625
0.816LysTrp: 0.816 ± 0.19
1.929LysTyr: 1.929 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
9.199LeuAla: 9.199 ± 1.049
1.335LeuCys: 1.335 ± 0.354
6.677LeuAsp: 6.677 ± 0.766
4.971LeuGlu: 4.971 ± 0.645
2.374LeuPhe: 2.374 ± 0.478
5.861LeuGly: 5.861 ± 0.659
2.151LeuHis: 2.151 ± 0.425
3.784LeuIle: 3.784 ± 0.63
4.897LeuLys: 4.897 ± 0.78
5.713LeuLeu: 5.713 ± 0.682
2.226LeuMet: 2.226 ± 0.417
3.858LeuAsn: 3.858 ± 0.398
3.19LeuPro: 3.19 ± 0.407
3.413LeuGln: 3.413 ± 0.488
4.229LeuArg: 4.229 ± 0.645
5.564LeuSer: 5.564 ± 0.497
4.674LeuThr: 4.674 ± 0.725
4.897LeuVal: 4.897 ± 0.829
1.41LeuTrp: 1.41 ± 0.322
2.522LeuTyr: 2.522 ± 0.421
0.074LeuXaa: 0.074 ± 0.082
Met
3.339MetAla: 3.339 ± 0.47
0.297MetCys: 0.297 ± 0.161
1.113MetAsp: 1.113 ± 0.298
1.484MetGlu: 1.484 ± 0.237
0.816MetPhe: 0.816 ± 0.269
2.597MetGly: 2.597 ± 0.586
0.594MetHis: 0.594 ± 0.18
1.781MetIle: 1.781 ± 0.324
1.781MetLys: 1.781 ± 0.354
2.819MetLeu: 2.819 ± 0.442
1.039MetMet: 1.039 ± 0.444
1.855MetAsn: 1.855 ± 0.368
1.41MetPro: 1.41 ± 0.359
1.781MetGln: 1.781 ± 0.444
1.558MetArg: 1.558 ± 0.293
2.745MetSer: 2.745 ± 0.541
1.632MetThr: 1.632 ± 0.394
1.929MetVal: 1.929 ± 0.367
0.297MetTrp: 0.297 ± 0.133
0.519MetTyr: 0.519 ± 0.175
0.0MetXaa: 0.0 ± 0.0
Asn
3.635AsnAla: 3.635 ± 0.589
0.445AsnCys: 0.445 ± 0.156
2.151AsnAsp: 2.151 ± 0.449
1.781AsnGlu: 1.781 ± 0.389
2.077AsnPhe: 2.077 ± 0.372
3.932AsnGly: 3.932 ± 0.546
1.113AsnHis: 1.113 ± 0.216
2.448AsnIle: 2.448 ± 0.463
3.264AsnLys: 3.264 ± 0.592
2.745AsnLeu: 2.745 ± 0.543
1.855AsnMet: 1.855 ± 0.392
1.706AsnAsn: 1.706 ± 0.36
1.781AsnPro: 1.781 ± 0.314
1.929AsnGln: 1.929 ± 0.51
2.745AsnArg: 2.745 ± 0.584
2.448AsnSer: 2.448 ± 0.394
2.448AsnThr: 2.448 ± 0.358
2.597AsnVal: 2.597 ± 0.545
0.668AsnTrp: 0.668 ± 0.193
1.261AsnTyr: 1.261 ± 0.293
0.0AsnXaa: 0.0 ± 0.0
Pro
3.487ProAla: 3.487 ± 0.68
0.519ProCys: 0.519 ± 0.19
2.151ProAsp: 2.151 ± 0.308
3.709ProGlu: 3.709 ± 0.528
1.558ProPhe: 1.558 ± 0.349
2.151ProGly: 2.151 ± 0.437
0.223ProHis: 0.223 ± 0.145
1.41ProIle: 1.41 ± 0.348
2.077ProLys: 2.077 ± 0.388
3.19ProLeu: 3.19 ± 0.357
0.742ProMet: 0.742 ± 0.226
2.671ProAsn: 2.671 ± 0.35
1.113ProPro: 1.113 ± 0.331
1.484ProGln: 1.484 ± 0.274
1.039ProArg: 1.039 ± 0.255
2.077ProSer: 2.077 ± 0.357
2.3ProThr: 2.3 ± 0.386
3.561ProVal: 3.561 ± 0.683
0.297ProTrp: 0.297 ± 0.11
1.706ProTyr: 1.706 ± 0.379
0.0ProXaa: 0.0 ± 0.0
Gln
5.267GlnAla: 5.267 ± 0.653
0.297GlnCys: 0.297 ± 0.141
2.448GlnAsp: 2.448 ± 0.404
2.226GlnGlu: 2.226 ± 0.483
1.558GlnPhe: 1.558 ± 0.361
2.671GlnGly: 2.671 ± 0.442
1.558GlnHis: 1.558 ± 0.32
1.706GlnIle: 1.706 ± 0.361
2.226GlnLys: 2.226 ± 0.433
3.561GlnLeu: 3.561 ± 0.606
1.929GlnMet: 1.929 ± 0.369
1.558GlnAsn: 1.558 ± 0.446
0.89GlnPro: 0.89 ± 0.285
2.151GlnGln: 2.151 ± 0.399
2.077GlnArg: 2.077 ± 0.321
2.003GlnSer: 2.003 ± 0.314
2.151GlnThr: 2.151 ± 0.676
2.745GlnVal: 2.745 ± 0.311
0.89GlnTrp: 0.89 ± 0.292
2.151GlnTyr: 2.151 ± 0.399
0.0GlnXaa: 0.0 ± 0.0
Arg
5.119ArgAla: 5.119 ± 0.729
0.297ArgCys: 0.297 ± 0.164
3.042ArgAsp: 3.042 ± 0.461
4.006ArgGlu: 4.006 ± 0.5
2.522ArgPhe: 2.522 ± 0.334
3.116ArgGly: 3.116 ± 0.528
0.89ArgHis: 0.89 ± 0.217
3.116ArgIle: 3.116 ± 0.568
2.448ArgLys: 2.448 ± 0.388
5.564ArgLeu: 5.564 ± 0.572
1.335ArgMet: 1.335 ± 0.323
2.151ArgAsn: 2.151 ± 0.294
2.077ArgPro: 2.077 ± 0.387
2.077ArgGln: 2.077 ± 0.424
2.819ArgArg: 2.819 ± 0.473
2.671ArgSer: 2.671 ± 0.5
3.635ArgThr: 3.635 ± 0.41
3.709ArgVal: 3.709 ± 0.582
1.261ArgTrp: 1.261 ± 0.314
1.781ArgTyr: 1.781 ± 0.275
0.0ArgXaa: 0.0 ± 0.0
Ser
5.342SerAla: 5.342 ± 0.611
0.519SerCys: 0.519 ± 0.204
4.377SerAsp: 4.377 ± 0.418
4.6SerGlu: 4.6 ± 0.557
1.855SerPhe: 1.855 ± 0.409
4.303SerGly: 4.303 ± 0.49
1.187SerHis: 1.187 ± 0.231
2.597SerIle: 2.597 ± 0.429
3.709SerLys: 3.709 ± 0.485
4.897SerLeu: 4.897 ± 0.779
1.855SerMet: 1.855 ± 0.388
2.893SerAsn: 2.893 ± 0.529
1.855SerPro: 1.855 ± 0.36
2.968SerGln: 2.968 ± 0.581
3.487SerArg: 3.487 ± 0.529
2.671SerSer: 2.671 ± 0.489
4.155SerThr: 4.155 ± 0.52
4.674SerVal: 4.674 ± 0.669
0.816SerTrp: 0.816 ± 0.236
1.929SerTyr: 1.929 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
6.232ThrAla: 6.232 ± 0.894
0.519ThrCys: 0.519 ± 0.209
3.561ThrAsp: 3.561 ± 0.462
3.784ThrGlu: 3.784 ± 0.672
1.855ThrPhe: 1.855 ± 0.475
5.193ThrGly: 5.193 ± 0.627
1.113ThrHis: 1.113 ± 0.233
2.819ThrIle: 2.819 ± 0.5
4.08ThrLys: 4.08 ± 0.497
5.045ThrLeu: 5.045 ± 0.786
1.187ThrMet: 1.187 ± 0.353
1.706ThrAsn: 1.706 ± 0.39
2.893ThrPro: 2.893 ± 0.544
1.41ThrGln: 1.41 ± 0.378
3.264ThrArg: 3.264 ± 0.461
3.709ThrSer: 3.709 ± 0.641
2.968ThrThr: 2.968 ± 0.388
3.932ThrVal: 3.932 ± 0.517
0.594ThrTrp: 0.594 ± 0.209
1.484ThrTyr: 1.484 ± 0.335
0.0ThrXaa: 0.0 ± 0.0
Val
6.009ValAla: 6.009 ± 0.8
0.964ValCys: 0.964 ± 0.31
4.006ValAsp: 4.006 ± 0.494
4.155ValGlu: 4.155 ± 0.579
2.819ValPhe: 2.819 ± 0.457
4.303ValGly: 4.303 ± 0.602
1.484ValHis: 1.484 ± 0.363
3.264ValIle: 3.264 ± 0.486
3.635ValLys: 3.635 ± 0.608
4.971ValLeu: 4.971 ± 0.498
2.226ValMet: 2.226 ± 0.357
2.374ValAsn: 2.374 ± 0.378
3.264ValPro: 3.264 ± 0.505
3.561ValGln: 3.561 ± 0.428
3.339ValArg: 3.339 ± 0.539
5.119ValSer: 5.119 ± 0.72
4.674ValThr: 4.674 ± 0.767
5.416ValVal: 5.416 ± 0.685
0.742ValTrp: 0.742 ± 0.271
2.819ValTyr: 2.819 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
0.89TrpAla: 0.89 ± 0.257
0.223TrpCys: 0.223 ± 0.141
0.964TrpAsp: 0.964 ± 0.3
0.89TrpGlu: 0.89 ± 0.307
1.039TrpPhe: 1.039 ± 0.279
0.519TrpGly: 0.519 ± 0.274
0.371TrpHis: 0.371 ± 0.156
0.445TrpIle: 0.445 ± 0.199
0.594TrpLys: 0.594 ± 0.167
2.077TrpLeu: 2.077 ± 0.421
0.297TrpMet: 0.297 ± 0.135
0.297TrpAsn: 0.297 ± 0.185
0.594TrpPro: 0.594 ± 0.202
0.445TrpGln: 0.445 ± 0.189
0.816TrpArg: 0.816 ± 0.215
0.519TrpSer: 0.519 ± 0.131
0.668TrpThr: 0.668 ± 0.226
0.742TrpVal: 0.742 ± 0.244
0.742TrpTrp: 0.742 ± 0.186
0.371TrpTyr: 0.371 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.264TyrAla: 3.264 ± 0.509
0.594TyrCys: 0.594 ± 0.182
2.597TyrAsp: 2.597 ± 0.488
2.226TyrGlu: 2.226 ± 0.524
1.039TyrPhe: 1.039 ± 0.233
2.226TyrGly: 2.226 ± 0.409
0.074TyrHis: 0.074 ± 0.06
2.448TyrIle: 2.448 ± 0.541
2.522TyrLys: 2.522 ± 0.367
2.522TyrLeu: 2.522 ± 0.523
0.668TyrMet: 0.668 ± 0.184
2.151TyrAsn: 2.151 ± 0.339
1.187TyrPro: 1.187 ± 0.271
1.484TyrGln: 1.484 ± 0.337
2.374TyrArg: 2.374 ± 0.477
2.522TyrSer: 2.522 ± 0.409
1.781TyrThr: 1.781 ± 0.377
2.448TyrVal: 2.448 ± 0.445
0.371TyrTrp: 0.371 ± 0.163
0.964TyrTyr: 0.964 ± 0.284
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.074XaaAsp: 0.074 ± 0.082
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski