Amino acid dipepetide frequency for Enterobacteria phage P88

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.719AlaAla: 9.719 ± 1.154
1.009AlaCys: 1.009 ± 0.262
6.693AlaAsp: 6.693 ± 0.754
7.977AlaGlu: 7.977 ± 0.868
3.117AlaPhe: 3.117 ± 0.6
6.968AlaGly: 6.968 ± 1.317
1.65AlaHis: 1.65 ± 0.37
5.959AlaIle: 5.959 ± 0.823
2.934AlaLys: 2.934 ± 0.506
9.352AlaLeu: 9.352 ± 1.365
3.209AlaMet: 3.209 ± 0.521
3.392AlaAsn: 3.392 ± 0.637
2.659AlaPro: 2.659 ± 0.421
4.309AlaGln: 4.309 ± 0.667
6.968AlaArg: 6.968 ± 1.035
5.226AlaSer: 5.226 ± 0.521
6.418AlaThr: 6.418 ± 1.098
7.151AlaVal: 7.151 ± 0.961
1.834AlaTrp: 1.834 ± 0.428
2.934AlaTyr: 2.934 ± 0.556
0.0AlaXaa: 0.0 ± 0.0
Cys
1.375CysAla: 1.375 ± 0.389
0.0CysCys: 0.0 ± 0.0
0.642CysAsp: 0.642 ± 0.277
0.55CysGlu: 0.55 ± 0.222
0.367CysPhe: 0.367 ± 0.187
1.1CysGly: 1.1 ± 0.293
0.183CysHis: 0.183 ± 0.121
0.458CysIle: 0.458 ± 0.205
0.275CysLys: 0.275 ± 0.172
0.367CysLeu: 0.367 ± 0.144
0.458CysMet: 0.458 ± 0.231
0.55CysAsn: 0.55 ± 0.332
0.733CysPro: 0.733 ± 0.262
0.275CysGln: 0.275 ± 0.133
0.367CysArg: 0.367 ± 0.189
0.275CysSer: 0.275 ± 0.146
0.275CysThr: 0.275 ± 0.128
0.55CysVal: 0.55 ± 0.235
0.183CysTrp: 0.183 ± 0.128
0.183CysTyr: 0.183 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
7.701AspAla: 7.701 ± 0.876
0.183AspCys: 0.183 ± 0.131
2.842AspAsp: 2.842 ± 0.463
4.309AspGlu: 4.309 ± 0.52
1.742AspPhe: 1.742 ± 0.395
5.501AspGly: 5.501 ± 0.689
0.825AspHis: 0.825 ± 0.234
3.484AspIle: 3.484 ± 0.551
2.292AspLys: 2.292 ± 0.466
4.951AspLeu: 4.951 ± 0.444
1.467AspMet: 1.467 ± 0.38
2.2AspAsn: 2.2 ± 0.434
1.1AspPro: 1.1 ± 0.339
1.284AspGln: 1.284 ± 0.282
3.667AspArg: 3.667 ± 0.625
4.217AspSer: 4.217 ± 0.615
4.217AspThr: 4.217 ± 0.764
3.576AspVal: 3.576 ± 0.554
1.375AspTrp: 1.375 ± 0.371
1.009AspTyr: 1.009 ± 0.304
0.0AspXaa: 0.0 ± 0.0
Glu
6.143GluAla: 6.143 ± 0.803
0.825GluCys: 0.825 ± 0.29
2.384GluAsp: 2.384 ± 0.502
2.384GluGlu: 2.384 ± 0.542
1.559GluPhe: 1.559 ± 0.531
3.209GluGly: 3.209 ± 0.619
1.65GluHis: 1.65 ± 0.294
4.309GluIle: 4.309 ± 0.553
3.209GluLys: 3.209 ± 0.598
5.776GluLeu: 5.776 ± 0.641
2.842GluMet: 2.842 ± 0.566
2.567GluAsn: 2.567 ± 0.472
2.842GluPro: 2.842 ± 0.559
3.484GluGln: 3.484 ± 0.589
6.051GluArg: 6.051 ± 1.046
3.942GluSer: 3.942 ± 0.684
4.309GluThr: 4.309 ± 0.719
3.392GluVal: 3.392 ± 0.675
1.375GluTrp: 1.375 ± 0.364
1.742GluTyr: 1.742 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
2.292PheAla: 2.292 ± 0.365
0.642PheCys: 0.642 ± 0.25
1.559PheAsp: 1.559 ± 0.318
1.834PheGlu: 1.834 ± 0.382
1.375PhePhe: 1.375 ± 0.334
2.109PheGly: 2.109 ± 0.526
0.275PheHis: 0.275 ± 0.164
1.559PheIle: 1.559 ± 0.369
2.017PheLys: 2.017 ± 0.439
2.2PheLeu: 2.2 ± 0.417
1.009PheMet: 1.009 ± 0.253
2.017PheAsn: 2.017 ± 0.506
0.917PhePro: 0.917 ± 0.33
1.375PheGln: 1.375 ± 0.442
2.567PheArg: 2.567 ± 0.454
2.475PheSer: 2.475 ± 0.431
3.392PheThr: 3.392 ± 0.451
1.559PheVal: 1.559 ± 0.433
0.55PheTrp: 0.55 ± 0.227
0.917PheTyr: 0.917 ± 0.311
0.0PheXaa: 0.0 ± 0.0
Gly
6.051GlyAla: 6.051 ± 0.713
1.1GlyCys: 1.1 ± 0.291
4.584GlyAsp: 4.584 ± 0.659
5.776GlyGlu: 5.776 ± 0.799
2.475GlyPhe: 2.475 ± 0.518
6.326GlyGly: 6.326 ± 0.907
0.825GlyHis: 0.825 ± 0.284
3.942GlyIle: 3.942 ± 0.558
5.043GlyLys: 5.043 ± 0.769
3.942GlyLeu: 3.942 ± 0.609
1.65GlyMet: 1.65 ± 0.429
3.851GlyAsn: 3.851 ± 0.671
1.834GlyPro: 1.834 ± 0.373
2.017GlyGln: 2.017 ± 0.378
4.309GlyArg: 4.309 ± 0.648
3.209GlySer: 3.209 ± 0.829
4.951GlyThr: 4.951 ± 0.687
4.768GlyVal: 4.768 ± 0.761
1.467GlyTrp: 1.467 ± 0.407
1.375GlyTyr: 1.375 ± 0.335
0.0GlyXaa: 0.0 ± 0.0
His
2.384HisAla: 2.384 ± 0.552
0.0HisCys: 0.0 ± 0.0
1.009HisAsp: 1.009 ± 0.289
1.559HisGlu: 1.559 ± 0.388
0.55HisPhe: 0.55 ± 0.261
1.559HisGly: 1.559 ± 0.329
0.642HisHis: 0.642 ± 0.254
1.375HisIle: 1.375 ± 0.361
0.733HisLys: 0.733 ± 0.296
1.65HisLeu: 1.65 ± 0.366
0.458HisMet: 0.458 ± 0.189
0.55HisAsn: 0.55 ± 0.192
0.917HisPro: 0.917 ± 0.283
1.009HisGln: 1.009 ± 0.247
0.917HisArg: 0.917 ± 0.344
0.55HisSer: 0.55 ± 0.239
0.733HisThr: 0.733 ± 0.244
1.192HisVal: 1.192 ± 0.385
0.275HisTrp: 0.275 ± 0.147
0.367HisTyr: 0.367 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
5.684IleAla: 5.684 ± 0.844
0.733IleCys: 0.733 ± 0.257
3.392IleAsp: 3.392 ± 0.621
2.751IleGlu: 2.751 ± 0.547
1.925IlePhe: 1.925 ± 0.371
4.768IleGly: 4.768 ± 0.649
0.733IleHis: 0.733 ± 0.264
3.209IleIle: 3.209 ± 0.631
2.292IleLys: 2.292 ± 0.708
2.842IleLeu: 2.842 ± 0.479
1.009IleMet: 1.009 ± 0.278
3.484IleAsn: 3.484 ± 0.552
1.65IlePro: 1.65 ± 0.35
2.567IleGln: 2.567 ± 0.482
4.676IleArg: 4.676 ± 0.705
3.667IleSer: 3.667 ± 0.585
4.401IleThr: 4.401 ± 0.615
2.659IleVal: 2.659 ± 0.383
0.55IleTrp: 0.55 ± 0.212
1.834IleTyr: 1.834 ± 0.418
0.0IleXaa: 0.0 ± 0.0
Lys
5.409LysAla: 5.409 ± 0.789
0.367LysCys: 0.367 ± 0.165
2.567LysAsp: 2.567 ± 0.477
3.301LysGlu: 3.301 ± 0.562
1.192LysPhe: 1.192 ± 0.405
2.384LysGly: 2.384 ± 0.448
0.55LysHis: 0.55 ± 0.184
2.292LysIle: 2.292 ± 0.35
4.034LysLys: 4.034 ± 0.692
4.768LysLeu: 4.768 ± 0.729
1.375LysMet: 1.375 ± 0.368
3.301LysAsn: 3.301 ± 0.663
2.2LysPro: 2.2 ± 0.366
3.301LysGln: 3.301 ± 0.637
2.475LysArg: 2.475 ± 0.462
3.026LysSer: 3.026 ± 0.455
3.667LysThr: 3.667 ± 0.512
2.475LysVal: 2.475 ± 0.555
0.825LysTrp: 0.825 ± 0.298
1.467LysTyr: 1.467 ± 0.411
0.0LysXaa: 0.0 ± 0.0
Leu
8.893LeuAla: 8.893 ± 1.074
1.375LeuCys: 1.375 ± 0.408
4.217LeuAsp: 4.217 ± 0.574
3.942LeuGlu: 3.942 ± 0.599
1.65LeuPhe: 1.65 ± 0.428
4.217LeuGly: 4.217 ± 0.793
1.65LeuHis: 1.65 ± 0.434
4.493LeuIle: 4.493 ± 0.633
4.859LeuLys: 4.859 ± 0.526
6.785LeuLeu: 6.785 ± 0.811
2.017LeuMet: 2.017 ± 0.438
3.759LeuAsn: 3.759 ± 0.536
3.851LeuPro: 3.851 ± 0.536
3.026LeuGln: 3.026 ± 0.547
7.977LeuArg: 7.977 ± 0.782
6.968LeuSer: 6.968 ± 0.824
6.785LeuThr: 6.785 ± 0.897
4.217LeuVal: 4.217 ± 0.765
1.284LeuTrp: 1.284 ± 0.363
2.842LeuTyr: 2.842 ± 0.469
0.0LeuXaa: 0.0 ± 0.0
Met
4.034MetAla: 4.034 ± 0.624
0.275MetCys: 0.275 ± 0.149
1.284MetAsp: 1.284 ± 0.271
1.467MetGlu: 1.467 ± 0.413
0.825MetPhe: 0.825 ± 0.287
1.467MetGly: 1.467 ± 0.418
0.458MetHis: 0.458 ± 0.22
1.467MetIle: 1.467 ± 0.404
1.742MetLys: 1.742 ± 0.318
2.292MetLeu: 2.292 ± 0.392
0.825MetMet: 0.825 ± 0.308
1.284MetAsn: 1.284 ± 0.422
1.192MetPro: 1.192 ± 0.434
1.009MetGln: 1.009 ± 0.295
1.742MetArg: 1.742 ± 0.356
2.567MetSer: 2.567 ± 0.559
1.834MetThr: 1.834 ± 0.427
1.009MetVal: 1.009 ± 0.237
0.275MetTrp: 0.275 ± 0.193
0.642MetTyr: 0.642 ± 0.254
0.0MetXaa: 0.0 ± 0.0
Asn
3.301AsnAla: 3.301 ± 0.758
0.092AsnCys: 0.092 ± 0.087
1.834AsnAsp: 1.834 ± 0.465
3.117AsnGlu: 3.117 ± 0.476
1.1AsnPhe: 1.1 ± 0.402
4.034AsnGly: 4.034 ± 0.65
0.642AsnHis: 0.642 ± 0.244
2.842AsnIle: 2.842 ± 0.656
2.2AsnLys: 2.2 ± 0.409
2.109AsnLeu: 2.109 ± 0.388
1.009AsnMet: 1.009 ± 0.28
1.65AsnAsn: 1.65 ± 0.458
3.392AsnPro: 3.392 ± 0.605
1.467AsnGln: 1.467 ± 0.406
2.934AsnArg: 2.934 ± 0.515
2.109AsnSer: 2.109 ± 0.612
3.484AsnThr: 3.484 ± 0.482
3.026AsnVal: 3.026 ± 0.51
0.825AsnTrp: 0.825 ± 0.262
0.642AsnTyr: 0.642 ± 0.257
0.0AsnXaa: 0.0 ± 0.0
Pro
5.134ProAla: 5.134 ± 0.861
0.092ProCys: 0.092 ± 0.084
4.217ProAsp: 4.217 ± 0.579
3.667ProGlu: 3.667 ± 0.695
1.192ProPhe: 1.192 ± 0.353
2.842ProGly: 2.842 ± 0.541
0.642ProHis: 0.642 ± 0.277
0.825ProIle: 0.825 ± 0.214
1.65ProLys: 1.65 ± 0.439
3.576ProLeu: 3.576 ± 0.498
0.917ProMet: 0.917 ± 0.284
0.917ProAsn: 0.917 ± 0.271
1.467ProPro: 1.467 ± 0.363
2.109ProGln: 2.109 ± 0.429
1.65ProArg: 1.65 ± 0.432
2.017ProSer: 2.017 ± 0.45
1.834ProThr: 1.834 ± 0.498
3.851ProVal: 3.851 ± 0.693
0.275ProTrp: 0.275 ± 0.199
1.009ProTyr: 1.009 ± 0.292
0.0ProXaa: 0.0 ± 0.0
Gln
4.951GlnAla: 4.951 ± 0.806
0.275GlnCys: 0.275 ± 0.168
2.751GlnAsp: 2.751 ± 0.459
2.384GlnGlu: 2.384 ± 0.483
1.65GlnPhe: 1.65 ± 0.381
2.384GlnGly: 2.384 ± 0.348
0.825GlnHis: 0.825 ± 0.278
2.109GlnIle: 2.109 ± 0.606
2.567GlnLys: 2.567 ± 0.507
4.768GlnLeu: 4.768 ± 0.608
1.009GlnMet: 1.009 ± 0.297
1.559GlnAsn: 1.559 ± 0.287
2.2GlnPro: 2.2 ± 0.389
3.301GlnGln: 3.301 ± 0.601
3.301GlnArg: 3.301 ± 0.615
1.65GlnSer: 1.65 ± 0.373
2.017GlnThr: 2.017 ± 0.429
2.475GlnVal: 2.475 ± 0.485
0.275GlnTrp: 0.275 ± 0.135
1.009GlnTyr: 1.009 ± 0.297
0.0GlnXaa: 0.0 ± 0.0
Arg
6.968ArgAla: 6.968 ± 0.964
0.458ArgCys: 0.458 ± 0.199
3.209ArgAsp: 3.209 ± 0.739
5.776ArgGlu: 5.776 ± 0.711
3.392ArgPhe: 3.392 ± 0.618
3.209ArgGly: 3.209 ± 0.575
2.475ArgHis: 2.475 ± 0.447
3.851ArgIle: 3.851 ± 0.648
4.951ArgLys: 4.951 ± 0.869
7.151ArgLeu: 7.151 ± 0.59
1.284ArgMet: 1.284 ± 0.354
3.026ArgAsn: 3.026 ± 0.611
2.384ArgPro: 2.384 ± 0.33
3.301ArgGln: 3.301 ± 0.64
5.226ArgArg: 5.226 ± 0.648
3.667ArgSer: 3.667 ± 0.491
3.942ArgThr: 3.942 ± 0.633
5.593ArgVal: 5.593 ± 0.569
0.917ArgTrp: 0.917 ± 0.247
2.2ArgTyr: 2.2 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
5.959SerAla: 5.959 ± 0.774
0.275SerCys: 0.275 ± 0.149
3.117SerAsp: 3.117 ± 0.42
4.126SerGlu: 4.126 ± 0.601
2.475SerPhe: 2.475 ± 0.473
5.593SerGly: 5.593 ± 0.826
1.1SerHis: 1.1 ± 0.307
2.475SerIle: 2.475 ± 0.506
1.742SerLys: 1.742 ± 0.49
6.601SerLeu: 6.601 ± 0.728
1.192SerMet: 1.192 ± 0.295
1.192SerAsn: 1.192 ± 0.319
2.475SerPro: 2.475 ± 0.488
2.017SerGln: 2.017 ± 0.398
4.951SerArg: 4.951 ± 0.748
4.859SerSer: 4.859 ± 0.793
3.851SerThr: 3.851 ± 0.915
3.667SerVal: 3.667 ± 0.543
1.467SerTrp: 1.467 ± 0.392
1.375SerTyr: 1.375 ± 0.389
0.0SerXaa: 0.0 ± 0.0
Thr
5.409ThrAla: 5.409 ± 1.014
0.367ThrCys: 0.367 ± 0.218
4.676ThrAsp: 4.676 ± 0.612
3.942ThrGlu: 3.942 ± 0.732
2.109ThrPhe: 2.109 ± 0.316
6.418ThrGly: 6.418 ± 0.759
1.009ThrHis: 1.009 ± 0.328
3.026ThrIle: 3.026 ± 0.487
2.842ThrLys: 2.842 ± 0.678
6.876ThrLeu: 6.876 ± 0.689
1.834ThrMet: 1.834 ± 0.478
1.925ThrAsn: 1.925 ± 0.656
3.851ThrPro: 3.851 ± 0.458
2.934ThrGln: 2.934 ± 0.478
5.226ThrArg: 5.226 ± 0.714
3.851ThrSer: 3.851 ± 0.588
4.309ThrThr: 4.309 ± 0.851
4.859ThrVal: 4.859 ± 0.902
0.733ThrTrp: 0.733 ± 0.247
1.65ThrTyr: 1.65 ± 0.414
0.0ThrXaa: 0.0 ± 0.0
Val
4.676ValAla: 4.676 ± 0.662
0.55ValCys: 0.55 ± 0.233
4.401ValAsp: 4.401 ± 0.447
3.026ValGlu: 3.026 ± 0.409
2.109ValPhe: 2.109 ± 0.428
2.659ValGly: 2.659 ± 0.644
0.917ValHis: 0.917 ± 0.231
4.584ValIle: 4.584 ± 0.852
3.667ValLys: 3.667 ± 0.674
4.859ValLeu: 4.859 ± 0.85
2.475ValMet: 2.475 ± 0.55
2.842ValAsn: 2.842 ± 0.483
2.2ValPro: 2.2 ± 0.508
2.384ValGln: 2.384 ± 0.584
4.584ValArg: 4.584 ± 0.624
4.217ValSer: 4.217 ± 0.554
5.226ValThr: 5.226 ± 0.619
4.034ValVal: 4.034 ± 0.691
0.642ValTrp: 0.642 ± 0.187
1.65ValTyr: 1.65 ± 0.353
0.0ValXaa: 0.0 ± 0.0
Trp
0.917TrpAla: 0.917 ± 0.26
0.092TrpCys: 0.092 ± 0.087
0.642TrpAsp: 0.642 ± 0.209
0.825TrpGlu: 0.825 ± 0.307
0.642TrpPhe: 0.642 ± 0.213
0.642TrpGly: 0.642 ± 0.227
0.825TrpHis: 0.825 ± 0.213
0.733TrpIle: 0.733 ± 0.294
1.192TrpLys: 1.192 ± 0.457
1.925TrpLeu: 1.925 ± 0.343
0.458TrpMet: 0.458 ± 0.247
0.917TrpAsn: 0.917 ± 0.337
0.733TrpPro: 0.733 ± 0.273
0.917TrpGln: 0.917 ± 0.29
1.284TrpArg: 1.284 ± 0.351
1.1TrpSer: 1.1 ± 0.313
0.55TrpThr: 0.55 ± 0.227
0.367TrpVal: 0.367 ± 0.152
0.367TrpTrp: 0.367 ± 0.154
0.917TrpTyr: 0.917 ± 0.302
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.567TyrAla: 2.567 ± 0.447
0.458TyrCys: 0.458 ± 0.224
1.925TyrAsp: 1.925 ± 0.356
1.284TyrGlu: 1.284 ± 0.375
1.192TyrPhe: 1.192 ± 0.377
2.109TyrGly: 2.109 ± 0.461
0.55TyrHis: 0.55 ± 0.298
1.834TyrIle: 1.834 ± 0.447
0.733TyrLys: 0.733 ± 0.275
2.017TyrLeu: 2.017 ± 0.626
1.1TyrMet: 1.1 ± 0.358
0.825TyrAsn: 0.825 ± 0.291
1.284TyrPro: 1.284 ± 0.343
1.375TyrGln: 1.375 ± 0.38
2.109TyrArg: 2.109 ± 0.458
0.917TyrSer: 0.917 ± 0.341
1.65TyrThr: 1.65 ± 0.426
1.284TyrVal: 1.284 ± 0.265
0.55TyrTrp: 0.55 ± 0.224
0.55TyrTyr: 0.55 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (10908 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski