Amino acid dipepetide frequency for Streptococcus phage Javan179

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.159AlaAla: 4.159 ± 0.881
0.582AlaCys: 0.582 ± 0.231
4.242AlaAsp: 4.242 ± 0.561
5.906AlaGlu: 5.906 ± 0.684
1.83AlaPhe: 1.83 ± 0.288
3.826AlaGly: 3.826 ± 0.69
0.832AlaHis: 0.832 ± 0.257
6.405AlaIle: 6.405 ± 0.528
6.405AlaLys: 6.405 ± 0.813
5.739AlaLeu: 5.739 ± 0.713
2.08AlaMet: 2.08 ± 0.613
4.076AlaAsn: 4.076 ± 0.572
1.497AlaPro: 1.497 ± 0.367
3.078AlaGln: 3.078 ± 0.464
2.995AlaArg: 2.995 ± 0.517
5.906AlaSer: 5.906 ± 0.924
3.909AlaThr: 3.909 ± 0.711
5.074AlaVal: 5.074 ± 0.644
0.665AlaTrp: 0.665 ± 0.224
1.996AlaTyr: 1.996 ± 0.42
0.0AlaXaa: 0.0 ± 0.0
Cys
0.25CysAla: 0.25 ± 0.134
0.0CysCys: 0.0 ± 0.0
0.333CysAsp: 0.333 ± 0.153
0.333CysGlu: 0.333 ± 0.156
0.499CysPhe: 0.499 ± 0.221
0.333CysGly: 0.333 ± 0.151
0.166CysHis: 0.166 ± 0.131
0.333CysIle: 0.333 ± 0.185
0.25CysLys: 0.25 ± 0.159
0.499CysLeu: 0.499 ± 0.243
0.25CysMet: 0.25 ± 0.167
0.25CysAsn: 0.25 ± 0.14
0.166CysPro: 0.166 ± 0.17
0.416CysGln: 0.416 ± 0.231
0.0CysArg: 0.0 ± 0.0
0.083CysSer: 0.083 ± 0.094
0.166CysThr: 0.166 ± 0.123
0.582CysVal: 0.582 ± 0.215
0.25CysTrp: 0.25 ± 0.148
0.416CysTyr: 0.416 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
2.745AspAla: 2.745 ± 0.527
0.25AspCys: 0.25 ± 0.156
5.157AspAsp: 5.157 ± 0.754
5.157AspGlu: 5.157 ± 0.671
3.327AspPhe: 3.327 ± 0.482
4.492AspGly: 4.492 ± 0.786
0.915AspHis: 0.915 ± 0.276
5.074AspIle: 5.074 ± 0.78
5.906AspLys: 5.906 ± 0.642
7.237AspLeu: 7.237 ± 0.777
1.414AspMet: 1.414 ± 0.322
4.824AspAsn: 4.824 ± 0.526
1.913AspPro: 1.913 ± 0.364
1.165AspGln: 1.165 ± 0.304
2.412AspArg: 2.412 ± 0.411
3.909AspSer: 3.909 ± 0.684
3.244AspThr: 3.244 ± 0.445
4.242AspVal: 4.242 ± 0.574
0.915AspTrp: 0.915 ± 0.317
3.327AspTyr: 3.327 ± 0.475
0.0AspXaa: 0.0 ± 0.0
Glu
5.823GluAla: 5.823 ± 0.834
0.499GluCys: 0.499 ± 0.187
3.327GluAsp: 3.327 ± 0.55
5.324GluGlu: 5.324 ± 0.835
2.495GluPhe: 2.495 ± 0.359
2.329GluGly: 2.329 ± 0.504
1.497GluHis: 1.497 ± 0.379
6.654GluIle: 6.654 ± 0.677
5.823GluLys: 5.823 ± 0.74
7.237GluLeu: 7.237 ± 0.732
1.913GluMet: 1.913 ± 0.398
2.745GluAsn: 2.745 ± 0.46
0.832GluPro: 0.832 ± 0.283
2.828GluGln: 2.828 ± 0.378
3.41GluArg: 3.41 ± 0.49
4.409GluSer: 4.409 ± 0.615
4.658GluThr: 4.658 ± 0.697
5.24GluVal: 5.24 ± 0.88
1.081GluTrp: 1.081 ± 0.273
3.078GluTyr: 3.078 ± 0.541
0.0GluXaa: 0.0 ± 0.0
Phe
2.412PheAla: 2.412 ± 0.388
0.333PheCys: 0.333 ± 0.137
2.995PheAsp: 2.995 ± 0.633
3.244PheGlu: 3.244 ± 0.557
1.414PhePhe: 1.414 ± 0.363
3.494PheGly: 3.494 ± 0.482
0.166PheHis: 0.166 ± 0.131
2.745PheIle: 2.745 ± 0.529
3.577PheLys: 3.577 ± 0.57
2.163PheLeu: 2.163 ± 0.494
1.331PheMet: 1.331 ± 0.312
2.579PheAsn: 2.579 ± 0.515
1.165PhePro: 1.165 ± 0.332
0.749PheGln: 0.749 ± 0.249
1.497PheArg: 1.497 ± 0.392
2.329PheSer: 2.329 ± 0.539
2.329PheThr: 2.329 ± 0.467
2.495PheVal: 2.495 ± 0.533
0.582PheTrp: 0.582 ± 0.221
0.915PheTyr: 0.915 ± 0.287
0.0PheXaa: 0.0 ± 0.0
Gly
4.159GlyAla: 4.159 ± 0.57
0.416GlyCys: 0.416 ± 0.226
3.993GlyAsp: 3.993 ± 0.606
3.909GlyGlu: 3.909 ± 0.582
2.329GlyPhe: 2.329 ± 0.519
4.325GlyGly: 4.325 ± 0.648
1.165GlyHis: 1.165 ± 0.276
4.908GlyIle: 4.908 ± 0.585
6.239GlyLys: 6.239 ± 0.813
5.906GlyLeu: 5.906 ± 0.951
2.495GlyMet: 2.495 ± 0.482
3.244GlyAsn: 3.244 ± 0.564
2.08GlyPro: 2.08 ± 1.056
2.662GlyGln: 2.662 ± 0.425
3.327GlyArg: 3.327 ± 0.414
3.244GlySer: 3.244 ± 0.54
2.495GlyThr: 2.495 ± 0.439
4.575GlyVal: 4.575 ± 0.58
0.915GlyTrp: 0.915 ± 0.292
2.579GlyTyr: 2.579 ± 0.53
0.0GlyXaa: 0.0 ± 0.0
His
0.582HisAla: 0.582 ± 0.225
0.083HisCys: 0.083 ± 0.084
1.081HisAsp: 1.081 ± 0.364
0.998HisGlu: 0.998 ± 0.311
0.832HisPhe: 0.832 ± 0.292
0.665HisGly: 0.665 ± 0.187
0.333HisHis: 0.333 ± 0.236
0.998HisIle: 0.998 ± 0.313
1.165HisLys: 1.165 ± 0.289
0.665HisLeu: 0.665 ± 0.239
0.416HisMet: 0.416 ± 0.219
0.832HisAsn: 0.832 ± 0.264
0.665HisPro: 0.665 ± 0.21
0.665HisGln: 0.665 ± 0.243
0.832HisArg: 0.832 ± 0.244
0.749HisSer: 0.749 ± 0.285
1.165HisThr: 1.165 ± 0.328
0.582HisVal: 0.582 ± 0.254
0.083HisTrp: 0.083 ± 0.081
0.665HisTyr: 0.665 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
5.573IleAla: 5.573 ± 0.729
0.25IleCys: 0.25 ± 0.148
6.155IleAsp: 6.155 ± 0.778
6.072IleGlu: 6.072 ± 0.692
2.246IlePhe: 2.246 ± 0.466
4.824IleGly: 4.824 ± 0.422
1.081IleHis: 1.081 ± 0.28
3.494IleIle: 3.494 ± 0.466
7.736IleLys: 7.736 ± 0.923
5.989IleLeu: 5.989 ± 0.723
0.749IleMet: 0.749 ± 0.269
3.327IleAsn: 3.327 ± 0.701
1.497IlePro: 1.497 ± 0.432
1.83IleGln: 1.83 ± 0.388
2.828IleArg: 2.828 ± 0.458
3.66IleSer: 3.66 ± 0.473
4.325IleThr: 4.325 ± 0.52
4.908IleVal: 4.908 ± 0.501
0.749IleTrp: 0.749 ± 0.287
2.329IleTyr: 2.329 ± 0.395
0.0IleXaa: 0.0 ± 0.0
Lys
6.571LysAla: 6.571 ± 0.924
0.582LysCys: 0.582 ± 0.216
5.074LysAsp: 5.074 ± 0.618
6.239LysGlu: 6.239 ± 0.891
2.412LysPhe: 2.412 ± 0.523
5.989LysGly: 5.989 ± 0.998
1.081LysHis: 1.081 ± 0.369
5.573LysIle: 5.573 ± 0.78
8.651LysLys: 8.651 ± 0.94
6.488LysLeu: 6.488 ± 0.773
3.161LysMet: 3.161 ± 0.579
5.24LysAsn: 5.24 ± 0.743
2.662LysPro: 2.662 ± 0.586
4.076LysGln: 4.076 ± 0.469
3.41LysArg: 3.41 ± 0.563
4.908LysSer: 4.908 ± 0.6
6.738LysThr: 6.738 ± 0.922
5.324LysVal: 5.324 ± 0.585
1.248LysTrp: 1.248 ± 0.285
3.161LysTyr: 3.161 ± 0.569
0.0LysXaa: 0.0 ± 0.0
Leu
6.322LeuAla: 6.322 ± 0.878
0.333LeuCys: 0.333 ± 0.173
6.654LeuAsp: 6.654 ± 0.564
6.821LeuGlu: 6.821 ± 0.791
2.911LeuPhe: 2.911 ± 0.667
5.49LeuGly: 5.49 ± 0.777
0.665LeuHis: 0.665 ± 0.232
5.49LeuIle: 5.49 ± 0.549
8.484LeuLys: 8.484 ± 0.83
6.571LeuLeu: 6.571 ± 0.701
2.08LeuMet: 2.08 ± 0.557
4.741LeuAsn: 4.741 ± 0.564
2.995LeuPro: 2.995 ± 0.475
3.494LeuGln: 3.494 ± 0.486
4.325LeuArg: 4.325 ± 0.592
5.573LeuSer: 5.573 ± 0.68
5.573LeuThr: 5.573 ± 0.748
4.741LeuVal: 4.741 ± 0.664
0.582LeuTrp: 0.582 ± 0.218
2.662LeuTyr: 2.662 ± 0.521
0.0LeuXaa: 0.0 ± 0.0
Met
2.163MetAla: 2.163 ± 0.347
0.0MetCys: 0.0 ± 0.0
1.248MetAsp: 1.248 ± 0.36
1.248MetGlu: 1.248 ± 0.26
1.414MetPhe: 1.414 ± 0.368
1.58MetGly: 1.58 ± 0.292
0.25MetHis: 0.25 ± 0.137
2.08MetIle: 2.08 ± 0.433
1.414MetLys: 1.414 ± 0.342
2.828MetLeu: 2.828 ± 0.546
0.832MetMet: 0.832 ± 0.273
0.749MetAsn: 0.749 ± 0.314
0.582MetPro: 0.582 ± 0.204
0.499MetGln: 0.499 ± 0.235
1.58MetArg: 1.58 ± 0.343
2.412MetSer: 2.412 ± 0.479
2.329MetThr: 2.329 ± 0.426
1.331MetVal: 1.331 ± 0.279
0.25MetTrp: 0.25 ± 0.131
0.665MetTyr: 0.665 ± 0.265
0.0MetXaa: 0.0 ± 0.0
Asn
3.41AsnAla: 3.41 ± 0.511
0.416AsnCys: 0.416 ± 0.191
3.577AsnAsp: 3.577 ± 0.412
2.579AsnGlu: 2.579 ± 0.454
2.495AsnPhe: 2.495 ± 0.486
5.157AsnGly: 5.157 ± 0.627
0.749AsnHis: 0.749 ± 0.246
2.412AsnIle: 2.412 ± 0.411
4.492AsnLys: 4.492 ± 0.632
5.074AsnLeu: 5.074 ± 0.55
1.83AsnMet: 1.83 ± 0.366
3.826AsnAsn: 3.826 ± 0.617
2.163AsnPro: 2.163 ± 0.446
2.579AsnGln: 2.579 ± 0.517
2.495AsnArg: 2.495 ± 0.437
3.078AsnSer: 3.078 ± 0.836
1.913AsnThr: 1.913 ± 0.357
2.911AsnVal: 2.911 ± 0.521
0.915AsnTrp: 0.915 ± 0.295
2.246AsnTyr: 2.246 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
1.58ProAla: 1.58 ± 0.458
0.166ProCys: 0.166 ± 0.112
2.246ProAsp: 2.246 ± 0.406
2.08ProGlu: 2.08 ± 0.374
1.248ProPhe: 1.248 ± 0.296
1.248ProGly: 1.248 ± 0.377
0.499ProHis: 0.499 ± 0.19
1.664ProIle: 1.664 ± 0.44
3.161ProLys: 3.161 ± 0.512
2.579ProLeu: 2.579 ± 0.508
0.499ProMet: 0.499 ± 0.213
1.497ProAsn: 1.497 ± 0.448
0.665ProPro: 0.665 ± 0.216
1.58ProGln: 1.58 ± 0.51
1.081ProArg: 1.081 ± 0.347
1.58ProSer: 1.58 ± 0.356
1.248ProThr: 1.248 ± 0.324
1.996ProVal: 1.996 ± 0.432
0.25ProTrp: 0.25 ± 0.142
0.915ProTyr: 0.915 ± 0.383
0.0ProXaa: 0.0 ± 0.0
Gln
3.161GlnAla: 3.161 ± 0.545
0.083GlnCys: 0.083 ± 0.083
2.08GlnAsp: 2.08 ± 0.332
3.078GlnGlu: 3.078 ± 0.478
1.331GlnPhe: 1.331 ± 0.303
2.579GlnGly: 2.579 ± 0.664
0.416GlnHis: 0.416 ± 0.156
2.579GlnIle: 2.579 ± 0.51
3.826GlnLys: 3.826 ± 0.63
3.743GlnLeu: 3.743 ± 0.614
0.998GlnMet: 0.998 ± 0.304
2.329GlnAsn: 2.329 ± 0.566
0.665GlnPro: 0.665 ± 0.256
2.329GlnGln: 2.329 ± 0.655
1.747GlnArg: 1.747 ± 0.442
2.745GlnSer: 2.745 ± 0.463
2.662GlnThr: 2.662 ± 0.556
1.414GlnVal: 1.414 ± 0.327
0.166GlnTrp: 0.166 ± 0.087
1.58GlnTyr: 1.58 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
3.078ArgAla: 3.078 ± 0.446
0.333ArgCys: 0.333 ± 0.15
2.495ArgAsp: 2.495 ± 0.482
3.078ArgGlu: 3.078 ± 0.465
1.58ArgPhe: 1.58 ± 0.325
2.911ArgGly: 2.911 ± 0.542
1.248ArgHis: 1.248 ± 0.383
3.66ArgIle: 3.66 ± 0.616
3.577ArgLys: 3.577 ± 0.564
3.909ArgLeu: 3.909 ± 0.558
0.665ArgMet: 0.665 ± 0.212
2.995ArgAsn: 2.995 ± 0.567
0.665ArgPro: 0.665 ± 0.199
1.497ArgGln: 1.497 ± 0.406
2.995ArgArg: 2.995 ± 0.531
2.246ArgSer: 2.246 ± 0.433
2.911ArgThr: 2.911 ± 0.463
2.246ArgVal: 2.246 ± 0.464
0.665ArgTrp: 0.665 ± 0.236
2.163ArgTyr: 2.163 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
5.157SerAla: 5.157 ± 0.794
0.166SerCys: 0.166 ± 0.126
4.575SerAsp: 4.575 ± 0.525
3.577SerGlu: 3.577 ± 0.468
3.161SerPhe: 3.161 ± 0.482
4.492SerGly: 4.492 ± 0.505
0.998SerHis: 0.998 ± 0.259
3.993SerIle: 3.993 ± 0.581
4.991SerLys: 4.991 ± 0.597
4.076SerLeu: 4.076 ± 0.749
1.664SerMet: 1.664 ± 0.355
2.828SerAsn: 2.828 ± 0.641
1.83SerPro: 1.83 ± 0.548
2.745SerGln: 2.745 ± 0.511
2.412SerArg: 2.412 ± 0.448
3.577SerSer: 3.577 ± 0.577
2.662SerThr: 2.662 ± 0.507
3.577SerVal: 3.577 ± 0.414
0.915SerTrp: 0.915 ± 0.311
2.911SerTyr: 2.911 ± 0.553
0.0SerXaa: 0.0 ± 0.0
Thr
4.991ThrAla: 4.991 ± 0.884
0.166ThrCys: 0.166 ± 0.115
3.494ThrAsp: 3.494 ± 0.432
3.577ThrGlu: 3.577 ± 0.406
2.579ThrPhe: 2.579 ± 0.49
4.991ThrGly: 4.991 ± 0.703
0.665ThrHis: 0.665 ± 0.25
4.908ThrIle: 4.908 ± 0.503
4.242ThrLys: 4.242 ± 0.587
5.324ThrLeu: 5.324 ± 0.663
0.915ThrMet: 0.915 ± 0.313
2.745ThrAsn: 2.745 ± 0.463
2.662ThrPro: 2.662 ± 0.379
2.495ThrGln: 2.495 ± 0.327
1.747ThrArg: 1.747 ± 0.369
3.161ThrSer: 3.161 ± 0.497
3.161ThrThr: 3.161 ± 0.463
3.993ThrVal: 3.993 ± 0.452
0.499ThrTrp: 0.499 ± 0.171
1.83ThrTyr: 1.83 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
5.573ValAla: 5.573 ± 0.765
0.25ValCys: 0.25 ± 0.125
5.24ValAsp: 5.24 ± 0.715
4.325ValGlu: 4.325 ± 0.592
2.495ValPhe: 2.495 ± 0.562
3.577ValGly: 3.577 ± 0.465
0.333ValHis: 0.333 ± 0.158
4.242ValIle: 4.242 ± 0.569
4.575ValLys: 4.575 ± 0.706
6.072ValLeu: 6.072 ± 0.881
1.497ValMet: 1.497 ± 0.425
3.327ValAsn: 3.327 ± 0.579
1.248ValPro: 1.248 ± 0.25
2.08ValGln: 2.08 ± 0.492
2.828ValArg: 2.828 ± 0.449
4.076ValSer: 4.076 ± 0.58
3.66ValThr: 3.66 ± 0.666
3.909ValVal: 3.909 ± 0.662
0.416ValTrp: 0.416 ± 0.182
2.246ValTyr: 2.246 ± 0.43
0.0ValXaa: 0.0 ± 0.0
Trp
0.582TrpAla: 0.582 ± 0.228
0.166TrpCys: 0.166 ± 0.113
1.248TrpAsp: 1.248 ± 0.365
0.832TrpGlu: 0.832 ± 0.276
0.416TrpPhe: 0.416 ± 0.199
0.915TrpGly: 0.915 ± 0.332
0.166TrpHis: 0.166 ± 0.129
0.749TrpIle: 0.749 ± 0.271
0.915TrpLys: 0.915 ± 0.337
1.081TrpLeu: 1.081 ± 0.261
0.083TrpMet: 0.083 ± 0.087
0.416TrpAsn: 0.416 ± 0.176
0.333TrpPro: 0.333 ± 0.163
0.749TrpGln: 0.749 ± 0.278
0.749TrpArg: 0.749 ± 0.25
0.416TrpSer: 0.416 ± 0.189
0.499TrpThr: 0.499 ± 0.227
0.832TrpVal: 0.832 ± 0.314
0.0TrpTrp: 0.0 ± 0.0
0.416TrpTyr: 0.416 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.911TyrAla: 2.911 ± 0.593
0.499TyrCys: 0.499 ± 0.204
2.412TyrAsp: 2.412 ± 0.348
2.828TyrGlu: 2.828 ± 0.395
1.497TyrPhe: 1.497 ± 0.405
1.747TyrGly: 1.747 ± 0.35
0.832TyrHis: 0.832 ± 0.196
1.747TyrIle: 1.747 ± 0.476
3.161TyrLys: 3.161 ± 0.531
3.327TyrLeu: 3.327 ± 0.613
0.499TyrMet: 0.499 ± 0.184
1.747TyrAsn: 1.747 ± 0.382
1.497TyrPro: 1.497 ± 0.403
1.996TyrGln: 1.996 ± 0.461
2.246TyrArg: 2.246 ± 0.346
2.246TyrSer: 2.246 ± 0.444
2.495TyrThr: 2.495 ± 0.365
1.996TyrVal: 1.996 ± 0.454
0.416TyrTrp: 0.416 ± 0.181
2.08TyrTyr: 2.08 ± 0.446
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12023 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski