Amino acid dipepetide frequency for Streptococcus phage Javan423

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.564AlaAla: 1.564 ± 0.538
0.434AlaCys: 0.434 ± 0.188
3.128AlaAsp: 3.128 ± 0.568
5.213AlaGlu: 5.213 ± 0.745
3.301AlaPhe: 3.301 ± 0.533
3.91AlaGly: 3.91 ± 1.045
0.608AlaHis: 0.608 ± 0.243
5.995AlaIle: 5.995 ± 1.232
6.342AlaLys: 6.342 ± 0.75
5.126AlaLeu: 5.126 ± 0.704
2.172AlaMet: 2.172 ± 0.369
3.041AlaAsn: 3.041 ± 0.466
2.172AlaPro: 2.172 ± 0.546
1.825AlaGln: 1.825 ± 0.292
2.78AlaArg: 2.78 ± 0.455
4.692AlaSer: 4.692 ± 0.862
3.91AlaThr: 3.91 ± 0.714
3.475AlaVal: 3.475 ± 0.505
1.043AlaTrp: 1.043 ± 0.296
2.606AlaTyr: 2.606 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
0.087CysAla: 0.087 ± 0.083
0.174CysCys: 0.174 ± 0.119
0.695CysAsp: 0.695 ± 0.255
0.782CysGlu: 0.782 ± 0.262
0.087CysPhe: 0.087 ± 0.091
0.608CysGly: 0.608 ± 0.319
0.174CysHis: 0.174 ± 0.118
0.434CysIle: 0.434 ± 0.19
0.348CysLys: 0.348 ± 0.217
0.608CysLeu: 0.608 ± 0.212
0.0CysMet: 0.0 ± 0.0
0.348CysAsn: 0.348 ± 0.17
0.174CysPro: 0.174 ± 0.101
0.087CysGln: 0.087 ± 0.088
0.261CysArg: 0.261 ± 0.173
0.608CysSer: 0.608 ± 0.184
0.087CysThr: 0.087 ± 0.095
0.434CysVal: 0.434 ± 0.265
0.087CysTrp: 0.087 ± 0.081
0.348CysTyr: 0.348 ± 0.233
0.0CysXaa: 0.0 ± 0.0
Asp
3.997AspAla: 3.997 ± 0.6
0.434AspCys: 0.434 ± 0.197
3.736AspAsp: 3.736 ± 0.602
3.91AspGlu: 3.91 ± 0.599
3.128AspPhe: 3.128 ± 0.561
5.647AspGly: 5.647 ± 0.845
0.869AspHis: 0.869 ± 0.261
4.778AspIle: 4.778 ± 0.621
5.039AspLys: 5.039 ± 0.643
5.213AspLeu: 5.213 ± 0.674
1.738AspMet: 1.738 ± 0.438
4.17AspAsn: 4.17 ± 0.635
1.043AspPro: 1.043 ± 0.268
1.129AspGln: 1.129 ± 0.27
2.606AspArg: 2.606 ± 0.41
3.997AspSer: 3.997 ± 0.702
3.562AspThr: 3.562 ± 0.624
2.78AspVal: 2.78 ± 0.557
1.303AspTrp: 1.303 ± 0.45
3.562AspTyr: 3.562 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
5.213GluAla: 5.213 ± 0.64
0.261GluCys: 0.261 ± 0.192
4.344GluAsp: 4.344 ± 0.69
6.342GluGlu: 6.342 ± 0.972
3.736GluPhe: 3.736 ± 0.558
3.041GluGly: 3.041 ± 0.515
1.129GluHis: 1.129 ± 0.32
5.126GluIle: 5.126 ± 0.789
5.213GluLys: 5.213 ± 0.928
7.298GluLeu: 7.298 ± 0.907
2.172GluMet: 2.172 ± 0.518
5.039GluAsn: 5.039 ± 0.809
1.043GluPro: 1.043 ± 0.304
3.301GluGln: 3.301 ± 0.679
2.693GluArg: 2.693 ± 0.556
3.041GluSer: 3.041 ± 0.491
3.215GluThr: 3.215 ± 0.557
4.605GluVal: 4.605 ± 0.613
1.216GluTrp: 1.216 ± 0.307
3.301GluTyr: 3.301 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
2.172PheAla: 2.172 ± 0.313
0.261PheCys: 0.261 ± 0.138
3.736PheAsp: 3.736 ± 0.495
3.736PheGlu: 3.736 ± 0.642
1.303PhePhe: 1.303 ± 0.369
3.041PheGly: 3.041 ± 0.617
0.348PheHis: 0.348 ± 0.182
3.041PheIle: 3.041 ± 0.51
4.692PheLys: 4.692 ± 0.883
2.867PheLeu: 2.867 ± 0.536
1.216PheMet: 1.216 ± 0.336
2.259PheAsn: 2.259 ± 0.4
0.956PhePro: 0.956 ± 0.362
1.216PheGln: 1.216 ± 0.298
1.911PheArg: 1.911 ± 0.424
2.085PheSer: 2.085 ± 0.507
3.475PheThr: 3.475 ± 0.482
2.346PheVal: 2.346 ± 0.428
0.087PheTrp: 0.087 ± 0.072
1.651PheTyr: 1.651 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
3.475GlyAla: 3.475 ± 0.964
0.348GlyCys: 0.348 ± 0.18
3.649GlyAsp: 3.649 ± 0.598
2.52GlyGlu: 2.52 ± 0.526
3.997GlyPhe: 3.997 ± 0.781
4.605GlyGly: 4.605 ± 0.688
1.303GlyHis: 1.303 ± 0.362
5.474GlyIle: 5.474 ± 0.795
5.734GlyLys: 5.734 ± 0.777
5.387GlyLeu: 5.387 ± 0.629
1.477GlyMet: 1.477 ± 0.454
3.388GlyAsn: 3.388 ± 0.451
0.695GlyPro: 0.695 ± 0.277
3.041GlyGln: 3.041 ± 0.469
2.693GlyArg: 2.693 ± 0.516
3.736GlySer: 3.736 ± 0.7
5.56GlyThr: 5.56 ± 1.02
4.17GlyVal: 4.17 ± 0.874
1.303GlyTrp: 1.303 ± 0.355
3.388GlyTyr: 3.388 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
1.043HisAla: 1.043 ± 0.385
0.174HisCys: 0.174 ± 0.098
0.782HisAsp: 0.782 ± 0.302
1.129HisGlu: 1.129 ± 0.255
0.434HisPhe: 0.434 ± 0.199
0.608HisGly: 0.608 ± 0.282
0.174HisHis: 0.174 ± 0.13
0.782HisIle: 0.782 ± 0.307
1.303HisLys: 1.303 ± 0.272
1.39HisLeu: 1.39 ± 0.368
0.348HisMet: 0.348 ± 0.205
0.695HisAsn: 0.695 ± 0.252
0.608HisPro: 0.608 ± 0.262
0.608HisGln: 0.608 ± 0.275
0.521HisArg: 0.521 ± 0.198
0.434HisSer: 0.434 ± 0.182
0.869HisThr: 0.869 ± 0.249
0.869HisVal: 0.869 ± 0.283
0.174HisTrp: 0.174 ± 0.139
0.869HisTyr: 0.869 ± 0.273
0.0HisXaa: 0.0 ± 0.0
Ile
5.126IleAla: 5.126 ± 0.691
0.608IleCys: 0.608 ± 0.273
6.082IleAsp: 6.082 ± 0.894
6.516IleGlu: 6.516 ± 0.767
2.78IlePhe: 2.78 ± 0.588
5.039IleGly: 5.039 ± 0.699
1.129IleHis: 1.129 ± 0.251
4.518IleIle: 4.518 ± 0.523
7.298IleLys: 7.298 ± 0.967
5.126IleLeu: 5.126 ± 0.569
1.738IleMet: 1.738 ± 0.398
5.647IleAsn: 5.647 ± 0.885
2.52IlePro: 2.52 ± 0.583
2.259IleGln: 2.259 ± 0.333
1.477IleArg: 1.477 ± 0.376
4.692IleSer: 4.692 ± 0.863
4.518IleThr: 4.518 ± 0.836
5.213IleVal: 5.213 ± 0.773
0.521IleTrp: 0.521 ± 0.19
2.606IleTyr: 2.606 ± 0.636
0.0IleXaa: 0.0 ± 0.0
Lys
7.211LysAla: 7.211 ± 0.889
0.261LysCys: 0.261 ± 0.165
4.865LysAsp: 4.865 ± 0.639
6.429LysGlu: 6.429 ± 0.985
2.172LysPhe: 2.172 ± 0.489
5.3LysGly: 5.3 ± 0.544
0.782LysHis: 0.782 ± 0.297
6.777LysIle: 6.777 ± 0.792
7.211LysLys: 7.211 ± 0.975
6.69LysLeu: 6.69 ± 0.794
2.346LysMet: 2.346 ± 0.395
5.821LysAsn: 5.821 ± 0.668
2.346LysPro: 2.346 ± 0.346
3.041LysGln: 3.041 ± 0.476
4.083LysArg: 4.083 ± 0.54
4.344LysSer: 4.344 ± 0.462
6.864LysThr: 6.864 ± 0.707
4.431LysVal: 4.431 ± 0.573
1.477LysTrp: 1.477 ± 0.371
3.475LysTyr: 3.475 ± 0.511
0.0LysXaa: 0.0 ± 0.0
Leu
4.605LeuAla: 4.605 ± 0.731
0.608LeuCys: 0.608 ± 0.284
5.213LeuAsp: 5.213 ± 0.777
5.56LeuGlu: 5.56 ± 0.694
3.649LeuPhe: 3.649 ± 0.498
4.605LeuGly: 4.605 ± 0.886
0.695LeuHis: 0.695 ± 0.332
7.124LeuIle: 7.124 ± 0.77
7.819LeuLys: 7.819 ± 0.891
5.56LeuLeu: 5.56 ± 0.661
1.564LeuMet: 1.564 ± 0.351
4.778LeuAsn: 4.778 ± 0.792
2.346LeuPro: 2.346 ± 0.446
3.301LeuGln: 3.301 ± 0.547
2.867LeuArg: 2.867 ± 0.551
5.647LeuSer: 5.647 ± 0.685
6.69LeuThr: 6.69 ± 0.809
3.301LeuVal: 3.301 ± 0.524
1.216LeuTrp: 1.216 ± 0.275
2.085LeuTyr: 2.085 ± 0.426
0.0LeuXaa: 0.0 ± 0.0
Met
2.259MetAla: 2.259 ± 0.614
0.261MetCys: 0.261 ± 0.142
1.651MetAsp: 1.651 ± 0.364
1.303MetGlu: 1.303 ± 0.426
0.869MetPhe: 0.869 ± 0.238
1.129MetGly: 1.129 ± 0.292
0.348MetHis: 0.348 ± 0.175
2.085MetIle: 2.085 ± 0.403
2.52MetLys: 2.52 ± 0.511
1.651MetLeu: 1.651 ± 0.333
0.782MetMet: 0.782 ± 0.259
0.869MetAsn: 0.869 ± 0.31
0.782MetPro: 0.782 ± 0.248
1.564MetGln: 1.564 ± 0.431
1.129MetArg: 1.129 ± 0.307
1.911MetSer: 1.911 ± 0.394
2.172MetThr: 2.172 ± 0.356
0.869MetVal: 0.869 ± 0.296
0.261MetTrp: 0.261 ± 0.136
1.043MetTyr: 1.043 ± 0.341
0.0MetXaa: 0.0 ± 0.0
Asn
3.215AsnAla: 3.215 ± 0.632
0.434AsnCys: 0.434 ± 0.175
3.649AsnAsp: 3.649 ± 0.688
3.215AsnGlu: 3.215 ± 0.526
2.78AsnPhe: 2.78 ± 0.448
5.387AsnGly: 5.387 ± 0.902
0.434AsnHis: 0.434 ± 0.176
3.649AsnIle: 3.649 ± 0.679
4.952AsnLys: 4.952 ± 0.73
3.823AsnLeu: 3.823 ± 0.601
1.564AsnMet: 1.564 ± 0.437
3.215AsnAsn: 3.215 ± 0.509
1.825AsnPro: 1.825 ± 0.407
2.346AsnGln: 2.346 ± 0.39
2.259AsnArg: 2.259 ± 0.48
4.344AsnSer: 4.344 ± 0.607
3.128AsnThr: 3.128 ± 0.639
4.431AsnVal: 4.431 ± 0.548
1.043AsnTrp: 1.043 ± 0.309
2.606AsnTyr: 2.606 ± 0.467
0.0AsnXaa: 0.0 ± 0.0
Pro
1.564ProAla: 1.564 ± 0.538
0.0ProCys: 0.0 ± 0.0
1.564ProAsp: 1.564 ± 0.463
1.825ProGlu: 1.825 ± 0.409
0.956ProPhe: 0.956 ± 0.284
1.129ProGly: 1.129 ± 0.384
0.782ProHis: 0.782 ± 0.218
2.346ProIle: 2.346 ± 0.381
2.867ProLys: 2.867 ± 0.544
2.52ProLeu: 2.52 ± 0.469
0.695ProMet: 0.695 ± 0.25
1.216ProAsn: 1.216 ± 0.376
0.608ProPro: 0.608 ± 0.219
1.39ProGln: 1.39 ± 0.337
0.956ProArg: 0.956 ± 0.241
2.693ProSer: 2.693 ± 0.565
1.911ProThr: 1.911 ± 0.459
1.738ProVal: 1.738 ± 0.359
0.087ProTrp: 0.087 ± 0.083
0.869ProTyr: 0.869 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
2.606GlnAla: 2.606 ± 0.368
0.348GlnCys: 0.348 ± 0.173
1.564GlnAsp: 1.564 ± 0.342
1.998GlnGlu: 1.998 ± 0.526
1.564GlnPhe: 1.564 ± 0.421
1.911GlnGly: 1.911 ± 0.575
0.695GlnHis: 0.695 ± 0.325
3.736GlnIle: 3.736 ± 0.777
3.041GlnLys: 3.041 ± 0.613
2.867GlnLeu: 2.867 ± 0.465
0.782GlnMet: 0.782 ± 0.281
1.998GlnAsn: 1.998 ± 0.408
1.477GlnPro: 1.477 ± 0.327
1.129GlnGln: 1.129 ± 0.35
1.303GlnArg: 1.303 ± 0.296
2.78GlnSer: 2.78 ± 0.53
2.433GlnThr: 2.433 ± 0.464
1.825GlnVal: 1.825 ± 0.42
1.303GlnTrp: 1.303 ± 0.361
1.303GlnTyr: 1.303 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
2.085ArgAla: 2.085 ± 0.391
0.0ArgCys: 0.0 ± 0.0
1.651ArgAsp: 1.651 ± 0.405
2.346ArgGlu: 2.346 ± 0.441
1.651ArgPhe: 1.651 ± 0.505
2.085ArgGly: 2.085 ± 0.412
0.695ArgHis: 0.695 ± 0.184
3.128ArgIle: 3.128 ± 0.535
2.867ArgLys: 2.867 ± 0.617
3.388ArgLeu: 3.388 ± 0.592
1.303ArgMet: 1.303 ± 0.302
2.085ArgAsn: 2.085 ± 0.457
1.129ArgPro: 1.129 ± 0.315
0.869ArgGln: 0.869 ± 0.272
0.608ArgArg: 0.608 ± 0.231
1.825ArgSer: 1.825 ± 0.483
2.172ArgThr: 2.172 ± 0.574
2.867ArgVal: 2.867 ± 0.581
0.087ArgTrp: 0.087 ± 0.088
2.52ArgTyr: 2.52 ± 0.513
0.0ArgXaa: 0.0 ± 0.0
Ser
4.518SerAla: 4.518 ± 1.019
0.434SerCys: 0.434 ± 0.217
3.475SerAsp: 3.475 ± 0.574
4.778SerGlu: 4.778 ± 0.637
2.52SerPhe: 2.52 ± 0.51
5.908SerGly: 5.908 ± 0.838
0.869SerHis: 0.869 ± 0.275
3.388SerIle: 3.388 ± 0.625
5.3SerLys: 5.3 ± 0.673
5.126SerLeu: 5.126 ± 0.872
1.911SerMet: 1.911 ± 0.514
4.344SerAsn: 4.344 ± 0.835
1.477SerPro: 1.477 ± 0.282
2.346SerGln: 2.346 ± 0.572
1.129SerArg: 1.129 ± 0.31
4.344SerSer: 4.344 ± 0.832
3.475SerThr: 3.475 ± 0.77
5.039SerVal: 5.039 ± 0.716
1.043SerTrp: 1.043 ± 0.253
2.606SerTyr: 2.606 ± 0.522
0.0SerXaa: 0.0 ± 0.0
Thr
4.778ThrAla: 4.778 ± 0.744
0.174ThrCys: 0.174 ± 0.107
4.778ThrAsp: 4.778 ± 0.801
4.17ThrGlu: 4.17 ± 0.801
2.867ThrPhe: 2.867 ± 0.521
5.908ThrGly: 5.908 ± 0.713
0.869ThrHis: 0.869 ± 0.297
5.821ThrIle: 5.821 ± 0.904
5.039ThrLys: 5.039 ± 0.547
4.778ThrLeu: 4.778 ± 0.76
0.956ThrMet: 0.956 ± 0.346
2.606ThrAsn: 2.606 ± 0.457
2.78ThrPro: 2.78 ± 0.42
2.606ThrGln: 2.606 ± 0.704
1.825ThrArg: 1.825 ± 0.432
3.91ThrSer: 3.91 ± 0.731
4.692ThrThr: 4.692 ± 0.86
5.3ThrVal: 5.3 ± 0.689
0.782ThrTrp: 0.782 ± 0.259
3.301ThrTyr: 3.301 ± 0.538
0.0ThrXaa: 0.0 ± 0.0
Val
4.083ValAla: 4.083 ± 0.628
0.608ValCys: 0.608 ± 0.243
4.17ValAsp: 4.17 ± 0.626
5.213ValGlu: 5.213 ± 0.6
1.825ValPhe: 1.825 ± 0.367
3.041ValGly: 3.041 ± 0.579
1.043ValHis: 1.043 ± 0.301
4.518ValIle: 4.518 ± 0.78
3.997ValLys: 3.997 ± 0.663
5.387ValLeu: 5.387 ± 0.642
1.303ValMet: 1.303 ± 0.316
3.823ValAsn: 3.823 ± 0.559
2.085ValPro: 2.085 ± 0.429
1.911ValGln: 1.911 ± 0.356
2.085ValArg: 2.085 ± 0.567
4.692ValSer: 4.692 ± 0.719
5.821ValThr: 5.821 ± 1.004
3.91ValVal: 3.91 ± 0.7
0.261ValTrp: 0.261 ± 0.156
1.998ValTyr: 1.998 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
0.956TrpAla: 0.956 ± 0.234
0.348TrpCys: 0.348 ± 0.19
0.521TrpAsp: 0.521 ± 0.245
1.043TrpGlu: 1.043 ± 0.309
0.521TrpPhe: 0.521 ± 0.242
1.043TrpGly: 1.043 ± 0.281
0.087TrpHis: 0.087 ± 0.095
1.043TrpIle: 1.043 ± 0.337
1.043TrpLys: 1.043 ± 0.259
0.869TrpLeu: 0.869 ± 0.251
0.174TrpMet: 0.174 ± 0.112
0.695TrpAsn: 0.695 ± 0.228
0.261TrpPro: 0.261 ± 0.15
0.348TrpGln: 0.348 ± 0.19
0.608TrpArg: 0.608 ± 0.211
1.303TrpSer: 1.303 ± 0.323
1.129TrpThr: 1.129 ± 0.474
1.303TrpVal: 1.303 ± 0.389
0.261TrpTrp: 0.261 ± 0.155
0.434TrpTyr: 0.434 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.78TyrAla: 2.78 ± 0.698
0.261TyrCys: 0.261 ± 0.161
3.388TyrAsp: 3.388 ± 0.364
3.301TyrGlu: 3.301 ± 0.55
2.259TyrPhe: 2.259 ± 0.478
1.998TyrGly: 1.998 ± 0.377
0.782TyrHis: 0.782 ± 0.268
1.651TyrIle: 1.651 ± 0.352
3.301TyrLys: 3.301 ± 0.497
3.562TyrLeu: 3.562 ± 0.522
1.129TyrMet: 1.129 ± 0.245
2.259TyrAsn: 2.259 ± 0.596
1.39TyrPro: 1.39 ± 0.386
2.346TyrGln: 2.346 ± 0.393
1.564TyrArg: 1.564 ± 0.459
3.041TyrSer: 3.041 ± 0.556
2.259TyrThr: 2.259 ± 0.399
2.867TyrVal: 2.867 ± 0.47
0.434TyrTrp: 0.434 ± 0.195
1.738TyrTyr: 1.738 ± 0.4
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11511 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski