Amino acid dipepetide frequency for Streptococcus satellite phage Javan305

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.65AlaAla: 2.65 ± 1.419
0.589AlaCys: 0.589 ± 0.466
3.239AlaAsp: 3.239 ± 1.273
6.478AlaGlu: 6.478 ± 1.54
1.767AlaPhe: 1.767 ± 0.617
1.472AlaGly: 1.472 ± 0.412
1.178AlaHis: 1.178 ± 0.417
5.006AlaIle: 5.006 ± 1.098
2.945AlaLys: 2.945 ± 0.908
6.478AlaLeu: 6.478 ± 0.918
0.883AlaMet: 0.883 ± 0.414
3.534AlaAsn: 3.534 ± 1.45
1.767AlaPro: 1.767 ± 0.412
3.239AlaGln: 3.239 ± 1.539
2.945AlaArg: 2.945 ± 0.642
5.006AlaSer: 5.006 ± 1.492
3.239AlaThr: 3.239 ± 1.095
3.534AlaVal: 3.534 ± 0.973
1.472AlaTrp: 1.472 ± 0.692
2.945AlaTyr: 2.945 ± 0.868
0.0AlaXaa: 0.0 ± 0.0
Cys
0.294CysAla: 0.294 ± 0.232
0.0CysCys: 0.0 ± 0.0
0.883CysAsp: 0.883 ± 0.449
0.589CysGlu: 0.589 ± 0.36
0.294CysPhe: 0.294 ± 0.279
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.589CysIle: 0.589 ± 0.36
0.883CysLys: 0.883 ± 0.507
0.294CysLeu: 0.294 ± 0.297
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.294CysPro: 0.294 ± 0.264
0.0CysGln: 0.0 ± 0.0
0.294CysArg: 0.294 ± 0.29
1.472CysSer: 1.472 ± 0.683
0.294CysThr: 0.294 ± 0.29
0.294CysVal: 0.294 ± 0.297
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.239AspAla: 3.239 ± 0.93
0.0AspCys: 0.0 ± 0.0
2.356AspAsp: 2.356 ± 0.681
3.239AspGlu: 3.239 ± 0.684
3.239AspPhe: 3.239 ± 0.925
3.239AspGly: 3.239 ± 0.74
0.883AspHis: 0.883 ± 0.442
5.3AspIle: 5.3 ± 1.251
7.362AspLys: 7.362 ± 1.327
3.534AspLeu: 3.534 ± 1.481
1.472AspMet: 1.472 ± 0.613
1.178AspAsn: 1.178 ± 0.56
1.767AspPro: 1.767 ± 0.561
1.472AspGln: 1.472 ± 1.112
1.767AspArg: 1.767 ± 1.142
5.595AspSer: 5.595 ± 0.843
2.945AspThr: 2.945 ± 0.627
2.356AspVal: 2.356 ± 0.97
0.294AspTrp: 0.294 ± 0.232
2.061AspTyr: 2.061 ± 1.114
0.0AspXaa: 0.0 ± 0.0
Glu
5.595GluAla: 5.595 ± 1.047
0.294GluCys: 0.294 ± 0.307
4.122GluAsp: 4.122 ± 1.049
7.362GluGlu: 7.362 ± 2.37
2.061GluPhe: 2.061 ± 0.838
4.711GluGly: 4.711 ± 1.267
0.294GluHis: 0.294 ± 0.342
5.595GluIle: 5.595 ± 0.93
10.306GluLys: 10.306 ± 1.957
9.717GluLeu: 9.717 ± 1.812
2.356GluMet: 2.356 ± 0.829
4.711GluAsn: 4.711 ± 1.455
1.178GluPro: 1.178 ± 0.574
3.828GluGln: 3.828 ± 1.272
3.239GluArg: 3.239 ± 1.557
2.945GluSer: 2.945 ± 0.696
4.417GluThr: 4.417 ± 0.999
6.478GluVal: 6.478 ± 1.374
0.294GluTrp: 0.294 ± 0.279
3.828GluTyr: 3.828 ± 1.12
0.0GluXaa: 0.0 ± 0.0
Phe
0.589PheAla: 0.589 ± 0.321
0.294PheCys: 0.294 ± 0.232
2.945PheAsp: 2.945 ± 1.049
2.65PheGlu: 2.65 ± 1.114
0.883PhePhe: 0.883 ± 0.477
0.883PheGly: 0.883 ± 0.428
0.883PheHis: 0.883 ± 0.378
1.767PheIle: 1.767 ± 0.619
2.945PheLys: 2.945 ± 0.846
4.122PheLeu: 4.122 ± 0.81
0.589PheMet: 0.589 ± 0.324
2.061PheAsn: 2.061 ± 0.777
0.0PhePro: 0.0 ± 0.0
0.883PheGln: 0.883 ± 0.378
1.472PheArg: 1.472 ± 0.411
2.356PheSer: 2.356 ± 0.703
2.061PheThr: 2.061 ± 0.674
1.767PheVal: 1.767 ± 0.681
1.472PheTrp: 1.472 ± 0.927
1.472PheTyr: 1.472 ± 0.595
0.0PheXaa: 0.0 ± 0.0
Gly
1.767GlyAla: 1.767 ± 0.89
1.178GlyCys: 1.178 ± 0.627
1.767GlyAsp: 1.767 ± 0.701
1.472GlyGlu: 1.472 ± 0.536
2.945GlyPhe: 2.945 ± 0.644
2.356GlyGly: 2.356 ± 0.888
1.178GlyHis: 1.178 ± 0.617
4.417GlyIle: 4.417 ± 0.811
4.417GlyLys: 4.417 ± 1.662
2.356GlyLeu: 2.356 ± 0.706
1.178GlyMet: 1.178 ± 0.59
2.65GlyAsn: 2.65 ± 0.677
0.883GlyPro: 0.883 ± 0.434
1.472GlyGln: 1.472 ± 0.603
2.061GlyArg: 2.061 ± 0.565
2.945GlySer: 2.945 ± 0.634
1.767GlyThr: 1.767 ± 0.773
2.356GlyVal: 2.356 ± 0.908
0.589GlyTrp: 0.589 ± 0.311
2.945GlyTyr: 2.945 ± 0.944
0.0GlyXaa: 0.0 ± 0.0
His
1.178HisAla: 1.178 ± 0.849
0.0HisCys: 0.0 ± 0.0
0.589HisAsp: 0.589 ± 0.346
0.589HisGlu: 0.589 ± 0.398
0.589HisPhe: 0.589 ± 0.39
1.178HisGly: 1.178 ± 0.436
0.294HisHis: 0.294 ± 0.232
1.472HisIle: 1.472 ± 0.63
2.061HisLys: 2.061 ± 0.745
2.061HisLeu: 2.061 ± 0.708
0.294HisMet: 0.294 ± 0.286
0.883HisAsn: 0.883 ± 0.446
1.178HisPro: 1.178 ± 0.647
0.589HisGln: 0.589 ± 0.595
0.294HisArg: 0.294 ± 0.29
1.472HisSer: 1.472 ± 0.405
2.65HisThr: 2.65 ± 0.876
1.178HisVal: 1.178 ± 0.607
0.294HisTrp: 0.294 ± 0.29
0.294HisTyr: 0.294 ± 0.29
0.0HisXaa: 0.0 ± 0.0
Ile
5.889IleAla: 5.889 ± 1.034
0.294IleCys: 0.294 ± 0.232
6.478IleAsp: 6.478 ± 1.531
7.362IleGlu: 7.362 ± 1.308
0.883IlePhe: 0.883 ± 0.506
5.3IleGly: 5.3 ± 1.325
2.65IleHis: 2.65 ± 0.837
7.656IleIle: 7.656 ± 1.855
6.773IleLys: 6.773 ± 1.396
6.478IleLeu: 6.478 ± 0.978
0.589IleMet: 0.589 ± 0.311
4.417IleAsn: 4.417 ± 1.141
3.239IlePro: 3.239 ± 0.72
2.945IleGln: 2.945 ± 0.927
2.061IleArg: 2.061 ± 0.579
5.006IleSer: 5.006 ± 1.089
5.006IleThr: 5.006 ± 1.259
6.478IleVal: 6.478 ± 1.368
0.589IleTrp: 0.589 ± 0.311
3.534IleTyr: 3.534 ± 1.275
0.0IleXaa: 0.0 ± 0.0
Lys
7.067LysAla: 7.067 ± 1.571
0.589LysCys: 0.589 ± 0.408
3.828LysAsp: 3.828 ± 1.228
9.717LysGlu: 9.717 ± 1.986
2.061LysPhe: 2.061 ± 0.862
3.239LysGly: 3.239 ± 0.958
2.356LysHis: 2.356 ± 0.852
9.128LysIle: 9.128 ± 2.056
9.423LysLys: 9.423 ± 1.729
7.362LysLeu: 7.362 ± 1.788
2.356LysMet: 2.356 ± 0.883
5.595LysAsn: 5.595 ± 1.171
1.472LysPro: 1.472 ± 0.535
4.122LysGln: 4.122 ± 0.987
4.122LysArg: 4.122 ± 1.41
7.951LysSer: 7.951 ± 1.127
4.711LysThr: 4.711 ± 1.021
4.711LysVal: 4.711 ± 0.761
0.883LysTrp: 0.883 ± 0.449
2.945LysTyr: 2.945 ± 0.66
0.0LysXaa: 0.0 ± 0.0
Leu
6.478LeuAla: 6.478 ± 1.866
0.0LeuCys: 0.0 ± 0.0
5.595LeuAsp: 5.595 ± 1.463
8.834LeuGlu: 8.834 ± 1.597
3.534LeuPhe: 3.534 ± 0.573
3.828LeuGly: 3.828 ± 1.097
1.767LeuHis: 1.767 ± 0.985
5.889LeuIle: 5.889 ± 1.698
8.834LeuLys: 8.834 ± 1.281
9.423LeuLeu: 9.423 ± 1.881
1.178LeuMet: 1.178 ± 0.449
5.3LeuAsn: 5.3 ± 1.656
3.239LeuPro: 3.239 ± 0.81
3.828LeuGln: 3.828 ± 1.526
3.828LeuArg: 3.828 ± 0.947
5.889LeuSer: 5.889 ± 1.363
4.417LeuThr: 4.417 ± 1.137
5.006LeuVal: 5.006 ± 1.313
0.589LeuTrp: 0.589 ± 0.318
2.356LeuTyr: 2.356 ± 0.591
0.0LeuXaa: 0.0 ± 0.0
Met
1.767MetAla: 1.767 ± 0.738
0.0MetCys: 0.0 ± 0.0
0.294MetAsp: 0.294 ± 0.29
2.65MetGlu: 2.65 ± 0.852
0.294MetPhe: 0.294 ± 0.232
0.589MetGly: 0.589 ± 0.528
0.0MetHis: 0.0 ± 0.0
1.178MetIle: 1.178 ± 0.473
3.534MetLys: 3.534 ± 0.918
2.356MetLeu: 2.356 ± 0.781
0.294MetMet: 0.294 ± 0.301
1.178MetAsn: 1.178 ± 0.514
0.0MetPro: 0.0 ± 0.0
0.294MetGln: 0.294 ± 0.29
0.883MetArg: 0.883 ± 0.606
0.294MetSer: 0.294 ± 0.232
1.178MetThr: 1.178 ± 0.562
1.178MetVal: 1.178 ± 0.535
0.0MetTrp: 0.0 ± 0.0
0.589MetTyr: 0.589 ± 0.432
0.0MetXaa: 0.0 ± 0.0
Asn
3.534AsnAla: 3.534 ± 1.08
0.589AsnCys: 0.589 ± 0.337
3.534AsnAsp: 3.534 ± 1.115
3.828AsnGlu: 3.828 ± 1.088
0.883AsnPhe: 0.883 ± 0.577
2.65AsnGly: 2.65 ± 0.97
1.178AsnHis: 1.178 ± 0.646
5.3AsnIle: 5.3 ± 1.245
5.595AsnLys: 5.595 ± 1.574
3.239AsnLeu: 3.239 ± 0.879
0.0AsnMet: 0.0 ± 0.258
5.006AsnAsn: 5.006 ± 1.128
2.061AsnPro: 2.061 ± 0.715
2.945AsnGln: 2.945 ± 0.92
3.828AsnArg: 3.828 ± 1.07
2.65AsnSer: 2.65 ± 0.813
5.3AsnThr: 5.3 ± 2.021
1.472AsnVal: 1.472 ± 0.65
0.0AsnTrp: 0.0 ± 0.0
0.883AsnTyr: 0.883 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
1.472ProAla: 1.472 ± 0.481
0.0ProCys: 0.0 ± 0.0
1.178ProAsp: 1.178 ± 0.764
2.061ProGlu: 2.061 ± 0.695
0.883ProPhe: 0.883 ± 0.507
0.589ProGly: 0.589 ± 0.386
0.294ProHis: 0.294 ± 0.232
1.767ProIle: 1.767 ± 0.822
2.65ProLys: 2.65 ± 0.706
2.061ProLeu: 2.061 ± 0.743
0.589ProMet: 0.589 ± 0.454
2.356ProAsn: 2.356 ± 0.824
0.294ProPro: 0.294 ± 0.264
2.945ProGln: 2.945 ± 1.214
0.589ProArg: 0.589 ± 0.408
2.945ProSer: 2.945 ± 1.113
2.356ProThr: 2.356 ± 0.763
0.589ProVal: 0.589 ± 0.463
0.589ProTrp: 0.589 ± 0.399
0.589ProTyr: 0.589 ± 0.415
0.0ProXaa: 0.0 ± 0.0
Gln
3.534GlnAla: 3.534 ± 0.914
0.0GlnCys: 0.0 ± 0.0
2.356GlnAsp: 2.356 ± 1.009
4.417GlnGlu: 4.417 ± 1.175
1.767GlnPhe: 1.767 ± 0.492
1.767GlnGly: 1.767 ± 0.889
1.178GlnHis: 1.178 ± 0.625
4.417GlnIle: 4.417 ± 1.278
2.945GlnLys: 2.945 ± 1.255
4.122GlnLeu: 4.122 ± 1.28
1.472GlnMet: 1.472 ± 0.802
2.061GlnAsn: 2.061 ± 0.859
0.294GlnPro: 0.294 ± 0.232
2.061GlnGln: 2.061 ± 0.94
1.178GlnArg: 1.178 ± 0.569
4.122GlnSer: 4.122 ± 1.187
2.65GlnThr: 2.65 ± 0.498
2.945GlnVal: 2.945 ± 1.04
0.294GlnTrp: 0.294 ± 0.279
1.178GlnTyr: 1.178 ± 0.455
0.0GlnXaa: 0.0 ± 0.0
Arg
3.239ArgAla: 3.239 ± 0.995
0.0ArgCys: 0.0 ± 0.0
1.178ArgAsp: 1.178 ± 0.458
4.122ArgGlu: 4.122 ± 1.102
1.472ArgPhe: 1.472 ± 0.607
0.883ArgGly: 0.883 ± 0.556
0.883ArgHis: 0.883 ± 0.446
4.711ArgIle: 4.711 ± 1.309
2.65ArgLys: 2.65 ± 1.04
5.006ArgLeu: 5.006 ± 1.276
0.589ArgMet: 0.589 ± 0.356
1.767ArgAsn: 1.767 ± 0.903
0.589ArgPro: 0.589 ± 0.481
2.65ArgGln: 2.65 ± 0.555
0.883ArgArg: 0.883 ± 0.525
2.061ArgSer: 2.061 ± 0.64
2.061ArgThr: 2.061 ± 0.834
2.65ArgVal: 2.65 ± 0.963
0.294ArgTrp: 0.294 ± 0.297
1.767ArgTyr: 1.767 ± 0.746
0.0ArgXaa: 0.0 ± 0.0
Ser
2.945SerAla: 2.945 ± 0.769
1.472SerCys: 1.472 ± 0.73
4.417SerAsp: 4.417 ± 0.646
4.711SerGlu: 4.711 ± 0.78
2.65SerPhe: 2.65 ± 0.919
2.65SerGly: 2.65 ± 0.918
1.767SerHis: 1.767 ± 0.435
6.184SerIle: 6.184 ± 1.426
5.006SerLys: 5.006 ± 1.422
6.478SerLeu: 6.478 ± 1.186
1.767SerMet: 1.767 ± 0.693
2.945SerAsn: 2.945 ± 0.783
2.356SerPro: 2.356 ± 0.563
3.828SerGln: 3.828 ± 1.132
2.65SerArg: 2.65 ± 0.765
14.723SerSer: 14.723 ± 5.245
5.889SerThr: 5.889 ± 1.138
4.417SerVal: 4.417 ± 0.828
0.589SerTrp: 0.589 ± 0.427
4.417SerTyr: 4.417 ± 1.493
0.0SerXaa: 0.0 ± 0.0
Thr
2.65ThrAla: 2.65 ± 0.66
0.294ThrCys: 0.294 ± 0.232
2.061ThrAsp: 2.061 ± 0.724
5.3ThrGlu: 5.3 ± 1.176
2.65ThrPhe: 2.65 ± 0.963
3.239ThrGly: 3.239 ± 0.933
1.472ThrHis: 1.472 ± 0.629
4.417ThrIle: 4.417 ± 0.944
4.417ThrLys: 4.417 ± 0.971
6.478ThrLeu: 6.478 ± 1.288
1.178ThrMet: 1.178 ± 0.562
2.061ThrAsn: 2.061 ± 0.824
2.65ThrPro: 2.65 ± 0.79
2.356ThrGln: 2.356 ± 0.908
1.178ThrArg: 1.178 ± 0.78
5.889ThrSer: 5.889 ± 1.253
5.3ThrThr: 5.3 ± 1.462
5.3ThrVal: 5.3 ± 1.387
0.294ThrTrp: 0.294 ± 0.264
1.767ThrTyr: 1.767 ± 0.586
0.0ThrXaa: 0.0 ± 0.0
Val
4.417ValAla: 4.417 ± 1.356
0.294ValCys: 0.294 ± 0.264
3.828ValAsp: 3.828 ± 1.11
3.828ValGlu: 3.828 ± 0.966
2.061ValPhe: 2.061 ± 0.634
1.472ValGly: 1.472 ± 0.478
0.294ValHis: 0.294 ± 0.301
5.3ValIle: 5.3 ± 1.269
6.184ValLys: 6.184 ± 1.282
3.828ValLeu: 3.828 ± 1.025
0.883ValMet: 0.883 ± 0.428
4.122ValAsn: 4.122 ± 1.295
2.061ValPro: 2.061 ± 0.54
2.356ValGln: 2.356 ± 0.782
2.061ValArg: 2.061 ± 0.614
5.595ValSer: 5.595 ± 0.736
3.534ValThr: 3.534 ± 0.664
2.945ValVal: 2.945 ± 1.073
0.0ValTrp: 0.0 ± 0.0
4.417ValTyr: 4.417 ± 0.912
0.0ValXaa: 0.0 ± 0.0
Trp
0.589TrpAla: 0.589 ± 0.348
0.0TrpCys: 0.0 ± 0.0
0.294TrpAsp: 0.294 ± 0.232
0.589TrpGlu: 0.589 ± 0.39
0.0TrpPhe: 0.0 ± 0.0
0.294TrpGly: 0.294 ± 0.297
0.0TrpHis: 0.0 ± 0.0
0.294TrpIle: 0.294 ± 0.264
0.589TrpLys: 0.589 ± 0.384
0.883TrpLeu: 0.883 ± 0.507
0.0TrpMet: 0.0 ± 0.0
0.589TrpAsn: 0.589 ± 0.432
0.0TrpPro: 0.0 ± 0.0
0.589TrpGln: 0.589 ± 0.311
1.178TrpArg: 1.178 ± 0.416
1.178TrpSer: 1.178 ± 0.388
0.0TrpThr: 0.0 ± 0.0
1.178TrpVal: 1.178 ± 0.558
0.294TrpTrp: 0.294 ± 0.29
0.294TrpTyr: 0.294 ± 0.279
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.472TyrAla: 1.472 ± 0.538
0.589TyrCys: 0.589 ± 0.558
2.356TyrAsp: 2.356 ± 0.799
3.534TyrGlu: 3.534 ± 1.191
1.178TyrPhe: 1.178 ± 0.518
2.356TyrGly: 2.356 ± 0.956
0.589TyrHis: 0.589 ± 0.37
3.239TyrIle: 3.239 ± 1.221
4.122TyrLys: 4.122 ± 0.734
3.534TyrLeu: 3.534 ± 0.852
0.589TyrMet: 0.589 ± 0.423
2.356TyrAsn: 2.356 ± 0.805
1.472TyrPro: 1.472 ± 0.414
2.356TyrGln: 2.356 ± 0.648
2.945TyrArg: 2.945 ± 1.011
1.767TyrSer: 1.767 ± 0.599
1.178TyrThr: 1.178 ± 0.514
2.65TyrVal: 2.65 ± 0.727
0.0TyrTrp: 0.0 ± 0.0
2.061TyrTyr: 2.061 ± 0.726
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (3397 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski