Amino acid dipepetide frequency for Streptococcus satellite phage Javan27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.784AlaCys: 0.784 ± 0.399
4.444AlaAsp: 4.444 ± 1.058
2.614AlaGlu: 2.614 ± 0.98
2.353AlaPhe: 2.353 ± 0.558
0.261AlaGly: 0.261 ± 0.274
0.0AlaHis: 0.0 ± 0.0
6.536AlaIle: 6.536 ± 1.06
5.752AlaLys: 5.752 ± 1.347
4.183AlaLeu: 4.183 ± 0.872
1.046AlaMet: 1.046 ± 0.444
4.706AlaAsn: 4.706 ± 0.842
1.046AlaPro: 1.046 ± 0.456
2.353AlaGln: 2.353 ± 0.854
0.784AlaArg: 0.784 ± 0.441
2.614AlaSer: 2.614 ± 1.002
3.922AlaThr: 3.922 ± 0.944
1.83AlaVal: 1.83 ± 0.536
1.307AlaTrp: 1.307 ± 0.697
2.876AlaTyr: 2.876 ± 0.693
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.523CysAsp: 0.523 ± 0.322
0.261CysGlu: 0.261 ± 0.251
0.0CysPhe: 0.0 ± 0.0
0.523CysGly: 0.523 ± 0.341
0.523CysHis: 0.523 ± 0.334
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.784CysLeu: 0.784 ± 0.428
0.261CysMet: 0.261 ± 0.264
0.784CysAsn: 0.784 ± 0.393
0.261CysPro: 0.261 ± 0.251
0.523CysGln: 0.523 ± 0.501
0.784CysArg: 0.784 ± 0.395
0.523CysSer: 0.523 ± 0.334
0.0CysThr: 0.0 ± 0.0
0.261CysVal: 0.261 ± 0.25
0.0CysTrp: 0.0 ± 0.0
0.523CysTyr: 0.523 ± 0.499
0.0CysXaa: 0.0 ± 0.0
Asp
1.307AspAla: 1.307 ± 0.508
0.523AspCys: 0.523 ± 0.337
3.399AspAsp: 3.399 ± 1.073
4.706AspGlu: 4.706 ± 1.021
5.752AspPhe: 5.752 ± 1.293
1.569AspGly: 1.569 ± 0.514
0.784AspHis: 0.784 ± 0.459
4.967AspIle: 4.967 ± 1.237
7.582AspLys: 7.582 ± 1.323
4.967AspLeu: 4.967 ± 0.984
1.83AspMet: 1.83 ± 0.812
5.49AspAsn: 5.49 ± 1.811
1.046AspPro: 1.046 ± 0.548
0.784AspGln: 0.784 ± 0.47
4.706AspArg: 4.706 ± 0.775
2.876AspSer: 2.876 ± 0.895
3.399AspThr: 3.399 ± 1.038
1.307AspVal: 1.307 ± 0.723
0.523AspTrp: 0.523 ± 0.333
2.092AspTyr: 2.092 ± 0.881
0.0AspXaa: 0.0 ± 0.0
Glu
5.49GluAla: 5.49 ± 0.832
0.261GluCys: 0.261 ± 0.251
5.49GluAsp: 5.49 ± 1.619
5.229GluGlu: 5.229 ± 1.392
1.83GluPhe: 1.83 ± 0.572
3.399GluGly: 3.399 ± 0.814
1.307GluHis: 1.307 ± 0.494
5.49GluIle: 5.49 ± 1.186
7.059GluLys: 7.059 ± 1.303
12.288GluLeu: 12.288 ± 1.831
2.092GluMet: 2.092 ± 0.727
3.66GluAsn: 3.66 ± 0.928
0.784GluPro: 0.784 ± 0.391
6.013GluGln: 6.013 ± 1.461
3.922GluArg: 3.922 ± 0.907
2.353GluSer: 2.353 ± 0.822
4.444GluThr: 4.444 ± 0.981
3.922GluVal: 3.922 ± 1.194
0.523GluTrp: 0.523 ± 0.392
3.399GluTyr: 3.399 ± 0.869
0.0GluXaa: 0.0 ± 0.0
Phe
1.569PheAla: 1.569 ± 0.508
0.261PheCys: 0.261 ± 0.264
3.137PheAsp: 3.137 ± 1.221
3.137PheGlu: 3.137 ± 0.797
1.83PhePhe: 1.83 ± 0.515
1.83PheGly: 1.83 ± 0.727
1.569PheHis: 1.569 ± 0.426
4.444PheIle: 4.444 ± 1.159
5.752PheLys: 5.752 ± 0.821
4.444PheLeu: 4.444 ± 0.826
0.261PheMet: 0.261 ± 0.261
3.399PheAsn: 3.399 ± 0.721
1.569PhePro: 1.569 ± 0.675
1.046PheGln: 1.046 ± 0.494
2.614PheArg: 2.614 ± 0.64
4.444PheSer: 4.444 ± 1.327
1.569PheThr: 1.569 ± 0.603
1.569PheVal: 1.569 ± 0.675
0.0PheTrp: 0.0 ± 0.0
2.614PheTyr: 2.614 ± 0.58
0.0PheXaa: 0.0 ± 0.0
Gly
2.876GlyAla: 2.876 ± 1.001
0.523GlyCys: 0.523 ± 0.364
1.046GlyAsp: 1.046 ± 0.482
2.353GlyGlu: 2.353 ± 0.888
2.614GlyPhe: 2.614 ± 0.702
0.784GlyGly: 0.784 ± 0.401
1.046GlyHis: 1.046 ± 0.46
3.399GlyIle: 3.399 ± 0.805
2.353GlyLys: 2.353 ± 0.711
4.444GlyLeu: 4.444 ± 1.1
0.784GlyMet: 0.784 ± 0.469
1.569GlyAsn: 1.569 ± 0.63
0.784GlyPro: 0.784 ± 0.539
0.523GlyGln: 0.523 ± 0.333
1.569GlyArg: 1.569 ± 0.686
2.092GlySer: 2.092 ± 0.823
2.876GlyThr: 2.876 ± 0.754
2.353GlyVal: 2.353 ± 0.787
0.0GlyTrp: 0.0 ± 0.0
3.399GlyTyr: 3.399 ± 0.943
0.0GlyXaa: 0.0 ± 0.0
His
1.307HisAla: 1.307 ± 0.626
0.0HisCys: 0.0 ± 0.0
0.784HisAsp: 0.784 ± 0.53
0.523HisGlu: 0.523 ± 0.34
1.569HisPhe: 1.569 ± 0.533
1.046HisGly: 1.046 ± 0.498
0.0HisHis: 0.0 ± 0.0
1.83HisIle: 1.83 ± 0.621
2.353HisLys: 2.353 ± 0.715
1.569HisLeu: 1.569 ± 0.544
0.0HisMet: 0.0 ± 0.0
1.307HisAsn: 1.307 ± 0.71
0.261HisPro: 0.261 ± 0.275
1.569HisGln: 1.569 ± 0.778
0.261HisArg: 0.261 ± 0.26
0.784HisSer: 0.784 ± 0.38
0.784HisThr: 0.784 ± 0.467
0.784HisVal: 0.784 ± 0.461
0.523HisTrp: 0.523 ± 0.501
2.614HisTyr: 2.614 ± 0.77
0.0HisXaa: 0.0 ± 0.0
Ile
3.66IleAla: 3.66 ± 0.854
0.523IleCys: 0.523 ± 0.334
6.013IleAsp: 6.013 ± 0.834
7.582IleGlu: 7.582 ± 1.5
3.399IlePhe: 3.399 ± 0.95
2.353IleGly: 2.353 ± 0.65
1.307IleHis: 1.307 ± 0.536
6.536IleIle: 6.536 ± 1.343
11.765IleLys: 11.765 ± 1.701
5.229IleLeu: 5.229 ± 0.803
1.046IleMet: 1.046 ± 0.674
4.706IleAsn: 4.706 ± 1.085
3.137IlePro: 3.137 ± 0.974
2.092IleGln: 2.092 ± 0.533
2.876IleArg: 2.876 ± 0.868
6.275IleSer: 6.275 ± 1.371
5.49IleThr: 5.49 ± 0.975
3.66IleVal: 3.66 ± 0.986
0.0IleTrp: 0.0 ± 0.0
4.183IleTyr: 4.183 ± 1.033
0.0IleXaa: 0.0 ± 0.0
Lys
7.843LysAla: 7.843 ± 1.491
0.523LysCys: 0.523 ± 0.341
4.967LysAsp: 4.967 ± 1.125
8.627LysGlu: 8.627 ± 1.56
3.922LysPhe: 3.922 ± 1.033
2.876LysGly: 2.876 ± 0.683
3.137LysHis: 3.137 ± 0.874
5.49LysIle: 5.49 ± 1.413
9.935LysLys: 9.935 ± 1.766
8.366LysLeu: 8.366 ± 1.297
2.353LysMet: 2.353 ± 0.757
5.49LysAsn: 5.49 ± 1.323
2.876LysPro: 2.876 ± 0.659
4.444LysGln: 4.444 ± 1.297
5.229LysArg: 5.229 ± 1.232
7.582LysSer: 7.582 ± 1.846
5.229LysThr: 5.229 ± 1.191
4.967LysVal: 4.967 ± 0.932
1.046LysTrp: 1.046 ± 0.6
5.49LysTyr: 5.49 ± 1.43
0.0LysXaa: 0.0 ± 0.0
Leu
5.49LeuAla: 5.49 ± 1.415
0.523LeuCys: 0.523 ± 0.363
6.797LeuAsp: 6.797 ± 1.228
10.719LeuGlu: 10.719 ± 2.115
5.229LeuPhe: 5.229 ± 1.142
5.752LeuGly: 5.752 ± 1.389
1.307LeuHis: 1.307 ± 0.483
6.275LeuIle: 6.275 ± 1.453
9.15LeuLys: 9.15 ± 1.528
7.843LeuLeu: 7.843 ± 1.4
2.876LeuMet: 2.876 ± 0.859
7.582LeuAsn: 7.582 ± 1.213
2.353LeuPro: 2.353 ± 0.821
3.399LeuGln: 3.399 ± 1.022
4.183LeuArg: 4.183 ± 1.225
8.366LeuSer: 8.366 ± 1.035
3.922LeuThr: 3.922 ± 1.013
4.183LeuVal: 4.183 ± 1.081
0.523LeuTrp: 0.523 ± 0.34
3.66LeuTyr: 3.66 ± 0.839
0.0LeuXaa: 0.0 ± 0.0
Met
0.523MetAla: 0.523 ± 0.362
0.261MetCys: 0.261 ± 0.236
2.353MetAsp: 2.353 ± 0.861
1.046MetGlu: 1.046 ± 0.538
0.784MetPhe: 0.784 ± 0.432
0.784MetGly: 0.784 ± 0.43
0.0MetHis: 0.0 ± 0.0
2.614MetIle: 2.614 ± 0.927
1.569MetLys: 1.569 ± 0.669
3.399MetLeu: 3.399 ± 0.903
0.523MetMet: 0.523 ± 0.394
2.353MetAsn: 2.353 ± 0.892
0.261MetPro: 0.261 ± 0.268
0.523MetGln: 0.523 ± 0.295
0.784MetArg: 0.784 ± 0.401
0.784MetSer: 0.784 ± 0.406
2.614MetThr: 2.614 ± 0.708
0.784MetVal: 0.784 ± 0.451
0.0MetTrp: 0.0 ± 0.0
0.523MetTyr: 0.523 ± 0.387
0.0MetXaa: 0.0 ± 0.0
Asn
4.183AsnAla: 4.183 ± 0.828
0.523AsnCys: 0.523 ± 0.342
2.614AsnAsp: 2.614 ± 0.751
4.967AsnGlu: 4.967 ± 0.943
3.399AsnPhe: 3.399 ± 1.158
2.876AsnGly: 2.876 ± 0.83
1.569AsnHis: 1.569 ± 0.579
6.013AsnIle: 6.013 ± 1.341
4.706AsnLys: 4.706 ± 0.788
5.752AsnLeu: 5.752 ± 1.267
1.83AsnMet: 1.83 ± 0.986
2.614AsnAsn: 2.614 ± 0.964
2.614AsnPro: 2.614 ± 0.595
5.49AsnGln: 5.49 ± 1.413
2.614AsnArg: 2.614 ± 0.529
2.092AsnSer: 2.092 ± 0.767
3.399AsnThr: 3.399 ± 1.008
3.399AsnVal: 3.399 ± 1.07
0.784AsnTrp: 0.784 ± 0.34
2.876AsnTyr: 2.876 ± 0.688
0.0AsnXaa: 0.0 ± 0.0
Pro
1.046ProAla: 1.046 ± 0.475
0.261ProCys: 0.261 ± 0.263
1.569ProAsp: 1.569 ± 0.516
1.83ProGlu: 1.83 ± 0.726
0.784ProPhe: 0.784 ± 0.387
0.523ProGly: 0.523 ± 0.345
0.523ProHis: 0.523 ± 0.379
2.353ProIle: 2.353 ± 0.683
3.399ProLys: 3.399 ± 1.212
2.876ProLeu: 2.876 ± 0.753
0.523ProMet: 0.523 ± 0.453
1.83ProAsn: 1.83 ± 0.61
0.784ProPro: 0.784 ± 0.425
1.046ProGln: 1.046 ± 0.452
1.307ProArg: 1.307 ± 0.555
1.569ProSer: 1.569 ± 0.67
1.307ProThr: 1.307 ± 0.59
0.523ProVal: 0.523 ± 0.364
0.0ProTrp: 0.0 ± 0.0
0.784ProTyr: 0.784 ± 0.374
0.0ProXaa: 0.0 ± 0.0
Gln
3.399GlnAla: 3.399 ± 0.709
0.0GlnCys: 0.0 ± 0.0
1.569GlnAsp: 1.569 ± 0.657
3.922GlnGlu: 3.922 ± 1.116
1.046GlnPhe: 1.046 ± 0.534
2.353GlnGly: 2.353 ± 1.059
0.523GlnHis: 0.523 ± 0.388
3.399GlnIle: 3.399 ± 1.014
3.137GlnLys: 3.137 ± 0.65
4.444GlnLeu: 4.444 ± 0.962
1.046GlnMet: 1.046 ± 0.489
3.399GlnAsn: 3.399 ± 1.036
2.353GlnPro: 2.353 ± 1.021
2.353GlnGln: 2.353 ± 0.717
1.83GlnArg: 1.83 ± 0.492
2.353GlnSer: 2.353 ± 0.544
1.83GlnThr: 1.83 ± 0.585
2.353GlnVal: 2.353 ± 0.615
0.523GlnTrp: 0.523 ± 0.461
2.092GlnTyr: 2.092 ± 0.716
0.0GlnXaa: 0.0 ± 0.0
Arg
1.307ArgAla: 1.307 ± 0.571
0.261ArgCys: 0.261 ± 0.251
2.614ArgAsp: 2.614 ± 0.713
2.614ArgGlu: 2.614 ± 0.767
1.83ArgPhe: 1.83 ± 0.597
1.83ArgGly: 1.83 ± 0.84
1.569ArgHis: 1.569 ± 0.672
3.922ArgIle: 3.922 ± 0.953
3.66ArgLys: 3.66 ± 0.812
6.536ArgLeu: 6.536 ± 1.069
1.307ArgMet: 1.307 ± 0.621
3.66ArgAsn: 3.66 ± 1.126
0.523ArgPro: 0.523 ± 0.372
2.353ArgGln: 2.353 ± 0.783
2.353ArgArg: 2.353 ± 0.698
1.569ArgSer: 1.569 ± 0.556
3.137ArgThr: 3.137 ± 1.189
2.614ArgVal: 2.614 ± 0.704
0.523ArgTrp: 0.523 ± 0.369
2.614ArgTyr: 2.614 ± 0.903
0.0ArgXaa: 0.0 ± 0.0
Ser
2.614SerAla: 2.614 ± 0.897
0.261SerCys: 0.261 ± 0.251
5.229SerAsp: 5.229 ± 1.074
7.059SerGlu: 7.059 ± 1.479
3.137SerPhe: 3.137 ± 0.57
0.784SerGly: 0.784 ± 0.446
1.046SerHis: 1.046 ± 0.554
6.013SerIle: 6.013 ± 1.087
5.752SerLys: 5.752 ± 1.265
5.752SerLeu: 5.752 ± 1.29
1.83SerMet: 1.83 ± 0.757
2.092SerAsn: 2.092 ± 0.672
0.523SerPro: 0.523 ± 0.315
2.614SerGln: 2.614 ± 0.889
2.353SerArg: 2.353 ± 0.614
2.614SerSer: 2.614 ± 0.84
2.614SerThr: 2.614 ± 0.875
2.876SerVal: 2.876 ± 0.793
0.523SerTrp: 0.523 ± 0.363
3.137SerTyr: 3.137 ± 0.791
0.0SerXaa: 0.0 ± 0.0
Thr
2.353ThrAla: 2.353 ± 0.628
0.261ThrCys: 0.261 ± 0.249
2.876ThrAsp: 2.876 ± 0.919
3.66ThrGlu: 3.66 ± 0.714
1.83ThrPhe: 1.83 ± 0.912
2.876ThrGly: 2.876 ± 0.965
1.569ThrHis: 1.569 ± 0.668
5.229ThrIle: 5.229 ± 1.209
4.706ThrLys: 4.706 ± 1.444
6.013ThrLeu: 6.013 ± 1.175
0.784ThrMet: 0.784 ± 0.468
3.66ThrAsn: 3.66 ± 0.94
1.83ThrPro: 1.83 ± 0.528
3.922ThrGln: 3.922 ± 0.979
3.137ThrArg: 3.137 ± 0.761
1.569ThrSer: 1.569 ± 0.554
2.876ThrThr: 2.876 ± 0.725
3.399ThrVal: 3.399 ± 1.299
0.261ThrTrp: 0.261 ± 0.307
2.614ThrTyr: 2.614 ± 0.822
0.0ThrXaa: 0.0 ± 0.0
Val
2.353ValAla: 2.353 ± 0.767
0.261ValCys: 0.261 ± 0.272
1.83ValAsp: 1.83 ± 0.644
3.399ValGlu: 3.399 ± 0.959
2.092ValPhe: 2.092 ± 0.647
2.353ValGly: 2.353 ± 0.756
0.784ValHis: 0.784 ± 0.468
4.444ValIle: 4.444 ± 1.161
4.967ValLys: 4.967 ± 0.831
5.752ValLeu: 5.752 ± 1.355
0.784ValMet: 0.784 ± 0.438
1.569ValAsn: 1.569 ± 0.712
0.784ValPro: 0.784 ± 0.444
0.784ValGln: 0.784 ± 0.499
1.307ValArg: 1.307 ± 0.494
4.706ValSer: 4.706 ± 1.334
4.444ValThr: 4.444 ± 1.231
1.83ValVal: 1.83 ± 0.551
0.261ValTrp: 0.261 ± 0.257
0.784ValTyr: 0.784 ± 0.548
0.0ValXaa: 0.0 ± 0.0
Trp
0.261TrpAla: 0.261 ± 0.307
0.0TrpCys: 0.0 ± 0.0
1.046TrpAsp: 1.046 ± 0.418
1.569TrpGlu: 1.569 ± 0.519
0.261TrpPhe: 0.261 ± 0.276
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.523TrpIle: 0.523 ± 0.351
1.046TrpLys: 1.046 ± 0.485
1.046TrpLeu: 1.046 ± 0.508
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.261TrpGln: 0.261 ± 0.298
0.523TrpArg: 0.523 ± 0.357
0.784TrpSer: 0.784 ± 0.388
0.0TrpThr: 0.0 ± 0.0
0.523TrpVal: 0.523 ± 0.355
0.261TrpTrp: 0.261 ± 0.219
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.83TyrAla: 1.83 ± 0.761
0.523TyrCys: 0.523 ± 0.331
1.307TyrAsp: 1.307 ± 0.428
3.137TyrGlu: 3.137 ± 0.696
3.399TyrPhe: 3.399 ± 0.991
2.614TyrGly: 2.614 ± 0.954
1.307TyrHis: 1.307 ± 0.539
2.353TyrIle: 2.353 ± 0.647
5.752TyrLys: 5.752 ± 1.166
4.444TyrLeu: 4.444 ± 0.964
1.046TyrMet: 1.046 ± 0.608
4.444TyrAsn: 4.444 ± 0.773
1.307TyrPro: 1.307 ± 0.455
1.83TyrGln: 1.83 ± 0.567
3.66TyrArg: 3.66 ± 1.131
2.876TyrSer: 2.876 ± 1.229
1.569TyrThr: 1.569 ± 0.556
2.353TyrVal: 2.353 ± 0.894
0.523TyrTrp: 0.523 ± 0.385
2.353TyrTyr: 2.353 ± 0.978
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28 proteins (3826 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski