Amino acid dipepetide frequency for Streptococcus satellite phage Javan22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.821AlaCys: 0.821 ± 0.424
4.65AlaAsp: 4.65 ± 1.046
3.009AlaGlu: 3.009 ± 1.017
2.462AlaPhe: 2.462 ± 0.59
0.274AlaGly: 0.274 ± 0.263
0.0AlaHis: 0.0 ± 0.0
6.565AlaIle: 6.565 ± 1.233
4.923AlaLys: 4.923 ± 1.037
4.376AlaLeu: 4.376 ± 1.053
1.094AlaMet: 1.094 ± 0.471
4.923AlaAsn: 4.923 ± 1.064
1.094AlaPro: 1.094 ± 0.496
2.462AlaGln: 2.462 ± 0.963
0.821AlaArg: 0.821 ± 0.469
2.735AlaSer: 2.735 ± 0.953
3.829AlaThr: 3.829 ± 0.879
1.915AlaVal: 1.915 ± 0.626
1.368AlaTrp: 1.368 ± 0.683
3.009AlaTyr: 3.009 ± 0.951
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.547CysAsp: 0.547 ± 0.388
0.274CysGlu: 0.274 ± 0.283
0.0CysPhe: 0.0 ± 0.0
0.274CysGly: 0.274 ± 0.283
0.547CysHis: 0.547 ± 0.325
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.821CysLeu: 0.821 ± 0.475
0.274CysMet: 0.274 ± 0.276
0.821CysAsn: 0.821 ± 0.354
0.274CysPro: 0.274 ± 0.283
0.547CysGln: 0.547 ± 0.566
0.821CysArg: 0.821 ± 0.389
0.547CysSer: 0.547 ± 0.446
0.0CysThr: 0.0 ± 0.0
0.274CysVal: 0.274 ± 0.265
0.0CysTrp: 0.0 ± 0.0
0.821CysTyr: 0.821 ± 0.6
0.0CysXaa: 0.0 ± 0.0
Asp
1.368AspAla: 1.368 ± 0.588
0.547AspCys: 0.547 ± 0.371
3.282AspAsp: 3.282 ± 1.224
4.923AspGlu: 4.923 ± 0.966
5.47AspPhe: 5.47 ± 1.216
1.641AspGly: 1.641 ± 0.516
0.821AspHis: 0.821 ± 0.496
5.47AspIle: 5.47 ± 1.154
7.659AspLys: 7.659 ± 1.342
5.197AspLeu: 5.197 ± 1.004
1.641AspMet: 1.641 ± 0.692
5.744AspAsn: 5.744 ± 1.959
1.094AspPro: 1.094 ± 0.602
0.547AspGln: 0.547 ± 0.373
4.65AspArg: 4.65 ± 0.7
2.735AspSer: 2.735 ± 0.865
3.556AspThr: 3.556 ± 1.123
1.368AspVal: 1.368 ± 0.718
0.274AspTrp: 0.274 ± 0.258
1.915AspTyr: 1.915 ± 0.697
0.0AspXaa: 0.0 ± 0.0
Glu
5.197GluAla: 5.197 ± 0.879
0.274GluCys: 0.274 ± 0.283
4.923GluAsp: 4.923 ± 1.623
4.923GluGlu: 4.923 ± 1.342
1.915GluPhe: 1.915 ± 0.61
3.556GluGly: 3.556 ± 0.793
1.368GluHis: 1.368 ± 0.549
5.744GluIle: 5.744 ± 1.358
7.112GluLys: 7.112 ± 1.404
12.582GluLeu: 12.582 ± 1.911
1.915GluMet: 1.915 ± 0.618
3.556GluAsn: 3.556 ± 1.027
0.821GluPro: 0.821 ± 0.428
6.018GluGln: 6.018 ± 1.569
4.103GluArg: 4.103 ± 0.805
2.462GluSer: 2.462 ± 0.858
4.65GluThr: 4.65 ± 1.104
3.829GluVal: 3.829 ± 1.269
0.547GluTrp: 0.547 ± 0.403
3.556GluTyr: 3.556 ± 0.827
0.0GluXaa: 0.0 ± 0.0
Phe
1.641PheAla: 1.641 ± 0.551
0.274PheCys: 0.274 ± 0.276
3.009PheAsp: 3.009 ± 1.134
3.282PheGlu: 3.282 ± 0.867
1.915PhePhe: 1.915 ± 0.505
1.915PheGly: 1.915 ± 0.703
1.641PheHis: 1.641 ± 0.419
4.376PheIle: 4.376 ± 1.332
5.197PheLys: 5.197 ± 0.757
4.103PheLeu: 4.103 ± 0.947
0.274PheMet: 0.274 ± 0.27
3.556PheAsn: 3.556 ± 0.851
1.641PhePro: 1.641 ± 0.715
0.821PheGln: 0.821 ± 0.413
2.462PheArg: 2.462 ± 0.827
4.923PheSer: 4.923 ± 1.248
1.641PheThr: 1.641 ± 0.718
1.641PheVal: 1.641 ± 0.681
0.0PheTrp: 0.0 ± 0.0
2.188PheTyr: 2.188 ± 0.53
0.0PheXaa: 0.0 ± 0.0
Gly
3.009GlyAla: 3.009 ± 0.984
0.547GlyCys: 0.547 ± 0.333
1.094GlyAsp: 1.094 ± 0.41
2.462GlyGlu: 2.462 ± 1.017
2.735GlyPhe: 2.735 ± 0.791
1.094GlyGly: 1.094 ± 0.539
1.094GlyHis: 1.094 ± 0.508
3.556GlyIle: 3.556 ± 0.875
2.462GlyLys: 2.462 ± 0.769
4.65GlyLeu: 4.65 ± 1.373
0.821GlyMet: 0.821 ± 0.435
1.641GlyAsn: 1.641 ± 0.657
0.821GlyPro: 0.821 ± 0.527
0.274GlyGln: 0.274 ± 0.258
1.641GlyArg: 1.641 ± 0.724
1.915GlySer: 1.915 ± 0.874
3.009GlyThr: 3.009 ± 0.895
1.915GlyVal: 1.915 ± 0.749
0.0GlyTrp: 0.0 ± 0.0
3.829GlyTyr: 3.829 ± 0.951
0.0GlyXaa: 0.0 ± 0.0
His
1.368HisAla: 1.368 ± 0.672
0.0HisCys: 0.0 ± 0.0
0.821HisAsp: 0.821 ± 0.606
0.547HisGlu: 0.547 ± 0.396
1.641HisPhe: 1.641 ± 0.587
1.094HisGly: 1.094 ± 0.5
0.0HisHis: 0.0 ± 0.0
1.915HisIle: 1.915 ± 0.763
2.188HisLys: 2.188 ± 0.776
1.368HisLeu: 1.368 ± 0.482
0.0HisMet: 0.0 ± 0.0
1.094HisAsn: 1.094 ± 0.694
0.274HisPro: 0.274 ± 0.297
1.368HisGln: 1.368 ± 0.752
0.274HisArg: 0.274 ± 0.29
0.821HisSer: 0.821 ± 0.368
0.821HisThr: 0.821 ± 0.465
0.547HisVal: 0.547 ± 0.342
0.547HisTrp: 0.547 ± 0.566
2.735HisTyr: 2.735 ± 0.838
0.0HisXaa: 0.0 ± 0.0
Ile
4.103IleAla: 4.103 ± 0.929
0.547IleCys: 0.547 ± 0.325
6.018IleAsp: 6.018 ± 0.833
7.659IleGlu: 7.659 ± 1.514
3.282IlePhe: 3.282 ± 1.008
2.462IleGly: 2.462 ± 0.74
1.368IleHis: 1.368 ± 0.495
6.838IleIle: 6.838 ± 1.464
12.309IleLys: 12.309 ± 1.59
5.197IleLeu: 5.197 ± 0.999
1.094IleMet: 1.094 ± 0.631
4.376IleAsn: 4.376 ± 1.181
2.735IlePro: 2.735 ± 0.758
2.188IleGln: 2.188 ± 0.625
3.009IleArg: 3.009 ± 0.847
6.018IleSer: 6.018 ± 1.424
5.47IleThr: 5.47 ± 0.975
3.829IleVal: 3.829 ± 0.937
0.0IleTrp: 0.0 ± 0.0
4.376IleTyr: 4.376 ± 1.038
0.0IleXaa: 0.0 ± 0.0
Lys
8.206LysAla: 8.206 ± 1.394
0.274LysCys: 0.274 ± 0.283
5.197LysAsp: 5.197 ± 0.969
9.026LysGlu: 9.026 ± 1.714
3.829LysPhe: 3.829 ± 0.942
3.009LysGly: 3.009 ± 0.94
2.462LysHis: 2.462 ± 1.007
5.197LysIle: 5.197 ± 1.358
10.394LysLys: 10.394 ± 1.916
7.932LysLeu: 7.932 ± 1.192
2.462LysMet: 2.462 ± 0.794
6.018LysAsn: 6.018 ± 1.431
3.009LysPro: 3.009 ± 0.754
4.103LysGln: 4.103 ± 1.424
5.197LysArg: 5.197 ± 1.108
7.659LysSer: 7.659 ± 1.844
5.197LysThr: 5.197 ± 1.246
5.197LysVal: 5.197 ± 0.911
1.094LysTrp: 1.094 ± 0.684
5.197LysTyr: 5.197 ± 1.408
0.0LysXaa: 0.0 ± 0.0
Leu
5.744LeuAla: 5.744 ± 1.48
0.821LeuCys: 0.821 ± 0.452
6.838LeuAsp: 6.838 ± 1.294
10.667LeuGlu: 10.667 ± 1.761
5.197LeuPhe: 5.197 ± 1.097
5.744LeuGly: 5.744 ± 1.595
1.368LeuHis: 1.368 ± 0.54
6.018LeuIle: 6.018 ± 1.346
9.026LeuLys: 9.026 ± 1.563
8.479LeuLeu: 8.479 ± 1.51
2.735LeuMet: 2.735 ± 0.924
7.659LeuAsn: 7.659 ± 1.392
2.188LeuPro: 2.188 ± 0.806
3.009LeuGln: 3.009 ± 0.835
4.103LeuArg: 4.103 ± 1.08
7.659LeuSer: 7.659 ± 0.998
3.829LeuThr: 3.829 ± 1.117
3.556LeuVal: 3.556 ± 0.931
0.547LeuTrp: 0.547 ± 0.342
3.556LeuTyr: 3.556 ± 0.909
0.0LeuXaa: 0.0 ± 0.0
Met
0.547MetAla: 0.547 ± 0.415
0.274MetCys: 0.274 ± 0.259
2.462MetAsp: 2.462 ± 0.917
0.821MetGlu: 0.821 ± 0.466
0.821MetPhe: 0.821 ± 0.425
0.821MetGly: 0.821 ± 0.435
0.0MetHis: 0.0 ± 0.0
3.009MetIle: 3.009 ± 0.919
1.641MetLys: 1.641 ± 0.758
3.556MetLeu: 3.556 ± 0.984
0.274MetMet: 0.274 ± 0.263
2.188MetAsn: 2.188 ± 0.819
0.274MetPro: 0.274 ± 0.28
0.547MetGln: 0.547 ± 0.347
0.821MetArg: 0.821 ± 0.463
0.547MetSer: 0.547 ± 0.399
2.462MetThr: 2.462 ± 0.746
0.547MetVal: 0.547 ± 0.416
0.0MetTrp: 0.0 ± 0.0
0.274MetTyr: 0.274 ± 0.247
0.0MetXaa: 0.0 ± 0.0
Asn
4.103AsnAla: 4.103 ± 0.947
0.547AsnCys: 0.547 ± 0.379
2.462AsnAsp: 2.462 ± 0.698
4.65AsnGlu: 4.65 ± 0.915
3.009AsnPhe: 3.009 ± 1.085
2.735AsnGly: 2.735 ± 0.841
1.641AsnHis: 1.641 ± 0.668
6.291AsnIle: 6.291 ± 1.362
4.923AsnLys: 4.923 ± 0.791
5.744AsnLeu: 5.744 ± 1.167
1.641AsnMet: 1.641 ± 1.051
2.462AsnAsn: 2.462 ± 0.992
2.462AsnPro: 2.462 ± 0.653
5.744AsnGln: 5.744 ± 1.521
3.009AsnArg: 3.009 ± 0.602
2.188AsnSer: 2.188 ± 0.866
3.282AsnThr: 3.282 ± 0.978
3.009AsnVal: 3.009 ± 1.058
0.821AsnTrp: 0.821 ± 0.387
3.009AsnTyr: 3.009 ± 0.773
0.0AsnXaa: 0.0 ± 0.0
Pro
1.094ProAla: 1.094 ± 0.452
0.274ProCys: 0.274 ± 0.328
1.641ProAsp: 1.641 ± 0.522
1.915ProGlu: 1.915 ± 0.766
0.821ProPhe: 0.821 ± 0.448
0.547ProGly: 0.547 ± 0.389
0.274ProHis: 0.274 ± 0.247
2.462ProIle: 2.462 ± 0.678
3.556ProLys: 3.556 ± 1.311
3.009ProLeu: 3.009 ± 0.894
0.547ProMet: 0.547 ± 0.399
1.641ProAsn: 1.641 ± 0.546
0.821ProPro: 0.821 ± 0.497
1.094ProGln: 1.094 ± 0.566
1.094ProArg: 1.094 ± 0.503
1.641ProSer: 1.641 ± 0.635
1.094ProThr: 1.094 ± 0.516
0.547ProVal: 0.547 ± 0.373
0.0ProTrp: 0.0 ± 0.0
0.821ProTyr: 0.821 ± 0.409
0.0ProXaa: 0.0 ± 0.0
Gln
3.556GlnAla: 3.556 ± 0.766
0.0GlnCys: 0.0 ± 0.0
1.915GlnAsp: 1.915 ± 0.767
4.103GlnGlu: 4.103 ± 1.074
0.821GlnPhe: 0.821 ± 0.526
2.462GlnGly: 2.462 ± 1.005
0.547GlnHis: 0.547 ± 0.381
3.009GlnIle: 3.009 ± 0.975
2.735GlnLys: 2.735 ± 0.592
4.103GlnLeu: 4.103 ± 1.232
0.547GlnMet: 0.547 ± 0.42
2.735GlnAsn: 2.735 ± 1.004
2.188GlnPro: 2.188 ± 1.145
2.735GlnGln: 2.735 ± 0.743
1.915GlnArg: 1.915 ± 0.54
2.735GlnSer: 2.735 ± 0.863
1.368GlnThr: 1.368 ± 0.471
2.462GlnVal: 2.462 ± 0.692
0.547GlnTrp: 0.547 ± 0.517
2.188GlnTyr: 2.188 ± 0.674
0.0GlnXaa: 0.0 ± 0.0
Arg
1.641ArgAla: 1.641 ± 0.709
0.274ArgCys: 0.274 ± 0.283
2.735ArgAsp: 2.735 ± 0.85
2.462ArgGlu: 2.462 ± 0.805
1.915ArgPhe: 1.915 ± 0.558
2.188ArgGly: 2.188 ± 0.958
1.368ArgHis: 1.368 ± 0.586
3.829ArgIle: 3.829 ± 0.844
3.556ArgLys: 3.556 ± 0.834
6.018ArgLeu: 6.018 ± 1.015
1.368ArgMet: 1.368 ± 0.666
3.829ArgAsn: 3.829 ± 1.063
0.547ArgPro: 0.547 ± 0.444
2.462ArgGln: 2.462 ± 0.786
2.462ArgArg: 2.462 ± 0.645
1.915ArgSer: 1.915 ± 0.576
3.282ArgThr: 3.282 ± 1.148
2.735ArgVal: 2.735 ± 0.761
0.547ArgTrp: 0.547 ± 0.4
2.735ArgTyr: 2.735 ± 0.98
0.0ArgXaa: 0.0 ± 0.0
Ser
2.735SerAla: 2.735 ± 1.124
0.274SerCys: 0.274 ± 0.283
5.197SerAsp: 5.197 ± 0.972
6.565SerGlu: 6.565 ± 1.735
3.009SerPhe: 3.009 ± 0.677
0.821SerGly: 0.821 ± 0.436
1.094SerHis: 1.094 ± 0.61
6.291SerIle: 6.291 ± 1.209
6.018SerLys: 6.018 ± 1.381
5.197SerLeu: 5.197 ± 1.329
2.188SerMet: 2.188 ± 0.793
2.188SerAsn: 2.188 ± 0.65
0.547SerPro: 0.547 ± 0.315
2.462SerGln: 2.462 ± 0.975
2.462SerArg: 2.462 ± 0.767
2.188SerSer: 2.188 ± 0.801
2.462SerThr: 2.462 ± 0.89
3.009SerVal: 3.009 ± 0.858
0.821SerTrp: 0.821 ± 0.417
3.282SerTyr: 3.282 ± 0.887
0.0SerXaa: 0.0 ± 0.0
Thr
2.188ThrAla: 2.188 ± 0.671
0.274ThrCys: 0.274 ± 0.247
3.009ThrAsp: 3.009 ± 0.98
3.556ThrGlu: 3.556 ± 0.659
1.915ThrPhe: 1.915 ± 1.098
3.282ThrGly: 3.282 ± 0.884
1.641ThrHis: 1.641 ± 0.645
4.923ThrIle: 4.923 ± 1.189
5.197ThrLys: 5.197 ± 1.376
5.744ThrLeu: 5.744 ± 1.225
0.821ThrMet: 0.821 ± 0.423
3.282ThrAsn: 3.282 ± 1.171
1.915ThrPro: 1.915 ± 0.539
3.829ThrGln: 3.829 ± 0.901
3.282ThrArg: 3.282 ± 0.835
1.641ThrSer: 1.641 ± 0.613
3.009ThrThr: 3.009 ± 0.833
3.282ThrVal: 3.282 ± 1.149
0.0ThrTrp: 0.0 ± 0.0
2.462ThrTyr: 2.462 ± 0.81
0.0ThrXaa: 0.0 ± 0.0
Val
1.915ValAla: 1.915 ± 0.62
0.274ValCys: 0.274 ± 0.307
1.915ValAsp: 1.915 ± 0.725
3.282ValGlu: 3.282 ± 1.016
2.188ValPhe: 2.188 ± 0.654
2.188ValGly: 2.188 ± 0.753
0.821ValHis: 0.821 ± 0.423
4.65ValIle: 4.65 ± 1.031
4.65ValLys: 4.65 ± 0.98
5.197ValLeu: 5.197 ± 1.272
0.821ValMet: 0.821 ± 0.496
1.368ValAsn: 1.368 ± 0.632
0.821ValPro: 0.821 ± 0.401
0.821ValGln: 0.821 ± 0.469
1.368ValArg: 1.368 ± 0.52
4.376ValSer: 4.376 ± 1.217
4.65ValThr: 4.65 ± 1.311
1.915ValVal: 1.915 ± 0.652
0.274ValTrp: 0.274 ± 0.266
0.821ValTyr: 0.821 ± 0.591
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.094TrpAsp: 1.094 ± 0.5
1.641TrpGlu: 1.641 ± 0.535
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.547TrpIle: 0.547 ± 0.364
1.094TrpLys: 1.094 ± 0.488
1.094TrpLeu: 1.094 ± 0.592
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.547TrpArg: 0.547 ± 0.399
0.821TrpSer: 0.821 ± 0.399
0.0TrpThr: 0.0 ± 0.0
0.547TrpVal: 0.547 ± 0.375
0.274TrpTrp: 0.274 ± 0.202
0.274TrpTyr: 0.274 ± 0.304
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.915TyrAla: 1.915 ± 0.717
0.547TyrCys: 0.547 ± 0.33
1.094TyrAsp: 1.094 ± 0.448
3.282TyrGlu: 3.282 ± 0.819
3.556TyrPhe: 3.556 ± 0.911
2.462TyrGly: 2.462 ± 0.941
1.368TyrHis: 1.368 ± 0.615
2.188TyrIle: 2.188 ± 0.653
6.018TyrLys: 6.018 ± 1.243
4.376TyrLeu: 4.376 ± 1.069
1.094TyrMet: 1.094 ± 0.667
4.65TyrAsn: 4.65 ± 0.809
1.368TyrPro: 1.368 ± 0.459
1.915TyrGln: 1.915 ± 0.775
3.829TyrArg: 3.829 ± 1.133
3.282TyrSer: 3.282 ± 1.293
1.641TyrThr: 1.641 ± 0.57
2.188TyrVal: 2.188 ± 0.807
0.274TyrTrp: 0.274 ± 0.283
2.735TyrTyr: 2.735 ± 1.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (3657 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski