Amino acid dipepetide frequency for Streptococcus satellite phage Javan419

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
1.131AlaCys: 1.131 ± 0.576
1.979AlaAsp: 1.979 ± 0.796
4.522AlaGlu: 4.522 ± 1.081
2.826AlaPhe: 2.826 ± 0.796
1.696AlaGly: 1.696 ± 0.612
0.0AlaHis: 0.0 ± 0.0
6.501AlaIle: 6.501 ± 1.599
3.957AlaLys: 3.957 ± 0.611
5.37AlaLeu: 5.37 ± 1.269
1.979AlaMet: 1.979 ± 0.76
3.392AlaAsn: 3.392 ± 0.956
1.979AlaPro: 1.979 ± 0.793
1.413AlaGln: 1.413 ± 0.767
2.261AlaArg: 2.261 ± 0.607
3.109AlaSer: 3.109 ± 1.08
3.109AlaThr: 3.109 ± 0.668
3.392AlaVal: 3.392 ± 0.923
0.848AlaTrp: 0.848 ± 0.615
2.544AlaTyr: 2.544 ± 0.855
0.0AlaXaa: 0.0 ± 0.0
Cys
1.131CysAla: 1.131 ± 0.508
0.283CysCys: 0.283 ± 0.271
0.565CysAsp: 0.565 ± 0.388
0.283CysGlu: 0.283 ± 0.271
0.0CysPhe: 0.0 ± 0.0
0.283CysGly: 0.283 ± 0.271
0.565CysHis: 0.565 ± 0.433
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.131CysLeu: 1.131 ± 0.511
0.283CysMet: 0.283 ± 0.282
0.0CysAsn: 0.0 ± 0.0
0.565CysPro: 0.565 ± 0.434
0.565CysGln: 0.565 ± 0.358
0.565CysArg: 0.565 ± 0.358
0.283CysSer: 0.283 ± 0.269
0.283CysThr: 0.283 ± 0.36
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.565CysTyr: 0.565 ± 0.385
0.0CysXaa: 0.0 ± 0.0
Asp
2.544AspAla: 2.544 ± 0.65
1.131AspCys: 1.131 ± 0.572
3.674AspAsp: 3.674 ± 0.993
4.522AspGlu: 4.522 ± 1.007
1.979AspPhe: 1.979 ± 0.734
2.261AspGly: 2.261 ± 0.637
1.131AspHis: 1.131 ± 0.65
6.501AspIle: 6.501 ± 1.005
4.24AspLys: 4.24 ± 0.964
4.24AspLeu: 4.24 ± 1.096
2.261AspMet: 2.261 ± 0.656
3.392AspAsn: 3.392 ± 1.405
1.131AspPro: 1.131 ± 0.463
2.544AspGln: 2.544 ± 0.663
1.696AspArg: 1.696 ± 0.558
1.979AspSer: 1.979 ± 0.85
3.674AspThr: 3.674 ± 1.165
1.413AspVal: 1.413 ± 0.548
0.283AspTrp: 0.283 ± 0.279
2.826AspTyr: 2.826 ± 0.778
0.0AspXaa: 0.0 ± 0.0
Glu
5.936GluAla: 5.936 ± 1.023
1.696GluCys: 1.696 ± 0.663
4.24GluAsp: 4.24 ± 1.178
5.936GluGlu: 5.936 ± 1.449
1.979GluPhe: 1.979 ± 0.732
2.826GluGly: 2.826 ± 0.862
1.696GluHis: 1.696 ± 0.651
6.218GluIle: 6.218 ± 1.369
7.631GluLys: 7.631 ± 1.026
11.023GluLeu: 11.023 ± 1.855
0.565GluMet: 0.565 ± 0.453
3.392GluAsn: 3.392 ± 0.966
1.979GluPro: 1.979 ± 0.723
5.37GluGln: 5.37 ± 1.45
5.37GluArg: 5.37 ± 0.985
3.674GluSer: 3.674 ± 1.511
3.392GluThr: 3.392 ± 1.327
3.674GluVal: 3.674 ± 0.978
0.848GluTrp: 0.848 ± 0.553
3.109GluTyr: 3.109 ± 0.581
0.0GluXaa: 0.0 ± 0.0
Phe
1.979PheAla: 1.979 ± 0.485
0.0PheCys: 0.0 ± 0.0
3.392PheAsp: 3.392 ± 0.706
2.261PheGlu: 2.261 ± 0.728
1.413PhePhe: 1.413 ± 0.542
1.696PheGly: 1.696 ± 0.527
1.131PheHis: 1.131 ± 0.488
1.696PheIle: 1.696 ± 0.571
4.805PheLys: 4.805 ± 0.978
2.826PheLeu: 2.826 ± 0.962
0.283PheMet: 0.283 ± 0.293
1.979PheAsn: 1.979 ± 1.053
1.696PhePro: 1.696 ± 0.562
1.696PheGln: 1.696 ± 0.729
2.544PheArg: 2.544 ± 0.665
2.826PheSer: 2.826 ± 0.734
3.392PheThr: 3.392 ± 0.742
1.413PheVal: 1.413 ± 0.548
0.283PheTrp: 0.283 ± 0.221
1.413PheTyr: 1.413 ± 0.771
0.0PheXaa: 0.0 ± 0.0
Gly
1.696GlyAla: 1.696 ± 1.247
0.0GlyCys: 0.0 ± 0.0
3.109GlyAsp: 3.109 ± 1.009
3.957GlyGlu: 3.957 ± 0.985
1.696GlyPhe: 1.696 ± 0.763
2.826GlyGly: 2.826 ± 0.829
1.413GlyHis: 1.413 ± 0.666
2.544GlyIle: 2.544 ± 1.1
4.522GlyLys: 4.522 ± 1.196
6.218GlyLeu: 6.218 ± 1.375
2.544GlyMet: 2.544 ± 0.704
1.979GlyAsn: 1.979 ± 0.514
0.0GlyPro: 0.0 ± 0.0
2.826GlyGln: 2.826 ± 1.18
2.261GlyArg: 2.261 ± 0.629
1.696GlySer: 1.696 ± 0.65
4.24GlyThr: 4.24 ± 1.269
1.979GlyVal: 1.979 ± 0.683
0.565GlyTrp: 0.565 ± 0.352
3.392GlyTyr: 3.392 ± 0.879
0.0GlyXaa: 0.0 ± 0.0
His
1.979HisAla: 1.979 ± 1.264
0.0HisCys: 0.0 ± 0.0
0.848HisAsp: 0.848 ± 0.472
0.848HisGlu: 0.848 ± 0.556
0.0HisPhe: 0.0 ± 0.0
0.848HisGly: 0.848 ± 0.457
0.848HisHis: 0.848 ± 0.381
0.283HisIle: 0.283 ± 0.284
2.261HisLys: 2.261 ± 0.711
2.261HisLeu: 2.261 ± 0.953
0.283HisMet: 0.283 ± 0.236
1.413HisAsn: 1.413 ± 0.531
0.283HisPro: 0.283 ± 0.221
1.696HisGln: 1.696 ± 0.646
1.131HisArg: 1.131 ± 0.533
1.131HisSer: 1.131 ± 0.452
1.413HisThr: 1.413 ± 0.508
0.565HisVal: 0.565 ± 0.306
0.565HisTrp: 0.565 ± 0.358
1.979HisTyr: 1.979 ± 0.875
0.0HisXaa: 0.0 ± 0.0
Ile
4.805IleAla: 4.805 ± 1.211
0.283IleCys: 0.283 ± 0.332
7.349IleAsp: 7.349 ± 1.464
4.24IleGlu: 4.24 ± 1.149
1.696IlePhe: 1.696 ± 0.662
1.979IleGly: 1.979 ± 0.743
0.848IleHis: 0.848 ± 0.53
4.522IleIle: 4.522 ± 0.917
8.479IleLys: 8.479 ± 1.328
4.24IleLeu: 4.24 ± 0.939
0.565IleMet: 0.565 ± 0.442
4.24IleAsn: 4.24 ± 0.957
2.261IlePro: 2.261 ± 0.734
2.826IleGln: 2.826 ± 0.815
3.674IleArg: 3.674 ± 0.753
4.522IleSer: 4.522 ± 1.101
7.066IleThr: 7.066 ± 1.445
2.261IleVal: 2.261 ± 0.562
0.0IleTrp: 0.0 ± 0.0
3.109IleTyr: 3.109 ± 1.272
0.0IleXaa: 0.0 ± 0.0
Lys
6.501LysAla: 6.501 ± 1.436
0.283LysCys: 0.283 ± 0.297
5.088LysAsp: 5.088 ± 1.235
11.023LysGlu: 11.023 ± 1.452
2.261LysPhe: 2.261 ± 0.616
5.653LysGly: 5.653 ± 1.191
2.826LysHis: 2.826 ± 0.608
7.066LysIle: 7.066 ± 1.628
7.066LysLys: 7.066 ± 1.951
8.197LysLeu: 8.197 ± 1.773
1.413LysMet: 1.413 ± 0.707
3.109LysAsn: 3.109 ± 0.792
4.24LysPro: 4.24 ± 1.049
7.066LysGln: 7.066 ± 1.449
5.653LysArg: 5.653 ± 1.015
3.109LysSer: 3.109 ± 1.221
7.066LysThr: 7.066 ± 1.445
5.088LysVal: 5.088 ± 1.353
0.565LysTrp: 0.565 ± 0.363
2.261LysTyr: 2.261 ± 0.747
0.0LysXaa: 0.0 ± 0.0
Leu
3.674LeuAla: 3.674 ± 1.194
0.565LeuCys: 0.565 ± 0.38
4.24LeuAsp: 4.24 ± 1.28
11.588LeuGlu: 11.588 ± 1.689
4.24LeuPhe: 4.24 ± 1.044
6.783LeuGly: 6.783 ± 1.544
1.696LeuHis: 1.696 ± 0.66
5.653LeuIle: 5.653 ± 1.307
8.762LeuLys: 8.762 ± 1.364
10.458LeuLeu: 10.458 ± 1.611
3.109LeuMet: 3.109 ± 0.878
5.088LeuAsn: 5.088 ± 1.814
5.088LeuPro: 5.088 ± 1.122
3.109LeuGln: 3.109 ± 0.728
1.979LeuArg: 1.979 ± 0.635
7.066LeuSer: 7.066 ± 1.502
4.522LeuThr: 4.522 ± 0.547
5.37LeuVal: 5.37 ± 0.994
0.565LeuTrp: 0.565 ± 0.329
5.653LeuTyr: 5.653 ± 0.823
0.0LeuXaa: 0.0 ± 0.0
Met
2.826MetAla: 2.826 ± 0.983
0.0MetCys: 0.0 ± 0.0
1.131MetAsp: 1.131 ± 0.693
0.565MetGlu: 0.565 ± 0.318
0.848MetPhe: 0.848 ± 0.383
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.413MetIle: 1.413 ± 0.657
2.826MetLys: 2.826 ± 0.728
2.544MetLeu: 2.544 ± 0.721
0.283MetMet: 0.283 ± 0.278
2.826MetAsn: 2.826 ± 0.697
0.283MetPro: 0.283 ± 0.221
0.0MetGln: 0.0 ± 0.0
1.413MetArg: 1.413 ± 0.514
1.696MetSer: 1.696 ± 0.692
3.392MetThr: 3.392 ± 0.992
0.848MetVal: 0.848 ± 0.532
0.0MetTrp: 0.0 ± 0.0
0.283MetTyr: 0.283 ± 0.278
0.0MetXaa: 0.0 ± 0.0
Asn
3.674AsnAla: 3.674 ± 1.134
0.283AsnCys: 0.283 ± 0.269
1.413AsnAsp: 1.413 ± 0.584
3.674AsnGlu: 3.674 ± 1.316
1.413AsnPhe: 1.413 ± 0.711
4.24AsnGly: 4.24 ± 1.036
1.696AsnHis: 1.696 ± 0.582
1.979AsnIle: 1.979 ± 0.874
5.088AsnLys: 5.088 ± 1.081
5.37AsnLeu: 5.37 ± 1.181
1.413AsnMet: 1.413 ± 0.628
3.392AsnAsn: 3.392 ± 1.373
2.826AsnPro: 2.826 ± 0.698
3.392AsnGln: 3.392 ± 1.073
2.826AsnArg: 2.826 ± 0.69
2.544AsnSer: 2.544 ± 0.769
2.261AsnThr: 2.261 ± 0.89
2.544AsnVal: 2.544 ± 0.619
0.848AsnTrp: 0.848 ± 0.417
2.261AsnTyr: 2.261 ± 0.771
0.0AsnXaa: 0.0 ± 0.0
Pro
0.848ProAla: 0.848 ± 0.413
0.283ProCys: 0.283 ± 0.284
2.544ProAsp: 2.544 ± 0.917
3.674ProGlu: 3.674 ± 0.658
1.696ProPhe: 1.696 ± 0.665
0.283ProGly: 0.283 ± 0.271
0.283ProHis: 0.283 ± 0.278
1.413ProIle: 1.413 ± 0.813
4.522ProLys: 4.522 ± 1.097
2.544ProLeu: 2.544 ± 0.993
0.565ProMet: 0.565 ± 0.358
2.261ProAsn: 2.261 ± 0.761
0.848ProPro: 0.848 ± 0.437
1.131ProGln: 1.131 ± 0.44
1.696ProArg: 1.696 ± 0.454
2.261ProSer: 2.261 ± 0.917
2.544ProThr: 2.544 ± 0.746
1.696ProVal: 1.696 ± 0.595
0.0ProTrp: 0.0 ± 0.0
1.131ProTyr: 1.131 ± 0.597
0.0ProXaa: 0.0 ± 0.0
Gln
3.957GlnAla: 3.957 ± 0.886
0.0GlnCys: 0.0 ± 0.0
2.261GlnAsp: 2.261 ± 0.876
4.522GlnGlu: 4.522 ± 1.324
1.696GlnPhe: 1.696 ± 0.453
2.261GlnGly: 2.261 ± 0.586
1.131GlnHis: 1.131 ± 0.487
3.392GlnIle: 3.392 ± 0.898
6.218GlnLys: 6.218 ± 1.296
7.066GlnLeu: 7.066 ± 0.935
1.979GlnMet: 1.979 ± 0.69
1.979GlnAsn: 1.979 ± 0.712
2.261GlnPro: 2.261 ± 1.135
2.544GlnGln: 2.544 ± 0.764
1.979GlnArg: 1.979 ± 0.65
2.826GlnSer: 2.826 ± 0.888
1.413GlnThr: 1.413 ± 0.492
2.826GlnVal: 2.826 ± 0.963
0.283GlnTrp: 0.283 ± 0.25
1.131GlnTyr: 1.131 ± 0.473
0.0GlnXaa: 0.0 ± 0.0
Arg
1.696ArgAla: 1.696 ± 0.649
0.565ArgCys: 0.565 ± 0.358
2.544ArgAsp: 2.544 ± 0.634
3.392ArgGlu: 3.392 ± 0.894
2.544ArgPhe: 2.544 ± 0.624
2.826ArgGly: 2.826 ± 0.994
1.131ArgHis: 1.131 ± 0.549
2.544ArgIle: 2.544 ± 0.721
5.088ArgLys: 5.088 ± 0.935
5.653ArgLeu: 5.653 ± 1.28
1.413ArgMet: 1.413 ± 0.477
1.979ArgAsn: 1.979 ± 0.805
1.413ArgPro: 1.413 ± 0.525
3.674ArgGln: 3.674 ± 0.82
1.413ArgArg: 1.413 ± 0.551
1.413ArgSer: 1.413 ± 0.592
3.674ArgThr: 3.674 ± 0.761
2.261ArgVal: 2.261 ± 0.809
0.848ArgTrp: 0.848 ± 0.488
2.544ArgTyr: 2.544 ± 0.797
0.0ArgXaa: 0.0 ± 0.0
Ser
2.261SerAla: 2.261 ± 0.802
0.283SerCys: 0.283 ± 0.271
2.826SerAsp: 2.826 ± 0.86
4.522SerGlu: 4.522 ± 1.387
2.261SerPhe: 2.261 ± 0.69
2.261SerGly: 2.261 ± 0.72
0.848SerHis: 0.848 ± 0.579
6.783SerIle: 6.783 ± 0.963
5.653SerLys: 5.653 ± 1.036
5.37SerLeu: 5.37 ± 0.85
1.131SerMet: 1.131 ± 0.542
1.979SerAsn: 1.979 ± 0.651
0.565SerPro: 0.565 ± 0.346
2.826SerGln: 2.826 ± 1.007
2.544SerArg: 2.544 ± 0.918
2.261SerSer: 2.261 ± 0.909
3.109SerThr: 3.109 ± 1.153
3.957SerVal: 3.957 ± 0.84
0.848SerTrp: 0.848 ± 0.437
2.826SerTyr: 2.826 ± 0.885
0.0SerXaa: 0.0 ± 0.0
Thr
3.674ThrAla: 3.674 ± 1.241
0.283ThrCys: 0.283 ± 0.36
1.413ThrAsp: 1.413 ± 0.549
3.674ThrGlu: 3.674 ± 0.816
3.957ThrPhe: 3.957 ± 1.296
5.37ThrGly: 5.37 ± 1.208
1.131ThrHis: 1.131 ± 0.499
3.957ThrIle: 3.957 ± 1.131
5.088ThrLys: 5.088 ± 1.202
5.653ThrLeu: 5.653 ± 1.032
1.413ThrMet: 1.413 ± 0.617
1.979ThrAsn: 1.979 ± 0.636
3.109ThrPro: 3.109 ± 0.809
1.979ThrGln: 1.979 ± 0.546
3.674ThrArg: 3.674 ± 0.839
3.674ThrSer: 3.674 ± 1.086
4.522ThrThr: 4.522 ± 1.166
5.653ThrVal: 5.653 ± 1.34
0.848ThrTrp: 0.848 ± 0.409
3.957ThrTyr: 3.957 ± 1.423
0.0ThrXaa: 0.0 ± 0.0
Val
2.261ValAla: 2.261 ± 0.53
0.0ValCys: 0.0 ± 0.0
3.392ValAsp: 3.392 ± 1.031
3.109ValGlu: 3.109 ± 0.626
2.826ValPhe: 2.826 ± 1.008
2.544ValGly: 2.544 ± 0.7
0.565ValHis: 0.565 ± 0.556
4.24ValIle: 4.24 ± 1.003
3.392ValLys: 3.392 ± 0.789
4.522ValLeu: 4.522 ± 0.837
0.565ValMet: 0.565 ± 0.373
2.826ValAsn: 2.826 ± 0.994
0.565ValPro: 0.565 ± 0.443
3.392ValGln: 3.392 ± 0.72
1.696ValArg: 1.696 ± 0.596
5.37ValSer: 5.37 ± 1.141
3.674ValThr: 3.674 ± 1.171
1.696ValVal: 1.696 ± 0.543
0.848ValTrp: 0.848 ± 0.472
1.413ValTyr: 1.413 ± 0.643
0.0ValXaa: 0.0 ± 0.0
Trp
0.565TrpAla: 0.565 ± 0.325
0.0TrpCys: 0.0 ± 0.0
0.283TrpAsp: 0.283 ± 0.266
1.131TrpGlu: 1.131 ± 0.538
0.0TrpPhe: 0.0 ± 0.0
0.283TrpGly: 0.283 ± 0.263
0.0TrpHis: 0.0 ± 0.0
0.283TrpIle: 0.283 ± 0.314
0.848TrpLys: 0.848 ± 0.529
1.696TrpLeu: 1.696 ± 0.772
0.0TrpMet: 0.0 ± 0.0
0.565TrpAsn: 0.565 ± 0.382
0.0TrpPro: 0.0 ± 0.0
0.565TrpGln: 0.565 ± 0.385
0.565TrpArg: 0.565 ± 0.354
1.131TrpSer: 1.131 ± 0.473
0.0TrpThr: 0.0 ± 0.0
1.131TrpVal: 1.131 ± 0.495
0.565TrpTrp: 0.565 ± 0.325
0.565TrpTyr: 0.565 ± 0.355
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.283TyrAla: 0.283 ± 0.332
0.283TyrCys: 0.283 ± 0.276
1.131TyrAsp: 1.131 ± 0.436
3.109TyrGlu: 3.109 ± 1.09
3.392TyrPhe: 3.392 ± 0.825
2.544TyrGly: 2.544 ± 0.641
1.696TyrHis: 1.696 ± 0.635
1.979TyrIle: 1.979 ± 0.652
5.088TyrLys: 5.088 ± 1.256
2.826TyrLeu: 2.826 ± 0.652
0.848TyrMet: 0.848 ± 0.658
5.37TyrAsn: 5.37 ± 1.018
1.131TyrPro: 1.131 ± 0.873
3.109TyrGln: 3.109 ± 0.865
3.674TyrArg: 3.674 ± 1.357
2.544TyrSer: 2.544 ± 0.673
2.261TyrThr: 2.261 ± 0.552
1.131TyrVal: 1.131 ± 0.499
0.565TyrTrp: 0.565 ± 0.543
3.109TyrTyr: 3.109 ± 1.12
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (3539 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski