Amino acid dipepetide frequency for Streptococcus satellite phage Javan344

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.279AlaAla: 0.279 ± 0.352
0.279AlaCys: 0.279 ± 0.224
3.902AlaAsp: 3.902 ± 1.025
8.64AlaGlu: 8.64 ± 1.978
5.017AlaPhe: 5.017 ± 1.328
1.672AlaGly: 1.672 ± 0.567
0.279AlaHis: 0.279 ± 0.327
4.738AlaIle: 4.738 ± 0.958
4.738AlaLys: 4.738 ± 1.064
6.41AlaLeu: 6.41 ± 1.735
2.787AlaMet: 2.787 ± 1.241
2.508AlaAsn: 2.508 ± 0.683
1.115AlaPro: 1.115 ± 0.706
3.066AlaGln: 3.066 ± 0.871
3.066AlaArg: 3.066 ± 0.81
2.508AlaSer: 2.508 ± 0.75
2.23AlaThr: 2.23 ± 0.825
3.623AlaVal: 3.623 ± 0.744
0.557AlaTrp: 0.557 ± 0.296
3.344AlaTyr: 3.344 ± 0.727
0.0AlaXaa: 0.0 ± 0.0
Cys
0.279CysAla: 0.279 ± 0.302
0.0CysCys: 0.0 ± 0.0
0.557CysAsp: 0.557 ± 0.503
1.115CysGlu: 1.115 ± 0.5
0.0CysPhe: 0.0 ± 0.0
0.557CysGly: 0.557 ± 0.503
0.557CysHis: 0.557 ± 0.26
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.557CysLeu: 0.557 ± 0.351
0.279CysMet: 0.279 ± 0.312
0.557CysAsn: 0.557 ± 0.392
1.115CysPro: 1.115 ± 0.529
0.557CysGln: 0.557 ± 0.376
1.115CysArg: 1.115 ± 0.655
0.557CysSer: 0.557 ± 0.357
0.279CysThr: 0.279 ± 0.27
0.557CysVal: 0.557 ± 0.296
0.0CysTrp: 0.0 ± 0.0
0.279CysTyr: 0.279 ± 0.312
0.0CysXaa: 0.0 ± 0.0
Asp
1.951AspAla: 1.951 ± 0.779
0.0AspCys: 0.0 ± 0.0
3.902AspAsp: 3.902 ± 1.084
6.41AspGlu: 6.41 ± 1.313
3.344AspPhe: 3.344 ± 1.109
3.902AspGly: 3.902 ± 1.263
0.557AspHis: 0.557 ± 0.361
5.295AspIle: 5.295 ± 1.353
9.755AspLys: 9.755 ± 2.027
7.246AspLeu: 7.246 ± 1.453
1.115AspMet: 1.115 ± 0.446
3.623AspAsn: 3.623 ± 0.771
1.115AspPro: 1.115 ± 0.593
1.115AspGln: 1.115 ± 0.668
3.902AspArg: 3.902 ± 0.966
3.066AspSer: 3.066 ± 0.927
1.951AspThr: 1.951 ± 1.179
3.066AspVal: 3.066 ± 0.812
0.557AspTrp: 0.557 ± 0.497
3.623AspTyr: 3.623 ± 1.04
0.0AspXaa: 0.0 ± 0.0
Glu
6.132GluAla: 6.132 ± 1.506
0.279GluCys: 0.279 ± 0.216
4.738GluAsp: 4.738 ± 1.168
6.41GluGlu: 6.41 ± 1.426
3.902GluPhe: 3.902 ± 1.12
2.508GluGly: 2.508 ± 0.791
2.508GluHis: 2.508 ± 0.841
8.919GluIle: 8.919 ± 1.514
9.755GluLys: 9.755 ± 1.299
10.312GluLeu: 10.312 ± 1.733
2.508GluMet: 2.508 ± 0.685
5.853GluAsn: 5.853 ± 1.287
0.836GluPro: 0.836 ± 0.498
3.623GluGln: 3.623 ± 0.607
4.738GluArg: 4.738 ± 1.254
2.787GluSer: 2.787 ± 0.756
7.246GluThr: 7.246 ± 0.777
5.295GluVal: 5.295 ± 1.279
0.557GluTrp: 0.557 ± 0.397
5.017GluTyr: 5.017 ± 0.951
0.0GluXaa: 0.0 ± 0.0
Phe
2.23PheAla: 2.23 ± 0.834
0.0PheCys: 0.0 ± 0.0
4.181PheAsp: 4.181 ± 1.323
3.623PheGlu: 3.623 ± 1.146
3.344PhePhe: 3.344 ± 1.644
2.23PheGly: 2.23 ± 0.742
1.394PheHis: 1.394 ± 0.446
3.344PheIle: 3.344 ± 0.99
4.738PheLys: 4.738 ± 1.631
3.623PheLeu: 3.623 ± 0.773
1.115PheMet: 1.115 ± 0.487
2.23PheAsn: 2.23 ± 0.82
0.836PhePro: 0.836 ± 0.394
0.836PheGln: 0.836 ± 0.497
1.951PheArg: 1.951 ± 0.812
3.066PheSer: 3.066 ± 0.886
2.508PheThr: 2.508 ± 0.881
3.902PheVal: 3.902 ± 1.109
0.279PheTrp: 0.279 ± 0.241
2.23PheTyr: 2.23 ± 0.755
0.0PheXaa: 0.0 ± 0.0
Gly
2.23GlyAla: 2.23 ± 1.24
0.279GlyCys: 0.279 ± 0.302
1.951GlyAsp: 1.951 ± 0.825
3.902GlyGlu: 3.902 ± 1.153
2.787GlyPhe: 2.787 ± 0.723
0.836GlyGly: 0.836 ± 0.591
0.836GlyHis: 0.836 ± 0.46
2.508GlyIle: 2.508 ± 0.77
3.344GlyLys: 3.344 ± 0.822
4.459GlyLeu: 4.459 ± 1.092
1.672GlyMet: 1.672 ± 0.693
0.557GlyAsn: 0.557 ± 0.359
0.836GlyPro: 0.836 ± 0.598
1.394GlyGln: 1.394 ± 0.643
2.787GlyArg: 2.787 ± 0.71
0.836GlySer: 0.836 ± 0.524
2.23GlyThr: 2.23 ± 0.955
2.787GlyVal: 2.787 ± 0.722
0.836GlyTrp: 0.836 ± 0.566
2.23GlyTyr: 2.23 ± 0.934
0.0GlyXaa: 0.0 ± 0.0
His
1.394HisAla: 1.394 ± 0.574
0.557HisCys: 0.557 ± 0.503
1.672HisAsp: 1.672 ± 0.63
0.279HisGlu: 0.279 ± 0.309
1.115HisPhe: 1.115 ± 0.591
1.672HisGly: 1.672 ± 0.7
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.508HisLys: 2.508 ± 0.751
0.836HisLeu: 0.836 ± 0.507
0.557HisMet: 0.557 ± 0.331
0.836HisAsn: 0.836 ± 0.384
0.557HisPro: 0.557 ± 0.482
1.115HisGln: 1.115 ± 0.639
0.557HisArg: 0.557 ± 0.474
1.394HisSer: 1.394 ± 0.479
0.836HisThr: 0.836 ± 0.533
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.951HisTyr: 1.951 ± 0.654
0.0HisXaa: 0.0 ± 0.0
Ile
6.689IleAla: 6.689 ± 1.467
2.23IleCys: 2.23 ± 0.848
6.689IleAsp: 6.689 ± 1.542
6.689IleGlu: 6.689 ± 1.775
2.23IlePhe: 2.23 ± 0.831
2.23IleGly: 2.23 ± 0.728
1.115IleHis: 1.115 ± 0.492
4.738IleIle: 4.738 ± 1.281
6.689IleLys: 6.689 ± 1.465
3.066IleLeu: 3.066 ± 0.956
1.394IleMet: 1.394 ± 0.613
2.787IleAsn: 2.787 ± 0.954
1.115IlePro: 1.115 ± 0.471
1.672IleGln: 1.672 ± 0.646
3.344IleArg: 3.344 ± 0.849
4.738IleSer: 4.738 ± 0.944
5.574IleThr: 5.574 ± 1.452
3.066IleVal: 3.066 ± 0.92
0.0IleTrp: 0.0 ± 0.0
1.672IleTyr: 1.672 ± 0.794
0.0IleXaa: 0.0 ± 0.0
Lys
6.689LysAla: 6.689 ± 1.477
0.279LysCys: 0.279 ± 0.216
7.804LysAsp: 7.804 ± 1.152
10.312LysGlu: 10.312 ± 2.142
4.738LysPhe: 4.738 ± 1.109
5.574LysGly: 5.574 ± 1.019
2.23LysHis: 2.23 ± 0.846
5.853LysIle: 5.853 ± 1.397
9.755LysLys: 9.755 ± 1.973
9.476LysLeu: 9.476 ± 1.689
2.787LysMet: 2.787 ± 0.73
5.853LysAsn: 5.853 ± 0.887
2.787LysPro: 2.787 ± 1.284
3.344LysGln: 3.344 ± 0.995
2.787LysArg: 2.787 ± 0.84
8.64LysSer: 8.64 ± 1.707
4.738LysThr: 4.738 ± 1.608
4.738LysVal: 4.738 ± 0.994
0.836LysTrp: 0.836 ± 0.348
3.066LysTyr: 3.066 ± 1.042
0.0LysXaa: 0.0 ± 0.0
Leu
3.344LeuAla: 3.344 ± 0.939
0.0LeuCys: 0.0 ± 0.0
8.361LeuAsp: 8.361 ± 1.782
9.755LeuGlu: 9.755 ± 1.623
5.017LeuPhe: 5.017 ± 1.322
2.787LeuGly: 2.787 ± 0.85
2.23LeuHis: 2.23 ± 0.792
3.344LeuIle: 3.344 ± 0.869
11.984LeuLys: 11.984 ± 2.006
6.41LeuLeu: 6.41 ± 1.698
1.394LeuMet: 1.394 ± 0.558
5.853LeuAsn: 5.853 ± 1.542
1.951LeuPro: 1.951 ± 0.728
2.23LeuGln: 2.23 ± 0.792
5.017LeuArg: 5.017 ± 0.919
4.738LeuSer: 4.738 ± 0.976
6.132LeuThr: 6.132 ± 1.444
4.738LeuVal: 4.738 ± 0.946
1.394LeuTrp: 1.394 ± 0.594
4.459LeuTyr: 4.459 ± 1.055
0.0LeuXaa: 0.0 ± 0.0
Met
2.23MetAla: 2.23 ± 1.107
0.0MetCys: 0.0 ± 0.0
1.672MetAsp: 1.672 ± 0.769
1.672MetGlu: 1.672 ± 0.785
0.279MetPhe: 0.279 ± 0.318
0.836MetGly: 0.836 ± 0.371
0.0MetHis: 0.0 ± 0.0
1.672MetIle: 1.672 ± 0.669
1.394MetLys: 1.394 ± 0.557
1.951MetLeu: 1.951 ± 0.808
0.836MetMet: 0.836 ± 0.397
2.23MetAsn: 2.23 ± 0.81
0.279MetPro: 0.279 ± 0.285
0.557MetGln: 0.557 ± 0.387
1.115MetArg: 1.115 ± 0.785
0.836MetSer: 0.836 ± 0.462
3.066MetThr: 3.066 ± 1.137
1.951MetVal: 1.951 ± 0.793
0.0MetTrp: 0.0 ± 0.0
0.279MetTyr: 0.279 ± 0.241
0.0MetXaa: 0.0 ± 0.0
Asn
4.459AsnAla: 4.459 ± 1.606
0.557AsnCys: 0.557 ± 0.311
2.508AsnAsp: 2.508 ± 0.866
5.295AsnGlu: 5.295 ± 0.925
2.787AsnPhe: 2.787 ± 0.955
3.066AsnGly: 3.066 ± 0.941
0.836AsnHis: 0.836 ± 0.44
4.459AsnIle: 4.459 ± 1.038
5.017AsnLys: 5.017 ± 0.844
5.574AsnLeu: 5.574 ± 1.635
0.836AsnMet: 0.836 ± 0.484
3.066AsnAsn: 3.066 ± 0.922
3.066AsnPro: 3.066 ± 0.64
2.23AsnGln: 2.23 ± 0.562
2.23AsnArg: 2.23 ± 0.547
3.344AsnSer: 3.344 ± 0.706
2.23AsnThr: 2.23 ± 0.729
2.23AsnVal: 2.23 ± 1.03
0.557AsnTrp: 0.557 ± 0.311
1.951AsnTyr: 1.951 ± 0.573
0.0AsnXaa: 0.0 ± 0.0
Pro
2.23ProAla: 2.23 ± 0.572
0.279ProCys: 0.279 ± 0.312
1.115ProAsp: 1.115 ± 0.618
2.23ProGlu: 2.23 ± 0.938
0.557ProPhe: 0.557 ± 0.361
0.557ProGly: 0.557 ± 0.442
0.0ProHis: 0.0 ± 0.0
0.836ProIle: 0.836 ± 0.499
2.787ProLys: 2.787 ± 1.011
1.951ProLeu: 1.951 ± 0.835
0.557ProMet: 0.557 ± 0.391
2.508ProAsn: 2.508 ± 0.671
0.836ProPro: 0.836 ± 0.513
0.0ProGln: 0.0 ± 0.0
1.394ProArg: 1.394 ± 0.77
1.672ProSer: 1.672 ± 0.526
1.115ProThr: 1.115 ± 0.536
1.951ProVal: 1.951 ± 0.932
0.279ProTrp: 0.279 ± 0.312
0.836ProTyr: 0.836 ± 0.513
0.0ProXaa: 0.0 ± 0.0
Gln
5.295GlnAla: 5.295 ± 1.667
0.0GlnCys: 0.0 ± 0.0
1.394GlnAsp: 1.394 ± 0.638
2.23GlnGlu: 2.23 ± 0.904
2.23GlnPhe: 2.23 ± 0.703
0.836GlnGly: 0.836 ± 0.36
0.557GlnHis: 0.557 ± 0.364
1.672GlnIle: 1.672 ± 0.674
3.344GlnLys: 3.344 ± 1.035
3.623GlnLeu: 3.623 ± 1.091
0.836GlnMet: 0.836 ± 0.596
3.066GlnAsn: 3.066 ± 0.948
0.0GlnPro: 0.0 ± 0.0
2.787GlnGln: 2.787 ± 1.195
3.066GlnArg: 3.066 ± 0.895
1.394GlnSer: 1.394 ± 0.627
1.115GlnThr: 1.115 ± 0.582
2.787GlnVal: 2.787 ± 0.81
0.836GlnTrp: 0.836 ± 0.449
1.115GlnTyr: 1.115 ± 0.468
0.0GlnXaa: 0.0 ± 0.0
Arg
4.181ArgAla: 4.181 ± 1.271
0.279ArgCys: 0.279 ± 0.28
3.066ArgAsp: 3.066 ± 0.877
4.181ArgGlu: 4.181 ± 1.211
2.23ArgPhe: 2.23 ± 0.743
1.672ArgGly: 1.672 ± 0.634
1.394ArgHis: 1.394 ± 0.54
5.017ArgIle: 5.017 ± 1.003
3.902ArgLys: 3.902 ± 1.443
5.574ArgLeu: 5.574 ± 1.388
0.557ArgMet: 0.557 ± 0.361
1.672ArgAsn: 1.672 ± 0.585
1.672ArgPro: 1.672 ± 0.583
3.902ArgGln: 3.902 ± 1.153
3.066ArgArg: 3.066 ± 0.895
0.836ArgSer: 0.836 ± 0.436
1.951ArgThr: 1.951 ± 0.497
1.394ArgVal: 1.394 ± 0.559
0.836ArgTrp: 0.836 ± 0.531
3.066ArgTyr: 3.066 ± 0.798
0.0ArgXaa: 0.0 ± 0.0
Ser
2.508SerAla: 2.508 ± 0.867
0.836SerCys: 0.836 ± 0.461
3.344SerAsp: 3.344 ± 0.759
4.181SerGlu: 4.181 ± 1.13
2.508SerPhe: 2.508 ± 0.764
1.394SerGly: 1.394 ± 0.603
0.557SerHis: 0.557 ± 0.351
2.787SerIle: 2.787 ± 0.855
5.017SerLys: 5.017 ± 0.863
5.017SerLeu: 5.017 ± 0.822
2.23SerMet: 2.23 ± 0.73
1.951SerAsn: 1.951 ± 0.912
1.672SerPro: 1.672 ± 0.727
3.066SerGln: 3.066 ± 0.864
2.23SerArg: 2.23 ± 0.896
2.23SerSer: 2.23 ± 0.843
2.787SerThr: 2.787 ± 0.864
3.902SerVal: 3.902 ± 0.865
0.279SerTrp: 0.279 ± 0.27
2.508SerTyr: 2.508 ± 0.838
0.0SerXaa: 0.0 ± 0.0
Thr
2.787ThrAla: 2.787 ± 0.872
0.0ThrCys: 0.0 ± 0.0
2.508ThrAsp: 2.508 ± 1.013
3.623ThrGlu: 3.623 ± 0.937
1.951ThrPhe: 1.951 ± 0.663
3.066ThrGly: 3.066 ± 0.841
1.115ThrHis: 1.115 ± 0.445
5.017ThrIle: 5.017 ± 1.47
4.738ThrLys: 4.738 ± 1.006
4.738ThrLeu: 4.738 ± 1.182
0.557ThrMet: 0.557 ± 0.357
1.951ThrAsn: 1.951 ± 0.593
1.951ThrPro: 1.951 ± 0.67
4.181ThrGln: 4.181 ± 0.988
2.508ThrArg: 2.508 ± 0.567
1.394ThrSer: 1.394 ± 0.719
2.787ThrThr: 2.787 ± 0.972
3.344ThrVal: 3.344 ± 0.928
0.557ThrTrp: 0.557 ± 0.436
3.902ThrTyr: 3.902 ± 1.229
0.0ThrXaa: 0.0 ± 0.0
Val
4.181ValAla: 4.181 ± 1.227
1.951ValCys: 1.951 ± 1.089
3.623ValAsp: 3.623 ± 1.366
7.246ValGlu: 7.246 ± 1.234
1.951ValPhe: 1.951 ± 0.759
1.672ValGly: 1.672 ± 0.451
0.836ValHis: 0.836 ± 0.448
3.344ValIle: 3.344 ± 0.817
4.738ValLys: 4.738 ± 0.969
4.459ValLeu: 4.459 ± 1.044
0.557ValMet: 0.557 ± 0.406
5.017ValAsn: 5.017 ± 0.839
1.115ValPro: 1.115 ± 0.544
0.557ValGln: 0.557 ± 0.411
2.508ValArg: 2.508 ± 0.898
2.23ValSer: 2.23 ± 0.673
1.951ValThr: 1.951 ± 0.603
3.902ValVal: 3.902 ± 1.187
0.557ValTrp: 0.557 ± 0.369
3.066ValTyr: 3.066 ± 1.103
0.0ValXaa: 0.0 ± 0.0
Trp
0.279TrpAla: 0.279 ± 0.216
0.279TrpCys: 0.279 ± 0.241
0.557TrpAsp: 0.557 ± 0.406
1.115TrpGlu: 1.115 ± 0.468
0.0TrpPhe: 0.0 ± 0.0
0.557TrpGly: 0.557 ± 0.363
0.0TrpHis: 0.0 ± 0.0
0.836TrpIle: 0.836 ± 0.493
1.115TrpLys: 1.115 ± 0.504
1.115TrpLeu: 1.115 ± 0.57
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.115TrpGln: 1.115 ± 0.632
0.0TrpArg: 0.0 ± 0.0
0.836TrpSer: 0.836 ± 0.454
0.557TrpThr: 0.557 ± 0.396
0.557TrpVal: 0.557 ± 0.384
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.672TyrAla: 1.672 ± 0.552
0.836TyrCys: 0.836 ± 0.342
1.951TyrAsp: 1.951 ± 0.676
5.017TyrGlu: 5.017 ± 1.158
1.672TyrPhe: 1.672 ± 0.484
1.672TyrGly: 1.672 ± 0.685
0.836TyrHis: 0.836 ± 0.384
3.344TyrIle: 3.344 ± 1.084
6.968TyrLys: 6.968 ± 1.372
4.738TyrLeu: 4.738 ± 1.334
0.0TyrMet: 0.0 ± 0.0
4.181TyrAsn: 4.181 ± 0.895
0.836TyrPro: 0.836 ± 0.378
0.836TyrGln: 0.836 ± 0.436
3.066TyrArg: 3.066 ± 0.981
3.902TyrSer: 3.902 ± 1.331
1.394TyrThr: 1.394 ± 0.569
1.672TyrVal: 1.672 ± 0.907
0.0TyrTrp: 0.0 ± 0.0
0.836TyrTyr: 0.836 ± 0.492
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3589 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski