Amino acid dipepetide frequency for Streptococcus satellite phage Javan283

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.673AlaAla: 1.673 ± 0.898
0.558AlaCys: 0.558 ± 0.296
2.509AlaAsp: 2.509 ± 0.961
4.182AlaGlu: 4.182 ± 0.969
2.788AlaPhe: 2.788 ± 0.795
1.951AlaGly: 1.951 ± 0.772
0.836AlaHis: 0.836 ± 0.438
4.182AlaIle: 4.182 ± 0.757
4.461AlaLys: 4.461 ± 1.048
5.297AlaLeu: 5.297 ± 1.087
0.836AlaMet: 0.836 ± 0.461
3.345AlaAsn: 3.345 ± 0.949
1.673AlaPro: 1.673 ± 0.73
1.951AlaGln: 1.951 ± 0.737
2.509AlaArg: 2.509 ± 0.751
3.624AlaSer: 3.624 ± 0.868
2.23AlaThr: 2.23 ± 0.689
2.788AlaVal: 2.788 ± 0.858
0.0AlaTrp: 0.0 ± 0.0
3.067AlaTyr: 3.067 ± 0.778
0.0AlaXaa: 0.0 ± 0.0
Cys
0.836CysAla: 0.836 ± 0.418
0.0CysCys: 0.0 ± 0.0
0.558CysAsp: 0.558 ± 0.339
0.558CysGlu: 0.558 ± 0.537
0.558CysPhe: 0.558 ± 0.466
0.279CysGly: 0.279 ± 0.254
0.558CysHis: 0.558 ± 0.384
0.558CysIle: 0.558 ± 0.35
0.836CysLys: 0.836 ± 0.408
0.279CysLeu: 0.279 ± 0.265
0.279CysMet: 0.279 ± 0.265
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.558CysArg: 0.558 ± 0.352
0.0CysSer: 0.0 ± 0.0
0.279CysThr: 0.279 ± 0.265
0.0CysVal: 0.0 ± 0.0
0.279CysTrp: 0.279 ± 0.288
0.836CysTyr: 0.836 ± 0.473
0.0CysXaa: 0.0 ± 0.0
Asp
1.673AspAla: 1.673 ± 0.556
0.836AspCys: 0.836 ± 0.426
2.509AspAsp: 2.509 ± 0.891
3.067AspGlu: 3.067 ± 0.982
3.624AspPhe: 3.624 ± 1.16
2.788AspGly: 2.788 ± 0.972
0.279AspHis: 0.279 ± 0.276
5.576AspIle: 5.576 ± 1.008
5.854AspLys: 5.854 ± 1.554
5.297AspLeu: 5.297 ± 1.492
2.509AspMet: 2.509 ± 0.695
3.345AspAsn: 3.345 ± 0.844
0.836AspPro: 0.836 ± 0.469
1.673AspGln: 1.673 ± 0.673
1.951AspArg: 1.951 ± 0.935
3.624AspSer: 3.624 ± 0.941
3.067AspThr: 3.067 ± 1.051
1.951AspVal: 1.951 ± 0.647
0.836AspTrp: 0.836 ± 0.517
3.624AspTyr: 3.624 ± 1.035
0.0AspXaa: 0.0 ± 0.0
Glu
3.903GluAla: 3.903 ± 1.052
0.558GluCys: 0.558 ± 0.333
4.461GluAsp: 4.461 ± 1.44
6.412GluGlu: 6.412 ± 1.532
3.067GluPhe: 3.067 ± 1.026
2.788GluGly: 2.788 ± 1.015
0.836GluHis: 0.836 ± 0.409
7.248GluIle: 7.248 ± 1.749
8.364GluLys: 8.364 ± 1.423
9.479GluLeu: 9.479 ± 1.75
1.673GluMet: 1.673 ± 0.541
4.739GluAsn: 4.739 ± 0.912
1.115GluPro: 1.115 ± 0.423
5.018GluGln: 5.018 ± 1.149
3.903GluArg: 3.903 ± 1.147
4.182GluSer: 4.182 ± 1.148
3.903GluThr: 3.903 ± 0.859
3.067GluVal: 3.067 ± 0.881
0.558GluTrp: 0.558 ± 0.359
2.788GluTyr: 2.788 ± 0.762
0.0GluXaa: 0.0 ± 0.0
Phe
2.23PheAla: 2.23 ± 0.775
0.279PheCys: 0.279 ± 0.273
3.345PheAsp: 3.345 ± 0.932
3.067PheGlu: 3.067 ± 0.908
1.951PhePhe: 1.951 ± 0.768
1.115PheGly: 1.115 ± 0.726
0.558PheHis: 0.558 ± 0.36
3.345PheIle: 3.345 ± 1.037
5.018PheLys: 5.018 ± 1.043
4.461PheLeu: 4.461 ± 0.978
1.394PheMet: 1.394 ± 0.496
2.788PheAsn: 2.788 ± 1.123
1.115PhePro: 1.115 ± 0.561
1.673PheGln: 1.673 ± 0.519
1.115PheArg: 1.115 ± 0.548
2.509PheSer: 2.509 ± 0.697
2.788PheThr: 2.788 ± 0.79
0.836PheVal: 0.836 ± 0.495
0.279PheTrp: 0.279 ± 0.254
1.951PheTyr: 1.951 ± 0.618
0.0PheXaa: 0.0 ± 0.0
Gly
1.951GlyAla: 1.951 ± 0.811
0.0GlyCys: 0.0 ± 0.0
1.394GlyAsp: 1.394 ± 0.536
3.345GlyGlu: 3.345 ± 0.984
1.115GlyPhe: 1.115 ± 0.594
1.673GlyGly: 1.673 ± 0.654
0.558GlyHis: 0.558 ± 0.356
3.903GlyIle: 3.903 ± 1.173
4.182GlyLys: 4.182 ± 1.124
5.018GlyLeu: 5.018 ± 1.669
1.673GlyMet: 1.673 ± 0.727
1.951GlyAsn: 1.951 ± 0.733
0.279GlyPro: 0.279 ± 0.227
1.673GlyGln: 1.673 ± 0.62
1.951GlyArg: 1.951 ± 0.726
3.067GlySer: 3.067 ± 0.78
3.067GlyThr: 3.067 ± 0.89
3.067GlyVal: 3.067 ± 1.016
0.558GlyTrp: 0.558 ± 0.417
2.23GlyTyr: 2.23 ± 0.758
0.0GlyXaa: 0.0 ± 0.0
His
1.115HisAla: 1.115 ± 0.769
0.0HisCys: 0.0 ± 0.0
1.394HisAsp: 1.394 ± 0.629
0.836HisGlu: 0.836 ± 0.474
0.558HisPhe: 0.558 ± 0.41
0.836HisGly: 0.836 ± 0.495
0.836HisHis: 0.836 ± 0.61
3.345HisIle: 3.345 ± 1.126
1.115HisLys: 1.115 ± 0.633
2.509HisLeu: 2.509 ± 0.637
0.0HisMet: 0.0 ± 0.0
1.115HisAsn: 1.115 ± 0.642
0.836HisPro: 0.836 ± 0.476
0.836HisGln: 0.836 ± 0.462
0.836HisArg: 0.836 ± 0.388
1.673HisSer: 1.673 ± 0.54
0.279HisThr: 0.279 ± 0.24
0.836HisVal: 0.836 ± 0.412
0.0HisTrp: 0.0 ± 0.0
1.394HisTyr: 1.394 ± 0.556
0.0HisXaa: 0.0 ± 0.0
Ile
2.788IleAla: 2.788 ± 1.127
0.558IleCys: 0.558 ± 0.405
6.97IleAsp: 6.97 ± 1.268
6.412IleGlu: 6.412 ± 1.738
3.067IlePhe: 3.067 ± 1.188
3.903IleGly: 3.903 ± 0.794
1.394IleHis: 1.394 ± 0.68
6.133IleIle: 6.133 ± 1.423
7.806IleLys: 7.806 ± 1.353
7.527IleLeu: 7.527 ± 1.963
2.788IleMet: 2.788 ± 0.872
4.739IleAsn: 4.739 ± 1.043
2.23IlePro: 2.23 ± 0.796
5.018IleGln: 5.018 ± 1.027
4.182IleArg: 4.182 ± 0.86
6.133IleSer: 6.133 ± 1.183
5.018IleThr: 5.018 ± 1.165
3.345IleVal: 3.345 ± 1.47
0.836IleTrp: 0.836 ± 0.469
3.067IleTyr: 3.067 ± 0.883
0.0IleXaa: 0.0 ± 0.0
Lys
6.133LysAla: 6.133 ± 1.276
0.558LysCys: 0.558 ± 0.507
3.345LysAsp: 3.345 ± 0.659
9.757LysGlu: 9.757 ± 1.193
2.23LysPhe: 2.23 ± 0.653
3.067LysGly: 3.067 ± 0.856
3.345LysHis: 3.345 ± 0.897
6.133LysIle: 6.133 ± 0.82
9.479LysLys: 9.479 ± 1.761
7.248LysLeu: 7.248 ± 1.423
2.23LysMet: 2.23 ± 0.678
5.854LysAsn: 5.854 ± 1.241
3.624LysPro: 3.624 ± 0.836
4.461LysGln: 4.461 ± 1.154
4.739LysArg: 4.739 ± 1.271
5.297LysSer: 5.297 ± 0.975
7.527LysThr: 7.527 ± 1.103
5.297LysVal: 5.297 ± 1.111
0.836LysTrp: 0.836 ± 0.408
2.509LysTyr: 2.509 ± 0.911
0.0LysXaa: 0.0 ± 0.0
Leu
5.854LeuAla: 5.854 ± 1.378
0.836LeuCys: 0.836 ± 0.57
7.806LeuAsp: 7.806 ± 1.268
10.594LeuGlu: 10.594 ± 1.882
3.345LeuPhe: 3.345 ± 1.606
6.412LeuGly: 6.412 ± 1.373
1.673LeuHis: 1.673 ± 0.738
6.691LeuIle: 6.691 ± 1.253
9.757LeuLys: 9.757 ± 1.463
12.267LeuLeu: 12.267 ± 2.005
3.345LeuMet: 3.345 ± 0.809
5.576LeuAsn: 5.576 ± 1.061
3.903LeuPro: 3.903 ± 1.031
5.854LeuGln: 5.854 ± 1.177
1.115LeuArg: 1.115 ± 0.563
5.576LeuSer: 5.576 ± 1.123
5.297LeuThr: 5.297 ± 0.999
4.461LeuVal: 4.461 ± 1.171
1.394LeuTrp: 1.394 ± 0.759
3.067LeuTyr: 3.067 ± 0.733
0.0LeuXaa: 0.0 ± 0.0
Met
1.951MetAla: 1.951 ± 0.713
0.0MetCys: 0.0 ± 0.0
1.394MetAsp: 1.394 ± 0.613
2.788MetGlu: 2.788 ± 0.625
0.836MetPhe: 0.836 ± 0.436
1.115MetGly: 1.115 ± 0.543
0.558MetHis: 0.558 ± 0.4
1.673MetIle: 1.673 ± 0.515
2.788MetLys: 2.788 ± 0.718
1.673MetLeu: 1.673 ± 0.63
0.0MetMet: 0.0 ± 0.0
1.951MetAsn: 1.951 ± 0.774
0.279MetPro: 0.279 ± 0.254
0.279MetGln: 0.279 ± 0.233
1.673MetArg: 1.673 ± 0.585
1.673MetSer: 1.673 ± 0.706
3.903MetThr: 3.903 ± 0.77
1.951MetVal: 1.951 ± 0.881
0.279MetTrp: 0.279 ± 0.273
1.673MetTyr: 1.673 ± 0.57
0.0MetXaa: 0.0 ± 0.0
Asn
3.067AsnAla: 3.067 ± 0.972
0.279AsnCys: 0.279 ± 0.24
1.951AsnAsp: 1.951 ± 0.59
3.903AsnGlu: 3.903 ± 1.054
1.951AsnPhe: 1.951 ± 0.735
3.345AsnGly: 3.345 ± 0.819
1.394AsnHis: 1.394 ± 0.481
5.018AsnIle: 5.018 ± 1.225
4.461AsnLys: 4.461 ± 0.995
6.412AsnLeu: 6.412 ± 1.641
0.836AsnMet: 0.836 ± 0.39
3.345AsnAsn: 3.345 ± 0.783
1.673AsnPro: 1.673 ± 0.597
1.951AsnGln: 1.951 ± 0.574
1.673AsnArg: 1.673 ± 0.534
3.903AsnSer: 3.903 ± 0.844
4.461AsnThr: 4.461 ± 1.096
1.394AsnVal: 1.394 ± 0.591
1.673AsnTrp: 1.673 ± 0.791
1.673AsnTyr: 1.673 ± 0.604
0.0AsnXaa: 0.0 ± 0.0
Pro
2.23ProAla: 2.23 ± 0.984
0.0ProCys: 0.0 ± 0.0
0.836ProAsp: 0.836 ± 0.441
2.23ProGlu: 2.23 ± 0.712
1.394ProPhe: 1.394 ± 0.75
1.115ProGly: 1.115 ± 0.453
0.279ProHis: 0.279 ± 0.28
1.673ProIle: 1.673 ± 0.554
1.394ProLys: 1.394 ± 0.813
3.067ProLeu: 3.067 ± 0.832
0.279ProMet: 0.279 ± 0.212
1.673ProAsn: 1.673 ± 0.665
0.558ProPro: 0.558 ± 0.492
0.558ProGln: 0.558 ± 0.32
1.115ProArg: 1.115 ± 0.562
1.115ProSer: 1.115 ± 0.533
1.394ProThr: 1.394 ± 0.585
1.951ProVal: 1.951 ± 0.867
0.0ProTrp: 0.0 ± 0.0
0.836ProTyr: 0.836 ± 0.54
0.0ProXaa: 0.0 ± 0.0
Gln
3.345GlnAla: 3.345 ± 1.235
0.279GlnCys: 0.279 ± 0.28
2.23GlnAsp: 2.23 ± 0.601
1.951GlnGlu: 1.951 ± 0.657
2.23GlnPhe: 2.23 ± 0.884
1.394GlnGly: 1.394 ± 0.843
1.951GlnHis: 1.951 ± 0.733
3.624GlnIle: 3.624 ± 0.865
3.624GlnLys: 3.624 ± 0.973
3.903GlnLeu: 3.903 ± 0.772
1.673GlnMet: 1.673 ± 0.639
3.345GlnAsn: 3.345 ± 1.04
0.279GlnPro: 0.279 ± 0.259
2.23GlnGln: 2.23 ± 1.114
1.951GlnArg: 1.951 ± 0.568
2.788GlnSer: 2.788 ± 0.971
3.067GlnThr: 3.067 ± 0.846
3.345GlnVal: 3.345 ± 0.977
0.279GlnTrp: 0.279 ± 0.269
2.23GlnTyr: 2.23 ± 0.744
0.0GlnXaa: 0.0 ± 0.0
Arg
1.394ArgAla: 1.394 ± 0.531
0.836ArgCys: 0.836 ± 0.349
1.673ArgAsp: 1.673 ± 0.595
3.903ArgGlu: 3.903 ± 1.014
0.836ArgPhe: 0.836 ± 0.401
1.394ArgGly: 1.394 ± 0.669
1.394ArgHis: 1.394 ± 0.535
5.297ArgIle: 5.297 ± 1.137
3.903ArgLys: 3.903 ± 0.958
5.576ArgLeu: 5.576 ± 1.444
1.115ArgMet: 1.115 ± 0.567
1.673ArgAsn: 1.673 ± 0.725
1.115ArgPro: 1.115 ± 0.392
2.509ArgGln: 2.509 ± 1.163
1.951ArgArg: 1.951 ± 0.666
1.951ArgSer: 1.951 ± 0.818
2.509ArgThr: 2.509 ± 0.763
1.394ArgVal: 1.394 ± 0.595
0.0ArgTrp: 0.0 ± 0.0
1.951ArgTyr: 1.951 ± 0.692
0.0ArgXaa: 0.0 ± 0.0
Ser
2.509SerAla: 2.509 ± 0.754
0.279SerCys: 0.279 ± 0.288
4.182SerAsp: 4.182 ± 0.913
4.739SerGlu: 4.739 ± 0.995
2.788SerPhe: 2.788 ± 0.711
2.23SerGly: 2.23 ± 0.552
0.836SerHis: 0.836 ± 0.458
4.739SerIle: 4.739 ± 0.728
5.576SerLys: 5.576 ± 0.979
6.97SerLeu: 6.97 ± 1.165
1.951SerMet: 1.951 ± 0.754
3.067SerAsn: 3.067 ± 1.157
1.115SerPro: 1.115 ± 0.51
1.394SerGln: 1.394 ± 0.548
3.345SerArg: 3.345 ± 0.986
2.509SerSer: 2.509 ± 0.747
4.739SerThr: 4.739 ± 0.883
3.903SerVal: 3.903 ± 0.957
0.279SerTrp: 0.279 ± 0.28
4.182SerTyr: 4.182 ± 0.976
0.0SerXaa: 0.0 ± 0.0
Thr
2.788ThrAla: 2.788 ± 0.831
0.558ThrCys: 0.558 ± 0.37
3.067ThrAsp: 3.067 ± 0.892
5.576ThrGlu: 5.576 ± 1.288
3.067ThrPhe: 3.067 ± 0.815
4.461ThrGly: 4.461 ± 1.01
1.115ThrHis: 1.115 ± 0.42
6.412ThrIle: 6.412 ± 1.053
5.297ThrLys: 5.297 ± 1.333
4.461ThrLeu: 4.461 ± 0.818
2.23ThrMet: 2.23 ± 0.871
2.23ThrAsn: 2.23 ± 0.606
1.115ThrPro: 1.115 ± 0.489
2.788ThrGln: 2.788 ± 0.764
1.951ThrArg: 1.951 ± 0.864
3.903ThrSer: 3.903 ± 0.949
3.067ThrThr: 3.067 ± 0.922
5.297ThrVal: 5.297 ± 1.37
0.279ThrTrp: 0.279 ± 0.359
2.509ThrTyr: 2.509 ± 0.981
0.0ThrXaa: 0.0 ± 0.0
Val
2.509ValAla: 2.509 ± 0.791
0.0ValCys: 0.0 ± 0.0
3.067ValAsp: 3.067 ± 0.877
1.951ValGlu: 1.951 ± 0.659
3.345ValPhe: 3.345 ± 0.714
1.115ValGly: 1.115 ± 0.583
0.0ValHis: 0.0 ± 0.0
4.182ValIle: 4.182 ± 1.111
5.297ValLys: 5.297 ± 1.604
5.018ValLeu: 5.018 ± 1.111
1.673ValMet: 1.673 ± 0.739
1.394ValAsn: 1.394 ± 0.772
1.394ValPro: 1.394 ± 0.6
3.067ValGln: 3.067 ± 0.982
2.509ValArg: 2.509 ± 0.717
3.903ValSer: 3.903 ± 0.857
1.951ValThr: 1.951 ± 0.611
2.788ValVal: 2.788 ± 0.957
1.115ValTrp: 1.115 ± 0.531
3.345ValTyr: 3.345 ± 0.95
0.0ValXaa: 0.0 ± 0.0
Trp
1.951TrpAla: 1.951 ± 0.534
0.0TrpCys: 0.0 ± 0.0
0.558TrpAsp: 0.558 ± 0.37
0.836TrpGlu: 0.836 ± 0.729
0.558TrpPhe: 0.558 ± 0.405
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.836TrpIle: 0.836 ± 0.468
0.279TrpLys: 0.279 ± 0.28
1.115TrpLeu: 1.115 ± 0.489
0.279TrpMet: 0.279 ± 0.316
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.558TrpGln: 0.558 ± 0.377
0.558TrpArg: 0.558 ± 0.369
1.115TrpSer: 1.115 ± 0.692
0.279TrpThr: 0.279 ± 0.359
0.279TrpVal: 0.279 ± 0.269
0.0TrpTrp: 0.0 ± 0.0
0.279TrpTyr: 0.279 ± 0.268
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.558TyrAla: 0.558 ± 0.406
0.836TyrCys: 0.836 ± 0.48
1.673TyrAsp: 1.673 ± 0.653
1.951TyrGlu: 1.951 ± 0.561
2.509TyrPhe: 2.509 ± 0.793
1.673TyrGly: 1.673 ± 0.892
1.951TyrHis: 1.951 ± 0.58
3.345TyrIle: 3.345 ± 1.301
4.182TyrLys: 4.182 ± 0.964
7.527TyrLeu: 7.527 ± 1.162
1.394TyrMet: 1.394 ± 0.624
2.23TyrAsn: 2.23 ± 0.679
0.558TyrPro: 0.558 ± 0.376
2.23TyrGln: 2.23 ± 0.789
2.788TyrArg: 2.788 ± 0.822
2.788TyrSer: 2.788 ± 0.915
3.345TyrThr: 3.345 ± 1.1
1.673TyrVal: 1.673 ± 0.875
0.0TyrTrp: 0.0 ± 0.0
1.673TyrTyr: 1.673 ± 0.668
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (3588 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski