Amino acid dipepetide frequency for Streptococcus phage phiZJ20091101-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.469AlaAla: 3.469 ± 1.338
1.387AlaCys: 1.387 ± 0.591
7.631AlaAsp: 7.631 ± 2.016
3.122AlaGlu: 3.122 ± 0.875
2.428AlaPhe: 2.428 ± 0.789
5.203AlaGly: 5.203 ± 1.298
1.041AlaHis: 1.041 ± 0.767
6.243AlaIle: 6.243 ± 1.271
5.203AlaLys: 5.203 ± 1.487
5.55AlaLeu: 5.55 ± 1.372
1.387AlaMet: 1.387 ± 1.116
3.469AlaAsn: 3.469 ± 1.0
1.734AlaPro: 1.734 ± 0.923
1.734AlaGln: 1.734 ± 0.741
5.203AlaArg: 5.203 ± 1.102
4.162AlaSer: 4.162 ± 0.926
4.509AlaThr: 4.509 ± 1.212
3.815AlaVal: 3.815 ± 2.226
0.694AlaTrp: 0.694 ± 0.508
2.081AlaTyr: 2.081 ± 0.671
0.0AlaXaa: 0.0 ± 0.0
Cys
0.347CysAla: 0.347 ± 0.298
0.0CysCys: 0.0 ± 0.0
0.694CysAsp: 0.694 ± 0.463
0.347CysGlu: 0.347 ± 0.33
0.0CysPhe: 0.0 ± 0.0
0.347CysGly: 0.347 ± 0.33
0.0CysHis: 0.0 ± 0.0
0.347CysIle: 0.347 ± 0.358
0.347CysLys: 0.347 ± 0.325
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.347CysAsn: 0.347 ± 0.266
1.041CysPro: 1.041 ± 0.66
0.694CysGln: 0.694 ± 0.661
0.694CysArg: 0.694 ± 0.404
0.0CysSer: 0.0 ± 0.0
0.347CysThr: 0.347 ± 0.373
0.347CysVal: 0.347 ± 0.299
0.0CysTrp: 0.0 ± 0.0
0.347CysTyr: 0.347 ± 0.31
0.0CysXaa: 0.0 ± 0.0
Asp
2.428AspAla: 2.428 ± 0.952
0.694AspCys: 0.694 ± 0.388
4.162AspAsp: 4.162 ± 1.127
2.775AspGlu: 2.775 ± 0.99
2.428AspPhe: 2.428 ± 0.635
5.203AspGly: 5.203 ± 1.527
0.0AspHis: 0.0 ± 0.0
5.203AspIle: 5.203 ± 1.407
9.018AspLys: 9.018 ± 1.405
4.856AspLeu: 4.856 ± 1.193
1.734AspMet: 1.734 ± 0.67
4.162AspAsn: 4.162 ± 0.886
1.387AspPro: 1.387 ± 0.652
1.387AspGln: 1.387 ± 0.705
2.775AspArg: 2.775 ± 0.758
3.122AspSer: 3.122 ± 0.909
2.775AspThr: 2.775 ± 1.1
3.469AspVal: 3.469 ± 0.818
1.041AspTrp: 1.041 ± 0.482
4.856AspTyr: 4.856 ± 1.003
0.0AspXaa: 0.0 ± 0.0
Glu
5.203GluAla: 5.203 ± 1.815
0.347GluCys: 0.347 ± 0.358
3.815GluAsp: 3.815 ± 1.216
6.243GluGlu: 6.243 ± 2.448
3.122GluPhe: 3.122 ± 1.118
3.122GluGly: 3.122 ± 1.729
1.734GluHis: 1.734 ± 0.72
4.509GluIle: 4.509 ± 1.447
5.203GluLys: 5.203 ± 1.591
10.406GluLeu: 10.406 ± 2.285
2.081GluMet: 2.081 ± 0.771
5.203GluAsn: 5.203 ± 1.067
2.081GluPro: 2.081 ± 0.791
3.122GluGln: 3.122 ± 1.03
4.509GluArg: 4.509 ± 1.257
3.122GluSer: 3.122 ± 1.098
5.55GluThr: 5.55 ± 1.077
5.897GluVal: 5.897 ± 1.791
1.041GluTrp: 1.041 ± 0.486
2.081GluTyr: 2.081 ± 0.899
0.0GluXaa: 0.0 ± 0.0
Phe
1.734PheAla: 1.734 ± 0.528
0.347PheCys: 0.347 ± 0.31
2.081PheAsp: 2.081 ± 0.846
3.815PheGlu: 3.815 ± 1.047
1.734PhePhe: 1.734 ± 0.628
1.041PheGly: 1.041 ± 0.432
0.694PheHis: 0.694 ± 0.353
2.428PheIle: 2.428 ± 1.53
4.856PheLys: 4.856 ± 1.548
2.081PheLeu: 2.081 ± 0.808
0.694PheMet: 0.694 ± 0.389
1.041PheAsn: 1.041 ± 0.543
1.387PhePro: 1.387 ± 0.749
0.694PheGln: 0.694 ± 0.353
0.694PheArg: 0.694 ± 0.391
2.775PheSer: 2.775 ± 0.756
2.428PheThr: 2.428 ± 0.643
2.428PheVal: 2.428 ± 0.901
0.347PheTrp: 0.347 ± 0.299
2.081PheTyr: 2.081 ± 0.884
0.0PheXaa: 0.0 ± 0.0
Gly
3.815GlyAla: 3.815 ± 2.073
0.694GlyCys: 0.694 ± 0.374
2.081GlyAsp: 2.081 ± 0.924
5.203GlyGlu: 5.203 ± 1.438
3.122GlyPhe: 3.122 ± 1.276
2.428GlyGly: 2.428 ± 1.011
0.694GlyHis: 0.694 ± 0.457
3.122GlyIle: 3.122 ± 0.776
5.203GlyLys: 5.203 ± 1.625
5.897GlyLeu: 5.897 ± 1.329
1.387GlyMet: 1.387 ± 0.684
3.469GlyAsn: 3.469 ± 1.187
0.0GlyPro: 0.0 ± 0.0
2.428GlyGln: 2.428 ± 0.61
1.387GlyArg: 1.387 ± 0.689
1.734GlySer: 1.734 ± 0.727
3.122GlyThr: 3.122 ± 1.123
5.897GlyVal: 5.897 ± 1.278
1.041GlyTrp: 1.041 ± 0.617
2.775GlyTyr: 2.775 ± 0.857
0.0GlyXaa: 0.0 ± 0.0
His
1.387HisAla: 1.387 ± 0.69
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.387HisGlu: 1.387 ± 0.566
0.347HisPhe: 0.347 ± 0.266
1.387HisGly: 1.387 ± 0.881
0.0HisHis: 0.0 ± 0.0
0.347HisIle: 0.347 ± 0.395
0.347HisLys: 0.347 ± 0.329
2.081HisLeu: 2.081 ± 0.887
0.0HisMet: 0.0 ± 0.0
0.694HisAsn: 0.694 ± 0.374
0.694HisPro: 0.694 ± 0.461
0.347HisGln: 0.347 ± 0.367
1.387HisArg: 1.387 ± 0.462
1.041HisSer: 1.041 ± 0.421
1.387HisThr: 1.387 ± 0.852
0.694HisVal: 0.694 ± 0.596
0.0HisTrp: 0.0 ± 0.0
1.387HisTyr: 1.387 ± 0.784
0.0HisXaa: 0.0 ± 0.0
Ile
6.937IleAla: 6.937 ± 2.272
0.694IleCys: 0.694 ± 0.38
3.469IleAsp: 3.469 ± 1.011
6.937IleGlu: 6.937 ± 1.52
1.387IlePhe: 1.387 ± 0.624
2.428IleGly: 2.428 ± 0.767
1.387IleHis: 1.387 ± 0.78
5.203IleIle: 5.203 ± 1.491
4.856IleLys: 4.856 ± 1.285
5.897IleLeu: 5.897 ± 1.502
0.347IleMet: 0.347 ± 0.298
5.897IleAsn: 5.897 ± 1.278
3.469IlePro: 3.469 ± 1.04
1.387IleGln: 1.387 ± 0.655
4.162IleArg: 4.162 ± 1.652
3.469IleSer: 3.469 ± 1.082
4.856IleThr: 4.856 ± 0.838
2.081IleVal: 2.081 ± 0.859
0.694IleTrp: 0.694 ± 0.6
2.081IleTyr: 2.081 ± 0.818
0.0IleXaa: 0.0 ± 0.0
Lys
8.672LysAla: 8.672 ± 1.618
0.0LysCys: 0.0 ± 0.0
6.937LysAsp: 6.937 ± 1.655
10.059LysGlu: 10.059 ± 1.488
2.428LysPhe: 2.428 ± 0.799
4.509LysGly: 4.509 ± 1.205
2.775LysHis: 2.775 ± 0.839
4.509LysIle: 4.509 ± 1.322
7.978LysLys: 7.978 ± 1.945
7.978LysLeu: 7.978 ± 2.147
2.081LysMet: 2.081 ± 0.908
3.122LysAsn: 3.122 ± 0.939
2.081LysPro: 2.081 ± 0.674
5.203LysGln: 5.203 ± 1.221
3.122LysArg: 3.122 ± 0.928
3.815LysSer: 3.815 ± 1.089
4.856LysThr: 4.856 ± 0.889
4.856LysVal: 4.856 ± 1.146
1.041LysTrp: 1.041 ± 0.526
3.122LysTyr: 3.122 ± 0.882
0.0LysXaa: 0.0 ± 0.0
Leu
6.937LeuAla: 6.937 ± 1.168
0.694LeuCys: 0.694 ± 0.542
7.631LeuAsp: 7.631 ± 1.651
8.325LeuGlu: 8.325 ± 2.23
2.428LeuPhe: 2.428 ± 0.657
7.631LeuGly: 7.631 ± 1.941
0.694LeuHis: 0.694 ± 0.378
5.203LeuIle: 5.203 ± 1.492
7.978LeuLys: 7.978 ± 1.206
8.672LeuLeu: 8.672 ± 1.947
3.122LeuMet: 3.122 ± 1.151
5.203LeuAsn: 5.203 ± 1.108
1.734LeuPro: 1.734 ± 0.623
1.734LeuGln: 1.734 ± 0.873
3.122LeuArg: 3.122 ± 0.771
5.897LeuSer: 5.897 ± 1.161
6.937LeuThr: 6.937 ± 1.038
5.203LeuVal: 5.203 ± 1.113
0.347LeuTrp: 0.347 ± 0.378
3.122LeuTyr: 3.122 ± 0.744
0.0LeuXaa: 0.0 ± 0.0
Met
4.509MetAla: 4.509 ± 1.538
0.0MetCys: 0.0 ± 0.0
1.041MetAsp: 1.041 ± 0.441
0.694MetGlu: 0.694 ± 0.509
0.347MetPhe: 0.347 ± 0.337
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
2.428MetIle: 2.428 ± 0.887
1.041MetLys: 1.041 ± 0.575
2.428MetLeu: 2.428 ± 1.111
1.387MetMet: 1.387 ± 0.61
2.081MetAsn: 2.081 ± 0.661
0.347MetPro: 0.347 ± 0.402
1.387MetGln: 1.387 ± 0.654
1.387MetArg: 1.387 ± 0.487
1.387MetSer: 1.387 ± 0.759
2.428MetThr: 2.428 ± 0.823
1.734MetVal: 1.734 ± 0.808
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.162AsnAla: 4.162 ± 0.995
0.347AsnCys: 0.347 ± 0.325
3.469AsnAsp: 3.469 ± 0.784
2.081AsnGlu: 2.081 ± 1.01
2.081AsnPhe: 2.081 ± 0.822
3.469AsnGly: 3.469 ± 0.778
0.0AsnHis: 0.0 ± 0.0
1.734AsnIle: 1.734 ± 0.699
4.509AsnLys: 4.509 ± 1.206
4.509AsnLeu: 4.509 ± 1.229
1.387AsnMet: 1.387 ± 0.623
2.081AsnAsn: 2.081 ± 0.718
2.081AsnPro: 2.081 ± 0.74
4.162AsnGln: 4.162 ± 1.057
1.734AsnArg: 1.734 ± 0.75
2.775AsnSer: 2.775 ± 0.714
3.815AsnThr: 3.815 ± 1.218
2.081AsnVal: 2.081 ± 0.708
1.041AsnTrp: 1.041 ± 0.607
2.775AsnTyr: 2.775 ± 0.928
0.0AsnXaa: 0.0 ± 0.0
Pro
1.041ProAla: 1.041 ± 0.6
0.0ProCys: 0.0 ± 0.0
0.694ProAsp: 0.694 ± 0.57
2.775ProGlu: 2.775 ± 1.078
1.734ProPhe: 1.734 ± 0.902
0.694ProGly: 0.694 ± 0.542
0.347ProHis: 0.347 ± 0.402
4.509ProIle: 4.509 ± 0.674
3.815ProLys: 3.815 ± 1.0
1.041ProLeu: 1.041 ± 0.416
0.694ProMet: 0.694 ± 0.471
2.081ProAsn: 2.081 ± 0.703
1.734ProPro: 1.734 ± 1.072
1.041ProGln: 1.041 ± 0.591
1.041ProArg: 1.041 ± 0.49
2.428ProSer: 2.428 ± 0.949
3.122ProThr: 3.122 ± 0.935
2.428ProVal: 2.428 ± 0.641
0.0ProTrp: 0.0 ± 0.0
0.347ProTyr: 0.347 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
3.815GlnAla: 3.815 ± 1.134
0.0GlnCys: 0.0 ± 0.0
1.041GlnAsp: 1.041 ± 0.717
3.122GlnGlu: 3.122 ± 0.785
0.694GlnPhe: 0.694 ± 0.4
1.387GlnGly: 1.387 ± 0.627
0.694GlnHis: 0.694 ± 0.463
2.775GlnIle: 2.775 ± 1.017
5.897GlnLys: 5.897 ± 1.593
4.162GlnLeu: 4.162 ± 1.154
1.041GlnMet: 1.041 ± 0.482
1.041GlnAsn: 1.041 ± 0.645
0.694GlnPro: 0.694 ± 0.431
5.203GlnGln: 5.203 ± 1.406
1.387GlnArg: 1.387 ± 0.75
3.122GlnSer: 3.122 ± 1.062
2.081GlnThr: 2.081 ± 0.631
2.428GlnVal: 2.428 ± 1.181
0.0GlnTrp: 0.0 ± 0.0
2.775GlnTyr: 2.775 ± 0.857
0.0GlnXaa: 0.0 ± 0.0
Arg
2.081ArgAla: 2.081 ± 0.733
0.347ArgCys: 0.347 ± 0.33
4.162ArgAsp: 4.162 ± 1.147
3.469ArgGlu: 3.469 ± 1.024
0.694ArgPhe: 0.694 ± 0.374
3.469ArgGly: 3.469 ± 1.222
0.694ArgHis: 0.694 ± 0.404
3.122ArgIle: 3.122 ± 0.922
4.509ArgLys: 4.509 ± 1.512
4.509ArgLeu: 4.509 ± 1.298
2.428ArgMet: 2.428 ± 0.713
1.041ArgAsn: 1.041 ± 0.482
0.694ArgPro: 0.694 ± 0.524
2.081ArgGln: 2.081 ± 0.972
1.734ArgArg: 1.734 ± 0.784
1.387ArgSer: 1.387 ± 0.501
2.428ArgThr: 2.428 ± 0.693
3.815ArgVal: 3.815 ± 0.908
1.387ArgTrp: 1.387 ± 0.629
2.428ArgTyr: 2.428 ± 0.931
0.0ArgXaa: 0.0 ± 0.0
Ser
2.775SerAla: 2.775 ± 0.701
0.0SerCys: 0.0 ± 0.0
3.815SerAsp: 3.815 ± 0.92
2.775SerGlu: 2.775 ± 0.68
2.775SerPhe: 2.775 ± 0.783
3.469SerGly: 3.469 ± 0.848
1.734SerHis: 1.734 ± 0.763
3.815SerIle: 3.815 ± 0.837
3.122SerLys: 3.122 ± 0.955
4.509SerLeu: 4.509 ± 1.008
1.041SerMet: 1.041 ± 0.571
2.775SerAsn: 2.775 ± 1.012
2.775SerPro: 2.775 ± 0.983
2.775SerGln: 2.775 ± 1.088
3.815SerArg: 3.815 ± 1.053
3.815SerSer: 3.815 ± 1.369
2.775SerThr: 2.775 ± 0.799
2.775SerVal: 2.775 ± 1.133
0.347SerTrp: 0.347 ± 0.266
2.081SerTyr: 2.081 ± 0.978
0.0SerXaa: 0.0 ± 0.0
Thr
3.469ThrAla: 3.469 ± 0.901
0.694ThrCys: 0.694 ± 0.38
4.162ThrAsp: 4.162 ± 1.091
7.631ThrGlu: 7.631 ± 1.352
1.387ThrPhe: 1.387 ± 0.556
4.162ThrGly: 4.162 ± 1.214
1.041ThrHis: 1.041 ± 0.531
6.243ThrIle: 6.243 ± 0.989
4.162ThrLys: 4.162 ± 1.145
6.59ThrLeu: 6.59 ± 1.217
0.347ThrMet: 0.347 ± 0.266
2.428ThrAsn: 2.428 ± 0.866
3.815ThrPro: 3.815 ± 0.962
3.469ThrGln: 3.469 ± 1.008
2.428ThrArg: 2.428 ± 0.954
1.734ThrSer: 1.734 ± 0.569
4.162ThrThr: 4.162 ± 1.491
3.815ThrVal: 3.815 ± 1.328
0.0ThrTrp: 0.0 ± 0.0
2.775ThrTyr: 2.775 ± 0.73
0.0ThrXaa: 0.0 ± 0.0
Val
3.815ValAla: 3.815 ± 1.105
0.0ValCys: 0.0 ± 0.0
3.122ValAsp: 3.122 ± 0.77
3.469ValGlu: 3.469 ± 0.913
3.469ValPhe: 3.469 ± 1.223
3.122ValGly: 3.122 ± 1.065
0.347ValHis: 0.347 ± 0.266
3.815ValIle: 3.815 ± 1.044
5.55ValLys: 5.55 ± 1.625
2.775ValLeu: 2.775 ± 1.165
1.387ValMet: 1.387 ± 0.798
2.428ValAsn: 2.428 ± 0.858
2.081ValPro: 2.081 ± 0.694
2.428ValGln: 2.428 ± 0.704
3.122ValArg: 3.122 ± 0.863
5.203ValSer: 5.203 ± 1.416
3.815ValThr: 3.815 ± 1.025
3.122ValVal: 3.122 ± 1.273
0.694ValTrp: 0.694 ± 0.452
4.162ValTyr: 4.162 ± 1.324
0.0ValXaa: 0.0 ± 0.0
Trp
1.041TrpAla: 1.041 ± 0.581
0.0TrpCys: 0.0 ± 0.0
0.694TrpAsp: 0.694 ± 0.353
1.041TrpGlu: 1.041 ± 0.631
0.0TrpPhe: 0.0 ± 0.0
0.347TrpGly: 0.347 ± 0.298
0.347TrpHis: 0.347 ± 0.266
0.347TrpIle: 0.347 ± 0.475
1.041TrpLys: 1.041 ± 0.482
2.081TrpLeu: 2.081 ± 0.98
0.347TrpMet: 0.347 ± 0.393
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.347TrpGln: 0.347 ± 0.266
1.041TrpArg: 1.041 ± 0.957
0.694TrpSer: 0.694 ± 0.374
1.041TrpThr: 1.041 ± 0.612
0.0TrpVal: 0.0 ± 0.0
0.347TrpTrp: 0.347 ± 0.266
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.775TyrAla: 2.775 ± 1.056
0.0TyrCys: 0.0 ± 0.0
2.775TyrAsp: 2.775 ± 1.178
3.122TyrGlu: 3.122 ± 0.972
2.775TyrPhe: 2.775 ± 0.991
2.081TyrGly: 2.081 ± 1.016
0.694TyrHis: 0.694 ± 0.468
1.734TyrIle: 1.734 ± 0.712
4.509TyrLys: 4.509 ± 1.337
6.59TyrLeu: 6.59 ± 1.742
1.387TyrMet: 1.387 ± 0.812
1.734TyrAsn: 1.734 ± 0.69
1.734TyrPro: 1.734 ± 0.758
1.734TyrGln: 1.734 ± 0.914
1.734TyrArg: 1.734 ± 0.646
2.081TyrSer: 2.081 ± 1.106
2.081TyrThr: 2.081 ± 1.187
1.041TyrVal: 1.041 ± 0.642
0.694TyrTrp: 0.694 ± 0.404
2.081TyrTyr: 2.081 ± 0.715
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (2884 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski