Amino acid dipepetide frequency for Escherichia phage alpha3 (Bacteriophage alpha-3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.743AlaAla: 6.743 ± 2.855
1.556AlaCys: 1.556 ± 0.935
2.075AlaAsp: 2.075 ± 0.594
6.743AlaGlu: 6.743 ± 1.3
3.112AlaPhe: 3.112 ± 0.944
9.336AlaGly: 9.336 ± 3.912
1.556AlaHis: 1.556 ± 1.217
4.668AlaIle: 4.668 ± 0.571
5.187AlaLys: 5.187 ± 1.974
6.224AlaLeu: 6.224 ± 1.318
1.037AlaMet: 1.037 ± 0.591
2.593AlaAsn: 2.593 ± 0.903
3.631AlaPro: 3.631 ± 1.653
3.112AlaGln: 3.112 ± 1.206
6.224AlaArg: 6.224 ± 1.8
7.261AlaSer: 7.261 ± 3.298
5.705AlaThr: 5.705 ± 1.699
5.187AlaVal: 5.187 ± 1.379
0.519AlaTrp: 0.519 ± 0.406
2.075AlaTyr: 2.075 ± 0.888
0.0AlaXaa: 0.0 ± 0.0
Cys
1.556CysAla: 1.556 ± 0.671
0.519CysCys: 0.519 ± 0.421
0.0CysAsp: 0.0 ± 0.0
0.519CysGlu: 0.519 ± 0.631
0.519CysPhe: 0.519 ± 0.421
0.519CysGly: 0.519 ± 0.421
0.519CysHis: 0.519 ± 0.421
0.0CysIle: 0.0 ± 0.0
1.037CysLys: 1.037 ± 0.696
1.037CysLeu: 1.037 ± 0.569
0.0CysMet: 0.0 ± 0.0
0.519CysAsn: 0.519 ± 0.421
0.519CysPro: 0.519 ± 0.406
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.037CysSer: 1.037 ± 0.931
0.0CysThr: 0.0 ± 0.0
3.112CysVal: 3.112 ± 1.3
0.0CysTrp: 0.0 ± 0.0
0.519CysTyr: 0.519 ± 0.406
0.0CysXaa: 0.0 ± 0.0
Asp
6.224AspAla: 6.224 ± 1.416
1.556AspCys: 1.556 ± 0.554
3.112AspAsp: 3.112 ± 1.309
2.593AspGlu: 2.593 ± 1.274
3.631AspPhe: 3.631 ± 1.648
3.631AspGly: 3.631 ± 1.389
1.556AspHis: 1.556 ± 0.718
2.593AspIle: 2.593 ± 0.55
2.075AspLys: 2.075 ± 0.97
4.668AspLeu: 4.668 ± 1.338
1.037AspMet: 1.037 ± 0.66
3.631AspAsn: 3.631 ± 1.055
2.075AspPro: 2.075 ± 0.775
2.075AspGln: 2.075 ± 0.609
1.556AspArg: 1.556 ± 0.624
4.668AspSer: 4.668 ± 1.118
2.593AspThr: 2.593 ± 0.958
3.631AspVal: 3.631 ± 0.873
1.037AspTrp: 1.037 ± 0.734
2.593AspTyr: 2.593 ± 0.754
0.0AspXaa: 0.0 ± 0.0
Glu
2.075GluAla: 2.075 ± 0.771
0.519GluCys: 0.519 ± 0.406
2.075GluAsp: 2.075 ± 0.663
2.593GluGlu: 2.593 ± 0.941
3.112GluPhe: 3.112 ± 1.354
1.037GluGly: 1.037 ± 0.569
2.075GluHis: 2.075 ± 1.029
2.593GluIle: 2.593 ± 0.819
4.668GluLys: 4.668 ± 1.671
4.668GluLeu: 4.668 ± 1.865
3.112GluMet: 3.112 ± 0.723
2.593GluAsn: 2.593 ± 1.115
0.519GluPro: 0.519 ± 0.57
1.556GluGln: 1.556 ± 0.763
5.705GluArg: 5.705 ± 1.779
4.668GluSer: 4.668 ± 1.602
3.112GluThr: 3.112 ± 0.559
1.556GluVal: 1.556 ± 0.901
0.0GluTrp: 0.0 ± 0.0
0.519GluTyr: 0.519 ± 0.465
0.0GluXaa: 0.0 ± 0.0
Phe
1.556PheAla: 1.556 ± 0.786
0.0PheCys: 0.0 ± 0.0
3.112PheAsp: 3.112 ± 0.978
1.556PheGlu: 1.556 ± 0.554
2.075PhePhe: 2.075 ± 0.609
3.631PheGly: 3.631 ± 0.923
0.0PheHis: 0.0 ± 0.0
1.556PheIle: 1.556 ± 0.817
3.112PheLys: 3.112 ± 0.727
3.112PheLeu: 3.112 ± 0.87
1.556PheMet: 1.556 ± 1.222
1.556PheAsn: 1.556 ± 0.836
2.075PhePro: 2.075 ± 0.882
1.556PheGln: 1.556 ± 0.915
3.112PheArg: 3.112 ± 1.281
2.593PheSer: 2.593 ± 0.847
1.556PheThr: 1.556 ± 0.654
2.593PheVal: 2.593 ± 0.981
0.519PheTrp: 0.519 ± 0.421
1.037PheTyr: 1.037 ± 0.712
0.0PheXaa: 0.0 ± 0.0
Gly
6.224GlyAla: 6.224 ± 3.178
0.519GlyCys: 0.519 ± 0.57
3.112GlyAsp: 3.112 ± 1.512
0.0GlyGlu: 0.0 ± 0.0
1.556GlyPhe: 1.556 ± 0.53
5.705GlyGly: 5.705 ± 2.027
0.519GlyHis: 0.519 ± 0.57
4.149GlyIle: 4.149 ± 1.599
4.149GlyLys: 4.149 ± 1.575
2.593GlyLeu: 2.593 ± 1.116
1.556GlyMet: 1.556 ± 0.718
4.668GlyAsn: 4.668 ± 1.846
0.0GlyPro: 0.0 ± 0.0
2.075GlyGln: 2.075 ± 0.743
4.149GlyArg: 4.149 ± 1.491
6.743GlySer: 6.743 ± 2.291
5.187GlyThr: 5.187 ± 2.033
4.668GlyVal: 4.668 ± 1.36
1.037GlyTrp: 1.037 ± 0.811
2.075GlyTyr: 2.075 ± 1.042
0.0GlyXaa: 0.0 ± 0.0
His
2.593HisAla: 2.593 ± 1.018
0.0HisCys: 0.0 ± 0.0
2.075HisAsp: 2.075 ± 0.781
1.037HisGlu: 1.037 ± 0.634
1.556HisPhe: 1.556 ± 1.387
1.556HisGly: 1.556 ± 0.745
0.519HisHis: 0.519 ± 0.421
0.0HisIle: 0.0 ± 0.0
0.519HisLys: 0.519 ± 0.561
3.112HisLeu: 3.112 ± 0.839
1.556HisMet: 1.556 ± 0.718
0.0HisAsn: 0.0 ± 0.0
1.556HisPro: 1.556 ± 0.691
2.075HisGln: 2.075 ± 0.609
1.556HisArg: 1.556 ± 0.608
0.519HisSer: 0.519 ± 0.421
1.037HisThr: 1.037 ± 0.843
2.593HisVal: 2.593 ± 0.964
0.519HisTrp: 0.519 ± 0.406
1.037HisTyr: 1.037 ± 0.578
0.0HisXaa: 0.0 ± 0.0
Ile
6.224IleAla: 6.224 ± 1.69
2.075IleCys: 2.075 ± 1.006
3.112IleAsp: 3.112 ± 0.632
1.556IleGlu: 1.556 ± 0.769
0.519IlePhe: 0.519 ± 0.406
2.593IleGly: 2.593 ± 0.75
2.075IleHis: 2.075 ± 1.104
1.037IleIle: 1.037 ± 0.569
1.556IleLys: 1.556 ± 0.892
2.075IleLeu: 2.075 ± 0.542
2.593IleMet: 2.593 ± 1.098
2.593IleAsn: 2.593 ± 0.916
2.593IlePro: 2.593 ± 1.102
3.112IleGln: 3.112 ± 1.862
1.556IleArg: 1.556 ± 0.718
4.668IleSer: 4.668 ± 0.773
2.075IleThr: 2.075 ± 0.809
3.631IleVal: 3.631 ± 1.443
1.556IleTrp: 1.556 ± 0.892
1.037IleTyr: 1.037 ± 0.427
0.0IleXaa: 0.0 ± 0.0
Lys
4.668LysAla: 4.668 ± 1.502
1.037LysCys: 1.037 ± 0.811
4.149LysAsp: 4.149 ± 2.215
2.075LysGlu: 2.075 ± 0.816
0.519LysPhe: 0.519 ± 0.421
2.075LysGly: 2.075 ± 1.133
2.593LysHis: 2.593 ± 0.905
2.075LysIle: 2.075 ± 0.878
5.187LysLys: 5.187 ± 2.03
4.668LysLeu: 4.668 ± 1.071
3.631LysMet: 3.631 ± 1.329
2.075LysAsn: 2.075 ± 1.086
1.037LysPro: 1.037 ± 0.811
4.149LysGln: 4.149 ± 1.33
1.556LysArg: 1.556 ± 0.899
2.075LysSer: 2.075 ± 0.743
3.631LysThr: 3.631 ± 1.905
4.149LysVal: 4.149 ± 0.741
1.556LysTrp: 1.556 ± 0.89
2.075LysTyr: 2.075 ± 1.086
0.0LysXaa: 0.0 ± 0.0
Leu
7.261LeuAla: 7.261 ± 1.929
1.037LeuCys: 1.037 ± 0.696
7.261LeuAsp: 7.261 ± 1.428
3.631LeuGlu: 3.631 ± 0.846
2.075LeuPhe: 2.075 ± 0.47
2.593LeuGly: 2.593 ± 0.611
2.593LeuHis: 2.593 ± 1.139
3.631LeuIle: 3.631 ± 0.991
5.705LeuLys: 5.705 ± 0.911
9.336LeuLeu: 9.336 ± 4.524
3.112LeuMet: 3.112 ± 1.542
3.631LeuAsn: 3.631 ± 0.801
4.149LeuPro: 4.149 ± 0.917
4.149LeuGln: 4.149 ± 1.002
7.261LeuArg: 7.261 ± 1.74
7.78LeuSer: 7.78 ± 2.589
5.705LeuThr: 5.705 ± 1.887
2.593LeuVal: 2.593 ± 0.85
2.075LeuTrp: 2.075 ± 1.163
1.037LeuTyr: 1.037 ± 0.591
0.0LeuXaa: 0.0 ± 0.0
Met
4.668MetAla: 4.668 ± 2.016
0.0MetCys: 0.0 ± 0.0
1.556MetAsp: 1.556 ± 1.079
2.075MetGlu: 2.075 ± 1.036
1.037MetPhe: 1.037 ± 0.811
1.037MetGly: 1.037 ± 0.843
0.519MetHis: 0.519 ± 0.421
1.037MetIle: 1.037 ± 0.485
2.075MetLys: 2.075 ± 0.924
4.149MetLeu: 4.149 ± 1.74
0.519MetMet: 0.519 ± 0.421
1.037MetAsn: 1.037 ± 0.648
1.037MetPro: 1.037 ± 0.843
4.149MetGln: 4.149 ± 1.157
1.556MetArg: 1.556 ± 0.354
1.037MetSer: 1.037 ± 0.578
2.075MetThr: 2.075 ± 0.854
1.037MetVal: 1.037 ± 0.811
0.519MetTrp: 0.519 ± 0.406
1.037MetTyr: 1.037 ± 0.681
0.0MetXaa: 0.0 ± 0.0
Asn
3.631AsnAla: 3.631 ± 0.9
0.519AsnCys: 0.519 ± 0.465
2.593AsnAsp: 2.593 ± 1.098
2.075AsnGlu: 2.075 ± 0.536
1.556AsnPhe: 1.556 ± 0.554
4.668AsnGly: 4.668 ± 1.844
0.519AsnHis: 0.519 ± 0.561
3.112AsnIle: 3.112 ± 1.051
1.037AsnLys: 1.037 ± 0.427
4.149AsnLeu: 4.149 ± 1.482
1.556AsnMet: 1.556 ± 1.035
2.075AsnAsn: 2.075 ± 0.858
2.075AsnPro: 2.075 ± 0.536
2.075AsnGln: 2.075 ± 1.558
3.112AsnArg: 3.112 ± 1.11
4.149AsnSer: 4.149 ± 1.575
2.593AsnThr: 2.593 ± 1.366
3.631AsnVal: 3.631 ± 1.48
0.519AsnTrp: 0.519 ± 0.406
2.075AsnTyr: 2.075 ± 0.651
0.0AsnXaa: 0.0 ± 0.0
Pro
2.593ProAla: 2.593 ± 0.903
0.519ProCys: 0.519 ± 0.554
1.556ProAsp: 1.556 ± 1.066
4.668ProGlu: 4.668 ± 1.069
0.519ProPhe: 0.519 ± 0.421
1.556ProGly: 1.556 ± 0.608
0.519ProHis: 0.519 ± 0.421
2.075ProIle: 2.075 ± 0.93
0.519ProLys: 0.519 ± 0.421
6.743ProLeu: 6.743 ± 2.181
0.0ProMet: 0.0 ± 0.0
2.075ProAsn: 2.075 ± 0.878
2.075ProPro: 2.075 ± 1.207
0.0ProGln: 0.0 ± 0.0
2.075ProArg: 2.075 ± 0.943
3.112ProSer: 3.112 ± 1.322
2.593ProThr: 2.593 ± 1.362
3.631ProVal: 3.631 ± 1.34
1.037ProTrp: 1.037 ± 0.54
1.037ProTyr: 1.037 ± 0.811
0.0ProXaa: 0.0 ± 0.0
Gln
4.668GlnAla: 4.668 ± 1.479
0.0GlnCys: 0.0 ± 0.0
1.556GlnAsp: 1.556 ± 0.681
4.149GlnGlu: 4.149 ± 1.331
1.556GlnPhe: 1.556 ± 1.038
2.075GlnGly: 2.075 ± 1.558
2.075GlnHis: 2.075 ± 0.936
2.593GlnIle: 2.593 ± 1.068
4.149GlnLys: 4.149 ± 1.38
5.705GlnLeu: 5.705 ± 1.163
0.519GlnMet: 0.519 ± 0.492
3.112GlnAsn: 3.112 ± 2.445
2.075GlnPro: 2.075 ± 0.93
3.631GlnGln: 3.631 ± 1.815
2.075GlnArg: 2.075 ± 0.743
5.187GlnSer: 5.187 ± 1.091
3.112GlnThr: 3.112 ± 0.869
2.075GlnVal: 2.075 ± 0.962
2.593GlnTrp: 2.593 ± 0.754
1.037GlnTyr: 1.037 ± 0.485
0.0GlnXaa: 0.0 ± 0.0
Arg
5.705ArgAla: 5.705 ± 0.936
1.037ArgCys: 1.037 ± 0.567
4.668ArgAsp: 4.668 ± 1.774
2.075ArgGlu: 2.075 ± 1.004
4.149ArgPhe: 4.149 ± 1.743
2.075ArgGly: 2.075 ± 0.743
2.593ArgHis: 2.593 ± 1.578
2.593ArgIle: 2.593 ± 0.755
3.631ArgLys: 3.631 ± 1.902
6.224ArgLeu: 6.224 ± 2.543
2.075ArgMet: 2.075 ± 0.816
2.075ArgAsn: 2.075 ± 0.709
1.556ArgPro: 1.556 ± 0.53
4.668ArgGln: 4.668 ± 1.753
3.112ArgArg: 3.112 ± 1.389
3.631ArgSer: 3.631 ± 1.629
4.149ArgThr: 4.149 ± 1.166
2.593ArgVal: 2.593 ± 0.523
0.519ArgTrp: 0.519 ± 0.554
4.668ArgTyr: 4.668 ± 1.297
0.0ArgXaa: 0.0 ± 0.0
Ser
7.261SerAla: 7.261 ± 2.679
0.0SerCys: 0.0 ± 0.0
5.187SerAsp: 5.187 ± 1.337
4.668SerGlu: 4.668 ± 2.737
4.668SerPhe: 4.668 ± 0.865
5.705SerGly: 5.705 ± 2.051
1.556SerHis: 1.556 ± 0.718
4.149SerIle: 4.149 ± 1.143
2.075SerLys: 2.075 ± 0.542
4.668SerLeu: 4.668 ± 2.366
3.631SerMet: 3.631 ± 1.347
3.631SerAsn: 3.631 ± 0.903
2.593SerPro: 2.593 ± 0.918
5.705SerGln: 5.705 ± 2.569
5.187SerArg: 5.187 ± 1.617
8.299SerSer: 8.299 ± 2.215
2.593SerThr: 2.593 ± 0.973
5.705SerVal: 5.705 ± 0.829
0.519SerTrp: 0.519 ± 0.421
3.112SerTyr: 3.112 ± 1.203
0.0SerXaa: 0.0 ± 0.0
Thr
3.631ThrAla: 3.631 ± 1.248
0.519ThrCys: 0.519 ± 0.421
4.149ThrAsp: 4.149 ± 1.002
3.112ThrGlu: 3.112 ± 0.854
1.556ThrPhe: 1.556 ± 0.745
3.112ThrGly: 3.112 ± 0.887
1.556ThrHis: 1.556 ± 0.354
2.593ThrIle: 2.593 ± 0.569
4.149ThrLys: 4.149 ± 1.2
7.261ThrLeu: 7.261 ± 2.857
0.0ThrMet: 0.0 ± 0.0
2.075ThrAsn: 2.075 ± 1.561
3.112ThrPro: 3.112 ± 1.443
3.112ThrGln: 3.112 ± 1.032
3.631ThrArg: 3.631 ± 1.733
7.261ThrSer: 7.261 ± 3.278
1.037ThrThr: 1.037 ± 0.634
3.112ThrVal: 3.112 ± 0.869
0.0ThrTrp: 0.0 ± 0.0
1.037ThrTyr: 1.037 ± 0.843
0.0ThrXaa: 0.0 ± 0.0
Val
3.631ValAla: 3.631 ± 0.888
0.519ValCys: 0.519 ± 0.465
3.631ValAsp: 3.631 ± 1.295
2.593ValGlu: 2.593 ± 1.469
1.556ValPhe: 1.556 ± 0.901
4.668ValGly: 4.668 ± 1.273
1.037ValHis: 1.037 ± 0.569
5.705ValIle: 5.705 ± 2.722
2.593ValLys: 2.593 ± 1.479
2.593ValLeu: 2.593 ± 1.08
3.112ValMet: 3.112 ± 1.316
4.668ValAsn: 4.668 ± 1.833
2.593ValPro: 2.593 ± 0.986
3.112ValGln: 3.112 ± 0.854
5.705ValArg: 5.705 ± 2.92
4.149ValSer: 4.149 ± 1.31
4.149ValThr: 4.149 ± 0.99
2.593ValVal: 2.593 ± 1.129
0.519ValTrp: 0.519 ± 0.406
3.112ValTyr: 3.112 ± 0.838
0.0ValXaa: 0.0 ± 0.0
Trp
0.519TrpAla: 0.519 ± 0.421
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.519TrpGlu: 0.519 ± 0.492
1.037TrpPhe: 1.037 ± 0.664
0.519TrpGly: 0.519 ± 0.57
0.519TrpHis: 0.519 ± 0.406
1.037TrpIle: 1.037 ± 0.567
1.037TrpLys: 1.037 ± 0.811
1.037TrpLeu: 1.037 ± 0.664
0.519TrpMet: 0.519 ± 0.421
1.037TrpAsn: 1.037 ± 0.54
1.556TrpPro: 1.556 ± 1.217
0.519TrpGln: 0.519 ± 0.406
0.0TrpArg: 0.0 ± 0.0
1.037TrpSer: 1.037 ± 0.427
2.075TrpThr: 2.075 ± 0.631
0.519TrpVal: 0.519 ± 0.492
0.0TrpTrp: 0.0 ± 0.0
1.556TrpTyr: 1.556 ± 0.707
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.112TyrAla: 3.112 ± 1.097
0.0TyrCys: 0.0 ± 0.0
1.556TyrAsp: 1.556 ± 1.264
0.519TyrGlu: 0.519 ± 0.406
2.075TyrPhe: 2.075 ± 0.865
2.593TyrGly: 2.593 ± 0.581
0.519TyrHis: 0.519 ± 0.421
1.037TyrIle: 1.037 ± 0.427
0.519TyrLys: 0.519 ± 0.554
2.593TyrLeu: 2.593 ± 0.754
0.519TyrMet: 0.519 ± 0.406
2.075TyrAsn: 2.075 ± 0.609
2.075TyrPro: 2.075 ± 1.032
3.112TyrGln: 3.112 ± 1.802
4.668TyrArg: 4.668 ± 1.923
1.037TyrSer: 1.037 ± 0.589
1.037TyrThr: 1.037 ± 0.567
3.631TyrVal: 3.631 ± 1.174
0.0TyrTrp: 0.0 ± 0.0
1.037TyrTyr: 1.037 ± 0.657
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (1929 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski