Amino acid dipepetide frequency for Escherichia phage Lilledu

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.772AlaAla: 8.772 ± 1.545
1.754AlaCys: 1.754 ± 1.63
7.018AlaAsp: 7.018 ± 1.601
2.924AlaGlu: 2.924 ± 1.532
2.924AlaPhe: 2.924 ± 1.642
9.942AlaGly: 9.942 ± 3.064
2.924AlaHis: 2.924 ± 2.134
5.263AlaIle: 5.263 ± 1.546
6.433AlaLys: 6.433 ± 1.804
5.848AlaLeu: 5.848 ± 2.895
1.17AlaMet: 1.17 ± 0.446
1.754AlaAsn: 1.754 ± 0.421
4.094AlaPro: 4.094 ± 0.867
4.094AlaGln: 4.094 ± 1.314
1.17AlaArg: 1.17 ± 0.944
10.526AlaSer: 10.526 ± 1.762
4.678AlaThr: 4.678 ± 0.914
8.187AlaVal: 8.187 ± 2.306
0.585AlaTrp: 0.585 ± 0.516
1.754AlaTyr: 1.754 ± 0.775
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.585CysCys: 0.585 ± 0.516
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.585CysPhe: 0.585 ± 0.516
0.0CysGly: 0.0 ± 0.0
0.585CysHis: 0.585 ± 0.516
0.0CysIle: 0.0 ± 0.0
0.585CysLys: 0.585 ± 0.845
1.17CysLeu: 1.17 ± 0.84
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.585CysPro: 0.585 ± 0.427
0.0CysGln: 0.0 ± 0.0
0.585CysArg: 0.585 ± 0.572
0.585CysSer: 0.585 ± 0.427
0.585CysThr: 0.585 ± 0.516
1.754CysVal: 1.754 ± 0.874
0.0CysTrp: 0.0 ± 0.0
1.17CysTyr: 1.17 ± 0.854
0.0CysXaa: 0.0 ± 0.0
Asp
5.848AspAla: 5.848 ± 2.052
1.17AspCys: 1.17 ± 0.625
2.924AspAsp: 2.924 ± 0.941
4.094AspGlu: 4.094 ± 1.874
1.754AspPhe: 1.754 ± 1.63
4.094AspGly: 4.094 ± 1.495
1.17AspHis: 1.17 ± 0.446
5.848AspIle: 5.848 ± 1.499
1.754AspLys: 1.754 ± 1.048
2.339AspLeu: 2.339 ± 1.014
0.585AspMet: 0.585 ± 0.427
4.678AspAsn: 4.678 ± 1.47
1.17AspPro: 1.17 ± 0.446
2.339AspGln: 2.339 ± 1.097
3.509AspArg: 3.509 ± 1.519
4.678AspSer: 4.678 ± 1.442
4.094AspThr: 4.094 ± 1.642
2.339AspVal: 2.339 ± 0.69
0.585AspTrp: 0.585 ± 0.819
4.094AspTyr: 4.094 ± 0.958
0.0AspXaa: 0.0 ± 0.0
Glu
1.754GluAla: 1.754 ± 0.775
1.754GluCys: 1.754 ± 0.704
1.754GluAsp: 1.754 ± 1.63
2.924GluGlu: 2.924 ± 1.535
2.339GluPhe: 2.339 ± 2.51
2.339GluGly: 2.339 ± 1.559
1.17GluHis: 1.17 ± 0.859
3.509GluIle: 3.509 ± 0.703
1.17GluLys: 1.17 ± 0.859
4.094GluLeu: 4.094 ± 1.726
1.17GluMet: 1.17 ± 0.587
2.339GluAsn: 2.339 ± 1.169
0.585GluPro: 0.585 ± 0.427
0.0GluGln: 0.0 ± 0.0
2.339GluArg: 2.339 ± 1.236
3.509GluSer: 3.509 ± 1.649
3.509GluThr: 3.509 ± 1.127
0.585GluVal: 0.585 ± 0.528
0.585GluTrp: 0.585 ± 0.427
1.17GluTyr: 1.17 ± 0.446
0.0GluXaa: 0.0 ± 0.0
Phe
1.754PheAla: 1.754 ± 1.264
0.585PheCys: 0.585 ± 0.427
0.585PheAsp: 0.585 ± 0.427
1.754PheGlu: 1.754 ± 0.945
0.585PhePhe: 0.585 ± 0.516
1.754PheGly: 1.754 ± 0.704
0.585PheHis: 0.585 ± 0.427
1.754PheIle: 1.754 ± 0.775
1.17PheLys: 1.17 ± 0.804
1.17PheLeu: 1.17 ± 0.927
1.754PheMet: 1.754 ± 0.865
1.754PheAsn: 1.754 ± 0.751
2.339PhePro: 2.339 ± 1.145
2.339PheGln: 2.339 ± 1.349
4.678PheArg: 4.678 ± 1.495
1.17PheSer: 1.17 ± 0.446
4.094PheThr: 4.094 ± 0.678
4.094PheVal: 4.094 ± 2.414
0.585PheTrp: 0.585 ± 0.516
2.339PheTyr: 2.339 ± 1.075
0.0PheXaa: 0.0 ± 0.0
Gly
7.602GlyAla: 7.602 ± 2.406
0.585GlyCys: 0.585 ± 0.845
2.339GlyAsp: 2.339 ± 1.657
1.754GlyGlu: 1.754 ± 0.704
2.924GlyPhe: 2.924 ± 0.687
3.509GlyGly: 3.509 ± 1.244
0.585GlyHis: 0.585 ± 0.427
5.263GlyIle: 5.263 ± 1.527
6.433GlyLys: 6.433 ± 2.153
4.094GlyLeu: 4.094 ± 1.425
1.754GlyMet: 1.754 ± 0.899
4.094GlyAsn: 4.094 ± 0.826
1.17GlyPro: 1.17 ± 0.587
3.509GlyGln: 3.509 ± 1.242
2.924GlyArg: 2.924 ± 1.098
4.094GlySer: 4.094 ± 1.349
2.924GlyThr: 2.924 ± 0.643
3.509GlyVal: 3.509 ± 1.566
1.17GlyTrp: 1.17 ± 0.854
2.924GlyTyr: 2.924 ± 0.941
0.0GlyXaa: 0.0 ± 0.0
His
1.17HisAla: 1.17 ± 0.854
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.754HisPhe: 1.754 ± 0.982
2.339HisGly: 2.339 ± 0.955
0.0HisHis: 0.0 ± 0.0
1.17HisIle: 1.17 ± 1.032
0.585HisLys: 0.585 ± 0.516
2.339HisLeu: 2.339 ± 0.892
0.0HisMet: 0.0 ± 0.0
1.754HisAsn: 1.754 ± 0.678
1.17HisPro: 1.17 ± 0.927
1.754HisGln: 1.754 ± 0.751
0.585HisArg: 0.585 ± 0.516
0.585HisSer: 0.585 ± 0.516
2.339HisThr: 2.339 ± 0.799
0.585HisVal: 0.585 ± 0.528
2.339HisTrp: 2.339 ± 1.075
1.17HisTyr: 1.17 ± 1.032
0.0HisXaa: 0.0 ± 0.0
Ile
8.187IleAla: 8.187 ± 1.842
0.0IleCys: 0.0 ± 0.0
5.263IleAsp: 5.263 ± 2.037
1.17IleGlu: 1.17 ± 0.859
0.0IlePhe: 0.0 ± 0.0
3.509IleGly: 3.509 ± 0.885
0.0IleHis: 0.0 ± 0.0
1.17IleIle: 1.17 ± 0.625
2.924IleLys: 2.924 ± 1.086
2.339IleLeu: 2.339 ± 1.706
3.509IleMet: 3.509 ± 1.999
2.339IleAsn: 2.339 ± 1.249
2.924IlePro: 2.924 ± 1.483
5.263IleGln: 5.263 ± 1.097
2.924IleArg: 2.924 ± 1.694
4.094IleSer: 4.094 ± 1.044
2.339IleThr: 2.339 ± 1.173
1.17IleVal: 1.17 ± 0.943
1.17IleTrp: 1.17 ± 0.446
1.17IleTyr: 1.17 ± 1.032
0.0IleXaa: 0.0 ± 0.0
Lys
5.263LysAla: 5.263 ± 0.965
0.0LysCys: 0.0 ± 0.0
7.018LysAsp: 7.018 ± 3.485
1.754LysGlu: 1.754 ± 0.899
1.754LysPhe: 1.754 ± 0.751
5.263LysGly: 5.263 ± 1.817
0.585LysHis: 0.585 ± 0.572
2.924LysIle: 2.924 ± 0.928
2.339LysLys: 2.339 ± 0.869
4.094LysLeu: 4.094 ± 1.999
4.678LysMet: 4.678 ± 1.633
1.754LysAsn: 1.754 ± 0.751
1.754LysPro: 1.754 ± 1.281
4.094LysGln: 4.094 ± 2.236
0.585LysArg: 0.585 ± 0.427
4.094LysSer: 4.094 ± 1.674
2.924LysThr: 2.924 ± 1.186
1.754LysVal: 1.754 ± 0.971
1.17LysTrp: 1.17 ± 0.865
1.754LysTyr: 1.754 ± 0.865
0.0LysXaa: 0.0 ± 0.0
Leu
7.602LeuAla: 7.602 ± 2.4
0.585LeuCys: 0.585 ± 0.572
4.678LeuAsp: 4.678 ± 1.606
2.924LeuGlu: 2.924 ± 0.896
2.924LeuPhe: 2.924 ± 0.643
4.678LeuGly: 4.678 ± 1.159
1.754LeuHis: 1.754 ± 0.865
4.094LeuIle: 4.094 ± 1.139
7.018LeuLys: 7.018 ± 2.079
5.263LeuLeu: 5.263 ± 1.522
2.924LeuMet: 2.924 ± 0.967
4.094LeuAsn: 4.094 ± 1.928
2.339LeuPro: 2.339 ± 0.892
5.848LeuGln: 5.848 ± 1.284
4.678LeuArg: 4.678 ± 1.683
5.263LeuSer: 5.263 ± 2.316
9.357LeuThr: 9.357 ± 2.277
4.678LeuVal: 4.678 ± 1.328
0.585LeuTrp: 0.585 ± 0.427
1.17LeuTyr: 1.17 ± 0.446
0.0LeuXaa: 0.0 ± 0.0
Met
2.339MetAla: 2.339 ± 1.075
0.0MetCys: 0.0 ± 0.0
0.585MetAsp: 0.585 ± 0.427
1.17MetGlu: 1.17 ± 0.927
0.585MetPhe: 0.585 ± 0.572
1.17MetGly: 1.17 ± 0.587
0.0MetHis: 0.0 ± 0.0
0.585MetIle: 0.585 ± 0.845
2.924MetLys: 2.924 ± 0.924
3.509MetLeu: 3.509 ± 1.607
0.585MetMet: 0.585 ± 0.784
2.339MetAsn: 2.339 ± 1.019
0.585MetPro: 0.585 ± 0.427
2.339MetGln: 2.339 ± 1.475
2.924MetArg: 2.924 ± 0.967
4.094MetSer: 4.094 ± 1.471
2.339MetThr: 2.339 ± 1.353
1.17MetVal: 1.17 ± 0.601
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.678AsnAla: 4.678 ± 1.812
0.0AsnCys: 0.0 ± 0.0
1.754AsnAsp: 1.754 ± 0.421
1.17AsnGlu: 1.17 ± 0.943
2.924AsnPhe: 2.924 ± 1.086
1.754AsnGly: 1.754 ± 0.775
0.0AsnHis: 0.0 ± 0.0
2.339AsnIle: 2.339 ± 1.853
2.924AsnLys: 2.924 ± 1.407
5.263AsnLeu: 5.263 ± 1.625
1.754AsnMet: 1.754 ± 0.923
2.924AsnAsn: 2.924 ± 0.818
4.678AsnPro: 4.678 ± 0.804
2.924AsnGln: 2.924 ± 1.666
1.754AsnArg: 1.754 ± 0.865
4.094AsnSer: 4.094 ± 1.38
6.433AsnThr: 6.433 ± 1.377
2.339AsnVal: 2.339 ± 0.725
0.0AsnTrp: 0.0 ± 0.0
2.924AsnTyr: 2.924 ± 0.687
0.0AsnXaa: 0.0 ± 0.0
Pro
1.754ProAla: 1.754 ± 1.312
0.0ProCys: 0.0 ± 0.0
1.754ProAsp: 1.754 ± 0.899
2.924ProGlu: 2.924 ± 0.9
1.17ProPhe: 1.17 ± 0.446
1.17ProGly: 1.17 ± 0.657
0.585ProHis: 0.585 ± 0.516
1.754ProIle: 1.754 ± 0.904
1.754ProLys: 1.754 ± 0.704
5.263ProLeu: 5.263 ± 1.729
0.0ProMet: 0.0 ± 0.0
2.924ProAsn: 2.924 ± 1.671
2.339ProPro: 2.339 ± 1.441
1.754ProGln: 1.754 ± 0.678
1.754ProArg: 1.754 ± 1.239
2.924ProSer: 2.924 ± 1.52
5.263ProThr: 5.263 ± 1.989
5.848ProVal: 5.848 ± 2.258
1.17ProTrp: 1.17 ± 0.587
0.585ProTyr: 0.585 ± 0.427
0.0ProXaa: 0.0 ± 0.0
Gln
5.848GlnAla: 5.848 ± 2.083
0.0GlnCys: 0.0 ± 0.0
1.17GlnAsp: 1.17 ± 0.854
2.924GlnGlu: 2.924 ± 0.791
1.754GlnPhe: 1.754 ± 1.225
2.924GlnGly: 2.924 ± 1.282
1.17GlnHis: 1.17 ± 0.601
1.17GlnIle: 1.17 ± 0.601
2.924GlnLys: 2.924 ± 1.625
7.018GlnLeu: 7.018 ± 1.32
0.585GlnMet: 0.585 ± 0.528
4.678GlnAsn: 4.678 ± 1.29
1.17GlnPro: 1.17 ± 0.625
2.339GlnGln: 2.339 ± 1.475
1.17GlnArg: 1.17 ± 0.446
2.924GlnSer: 2.924 ± 1.074
5.848GlnThr: 5.848 ± 1.918
2.924GlnVal: 2.924 ± 1.213
0.585GlnTrp: 0.585 ± 0.516
2.339GlnTyr: 2.339 ± 0.611
0.0GlnXaa: 0.0 ± 0.0
Arg
6.433ArgAla: 6.433 ± 3.824
1.17ArgCys: 1.17 ± 0.446
5.263ArgAsp: 5.263 ± 1.801
0.585ArgGlu: 0.585 ± 0.516
2.339ArgPhe: 2.339 ± 1.349
3.509ArgGly: 3.509 ± 1.507
2.924ArgHis: 2.924 ± 2.033
2.339ArgIle: 2.339 ± 1.707
1.754ArgLys: 1.754 ± 1.048
4.094ArgLeu: 4.094 ± 1.767
1.754ArgMet: 1.754 ± 0.899
1.17ArgAsn: 1.17 ± 0.865
1.17ArgPro: 1.17 ± 0.446
1.754ArgGln: 1.754 ± 0.899
3.509ArgArg: 3.509 ± 1.052
2.924ArgSer: 2.924 ± 0.643
2.924ArgThr: 2.924 ± 0.967
3.509ArgVal: 3.509 ± 1.519
0.0ArgTrp: 0.0 ± 0.0
2.924ArgTyr: 2.924 ± 0.853
0.0ArgXaa: 0.0 ± 0.0
Ser
9.357SerAla: 9.357 ± 2.744
0.0SerCys: 0.0 ± 0.0
3.509SerAsp: 3.509 ± 1.333
1.17SerGlu: 1.17 ± 0.943
1.754SerPhe: 1.754 ± 1.225
2.924SerGly: 2.924 ± 1.21
1.754SerHis: 1.754 ± 0.678
2.924SerIle: 2.924 ± 1.66
4.094SerLys: 4.094 ± 1.885
6.433SerLeu: 6.433 ± 1.153
3.509SerMet: 3.509 ± 1.396
4.094SerAsn: 4.094 ± 1.223
4.094SerPro: 4.094 ± 1.261
2.339SerGln: 2.339 ± 1.272
7.602SerArg: 7.602 ± 1.892
4.678SerSer: 4.678 ± 1.957
4.678SerThr: 4.678 ± 1.115
5.263SerVal: 5.263 ± 1.14
0.585SerTrp: 0.585 ± 0.516
2.339SerTyr: 2.339 ± 0.955
0.0SerXaa: 0.0 ± 0.0
Thr
5.263ThrAla: 5.263 ± 1.825
0.585ThrCys: 0.585 ± 0.516
4.094ThrAsp: 4.094 ± 1.228
4.678ThrGlu: 4.678 ± 2.29
2.924ThrPhe: 2.924 ± 0.853
3.509ThrGly: 3.509 ± 2.601
2.924ThrHis: 2.924 ± 1.186
3.509ThrIle: 3.509 ± 1.365
5.848ThrLys: 5.848 ± 2.189
8.772ThrLeu: 8.772 ± 1.98
0.585ThrMet: 0.585 ± 0.427
2.339ThrAsn: 2.339 ± 1.272
4.094ThrPro: 4.094 ± 1.026
5.263ThrGln: 5.263 ± 2.108
3.509ThrArg: 3.509 ± 0.753
6.433ThrSer: 6.433 ± 1.747
8.187ThrThr: 8.187 ± 3.326
5.263ThrVal: 5.263 ± 2.332
1.17ThrTrp: 1.17 ± 0.601
0.585ThrTyr: 0.585 ± 0.572
0.0ThrXaa: 0.0 ± 0.0
Val
5.263ValAla: 5.263 ± 2.234
0.0ValCys: 0.0 ± 0.0
5.263ValAsp: 5.263 ± 1.5
3.509ValGlu: 3.509 ± 2.405
1.17ValPhe: 1.17 ± 0.446
5.263ValGly: 5.263 ± 1.595
1.754ValHis: 1.754 ± 1.127
4.094ValIle: 4.094 ± 1.66
2.339ValLys: 2.339 ± 0.816
5.263ValLeu: 5.263 ± 2.327
0.585ValMet: 0.585 ± 0.528
3.509ValAsn: 3.509 ± 1.739
3.509ValPro: 3.509 ± 1.795
2.924ValGln: 2.924 ± 0.928
2.924ValArg: 2.924 ± 0.695
3.509ValSer: 3.509 ± 1.555
4.678ValThr: 4.678 ± 2.239
1.754ValVal: 1.754 ± 1.119
0.585ValTrp: 0.585 ± 0.845
2.924ValTyr: 2.924 ± 1.265
0.0ValXaa: 0.0 ± 0.0
Trp
0.585TrpAla: 0.585 ± 0.516
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.585TrpGlu: 0.585 ± 0.528
0.585TrpPhe: 0.585 ± 0.819
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.17TrpIle: 1.17 ± 0.927
1.17TrpLys: 1.17 ± 0.601
1.17TrpLeu: 1.17 ± 0.865
0.585TrpMet: 0.585 ± 0.516
1.754TrpAsn: 1.754 ± 0.421
1.17TrpPro: 1.17 ± 0.854
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.754TrpSer: 1.754 ± 0.704
1.754TrpThr: 1.754 ± 0.704
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.17TrpTyr: 1.17 ± 0.446
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.339TyrAla: 2.339 ± 0.845
0.0TyrCys: 0.0 ± 0.0
4.094TyrAsp: 4.094 ± 1.522
0.585TyrGlu: 0.585 ± 0.819
4.094TyrPhe: 4.094 ± 0.999
3.509TyrGly: 3.509 ± 0.965
1.17TyrHis: 1.17 ± 1.032
0.585TyrIle: 0.585 ± 0.516
0.0TyrLys: 0.0 ± 0.0
2.924TyrLeu: 2.924 ± 1.186
1.17TyrMet: 1.17 ± 0.854
2.339TyrAsn: 2.339 ± 0.725
1.754TyrPro: 1.754 ± 0.874
0.585TyrGln: 0.585 ± 0.427
2.924TyrArg: 2.924 ± 1.276
1.17TyrSer: 1.17 ± 0.854
0.585TyrThr: 0.585 ± 0.516
4.094TyrVal: 4.094 ± 1.017
0.585TyrTrp: 0.585 ± 0.528
1.17TyrTyr: 1.17 ± 0.944
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1711 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski