Amino acid dipepetide frequency for Escherichia phage Lilleen

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.018AlaAla: 8.018 ± 1.346
1.718AlaCys: 1.718 ± 1.372
7.446AlaAsp: 7.446 ± 1.811
2.864AlaGlu: 2.864 ± 1.321
2.864AlaPhe: 2.864 ± 1.377
10.882AlaGly: 10.882 ± 4.296
2.864AlaHis: 2.864 ± 2.241
5.727AlaIle: 5.727 ± 1.458
6.3AlaLys: 6.3 ± 1.718
6.3AlaLeu: 6.3 ± 2.976
1.145AlaMet: 1.145 ± 0.479
1.718AlaAsn: 1.718 ± 0.413
4.009AlaPro: 4.009 ± 0.908
4.009AlaGln: 4.009 ± 1.368
1.145AlaArg: 1.145 ± 0.977
10.309AlaSer: 10.309 ± 1.854
7.446AlaThr: 7.446 ± 1.429
8.018AlaVal: 8.018 ± 2.51
0.573AlaTrp: 0.573 ± 0.427
1.718AlaTyr: 1.718 ± 0.655
0.0AlaXaa: 0.0 ± 0.0
Cys
0.573CysAla: 0.573 ± 0.611
0.573CysCys: 0.573 ± 0.427
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.573CysPhe: 0.573 ± 0.427
0.0CysGly: 0.0 ± 0.0
0.573CysHis: 0.573 ± 0.427
0.0CysIle: 0.0 ± 0.0
0.573CysLys: 0.573 ± 0.712
1.145CysLeu: 1.145 ± 0.732
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.573CysPro: 0.573 ± 0.448
0.0CysGln: 0.0 ± 0.0
0.573CysArg: 0.573 ± 0.611
0.573CysSer: 0.573 ± 0.448
0.573CysThr: 0.573 ± 0.427
1.145CysVal: 1.145 ± 0.732
0.0CysTrp: 0.0 ± 0.0
1.145CysTyr: 1.145 ± 0.896
0.0CysXaa: 0.0 ± 0.0
Asp
5.727AspAla: 5.727 ± 2.094
1.145AspCys: 1.145 ± 0.621
2.864AspAsp: 2.864 ± 0.9
4.009AspGlu: 4.009 ± 2.065
1.718AspPhe: 1.718 ± 1.372
4.009AspGly: 4.009 ± 1.512
1.145AspHis: 1.145 ± 0.479
5.727AspIle: 5.727 ± 1.501
1.718AspLys: 1.718 ± 0.99
2.291AspLeu: 2.291 ± 0.829
0.573AspMet: 0.573 ± 0.448
4.009AspAsn: 4.009 ± 1.282
1.145AspPro: 1.145 ± 0.479
2.291AspGln: 2.291 ± 0.982
3.436AspArg: 3.436 ± 1.221
4.582AspSer: 4.582 ± 1.393
5.155AspThr: 5.155 ± 1.222
2.291AspVal: 2.291 ± 0.574
0.573AspTrp: 0.573 ± 0.596
4.009AspTyr: 4.009 ± 0.95
0.0AspXaa: 0.0 ± 0.0
Glu
2.291GluAla: 2.291 ± 0.574
1.718GluCys: 1.718 ± 0.823
1.718GluAsp: 1.718 ± 1.372
2.864GluGlu: 2.864 ± 1.282
2.291GluPhe: 2.291 ± 2.272
2.291GluGly: 2.291 ± 1.281
1.145GluHis: 1.145 ± 0.713
3.436GluIle: 3.436 ± 0.545
1.145GluLys: 1.145 ± 0.713
4.009GluLeu: 4.009 ± 1.516
1.145GluMet: 1.145 ± 0.602
2.291GluAsn: 2.291 ± 1.209
0.573GluPro: 0.573 ± 0.448
0.0GluGln: 0.0 ± 0.0
1.718GluArg: 1.718 ± 0.801
3.436GluSer: 3.436 ± 1.624
2.864GluThr: 2.864 ± 0.996
0.573GluVal: 0.573 ± 0.561
0.573GluTrp: 0.573 ± 0.448
1.145GluTyr: 1.145 ± 0.479
0.0GluXaa: 0.0 ± 0.0
Phe
1.718PheAla: 1.718 ± 1.049
0.573PheCys: 0.573 ± 0.448
0.573PheAsp: 0.573 ± 0.448
1.718PheGlu: 1.718 ± 1.063
0.573PhePhe: 0.573 ± 0.427
2.291PheGly: 2.291 ± 0.617
0.573PheHis: 0.573 ± 0.448
2.291PheIle: 2.291 ± 0.804
1.145PheLys: 1.145 ± 0.716
1.145PheLeu: 1.145 ± 0.745
1.718PheMet: 1.718 ± 0.789
1.718PheAsn: 1.718 ± 0.675
2.291PhePro: 2.291 ± 0.986
1.718PheGln: 1.718 ± 0.983
4.582PheArg: 4.582 ± 1.307
1.145PheSer: 1.145 ± 0.479
3.436PheThr: 3.436 ± 0.811
2.864PheVal: 2.864 ± 1.697
0.573PheTrp: 0.573 ± 0.427
2.291PheTyr: 2.291 ± 1.236
0.0PheXaa: 0.0 ± 0.0
Gly
8.018GlyAla: 8.018 ± 2.669
0.573GlyCys: 0.573 ± 0.712
2.291GlyAsp: 2.291 ± 1.321
1.718GlyGlu: 1.718 ± 0.823
2.864GlyPhe: 2.864 ± 0.697
4.582GlyGly: 4.582 ± 2.423
0.573GlyHis: 0.573 ± 0.448
5.727GlyIle: 5.727 ± 1.878
6.3GlyLys: 6.3 ± 2.195
4.582GlyLeu: 4.582 ± 1.907
1.718GlyMet: 1.718 ± 0.8
4.009GlyAsn: 4.009 ± 0.68
1.145GlyPro: 1.145 ± 0.602
3.436GlyGln: 3.436 ± 1.165
3.436GlyArg: 3.436 ± 1.436
4.582GlySer: 4.582 ± 1.321
2.291GlyThr: 2.291 ± 0.574
3.436GlyVal: 3.436 ± 1.454
1.145GlyTrp: 1.145 ± 0.896
2.864GlyTyr: 2.864 ± 0.9
0.0GlyXaa: 0.0 ± 0.0
His
1.145HisAla: 1.145 ± 0.896
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.718HisPhe: 1.718 ± 0.698
2.291HisGly: 2.291 ± 0.905
0.0HisHis: 0.0 ± 0.0
1.145HisIle: 1.145 ± 0.855
1.145HisLys: 1.145 ± 0.855
2.291HisLeu: 2.291 ± 0.958
0.0HisMet: 0.0 ± 0.0
1.145HisAsn: 1.145 ± 0.479
1.145HisPro: 1.145 ± 0.745
1.718HisGln: 1.718 ± 0.675
0.573HisArg: 0.573 ± 0.427
1.145HisSer: 1.145 ± 0.855
2.291HisThr: 2.291 ± 0.795
0.0HisVal: 0.0 ± 0.0
2.291HisTrp: 2.291 ± 1.236
1.145HisTyr: 1.145 ± 0.855
0.0HisXaa: 0.0 ± 0.0
Ile
10.309IleAla: 10.309 ± 2.293
0.0IleCys: 0.0 ± 0.0
5.727IleAsp: 5.727 ± 1.761
1.145IleGlu: 1.145 ± 0.713
0.0IlePhe: 0.0 ± 0.0
2.864IleGly: 2.864 ± 0.996
0.0IleHis: 0.0 ± 0.0
1.145IleIle: 1.145 ± 0.621
2.864IleLys: 2.864 ± 1.176
2.291IleLeu: 2.291 ± 1.187
2.864IleMet: 2.864 ± 1.477
2.291IleAsn: 2.291 ± 1.243
3.436IlePro: 3.436 ± 1.586
5.155IleGln: 5.155 ± 1.512
2.864IleArg: 2.864 ± 1.741
5.155IleSer: 5.155 ± 2.326
2.291IleThr: 2.291 ± 1.122
1.718IleVal: 1.718 ± 1.063
1.145IleTrp: 1.145 ± 0.479
1.145IleTyr: 1.145 ± 0.855
0.0IleXaa: 0.0 ± 0.0
Lys
5.155LysAla: 5.155 ± 0.751
0.0LysCys: 0.0 ± 0.0
7.446LysAsp: 7.446 ± 3.288
1.718LysGlu: 1.718 ± 0.8
1.718LysPhe: 1.718 ± 0.675
5.727LysGly: 5.727 ± 1.501
0.0LysHis: 0.0 ± 0.0
2.864LysIle: 2.864 ± 0.931
2.291LysLys: 2.291 ± 0.897
4.582LysLeu: 4.582 ± 1.778
5.155LysMet: 5.155 ± 1.328
1.718LysAsn: 1.718 ± 0.675
1.718LysPro: 1.718 ± 1.345
4.009LysGln: 4.009 ± 1.974
0.573LysArg: 0.573 ± 0.448
3.436LysSer: 3.436 ± 1.448
2.864LysThr: 2.864 ± 1.027
1.718LysVal: 1.718 ± 0.881
1.145LysTrp: 1.145 ± 0.676
1.718LysTyr: 1.718 ± 0.789
0.0LysXaa: 0.0 ± 0.0
Leu
8.018LeuAla: 8.018 ± 2.128
0.573LeuCys: 0.573 ± 0.611
4.582LeuAsp: 4.582 ± 1.293
2.864LeuGlu: 2.864 ± 0.858
2.864LeuPhe: 2.864 ± 0.669
4.009LeuGly: 4.009 ± 1.343
1.718LeuHis: 1.718 ± 0.789
4.009LeuIle: 4.009 ± 1.174
6.873LeuLys: 6.873 ± 2.009
5.727LeuLeu: 5.727 ± 1.086
3.436LeuMet: 3.436 ± 1.042
4.009LeuAsn: 4.009 ± 1.558
2.291LeuPro: 2.291 ± 0.958
5.727LeuGln: 5.727 ± 1.034
4.582LeuArg: 4.582 ± 1.679
5.727LeuSer: 5.727 ± 2.211
9.164LeuThr: 9.164 ± 2.016
5.155LeuVal: 5.155 ± 1.537
0.573LeuTrp: 0.573 ± 0.448
1.718LeuTyr: 1.718 ± 0.609
0.0LeuXaa: 0.0 ± 0.0
Met
2.291MetAla: 2.291 ± 1.236
0.0MetCys: 0.0 ± 0.0
1.145MetAsp: 1.145 ± 0.479
1.145MetGlu: 1.145 ± 0.745
0.573MetPhe: 0.573 ± 0.561
1.718MetGly: 1.718 ± 0.801
0.0MetHis: 0.0 ± 0.0
0.573MetIle: 0.573 ± 0.712
2.864MetLys: 2.864 ± 1.007
2.864MetLeu: 2.864 ± 1.278
0.573MetMet: 0.573 ± 0.885
2.291MetAsn: 2.291 ± 0.873
1.145MetPro: 1.145 ± 0.621
2.291MetGln: 2.291 ± 1.615
2.864MetArg: 2.864 ± 0.996
4.582MetSer: 4.582 ± 1.13
1.145MetThr: 1.145 ± 0.479
1.145MetVal: 1.145 ± 0.526
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.582AsnAla: 4.582 ± 1.743
0.0AsnCys: 0.0 ± 0.0
1.718AsnAsp: 1.718 ± 0.413
1.145AsnGlu: 1.145 ± 0.883
2.291AsnPhe: 2.291 ± 1.236
1.718AsnGly: 1.718 ± 0.655
0.0AsnHis: 0.0 ± 0.0
2.864AsnIle: 2.864 ± 2.117
2.864AsnLys: 2.864 ± 1.083
5.155AsnLeu: 5.155 ± 1.537
1.718AsnMet: 1.718 ± 0.801
3.436AsnAsn: 3.436 ± 0.657
4.582AsnPro: 4.582 ± 0.654
2.864AsnGln: 2.864 ± 1.931
1.718AsnArg: 1.718 ± 0.789
3.436AsnSer: 3.436 ± 0.986
6.3AsnThr: 6.3 ± 1.294
2.864AsnVal: 2.864 ± 0.931
0.0AsnTrp: 0.0 ± 0.0
2.291AsnTyr: 2.291 ± 0.617
0.0AsnXaa: 0.0 ± 0.0
Pro
1.718ProAla: 1.718 ± 1.466
0.0ProCys: 0.0 ± 0.0
1.718ProAsp: 1.718 ± 0.8
2.864ProGlu: 2.864 ± 0.844
1.145ProPhe: 1.145 ± 0.479
1.145ProGly: 1.145 ± 0.767
0.573ProHis: 0.573 ± 0.427
1.718ProIle: 1.718 ± 0.895
1.718ProLys: 1.718 ± 0.823
5.155ProLeu: 5.155 ± 1.654
0.0ProMet: 0.0 ± 0.0
3.436ProAsn: 3.436 ± 1.416
2.291ProPro: 2.291 ± 1.247
1.145ProGln: 1.145 ± 0.479
2.864ProArg: 2.864 ± 1.111
2.291ProSer: 2.291 ± 1.247
5.155ProThr: 5.155 ± 1.712
5.727ProVal: 5.727 ± 2.111
1.145ProTrp: 1.145 ± 0.602
0.573ProTyr: 0.573 ± 0.448
0.0ProXaa: 0.0 ± 0.0
Gln
5.727GlnAla: 5.727 ± 2.176
0.0GlnCys: 0.0 ± 0.0
1.145GlnAsp: 1.145 ± 0.896
2.864GlnGlu: 2.864 ± 0.828
1.145GlnPhe: 1.145 ± 0.855
2.864GlnGly: 2.864 ± 1.267
1.145GlnHis: 1.145 ± 0.526
1.145GlnIle: 1.145 ± 0.526
2.864GlnLys: 2.864 ± 1.484
7.446GlnLeu: 7.446 ± 1.217
0.573GlnMet: 0.573 ± 0.561
3.436GlnAsn: 3.436 ± 1.578
1.145GlnPro: 1.145 ± 0.621
2.291GlnGln: 2.291 ± 1.615
1.718GlnArg: 1.718 ± 0.413
2.864GlnSer: 2.864 ± 0.936
5.727GlnThr: 5.727 ± 1.647
2.864GlnVal: 2.864 ± 1.305
0.573GlnTrp: 0.573 ± 0.427
2.291GlnTyr: 2.291 ± 0.617
0.0GlnXaa: 0.0 ± 0.0
Arg
6.3ArgAla: 6.3 ± 3.176
1.145ArgCys: 1.145 ± 0.479
4.582ArgAsp: 4.582 ± 1.915
0.573ArgGlu: 0.573 ± 0.427
2.291ArgPhe: 2.291 ± 1.153
3.436ArgGly: 3.436 ± 1.2
3.436ArgHis: 3.436 ± 2.183
2.291ArgIle: 2.291 ± 1.793
1.718ArgLys: 1.718 ± 0.99
4.582ArgLeu: 4.582 ± 1.886
1.718ArgMet: 1.718 ± 0.8
1.718ArgAsn: 1.718 ± 0.87
1.718ArgPro: 1.718 ± 0.413
1.718ArgGln: 1.718 ± 0.8
4.009ArgArg: 4.009 ± 1.284
2.864ArgSer: 2.864 ± 0.669
2.864ArgThr: 2.864 ± 0.996
4.009ArgVal: 4.009 ± 1.358
0.0ArgTrp: 0.0 ± 0.0
2.864ArgTyr: 2.864 ± 0.761
0.0ArgXaa: 0.0 ± 0.0
Ser
9.737SerAla: 9.737 ± 3.889
0.0SerCys: 0.0 ± 0.0
2.291SerAsp: 2.291 ± 1.186
1.718SerGlu: 1.718 ± 0.833
1.718SerPhe: 1.718 ± 1.008
2.864SerGly: 2.864 ± 1.241
1.718SerHis: 1.718 ± 0.609
2.864SerIle: 2.864 ± 2.002
4.582SerLys: 4.582 ± 1.303
6.873SerLeu: 6.873 ± 1.409
2.864SerMet: 2.864 ± 1.605
4.009SerAsn: 4.009 ± 1.097
2.864SerPro: 2.864 ± 0.998
2.864SerGln: 2.864 ± 1.057
8.018SerArg: 8.018 ± 1.172
4.582SerSer: 4.582 ± 2.174
4.582SerThr: 4.582 ± 0.984
4.009SerVal: 4.009 ± 0.7
0.573SerTrp: 0.573 ± 0.427
4.009SerTyr: 4.009 ± 2.024
0.0SerXaa: 0.0 ± 0.0
Thr
6.3ThrAla: 6.3 ± 2.513
0.573ThrCys: 0.573 ± 0.427
4.009ThrAsp: 4.009 ± 0.987
3.436ThrGlu: 3.436 ± 1.64
2.864ThrPhe: 2.864 ± 0.761
3.436ThrGly: 3.436 ± 2.097
2.291ThrHis: 2.291 ± 1.175
3.436ThrIle: 3.436 ± 1.255
5.155ThrLys: 5.155 ± 1.838
8.018ThrLeu: 8.018 ± 1.726
1.145ThrMet: 1.145 ± 0.621
2.864ThrAsn: 2.864 ± 1.278
4.009ThrPro: 4.009 ± 0.985
5.155ThrGln: 5.155 ± 1.999
3.436ThrArg: 3.436 ± 0.732
6.873ThrSer: 6.873 ± 1.751
6.3ThrThr: 6.3 ± 2.873
5.155ThrVal: 5.155 ± 2.231
1.145ThrTrp: 1.145 ± 0.526
0.573ThrTyr: 0.573 ± 0.611
0.0ThrXaa: 0.0 ± 0.0
Val
4.582ValAla: 4.582 ± 0.926
0.0ValCys: 0.0 ± 0.0
5.155ValAsp: 5.155 ± 1.479
3.436ValGlu: 3.436 ± 2.011
1.145ValPhe: 1.145 ± 0.479
5.727ValGly: 5.727 ± 1.83
1.718ValHis: 1.718 ± 0.896
5.155ValIle: 5.155 ± 1.587
2.291ValLys: 2.291 ± 0.856
5.155ValLeu: 5.155 ± 2.162
0.573ValMet: 0.573 ± 0.561
3.436ValAsn: 3.436 ± 1.662
3.436ValPro: 3.436 ± 1.496
2.864ValGln: 2.864 ± 0.931
2.864ValArg: 2.864 ± 0.606
2.291ValSer: 2.291 ± 0.984
3.436ValThr: 3.436 ± 1.776
1.145ValVal: 1.145 ± 0.621
0.573ValTrp: 0.573 ± 0.712
2.864ValTyr: 2.864 ± 1.432
0.0ValXaa: 0.0 ± 0.0
Trp
0.573TrpAla: 0.573 ± 0.427
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.573TrpGlu: 0.573 ± 0.561
0.573TrpPhe: 0.573 ± 0.596
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.145TrpIle: 1.145 ± 0.745
1.145TrpLys: 1.145 ± 0.526
1.145TrpLeu: 1.145 ± 0.676
0.573TrpMet: 0.573 ± 0.427
1.718TrpAsn: 1.718 ± 0.413
1.145TrpPro: 1.145 ± 0.896
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.718TrpSer: 1.718 ± 0.823
1.718TrpThr: 1.718 ± 0.823
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.145TrpTyr: 1.145 ± 0.479
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.291TyrAla: 2.291 ± 0.861
0.0TyrCys: 0.0 ± 0.0
4.009TyrAsp: 4.009 ± 1.734
0.573TyrGlu: 0.573 ± 0.596
4.009TyrPhe: 4.009 ± 0.988
3.436TyrGly: 3.436 ± 0.892
1.718TyrHis: 1.718 ± 1.282
1.145TyrIle: 1.145 ± 0.716
0.0TyrLys: 0.0 ± 0.0
2.864TyrLeu: 2.864 ± 1.027
1.718TyrMet: 1.718 ± 0.864
2.291TyrAsn: 2.291 ± 0.656
1.718TyrPro: 1.718 ± 0.892
0.573TyrGln: 0.573 ± 0.448
2.864TyrArg: 2.864 ± 1.233
1.145TyrSer: 1.145 ± 0.896
0.573TyrThr: 0.573 ± 0.427
4.009TyrVal: 4.009 ± 1.064
0.573TyrTrp: 0.573 ± 0.561
3.436TyrTyr: 3.436 ± 1.778
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1747 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski