Amino acid dipepetide frequency for Escherichia phage If1 (Bacteriophage If1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.287AlaAla: 7.287 ± 1.397
0.81AlaCys: 0.81 ± 0.836
4.858AlaAsp: 4.858 ± 1.128
2.024AlaGlu: 2.024 ± 0.722
3.239AlaPhe: 3.239 ± 1.022
5.668AlaGly: 5.668 ± 1.368
0.405AlaHis: 0.405 ± 0.4
4.049AlaIle: 4.049 ± 1.338
4.858AlaLys: 4.858 ± 1.686
4.858AlaLeu: 4.858 ± 1.525
2.429AlaMet: 2.429 ± 1.389
5.668AlaAsn: 5.668 ± 1.474
3.644AlaPro: 3.644 ± 0.823
1.215AlaGln: 1.215 ± 0.812
3.644AlaArg: 3.644 ± 1.08
5.263AlaSer: 5.263 ± 1.994
4.453AlaThr: 4.453 ± 1.772
5.263AlaVal: 5.263 ± 1.533
0.81AlaTrp: 0.81 ± 0.609
2.834AlaTyr: 2.834 ± 1.281
0.0AlaXaa: 0.0 ± 0.0
Cys
0.405CysAla: 0.405 ± 0.418
0.81CysCys: 0.81 ± 0.836
0.405CysAsp: 0.405 ± 0.334
0.0CysGlu: 0.0 ± 0.0
0.81CysPhe: 0.81 ± 0.672
2.429CysGly: 2.429 ± 1.382
0.0CysHis: 0.0 ± 0.0
1.215CysIle: 1.215 ± 0.792
1.215CysLys: 1.215 ± 0.821
2.429CysLeu: 2.429 ± 0.901
0.81CysMet: 0.81 ± 0.531
1.215CysAsn: 1.215 ± 0.632
0.0CysPro: 0.0 ± 0.0
0.405CysGln: 0.405 ± 0.418
1.215CysArg: 1.215 ± 0.804
1.215CysSer: 1.215 ± 0.527
0.81CysThr: 0.81 ± 0.668
1.619CysVal: 1.619 ± 0.922
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.858AspAla: 4.858 ± 1.385
0.0AspCys: 0.0 ± 0.0
2.429AspAsp: 2.429 ± 0.753
1.619AspGlu: 1.619 ± 0.797
3.644AspPhe: 3.644 ± 1.561
5.668AspGly: 5.668 ± 1.575
0.81AspHis: 0.81 ± 0.635
2.834AspIle: 2.834 ± 1.011
3.644AspLys: 3.644 ± 1.628
5.263AspLeu: 5.263 ± 1.542
1.215AspMet: 1.215 ± 0.778
2.024AspAsn: 2.024 ± 0.591
0.405AspPro: 0.405 ± 0.351
0.405AspGln: 0.405 ± 0.418
1.215AspArg: 1.215 ± 0.515
7.692AspSer: 7.692 ± 2.658
2.429AspThr: 2.429 ± 1.603
4.049AspVal: 4.049 ± 1.811
0.81AspTrp: 0.81 ± 0.605
2.024AspTyr: 2.024 ± 0.886
0.0AspXaa: 0.0 ± 0.0
Glu
1.215GluAla: 1.215 ± 0.641
2.024GluCys: 2.024 ± 0.711
0.81GluAsp: 0.81 ± 0.498
0.0GluGlu: 0.0 ± 0.0
2.429GluPhe: 2.429 ± 0.93
1.215GluGly: 1.215 ± 0.69
0.81GluHis: 0.81 ± 0.724
3.239GluIle: 3.239 ± 0.995
1.619GluLys: 1.619 ± 0.639
4.858GluLeu: 4.858 ± 1.371
1.619GluMet: 1.619 ± 0.801
2.834GluAsn: 2.834 ± 0.713
1.619GluPro: 1.619 ± 1.208
2.024GluGln: 2.024 ± 0.867
1.619GluArg: 1.619 ± 0.797
1.619GluSer: 1.619 ± 0.854
4.049GluThr: 4.049 ± 1.149
0.81GluVal: 0.81 ± 0.604
0.0GluTrp: 0.0 ± 0.0
2.024GluTyr: 2.024 ± 0.682
0.0GluXaa: 0.0 ± 0.0
Phe
3.644PheAla: 3.644 ± 1.315
1.215PheCys: 1.215 ± 0.607
4.453PheAsp: 4.453 ± 1.077
4.049PheGlu: 4.049 ± 0.731
2.834PhePhe: 2.834 ± 1.164
1.215PheGly: 1.215 ± 0.693
0.81PheHis: 0.81 ± 0.635
2.024PheIle: 2.024 ± 1.205
1.619PheLys: 1.619 ± 0.878
4.049PheLeu: 4.049 ± 0.941
0.81PheMet: 0.81 ± 0.546
3.644PheAsn: 3.644 ± 1.359
1.619PhePro: 1.619 ± 0.884
2.024PheGln: 2.024 ± 0.859
1.619PheArg: 1.619 ± 0.95
4.858PheSer: 4.858 ± 0.719
3.644PheThr: 3.644 ± 1.261
3.239PheVal: 3.239 ± 0.747
0.405PheTrp: 0.405 ± 0.334
0.405PheTyr: 0.405 ± 0.362
0.0PheXaa: 0.0 ± 0.0
Gly
2.834GlyAla: 2.834 ± 0.581
0.405GlyCys: 0.405 ± 0.362
4.049GlyAsp: 4.049 ± 1.621
2.429GlyGlu: 2.429 ± 1.073
4.049GlyPhe: 4.049 ± 1.637
7.287GlyGly: 7.287 ± 2.749
2.024GlyHis: 2.024 ± 0.893
5.263GlyIle: 5.263 ± 1.262
4.453GlyLys: 4.453 ± 1.194
7.287GlyLeu: 7.287 ± 2.07
0.81GlyMet: 0.81 ± 0.426
6.478GlyAsn: 6.478 ± 2.002
0.0GlyPro: 0.0 ± 0.0
3.239GlyGln: 3.239 ± 0.946
4.049GlyArg: 4.049 ± 0.901
7.287GlySer: 7.287 ± 3.282
4.049GlyThr: 4.049 ± 1.317
2.024GlyVal: 2.024 ± 1.116
2.024GlyTrp: 2.024 ± 1.05
2.834GlyTyr: 2.834 ± 1.368
0.0GlyXaa: 0.0 ± 0.0
His
1.619HisAla: 1.619 ± 0.787
0.405HisCys: 0.405 ± 0.45
0.405HisAsp: 0.405 ± 0.351
1.215HisGlu: 1.215 ± 0.607
0.0HisPhe: 0.0 ± 0.0
1.215HisGly: 1.215 ± 0.654
0.0HisHis: 0.0 ± 0.0
0.405HisIle: 0.405 ± 0.4
1.215HisLys: 1.215 ± 0.52
1.619HisLeu: 1.619 ± 1.269
0.405HisMet: 0.405 ± 0.415
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.81HisArg: 0.81 ± 0.635
2.429HisSer: 2.429 ± 1.184
1.215HisThr: 1.215 ± 1.254
2.024HisVal: 2.024 ± 0.757
0.405HisTrp: 0.405 ± 0.334
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.073IleAla: 6.073 ± 2.096
0.405IleCys: 0.405 ± 0.572
1.619IleAsp: 1.619 ± 0.611
1.619IleGlu: 1.619 ± 0.761
4.453IlePhe: 4.453 ± 1.315
6.478IleGly: 6.478 ± 2.072
0.81IleHis: 0.81 ± 0.541
5.263IleIle: 5.263 ± 1.277
5.668IleLys: 5.668 ± 2.015
4.453IleLeu: 4.453 ± 1.058
1.215IleMet: 1.215 ± 0.468
1.619IleAsn: 1.619 ± 0.657
2.024IlePro: 2.024 ± 0.885
2.429IleGln: 2.429 ± 0.699
2.834IleArg: 2.834 ± 0.811
6.073IleSer: 6.073 ± 1.498
6.478IleThr: 6.478 ± 1.873
2.429IleVal: 2.429 ± 1.02
0.0IleTrp: 0.0 ± 0.0
2.024IleTyr: 2.024 ± 0.733
0.0IleXaa: 0.0 ± 0.0
Lys
3.239LysAla: 3.239 ± 0.896
1.619LysCys: 1.619 ± 0.918
4.049LysAsp: 4.049 ± 1.042
3.239LysGlu: 3.239 ± 0.977
2.834LysPhe: 2.834 ± 1.039
2.429LysGly: 2.429 ± 0.985
0.81LysHis: 0.81 ± 0.546
6.073LysIle: 6.073 ± 1.032
6.883LysLys: 6.883 ± 1.667
6.073LysLeu: 6.073 ± 2.166
0.81LysMet: 0.81 ± 0.463
2.834LysAsn: 2.834 ± 1.601
4.049LysPro: 4.049 ± 1.838
6.073LysGln: 6.073 ± 1.367
2.834LysArg: 2.834 ± 1.13
5.668LysSer: 5.668 ± 0.927
0.81LysThr: 0.81 ± 0.478
4.453LysVal: 4.453 ± 1.256
0.0LysTrp: 0.0 ± 0.0
2.024LysTyr: 2.024 ± 0.744
0.0LysXaa: 0.0 ± 0.0
Leu
6.478LeuAla: 6.478 ± 2.513
1.215LeuCys: 1.215 ± 0.641
5.668LeuAsp: 5.668 ± 1.686
2.834LeuGlu: 2.834 ± 1.464
3.239LeuPhe: 3.239 ± 0.797
4.453LeuGly: 4.453 ± 1.762
2.834LeuHis: 2.834 ± 1.436
5.668LeuIle: 5.668 ± 1.705
5.263LeuLys: 5.263 ± 1.095
6.073LeuLeu: 6.073 ± 2.093
1.215LeuMet: 1.215 ± 0.711
2.024LeuAsn: 2.024 ± 0.758
4.453LeuPro: 4.453 ± 1.189
2.429LeuGln: 2.429 ± 1.185
3.644LeuArg: 3.644 ± 1.157
12.146LeuSer: 12.146 ± 1.941
3.644LeuThr: 3.644 ± 0.898
5.668LeuVal: 5.668 ± 1.86
0.405LeuTrp: 0.405 ± 0.418
2.429LeuTyr: 2.429 ± 0.934
0.0LeuXaa: 0.0 ± 0.0
Met
0.81MetAla: 0.81 ± 0.416
0.405MetCys: 0.405 ± 0.334
2.429MetAsp: 2.429 ± 0.904
0.405MetGlu: 0.405 ± 0.4
0.405MetPhe: 0.405 ± 0.334
1.215MetGly: 1.215 ± 0.641
0.0MetHis: 0.0 ± 0.0
0.405MetIle: 0.405 ± 0.418
1.215MetLys: 1.215 ± 0.56
0.405MetLeu: 0.405 ± 0.45
0.0MetMet: 0.0 ± 0.0
0.81MetAsn: 0.81 ± 0.668
1.619MetPro: 1.619 ± 0.51
0.405MetGln: 0.405 ± 0.418
1.215MetArg: 1.215 ± 0.652
2.429MetSer: 2.429 ± 1.137
0.405MetThr: 0.405 ± 0.362
1.619MetVal: 1.619 ± 0.648
0.81MetTrp: 0.81 ± 0.463
0.81MetTyr: 0.81 ± 0.635
0.0MetXaa: 0.0 ± 0.0
Asn
6.073AsnAla: 6.073 ± 0.923
0.405AsnCys: 0.405 ± 0.334
0.81AsnAsp: 0.81 ± 0.668
1.215AsnGlu: 1.215 ± 0.801
0.81AsnPhe: 0.81 ± 0.463
6.478AsnGly: 6.478 ± 1.866
1.215AsnHis: 1.215 ± 0.816
2.834AsnIle: 2.834 ± 1.065
3.644AsnLys: 3.644 ± 1.203
3.239AsnLeu: 3.239 ± 0.631
0.405AsnMet: 0.405 ± 0.418
4.049AsnAsn: 4.049 ± 1.214
1.619AsnPro: 1.619 ± 0.706
0.81AsnGln: 0.81 ± 0.463
2.834AsnArg: 2.834 ± 1.21
2.834AsnSer: 2.834 ± 0.908
2.429AsnThr: 2.429 ± 1.231
5.263AsnVal: 5.263 ± 1.434
1.215AsnTrp: 1.215 ± 0.471
1.619AsnTyr: 1.619 ± 0.965
0.0AsnXaa: 0.0 ± 0.0
Pro
3.239ProAla: 3.239 ± 1.052
1.215ProCys: 1.215 ± 0.698
2.429ProAsp: 2.429 ± 0.71
2.429ProGlu: 2.429 ± 0.604
2.024ProPhe: 2.024 ± 0.819
2.834ProGly: 2.834 ± 0.919
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
0.405ProLys: 0.405 ± 0.362
2.429ProLeu: 2.429 ± 0.517
0.0ProMet: 0.0 ± 0.0
0.405ProAsn: 0.405 ± 0.418
2.024ProPro: 2.024 ± 0.767
2.024ProGln: 2.024 ± 0.833
1.619ProArg: 1.619 ± 0.751
5.263ProSer: 5.263 ± 1.105
1.619ProThr: 1.619 ± 0.789
7.287ProVal: 7.287 ± 2.186
0.405ProTrp: 0.405 ± 0.351
1.215ProTyr: 1.215 ± 0.624
0.0ProXaa: 0.0 ± 0.0
Gln
2.429GlnAla: 2.429 ± 1.042
0.81GlnCys: 0.81 ± 0.463
1.215GlnAsp: 1.215 ± 0.769
1.215GlnGlu: 1.215 ± 0.706
1.619GlnPhe: 1.619 ± 0.423
1.215GlnGly: 1.215 ± 0.69
0.405GlnHis: 0.405 ± 0.351
1.619GlnIle: 1.619 ± 1.092
4.858GlnLys: 4.858 ± 2.015
2.429GlnLeu: 2.429 ± 1.628
0.0GlnMet: 0.0 ± 0.0
1.215GlnAsn: 1.215 ± 0.682
1.619GlnPro: 1.619 ± 0.423
0.405GlnGln: 0.405 ± 0.418
1.619GlnArg: 1.619 ± 1.322
4.453GlnSer: 4.453 ± 1.311
4.453GlnThr: 4.453 ± 1.519
2.834GlnVal: 2.834 ± 1.121
1.215GlnTrp: 1.215 ± 0.706
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.263ArgAla: 5.263 ± 1.558
1.215ArgCys: 1.215 ± 1.254
1.619ArgAsp: 1.619 ± 1.004
2.024ArgGlu: 2.024 ± 1.029
2.429ArgPhe: 2.429 ± 1.302
2.024ArgGly: 2.024 ± 0.582
0.405ArgHis: 0.405 ± 0.351
5.263ArgIle: 5.263 ± 1.873
2.834ArgLys: 2.834 ± 1.146
4.453ArgLeu: 4.453 ± 1.514
2.024ArgMet: 2.024 ± 1.224
2.024ArgAsn: 2.024 ± 0.682
1.215ArgPro: 1.215 ± 0.706
1.619ArgGln: 1.619 ± 1.242
1.619ArgArg: 1.619 ± 1.242
1.619ArgSer: 1.619 ± 0.597
0.81ArgThr: 0.81 ± 0.701
4.858ArgVal: 4.858 ± 1.409
0.81ArgTrp: 0.81 ± 0.496
1.619ArgTyr: 1.619 ± 0.743
0.0ArgXaa: 0.0 ± 0.0
Ser
5.263SerAla: 5.263 ± 1.18
1.215SerCys: 1.215 ± 0.727
8.502SerAsp: 8.502 ± 1.367
2.429SerGlu: 2.429 ± 0.895
5.668SerPhe: 5.668 ± 1.319
9.717SerGly: 9.717 ± 3.231
1.619SerHis: 1.619 ± 0.833
5.668SerIle: 5.668 ± 1.029
6.073SerLys: 6.073 ± 0.865
6.883SerLeu: 6.883 ± 1.525
1.619SerMet: 1.619 ± 0.699
4.453SerAsn: 4.453 ± 1.788
3.644SerPro: 3.644 ± 0.993
4.049SerGln: 4.049 ± 2.346
6.073SerArg: 6.073 ± 1.21
4.453SerSer: 4.453 ± 1.574
4.049SerThr: 4.049 ± 1.254
6.883SerVal: 6.883 ± 1.801
1.619SerTrp: 1.619 ± 0.654
5.263SerTyr: 5.263 ± 1.885
0.0SerXaa: 0.0 ± 0.0
Thr
4.453ThrAla: 4.453 ± 1.833
0.81ThrCys: 0.81 ± 0.546
2.834ThrAsp: 2.834 ± 1.055
1.215ThrGlu: 1.215 ± 0.742
1.619ThrPhe: 1.619 ± 0.657
5.668ThrGly: 5.668 ± 1.869
1.619ThrHis: 1.619 ± 0.651
3.239ThrIle: 3.239 ± 0.773
2.429ThrLys: 2.429 ± 1.16
4.049ThrLeu: 4.049 ± 0.816
1.619ThrMet: 1.619 ± 0.614
1.619ThrAsn: 1.619 ± 0.714
4.453ThrPro: 4.453 ± 1.64
2.024ThrGln: 2.024 ± 0.848
2.834ThrArg: 2.834 ± 1.267
6.073ThrSer: 6.073 ± 1.116
6.883ThrThr: 6.883 ± 1.62
6.478ThrVal: 6.478 ± 1.358
0.405ThrTrp: 0.405 ± 0.362
2.429ThrTyr: 2.429 ± 0.933
0.0ThrXaa: 0.0 ± 0.0
Val
4.858ValAla: 4.858 ± 1.594
0.81ValCys: 0.81 ± 0.463
1.619ValAsp: 1.619 ± 0.74
3.644ValGlu: 3.644 ± 0.714
3.239ValPhe: 3.239 ± 1.276
4.453ValGly: 4.453 ± 1.447
0.81ValHis: 0.81 ± 0.496
5.668ValIle: 5.668 ± 1.859
5.668ValLys: 5.668 ± 1.296
5.668ValLeu: 5.668 ± 1.563
0.0ValMet: 0.0 ± 0.0
4.858ValAsn: 4.858 ± 1.402
3.644ValPro: 3.644 ± 1.833
2.834ValGln: 2.834 ± 1.221
2.834ValArg: 2.834 ± 0.832
9.312ValSer: 9.312 ± 1.62
6.478ValThr: 6.478 ± 1.631
6.478ValVal: 6.478 ± 1.892
0.81ValTrp: 0.81 ± 0.463
3.239ValTyr: 3.239 ± 1.083
0.0ValXaa: 0.0 ± 0.0
Trp
0.81TrpAla: 0.81 ± 0.555
0.0TrpCys: 0.0 ± 0.0
0.405TrpAsp: 0.405 ± 0.362
0.405TrpGlu: 0.405 ± 0.334
2.024TrpPhe: 2.024 ± 0.757
0.81TrpGly: 0.81 ± 0.801
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.619TrpLys: 1.619 ± 1.212
0.81TrpLeu: 0.81 ± 0.605
0.0TrpMet: 0.0 ± 0.0
0.405TrpAsn: 0.405 ± 0.471
0.0TrpPro: 0.0 ± 0.0
0.405TrpGln: 0.405 ± 0.418
0.405TrpArg: 0.405 ± 0.418
0.0TrpSer: 0.0 ± 0.0
1.215TrpThr: 1.215 ± 0.662
1.215TrpVal: 1.215 ± 0.707
0.0TrpTrp: 0.0 ± 0.0
2.024TrpTyr: 2.024 ± 0.902
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.834TyrAla: 2.834 ± 1.02
1.215TyrCys: 1.215 ± 0.753
2.429TyrAsp: 2.429 ± 1.014
2.429TyrGlu: 2.429 ± 0.61
0.81TyrPhe: 0.81 ± 0.582
0.405TyrGly: 0.405 ± 0.362
0.0TyrHis: 0.0 ± 0.0
3.239TyrIle: 3.239 ± 0.636
1.619TyrLys: 1.619 ± 0.577
4.453TyrLeu: 4.453 ± 1.595
0.405TyrMet: 0.405 ± 0.362
1.619TyrAsn: 1.619 ± 0.965
1.215TyrPro: 1.215 ± 0.728
0.81TyrGln: 0.81 ± 0.65
1.215TyrArg: 1.215 ± 0.607
4.453TyrSer: 4.453 ± 1.053
2.834TyrThr: 2.834 ± 1.447
2.429TyrVal: 2.429 ± 1.121
0.405TyrTrp: 0.405 ± 0.4
1.215TyrTyr: 1.215 ± 0.937
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (2471 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski