Amino acid dipepetide frequency for Enterococcus phage phiFL1A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.528AlaAla: 3.528 ± 1.063
0.258AlaCys: 0.258 ± 0.141
3.872AlaAsp: 3.872 ± 0.554
4.388AlaGlu: 4.388 ± 0.792
2.839AlaPhe: 2.839 ± 0.547
3.184AlaGly: 3.184 ± 0.788
0.688AlaHis: 0.688 ± 0.277
5.593AlaIle: 5.593 ± 0.753
5.421AlaLys: 5.421 ± 0.69
6.109AlaLeu: 6.109 ± 0.884
1.463AlaMet: 1.463 ± 0.355
3.7AlaAsn: 3.7 ± 0.705
1.033AlaPro: 1.033 ± 0.272
2.151AlaGln: 2.151 ± 0.459
2.409AlaArg: 2.409 ± 0.526
4.044AlaSer: 4.044 ± 1.015
5.593AlaThr: 5.593 ± 0.912
4.044AlaVal: 4.044 ± 0.855
0.344AlaTrp: 0.344 ± 0.149
3.614AlaTyr: 3.614 ± 0.55
0.0AlaXaa: 0.0 ± 0.0
Cys
0.258CysAla: 0.258 ± 0.187
0.086CysCys: 0.086 ± 0.104
0.172CysAsp: 0.172 ± 0.118
0.43CysGlu: 0.43 ± 0.194
0.086CysPhe: 0.086 ± 0.094
0.516CysGly: 0.516 ± 0.266
0.258CysHis: 0.258 ± 0.169
0.172CysIle: 0.172 ± 0.133
1.033CysLys: 1.033 ± 0.411
0.602CysLeu: 0.602 ± 0.274
0.172CysMet: 0.172 ± 0.127
0.774CysAsn: 0.774 ± 0.243
0.258CysPro: 0.258 ± 0.171
0.43CysGln: 0.43 ± 0.182
0.344CysArg: 0.344 ± 0.185
0.688CysSer: 0.688 ± 0.252
0.172CysThr: 0.172 ± 0.128
0.602CysVal: 0.602 ± 0.261
0.086CysTrp: 0.086 ± 0.089
0.258CysTyr: 0.258 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
3.098AspAla: 3.098 ± 0.617
0.516AspCys: 0.516 ± 0.265
3.614AspAsp: 3.614 ± 0.633
6.281AspGlu: 6.281 ± 1.113
2.925AspPhe: 2.925 ± 0.459
4.474AspGly: 4.474 ± 0.513
0.43AspHis: 0.43 ± 0.198
3.7AspIle: 3.7 ± 0.454
5.077AspLys: 5.077 ± 0.557
5.937AspLeu: 5.937 ± 0.79
1.205AspMet: 1.205 ± 0.417
3.27AspAsn: 3.27 ± 0.662
1.463AspPro: 1.463 ± 0.334
1.635AspGln: 1.635 ± 0.415
1.979AspArg: 1.979 ± 0.529
4.216AspSer: 4.216 ± 0.786
2.409AspThr: 2.409 ± 0.523
3.614AspVal: 3.614 ± 0.47
0.946AspTrp: 0.946 ± 0.33
2.495AspTyr: 2.495 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
4.388GluAla: 4.388 ± 0.686
0.602GluCys: 0.602 ± 0.271
3.614GluAsp: 3.614 ± 0.721
6.797GluGlu: 6.797 ± 1.003
2.839GluPhe: 2.839 ± 0.554
3.786GluGly: 3.786 ± 0.79
1.119GluHis: 1.119 ± 0.344
6.023GluIle: 6.023 ± 0.784
7.228GluLys: 7.228 ± 1.022
7.572GluLeu: 7.572 ± 0.981
1.807GluMet: 1.807 ± 0.414
4.044GluAsn: 4.044 ± 0.659
3.098GluPro: 3.098 ± 0.501
3.614GluGln: 3.614 ± 0.707
3.7GluArg: 3.7 ± 0.705
2.753GluSer: 2.753 ± 0.51
4.991GluThr: 4.991 ± 0.626
5.335GluVal: 5.335 ± 0.737
1.119GluTrp: 1.119 ± 0.335
2.753GluTyr: 2.753 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
2.237PheAla: 2.237 ± 0.363
0.344PheCys: 0.344 ± 0.178
3.356PheAsp: 3.356 ± 0.567
4.13PheGlu: 4.13 ± 0.65
2.237PhePhe: 2.237 ± 0.474
2.409PheGly: 2.409 ± 0.472
0.43PheHis: 0.43 ± 0.193
2.323PheIle: 2.323 ± 0.438
3.012PheLys: 3.012 ± 0.489
2.753PheLeu: 2.753 ± 0.515
1.205PheMet: 1.205 ± 0.403
2.581PheAsn: 2.581 ± 0.433
0.516PhePro: 0.516 ± 0.247
1.205PheGln: 1.205 ± 0.379
1.635PheArg: 1.635 ± 0.404
2.151PheSer: 2.151 ± 0.518
2.065PheThr: 2.065 ± 0.48
2.839PheVal: 2.839 ± 0.732
0.344PheTrp: 0.344 ± 0.173
1.291PheTyr: 1.291 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
4.302GlyAla: 4.302 ± 0.67
0.172GlyCys: 0.172 ± 0.123
2.925GlyAsp: 2.925 ± 0.493
3.958GlyGlu: 3.958 ± 0.667
3.098GlyPhe: 3.098 ± 0.53
3.356GlyGly: 3.356 ± 0.545
0.86GlyHis: 0.86 ± 0.287
5.593GlyIle: 5.593 ± 0.862
5.421GlyLys: 5.421 ± 0.773
4.818GlyLeu: 4.818 ± 0.924
2.151GlyMet: 2.151 ± 0.683
4.474GlyAsn: 4.474 ± 0.822
0.602GlyPro: 0.602 ± 0.24
1.979GlyGln: 1.979 ± 0.441
1.721GlyArg: 1.721 ± 0.426
3.614GlySer: 3.614 ± 0.807
3.958GlyThr: 3.958 ± 0.889
4.388GlyVal: 4.388 ± 0.632
0.688GlyTrp: 0.688 ± 0.253
3.614GlyTyr: 3.614 ± 0.775
0.0GlyXaa: 0.0 ± 0.0
His
0.344HisAla: 0.344 ± 0.191
0.172HisCys: 0.172 ± 0.185
0.516HisAsp: 0.516 ± 0.323
0.43HisGlu: 0.43 ± 0.209
0.516HisPhe: 0.516 ± 0.167
0.688HisGly: 0.688 ± 0.239
0.0HisHis: 0.0 ± 0.0
0.688HisIle: 0.688 ± 0.255
0.516HisLys: 0.516 ± 0.202
0.86HisLeu: 0.86 ± 0.345
0.344HisMet: 0.344 ± 0.151
0.516HisAsn: 0.516 ± 0.204
0.258HisPro: 0.258 ± 0.145
0.344HisGln: 0.344 ± 0.212
0.774HisArg: 0.774 ± 0.297
0.602HisSer: 0.602 ± 0.247
0.43HisThr: 0.43 ± 0.163
1.463HisVal: 1.463 ± 0.409
0.43HisTrp: 0.43 ± 0.238
1.033HisTyr: 1.033 ± 0.329
0.0HisXaa: 0.0 ± 0.0
Ile
4.904IleAla: 4.904 ± 0.972
0.516IleCys: 0.516 ± 0.229
5.421IleAsp: 5.421 ± 0.69
4.904IleGlu: 4.904 ± 0.827
2.151IlePhe: 2.151 ± 0.388
4.646IleGly: 4.646 ± 1.059
0.774IleHis: 0.774 ± 0.223
3.528IleIle: 3.528 ± 0.849
7.4IleLys: 7.4 ± 0.867
6.883IleLeu: 6.883 ± 0.799
1.549IleMet: 1.549 ± 0.453
4.646IleAsn: 4.646 ± 0.761
2.495IlePro: 2.495 ± 0.446
2.925IleGln: 2.925 ± 0.477
2.237IleArg: 2.237 ± 0.592
4.646IleSer: 4.646 ± 1.008
3.614IleThr: 3.614 ± 0.585
4.56IleVal: 4.56 ± 0.596
0.774IleTrp: 0.774 ± 0.247
3.012IleTyr: 3.012 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
6.023LysAla: 6.023 ± 0.789
0.344LysCys: 0.344 ± 0.182
4.216LysAsp: 4.216 ± 0.642
9.121LysGlu: 9.121 ± 1.127
2.151LysPhe: 2.151 ± 0.419
4.991LysGly: 4.991 ± 0.493
1.205LysHis: 1.205 ± 0.34
6.539LysIle: 6.539 ± 0.916
9.551LysLys: 9.551 ± 1.267
6.023LysLeu: 6.023 ± 0.719
2.839LysMet: 2.839 ± 0.566
6.109LysAsn: 6.109 ± 0.785
1.979LysPro: 1.979 ± 0.445
3.786LysGln: 3.786 ± 0.676
3.528LysArg: 3.528 ± 0.686
5.249LysSer: 5.249 ± 0.69
5.679LysThr: 5.679 ± 0.742
4.904LysVal: 4.904 ± 0.659
1.291LysTrp: 1.291 ± 0.422
3.356LysTyr: 3.356 ± 0.623
0.0LysXaa: 0.0 ± 0.0
Leu
6.367LeuAla: 6.367 ± 0.656
0.688LeuCys: 0.688 ± 0.208
5.937LeuAsp: 5.937 ± 0.786
6.367LeuGlu: 6.367 ± 0.87
3.442LeuPhe: 3.442 ± 0.673
4.991LeuGly: 4.991 ± 0.728
0.688LeuHis: 0.688 ± 0.256
4.216LeuIle: 4.216 ± 0.79
6.883LeuLys: 6.883 ± 0.875
6.023LeuLeu: 6.023 ± 0.783
2.151LeuMet: 2.151 ± 0.491
6.625LeuAsn: 6.625 ± 0.773
2.839LeuPro: 2.839 ± 0.712
2.839LeuGln: 2.839 ± 0.498
3.528LeuArg: 3.528 ± 0.606
6.367LeuSer: 6.367 ± 0.918
4.732LeuThr: 4.732 ± 0.562
5.077LeuVal: 5.077 ± 0.6
0.946LeuTrp: 0.946 ± 0.309
2.753LeuTyr: 2.753 ± 0.518
0.0LeuXaa: 0.0 ± 0.0
Met
2.237MetAla: 2.237 ± 0.495
0.172MetCys: 0.172 ± 0.142
1.635MetAsp: 1.635 ± 0.436
1.119MetGlu: 1.119 ± 0.365
0.774MetPhe: 0.774 ± 0.234
1.549MetGly: 1.549 ± 0.514
0.172MetHis: 0.172 ± 0.128
1.635MetIle: 1.635 ± 0.397
2.409MetLys: 2.409 ± 0.493
2.151MetLeu: 2.151 ± 0.412
0.344MetMet: 0.344 ± 0.171
2.839MetAsn: 2.839 ± 0.872
0.946MetPro: 0.946 ± 0.309
1.463MetGln: 1.463 ± 0.397
1.205MetArg: 1.205 ± 0.391
2.667MetSer: 2.667 ± 0.403
1.119MetThr: 1.119 ± 0.329
1.205MetVal: 1.205 ± 0.345
0.086MetTrp: 0.086 ± 0.089
0.774MetTyr: 0.774 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
4.13AsnAla: 4.13 ± 0.77
0.258AsnCys: 0.258 ± 0.164
3.442AsnAsp: 3.442 ± 0.658
5.937AsnGlu: 5.937 ± 0.964
1.377AsnPhe: 1.377 ± 0.514
5.335AsnGly: 5.335 ± 0.731
0.86AsnHis: 0.86 ± 0.24
5.593AsnIle: 5.593 ± 0.86
5.507AsnLys: 5.507 ± 0.731
4.216AsnLeu: 4.216 ± 0.767
1.463AsnMet: 1.463 ± 0.273
4.13AsnAsn: 4.13 ± 0.576
1.807AsnPro: 1.807 ± 0.479
3.356AsnGln: 3.356 ± 0.557
2.839AsnArg: 2.839 ± 0.509
5.163AsnSer: 5.163 ± 1.261
3.442AsnThr: 3.442 ± 0.533
4.991AsnVal: 4.991 ± 0.671
0.344AsnTrp: 0.344 ± 0.18
2.237AsnTyr: 2.237 ± 0.49
0.0AsnXaa: 0.0 ± 0.0
Pro
2.065ProAla: 2.065 ± 0.5
0.258ProCys: 0.258 ± 0.133
1.893ProAsp: 1.893 ± 0.543
3.27ProGlu: 3.27 ± 0.554
2.065ProPhe: 2.065 ± 0.408
1.033ProGly: 1.033 ± 0.291
0.172ProHis: 0.172 ± 0.131
1.721ProIle: 1.721 ± 0.345
2.151ProLys: 2.151 ± 0.379
1.721ProLeu: 1.721 ± 0.417
1.033ProMet: 1.033 ± 0.281
2.151ProAsn: 2.151 ± 0.435
0.774ProPro: 0.774 ± 0.3
1.119ProGln: 1.119 ± 0.522
0.688ProArg: 0.688 ± 0.28
1.549ProSer: 1.549 ± 0.33
1.291ProThr: 1.291 ± 0.33
0.946ProVal: 0.946 ± 0.311
0.172ProTrp: 0.172 ± 0.115
0.86ProTyr: 0.86 ± 0.351
0.0ProXaa: 0.0 ± 0.0
Gln
3.528GlnAla: 3.528 ± 0.632
0.172GlnCys: 0.172 ± 0.134
2.065GlnAsp: 2.065 ± 0.482
2.495GlnGlu: 2.495 ± 0.498
1.205GlnPhe: 1.205 ± 0.302
1.893GlnGly: 1.893 ± 0.501
0.258GlnHis: 0.258 ± 0.169
3.786GlnIle: 3.786 ± 0.601
3.614GlnLys: 3.614 ± 0.538
3.872GlnLeu: 3.872 ± 0.647
1.119GlnMet: 1.119 ± 0.352
2.925GlnAsn: 2.925 ± 0.639
1.119GlnPro: 1.119 ± 0.269
2.667GlnGln: 2.667 ± 0.604
1.463GlnArg: 1.463 ± 0.446
2.839GlnSer: 2.839 ± 0.509
2.151GlnThr: 2.151 ± 0.487
1.979GlnVal: 1.979 ± 0.384
0.258GlnTrp: 0.258 ± 0.133
2.151GlnTyr: 2.151 ± 0.479
0.0GlnXaa: 0.0 ± 0.0
Arg
2.065ArgAla: 2.065 ± 0.51
0.344ArgCys: 0.344 ± 0.17
2.409ArgAsp: 2.409 ± 0.689
2.237ArgGlu: 2.237 ± 0.473
1.893ArgPhe: 1.893 ± 0.441
1.807ArgGly: 1.807 ± 0.452
0.516ArgHis: 0.516 ± 0.314
3.27ArgIle: 3.27 ± 0.543
3.7ArgLys: 3.7 ± 0.661
3.786ArgLeu: 3.786 ± 0.784
1.721ArgMet: 1.721 ± 0.297
2.409ArgAsn: 2.409 ± 0.474
1.205ArgPro: 1.205 ± 0.347
2.409ArgGln: 2.409 ± 0.543
1.893ArgArg: 1.893 ± 0.457
1.721ArgSer: 1.721 ± 0.435
1.291ArgThr: 1.291 ± 0.3
2.237ArgVal: 2.237 ± 0.488
0.344ArgTrp: 0.344 ± 0.181
1.549ArgTyr: 1.549 ± 0.378
0.0ArgXaa: 0.0 ± 0.0
Ser
4.732SerAla: 4.732 ± 1.428
0.688SerCys: 0.688 ± 0.252
4.13SerAsp: 4.13 ± 0.824
3.442SerGlu: 3.442 ± 0.636
2.839SerPhe: 2.839 ± 0.562
4.216SerGly: 4.216 ± 0.692
0.344SerHis: 0.344 ± 0.178
5.163SerIle: 5.163 ± 0.648
5.765SerLys: 5.765 ± 0.715
5.077SerLeu: 5.077 ± 0.781
2.237SerMet: 2.237 ± 0.67
4.818SerAsn: 4.818 ± 1.08
1.549SerPro: 1.549 ± 0.423
2.839SerGln: 2.839 ± 0.636
2.323SerArg: 2.323 ± 0.351
4.818SerSer: 4.818 ± 1.603
4.474SerThr: 4.474 ± 0.624
3.012SerVal: 3.012 ± 0.668
0.774SerTrp: 0.774 ± 0.285
1.549SerTyr: 1.549 ± 0.369
0.0SerXaa: 0.0 ± 0.0
Thr
3.27ThrAla: 3.27 ± 0.506
0.516ThrCys: 0.516 ± 0.316
4.044ThrAsp: 4.044 ± 0.712
3.528ThrGlu: 3.528 ± 0.583
2.323ThrPhe: 2.323 ± 0.409
4.818ThrGly: 4.818 ± 0.485
0.516ThrHis: 0.516 ± 0.221
3.786ThrIle: 3.786 ± 0.47
5.249ThrLys: 5.249 ± 0.741
4.56ThrLeu: 4.56 ± 0.493
1.291ThrMet: 1.291 ± 0.464
3.786ThrAsn: 3.786 ± 0.618
1.463ThrPro: 1.463 ± 0.408
2.151ThrGln: 2.151 ± 0.7
1.635ThrArg: 1.635 ± 0.361
3.872ThrSer: 3.872 ± 0.823
5.507ThrThr: 5.507 ± 1.716
3.872ThrVal: 3.872 ± 0.703
1.291ThrTrp: 1.291 ± 0.382
2.065ThrTyr: 2.065 ± 0.518
0.0ThrXaa: 0.0 ± 0.0
Val
3.528ValAla: 3.528 ± 0.664
0.516ValCys: 0.516 ± 0.239
3.528ValAsp: 3.528 ± 0.551
4.732ValGlu: 4.732 ± 0.674
2.323ValPhe: 2.323 ± 0.531
3.958ValGly: 3.958 ± 0.708
0.516ValHis: 0.516 ± 0.21
4.474ValIle: 4.474 ± 0.691
5.249ValLys: 5.249 ± 0.78
5.507ValLeu: 5.507 ± 0.707
1.119ValMet: 1.119 ± 0.288
3.786ValAsn: 3.786 ± 0.753
1.893ValPro: 1.893 ± 0.505
2.495ValGln: 2.495 ± 0.395
2.495ValArg: 2.495 ± 0.634
4.56ValSer: 4.56 ± 0.816
3.786ValThr: 3.786 ± 0.778
3.27ValVal: 3.27 ± 0.53
0.516ValTrp: 0.516 ± 0.231
2.409ValTyr: 2.409 ± 0.586
0.0ValXaa: 0.0 ± 0.0
Trp
0.946TrpAla: 0.946 ± 0.319
0.0TrpCys: 0.0 ± 0.0
0.688TrpAsp: 0.688 ± 0.236
0.86TrpGlu: 0.86 ± 0.317
0.43TrpPhe: 0.43 ± 0.199
0.946TrpGly: 0.946 ± 0.277
0.172TrpHis: 0.172 ± 0.133
0.43TrpIle: 0.43 ± 0.244
0.946TrpLys: 0.946 ± 0.337
1.291TrpLeu: 1.291 ± 0.419
0.172TrpMet: 0.172 ± 0.12
0.946TrpAsn: 0.946 ± 0.278
0.172TrpPro: 0.172 ± 0.112
0.516TrpGln: 0.516 ± 0.181
0.688TrpArg: 0.688 ± 0.25
0.258TrpSer: 0.258 ± 0.166
0.688TrpThr: 0.688 ± 0.288
0.602TrpVal: 0.602 ± 0.25
0.086TrpTrp: 0.086 ± 0.089
0.602TrpTyr: 0.602 ± 0.284
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.323TyrAla: 2.323 ± 0.44
0.774TyrCys: 0.774 ± 0.274
1.807TyrAsp: 1.807 ± 0.446
2.753TyrGlu: 2.753 ± 0.441
1.549TyrPhe: 1.549 ± 0.413
3.098TyrGly: 3.098 ± 0.661
0.946TyrHis: 0.946 ± 0.315
3.356TyrIle: 3.356 ± 0.715
2.839TyrLys: 2.839 ± 0.502
3.7TyrLeu: 3.7 ± 0.677
1.033TyrMet: 1.033 ± 0.313
1.893TyrAsn: 1.893 ± 0.473
1.463TyrPro: 1.463 ± 0.516
1.635TyrGln: 1.635 ± 0.462
1.721TyrArg: 1.721 ± 0.432
3.012TyrSer: 3.012 ± 0.532
2.151TyrThr: 2.151 ± 0.533
1.721TyrVal: 1.721 ± 0.453
0.602TyrTrp: 0.602 ± 0.234
1.205TyrTyr: 1.205 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (11623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski