Amino acid dipepetide frequency for Shigella phage SfV (Shigella flexneri bacteriophage V) (Bacteriophage SfV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.955AlaAla: 8.955 ± 1.069
1.55AlaCys: 1.55 ± 0.443
5.511AlaAsp: 5.511 ± 0.903
4.736AlaGlu: 4.736 ± 0.537
2.842AlaPhe: 2.842 ± 0.566
8.353AlaGly: 8.353 ± 0.938
1.808AlaHis: 1.808 ± 0.419
4.908AlaIle: 4.908 ± 0.567
3.186AlaLys: 3.186 ± 0.457
8.611AlaLeu: 8.611 ± 0.722
3.358AlaMet: 3.358 ± 0.546
3.1AlaAsn: 3.1 ± 0.566
4.564AlaPro: 4.564 ± 0.588
2.928AlaGln: 2.928 ± 0.629
6.458AlaArg: 6.458 ± 0.944
5.769AlaSer: 5.769 ± 0.773
5.769AlaThr: 5.769 ± 0.645
6.028AlaVal: 6.028 ± 0.903
1.55AlaTrp: 1.55 ± 0.358
3.014AlaTyr: 3.014 ± 0.554
0.0AlaXaa: 0.0 ± 0.0
Cys
1.206CysAla: 1.206 ± 0.32
0.258CysCys: 0.258 ± 0.161
0.603CysAsp: 0.603 ± 0.205
0.861CysGlu: 0.861 ± 0.275
0.344CysPhe: 0.344 ± 0.141
1.894CysGly: 1.894 ± 0.354
0.344CysHis: 0.344 ± 0.185
0.947CysIle: 0.947 ± 0.255
0.344CysLys: 0.344 ± 0.168
0.947CysLeu: 0.947 ± 0.352
0.172CysMet: 0.172 ± 0.108
0.258CysAsn: 0.258 ± 0.126
0.517CysPro: 0.517 ± 0.213
1.033CysGln: 1.033 ± 0.301
1.119CysArg: 1.119 ± 0.351
0.689CysSer: 0.689 ± 0.207
0.431CysThr: 0.431 ± 0.19
0.603CysVal: 0.603 ± 0.248
0.517CysTrp: 0.517 ± 0.243
0.344CysTyr: 0.344 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
5.167AspAla: 5.167 ± 0.748
0.689AspCys: 0.689 ± 0.211
5.081AspAsp: 5.081 ± 0.828
4.478AspGlu: 4.478 ± 0.536
2.497AspPhe: 2.497 ± 0.39
4.392AspGly: 4.392 ± 0.687
0.775AspHis: 0.775 ± 0.222
3.014AspIle: 3.014 ± 0.549
2.325AspLys: 2.325 ± 0.416
5.511AspLeu: 5.511 ± 0.845
1.808AspMet: 1.808 ± 0.41
2.583AspAsn: 2.583 ± 0.503
3.014AspPro: 3.014 ± 0.527
1.808AspGln: 1.808 ± 0.436
2.067AspArg: 2.067 ± 0.388
2.583AspSer: 2.583 ± 0.446
2.497AspThr: 2.497 ± 0.408
3.272AspVal: 3.272 ± 0.568
0.431AspTrp: 0.431 ± 0.177
2.153AspTyr: 2.153 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
5.597GluAla: 5.597 ± 0.669
0.947GluCys: 0.947 ± 0.294
2.928GluAsp: 2.928 ± 0.465
3.789GluGlu: 3.789 ± 0.743
1.378GluPhe: 1.378 ± 0.311
2.669GluGly: 2.669 ± 0.491
1.033GluHis: 1.033 ± 0.267
2.756GluIle: 2.756 ± 0.437
3.617GluLys: 3.617 ± 0.537
7.578GluLeu: 7.578 ± 0.999
2.067GluMet: 2.067 ± 0.424
2.669GluAsn: 2.669 ± 0.522
3.272GluPro: 3.272 ± 0.587
2.153GluGln: 2.153 ± 0.477
4.219GluArg: 4.219 ± 0.582
3.961GluSer: 3.961 ± 0.66
3.186GluThr: 3.186 ± 0.44
4.133GluVal: 4.133 ± 0.645
1.808GluTrp: 1.808 ± 0.353
1.206GluTyr: 1.206 ± 0.266
0.0GluXaa: 0.0 ± 0.0
Phe
3.014PheAla: 3.014 ± 0.465
0.344PheCys: 0.344 ± 0.202
2.583PheAsp: 2.583 ± 0.405
1.894PheGlu: 1.894 ± 0.379
1.033PhePhe: 1.033 ± 0.325
2.497PheGly: 2.497 ± 0.511
0.775PheHis: 0.775 ± 0.231
2.153PheIle: 2.153 ± 0.589
2.153PheLys: 2.153 ± 0.453
2.497PheLeu: 2.497 ± 0.586
1.464PheMet: 1.464 ± 0.35
2.239PheAsn: 2.239 ± 0.452
1.292PhePro: 1.292 ± 0.319
1.033PheGln: 1.033 ± 0.318
2.497PheArg: 2.497 ± 0.442
2.411PheSer: 2.411 ± 0.454
2.411PheThr: 2.411 ± 0.402
2.067PheVal: 2.067 ± 0.503
0.689PheTrp: 0.689 ± 0.24
1.378PheTyr: 1.378 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
6.975GlyAla: 6.975 ± 0.859
0.775GlyCys: 0.775 ± 0.219
4.908GlyAsp: 4.908 ± 0.57
5.167GlyGlu: 5.167 ± 0.615
2.411GlyPhe: 2.411 ± 0.381
5.425GlyGly: 5.425 ± 0.772
1.119GlyHis: 1.119 ± 0.318
3.789GlyIle: 3.789 ± 0.692
3.961GlyLys: 3.961 ± 0.546
4.822GlyLeu: 4.822 ± 1.087
1.636GlyMet: 1.636 ± 0.378
3.272GlyAsn: 3.272 ± 0.517
1.55GlyPro: 1.55 ± 0.367
2.669GlyGln: 2.669 ± 0.49
4.133GlyArg: 4.133 ± 0.536
3.358GlySer: 3.358 ± 0.452
4.65GlyThr: 4.65 ± 0.709
5.597GlyVal: 5.597 ± 0.683
2.411GlyTrp: 2.411 ± 0.481
3.014GlyTyr: 3.014 ± 0.45
0.0GlyXaa: 0.0 ± 0.0
His
1.808HisAla: 1.808 ± 0.422
0.517HisCys: 0.517 ± 0.225
1.636HisAsp: 1.636 ± 0.368
1.033HisGlu: 1.033 ± 0.261
0.861HisPhe: 0.861 ± 0.252
1.636HisGly: 1.636 ± 0.366
0.775HisHis: 0.775 ± 0.255
1.464HisIle: 1.464 ± 0.385
0.947HisLys: 0.947 ± 0.308
1.378HisLeu: 1.378 ± 0.342
0.344HisMet: 0.344 ± 0.152
0.344HisAsn: 0.344 ± 0.148
0.775HisPro: 0.775 ± 0.302
0.947HisGln: 0.947 ± 0.313
1.378HisArg: 1.378 ± 0.364
0.689HisSer: 0.689 ± 0.25
0.861HisThr: 0.861 ± 0.246
0.775HisVal: 0.775 ± 0.217
0.775HisTrp: 0.775 ± 0.228
0.431HisTyr: 0.431 ± 0.175
0.0HisXaa: 0.0 ± 0.0
Ile
4.392IleAla: 4.392 ± 0.628
0.861IleCys: 0.861 ± 0.351
3.531IleAsp: 3.531 ± 0.518
3.186IleGlu: 3.186 ± 0.544
1.464IlePhe: 1.464 ± 0.406
4.736IleGly: 4.736 ± 0.74
0.603IleHis: 0.603 ± 0.226
3.1IleIle: 3.1 ± 0.693
2.669IleLys: 2.669 ± 0.441
3.1IleLeu: 3.1 ± 0.6
1.378IleMet: 1.378 ± 0.298
3.186IleAsn: 3.186 ± 0.5
2.842IlePro: 2.842 ± 0.429
1.722IleGln: 1.722 ± 0.306
3.789IleArg: 3.789 ± 0.502
4.478IleSer: 4.478 ± 0.714
5.253IleThr: 5.253 ± 0.573
2.669IleVal: 2.669 ± 0.471
0.603IleTrp: 0.603 ± 0.187
0.775IleTyr: 0.775 ± 0.211
0.0IleXaa: 0.0 ± 0.0
Lys
5.253LysAla: 5.253 ± 0.756
0.775LysCys: 0.775 ± 0.259
1.981LysAsp: 1.981 ± 0.414
3.1LysGlu: 3.1 ± 0.429
1.894LysPhe: 1.894 ± 0.443
3.014LysGly: 3.014 ± 0.555
1.206LysHis: 1.206 ± 0.274
2.842LysIle: 2.842 ± 0.499
2.842LysLys: 2.842 ± 0.616
4.908LysLeu: 4.908 ± 0.683
1.722LysMet: 1.722 ± 0.452
2.497LysAsn: 2.497 ± 0.411
2.583LysPro: 2.583 ± 0.509
2.153LysGln: 2.153 ± 0.363
3.272LysArg: 3.272 ± 0.469
3.358LysSer: 3.358 ± 0.461
2.583LysThr: 2.583 ± 0.454
4.047LysVal: 4.047 ± 0.632
0.861LysTrp: 0.861 ± 0.282
1.636LysTyr: 1.636 ± 0.402
0.0LysXaa: 0.0 ± 0.0
Leu
8.353LeuAla: 8.353 ± 0.822
1.981LeuCys: 1.981 ± 0.481
4.392LeuAsp: 4.392 ± 0.515
4.822LeuGlu: 4.822 ± 0.643
3.703LeuPhe: 3.703 ± 0.499
4.994LeuGly: 4.994 ± 0.747
1.55LeuHis: 1.55 ± 0.281
5.253LeuIle: 5.253 ± 0.653
4.994LeuLys: 4.994 ± 0.866
7.147LeuLeu: 7.147 ± 0.915
1.808LeuMet: 1.808 ± 0.408
4.564LeuAsn: 4.564 ± 0.539
4.478LeuPro: 4.478 ± 0.599
3.272LeuGln: 3.272 ± 0.485
6.458LeuArg: 6.458 ± 0.636
5.856LeuSer: 5.856 ± 0.657
4.908LeuThr: 4.908 ± 0.695
5.942LeuVal: 5.942 ± 0.675
1.119LeuTrp: 1.119 ± 0.457
1.636LeuTyr: 1.636 ± 0.408
0.0LeuXaa: 0.0 ± 0.0
Met
2.325MetAla: 2.325 ± 0.357
0.172MetCys: 0.172 ± 0.113
0.517MetAsp: 0.517 ± 0.205
1.378MetGlu: 1.378 ± 0.398
0.517MetPhe: 0.517 ± 0.199
1.722MetGly: 1.722 ± 0.344
0.431MetHis: 0.431 ± 0.161
1.722MetIle: 1.722 ± 0.421
1.981MetLys: 1.981 ± 0.336
2.842MetLeu: 2.842 ± 0.528
0.689MetMet: 0.689 ± 0.361
1.55MetAsn: 1.55 ± 0.282
1.292MetPro: 1.292 ± 0.391
1.119MetGln: 1.119 ± 0.32
2.153MetArg: 2.153 ± 0.449
2.067MetSer: 2.067 ± 0.345
1.894MetThr: 1.894 ± 0.33
1.722MetVal: 1.722 ± 0.399
0.431MetTrp: 0.431 ± 0.147
0.344MetTyr: 0.344 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.617AsnAla: 3.617 ± 0.569
0.258AsnCys: 0.258 ± 0.126
2.497AsnAsp: 2.497 ± 0.367
2.583AsnGlu: 2.583 ± 0.55
1.292AsnPhe: 1.292 ± 0.348
3.961AsnGly: 3.961 ± 0.619
1.033AsnHis: 1.033 ± 0.321
2.497AsnIle: 2.497 ± 0.426
2.842AsnLys: 2.842 ± 0.52
2.067AsnLeu: 2.067 ± 0.389
1.033AsnMet: 1.033 ± 0.345
1.894AsnAsn: 1.894 ± 0.353
2.842AsnPro: 2.842 ± 0.527
2.153AsnGln: 2.153 ± 0.507
2.411AsnArg: 2.411 ± 0.535
2.583AsnSer: 2.583 ± 0.557
2.669AsnThr: 2.669 ± 0.552
2.153AsnVal: 2.153 ± 0.477
0.603AsnTrp: 0.603 ± 0.232
0.775AsnTyr: 0.775 ± 0.246
0.0AsnXaa: 0.0 ± 0.0
Pro
5.942ProAla: 5.942 ± 0.707
0.689ProCys: 0.689 ± 0.264
3.186ProAsp: 3.186 ± 0.429
4.392ProGlu: 4.392 ± 0.63
1.722ProPhe: 1.722 ± 0.458
3.186ProGly: 3.186 ± 0.556
0.947ProHis: 0.947 ± 0.291
1.722ProIle: 1.722 ± 0.394
2.583ProLys: 2.583 ± 0.472
3.444ProLeu: 3.444 ± 0.628
1.206ProMet: 1.206 ± 0.437
1.894ProAsn: 1.894 ± 0.396
1.636ProPro: 1.636 ± 0.352
1.292ProGln: 1.292 ± 0.329
1.722ProArg: 1.722 ± 0.33
2.153ProSer: 2.153 ± 0.432
2.325ProThr: 2.325 ± 0.449
4.478ProVal: 4.478 ± 0.51
0.344ProTrp: 0.344 ± 0.201
1.894ProTyr: 1.894 ± 0.424
0.0ProXaa: 0.0 ± 0.0
Gln
3.789GlnAla: 3.789 ± 0.515
0.431GlnCys: 0.431 ± 0.183
1.894GlnAsp: 1.894 ± 0.505
2.497GlnGlu: 2.497 ± 0.399
1.636GlnPhe: 1.636 ± 0.388
2.067GlnGly: 2.067 ± 0.407
1.119GlnHis: 1.119 ± 0.289
2.067GlnIle: 2.067 ± 0.479
2.497GlnLys: 2.497 ± 0.482
3.444GlnLeu: 3.444 ± 0.533
1.119GlnMet: 1.119 ± 0.323
1.206GlnAsn: 1.206 ± 0.257
1.808GlnPro: 1.808 ± 0.354
2.325GlnGln: 2.325 ± 0.394
3.358GlnArg: 3.358 ± 0.489
2.153GlnSer: 2.153 ± 0.441
2.239GlnThr: 2.239 ± 0.457
1.464GlnVal: 1.464 ± 0.276
1.206GlnTrp: 1.206 ± 0.258
0.775GlnTyr: 0.775 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
5.769ArgAla: 5.769 ± 0.717
0.947ArgCys: 0.947 ± 0.28
2.842ArgAsp: 2.842 ± 0.445
4.047ArgGlu: 4.047 ± 0.612
2.928ArgPhe: 2.928 ± 0.46
3.531ArgGly: 3.531 ± 0.566
2.153ArgHis: 2.153 ± 0.398
2.583ArgIle: 2.583 ± 0.427
3.531ArgLys: 3.531 ± 0.549
6.372ArgLeu: 6.372 ± 0.65
1.119ArgMet: 1.119 ± 0.268
2.411ArgAsn: 2.411 ± 0.454
2.497ArgPro: 2.497 ± 0.34
3.272ArgGln: 3.272 ± 0.665
5.511ArgArg: 5.511 ± 1.117
3.272ArgSer: 3.272 ± 0.603
2.928ArgThr: 2.928 ± 0.507
4.392ArgVal: 4.392 ± 0.503
1.033ArgTrp: 1.033 ± 0.354
3.014ArgTyr: 3.014 ± 0.466
0.0ArgXaa: 0.0 ± 0.0
Ser
6.028SerAla: 6.028 ± 0.826
0.517SerCys: 0.517 ± 0.186
2.756SerAsp: 2.756 ± 0.619
3.531SerGlu: 3.531 ± 0.606
2.928SerPhe: 2.928 ± 0.483
5.253SerGly: 5.253 ± 0.653
1.464SerHis: 1.464 ± 0.282
3.1SerIle: 3.1 ± 0.501
3.444SerLys: 3.444 ± 0.513
6.372SerLeu: 6.372 ± 0.681
1.292SerMet: 1.292 ± 0.31
1.894SerAsn: 1.894 ± 0.368
2.325SerPro: 2.325 ± 0.521
2.239SerGln: 2.239 ± 0.373
3.1SerArg: 3.1 ± 0.524
3.272SerSer: 3.272 ± 0.441
2.842SerThr: 2.842 ± 0.538
5.683SerVal: 5.683 ± 0.605
1.119SerTrp: 1.119 ± 0.289
1.894SerTyr: 1.894 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
6.458ThrAla: 6.458 ± 0.9
0.517ThrCys: 0.517 ± 0.223
3.358ThrAsp: 3.358 ± 0.516
3.531ThrGlu: 3.531 ± 0.505
2.239ThrPhe: 2.239 ± 0.406
4.994ThrGly: 4.994 ± 0.75
1.206ThrHis: 1.206 ± 0.333
2.669ThrIle: 2.669 ± 0.394
2.411ThrLys: 2.411 ± 0.578
5.683ThrLeu: 5.683 ± 0.858
1.55ThrMet: 1.55 ± 0.386
2.153ThrAsn: 2.153 ± 0.472
3.014ThrPro: 3.014 ± 0.402
1.464ThrGln: 1.464 ± 0.316
3.358ThrArg: 3.358 ± 0.506
3.444ThrSer: 3.444 ± 0.653
3.875ThrThr: 3.875 ± 0.699
3.961ThrVal: 3.961 ± 0.603
1.378ThrTrp: 1.378 ± 0.397
1.464ThrTyr: 1.464 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
4.736ValAla: 4.736 ± 0.638
0.603ValCys: 0.603 ± 0.274
3.531ValAsp: 3.531 ± 0.547
3.789ValGlu: 3.789 ± 0.663
2.756ValPhe: 2.756 ± 0.593
4.133ValGly: 4.133 ± 0.679
0.603ValHis: 0.603 ± 0.187
5.425ValIle: 5.425 ± 0.653
3.789ValLys: 3.789 ± 0.602
5.942ValLeu: 5.942 ± 0.522
1.722ValMet: 1.722 ± 0.348
3.1ValAsn: 3.1 ± 0.518
3.703ValPro: 3.703 ± 0.551
2.669ValGln: 2.669 ± 0.472
3.875ValArg: 3.875 ± 0.611
5.511ValSer: 5.511 ± 0.633
4.392ValThr: 4.392 ± 0.752
4.736ValVal: 4.736 ± 0.729
0.947ValTrp: 0.947 ± 0.259
1.981ValTyr: 1.981 ± 0.445
0.0ValXaa: 0.0 ± 0.0
Trp
1.378TrpAla: 1.378 ± 0.289
0.172TrpCys: 0.172 ± 0.124
0.689TrpAsp: 0.689 ± 0.289
1.206TrpGlu: 1.206 ± 0.339
0.603TrpPhe: 0.603 ± 0.208
0.775TrpGly: 0.775 ± 0.279
0.172TrpHis: 0.172 ± 0.169
0.517TrpIle: 0.517 ± 0.209
1.206TrpLys: 1.206 ± 0.31
2.239TrpLeu: 2.239 ± 0.469
0.603TrpMet: 0.603 ± 0.215
0.517TrpAsn: 0.517 ± 0.182
1.292TrpPro: 1.292 ± 0.277
1.464TrpGln: 1.464 ± 0.299
1.378TrpArg: 1.378 ± 0.333
0.861TrpSer: 0.861 ± 0.239
1.033TrpThr: 1.033 ± 0.265
1.378TrpVal: 1.378 ± 0.478
0.517TrpTrp: 0.517 ± 0.211
0.947TrpTyr: 0.947 ± 0.348
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.239TyrAla: 2.239 ± 0.433
0.431TyrCys: 0.431 ± 0.188
1.722TyrAsp: 1.722 ± 0.354
0.947TyrGlu: 0.947 ± 0.308
1.378TyrPhe: 1.378 ± 0.434
2.411TyrGly: 2.411 ± 0.444
0.258TyrHis: 0.258 ± 0.136
1.808TyrIle: 1.808 ± 0.447
1.206TyrLys: 1.206 ± 0.326
2.325TyrLeu: 2.325 ± 0.357
0.689TyrMet: 0.689 ± 0.208
0.431TyrAsn: 0.431 ± 0.177
1.464TyrPro: 1.464 ± 0.316
1.378TyrGln: 1.378 ± 0.36
1.894TyrArg: 1.894 ± 0.399
2.669TyrSer: 2.669 ± 0.478
1.981TyrThr: 1.981 ± 0.417
2.842TyrVal: 2.842 ± 0.516
0.603TyrTrp: 0.603 ± 0.239
0.947TyrTyr: 0.947 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11614 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski