Amino acid dipepetide frequency for Shigella phage SfIV

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.796AlaAla: 7.796 ± 0.883
1.493AlaCys: 1.493 ± 0.399
5.474AlaAsp: 5.474 ± 0.935
4.561AlaGlu: 4.561 ± 0.579
3.483AlaPhe: 3.483 ± 0.719
7.381AlaGly: 7.381 ± 0.849
1.41AlaHis: 1.41 ± 0.363
4.561AlaIle: 4.561 ± 0.595
3.4AlaLys: 3.4 ± 0.455
8.376AlaLeu: 8.376 ± 0.902
2.986AlaMet: 2.986 ± 0.527
2.405AlaAsn: 2.405 ± 0.566
3.317AlaPro: 3.317 ± 0.535
2.405AlaGln: 2.405 ± 0.428
6.22AlaArg: 6.22 ± 0.968
4.727AlaSer: 4.727 ± 0.844
6.386AlaThr: 6.386 ± 0.805
6.883AlaVal: 6.883 ± 0.816
1.659AlaTrp: 1.659 ± 0.344
2.986AlaTyr: 2.986 ± 0.574
0.0AlaXaa: 0.0 ± 0.0
Cys
1.41CysAla: 1.41 ± 0.392
0.166CysCys: 0.166 ± 0.107
0.746CysAsp: 0.746 ± 0.198
0.498CysGlu: 0.498 ± 0.221
0.415CysPhe: 0.415 ± 0.165
1.327CysGly: 1.327 ± 0.374
0.498CysHis: 0.498 ± 0.226
0.912CysIle: 0.912 ± 0.255
0.249CysLys: 0.249 ± 0.138
1.078CysLeu: 1.078 ± 0.348
0.083CysMet: 0.083 ± 0.101
0.249CysAsn: 0.249 ± 0.125
0.581CysPro: 0.581 ± 0.207
0.581CysGln: 0.581 ± 0.221
1.161CysArg: 1.161 ± 0.258
0.829CysSer: 0.829 ± 0.246
0.746CysThr: 0.746 ± 0.22
0.995CysVal: 0.995 ± 0.287
0.415CysTrp: 0.415 ± 0.221
0.332CysTyr: 0.332 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
6.054AspAla: 6.054 ± 0.752
0.581AspCys: 0.581 ± 0.231
4.312AspAsp: 4.312 ± 0.698
4.644AspGlu: 4.644 ± 0.503
2.073AspPhe: 2.073 ± 0.4
4.561AspGly: 4.561 ± 0.664
0.581AspHis: 0.581 ± 0.192
3.483AspIle: 3.483 ± 0.643
2.82AspLys: 2.82 ± 0.564
5.888AspLeu: 5.888 ± 0.791
1.493AspMet: 1.493 ± 0.377
2.322AspAsn: 2.322 ± 0.476
2.986AspPro: 2.986 ± 0.669
1.078AspGln: 1.078 ± 0.286
2.571AspArg: 2.571 ± 0.414
3.649AspSer: 3.649 ± 0.504
2.82AspThr: 2.82 ± 0.487
3.649AspVal: 3.649 ± 0.679
0.663AspTrp: 0.663 ± 0.183
1.907AspTyr: 1.907 ± 0.452
0.0AspXaa: 0.0 ± 0.0
Glu
5.308GluAla: 5.308 ± 0.673
0.829GluCys: 0.829 ± 0.286
2.82GluAsp: 2.82 ± 0.581
3.483GluGlu: 3.483 ± 0.593
1.576GluPhe: 1.576 ± 0.343
2.571GluGly: 2.571 ± 0.468
0.995GluHis: 0.995 ± 0.278
3.4GluIle: 3.4 ± 0.409
3.234GluLys: 3.234 ± 0.516
6.469GluLeu: 6.469 ± 0.925
2.073GluMet: 2.073 ± 0.325
3.151GluAsn: 3.151 ± 0.48
2.737GluPro: 2.737 ± 0.453
2.488GluGln: 2.488 ± 0.483
3.566GluArg: 3.566 ± 0.502
4.81GluSer: 4.81 ± 0.655
3.566GluThr: 3.566 ± 0.67
3.898GluVal: 3.898 ± 0.606
1.41GluTrp: 1.41 ± 0.324
1.493GluTyr: 1.493 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
2.322PheAla: 2.322 ± 0.394
0.249PheCys: 0.249 ± 0.141
2.903PheAsp: 2.903 ± 0.596
1.659PheGlu: 1.659 ± 0.393
1.244PhePhe: 1.244 ± 0.346
2.82PheGly: 2.82 ± 0.577
0.663PheHis: 0.663 ± 0.257
1.907PheIle: 1.907 ± 0.484
2.405PheLys: 2.405 ± 0.477
2.488PheLeu: 2.488 ± 0.521
1.576PheMet: 1.576 ± 0.344
1.99PheAsn: 1.99 ± 0.38
1.659PhePro: 1.659 ± 0.341
0.829PheGln: 0.829 ± 0.269
2.156PheArg: 2.156 ± 0.499
3.069PheSer: 3.069 ± 0.744
2.156PheThr: 2.156 ± 0.469
1.825PheVal: 1.825 ± 0.459
0.746PheTrp: 0.746 ± 0.225
1.742PheTyr: 1.742 ± 0.442
0.0PheXaa: 0.0 ± 0.0
Gly
6.303GlyAla: 6.303 ± 0.757
0.995GlyCys: 0.995 ± 0.297
4.147GlyAsp: 4.147 ± 0.622
4.727GlyGlu: 4.727 ± 0.602
2.82GlyPhe: 2.82 ± 0.521
4.727GlyGly: 4.727 ± 0.874
0.829GlyHis: 0.829 ± 0.307
3.732GlyIle: 3.732 ± 0.535
3.981GlyLys: 3.981 ± 0.502
5.059GlyLeu: 5.059 ± 0.892
1.825GlyMet: 1.825 ± 0.458
2.986GlyAsn: 2.986 ± 0.469
1.244GlyPro: 1.244 ± 0.385
2.571GlyGln: 2.571 ± 0.45
3.566GlyArg: 3.566 ± 0.555
4.312GlySer: 4.312 ± 0.518
4.478GlyThr: 4.478 ± 0.711
5.308GlyVal: 5.308 ± 0.683
2.239GlyTrp: 2.239 ± 0.403
3.234GlyTyr: 3.234 ± 0.543
0.0GlyXaa: 0.0 ± 0.0
His
1.41HisAla: 1.41 ± 0.361
0.332HisCys: 0.332 ± 0.179
1.493HisAsp: 1.493 ± 0.359
0.995HisGlu: 0.995 ± 0.241
0.829HisPhe: 0.829 ± 0.307
1.161HisGly: 1.161 ± 0.297
0.912HisHis: 0.912 ± 0.282
0.746HisIle: 0.746 ± 0.243
0.995HisLys: 0.995 ± 0.321
1.742HisLeu: 1.742 ± 0.405
0.332HisMet: 0.332 ± 0.169
0.829HisAsn: 0.829 ± 0.319
0.912HisPro: 0.912 ± 0.331
0.415HisGln: 0.415 ± 0.188
1.161HisArg: 1.161 ± 0.325
1.161HisSer: 1.161 ± 0.346
1.244HisThr: 1.244 ± 0.281
0.829HisVal: 0.829 ± 0.246
0.581HisTrp: 0.581 ± 0.212
0.663HisTyr: 0.663 ± 0.331
0.0HisXaa: 0.0 ± 0.0
Ile
4.976IleAla: 4.976 ± 0.906
0.581IleCys: 0.581 ± 0.19
3.649IleAsp: 3.649 ± 0.555
3.815IleGlu: 3.815 ± 0.659
1.493IlePhe: 1.493 ± 0.565
4.561IleGly: 4.561 ± 0.633
0.995IleHis: 0.995 ± 0.309
2.82IleIle: 2.82 ± 0.644
3.151IleLys: 3.151 ± 0.635
2.571IleLeu: 2.571 ± 0.465
0.995IleMet: 0.995 ± 0.261
2.903IleAsn: 2.903 ± 0.515
2.654IlePro: 2.654 ± 0.497
1.244IleGln: 1.244 ± 0.318
3.815IleArg: 3.815 ± 0.557
4.976IleSer: 4.976 ± 0.753
5.142IleThr: 5.142 ± 0.561
2.82IleVal: 2.82 ± 0.439
0.746IleTrp: 0.746 ± 0.22
1.078IleTyr: 1.078 ± 0.247
0.0IleXaa: 0.0 ± 0.0
Lys
4.893LysAla: 4.893 ± 0.786
0.663LysCys: 0.663 ± 0.268
2.903LysAsp: 2.903 ± 0.535
4.147LysGlu: 4.147 ± 0.636
1.99LysPhe: 1.99 ± 0.392
2.737LysGly: 2.737 ± 0.481
0.829LysHis: 0.829 ± 0.233
4.064LysIle: 4.064 ± 0.608
4.312LysLys: 4.312 ± 0.866
5.142LysLeu: 5.142 ± 0.797
1.659LysMet: 1.659 ± 0.333
2.405LysAsn: 2.405 ± 0.424
1.907LysPro: 1.907 ± 0.383
2.488LysGln: 2.488 ± 0.39
3.649LysArg: 3.649 ± 0.561
4.395LysSer: 4.395 ± 0.63
2.405LysThr: 2.405 ± 0.465
3.732LysVal: 3.732 ± 0.689
0.663LysTrp: 0.663 ± 0.259
1.327LysTyr: 1.327 ± 0.446
0.0LysXaa: 0.0 ± 0.0
Leu
7.464LeuAla: 7.464 ± 0.664
1.907LeuCys: 1.907 ± 0.502
4.893LeuAsp: 4.893 ± 0.627
5.639LeuGlu: 5.639 ± 0.665
3.4LeuPhe: 3.4 ± 0.494
4.395LeuGly: 4.395 ± 0.804
1.576LeuHis: 1.576 ± 0.334
5.059LeuIle: 5.059 ± 0.636
5.474LeuLys: 5.474 ± 0.743
8.542LeuLeu: 8.542 ± 1.32
2.488LeuMet: 2.488 ± 0.455
4.312LeuAsn: 4.312 ± 0.57
4.147LeuPro: 4.147 ± 0.581
3.649LeuGln: 3.649 ± 0.645
6.635LeuArg: 6.635 ± 0.741
7.049LeuSer: 7.049 ± 0.896
5.059LeuThr: 5.059 ± 0.761
4.893LeuVal: 4.893 ± 0.682
1.161LeuTrp: 1.161 ± 0.438
2.737LeuTyr: 2.737 ± 0.599
0.0LeuXaa: 0.0 ± 0.0
Met
2.737MetAla: 2.737 ± 0.428
0.166MetCys: 0.166 ± 0.117
0.829MetAsp: 0.829 ± 0.244
0.829MetGlu: 0.829 ± 0.295
0.498MetPhe: 0.498 ± 0.266
1.825MetGly: 1.825 ± 0.477
0.415MetHis: 0.415 ± 0.19
1.576MetIle: 1.576 ± 0.396
1.825MetLys: 1.825 ± 0.38
2.571MetLeu: 2.571 ± 0.444
0.415MetMet: 0.415 ± 0.166
1.41MetAsn: 1.41 ± 0.353
1.493MetPro: 1.493 ± 0.335
1.161MetGln: 1.161 ± 0.394
1.907MetArg: 1.907 ± 0.434
2.405MetSer: 2.405 ± 0.37
2.156MetThr: 2.156 ± 0.437
0.912MetVal: 0.912 ± 0.224
0.498MetTrp: 0.498 ± 0.219
0.332MetTyr: 0.332 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
3.483AsnAla: 3.483 ± 0.709
0.332AsnCys: 0.332 ± 0.151
2.488AsnAsp: 2.488 ± 0.36
2.073AsnGlu: 2.073 ± 0.418
1.659AsnPhe: 1.659 ± 0.356
3.898AsnGly: 3.898 ± 0.638
0.498AsnHis: 0.498 ± 0.198
1.99AsnIle: 1.99 ± 0.433
2.239AsnLys: 2.239 ± 0.511
2.488AsnLeu: 2.488 ± 0.528
0.912AsnMet: 0.912 ± 0.26
1.41AsnAsn: 1.41 ± 0.361
2.737AsnPro: 2.737 ± 0.512
1.742AsnGln: 1.742 ± 0.431
2.239AsnArg: 2.239 ± 0.501
2.488AsnSer: 2.488 ± 0.472
2.239AsnThr: 2.239 ± 0.522
2.405AsnVal: 2.405 ± 0.54
0.746AsnTrp: 0.746 ± 0.265
0.746AsnTyr: 0.746 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
3.981ProAla: 3.981 ± 0.714
0.746ProCys: 0.746 ± 0.24
3.483ProAsp: 3.483 ± 0.58
3.649ProGlu: 3.649 ± 0.738
2.156ProPhe: 2.156 ± 0.531
3.483ProGly: 3.483 ± 0.556
0.995ProHis: 0.995 ± 0.293
1.659ProIle: 1.659 ± 0.371
1.99ProLys: 1.99 ± 0.434
3.234ProLeu: 3.234 ± 0.645
1.078ProMet: 1.078 ± 0.284
1.659ProAsn: 1.659 ± 0.351
1.161ProPro: 1.161 ± 0.306
1.41ProGln: 1.41 ± 0.394
1.907ProArg: 1.907 ± 0.356
3.069ProSer: 3.069 ± 0.852
2.571ProThr: 2.571 ± 0.537
4.644ProVal: 4.644 ± 0.506
0.166ProTrp: 0.166 ± 0.104
1.244ProTyr: 1.244 ± 0.325
0.0ProXaa: 0.0 ± 0.0
Gln
3.234GlnAla: 3.234 ± 0.466
0.498GlnCys: 0.498 ± 0.204
1.244GlnAsp: 1.244 ± 0.299
1.99GlnGlu: 1.99 ± 0.439
1.327GlnPhe: 1.327 ± 0.431
1.742GlnGly: 1.742 ± 0.409
1.078GlnHis: 1.078 ± 0.288
2.156GlnIle: 2.156 ± 0.521
2.571GlnLys: 2.571 ± 0.458
3.566GlnLeu: 3.566 ± 0.522
1.161GlnMet: 1.161 ± 0.293
0.912GlnAsn: 0.912 ± 0.24
1.907GlnPro: 1.907 ± 0.349
1.907GlnGln: 1.907 ± 0.366
3.898GlnArg: 3.898 ± 0.559
1.825GlnSer: 1.825 ± 0.459
2.156GlnThr: 2.156 ± 0.47
1.825GlnVal: 1.825 ± 0.351
0.746GlnTrp: 0.746 ± 0.231
1.41GlnTyr: 1.41 ± 0.408
0.0GlnXaa: 0.0 ± 0.0
Arg
5.556ArgAla: 5.556 ± 0.8
0.663ArgCys: 0.663 ± 0.222
3.4ArgAsp: 3.4 ± 0.512
3.317ArgGlu: 3.317 ± 0.615
2.737ArgPhe: 2.737 ± 0.47
3.732ArgGly: 3.732 ± 0.543
1.907ArgHis: 1.907 ± 0.351
3.151ArgIle: 3.151 ± 0.456
4.23ArgLys: 4.23 ± 0.481
6.966ArgLeu: 6.966 ± 0.916
1.244ArgMet: 1.244 ± 0.307
1.825ArgAsn: 1.825 ± 0.38
2.986ArgPro: 2.986 ± 0.42
3.566ArgGln: 3.566 ± 0.608
5.225ArgArg: 5.225 ± 0.984
3.649ArgSer: 3.649 ± 0.677
3.4ArgThr: 3.4 ± 0.569
3.898ArgVal: 3.898 ± 0.578
1.327ArgTrp: 1.327 ± 0.414
2.073ArgTyr: 2.073 ± 0.389
0.0ArgXaa: 0.0 ± 0.0
Ser
5.474SerAla: 5.474 ± 0.763
0.746SerCys: 0.746 ± 0.221
4.478SerAsp: 4.478 ± 0.779
3.151SerGlu: 3.151 ± 0.484
2.82SerPhe: 2.82 ± 0.56
5.556SerGly: 5.556 ± 0.565
1.659SerHis: 1.659 ± 0.422
3.649SerIle: 3.649 ± 0.665
4.395SerLys: 4.395 ± 0.708
7.215SerLeu: 7.215 ± 0.756
1.576SerMet: 1.576 ± 0.361
2.073SerAsn: 2.073 ± 0.42
2.405SerPro: 2.405 ± 0.535
2.986SerGln: 2.986 ± 0.537
4.147SerArg: 4.147 ± 0.771
4.727SerSer: 4.727 ± 0.687
3.815SerThr: 3.815 ± 0.536
4.644SerVal: 4.644 ± 0.554
0.995SerTrp: 0.995 ± 0.255
1.576SerTyr: 1.576 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
6.718ThrAla: 6.718 ± 1.041
0.663ThrCys: 0.663 ± 0.215
2.986ThrAsp: 2.986 ± 0.47
4.395ThrGlu: 4.395 ± 0.487
2.239ThrPhe: 2.239 ± 0.434
4.893ThrGly: 4.893 ± 0.715
1.493ThrHis: 1.493 ± 0.343
2.488ThrIle: 2.488 ± 0.44
3.151ThrLys: 3.151 ± 0.559
6.303ThrLeu: 6.303 ± 0.932
1.244ThrMet: 1.244 ± 0.284
2.156ThrAsn: 2.156 ± 0.422
3.151ThrPro: 3.151 ± 0.464
1.907ThrGln: 1.907 ± 0.336
3.649ThrArg: 3.649 ± 0.479
3.4ThrSer: 3.4 ± 0.567
4.064ThrThr: 4.064 ± 0.584
4.147ThrVal: 4.147 ± 0.619
0.995ThrTrp: 0.995 ± 0.306
1.825ThrTyr: 1.825 ± 0.335
0.0ThrXaa: 0.0 ± 0.0
Val
5.142ValAla: 5.142 ± 0.54
0.829ValCys: 0.829 ± 0.296
3.566ValAsp: 3.566 ± 0.464
3.649ValGlu: 3.649 ± 0.541
2.073ValPhe: 2.073 ± 0.417
4.312ValGly: 4.312 ± 0.671
0.746ValHis: 0.746 ± 0.294
5.059ValIle: 5.059 ± 0.812
3.234ValLys: 3.234 ± 0.621
6.054ValLeu: 6.054 ± 0.682
1.493ValMet: 1.493 ± 0.307
2.986ValAsn: 2.986 ± 0.625
4.064ValPro: 4.064 ± 0.626
2.488ValGln: 2.488 ± 0.482
3.815ValArg: 3.815 ± 0.605
4.23ValSer: 4.23 ± 0.582
4.644ValThr: 4.644 ± 0.856
4.81ValVal: 4.81 ± 0.7
0.663ValTrp: 0.663 ± 0.263
2.239ValTyr: 2.239 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
1.161TrpAla: 1.161 ± 0.28
0.332TrpCys: 0.332 ± 0.178
0.746TrpAsp: 0.746 ± 0.26
0.995TrpGlu: 0.995 ± 0.278
0.332TrpPhe: 0.332 ± 0.163
0.663TrpGly: 0.663 ± 0.229
0.332TrpHis: 0.332 ± 0.192
0.663TrpIle: 0.663 ± 0.22
1.078TrpLys: 1.078 ± 0.307
2.322TrpLeu: 2.322 ± 0.434
0.663TrpMet: 0.663 ± 0.25
0.249TrpAsn: 0.249 ± 0.136
1.078TrpPro: 1.078 ± 0.266
1.078TrpGln: 1.078 ± 0.279
1.576TrpArg: 1.576 ± 0.306
0.912TrpSer: 0.912 ± 0.257
0.829TrpThr: 0.829 ± 0.275
1.327TrpVal: 1.327 ± 0.394
0.332TrpTrp: 0.332 ± 0.157
0.498TrpTyr: 0.498 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.488TyrAla: 2.488 ± 0.497
0.498TyrCys: 0.498 ± 0.201
1.576TyrAsp: 1.576 ± 0.333
1.659TyrGlu: 1.659 ± 0.37
1.244TyrPhe: 1.244 ± 0.3
2.737TyrGly: 2.737 ± 0.44
0.332TyrHis: 0.332 ± 0.151
1.907TyrIle: 1.907 ± 0.511
1.659TyrLys: 1.659 ± 0.375
2.405TyrLeu: 2.405 ± 0.363
0.581TyrMet: 0.581 ± 0.216
0.581TyrAsn: 0.581 ± 0.24
1.161TyrPro: 1.161 ± 0.359
1.161TyrGln: 1.161 ± 0.279
1.99TyrArg: 1.99 ± 0.44
2.405TyrSer: 2.405 ± 0.351
1.99TyrThr: 1.99 ± 0.487
2.654TyrVal: 2.654 ± 0.554
0.415TyrTrp: 0.415 ± 0.187
1.078TyrTyr: 1.078 ± 0.3
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12059 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski