Amino acid dipepetide frequency for Streptococcus pyogenes phage T12 (Bacteriophage T12)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.646AlaAla: 2.646 ± 0.954
0.427AlaCys: 0.427 ± 0.186
4.183AlaAsp: 4.183 ± 0.534
7.086AlaGlu: 7.086 ± 0.956
2.817AlaPhe: 2.817 ± 0.367
3.5AlaGly: 3.5 ± 0.672
0.598AlaHis: 0.598 ± 0.221
5.549AlaIle: 5.549 ± 0.793
6.403AlaLys: 6.403 ± 0.744
4.439AlaLeu: 4.439 ± 0.704
2.049AlaMet: 2.049 ± 0.4
4.354AlaAsn: 4.354 ± 0.703
1.366AlaPro: 1.366 ± 0.284
2.049AlaGln: 2.049 ± 0.366
2.049AlaArg: 2.049 ± 0.432
4.354AlaSer: 4.354 ± 0.9
3.585AlaThr: 3.585 ± 0.742
3.329AlaVal: 3.329 ± 0.739
1.024AlaTrp: 1.024 ± 0.245
2.732AlaTyr: 2.732 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
0.085CysAla: 0.085 ± 0.074
0.085CysCys: 0.085 ± 0.078
0.256CysAsp: 0.256 ± 0.155
0.598CysGlu: 0.598 ± 0.219
0.171CysPhe: 0.171 ± 0.129
0.085CysGly: 0.085 ± 0.078
0.171CysHis: 0.171 ± 0.136
0.427CysIle: 0.427 ± 0.215
0.512CysLys: 0.512 ± 0.24
0.171CysLeu: 0.171 ± 0.142
0.085CysMet: 0.085 ± 0.078
0.256CysAsn: 0.256 ± 0.138
0.171CysPro: 0.171 ± 0.114
0.171CysGln: 0.171 ± 0.129
0.171CysArg: 0.171 ± 0.135
0.598CysSer: 0.598 ± 0.233
0.085CysThr: 0.085 ± 0.098
0.256CysVal: 0.256 ± 0.177
0.171CysTrp: 0.171 ± 0.136
0.512CysTyr: 0.512 ± 0.347
0.0CysXaa: 0.0 ± 0.0
Asp
4.183AspAla: 4.183 ± 0.545
0.683AspCys: 0.683 ± 0.275
4.098AspAsp: 4.098 ± 0.606
4.525AspGlu: 4.525 ± 0.666
2.646AspPhe: 2.646 ± 0.619
4.61AspGly: 4.61 ± 0.651
0.512AspHis: 0.512 ± 0.199
4.183AspIle: 4.183 ± 0.515
5.634AspLys: 5.634 ± 0.658
5.72AspLeu: 5.72 ± 0.682
1.707AspMet: 1.707 ± 0.449
4.61AspAsn: 4.61 ± 0.686
0.939AspPro: 0.939 ± 0.317
1.451AspGln: 1.451 ± 0.32
2.817AspArg: 2.817 ± 0.591
3.756AspSer: 3.756 ± 0.541
3.415AspThr: 3.415 ± 0.635
4.439AspVal: 4.439 ± 0.457
0.768AspTrp: 0.768 ± 0.298
3.5AspTyr: 3.5 ± 0.542
0.0AspXaa: 0.0 ± 0.0
Glu
4.183GluAla: 4.183 ± 0.956
0.171GluCys: 0.171 ± 0.127
3.073GluAsp: 3.073 ± 0.795
5.293GluGlu: 5.293 ± 0.827
2.732GluPhe: 2.732 ± 0.496
2.561GluGly: 2.561 ± 0.473
0.854GluHis: 0.854 ± 0.268
4.695GluIle: 4.695 ± 0.501
5.634GluLys: 5.634 ± 0.769
8.281GluLeu: 8.281 ± 0.962
1.878GluMet: 1.878 ± 0.37
3.756GluAsn: 3.756 ± 0.66
1.793GluPro: 1.793 ± 0.329
3.756GluGln: 3.756 ± 0.721
3.927GluArg: 3.927 ± 1.049
3.585GluSer: 3.585 ± 0.572
4.439GluThr: 4.439 ± 0.671
4.61GluVal: 4.61 ± 0.69
0.854GluTrp: 0.854 ± 0.289
2.903GluTyr: 2.903 ± 0.516
0.0GluXaa: 0.0 ± 0.0
Phe
2.646PheAla: 2.646 ± 0.621
0.085PheCys: 0.085 ± 0.083
3.415PheAsp: 3.415 ± 0.514
2.134PheGlu: 2.134 ± 0.471
1.024PhePhe: 1.024 ± 0.333
2.732PheGly: 2.732 ± 0.362
0.0PheHis: 0.0 ± 0.0
3.073PheIle: 3.073 ± 0.541
2.988PheLys: 2.988 ± 0.48
2.646PheLeu: 2.646 ± 0.563
0.939PheMet: 0.939 ± 0.321
2.049PheAsn: 2.049 ± 0.509
1.024PhePro: 1.024 ± 0.298
1.11PheGln: 1.11 ± 0.357
1.793PheArg: 1.793 ± 0.333
2.39PheSer: 2.39 ± 0.517
2.476PheThr: 2.476 ± 0.443
2.561PheVal: 2.561 ± 0.492
0.939PheTrp: 0.939 ± 0.261
1.963PheTyr: 1.963 ± 0.325
0.0PheXaa: 0.0 ± 0.0
Gly
3.5GlyAla: 3.5 ± 0.753
0.341GlyCys: 0.341 ± 0.24
4.354GlyAsp: 4.354 ± 0.713
2.988GlyGlu: 2.988 ± 0.484
1.622GlyPhe: 1.622 ± 0.387
4.354GlyGly: 4.354 ± 0.822
0.683GlyHis: 0.683 ± 0.268
5.122GlyIle: 5.122 ± 0.782
4.951GlyLys: 4.951 ± 0.542
4.525GlyLeu: 4.525 ± 0.519
2.305GlyMet: 2.305 ± 0.465
2.988GlyAsn: 2.988 ± 0.421
1.195GlyPro: 1.195 ± 0.422
2.134GlyGln: 2.134 ± 0.632
2.561GlyArg: 2.561 ± 0.524
4.183GlySer: 4.183 ± 0.803
4.61GlyThr: 4.61 ± 1.009
3.756GlyVal: 3.756 ± 0.745
1.451GlyTrp: 1.451 ± 0.266
3.5GlyTyr: 3.5 ± 0.556
0.0GlyXaa: 0.0 ± 0.0
His
1.11HisAla: 1.11 ± 0.416
0.171HisCys: 0.171 ± 0.121
0.598HisAsp: 0.598 ± 0.223
0.768HisGlu: 0.768 ± 0.298
0.598HisPhe: 0.598 ± 0.201
0.768HisGly: 0.768 ± 0.234
0.085HisHis: 0.085 ± 0.069
1.281HisIle: 1.281 ± 0.33
0.939HisLys: 0.939 ± 0.322
1.195HisLeu: 1.195 ± 0.308
0.256HisMet: 0.256 ± 0.152
0.939HisAsn: 0.939 ± 0.299
0.341HisPro: 0.341 ± 0.15
0.427HisGln: 0.427 ± 0.21
1.024HisArg: 1.024 ± 0.315
0.598HisSer: 0.598 ± 0.32
0.854HisThr: 0.854 ± 0.286
0.683HisVal: 0.683 ± 0.252
0.171HisTrp: 0.171 ± 0.115
0.598HisTyr: 0.598 ± 0.266
0.0HisXaa: 0.0 ± 0.0
Ile
5.805IleAla: 5.805 ± 0.593
0.427IleCys: 0.427 ± 0.204
5.805IleAsp: 5.805 ± 0.67
5.464IleGlu: 5.464 ± 0.808
1.793IlePhe: 1.793 ± 0.423
5.122IleGly: 5.122 ± 0.636
0.854IleHis: 0.854 ± 0.221
4.354IleIle: 4.354 ± 0.658
5.037IleLys: 5.037 ± 0.644
4.183IleLeu: 4.183 ± 0.531
1.366IleMet: 1.366 ± 0.427
4.951IleAsn: 4.951 ± 0.735
2.817IlePro: 2.817 ± 0.671
2.049IleGln: 2.049 ± 0.515
2.476IleArg: 2.476 ± 0.444
4.866IleSer: 4.866 ± 0.62
5.805IleThr: 5.805 ± 0.771
4.781IleVal: 4.781 ± 0.684
0.683IleTrp: 0.683 ± 0.247
2.561IleTyr: 2.561 ± 0.576
0.0IleXaa: 0.0 ± 0.0
Lys
5.805LysAla: 5.805 ± 0.789
0.171LysCys: 0.171 ± 0.119
4.525LysAsp: 4.525 ± 0.669
5.207LysGlu: 5.207 ± 0.866
2.476LysPhe: 2.476 ± 0.387
4.525LysGly: 4.525 ± 0.575
1.451LysHis: 1.451 ± 0.335
5.976LysIle: 5.976 ± 0.842
6.573LysLys: 6.573 ± 1.141
6.146LysLeu: 6.146 ± 0.708
2.049LysMet: 2.049 ± 0.594
5.464LysAsn: 5.464 ± 0.675
2.732LysPro: 2.732 ± 0.549
5.037LysGln: 5.037 ± 0.633
4.781LysArg: 4.781 ± 0.713
5.549LysSer: 5.549 ± 0.539
5.293LysThr: 5.293 ± 0.728
5.207LysVal: 5.207 ± 0.797
1.622LysTrp: 1.622 ± 0.485
4.012LysTyr: 4.012 ± 0.541
0.0LysXaa: 0.0 ± 0.0
Leu
5.207LeuAla: 5.207 ± 0.657
0.341LeuCys: 0.341 ± 0.214
4.951LeuAsp: 4.951 ± 0.781
7.512LeuGlu: 7.512 ± 0.816
2.476LeuPhe: 2.476 ± 0.385
3.842LeuGly: 3.842 ± 0.721
1.024LeuHis: 1.024 ± 0.334
6.232LeuIle: 6.232 ± 0.723
7.598LeuLys: 7.598 ± 0.714
6.829LeuLeu: 6.829 ± 0.994
1.622LeuMet: 1.622 ± 0.391
5.122LeuAsn: 5.122 ± 0.607
2.134LeuPro: 2.134 ± 0.576
3.5LeuGln: 3.5 ± 0.566
3.244LeuArg: 3.244 ± 0.517
6.146LeuSer: 6.146 ± 0.697
6.232LeuThr: 6.232 ± 0.628
4.61LeuVal: 4.61 ± 0.718
1.024LeuTrp: 1.024 ± 0.276
3.329LeuTyr: 3.329 ± 0.487
0.0LeuXaa: 0.0 ± 0.0
Met
1.707MetAla: 1.707 ± 0.47
0.085MetCys: 0.085 ± 0.102
1.878MetAsp: 1.878 ± 0.519
1.366MetGlu: 1.366 ± 0.362
1.195MetPhe: 1.195 ± 0.321
0.598MetGly: 0.598 ± 0.28
0.256MetHis: 0.256 ± 0.135
1.195MetIle: 1.195 ± 0.378
1.451MetLys: 1.451 ± 0.381
2.476MetLeu: 2.476 ± 0.558
0.683MetMet: 0.683 ± 0.312
1.366MetAsn: 1.366 ± 0.311
0.768MetPro: 0.768 ± 0.273
1.024MetGln: 1.024 ± 0.339
1.537MetArg: 1.537 ± 0.41
1.451MetSer: 1.451 ± 0.31
2.732MetThr: 2.732 ± 0.437
1.195MetVal: 1.195 ± 0.35
0.171MetTrp: 0.171 ± 0.123
0.854MetTyr: 0.854 ± 0.248
0.0MetXaa: 0.0 ± 0.0
Asn
3.927AsnAla: 3.927 ± 0.539
0.085AsnCys: 0.085 ± 0.094
3.329AsnAsp: 3.329 ± 0.491
3.415AsnGlu: 3.415 ± 0.459
2.22AsnPhe: 2.22 ± 0.358
4.525AsnGly: 4.525 ± 0.666
1.024AsnHis: 1.024 ± 0.317
4.61AsnIle: 4.61 ± 0.733
5.634AsnLys: 5.634 ± 0.64
5.037AsnLeu: 5.037 ± 0.695
1.793AsnMet: 1.793 ± 0.41
4.012AsnAsn: 4.012 ± 0.603
2.305AsnPro: 2.305 ± 0.513
2.817AsnGln: 2.817 ± 0.496
2.134AsnArg: 2.134 ± 0.346
3.329AsnSer: 3.329 ± 0.52
3.329AsnThr: 3.329 ± 0.484
3.671AsnVal: 3.671 ± 0.594
0.939AsnTrp: 0.939 ± 0.331
2.476AsnTyr: 2.476 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
1.281ProAla: 1.281 ± 0.364
0.0ProCys: 0.0 ± 0.0
1.622ProAsp: 1.622 ± 0.389
1.878ProGlu: 1.878 ± 0.483
1.707ProPhe: 1.707 ± 0.533
0.768ProGly: 0.768 ± 0.216
0.768ProHis: 0.768 ± 0.273
1.622ProIle: 1.622 ± 0.382
3.073ProLys: 3.073 ± 0.686
2.049ProLeu: 2.049 ± 0.425
0.256ProMet: 0.256 ± 0.142
1.451ProAsn: 1.451 ± 0.365
1.281ProPro: 1.281 ± 0.329
1.622ProGln: 1.622 ± 0.375
1.195ProArg: 1.195 ± 0.376
2.903ProSer: 2.903 ± 0.674
1.622ProThr: 1.622 ± 0.5
1.537ProVal: 1.537 ± 0.299
0.171ProTrp: 0.171 ± 0.134
1.195ProTyr: 1.195 ± 0.296
0.0ProXaa: 0.0 ± 0.0
Gln
3.329GlnAla: 3.329 ± 0.658
0.341GlnCys: 0.341 ± 0.171
2.049GlnAsp: 2.049 ± 0.437
2.22GlnGlu: 2.22 ± 0.473
1.451GlnPhe: 1.451 ± 0.422
2.476GlnGly: 2.476 ± 0.509
0.768GlnHis: 0.768 ± 0.239
2.903GlnIle: 2.903 ± 0.373
4.012GlnLys: 4.012 ± 0.593
3.073GlnLeu: 3.073 ± 0.49
0.939GlnMet: 0.939 ± 0.313
3.073GlnAsn: 3.073 ± 0.513
1.11GlnPro: 1.11 ± 0.369
2.134GlnGln: 2.134 ± 0.51
1.195GlnArg: 1.195 ± 0.327
2.903GlnSer: 2.903 ± 0.46
1.963GlnThr: 1.963 ± 0.432
3.244GlnVal: 3.244 ± 0.428
0.854GlnTrp: 0.854 ± 0.245
1.537GlnTyr: 1.537 ± 0.316
0.0GlnXaa: 0.0 ± 0.0
Arg
2.903ArgAla: 2.903 ± 0.473
0.256ArgCys: 0.256 ± 0.142
2.646ArgAsp: 2.646 ± 0.416
2.561ArgGlu: 2.561 ± 0.42
1.451ArgPhe: 1.451 ± 0.446
2.305ArgGly: 2.305 ± 0.492
0.683ArgHis: 0.683 ± 0.26
2.732ArgIle: 2.732 ± 0.503
3.842ArgLys: 3.842 ± 0.539
4.354ArgLeu: 4.354 ± 0.886
1.11ArgMet: 1.11 ± 0.344
2.22ArgAsn: 2.22 ± 0.541
1.366ArgPro: 1.366 ± 0.372
2.561ArgGln: 2.561 ± 0.556
2.476ArgArg: 2.476 ± 0.475
1.622ArgSer: 1.622 ± 0.36
2.561ArgThr: 2.561 ± 0.576
2.22ArgVal: 2.22 ± 0.554
0.854ArgTrp: 0.854 ± 0.324
2.22ArgTyr: 2.22 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
4.781SerAla: 4.781 ± 0.779
0.171SerCys: 0.171 ± 0.131
4.781SerAsp: 4.781 ± 0.66
4.866SerGlu: 4.866 ± 0.685
2.903SerPhe: 2.903 ± 0.729
5.976SerGly: 5.976 ± 1.046
0.939SerHis: 0.939 ± 0.286
3.842SerIle: 3.842 ± 0.493
4.525SerLys: 4.525 ± 0.571
5.549SerLeu: 5.549 ± 0.601
1.281SerMet: 1.281 ± 0.419
2.988SerAsn: 2.988 ± 0.551
1.451SerPro: 1.451 ± 0.359
3.671SerGln: 3.671 ± 0.577
1.537SerArg: 1.537 ± 0.337
4.61SerSer: 4.61 ± 1.151
3.5SerThr: 3.5 ± 0.693
4.183SerVal: 4.183 ± 0.734
0.768SerTrp: 0.768 ± 0.278
2.39SerTyr: 2.39 ± 0.476
0.0SerXaa: 0.0 ± 0.0
Thr
3.244ThrAla: 3.244 ± 0.626
0.085ThrCys: 0.085 ± 0.098
3.842ThrAsp: 3.842 ± 0.564
3.415ThrGlu: 3.415 ± 0.51
3.415ThrPhe: 3.415 ± 0.599
4.695ThrGly: 4.695 ± 0.73
1.281ThrHis: 1.281 ± 0.374
6.061ThrIle: 6.061 ± 1.03
5.976ThrLys: 5.976 ± 0.823
6.573ThrLeu: 6.573 ± 0.909
0.939ThrMet: 0.939 ± 0.312
4.354ThrAsn: 4.354 ± 0.679
1.793ThrPro: 1.793 ± 0.424
2.049ThrGln: 2.049 ± 0.491
2.305ThrArg: 2.305 ± 0.354
3.159ThrSer: 3.159 ± 0.505
5.464ThrThr: 5.464 ± 1.098
3.842ThrVal: 3.842 ± 0.893
0.598ThrTrp: 0.598 ± 0.174
2.305ThrTyr: 2.305 ± 0.428
0.0ThrXaa: 0.0 ± 0.0
Val
4.183ValAla: 4.183 ± 0.605
0.512ValCys: 0.512 ± 0.233
4.866ValAsp: 4.866 ± 0.632
4.61ValGlu: 4.61 ± 0.773
2.39ValPhe: 2.39 ± 0.443
3.585ValGly: 3.585 ± 0.844
0.512ValHis: 0.512 ± 0.187
3.329ValIle: 3.329 ± 0.749
5.122ValLys: 5.122 ± 0.691
4.098ValLeu: 4.098 ± 0.425
1.11ValMet: 1.11 ± 0.363
3.585ValAsn: 3.585 ± 0.507
1.793ValPro: 1.793 ± 0.301
1.963ValGln: 1.963 ± 0.529
2.988ValArg: 2.988 ± 0.573
5.207ValSer: 5.207 ± 0.57
4.781ValThr: 4.781 ± 0.819
4.098ValVal: 4.098 ± 0.759
0.427ValTrp: 0.427 ± 0.174
2.305ValTyr: 2.305 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
0.768TrpAla: 0.768 ± 0.29
0.256TrpCys: 0.256 ± 0.152
0.683TrpAsp: 0.683 ± 0.262
0.598TrpGlu: 0.598 ± 0.178
0.598TrpPhe: 0.598 ± 0.215
1.195TrpGly: 1.195 ± 0.385
0.427TrpHis: 0.427 ± 0.17
0.939TrpIle: 0.939 ± 0.253
1.11TrpLys: 1.11 ± 0.322
1.451TrpLeu: 1.451 ± 0.413
0.512TrpMet: 0.512 ± 0.223
0.683TrpAsn: 0.683 ± 0.204
0.0TrpPro: 0.0 ± 0.0
0.512TrpGln: 0.512 ± 0.22
0.683TrpArg: 0.683 ± 0.268
1.451TrpSer: 1.451 ± 0.352
0.854TrpThr: 0.854 ± 0.303
0.939TrpVal: 0.939 ± 0.304
0.256TrpTrp: 0.256 ± 0.162
0.256TrpTyr: 0.256 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.244TyrAla: 3.244 ± 0.815
0.341TyrCys: 0.341 ± 0.198
3.244TyrAsp: 3.244 ± 0.55
2.476TyrGlu: 2.476 ± 0.429
2.305TyrPhe: 2.305 ± 0.495
2.903TyrGly: 2.903 ± 0.484
0.598TyrHis: 0.598 ± 0.231
3.073TyrIle: 3.073 ± 0.598
3.585TyrLys: 3.585 ± 0.639
4.183TyrLeu: 4.183 ± 0.703
1.024TyrMet: 1.024 ± 0.265
2.39TyrAsn: 2.39 ± 0.59
1.622TyrPro: 1.622 ± 0.366
1.537TyrGln: 1.537 ± 0.302
2.049TyrArg: 2.049 ± 0.394
2.049TyrSer: 2.049 ± 0.384
1.793TyrThr: 1.793 ± 0.404
2.305TyrVal: 2.305 ± 0.522
0.427TyrTrp: 0.427 ± 0.204
1.963TyrTyr: 1.963 ± 0.342
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (11715 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski