Amino acid dipepetide frequency for Staphylococcus phage St 134

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.854AlaAla: 0.854 ± 0.522
0.171AlaCys: 0.171 ± 0.168
2.39AlaAsp: 2.39 ± 0.61
1.878AlaGlu: 1.878 ± 0.765
3.585AlaPhe: 3.585 ± 0.685
2.561AlaGly: 2.561 ± 0.47
0.512AlaHis: 0.512 ± 0.239
2.049AlaIle: 2.049 ± 0.521
2.903AlaLys: 2.903 ± 0.818
3.756AlaLeu: 3.756 ± 0.784
1.024AlaMet: 1.024 ± 0.534
3.415AlaAsn: 3.415 ± 0.9
1.024AlaPro: 1.024 ± 0.443
0.854AlaGln: 0.854 ± 0.631
2.22AlaArg: 2.22 ± 0.723
4.098AlaSer: 4.098 ± 1.177
3.244AlaThr: 3.244 ± 0.982
1.707AlaVal: 1.707 ± 0.428
0.512AlaTrp: 0.512 ± 0.368
2.903AlaTyr: 2.903 ± 0.94
0.0AlaXaa: 0.0 ± 0.0
Cys
0.171CysAla: 0.171 ± 0.21
0.0CysCys: 0.0 ± 0.0
1.024CysAsp: 1.024 ± 0.51
0.341CysGlu: 0.341 ± 0.216
1.024CysPhe: 1.024 ± 0.521
1.024CysGly: 1.024 ± 0.67
0.171CysHis: 0.171 ± 0.136
0.341CysIle: 0.341 ± 0.249
0.171CysLys: 0.171 ± 0.164
0.341CysLeu: 0.341 ± 0.215
0.0CysMet: 0.0 ± 0.0
0.341CysAsn: 0.341 ± 0.196
0.341CysPro: 0.341 ± 0.233
0.683CysGln: 0.683 ± 0.316
0.0CysArg: 0.0 ± 0.0
0.512CysSer: 0.512 ± 0.337
0.683CysThr: 0.683 ± 0.266
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.341CysTyr: 0.341 ± 0.194
0.0CysXaa: 0.0 ± 0.0
Asp
2.903AspAla: 2.903 ± 0.648
0.171AspCys: 0.171 ± 0.164
5.805AspAsp: 5.805 ± 1.039
4.781AspGlu: 4.781 ± 1.085
5.634AspPhe: 5.634 ± 1.208
4.098AspGly: 4.098 ± 0.74
0.683AspHis: 0.683 ± 0.258
5.634AspIle: 5.634 ± 1.293
5.122AspLys: 5.122 ± 0.982
4.781AspLeu: 4.781 ± 0.837
1.537AspMet: 1.537 ± 0.449
5.805AspAsn: 5.805 ± 0.976
0.854AspPro: 0.854 ± 0.321
0.683AspGln: 0.683 ± 0.31
1.878AspArg: 1.878 ± 0.71
4.439AspSer: 4.439 ± 0.62
4.268AspThr: 4.268 ± 0.961
4.439AspVal: 4.439 ± 0.786
0.512AspTrp: 0.512 ± 0.409
6.488AspTyr: 6.488 ± 0.782
0.0AspXaa: 0.0 ± 0.0
Glu
3.585GluAla: 3.585 ± 0.955
0.854GluCys: 0.854 ± 0.556
3.756GluAsp: 3.756 ± 0.958
4.781GluGlu: 4.781 ± 1.469
2.049GluPhe: 2.049 ± 0.511
1.366GluGly: 1.366 ± 0.607
1.195GluHis: 1.195 ± 0.497
4.61GluIle: 4.61 ± 0.675
7.171GluLys: 7.171 ± 1.739
5.122GluLeu: 5.122 ± 0.851
3.244GluMet: 3.244 ± 1.199
3.415GluAsn: 3.415 ± 0.486
1.878GluPro: 1.878 ± 0.543
3.756GluGln: 3.756 ± 0.847
2.39GluArg: 2.39 ± 0.647
4.098GluSer: 4.098 ± 1.469
3.585GluThr: 3.585 ± 0.631
3.073GluVal: 3.073 ± 0.758
0.683GluTrp: 0.683 ± 0.289
4.61GluTyr: 4.61 ± 0.81
0.0GluXaa: 0.0 ± 0.0
Phe
1.024PheAla: 1.024 ± 0.36
0.341PheCys: 0.341 ± 0.249
4.439PheAsp: 4.439 ± 0.937
3.415PheGlu: 3.415 ± 0.653
2.22PhePhe: 2.22 ± 0.589
2.732PheGly: 2.732 ± 0.541
0.854PheHis: 0.854 ± 0.354
5.122PheIle: 5.122 ± 0.966
2.903PheLys: 2.903 ± 0.652
4.098PheLeu: 4.098 ± 0.681
1.707PheMet: 1.707 ± 0.526
5.976PheAsn: 5.976 ± 0.894
1.537PhePro: 1.537 ± 0.431
1.366PheGln: 1.366 ± 0.545
1.366PheArg: 1.366 ± 0.35
3.756PheSer: 3.756 ± 1.108
2.732PheThr: 2.732 ± 0.674
2.903PheVal: 2.903 ± 0.556
0.171PheTrp: 0.171 ± 0.162
2.903PheTyr: 2.903 ± 0.694
0.0PheXaa: 0.0 ± 0.0
Gly
2.732GlyAla: 2.732 ± 0.693
0.341GlyCys: 0.341 ± 0.328
3.585GlyAsp: 3.585 ± 0.806
2.22GlyGlu: 2.22 ± 0.765
2.903GlyPhe: 2.903 ± 0.571
4.098GlyGly: 4.098 ± 1.416
1.024GlyHis: 1.024 ± 0.372
4.268GlyIle: 4.268 ± 1.039
4.61GlyLys: 4.61 ± 1.116
4.781GlyLeu: 4.781 ± 0.931
1.195GlyMet: 1.195 ± 0.346
4.951GlyAsn: 4.951 ± 1.256
0.0GlyPro: 0.0 ± 0.0
2.39GlyGln: 2.39 ± 0.568
1.707GlyArg: 1.707 ± 0.559
4.781GlySer: 4.781 ± 1.326
2.903GlyThr: 2.903 ± 0.742
3.927GlyVal: 3.927 ± 0.983
0.854GlyTrp: 0.854 ± 0.308
2.561GlyTyr: 2.561 ± 0.663
0.0GlyXaa: 0.0 ± 0.0
His
0.512HisAla: 0.512 ± 0.305
0.341HisCys: 0.341 ± 0.233
0.683HisAsp: 0.683 ± 0.329
1.195HisGlu: 1.195 ± 0.49
2.22HisPhe: 2.22 ± 0.769
1.537HisGly: 1.537 ± 0.887
0.854HisHis: 0.854 ± 0.444
2.732HisIle: 2.732 ± 0.622
1.707HisLys: 1.707 ± 0.605
1.707HisLeu: 1.707 ± 0.545
0.171HisMet: 0.171 ± 0.187
1.366HisAsn: 1.366 ± 0.418
0.512HisPro: 0.512 ± 0.306
0.854HisGln: 0.854 ± 0.433
0.683HisArg: 0.683 ± 0.457
1.366HisSer: 1.366 ± 0.58
1.537HisThr: 1.537 ± 0.456
0.683HisVal: 0.683 ± 0.268
0.171HisTrp: 0.171 ± 0.171
0.854HisTyr: 0.854 ± 0.335
0.0HisXaa: 0.0 ± 0.0
Ile
3.244IleAla: 3.244 ± 0.795
0.512IleCys: 0.512 ± 0.282
8.195IleAsp: 8.195 ± 0.707
6.829IleGlu: 6.829 ± 1.188
2.561IlePhe: 2.561 ± 0.829
3.927IleGly: 3.927 ± 0.598
2.22IleHis: 2.22 ± 0.434
5.805IleIle: 5.805 ± 1.123
7.342IleLys: 7.342 ± 1.306
4.439IleLeu: 4.439 ± 0.733
2.39IleMet: 2.39 ± 0.782
8.025IleAsn: 8.025 ± 1.232
2.903IlePro: 2.903 ± 0.713
2.049IleGln: 2.049 ± 0.685
1.195IleArg: 1.195 ± 0.282
3.756IleSer: 3.756 ± 0.621
6.317IleThr: 6.317 ± 1.367
2.732IleVal: 2.732 ± 0.665
0.341IleTrp: 0.341 ± 0.248
3.756IleTyr: 3.756 ± 0.803
0.0IleXaa: 0.0 ± 0.0
Lys
2.39LysAla: 2.39 ± 0.716
0.341LysCys: 0.341 ± 0.216
6.317LysAsp: 6.317 ± 1.329
5.634LysGlu: 5.634 ± 1.158
3.756LysPhe: 3.756 ± 0.733
6.146LysGly: 6.146 ± 0.64
1.366LysHis: 1.366 ± 0.41
7.342LysIle: 7.342 ± 1.243
4.439LysLys: 4.439 ± 0.98
5.464LysLeu: 5.464 ± 0.743
1.878LysMet: 1.878 ± 0.491
5.634LysAsn: 5.634 ± 0.789
2.22LysPro: 2.22 ± 0.522
3.585LysGln: 3.585 ± 0.89
2.903LysArg: 2.903 ± 0.696
5.634LysSer: 5.634 ± 0.864
5.464LysThr: 5.464 ± 0.785
4.781LysVal: 4.781 ± 0.622
0.512LysTrp: 0.512 ± 0.258
4.098LysTyr: 4.098 ± 1.062
0.0LysXaa: 0.0 ± 0.0
Leu
4.098LeuAla: 4.098 ± 0.982
0.683LeuCys: 0.683 ± 0.373
3.415LeuAsp: 3.415 ± 0.552
4.951LeuGlu: 4.951 ± 0.998
4.268LeuPhe: 4.268 ± 0.949
2.561LeuGly: 2.561 ± 0.764
1.707LeuHis: 1.707 ± 0.519
4.951LeuIle: 4.951 ± 0.915
5.976LeuLys: 5.976 ± 0.907
5.464LeuLeu: 5.464 ± 1.109
1.878LeuMet: 1.878 ± 0.665
7.171LeuAsn: 7.171 ± 0.974
2.049LeuPro: 2.049 ± 0.527
3.415LeuGln: 3.415 ± 0.709
3.756LeuArg: 3.756 ± 0.884
3.927LeuSer: 3.927 ± 0.694
4.268LeuThr: 4.268 ± 0.705
3.073LeuVal: 3.073 ± 0.792
0.683LeuTrp: 0.683 ± 0.575
3.585LeuTyr: 3.585 ± 1.302
0.0LeuXaa: 0.0 ± 0.0
Met
1.537MetAla: 1.537 ± 0.537
0.171MetCys: 0.171 ± 0.136
1.537MetAsp: 1.537 ± 0.425
0.683MetGlu: 0.683 ± 0.41
0.854MetPhe: 0.854 ± 0.399
0.683MetGly: 0.683 ± 0.268
0.171MetHis: 0.171 ± 0.191
1.366MetIle: 1.366 ± 0.478
3.415MetLys: 3.415 ± 0.616
2.049MetLeu: 2.049 ± 0.564
0.854MetMet: 0.854 ± 0.286
2.22MetAsn: 2.22 ± 0.541
0.171MetPro: 0.171 ± 0.182
1.024MetGln: 1.024 ± 0.321
1.195MetArg: 1.195 ± 0.413
1.537MetSer: 1.537 ± 0.57
2.22MetThr: 2.22 ± 0.518
1.366MetVal: 1.366 ± 0.502
0.0MetTrp: 0.0 ± 0.0
3.073MetTyr: 3.073 ± 0.787
0.0MetXaa: 0.0 ± 0.0
Asn
3.415AsnAla: 3.415 ± 0.81
1.024AsnCys: 1.024 ± 0.468
5.976AsnAsp: 5.976 ± 1.042
7.171AsnGlu: 7.171 ± 1.111
3.927AsnPhe: 3.927 ± 0.703
6.829AsnGly: 6.829 ± 1.722
2.39AsnHis: 2.39 ± 0.599
5.464AsnIle: 5.464 ± 0.705
6.488AsnLys: 6.488 ± 1.073
7.683AsnLeu: 7.683 ± 1.216
2.22AsnMet: 2.22 ± 0.451
4.781AsnAsn: 4.781 ± 0.593
1.537AsnPro: 1.537 ± 0.477
2.732AsnGln: 2.732 ± 0.558
3.927AsnArg: 3.927 ± 0.744
4.268AsnSer: 4.268 ± 0.768
6.317AsnThr: 6.317 ± 1.086
3.585AsnVal: 3.585 ± 0.784
0.854AsnTrp: 0.854 ± 0.553
5.293AsnTyr: 5.293 ± 0.824
0.0AsnXaa: 0.0 ± 0.0
Pro
1.195ProAla: 1.195 ± 0.49
0.0ProCys: 0.0 ± 0.0
0.854ProAsp: 0.854 ± 0.469
1.537ProGlu: 1.537 ± 0.714
2.049ProPhe: 2.049 ± 0.685
0.341ProGly: 0.341 ± 0.248
0.854ProHis: 0.854 ± 0.49
2.39ProIle: 2.39 ± 0.63
2.22ProLys: 2.22 ± 0.715
1.707ProLeu: 1.707 ± 0.376
0.854ProMet: 0.854 ± 0.364
1.707ProAsn: 1.707 ± 0.463
1.024ProPro: 1.024 ± 0.484
1.195ProGln: 1.195 ± 0.495
0.512ProArg: 0.512 ± 0.266
2.049ProSer: 2.049 ± 0.587
1.878ProThr: 1.878 ± 0.486
1.195ProVal: 1.195 ± 0.42
0.171ProTrp: 0.171 ± 0.162
1.707ProTyr: 1.707 ± 0.627
0.0ProXaa: 0.0 ± 0.0
Gln
2.903GlnAla: 2.903 ± 0.706
0.683GlnCys: 0.683 ± 0.348
2.561GlnAsp: 2.561 ± 0.609
2.39GlnGlu: 2.39 ± 0.928
1.195GlnPhe: 1.195 ± 0.324
1.537GlnGly: 1.537 ± 0.383
0.854GlnHis: 0.854 ± 0.349
1.366GlnIle: 1.366 ± 0.413
4.268GlnLys: 4.268 ± 0.79
2.903GlnLeu: 2.903 ± 0.519
1.195GlnMet: 1.195 ± 0.417
2.903GlnAsn: 2.903 ± 0.892
1.707GlnPro: 1.707 ± 0.448
0.683GlnGln: 0.683 ± 0.403
1.366GlnArg: 1.366 ± 0.607
2.903GlnSer: 2.903 ± 0.751
1.707GlnThr: 1.707 ± 0.524
1.537GlnVal: 1.537 ± 0.493
0.854GlnTrp: 0.854 ± 0.489
2.049GlnTyr: 2.049 ± 0.469
0.0GlnXaa: 0.0 ± 0.0
Arg
1.707ArgAla: 1.707 ± 0.484
0.171ArgCys: 0.171 ± 0.164
2.049ArgAsp: 2.049 ± 0.553
2.903ArgGlu: 2.903 ± 0.536
2.049ArgPhe: 2.049 ± 0.636
2.39ArgGly: 2.39 ± 1.042
0.512ArgHis: 0.512 ± 0.309
3.244ArgIle: 3.244 ± 0.768
1.707ArgLys: 1.707 ± 0.594
2.049ArgLeu: 2.049 ± 0.46
0.512ArgMet: 0.512 ± 0.333
4.61ArgAsn: 4.61 ± 0.745
0.341ArgPro: 0.341 ± 0.248
1.878ArgGln: 1.878 ± 0.473
2.561ArgArg: 2.561 ± 0.532
2.39ArgSer: 2.39 ± 0.549
1.707ArgThr: 1.707 ± 0.49
2.39ArgVal: 2.39 ± 0.682
0.0ArgTrp: 0.0 ± 0.0
1.537ArgTyr: 1.537 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
2.22SerAla: 2.22 ± 0.477
0.171SerCys: 0.171 ± 0.182
5.293SerAsp: 5.293 ± 0.733
3.585SerGlu: 3.585 ± 0.818
2.732SerPhe: 2.732 ± 0.603
3.415SerGly: 3.415 ± 0.954
1.537SerHis: 1.537 ± 0.471
4.951SerIle: 4.951 ± 0.898
5.976SerLys: 5.976 ± 0.866
3.244SerLeu: 3.244 ± 0.569
1.195SerMet: 1.195 ± 0.408
7.0SerAsn: 7.0 ± 1.263
1.195SerPro: 1.195 ± 0.359
3.073SerGln: 3.073 ± 0.597
3.415SerArg: 3.415 ± 0.871
5.293SerSer: 5.293 ± 1.483
2.39SerThr: 2.39 ± 0.854
3.073SerVal: 3.073 ± 0.667
1.024SerTrp: 1.024 ± 0.537
3.073SerTyr: 3.073 ± 0.634
0.0SerXaa: 0.0 ± 0.0
Thr
1.537ThrAla: 1.537 ± 0.417
0.171ThrCys: 0.171 ± 0.164
4.781ThrAsp: 4.781 ± 0.687
4.781ThrGlu: 4.781 ± 1.331
3.927ThrPhe: 3.927 ± 0.683
3.585ThrGly: 3.585 ± 0.667
1.707ThrHis: 1.707 ± 0.342
5.976ThrIle: 5.976 ± 0.91
5.464ThrLys: 5.464 ± 1.159
4.098ThrLeu: 4.098 ± 0.844
1.024ThrMet: 1.024 ± 0.266
3.756ThrAsn: 3.756 ± 0.91
2.22ThrPro: 2.22 ± 0.793
2.39ThrGln: 2.39 ± 0.564
1.878ThrArg: 1.878 ± 0.618
2.39ThrSer: 2.39 ± 0.625
3.756ThrThr: 3.756 ± 0.656
3.756ThrVal: 3.756 ± 1.025
0.171ThrTrp: 0.171 ± 0.157
4.439ThrTyr: 4.439 ± 0.906
0.0ThrXaa: 0.0 ± 0.0
Val
2.049ValAla: 2.049 ± 0.8
0.171ValCys: 0.171 ± 0.136
3.073ValAsp: 3.073 ± 0.613
3.415ValGlu: 3.415 ± 0.837
2.049ValPhe: 2.049 ± 0.478
2.903ValGly: 2.903 ± 0.711
0.683ValHis: 0.683 ± 0.292
4.268ValIle: 4.268 ± 0.723
4.268ValLys: 4.268 ± 0.871
3.073ValLeu: 3.073 ± 0.538
1.366ValMet: 1.366 ± 0.367
5.122ValAsn: 5.122 ± 0.718
2.22ValPro: 2.22 ± 0.905
2.049ValGln: 2.049 ± 0.553
1.366ValArg: 1.366 ± 0.589
3.244ValSer: 3.244 ± 0.636
3.756ValThr: 3.756 ± 0.626
3.415ValVal: 3.415 ± 1.0
0.171ValTrp: 0.171 ± 0.157
2.39ValTyr: 2.39 ± 0.636
0.0ValXaa: 0.0 ± 0.0
Trp
0.683TrpAla: 0.683 ± 0.409
0.341TrpCys: 0.341 ± 0.222
0.683TrpAsp: 0.683 ± 0.337
0.512TrpGlu: 0.512 ± 0.285
0.0TrpPhe: 0.0 ± 0.0
0.854TrpGly: 0.854 ± 0.45
0.854TrpHis: 0.854 ± 0.408
0.683TrpIle: 0.683 ± 0.306
0.0TrpLys: 0.0 ± 0.0
0.683TrpLeu: 0.683 ± 0.334
0.171TrpMet: 0.171 ± 0.166
0.341TrpAsn: 0.341 ± 0.236
0.0TrpPro: 0.0 ± 0.0
0.512TrpGln: 0.512 ± 0.286
0.171TrpArg: 0.171 ± 0.188
0.341TrpSer: 0.341 ± 0.297
0.512TrpThr: 0.512 ± 0.233
0.341TrpVal: 0.341 ± 0.247
0.171TrpTrp: 0.171 ± 0.182
0.341TrpTyr: 0.341 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.903TyrAla: 2.903 ± 0.733
0.854TyrCys: 0.854 ± 0.342
4.098TyrAsp: 4.098 ± 0.899
2.39TyrGlu: 2.39 ± 0.575
2.561TyrPhe: 2.561 ± 0.745
3.244TyrGly: 3.244 ± 0.944
1.537TyrHis: 1.537 ± 0.473
5.976TyrIle: 5.976 ± 0.8
3.756TyrLys: 3.756 ± 0.64
4.439TyrLeu: 4.439 ± 0.886
1.537TyrMet: 1.537 ± 0.413
7.342TyrAsn: 7.342 ± 1.001
1.707TyrPro: 1.707 ± 0.452
2.39TyrGln: 2.39 ± 0.668
2.22TyrArg: 2.22 ± 0.559
3.073TyrSer: 3.073 ± 0.623
2.561TyrThr: 2.561 ± 0.552
3.073TyrVal: 3.073 ± 0.614
0.341TyrTrp: 0.341 ± 0.194
3.073TyrTyr: 3.073 ± 0.807
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (5858 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski