Amino acid dipepetide frequency for Staphylococcus phage 44AHJD

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.577AlaAla: 0.577 ± 0.512
0.192AlaCys: 0.192 ± 0.196
1.73AlaAsp: 1.73 ± 0.414
1.73AlaGlu: 1.73 ± 0.546
2.115AlaPhe: 2.115 ± 0.474
3.076AlaGly: 3.076 ± 1.01
0.577AlaHis: 0.577 ± 0.353
3.076AlaIle: 3.076 ± 0.621
4.806AlaLys: 4.806 ± 0.971
4.037AlaLeu: 4.037 ± 0.68
0.961AlaMet: 0.961 ± 0.399
2.115AlaAsn: 2.115 ± 0.698
1.153AlaPro: 1.153 ± 0.572
0.769AlaGln: 0.769 ± 0.393
2.115AlaArg: 2.115 ± 0.483
1.346AlaSer: 1.346 ± 0.373
2.307AlaThr: 2.307 ± 0.704
2.499AlaVal: 2.499 ± 0.552
0.384AlaTrp: 0.384 ± 0.229
4.229AlaTyr: 4.229 ± 0.764
0.0AlaXaa: 0.0 ± 0.0
Cys
0.384CysAla: 0.384 ± 0.372
0.0CysCys: 0.0 ± 0.0
0.384CysAsp: 0.384 ± 0.271
0.192CysGlu: 0.192 ± 0.156
0.769CysPhe: 0.769 ± 0.379
0.384CysGly: 0.384 ± 0.251
0.192CysHis: 0.192 ± 0.156
0.961CysIle: 0.961 ± 0.514
0.192CysLys: 0.192 ± 0.198
0.577CysLeu: 0.577 ± 0.429
0.769CysMet: 0.769 ± 0.402
0.384CysAsn: 0.384 ± 0.215
0.0CysPro: 0.0 ± 0.0
0.192CysGln: 0.192 ± 0.196
0.0CysArg: 0.0 ± 0.0
0.192CysSer: 0.192 ± 0.243
0.577CysThr: 0.577 ± 0.3
0.192CysVal: 0.192 ± 0.192
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.076AspAla: 3.076 ± 0.615
0.384AspCys: 0.384 ± 0.249
8.074AspAsp: 8.074 ± 1.384
5.575AspGlu: 5.575 ± 1.356
3.845AspPhe: 3.845 ± 0.701
4.037AspGly: 4.037 ± 1.59
0.961AspHis: 0.961 ± 0.323
6.536AspIle: 6.536 ± 1.692
4.614AspLys: 4.614 ± 1.021
6.344AspLeu: 6.344 ± 1.101
1.153AspMet: 1.153 ± 0.543
7.113AspAsn: 7.113 ± 1.551
0.961AspPro: 0.961 ± 0.364
0.769AspGln: 0.769 ± 0.444
2.307AspArg: 2.307 ± 0.755
3.268AspSer: 3.268 ± 1.1
3.46AspThr: 3.46 ± 1.025
4.421AspVal: 4.421 ± 0.662
0.192AspTrp: 0.192 ± 0.169
6.151AspTyr: 6.151 ± 0.995
0.0AspXaa: 0.0 ± 0.0
Glu
2.499GluAla: 2.499 ± 0.889
0.192GluCys: 0.192 ± 0.156
3.652GluAsp: 3.652 ± 1.056
5.19GluGlu: 5.19 ± 1.837
4.037GluPhe: 4.037 ± 1.255
1.346GluGly: 1.346 ± 0.446
1.346GluHis: 1.346 ± 0.627
5.19GluIle: 5.19 ± 1.224
4.421GluLys: 4.421 ± 0.846
4.806GluLeu: 4.806 ± 1.299
3.46GluMet: 3.46 ± 0.71
4.037GluAsn: 4.037 ± 1.04
1.73GluPro: 1.73 ± 0.439
2.884GluGln: 2.884 ± 0.586
2.691GluArg: 2.691 ± 0.705
4.806GluSer: 4.806 ± 1.404
3.268GluThr: 3.268 ± 0.916
3.076GluVal: 3.076 ± 0.617
0.577GluTrp: 0.577 ± 0.281
4.229GluTyr: 4.229 ± 1.031
0.0GluXaa: 0.0 ± 0.0
Phe
1.538PheAla: 1.538 ± 0.393
0.384PheCys: 0.384 ± 0.316
4.614PheAsp: 4.614 ± 1.364
3.076PheGlu: 3.076 ± 0.696
1.73PhePhe: 1.73 ± 0.479
2.307PheGly: 2.307 ± 0.61
1.153PheHis: 1.153 ± 0.348
3.46PheIle: 3.46 ± 0.652
4.614PheLys: 4.614 ± 1.004
5.767PheLeu: 5.767 ± 1.032
1.153PheMet: 1.153 ± 0.407
4.421PheAsn: 4.421 ± 1.017
1.922PhePro: 1.922 ± 0.515
2.691PheGln: 2.691 ± 0.708
1.538PheArg: 1.538 ± 0.463
4.229PheSer: 4.229 ± 0.958
4.614PheThr: 4.614 ± 1.052
3.076PheVal: 3.076 ± 0.661
0.384PheTrp: 0.384 ± 0.205
3.845PheTyr: 3.845 ± 0.855
0.0PheXaa: 0.0 ± 0.0
Gly
1.73GlyAla: 1.73 ± 0.612
0.384GlyCys: 0.384 ± 0.284
3.268GlyAsp: 3.268 ± 0.913
1.346GlyGlu: 1.346 ± 0.783
3.652GlyPhe: 3.652 ± 0.995
3.845GlyGly: 3.845 ± 1.146
0.769GlyHis: 0.769 ± 0.365
3.076GlyIle: 3.076 ± 0.613
4.229GlyLys: 4.229 ± 0.926
3.46GlyLeu: 3.46 ± 0.856
1.73GlyMet: 1.73 ± 0.702
5.19GlyAsn: 5.19 ± 1.141
0.0GlyPro: 0.0 ± 0.0
2.307GlyGln: 2.307 ± 0.839
1.538GlyArg: 1.538 ± 0.513
3.46GlySer: 3.46 ± 1.034
2.884GlyThr: 2.884 ± 0.895
3.46GlyVal: 3.46 ± 0.921
0.961GlyTrp: 0.961 ± 0.414
2.691GlyTyr: 2.691 ± 0.914
0.0GlyXaa: 0.0 ± 0.0
His
0.192HisAla: 0.192 ± 0.156
0.0HisCys: 0.0 ± 0.0
1.153HisAsp: 1.153 ± 0.374
1.153HisGlu: 1.153 ± 0.505
2.307HisPhe: 2.307 ± 0.512
0.577HisGly: 0.577 ± 0.274
0.384HisHis: 0.384 ± 0.284
2.307HisIle: 2.307 ± 0.65
1.538HisLys: 1.538 ± 0.603
0.961HisLeu: 0.961 ± 0.431
0.577HisMet: 0.577 ± 0.289
2.115HisAsn: 2.115 ± 0.5
0.192HisPro: 0.192 ± 0.156
0.577HisGln: 0.577 ± 0.336
0.384HisArg: 0.384 ± 0.238
0.961HisSer: 0.961 ± 0.425
0.961HisThr: 0.961 ± 0.405
1.538HisVal: 1.538 ± 0.784
0.192HisTrp: 0.192 ± 0.183
1.73HisTyr: 1.73 ± 0.546
0.0HisXaa: 0.0 ± 0.0
Ile
3.652IleAla: 3.652 ± 0.689
0.192IleCys: 0.192 ± 0.172
9.035IleAsp: 9.035 ± 1.342
4.037IleGlu: 4.037 ± 1.119
2.115IlePhe: 2.115 ± 0.751
3.652IleGly: 3.652 ± 1.138
1.73IleHis: 1.73 ± 0.46
5.383IleIle: 5.383 ± 1.622
6.151IleLys: 6.151 ± 1.42
5.959IleLeu: 5.959 ± 1.104
1.153IleMet: 1.153 ± 0.413
6.728IleAsn: 6.728 ± 1.15
1.922IlePro: 1.922 ± 0.66
1.346IleGln: 1.346 ± 0.427
2.307IleArg: 2.307 ± 0.564
3.652IleSer: 3.652 ± 0.743
4.421IleThr: 4.421 ± 1.065
3.46IleVal: 3.46 ± 0.697
0.577IleTrp: 0.577 ± 0.394
5.383IleTyr: 5.383 ± 1.362
0.0IleXaa: 0.0 ± 0.0
Lys
2.499LysAla: 2.499 ± 0.9
0.769LysCys: 0.769 ± 0.373
4.421LysAsp: 4.421 ± 0.872
7.113LysGlu: 7.113 ± 1.517
4.229LysPhe: 4.229 ± 0.762
3.845LysGly: 3.845 ± 1.091
2.307LysHis: 2.307 ± 0.652
5.575LysIle: 5.575 ± 1.071
6.536LysLys: 6.536 ± 1.153
6.92LysLeu: 6.92 ± 1.079
2.499LysMet: 2.499 ± 0.502
4.998LysAsn: 4.998 ± 0.866
3.076LysPro: 3.076 ± 0.899
2.499LysGln: 2.499 ± 0.677
3.845LysArg: 3.845 ± 0.647
6.536LysSer: 6.536 ± 1.053
4.229LysThr: 4.229 ± 0.729
3.46LysVal: 3.46 ± 0.599
0.769LysTrp: 0.769 ± 0.294
4.229LysTyr: 4.229 ± 0.901
0.0LysXaa: 0.0 ± 0.0
Leu
4.421LeuAla: 4.421 ± 0.744
0.192LeuCys: 0.192 ± 0.224
4.806LeuAsp: 4.806 ± 0.45
4.229LeuGlu: 4.229 ± 1.029
4.806LeuPhe: 4.806 ± 0.557
4.037LeuGly: 4.037 ± 0.829
1.73LeuHis: 1.73 ± 0.672
5.383LeuIle: 5.383 ± 0.957
6.536LeuLys: 6.536 ± 1.074
6.92LeuLeu: 6.92 ± 1.199
2.307LeuMet: 2.307 ± 0.848
7.113LeuAsn: 7.113 ± 1.397
1.922LeuPro: 1.922 ± 0.564
3.845LeuGln: 3.845 ± 0.978
3.268LeuArg: 3.268 ± 0.792
6.92LeuSer: 6.92 ± 1.418
5.19LeuThr: 5.19 ± 1.076
2.884LeuVal: 2.884 ± 0.887
0.384LeuTrp: 0.384 ± 0.294
4.998LeuTyr: 4.998 ± 1.26
0.0LeuXaa: 0.0 ± 0.0
Met
0.961MetAla: 0.961 ± 0.511
0.192MetCys: 0.192 ± 0.156
1.153MetAsp: 1.153 ± 0.505
1.73MetGlu: 1.73 ± 0.601
1.73MetPhe: 1.73 ± 0.669
0.769MetGly: 0.769 ± 0.351
0.384MetHis: 0.384 ± 0.225
1.538MetIle: 1.538 ± 0.618
3.268MetLys: 3.268 ± 0.592
2.691MetLeu: 2.691 ± 1.462
0.384MetMet: 0.384 ± 0.255
1.922MetAsn: 1.922 ± 0.677
0.192MetPro: 0.192 ± 0.172
1.922MetGln: 1.922 ± 0.528
1.538MetArg: 1.538 ± 0.389
1.153MetSer: 1.153 ± 0.521
3.268MetThr: 3.268 ± 0.752
1.538MetVal: 1.538 ± 0.709
0.192MetTrp: 0.192 ± 0.192
1.153MetTyr: 1.153 ± 0.413
0.0MetXaa: 0.0 ± 0.0
Asn
4.229AsnAla: 4.229 ± 0.934
0.769AsnCys: 0.769 ± 0.395
6.92AsnAsp: 6.92 ± 0.822
7.497AsnGlu: 7.497 ± 1.214
4.421AsnPhe: 4.421 ± 1.196
6.344AsnGly: 6.344 ± 1.104
2.115AsnHis: 2.115 ± 0.614
4.998AsnIle: 4.998 ± 0.957
5.959AsnLys: 5.959 ± 0.986
4.806AsnLeu: 4.806 ± 1.123
2.307AsnMet: 2.307 ± 0.65
5.383AsnAsn: 5.383 ± 0.962
2.307AsnPro: 2.307 ± 0.623
3.268AsnGln: 3.268 ± 0.941
1.922AsnArg: 1.922 ± 0.611
5.19AsnSer: 5.19 ± 1.127
4.806AsnThr: 4.806 ± 1.003
4.806AsnVal: 4.806 ± 0.737
1.153AsnTrp: 1.153 ± 0.556
4.037AsnTyr: 4.037 ± 0.667
0.0AsnXaa: 0.0 ± 0.0
Pro
0.577ProAla: 0.577 ± 0.303
0.192ProCys: 0.192 ± 0.156
0.961ProAsp: 0.961 ± 0.502
2.115ProGlu: 2.115 ± 0.818
1.73ProPhe: 1.73 ± 0.513
0.192ProGly: 0.192 ± 0.171
0.192ProHis: 0.192 ± 0.221
2.307ProIle: 2.307 ± 0.699
2.691ProLys: 2.691 ± 0.648
1.73ProLeu: 1.73 ± 0.536
0.961ProMet: 0.961 ± 0.525
1.538ProAsn: 1.538 ± 0.526
0.769ProPro: 0.769 ± 0.385
1.153ProGln: 1.153 ± 0.483
0.384ProArg: 0.384 ± 0.238
2.115ProSer: 2.115 ± 0.61
2.115ProThr: 2.115 ± 0.78
1.346ProVal: 1.346 ± 0.416
0.384ProTrp: 0.384 ± 0.224
2.115ProTyr: 2.115 ± 0.52
0.0ProXaa: 0.0 ± 0.0
Gln
2.499GlnAla: 2.499 ± 0.687
0.769GlnCys: 0.769 ± 0.527
1.922GlnAsp: 1.922 ± 0.705
1.538GlnGlu: 1.538 ± 0.545
1.346GlnPhe: 1.346 ± 0.598
1.922GlnGly: 1.922 ± 0.507
0.192GlnHis: 0.192 ± 0.169
2.499GlnIle: 2.499 ± 0.596
2.499GlnLys: 2.499 ± 0.836
3.46GlnLeu: 3.46 ± 0.635
1.346GlnMet: 1.346 ± 0.4
3.845GlnAsn: 3.845 ± 0.918
1.153GlnPro: 1.153 ± 0.48
2.307GlnGln: 2.307 ± 0.953
0.577GlnArg: 0.577 ± 0.41
2.499GlnSer: 2.499 ± 0.76
1.346GlnThr: 1.346 ± 0.514
1.922GlnVal: 1.922 ± 0.649
0.961GlnTrp: 0.961 ± 0.485
2.884GlnTyr: 2.884 ± 0.52
0.0GlnXaa: 0.0 ± 0.0
Arg
2.115ArgAla: 2.115 ± 0.89
0.0ArgCys: 0.0 ± 0.0
2.499ArgAsp: 2.499 ± 1.123
2.884ArgGlu: 2.884 ± 0.774
3.076ArgPhe: 3.076 ± 0.815
1.538ArgGly: 1.538 ± 0.578
0.961ArgHis: 0.961 ± 0.51
0.961ArgIle: 0.961 ± 0.52
2.115ArgLys: 2.115 ± 0.383
1.153ArgLeu: 1.153 ± 0.413
1.153ArgMet: 1.153 ± 0.366
3.076ArgAsn: 3.076 ± 0.841
0.769ArgPro: 0.769 ± 0.278
1.73ArgGln: 1.73 ± 0.409
0.961ArgArg: 0.961 ± 0.356
1.538ArgSer: 1.538 ± 0.585
1.153ArgThr: 1.153 ± 0.481
2.691ArgVal: 2.691 ± 0.576
0.384ArgTrp: 0.384 ± 0.261
1.922ArgTyr: 1.922 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
2.691SerAla: 2.691 ± 0.681
0.0SerCys: 0.0 ± 0.0
4.229SerAsp: 4.229 ± 0.75
4.806SerGlu: 4.806 ± 0.822
3.845SerPhe: 3.845 ± 0.934
3.46SerGly: 3.46 ± 1.421
0.577SerHis: 0.577 ± 0.277
3.845SerIle: 3.845 ± 0.591
6.728SerLys: 6.728 ± 1.273
4.998SerLeu: 4.998 ± 0.87
1.538SerMet: 1.538 ± 0.493
6.536SerAsn: 6.536 ± 1.657
1.922SerPro: 1.922 ± 0.584
2.691SerGln: 2.691 ± 0.484
2.499SerArg: 2.499 ± 0.505
4.806SerSer: 4.806 ± 1.238
2.499SerThr: 2.499 ± 1.196
3.076SerVal: 3.076 ± 0.73
0.384SerTrp: 0.384 ± 0.244
2.691SerTyr: 2.691 ± 0.739
0.0SerXaa: 0.0 ± 0.0
Thr
1.346ThrAla: 1.346 ± 0.423
0.577ThrCys: 0.577 ± 0.409
4.421ThrAsp: 4.421 ± 0.652
4.229ThrGlu: 4.229 ± 1.563
4.229ThrPhe: 4.229 ± 0.614
3.268ThrGly: 3.268 ± 0.619
1.346ThrHis: 1.346 ± 0.562
5.383ThrIle: 5.383 ± 0.65
4.806ThrLys: 4.806 ± 0.975
5.575ThrLeu: 5.575 ± 0.859
1.153ThrMet: 1.153 ± 0.494
3.652ThrAsn: 3.652 ± 0.604
1.538ThrPro: 1.538 ± 0.715
1.73ThrGln: 1.73 ± 0.545
1.538ThrArg: 1.538 ± 0.442
4.421ThrSer: 4.421 ± 0.867
4.037ThrThr: 4.037 ± 1.132
2.499ThrVal: 2.499 ± 1.014
0.769ThrTrp: 0.769 ± 0.418
2.691ThrTyr: 2.691 ± 0.62
0.0ThrXaa: 0.0 ± 0.0
Val
2.307ValAla: 2.307 ± 0.709
0.769ValCys: 0.769 ± 0.334
3.46ValAsp: 3.46 ± 0.626
1.922ValGlu: 1.922 ± 0.619
3.268ValPhe: 3.268 ± 0.946
1.153ValGly: 1.153 ± 0.556
0.961ValHis: 0.961 ± 0.44
4.037ValIle: 4.037 ± 1.189
4.037ValLys: 4.037 ± 0.573
4.229ValLeu: 4.229 ± 0.649
1.346ValMet: 1.346 ± 0.452
5.19ValAsn: 5.19 ± 1.057
2.307ValPro: 2.307 ± 0.843
2.307ValGln: 2.307 ± 0.792
2.499ValArg: 2.499 ± 0.726
2.884ValSer: 2.884 ± 0.687
3.652ValThr: 3.652 ± 0.743
3.268ValVal: 3.268 ± 0.83
0.577ValTrp: 0.577 ± 0.42
3.076ValTyr: 3.076 ± 0.783
0.0ValXaa: 0.0 ± 0.0
Trp
0.384TrpAla: 0.384 ± 0.341
0.0TrpCys: 0.0 ± 0.0
1.538TrpAsp: 1.538 ± 0.479
0.192TrpGlu: 0.192 ± 0.192
0.384TrpPhe: 0.384 ± 0.251
0.577TrpGly: 0.577 ± 0.447
0.384TrpHis: 0.384 ± 0.338
1.153TrpIle: 1.153 ± 0.493
0.577TrpLys: 0.577 ± 0.484
1.73TrpLeu: 1.73 ± 0.729
0.384TrpMet: 0.384 ± 0.274
0.769TrpAsn: 0.769 ± 0.46
0.0TrpPro: 0.0 ± 0.0
0.192TrpGln: 0.192 ± 0.192
0.0TrpArg: 0.0 ± 0.0
0.384TrpSer: 0.384 ± 0.264
0.577TrpThr: 0.577 ± 0.276
0.192TrpVal: 0.192 ± 0.172
0.0TrpTrp: 0.0 ± 0.0
0.384TrpTyr: 0.384 ± 0.341
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.115TyrAla: 2.115 ± 0.693
0.384TyrCys: 0.384 ± 0.271
4.998TyrAsp: 4.998 ± 0.948
2.884TyrGlu: 2.884 ± 0.888
2.884TyrPhe: 2.884 ± 0.89
3.845TyrGly: 3.845 ± 0.907
1.538TyrHis: 1.538 ± 0.639
5.19TyrIle: 5.19 ± 1.281
4.037TyrLys: 4.037 ± 0.694
5.767TyrLeu: 5.767 ± 1.276
1.153TyrMet: 1.153 ± 0.646
7.113TyrAsn: 7.113 ± 1.155
1.73TyrPro: 1.73 ± 0.399
2.307TyrGln: 2.307 ± 0.92
0.769TyrArg: 0.769 ± 0.35
3.46TyrSer: 3.46 ± 0.731
3.845TyrThr: 3.845 ± 0.629
3.845TyrVal: 3.845 ± 0.532
0.577TyrTrp: 0.577 ± 0.279
4.614TyrTyr: 4.614 ± 1.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (5203 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski