Amino acid dipepetide frequency for Streptococcus phage IPP15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.412AlaAla: 2.412 ± 0.753
0.25AlaCys: 0.25 ± 0.18
5.989AlaAsp: 5.989 ± 0.694
6.738AlaGlu: 6.738 ± 0.677
2.662AlaPhe: 2.662 ± 0.856
4.908AlaGly: 4.908 ± 1.114
0.749AlaHis: 0.749 ± 0.23
4.242AlaIle: 4.242 ± 0.744
6.405AlaLys: 6.405 ± 0.886
5.823AlaLeu: 5.823 ± 1.052
2.412AlaMet: 2.412 ± 0.435
4.076AlaAsn: 4.076 ± 0.751
1.913AlaPro: 1.913 ± 0.451
2.579AlaGln: 2.579 ± 0.512
2.579AlaArg: 2.579 ± 0.562
3.244AlaSer: 3.244 ± 0.725
4.492AlaThr: 4.492 ± 0.638
4.991AlaVal: 4.991 ± 0.753
1.331AlaTrp: 1.331 ± 0.433
1.913AlaTyr: 1.913 ± 0.349
0.0AlaXaa: 0.0 ± 0.0
Cys
0.333CysAla: 0.333 ± 0.187
0.166CysCys: 0.166 ± 0.121
0.333CysAsp: 0.333 ± 0.225
0.499CysGlu: 0.499 ± 0.221
0.333CysPhe: 0.333 ± 0.172
0.499CysGly: 0.499 ± 0.287
0.166CysHis: 0.166 ± 0.136
0.416CysIle: 0.416 ± 0.235
0.749CysLys: 0.749 ± 0.326
0.333CysLeu: 0.333 ± 0.173
0.083CysMet: 0.083 ± 0.085
0.166CysAsn: 0.166 ± 0.162
0.333CysPro: 0.333 ± 0.205
0.25CysGln: 0.25 ± 0.126
0.333CysArg: 0.333 ± 0.13
0.166CysSer: 0.166 ± 0.121
0.083CysThr: 0.083 ± 0.082
0.25CysVal: 0.25 ± 0.152
0.166CysTrp: 0.166 ± 0.126
0.333CysTyr: 0.333 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
4.575AspAla: 4.575 ± 0.629
0.499AspCys: 0.499 ± 0.195
3.244AspAsp: 3.244 ± 0.674
4.492AspGlu: 4.492 ± 0.971
3.41AspPhe: 3.41 ± 0.562
4.824AspGly: 4.824 ± 0.669
0.416AspHis: 0.416 ± 0.198
5.074AspIle: 5.074 ± 0.583
5.49AspLys: 5.49 ± 0.847
4.741AspLeu: 4.741 ± 0.602
1.414AspMet: 1.414 ± 0.289
3.909AspAsn: 3.909 ± 0.558
1.165AspPro: 1.165 ± 0.322
1.913AspGln: 1.913 ± 0.43
2.579AspArg: 2.579 ± 0.46
3.161AspSer: 3.161 ± 0.426
3.66AspThr: 3.66 ± 0.474
4.076AspVal: 4.076 ± 0.524
0.998AspTrp: 0.998 ± 0.285
3.494AspTyr: 3.494 ± 0.62
0.0AspXaa: 0.0 ± 0.0
Glu
5.906GluAla: 5.906 ± 1.012
0.25GluCys: 0.25 ± 0.131
4.575GluAsp: 4.575 ± 0.938
5.989GluGlu: 5.989 ± 0.943
3.577GluPhe: 3.577 ± 0.612
3.909GluGly: 3.909 ± 0.493
1.081GluHis: 1.081 ± 0.321
6.904GluIle: 6.904 ± 0.867
5.823GluLys: 5.823 ± 0.891
9.15GluLeu: 9.15 ± 0.845
1.58GluMet: 1.58 ± 0.482
4.409GluAsn: 4.409 ± 0.564
2.08GluPro: 2.08 ± 0.59
3.743GluGln: 3.743 ± 0.777
3.66GluArg: 3.66 ± 0.555
3.66GluSer: 3.66 ± 0.479
3.494GluThr: 3.494 ± 0.524
5.074GluVal: 5.074 ± 0.656
0.665GluTrp: 0.665 ± 0.239
2.995GluTyr: 2.995 ± 0.573
0.0GluXaa: 0.0 ± 0.0
Phe
2.745PheAla: 2.745 ± 0.546
0.416PheCys: 0.416 ± 0.246
4.575PheAsp: 4.575 ± 0.578
3.494PheGlu: 3.494 ± 0.508
1.165PhePhe: 1.165 ± 0.256
2.412PheGly: 2.412 ± 0.603
0.582PheHis: 0.582 ± 0.271
2.745PheIle: 2.745 ± 0.415
3.244PheLys: 3.244 ± 0.594
2.08PheLeu: 2.08 ± 0.364
0.998PheMet: 0.998 ± 0.38
3.244PheAsn: 3.244 ± 0.465
0.998PhePro: 0.998 ± 0.325
1.664PheGln: 1.664 ± 0.373
1.165PheArg: 1.165 ± 0.29
3.078PheSer: 3.078 ± 0.635
2.911PheThr: 2.911 ± 0.504
1.747PheVal: 1.747 ± 0.4
0.499PheTrp: 0.499 ± 0.196
1.747PheTyr: 1.747 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
2.911GlyAla: 2.911 ± 0.451
0.0GlyCys: 0.0 ± 0.0
3.66GlyAsp: 3.66 ± 0.527
4.575GlyGlu: 4.575 ± 0.703
2.412GlyPhe: 2.412 ± 0.672
4.991GlyGly: 4.991 ± 1.201
1.081GlyHis: 1.081 ± 0.231
4.076GlyIle: 4.076 ± 0.762
5.24GlyLys: 5.24 ± 0.63
6.072GlyLeu: 6.072 ± 0.965
1.913GlyMet: 1.913 ± 0.401
4.409GlyAsn: 4.409 ± 0.505
0.582GlyPro: 0.582 ± 0.258
3.244GlyGln: 3.244 ± 0.517
4.325GlyArg: 4.325 ± 0.412
4.409GlySer: 4.409 ± 0.73
3.161GlyThr: 3.161 ± 0.517
4.409GlyVal: 4.409 ± 0.617
0.998GlyTrp: 0.998 ± 0.463
2.995GlyTyr: 2.995 ± 0.552
0.0GlyXaa: 0.0 ± 0.0
His
0.665HisAla: 0.665 ± 0.294
0.083HisCys: 0.083 ± 0.117
0.416HisAsp: 0.416 ± 0.205
0.915HisGlu: 0.915 ± 0.214
0.832HisPhe: 0.832 ± 0.268
1.081HisGly: 1.081 ± 0.328
0.083HisHis: 0.083 ± 0.079
0.582HisIle: 0.582 ± 0.222
0.915HisLys: 0.915 ± 0.239
0.998HisLeu: 0.998 ± 0.271
0.166HisMet: 0.166 ± 0.111
0.998HisAsn: 0.998 ± 0.341
0.582HisPro: 0.582 ± 0.221
0.499HisGln: 0.499 ± 0.248
0.582HisArg: 0.582 ± 0.263
1.331HisSer: 1.331 ± 0.463
0.998HisThr: 0.998 ± 0.301
1.081HisVal: 1.081 ± 0.286
0.166HisTrp: 0.166 ± 0.121
0.749HisTyr: 0.749 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.49IleAla: 5.49 ± 0.951
0.25IleCys: 0.25 ± 0.115
3.494IleAsp: 3.494 ± 0.577
6.571IleGlu: 6.571 ± 0.901
2.662IlePhe: 2.662 ± 0.507
5.074IleGly: 5.074 ± 0.967
0.499IleHis: 0.499 ± 0.261
3.41IleIle: 3.41 ± 0.503
6.322IleLys: 6.322 ± 0.692
4.242IleLeu: 4.242 ± 0.682
1.331IleMet: 1.331 ± 0.325
4.242IleAsn: 4.242 ± 0.645
1.58IlePro: 1.58 ± 0.436
2.329IleGln: 2.329 ± 0.392
3.078IleArg: 3.078 ± 0.632
5.074IleSer: 5.074 ± 0.857
4.076IleThr: 4.076 ± 0.591
2.995IleVal: 2.995 ± 0.523
0.832IleTrp: 0.832 ± 0.297
2.163IleTyr: 2.163 ± 0.658
0.0IleXaa: 0.0 ± 0.0
Lys
4.741LysAla: 4.741 ± 0.74
0.416LysCys: 0.416 ± 0.206
5.324LysAsp: 5.324 ± 0.637
7.237LysGlu: 7.237 ± 0.855
3.161LysPhe: 3.161 ± 0.533
4.325LysGly: 4.325 ± 0.679
0.915LysHis: 0.915 ± 0.221
5.573LysIle: 5.573 ± 1.018
5.989LysLys: 5.989 ± 0.991
6.987LysLeu: 6.987 ± 0.747
1.83LysMet: 1.83 ± 0.448
4.492LysAsn: 4.492 ± 0.459
2.662LysPro: 2.662 ± 0.569
3.66LysGln: 3.66 ± 0.681
4.159LysArg: 4.159 ± 0.554
4.492LysSer: 4.492 ± 0.622
4.908LysThr: 4.908 ± 0.496
7.07LysVal: 7.07 ± 0.844
0.832LysTrp: 0.832 ± 0.27
3.494LysTyr: 3.494 ± 0.714
0.0LysXaa: 0.0 ± 0.0
Leu
5.989LeuAla: 5.989 ± 1.007
0.499LeuCys: 0.499 ± 0.301
5.739LeuAsp: 5.739 ± 0.662
7.237LeuGlu: 7.237 ± 0.874
2.995LeuPhe: 2.995 ± 0.528
5.157LeuGly: 5.157 ± 1.292
1.248LeuHis: 1.248 ± 0.354
3.826LeuIle: 3.826 ± 0.652
7.32LeuLys: 7.32 ± 0.764
6.821LeuLeu: 6.821 ± 1.107
1.913LeuMet: 1.913 ± 0.362
4.409LeuAsn: 4.409 ± 0.655
2.495LeuPro: 2.495 ± 0.563
3.327LeuGln: 3.327 ± 0.645
3.494LeuArg: 3.494 ± 0.6
5.24LeuSer: 5.24 ± 0.799
4.991LeuThr: 4.991 ± 0.688
3.66LeuVal: 3.66 ± 0.518
0.998LeuTrp: 0.998 ± 0.3
1.747LeuTyr: 1.747 ± 0.252
0.0LeuXaa: 0.0 ± 0.0
Met
1.913MetAla: 1.913 ± 0.479
0.083MetCys: 0.083 ± 0.095
1.248MetAsp: 1.248 ± 0.279
1.331MetGlu: 1.331 ± 0.335
0.915MetPhe: 0.915 ± 0.25
1.248MetGly: 1.248 ± 0.428
0.333MetHis: 0.333 ± 0.196
1.747MetIle: 1.747 ± 0.403
2.246MetLys: 2.246 ± 0.502
1.331MetLeu: 1.331 ± 0.282
0.416MetMet: 0.416 ± 0.23
1.664MetAsn: 1.664 ± 0.389
0.749MetPro: 0.749 ± 0.234
0.665MetGln: 0.665 ± 0.302
1.165MetArg: 1.165 ± 0.331
1.248MetSer: 1.248 ± 0.36
1.58MetThr: 1.58 ± 0.413
1.331MetVal: 1.331 ± 0.325
0.083MetTrp: 0.083 ± 0.09
0.665MetTyr: 0.665 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
4.991AsnAla: 4.991 ± 1.015
0.333AsnCys: 0.333 ± 0.14
3.244AsnAsp: 3.244 ± 0.519
3.577AsnGlu: 3.577 ± 0.668
2.163AsnPhe: 2.163 ± 0.504
4.908AsnGly: 4.908 ± 0.629
1.248AsnHis: 1.248 ± 0.331
4.242AsnIle: 4.242 ± 0.452
4.908AsnLys: 4.908 ± 0.637
4.242AsnLeu: 4.242 ± 0.632
1.331AsnMet: 1.331 ± 0.37
2.995AsnAsn: 2.995 ± 0.908
2.246AsnPro: 2.246 ± 0.525
3.244AsnGln: 3.244 ± 0.614
2.911AsnArg: 2.911 ± 0.663
3.494AsnSer: 3.494 ± 0.514
2.662AsnThr: 2.662 ± 0.515
2.995AsnVal: 2.995 ± 0.498
0.998AsnTrp: 0.998 ± 0.228
2.412AsnTyr: 2.412 ± 0.478
0.0AsnXaa: 0.0 ± 0.0
Pro
2.662ProAla: 2.662 ± 0.473
0.333ProCys: 0.333 ± 0.187
1.996ProAsp: 1.996 ± 0.493
2.745ProGlu: 2.745 ± 0.404
1.248ProPhe: 1.248 ± 0.474
1.081ProGly: 1.081 ± 0.345
0.416ProHis: 0.416 ± 0.231
1.497ProIle: 1.497 ± 0.364
2.662ProLys: 2.662 ± 0.48
1.664ProLeu: 1.664 ± 0.443
0.333ProMet: 0.333 ± 0.189
1.248ProAsn: 1.248 ± 0.443
0.749ProPro: 0.749 ± 0.327
1.331ProGln: 1.331 ± 0.337
1.081ProArg: 1.081 ± 0.326
1.248ProSer: 1.248 ± 0.34
1.165ProThr: 1.165 ± 0.442
1.996ProVal: 1.996 ± 0.335
0.499ProTrp: 0.499 ± 0.188
1.331ProTyr: 1.331 ± 0.464
0.0ProXaa: 0.0 ± 0.0
Gln
3.494GlnAla: 3.494 ± 0.574
0.416GlnCys: 0.416 ± 0.19
2.412GlnAsp: 2.412 ± 0.324
3.66GlnGlu: 3.66 ± 0.765
1.331GlnPhe: 1.331 ± 0.364
2.246GlnGly: 2.246 ± 0.705
0.416GlnHis: 0.416 ± 0.177
3.41GlnIle: 3.41 ± 0.485
3.41GlnLys: 3.41 ± 0.527
2.745GlnLeu: 2.745 ± 0.434
0.749GlnMet: 0.749 ± 0.212
2.412GlnAsn: 2.412 ± 0.503
0.998GlnPro: 0.998 ± 0.338
2.246GlnGln: 2.246 ± 0.604
2.329GlnArg: 2.329 ± 0.415
3.078GlnSer: 3.078 ± 0.476
2.662GlnThr: 2.662 ± 0.523
3.66GlnVal: 3.66 ± 0.513
0.665GlnTrp: 0.665 ± 0.195
0.832GlnTyr: 0.832 ± 0.324
0.0GlnXaa: 0.0 ± 0.0
Arg
3.161ArgAla: 3.161 ± 0.46
0.25ArgCys: 0.25 ± 0.19
2.495ArgAsp: 2.495 ± 0.406
2.995ArgGlu: 2.995 ± 0.516
1.664ArgPhe: 1.664 ± 0.412
2.329ArgGly: 2.329 ± 0.485
0.832ArgHis: 0.832 ± 0.284
2.911ArgIle: 2.911 ± 0.653
3.66ArgLys: 3.66 ± 0.696
4.658ArgLeu: 4.658 ± 0.638
1.996ArgMet: 1.996 ± 0.408
2.329ArgAsn: 2.329 ± 0.805
1.331ArgPro: 1.331 ± 0.304
2.579ArgGln: 2.579 ± 0.506
1.913ArgArg: 1.913 ± 0.54
1.996ArgSer: 1.996 ± 0.513
2.911ArgThr: 2.911 ± 0.531
2.662ArgVal: 2.662 ± 0.441
0.499ArgTrp: 0.499 ± 0.201
2.246ArgTyr: 2.246 ± 0.511
0.0ArgXaa: 0.0 ± 0.0
Ser
4.991SerAla: 4.991 ± 1.049
0.25SerCys: 0.25 ± 0.117
3.244SerAsp: 3.244 ± 0.509
3.909SerGlu: 3.909 ± 0.67
2.329SerPhe: 2.329 ± 0.432
4.409SerGly: 4.409 ± 0.685
0.915SerHis: 0.915 ± 0.369
3.41SerIle: 3.41 ± 0.755
4.658SerLys: 4.658 ± 0.782
4.658SerLeu: 4.658 ± 0.715
1.331SerMet: 1.331 ± 0.422
3.743SerAsn: 3.743 ± 0.559
1.414SerPro: 1.414 ± 0.328
2.495SerGln: 2.495 ± 0.611
3.161SerArg: 3.161 ± 0.637
3.244SerSer: 3.244 ± 0.523
4.409SerThr: 4.409 ± 0.492
3.494SerVal: 3.494 ± 0.901
1.331SerTrp: 1.331 ± 0.422
2.329SerTyr: 2.329 ± 0.517
0.0SerXaa: 0.0 ± 0.0
Thr
4.325ThrAla: 4.325 ± 1.012
0.25ThrCys: 0.25 ± 0.146
4.159ThrAsp: 4.159 ± 0.626
3.577ThrGlu: 3.577 ± 0.386
3.327ThrPhe: 3.327 ± 0.699
4.908ThrGly: 4.908 ± 0.657
0.915ThrHis: 0.915 ± 0.319
4.908ThrIle: 4.908 ± 0.678
3.993ThrLys: 3.993 ± 0.618
4.741ThrLeu: 4.741 ± 0.613
0.665ThrMet: 0.665 ± 0.205
3.41ThrAsn: 3.41 ± 0.464
1.165ThrPro: 1.165 ± 0.404
2.329ThrGln: 2.329 ± 0.435
1.913ThrArg: 1.913 ± 0.449
3.826ThrSer: 3.826 ± 0.613
4.325ThrThr: 4.325 ± 0.923
4.741ThrVal: 4.741 ± 0.806
0.665ThrTrp: 0.665 ± 0.254
2.246ThrTyr: 2.246 ± 0.561
0.0ThrXaa: 0.0 ± 0.0
Val
5.24ValAla: 5.24 ± 0.65
0.499ValCys: 0.499 ± 0.19
3.577ValAsp: 3.577 ± 0.547
5.324ValGlu: 5.324 ± 0.8
1.996ValPhe: 1.996 ± 0.356
4.159ValGly: 4.159 ± 0.674
0.915ValHis: 0.915 ± 0.305
3.577ValIle: 3.577 ± 0.486
4.409ValLys: 4.409 ± 0.578
4.658ValLeu: 4.658 ± 0.755
0.915ValMet: 0.915 ± 0.291
4.409ValAsn: 4.409 ± 0.798
2.329ValPro: 2.329 ± 0.347
2.163ValGln: 2.163 ± 0.458
2.579ValArg: 2.579 ± 0.407
4.492ValSer: 4.492 ± 0.691
5.24ValThr: 5.24 ± 0.767
4.741ValVal: 4.741 ± 0.772
0.749ValTrp: 0.749 ± 0.204
2.579ValTyr: 2.579 ± 0.616
0.0ValXaa: 0.0 ± 0.0
Trp
1.497TrpAla: 1.497 ± 0.442
0.166TrpCys: 0.166 ± 0.116
0.665TrpAsp: 0.665 ± 0.261
1.081TrpGlu: 1.081 ± 0.333
0.915TrpPhe: 0.915 ± 0.386
0.749TrpGly: 0.749 ± 0.256
0.083TrpHis: 0.083 ± 0.095
0.665TrpIle: 0.665 ± 0.245
1.248TrpLys: 1.248 ± 0.33
0.749TrpLeu: 0.749 ± 0.288
0.166TrpMet: 0.166 ± 0.125
0.915TrpAsn: 0.915 ± 0.294
0.083TrpPro: 0.083 ± 0.079
0.582TrpGln: 0.582 ± 0.241
0.416TrpArg: 0.416 ± 0.175
0.915TrpSer: 0.915 ± 0.263
0.832TrpThr: 0.832 ± 0.266
0.998TrpVal: 0.998 ± 0.241
0.166TrpTrp: 0.166 ± 0.095
0.998TrpTyr: 0.998 ± 0.531
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.664TyrAla: 1.664 ± 0.377
0.582TyrCys: 0.582 ± 0.309
2.745TyrAsp: 2.745 ± 0.535
2.579TyrGlu: 2.579 ± 0.561
2.412TyrPhe: 2.412 ± 0.578
2.246TyrGly: 2.246 ± 0.409
0.832TyrHis: 0.832 ± 0.249
2.495TyrIle: 2.495 ± 0.485
3.66TyrLys: 3.66 ± 0.64
2.412TyrLeu: 2.412 ± 0.476
0.333TyrMet: 0.333 ± 0.219
1.664TyrAsn: 1.664 ± 0.435
1.83TyrPro: 1.83 ± 0.494
2.329TyrGln: 2.329 ± 0.453
1.996TyrArg: 1.996 ± 0.444
2.329TyrSer: 2.329 ± 0.505
1.83TyrThr: 1.83 ± 0.369
2.579TyrVal: 2.579 ± 0.427
0.749TyrTrp: 0.749 ± 0.322
2.08TyrTyr: 2.08 ± 0.701
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12023 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski