Amino acid dipepetide frequency for Streptococcus phage YMC-2011

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.145AlaAla: 3.145 ± 1.072
0.484AlaCys: 0.484 ± 0.25
4.274AlaAsp: 4.274 ± 0.712
4.919AlaGlu: 4.919 ± 0.722
2.984AlaPhe: 2.984 ± 0.556
4.758AlaGly: 4.758 ± 0.93
0.806AlaHis: 0.806 ± 0.266
4.113AlaIle: 4.113 ± 0.674
6.613AlaLys: 6.613 ± 0.768
5.403AlaLeu: 5.403 ± 0.648
2.581AlaMet: 2.581 ± 0.463
3.629AlaAsn: 3.629 ± 0.685
2.097AlaPro: 2.097 ± 0.773
2.5AlaGln: 2.5 ± 0.696
2.339AlaArg: 2.339 ± 0.499
4.274AlaSer: 4.274 ± 0.608
4.919AlaThr: 4.919 ± 0.699
4.194AlaVal: 4.194 ± 0.637
0.968AlaTrp: 0.968 ± 0.232
1.935AlaTyr: 1.935 ± 0.506
0.0AlaXaa: 0.0 ± 0.0
Cys
0.161CysAla: 0.161 ± 0.101
0.081CysCys: 0.081 ± 0.081
0.484CysAsp: 0.484 ± 0.205
0.968CysGlu: 0.968 ± 0.354
0.161CysPhe: 0.161 ± 0.119
0.565CysGly: 0.565 ± 0.339
0.161CysHis: 0.161 ± 0.123
0.242CysIle: 0.242 ± 0.136
0.726CysLys: 0.726 ± 0.273
0.645CysLeu: 0.645 ± 0.214
0.0CysMet: 0.0 ± 0.0
0.403CysAsn: 0.403 ± 0.184
0.323CysPro: 0.323 ± 0.219
0.403CysGln: 0.403 ± 0.21
0.242CysArg: 0.242 ± 0.144
0.323CysSer: 0.323 ± 0.151
0.484CysThr: 0.484 ± 0.177
0.242CysVal: 0.242 ± 0.121
0.0CysTrp: 0.0 ± 0.0
0.403CysTyr: 0.403 ± 0.175
0.0CysXaa: 0.0 ± 0.0
Asp
3.629AspAla: 3.629 ± 0.79
0.323AspCys: 0.323 ± 0.182
3.468AspAsp: 3.468 ± 0.613
4.758AspGlu: 4.758 ± 0.716
3.629AspPhe: 3.629 ± 0.569
6.048AspGly: 6.048 ± 1.029
0.565AspHis: 0.565 ± 0.221
5.403AspIle: 5.403 ± 0.662
5.081AspLys: 5.081 ± 0.556
3.79AspLeu: 3.79 ± 0.602
1.935AspMet: 1.935 ± 0.443
3.871AspAsn: 3.871 ± 0.587
2.016AspPro: 2.016 ± 0.361
1.452AspGln: 1.452 ± 0.334
2.823AspArg: 2.823 ± 0.512
3.871AspSer: 3.871 ± 0.588
4.274AspThr: 4.274 ± 0.566
3.387AspVal: 3.387 ± 0.457
0.806AspTrp: 0.806 ± 0.215
2.661AspTyr: 2.661 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
4.355GluAla: 4.355 ± 0.59
0.403GluCys: 0.403 ± 0.183
2.903GluAsp: 2.903 ± 0.448
4.919GluGlu: 4.919 ± 0.749
2.5GluPhe: 2.5 ± 0.383
3.871GluGly: 3.871 ± 0.611
1.452GluHis: 1.452 ± 0.344
5.565GluIle: 5.565 ± 0.7
4.435GluLys: 4.435 ± 0.567
6.452GluLeu: 6.452 ± 0.683
1.855GluMet: 1.855 ± 0.412
4.113GluAsn: 4.113 ± 0.49
2.097GluPro: 2.097 ± 0.54
3.952GluGln: 3.952 ± 0.624
3.71GluArg: 3.71 ± 0.592
3.387GluSer: 3.387 ± 0.552
4.194GluThr: 4.194 ± 0.525
5.081GluVal: 5.081 ± 0.626
1.21GluTrp: 1.21 ± 0.287
3.952GluTyr: 3.952 ± 0.504
0.0GluXaa: 0.0 ± 0.0
Phe
3.145PheAla: 3.145 ± 0.513
0.565PheCys: 0.565 ± 0.208
3.79PheAsp: 3.79 ± 0.531
2.097PheGlu: 2.097 ± 0.396
2.177PhePhe: 2.177 ± 0.391
3.145PheGly: 3.145 ± 0.526
0.242PheHis: 0.242 ± 0.142
2.339PheIle: 2.339 ± 0.645
3.79PheLys: 3.79 ± 0.552
3.387PheLeu: 3.387 ± 0.506
0.806PheMet: 0.806 ± 0.238
3.226PheAsn: 3.226 ± 0.615
0.645PhePro: 0.645 ± 0.209
0.565PheGln: 0.565 ± 0.174
1.694PheArg: 1.694 ± 0.361
3.629PheSer: 3.629 ± 0.584
2.016PheThr: 2.016 ± 0.321
2.984PheVal: 2.984 ± 0.44
0.565PheTrp: 0.565 ± 0.232
1.452PheTyr: 1.452 ± 0.308
0.0PheXaa: 0.0 ± 0.0
Gly
4.435GlyAla: 4.435 ± 0.634
0.565GlyCys: 0.565 ± 0.246
3.79GlyAsp: 3.79 ± 0.528
3.548GlyGlu: 3.548 ± 0.606
3.871GlyPhe: 3.871 ± 0.424
5.242GlyGly: 5.242 ± 0.979
0.726GlyHis: 0.726 ± 0.247
5.403GlyIle: 5.403 ± 0.75
5.887GlyLys: 5.887 ± 0.713
5.323GlyLeu: 5.323 ± 0.694
2.016GlyMet: 2.016 ± 0.446
4.355GlyAsn: 4.355 ± 0.659
1.048GlyPro: 1.048 ± 0.341
2.419GlyGln: 2.419 ± 0.379
3.629GlyArg: 3.629 ± 0.644
3.79GlySer: 3.79 ± 0.626
4.274GlyThr: 4.274 ± 0.625
4.597GlyVal: 4.597 ± 0.635
1.21GlyTrp: 1.21 ± 0.292
3.387GlyTyr: 3.387 ± 0.526
0.0GlyXaa: 0.0 ± 0.0
His
0.565HisAla: 0.565 ± 0.232
0.0HisCys: 0.0 ± 0.0
1.21HisAsp: 1.21 ± 0.272
1.048HisGlu: 1.048 ± 0.358
0.645HisPhe: 0.645 ± 0.242
0.484HisGly: 0.484 ± 0.201
0.403HisHis: 0.403 ± 0.13
1.129HisIle: 1.129 ± 0.255
1.774HisLys: 1.774 ± 0.52
0.887HisLeu: 0.887 ± 0.272
0.242HisMet: 0.242 ± 0.128
0.887HisAsn: 0.887 ± 0.313
0.565HisPro: 0.565 ± 0.185
0.565HisGln: 0.565 ± 0.215
0.403HisArg: 0.403 ± 0.185
0.726HisSer: 0.726 ± 0.225
0.645HisThr: 0.645 ± 0.201
0.887HisVal: 0.887 ± 0.201
0.081HisTrp: 0.081 ± 0.079
0.726HisTyr: 0.726 ± 0.28
0.0HisXaa: 0.0 ± 0.0
Ile
4.758IleAla: 4.758 ± 0.791
0.645IleCys: 0.645 ± 0.299
5.484IleAsp: 5.484 ± 0.716
4.516IleGlu: 4.516 ± 0.608
2.177IlePhe: 2.177 ± 0.482
3.387IleGly: 3.387 ± 0.462
0.565IleHis: 0.565 ± 0.167
2.5IleIle: 2.5 ± 0.442
4.677IleLys: 4.677 ± 0.627
3.952IleLeu: 3.952 ± 0.704
1.774IleMet: 1.774 ± 0.444
3.71IleAsn: 3.71 ± 0.653
3.629IlePro: 3.629 ± 0.964
2.984IleGln: 2.984 ± 0.372
2.823IleArg: 2.823 ± 0.497
4.677IleSer: 4.677 ± 0.701
4.516IleThr: 4.516 ± 0.507
3.468IleVal: 3.468 ± 0.513
0.726IleTrp: 0.726 ± 0.214
2.016IleTyr: 2.016 ± 0.348
0.0IleXaa: 0.0 ± 0.0
Lys
5.403LysAla: 5.403 ± 0.559
0.403LysCys: 0.403 ± 0.328
4.597LysAsp: 4.597 ± 0.854
5.968LysGlu: 5.968 ± 0.685
3.226LysPhe: 3.226 ± 0.755
6.371LysGly: 6.371 ± 0.726
1.694LysHis: 1.694 ± 0.434
5.323LysIle: 5.323 ± 0.558
6.532LysLys: 6.532 ± 1.136
5.968LysLeu: 5.968 ± 0.552
1.532LysMet: 1.532 ± 0.338
5.645LysAsn: 5.645 ± 0.637
3.145LysPro: 3.145 ± 0.617
4.194LysGln: 4.194 ± 0.713
3.548LysArg: 3.548 ± 0.568
4.032LysSer: 4.032 ± 0.565
6.048LysThr: 6.048 ± 0.776
5.161LysVal: 5.161 ± 0.706
1.129LysTrp: 1.129 ± 0.341
3.871LysTyr: 3.871 ± 0.654
0.0LysXaa: 0.0 ± 0.0
Leu
5.484LeuAla: 5.484 ± 0.682
0.484LeuCys: 0.484 ± 0.174
5.806LeuAsp: 5.806 ± 0.745
6.129LeuGlu: 6.129 ± 0.851
2.339LeuPhe: 2.339 ± 0.323
5.0LeuGly: 5.0 ± 0.881
0.968LeuHis: 0.968 ± 0.291
3.871LeuIle: 3.871 ± 0.622
7.177LeuLys: 7.177 ± 0.8
5.726LeuLeu: 5.726 ± 0.604
2.258LeuMet: 2.258 ± 0.396
4.274LeuAsn: 4.274 ± 0.518
3.306LeuPro: 3.306 ± 0.549
2.823LeuGln: 2.823 ± 0.547
3.306LeuArg: 3.306 ± 0.665
4.274LeuSer: 4.274 ± 0.571
5.323LeuThr: 5.323 ± 0.685
4.032LeuVal: 4.032 ± 0.426
0.968LeuTrp: 0.968 ± 0.221
2.097LeuTyr: 2.097 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
2.339MetAla: 2.339 ± 0.457
0.081MetCys: 0.081 ± 0.095
1.29MetAsp: 1.29 ± 0.353
1.29MetGlu: 1.29 ± 0.361
1.048MetPhe: 1.048 ± 0.265
1.048MetGly: 1.048 ± 0.292
0.081MetHis: 0.081 ± 0.066
1.452MetIle: 1.452 ± 0.363
2.661MetLys: 2.661 ± 0.473
1.774MetLeu: 1.774 ± 0.301
0.645MetMet: 0.645 ± 0.21
0.887MetAsn: 0.887 ± 0.283
0.887MetPro: 0.887 ± 0.245
0.968MetGln: 0.968 ± 0.296
0.968MetArg: 0.968 ± 0.25
1.774MetSer: 1.774 ± 0.372
1.452MetThr: 1.452 ± 0.359
1.774MetVal: 1.774 ± 0.388
0.081MetTrp: 0.081 ± 0.074
0.806MetTyr: 0.806 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
4.758AsnAla: 4.758 ± 0.996
0.323AsnCys: 0.323 ± 0.18
2.823AsnAsp: 2.823 ± 0.377
3.952AsnGlu: 3.952 ± 0.786
2.177AsnPhe: 2.177 ± 0.564
6.048AsnGly: 6.048 ± 0.787
0.484AsnHis: 0.484 ± 0.19
2.661AsnIle: 2.661 ± 0.473
4.113AsnLys: 4.113 ± 0.456
4.677AsnLeu: 4.677 ± 0.564
0.968AsnMet: 0.968 ± 0.275
3.629AsnAsn: 3.629 ± 0.623
2.661AsnPro: 2.661 ± 0.499
2.903AsnGln: 2.903 ± 0.452
2.258AsnArg: 2.258 ± 0.51
2.903AsnSer: 2.903 ± 0.505
2.903AsnThr: 2.903 ± 0.444
4.516AsnVal: 4.516 ± 0.54
0.887AsnTrp: 0.887 ± 0.259
2.258AsnTyr: 2.258 ± 0.347
0.0AsnXaa: 0.0 ± 0.0
Pro
2.016ProAla: 2.016 ± 0.471
0.081ProCys: 0.081 ± 0.074
1.694ProAsp: 1.694 ± 0.336
3.226ProGlu: 3.226 ± 0.779
1.532ProPhe: 1.532 ± 0.289
1.774ProGly: 1.774 ± 0.403
0.726ProHis: 0.726 ± 0.24
1.694ProIle: 1.694 ± 0.479
3.387ProLys: 3.387 ± 0.683
2.258ProLeu: 2.258 ± 0.389
0.161ProMet: 0.161 ± 0.116
2.419ProAsn: 2.419 ± 0.515
0.806ProPro: 0.806 ± 0.296
1.129ProGln: 1.129 ± 0.287
0.887ProArg: 0.887 ± 0.283
2.419ProSer: 2.419 ± 0.407
3.226ProThr: 3.226 ± 0.493
2.177ProVal: 2.177 ± 0.53
0.242ProTrp: 0.242 ± 0.132
1.21ProTyr: 1.21 ± 0.334
0.0ProXaa: 0.0 ± 0.0
Gln
3.79GlnAla: 3.79 ± 0.616
0.403GlnCys: 0.403 ± 0.18
2.016GlnAsp: 2.016 ± 0.548
2.581GlnGlu: 2.581 ± 0.491
1.29GlnPhe: 1.29 ± 0.301
3.226GlnGly: 3.226 ± 0.594
0.645GlnHis: 0.645 ± 0.24
2.661GlnIle: 2.661 ± 0.609
2.581GlnLys: 2.581 ± 0.54
3.065GlnLeu: 3.065 ± 0.425
1.21GlnMet: 1.21 ± 0.414
2.823GlnAsn: 2.823 ± 0.442
1.694GlnPro: 1.694 ± 0.425
2.258GlnGln: 2.258 ± 0.611
1.855GlnArg: 1.855 ± 0.387
2.823GlnSer: 2.823 ± 0.501
2.419GlnThr: 2.419 ± 0.475
2.339GlnVal: 2.339 ± 0.341
0.565GlnTrp: 0.565 ± 0.211
1.694GlnTyr: 1.694 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
2.258ArgAla: 2.258 ± 0.488
0.323ArgCys: 0.323 ± 0.158
2.177ArgAsp: 2.177 ± 0.416
2.823ArgGlu: 2.823 ± 0.404
2.419ArgPhe: 2.419 ± 0.509
3.065ArgGly: 3.065 ± 0.583
0.726ArgHis: 0.726 ± 0.232
2.5ArgIle: 2.5 ± 0.477
3.387ArgLys: 3.387 ± 0.604
3.548ArgLeu: 3.548 ± 0.667
0.968ArgMet: 0.968 ± 0.254
2.339ArgAsn: 2.339 ± 0.447
1.371ArgPro: 1.371 ± 0.312
1.855ArgGln: 1.855 ± 0.384
1.29ArgArg: 1.29 ± 0.376
1.855ArgSer: 1.855 ± 0.353
3.468ArgThr: 3.468 ± 0.546
2.984ArgVal: 2.984 ± 0.449
0.806ArgTrp: 0.806 ± 0.28
2.097ArgTyr: 2.097 ± 0.514
0.0ArgXaa: 0.0 ± 0.0
Ser
3.226SerAla: 3.226 ± 0.503
0.484SerCys: 0.484 ± 0.204
3.629SerAsp: 3.629 ± 0.645
5.161SerGlu: 5.161 ± 0.551
2.661SerPhe: 2.661 ± 0.425
3.871SerGly: 3.871 ± 0.507
0.484SerHis: 0.484 ± 0.197
3.871SerIle: 3.871 ± 0.545
5.323SerLys: 5.323 ± 0.582
4.355SerLeu: 4.355 ± 0.555
1.532SerMet: 1.532 ± 0.312
4.113SerAsn: 4.113 ± 0.496
1.935SerPro: 1.935 ± 0.336
2.903SerGln: 2.903 ± 0.705
1.774SerArg: 1.774 ± 0.394
4.597SerSer: 4.597 ± 0.917
3.629SerThr: 3.629 ± 0.515
4.435SerVal: 4.435 ± 0.659
0.645SerTrp: 0.645 ± 0.276
2.097SerTyr: 2.097 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
4.758ThrAla: 4.758 ± 0.884
0.403ThrCys: 0.403 ± 0.186
4.919ThrAsp: 4.919 ± 0.605
3.79ThrGlu: 3.79 ± 0.564
2.903ThrPhe: 2.903 ± 0.408
3.871ThrGly: 3.871 ± 0.435
1.29ThrHis: 1.29 ± 0.3
4.516ThrIle: 4.516 ± 0.581
5.081ThrLys: 5.081 ± 0.605
6.29ThrLeu: 6.29 ± 0.649
1.21ThrMet: 1.21 ± 0.307
3.468ThrAsn: 3.468 ± 0.436
2.016ThrPro: 2.016 ± 0.446
3.226ThrGln: 3.226 ± 0.558
2.177ThrArg: 2.177 ± 0.443
2.903ThrSer: 2.903 ± 0.578
4.113ThrThr: 4.113 ± 0.833
5.0ThrVal: 5.0 ± 0.74
1.29ThrTrp: 1.29 ± 0.309
2.823ThrTyr: 2.823 ± 0.508
0.0ThrXaa: 0.0 ± 0.0
Val
5.161ValAla: 5.161 ± 0.84
0.161ValCys: 0.161 ± 0.101
5.565ValAsp: 5.565 ± 0.554
4.839ValGlu: 4.839 ± 0.621
2.258ValPhe: 2.258 ± 0.404
4.516ValGly: 4.516 ± 0.626
0.726ValHis: 0.726 ± 0.291
4.194ValIle: 4.194 ± 0.713
6.129ValLys: 6.129 ± 0.626
3.79ValLeu: 3.79 ± 0.466
0.645ValMet: 0.645 ± 0.23
2.661ValAsn: 2.661 ± 0.502
1.532ValPro: 1.532 ± 0.369
2.097ValGln: 2.097 ± 0.497
3.145ValArg: 3.145 ± 0.546
5.081ValSer: 5.081 ± 0.61
4.677ValThr: 4.677 ± 0.655
4.919ValVal: 4.919 ± 0.574
1.048ValTrp: 1.048 ± 0.19
2.5ValTyr: 2.5 ± 0.469
0.0ValXaa: 0.0 ± 0.0
Trp
0.565TrpAla: 0.565 ± 0.194
0.242TrpCys: 0.242 ± 0.15
0.968TrpAsp: 0.968 ± 0.568
1.048TrpGlu: 1.048 ± 0.237
0.726TrpPhe: 0.726 ± 0.215
0.887TrpGly: 0.887 ± 0.266
0.484TrpHis: 0.484 ± 0.171
0.645TrpIle: 0.645 ± 0.199
0.726TrpLys: 0.726 ± 0.182
1.532TrpLeu: 1.532 ± 0.302
0.0TrpMet: 0.0 ± 0.0
0.403TrpAsn: 0.403 ± 0.183
0.081TrpPro: 0.081 ± 0.077
0.484TrpGln: 0.484 ± 0.164
0.806TrpArg: 0.806 ± 0.171
1.371TrpSer: 1.371 ± 0.301
1.048TrpThr: 1.048 ± 0.263
1.129TrpVal: 1.129 ± 0.311
0.242TrpTrp: 0.242 ± 0.108
0.161TrpTyr: 0.161 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.984TyrAla: 2.984 ± 0.653
0.565TyrCys: 0.565 ± 0.265
2.903TyrAsp: 2.903 ± 0.475
2.823TyrGlu: 2.823 ± 0.567
1.532TyrPhe: 1.532 ± 0.242
2.339TyrGly: 2.339 ± 0.464
0.645TyrHis: 0.645 ± 0.24
2.903TyrIle: 2.903 ± 0.483
3.71TyrLys: 3.71 ± 0.544
2.823TyrLeu: 2.823 ± 0.535
0.887TyrMet: 0.887 ± 0.301
1.048TyrAsn: 1.048 ± 0.289
1.21TyrPro: 1.21 ± 0.364
2.258TyrGln: 2.258 ± 0.418
2.581TyrArg: 2.581 ± 0.451
2.016TyrSer: 2.016 ± 0.442
2.419TyrThr: 2.419 ± 0.387
2.419TyrVal: 2.419 ± 0.417
0.081TyrTrp: 0.081 ± 0.064
1.21TyrTyr: 1.21 ± 0.342
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12401 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski