Amino acid dipepetide frequency for Streptococcus phage Javan149

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.423AlaAla: 2.423 ± 0.605
0.081AlaCys: 0.081 ± 0.078
4.119AlaAsp: 4.119 ± 0.415
5.653AlaGlu: 5.653 ± 0.738
2.746AlaPhe: 2.746 ± 0.475
3.796AlaGly: 3.796 ± 0.617
0.888AlaHis: 0.888 ± 0.274
4.684AlaIle: 4.684 ± 0.711
7.187AlaLys: 7.187 ± 0.654
4.684AlaLeu: 4.684 ± 0.629
1.777AlaMet: 1.777 ± 0.299
4.28AlaAsn: 4.28 ± 0.569
1.696AlaPro: 1.696 ± 0.362
1.615AlaGln: 1.615 ± 0.403
2.665AlaArg: 2.665 ± 0.555
4.119AlaSer: 4.119 ± 0.706
2.665AlaThr: 2.665 ± 0.522
3.796AlaVal: 3.796 ± 0.772
0.888AlaTrp: 0.888 ± 0.271
2.1AlaTyr: 2.1 ± 0.359
0.0AlaXaa: 0.0 ± 0.0
Cys
0.081CysAla: 0.081 ± 0.088
0.0CysCys: 0.0 ± 0.0
0.404CysAsp: 0.404 ± 0.18
0.808CysGlu: 0.808 ± 0.269
0.242CysPhe: 0.242 ± 0.145
0.404CysGly: 0.404 ± 0.193
0.081CysHis: 0.081 ± 0.077
0.242CysIle: 0.242 ± 0.137
0.242CysLys: 0.242 ± 0.141
0.404CysLeu: 0.404 ± 0.182
0.162CysMet: 0.162 ± 0.12
0.242CysAsn: 0.242 ± 0.133
0.162CysPro: 0.162 ± 0.108
0.162CysGln: 0.162 ± 0.116
0.323CysArg: 0.323 ± 0.24
0.727CysSer: 0.727 ± 0.2
0.0CysThr: 0.0 ± 0.0
0.323CysVal: 0.323 ± 0.176
0.0CysTrp: 0.0 ± 0.0
0.404CysTyr: 0.404 ± 0.273
0.0CysXaa: 0.0 ± 0.0
Asp
3.796AspAla: 3.796 ± 0.459
0.565AspCys: 0.565 ± 0.209
3.392AspAsp: 3.392 ± 0.561
5.088AspGlu: 5.088 ± 0.71
2.746AspPhe: 2.746 ± 0.501
5.734AspGly: 5.734 ± 0.735
0.727AspHis: 0.727 ± 0.21
4.361AspIle: 4.361 ± 0.525
5.007AspLys: 5.007 ± 0.629
5.814AspLeu: 5.814 ± 0.545
1.938AspMet: 1.938 ± 0.503
4.603AspAsn: 4.603 ± 0.622
0.969AspPro: 0.969 ± 0.282
0.969AspGln: 0.969 ± 0.236
3.069AspArg: 3.069 ± 0.485
3.23AspSer: 3.23 ± 0.557
3.23AspThr: 3.23 ± 0.583
3.876AspVal: 3.876 ± 0.594
1.05AspTrp: 1.05 ± 0.449
3.634AspTyr: 3.634 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
4.28GluAla: 4.28 ± 0.84
0.565GluCys: 0.565 ± 0.218
3.473GluAsp: 3.473 ± 0.663
5.411GluGlu: 5.411 ± 0.775
3.149GluPhe: 3.149 ± 0.45
2.988GluGly: 2.988 ± 0.436
1.454GluHis: 1.454 ± 0.362
5.411GluIle: 5.411 ± 0.703
7.268GluLys: 7.268 ± 0.859
8.641GluLeu: 8.641 ± 1.067
1.938GluMet: 1.938 ± 0.442
4.038GluAsn: 4.038 ± 0.626
1.777GluPro: 1.777 ± 0.414
3.149GluGln: 3.149 ± 0.548
2.826GluArg: 2.826 ± 0.743
3.796GluSer: 3.796 ± 0.556
4.119GluThr: 4.119 ± 0.509
5.491GluVal: 5.491 ± 0.74
0.888GluTrp: 0.888 ± 0.277
2.665GluTyr: 2.665 ± 0.48
0.0GluXaa: 0.0 ± 0.0
Phe
2.746PheAla: 2.746 ± 0.424
0.242PheCys: 0.242 ± 0.148
4.038PheAsp: 4.038 ± 0.605
2.019PheGlu: 2.019 ± 0.441
1.454PhePhe: 1.454 ± 0.392
3.23PheGly: 3.23 ± 0.546
0.242PheHis: 0.242 ± 0.156
3.069PheIle: 3.069 ± 0.546
2.826PheLys: 2.826 ± 0.644
2.18PheLeu: 2.18 ± 0.437
0.888PheMet: 0.888 ± 0.235
2.503PheAsn: 2.503 ± 0.482
0.888PhePro: 0.888 ± 0.234
1.05PheGln: 1.05 ± 0.339
0.969PheArg: 0.969 ± 0.279
2.584PheSer: 2.584 ± 0.416
2.907PheThr: 2.907 ± 0.446
2.665PheVal: 2.665 ± 0.465
0.727PheTrp: 0.727 ± 0.211
1.857PheTyr: 1.857 ± 0.355
0.0PheXaa: 0.0 ± 0.0
Gly
3.957GlyAla: 3.957 ± 0.607
0.323GlyCys: 0.323 ± 0.181
2.988GlyAsp: 2.988 ± 0.501
2.988GlyGlu: 2.988 ± 0.47
2.1GlyPhe: 2.1 ± 0.433
3.957GlyGly: 3.957 ± 0.807
1.131GlyHis: 1.131 ± 0.308
5.734GlyIle: 5.734 ± 0.672
4.603GlyLys: 4.603 ± 0.503
5.249GlyLeu: 5.249 ± 0.607
2.261GlyMet: 2.261 ± 0.465
3.392GlyAsn: 3.392 ± 0.406
1.05GlyPro: 1.05 ± 0.464
2.261GlyGln: 2.261 ± 0.507
2.342GlyArg: 2.342 ± 0.448
4.199GlySer: 4.199 ± 0.961
4.442GlyThr: 4.442 ± 1.046
4.442GlyVal: 4.442 ± 0.609
1.211GlyTrp: 1.211 ± 0.313
3.634GlyTyr: 3.634 ± 0.567
0.0GlyXaa: 0.0 ± 0.0
His
0.888HisAla: 0.888 ± 0.353
0.081HisCys: 0.081 ± 0.068
0.323HisAsp: 0.323 ± 0.136
0.727HisGlu: 0.727 ± 0.239
0.242HisPhe: 0.242 ± 0.127
0.808HisGly: 0.808 ± 0.276
0.081HisHis: 0.081 ± 0.076
1.373HisIle: 1.373 ± 0.301
1.373HisLys: 1.373 ± 0.361
1.05HisLeu: 1.05 ± 0.267
0.485HisMet: 0.485 ± 0.183
1.05HisAsn: 1.05 ± 0.29
0.404HisPro: 0.404 ± 0.178
0.404HisGln: 0.404 ± 0.208
0.404HisArg: 0.404 ± 0.178
1.292HisSer: 1.292 ± 0.358
1.211HisThr: 1.211 ± 0.358
1.05HisVal: 1.05 ± 0.258
0.162HisTrp: 0.162 ± 0.092
0.808HisTyr: 0.808 ± 0.282
0.0HisXaa: 0.0 ± 0.0
Ile
5.168IleAla: 5.168 ± 0.532
0.565IleCys: 0.565 ± 0.204
5.653IleAsp: 5.653 ± 0.679
6.218IleGlu: 6.218 ± 1.002
1.777IlePhe: 1.777 ± 0.392
4.038IleGly: 4.038 ± 0.561
0.727IleHis: 0.727 ± 0.193
4.442IleIle: 4.442 ± 0.578
5.734IleLys: 5.734 ± 0.611
3.957IleLeu: 3.957 ± 0.631
1.534IleMet: 1.534 ± 0.346
5.33IleAsn: 5.33 ± 0.755
2.019IlePro: 2.019 ± 0.425
1.696IleGln: 1.696 ± 0.436
2.503IleArg: 2.503 ± 0.425
4.845IleSer: 4.845 ± 0.567
5.653IleThr: 5.653 ± 0.732
4.361IleVal: 4.361 ± 0.603
0.727IleTrp: 0.727 ± 0.187
2.907IleTyr: 2.907 ± 0.417
0.0IleXaa: 0.0 ± 0.0
Lys
5.734LysAla: 5.734 ± 0.727
0.162LysCys: 0.162 ± 0.123
5.411LysAsp: 5.411 ± 0.766
5.734LysGlu: 5.734 ± 0.706
2.584LysPhe: 2.584 ± 0.37
4.28LysGly: 4.28 ± 0.524
1.292LysHis: 1.292 ± 0.354
5.814LysIle: 5.814 ± 0.69
6.783LysLys: 6.783 ± 0.956
6.703LysLeu: 6.703 ± 0.724
1.777LysMet: 1.777 ± 0.458
5.411LysAsn: 5.411 ± 0.824
2.584LysPro: 2.584 ± 0.574
4.926LysGln: 4.926 ± 0.612
4.765LysArg: 4.765 ± 0.702
5.814LysSer: 5.814 ± 0.703
5.734LysThr: 5.734 ± 0.572
6.299LysVal: 6.299 ± 0.866
1.05LysTrp: 1.05 ± 0.362
3.553LysTyr: 3.553 ± 0.406
0.0LysXaa: 0.0 ± 0.0
Leu
5.653LeuAla: 5.653 ± 0.789
0.323LeuCys: 0.323 ± 0.164
6.137LeuAsp: 6.137 ± 0.789
7.753LeuGlu: 7.753 ± 0.937
2.261LeuPhe: 2.261 ± 0.36
4.038LeuGly: 4.038 ± 0.571
1.292LeuHis: 1.292 ± 0.342
5.976LeuIle: 5.976 ± 0.723
7.833LeuLys: 7.833 ± 0.688
6.541LeuLeu: 6.541 ± 0.757
2.261LeuMet: 2.261 ± 0.439
5.088LeuAsn: 5.088 ± 0.564
1.938LeuPro: 1.938 ± 0.33
3.311LeuGln: 3.311 ± 0.517
3.149LeuArg: 3.149 ± 0.465
6.46LeuSer: 6.46 ± 0.675
5.895LeuThr: 5.895 ± 0.646
4.926LeuVal: 4.926 ± 0.69
1.05LeuTrp: 1.05 ± 0.24
3.069LeuTyr: 3.069 ± 0.46
0.0LeuXaa: 0.0 ± 0.0
Met
2.18MetAla: 2.18 ± 0.427
0.081MetCys: 0.081 ± 0.088
1.211MetAsp: 1.211 ± 0.272
1.857MetGlu: 1.857 ± 0.473
1.292MetPhe: 1.292 ± 0.283
0.565MetGly: 0.565 ± 0.266
0.485MetHis: 0.485 ± 0.19
1.292MetIle: 1.292 ± 0.364
1.696MetLys: 1.696 ± 0.369
2.746MetLeu: 2.746 ± 0.501
0.323MetMet: 0.323 ± 0.168
1.211MetAsn: 1.211 ± 0.306
0.969MetPro: 0.969 ± 0.266
0.888MetGln: 0.888 ± 0.264
0.888MetArg: 0.888 ± 0.27
1.615MetSer: 1.615 ± 0.304
2.18MetThr: 2.18 ± 0.386
1.292MetVal: 1.292 ± 0.331
0.242MetTrp: 0.242 ± 0.141
0.727MetTyr: 0.727 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
3.796AsnAla: 3.796 ± 0.534
0.242AsnCys: 0.242 ± 0.15
3.796AsnAsp: 3.796 ± 0.581
4.442AsnGlu: 4.442 ± 0.613
2.423AsnPhe: 2.423 ± 0.395
4.522AsnGly: 4.522 ± 0.659
1.131AsnHis: 1.131 ± 0.282
3.634AsnIle: 3.634 ± 0.554
4.522AsnLys: 4.522 ± 0.537
5.249AsnLeu: 5.249 ± 0.67
1.373AsnMet: 1.373 ± 0.346
4.684AsnAsn: 4.684 ± 0.682
2.342AsnPro: 2.342 ± 0.504
2.826AsnGln: 2.826 ± 0.561
2.1AsnArg: 2.1 ± 0.434
4.038AsnSer: 4.038 ± 0.611
2.988AsnThr: 2.988 ± 0.48
3.876AsnVal: 3.876 ± 0.45
0.969AsnTrp: 0.969 ± 0.281
2.826AsnTyr: 2.826 ± 0.466
0.0AsnXaa: 0.0 ± 0.0
Pro
1.696ProAla: 1.696 ± 0.365
0.081ProCys: 0.081 ± 0.089
1.534ProAsp: 1.534 ± 0.383
2.261ProGlu: 2.261 ± 0.482
1.211ProPhe: 1.211 ± 0.263
1.131ProGly: 1.131 ± 0.25
0.485ProHis: 0.485 ± 0.207
1.211ProIle: 1.211 ± 0.358
2.665ProLys: 2.665 ± 0.455
2.18ProLeu: 2.18 ± 0.377
0.404ProMet: 0.404 ± 0.177
1.777ProAsn: 1.777 ± 0.33
1.05ProPro: 1.05 ± 0.258
1.373ProGln: 1.373 ± 0.411
1.373ProArg: 1.373 ± 0.357
2.503ProSer: 2.503 ± 0.518
1.615ProThr: 1.615 ± 0.401
1.938ProVal: 1.938 ± 0.386
0.242ProTrp: 0.242 ± 0.138
0.808ProTyr: 0.808 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
1.857GlnAla: 1.857 ± 0.318
0.404GlnCys: 0.404 ± 0.199
2.18GlnAsp: 2.18 ± 0.407
2.423GlnGlu: 2.423 ± 0.378
1.373GlnPhe: 1.373 ± 0.4
3.149GlnGly: 3.149 ± 0.679
0.565GlnHis: 0.565 ± 0.212
2.665GlnIle: 2.665 ± 0.327
3.069GlnLys: 3.069 ± 0.465
3.392GlnLeu: 3.392 ± 0.567
0.969GlnMet: 0.969 ± 0.308
2.746GlnAsn: 2.746 ± 0.475
0.727GlnPro: 0.727 ± 0.227
1.857GlnGln: 1.857 ± 0.459
1.131GlnArg: 1.131 ± 0.258
2.261GlnSer: 2.261 ± 0.563
2.019GlnThr: 2.019 ± 0.423
3.149GlnVal: 3.149 ± 0.397
0.404GlnTrp: 0.404 ± 0.19
1.696GlnTyr: 1.696 ± 0.396
0.0GlnXaa: 0.0 ± 0.0
Arg
2.019ArgAla: 2.019 ± 0.502
0.162ArgCys: 0.162 ± 0.092
2.342ArgAsp: 2.342 ± 0.368
2.18ArgGlu: 2.18 ± 0.416
1.211ArgPhe: 1.211 ± 0.342
2.1ArgGly: 2.1 ± 0.418
0.646ArgHis: 0.646 ± 0.216
2.261ArgIle: 2.261 ± 0.376
3.634ArgLys: 3.634 ± 0.516
4.038ArgLeu: 4.038 ± 0.52
1.211ArgMet: 1.211 ± 0.302
2.423ArgAsn: 2.423 ± 0.478
1.615ArgPro: 1.615 ± 0.319
2.503ArgGln: 2.503 ± 0.532
2.18ArgArg: 2.18 ± 0.418
1.696ArgSer: 1.696 ± 0.381
2.423ArgThr: 2.423 ± 0.391
2.261ArgVal: 2.261 ± 0.479
0.485ArgTrp: 0.485 ± 0.173
1.777ArgTyr: 1.777 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
4.603SerAla: 4.603 ± 0.737
0.242SerCys: 0.242 ± 0.127
4.522SerAsp: 4.522 ± 0.593
5.814SerGlu: 5.814 ± 0.635
3.23SerPhe: 3.23 ± 0.675
5.411SerGly: 5.411 ± 0.972
0.646SerHis: 0.646 ± 0.216
5.168SerIle: 5.168 ± 0.633
5.491SerLys: 5.491 ± 0.68
5.572SerLeu: 5.572 ± 0.676
1.373SerMet: 1.373 ± 0.343
3.634SerAsn: 3.634 ± 0.731
1.373SerPro: 1.373 ± 0.324
2.907SerGln: 2.907 ± 0.641
1.615SerArg: 1.615 ± 0.307
3.876SerSer: 3.876 ± 0.97
3.553SerThr: 3.553 ± 0.672
3.796SerVal: 3.796 ± 0.475
0.727SerTrp: 0.727 ± 0.211
2.584SerTyr: 2.584 ± 0.548
0.0SerXaa: 0.0 ± 0.0
Thr
3.069ThrAla: 3.069 ± 0.493
0.081ThrCys: 0.081 ± 0.094
3.149ThrAsp: 3.149 ± 0.511
3.392ThrGlu: 3.392 ± 0.56
3.473ThrPhe: 3.473 ± 0.647
5.249ThrGly: 5.249 ± 0.728
0.808ThrHis: 0.808 ± 0.287
4.522ThrIle: 4.522 ± 0.908
6.057ThrLys: 6.057 ± 0.647
5.976ThrLeu: 5.976 ± 0.797
0.646ThrMet: 0.646 ± 0.186
3.634ThrAsn: 3.634 ± 0.601
1.938ThrPro: 1.938 ± 0.312
1.534ThrGln: 1.534 ± 0.295
1.857ThrArg: 1.857 ± 0.336
4.361ThrSer: 4.361 ± 0.538
4.926ThrThr: 4.926 ± 0.785
4.603ThrVal: 4.603 ± 0.881
0.969ThrTrp: 0.969 ± 0.32
2.584ThrTyr: 2.584 ± 0.615
0.0ThrXaa: 0.0 ± 0.0
Val
4.603ValAla: 4.603 ± 0.596
0.646ValCys: 0.646 ± 0.22
5.168ValAsp: 5.168 ± 0.685
5.007ValGlu: 5.007 ± 0.792
2.423ValPhe: 2.423 ± 0.423
2.988ValGly: 2.988 ± 0.566
0.646ValHis: 0.646 ± 0.232
3.876ValIle: 3.876 ± 0.62
6.057ValLys: 6.057 ± 0.712
5.33ValLeu: 5.33 ± 0.456
1.211ValMet: 1.211 ± 0.319
2.907ValAsn: 2.907 ± 0.515
1.938ValPro: 1.938 ± 0.45
1.938ValGln: 1.938 ± 0.338
2.988ValArg: 2.988 ± 0.632
5.734ValSer: 5.734 ± 0.513
4.442ValThr: 4.442 ± 0.735
3.957ValVal: 3.957 ± 0.59
0.727ValTrp: 0.727 ± 0.254
2.1ValTyr: 2.1 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
0.969TrpAla: 0.969 ± 0.237
0.162TrpCys: 0.162 ± 0.11
0.485TrpAsp: 0.485 ± 0.191
0.969TrpGlu: 0.969 ± 0.281
0.565TrpPhe: 0.565 ± 0.194
0.969TrpGly: 0.969 ± 0.293
0.162TrpHis: 0.162 ± 0.116
1.211TrpIle: 1.211 ± 0.444
1.292TrpLys: 1.292 ± 0.36
1.05TrpLeu: 1.05 ± 0.361
0.323TrpMet: 0.323 ± 0.146
0.808TrpAsn: 0.808 ± 0.224
0.323TrpPro: 0.323 ± 0.141
0.485TrpGln: 0.485 ± 0.21
0.485TrpArg: 0.485 ± 0.23
0.969TrpSer: 0.969 ± 0.28
0.969TrpThr: 0.969 ± 0.257
0.404TrpVal: 0.404 ± 0.179
0.162TrpTrp: 0.162 ± 0.107
0.242TrpTyr: 0.242 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.342TyrAla: 2.342 ± 0.69
0.404TyrCys: 0.404 ± 0.183
3.473TyrAsp: 3.473 ± 0.562
2.584TyrGlu: 2.584 ± 0.541
2.584TyrPhe: 2.584 ± 0.595
2.907TyrGly: 2.907 ± 0.458
0.727TyrHis: 0.727 ± 0.214
2.907TyrIle: 2.907 ± 0.527
3.149TyrLys: 3.149 ± 0.626
4.199TyrLeu: 4.199 ± 0.641
0.808TyrMet: 0.808 ± 0.271
2.019TyrAsn: 2.019 ± 0.345
1.777TyrPro: 1.777 ± 0.389
2.18TyrGln: 2.18 ± 0.471
1.534TyrArg: 1.534 ± 0.348
2.019TyrSer: 2.019 ± 0.376
1.857TyrThr: 1.857 ± 0.466
2.18TyrVal: 2.18 ± 0.537
0.323TyrTrp: 0.323 ± 0.141
1.938TyrTyr: 1.938 ± 0.407
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (12384 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski