Amino acid dipepetide frequency for Streptococcus phage Javan123

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.675AlaAla: 3.675 ± 1.02
0.437AlaCys: 0.437 ± 0.182
4.55AlaAsp: 4.55 ± 0.538
4.637AlaGlu: 4.637 ± 0.7
2.625AlaPhe: 2.625 ± 0.32
4.9AlaGly: 4.9 ± 0.703
0.437AlaHis: 0.437 ± 0.208
6.475AlaIle: 6.475 ± 1.012
5.862AlaLys: 5.862 ± 0.692
5.95AlaLeu: 5.95 ± 0.87
1.575AlaMet: 1.575 ± 0.338
3.587AlaAsn: 3.587 ± 0.625
1.312AlaPro: 1.312 ± 0.425
2.712AlaGln: 2.712 ± 0.857
3.15AlaArg: 3.15 ± 0.449
5.25AlaSer: 5.25 ± 0.863
4.375AlaThr: 4.375 ± 0.642
3.937AlaVal: 3.937 ± 0.668
0.787AlaTrp: 0.787 ± 0.305
3.325AlaTyr: 3.325 ± 0.513
0.0AlaXaa: 0.0 ± 0.0
Cys
0.525CysAla: 0.525 ± 0.231
0.437CysCys: 0.437 ± 0.234
0.262CysAsp: 0.262 ± 0.164
0.7CysGlu: 0.7 ± 0.191
0.262CysPhe: 0.262 ± 0.163
0.787CysGly: 0.787 ± 0.224
0.087CysHis: 0.087 ± 0.077
0.437CysIle: 0.437 ± 0.257
0.437CysLys: 0.437 ± 0.224
0.787CysLeu: 0.787 ± 0.345
0.087CysMet: 0.087 ± 0.088
0.262CysAsn: 0.262 ± 0.192
0.35CysPro: 0.35 ± 0.195
0.875CysGln: 0.875 ± 0.324
0.437CysArg: 0.437 ± 0.196
0.35CysSer: 0.35 ± 0.165
0.087CysThr: 0.087 ± 0.093
0.437CysVal: 0.437 ± 0.217
0.0CysTrp: 0.0 ± 0.0
0.525CysTyr: 0.525 ± 0.249
0.0CysXaa: 0.0 ± 0.0
Asp
3.15AspAla: 3.15 ± 0.468
0.7AspCys: 0.7 ± 0.278
3.675AspAsp: 3.675 ± 0.715
4.9AspGlu: 4.9 ± 0.911
3.412AspPhe: 3.412 ± 0.527
5.075AspGly: 5.075 ± 0.623
1.137AspHis: 1.137 ± 0.448
4.55AspIle: 4.55 ± 0.504
3.937AspLys: 3.937 ± 0.495
5.25AspLeu: 5.25 ± 1.033
1.925AspMet: 1.925 ± 0.407
2.362AspAsn: 2.362 ± 0.402
1.75AspPro: 1.75 ± 0.462
1.837AspGln: 1.837 ± 0.386
2.537AspArg: 2.537 ± 0.657
3.85AspSer: 3.85 ± 0.632
2.8AspThr: 2.8 ± 0.487
3.587AspVal: 3.587 ± 0.583
1.05AspTrp: 1.05 ± 0.307
2.275AspTyr: 2.275 ± 0.537
0.0AspXaa: 0.0 ± 0.0
Glu
4.55GluAla: 4.55 ± 0.514
0.525GluCys: 0.525 ± 0.264
4.375GluAsp: 4.375 ± 0.715
6.212GluGlu: 6.212 ± 1.106
1.662GluPhe: 1.662 ± 0.397
4.987GluGly: 4.987 ± 0.569
1.225GluHis: 1.225 ± 0.376
4.025GluIle: 4.025 ± 0.403
6.562GluLys: 6.562 ± 1.036
8.925GluLeu: 8.925 ± 1.143
2.275GluMet: 2.275 ± 0.553
4.375GluAsn: 4.375 ± 0.704
1.75GluPro: 1.75 ± 0.524
4.025GluGln: 4.025 ± 0.545
3.15GluArg: 3.15 ± 0.605
3.237GluSer: 3.237 ± 0.466
4.9GluThr: 4.9 ± 0.7
4.2GluVal: 4.2 ± 0.658
0.7GluTrp: 0.7 ± 0.303
1.575GluTyr: 1.575 ± 0.342
0.0GluXaa: 0.0 ± 0.0
Phe
2.537PheAla: 2.537 ± 0.505
0.437PheCys: 0.437 ± 0.194
2.8PheAsp: 2.8 ± 0.473
3.062PheGlu: 3.062 ± 0.622
1.75PhePhe: 1.75 ± 0.4
2.537PheGly: 2.537 ± 0.42
1.05PheHis: 1.05 ± 0.269
2.1PheIle: 2.1 ± 0.53
3.062PheLys: 3.062 ± 0.626
2.625PheLeu: 2.625 ± 0.564
0.875PheMet: 0.875 ± 0.316
1.837PheAsn: 1.837 ± 0.269
0.7PhePro: 0.7 ± 0.271
1.312PheGln: 1.312 ± 0.381
1.837PheArg: 1.837 ± 0.314
1.925PheSer: 1.925 ± 0.375
2.012PheThr: 2.012 ± 0.355
1.575PheVal: 1.575 ± 0.304
0.612PheTrp: 0.612 ± 0.254
1.925PheTyr: 1.925 ± 0.364
0.0PheXaa: 0.0 ± 0.0
Gly
3.412GlyAla: 3.412 ± 0.528
0.262GlyCys: 0.262 ± 0.167
4.2GlyAsp: 4.2 ± 0.81
4.287GlyGlu: 4.287 ± 0.599
2.45GlyPhe: 2.45 ± 0.446
4.287GlyGly: 4.287 ± 0.762
2.012GlyHis: 2.012 ± 0.423
5.337GlyIle: 5.337 ± 0.964
4.725GlyLys: 4.725 ± 0.458
5.95GlyLeu: 5.95 ± 0.874
1.925GlyMet: 1.925 ± 0.43
3.587GlyAsn: 3.587 ± 0.66
0.787GlyPro: 0.787 ± 0.234
3.587GlyGln: 3.587 ± 0.58
3.587GlyArg: 3.587 ± 0.449
4.287GlySer: 4.287 ± 0.617
4.375GlyThr: 4.375 ± 0.559
4.287GlyVal: 4.287 ± 0.678
0.787GlyTrp: 0.787 ± 0.187
2.975GlyTyr: 2.975 ± 0.486
0.0GlyXaa: 0.0 ± 0.0
His
0.962HisAla: 0.962 ± 0.227
0.087HisCys: 0.087 ± 0.083
1.137HisAsp: 1.137 ± 0.301
0.875HisGlu: 0.875 ± 0.295
1.137HisPhe: 1.137 ± 0.345
1.662HisGly: 1.662 ± 0.359
0.437HisHis: 0.437 ± 0.218
1.312HisIle: 1.312 ± 0.264
0.787HisLys: 0.787 ± 0.263
2.012HisLeu: 2.012 ± 0.305
0.35HisMet: 0.35 ± 0.184
1.05HisAsn: 1.05 ± 0.298
1.137HisPro: 1.137 ± 0.385
0.962HisGln: 0.962 ± 0.347
0.962HisArg: 0.962 ± 0.33
0.787HisSer: 0.787 ± 0.257
1.137HisThr: 1.137 ± 0.38
1.137HisVal: 1.137 ± 0.354
0.175HisTrp: 0.175 ± 0.13
0.525HisTyr: 0.525 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
5.25IleAla: 5.25 ± 0.447
0.525IleCys: 0.525 ± 0.2
5.162IleAsp: 5.162 ± 0.678
3.325IleGlu: 3.325 ± 0.602
1.312IlePhe: 1.312 ± 0.446
4.812IleGly: 4.812 ± 0.773
0.962IleHis: 0.962 ± 0.3
3.325IleIle: 3.325 ± 0.566
4.725IleLys: 4.725 ± 0.665
5.6IleLeu: 5.6 ± 0.755
0.962IleMet: 0.962 ± 0.286
2.8IleAsn: 2.8 ± 0.491
2.187IlePro: 2.187 ± 0.374
2.8IleGln: 2.8 ± 0.481
2.975IleArg: 2.975 ± 0.441
5.775IleSer: 5.775 ± 0.98
5.075IleThr: 5.075 ± 0.839
4.2IleVal: 4.2 ± 0.954
1.225IleTrp: 1.225 ± 0.466
1.925IleTyr: 1.925 ± 0.334
0.0IleXaa: 0.0 ± 0.0
Lys
6.037LysAla: 6.037 ± 0.569
0.35LysCys: 0.35 ± 0.16
3.762LysAsp: 3.762 ± 0.658
5.425LysGlu: 5.425 ± 0.618
2.45LysPhe: 2.45 ± 0.412
4.55LysGly: 4.55 ± 0.422
1.925LysHis: 1.925 ± 0.468
5.162LysIle: 5.162 ± 0.555
4.025LysLys: 4.025 ± 0.621
6.037LysLeu: 6.037 ± 0.681
1.662LysMet: 1.662 ± 0.448
2.975LysAsn: 2.975 ± 0.483
2.275LysPro: 2.275 ± 0.249
3.675LysGln: 3.675 ± 0.71
4.2LysArg: 4.2 ± 0.575
3.85LysSer: 3.85 ± 0.568
3.85LysThr: 3.85 ± 0.567
4.287LysVal: 4.287 ± 0.599
1.4LysTrp: 1.4 ± 0.352
1.925LysTyr: 1.925 ± 0.515
0.0LysXaa: 0.0 ± 0.0
Leu
6.65LeuAla: 6.65 ± 0.74
0.35LeuCys: 0.35 ± 0.208
5.075LeuAsp: 5.075 ± 0.599
8.05LeuGlu: 8.05 ± 0.959
2.712LeuPhe: 2.712 ± 0.455
4.812LeuGly: 4.812 ± 0.672
1.662LeuHis: 1.662 ± 0.374
4.55LeuIle: 4.55 ± 0.642
7.35LeuLys: 7.35 ± 0.841
7.175LeuLeu: 7.175 ± 0.979
2.1LeuMet: 2.1 ± 0.344
4.637LeuAsn: 4.637 ± 0.678
3.237LeuPro: 3.237 ± 0.719
3.937LeuGln: 3.937 ± 0.686
3.587LeuArg: 3.587 ± 0.638
7.087LeuSer: 7.087 ± 0.754
7.525LeuThr: 7.525 ± 0.909
6.037LeuVal: 6.037 ± 0.719
0.612LeuTrp: 0.612 ± 0.203
3.85LeuTyr: 3.85 ± 0.783
0.0LeuXaa: 0.0 ± 0.0
Met
1.925MetAla: 1.925 ± 0.428
0.175MetCys: 0.175 ± 0.126
1.312MetAsp: 1.312 ± 0.422
1.837MetGlu: 1.837 ± 0.467
0.7MetPhe: 0.7 ± 0.289
2.012MetGly: 2.012 ± 0.578
0.175MetHis: 0.175 ± 0.146
1.662MetIle: 1.662 ± 0.369
1.662MetLys: 1.662 ± 0.386
1.225MetLeu: 1.225 ± 0.36
0.875MetMet: 0.875 ± 0.372
0.7MetAsn: 0.7 ± 0.211
0.437MetPro: 0.437 ± 0.185
0.875MetGln: 0.875 ± 0.31
1.137MetArg: 1.137 ± 0.276
2.362MetSer: 2.362 ± 0.399
1.837MetThr: 1.837 ± 0.447
1.312MetVal: 1.312 ± 0.51
0.087MetTrp: 0.087 ± 0.111
0.437MetTyr: 0.437 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
4.9AsnAla: 4.9 ± 0.795
0.262AsnCys: 0.262 ± 0.189
2.45AsnAsp: 2.45 ± 0.502
2.712AsnGlu: 2.712 ± 0.592
2.012AsnPhe: 2.012 ± 0.426
4.9AsnGly: 4.9 ± 0.676
1.137AsnHis: 1.137 ± 0.248
2.187AsnIle: 2.187 ± 0.391
2.625AsnLys: 2.625 ± 0.514
4.55AsnLeu: 4.55 ± 0.985
0.962AsnMet: 0.962 ± 0.27
1.75AsnAsn: 1.75 ± 0.404
2.187AsnPro: 2.187 ± 0.336
1.925AsnGln: 1.925 ± 0.435
2.625AsnArg: 2.625 ± 0.453
2.975AsnSer: 2.975 ± 0.405
1.925AsnThr: 1.925 ± 0.751
2.362AsnVal: 2.362 ± 0.563
1.312AsnTrp: 1.312 ± 0.354
1.05AsnTyr: 1.05 ± 0.26
0.0AsnXaa: 0.0 ± 0.0
Pro
1.137ProAla: 1.137 ± 0.265
0.35ProCys: 0.35 ± 0.146
1.925ProAsp: 1.925 ± 0.396
2.362ProGlu: 2.362 ± 0.497
1.225ProPhe: 1.225 ± 0.331
0.787ProGly: 0.787 ± 0.317
0.7ProHis: 0.7 ± 0.241
1.837ProIle: 1.837 ± 0.373
2.012ProLys: 2.012 ± 0.414
3.15ProLeu: 3.15 ± 0.497
0.612ProMet: 0.612 ± 0.217
1.312ProAsn: 1.312 ± 0.31
0.962ProPro: 0.962 ± 0.359
0.787ProGln: 0.787 ± 0.289
1.662ProArg: 1.662 ± 0.342
2.712ProSer: 2.712 ± 0.496
2.187ProThr: 2.187 ± 0.473
2.275ProVal: 2.275 ± 0.486
0.437ProTrp: 0.437 ± 0.214
1.225ProTyr: 1.225 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
4.462GlnAla: 4.462 ± 0.739
0.262GlnCys: 0.262 ± 0.126
2.1GlnAsp: 2.1 ± 0.372
3.675GlnGlu: 3.675 ± 0.602
1.662GlnPhe: 1.662 ± 0.401
2.187GlnGly: 2.187 ± 0.496
0.437GlnHis: 0.437 ± 0.191
2.625GlnIle: 2.625 ± 0.489
2.537GlnLys: 2.537 ± 0.53
4.725GlnLeu: 4.725 ± 0.603
1.137GlnMet: 1.137 ± 0.387
2.187GlnAsn: 2.187 ± 0.552
1.575GlnPro: 1.575 ± 0.436
2.362GlnGln: 2.362 ± 0.394
1.837GlnArg: 1.837 ± 0.508
2.8GlnSer: 2.8 ± 0.498
3.587GlnThr: 3.587 ± 0.832
4.375GlnVal: 4.375 ± 0.594
0.612GlnTrp: 0.612 ± 0.286
0.875GlnTyr: 0.875 ± 0.346
0.0GlnXaa: 0.0 ± 0.0
Arg
2.45ArgAla: 2.45 ± 0.531
0.875ArgCys: 0.875 ± 0.324
2.45ArgAsp: 2.45 ± 0.425
3.587ArgGlu: 3.587 ± 0.447
1.925ArgPhe: 1.925 ± 0.465
2.712ArgGly: 2.712 ± 0.509
0.787ArgHis: 0.787 ± 0.327
2.975ArgIle: 2.975 ± 0.571
3.762ArgLys: 3.762 ± 0.78
4.812ArgLeu: 4.812 ± 0.666
0.437ArgMet: 0.437 ± 0.192
2.1ArgAsn: 2.1 ± 0.337
1.4ArgPro: 1.4 ± 0.384
2.8ArgGln: 2.8 ± 0.487
1.837ArgArg: 1.837 ± 0.483
2.8ArgSer: 2.8 ± 0.375
3.062ArgThr: 3.062 ± 0.65
4.112ArgVal: 4.112 ± 0.542
1.05ArgTrp: 1.05 ± 0.312
1.225ArgTyr: 1.225 ± 0.419
0.0ArgXaa: 0.0 ± 0.0
Ser
4.375SerAla: 4.375 ± 0.574
0.525SerCys: 0.525 ± 0.247
4.637SerAsp: 4.637 ± 0.679
4.2SerGlu: 4.2 ± 0.771
2.537SerPhe: 2.537 ± 0.611
4.637SerGly: 4.637 ± 0.64
1.225SerHis: 1.225 ± 0.302
4.812SerIle: 4.812 ± 0.662
4.2SerLys: 4.2 ± 0.51
5.775SerLeu: 5.775 ± 0.701
1.487SerMet: 1.487 ± 0.291
3.15SerAsn: 3.15 ± 0.663
2.362SerPro: 2.362 ± 0.363
3.85SerGln: 3.85 ± 1.004
3.062SerArg: 3.062 ± 0.456
5.425SerSer: 5.425 ± 0.922
4.637SerThr: 4.637 ± 0.597
4.287SerVal: 4.287 ± 0.642
1.137SerTrp: 1.137 ± 0.252
2.537SerTyr: 2.537 ± 0.507
0.0SerXaa: 0.0 ± 0.0
Thr
5.6ThrAla: 5.6 ± 0.481
0.175ThrCys: 0.175 ± 0.114
2.8ThrAsp: 2.8 ± 0.469
4.725ThrGlu: 4.725 ± 0.669
2.712ThrPhe: 2.712 ± 0.629
4.9ThrGly: 4.9 ± 0.897
0.7ThrHis: 0.7 ± 0.18
4.812ThrIle: 4.812 ± 0.928
4.637ThrLys: 4.637 ± 0.787
5.6ThrLeu: 5.6 ± 0.567
0.875ThrMet: 0.875 ± 0.321
2.537ThrAsn: 2.537 ± 0.482
1.925ThrPro: 1.925 ± 0.445
2.625ThrGln: 2.625 ± 0.815
2.012ThrArg: 2.012 ± 0.519
5.687ThrSer: 5.687 ± 1.214
5.337ThrThr: 5.337 ± 0.921
6.037ThrVal: 6.037 ± 0.768
0.875ThrTrp: 0.875 ± 0.263
2.012ThrTyr: 2.012 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
4.287ValAla: 4.287 ± 0.707
0.525ValCys: 0.525 ± 0.301
3.762ValAsp: 3.762 ± 0.678
4.812ValGlu: 4.812 ± 0.73
2.012ValPhe: 2.012 ± 0.551
3.937ValGly: 3.937 ± 0.626
1.137ValHis: 1.137 ± 0.283
4.112ValIle: 4.112 ± 0.69
4.375ValLys: 4.375 ± 0.638
6.475ValLeu: 6.475 ± 0.627
1.487ValMet: 1.487 ± 0.396
2.8ValAsn: 2.8 ± 0.463
2.1ValPro: 2.1 ± 0.281
2.45ValGln: 2.45 ± 0.362
4.112ValArg: 4.112 ± 0.685
4.375ValSer: 4.375 ± 0.812
4.9ValThr: 4.9 ± 0.643
3.762ValVal: 3.762 ± 0.494
1.137ValTrp: 1.137 ± 0.316
2.45ValTyr: 2.45 ± 0.525
0.0ValXaa: 0.0 ± 0.0
Trp
1.137TrpAla: 1.137 ± 0.331
0.262TrpCys: 0.262 ± 0.154
0.525TrpAsp: 0.525 ± 0.226
1.225TrpGlu: 1.225 ± 0.375
0.875TrpPhe: 0.875 ± 0.291
0.7TrpGly: 0.7 ± 0.234
0.35TrpHis: 0.35 ± 0.171
0.7TrpIle: 0.7 ± 0.29
0.525TrpLys: 0.525 ± 0.234
1.05TrpLeu: 1.05 ± 0.291
0.437TrpMet: 0.437 ± 0.174
1.4TrpAsn: 1.4 ± 0.408
0.087TrpPro: 0.087 ± 0.077
0.962TrpGln: 0.962 ± 0.334
0.612TrpArg: 0.612 ± 0.321
1.05TrpSer: 1.05 ± 0.358
1.225TrpThr: 1.225 ± 0.331
0.962TrpVal: 0.962 ± 0.253
0.175TrpTrp: 0.175 ± 0.112
0.262TrpTyr: 0.262 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.537TyrAla: 2.537 ± 0.438
0.612TyrCys: 0.612 ± 0.235
2.887TyrAsp: 2.887 ± 0.749
2.625TyrGlu: 2.625 ± 0.597
1.137TyrPhe: 1.137 ± 0.303
1.925TyrGly: 1.925 ± 0.51
1.05TyrHis: 1.05 ± 0.29
1.925TyrIle: 1.925 ± 0.445
2.012TyrLys: 2.012 ± 0.48
3.325TyrLeu: 3.325 ± 0.49
0.612TyrMet: 0.612 ± 0.253
1.4TyrAsn: 1.4 ± 0.348
1.137TyrPro: 1.137 ± 0.238
1.662TyrGln: 1.662 ± 0.355
1.837TyrArg: 1.837 ± 0.374
2.275TyrSer: 2.275 ± 0.46
1.75TyrThr: 1.75 ± 0.404
1.837TyrVal: 1.837 ± 0.388
0.35TyrTrp: 0.35 ± 0.164
0.962TyrTyr: 0.962 ± 0.362
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (11430 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski