Amino acid dipepetide frequency for Streptococcus phage Javan422

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.324AlaAla: 3.324 ± 0.938
0.649AlaCys: 0.649 ± 0.227
3.648AlaAsp: 3.648 ± 0.592
3.324AlaGlu: 3.324 ± 0.462
3.324AlaPhe: 3.324 ± 0.465
4.135AlaGly: 4.135 ± 0.893
0.892AlaHis: 0.892 ± 0.269
6.081AlaIle: 6.081 ± 0.841
5.027AlaLys: 5.027 ± 0.659
5.27AlaLeu: 5.27 ± 0.801
1.622AlaMet: 1.622 ± 0.55
2.513AlaAsn: 2.513 ± 0.442
1.054AlaPro: 1.054 ± 0.275
1.622AlaGln: 1.622 ± 0.406
2.838AlaArg: 2.838 ± 0.355
3.73AlaSer: 3.73 ± 0.53
3.892AlaThr: 3.892 ± 0.925
3.162AlaVal: 3.162 ± 0.617
0.73AlaTrp: 0.73 ± 0.27
2.838AlaTyr: 2.838 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.811CysAla: 0.811 ± 0.203
0.243CysCys: 0.243 ± 0.115
0.73CysAsp: 0.73 ± 0.273
0.568CysGlu: 0.568 ± 0.197
0.486CysPhe: 0.486 ± 0.189
1.378CysGly: 1.378 ± 0.289
0.243CysHis: 0.243 ± 0.143
0.973CysIle: 0.973 ± 0.275
0.73CysLys: 0.73 ± 0.232
0.568CysLeu: 0.568 ± 0.207
0.162CysMet: 0.162 ± 0.101
0.162CysAsn: 0.162 ± 0.099
0.405CysPro: 0.405 ± 0.174
0.73CysGln: 0.73 ± 0.208
0.405CysArg: 0.405 ± 0.263
0.973CysSer: 0.973 ± 0.248
0.081CysThr: 0.081 ± 0.077
0.568CysVal: 0.568 ± 0.23
0.0CysTrp: 0.0 ± 0.0
0.649CysTyr: 0.649 ± 0.248
0.0CysXaa: 0.0 ± 0.0
Asp
3.405AspAla: 3.405 ± 0.52
0.811AspCys: 0.811 ± 0.343
3.081AspAsp: 3.081 ± 0.717
4.54AspGlu: 4.54 ± 0.643
2.838AspPhe: 2.838 ± 0.392
3.811AspGly: 3.811 ± 0.561
0.892AspHis: 0.892 ± 0.219
4.216AspIle: 4.216 ± 0.6
5.351AspLys: 5.351 ± 0.717
4.621AspLeu: 4.621 ± 0.807
1.54AspMet: 1.54 ± 0.348
2.594AspAsn: 2.594 ± 0.502
1.378AspPro: 1.378 ± 0.286
1.297AspGln: 1.297 ± 0.296
2.108AspArg: 2.108 ± 0.509
3.0AspSer: 3.0 ± 0.491
3.0AspThr: 3.0 ± 0.562
3.811AspVal: 3.811 ± 0.55
0.649AspTrp: 0.649 ± 0.187
2.594AspTyr: 2.594 ± 0.531
0.0AspXaa: 0.0 ± 0.0
Glu
3.73GluAla: 3.73 ± 0.526
1.054GluCys: 1.054 ± 0.338
3.405GluAsp: 3.405 ± 0.543
7.297GluGlu: 7.297 ± 0.784
1.946GluPhe: 1.946 ± 0.495
4.784GluGly: 4.784 ± 0.532
1.135GluHis: 1.135 ± 0.34
5.189GluIle: 5.189 ± 0.807
8.108GluLys: 8.108 ± 0.849
7.864GluLeu: 7.864 ± 0.795
2.432GluMet: 2.432 ± 0.387
4.54GluAsn: 4.54 ± 0.737
1.378GluPro: 1.378 ± 0.331
2.919GluGln: 2.919 ± 0.532
3.081GluArg: 3.081 ± 0.606
4.459GluSer: 4.459 ± 0.59
3.73GluThr: 3.73 ± 0.501
5.513GluVal: 5.513 ± 0.666
0.892GluTrp: 0.892 ± 0.272
2.513GluTyr: 2.513 ± 0.493
0.0GluXaa: 0.0 ± 0.0
Phe
1.946PheAla: 1.946 ± 0.293
0.486PheCys: 0.486 ± 0.197
2.676PheAsp: 2.676 ± 0.631
3.486PheGlu: 3.486 ± 0.552
1.378PhePhe: 1.378 ± 0.358
3.486PheGly: 3.486 ± 0.605
0.73PheHis: 0.73 ± 0.199
2.108PheIle: 2.108 ± 0.351
3.486PheLys: 3.486 ± 0.553
3.081PheLeu: 3.081 ± 0.441
0.649PheMet: 0.649 ± 0.206
2.919PheAsn: 2.919 ± 0.404
1.216PhePro: 1.216 ± 0.288
1.297PheGln: 1.297 ± 0.288
1.622PheArg: 1.622 ± 0.341
2.676PheSer: 2.676 ± 0.436
2.027PheThr: 2.027 ± 0.519
1.622PheVal: 1.622 ± 0.374
0.405PheTrp: 0.405 ± 0.16
1.297PheTyr: 1.297 ± 0.285
0.0PheXaa: 0.0 ± 0.0
Gly
4.459GlyAla: 4.459 ± 0.958
0.405GlyCys: 0.405 ± 0.158
3.486GlyAsp: 3.486 ± 0.583
3.811GlyGlu: 3.811 ± 0.603
2.351GlyPhe: 2.351 ± 0.529
4.297GlyGly: 4.297 ± 0.8
0.973GlyHis: 0.973 ± 0.222
4.135GlyIle: 4.135 ± 1.156
6.162GlyLys: 6.162 ± 0.52
4.865GlyLeu: 4.865 ± 0.769
2.27GlyMet: 2.27 ± 0.425
4.297GlyAsn: 4.297 ± 0.666
0.568GlyPro: 0.568 ± 0.2
1.865GlyGln: 1.865 ± 0.303
3.811GlyArg: 3.811 ± 0.704
3.405GlySer: 3.405 ± 0.621
3.892GlyThr: 3.892 ± 0.631
3.405GlyVal: 3.405 ± 0.623
0.649GlyTrp: 0.649 ± 0.208
2.838GlyTyr: 2.838 ± 0.573
0.0GlyXaa: 0.0 ± 0.0
His
1.054HisAla: 1.054 ± 0.279
0.081HisCys: 0.081 ± 0.083
0.892HisAsp: 0.892 ± 0.326
1.216HisGlu: 1.216 ± 0.312
0.811HisPhe: 0.811 ± 0.31
1.054HisGly: 1.054 ± 0.311
0.892HisHis: 0.892 ± 0.221
1.054HisIle: 1.054 ± 0.282
1.378HisLys: 1.378 ± 0.451
1.946HisLeu: 1.946 ± 0.306
0.486HisMet: 0.486 ± 0.22
1.054HisAsn: 1.054 ± 0.32
0.973HisPro: 0.973 ± 0.256
0.73HisGln: 0.73 ± 0.279
0.811HisArg: 0.811 ± 0.234
1.216HisSer: 1.216 ± 0.4
1.378HisThr: 1.378 ± 0.284
1.135HisVal: 1.135 ± 0.254
0.081HisTrp: 0.081 ± 0.088
0.568HisTyr: 0.568 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
6.243IleAla: 6.243 ± 0.757
0.973IleCys: 0.973 ± 0.239
4.702IleAsp: 4.702 ± 0.68
6.892IleGlu: 6.892 ± 0.628
2.513IlePhe: 2.513 ± 0.535
3.486IleGly: 3.486 ± 0.583
1.135IleHis: 1.135 ± 0.406
5.108IleIle: 5.108 ± 0.719
6.405IleLys: 6.405 ± 1.092
6.243IleLeu: 6.243 ± 0.695
1.378IleMet: 1.378 ± 0.325
3.405IleAsn: 3.405 ± 0.534
2.919IlePro: 2.919 ± 0.568
3.0IleGln: 3.0 ± 0.556
3.081IleArg: 3.081 ± 0.471
4.946IleSer: 4.946 ± 0.777
4.784IleThr: 4.784 ± 0.642
4.216IleVal: 4.216 ± 0.741
1.622IleTrp: 1.622 ± 0.354
3.0IleTyr: 3.0 ± 0.307
0.0IleXaa: 0.0 ± 0.0
Lys
4.54LysAla: 4.54 ± 0.75
0.892LysCys: 0.892 ± 0.25
4.54LysAsp: 4.54 ± 0.652
8.189LysGlu: 8.189 ± 0.882
3.0LysPhe: 3.0 ± 0.426
4.459LysGly: 4.459 ± 0.539
1.622LysHis: 1.622 ± 0.346
6.324LysIle: 6.324 ± 0.791
6.648LysLys: 6.648 ± 1.087
8.594LysLeu: 8.594 ± 0.794
2.189LysMet: 2.189 ± 0.437
4.135LysAsn: 4.135 ± 0.684
2.594LysPro: 2.594 ± 0.417
4.621LysGln: 4.621 ± 0.667
3.567LysArg: 3.567 ± 0.632
6.648LysSer: 6.648 ± 0.699
3.811LysThr: 3.811 ± 0.598
5.675LysVal: 5.675 ± 0.828
1.297LysTrp: 1.297 ± 0.37
2.919LysTyr: 2.919 ± 0.398
0.0LysXaa: 0.0 ± 0.0
Leu
4.216LeuAla: 4.216 ± 0.524
0.892LeuCys: 0.892 ± 0.243
6.892LeuAsp: 6.892 ± 0.632
7.54LeuGlu: 7.54 ± 0.89
3.405LeuPhe: 3.405 ± 0.496
5.108LeuGly: 5.108 ± 0.779
1.054LeuHis: 1.054 ± 0.308
5.432LeuIle: 5.432 ± 0.609
7.054LeuLys: 7.054 ± 0.777
6.648LeuLeu: 6.648 ± 0.744
1.703LeuMet: 1.703 ± 0.442
4.054LeuAsn: 4.054 ± 0.551
3.081LeuPro: 3.081 ± 0.53
3.892LeuGln: 3.892 ± 0.606
3.648LeuArg: 3.648 ± 0.511
7.621LeuSer: 7.621 ± 0.599
5.838LeuThr: 5.838 ± 0.613
5.838LeuVal: 5.838 ± 0.572
0.649LeuTrp: 0.649 ± 0.212
3.486LeuTyr: 3.486 ± 0.659
0.0LeuXaa: 0.0 ± 0.0
Met
2.27MetAla: 2.27 ± 0.438
0.162MetCys: 0.162 ± 0.107
1.297MetAsp: 1.297 ± 0.286
1.54MetGlu: 1.54 ± 0.429
0.811MetPhe: 0.811 ± 0.312
1.297MetGly: 1.297 ± 0.31
0.243MetHis: 0.243 ± 0.161
1.135MetIle: 1.135 ± 0.371
2.919MetLys: 2.919 ± 0.465
1.784MetLeu: 1.784 ± 0.513
1.216MetMet: 1.216 ± 0.387
1.297MetAsn: 1.297 ± 0.385
1.054MetPro: 1.054 ± 0.298
0.568MetGln: 0.568 ± 0.225
0.568MetArg: 0.568 ± 0.176
1.622MetSer: 1.622 ± 0.284
2.027MetThr: 2.027 ± 0.591
0.973MetVal: 0.973 ± 0.302
0.405MetTrp: 0.405 ± 0.174
0.811MetTyr: 0.811 ± 0.215
0.0MetXaa: 0.0 ± 0.0
Asn
4.297AsnAla: 4.297 ± 0.587
0.73AsnCys: 0.73 ± 0.228
2.027AsnAsp: 2.027 ± 0.469
3.162AsnGlu: 3.162 ± 0.456
1.297AsnPhe: 1.297 ± 0.357
5.432AsnGly: 5.432 ± 0.804
1.135AsnHis: 1.135 ± 0.303
4.378AsnIle: 4.378 ± 0.61
4.54AsnLys: 4.54 ± 0.683
4.378AsnLeu: 4.378 ± 0.647
1.216AsnMet: 1.216 ± 0.337
2.757AsnAsn: 2.757 ± 0.604
1.865AsnPro: 1.865 ± 0.42
2.676AsnGln: 2.676 ± 0.562
1.865AsnArg: 1.865 ± 0.427
3.811AsnSer: 3.811 ± 0.527
3.892AsnThr: 3.892 ± 0.53
2.513AsnVal: 2.513 ± 0.586
0.892AsnTrp: 0.892 ± 0.292
1.459AsnTyr: 1.459 ± 0.38
0.0AsnXaa: 0.0 ± 0.0
Pro
0.973ProAla: 0.973 ± 0.218
0.324ProCys: 0.324 ± 0.179
1.297ProAsp: 1.297 ± 0.304
1.459ProGlu: 1.459 ± 0.355
1.54ProPhe: 1.54 ± 0.371
1.216ProGly: 1.216 ± 0.326
0.811ProHis: 0.811 ± 0.237
2.838ProIle: 2.838 ± 0.396
1.865ProLys: 1.865 ± 0.363
2.919ProLeu: 2.919 ± 0.369
0.405ProMet: 0.405 ± 0.237
1.622ProAsn: 1.622 ± 0.338
0.973ProPro: 0.973 ± 0.259
1.054ProGln: 1.054 ± 0.239
1.297ProArg: 1.297 ± 0.278
2.108ProSer: 2.108 ± 0.371
2.432ProThr: 2.432 ± 0.456
2.189ProVal: 2.189 ± 0.417
0.243ProTrp: 0.243 ± 0.136
1.297ProTyr: 1.297 ± 0.385
0.0ProXaa: 0.0 ± 0.0
Gln
2.838GlnAla: 2.838 ± 0.417
0.243GlnCys: 0.243 ± 0.134
1.784GlnAsp: 1.784 ± 0.334
3.243GlnGlu: 3.243 ± 0.644
2.594GlnPhe: 2.594 ± 0.502
2.108GlnGly: 2.108 ± 0.451
0.405GlnHis: 0.405 ± 0.183
2.757GlnIle: 2.757 ± 0.515
3.892GlnLys: 3.892 ± 0.462
4.054GlnLeu: 4.054 ± 0.58
1.054GlnMet: 1.054 ± 0.304
2.27GlnAsn: 2.27 ± 0.541
0.973GlnPro: 0.973 ± 0.228
2.108GlnGln: 2.108 ± 0.494
1.297GlnArg: 1.297 ± 0.296
2.919GlnSer: 2.919 ± 0.405
1.865GlnThr: 1.865 ± 0.366
2.594GlnVal: 2.594 ± 0.53
0.568GlnTrp: 0.568 ± 0.27
0.811GlnTyr: 0.811 ± 0.246
0.0GlnXaa: 0.0 ± 0.0
Arg
1.865ArgAla: 1.865 ± 0.368
0.892ArgCys: 0.892 ± 0.315
1.784ArgAsp: 1.784 ± 0.364
2.919ArgGlu: 2.919 ± 0.501
1.703ArgPhe: 1.703 ± 0.291
2.108ArgGly: 2.108 ± 0.407
0.568ArgHis: 0.568 ± 0.212
3.405ArgIle: 3.405 ± 0.557
3.892ArgLys: 3.892 ± 0.693
3.811ArgLeu: 3.811 ± 0.527
0.73ArgMet: 0.73 ± 0.224
2.757ArgAsn: 2.757 ± 0.536
0.973ArgPro: 0.973 ± 0.257
1.946ArgGln: 1.946 ± 0.442
1.297ArgArg: 1.297 ± 0.442
2.351ArgSer: 2.351 ± 0.536
2.027ArgThr: 2.027 ± 0.506
3.0ArgVal: 3.0 ± 0.551
0.892ArgTrp: 0.892 ± 0.314
1.703ArgTyr: 1.703 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
3.811SerAla: 3.811 ± 0.479
0.568SerCys: 0.568 ± 0.25
3.892SerAsp: 3.892 ± 0.747
4.54SerGlu: 4.54 ± 0.617
2.513SerPhe: 2.513 ± 0.531
4.216SerGly: 4.216 ± 0.854
1.378SerHis: 1.378 ± 0.392
5.756SerIle: 5.756 ± 0.918
5.432SerLys: 5.432 ± 0.643
5.756SerLeu: 5.756 ± 0.614
1.703SerMet: 1.703 ± 0.369
4.784SerAsn: 4.784 ± 0.648
2.513SerPro: 2.513 ± 0.445
2.189SerGln: 2.189 ± 0.445
2.838SerArg: 2.838 ± 0.559
5.027SerSer: 5.027 ± 0.582
4.459SerThr: 4.459 ± 1.121
5.108SerVal: 5.108 ± 0.9
0.73SerTrp: 0.73 ± 0.211
2.027SerTyr: 2.027 ± 0.41
0.0SerXaa: 0.0 ± 0.0
Thr
3.081ThrAla: 3.081 ± 0.864
0.486ThrCys: 0.486 ± 0.199
3.0ThrAsp: 3.0 ± 0.529
3.567ThrGlu: 3.567 ± 0.584
1.703ThrPhe: 1.703 ± 0.361
3.243ThrGly: 3.243 ± 0.698
2.189ThrHis: 2.189 ± 0.431
6.486ThrIle: 6.486 ± 1.215
4.702ThrLys: 4.702 ± 0.514
6.162ThrLeu: 6.162 ± 0.797
1.378ThrMet: 1.378 ± 0.406
3.162ThrAsn: 3.162 ± 0.593
1.784ThrPro: 1.784 ± 0.441
2.108ThrGln: 2.108 ± 0.457
1.946ThrArg: 1.946 ± 0.388
3.243ThrSer: 3.243 ± 0.757
4.054ThrThr: 4.054 ± 1.262
3.811ThrVal: 3.811 ± 0.578
1.216ThrTrp: 1.216 ± 0.359
1.865ThrTyr: 1.865 ± 0.482
0.0ThrXaa: 0.0 ± 0.0
Val
3.892ValAla: 3.892 ± 0.582
0.486ValCys: 0.486 ± 0.191
3.567ValAsp: 3.567 ± 0.41
4.702ValGlu: 4.702 ± 0.672
1.946ValPhe: 1.946 ± 0.347
3.486ValGly: 3.486 ± 0.839
1.216ValHis: 1.216 ± 0.321
4.865ValIle: 4.865 ± 0.496
5.189ValLys: 5.189 ± 0.667
5.675ValLeu: 5.675 ± 0.541
0.973ValMet: 0.973 ± 0.297
3.081ValAsn: 3.081 ± 0.691
2.027ValPro: 2.027 ± 0.346
2.351ValGln: 2.351 ± 0.47
2.27ValArg: 2.27 ± 0.441
4.946ValSer: 4.946 ± 0.649
3.081ValThr: 3.081 ± 0.539
3.405ValVal: 3.405 ± 0.636
1.054ValTrp: 1.054 ± 0.444
2.757ValTyr: 2.757 ± 0.475
0.0ValXaa: 0.0 ± 0.0
Trp
0.649TrpAla: 0.649 ± 0.311
0.324TrpCys: 0.324 ± 0.163
0.405TrpAsp: 0.405 ± 0.15
1.297TrpGlu: 1.297 ± 0.403
0.568TrpPhe: 0.568 ± 0.169
0.892TrpGly: 0.892 ± 0.258
0.486TrpHis: 0.486 ± 0.195
0.892TrpIle: 0.892 ± 0.227
0.73TrpLys: 0.73 ± 0.227
0.973TrpLeu: 0.973 ± 0.267
0.324TrpMet: 0.324 ± 0.179
0.73TrpAsn: 0.73 ± 0.236
0.081TrpPro: 0.081 ± 0.088
0.973TrpGln: 0.973 ± 0.247
0.649TrpArg: 0.649 ± 0.299
1.378TrpSer: 1.378 ± 0.389
0.811TrpThr: 0.811 ± 0.211
0.486TrpVal: 0.486 ± 0.166
0.081TrpTrp: 0.081 ± 0.077
0.649TrpTyr: 0.649 ± 0.302
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.27TyrAla: 2.27 ± 0.433
0.162TyrCys: 0.162 ± 0.118
2.432TyrAsp: 2.432 ± 0.47
2.757TyrGlu: 2.757 ± 0.443
1.865TyrPhe: 1.865 ± 0.382
1.946TyrGly: 1.946 ± 0.527
0.811TyrHis: 0.811 ± 0.219
2.919TyrIle: 2.919 ± 0.478
2.757TyrLys: 2.757 ± 0.357
2.838TyrLeu: 2.838 ± 0.485
0.486TyrMet: 0.486 ± 0.183
2.189TyrAsn: 2.189 ± 0.417
1.054TyrPro: 1.054 ± 0.307
2.432TyrGln: 2.432 ± 0.417
1.622TyrArg: 1.622 ± 0.328
3.162TyrSer: 3.162 ± 0.431
2.108TyrThr: 2.108 ± 0.381
1.946TyrVal: 1.946 ± 0.517
0.324TyrTrp: 0.324 ± 0.174
1.216TyrTyr: 1.216 ± 0.265
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (12335 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski