Amino acid dipepetide frequency for Streptococcus phage Javan53

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.28AlaAla: 3.28 ± 1.376
0.437AlaCys: 0.437 ± 0.197
3.207AlaAsp: 3.207 ± 0.512
4.665AlaGlu: 4.665 ± 0.617
2.916AlaPhe: 2.916 ± 0.393
4.519AlaGly: 4.519 ± 0.836
0.875AlaHis: 0.875 ± 0.238
5.977AlaIle: 5.977 ± 1.049
6.633AlaLys: 6.633 ± 0.635
5.904AlaLeu: 5.904 ± 0.641
1.458AlaMet: 1.458 ± 0.304
4.519AlaAsn: 4.519 ± 0.488
1.166AlaPro: 1.166 ± 0.259
2.843AlaGln: 2.843 ± 0.39
3.134AlaArg: 3.134 ± 0.433
4.665AlaSer: 4.665 ± 0.879
3.936AlaThr: 3.936 ± 0.532
3.717AlaVal: 3.717 ± 0.588
0.948AlaTrp: 0.948 ± 0.258
2.333AlaTyr: 2.333 ± 0.422
0.0AlaXaa: 0.0 ± 0.0
Cys
0.146CysAla: 0.146 ± 0.104
0.073CysCys: 0.073 ± 0.078
0.437CysAsp: 0.437 ± 0.181
0.437CysGlu: 0.437 ± 0.197
0.219CysPhe: 0.219 ± 0.118
0.292CysGly: 0.292 ± 0.134
0.073CysHis: 0.073 ± 0.068
0.219CysIle: 0.219 ± 0.126
0.583CysLys: 0.583 ± 0.222
0.729CysLeu: 0.729 ± 0.257
0.146CysMet: 0.146 ± 0.1
0.292CysAsn: 0.292 ± 0.154
0.146CysPro: 0.146 ± 0.122
0.146CysGln: 0.146 ± 0.104
0.364CysArg: 0.364 ± 0.158
0.292CysSer: 0.292 ± 0.154
0.364CysThr: 0.364 ± 0.172
0.146CysVal: 0.146 ± 0.088
0.146CysTrp: 0.146 ± 0.105
0.292CysTyr: 0.292 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
2.697AspAla: 2.697 ± 0.426
0.364AspCys: 0.364 ± 0.161
4.373AspAsp: 4.373 ± 0.578
4.009AspGlu: 4.009 ± 0.62
3.645AspPhe: 3.645 ± 0.575
4.373AspGly: 4.373 ± 0.66
0.364AspHis: 0.364 ± 0.174
3.353AspIle: 3.353 ± 0.473
5.831AspLys: 5.831 ± 0.528
6.779AspLeu: 6.779 ± 0.571
1.822AspMet: 1.822 ± 0.358
4.665AspAsn: 4.665 ± 0.513
1.312AspPro: 1.312 ± 0.307
1.531AspGln: 1.531 ± 0.279
2.114AspArg: 2.114 ± 0.411
3.134AspSer: 3.134 ± 0.383
4.009AspThr: 4.009 ± 0.537
4.009AspVal: 4.009 ± 0.493
1.166AspTrp: 1.166 ± 0.325
3.353AspTyr: 3.353 ± 0.592
0.0AspXaa: 0.0 ± 0.0
Glu
5.613GluAla: 5.613 ± 0.671
0.219GluCys: 0.219 ± 0.164
3.207GluAsp: 3.207 ± 0.481
6.998GluGlu: 6.998 ± 0.868
2.624GluPhe: 2.624 ± 0.36
2.916GluGly: 2.916 ± 0.502
1.312GluHis: 1.312 ± 0.391
5.831GluIle: 5.831 ± 0.639
5.613GluLys: 5.613 ± 0.753
8.383GluLeu: 8.383 ± 0.931
1.677GluMet: 1.677 ± 0.386
4.592GluAsn: 4.592 ± 0.768
1.822GluPro: 1.822 ± 0.408
3.717GluGln: 3.717 ± 0.543
2.478GluArg: 2.478 ± 0.482
3.499GluSer: 3.499 ± 0.452
4.519GluThr: 4.519 ± 0.516
5.102GluVal: 5.102 ± 0.766
0.656GluTrp: 0.656 ± 0.204
3.426GluTyr: 3.426 ± 0.487
0.0GluXaa: 0.0 ± 0.0
Phe
1.677PheAla: 1.677 ± 0.296
0.292PheCys: 0.292 ± 0.107
3.572PheAsp: 3.572 ± 0.384
3.061PheGlu: 3.061 ± 0.498
1.312PhePhe: 1.312 ± 0.289
2.843PheGly: 2.843 ± 0.469
0.51PheHis: 0.51 ± 0.177
2.77PheIle: 2.77 ± 0.401
5.03PheLys: 5.03 ± 0.546
2.697PheLeu: 2.697 ± 0.383
0.802PheMet: 0.802 ± 0.275
2.405PheAsn: 2.405 ± 0.368
0.656PhePro: 0.656 ± 0.253
0.875PheGln: 0.875 ± 0.189
1.822PheArg: 1.822 ± 0.379
2.478PheSer: 2.478 ± 0.431
2.187PheThr: 2.187 ± 0.418
2.843PheVal: 2.843 ± 0.431
0.364PheTrp: 0.364 ± 0.134
1.312PheTyr: 1.312 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
5.321GlyAla: 5.321 ± 0.806
0.219GlyCys: 0.219 ± 0.144
3.499GlyAsp: 3.499 ± 0.779
3.426GlyGlu: 3.426 ± 0.624
2.187GlyPhe: 2.187 ± 0.393
2.77GlyGly: 2.77 ± 0.651
0.437GlyHis: 0.437 ± 0.174
4.665GlyIle: 4.665 ± 0.632
5.321GlyLys: 5.321 ± 0.724
5.102GlyLeu: 5.102 ± 0.887
1.677GlyMet: 1.677 ± 0.378
3.426GlyAsn: 3.426 ± 0.53
1.895GlyPro: 1.895 ± 1.257
1.677GlyGln: 1.677 ± 0.436
2.478GlyArg: 2.478 ± 0.489
2.916GlySer: 2.916 ± 0.447
3.061GlyThr: 3.061 ± 0.495
3.717GlyVal: 3.717 ± 0.598
0.948GlyTrp: 0.948 ± 0.232
2.697GlyTyr: 2.697 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
0.875HisAla: 0.875 ± 0.227
0.219HisCys: 0.219 ± 0.128
0.875HisAsp: 0.875 ± 0.245
0.802HisGlu: 0.802 ± 0.257
0.656HisPhe: 0.656 ± 0.209
0.802HisGly: 0.802 ± 0.31
0.219HisHis: 0.219 ± 0.137
0.875HisIle: 0.875 ± 0.263
1.239HisLys: 1.239 ± 0.261
1.02HisLeu: 1.02 ± 0.253
0.364HisMet: 0.364 ± 0.146
0.948HisAsn: 0.948 ± 0.256
0.292HisPro: 0.292 ± 0.13
0.51HisGln: 0.51 ± 0.208
0.875HisArg: 0.875 ± 0.277
1.02HisSer: 1.02 ± 0.314
1.166HisThr: 1.166 ± 0.31
1.166HisVal: 1.166 ± 0.273
0.146HisTrp: 0.146 ± 0.11
0.802HisTyr: 0.802 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
4.957IleAla: 4.957 ± 0.65
0.219IleCys: 0.219 ± 0.122
4.811IleAsp: 4.811 ± 0.605
4.957IleGlu: 4.957 ± 0.714
2.333IlePhe: 2.333 ± 0.52
4.082IleGly: 4.082 ± 0.691
1.312IleHis: 1.312 ± 0.334
4.884IleIle: 4.884 ± 0.612
7.07IleLys: 7.07 ± 0.696
5.613IleLeu: 5.613 ± 0.694
1.531IleMet: 1.531 ± 0.478
4.446IleAsn: 4.446 ± 0.539
2.114IlePro: 2.114 ± 0.385
2.697IleGln: 2.697 ± 0.4
1.968IleArg: 1.968 ± 0.365
4.884IleSer: 4.884 ± 0.627
4.228IleThr: 4.228 ± 0.563
3.353IleVal: 3.353 ± 0.579
0.802IleTrp: 0.802 ± 0.237
2.989IleTyr: 2.989 ± 0.507
0.0IleXaa: 0.0 ± 0.0
Lys
6.706LysAla: 6.706 ± 0.808
0.364LysCys: 0.364 ± 0.164
5.248LysAsp: 5.248 ± 0.629
7.362LysGlu: 7.362 ± 0.764
2.697LysPhe: 2.697 ± 0.486
5.102LysGly: 5.102 ± 0.97
1.895LysHis: 1.895 ± 0.353
5.175LysIle: 5.175 ± 0.552
7.872LysLys: 7.872 ± 1.003
8.601LysLeu: 8.601 ± 0.606
3.426LysMet: 3.426 ± 0.53
5.102LysAsn: 5.102 ± 0.451
3.28LysPro: 3.28 ± 0.635
4.519LysGln: 4.519 ± 0.515
5.03LysArg: 5.03 ± 0.734
5.03LysSer: 5.03 ± 0.455
6.123LysThr: 6.123 ± 0.784
5.686LysVal: 5.686 ± 0.736
1.312LysTrp: 1.312 ± 0.334
3.28LysTyr: 3.28 ± 0.452
0.0LysXaa: 0.0 ± 0.0
Leu
6.56LeuAla: 6.56 ± 0.914
0.656LeuCys: 0.656 ± 0.207
5.686LeuAsp: 5.686 ± 0.801
8.747LeuGlu: 8.747 ± 0.902
2.405LeuPhe: 2.405 ± 0.33
5.613LeuGly: 5.613 ± 0.474
1.093LeuHis: 1.093 ± 0.28
4.957LeuIle: 4.957 ± 0.65
9.403LeuLys: 9.403 ± 0.706
5.175LeuLeu: 5.175 ± 0.56
1.749LeuMet: 1.749 ± 0.377
5.977LeuAsn: 5.977 ± 0.507
2.041LeuPro: 2.041 ± 0.311
2.843LeuGln: 2.843 ± 0.528
4.228LeuArg: 4.228 ± 0.666
7.143LeuSer: 7.143 ± 0.652
5.102LeuThr: 5.102 ± 0.546
4.665LeuVal: 4.665 ± 0.666
0.802LeuTrp: 0.802 ± 0.231
2.333LeuTyr: 2.333 ± 0.381
0.0LeuXaa: 0.0 ± 0.0
Met
2.478MetAla: 2.478 ± 0.355
0.219MetCys: 0.219 ± 0.126
1.531MetAsp: 1.531 ± 0.316
1.604MetGlu: 1.604 ± 0.385
0.875MetPhe: 0.875 ± 0.215
0.802MetGly: 0.802 ± 0.208
0.656MetHis: 0.656 ± 0.193
1.895MetIle: 1.895 ± 0.303
1.968MetLys: 1.968 ± 0.444
1.312MetLeu: 1.312 ± 0.298
0.292MetMet: 0.292 ± 0.147
0.875MetAsn: 0.875 ± 0.305
0.583MetPro: 0.583 ± 0.231
1.166MetGln: 1.166 ± 0.282
1.02MetArg: 1.02 ± 0.287
1.968MetSer: 1.968 ± 0.487
2.041MetThr: 2.041 ± 0.453
1.312MetVal: 1.312 ± 0.277
0.0MetTrp: 0.0 ± 0.0
0.802MetTyr: 0.802 ± 0.261
0.0MetXaa: 0.0 ± 0.0
Asn
3.936AsnAla: 3.936 ± 0.622
0.51AsnCys: 0.51 ± 0.195
2.843AsnAsp: 2.843 ± 0.48
3.936AsnGlu: 3.936 ± 0.613
3.134AsnPhe: 3.134 ± 0.424
4.301AsnGly: 4.301 ± 0.561
0.583AsnHis: 0.583 ± 0.213
4.155AsnIle: 4.155 ± 0.474
4.155AsnLys: 4.155 ± 0.519
5.977AsnLeu: 5.977 ± 0.706
1.166AsnMet: 1.166 ± 0.268
3.207AsnAsn: 3.207 ± 0.49
2.405AsnPro: 2.405 ± 0.385
3.207AsnGln: 3.207 ± 0.716
3.207AsnArg: 3.207 ± 0.377
3.572AsnSer: 3.572 ± 0.482
2.697AsnThr: 2.697 ± 0.487
2.916AsnVal: 2.916 ± 0.511
0.802AsnTrp: 0.802 ± 0.325
2.26AsnTyr: 2.26 ± 0.359
0.0AsnXaa: 0.0 ± 0.0
Pro
1.677ProAla: 1.677 ± 0.427
0.0ProCys: 0.0 ± 0.0
1.895ProAsp: 1.895 ± 0.401
1.239ProGlu: 1.239 ± 0.318
1.093ProPhe: 1.093 ± 0.283
0.875ProGly: 0.875 ± 0.339
0.802ProHis: 0.802 ± 0.269
1.604ProIle: 1.604 ± 0.277
2.624ProLys: 2.624 ± 0.631
2.041ProLeu: 2.041 ± 0.423
0.729ProMet: 0.729 ± 0.252
1.749ProAsn: 1.749 ± 0.319
0.583ProPro: 0.583 ± 0.26
1.239ProGln: 1.239 ± 0.385
1.093ProArg: 1.093 ± 0.316
2.405ProSer: 2.405 ± 0.452
1.531ProThr: 1.531 ± 0.358
1.531ProVal: 1.531 ± 0.299
0.073ProTrp: 0.073 ± 0.063
1.531ProTyr: 1.531 ± 0.405
0.0ProXaa: 0.0 ± 0.0
Gln
3.134GlnAla: 3.134 ± 0.516
0.219GlnCys: 0.219 ± 0.108
1.895GlnAsp: 1.895 ± 0.408
3.207GlnGlu: 3.207 ± 0.497
1.604GlnPhe: 1.604 ± 0.385
2.333GlnGly: 2.333 ± 0.498
0.146GlnHis: 0.146 ± 0.086
3.717GlnIle: 3.717 ± 0.618
3.936GlnLys: 3.936 ± 0.49
4.373GlnLeu: 4.373 ± 0.46
0.437GlnMet: 0.437 ± 0.191
2.989GlnAsn: 2.989 ± 0.454
0.656GlnPro: 0.656 ± 0.254
1.968GlnGln: 1.968 ± 0.635
1.604GlnArg: 1.604 ± 0.461
3.717GlnSer: 3.717 ± 0.502
2.405GlnThr: 2.405 ± 0.437
2.405GlnVal: 2.405 ± 0.338
0.364GlnTrp: 0.364 ± 0.154
1.239GlnTyr: 1.239 ± 0.327
0.0GlnXaa: 0.0 ± 0.0
Arg
2.916ArgAla: 2.916 ± 0.512
0.364ArgCys: 0.364 ± 0.154
2.843ArgAsp: 2.843 ± 0.413
2.916ArgGlu: 2.916 ± 0.442
1.02ArgPhe: 1.02 ± 0.245
1.749ArgGly: 1.749 ± 0.426
0.802ArgHis: 0.802 ± 0.254
3.207ArgIle: 3.207 ± 0.596
4.228ArgLys: 4.228 ± 0.677
4.446ArgLeu: 4.446 ± 0.619
1.895ArgMet: 1.895 ± 0.345
2.26ArgAsn: 2.26 ± 0.413
0.802ArgPro: 0.802 ± 0.26
1.968ArgGln: 1.968 ± 0.282
2.26ArgArg: 2.26 ± 0.453
1.749ArgSer: 1.749 ± 0.321
2.041ArgThr: 2.041 ± 0.382
2.697ArgVal: 2.697 ± 0.485
0.437ArgTrp: 0.437 ± 0.225
1.968ArgTyr: 1.968 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
4.301SerAla: 4.301 ± 0.82
0.073SerCys: 0.073 ± 0.083
4.811SerAsp: 4.811 ± 0.56
4.009SerGlu: 4.009 ± 0.514
3.426SerPhe: 3.426 ± 0.485
4.446SerGly: 4.446 ± 0.632
1.093SerHis: 1.093 ± 0.246
5.321SerIle: 5.321 ± 0.544
6.414SerLys: 6.414 ± 0.798
4.519SerLeu: 4.519 ± 0.853
1.239SerMet: 1.239 ± 0.341
3.207SerAsn: 3.207 ± 0.402
1.166SerPro: 1.166 ± 0.291
3.79SerGln: 3.79 ± 0.469
2.187SerArg: 2.187 ± 0.355
3.645SerSer: 3.645 ± 0.621
3.353SerThr: 3.353 ± 0.408
3.207SerVal: 3.207 ± 0.541
0.364SerTrp: 0.364 ± 0.16
2.697SerTyr: 2.697 ± 0.472
0.0SerXaa: 0.0 ± 0.0
Thr
3.936ThrAla: 3.936 ± 0.76
0.292ThrCys: 0.292 ± 0.122
3.936ThrAsp: 3.936 ± 0.482
4.811ThrGlu: 4.811 ± 0.578
3.207ThrPhe: 3.207 ± 0.45
4.811ThrGly: 4.811 ± 1.261
0.875ThrHis: 0.875 ± 0.25
4.301ThrIle: 4.301 ± 0.578
6.633ThrLys: 6.633 ± 0.904
5.321ThrLeu: 5.321 ± 0.606
0.802ThrMet: 0.802 ± 0.335
2.405ThrAsn: 2.405 ± 0.455
2.26ThrPro: 2.26 ± 0.32
2.697ThrGln: 2.697 ± 0.513
1.968ThrArg: 1.968 ± 0.417
3.061ThrSer: 3.061 ± 0.53
3.79ThrThr: 3.79 ± 0.747
3.717ThrVal: 3.717 ± 0.602
0.073ThrTrp: 0.073 ± 0.067
2.77ThrTyr: 2.77 ± 0.617
0.0ThrXaa: 0.0 ± 0.0
Val
4.155ValAla: 4.155 ± 0.772
0.437ValCys: 0.437 ± 0.165
4.009ValAsp: 4.009 ± 0.509
4.811ValGlu: 4.811 ± 0.559
1.968ValPhe: 1.968 ± 0.425
2.333ValGly: 2.333 ± 0.487
0.802ValHis: 0.802 ± 0.259
3.717ValIle: 3.717 ± 0.646
4.665ValLys: 4.665 ± 0.764
4.301ValLeu: 4.301 ± 0.615
1.312ValMet: 1.312 ± 0.366
3.426ValAsn: 3.426 ± 0.59
1.239ValPro: 1.239 ± 0.369
2.114ValGln: 2.114 ± 0.374
2.551ValArg: 2.551 ± 0.46
4.738ValSer: 4.738 ± 0.56
5.54ValThr: 5.54 ± 0.788
3.863ValVal: 3.863 ± 0.602
0.583ValTrp: 0.583 ± 0.22
1.968ValTyr: 1.968 ± 0.402
0.0ValXaa: 0.0 ± 0.0
Trp
0.729TrpAla: 0.729 ± 0.228
0.073TrpCys: 0.073 ± 0.072
0.437TrpAsp: 0.437 ± 0.186
0.729TrpGlu: 0.729 ± 0.179
0.437TrpPhe: 0.437 ± 0.172
0.583TrpGly: 0.583 ± 0.346
0.292TrpHis: 0.292 ± 0.146
0.51TrpIle: 0.51 ± 0.15
1.093TrpLys: 1.093 ± 0.239
1.166TrpLeu: 1.166 ± 0.394
0.073TrpMet: 0.073 ± 0.066
0.437TrpAsn: 0.437 ± 0.197
0.0TrpPro: 0.0 ± 0.0
0.729TrpGln: 0.729 ± 0.236
0.437TrpArg: 0.437 ± 0.181
0.948TrpSer: 0.948 ± 0.221
0.656TrpThr: 0.656 ± 0.286
0.437TrpVal: 0.437 ± 0.187
0.364TrpTrp: 0.364 ± 0.158
0.51TrpTyr: 0.51 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.041TyrAla: 2.041 ± 0.521
0.292TyrCys: 0.292 ± 0.133
3.79TyrAsp: 3.79 ± 0.46
2.551TyrGlu: 2.551 ± 0.395
1.749TyrPhe: 1.749 ± 0.406
1.895TyrGly: 1.895 ± 0.321
0.729TyrHis: 0.729 ± 0.257
2.114TyrIle: 2.114 ± 0.403
3.499TyrLys: 3.499 ± 0.507
3.717TyrLeu: 3.717 ± 0.627
0.729TyrMet: 0.729 ± 0.174
2.041TyrAsn: 2.041 ± 0.446
1.895TyrPro: 1.895 ± 0.361
2.114TyrGln: 2.114 ± 0.396
1.749TyrArg: 1.749 ± 0.368
2.405TyrSer: 2.405 ± 0.533
2.916TyrThr: 2.916 ± 0.627
2.041TyrVal: 2.041 ± 0.402
0.292TyrTrp: 0.292 ± 0.146
1.895TyrTyr: 1.895 ± 0.404
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (13720 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski