Amino acid dipepetide frequency for Salmonella phage 29485

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.854AlaAla: 9.854 ± 1.362
1.044AlaCys: 1.044 ± 0.276
5.547AlaAsp: 5.547 ± 0.667
6.591AlaGlu: 6.591 ± 0.611
3.198AlaPhe: 3.198 ± 0.408
6.33AlaGly: 6.33 ± 0.652
1.109AlaHis: 1.109 ± 0.202
5.874AlaIle: 5.874 ± 0.573
6.069AlaLys: 6.069 ± 0.728
7.113AlaLeu: 7.113 ± 0.775
3.198AlaMet: 3.198 ± 0.381
4.111AlaAsn: 4.111 ± 0.638
2.219AlaPro: 2.219 ± 0.38
3.002AlaGln: 3.002 ± 0.411
5.417AlaArg: 5.417 ± 0.685
5.351AlaSer: 5.351 ± 0.821
5.156AlaThr: 5.156 ± 0.607
6.004AlaVal: 6.004 ± 0.586
2.415AlaTrp: 2.415 ± 0.41
2.741AlaTyr: 2.741 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.979CysAla: 0.979 ± 0.267
0.065CysCys: 0.065 ± 0.056
0.718CysAsp: 0.718 ± 0.172
0.783CysGlu: 0.783 ± 0.243
0.131CysPhe: 0.131 ± 0.091
1.109CysGly: 1.109 ± 0.287
0.261CysHis: 0.261 ± 0.132
0.718CysIle: 0.718 ± 0.266
0.783CysLys: 0.783 ± 0.232
0.718CysLeu: 0.718 ± 0.242
0.261CysMet: 0.261 ± 0.119
0.587CysAsn: 0.587 ± 0.186
0.522CysPro: 0.522 ± 0.158
0.587CysGln: 0.587 ± 0.224
1.044CysArg: 1.044 ± 0.245
0.783CysSer: 0.783 ± 0.226
0.848CysThr: 0.848 ± 0.205
0.718CysVal: 0.718 ± 0.218
0.326CysTrp: 0.326 ± 0.142
0.326CysTyr: 0.326 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
6.396AspAla: 6.396 ± 0.668
0.522AspCys: 0.522 ± 0.177
3.524AspAsp: 3.524 ± 0.445
4.177AspGlu: 4.177 ± 0.506
2.023AspPhe: 2.023 ± 0.323
6.004AspGly: 6.004 ± 0.657
0.587AspHis: 0.587 ± 0.18
4.373AspIle: 4.373 ± 0.683
3.198AspLys: 3.198 ± 0.445
3.394AspLeu: 3.394 ± 0.556
1.632AspMet: 1.632 ± 0.345
3.002AspAsn: 3.002 ± 0.611
1.958AspPro: 1.958 ± 0.401
2.284AspGln: 2.284 ± 0.431
2.154AspArg: 2.154 ± 0.354
3.589AspSer: 3.589 ± 0.535
2.088AspThr: 2.088 ± 0.457
3.785AspVal: 3.785 ± 0.602
0.979AspTrp: 0.979 ± 0.269
2.023AspTyr: 2.023 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
5.612GluAla: 5.612 ± 0.66
0.979GluCys: 0.979 ± 0.261
3.133GluAsp: 3.133 ± 0.593
4.242GluGlu: 4.242 ± 0.694
3.133GluPhe: 3.133 ± 0.466
3.85GluGly: 3.85 ± 0.444
0.718GluHis: 0.718 ± 0.199
4.373GluIle: 4.373 ± 0.506
4.438GluLys: 4.438 ± 0.597
5.547GluLeu: 5.547 ± 0.555
2.219GluMet: 2.219 ± 0.381
2.154GluAsn: 2.154 ± 0.344
2.349GluPro: 2.349 ± 0.358
3.394GluGln: 3.394 ± 0.454
5.09GluArg: 5.09 ± 0.607
3.394GluSer: 3.394 ± 0.464
3.655GluThr: 3.655 ± 0.414
3.655GluVal: 3.655 ± 0.457
1.501GluTrp: 1.501 ± 0.292
2.61GluTyr: 2.61 ± 0.399
0.0GluXaa: 0.0 ± 0.0
Phe
2.61PheAla: 2.61 ± 0.343
0.653PheCys: 0.653 ± 0.202
1.501PheAsp: 1.501 ± 0.274
2.545PheGlu: 2.545 ± 0.407
0.848PhePhe: 0.848 ± 0.211
3.133PheGly: 3.133 ± 0.437
0.326PheHis: 0.326 ± 0.129
1.436PheIle: 1.436 ± 0.244
1.827PheLys: 1.827 ± 0.364
1.958PheLeu: 1.958 ± 0.346
0.653PheMet: 0.653 ± 0.257
1.37PheAsn: 1.37 ± 0.293
1.37PhePro: 1.37 ± 0.374
0.914PheGln: 0.914 ± 0.211
2.023PheArg: 2.023 ± 0.453
2.154PheSer: 2.154 ± 0.344
2.154PheThr: 2.154 ± 0.412
2.219PheVal: 2.219 ± 0.376
0.392PheTrp: 0.392 ± 0.152
1.044PheTyr: 1.044 ± 0.3
0.0PheXaa: 0.0 ± 0.0
Gly
5.482GlyAla: 5.482 ± 0.678
0.979GlyCys: 0.979 ± 0.277
4.764GlyAsp: 4.764 ± 0.601
3.589GlyGlu: 3.589 ± 0.485
2.415GlyPhe: 2.415 ± 0.365
6.004GlyGly: 6.004 ± 0.802
1.044GlyHis: 1.044 ± 0.25
5.221GlyIle: 5.221 ± 0.468
4.829GlyLys: 4.829 ± 0.53
5.417GlyLeu: 5.417 ± 0.58
2.806GlyMet: 2.806 ± 0.427
3.72GlyAsn: 3.72 ± 0.535
1.109GlyPro: 1.109 ± 0.263
2.545GlyGln: 2.545 ± 0.417
4.829GlyArg: 4.829 ± 0.465
5.025GlySer: 5.025 ± 0.753
3.328GlyThr: 3.328 ± 0.426
5.351GlyVal: 5.351 ± 0.528
1.24GlyTrp: 1.24 ± 0.289
2.61GlyTyr: 2.61 ± 0.406
0.0GlyXaa: 0.0 ± 0.0
His
1.762HisAla: 1.762 ± 0.345
0.457HisCys: 0.457 ± 0.175
0.718HisAsp: 0.718 ± 0.216
0.783HisGlu: 0.783 ± 0.275
0.653HisPhe: 0.653 ± 0.189
1.24HisGly: 1.24 ± 0.257
0.653HisHis: 0.653 ± 0.208
0.914HisIle: 0.914 ± 0.263
0.718HisLys: 0.718 ± 0.229
1.566HisLeu: 1.566 ± 0.328
0.196HisMet: 0.196 ± 0.107
0.261HisAsn: 0.261 ± 0.148
0.783HisPro: 0.783 ± 0.227
0.653HisGln: 0.653 ± 0.184
1.37HisArg: 1.37 ± 0.304
1.109HisSer: 1.109 ± 0.23
0.522HisThr: 0.522 ± 0.166
0.914HisVal: 0.914 ± 0.23
0.392HisTrp: 0.392 ± 0.164
0.783HisTyr: 0.783 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
5.939IleAla: 5.939 ± 0.697
0.783IleCys: 0.783 ± 0.244
4.046IleAsp: 4.046 ± 0.433
3.981IleGlu: 3.981 ± 0.447
1.37IlePhe: 1.37 ± 0.315
3.263IleGly: 3.263 ± 0.367
1.175IleHis: 1.175 ± 0.218
3.916IleIle: 3.916 ± 0.597
3.198IleLys: 3.198 ± 0.54
4.503IleLeu: 4.503 ± 0.591
1.501IleMet: 1.501 ± 0.333
3.198IleAsn: 3.198 ± 0.392
3.133IlePro: 3.133 ± 0.432
2.088IleGln: 2.088 ± 0.322
4.046IleArg: 4.046 ± 0.486
4.96IleSer: 4.96 ± 0.611
4.242IleThr: 4.242 ± 0.636
3.002IleVal: 3.002 ± 0.424
1.24IleTrp: 1.24 ± 0.254
0.979IleTyr: 0.979 ± 0.225
0.0IleXaa: 0.0 ± 0.0
Lys
5.417LysAla: 5.417 ± 0.583
0.914LysCys: 0.914 ± 0.243
2.219LysAsp: 2.219 ± 0.417
4.046LysGlu: 4.046 ± 0.528
1.762LysPhe: 1.762 ± 0.298
4.242LysGly: 4.242 ± 0.516
1.044LysHis: 1.044 ± 0.26
3.263LysIle: 3.263 ± 0.458
3.785LysLys: 3.785 ± 0.604
4.634LysLeu: 4.634 ± 0.736
2.219LysMet: 2.219 ± 0.346
1.893LysAsn: 1.893 ± 0.339
3.002LysPro: 3.002 ± 0.573
3.72LysGln: 3.72 ± 0.464
4.177LysArg: 4.177 ± 0.562
3.394LysSer: 3.394 ± 0.553
3.524LysThr: 3.524 ± 0.566
3.067LysVal: 3.067 ± 0.457
0.848LysTrp: 0.848 ± 0.225
2.349LysTyr: 2.349 ± 0.391
0.0LysXaa: 0.0 ± 0.0
Leu
7.048LeuAla: 7.048 ± 0.628
0.848LeuCys: 0.848 ± 0.233
4.242LeuAsp: 4.242 ± 0.536
4.699LeuGlu: 4.699 ± 0.536
2.023LeuPhe: 2.023 ± 0.349
4.046LeuGly: 4.046 ± 0.496
1.37LeuHis: 1.37 ± 0.269
4.634LeuIle: 4.634 ± 0.542
4.829LeuLys: 4.829 ± 0.656
5.286LeuLeu: 5.286 ± 0.561
2.088LeuMet: 2.088 ± 0.304
4.829LeuAsn: 4.829 ± 0.639
3.524LeuPro: 3.524 ± 0.543
2.741LeuGln: 2.741 ± 0.447
4.503LeuArg: 4.503 ± 0.549
6.069LeuSer: 6.069 ± 0.827
4.373LeuThr: 4.373 ± 0.564
4.568LeuVal: 4.568 ± 0.537
1.305LeuTrp: 1.305 ± 0.225
2.48LeuTyr: 2.48 ± 0.329
0.0LeuXaa: 0.0 ± 0.0
Met
2.61MetAla: 2.61 ± 0.419
0.392MetCys: 0.392 ± 0.189
1.632MetAsp: 1.632 ± 0.3
1.632MetGlu: 1.632 ± 0.403
0.653MetPhe: 0.653 ± 0.21
1.501MetGly: 1.501 ± 0.348
0.522MetHis: 0.522 ± 0.169
1.566MetIle: 1.566 ± 0.293
2.023MetLys: 2.023 ± 0.402
2.023MetLeu: 2.023 ± 0.304
0.522MetMet: 0.522 ± 0.201
1.175MetAsn: 1.175 ± 0.272
1.632MetPro: 1.632 ± 0.405
1.37MetGln: 1.37 ± 0.307
2.219MetArg: 2.219 ± 0.458
2.545MetSer: 2.545 ± 0.443
2.61MetThr: 2.61 ± 0.375
1.501MetVal: 1.501 ± 0.31
0.718MetTrp: 0.718 ± 0.211
0.848MetTyr: 0.848 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
4.242AsnAla: 4.242 ± 0.489
0.326AsnCys: 0.326 ± 0.148
3.133AsnAsp: 3.133 ± 0.384
2.937AsnGlu: 2.937 ± 0.404
1.044AsnPhe: 1.044 ± 0.241
5.612AsnGly: 5.612 ± 0.751
1.109AsnHis: 1.109 ± 0.273
1.958AsnIle: 1.958 ± 0.369
3.002AsnLys: 3.002 ± 0.408
3.263AsnLeu: 3.263 ± 0.491
0.979AsnMet: 0.979 ± 0.295
2.48AsnAsn: 2.48 ± 0.553
2.349AsnPro: 2.349 ± 0.433
2.415AsnGln: 2.415 ± 0.431
2.284AsnArg: 2.284 ± 0.381
3.133AsnSer: 3.133 ± 0.465
2.676AsnThr: 2.676 ± 0.528
2.872AsnVal: 2.872 ± 0.497
0.522AsnTrp: 0.522 ± 0.177
1.697AsnTyr: 1.697 ± 0.349
0.0AsnXaa: 0.0 ± 0.0
Pro
3.394ProAla: 3.394 ± 0.427
0.196ProCys: 0.196 ± 0.116
2.415ProAsp: 2.415 ± 0.433
3.72ProGlu: 3.72 ± 0.47
1.827ProPhe: 1.827 ± 0.377
2.545ProGly: 2.545 ± 0.485
0.457ProHis: 0.457 ± 0.194
2.219ProIle: 2.219 ± 0.34
1.958ProLys: 1.958 ± 0.376
2.61ProLeu: 2.61 ± 0.487
1.044ProMet: 1.044 ± 0.257
1.632ProAsn: 1.632 ± 0.271
1.632ProPro: 1.632 ± 0.305
1.566ProGln: 1.566 ± 0.348
1.762ProArg: 1.762 ± 0.306
2.284ProSer: 2.284 ± 0.329
2.023ProThr: 2.023 ± 0.399
3.589ProVal: 3.589 ± 0.579
0.392ProTrp: 0.392 ± 0.166
0.848ProTyr: 0.848 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
4.046GlnAla: 4.046 ± 0.696
0.587GlnCys: 0.587 ± 0.16
2.154GlnAsp: 2.154 ± 0.337
2.415GlnGlu: 2.415 ± 0.294
1.37GlnPhe: 1.37 ± 0.413
1.893GlnGly: 1.893 ± 0.274
0.848GlnHis: 0.848 ± 0.2
3.002GlnIle: 3.002 ± 0.362
2.415GlnLys: 2.415 ± 0.311
3.328GlnLeu: 3.328 ± 0.519
1.109GlnMet: 1.109 ± 0.259
1.24GlnAsn: 1.24 ± 0.325
2.088GlnPro: 2.088 ± 0.398
2.741GlnGln: 2.741 ± 0.585
2.806GlnArg: 2.806 ± 0.427
2.219GlnSer: 2.219 ± 0.429
1.827GlnThr: 1.827 ± 0.392
3.067GlnVal: 3.067 ± 0.463
0.783GlnTrp: 0.783 ± 0.187
1.436GlnTyr: 1.436 ± 0.411
0.0GlnXaa: 0.0 ± 0.0
Arg
4.307ArgAla: 4.307 ± 0.523
0.718ArgCys: 0.718 ± 0.196
4.046ArgAsp: 4.046 ± 0.627
5.025ArgGlu: 5.025 ± 0.732
1.24ArgPhe: 1.24 ± 0.329
3.524ArgGly: 3.524 ± 0.466
1.501ArgHis: 1.501 ± 0.327
3.524ArgIle: 3.524 ± 0.512
3.72ArgLys: 3.72 ± 0.618
5.678ArgLeu: 5.678 ± 0.547
2.545ArgMet: 2.545 ± 0.399
3.785ArgAsn: 3.785 ± 0.521
2.545ArgPro: 2.545 ± 0.428
2.545ArgGln: 2.545 ± 0.381
3.785ArgArg: 3.785 ± 0.681
3.263ArgSer: 3.263 ± 0.464
2.284ArgThr: 2.284 ± 0.338
4.568ArgVal: 4.568 ± 0.547
0.914ArgTrp: 0.914 ± 0.24
2.284ArgTyr: 2.284 ± 0.395
0.0ArgXaa: 0.0 ± 0.0
Ser
5.743SerAla: 5.743 ± 0.672
0.457SerCys: 0.457 ± 0.15
3.981SerAsp: 3.981 ± 0.579
4.373SerGlu: 4.373 ± 0.578
2.48SerPhe: 2.48 ± 0.465
5.286SerGly: 5.286 ± 0.619
1.109SerHis: 1.109 ± 0.242
3.85SerIle: 3.85 ± 0.432
2.545SerLys: 2.545 ± 0.418
5.678SerLeu: 5.678 ± 0.932
2.415SerMet: 2.415 ± 0.394
2.806SerAsn: 2.806 ± 0.503
2.415SerPro: 2.415 ± 0.431
3.002SerGln: 3.002 ± 0.377
3.459SerArg: 3.459 ± 0.44
4.111SerSer: 4.111 ± 0.617
3.133SerThr: 3.133 ± 0.562
5.808SerVal: 5.808 ± 0.515
0.783SerTrp: 0.783 ± 0.193
1.762SerTyr: 1.762 ± 0.343
0.0SerXaa: 0.0 ± 0.0
Thr
6.396ThrAla: 6.396 ± 0.587
0.522ThrCys: 0.522 ± 0.176
3.589ThrAsp: 3.589 ± 0.47
3.916ThrGlu: 3.916 ± 0.461
1.632ThrPhe: 1.632 ± 0.399
4.764ThrGly: 4.764 ± 0.525
0.392ThrHis: 0.392 ± 0.153
2.741ThrIle: 2.741 ± 0.451
2.937ThrLys: 2.937 ± 0.389
3.589ThrLeu: 3.589 ± 0.445
1.501ThrMet: 1.501 ± 0.349
2.937ThrAsn: 2.937 ± 0.453
2.48ThrPro: 2.48 ± 0.505
1.762ThrGln: 1.762 ± 0.301
2.545ThrArg: 2.545 ± 0.417
3.198ThrSer: 3.198 ± 0.523
2.61ThrThr: 2.61 ± 0.48
4.242ThrVal: 4.242 ± 0.694
0.653ThrTrp: 0.653 ± 0.185
2.154ThrTyr: 2.154 ± 0.423
0.0ThrXaa: 0.0 ± 0.0
Val
6.396ValAla: 6.396 ± 0.738
1.109ValCys: 1.109 ± 0.321
3.72ValAsp: 3.72 ± 0.392
4.111ValGlu: 4.111 ± 0.442
1.958ValPhe: 1.958 ± 0.32
4.634ValGly: 4.634 ± 0.599
1.175ValHis: 1.175 ± 0.324
4.307ValIle: 4.307 ± 0.462
4.373ValLys: 4.373 ± 0.514
4.764ValLeu: 4.764 ± 0.615
1.697ValMet: 1.697 ± 0.368
3.981ValAsn: 3.981 ± 0.581
1.762ValPro: 1.762 ± 0.348
1.762ValGln: 1.762 ± 0.253
4.111ValArg: 4.111 ± 0.474
4.895ValSer: 4.895 ± 0.643
4.634ValThr: 4.634 ± 0.616
5.351ValVal: 5.351 ± 0.779
0.653ValTrp: 0.653 ± 0.26
2.48ValTyr: 2.48 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
1.305TrpAla: 1.305 ± 0.27
0.326TrpCys: 0.326 ± 0.159
0.914TrpAsp: 0.914 ± 0.195
0.783TrpGlu: 0.783 ± 0.225
0.261TrpPhe: 0.261 ± 0.133
0.848TrpGly: 0.848 ± 0.238
0.392TrpHis: 0.392 ± 0.16
0.718TrpIle: 0.718 ± 0.199
1.109TrpLys: 1.109 ± 0.279
1.762TrpLeu: 1.762 ± 0.34
0.261TrpMet: 0.261 ± 0.125
1.175TrpAsn: 1.175 ± 0.315
0.392TrpPro: 0.392 ± 0.156
0.783TrpGln: 0.783 ± 0.218
1.37TrpArg: 1.37 ± 0.286
1.175TrpSer: 1.175 ± 0.308
0.979TrpThr: 0.979 ± 0.372
1.697TrpVal: 1.697 ± 0.336
0.196TrpTrp: 0.196 ± 0.127
0.457TrpTyr: 0.457 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.806TyrAla: 2.806 ± 0.415
0.392TyrCys: 0.392 ± 0.151
1.697TyrAsp: 1.697 ± 0.315
1.697TyrGlu: 1.697 ± 0.29
1.109TyrPhe: 1.109 ± 0.229
2.545TyrGly: 2.545 ± 0.372
0.653TyrHis: 0.653 ± 0.184
1.958TyrIle: 1.958 ± 0.409
1.762TyrLys: 1.762 ± 0.376
2.676TyrLeu: 2.676 ± 0.467
0.783TyrMet: 0.783 ± 0.236
1.697TyrAsn: 1.697 ± 0.367
0.848TyrPro: 0.848 ± 0.186
1.501TyrGln: 1.501 ± 0.323
2.806TyrArg: 2.806 ± 0.353
2.545TyrSer: 2.545 ± 0.426
1.893TyrThr: 1.893 ± 0.385
1.893TyrVal: 1.893 ± 0.449
0.653TyrTrp: 0.653 ± 0.226
0.718TyrTyr: 0.718 ± 0.209
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (15324 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski