Amino acid dipepetide frequency for Salmonella phage Vi II-E1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.264AlaAla: 9.264 ± 1.497
0.644AlaCys: 0.644 ± 0.27
3.947AlaAsp: 3.947 ± 0.595
6.364AlaGlu: 6.364 ± 0.694
2.336AlaPhe: 2.336 ± 0.352
6.364AlaGly: 6.364 ± 0.784
0.564AlaHis: 0.564 ± 0.222
5.961AlaIle: 5.961 ± 0.749
5.639AlaLys: 5.639 ± 0.685
6.848AlaLeu: 6.848 ± 0.807
3.061AlaMet: 3.061 ± 0.62
3.947AlaAsn: 3.947 ± 0.534
1.531AlaPro: 1.531 ± 0.346
4.27AlaGln: 4.27 ± 0.767
5.317AlaArg: 5.317 ± 0.587
4.995AlaSer: 4.995 ± 0.925
5.398AlaThr: 5.398 ± 0.88
5.156AlaVal: 5.156 ± 0.774
1.128AlaTrp: 1.128 ± 0.252
2.417AlaTyr: 2.417 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
1.047CysAla: 1.047 ± 0.291
0.242CysCys: 0.242 ± 0.127
0.886CysAsp: 0.886 ± 0.291
1.128CysGlu: 1.128 ± 0.358
0.403CysPhe: 0.403 ± 0.188
1.128CysGly: 1.128 ± 0.391
0.564CysHis: 0.564 ± 0.213
0.644CysIle: 0.644 ± 0.186
1.37CysLys: 1.37 ± 0.451
1.289CysLeu: 1.289 ± 0.317
0.161CysMet: 0.161 ± 0.115
0.725CysAsn: 0.725 ± 0.23
0.644CysPro: 0.644 ± 0.254
0.322CysGln: 0.322 ± 0.176
0.967CysArg: 0.967 ± 0.242
0.886CysSer: 0.886 ± 0.328
0.403CysThr: 0.403 ± 0.16
1.047CysVal: 1.047 ± 0.273
0.403CysTrp: 0.403 ± 0.158
0.081CysTyr: 0.081 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
6.606AspAla: 6.606 ± 0.761
0.967AspCys: 0.967 ± 0.303
4.27AspAsp: 4.27 ± 0.851
4.511AspGlu: 4.511 ± 0.686
3.061AspPhe: 3.061 ± 0.499
5.72AspGly: 5.72 ± 0.858
1.208AspHis: 1.208 ± 0.384
4.028AspIle: 4.028 ± 0.427
3.625AspLys: 3.625 ± 0.686
2.497AspLeu: 2.497 ± 0.494
2.417AspMet: 2.417 ± 0.454
2.497AspAsn: 2.497 ± 0.387
2.014AspPro: 2.014 ± 0.376
1.128AspGln: 1.128 ± 0.306
2.739AspArg: 2.739 ± 0.39
3.303AspSer: 3.303 ± 0.509
2.739AspThr: 2.739 ± 0.51
4.028AspVal: 4.028 ± 0.56
0.483AspTrp: 0.483 ± 0.192
2.981AspTyr: 2.981 ± 0.538
0.0AspXaa: 0.0 ± 0.0
Glu
5.236GluAla: 5.236 ± 0.717
1.047GluCys: 1.047 ± 0.261
3.222GluAsp: 3.222 ± 0.684
4.431GluGlu: 4.431 ± 0.659
2.095GluPhe: 2.095 ± 0.441
2.9GluGly: 2.9 ± 0.461
1.047GluHis: 1.047 ± 0.296
4.673GluIle: 4.673 ± 0.57
4.431GluLys: 4.431 ± 0.708
6.364GluLeu: 6.364 ± 0.573
2.256GluMet: 2.256 ± 0.39
3.142GluAsn: 3.142 ± 0.431
1.933GluPro: 1.933 ± 0.346
4.35GluGln: 4.35 ± 0.611
3.867GluArg: 3.867 ± 0.516
3.303GluSer: 3.303 ± 0.577
2.981GluThr: 2.981 ± 0.468
3.947GluVal: 3.947 ± 0.515
0.967GluTrp: 0.967 ± 0.239
2.739GluTyr: 2.739 ± 0.424
0.0GluXaa: 0.0 ± 0.0
Phe
1.853PheAla: 1.853 ± 0.368
0.967PheCys: 0.967 ± 0.232
3.142PheAsp: 3.142 ± 0.553
1.772PheGlu: 1.772 ± 0.416
1.047PhePhe: 1.047 ± 0.311
2.981PheGly: 2.981 ± 0.446
0.644PheHis: 0.644 ± 0.243
2.981PheIle: 2.981 ± 0.571
1.692PheLys: 1.692 ± 0.38
1.933PheLeu: 1.933 ± 0.323
1.611PheMet: 1.611 ± 0.518
1.853PheAsn: 1.853 ± 0.474
0.967PhePro: 0.967 ± 0.296
1.128PheGln: 1.128 ± 0.293
2.095PheArg: 2.095 ± 0.394
2.659PheSer: 2.659 ± 0.374
1.853PheThr: 1.853 ± 0.37
1.772PheVal: 1.772 ± 0.337
0.483PheTrp: 0.483 ± 0.212
1.208PheTyr: 1.208 ± 0.32
0.0PheXaa: 0.0 ± 0.0
Gly
5.236GlyAla: 5.236 ± 0.789
1.208GlyCys: 1.208 ± 0.359
4.592GlyAsp: 4.592 ± 0.546
4.834GlyGlu: 4.834 ± 0.674
3.142GlyPhe: 3.142 ± 0.455
5.236GlyGly: 5.236 ± 0.828
1.45GlyHis: 1.45 ± 0.35
4.109GlyIle: 4.109 ± 0.575
5.8GlyLys: 5.8 ± 0.79
4.914GlyLeu: 4.914 ± 0.542
1.853GlyMet: 1.853 ± 0.36
3.464GlyAsn: 3.464 ± 0.595
0.725GlyPro: 0.725 ± 0.285
2.739GlyGln: 2.739 ± 0.532
3.625GlyArg: 3.625 ± 0.471
4.673GlySer: 4.673 ± 0.609
4.109GlyThr: 4.109 ± 0.71
4.431GlyVal: 4.431 ± 0.684
1.047GlyTrp: 1.047 ± 0.239
3.464GlyTyr: 3.464 ± 0.503
0.0GlyXaa: 0.0 ± 0.0
His
1.45HisAla: 1.45 ± 0.459
0.403HisCys: 0.403 ± 0.211
0.967HisAsp: 0.967 ± 0.192
1.128HisGlu: 1.128 ± 0.263
0.725HisPhe: 0.725 ± 0.29
2.175HisGly: 2.175 ± 0.717
0.564HisHis: 0.564 ± 0.207
0.967HisIle: 0.967 ± 0.374
1.128HisLys: 1.128 ± 0.291
1.611HisLeu: 1.611 ± 0.433
0.564HisMet: 0.564 ± 0.173
0.886HisAsn: 0.886 ± 0.277
0.806HisPro: 0.806 ± 0.235
0.644HisGln: 0.644 ± 0.194
1.289HisArg: 1.289 ± 0.302
1.208HisSer: 1.208 ± 0.282
0.564HisThr: 0.564 ± 0.206
1.531HisVal: 1.531 ± 0.435
0.242HisTrp: 0.242 ± 0.123
0.806HisTyr: 0.806 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
4.834IleAla: 4.834 ± 0.651
0.806IleCys: 0.806 ± 0.3
4.592IleAsp: 4.592 ± 0.535
4.27IleGlu: 4.27 ± 0.486
1.772IlePhe: 1.772 ± 0.31
3.947IleGly: 3.947 ± 0.682
1.692IleHis: 1.692 ± 0.358
4.35IleIle: 4.35 ± 0.603
3.625IleLys: 3.625 ± 0.435
3.867IleLeu: 3.867 ± 0.569
2.175IleMet: 2.175 ± 0.476
3.947IleAsn: 3.947 ± 0.629
2.336IlePro: 2.336 ± 0.418
1.933IleGln: 1.933 ± 0.452
3.061IleArg: 3.061 ± 0.694
4.834IleSer: 4.834 ± 0.586
3.867IleThr: 3.867 ± 0.623
4.028IleVal: 4.028 ± 0.45
0.725IleTrp: 0.725 ± 0.277
2.014IleTyr: 2.014 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
5.881LysAla: 5.881 ± 0.808
1.37LysCys: 1.37 ± 0.469
3.061LysAsp: 3.061 ± 0.58
4.431LysGlu: 4.431 ± 0.633
1.37LysPhe: 1.37 ± 0.338
3.625LysGly: 3.625 ± 0.629
1.37LysHis: 1.37 ± 0.364
3.625LysIle: 3.625 ± 0.423
3.706LysLys: 3.706 ± 0.565
5.156LysLeu: 5.156 ± 0.725
2.095LysMet: 2.095 ± 0.407
3.867LysAsn: 3.867 ± 0.805
2.9LysPro: 2.9 ± 0.477
2.9LysGln: 2.9 ± 0.407
3.222LysArg: 3.222 ± 0.476
4.511LysSer: 4.511 ± 0.797
3.625LysThr: 3.625 ± 0.679
4.35LysVal: 4.35 ± 0.842
1.047LysTrp: 1.047 ± 0.283
2.659LysTyr: 2.659 ± 0.484
0.0LysXaa: 0.0 ± 0.0
Leu
5.72LeuAla: 5.72 ± 0.811
0.886LeuCys: 0.886 ± 0.292
4.028LeuAsp: 4.028 ± 0.603
3.947LeuGlu: 3.947 ± 0.531
1.933LeuPhe: 1.933 ± 0.404
4.753LeuGly: 4.753 ± 0.499
1.772LeuHis: 1.772 ± 0.293
4.109LeuIle: 4.109 ± 0.552
5.559LeuLys: 5.559 ± 0.732
4.431LeuLeu: 4.431 ± 0.831
1.853LeuMet: 1.853 ± 0.391
3.867LeuAsn: 3.867 ± 0.636
2.9LeuPro: 2.9 ± 0.539
2.82LeuGln: 2.82 ± 0.542
4.673LeuArg: 4.673 ± 0.684
4.753LeuSer: 4.753 ± 0.605
5.639LeuThr: 5.639 ± 0.607
4.511LeuVal: 4.511 ± 0.634
1.45LeuTrp: 1.45 ± 0.309
2.256LeuTyr: 2.256 ± 0.527
0.0LeuXaa: 0.0 ± 0.0
Met
2.82MetAla: 2.82 ± 0.48
0.403MetCys: 0.403 ± 0.16
1.289MetAsp: 1.289 ± 0.312
1.933MetGlu: 1.933 ± 0.327
1.208MetPhe: 1.208 ± 0.357
1.531MetGly: 1.531 ± 0.294
0.886MetHis: 0.886 ± 0.294
2.095MetIle: 2.095 ± 0.436
2.336MetLys: 2.336 ± 0.449
2.256MetLeu: 2.256 ± 0.533
1.531MetMet: 1.531 ± 0.421
1.853MetAsn: 1.853 ± 0.318
1.047MetPro: 1.047 ± 0.263
1.611MetGln: 1.611 ± 0.457
1.772MetArg: 1.772 ± 0.356
2.417MetSer: 2.417 ± 0.389
1.853MetThr: 1.853 ± 0.332
1.208MetVal: 1.208 ± 0.338
0.644MetTrp: 0.644 ± 0.223
0.644MetTyr: 0.644 ± 0.222
0.0MetXaa: 0.0 ± 0.0
Asn
4.592AsnAla: 4.592 ± 0.513
0.644AsnCys: 0.644 ± 0.263
3.303AsnAsp: 3.303 ± 0.414
3.545AsnGlu: 3.545 ± 0.473
1.289AsnPhe: 1.289 ± 0.351
4.914AsnGly: 4.914 ± 0.696
1.047AsnHis: 1.047 ± 0.34
2.739AsnIle: 2.739 ± 0.411
3.222AsnLys: 3.222 ± 0.727
3.061AsnLeu: 3.061 ± 0.484
0.886AsnMet: 0.886 ± 0.253
2.095AsnAsn: 2.095 ± 0.486
1.692AsnPro: 1.692 ± 0.349
1.853AsnGln: 1.853 ± 0.402
2.417AsnArg: 2.417 ± 0.518
3.222AsnSer: 3.222 ± 0.475
2.336AsnThr: 2.336 ± 0.533
3.706AsnVal: 3.706 ± 0.754
0.725AsnTrp: 0.725 ± 0.288
1.692AsnTyr: 1.692 ± 0.358
0.0AsnXaa: 0.0 ± 0.0
Pro
3.303ProAla: 3.303 ± 0.591
0.483ProCys: 0.483 ± 0.171
2.014ProAsp: 2.014 ± 0.384
2.82ProGlu: 2.82 ± 0.452
0.967ProPhe: 0.967 ± 0.265
2.095ProGly: 2.095 ± 0.364
0.725ProHis: 0.725 ± 0.221
2.095ProIle: 2.095 ± 0.307
1.933ProLys: 1.933 ± 0.392
1.692ProLeu: 1.692 ± 0.448
0.644ProMet: 0.644 ± 0.174
1.45ProAsn: 1.45 ± 0.324
1.45ProPro: 1.45 ± 0.38
1.128ProGln: 1.128 ± 0.347
1.208ProArg: 1.208 ± 0.297
1.933ProSer: 1.933 ± 0.411
1.853ProThr: 1.853 ± 0.405
3.142ProVal: 3.142 ± 0.532
0.403ProTrp: 0.403 ± 0.191
1.531ProTyr: 1.531 ± 0.285
0.0ProXaa: 0.0 ± 0.0
Gln
2.82GlnAla: 2.82 ± 0.713
0.483GlnCys: 0.483 ± 0.185
1.772GlnAsp: 1.772 ± 0.352
2.659GlnGlu: 2.659 ± 0.4
1.772GlnPhe: 1.772 ± 0.372
1.208GlnGly: 1.208 ± 0.347
1.047GlnHis: 1.047 ± 0.272
2.256GlnIle: 2.256 ± 0.487
2.981GlnLys: 2.981 ± 0.438
4.189GlnLeu: 4.189 ± 0.666
1.611GlnMet: 1.611 ± 0.373
1.37GlnAsn: 1.37 ± 0.304
1.289GlnPro: 1.289 ± 0.286
3.867GlnGln: 3.867 ± 1.239
2.095GlnArg: 2.095 ± 0.403
3.222GlnSer: 3.222 ± 0.509
2.256GlnThr: 2.256 ± 0.524
3.303GlnVal: 3.303 ± 0.57
0.725GlnTrp: 0.725 ± 0.222
1.611GlnTyr: 1.611 ± 0.442
0.0GlnXaa: 0.0 ± 0.0
Arg
4.109ArgAla: 4.109 ± 0.42
1.128ArgCys: 1.128 ± 0.403
2.659ArgAsp: 2.659 ± 0.59
3.384ArgGlu: 3.384 ± 0.577
1.933ArgPhe: 1.933 ± 0.392
3.464ArgGly: 3.464 ± 0.582
0.967ArgHis: 0.967 ± 0.238
3.303ArgIle: 3.303 ± 0.643
3.786ArgLys: 3.786 ± 0.608
5.398ArgLeu: 5.398 ± 0.587
1.853ArgMet: 1.853 ± 0.371
3.222ArgAsn: 3.222 ± 0.473
1.208ArgPro: 1.208 ± 0.256
2.739ArgGln: 2.739 ± 0.558
2.497ArgArg: 2.497 ± 0.431
3.061ArgSer: 3.061 ± 0.5
1.772ArgThr: 1.772 ± 0.418
2.9ArgVal: 2.9 ± 0.501
0.322ArgTrp: 0.322 ± 0.133
3.222ArgTyr: 3.222 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
5.961SerAla: 5.961 ± 0.967
0.886SerCys: 0.886 ± 0.277
4.511SerAsp: 4.511 ± 0.608
4.109SerGlu: 4.109 ± 0.606
2.659SerPhe: 2.659 ± 0.533
5.72SerGly: 5.72 ± 0.656
0.644SerHis: 0.644 ± 0.178
3.625SerIle: 3.625 ± 0.568
4.35SerLys: 4.35 ± 0.843
5.236SerLeu: 5.236 ± 0.7
2.256SerMet: 2.256 ± 0.405
2.578SerAsn: 2.578 ± 0.47
2.659SerPro: 2.659 ± 0.385
2.497SerGln: 2.497 ± 0.382
2.9SerArg: 2.9 ± 0.455
5.156SerSer: 5.156 ± 0.596
4.028SerThr: 4.028 ± 0.723
4.109SerVal: 4.109 ± 0.467
1.047SerTrp: 1.047 ± 0.35
2.497SerTyr: 2.497 ± 0.313
0.0SerXaa: 0.0 ± 0.0
Thr
4.35ThrAla: 4.35 ± 0.802
0.644ThrCys: 0.644 ± 0.203
3.706ThrAsp: 3.706 ± 0.535
3.706ThrGlu: 3.706 ± 0.534
2.256ThrPhe: 2.256 ± 0.545
5.8ThrGly: 5.8 ± 0.808
1.047ThrHis: 1.047 ± 0.304
3.384ThrIle: 3.384 ± 0.619
3.142ThrLys: 3.142 ± 0.461
3.545ThrLeu: 3.545 ± 0.473
1.45ThrMet: 1.45 ± 0.31
2.497ThrAsn: 2.497 ± 0.532
3.142ThrPro: 3.142 ± 0.602
1.611ThrGln: 1.611 ± 0.395
2.9ThrArg: 2.9 ± 0.471
4.028ThrSer: 4.028 ± 0.509
3.947ThrThr: 3.947 ± 0.982
3.545ThrVal: 3.545 ± 0.632
0.806ThrTrp: 0.806 ± 0.333
1.531ThrTyr: 1.531 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
5.075ValAla: 5.075 ± 0.66
0.483ValCys: 0.483 ± 0.325
4.109ValAsp: 4.109 ± 0.728
3.867ValGlu: 3.867 ± 0.603
2.497ValPhe: 2.497 ± 0.559
3.706ValGly: 3.706 ± 0.557
0.806ValHis: 0.806 ± 0.255
4.834ValIle: 4.834 ± 0.512
3.545ValLys: 3.545 ± 0.636
4.189ValLeu: 4.189 ± 0.668
2.256ValMet: 2.256 ± 0.418
3.384ValAsn: 3.384 ± 0.589
1.853ValPro: 1.853 ± 0.437
2.256ValGln: 2.256 ± 0.416
3.142ValArg: 3.142 ± 0.511
5.72ValSer: 5.72 ± 0.789
4.753ValThr: 4.753 ± 0.877
4.834ValVal: 4.834 ± 0.93
1.128ValTrp: 1.128 ± 0.365
2.659ValTyr: 2.659 ± 0.376
0.0ValXaa: 0.0 ± 0.0
Trp
1.047TrpAla: 1.047 ± 0.264
0.242TrpCys: 0.242 ± 0.133
1.37TrpAsp: 1.37 ± 0.286
0.806TrpGlu: 0.806 ± 0.305
0.725TrpPhe: 0.725 ± 0.252
0.564TrpGly: 0.564 ± 0.294
0.725TrpHis: 0.725 ± 0.305
0.403TrpIle: 0.403 ± 0.197
0.967TrpLys: 0.967 ± 0.286
1.289TrpLeu: 1.289 ± 0.388
0.322TrpMet: 0.322 ± 0.239
0.483TrpAsn: 0.483 ± 0.21
0.322TrpPro: 0.322 ± 0.159
0.886TrpGln: 0.886 ± 0.254
1.45TrpArg: 1.45 ± 0.385
0.403TrpSer: 0.403 ± 0.167
0.725TrpThr: 0.725 ± 0.217
1.45TrpVal: 1.45 ± 0.289
0.242TrpTrp: 0.242 ± 0.124
0.403TrpTyr: 0.403 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.028TyrAla: 4.028 ± 0.378
0.403TyrCys: 0.403 ± 0.147
3.384TyrAsp: 3.384 ± 0.534
1.289TyrGlu: 1.289 ± 0.307
1.45TyrPhe: 1.45 ± 0.399
2.82TyrGly: 2.82 ± 0.522
0.886TyrHis: 0.886 ± 0.232
2.497TyrIle: 2.497 ± 0.536
1.933TyrLys: 1.933 ± 0.425
2.095TyrLeu: 2.095 ± 0.488
0.403TyrMet: 0.403 ± 0.162
1.853TyrAsn: 1.853 ± 0.414
1.45TyrPro: 1.45 ± 0.319
1.853TyrGln: 1.853 ± 0.503
1.692TyrArg: 1.692 ± 0.317
3.142TyrSer: 3.142 ± 0.48
2.336TyrThr: 2.336 ± 0.566
2.095TyrVal: 2.095 ± 0.404
0.806TyrTrp: 0.806 ± 0.25
1.047TyrTyr: 1.047 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (12414 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski