Amino acid dipepetide frequency for Salmonella phage 19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.695AlaAla: 5.695 ± 0.67
0.899AlaCys: 0.899 ± 0.291
4.346AlaAsp: 4.346 ± 0.681
3.972AlaGlu: 3.972 ± 0.621
3.747AlaPhe: 3.747 ± 0.489
6.519AlaGly: 6.519 ± 0.678
0.899AlaHis: 0.899 ± 0.251
2.997AlaIle: 2.997 ± 0.455
4.346AlaLys: 4.346 ± 0.661
4.196AlaLeu: 4.196 ± 0.614
3.522AlaMet: 3.522 ± 0.526
3.372AlaAsn: 3.372 ± 0.427
2.323AlaPro: 2.323 ± 0.407
2.248AlaGln: 2.248 ± 0.336
2.922AlaArg: 2.922 ± 0.483
4.796AlaSer: 4.796 ± 0.508
4.121AlaThr: 4.121 ± 0.748
4.046AlaVal: 4.046 ± 0.547
1.574AlaTrp: 1.574 ± 0.311
3.072AlaTyr: 3.072 ± 0.549
0.0AlaXaa: 0.0 ± 0.0
Cys
1.199CysAla: 1.199 ± 0.338
0.599CysCys: 0.599 ± 0.213
0.674CysAsp: 0.674 ± 0.247
0.899CysGlu: 0.899 ± 0.3
0.45CysPhe: 0.45 ± 0.178
0.899CysGly: 0.899 ± 0.259
0.824CysHis: 0.824 ± 0.22
1.199CysIle: 1.199 ± 0.289
0.899CysLys: 0.899 ± 0.285
1.798CysLeu: 1.798 ± 0.325
0.974CysMet: 0.974 ± 0.262
0.824CysAsn: 0.824 ± 0.278
0.674CysPro: 0.674 ± 0.212
0.525CysGln: 0.525 ± 0.191
1.199CysArg: 1.199 ± 0.278
1.049CysSer: 1.049 ± 0.298
0.525CysThr: 0.525 ± 0.181
0.599CysVal: 0.599 ± 0.22
0.599CysTrp: 0.599 ± 0.2
0.599CysTyr: 0.599 ± 0.195
0.0CysXaa: 0.0 ± 0.0
Asp
4.196AspAla: 4.196 ± 0.495
1.349AspCys: 1.349 ± 0.313
3.897AspAsp: 3.897 ± 0.652
3.297AspGlu: 3.297 ± 0.454
2.773AspPhe: 2.773 ± 0.464
4.646AspGly: 4.646 ± 0.653
0.974AspHis: 0.974 ± 0.253
2.997AspIle: 2.997 ± 0.48
3.597AspLys: 3.597 ± 0.641
4.796AspLeu: 4.796 ± 0.547
1.424AspMet: 1.424 ± 0.315
2.548AspAsn: 2.548 ± 0.382
2.698AspPro: 2.698 ± 0.613
3.147AspGln: 3.147 ± 0.537
2.548AspArg: 2.548 ± 0.492
3.222AspSer: 3.222 ± 0.517
2.997AspThr: 2.997 ± 0.578
4.871AspVal: 4.871 ± 0.548
1.349AspTrp: 1.349 ± 0.295
1.873AspTyr: 1.873 ± 0.308
0.0AspXaa: 0.0 ± 0.0
Glu
4.121GluAla: 4.121 ± 0.631
0.824GluCys: 0.824 ± 0.218
2.848GluAsp: 2.848 ± 0.428
3.297GluGlu: 3.297 ± 0.693
2.997GluPhe: 2.997 ± 0.53
3.672GluGly: 3.672 ± 0.645
1.274GluHis: 1.274 ± 0.323
2.623GluIle: 2.623 ± 0.376
3.897GluLys: 3.897 ± 0.557
5.395GluLeu: 5.395 ± 0.559
1.424GluMet: 1.424 ± 0.281
2.473GluAsn: 2.473 ± 0.369
1.199GluPro: 1.199 ± 0.333
1.948GluGln: 1.948 ± 0.389
2.623GluArg: 2.623 ± 0.524
3.822GluSer: 3.822 ± 0.538
2.323GluThr: 2.323 ± 0.417
3.222GluVal: 3.222 ± 0.413
1.349GluTrp: 1.349 ± 0.362
1.274GluTyr: 1.274 ± 0.276
0.0GluXaa: 0.0 ± 0.0
Phe
2.698PheAla: 2.698 ± 0.416
0.824PheCys: 0.824 ± 0.234
2.922PheAsp: 2.922 ± 0.537
1.873PheGlu: 1.873 ± 0.37
1.124PhePhe: 1.124 ± 0.311
2.098PheGly: 2.098 ± 0.412
0.749PheHis: 0.749 ± 0.226
1.723PheIle: 1.723 ± 0.385
1.873PheLys: 1.873 ± 0.393
3.447PheLeu: 3.447 ± 0.514
1.649PheMet: 1.649 ± 0.346
2.023PheAsn: 2.023 ± 0.356
1.574PhePro: 1.574 ± 0.356
1.424PheGln: 1.424 ± 0.321
2.773PheArg: 2.773 ± 0.417
1.798PheSer: 1.798 ± 0.295
4.196PheThr: 4.196 ± 0.568
3.147PheVal: 3.147 ± 0.449
0.824PheTrp: 0.824 ± 0.273
0.974PheTyr: 0.974 ± 0.251
0.0PheXaa: 0.0 ± 0.0
Gly
4.946GlyAla: 4.946 ± 0.868
1.349GlyCys: 1.349 ± 0.319
5.47GlyAsp: 5.47 ± 1.829
3.822GlyGlu: 3.822 ± 0.509
2.323GlyPhe: 2.323 ± 0.456
3.522GlyGly: 3.522 ± 0.572
0.525GlyHis: 0.525 ± 0.204
3.297GlyIle: 3.297 ± 0.552
3.747GlyLys: 3.747 ± 0.563
6.369GlyLeu: 6.369 ± 0.601
2.323GlyMet: 2.323 ± 0.436
3.597GlyAsn: 3.597 ± 0.641
4.121GlyPro: 4.121 ± 2.198
2.623GlyGln: 2.623 ± 0.495
2.698GlyArg: 2.698 ± 0.426
4.571GlySer: 4.571 ± 0.64
4.571GlyThr: 4.571 ± 0.595
4.871GlyVal: 4.871 ± 0.533
1.199GlyTrp: 1.199 ± 0.289
2.248GlyTyr: 2.248 ± 0.413
0.0GlyXaa: 0.0 ± 0.0
His
0.525HisAla: 0.525 ± 0.189
0.45HisCys: 0.45 ± 0.167
1.274HisAsp: 1.274 ± 0.316
1.124HisGlu: 1.124 ± 0.305
0.974HisPhe: 0.974 ± 0.284
1.049HisGly: 1.049 ± 0.294
0.375HisHis: 0.375 ± 0.17
0.824HisIle: 0.824 ± 0.248
0.749HisLys: 0.749 ± 0.24
1.499HisLeu: 1.499 ± 0.314
0.749HisMet: 0.749 ± 0.264
0.749HisAsn: 0.749 ± 0.247
0.749HisPro: 0.749 ± 0.235
0.824HisGln: 0.824 ± 0.221
1.349HisArg: 1.349 ± 0.313
1.798HisSer: 1.798 ± 0.388
1.199HisThr: 1.199 ± 0.288
1.499HisVal: 1.499 ± 0.296
0.45HisTrp: 0.45 ± 0.184
0.674HisTyr: 0.674 ± 0.211
0.0HisXaa: 0.0 ± 0.0
Ile
3.597IleAla: 3.597 ± 0.568
0.525IleCys: 0.525 ± 0.206
3.972IleAsp: 3.972 ± 0.546
3.222IleGlu: 3.222 ± 0.496
1.798IlePhe: 1.798 ± 0.421
2.623IleGly: 2.623 ± 0.46
0.899IleHis: 0.899 ± 0.261
3.072IleIle: 3.072 ± 0.601
2.248IleLys: 2.248 ± 0.477
4.646IleLeu: 4.646 ± 0.473
2.023IleMet: 2.023 ± 0.367
2.398IleAsn: 2.398 ± 0.374
2.623IlePro: 2.623 ± 0.414
1.948IleGln: 1.948 ± 0.53
2.922IleArg: 2.922 ± 0.411
3.672IleSer: 3.672 ± 0.508
2.848IleThr: 2.848 ± 0.515
4.121IleVal: 4.121 ± 0.551
0.674IleTrp: 0.674 ± 0.212
1.574IleTyr: 1.574 ± 0.349
0.0IleXaa: 0.0 ± 0.0
Lys
4.721LysAla: 4.721 ± 0.71
0.824LysCys: 0.824 ± 0.277
2.548LysAsp: 2.548 ± 0.465
2.623LysGlu: 2.623 ± 0.509
1.499LysPhe: 1.499 ± 0.314
4.496LysGly: 4.496 ± 1.314
1.274LysHis: 1.274 ± 0.334
3.222LysIle: 3.222 ± 0.527
2.548LysLys: 2.548 ± 0.428
4.121LysLeu: 4.121 ± 0.51
1.798LysMet: 1.798 ± 0.41
2.248LysAsn: 2.248 ± 0.406
1.574LysPro: 1.574 ± 0.365
2.098LysGln: 2.098 ± 0.392
2.023LysArg: 2.023 ± 0.417
5.021LysSer: 5.021 ± 0.634
2.997LysThr: 2.997 ± 0.469
4.421LysVal: 4.421 ± 0.517
0.974LysTrp: 0.974 ± 0.26
2.023LysTyr: 2.023 ± 0.442
0.0LysXaa: 0.0 ± 0.0
Leu
6.294LeuAla: 6.294 ± 0.687
1.274LeuCys: 1.274 ± 0.31
3.972LeuAsp: 3.972 ± 0.439
4.571LeuGlu: 4.571 ± 0.581
2.848LeuPhe: 2.848 ± 0.365
4.421LeuGly: 4.421 ± 0.475
2.023LeuHis: 2.023 ± 0.39
3.747LeuIle: 3.747 ± 0.55
5.47LeuLys: 5.47 ± 0.622
8.093LeuLeu: 8.093 ± 0.809
4.496LeuMet: 4.496 ± 0.62
3.822LeuAsn: 3.822 ± 0.564
4.421LeuPro: 4.421 ± 0.529
4.196LeuGln: 4.196 ± 0.536
5.17LeuArg: 5.17 ± 0.676
5.92LeuSer: 5.92 ± 0.557
6.145LeuThr: 6.145 ± 0.708
7.793LeuVal: 7.793 ± 0.842
1.499LeuTrp: 1.499 ± 0.321
2.698LeuTyr: 2.698 ± 0.427
0.0LeuXaa: 0.0 ± 0.0
Met
2.473MetAla: 2.473 ± 0.4
0.599MetCys: 0.599 ± 0.204
1.574MetAsp: 1.574 ± 0.277
1.274MetGlu: 1.274 ± 0.282
2.398MetPhe: 2.398 ± 0.398
1.723MetGly: 1.723 ± 0.322
0.824MetHis: 0.824 ± 0.237
2.473MetIle: 2.473 ± 0.405
2.398MetLys: 2.398 ± 0.513
3.822MetLeu: 3.822 ± 0.602
1.199MetMet: 1.199 ± 0.29
2.323MetAsn: 2.323 ± 0.457
1.199MetPro: 1.199 ± 0.287
2.398MetGln: 2.398 ± 0.39
1.424MetArg: 1.424 ± 0.369
2.698MetSer: 2.698 ± 0.438
3.222MetThr: 3.222 ± 0.488
3.372MetVal: 3.372 ± 0.622
0.45MetTrp: 0.45 ± 0.156
1.049MetTyr: 1.049 ± 0.261
0.0MetXaa: 0.0 ± 0.0
Asn
3.672AsnAla: 3.672 ± 0.537
0.749AsnCys: 0.749 ± 0.241
2.248AsnAsp: 2.248 ± 0.328
1.499AsnGlu: 1.499 ± 0.398
1.499AsnPhe: 1.499 ± 0.302
3.222AsnGly: 3.222 ± 0.591
0.674AsnHis: 0.674 ± 0.218
2.248AsnIle: 2.248 ± 0.442
2.997AsnLys: 2.997 ± 0.492
4.646AsnLeu: 4.646 ± 0.677
1.948AsnMet: 1.948 ± 0.395
1.199AsnAsn: 1.199 ± 0.266
2.922AsnPro: 2.922 ± 0.419
1.199AsnGln: 1.199 ± 0.314
2.548AsnArg: 2.548 ± 0.391
3.147AsnSer: 3.147 ± 0.486
2.248AsnThr: 2.248 ± 0.328
2.922AsnVal: 2.922 ± 0.342
0.824AsnTrp: 0.824 ± 0.255
1.649AsnTyr: 1.649 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
2.922ProAla: 2.922 ± 0.661
0.599ProCys: 0.599 ± 0.199
2.323ProAsp: 2.323 ± 0.485
2.698ProGlu: 2.698 ± 0.512
1.349ProPhe: 1.349 ± 0.285
2.698ProGly: 2.698 ± 0.456
0.674ProHis: 0.674 ± 0.255
1.649ProIle: 1.649 ± 0.401
1.723ProLys: 1.723 ± 0.441
3.222ProLeu: 3.222 ± 0.47
1.499ProMet: 1.499 ± 0.375
1.424ProAsn: 1.424 ± 0.425
1.424ProPro: 1.424 ± 0.341
3.222ProGln: 3.222 ± 0.991
1.723ProArg: 1.723 ± 0.333
3.072ProSer: 3.072 ± 0.436
2.398ProThr: 2.398 ± 0.413
3.972ProVal: 3.972 ± 0.539
0.599ProTrp: 0.599 ± 0.234
1.274ProTyr: 1.274 ± 0.28
0.0ProXaa: 0.0 ± 0.0
Gln
2.997GlnAla: 2.997 ± 0.417
0.674GlnCys: 0.674 ± 0.195
2.173GlnAsp: 2.173 ± 0.372
2.098GlnGlu: 2.098 ± 0.354
1.124GlnPhe: 1.124 ± 0.322
5.545GlnGly: 5.545 ± 1.798
1.124GlnHis: 1.124 ± 0.314
2.473GlnIle: 2.473 ± 0.44
2.023GlnLys: 2.023 ± 0.383
3.597GlnLeu: 3.597 ± 0.466
1.199GlnMet: 1.199 ± 0.308
1.873GlnAsn: 1.873 ± 0.371
1.274GlnPro: 1.274 ± 0.318
1.723GlnGln: 1.723 ± 0.468
2.023GlnArg: 2.023 ± 0.447
3.447GlnSer: 3.447 ± 0.521
2.023GlnThr: 2.023 ± 0.35
2.473GlnVal: 2.473 ± 0.407
0.525GlnTrp: 0.525 ± 0.191
1.873GlnTyr: 1.873 ± 0.387
0.0GlnXaa: 0.0 ± 0.0
Arg
2.398ArgAla: 2.398 ± 0.46
1.574ArgCys: 1.574 ± 0.388
2.623ArgAsp: 2.623 ± 0.636
2.248ArgGlu: 2.248 ± 0.356
3.072ArgPhe: 3.072 ± 0.404
2.623ArgGly: 2.623 ± 0.495
1.424ArgHis: 1.424 ± 0.306
2.997ArgIle: 2.997 ± 0.426
2.473ArgLys: 2.473 ± 0.37
4.721ArgLeu: 4.721 ± 0.672
2.997ArgMet: 2.997 ± 0.439
2.098ArgAsn: 2.098 ± 0.348
2.323ArgPro: 2.323 ± 0.452
2.548ArgGln: 2.548 ± 0.404
3.597ArgArg: 3.597 ± 0.57
4.046ArgSer: 4.046 ± 0.578
3.297ArgThr: 3.297 ± 0.564
3.522ArgVal: 3.522 ± 0.425
1.124ArgTrp: 1.124 ± 0.291
1.499ArgTyr: 1.499 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
5.245SerAla: 5.245 ± 0.695
1.274SerCys: 1.274 ± 0.321
4.646SerAsp: 4.646 ± 0.491
3.147SerGlu: 3.147 ± 0.491
2.698SerPhe: 2.698 ± 0.462
6.894SerGly: 6.894 ± 0.79
1.049SerHis: 1.049 ± 0.27
3.747SerIle: 3.747 ± 0.456
3.072SerLys: 3.072 ± 0.563
6.369SerLeu: 6.369 ± 0.748
2.922SerMet: 2.922 ± 0.502
2.922SerAsn: 2.922 ± 0.435
2.698SerPro: 2.698 ± 0.391
2.773SerGln: 2.773 ± 0.442
4.271SerArg: 4.271 ± 0.687
4.721SerSer: 4.721 ± 0.643
4.646SerThr: 4.646 ± 0.61
5.92SerVal: 5.92 ± 0.812
1.274SerTrp: 1.274 ± 0.327
2.023SerTyr: 2.023 ± 0.376
0.0SerXaa: 0.0 ± 0.0
Thr
4.046ThrAla: 4.046 ± 0.563
0.974ThrCys: 0.974 ± 0.302
4.046ThrAsp: 4.046 ± 0.504
3.372ThrGlu: 3.372 ± 0.421
2.248ThrPhe: 2.248 ± 0.34
5.096ThrGly: 5.096 ± 1.094
0.674ThrHis: 0.674 ± 0.174
2.548ThrIle: 2.548 ± 0.397
3.222ThrLys: 3.222 ± 0.577
5.545ThrLeu: 5.545 ± 0.63
1.649ThrMet: 1.649 ± 0.345
2.473ThrAsn: 2.473 ± 0.488
2.548ThrPro: 2.548 ± 0.459
2.398ThrGln: 2.398 ± 0.427
3.147ThrArg: 3.147 ± 0.493
5.17ThrSer: 5.17 ± 0.571
3.672ThrThr: 3.672 ± 0.627
5.245ThrVal: 5.245 ± 0.678
1.499ThrTrp: 1.499 ± 0.376
1.948ThrTyr: 1.948 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
5.096ValAla: 5.096 ± 0.741
0.899ValCys: 0.899 ± 0.204
4.571ValAsp: 4.571 ± 0.658
4.121ValGlu: 4.121 ± 0.56
2.473ValPhe: 2.473 ± 0.388
3.897ValGly: 3.897 ± 0.674
1.349ValHis: 1.349 ± 0.346
4.271ValIle: 4.271 ± 0.564
3.672ValLys: 3.672 ± 0.58
7.269ValLeu: 7.269 ± 0.74
3.372ValMet: 3.372 ± 0.481
3.372ValAsn: 3.372 ± 0.48
2.623ValPro: 2.623 ± 0.474
2.773ValGln: 2.773 ± 0.472
5.096ValArg: 5.096 ± 0.727
6.145ValSer: 6.145 ± 0.695
4.871ValThr: 4.871 ± 0.528
6.894ValVal: 6.894 ± 0.718
1.199ValTrp: 1.199 ± 0.353
2.473ValTyr: 2.473 ± 0.443
0.0ValXaa: 0.0 ± 0.0
Trp
0.749TrpAla: 0.749 ± 0.183
0.225TrpCys: 0.225 ± 0.121
1.049TrpAsp: 1.049 ± 0.265
0.899TrpGlu: 0.899 ± 0.251
0.674TrpPhe: 0.674 ± 0.193
0.45TrpGly: 0.45 ± 0.182
0.674TrpHis: 0.674 ± 0.213
1.499TrpIle: 1.499 ± 0.298
0.824TrpLys: 0.824 ± 0.276
2.623TrpLeu: 2.623 ± 0.459
0.974TrpMet: 0.974 ± 0.244
0.899TrpAsn: 0.899 ± 0.265
0.375TrpPro: 0.375 ± 0.197
0.974TrpGln: 0.974 ± 0.247
0.749TrpArg: 0.749 ± 0.272
1.424TrpSer: 1.424 ± 0.349
1.424TrpThr: 1.424 ± 0.32
1.274TrpVal: 1.274 ± 0.304
0.45TrpTrp: 0.45 ± 0.18
0.974TrpTyr: 0.974 ± 0.273
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.023TyrAla: 2.023 ± 0.396
0.525TyrCys: 0.525 ± 0.167
2.023TyrAsp: 2.023 ± 0.442
2.548TyrGlu: 2.548 ± 0.394
1.649TyrPhe: 1.649 ± 0.411
2.098TyrGly: 2.098 ± 0.41
0.599TyrHis: 0.599 ± 0.175
1.798TyrIle: 1.798 ± 0.422
1.274TyrLys: 1.274 ± 0.285
2.698TyrLeu: 2.698 ± 0.573
0.674TyrMet: 0.674 ± 0.217
1.424TyrAsn: 1.424 ± 0.331
1.124TyrPro: 1.124 ± 0.31
1.274TyrGln: 1.274 ± 0.319
2.398TyrArg: 2.398 ± 0.441
2.773TyrSer: 2.773 ± 0.426
1.798TyrThr: 1.798 ± 0.304
2.323TyrVal: 2.323 ± 0.407
0.749TyrTrp: 0.749 ± 0.209
1.349TyrTyr: 1.349 ± 0.29
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 194 proteins (13346 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski