Amino acid dipepetide frequency for Salmonella phage 64795_sal3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.343AlaAla: 8.343 ± 1.442
0.745AlaCys: 0.745 ± 0.308
4.246AlaAsp: 4.246 ± 0.478
6.331AlaGlu: 6.331 ± 0.854
2.011AlaPhe: 2.011 ± 0.406
6.704AlaGly: 6.704 ± 0.586
0.894AlaHis: 0.894 ± 0.268
5.885AlaIle: 5.885 ± 0.467
6.629AlaLys: 6.629 ± 0.782
7.672AlaLeu: 7.672 ± 0.977
2.756AlaMet: 2.756 ± 0.415
3.426AlaAsn: 3.426 ± 0.611
2.16AlaPro: 2.16 ± 0.357
3.426AlaGln: 3.426 ± 0.627
4.618AlaArg: 4.618 ± 0.624
5.214AlaSer: 5.214 ± 0.762
4.916AlaThr: 4.916 ± 0.871
6.034AlaVal: 6.034 ± 0.72
1.117AlaTrp: 1.117 ± 0.249
2.98AlaTyr: 2.98 ± 0.438
0.0AlaXaa: 0.0 ± 0.0
Cys
0.968CysAla: 0.968 ± 0.264
0.447CysCys: 0.447 ± 0.178
1.266CysAsp: 1.266 ± 0.332
1.192CysGlu: 1.192 ± 0.273
0.596CysPhe: 0.596 ± 0.22
1.639CysGly: 1.639 ± 0.49
0.596CysHis: 0.596 ± 0.214
0.968CysIle: 0.968 ± 0.268
1.415CysLys: 1.415 ± 0.29
0.819CysLeu: 0.819 ± 0.27
0.372CysMet: 0.372 ± 0.153
0.894CysAsn: 0.894 ± 0.254
0.968CysPro: 0.968 ± 0.44
0.372CysGln: 0.372 ± 0.16
0.745CysArg: 0.745 ± 0.259
0.67CysSer: 0.67 ± 0.228
0.447CysThr: 0.447 ± 0.205
0.521CysVal: 0.521 ± 0.186
0.521CysTrp: 0.521 ± 0.183
0.596CysTyr: 0.596 ± 0.206
0.0CysXaa: 0.0 ± 0.0
Asp
6.406AspAla: 6.406 ± 0.65
1.043AspCys: 1.043 ± 0.324
4.842AspAsp: 4.842 ± 0.748
5.289AspGlu: 5.289 ± 0.693
3.054AspPhe: 3.054 ± 0.474
6.034AspGly: 6.034 ± 1.002
1.192AspHis: 1.192 ± 0.278
4.469AspIle: 4.469 ± 0.584
4.32AspLys: 4.32 ± 0.726
2.905AspLeu: 2.905 ± 0.456
2.235AspMet: 2.235 ± 0.497
2.533AspAsn: 2.533 ± 0.43
1.639AspPro: 1.639 ± 0.317
1.266AspGln: 1.266 ± 0.333
2.384AspArg: 2.384 ± 0.511
3.054AspSer: 3.054 ± 0.624
2.682AspThr: 2.682 ± 0.459
4.171AspVal: 4.171 ± 0.53
0.745AspTrp: 0.745 ± 0.223
2.905AspTyr: 2.905 ± 0.528
0.0AspXaa: 0.0 ± 0.0
Glu
5.214GluAla: 5.214 ± 0.744
1.266GluCys: 1.266 ± 0.294
3.128GluAsp: 3.128 ± 0.491
4.767GluGlu: 4.767 ± 0.542
2.98GluPhe: 2.98 ± 0.574
2.831GluGly: 2.831 ± 0.396
0.596GluHis: 0.596 ± 0.22
4.767GluIle: 4.767 ± 0.547
4.246GluLys: 4.246 ± 0.659
5.885GluLeu: 5.885 ± 0.602
2.607GluMet: 2.607 ± 0.45
2.384GluAsn: 2.384 ± 0.323
1.937GluPro: 1.937 ± 0.365
4.171GluGln: 4.171 ± 0.626
3.352GluArg: 3.352 ± 0.641
4.022GluSer: 4.022 ± 0.595
2.533GluThr: 2.533 ± 0.41
4.097GluVal: 4.097 ± 0.638
1.713GluTrp: 1.713 ± 0.299
2.682GluTyr: 2.682 ± 0.424
0.0GluXaa: 0.0 ± 0.0
Phe
1.564PheAla: 1.564 ± 0.327
0.894PheCys: 0.894 ± 0.217
2.831PheAsp: 2.831 ± 0.41
2.235PheGlu: 2.235 ± 0.428
0.819PhePhe: 0.819 ± 0.271
2.682PheGly: 2.682 ± 0.349
0.521PheHis: 0.521 ± 0.204
2.384PheIle: 2.384 ± 0.467
1.713PheLys: 1.713 ± 0.312
1.937PheLeu: 1.937 ± 0.426
1.564PheMet: 1.564 ± 0.319
2.309PheAsn: 2.309 ± 0.456
1.043PhePro: 1.043 ± 0.276
1.192PheGln: 1.192 ± 0.277
1.639PheArg: 1.639 ± 0.354
2.682PheSer: 2.682 ± 0.495
1.788PheThr: 1.788 ± 0.359
1.639PheVal: 1.639 ± 0.333
0.894PheTrp: 0.894 ± 0.243
0.968PheTyr: 0.968 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
5.736GlyAla: 5.736 ± 0.697
1.639GlyCys: 1.639 ± 0.553
5.14GlyAsp: 5.14 ± 0.682
4.693GlyGlu: 4.693 ± 0.576
2.16GlyPhe: 2.16 ± 0.5
4.842GlyGly: 4.842 ± 0.845
1.117GlyHis: 1.117 ± 0.286
4.395GlyIle: 4.395 ± 0.594
6.406GlyLys: 6.406 ± 0.76
4.171GlyLeu: 4.171 ± 0.429
2.384GlyMet: 2.384 ± 0.38
2.98GlyAsn: 2.98 ± 0.387
0.745GlyPro: 0.745 ± 0.224
2.384GlyGln: 2.384 ± 0.447
3.277GlyArg: 3.277 ± 0.571
4.842GlySer: 4.842 ± 0.736
4.022GlyThr: 4.022 ± 0.618
5.959GlyVal: 5.959 ± 0.756
0.894GlyTrp: 0.894 ± 0.228
3.724GlyTyr: 3.724 ± 0.491
0.0GlyXaa: 0.0 ± 0.0
His
0.894HisAla: 0.894 ± 0.335
0.447HisCys: 0.447 ± 0.171
1.043HisAsp: 1.043 ± 0.27
1.192HisGlu: 1.192 ± 0.277
0.521HisPhe: 0.521 ± 0.198
2.533HisGly: 2.533 ± 0.765
0.596HisHis: 0.596 ± 0.217
0.968HisIle: 0.968 ± 0.342
1.341HisLys: 1.341 ± 0.279
1.266HisLeu: 1.266 ± 0.309
0.521HisMet: 0.521 ± 0.211
0.894HisAsn: 0.894 ± 0.192
0.67HisPro: 0.67 ± 0.212
0.596HisGln: 0.596 ± 0.207
1.117HisArg: 1.117 ± 0.3
0.67HisSer: 0.67 ± 0.193
0.447HisThr: 0.447 ± 0.235
1.564HisVal: 1.564 ± 0.316
0.223HisTrp: 0.223 ± 0.186
0.894HisTyr: 0.894 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
6.108IleAla: 6.108 ± 0.69
1.192IleCys: 1.192 ± 0.338
5.512IleAsp: 5.512 ± 0.712
4.097IleGlu: 4.097 ± 0.451
2.16IlePhe: 2.16 ± 0.346
4.544IleGly: 4.544 ± 0.585
1.49IleHis: 1.49 ± 0.366
4.544IleIle: 4.544 ± 0.769
4.767IleLys: 4.767 ± 0.608
3.724IleLeu: 3.724 ± 0.518
2.235IleMet: 2.235 ± 0.489
3.724IleAsn: 3.724 ± 0.569
2.533IlePro: 2.533 ± 0.386
2.086IleGln: 2.086 ± 0.465
3.128IleArg: 3.128 ± 0.5
4.395IleSer: 4.395 ± 0.727
3.724IleThr: 3.724 ± 0.47
4.32IleVal: 4.32 ± 0.499
0.447IleTrp: 0.447 ± 0.184
1.639IleTyr: 1.639 ± 0.369
0.0IleXaa: 0.0 ± 0.0
Lys
6.778LysAla: 6.778 ± 0.88
1.266LysCys: 1.266 ± 0.335
3.724LysAsp: 3.724 ± 0.628
4.246LysGlu: 4.246 ± 0.648
2.011LysPhe: 2.011 ± 0.413
3.724LysGly: 3.724 ± 0.622
1.415LysHis: 1.415 ± 0.391
4.32LysIle: 4.32 ± 0.598
4.767LysLys: 4.767 ± 0.781
5.289LysLeu: 5.289 ± 0.691
2.831LysMet: 2.831 ± 0.537
4.097LysAsn: 4.097 ± 0.629
2.384LysPro: 2.384 ± 0.466
3.203LysGln: 3.203 ± 0.614
4.097LysArg: 4.097 ± 0.583
4.991LysSer: 4.991 ± 0.646
3.799LysThr: 3.799 ± 0.654
4.693LysVal: 4.693 ± 0.572
1.341LysTrp: 1.341 ± 0.282
3.128LysTyr: 3.128 ± 0.576
0.0LysXaa: 0.0 ± 0.0
Leu
5.438LeuAla: 5.438 ± 0.657
0.894LeuCys: 0.894 ± 0.273
3.277LeuAsp: 3.277 ± 0.344
3.65LeuGlu: 3.65 ± 0.555
2.086LeuPhe: 2.086 ± 0.389
4.842LeuGly: 4.842 ± 0.632
1.266LeuHis: 1.266 ± 0.264
4.842LeuIle: 4.842 ± 0.54
5.438LeuLys: 5.438 ± 0.674
4.246LeuLeu: 4.246 ± 0.674
1.49LeuMet: 1.49 ± 0.284
3.352LeuAsn: 3.352 ± 0.551
2.98LeuPro: 2.98 ± 0.671
2.682LeuGln: 2.682 ± 0.511
4.32LeuArg: 4.32 ± 0.523
5.438LeuSer: 5.438 ± 0.623
4.842LeuThr: 4.842 ± 0.576
4.842LeuVal: 4.842 ± 0.557
1.117LeuTrp: 1.117 ± 0.282
2.235LeuTyr: 2.235 ± 0.448
0.0LeuXaa: 0.0 ± 0.0
Met
3.352MetAla: 3.352 ± 0.432
0.447MetCys: 0.447 ± 0.191
1.415MetAsp: 1.415 ± 0.301
1.415MetGlu: 1.415 ± 0.355
0.894MetPhe: 0.894 ± 0.339
1.415MetGly: 1.415 ± 0.351
0.67MetHis: 0.67 ± 0.206
2.309MetIle: 2.309 ± 0.47
2.905MetLys: 2.905 ± 0.477
2.235MetLeu: 2.235 ± 0.407
1.043MetMet: 1.043 ± 0.264
2.086MetAsn: 2.086 ± 0.327
1.49MetPro: 1.49 ± 0.329
1.564MetGln: 1.564 ± 0.393
1.937MetArg: 1.937 ± 0.361
2.384MetSer: 2.384 ± 0.391
1.713MetThr: 1.713 ± 0.385
1.564MetVal: 1.564 ± 0.349
0.447MetTrp: 0.447 ± 0.137
1.043MetTyr: 1.043 ± 0.303
0.0MetXaa: 0.0 ± 0.0
Asn
5.065AsnAla: 5.065 ± 0.704
0.67AsnCys: 0.67 ± 0.249
3.352AsnAsp: 3.352 ± 0.516
2.458AsnGlu: 2.458 ± 0.384
0.894AsnPhe: 0.894 ± 0.307
4.991AsnGly: 4.991 ± 0.582
1.415AsnHis: 1.415 ± 0.344
2.384AsnIle: 2.384 ± 0.377
3.724AsnLys: 3.724 ± 0.487
3.054AsnLeu: 3.054 ± 0.371
1.117AsnMet: 1.117 ± 0.289
2.458AsnAsn: 2.458 ± 0.37
1.341AsnPro: 1.341 ± 0.343
1.49AsnGln: 1.49 ± 0.346
2.458AsnArg: 2.458 ± 0.327
2.98AsnSer: 2.98 ± 0.504
2.756AsnThr: 2.756 ± 0.587
3.128AsnVal: 3.128 ± 0.693
0.596AsnTrp: 0.596 ± 0.185
1.639AsnTyr: 1.639 ± 0.332
0.0AsnXaa: 0.0 ± 0.0
Pro
3.054ProAla: 3.054 ± 0.583
0.596ProCys: 0.596 ± 0.213
2.384ProAsp: 2.384 ± 0.43
2.831ProGlu: 2.831 ± 0.409
1.043ProPhe: 1.043 ± 0.276
2.011ProGly: 2.011 ± 0.529
0.67ProHis: 0.67 ± 0.177
1.713ProIle: 1.713 ± 0.322
1.788ProLys: 1.788 ± 0.351
1.713ProLeu: 1.713 ± 0.41
0.819ProMet: 0.819 ± 0.209
1.192ProAsn: 1.192 ± 0.322
0.745ProPro: 0.745 ± 0.276
1.117ProGln: 1.117 ± 0.302
1.117ProArg: 1.117 ± 0.277
2.235ProSer: 2.235 ± 0.474
1.415ProThr: 1.415 ± 0.267
2.98ProVal: 2.98 ± 0.493
0.447ProTrp: 0.447 ± 0.201
1.564ProTyr: 1.564 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.384GlnAla: 2.384 ± 0.502
0.298GlnCys: 0.298 ± 0.124
1.49GlnAsp: 1.49 ± 0.275
2.682GlnGlu: 2.682 ± 0.357
2.086GlnPhe: 2.086 ± 0.389
1.788GlnGly: 1.788 ± 0.372
1.043GlnHis: 1.043 ± 0.26
2.309GlnIle: 2.309 ± 0.471
2.533GlnLys: 2.533 ± 0.472
3.724GlnLeu: 3.724 ± 0.629
1.564GlnMet: 1.564 ± 0.406
1.266GlnAsn: 1.266 ± 0.292
1.341GlnPro: 1.341 ± 0.305
3.128GlnGln: 3.128 ± 0.805
2.086GlnArg: 2.086 ± 0.46
2.756GlnSer: 2.756 ± 0.54
2.384GlnThr: 2.384 ± 0.5
2.086GlnVal: 2.086 ± 0.327
0.745GlnTrp: 0.745 ± 0.23
1.117GlnTyr: 1.117 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
4.693ArgAla: 4.693 ± 0.541
1.117ArgCys: 1.117 ± 0.406
3.501ArgAsp: 3.501 ± 0.625
3.054ArgGlu: 3.054 ± 0.447
1.192ArgPhe: 1.192 ± 0.31
2.831ArgGly: 2.831 ± 0.492
0.596ArgHis: 0.596 ± 0.201
3.799ArgIle: 3.799 ± 0.573
3.873ArgLys: 3.873 ± 0.526
3.799ArgLeu: 3.799 ± 0.565
1.415ArgMet: 1.415 ± 0.302
3.724ArgAsn: 3.724 ± 0.543
1.043ArgPro: 1.043 ± 0.255
1.937ArgGln: 1.937 ± 0.369
2.533ArgArg: 2.533 ± 0.397
3.277ArgSer: 3.277 ± 0.4
1.639ArgThr: 1.639 ± 0.408
3.352ArgVal: 3.352 ± 0.485
0.596ArgTrp: 0.596 ± 0.26
2.831ArgTyr: 2.831 ± 0.521
0.0ArgXaa: 0.0 ± 0.0
Ser
5.512SerAla: 5.512 ± 0.872
1.192SerCys: 1.192 ± 0.296
4.693SerAsp: 4.693 ± 0.615
3.948SerGlu: 3.948 ± 0.353
2.682SerPhe: 2.682 ± 0.382
6.257SerGly: 6.257 ± 0.638
0.596SerHis: 0.596 ± 0.178
4.618SerIle: 4.618 ± 0.738
4.618SerLys: 4.618 ± 0.76
5.289SerLeu: 5.289 ± 0.658
2.086SerMet: 2.086 ± 0.461
2.235SerAsn: 2.235 ± 0.34
2.384SerPro: 2.384 ± 0.439
2.533SerGln: 2.533 ± 0.553
2.533SerArg: 2.533 ± 0.395
2.905SerSer: 2.905 ± 0.522
4.171SerThr: 4.171 ± 0.758
3.203SerVal: 3.203 ± 0.402
0.968SerTrp: 0.968 ± 0.259
1.862SerTyr: 1.862 ± 0.346
0.0SerXaa: 0.0 ± 0.0
Thr
4.544ThrAla: 4.544 ± 0.784
0.596ThrCys: 0.596 ± 0.188
3.054ThrAsp: 3.054 ± 0.561
4.246ThrGlu: 4.246 ± 0.677
1.713ThrPhe: 1.713 ± 0.474
5.214ThrGly: 5.214 ± 0.605
0.745ThrHis: 0.745 ± 0.211
2.831ThrIle: 2.831 ± 0.557
3.352ThrLys: 3.352 ± 0.539
3.799ThrLeu: 3.799 ± 0.692
1.49ThrMet: 1.49 ± 0.31
2.458ThrAsn: 2.458 ± 0.476
2.905ThrPro: 2.905 ± 0.485
1.564ThrGln: 1.564 ± 0.312
2.98ThrArg: 2.98 ± 0.466
3.128ThrSer: 3.128 ± 0.562
3.501ThrThr: 3.501 ± 0.788
3.128ThrVal: 3.128 ± 0.643
0.745ThrTrp: 0.745 ± 0.271
2.011ThrTyr: 2.011 ± 0.381
0.0ThrXaa: 0.0 ± 0.0
Val
6.034ValAla: 6.034 ± 0.711
0.67ValCys: 0.67 ± 0.218
3.948ValAsp: 3.948 ± 0.521
4.32ValGlu: 4.32 ± 0.624
1.937ValPhe: 1.937 ± 0.436
3.426ValGly: 3.426 ± 0.705
1.341ValHis: 1.341 ± 0.322
5.661ValIle: 5.661 ± 0.561
4.544ValLys: 4.544 ± 0.504
3.873ValLeu: 3.873 ± 0.596
2.458ValMet: 2.458 ± 0.361
4.171ValAsn: 4.171 ± 0.573
1.266ValPro: 1.266 ± 0.352
2.384ValGln: 2.384 ± 0.349
3.352ValArg: 3.352 ± 0.506
4.395ValSer: 4.395 ± 0.533
3.873ValThr: 3.873 ± 1.015
4.246ValVal: 4.246 ± 0.653
1.043ValTrp: 1.043 ± 0.246
2.533ValTyr: 2.533 ± 0.445
0.0ValXaa: 0.0 ± 0.0
Trp
0.819TrpAla: 0.819 ± 0.263
0.223TrpCys: 0.223 ± 0.121
0.894TrpAsp: 0.894 ± 0.252
0.596TrpGlu: 0.596 ± 0.288
0.968TrpPhe: 0.968 ± 0.287
0.298TrpGly: 0.298 ± 0.208
0.521TrpHis: 0.521 ± 0.212
1.117TrpIle: 1.117 ± 0.283
1.415TrpLys: 1.415 ± 0.287
1.639TrpLeu: 1.639 ± 0.353
0.447TrpMet: 0.447 ± 0.191
0.745TrpAsn: 0.745 ± 0.241
0.596TrpPro: 0.596 ± 0.199
0.521TrpGln: 0.521 ± 0.161
1.266TrpArg: 1.266 ± 0.281
0.819TrpSer: 0.819 ± 0.275
0.968TrpThr: 0.968 ± 0.282
1.266TrpVal: 1.266 ± 0.306
0.223TrpTrp: 0.223 ± 0.13
0.298TrpTyr: 0.298 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.203TyrAla: 3.203 ± 0.41
0.372TyrCys: 0.372 ± 0.162
3.426TyrAsp: 3.426 ± 0.609
1.937TyrGlu: 1.937 ± 0.434
1.49TyrPhe: 1.49 ± 0.352
2.905TyrGly: 2.905 ± 0.476
1.117TyrHis: 1.117 ± 0.318
2.086TyrIle: 2.086 ± 0.382
2.384TyrLys: 2.384 ± 0.469
2.235TyrLeu: 2.235 ± 0.489
0.819TyrMet: 0.819 ± 0.219
1.043TyrAsn: 1.043 ± 0.247
1.266TyrPro: 1.266 ± 0.293
1.192TyrGln: 1.192 ± 0.294
1.862TyrArg: 1.862 ± 0.399
3.426TyrSer: 3.426 ± 0.627
2.384TyrThr: 2.384 ± 0.514
2.682TyrVal: 2.682 ± 0.48
0.819TyrTrp: 0.819 ± 0.254
1.192TyrTyr: 1.192 ± 0.314
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (13426 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski