Amino acid dipepetide frequency for Salmonella phage SEN34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.566AlaAla: 10.566 ± 1.146
0.65AlaCys: 0.65 ± 0.194
6.096AlaAsp: 6.096 ± 0.826
5.852AlaGlu: 5.852 ± 0.778
3.576AlaPhe: 3.576 ± 0.504
7.559AlaGly: 7.559 ± 0.764
1.219AlaHis: 1.219 ± 0.277
4.958AlaIle: 4.958 ± 0.625
5.445AlaLys: 5.445 ± 0.65
8.29AlaLeu: 8.29 ± 0.708
1.869AlaMet: 1.869 ± 0.384
4.633AlaAsn: 4.633 ± 0.683
2.032AlaPro: 2.032 ± 0.376
3.657AlaGln: 3.657 ± 0.463
4.795AlaArg: 4.795 ± 0.696
5.445AlaSer: 5.445 ± 0.755
4.714AlaThr: 4.714 ± 0.969
5.608AlaVal: 5.608 ± 0.603
1.625AlaTrp: 1.625 ± 0.357
2.763AlaTyr: 2.763 ± 0.469
0.0AlaXaa: 0.0 ± 0.0
Cys
0.569CysAla: 0.569 ± 0.223
0.0CysCys: 0.0 ± 0.0
0.731CysAsp: 0.731 ± 0.225
0.488CysGlu: 0.488 ± 0.239
0.244CysPhe: 0.244 ± 0.14
0.65CysGly: 0.65 ± 0.175
0.244CysHis: 0.244 ± 0.187
0.569CysIle: 0.569 ± 0.259
0.975CysLys: 0.975 ± 0.303
0.406CysLeu: 0.406 ± 0.167
0.081CysMet: 0.081 ± 0.079
0.488CysAsn: 0.488 ± 0.241
0.163CysPro: 0.163 ± 0.125
0.0CysGln: 0.0 ± 0.0
0.813CysArg: 0.813 ± 0.283
1.138CysSer: 1.138 ± 0.291
0.65CysThr: 0.65 ± 0.22
0.65CysVal: 0.65 ± 0.242
0.0CysTrp: 0.0 ± 0.0
0.163CysTyr: 0.163 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
6.421AspAla: 6.421 ± 0.77
0.406AspCys: 0.406 ± 0.176
3.495AspAsp: 3.495 ± 0.547
4.226AspGlu: 4.226 ± 0.541
2.357AspPhe: 2.357 ± 0.467
5.527AspGly: 5.527 ± 0.631
0.244AspHis: 0.244 ± 0.164
4.47AspIle: 4.47 ± 0.611
3.901AspLys: 3.901 ± 0.623
5.039AspLeu: 5.039 ± 0.658
1.869AspMet: 1.869 ± 0.381
2.276AspAsn: 2.276 ± 0.434
2.113AspPro: 2.113 ± 0.472
2.113AspGln: 2.113 ± 0.396
2.52AspArg: 2.52 ± 0.477
3.414AspSer: 3.414 ± 0.484
3.414AspThr: 3.414 ± 0.585
3.901AspVal: 3.901 ± 0.422
1.3AspTrp: 1.3 ± 0.32
1.544AspTyr: 1.544 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
5.12GluAla: 5.12 ± 0.627
0.731GluCys: 0.731 ± 0.207
1.869GluAsp: 1.869 ± 0.383
2.357GluGlu: 2.357 ± 0.632
2.276GluPhe: 2.276 ± 0.393
3.82GluGly: 3.82 ± 0.459
0.813GluHis: 0.813 ± 0.274
4.145GluIle: 4.145 ± 0.622
3.657GluLys: 3.657 ± 0.569
7.071GluLeu: 7.071 ± 0.823
1.3GluMet: 1.3 ± 0.342
2.926GluAsn: 2.926 ± 0.412
1.625GluPro: 1.625 ± 0.312
3.17GluGln: 3.17 ± 0.59
4.145GluArg: 4.145 ± 0.665
3.251GluSer: 3.251 ± 0.526
3.251GluThr: 3.251 ± 0.501
3.576GluVal: 3.576 ± 0.673
0.975GluTrp: 0.975 ± 0.378
1.138GluTyr: 1.138 ± 0.259
0.0GluXaa: 0.0 ± 0.0
Phe
2.52PheAla: 2.52 ± 0.39
0.488PheCys: 0.488 ± 0.183
2.682PheAsp: 2.682 ± 0.435
1.951PheGlu: 1.951 ± 0.401
1.544PhePhe: 1.544 ± 0.438
2.52PheGly: 2.52 ± 0.411
0.65PheHis: 0.65 ± 0.183
1.951PheIle: 1.951 ± 0.412
1.707PheLys: 1.707 ± 0.347
2.438PheLeu: 2.438 ± 0.543
0.731PheMet: 0.731 ± 0.211
1.625PheAsn: 1.625 ± 0.407
2.357PhePro: 2.357 ± 0.418
0.813PheGln: 0.813 ± 0.223
1.951PheArg: 1.951 ± 0.504
2.845PheSer: 2.845 ± 0.524
2.601PheThr: 2.601 ± 0.442
1.869PheVal: 1.869 ± 0.436
0.569PheTrp: 0.569 ± 0.203
0.65PheTyr: 0.65 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
7.233GlyAla: 7.233 ± 0.761
0.406GlyCys: 0.406 ± 0.165
4.714GlyAsp: 4.714 ± 0.974
2.926GlyGlu: 2.926 ± 0.493
3.088GlyPhe: 3.088 ± 0.614
5.77GlyGly: 5.77 ± 1.123
0.894GlyHis: 0.894 ± 0.274
6.177GlyIle: 6.177 ± 0.647
4.633GlyLys: 4.633 ± 0.662
6.014GlyLeu: 6.014 ± 0.687
3.657GlyMet: 3.657 ± 0.492
3.901GlyAsn: 3.901 ± 0.598
1.219GlyPro: 1.219 ± 0.358
2.763GlyGln: 2.763 ± 0.525
4.389GlyArg: 4.389 ± 0.578
3.982GlySer: 3.982 ± 0.631
3.495GlyThr: 3.495 ± 0.517
5.608GlyVal: 5.608 ± 0.634
1.219GlyTrp: 1.219 ± 0.304
2.601GlyTyr: 2.601 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
0.813HisAla: 0.813 ± 0.237
0.406HisCys: 0.406 ± 0.214
0.569HisAsp: 0.569 ± 0.221
0.813HisGlu: 0.813 ± 0.257
0.244HisPhe: 0.244 ± 0.13
1.057HisGly: 1.057 ± 0.318
0.894HisHis: 0.894 ± 0.309
1.057HisIle: 1.057 ± 0.239
0.325HisLys: 0.325 ± 0.152
1.3HisLeu: 1.3 ± 0.3
0.325HisMet: 0.325 ± 0.161
0.406HisAsn: 0.406 ± 0.183
0.975HisPro: 0.975 ± 0.293
0.65HisGln: 0.65 ± 0.208
1.057HisArg: 1.057 ± 0.31
1.138HisSer: 1.138 ± 0.295
0.488HisThr: 0.488 ± 0.182
0.813HisVal: 0.813 ± 0.233
0.406HisTrp: 0.406 ± 0.199
0.894HisTyr: 0.894 ± 0.284
0.0HisXaa: 0.0 ± 0.0
Ile
6.014IleAla: 6.014 ± 1.05
0.325IleCys: 0.325 ± 0.161
3.739IleAsp: 3.739 ± 0.567
4.226IleGlu: 4.226 ± 0.614
1.951IlePhe: 1.951 ± 0.755
4.389IleGly: 4.389 ± 0.565
0.894IleHis: 0.894 ± 0.288
2.438IleIle: 2.438 ± 0.501
3.088IleLys: 3.088 ± 0.463
4.551IleLeu: 4.551 ± 0.721
1.382IleMet: 1.382 ± 0.319
3.17IleAsn: 3.17 ± 0.573
2.763IlePro: 2.763 ± 0.575
2.194IleGln: 2.194 ± 0.374
3.576IleArg: 3.576 ± 0.503
5.77IleSer: 5.77 ± 0.653
5.689IleThr: 5.689 ± 0.668
2.845IleVal: 2.845 ± 0.423
0.894IleTrp: 0.894 ± 0.241
1.788IleTyr: 1.788 ± 0.387
0.0IleXaa: 0.0 ± 0.0
Lys
5.608LysAla: 5.608 ± 0.767
0.325LysCys: 0.325 ± 0.169
3.576LysAsp: 3.576 ± 0.547
3.17LysGlu: 3.17 ± 0.553
1.544LysPhe: 1.544 ± 0.283
3.82LysGly: 3.82 ± 0.58
1.138LysHis: 1.138 ± 0.269
2.926LysIle: 2.926 ± 0.485
3.739LysLys: 3.739 ± 0.716
4.714LysLeu: 4.714 ± 0.644
1.788LysMet: 1.788 ± 0.394
2.601LysAsn: 2.601 ± 0.527
3.007LysPro: 3.007 ± 0.563
2.52LysGln: 2.52 ± 0.381
2.438LysArg: 2.438 ± 0.417
3.657LysSer: 3.657 ± 0.565
3.982LysThr: 3.982 ± 0.65
3.17LysVal: 3.17 ± 0.449
1.057LysTrp: 1.057 ± 0.322
2.194LysTyr: 2.194 ± 0.403
0.0LysXaa: 0.0 ± 0.0
Leu
8.127LeuAla: 8.127 ± 0.828
1.057LeuCys: 1.057 ± 0.261
5.039LeuAsp: 5.039 ± 0.591
3.332LeuGlu: 3.332 ± 0.557
2.276LeuPhe: 2.276 ± 0.534
5.933LeuGly: 5.933 ± 0.567
0.894LeuHis: 0.894 ± 0.235
6.583LeuIle: 6.583 ± 0.958
4.876LeuLys: 4.876 ± 0.612
6.99LeuLeu: 6.99 ± 0.74
2.52LeuMet: 2.52 ± 0.425
4.714LeuAsn: 4.714 ± 0.485
5.364LeuPro: 5.364 ± 0.611
3.007LeuGln: 3.007 ± 0.465
5.12LeuArg: 5.12 ± 0.605
6.014LeuSer: 6.014 ± 0.569
5.202LeuThr: 5.202 ± 0.644
4.795LeuVal: 4.795 ± 0.671
1.057LeuTrp: 1.057 ± 0.265
3.088LeuTyr: 3.088 ± 0.532
0.0LeuXaa: 0.0 ± 0.0
Met
2.845MetAla: 2.845 ± 0.469
0.488MetCys: 0.488 ± 0.186
1.951MetAsp: 1.951 ± 0.369
1.219MetGlu: 1.219 ± 0.383
0.813MetPhe: 0.813 ± 0.339
1.544MetGly: 1.544 ± 0.281
0.488MetHis: 0.488 ± 0.18
1.625MetIle: 1.625 ± 0.354
1.788MetLys: 1.788 ± 0.34
2.52MetLeu: 2.52 ± 0.432
0.731MetMet: 0.731 ± 0.209
1.3MetAsn: 1.3 ± 0.283
1.707MetPro: 1.707 ± 0.382
0.813MetGln: 0.813 ± 0.216
1.788MetArg: 1.788 ± 0.368
1.951MetSer: 1.951 ± 0.389
2.682MetThr: 2.682 ± 0.615
1.625MetVal: 1.625 ± 0.403
0.081MetTrp: 0.081 ± 0.084
0.163MetTyr: 0.163 ± 0.113
0.0MetXaa: 0.0 ± 0.0
Asn
4.308AsnAla: 4.308 ± 0.632
0.244AsnCys: 0.244 ± 0.126
2.682AsnAsp: 2.682 ± 0.493
2.926AsnGlu: 2.926 ± 0.544
1.382AsnPhe: 1.382 ± 0.421
4.064AsnGly: 4.064 ± 0.583
0.813AsnHis: 0.813 ± 0.277
3.007AsnIle: 3.007 ± 0.458
2.194AsnLys: 2.194 ± 0.363
3.332AsnLeu: 3.332 ± 0.448
1.382AsnMet: 1.382 ± 0.327
2.113AsnAsn: 2.113 ± 0.416
2.52AsnPro: 2.52 ± 0.388
2.276AsnGln: 2.276 ± 0.469
3.007AsnArg: 3.007 ± 0.422
2.601AsnSer: 2.601 ± 0.435
2.601AsnThr: 2.601 ± 0.412
3.088AsnVal: 3.088 ± 0.408
0.975AsnTrp: 0.975 ± 0.211
1.463AsnTyr: 1.463 ± 0.342
0.0AsnXaa: 0.0 ± 0.0
Pro
2.926ProAla: 2.926 ± 0.522
0.244ProCys: 0.244 ± 0.144
4.308ProAsp: 4.308 ± 0.523
3.739ProGlu: 3.739 ± 0.458
1.463ProPhe: 1.463 ± 0.399
2.194ProGly: 2.194 ± 0.516
0.731ProHis: 0.731 ± 0.227
2.113ProIle: 2.113 ± 0.337
2.194ProLys: 2.194 ± 0.439
3.17ProLeu: 3.17 ± 0.585
0.731ProMet: 0.731 ± 0.212
1.463ProAsn: 1.463 ± 0.342
1.544ProPro: 1.544 ± 0.372
1.544ProGln: 1.544 ± 0.291
2.113ProArg: 2.113 ± 0.384
1.869ProSer: 1.869 ± 0.428
2.276ProThr: 2.276 ± 0.359
3.739ProVal: 3.739 ± 0.543
0.813ProTrp: 0.813 ± 0.248
1.057ProTyr: 1.057 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
3.17GlnAla: 3.17 ± 0.516
0.569GlnCys: 0.569 ± 0.219
1.138GlnAsp: 1.138 ± 0.287
2.276GlnGlu: 2.276 ± 0.415
1.625GlnPhe: 1.625 ± 0.349
1.463GlnGly: 1.463 ± 0.347
0.569GlnHis: 0.569 ± 0.165
1.544GlnIle: 1.544 ± 0.37
2.52GlnLys: 2.52 ± 0.413
3.901GlnLeu: 3.901 ± 0.625
1.463GlnMet: 1.463 ± 0.352
1.951GlnAsn: 1.951 ± 0.378
1.544GlnPro: 1.544 ± 0.315
2.113GlnGln: 2.113 ± 0.41
1.869GlnArg: 1.869 ± 0.499
2.926GlnSer: 2.926 ± 0.433
2.926GlnThr: 2.926 ± 0.499
2.52GlnVal: 2.52 ± 0.487
0.894GlnTrp: 0.894 ± 0.263
1.3GlnTyr: 1.3 ± 0.305
0.0GlnXaa: 0.0 ± 0.0
Arg
4.551ArgAla: 4.551 ± 0.644
0.569ArgCys: 0.569 ± 0.228
2.926ArgAsp: 2.926 ± 0.585
4.064ArgGlu: 4.064 ± 0.533
1.463ArgPhe: 1.463 ± 0.359
3.982ArgGly: 3.982 ± 0.557
1.707ArgHis: 1.707 ± 0.397
4.308ArgIle: 4.308 ± 0.661
4.226ArgLys: 4.226 ± 0.704
5.77ArgLeu: 5.77 ± 0.674
1.869ArgMet: 1.869 ± 0.349
2.438ArgAsn: 2.438 ± 0.384
2.113ArgPro: 2.113 ± 0.346
2.52ArgGln: 2.52 ± 0.36
4.47ArgArg: 4.47 ± 0.637
2.763ArgSer: 2.763 ± 0.39
3.251ArgThr: 3.251 ± 0.46
3.576ArgVal: 3.576 ± 0.613
1.3ArgTrp: 1.3 ± 0.237
1.869ArgTyr: 1.869 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
5.12SerAla: 5.12 ± 0.657
0.813SerCys: 0.813 ± 0.241
4.389SerAsp: 4.389 ± 0.577
3.982SerGlu: 3.982 ± 0.591
2.113SerPhe: 2.113 ± 0.42
5.283SerGly: 5.283 ± 0.711
0.488SerHis: 0.488 ± 0.188
3.901SerIle: 3.901 ± 0.599
2.845SerLys: 2.845 ± 0.436
6.339SerLeu: 6.339 ± 0.838
1.382SerMet: 1.382 ± 0.262
2.763SerAsn: 2.763 ± 0.38
2.438SerPro: 2.438 ± 0.394
2.438SerGln: 2.438 ± 0.438
3.901SerArg: 3.901 ± 0.579
3.82SerSer: 3.82 ± 0.577
4.145SerThr: 4.145 ± 0.512
4.876SerVal: 4.876 ± 0.642
1.3SerTrp: 1.3 ± 0.331
1.463SerTyr: 1.463 ± 0.252
0.0SerXaa: 0.0 ± 0.0
Thr
6.908ThrAla: 6.908 ± 0.945
0.325ThrCys: 0.325 ± 0.163
3.982ThrAsp: 3.982 ± 0.572
3.414ThrGlu: 3.414 ± 0.573
1.625ThrPhe: 1.625 ± 0.336
5.852ThrGly: 5.852 ± 0.991
1.057ThrHis: 1.057 ± 0.315
3.332ThrIle: 3.332 ± 0.405
3.007ThrLys: 3.007 ± 0.495
6.177ThrLeu: 6.177 ± 0.58
1.138ThrMet: 1.138 ± 0.33
3.088ThrAsn: 3.088 ± 0.498
2.682ThrPro: 2.682 ± 0.424
1.382ThrGln: 1.382 ± 0.337
4.226ThrArg: 4.226 ± 0.519
4.226ThrSer: 4.226 ± 0.587
4.064ThrThr: 4.064 ± 0.691
5.283ThrVal: 5.283 ± 0.865
0.975ThrTrp: 0.975 ± 0.254
2.357ThrTyr: 2.357 ± 0.409
0.0ThrXaa: 0.0 ± 0.0
Val
5.933ValAla: 5.933 ± 0.705
0.569ValCys: 0.569 ± 0.235
3.82ValAsp: 3.82 ± 0.505
4.226ValGlu: 4.226 ± 0.531
2.438ValPhe: 2.438 ± 0.566
4.958ValGly: 4.958 ± 0.719
0.406ValHis: 0.406 ± 0.183
3.901ValIle: 3.901 ± 0.751
3.17ValLys: 3.17 ± 0.536
4.551ValLeu: 4.551 ± 0.604
1.788ValMet: 1.788 ± 0.396
2.845ValAsn: 2.845 ± 0.477
2.438ValPro: 2.438 ± 0.413
2.113ValGln: 2.113 ± 0.443
3.82ValArg: 3.82 ± 0.506
3.982ValSer: 3.982 ± 0.57
6.177ValThr: 6.177 ± 0.773
4.389ValVal: 4.389 ± 0.612
0.65ValTrp: 0.65 ± 0.268
2.113ValTyr: 2.113 ± 0.355
0.0ValXaa: 0.0 ± 0.0
Trp
1.219TrpAla: 1.219 ± 0.349
0.244TrpCys: 0.244 ± 0.13
0.731TrpAsp: 0.731 ± 0.264
0.731TrpGlu: 0.731 ± 0.212
0.975TrpPhe: 0.975 ± 0.264
1.382TrpGly: 1.382 ± 0.31
0.325TrpHis: 0.325 ± 0.151
1.3TrpIle: 1.3 ± 0.286
0.813TrpLys: 0.813 ± 0.246
1.219TrpLeu: 1.219 ± 0.281
0.813TrpMet: 0.813 ± 0.234
0.406TrpAsn: 0.406 ± 0.188
0.406TrpPro: 0.406 ± 0.165
0.813TrpGln: 0.813 ± 0.21
1.3TrpArg: 1.3 ± 0.361
0.813TrpSer: 0.813 ± 0.252
1.544TrpThr: 1.544 ± 0.338
0.975TrpVal: 0.975 ± 0.256
0.488TrpTrp: 0.488 ± 0.186
0.569TrpTyr: 0.569 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.382TyrAla: 1.382 ± 0.278
0.163TyrCys: 0.163 ± 0.111
2.113TyrAsp: 2.113 ± 0.379
1.3TyrGlu: 1.3 ± 0.3
1.463TyrPhe: 1.463 ± 0.315
2.926TyrGly: 2.926 ± 0.426
0.163TyrHis: 0.163 ± 0.111
1.382TyrIle: 1.382 ± 0.375
1.869TyrLys: 1.869 ± 0.448
2.276TyrLeu: 2.276 ± 0.398
1.219TyrMet: 1.219 ± 0.291
1.869TyrAsn: 1.869 ± 0.46
1.219TyrPro: 1.219 ± 0.353
1.382TyrGln: 1.382 ± 0.276
2.438TyrArg: 2.438 ± 0.406
2.194TyrSer: 2.194 ± 0.423
1.951TyrThr: 1.951 ± 0.404
1.382TyrVal: 1.382 ± 0.355
0.488TyrTrp: 0.488 ± 0.18
0.65TyrTyr: 0.65 ± 0.21
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (12305 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski