Amino acid dipepetide frequency for Salmonella phage SPN1S

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.841AlaAla: 11.841 ± 1.337
0.596AlaCys: 0.596 ± 0.232
5.622AlaAsp: 5.622 ± 0.84
5.963AlaGlu: 5.963 ± 0.922
4.004AlaPhe: 4.004 ± 0.6
6.9AlaGly: 6.9 ± 0.872
1.278AlaHis: 1.278 ± 0.263
4.174AlaIle: 4.174 ± 0.631
4.344AlaLys: 4.344 ± 0.627
8.263AlaLeu: 8.263 ± 0.853
3.578AlaMet: 3.578 ± 0.513
3.663AlaAsn: 3.663 ± 0.625
3.493AlaPro: 3.493 ± 0.64
5.707AlaGln: 5.707 ± 0.972
6.048AlaArg: 6.048 ± 0.844
6.389AlaSer: 6.389 ± 0.616
4.856AlaThr: 4.856 ± 0.716
7.582AlaVal: 7.582 ± 1.058
1.619AlaTrp: 1.619 ± 0.327
3.663AlaTyr: 3.663 ± 0.723
0.0AlaXaa: 0.0 ± 0.0
Cys
0.852CysAla: 0.852 ± 0.282
0.085CysCys: 0.085 ± 0.086
0.511CysAsp: 0.511 ± 0.194
0.681CysGlu: 0.681 ± 0.205
0.426CysPhe: 0.426 ± 0.19
0.852CysGly: 0.852 ± 0.288
0.085CysHis: 0.085 ± 0.085
1.107CysIle: 1.107 ± 0.285
0.596CysLys: 0.596 ± 0.295
0.681CysLeu: 0.681 ± 0.232
0.256CysMet: 0.256 ± 0.174
0.681CysAsn: 0.681 ± 0.24
0.511CysPro: 0.511 ± 0.219
0.17CysGln: 0.17 ± 0.117
0.767CysArg: 0.767 ± 0.279
0.596CysSer: 0.596 ± 0.21
0.341CysThr: 0.341 ± 0.192
0.596CysVal: 0.596 ± 0.285
0.085CysTrp: 0.085 ± 0.075
0.256CysTyr: 0.256 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
6.048AspAla: 6.048 ± 0.976
0.852AspCys: 0.852 ± 0.245
4.174AspAsp: 4.174 ± 0.425
3.833AspGlu: 3.833 ± 0.754
2.215AspPhe: 2.215 ± 0.38
5.026AspGly: 5.026 ± 0.699
1.107AspHis: 1.107 ± 0.38
3.748AspIle: 3.748 ± 0.507
3.237AspLys: 3.237 ± 0.776
5.111AspLeu: 5.111 ± 0.523
1.704AspMet: 1.704 ± 0.356
2.47AspAsn: 2.47 ± 0.405
2.13AspPro: 2.13 ± 0.597
2.385AspGln: 2.385 ± 0.398
2.385AspArg: 2.385 ± 0.536
3.748AspSer: 3.748 ± 0.502
2.641AspThr: 2.641 ± 0.426
5.452AspVal: 5.452 ± 0.859
1.107AspTrp: 1.107 ± 0.272
2.215AspTyr: 2.215 ± 0.421
0.0AspXaa: 0.0 ± 0.0
Glu
6.133GluAla: 6.133 ± 0.756
0.596GluCys: 0.596 ± 0.261
3.578GluAsp: 3.578 ± 0.591
3.578GluGlu: 3.578 ± 0.8
2.982GluPhe: 2.982 ± 0.519
2.896GluGly: 2.896 ± 0.484
0.852GluHis: 0.852 ± 0.253
2.982GluIle: 2.982 ± 0.538
3.663GluLys: 3.663 ± 0.693
5.707GluLeu: 5.707 ± 0.702
1.704GluMet: 1.704 ± 0.393
2.13GluAsn: 2.13 ± 0.427
1.874GluPro: 1.874 ± 0.406
3.748GluGln: 3.748 ± 0.716
3.919GluArg: 3.919 ± 0.623
3.407GluSer: 3.407 ± 0.565
2.47GluThr: 2.47 ± 0.397
3.407GluVal: 3.407 ± 0.501
0.852GluTrp: 0.852 ± 0.323
2.215GluTyr: 2.215 ± 0.31
0.0GluXaa: 0.0 ± 0.0
Phe
1.874PheAla: 1.874 ± 0.394
0.596PheCys: 0.596 ± 0.245
2.385PheAsp: 2.385 ± 0.496
2.385PheGlu: 2.385 ± 0.487
1.278PhePhe: 1.278 ± 0.352
3.067PheGly: 3.067 ± 0.457
0.426PheHis: 0.426 ± 0.207
3.322PheIle: 3.322 ± 0.528
2.044PheLys: 2.044 ± 0.453
1.874PheLeu: 1.874 ± 0.478
0.681PheMet: 0.681 ± 0.253
1.789PheAsn: 1.789 ± 0.334
1.193PhePro: 1.193 ± 0.248
0.767PheGln: 0.767 ± 0.297
2.385PheArg: 2.385 ± 0.404
3.322PheSer: 3.322 ± 0.49
1.789PheThr: 1.789 ± 0.419
1.704PheVal: 1.704 ± 0.404
0.341PheTrp: 0.341 ± 0.142
1.363PheTyr: 1.363 ± 0.424
0.0PheXaa: 0.0 ± 0.0
Gly
6.304GlyAla: 6.304 ± 0.86
0.596GlyCys: 0.596 ± 0.218
4.6GlyAsp: 4.6 ± 0.588
5.026GlyGlu: 5.026 ± 0.554
3.322GlyPhe: 3.322 ± 0.719
5.537GlyGly: 5.537 ± 0.759
1.278GlyHis: 1.278 ± 0.259
4.941GlyIle: 4.941 ± 0.661
4.77GlyLys: 4.77 ± 0.595
5.111GlyLeu: 5.111 ± 0.775
2.385GlyMet: 2.385 ± 0.433
3.152GlyAsn: 3.152 ± 0.482
1.704GlyPro: 1.704 ± 0.413
3.748GlyGln: 3.748 ± 0.662
3.663GlyArg: 3.663 ± 0.616
5.111GlySer: 5.111 ± 0.646
4.174GlyThr: 4.174 ± 0.659
5.196GlyVal: 5.196 ± 0.751
1.363GlyTrp: 1.363 ± 0.398
2.13GlyTyr: 2.13 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
1.107HisAla: 1.107 ± 0.347
0.426HisCys: 0.426 ± 0.226
1.363HisAsp: 1.363 ± 0.27
1.363HisGlu: 1.363 ± 0.378
0.426HisPhe: 0.426 ± 0.216
1.704HisGly: 1.704 ± 0.356
0.341HisHis: 0.341 ± 0.169
0.767HisIle: 0.767 ± 0.218
0.767HisLys: 0.767 ± 0.246
1.448HisLeu: 1.448 ± 0.346
0.596HisMet: 0.596 ± 0.194
0.852HisAsn: 0.852 ± 0.302
0.596HisPro: 0.596 ± 0.208
0.256HisGln: 0.256 ± 0.196
0.937HisArg: 0.937 ± 0.298
0.937HisSer: 0.937 ± 0.286
0.852HisThr: 0.852 ± 0.221
0.767HisVal: 0.767 ± 0.227
0.681HisTrp: 0.681 ± 0.262
0.085HisTyr: 0.085 ± 0.112
0.0HisXaa: 0.0 ± 0.0
Ile
5.196IleAla: 5.196 ± 0.8
0.767IleCys: 0.767 ± 0.267
3.237IleAsp: 3.237 ± 0.486
3.152IleGlu: 3.152 ± 0.528
1.704IlePhe: 1.704 ± 0.4
4.77IleGly: 4.77 ± 0.683
0.596IleHis: 0.596 ± 0.235
3.237IleIle: 3.237 ± 0.728
2.641IleLys: 2.641 ± 0.533
2.726IleLeu: 2.726 ± 0.55
1.022IleMet: 1.022 ± 0.34
4.004IleAsn: 4.004 ± 0.569
2.556IlePro: 2.556 ± 0.401
2.47IleGln: 2.47 ± 0.476
3.152IleArg: 3.152 ± 0.46
4.43IleSer: 4.43 ± 0.661
3.833IleThr: 3.833 ± 0.657
2.896IleVal: 2.896 ± 0.534
1.278IleTrp: 1.278 ± 0.26
1.704IleTyr: 1.704 ± 0.428
0.0IleXaa: 0.0 ± 0.0
Lys
4.515LysAla: 4.515 ± 0.684
0.681LysCys: 0.681 ± 0.228
2.385LysAsp: 2.385 ± 0.409
2.215LysGlu: 2.215 ± 0.462
1.619LysPhe: 1.619 ± 0.32
2.811LysGly: 2.811 ± 0.632
1.022LysHis: 1.022 ± 0.293
1.959LysIle: 1.959 ± 0.343
3.237LysLys: 3.237 ± 0.606
4.6LysLeu: 4.6 ± 0.533
1.363LysMet: 1.363 ± 0.403
2.811LysAsn: 2.811 ± 0.551
3.407LysPro: 3.407 ± 0.835
2.044LysGln: 2.044 ± 0.508
3.663LysArg: 3.663 ± 0.727
3.237LysSer: 3.237 ± 0.61
3.493LysThr: 3.493 ± 0.444
3.322LysVal: 3.322 ± 0.589
1.193LysTrp: 1.193 ± 0.358
2.044LysTyr: 2.044 ± 0.46
0.0LysXaa: 0.0 ± 0.0
Leu
7.922LeuAla: 7.922 ± 0.826
0.852LeuCys: 0.852 ± 0.274
4.344LeuAsp: 4.344 ± 0.601
3.919LeuGlu: 3.919 ± 0.639
2.726LeuPhe: 2.726 ± 0.527
4.856LeuGly: 4.856 ± 0.999
1.619LeuHis: 1.619 ± 0.357
4.174LeuIle: 4.174 ± 0.648
4.174LeuLys: 4.174 ± 0.742
5.452LeuLeu: 5.452 ± 0.828
2.13LeuMet: 2.13 ± 0.384
3.322LeuAsn: 3.322 ± 0.544
2.811LeuPro: 2.811 ± 0.549
3.237LeuGln: 3.237 ± 0.557
4.43LeuArg: 4.43 ± 0.629
6.133LeuSer: 6.133 ± 0.701
6.985LeuThr: 6.985 ± 0.713
4.515LeuVal: 4.515 ± 0.492
0.767LeuTrp: 0.767 ± 0.231
1.789LeuTyr: 1.789 ± 0.311
0.0LeuXaa: 0.0 ± 0.0
Met
3.322MetAla: 3.322 ± 0.624
0.426MetCys: 0.426 ± 0.228
1.448MetAsp: 1.448 ± 0.361
1.193MetGlu: 1.193 ± 0.342
0.681MetPhe: 0.681 ± 0.333
1.619MetGly: 1.619 ± 0.355
0.0MetHis: 0.0 ± 0.0
1.533MetIle: 1.533 ± 0.385
1.959MetLys: 1.959 ± 0.603
2.13MetLeu: 2.13 ± 0.459
1.022MetMet: 1.022 ± 0.332
2.044MetAsn: 2.044 ± 0.396
1.363MetPro: 1.363 ± 0.331
1.107MetGln: 1.107 ± 0.339
1.363MetArg: 1.363 ± 0.651
2.385MetSer: 2.385 ± 0.316
1.619MetThr: 1.619 ± 0.359
1.959MetVal: 1.959 ± 0.369
0.17MetTrp: 0.17 ± 0.124
1.022MetTyr: 1.022 ± 0.316
0.0MetXaa: 0.0 ± 0.0
Asn
4.089AsnAla: 4.089 ± 0.52
0.256AsnCys: 0.256 ± 0.135
2.896AsnAsp: 2.896 ± 0.441
3.067AsnGlu: 3.067 ± 0.557
0.937AsnPhe: 0.937 ± 0.293
4.004AsnGly: 4.004 ± 0.769
0.852AsnHis: 0.852 ± 0.26
2.13AsnIle: 2.13 ± 0.53
1.959AsnLys: 1.959 ± 0.427
3.919AsnLeu: 3.919 ± 0.745
1.278AsnMet: 1.278 ± 0.339
2.811AsnAsn: 2.811 ± 0.587
2.982AsnPro: 2.982 ± 0.576
2.726AsnGln: 2.726 ± 0.485
2.385AsnArg: 2.385 ± 0.508
2.385AsnSer: 2.385 ± 0.542
3.152AsnThr: 3.152 ± 0.584
2.811AsnVal: 2.811 ± 0.489
0.852AsnTrp: 0.852 ± 0.36
1.959AsnTyr: 1.959 ± 0.6
0.0AsnXaa: 0.0 ± 0.0
Pro
4.004ProAla: 4.004 ± 1.063
0.256ProCys: 0.256 ± 0.129
3.237ProAsp: 3.237 ± 0.605
3.919ProGlu: 3.919 ± 0.74
1.193ProPhe: 1.193 ± 0.301
3.237ProGly: 3.237 ± 0.419
0.852ProHis: 0.852 ± 0.314
1.619ProIle: 1.619 ± 0.341
1.193ProLys: 1.193 ± 0.357
2.556ProLeu: 2.556 ± 0.493
0.852ProMet: 0.852 ± 0.263
0.852ProAsn: 0.852 ± 0.224
1.789ProPro: 1.789 ± 0.429
1.789ProGln: 1.789 ± 0.416
1.789ProArg: 1.789 ± 0.421
3.067ProSer: 3.067 ± 0.585
2.556ProThr: 2.556 ± 0.584
3.407ProVal: 3.407 ± 0.521
0.767ProTrp: 0.767 ± 0.287
1.107ProTyr: 1.107 ± 0.336
0.0ProXaa: 0.0 ± 0.0
Gln
6.133GlnAla: 6.133 ± 1.15
0.681GlnCys: 0.681 ± 0.235
2.13GlnAsp: 2.13 ± 0.381
2.982GlnGlu: 2.982 ± 0.459
1.874GlnPhe: 1.874 ± 0.435
2.726GlnGly: 2.726 ± 0.493
1.022GlnHis: 1.022 ± 0.317
2.13GlnIle: 2.13 ± 0.521
1.959GlnLys: 1.959 ± 0.46
4.685GlnLeu: 4.685 ± 0.842
1.533GlnMet: 1.533 ± 0.411
1.363GlnAsn: 1.363 ± 0.324
1.959GlnPro: 1.959 ± 0.312
3.748GlnGln: 3.748 ± 0.79
4.004GlnArg: 4.004 ± 0.496
1.619GlnSer: 1.619 ± 0.367
2.811GlnThr: 2.811 ± 0.508
2.556GlnVal: 2.556 ± 0.682
0.852GlnTrp: 0.852 ± 0.348
1.959GlnTyr: 1.959 ± 0.457
0.0GlnXaa: 0.0 ± 0.0
Arg
5.452ArgAla: 5.452 ± 0.899
0.767ArgCys: 0.767 ± 0.271
4.344ArgAsp: 4.344 ± 0.563
3.748ArgGlu: 3.748 ± 0.542
1.874ArgPhe: 1.874 ± 0.272
3.067ArgGly: 3.067 ± 0.5
1.107ArgHis: 1.107 ± 0.285
4.089ArgIle: 4.089 ± 0.481
3.663ArgLys: 3.663 ± 0.521
4.089ArgLeu: 4.089 ± 0.496
1.107ArgMet: 1.107 ± 0.37
3.407ArgAsn: 3.407 ± 0.404
0.852ArgPro: 0.852 ± 0.267
3.833ArgGln: 3.833 ± 0.544
4.515ArgArg: 4.515 ± 0.741
2.47ArgSer: 2.47 ± 0.567
3.237ArgThr: 3.237 ± 0.429
3.578ArgVal: 3.578 ± 0.675
1.533ArgTrp: 1.533 ± 0.327
1.789ArgTyr: 1.789 ± 0.322
0.0ArgXaa: 0.0 ± 0.0
Ser
6.219SerAla: 6.219 ± 0.72
0.341SerCys: 0.341 ± 0.166
3.833SerAsp: 3.833 ± 0.746
3.237SerGlu: 3.237 ± 0.479
2.641SerPhe: 2.641 ± 0.417
5.452SerGly: 5.452 ± 0.884
1.107SerHis: 1.107 ± 0.308
3.322SerIle: 3.322 ± 0.651
3.748SerLys: 3.748 ± 0.656
4.174SerLeu: 4.174 ± 0.732
2.13SerMet: 2.13 ± 0.529
3.067SerAsn: 3.067 ± 0.497
2.47SerPro: 2.47 ± 0.415
3.237SerGln: 3.237 ± 0.581
3.493SerArg: 3.493 ± 0.607
3.919SerSer: 3.919 ± 0.659
4.004SerThr: 4.004 ± 0.604
4.941SerVal: 4.941 ± 0.685
1.107SerTrp: 1.107 ± 0.239
1.448SerTyr: 1.448 ± 0.312
0.0SerXaa: 0.0 ± 0.0
Thr
6.985ThrAla: 6.985 ± 0.98
0.341ThrCys: 0.341 ± 0.2
4.344ThrAsp: 4.344 ± 0.546
2.556ThrGlu: 2.556 ± 0.564
1.448ThrPhe: 1.448 ± 0.315
6.559ThrGly: 6.559 ± 0.858
0.767ThrHis: 0.767 ± 0.25
3.237ThrIle: 3.237 ± 0.561
2.215ThrLys: 2.215 ± 0.305
4.174ThrLeu: 4.174 ± 0.665
1.789ThrMet: 1.789 ± 0.346
2.811ThrAsn: 2.811 ± 0.405
3.322ThrPro: 3.322 ± 0.528
3.067ThrGln: 3.067 ± 0.565
2.641ThrArg: 2.641 ± 0.455
3.748ThrSer: 3.748 ± 0.615
2.982ThrThr: 2.982 ± 0.447
4.259ThrVal: 4.259 ± 0.705
0.852ThrTrp: 0.852 ± 0.275
1.789ThrTyr: 1.789 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
6.645ValAla: 6.645 ± 0.68
0.426ValCys: 0.426 ± 0.175
4.344ValAsp: 4.344 ± 0.613
3.748ValGlu: 3.748 ± 0.489
1.704ValPhe: 1.704 ± 0.323
4.089ValGly: 4.089 ± 0.564
0.852ValHis: 0.852 ± 0.246
4.259ValIle: 4.259 ± 0.518
3.407ValLys: 3.407 ± 0.538
5.111ValLeu: 5.111 ± 0.686
1.959ValMet: 1.959 ± 0.418
4.344ValAsn: 4.344 ± 0.626
2.726ValPro: 2.726 ± 0.552
2.385ValGln: 2.385 ± 0.414
4.259ValArg: 4.259 ± 0.749
4.259ValSer: 4.259 ± 0.469
5.111ValThr: 5.111 ± 0.584
4.089ValVal: 4.089 ± 0.666
0.681ValTrp: 0.681 ± 0.267
1.704ValTyr: 1.704 ± 0.341
0.0ValXaa: 0.0 ± 0.0
Trp
1.363TrpAla: 1.363 ± 0.324
0.17TrpCys: 0.17 ± 0.126
0.937TrpAsp: 0.937 ± 0.292
0.767TrpGlu: 0.767 ± 0.267
0.596TrpPhe: 0.596 ± 0.198
1.704TrpGly: 1.704 ± 0.312
0.256TrpHis: 0.256 ± 0.147
0.767TrpIle: 0.767 ± 0.288
0.852TrpLys: 0.852 ± 0.291
1.789TrpLeu: 1.789 ± 0.514
0.511TrpMet: 0.511 ± 0.201
0.681TrpAsn: 0.681 ± 0.242
0.426TrpPro: 0.426 ± 0.226
0.681TrpGln: 0.681 ± 0.274
0.681TrpArg: 0.681 ± 0.229
1.448TrpSer: 1.448 ± 0.373
1.022TrpThr: 1.022 ± 0.382
1.193TrpVal: 1.193 ± 0.351
0.17TrpTrp: 0.17 ± 0.103
0.681TrpTyr: 0.681 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.578TyrAla: 3.578 ± 0.593
0.426TyrCys: 0.426 ± 0.237
1.874TyrAsp: 1.874 ± 0.389
1.363TyrGlu: 1.363 ± 0.322
0.937TyrPhe: 0.937 ± 0.308
3.237TyrGly: 3.237 ± 0.534
0.852TyrHis: 0.852 ± 0.329
1.874TyrIle: 1.874 ± 0.374
1.193TyrLys: 1.193 ± 0.351
2.556TyrLeu: 2.556 ± 0.533
0.681TyrMet: 0.681 ± 0.288
1.193TyrAsn: 1.193 ± 0.289
1.874TyrPro: 1.874 ± 0.475
1.619TyrGln: 1.619 ± 0.382
2.044TyrArg: 2.044 ± 0.357
1.278TyrSer: 1.278 ± 0.293
1.959TyrThr: 1.959 ± 0.473
1.959TyrVal: 1.959 ± 0.493
0.426TyrTrp: 0.426 ± 0.165
1.278TyrTyr: 1.278 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski