Amino acid dipepetide frequency for Salmonella phage SPC32N

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.724AlaAla: 11.724 ± 1.65
0.582AlaCys: 0.582 ± 0.227
5.405AlaAsp: 5.405 ± 0.987
5.82AlaGlu: 5.82 ± 0.988
3.908AlaPhe: 3.908 ± 0.479
7.317AlaGly: 7.317 ± 0.929
1.164AlaHis: 1.164 ± 0.262
4.24AlaIle: 4.24 ± 0.588
4.573AlaLys: 4.573 ± 0.732
8.398AlaLeu: 8.398 ± 0.832
3.409AlaMet: 3.409 ± 0.488
3.658AlaAsn: 3.658 ± 0.551
3.409AlaPro: 3.409 ± 0.66
5.488AlaGln: 5.488 ± 1.043
6.153AlaArg: 6.153 ± 0.888
6.569AlaSer: 6.569 ± 0.614
4.822AlaThr: 4.822 ± 0.624
7.649AlaVal: 7.649 ± 0.838
1.58AlaTrp: 1.58 ± 0.282
3.575AlaTyr: 3.575 ± 0.62
0.0AlaXaa: 0.0 ± 0.0
Cys
0.998CysAla: 0.998 ± 0.263
0.083CysCys: 0.083 ± 0.071
0.499CysAsp: 0.499 ± 0.186
0.665CysGlu: 0.665 ± 0.223
0.499CysPhe: 0.499 ± 0.171
0.831CysGly: 0.831 ± 0.244
0.083CysHis: 0.083 ± 0.08
1.164CysIle: 1.164 ± 0.34
0.582CysLys: 0.582 ± 0.267
0.582CysLeu: 0.582 ± 0.228
0.249CysMet: 0.249 ± 0.14
0.665CysAsn: 0.665 ± 0.253
0.582CysPro: 0.582 ± 0.272
0.166CysGln: 0.166 ± 0.135
0.748CysArg: 0.748 ± 0.239
0.582CysSer: 0.582 ± 0.206
0.333CysThr: 0.333 ± 0.174
0.582CysVal: 0.582 ± 0.254
0.083CysTrp: 0.083 ± 0.085
0.166CysTyr: 0.166 ± 0.111
0.0CysXaa: 0.0 ± 0.0
Asp
5.903AspAla: 5.903 ± 0.939
0.831AspCys: 0.831 ± 0.285
4.24AspAsp: 4.24 ± 0.479
3.658AspGlu: 3.658 ± 0.724
2.079AspPhe: 2.079 ± 0.417
4.989AspGly: 4.989 ± 0.752
1.081AspHis: 1.081 ± 0.351
3.658AspIle: 3.658 ± 0.515
3.326AspLys: 3.326 ± 0.694
4.989AspLeu: 4.989 ± 0.487
1.663AspMet: 1.663 ± 0.406
2.411AspAsn: 2.411 ± 0.408
2.162AspPro: 2.162 ± 0.538
2.328AspGln: 2.328 ± 0.473
2.578AspArg: 2.578 ± 0.541
3.658AspSer: 3.658 ± 0.517
2.578AspThr: 2.578 ± 0.391
5.321AspVal: 5.321 ± 0.83
1.081AspTrp: 1.081 ± 0.334
2.245AspTyr: 2.245 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
5.987GluAla: 5.987 ± 0.704
0.582GluCys: 0.582 ± 0.231
3.492GluAsp: 3.492 ± 0.584
3.409GluGlu: 3.409 ± 0.64
3.076GluPhe: 3.076 ± 0.463
2.91GluGly: 2.91 ± 0.459
0.831GluHis: 0.831 ± 0.264
2.91GluIle: 2.91 ± 0.462
3.575GluLys: 3.575 ± 0.544
5.405GluLeu: 5.405 ± 0.66
1.497GluMet: 1.497 ± 0.39
2.079GluAsn: 2.079 ± 0.33
1.829GluPro: 1.829 ± 0.431
3.658GluGln: 3.658 ± 0.727
3.825GluArg: 3.825 ± 0.568
3.492GluSer: 3.492 ± 0.465
2.328GluThr: 2.328 ± 0.446
3.326GluVal: 3.326 ± 0.529
0.831GluTrp: 0.831 ± 0.272
2.162GluTyr: 2.162 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
1.829PheAla: 1.829 ± 0.328
0.499PheCys: 0.499 ± 0.197
2.411PheAsp: 2.411 ± 0.497
2.411PheGlu: 2.411 ± 0.426
1.663PhePhe: 1.663 ± 0.406
2.993PheGly: 2.993 ± 0.411
0.416PheHis: 0.416 ± 0.184
3.409PheIle: 3.409 ± 0.471
2.162PheLys: 2.162 ± 0.445
1.829PheLeu: 1.829 ± 0.379
0.748PheMet: 0.748 ± 0.241
1.663PheAsn: 1.663 ± 0.293
1.413PhePro: 1.413 ± 0.351
0.915PheGln: 0.915 ± 0.303
2.411PheArg: 2.411 ± 0.404
3.409PheSer: 3.409 ± 0.48
1.746PheThr: 1.746 ± 0.364
1.746PheVal: 1.746 ± 0.492
0.416PheTrp: 0.416 ± 0.163
1.58PheTyr: 1.58 ± 0.459
0.0PheXaa: 0.0 ± 0.0
Gly
6.652GlyAla: 6.652 ± 0.902
0.665GlyCys: 0.665 ± 0.267
4.573GlyAsp: 4.573 ± 0.527
4.739GlyGlu: 4.739 ± 0.486
3.16GlyPhe: 3.16 ± 0.64
5.571GlyGly: 5.571 ± 0.746
1.33GlyHis: 1.33 ± 0.268
4.906GlyIle: 4.906 ± 0.639
4.822GlyLys: 4.822 ± 0.568
4.989GlyLeu: 4.989 ± 0.717
2.328GlyMet: 2.328 ± 0.427
3.16GlyAsn: 3.16 ± 0.559
1.746GlyPro: 1.746 ± 0.382
3.742GlyGln: 3.742 ± 0.763
3.575GlyArg: 3.575 ± 0.633
4.989GlySer: 4.989 ± 0.836
4.324GlyThr: 4.324 ± 0.545
5.238GlyVal: 5.238 ± 0.843
1.413GlyTrp: 1.413 ± 0.334
2.245GlyTyr: 2.245 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
1.081HisAla: 1.081 ± 0.329
0.416HisCys: 0.416 ± 0.234
1.164HisAsp: 1.164 ± 0.273
1.33HisGlu: 1.33 ± 0.36
0.416HisPhe: 0.416 ± 0.212
1.58HisGly: 1.58 ± 0.404
0.416HisHis: 0.416 ± 0.176
0.748HisIle: 0.748 ± 0.209
0.748HisLys: 0.748 ± 0.269
1.413HisLeu: 1.413 ± 0.361
0.582HisMet: 0.582 ± 0.202
0.915HisAsn: 0.915 ± 0.297
0.582HisPro: 0.582 ± 0.216
0.249HisGln: 0.249 ± 0.194
0.831HisArg: 0.831 ± 0.264
0.831HisSer: 0.831 ± 0.29
0.831HisThr: 0.831 ± 0.253
0.915HisVal: 0.915 ± 0.247
0.665HisTrp: 0.665 ± 0.223
0.166HisTyr: 0.166 ± 0.098
0.0HisXaa: 0.0 ± 0.0
Ile
5.238IleAla: 5.238 ± 0.679
0.748IleCys: 0.748 ± 0.268
3.16IleAsp: 3.16 ± 0.562
3.243IleGlu: 3.243 ± 0.538
1.912IlePhe: 1.912 ± 0.547
4.822IleGly: 4.822 ± 0.581
0.582IleHis: 0.582 ± 0.212
3.658IleIle: 3.658 ± 1.069
2.827IleLys: 2.827 ± 0.575
3.243IleLeu: 3.243 ± 0.831
0.998IleMet: 0.998 ± 0.328
3.825IleAsn: 3.825 ± 0.536
2.578IlePro: 2.578 ± 0.412
2.328IleGln: 2.328 ± 0.351
3.076IleArg: 3.076 ± 0.476
4.822IleSer: 4.822 ± 0.92
3.742IleThr: 3.742 ± 0.602
3.326IleVal: 3.326 ± 0.653
1.247IleTrp: 1.247 ± 0.258
1.663IleTyr: 1.663 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
4.656LysAla: 4.656 ± 0.705
0.665LysCys: 0.665 ± 0.254
2.328LysAsp: 2.328 ± 0.415
2.245LysGlu: 2.245 ± 0.433
1.58LysPhe: 1.58 ± 0.353
2.827LysGly: 2.827 ± 0.643
1.081LysHis: 1.081 ± 0.307
1.996LysIle: 1.996 ± 0.363
3.243LysLys: 3.243 ± 0.596
4.573LysLeu: 4.573 ± 0.531
1.247LysMet: 1.247 ± 0.305
2.744LysAsn: 2.744 ± 0.587
3.409LysPro: 3.409 ± 0.69
1.996LysGln: 1.996 ± 0.464
3.825LysArg: 3.825 ± 0.747
3.575LysSer: 3.575 ± 0.709
3.409LysThr: 3.409 ± 0.499
3.409LysVal: 3.409 ± 0.585
1.247LysTrp: 1.247 ± 0.375
1.996LysTyr: 1.996 ± 0.477
0.0LysXaa: 0.0 ± 0.0
Leu
8.065LeuAla: 8.065 ± 0.681
0.998LeuCys: 0.998 ± 0.304
4.24LeuAsp: 4.24 ± 0.461
3.492LeuGlu: 3.492 ± 0.562
2.661LeuPhe: 2.661 ± 0.463
4.822LeuGly: 4.822 ± 0.951
1.58LeuHis: 1.58 ± 0.37
4.573LeuIle: 4.573 ± 0.619
4.157LeuLys: 4.157 ± 0.77
5.405LeuLeu: 5.405 ± 0.907
2.245LeuMet: 2.245 ± 0.322
3.243LeuAsn: 3.243 ± 0.517
2.827LeuPro: 2.827 ± 0.494
3.16LeuGln: 3.16 ± 0.609
4.573LeuArg: 4.573 ± 0.569
6.236LeuSer: 6.236 ± 0.639
6.818LeuThr: 6.818 ± 0.758
4.24LeuVal: 4.24 ± 0.511
1.081LeuTrp: 1.081 ± 0.359
1.663LeuTyr: 1.663 ± 0.309
0.0LeuXaa: 0.0 ± 0.0
Met
3.16MetAla: 3.16 ± 0.525
0.416MetCys: 0.416 ± 0.163
1.497MetAsp: 1.497 ± 0.36
1.164MetGlu: 1.164 ± 0.352
0.582MetPhe: 0.582 ± 0.261
1.663MetGly: 1.663 ± 0.437
0.0MetHis: 0.0 ± 0.0
1.413MetIle: 1.413 ± 0.306
1.829MetLys: 1.829 ± 0.491
1.996MetLeu: 1.996 ± 0.459
0.998MetMet: 0.998 ± 0.3
1.912MetAsn: 1.912 ± 0.42
1.247MetPro: 1.247 ± 0.376
1.081MetGln: 1.081 ± 0.305
1.33MetArg: 1.33 ± 0.562
2.744MetSer: 2.744 ± 0.348
1.497MetThr: 1.497 ± 0.32
2.079MetVal: 2.079 ± 0.329
0.166MetTrp: 0.166 ± 0.117
0.998MetTyr: 0.998 ± 0.394
0.0MetXaa: 0.0 ± 0.0
Asn
4.074AsnAla: 4.074 ± 0.509
0.249AsnCys: 0.249 ± 0.142
2.91AsnAsp: 2.91 ± 0.448
2.993AsnGlu: 2.993 ± 0.551
0.998AsnPhe: 0.998 ± 0.266
3.742AsnGly: 3.742 ± 0.654
0.831AsnHis: 0.831 ± 0.281
2.245AsnIle: 2.245 ± 0.489
1.912AsnLys: 1.912 ± 0.4
3.742AsnLeu: 3.742 ± 0.797
1.247AsnMet: 1.247 ± 0.298
2.661AsnAsn: 2.661 ± 0.585
2.91AsnPro: 2.91 ± 0.613
2.578AsnGln: 2.578 ± 0.492
2.245AsnArg: 2.245 ± 0.434
2.494AsnSer: 2.494 ± 0.48
3.076AsnThr: 3.076 ± 0.498
2.827AsnVal: 2.827 ± 0.483
0.831AsnTrp: 0.831 ± 0.309
2.079AsnTyr: 2.079 ± 0.648
0.0AsnXaa: 0.0 ± 0.0
Pro
4.157ProAla: 4.157 ± 1.122
0.249ProCys: 0.249 ± 0.144
3.243ProAsp: 3.243 ± 0.565
3.825ProGlu: 3.825 ± 0.618
1.081ProPhe: 1.081 ± 0.253
3.243ProGly: 3.243 ± 0.394
0.831ProHis: 0.831 ± 0.281
1.746ProIle: 1.746 ± 0.325
1.164ProLys: 1.164 ± 0.352
2.661ProLeu: 2.661 ± 0.466
0.915ProMet: 0.915 ± 0.243
0.831ProAsn: 0.831 ± 0.258
1.746ProPro: 1.746 ± 0.387
1.829ProGln: 1.829 ± 0.386
1.746ProArg: 1.746 ± 0.423
3.076ProSer: 3.076 ± 0.532
2.411ProThr: 2.411 ± 0.677
3.326ProVal: 3.326 ± 0.548
0.748ProTrp: 0.748 ± 0.272
1.164ProTyr: 1.164 ± 0.31
0.0ProXaa: 0.0 ± 0.0
Gln
5.987GlnAla: 5.987 ± 1.379
0.582GlnCys: 0.582 ± 0.213
2.245GlnAsp: 2.245 ± 0.347
2.91GlnGlu: 2.91 ± 0.402
1.912GlnPhe: 1.912 ± 0.442
2.744GlnGly: 2.744 ± 0.541
0.998GlnHis: 0.998 ± 0.347
2.162GlnIle: 2.162 ± 0.476
1.912GlnLys: 1.912 ± 0.45
4.49GlnLeu: 4.49 ± 0.858
1.497GlnMet: 1.497 ± 0.387
1.247GlnAsn: 1.247 ± 0.286
1.912GlnPro: 1.912 ± 0.303
3.742GlnGln: 3.742 ± 0.745
3.991GlnArg: 3.991 ± 0.608
1.746GlnSer: 1.746 ± 0.374
2.744GlnThr: 2.744 ± 0.504
2.494GlnVal: 2.494 ± 0.698
0.831GlnTrp: 0.831 ± 0.273
1.996GlnTyr: 1.996 ± 0.443
0.0GlnXaa: 0.0 ± 0.0
Arg
5.571ArgAla: 5.571 ± 0.848
0.748ArgCys: 0.748 ± 0.281
4.24ArgAsp: 4.24 ± 0.51
3.742ArgGlu: 3.742 ± 0.519
2.162ArgPhe: 2.162 ± 0.317
3.16ArgGly: 3.16 ± 0.457
0.998ArgHis: 0.998 ± 0.245
4.157ArgIle: 4.157 ± 0.485
3.575ArgLys: 3.575 ± 0.545
4.157ArgLeu: 4.157 ± 0.543
1.081ArgMet: 1.081 ± 0.344
3.326ArgAsn: 3.326 ± 0.325
0.915ArgPro: 0.915 ± 0.243
3.742ArgGln: 3.742 ± 0.571
4.739ArgArg: 4.739 ± 0.657
2.411ArgSer: 2.411 ± 0.525
3.16ArgThr: 3.16 ± 0.439
3.326ArgVal: 3.326 ± 0.693
1.497ArgTrp: 1.497 ± 0.355
1.912ArgTyr: 1.912 ± 0.365
0.0ArgXaa: 0.0 ± 0.0
Ser
6.236SerAla: 6.236 ± 0.724
0.582SerCys: 0.582 ± 0.243
3.742SerAsp: 3.742 ± 0.758
3.243SerGlu: 3.243 ± 0.426
2.661SerPhe: 2.661 ± 0.393
5.654SerGly: 5.654 ± 0.775
0.998SerHis: 0.998 ± 0.258
3.742SerIle: 3.742 ± 0.735
3.991SerLys: 3.991 ± 0.664
4.573SerLeu: 4.573 ± 0.797
1.996SerMet: 1.996 ± 0.472
3.326SerAsn: 3.326 ± 0.515
2.494SerPro: 2.494 ± 0.434
3.326SerGln: 3.326 ± 0.63
3.742SerArg: 3.742 ± 0.497
4.074SerSer: 4.074 ± 0.692
3.991SerThr: 3.991 ± 0.622
4.906SerVal: 4.906 ± 0.728
1.081SerTrp: 1.081 ± 0.258
1.746SerTyr: 1.746 ± 0.627
0.0SerXaa: 0.0 ± 0.0
Thr
6.735ThrAla: 6.735 ± 0.94
0.333ThrCys: 0.333 ± 0.183
4.24ThrAsp: 4.24 ± 0.445
2.328ThrGlu: 2.328 ± 0.494
1.413ThrPhe: 1.413 ± 0.349
6.402ThrGly: 6.402 ± 0.712
0.748ThrHis: 0.748 ± 0.264
3.243ThrIle: 3.243 ± 0.558
2.079ThrLys: 2.079 ± 0.331
4.24ThrLeu: 4.24 ± 0.609
1.746ThrMet: 1.746 ± 0.388
2.744ThrAsn: 2.744 ± 0.416
3.243ThrPro: 3.243 ± 0.59
3.16ThrGln: 3.16 ± 0.673
2.411ThrArg: 2.411 ± 0.455
3.825ThrSer: 3.825 ± 0.637
2.661ThrThr: 2.661 ± 0.407
4.157ThrVal: 4.157 ± 0.76
0.831ThrTrp: 0.831 ± 0.26
1.829ThrTyr: 1.829 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
6.485ValAla: 6.485 ± 0.795
0.416ValCys: 0.416 ± 0.179
4.324ValAsp: 4.324 ± 0.562
3.742ValGlu: 3.742 ± 0.491
1.912ValPhe: 1.912 ± 0.369
4.157ValGly: 4.157 ± 0.551
0.831ValHis: 0.831 ± 0.253
4.407ValIle: 4.407 ± 0.778
3.409ValLys: 3.409 ± 0.539
4.989ValLeu: 4.989 ± 0.652
1.996ValMet: 1.996 ± 0.456
4.324ValAsn: 4.324 ± 0.631
2.578ValPro: 2.578 ± 0.464
2.411ValGln: 2.411 ± 0.402
4.157ValArg: 4.157 ± 0.526
4.49ValSer: 4.49 ± 0.578
4.906ValThr: 4.906 ± 0.558
3.908ValVal: 3.908 ± 0.636
0.748ValTrp: 0.748 ± 0.263
1.746ValTyr: 1.746 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
1.33TrpAla: 1.33 ± 0.361
0.166TrpCys: 0.166 ± 0.119
0.915TrpAsp: 0.915 ± 0.31
0.831TrpGlu: 0.831 ± 0.245
0.665TrpPhe: 0.665 ± 0.238
1.746TrpGly: 1.746 ± 0.351
0.249TrpHis: 0.249 ± 0.141
0.748TrpIle: 0.748 ± 0.246
0.831TrpLys: 0.831 ± 0.257
1.746TrpLeu: 1.746 ± 0.358
0.499TrpMet: 0.499 ± 0.203
0.665TrpAsn: 0.665 ± 0.235
0.499TrpPro: 0.499 ± 0.255
0.665TrpGln: 0.665 ± 0.276
0.665TrpArg: 0.665 ± 0.237
1.746TrpSer: 1.746 ± 0.442
0.998TrpThr: 0.998 ± 0.331
1.164TrpVal: 1.164 ± 0.369
0.166TrpTrp: 0.166 ± 0.105
0.665TrpTyr: 0.665 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.575TyrAla: 3.575 ± 0.565
0.333TyrCys: 0.333 ± 0.176
1.912TyrAsp: 1.912 ± 0.327
1.413TyrGlu: 1.413 ± 0.327
1.33TyrPhe: 1.33 ± 0.476
2.993TyrGly: 2.993 ± 0.461
0.831TyrHis: 0.831 ± 0.373
1.912TyrIle: 1.912 ± 0.465
1.33TyrLys: 1.33 ± 0.353
2.661TyrLeu: 2.661 ± 0.487
0.748TyrMet: 0.748 ± 0.246
1.247TyrAsn: 1.247 ± 0.255
1.912TyrPro: 1.912 ± 0.485
1.58TyrGln: 1.58 ± 0.366
1.996TyrArg: 1.996 ± 0.311
1.33TyrSer: 1.33 ± 0.305
1.996TyrThr: 1.996 ± 0.515
2.079TyrVal: 2.079 ± 0.451
0.416TyrTrp: 0.416 ± 0.156
1.413TyrTyr: 1.413 ± 0.348
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (12028 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski