Amino acid dipepetide frequency for Salmonella phage vB_SenTO17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.283AlaAla: 11.283 ± 1.419
0.827AlaCys: 0.827 ± 0.292
6.62AlaAsp: 6.62 ± 0.741
7.146AlaGlu: 7.146 ± 0.947
3.535AlaPhe: 3.535 ± 0.543
8.199AlaGly: 8.199 ± 0.812
1.655AlaHis: 1.655 ± 0.371
5.717AlaIle: 5.717 ± 0.737
6.168AlaLys: 6.168 ± 0.838
8.274AlaLeu: 8.274 ± 0.828
1.956AlaMet: 1.956 ± 0.476
3.987AlaAsn: 3.987 ± 0.662
2.858AlaPro: 2.858 ± 0.462
3.611AlaGln: 3.611 ± 0.694
4.889AlaArg: 4.889 ± 0.686
5.943AlaSer: 5.943 ± 0.665
5.867AlaThr: 5.867 ± 0.884
6.77AlaVal: 6.77 ± 0.791
1.504AlaTrp: 1.504 ± 0.354
3.009AlaTyr: 3.009 ± 0.558
0.0AlaXaa: 0.0 ± 0.0
Cys
1.279CysAla: 1.279 ± 0.337
0.301CysCys: 0.301 ± 0.162
0.602CysAsp: 0.602 ± 0.24
1.128CysGlu: 1.128 ± 0.376
0.15CysPhe: 0.15 ± 0.093
1.429CysGly: 1.429 ± 0.397
0.301CysHis: 0.301 ± 0.138
0.226CysIle: 0.226 ± 0.115
0.903CysLys: 0.903 ± 0.267
0.827CysLeu: 0.827 ± 0.333
0.075CysMet: 0.075 ± 0.082
0.752CysAsn: 0.752 ± 0.246
0.527CysPro: 0.527 ± 0.186
0.527CysGln: 0.527 ± 0.24
0.978CysArg: 0.978 ± 0.327
0.602CysSer: 0.602 ± 0.267
0.301CysThr: 0.301 ± 0.133
0.451CysVal: 0.451 ± 0.186
0.301CysTrp: 0.301 ± 0.137
0.827CysTyr: 0.827 ± 0.218
0.0CysXaa: 0.0 ± 0.0
Asp
6.845AspAla: 6.845 ± 0.692
0.527AspCys: 0.527 ± 0.275
4.438AspAsp: 4.438 ± 0.9
5.19AspGlu: 5.19 ± 0.994
2.257AspPhe: 2.257 ± 0.456
4.739AspGly: 4.739 ± 0.776
0.677AspHis: 0.677 ± 0.212
3.385AspIle: 3.385 ± 0.494
3.987AspLys: 3.987 ± 0.545
5.642AspLeu: 5.642 ± 0.71
1.655AspMet: 1.655 ± 0.384
2.708AspAsn: 2.708 ± 0.444
1.73AspPro: 1.73 ± 0.38
1.354AspGln: 1.354 ± 0.355
2.633AspArg: 2.633 ± 0.522
4.137AspSer: 4.137 ± 0.719
3.235AspThr: 3.235 ± 0.439
3.761AspVal: 3.761 ± 0.514
0.903AspTrp: 0.903 ± 0.221
2.257AspTyr: 2.257 ± 0.324
0.0AspXaa: 0.0 ± 0.0
Glu
5.867GluAla: 5.867 ± 1.055
0.451GluCys: 0.451 ± 0.279
4.664GluAsp: 4.664 ± 0.593
3.686GluGlu: 3.686 ± 0.695
2.031GluPhe: 2.031 ± 0.325
4.363GluGly: 4.363 ± 0.496
1.354GluHis: 1.354 ± 0.409
3.235GluIle: 3.235 ± 0.406
4.062GluLys: 4.062 ± 0.578
5.867GluLeu: 5.867 ± 0.752
2.332GluMet: 2.332 ± 0.451
2.708GluAsn: 2.708 ± 0.395
2.181GluPro: 2.181 ± 0.485
2.332GluGln: 2.332 ± 0.519
3.385GluArg: 3.385 ± 0.574
3.009GluSer: 3.009 ± 0.374
3.836GluThr: 3.836 ± 0.5
4.739GluVal: 4.739 ± 0.594
0.827GluTrp: 0.827 ± 0.215
1.881GluTyr: 1.881 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
2.558PheAla: 2.558 ± 0.334
0.602PheCys: 0.602 ± 0.215
2.407PheAsp: 2.407 ± 0.506
2.558PheGlu: 2.558 ± 0.306
0.827PhePhe: 0.827 ± 0.255
2.934PheGly: 2.934 ± 0.473
0.752PheHis: 0.752 ± 0.255
2.482PheIle: 2.482 ± 0.407
1.279PheLys: 1.279 ± 0.305
1.73PheLeu: 1.73 ± 0.384
0.527PheMet: 0.527 ± 0.194
2.181PheAsn: 2.181 ± 0.462
1.429PhePro: 1.429 ± 0.402
0.827PheGln: 0.827 ± 0.303
2.482PheArg: 2.482 ± 0.461
2.332PheSer: 2.332 ± 0.548
2.482PheThr: 2.482 ± 0.539
2.332PheVal: 2.332 ± 0.421
0.978PheTrp: 0.978 ± 0.313
0.978PheTyr: 0.978 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
6.544GlyAla: 6.544 ± 0.729
1.053GlyCys: 1.053 ± 0.264
4.062GlyAsp: 4.062 ± 0.503
4.137GlyGlu: 4.137 ± 0.495
3.31GlyPhe: 3.31 ± 0.502
6.62GlyGly: 6.62 ± 0.906
0.978GlyHis: 0.978 ± 0.327
3.535GlyIle: 3.535 ± 0.541
4.965GlyLys: 4.965 ± 0.726
5.416GlyLeu: 5.416 ± 0.636
2.031GlyMet: 2.031 ± 0.402
3.987GlyAsn: 3.987 ± 0.608
1.881GlyPro: 1.881 ± 0.352
3.31GlyGln: 3.31 ± 0.458
4.212GlyArg: 4.212 ± 0.563
6.168GlySer: 6.168 ± 0.852
3.987GlyThr: 3.987 ± 0.646
6.319GlyVal: 6.319 ± 0.683
1.279GlyTrp: 1.279 ± 0.374
2.858GlyTyr: 2.858 ± 0.441
0.0GlyXaa: 0.0 ± 0.0
His
1.354HisAla: 1.354 ± 0.277
0.602HisCys: 0.602 ± 0.182
1.279HisAsp: 1.279 ± 0.286
0.677HisGlu: 0.677 ± 0.219
0.527HisPhe: 0.527 ± 0.221
0.527HisGly: 0.527 ± 0.21
0.752HisHis: 0.752 ± 0.228
1.128HisIle: 1.128 ± 0.307
1.354HisLys: 1.354 ± 0.346
0.978HisLeu: 0.978 ± 0.323
0.451HisMet: 0.451 ± 0.231
0.827HisAsn: 0.827 ± 0.205
1.504HisPro: 1.504 ± 0.34
1.053HisGln: 1.053 ± 0.268
0.903HisArg: 0.903 ± 0.287
1.279HisSer: 1.279 ± 0.313
0.677HisThr: 0.677 ± 0.245
1.429HisVal: 1.429 ± 0.335
0.15HisTrp: 0.15 ± 0.109
0.301HisTyr: 0.301 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
5.115IleAla: 5.115 ± 0.786
0.752IleCys: 0.752 ± 0.252
3.235IleAsp: 3.235 ± 0.563
3.686IleGlu: 3.686 ± 0.419
1.354IlePhe: 1.354 ± 0.329
4.212IleGly: 4.212 ± 0.613
0.451IleHis: 0.451 ± 0.164
3.31IleIle: 3.31 ± 0.601
3.235IleLys: 3.235 ± 0.519
3.159IleLeu: 3.159 ± 0.436
0.978IleMet: 0.978 ± 0.247
2.858IleAsn: 2.858 ± 0.503
2.558IlePro: 2.558 ± 0.393
1.956IleGln: 1.956 ± 0.37
2.558IleArg: 2.558 ± 0.401
3.686IleSer: 3.686 ± 0.507
3.987IleThr: 3.987 ± 0.606
3.084IleVal: 3.084 ± 0.409
0.978IleTrp: 0.978 ± 0.282
1.504IleTyr: 1.504 ± 0.355
0.0IleXaa: 0.0 ± 0.0
Lys
7.071LysAla: 7.071 ± 0.815
1.128LysCys: 1.128 ± 0.319
2.708LysAsp: 2.708 ± 0.527
3.159LysGlu: 3.159 ± 0.559
2.332LysPhe: 2.332 ± 0.359
4.288LysGly: 4.288 ± 0.636
1.128LysHis: 1.128 ± 0.29
2.482LysIle: 2.482 ± 0.406
3.385LysLys: 3.385 ± 0.498
5.04LysLeu: 5.04 ± 0.661
2.332LysMet: 2.332 ± 0.514
2.783LysAsn: 2.783 ± 0.612
3.159LysPro: 3.159 ± 0.569
2.407LysGln: 2.407 ± 0.448
3.535LysArg: 3.535 ± 0.542
3.084LysSer: 3.084 ± 0.645
3.912LysThr: 3.912 ± 0.62
3.385LysVal: 3.385 ± 0.54
0.752LysTrp: 0.752 ± 0.238
2.558LysTyr: 2.558 ± 0.415
0.0LysXaa: 0.0 ± 0.0
Leu
8.35LeuAla: 8.35 ± 0.863
0.827LeuCys: 0.827 ± 0.293
4.438LeuAsp: 4.438 ± 0.548
5.266LeuGlu: 5.266 ± 0.63
1.805LeuPhe: 1.805 ± 0.335
4.363LeuGly: 4.363 ± 0.536
0.903LeuHis: 0.903 ± 0.244
3.912LeuIle: 3.912 ± 0.477
5.266LeuLys: 5.266 ± 0.724
5.943LeuLeu: 5.943 ± 0.857
1.429LeuMet: 1.429 ± 0.331
3.912LeuAsn: 3.912 ± 0.514
3.686LeuPro: 3.686 ± 0.591
3.535LeuGln: 3.535 ± 0.567
4.739LeuArg: 4.739 ± 0.611
4.363LeuSer: 4.363 ± 0.54
4.062LeuThr: 4.062 ± 0.628
5.04LeuVal: 5.04 ± 0.74
0.752LeuTrp: 0.752 ± 0.235
2.633LeuTyr: 2.633 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
2.934MetAla: 2.934 ± 0.459
0.602MetCys: 0.602 ± 0.198
1.354MetAsp: 1.354 ± 0.381
1.805MetGlu: 1.805 ± 0.345
1.053MetPhe: 1.053 ± 0.305
1.73MetGly: 1.73 ± 0.347
0.451MetHis: 0.451 ± 0.181
1.128MetIle: 1.128 ± 0.285
1.204MetLys: 1.204 ± 0.274
1.655MetLeu: 1.655 ± 0.353
1.053MetMet: 1.053 ± 0.275
1.279MetAsn: 1.279 ± 0.355
0.677MetPro: 0.677 ± 0.189
0.827MetGln: 0.827 ± 0.24
1.881MetArg: 1.881 ± 0.381
1.504MetSer: 1.504 ± 0.321
1.881MetThr: 1.881 ± 0.429
1.354MetVal: 1.354 ± 0.333
0.15MetTrp: 0.15 ± 0.091
0.752MetTyr: 0.752 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
3.836AsnAla: 3.836 ± 0.574
0.451AsnCys: 0.451 ± 0.197
3.686AsnAsp: 3.686 ± 0.473
2.257AsnGlu: 2.257 ± 0.418
1.429AsnPhe: 1.429 ± 0.265
4.965AsnGly: 4.965 ± 0.791
1.204AsnHis: 1.204 ± 0.277
3.009AsnIle: 3.009 ± 0.542
2.482AsnLys: 2.482 ± 0.513
3.084AsnLeu: 3.084 ± 0.374
0.903AsnMet: 0.903 ± 0.261
2.783AsnAsn: 2.783 ± 0.489
1.805AsnPro: 1.805 ± 0.407
1.504AsnGln: 1.504 ± 0.351
2.708AsnArg: 2.708 ± 0.504
2.708AsnSer: 2.708 ± 0.501
2.934AsnThr: 2.934 ± 0.583
2.708AsnVal: 2.708 ± 0.515
0.827AsnTrp: 0.827 ± 0.235
2.407AsnTyr: 2.407 ± 0.474
0.0AsnXaa: 0.0 ± 0.0
Pro
3.235ProAla: 3.235 ± 0.513
0.451ProCys: 0.451 ± 0.16
3.159ProAsp: 3.159 ± 0.409
4.513ProGlu: 4.513 ± 0.508
1.429ProPhe: 1.429 ± 0.4
3.009ProGly: 3.009 ± 0.625
0.602ProHis: 0.602 ± 0.226
1.429ProIle: 1.429 ± 0.334
2.181ProLys: 2.181 ± 0.517
2.934ProLeu: 2.934 ± 0.458
0.978ProMet: 0.978 ± 0.286
1.881ProAsn: 1.881 ± 0.513
0.827ProPro: 0.827 ± 0.323
1.354ProGln: 1.354 ± 0.365
1.58ProArg: 1.58 ± 0.388
2.257ProSer: 2.257 ± 0.411
1.73ProThr: 1.73 ± 0.368
3.46ProVal: 3.46 ± 0.55
0.301ProTrp: 0.301 ± 0.187
1.58ProTyr: 1.58 ± 0.455
0.0ProXaa: 0.0 ± 0.0
Gln
4.739GlnAla: 4.739 ± 0.694
0.527GlnCys: 0.527 ± 0.194
1.956GlnAsp: 1.956 ± 0.531
1.58GlnGlu: 1.58 ± 0.36
1.504GlnPhe: 1.504 ± 0.364
2.031GlnGly: 2.031 ± 0.413
1.204GlnHis: 1.204 ± 0.313
2.633GlnIle: 2.633 ± 0.573
2.257GlnLys: 2.257 ± 0.561
2.407GlnLeu: 2.407 ± 0.369
1.655GlnMet: 1.655 ± 0.371
1.881GlnAsn: 1.881 ± 0.42
2.257GlnPro: 2.257 ± 0.419
2.708GlnGln: 2.708 ± 0.842
2.407GlnArg: 2.407 ± 0.543
1.881GlnSer: 1.881 ± 0.316
1.956GlnThr: 1.956 ± 0.449
2.633GlnVal: 2.633 ± 0.462
0.602GlnTrp: 0.602 ± 0.262
1.429GlnTyr: 1.429 ± 0.293
0.0GlnXaa: 0.0 ± 0.0
Arg
5.115ArgAla: 5.115 ± 0.471
0.903ArgCys: 0.903 ± 0.323
3.235ArgAsp: 3.235 ± 0.458
2.558ArgGlu: 2.558 ± 0.43
1.956ArgPhe: 1.956 ± 0.389
4.589ArgGly: 4.589 ± 0.525
0.827ArgHis: 0.827 ± 0.245
3.159ArgIle: 3.159 ± 0.493
3.912ArgLys: 3.912 ± 0.628
3.761ArgLeu: 3.761 ± 0.496
1.128ArgMet: 1.128 ± 0.316
3.385ArgAsn: 3.385 ± 0.57
1.956ArgPro: 1.956 ± 0.57
3.761ArgGln: 3.761 ± 0.544
3.761ArgArg: 3.761 ± 0.55
2.407ArgSer: 2.407 ± 0.374
3.009ArgThr: 3.009 ± 0.449
4.363ArgVal: 4.363 ± 0.489
0.677ArgTrp: 0.677 ± 0.298
1.354ArgTyr: 1.354 ± 0.421
0.0ArgXaa: 0.0 ± 0.0
Ser
6.018SerAla: 6.018 ± 0.909
0.527SerCys: 0.527 ± 0.225
3.611SerAsp: 3.611 ± 0.506
2.558SerGlu: 2.558 ± 0.534
2.633SerPhe: 2.633 ± 0.497
4.889SerGly: 4.889 ± 0.81
1.58SerHis: 1.58 ± 0.37
2.633SerIle: 2.633 ± 0.465
3.235SerLys: 3.235 ± 0.465
4.513SerLeu: 4.513 ± 0.579
1.504SerMet: 1.504 ± 0.349
2.332SerAsn: 2.332 ± 0.359
2.407SerPro: 2.407 ± 0.43
2.106SerGln: 2.106 ± 0.542
3.385SerArg: 3.385 ± 0.586
2.708SerSer: 2.708 ± 0.513
4.288SerThr: 4.288 ± 0.596
4.965SerVal: 4.965 ± 0.839
0.978SerTrp: 0.978 ± 0.272
1.881SerTyr: 1.881 ± 0.437
0.0SerXaa: 0.0 ± 0.0
Thr
6.243ThrAla: 6.243 ± 0.78
0.376ThrCys: 0.376 ± 0.207
3.009ThrAsp: 3.009 ± 0.388
3.912ThrGlu: 3.912 ± 0.624
2.858ThrPhe: 2.858 ± 0.468
5.642ThrGly: 5.642 ± 0.779
0.677ThrHis: 0.677 ± 0.243
3.235ThrIle: 3.235 ± 0.464
2.482ThrLys: 2.482 ± 0.316
5.416ThrLeu: 5.416 ± 0.597
1.279ThrMet: 1.279 ± 0.319
1.655ThrAsn: 1.655 ± 0.331
3.46ThrPro: 3.46 ± 0.564
2.031ThrGln: 2.031 ± 0.428
2.858ThrArg: 2.858 ± 0.481
3.836ThrSer: 3.836 ± 0.712
3.761ThrThr: 3.761 ± 0.544
4.513ThrVal: 4.513 ± 0.635
0.677ThrTrp: 0.677 ± 0.247
2.332ThrTyr: 2.332 ± 0.502
0.0ThrXaa: 0.0 ± 0.0
Val
6.62ValAla: 6.62 ± 0.666
0.827ValCys: 0.827 ± 0.242
3.987ValAsp: 3.987 ± 0.395
3.836ValGlu: 3.836 ± 0.525
1.655ValPhe: 1.655 ± 0.423
4.664ValGly: 4.664 ± 0.701
1.279ValHis: 1.279 ± 0.286
4.212ValIle: 4.212 ± 0.527
5.19ValLys: 5.19 ± 0.663
4.664ValLeu: 4.664 ± 0.585
1.805ValMet: 1.805 ± 0.406
3.912ValAsn: 3.912 ± 0.521
2.332ValPro: 2.332 ± 0.493
2.558ValGln: 2.558 ± 0.493
3.987ValArg: 3.987 ± 0.507
3.836ValSer: 3.836 ± 0.506
5.341ValThr: 5.341 ± 0.794
5.04ValVal: 5.04 ± 0.863
0.978ValTrp: 0.978 ± 0.262
2.482ValTyr: 2.482 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
1.053TrpAla: 1.053 ± 0.297
0.15TrpCys: 0.15 ± 0.093
1.128TrpAsp: 1.128 ± 0.307
0.527TrpGlu: 0.527 ± 0.216
0.527TrpPhe: 0.527 ± 0.199
0.827TrpGly: 0.827 ± 0.254
0.451TrpHis: 0.451 ± 0.216
0.527TrpIle: 0.527 ± 0.178
0.451TrpLys: 0.451 ± 0.179
1.504TrpLeu: 1.504 ± 0.34
0.226TrpMet: 0.226 ± 0.127
0.602TrpAsn: 0.602 ± 0.213
0.527TrpPro: 0.527 ± 0.203
0.827TrpGln: 0.827 ± 0.253
1.354TrpArg: 1.354 ± 0.302
1.128TrpSer: 1.128 ± 0.321
1.128TrpThr: 1.128 ± 0.267
1.053TrpVal: 1.053 ± 0.23
0.226TrpTrp: 0.226 ± 0.12
0.376TrpTyr: 0.376 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.761TyrAla: 3.761 ± 0.642
0.602TyrCys: 0.602 ± 0.221
2.482TyrAsp: 2.482 ± 0.446
2.633TyrGlu: 2.633 ± 0.437
1.504TyrPhe: 1.504 ± 0.336
2.633TyrGly: 2.633 ± 0.491
0.527TyrHis: 0.527 ± 0.178
1.279TyrIle: 1.279 ± 0.306
2.633TyrLys: 2.633 ± 0.423
2.407TyrLeu: 2.407 ± 0.386
0.903TyrMet: 0.903 ± 0.225
1.279TyrAsn: 1.279 ± 0.231
1.204TyrPro: 1.204 ± 0.32
1.655TyrGln: 1.655 ± 0.558
1.58TyrArg: 1.58 ± 0.341
1.881TyrSer: 1.881 ± 0.433
1.881TyrThr: 1.881 ± 0.37
1.805TyrVal: 1.805 ± 0.347
0.752TyrTrp: 0.752 ± 0.238
1.128TyrTyr: 1.128 ± 0.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (13295 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski