Amino acid dipepetide frequency for Salmonella phage vB_SenS-Ent3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.045AlaAla: 11.045 ± 1.741
1.219AlaCys: 1.219 ± 0.313
6.17AlaAsp: 6.17 ± 0.843
6.399AlaGlu: 6.399 ± 0.991
4.113AlaPhe: 4.113 ± 0.53
7.313AlaGly: 7.313 ± 0.733
1.904AlaHis: 1.904 ± 0.379
3.809AlaIle: 3.809 ± 0.546
5.484AlaLys: 5.484 ± 0.895
7.389AlaLeu: 7.389 ± 0.833
2.514AlaMet: 2.514 ± 0.557
3.352AlaAsn: 3.352 ± 0.486
3.123AlaPro: 3.123 ± 0.418
3.656AlaGln: 3.656 ± 0.731
4.113AlaArg: 4.113 ± 0.491
5.789AlaSer: 5.789 ± 0.959
5.789AlaThr: 5.789 ± 0.65
7.77AlaVal: 7.77 ± 0.962
1.143AlaTrp: 1.143 ± 0.277
3.123AlaTyr: 3.123 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
0.609CysAla: 0.609 ± 0.184
0.152CysCys: 0.152 ± 0.109
0.838CysAsp: 0.838 ± 0.236
1.295CysGlu: 1.295 ± 0.385
0.305CysPhe: 0.305 ± 0.159
0.838CysGly: 0.838 ± 0.255
0.229CysHis: 0.229 ± 0.144
0.229CysIle: 0.229 ± 0.114
0.914CysLys: 0.914 ± 0.289
0.762CysLeu: 0.762 ± 0.275
0.381CysMet: 0.381 ± 0.206
0.457CysAsn: 0.457 ± 0.176
0.229CysPro: 0.229 ± 0.124
0.152CysGln: 0.152 ± 0.139
1.143CysArg: 1.143 ± 0.302
0.381CysSer: 0.381 ± 0.159
0.457CysThr: 0.457 ± 0.151
0.609CysVal: 0.609 ± 0.184
0.305CysTrp: 0.305 ± 0.133
0.305CysTyr: 0.305 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
6.779AspAla: 6.779 ± 0.734
0.686AspCys: 0.686 ± 0.228
3.885AspAsp: 3.885 ± 0.41
3.961AspGlu: 3.961 ± 0.494
3.123AspPhe: 3.123 ± 0.406
6.018AspGly: 6.018 ± 0.877
0.762AspHis: 0.762 ± 0.23
3.199AspIle: 3.199 ± 0.353
3.504AspLys: 3.504 ± 0.317
4.799AspLeu: 4.799 ± 0.523
1.523AspMet: 1.523 ± 0.231
2.514AspAsn: 2.514 ± 0.513
1.752AspPro: 1.752 ± 0.376
0.457AspGln: 0.457 ± 0.188
2.895AspArg: 2.895 ± 0.541
3.732AspSer: 3.732 ± 0.426
4.19AspThr: 4.19 ± 0.518
4.19AspVal: 4.19 ± 0.46
0.99AspTrp: 0.99 ± 0.287
1.828AspTyr: 1.828 ± 0.381
0.0AspXaa: 0.0 ± 0.0
Glu
6.094GluAla: 6.094 ± 0.748
0.305GluCys: 0.305 ± 0.148
3.961GluAsp: 3.961 ± 0.546
4.57GluGlu: 4.57 ± 0.904
3.199GluPhe: 3.199 ± 0.847
4.799GluGly: 4.799 ± 0.648
1.143GluHis: 1.143 ± 0.292
3.58GluIle: 3.58 ± 0.432
4.19GluLys: 4.19 ± 0.635
5.713GluLeu: 5.713 ± 0.752
2.742GluMet: 2.742 ± 0.521
2.514GluAsn: 2.514 ± 0.425
1.828GluPro: 1.828 ± 0.556
3.352GluGln: 3.352 ± 0.672
4.266GluArg: 4.266 ± 0.618
3.885GluSer: 3.885 ± 0.609
3.732GluThr: 3.732 ± 0.464
4.647GluVal: 4.647 ± 0.594
0.762GluTrp: 0.762 ± 0.258
1.676GluTyr: 1.676 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
2.818PheAla: 2.818 ± 0.493
0.533PheCys: 0.533 ± 0.188
3.123PheAsp: 3.123 ± 0.458
2.742PheGlu: 2.742 ± 0.495
0.686PhePhe: 0.686 ± 0.176
3.047PheGly: 3.047 ± 0.357
0.533PheHis: 0.533 ± 0.171
2.361PheIle: 2.361 ± 0.433
1.828PheLys: 1.828 ± 0.411
2.361PheLeu: 2.361 ± 0.391
0.533PheMet: 0.533 ± 0.189
1.371PheAsn: 1.371 ± 0.399
1.6PhePro: 1.6 ± 0.408
1.447PheGln: 1.447 ± 0.364
2.057PheArg: 2.057 ± 0.298
2.209PheSer: 2.209 ± 0.489
3.428PheThr: 3.428 ± 0.623
2.895PheVal: 2.895 ± 0.458
0.838PheTrp: 0.838 ± 0.251
1.143PheTyr: 1.143 ± 0.347
0.0PheXaa: 0.0 ± 0.0
Gly
7.16GlyAla: 7.16 ± 0.763
0.99GlyCys: 0.99 ± 0.308
4.113GlyAsp: 4.113 ± 0.66
5.408GlyGlu: 5.408 ± 0.796
2.971GlyPhe: 2.971 ± 0.504
6.17GlyGly: 6.17 ± 0.851
1.447GlyHis: 1.447 ± 0.468
3.428GlyIle: 3.428 ± 0.438
5.561GlyLys: 5.561 ± 0.572
5.637GlyLeu: 5.637 ± 0.54
2.438GlyMet: 2.438 ± 0.557
3.809GlyAsn: 3.809 ± 0.477
1.676GlyPro: 1.676 ± 0.323
2.895GlyGln: 2.895 ± 0.414
4.342GlyArg: 4.342 ± 0.48
4.951GlySer: 4.951 ± 0.859
4.037GlyThr: 4.037 ± 0.702
5.941GlyVal: 5.941 ± 0.805
1.295GlyTrp: 1.295 ± 0.315
2.895GlyTyr: 2.895 ± 0.48
0.0GlyXaa: 0.0 ± 0.0
His
0.99HisAla: 0.99 ± 0.256
0.381HisCys: 0.381 ± 0.17
1.143HisAsp: 1.143 ± 0.269
0.762HisGlu: 0.762 ± 0.235
0.533HisPhe: 0.533 ± 0.256
0.762HisGly: 0.762 ± 0.274
0.686HisHis: 0.686 ± 0.281
1.143HisIle: 1.143 ± 0.283
0.99HisLys: 0.99 ± 0.227
1.066HisLeu: 1.066 ± 0.279
0.457HisMet: 0.457 ± 0.17
0.533HisAsn: 0.533 ± 0.247
0.99HisPro: 0.99 ± 0.312
0.838HisGln: 0.838 ± 0.211
1.219HisArg: 1.219 ± 0.33
0.914HisSer: 0.914 ± 0.214
0.914HisThr: 0.914 ± 0.336
0.686HisVal: 0.686 ± 0.205
0.076HisTrp: 0.076 ± 0.078
0.99HisTyr: 0.99 ± 0.316
0.0HisXaa: 0.0 ± 0.0
Ile
4.266IleAla: 4.266 ± 0.752
0.686IleCys: 0.686 ± 0.257
3.58IleAsp: 3.58 ± 0.508
3.123IleGlu: 3.123 ± 0.512
1.066IlePhe: 1.066 ± 0.3
3.123IleGly: 3.123 ± 0.35
0.609IleHis: 0.609 ± 0.161
2.285IleIle: 2.285 ± 0.457
2.666IleLys: 2.666 ± 0.449
3.275IleLeu: 3.275 ± 0.533
1.143IleMet: 1.143 ± 0.332
2.133IleAsn: 2.133 ± 0.483
2.59IlePro: 2.59 ± 0.461
1.904IleGln: 1.904 ± 0.445
2.361IleArg: 2.361 ± 0.291
3.123IleSer: 3.123 ± 0.555
4.647IleThr: 4.647 ± 0.523
3.047IleVal: 3.047 ± 0.492
0.686IleTrp: 0.686 ± 0.197
1.371IleTyr: 1.371 ± 0.426
0.0IleXaa: 0.0 ± 0.0
Lys
5.484LysAla: 5.484 ± 0.805
0.838LysCys: 0.838 ± 0.364
3.504LysAsp: 3.504 ± 0.52
4.494LysGlu: 4.494 ± 0.57
2.438LysPhe: 2.438 ± 0.36
3.961LysGly: 3.961 ± 0.496
1.066LysHis: 1.066 ± 0.275
1.6LysIle: 1.6 ± 0.325
3.275LysLys: 3.275 ± 0.56
5.408LysLeu: 5.408 ± 0.651
3.047LysMet: 3.047 ± 0.555
2.438LysAsn: 2.438 ± 0.381
2.514LysPro: 2.514 ± 0.452
2.438LysGln: 2.438 ± 0.393
3.961LysArg: 3.961 ± 0.605
2.438LysSer: 2.438 ± 0.52
3.961LysThr: 3.961 ± 0.471
3.809LysVal: 3.809 ± 0.495
0.762LysTrp: 0.762 ± 0.231
2.818LysTyr: 2.818 ± 0.379
0.0LysXaa: 0.0 ± 0.0
Leu
7.693LeuAla: 7.693 ± 0.659
0.838LeuCys: 0.838 ± 0.309
3.961LeuAsp: 3.961 ± 0.512
5.256LeuGlu: 5.256 ± 0.776
1.6LeuPhe: 1.6 ± 0.374
4.723LeuGly: 4.723 ± 0.535
0.914LeuHis: 0.914 ± 0.248
4.266LeuIle: 4.266 ± 0.493
5.561LeuLys: 5.561 ± 0.549
6.018LeuLeu: 6.018 ± 0.709
2.285LeuMet: 2.285 ± 0.42
4.723LeuAsn: 4.723 ± 0.566
3.732LeuPro: 3.732 ± 0.609
2.666LeuGln: 2.666 ± 0.34
5.18LeuArg: 5.18 ± 0.606
4.494LeuSer: 4.494 ± 0.516
5.256LeuThr: 5.256 ± 0.371
5.484LeuVal: 5.484 ± 0.534
1.447LeuTrp: 1.447 ± 0.325
2.133LeuTyr: 2.133 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
2.895MetAla: 2.895 ± 0.425
0.305MetCys: 0.305 ± 0.15
1.371MetAsp: 1.371 ± 0.36
1.143MetGlu: 1.143 ± 0.206
1.295MetPhe: 1.295 ± 0.346
1.904MetGly: 1.904 ± 0.328
0.305MetHis: 0.305 ± 0.14
1.066MetIle: 1.066 ± 0.255
1.371MetLys: 1.371 ± 0.316
1.98MetLeu: 1.98 ± 0.386
0.762MetMet: 0.762 ± 0.249
0.99MetAsn: 0.99 ± 0.257
1.143MetPro: 1.143 ± 0.371
1.066MetGln: 1.066 ± 0.226
1.6MetArg: 1.6 ± 0.338
2.057MetSer: 2.057 ± 0.307
2.361MetThr: 2.361 ± 0.347
1.98MetVal: 1.98 ± 0.363
0.533MetTrp: 0.533 ± 0.166
0.762MetTyr: 0.762 ± 0.264
0.0MetXaa: 0.0 ± 0.0
Asn
3.504AsnAla: 3.504 ± 0.508
0.381AsnCys: 0.381 ± 0.152
2.818AsnAsp: 2.818 ± 0.365
2.514AsnGlu: 2.514 ± 0.432
1.828AsnPhe: 1.828 ± 0.38
4.57AsnGly: 4.57 ± 0.648
0.533AsnHis: 0.533 ± 0.211
2.742AsnIle: 2.742 ± 0.324
1.98AsnLys: 1.98 ± 0.437
3.809AsnLeu: 3.809 ± 0.409
0.533AsnMet: 0.533 ± 0.207
2.514AsnAsn: 2.514 ± 0.463
1.676AsnPro: 1.676 ± 0.336
1.371AsnGln: 1.371 ± 0.32
2.514AsnArg: 2.514 ± 0.343
2.057AsnSer: 2.057 ± 0.356
2.285AsnThr: 2.285 ± 0.372
4.037AsnVal: 4.037 ± 0.404
0.686AsnTrp: 0.686 ± 0.214
1.295AsnTyr: 1.295 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
2.514ProAla: 2.514 ± 0.348
0.305ProCys: 0.305 ± 0.146
2.742ProAsp: 2.742 ± 0.426
3.352ProGlu: 3.352 ± 0.499
1.752ProPhe: 1.752 ± 0.303
2.895ProGly: 2.895 ± 0.433
0.609ProHis: 0.609 ± 0.191
1.295ProIle: 1.295 ± 0.351
2.895ProLys: 2.895 ± 0.503
3.428ProLeu: 3.428 ± 0.568
0.838ProMet: 0.838 ± 0.268
1.371ProAsn: 1.371 ± 0.506
1.219ProPro: 1.219 ± 0.332
1.219ProGln: 1.219 ± 0.296
2.057ProArg: 2.057 ± 0.399
2.133ProSer: 2.133 ± 0.338
1.371ProThr: 1.371 ± 0.29
3.885ProVal: 3.885 ± 0.495
0.457ProTrp: 0.457 ± 0.246
1.6ProTyr: 1.6 ± 0.411
0.0ProXaa: 0.0 ± 0.0
Gln
4.418GlnAla: 4.418 ± 0.622
0.305GlnCys: 0.305 ± 0.144
1.447GlnAsp: 1.447 ± 0.336
2.361GlnGlu: 2.361 ± 0.499
1.295GlnPhe: 1.295 ± 0.335
2.361GlnGly: 2.361 ± 0.478
0.457GlnHis: 0.457 ± 0.181
1.6GlnIle: 1.6 ± 0.263
1.98GlnLys: 1.98 ± 0.367
2.971GlnLeu: 2.971 ± 0.479
1.143GlnMet: 1.143 ± 0.334
1.904GlnAsn: 1.904 ± 0.381
2.133GlnPro: 2.133 ± 0.388
2.209GlnGln: 2.209 ± 0.627
1.676GlnArg: 1.676 ± 0.332
2.057GlnSer: 2.057 ± 0.349
1.98GlnThr: 1.98 ± 0.369
2.666GlnVal: 2.666 ± 0.405
0.609GlnTrp: 0.609 ± 0.215
1.6GlnTyr: 1.6 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
4.723ArgAla: 4.723 ± 0.382
0.457ArgCys: 0.457 ± 0.152
3.58ArgAsp: 3.58 ± 0.462
3.809ArgGlu: 3.809 ± 0.537
1.98ArgPhe: 1.98 ± 0.365
3.961ArgGly: 3.961 ± 0.523
1.066ArgHis: 1.066 ± 0.266
3.199ArgIle: 3.199 ± 0.446
4.19ArgLys: 4.19 ± 0.645
4.037ArgLeu: 4.037 ± 0.578
1.904ArgMet: 1.904 ± 0.355
3.123ArgAsn: 3.123 ± 0.486
1.904ArgPro: 1.904 ± 0.35
3.047ArgGln: 3.047 ± 0.568
4.342ArgArg: 4.342 ± 0.681
2.209ArgSer: 2.209 ± 0.331
2.742ArgThr: 2.742 ± 0.414
4.037ArgVal: 4.037 ± 0.451
0.914ArgTrp: 0.914 ± 0.22
1.371ArgTyr: 1.371 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
6.779SerAla: 6.779 ± 1.188
0.381SerCys: 0.381 ± 0.241
2.895SerAsp: 2.895 ± 0.467
3.352SerGlu: 3.352 ± 0.537
2.209SerPhe: 2.209 ± 0.365
6.627SerGly: 6.627 ± 0.623
0.762SerHis: 0.762 ± 0.163
2.666SerIle: 2.666 ± 0.416
2.438SerLys: 2.438 ± 0.445
5.256SerLeu: 5.256 ± 0.492
1.143SerMet: 1.143 ± 0.287
2.742SerAsn: 2.742 ± 0.339
1.676SerPro: 1.676 ± 0.375
2.057SerGln: 2.057 ± 0.328
2.895SerArg: 2.895 ± 0.495
3.199SerSer: 3.199 ± 0.492
4.266SerThr: 4.266 ± 0.56
5.332SerVal: 5.332 ± 0.707
0.838SerTrp: 0.838 ± 0.208
2.057SerTyr: 2.057 ± 0.419
0.0SerXaa: 0.0 ± 0.0
Thr
5.789ThrAla: 5.789 ± 0.733
0.533ThrCys: 0.533 ± 0.184
4.418ThrAsp: 4.418 ± 0.65
3.199ThrGlu: 3.199 ± 0.448
2.895ThrPhe: 2.895 ± 0.485
6.246ThrGly: 6.246 ± 0.84
0.99ThrHis: 0.99 ± 0.273
2.818ThrIle: 2.818 ± 0.48
3.123ThrLys: 3.123 ± 0.41
4.951ThrLeu: 4.951 ± 0.598
1.219ThrMet: 1.219 ± 0.325
1.752ThrAsn: 1.752 ± 0.376
4.037ThrPro: 4.037 ± 0.549
2.057ThrGln: 2.057 ± 0.347
2.895ThrArg: 2.895 ± 0.375
5.332ThrSer: 5.332 ± 0.599
3.961ThrThr: 3.961 ± 0.464
4.647ThrVal: 4.647 ± 0.7
1.066ThrTrp: 1.066 ± 0.273
2.361ThrTyr: 2.361 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
7.541ValAla: 7.541 ± 0.967
0.762ValCys: 0.762 ± 0.222
4.418ValAsp: 4.418 ± 0.447
6.17ValGlu: 6.17 ± 0.567
2.361ValPhe: 2.361 ± 0.478
4.037ValGly: 4.037 ± 0.512
0.914ValHis: 0.914 ± 0.215
4.19ValIle: 4.19 ± 0.535
4.951ValLys: 4.951 ± 0.741
4.723ValLeu: 4.723 ± 0.648
0.914ValMet: 0.914 ± 0.326
3.428ValAsn: 3.428 ± 0.527
2.438ValPro: 2.438 ± 0.616
2.133ValGln: 2.133 ± 0.369
3.885ValArg: 3.885 ± 0.536
5.941ValSer: 5.941 ± 0.822
5.941ValThr: 5.941 ± 0.752
5.484ValVal: 5.484 ± 0.934
1.066ValTrp: 1.066 ± 0.251
2.971ValTyr: 2.971 ± 0.447
0.0ValXaa: 0.0 ± 0.0
Trp
1.066TrpAla: 1.066 ± 0.391
0.076TrpCys: 0.076 ± 0.069
0.838TrpAsp: 0.838 ± 0.263
0.533TrpGlu: 0.533 ± 0.158
0.838TrpPhe: 0.838 ± 0.27
1.066TrpGly: 1.066 ± 0.255
0.305TrpHis: 0.305 ± 0.19
0.609TrpIle: 0.609 ± 0.173
0.686TrpLys: 0.686 ± 0.272
2.057TrpLeu: 2.057 ± 0.473
0.381TrpMet: 0.381 ± 0.146
0.609TrpAsn: 0.609 ± 0.243
0.533TrpPro: 0.533 ± 0.233
0.686TrpGln: 0.686 ± 0.196
1.371TrpArg: 1.371 ± 0.304
0.762TrpSer: 0.762 ± 0.196
0.762TrpThr: 0.762 ± 0.23
1.143TrpVal: 1.143 ± 0.198
0.305TrpTrp: 0.305 ± 0.123
0.381TrpTyr: 0.381 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.047TyrAla: 3.047 ± 0.46
0.457TyrCys: 0.457 ± 0.148
1.98TyrAsp: 1.98 ± 0.528
2.59TyrGlu: 2.59 ± 0.488
1.066TyrPhe: 1.066 ± 0.284
2.742TyrGly: 2.742 ± 0.434
0.838TyrHis: 0.838 ± 0.235
1.6TyrIle: 1.6 ± 0.346
2.666TyrLys: 2.666 ± 0.425
2.59TyrLeu: 2.59 ± 0.402
0.914TyrMet: 0.914 ± 0.182
1.371TyrAsn: 1.371 ± 0.285
1.143TyrPro: 1.143 ± 0.325
1.523TyrGln: 1.523 ± 0.344
1.828TyrArg: 1.828 ± 0.474
1.904TyrSer: 1.904 ± 0.312
2.209TyrThr: 2.209 ± 0.384
1.98TyrVal: 1.98 ± 0.322
0.229TyrTrp: 0.229 ± 0.175
1.447TyrTyr: 1.447 ± 0.26
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (13129 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski