Amino acid dipepetide frequency for Salmonella phage SEN8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.102AlaAla: 12.102 ± 2.454
1.345AlaCys: 1.345 ± 0.394
6.006AlaAsp: 6.006 ± 0.756
5.917AlaGlu: 5.917 ± 0.717
3.048AlaPhe: 3.048 ± 0.469
6.365AlaGly: 6.365 ± 0.881
1.165AlaHis: 1.165 ± 0.319
4.213AlaIle: 4.213 ± 0.553
5.11AlaLys: 5.11 ± 0.594
9.682AlaLeu: 9.682 ± 1.262
2.51AlaMet: 2.51 ± 0.587
2.779AlaAsn: 2.779 ± 0.466
3.586AlaPro: 3.586 ± 0.622
3.496AlaGln: 3.496 ± 0.907
5.379AlaArg: 5.379 ± 0.893
6.544AlaSer: 6.544 ± 0.642
5.11AlaThr: 5.11 ± 0.755
6.813AlaVal: 6.813 ± 0.79
1.345AlaTrp: 1.345 ± 0.339
2.869AlaTyr: 2.869 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
1.434CysAla: 1.434 ± 0.351
0.09CysCys: 0.09 ± 0.082
0.538CysAsp: 0.538 ± 0.204
0.448CysGlu: 0.448 ± 0.175
0.359CysPhe: 0.359 ± 0.201
0.717CysGly: 0.717 ± 0.257
0.448CysHis: 0.448 ± 0.195
0.359CysIle: 0.359 ± 0.173
0.448CysLys: 0.448 ± 0.208
0.986CysLeu: 0.986 ± 0.336
0.359CysMet: 0.359 ± 0.172
0.359CysAsn: 0.359 ± 0.159
0.896CysPro: 0.896 ± 0.332
0.448CysGln: 0.448 ± 0.181
0.807CysArg: 0.807 ± 0.293
0.717CysSer: 0.717 ± 0.342
0.538CysThr: 0.538 ± 0.201
0.896CysVal: 0.896 ± 0.265
0.359CysTrp: 0.359 ± 0.197
0.359CysTyr: 0.359 ± 0.203
0.0CysXaa: 0.0 ± 0.0
Asp
5.379AspAla: 5.379 ± 0.741
0.717AspCys: 0.717 ± 0.275
3.675AspAsp: 3.675 ± 0.489
4.303AspGlu: 4.303 ± 0.466
1.883AspPhe: 1.883 ± 0.458
4.751AspGly: 4.751 ± 0.557
1.076AspHis: 1.076 ± 0.303
3.855AspIle: 3.855 ± 0.542
2.241AspLys: 2.241 ± 0.425
5.289AspLeu: 5.289 ± 0.901
1.255AspMet: 1.255 ± 0.354
2.6AspAsn: 2.6 ± 0.551
1.883AspPro: 1.883 ± 0.468
1.972AspGln: 1.972 ± 0.346
2.51AspArg: 2.51 ± 0.482
3.407AspSer: 3.407 ± 0.525
4.034AspThr: 4.034 ± 0.612
3.765AspVal: 3.765 ± 0.643
1.076AspTrp: 1.076 ± 0.32
2.241AspTyr: 2.241 ± 0.325
0.0AspXaa: 0.0 ± 0.0
Glu
4.841GluAla: 4.841 ± 0.558
0.448GluCys: 0.448 ± 0.224
3.138GluAsp: 3.138 ± 0.617
2.6GluGlu: 2.6 ± 0.533
1.524GluPhe: 1.524 ± 0.48
3.496GluGly: 3.496 ± 0.541
0.628GluHis: 0.628 ± 0.208
4.393GluIle: 4.393 ± 0.616
2.958GluLys: 2.958 ± 0.428
7.889GluLeu: 7.889 ± 0.795
1.883GluMet: 1.883 ± 0.389
2.689GluAsn: 2.689 ± 0.46
2.869GluPro: 2.869 ± 0.485
4.662GluGln: 4.662 ± 0.849
4.213GluArg: 4.213 ± 0.622
3.317GluSer: 3.317 ± 0.51
3.227GluThr: 3.227 ± 0.511
4.303GluVal: 4.303 ± 0.599
1.255GluTrp: 1.255 ± 0.31
1.703GluTyr: 1.703 ± 0.44
0.0GluXaa: 0.0 ± 0.0
Phe
2.869PheAla: 2.869 ± 0.556
0.359PheCys: 0.359 ± 0.178
1.345PheAsp: 1.345 ± 0.365
2.6PheGlu: 2.6 ± 0.427
1.434PhePhe: 1.434 ± 0.4
1.883PheGly: 1.883 ± 0.409
0.896PheHis: 0.896 ± 0.253
1.255PheIle: 1.255 ± 0.358
1.793PheLys: 1.793 ± 0.412
2.241PheLeu: 2.241 ± 0.498
0.717PheMet: 0.717 ± 0.29
1.793PheAsn: 1.793 ± 0.457
1.255PhePro: 1.255 ± 0.301
0.896PheGln: 0.896 ± 0.266
2.51PheArg: 2.51 ± 0.414
2.779PheSer: 2.779 ± 0.463
1.793PheThr: 1.793 ± 0.461
1.793PheVal: 1.793 ± 0.438
0.538PheTrp: 0.538 ± 0.272
0.807PheTyr: 0.807 ± 0.262
0.0PheXaa: 0.0 ± 0.0
Gly
6.096GlyAla: 6.096 ± 0.929
1.076GlyCys: 1.076 ± 0.294
4.393GlyAsp: 4.393 ± 0.628
3.586GlyGlu: 3.586 ± 0.683
2.241GlyPhe: 2.241 ± 0.485
5.379GlyGly: 5.379 ± 0.918
1.165GlyHis: 1.165 ± 0.452
3.496GlyIle: 3.496 ± 0.53
3.855GlyLys: 3.855 ± 0.697
6.455GlyLeu: 6.455 ± 0.7
1.614GlyMet: 1.614 ± 0.406
3.317GlyAsn: 3.317 ± 0.702
1.972GlyPro: 1.972 ± 0.321
1.972GlyGln: 1.972 ± 0.357
4.124GlyArg: 4.124 ± 0.563
3.586GlySer: 3.586 ± 0.573
3.227GlyThr: 3.227 ± 0.712
5.558GlyVal: 5.558 ± 0.796
1.793GlyTrp: 1.793 ± 0.305
1.434GlyTyr: 1.434 ± 0.344
0.0GlyXaa: 0.0 ± 0.0
His
1.165HisAla: 1.165 ± 0.481
0.359HisCys: 0.359 ± 0.186
0.717HisAsp: 0.717 ± 0.28
0.896HisGlu: 0.896 ± 0.314
0.896HisPhe: 0.896 ± 0.308
1.255HisGly: 1.255 ± 0.354
0.717HisHis: 0.717 ± 0.233
0.896HisIle: 0.896 ± 0.231
1.434HisLys: 1.434 ± 0.321
1.703HisLeu: 1.703 ± 0.351
0.896HisMet: 0.896 ± 0.258
0.986HisAsn: 0.986 ± 0.275
0.807HisPro: 0.807 ± 0.279
1.076HisGln: 1.076 ± 0.298
1.165HisArg: 1.165 ± 0.316
1.255HisSer: 1.255 ± 0.397
1.165HisThr: 1.165 ± 0.317
0.896HisVal: 0.896 ± 0.247
0.359HisTrp: 0.359 ± 0.165
0.717HisTyr: 0.717 ± 0.264
0.0HisXaa: 0.0 ± 0.0
Ile
4.124IleAla: 4.124 ± 0.668
0.448IleCys: 0.448 ± 0.198
3.944IleAsp: 3.944 ± 0.605
2.869IleGlu: 2.869 ± 0.523
1.524IlePhe: 1.524 ± 0.31
2.869IleGly: 2.869 ± 0.656
1.255IleHis: 1.255 ± 0.346
2.331IleIle: 2.331 ± 0.388
2.689IleLys: 2.689 ± 0.606
2.062IleLeu: 2.062 ± 0.436
1.972IleMet: 1.972 ± 0.432
3.496IleAsn: 3.496 ± 0.52
2.331IlePro: 2.331 ± 0.566
2.152IleGln: 2.152 ± 0.459
2.869IleArg: 2.869 ± 0.39
4.393IleSer: 4.393 ± 0.698
4.124IleThr: 4.124 ± 0.641
2.779IleVal: 2.779 ± 0.448
0.717IleTrp: 0.717 ± 0.214
1.076IleTyr: 1.076 ± 0.329
0.0IleXaa: 0.0 ± 0.0
Lys
5.379LysAla: 5.379 ± 0.719
0.717LysCys: 0.717 ± 0.309
2.152LysAsp: 2.152 ± 0.402
3.138LysGlu: 3.138 ± 0.601
1.703LysPhe: 1.703 ± 0.435
2.689LysGly: 2.689 ± 0.507
0.896LysHis: 0.896 ± 0.226
2.51LysIle: 2.51 ± 0.425
3.855LysLys: 3.855 ± 0.549
5.379LysLeu: 5.379 ± 0.658
0.717LysMet: 0.717 ± 0.29
2.42LysAsn: 2.42 ± 0.591
2.958LysPro: 2.958 ± 0.524
2.241LysGln: 2.241 ± 0.402
4.213LysArg: 4.213 ± 0.63
2.869LysSer: 2.869 ± 0.498
2.958LysThr: 2.958 ± 0.492
2.689LysVal: 2.689 ± 0.46
0.986LysTrp: 0.986 ± 0.333
1.614LysTyr: 1.614 ± 0.367
0.0LysXaa: 0.0 ± 0.0
Leu
9.323LeuAla: 9.323 ± 1.044
1.345LeuCys: 1.345 ± 0.366
5.558LeuAsp: 5.558 ± 0.65
5.827LeuGlu: 5.827 ± 0.636
3.048LeuPhe: 3.048 ± 0.507
6.455LeuGly: 6.455 ± 0.904
1.524LeuHis: 1.524 ± 0.339
5.02LeuIle: 5.02 ± 0.69
4.303LeuLys: 4.303 ± 0.499
9.323LeuLeu: 9.323 ± 0.952
2.779LeuMet: 2.779 ± 0.528
4.751LeuAsn: 4.751 ± 0.547
4.482LeuPro: 4.482 ± 0.58
4.124LeuGln: 4.124 ± 0.611
6.455LeuArg: 6.455 ± 0.859
6.544LeuSer: 6.544 ± 0.716
7.62LeuThr: 7.62 ± 0.834
5.737LeuVal: 5.737 ± 0.871
1.076LeuTrp: 1.076 ± 0.283
3.317LeuTyr: 3.317 ± 0.612
0.0LeuXaa: 0.0 ± 0.0
Met
3.944MetAla: 3.944 ± 0.629
0.269MetCys: 0.269 ± 0.151
1.434MetAsp: 1.434 ± 0.34
1.524MetGlu: 1.524 ± 0.293
0.538MetPhe: 0.538 ± 0.314
1.793MetGly: 1.793 ± 0.634
0.448MetHis: 0.448 ± 0.167
0.717MetIle: 0.717 ± 0.207
1.165MetLys: 1.165 ± 0.265
1.972MetLeu: 1.972 ± 0.442
1.076MetMet: 1.076 ± 0.369
1.524MetAsn: 1.524 ± 0.309
1.076MetPro: 1.076 ± 0.347
1.076MetGln: 1.076 ± 0.386
1.434MetArg: 1.434 ± 0.374
1.972MetSer: 1.972 ± 0.449
1.165MetThr: 1.165 ± 0.314
1.883MetVal: 1.883 ± 0.345
0.179MetTrp: 0.179 ± 0.134
0.717MetTyr: 0.717 ± 0.27
0.0MetXaa: 0.0 ± 0.0
Asn
3.586AsnAla: 3.586 ± 0.673
0.448AsnCys: 0.448 ± 0.174
2.689AsnAsp: 2.689 ± 0.523
2.51AsnGlu: 2.51 ± 0.434
1.703AsnPhe: 1.703 ± 0.443
5.02AsnGly: 5.02 ± 0.866
1.076AsnHis: 1.076 ± 0.44
2.689AsnIle: 2.689 ± 0.515
2.241AsnLys: 2.241 ± 0.465
4.303AsnLeu: 4.303 ± 0.568
0.807AsnMet: 0.807 ± 0.263
1.972AsnAsn: 1.972 ± 0.391
3.407AsnPro: 3.407 ± 0.626
1.703AsnGln: 1.703 ± 0.337
2.869AsnArg: 2.869 ± 0.529
2.958AsnSer: 2.958 ± 0.44
2.152AsnThr: 2.152 ± 0.487
2.6AsnVal: 2.6 ± 0.522
0.717AsnTrp: 0.717 ± 0.306
0.717AsnTyr: 0.717 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
4.751ProAla: 4.751 ± 0.663
0.538ProCys: 0.538 ± 0.229
2.779ProAsp: 2.779 ± 0.642
3.317ProGlu: 3.317 ± 0.523
1.255ProPhe: 1.255 ± 0.345
2.689ProGly: 2.689 ± 0.434
0.986ProHis: 0.986 ± 0.347
1.345ProIle: 1.345 ± 0.373
1.883ProLys: 1.883 ± 0.367
4.124ProLeu: 4.124 ± 0.505
0.628ProMet: 0.628 ± 0.186
1.703ProAsn: 1.703 ± 0.513
1.614ProPro: 1.614 ± 0.327
1.076ProGln: 1.076 ± 0.343
2.152ProArg: 2.152 ± 0.485
2.779ProSer: 2.779 ± 0.523
2.689ProThr: 2.689 ± 0.58
3.765ProVal: 3.765 ± 0.881
0.448ProTrp: 0.448 ± 0.188
1.614ProTyr: 1.614 ± 0.504
0.0ProXaa: 0.0 ± 0.0
Gln
3.675GlnAla: 3.675 ± 0.564
0.628GlnCys: 0.628 ± 0.226
2.958GlnAsp: 2.958 ± 0.525
1.614GlnGlu: 1.614 ± 0.424
0.896GlnPhe: 0.896 ± 0.301
2.51GlnGly: 2.51 ± 0.456
0.448GlnHis: 0.448 ± 0.236
2.152GlnIle: 2.152 ± 0.484
2.331GlnLys: 2.331 ± 0.417
4.393GlnLeu: 4.393 ± 0.576
1.524GlnMet: 1.524 ± 0.406
1.793GlnAsn: 1.793 ± 0.36
1.614GlnPro: 1.614 ± 0.382
2.331GlnGln: 2.331 ± 0.557
3.944GlnArg: 3.944 ± 0.6
2.6GlnSer: 2.6 ± 0.524
2.51GlnThr: 2.51 ± 0.494
1.703GlnVal: 1.703 ± 0.383
1.255GlnTrp: 1.255 ± 0.241
1.614GlnTyr: 1.614 ± 0.388
0.0GlnXaa: 0.0 ± 0.0
Arg
4.482ArgAla: 4.482 ± 0.639
0.628ArgCys: 0.628 ± 0.194
2.958ArgAsp: 2.958 ± 0.455
5.02ArgGlu: 5.02 ± 0.69
1.972ArgPhe: 1.972 ± 0.422
3.227ArgGly: 3.227 ± 0.657
1.972ArgHis: 1.972 ± 0.591
3.675ArgIle: 3.675 ± 0.48
3.407ArgLys: 3.407 ± 0.89
7.172ArgLeu: 7.172 ± 0.861
1.524ArgMet: 1.524 ± 0.31
2.958ArgAsn: 2.958 ± 0.613
1.793ArgPro: 1.793 ± 0.351
4.034ArgGln: 4.034 ± 0.602
5.558ArgArg: 5.558 ± 0.731
3.944ArgSer: 3.944 ± 0.605
2.958ArgThr: 2.958 ± 0.481
4.931ArgVal: 4.931 ± 0.741
1.345ArgTrp: 1.345 ± 0.321
1.614ArgTyr: 1.614 ± 0.325
0.0ArgXaa: 0.0 ± 0.0
Ser
7.082SerAla: 7.082 ± 0.891
0.628SerCys: 0.628 ± 0.209
3.317SerAsp: 3.317 ± 0.486
3.496SerGlu: 3.496 ± 0.593
1.972SerPhe: 1.972 ± 0.528
4.482SerGly: 4.482 ± 0.64
1.434SerHis: 1.434 ± 0.425
3.317SerIle: 3.317 ± 0.481
2.689SerLys: 2.689 ± 0.446
6.903SerLeu: 6.903 ± 0.886
1.793SerMet: 1.793 ± 0.458
2.869SerAsn: 2.869 ± 0.412
2.331SerPro: 2.331 ± 0.43
2.062SerGln: 2.062 ± 0.452
3.675SerArg: 3.675 ± 0.455
2.779SerSer: 2.779 ± 0.383
3.586SerThr: 3.586 ± 0.534
4.841SerVal: 4.841 ± 0.646
1.165SerTrp: 1.165 ± 0.291
1.524SerTyr: 1.524 ± 0.344
0.0SerXaa: 0.0 ± 0.0
Thr
5.558ThrAla: 5.558 ± 0.623
0.448ThrCys: 0.448 ± 0.239
3.855ThrAsp: 3.855 ± 0.695
3.944ThrGlu: 3.944 ± 0.68
1.614ThrPhe: 1.614 ± 0.401
4.841ThrGly: 4.841 ± 0.717
1.076ThrHis: 1.076 ± 0.354
2.331ThrIle: 2.331 ± 0.513
2.689ThrLys: 2.689 ± 0.629
6.455ThrLeu: 6.455 ± 0.766
1.434ThrMet: 1.434 ± 0.34
2.152ThrAsn: 2.152 ± 0.441
3.048ThrPro: 3.048 ± 0.492
1.703ThrGln: 1.703 ± 0.337
3.675ThrArg: 3.675 ± 0.607
3.407ThrSer: 3.407 ± 0.509
4.303ThrThr: 4.303 ± 0.815
4.303ThrVal: 4.303 ± 0.611
0.448ThrTrp: 0.448 ± 0.196
1.524ThrTyr: 1.524 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
6.096ValAla: 6.096 ± 1.087
0.538ValCys: 0.538 ± 0.204
3.855ValAsp: 3.855 ± 0.786
5.199ValGlu: 5.199 ± 0.841
2.062ValPhe: 2.062 ± 0.413
3.317ValGly: 3.317 ± 0.538
0.986ValHis: 0.986 ± 0.255
3.496ValIle: 3.496 ± 0.59
4.841ValLys: 4.841 ± 0.681
6.992ValLeu: 6.992 ± 0.739
1.972ValMet: 1.972 ± 0.468
4.034ValAsn: 4.034 ± 0.602
2.42ValPro: 2.42 ± 0.492
3.138ValGln: 3.138 ± 0.544
3.407ValArg: 3.407 ± 0.442
4.034ValSer: 4.034 ± 0.632
3.407ValThr: 3.407 ± 0.534
4.572ValVal: 4.572 ± 0.713
1.076ValTrp: 1.076 ± 0.359
1.255ValTyr: 1.255 ± 0.354
0.0ValXaa: 0.0 ± 0.0
Trp
1.434TrpAla: 1.434 ± 0.351
0.179TrpCys: 0.179 ± 0.107
0.448TrpAsp: 0.448 ± 0.197
1.076TrpGlu: 1.076 ± 0.251
0.628TrpPhe: 0.628 ± 0.206
0.538TrpGly: 0.538 ± 0.184
0.717TrpHis: 0.717 ± 0.242
0.807TrpIle: 0.807 ± 0.223
1.076TrpLys: 1.076 ± 0.245
2.42TrpLeu: 2.42 ± 0.487
0.179TrpMet: 0.179 ± 0.12
0.538TrpAsn: 0.538 ± 0.248
0.807TrpPro: 0.807 ± 0.311
0.807TrpGln: 0.807 ± 0.23
1.524TrpArg: 1.524 ± 0.47
0.628TrpSer: 0.628 ± 0.245
0.807TrpThr: 0.807 ± 0.216
0.986TrpVal: 0.986 ± 0.308
0.538TrpTrp: 0.538 ± 0.197
1.076TrpTyr: 1.076 ± 0.366
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.062TyrAla: 2.062 ± 0.353
0.359TyrCys: 0.359 ± 0.185
2.152TyrAsp: 2.152 ± 0.471
2.152TyrGlu: 2.152 ± 0.385
1.255TyrPhe: 1.255 ± 0.481
2.062TyrGly: 2.062 ± 0.416
0.448TyrHis: 0.448 ± 0.196
1.165TyrIle: 1.165 ± 0.356
1.076TyrLys: 1.076 ± 0.267
2.869TyrLeu: 2.869 ± 0.552
0.179TyrMet: 0.179 ± 0.134
1.434TyrAsn: 1.434 ± 0.326
0.986TyrPro: 0.986 ± 0.278
1.524TyrGln: 1.524 ± 0.427
2.689TyrArg: 2.689 ± 0.462
1.434TyrSer: 1.434 ± 0.361
1.434TyrThr: 1.434 ± 0.386
1.972TyrVal: 1.972 ± 0.479
0.538TyrTrp: 0.538 ± 0.209
0.807TyrTyr: 0.807 ± 0.261
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (11156 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski