Amino acid dipepetide frequency for Lactococcus phage ul36

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.853AlaAla: 3.853 ± 0.68
0.538AlaCys: 0.538 ± 0.221
4.301AlaAsp: 4.301 ± 0.578
3.405AlaGlu: 3.405 ± 0.708
3.584AlaPhe: 3.584 ± 0.512
3.674AlaGly: 3.674 ± 0.646
0.806AlaHis: 0.806 ± 0.299
5.466AlaIle: 5.466 ± 0.947
4.57AlaLys: 4.57 ± 0.675
6.362AlaLeu: 6.362 ± 0.672
1.703AlaMet: 1.703 ± 0.261
4.57AlaAsn: 4.57 ± 0.668
1.434AlaPro: 1.434 ± 0.369
2.509AlaGln: 2.509 ± 0.536
1.882AlaArg: 1.882 ± 0.467
3.584AlaSer: 3.584 ± 0.516
3.495AlaThr: 3.495 ± 0.529
3.226AlaVal: 3.226 ± 0.573
0.986AlaTrp: 0.986 ± 0.289
2.688AlaTyr: 2.688 ± 0.52
0.0AlaXaa: 0.0 ± 0.0
Cys
0.09CysAla: 0.09 ± 0.08
0.0CysCys: 0.0 ± 0.0
0.627CysAsp: 0.627 ± 0.224
1.075CysGlu: 1.075 ± 0.321
0.269CysPhe: 0.269 ± 0.15
0.896CysGly: 0.896 ± 0.329
0.358CysHis: 0.358 ± 0.262
0.448CysIle: 0.448 ± 0.226
0.896CysLys: 0.896 ± 0.322
0.179CysLeu: 0.179 ± 0.149
0.179CysMet: 0.179 ± 0.12
0.269CysAsn: 0.269 ± 0.171
0.448CysPro: 0.448 ± 0.252
0.269CysGln: 0.269 ± 0.219
0.179CysArg: 0.179 ± 0.127
0.538CysSer: 0.538 ± 0.176
0.0CysThr: 0.0 ± 0.0
0.448CysVal: 0.448 ± 0.191
0.09CysTrp: 0.09 ± 0.081
0.269CysTyr: 0.269 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
3.315AspAla: 3.315 ± 0.541
0.448AspCys: 0.448 ± 0.227
3.763AspAsp: 3.763 ± 0.655
5.824AspGlu: 5.824 ± 0.888
3.226AspPhe: 3.226 ± 0.714
5.287AspGly: 5.287 ± 0.713
0.358AspHis: 0.358 ± 0.164
4.032AspIle: 4.032 ± 0.643
5.287AspLys: 5.287 ± 0.772
5.287AspLeu: 5.287 ± 0.543
1.613AspMet: 1.613 ± 0.365
4.122AspAsn: 4.122 ± 0.487
1.075AspPro: 1.075 ± 0.368
1.075AspGln: 1.075 ± 0.313
2.061AspArg: 2.061 ± 0.399
3.763AspSer: 3.763 ± 0.606
3.674AspThr: 3.674 ± 0.555
4.211AspVal: 4.211 ± 0.611
1.792AspTrp: 1.792 ± 0.335
3.226AspTyr: 3.226 ± 0.45
0.0AspXaa: 0.0 ± 0.0
Glu
4.301GluAla: 4.301 ± 0.637
0.627GluCys: 0.627 ± 0.25
2.509GluAsp: 2.509 ± 0.512
5.376GluGlu: 5.376 ± 0.947
3.584GluPhe: 3.584 ± 0.495
2.419GluGly: 2.419 ± 0.591
1.344GluHis: 1.344 ± 0.368
4.032GluIle: 4.032 ± 0.679
7.616GluLys: 7.616 ± 1.187
7.706GluLeu: 7.706 ± 0.998
1.971GluMet: 1.971 ± 0.41
4.122GluAsn: 4.122 ± 0.735
1.971GluPro: 1.971 ± 0.491
3.674GluGln: 3.674 ± 0.557
2.151GluArg: 2.151 ± 0.364
3.495GluSer: 3.495 ± 0.577
4.301GluThr: 4.301 ± 0.857
5.376GluVal: 5.376 ± 0.637
1.075GluTrp: 1.075 ± 0.299
2.778GluTyr: 2.778 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
2.688PheAla: 2.688 ± 0.466
0.358PheCys: 0.358 ± 0.172
4.122PheAsp: 4.122 ± 0.481
2.867PheGlu: 2.867 ± 0.679
1.613PhePhe: 1.613 ± 0.362
2.778PheGly: 2.778 ± 0.487
0.448PheHis: 0.448 ± 0.199
2.957PheIle: 2.957 ± 0.568
4.211PheLys: 4.211 ± 0.523
2.509PheLeu: 2.509 ± 0.511
1.613PheMet: 1.613 ± 0.414
2.151PheAsn: 2.151 ± 0.432
0.717PhePro: 0.717 ± 0.259
1.792PheGln: 1.792 ± 0.368
1.613PheArg: 1.613 ± 0.414
3.674PheSer: 3.674 ± 0.569
4.032PheThr: 4.032 ± 0.759
3.315PheVal: 3.315 ± 0.505
0.448PheTrp: 0.448 ± 0.237
2.24PheTyr: 2.24 ± 0.42
0.0PheXaa: 0.0 ± 0.0
Gly
3.226GlyAla: 3.226 ± 0.787
0.358GlyCys: 0.358 ± 0.173
3.943GlyAsp: 3.943 ± 0.745
3.315GlyGlu: 3.315 ± 0.677
3.584GlyPhe: 3.584 ± 0.574
5.466GlyGly: 5.466 ± 1.099
0.538GlyHis: 0.538 ± 0.222
6.004GlyIle: 6.004 ± 0.952
6.004GlyLys: 6.004 ± 0.674
3.495GlyLeu: 3.495 ± 0.648
2.419GlyMet: 2.419 ± 0.53
3.136GlyAsn: 3.136 ± 0.763
0.806GlyPro: 0.806 ± 0.31
3.047GlyGln: 3.047 ± 0.737
2.688GlyArg: 2.688 ± 0.485
4.122GlySer: 4.122 ± 0.556
4.839GlyThr: 4.839 ± 0.707
3.943GlyVal: 3.943 ± 0.655
1.165GlyTrp: 1.165 ± 0.313
2.778GlyTyr: 2.778 ± 0.5
0.0GlyXaa: 0.0 ± 0.0
His
0.986HisAla: 0.986 ± 0.42
0.0HisCys: 0.0 ± 0.0
1.254HisAsp: 1.254 ± 0.327
0.986HisGlu: 0.986 ± 0.371
0.627HisPhe: 0.627 ± 0.245
0.627HisGly: 0.627 ± 0.219
0.09HisHis: 0.09 ± 0.082
0.806HisIle: 0.806 ± 0.206
0.717HisLys: 0.717 ± 0.264
0.717HisLeu: 0.717 ± 0.258
0.269HisMet: 0.269 ± 0.142
0.627HisAsn: 0.627 ± 0.272
0.358HisPro: 0.358 ± 0.17
0.538HisGln: 0.538 ± 0.236
0.358HisArg: 0.358 ± 0.159
0.896HisSer: 0.896 ± 0.277
0.538HisThr: 0.538 ± 0.196
0.717HisVal: 0.717 ± 0.218
0.09HisTrp: 0.09 ± 0.08
0.717HisTyr: 0.717 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
4.122IleAla: 4.122 ± 0.644
1.075IleCys: 1.075 ± 0.355
3.943IleAsp: 3.943 ± 0.52
5.197IleGlu: 5.197 ± 0.71
2.599IlePhe: 2.599 ± 0.513
4.659IleGly: 4.659 ± 0.714
0.717IleHis: 0.717 ± 0.309
5.466IleIle: 5.466 ± 0.932
6.81IleLys: 6.81 ± 0.761
4.48IleLeu: 4.48 ± 0.687
1.434IleMet: 1.434 ± 0.3
4.57IleAsn: 4.57 ± 0.721
1.792IlePro: 1.792 ± 0.403
3.495IleGln: 3.495 ± 0.541
2.061IleArg: 2.061 ± 0.537
5.287IleSer: 5.287 ± 0.778
4.391IleThr: 4.391 ± 0.628
3.763IleVal: 3.763 ± 0.677
0.896IleTrp: 0.896 ± 0.242
2.33IleTyr: 2.33 ± 0.472
0.0IleXaa: 0.0 ± 0.0
Lys
6.272LysAla: 6.272 ± 1.017
0.538LysCys: 0.538 ± 0.208
5.287LysAsp: 5.287 ± 0.654
6.183LysGlu: 6.183 ± 0.786
3.584LysPhe: 3.584 ± 0.52
5.197LysGly: 5.197 ± 0.633
1.254LysHis: 1.254 ± 0.318
6.093LysIle: 6.093 ± 0.967
7.975LysLys: 7.975 ± 1.036
6.989LysLeu: 6.989 ± 0.961
3.047LysMet: 3.047 ± 0.61
6.9LysAsn: 6.9 ± 0.947
3.047LysPro: 3.047 ± 0.418
4.301LysGln: 4.301 ± 0.469
3.495LysArg: 3.495 ± 0.731
4.659LysSer: 4.659 ± 0.654
4.48LysThr: 4.48 ± 0.76
5.735LysVal: 5.735 ± 0.739
1.165LysTrp: 1.165 ± 0.327
3.136LysTyr: 3.136 ± 0.64
0.0LysXaa: 0.0 ± 0.0
Leu
4.57LeuAla: 4.57 ± 0.591
0.538LeuCys: 0.538 ± 0.239
5.466LeuAsp: 5.466 ± 0.601
5.018LeuGlu: 5.018 ± 0.651
2.419LeuPhe: 2.419 ± 0.538
5.556LeuGly: 5.556 ± 0.638
0.806LeuHis: 0.806 ± 0.278
4.749LeuIle: 4.749 ± 0.625
6.81LeuLys: 6.81 ± 0.916
6.541LeuLeu: 6.541 ± 0.899
2.151LeuMet: 2.151 ± 0.423
4.57LeuAsn: 4.57 ± 0.507
3.405LeuPro: 3.405 ± 0.462
3.674LeuGln: 3.674 ± 0.533
1.613LeuArg: 1.613 ± 0.369
6.9LeuSer: 6.9 ± 0.616
4.659LeuThr: 4.659 ± 0.658
3.584LeuVal: 3.584 ± 0.555
0.806LeuTrp: 0.806 ± 0.284
3.047LeuTyr: 3.047 ± 0.447
0.0LeuXaa: 0.0 ± 0.0
Met
2.33MetAla: 2.33 ± 0.39
0.269MetCys: 0.269 ± 0.168
2.151MetAsp: 2.151 ± 0.426
1.882MetGlu: 1.882 ± 0.505
0.717MetPhe: 0.717 ± 0.248
1.434MetGly: 1.434 ± 0.445
0.179MetHis: 0.179 ± 0.131
1.523MetIle: 1.523 ± 0.31
2.599MetLys: 2.599 ± 0.556
1.434MetLeu: 1.434 ± 0.37
0.627MetMet: 0.627 ± 0.307
1.792MetAsn: 1.792 ± 0.394
0.896MetPro: 0.896 ± 0.232
0.896MetGln: 0.896 ± 0.267
1.434MetArg: 1.434 ± 0.481
1.703MetSer: 1.703 ± 0.382
2.867MetThr: 2.867 ± 0.538
1.165MetVal: 1.165 ± 0.269
0.269MetTrp: 0.269 ± 0.16
0.538MetTyr: 0.538 ± 0.249
0.0MetXaa: 0.0 ± 0.0
Asn
3.674AsnAla: 3.674 ± 0.625
0.269AsnCys: 0.269 ± 0.133
4.301AsnAsp: 4.301 ± 0.465
3.226AsnGlu: 3.226 ± 0.559
2.419AsnPhe: 2.419 ± 0.503
6.093AsnGly: 6.093 ± 1.012
0.806AsnHis: 0.806 ± 0.405
3.405AsnIle: 3.405 ± 0.523
5.197AsnLys: 5.197 ± 0.627
4.659AsnLeu: 4.659 ± 0.686
1.882AsnMet: 1.882 ± 0.479
3.853AsnAsn: 3.853 ± 0.695
2.419AsnPro: 2.419 ± 0.382
2.509AsnGln: 2.509 ± 0.438
2.24AsnArg: 2.24 ± 0.31
3.315AsnSer: 3.315 ± 0.416
2.867AsnThr: 2.867 ± 0.483
4.211AsnVal: 4.211 ± 0.729
0.806AsnTrp: 0.806 ± 0.257
3.047AsnTyr: 3.047 ± 0.552
0.0AsnXaa: 0.0 ± 0.0
Pro
1.075ProAla: 1.075 ± 0.296
0.0ProCys: 0.0 ± 0.0
2.688ProAsp: 2.688 ± 0.438
2.151ProGlu: 2.151 ± 0.333
1.434ProPhe: 1.434 ± 0.338
0.448ProGly: 0.448 ± 0.188
0.627ProHis: 0.627 ± 0.231
1.703ProIle: 1.703 ± 0.42
2.688ProLys: 2.688 ± 0.492
2.151ProLeu: 2.151 ± 0.432
0.627ProMet: 0.627 ± 0.231
1.613ProAsn: 1.613 ± 0.471
0.896ProPro: 0.896 ± 0.214
1.523ProGln: 1.523 ± 0.439
1.165ProArg: 1.165 ± 0.371
2.599ProSer: 2.599 ± 0.583
1.971ProThr: 1.971 ± 0.354
2.061ProVal: 2.061 ± 0.463
0.448ProTrp: 0.448 ± 0.187
0.806ProTyr: 0.806 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
4.391GlnAla: 4.391 ± 0.637
0.358GlnCys: 0.358 ± 0.158
1.075GlnAsp: 1.075 ± 0.401
5.197GlnGlu: 5.197 ± 0.557
1.792GlnPhe: 1.792 ± 0.449
2.599GlnGly: 2.599 ± 0.557
0.448GlnHis: 0.448 ± 0.186
1.882GlnIle: 1.882 ± 0.431
2.778GlnLys: 2.778 ± 0.478
2.957GlnLeu: 2.957 ± 0.435
1.165GlnMet: 1.165 ± 0.346
2.599GlnAsn: 2.599 ± 0.532
1.792GlnPro: 1.792 ± 0.448
2.688GlnGln: 2.688 ± 0.494
1.434GlnArg: 1.434 ± 0.339
2.061GlnSer: 2.061 ± 0.458
3.226GlnThr: 3.226 ± 0.473
3.226GlnVal: 3.226 ± 0.657
0.986GlnTrp: 0.986 ± 0.319
1.613GlnTyr: 1.613 ± 0.327
0.0GlnXaa: 0.0 ± 0.0
Arg
2.24ArgAla: 2.24 ± 0.409
0.269ArgCys: 0.269 ± 0.173
1.882ArgAsp: 1.882 ± 0.39
1.971ArgGlu: 1.971 ± 0.362
1.882ArgPhe: 1.882 ± 0.384
1.523ArgGly: 1.523 ± 0.318
0.448ArgHis: 0.448 ± 0.222
3.226ArgIle: 3.226 ± 0.565
3.584ArgLys: 3.584 ± 0.677
3.495ArgLeu: 3.495 ± 0.693
0.806ArgMet: 0.806 ± 0.231
1.703ArgAsn: 1.703 ± 0.388
0.896ArgPro: 0.896 ± 0.385
1.165ArgGln: 1.165 ± 0.312
0.627ArgArg: 0.627 ± 0.209
1.703ArgSer: 1.703 ± 0.39
1.434ArgThr: 1.434 ± 0.285
2.24ArgVal: 2.24 ± 0.375
0.179ArgTrp: 0.179 ± 0.128
1.613ArgTyr: 1.613 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
3.763SerAla: 3.763 ± 0.525
0.538SerCys: 0.538 ± 0.233
4.391SerAsp: 4.391 ± 0.491
4.928SerGlu: 4.928 ± 0.696
3.584SerPhe: 3.584 ± 0.51
5.108SerGly: 5.108 ± 0.901
0.717SerHis: 0.717 ± 0.24
4.749SerIle: 4.749 ± 0.662
4.659SerLys: 4.659 ± 0.723
4.749SerLeu: 4.749 ± 0.552
0.986SerMet: 0.986 ± 0.264
4.48SerAsn: 4.48 ± 0.592
0.896SerPro: 0.896 ± 0.285
3.315SerGln: 3.315 ± 0.525
1.434SerArg: 1.434 ± 0.293
5.108SerSer: 5.108 ± 1.039
3.405SerThr: 3.405 ± 0.531
4.749SerVal: 4.749 ± 0.571
1.165SerTrp: 1.165 ± 0.303
2.867SerTyr: 2.867 ± 0.456
0.0SerXaa: 0.0 ± 0.0
Thr
4.48ThrAla: 4.48 ± 0.66
0.448ThrCys: 0.448 ± 0.249
3.674ThrAsp: 3.674 ± 0.607
4.122ThrGlu: 4.122 ± 0.621
3.136ThrPhe: 3.136 ± 0.531
5.108ThrGly: 5.108 ± 0.738
0.627ThrHis: 0.627 ± 0.262
3.853ThrIle: 3.853 ± 0.565
5.287ThrLys: 5.287 ± 0.759
4.57ThrLeu: 4.57 ± 0.593
1.344ThrMet: 1.344 ± 0.435
3.674ThrAsn: 3.674 ± 0.525
2.778ThrPro: 2.778 ± 0.53
1.703ThrGln: 1.703 ± 0.311
1.882ThrArg: 1.882 ± 0.338
4.211ThrSer: 4.211 ± 0.624
4.122ThrThr: 4.122 ± 0.541
3.943ThrVal: 3.943 ± 0.496
0.448ThrTrp: 0.448 ± 0.159
1.971ThrTyr: 1.971 ± 0.474
0.0ThrXaa: 0.0 ± 0.0
Val
4.122ValAla: 4.122 ± 0.877
0.358ValCys: 0.358 ± 0.158
4.659ValAsp: 4.659 ± 0.624
4.659ValGlu: 4.659 ± 0.583
3.315ValPhe: 3.315 ± 0.463
2.867ValGly: 2.867 ± 0.525
0.538ValHis: 0.538 ± 0.228
4.839ValIle: 4.839 ± 0.676
6.989ValLys: 6.989 ± 0.84
4.659ValLeu: 4.659 ± 0.666
1.344ValMet: 1.344 ± 0.31
3.405ValAsn: 3.405 ± 0.558
1.434ValPro: 1.434 ± 0.321
3.226ValGln: 3.226 ± 0.576
1.613ValArg: 1.613 ± 0.372
4.211ValSer: 4.211 ± 0.64
3.674ValThr: 3.674 ± 0.554
3.674ValVal: 3.674 ± 0.509
0.986ValTrp: 0.986 ± 0.266
1.882ValTyr: 1.882 ± 0.365
0.0ValXaa: 0.0 ± 0.0
Trp
0.986TrpAla: 0.986 ± 0.275
0.09TrpCys: 0.09 ± 0.095
0.717TrpAsp: 0.717 ± 0.255
0.896TrpGlu: 0.896 ± 0.274
0.627TrpPhe: 0.627 ± 0.215
0.627TrpGly: 0.627 ± 0.207
0.179TrpHis: 0.179 ± 0.103
1.434TrpIle: 1.434 ± 0.277
1.165TrpLys: 1.165 ± 0.339
1.165TrpLeu: 1.165 ± 0.305
0.179TrpMet: 0.179 ± 0.125
0.896TrpAsn: 0.896 ± 0.24
0.269TrpPro: 0.269 ± 0.136
1.075TrpGln: 1.075 ± 0.371
0.627TrpArg: 0.627 ± 0.205
0.896TrpSer: 0.896 ± 0.29
0.806TrpThr: 0.806 ± 0.376
1.165TrpVal: 1.165 ± 0.302
0.269TrpTrp: 0.269 ± 0.161
0.717TrpTyr: 0.717 ± 0.304
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.151TyrAla: 2.151 ± 0.298
0.448TyrCys: 0.448 ± 0.193
2.688TyrAsp: 2.688 ± 0.506
2.151TyrGlu: 2.151 ± 0.437
2.151TyrPhe: 2.151 ± 0.527
2.509TyrGly: 2.509 ± 0.636
0.627TyrHis: 0.627 ± 0.247
2.419TyrIle: 2.419 ± 0.53
3.495TyrLys: 3.495 ± 0.629
2.778TyrLeu: 2.778 ± 0.543
1.254TyrMet: 1.254 ± 0.354
2.24TyrAsn: 2.24 ± 0.442
1.344TyrPro: 1.344 ± 0.331
1.792TyrGln: 1.792 ± 0.378
2.33TyrArg: 2.33 ± 0.538
2.867TyrSer: 2.867 ± 0.493
2.599TyrThr: 2.599 ± 0.478
1.703TyrVal: 1.703 ± 0.348
0.717TyrTrp: 0.717 ± 0.255
1.523TyrTyr: 1.523 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (11161 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski