Amino acid dipepetide frequency for Lactococcus phage 38503

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.048AlaAla: 1.048 ± 0.343
0.233AlaCys: 0.233 ± 0.173
3.027AlaAsp: 3.027 ± 0.517
4.308AlaGlu: 4.308 ± 1.07
3.027AlaPhe: 3.027 ± 0.535
5.239AlaGly: 5.239 ± 1.407
0.466AlaHis: 0.466 ± 0.234
5.356AlaIle: 5.356 ± 1.12
7.102AlaLys: 7.102 ± 0.993
6.171AlaLeu: 6.171 ± 1.09
2.445AlaMet: 2.445 ± 0.498
4.308AlaAsn: 4.308 ± 0.713
1.048AlaPro: 1.048 ± 0.444
2.212AlaGln: 2.212 ± 0.62
1.979AlaArg: 1.979 ± 0.335
3.144AlaSer: 3.144 ± 0.617
1.979AlaThr: 1.979 ± 0.385
4.191AlaVal: 4.191 ± 1.072
1.979AlaTrp: 1.979 ± 0.687
2.096AlaTyr: 2.096 ± 0.448
0.0AlaXaa: 0.0 ± 0.0
Cys
0.466CysAla: 0.466 ± 0.203
0.116CysCys: 0.116 ± 0.127
0.349CysAsp: 0.349 ± 0.171
0.233CysGlu: 0.233 ± 0.166
0.233CysPhe: 0.233 ± 0.157
0.815CysGly: 0.815 ± 0.324
0.116CysHis: 0.116 ± 0.109
0.815CysIle: 0.815 ± 0.273
1.048CysLys: 1.048 ± 0.374
0.466CysLeu: 0.466 ± 0.293
0.116CysMet: 0.116 ± 0.119
0.349CysAsn: 0.349 ± 0.213
0.233CysPro: 0.233 ± 0.173
0.466CysGln: 0.466 ± 0.235
0.815CysArg: 0.815 ± 0.281
0.466CysSer: 0.466 ± 0.232
0.233CysThr: 0.233 ± 0.176
0.349CysVal: 0.349 ± 0.22
0.116CysTrp: 0.116 ± 0.117
0.466CysTyr: 0.466 ± 0.28
0.0CysXaa: 0.0 ± 0.0
Asp
1.746AspAla: 1.746 ± 0.463
0.116AspCys: 0.116 ± 0.142
3.609AspAsp: 3.609 ± 0.913
4.657AspGlu: 4.657 ± 0.85
4.308AspPhe: 4.308 ± 0.606
3.376AspGly: 3.376 ± 0.66
0.582AspHis: 0.582 ± 0.312
4.075AspIle: 4.075 ± 0.702
5.356AspLys: 5.356 ± 0.675
5.705AspLeu: 5.705 ± 1.041
1.048AspMet: 1.048 ± 0.322
3.376AspAsn: 3.376 ± 0.645
1.63AspPro: 1.63 ± 0.431
0.466AspGln: 0.466 ± 0.29
2.096AspArg: 2.096 ± 0.437
3.609AspSer: 3.609 ± 0.736
4.424AspThr: 4.424 ± 0.738
3.493AspVal: 3.493 ± 0.751
1.281AspTrp: 1.281 ± 0.457
2.794AspTyr: 2.794 ± 0.546
0.0AspXaa: 0.0 ± 0.0
Glu
3.26GluAla: 3.26 ± 0.641
0.699GluCys: 0.699 ± 0.278
3.027GluAsp: 3.027 ± 0.658
5.938GluGlu: 5.938 ± 1.154
4.075GluPhe: 4.075 ± 0.581
2.212GluGly: 2.212 ± 0.42
0.815GluHis: 0.815 ± 0.428
5.589GluIle: 5.589 ± 0.909
6.986GluLys: 6.986 ± 1.256
9.314GluLeu: 9.314 ± 1.505
3.144GluMet: 3.144 ± 0.645
4.774GluAsn: 4.774 ± 0.787
1.397GluPro: 1.397 ± 0.425
3.726GluGln: 3.726 ± 0.841
3.376GluArg: 3.376 ± 0.654
3.726GluSer: 3.726 ± 0.536
4.541GluThr: 4.541 ± 0.606
4.424GluVal: 4.424 ± 0.839
1.281GluTrp: 1.281 ± 0.347
3.027GluTyr: 3.027 ± 0.664
0.0GluXaa: 0.0 ± 0.0
Phe
3.26PheAla: 3.26 ± 0.771
0.466PheCys: 0.466 ± 0.263
3.027PheAsp: 3.027 ± 0.665
3.027PheGlu: 3.027 ± 0.719
2.445PhePhe: 2.445 ± 0.693
2.445PheGly: 2.445 ± 0.569
0.233PheHis: 0.233 ± 0.149
3.609PheIle: 3.609 ± 0.808
4.541PheLys: 4.541 ± 0.616
2.561PheLeu: 2.561 ± 0.483
1.164PheMet: 1.164 ± 0.381
3.842PheAsn: 3.842 ± 0.788
0.582PhePro: 0.582 ± 0.283
1.281PheGln: 1.281 ± 0.416
0.931PheArg: 0.931 ± 0.352
4.774PheSer: 4.774 ± 1.201
2.445PheThr: 2.445 ± 0.548
2.096PheVal: 2.096 ± 0.342
0.233PheTrp: 0.233 ± 0.156
1.979PheTyr: 1.979 ± 0.446
0.0PheXaa: 0.0 ± 0.0
Gly
3.726GlyAla: 3.726 ± 1.208
0.466GlyCys: 0.466 ± 0.208
3.26GlyAsp: 3.26 ± 0.629
4.308GlyGlu: 4.308 ± 0.687
2.096GlyPhe: 2.096 ± 0.613
4.541GlyGly: 4.541 ± 1.064
1.048GlyHis: 1.048 ± 0.299
4.308GlyIle: 4.308 ± 1.618
6.986GlyLys: 6.986 ± 1.061
6.054GlyLeu: 6.054 ± 1.297
1.63GlyMet: 1.63 ± 0.496
2.911GlyAsn: 2.911 ± 0.574
0.116GlyPro: 0.116 ± 0.109
2.096GlyGln: 2.096 ± 0.566
1.863GlyArg: 1.863 ± 0.396
4.191GlySer: 4.191 ± 0.938
2.678GlyThr: 2.678 ± 0.883
5.938GlyVal: 5.938 ± 1.528
1.281GlyTrp: 1.281 ± 0.372
3.144GlyTyr: 3.144 ± 0.573
0.0GlyXaa: 0.0 ± 0.0
His
0.931HisAla: 0.931 ± 0.3
0.582HisCys: 0.582 ± 0.296
0.582HisAsp: 0.582 ± 0.311
0.931HisGlu: 0.931 ± 0.366
0.466HisPhe: 0.466 ± 0.235
1.281HisGly: 1.281 ± 0.377
0.116HisHis: 0.116 ± 0.123
0.466HisIle: 0.466 ± 0.241
0.699HisLys: 0.699 ± 0.317
1.164HisLeu: 1.164 ± 0.452
0.233HisMet: 0.233 ± 0.246
1.281HisAsn: 1.281 ± 0.475
0.233HisPro: 0.233 ± 0.163
0.349HisGln: 0.349 ± 0.225
0.233HisArg: 0.233 ± 0.168
0.349HisSer: 0.349 ± 0.212
0.699HisThr: 0.699 ± 0.278
1.164HisVal: 1.164 ± 0.445
0.0HisTrp: 0.0 ± 0.0
0.699HisTyr: 0.699 ± 0.303
0.0HisXaa: 0.0 ± 0.0
Ile
3.842IleAla: 3.842 ± 0.709
0.116IleCys: 0.116 ± 0.11
5.006IleAsp: 5.006 ± 0.697
7.451IleGlu: 7.451 ± 1.088
2.794IlePhe: 2.794 ± 0.547
3.609IleGly: 3.609 ± 0.947
1.281IleHis: 1.281 ± 0.4
5.472IleIle: 5.472 ± 0.83
6.054IleLys: 6.054 ± 0.814
5.006IleLeu: 5.006 ± 1.072
1.281IleMet: 1.281 ± 0.35
4.657IleAsn: 4.657 ± 0.664
1.746IlePro: 1.746 ± 0.45
2.096IleGln: 2.096 ± 0.466
2.212IleArg: 2.212 ± 0.508
4.657IleSer: 4.657 ± 0.861
5.006IleThr: 5.006 ± 0.84
4.541IleVal: 4.541 ± 0.703
1.281IleTrp: 1.281 ± 0.485
2.212IleTyr: 2.212 ± 0.393
0.0IleXaa: 0.0 ± 0.0
Lys
7.801LysAla: 7.801 ± 1.297
1.397LysCys: 1.397 ± 0.672
4.541LysAsp: 4.541 ± 0.799
8.849LysGlu: 8.849 ± 1.462
2.096LysPhe: 2.096 ± 0.429
6.404LysGly: 6.404 ± 1.139
1.514LysHis: 1.514 ± 0.535
5.589LysIle: 5.589 ± 0.772
9.78LysLys: 9.78 ± 1.252
8.732LysLeu: 8.732 ± 0.967
2.911LysMet: 2.911 ± 0.506
4.191LysAsn: 4.191 ± 0.759
1.63LysPro: 1.63 ± 0.566
3.842LysGln: 3.842 ± 0.686
3.842LysArg: 3.842 ± 0.696
5.472LysSer: 5.472 ± 0.785
5.356LysThr: 5.356 ± 0.692
5.938LysVal: 5.938 ± 0.775
1.514LysTrp: 1.514 ± 0.432
4.308LysTyr: 4.308 ± 0.897
0.0LysXaa: 0.0 ± 0.0
Leu
5.821LeuAla: 5.821 ± 0.722
0.116LeuCys: 0.116 ± 0.109
5.006LeuAsp: 5.006 ± 0.725
5.589LeuGlu: 5.589 ± 0.848
3.959LeuPhe: 3.959 ± 0.674
4.191LeuGly: 4.191 ± 0.917
1.048LeuHis: 1.048 ± 0.383
7.451LeuIle: 7.451 ± 1.406
8.965LeuLys: 8.965 ± 1.126
6.054LeuLeu: 6.054 ± 1.135
1.863LeuMet: 1.863 ± 0.604
5.472LeuAsn: 5.472 ± 0.842
3.027LeuPro: 3.027 ± 0.569
3.027LeuGln: 3.027 ± 0.482
2.678LeuArg: 2.678 ± 0.578
6.171LeuSer: 6.171 ± 0.809
5.705LeuThr: 5.705 ± 0.833
5.472LeuVal: 5.472 ± 0.717
1.048LeuTrp: 1.048 ± 0.324
4.541LeuTyr: 4.541 ± 0.746
0.0LeuXaa: 0.0 ± 0.0
Met
2.794MetAla: 2.794 ± 0.495
0.116MetCys: 0.116 ± 0.105
1.63MetAsp: 1.63 ± 0.371
1.863MetGlu: 1.863 ± 0.521
0.233MetPhe: 0.233 ± 0.202
1.048MetGly: 1.048 ± 0.329
0.349MetHis: 0.349 ± 0.191
1.979MetIle: 1.979 ± 0.477
2.678MetLys: 2.678 ± 0.548
1.863MetLeu: 1.863 ± 0.69
0.466MetMet: 0.466 ± 0.244
2.329MetAsn: 2.329 ± 0.473
0.466MetPro: 0.466 ± 0.278
1.397MetGln: 1.397 ± 0.342
0.815MetArg: 0.815 ± 0.332
1.164MetSer: 1.164 ± 0.346
1.63MetThr: 1.63 ± 0.442
1.514MetVal: 1.514 ± 0.419
0.116MetTrp: 0.116 ± 0.126
1.281MetTyr: 1.281 ± 0.385
0.0MetXaa: 0.0 ± 0.0
Asn
4.657AsnAla: 4.657 ± 1.104
0.233AsnCys: 0.233 ± 0.162
3.726AsnAsp: 3.726 ± 0.741
4.89AsnGlu: 4.89 ± 0.814
2.329AsnPhe: 2.329 ± 0.481
5.356AsnGly: 5.356 ± 0.897
0.466AsnHis: 0.466 ± 0.22
3.959AsnIle: 3.959 ± 0.589
6.52AsnLys: 6.52 ± 0.981
5.006AsnLeu: 5.006 ± 0.748
1.746AsnMet: 1.746 ± 0.481
3.376AsnAsn: 3.376 ± 0.693
1.746AsnPro: 1.746 ± 0.455
2.096AsnGln: 2.096 ± 0.589
1.863AsnArg: 1.863 ± 0.361
4.191AsnSer: 4.191 ± 0.594
3.842AsnThr: 3.842 ± 0.677
3.26AsnVal: 3.26 ± 0.551
1.048AsnTrp: 1.048 ± 0.39
2.561AsnTyr: 2.561 ± 0.576
0.0AsnXaa: 0.0 ± 0.0
Pro
0.931ProAla: 0.931 ± 0.31
0.116ProCys: 0.116 ± 0.121
1.746ProAsp: 1.746 ± 0.526
1.746ProGlu: 1.746 ± 0.48
1.048ProPhe: 1.048 ± 0.379
0.233ProGly: 0.233 ± 0.179
0.116ProHis: 0.116 ± 0.104
2.212ProIle: 2.212 ± 0.599
1.863ProLys: 1.863 ± 0.419
1.863ProLeu: 1.863 ± 0.425
0.466ProMet: 0.466 ± 0.237
1.863ProAsn: 1.863 ± 0.714
0.699ProPro: 0.699 ± 0.275
0.233ProGln: 0.233 ± 0.14
0.582ProArg: 0.582 ± 0.27
0.815ProSer: 0.815 ± 0.361
2.329ProThr: 2.329 ± 0.512
0.931ProVal: 0.931 ± 0.377
0.116ProTrp: 0.116 ± 0.124
0.815ProTyr: 0.815 ± 0.337
0.0ProXaa: 0.0 ± 0.0
Gln
3.26GlnAla: 3.26 ± 0.808
0.233GlnCys: 0.233 ± 0.159
2.329GlnAsp: 2.329 ± 0.519
2.561GlnGlu: 2.561 ± 0.675
1.281GlnPhe: 1.281 ± 0.368
2.678GlnGly: 2.678 ± 0.53
0.349GlnHis: 0.349 ± 0.173
1.397GlnIle: 1.397 ± 0.329
2.445GlnLys: 2.445 ± 0.601
2.794GlnLeu: 2.794 ± 0.541
0.931GlnMet: 0.931 ± 0.25
2.561GlnAsn: 2.561 ± 0.46
0.815GlnPro: 0.815 ± 0.274
1.863GlnGln: 1.863 ± 0.536
1.281GlnArg: 1.281 ± 0.447
1.397GlnSer: 1.397 ± 0.417
2.678GlnThr: 2.678 ± 0.491
1.979GlnVal: 1.979 ± 0.418
0.699GlnTrp: 0.699 ± 0.241
1.048GlnTyr: 1.048 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
2.329ArgAla: 2.329 ± 0.646
0.349ArgCys: 0.349 ± 0.171
1.63ArgAsp: 1.63 ± 0.393
2.678ArgGlu: 2.678 ± 0.49
0.931ArgPhe: 0.931 ± 0.306
2.445ArgGly: 2.445 ± 0.445
0.815ArgHis: 0.815 ± 0.383
1.979ArgIle: 1.979 ± 0.505
4.657ArgLys: 4.657 ± 0.921
3.842ArgLeu: 3.842 ± 0.639
0.582ArgMet: 0.582 ± 0.34
2.561ArgAsn: 2.561 ± 0.483
0.582ArgPro: 0.582 ± 0.254
1.164ArgGln: 1.164 ± 0.34
1.746ArgArg: 1.746 ± 0.385
1.863ArgSer: 1.863 ± 0.468
1.514ArgThr: 1.514 ± 0.354
1.514ArgVal: 1.514 ± 0.494
0.233ArgTrp: 0.233 ± 0.178
1.979ArgTyr: 1.979 ± 0.49
0.0ArgXaa: 0.0 ± 0.0
Ser
4.541SerAla: 4.541 ± 1.548
0.931SerCys: 0.931 ± 0.352
4.308SerAsp: 4.308 ± 0.719
2.678SerGlu: 2.678 ± 0.539
3.609SerPhe: 3.609 ± 0.796
5.356SerGly: 5.356 ± 1.524
0.699SerHis: 0.699 ± 0.259
3.842SerIle: 3.842 ± 0.668
6.636SerLys: 6.636 ± 0.855
5.239SerLeu: 5.239 ± 0.77
1.979SerMet: 1.979 ± 0.412
3.726SerAsn: 3.726 ± 0.716
0.931SerPro: 0.931 ± 0.389
1.979SerGln: 1.979 ± 0.605
2.212SerArg: 2.212 ± 0.383
4.657SerSer: 4.657 ± 1.203
3.493SerThr: 3.493 ± 0.786
3.842SerVal: 3.842 ± 0.709
0.815SerTrp: 0.815 ± 0.323
2.678SerTyr: 2.678 ± 0.71
0.0SerXaa: 0.0 ± 0.0
Thr
4.774ThrAla: 4.774 ± 0.754
0.349ThrCys: 0.349 ± 0.2
3.959ThrAsp: 3.959 ± 0.736
4.774ThrGlu: 4.774 ± 0.772
2.911ThrPhe: 2.911 ± 0.721
4.191ThrGly: 4.191 ± 0.709
0.466ThrHis: 0.466 ± 0.226
3.493ThrIle: 3.493 ± 0.784
3.842ThrLys: 3.842 ± 0.506
5.821ThrLeu: 5.821 ± 0.7
0.699ThrMet: 0.699 ± 0.259
3.959ThrAsn: 3.959 ± 0.608
1.514ThrPro: 1.514 ± 0.345
2.445ThrGln: 2.445 ± 0.507
2.096ThrArg: 2.096 ± 0.535
5.123ThrSer: 5.123 ± 0.877
3.609ThrThr: 3.609 ± 0.592
4.191ThrVal: 4.191 ± 0.985
0.582ThrTrp: 0.582 ± 0.284
1.979ThrTyr: 1.979 ± 0.469
0.0ThrXaa: 0.0 ± 0.0
Val
3.609ValAla: 3.609 ± 0.66
0.582ValCys: 0.582 ± 0.293
3.842ValAsp: 3.842 ± 0.674
4.191ValGlu: 4.191 ± 0.55
3.609ValPhe: 3.609 ± 0.667
3.027ValGly: 3.027 ± 0.552
0.582ValHis: 0.582 ± 0.246
4.191ValIle: 4.191 ± 0.682
6.171ValLys: 6.171 ± 0.731
3.842ValLeu: 3.842 ± 0.832
1.63ValMet: 1.63 ± 0.412
2.561ValAsn: 2.561 ± 0.489
1.281ValPro: 1.281 ± 0.44
1.63ValGln: 1.63 ± 0.371
3.144ValArg: 3.144 ± 0.703
5.356ValSer: 5.356 ± 1.045
5.006ValThr: 5.006 ± 1.098
3.959ValVal: 3.959 ± 0.815
0.466ValTrp: 0.466 ± 0.233
2.794ValTyr: 2.794 ± 0.561
0.0ValXaa: 0.0 ± 0.0
Trp
1.048TrpAla: 1.048 ± 0.302
0.233TrpCys: 0.233 ± 0.166
0.582TrpAsp: 0.582 ± 0.281
0.699TrpGlu: 0.699 ± 0.253
1.281TrpPhe: 1.281 ± 0.415
1.048TrpGly: 1.048 ± 0.476
0.233TrpHis: 0.233 ± 0.182
0.699TrpIle: 0.699 ± 0.26
0.699TrpLys: 0.699 ± 0.228
1.863TrpLeu: 1.863 ± 0.63
0.233TrpMet: 0.233 ± 0.143
1.397TrpAsn: 1.397 ± 0.486
0.0TrpPro: 0.0 ± 0.0
0.699TrpGln: 0.699 ± 0.228
0.349TrpArg: 0.349 ± 0.228
0.815TrpSer: 0.815 ± 0.271
0.815TrpThr: 0.815 ± 0.271
0.699TrpVal: 0.699 ± 0.388
0.0TrpTrp: 0.0 ± 0.0
1.281TrpTyr: 1.281 ± 0.346
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.514TyrAla: 1.514 ± 0.473
0.815TyrCys: 0.815 ± 0.365
2.561TyrAsp: 2.561 ± 0.683
3.959TyrGlu: 3.959 ± 0.659
2.212TyrPhe: 2.212 ± 0.45
3.144TyrGly: 3.144 ± 0.672
1.164TyrHis: 1.164 ± 0.317
3.842TyrIle: 3.842 ± 0.745
2.678TyrLys: 2.678 ± 0.676
3.842TyrLeu: 3.842 ± 0.69
1.048TyrMet: 1.048 ± 0.469
3.493TyrAsn: 3.493 ± 0.543
1.048TyrPro: 1.048 ± 0.381
1.514TyrGln: 1.514 ± 0.572
1.281TyrArg: 1.281 ± 0.379
2.212TyrSer: 2.212 ± 0.604
2.794TyrThr: 2.794 ± 0.659
1.979TyrVal: 1.979 ± 0.491
0.582TyrTrp: 0.582 ± 0.254
1.979TyrTyr: 1.979 ± 0.604
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (8590 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski