Amino acid dipepetide frequency for Lactococcus phage bIL286

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.469AlaAla: 5.469 ± 1.586
0.308AlaCys: 0.308 ± 0.153
4.776AlaAsp: 4.776 ± 0.535
5.084AlaGlu: 5.084 ± 0.641
2.388AlaPhe: 2.388 ± 0.529
4.93AlaGly: 4.93 ± 0.95
0.847AlaHis: 0.847 ± 0.319
5.315AlaIle: 5.315 ± 0.58
5.161AlaLys: 5.161 ± 0.877
5.7AlaLeu: 5.7 ± 0.824
1.926AlaMet: 1.926 ± 0.317
5.623AlaAsn: 5.623 ± 0.892
1.618AlaPro: 1.618 ± 0.386
4.006AlaGln: 4.006 ± 1.061
2.311AlaArg: 2.311 ± 0.431
4.468AlaSer: 4.468 ± 0.725
4.083AlaThr: 4.083 ± 0.611
4.314AlaVal: 4.314 ± 0.661
0.847AlaTrp: 0.847 ± 0.295
3.158AlaTyr: 3.158 ± 0.496
0.0AlaXaa: 0.0 ± 0.0
Cys
0.077CysAla: 0.077 ± 0.064
0.077CysCys: 0.077 ± 0.088
0.231CysAsp: 0.231 ± 0.169
0.308CysGlu: 0.308 ± 0.165
0.154CysPhe: 0.154 ± 0.127
0.308CysGly: 0.308 ± 0.18
0.231CysHis: 0.231 ± 0.157
0.231CysIle: 0.231 ± 0.148
0.539CysLys: 0.539 ± 0.238
0.385CysLeu: 0.385 ± 0.187
0.154CysMet: 0.154 ± 0.108
0.154CysAsn: 0.154 ± 0.093
0.077CysPro: 0.077 ± 0.088
0.154CysGln: 0.154 ± 0.108
0.385CysArg: 0.385 ± 0.152
0.693CysSer: 0.693 ± 0.25
0.693CysThr: 0.693 ± 0.286
0.385CysVal: 0.385 ± 0.149
0.077CysTrp: 0.077 ± 0.081
0.154CysTyr: 0.154 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
3.62AspAla: 3.62 ± 0.5
0.539AspCys: 0.539 ± 0.187
4.545AspAsp: 4.545 ± 0.633
4.16AspGlu: 4.16 ± 0.684
3.312AspPhe: 3.312 ± 0.498
6.162AspGly: 6.162 ± 1.42
0.924AspHis: 0.924 ± 0.289
3.389AspIle: 3.389 ± 0.394
4.853AspLys: 4.853 ± 0.538
6.008AspLeu: 6.008 ± 0.715
1.464AspMet: 1.464 ± 0.289
3.851AspAsn: 3.851 ± 0.535
1.772AspPro: 1.772 ± 0.363
1.232AspGln: 1.232 ± 0.356
1.849AspArg: 1.849 ± 0.376
4.314AspSer: 4.314 ± 0.615
3.004AspThr: 3.004 ± 0.424
3.158AspVal: 3.158 ± 0.456
1.001AspTrp: 1.001 ± 0.349
3.004AspTyr: 3.004 ± 0.478
0.0AspXaa: 0.0 ± 0.0
Glu
4.622GluAla: 4.622 ± 0.753
0.385GluCys: 0.385 ± 0.177
2.003GluAsp: 2.003 ± 0.535
4.314GluGlu: 4.314 ± 0.897
2.542GluPhe: 2.542 ± 0.423
2.388GluGly: 2.388 ± 0.51
0.77GluHis: 0.77 ± 0.309
6.47GluIle: 6.47 ± 0.835
6.47GluLys: 6.47 ± 1.041
6.085GluLeu: 6.085 ± 0.775
1.618GluMet: 1.618 ± 0.375
4.083GluAsn: 4.083 ± 0.576
1.772GluPro: 1.772 ± 0.397
3.466GluGln: 3.466 ± 0.409
2.311GluArg: 2.311 ± 0.508
3.543GluSer: 3.543 ± 0.483
3.62GluThr: 3.62 ± 0.446
4.391GluVal: 4.391 ± 0.727
0.847GluTrp: 0.847 ± 0.273
2.311GluTyr: 2.311 ± 0.527
0.0GluXaa: 0.0 ± 0.0
Phe
2.696PheAla: 2.696 ± 0.484
0.154PheCys: 0.154 ± 0.108
3.004PheAsp: 3.004 ± 0.475
3.004PheGlu: 3.004 ± 0.679
1.001PhePhe: 1.001 ± 0.209
2.619PheGly: 2.619 ± 0.421
0.462PheHis: 0.462 ± 0.159
3.158PheIle: 3.158 ± 0.489
3.697PheLys: 3.697 ± 0.607
2.465PheLeu: 2.465 ± 0.64
1.001PheMet: 1.001 ± 0.311
2.542PheAsn: 2.542 ± 0.456
1.001PhePro: 1.001 ± 0.252
1.541PheGln: 1.541 ± 0.294
0.77PheArg: 0.77 ± 0.35
2.85PheSer: 2.85 ± 0.445
2.773PheThr: 2.773 ± 0.518
1.926PheVal: 1.926 ± 0.476
0.308PheTrp: 0.308 ± 0.153
1.695PheTyr: 1.695 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
3.158GlyAla: 3.158 ± 0.616
0.539GlyCys: 0.539 ± 0.262
3.466GlyAsp: 3.466 ± 0.554
3.235GlyGlu: 3.235 ± 0.45
2.619GlyPhe: 2.619 ± 0.332
4.237GlyGly: 4.237 ± 0.86
1.078GlyHis: 1.078 ± 0.253
5.161GlyIle: 5.161 ± 0.576
5.469GlyLys: 5.469 ± 0.689
5.546GlyLeu: 5.546 ± 0.774
1.849GlyMet: 1.849 ± 0.316
3.929GlyAsn: 3.929 ± 0.664
0.924GlyPro: 0.924 ± 0.338
3.235GlyGln: 3.235 ± 0.496
1.155GlyArg: 1.155 ± 0.269
6.162GlySer: 6.162 ± 0.691
6.085GlyThr: 6.085 ± 1.212
3.235GlyVal: 3.235 ± 0.656
1.232GlyTrp: 1.232 ± 0.255
3.235GlyTyr: 3.235 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.387HisAla: 1.387 ± 0.476
0.154HisCys: 0.154 ± 0.105
1.001HisAsp: 1.001 ± 0.278
1.001HisGlu: 1.001 ± 0.348
0.462HisPhe: 0.462 ± 0.206
0.77HisGly: 0.77 ± 0.227
0.231HisHis: 0.231 ± 0.105
0.693HisIle: 0.693 ± 0.227
1.155HisLys: 1.155 ± 0.279
1.001HisLeu: 1.001 ± 0.295
0.308HisMet: 0.308 ± 0.163
0.693HisAsn: 0.693 ± 0.217
0.308HisPro: 0.308 ± 0.141
0.924HisGln: 0.924 ± 0.275
0.462HisArg: 0.462 ± 0.191
0.924HisSer: 0.924 ± 0.266
0.847HisThr: 0.847 ± 0.223
0.462HisVal: 0.462 ± 0.14
0.231HisTrp: 0.231 ± 0.127
0.616HisTyr: 0.616 ± 0.25
0.0HisXaa: 0.0 ± 0.0
Ile
5.931IleAla: 5.931 ± 0.614
0.308IleCys: 0.308 ± 0.126
4.776IleAsp: 4.776 ± 0.668
4.006IleGlu: 4.006 ± 0.615
2.388IlePhe: 2.388 ± 0.59
4.776IleGly: 4.776 ± 0.627
0.693IleHis: 0.693 ± 0.222
4.314IleIle: 4.314 ± 0.749
4.853IleLys: 4.853 ± 0.69
4.16IleLeu: 4.16 ± 0.603
1.618IleMet: 1.618 ± 0.302
6.008IleAsn: 6.008 ± 0.72
1.849IlePro: 1.849 ± 0.308
2.08IleGln: 2.08 ± 0.42
2.157IleArg: 2.157 ± 0.47
6.008IleSer: 6.008 ± 0.863
5.315IleThr: 5.315 ± 0.877
4.391IleVal: 4.391 ± 0.655
0.693IleTrp: 0.693 ± 0.237
2.465IleTyr: 2.465 ± 0.559
0.0IleXaa: 0.0 ± 0.0
Lys
6.933LysAla: 6.933 ± 1.327
0.308LysCys: 0.308 ± 0.156
6.316LysAsp: 6.316 ± 0.873
5.854LysGlu: 5.854 ± 0.832
3.62LysPhe: 3.62 ± 0.481
4.545LysGly: 4.545 ± 0.778
1.387LysHis: 1.387 ± 0.462
4.545LysIle: 4.545 ± 0.615
7.395LysLys: 7.395 ± 1.107
6.316LysLeu: 6.316 ± 0.763
2.234LysMet: 2.234 ± 0.455
5.238LysAsn: 5.238 ± 0.519
1.618LysPro: 1.618 ± 0.474
3.929LysGln: 3.929 ± 0.753
2.388LysArg: 2.388 ± 0.532
5.546LysSer: 5.546 ± 0.691
5.7LysThr: 5.7 ± 0.578
4.468LysVal: 4.468 ± 0.622
1.541LysTrp: 1.541 ± 0.355
3.004LysTyr: 3.004 ± 0.495
0.0LysXaa: 0.0 ± 0.0
Leu
6.239LeuAla: 6.239 ± 0.633
0.385LeuCys: 0.385 ± 0.17
4.468LeuAsp: 4.468 ± 0.782
4.699LeuGlu: 4.699 ± 0.597
2.85LeuPhe: 2.85 ± 0.567
3.929LeuGly: 3.929 ± 0.592
0.693LeuHis: 0.693 ± 0.262
5.315LeuIle: 5.315 ± 0.617
5.161LeuLys: 5.161 ± 0.556
6.47LeuLeu: 6.47 ± 1.056
2.388LeuMet: 2.388 ± 0.406
6.548LeuAsn: 6.548 ± 0.724
3.312LeuPro: 3.312 ± 0.582
4.083LeuGln: 4.083 ± 0.654
2.234LeuArg: 2.234 ± 0.394
6.47LeuSer: 6.47 ± 1.134
5.161LeuThr: 5.161 ± 0.903
3.62LeuVal: 3.62 ± 0.489
1.001LeuTrp: 1.001 ± 0.369
2.311LeuTyr: 2.311 ± 0.519
0.0LeuXaa: 0.0 ± 0.0
Met
2.157MetAla: 2.157 ± 0.352
0.154MetCys: 0.154 ± 0.124
0.693MetAsp: 0.693 ± 0.268
1.849MetGlu: 1.849 ± 0.379
1.001MetPhe: 1.001 ± 0.331
1.232MetGly: 1.232 ± 0.254
0.308MetHis: 0.308 ± 0.176
1.849MetIle: 1.849 ± 0.368
2.773MetLys: 2.773 ± 0.486
0.847MetLeu: 0.847 ± 0.29
0.693MetMet: 0.693 ± 0.271
1.926MetAsn: 1.926 ± 0.415
1.078MetPro: 1.078 ± 0.267
1.078MetGln: 1.078 ± 0.313
0.77MetArg: 0.77 ± 0.22
2.619MetSer: 2.619 ± 0.448
2.542MetThr: 2.542 ± 0.468
1.232MetVal: 1.232 ± 0.272
0.385MetTrp: 0.385 ± 0.157
0.77MetTyr: 0.77 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
3.929AsnAla: 3.929 ± 0.563
0.231AsnCys: 0.231 ± 0.101
4.468AsnAsp: 4.468 ± 0.609
3.697AsnGlu: 3.697 ± 0.504
2.465AsnPhe: 2.465 ± 0.398
6.085AsnGly: 6.085 ± 0.799
1.001AsnHis: 1.001 ± 0.311
5.161AsnIle: 5.161 ± 0.632
4.776AsnLys: 4.776 ± 0.594
5.392AsnLeu: 5.392 ± 0.749
2.388AsnMet: 2.388 ± 0.405
4.006AsnAsn: 4.006 ± 0.631
2.311AsnPro: 2.311 ± 0.389
3.774AsnGln: 3.774 ± 0.813
2.465AsnArg: 2.465 ± 0.402
5.546AsnSer: 5.546 ± 0.763
3.929AsnThr: 3.929 ± 0.543
3.62AsnVal: 3.62 ± 0.565
0.693AsnTrp: 0.693 ± 0.268
1.772AsnTyr: 1.772 ± 0.375
0.0AsnXaa: 0.0 ± 0.0
Pro
1.232ProAla: 1.232 ± 0.37
0.077ProCys: 0.077 ± 0.08
1.31ProAsp: 1.31 ± 0.433
2.157ProGlu: 2.157 ± 0.446
1.001ProPhe: 1.001 ± 0.303
1.001ProGly: 1.001 ± 0.398
0.616ProHis: 0.616 ± 0.222
2.157ProIle: 2.157 ± 0.407
2.234ProLys: 2.234 ± 0.443
2.465ProLeu: 2.465 ± 0.412
0.616ProMet: 0.616 ± 0.238
1.618ProAsn: 1.618 ± 0.569
0.77ProPro: 0.77 ± 0.216
1.849ProGln: 1.849 ± 0.364
0.385ProArg: 0.385 ± 0.138
2.234ProSer: 2.234 ± 0.4
2.08ProThr: 2.08 ± 0.37
1.926ProVal: 1.926 ± 0.394
0.539ProTrp: 0.539 ± 0.264
0.77ProTyr: 0.77 ± 0.218
0.0ProXaa: 0.0 ± 0.0
Gln
3.851GlnAla: 3.851 ± 0.766
0.0GlnCys: 0.0 ± 0.0
2.234GlnAsp: 2.234 ± 0.494
3.158GlnGlu: 3.158 ± 0.392
1.772GlnPhe: 1.772 ± 0.326
3.158GlnGly: 3.158 ± 0.483
0.616GlnHis: 0.616 ± 0.207
3.004GlnIle: 3.004 ± 0.47
3.312GlnLys: 3.312 ± 1.155
4.391GlnLeu: 4.391 ± 0.724
1.078GlnMet: 1.078 ± 0.269
2.619GlnAsn: 2.619 ± 0.606
0.77GlnPro: 0.77 ± 0.256
2.619GlnGln: 2.619 ± 0.613
1.849GlnArg: 1.849 ± 0.429
3.004GlnSer: 3.004 ± 0.591
3.158GlnThr: 3.158 ± 0.6
2.234GlnVal: 2.234 ± 0.411
0.385GlnTrp: 0.385 ± 0.173
1.695GlnTyr: 1.695 ± 0.285
0.0GlnXaa: 0.0 ± 0.0
Arg
2.619ArgAla: 2.619 ± 0.402
0.385ArgCys: 0.385 ± 0.187
1.31ArgAsp: 1.31 ± 0.382
2.85ArgGlu: 2.85 ± 0.698
1.772ArgPhe: 1.772 ± 0.331
1.849ArgGly: 1.849 ± 0.311
0.462ArgHis: 0.462 ± 0.224
2.003ArgIle: 2.003 ± 0.456
2.773ArgLys: 2.773 ± 0.801
2.08ArgLeu: 2.08 ± 0.457
0.616ArgMet: 0.616 ± 0.257
2.773ArgAsn: 2.773 ± 0.421
0.77ArgPro: 0.77 ± 0.29
1.078ArgGln: 1.078 ± 0.304
1.078ArgArg: 1.078 ± 0.376
1.926ArgSer: 1.926 ± 0.416
1.541ArgThr: 1.541 ± 0.366
1.926ArgVal: 1.926 ± 0.426
0.462ArgTrp: 0.462 ± 0.222
1.618ArgTyr: 1.618 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
5.007SerAla: 5.007 ± 0.613
0.154SerCys: 0.154 ± 0.109
5.392SerAsp: 5.392 ± 0.687
4.622SerGlu: 4.622 ± 0.523
2.927SerPhe: 2.927 ± 0.497
5.854SerGly: 5.854 ± 0.895
0.847SerHis: 0.847 ± 0.258
4.699SerIle: 4.699 ± 0.714
5.931SerLys: 5.931 ± 0.607
5.7SerLeu: 5.7 ± 0.896
1.926SerMet: 1.926 ± 0.323
4.699SerAsn: 4.699 ± 0.595
1.695SerPro: 1.695 ± 0.367
2.696SerGln: 2.696 ± 0.525
2.234SerArg: 2.234 ± 0.391
4.545SerSer: 4.545 ± 0.619
5.392SerThr: 5.392 ± 0.819
5.469SerVal: 5.469 ± 0.739
0.693SerTrp: 0.693 ± 0.278
2.542SerTyr: 2.542 ± 0.382
0.0SerXaa: 0.0 ± 0.0
Thr
6.008ThrAla: 6.008 ± 0.72
0.462ThrCys: 0.462 ± 0.23
3.851ThrAsp: 3.851 ± 0.614
4.006ThrGlu: 4.006 ± 0.49
2.234ThrPhe: 2.234 ± 0.34
5.238ThrGly: 5.238 ± 0.583
1.001ThrHis: 1.001 ± 0.282
4.314ThrIle: 4.314 ± 0.858
6.548ThrLys: 6.548 ± 0.572
3.851ThrLeu: 3.851 ± 0.551
1.387ThrMet: 1.387 ± 0.336
4.699ThrAsn: 4.699 ± 0.821
1.772ThrPro: 1.772 ± 0.423
2.311ThrGln: 2.311 ± 0.388
2.773ThrArg: 2.773 ± 0.355
4.776ThrSer: 4.776 ± 0.77
6.316ThrThr: 6.316 ± 1.417
5.546ThrVal: 5.546 ± 1.02
0.847ThrTrp: 0.847 ± 0.266
2.388ThrTyr: 2.388 ± 0.62
0.0ThrXaa: 0.0 ± 0.0
Val
3.62ValAla: 3.62 ± 0.645
0.077ValCys: 0.077 ± 0.071
4.468ValAsp: 4.468 ± 0.585
3.004ValGlu: 3.004 ± 0.508
2.311ValPhe: 2.311 ± 0.513
3.235ValGly: 3.235 ± 0.458
0.616ValHis: 0.616 ± 0.187
3.851ValIle: 3.851 ± 0.469
5.623ValLys: 5.623 ± 0.818
4.699ValLeu: 4.699 ± 0.572
1.464ValMet: 1.464 ± 0.428
4.314ValAsn: 4.314 ± 0.661
2.157ValPro: 2.157 ± 0.57
2.234ValGln: 2.234 ± 0.497
1.772ValArg: 1.772 ± 0.581
4.16ValSer: 4.16 ± 0.501
4.776ValThr: 4.776 ± 0.636
3.312ValVal: 3.312 ± 0.449
0.693ValTrp: 0.693 ± 0.223
1.618ValTyr: 1.618 ± 0.315
0.0ValXaa: 0.0 ± 0.0
Trp
1.078TrpAla: 1.078 ± 0.266
0.0TrpCys: 0.0 ± 0.0
0.924TrpAsp: 0.924 ± 0.252
0.693TrpGlu: 0.693 ± 0.227
0.462TrpPhe: 0.462 ± 0.162
1.001TrpGly: 1.001 ± 0.216
0.462TrpHis: 0.462 ± 0.189
0.693TrpIle: 0.693 ± 0.226
1.001TrpLys: 1.001 ± 0.274
1.31TrpLeu: 1.31 ± 0.417
0.231TrpMet: 0.231 ± 0.13
0.616TrpAsn: 0.616 ± 0.211
0.077TrpPro: 0.077 ± 0.08
0.462TrpGln: 0.462 ± 0.165
0.77TrpArg: 0.77 ± 0.267
0.616TrpSer: 0.616 ± 0.216
1.001TrpThr: 1.001 ± 0.48
1.078TrpVal: 1.078 ± 0.245
0.539TrpTrp: 0.539 ± 0.198
0.308TrpTyr: 0.308 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.696TyrAla: 2.696 ± 0.483
0.77TyrCys: 0.77 ± 0.288
2.927TyrAsp: 2.927 ± 0.428
2.311TyrGlu: 2.311 ± 0.409
1.464TyrPhe: 1.464 ± 0.411
2.234TyrGly: 2.234 ± 0.416
0.385TyrHis: 0.385 ± 0.234
2.465TyrIle: 2.465 ± 0.476
3.158TyrLys: 3.158 ± 0.526
2.465TyrLeu: 2.465 ± 0.629
0.924TyrMet: 0.924 ± 0.286
1.772TyrAsn: 1.772 ± 0.315
1.387TyrPro: 1.387 ± 0.363
2.157TyrGln: 2.157 ± 0.624
1.772TyrArg: 1.772 ± 0.315
2.619TyrSer: 2.619 ± 0.553
2.311TyrThr: 2.311 ± 0.551
1.31TyrVal: 1.31 ± 0.241
0.308TyrTrp: 0.308 ± 0.151
1.541TyrTyr: 1.541 ± 0.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (12983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski