Amino acid dipepetide frequency for Mycobacterium phage Lamina13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.228AlaAla: 13.228 ± 1.394
0.886AlaCys: 0.886 ± 0.228
6.791AlaAsp: 6.791 ± 0.642
6.26AlaGlu: 6.26 ± 0.633
3.307AlaPhe: 3.307 ± 0.494
7.677AlaGly: 7.677 ± 0.619
1.653AlaHis: 1.653 ± 0.326
4.429AlaIle: 4.429 ± 0.565
4.252AlaLys: 4.252 ± 0.483
9.33AlaLeu: 9.33 ± 0.849
2.657AlaMet: 2.657 ± 0.394
2.362AlaAsn: 2.362 ± 0.359
4.665AlaPro: 4.665 ± 0.69
2.775AlaGln: 2.775 ± 0.43
5.964AlaArg: 5.964 ± 0.513
5.315AlaSer: 5.315 ± 0.595
5.669AlaThr: 5.669 ± 0.665
8.385AlaVal: 8.385 ± 0.879
1.831AlaTrp: 1.831 ± 0.328
2.657AlaTyr: 2.657 ± 0.346
0.0AlaXaa: 0.0 ± 0.0
Cys
0.768CysAla: 0.768 ± 0.231
0.059CysCys: 0.059 ± 0.057
0.531CysAsp: 0.531 ± 0.181
0.709CysGlu: 0.709 ± 0.22
0.177CysPhe: 0.177 ± 0.095
0.768CysGly: 0.768 ± 0.273
0.177CysHis: 0.177 ± 0.107
0.295CysIle: 0.295 ± 0.136
0.177CysLys: 0.177 ± 0.1
0.531CysLeu: 0.531 ± 0.211
0.059CysMet: 0.059 ± 0.056
0.177CysAsn: 0.177 ± 0.091
0.236CysPro: 0.236 ± 0.114
0.236CysGln: 0.236 ± 0.108
0.709CysArg: 0.709 ± 0.198
0.591CysSer: 0.591 ± 0.187
0.177CysThr: 0.177 ± 0.094
0.413CysVal: 0.413 ± 0.177
0.177CysTrp: 0.177 ± 0.094
0.177CysTyr: 0.177 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
6.496AspAla: 6.496 ± 0.498
0.709AspCys: 0.709 ± 0.222
4.37AspAsp: 4.37 ± 0.47
3.897AspGlu: 3.897 ± 0.568
2.362AspPhe: 2.362 ± 0.366
5.964AspGly: 5.964 ± 0.56
0.945AspHis: 0.945 ± 0.219
2.775AspIle: 2.775 ± 0.415
2.421AspLys: 2.421 ± 0.44
6.673AspLeu: 6.673 ± 0.705
1.122AspMet: 1.122 ± 0.242
1.772AspAsn: 1.772 ± 0.292
5.079AspPro: 5.079 ± 0.598
1.831AspGln: 1.831 ± 0.301
4.016AspArg: 4.016 ± 0.385
3.72AspSer: 3.72 ± 0.565
3.72AspThr: 3.72 ± 0.41
4.96AspVal: 4.96 ± 0.495
1.772AspTrp: 1.772 ± 0.284
2.008AspTyr: 2.008 ± 0.294
0.0AspXaa: 0.0 ± 0.0
Glu
6.555GluAla: 6.555 ± 0.664
0.472GluCys: 0.472 ± 0.172
5.374GluAsp: 5.374 ± 0.583
4.842GluGlu: 4.842 ± 0.575
1.772GluPhe: 1.772 ± 0.348
3.543GluGly: 3.543 ± 0.419
1.299GluHis: 1.299 ± 0.287
3.366GluIle: 3.366 ± 0.398
2.775GluLys: 2.775 ± 0.386
6.673GluLeu: 6.673 ± 0.53
1.476GluMet: 1.476 ± 0.267
1.949GluAsn: 1.949 ± 0.428
2.539GluPro: 2.539 ± 0.477
2.48GluGln: 2.48 ± 0.359
3.661GluArg: 3.661 ± 0.498
3.897GluSer: 3.897 ± 0.499
3.897GluThr: 3.897 ± 0.542
5.433GluVal: 5.433 ± 0.505
1.24GluTrp: 1.24 ± 0.271
2.362GluTyr: 2.362 ± 0.389
0.0GluXaa: 0.0 ± 0.0
Phe
2.362PheAla: 2.362 ± 0.352
0.295PheCys: 0.295 ± 0.136
2.598PheAsp: 2.598 ± 0.312
1.89PheGlu: 1.89 ± 0.301
0.472PhePhe: 0.472 ± 0.145
3.248PheGly: 3.248 ± 0.465
0.768PheHis: 0.768 ± 0.236
1.181PheIle: 1.181 ± 0.282
1.181PheLys: 1.181 ± 0.273
2.598PheLeu: 2.598 ± 0.452
0.591PheMet: 0.591 ± 0.182
1.063PheAsn: 1.063 ± 0.238
1.713PhePro: 1.713 ± 0.311
0.886PheGln: 0.886 ± 0.249
1.831PheArg: 1.831 ± 0.358
1.949PheSer: 1.949 ± 0.404
2.126PheThr: 2.126 ± 0.357
1.831PheVal: 1.831 ± 0.361
0.768PheTrp: 0.768 ± 0.222
1.004PheTyr: 1.004 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
7.559GlyAla: 7.559 ± 1.002
0.591GlyCys: 0.591 ± 0.202
5.374GlyAsp: 5.374 ± 0.487
4.665GlyGlu: 4.665 ± 0.505
2.775GlyPhe: 2.775 ± 0.513
8.74GlyGly: 8.74 ± 1.727
1.713GlyHis: 1.713 ± 0.308
4.547GlyIle: 4.547 ± 0.61
3.366GlyLys: 3.366 ± 0.46
7.677GlyLeu: 7.677 ± 0.803
1.89GlyMet: 1.89 ± 0.352
3.012GlyAsn: 3.012 ± 0.453
3.779GlyPro: 3.779 ± 0.566
2.835GlyGln: 2.835 ± 0.382
5.079GlyArg: 5.079 ± 0.541
6.319GlySer: 6.319 ± 0.878
5.374GlyThr: 5.374 ± 0.796
5.61GlyVal: 5.61 ± 0.666
2.598GlyTrp: 2.598 ± 0.335
2.48GlyTyr: 2.48 ± 0.282
0.0GlyXaa: 0.0 ± 0.0
His
1.713HisAla: 1.713 ± 0.337
0.059HisCys: 0.059 ± 0.064
1.063HisAsp: 1.063 ± 0.243
1.653HisGlu: 1.653 ± 0.297
0.65HisPhe: 0.65 ± 0.182
1.476HisGly: 1.476 ± 0.331
0.65HisHis: 0.65 ± 0.212
0.768HisIle: 0.768 ± 0.214
1.004HisLys: 1.004 ± 0.271
1.417HisLeu: 1.417 ± 0.353
0.118HisMet: 0.118 ± 0.113
0.295HisAsn: 0.295 ± 0.142
1.358HisPro: 1.358 ± 0.258
0.768HisGln: 0.768 ± 0.184
1.417HisArg: 1.417 ± 0.289
0.591HisSer: 0.591 ± 0.15
1.004HisThr: 1.004 ± 0.252
1.476HisVal: 1.476 ± 0.263
0.531HisTrp: 0.531 ± 0.154
0.531HisTyr: 0.531 ± 0.218
0.0HisXaa: 0.0 ± 0.0
Ile
6.201IleAla: 6.201 ± 0.765
0.236IleCys: 0.236 ± 0.095
3.071IleAsp: 3.071 ± 0.366
4.016IleGlu: 4.016 ± 0.411
0.945IlePhe: 0.945 ± 0.243
3.838IleGly: 3.838 ± 0.37
0.827IleHis: 0.827 ± 0.193
1.594IleIle: 1.594 ± 0.311
1.713IleLys: 1.713 ± 0.312
3.543IleLeu: 3.543 ± 0.455
0.65IleMet: 0.65 ± 0.209
2.067IleAsn: 2.067 ± 0.318
3.307IlePro: 3.307 ± 0.361
1.476IleGln: 1.476 ± 0.401
3.543IleArg: 3.543 ± 0.5
3.72IleSer: 3.72 ± 0.557
3.248IleThr: 3.248 ± 0.474
2.894IleVal: 2.894 ± 0.467
0.768IleTrp: 0.768 ± 0.18
1.653IleTyr: 1.653 ± 0.259
0.0IleXaa: 0.0 ± 0.0
Lys
3.838LysAla: 3.838 ± 0.505
0.236LysCys: 0.236 ± 0.109
2.362LysAsp: 2.362 ± 0.432
1.831LysGlu: 1.831 ± 0.302
1.417LysPhe: 1.417 ± 0.284
2.303LysGly: 2.303 ± 0.405
1.24LysHis: 1.24 ± 0.276
2.657LysIle: 2.657 ± 0.404
2.008LysLys: 2.008 ± 0.402
3.189LysLeu: 3.189 ± 0.428
0.945LysMet: 0.945 ± 0.203
1.653LysAsn: 1.653 ± 0.271
2.303LysPro: 2.303 ± 0.407
1.89LysGln: 1.89 ± 0.303
2.775LysArg: 2.775 ± 0.353
2.421LysSer: 2.421 ± 0.375
2.421LysThr: 2.421 ± 0.343
3.366LysVal: 3.366 ± 0.486
0.827LysTrp: 0.827 ± 0.237
1.063LysTyr: 1.063 ± 0.228
0.0LysXaa: 0.0 ± 0.0
Leu
9.507LeuAla: 9.507 ± 0.761
0.413LeuCys: 0.413 ± 0.178
6.319LeuAsp: 6.319 ± 0.503
5.492LeuGlu: 5.492 ± 0.612
2.008LeuPhe: 2.008 ± 0.38
7.972LeuGly: 7.972 ± 0.817
1.476LeuHis: 1.476 ± 0.299
4.96LeuIle: 4.96 ± 0.476
4.016LeuLys: 4.016 ± 0.466
5.315LeuLeu: 5.315 ± 0.566
1.89LeuMet: 1.89 ± 0.309
2.835LeuAsn: 2.835 ± 0.383
5.315LeuPro: 5.315 ± 0.487
2.835LeuGln: 2.835 ± 0.441
6.201LeuArg: 6.201 ± 0.554
5.433LeuSer: 5.433 ± 0.51
6.555LeuThr: 6.555 ± 0.603
4.783LeuVal: 4.783 ± 0.621
1.181LeuTrp: 1.181 ± 0.299
2.244LeuTyr: 2.244 ± 0.378
0.0LeuXaa: 0.0 ± 0.0
Met
2.303MetAla: 2.303 ± 0.321
0.0MetCys: 0.0 ± 0.0
1.299MetAsp: 1.299 ± 0.245
1.594MetGlu: 1.594 ± 0.259
0.472MetPhe: 0.472 ± 0.145
1.24MetGly: 1.24 ± 0.233
0.354MetHis: 0.354 ± 0.127
0.709MetIle: 0.709 ± 0.218
0.886MetLys: 0.886 ± 0.209
1.181MetLeu: 1.181 ± 0.26
0.236MetMet: 0.236 ± 0.12
0.886MetAsn: 0.886 ± 0.188
1.358MetPro: 1.358 ± 0.254
0.531MetGln: 0.531 ± 0.145
1.299MetArg: 1.299 ± 0.274
1.89MetSer: 1.89 ± 0.342
1.949MetThr: 1.949 ± 0.294
1.181MetVal: 1.181 ± 0.31
0.295MetTrp: 0.295 ± 0.109
0.413MetTyr: 0.413 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
2.894AsnAla: 2.894 ± 0.496
0.118AsnCys: 0.118 ± 0.085
2.185AsnAsp: 2.185 ± 0.405
1.831AsnGlu: 1.831 ± 0.333
0.886AsnPhe: 0.886 ± 0.232
3.307AsnGly: 3.307 ± 0.552
0.768AsnHis: 0.768 ± 0.17
1.594AsnIle: 1.594 ± 0.275
0.413AsnLys: 0.413 ± 0.163
2.598AsnLeu: 2.598 ± 0.377
0.531AsnMet: 0.531 ± 0.171
0.591AsnAsn: 0.591 ± 0.173
2.303AsnPro: 2.303 ± 0.356
0.945AsnGln: 0.945 ± 0.183
1.417AsnArg: 1.417 ± 0.31
1.89AsnSer: 1.89 ± 0.42
1.713AsnThr: 1.713 ± 0.333
2.48AsnVal: 2.48 ± 0.431
0.65AsnTrp: 0.65 ± 0.181
1.122AsnTyr: 1.122 ± 0.274
0.0AsnXaa: 0.0 ± 0.0
Pro
5.374ProAla: 5.374 ± 0.618
0.295ProCys: 0.295 ± 0.118
4.488ProAsp: 4.488 ± 0.508
4.193ProGlu: 4.193 ± 0.466
2.421ProPhe: 2.421 ± 0.376
5.019ProGly: 5.019 ± 0.597
0.709ProHis: 0.709 ± 0.194
2.362ProIle: 2.362 ± 0.371
1.949ProLys: 1.949 ± 0.272
4.665ProLeu: 4.665 ± 0.464
0.768ProMet: 0.768 ± 0.237
1.299ProAsn: 1.299 ± 0.299
2.539ProPro: 2.539 ± 0.414
1.476ProGln: 1.476 ± 0.288
2.775ProArg: 2.775 ± 0.452
3.602ProSer: 3.602 ± 0.536
4.193ProThr: 4.193 ± 0.505
3.72ProVal: 3.72 ± 0.451
0.827ProTrp: 0.827 ± 0.269
1.594ProTyr: 1.594 ± 0.362
0.0ProXaa: 0.0 ± 0.0
Gln
3.012GlnAla: 3.012 ± 0.453
0.118GlnCys: 0.118 ± 0.085
1.417GlnAsp: 1.417 ± 0.249
1.653GlnGlu: 1.653 ± 0.294
1.181GlnPhe: 1.181 ± 0.24
2.953GlnGly: 2.953 ± 0.346
0.531GlnHis: 0.531 ± 0.148
2.598GlnIle: 2.598 ± 0.479
1.417GlnLys: 1.417 ± 0.297
4.016GlnLeu: 4.016 ± 0.495
0.709GlnMet: 0.709 ± 0.182
0.591GlnAsn: 0.591 ± 0.17
1.772GlnPro: 1.772 ± 0.314
1.772GlnGln: 1.772 ± 0.404
1.831GlnArg: 1.831 ± 0.374
1.89GlnSer: 1.89 ± 0.301
1.772GlnThr: 1.772 ± 0.312
2.362GlnVal: 2.362 ± 0.313
0.709GlnTrp: 0.709 ± 0.207
0.591GlnTyr: 0.591 ± 0.178
0.0GlnXaa: 0.0 ± 0.0
Arg
5.256ArgAla: 5.256 ± 0.566
1.063ArgCys: 1.063 ± 0.288
3.661ArgAsp: 3.661 ± 0.443
4.665ArgGlu: 4.665 ± 0.503
1.89ArgPhe: 1.89 ± 0.311
5.079ArgGly: 5.079 ± 0.659
0.945ArgHis: 0.945 ± 0.228
3.13ArgIle: 3.13 ± 0.439
3.484ArgLys: 3.484 ± 0.59
6.437ArgLeu: 6.437 ± 0.654
1.653ArgMet: 1.653 ± 0.301
2.362ArgAsn: 2.362 ± 0.432
2.421ArgPro: 2.421 ± 0.349
1.89ArgGln: 1.89 ± 0.308
5.551ArgArg: 5.551 ± 0.631
4.134ArgSer: 4.134 ± 0.466
3.012ArgThr: 3.012 ± 0.414
5.079ArgVal: 5.079 ± 0.525
1.299ArgTrp: 1.299 ± 0.295
1.713ArgTyr: 1.713 ± 0.292
0.0ArgXaa: 0.0 ± 0.0
Ser
6.023SerAla: 6.023 ± 0.668
0.472SerCys: 0.472 ± 0.161
3.484SerAsp: 3.484 ± 0.445
4.134SerGlu: 4.134 ± 0.535
1.831SerPhe: 1.831 ± 0.343
6.968SerGly: 6.968 ± 0.741
1.122SerHis: 1.122 ± 0.255
3.189SerIle: 3.189 ± 0.453
2.421SerLys: 2.421 ± 0.393
5.315SerLeu: 5.315 ± 0.63
1.417SerMet: 1.417 ± 0.285
1.949SerAsn: 1.949 ± 0.333
3.661SerPro: 3.661 ± 0.404
1.831SerGln: 1.831 ± 0.251
3.484SerArg: 3.484 ± 0.418
3.897SerSer: 3.897 ± 0.714
3.779SerThr: 3.779 ± 0.57
4.429SerVal: 4.429 ± 0.509
1.358SerTrp: 1.358 ± 0.292
1.24SerTyr: 1.24 ± 0.285
0.0SerXaa: 0.0 ± 0.0
Thr
6.437ThrAla: 6.437 ± 0.741
0.236ThrCys: 0.236 ± 0.107
4.134ThrAsp: 4.134 ± 0.472
4.075ThrGlu: 4.075 ± 0.467
2.185ThrPhe: 2.185 ± 0.351
6.614ThrGly: 6.614 ± 0.527
0.945ThrHis: 0.945 ± 0.242
2.835ThrIle: 2.835 ± 0.606
2.421ThrLys: 2.421 ± 0.35
5.787ThrLeu: 5.787 ± 0.571
1.063ThrMet: 1.063 ± 0.24
1.653ThrAsn: 1.653 ± 0.326
3.779ThrPro: 3.779 ± 0.497
1.772ThrGln: 1.772 ± 0.316
3.425ThrArg: 3.425 ± 0.463
3.366ThrSer: 3.366 ± 0.481
4.429ThrThr: 4.429 ± 0.66
5.492ThrVal: 5.492 ± 0.503
1.122ThrTrp: 1.122 ± 0.259
1.831ThrTyr: 1.831 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
6.673ValAla: 6.673 ± 0.608
0.413ValCys: 0.413 ± 0.152
5.433ValAsp: 5.433 ± 0.504
5.019ValGlu: 5.019 ± 0.53
2.244ValPhe: 2.244 ± 0.302
4.842ValGly: 4.842 ± 0.654
1.24ValHis: 1.24 ± 0.213
3.661ValIle: 3.661 ± 0.545
3.189ValLys: 3.189 ± 0.448
5.433ValLeu: 5.433 ± 0.619
1.181ValMet: 1.181 ± 0.322
2.185ValAsn: 2.185 ± 0.316
4.075ValPro: 4.075 ± 0.439
2.539ValGln: 2.539 ± 0.359
6.26ValArg: 6.26 ± 0.707
5.079ValSer: 5.079 ± 0.523
5.315ValThr: 5.315 ± 0.555
5.374ValVal: 5.374 ± 0.675
1.122ValTrp: 1.122 ± 0.223
2.067ValTyr: 2.067 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
1.476TrpAla: 1.476 ± 0.31
0.236TrpCys: 0.236 ± 0.11
1.417TrpAsp: 1.417 ± 0.284
0.886TrpGlu: 0.886 ± 0.179
0.827TrpPhe: 0.827 ± 0.219
1.594TrpGly: 1.594 ± 0.333
0.531TrpHis: 0.531 ± 0.175
1.122TrpIle: 1.122 ± 0.248
0.472TrpLys: 0.472 ± 0.191
2.126TrpLeu: 2.126 ± 0.346
0.295TrpMet: 0.295 ± 0.148
0.413TrpAsn: 0.413 ± 0.16
0.886TrpPro: 0.886 ± 0.279
0.945TrpGln: 0.945 ± 0.199
1.299TrpArg: 1.299 ± 0.287
1.063TrpSer: 1.063 ± 0.261
1.535TrpThr: 1.535 ± 0.337
2.067TrpVal: 2.067 ± 0.317
0.65TrpTrp: 0.65 ± 0.226
0.295TrpTyr: 0.295 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.421TyrAla: 2.421 ± 0.412
0.236TyrCys: 0.236 ± 0.11
1.24TyrAsp: 1.24 ± 0.265
2.303TyrGlu: 2.303 ± 0.376
0.531TyrPhe: 0.531 ± 0.151
2.835TyrGly: 2.835 ± 0.38
0.65TyrHis: 0.65 ± 0.204
1.535TyrIle: 1.535 ± 0.341
1.181TyrLys: 1.181 ± 0.22
2.244TyrLeu: 2.244 ± 0.356
0.768TyrMet: 0.768 ± 0.152
1.122TyrAsn: 1.122 ± 0.269
1.181TyrPro: 1.181 ± 0.239
1.181TyrGln: 1.181 ± 0.32
2.244TyrArg: 2.244 ± 0.364
1.24TyrSer: 1.24 ± 0.25
1.713TyrThr: 1.713 ± 0.389
2.008TyrVal: 2.008 ± 0.337
0.413TyrTrp: 0.413 ± 0.148
0.531TyrTyr: 0.531 ± 0.181
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 96 proteins (16935 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski