Amino acid dipepetide frequency for Lactobacillus phage phig1e

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.656AlaAla: 7.656 ± 1.125
0.934AlaCys: 0.934 ± 0.381
6.162AlaAsp: 6.162 ± 0.773
3.268AlaGlu: 3.268 ± 0.523
2.707AlaPhe: 2.707 ± 0.427
6.815AlaGly: 6.815 ± 1.584
1.12AlaHis: 1.12 ± 0.224
5.508AlaIle: 5.508 ± 0.747
7.842AlaLys: 7.842 ± 1.059
5.882AlaLeu: 5.882 ± 0.544
2.801AlaMet: 2.801 ± 0.556
4.201AlaAsn: 4.201 ± 0.745
1.774AlaPro: 1.774 ± 0.389
2.707AlaGln: 2.707 ± 0.832
3.081AlaArg: 3.081 ± 0.533
4.855AlaSer: 4.855 ± 0.737
5.135AlaThr: 5.135 ± 0.539
5.415AlaVal: 5.415 ± 0.639
1.027AlaTrp: 1.027 ± 0.322
2.334AlaTyr: 2.334 ± 0.534
0.0AlaXaa: 0.0 ± 0.0
Cys
0.28CysAla: 0.28 ± 0.151
0.093CysCys: 0.093 ± 0.094
0.467CysAsp: 0.467 ± 0.254
0.28CysGlu: 0.28 ± 0.181
0.187CysPhe: 0.187 ± 0.14
1.027CysGly: 1.027 ± 0.556
0.28CysHis: 0.28 ± 0.149
0.187CysIle: 0.187 ± 0.152
0.28CysLys: 0.28 ± 0.179
0.747CysLeu: 0.747 ± 0.291
0.373CysMet: 0.373 ± 0.213
0.28CysAsn: 0.28 ± 0.164
0.28CysPro: 0.28 ± 0.166
0.56CysGln: 0.56 ± 0.227
0.373CysArg: 0.373 ± 0.236
0.28CysSer: 0.28 ± 0.147
0.373CysThr: 0.373 ± 0.203
0.654CysVal: 0.654 ± 0.267
0.093CysTrp: 0.093 ± 0.091
0.187CysTyr: 0.187 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
5.322AspAla: 5.322 ± 0.707
0.654AspCys: 0.654 ± 0.263
6.255AspAsp: 6.255 ± 0.874
4.668AspGlu: 4.668 ± 0.679
2.334AspPhe: 2.334 ± 0.504
5.882AspGly: 5.882 ± 0.818
1.587AspHis: 1.587 ± 0.422
3.734AspIle: 3.734 ± 0.565
5.322AspLys: 5.322 ± 0.567
7.002AspLeu: 7.002 ± 0.708
1.961AspMet: 1.961 ± 0.458
4.481AspAsn: 4.481 ± 0.699
2.147AspPro: 2.147 ± 0.501
2.241AspGln: 2.241 ± 0.449
2.427AspArg: 2.427 ± 0.494
4.668AspSer: 4.668 ± 0.592
4.761AspThr: 4.761 ± 0.635
3.361AspVal: 3.361 ± 0.585
0.747AspTrp: 0.747 ± 0.213
3.641AspTyr: 3.641 ± 0.731
0.0AspXaa: 0.0 ± 0.0
Glu
5.042GluAla: 5.042 ± 0.588
0.56GluCys: 0.56 ± 0.24
3.174GluAsp: 3.174 ± 0.555
4.015GluGlu: 4.015 ± 0.656
2.054GluPhe: 2.054 ± 0.446
2.521GluGly: 2.521 ± 0.498
1.307GluHis: 1.307 ± 0.323
2.894GluIle: 2.894 ± 0.441
3.734GluLys: 3.734 ± 0.522
6.069GluLeu: 6.069 ± 0.934
1.214GluMet: 1.214 ± 0.344
2.054GluAsn: 2.054 ± 0.542
1.681GluPro: 1.681 ± 0.474
2.988GluGln: 2.988 ± 0.637
2.147GluArg: 2.147 ± 0.536
3.454GluSer: 3.454 ± 0.561
3.641GluThr: 3.641 ± 0.524
3.268GluVal: 3.268 ± 0.506
0.84GluTrp: 0.84 ± 0.301
2.241GluTyr: 2.241 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
2.801PheAla: 2.801 ± 0.564
0.373PheCys: 0.373 ± 0.251
3.361PheAsp: 3.361 ± 0.663
2.707PheGlu: 2.707 ± 0.64
1.494PhePhe: 1.494 ± 0.432
2.988PheGly: 2.988 ± 0.545
0.187PheHis: 0.187 ± 0.096
1.867PheIle: 1.867 ± 0.392
2.521PheLys: 2.521 ± 0.51
2.054PheLeu: 2.054 ± 0.595
1.4PheMet: 1.4 ± 0.371
1.961PheAsn: 1.961 ± 0.542
1.214PhePro: 1.214 ± 0.282
0.934PheGln: 0.934 ± 0.248
1.307PheArg: 1.307 ± 0.368
2.427PheSer: 2.427 ± 0.453
2.334PheThr: 2.334 ± 0.434
1.774PheVal: 1.774 ± 0.325
0.747PheTrp: 0.747 ± 0.252
1.494PheTyr: 1.494 ± 0.452
0.0PheXaa: 0.0 ± 0.0
Gly
4.761GlyAla: 4.761 ± 0.914
0.373GlyCys: 0.373 ± 0.164
3.734GlyAsp: 3.734 ± 0.514
3.921GlyGlu: 3.921 ± 0.664
2.521GlyPhe: 2.521 ± 0.43
4.668GlyGly: 4.668 ± 1.249
2.147GlyHis: 2.147 ± 0.454
4.295GlyIle: 4.295 ± 0.612
5.322GlyLys: 5.322 ± 0.952
5.788GlyLeu: 5.788 ± 0.687
1.774GlyMet: 1.774 ± 0.408
4.761GlyAsn: 4.761 ± 0.615
1.681GlyPro: 1.681 ± 0.458
3.081GlyGln: 3.081 ± 0.469
1.961GlyArg: 1.961 ± 0.621
5.322GlySer: 5.322 ± 0.861
5.042GlyThr: 5.042 ± 0.705
4.388GlyVal: 4.388 ± 0.896
0.373GlyTrp: 0.373 ± 0.182
2.801GlyTyr: 2.801 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
1.494HisAla: 1.494 ± 0.517
0.187HisCys: 0.187 ± 0.137
1.681HisAsp: 1.681 ± 0.436
1.027HisGlu: 1.027 ± 0.3
0.654HisPhe: 0.654 ± 0.212
1.4HisGly: 1.4 ± 0.343
0.467HisHis: 0.467 ± 0.285
0.56HisIle: 0.56 ± 0.266
1.774HisLys: 1.774 ± 0.405
1.307HisLeu: 1.307 ± 0.348
0.373HisMet: 0.373 ± 0.165
0.934HisAsn: 0.934 ± 0.307
0.654HisPro: 0.654 ± 0.246
0.84HisGln: 0.84 ± 0.271
1.307HisArg: 1.307 ± 0.368
0.84HisSer: 0.84 ± 0.314
1.307HisThr: 1.307 ± 0.312
2.054HisVal: 2.054 ± 0.463
0.373HisTrp: 0.373 ± 0.221
0.84HisTyr: 0.84 ± 0.324
0.0HisXaa: 0.0 ± 0.0
Ile
5.415IleAla: 5.415 ± 0.613
0.093IleCys: 0.093 ± 0.094
5.228IleAsp: 5.228 ± 0.587
4.015IleGlu: 4.015 ± 0.589
1.214IlePhe: 1.214 ± 0.302
4.201IleGly: 4.201 ± 0.661
0.747IleHis: 0.747 ± 0.317
2.988IleIle: 2.988 ± 0.521
5.135IleLys: 5.135 ± 0.666
2.894IleLeu: 2.894 ± 0.533
1.4IleMet: 1.4 ± 0.346
3.548IleAsn: 3.548 ± 0.721
1.961IlePro: 1.961 ± 0.463
2.894IleGln: 2.894 ± 0.445
2.241IleArg: 2.241 ± 0.539
4.855IleSer: 4.855 ± 0.797
4.201IleThr: 4.201 ± 0.699
4.295IleVal: 4.295 ± 0.694
0.093IleTrp: 0.093 ± 0.082
2.334IleTyr: 2.334 ± 0.434
0.0IleXaa: 0.0 ± 0.0
Lys
7.096LysAla: 7.096 ± 1.37
0.467LysCys: 0.467 ± 0.233
5.228LysAsp: 5.228 ± 0.806
4.575LysGlu: 4.575 ± 0.898
2.241LysPhe: 2.241 ± 0.453
3.268LysGly: 3.268 ± 0.611
1.214LysHis: 1.214 ± 0.292
4.108LysIle: 4.108 ± 0.611
6.722LysLys: 6.722 ± 1.78
7.282LysLeu: 7.282 ± 1.0
1.961LysMet: 1.961 ± 0.521
3.641LysAsn: 3.641 ± 0.585
2.707LysPro: 2.707 ± 0.502
4.201LysGln: 4.201 ± 0.91
3.734LysArg: 3.734 ± 0.835
5.135LysSer: 5.135 ± 0.796
5.975LysThr: 5.975 ± 0.609
4.108LysVal: 4.108 ± 0.649
0.84LysTrp: 0.84 ± 0.263
3.174LysTyr: 3.174 ± 0.522
0.0LysXaa: 0.0 ± 0.0
Leu
8.029LeuAla: 8.029 ± 0.882
0.654LeuCys: 0.654 ± 0.314
6.909LeuAsp: 6.909 ± 0.854
3.828LeuGlu: 3.828 ± 0.658
3.268LeuPhe: 3.268 ± 0.737
5.322LeuGly: 5.322 ± 0.885
1.307LeuHis: 1.307 ± 0.318
5.602LeuIle: 5.602 ± 0.802
5.788LeuLys: 5.788 ± 0.677
6.442LeuLeu: 6.442 ± 1.213
1.587LeuMet: 1.587 ± 0.472
4.015LeuAsn: 4.015 ± 0.652
2.521LeuPro: 2.521 ± 0.555
2.427LeuGln: 2.427 ± 0.671
3.081LeuArg: 3.081 ± 0.522
5.135LeuSer: 5.135 ± 0.809
5.788LeuThr: 5.788 ± 0.767
5.135LeuVal: 5.135 ± 0.66
0.934LeuTrp: 0.934 ± 0.289
2.894LeuTyr: 2.894 ± 0.614
0.0LeuXaa: 0.0 ± 0.0
Met
3.361MetAla: 3.361 ± 0.548
0.467MetCys: 0.467 ± 0.283
1.867MetAsp: 1.867 ± 0.471
1.12MetGlu: 1.12 ± 0.286
0.84MetPhe: 0.84 ± 0.28
1.867MetGly: 1.867 ± 0.416
0.28MetHis: 0.28 ± 0.161
1.681MetIle: 1.681 ± 0.348
3.174MetLys: 3.174 ± 0.593
1.867MetLeu: 1.867 ± 0.451
1.027MetMet: 1.027 ± 0.348
0.84MetAsn: 0.84 ± 0.25
0.84MetPro: 0.84 ± 0.259
1.12MetGln: 1.12 ± 0.27
1.4MetArg: 1.4 ± 0.346
2.894MetSer: 2.894 ± 0.739
1.681MetThr: 1.681 ± 0.413
1.307MetVal: 1.307 ± 0.382
0.187MetTrp: 0.187 ± 0.143
0.56MetTyr: 0.56 ± 0.194
0.0MetXaa: 0.0 ± 0.0
Asn
3.454AsnAla: 3.454 ± 0.509
0.373AsnCys: 0.373 ± 0.154
3.268AsnAsp: 3.268 ± 0.747
3.268AsnGlu: 3.268 ± 0.655
0.934AsnPhe: 0.934 ± 0.342
4.855AsnGly: 4.855 ± 0.543
1.587AsnHis: 1.587 ± 0.447
2.147AsnIle: 2.147 ± 0.453
3.734AsnLys: 3.734 ± 0.731
3.361AsnLeu: 3.361 ± 0.454
1.681AsnMet: 1.681 ± 0.503
3.081AsnAsn: 3.081 ± 0.605
2.521AsnPro: 2.521 ± 0.684
2.707AsnGln: 2.707 ± 0.438
2.894AsnArg: 2.894 ± 0.595
3.921AsnSer: 3.921 ± 0.727
3.174AsnThr: 3.174 ± 0.627
2.988AsnVal: 2.988 ± 0.674
0.654AsnTrp: 0.654 ± 0.219
2.147AsnTyr: 2.147 ± 0.453
0.0AsnXaa: 0.0 ± 0.0
Pro
2.801ProAla: 2.801 ± 0.423
0.093ProCys: 0.093 ± 0.073
2.521ProAsp: 2.521 ± 0.679
2.054ProGlu: 2.054 ± 0.417
1.587ProPhe: 1.587 ± 0.43
1.587ProGly: 1.587 ± 0.417
1.027ProHis: 1.027 ± 0.31
1.494ProIle: 1.494 ± 0.287
2.427ProLys: 2.427 ± 0.453
2.894ProLeu: 2.894 ± 0.484
1.12ProMet: 1.12 ± 0.256
1.4ProAsn: 1.4 ± 0.372
0.654ProPro: 0.654 ± 0.285
1.214ProGln: 1.214 ± 0.325
1.027ProArg: 1.027 ± 0.253
1.867ProSer: 1.867 ± 0.458
1.867ProThr: 1.867 ± 0.393
1.494ProVal: 1.494 ± 0.394
0.187ProTrp: 0.187 ± 0.145
1.307ProTyr: 1.307 ± 0.369
0.0ProXaa: 0.0 ± 0.0
Gln
3.548GlnAla: 3.548 ± 0.604
0.187GlnCys: 0.187 ± 0.16
2.894GlnAsp: 2.894 ± 0.55
1.867GlnGlu: 1.867 ± 0.363
2.521GlnPhe: 2.521 ± 0.434
2.334GlnGly: 2.334 ± 0.412
0.56GlnHis: 0.56 ± 0.253
2.614GlnIle: 2.614 ± 0.489
2.334GlnLys: 2.334 ± 0.57
3.548GlnLeu: 3.548 ± 0.587
1.12GlnMet: 1.12 ± 0.38
2.427GlnAsn: 2.427 ± 0.617
1.027GlnPro: 1.027 ± 0.273
2.054GlnGln: 2.054 ± 0.491
2.801GlnArg: 2.801 ± 0.53
2.241GlnSer: 2.241 ± 0.396
3.548GlnThr: 3.548 ± 0.616
2.707GlnVal: 2.707 ± 0.537
0.56GlnTrp: 0.56 ± 0.344
1.587GlnTyr: 1.587 ± 0.358
0.0GlnXaa: 0.0 ± 0.0
Arg
2.707ArgAla: 2.707 ± 0.506
0.187ArgCys: 0.187 ± 0.149
2.521ArgAsp: 2.521 ± 0.482
1.494ArgGlu: 1.494 ± 0.314
1.681ArgPhe: 1.681 ± 0.453
1.867ArgGly: 1.867 ± 0.441
1.307ArgHis: 1.307 ± 0.607
3.641ArgIle: 3.641 ± 0.698
3.268ArgLys: 3.268 ± 0.524
3.454ArgLeu: 3.454 ± 0.74
1.587ArgMet: 1.587 ± 0.322
1.867ArgAsn: 1.867 ± 0.499
1.12ArgPro: 1.12 ± 0.302
1.587ArgGln: 1.587 ± 0.353
1.307ArgArg: 1.307 ± 0.459
2.054ArgSer: 2.054 ± 0.505
2.427ArgThr: 2.427 ± 0.5
1.961ArgVal: 1.961 ± 0.507
1.12ArgTrp: 1.12 ± 0.373
1.4ArgTyr: 1.4 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
4.295SerAla: 4.295 ± 0.703
0.28SerCys: 0.28 ± 0.191
5.228SerAsp: 5.228 ± 0.683
4.015SerGlu: 4.015 ± 0.491
3.361SerPhe: 3.361 ± 0.607
5.508SerGly: 5.508 ± 0.838
1.681SerHis: 1.681 ± 0.37
3.921SerIle: 3.921 ± 0.473
5.042SerLys: 5.042 ± 0.803
4.668SerLeu: 4.668 ± 0.595
2.334SerMet: 2.334 ± 0.38
3.828SerAsn: 3.828 ± 0.51
2.054SerPro: 2.054 ± 0.471
2.801SerGln: 2.801 ± 0.471
2.334SerArg: 2.334 ± 0.411
5.882SerSer: 5.882 ± 1.03
4.668SerThr: 4.668 ± 0.683
3.454SerVal: 3.454 ± 0.561
0.84SerTrp: 0.84 ± 0.347
2.427SerTyr: 2.427 ± 0.418
0.0SerXaa: 0.0 ± 0.0
Thr
5.135ThrAla: 5.135 ± 0.551
0.56ThrCys: 0.56 ± 0.273
4.295ThrAsp: 4.295 ± 0.896
3.268ThrGlu: 3.268 ± 0.597
2.521ThrPhe: 2.521 ± 0.462
5.882ThrGly: 5.882 ± 0.748
1.12ThrHis: 1.12 ± 0.389
5.788ThrIle: 5.788 ± 0.759
5.602ThrLys: 5.602 ± 0.964
5.322ThrLeu: 5.322 ± 0.589
1.307ThrMet: 1.307 ± 0.314
3.361ThrAsn: 3.361 ± 0.704
2.334ThrPro: 2.334 ± 0.506
2.147ThrGln: 2.147 ± 0.508
1.307ThrArg: 1.307 ± 0.4
4.575ThrSer: 4.575 ± 0.796
4.761ThrThr: 4.761 ± 0.638
4.575ThrVal: 4.575 ± 0.52
1.307ThrTrp: 1.307 ± 0.328
2.801ThrTyr: 2.801 ± 0.492
0.0ThrXaa: 0.0 ± 0.0
Val
4.855ValAla: 4.855 ± 0.571
0.654ValCys: 0.654 ± 0.318
4.201ValAsp: 4.201 ± 0.603
2.894ValGlu: 2.894 ± 0.519
1.494ValPhe: 1.494 ± 0.439
3.921ValGly: 3.921 ± 0.598
0.84ValHis: 0.84 ± 0.267
4.295ValIle: 4.295 ± 0.649
4.201ValLys: 4.201 ± 0.669
5.135ValLeu: 5.135 ± 0.74
1.867ValMet: 1.867 ± 0.314
3.174ValAsn: 3.174 ± 0.546
2.147ValPro: 2.147 ± 0.436
3.081ValGln: 3.081 ± 0.492
1.587ValArg: 1.587 ± 0.389
4.481ValSer: 4.481 ± 0.557
4.575ValThr: 4.575 ± 0.701
4.201ValVal: 4.201 ± 0.581
0.654ValTrp: 0.654 ± 0.185
1.867ValTyr: 1.867 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
0.654TrpAla: 0.654 ± 0.246
0.0TrpCys: 0.0 ± 0.0
1.12TrpAsp: 1.12 ± 0.311
0.56TrpGlu: 0.56 ± 0.217
0.654TrpPhe: 0.654 ± 0.254
0.467TrpGly: 0.467 ± 0.217
0.56TrpHis: 0.56 ± 0.279
0.654TrpIle: 0.654 ± 0.304
0.84TrpLys: 0.84 ± 0.252
1.587TrpLeu: 1.587 ± 0.428
0.373TrpMet: 0.373 ± 0.183
0.467TrpAsn: 0.467 ± 0.196
0.187TrpPro: 0.187 ± 0.145
0.56TrpGln: 0.56 ± 0.239
0.56TrpArg: 0.56 ± 0.209
0.747TrpSer: 0.747 ± 0.222
0.56TrpThr: 0.56 ± 0.21
0.934TrpVal: 0.934 ± 0.277
0.0TrpTrp: 0.0 ± 0.0
0.467TrpTyr: 0.467 ± 0.201
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.521TyrAla: 2.521 ± 0.482
0.093TyrCys: 0.093 ± 0.098
3.081TyrAsp: 3.081 ± 0.543
2.054TyrGlu: 2.054 ± 0.345
1.681TyrPhe: 1.681 ± 0.442
2.334TyrGly: 2.334 ± 0.45
0.747TyrHis: 0.747 ± 0.229
1.774TyrIle: 1.774 ± 0.426
2.614TyrLys: 2.614 ± 0.485
3.361TyrLeu: 3.361 ± 0.519
1.027TyrMet: 1.027 ± 0.351
2.427TyrAsn: 2.427 ± 0.454
1.12TyrPro: 1.12 ± 0.285
2.334TyrGln: 2.334 ± 0.569
1.681TyrArg: 1.681 ± 0.408
3.081TyrSer: 3.081 ± 0.602
2.147TyrThr: 2.147 ± 0.528
2.054TyrVal: 2.054 ± 0.37
0.373TyrTrp: 0.373 ± 0.218
2.614TyrTyr: 2.614 ± 0.517
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10712 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski