Amino acid dipepetide frequency for Lactobacillus phage phiAT3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.582AlaAla: 7.582 ± 0.907
0.596AlaCys: 0.596 ± 0.213
6.559AlaAsp: 6.559 ± 0.939
4.685AlaGlu: 4.685 ± 0.792
3.493AlaPhe: 3.493 ± 0.595
3.919AlaGly: 3.919 ± 0.721
2.13AlaHis: 2.13 ± 0.487
5.707AlaIle: 5.707 ± 0.643
7.07AlaLys: 7.07 ± 1.109
6.559AlaLeu: 6.559 ± 0.699
2.13AlaMet: 2.13 ± 0.502
4.856AlaAsn: 4.856 ± 0.592
1.959AlaPro: 1.959 ± 0.399
3.748AlaGln: 3.748 ± 1.031
3.067AlaArg: 3.067 ± 0.469
5.622AlaSer: 5.622 ± 0.771
3.833AlaThr: 3.833 ± 0.554
5.537AlaVal: 5.537 ± 0.69
1.619AlaTrp: 1.619 ± 0.454
2.726AlaTyr: 2.726 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.341CysAla: 0.341 ± 0.212
0.256CysCys: 0.256 ± 0.165
0.681CysAsp: 0.681 ± 0.257
0.17CysGlu: 0.17 ± 0.132
0.341CysPhe: 0.341 ± 0.18
0.341CysGly: 0.341 ± 0.189
0.426CysHis: 0.426 ± 0.214
0.256CysIle: 0.256 ± 0.137
0.511CysLys: 0.511 ± 0.305
0.681CysLeu: 0.681 ± 0.315
0.256CysMet: 0.256 ± 0.127
0.767CysAsn: 0.767 ± 0.337
0.256CysPro: 0.256 ± 0.148
0.17CysGln: 0.17 ± 0.111
0.256CysArg: 0.256 ± 0.166
0.085CysSer: 0.085 ± 0.089
0.341CysThr: 0.341 ± 0.162
0.426CysVal: 0.426 ± 0.2
0.17CysTrp: 0.17 ± 0.102
0.426CysTyr: 0.426 ± 0.201
0.0CysXaa: 0.0 ± 0.0
Asp
5.367AspAla: 5.367 ± 0.714
0.426AspCys: 0.426 ± 0.195
4.856AspAsp: 4.856 ± 0.756
4.6AspGlu: 4.6 ± 0.689
2.811AspPhe: 2.811 ± 0.597
6.219AspGly: 6.219 ± 0.769
1.704AspHis: 1.704 ± 0.468
3.663AspIle: 3.663 ± 0.617
4.515AspLys: 4.515 ± 0.592
5.452AspLeu: 5.452 ± 0.887
1.448AspMet: 1.448 ± 0.382
4.174AspAsn: 4.174 ± 0.725
2.215AspPro: 2.215 ± 0.477
2.896AspGln: 2.896 ± 0.572
3.237AspArg: 3.237 ± 0.505
5.367AspSer: 5.367 ± 0.555
3.152AspThr: 3.152 ± 0.483
5.111AspVal: 5.111 ± 0.752
1.193AspTrp: 1.193 ± 0.374
2.726AspTyr: 2.726 ± 0.473
0.0AspXaa: 0.0 ± 0.0
Glu
4.089GluAla: 4.089 ± 0.726
0.426GluCys: 0.426 ± 0.232
2.982GluAsp: 2.982 ± 0.443
3.322GluGlu: 3.322 ± 0.699
1.619GluPhe: 1.619 ± 0.36
2.385GluGly: 2.385 ± 0.514
1.278GluHis: 1.278 ± 0.399
3.237GluIle: 3.237 ± 0.619
4.004GluLys: 4.004 ± 0.833
4.515GluLeu: 4.515 ± 0.618
1.619GluMet: 1.619 ± 0.336
3.237GluAsn: 3.237 ± 0.544
1.874GluPro: 1.874 ± 0.577
2.47GluGln: 2.47 ± 0.323
2.215GluArg: 2.215 ± 0.57
2.726GluSer: 2.726 ± 0.605
3.152GluThr: 3.152 ± 0.462
3.067GluVal: 3.067 ± 0.546
0.937GluTrp: 0.937 ± 0.267
1.704GluTyr: 1.704 ± 0.395
0.0GluXaa: 0.0 ± 0.0
Phe
2.896PheAla: 2.896 ± 0.266
0.085PheCys: 0.085 ± 0.09
2.13PheAsp: 2.13 ± 0.456
2.385PheGlu: 2.385 ± 0.429
0.426PhePhe: 0.426 ± 0.189
2.811PheGly: 2.811 ± 0.493
0.937PheHis: 0.937 ± 0.293
2.385PheIle: 2.385 ± 0.548
2.641PheLys: 2.641 ± 0.456
2.556PheLeu: 2.556 ± 0.573
0.937PheMet: 0.937 ± 0.331
2.556PheAsn: 2.556 ± 0.506
1.533PhePro: 1.533 ± 0.34
0.852PheGln: 0.852 ± 0.313
1.278PheArg: 1.278 ± 0.384
2.641PheSer: 2.641 ± 0.363
2.385PheThr: 2.385 ± 0.447
2.641PheVal: 2.641 ± 0.452
1.363PheTrp: 1.363 ± 0.73
1.363PheTyr: 1.363 ± 0.311
0.0PheXaa: 0.0 ± 0.0
Gly
5.111GlyAla: 5.111 ± 0.8
0.426GlyCys: 0.426 ± 0.189
4.089GlyAsp: 4.089 ± 0.706
2.044GlyGlu: 2.044 ± 0.475
3.407GlyPhe: 3.407 ± 0.558
4.259GlyGly: 4.259 ± 0.734
1.874GlyHis: 1.874 ± 0.382
4.259GlyIle: 4.259 ± 0.841
5.537GlyLys: 5.537 ± 0.812
4.685GlyLeu: 4.685 ± 0.709
2.215GlyMet: 2.215 ± 0.389
4.004GlyAsn: 4.004 ± 0.551
1.363GlyPro: 1.363 ± 0.365
2.47GlyGln: 2.47 ± 0.417
2.726GlyArg: 2.726 ± 0.555
3.493GlySer: 3.493 ± 0.598
4.174GlyThr: 4.174 ± 0.593
5.111GlyVal: 5.111 ± 0.758
1.789GlyTrp: 1.789 ± 0.554
3.578GlyTyr: 3.578 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
1.874HisAla: 1.874 ± 0.418
0.17HisCys: 0.17 ± 0.129
1.533HisAsp: 1.533 ± 0.359
1.278HisGlu: 1.278 ± 0.348
1.193HisPhe: 1.193 ± 0.292
1.278HisGly: 1.278 ± 0.274
0.426HisHis: 0.426 ± 0.269
1.789HisIle: 1.789 ± 0.34
1.278HisLys: 1.278 ± 0.336
1.022HisLeu: 1.022 ± 0.282
0.596HisMet: 0.596 ± 0.194
1.193HisAsn: 1.193 ± 0.277
0.681HisPro: 0.681 ± 0.275
0.511HisGln: 0.511 ± 0.189
0.937HisArg: 0.937 ± 0.352
1.874HisSer: 1.874 ± 0.598
1.107HisThr: 1.107 ± 0.206
1.704HisVal: 1.704 ± 0.38
0.17HisTrp: 0.17 ± 0.123
1.278HisTyr: 1.278 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
4.685IleAla: 4.685 ± 0.614
0.426IleCys: 0.426 ± 0.288
4.344IleAsp: 4.344 ± 0.443
3.237IleGlu: 3.237 ± 0.519
1.789IlePhe: 1.789 ± 0.378
3.833IleGly: 3.833 ± 0.552
1.874IleHis: 1.874 ± 0.47
2.896IleIle: 2.896 ± 0.447
4.685IleLys: 4.685 ± 0.572
3.067IleLeu: 3.067 ± 0.493
1.107IleMet: 1.107 ± 0.37
4.089IleAsn: 4.089 ± 0.627
2.556IlePro: 2.556 ± 0.527
1.619IleGln: 1.619 ± 0.305
3.237IleArg: 3.237 ± 0.583
4.174IleSer: 4.174 ± 0.472
2.982IleThr: 2.982 ± 0.619
3.833IleVal: 3.833 ± 0.574
1.022IleTrp: 1.022 ± 0.329
3.152IleTyr: 3.152 ± 0.55
0.0IleXaa: 0.0 ± 0.0
Lys
7.07LysAla: 7.07 ± 1.018
0.341LysCys: 0.341 ± 0.181
4.174LysAsp: 4.174 ± 0.676
3.067LysGlu: 3.067 ± 0.692
2.385LysPhe: 2.385 ± 0.383
3.833LysGly: 3.833 ± 0.593
1.789LysHis: 1.789 ± 0.379
4.43LysIle: 4.43 ± 0.716
4.77LysLys: 4.77 ± 0.687
5.111LysLeu: 5.111 ± 0.643
2.385LysMet: 2.385 ± 0.454
3.152LysAsn: 3.152 ± 0.515
3.407LysPro: 3.407 ± 0.554
5.026LysGln: 5.026 ± 0.837
3.663LysArg: 3.663 ± 0.796
5.537LysSer: 5.537 ± 1.191
5.367LysThr: 5.367 ± 0.688
4.259LysVal: 4.259 ± 0.644
1.363LysTrp: 1.363 ± 0.296
2.726LysTyr: 2.726 ± 0.471
0.0LysXaa: 0.0 ± 0.0
Leu
6.73LeuAla: 6.73 ± 0.805
0.852LeuCys: 0.852 ± 0.311
4.941LeuAsp: 4.941 ± 0.574
3.919LeuGlu: 3.919 ± 0.707
3.493LeuPhe: 3.493 ± 0.598
5.622LeuGly: 5.622 ± 0.707
1.619LeuHis: 1.619 ± 0.424
3.919LeuIle: 3.919 ± 0.4
4.856LeuLys: 4.856 ± 0.783
4.685LeuLeu: 4.685 ± 0.542
2.044LeuMet: 2.044 ± 0.424
4.004LeuAsn: 4.004 ± 0.717
2.641LeuPro: 2.641 ± 0.498
3.748LeuGln: 3.748 ± 0.567
3.067LeuArg: 3.067 ± 0.419
3.919LeuSer: 3.919 ± 0.493
4.77LeuThr: 4.77 ± 0.609
4.77LeuVal: 4.77 ± 0.544
1.278LeuTrp: 1.278 ± 0.385
2.726LeuTyr: 2.726 ± 0.488
0.0LeuXaa: 0.0 ± 0.0
Met
2.641MetAla: 2.641 ± 0.358
0.17MetCys: 0.17 ± 0.11
1.874MetAsp: 1.874 ± 0.406
1.278MetGlu: 1.278 ± 0.317
0.511MetPhe: 0.511 ± 0.342
1.278MetGly: 1.278 ± 0.28
0.511MetHis: 0.511 ± 0.168
1.789MetIle: 1.789 ± 0.505
2.385MetLys: 2.385 ± 0.352
1.619MetLeu: 1.619 ± 0.51
0.681MetMet: 0.681 ± 0.27
1.874MetAsn: 1.874 ± 0.287
0.767MetPro: 0.767 ± 0.259
1.278MetGln: 1.278 ± 0.357
1.022MetArg: 1.022 ± 0.391
1.704MetSer: 1.704 ± 0.359
1.789MetThr: 1.789 ± 0.341
1.533MetVal: 1.533 ± 0.407
0.341MetTrp: 0.341 ± 0.165
0.767MetTyr: 0.767 ± 0.264
0.0MetXaa: 0.0 ± 0.0
Asn
5.282AsnAla: 5.282 ± 0.951
0.17AsnCys: 0.17 ± 0.203
5.367AsnAsp: 5.367 ± 0.696
2.811AsnGlu: 2.811 ± 0.469
1.107AsnPhe: 1.107 ± 0.245
5.878AsnGly: 5.878 ± 1.041
0.852AsnHis: 0.852 ± 0.233
3.067AsnIle: 3.067 ± 0.499
3.748AsnLys: 3.748 ± 0.537
5.111AsnLeu: 5.111 ± 0.669
1.789AsnMet: 1.789 ± 0.516
3.152AsnAsn: 3.152 ± 0.681
2.3AsnPro: 2.3 ± 0.392
2.811AsnGln: 2.811 ± 0.491
2.982AsnArg: 2.982 ± 0.494
3.237AsnSer: 3.237 ± 0.506
2.896AsnThr: 2.896 ± 0.489
2.896AsnVal: 2.896 ± 0.452
0.852AsnTrp: 0.852 ± 0.248
1.704AsnTyr: 1.704 ± 0.406
0.0AsnXaa: 0.0 ± 0.0
Pro
2.556ProAla: 2.556 ± 0.471
0.085ProCys: 0.085 ± 0.095
3.152ProAsp: 3.152 ± 0.519
2.47ProGlu: 2.47 ± 0.498
1.107ProPhe: 1.107 ± 0.28
1.448ProGly: 1.448 ± 0.328
0.767ProHis: 0.767 ± 0.267
1.619ProIle: 1.619 ± 0.436
2.556ProLys: 2.556 ± 0.452
2.556ProLeu: 2.556 ± 0.482
0.511ProMet: 0.511 ± 0.208
2.556ProAsn: 2.556 ± 0.451
0.937ProPro: 0.937 ± 0.339
1.193ProGln: 1.193 ± 0.387
1.363ProArg: 1.363 ± 0.427
2.896ProSer: 2.896 ± 0.503
1.789ProThr: 1.789 ± 0.411
2.385ProVal: 2.385 ± 0.609
0.426ProTrp: 0.426 ± 0.24
1.022ProTyr: 1.022 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
5.111GlnAla: 5.111 ± 0.819
0.17GlnCys: 0.17 ± 0.124
2.385GlnAsp: 2.385 ± 0.561
2.044GlnGlu: 2.044 ± 0.402
1.193GlnPhe: 1.193 ± 0.282
2.385GlnGly: 2.385 ± 0.499
1.278GlnHis: 1.278 ± 0.283
2.47GlnIle: 2.47 ± 0.513
3.919GlnLys: 3.919 ± 0.965
4.174GlnLeu: 4.174 ± 0.63
1.533GlnMet: 1.533 ± 0.386
1.789GlnAsn: 1.789 ± 0.44
1.704GlnPro: 1.704 ± 0.356
3.067GlnGln: 3.067 ± 1.392
1.959GlnArg: 1.959 ± 0.51
2.3GlnSer: 2.3 ± 0.525
3.067GlnThr: 3.067 ± 0.475
2.641GlnVal: 2.641 ± 0.508
0.681GlnTrp: 0.681 ± 0.27
1.533GlnTyr: 1.533 ± 0.334
0.0GlnXaa: 0.0 ± 0.0
Arg
2.811ArgAla: 2.811 ± 0.521
0.681ArgCys: 0.681 ± 0.334
3.237ArgAsp: 3.237 ± 0.662
2.044ArgGlu: 2.044 ± 0.472
2.3ArgPhe: 2.3 ± 0.422
2.641ArgGly: 2.641 ± 0.498
0.937ArgHis: 0.937 ± 0.277
2.556ArgIle: 2.556 ± 0.61
3.493ArgLys: 3.493 ± 0.724
3.833ArgLeu: 3.833 ± 0.676
1.363ArgMet: 1.363 ± 0.287
1.959ArgAsn: 1.959 ± 0.362
1.193ArgPro: 1.193 ± 0.25
1.704ArgGln: 1.704 ± 0.428
1.789ArgArg: 1.789 ± 0.511
3.322ArgSer: 3.322 ± 0.543
2.47ArgThr: 2.47 ± 0.479
2.726ArgVal: 2.726 ± 0.581
1.022ArgTrp: 1.022 ± 0.306
2.385ArgTyr: 2.385 ± 0.569
0.0ArgXaa: 0.0 ± 0.0
Ser
6.219SerAla: 6.219 ± 0.738
0.17SerCys: 0.17 ± 0.131
4.685SerAsp: 4.685 ± 0.601
3.237SerGlu: 3.237 ± 0.466
3.322SerPhe: 3.322 ± 0.469
6.133SerGly: 6.133 ± 1.04
1.193SerHis: 1.193 ± 0.305
3.663SerIle: 3.663 ± 0.473
4.515SerLys: 4.515 ± 0.846
4.43SerLeu: 4.43 ± 0.644
1.448SerMet: 1.448 ± 0.408
4.43SerAsn: 4.43 ± 0.604
1.874SerPro: 1.874 ± 0.504
3.067SerGln: 3.067 ± 0.526
3.493SerArg: 3.493 ± 0.542
4.174SerSer: 4.174 ± 0.646
3.407SerThr: 3.407 ± 0.534
4.174SerVal: 4.174 ± 0.74
0.596SerTrp: 0.596 ± 0.257
2.982SerTyr: 2.982 ± 0.648
0.0SerXaa: 0.0 ± 0.0
Thr
4.174ThrAla: 4.174 ± 0.592
0.256ThrCys: 0.256 ± 0.182
4.941ThrAsp: 4.941 ± 0.848
2.385ThrGlu: 2.385 ± 0.472
2.47ThrPhe: 2.47 ± 0.544
4.174ThrGly: 4.174 ± 0.395
0.511ThrHis: 0.511 ± 0.172
3.919ThrIle: 3.919 ± 0.497
3.748ThrLys: 3.748 ± 0.595
4.515ThrLeu: 4.515 ± 0.651
1.278ThrMet: 1.278 ± 0.304
3.237ThrAsn: 3.237 ± 0.554
2.044ThrPro: 2.044 ± 0.436
2.556ThrGln: 2.556 ± 0.423
2.385ThrArg: 2.385 ± 0.471
4.941ThrSer: 4.941 ± 0.598
3.322ThrThr: 3.322 ± 0.514
4.685ThrVal: 4.685 ± 0.789
1.107ThrTrp: 1.107 ± 0.422
2.47ThrTyr: 2.47 ± 0.704
0.0ThrXaa: 0.0 ± 0.0
Val
4.856ValAla: 4.856 ± 0.551
0.341ValCys: 0.341 ± 0.16
4.856ValAsp: 4.856 ± 0.701
3.237ValGlu: 3.237 ± 0.553
1.789ValPhe: 1.789 ± 0.344
4.685ValGly: 4.685 ± 0.668
0.937ValHis: 0.937 ± 0.257
4.174ValIle: 4.174 ± 0.447
5.793ValLys: 5.793 ± 0.697
4.515ValLeu: 4.515 ± 0.778
1.363ValMet: 1.363 ± 0.322
3.152ValAsn: 3.152 ± 0.433
1.619ValPro: 1.619 ± 0.353
2.726ValGln: 2.726 ± 0.417
2.556ValArg: 2.556 ± 0.442
5.026ValSer: 5.026 ± 0.64
5.111ValThr: 5.111 ± 0.705
4.004ValVal: 4.004 ± 0.736
0.937ValTrp: 0.937 ± 0.259
2.896ValTyr: 2.896 ± 0.5
0.0ValXaa: 0.0 ± 0.0
Trp
0.681TrpAla: 0.681 ± 0.22
0.767TrpCys: 0.767 ± 0.306
0.937TrpAsp: 0.937 ± 0.319
0.852TrpGlu: 0.852 ± 0.246
0.681TrpPhe: 0.681 ± 0.196
0.596TrpGly: 0.596 ± 0.261
0.341TrpHis: 0.341 ± 0.16
1.022TrpIle: 1.022 ± 0.276
1.533TrpLys: 1.533 ± 0.423
1.448TrpLeu: 1.448 ± 0.33
0.17TrpMet: 0.17 ± 0.092
1.789TrpAsn: 1.789 ± 0.824
0.341TrpPro: 0.341 ± 0.148
1.193TrpGln: 1.193 ± 0.292
1.363TrpArg: 1.363 ± 0.307
1.363TrpSer: 1.363 ± 0.425
1.363TrpThr: 1.363 ± 0.346
0.852TrpVal: 0.852 ± 0.237
0.085TrpTrp: 0.085 ± 0.078
0.256TrpTyr: 0.256 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.322TyrAla: 3.322 ± 0.689
0.341TyrCys: 0.341 ± 0.185
3.407TyrAsp: 3.407 ± 0.528
1.789TyrGlu: 1.789 ± 0.558
1.363TyrPhe: 1.363 ± 0.264
2.896TyrGly: 2.896 ± 0.714
0.511TyrHis: 0.511 ± 0.202
1.789TyrIle: 1.789 ± 0.449
2.385TyrLys: 2.385 ± 0.551
3.067TyrLeu: 3.067 ± 0.529
0.852TyrMet: 0.852 ± 0.205
2.044TyrAsn: 2.044 ± 0.355
2.13TyrPro: 2.13 ± 0.399
2.3TyrGln: 2.3 ± 0.52
1.874TyrArg: 1.874 ± 0.566
2.811TyrSer: 2.811 ± 0.562
2.641TyrThr: 2.641 ± 0.534
2.215TyrVal: 2.215 ± 0.333
0.681TyrTrp: 0.681 ± 0.253
2.044TyrTyr: 2.044 ± 0.56
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski