Amino acid dipepetide frequency for Enterococcus phage vB_EfaP_Ef7.2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.015AlaAla: 1.015 ± 0.721
0.508AlaCys: 0.508 ± 0.267
3.553AlaAsp: 3.553 ± 0.724
3.215AlaGlu: 3.215 ± 0.729
3.046AlaPhe: 3.046 ± 0.5
2.707AlaGly: 2.707 ± 0.735
0.846AlaHis: 0.846 ± 0.399
4.399AlaIle: 4.399 ± 0.919
3.384AlaLys: 3.384 ± 0.714
4.907AlaLeu: 4.907 ± 0.875
1.861AlaMet: 1.861 ± 0.534
3.553AlaAsn: 3.553 ± 0.581
1.861AlaPro: 1.861 ± 0.384
2.707AlaGln: 2.707 ± 0.554
2.03AlaArg: 2.03 ± 0.39
3.215AlaSer: 3.215 ± 0.568
4.399AlaThr: 4.399 ± 1.045
2.2AlaVal: 2.2 ± 0.645
0.677AlaTrp: 0.677 ± 0.341
2.03AlaTyr: 2.03 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
0.338CysAla: 0.338 ± 0.201
0.0CysCys: 0.0 ± 0.0
0.677CysAsp: 0.677 ± 0.269
0.677CysGlu: 0.677 ± 0.255
0.338CysPhe: 0.338 ± 0.222
0.846CysGly: 0.846 ± 0.426
0.169CysHis: 0.169 ± 0.177
0.338CysIle: 0.338 ± 0.215
0.169CysLys: 0.169 ± 0.165
0.338CysLeu: 0.338 ± 0.221
0.169CysMet: 0.169 ± 0.161
0.677CysAsn: 0.677 ± 0.265
0.169CysPro: 0.169 ± 0.184
0.677CysGln: 0.677 ± 0.364
0.338CysArg: 0.338 ± 0.239
0.677CysSer: 0.677 ± 0.327
0.508CysThr: 0.508 ± 0.239
0.846CysVal: 0.846 ± 0.384
0.0CysTrp: 0.0 ± 0.0
0.169CysTyr: 0.169 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
1.692AspAla: 1.692 ± 0.587
0.846AspCys: 0.846 ± 0.313
2.03AspAsp: 2.03 ± 0.781
5.415AspGlu: 5.415 ± 0.974
3.723AspPhe: 3.723 ± 0.561
3.892AspGly: 3.892 ± 0.853
0.338AspHis: 0.338 ± 0.239
4.738AspIle: 4.738 ± 0.648
5.245AspLys: 5.245 ± 0.878
5.922AspLeu: 5.922 ± 0.811
1.354AspMet: 1.354 ± 0.572
4.399AspAsn: 4.399 ± 0.785
2.2AspPro: 2.2 ± 0.743
0.846AspGln: 0.846 ± 0.407
2.2AspArg: 2.2 ± 0.625
2.707AspSer: 2.707 ± 0.639
3.723AspThr: 3.723 ± 1.006
4.907AspVal: 4.907 ± 0.822
0.508AspTrp: 0.508 ± 0.261
4.061AspTyr: 4.061 ± 0.575
0.0AspXaa: 0.0 ± 0.0
Glu
3.553GluAla: 3.553 ± 0.772
0.677GluCys: 0.677 ± 0.258
4.061GluAsp: 4.061 ± 0.81
6.261GluGlu: 6.261 ± 1.311
3.215GluPhe: 3.215 ± 0.653
4.23GluGly: 4.23 ± 0.99
1.523GluHis: 1.523 ± 0.361
6.768GluIle: 6.768 ± 1.31
7.107GluLys: 7.107 ± 1.203
5.753GluLeu: 5.753 ± 1.274
2.876GluMet: 2.876 ± 0.609
4.738GluAsn: 4.738 ± 0.759
1.692GluPro: 1.692 ± 0.952
2.876GluGln: 2.876 ± 0.532
2.876GluArg: 2.876 ± 0.504
3.553GluSer: 3.553 ± 0.845
4.738GluThr: 4.738 ± 0.812
5.415GluVal: 5.415 ± 1.0
1.354GluTrp: 1.354 ± 0.499
3.892GluTyr: 3.892 ± 0.865
0.0GluXaa: 0.0 ± 0.0
Phe
1.692PheAla: 1.692 ± 0.396
0.508PheCys: 0.508 ± 0.247
3.553PheAsp: 3.553 ± 0.868
2.876PheGlu: 2.876 ± 0.739
1.354PhePhe: 1.354 ± 0.552
3.384PheGly: 3.384 ± 0.967
0.508PheHis: 0.508 ± 0.269
5.076PheIle: 5.076 ± 0.76
2.876PheLys: 2.876 ± 0.607
3.723PheLeu: 3.723 ± 0.813
1.354PheMet: 1.354 ± 0.387
5.076PheAsn: 5.076 ± 1.025
1.692PhePro: 1.692 ± 0.486
0.677PheGln: 0.677 ± 0.366
1.861PheArg: 1.861 ± 0.415
3.384PheSer: 3.384 ± 0.837
3.553PheThr: 3.553 ± 0.968
2.707PheVal: 2.707 ± 0.662
0.338PheTrp: 0.338 ± 0.226
2.2PheTyr: 2.2 ± 0.633
0.0PheXaa: 0.0 ± 0.0
Gly
4.399GlyAla: 4.399 ± 1.685
0.508GlyCys: 0.508 ± 0.276
4.061GlyAsp: 4.061 ± 0.801
4.23GlyGlu: 4.23 ± 0.828
2.369GlyPhe: 2.369 ± 0.741
6.261GlyGly: 6.261 ± 0.992
1.523GlyHis: 1.523 ± 0.43
4.907GlyIle: 4.907 ± 0.815
3.892GlyLys: 3.892 ± 0.583
4.23GlyLeu: 4.23 ± 0.938
1.015GlyMet: 1.015 ± 0.422
3.723GlyAsn: 3.723 ± 0.905
0.508GlyPro: 0.508 ± 0.272
1.015GlyGln: 1.015 ± 0.507
1.184GlyArg: 1.184 ± 0.34
4.569GlySer: 4.569 ± 1.056
3.384GlyThr: 3.384 ± 0.751
3.892GlyVal: 3.892 ± 0.836
0.338GlyTrp: 0.338 ± 0.204
2.707GlyTyr: 2.707 ± 0.665
0.0GlyXaa: 0.0 ± 0.0
His
1.015HisAla: 1.015 ± 0.486
0.169HisCys: 0.169 ± 0.126
0.508HisAsp: 0.508 ± 0.235
1.523HisGlu: 1.523 ± 0.441
1.692HisPhe: 1.692 ± 0.501
1.015HisGly: 1.015 ± 0.481
0.169HisHis: 0.169 ± 0.184
1.692HisIle: 1.692 ± 0.404
0.338HisLys: 0.338 ± 0.19
1.523HisLeu: 1.523 ± 0.379
0.508HisMet: 0.508 ± 0.284
1.861HisAsn: 1.861 ± 0.54
0.508HisPro: 0.508 ± 0.307
0.169HisGln: 0.169 ± 0.126
1.015HisArg: 1.015 ± 0.484
1.015HisSer: 1.015 ± 0.379
0.169HisThr: 0.169 ± 0.165
0.846HisVal: 0.846 ± 0.318
0.0HisTrp: 0.0 ± 0.0
1.523HisTyr: 1.523 ± 0.425
0.0HisXaa: 0.0 ± 0.0
Ile
4.907IleAla: 4.907 ± 0.951
0.677IleCys: 0.677 ± 0.278
5.076IleAsp: 5.076 ± 0.82
5.922IleGlu: 5.922 ± 1.181
2.2IlePhe: 2.2 ± 1.003
3.553IleGly: 3.553 ± 0.527
1.692IleHis: 1.692 ± 0.511
3.892IleIle: 3.892 ± 0.963
6.599IleLys: 6.599 ± 1.042
4.061IleLeu: 4.061 ± 0.682
1.354IleMet: 1.354 ± 0.489
5.753IleAsn: 5.753 ± 0.691
2.876IlePro: 2.876 ± 0.58
2.707IleGln: 2.707 ± 0.795
1.184IleArg: 1.184 ± 0.36
4.907IleSer: 4.907 ± 0.789
4.907IleThr: 4.907 ± 0.793
3.046IleVal: 3.046 ± 0.645
0.677IleTrp: 0.677 ± 0.402
3.046IleTyr: 3.046 ± 0.702
0.0IleXaa: 0.0 ± 0.0
Lys
5.584LysAla: 5.584 ± 1.066
0.169LysCys: 0.169 ± 0.176
5.245LysAsp: 5.245 ± 0.905
8.291LysGlu: 8.291 ± 1.633
4.399LysPhe: 4.399 ± 0.89
4.23LysGly: 4.23 ± 0.624
1.015LysHis: 1.015 ± 0.326
4.399LysIle: 4.399 ± 0.883
5.753LysLys: 5.753 ± 0.944
7.614LysLeu: 7.614 ± 0.952
2.876LysMet: 2.876 ± 0.728
3.046LysAsn: 3.046 ± 0.84
3.215LysPro: 3.215 ± 0.91
3.046LysGln: 3.046 ± 0.859
4.061LysArg: 4.061 ± 0.638
3.215LysSer: 3.215 ± 0.817
5.245LysThr: 5.245 ± 0.99
6.599LysVal: 6.599 ± 0.819
0.846LysTrp: 0.846 ± 0.318
3.384LysTyr: 3.384 ± 0.836
0.0LysXaa: 0.0 ± 0.0
Leu
5.245LeuAla: 5.245 ± 0.686
0.338LeuCys: 0.338 ± 0.236
5.922LeuAsp: 5.922 ± 0.988
6.261LeuGlu: 6.261 ± 1.122
1.015LeuPhe: 1.015 ± 0.457
4.061LeuGly: 4.061 ± 0.901
1.354LeuHis: 1.354 ± 0.421
4.738LeuIle: 4.738 ± 0.919
8.291LeuLys: 8.291 ± 1.103
6.091LeuLeu: 6.091 ± 1.325
1.861LeuMet: 1.861 ± 0.616
5.076LeuAsn: 5.076 ± 1.081
3.723LeuPro: 3.723 ± 0.783
1.692LeuGln: 1.692 ± 0.602
2.876LeuArg: 2.876 ± 0.662
4.738LeuSer: 4.738 ± 0.788
5.753LeuThr: 5.753 ± 0.834
4.738LeuVal: 4.738 ± 0.731
1.354LeuTrp: 1.354 ± 0.525
3.553LeuTyr: 3.553 ± 0.764
0.0LeuXaa: 0.0 ± 0.0
Met
1.184MetAla: 1.184 ± 0.369
0.169MetCys: 0.169 ± 0.146
1.015MetAsp: 1.015 ± 0.519
2.03MetGlu: 2.03 ± 0.485
2.03MetPhe: 2.03 ± 0.376
2.03MetGly: 2.03 ± 0.432
0.0MetHis: 0.0 ± 0.0
2.369MetIle: 2.369 ± 0.625
2.876MetLys: 2.876 ± 0.462
3.215MetLeu: 3.215 ± 0.689
0.508MetMet: 0.508 ± 0.29
1.861MetAsn: 1.861 ± 0.417
0.338MetPro: 0.338 ± 0.221
0.677MetGln: 0.677 ± 0.312
1.354MetArg: 1.354 ± 0.555
1.861MetSer: 1.861 ± 0.629
2.369MetThr: 2.369 ± 0.849
1.184MetVal: 1.184 ± 0.359
0.338MetTrp: 0.338 ± 0.228
1.354MetTyr: 1.354 ± 0.573
0.0MetXaa: 0.0 ± 0.0
Asn
4.061AsnAla: 4.061 ± 0.881
0.169AsnCys: 0.169 ± 0.182
2.876AsnAsp: 2.876 ± 0.649
5.076AsnGlu: 5.076 ± 1.129
3.553AsnPhe: 3.553 ± 0.876
4.23AsnGly: 4.23 ± 0.87
2.03AsnHis: 2.03 ± 0.552
4.738AsnIle: 4.738 ± 0.937
5.245AsnLys: 5.245 ± 0.838
4.569AsnLeu: 4.569 ± 0.901
2.538AsnMet: 2.538 ± 0.607
6.937AsnAsn: 6.937 ± 1.131
3.892AsnPro: 3.892 ± 0.782
2.707AsnGln: 2.707 ± 0.694
2.03AsnArg: 2.03 ± 0.585
4.569AsnSer: 4.569 ± 0.638
2.707AsnThr: 2.707 ± 0.62
3.215AsnVal: 3.215 ± 0.661
1.354AsnTrp: 1.354 ± 0.429
4.569AsnTyr: 4.569 ± 0.822
0.0AsnXaa: 0.0 ± 0.0
Pro
1.692ProAla: 1.692 ± 0.426
0.0ProCys: 0.0 ± 0.0
3.215ProAsp: 3.215 ± 0.735
3.046ProGlu: 3.046 ± 0.382
2.369ProPhe: 2.369 ± 0.419
0.677ProGly: 0.677 ± 0.286
0.338ProHis: 0.338 ± 0.178
1.523ProIle: 1.523 ± 0.477
3.723ProLys: 3.723 ± 0.907
2.03ProLeu: 2.03 ± 0.516
0.338ProMet: 0.338 ± 0.257
2.538ProAsn: 2.538 ± 0.915
0.677ProPro: 0.677 ± 0.644
1.354ProGln: 1.354 ± 0.376
0.508ProArg: 0.508 ± 0.267
2.707ProSer: 2.707 ± 0.629
3.046ProThr: 3.046 ± 0.653
1.354ProVal: 1.354 ± 0.655
0.508ProTrp: 0.508 ± 0.23
1.015ProTyr: 1.015 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
1.861GlnAla: 1.861 ± 0.554
0.169GlnCys: 0.169 ± 0.161
1.015GlnAsp: 1.015 ± 0.388
2.2GlnGlu: 2.2 ± 0.471
2.707GlnPhe: 2.707 ± 0.56
1.692GlnGly: 1.692 ± 0.524
0.677GlnHis: 0.677 ± 0.256
1.861GlnIle: 1.861 ± 0.493
2.707GlnLys: 2.707 ± 0.548
2.876GlnLeu: 2.876 ± 0.511
0.677GlnMet: 0.677 ± 0.314
2.707GlnAsn: 2.707 ± 0.602
0.846GlnPro: 0.846 ± 0.38
1.015GlnGln: 1.015 ± 0.357
0.846GlnArg: 0.846 ± 0.382
1.015GlnSer: 1.015 ± 0.349
3.215GlnThr: 3.215 ± 0.819
2.03GlnVal: 2.03 ± 0.723
1.523GlnTrp: 1.523 ± 0.456
1.523GlnTyr: 1.523 ± 0.446
0.0GlnXaa: 0.0 ± 0.0
Arg
1.184ArgAla: 1.184 ± 0.527
0.846ArgCys: 0.846 ± 0.3
1.523ArgAsp: 1.523 ± 0.366
2.03ArgGlu: 2.03 ± 0.41
2.369ArgPhe: 2.369 ± 0.683
1.692ArgGly: 1.692 ± 0.466
0.677ArgHis: 0.677 ± 0.306
2.707ArgIle: 2.707 ± 0.579
2.707ArgLys: 2.707 ± 0.629
3.215ArgLeu: 3.215 ± 0.587
0.846ArgMet: 0.846 ± 0.522
2.03ArgAsn: 2.03 ± 0.395
0.846ArgPro: 0.846 ± 0.294
2.2ArgGln: 2.2 ± 0.605
0.846ArgArg: 0.846 ± 0.469
1.015ArgSer: 1.015 ± 0.4
1.015ArgThr: 1.015 ± 0.398
1.861ArgVal: 1.861 ± 0.509
0.0ArgTrp: 0.0 ± 0.0
3.215ArgTyr: 3.215 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
2.707SerAla: 2.707 ± 0.727
0.0SerCys: 0.0 ± 0.0
4.907SerAsp: 4.907 ± 0.931
3.384SerGlu: 3.384 ± 0.58
3.215SerPhe: 3.215 ± 0.707
3.215SerGly: 3.215 ± 1.048
1.184SerHis: 1.184 ± 0.367
4.061SerIle: 4.061 ± 0.632
5.584SerLys: 5.584 ± 0.734
4.569SerLeu: 4.569 ± 1.301
2.369SerMet: 2.369 ± 0.643
3.046SerAsn: 3.046 ± 0.919
1.184SerPro: 1.184 ± 0.432
2.03SerGln: 2.03 ± 0.468
1.184SerArg: 1.184 ± 0.46
2.538SerSer: 2.538 ± 0.537
3.553SerThr: 3.553 ± 0.866
3.046SerVal: 3.046 ± 0.708
1.184SerTrp: 1.184 ± 0.547
3.215SerTyr: 3.215 ± 0.973
0.0SerXaa: 0.0 ± 0.0
Thr
3.892ThrAla: 3.892 ± 1.099
0.338ThrCys: 0.338 ± 0.207
3.384ThrAsp: 3.384 ± 0.856
5.415ThrGlu: 5.415 ± 0.868
3.723ThrPhe: 3.723 ± 0.609
4.569ThrGly: 4.569 ± 0.863
1.015ThrHis: 1.015 ± 0.341
4.738ThrIle: 4.738 ± 0.683
5.076ThrLys: 5.076 ± 0.861
4.907ThrLeu: 4.907 ± 0.986
2.369ThrMet: 2.369 ± 0.698
5.076ThrAsn: 5.076 ± 0.735
2.2ThrPro: 2.2 ± 0.829
3.046ThrGln: 3.046 ± 0.829
2.2ThrArg: 2.2 ± 0.558
4.23ThrSer: 4.23 ± 0.809
5.753ThrThr: 5.753 ± 1.187
4.23ThrVal: 4.23 ± 0.899
0.508ThrTrp: 0.508 ± 0.352
2.876ThrTyr: 2.876 ± 0.589
0.0ThrXaa: 0.0 ± 0.0
Val
3.384ValAla: 3.384 ± 0.846
0.677ValCys: 0.677 ± 0.255
4.23ValAsp: 4.23 ± 1.145
5.584ValGlu: 5.584 ± 0.94
2.369ValPhe: 2.369 ± 0.473
2.876ValGly: 2.876 ± 0.742
0.508ValHis: 0.508 ± 0.329
2.876ValIle: 2.876 ± 0.567
5.753ValLys: 5.753 ± 0.897
3.553ValLeu: 3.553 ± 0.739
2.03ValMet: 2.03 ± 0.471
3.892ValAsn: 3.892 ± 0.85
2.538ValPro: 2.538 ± 0.483
0.338ValGln: 0.338 ± 0.214
2.2ValArg: 2.2 ± 0.503
3.046ValSer: 3.046 ± 0.79
5.584ValThr: 5.584 ± 1.216
3.553ValVal: 3.553 ± 0.847
0.338ValTrp: 0.338 ± 0.248
3.384ValTyr: 3.384 ± 0.57
0.0ValXaa: 0.0 ± 0.0
Trp
0.677TrpAla: 0.677 ± 0.324
0.508TrpCys: 0.508 ± 0.365
0.677TrpAsp: 0.677 ± 0.312
0.508TrpGlu: 0.508 ± 0.303
0.677TrpPhe: 0.677 ± 0.306
0.0TrpGly: 0.0 ± 0.0
0.338TrpHis: 0.338 ± 0.181
1.184TrpIle: 1.184 ± 0.404
0.508TrpLys: 0.508 ± 0.311
1.354TrpLeu: 1.354 ± 0.508
0.338TrpMet: 0.338 ± 0.222
0.846TrpAsn: 0.846 ± 0.396
0.0TrpPro: 0.0 ± 0.0
0.677TrpGln: 0.677 ± 0.269
0.846TrpArg: 0.846 ± 0.36
0.508TrpSer: 0.508 ± 0.26
1.354TrpThr: 1.354 ± 0.456
0.677TrpVal: 0.677 ± 0.38
0.169TrpTrp: 0.169 ± 0.161
1.015TrpTyr: 1.015 ± 0.405
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.03TyrAla: 2.03 ± 0.405
0.846TyrCys: 0.846 ± 0.346
3.215TyrAsp: 3.215 ± 0.92
3.215TyrGlu: 3.215 ± 0.764
2.03TyrPhe: 2.03 ± 0.486
3.553TyrGly: 3.553 ± 0.565
1.354TyrHis: 1.354 ± 0.437
2.03TyrIle: 2.03 ± 0.568
4.569TyrLys: 4.569 ± 0.862
4.23TyrLeu: 4.23 ± 0.683
1.354TyrMet: 1.354 ± 0.35
4.23TyrAsn: 4.23 ± 0.829
1.692TyrPro: 1.692 ± 0.437
2.707TyrGln: 2.707 ± 0.461
1.184TyrArg: 1.184 ± 0.333
2.707TyrSer: 2.707 ± 0.482
4.569TyrThr: 4.569 ± 0.696
2.369TyrVal: 2.369 ± 0.49
0.846TyrTrp: 0.846 ± 0.35
3.046TyrTyr: 3.046 ± 0.759
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30 proteins (5911 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski