Amino acid dipepetide frequency for Enterobacteria phage NC41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.169AlaAla: 8.169 ± 1.786
2.58AlaCys: 2.58 ± 1.147
6.449AlaAsp: 6.449 ± 1.71
6.449AlaGlu: 6.449 ± 1.91
3.869AlaPhe: 3.869 ± 0.717
8.598AlaGly: 8.598 ± 3.58
1.29AlaHis: 1.29 ± 0.577
1.72AlaIle: 1.72 ± 0.801
6.019AlaLys: 6.019 ± 2.0
4.729AlaLeu: 4.729 ± 1.153
1.72AlaMet: 1.72 ± 0.748
3.009AlaAsn: 3.009 ± 0.832
2.58AlaPro: 2.58 ± 1.126
3.869AlaGln: 3.869 ± 1.165
2.58AlaArg: 2.58 ± 1.361
8.169AlaSer: 8.169 ± 1.471
6.879AlaThr: 6.879 ± 1.341
6.449AlaVal: 6.449 ± 1.42
0.43AlaTrp: 0.43 ± 0.407
2.15AlaTyr: 2.15 ± 0.858
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.43CysCys: 0.43 ± 0.377
0.86CysAsp: 0.86 ± 0.783
0.0CysGlu: 0.0 ± 0.0
0.43CysPhe: 0.43 ± 0.377
0.43CysGly: 0.43 ± 0.301
0.86CysHis: 0.86 ± 0.395
0.43CysIle: 0.43 ± 0.447
0.0CysLys: 0.0 ± 0.0
3.009CysLeu: 3.009 ± 1.309
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.86CysArg: 0.86 ± 0.479
1.29CysSer: 1.29 ± 0.665
0.0CysThr: 0.0 ± 0.0
2.58CysVal: 2.58 ± 1.209
0.0CysTrp: 0.0 ± 0.0
1.29CysTyr: 1.29 ± 0.714
0.0CysXaa: 0.0 ± 0.0
Asp
8.169AspAla: 8.169 ± 0.989
1.29AspCys: 1.29 ± 0.628
2.15AspAsp: 2.15 ± 0.521
4.299AspGlu: 4.299 ± 1.536
1.72AspPhe: 1.72 ± 1.404
4.299AspGly: 4.299 ± 0.805
0.86AspHis: 0.86 ± 0.395
6.019AspIle: 6.019 ± 0.724
3.869AspLys: 3.869 ± 1.65
3.869AspLeu: 3.869 ± 0.91
1.29AspMet: 1.29 ± 0.714
3.009AspAsn: 3.009 ± 0.853
1.72AspPro: 1.72 ± 0.496
2.15AspGln: 2.15 ± 1.35
1.72AspArg: 1.72 ± 0.519
3.439AspSer: 3.439 ± 1.077
3.009AspThr: 3.009 ± 0.671
4.299AspVal: 4.299 ± 0.813
0.86AspTrp: 0.86 ± 0.629
2.58AspTyr: 2.58 ± 0.639
0.0AspXaa: 0.0 ± 0.0
Glu
4.729GluAla: 4.729 ± 1.316
1.72GluCys: 1.72 ± 0.972
0.86GluAsp: 0.86 ± 0.545
1.29GluGlu: 1.29 ± 0.635
2.15GluPhe: 2.15 ± 1.07
3.439GluGly: 3.439 ± 0.816
0.43GluHis: 0.43 ± 0.377
3.439GluIle: 3.439 ± 1.27
2.15GluLys: 2.15 ± 1.409
3.439GluLeu: 3.439 ± 1.539
3.009GluMet: 3.009 ± 1.011
3.009GluAsn: 3.009 ± 1.288
1.72GluPro: 1.72 ± 0.486
1.29GluGln: 1.29 ± 0.637
3.869GluArg: 3.869 ± 1.025
2.58GluSer: 2.58 ± 0.831
1.29GluThr: 1.29 ± 0.645
0.86GluVal: 0.86 ± 0.594
0.86GluTrp: 0.86 ± 0.505
1.29GluTyr: 1.29 ± 0.714
0.0GluXaa: 0.0 ± 0.0
Phe
1.29PheAla: 1.29 ± 0.77
1.29PheCys: 1.29 ± 0.714
3.869PheAsp: 3.869 ± 1.276
1.72PheGlu: 1.72 ± 0.485
1.72PhePhe: 1.72 ± 1.289
3.869PheGly: 3.869 ± 0.957
2.15PheHis: 2.15 ± 0.687
3.439PheIle: 3.439 ± 0.943
1.29PheLys: 1.29 ± 0.855
3.009PheLeu: 3.009 ± 1.182
3.869PheMet: 3.869 ± 0.875
1.29PheAsn: 1.29 ± 0.447
1.72PhePro: 1.72 ± 0.851
1.72PheGln: 1.72 ± 0.845
3.869PheArg: 3.869 ± 1.145
1.72PheSer: 1.72 ± 0.496
3.439PheThr: 3.439 ± 0.838
2.58PheVal: 2.58 ± 0.849
0.86PheTrp: 0.86 ± 0.537
2.58PheTyr: 2.58 ± 0.835
0.0PheXaa: 0.0 ± 0.0
Gly
6.449GlyAla: 6.449 ± 2.107
0.0GlyCys: 0.0 ± 0.0
3.439GlyAsp: 3.439 ± 1.501
1.29GlyGlu: 1.29 ± 0.447
4.729GlyPhe: 4.729 ± 0.878
5.589GlyGly: 5.589 ± 2.307
0.43GlyHis: 0.43 ± 0.301
4.299GlyIle: 4.299 ± 2.239
6.019GlyLys: 6.019 ± 1.533
4.299GlyLeu: 4.299 ± 0.928
2.15GlyMet: 2.15 ± 0.802
1.72GlyAsn: 1.72 ± 0.998
0.0GlyPro: 0.0 ± 0.0
2.58GlyGln: 2.58 ± 1.47
5.589GlyArg: 5.589 ± 1.594
2.58GlySer: 2.58 ± 1.479
4.299GlyThr: 4.299 ± 1.093
3.009GlyVal: 3.009 ± 0.817
1.72GlyTrp: 1.72 ± 1.01
4.299GlyTyr: 4.299 ± 0.817
0.0GlyXaa: 0.0 ± 0.0
His
3.439HisAla: 3.439 ± 1.533
0.43HisCys: 0.43 ± 0.447
1.72HisAsp: 1.72 ± 0.855
0.0HisGlu: 0.0 ± 0.0
2.15HisPhe: 2.15 ± 1.046
1.72HisGly: 1.72 ± 0.769
0.43HisHis: 0.43 ± 0.377
0.43HisIle: 0.43 ± 0.407
0.86HisLys: 0.86 ± 0.603
1.72HisLeu: 1.72 ± 1.509
0.0HisMet: 0.0 ± 0.0
0.86HisAsn: 0.86 ± 0.656
0.43HisPro: 0.43 ± 0.47
0.86HisGln: 0.86 ± 0.525
0.86HisArg: 0.86 ± 0.395
0.43HisSer: 0.43 ± 0.377
0.43HisThr: 0.43 ± 0.377
0.43HisVal: 0.43 ± 0.377
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.449IleAla: 6.449 ± 1.422
0.43IleCys: 0.43 ± 0.447
2.58IleAsp: 2.58 ± 0.997
1.72IleGlu: 1.72 ± 0.977
0.86IlePhe: 0.86 ± 0.537
3.439IleGly: 3.439 ± 0.815
0.0IleHis: 0.0 ± 0.0
1.29IleIle: 1.29 ± 0.975
5.159IleLys: 5.159 ± 0.973
3.009IleLeu: 3.009 ± 1.532
2.58IleMet: 2.58 ± 1.099
1.72IleAsn: 1.72 ± 0.602
0.86IlePro: 0.86 ± 0.576
5.589IleGln: 5.589 ± 0.863
2.15IleArg: 2.15 ± 0.797
3.439IleSer: 3.439 ± 0.761
1.29IleThr: 1.29 ± 0.447
0.86IleVal: 0.86 ± 0.505
0.43IleTrp: 0.43 ± 0.377
0.86IleTyr: 0.86 ± 0.534
0.0IleXaa: 0.0 ± 0.0
Lys
6.449LysAla: 6.449 ± 1.259
0.43LysCys: 0.43 ± 0.447
4.299LysAsp: 4.299 ± 1.458
4.729LysGlu: 4.729 ± 1.734
3.009LysPhe: 3.009 ± 0.893
4.729LysGly: 4.729 ± 1.564
0.86LysHis: 0.86 ± 0.754
1.72LysIle: 1.72 ± 1.497
4.299LysLys: 4.299 ± 1.225
7.739LysLeu: 7.739 ± 2.21
3.439LysMet: 3.439 ± 1.319
3.009LysAsn: 3.009 ± 1.081
0.43LysPro: 0.43 ± 0.301
2.15LysGln: 2.15 ± 0.791
3.009LysArg: 3.009 ± 1.419
4.729LysSer: 4.729 ± 1.614
2.58LysThr: 2.58 ± 0.759
1.72LysVal: 1.72 ± 0.565
1.72LysTrp: 1.72 ± 0.758
0.86LysTyr: 0.86 ± 0.505
0.0LysXaa: 0.0 ± 0.0
Leu
9.458LeuAla: 9.458 ± 1.361
0.43LeuCys: 0.43 ± 0.641
3.869LeuAsp: 3.869 ± 1.087
3.439LeuGlu: 3.439 ± 1.086
2.58LeuPhe: 2.58 ± 0.472
5.159LeuGly: 5.159 ± 1.243
1.72LeuHis: 1.72 ± 0.496
3.869LeuIle: 3.869 ± 1.305
5.589LeuLys: 5.589 ± 2.132
11.178LeuLeu: 11.178 ± 5.066
2.15LeuMet: 2.15 ± 0.835
4.729LeuAsn: 4.729 ± 1.12
5.589LeuPro: 5.589 ± 1.381
4.299LeuGln: 4.299 ± 0.881
6.879LeuArg: 6.879 ± 1.649
8.169LeuSer: 8.169 ± 1.602
8.169LeuThr: 8.169 ± 1.783
3.869LeuVal: 3.869 ± 1.574
1.72LeuTrp: 1.72 ± 0.725
0.43LeuTyr: 0.43 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
1.72MetAla: 1.72 ± 0.556
0.0MetCys: 0.0 ± 0.0
2.58MetAsp: 2.58 ± 0.811
2.15MetGlu: 2.15 ± 0.605
1.72MetPhe: 1.72 ± 0.725
0.86MetGly: 0.86 ± 0.537
0.0MetHis: 0.0 ± 0.0
0.86MetIle: 0.86 ± 0.505
2.58MetLys: 2.58 ± 0.894
3.009MetLeu: 3.009 ± 0.894
0.43MetMet: 0.43 ± 0.449
0.86MetAsn: 0.86 ± 0.505
1.72MetPro: 1.72 ± 0.655
1.72MetGln: 1.72 ± 0.801
4.729MetArg: 4.729 ± 1.679
3.439MetSer: 3.439 ± 0.882
3.009MetThr: 3.009 ± 0.754
2.58MetVal: 2.58 ± 0.726
0.0MetTrp: 0.0 ± 0.0
0.43MetTyr: 0.43 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
4.729AsnAla: 4.729 ± 0.942
0.43AsnCys: 0.43 ± 0.465
2.58AsnAsp: 2.58 ± 1.019
3.009AsnGlu: 3.009 ± 1.153
3.869AsnPhe: 3.869 ± 0.932
2.15AsnGly: 2.15 ± 0.854
0.0AsnHis: 0.0 ± 0.0
1.72AsnIle: 1.72 ± 1.089
1.72AsnLys: 1.72 ± 0.519
4.729AsnLeu: 4.729 ± 1.128
0.86AsnMet: 0.86 ± 0.721
3.009AsnAsn: 3.009 ± 0.634
2.58AsnPro: 2.58 ± 0.612
3.869AsnGln: 3.869 ± 1.532
2.15AsnArg: 2.15 ± 0.797
2.58AsnSer: 2.58 ± 0.868
3.869AsnThr: 3.869 ± 0.948
3.439AsnVal: 3.439 ± 1.049
0.0AsnTrp: 0.0 ± 0.0
2.15AsnTyr: 2.15 ± 0.655
0.0AsnXaa: 0.0 ± 0.0
Pro
3.009ProAla: 3.009 ± 0.934
0.43ProCys: 0.43 ± 0.465
1.72ProAsp: 1.72 ± 0.895
2.58ProGlu: 2.58 ± 0.639
1.29ProPhe: 1.29 ± 0.447
0.86ProGly: 0.86 ± 0.697
0.86ProHis: 0.86 ± 0.754
1.29ProIle: 1.29 ± 0.733
2.58ProLys: 2.58 ± 0.863
5.159ProLeu: 5.159 ± 1.335
0.43ProMet: 0.43 ± 0.301
3.009ProAsn: 3.009 ± 0.822
1.72ProPro: 1.72 ± 1.509
1.29ProGln: 1.29 ± 0.722
1.72ProArg: 1.72 ± 0.845
3.439ProSer: 3.439 ± 1.152
2.58ProThr: 2.58 ± 1.294
3.869ProVal: 3.869 ± 1.47
0.43ProTrp: 0.43 ± 0.377
0.86ProTyr: 0.86 ± 0.505
0.0ProXaa: 0.0 ± 0.0
Gln
3.439GlnAla: 3.439 ± 0.897
0.86GlnCys: 0.86 ± 0.395
0.86GlnAsp: 0.86 ± 0.505
2.58GlnGlu: 2.58 ± 1.41
1.29GlnPhe: 1.29 ± 0.646
1.72GlnGly: 1.72 ± 0.801
0.0GlnHis: 0.0 ± 0.0
2.15GlnIle: 2.15 ± 0.455
4.299GlnLys: 4.299 ± 1.873
4.729GlnLeu: 4.729 ± 1.116
1.72GlnMet: 1.72 ± 0.485
4.299GlnAsn: 4.299 ± 1.605
3.009GlnPro: 3.009 ± 1.194
2.58GlnGln: 2.58 ± 1.728
1.72GlnArg: 1.72 ± 0.409
2.58GlnSer: 2.58 ± 0.5
3.869GlnThr: 3.869 ± 1.52
2.58GlnVal: 2.58 ± 1.151
1.29GlnTrp: 1.29 ± 1.132
1.72GlnTyr: 1.72 ± 0.409
0.0GlnXaa: 0.0 ± 0.0
Arg
2.58ArgAla: 2.58 ± 1.342
0.86ArgCys: 0.86 ± 0.545
6.019ArgAsp: 6.019 ± 1.347
1.29ArgGlu: 1.29 ± 0.447
3.439ArgPhe: 3.439 ± 1.622
2.58ArgGly: 2.58 ± 0.652
2.15ArgHis: 2.15 ± 1.004
3.009ArgIle: 3.009 ± 1.39
4.299ArgLys: 4.299 ± 1.087
6.879ArgLeu: 6.879 ± 2.276
3.009ArgMet: 3.009 ± 1.036
2.15ArgAsn: 2.15 ± 0.876
3.869ArgPro: 3.869 ± 1.64
3.439ArgGln: 3.439 ± 0.737
4.299ArgArg: 4.299 ± 1.3
4.299ArgSer: 4.299 ± 1.155
1.72ArgThr: 1.72 ± 0.496
2.58ArgVal: 2.58 ± 0.566
0.43ArgTrp: 0.43 ± 0.465
3.009ArgTyr: 3.009 ± 0.935
0.0ArgXaa: 0.0 ± 0.0
Ser
4.729SerAla: 4.729 ± 2.826
0.0SerCys: 0.0 ± 0.0
4.299SerAsp: 4.299 ± 1.509
1.29SerGlu: 1.29 ± 0.506
2.15SerPhe: 2.15 ± 0.727
6.019SerGly: 6.019 ± 1.629
2.15SerHis: 2.15 ± 0.469
2.58SerIle: 2.58 ± 0.566
3.869SerLys: 3.869 ± 0.802
6.449SerLeu: 6.449 ± 2.243
3.439SerMet: 3.439 ± 1.318
4.299SerAsn: 4.299 ± 1.103
1.72SerPro: 1.72 ± 0.948
1.72SerGln: 1.72 ± 0.589
6.879SerArg: 6.879 ± 1.066
3.439SerSer: 3.439 ± 1.476
3.869SerThr: 3.869 ± 1.14
5.589SerVal: 5.589 ± 1.875
0.43SerTrp: 0.43 ± 0.465
3.009SerTyr: 3.009 ± 0.758
0.0SerXaa: 0.0 ± 0.0
Thr
6.449ThrAla: 6.449 ± 1.081
0.43ThrCys: 0.43 ± 0.43
3.869ThrAsp: 3.869 ± 1.719
2.58ThrGlu: 2.58 ± 1.409
3.009ThrPhe: 3.009 ± 1.554
0.86ThrGly: 0.86 ± 0.754
0.0ThrHis: 0.0 ± 0.0
3.439ThrIle: 3.439 ± 0.804
4.299ThrLys: 4.299 ± 0.811
8.169ThrLeu: 8.169 ± 2.386
1.72ThrMet: 1.72 ± 0.614
3.439ThrAsn: 3.439 ± 0.855
3.009ThrPro: 3.009 ± 1.22
3.869ThrGln: 3.869 ± 1.208
1.29ThrArg: 1.29 ± 0.874
6.019ThrSer: 6.019 ± 1.638
4.299ThrThr: 4.299 ± 1.489
3.439ThrVal: 3.439 ± 0.761
1.29ThrTrp: 1.29 ± 0.492
1.29ThrTyr: 1.29 ± 0.819
0.0ThrXaa: 0.0 ± 0.0
Val
3.439ValAla: 3.439 ± 0.99
0.0ValCys: 0.0 ± 0.0
4.729ValAsp: 4.729 ± 1.024
1.72ValGlu: 1.72 ± 1.404
1.29ValPhe: 1.29 ± 0.447
4.299ValGly: 4.299 ± 1.391
2.58ValHis: 2.58 ± 0.737
1.72ValIle: 1.72 ± 0.845
2.15ValLys: 2.15 ± 1.035
4.729ValLeu: 4.729 ± 1.175
0.86ValMet: 0.86 ± 0.545
3.009ValAsn: 3.009 ± 0.659
3.009ValPro: 3.009 ± 0.829
2.58ValGln: 2.58 ± 1.407
5.589ValArg: 5.589 ± 1.821
4.299ValSer: 4.299 ± 1.405
4.729ValThr: 4.729 ± 1.121
1.29ValVal: 1.29 ± 0.879
0.86ValTrp: 0.86 ± 0.681
3.439ValTyr: 3.439 ± 1.377
0.0ValXaa: 0.0 ± 0.0
Trp
0.43TrpAla: 0.43 ± 0.377
0.0TrpCys: 0.0 ± 0.0
0.43TrpAsp: 0.43 ± 0.465
0.43TrpGlu: 0.43 ± 0.407
1.72TrpPhe: 1.72 ± 0.758
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.86TrpIle: 0.86 ± 0.545
0.86TrpLys: 0.86 ± 0.566
1.29TrpLeu: 1.29 ± 0.602
0.43TrpMet: 0.43 ± 0.377
1.72TrpAsn: 1.72 ± 0.409
1.72TrpPro: 1.72 ± 1.01
0.0TrpGln: 0.0 ± 0.0
0.43TrpArg: 0.43 ± 0.43
0.43TrpSer: 0.43 ± 0.447
1.72TrpThr: 1.72 ± 0.55
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.86TrpTyr: 0.86 ± 0.656
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.58TyrAla: 2.58 ± 1.228
0.43TyrCys: 0.43 ± 0.43
3.439TyrAsp: 3.439 ± 0.957
0.86TyrGlu: 0.86 ± 0.76
4.299TyrPhe: 4.299 ± 0.8
3.869TyrGly: 3.869 ± 0.811
0.43TyrHis: 0.43 ± 0.377
0.43TyrIle: 0.43 ± 0.377
0.43TyrLys: 0.43 ± 0.377
2.15TyrLeu: 2.15 ± 0.614
0.86TyrMet: 0.86 ± 0.505
1.29TyrAsn: 1.29 ± 0.593
1.72TyrPro: 1.72 ± 0.795
1.72TyrGln: 1.72 ± 0.568
1.72TyrArg: 1.72 ± 1.158
0.86TyrSer: 0.86 ± 0.505
1.72TyrThr: 1.72 ± 0.496
4.299TyrVal: 4.299 ± 0.98
0.0TyrTrp: 0.0 ± 0.0
1.29TyrTyr: 1.29 ± 0.676
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (2327 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski