Amino acid dipepetide frequency for Streptococcus satellite phage Javan121

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.116AlaAla: 1.116 ± 0.428
0.893AlaCys: 0.893 ± 0.405
3.572AlaAsp: 3.572 ± 0.804
4.242AlaGlu: 4.242 ± 1.206
3.126AlaPhe: 3.126 ± 0.685
3.795AlaGly: 3.795 ± 1.156
0.67AlaHis: 0.67 ± 0.444
6.028AlaIle: 6.028 ± 1.306
4.912AlaLys: 4.912 ± 0.909
5.805AlaLeu: 5.805 ± 1.304
0.893AlaMet: 0.893 ± 0.443
3.126AlaAsn: 3.126 ± 0.633
1.786AlaPro: 1.786 ± 0.555
2.009AlaGln: 2.009 ± 0.536
1.786AlaArg: 1.786 ± 0.579
4.242AlaSer: 4.242 ± 1.022
3.126AlaThr: 3.126 ± 0.631
2.902AlaVal: 2.902 ± 0.801
0.67AlaTrp: 0.67 ± 0.542
1.116AlaTyr: 1.116 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.223CysAla: 0.223 ± 0.211
0.223CysCys: 0.223 ± 0.255
0.447CysAsp: 0.447 ± 0.327
0.223CysGlu: 0.223 ± 0.255
0.0CysPhe: 0.0 ± 0.0
0.223CysGly: 0.223 ± 0.255
0.223CysHis: 0.223 ± 0.229
0.223CysIle: 0.223 ± 0.223
0.223CysLys: 0.223 ± 0.207
0.67CysLeu: 0.67 ± 0.353
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.223CysPro: 0.223 ± 0.255
0.447CysGln: 0.447 ± 0.32
0.447CysArg: 0.447 ± 0.283
0.67CysSer: 0.67 ± 0.385
0.0CysThr: 0.0 ± 0.0
0.223CysVal: 0.223 ± 0.223
0.0CysTrp: 0.0 ± 0.0
0.447CysTyr: 0.447 ± 0.327
0.0CysXaa: 0.0 ± 0.0
Asp
1.116AspAla: 1.116 ± 0.399
0.67AspCys: 0.67 ± 0.537
3.349AspAsp: 3.349 ± 0.9
4.465AspGlu: 4.465 ± 0.982
2.233AspPhe: 2.233 ± 0.473
2.902AspGly: 2.902 ± 0.757
0.893AspHis: 0.893 ± 0.341
6.028AspIle: 6.028 ± 0.967
4.465AspLys: 4.465 ± 1.242
4.465AspLeu: 4.465 ± 0.982
1.563AspMet: 1.563 ± 0.582
2.679AspAsn: 2.679 ± 0.645
0.447AspPro: 0.447 ± 0.318
1.786AspGln: 1.786 ± 0.697
4.242AspArg: 4.242 ± 0.897
2.009AspSer: 2.009 ± 0.668
3.795AspThr: 3.795 ± 0.896
0.67AspVal: 0.67 ± 0.35
0.67AspTrp: 0.67 ± 0.492
3.349AspTyr: 3.349 ± 1.064
0.0AspXaa: 0.0 ± 0.0
Glu
5.805GluAla: 5.805 ± 1.146
0.893GluCys: 0.893 ± 0.384
3.349GluAsp: 3.349 ± 0.97
7.144GluGlu: 7.144 ± 1.669
1.786GluPhe: 1.786 ± 0.596
2.902GluGly: 2.902 ± 0.796
1.563GluHis: 1.563 ± 0.512
6.921GluIle: 6.921 ± 1.174
5.805GluLys: 5.805 ± 1.133
10.047GluLeu: 10.047 ± 1.219
1.786GluMet: 1.786 ± 0.625
4.019GluAsn: 4.019 ± 1.029
1.563GluPro: 1.563 ± 0.537
5.135GluGln: 5.135 ± 1.277
4.242GluArg: 4.242 ± 0.971
2.233GluSer: 2.233 ± 0.754
5.135GluThr: 5.135 ± 1.383
3.795GluVal: 3.795 ± 0.844
1.563GluTrp: 1.563 ± 0.609
2.902GluTyr: 2.902 ± 0.865
0.0GluXaa: 0.0 ± 0.0
Phe
1.34PheAla: 1.34 ± 0.485
0.0PheCys: 0.0 ± 0.0
2.456PheAsp: 2.456 ± 0.636
2.902PheGlu: 2.902 ± 0.712
2.233PhePhe: 2.233 ± 0.636
2.456PheGly: 2.456 ± 0.792
2.009PheHis: 2.009 ± 0.459
4.019PheIle: 4.019 ± 0.978
4.465PheLys: 4.465 ± 0.77
3.572PheLeu: 3.572 ± 0.812
0.447PheMet: 0.447 ± 0.324
2.902PheAsn: 2.902 ± 0.835
1.116PhePro: 1.116 ± 0.457
0.447PheGln: 0.447 ± 0.295
2.009PheArg: 2.009 ± 0.638
2.902PheSer: 2.902 ± 0.589
2.233PheThr: 2.233 ± 0.541
1.786PheVal: 1.786 ± 0.537
0.223PheTrp: 0.223 ± 0.21
2.233PheTyr: 2.233 ± 0.705
0.0PheXaa: 0.0 ± 0.0
Gly
3.126GlyAla: 3.126 ± 0.878
0.223GlyCys: 0.223 ± 0.207
3.572GlyAsp: 3.572 ± 0.998
2.902GlyGlu: 2.902 ± 0.619
2.902GlyPhe: 2.902 ± 0.666
3.349GlyGly: 3.349 ± 1.212
1.34GlyHis: 1.34 ± 0.536
3.795GlyIle: 3.795 ± 1.063
3.349GlyLys: 3.349 ± 0.869
6.698GlyLeu: 6.698 ± 1.535
1.116GlyMet: 1.116 ± 0.406
2.009GlyAsn: 2.009 ± 0.606
0.0GlyPro: 0.0 ± 0.0
2.456GlyGln: 2.456 ± 0.905
2.679GlyArg: 2.679 ± 1.041
2.456GlySer: 2.456 ± 0.912
3.349GlyThr: 3.349 ± 0.86
3.349GlyVal: 3.349 ± 0.91
0.223GlyTrp: 0.223 ± 0.21
3.349GlyTyr: 3.349 ± 1.091
0.0GlyXaa: 0.0 ± 0.0
His
1.563HisAla: 1.563 ± 0.79
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.893HisGlu: 0.893 ± 0.392
0.0HisPhe: 0.0 ± 0.0
0.893HisGly: 0.893 ± 0.439
0.67HisHis: 0.67 ± 0.345
1.34HisIle: 1.34 ± 0.64
1.563HisLys: 1.563 ± 0.736
1.563HisLeu: 1.563 ± 0.613
0.223HisMet: 0.223 ± 0.216
2.009HisAsn: 2.009 ± 0.863
1.34HisPro: 1.34 ± 0.478
1.116HisGln: 1.116 ± 0.534
0.893HisArg: 0.893 ± 0.494
1.34HisSer: 1.34 ± 0.522
2.233HisThr: 2.233 ± 0.7
0.67HisVal: 0.67 ± 0.338
0.447HisTrp: 0.447 ± 0.32
1.116HisTyr: 1.116 ± 0.408
0.0HisXaa: 0.0 ± 0.0
Ile
5.135IleAla: 5.135 ± 1.206
0.223IleCys: 0.223 ± 0.229
5.805IleAsp: 5.805 ± 1.204
5.358IleGlu: 5.358 ± 1.254
2.456IlePhe: 2.456 ± 0.628
2.456IleGly: 2.456 ± 0.626
0.67IleHis: 0.67 ± 0.338
6.028IleIle: 6.028 ± 1.28
8.707IleLys: 8.707 ± 1.427
4.912IleLeu: 4.912 ± 0.795
1.116IleMet: 1.116 ± 0.478
3.126IleAsn: 3.126 ± 1.068
3.795IlePro: 3.795 ± 0.761
2.009IleGln: 2.009 ± 0.772
3.126IleArg: 3.126 ± 0.74
5.582IleSer: 5.582 ± 1.743
6.028IleThr: 6.028 ± 1.062
3.572IleVal: 3.572 ± 0.779
0.447IleTrp: 0.447 ± 0.31
3.572IleTyr: 3.572 ± 1.012
0.0IleXaa: 0.0 ± 0.0
Lys
6.698LysAla: 6.698 ± 1.405
0.223LysCys: 0.223 ± 0.211
4.019LysAsp: 4.019 ± 1.129
10.493LysGlu: 10.493 ± 1.363
1.563LysPhe: 1.563 ± 0.52
3.572LysGly: 3.572 ± 1.53
2.456LysHis: 2.456 ± 0.755
3.349LysIle: 3.349 ± 0.981
7.814LysLys: 7.814 ± 1.721
7.591LysLeu: 7.591 ± 1.198
1.786LysMet: 1.786 ± 0.703
5.358LysAsn: 5.358 ± 1.646
5.805LysPro: 5.805 ± 0.972
6.475LysGln: 6.475 ± 1.604
4.912LysArg: 4.912 ± 1.186
4.242LysSer: 4.242 ± 0.958
5.358LysThr: 5.358 ± 1.128
6.028LysVal: 6.028 ± 1.128
0.67LysTrp: 0.67 ± 0.357
2.679LysTyr: 2.679 ± 0.812
0.0LysXaa: 0.0 ± 0.0
Leu
7.368LeuAla: 7.368 ± 1.047
0.223LeuCys: 0.223 ± 0.255
4.689LeuAsp: 4.689 ± 1.065
9.6LeuGlu: 9.6 ± 1.416
3.795LeuPhe: 3.795 ± 0.95
7.368LeuGly: 7.368 ± 1.074
1.34LeuHis: 1.34 ± 0.61
7.368LeuIle: 7.368 ± 1.327
8.484LeuLys: 8.484 ± 1.052
10.94LeuLeu: 10.94 ± 1.645
1.786LeuMet: 1.786 ± 0.545
3.795LeuAsn: 3.795 ± 0.796
4.912LeuPro: 4.912 ± 0.985
2.902LeuGln: 2.902 ± 0.755
2.009LeuArg: 2.009 ± 0.602
7.144LeuSer: 7.144 ± 1.794
5.135LeuThr: 5.135 ± 0.955
4.465LeuVal: 4.465 ± 1.123
0.893LeuTrp: 0.893 ± 0.49
4.242LeuTyr: 4.242 ± 0.782
0.0LeuXaa: 0.0 ± 0.0
Met
1.563MetAla: 1.563 ± 0.62
0.0MetCys: 0.0 ± 0.0
1.34MetAsp: 1.34 ± 0.589
1.116MetGlu: 1.116 ± 0.446
0.67MetPhe: 0.67 ± 0.379
0.223MetGly: 0.223 ± 0.214
0.223MetHis: 0.223 ± 0.204
0.67MetIle: 0.67 ± 0.384
2.456MetLys: 2.456 ± 0.643
2.902MetLeu: 2.902 ± 0.951
0.223MetMet: 0.223 ± 0.239
1.786MetAsn: 1.786 ± 0.615
0.223MetPro: 0.223 ± 0.21
0.223MetGln: 0.223 ± 0.239
1.116MetArg: 1.116 ± 0.39
1.563MetSer: 1.563 ± 0.612
3.349MetThr: 3.349 ± 0.757
0.67MetVal: 0.67 ± 0.352
0.0MetTrp: 0.0 ± 0.0
0.223MetTyr: 0.223 ± 0.223
0.0MetXaa: 0.0 ± 0.0
Asn
2.679AsnAla: 2.679 ± 0.635
0.0AsnCys: 0.0 ± 0.0
1.563AsnAsp: 1.563 ± 0.547
3.126AsnGlu: 3.126 ± 0.791
1.563AsnPhe: 1.563 ± 0.468
6.028AsnGly: 6.028 ± 1.007
1.34AsnHis: 1.34 ± 0.478
3.349AsnIle: 3.349 ± 1.078
4.689AsnLys: 4.689 ± 0.934
4.689AsnLeu: 4.689 ± 1.182
1.34AsnMet: 1.34 ± 0.383
3.349AsnAsn: 3.349 ± 0.867
1.786AsnPro: 1.786 ± 0.578
2.902AsnGln: 2.902 ± 0.899
4.912AsnArg: 4.912 ± 0.854
1.786AsnSer: 1.786 ± 0.517
2.679AsnThr: 2.679 ± 1.011
2.456AsnVal: 2.456 ± 0.652
0.447AsnTrp: 0.447 ± 0.381
2.902AsnTyr: 2.902 ± 0.626
0.0AsnXaa: 0.0 ± 0.0
Pro
1.34ProAla: 1.34 ± 0.419
0.223ProCys: 0.223 ± 0.25
2.009ProAsp: 2.009 ± 0.588
4.019ProGlu: 4.019 ± 0.834
1.563ProPhe: 1.563 ± 0.603
0.447ProGly: 0.447 ± 0.347
0.67ProHis: 0.67 ± 0.379
1.786ProIle: 1.786 ± 0.635
5.358ProLys: 5.358 ± 1.251
2.233ProLeu: 2.233 ± 0.623
0.223ProMet: 0.223 ± 0.255
1.563ProAsn: 1.563 ± 0.567
1.116ProPro: 1.116 ± 0.445
2.009ProGln: 2.009 ± 1.383
1.786ProArg: 1.786 ± 0.549
1.786ProSer: 1.786 ± 0.584
2.233ProThr: 2.233 ± 0.598
2.009ProVal: 2.009 ± 0.527
0.223ProTrp: 0.223 ± 0.2
1.563ProTyr: 1.563 ± 0.484
0.0ProXaa: 0.0 ± 0.0
Gln
2.679GlnAla: 2.679 ± 0.753
0.0GlnCys: 0.0 ± 0.0
2.233GlnAsp: 2.233 ± 0.552
3.795GlnGlu: 3.795 ± 0.719
2.233GlnPhe: 2.233 ± 0.999
2.009GlnGly: 2.009 ± 0.561
1.116GlnHis: 1.116 ± 0.525
3.126GlnIle: 3.126 ± 0.763
6.921GlnLys: 6.921 ± 1.482
4.912GlnLeu: 4.912 ± 0.969
0.893GlnMet: 0.893 ± 0.472
2.456GlnAsn: 2.456 ± 0.953
0.893GlnPro: 0.893 ± 0.565
3.795GlnGln: 3.795 ± 1.144
3.126GlnArg: 3.126 ± 0.785
2.233GlnSer: 2.233 ± 0.733
1.116GlnThr: 1.116 ± 0.499
3.572GlnVal: 3.572 ± 1.154
0.447GlnTrp: 0.447 ± 0.348
1.116GlnTyr: 1.116 ± 0.645
0.0GlnXaa: 0.0 ± 0.0
Arg
2.233ArgAla: 2.233 ± 0.702
0.893ArgCys: 0.893 ± 0.372
2.233ArgAsp: 2.233 ± 0.676
3.126ArgGlu: 3.126 ± 0.774
2.679ArgPhe: 2.679 ± 0.941
2.679ArgGly: 2.679 ± 0.784
0.893ArgHis: 0.893 ± 0.399
2.456ArgIle: 2.456 ± 0.704
4.689ArgLys: 4.689 ± 0.928
5.358ArgLeu: 5.358 ± 0.909
1.786ArgMet: 1.786 ± 0.561
2.902ArgAsn: 2.902 ± 0.798
1.116ArgPro: 1.116 ± 0.542
3.572ArgGln: 3.572 ± 1.0
2.902ArgArg: 2.902 ± 0.826
1.563ArgSer: 1.563 ± 0.737
3.126ArgThr: 3.126 ± 0.691
3.349ArgVal: 3.349 ± 0.74
0.447ArgTrp: 0.447 ± 0.303
2.456ArgTyr: 2.456 ± 0.842
0.0ArgXaa: 0.0 ± 0.0
Ser
3.572SerAla: 3.572 ± 0.944
0.223SerCys: 0.223 ± 0.255
3.795SerAsp: 3.795 ± 0.852
4.465SerGlu: 4.465 ± 0.948
3.572SerPhe: 3.572 ± 0.905
1.34SerGly: 1.34 ± 0.471
0.447SerHis: 0.447 ± 0.344
5.358SerIle: 5.358 ± 1.144
4.242SerLys: 4.242 ± 0.86
6.921SerLeu: 6.921 ± 1.478
0.893SerMet: 0.893 ± 0.427
3.126SerAsn: 3.126 ± 0.724
1.34SerPro: 1.34 ± 0.554
2.009SerGln: 2.009 ± 0.593
2.902SerArg: 2.902 ± 0.8
1.786SerSer: 1.786 ± 0.446
2.456SerThr: 2.456 ± 0.661
4.019SerVal: 4.019 ± 1.095
0.893SerTrp: 0.893 ± 0.41
2.679SerTyr: 2.679 ± 0.729
0.0SerXaa: 0.0 ± 0.0
Thr
3.349ThrAla: 3.349 ± 0.956
0.0ThrCys: 0.0 ± 0.0
2.902ThrAsp: 2.902 ± 1.0
4.912ThrGlu: 4.912 ± 1.222
3.349ThrPhe: 3.349 ± 1.06
3.572ThrGly: 3.572 ± 0.974
0.447ThrHis: 0.447 ± 0.277
5.135ThrIle: 5.135 ± 1.225
2.902ThrLys: 2.902 ± 0.776
4.689ThrLeu: 4.689 ± 0.979
1.563ThrMet: 1.563 ± 0.561
2.679ThrAsn: 2.679 ± 0.819
4.019ThrPro: 4.019 ± 1.093
2.902ThrGln: 2.902 ± 0.793
2.233ThrArg: 2.233 ± 0.62
3.349ThrSer: 3.349 ± 0.888
3.349ThrThr: 3.349 ± 0.989
3.572ThrVal: 3.572 ± 1.136
0.893ThrTrp: 0.893 ± 0.32
4.019ThrTyr: 4.019 ± 1.049
0.0ThrXaa: 0.0 ± 0.0
Val
2.902ValAla: 2.902 ± 0.525
0.0ValCys: 0.0 ± 0.0
1.786ValAsp: 1.786 ± 0.56
2.009ValGlu: 2.009 ± 0.742
2.679ValPhe: 2.679 ± 0.719
2.679ValGly: 2.679 ± 0.828
0.893ValHis: 0.893 ± 0.522
5.135ValIle: 5.135 ± 0.996
5.135ValLys: 5.135 ± 1.176
5.135ValLeu: 5.135 ± 0.894
1.34ValMet: 1.34 ± 0.456
2.456ValAsn: 2.456 ± 0.829
0.67ValPro: 0.67 ± 0.405
3.126ValGln: 3.126 ± 0.749
1.116ValArg: 1.116 ± 0.405
5.805ValSer: 5.805 ± 1.099
3.126ValThr: 3.126 ± 1.072
3.349ValVal: 3.349 ± 1.064
0.447ValTrp: 0.447 ± 0.421
2.233ValTyr: 2.233 ± 0.576
0.0ValXaa: 0.0 ± 0.0
Trp
0.447TrpAla: 0.447 ± 0.291
0.0TrpCys: 0.0 ± 0.0
0.447TrpAsp: 0.447 ± 0.353
0.447TrpGlu: 0.447 ± 0.328
0.447TrpPhe: 0.447 ± 0.399
0.223TrpGly: 0.223 ± 0.229
0.223TrpHis: 0.223 ± 0.229
0.67TrpIle: 0.67 ± 0.367
1.34TrpLys: 1.34 ± 0.659
1.786TrpLeu: 1.786 ± 0.666
0.0TrpMet: 0.0 ± 0.0
0.447TrpAsn: 0.447 ± 0.286
0.223TrpPro: 0.223 ± 0.21
0.447TrpGln: 0.447 ± 0.302
0.447TrpArg: 0.447 ± 0.316
0.893TrpSer: 0.893 ± 0.383
0.447TrpThr: 0.447 ± 0.348
0.893TrpVal: 0.893 ± 0.44
0.67TrpTrp: 0.67 ± 0.471
0.223TrpTyr: 0.223 ± 0.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.563TyrAla: 1.563 ± 0.498
0.223TyrCys: 0.223 ± 0.209
2.233TyrAsp: 2.233 ± 0.525
3.349TyrGlu: 3.349 ± 0.904
2.902TyrPhe: 2.902 ± 0.638
3.126TyrGly: 3.126 ± 0.768
1.786TyrHis: 1.786 ± 0.862
1.34TyrIle: 1.34 ± 0.497
3.572TyrLys: 3.572 ± 0.906
3.572TyrLeu: 3.572 ± 0.668
1.34TyrMet: 1.34 ± 0.671
3.795TyrAsn: 3.795 ± 0.756
1.786TyrPro: 1.786 ± 0.678
2.902TyrGln: 2.902 ± 0.838
3.572TyrArg: 3.572 ± 1.123
2.456TyrSer: 2.456 ± 0.816
1.563TyrThr: 1.563 ± 0.472
0.67TyrVal: 0.67 ± 0.431
0.447TyrTrp: 0.447 ± 0.51
2.902TyrTyr: 2.902 ± 0.858
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28 proteins (4480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski