Amino acid dipepetide frequency for Streptococcus satellite phage Javan338

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.679AlaCys: 0.679 ± 0.427
2.716AlaAsp: 2.716 ± 0.561
5.433AlaGlu: 5.433 ± 1.581
3.735AlaPhe: 3.735 ± 1.38
3.056AlaGly: 3.056 ± 0.953
1.019AlaHis: 1.019 ± 0.527
3.056AlaIle: 3.056 ± 0.679
6.112AlaLys: 6.112 ± 1.283
5.093AlaLeu: 5.093 ± 1.469
2.377AlaMet: 2.377 ± 1.083
3.735AlaAsn: 3.735 ± 1.143
1.698AlaPro: 1.698 ± 0.671
2.037AlaGln: 2.037 ± 0.656
2.716AlaArg: 2.716 ± 1.005
1.698AlaSer: 1.698 ± 0.736
2.716AlaThr: 2.716 ± 0.913
2.716AlaVal: 2.716 ± 0.94
1.358AlaTrp: 1.358 ± 0.556
2.716AlaTyr: 2.716 ± 0.866
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.019CysAsp: 1.019 ± 0.481
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.34CysIle: 0.34 ± 0.289
0.679CysLys: 0.679 ± 0.452
0.34CysLeu: 0.34 ± 0.385
0.34CysMet: 0.34 ± 0.341
1.019CysAsn: 1.019 ± 0.671
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.34CysArg: 0.34 ± 0.291
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.34CysVal: 0.34 ± 0.326
0.0CysTrp: 0.0 ± 0.0
1.019CysTyr: 1.019 ± 0.588
0.0CysXaa: 0.0 ± 0.0
Asp
1.019AspAla: 1.019 ± 0.454
0.679AspCys: 0.679 ± 0.452
2.037AspAsp: 2.037 ± 0.725
3.735AspGlu: 3.735 ± 0.975
3.396AspPhe: 3.396 ± 0.987
1.698AspGly: 1.698 ± 0.533
0.0AspHis: 0.0 ± 0.0
6.452AspIle: 6.452 ± 1.513
5.433AspLys: 5.433 ± 1.311
9.847AspLeu: 9.847 ± 1.876
1.358AspMet: 1.358 ± 0.656
2.716AspAsn: 2.716 ± 1.016
1.019AspPro: 1.019 ± 0.635
1.019AspGln: 1.019 ± 0.597
3.396AspArg: 3.396 ± 0.809
3.396AspSer: 3.396 ± 0.955
3.396AspThr: 3.396 ± 1.085
2.377AspVal: 2.377 ± 0.785
0.34AspTrp: 0.34 ± 0.31
4.075AspTyr: 4.075 ± 1.388
0.0AspXaa: 0.0 ± 0.0
Glu
6.452GluAla: 6.452 ± 1.07
0.34GluCys: 0.34 ± 0.385
5.433GluAsp: 5.433 ± 1.697
6.791GluGlu: 6.791 ± 1.564
1.698GluPhe: 1.698 ± 0.617
4.075GluGly: 4.075 ± 1.088
3.056GluHis: 3.056 ± 0.699
6.452GluIle: 6.452 ± 1.544
6.791GluLys: 6.791 ± 0.971
10.866GluLeu: 10.866 ± 1.306
2.037GluMet: 2.037 ± 0.745
5.093GluAsn: 5.093 ± 1.134
2.037GluPro: 2.037 ± 1.051
4.754GluGln: 4.754 ± 1.336
6.112GluArg: 6.112 ± 1.611
4.075GluSer: 4.075 ± 0.838
5.433GluThr: 5.433 ± 1.451
3.735GluVal: 3.735 ± 0.904
0.679GluTrp: 0.679 ± 0.62
2.037GluTyr: 2.037 ± 0.693
0.0GluXaa: 0.0 ± 0.0
Phe
1.358PheAla: 1.358 ± 0.841
0.34PheCys: 0.34 ± 0.365
2.037PheAsp: 2.037 ± 0.874
3.056PheGlu: 3.056 ± 0.997
1.019PhePhe: 1.019 ± 0.481
2.037PheGly: 2.037 ± 0.687
0.679PheHis: 0.679 ± 0.383
3.735PheIle: 3.735 ± 1.285
4.414PheLys: 4.414 ± 1.069
5.772PheLeu: 5.772 ± 1.137
0.0PheMet: 0.0 ± 0.0
2.377PheAsn: 2.377 ± 1.297
0.679PhePro: 0.679 ± 0.441
2.716PheGln: 2.716 ± 0.687
2.037PheArg: 2.037 ± 0.798
3.056PheSer: 3.056 ± 0.68
2.037PheThr: 2.037 ± 0.954
1.698PheVal: 1.698 ± 0.797
0.679PheTrp: 0.679 ± 0.442
2.037PheTyr: 2.037 ± 0.662
0.0PheXaa: 0.0 ± 0.0
Gly
3.056GlyAla: 3.056 ± 1.251
0.34GlyCys: 0.34 ± 0.291
2.716GlyAsp: 2.716 ± 1.295
4.075GlyGlu: 4.075 ± 1.382
2.037GlyPhe: 2.037 ± 0.718
2.037GlyGly: 2.037 ± 0.648
0.679GlyHis: 0.679 ± 0.371
3.735GlyIle: 3.735 ± 1.045
2.377GlyLys: 2.377 ± 0.821
4.414GlyLeu: 4.414 ± 1.503
1.019GlyMet: 1.019 ± 0.717
2.716GlyAsn: 2.716 ± 0.98
0.0GlyPro: 0.0 ± 0.0
1.358GlyGln: 1.358 ± 0.573
1.698GlyArg: 1.698 ± 0.553
2.716GlySer: 2.716 ± 0.729
2.377GlyThr: 2.377 ± 1.084
2.716GlyVal: 2.716 ± 0.945
0.34GlyTrp: 0.34 ± 0.31
3.735GlyTyr: 3.735 ± 1.195
0.0GlyXaa: 0.0 ± 0.0
His
1.698HisAla: 1.698 ± 0.619
0.0HisCys: 0.0 ± 0.0
2.037HisAsp: 2.037 ± 0.995
0.679HisGlu: 0.679 ± 0.485
1.019HisPhe: 1.019 ± 0.597
2.377HisGly: 2.377 ± 0.837
0.0HisHis: 0.0 ± 0.0
1.019HisIle: 1.019 ± 0.597
0.679HisLys: 0.679 ± 0.463
1.698HisLeu: 1.698 ± 0.527
0.0HisMet: 0.0 ± 0.0
1.019HisAsn: 1.019 ± 0.467
0.0HisPro: 0.0 ± 0.0
0.34HisGln: 0.34 ± 0.385
0.679HisArg: 0.679 ± 0.505
1.698HisSer: 1.698 ± 0.574
1.698HisThr: 1.698 ± 0.723
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.698HisTyr: 1.698 ± 0.662
0.0HisXaa: 0.0 ± 0.0
Ile
4.414IleAla: 4.414 ± 1.114
0.679IleCys: 0.679 ± 0.462
6.791IleAsp: 6.791 ± 2.202
6.791IleGlu: 6.791 ± 2.366
3.056IlePhe: 3.056 ± 0.972
3.056IleGly: 3.056 ± 0.785
1.358IleHis: 1.358 ± 0.97
4.075IleIle: 4.075 ± 0.855
7.47IleLys: 7.47 ± 1.428
6.112IleLeu: 6.112 ± 1.327
1.019IleMet: 1.019 ± 0.521
3.056IleAsn: 3.056 ± 1.045
3.396IlePro: 3.396 ± 0.988
1.698IleGln: 1.698 ± 0.725
2.377IleArg: 2.377 ± 0.799
6.452IleSer: 6.452 ± 1.743
4.754IleThr: 4.754 ± 1.327
2.377IleVal: 2.377 ± 1.032
0.0IleTrp: 0.0 ± 0.0
4.075IleTyr: 4.075 ± 1.492
0.0IleXaa: 0.0 ± 0.0
Lys
5.772LysAla: 5.772 ± 1.711
0.0LysCys: 0.0 ± 0.0
5.093LysAsp: 5.093 ± 1.566
8.149LysGlu: 8.149 ± 1.87
2.377LysPhe: 2.377 ± 0.714
4.414LysGly: 4.414 ± 1.062
2.377LysHis: 2.377 ± 0.666
8.149LysIle: 8.149 ± 1.368
7.81LysLys: 7.81 ± 1.807
6.452LysLeu: 6.452 ± 1.256
3.396LysMet: 3.396 ± 1.33
3.396LysAsn: 3.396 ± 0.719
4.414LysPro: 4.414 ± 1.447
5.433LysGln: 5.433 ± 1.108
6.112LysArg: 6.112 ± 0.962
5.093LysSer: 5.093 ± 1.096
7.131LysThr: 7.131 ± 1.473
5.093LysVal: 5.093 ± 1.226
1.019LysTrp: 1.019 ± 0.603
3.056LysTyr: 3.056 ± 0.995
0.0LysXaa: 0.0 ± 0.0
Leu
5.772LeuAla: 5.772 ± 1.412
0.679LeuCys: 0.679 ± 0.427
7.81LeuAsp: 7.81 ± 1.602
13.243LeuGlu: 13.243 ± 2.355
3.735LeuPhe: 3.735 ± 1.308
3.396LeuGly: 3.396 ± 1.159
1.019LeuHis: 1.019 ± 0.448
7.47LeuIle: 7.47 ± 1.455
8.829LeuLys: 8.829 ± 1.495
9.847LeuLeu: 9.847 ± 2.103
2.716LeuMet: 2.716 ± 0.752
7.81LeuAsn: 7.81 ± 1.251
2.716LeuPro: 2.716 ± 0.938
4.754LeuGln: 4.754 ± 0.961
4.414LeuArg: 4.414 ± 1.246
5.433LeuSer: 5.433 ± 0.885
6.112LeuThr: 6.112 ± 1.255
4.075LeuVal: 4.075 ± 1.175
1.019LeuTrp: 1.019 ± 0.633
3.396LeuTyr: 3.396 ± 0.991
0.0LeuXaa: 0.0 ± 0.0
Met
2.377MetAla: 2.377 ± 0.849
0.0MetCys: 0.0 ± 0.0
2.037MetAsp: 2.037 ± 0.876
2.037MetGlu: 2.037 ± 0.798
1.358MetPhe: 1.358 ± 0.663
1.019MetGly: 1.019 ± 0.513
0.0MetHis: 0.0 ± 0.0
1.019MetIle: 1.019 ± 0.463
3.056MetLys: 3.056 ± 0.867
1.698MetLeu: 1.698 ± 0.668
0.679MetMet: 0.679 ± 0.457
2.377MetAsn: 2.377 ± 0.887
0.679MetPro: 0.679 ± 0.393
1.019MetGln: 1.019 ± 0.496
1.358MetArg: 1.358 ± 0.753
1.019MetSer: 1.019 ± 0.536
3.735MetThr: 3.735 ± 1.089
1.358MetVal: 1.358 ± 0.599
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.716AsnAla: 2.716 ± 0.988
0.0AsnCys: 0.0 ± 0.0
2.377AsnAsp: 2.377 ± 0.879
4.754AsnGlu: 4.754 ± 1.101
1.019AsnPhe: 1.019 ± 0.575
2.716AsnGly: 2.716 ± 1.193
1.019AsnHis: 1.019 ± 0.463
3.056AsnIle: 3.056 ± 1.175
5.433AsnLys: 5.433 ± 0.99
6.112AsnLeu: 6.112 ± 1.264
2.377AsnMet: 2.377 ± 0.657
3.056AsnAsn: 3.056 ± 0.919
3.056AsnPro: 3.056 ± 0.891
2.716AsnGln: 2.716 ± 0.826
2.716AsnArg: 2.716 ± 0.805
3.396AsnSer: 3.396 ± 0.749
3.396AsnThr: 3.396 ± 1.067
2.037AsnVal: 2.037 ± 0.771
0.34AsnTrp: 0.34 ± 0.315
1.358AsnTyr: 1.358 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
1.698ProAla: 1.698 ± 0.667
0.0ProCys: 0.0 ± 0.0
2.037ProAsp: 2.037 ± 0.69
2.037ProGlu: 2.037 ± 0.776
1.698ProPhe: 1.698 ± 1.019
0.0ProGly: 0.0 ± 0.0
0.34ProHis: 0.34 ± 0.293
2.037ProIle: 2.037 ± 0.594
4.075ProLys: 4.075 ± 0.974
3.056ProLeu: 3.056 ± 0.799
0.679ProMet: 0.679 ± 0.463
2.377ProAsn: 2.377 ± 0.898
0.679ProPro: 0.679 ± 0.431
1.019ProGln: 1.019 ± 0.645
3.056ProArg: 3.056 ± 1.016
2.037ProSer: 2.037 ± 0.625
0.679ProThr: 0.679 ± 0.385
2.377ProVal: 2.377 ± 0.686
0.34ProTrp: 0.34 ± 0.31
1.358ProTyr: 1.358 ± 0.6
0.0ProXaa: 0.0 ± 0.0
Gln
4.075GlnAla: 4.075 ± 1.586
0.0GlnCys: 0.0 ± 0.0
1.698GlnAsp: 1.698 ± 0.737
4.075GlnGlu: 4.075 ± 1.022
0.679GlnPhe: 0.679 ± 0.446
2.037GlnGly: 2.037 ± 0.722
0.0GlnHis: 0.0 ± 0.0
3.396GlnIle: 3.396 ± 0.986
4.754GlnLys: 4.754 ± 1.248
3.056GlnLeu: 3.056 ± 0.974
0.0GlnMet: 0.0 ± 0.0
1.698GlnAsn: 1.698 ± 0.603
2.037GlnPro: 2.037 ± 0.807
1.358GlnGln: 1.358 ± 0.728
2.716GlnArg: 2.716 ± 0.694
0.679GlnSer: 0.679 ± 0.429
2.716GlnThr: 2.716 ± 0.997
2.377GlnVal: 2.377 ± 0.823
0.34GlnTrp: 0.34 ± 0.385
1.698GlnTyr: 1.698 ± 0.935
0.0GlnXaa: 0.0 ± 0.0
Arg
3.396ArgAla: 3.396 ± 0.856
0.0ArgCys: 0.0 ± 0.0
3.056ArgAsp: 3.056 ± 1.088
5.772ArgGlu: 5.772 ± 1.158
3.056ArgPhe: 3.056 ± 0.913
2.037ArgGly: 2.037 ± 0.723
1.698ArgHis: 1.698 ± 0.748
5.433ArgIle: 5.433 ± 1.163
3.735ArgLys: 3.735 ± 0.874
6.112ArgLeu: 6.112 ± 1.714
1.358ArgMet: 1.358 ± 0.811
3.056ArgAsn: 3.056 ± 0.831
1.019ArgPro: 1.019 ± 0.485
1.698ArgGln: 1.698 ± 0.646
1.019ArgArg: 1.019 ± 0.463
1.698ArgSer: 1.698 ± 0.663
2.377ArgThr: 2.377 ± 0.697
2.377ArgVal: 2.377 ± 1.031
0.0ArgTrp: 0.0 ± 0.0
3.735ArgTyr: 3.735 ± 0.886
0.0ArgXaa: 0.0 ± 0.0
Ser
3.396SerAla: 3.396 ± 0.765
0.0SerCys: 0.0 ± 0.0
3.735SerAsp: 3.735 ± 1.024
4.754SerGlu: 4.754 ± 1.475
1.698SerPhe: 1.698 ± 0.716
3.056SerGly: 3.056 ± 0.847
0.679SerHis: 0.679 ± 0.373
4.414SerIle: 4.414 ± 1.304
5.433SerLys: 5.433 ± 1.489
5.772SerLeu: 5.772 ± 1.287
1.358SerMet: 1.358 ± 0.466
2.377SerAsn: 2.377 ± 0.875
2.037SerPro: 2.037 ± 0.645
1.358SerGln: 1.358 ± 0.508
1.698SerArg: 1.698 ± 0.747
2.037SerSer: 2.037 ± 1.019
3.396SerThr: 3.396 ± 1.01
2.716SerVal: 2.716 ± 0.969
0.679SerTrp: 0.679 ± 0.388
2.377SerTyr: 2.377 ± 0.883
0.0SerXaa: 0.0 ± 0.0
Thr
3.396ThrAla: 3.396 ± 0.837
0.34ThrCys: 0.34 ± 0.341
1.358ThrAsp: 1.358 ± 0.813
3.735ThrGlu: 3.735 ± 0.738
6.112ThrPhe: 6.112 ± 2.364
3.056ThrGly: 3.056 ± 0.797
1.698ThrHis: 1.698 ± 0.585
3.056ThrIle: 3.056 ± 0.988
7.131ThrLys: 7.131 ± 1.729
8.489ThrLeu: 8.489 ± 1.475
3.396ThrMet: 3.396 ± 0.823
2.037ThrAsn: 2.037 ± 1.026
1.358ThrPro: 1.358 ± 0.643
2.037ThrGln: 2.037 ± 1.002
3.056ThrArg: 3.056 ± 0.707
4.075ThrSer: 4.075 ± 0.834
4.075ThrThr: 4.075 ± 1.034
3.735ThrVal: 3.735 ± 0.755
0.34ThrTrp: 0.34 ± 0.289
1.358ThrTyr: 1.358 ± 0.735
0.0ThrXaa: 0.0 ± 0.0
Val
2.377ValAla: 2.377 ± 0.944
0.34ValCys: 0.34 ± 0.31
1.698ValAsp: 1.698 ± 0.668
4.414ValGlu: 4.414 ± 1.425
1.358ValPhe: 1.358 ± 0.88
2.716ValGly: 2.716 ± 1.149
1.019ValHis: 1.019 ± 0.541
2.037ValIle: 2.037 ± 0.857
4.754ValLys: 4.754 ± 1.664
4.414ValLeu: 4.414 ± 1.379
1.019ValMet: 1.019 ± 0.539
1.698ValAsn: 1.698 ± 0.645
2.716ValPro: 2.716 ± 0.689
1.019ValGln: 1.019 ± 0.571
3.056ValArg: 3.056 ± 0.991
2.037ValSer: 2.037 ± 0.622
5.433ValThr: 5.433 ± 1.166
3.056ValVal: 3.056 ± 1.431
0.0ValTrp: 0.0 ± 0.0
1.698ValTyr: 1.698 ± 0.772
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.358TrpGlu: 1.358 ± 0.601
0.34TrpPhe: 0.34 ± 0.365
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.679TrpIle: 0.679 ± 0.442
0.34TrpLys: 0.34 ± 0.291
1.358TrpLeu: 1.358 ± 0.567
0.0TrpMet: 0.0 ± 0.0
0.34TrpAsn: 0.34 ± 0.385
0.34TrpPro: 0.34 ± 0.31
0.679TrpGln: 0.679 ± 0.442
0.679TrpArg: 0.679 ± 0.424
0.679TrpSer: 0.679 ± 0.377
0.0TrpThr: 0.0 ± 0.0
0.34TrpVal: 0.34 ± 0.289
0.0TrpTrp: 0.0 ± 0.0
0.34TrpTyr: 0.34 ± 0.31
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.698TyrAla: 1.698 ± 0.795
1.019TyrCys: 1.019 ± 0.527
1.698TyrAsp: 1.698 ± 0.512
3.056TyrGlu: 3.056 ± 1.048
2.377TyrPhe: 2.377 ± 1.069
1.698TyrGly: 1.698 ± 0.761
1.698TyrHis: 1.698 ± 0.63
3.396TyrIle: 3.396 ± 1.073
5.433TyrLys: 5.433 ± 1.369
3.735TyrLeu: 3.735 ± 1.236
1.358TyrMet: 1.358 ± 0.573
1.698TyrAsn: 1.698 ± 0.799
1.698TyrPro: 1.698 ± 0.835
2.377TyrGln: 2.377 ± 1.023
3.396TyrArg: 3.396 ± 1.125
1.698TyrSer: 1.698 ± 0.624
2.377TyrThr: 2.377 ± 0.81
1.358TyrVal: 1.358 ± 0.626
0.0TyrTrp: 0.0 ± 0.0
1.358TyrTyr: 1.358 ± 0.596
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (2946 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski