Amino acid dipepetide frequency for Streptococcus satellite phage Javan418

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
2.915AlaAsp: 2.915 ± 1.434
2.915AlaGlu: 2.915 ± 1.197
3.332AlaPhe: 3.332 ± 0.83
4.581AlaGly: 4.581 ± 0.883
0.416AlaHis: 0.416 ± 0.298
6.664AlaIle: 6.664 ± 1.525
4.581AlaLys: 4.581 ± 0.841
4.581AlaLeu: 4.581 ± 1.425
1.249AlaMet: 1.249 ± 0.76
0.833AlaAsn: 0.833 ± 0.436
1.249AlaPro: 1.249 ± 0.763
2.915AlaGln: 2.915 ± 1.435
2.915AlaArg: 2.915 ± 0.899
4.165AlaSer: 4.165 ± 1.335
3.332AlaThr: 3.332 ± 1.062
2.915AlaVal: 2.915 ± 0.843
1.249AlaTrp: 1.249 ± 0.755
4.165AlaTyr: 4.165 ± 0.964
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.416CysAsp: 0.416 ± 0.534
0.833CysGlu: 0.833 ± 0.743
0.416CysPhe: 0.416 ± 0.404
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.833CysLys: 0.833 ± 0.541
0.416CysLeu: 0.416 ± 0.557
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.416CysPro: 0.416 ± 0.329
0.0CysGln: 0.0 ± 0.0
0.416CysArg: 0.416 ± 0.329
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.416CysTrp: 0.416 ± 0.298
0.833CysTyr: 0.833 ± 0.683
0.0CysXaa: 0.0 ± 0.0
Asp
2.082AspAla: 2.082 ± 0.707
0.0AspCys: 0.0 ± 0.0
5.831AspAsp: 5.831 ± 1.725
4.998AspGlu: 4.998 ± 0.976
4.998AspPhe: 4.998 ± 1.401
1.666AspGly: 1.666 ± 0.669
0.0AspHis: 0.0 ± 0.0
4.998AspIle: 4.998 ± 1.031
7.497AspLys: 7.497 ± 1.591
5.414AspLeu: 5.414 ± 1.536
2.915AspMet: 2.915 ± 1.306
5.414AspAsn: 5.414 ± 1.353
0.416AspPro: 0.416 ± 0.344
0.416AspGln: 0.416 ± 0.344
2.082AspArg: 2.082 ± 0.954
1.666AspSer: 1.666 ± 0.649
3.748AspThr: 3.748 ± 1.23
2.499AspVal: 2.499 ± 0.827
0.416AspTrp: 0.416 ± 0.298
4.998AspTyr: 4.998 ± 1.332
0.0AspXaa: 0.0 ± 0.0
Glu
2.499GluAla: 2.499 ± 1.186
0.416GluCys: 0.416 ± 0.534
4.581GluAsp: 4.581 ± 1.489
7.497GluGlu: 7.497 ± 4.49
3.748GluPhe: 3.748 ± 1.123
2.082GluGly: 2.082 ± 0.768
1.249GluHis: 1.249 ± 0.674
4.581GluIle: 4.581 ± 1.561
5.414GluLys: 5.414 ± 1.849
11.245GluLeu: 11.245 ± 2.805
0.833GluMet: 0.833 ± 1.106
3.332GluAsn: 3.332 ± 1.021
0.833GluPro: 0.833 ± 0.596
4.581GluGln: 4.581 ± 2.001
4.165GluArg: 4.165 ± 1.256
2.915GluSer: 2.915 ± 1.186
3.332GluThr: 3.332 ± 1.401
5.414GluVal: 5.414 ± 1.347
0.833GluTrp: 0.833 ± 0.644
3.332GluTyr: 3.332 ± 1.145
0.0GluXaa: 0.0 ± 0.0
Phe
1.249PheAla: 1.249 ± 0.998
0.0PheCys: 0.0 ± 0.0
3.332PheAsp: 3.332 ± 0.973
2.499PheGlu: 2.499 ± 0.962
1.666PhePhe: 1.666 ± 0.626
1.666PheGly: 1.666 ± 0.611
2.082PheHis: 2.082 ± 1.229
2.915PheIle: 2.915 ± 0.726
3.332PheLys: 3.332 ± 1.063
4.581PheLeu: 4.581 ± 1.503
0.416PheMet: 0.416 ± 0.536
3.332PheAsn: 3.332 ± 1.071
0.416PhePro: 0.416 ± 0.298
1.249PheGln: 1.249 ± 0.63
1.666PheArg: 1.666 ± 0.896
3.332PheSer: 3.332 ± 0.977
2.082PheThr: 2.082 ± 0.697
1.666PheVal: 1.666 ± 0.604
0.833PheTrp: 0.833 ± 0.494
3.332PheTyr: 3.332 ± 1.025
0.0PheXaa: 0.0 ± 0.0
Gly
1.249GlyAla: 1.249 ± 0.699
1.249GlyCys: 1.249 ± 0.699
2.915GlyAsp: 2.915 ± 1.333
3.748GlyGlu: 3.748 ± 1.58
1.249GlyPhe: 1.249 ± 0.671
0.833GlyGly: 0.833 ± 0.43
0.416GlyHis: 0.416 ± 0.329
2.915GlyIle: 2.915 ± 0.978
4.165GlyLys: 4.165 ± 1.181
4.998GlyLeu: 4.998 ± 1.107
2.082GlyMet: 2.082 ± 0.88
0.416GlyAsn: 0.416 ± 0.534
0.0GlyPro: 0.0 ± 0.0
1.249GlyGln: 1.249 ± 0.474
1.249GlyArg: 1.249 ± 0.655
1.666GlySer: 1.666 ± 0.725
4.165GlyThr: 4.165 ± 1.592
2.915GlyVal: 2.915 ± 0.785
0.833GlyTrp: 0.833 ± 0.446
4.581GlyTyr: 4.581 ± 1.158
0.0GlyXaa: 0.0 ± 0.0
His
0.833HisAla: 0.833 ± 0.657
0.0HisCys: 0.0 ± 0.0
0.833HisAsp: 0.833 ± 0.668
0.833HisGlu: 0.833 ± 0.494
0.416HisPhe: 0.416 ± 0.298
1.666HisGly: 1.666 ± 0.745
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.666HisLys: 1.666 ± 0.895
1.666HisLeu: 1.666 ± 0.732
0.0HisMet: 0.0 ± 0.0
1.249HisAsn: 1.249 ± 0.701
0.0HisPro: 0.0 ± 0.0
1.666HisGln: 1.666 ± 0.987
0.416HisArg: 0.416 ± 0.298
0.416HisSer: 0.416 ± 0.329
0.416HisThr: 0.416 ± 0.329
1.666HisVal: 1.666 ± 0.903
0.833HisTrp: 0.833 ± 0.53
1.249HisTyr: 1.249 ± 0.786
0.0HisXaa: 0.0 ± 0.0
Ile
4.998IleAla: 4.998 ± 2.062
0.416IleCys: 0.416 ± 0.498
4.998IleAsp: 4.998 ± 1.331
3.748IleGlu: 3.748 ± 1.15
3.748IlePhe: 3.748 ± 1.249
2.082IleGly: 2.082 ± 0.733
0.833IleHis: 0.833 ± 0.54
4.165IleIle: 4.165 ± 1.09
10.829IleLys: 10.829 ± 1.891
6.247IleLeu: 6.247 ± 1.863
2.082IleMet: 2.082 ± 0.796
8.746IleAsn: 8.746 ± 1.473
3.332IlePro: 3.332 ± 1.339
2.915IleGln: 2.915 ± 0.876
2.915IleArg: 2.915 ± 0.703
3.332IleSer: 3.332 ± 1.406
5.831IleThr: 5.831 ± 0.849
3.332IleVal: 3.332 ± 1.01
1.249IleTrp: 1.249 ± 0.652
1.249IleTyr: 1.249 ± 0.836
0.0IleXaa: 0.0 ± 0.0
Lys
6.664LysAla: 6.664 ± 1.507
0.0LysCys: 0.0 ± 0.0
6.664LysAsp: 6.664 ± 1.612
11.245LysGlu: 11.245 ± 2.5
2.082LysPhe: 2.082 ± 0.699
4.581LysGly: 4.581 ± 1.429
0.833LysHis: 0.833 ± 0.657
9.163LysIle: 9.163 ± 2.339
10.412LysLys: 10.412 ± 1.288
6.664LysLeu: 6.664 ± 1.625
2.499LysMet: 2.499 ± 1.173
3.332LysAsn: 3.332 ± 1.373
5.831LysPro: 5.831 ± 1.625
7.497LysGln: 7.497 ± 1.896
3.748LysArg: 3.748 ± 1.206
6.664LysSer: 6.664 ± 1.312
5.414LysThr: 5.414 ± 1.174
2.082LysVal: 2.082 ± 0.619
0.416LysTrp: 0.416 ± 0.329
4.165LysTyr: 4.165 ± 1.18
0.0LysXaa: 0.0 ± 0.0
Leu
8.746LeuAla: 8.746 ± 1.809
0.0LeuCys: 0.0 ± 0.0
5.831LeuAsp: 5.831 ± 1.303
8.33LeuGlu: 8.33 ± 1.816
4.998LeuPhe: 4.998 ± 1.496
7.08LeuGly: 7.08 ± 1.62
3.332LeuHis: 3.332 ± 1.798
6.664LeuIle: 6.664 ± 2.32
10.829LeuLys: 10.829 ± 2.42
5.831LeuLeu: 5.831 ± 1.064
1.249LeuMet: 1.249 ± 0.687
8.33LeuAsn: 8.33 ± 2.438
2.915LeuPro: 2.915 ± 0.695
3.332LeuGln: 3.332 ± 0.987
4.165LeuArg: 4.165 ± 1.158
3.748LeuSer: 3.748 ± 1.504
4.998LeuThr: 4.998 ± 1.41
4.165LeuVal: 4.165 ± 1.264
0.416LeuTrp: 0.416 ± 0.329
4.165LeuTyr: 4.165 ± 1.362
0.0LeuXaa: 0.0 ± 0.0
Met
0.416MetAla: 0.416 ± 0.504
0.0MetCys: 0.0 ± 0.0
1.249MetAsp: 1.249 ± 0.575
0.833MetGlu: 0.833 ± 0.717
1.249MetPhe: 1.249 ± 0.5
1.249MetGly: 1.249 ± 0.741
0.0MetHis: 0.0 ± 0.0
1.666MetIle: 1.666 ± 1.079
0.833MetLys: 0.833 ± 0.71
3.748MetLeu: 3.748 ± 0.832
1.249MetMet: 1.249 ± 0.659
2.499MetAsn: 2.499 ± 0.708
0.833MetPro: 0.833 ± 0.746
0.416MetGln: 0.416 ± 0.329
0.416MetArg: 0.416 ± 0.459
0.833MetSer: 0.833 ± 0.539
3.332MetThr: 3.332 ± 0.818
0.833MetVal: 0.833 ± 0.67
0.0MetTrp: 0.0 ± 0.0
0.833MetTyr: 0.833 ± 0.638
0.0MetXaa: 0.0 ± 0.0
Asn
4.165AsnAla: 4.165 ± 1.381
0.833AsnCys: 0.833 ± 0.596
3.748AsnAsp: 3.748 ± 1.117
2.499AsnGlu: 2.499 ± 1.009
1.249AsnPhe: 1.249 ± 0.639
5.831AsnGly: 5.831 ± 1.368
1.249AsnHis: 1.249 ± 0.665
3.748AsnIle: 3.748 ± 1.032
4.998AsnLys: 4.998 ± 1.461
3.748AsnLeu: 3.748 ± 1.138
0.833AsnMet: 0.833 ± 0.766
3.332AsnAsn: 3.332 ± 0.716
3.332AsnPro: 3.332 ± 1.36
1.249AsnGln: 1.249 ± 0.439
3.332AsnArg: 3.332 ± 0.998
4.581AsnSer: 4.581 ± 1.478
4.581AsnThr: 4.581 ± 1.104
2.082AsnVal: 2.082 ± 0.555
0.0AsnTrp: 0.0 ± 0.0
0.833AsnTyr: 0.833 ± 0.62
0.0AsnXaa: 0.0 ± 0.0
Pro
2.082ProAla: 2.082 ± 0.974
0.416ProCys: 0.416 ± 0.459
4.165ProAsp: 4.165 ± 1.206
2.082ProGlu: 2.082 ± 0.718
1.666ProPhe: 1.666 ± 0.661
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
2.499ProIle: 2.499 ± 0.743
4.998ProLys: 4.998 ± 1.555
2.915ProLeu: 2.915 ± 1.096
0.416ProMet: 0.416 ± 0.557
1.666ProAsn: 1.666 ± 1.192
0.833ProPro: 0.833 ± 0.631
0.833ProGln: 0.833 ± 0.527
1.249ProArg: 1.249 ± 0.571
1.249ProSer: 1.249 ± 0.72
2.082ProThr: 2.082 ± 0.82
1.249ProVal: 1.249 ± 0.652
0.0ProTrp: 0.0 ± 0.0
0.833ProTyr: 0.833 ± 0.596
0.0ProXaa: 0.0 ± 0.0
Gln
3.748GlnAla: 3.748 ± 1.127
0.0GlnCys: 0.0 ± 0.0
3.332GlnAsp: 3.332 ± 1.112
4.581GlnGlu: 4.581 ± 1.579
1.249GlnPhe: 1.249 ± 0.714
0.833GlnGly: 0.833 ± 0.43
0.833GlnHis: 0.833 ± 0.505
4.581GlnIle: 4.581 ± 1.556
4.165GlnLys: 4.165 ± 0.924
5.831GlnLeu: 5.831 ± 1.225
0.0GlnMet: 0.0 ± 0.0
1.666GlnAsn: 1.666 ± 0.755
3.748GlnPro: 3.748 ± 1.279
4.165GlnGln: 4.165 ± 1.364
0.833GlnArg: 0.833 ± 0.53
3.748GlnSer: 3.748 ± 1.54
4.165GlnThr: 4.165 ± 0.972
2.915GlnVal: 2.915 ± 1.368
0.0GlnTrp: 0.0 ± 0.0
2.915GlnTyr: 2.915 ± 0.8
0.0GlnXaa: 0.0 ± 0.0
Arg
1.666ArgAla: 1.666 ± 0.697
0.0ArgCys: 0.0 ± 0.0
2.915ArgAsp: 2.915 ± 0.791
3.748ArgGlu: 3.748 ± 0.834
1.666ArgPhe: 1.666 ± 0.86
1.249ArgGly: 1.249 ± 0.755
0.833ArgHis: 0.833 ± 0.657
2.915ArgIle: 2.915 ± 0.887
3.332ArgLys: 3.332 ± 1.384
4.581ArgLeu: 4.581 ± 1.127
0.833ArgMet: 0.833 ± 0.527
0.0ArgAsn: 0.0 ± 0.0
0.0ArgPro: 0.0 ± 0.0
4.165ArgGln: 4.165 ± 0.959
2.499ArgArg: 2.499 ± 1.068
0.833ArgSer: 0.833 ± 0.366
4.165ArgThr: 4.165 ± 1.531
2.082ArgVal: 2.082 ± 1.009
1.249ArgTrp: 1.249 ± 0.706
1.666ArgTyr: 1.666 ± 0.782
0.0ArgXaa: 0.0 ± 0.0
Ser
2.082SerAla: 2.082 ± 0.737
0.833SerCys: 0.833 ± 0.492
2.082SerAsp: 2.082 ± 0.693
2.082SerGlu: 2.082 ± 1.133
2.915SerPhe: 2.915 ± 1.524
0.833SerGly: 0.833 ± 0.54
0.416SerHis: 0.416 ± 0.344
3.748SerIle: 3.748 ± 1.004
5.831SerLys: 5.831 ± 1.095
5.831SerLeu: 5.831 ± 1.526
2.082SerMet: 2.082 ± 0.838
2.499SerAsn: 2.499 ± 0.894
1.666SerPro: 1.666 ± 0.602
3.332SerGln: 3.332 ± 1.168
1.666SerArg: 1.666 ± 0.585
3.748SerSer: 3.748 ± 1.385
2.082SerThr: 2.082 ± 1.3
2.082SerVal: 2.082 ± 0.703
0.416SerTrp: 0.416 ± 0.464
3.748SerTyr: 3.748 ± 1.451
0.0SerXaa: 0.0 ± 0.0
Thr
4.581ThrAla: 4.581 ± 1.037
0.416ThrCys: 0.416 ± 0.557
2.915ThrAsp: 2.915 ± 1.047
3.332ThrGlu: 3.332 ± 0.887
2.499ThrPhe: 2.499 ± 1.127
2.499ThrGly: 2.499 ± 0.876
1.249ThrHis: 1.249 ± 0.626
8.33ThrIle: 8.33 ± 2.247
5.414ThrLys: 5.414 ± 2.331
6.247ThrLeu: 6.247 ± 1.184
0.416ThrMet: 0.416 ± 0.464
3.332ThrAsn: 3.332 ± 2.161
2.915ThrPro: 2.915 ± 1.155
4.581ThrGln: 4.581 ± 1.361
2.499ThrArg: 2.499 ± 0.885
1.666ThrSer: 1.666 ± 0.681
4.165ThrThr: 4.165 ± 1.708
4.581ThrVal: 4.581 ± 1.606
0.0ThrTrp: 0.0 ± 0.0
3.332ThrTyr: 3.332 ± 1.162
0.0ThrXaa: 0.0 ± 0.0
Val
3.332ValAla: 3.332 ± 1.063
0.0ValCys: 0.0 ± 0.0
0.416ValAsp: 0.416 ± 0.329
3.748ValGlu: 3.748 ± 0.825
1.666ValPhe: 1.666 ± 0.698
0.833ValGly: 0.833 ± 0.54
0.833ValHis: 0.833 ± 0.657
3.748ValIle: 3.748 ± 1.051
3.332ValLys: 3.332 ± 1.046
5.831ValLeu: 5.831 ± 0.855
2.082ValMet: 2.082 ± 0.714
1.666ValAsn: 1.666 ± 1.046
1.666ValPro: 1.666 ± 0.669
1.666ValGln: 1.666 ± 0.697
2.082ValArg: 2.082 ± 1.135
2.915ValSer: 2.915 ± 1.555
4.581ValThr: 4.581 ± 1.37
2.915ValVal: 2.915 ± 0.883
0.416ValTrp: 0.416 ± 0.344
3.748ValTyr: 3.748 ± 0.784
0.0ValXaa: 0.0 ± 0.0
Trp
0.833TrpAla: 0.833 ± 0.552
0.0TrpCys: 0.0 ± 0.0
0.833TrpAsp: 0.833 ± 0.436
1.249TrpGlu: 1.249 ± 0.706
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.833TrpIle: 0.833 ± 0.655
1.666TrpLys: 1.666 ± 0.853
0.833TrpLeu: 0.833 ± 0.596
0.0TrpMet: 0.0 ± 0.0
0.416TrpAsn: 0.416 ± 0.298
0.0TrpPro: 0.0 ± 0.0
1.249TrpGln: 1.249 ± 0.568
0.0TrpArg: 0.0 ± 0.0
0.833TrpSer: 0.833 ± 0.571
0.0TrpThr: 0.0 ± 0.0
0.833TrpVal: 0.833 ± 0.743
0.416TrpTrp: 0.416 ± 0.329
0.416TrpTyr: 0.416 ± 0.298
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.748TyrAla: 3.748 ± 1.2
0.416TyrCys: 0.416 ± 0.298
2.082TyrAsp: 2.082 ± 0.667
2.082TyrGlu: 2.082 ± 1.25
0.833TyrPhe: 0.833 ± 0.596
3.332TyrGly: 3.332 ± 1.345
1.249TyrHis: 1.249 ± 0.542
2.915TyrIle: 2.915 ± 0.763
5.831TyrLys: 5.831 ± 1.22
7.913TyrLeu: 7.913 ± 1.566
0.833TyrMet: 0.833 ± 0.479
4.581TyrAsn: 4.581 ± 1.195
0.833TyrPro: 0.833 ± 0.43
5.831TyrGln: 5.831 ± 1.696
2.082TyrArg: 2.082 ± 1.136
1.666TyrSer: 1.666 ± 0.697
2.499TyrThr: 2.499 ± 0.709
1.249TyrVal: 1.249 ± 0.628
0.416TyrTrp: 0.416 ± 0.298
1.666TyrTyr: 1.666 ± 0.866
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2402 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski