Amino acid dipepetide frequency for Streptococcus satellite phage Javan382

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.583AlaCys: 0.583 ± 0.48
1.166AlaAsp: 1.166 ± 0.533
3.499AlaGlu: 3.499 ± 1.675
0.583AlaPhe: 0.583 ± 0.624
3.499AlaGly: 3.499 ± 1.639
0.0AlaHis: 0.0 ± 0.0
2.332AlaIle: 2.332 ± 0.794
7.58AlaLys: 7.58 ± 1.783
6.997AlaLeu: 6.997 ± 2.293
1.749AlaMet: 1.749 ± 1.051
2.915AlaAsn: 2.915 ± 1.846
0.0AlaPro: 0.0 ± 0.0
1.166AlaGln: 1.166 ± 0.679
3.499AlaArg: 3.499 ± 0.821
6.414AlaSer: 6.414 ± 1.684
4.665AlaThr: 4.665 ± 1.294
1.749AlaVal: 1.749 ± 0.893
1.166AlaTrp: 1.166 ± 0.601
3.499AlaTyr: 3.499 ± 1.639
0.0AlaXaa: 0.0 ± 0.0
Cys
1.166CysAla: 1.166 ± 0.625
0.0CysCys: 0.0 ± 0.0
0.583CysAsp: 0.583 ± 0.555
0.583CysGlu: 0.583 ± 0.482
0.0CysPhe: 0.0 ± 0.0
0.583CysGly: 0.583 ± 0.482
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.583CysLeu: 0.583 ± 0.48
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.583CysGln: 0.583 ± 0.482
0.583CysArg: 0.583 ± 0.48
0.583CysSer: 0.583 ± 0.624
0.583CysThr: 0.583 ± 0.624
0.583CysVal: 0.583 ± 0.482
0.583CysTrp: 0.583 ± 0.48
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.749AspAla: 1.749 ± 1.071
0.583AspCys: 0.583 ± 0.482
1.749AspAsp: 1.749 ± 1.447
4.082AspGlu: 4.082 ± 1.557
2.332AspPhe: 2.332 ± 1.328
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
4.082AspIle: 4.082 ± 1.1
4.082AspLys: 4.082 ± 1.302
5.831AspLeu: 5.831 ± 1.961
0.583AspMet: 0.583 ± 0.853
2.915AspAsn: 2.915 ± 1.486
1.749AspPro: 1.749 ± 0.896
2.332AspGln: 2.332 ± 0.325
4.665AspArg: 4.665 ± 1.303
2.915AspSer: 2.915 ± 1.799
1.749AspThr: 1.749 ± 0.926
2.915AspVal: 2.915 ± 1.914
0.583AspTrp: 0.583 ± 0.482
2.332AspTyr: 2.332 ± 1.14
0.0AspXaa: 0.0 ± 0.0
Glu
6.997GluAla: 6.997 ± 2.771
1.749GluCys: 1.749 ± 0.6
1.749GluAsp: 1.749 ± 0.916
8.163GluGlu: 8.163 ± 2.41
2.915GluPhe: 2.915 ± 1.805
3.499GluGly: 3.499 ± 0.782
1.166GluHis: 1.166 ± 1.109
8.163GluIle: 8.163 ± 2.837
7.58GluLys: 7.58 ± 1.826
9.913GluLeu: 9.913 ± 4.312
4.082GluMet: 4.082 ± 1.622
6.414GluAsn: 6.414 ± 0.997
0.583GluPro: 0.583 ± 0.555
2.332GluGln: 2.332 ± 0.929
5.248GluArg: 5.248 ± 2.877
1.166GluSer: 1.166 ± 0.533
6.414GluThr: 6.414 ± 1.884
4.665GluVal: 4.665 ± 0.974
2.332GluTrp: 2.332 ± 1.18
3.499GluTyr: 3.499 ± 1.351
0.0GluXaa: 0.0 ± 0.0
Phe
0.583PheAla: 0.583 ± 0.48
0.0PheCys: 0.0 ± 0.0
2.915PheAsp: 2.915 ± 1.799
2.915PheGlu: 2.915 ± 0.851
3.499PhePhe: 3.499 ± 1.61
2.332PheGly: 2.332 ± 1.101
0.583PheHis: 0.583 ± 0.48
1.749PheIle: 1.749 ± 0.916
3.499PheLys: 3.499 ± 1.157
4.665PheLeu: 4.665 ± 1.625
1.749PheMet: 1.749 ± 1.664
2.332PheAsn: 2.332 ± 1.221
0.583PhePro: 0.583 ± 0.482
1.749PheGln: 1.749 ± 1.055
1.166PheArg: 1.166 ± 0.533
2.915PheSer: 2.915 ± 0.851
5.831PheThr: 5.831 ± 1.408
2.915PheVal: 2.915 ± 1.002
0.583PheTrp: 0.583 ± 0.482
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.499GlyAla: 3.499 ± 0.757
0.583GlyCys: 0.583 ± 0.48
4.082GlyAsp: 4.082 ± 1.088
4.082GlyGlu: 4.082 ± 1.294
2.915GlyPhe: 2.915 ± 0.851
2.332GlyGly: 2.332 ± 0.884
1.166GlyHis: 1.166 ± 0.96
2.332GlyIle: 2.332 ± 0.983
6.414GlyLys: 6.414 ± 1.173
6.997GlyLeu: 6.997 ± 1.435
0.583GlyMet: 0.583 ± 0.482
1.166GlyAsn: 1.166 ± 0.601
0.0GlyPro: 0.0 ± 0.0
2.332GlyGln: 2.332 ± 1.599
2.332GlyArg: 2.332 ± 0.778
2.332GlySer: 2.332 ± 1.202
0.583GlyThr: 0.583 ± 0.48
5.831GlyVal: 5.831 ± 0.875
0.583GlyTrp: 0.583 ± 0.482
2.332GlyTyr: 2.332 ± 0.778
0.0GlyXaa: 0.0 ± 0.0
His
1.749HisAla: 1.749 ± 1.079
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.332HisGlu: 2.332 ± 0.744
0.583HisPhe: 0.583 ± 0.48
0.583HisGly: 0.583 ± 0.48
0.0HisHis: 0.0 ± 0.0
0.583HisIle: 0.583 ± 0.48
0.583HisLys: 0.583 ± 0.555
1.166HisLeu: 1.166 ± 0.679
0.0HisMet: 0.0 ± 0.0
1.166HisAsn: 1.166 ± 0.625
0.583HisPro: 0.583 ± 0.624
0.583HisGln: 0.583 ± 0.482
0.0HisArg: 0.0 ± 0.0
0.583HisSer: 0.583 ± 0.48
0.583HisThr: 0.583 ± 0.48
0.583HisVal: 0.583 ± 0.555
0.0HisTrp: 0.0 ± 0.0
0.583HisTyr: 0.583 ± 0.624
0.0HisXaa: 0.0 ± 0.0
Ile
2.915IleAla: 2.915 ± 1.38
0.583IleCys: 0.583 ± 0.624
4.082IleAsp: 4.082 ± 0.626
6.997IleGlu: 6.997 ± 2.67
1.166IlePhe: 1.166 ± 0.533
1.749IleGly: 1.749 ± 0.686
1.166IleHis: 1.166 ± 0.601
4.665IleIle: 4.665 ± 0.692
5.831IleLys: 5.831 ± 1.995
5.248IleLeu: 5.248 ± 1.295
0.583IleMet: 0.583 ± 0.48
1.749IleAsn: 1.749 ± 0.967
1.749IlePro: 1.749 ± 0.686
2.915IleGln: 2.915 ± 0.786
4.082IleArg: 4.082 ± 0.879
5.248IleSer: 5.248 ± 1.982
2.915IleThr: 2.915 ± 1.002
4.665IleVal: 4.665 ± 0.692
1.166IleTrp: 1.166 ± 0.864
1.749IleTyr: 1.749 ± 0.955
0.0IleXaa: 0.0 ± 0.0
Lys
6.414LysAla: 6.414 ± 2.337
0.0LysCys: 0.0 ± 0.0
6.414LysAsp: 6.414 ± 1.258
11.662LysGlu: 11.662 ± 3.592
2.332LysPhe: 2.332 ± 0.799
4.665LysGly: 4.665 ± 1.534
0.0LysHis: 0.0 ± 0.0
9.329LysIle: 9.329 ± 1.704
14.577LysLys: 14.577 ± 2.627
6.414LysLeu: 6.414 ± 0.716
2.332LysMet: 2.332 ± 1.232
5.248LysAsn: 5.248 ± 1.339
1.749LysPro: 1.749 ± 1.145
5.248LysGln: 5.248 ± 0.761
3.499LysArg: 3.499 ± 0.757
5.248LysSer: 5.248 ± 1.715
7.58LysThr: 7.58 ± 2.074
6.414LysVal: 6.414 ± 2.832
0.583LysTrp: 0.583 ± 0.48
3.499LysTyr: 3.499 ± 1.61
0.0LysXaa: 0.0 ± 0.0
Leu
5.248LeuAla: 5.248 ± 1.393
0.0LeuCys: 0.0 ± 0.0
5.248LeuAsp: 5.248 ± 1.512
8.746LeuGlu: 8.746 ± 2.53
5.248LeuPhe: 5.248 ± 1.01
6.414LeuGly: 6.414 ± 2.326
1.166LeuHis: 1.166 ± 0.533
1.166LeuIle: 1.166 ± 0.959
11.079LeuLys: 11.079 ± 1.614
6.414LeuLeu: 6.414 ± 1.649
2.332LeuMet: 2.332 ± 1.578
10.496LeuAsn: 10.496 ± 1.232
3.499LeuPro: 3.499 ± 1.309
3.499LeuGln: 3.499 ± 1.642
3.499LeuArg: 3.499 ± 0.723
4.665LeuSer: 4.665 ± 2.011
8.163LeuThr: 8.163 ± 2.251
5.831LeuVal: 5.831 ± 1.376
0.583LeuTrp: 0.583 ± 0.48
4.082LeuTyr: 4.082 ± 1.546
0.0LeuXaa: 0.0 ± 0.0
Met
1.166MetAla: 1.166 ± 1.109
0.0MetCys: 0.0 ± 0.0
0.583MetAsp: 0.583 ± 0.624
4.665MetGlu: 4.665 ± 2.252
0.583MetPhe: 0.583 ± 0.555
0.583MetGly: 0.583 ± 0.627
0.0MetHis: 0.0 ± 0.0
0.583MetIle: 0.583 ± 0.555
1.749MetLys: 1.749 ± 1.046
2.332MetLeu: 2.332 ± 1.006
0.583MetMet: 0.583 ± 0.627
1.749MetAsn: 1.749 ± 0.91
0.0MetPro: 0.0 ± 0.0
1.749MetGln: 1.749 ± 0.6
0.583MetArg: 0.583 ± 0.741
1.166MetSer: 1.166 ± 0.601
2.332MetThr: 2.332 ± 1.251
2.332MetVal: 2.332 ± 1.366
0.0MetTrp: 0.0 ± 0.0
0.583MetTyr: 0.583 ± 0.482
0.0MetXaa: 0.0 ± 0.0
Asn
1.166AsnAla: 1.166 ± 0.601
0.0AsnCys: 0.0 ± 0.0
3.499AsnAsp: 3.499 ± 1.583
2.915AsnGlu: 2.915 ± 1.059
2.332AsnPhe: 2.332 ± 1.328
4.665AsnGly: 4.665 ± 1.247
0.583AsnHis: 0.583 ± 0.624
3.499AsnIle: 3.499 ± 1.657
5.831AsnLys: 5.831 ± 1.366
7.58AsnLeu: 7.58 ± 0.734
1.166AsnMet: 1.166 ± 1.144
5.248AsnAsn: 5.248 ± 1.874
1.749AsnPro: 1.749 ± 0.851
1.166AsnGln: 1.166 ± 0.533
3.499AsnArg: 3.499 ± 1.352
5.248AsnSer: 5.248 ± 1.718
4.665AsnThr: 4.665 ± 2.01
1.749AsnVal: 1.749 ± 1.051
1.749AsnTrp: 1.749 ± 1.055
3.499AsnTyr: 3.499 ± 3.097
0.0AsnXaa: 0.0 ± 0.0
Pro
1.166ProAla: 1.166 ± 0.96
0.0ProCys: 0.0 ± 0.0
1.749ProAsp: 1.749 ± 0.851
2.915ProGlu: 2.915 ± 1.002
0.583ProPhe: 0.583 ± 0.48
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.749ProIle: 1.749 ± 0.919
1.166ProLys: 1.166 ± 0.533
1.166ProLeu: 1.166 ± 0.618
1.166ProMet: 1.166 ± 0.788
3.499ProAsn: 3.499 ± 1.284
2.332ProPro: 2.332 ± 1.328
0.0ProGln: 0.0 ± 0.0
0.583ProArg: 0.583 ± 0.48
2.915ProSer: 2.915 ± 1.034
3.499ProThr: 3.499 ± 0.99
1.749ProVal: 1.749 ± 0.91
0.0ProTrp: 0.0 ± 0.0
2.332ProTyr: 2.332 ± 0.744
0.0ProXaa: 0.0 ± 0.0
Gln
1.749GlnAla: 1.749 ± 0.686
0.0GlnCys: 0.0 ± 0.0
0.583GlnAsp: 0.583 ± 0.555
4.665GlnGlu: 4.665 ± 1.598
0.583GlnPhe: 0.583 ± 0.482
3.499GlnGly: 3.499 ± 0.622
0.583GlnHis: 0.583 ± 0.48
1.166GlnIle: 1.166 ± 1.109
4.082GlnLys: 4.082 ± 1.232
0.583GlnLeu: 0.583 ± 0.627
1.749GlnMet: 1.749 ± 1.08
1.749GlnAsn: 1.749 ± 0.524
1.166GlnPro: 1.166 ± 0.601
4.082GlnGln: 4.082 ± 2.263
1.166GlnArg: 1.166 ± 1.002
1.749GlnSer: 1.749 ± 1.08
1.749GlnThr: 1.749 ± 1.259
2.332GlnVal: 2.332 ± 1.393
0.0GlnTrp: 0.0 ± 0.0
3.499GlnTyr: 3.499 ± 1.599
0.0GlnXaa: 0.0 ± 0.0
Arg
4.665ArgAla: 4.665 ± 1.091
0.0ArgCys: 0.0 ± 0.0
2.915ArgAsp: 2.915 ± 0.948
4.082ArgGlu: 4.082 ± 0.93
4.082ArgPhe: 4.082 ± 1.339
1.749ArgGly: 1.749 ± 0.896
2.915ArgHis: 2.915 ± 1.278
2.332ArgIle: 2.332 ± 0.894
4.665ArgLys: 4.665 ± 1.397
5.831ArgLeu: 5.831 ± 1.912
1.749ArgMet: 1.749 ± 0.715
1.749ArgAsn: 1.749 ± 1.055
1.749ArgPro: 1.749 ± 1.078
0.583ArgGln: 0.583 ± 0.48
4.082ArgArg: 4.082 ± 2.102
1.166ArgSer: 1.166 ± 0.625
1.749ArgThr: 1.749 ± 0.6
0.583ArgVal: 0.583 ± 0.482
0.0ArgTrp: 0.0 ± 0.0
2.332ArgTyr: 2.332 ± 0.799
0.0ArgXaa: 0.0 ± 0.0
Ser
0.583SerAla: 0.583 ± 0.48
1.166SerCys: 1.166 ± 0.601
4.665SerAsp: 4.665 ± 1.53
2.332SerGlu: 2.332 ± 1.006
2.915SerPhe: 2.915 ± 0.962
3.499SerGly: 3.499 ± 0.577
0.0SerHis: 0.0 ± 0.0
5.248SerIle: 5.248 ± 2.459
6.414SerLys: 6.414 ± 2.274
5.248SerLeu: 5.248 ± 1.331
0.583SerMet: 0.583 ± 0.48
1.749SerAsn: 1.749 ± 1.44
4.082SerPro: 4.082 ± 1.394
1.166SerGln: 1.166 ± 0.625
2.332SerArg: 2.332 ± 0.929
4.082SerSer: 4.082 ± 1.206
2.332SerThr: 2.332 ± 1.263
4.665SerVal: 4.665 ± 1.31
0.0SerTrp: 0.0 ± 0.0
2.915SerTyr: 2.915 ± 1.44
0.0SerXaa: 0.0 ± 0.0
Thr
2.915ThrAla: 2.915 ± 1.521
0.0ThrCys: 0.0 ± 0.0
2.915ThrAsp: 2.915 ± 2.042
3.499ThrGlu: 3.499 ± 1.832
1.749ThrPhe: 1.749 ± 0.896
5.248ThrGly: 5.248 ± 1.55
1.166ThrHis: 1.166 ± 0.533
6.414ThrIle: 6.414 ± 2.389
4.082ThrLys: 4.082 ± 1.274
9.329ThrLeu: 9.329 ± 2.846
1.166ThrMet: 1.166 ± 0.96
4.082ThrAsn: 4.082 ± 2.194
4.665ThrPro: 4.665 ± 0.895
2.332ThrGln: 2.332 ± 0.991
1.749ThrArg: 1.749 ± 0.898
2.332ThrSer: 2.332 ± 0.995
3.499ThrThr: 3.499 ± 1.657
4.665ThrVal: 4.665 ± 1.517
0.583ThrTrp: 0.583 ± 0.555
2.332ThrTyr: 2.332 ± 0.971
0.0ThrXaa: 0.0 ± 0.0
Val
5.831ValAla: 5.831 ± 2.173
0.583ValCys: 0.583 ± 0.555
2.915ValAsp: 2.915 ± 0.683
7.58ValGlu: 7.58 ± 1.824
3.499ValPhe: 3.499 ± 0.564
1.749ValGly: 1.749 ± 0.938
1.166ValHis: 1.166 ± 0.739
4.665ValIle: 4.665 ± 2.022
6.414ValLys: 6.414 ± 0.787
3.499ValLeu: 3.499 ± 0.564
0.583ValMet: 0.583 ± 0.555
3.499ValAsn: 3.499 ± 0.577
1.166ValPro: 1.166 ± 0.625
0.583ValGln: 0.583 ± 0.624
2.332ValArg: 2.332 ± 0.935
2.915ValSer: 2.915 ± 1.461
2.915ValThr: 2.915 ± 0.962
2.332ValVal: 2.332 ± 0.778
1.166ValTrp: 1.166 ± 1.465
2.915ValTyr: 2.915 ± 1.048
0.0ValXaa: 0.0 ± 0.0
Trp
1.166TrpAla: 1.166 ± 0.533
0.583TrpCys: 0.583 ± 0.482
0.0TrpAsp: 0.0 ± 0.0
1.749TrpGlu: 1.749 ± 0.983
0.583TrpPhe: 0.583 ± 0.555
1.166TrpGly: 1.166 ± 0.965
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.749TrpLys: 1.749 ± 0.919
2.332TrpLeu: 2.332 ± 1.133
0.0TrpMet: 0.0 ± 0.0
1.166TrpAsn: 1.166 ± 1.465
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.583TrpArg: 0.583 ± 0.482
0.583TrpSer: 0.583 ± 0.48
1.166TrpThr: 1.166 ± 0.601
0.0TrpVal: 0.0 ± 0.0
0.583TrpTrp: 0.583 ± 0.48
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.332TyrAla: 2.332 ± 1.92
0.583TyrCys: 0.583 ± 0.48
0.0TyrAsp: 0.0 ± 0.0
1.166TyrGlu: 1.166 ± 0.824
4.082TyrPhe: 4.082 ± 2.331
4.665TyrGly: 4.665 ± 2.004
0.583TyrHis: 0.583 ± 0.624
1.166TyrIle: 1.166 ± 0.618
5.831TyrLys: 5.831 ± 1.78
5.248TyrLeu: 5.248 ± 0.904
0.0TyrMet: 0.0 ± 0.0
2.915TyrAsn: 2.915 ± 1.775
1.166TyrPro: 1.166 ± 0.618
2.332TyrGln: 2.332 ± 1.599
3.499TyrArg: 3.499 ± 1.335
1.749TyrSer: 1.749 ± 1.08
1.749TyrThr: 1.749 ± 1.18
1.749TyrVal: 1.749 ± 1.145
1.166TyrTrp: 1.166 ± 1.247
1.166TyrTyr: 1.166 ± 1.002
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1716 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski