Amino acid dipepetide frequency for Hubei sobemo-like virus 32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.831AlaAla: 5.831 ± 1.689
0.972AlaCys: 0.972 ± 0.529
3.887AlaAsp: 3.887 ± 0.631
2.915AlaGlu: 2.915 ± 1.587
2.915AlaPhe: 2.915 ± 1.587
5.831AlaGly: 5.831 ± 1.689
2.915AlaHis: 2.915 ± 0.102
0.972AlaIle: 0.972 ± 0.529
8.746AlaLys: 8.746 ± 1.18
3.887AlaLeu: 3.887 ± 2.34
2.915AlaMet: 2.915 ± 0.102
2.915AlaAsn: 2.915 ± 1.587
1.944AlaPro: 1.944 ± 1.058
0.972AlaGln: 0.972 ± 0.956
1.944AlaArg: 1.944 ± 1.058
0.972AlaSer: 0.972 ± 0.529
3.887AlaThr: 3.887 ± 2.116
3.887AlaVal: 3.887 ± 2.116
2.915AlaTrp: 2.915 ± 1.383
2.915AlaTyr: 2.915 ± 0.102
0.0AlaXaa: 0.0 ± 0.0
Cys
0.972CysAla: 0.972 ± 0.529
0.0CysCys: 0.0 ± 0.0
0.972CysAsp: 0.972 ± 0.956
3.887CysGlu: 3.887 ± 0.631
0.972CysPhe: 0.972 ± 0.529
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.972CysLys: 0.972 ± 0.956
1.944CysLeu: 1.944 ± 1.058
0.972CysMet: 0.972 ± 0.956
0.972CysAsn: 0.972 ± 0.956
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.915CysVal: 2.915 ± 1.383
0.0CysTrp: 0.0 ± 0.0
0.972CysTyr: 0.972 ± 0.956
0.0CysXaa: 0.0 ± 0.0
Asp
0.972AspAla: 0.972 ± 0.529
0.972AspCys: 0.972 ± 0.529
2.915AspAsp: 2.915 ± 0.102
2.915AspGlu: 2.915 ± 1.587
1.944AspPhe: 1.944 ± 0.427
4.859AspGly: 4.859 ± 1.811
0.972AspHis: 0.972 ± 0.956
2.915AspIle: 2.915 ± 0.102
4.859AspLys: 4.859 ± 0.325
5.831AspLeu: 5.831 ± 2.767
0.972AspMet: 0.972 ± 0.529
1.944AspAsn: 1.944 ± 1.058
2.915AspPro: 2.915 ± 1.383
1.944AspGln: 1.944 ± 1.912
3.887AspArg: 3.887 ± 3.825
2.915AspSer: 2.915 ± 1.587
1.944AspThr: 1.944 ± 0.427
3.887AspVal: 3.887 ± 0.631
1.944AspTrp: 1.944 ± 1.912
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
8.746GluAla: 8.746 ± 1.79
0.0GluCys: 0.0 ± 0.0
0.972GluAsp: 0.972 ± 0.956
4.859GluGlu: 4.859 ± 0.325
5.831GluPhe: 5.831 ± 1.282
2.915GluGly: 2.915 ± 1.587
0.972GluHis: 0.972 ± 0.956
5.831GluIle: 5.831 ± 0.204
6.803GluLys: 6.803 ± 3.703
4.859GluLeu: 4.859 ± 1.811
1.944GluMet: 1.944 ± 0.347
4.859GluAsn: 4.859 ± 0.325
1.944GluPro: 1.944 ± 1.058
2.915GluGln: 2.915 ± 0.102
2.915GluArg: 2.915 ± 1.587
2.915GluSer: 2.915 ± 1.587
2.915GluThr: 2.915 ± 0.102
4.859GluVal: 4.859 ± 1.16
1.944GluTrp: 1.944 ± 0.427
2.915GluTyr: 2.915 ± 0.102
0.0GluXaa: 0.0 ± 0.0
Phe
1.944PheAla: 1.944 ± 0.427
0.0PheCys: 0.0 ± 0.0
3.887PheAsp: 3.887 ± 2.34
2.915PheGlu: 2.915 ± 1.383
0.972PhePhe: 0.972 ± 0.956
4.859PheGly: 4.859 ± 2.645
1.944PheHis: 1.944 ± 1.058
0.972PheIle: 0.972 ± 0.956
1.944PheLys: 1.944 ± 1.058
4.859PheLeu: 4.859 ± 1.16
0.0PheMet: 0.0 ± 0.0
1.944PheAsn: 1.944 ± 0.427
0.0PhePro: 0.0 ± 0.0
2.915PheGln: 2.915 ± 1.383
1.944PheArg: 1.944 ± 1.058
0.972PheSer: 0.972 ± 0.529
0.0PheThr: 0.0 ± 0.0
2.915PheVal: 2.915 ± 0.102
0.0PheTrp: 0.0 ± 0.0
1.944PheTyr: 1.944 ± 1.058
0.0PheXaa: 0.0 ± 0.0
Gly
3.887GlyAla: 3.887 ± 0.631
0.0GlyCys: 0.0 ± 0.0
3.887GlyAsp: 3.887 ± 0.631
5.831GlyGlu: 5.831 ± 0.204
3.887GlyPhe: 3.887 ± 2.116
2.915GlyGly: 2.915 ± 1.383
3.887GlyHis: 3.887 ± 2.34
1.944GlyIle: 1.944 ± 1.058
2.915GlyLys: 2.915 ± 0.102
4.859GlyLeu: 4.859 ± 1.16
2.915GlyMet: 2.915 ± 1.383
1.944GlyAsn: 1.944 ± 1.058
2.915GlyPro: 2.915 ± 1.587
5.831GlyGln: 5.831 ± 1.689
0.0GlyArg: 0.0 ± 0.0
6.803GlySer: 6.803 ± 0.733
0.972GlyThr: 0.972 ± 0.956
3.887GlyVal: 3.887 ± 2.116
1.944GlyTrp: 1.944 ± 1.912
1.944GlyTyr: 1.944 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
1.944HisAla: 1.944 ± 1.912
2.915HisCys: 2.915 ± 0.102
0.972HisAsp: 0.972 ± 0.529
1.944HisGlu: 1.944 ± 1.058
0.972HisPhe: 0.972 ± 0.956
2.915HisGly: 2.915 ± 0.102
0.0HisHis: 0.0 ± 0.0
0.972HisIle: 0.972 ± 0.956
1.944HisLys: 1.944 ± 0.427
1.944HisLeu: 1.944 ± 1.058
0.972HisMet: 0.972 ± 0.956
0.0HisAsn: 0.0 ± 0.0
1.944HisPro: 1.944 ± 0.427
0.0HisGln: 0.0 ± 0.0
1.944HisArg: 1.944 ± 1.912
0.0HisSer: 0.0 ± 0.0
2.915HisThr: 2.915 ± 0.102
3.887HisVal: 3.887 ± 0.854
0.0HisTrp: 0.0 ± 0.0
0.972HisTyr: 0.972 ± 0.956
0.0HisXaa: 0.0 ± 0.0
Ile
1.944IleAla: 1.944 ± 0.427
2.915IleCys: 2.915 ± 1.383
1.944IleAsp: 1.944 ± 1.912
0.972IleGlu: 0.972 ± 0.529
0.972IlePhe: 0.972 ± 0.529
2.915IleGly: 2.915 ± 0.102
0.972IleHis: 0.972 ± 0.529
3.887IleIle: 3.887 ± 0.854
3.887IleLys: 3.887 ± 0.854
4.859IleLeu: 4.859 ± 2.645
1.944IleMet: 1.944 ± 0.427
0.972IleAsn: 0.972 ± 0.529
3.887IlePro: 3.887 ± 0.854
0.972IleGln: 0.972 ± 0.956
2.915IleArg: 2.915 ± 1.383
2.915IleSer: 2.915 ± 0.102
0.972IleThr: 0.972 ± 0.529
2.915IleVal: 2.915 ± 1.383
0.972IleTrp: 0.972 ± 0.956
0.972IleTyr: 0.972 ± 0.956
0.0IleXaa: 0.0 ± 0.0
Lys
1.944LysAla: 1.944 ± 1.058
0.972LysCys: 0.972 ± 0.529
0.972LysAsp: 0.972 ± 0.529
3.887LysGlu: 3.887 ± 2.34
0.972LysPhe: 0.972 ± 0.529
1.944LysGly: 1.944 ± 1.058
1.944LysHis: 1.944 ± 0.427
1.944LysIle: 1.944 ± 1.058
4.859LysLys: 4.859 ± 1.16
9.718LysLeu: 9.718 ± 3.621
5.831LysMet: 5.831 ± 0.204
0.972LysAsn: 0.972 ± 0.529
2.915LysPro: 2.915 ± 1.383
1.944LysGln: 1.944 ± 0.427
4.859LysArg: 4.859 ± 2.645
3.887LysSer: 3.887 ± 0.631
4.859LysThr: 4.859 ± 1.16
8.746LysVal: 8.746 ± 3.276
0.972LysTrp: 0.972 ± 0.956
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.859LeuAla: 4.859 ± 1.16
2.915LeuCys: 2.915 ± 1.383
6.803LeuAsp: 6.803 ± 3.723
4.859LeuGlu: 4.859 ± 0.325
4.859LeuPhe: 4.859 ± 1.811
5.831LeuGly: 5.831 ± 0.204
1.944LeuHis: 1.944 ± 0.427
1.944LeuIle: 1.944 ± 0.427
6.803LeuLys: 6.803 ± 0.733
9.718LeuLeu: 9.718 ± 2.136
4.859LeuMet: 4.859 ± 1.16
0.972LeuAsn: 0.972 ± 0.529
5.831LeuPro: 5.831 ± 0.204
0.972LeuGln: 0.972 ± 0.529
6.803LeuArg: 6.803 ± 0.753
2.915LeuSer: 2.915 ± 1.587
3.887LeuThr: 3.887 ± 0.631
10.69LeuVal: 10.69 ± 2.848
0.972LeuTrp: 0.972 ± 0.529
0.972LeuTyr: 0.972 ± 0.956
0.0LeuXaa: 0.0 ± 0.0
Met
3.887MetAla: 3.887 ± 2.116
0.0MetCys: 0.0 ± 0.0
1.944MetAsp: 1.944 ± 1.058
4.859MetGlu: 4.859 ± 2.645
0.0MetPhe: 0.0 ± 0.0
1.944MetGly: 1.944 ± 1.058
3.887MetHis: 3.887 ± 0.854
2.915MetIle: 2.915 ± 0.102
2.915MetLys: 2.915 ± 0.102
2.915MetLeu: 2.915 ± 1.383
0.0MetMet: 0.0 ± 0.0
1.944MetAsn: 1.944 ± 1.058
0.0MetPro: 0.0 ± 0.0
2.915MetGln: 2.915 ± 0.102
0.0MetArg: 0.0 ± 0.0
2.915MetSer: 2.915 ± 1.383
0.972MetThr: 0.972 ± 0.956
7.775MetVal: 7.775 ± 1.709
0.972MetTrp: 0.972 ± 0.956
0.972MetTyr: 0.972 ± 0.529
0.0MetXaa: 0.0 ± 0.0
Asn
3.887AsnAla: 3.887 ± 0.631
0.0AsnCys: 0.0 ± 0.0
2.915AsnAsp: 2.915 ± 0.102
1.944AsnGlu: 1.944 ± 1.058
0.972AsnPhe: 0.972 ± 0.529
1.944AsnGly: 1.944 ± 1.058
0.0AsnHis: 0.0 ± 0.0
1.944AsnIle: 1.944 ± 1.058
1.944AsnLys: 1.944 ± 1.912
4.859AsnLeu: 4.859 ± 2.645
0.972AsnMet: 0.972 ± 0.771
0.0AsnAsn: 0.0 ± 0.0
3.887AsnPro: 3.887 ± 2.116
0.972AsnGln: 0.972 ± 0.529
0.972AsnArg: 0.972 ± 0.529
2.915AsnSer: 2.915 ± 0.102
1.944AsnThr: 1.944 ± 0.427
0.972AsnVal: 0.972 ± 0.956
0.0AsnTrp: 0.0 ± 0.0
0.972AsnTyr: 0.972 ± 0.956
0.0AsnXaa: 0.0 ± 0.0
Pro
1.944ProAla: 1.944 ± 0.427
0.0ProCys: 0.0 ± 0.0
3.887ProAsp: 3.887 ± 2.116
1.944ProGlu: 1.944 ± 0.427
0.0ProPhe: 0.0 ± 0.0
3.887ProGly: 3.887 ± 2.34
0.972ProHis: 0.972 ± 0.956
0.972ProIle: 0.972 ± 0.956
0.972ProLys: 0.972 ± 0.529
1.944ProLeu: 1.944 ± 1.058
2.915ProMet: 2.915 ± 0.102
0.972ProAsn: 0.972 ± 0.956
0.972ProPro: 0.972 ± 0.529
0.0ProGln: 0.0 ± 0.0
1.944ProArg: 1.944 ± 1.058
6.803ProSer: 6.803 ± 2.238
3.887ProThr: 3.887 ± 0.631
5.831ProVal: 5.831 ± 1.689
0.0ProTrp: 0.0 ± 0.0
2.915ProTyr: 2.915 ± 1.383
0.0ProXaa: 0.0 ± 0.0
Gln
0.972GlnAla: 0.972 ± 0.529
0.0GlnCys: 0.0 ± 0.0
0.972GlnAsp: 0.972 ± 0.956
3.887GlnGlu: 3.887 ± 0.854
1.944GlnPhe: 1.944 ± 0.427
0.972GlnGly: 0.972 ± 0.529
0.0GlnHis: 0.0 ± 0.0
2.915GlnIle: 2.915 ± 1.383
0.972GlnLys: 0.972 ± 0.529
2.915GlnLeu: 2.915 ± 1.587
0.972GlnMet: 0.972 ± 0.529
1.944GlnAsn: 1.944 ± 1.058
0.972GlnPro: 0.972 ± 0.956
0.972GlnGln: 0.972 ± 0.529
1.944GlnArg: 1.944 ± 1.912
1.944GlnSer: 1.944 ± 1.058
1.944GlnThr: 1.944 ± 0.427
1.944GlnVal: 1.944 ± 0.427
0.972GlnTrp: 0.972 ± 0.956
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.915ArgAla: 2.915 ± 0.102
0.0ArgCys: 0.0 ± 0.0
1.944ArgAsp: 1.944 ± 0.427
4.859ArgGlu: 4.859 ± 1.16
0.0ArgPhe: 0.0 ± 0.0
3.887ArgGly: 3.887 ± 0.854
2.915ArgHis: 2.915 ± 1.383
2.915ArgIle: 2.915 ± 1.383
0.972ArgLys: 0.972 ± 0.529
4.859ArgLeu: 4.859 ± 1.811
0.972ArgMet: 0.972 ± 0.956
2.915ArgAsn: 2.915 ± 1.587
1.944ArgPro: 1.944 ± 0.427
1.944ArgGln: 1.944 ± 1.058
1.944ArgArg: 1.944 ± 1.058
2.915ArgSer: 2.915 ± 0.102
4.859ArgThr: 4.859 ± 2.645
5.831ArgVal: 5.831 ± 2.767
0.0ArgTrp: 0.0 ± 0.0
2.915ArgTyr: 2.915 ± 1.383
0.0ArgXaa: 0.0 ± 0.0
Ser
4.859SerAla: 4.859 ± 1.16
1.944SerCys: 1.944 ± 0.427
4.859SerAsp: 4.859 ± 1.811
5.831SerGlu: 5.831 ± 0.204
0.972SerPhe: 0.972 ± 0.529
5.831SerGly: 5.831 ± 0.204
0.972SerHis: 0.972 ± 0.529
1.944SerIle: 1.944 ± 0.427
3.887SerLys: 3.887 ± 0.631
1.944SerLeu: 1.944 ± 0.427
0.972SerMet: 0.972 ± 0.529
2.915SerAsn: 2.915 ± 0.102
3.887SerPro: 3.887 ± 0.854
1.944SerGln: 1.944 ± 0.427
3.887SerArg: 3.887 ± 2.116
5.831SerSer: 5.831 ± 0.204
3.887SerThr: 3.887 ± 2.116
5.831SerVal: 5.831 ± 1.282
1.944SerTrp: 1.944 ± 1.058
0.972SerTyr: 0.972 ± 0.956
0.0SerXaa: 0.0 ± 0.0
Thr
3.887ThrAla: 3.887 ± 0.854
0.0ThrCys: 0.0 ± 0.0
0.972ThrAsp: 0.972 ± 0.529
2.915ThrGlu: 2.915 ± 1.383
4.859ThrPhe: 4.859 ± 1.16
1.944ThrGly: 1.944 ± 1.058
2.915ThrHis: 2.915 ± 1.587
3.887ThrIle: 3.887 ± 0.854
0.0ThrLys: 0.0 ± 0.0
6.803ThrLeu: 6.803 ± 2.238
5.831ThrMet: 5.831 ± 3.174
0.972ThrAsn: 0.972 ± 0.956
1.944ThrPro: 1.944 ± 0.427
0.972ThrGln: 0.972 ± 0.529
1.944ThrArg: 1.944 ± 0.427
4.859ThrSer: 4.859 ± 1.811
0.972ThrThr: 0.972 ± 0.529
2.915ThrVal: 2.915 ± 0.102
4.859ThrTrp: 4.859 ± 2.645
1.944ThrTyr: 1.944 ± 1.058
0.0ThrXaa: 0.0 ± 0.0
Val
6.803ValAla: 6.803 ± 3.703
0.972ValCys: 0.972 ± 0.956
5.831ValAsp: 5.831 ± 1.282
5.831ValGlu: 5.831 ± 1.689
0.972ValPhe: 0.972 ± 0.956
5.831ValGly: 5.831 ± 1.282
0.972ValHis: 0.972 ± 0.956
2.915ValIle: 2.915 ± 1.383
4.859ValLys: 4.859 ± 1.16
6.803ValLeu: 6.803 ± 3.703
3.887ValMet: 3.887 ± 2.116
3.887ValAsn: 3.887 ± 0.631
2.915ValPro: 2.915 ± 1.383
0.972ValGln: 0.972 ± 0.956
6.803ValArg: 6.803 ± 2.238
10.69ValSer: 10.69 ± 1.607
6.803ValThr: 6.803 ± 0.733
11.662ValVal: 11.662 ± 0.407
2.915ValTrp: 2.915 ± 0.102
1.944ValTyr: 1.944 ± 1.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.972TrpAla: 0.972 ± 0.529
0.0TrpCys: 0.0 ± 0.0
0.972TrpAsp: 0.972 ± 0.956
2.915TrpGlu: 2.915 ± 0.102
0.972TrpPhe: 0.972 ± 0.529
0.972TrpGly: 0.972 ± 0.529
0.0TrpHis: 0.0 ± 0.0
0.972TrpIle: 0.972 ± 0.529
2.915TrpLys: 2.915 ± 0.102
2.915TrpLeu: 2.915 ± 0.102
0.972TrpMet: 0.972 ± 0.956
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.915TrpArg: 2.915 ± 1.383
0.972TrpSer: 0.972 ± 0.529
3.887TrpThr: 3.887 ± 3.825
0.0TrpVal: 0.0 ± 0.0
0.972TrpTrp: 0.972 ± 0.529
0.972TrpTyr: 0.972 ± 0.956
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.972TyrAla: 0.972 ± 0.956
0.972TyrCys: 0.972 ± 0.956
0.0TyrAsp: 0.0 ± 0.0
1.944TyrGlu: 1.944 ± 1.058
1.944TyrPhe: 1.944 ± 0.427
0.972TyrGly: 0.972 ± 0.529
0.972TyrHis: 0.972 ± 0.956
1.944TyrIle: 1.944 ± 1.912
0.972TyrLys: 0.972 ± 0.529
0.972TyrLeu: 0.972 ± 0.529
1.944TyrMet: 1.944 ± 0.427
1.944TyrAsn: 1.944 ± 0.427
1.944TyrPro: 1.944 ± 0.427
0.0TyrGln: 0.0 ± 0.0
1.944TyrArg: 1.944 ± 0.427
0.972TyrSer: 0.972 ± 0.529
3.887TyrThr: 3.887 ± 0.854
2.915TyrVal: 2.915 ± 1.383
0.0TyrTrp: 0.0 ± 0.0
1.944TyrTyr: 1.944 ± 1.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1030 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski