Amino acid dipepetide frequency for Hubei sobemo-like virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.203AlaAla: 1.203 ± 1.707
1.203AlaCys: 1.203 ± 0.616
1.203AlaAsp: 1.203 ± 0.616
1.203AlaGlu: 1.203 ± 0.616
1.203AlaPhe: 1.203 ± 1.707
4.813AlaGly: 4.813 ± 0.143
0.0AlaHis: 0.0 ± 0.0
3.61AlaIle: 3.61 ± 1.849
4.813AlaLys: 4.813 ± 0.143
6.017AlaLeu: 6.017 ± 0.759
3.61AlaMet: 3.61 ± 5.12
1.203AlaAsn: 1.203 ± 0.616
1.203AlaPro: 1.203 ± 0.616
3.61AlaGln: 3.61 ± 2.797
8.424AlaArg: 8.424 ± 1.992
2.407AlaSer: 2.407 ± 1.233
4.813AlaThr: 4.813 ± 0.143
8.424AlaVal: 8.424 ± 2.654
1.203AlaTrp: 1.203 ± 1.707
2.407AlaTyr: 2.407 ± 1.09
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.203CysAsp: 1.203 ± 1.707
0.0CysGlu: 0.0 ± 0.0
1.203CysPhe: 1.203 ± 0.616
1.203CysGly: 1.203 ± 0.616
0.0CysHis: 0.0 ± 0.0
1.203CysIle: 1.203 ± 1.707
0.0CysLys: 0.0 ± 0.0
1.203CysLeu: 1.203 ± 0.616
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.203CysGln: 1.203 ± 0.616
1.203CysArg: 1.203 ± 1.707
2.407CysSer: 2.407 ± 1.09
1.203CysThr: 1.203 ± 0.616
2.407CysVal: 2.407 ± 3.413
1.203CysTrp: 1.203 ± 1.707
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.203AspAla: 1.203 ± 0.616
0.0AspCys: 0.0 ± 0.0
6.017AspAsp: 6.017 ± 0.759
2.407AspGlu: 2.407 ± 1.09
3.61AspPhe: 3.61 ± 1.849
4.813AspGly: 4.813 ± 2.181
0.0AspHis: 0.0 ± 0.0
1.203AspIle: 1.203 ± 0.616
2.407AspLys: 2.407 ± 1.233
7.22AspLeu: 7.22 ± 1.375
2.407AspMet: 2.407 ± 1.09
0.0AspAsn: 0.0 ± 0.0
2.407AspPro: 2.407 ± 1.09
1.203AspGln: 1.203 ± 1.707
3.61AspArg: 3.61 ± 1.849
3.61AspSer: 3.61 ± 1.849
1.203AspThr: 1.203 ± 0.616
2.407AspVal: 2.407 ± 1.233
2.407AspTrp: 2.407 ± 1.09
2.407AspTyr: 2.407 ± 1.233
0.0AspXaa: 0.0 ± 0.0
Glu
3.61GluAla: 3.61 ± 0.474
0.0GluCys: 0.0 ± 0.0
3.61GluAsp: 3.61 ± 1.849
4.813GluGlu: 4.813 ± 2.181
2.407GluPhe: 2.407 ± 1.09
8.424GluGly: 8.424 ± 1.992
1.203GluHis: 1.203 ± 1.707
2.407GluIle: 2.407 ± 1.233
2.407GluLys: 2.407 ± 1.09
6.017GluLeu: 6.017 ± 1.564
1.203GluMet: 1.203 ± 2.585
1.203GluAsn: 1.203 ± 0.616
2.407GluPro: 2.407 ± 1.09
6.017GluGln: 6.017 ± 0.759
4.813GluArg: 4.813 ± 2.466
2.407GluSer: 2.407 ± 1.233
3.61GluThr: 3.61 ± 1.849
2.407GluVal: 2.407 ± 1.09
3.61GluTrp: 3.61 ± 1.849
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.407PheAla: 2.407 ± 1.09
2.407PheCys: 2.407 ± 1.233
2.407PheAsp: 2.407 ± 1.233
0.0PheGlu: 0.0 ± 0.0
3.61PhePhe: 3.61 ± 0.474
2.407PheGly: 2.407 ± 1.233
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.203PheLys: 1.203 ± 1.707
1.203PheLeu: 1.203 ± 0.616
2.407PheMet: 2.407 ± 1.09
0.0PheAsn: 0.0 ± 0.0
2.407PhePro: 2.407 ± 1.09
2.407PheGln: 2.407 ± 1.09
3.61PheArg: 3.61 ± 2.797
2.407PheSer: 2.407 ± 1.233
1.203PheThr: 1.203 ± 1.707
1.203PheVal: 1.203 ± 0.616
2.407PheTrp: 2.407 ± 1.233
3.61PheTyr: 3.61 ± 1.849
0.0PheXaa: 0.0 ± 0.0
Gly
7.22GlyAla: 7.22 ± 1.375
2.407GlyCys: 2.407 ± 3.413
2.407GlyAsp: 2.407 ± 3.413
1.203GlyGlu: 1.203 ± 0.616
3.61GlyPhe: 3.61 ± 0.474
8.424GlyGly: 8.424 ± 4.315
1.203GlyHis: 1.203 ± 0.616
2.407GlyIle: 2.407 ± 1.233
3.61GlyLys: 3.61 ± 1.849
6.017GlyLeu: 6.017 ± 0.759
1.203GlyMet: 1.203 ± 0.616
2.407GlyAsn: 2.407 ± 1.233
2.407GlyPro: 2.407 ± 1.09
2.407GlyGln: 2.407 ± 1.09
6.017GlyArg: 6.017 ± 0.759
7.22GlySer: 7.22 ± 3.699
1.203GlyThr: 1.203 ± 0.616
3.61GlyVal: 3.61 ± 0.474
3.61GlyTrp: 3.61 ± 0.474
4.813GlyTyr: 4.813 ± 4.504
0.0GlyXaa: 0.0 ± 0.0
His
2.407HisAla: 2.407 ± 1.09
2.407HisCys: 2.407 ± 1.09
1.203HisAsp: 1.203 ± 0.616
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.203HisIle: 1.203 ± 0.616
1.203HisLys: 1.203 ± 1.707
3.61HisLeu: 3.61 ± 0.474
1.203HisMet: 1.203 ± 0.616
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.203HisSer: 1.203 ± 0.616
0.0HisThr: 0.0 ± 0.0
2.407HisVal: 2.407 ± 1.09
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
4.813IleGlu: 4.813 ± 0.143
0.0IlePhe: 0.0 ± 0.0
3.61IleGly: 3.61 ± 1.849
2.407IleHis: 2.407 ± 1.233
0.0IleIle: 0.0 ± 0.0
0.0IleLys: 0.0 ± 0.0
2.407IleLeu: 2.407 ± 1.233
1.203IleMet: 1.203 ± 0.616
1.203IleAsn: 1.203 ± 0.616
2.407IlePro: 2.407 ± 1.233
4.813IleGln: 4.813 ± 2.466
0.0IleArg: 0.0 ± 0.0
0.0IleSer: 0.0 ± 0.0
1.203IleThr: 1.203 ± 0.616
3.61IleVal: 3.61 ± 0.474
1.203IleTrp: 1.203 ± 0.616
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.407LysAla: 2.407 ± 1.233
0.0LysCys: 0.0 ± 0.0
1.203LysAsp: 1.203 ± 0.616
3.61LysGlu: 3.61 ± 0.474
4.813LysPhe: 4.813 ± 0.143
2.407LysGly: 2.407 ± 1.09
2.407LysHis: 2.407 ± 3.413
0.0LysIle: 0.0 ± 0.0
2.407LysLys: 2.407 ± 1.233
4.813LysLeu: 4.813 ± 2.181
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
1.203LysPro: 1.203 ± 0.616
1.203LysGln: 1.203 ± 0.616
2.407LysArg: 2.407 ± 1.233
6.017LysSer: 6.017 ± 0.759
2.407LysThr: 2.407 ± 1.233
2.407LysVal: 2.407 ± 1.233
0.0LysTrp: 0.0 ± 0.0
1.203LysTyr: 1.203 ± 0.616
0.0LysXaa: 0.0 ± 0.0
Leu
7.22LeuAla: 7.22 ± 0.948
1.203LeuCys: 1.203 ± 1.707
4.813LeuAsp: 4.813 ± 0.143
13.237LeuGlu: 13.237 ± 2.134
3.61LeuPhe: 3.61 ± 1.849
4.813LeuGly: 4.813 ± 2.466
0.0LeuHis: 0.0 ± 0.0
2.407LeuIle: 2.407 ± 1.09
2.407LeuLys: 2.407 ± 1.233
9.627LeuLeu: 9.627 ± 6.684
4.813LeuMet: 4.813 ± 0.464
1.203LeuAsn: 1.203 ± 0.616
6.017LeuPro: 6.017 ± 6.21
2.407LeuGln: 2.407 ± 1.233
13.237LeuArg: 13.237 ± 0.189
9.627LeuSer: 9.627 ± 4.361
3.61LeuThr: 3.61 ± 0.474
7.22LeuVal: 7.22 ± 3.699
0.0LeuTrp: 0.0 ± 0.0
3.61LeuTyr: 3.61 ± 2.797
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.203MetCys: 1.203 ± 1.707
1.203MetAsp: 1.203 ± 0.616
3.61MetGlu: 3.61 ± 1.849
2.407MetPhe: 2.407 ± 1.09
2.407MetGly: 2.407 ± 3.413
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.407MetLys: 2.407 ± 1.09
4.813MetLeu: 4.813 ± 4.504
1.203MetMet: 1.203 ± 1.707
2.407MetAsn: 2.407 ± 1.09
2.407MetPro: 2.407 ± 1.09
0.0MetGln: 0.0 ± 0.0
2.407MetArg: 2.407 ± 1.233
1.203MetSer: 1.203 ± 0.616
2.407MetThr: 2.407 ± 1.09
0.0MetVal: 0.0 ± 0.0
2.407MetTrp: 2.407 ± 3.413
2.407MetTyr: 2.407 ± 1.233
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
1.203AsnAsp: 1.203 ± 0.616
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
2.407AsnGly: 2.407 ± 1.233
0.0AsnHis: 0.0 ± 0.0
1.203AsnIle: 1.203 ± 0.616
1.203AsnLys: 1.203 ± 0.616
0.0AsnLeu: 0.0 ± 0.0
1.203AsnMet: 1.203 ± 0.616
1.203AsnAsn: 1.203 ± 0.616
1.203AsnPro: 1.203 ± 0.616
1.203AsnGln: 1.203 ± 0.616
4.813AsnArg: 4.813 ± 0.143
6.017AsnSer: 6.017 ± 3.887
2.407AsnThr: 2.407 ± 1.233
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.203AsnTyr: 1.203 ± 0.616
0.0AsnXaa: 0.0 ± 0.0
Pro
3.61ProAla: 3.61 ± 2.797
1.203ProCys: 1.203 ± 0.616
4.813ProAsp: 4.813 ± 2.181
2.407ProGlu: 2.407 ± 1.233
0.0ProPhe: 0.0 ± 0.0
1.203ProGly: 1.203 ± 1.707
1.203ProHis: 1.203 ± 0.616
1.203ProIle: 1.203 ± 0.616
0.0ProLys: 0.0 ± 0.0
4.813ProLeu: 4.813 ± 2.181
0.0ProMet: 0.0 ± 0.0
1.203ProAsn: 1.203 ± 1.707
0.0ProPro: 0.0 ± 0.0
1.203ProGln: 1.203 ± 0.616
1.203ProArg: 1.203 ± 0.616
3.61ProSer: 3.61 ± 1.849
0.0ProThr: 0.0 ± 0.0
4.813ProVal: 4.813 ± 0.143
1.203ProTrp: 1.203 ± 0.616
3.61ProTyr: 3.61 ± 2.797
0.0ProXaa: 0.0 ± 0.0
Gln
1.203GlnAla: 1.203 ± 0.616
1.203GlnCys: 1.203 ± 1.707
0.0GlnAsp: 0.0 ± 0.0
2.407GlnGlu: 2.407 ± 1.09
2.407GlnPhe: 2.407 ± 1.09
3.61GlnGly: 3.61 ± 1.849
3.61GlnHis: 3.61 ± 0.474
1.203GlnIle: 1.203 ± 0.616
3.61GlnLys: 3.61 ± 1.849
2.407GlnLeu: 2.407 ± 1.233
0.0GlnMet: 0.0 ± 0.0
2.407GlnAsn: 2.407 ± 1.233
1.203GlnPro: 1.203 ± 0.616
3.61GlnGln: 3.61 ± 1.849
4.813GlnArg: 4.813 ± 2.181
3.61GlnSer: 3.61 ± 1.849
4.813GlnThr: 4.813 ± 0.143
4.813GlnVal: 4.813 ± 2.466
0.0GlnTrp: 0.0 ± 0.0
1.203GlnTyr: 1.203 ± 0.616
0.0GlnXaa: 0.0 ± 0.0
Arg
6.017ArgAla: 6.017 ± 0.759
0.0ArgCys: 0.0 ± 0.0
2.407ArgAsp: 2.407 ± 1.233
7.22ArgGlu: 7.22 ± 0.948
2.407ArgPhe: 2.407 ± 1.09
2.407ArgGly: 2.407 ± 1.233
0.0ArgHis: 0.0 ± 0.0
1.203ArgIle: 1.203 ± 0.616
7.22ArgLys: 7.22 ± 3.699
6.017ArgLeu: 6.017 ± 3.887
3.61ArgMet: 3.61 ± 5.12
4.813ArgAsn: 4.813 ± 2.466
3.61ArgPro: 3.61 ± 1.849
4.813ArgGln: 4.813 ± 0.143
6.017ArgArg: 6.017 ± 0.759
4.813ArgSer: 4.813 ± 0.143
4.813ArgThr: 4.813 ± 2.466
10.83ArgVal: 10.83 ± 3.225
1.203ArgTrp: 1.203 ± 1.707
1.203ArgTyr: 1.203 ± 1.707
0.0ArgXaa: 0.0 ± 0.0
Ser
4.813SerAla: 4.813 ± 0.143
1.203SerCys: 1.203 ± 0.616
4.813SerAsp: 4.813 ± 0.143
3.61SerGlu: 3.61 ± 1.849
0.0SerPhe: 0.0 ± 0.0
6.017SerGly: 6.017 ± 1.564
1.203SerHis: 1.203 ± 0.616
2.407SerIle: 2.407 ± 1.233
1.203SerLys: 1.203 ± 0.616
9.627SerLeu: 9.627 ± 2.608
4.813SerMet: 4.813 ± 0.143
1.203SerAsn: 1.203 ± 1.707
2.407SerPro: 2.407 ± 1.09
4.813SerGln: 4.813 ± 2.466
4.813SerArg: 4.813 ± 0.143
8.424SerSer: 8.424 ± 1.992
4.813SerThr: 4.813 ± 2.466
13.237SerVal: 13.237 ± 6.781
1.203SerTrp: 1.203 ± 1.707
2.407SerTyr: 2.407 ± 3.413
0.0SerXaa: 0.0 ± 0.0
Thr
2.407ThrAla: 2.407 ± 1.233
1.203ThrCys: 1.203 ± 1.707
0.0ThrAsp: 0.0 ± 0.0
1.203ThrGlu: 1.203 ± 0.616
2.407ThrPhe: 2.407 ± 1.233
2.407ThrGly: 2.407 ± 1.09
1.203ThrHis: 1.203 ± 0.616
3.61ThrIle: 3.61 ± 1.849
0.0ThrLys: 0.0 ± 0.0
9.627ThrLeu: 9.627 ± 0.285
1.203ThrMet: 1.203 ± 0.616
0.0ThrAsn: 0.0 ± 0.0
2.407ThrPro: 2.407 ± 1.233
2.407ThrGln: 2.407 ± 1.233
1.203ThrArg: 1.203 ± 1.707
7.22ThrSer: 7.22 ± 3.699
2.407ThrThr: 2.407 ± 1.09
3.61ThrVal: 3.61 ± 1.849
1.203ThrTrp: 1.203 ± 0.616
2.407ThrTyr: 2.407 ± 1.233
0.0ThrXaa: 0.0 ± 0.0
Val
6.017ValAla: 6.017 ± 1.564
0.0ValCys: 0.0 ± 0.0
4.813ValAsp: 4.813 ± 2.466
7.22ValGlu: 7.22 ± 0.948
2.407ValPhe: 2.407 ± 1.233
10.83ValGly: 10.83 ± 3.745
1.203ValHis: 1.203 ± 0.616
0.0ValIle: 0.0 ± 0.0
2.407ValLys: 2.407 ± 3.413
8.424ValLeu: 8.424 ± 1.992
1.203ValMet: 1.203 ± 0.616
3.61ValAsn: 3.61 ± 1.849
1.203ValPro: 1.203 ± 0.616
4.813ValGln: 4.813 ± 2.466
9.627ValArg: 9.627 ± 2.608
7.22ValSer: 7.22 ± 1.375
4.813ValThr: 4.813 ± 2.466
9.627ValVal: 9.627 ± 2.038
2.407ValTrp: 2.407 ± 1.09
2.407ValTyr: 2.407 ± 1.233
0.0ValXaa: 0.0 ± 0.0
Trp
3.61TrpAla: 3.61 ± 2.797
0.0TrpCys: 0.0 ± 0.0
2.407TrpAsp: 2.407 ± 1.233
2.407TrpGlu: 2.407 ± 1.09
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
3.61TrpIle: 3.61 ± 1.849
1.203TrpLys: 1.203 ± 0.616
3.61TrpLeu: 3.61 ± 2.797
1.203TrpMet: 1.203 ± 0.616
1.203TrpAsn: 1.203 ± 1.707
1.203TrpPro: 1.203 ± 0.616
0.0TrpGln: 0.0 ± 0.0
2.407TrpArg: 2.407 ± 1.09
1.203TrpSer: 1.203 ± 1.707
0.0TrpThr: 0.0 ± 0.0
1.203TrpVal: 1.203 ± 1.707
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
7.22TyrAla: 7.22 ± 0.948
0.0TyrCys: 0.0 ± 0.0
4.813TyrAsp: 4.813 ± 0.143
2.407TyrGlu: 2.407 ± 1.09
1.203TyrPhe: 1.203 ± 1.707
1.203TyrGly: 1.203 ± 0.616
1.203TyrHis: 1.203 ± 1.707
0.0TyrIle: 0.0 ± 0.0
1.203TyrLys: 1.203 ± 1.707
3.61TyrLeu: 3.61 ± 1.849
1.203TyrMet: 1.203 ± 1.707
0.0TyrAsn: 0.0 ± 0.0
1.203TyrPro: 1.203 ± 1.707
0.0TyrGln: 0.0 ± 0.0
0.0TyrArg: 0.0 ± 0.0
2.407TyrSer: 2.407 ± 1.233
1.203TyrThr: 1.203 ± 0.616
6.017TyrVal: 6.017 ± 1.564
0.0TyrTrp: 0.0 ± 0.0
2.407TyrTyr: 2.407 ± 1.233
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski