Amino acid dipepetide frequency for Hubei picorna-like virus 35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.1AlaAla: 3.1 ± 0.0
1.722AlaCys: 1.722 ± 0.0
2.411AlaAsp: 2.411 ± 0.0
1.722AlaGlu: 1.722 ± 0.0
2.756AlaPhe: 2.756 ± 0.0
2.756AlaGly: 2.756 ± 0.0
1.033AlaHis: 1.033 ± 0.0
4.823AlaIle: 4.823 ± 0.0
3.789AlaLys: 3.789 ± 0.0
4.134AlaLeu: 4.134 ± 0.0
0.689AlaMet: 0.689 ± 0.0
2.756AlaAsn: 2.756 ± 0.0
1.722AlaPro: 1.722 ± 0.0
2.067AlaGln: 2.067 ± 0.0
1.033AlaArg: 1.033 ± 0.0
3.789AlaSer: 3.789 ± 0.0
2.067AlaThr: 2.067 ± 0.0
4.478AlaVal: 4.478 ± 0.0
1.033AlaTrp: 1.033 ± 0.0
1.378AlaTyr: 1.378 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.344CysAla: 0.344 ± 0.0
0.344CysCys: 0.344 ± 0.0
0.689CysAsp: 0.689 ± 0.0
1.722CysGlu: 1.722 ± 0.0
1.033CysPhe: 1.033 ± 0.0
2.067CysGly: 2.067 ± 0.0
0.344CysHis: 0.344 ± 0.0
0.689CysIle: 0.689 ± 0.0
0.344CysLys: 0.344 ± 0.0
1.033CysLeu: 1.033 ± 0.0
0.344CysMet: 0.344 ± 0.0
2.411CysAsn: 2.411 ± 0.0
1.722CysPro: 1.722 ± 0.0
0.344CysGln: 0.344 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.689CysSer: 0.689 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.411CysVal: 2.411 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.689CysTyr: 0.689 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.033AspAla: 1.033 ± 0.0
1.722AspCys: 1.722 ± 0.0
4.478AspAsp: 4.478 ± 0.0
2.756AspGlu: 2.756 ± 0.0
5.167AspPhe: 5.167 ± 0.0
2.411AspGly: 2.411 ± 0.0
0.344AspHis: 0.344 ± 0.0
3.1AspIle: 3.1 ± 0.0
4.134AspLys: 4.134 ± 0.0
5.512AspLeu: 5.512 ± 0.0
0.689AspMet: 0.689 ± 0.0
3.445AspAsn: 3.445 ± 0.0
1.722AspPro: 1.722 ± 0.0
1.378AspGln: 1.378 ± 0.0
1.378AspArg: 1.378 ± 0.0
4.134AspSer: 4.134 ± 0.0
3.445AspThr: 3.445 ± 0.0
5.167AspVal: 5.167 ± 0.0
0.689AspTrp: 0.689 ± 0.0
2.756AspTyr: 2.756 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.445GluAla: 3.445 ± 0.0
0.689GluCys: 0.689 ± 0.0
3.1GluAsp: 3.1 ± 0.0
3.789GluGlu: 3.789 ± 0.0
1.722GluPhe: 1.722 ± 0.0
3.445GluGly: 3.445 ± 0.0
2.067GluHis: 2.067 ± 0.0
1.722GluIle: 1.722 ± 0.0
4.134GluLys: 4.134 ± 0.0
5.856GluLeu: 5.856 ± 0.0
1.378GluMet: 1.378 ± 0.0
1.722GluAsn: 1.722 ± 0.0
1.378GluPro: 1.378 ± 0.0
2.067GluGln: 2.067 ± 0.0
1.033GluArg: 1.033 ± 0.0
2.756GluSer: 2.756 ± 0.0
1.722GluThr: 1.722 ± 0.0
5.167GluVal: 5.167 ± 0.0
1.722GluTrp: 1.722 ± 0.0
1.722GluTyr: 1.722 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.033PheAla: 1.033 ± 0.0
1.033PheCys: 1.033 ± 0.0
1.722PheAsp: 1.722 ± 0.0
3.789PheGlu: 3.789 ± 0.0
1.378PhePhe: 1.378 ± 0.0
5.167PheGly: 5.167 ± 0.0
0.344PheHis: 0.344 ± 0.0
1.722PheIle: 1.722 ± 0.0
4.134PheLys: 4.134 ± 0.0
4.823PheLeu: 4.823 ± 0.0
1.722PheMet: 1.722 ± 0.0
2.756PheAsn: 2.756 ± 0.0
3.789PhePro: 3.789 ± 0.0
2.756PheGln: 2.756 ± 0.0
1.722PheArg: 1.722 ± 0.0
4.823PheSer: 4.823 ± 0.0
2.067PheThr: 2.067 ± 0.0
5.167PheVal: 5.167 ± 0.0
0.689PheTrp: 0.689 ± 0.0
2.067PheTyr: 2.067 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.789GlyAla: 3.789 ± 0.0
0.689GlyCys: 0.689 ± 0.0
4.478GlyAsp: 4.478 ± 0.0
3.1GlyGlu: 3.1 ± 0.0
3.445GlyPhe: 3.445 ± 0.0
4.823GlyGly: 4.823 ± 0.0
1.033GlyHis: 1.033 ± 0.0
5.167GlyIle: 5.167 ± 0.0
3.1GlyLys: 3.1 ± 0.0
5.856GlyLeu: 5.856 ± 0.0
2.411GlyMet: 2.411 ± 0.0
2.411GlyAsn: 2.411 ± 0.0
1.378GlyPro: 1.378 ± 0.0
1.378GlyGln: 1.378 ± 0.0
2.067GlyArg: 2.067 ± 0.0
4.823GlySer: 4.823 ± 0.0
4.478GlyThr: 4.478 ± 0.0
6.2GlyVal: 6.2 ± 0.0
1.378GlyTrp: 1.378 ± 0.0
1.033GlyTyr: 1.033 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.344HisAla: 0.344 ± 0.0
1.033HisCys: 1.033 ± 0.0
1.722HisAsp: 1.722 ± 0.0
0.689HisGlu: 0.689 ± 0.0
1.033HisPhe: 1.033 ± 0.0
2.411HisGly: 2.411 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.722HisIle: 1.722 ± 0.0
1.722HisLys: 1.722 ± 0.0
1.722HisLeu: 1.722 ± 0.0
0.344HisMet: 0.344 ± 0.0
1.033HisAsn: 1.033 ± 0.0
0.689HisPro: 0.689 ± 0.0
0.344HisGln: 0.344 ± 0.0
0.344HisArg: 0.344 ± 0.0
1.378HisSer: 1.378 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.722HisVal: 1.722 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.344HisTyr: 0.344 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.512IleAla: 5.512 ± 0.0
0.689IleCys: 0.689 ± 0.0
4.134IleAsp: 4.134 ± 0.0
2.411IleGlu: 2.411 ± 0.0
4.478IlePhe: 4.478 ± 0.0
3.1IleGly: 3.1 ± 0.0
2.411IleHis: 2.411 ± 0.0
3.445IleIle: 3.445 ± 0.0
3.1IleLys: 3.1 ± 0.0
4.134IleLeu: 4.134 ± 0.0
1.033IleMet: 1.033 ± 0.0
4.478IleAsn: 4.478 ± 0.0
3.445IlePro: 3.445 ± 0.0
3.445IleGln: 3.445 ± 0.0
2.756IleArg: 2.756 ± 0.0
3.445IleSer: 3.445 ± 0.0
3.789IleThr: 3.789 ± 0.0
7.234IleVal: 7.234 ± 0.0
0.689IleTrp: 0.689 ± 0.0
1.033IleTyr: 1.033 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.1LysAla: 3.1 ± 0.0
1.378LysCys: 1.378 ± 0.0
3.1LysAsp: 3.1 ± 0.0
3.1LysGlu: 3.1 ± 0.0
2.756LysPhe: 2.756 ± 0.0
3.789LysGly: 3.789 ± 0.0
1.378LysHis: 1.378 ± 0.0
7.578LysIle: 7.578 ± 0.0
2.411LysLys: 2.411 ± 0.0
6.889LysLeu: 6.889 ± 0.0
1.033LysMet: 1.033 ± 0.0
3.445LysAsn: 3.445 ± 0.0
1.722LysPro: 1.722 ± 0.0
1.722LysGln: 1.722 ± 0.0
2.411LysArg: 2.411 ± 0.0
3.1LysSer: 3.1 ± 0.0
5.856LysThr: 5.856 ± 0.0
5.167LysVal: 5.167 ± 0.0
1.378LysTrp: 1.378 ± 0.0
2.067LysTyr: 2.067 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.823LeuAla: 4.823 ± 0.0
2.067LeuCys: 2.067 ± 0.0
5.512LeuAsp: 5.512 ± 0.0
4.478LeuGlu: 4.478 ± 0.0
3.445LeuPhe: 3.445 ± 0.0
4.478LeuGly: 4.478 ± 0.0
2.067LeuHis: 2.067 ± 0.0
5.856LeuIle: 5.856 ± 0.0
7.234LeuLys: 7.234 ± 0.0
7.923LeuLeu: 7.923 ± 0.0
2.411LeuMet: 2.411 ± 0.0
7.578LeuAsn: 7.578 ± 0.0
2.067LeuPro: 2.067 ± 0.0
3.1LeuGln: 3.1 ± 0.0
4.823LeuArg: 4.823 ± 0.0
8.612LeuSer: 8.612 ± 0.0
5.167LeuThr: 5.167 ± 0.0
6.545LeuVal: 6.545 ± 0.0
0.689LeuTrp: 0.689 ± 0.0
4.478LeuTyr: 4.478 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.689MetAla: 0.689 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.344MetAsp: 0.344 ± 0.0
0.689MetGlu: 0.689 ± 0.0
2.067MetPhe: 2.067 ± 0.0
1.033MetGly: 1.033 ± 0.0
1.033MetHis: 1.033 ± 0.0
0.344MetIle: 0.344 ± 0.0
1.378MetLys: 1.378 ± 0.0
3.445MetLeu: 3.445 ± 0.0
0.344MetMet: 0.344 ± 0.0
2.067MetAsn: 2.067 ± 0.0
0.689MetPro: 0.689 ± 0.0
1.033MetGln: 1.033 ± 0.0
1.378MetArg: 1.378 ± 0.0
2.067MetSer: 2.067 ± 0.0
0.344MetThr: 0.344 ± 0.0
1.378MetVal: 1.378 ± 0.0
0.689MetTrp: 0.689 ± 0.0
1.378MetTyr: 1.378 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.067AsnAla: 2.067 ± 0.0
1.378AsnCys: 1.378 ± 0.0
1.722AsnAsp: 1.722 ± 0.0
1.722AsnGlu: 1.722 ± 0.0
4.478AsnPhe: 4.478 ± 0.0
3.1AsnGly: 3.1 ± 0.0
0.689AsnHis: 0.689 ± 0.0
4.823AsnIle: 4.823 ± 0.0
3.445AsnLys: 3.445 ± 0.0
5.167AsnLeu: 5.167 ± 0.0
1.378AsnMet: 1.378 ± 0.0
2.067AsnAsn: 2.067 ± 0.0
3.789AsnPro: 3.789 ± 0.0
2.067AsnGln: 2.067 ± 0.0
2.756AsnArg: 2.756 ± 0.0
4.478AsnSer: 4.478 ± 0.0
2.067AsnThr: 2.067 ± 0.0
6.2AsnVal: 6.2 ± 0.0
1.378AsnTrp: 1.378 ± 0.0
3.445AsnTyr: 3.445 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.067ProAla: 2.067 ± 0.0
0.0ProCys: 0.0 ± 0.0
1.033ProAsp: 1.033 ± 0.0
2.067ProGlu: 2.067 ± 0.0
3.1ProPhe: 3.1 ± 0.0
2.067ProGly: 2.067 ± 0.0
1.378ProHis: 1.378 ± 0.0
1.033ProIle: 1.033 ± 0.0
2.411ProLys: 2.411 ± 0.0
4.823ProLeu: 4.823 ± 0.0
0.689ProMet: 0.689 ± 0.0
2.067ProAsn: 2.067 ± 0.0
1.722ProPro: 1.722 ± 0.0
3.1ProGln: 3.1 ± 0.0
2.411ProArg: 2.411 ± 0.0
2.756ProSer: 2.756 ± 0.0
2.756ProThr: 2.756 ± 0.0
2.756ProVal: 2.756 ± 0.0
0.0ProTrp: 0.0 ± 0.0
2.756ProTyr: 2.756 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.722GlnAla: 1.722 ± 0.0
0.344GlnCys: 0.344 ± 0.0
2.756GlnAsp: 2.756 ± 0.0
3.1GlnGlu: 3.1 ± 0.0
2.756GlnPhe: 2.756 ± 0.0
3.789GlnGly: 3.789 ± 0.0
1.033GlnHis: 1.033 ± 0.0
2.411GlnIle: 2.411 ± 0.0
2.067GlnLys: 2.067 ± 0.0
4.823GlnLeu: 4.823 ± 0.0
0.689GlnMet: 0.689 ± 0.0
1.378GlnAsn: 1.378 ± 0.0
1.722GlnPro: 1.722 ± 0.0
1.378GlnGln: 1.378 ± 0.0
1.722GlnArg: 1.722 ± 0.0
3.445GlnSer: 3.445 ± 0.0
2.067GlnThr: 2.067 ± 0.0
3.445GlnVal: 3.445 ± 0.0
0.344GlnTrp: 0.344 ± 0.0
1.722GlnTyr: 1.722 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.411ArgAla: 2.411 ± 0.0
0.344ArgCys: 0.344 ± 0.0
3.1ArgAsp: 3.1 ± 0.0
2.411ArgGlu: 2.411 ± 0.0
1.378ArgPhe: 1.378 ± 0.0
1.722ArgGly: 1.722 ± 0.0
0.344ArgHis: 0.344 ± 0.0
2.411ArgIle: 2.411 ± 0.0
2.411ArgLys: 2.411 ± 0.0
2.756ArgLeu: 2.756 ± 0.0
0.0ArgMet: 0.0 ± 0.0
2.756ArgAsn: 2.756 ± 0.0
1.378ArgPro: 1.378 ± 0.0
1.722ArgGln: 1.722 ± 0.0
3.445ArgArg: 3.445 ± 0.0
1.722ArgSer: 1.722 ± 0.0
3.445ArgThr: 3.445 ± 0.0
2.067ArgVal: 2.067 ± 0.0
0.689ArgTrp: 0.689 ± 0.0
2.067ArgTyr: 2.067 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.411SerAla: 2.411 ± 0.0
0.689SerCys: 0.689 ± 0.0
3.789SerAsp: 3.789 ± 0.0
2.756SerGlu: 2.756 ± 0.0
4.134SerPhe: 4.134 ± 0.0
4.134SerGly: 4.134 ± 0.0
0.689SerHis: 0.689 ± 0.0
4.478SerIle: 4.478 ± 0.0
5.167SerLys: 5.167 ± 0.0
5.512SerLeu: 5.512 ± 0.0
2.067SerMet: 2.067 ± 0.0
3.789SerAsn: 3.789 ± 0.0
3.1SerPro: 3.1 ± 0.0
2.756SerGln: 2.756 ± 0.0
1.722SerArg: 1.722 ± 0.0
8.267SerSer: 8.267 ± 0.0
4.823SerThr: 4.823 ± 0.0
6.2SerVal: 6.2 ± 0.0
2.411SerTrp: 2.411 ± 0.0
2.756SerTyr: 2.756 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.1ThrAla: 3.1 ± 0.0
0.0ThrCys: 0.0 ± 0.0
3.445ThrAsp: 3.445 ± 0.0
2.411ThrGlu: 2.411 ± 0.0
1.033ThrPhe: 1.033 ± 0.0
3.789ThrGly: 3.789 ± 0.0
0.0ThrHis: 0.0 ± 0.0
4.478ThrIle: 4.478 ± 0.0
2.067ThrLys: 2.067 ± 0.0
4.134ThrLeu: 4.134 ± 0.0
2.411ThrMet: 2.411 ± 0.0
4.478ThrAsn: 4.478 ± 0.0
1.378ThrPro: 1.378 ± 0.0
6.545ThrGln: 6.545 ± 0.0
3.1ThrArg: 3.1 ± 0.0
3.445ThrSer: 3.445 ± 0.0
3.1ThrThr: 3.1 ± 0.0
5.167ThrVal: 5.167 ± 0.0
0.689ThrTrp: 0.689 ± 0.0
1.033ThrTyr: 1.033 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.2ValAla: 6.2 ± 0.0
1.722ValCys: 1.722 ± 0.0
5.856ValAsp: 5.856 ± 0.0
5.167ValGlu: 5.167 ± 0.0
2.411ValPhe: 2.411 ± 0.0
5.167ValGly: 5.167 ± 0.0
1.033ValHis: 1.033 ± 0.0
6.2ValIle: 6.2 ± 0.0
7.923ValLys: 7.923 ± 0.0
7.234ValLeu: 7.234 ± 0.0
2.411ValMet: 2.411 ± 0.0
3.789ValAsn: 3.789 ± 0.0
5.856ValPro: 5.856 ± 0.0
3.445ValGln: 3.445 ± 0.0
1.722ValArg: 1.722 ± 0.0
3.445ValSer: 3.445 ± 0.0
5.856ValThr: 5.856 ± 0.0
6.889ValVal: 6.889 ± 0.0
1.722ValTrp: 1.722 ± 0.0
2.756ValTyr: 2.756 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.033TrpAla: 1.033 ± 0.0
0.689TrpCys: 0.689 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.033TrpGlu: 1.033 ± 0.0
1.033TrpPhe: 1.033 ± 0.0
1.722TrpGly: 1.722 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.689TrpIle: 0.689 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.722TrpLeu: 1.722 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.378TrpAsn: 1.378 ± 0.0
0.689TrpPro: 0.689 ± 0.0
0.689TrpGln: 0.689 ± 0.0
1.378TrpArg: 1.378 ± 0.0
2.067TrpSer: 2.067 ± 0.0
0.689TrpThr: 0.689 ± 0.0
1.378TrpVal: 1.378 ± 0.0
0.344TrpTrp: 0.344 ± 0.0
0.344TrpTyr: 0.344 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.378TyrAla: 1.378 ± 0.0
0.689TyrCys: 0.689 ± 0.0
2.067TyrAsp: 2.067 ± 0.0
1.722TyrGlu: 1.722 ± 0.0
3.1TyrPhe: 3.1 ± 0.0
2.067TyrGly: 2.067 ± 0.0
1.378TyrHis: 1.378 ± 0.0
2.067TyrIle: 2.067 ± 0.0
2.411TyrLys: 2.411 ± 0.0
5.856TyrLeu: 5.856 ± 0.0
0.0TyrMet: 0.0 ± 0.0
2.411TyrAsn: 2.411 ± 0.0
1.033TyrPro: 1.033 ± 0.0
1.378TyrGln: 1.378 ± 0.0
1.378TyrArg: 1.378 ± 0.0
2.411TyrSer: 2.411 ± 0.0
2.067TyrThr: 2.067 ± 0.0
2.067TyrVal: 2.067 ± 0.0
0.344TyrTrp: 0.344 ± 0.0
1.722TyrTyr: 1.722 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2904 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski