Amino acid dipepetide frequency for Hubei picorna-like virus 59

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.889AlaAla: 6.889 ± 0.0
1.033AlaCys: 1.033 ± 0.0
3.445AlaAsp: 3.445 ± 0.0
4.134AlaGlu: 4.134 ± 0.0
1.722AlaPhe: 1.722 ± 0.0
2.756AlaGly: 2.756 ± 0.0
2.756AlaHis: 2.756 ± 0.0
4.823AlaIle: 4.823 ± 0.0
4.478AlaLys: 4.478 ± 0.0
7.578AlaLeu: 7.578 ± 0.0
3.1AlaMet: 3.1 ± 0.0
2.756AlaAsn: 2.756 ± 0.0
4.823AlaPro: 4.823 ± 0.0
3.445AlaGln: 3.445 ± 0.0
3.1AlaArg: 3.1 ± 0.0
4.134AlaSer: 4.134 ± 0.0
5.167AlaThr: 5.167 ± 0.0
4.823AlaVal: 4.823 ± 0.0
0.689AlaTrp: 0.689 ± 0.0
3.445AlaTyr: 3.445 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.689CysAla: 0.689 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.722CysAsp: 1.722 ± 0.0
0.344CysGlu: 0.344 ± 0.0
0.344CysPhe: 0.344 ± 0.0
1.722CysGly: 1.722 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.033CysIle: 1.033 ± 0.0
0.344CysLys: 0.344 ± 0.0
2.067CysLeu: 2.067 ± 0.0
0.689CysMet: 0.689 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.033CysPro: 1.033 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.689CysArg: 0.689 ± 0.0
1.033CysSer: 1.033 ± 0.0
1.378CysThr: 1.378 ± 0.0
0.689CysVal: 0.689 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.378CysTyr: 1.378 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.445AspAla: 3.445 ± 0.0
2.067AspCys: 2.067 ± 0.0
1.722AspAsp: 1.722 ± 0.0
4.478AspGlu: 4.478 ± 0.0
4.478AspPhe: 4.478 ± 0.0
2.411AspGly: 2.411 ± 0.0
1.722AspHis: 1.722 ± 0.0
3.445AspIle: 3.445 ± 0.0
2.411AspLys: 2.411 ± 0.0
3.789AspLeu: 3.789 ± 0.0
1.033AspMet: 1.033 ± 0.0
3.1AspAsn: 3.1 ± 0.0
2.067AspPro: 2.067 ± 0.0
1.722AspGln: 1.722 ± 0.0
2.411AspArg: 2.411 ± 0.0
4.478AspSer: 4.478 ± 0.0
2.411AspThr: 2.411 ± 0.0
1.378AspVal: 1.378 ± 0.0
1.378AspTrp: 1.378 ± 0.0
4.134AspTyr: 4.134 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.756GluAla: 2.756 ± 0.0
0.689GluCys: 0.689 ± 0.0
4.134GluAsp: 4.134 ± 0.0
4.134GluGlu: 4.134 ± 0.0
2.067GluPhe: 2.067 ± 0.0
3.1GluGly: 3.1 ± 0.0
0.344GluHis: 0.344 ± 0.0
2.756GluIle: 2.756 ± 0.0
4.823GluLys: 4.823 ± 0.0
5.856GluLeu: 5.856 ± 0.0
1.722GluMet: 1.722 ± 0.0
2.411GluAsn: 2.411 ± 0.0
2.411GluPro: 2.411 ± 0.0
1.722GluGln: 1.722 ± 0.0
2.411GluArg: 2.411 ± 0.0
4.478GluSer: 4.478 ± 0.0
2.756GluThr: 2.756 ± 0.0
3.789GluVal: 3.789 ± 0.0
1.722GluTrp: 1.722 ± 0.0
2.756GluTyr: 2.756 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.378PheAla: 1.378 ± 0.0
1.033PheCys: 1.033 ± 0.0
1.722PheAsp: 1.722 ± 0.0
1.378PheGlu: 1.378 ± 0.0
1.378PhePhe: 1.378 ± 0.0
3.1PheGly: 3.1 ± 0.0
0.0PheHis: 0.0 ± 0.0
3.1PheIle: 3.1 ± 0.0
3.1PheLys: 3.1 ± 0.0
2.411PheLeu: 2.411 ± 0.0
0.689PheMet: 0.689 ± 0.0
2.411PheAsn: 2.411 ± 0.0
1.378PhePro: 1.378 ± 0.0
1.722PheGln: 1.722 ± 0.0
1.378PheArg: 1.378 ± 0.0
3.1PheSer: 3.1 ± 0.0
1.033PheThr: 1.033 ± 0.0
4.134PheVal: 4.134 ± 0.0
0.689PheTrp: 0.689 ± 0.0
2.067PheTyr: 2.067 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.1GlyAla: 3.1 ± 0.0
0.689GlyCys: 0.689 ± 0.0
3.445GlyAsp: 3.445 ± 0.0
3.1GlyGlu: 3.1 ± 0.0
1.722GlyPhe: 1.722 ± 0.0
3.789GlyGly: 3.789 ± 0.0
1.378GlyHis: 1.378 ± 0.0
4.823GlyIle: 4.823 ± 0.0
4.823GlyLys: 4.823 ± 0.0
5.512GlyLeu: 5.512 ± 0.0
1.722GlyMet: 1.722 ± 0.0
3.445GlyAsn: 3.445 ± 0.0
3.1GlyPro: 3.1 ± 0.0
1.722GlyGln: 1.722 ± 0.0
1.722GlyArg: 1.722 ± 0.0
4.823GlySer: 4.823 ± 0.0
4.478GlyThr: 4.478 ± 0.0
2.411GlyVal: 2.411 ± 0.0
0.689GlyTrp: 0.689 ± 0.0
2.411GlyTyr: 2.411 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.378HisAla: 1.378 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.378HisAsp: 1.378 ± 0.0
1.378HisGlu: 1.378 ± 0.0
0.344HisPhe: 0.344 ± 0.0
1.033HisGly: 1.033 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.378HisIle: 1.378 ± 0.0
0.689HisLys: 0.689 ± 0.0
1.722HisLeu: 1.722 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.689HisAsn: 0.689 ± 0.0
1.722HisPro: 1.722 ± 0.0
1.033HisGln: 1.033 ± 0.0
1.722HisArg: 1.722 ± 0.0
0.689HisSer: 0.689 ± 0.0
2.756HisThr: 2.756 ± 0.0
1.378HisVal: 1.378 ± 0.0
0.344HisTrp: 0.344 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.823IleAla: 4.823 ± 0.0
1.378IleCys: 1.378 ± 0.0
4.134IleAsp: 4.134 ± 0.0
2.411IleGlu: 2.411 ± 0.0
2.067IlePhe: 2.067 ± 0.0
5.512IleGly: 5.512 ± 0.0
2.756IleHis: 2.756 ± 0.0
4.823IleIle: 4.823 ± 0.0
4.134IleLys: 4.134 ± 0.0
5.512IleLeu: 5.512 ± 0.0
2.411IleMet: 2.411 ± 0.0
4.134IleAsn: 4.134 ± 0.0
2.411IlePro: 2.411 ± 0.0
3.1IleGln: 3.1 ± 0.0
2.756IleArg: 2.756 ± 0.0
5.167IleSer: 5.167 ± 0.0
6.2IleThr: 6.2 ± 0.0
4.134IleVal: 4.134 ± 0.0
0.0IleTrp: 0.0 ± 0.0
4.478IleTyr: 4.478 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.478LysAla: 4.478 ± 0.0
0.689LysCys: 0.689 ± 0.0
3.445LysAsp: 3.445 ± 0.0
5.512LysGlu: 5.512 ± 0.0
2.411LysPhe: 2.411 ± 0.0
4.134LysGly: 4.134 ± 0.0
2.411LysHis: 2.411 ± 0.0
5.512LysIle: 5.512 ± 0.0
3.789LysLys: 3.789 ± 0.0
6.545LysLeu: 6.545 ± 0.0
2.067LysMet: 2.067 ± 0.0
3.1LysAsn: 3.1 ± 0.0
2.756LysPro: 2.756 ± 0.0
1.378LysGln: 1.378 ± 0.0
2.756LysArg: 2.756 ± 0.0
3.445LysSer: 3.445 ± 0.0
3.1LysThr: 3.1 ± 0.0
4.478LysVal: 4.478 ± 0.0
0.689LysTrp: 0.689 ± 0.0
3.1LysTyr: 3.1 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.267LeuAla: 8.267 ± 0.0
0.344LeuCys: 0.344 ± 0.0
4.134LeuAsp: 4.134 ± 0.0
5.167LeuGlu: 5.167 ± 0.0
2.756LeuPhe: 2.756 ± 0.0
3.445LeuGly: 3.445 ± 0.0
2.756LeuHis: 2.756 ± 0.0
3.445LeuIle: 3.445 ± 0.0
6.545LeuLys: 6.545 ± 0.0
7.234LeuLeu: 7.234 ± 0.0
3.1LeuMet: 3.1 ± 0.0
6.545LeuAsn: 6.545 ± 0.0
3.789LeuPro: 3.789 ± 0.0
3.789LeuGln: 3.789 ± 0.0
2.067LeuArg: 2.067 ± 0.0
6.2LeuSer: 6.2 ± 0.0
3.445LeuThr: 3.445 ± 0.0
6.2LeuVal: 6.2 ± 0.0
1.033LeuTrp: 1.033 ± 0.0
2.756LeuTyr: 2.756 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.1MetAla: 3.1 ± 0.0
0.344MetCys: 0.344 ± 0.0
2.756MetAsp: 2.756 ± 0.0
0.689MetGlu: 0.689 ± 0.0
1.378MetPhe: 1.378 ± 0.0
3.789MetGly: 3.789 ± 0.0
0.344MetHis: 0.344 ± 0.0
2.067MetIle: 2.067 ± 0.0
2.411MetLys: 2.411 ± 0.0
2.067MetLeu: 2.067 ± 0.0
0.689MetMet: 0.689 ± 0.0
1.722MetAsn: 1.722 ± 0.0
0.344MetPro: 0.344 ± 0.0
1.378MetGln: 1.378 ± 0.0
1.378MetArg: 1.378 ± 0.0
1.722MetSer: 1.722 ± 0.0
2.411MetThr: 2.411 ± 0.0
0.689MetVal: 0.689 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.033MetTyr: 1.033 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.134AsnAla: 4.134 ± 0.0
1.033AsnCys: 1.033 ± 0.0
3.445AsnAsp: 3.445 ± 0.0
1.378AsnGlu: 1.378 ± 0.0
2.756AsnPhe: 2.756 ± 0.0
3.1AsnGly: 3.1 ± 0.0
0.689AsnHis: 0.689 ± 0.0
5.167AsnIle: 5.167 ± 0.0
3.789AsnLys: 3.789 ± 0.0
3.1AsnLeu: 3.1 ± 0.0
1.378AsnMet: 1.378 ± 0.0
1.722AsnAsn: 1.722 ± 0.0
3.445AsnPro: 3.445 ± 0.0
1.722AsnGln: 1.722 ± 0.0
3.1AsnArg: 3.1 ± 0.0
5.167AsnSer: 5.167 ± 0.0
3.445AsnThr: 3.445 ± 0.0
2.411AsnVal: 2.411 ± 0.0
0.344AsnTrp: 0.344 ± 0.0
2.756AsnTyr: 2.756 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.823ProAla: 4.823 ± 0.0
1.033ProCys: 1.033 ± 0.0
2.756ProAsp: 2.756 ± 0.0
3.445ProGlu: 3.445 ± 0.0
1.033ProPhe: 1.033 ± 0.0
2.411ProGly: 2.411 ± 0.0
0.689ProHis: 0.689 ± 0.0
2.756ProIle: 2.756 ± 0.0
2.067ProLys: 2.067 ± 0.0
2.756ProLeu: 2.756 ± 0.0
0.344ProMet: 0.344 ± 0.0
4.478ProAsn: 4.478 ± 0.0
2.067ProPro: 2.067 ± 0.0
2.067ProGln: 2.067 ± 0.0
2.067ProArg: 2.067 ± 0.0
4.823ProSer: 4.823 ± 0.0
3.789ProThr: 3.789 ± 0.0
2.756ProVal: 2.756 ± 0.0
0.344ProTrp: 0.344 ± 0.0
1.378ProTyr: 1.378 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.478GlnAla: 4.478 ± 0.0
0.689GlnCys: 0.689 ± 0.0
2.067GlnAsp: 2.067 ± 0.0
1.722GlnGlu: 1.722 ± 0.0
2.411GlnPhe: 2.411 ± 0.0
1.722GlnGly: 1.722 ± 0.0
0.344GlnHis: 0.344 ± 0.0
2.411GlnIle: 2.411 ± 0.0
2.411GlnLys: 2.411 ± 0.0
3.789GlnLeu: 3.789 ± 0.0
2.067GlnMet: 2.067 ± 0.0
1.033GlnAsn: 1.033 ± 0.0
3.1GlnPro: 3.1 ± 0.0
2.067GlnGln: 2.067 ± 0.0
1.722GlnArg: 1.722 ± 0.0
2.756GlnSer: 2.756 ± 0.0
2.067GlnThr: 2.067 ± 0.0
1.722GlnVal: 1.722 ± 0.0
0.344GlnTrp: 0.344 ± 0.0
1.378GlnTyr: 1.378 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.411ArgAla: 2.411 ± 0.0
1.033ArgCys: 1.033 ± 0.0
2.411ArgAsp: 2.411 ± 0.0
2.756ArgGlu: 2.756 ± 0.0
0.689ArgPhe: 0.689 ± 0.0
2.411ArgGly: 2.411 ± 0.0
0.344ArgHis: 0.344 ± 0.0
3.789ArgIle: 3.789 ± 0.0
2.756ArgLys: 2.756 ± 0.0
4.823ArgLeu: 4.823 ± 0.0
1.722ArgMet: 1.722 ± 0.0
2.067ArgAsn: 2.067 ± 0.0
3.445ArgPro: 3.445 ± 0.0
1.722ArgGln: 1.722 ± 0.0
1.722ArgArg: 1.722 ± 0.0
1.033ArgSer: 1.033 ± 0.0
2.067ArgThr: 2.067 ± 0.0
2.756ArgVal: 2.756 ± 0.0
0.344ArgTrp: 0.344 ± 0.0
2.756ArgTyr: 2.756 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.167SerAla: 5.167 ± 0.0
0.344SerCys: 0.344 ± 0.0
4.134SerAsp: 4.134 ± 0.0
5.167SerGlu: 5.167 ± 0.0
4.478SerPhe: 4.478 ± 0.0
6.2SerGly: 6.2 ± 0.0
1.033SerHis: 1.033 ± 0.0
5.856SerIle: 5.856 ± 0.0
4.478SerLys: 4.478 ± 0.0
5.512SerLeu: 5.512 ± 0.0
1.033SerMet: 1.033 ± 0.0
5.512SerAsn: 5.512 ± 0.0
1.722SerPro: 1.722 ± 0.0
2.411SerGln: 2.411 ± 0.0
3.789SerArg: 3.789 ± 0.0
5.856SerSer: 5.856 ± 0.0
3.1SerThr: 3.1 ± 0.0
3.445SerVal: 3.445 ± 0.0
1.378SerTrp: 1.378 ± 0.0
1.722SerTyr: 1.722 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.445ThrAla: 3.445 ± 0.0
1.033ThrCys: 1.033 ± 0.0
3.1ThrAsp: 3.1 ± 0.0
3.445ThrGlu: 3.445 ± 0.0
1.722ThrPhe: 1.722 ± 0.0
2.756ThrGly: 2.756 ± 0.0
0.689ThrHis: 0.689 ± 0.0
6.2ThrIle: 6.2 ± 0.0
2.756ThrLys: 2.756 ± 0.0
3.789ThrLeu: 3.789 ± 0.0
2.411ThrMet: 2.411 ± 0.0
3.445ThrAsn: 3.445 ± 0.0
2.756ThrPro: 2.756 ± 0.0
3.445ThrGln: 3.445 ± 0.0
1.722ThrArg: 1.722 ± 0.0
5.856ThrSer: 5.856 ± 0.0
6.889ThrThr: 6.889 ± 0.0
4.823ThrVal: 4.823 ± 0.0
1.033ThrTrp: 1.033 ± 0.0
1.378ThrTyr: 1.378 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.512ValAla: 5.512 ± 0.0
1.378ValCys: 1.378 ± 0.0
1.378ValAsp: 1.378 ± 0.0
3.445ValGlu: 3.445 ± 0.0
1.722ValPhe: 1.722 ± 0.0
3.1ValGly: 3.1 ± 0.0
0.689ValHis: 0.689 ± 0.0
5.167ValIle: 5.167 ± 0.0
5.167ValLys: 5.167 ± 0.0
4.823ValLeu: 4.823 ± 0.0
2.067ValMet: 2.067 ± 0.0
2.411ValAsn: 2.411 ± 0.0
2.756ValPro: 2.756 ± 0.0
4.134ValGln: 4.134 ± 0.0
4.134ValArg: 4.134 ± 0.0
3.789ValSer: 3.789 ± 0.0
2.411ValThr: 2.411 ± 0.0
3.1ValVal: 3.1 ± 0.0
0.0ValTrp: 0.0 ± 0.0
1.033ValTyr: 1.033 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.344TrpAla: 0.344 ± 0.0
0.344TrpCys: 0.344 ± 0.0
0.689TrpAsp: 0.689 ± 0.0
1.378TrpGlu: 1.378 ± 0.0
0.344TrpPhe: 0.344 ± 0.0
1.033TrpGly: 1.033 ± 0.0
0.344TrpHis: 0.344 ± 0.0
1.033TrpIle: 1.033 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.378TrpLeu: 1.378 ± 0.0
0.344TrpMet: 0.344 ± 0.0
0.689TrpAsn: 0.689 ± 0.0
0.344TrpPro: 0.344 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.689TrpArg: 0.689 ± 0.0
0.689TrpSer: 0.689 ± 0.0
0.689TrpThr: 0.689 ± 0.0
0.689TrpVal: 0.689 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.344TrpTyr: 0.344 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.478TyrAla: 4.478 ± 0.0
0.0TyrCys: 0.0 ± 0.0
2.067TyrAsp: 2.067 ± 0.0
1.378TyrGlu: 1.378 ± 0.0
1.378TyrPhe: 1.378 ± 0.0
1.378TyrGly: 1.378 ± 0.0
0.344TyrHis: 0.344 ± 0.0
2.756TyrIle: 2.756 ± 0.0
4.823TyrLys: 4.823 ± 0.0
2.756TyrLeu: 2.756 ± 0.0
1.378TyrMet: 1.378 ± 0.0
2.411TyrAsn: 2.411 ± 0.0
2.411TyrPro: 2.411 ± 0.0
1.722TyrGln: 1.722 ± 0.0
1.722TyrArg: 1.722 ± 0.0
3.1TyrSer: 3.1 ± 0.0
3.1TyrThr: 3.1 ± 0.0
2.756TyrVal: 2.756 ± 0.0
0.344TyrTrp: 0.344 ± 0.0
1.378TyrTyr: 1.378 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2904 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski