Amino acid dipepetide frequency for Hubei toti-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.569AlaAla: 10.569 ± 0.0
1.22AlaCys: 1.22 ± 0.0
3.659AlaAsp: 3.659 ± 0.0
4.472AlaGlu: 4.472 ± 0.0
3.659AlaPhe: 3.659 ± 0.0
8.943AlaGly: 8.943 ± 0.0
1.626AlaHis: 1.626 ± 0.0
2.846AlaIle: 2.846 ± 0.0
6.504AlaLys: 6.504 ± 0.0
6.098AlaLeu: 6.098 ± 0.0
2.846AlaMet: 2.846 ± 0.0
1.626AlaAsn: 1.626 ± 0.0
4.472AlaPro: 4.472 ± 0.0
4.878AlaGln: 4.878 ± 0.0
8.537AlaArg: 8.537 ± 0.0
6.504AlaSer: 6.504 ± 0.0
4.878AlaThr: 4.878 ± 0.0
7.724AlaVal: 7.724 ± 0.0
0.813AlaTrp: 0.813 ± 0.0
2.439AlaTyr: 2.439 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.407CysAla: 0.407 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.813CysAsp: 0.813 ± 0.0
0.813CysGlu: 0.813 ± 0.0
0.813CysPhe: 0.813 ± 0.0
2.846CysGly: 2.846 ± 0.0
1.22CysHis: 1.22 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.813CysLys: 0.813 ± 0.0
0.813CysLeu: 0.813 ± 0.0
0.407CysMet: 0.407 ± 0.0
0.813CysAsn: 0.813 ± 0.0
0.813CysPro: 0.813 ± 0.0
1.22CysGln: 1.22 ± 0.0
2.033CysArg: 2.033 ± 0.0
0.813CysSer: 0.813 ± 0.0
0.813CysThr: 0.813 ± 0.0
1.22CysVal: 1.22 ± 0.0
0.813CysTrp: 0.813 ± 0.0
0.407CysTyr: 0.407 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.691AspAla: 5.691 ± 0.0
0.407AspCys: 0.407 ± 0.0
2.033AspAsp: 2.033 ± 0.0
5.691AspGlu: 5.691 ± 0.0
0.407AspPhe: 0.407 ± 0.0
2.846AspGly: 2.846 ± 0.0
0.813AspHis: 0.813 ± 0.0
1.626AspIle: 1.626 ± 0.0
0.407AspLys: 0.407 ± 0.0
4.878AspLeu: 4.878 ± 0.0
0.0AspMet: 0.0 ± 0.0
1.626AspAsn: 1.626 ± 0.0
2.033AspPro: 2.033 ± 0.0
2.439AspGln: 2.439 ± 0.0
2.846AspArg: 2.846 ± 0.0
2.846AspSer: 2.846 ± 0.0
2.439AspThr: 2.439 ± 0.0
2.846AspVal: 2.846 ± 0.0
1.22AspTrp: 1.22 ± 0.0
2.439AspTyr: 2.439 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.504GluAla: 6.504 ± 0.0
2.439GluCys: 2.439 ± 0.0
4.878GluAsp: 4.878 ± 0.0
7.317GluGlu: 7.317 ± 0.0
2.846GluPhe: 2.846 ± 0.0
3.252GluGly: 3.252 ± 0.0
2.033GluHis: 2.033 ± 0.0
2.846GluIle: 2.846 ± 0.0
3.252GluLys: 3.252 ± 0.0
6.911GluLeu: 6.911 ± 0.0
3.252GluMet: 3.252 ± 0.0
2.846GluAsn: 2.846 ± 0.0
4.878GluPro: 4.878 ± 0.0
5.285GluGln: 5.285 ± 0.0
6.098GluArg: 6.098 ± 0.0
5.285GluSer: 5.285 ± 0.0
4.065GluThr: 4.065 ± 0.0
5.691GluVal: 5.691 ± 0.0
1.626GluTrp: 1.626 ± 0.0
0.813GluTyr: 0.813 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.252PheAla: 3.252 ± 0.0
0.813PheCys: 0.813 ± 0.0
1.22PheAsp: 1.22 ± 0.0
1.22PheGlu: 1.22 ± 0.0
1.626PhePhe: 1.626 ± 0.0
1.626PheGly: 1.626 ± 0.0
0.0PheHis: 0.0 ± 0.0
2.033PheIle: 2.033 ± 0.0
0.0PheLys: 0.0 ± 0.0
2.033PheLeu: 2.033 ± 0.0
0.407PheMet: 0.407 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.22PhePro: 1.22 ± 0.0
0.813PheGln: 0.813 ± 0.0
1.626PheArg: 1.626 ± 0.0
1.22PheSer: 1.22 ± 0.0
1.22PheThr: 1.22 ± 0.0
1.626PheVal: 1.626 ± 0.0
0.813PheTrp: 0.813 ± 0.0
0.813PheTyr: 0.813 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.317GlyAla: 7.317 ± 0.0
1.626GlyCys: 1.626 ± 0.0
4.472GlyAsp: 4.472 ± 0.0
6.504GlyGlu: 6.504 ± 0.0
2.439GlyPhe: 2.439 ± 0.0
9.35GlyGly: 9.35 ± 0.0
2.439GlyHis: 2.439 ± 0.0
3.252GlyIle: 3.252 ± 0.0
4.065GlyLys: 4.065 ± 0.0
5.285GlyLeu: 5.285 ± 0.0
2.846GlyMet: 2.846 ± 0.0
4.065GlyAsn: 4.065 ± 0.0
4.065GlyPro: 4.065 ± 0.0
2.846GlyGln: 2.846 ± 0.0
5.285GlyArg: 5.285 ± 0.0
7.317GlySer: 7.317 ± 0.0
3.659GlyThr: 3.659 ± 0.0
3.659GlyVal: 3.659 ± 0.0
1.22GlyTrp: 1.22 ± 0.0
3.252GlyTyr: 3.252 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.439HisAla: 2.439 ± 0.0
0.407HisCys: 0.407 ± 0.0
0.407HisAsp: 0.407 ± 0.0
1.22HisGlu: 1.22 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.813HisGly: 0.813 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.407HisIle: 0.407 ± 0.0
1.22HisLys: 1.22 ± 0.0
2.439HisLeu: 2.439 ± 0.0
1.22HisMet: 1.22 ± 0.0
1.626HisAsn: 1.626 ± 0.0
1.626HisPro: 1.626 ± 0.0
0.407HisGln: 0.407 ± 0.0
0.813HisArg: 0.813 ± 0.0
0.407HisSer: 0.407 ± 0.0
1.22HisThr: 1.22 ± 0.0
2.033HisVal: 2.033 ± 0.0
1.22HisTrp: 1.22 ± 0.0
1.22HisTyr: 1.22 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.659IleAla: 3.659 ± 0.0
1.22IleCys: 1.22 ± 0.0
2.846IleAsp: 2.846 ± 0.0
2.033IleGlu: 2.033 ± 0.0
0.407IlePhe: 0.407 ± 0.0
3.659IleGly: 3.659 ± 0.0
1.22IleHis: 1.22 ± 0.0
0.813IleIle: 0.813 ± 0.0
1.626IleLys: 1.626 ± 0.0
2.846IleLeu: 2.846 ± 0.0
0.813IleMet: 0.813 ± 0.0
1.22IleAsn: 1.22 ± 0.0
1.626IlePro: 1.626 ± 0.0
0.813IleGln: 0.813 ± 0.0
3.659IleArg: 3.659 ± 0.0
1.626IleSer: 1.626 ± 0.0
3.252IleThr: 3.252 ± 0.0
1.22IleVal: 1.22 ± 0.0
0.407IleTrp: 0.407 ± 0.0
1.22IleTyr: 1.22 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.846LysAla: 2.846 ± 0.0
0.0LysCys: 0.0 ± 0.0
1.626LysAsp: 1.626 ± 0.0
4.878LysGlu: 4.878 ± 0.0
0.407LysPhe: 0.407 ± 0.0
2.033LysGly: 2.033 ± 0.0
0.813LysHis: 0.813 ± 0.0
2.846LysIle: 2.846 ± 0.0
4.065LysLys: 4.065 ± 0.0
2.033LysLeu: 2.033 ± 0.0
1.22LysMet: 1.22 ± 0.0
2.846LysAsn: 2.846 ± 0.0
2.439LysPro: 2.439 ± 0.0
1.626LysGln: 1.626 ± 0.0
3.659LysArg: 3.659 ± 0.0
2.439LysSer: 2.439 ± 0.0
2.033LysThr: 2.033 ± 0.0
2.439LysVal: 2.439 ± 0.0
1.626LysTrp: 1.626 ± 0.0
1.626LysTyr: 1.626 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
11.382LeuAla: 11.382 ± 0.0
0.407LeuCys: 0.407 ± 0.0
5.285LeuAsp: 5.285 ± 0.0
6.098LeuGlu: 6.098 ± 0.0
1.626LeuPhe: 1.626 ± 0.0
6.911LeuGly: 6.911 ± 0.0
0.813LeuHis: 0.813 ± 0.0
4.065LeuIle: 4.065 ± 0.0
2.033LeuLys: 2.033 ± 0.0
6.911LeuLeu: 6.911 ± 0.0
2.439LeuMet: 2.439 ± 0.0
2.439LeuAsn: 2.439 ± 0.0
5.285LeuPro: 5.285 ± 0.0
4.878LeuGln: 4.878 ± 0.0
7.724LeuArg: 7.724 ± 0.0
3.252LeuSer: 3.252 ± 0.0
1.626LeuThr: 1.626 ± 0.0
4.878LeuVal: 4.878 ± 0.0
1.22LeuTrp: 1.22 ± 0.0
3.252LeuTyr: 3.252 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.626MetAla: 1.626 ± 0.0
0.813MetCys: 0.813 ± 0.0
1.22MetAsp: 1.22 ± 0.0
1.626MetGlu: 1.626 ± 0.0
0.407MetPhe: 0.407 ± 0.0
3.252MetGly: 3.252 ± 0.0
0.813MetHis: 0.813 ± 0.0
0.407MetIle: 0.407 ± 0.0
1.22MetLys: 1.22 ± 0.0
3.252MetLeu: 3.252 ± 0.0
0.407MetMet: 0.407 ± 0.0
1.626MetAsn: 1.626 ± 0.0
3.252MetPro: 3.252 ± 0.0
2.033MetGln: 2.033 ± 0.0
2.033MetArg: 2.033 ± 0.0
0.813MetSer: 0.813 ± 0.0
1.22MetThr: 1.22 ± 0.0
1.626MetVal: 1.626 ± 0.0
0.407MetTrp: 0.407 ± 0.0
1.22MetTyr: 1.22 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.22AsnAla: 1.22 ± 0.0
0.813AsnCys: 0.813 ± 0.0
1.626AsnAsp: 1.626 ± 0.0
1.626AsnGlu: 1.626 ± 0.0
0.407AsnPhe: 0.407 ± 0.0
2.846AsnGly: 2.846 ± 0.0
0.813AsnHis: 0.813 ± 0.0
1.22AsnIle: 1.22 ± 0.0
0.0AsnLys: 0.0 ± 0.0
2.033AsnLeu: 2.033 ± 0.0
1.22AsnMet: 1.22 ± 0.0
1.22AsnAsn: 1.22 ± 0.0
1.22AsnPro: 1.22 ± 0.0
1.626AsnGln: 1.626 ± 0.0
2.033AsnArg: 2.033 ± 0.0
0.813AsnSer: 0.813 ± 0.0
3.252AsnThr: 3.252 ± 0.0
2.439AsnVal: 2.439 ± 0.0
0.813AsnTrp: 0.813 ± 0.0
1.626AsnTyr: 1.626 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.659ProAla: 3.659 ± 0.0
0.407ProCys: 0.407 ± 0.0
0.813ProAsp: 0.813 ± 0.0
8.13ProGlu: 8.13 ± 0.0
0.813ProPhe: 0.813 ± 0.0
5.691ProGly: 5.691 ± 0.0
0.407ProHis: 0.407 ± 0.0
2.846ProIle: 2.846 ± 0.0
2.033ProLys: 2.033 ± 0.0
5.285ProLeu: 5.285 ± 0.0
2.439ProMet: 2.439 ± 0.0
1.22ProAsn: 1.22 ± 0.0
1.626ProPro: 1.626 ± 0.0
0.407ProGln: 0.407 ± 0.0
4.472ProArg: 4.472 ± 0.0
3.659ProSer: 3.659 ± 0.0
1.626ProThr: 1.626 ± 0.0
2.033ProVal: 2.033 ± 0.0
0.407ProTrp: 0.407 ± 0.0
0.813ProTyr: 0.813 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.878GlnAla: 4.878 ± 0.0
1.22GlnCys: 1.22 ± 0.0
1.626GlnAsp: 1.626 ± 0.0
3.252GlnGlu: 3.252 ± 0.0
0.407GlnPhe: 0.407 ± 0.0
1.626GlnGly: 1.626 ± 0.0
1.626GlnHis: 1.626 ± 0.0
1.22GlnIle: 1.22 ± 0.0
3.252GlnLys: 3.252 ± 0.0
4.065GlnLeu: 4.065 ± 0.0
2.033GlnMet: 2.033 ± 0.0
0.407GlnAsn: 0.407 ± 0.0
1.626GlnPro: 1.626 ± 0.0
2.033GlnGln: 2.033 ± 0.0
3.252GlnArg: 3.252 ± 0.0
1.626GlnSer: 1.626 ± 0.0
3.659GlnThr: 3.659 ± 0.0
2.439GlnVal: 2.439 ± 0.0
2.033GlnTrp: 2.033 ± 0.0
1.626GlnTyr: 1.626 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.691ArgAla: 5.691 ± 0.0
0.0ArgCys: 0.0 ± 0.0
3.252ArgAsp: 3.252 ± 0.0
3.659ArgGlu: 3.659 ± 0.0
1.22ArgPhe: 1.22 ± 0.0
6.911ArgGly: 6.911 ± 0.0
2.033ArgHis: 2.033 ± 0.0
3.252ArgIle: 3.252 ± 0.0
5.285ArgLys: 5.285 ± 0.0
7.724ArgLeu: 7.724 ± 0.0
2.439ArgMet: 2.439 ± 0.0
0.813ArgAsn: 0.813 ± 0.0
2.439ArgPro: 2.439 ± 0.0
2.033ArgGln: 2.033 ± 0.0
8.537ArgArg: 8.537 ± 0.0
6.098ArgSer: 6.098 ± 0.0
6.098ArgThr: 6.098 ± 0.0
6.911ArgVal: 6.911 ± 0.0
2.846ArgTrp: 2.846 ± 0.0
2.846ArgTyr: 2.846 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.691SerAla: 5.691 ± 0.0
1.626SerCys: 1.626 ± 0.0
2.439SerAsp: 2.439 ± 0.0
7.317SerGlu: 7.317 ± 0.0
0.813SerPhe: 0.813 ± 0.0
7.724SerGly: 7.724 ± 0.0
0.813SerHis: 0.813 ± 0.0
2.033SerIle: 2.033 ± 0.0
2.439SerLys: 2.439 ± 0.0
4.065SerLeu: 4.065 ± 0.0
0.813SerMet: 0.813 ± 0.0
1.22SerAsn: 1.22 ± 0.0
2.439SerPro: 2.439 ± 0.0
3.252SerGln: 3.252 ± 0.0
5.691SerArg: 5.691 ± 0.0
2.033SerSer: 2.033 ± 0.0
4.472SerThr: 4.472 ± 0.0
2.846SerVal: 2.846 ± 0.0
1.22SerTrp: 1.22 ± 0.0
1.626SerTyr: 1.626 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.911ThrAla: 6.911 ± 0.0
0.407ThrCys: 0.407 ± 0.0
2.033ThrAsp: 2.033 ± 0.0
5.285ThrGlu: 5.285 ± 0.0
2.033ThrPhe: 2.033 ± 0.0
2.846ThrGly: 2.846 ± 0.0
1.22ThrHis: 1.22 ± 0.0
2.033ThrIle: 2.033 ± 0.0
1.22ThrLys: 1.22 ± 0.0
5.691ThrLeu: 5.691 ± 0.0
1.626ThrMet: 1.626 ± 0.0
0.407ThrAsn: 0.407 ± 0.0
2.846ThrPro: 2.846 ± 0.0
0.813ThrGln: 0.813 ± 0.0
2.033ThrArg: 2.033 ± 0.0
4.878ThrSer: 4.878 ± 0.0
4.065ThrThr: 4.065 ± 0.0
3.252ThrVal: 3.252 ± 0.0
1.22ThrTrp: 1.22 ± 0.0
3.659ThrTyr: 3.659 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.691ValAla: 5.691 ± 0.0
3.252ValCys: 3.252 ± 0.0
3.252ValAsp: 3.252 ± 0.0
6.911ValGlu: 6.911 ± 0.0
1.626ValPhe: 1.626 ± 0.0
6.911ValGly: 6.911 ± 0.0
1.22ValHis: 1.22 ± 0.0
1.22ValIle: 1.22 ± 0.0
1.22ValLys: 1.22 ± 0.0
5.285ValLeu: 5.285 ± 0.0
1.22ValMet: 1.22 ± 0.0
1.22ValAsn: 1.22 ± 0.0
1.626ValPro: 1.626 ± 0.0
3.659ValGln: 3.659 ± 0.0
5.285ValArg: 5.285 ± 0.0
5.285ValSer: 5.285 ± 0.0
1.626ValThr: 1.626 ± 0.0
4.065ValVal: 4.065 ± 0.0
1.626ValTrp: 1.626 ± 0.0
1.22ValTyr: 1.22 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.22TrpAla: 1.22 ± 0.0
0.813TrpCys: 0.813 ± 0.0
0.813TrpAsp: 0.813 ± 0.0
2.846TrpGlu: 2.846 ± 0.0
1.22TrpPhe: 1.22 ± 0.0
1.626TrpGly: 1.626 ± 0.0
1.22TrpHis: 1.22 ± 0.0
0.813TrpIle: 0.813 ± 0.0
1.22TrpLys: 1.22 ± 0.0
2.846TrpLeu: 2.846 ± 0.0
0.407TrpMet: 0.407 ± 0.0
0.407TrpAsn: 0.407 ± 0.0
2.033TrpPro: 2.033 ± 0.0
1.22TrpGln: 1.22 ± 0.0
0.813TrpArg: 0.813 ± 0.0
0.813TrpSer: 0.813 ± 0.0
0.407TrpThr: 0.407 ± 0.0
0.407TrpVal: 0.407 ± 0.0
0.407TrpTrp: 0.407 ± 0.0
2.033TrpTyr: 2.033 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.659TyrAla: 3.659 ± 0.0
0.0TyrCys: 0.0 ± 0.0
1.22TyrAsp: 1.22 ± 0.0
1.22TyrGlu: 1.22 ± 0.0
0.813TyrPhe: 0.813 ± 0.0
3.252TyrGly: 3.252 ± 0.0
0.0TyrHis: 0.0 ± 0.0
0.407TyrIle: 0.407 ± 0.0
1.626TyrLys: 1.626 ± 0.0
2.033TyrLeu: 2.033 ± 0.0
0.813TyrMet: 0.813 ± 0.0
1.22TyrAsn: 1.22 ± 0.0
1.22TyrPro: 1.22 ± 0.0
1.626TyrGln: 1.626 ± 0.0
3.659TyrArg: 3.659 ± 0.0
2.846TyrSer: 2.846 ± 0.0
2.846TyrThr: 2.846 ± 0.0
4.065TyrVal: 4.065 ± 0.0
1.626TyrTrp: 1.626 ± 0.0
1.22TyrTyr: 1.22 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2461 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski