Amino acid dipepetide frequency for Wenzhou picorna-like virus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.277AlaAla: 3.277 ± 0.0
0.546AlaCys: 0.546 ± 0.0
3.823AlaAsp: 3.823 ± 0.0
3.823AlaGlu: 3.823 ± 0.0
3.823AlaPhe: 3.823 ± 0.0
3.277AlaGly: 3.277 ± 0.0
1.638AlaHis: 1.638 ± 0.0
6.554AlaIle: 6.554 ± 0.0
4.369AlaLys: 4.369 ± 0.0
7.646AlaLeu: 7.646 ± 0.0
3.277AlaMet: 3.277 ± 0.0
1.092AlaAsn: 1.092 ± 0.0
1.092AlaPro: 1.092 ± 0.0
2.185AlaGln: 2.185 ± 0.0
1.092AlaArg: 1.092 ± 0.0
3.277AlaSer: 3.277 ± 0.0
3.277AlaThr: 3.277 ± 0.0
0.546AlaVal: 0.546 ± 0.0
1.092AlaTrp: 1.092 ± 0.0
3.277AlaTyr: 3.277 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
3.277CysAla: 3.277 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.546CysAsp: 0.546 ± 0.0
1.092CysGlu: 1.092 ± 0.0
1.092CysPhe: 1.092 ± 0.0
1.092CysGly: 1.092 ± 0.0
0.546CysHis: 0.546 ± 0.0
0.546CysIle: 0.546 ± 0.0
1.638CysLys: 1.638 ± 0.0
1.092CysLeu: 1.092 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.546CysAsn: 0.546 ± 0.0
1.638CysPro: 1.638 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.638CysArg: 1.638 ± 0.0
1.092CysSer: 1.092 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.638CysVal: 1.638 ± 0.0
0.546CysTrp: 0.546 ± 0.0
0.546CysTyr: 0.546 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.185AspAla: 2.185 ± 0.0
0.0AspCys: 0.0 ± 0.0
3.823AspAsp: 3.823 ± 0.0
6.554AspGlu: 6.554 ± 0.0
2.731AspPhe: 2.731 ± 0.0
4.369AspGly: 4.369 ± 0.0
1.638AspHis: 1.638 ± 0.0
4.369AspIle: 4.369 ± 0.0
3.277AspLys: 3.277 ± 0.0
7.1AspLeu: 7.1 ± 0.0
1.092AspMet: 1.092 ± 0.0
1.092AspAsn: 1.092 ± 0.0
3.823AspPro: 3.823 ± 0.0
1.638AspGln: 1.638 ± 0.0
3.277AspArg: 3.277 ± 0.0
1.638AspSer: 1.638 ± 0.0
3.277AspThr: 3.277 ± 0.0
6.008AspVal: 6.008 ± 0.0
0.546AspTrp: 0.546 ± 0.0
2.185AspTyr: 2.185 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.823GluAla: 3.823 ± 0.0
0.546GluCys: 0.546 ± 0.0
4.369GluAsp: 4.369 ± 0.0
5.461GluGlu: 5.461 ± 0.0
2.731GluPhe: 2.731 ± 0.0
4.915GluGly: 4.915 ± 0.0
2.185GluHis: 2.185 ± 0.0
3.277GluIle: 3.277 ± 0.0
3.823GluLys: 3.823 ± 0.0
3.823GluLeu: 3.823 ± 0.0
2.185GluMet: 2.185 ± 0.0
1.092GluAsn: 1.092 ± 0.0
1.638GluPro: 1.638 ± 0.0
2.185GluGln: 2.185 ± 0.0
5.461GluArg: 5.461 ± 0.0
4.369GluSer: 4.369 ± 0.0
2.731GluThr: 2.731 ± 0.0
4.369GluVal: 4.369 ± 0.0
2.731GluTrp: 2.731 ± 0.0
3.277GluTyr: 3.277 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.277PheAla: 3.277 ± 0.0
2.185PheCys: 2.185 ± 0.0
3.277PheAsp: 3.277 ± 0.0
2.185PheGlu: 2.185 ± 0.0
2.185PhePhe: 2.185 ± 0.0
2.185PheGly: 2.185 ± 0.0
1.638PheHis: 1.638 ± 0.0
3.823PheIle: 3.823 ± 0.0
1.638PheLys: 1.638 ± 0.0
4.369PheLeu: 4.369 ± 0.0
1.638PheMet: 1.638 ± 0.0
3.277PheAsn: 3.277 ± 0.0
1.638PhePro: 1.638 ± 0.0
0.546PheGln: 0.546 ± 0.0
2.731PheArg: 2.731 ± 0.0
2.185PheSer: 2.185 ± 0.0
3.277PheThr: 3.277 ± 0.0
3.277PheVal: 3.277 ± 0.0
0.546PheTrp: 0.546 ± 0.0
3.277PheTyr: 3.277 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.823GlyAla: 3.823 ± 0.0
0.0GlyCys: 0.0 ± 0.0
2.731GlyAsp: 2.731 ± 0.0
2.185GlyGlu: 2.185 ± 0.0
2.731GlyPhe: 2.731 ± 0.0
1.638GlyGly: 1.638 ± 0.0
1.638GlyHis: 1.638 ± 0.0
5.461GlyIle: 5.461 ± 0.0
3.823GlyLys: 3.823 ± 0.0
4.369GlyLeu: 4.369 ± 0.0
2.185GlyMet: 2.185 ± 0.0
2.185GlyAsn: 2.185 ± 0.0
2.185GlyPro: 2.185 ± 0.0
2.185GlyGln: 2.185 ± 0.0
2.185GlyArg: 2.185 ± 0.0
4.915GlySer: 4.915 ± 0.0
3.277GlyThr: 3.277 ± 0.0
6.554GlyVal: 6.554 ± 0.0
1.638GlyTrp: 1.638 ± 0.0
1.638GlyTyr: 1.638 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.638HisAla: 1.638 ± 0.0
1.092HisCys: 1.092 ± 0.0
0.546HisAsp: 0.546 ± 0.0
1.638HisGlu: 1.638 ± 0.0
0.546HisPhe: 0.546 ± 0.0
1.638HisGly: 1.638 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.638HisIle: 1.638 ± 0.0
1.092HisLys: 1.092 ± 0.0
2.731HisLeu: 2.731 ± 0.0
1.092HisMet: 1.092 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.092HisPro: 1.092 ± 0.0
0.546HisGln: 0.546 ± 0.0
0.546HisArg: 0.546 ± 0.0
1.092HisSer: 1.092 ± 0.0
1.092HisThr: 1.092 ± 0.0
3.277HisVal: 3.277 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.092HisTyr: 1.092 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.277IleAla: 3.277 ± 0.0
1.092IleCys: 1.092 ± 0.0
3.823IleAsp: 3.823 ± 0.0
2.185IleGlu: 2.185 ± 0.0
3.277IlePhe: 3.277 ± 0.0
1.638IleGly: 1.638 ± 0.0
2.185IleHis: 2.185 ± 0.0
4.369IleIle: 4.369 ± 0.0
3.823IleLys: 3.823 ± 0.0
4.915IleLeu: 4.915 ± 0.0
2.731IleMet: 2.731 ± 0.0
2.731IleAsn: 2.731 ± 0.0
0.546IlePro: 0.546 ± 0.0
4.369IleGln: 4.369 ± 0.0
5.461IleArg: 5.461 ± 0.0
5.461IleSer: 5.461 ± 0.0
4.369IleThr: 4.369 ± 0.0
2.185IleVal: 2.185 ± 0.0
1.092IleTrp: 1.092 ± 0.0
1.638IleTyr: 1.638 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.277LysAla: 3.277 ± 0.0
1.638LysCys: 1.638 ± 0.0
3.277LysAsp: 3.277 ± 0.0
3.823LysGlu: 3.823 ± 0.0
3.823LysPhe: 3.823 ± 0.0
2.731LysGly: 2.731 ± 0.0
1.638LysHis: 1.638 ± 0.0
2.731LysIle: 2.731 ± 0.0
2.731LysLys: 2.731 ± 0.0
5.461LysLeu: 5.461 ± 0.0
2.731LysMet: 2.731 ± 0.0
3.277LysAsn: 3.277 ± 0.0
1.638LysPro: 1.638 ± 0.0
2.731LysGln: 2.731 ± 0.0
4.915LysArg: 4.915 ± 0.0
5.461LysSer: 5.461 ± 0.0
4.915LysThr: 4.915 ± 0.0
3.277LysVal: 3.277 ± 0.0
0.0LysTrp: 0.0 ± 0.0
3.277LysTyr: 3.277 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.008LeuAla: 6.008 ± 0.0
1.092LeuCys: 1.092 ± 0.0
7.1LeuAsp: 7.1 ± 0.0
6.008LeuGlu: 6.008 ± 0.0
6.554LeuPhe: 6.554 ± 0.0
7.1LeuGly: 7.1 ± 0.0
2.731LeuHis: 2.731 ± 0.0
2.731LeuIle: 2.731 ± 0.0
2.731LeuLys: 2.731 ± 0.0
6.554LeuLeu: 6.554 ± 0.0
2.731LeuMet: 2.731 ± 0.0
4.369LeuAsn: 4.369 ± 0.0
5.461LeuPro: 5.461 ± 0.0
0.546LeuGln: 0.546 ± 0.0
3.823LeuArg: 3.823 ± 0.0
6.008LeuSer: 6.008 ± 0.0
4.915LeuThr: 4.915 ± 0.0
7.1LeuVal: 7.1 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
1.638LeuTyr: 1.638 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.731MetAla: 2.731 ± 0.0
0.546MetCys: 0.546 ± 0.0
2.185MetAsp: 2.185 ± 0.0
2.185MetGlu: 2.185 ± 0.0
4.369MetPhe: 4.369 ± 0.0
2.185MetGly: 2.185 ± 0.0
0.546MetHis: 0.546 ± 0.0
0.546MetIle: 0.546 ± 0.0
2.185MetLys: 2.185 ± 0.0
1.092MetLeu: 1.092 ± 0.0
1.092MetMet: 1.092 ± 0.0
1.092MetAsn: 1.092 ± 0.0
0.546MetPro: 0.546 ± 0.0
1.092MetGln: 1.092 ± 0.0
1.092MetArg: 1.092 ± 0.0
1.638MetSer: 1.638 ± 0.0
3.277MetThr: 3.277 ± 0.0
2.731MetVal: 2.731 ± 0.0
1.638MetTrp: 1.638 ± 0.0
2.731MetTyr: 2.731 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.185AsnAla: 2.185 ± 0.0
0.0AsnCys: 0.0 ± 0.0
2.185AsnAsp: 2.185 ± 0.0
2.185AsnGlu: 2.185 ± 0.0
1.638AsnPhe: 1.638 ± 0.0
2.731AsnGly: 2.731 ± 0.0
0.546AsnHis: 0.546 ± 0.0
3.277AsnIle: 3.277 ± 0.0
2.731AsnLys: 2.731 ± 0.0
2.731AsnLeu: 2.731 ± 0.0
0.546AsnMet: 0.546 ± 0.0
2.731AsnAsn: 2.731 ± 0.0
3.823AsnPro: 3.823 ± 0.0
0.546AsnGln: 0.546 ± 0.0
2.731AsnArg: 2.731 ± 0.0
2.731AsnSer: 2.731 ± 0.0
2.185AsnThr: 2.185 ± 0.0
3.823AsnVal: 3.823 ± 0.0
0.546AsnTrp: 0.546 ± 0.0
1.638AsnTyr: 1.638 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.277ProAla: 3.277 ± 0.0
0.546ProCys: 0.546 ± 0.0
1.638ProAsp: 1.638 ± 0.0
1.638ProGlu: 1.638 ± 0.0
1.092ProPhe: 1.092 ± 0.0
1.638ProGly: 1.638 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.092ProIle: 1.092 ± 0.0
3.277ProLys: 3.277 ± 0.0
3.277ProLeu: 3.277 ± 0.0
0.546ProMet: 0.546 ± 0.0
1.638ProAsn: 1.638 ± 0.0
1.092ProPro: 1.092 ± 0.0
2.731ProGln: 2.731 ± 0.0
1.092ProArg: 1.092 ± 0.0
2.185ProSer: 2.185 ± 0.0
1.638ProThr: 1.638 ± 0.0
3.277ProVal: 3.277 ± 0.0
2.731ProTrp: 2.731 ± 0.0
0.546ProTyr: 0.546 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.731GlnAla: 2.731 ± 0.0
2.185GlnCys: 2.185 ± 0.0
1.092GlnAsp: 1.092 ± 0.0
3.823GlnGlu: 3.823 ± 0.0
0.546GlnPhe: 0.546 ± 0.0
2.731GlnGly: 2.731 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.092GlnIle: 1.092 ± 0.0
1.638GlnLys: 1.638 ± 0.0
2.185GlnLeu: 2.185 ± 0.0
0.546GlnMet: 0.546 ± 0.0
2.731GlnAsn: 2.731 ± 0.0
0.546GlnPro: 0.546 ± 0.0
1.092GlnGln: 1.092 ± 0.0
1.638GlnArg: 1.638 ± 0.0
3.823GlnSer: 3.823 ± 0.0
1.638GlnThr: 1.638 ± 0.0
3.823GlnVal: 3.823 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.638GlnTyr: 1.638 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.185ArgAla: 2.185 ± 0.0
0.546ArgCys: 0.546 ± 0.0
4.915ArgAsp: 4.915 ± 0.0
3.823ArgGlu: 3.823 ± 0.0
2.185ArgPhe: 2.185 ± 0.0
3.277ArgGly: 3.277 ± 0.0
1.092ArgHis: 1.092 ± 0.0
2.731ArgIle: 2.731 ± 0.0
4.915ArgLys: 4.915 ± 0.0
5.461ArgLeu: 5.461 ± 0.0
1.638ArgMet: 1.638 ± 0.0
3.823ArgAsn: 3.823 ± 0.0
0.546ArgPro: 0.546 ± 0.0
2.185ArgGln: 2.185 ± 0.0
3.823ArgArg: 3.823 ± 0.0
4.915ArgSer: 4.915 ± 0.0
1.092ArgThr: 1.092 ± 0.0
5.461ArgVal: 5.461 ± 0.0
1.638ArgTrp: 1.638 ± 0.0
1.638ArgTyr: 1.638 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.915SerAla: 4.915 ± 0.0
1.638SerCys: 1.638 ± 0.0
3.277SerAsp: 3.277 ± 0.0
4.915SerGlu: 4.915 ± 0.0
1.638SerPhe: 1.638 ± 0.0
3.277SerGly: 3.277 ± 0.0
0.546SerHis: 0.546 ± 0.0
2.731SerIle: 2.731 ± 0.0
4.915SerLys: 4.915 ± 0.0
9.285SerLeu: 9.285 ± 0.0
5.461SerMet: 5.461 ± 0.0
1.092SerAsn: 1.092 ± 0.0
3.277SerPro: 3.277 ± 0.0
1.092SerGln: 1.092 ± 0.0
4.369SerArg: 4.369 ± 0.0
4.369SerSer: 4.369 ± 0.0
4.369SerThr: 4.369 ± 0.0
2.185SerVal: 2.185 ± 0.0
0.546SerTrp: 0.546 ± 0.0
2.731SerTyr: 2.731 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.823ThrAla: 3.823 ± 0.0
1.092ThrCys: 1.092 ± 0.0
2.185ThrAsp: 2.185 ± 0.0
2.185ThrGlu: 2.185 ± 0.0
1.092ThrPhe: 1.092 ± 0.0
3.823ThrGly: 3.823 ± 0.0
1.092ThrHis: 1.092 ± 0.0
3.277ThrIle: 3.277 ± 0.0
4.369ThrLys: 4.369 ± 0.0
4.369ThrLeu: 4.369 ± 0.0
2.185ThrMet: 2.185 ± 0.0
3.277ThrAsn: 3.277 ± 0.0
1.638ThrPro: 1.638 ± 0.0
3.277ThrGln: 3.277 ± 0.0
3.823ThrArg: 3.823 ± 0.0
2.731ThrSer: 2.731 ± 0.0
2.185ThrThr: 2.185 ± 0.0
3.277ThrVal: 3.277 ± 0.0
1.638ThrTrp: 1.638 ± 0.0
3.823ThrTyr: 3.823 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.731ValAla: 2.731 ± 0.0
1.092ValCys: 1.092 ± 0.0
6.008ValAsp: 6.008 ± 0.0
4.369ValGlu: 4.369 ± 0.0
2.731ValPhe: 2.731 ± 0.0
3.277ValGly: 3.277 ± 0.0
1.092ValHis: 1.092 ± 0.0
6.554ValIle: 6.554 ± 0.0
4.915ValLys: 4.915 ± 0.0
3.277ValLeu: 3.277 ± 0.0
2.185ValMet: 2.185 ± 0.0
2.731ValAsn: 2.731 ± 0.0
2.185ValPro: 2.185 ± 0.0
5.461ValGln: 5.461 ± 0.0
3.823ValArg: 3.823 ± 0.0
6.008ValSer: 6.008 ± 0.0
5.461ValThr: 5.461 ± 0.0
5.461ValVal: 5.461 ± 0.0
1.092ValTrp: 1.092 ± 0.0
1.638ValTyr: 1.638 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.546TrpCys: 0.546 ± 0.0
2.185TrpAsp: 2.185 ± 0.0
2.731TrpGlu: 2.731 ± 0.0
1.638TrpPhe: 1.638 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.638TrpIle: 1.638 ± 0.0
1.638TrpLys: 1.638 ± 0.0
1.092TrpLeu: 1.092 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.546TrpAsn: 0.546 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.092TrpGln: 1.092 ± 0.0
2.185TrpArg: 2.185 ± 0.0
1.092TrpSer: 1.092 ± 0.0
0.546TrpThr: 0.546 ± 0.0
0.546TrpVal: 0.546 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.092TrpTyr: 1.092 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.546TyrAla: 0.546 ± 0.0
2.185TyrCys: 2.185 ± 0.0
2.185TyrAsp: 2.185 ± 0.0
2.185TyrGlu: 2.185 ± 0.0
2.731TyrPhe: 2.731 ± 0.0
3.823TyrGly: 3.823 ± 0.0
1.638TyrHis: 1.638 ± 0.0
2.731TyrIle: 2.731 ± 0.0
3.823TyrLys: 3.823 ± 0.0
4.369TyrLeu: 4.369 ± 0.0
1.092TyrMet: 1.092 ± 0.0
2.185TyrAsn: 2.185 ± 0.0
0.546TyrPro: 0.546 ± 0.0
0.0TyrGln: 0.0 ± 0.0
2.185TyrArg: 2.185 ± 0.0
1.638TyrSer: 1.638 ± 0.0
1.638TyrThr: 1.638 ± 0.0
3.277TyrVal: 3.277 ± 0.0
0.546TyrTrp: 0.546 ± 0.0
1.092TyrTyr: 1.092 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (1832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski