Amino acid dipepetide frequency for Wenzhou picorna-like virus 18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.336AlaAla: 5.336 ± 0.0
1.601AlaCys: 1.601 ± 0.0
1.601AlaAsp: 1.601 ± 0.0
8.538AlaGlu: 8.538 ± 0.0
2.134AlaPhe: 2.134 ± 0.0
2.134AlaGly: 2.134 ± 0.0
2.134AlaHis: 2.134 ± 0.0
4.269AlaIle: 4.269 ± 0.0
4.269AlaLys: 4.269 ± 0.0
7.471AlaLeu: 7.471 ± 0.0
1.067AlaMet: 1.067 ± 0.0
1.601AlaAsn: 1.601 ± 0.0
1.067AlaPro: 1.067 ± 0.0
4.803AlaGln: 4.803 ± 0.0
1.601AlaArg: 1.601 ± 0.0
4.269AlaSer: 4.269 ± 0.0
2.134AlaThr: 2.134 ± 0.0
5.336AlaVal: 5.336 ± 0.0
2.134AlaTrp: 2.134 ± 0.0
2.134AlaTyr: 2.134 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.601CysAla: 1.601 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.534CysAsp: 0.534 ± 0.0
1.067CysGlu: 1.067 ± 0.0
0.534CysPhe: 0.534 ± 0.0
2.134CysGly: 2.134 ± 0.0
1.601CysHis: 1.601 ± 0.0
2.134CysIle: 2.134 ± 0.0
2.134CysLys: 2.134 ± 0.0
3.202CysLeu: 3.202 ± 0.0
0.534CysMet: 0.534 ± 0.0
2.134CysAsn: 2.134 ± 0.0
1.067CysPro: 1.067 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.534CysArg: 0.534 ± 0.0
1.601CysSer: 1.601 ± 0.0
1.601CysThr: 1.601 ± 0.0
1.067CysVal: 1.067 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.534CysTyr: 0.534 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.937AspAla: 6.937 ± 0.0
1.601AspCys: 1.601 ± 0.0
3.735AspAsp: 3.735 ± 0.0
4.269AspGlu: 4.269 ± 0.0
2.668AspPhe: 2.668 ± 0.0
3.735AspGly: 3.735 ± 0.0
1.067AspHis: 1.067 ± 0.0
2.668AspIle: 2.668 ± 0.0
3.735AspLys: 3.735 ± 0.0
5.87AspLeu: 5.87 ± 0.0
1.067AspMet: 1.067 ± 0.0
2.134AspAsn: 2.134 ± 0.0
1.601AspPro: 1.601 ± 0.0
0.534AspGln: 0.534 ± 0.0
0.534AspArg: 0.534 ± 0.0
2.668AspSer: 2.668 ± 0.0
4.269AspThr: 4.269 ± 0.0
3.735AspVal: 3.735 ± 0.0
0.0AspTrp: 0.0 ± 0.0
3.735AspTyr: 3.735 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.134GluAla: 2.134 ± 0.0
2.668GluCys: 2.668 ± 0.0
3.735GluAsp: 3.735 ± 0.0
4.803GluGlu: 4.803 ± 0.0
4.269GluPhe: 4.269 ± 0.0
4.269GluGly: 4.269 ± 0.0
1.067GluHis: 1.067 ± 0.0
4.803GluIle: 4.803 ± 0.0
5.336GluLys: 5.336 ± 0.0
4.269GluLeu: 4.269 ± 0.0
1.601GluMet: 1.601 ± 0.0
2.134GluAsn: 2.134 ± 0.0
0.534GluPro: 0.534 ± 0.0
2.668GluGln: 2.668 ± 0.0
5.336GluArg: 5.336 ± 0.0
4.803GluSer: 4.803 ± 0.0
4.269GluThr: 4.269 ± 0.0
4.269GluVal: 4.269 ± 0.0
1.067GluTrp: 1.067 ± 0.0
3.202GluTyr: 3.202 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.601PheAla: 1.601 ± 0.0
1.067PheCys: 1.067 ± 0.0
2.134PheAsp: 2.134 ± 0.0
2.668PheGlu: 2.668 ± 0.0
3.735PhePhe: 3.735 ± 0.0
1.067PheGly: 1.067 ± 0.0
1.067PheHis: 1.067 ± 0.0
1.601PheIle: 1.601 ± 0.0
1.067PheLys: 1.067 ± 0.0
6.403PheLeu: 6.403 ± 0.0
1.067PheMet: 1.067 ± 0.0
0.534PheAsn: 0.534 ± 0.0
2.134PhePro: 2.134 ± 0.0
2.134PheGln: 2.134 ± 0.0
1.601PheArg: 1.601 ± 0.0
2.668PheSer: 2.668 ± 0.0
3.202PheThr: 3.202 ± 0.0
3.202PheVal: 3.202 ± 0.0
0.534PheTrp: 0.534 ± 0.0
1.601PheTyr: 1.601 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.803GlyAla: 4.803 ± 0.0
0.534GlyCys: 0.534 ± 0.0
3.202GlyAsp: 3.202 ± 0.0
0.534GlyGlu: 0.534 ± 0.0
2.134GlyPhe: 2.134 ± 0.0
2.668GlyGly: 2.668 ± 0.0
2.134GlyHis: 2.134 ± 0.0
4.269GlyIle: 4.269 ± 0.0
5.336GlyLys: 5.336 ± 0.0
4.269GlyLeu: 4.269 ± 0.0
0.534GlyMet: 0.534 ± 0.0
3.735GlyAsn: 3.735 ± 0.0
3.735GlyPro: 3.735 ± 0.0
2.134GlyGln: 2.134 ± 0.0
2.134GlyArg: 2.134 ± 0.0
5.87GlySer: 5.87 ± 0.0
2.668GlyThr: 2.668 ± 0.0
8.538GlyVal: 8.538 ± 0.0
0.534GlyTrp: 0.534 ± 0.0
3.202GlyTyr: 3.202 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.067HisAla: 1.067 ± 0.0
1.067HisCys: 1.067 ± 0.0
3.735HisAsp: 3.735 ± 0.0
1.601HisGlu: 1.601 ± 0.0
2.134HisPhe: 2.134 ± 0.0
2.668HisGly: 2.668 ± 0.0
2.134HisHis: 2.134 ± 0.0
1.601HisIle: 1.601 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.735HisLeu: 3.735 ± 0.0
1.067HisMet: 1.067 ± 0.0
2.668HisAsn: 2.668 ± 0.0
3.202HisPro: 3.202 ± 0.0
0.534HisGln: 0.534 ± 0.0
1.601HisArg: 1.601 ± 0.0
2.134HisSer: 2.134 ± 0.0
0.534HisThr: 0.534 ± 0.0
0.534HisVal: 0.534 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.067HisTyr: 1.067 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.735IleAla: 3.735 ± 0.0
0.534IleCys: 0.534 ± 0.0
5.336IleAsp: 5.336 ± 0.0
3.202IleGlu: 3.202 ± 0.0
3.202IlePhe: 3.202 ± 0.0
3.735IleGly: 3.735 ± 0.0
1.601IleHis: 1.601 ± 0.0
2.668IleIle: 2.668 ± 0.0
0.534IleLys: 0.534 ± 0.0
3.735IleLeu: 3.735 ± 0.0
2.134IleMet: 2.134 ± 0.0
2.134IleAsn: 2.134 ± 0.0
3.202IlePro: 3.202 ± 0.0
1.067IleGln: 1.067 ± 0.0
2.134IleArg: 2.134 ± 0.0
6.937IleSer: 6.937 ± 0.0
3.202IleThr: 3.202 ± 0.0
3.202IleVal: 3.202 ± 0.0
0.0IleTrp: 0.0 ± 0.0
2.134IleTyr: 2.134 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.269LysAla: 4.269 ± 0.0
5.336LysCys: 5.336 ± 0.0
3.735LysAsp: 3.735 ± 0.0
4.269LysGlu: 4.269 ± 0.0
2.668LysPhe: 2.668 ± 0.0
3.735LysGly: 3.735 ± 0.0
2.668LysHis: 2.668 ± 0.0
7.471LysIle: 7.471 ± 0.0
3.202LysLys: 3.202 ± 0.0
3.735LysLeu: 3.735 ± 0.0
2.668LysMet: 2.668 ± 0.0
1.067LysAsn: 1.067 ± 0.0
2.668LysPro: 2.668 ± 0.0
2.134LysGln: 2.134 ± 0.0
6.403LysArg: 6.403 ± 0.0
5.87LysSer: 5.87 ± 0.0
3.202LysThr: 3.202 ± 0.0
3.202LysVal: 3.202 ± 0.0
1.067LysTrp: 1.067 ± 0.0
2.134LysTyr: 2.134 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.471LeuAla: 7.471 ± 0.0
2.668LeuCys: 2.668 ± 0.0
3.735LeuAsp: 3.735 ± 0.0
5.87LeuGlu: 5.87 ± 0.0
2.668LeuPhe: 2.668 ± 0.0
5.336LeuGly: 5.336 ± 0.0
3.735LeuHis: 3.735 ± 0.0
6.403LeuIle: 6.403 ± 0.0
10.139LeuLys: 10.139 ± 0.0
6.937LeuLeu: 6.937 ± 0.0
1.601LeuMet: 1.601 ± 0.0
2.134LeuAsn: 2.134 ± 0.0
3.202LeuPro: 3.202 ± 0.0
2.134LeuGln: 2.134 ± 0.0
2.134LeuArg: 2.134 ± 0.0
5.87LeuSer: 5.87 ± 0.0
1.601LeuThr: 1.601 ± 0.0
6.937LeuVal: 6.937 ± 0.0
2.134LeuTrp: 2.134 ± 0.0
3.735LeuTyr: 3.735 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.668MetAla: 2.668 ± 0.0
0.534MetCys: 0.534 ± 0.0
1.067MetAsp: 1.067 ± 0.0
2.134MetGlu: 2.134 ± 0.0
0.534MetPhe: 0.534 ± 0.0
2.134MetGly: 2.134 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.067MetIle: 1.067 ± 0.0
4.269MetLys: 4.269 ± 0.0
1.601MetLeu: 1.601 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.534MetPro: 0.534 ± 0.0
0.0MetGln: 0.0 ± 0.0
3.202MetArg: 3.202 ± 0.0
2.134MetSer: 2.134 ± 0.0
1.601MetThr: 1.601 ± 0.0
1.067MetVal: 1.067 ± 0.0
0.534MetTrp: 0.534 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.134AsnAla: 2.134 ± 0.0
1.601AsnCys: 1.601 ± 0.0
1.067AsnAsp: 1.067 ± 0.0
2.668AsnGlu: 2.668 ± 0.0
0.534AsnPhe: 0.534 ± 0.0
3.202AsnGly: 3.202 ± 0.0
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
1.601AsnLys: 1.601 ± 0.0
2.668AsnLeu: 2.668 ± 0.0
2.134AsnMet: 2.134 ± 0.0
1.601AsnAsn: 1.601 ± 0.0
2.668AsnPro: 2.668 ± 0.0
0.0AsnGln: 0.0 ± 0.0
2.134AsnArg: 2.134 ± 0.0
4.803AsnSer: 4.803 ± 0.0
4.269AsnThr: 4.269 ± 0.0
3.202AsnVal: 3.202 ± 0.0
1.601AsnTrp: 1.601 ± 0.0
2.134AsnTyr: 2.134 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.534ProAla: 0.534 ± 0.0
0.0ProCys: 0.0 ± 0.0
4.269ProAsp: 4.269 ± 0.0
6.403ProGlu: 6.403 ± 0.0
1.067ProPhe: 1.067 ± 0.0
1.601ProGly: 1.601 ± 0.0
2.134ProHis: 2.134 ± 0.0
1.601ProIle: 1.601 ± 0.0
3.202ProLys: 3.202 ± 0.0
3.202ProLeu: 3.202 ± 0.0
0.534ProMet: 0.534 ± 0.0
1.601ProAsn: 1.601 ± 0.0
1.067ProPro: 1.067 ± 0.0
0.534ProGln: 0.534 ± 0.0
0.0ProArg: 0.0 ± 0.0
2.668ProSer: 2.668 ± 0.0
2.668ProThr: 2.668 ± 0.0
2.668ProVal: 2.668 ± 0.0
1.067ProTrp: 1.067 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.668GlnAla: 2.668 ± 0.0
0.534GlnCys: 0.534 ± 0.0
2.668GlnAsp: 2.668 ± 0.0
4.803GlnGlu: 4.803 ± 0.0
1.067GlnPhe: 1.067 ± 0.0
0.534GlnGly: 0.534 ± 0.0
1.601GlnHis: 1.601 ± 0.0
0.0GlnIle: 0.0 ± 0.0
2.134GlnLys: 2.134 ± 0.0
1.601GlnLeu: 1.601 ± 0.0
0.534GlnMet: 0.534 ± 0.0
0.534GlnAsn: 0.534 ± 0.0
0.534GlnPro: 0.534 ± 0.0
0.0GlnGln: 0.0 ± 0.0
3.202GlnArg: 3.202 ± 0.0
1.067GlnSer: 1.067 ± 0.0
0.534GlnThr: 0.534 ± 0.0
2.134GlnVal: 2.134 ± 0.0
1.601GlnTrp: 1.601 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.134ArgAla: 2.134 ± 0.0
1.067ArgCys: 1.067 ± 0.0
3.202ArgAsp: 3.202 ± 0.0
2.668ArgGlu: 2.668 ± 0.0
1.067ArgPhe: 1.067 ± 0.0
4.269ArgGly: 4.269 ± 0.0
3.202ArgHis: 3.202 ± 0.0
2.134ArgIle: 2.134 ± 0.0
4.269ArgLys: 4.269 ± 0.0
2.134ArgLeu: 2.134 ± 0.0
1.601ArgMet: 1.601 ± 0.0
2.668ArgAsn: 2.668 ± 0.0
0.534ArgPro: 0.534 ± 0.0
1.067ArgGln: 1.067 ± 0.0
2.668ArgArg: 2.668 ± 0.0
4.803ArgSer: 4.803 ± 0.0
2.668ArgThr: 2.668 ± 0.0
3.202ArgVal: 3.202 ± 0.0
1.601ArgTrp: 1.601 ± 0.0
3.735ArgTyr: 3.735 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.336SerAla: 5.336 ± 0.0
1.067SerCys: 1.067 ± 0.0
4.269SerAsp: 4.269 ± 0.0
3.735SerGlu: 3.735 ± 0.0
4.269SerPhe: 4.269 ± 0.0
7.471SerGly: 7.471 ± 0.0
2.134SerHis: 2.134 ± 0.0
3.202SerIle: 3.202 ± 0.0
5.336SerLys: 5.336 ± 0.0
9.605SerLeu: 9.605 ± 0.0
1.067SerMet: 1.067 ± 0.0
1.601SerAsn: 1.601 ± 0.0
2.134SerPro: 2.134 ± 0.0
2.668SerGln: 2.668 ± 0.0
3.202SerArg: 3.202 ± 0.0
4.803SerSer: 4.803 ± 0.0
4.269SerThr: 4.269 ± 0.0
5.336SerVal: 5.336 ± 0.0
0.0SerTrp: 0.0 ± 0.0
2.668SerTyr: 2.668 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.269ThrAla: 4.269 ± 0.0
1.067ThrCys: 1.067 ± 0.0
2.668ThrAsp: 2.668 ± 0.0
3.202ThrGlu: 3.202 ± 0.0
0.534ThrPhe: 0.534 ± 0.0
4.803ThrGly: 4.803 ± 0.0
0.0ThrHis: 0.0 ± 0.0
3.735ThrIle: 3.735 ± 0.0
3.735ThrLys: 3.735 ± 0.0
9.072ThrLeu: 9.072 ± 0.0
1.601ThrMet: 1.601 ± 0.0
2.134ThrAsn: 2.134 ± 0.0
2.134ThrPro: 2.134 ± 0.0
2.134ThrGln: 2.134 ± 0.0
3.735ThrArg: 3.735 ± 0.0
2.134ThrSer: 2.134 ± 0.0
2.668ThrThr: 2.668 ± 0.0
3.202ThrVal: 3.202 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
1.601ThrTyr: 1.601 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.735ValAla: 3.735 ± 0.0
0.0ValCys: 0.0 ± 0.0
3.202ValAsp: 3.202 ± 0.0
1.601ValGlu: 1.601 ± 0.0
2.668ValPhe: 2.668 ± 0.0
5.336ValGly: 5.336 ± 0.0
2.134ValHis: 2.134 ± 0.0
1.601ValIle: 1.601 ± 0.0
6.403ValLys: 6.403 ± 0.0
3.735ValLeu: 3.735 ± 0.0
1.601ValMet: 1.601 ± 0.0
4.269ValAsn: 4.269 ± 0.0
4.269ValPro: 4.269 ± 0.0
2.134ValGln: 2.134 ± 0.0
6.403ValArg: 6.403 ± 0.0
5.87ValSer: 5.87 ± 0.0
5.87ValThr: 5.87 ± 0.0
5.87ValVal: 5.87 ± 0.0
1.067ValTrp: 1.067 ± 0.0
4.269ValTyr: 4.269 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.534TrpAla: 0.534 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.534TrpAsp: 0.534 ± 0.0
2.134TrpGlu: 2.134 ± 0.0
1.067TrpPhe: 1.067 ± 0.0
0.534TrpGly: 0.534 ± 0.0
1.067TrpHis: 1.067 ± 0.0
1.067TrpIle: 1.067 ± 0.0
1.067TrpLys: 1.067 ± 0.0
1.067TrpLeu: 1.067 ± 0.0
1.067TrpMet: 1.067 ± 0.0
1.601TrpAsn: 1.601 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.067TrpSer: 1.067 ± 0.0
1.601TrpThr: 1.601 ± 0.0
1.067TrpVal: 1.067 ± 0.0
0.534TrpTrp: 0.534 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.668TyrAla: 2.668 ± 0.0
1.067TyrCys: 1.067 ± 0.0
2.134TyrAsp: 2.134 ± 0.0
1.067TyrGlu: 1.067 ± 0.0
2.134TyrPhe: 2.134 ± 0.0
1.601TyrGly: 1.601 ± 0.0
1.601TyrHis: 1.601 ± 0.0
2.134TyrIle: 2.134 ± 0.0
3.735TyrLys: 3.735 ± 0.0
2.668TyrLeu: 2.668 ± 0.0
1.067TyrMet: 1.067 ± 0.0
3.735TyrAsn: 3.735 ± 0.0
0.534TyrPro: 0.534 ± 0.0
1.067TyrGln: 1.067 ± 0.0
2.134TyrArg: 2.134 ± 0.0
2.134TyrSer: 2.134 ± 0.0
1.601TyrThr: 1.601 ± 0.0
4.269TyrVal: 4.269 ± 0.0
0.534TyrTrp: 0.534 ± 0.0
2.134TyrTyr: 2.134 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (1875 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski