Amino acid dipepetide frequency for Wenzhou picorna-like virus 36

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.701AlaAla: 3.701 ± 0.0
0.673AlaCys: 0.673 ± 0.0
4.038AlaAsp: 4.038 ± 0.0
2.019AlaGlu: 2.019 ± 0.0
3.028AlaPhe: 3.028 ± 0.0
3.701AlaGly: 3.701 ± 0.0
0.336AlaHis: 0.336 ± 0.0
4.374AlaIle: 4.374 ± 0.0
3.365AlaLys: 3.365 ± 0.0
4.711AlaLeu: 4.711 ± 0.0
0.673AlaMet: 0.673 ± 0.0
4.374AlaAsn: 4.374 ± 0.0
3.028AlaPro: 3.028 ± 0.0
0.673AlaGln: 0.673 ± 0.0
3.028AlaArg: 3.028 ± 0.0
4.374AlaSer: 4.374 ± 0.0
3.701AlaThr: 3.701 ± 0.0
6.393AlaVal: 6.393 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
1.346AlaTyr: 1.346 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.355CysAla: 2.355 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.682CysPhe: 1.682 ± 0.0
0.336CysGly: 0.336 ± 0.0
0.673CysHis: 0.673 ± 0.0
0.336CysIle: 0.336 ± 0.0
0.336CysLys: 0.336 ± 0.0
1.682CysLeu: 1.682 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.336CysAsn: 0.336 ± 0.0
1.346CysPro: 1.346 ± 0.0
0.336CysGln: 0.336 ± 0.0
0.336CysArg: 0.336 ± 0.0
1.682CysSer: 1.682 ± 0.0
1.682CysThr: 1.682 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.009CysTyr: 1.009 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.365AspAla: 3.365 ± 0.0
0.673AspCys: 0.673 ± 0.0
2.692AspAsp: 2.692 ± 0.0
4.711AspGlu: 4.711 ± 0.0
4.038AspPhe: 4.038 ± 0.0
2.355AspGly: 2.355 ± 0.0
1.346AspHis: 1.346 ± 0.0
3.365AspIle: 3.365 ± 0.0
5.047AspLys: 5.047 ± 0.0
5.384AspLeu: 5.384 ± 0.0
2.355AspMet: 2.355 ± 0.0
2.692AspAsn: 2.692 ± 0.0
2.355AspPro: 2.355 ± 0.0
2.019AspGln: 2.019 ± 0.0
1.682AspArg: 1.682 ± 0.0
4.038AspSer: 4.038 ± 0.0
1.682AspThr: 1.682 ± 0.0
3.028AspVal: 3.028 ± 0.0
1.009AspTrp: 1.009 ± 0.0
2.692AspTyr: 2.692 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.692GluAla: 2.692 ± 0.0
0.673GluCys: 0.673 ± 0.0
2.019GluAsp: 2.019 ± 0.0
2.019GluGlu: 2.019 ± 0.0
3.701GluPhe: 3.701 ± 0.0
2.355GluGly: 2.355 ± 0.0
2.019GluHis: 2.019 ± 0.0
5.72GluIle: 5.72 ± 0.0
5.72GluLys: 5.72 ± 0.0
4.038GluLeu: 4.038 ± 0.0
1.682GluMet: 1.682 ± 0.0
4.711GluAsn: 4.711 ± 0.0
2.355GluPro: 2.355 ± 0.0
3.365GluGln: 3.365 ± 0.0
1.682GluArg: 1.682 ± 0.0
5.047GluSer: 5.047 ± 0.0
2.019GluThr: 2.019 ± 0.0
4.038GluVal: 4.038 ± 0.0
1.346GluTrp: 1.346 ± 0.0
1.682GluTyr: 1.682 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.701PheAla: 3.701 ± 0.0
2.019PheCys: 2.019 ± 0.0
3.028PheAsp: 3.028 ± 0.0
3.365PheGlu: 3.365 ± 0.0
1.346PhePhe: 1.346 ± 0.0
3.365PheGly: 3.365 ± 0.0
1.682PheHis: 1.682 ± 0.0
4.038PheIle: 4.038 ± 0.0
5.384PheLys: 5.384 ± 0.0
5.72PheLeu: 5.72 ± 0.0
0.336PheMet: 0.336 ± 0.0
1.346PheAsn: 1.346 ± 0.0
1.346PhePro: 1.346 ± 0.0
2.692PheGln: 2.692 ± 0.0
2.019PheArg: 2.019 ± 0.0
7.402PheSer: 7.402 ± 0.0
3.028PheThr: 3.028 ± 0.0
3.028PheVal: 3.028 ± 0.0
0.336PheTrp: 0.336 ± 0.0
2.019PheTyr: 2.019 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.019GlyAla: 2.019 ± 0.0
0.336GlyCys: 0.336 ± 0.0
1.346GlyAsp: 1.346 ± 0.0
3.028GlyGlu: 3.028 ± 0.0
3.701GlyPhe: 3.701 ± 0.0
1.346GlyGly: 1.346 ± 0.0
0.336GlyHis: 0.336 ± 0.0
2.692GlyIle: 2.692 ± 0.0
5.72GlyLys: 5.72 ± 0.0
4.374GlyLeu: 4.374 ± 0.0
1.682GlyMet: 1.682 ± 0.0
2.692GlyAsn: 2.692 ± 0.0
1.682GlyPro: 1.682 ± 0.0
0.673GlyGln: 0.673 ± 0.0
0.673GlyArg: 0.673 ± 0.0
5.72GlySer: 5.72 ± 0.0
3.701GlyThr: 3.701 ± 0.0
3.028GlyVal: 3.028 ± 0.0
0.336GlyTrp: 0.336 ± 0.0
2.355GlyTyr: 2.355 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.673HisAla: 0.673 ± 0.0
0.673HisCys: 0.673 ± 0.0
1.682HisAsp: 1.682 ± 0.0
1.009HisGlu: 1.009 ± 0.0
0.336HisPhe: 0.336 ± 0.0
0.673HisGly: 0.673 ± 0.0
1.346HisHis: 1.346 ± 0.0
1.682HisIle: 1.682 ± 0.0
1.682HisLys: 1.682 ± 0.0
2.692HisLeu: 2.692 ± 0.0
0.336HisMet: 0.336 ± 0.0
1.346HisAsn: 1.346 ± 0.0
1.009HisPro: 1.009 ± 0.0
0.336HisGln: 0.336 ± 0.0
1.009HisArg: 1.009 ± 0.0
0.673HisSer: 0.673 ± 0.0
1.682HisThr: 1.682 ± 0.0
1.346HisVal: 1.346 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.009HisTyr: 1.009 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.374IleAla: 4.374 ± 0.0
2.019IleCys: 2.019 ± 0.0
2.019IleAsp: 2.019 ± 0.0
3.701IleGlu: 3.701 ± 0.0
4.711IlePhe: 4.711 ± 0.0
3.365IleGly: 3.365 ± 0.0
1.682IleHis: 1.682 ± 0.0
6.729IleIle: 6.729 ± 0.0
6.393IleLys: 6.393 ± 0.0
5.047IleLeu: 5.047 ± 0.0
1.009IleMet: 1.009 ± 0.0
4.374IleAsn: 4.374 ± 0.0
5.384IlePro: 5.384 ± 0.0
3.028IleGln: 3.028 ± 0.0
2.355IleArg: 2.355 ± 0.0
5.72IleSer: 5.72 ± 0.0
4.038IleThr: 4.038 ± 0.0
4.374IleVal: 4.374 ± 0.0
0.336IleTrp: 0.336 ± 0.0
2.019IleTyr: 2.019 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.355LysAla: 2.355 ± 0.0
0.673LysCys: 0.673 ± 0.0
6.057LysAsp: 6.057 ± 0.0
4.038LysGlu: 4.038 ± 0.0
5.72LysPhe: 5.72 ± 0.0
3.028LysGly: 3.028 ± 0.0
2.355LysHis: 2.355 ± 0.0
7.066LysIle: 7.066 ± 0.0
4.374LysLys: 4.374 ± 0.0
8.748LysLeu: 8.748 ± 0.0
1.346LysMet: 1.346 ± 0.0
3.365LysAsn: 3.365 ± 0.0
3.028LysPro: 3.028 ± 0.0
2.019LysGln: 2.019 ± 0.0
2.019LysArg: 2.019 ± 0.0
6.729LysSer: 6.729 ± 0.0
3.365LysThr: 3.365 ± 0.0
6.057LysVal: 6.057 ± 0.0
0.336LysTrp: 0.336 ± 0.0
2.692LysTyr: 2.692 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.748LeuAla: 8.748 ± 0.0
1.009LeuCys: 1.009 ± 0.0
7.739LeuAsp: 7.739 ± 0.0
7.402LeuGlu: 7.402 ± 0.0
3.365LeuPhe: 3.365 ± 0.0
5.047LeuGly: 5.047 ± 0.0
1.009LeuHis: 1.009 ± 0.0
4.374LeuIle: 4.374 ± 0.0
7.066LeuLys: 7.066 ± 0.0
5.047LeuLeu: 5.047 ± 0.0
1.682LeuMet: 1.682 ± 0.0
4.038LeuAsn: 4.038 ± 0.0
5.72LeuPro: 5.72 ± 0.0
3.028LeuGln: 3.028 ± 0.0
4.038LeuArg: 4.038 ± 0.0
5.72LeuSer: 5.72 ± 0.0
6.393LeuThr: 6.393 ± 0.0
6.393LeuVal: 6.393 ± 0.0
0.336LeuTrp: 0.336 ± 0.0
3.365LeuTyr: 3.365 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.346MetAla: 1.346 ± 0.0
0.336MetCys: 0.336 ± 0.0
0.336MetAsp: 0.336 ± 0.0
2.019MetGlu: 2.019 ± 0.0
0.673MetPhe: 0.673 ± 0.0
1.682MetGly: 1.682 ± 0.0
0.336MetHis: 0.336 ± 0.0
1.682MetIle: 1.682 ± 0.0
1.009MetLys: 1.009 ± 0.0
2.019MetLeu: 2.019 ± 0.0
0.336MetMet: 0.336 ± 0.0
0.673MetAsn: 0.673 ± 0.0
0.673MetPro: 0.673 ± 0.0
0.673MetGln: 0.673 ± 0.0
0.673MetArg: 0.673 ± 0.0
2.019MetSer: 2.019 ± 0.0
2.019MetThr: 2.019 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.673MetTrp: 0.673 ± 0.0
1.009MetTyr: 1.009 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.355AsnAla: 2.355 ± 0.0
1.009AsnCys: 1.009 ± 0.0
1.682AsnAsp: 1.682 ± 0.0
2.019AsnGlu: 2.019 ± 0.0
3.028AsnPhe: 3.028 ± 0.0
1.682AsnGly: 1.682 ± 0.0
0.673AsnHis: 0.673 ± 0.0
5.047AsnIle: 5.047 ± 0.0
5.047AsnLys: 5.047 ± 0.0
4.711AsnLeu: 4.711 ± 0.0
0.336AsnMet: 0.336 ± 0.0
2.355AsnAsn: 2.355 ± 0.0
2.019AsnPro: 2.019 ± 0.0
0.673AsnGln: 0.673 ± 0.0
2.019AsnArg: 2.019 ± 0.0
7.739AsnSer: 7.739 ± 0.0
3.028AsnThr: 3.028 ± 0.0
3.028AsnVal: 3.028 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
2.692AsnTyr: 2.692 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.346ProAla: 1.346 ± 0.0
0.336ProCys: 0.336 ± 0.0
2.692ProAsp: 2.692 ± 0.0
2.355ProGlu: 2.355 ± 0.0
4.038ProPhe: 4.038 ± 0.0
2.355ProGly: 2.355 ± 0.0
1.346ProHis: 1.346 ± 0.0
5.047ProIle: 5.047 ± 0.0
2.355ProLys: 2.355 ± 0.0
4.374ProLeu: 4.374 ± 0.0
0.673ProMet: 0.673 ± 0.0
1.346ProAsn: 1.346 ± 0.0
2.019ProPro: 2.019 ± 0.0
1.682ProGln: 1.682 ± 0.0
1.682ProArg: 1.682 ± 0.0
2.355ProSer: 2.355 ± 0.0
4.038ProThr: 4.038 ± 0.0
2.692ProVal: 2.692 ± 0.0
0.673ProTrp: 0.673 ± 0.0
1.682ProTyr: 1.682 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.692GlnAla: 2.692 ± 0.0
0.673GlnCys: 0.673 ± 0.0
2.355GlnAsp: 2.355 ± 0.0
2.019GlnGlu: 2.019 ± 0.0
2.019GlnPhe: 2.019 ± 0.0
1.682GlnGly: 1.682 ± 0.0
0.0GlnHis: 0.0 ± 0.0
2.019GlnIle: 2.019 ± 0.0
3.028GlnLys: 3.028 ± 0.0
3.701GlnLeu: 3.701 ± 0.0
0.336GlnMet: 0.336 ± 0.0
2.019GlnAsn: 2.019 ± 0.0
0.336GlnPro: 0.336 ± 0.0
3.028GlnGln: 3.028 ± 0.0
1.682GlnArg: 1.682 ± 0.0
2.355GlnSer: 2.355 ± 0.0
2.019GlnThr: 2.019 ± 0.0
1.682GlnVal: 1.682 ± 0.0
0.673GlnTrp: 0.673 ± 0.0
1.346GlnTyr: 1.346 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.682ArgAla: 1.682 ± 0.0
0.336ArgCys: 0.336 ± 0.0
3.701ArgAsp: 3.701 ± 0.0
2.019ArgGlu: 2.019 ± 0.0
2.355ArgPhe: 2.355 ± 0.0
0.673ArgGly: 0.673 ± 0.0
0.673ArgHis: 0.673 ± 0.0
3.365ArgIle: 3.365 ± 0.0
2.019ArgLys: 2.019 ± 0.0
3.028ArgLeu: 3.028 ± 0.0
1.682ArgMet: 1.682 ± 0.0
1.346ArgAsn: 1.346 ± 0.0
0.336ArgPro: 0.336 ± 0.0
0.673ArgGln: 0.673 ± 0.0
1.009ArgArg: 1.009 ± 0.0
2.692ArgSer: 2.692 ± 0.0
2.355ArgThr: 2.355 ± 0.0
3.701ArgVal: 3.701 ± 0.0
0.673ArgTrp: 0.673 ± 0.0
1.682ArgTyr: 1.682 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.374SerAla: 4.374 ± 0.0
0.0SerCys: 0.0 ± 0.0
5.047SerAsp: 5.047 ± 0.0
5.384SerGlu: 5.384 ± 0.0
5.047SerPhe: 5.047 ± 0.0
5.047SerGly: 5.047 ± 0.0
1.009SerHis: 1.009 ± 0.0
4.374SerIle: 4.374 ± 0.0
5.72SerLys: 5.72 ± 0.0
8.748SerLeu: 8.748 ± 0.0
1.346SerMet: 1.346 ± 0.0
4.711SerAsn: 4.711 ± 0.0
4.374SerPro: 4.374 ± 0.0
2.692SerGln: 2.692 ± 0.0
3.701SerArg: 3.701 ± 0.0
6.057SerSer: 6.057 ± 0.0
7.402SerThr: 7.402 ± 0.0
4.374SerVal: 4.374 ± 0.0
0.0SerTrp: 0.0 ± 0.0
3.365SerTyr: 3.365 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.692ThrAla: 2.692 ± 0.0
1.682ThrCys: 1.682 ± 0.0
1.346ThrAsp: 1.346 ± 0.0
3.701ThrGlu: 3.701 ± 0.0
4.374ThrPhe: 4.374 ± 0.0
1.346ThrGly: 1.346 ± 0.0
2.019ThrHis: 2.019 ± 0.0
3.028ThrIle: 3.028 ± 0.0
1.682ThrLys: 1.682 ± 0.0
8.748ThrLeu: 8.748 ± 0.0
1.009ThrMet: 1.009 ± 0.0
3.365ThrAsn: 3.365 ± 0.0
4.038ThrPro: 4.038 ± 0.0
3.365ThrGln: 3.365 ± 0.0
2.692ThrArg: 2.692 ± 0.0
5.047ThrSer: 5.047 ± 0.0
3.365ThrThr: 3.365 ± 0.0
4.374ThrVal: 4.374 ± 0.0
1.346ThrTrp: 1.346 ± 0.0
2.692ThrTyr: 2.692 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.711ValAla: 4.711 ± 0.0
0.673ValCys: 0.673 ± 0.0
6.057ValAsp: 6.057 ± 0.0
4.374ValGlu: 4.374 ± 0.0
2.692ValPhe: 2.692 ± 0.0
4.374ValGly: 4.374 ± 0.0
1.009ValHis: 1.009 ± 0.0
3.365ValIle: 3.365 ± 0.0
5.72ValLys: 5.72 ± 0.0
3.365ValLeu: 3.365 ± 0.0
2.355ValMet: 2.355 ± 0.0
3.028ValAsn: 3.028 ± 0.0
3.365ValPro: 3.365 ± 0.0
2.019ValGln: 2.019 ± 0.0
2.355ValArg: 2.355 ± 0.0
5.384ValSer: 5.384 ± 0.0
2.692ValThr: 2.692 ± 0.0
4.038ValVal: 4.038 ± 0.0
0.673ValTrp: 0.673 ± 0.0
3.365ValTyr: 3.365 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.336TrpAla: 0.336 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.673TrpAsp: 0.673 ± 0.0
1.009TrpGlu: 1.009 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.336TrpGly: 0.336 ± 0.0
0.336TrpHis: 0.336 ± 0.0
1.009TrpIle: 1.009 ± 0.0
0.673TrpLys: 0.673 ± 0.0
1.682TrpLeu: 1.682 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.336TrpGln: 0.336 ± 0.0
0.336TrpArg: 0.336 ± 0.0
0.336TrpSer: 0.336 ± 0.0
0.336TrpThr: 0.336 ± 0.0
0.673TrpVal: 0.673 ± 0.0
0.336TrpTrp: 0.336 ± 0.0
0.673TrpTyr: 0.673 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.019TyrAla: 2.019 ± 0.0
0.336TyrCys: 0.336 ± 0.0
2.019TyrAsp: 2.019 ± 0.0
2.692TyrGlu: 2.692 ± 0.0
1.682TyrPhe: 1.682 ± 0.0
2.692TyrGly: 2.692 ± 0.0
1.009TyrHis: 1.009 ± 0.0
3.365TyrIle: 3.365 ± 0.0
3.028TyrLys: 3.028 ± 0.0
3.701TyrLeu: 3.701 ± 0.0
0.673TyrMet: 0.673 ± 0.0
3.028TyrAsn: 3.028 ± 0.0
0.673TyrPro: 0.673 ± 0.0
2.355TyrGln: 2.355 ± 0.0
1.009TyrArg: 1.009 ± 0.0
1.682TyrSer: 1.682 ± 0.0
3.365TyrThr: 3.365 ± 0.0
3.365TyrVal: 3.365 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.346TyrTyr: 1.346 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2973 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski