Amino acid dipepetide frequency for Wenzhou picorna-like virus 49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.009AlaAla: 8.009 ± 0.0
0.0AlaCys: 0.0 ± 0.0
6.189AlaAsp: 6.189 ± 0.0
5.096AlaGlu: 5.096 ± 0.0
4.004AlaPhe: 4.004 ± 0.0
8.373AlaGly: 8.373 ± 0.0
1.092AlaHis: 1.092 ± 0.0
5.461AlaIle: 5.461 ± 0.0
6.189AlaLys: 6.189 ± 0.0
7.645AlaLeu: 7.645 ± 0.0
2.548AlaMet: 2.548 ± 0.0
3.276AlaAsn: 3.276 ± 0.0
7.281AlaPro: 7.281 ± 0.0
1.82AlaGln: 1.82 ± 0.0
5.096AlaArg: 5.096 ± 0.0
6.189AlaSer: 6.189 ± 0.0
5.096AlaThr: 5.096 ± 0.0
6.917AlaVal: 6.917 ± 0.0
0.728AlaTrp: 0.728 ± 0.0
1.82AlaTyr: 1.82 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.456CysAla: 1.456 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.728CysAsp: 0.728 ± 0.0
1.092CysGlu: 1.092 ± 0.0
1.092CysPhe: 1.092 ± 0.0
1.092CysGly: 1.092 ± 0.0
0.364CysHis: 0.364 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.456CysLys: 1.456 ± 0.0
1.82CysLeu: 1.82 ± 0.0
0.728CysMet: 0.728 ± 0.0
0.728CysAsn: 0.728 ± 0.0
0.728CysPro: 0.728 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.728CysArg: 0.728 ± 0.0
1.092CysSer: 1.092 ± 0.0
0.728CysThr: 0.728 ± 0.0
1.456CysVal: 1.456 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.189AspAla: 6.189 ± 0.0
1.092AspCys: 1.092 ± 0.0
4.732AspAsp: 4.732 ± 0.0
3.64AspGlu: 3.64 ± 0.0
2.548AspPhe: 2.548 ± 0.0
4.368AspGly: 4.368 ± 0.0
2.184AspHis: 2.184 ± 0.0
3.276AspIle: 3.276 ± 0.0
2.912AspLys: 2.912 ± 0.0
3.64AspLeu: 3.64 ± 0.0
2.184AspMet: 2.184 ± 0.0
1.456AspAsn: 1.456 ± 0.0
3.276AspPro: 3.276 ± 0.0
0.728AspGln: 0.728 ± 0.0
0.728AspArg: 0.728 ± 0.0
4.004AspSer: 4.004 ± 0.0
2.912AspThr: 2.912 ± 0.0
5.096AspVal: 5.096 ± 0.0
0.364AspTrp: 0.364 ± 0.0
1.456AspTyr: 1.456 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.276GluAla: 3.276 ± 0.0
1.82GluCys: 1.82 ± 0.0
4.368GluAsp: 4.368 ± 0.0
3.64GluGlu: 3.64 ± 0.0
2.184GluPhe: 2.184 ± 0.0
1.092GluGly: 1.092 ± 0.0
0.364GluHis: 0.364 ± 0.0
2.548GluIle: 2.548 ± 0.0
2.184GluLys: 2.184 ± 0.0
4.732GluLeu: 4.732 ± 0.0
1.456GluMet: 1.456 ± 0.0
1.092GluAsn: 1.092 ± 0.0
2.548GluPro: 2.548 ± 0.0
1.456GluGln: 1.456 ± 0.0
1.82GluArg: 1.82 ± 0.0
1.456GluSer: 1.456 ± 0.0
3.276GluThr: 3.276 ± 0.0
8.009GluVal: 8.009 ± 0.0
0.728GluTrp: 0.728 ± 0.0
0.728GluTyr: 0.728 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.732PheAla: 4.732 ± 0.0
0.364PheCys: 0.364 ± 0.0
2.184PheAsp: 2.184 ± 0.0
0.728PheGlu: 0.728 ± 0.0
1.092PhePhe: 1.092 ± 0.0
2.548PheGly: 2.548 ± 0.0
1.456PheHis: 1.456 ± 0.0
2.184PheIle: 2.184 ± 0.0
1.092PheLys: 1.092 ± 0.0
2.184PheLeu: 2.184 ± 0.0
0.728PheMet: 0.728 ± 0.0
2.548PheAsn: 2.548 ± 0.0
1.092PhePro: 1.092 ± 0.0
0.728PheGln: 0.728 ± 0.0
3.276PheArg: 3.276 ± 0.0
2.184PheSer: 2.184 ± 0.0
4.004PheThr: 4.004 ± 0.0
3.64PheVal: 3.64 ± 0.0
0.728PheTrp: 0.728 ± 0.0
1.092PheTyr: 1.092 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.461GlyAla: 5.461 ± 0.0
1.456GlyCys: 1.456 ± 0.0
2.184GlyAsp: 2.184 ± 0.0
2.548GlyGlu: 2.548 ± 0.0
2.548GlyPhe: 2.548 ± 0.0
2.912GlyGly: 2.912 ± 0.0
2.548GlyHis: 2.548 ± 0.0
4.368GlyIle: 4.368 ± 0.0
4.732GlyLys: 4.732 ± 0.0
5.461GlyLeu: 5.461 ± 0.0
2.184GlyMet: 2.184 ± 0.0
2.548GlyAsn: 2.548 ± 0.0
2.548GlyPro: 2.548 ± 0.0
2.548GlyGln: 2.548 ± 0.0
4.004GlyArg: 4.004 ± 0.0
7.645GlySer: 7.645 ± 0.0
2.548GlyThr: 2.548 ± 0.0
5.825GlyVal: 5.825 ± 0.0
2.184GlyTrp: 2.184 ± 0.0
2.912GlyTyr: 2.912 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.456HisAla: 1.456 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.728HisAsp: 0.728 ± 0.0
1.092HisGlu: 1.092 ± 0.0
1.092HisPhe: 1.092 ± 0.0
2.912HisGly: 2.912 ± 0.0
0.728HisHis: 0.728 ± 0.0
1.092HisIle: 1.092 ± 0.0
0.364HisLys: 0.364 ± 0.0
1.82HisLeu: 1.82 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.728HisAsn: 0.728 ± 0.0
0.728HisPro: 0.728 ± 0.0
1.456HisGln: 1.456 ± 0.0
1.092HisArg: 1.092 ± 0.0
1.092HisSer: 1.092 ± 0.0
1.092HisThr: 1.092 ± 0.0
2.548HisVal: 2.548 ± 0.0
1.456HisTrp: 1.456 ± 0.0
1.092HisTyr: 1.092 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
9.829IleAla: 9.829 ± 0.0
0.364IleCys: 0.364 ± 0.0
1.456IleAsp: 1.456 ± 0.0
2.184IleGlu: 2.184 ± 0.0
1.82IlePhe: 1.82 ± 0.0
5.825IleGly: 5.825 ± 0.0
0.364IleHis: 0.364 ± 0.0
4.004IleIle: 4.004 ± 0.0
1.82IleLys: 1.82 ± 0.0
4.368IleLeu: 4.368 ± 0.0
1.092IleMet: 1.092 ± 0.0
2.912IleAsn: 2.912 ± 0.0
3.64IlePro: 3.64 ± 0.0
1.456IleGln: 1.456 ± 0.0
1.092IleArg: 1.092 ± 0.0
5.096IleSer: 5.096 ± 0.0
4.004IleThr: 4.004 ± 0.0
3.276IleVal: 3.276 ± 0.0
0.364IleTrp: 0.364 ± 0.0
0.728IleTyr: 0.728 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.184LysAla: 2.184 ± 0.0
0.364LysCys: 0.364 ± 0.0
1.456LysAsp: 1.456 ± 0.0
2.548LysGlu: 2.548 ± 0.0
1.82LysPhe: 1.82 ± 0.0
2.912LysGly: 2.912 ± 0.0
0.728LysHis: 0.728 ± 0.0
1.82LysIle: 1.82 ± 0.0
2.912LysLys: 2.912 ± 0.0
4.732LysLeu: 4.732 ± 0.0
1.456LysMet: 1.456 ± 0.0
1.82LysAsn: 1.82 ± 0.0
2.184LysPro: 2.184 ± 0.0
1.092LysGln: 1.092 ± 0.0
3.276LysArg: 3.276 ± 0.0
3.276LysSer: 3.276 ± 0.0
1.82LysThr: 1.82 ± 0.0
4.368LysVal: 4.368 ± 0.0
1.092LysTrp: 1.092 ± 0.0
3.64LysTyr: 3.64 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.461LeuAla: 5.461 ± 0.0
1.092LeuCys: 1.092 ± 0.0
4.732LeuAsp: 4.732 ± 0.0
4.368LeuGlu: 4.368 ± 0.0
3.276LeuPhe: 3.276 ± 0.0
6.553LeuGly: 6.553 ± 0.0
1.456LeuHis: 1.456 ± 0.0
3.64LeuIle: 3.64 ± 0.0
5.096LeuLys: 5.096 ± 0.0
7.645LeuLeu: 7.645 ± 0.0
1.456LeuMet: 1.456 ± 0.0
4.004LeuAsn: 4.004 ± 0.0
5.461LeuPro: 5.461 ± 0.0
3.276LeuGln: 3.276 ± 0.0
4.368LeuArg: 4.368 ± 0.0
6.189LeuSer: 6.189 ± 0.0
6.189LeuThr: 6.189 ± 0.0
4.004LeuVal: 4.004 ± 0.0
0.364LeuTrp: 0.364 ± 0.0
4.368LeuTyr: 4.368 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.456MetAla: 1.456 ± 0.0
1.82MetCys: 1.82 ± 0.0
1.456MetAsp: 1.456 ± 0.0
0.728MetGlu: 0.728 ± 0.0
1.456MetPhe: 1.456 ± 0.0
0.728MetGly: 0.728 ± 0.0
1.092MetHis: 1.092 ± 0.0
1.456MetIle: 1.456 ± 0.0
0.728MetLys: 0.728 ± 0.0
1.092MetLeu: 1.092 ± 0.0
0.364MetMet: 0.364 ± 0.0
0.364MetAsn: 0.364 ± 0.0
1.456MetPro: 1.456 ± 0.0
0.728MetGln: 0.728 ± 0.0
1.456MetArg: 1.456 ± 0.0
2.548MetSer: 2.548 ± 0.0
0.728MetThr: 0.728 ± 0.0
1.092MetVal: 1.092 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.456MetTyr: 1.456 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.004AsnAla: 4.004 ± 0.0
1.092AsnCys: 1.092 ± 0.0
1.092AsnAsp: 1.092 ± 0.0
2.912AsnGlu: 2.912 ± 0.0
2.912AsnPhe: 2.912 ± 0.0
2.184AsnGly: 2.184 ± 0.0
0.364AsnHis: 0.364 ± 0.0
2.548AsnIle: 2.548 ± 0.0
0.364AsnLys: 0.364 ± 0.0
4.732AsnLeu: 4.732 ± 0.0
0.364AsnMet: 0.364 ± 0.0
2.184AsnAsn: 2.184 ± 0.0
1.82AsnPro: 1.82 ± 0.0
0.364AsnGln: 0.364 ± 0.0
1.456AsnArg: 1.456 ± 0.0
2.912AsnSer: 2.912 ± 0.0
3.64AsnThr: 3.64 ± 0.0
5.096AsnVal: 5.096 ± 0.0
1.456AsnTrp: 1.456 ± 0.0
1.456AsnTyr: 1.456 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.553ProAla: 6.553 ± 0.0
0.728ProCys: 0.728 ± 0.0
2.912ProAsp: 2.912 ± 0.0
1.82ProGlu: 1.82 ± 0.0
1.092ProPhe: 1.092 ± 0.0
4.004ProGly: 4.004 ± 0.0
1.092ProHis: 1.092 ± 0.0
4.004ProIle: 4.004 ± 0.0
2.184ProLys: 2.184 ± 0.0
4.732ProLeu: 4.732 ± 0.0
0.0ProMet: 0.0 ± 0.0
2.184ProAsn: 2.184 ± 0.0
3.276ProPro: 3.276 ± 0.0
1.82ProGln: 1.82 ± 0.0
1.456ProArg: 1.456 ± 0.0
5.825ProSer: 5.825 ± 0.0
3.64ProThr: 3.64 ± 0.0
2.912ProVal: 2.912 ± 0.0
0.728ProTrp: 0.728 ± 0.0
1.82ProTyr: 1.82 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.64GlnAla: 3.64 ± 0.0
0.364GlnCys: 0.364 ± 0.0
1.456GlnAsp: 1.456 ± 0.0
1.456GlnGlu: 1.456 ± 0.0
1.82GlnPhe: 1.82 ± 0.0
2.548GlnGly: 2.548 ± 0.0
0.728GlnHis: 0.728 ± 0.0
2.184GlnIle: 2.184 ± 0.0
1.456GlnLys: 1.456 ± 0.0
2.184GlnLeu: 2.184 ± 0.0
1.092GlnMet: 1.092 ± 0.0
1.092GlnAsn: 1.092 ± 0.0
0.0GlnPro: 0.0 ± 0.0
1.092GlnGln: 1.092 ± 0.0
0.728GlnArg: 0.728 ± 0.0
1.456GlnSer: 1.456 ± 0.0
2.184GlnThr: 2.184 ± 0.0
1.456GlnVal: 1.456 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.092GlnTyr: 1.092 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.004ArgAla: 4.004 ± 0.0
0.728ArgCys: 0.728 ± 0.0
5.825ArgAsp: 5.825 ± 0.0
2.912ArgGlu: 2.912 ± 0.0
2.184ArgPhe: 2.184 ± 0.0
2.548ArgGly: 2.548 ± 0.0
0.728ArgHis: 0.728 ± 0.0
2.184ArgIle: 2.184 ± 0.0
3.276ArgLys: 3.276 ± 0.0
4.732ArgLeu: 4.732 ± 0.0
1.456ArgMet: 1.456 ± 0.0
2.548ArgAsn: 2.548 ± 0.0
2.184ArgPro: 2.184 ± 0.0
1.092ArgGln: 1.092 ± 0.0
2.184ArgArg: 2.184 ± 0.0
2.184ArgSer: 2.184 ± 0.0
2.184ArgThr: 2.184 ± 0.0
6.189ArgVal: 6.189 ± 0.0
0.728ArgTrp: 0.728 ± 0.0
0.728ArgTyr: 0.728 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.917SerAla: 6.917 ± 0.0
1.092SerCys: 1.092 ± 0.0
2.912SerAsp: 2.912 ± 0.0
4.004SerGlu: 4.004 ± 0.0
0.728SerPhe: 0.728 ± 0.0
4.732SerGly: 4.732 ± 0.0
2.912SerHis: 2.912 ± 0.0
4.732SerIle: 4.732 ± 0.0
2.548SerLys: 2.548 ± 0.0
6.189SerLeu: 6.189 ± 0.0
0.728SerMet: 0.728 ± 0.0
3.64SerAsn: 3.64 ± 0.0
2.912SerPro: 2.912 ± 0.0
1.82SerGln: 1.82 ± 0.0
5.825SerArg: 5.825 ± 0.0
4.004SerSer: 4.004 ± 0.0
6.189SerThr: 6.189 ± 0.0
6.189SerVal: 6.189 ± 0.0
1.092SerTrp: 1.092 ± 0.0
2.548SerTyr: 2.548 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.096ThrAla: 5.096 ± 0.0
1.092ThrCys: 1.092 ± 0.0
3.276ThrAsp: 3.276 ± 0.0
3.276ThrGlu: 3.276 ± 0.0
1.456ThrPhe: 1.456 ± 0.0
4.004ThrGly: 4.004 ± 0.0
1.456ThrHis: 1.456 ± 0.0
4.368ThrIle: 4.368 ± 0.0
2.184ThrLys: 2.184 ± 0.0
6.189ThrLeu: 6.189 ± 0.0
1.456ThrMet: 1.456 ± 0.0
2.184ThrAsn: 2.184 ± 0.0
4.368ThrPro: 4.368 ± 0.0
2.548ThrGln: 2.548 ± 0.0
4.368ThrArg: 4.368 ± 0.0
4.004ThrSer: 4.004 ± 0.0
5.461ThrThr: 5.461 ± 0.0
5.096ThrVal: 5.096 ± 0.0
2.184ThrTrp: 2.184 ± 0.0
1.82ThrTyr: 1.82 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
9.829ValAla: 9.829 ± 0.0
1.092ValCys: 1.092 ± 0.0
6.189ValAsp: 6.189 ± 0.0
2.548ValGlu: 2.548 ± 0.0
2.548ValPhe: 2.548 ± 0.0
6.553ValGly: 6.553 ± 0.0
1.82ValHis: 1.82 ± 0.0
4.004ValIle: 4.004 ± 0.0
1.456ValLys: 1.456 ± 0.0
5.825ValLeu: 5.825 ± 0.0
1.092ValMet: 1.092 ± 0.0
4.732ValAsn: 4.732 ± 0.0
5.096ValPro: 5.096 ± 0.0
1.456ValGln: 1.456 ± 0.0
3.64ValArg: 3.64 ± 0.0
5.825ValSer: 5.825 ± 0.0
8.009ValThr: 8.009 ± 0.0
7.645ValVal: 7.645 ± 0.0
1.456ValTrp: 1.456 ± 0.0
4.732ValTyr: 4.732 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.728TrpAla: 0.728 ± 0.0
0.364TrpCys: 0.364 ± 0.0
0.728TrpAsp: 0.728 ± 0.0
0.728TrpGlu: 0.728 ± 0.0
0.364TrpPhe: 0.364 ± 0.0
0.364TrpGly: 0.364 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.364TrpIle: 0.364 ± 0.0
1.092TrpLys: 1.092 ± 0.0
1.456TrpLeu: 1.456 ± 0.0
0.364TrpMet: 0.364 ± 0.0
0.728TrpAsn: 0.728 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.82TrpGln: 1.82 ± 0.0
0.728TrpArg: 0.728 ± 0.0
1.82TrpSer: 1.82 ± 0.0
0.364TrpThr: 0.364 ± 0.0
2.548TrpVal: 2.548 ± 0.0
0.364TrpTrp: 0.364 ± 0.0
1.092TrpTyr: 1.092 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.548TyrAla: 2.548 ± 0.0
0.364TyrCys: 0.364 ± 0.0
2.184TyrAsp: 2.184 ± 0.0
1.456TyrGlu: 1.456 ± 0.0
2.548TyrPhe: 2.548 ± 0.0
2.912TyrGly: 2.912 ± 0.0
1.092TyrHis: 1.092 ± 0.0
1.456TyrIle: 1.456 ± 0.0
1.456TyrLys: 1.456 ± 0.0
1.82TyrLeu: 1.82 ± 0.0
1.092TyrMet: 1.092 ± 0.0
2.548TyrAsn: 2.548 ± 0.0
2.184TyrPro: 2.184 ± 0.0
0.728TyrGln: 0.728 ± 0.0
3.276TyrArg: 3.276 ± 0.0
2.912TyrSer: 2.912 ± 0.0
1.82TyrThr: 1.82 ± 0.0
2.184TyrVal: 2.184 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.456TyrTyr: 1.456 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2748 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski