Amino acid dipepetide frequency for Wenzhou picorna-like virus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.061AlaAla: 5.061 ± 0.0
0.362AlaCys: 0.362 ± 0.0
1.446AlaAsp: 1.446 ± 0.0
2.169AlaGlu: 2.169 ± 0.0
3.254AlaPhe: 3.254 ± 0.0
2.892AlaGly: 2.892 ± 0.0
1.808AlaHis: 1.808 ± 0.0
3.254AlaIle: 3.254 ± 0.0
3.254AlaLys: 3.254 ± 0.0
4.338AlaLeu: 4.338 ± 0.0
0.723AlaMet: 0.723 ± 0.0
2.892AlaAsn: 2.892 ± 0.0
1.808AlaPro: 1.808 ± 0.0
1.446AlaGln: 1.446 ± 0.0
4.338AlaArg: 4.338 ± 0.0
4.338AlaSer: 4.338 ± 0.0
4.7AlaThr: 4.7 ± 0.0
4.338AlaVal: 4.338 ± 0.0
0.723AlaTrp: 0.723 ± 0.0
3.254AlaTyr: 3.254 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.085CysAla: 1.085 ± 0.0
0.362CysCys: 0.362 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.446CysGlu: 1.446 ± 0.0
0.723CysPhe: 0.723 ± 0.0
1.446CysGly: 1.446 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.723CysIle: 0.723 ± 0.0
0.723CysLys: 0.723 ± 0.0
1.446CysLeu: 1.446 ± 0.0
0.362CysMet: 0.362 ± 0.0
0.723CysAsn: 0.723 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.362CysGln: 0.362 ± 0.0
1.446CysArg: 1.446 ± 0.0
1.085CysSer: 1.085 ± 0.0
1.085CysThr: 1.085 ± 0.0
1.446CysVal: 1.446 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.446AspAla: 1.446 ± 0.0
0.0AspCys: 0.0 ± 0.0
3.615AspAsp: 3.615 ± 0.0
2.531AspGlu: 2.531 ± 0.0
2.169AspPhe: 2.169 ± 0.0
2.531AspGly: 2.531 ± 0.0
0.723AspHis: 0.723 ± 0.0
2.531AspIle: 2.531 ± 0.0
1.808AspLys: 1.808 ± 0.0
4.7AspLeu: 4.7 ± 0.0
3.254AspMet: 3.254 ± 0.0
1.085AspAsn: 1.085 ± 0.0
3.615AspPro: 3.615 ± 0.0
2.531AspGln: 2.531 ± 0.0
1.446AspArg: 1.446 ± 0.0
3.615AspSer: 3.615 ± 0.0
2.892AspThr: 2.892 ± 0.0
5.785AspVal: 5.785 ± 0.0
1.085AspTrp: 1.085 ± 0.0
1.808AspTyr: 1.808 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.892GluAla: 2.892 ± 0.0
1.808GluCys: 1.808 ± 0.0
1.446GluAsp: 1.446 ± 0.0
1.085GluGlu: 1.085 ± 0.0
4.338GluPhe: 4.338 ± 0.0
2.892GluGly: 2.892 ± 0.0
0.362GluHis: 0.362 ± 0.0
3.977GluIle: 3.977 ± 0.0
2.169GluLys: 2.169 ± 0.0
1.808GluLeu: 1.808 ± 0.0
1.085GluMet: 1.085 ± 0.0
2.892GluAsn: 2.892 ± 0.0
2.531GluPro: 2.531 ± 0.0
0.723GluGln: 0.723 ± 0.0
2.892GluArg: 2.892 ± 0.0
4.338GluSer: 4.338 ± 0.0
4.7GluThr: 4.7 ± 0.0
5.785GluVal: 5.785 ± 0.0
0.723GluTrp: 0.723 ± 0.0
3.254GluTyr: 3.254 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.338PheAla: 4.338 ± 0.0
1.446PheCys: 1.446 ± 0.0
3.615PheAsp: 3.615 ± 0.0
3.254PheGlu: 3.254 ± 0.0
1.808PhePhe: 1.808 ± 0.0
3.977PheGly: 3.977 ± 0.0
1.446PheHis: 1.446 ± 0.0
2.531PheIle: 2.531 ± 0.0
1.446PheLys: 1.446 ± 0.0
3.977PheLeu: 3.977 ± 0.0
1.808PheMet: 1.808 ± 0.0
3.977PheAsn: 3.977 ± 0.0
2.531PhePro: 2.531 ± 0.0
1.446PheGln: 1.446 ± 0.0
3.254PheArg: 3.254 ± 0.0
5.061PheSer: 5.061 ± 0.0
3.977PheThr: 3.977 ± 0.0
3.254PheVal: 3.254 ± 0.0
0.362PheTrp: 0.362 ± 0.0
1.446PheTyr: 1.446 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.169GlyAla: 2.169 ± 0.0
0.362GlyCys: 0.362 ± 0.0
3.977GlyAsp: 3.977 ± 0.0
2.169GlyGlu: 2.169 ± 0.0
1.808GlyPhe: 1.808 ± 0.0
4.7GlyGly: 4.7 ± 0.0
1.446GlyHis: 1.446 ± 0.0
3.254GlyIle: 3.254 ± 0.0
5.423GlyLys: 5.423 ± 0.0
3.615GlyLeu: 3.615 ± 0.0
2.169GlyMet: 2.169 ± 0.0
2.169GlyAsn: 2.169 ± 0.0
2.531GlyPro: 2.531 ± 0.0
1.446GlyGln: 1.446 ± 0.0
2.531GlyArg: 2.531 ± 0.0
4.338GlySer: 4.338 ± 0.0
3.977GlyThr: 3.977 ± 0.0
4.7GlyVal: 4.7 ± 0.0
0.362GlyTrp: 0.362 ± 0.0
3.615GlyTyr: 3.615 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.723HisAla: 0.723 ± 0.0
1.085HisCys: 1.085 ± 0.0
0.723HisAsp: 0.723 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.085HisPhe: 1.085 ± 0.0
0.723HisGly: 0.723 ± 0.0
0.362HisHis: 0.362 ± 0.0
2.169HisIle: 2.169 ± 0.0
0.362HisLys: 0.362 ± 0.0
0.0HisLeu: 0.0 ± 0.0
1.085HisMet: 1.085 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.085HisPro: 1.085 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.362HisArg: 0.362 ± 0.0
2.169HisSer: 2.169 ± 0.0
1.446HisThr: 1.446 ± 0.0
2.892HisVal: 2.892 ± 0.0
0.362HisTrp: 0.362 ± 0.0
0.723HisTyr: 0.723 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.892IleAla: 2.892 ± 0.0
0.723IleCys: 0.723 ± 0.0
3.254IleAsp: 3.254 ± 0.0
2.892IleGlu: 2.892 ± 0.0
4.338IlePhe: 4.338 ± 0.0
2.531IleGly: 2.531 ± 0.0
1.085IleHis: 1.085 ± 0.0
1.808IleIle: 1.808 ± 0.0
3.254IleLys: 3.254 ± 0.0
5.423IleLeu: 5.423 ± 0.0
0.362IleMet: 0.362 ± 0.0
5.061IleAsn: 5.061 ± 0.0
5.061IlePro: 5.061 ± 0.0
2.531IleGln: 2.531 ± 0.0
2.531IleArg: 2.531 ± 0.0
6.146IleSer: 6.146 ± 0.0
3.977IleThr: 3.977 ± 0.0
4.338IleVal: 4.338 ± 0.0
0.362IleTrp: 0.362 ± 0.0
2.531IleTyr: 2.531 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.085LysAla: 1.085 ± 0.0
0.723LysCys: 0.723 ± 0.0
2.892LysAsp: 2.892 ± 0.0
3.977LysGlu: 3.977 ± 0.0
1.446LysPhe: 1.446 ± 0.0
1.808LysGly: 1.808 ± 0.0
0.723LysHis: 0.723 ± 0.0
1.808LysIle: 1.808 ± 0.0
3.615LysLys: 3.615 ± 0.0
4.338LysLeu: 4.338 ± 0.0
1.446LysMet: 1.446 ± 0.0
3.615LysAsn: 3.615 ± 0.0
2.531LysPro: 2.531 ± 0.0
1.446LysGln: 1.446 ± 0.0
4.338LysArg: 4.338 ± 0.0
2.892LysSer: 2.892 ± 0.0
3.254LysThr: 3.254 ± 0.0
2.892LysVal: 2.892 ± 0.0
0.723LysTrp: 0.723 ± 0.0
3.254LysTyr: 3.254 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
2.892LeuAla: 2.892 ± 0.0
1.808LeuCys: 1.808 ± 0.0
3.977LeuAsp: 3.977 ± 0.0
4.338LeuGlu: 4.338 ± 0.0
7.231LeuPhe: 7.231 ± 0.0
2.892LeuGly: 2.892 ± 0.0
1.808LeuHis: 1.808 ± 0.0
5.061LeuIle: 5.061 ± 0.0
3.977LeuLys: 3.977 ± 0.0
5.061LeuLeu: 5.061 ± 0.0
1.085LeuMet: 1.085 ± 0.0
4.338LeuAsn: 4.338 ± 0.0
4.338LeuPro: 4.338 ± 0.0
1.446LeuGln: 1.446 ± 0.0
3.615LeuArg: 3.615 ± 0.0
5.061LeuSer: 5.061 ± 0.0
4.338LeuThr: 4.338 ± 0.0
8.677LeuVal: 8.677 ± 0.0
1.446LeuTrp: 1.446 ± 0.0
2.531LeuTyr: 2.531 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.169MetAla: 2.169 ± 0.0
0.723MetCys: 0.723 ± 0.0
0.723MetAsp: 0.723 ± 0.0
1.446MetGlu: 1.446 ± 0.0
0.723MetPhe: 0.723 ± 0.0
2.169MetGly: 2.169 ± 0.0
0.723MetHis: 0.723 ± 0.0
1.085MetIle: 1.085 ± 0.0
1.446MetLys: 1.446 ± 0.0
2.531MetLeu: 2.531 ± 0.0
0.362MetMet: 0.362 ± 0.0
1.808MetAsn: 1.808 ± 0.0
1.446MetPro: 1.446 ± 0.0
0.723MetGln: 0.723 ± 0.0
1.446MetArg: 1.446 ± 0.0
2.531MetSer: 2.531 ± 0.0
2.169MetThr: 2.169 ± 0.0
1.085MetVal: 1.085 ± 0.0
0.362MetTrp: 0.362 ± 0.0
0.723MetTyr: 0.723 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.254AsnAla: 3.254 ± 0.0
0.362AsnCys: 0.362 ± 0.0
2.892AsnAsp: 2.892 ± 0.0
2.892AsnGlu: 2.892 ± 0.0
2.169AsnPhe: 2.169 ± 0.0
3.615AsnGly: 3.615 ± 0.0
0.723AsnHis: 0.723 ± 0.0
2.892AsnIle: 2.892 ± 0.0
3.254AsnLys: 3.254 ± 0.0
3.254AsnLeu: 3.254 ± 0.0
3.254AsnMet: 3.254 ± 0.0
2.892AsnAsn: 2.892 ± 0.0
4.7AsnPro: 4.7 ± 0.0
1.808AsnGln: 1.808 ± 0.0
1.446AsnArg: 1.446 ± 0.0
2.892AsnSer: 2.892 ± 0.0
3.977AsnThr: 3.977 ± 0.0
4.7AsnVal: 4.7 ± 0.0
1.085AsnTrp: 1.085 ± 0.0
2.169AsnTyr: 2.169 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.169ProAla: 2.169 ± 0.0
0.362ProCys: 0.362 ± 0.0
1.085ProAsp: 1.085 ± 0.0
1.808ProGlu: 1.808 ± 0.0
3.254ProPhe: 3.254 ± 0.0
3.977ProGly: 3.977 ± 0.0
0.723ProHis: 0.723 ± 0.0
2.531ProIle: 2.531 ± 0.0
1.808ProLys: 1.808 ± 0.0
6.508ProLeu: 6.508 ± 0.0
0.723ProMet: 0.723 ± 0.0
2.531ProAsn: 2.531 ± 0.0
2.531ProPro: 2.531 ± 0.0
1.808ProGln: 1.808 ± 0.0
1.446ProArg: 1.446 ± 0.0
4.7ProSer: 4.7 ± 0.0
5.061ProThr: 5.061 ± 0.0
3.615ProVal: 3.615 ± 0.0
0.362ProTrp: 0.362 ± 0.0
2.531ProTyr: 2.531 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.531GlnAla: 2.531 ± 0.0
0.0GlnCys: 0.0 ± 0.0
0.723GlnAsp: 0.723 ± 0.0
2.531GlnGlu: 2.531 ± 0.0
1.808GlnPhe: 1.808 ± 0.0
0.723GlnGly: 0.723 ± 0.0
0.362GlnHis: 0.362 ± 0.0
2.892GlnIle: 2.892 ± 0.0
1.085GlnLys: 1.085 ± 0.0
1.446GlnLeu: 1.446 ± 0.0
1.446GlnMet: 1.446 ± 0.0
0.723GlnAsn: 0.723 ± 0.0
0.0GlnPro: 0.0 ± 0.0
0.362GlnGln: 0.362 ± 0.0
1.808GlnArg: 1.808 ± 0.0
3.977GlnSer: 3.977 ± 0.0
2.531GlnThr: 2.531 ± 0.0
2.892GlnVal: 2.892 ± 0.0
0.723GlnTrp: 0.723 ± 0.0
1.808GlnTyr: 1.808 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.338ArgAla: 4.338 ± 0.0
0.0ArgCys: 0.0 ± 0.0
1.808ArgAsp: 1.808 ± 0.0
3.615ArgGlu: 3.615 ± 0.0
3.977ArgPhe: 3.977 ± 0.0
3.254ArgGly: 3.254 ± 0.0
0.723ArgHis: 0.723 ± 0.0
2.531ArgIle: 2.531 ± 0.0
0.723ArgLys: 0.723 ± 0.0
5.785ArgLeu: 5.785 ± 0.0
1.808ArgMet: 1.808 ± 0.0
3.254ArgAsn: 3.254 ± 0.0
2.531ArgPro: 2.531 ± 0.0
2.169ArgGln: 2.169 ± 0.0
3.615ArgArg: 3.615 ± 0.0
3.254ArgSer: 3.254 ± 0.0
4.338ArgThr: 4.338 ± 0.0
3.615ArgVal: 3.615 ± 0.0
0.723ArgTrp: 0.723 ± 0.0
3.977ArgTyr: 3.977 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.146SerAla: 6.146 ± 0.0
0.723SerCys: 0.723 ± 0.0
3.977SerAsp: 3.977 ± 0.0
5.061SerGlu: 5.061 ± 0.0
3.615SerPhe: 3.615 ± 0.0
4.338SerGly: 4.338 ± 0.0
1.085SerHis: 1.085 ± 0.0
5.061SerIle: 5.061 ± 0.0
4.7SerLys: 4.7 ± 0.0
6.869SerLeu: 6.869 ± 0.0
1.085SerMet: 1.085 ± 0.0
5.061SerAsn: 5.061 ± 0.0
2.892SerPro: 2.892 ± 0.0
2.531SerGln: 2.531 ± 0.0
4.7SerArg: 4.7 ± 0.0
3.615SerSer: 3.615 ± 0.0
6.869SerThr: 6.869 ± 0.0
6.869SerVal: 6.869 ± 0.0
0.723SerTrp: 0.723 ± 0.0
5.423SerTyr: 5.423 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.7ThrAla: 4.7 ± 0.0
0.723ThrCys: 0.723 ± 0.0
3.254ThrAsp: 3.254 ± 0.0
2.169ThrGlu: 2.169 ± 0.0
2.531ThrPhe: 2.531 ± 0.0
2.892ThrGly: 2.892 ± 0.0
1.808ThrHis: 1.808 ± 0.0
5.785ThrIle: 5.785 ± 0.0
1.808ThrLys: 1.808 ± 0.0
6.146ThrLeu: 6.146 ± 0.0
1.085ThrMet: 1.085 ± 0.0
5.061ThrAsn: 5.061 ± 0.0
4.7ThrPro: 4.7 ± 0.0
3.977ThrGln: 3.977 ± 0.0
5.061ThrArg: 5.061 ± 0.0
6.508ThrSer: 6.508 ± 0.0
4.338ThrThr: 4.338 ± 0.0
6.146ThrVal: 6.146 ± 0.0
1.446ThrTrp: 1.446 ± 0.0
2.531ThrTyr: 2.531 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.338ValAla: 4.338 ± 0.0
1.446ValCys: 1.446 ± 0.0
4.7ValAsp: 4.7 ± 0.0
4.7ValGlu: 4.7 ± 0.0
4.338ValPhe: 4.338 ± 0.0
6.508ValGly: 6.508 ± 0.0
0.362ValHis: 0.362 ± 0.0
6.869ValIle: 6.869 ± 0.0
5.423ValLys: 5.423 ± 0.0
3.977ValLeu: 3.977 ± 0.0
1.808ValMet: 1.808 ± 0.0
2.531ValAsn: 2.531 ± 0.0
3.615ValPro: 3.615 ± 0.0
2.892ValGln: 2.892 ± 0.0
4.7ValArg: 4.7 ± 0.0
9.4ValSer: 9.4 ± 0.0
6.508ValThr: 6.508 ± 0.0
4.7ValVal: 4.7 ± 0.0
0.723ValTrp: 0.723 ± 0.0
2.892ValTyr: 2.892 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.723TrpCys: 0.723 ± 0.0
1.085TrpAsp: 1.085 ± 0.0
0.362TrpGlu: 0.362 ± 0.0
1.446TrpPhe: 1.446 ± 0.0
0.362TrpGly: 0.362 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.085TrpIle: 1.085 ± 0.0
0.362TrpLys: 0.362 ± 0.0
0.723TrpLeu: 0.723 ± 0.0
0.723TrpMet: 0.723 ± 0.0
1.085TrpAsn: 1.085 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.085TrpArg: 1.085 ± 0.0
2.169TrpSer: 2.169 ± 0.0
0.362TrpThr: 0.362 ± 0.0
1.085TrpVal: 1.085 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.362TrpTyr: 0.362 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.169TyrAla: 2.169 ± 0.0
0.723TyrCys: 0.723 ± 0.0
3.977TyrAsp: 3.977 ± 0.0
3.615TyrGlu: 3.615 ± 0.0
2.892TyrPhe: 2.892 ± 0.0
2.531TyrGly: 2.531 ± 0.0
1.085TyrHis: 1.085 ± 0.0
3.254TyrIle: 3.254 ± 0.0
2.169TyrLys: 2.169 ± 0.0
3.977TyrLeu: 3.977 ± 0.0
0.362TyrMet: 0.362 ± 0.0
2.531TyrAsn: 2.531 ± 0.0
1.085TyrPro: 1.085 ± 0.0
0.723TyrGln: 0.723 ± 0.0
3.977TyrArg: 3.977 ± 0.0
3.254TyrSer: 3.254 ± 0.0
2.169TyrThr: 2.169 ± 0.0
3.615TyrVal: 3.615 ± 0.0
0.723TyrTrp: 0.723 ± 0.0
2.531TyrTyr: 2.531 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2767 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski