Amino acid dipepetide frequency for Shahe heteroptera virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.245AlaAla: 8.245 ± 0.0
0.344AlaCys: 0.344 ± 0.0
3.779AlaAsp: 3.779 ± 0.0
4.809AlaGlu: 4.809 ± 0.0
3.435AlaPhe: 3.435 ± 0.0
3.435AlaGly: 3.435 ± 0.0
2.061AlaHis: 2.061 ± 0.0
2.748AlaIle: 2.748 ± 0.0
4.809AlaLys: 4.809 ± 0.0
4.466AlaLeu: 4.466 ± 0.0
1.718AlaMet: 1.718 ± 0.0
3.092AlaAsn: 3.092 ± 0.0
4.466AlaPro: 4.466 ± 0.0
4.122AlaGln: 4.122 ± 0.0
5.84AlaArg: 5.84 ± 0.0
5.153AlaSer: 5.153 ± 0.0
4.466AlaThr: 4.466 ± 0.0
6.527AlaVal: 6.527 ± 0.0
2.061AlaTrp: 2.061 ± 0.0
2.748AlaTyr: 2.748 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.031CysAla: 1.031 ± 0.0
0.344CysCys: 0.344 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.344CysGlu: 0.344 ± 0.0
1.031CysPhe: 1.031 ± 0.0
1.718CysGly: 1.718 ± 0.0
0.344CysHis: 0.344 ± 0.0
1.374CysIle: 1.374 ± 0.0
0.344CysLys: 0.344 ± 0.0
1.718CysLeu: 1.718 ± 0.0
0.344CysMet: 0.344 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.405CysPro: 2.405 ± 0.0
1.718CysGln: 1.718 ± 0.0
1.031CysArg: 1.031 ± 0.0
0.344CysSer: 0.344 ± 0.0
0.344CysThr: 0.344 ± 0.0
1.031CysVal: 1.031 ± 0.0
0.687CysTrp: 0.687 ± 0.0
0.687CysTyr: 0.687 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.809AspAla: 4.809 ± 0.0
0.687AspCys: 0.687 ± 0.0
2.061AspAsp: 2.061 ± 0.0
2.405AspGlu: 2.405 ± 0.0
1.374AspPhe: 1.374 ± 0.0
4.122AspGly: 4.122 ± 0.0
1.031AspHis: 1.031 ± 0.0
2.061AspIle: 2.061 ± 0.0
4.466AspLys: 4.466 ± 0.0
3.092AspLeu: 3.092 ± 0.0
0.687AspMet: 0.687 ± 0.0
1.718AspAsn: 1.718 ± 0.0
3.435AspPro: 3.435 ± 0.0
3.435AspGln: 3.435 ± 0.0
2.748AspArg: 2.748 ± 0.0
4.122AspSer: 4.122 ± 0.0
3.092AspThr: 3.092 ± 0.0
2.748AspVal: 2.748 ± 0.0
1.031AspTrp: 1.031 ± 0.0
3.092AspTyr: 3.092 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.183GluAla: 6.183 ± 0.0
1.031GluCys: 1.031 ± 0.0
1.374GluAsp: 1.374 ± 0.0
6.183GluGlu: 6.183 ± 0.0
2.748GluPhe: 2.748 ± 0.0
3.092GluGly: 3.092 ± 0.0
1.718GluHis: 1.718 ± 0.0
3.092GluIle: 3.092 ± 0.0
5.153GluLys: 5.153 ± 0.0
5.496GluLeu: 5.496 ± 0.0
2.061GluMet: 2.061 ± 0.0
2.061GluAsn: 2.061 ± 0.0
2.748GluPro: 2.748 ± 0.0
1.718GluGln: 1.718 ± 0.0
2.748GluArg: 2.748 ± 0.0
1.718GluSer: 1.718 ± 0.0
2.061GluThr: 2.061 ± 0.0
5.153GluVal: 5.153 ± 0.0
1.718GluTrp: 1.718 ± 0.0
3.092GluTyr: 3.092 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.748PheAla: 2.748 ± 0.0
1.031PheCys: 1.031 ± 0.0
3.092PheAsp: 3.092 ± 0.0
2.405PheGlu: 2.405 ± 0.0
1.374PhePhe: 1.374 ± 0.0
2.748PheGly: 2.748 ± 0.0
1.718PheHis: 1.718 ± 0.0
2.405PheIle: 2.405 ± 0.0
2.405PheLys: 2.405 ± 0.0
2.405PheLeu: 2.405 ± 0.0
1.031PheMet: 1.031 ± 0.0
1.718PheAsn: 1.718 ± 0.0
3.092PhePro: 3.092 ± 0.0
3.092PheGln: 3.092 ± 0.0
2.061PheArg: 2.061 ± 0.0
3.435PheSer: 3.435 ± 0.0
2.748PheThr: 2.748 ± 0.0
4.122PheVal: 4.122 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.061PheTyr: 2.061 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.718GlyAla: 1.718 ± 0.0
0.687GlyCys: 0.687 ± 0.0
3.435GlyAsp: 3.435 ± 0.0
5.496GlyGlu: 5.496 ± 0.0
3.779GlyPhe: 3.779 ± 0.0
3.435GlyGly: 3.435 ± 0.0
1.031GlyHis: 1.031 ± 0.0
2.405GlyIle: 2.405 ± 0.0
5.153GlyLys: 5.153 ± 0.0
3.779GlyLeu: 3.779 ± 0.0
2.061GlyMet: 2.061 ± 0.0
1.718GlyAsn: 1.718 ± 0.0
2.405GlyPro: 2.405 ± 0.0
3.092GlyGln: 3.092 ± 0.0
2.061GlyArg: 2.061 ± 0.0
3.435GlySer: 3.435 ± 0.0
3.435GlyThr: 3.435 ± 0.0
3.779GlyVal: 3.779 ± 0.0
0.687GlyTrp: 0.687 ± 0.0
3.092GlyTyr: 3.092 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.061HisAla: 2.061 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.031HisAsp: 1.031 ± 0.0
0.687HisGlu: 0.687 ± 0.0
1.374HisPhe: 1.374 ± 0.0
1.718HisGly: 1.718 ± 0.0
0.344HisHis: 0.344 ± 0.0
1.374HisIle: 1.374 ± 0.0
0.344HisLys: 0.344 ± 0.0
1.374HisLeu: 1.374 ± 0.0
0.687HisMet: 0.687 ± 0.0
1.031HisAsn: 1.031 ± 0.0
1.031HisPro: 1.031 ± 0.0
1.031HisGln: 1.031 ± 0.0
2.405HisArg: 2.405 ± 0.0
1.374HisSer: 1.374 ± 0.0
0.0HisThr: 0.0 ± 0.0
2.061HisVal: 2.061 ± 0.0
1.031HisTrp: 1.031 ± 0.0
1.718HisTyr: 1.718 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.809IleAla: 4.809 ± 0.0
1.031IleCys: 1.031 ± 0.0
2.748IleAsp: 2.748 ± 0.0
4.466IleGlu: 4.466 ± 0.0
3.092IlePhe: 3.092 ± 0.0
3.435IleGly: 3.435 ± 0.0
1.374IleHis: 1.374 ± 0.0
2.748IleIle: 2.748 ± 0.0
2.405IleLys: 2.405 ± 0.0
3.779IleLeu: 3.779 ± 0.0
1.374IleMet: 1.374 ± 0.0
2.405IleAsn: 2.405 ± 0.0
3.779IlePro: 3.779 ± 0.0
2.061IleGln: 2.061 ± 0.0
3.779IleArg: 3.779 ± 0.0
4.122IleSer: 4.122 ± 0.0
3.092IleThr: 3.092 ± 0.0
3.779IleVal: 3.779 ± 0.0
1.031IleTrp: 1.031 ± 0.0
0.687IleTyr: 0.687 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.122LysAla: 4.122 ± 0.0
1.718LysCys: 1.718 ± 0.0
3.092LysAsp: 3.092 ± 0.0
3.779LysGlu: 3.779 ± 0.0
1.718LysPhe: 1.718 ± 0.0
4.122LysGly: 4.122 ± 0.0
1.718LysHis: 1.718 ± 0.0
2.061LysIle: 2.061 ± 0.0
2.405LysLys: 2.405 ± 0.0
5.153LysLeu: 5.153 ± 0.0
1.031LysMet: 1.031 ± 0.0
1.718LysAsn: 1.718 ± 0.0
2.061LysPro: 2.061 ± 0.0
2.748LysGln: 2.748 ± 0.0
1.718LysArg: 1.718 ± 0.0
1.374LysSer: 1.374 ± 0.0
4.809LysThr: 4.809 ± 0.0
5.84LysVal: 5.84 ± 0.0
1.374LysTrp: 1.374 ± 0.0
1.031LysTyr: 1.031 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.496LeuAla: 5.496 ± 0.0
1.374LeuCys: 1.374 ± 0.0
4.809LeuAsp: 4.809 ± 0.0
3.435LeuGlu: 3.435 ± 0.0
2.748LeuPhe: 2.748 ± 0.0
4.809LeuGly: 4.809 ± 0.0
1.718LeuHis: 1.718 ± 0.0
5.84LeuIle: 5.84 ± 0.0
4.466LeuLys: 4.466 ± 0.0
5.84LeuLeu: 5.84 ± 0.0
3.092LeuMet: 3.092 ± 0.0
3.779LeuAsn: 3.779 ± 0.0
2.405LeuPro: 2.405 ± 0.0
2.748LeuGln: 2.748 ± 0.0
4.122LeuArg: 4.122 ± 0.0
4.466LeuSer: 4.466 ± 0.0
3.092LeuThr: 3.092 ± 0.0
4.809LeuVal: 4.809 ± 0.0
1.374LeuTrp: 1.374 ± 0.0
2.748LeuTyr: 2.748 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.374MetAla: 1.374 ± 0.0
1.718MetCys: 1.718 ± 0.0
1.374MetAsp: 1.374 ± 0.0
1.031MetGlu: 1.031 ± 0.0
0.344MetPhe: 0.344 ± 0.0
1.031MetGly: 1.031 ± 0.0
0.687MetHis: 0.687 ± 0.0
0.344MetIle: 0.344 ± 0.0
1.374MetLys: 1.374 ± 0.0
2.061MetLeu: 2.061 ± 0.0
0.687MetMet: 0.687 ± 0.0
1.718MetAsn: 1.718 ± 0.0
2.748MetPro: 2.748 ± 0.0
0.687MetGln: 0.687 ± 0.0
2.061MetArg: 2.061 ± 0.0
1.718MetSer: 1.718 ± 0.0
3.435MetThr: 3.435 ± 0.0
1.031MetVal: 1.031 ± 0.0
0.344MetTrp: 0.344 ± 0.0
1.031MetTyr: 1.031 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.435AsnAla: 3.435 ± 0.0
3.092AsnCys: 3.092 ± 0.0
1.374AsnAsp: 1.374 ± 0.0
2.748AsnGlu: 2.748 ± 0.0
2.061AsnPhe: 2.061 ± 0.0
2.405AsnGly: 2.405 ± 0.0
0.687AsnHis: 0.687 ± 0.0
3.092AsnIle: 3.092 ± 0.0
2.061AsnLys: 2.061 ± 0.0
2.748AsnLeu: 2.748 ± 0.0
0.687AsnMet: 0.687 ± 0.0
2.061AsnAsn: 2.061 ± 0.0
2.748AsnPro: 2.748 ± 0.0
2.061AsnGln: 2.061 ± 0.0
2.061AsnArg: 2.061 ± 0.0
3.092AsnSer: 3.092 ± 0.0
1.718AsnThr: 1.718 ± 0.0
3.779AsnVal: 3.779 ± 0.0
1.374AsnTrp: 1.374 ± 0.0
1.031AsnTyr: 1.031 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.496ProAla: 5.496 ± 0.0
0.687ProCys: 0.687 ± 0.0
3.779ProAsp: 3.779 ± 0.0
4.122ProGlu: 4.122 ± 0.0
3.435ProPhe: 3.435 ± 0.0
3.779ProGly: 3.779 ± 0.0
1.718ProHis: 1.718 ± 0.0
5.153ProIle: 5.153 ± 0.0
2.748ProLys: 2.748 ± 0.0
3.092ProLeu: 3.092 ± 0.0
1.031ProMet: 1.031 ± 0.0
2.748ProAsn: 2.748 ± 0.0
4.466ProPro: 4.466 ± 0.0
2.061ProGln: 2.061 ± 0.0
2.061ProArg: 2.061 ± 0.0
2.748ProSer: 2.748 ± 0.0
3.779ProThr: 3.779 ± 0.0
4.122ProVal: 4.122 ± 0.0
1.374ProTrp: 1.374 ± 0.0
3.092ProTyr: 3.092 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
5.84GlnAla: 5.84 ± 0.0
0.0GlnCys: 0.0 ± 0.0
3.435GlnAsp: 3.435 ± 0.0
0.687GlnGlu: 0.687 ± 0.0
3.092GlnPhe: 3.092 ± 0.0
2.405GlnGly: 2.405 ± 0.0
1.031GlnHis: 1.031 ± 0.0
3.435GlnIle: 3.435 ± 0.0
2.405GlnLys: 2.405 ± 0.0
4.466GlnLeu: 4.466 ± 0.0
0.0GlnMet: 0.0 ± 0.0
1.718GlnAsn: 1.718 ± 0.0
3.779GlnPro: 3.779 ± 0.0
1.031GlnGln: 1.031 ± 0.0
3.435GlnArg: 3.435 ± 0.0
3.779GlnSer: 3.779 ± 0.0
4.122GlnThr: 4.122 ± 0.0
1.031GlnVal: 1.031 ± 0.0
0.687GlnTrp: 0.687 ± 0.0
2.405GlnTyr: 2.405 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.092ArgAla: 3.092 ± 0.0
0.687ArgCys: 0.687 ± 0.0
2.748ArgAsp: 2.748 ± 0.0
4.809ArgGlu: 4.809 ± 0.0
2.405ArgPhe: 2.405 ± 0.0
4.122ArgGly: 4.122 ± 0.0
2.061ArgHis: 2.061 ± 0.0
3.779ArgIle: 3.779 ± 0.0
2.748ArgLys: 2.748 ± 0.0
5.84ArgLeu: 5.84 ± 0.0
1.374ArgMet: 1.374 ± 0.0
2.748ArgAsn: 2.748 ± 0.0
2.748ArgPro: 2.748 ± 0.0
5.153ArgGln: 5.153 ± 0.0
5.153ArgArg: 5.153 ± 0.0
3.092ArgSer: 3.092 ± 0.0
2.061ArgThr: 2.061 ± 0.0
2.061ArgVal: 2.061 ± 0.0
1.031ArgTrp: 1.031 ± 0.0
2.405ArgTyr: 2.405 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.092SerAla: 3.092 ± 0.0
0.344SerCys: 0.344 ± 0.0
3.779SerAsp: 3.779 ± 0.0
2.405SerGlu: 2.405 ± 0.0
3.435SerPhe: 3.435 ± 0.0
3.435SerGly: 3.435 ± 0.0
0.687SerHis: 0.687 ± 0.0
2.405SerIle: 2.405 ± 0.0
2.405SerLys: 2.405 ± 0.0
5.84SerLeu: 5.84 ± 0.0
2.748SerMet: 2.748 ± 0.0
6.527SerAsn: 6.527 ± 0.0
4.466SerPro: 4.466 ± 0.0
2.405SerGln: 2.405 ± 0.0
2.748SerArg: 2.748 ± 0.0
1.718SerSer: 1.718 ± 0.0
3.435SerThr: 3.435 ± 0.0
3.779SerVal: 3.779 ± 0.0
0.687SerTrp: 0.687 ± 0.0
2.061SerTyr: 2.061 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.153ThrAla: 5.153 ± 0.0
0.344ThrCys: 0.344 ± 0.0
3.092ThrAsp: 3.092 ± 0.0
4.466ThrGlu: 4.466 ± 0.0
2.405ThrPhe: 2.405 ± 0.0
2.061ThrGly: 2.061 ± 0.0
0.344ThrHis: 0.344 ± 0.0
3.435ThrIle: 3.435 ± 0.0
2.061ThrLys: 2.061 ± 0.0
4.122ThrLeu: 4.122 ± 0.0
2.061ThrMet: 2.061 ± 0.0
3.779ThrAsn: 3.779 ± 0.0
3.435ThrPro: 3.435 ± 0.0
2.748ThrGln: 2.748 ± 0.0
3.779ThrArg: 3.779 ± 0.0
4.122ThrSer: 4.122 ± 0.0
2.405ThrThr: 2.405 ± 0.0
1.718ThrVal: 1.718 ± 0.0
1.374ThrTrp: 1.374 ± 0.0
2.748ThrTyr: 2.748 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.153ValAla: 5.153 ± 0.0
1.374ValCys: 1.374 ± 0.0
2.061ValAsp: 2.061 ± 0.0
4.466ValGlu: 4.466 ± 0.0
2.061ValPhe: 2.061 ± 0.0
2.405ValGly: 2.405 ± 0.0
1.031ValHis: 1.031 ± 0.0
4.466ValIle: 4.466 ± 0.0
3.435ValLys: 3.435 ± 0.0
2.748ValLeu: 2.748 ± 0.0
1.031ValMet: 1.031 ± 0.0
2.405ValAsn: 2.405 ± 0.0
6.183ValPro: 6.183 ± 0.0
3.779ValGln: 3.779 ± 0.0
4.466ValArg: 4.466 ± 0.0
5.496ValSer: 5.496 ± 0.0
3.435ValThr: 3.435 ± 0.0
5.153ValVal: 5.153 ± 0.0
0.344ValTrp: 0.344 ± 0.0
3.779ValTyr: 3.779 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.344TrpAsp: 0.344 ± 0.0
1.374TrpGlu: 1.374 ± 0.0
1.718TrpPhe: 1.718 ± 0.0
0.687TrpGly: 0.687 ± 0.0
0.344TrpHis: 0.344 ± 0.0
1.031TrpIle: 1.031 ± 0.0
1.031TrpLys: 1.031 ± 0.0
1.718TrpLeu: 1.718 ± 0.0
0.344TrpMet: 0.344 ± 0.0
1.374TrpAsn: 1.374 ± 0.0
0.687TrpPro: 0.687 ± 0.0
1.374TrpGln: 1.374 ± 0.0
2.748TrpArg: 2.748 ± 0.0
0.687TrpSer: 0.687 ± 0.0
1.031TrpThr: 1.031 ± 0.0
1.374TrpVal: 1.374 ± 0.0
0.687TrpTrp: 0.687 ± 0.0
1.718TrpTyr: 1.718 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.466TyrAla: 4.466 ± 0.0
0.344TyrCys: 0.344 ± 0.0
4.809TyrAsp: 4.809 ± 0.0
1.374TyrGlu: 1.374 ± 0.0
2.061TyrPhe: 2.061 ± 0.0
1.718TyrGly: 1.718 ± 0.0
1.031TyrHis: 1.031 ± 0.0
2.748TyrIle: 2.748 ± 0.0
1.374TyrLys: 1.374 ± 0.0
3.435TyrLeu: 3.435 ± 0.0
2.405TyrMet: 2.405 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
2.405TyrPro: 2.405 ± 0.0
1.718TyrGln: 1.718 ± 0.0
3.092TyrArg: 3.092 ± 0.0
2.405TyrSer: 2.405 ± 0.0
3.092TyrThr: 3.092 ± 0.0
1.374TyrVal: 1.374 ± 0.0
1.374TyrTrp: 1.374 ± 0.0
1.374TyrTyr: 1.374 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2912 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski