Amino acid dipepetide frequency for Beihai mantis shrimp virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.092AlaAla: 3.092 ± 0.0
1.031AlaCys: 1.031 ± 0.0
3.779AlaAsp: 3.779 ± 0.0
2.061AlaGlu: 2.061 ± 0.0
2.405AlaPhe: 2.405 ± 0.0
5.153AlaGly: 5.153 ± 0.0
2.061AlaHis: 2.061 ± 0.0
2.405AlaIle: 2.405 ± 0.0
3.435AlaLys: 3.435 ± 0.0
6.183AlaLeu: 6.183 ± 0.0
1.718AlaMet: 1.718 ± 0.0
5.153AlaAsn: 5.153 ± 0.0
1.718AlaPro: 1.718 ± 0.0
3.092AlaGln: 3.092 ± 0.0
1.374AlaArg: 1.374 ± 0.0
3.435AlaSer: 3.435 ± 0.0
3.092AlaThr: 3.092 ± 0.0
4.466AlaVal: 4.466 ± 0.0
0.687AlaTrp: 0.687 ± 0.0
0.687AlaTyr: 0.687 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.405CysAla: 2.405 ± 0.0
0.344CysCys: 0.344 ± 0.0
0.687CysAsp: 0.687 ± 0.0
0.687CysGlu: 0.687 ± 0.0
0.344CysPhe: 0.344 ± 0.0
1.031CysGly: 1.031 ± 0.0
0.344CysHis: 0.344 ± 0.0
1.718CysIle: 1.718 ± 0.0
1.031CysLys: 1.031 ± 0.0
1.031CysLeu: 1.031 ± 0.0
0.344CysMet: 0.344 ± 0.0
0.687CysAsn: 0.687 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.344CysGln: 0.344 ± 0.0
2.061CysArg: 2.061 ± 0.0
1.718CysSer: 1.718 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.405CysVal: 2.405 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.687CysTyr: 0.687 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.092AspAla: 3.092 ± 0.0
1.718AspCys: 1.718 ± 0.0
5.496AspAsp: 5.496 ± 0.0
5.153AspGlu: 5.153 ± 0.0
3.779AspPhe: 3.779 ± 0.0
4.466AspGly: 4.466 ± 0.0
0.687AspHis: 0.687 ± 0.0
5.84AspIle: 5.84 ± 0.0
4.809AspLys: 4.809 ± 0.0
5.153AspLeu: 5.153 ± 0.0
1.031AspMet: 1.031 ± 0.0
3.779AspAsn: 3.779 ± 0.0
3.779AspPro: 3.779 ± 0.0
1.718AspGln: 1.718 ± 0.0
2.061AspArg: 2.061 ± 0.0
3.092AspSer: 3.092 ± 0.0
5.153AspThr: 5.153 ± 0.0
3.779AspVal: 3.779 ± 0.0
0.0AspTrp: 0.0 ± 0.0
1.374AspTyr: 1.374 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.718GluAla: 1.718 ± 0.0
1.031GluCys: 1.031 ± 0.0
2.405GluAsp: 2.405 ± 0.0
4.809GluGlu: 4.809 ± 0.0
4.122GluPhe: 4.122 ± 0.0
2.061GluGly: 2.061 ± 0.0
1.374GluHis: 1.374 ± 0.0
2.405GluIle: 2.405 ± 0.0
2.748GluLys: 2.748 ± 0.0
4.809GluLeu: 4.809 ± 0.0
2.748GluMet: 2.748 ± 0.0
2.405GluAsn: 2.405 ± 0.0
1.718GluPro: 1.718 ± 0.0
2.748GluGln: 2.748 ± 0.0
1.374GluArg: 1.374 ± 0.0
1.718GluSer: 1.718 ± 0.0
4.466GluThr: 4.466 ± 0.0
3.435GluVal: 3.435 ± 0.0
2.061GluTrp: 2.061 ± 0.0
3.435GluTyr: 3.435 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.809PheAla: 4.809 ± 0.0
1.031PheCys: 1.031 ± 0.0
5.153PheAsp: 5.153 ± 0.0
1.718PheGlu: 1.718 ± 0.0
0.687PhePhe: 0.687 ± 0.0
3.092PheGly: 3.092 ± 0.0
0.687PheHis: 0.687 ± 0.0
2.748PheIle: 2.748 ± 0.0
1.718PheLys: 1.718 ± 0.0
4.122PheLeu: 4.122 ± 0.0
0.344PheMet: 0.344 ± 0.0
2.061PheAsn: 2.061 ± 0.0
0.687PhePro: 0.687 ± 0.0
2.061PheGln: 2.061 ± 0.0
3.092PheArg: 3.092 ± 0.0
4.122PheSer: 4.122 ± 0.0
3.779PheThr: 3.779 ± 0.0
2.405PheVal: 2.405 ± 0.0
0.687PheTrp: 0.687 ± 0.0
1.718PheTyr: 1.718 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.092GlyAla: 3.092 ± 0.0
0.0GlyCys: 0.0 ± 0.0
6.87GlyAsp: 6.87 ± 0.0
3.092GlyGlu: 3.092 ± 0.0
2.748GlyPhe: 2.748 ± 0.0
3.779GlyGly: 3.779 ± 0.0
1.031GlyHis: 1.031 ± 0.0
4.466GlyIle: 4.466 ± 0.0
3.779GlyLys: 3.779 ± 0.0
6.527GlyLeu: 6.527 ± 0.0
1.031GlyMet: 1.031 ± 0.0
1.374GlyAsn: 1.374 ± 0.0
1.374GlyPro: 1.374 ± 0.0
2.748GlyGln: 2.748 ± 0.0
2.748GlyArg: 2.748 ± 0.0
5.153GlySer: 5.153 ± 0.0
2.748GlyThr: 2.748 ± 0.0
4.809GlyVal: 4.809 ± 0.0
1.031GlyTrp: 1.031 ± 0.0
2.061GlyTyr: 2.061 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.031HisAla: 1.031 ± 0.0
1.374HisCys: 1.374 ± 0.0
0.344HisAsp: 0.344 ± 0.0
1.031HisGlu: 1.031 ± 0.0
2.748HisPhe: 2.748 ± 0.0
1.374HisGly: 1.374 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.344HisIle: 0.344 ± 0.0
1.374HisLys: 1.374 ± 0.0
2.061HisLeu: 2.061 ± 0.0
0.687HisMet: 0.687 ± 0.0
0.687HisAsn: 0.687 ± 0.0
1.031HisPro: 1.031 ± 0.0
1.718HisGln: 1.718 ± 0.0
1.031HisArg: 1.031 ± 0.0
2.405HisSer: 2.405 ± 0.0
2.405HisThr: 2.405 ± 0.0
1.374HisVal: 1.374 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.031HisTyr: 1.031 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.779IleAla: 3.779 ± 0.0
2.405IleCys: 2.405 ± 0.0
2.748IleAsp: 2.748 ± 0.0
2.061IleGlu: 2.061 ± 0.0
2.061IlePhe: 2.061 ± 0.0
3.092IleGly: 3.092 ± 0.0
1.374IleHis: 1.374 ± 0.0
1.374IleIle: 1.374 ± 0.0
3.779IleLys: 3.779 ± 0.0
4.122IleLeu: 4.122 ± 0.0
1.374IleMet: 1.374 ± 0.0
2.748IleAsn: 2.748 ± 0.0
2.748IlePro: 2.748 ± 0.0
2.405IleGln: 2.405 ± 0.0
2.405IleArg: 2.405 ± 0.0
4.122IleSer: 4.122 ± 0.0
2.405IleThr: 2.405 ± 0.0
4.122IleVal: 4.122 ± 0.0
1.031IleTrp: 1.031 ± 0.0
3.435IleTyr: 3.435 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.092LysAla: 3.092 ± 0.0
1.718LysCys: 1.718 ± 0.0
3.779LysAsp: 3.779 ± 0.0
3.779LysGlu: 3.779 ± 0.0
3.092LysPhe: 3.092 ± 0.0
3.435LysGly: 3.435 ± 0.0
1.374LysHis: 1.374 ± 0.0
3.092LysIle: 3.092 ± 0.0
3.092LysLys: 3.092 ± 0.0
4.809LysLeu: 4.809 ± 0.0
1.031LysMet: 1.031 ± 0.0
0.687LysAsn: 0.687 ± 0.0
3.779LysPro: 3.779 ± 0.0
2.061LysGln: 2.061 ± 0.0
3.092LysArg: 3.092 ± 0.0
3.092LysSer: 3.092 ± 0.0
5.153LysThr: 5.153 ± 0.0
5.153LysVal: 5.153 ± 0.0
1.718LysTrp: 1.718 ± 0.0
4.466LysTyr: 4.466 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.87LeuAla: 6.87 ± 0.0
1.374LeuCys: 1.374 ± 0.0
3.435LeuAsp: 3.435 ± 0.0
3.435LeuGlu: 3.435 ± 0.0
3.779LeuPhe: 3.779 ± 0.0
7.558LeuGly: 7.558 ± 0.0
2.405LeuHis: 2.405 ± 0.0
2.405LeuIle: 2.405 ± 0.0
6.183LeuLys: 6.183 ± 0.0
7.558LeuLeu: 7.558 ± 0.0
3.435LeuMet: 3.435 ± 0.0
4.122LeuAsn: 4.122 ± 0.0
3.435LeuPro: 3.435 ± 0.0
3.779LeuGln: 3.779 ± 0.0
4.809LeuArg: 4.809 ± 0.0
6.183LeuSer: 6.183 ± 0.0
7.214LeuThr: 7.214 ± 0.0
7.558LeuVal: 7.558 ± 0.0
1.718LeuTrp: 1.718 ± 0.0
1.718LeuTyr: 1.718 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.687MetAla: 0.687 ± 0.0
1.031MetCys: 1.031 ± 0.0
1.374MetAsp: 1.374 ± 0.0
2.405MetGlu: 2.405 ± 0.0
0.687MetPhe: 0.687 ± 0.0
1.031MetGly: 1.031 ± 0.0
1.031MetHis: 1.031 ± 0.0
2.405MetIle: 2.405 ± 0.0
0.687MetLys: 0.687 ± 0.0
2.405MetLeu: 2.405 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.031MetAsn: 1.031 ± 0.0
0.687MetPro: 0.687 ± 0.0
0.344MetGln: 0.344 ± 0.0
1.374MetArg: 1.374 ± 0.0
1.718MetSer: 1.718 ± 0.0
3.435MetThr: 3.435 ± 0.0
3.435MetVal: 3.435 ± 0.0
0.687MetTrp: 0.687 ± 0.0
0.687MetTyr: 0.687 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.061AsnAla: 2.061 ± 0.0
1.031AsnCys: 1.031 ± 0.0
0.687AsnAsp: 0.687 ± 0.0
4.122AsnGlu: 4.122 ± 0.0
2.748AsnPhe: 2.748 ± 0.0
2.405AsnGly: 2.405 ± 0.0
1.718AsnHis: 1.718 ± 0.0
2.061AsnIle: 2.061 ± 0.0
2.405AsnLys: 2.405 ± 0.0
5.84AsnLeu: 5.84 ± 0.0
1.718AsnMet: 1.718 ± 0.0
1.718AsnAsn: 1.718 ± 0.0
4.122AsnPro: 4.122 ± 0.0
1.031AsnGln: 1.031 ± 0.0
1.374AsnArg: 1.374 ± 0.0
4.809AsnSer: 4.809 ± 0.0
1.374AsnThr: 1.374 ± 0.0
2.748AsnVal: 2.748 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
0.687AsnTyr: 0.687 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.061ProAla: 2.061 ± 0.0
0.344ProCys: 0.344 ± 0.0
2.748ProAsp: 2.748 ± 0.0
2.061ProGlu: 2.061 ± 0.0
1.031ProPhe: 1.031 ± 0.0
1.718ProGly: 1.718 ± 0.0
1.031ProHis: 1.031 ± 0.0
2.061ProIle: 2.061 ± 0.0
2.405ProLys: 2.405 ± 0.0
5.496ProLeu: 5.496 ± 0.0
2.061ProMet: 2.061 ± 0.0
1.374ProAsn: 1.374 ± 0.0
2.405ProPro: 2.405 ± 0.0
3.092ProGln: 3.092 ± 0.0
0.687ProArg: 0.687 ± 0.0
3.435ProSer: 3.435 ± 0.0
3.435ProThr: 3.435 ± 0.0
4.122ProVal: 4.122 ± 0.0
1.031ProTrp: 1.031 ± 0.0
2.061ProTyr: 2.061 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.435GlnAla: 3.435 ± 0.0
0.0GlnCys: 0.0 ± 0.0
2.061GlnAsp: 2.061 ± 0.0
2.405GlnGlu: 2.405 ± 0.0
2.405GlnPhe: 2.405 ± 0.0
2.405GlnGly: 2.405 ± 0.0
0.344GlnHis: 0.344 ± 0.0
3.092GlnIle: 3.092 ± 0.0
2.748GlnLys: 2.748 ± 0.0
5.153GlnLeu: 5.153 ± 0.0
1.718GlnMet: 1.718 ± 0.0
1.374GlnAsn: 1.374 ± 0.0
1.718GlnPro: 1.718 ± 0.0
1.374GlnGln: 1.374 ± 0.0
1.031GlnArg: 1.031 ± 0.0
2.405GlnSer: 2.405 ± 0.0
1.718GlnThr: 1.718 ± 0.0
4.466GlnVal: 4.466 ± 0.0
1.374GlnTrp: 1.374 ± 0.0
1.031GlnTyr: 1.031 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.718ArgAla: 1.718 ± 0.0
1.031ArgCys: 1.031 ± 0.0
2.405ArgAsp: 2.405 ± 0.0
2.748ArgGlu: 2.748 ± 0.0
2.405ArgPhe: 2.405 ± 0.0
2.405ArgGly: 2.405 ± 0.0
1.718ArgHis: 1.718 ± 0.0
1.374ArgIle: 1.374 ± 0.0
3.435ArgLys: 3.435 ± 0.0
2.748ArgLeu: 2.748 ± 0.0
1.718ArgMet: 1.718 ± 0.0
1.031ArgAsn: 1.031 ± 0.0
2.748ArgPro: 2.748 ± 0.0
1.718ArgGln: 1.718 ± 0.0
2.061ArgArg: 2.061 ± 0.0
1.718ArgSer: 1.718 ± 0.0
2.405ArgThr: 2.405 ± 0.0
4.466ArgVal: 4.466 ± 0.0
1.031ArgTrp: 1.031 ± 0.0
2.405ArgTyr: 2.405 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.466SerAla: 4.466 ± 0.0
1.031SerCys: 1.031 ± 0.0
6.527SerAsp: 6.527 ± 0.0
1.031SerGlu: 1.031 ± 0.0
4.466SerPhe: 4.466 ± 0.0
3.779SerGly: 3.779 ± 0.0
0.687SerHis: 0.687 ± 0.0
4.466SerIle: 4.466 ± 0.0
5.84SerLys: 5.84 ± 0.0
5.153SerLeu: 5.153 ± 0.0
1.718SerMet: 1.718 ± 0.0
2.748SerAsn: 2.748 ± 0.0
2.405SerPro: 2.405 ± 0.0
5.153SerGln: 5.153 ± 0.0
2.748SerArg: 2.748 ± 0.0
6.87SerSer: 6.87 ± 0.0
2.748SerThr: 2.748 ± 0.0
6.87SerVal: 6.87 ± 0.0
0.687SerTrp: 0.687 ± 0.0
1.718SerTyr: 1.718 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.153ThrAla: 5.153 ± 0.0
0.687ThrCys: 0.687 ± 0.0
3.092ThrAsp: 3.092 ± 0.0
3.092ThrGlu: 3.092 ± 0.0
3.092ThrPhe: 3.092 ± 0.0
3.779ThrGly: 3.779 ± 0.0
0.687ThrHis: 0.687 ± 0.0
5.153ThrIle: 5.153 ± 0.0
4.466ThrLys: 4.466 ± 0.0
6.183ThrLeu: 6.183 ± 0.0
1.718ThrMet: 1.718 ± 0.0
2.748ThrAsn: 2.748 ± 0.0
4.122ThrPro: 4.122 ± 0.0
2.405ThrGln: 2.405 ± 0.0
4.122ThrArg: 4.122 ± 0.0
3.779ThrSer: 3.779 ± 0.0
2.405ThrThr: 2.405 ± 0.0
5.153ThrVal: 5.153 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
1.031ThrTyr: 1.031 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.748ValAla: 2.748 ± 0.0
0.344ValCys: 0.344 ± 0.0
7.901ValAsp: 7.901 ± 0.0
5.496ValGlu: 5.496 ± 0.0
2.061ValPhe: 2.061 ± 0.0
4.122ValGly: 4.122 ± 0.0
3.779ValHis: 3.779 ± 0.0
2.405ValIle: 2.405 ± 0.0
5.496ValLys: 5.496 ± 0.0
4.809ValLeu: 4.809 ± 0.0
1.374ValMet: 1.374 ± 0.0
6.183ValAsn: 6.183 ± 0.0
5.153ValPro: 5.153 ± 0.0
3.092ValGln: 3.092 ± 0.0
3.435ValArg: 3.435 ± 0.0
7.214ValSer: 7.214 ± 0.0
4.466ValThr: 4.466 ± 0.0
7.214ValVal: 7.214 ± 0.0
1.374ValTrp: 1.374 ± 0.0
3.092ValTyr: 3.092 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.031TrpAsp: 1.031 ± 0.0
0.344TrpGlu: 0.344 ± 0.0
0.344TrpPhe: 0.344 ± 0.0
0.687TrpGly: 0.687 ± 0.0
0.344TrpHis: 0.344 ± 0.0
1.718TrpIle: 1.718 ± 0.0
1.031TrpLys: 1.031 ± 0.0
2.405TrpLeu: 2.405 ± 0.0
1.031TrpMet: 1.031 ± 0.0
1.031TrpAsn: 1.031 ± 0.0
0.344TrpPro: 0.344 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.374TrpArg: 1.374 ± 0.0
1.718TrpSer: 1.718 ± 0.0
1.374TrpThr: 1.374 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.344TrpTrp: 0.344 ± 0.0
1.374TrpTyr: 1.374 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.718TyrAla: 1.718 ± 0.0
0.344TyrCys: 0.344 ± 0.0
3.435TyrAsp: 3.435 ± 0.0
2.061TyrGlu: 2.061 ± 0.0
1.718TyrPhe: 1.718 ± 0.0
3.092TyrGly: 3.092 ± 0.0
1.374TyrHis: 1.374 ± 0.0
2.061TyrIle: 2.061 ± 0.0
1.374TyrLys: 1.374 ± 0.0
2.405TyrLeu: 2.405 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.718TyrAsn: 1.718 ± 0.0
0.687TyrPro: 0.687 ± 0.0
1.374TyrGln: 1.374 ± 0.0
1.031TyrArg: 1.031 ± 0.0
2.405TyrSer: 2.405 ± 0.0
3.092TyrThr: 3.092 ± 0.0
3.779TyrVal: 3.779 ± 0.0
1.031TyrTrp: 1.031 ± 0.0
1.374TyrTyr: 1.374 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2912 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski