Amino acid dipepetide frequency for Basavirus sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.092AlaAla: 3.092 ± 0.0
0.344AlaCys: 0.344 ± 0.0
3.092AlaAsp: 3.092 ± 0.0
4.809AlaGlu: 4.809 ± 0.0
4.809AlaPhe: 4.809 ± 0.0
3.435AlaGly: 3.435 ± 0.0
0.687AlaHis: 0.687 ± 0.0
3.435AlaIle: 3.435 ± 0.0
1.718AlaLys: 1.718 ± 0.0
3.435AlaLeu: 3.435 ± 0.0
1.031AlaMet: 1.031 ± 0.0
3.092AlaAsn: 3.092 ± 0.0
2.405AlaPro: 2.405 ± 0.0
1.374AlaGln: 1.374 ± 0.0
2.748AlaArg: 2.748 ± 0.0
5.153AlaSer: 5.153 ± 0.0
3.092AlaThr: 3.092 ± 0.0
3.779AlaVal: 3.779 ± 0.0
1.374AlaTrp: 1.374 ± 0.0
2.405AlaTyr: 2.405 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.031CysAla: 1.031 ± 0.0
0.344CysCys: 0.344 ± 0.0
1.374CysAsp: 1.374 ± 0.0
2.405CysGlu: 2.405 ± 0.0
0.344CysPhe: 0.344 ± 0.0
0.687CysGly: 0.687 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.031CysIle: 1.031 ± 0.0
0.344CysLys: 0.344 ± 0.0
2.061CysLeu: 2.061 ± 0.0
1.031CysMet: 1.031 ± 0.0
1.374CysAsn: 1.374 ± 0.0
1.374CysPro: 1.374 ± 0.0
0.687CysGln: 0.687 ± 0.0
1.374CysArg: 1.374 ± 0.0
2.061CysSer: 2.061 ± 0.0
0.687CysThr: 0.687 ± 0.0
0.344CysVal: 0.344 ± 0.0
0.344CysTrp: 0.344 ± 0.0
0.344CysTyr: 0.344 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.374AspAla: 1.374 ± 0.0
1.374AspCys: 1.374 ± 0.0
3.435AspAsp: 3.435 ± 0.0
4.809AspGlu: 4.809 ± 0.0
2.748AspPhe: 2.748 ± 0.0
2.748AspGly: 2.748 ± 0.0
1.718AspHis: 1.718 ± 0.0
4.122AspIle: 4.122 ± 0.0
2.405AspLys: 2.405 ± 0.0
3.779AspLeu: 3.779 ± 0.0
1.718AspMet: 1.718 ± 0.0
2.748AspAsn: 2.748 ± 0.0
3.435AspPro: 3.435 ± 0.0
1.374AspGln: 1.374 ± 0.0
2.748AspArg: 2.748 ± 0.0
3.779AspSer: 3.779 ± 0.0
3.435AspThr: 3.435 ± 0.0
5.153AspVal: 5.153 ± 0.0
0.344AspTrp: 0.344 ± 0.0
1.718AspTyr: 1.718 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.779GluAla: 3.779 ± 0.0
0.687GluCys: 0.687 ± 0.0
5.84GluAsp: 5.84 ± 0.0
7.214GluGlu: 7.214 ± 0.0
3.435GluPhe: 3.435 ± 0.0
4.122GluGly: 4.122 ± 0.0
0.687GluHis: 0.687 ± 0.0
4.809GluIle: 4.809 ± 0.0
5.84GluLys: 5.84 ± 0.0
4.466GluLeu: 4.466 ± 0.0
3.092GluMet: 3.092 ± 0.0
3.779GluAsn: 3.779 ± 0.0
2.748GluPro: 2.748 ± 0.0
2.748GluGln: 2.748 ± 0.0
2.061GluArg: 2.061 ± 0.0
4.809GluSer: 4.809 ± 0.0
3.092GluThr: 3.092 ± 0.0
3.092GluVal: 3.092 ± 0.0
0.344GluTrp: 0.344 ± 0.0
2.405GluTyr: 2.405 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.122PheAla: 4.122 ± 0.0
0.687PheCys: 0.687 ± 0.0
3.435PheAsp: 3.435 ± 0.0
2.405PheGlu: 2.405 ± 0.0
2.748PhePhe: 2.748 ± 0.0
3.779PheGly: 3.779 ± 0.0
1.374PheHis: 1.374 ± 0.0
3.779PheIle: 3.779 ± 0.0
2.061PheLys: 2.061 ± 0.0
5.153PheLeu: 5.153 ± 0.0
1.031PheMet: 1.031 ± 0.0
3.092PheAsn: 3.092 ± 0.0
1.718PhePro: 1.718 ± 0.0
1.374PheGln: 1.374 ± 0.0
1.718PheArg: 1.718 ± 0.0
3.092PheSer: 3.092 ± 0.0
4.122PheThr: 4.122 ± 0.0
5.153PheVal: 5.153 ± 0.0
1.374PheTrp: 1.374 ± 0.0
1.374PheTyr: 1.374 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.435GlyAla: 3.435 ± 0.0
0.687GlyCys: 0.687 ± 0.0
4.466GlyAsp: 4.466 ± 0.0
3.092GlyGlu: 3.092 ± 0.0
3.092GlyPhe: 3.092 ± 0.0
2.405GlyGly: 2.405 ± 0.0
0.687GlyHis: 0.687 ± 0.0
4.122GlyIle: 4.122 ± 0.0
2.748GlyLys: 2.748 ± 0.0
6.527GlyLeu: 6.527 ± 0.0
1.374GlyMet: 1.374 ± 0.0
2.748GlyAsn: 2.748 ± 0.0
1.718GlyPro: 1.718 ± 0.0
1.374GlyGln: 1.374 ± 0.0
2.748GlyArg: 2.748 ± 0.0
3.779GlySer: 3.779 ± 0.0
5.496GlyThr: 5.496 ± 0.0
2.061GlyVal: 2.061 ± 0.0
0.687GlyTrp: 0.687 ± 0.0
3.435GlyTyr: 3.435 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.687HisAla: 0.687 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.687HisAsp: 0.687 ± 0.0
1.031HisGlu: 1.031 ± 0.0
1.374HisPhe: 1.374 ± 0.0
0.344HisGly: 0.344 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.687HisIle: 0.687 ± 0.0
1.374HisLys: 1.374 ± 0.0
1.718HisLeu: 1.718 ± 0.0
0.344HisMet: 0.344 ± 0.0
0.687HisAsn: 0.687 ± 0.0
0.687HisPro: 0.687 ± 0.0
0.344HisGln: 0.344 ± 0.0
2.061HisArg: 2.061 ± 0.0
2.061HisSer: 2.061 ± 0.0
0.344HisThr: 0.344 ± 0.0
2.061HisVal: 2.061 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.344HisTyr: 0.344 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.153IleAla: 5.153 ± 0.0
1.718IleCys: 1.718 ± 0.0
4.122IleAsp: 4.122 ± 0.0
3.779IleGlu: 3.779 ± 0.0
3.092IlePhe: 3.092 ± 0.0
2.405IleGly: 2.405 ± 0.0
1.718IleHis: 1.718 ± 0.0
4.122IleIle: 4.122 ± 0.0
5.153IleLys: 5.153 ± 0.0
5.153IleLeu: 5.153 ± 0.0
1.718IleMet: 1.718 ± 0.0
6.183IleAsn: 6.183 ± 0.0
3.779IlePro: 3.779 ± 0.0
2.405IleGln: 2.405 ± 0.0
3.435IleArg: 3.435 ± 0.0
5.496IleSer: 5.496 ± 0.0
7.214IleThr: 7.214 ± 0.0
3.435IleVal: 3.435 ± 0.0
0.344IleTrp: 0.344 ± 0.0
2.748IleTyr: 2.748 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.405LysAla: 2.405 ± 0.0
1.031LysCys: 1.031 ± 0.0
5.153LysAsp: 5.153 ± 0.0
3.435LysGlu: 3.435 ± 0.0
4.466LysPhe: 4.466 ± 0.0
3.435LysGly: 3.435 ± 0.0
1.031LysHis: 1.031 ± 0.0
6.87LysIle: 6.87 ± 0.0
4.809LysLys: 4.809 ± 0.0
3.779LysLeu: 3.779 ± 0.0
1.374LysMet: 1.374 ± 0.0
4.809LysAsn: 4.809 ± 0.0
1.031LysPro: 1.031 ± 0.0
1.374LysGln: 1.374 ± 0.0
1.718LysArg: 1.718 ± 0.0
2.748LysSer: 2.748 ± 0.0
2.748LysThr: 2.748 ± 0.0
3.092LysVal: 3.092 ± 0.0
0.344LysTrp: 0.344 ± 0.0
1.031LysTyr: 1.031 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.466LeuAla: 4.466 ± 0.0
1.374LeuCys: 1.374 ± 0.0
2.061LeuAsp: 2.061 ± 0.0
3.779LeuGlu: 3.779 ± 0.0
3.092LeuPhe: 3.092 ± 0.0
4.122LeuGly: 4.122 ± 0.0
0.0LeuHis: 0.0 ± 0.0
5.496LeuIle: 5.496 ± 0.0
6.183LeuLys: 6.183 ± 0.0
5.496LeuLeu: 5.496 ± 0.0
2.405LeuMet: 2.405 ± 0.0
6.527LeuAsn: 6.527 ± 0.0
4.809LeuPro: 4.809 ± 0.0
2.405LeuGln: 2.405 ± 0.0
4.122LeuArg: 4.122 ± 0.0
6.527LeuSer: 6.527 ± 0.0
5.153LeuThr: 5.153 ± 0.0
5.153LeuVal: 5.153 ± 0.0
1.031LeuTrp: 1.031 ± 0.0
2.748LeuTyr: 2.748 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.687MetAla: 0.687 ± 0.0
0.344MetCys: 0.344 ± 0.0
0.687MetAsp: 0.687 ± 0.0
0.344MetGlu: 0.344 ± 0.0
1.031MetPhe: 1.031 ± 0.0
1.374MetGly: 1.374 ± 0.0
0.344MetHis: 0.344 ± 0.0
3.435MetIle: 3.435 ± 0.0
2.405MetLys: 2.405 ± 0.0
1.718MetLeu: 1.718 ± 0.0
1.031MetMet: 1.031 ± 0.0
2.748MetAsn: 2.748 ± 0.0
3.092MetPro: 3.092 ± 0.0
1.031MetGln: 1.031 ± 0.0
1.031MetArg: 1.031 ± 0.0
1.718MetSer: 1.718 ± 0.0
1.718MetThr: 1.718 ± 0.0
1.718MetVal: 1.718 ± 0.0
0.344MetTrp: 0.344 ± 0.0
1.374MetTyr: 1.374 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.718AsnAla: 1.718 ± 0.0
2.061AsnCys: 2.061 ± 0.0
3.092AsnAsp: 3.092 ± 0.0
3.779AsnGlu: 3.779 ± 0.0
2.061AsnPhe: 2.061 ± 0.0
4.466AsnGly: 4.466 ± 0.0
2.061AsnHis: 2.061 ± 0.0
3.779AsnIle: 3.779 ± 0.0
3.779AsnLys: 3.779 ± 0.0
2.748AsnLeu: 2.748 ± 0.0
2.405AsnMet: 2.405 ± 0.0
3.435AsnAsn: 3.435 ± 0.0
3.092AsnPro: 3.092 ± 0.0
2.061AsnGln: 2.061 ± 0.0
4.122AsnArg: 4.122 ± 0.0
3.092AsnSer: 3.092 ± 0.0
4.122AsnThr: 4.122 ± 0.0
6.183AsnVal: 6.183 ± 0.0
0.687AsnTrp: 0.687 ± 0.0
2.748AsnTyr: 2.748 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.374ProAla: 1.374 ± 0.0
0.344ProCys: 0.344 ± 0.0
2.061ProAsp: 2.061 ± 0.0
3.092ProGlu: 3.092 ± 0.0
3.435ProPhe: 3.435 ± 0.0
3.092ProGly: 3.092 ± 0.0
0.344ProHis: 0.344 ± 0.0
3.779ProIle: 3.779 ± 0.0
1.718ProLys: 1.718 ± 0.0
4.809ProLeu: 4.809 ± 0.0
1.031ProMet: 1.031 ± 0.0
1.374ProAsn: 1.374 ± 0.0
2.061ProPro: 2.061 ± 0.0
3.779ProGln: 3.779 ± 0.0
3.435ProArg: 3.435 ± 0.0
2.748ProSer: 2.748 ± 0.0
4.466ProThr: 4.466 ± 0.0
3.435ProVal: 3.435 ± 0.0
0.344ProTrp: 0.344 ± 0.0
1.718ProTyr: 1.718 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.687GlnAla: 0.687 ± 0.0
0.344GlnCys: 0.344 ± 0.0
1.031GlnAsp: 1.031 ± 0.0
2.405GlnGlu: 2.405 ± 0.0
2.061GlnPhe: 2.061 ± 0.0
1.374GlnGly: 1.374 ± 0.0
0.344GlnHis: 0.344 ± 0.0
1.718GlnIle: 1.718 ± 0.0
1.718GlnLys: 1.718 ± 0.0
1.718GlnLeu: 1.718 ± 0.0
0.344GlnMet: 0.344 ± 0.0
3.092GlnAsn: 3.092 ± 0.0
2.061GlnPro: 2.061 ± 0.0
1.031GlnGln: 1.031 ± 0.0
2.061GlnArg: 2.061 ± 0.0
0.687GlnSer: 0.687 ± 0.0
2.748GlnThr: 2.748 ± 0.0
1.718GlnVal: 1.718 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.718GlnTyr: 1.718 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.122ArgAla: 4.122 ± 0.0
1.718ArgCys: 1.718 ± 0.0
1.718ArgAsp: 1.718 ± 0.0
4.809ArgGlu: 4.809 ± 0.0
2.748ArgPhe: 2.748 ± 0.0
3.435ArgGly: 3.435 ± 0.0
1.031ArgHis: 1.031 ± 0.0
2.405ArgIle: 2.405 ± 0.0
2.405ArgLys: 2.405 ± 0.0
4.809ArgLeu: 4.809 ± 0.0
2.061ArgMet: 2.061 ± 0.0
3.092ArgAsn: 3.092 ± 0.0
1.718ArgPro: 1.718 ± 0.0
1.031ArgGln: 1.031 ± 0.0
3.435ArgArg: 3.435 ± 0.0
2.061ArgSer: 2.061 ± 0.0
1.718ArgThr: 1.718 ± 0.0
4.466ArgVal: 4.466 ± 0.0
0.687ArgTrp: 0.687 ± 0.0
2.405ArgTyr: 2.405 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.122SerAla: 4.122 ± 0.0
1.718SerCys: 1.718 ± 0.0
4.809SerAsp: 4.809 ± 0.0
6.183SerGlu: 6.183 ± 0.0
2.405SerPhe: 2.405 ± 0.0
4.466SerGly: 4.466 ± 0.0
1.718SerHis: 1.718 ± 0.0
3.092SerIle: 3.092 ± 0.0
3.779SerLys: 3.779 ± 0.0
6.183SerLeu: 6.183 ± 0.0
2.405SerMet: 2.405 ± 0.0
3.435SerAsn: 3.435 ± 0.0
3.092SerPro: 3.092 ± 0.0
0.344SerGln: 0.344 ± 0.0
4.809SerArg: 4.809 ± 0.0
5.496SerSer: 5.496 ± 0.0
3.435SerThr: 3.435 ± 0.0
4.809SerVal: 4.809 ± 0.0
0.344SerTrp: 0.344 ± 0.0
3.435SerTyr: 3.435 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.122ThrAla: 4.122 ± 0.0
2.061ThrCys: 2.061 ± 0.0
3.779ThrAsp: 3.779 ± 0.0
3.435ThrGlu: 3.435 ± 0.0
4.809ThrPhe: 4.809 ± 0.0
2.748ThrGly: 2.748 ± 0.0
1.031ThrHis: 1.031 ± 0.0
7.558ThrIle: 7.558 ± 0.0
1.718ThrLys: 1.718 ± 0.0
3.092ThrLeu: 3.092 ± 0.0
1.718ThrMet: 1.718 ± 0.0
3.779ThrAsn: 3.779 ± 0.0
3.779ThrPro: 3.779 ± 0.0
2.061ThrGln: 2.061 ± 0.0
3.092ThrArg: 3.092 ± 0.0
4.466ThrSer: 4.466 ± 0.0
5.84ThrThr: 5.84 ± 0.0
4.122ThrVal: 4.122 ± 0.0
1.031ThrTrp: 1.031 ± 0.0
1.374ThrTyr: 1.374 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.527ValAla: 6.527 ± 0.0
1.374ValCys: 1.374 ± 0.0
2.405ValAsp: 2.405 ± 0.0
4.809ValGlu: 4.809 ± 0.0
4.122ValPhe: 4.122 ± 0.0
3.779ValGly: 3.779 ± 0.0
1.031ValHis: 1.031 ± 0.0
5.153ValIle: 5.153 ± 0.0
3.779ValLys: 3.779 ± 0.0
5.153ValLeu: 5.153 ± 0.0
1.718ValMet: 1.718 ± 0.0
3.779ValAsn: 3.779 ± 0.0
3.779ValPro: 3.779 ± 0.0
1.718ValGln: 1.718 ± 0.0
1.718ValArg: 1.718 ± 0.0
4.809ValSer: 4.809 ± 0.0
2.748ValThr: 2.748 ± 0.0
2.748ValVal: 2.748 ± 0.0
0.0ValTrp: 0.0 ± 0.0
3.435ValTyr: 3.435 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.374TrpAla: 1.374 ± 0.0
0.344TrpCys: 0.344 ± 0.0
0.344TrpAsp: 0.344 ± 0.0
0.687TrpGlu: 0.687 ± 0.0
0.344TrpPhe: 0.344 ± 0.0
1.031TrpGly: 1.031 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.031TrpIle: 1.031 ± 0.0
0.344TrpLys: 0.344 ± 0.0
0.687TrpLeu: 0.687 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.687TrpAsn: 0.687 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.718TrpArg: 1.718 ± 0.0
1.031TrpSer: 1.031 ± 0.0
0.344TrpThr: 0.344 ± 0.0
0.687TrpVal: 0.687 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.374TyrAla: 1.374 ± 0.0
0.687TyrCys: 0.687 ± 0.0
1.374TyrAsp: 1.374 ± 0.0
2.748TyrGlu: 2.748 ± 0.0
1.374TyrPhe: 1.374 ± 0.0
3.435TyrGly: 3.435 ± 0.0
1.031TyrHis: 1.031 ± 0.0
2.748TyrIle: 2.748 ± 0.0
2.405TyrLys: 2.405 ± 0.0
4.466TyrLeu: 4.466 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.031TyrAsn: 1.031 ± 0.0
2.061TyrPro: 2.061 ± 0.0
0.344TyrGln: 0.344 ± 0.0
1.718TyrArg: 1.718 ± 0.0
4.466TyrSer: 4.466 ± 0.0
3.092TyrThr: 3.092 ± 0.0
1.374TyrVal: 1.374 ± 0.0
1.031TyrTrp: 1.031 ± 0.0
1.031TyrTyr: 1.031 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2912 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski