Amino acid dipepetide frequency for Agaricus bisporus endornavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.083AlaAla: 3.083 ± 0.0
0.237AlaCys: 0.237 ± 0.0
3.083AlaAsp: 3.083 ± 0.0
2.372AlaGlu: 2.372 ± 0.0
1.898AlaPhe: 1.898 ± 0.0
3.321AlaGly: 3.321 ± 0.0
0.949AlaHis: 0.949 ± 0.0
2.846AlaIle: 2.846 ± 0.0
3.558AlaLys: 3.558 ± 0.0
4.981AlaLeu: 4.981 ± 0.0
1.423AlaMet: 1.423 ± 0.0
4.981AlaAsn: 4.981 ± 0.0
0.712AlaPro: 0.712 ± 0.0
0.949AlaGln: 0.949 ± 0.0
1.423AlaArg: 1.423 ± 0.0
1.66AlaSer: 1.66 ± 0.0
3.558AlaThr: 3.558 ± 0.0
4.032AlaVal: 4.032 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
1.898AlaTyr: 1.898 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.66CysAla: 1.66 ± 0.0
1.186CysCys: 1.186 ± 0.0
0.949CysAsp: 0.949 ± 0.0
1.898CysGlu: 1.898 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.423CysGly: 1.423 ± 0.0
1.423CysHis: 1.423 ± 0.0
0.949CysIle: 0.949 ± 0.0
0.949CysLys: 0.949 ± 0.0
2.609CysLeu: 2.609 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.712CysAsn: 0.712 ± 0.0
0.949CysPro: 0.949 ± 0.0
0.237CysGln: 0.237 ± 0.0
0.474CysArg: 0.474 ± 0.0
0.474CysSer: 0.474 ± 0.0
2.135CysThr: 2.135 ± 0.0
2.609CysVal: 2.609 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.186CysTyr: 1.186 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.898AspAla: 1.898 ± 0.0
1.186AspCys: 1.186 ± 0.0
3.083AspAsp: 3.083 ± 0.0
4.507AspGlu: 4.507 ± 0.0
3.321AspPhe: 3.321 ± 0.0
2.609AspGly: 2.609 ± 0.0
0.949AspHis: 0.949 ± 0.0
4.744AspIle: 4.744 ± 0.0
4.507AspLys: 4.507 ± 0.0
4.981AspLeu: 4.981 ± 0.0
1.66AspMet: 1.66 ± 0.0
4.507AspAsn: 4.507 ± 0.0
1.898AspPro: 1.898 ± 0.0
1.186AspGln: 1.186 ± 0.0
2.609AspArg: 2.609 ± 0.0
5.455AspSer: 5.455 ± 0.0
2.609AspThr: 2.609 ± 0.0
2.609AspVal: 2.609 ± 0.0
0.712AspTrp: 0.712 ± 0.0
3.083AspTyr: 3.083 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.846GluAla: 2.846 ± 0.0
0.949GluCys: 0.949 ± 0.0
2.846GluAsp: 2.846 ± 0.0
4.269GluGlu: 4.269 ± 0.0
2.372GluPhe: 2.372 ± 0.0
1.898GluGly: 1.898 ± 0.0
2.135GluHis: 2.135 ± 0.0
4.744GluIle: 4.744 ± 0.0
4.269GluLys: 4.269 ± 0.0
9.725GluLeu: 9.725 ± 0.0
2.372GluMet: 2.372 ± 0.0
3.321GluAsn: 3.321 ± 0.0
2.135GluPro: 2.135 ± 0.0
3.321GluGln: 3.321 ± 0.0
1.898GluArg: 1.898 ± 0.0
2.846GluSer: 2.846 ± 0.0
5.693GluThr: 5.693 ± 0.0
3.083GluVal: 3.083 ± 0.0
0.237GluTrp: 0.237 ± 0.0
1.66GluTyr: 1.66 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.423PheAla: 1.423 ± 0.0
0.712PheCys: 0.712 ± 0.0
3.558PheAsp: 3.558 ± 0.0
3.321PheGlu: 3.321 ± 0.0
0.712PhePhe: 0.712 ± 0.0
2.846PheGly: 2.846 ± 0.0
0.237PheHis: 0.237 ± 0.0
1.186PheIle: 1.186 ± 0.0
3.558PheLys: 3.558 ± 0.0
3.083PheLeu: 3.083 ± 0.0
0.237PheMet: 0.237 ± 0.0
4.269PheAsn: 4.269 ± 0.0
0.474PhePro: 0.474 ± 0.0
0.237PheGln: 0.237 ± 0.0
1.898PheArg: 1.898 ± 0.0
1.423PheSer: 1.423 ± 0.0
3.795PheThr: 3.795 ± 0.0
3.321PheVal: 3.321 ± 0.0
0.949PheTrp: 0.949 ± 0.0
0.949PheTyr: 0.949 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.269GlyAla: 4.269 ± 0.0
0.712GlyCys: 0.712 ± 0.0
1.898GlyAsp: 1.898 ± 0.0
3.558GlyGlu: 3.558 ± 0.0
1.898GlyPhe: 1.898 ± 0.0
2.135GlyGly: 2.135 ± 0.0
0.949GlyHis: 0.949 ± 0.0
4.269GlyIle: 4.269 ± 0.0
1.66GlyLys: 1.66 ± 0.0
4.507GlyLeu: 4.507 ± 0.0
1.186GlyMet: 1.186 ± 0.0
2.372GlyAsn: 2.372 ± 0.0
1.423GlyPro: 1.423 ± 0.0
1.186GlyGln: 1.186 ± 0.0
1.423GlyArg: 1.423 ± 0.0
3.795GlySer: 3.795 ± 0.0
1.898GlyThr: 1.898 ± 0.0
4.981GlyVal: 4.981 ± 0.0
0.712GlyTrp: 0.712 ± 0.0
2.609GlyTyr: 2.609 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.898HisAla: 1.898 ± 0.0
1.66HisCys: 1.66 ± 0.0
1.66HisAsp: 1.66 ± 0.0
2.846HisGlu: 2.846 ± 0.0
0.237HisPhe: 0.237 ± 0.0
0.474HisGly: 0.474 ± 0.0
0.474HisHis: 0.474 ± 0.0
0.949HisIle: 0.949 ± 0.0
3.558HisLys: 3.558 ± 0.0
1.423HisLeu: 1.423 ± 0.0
0.237HisMet: 0.237 ± 0.0
3.321HisAsn: 3.321 ± 0.0
1.423HisPro: 1.423 ± 0.0
0.949HisGln: 0.949 ± 0.0
0.949HisArg: 0.949 ± 0.0
1.423HisSer: 1.423 ± 0.0
2.609HisThr: 2.609 ± 0.0
1.898HisVal: 1.898 ± 0.0
0.474HisTrp: 0.474 ± 0.0
2.135HisTyr: 2.135 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.846IleAla: 2.846 ± 0.0
2.609IleCys: 2.609 ± 0.0
5.93IleAsp: 5.93 ± 0.0
4.507IleGlu: 4.507 ± 0.0
2.846IlePhe: 2.846 ± 0.0
3.558IleGly: 3.558 ± 0.0
1.423IleHis: 1.423 ± 0.0
4.507IleIle: 4.507 ± 0.0
4.981IleLys: 4.981 ± 0.0
4.744IleLeu: 4.744 ± 0.0
1.66IleMet: 1.66 ± 0.0
3.321IleAsn: 3.321 ± 0.0
1.898IlePro: 1.898 ± 0.0
2.372IleGln: 2.372 ± 0.0
2.135IleArg: 2.135 ± 0.0
5.455IleSer: 5.455 ± 0.0
7.353IleThr: 7.353 ± 0.0
4.744IleVal: 4.744 ± 0.0
0.474IleTrp: 0.474 ± 0.0
1.898IleTyr: 1.898 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.898LysAla: 1.898 ± 0.0
1.423LysCys: 1.423 ± 0.0
1.898LysAsp: 1.898 ± 0.0
3.321LysGlu: 3.321 ± 0.0
3.321LysPhe: 3.321 ± 0.0
2.609LysGly: 2.609 ± 0.0
2.846LysHis: 2.846 ± 0.0
5.455LysIle: 5.455 ± 0.0
4.981LysLys: 4.981 ± 0.0
8.776LysLeu: 8.776 ± 0.0
2.372LysMet: 2.372 ± 0.0
3.321LysAsn: 3.321 ± 0.0
4.269LysPro: 4.269 ± 0.0
4.507LysGln: 4.507 ± 0.0
1.423LysArg: 1.423 ± 0.0
4.269LysSer: 4.269 ± 0.0
7.59LysThr: 7.59 ± 0.0
4.032LysVal: 4.032 ± 0.0
0.237LysTrp: 0.237 ± 0.0
3.083LysTyr: 3.083 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.321LeuAla: 3.321 ± 0.0
1.66LeuCys: 1.66 ± 0.0
5.218LeuAsp: 5.218 ± 0.0
5.218LeuGlu: 5.218 ± 0.0
2.846LeuPhe: 2.846 ± 0.0
5.218LeuGly: 5.218 ± 0.0
3.558LeuHis: 3.558 ± 0.0
7.827LeuIle: 7.827 ± 0.0
4.981LeuLys: 4.981 ± 0.0
10.674LeuLeu: 10.674 ± 0.0
3.321LeuMet: 3.321 ± 0.0
7.827LeuAsn: 7.827 ± 0.0
4.032LeuPro: 4.032 ± 0.0
2.609LeuGln: 2.609 ± 0.0
4.269LeuArg: 4.269 ± 0.0
7.353LeuSer: 7.353 ± 0.0
8.065LeuThr: 8.065 ± 0.0
4.744LeuVal: 4.744 ± 0.0
0.237LeuTrp: 0.237 ± 0.0
3.321LeuTyr: 3.321 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.712MetAla: 0.712 ± 0.0
0.474MetCys: 0.474 ± 0.0
1.423MetAsp: 1.423 ± 0.0
1.423MetGlu: 1.423 ± 0.0
0.474MetPhe: 0.474 ± 0.0
0.474MetGly: 0.474 ± 0.0
0.474MetHis: 0.474 ± 0.0
2.609MetIle: 2.609 ± 0.0
1.186MetLys: 1.186 ± 0.0
1.898MetLeu: 1.898 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.898MetAsn: 1.898 ± 0.0
0.474MetPro: 0.474 ± 0.0
0.949MetGln: 0.949 ± 0.0
1.66MetArg: 1.66 ± 0.0
2.609MetSer: 2.609 ± 0.0
1.66MetThr: 1.66 ± 0.0
2.609MetVal: 2.609 ± 0.0
0.474MetTrp: 0.474 ± 0.0
1.186MetTyr: 1.186 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.032AsnAla: 4.032 ± 0.0
2.609AsnCys: 2.609 ± 0.0
4.507AsnAsp: 4.507 ± 0.0
4.269AsnGlu: 4.269 ± 0.0
2.135AsnPhe: 2.135 ± 0.0
2.135AsnGly: 2.135 ± 0.0
2.372AsnHis: 2.372 ± 0.0
4.507AsnIle: 4.507 ± 0.0
5.93AsnLys: 5.93 ± 0.0
5.693AsnLeu: 5.693 ± 0.0
2.846AsnMet: 2.846 ± 0.0
3.795AsnAsn: 3.795 ± 0.0
0.949AsnPro: 0.949 ± 0.0
1.186AsnGln: 1.186 ± 0.0
3.083AsnArg: 3.083 ± 0.0
7.353AsnSer: 7.353 ± 0.0
4.269AsnThr: 4.269 ± 0.0
5.693AsnVal: 5.693 ± 0.0
0.712AsnTrp: 0.712 ± 0.0
2.372AsnTyr: 2.372 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.372ProAla: 2.372 ± 0.0
0.474ProCys: 0.474 ± 0.0
2.372ProAsp: 2.372 ± 0.0
3.558ProGlu: 3.558 ± 0.0
1.423ProPhe: 1.423 ± 0.0
1.66ProGly: 1.66 ± 0.0
0.237ProHis: 0.237 ± 0.0
3.795ProIle: 3.795 ± 0.0
2.609ProLys: 2.609 ± 0.0
1.898ProLeu: 1.898 ± 0.0
0.712ProMet: 0.712 ± 0.0
2.609ProAsn: 2.609 ± 0.0
0.712ProPro: 0.712 ± 0.0
1.898ProGln: 1.898 ± 0.0
0.474ProArg: 0.474 ± 0.0
1.423ProSer: 1.423 ± 0.0
2.135ProThr: 2.135 ± 0.0
2.609ProVal: 2.609 ± 0.0
0.949ProTrp: 0.949 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.186GlnAla: 1.186 ± 0.0
0.949GlnCys: 0.949 ± 0.0
1.898GlnAsp: 1.898 ± 0.0
1.66GlnGlu: 1.66 ± 0.0
1.423GlnPhe: 1.423 ± 0.0
0.949GlnGly: 0.949 ± 0.0
1.423GlnHis: 1.423 ± 0.0
3.083GlnIle: 3.083 ± 0.0
1.898GlnLys: 1.898 ± 0.0
3.795GlnLeu: 3.795 ± 0.0
0.237GlnMet: 0.237 ± 0.0
1.66GlnAsn: 1.66 ± 0.0
0.949GlnPro: 0.949 ± 0.0
0.949GlnGln: 0.949 ± 0.0
0.949GlnArg: 0.949 ± 0.0
2.135GlnSer: 2.135 ± 0.0
1.898GlnThr: 1.898 ± 0.0
1.66GlnVal: 1.66 ± 0.0
0.237GlnTrp: 0.237 ± 0.0
0.949GlnTyr: 0.949 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.898ArgAla: 1.898 ± 0.0
0.712ArgCys: 0.712 ± 0.0
2.372ArgAsp: 2.372 ± 0.0
1.423ArgGlu: 1.423 ± 0.0
3.083ArgPhe: 3.083 ± 0.0
2.609ArgGly: 2.609 ± 0.0
1.423ArgHis: 1.423 ± 0.0
2.372ArgIle: 2.372 ± 0.0
2.846ArgLys: 2.846 ± 0.0
4.269ArgLeu: 4.269 ± 0.0
0.712ArgMet: 0.712 ± 0.0
3.321ArgAsn: 3.321 ± 0.0
0.474ArgPro: 0.474 ± 0.0
0.712ArgGln: 0.712 ± 0.0
2.609ArgArg: 2.609 ± 0.0
2.135ArgSer: 2.135 ± 0.0
2.846ArgThr: 2.846 ± 0.0
1.66ArgVal: 1.66 ± 0.0
0.474ArgTrp: 0.474 ± 0.0
0.949ArgTyr: 0.949 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.558SerAla: 3.558 ± 0.0
0.949SerCys: 0.949 ± 0.0
3.083SerAsp: 3.083 ± 0.0
3.558SerGlu: 3.558 ± 0.0
1.898SerPhe: 1.898 ± 0.0
2.372SerGly: 2.372 ± 0.0
1.898SerHis: 1.898 ± 0.0
4.032SerIle: 4.032 ± 0.0
5.218SerLys: 5.218 ± 0.0
6.404SerLeu: 6.404 ± 0.0
1.898SerMet: 1.898 ± 0.0
4.744SerAsn: 4.744 ± 0.0
1.66SerPro: 1.66 ± 0.0
2.846SerGln: 2.846 ± 0.0
3.321SerArg: 3.321 ± 0.0
3.083SerSer: 3.083 ± 0.0
4.981SerThr: 4.981 ± 0.0
6.167SerVal: 6.167 ± 0.0
0.712SerTrp: 0.712 ± 0.0
2.609SerTyr: 2.609 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.372ThrAla: 2.372 ± 0.0
1.423ThrCys: 1.423 ± 0.0
4.744ThrAsp: 4.744 ± 0.0
5.218ThrGlu: 5.218 ± 0.0
3.795ThrPhe: 3.795 ± 0.0
4.981ThrGly: 4.981 ± 0.0
3.558ThrHis: 3.558 ± 0.0
5.455ThrIle: 5.455 ± 0.0
4.981ThrLys: 4.981 ± 0.0
6.879ThrLeu: 6.879 ± 0.0
1.898ThrMet: 1.898 ± 0.0
5.218ThrAsn: 5.218 ± 0.0
4.269ThrPro: 4.269 ± 0.0
0.712ThrGln: 0.712 ± 0.0
3.558ThrArg: 3.558 ± 0.0
4.507ThrSer: 4.507 ± 0.0
5.218ThrThr: 5.218 ± 0.0
5.93ThrVal: 5.93 ± 0.0
0.237ThrTrp: 0.237 ± 0.0
2.609ThrTyr: 2.609 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.558ValAla: 3.558 ± 0.0
0.949ValCys: 0.949 ± 0.0
4.032ValAsp: 4.032 ± 0.0
3.083ValGlu: 3.083 ± 0.0
2.135ValPhe: 2.135 ± 0.0
3.321ValGly: 3.321 ± 0.0
2.846ValHis: 2.846 ± 0.0
3.795ValIle: 3.795 ± 0.0
7.116ValLys: 7.116 ± 0.0
6.404ValLeu: 6.404 ± 0.0
0.949ValMet: 0.949 ± 0.0
3.558ValAsn: 3.558 ± 0.0
3.795ValPro: 3.795 ± 0.0
0.949ValGln: 0.949 ± 0.0
3.558ValArg: 3.558 ± 0.0
5.455ValSer: 5.455 ± 0.0
6.167ValThr: 6.167 ± 0.0
8.065ValVal: 8.065 ± 0.0
0.949ValTrp: 0.949 ± 0.0
1.898ValTyr: 1.898 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.237TrpAla: 0.237 ± 0.0
0.237TrpCys: 0.237 ± 0.0
0.474TrpAsp: 0.474 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.186TrpPhe: 1.186 ± 0.0
0.237TrpGly: 0.237 ± 0.0
0.237TrpHis: 0.237 ± 0.0
0.712TrpIle: 0.712 ± 0.0
0.712TrpLys: 0.712 ± 0.0
0.712TrpLeu: 0.712 ± 0.0
0.237TrpMet: 0.237 ± 0.0
0.949TrpAsn: 0.949 ± 0.0
0.474TrpPro: 0.474 ± 0.0
0.237TrpGln: 0.237 ± 0.0
0.237TrpArg: 0.237 ± 0.0
0.474TrpSer: 0.474 ± 0.0
1.186TrpThr: 1.186 ± 0.0
0.474TrpVal: 0.474 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.898TyrAla: 1.898 ± 0.0
0.712TyrCys: 0.712 ± 0.0
3.083TyrAsp: 3.083 ± 0.0
2.609TyrGlu: 2.609 ± 0.0
1.186TyrPhe: 1.186 ± 0.0
2.609TyrGly: 2.609 ± 0.0
1.186TyrHis: 1.186 ± 0.0
1.423TyrIle: 1.423 ± 0.0
2.135TyrLys: 2.135 ± 0.0
2.846TyrLeu: 2.846 ± 0.0
0.474TyrMet: 0.474 ± 0.0
4.744TyrAsn: 4.744 ± 0.0
0.949TyrPro: 0.949 ± 0.0
1.898TyrGln: 1.898 ± 0.0
1.186TyrArg: 1.186 ± 0.0
1.66TyrSer: 1.66 ± 0.0
1.898TyrThr: 1.898 ± 0.0
1.66TyrVal: 1.66 ± 0.0
0.237TyrTrp: 0.237 ± 0.0
0.712TyrTyr: 0.712 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (4217 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski