Amino acid dipepetide frequency for Helianthus annuus alphaendornavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.672AlaAla: 2.672 ± 0.0
1.233AlaCys: 1.233 ± 0.0
2.466AlaAsp: 2.466 ± 0.0
3.494AlaGlu: 3.494 ± 0.0
0.617AlaPhe: 0.617 ± 0.0
2.877AlaGly: 2.877 ± 0.0
1.233AlaHis: 1.233 ± 0.0
4.521AlaIle: 4.521 ± 0.0
3.494AlaLys: 3.494 ± 0.0
4.932AlaLeu: 4.932 ± 0.0
1.233AlaMet: 1.233 ± 0.0
2.466AlaAsn: 2.466 ± 0.0
1.439AlaPro: 1.439 ± 0.0
2.466AlaGln: 2.466 ± 0.0
3.494AlaArg: 3.494 ± 0.0
2.466AlaSer: 2.466 ± 0.0
3.494AlaThr: 3.494 ± 0.0
2.466AlaVal: 2.466 ± 0.0
0.822AlaTrp: 0.822 ± 0.0
2.055AlaTyr: 2.055 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.028CysAla: 1.028 ± 0.0
1.233CysCys: 1.233 ± 0.0
0.206CysAsp: 0.206 ± 0.0
1.028CysGlu: 1.028 ± 0.0
0.206CysPhe: 0.206 ± 0.0
2.261CysGly: 2.261 ± 0.0
0.617CysHis: 0.617 ± 0.0
0.411CysIle: 0.411 ± 0.0
0.822CysLys: 0.822 ± 0.0
2.261CysLeu: 2.261 ± 0.0
0.206CysMet: 0.206 ± 0.0
0.617CysAsn: 0.617 ± 0.0
0.822CysPro: 0.822 ± 0.0
1.233CysGln: 1.233 ± 0.0
1.028CysArg: 1.028 ± 0.0
1.028CysSer: 1.028 ± 0.0
1.028CysThr: 1.028 ± 0.0
1.85CysVal: 1.85 ± 0.0
0.206CysTrp: 0.206 ± 0.0
0.822CysTyr: 0.822 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.466AspAla: 2.466 ± 0.0
1.028AspCys: 1.028 ± 0.0
2.877AspAsp: 2.877 ± 0.0
2.672AspGlu: 2.672 ± 0.0
1.644AspPhe: 1.644 ± 0.0
1.439AspGly: 1.439 ± 0.0
1.439AspHis: 1.439 ± 0.0
3.494AspIle: 3.494 ± 0.0
4.521AspLys: 4.521 ± 0.0
6.576AspLeu: 6.576 ± 0.0
2.672AspMet: 2.672 ± 0.0
2.466AspAsn: 2.466 ± 0.0
2.055AspPro: 2.055 ± 0.0
3.288AspGln: 3.288 ± 0.0
2.877AspArg: 2.877 ± 0.0
3.905AspSer: 3.905 ± 0.0
2.877AspThr: 2.877 ± 0.0
3.905AspVal: 3.905 ± 0.0
0.411AspTrp: 0.411 ± 0.0
3.905AspTyr: 3.905 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.494GluAla: 3.494 ± 0.0
1.233GluCys: 1.233 ± 0.0
4.316GluAsp: 4.316 ± 0.0
5.343GluGlu: 5.343 ± 0.0
2.672GluPhe: 2.672 ± 0.0
5.343GluGly: 5.343 ± 0.0
2.672GluHis: 2.672 ± 0.0
3.905GluIle: 3.905 ± 0.0
4.932GluLys: 4.932 ± 0.0
4.316GluLeu: 4.316 ± 0.0
1.644GluMet: 1.644 ± 0.0
2.055GluAsn: 2.055 ± 0.0
3.083GluPro: 3.083 ± 0.0
3.083GluGln: 3.083 ± 0.0
2.261GluArg: 2.261 ± 0.0
4.521GluSer: 4.521 ± 0.0
6.165GluThr: 6.165 ± 0.0
5.343GluVal: 5.343 ± 0.0
0.822GluTrp: 0.822 ± 0.0
1.85GluTyr: 1.85 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.822PheAla: 0.822 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.083PheAsp: 3.083 ± 0.0
2.672PheGlu: 2.672 ± 0.0
0.206PhePhe: 0.206 ± 0.0
1.233PheGly: 1.233 ± 0.0
0.617PheHis: 0.617 ± 0.0
0.822PheIle: 0.822 ± 0.0
2.261PheLys: 2.261 ± 0.0
1.233PheLeu: 1.233 ± 0.0
0.617PheMet: 0.617 ± 0.0
1.85PheAsn: 1.85 ± 0.0
0.822PhePro: 0.822 ± 0.0
1.233PheGln: 1.233 ± 0.0
1.644PheArg: 1.644 ± 0.0
1.85PheSer: 1.85 ± 0.0
1.233PheThr: 1.233 ± 0.0
1.233PheVal: 1.233 ± 0.0
0.206PheTrp: 0.206 ± 0.0
0.411PheTyr: 0.411 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.85GlyAla: 1.85 ± 0.0
1.028GlyCys: 1.028 ± 0.0
2.261GlyAsp: 2.261 ± 0.0
4.521GlyGlu: 4.521 ± 0.0
1.233GlyPhe: 1.233 ± 0.0
2.672GlyGly: 2.672 ± 0.0
1.439GlyHis: 1.439 ± 0.0
3.494GlyIle: 3.494 ± 0.0
6.165GlyLys: 6.165 ± 0.0
7.809GlyLeu: 7.809 ± 0.0
2.055GlyMet: 2.055 ± 0.0
2.466GlyAsn: 2.466 ± 0.0
1.85GlyPro: 1.85 ± 0.0
2.672GlyGln: 2.672 ± 0.0
2.261GlyArg: 2.261 ± 0.0
3.699GlySer: 3.699 ± 0.0
3.083GlyThr: 3.083 ± 0.0
2.466GlyVal: 2.466 ± 0.0
0.617GlyTrp: 0.617 ± 0.0
1.644GlyTyr: 1.644 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.233HisAla: 1.233 ± 0.0
0.617HisCys: 0.617 ± 0.0
1.85HisAsp: 1.85 ± 0.0
1.439HisGlu: 1.439 ± 0.0
1.028HisPhe: 1.028 ± 0.0
1.85HisGly: 1.85 ± 0.0
0.617HisHis: 0.617 ± 0.0
1.028HisIle: 1.028 ± 0.0
2.877HisLys: 2.877 ± 0.0
2.055HisLeu: 2.055 ± 0.0
0.617HisMet: 0.617 ± 0.0
1.85HisAsn: 1.85 ± 0.0
0.822HisPro: 0.822 ± 0.0
0.822HisGln: 0.822 ± 0.0
1.439HisArg: 1.439 ± 0.0
1.644HisSer: 1.644 ± 0.0
2.261HisThr: 2.261 ± 0.0
2.261HisVal: 2.261 ± 0.0
0.411HisTrp: 0.411 ± 0.0
1.644HisTyr: 1.644 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.672IleAla: 2.672 ± 0.0
1.028IleCys: 1.028 ± 0.0
4.316IleAsp: 4.316 ± 0.0
4.316IleGlu: 4.316 ± 0.0
2.055IlePhe: 2.055 ± 0.0
4.11IleGly: 4.11 ± 0.0
2.672IleHis: 2.672 ± 0.0
4.727IleIle: 4.727 ± 0.0
4.727IleLys: 4.727 ± 0.0
5.96IleLeu: 5.96 ± 0.0
1.028IleMet: 1.028 ± 0.0
4.727IleAsn: 4.727 ± 0.0
2.261IlePro: 2.261 ± 0.0
3.288IleGln: 3.288 ± 0.0
3.288IleArg: 3.288 ± 0.0
4.11IleSer: 4.11 ± 0.0
5.549IleThr: 5.549 ± 0.0
5.138IleVal: 5.138 ± 0.0
0.617IleTrp: 0.617 ± 0.0
1.439IleTyr: 1.439 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.138LysAla: 5.138 ± 0.0
1.233LysCys: 1.233 ± 0.0
3.905LysAsp: 3.905 ± 0.0
4.932LysGlu: 4.932 ± 0.0
1.439LysPhe: 1.439 ± 0.0
2.672LysGly: 2.672 ± 0.0
2.055LysHis: 2.055 ± 0.0
4.521LysIle: 4.521 ± 0.0
5.343LysLys: 5.343 ± 0.0
8.426LysLeu: 8.426 ± 0.0
1.233LysMet: 1.233 ± 0.0
3.288LysAsn: 3.288 ± 0.0
5.754LysPro: 5.754 ± 0.0
2.055LysGln: 2.055 ± 0.0
3.288LysArg: 3.288 ± 0.0
4.521LysSer: 4.521 ± 0.0
6.987LysThr: 6.987 ± 0.0
6.371LysVal: 6.371 ± 0.0
0.822LysTrp: 0.822 ± 0.0
3.288LysTyr: 3.288 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.193LeuAla: 7.193 ± 0.0
1.233LeuCys: 1.233 ± 0.0
5.343LeuAsp: 5.343 ± 0.0
8.22LeuGlu: 8.22 ± 0.0
0.617LeuPhe: 0.617 ± 0.0
4.932LeuGly: 4.932 ± 0.0
1.644LeuHis: 1.644 ± 0.0
6.782LeuIle: 6.782 ± 0.0
6.576LeuLys: 6.576 ± 0.0
9.042LeuLeu: 9.042 ± 0.0
3.083LeuMet: 3.083 ± 0.0
5.138LeuAsn: 5.138 ± 0.0
2.672LeuPro: 2.672 ± 0.0
2.672LeuGln: 2.672 ± 0.0
4.932LeuArg: 4.932 ± 0.0
7.398LeuSer: 7.398 ± 0.0
8.426LeuThr: 8.426 ± 0.0
6.987LeuVal: 6.987 ± 0.0
0.617LeuTrp: 0.617 ± 0.0
1.85LeuTyr: 1.85 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.261MetAla: 2.261 ± 0.0
0.411MetCys: 0.411 ± 0.0
1.439MetAsp: 1.439 ± 0.0
3.288MetGlu: 3.288 ± 0.0
0.411MetPhe: 0.411 ± 0.0
3.083MetGly: 3.083 ± 0.0
0.617MetHis: 0.617 ± 0.0
2.672MetIle: 2.672 ± 0.0
3.288MetLys: 3.288 ± 0.0
2.055MetLeu: 2.055 ± 0.0
1.439MetMet: 1.439 ± 0.0
1.028MetAsn: 1.028 ± 0.0
0.411MetPro: 0.411 ± 0.0
0.411MetGln: 0.411 ± 0.0
1.233MetArg: 1.233 ± 0.0
1.644MetSer: 1.644 ± 0.0
2.877MetThr: 2.877 ± 0.0
3.083MetVal: 3.083 ± 0.0
0.206MetTrp: 0.206 ± 0.0
0.617MetTyr: 0.617 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.055AsnAla: 2.055 ± 0.0
0.617AsnCys: 0.617 ± 0.0
1.439AsnAsp: 1.439 ± 0.0
2.261AsnGlu: 2.261 ± 0.0
1.233AsnPhe: 1.233 ± 0.0
1.439AsnGly: 1.439 ± 0.0
1.644AsnHis: 1.644 ± 0.0
2.466AsnIle: 2.466 ± 0.0
4.11AsnLys: 4.11 ± 0.0
5.343AsnLeu: 5.343 ± 0.0
1.644AsnMet: 1.644 ± 0.0
1.644AsnAsn: 1.644 ± 0.0
3.288AsnPro: 3.288 ± 0.0
1.644AsnGln: 1.644 ± 0.0
2.261AsnArg: 2.261 ± 0.0
2.466AsnSer: 2.466 ± 0.0
4.521AsnThr: 4.521 ± 0.0
3.494AsnVal: 3.494 ± 0.0
1.028AsnTrp: 1.028 ± 0.0
1.233AsnTyr: 1.233 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.233ProAla: 1.233 ± 0.0
1.233ProCys: 1.233 ± 0.0
1.85ProAsp: 1.85 ± 0.0
3.494ProGlu: 3.494 ± 0.0
0.822ProPhe: 0.822 ± 0.0
2.466ProGly: 2.466 ± 0.0
1.85ProHis: 1.85 ± 0.0
2.261ProIle: 2.261 ± 0.0
3.083ProLys: 3.083 ± 0.0
3.494ProLeu: 3.494 ± 0.0
0.617ProMet: 0.617 ± 0.0
2.261ProAsn: 2.261 ± 0.0
1.439ProPro: 1.439 ± 0.0
2.877ProGln: 2.877 ± 0.0
1.644ProArg: 1.644 ± 0.0
2.672ProSer: 2.672 ± 0.0
4.521ProThr: 4.521 ± 0.0
2.261ProVal: 2.261 ± 0.0
0.617ProTrp: 0.617 ± 0.0
1.028ProTyr: 1.028 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.261GlnAla: 2.261 ± 0.0
0.822GlnCys: 0.822 ± 0.0
2.672GlnAsp: 2.672 ± 0.0
3.494GlnGlu: 3.494 ± 0.0
2.261GlnPhe: 2.261 ± 0.0
3.494GlnGly: 3.494 ± 0.0
1.028GlnHis: 1.028 ± 0.0
1.85GlnIle: 1.85 ± 0.0
2.055GlnLys: 2.055 ± 0.0
4.316GlnLeu: 4.316 ± 0.0
2.055GlnMet: 2.055 ± 0.0
0.411GlnAsn: 0.411 ± 0.0
2.261GlnPro: 2.261 ± 0.0
1.644GlnGln: 1.644 ± 0.0
1.644GlnArg: 1.644 ± 0.0
1.028GlnSer: 1.028 ± 0.0
3.699GlnThr: 3.699 ± 0.0
2.261GlnVal: 2.261 ± 0.0
0.206GlnTrp: 0.206 ± 0.0
1.028GlnTyr: 1.028 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.85ArgAla: 1.85 ± 0.0
0.617ArgCys: 0.617 ± 0.0
3.288ArgAsp: 3.288 ± 0.0
1.85ArgGlu: 1.85 ± 0.0
1.028ArgPhe: 1.028 ± 0.0
2.261ArgGly: 2.261 ± 0.0
0.411ArgHis: 0.411 ± 0.0
4.11ArgIle: 4.11 ± 0.0
4.521ArgLys: 4.521 ± 0.0
6.371ArgLeu: 6.371 ± 0.0
1.233ArgMet: 1.233 ± 0.0
2.672ArgAsn: 2.672 ± 0.0
4.11ArgPro: 4.11 ± 0.0
2.466ArgGln: 2.466 ± 0.0
4.11ArgArg: 4.11 ± 0.0
2.466ArgSer: 2.466 ± 0.0
2.877ArgThr: 2.877 ± 0.0
2.877ArgVal: 2.877 ± 0.0
0.617ArgTrp: 0.617 ± 0.0
1.028ArgTyr: 1.028 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.466SerAla: 2.466 ± 0.0
0.617SerCys: 0.617 ± 0.0
2.261SerAsp: 2.261 ± 0.0
3.494SerGlu: 3.494 ± 0.0
0.411SerPhe: 0.411 ± 0.0
2.466SerGly: 2.466 ± 0.0
2.055SerHis: 2.055 ± 0.0
3.699SerIle: 3.699 ± 0.0
6.165SerLys: 6.165 ± 0.0
5.138SerLeu: 5.138 ± 0.0
1.85SerMet: 1.85 ± 0.0
2.055SerAsn: 2.055 ± 0.0
2.055SerPro: 2.055 ± 0.0
1.439SerGln: 1.439 ± 0.0
4.316SerArg: 4.316 ± 0.0
3.905SerSer: 3.905 ± 0.0
4.316SerThr: 4.316 ± 0.0
6.165SerVal: 6.165 ± 0.0
0.411SerTrp: 0.411 ± 0.0
2.877SerTyr: 2.877 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.261ThrAla: 2.261 ± 0.0
2.672ThrCys: 2.672 ± 0.0
3.699ThrAsp: 3.699 ± 0.0
5.343ThrGlu: 5.343 ± 0.0
2.877ThrPhe: 2.877 ± 0.0
4.316ThrGly: 4.316 ± 0.0
1.85ThrHis: 1.85 ± 0.0
8.015ThrIle: 8.015 ± 0.0
4.727ThrLys: 4.727 ± 0.0
6.576ThrLeu: 6.576 ± 0.0
4.521ThrMet: 4.521 ± 0.0
3.494ThrAsn: 3.494 ± 0.0
2.055ThrPro: 2.055 ± 0.0
3.288ThrGln: 3.288 ± 0.0
5.138ThrArg: 5.138 ± 0.0
3.083ThrSer: 3.083 ± 0.0
6.165ThrThr: 6.165 ± 0.0
4.316ThrVal: 4.316 ± 0.0
0.617ThrTrp: 0.617 ± 0.0
2.055ThrTyr: 2.055 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.521ValAla: 4.521 ± 0.0
1.028ValCys: 1.028 ± 0.0
5.549ValAsp: 5.549 ± 0.0
4.316ValGlu: 4.316 ± 0.0
1.644ValPhe: 1.644 ± 0.0
4.521ValGly: 4.521 ± 0.0
2.055ValHis: 2.055 ± 0.0
7.193ValIle: 7.193 ± 0.0
4.727ValLys: 4.727 ± 0.0
5.754ValLeu: 5.754 ± 0.0
3.494ValMet: 3.494 ± 0.0
2.877ValAsn: 2.877 ± 0.0
3.494ValPro: 3.494 ± 0.0
1.233ValGln: 1.233 ± 0.0
1.439ValArg: 1.439 ± 0.0
3.083ValSer: 3.083 ± 0.0
5.343ValThr: 5.343 ± 0.0
3.083ValVal: 3.083 ± 0.0
0.617ValTrp: 0.617 ± 0.0
2.261ValTyr: 2.261 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.822TrpAla: 0.822 ± 0.0
0.411TrpCys: 0.411 ± 0.0
1.644TrpAsp: 1.644 ± 0.0
0.206TrpGlu: 0.206 ± 0.0
0.822TrpPhe: 0.822 ± 0.0
0.206TrpGly: 0.206 ± 0.0
0.411TrpHis: 0.411 ± 0.0
0.822TrpIle: 0.822 ± 0.0
0.617TrpLys: 0.617 ± 0.0
0.411TrpLeu: 0.411 ± 0.0
0.206TrpMet: 0.206 ± 0.0
0.411TrpAsn: 0.411 ± 0.0
0.617TrpPro: 0.617 ± 0.0
0.617TrpGln: 0.617 ± 0.0
0.206TrpArg: 0.206 ± 0.0
0.617TrpSer: 0.617 ± 0.0
0.411TrpThr: 0.411 ± 0.0
0.617TrpVal: 0.617 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.411TrpTyr: 0.411 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.85TyrAla: 1.85 ± 0.0
0.617TyrCys: 0.617 ± 0.0
2.055TyrAsp: 2.055 ± 0.0
2.672TyrGlu: 2.672 ± 0.0
0.822TyrPhe: 0.822 ± 0.0
1.644TyrGly: 1.644 ± 0.0
1.439TyrHis: 1.439 ± 0.0
1.233TyrIle: 1.233 ± 0.0
2.261TyrLys: 2.261 ± 0.0
3.288TyrLeu: 3.288 ± 0.0
0.822TyrMet: 0.822 ± 0.0
2.055TyrAsn: 2.055 ± 0.0
0.411TyrPro: 0.411 ± 0.0
2.055TyrGln: 2.055 ± 0.0
2.261TyrArg: 2.261 ± 0.0
2.055TyrSer: 2.055 ± 0.0
1.233TyrThr: 1.233 ± 0.0
2.055TyrVal: 2.055 ± 0.0
0.617TyrTrp: 0.617 ± 0.0
0.617TyrTyr: 0.617 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (4867 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski