Amino acid dipepetide frequency for Shayang spider virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.261AlaAla: 3.261 ± 0.0
1.134AlaCys: 1.134 ± 0.0
3.12AlaAsp: 3.12 ± 0.0
2.836AlaGlu: 2.836 ± 0.0
2.978AlaPhe: 2.978 ± 0.0
2.694AlaGly: 2.694 ± 0.0
0.993AlaHis: 0.993 ± 0.0
4.68AlaIle: 4.68 ± 0.0
3.545AlaLys: 3.545 ± 0.0
4.538AlaLeu: 4.538 ± 0.0
0.851AlaMet: 0.851 ± 0.0
1.985AlaAsn: 1.985 ± 0.0
2.411AlaPro: 2.411 ± 0.0
1.418AlaGln: 1.418 ± 0.0
2.694AlaArg: 2.694 ± 0.0
3.971AlaSer: 3.971 ± 0.0
3.12AlaThr: 3.12 ± 0.0
3.829AlaVal: 3.829 ± 0.0
0.284AlaTrp: 0.284 ± 0.0
1.56AlaTyr: 1.56 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.56CysAla: 1.56 ± 0.0
0.567CysCys: 0.567 ± 0.0
1.418CysAsp: 1.418 ± 0.0
2.552CysGlu: 2.552 ± 0.0
0.993CysPhe: 0.993 ± 0.0
0.993CysGly: 0.993 ± 0.0
0.567CysHis: 0.567 ± 0.0
1.56CysIle: 1.56 ± 0.0
2.127CysLys: 2.127 ± 0.0
2.269CysLeu: 2.269 ± 0.0
0.284CysMet: 0.284 ± 0.0
1.702CysAsn: 1.702 ± 0.0
0.851CysPro: 0.851 ± 0.0
0.851CysGln: 0.851 ± 0.0
0.567CysArg: 0.567 ± 0.0
1.843CysSer: 1.843 ± 0.0
0.851CysThr: 0.851 ± 0.0
1.985CysVal: 1.985 ± 0.0
0.142CysTrp: 0.142 ± 0.0
0.993CysTyr: 0.993 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.411AspAla: 2.411 ± 0.0
2.552AspCys: 2.552 ± 0.0
3.403AspAsp: 3.403 ± 0.0
4.821AspGlu: 4.821 ± 0.0
1.985AspPhe: 1.985 ± 0.0
2.552AspGly: 2.552 ± 0.0
0.993AspHis: 0.993 ± 0.0
3.971AspIle: 3.971 ± 0.0
5.389AspLys: 5.389 ± 0.0
4.254AspLeu: 4.254 ± 0.0
1.702AspMet: 1.702 ± 0.0
2.978AspAsn: 2.978 ± 0.0
2.411AspPro: 2.411 ± 0.0
1.276AspGln: 1.276 ± 0.0
3.403AspArg: 3.403 ± 0.0
2.978AspSer: 2.978 ± 0.0
2.836AspThr: 2.836 ± 0.0
2.552AspVal: 2.552 ± 0.0
0.851AspTrp: 0.851 ± 0.0
3.12AspTyr: 3.12 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.694GluAla: 2.694 ± 0.0
1.418GluCys: 1.418 ± 0.0
3.261GluAsp: 3.261 ± 0.0
4.821GluGlu: 4.821 ± 0.0
3.261GluPhe: 3.261 ± 0.0
2.978GluGly: 2.978 ± 0.0
2.127GluHis: 2.127 ± 0.0
4.821GluIle: 4.821 ± 0.0
4.538GluLys: 4.538 ± 0.0
4.821GluLeu: 4.821 ± 0.0
1.56GluMet: 1.56 ± 0.0
3.261GluAsn: 3.261 ± 0.0
1.276GluPro: 1.276 ± 0.0
2.978GluGln: 2.978 ± 0.0
2.694GluArg: 2.694 ± 0.0
4.112GluSer: 4.112 ± 0.0
4.396GluThr: 4.396 ± 0.0
4.254GluVal: 4.254 ± 0.0
0.993GluTrp: 0.993 ± 0.0
2.552GluTyr: 2.552 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.843PheAla: 1.843 ± 0.0
0.851PheCys: 0.851 ± 0.0
2.411PheAsp: 2.411 ± 0.0
2.836PheGlu: 2.836 ± 0.0
1.276PhePhe: 1.276 ± 0.0
3.687PheGly: 3.687 ± 0.0
0.851PheHis: 0.851 ± 0.0
3.687PheIle: 3.687 ± 0.0
4.112PheLys: 4.112 ± 0.0
3.545PheLeu: 3.545 ± 0.0
1.985PheMet: 1.985 ± 0.0
2.694PheAsn: 2.694 ± 0.0
1.418PhePro: 1.418 ± 0.0
0.851PheGln: 0.851 ± 0.0
2.127PheArg: 2.127 ± 0.0
3.12PheSer: 3.12 ± 0.0
3.12PheThr: 3.12 ± 0.0
4.821PheVal: 4.821 ± 0.0
1.134PheTrp: 1.134 ± 0.0
1.702PheTyr: 1.702 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.261GlyAla: 3.261 ± 0.0
0.851GlyCys: 0.851 ± 0.0
3.261GlyAsp: 3.261 ± 0.0
1.985GlyGlu: 1.985 ± 0.0
2.978GlyPhe: 2.978 ± 0.0
2.694GlyGly: 2.694 ± 0.0
1.276GlyHis: 1.276 ± 0.0
3.12GlyIle: 3.12 ± 0.0
4.821GlyLys: 4.821 ± 0.0
4.396GlyLeu: 4.396 ± 0.0
1.702GlyMet: 1.702 ± 0.0
3.261GlyAsn: 3.261 ± 0.0
1.418GlyPro: 1.418 ± 0.0
1.56GlyGln: 1.56 ± 0.0
3.545GlyArg: 3.545 ± 0.0
3.971GlySer: 3.971 ± 0.0
3.261GlyThr: 3.261 ± 0.0
4.112GlyVal: 4.112 ± 0.0
1.276GlyTrp: 1.276 ± 0.0
1.985GlyTyr: 1.985 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.851HisAla: 0.851 ± 0.0
0.851HisCys: 0.851 ± 0.0
1.276HisAsp: 1.276 ± 0.0
1.134HisGlu: 1.134 ± 0.0
1.702HisPhe: 1.702 ± 0.0
0.851HisGly: 0.851 ± 0.0
0.284HisHis: 0.284 ± 0.0
1.276HisIle: 1.276 ± 0.0
1.702HisLys: 1.702 ± 0.0
1.56HisLeu: 1.56 ± 0.0
0.425HisMet: 0.425 ± 0.0
0.425HisAsn: 0.425 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.993HisGln: 0.993 ± 0.0
0.567HisArg: 0.567 ± 0.0
1.134HisSer: 1.134 ± 0.0
1.418HisThr: 1.418 ± 0.0
1.418HisVal: 1.418 ± 0.0
0.284HisTrp: 0.284 ± 0.0
0.993HisTyr: 0.993 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.254IleAla: 4.254 ± 0.0
1.702IleCys: 1.702 ± 0.0
5.53IleAsp: 5.53 ± 0.0
6.381IleGlu: 6.381 ± 0.0
2.269IlePhe: 2.269 ± 0.0
3.687IleGly: 3.687 ± 0.0
0.993IleHis: 0.993 ± 0.0
4.68IleIle: 4.68 ± 0.0
6.381IleLys: 6.381 ± 0.0
3.545IleLeu: 3.545 ± 0.0
1.985IleMet: 1.985 ± 0.0
5.672IleAsn: 5.672 ± 0.0
3.261IlePro: 3.261 ± 0.0
2.411IleGln: 2.411 ± 0.0
4.112IleArg: 4.112 ± 0.0
5.53IleSer: 5.53 ± 0.0
4.821IleThr: 4.821 ± 0.0
4.821IleVal: 4.821 ± 0.0
1.418IleTrp: 1.418 ± 0.0
1.702IleTyr: 1.702 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.12LysAla: 3.12 ± 0.0
1.843LysCys: 1.843 ± 0.0
3.971LysAsp: 3.971 ± 0.0
3.545LysGlu: 3.545 ± 0.0
6.239LysPhe: 6.239 ± 0.0
3.687LysGly: 3.687 ± 0.0
0.709LysHis: 0.709 ± 0.0
6.948LysIle: 6.948 ± 0.0
6.523LysLys: 6.523 ± 0.0
7.232LysLeu: 7.232 ± 0.0
2.127LysMet: 2.127 ± 0.0
5.105LysAsn: 5.105 ± 0.0
3.261LysPro: 3.261 ± 0.0
2.694LysGln: 2.694 ± 0.0
2.552LysArg: 2.552 ± 0.0
5.105LysSer: 5.105 ± 0.0
3.687LysThr: 3.687 ± 0.0
5.53LysVal: 5.53 ± 0.0
0.851LysTrp: 0.851 ± 0.0
4.396LysTyr: 4.396 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.396LeuAla: 4.396 ± 0.0
2.127LeuCys: 2.127 ± 0.0
4.396LeuAsp: 4.396 ± 0.0
4.963LeuGlu: 4.963 ± 0.0
4.254LeuPhe: 4.254 ± 0.0
3.687LeuGly: 3.687 ± 0.0
1.843LeuHis: 1.843 ± 0.0
4.821LeuIle: 4.821 ± 0.0
3.971LeuLys: 3.971 ± 0.0
6.665LeuLeu: 6.665 ± 0.0
1.843LeuMet: 1.843 ± 0.0
5.956LeuAsn: 5.956 ± 0.0
4.821LeuPro: 4.821 ± 0.0
3.403LeuGln: 3.403 ± 0.0
3.12LeuArg: 3.12 ± 0.0
6.381LeuSer: 6.381 ± 0.0
5.247LeuThr: 5.247 ± 0.0
4.254LeuVal: 4.254 ± 0.0
0.709LeuTrp: 0.709 ± 0.0
2.836LeuTyr: 2.836 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.418MetAla: 1.418 ± 0.0
0.425MetCys: 0.425 ± 0.0
2.127MetAsp: 2.127 ± 0.0
1.56MetGlu: 1.56 ± 0.0
0.851MetPhe: 0.851 ± 0.0
0.709MetGly: 0.709 ± 0.0
0.284MetHis: 0.284 ± 0.0
2.127MetIle: 2.127 ± 0.0
2.127MetLys: 2.127 ± 0.0
2.269MetLeu: 2.269 ± 0.0
0.851MetMet: 0.851 ± 0.0
1.702MetAsn: 1.702 ± 0.0
1.418MetPro: 1.418 ± 0.0
0.993MetGln: 0.993 ± 0.0
0.284MetArg: 0.284 ± 0.0
2.978MetSer: 2.978 ± 0.0
1.985MetThr: 1.985 ± 0.0
1.276MetVal: 1.276 ± 0.0
0.425MetTrp: 0.425 ± 0.0
0.709MetTyr: 0.709 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.552AsnAla: 2.552 ± 0.0
1.843AsnCys: 1.843 ± 0.0
2.694AsnAsp: 2.694 ± 0.0
2.978AsnGlu: 2.978 ± 0.0
2.694AsnPhe: 2.694 ± 0.0
3.971AsnGly: 3.971 ± 0.0
1.134AsnHis: 1.134 ± 0.0
4.538AsnIle: 4.538 ± 0.0
5.389AsnLys: 5.389 ± 0.0
4.821AsnLeu: 4.821 ± 0.0
1.134AsnMet: 1.134 ± 0.0
3.261AsnAsn: 3.261 ± 0.0
2.552AsnPro: 2.552 ± 0.0
0.993AsnGln: 0.993 ± 0.0
3.687AsnArg: 3.687 ± 0.0
4.538AsnSer: 4.538 ± 0.0
1.985AsnThr: 1.985 ± 0.0
3.971AsnVal: 3.971 ± 0.0
1.134AsnTrp: 1.134 ± 0.0
2.127AsnTyr: 2.127 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.56ProAla: 1.56 ± 0.0
0.567ProCys: 0.567 ± 0.0
1.843ProAsp: 1.843 ± 0.0
2.694ProGlu: 2.694 ± 0.0
2.127ProPhe: 2.127 ± 0.0
1.702ProGly: 1.702 ± 0.0
0.567ProHis: 0.567 ± 0.0
2.552ProIle: 2.552 ± 0.0
3.261ProLys: 3.261 ± 0.0
3.261ProLeu: 3.261 ± 0.0
1.134ProMet: 1.134 ± 0.0
2.694ProAsn: 2.694 ± 0.0
1.276ProPro: 1.276 ± 0.0
0.425ProGln: 0.425 ± 0.0
2.127ProArg: 2.127 ± 0.0
2.694ProSer: 2.694 ± 0.0
2.552ProThr: 2.552 ± 0.0
3.545ProVal: 3.545 ± 0.0
0.425ProTrp: 0.425 ± 0.0
1.134ProTyr: 1.134 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.702GlnAla: 1.702 ± 0.0
1.134GlnCys: 1.134 ± 0.0
0.567GlnAsp: 0.567 ± 0.0
1.56GlnGlu: 1.56 ± 0.0
1.702GlnPhe: 1.702 ± 0.0
0.993GlnGly: 0.993 ± 0.0
0.709GlnHis: 0.709 ± 0.0
2.552GlnIle: 2.552 ± 0.0
2.694GlnLys: 2.694 ± 0.0
2.978GlnLeu: 2.978 ± 0.0
0.993GlnMet: 0.993 ± 0.0
1.702GlnAsn: 1.702 ± 0.0
0.284GlnPro: 0.284 ± 0.0
0.851GlnGln: 0.851 ± 0.0
1.56GlnArg: 1.56 ± 0.0
2.411GlnSer: 2.411 ± 0.0
2.127GlnThr: 2.127 ± 0.0
2.411GlnVal: 2.411 ± 0.0
0.851GlnTrp: 0.851 ± 0.0
1.276GlnTyr: 1.276 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.694ArgAla: 2.694 ± 0.0
0.851ArgCys: 0.851 ± 0.0
1.985ArgAsp: 1.985 ± 0.0
2.127ArgGlu: 2.127 ± 0.0
1.843ArgPhe: 1.843 ± 0.0
3.261ArgGly: 3.261 ± 0.0
0.851ArgHis: 0.851 ± 0.0
2.978ArgIle: 2.978 ± 0.0
2.411ArgLys: 2.411 ± 0.0
4.538ArgLeu: 4.538 ± 0.0
1.418ArgMet: 1.418 ± 0.0
2.552ArgAsn: 2.552 ± 0.0
2.978ArgPro: 2.978 ± 0.0
1.56ArgGln: 1.56 ± 0.0
2.127ArgArg: 2.127 ± 0.0
4.254ArgSer: 4.254 ± 0.0
1.56ArgThr: 1.56 ± 0.0
3.261ArgVal: 3.261 ± 0.0
0.709ArgTrp: 0.709 ± 0.0
1.276ArgTyr: 1.276 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.261SerAla: 3.261 ± 0.0
1.56SerCys: 1.56 ± 0.0
2.836SerAsp: 2.836 ± 0.0
4.821SerGlu: 4.821 ± 0.0
3.687SerPhe: 3.687 ± 0.0
4.963SerGly: 4.963 ± 0.0
1.56SerHis: 1.56 ± 0.0
6.098SerIle: 6.098 ± 0.0
6.665SerLys: 6.665 ± 0.0
5.814SerLeu: 5.814 ± 0.0
2.978SerMet: 2.978 ± 0.0
3.971SerAsn: 3.971 ± 0.0
3.12SerPro: 3.12 ± 0.0
1.985SerGln: 1.985 ± 0.0
2.836SerArg: 2.836 ± 0.0
3.687SerSer: 3.687 ± 0.0
3.687SerThr: 3.687 ± 0.0
4.963SerVal: 4.963 ± 0.0
0.993SerTrp: 0.993 ± 0.0
2.694SerTyr: 2.694 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.112ThrAla: 4.112 ± 0.0
0.993ThrCys: 0.993 ± 0.0
2.694ThrAsp: 2.694 ± 0.0
3.12ThrGlu: 3.12 ± 0.0
2.269ThrPhe: 2.269 ± 0.0
4.112ThrGly: 4.112 ± 0.0
0.993ThrHis: 0.993 ± 0.0
5.956ThrIle: 5.956 ± 0.0
3.545ThrLys: 3.545 ± 0.0
4.963ThrLeu: 4.963 ± 0.0
1.702ThrMet: 1.702 ± 0.0
2.269ThrAsn: 2.269 ± 0.0
2.127ThrPro: 2.127 ± 0.0
0.425ThrGln: 0.425 ± 0.0
2.127ThrArg: 2.127 ± 0.0
3.687ThrSer: 3.687 ± 0.0
3.971ThrThr: 3.971 ± 0.0
3.829ThrVal: 3.829 ± 0.0
1.418ThrTrp: 1.418 ± 0.0
2.552ThrTyr: 2.552 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.971ValAla: 3.971 ± 0.0
2.411ValCys: 2.411 ± 0.0
5.53ValAsp: 5.53 ± 0.0
4.396ValGlu: 4.396 ± 0.0
2.269ValPhe: 2.269 ± 0.0
3.545ValGly: 3.545 ± 0.0
1.702ValHis: 1.702 ± 0.0
4.396ValIle: 4.396 ± 0.0
5.672ValLys: 5.672 ± 0.0
4.538ValLeu: 4.538 ± 0.0
0.567ValMet: 0.567 ± 0.0
3.971ValAsn: 3.971 ± 0.0
2.127ValPro: 2.127 ± 0.0
3.687ValGln: 3.687 ± 0.0
3.261ValArg: 3.261 ± 0.0
5.672ValSer: 5.672 ± 0.0
3.261ValThr: 3.261 ± 0.0
3.403ValVal: 3.403 ± 0.0
0.993ValTrp: 0.993 ± 0.0
1.985ValTyr: 1.985 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.993TrpAla: 0.993 ± 0.0
0.284TrpCys: 0.284 ± 0.0
1.276TrpAsp: 1.276 ± 0.0
1.418TrpGlu: 1.418 ± 0.0
0.425TrpPhe: 0.425 ± 0.0
1.134TrpGly: 1.134 ± 0.0
0.142TrpHis: 0.142 ± 0.0
1.134TrpIle: 1.134 ± 0.0
1.418TrpLys: 1.418 ± 0.0
1.134TrpLeu: 1.134 ± 0.0
0.284TrpMet: 0.284 ± 0.0
0.993TrpAsn: 0.993 ± 0.0
0.284TrpPro: 0.284 ± 0.0
0.425TrpGln: 0.425 ± 0.0
0.567TrpArg: 0.567 ± 0.0
1.134TrpSer: 1.134 ± 0.0
0.851TrpThr: 0.851 ± 0.0
1.134TrpVal: 1.134 ± 0.0
0.142TrpTrp: 0.142 ± 0.0
0.425TrpTyr: 0.425 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.985TyrAla: 1.985 ± 0.0
0.709TyrCys: 0.709 ± 0.0
2.836TyrAsp: 2.836 ± 0.0
2.411TyrGlu: 2.411 ± 0.0
2.269TyrPhe: 2.269 ± 0.0
2.836TyrGly: 2.836 ± 0.0
0.567TyrHis: 0.567 ± 0.0
3.261TyrIle: 3.261 ± 0.0
2.836TyrLys: 2.836 ± 0.0
2.552TyrLeu: 2.552 ± 0.0
0.851TyrMet: 0.851 ± 0.0
1.843TyrAsn: 1.843 ± 0.0
0.851TyrPro: 0.851 ± 0.0
1.276TyrGln: 1.276 ± 0.0
1.276TyrArg: 1.276 ± 0.0
2.978TyrSer: 2.978 ± 0.0
2.127TyrThr: 2.127 ± 0.0
1.985TyrVal: 1.985 ± 0.0
0.567TyrTrp: 0.567 ± 0.0
1.276TyrTyr: 1.276 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (7053 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski