Amino acid dipepetide frequency for Zika virus (ZIKV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.825AlaAla: 10.825 ± 0.0
0.878AlaCys: 0.878 ± 0.0
2.048AlaAsp: 2.048 ± 0.0
4.974AlaGlu: 4.974 ± 0.0
2.048AlaPhe: 2.048 ± 0.0
7.314AlaGly: 7.314 ± 0.0
2.633AlaHis: 2.633 ± 0.0
6.144AlaIle: 6.144 ± 0.0
3.803AlaLys: 3.803 ± 0.0
9.655AlaLeu: 9.655 ± 0.0
1.755AlaMet: 1.755 ± 0.0
1.463AlaAsn: 1.463 ± 0.0
1.755AlaPro: 1.755 ± 0.0
1.463AlaGln: 1.463 ± 0.0
4.096AlaArg: 4.096 ± 0.0
3.511AlaSer: 3.511 ± 0.0
4.096AlaThr: 4.096 ± 0.0
8.777AlaVal: 8.777 ± 0.0
4.096AlaTrp: 4.096 ± 0.0
1.463AlaTyr: 1.463 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.17CysAla: 1.17 ± 0.0
0.293CysCys: 0.293 ± 0.0
1.17CysAsp: 1.17 ± 0.0
0.585CysGlu: 0.585 ± 0.0
0.585CysPhe: 0.585 ± 0.0
2.926CysGly: 2.926 ± 0.0
1.17CysHis: 1.17 ± 0.0
0.585CysIle: 0.585 ± 0.0
0.585CysLys: 0.585 ± 0.0
1.463CysLeu: 1.463 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.585CysAsn: 0.585 ± 0.0
1.463CysPro: 1.463 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.463CysArg: 1.463 ± 0.0
1.463CysSer: 1.463 ± 0.0
0.878CysThr: 0.878 ± 0.0
1.17CysVal: 1.17 ± 0.0
0.585CysTrp: 0.585 ± 0.0
0.585CysTyr: 0.585 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.048AspAla: 2.048 ± 0.0
0.878AspCys: 0.878 ± 0.0
1.17AspAsp: 1.17 ± 0.0
2.633AspGlu: 2.633 ± 0.0
1.755AspPhe: 1.755 ± 0.0
4.681AspGly: 4.681 ± 0.0
2.048AspHis: 2.048 ± 0.0
3.511AspIle: 3.511 ± 0.0
2.048AspLys: 2.048 ± 0.0
5.266AspLeu: 5.266 ± 0.0
1.463AspMet: 1.463 ± 0.0
1.17AspAsn: 1.17 ± 0.0
2.341AspPro: 2.341 ± 0.0
0.293AspGln: 0.293 ± 0.0
2.633AspArg: 2.633 ± 0.0
1.755AspSer: 1.755 ± 0.0
4.389AspThr: 4.389 ± 0.0
3.218AspVal: 3.218 ± 0.0
0.878AspTrp: 0.878 ± 0.0
0.878AspTyr: 0.878 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
7.607GluAla: 7.607 ± 0.0
2.048GluCys: 2.048 ± 0.0
2.926GluAsp: 2.926 ± 0.0
8.192GluGlu: 8.192 ± 0.0
2.341GluPhe: 2.341 ± 0.0
5.559GluGly: 5.559 ± 0.0
1.17GluHis: 1.17 ± 0.0
2.926GluIle: 2.926 ± 0.0
3.511GluLys: 3.511 ± 0.0
2.633GluLeu: 2.633 ± 0.0
2.926GluMet: 2.926 ± 0.0
2.926GluAsn: 2.926 ± 0.0
2.341GluPro: 2.341 ± 0.0
0.293GluGln: 0.293 ± 0.0
3.218GluArg: 3.218 ± 0.0
2.341GluSer: 2.341 ± 0.0
4.681GluThr: 4.681 ± 0.0
4.389GluVal: 4.389 ± 0.0
2.048GluTrp: 2.048 ± 0.0
0.878GluTyr: 0.878 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.341PheAla: 2.341 ± 0.0
0.585PheCys: 0.585 ± 0.0
0.878PheAsp: 0.878 ± 0.0
1.755PheGlu: 1.755 ± 0.0
0.293PhePhe: 0.293 ± 0.0
2.926PheGly: 2.926 ± 0.0
1.463PheHis: 1.463 ± 0.0
0.878PheIle: 0.878 ± 0.0
2.633PheLys: 2.633 ± 0.0
2.341PheLeu: 2.341 ± 0.0
0.585PheMet: 0.585 ± 0.0
0.585PheAsn: 0.585 ± 0.0
0.293PhePro: 0.293 ± 0.0
0.585PheGln: 0.585 ± 0.0
1.17PheArg: 1.17 ± 0.0
2.341PheSer: 2.341 ± 0.0
2.048PheThr: 2.048 ± 0.0
2.341PheVal: 2.341 ± 0.0
0.293PheTrp: 0.293 ± 0.0
0.293PheTyr: 0.293 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.022GlyAla: 7.022 ± 0.0
2.048GlyCys: 2.048 ± 0.0
4.681GlyAsp: 4.681 ± 0.0
5.266GlyGlu: 5.266 ± 0.0
2.926GlyPhe: 2.926 ± 0.0
7.022GlyGly: 7.022 ± 0.0
2.633GlyHis: 2.633 ± 0.0
3.803GlyIle: 3.803 ± 0.0
7.899GlyLys: 7.899 ± 0.0
7.607GlyLeu: 7.607 ± 0.0
2.341GlyMet: 2.341 ± 0.0
1.17GlyAsn: 1.17 ± 0.0
3.511GlyPro: 3.511 ± 0.0
0.878GlyGln: 0.878 ± 0.0
4.681GlyArg: 4.681 ± 0.0
7.899GlySer: 7.899 ± 0.0
5.851GlyThr: 5.851 ± 0.0
7.899GlyVal: 7.899 ± 0.0
2.926GlyTrp: 2.926 ± 0.0
2.048GlyTyr: 2.048 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.926HisAla: 2.926 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.293HisAsp: 0.293 ± 0.0
0.585HisGlu: 0.585 ± 0.0
0.878HisPhe: 0.878 ± 0.0
2.926HisGly: 2.926 ± 0.0
1.463HisHis: 1.463 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.585HisLys: 0.585 ± 0.0
1.755HisLeu: 1.755 ± 0.0
1.463HisMet: 1.463 ± 0.0
0.293HisAsn: 0.293 ± 0.0
0.878HisPro: 0.878 ± 0.0
0.585HisGln: 0.585 ± 0.0
1.463HisArg: 1.463 ± 0.0
2.048HisSer: 2.048 ± 0.0
1.17HisThr: 1.17 ± 0.0
0.585HisVal: 0.585 ± 0.0
1.463HisTrp: 1.463 ± 0.0
0.293HisTyr: 0.293 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.218IleAla: 3.218 ± 0.0
1.17IleCys: 1.17 ± 0.0
1.755IleAsp: 1.755 ± 0.0
2.926IleGlu: 2.926 ± 0.0
2.633IlePhe: 2.633 ± 0.0
4.974IleGly: 4.974 ± 0.0
0.585IleHis: 0.585 ± 0.0
3.803IleIle: 3.803 ± 0.0
2.633IleLys: 2.633 ± 0.0
4.389IleLeu: 4.389 ± 0.0
3.218IleMet: 3.218 ± 0.0
1.463IleAsn: 1.463 ± 0.0
2.048IlePro: 2.048 ± 0.0
0.878IleGln: 0.878 ± 0.0
3.803IleArg: 3.803 ± 0.0
2.926IleSer: 2.926 ± 0.0
3.803IleThr: 3.803 ± 0.0
3.218IleVal: 3.218 ± 0.0
0.293IleTrp: 0.293 ± 0.0
1.463IleTyr: 1.463 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.633LysAla: 2.633 ± 0.0
0.878LysCys: 0.878 ± 0.0
2.633LysAsp: 2.633 ± 0.0
4.096LysGlu: 4.096 ± 0.0
0.878LysPhe: 0.878 ± 0.0
4.681LysGly: 4.681 ± 0.0
0.585LysHis: 0.585 ± 0.0
1.463LysIle: 1.463 ± 0.0
4.681LysLys: 4.681 ± 0.0
1.755LysLeu: 1.755 ± 0.0
1.755LysMet: 1.755 ± 0.0
2.633LysAsn: 2.633 ± 0.0
2.633LysPro: 2.633 ± 0.0
1.755LysGln: 1.755 ± 0.0
7.022LysArg: 7.022 ± 0.0
3.803LysSer: 3.803 ± 0.0
3.511LysThr: 3.511 ± 0.0
5.266LysVal: 5.266 ± 0.0
0.585LysTrp: 0.585 ± 0.0
1.755LysTyr: 1.755 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.607LeuAla: 7.607 ± 0.0
1.463LeuCys: 1.463 ± 0.0
4.681LeuAsp: 4.681 ± 0.0
5.851LeuGlu: 5.851 ± 0.0
1.17LeuPhe: 1.17 ± 0.0
11.41LeuGly: 11.41 ± 0.0
0.585LeuHis: 0.585 ± 0.0
7.022LeuIle: 7.022 ± 0.0
5.559LeuLys: 5.559 ± 0.0
10.532LeuLeu: 10.532 ± 0.0
4.096LeuMet: 4.096 ± 0.0
2.048LeuAsn: 2.048 ± 0.0
3.803LeuPro: 3.803 ± 0.0
2.633LeuGln: 2.633 ± 0.0
4.681LeuArg: 4.681 ± 0.0
3.218LeuSer: 3.218 ± 0.0
4.096LeuThr: 4.096 ± 0.0
7.607LeuVal: 7.607 ± 0.0
1.755LeuTrp: 1.755 ± 0.0
1.463LeuTyr: 1.463 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
5.266MetAla: 5.266 ± 0.0
0.585MetCys: 0.585 ± 0.0
3.218MetAsp: 3.218 ± 0.0
3.218MetGlu: 3.218 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.511MetGly: 3.511 ± 0.0
0.293MetHis: 0.293 ± 0.0
1.17MetIle: 1.17 ± 0.0
1.463MetLys: 1.463 ± 0.0
3.511MetLeu: 3.511 ± 0.0
1.17MetMet: 1.17 ± 0.0
1.463MetAsn: 1.463 ± 0.0
0.878MetPro: 0.878 ± 0.0
0.878MetGln: 0.878 ± 0.0
0.878MetArg: 0.878 ± 0.0
1.755MetSer: 1.755 ± 0.0
2.633MetThr: 2.633 ± 0.0
2.633MetVal: 2.633 ± 0.0
1.755MetTrp: 1.755 ± 0.0
1.755MetTyr: 1.755 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.463AsnAla: 1.463 ± 0.0
0.0AsnCys: 0.0 ± 0.0
1.755AsnAsp: 1.755 ± 0.0
1.755AsnGlu: 1.755 ± 0.0
0.293AsnPhe: 0.293 ± 0.0
2.633AsnGly: 2.633 ± 0.0
0.293AsnHis: 0.293 ± 0.0
2.926AsnIle: 2.926 ± 0.0
2.633AsnLys: 2.633 ± 0.0
2.048AsnLeu: 2.048 ± 0.0
1.755AsnMet: 1.755 ± 0.0
1.463AsnAsn: 1.463 ± 0.0
2.341AsnPro: 2.341 ± 0.0
1.17AsnGln: 1.17 ± 0.0
1.17AsnArg: 1.17 ± 0.0
2.926AsnSer: 2.926 ± 0.0
1.755AsnThr: 1.755 ± 0.0
0.585AsnVal: 0.585 ± 0.0
0.878AsnTrp: 0.878 ± 0.0
0.585AsnTyr: 0.585 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.511ProAla: 3.511 ± 0.0
0.878ProCys: 0.878 ± 0.0
1.17ProAsp: 1.17 ± 0.0
2.926ProGlu: 2.926 ± 0.0
1.463ProPhe: 1.463 ± 0.0
4.096ProGly: 4.096 ± 0.0
0.585ProHis: 0.585 ± 0.0
2.341ProIle: 2.341 ± 0.0
1.17ProLys: 1.17 ± 0.0
2.633ProLeu: 2.633 ± 0.0
1.755ProMet: 1.755 ± 0.0
1.463ProAsn: 1.463 ± 0.0
1.755ProPro: 1.755 ± 0.0
1.17ProGln: 1.17 ± 0.0
3.511ProArg: 3.511 ± 0.0
2.341ProSer: 2.341 ± 0.0
1.463ProThr: 1.463 ± 0.0
4.096ProVal: 4.096 ± 0.0
1.17ProTrp: 1.17 ± 0.0
2.048ProTyr: 2.048 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.878GlnAla: 0.878 ± 0.0
0.0GlnCys: 0.0 ± 0.0
2.048GlnAsp: 2.048 ± 0.0
2.341GlnGlu: 2.341 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
1.17GlnGly: 1.17 ± 0.0
0.585GlnHis: 0.585 ± 0.0
0.878GlnIle: 0.878 ± 0.0
0.878GlnLys: 0.878 ± 0.0
2.633GlnLeu: 2.633 ± 0.0
1.463GlnMet: 1.463 ± 0.0
0.293GlnAsn: 0.293 ± 0.0
0.878GlnPro: 0.878 ± 0.0
0.293GlnGln: 0.293 ± 0.0
2.341GlnArg: 2.341 ± 0.0
0.878GlnSer: 0.878 ± 0.0
1.755GlnThr: 1.755 ± 0.0
2.341GlnVal: 2.341 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
0.585GlnTyr: 0.585 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.851ArgAla: 5.851 ± 0.0
1.17ArgCys: 1.17 ± 0.0
1.17ArgAsp: 1.17 ± 0.0
4.974ArgGlu: 4.974 ± 0.0
2.048ArgPhe: 2.048 ± 0.0
6.144ArgGly: 6.144 ± 0.0
0.585ArgHis: 0.585 ± 0.0
2.926ArgIle: 2.926 ± 0.0
2.926ArgLys: 2.926 ± 0.0
5.851ArgLeu: 5.851 ± 0.0
2.048ArgMet: 2.048 ± 0.0
2.633ArgAsn: 2.633 ± 0.0
2.926ArgPro: 2.926 ± 0.0
0.878ArgGln: 0.878 ± 0.0
5.559ArgArg: 5.559 ± 0.0
4.096ArgSer: 4.096 ± 0.0
4.096ArgThr: 4.096 ± 0.0
6.144ArgVal: 6.144 ± 0.0
1.463ArgTrp: 1.463 ± 0.0
0.878ArgTyr: 0.878 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.803SerAla: 3.803 ± 0.0
0.878SerCys: 0.878 ± 0.0
2.633SerAsp: 2.633 ± 0.0
1.755SerGlu: 1.755 ± 0.0
1.755SerPhe: 1.755 ± 0.0
6.144SerGly: 6.144 ± 0.0
1.17SerHis: 1.17 ± 0.0
2.633SerIle: 2.633 ± 0.0
1.463SerLys: 1.463 ± 0.0
5.851SerLeu: 5.851 ± 0.0
2.048SerMet: 2.048 ± 0.0
2.341SerAsn: 2.341 ± 0.0
2.633SerPro: 2.633 ± 0.0
2.048SerGln: 2.048 ± 0.0
3.218SerArg: 3.218 ± 0.0
3.218SerSer: 3.218 ± 0.0
5.559SerThr: 5.559 ± 0.0
3.803SerVal: 3.803 ± 0.0
0.878SerTrp: 0.878 ± 0.0
4.389SerTyr: 4.389 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.729ThrAla: 6.729 ± 0.0
2.048ThrCys: 2.048 ± 0.0
4.096ThrAsp: 4.096 ± 0.0
2.633ThrGlu: 2.633 ± 0.0
1.755ThrPhe: 1.755 ± 0.0
4.096ThrGly: 4.096 ± 0.0
1.17ThrHis: 1.17 ± 0.0
1.463ThrIle: 1.463 ± 0.0
4.096ThrLys: 4.096 ± 0.0
6.144ThrLeu: 6.144 ± 0.0
2.341ThrMet: 2.341 ± 0.0
1.17ThrAsn: 1.17 ± 0.0
3.511ThrPro: 3.511 ± 0.0
2.633ThrGln: 2.633 ± 0.0
5.559ThrArg: 5.559 ± 0.0
3.511ThrSer: 3.511 ± 0.0
5.266ThrThr: 5.266 ± 0.0
4.389ThrVal: 4.389 ± 0.0
2.633ThrTrp: 2.633 ± 0.0
0.878ThrTyr: 0.878 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.974ValAla: 4.974 ± 0.0
1.17ValCys: 1.17 ± 0.0
4.681ValAsp: 4.681 ± 0.0
5.851ValGlu: 5.851 ± 0.0
2.048ValPhe: 2.048 ± 0.0
3.803ValGly: 3.803 ± 0.0
0.878ValHis: 0.878 ± 0.0
4.681ValIle: 4.681 ± 0.0
3.511ValLys: 3.511 ± 0.0
7.022ValLeu: 7.022 ± 0.0
2.926ValMet: 2.926 ± 0.0
2.633ValAsn: 2.633 ± 0.0
4.681ValPro: 4.681 ± 0.0
2.926ValGln: 2.926 ± 0.0
5.266ValArg: 5.266 ± 0.0
4.974ValSer: 4.974 ± 0.0
6.144ValThr: 6.144 ± 0.0
7.607ValVal: 7.607 ± 0.0
2.048ValTrp: 2.048 ± 0.0
1.463ValTyr: 1.463 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.17TrpAla: 1.17 ± 0.0
1.17TrpCys: 1.17 ± 0.0
2.048TrpAsp: 2.048 ± 0.0
1.17TrpGlu: 1.17 ± 0.0
1.17TrpPhe: 1.17 ± 0.0
1.755TrpGly: 1.755 ± 0.0
1.17TrpHis: 1.17 ± 0.0
0.878TrpIle: 0.878 ± 0.0
2.048TrpLys: 2.048 ± 0.0
4.974TrpLeu: 4.974 ± 0.0
0.878TrpMet: 0.878 ± 0.0
1.463TrpAsn: 1.463 ± 0.0
0.585TrpPro: 0.585 ± 0.0
0.293TrpGln: 0.293 ± 0.0
1.17TrpArg: 1.17 ± 0.0
1.463TrpSer: 1.463 ± 0.0
1.17TrpThr: 1.17 ± 0.0
1.463TrpVal: 1.463 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.878TrpTyr: 0.878 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.755TyrAla: 1.755 ± 0.0
0.585TyrCys: 0.585 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
1.463TyrGlu: 1.463 ± 0.0
0.585TyrPhe: 0.585 ± 0.0
2.341TyrGly: 2.341 ± 0.0
0.585TyrHis: 0.585 ± 0.0
1.17TyrIle: 1.17 ± 0.0
0.878TyrLys: 0.878 ± 0.0
3.511TyrLeu: 3.511 ± 0.0
1.755TyrMet: 1.755 ± 0.0
1.17TyrAsn: 1.17 ± 0.0
0.293TyrPro: 0.293 ± 0.0
0.585TyrGln: 0.585 ± 0.0
1.463TyrArg: 1.463 ± 0.0
1.755TyrSer: 1.755 ± 0.0
1.755TyrThr: 1.755 ± 0.0
1.755TyrVal: 1.755 ± 0.0
1.17TyrTrp: 1.17 ± 0.0
0.878TyrTyr: 0.878 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski