Amino acid dipepetide frequency for Wenling hagfish virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.728AlaAla: 6.728 ± 0.0
0.481AlaCys: 0.481 ± 0.0
1.922AlaAsp: 1.922 ± 0.0
3.364AlaGlu: 3.364 ± 0.0
1.922AlaPhe: 1.922 ± 0.0
2.403AlaGly: 2.403 ± 0.0
1.442AlaHis: 1.442 ± 0.0
7.208AlaIle: 7.208 ± 0.0
5.286AlaLys: 5.286 ± 0.0
8.65AlaLeu: 8.65 ± 0.0
2.403AlaMet: 2.403 ± 0.0
3.364AlaAsn: 3.364 ± 0.0
0.961AlaPro: 0.961 ± 0.0
5.286AlaGln: 5.286 ± 0.0
4.325AlaArg: 4.325 ± 0.0
4.325AlaSer: 4.325 ± 0.0
2.883AlaThr: 2.883 ± 0.0
3.844AlaVal: 3.844 ± 0.0
0.481AlaTrp: 0.481 ± 0.0
1.442AlaTyr: 1.442 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.403CysAla: 2.403 ± 0.0
0.961CysCys: 0.961 ± 0.0
0.481CysAsp: 0.481 ± 0.0
1.922CysGlu: 1.922 ± 0.0
0.481CysPhe: 0.481 ± 0.0
1.442CysGly: 1.442 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.961CysIle: 0.961 ± 0.0
0.961CysLys: 0.961 ± 0.0
2.403CysLeu: 2.403 ± 0.0
0.481CysMet: 0.481 ± 0.0
0.481CysAsn: 0.481 ± 0.0
0.961CysPro: 0.961 ± 0.0
0.961CysGln: 0.961 ± 0.0
1.442CysArg: 1.442 ± 0.0
0.481CysSer: 0.481 ± 0.0
1.922CysThr: 1.922 ± 0.0
2.403CysVal: 2.403 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.922CysTyr: 1.922 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.922AspAla: 1.922 ± 0.0
0.481AspCys: 0.481 ± 0.0
3.844AspAsp: 3.844 ± 0.0
1.922AspGlu: 1.922 ± 0.0
5.766AspPhe: 5.766 ± 0.0
1.922AspGly: 1.922 ± 0.0
0.481AspHis: 0.481 ± 0.0
3.844AspIle: 3.844 ± 0.0
3.844AspLys: 3.844 ± 0.0
5.286AspLeu: 5.286 ± 0.0
0.961AspMet: 0.961 ± 0.0
1.922AspAsn: 1.922 ± 0.0
4.805AspPro: 4.805 ± 0.0
2.883AspGln: 2.883 ± 0.0
1.442AspArg: 1.442 ± 0.0
2.883AspSer: 2.883 ± 0.0
1.442AspThr: 1.442 ± 0.0
4.805AspVal: 4.805 ± 0.0
1.922AspTrp: 1.922 ± 0.0
0.961AspTyr: 0.961 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.728GluAla: 6.728 ± 0.0
1.442GluCys: 1.442 ± 0.0
4.325GluAsp: 4.325 ± 0.0
9.611GluGlu: 9.611 ± 0.0
1.442GluPhe: 1.442 ± 0.0
4.325GluGly: 4.325 ± 0.0
0.961GluHis: 0.961 ± 0.0
2.403GluIle: 2.403 ± 0.0
3.844GluLys: 3.844 ± 0.0
5.766GluLeu: 5.766 ± 0.0
2.403GluMet: 2.403 ± 0.0
2.403GluAsn: 2.403 ± 0.0
2.883GluPro: 2.883 ± 0.0
4.325GluGln: 4.325 ± 0.0
4.325GluArg: 4.325 ± 0.0
2.883GluSer: 2.883 ± 0.0
6.247GluThr: 6.247 ± 0.0
2.883GluVal: 2.883 ± 0.0
0.961GluTrp: 0.961 ± 0.0
2.403GluTyr: 2.403 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.961PheAla: 0.961 ± 0.0
2.883PheCys: 2.883 ± 0.0
2.883PheAsp: 2.883 ± 0.0
2.883PheGlu: 2.883 ± 0.0
2.403PhePhe: 2.403 ± 0.0
3.364PheGly: 3.364 ± 0.0
0.481PheHis: 0.481 ± 0.0
0.961PheIle: 0.961 ± 0.0
2.403PheLys: 2.403 ± 0.0
2.883PheLeu: 2.883 ± 0.0
3.364PheMet: 3.364 ± 0.0
1.922PheAsn: 1.922 ± 0.0
0.961PhePro: 0.961 ± 0.0
0.481PheGln: 0.481 ± 0.0
1.922PheArg: 1.922 ± 0.0
2.403PheSer: 2.403 ± 0.0
3.364PheThr: 3.364 ± 0.0
1.442PheVal: 1.442 ± 0.0
1.442PheTrp: 1.442 ± 0.0
1.922PheTyr: 1.922 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.403GlyAla: 2.403 ± 0.0
0.481GlyCys: 0.481 ± 0.0
3.364GlyAsp: 3.364 ± 0.0
2.883GlyGlu: 2.883 ± 0.0
2.883GlyPhe: 2.883 ± 0.0
1.922GlyGly: 1.922 ± 0.0
0.961GlyHis: 0.961 ± 0.0
1.922GlyIle: 1.922 ± 0.0
3.844GlyLys: 3.844 ± 0.0
4.805GlyLeu: 4.805 ± 0.0
1.442GlyMet: 1.442 ± 0.0
2.883GlyAsn: 2.883 ± 0.0
2.403GlyPro: 2.403 ± 0.0
0.961GlyGln: 0.961 ± 0.0
1.442GlyArg: 1.442 ± 0.0
5.286GlySer: 5.286 ± 0.0
2.403GlyThr: 2.403 ± 0.0
2.883GlyVal: 2.883 ± 0.0
1.442GlyTrp: 1.442 ± 0.0
1.922GlyTyr: 1.922 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.961HisAla: 0.961 ± 0.0
0.961HisCys: 0.961 ± 0.0
0.961HisAsp: 0.961 ± 0.0
0.961HisGlu: 0.961 ± 0.0
0.481HisPhe: 0.481 ± 0.0
1.442HisGly: 1.442 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.481HisIle: 0.481 ± 0.0
0.961HisLys: 0.961 ± 0.0
1.922HisLeu: 1.922 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.961HisPro: 0.961 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.481HisArg: 0.481 ± 0.0
0.961HisSer: 0.961 ± 0.0
0.961HisThr: 0.961 ± 0.0
1.442HisVal: 1.442 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.325IleAla: 4.325 ± 0.0
1.442IleCys: 1.442 ± 0.0
0.961IleAsp: 0.961 ± 0.0
3.844IleGlu: 3.844 ± 0.0
1.922IlePhe: 1.922 ± 0.0
1.922IleGly: 1.922 ± 0.0
0.481IleHis: 0.481 ± 0.0
2.403IleIle: 2.403 ± 0.0
6.728IleLys: 6.728 ± 0.0
6.247IleLeu: 6.247 ± 0.0
1.922IleMet: 1.922 ± 0.0
2.403IleAsn: 2.403 ± 0.0
2.403IlePro: 2.403 ± 0.0
1.442IleGln: 1.442 ± 0.0
3.844IleArg: 3.844 ± 0.0
7.208IleSer: 7.208 ± 0.0
2.403IleThr: 2.403 ± 0.0
2.403IleVal: 2.403 ± 0.0
0.481IleTrp: 0.481 ± 0.0
1.442IleTyr: 1.442 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.766LysAla: 5.766 ± 0.0
0.961LysCys: 0.961 ± 0.0
4.805LysAsp: 4.805 ± 0.0
4.805LysGlu: 4.805 ± 0.0
3.364LysPhe: 3.364 ± 0.0
3.844LysGly: 3.844 ± 0.0
2.883LysHis: 2.883 ± 0.0
2.883LysIle: 2.883 ± 0.0
7.689LysLys: 7.689 ± 0.0
9.13LysLeu: 9.13 ± 0.0
2.883LysMet: 2.883 ± 0.0
0.961LysAsn: 0.961 ± 0.0
4.325LysPro: 4.325 ± 0.0
3.364LysGln: 3.364 ± 0.0
5.766LysArg: 5.766 ± 0.0
3.844LysSer: 3.844 ± 0.0
3.844LysThr: 3.844 ± 0.0
4.805LysVal: 4.805 ± 0.0
1.442LysTrp: 1.442 ± 0.0
1.922LysTyr: 1.922 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.766LeuAla: 5.766 ± 0.0
2.883LeuCys: 2.883 ± 0.0
5.766LeuAsp: 5.766 ± 0.0
4.325LeuGlu: 4.325 ± 0.0
3.364LeuPhe: 3.364 ± 0.0
3.844LeuGly: 3.844 ± 0.0
1.442LeuHis: 1.442 ± 0.0
6.728LeuIle: 6.728 ± 0.0
8.169LeuLys: 8.169 ± 0.0
5.286LeuLeu: 5.286 ± 0.0
0.961LeuMet: 0.961 ± 0.0
3.844LeuAsn: 3.844 ± 0.0
3.844LeuPro: 3.844 ± 0.0
4.325LeuGln: 4.325 ± 0.0
5.286LeuArg: 5.286 ± 0.0
10.572LeuSer: 10.572 ± 0.0
4.325LeuThr: 4.325 ± 0.0
3.844LeuVal: 3.844 ± 0.0
1.442LeuTrp: 1.442 ± 0.0
3.844LeuTyr: 3.844 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.961MetAla: 0.961 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.442MetAsp: 1.442 ± 0.0
3.364MetGlu: 3.364 ± 0.0
2.403MetPhe: 2.403 ± 0.0
0.961MetGly: 0.961 ± 0.0
0.0MetHis: 0.0 ± 0.0
3.364MetIle: 3.364 ± 0.0
3.364MetLys: 3.364 ± 0.0
2.883MetLeu: 2.883 ± 0.0
1.442MetMet: 1.442 ± 0.0
0.481MetAsn: 0.481 ± 0.0
0.961MetPro: 0.961 ± 0.0
2.403MetGln: 2.403 ± 0.0
2.883MetArg: 2.883 ± 0.0
0.0MetSer: 0.0 ± 0.0
0.961MetThr: 0.961 ± 0.0
1.442MetVal: 1.442 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.481MetTyr: 0.481 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.442AsnAla: 1.442 ± 0.0
0.481AsnCys: 0.481 ± 0.0
0.961AsnAsp: 0.961 ± 0.0
3.364AsnGlu: 3.364 ± 0.0
1.922AsnPhe: 1.922 ± 0.0
1.442AsnGly: 1.442 ± 0.0
0.0AsnHis: 0.0 ± 0.0
4.805AsnIle: 4.805 ± 0.0
2.883AsnLys: 2.883 ± 0.0
4.325AsnLeu: 4.325 ± 0.0
0.961AsnMet: 0.961 ± 0.0
2.883AsnAsn: 2.883 ± 0.0
3.844AsnPro: 3.844 ± 0.0
1.442AsnGln: 1.442 ± 0.0
2.403AsnArg: 2.403 ± 0.0
2.883AsnSer: 2.883 ± 0.0
2.403AsnThr: 2.403 ± 0.0
1.442AsnVal: 1.442 ± 0.0
0.961AsnTrp: 0.961 ± 0.0
3.364AsnTyr: 3.364 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.442ProAla: 1.442 ± 0.0
0.961ProCys: 0.961 ± 0.0
2.403ProAsp: 2.403 ± 0.0
4.805ProGlu: 4.805 ± 0.0
1.442ProPhe: 1.442 ± 0.0
4.325ProGly: 4.325 ± 0.0
1.922ProHis: 1.922 ± 0.0
3.364ProIle: 3.364 ± 0.0
0.481ProLys: 0.481 ± 0.0
3.844ProLeu: 3.844 ± 0.0
0.961ProMet: 0.961 ± 0.0
0.481ProAsn: 0.481 ± 0.0
0.961ProPro: 0.961 ± 0.0
2.403ProGln: 2.403 ± 0.0
2.883ProArg: 2.883 ± 0.0
4.325ProSer: 4.325 ± 0.0
2.883ProThr: 2.883 ± 0.0
2.883ProVal: 2.883 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.442ProTyr: 1.442 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.922GlnAla: 1.922 ± 0.0
0.961GlnCys: 0.961 ± 0.0
2.883GlnAsp: 2.883 ± 0.0
4.325GlnGlu: 4.325 ± 0.0
2.403GlnPhe: 2.403 ± 0.0
1.922GlnGly: 1.922 ± 0.0
0.481GlnHis: 0.481 ± 0.0
2.403GlnIle: 2.403 ± 0.0
3.844GlnLys: 3.844 ± 0.0
2.403GlnLeu: 2.403 ± 0.0
2.403GlnMet: 2.403 ± 0.0
2.403GlnAsn: 2.403 ± 0.0
2.403GlnPro: 2.403 ± 0.0
2.883GlnGln: 2.883 ± 0.0
1.922GlnArg: 1.922 ± 0.0
4.325GlnSer: 4.325 ± 0.0
0.961GlnThr: 0.961 ± 0.0
2.403GlnVal: 2.403 ± 0.0
1.442GlnTrp: 1.442 ± 0.0
0.961GlnTyr: 0.961 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.325ArgAla: 4.325 ± 0.0
2.883ArgCys: 2.883 ± 0.0
1.442ArgAsp: 1.442 ± 0.0
0.961ArgGlu: 0.961 ± 0.0
0.961ArgPhe: 0.961 ± 0.0
3.844ArgGly: 3.844 ± 0.0
0.961ArgHis: 0.961 ± 0.0
0.961ArgIle: 0.961 ± 0.0
4.805ArgLys: 4.805 ± 0.0
3.364ArgLeu: 3.364 ± 0.0
0.481ArgMet: 0.481 ± 0.0
3.364ArgAsn: 3.364 ± 0.0
3.844ArgPro: 3.844 ± 0.0
2.883ArgGln: 2.883 ± 0.0
1.922ArgArg: 1.922 ± 0.0
6.247ArgSer: 6.247 ± 0.0
3.364ArgThr: 3.364 ± 0.0
1.442ArgVal: 1.442 ± 0.0
0.961ArgTrp: 0.961 ± 0.0
4.325ArgTyr: 4.325 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.689SerAla: 7.689 ± 0.0
1.442SerCys: 1.442 ± 0.0
6.728SerAsp: 6.728 ± 0.0
4.325SerGlu: 4.325 ± 0.0
2.883SerPhe: 2.883 ± 0.0
4.805SerGly: 4.805 ± 0.0
0.961SerHis: 0.961 ± 0.0
2.883SerIle: 2.883 ± 0.0
5.766SerLys: 5.766 ± 0.0
7.208SerLeu: 7.208 ± 0.0
2.883SerMet: 2.883 ± 0.0
5.286SerAsn: 5.286 ± 0.0
3.844SerPro: 3.844 ± 0.0
2.883SerGln: 2.883 ± 0.0
4.805SerArg: 4.805 ± 0.0
7.689SerSer: 7.689 ± 0.0
3.364SerThr: 3.364 ± 0.0
3.364SerVal: 3.364 ± 0.0
0.961SerTrp: 0.961 ± 0.0
2.403SerTyr: 2.403 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.364ThrAla: 3.364 ± 0.0
1.442ThrCys: 1.442 ± 0.0
3.844ThrAsp: 3.844 ± 0.0
5.286ThrGlu: 5.286 ± 0.0
1.442ThrPhe: 1.442 ± 0.0
1.922ThrGly: 1.922 ± 0.0
0.0ThrHis: 0.0 ± 0.0
2.403ThrIle: 2.403 ± 0.0
5.766ThrLys: 5.766 ± 0.0
3.844ThrLeu: 3.844 ± 0.0
1.442ThrMet: 1.442 ± 0.0
3.844ThrAsn: 3.844 ± 0.0
1.922ThrPro: 1.922 ± 0.0
2.403ThrGln: 2.403 ± 0.0
2.403ThrArg: 2.403 ± 0.0
1.922ThrSer: 1.922 ± 0.0
1.442ThrThr: 1.442 ± 0.0
4.325ThrVal: 4.325 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
0.481ThrTyr: 0.481 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.247ValAla: 6.247 ± 0.0
0.481ValCys: 0.481 ± 0.0
3.364ValAsp: 3.364 ± 0.0
7.208ValGlu: 7.208 ± 0.0
0.961ValPhe: 0.961 ± 0.0
2.403ValGly: 2.403 ± 0.0
0.0ValHis: 0.0 ± 0.0
2.403ValIle: 2.403 ± 0.0
3.844ValLys: 3.844 ± 0.0
4.805ValLeu: 4.805 ± 0.0
1.442ValMet: 1.442 ± 0.0
1.922ValAsn: 1.922 ± 0.0
1.922ValPro: 1.922 ± 0.0
1.442ValGln: 1.442 ± 0.0
2.883ValArg: 2.883 ± 0.0
4.805ValSer: 4.805 ± 0.0
0.961ValThr: 0.961 ± 0.0
5.286ValVal: 5.286 ± 0.0
0.0ValTrp: 0.0 ± 0.0
2.403ValTyr: 2.403 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.961TrpAla: 0.961 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.961TrpPhe: 0.961 ± 0.0
0.481TrpGly: 0.481 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.922TrpIle: 1.922 ± 0.0
1.922TrpLys: 1.922 ± 0.0
2.403TrpLeu: 2.403 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.481TrpAsn: 0.481 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.481TrpGln: 0.481 ± 0.0
0.0TrpArg: 0.0 ± 0.0
2.883TrpSer: 2.883 ± 0.0
0.481TrpThr: 0.481 ± 0.0
0.961TrpVal: 0.961 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.403TyrAla: 2.403 ± 0.0
1.442TyrCys: 1.442 ± 0.0
0.961TyrAsp: 0.961 ± 0.0
2.403TyrGlu: 2.403 ± 0.0
1.922TyrPhe: 1.922 ± 0.0
0.0TyrGly: 0.0 ± 0.0
0.481TyrHis: 0.481 ± 0.0
1.922TyrIle: 1.922 ± 0.0
2.883TyrLys: 2.883 ± 0.0
2.403TyrLeu: 2.403 ± 0.0
0.481TyrMet: 0.481 ± 0.0
2.883TyrAsn: 2.883 ± 0.0
0.0TyrPro: 0.0 ± 0.0
2.403TyrGln: 2.403 ± 0.0
1.442TyrArg: 1.442 ± 0.0
6.247TyrSer: 6.247 ± 0.0
2.883TyrThr: 2.883 ± 0.0
0.481TyrVal: 0.481 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.922TyrTyr: 1.922 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2082 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski