Amino acid dipepetide frequency for Hubei mosquito virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.825AlaAla: 2.825 ± 0.0
1.766AlaCys: 1.766 ± 0.0
3.178AlaAsp: 3.178 ± 0.0
2.472AlaGlu: 2.472 ± 0.0
2.472AlaPhe: 2.472 ± 0.0
5.297AlaGly: 5.297 ± 0.0
2.119AlaHis: 2.119 ± 0.0
3.178AlaIle: 3.178 ± 0.0
3.531AlaLys: 3.531 ± 0.0
8.475AlaLeu: 8.475 ± 0.0
1.766AlaMet: 1.766 ± 0.0
5.297AlaAsn: 5.297 ± 0.0
3.531AlaPro: 3.531 ± 0.0
2.472AlaGln: 2.472 ± 0.0
5.297AlaArg: 5.297 ± 0.0
5.65AlaSer: 5.65 ± 0.0
6.356AlaThr: 6.356 ± 0.0
5.297AlaVal: 5.297 ± 0.0
2.119AlaTrp: 2.119 ± 0.0
4.237AlaTyr: 4.237 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.706CysAla: 0.706 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.059CysGlu: 1.059 ± 0.0
0.706CysPhe: 0.706 ± 0.0
0.706CysGly: 0.706 ± 0.0
1.059CysHis: 1.059 ± 0.0
1.059CysIle: 1.059 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.412CysLeu: 1.412 ± 0.0
0.353CysMet: 0.353 ± 0.0
2.472CysAsn: 2.472 ± 0.0
1.412CysPro: 1.412 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.766CysArg: 1.766 ± 0.0
1.059CysSer: 1.059 ± 0.0
0.706CysThr: 0.706 ± 0.0
1.412CysVal: 1.412 ± 0.0
0.706CysTrp: 0.706 ± 0.0
0.706CysTyr: 0.706 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.178AspAla: 3.178 ± 0.0
1.059AspCys: 1.059 ± 0.0
2.472AspAsp: 2.472 ± 0.0
2.825AspGlu: 2.825 ± 0.0
3.178AspPhe: 3.178 ± 0.0
2.119AspGly: 2.119 ± 0.0
0.706AspHis: 0.706 ± 0.0
4.59AspIle: 4.59 ± 0.0
3.178AspLys: 3.178 ± 0.0
4.237AspLeu: 4.237 ± 0.0
1.766AspMet: 1.766 ± 0.0
3.178AspAsn: 3.178 ± 0.0
3.531AspPro: 3.531 ± 0.0
0.706AspGln: 0.706 ± 0.0
3.531AspArg: 3.531 ± 0.0
1.412AspSer: 1.412 ± 0.0
3.884AspThr: 3.884 ± 0.0
5.297AspVal: 5.297 ± 0.0
0.0AspTrp: 0.0 ± 0.0
3.531AspTyr: 3.531 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.59GluAla: 4.59 ± 0.0
1.059GluCys: 1.059 ± 0.0
2.472GluAsp: 2.472 ± 0.0
2.472GluGlu: 2.472 ± 0.0
1.412GluPhe: 1.412 ± 0.0
2.119GluGly: 2.119 ± 0.0
2.472GluHis: 2.472 ± 0.0
3.884GluIle: 3.884 ± 0.0
2.825GluLys: 2.825 ± 0.0
3.531GluLeu: 3.531 ± 0.0
1.412GluMet: 1.412 ± 0.0
3.178GluAsn: 3.178 ± 0.0
2.119GluPro: 2.119 ± 0.0
0.706GluGln: 0.706 ± 0.0
1.766GluArg: 1.766 ± 0.0
4.944GluSer: 4.944 ± 0.0
1.412GluThr: 1.412 ± 0.0
2.825GluVal: 2.825 ± 0.0
1.059GluTrp: 1.059 ± 0.0
3.178GluTyr: 3.178 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.884PheAla: 3.884 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.766PheAsp: 1.766 ± 0.0
1.766PheGlu: 1.766 ± 0.0
2.119PhePhe: 2.119 ± 0.0
1.412PheGly: 1.412 ± 0.0
0.353PheHis: 0.353 ± 0.0
2.825PheIle: 2.825 ± 0.0
2.825PheLys: 2.825 ± 0.0
3.178PheLeu: 3.178 ± 0.0
0.706PheMet: 0.706 ± 0.0
1.059PheAsn: 1.059 ± 0.0
1.059PhePro: 1.059 ± 0.0
2.119PheGln: 2.119 ± 0.0
1.412PheArg: 1.412 ± 0.0
2.472PheSer: 2.472 ± 0.0
2.825PheThr: 2.825 ± 0.0
3.178PheVal: 3.178 ± 0.0
0.706PheTrp: 0.706 ± 0.0
1.412PheTyr: 1.412 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.003GlyAla: 6.003 ± 0.0
0.353GlyCys: 0.353 ± 0.0
3.884GlyAsp: 3.884 ± 0.0
2.825GlyGlu: 2.825 ± 0.0
1.412GlyPhe: 1.412 ± 0.0
2.825GlyGly: 2.825 ± 0.0
1.766GlyHis: 1.766 ± 0.0
3.178GlyIle: 3.178 ± 0.0
2.825GlyLys: 2.825 ± 0.0
5.65GlyLeu: 5.65 ± 0.0
1.059GlyMet: 1.059 ± 0.0
2.119GlyAsn: 2.119 ± 0.0
4.944GlyPro: 4.944 ± 0.0
2.119GlyGln: 2.119 ± 0.0
0.353GlyArg: 0.353 ± 0.0
2.825GlySer: 2.825 ± 0.0
4.237GlyThr: 4.237 ± 0.0
6.709GlyVal: 6.709 ± 0.0
0.706GlyTrp: 0.706 ± 0.0
3.178GlyTyr: 3.178 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.119HisAla: 2.119 ± 0.0
0.0HisCys: 0.0 ± 0.0
3.178HisAsp: 3.178 ± 0.0
0.353HisGlu: 0.353 ± 0.0
1.412HisPhe: 1.412 ± 0.0
2.825HisGly: 2.825 ± 0.0
0.353HisHis: 0.353 ± 0.0
1.412HisIle: 1.412 ± 0.0
1.059HisLys: 1.059 ± 0.0
2.825HisLeu: 2.825 ± 0.0
0.706HisMet: 0.706 ± 0.0
1.059HisAsn: 1.059 ± 0.0
1.412HisPro: 1.412 ± 0.0
0.706HisGln: 0.706 ± 0.0
1.059HisArg: 1.059 ± 0.0
0.706HisSer: 0.706 ± 0.0
1.059HisThr: 1.059 ± 0.0
1.766HisVal: 1.766 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.412HisTyr: 1.412 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.237IleAla: 4.237 ± 0.0
1.412IleCys: 1.412 ± 0.0
3.884IleAsp: 3.884 ± 0.0
2.472IleGlu: 2.472 ± 0.0
1.412IlePhe: 1.412 ± 0.0
1.766IleGly: 1.766 ± 0.0
1.412IleHis: 1.412 ± 0.0
3.884IleIle: 3.884 ± 0.0
2.825IleLys: 2.825 ± 0.0
4.59IleLeu: 4.59 ± 0.0
1.766IleMet: 1.766 ± 0.0
3.884IleAsn: 3.884 ± 0.0
3.884IlePro: 3.884 ± 0.0
1.766IleGln: 1.766 ± 0.0
2.825IleArg: 2.825 ± 0.0
2.119IleSer: 2.119 ± 0.0
4.944IleThr: 4.944 ± 0.0
5.65IleVal: 5.65 ± 0.0
0.353IleTrp: 0.353 ± 0.0
1.766IleTyr: 1.766 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.531LysAla: 3.531 ± 0.0
0.0LysCys: 0.0 ± 0.0
2.472LysAsp: 2.472 ± 0.0
3.178LysGlu: 3.178 ± 0.0
2.119LysPhe: 2.119 ± 0.0
2.472LysGly: 2.472 ± 0.0
2.472LysHis: 2.472 ± 0.0
1.412LysIle: 1.412 ± 0.0
2.119LysLys: 2.119 ± 0.0
3.178LysLeu: 3.178 ± 0.0
1.766LysMet: 1.766 ± 0.0
1.412LysAsn: 1.412 ± 0.0
2.472LysPro: 2.472 ± 0.0
0.0LysGln: 0.0 ± 0.0
2.472LysArg: 2.472 ± 0.0
4.59LysSer: 4.59 ± 0.0
3.178LysThr: 3.178 ± 0.0
3.531LysVal: 3.531 ± 0.0
0.353LysTrp: 0.353 ± 0.0
1.059LysTyr: 1.059 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.356LeuAla: 6.356 ± 0.0
1.412LeuCys: 1.412 ± 0.0
4.944LeuAsp: 4.944 ± 0.0
3.178LeuGlu: 3.178 ± 0.0
2.119LeuPhe: 2.119 ± 0.0
5.297LeuGly: 5.297 ± 0.0
1.766LeuHis: 1.766 ± 0.0
4.237LeuIle: 4.237 ± 0.0
3.884LeuLys: 3.884 ± 0.0
6.709LeuLeu: 6.709 ± 0.0
2.825LeuMet: 2.825 ± 0.0
2.472LeuAsn: 2.472 ± 0.0
2.119LeuPro: 2.119 ± 0.0
1.766LeuGln: 1.766 ± 0.0
5.65LeuArg: 5.65 ± 0.0
8.828LeuSer: 8.828 ± 0.0
7.062LeuThr: 7.062 ± 0.0
6.356LeuVal: 6.356 ± 0.0
0.706LeuTrp: 0.706 ± 0.0
2.825LeuTyr: 2.825 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.825MetAla: 2.825 ± 0.0
0.706MetCys: 0.706 ± 0.0
3.531MetAsp: 3.531 ± 0.0
1.412MetGlu: 1.412 ± 0.0
0.706MetPhe: 0.706 ± 0.0
1.766MetGly: 1.766 ± 0.0
1.412MetHis: 1.412 ± 0.0
1.766MetIle: 1.766 ± 0.0
1.059MetLys: 1.059 ± 0.0
2.119MetLeu: 2.119 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.706MetAsn: 0.706 ± 0.0
1.412MetPro: 1.412 ± 0.0
0.706MetGln: 0.706 ± 0.0
2.472MetArg: 2.472 ± 0.0
1.059MetSer: 1.059 ± 0.0
1.059MetThr: 1.059 ± 0.0
2.472MetVal: 2.472 ± 0.0
0.706MetTrp: 0.706 ± 0.0
0.353MetTyr: 0.353 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.237AsnAla: 4.237 ± 0.0
0.353AsnCys: 0.353 ± 0.0
2.825AsnAsp: 2.825 ± 0.0
2.825AsnGlu: 2.825 ± 0.0
3.178AsnPhe: 3.178 ± 0.0
3.531AsnGly: 3.531 ± 0.0
0.706AsnHis: 0.706 ± 0.0
1.766AsnIle: 1.766 ± 0.0
2.119AsnLys: 2.119 ± 0.0
1.412AsnLeu: 1.412 ± 0.0
1.412AsnMet: 1.412 ± 0.0
1.412AsnAsn: 1.412 ± 0.0
3.531AsnPro: 3.531 ± 0.0
1.059AsnGln: 1.059 ± 0.0
2.119AsnArg: 2.119 ± 0.0
4.59AsnSer: 4.59 ± 0.0
3.531AsnThr: 3.531 ± 0.0
3.884AsnVal: 3.884 ± 0.0
0.353AsnTrp: 0.353 ± 0.0
3.178AsnTyr: 3.178 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.884ProAla: 3.884 ± 0.0
0.706ProCys: 0.706 ± 0.0
2.119ProAsp: 2.119 ± 0.0
3.531ProGlu: 3.531 ± 0.0
1.766ProPhe: 1.766 ± 0.0
4.237ProGly: 4.237 ± 0.0
1.766ProHis: 1.766 ± 0.0
2.119ProIle: 2.119 ± 0.0
1.766ProLys: 1.766 ± 0.0
6.709ProLeu: 6.709 ± 0.0
1.059ProMet: 1.059 ± 0.0
2.119ProAsn: 2.119 ± 0.0
1.412ProPro: 1.412 ± 0.0
3.531ProGln: 3.531 ± 0.0
2.119ProArg: 2.119 ± 0.0
4.59ProSer: 4.59 ± 0.0
3.531ProThr: 3.531 ± 0.0
3.531ProVal: 3.531 ± 0.0
0.706ProTrp: 0.706 ± 0.0
3.884ProTyr: 3.884 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.766GlnAla: 1.766 ± 0.0
0.706GlnCys: 0.706 ± 0.0
2.119GlnAsp: 2.119 ± 0.0
0.706GlnGlu: 0.706 ± 0.0
1.412GlnPhe: 1.412 ± 0.0
1.412GlnGly: 1.412 ± 0.0
0.353GlnHis: 0.353 ± 0.0
1.766GlnIle: 1.766 ± 0.0
0.706GlnLys: 0.706 ± 0.0
2.825GlnLeu: 2.825 ± 0.0
1.766GlnMet: 1.766 ± 0.0
1.412GlnAsn: 1.412 ± 0.0
2.119GlnPro: 2.119 ± 0.0
0.353GlnGln: 0.353 ± 0.0
1.412GlnArg: 1.412 ± 0.0
3.178GlnSer: 3.178 ± 0.0
1.766GlnThr: 1.766 ± 0.0
1.059GlnVal: 1.059 ± 0.0
0.706GlnTrp: 0.706 ± 0.0
1.766GlnTyr: 1.766 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.178ArgAla: 3.178 ± 0.0
1.766ArgCys: 1.766 ± 0.0
2.119ArgAsp: 2.119 ± 0.0
3.531ArgGlu: 3.531 ± 0.0
2.472ArgPhe: 2.472 ± 0.0
2.472ArgGly: 2.472 ± 0.0
1.766ArgHis: 1.766 ± 0.0
2.119ArgIle: 2.119 ± 0.0
1.766ArgLys: 1.766 ± 0.0
4.237ArgLeu: 4.237 ± 0.0
1.766ArgMet: 1.766 ± 0.0
1.412ArgAsn: 1.412 ± 0.0
3.531ArgPro: 3.531 ± 0.0
2.825ArgGln: 2.825 ± 0.0
3.884ArgArg: 3.884 ± 0.0
3.531ArgSer: 3.531 ± 0.0
2.472ArgThr: 2.472 ± 0.0
3.884ArgVal: 3.884 ± 0.0
0.353ArgTrp: 0.353 ± 0.0
4.237ArgTyr: 4.237 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.003SerAla: 6.003 ± 0.0
2.119SerCys: 2.119 ± 0.0
3.531SerAsp: 3.531 ± 0.0
3.178SerGlu: 3.178 ± 0.0
2.472SerPhe: 2.472 ± 0.0
4.59SerGly: 4.59 ± 0.0
0.706SerHis: 0.706 ± 0.0
5.65SerIle: 5.65 ± 0.0
3.531SerLys: 3.531 ± 0.0
4.944SerLeu: 4.944 ± 0.0
1.766SerMet: 1.766 ± 0.0
2.472SerAsn: 2.472 ± 0.0
3.531SerPro: 3.531 ± 0.0
1.766SerGln: 1.766 ± 0.0
2.825SerArg: 2.825 ± 0.0
5.297SerSer: 5.297 ± 0.0
8.121SerThr: 8.121 ± 0.0
5.65SerVal: 5.65 ± 0.0
1.766SerTrp: 1.766 ± 0.0
2.472SerTyr: 2.472 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.356ThrAla: 6.356 ± 0.0
2.119ThrCys: 2.119 ± 0.0
3.884ThrAsp: 3.884 ± 0.0
3.178ThrGlu: 3.178 ± 0.0
3.884ThrPhe: 3.884 ± 0.0
3.884ThrGly: 3.884 ± 0.0
1.059ThrHis: 1.059 ± 0.0
4.944ThrIle: 4.944 ± 0.0
1.766ThrLys: 1.766 ± 0.0
7.415ThrLeu: 7.415 ± 0.0
1.412ThrMet: 1.412 ± 0.0
4.237ThrAsn: 4.237 ± 0.0
3.178ThrPro: 3.178 ± 0.0
2.472ThrGln: 2.472 ± 0.0
3.884ThrArg: 3.884 ± 0.0
5.297ThrSer: 5.297 ± 0.0
6.003ThrThr: 6.003 ± 0.0
5.297ThrVal: 5.297 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
1.412ThrTyr: 1.412 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.475ValAla: 8.475 ± 0.0
1.059ValCys: 1.059 ± 0.0
2.472ValAsp: 2.472 ± 0.0
4.237ValGlu: 4.237 ± 0.0
1.059ValPhe: 1.059 ± 0.0
6.356ValGly: 6.356 ± 0.0
2.472ValHis: 2.472 ± 0.0
2.825ValIle: 2.825 ± 0.0
2.472ValLys: 2.472 ± 0.0
4.237ValLeu: 4.237 ± 0.0
2.472ValMet: 2.472 ± 0.0
3.884ValAsn: 3.884 ± 0.0
7.415ValPro: 7.415 ± 0.0
2.825ValGln: 2.825 ± 0.0
5.297ValArg: 5.297 ± 0.0
6.356ValSer: 6.356 ± 0.0
3.531ValThr: 3.531 ± 0.0
7.768ValVal: 7.768 ± 0.0
1.059ValTrp: 1.059 ± 0.0
3.884ValTyr: 3.884 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.353TrpAla: 0.353 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.706TrpAsp: 0.706 ± 0.0
1.059TrpGlu: 1.059 ± 0.0
0.353TrpPhe: 0.353 ± 0.0
0.353TrpGly: 0.353 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.353TrpIle: 0.353 ± 0.0
1.412TrpLys: 1.412 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.353TrpMet: 0.353 ± 0.0
1.412TrpAsn: 1.412 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.412TrpArg: 1.412 ± 0.0
1.412TrpSer: 1.412 ± 0.0
1.412TrpThr: 1.412 ± 0.0
1.412TrpVal: 1.412 ± 0.0
0.706TrpTrp: 0.706 ± 0.0
0.706TrpTyr: 0.706 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.825TyrAla: 2.825 ± 0.0
0.706TyrCys: 0.706 ± 0.0
2.825TyrAsp: 2.825 ± 0.0
3.178TyrGlu: 3.178 ± 0.0
1.412TyrPhe: 1.412 ± 0.0
3.531TyrGly: 3.531 ± 0.0
0.706TyrHis: 0.706 ± 0.0
3.884TyrIle: 3.884 ± 0.0
2.119TyrLys: 2.119 ± 0.0
2.825TyrLeu: 2.825 ± 0.0
1.766TyrMet: 1.766 ± 0.0
2.825TyrAsn: 2.825 ± 0.0
2.472TyrPro: 2.472 ± 0.0
1.412TyrGln: 1.412 ± 0.0
1.766TyrArg: 1.766 ± 0.0
2.472TyrSer: 2.472 ± 0.0
4.944TyrThr: 4.944 ± 0.0
3.178TyrVal: 3.178 ± 0.0
0.353TyrTrp: 0.353 ± 0.0
2.119TyrTyr: 2.119 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2833 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski