Amino acid dipepetide frequency for Wuhan fly virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.493AlaAla: 5.493 ± 0.0
0.687AlaCys: 0.687 ± 0.0
2.403AlaAsp: 2.403 ± 0.0
2.06AlaGlu: 2.06 ± 0.0
3.433AlaPhe: 3.433 ± 0.0
4.463AlaGly: 4.463 ± 0.0
1.716AlaHis: 1.716 ± 0.0
4.806AlaIle: 4.806 ± 0.0
4.119AlaLys: 4.119 ± 0.0
5.836AlaLeu: 5.836 ± 0.0
2.403AlaMet: 2.403 ± 0.0
2.06AlaAsn: 2.06 ± 0.0
3.09AlaPro: 3.09 ± 0.0
2.403AlaGln: 2.403 ± 0.0
1.716AlaArg: 1.716 ± 0.0
4.806AlaSer: 4.806 ± 0.0
2.746AlaThr: 2.746 ± 0.0
2.403AlaVal: 2.403 ± 0.0
0.343AlaTrp: 0.343 ± 0.0
2.403AlaTyr: 2.403 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.716CysAla: 1.716 ± 0.0
0.687CysCys: 0.687 ± 0.0
1.373CysAsp: 1.373 ± 0.0
1.03CysGlu: 1.03 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.06CysGly: 2.06 ± 0.0
0.343CysHis: 0.343 ± 0.0
2.403CysIle: 2.403 ± 0.0
0.343CysLys: 0.343 ± 0.0
1.373CysLeu: 1.373 ± 0.0
1.03CysMet: 1.03 ± 0.0
0.343CysAsn: 0.343 ± 0.0
1.03CysPro: 1.03 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.716CysArg: 1.716 ± 0.0
0.687CysSer: 0.687 ± 0.0
1.03CysThr: 1.03 ± 0.0
1.03CysVal: 1.03 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.716CysTyr: 1.716 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.493AspAla: 5.493 ± 0.0
0.343AspCys: 0.343 ± 0.0
4.119AspAsp: 4.119 ± 0.0
4.463AspGlu: 4.463 ± 0.0
5.149AspPhe: 5.149 ± 0.0
1.716AspGly: 1.716 ± 0.0
1.03AspHis: 1.03 ± 0.0
3.09AspIle: 3.09 ± 0.0
3.776AspLys: 3.776 ± 0.0
6.179AspLeu: 6.179 ± 0.0
1.716AspMet: 1.716 ± 0.0
2.403AspAsn: 2.403 ± 0.0
2.06AspPro: 2.06 ± 0.0
0.687AspGln: 0.687 ± 0.0
2.403AspArg: 2.403 ± 0.0
2.06AspSer: 2.06 ± 0.0
3.433AspThr: 3.433 ± 0.0
7.209AspVal: 7.209 ± 0.0
0.0AspTrp: 0.0 ± 0.0
1.373AspTyr: 1.373 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.09GluAla: 3.09 ± 0.0
0.343GluCys: 0.343 ± 0.0
3.09GluAsp: 3.09 ± 0.0
3.776GluGlu: 3.776 ± 0.0
3.09GluPhe: 3.09 ± 0.0
3.776GluGly: 3.776 ± 0.0
1.03GluHis: 1.03 ± 0.0
5.149GluIle: 5.149 ± 0.0
2.06GluLys: 2.06 ± 0.0
1.716GluLeu: 1.716 ± 0.0
3.09GluMet: 3.09 ± 0.0
3.433GluAsn: 3.433 ± 0.0
3.09GluPro: 3.09 ± 0.0
3.433GluGln: 3.433 ± 0.0
2.746GluArg: 2.746 ± 0.0
3.776GluSer: 3.776 ± 0.0
4.463GluThr: 4.463 ± 0.0
4.463GluVal: 4.463 ± 0.0
1.03GluTrp: 1.03 ± 0.0
2.403GluTyr: 2.403 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.09PheAla: 3.09 ± 0.0
0.687PheCys: 0.687 ± 0.0
3.433PheAsp: 3.433 ± 0.0
3.433PheGlu: 3.433 ± 0.0
2.06PhePhe: 2.06 ± 0.0
3.09PheGly: 3.09 ± 0.0
0.687PheHis: 0.687 ± 0.0
1.716PheIle: 1.716 ± 0.0
1.373PheLys: 1.373 ± 0.0
3.433PheLeu: 3.433 ± 0.0
1.03PheMet: 1.03 ± 0.0
1.716PheAsn: 1.716 ± 0.0
1.716PhePro: 1.716 ± 0.0
2.06PheGln: 2.06 ± 0.0
1.373PheArg: 1.373 ± 0.0
5.836PheSer: 5.836 ± 0.0
4.806PheThr: 4.806 ± 0.0
5.149PheVal: 5.149 ± 0.0
0.343PheTrp: 0.343 ± 0.0
1.716PheTyr: 1.716 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.09GlyAla: 3.09 ± 0.0
1.716GlyCys: 1.716 ± 0.0
3.776GlyAsp: 3.776 ± 0.0
3.09GlyGlu: 3.09 ± 0.0
1.03GlyPhe: 1.03 ± 0.0
1.716GlyGly: 1.716 ± 0.0
0.687GlyHis: 0.687 ± 0.0
3.09GlyIle: 3.09 ± 0.0
4.806GlyLys: 4.806 ± 0.0
1.716GlyLeu: 1.716 ± 0.0
1.03GlyMet: 1.03 ± 0.0
2.06GlyAsn: 2.06 ± 0.0
2.06GlyPro: 2.06 ± 0.0
2.403GlyGln: 2.403 ± 0.0
1.716GlyArg: 1.716 ± 0.0
2.746GlySer: 2.746 ± 0.0
2.403GlyThr: 2.403 ± 0.0
2.746GlyVal: 2.746 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
3.09GlyTyr: 3.09 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.716HisAla: 1.716 ± 0.0
0.343HisCys: 0.343 ± 0.0
0.687HisAsp: 0.687 ± 0.0
1.03HisGlu: 1.03 ± 0.0
1.03HisPhe: 1.03 ± 0.0
1.03HisGly: 1.03 ± 0.0
0.687HisHis: 0.687 ± 0.0
2.06HisIle: 2.06 ± 0.0
2.746HisLys: 2.746 ± 0.0
2.403HisLeu: 2.403 ± 0.0
0.343HisMet: 0.343 ± 0.0
0.343HisAsn: 0.343 ± 0.0
0.687HisPro: 0.687 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.03HisArg: 1.03 ± 0.0
1.03HisSer: 1.03 ± 0.0
0.687HisThr: 0.687 ± 0.0
3.776HisVal: 3.776 ± 0.0
0.343HisTrp: 0.343 ± 0.0
1.373HisTyr: 1.373 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.433IleAla: 3.433 ± 0.0
2.403IleCys: 2.403 ± 0.0
4.806IleAsp: 4.806 ± 0.0
5.836IleGlu: 5.836 ± 0.0
2.746IlePhe: 2.746 ± 0.0
4.119IleGly: 4.119 ± 0.0
2.403IleHis: 2.403 ± 0.0
4.463IleIle: 4.463 ± 0.0
2.06IleLys: 2.06 ± 0.0
4.119IleLeu: 4.119 ± 0.0
2.746IleMet: 2.746 ± 0.0
2.403IleAsn: 2.403 ± 0.0
4.463IlePro: 4.463 ± 0.0
1.716IleGln: 1.716 ± 0.0
3.433IleArg: 3.433 ± 0.0
3.433IleSer: 3.433 ± 0.0
4.119IleThr: 4.119 ± 0.0
4.463IleVal: 4.463 ± 0.0
0.0IleTrp: 0.0 ± 0.0
1.716IleTyr: 1.716 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.746LysAla: 2.746 ± 0.0
2.06LysCys: 2.06 ± 0.0
5.149LysAsp: 5.149 ± 0.0
3.433LysGlu: 3.433 ± 0.0
3.776LysPhe: 3.776 ± 0.0
2.06LysGly: 2.06 ± 0.0
1.716LysHis: 1.716 ± 0.0
4.463LysIle: 4.463 ± 0.0
4.463LysLys: 4.463 ± 0.0
4.463LysLeu: 4.463 ± 0.0
2.06LysMet: 2.06 ± 0.0
3.433LysAsn: 3.433 ± 0.0
3.433LysPro: 3.433 ± 0.0
1.03LysGln: 1.03 ± 0.0
3.776LysArg: 3.776 ± 0.0
4.806LysSer: 4.806 ± 0.0
5.493LysThr: 5.493 ± 0.0
2.06LysVal: 2.06 ± 0.0
2.06LysTrp: 2.06 ± 0.0
3.776LysTyr: 3.776 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.09LeuAla: 3.09 ± 0.0
2.06LeuCys: 2.06 ± 0.0
5.493LeuAsp: 5.493 ± 0.0
3.776LeuGlu: 3.776 ± 0.0
2.746LeuPhe: 2.746 ± 0.0
3.776LeuGly: 3.776 ± 0.0
3.433LeuHis: 3.433 ± 0.0
3.433LeuIle: 3.433 ± 0.0
8.239LeuLys: 8.239 ± 0.0
4.806LeuLeu: 4.806 ± 0.0
2.403LeuMet: 2.403 ± 0.0
4.806LeuAsn: 4.806 ± 0.0
4.463LeuPro: 4.463 ± 0.0
2.746LeuGln: 2.746 ± 0.0
4.119LeuArg: 4.119 ± 0.0
3.776LeuSer: 3.776 ± 0.0
4.463LeuThr: 4.463 ± 0.0
5.493LeuVal: 5.493 ± 0.0
0.687LeuTrp: 0.687 ± 0.0
3.433LeuTyr: 3.433 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.746MetAla: 2.746 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.746MetAsp: 2.746 ± 0.0
2.06MetGlu: 2.06 ± 0.0
0.687MetPhe: 0.687 ± 0.0
1.03MetGly: 1.03 ± 0.0
1.373MetHis: 1.373 ± 0.0
2.746MetIle: 2.746 ± 0.0
1.716MetLys: 1.716 ± 0.0
3.776MetLeu: 3.776 ± 0.0
1.716MetMet: 1.716 ± 0.0
1.373MetAsn: 1.373 ± 0.0
1.373MetPro: 1.373 ± 0.0
2.06MetGln: 2.06 ± 0.0
2.06MetArg: 2.06 ± 0.0
2.403MetSer: 2.403 ± 0.0
2.746MetThr: 2.746 ± 0.0
1.373MetVal: 1.373 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.716MetTyr: 1.716 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.119AsnAla: 4.119 ± 0.0
1.03AsnCys: 1.03 ± 0.0
1.716AsnAsp: 1.716 ± 0.0
2.403AsnGlu: 2.403 ± 0.0
2.403AsnPhe: 2.403 ± 0.0
2.06AsnGly: 2.06 ± 0.0
0.343AsnHis: 0.343 ± 0.0
3.433AsnIle: 3.433 ± 0.0
3.776AsnLys: 3.776 ± 0.0
4.119AsnLeu: 4.119 ± 0.0
3.09AsnMet: 3.09 ± 0.0
3.09AsnAsn: 3.09 ± 0.0
2.403AsnPro: 2.403 ± 0.0
1.716AsnGln: 1.716 ± 0.0
2.06AsnArg: 2.06 ± 0.0
5.149AsnSer: 5.149 ± 0.0
3.433AsnThr: 3.433 ± 0.0
4.463AsnVal: 4.463 ± 0.0
0.343AsnTrp: 0.343 ± 0.0
1.716AsnTyr: 1.716 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.716ProAla: 1.716 ± 0.0
1.03ProCys: 1.03 ± 0.0
1.373ProAsp: 1.373 ± 0.0
4.806ProGlu: 4.806 ± 0.0
2.403ProPhe: 2.403 ± 0.0
1.03ProGly: 1.03 ± 0.0
1.716ProHis: 1.716 ± 0.0
3.09ProIle: 3.09 ± 0.0
2.746ProLys: 2.746 ± 0.0
4.463ProLeu: 4.463 ± 0.0
1.373ProMet: 1.373 ± 0.0
3.776ProAsn: 3.776 ± 0.0
2.06ProPro: 2.06 ± 0.0
1.373ProGln: 1.373 ± 0.0
2.403ProArg: 2.403 ± 0.0
1.716ProSer: 1.716 ± 0.0
4.806ProThr: 4.806 ± 0.0
6.179ProVal: 6.179 ± 0.0
0.0ProTrp: 0.0 ± 0.0
2.746ProTyr: 2.746 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.06GlnAla: 2.06 ± 0.0
0.343GlnCys: 0.343 ± 0.0
2.06GlnAsp: 2.06 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
2.746GlnPhe: 2.746 ± 0.0
1.716GlnGly: 1.716 ± 0.0
1.716GlnHis: 1.716 ± 0.0
1.03GlnIle: 1.03 ± 0.0
2.06GlnLys: 2.06 ± 0.0
3.776GlnLeu: 3.776 ± 0.0
2.06GlnMet: 2.06 ± 0.0
1.716GlnAsn: 1.716 ± 0.0
2.06GlnPro: 2.06 ± 0.0
2.06GlnGln: 2.06 ± 0.0
0.687GlnArg: 0.687 ± 0.0
3.433GlnSer: 3.433 ± 0.0
2.746GlnThr: 2.746 ± 0.0
1.373GlnVal: 1.373 ± 0.0
0.687GlnTrp: 0.687 ± 0.0
1.373GlnTyr: 1.373 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.06ArgAla: 2.06 ± 0.0
0.687ArgCys: 0.687 ± 0.0
3.09ArgAsp: 3.09 ± 0.0
2.746ArgGlu: 2.746 ± 0.0
3.09ArgPhe: 3.09 ± 0.0
0.687ArgGly: 0.687 ± 0.0
0.343ArgHis: 0.343 ± 0.0
3.433ArgIle: 3.433 ± 0.0
2.403ArgLys: 2.403 ± 0.0
3.433ArgLeu: 3.433 ± 0.0
2.746ArgMet: 2.746 ± 0.0
2.403ArgAsn: 2.403 ± 0.0
3.433ArgPro: 3.433 ± 0.0
2.746ArgGln: 2.746 ± 0.0
3.09ArgArg: 3.09 ± 0.0
2.403ArgSer: 2.403 ± 0.0
1.03ArgThr: 1.03 ± 0.0
2.746ArgVal: 2.746 ± 0.0
0.687ArgTrp: 0.687 ± 0.0
2.403ArgTyr: 2.403 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.433SerAla: 3.433 ± 0.0
1.03SerCys: 1.03 ± 0.0
3.433SerAsp: 3.433 ± 0.0
2.403SerGlu: 2.403 ± 0.0
3.776SerPhe: 3.776 ± 0.0
4.119SerGly: 4.119 ± 0.0
1.716SerHis: 1.716 ± 0.0
3.433SerIle: 3.433 ± 0.0
6.179SerLys: 6.179 ± 0.0
4.119SerLeu: 4.119 ± 0.0
1.716SerMet: 1.716 ± 0.0
3.776SerAsn: 3.776 ± 0.0
4.119SerPro: 4.119 ± 0.0
3.09SerGln: 3.09 ± 0.0
2.746SerArg: 2.746 ± 0.0
2.746SerSer: 2.746 ± 0.0
4.806SerThr: 4.806 ± 0.0
4.119SerVal: 4.119 ± 0.0
0.687SerTrp: 0.687 ± 0.0
3.09SerTyr: 3.09 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.493ThrAla: 5.493 ± 0.0
0.687ThrCys: 0.687 ± 0.0
2.403ThrAsp: 2.403 ± 0.0
4.463ThrGlu: 4.463 ± 0.0
0.687ThrPhe: 0.687 ± 0.0
2.746ThrGly: 2.746 ± 0.0
0.343ThrHis: 0.343 ± 0.0
5.836ThrIle: 5.836 ± 0.0
4.119ThrLys: 4.119 ± 0.0
5.493ThrLeu: 5.493 ± 0.0
1.373ThrMet: 1.373 ± 0.0
2.746ThrAsn: 2.746 ± 0.0
3.776ThrPro: 3.776 ± 0.0
2.403ThrGln: 2.403 ± 0.0
2.746ThrArg: 2.746 ± 0.0
5.493ThrSer: 5.493 ± 0.0
5.149ThrThr: 5.149 ± 0.0
5.149ThrVal: 5.149 ± 0.0
1.373ThrTrp: 1.373 ± 0.0
2.403ThrTyr: 2.403 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.746ValAla: 2.746 ± 0.0
2.403ValCys: 2.403 ± 0.0
4.463ValAsp: 4.463 ± 0.0
6.179ValGlu: 6.179 ± 0.0
4.806ValPhe: 4.806 ± 0.0
2.403ValGly: 2.403 ± 0.0
1.373ValHis: 1.373 ± 0.0
3.776ValIle: 3.776 ± 0.0
5.836ValLys: 5.836 ± 0.0
6.522ValLeu: 6.522 ± 0.0
1.373ValMet: 1.373 ± 0.0
5.493ValAsn: 5.493 ± 0.0
4.806ValPro: 4.806 ± 0.0
1.716ValGln: 1.716 ± 0.0
2.403ValArg: 2.403 ± 0.0
5.493ValSer: 5.493 ± 0.0
3.776ValThr: 3.776 ± 0.0
4.463ValVal: 4.463 ± 0.0
0.687ValTrp: 0.687 ± 0.0
2.746ValTyr: 2.746 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.343TrpAsp: 0.343 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.03TrpPhe: 1.03 ± 0.0
0.343TrpGly: 0.343 ± 0.0
0.343TrpHis: 0.343 ± 0.0
1.03TrpIle: 1.03 ± 0.0
0.343TrpLys: 0.343 ± 0.0
1.716TrpLeu: 1.716 ± 0.0
0.687TrpMet: 0.687 ± 0.0
0.343TrpAsn: 0.343 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.343TrpGln: 0.343 ± 0.0
0.343TrpArg: 0.343 ± 0.0
0.687TrpSer: 0.687 ± 0.0
0.687TrpThr: 0.687 ± 0.0
0.687TrpVal: 0.687 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.03TrpTyr: 1.03 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.06TyrAla: 2.06 ± 0.0
1.373TyrCys: 1.373 ± 0.0
3.09TyrAsp: 3.09 ± 0.0
2.06TyrGlu: 2.06 ± 0.0
2.06TyrPhe: 2.06 ± 0.0
1.03TyrGly: 1.03 ± 0.0
0.343TyrHis: 0.343 ± 0.0
2.403TyrIle: 2.403 ± 0.0
3.09TyrLys: 3.09 ± 0.0
3.776TyrLeu: 3.776 ± 0.0
1.373TyrMet: 1.373 ± 0.0
5.149TyrAsn: 5.149 ± 0.0
0.687TyrPro: 0.687 ± 0.0
1.716TyrGln: 1.716 ± 0.0
3.09TyrArg: 3.09 ± 0.0
2.06TyrSer: 2.06 ± 0.0
2.06TyrThr: 2.06 ± 0.0
4.119TyrVal: 4.119 ± 0.0
0.687TyrTrp: 0.687 ± 0.0
1.716TyrTyr: 1.716 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2914 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski