Amino acid dipepetide frequency for Wenzhou gastropodes virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.393AlaAla: 2.393 ± 0.0
0.342AlaCys: 0.342 ± 0.0
1.709AlaAsp: 1.709 ± 0.0
1.709AlaGlu: 1.709 ± 0.0
2.735AlaPhe: 2.735 ± 0.0
4.103AlaGly: 4.103 ± 0.0
1.026AlaHis: 1.026 ± 0.0
4.103AlaIle: 4.103 ± 0.0
3.077AlaLys: 3.077 ± 0.0
2.393AlaLeu: 2.393 ± 0.0
0.342AlaMet: 0.342 ± 0.0
1.709AlaAsn: 1.709 ± 0.0
2.051AlaPro: 2.051 ± 0.0
1.026AlaGln: 1.026 ± 0.0
1.368AlaArg: 1.368 ± 0.0
2.393AlaSer: 2.393 ± 0.0
3.077AlaThr: 3.077 ± 0.0
3.419AlaVal: 3.419 ± 0.0
0.342AlaTrp: 0.342 ± 0.0
1.709AlaTyr: 1.709 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.684CysAla: 0.684 ± 0.0
0.342CysCys: 0.342 ± 0.0
1.026CysAsp: 1.026 ± 0.0
1.368CysGlu: 1.368 ± 0.0
0.684CysPhe: 0.684 ± 0.0
1.026CysGly: 1.026 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.709CysIle: 1.709 ± 0.0
1.026CysLys: 1.026 ± 0.0
0.684CysLeu: 0.684 ± 0.0
0.684CysMet: 0.684 ± 0.0
1.368CysAsn: 1.368 ± 0.0
1.026CysPro: 1.026 ± 0.0
1.026CysGln: 1.026 ± 0.0
0.684CysArg: 0.684 ± 0.0
0.342CysSer: 0.342 ± 0.0
0.684CysThr: 0.684 ± 0.0
1.026CysVal: 1.026 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.684CysTyr: 0.684 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.393AspAla: 2.393 ± 0.0
0.684AspCys: 0.684 ± 0.0
2.393AspAsp: 2.393 ± 0.0
6.154AspGlu: 6.154 ± 0.0
5.128AspPhe: 5.128 ± 0.0
2.735AspGly: 2.735 ± 0.0
1.709AspHis: 1.709 ± 0.0
1.709AspIle: 1.709 ± 0.0
4.444AspLys: 4.444 ± 0.0
5.128AspLeu: 5.128 ± 0.0
2.051AspMet: 2.051 ± 0.0
4.103AspAsn: 4.103 ± 0.0
3.077AspPro: 3.077 ± 0.0
3.761AspGln: 3.761 ± 0.0
3.419AspArg: 3.419 ± 0.0
3.761AspSer: 3.761 ± 0.0
2.393AspThr: 2.393 ± 0.0
3.761AspVal: 3.761 ± 0.0
1.709AspTrp: 1.709 ± 0.0
2.393AspTyr: 2.393 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.026GluAla: 1.026 ± 0.0
1.026GluCys: 1.026 ± 0.0
3.077GluAsp: 3.077 ± 0.0
8.205GluGlu: 8.205 ± 0.0
2.735GluPhe: 2.735 ± 0.0
3.419GluGly: 3.419 ± 0.0
0.342GluHis: 0.342 ± 0.0
6.838GluIle: 6.838 ± 0.0
7.863GluLys: 7.863 ± 0.0
4.444GluLeu: 4.444 ± 0.0
4.103GluMet: 4.103 ± 0.0
3.761GluAsn: 3.761 ± 0.0
2.051GluPro: 2.051 ± 0.0
4.786GluGln: 4.786 ± 0.0
1.368GluArg: 1.368 ± 0.0
3.077GluSer: 3.077 ± 0.0
3.419GluThr: 3.419 ± 0.0
1.709GluVal: 1.709 ± 0.0
0.684GluTrp: 0.684 ± 0.0
2.735GluTyr: 2.735 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.368PheAla: 1.368 ± 0.0
1.368PheCys: 1.368 ± 0.0
4.444PheAsp: 4.444 ± 0.0
3.761PheGlu: 3.761 ± 0.0
2.051PhePhe: 2.051 ± 0.0
4.786PheGly: 4.786 ± 0.0
1.026PheHis: 1.026 ± 0.0
3.419PheIle: 3.419 ± 0.0
4.786PheLys: 4.786 ± 0.0
3.077PheLeu: 3.077 ± 0.0
1.709PheMet: 1.709 ± 0.0
2.735PheAsn: 2.735 ± 0.0
2.051PhePro: 2.051 ± 0.0
2.735PheGln: 2.735 ± 0.0
1.368PheArg: 1.368 ± 0.0
3.077PheSer: 3.077 ± 0.0
2.051PheThr: 2.051 ± 0.0
4.444PheVal: 4.444 ± 0.0
0.342PheTrp: 0.342 ± 0.0
1.368PheTyr: 1.368 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.735GlyAla: 2.735 ± 0.0
1.709GlyCys: 1.709 ± 0.0
3.761GlyAsp: 3.761 ± 0.0
2.393GlyGlu: 2.393 ± 0.0
4.444GlyPhe: 4.444 ± 0.0
2.735GlyGly: 2.735 ± 0.0
0.342GlyHis: 0.342 ± 0.0
4.103GlyIle: 4.103 ± 0.0
6.496GlyLys: 6.496 ± 0.0
4.103GlyLeu: 4.103 ± 0.0
2.735GlyMet: 2.735 ± 0.0
3.419GlyAsn: 3.419 ± 0.0
0.0GlyPro: 0.0 ± 0.0
2.393GlyGln: 2.393 ± 0.0
2.051GlyArg: 2.051 ± 0.0
3.419GlySer: 3.419 ± 0.0
3.077GlyThr: 3.077 ± 0.0
5.128GlyVal: 5.128 ± 0.0
0.342GlyTrp: 0.342 ± 0.0
1.368GlyTyr: 1.368 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.684HisAla: 0.684 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.684HisAsp: 0.684 ± 0.0
1.368HisGlu: 1.368 ± 0.0
2.051HisPhe: 2.051 ± 0.0
0.684HisGly: 0.684 ± 0.0
0.342HisHis: 0.342 ± 0.0
1.709HisIle: 1.709 ± 0.0
2.051HisLys: 2.051 ± 0.0
1.026HisLeu: 1.026 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.026HisPro: 1.026 ± 0.0
1.026HisGln: 1.026 ± 0.0
2.393HisArg: 2.393 ± 0.0
1.368HisSer: 1.368 ± 0.0
1.026HisThr: 1.026 ± 0.0
1.026HisVal: 1.026 ± 0.0
0.684HisTrp: 0.684 ± 0.0
0.684HisTyr: 0.684 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.419IleAla: 3.419 ± 0.0
1.368IleCys: 1.368 ± 0.0
6.838IleAsp: 6.838 ± 0.0
6.496IleGlu: 6.496 ± 0.0
2.393IlePhe: 2.393 ± 0.0
2.735IleGly: 2.735 ± 0.0
1.026IleHis: 1.026 ± 0.0
6.154IleIle: 6.154 ± 0.0
4.786IleLys: 4.786 ± 0.0
7.179IleLeu: 7.179 ± 0.0
1.709IleMet: 1.709 ± 0.0
3.761IleAsn: 3.761 ± 0.0
3.077IlePro: 3.077 ± 0.0
3.077IleGln: 3.077 ± 0.0
2.735IleArg: 2.735 ± 0.0
2.051IleSer: 2.051 ± 0.0
1.709IleThr: 1.709 ± 0.0
5.812IleVal: 5.812 ± 0.0
1.026IleTrp: 1.026 ± 0.0
2.735IleTyr: 2.735 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.444LysAla: 4.444 ± 0.0
0.342LysCys: 0.342 ± 0.0
7.863LysAsp: 7.863 ± 0.0
6.154LysGlu: 6.154 ± 0.0
4.103LysPhe: 4.103 ± 0.0
1.709LysGly: 1.709 ± 0.0
3.077LysHis: 3.077 ± 0.0
5.128LysIle: 5.128 ± 0.0
6.838LysLys: 6.838 ± 0.0
7.863LysLeu: 7.863 ± 0.0
0.684LysMet: 0.684 ± 0.0
4.786LysAsn: 4.786 ± 0.0
2.735LysPro: 2.735 ± 0.0
4.786LysGln: 4.786 ± 0.0
5.812LysArg: 5.812 ± 0.0
5.128LysSer: 5.128 ± 0.0
4.103LysThr: 4.103 ± 0.0
5.812LysVal: 5.812 ± 0.0
0.342LysTrp: 0.342 ± 0.0
3.077LysTyr: 3.077 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.103LeuAla: 4.103 ± 0.0
1.368LeuCys: 1.368 ± 0.0
7.521LeuAsp: 7.521 ± 0.0
4.444LeuGlu: 4.444 ± 0.0
3.077LeuPhe: 3.077 ± 0.0
4.444LeuGly: 4.444 ± 0.0
1.026LeuHis: 1.026 ± 0.0
5.47LeuIle: 5.47 ± 0.0
7.863LeuLys: 7.863 ± 0.0
5.47LeuLeu: 5.47 ± 0.0
1.026LeuMet: 1.026 ± 0.0
8.205LeuAsn: 8.205 ± 0.0
1.709LeuPro: 1.709 ± 0.0
3.077LeuGln: 3.077 ± 0.0
4.103LeuArg: 4.103 ± 0.0
6.496LeuSer: 6.496 ± 0.0
3.419LeuThr: 3.419 ± 0.0
4.786LeuVal: 4.786 ± 0.0
0.684LeuTrp: 0.684 ± 0.0
3.419LeuTyr: 3.419 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.026MetAla: 1.026 ± 0.0
0.342MetCys: 0.342 ± 0.0
1.026MetAsp: 1.026 ± 0.0
0.684MetGlu: 0.684 ± 0.0
1.709MetPhe: 1.709 ± 0.0
1.368MetGly: 1.368 ± 0.0
1.026MetHis: 1.026 ± 0.0
1.709MetIle: 1.709 ± 0.0
2.735MetLys: 2.735 ± 0.0
2.735MetLeu: 2.735 ± 0.0
1.026MetMet: 1.026 ± 0.0
1.368MetAsn: 1.368 ± 0.0
0.684MetPro: 0.684 ± 0.0
1.026MetGln: 1.026 ± 0.0
1.026MetArg: 1.026 ± 0.0
1.709MetSer: 1.709 ± 0.0
1.368MetThr: 1.368 ± 0.0
2.735MetVal: 2.735 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.026MetTyr: 1.026 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.368AsnAla: 1.368 ± 0.0
2.051AsnCys: 2.051 ± 0.0
1.709AsnAsp: 1.709 ± 0.0
4.444AsnGlu: 4.444 ± 0.0
0.684AsnPhe: 0.684 ± 0.0
4.103AsnGly: 4.103 ± 0.0
1.709AsnHis: 1.709 ± 0.0
4.103AsnIle: 4.103 ± 0.0
6.496AsnLys: 6.496 ± 0.0
5.128AsnLeu: 5.128 ± 0.0
1.709AsnMet: 1.709 ± 0.0
4.786AsnAsn: 4.786 ± 0.0
2.051AsnPro: 2.051 ± 0.0
3.077AsnGln: 3.077 ± 0.0
3.761AsnArg: 3.761 ± 0.0
5.47AsnSer: 5.47 ± 0.0
3.761AsnThr: 3.761 ± 0.0
3.077AsnVal: 3.077 ± 0.0
1.026AsnTrp: 1.026 ± 0.0
0.684AsnTyr: 0.684 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.026ProAla: 1.026 ± 0.0
1.026ProCys: 1.026 ± 0.0
1.709ProAsp: 1.709 ± 0.0
2.051ProGlu: 2.051 ± 0.0
3.761ProPhe: 3.761 ± 0.0
2.051ProGly: 2.051 ± 0.0
0.342ProHis: 0.342 ± 0.0
3.761ProIle: 3.761 ± 0.0
3.419ProLys: 3.419 ± 0.0
3.077ProLeu: 3.077 ± 0.0
1.026ProMet: 1.026 ± 0.0
1.368ProAsn: 1.368 ± 0.0
1.368ProPro: 1.368 ± 0.0
1.026ProGln: 1.026 ± 0.0
1.368ProArg: 1.368 ± 0.0
3.419ProSer: 3.419 ± 0.0
1.026ProThr: 1.026 ± 0.0
2.393ProVal: 2.393 ± 0.0
1.026ProTrp: 1.026 ± 0.0
1.368ProTyr: 1.368 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.051GlnAla: 2.051 ± 0.0
0.342GlnCys: 0.342 ± 0.0
2.051GlnAsp: 2.051 ± 0.0
1.709GlnGlu: 1.709 ± 0.0
2.051GlnPhe: 2.051 ± 0.0
2.735GlnGly: 2.735 ± 0.0
0.684GlnHis: 0.684 ± 0.0
4.103GlnIle: 4.103 ± 0.0
3.761GlnLys: 3.761 ± 0.0
3.419GlnLeu: 3.419 ± 0.0
1.709GlnMet: 1.709 ± 0.0
2.393GlnAsn: 2.393 ± 0.0
1.709GlnPro: 1.709 ± 0.0
2.051GlnGln: 2.051 ± 0.0
1.026GlnArg: 1.026 ± 0.0
4.103GlnSer: 4.103 ± 0.0
3.419GlnThr: 3.419 ± 0.0
2.393GlnVal: 2.393 ± 0.0
0.342GlnTrp: 0.342 ± 0.0
1.709GlnTyr: 1.709 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.051ArgAla: 2.051 ± 0.0
1.368ArgCys: 1.368 ± 0.0
2.393ArgAsp: 2.393 ± 0.0
3.761ArgGlu: 3.761 ± 0.0
2.735ArgPhe: 2.735 ± 0.0
1.368ArgGly: 1.368 ± 0.0
1.368ArgHis: 1.368 ± 0.0
2.735ArgIle: 2.735 ± 0.0
3.419ArgLys: 3.419 ± 0.0
2.393ArgLeu: 2.393 ± 0.0
1.026ArgMet: 1.026 ± 0.0
3.761ArgAsn: 3.761 ± 0.0
3.077ArgPro: 3.077 ± 0.0
1.026ArgGln: 1.026 ± 0.0
2.051ArgArg: 2.051 ± 0.0
2.051ArgSer: 2.051 ± 0.0
2.735ArgThr: 2.735 ± 0.0
1.026ArgVal: 1.026 ± 0.0
1.368ArgTrp: 1.368 ± 0.0
1.026ArgTyr: 1.026 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.051SerAla: 2.051 ± 0.0
1.368SerCys: 1.368 ± 0.0
3.761SerAsp: 3.761 ± 0.0
2.735SerGlu: 2.735 ± 0.0
3.761SerPhe: 3.761 ± 0.0
5.47SerGly: 5.47 ± 0.0
0.684SerHis: 0.684 ± 0.0
3.761SerIle: 3.761 ± 0.0
5.812SerLys: 5.812 ± 0.0
6.838SerLeu: 6.838 ± 0.0
2.051SerMet: 2.051 ± 0.0
3.077SerAsn: 3.077 ± 0.0
1.709SerPro: 1.709 ± 0.0
4.786SerGln: 4.786 ± 0.0
1.709SerArg: 1.709 ± 0.0
7.521SerSer: 7.521 ± 0.0
6.838SerThr: 6.838 ± 0.0
4.444SerVal: 4.444 ± 0.0
0.684SerTrp: 0.684 ± 0.0
1.709SerTyr: 1.709 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.709ThrAla: 1.709 ± 0.0
0.342ThrCys: 0.342 ± 0.0
1.709ThrAsp: 1.709 ± 0.0
2.393ThrGlu: 2.393 ± 0.0
3.419ThrPhe: 3.419 ± 0.0
1.709ThrGly: 1.709 ± 0.0
1.026ThrHis: 1.026 ± 0.0
3.761ThrIle: 3.761 ± 0.0
3.761ThrLys: 3.761 ± 0.0
4.786ThrLeu: 4.786 ± 0.0
0.684ThrMet: 0.684 ± 0.0
3.419ThrAsn: 3.419 ± 0.0
3.761ThrPro: 3.761 ± 0.0
0.342ThrGln: 0.342 ± 0.0
1.026ThrArg: 1.026 ± 0.0
6.496ThrSer: 6.496 ± 0.0
3.419ThrThr: 3.419 ± 0.0
7.521ThrVal: 7.521 ± 0.0
0.684ThrTrp: 0.684 ± 0.0
1.709ThrTyr: 1.709 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.128ValAla: 5.128 ± 0.0
0.342ValCys: 0.342 ± 0.0
7.521ValAsp: 7.521 ± 0.0
4.103ValGlu: 4.103 ± 0.0
2.051ValPhe: 2.051 ± 0.0
5.812ValGly: 5.812 ± 0.0
1.026ValHis: 1.026 ± 0.0
3.761ValIle: 3.761 ± 0.0
3.761ValLys: 3.761 ± 0.0
7.521ValLeu: 7.521 ± 0.0
1.026ValMet: 1.026 ± 0.0
2.051ValAsn: 2.051 ± 0.0
3.761ValPro: 3.761 ± 0.0
1.368ValGln: 1.368 ± 0.0
2.051ValArg: 2.051 ± 0.0
4.786ValSer: 4.786 ± 0.0
3.077ValThr: 3.077 ± 0.0
2.051ValVal: 2.051 ± 0.0
0.684ValTrp: 0.684 ± 0.0
3.077ValTyr: 3.077 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.684TrpAla: 0.684 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.684TrpAsp: 0.684 ± 0.0
0.342TrpGlu: 0.342 ± 0.0
0.342TrpPhe: 0.342 ± 0.0
0.342TrpGly: 0.342 ± 0.0
0.342TrpHis: 0.342 ± 0.0
0.684TrpIle: 0.684 ± 0.0
0.342TrpLys: 0.342 ± 0.0
1.026TrpLeu: 1.026 ± 0.0
0.342TrpMet: 0.342 ± 0.0
1.709TrpAsn: 1.709 ± 0.0
0.342TrpPro: 0.342 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.342TrpArg: 0.342 ± 0.0
1.709TrpSer: 1.709 ± 0.0
1.368TrpThr: 1.368 ± 0.0
1.026TrpVal: 1.026 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.684TrpTyr: 0.684 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.684TyrAla: 0.684 ± 0.0
0.342TyrCys: 0.342 ± 0.0
1.026TyrAsp: 1.026 ± 0.0
2.393TyrGlu: 2.393 ± 0.0
2.051TyrPhe: 2.051 ± 0.0
3.419TyrGly: 3.419 ± 0.0
1.709TyrHis: 1.709 ± 0.0
1.368TyrIle: 1.368 ± 0.0
2.393TyrLys: 2.393 ± 0.0
3.419TyrLeu: 3.419 ± 0.0
0.342TyrMet: 0.342 ± 0.0
3.419TyrAsn: 3.419 ± 0.0
0.342TyrPro: 0.342 ± 0.0
1.368TyrGln: 1.368 ± 0.0
3.077TyrArg: 3.077 ± 0.0
2.393TyrSer: 2.393 ± 0.0
1.709TyrThr: 1.709 ± 0.0
1.368TyrVal: 1.368 ± 0.0
0.342TyrTrp: 0.342 ± 0.0
1.709TyrTyr: 1.709 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2926 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski