Amino acid dipepetide frequency for Wuhan spider virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.237AlaAla: 5.237 ± 0.0
1.397AlaCys: 1.397 ± 0.0
2.444AlaAsp: 2.444 ± 0.0
2.444AlaGlu: 2.444 ± 0.0
1.746AlaPhe: 1.746 ± 0.0
5.237AlaGly: 5.237 ± 0.0
1.746AlaHis: 1.746 ± 0.0
4.19AlaIle: 4.19 ± 0.0
3.142AlaLys: 3.142 ± 0.0
5.587AlaLeu: 5.587 ± 0.0
2.793AlaMet: 2.793 ± 0.0
3.492AlaAsn: 3.492 ± 0.0
5.237AlaPro: 5.237 ± 0.0
3.492AlaGln: 3.492 ± 0.0
5.587AlaArg: 5.587 ± 0.0
3.492AlaSer: 3.492 ± 0.0
3.492AlaThr: 3.492 ± 0.0
2.793AlaVal: 2.793 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
3.142AlaTyr: 3.142 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.397CysAla: 1.397 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.698CysAsp: 0.698 ± 0.0
0.698CysGlu: 0.698 ± 0.0
0.349CysPhe: 0.349 ± 0.0
0.698CysGly: 0.698 ± 0.0
0.698CysHis: 0.698 ± 0.0
1.746CysIle: 1.746 ± 0.0
0.349CysLys: 0.349 ± 0.0
1.397CysLeu: 1.397 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.349CysAsn: 0.349 ± 0.0
0.698CysPro: 0.698 ± 0.0
0.349CysGln: 0.349 ± 0.0
1.397CysArg: 1.397 ± 0.0
1.397CysSer: 1.397 ± 0.0
0.349CysThr: 0.349 ± 0.0
1.047CysVal: 1.047 ± 0.0
0.349CysTrp: 0.349 ± 0.0
0.698CysTyr: 0.698 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.444AspAla: 2.444 ± 0.0
1.746AspCys: 1.746 ± 0.0
4.888AspAsp: 4.888 ± 0.0
5.587AspGlu: 5.587 ± 0.0
4.539AspPhe: 4.539 ± 0.0
3.492AspGly: 3.492 ± 0.0
1.047AspHis: 1.047 ± 0.0
5.587AspIle: 5.587 ± 0.0
3.841AspLys: 3.841 ± 0.0
4.19AspLeu: 4.19 ± 0.0
1.047AspMet: 1.047 ± 0.0
1.047AspAsn: 1.047 ± 0.0
1.397AspPro: 1.397 ± 0.0
2.095AspGln: 2.095 ± 0.0
1.397AspArg: 1.397 ± 0.0
2.444AspSer: 2.444 ± 0.0
3.841AspThr: 3.841 ± 0.0
5.587AspVal: 5.587 ± 0.0
0.698AspTrp: 0.698 ± 0.0
1.047AspTyr: 1.047 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.142GluAla: 3.142 ± 0.0
0.349GluCys: 0.349 ± 0.0
3.841GluAsp: 3.841 ± 0.0
7.332GluGlu: 7.332 ± 0.0
2.444GluPhe: 2.444 ± 0.0
3.492GluGly: 3.492 ± 0.0
0.349GluHis: 0.349 ± 0.0
2.793GluIle: 2.793 ± 0.0
5.237GluLys: 5.237 ± 0.0
5.237GluLeu: 5.237 ± 0.0
2.095GluMet: 2.095 ± 0.0
1.397GluAsn: 1.397 ± 0.0
1.047GluPro: 1.047 ± 0.0
3.492GluGln: 3.492 ± 0.0
1.746GluArg: 1.746 ± 0.0
2.444GluSer: 2.444 ± 0.0
2.095GluThr: 2.095 ± 0.0
4.539GluVal: 4.539 ± 0.0
2.095GluTrp: 2.095 ± 0.0
2.793GluTyr: 2.793 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.19PheAla: 4.19 ± 0.0
1.047PheCys: 1.047 ± 0.0
2.444PheAsp: 2.444 ± 0.0
3.142PheGlu: 3.142 ± 0.0
2.793PhePhe: 2.793 ± 0.0
3.492PheGly: 3.492 ± 0.0
1.047PheHis: 1.047 ± 0.0
2.444PheIle: 2.444 ± 0.0
2.793PheLys: 2.793 ± 0.0
3.142PheLeu: 3.142 ± 0.0
2.095PheMet: 2.095 ± 0.0
2.444PheAsn: 2.444 ± 0.0
1.397PhePro: 1.397 ± 0.0
1.397PheGln: 1.397 ± 0.0
2.095PheArg: 2.095 ± 0.0
3.492PheSer: 3.492 ± 0.0
4.19PheThr: 4.19 ± 0.0
2.444PheVal: 2.444 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.444PheTyr: 2.444 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.841GlyAla: 3.841 ± 0.0
0.698GlyCys: 0.698 ± 0.0
4.539GlyAsp: 4.539 ± 0.0
1.746GlyGlu: 1.746 ± 0.0
3.142GlyPhe: 3.142 ± 0.0
2.095GlyGly: 2.095 ± 0.0
1.047GlyHis: 1.047 ± 0.0
3.142GlyIle: 3.142 ± 0.0
5.936GlyLys: 5.936 ± 0.0
6.983GlyLeu: 6.983 ± 0.0
1.746GlyMet: 1.746 ± 0.0
3.142GlyAsn: 3.142 ± 0.0
0.698GlyPro: 0.698 ± 0.0
1.746GlyGln: 1.746 ± 0.0
3.492GlyArg: 3.492 ± 0.0
3.841GlySer: 3.841 ± 0.0
4.888GlyThr: 4.888 ± 0.0
3.492GlyVal: 3.492 ± 0.0
2.095GlyTrp: 2.095 ± 0.0
0.698GlyTyr: 0.698 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.746HisAla: 1.746 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.047HisAsp: 1.047 ± 0.0
1.397HisGlu: 1.397 ± 0.0
2.793HisPhe: 2.793 ± 0.0
5.237HisGly: 5.237 ± 0.0
0.349HisHis: 0.349 ± 0.0
1.746HisIle: 1.746 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.397HisLeu: 1.397 ± 0.0
0.349HisMet: 0.349 ± 0.0
1.397HisAsn: 1.397 ± 0.0
1.397HisPro: 1.397 ± 0.0
1.047HisGln: 1.047 ± 0.0
1.047HisArg: 1.047 ± 0.0
0.349HisSer: 0.349 ± 0.0
1.746HisThr: 1.746 ± 0.0
1.047HisVal: 1.047 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.698HisTyr: 0.698 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.19IleAla: 4.19 ± 0.0
0.698IleCys: 0.698 ± 0.0
3.142IleAsp: 3.142 ± 0.0
5.237IleGlu: 5.237 ± 0.0
2.444IlePhe: 2.444 ± 0.0
3.841IleGly: 3.841 ± 0.0
2.095IleHis: 2.095 ± 0.0
3.492IleIle: 3.492 ± 0.0
3.841IleLys: 3.841 ± 0.0
6.983IleLeu: 6.983 ± 0.0
1.047IleMet: 1.047 ± 0.0
3.142IleAsn: 3.142 ± 0.0
2.095IlePro: 2.095 ± 0.0
2.444IleGln: 2.444 ± 0.0
5.587IleArg: 5.587 ± 0.0
4.888IleSer: 4.888 ± 0.0
3.841IleThr: 3.841 ± 0.0
3.841IleVal: 3.841 ± 0.0
0.698IleTrp: 0.698 ± 0.0
3.142IleTyr: 3.142 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.936LysAla: 5.936 ± 0.0
1.047LysCys: 1.047 ± 0.0
3.841LysAsp: 3.841 ± 0.0
2.793LysGlu: 2.793 ± 0.0
2.444LysPhe: 2.444 ± 0.0
2.793LysGly: 2.793 ± 0.0
1.746LysHis: 1.746 ± 0.0
6.285LysIle: 6.285 ± 0.0
1.746LysLys: 1.746 ± 0.0
4.19LysLeu: 4.19 ± 0.0
0.698LysMet: 0.698 ± 0.0
2.095LysAsn: 2.095 ± 0.0
3.142LysPro: 3.142 ± 0.0
0.698LysGln: 0.698 ± 0.0
1.397LysArg: 1.397 ± 0.0
4.19LysSer: 4.19 ± 0.0
4.888LysThr: 4.888 ± 0.0
5.936LysVal: 5.936 ± 0.0
0.349LysTrp: 0.349 ± 0.0
2.095LysTyr: 2.095 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.285LeuAla: 6.285 ± 0.0
1.397LeuCys: 1.397 ± 0.0
3.492LeuAsp: 3.492 ± 0.0
3.492LeuGlu: 3.492 ± 0.0
4.539LeuPhe: 4.539 ± 0.0
3.841LeuGly: 3.841 ± 0.0
3.492LeuHis: 3.492 ± 0.0
5.936LeuIle: 5.936 ± 0.0
5.237LeuLys: 5.237 ± 0.0
6.634LeuLeu: 6.634 ± 0.0
1.397LeuMet: 1.397 ± 0.0
7.332LeuAsn: 7.332 ± 0.0
5.237LeuPro: 5.237 ± 0.0
3.142LeuGln: 3.142 ± 0.0
6.634LeuArg: 6.634 ± 0.0
5.237LeuSer: 5.237 ± 0.0
6.634LeuThr: 6.634 ± 0.0
5.237LeuVal: 5.237 ± 0.0
0.698LeuTrp: 0.698 ± 0.0
2.444LeuTyr: 2.444 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.841MetAla: 3.841 ± 0.0
0.349MetCys: 0.349 ± 0.0
1.047MetAsp: 1.047 ± 0.0
1.397MetGlu: 1.397 ± 0.0
0.698MetPhe: 0.698 ± 0.0
1.397MetGly: 1.397 ± 0.0
0.698MetHis: 0.698 ± 0.0
0.698MetIle: 0.698 ± 0.0
0.349MetLys: 0.349 ± 0.0
3.142MetLeu: 3.142 ± 0.0
0.349MetMet: 0.349 ± 0.0
1.746MetAsn: 1.746 ± 0.0
1.047MetPro: 1.047 ± 0.0
1.746MetGln: 1.746 ± 0.0
1.746MetArg: 1.746 ± 0.0
1.746MetSer: 1.746 ± 0.0
0.698MetThr: 0.698 ± 0.0
1.746MetVal: 1.746 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.698MetTyr: 0.698 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.142AsnAla: 3.142 ± 0.0
0.349AsnCys: 0.349 ± 0.0
3.492AsnAsp: 3.492 ± 0.0
1.047AsnGlu: 1.047 ± 0.0
1.746AsnPhe: 1.746 ± 0.0
2.095AsnGly: 2.095 ± 0.0
0.0AsnHis: 0.0 ± 0.0
2.095AsnIle: 2.095 ± 0.0
3.492AsnLys: 3.492 ± 0.0
4.539AsnLeu: 4.539 ± 0.0
2.095AsnMet: 2.095 ± 0.0
1.397AsnAsn: 1.397 ± 0.0
2.444AsnPro: 2.444 ± 0.0
1.047AsnGln: 1.047 ± 0.0
2.444AsnArg: 2.444 ± 0.0
4.888AsnSer: 4.888 ± 0.0
1.397AsnThr: 1.397 ± 0.0
2.793AsnVal: 2.793 ± 0.0
1.047AsnTrp: 1.047 ± 0.0
1.746AsnTyr: 1.746 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.142ProAla: 3.142 ± 0.0
0.0ProCys: 0.0 ± 0.0
2.095ProAsp: 2.095 ± 0.0
2.444ProGlu: 2.444 ± 0.0
2.444ProPhe: 2.444 ± 0.0
2.095ProGly: 2.095 ± 0.0
0.698ProHis: 0.698 ± 0.0
5.936ProIle: 5.936 ± 0.0
1.746ProLys: 1.746 ± 0.0
2.793ProLeu: 2.793 ± 0.0
0.698ProMet: 0.698 ± 0.0
1.397ProAsn: 1.397 ± 0.0
1.047ProPro: 1.047 ± 0.0
2.444ProGln: 2.444 ± 0.0
1.746ProArg: 1.746 ± 0.0
3.841ProSer: 3.841 ± 0.0
2.095ProThr: 2.095 ± 0.0
6.285ProVal: 6.285 ± 0.0
1.047ProTrp: 1.047 ± 0.0
2.444ProTyr: 2.444 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.492GlnAla: 3.492 ± 0.0
0.698GlnCys: 0.698 ± 0.0
1.047GlnAsp: 1.047 ± 0.0
2.444GlnGlu: 2.444 ± 0.0
1.746GlnPhe: 1.746 ± 0.0
2.444GlnGly: 2.444 ± 0.0
0.349GlnHis: 0.349 ± 0.0
1.746GlnIle: 1.746 ± 0.0
3.841GlnLys: 3.841 ± 0.0
3.841GlnLeu: 3.841 ± 0.0
2.444GlnMet: 2.444 ± 0.0
1.746GlnAsn: 1.746 ± 0.0
2.793GlnPro: 2.793 ± 0.0
0.698GlnGln: 0.698 ± 0.0
1.746GlnArg: 1.746 ± 0.0
2.095GlnSer: 2.095 ± 0.0
2.095GlnThr: 2.095 ± 0.0
2.444GlnVal: 2.444 ± 0.0
0.698GlnTrp: 0.698 ± 0.0
1.397GlnTyr: 1.397 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.444ArgAla: 2.444 ± 0.0
1.746ArgCys: 1.746 ± 0.0
4.539ArgAsp: 4.539 ± 0.0
3.841ArgGlu: 3.841 ± 0.0
2.793ArgPhe: 2.793 ± 0.0
3.142ArgGly: 3.142 ± 0.0
2.444ArgHis: 2.444 ± 0.0
3.492ArgIle: 3.492 ± 0.0
2.793ArgLys: 2.793 ± 0.0
3.492ArgLeu: 3.492 ± 0.0
0.698ArgMet: 0.698 ± 0.0
3.142ArgAsn: 3.142 ± 0.0
1.746ArgPro: 1.746 ± 0.0
1.397ArgGln: 1.397 ± 0.0
3.841ArgArg: 3.841 ± 0.0
4.19ArgSer: 4.19 ± 0.0
1.397ArgThr: 1.397 ± 0.0
2.444ArgVal: 2.444 ± 0.0
1.746ArgTrp: 1.746 ± 0.0
3.142ArgTyr: 3.142 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.492SerAla: 3.492 ± 0.0
0.698SerCys: 0.698 ± 0.0
3.492SerAsp: 3.492 ± 0.0
3.142SerGlu: 3.142 ± 0.0
4.539SerPhe: 4.539 ± 0.0
4.19SerGly: 4.19 ± 0.0
1.397SerHis: 1.397 ± 0.0
4.19SerIle: 4.19 ± 0.0
4.539SerLys: 4.539 ± 0.0
6.634SerLeu: 6.634 ± 0.0
0.698SerMet: 0.698 ± 0.0
2.793SerAsn: 2.793 ± 0.0
4.19SerPro: 4.19 ± 0.0
3.841SerGln: 3.841 ± 0.0
1.397SerArg: 1.397 ± 0.0
5.587SerSer: 5.587 ± 0.0
2.793SerThr: 2.793 ± 0.0
3.841SerVal: 3.841 ± 0.0
1.397SerTrp: 1.397 ± 0.0
3.492SerTyr: 3.492 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.492ThrAla: 3.492 ± 0.0
0.0ThrCys: 0.0 ± 0.0
4.888ThrAsp: 4.888 ± 0.0
2.793ThrGlu: 2.793 ± 0.0
1.746ThrPhe: 1.746 ± 0.0
4.888ThrGly: 4.888 ± 0.0
1.746ThrHis: 1.746 ± 0.0
2.444ThrIle: 2.444 ± 0.0
2.444ThrLys: 2.444 ± 0.0
7.682ThrLeu: 7.682 ± 0.0
1.746ThrMet: 1.746 ± 0.0
1.047ThrAsn: 1.047 ± 0.0
3.841ThrPro: 3.841 ± 0.0
3.841ThrGln: 3.841 ± 0.0
3.492ThrArg: 3.492 ± 0.0
4.539ThrSer: 4.539 ± 0.0
4.539ThrThr: 4.539 ± 0.0
4.19ThrVal: 4.19 ± 0.0
0.698ThrTrp: 0.698 ± 0.0
1.047ThrTyr: 1.047 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.142ValAla: 3.142 ± 0.0
1.047ValCys: 1.047 ± 0.0
2.444ValAsp: 2.444 ± 0.0
5.587ValGlu: 5.587 ± 0.0
3.142ValPhe: 3.142 ± 0.0
2.793ValGly: 2.793 ± 0.0
1.397ValHis: 1.397 ± 0.0
3.142ValIle: 3.142 ± 0.0
4.888ValLys: 4.888 ± 0.0
5.936ValLeu: 5.936 ± 0.0
1.397ValMet: 1.397 ± 0.0
2.444ValAsn: 2.444 ± 0.0
3.142ValPro: 3.142 ± 0.0
2.444ValGln: 2.444 ± 0.0
3.492ValArg: 3.492 ± 0.0
4.19ValSer: 4.19 ± 0.0
5.936ValThr: 5.936 ± 0.0
2.793ValVal: 2.793 ± 0.0
1.397ValTrp: 1.397 ± 0.0
3.841ValTyr: 3.841 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.698TrpAla: 0.698 ± 0.0
0.349TrpCys: 0.349 ± 0.0
1.397TrpAsp: 1.397 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.349TrpPhe: 0.349 ± 0.0
0.349TrpGly: 0.349 ± 0.0
0.349TrpHis: 0.349 ± 0.0
0.698TrpIle: 0.698 ± 0.0
0.698TrpLys: 0.698 ± 0.0
1.397TrpLeu: 1.397 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.698TrpAsn: 0.698 ± 0.0
0.698TrpPro: 0.698 ± 0.0
1.746TrpGln: 1.746 ± 0.0
1.746TrpArg: 1.746 ± 0.0
0.698TrpSer: 0.698 ± 0.0
2.444TrpThr: 2.444 ± 0.0
0.698TrpVal: 0.698 ± 0.0
0.349TrpTrp: 0.349 ± 0.0
0.698TrpTyr: 0.698 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.746TyrAla: 1.746 ± 0.0
1.047TyrCys: 1.047 ± 0.0
3.142TyrAsp: 3.142 ± 0.0
1.397TyrGlu: 1.397 ± 0.0
2.095TyrPhe: 2.095 ± 0.0
1.397TyrGly: 1.397 ± 0.0
2.095TyrHis: 2.095 ± 0.0
4.888TyrIle: 4.888 ± 0.0
1.397TyrLys: 1.397 ± 0.0
3.142TyrLeu: 3.142 ± 0.0
1.397TyrMet: 1.397 ± 0.0
1.047TyrAsn: 1.047 ± 0.0
3.142TyrPro: 3.142 ± 0.0
0.698TyrGln: 0.698 ± 0.0
2.444TyrArg: 2.444 ± 0.0
2.793TyrSer: 2.793 ± 0.0
1.746TyrThr: 1.746 ± 0.0
1.397TyrVal: 1.397 ± 0.0
0.698TyrTrp: 0.698 ± 0.0
2.095TyrTyr: 2.095 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2865 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski