Amino acid dipepetide frequency for Wuhan spirurian nematodes virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.562AlaAla: 5.562 ± 0.0
0.927AlaCys: 0.927 ± 0.0
4.326AlaAsp: 4.326 ± 0.0
2.472AlaGlu: 2.472 ± 0.0
1.854AlaPhe: 1.854 ± 0.0
4.944AlaGly: 4.944 ± 0.0
1.236AlaHis: 1.236 ± 0.0
3.708AlaIle: 3.708 ± 0.0
4.017AlaLys: 4.017 ± 0.0
5.562AlaLeu: 5.562 ± 0.0
1.545AlaMet: 1.545 ± 0.0
3.708AlaAsn: 3.708 ± 0.0
3.399AlaPro: 3.399 ± 0.0
3.09AlaGln: 3.09 ± 0.0
2.781AlaArg: 2.781 ± 0.0
4.944AlaSer: 4.944 ± 0.0
4.326AlaThr: 4.326 ± 0.0
3.09AlaVal: 3.09 ± 0.0
0.618AlaTrp: 0.618 ± 0.0
1.854AlaTyr: 1.854 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.545CysAla: 1.545 ± 0.0
0.309CysCys: 0.309 ± 0.0
2.163CysAsp: 2.163 ± 0.0
1.236CysGlu: 1.236 ± 0.0
0.618CysPhe: 0.618 ± 0.0
1.236CysGly: 1.236 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.309CysIle: 0.309 ± 0.0
0.618CysLys: 0.618 ± 0.0
0.309CysLeu: 0.309 ± 0.0
0.618CysMet: 0.618 ± 0.0
0.927CysAsn: 0.927 ± 0.0
1.236CysPro: 1.236 ± 0.0
0.927CysGln: 0.927 ± 0.0
0.309CysArg: 0.309 ± 0.0
0.618CysSer: 0.618 ± 0.0
0.309CysThr: 0.309 ± 0.0
1.545CysVal: 1.545 ± 0.0
0.309CysTrp: 0.309 ± 0.0
0.927CysTyr: 0.927 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.781AspAla: 2.781 ± 0.0
0.927AspCys: 0.927 ± 0.0
4.944AspAsp: 4.944 ± 0.0
5.253AspGlu: 5.253 ± 0.0
2.163AspPhe: 2.163 ± 0.0
3.399AspGly: 3.399 ± 0.0
0.927AspHis: 0.927 ± 0.0
3.399AspIle: 3.399 ± 0.0
2.781AspLys: 2.781 ± 0.0
5.562AspLeu: 5.562 ± 0.0
1.236AspMet: 1.236 ± 0.0
2.472AspAsn: 2.472 ± 0.0
2.163AspPro: 2.163 ± 0.0
1.545AspGln: 1.545 ± 0.0
2.781AspArg: 2.781 ± 0.0
6.489AspSer: 6.489 ± 0.0
3.09AspThr: 3.09 ± 0.0
4.635AspVal: 4.635 ± 0.0
1.236AspTrp: 1.236 ± 0.0
2.472AspTyr: 2.472 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.708GluAla: 3.708 ± 0.0
0.927GluCys: 0.927 ± 0.0
4.944GluAsp: 4.944 ± 0.0
6.18GluGlu: 6.18 ± 0.0
4.326GluPhe: 4.326 ± 0.0
1.236GluGly: 1.236 ± 0.0
0.927GluHis: 0.927 ± 0.0
3.399GluIle: 3.399 ± 0.0
4.017GluLys: 4.017 ± 0.0
4.017GluLeu: 4.017 ± 0.0
0.927GluMet: 0.927 ± 0.0
1.545GluAsn: 1.545 ± 0.0
2.781GluPro: 2.781 ± 0.0
4.326GluGln: 4.326 ± 0.0
4.017GluArg: 4.017 ± 0.0
2.472GluSer: 2.472 ± 0.0
2.781GluThr: 2.781 ± 0.0
5.871GluVal: 5.871 ± 0.0
1.236GluTrp: 1.236 ± 0.0
3.399GluTyr: 3.399 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.017PheAla: 4.017 ± 0.0
0.927PheCys: 0.927 ± 0.0
3.708PheAsp: 3.708 ± 0.0
4.635PheGlu: 4.635 ± 0.0
2.163PhePhe: 2.163 ± 0.0
4.635PheGly: 4.635 ± 0.0
1.236PheHis: 1.236 ± 0.0
1.236PheIle: 1.236 ± 0.0
0.618PheLys: 0.618 ± 0.0
3.09PheLeu: 3.09 ± 0.0
0.618PheMet: 0.618 ± 0.0
2.472PheAsn: 2.472 ± 0.0
2.472PhePro: 2.472 ± 0.0
4.017PheGln: 4.017 ± 0.0
3.708PheArg: 3.708 ± 0.0
5.871PheSer: 5.871 ± 0.0
0.927PheThr: 0.927 ± 0.0
4.635PheVal: 4.635 ± 0.0
1.545PheTrp: 1.545 ± 0.0
0.927PheTyr: 0.927 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.399GlyAla: 3.399 ± 0.0
0.618GlyCys: 0.618 ± 0.0
3.708GlyAsp: 3.708 ± 0.0
3.09GlyGlu: 3.09 ± 0.0
4.326GlyPhe: 4.326 ± 0.0
3.09GlyGly: 3.09 ± 0.0
1.545GlyHis: 1.545 ± 0.0
4.944GlyIle: 4.944 ± 0.0
1.854GlyLys: 1.854 ± 0.0
4.017GlyLeu: 4.017 ± 0.0
2.163GlyMet: 2.163 ± 0.0
1.236GlyAsn: 1.236 ± 0.0
2.163GlyPro: 2.163 ± 0.0
2.472GlyGln: 2.472 ± 0.0
2.781GlyArg: 2.781 ± 0.0
4.017GlySer: 4.017 ± 0.0
2.163GlyThr: 2.163 ± 0.0
4.326GlyVal: 4.326 ± 0.0
0.618GlyTrp: 0.618 ± 0.0
2.781GlyTyr: 2.781 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.618HisAla: 0.618 ± 0.0
0.309HisCys: 0.309 ± 0.0
2.163HisAsp: 2.163 ± 0.0
1.236HisGlu: 1.236 ± 0.0
1.236HisPhe: 1.236 ± 0.0
1.236HisGly: 1.236 ± 0.0
0.309HisHis: 0.309 ± 0.0
0.927HisIle: 0.927 ± 0.0
0.927HisLys: 0.927 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.236HisAsn: 1.236 ± 0.0
0.927HisPro: 0.927 ± 0.0
0.927HisGln: 0.927 ± 0.0
0.927HisArg: 0.927 ± 0.0
0.927HisSer: 0.927 ± 0.0
1.545HisThr: 1.545 ± 0.0
1.545HisVal: 1.545 ± 0.0
0.309HisTrp: 0.309 ± 0.0
1.545HisTyr: 1.545 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.781IleAla: 2.781 ± 0.0
0.927IleCys: 0.927 ± 0.0
4.017IleAsp: 4.017 ± 0.0
2.781IleGlu: 2.781 ± 0.0
2.163IlePhe: 2.163 ± 0.0
3.09IleGly: 3.09 ± 0.0
1.545IleHis: 1.545 ± 0.0
2.163IleIle: 2.163 ± 0.0
3.09IleLys: 3.09 ± 0.0
3.09IleLeu: 3.09 ± 0.0
0.618IleMet: 0.618 ± 0.0
2.472IleAsn: 2.472 ± 0.0
3.09IlePro: 3.09 ± 0.0
1.854IleGln: 1.854 ± 0.0
2.163IleArg: 2.163 ± 0.0
5.871IleSer: 5.871 ± 0.0
3.708IleThr: 3.708 ± 0.0
5.562IleVal: 5.562 ± 0.0
0.927IleTrp: 0.927 ± 0.0
2.472IleTyr: 2.472 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.781LysAla: 2.781 ± 0.0
0.309LysCys: 0.309 ± 0.0
1.854LysAsp: 1.854 ± 0.0
1.854LysGlu: 1.854 ± 0.0
3.09LysPhe: 3.09 ± 0.0
2.163LysGly: 2.163 ± 0.0
0.309LysHis: 0.309 ± 0.0
4.635LysIle: 4.635 ± 0.0
1.236LysLys: 1.236 ± 0.0
2.781LysLeu: 2.781 ± 0.0
1.236LysMet: 1.236 ± 0.0
1.236LysAsn: 1.236 ± 0.0
4.017LysPro: 4.017 ± 0.0
2.163LysGln: 2.163 ± 0.0
4.326LysArg: 4.326 ± 0.0
2.163LysSer: 2.163 ± 0.0
2.472LysThr: 2.472 ± 0.0
3.09LysVal: 3.09 ± 0.0
0.309LysTrp: 0.309 ± 0.0
2.472LysTyr: 2.472 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.799LeuAla: 6.799 ± 0.0
0.618LeuCys: 0.618 ± 0.0
5.253LeuAsp: 5.253 ± 0.0
4.326LeuGlu: 4.326 ± 0.0
3.09LeuPhe: 3.09 ± 0.0
3.399LeuGly: 3.399 ± 0.0
0.309LeuHis: 0.309 ± 0.0
3.708LeuIle: 3.708 ± 0.0
3.708LeuLys: 3.708 ± 0.0
4.944LeuLeu: 4.944 ± 0.0
1.236LeuMet: 1.236 ± 0.0
1.545LeuAsn: 1.545 ± 0.0
3.09LeuPro: 3.09 ± 0.0
3.09LeuGln: 3.09 ± 0.0
3.399LeuArg: 3.399 ± 0.0
5.562LeuSer: 5.562 ± 0.0
4.017LeuThr: 4.017 ± 0.0
6.799LeuVal: 6.799 ± 0.0
1.854LeuTrp: 1.854 ± 0.0
1.854LeuTyr: 1.854 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.854MetAla: 1.854 ± 0.0
0.309MetCys: 0.309 ± 0.0
0.309MetAsp: 0.309 ± 0.0
1.854MetGlu: 1.854 ± 0.0
2.472MetPhe: 2.472 ± 0.0
0.618MetGly: 0.618 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.927MetIle: 0.927 ± 0.0
0.309MetLys: 0.309 ± 0.0
1.545MetLeu: 1.545 ± 0.0
0.618MetMet: 0.618 ± 0.0
1.236MetAsn: 1.236 ± 0.0
3.09MetPro: 3.09 ± 0.0
0.309MetGln: 0.309 ± 0.0
1.545MetArg: 1.545 ± 0.0
3.09MetSer: 3.09 ± 0.0
0.618MetThr: 0.618 ± 0.0
2.781MetVal: 2.781 ± 0.0
0.618MetTrp: 0.618 ± 0.0
0.309MetTyr: 0.309 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.545AsnAla: 1.545 ± 0.0
0.927AsnCys: 0.927 ± 0.0
1.854AsnAsp: 1.854 ± 0.0
2.163AsnGlu: 2.163 ± 0.0
2.472AsnPhe: 2.472 ± 0.0
1.854AsnGly: 1.854 ± 0.0
1.545AsnHis: 1.545 ± 0.0
2.472AsnIle: 2.472 ± 0.0
1.854AsnLys: 1.854 ± 0.0
1.854AsnLeu: 1.854 ± 0.0
1.854AsnMet: 1.854 ± 0.0
3.09AsnAsn: 3.09 ± 0.0
4.017AsnPro: 4.017 ± 0.0
1.854AsnGln: 1.854 ± 0.0
2.781AsnArg: 2.781 ± 0.0
3.399AsnSer: 3.399 ± 0.0
2.472AsnThr: 2.472 ± 0.0
4.017AsnVal: 4.017 ± 0.0
0.927AsnTrp: 0.927 ± 0.0
1.854AsnTyr: 1.854 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.09ProAla: 3.09 ± 0.0
0.927ProCys: 0.927 ± 0.0
3.708ProAsp: 3.708 ± 0.0
1.854ProGlu: 1.854 ± 0.0
3.09ProPhe: 3.09 ± 0.0
3.399ProGly: 3.399 ± 0.0
1.236ProHis: 1.236 ± 0.0
1.854ProIle: 1.854 ± 0.0
0.927ProLys: 0.927 ± 0.0
4.635ProLeu: 4.635 ± 0.0
1.236ProMet: 1.236 ± 0.0
2.781ProAsn: 2.781 ± 0.0
2.163ProPro: 2.163 ± 0.0
1.545ProGln: 1.545 ± 0.0
2.163ProArg: 2.163 ± 0.0
4.326ProSer: 4.326 ± 0.0
4.635ProThr: 4.635 ± 0.0
4.635ProVal: 4.635 ± 0.0
1.545ProTrp: 1.545 ± 0.0
3.399ProTyr: 3.399 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.472GlnAla: 2.472 ± 0.0
1.236GlnCys: 1.236 ± 0.0
0.927GlnAsp: 0.927 ± 0.0
3.399GlnGlu: 3.399 ± 0.0
3.708GlnPhe: 3.708 ± 0.0
3.708GlnGly: 3.708 ± 0.0
0.927GlnHis: 0.927 ± 0.0
1.236GlnIle: 1.236 ± 0.0
2.781GlnLys: 2.781 ± 0.0
1.854GlnLeu: 1.854 ± 0.0
1.236GlnMet: 1.236 ± 0.0
2.472GlnAsn: 2.472 ± 0.0
2.163GlnPro: 2.163 ± 0.0
2.781GlnGln: 2.781 ± 0.0
1.854GlnArg: 1.854 ± 0.0
2.163GlnSer: 2.163 ± 0.0
2.781GlnThr: 2.781 ± 0.0
3.399GlnVal: 3.399 ± 0.0
1.236GlnTrp: 1.236 ± 0.0
1.854GlnTyr: 1.854 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.708ArgAla: 3.708 ± 0.0
1.236ArgCys: 1.236 ± 0.0
2.163ArgAsp: 2.163 ± 0.0
3.09ArgGlu: 3.09 ± 0.0
3.09ArgPhe: 3.09 ± 0.0
2.163ArgGly: 2.163 ± 0.0
1.236ArgHis: 1.236 ± 0.0
3.09ArgIle: 3.09 ± 0.0
5.253ArgLys: 5.253 ± 0.0
3.399ArgLeu: 3.399 ± 0.0
1.854ArgMet: 1.854 ± 0.0
2.472ArgAsn: 2.472 ± 0.0
2.163ArgPro: 2.163 ± 0.0
2.472ArgGln: 2.472 ± 0.0
4.017ArgArg: 4.017 ± 0.0
4.635ArgSer: 4.635 ± 0.0
2.781ArgThr: 2.781 ± 0.0
3.09ArgVal: 3.09 ± 0.0
1.545ArgTrp: 1.545 ± 0.0
2.472ArgTyr: 2.472 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.944SerAla: 4.944 ± 0.0
0.927SerCys: 0.927 ± 0.0
6.18SerAsp: 6.18 ± 0.0
4.326SerGlu: 4.326 ± 0.0
3.09SerPhe: 3.09 ± 0.0
5.871SerGly: 5.871 ± 0.0
1.545SerHis: 1.545 ± 0.0
5.871SerIle: 5.871 ± 0.0
2.472SerLys: 2.472 ± 0.0
4.635SerLeu: 4.635 ± 0.0
2.163SerMet: 2.163 ± 0.0
2.163SerAsn: 2.163 ± 0.0
4.017SerPro: 4.017 ± 0.0
1.854SerGln: 1.854 ± 0.0
4.017SerArg: 4.017 ± 0.0
3.708SerSer: 3.708 ± 0.0
6.489SerThr: 6.489 ± 0.0
4.635SerVal: 4.635 ± 0.0
1.545SerTrp: 1.545 ± 0.0
4.326SerTyr: 4.326 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.708ThrAla: 3.708 ± 0.0
0.618ThrCys: 0.618 ± 0.0
2.472ThrAsp: 2.472 ± 0.0
3.708ThrGlu: 3.708 ± 0.0
3.09ThrPhe: 3.09 ± 0.0
2.781ThrGly: 2.781 ± 0.0
1.236ThrHis: 1.236 ± 0.0
2.781ThrIle: 2.781 ± 0.0
1.236ThrLys: 1.236 ± 0.0
5.253ThrLeu: 5.253 ± 0.0
2.472ThrMet: 2.472 ± 0.0
3.399ThrAsn: 3.399 ± 0.0
3.399ThrPro: 3.399 ± 0.0
4.017ThrGln: 4.017 ± 0.0
3.09ThrArg: 3.09 ± 0.0
5.253ThrSer: 5.253 ± 0.0
4.635ThrThr: 4.635 ± 0.0
3.708ThrVal: 3.708 ± 0.0
0.927ThrTrp: 0.927 ± 0.0
2.163ThrTyr: 2.163 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.326ValAla: 4.326 ± 0.0
1.854ValCys: 1.854 ± 0.0
3.708ValAsp: 3.708 ± 0.0
5.562ValGlu: 5.562 ± 0.0
4.326ValPhe: 4.326 ± 0.0
2.163ValGly: 2.163 ± 0.0
1.236ValHis: 1.236 ± 0.0
5.562ValIle: 5.562 ± 0.0
3.09ValLys: 3.09 ± 0.0
4.944ValLeu: 4.944 ± 0.0
1.545ValMet: 1.545 ± 0.0
5.253ValAsn: 5.253 ± 0.0
4.635ValPro: 4.635 ± 0.0
2.472ValGln: 2.472 ± 0.0
5.253ValArg: 5.253 ± 0.0
5.253ValSer: 5.253 ± 0.0
4.017ValThr: 4.017 ± 0.0
5.562ValVal: 5.562 ± 0.0
0.927ValTrp: 0.927 ± 0.0
5.562ValTyr: 5.562 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.236TrpAla: 1.236 ± 0.0
0.927TrpCys: 0.927 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.236TrpGlu: 1.236 ± 0.0
0.618TrpPhe: 0.618 ± 0.0
0.927TrpGly: 0.927 ± 0.0
0.309TrpHis: 0.309 ± 0.0
0.618TrpIle: 0.618 ± 0.0
0.927TrpLys: 0.927 ± 0.0
1.854TrpLeu: 1.854 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.236TrpAsn: 1.236 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.927TrpGln: 0.927 ± 0.0
1.545TrpArg: 1.545 ± 0.0
1.236TrpSer: 1.236 ± 0.0
2.781TrpThr: 2.781 ± 0.0
2.163TrpVal: 2.163 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.236TrpTyr: 1.236 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.09TyrAla: 3.09 ± 0.0
0.309TyrCys: 0.309 ± 0.0
1.854TyrAsp: 1.854 ± 0.0
2.781TyrGlu: 2.781 ± 0.0
2.472TyrPhe: 2.472 ± 0.0
3.399TyrGly: 3.399 ± 0.0
1.236TyrHis: 1.236 ± 0.0
1.545TyrIle: 1.545 ± 0.0
2.781TyrLys: 2.781 ± 0.0
5.253TyrLeu: 5.253 ± 0.0
0.927TyrMet: 0.927 ± 0.0
1.854TyrAsn: 1.854 ± 0.0
2.163TyrPro: 2.163 ± 0.0
1.545TyrGln: 1.545 ± 0.0
2.472TyrArg: 2.472 ± 0.0
3.09TyrSer: 3.09 ± 0.0
3.399TyrThr: 3.399 ± 0.0
2.163TyrVal: 2.163 ± 0.0
1.545TyrTrp: 1.545 ± 0.0
3.399TyrTyr: 3.399 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (3237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski