Amino acid dipepetide frequency for Hubei sobemo-like virus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.035AlaAla: 3.035 ± 0.038
0.0AlaCys: 0.0 ± 0.0
1.517AlaAsp: 1.517 ± 1.032
3.035AlaGlu: 3.035 ± 0.038
3.035AlaPhe: 3.035 ± 2.14
6.07AlaGly: 6.07 ± 2.178
0.0AlaHis: 0.0 ± 0.0
4.552AlaIle: 4.552 ± 3.21
4.552AlaLys: 4.552 ± 0.993
4.552AlaLeu: 4.552 ± 3.21
3.035AlaMet: 3.035 ± 2.064
1.517AlaAsn: 1.517 ± 1.032
3.035AlaPro: 3.035 ± 2.064
3.035AlaGln: 3.035 ± 0.038
4.552AlaArg: 4.552 ± 1.108
3.035AlaSer: 3.035 ± 2.064
4.552AlaThr: 4.552 ± 3.095
7.587AlaVal: 7.587 ± 0.955
0.0AlaTrp: 0.0 ± 0.0
1.517AlaTyr: 1.517 ± 1.032
0.0AlaXaa: 0.0 ± 0.0
Cys
1.517CysAla: 1.517 ± 1.07
0.0CysCys: 0.0 ± 0.0
1.517CysAsp: 1.517 ± 1.07
4.552CysGlu: 4.552 ± 1.108
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.517CysPro: 1.517 ± 1.032
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
3.035CysTyr: 3.035 ± 2.14
0.0CysXaa: 0.0 ± 0.0
Asp
3.035AspAla: 3.035 ± 2.064
0.0AspCys: 0.0 ± 0.0
3.035AspAsp: 3.035 ± 0.038
4.552AspGlu: 4.552 ± 1.108
4.552AspPhe: 4.552 ± 1.108
3.035AspGly: 3.035 ± 0.038
1.517AspHis: 1.517 ± 1.032
3.035AspIle: 3.035 ± 2.14
7.587AspLys: 7.587 ± 1.147
6.07AspLeu: 6.07 ± 2.025
1.517AspMet: 1.517 ± 1.032
3.035AspAsn: 3.035 ± 2.064
4.552AspPro: 4.552 ± 1.108
4.552AspGln: 4.552 ± 0.993
4.552AspArg: 4.552 ± 0.993
4.552AspSer: 4.552 ± 0.993
1.517AspThr: 1.517 ± 1.07
1.517AspVal: 1.517 ± 1.07
3.035AspTrp: 3.035 ± 0.038
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.07GluAla: 6.07 ± 0.077
0.0GluCys: 0.0 ± 0.0
1.517GluAsp: 1.517 ± 1.07
3.035GluGlu: 3.035 ± 0.038
4.552GluPhe: 4.552 ± 0.993
1.517GluGly: 1.517 ± 1.07
3.035GluHis: 3.035 ± 2.064
3.035GluIle: 3.035 ± 0.038
6.07GluLys: 6.07 ± 2.178
7.587GluLeu: 7.587 ± 1.147
0.0GluMet: 0.0 ± 0.762
1.517GluAsn: 1.517 ± 1.07
1.517GluPro: 1.517 ± 1.07
1.517GluGln: 1.517 ± 1.032
3.035GluArg: 3.035 ± 2.14
4.552GluSer: 4.552 ± 0.993
1.517GluThr: 1.517 ± 1.032
12.14GluVal: 12.14 ± 4.357
1.517GluTrp: 1.517 ± 1.07
3.035GluTyr: 3.035 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
1.517PheAla: 1.517 ± 1.032
0.0PheCys: 0.0 ± 0.0
4.552PheAsp: 4.552 ± 1.108
6.07PheGlu: 6.07 ± 4.28
4.552PhePhe: 4.552 ± 3.21
1.517PheGly: 1.517 ± 1.07
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.517PheLys: 1.517 ± 1.07
1.517PheLeu: 1.517 ± 1.032
0.0PheMet: 0.0 ± 0.0
1.517PheAsn: 1.517 ± 1.07
1.517PhePro: 1.517 ± 1.07
4.552PheGln: 4.552 ± 0.993
3.035PheArg: 3.035 ± 2.064
3.035PheSer: 3.035 ± 0.038
3.035PheThr: 3.035 ± 0.038
1.517PheVal: 1.517 ± 1.07
0.0PheTrp: 0.0 ± 0.0
3.035PheTyr: 3.035 ± 2.064
0.0PheXaa: 0.0 ± 0.0
Gly
3.035GlyAla: 3.035 ± 2.064
3.035GlyCys: 3.035 ± 2.14
4.552GlyAsp: 4.552 ± 3.21
6.07GlyGlu: 6.07 ± 2.178
3.035GlyPhe: 3.035 ± 2.064
6.07GlyGly: 6.07 ± 2.178
0.0GlyHis: 0.0 ± 0.0
3.035GlyIle: 3.035 ± 2.064
4.552GlyLys: 4.552 ± 0.993
10.622GlyLeu: 10.622 ± 3.019
3.035GlyMet: 3.035 ± 2.064
1.517GlyAsn: 1.517 ± 1.07
0.0GlyPro: 0.0 ± 0.0
1.517GlyGln: 1.517 ± 1.07
6.07GlyArg: 6.07 ± 2.178
4.552GlySer: 4.552 ± 3.095
4.552GlyThr: 4.552 ± 0.993
1.517GlyVal: 1.517 ± 1.07
1.517GlyTrp: 1.517 ± 1.07
6.07GlyTyr: 6.07 ± 0.077
0.0GlyXaa: 0.0 ± 0.0
His
3.035HisAla: 3.035 ± 2.14
0.0HisCys: 0.0 ± 0.0
3.035HisAsp: 3.035 ± 2.064
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.035HisGly: 3.035 ± 0.038
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.035HisLys: 3.035 ± 0.038
3.035HisLeu: 3.035 ± 2.064
1.517HisMet: 1.517 ± 1.032
1.517HisAsn: 1.517 ± 1.032
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.517HisSer: 1.517 ± 1.032
0.0HisThr: 0.0 ± 0.0
4.552HisVal: 4.552 ± 1.108
0.0HisTrp: 0.0 ± 0.0
1.517HisTyr: 1.517 ± 1.07
0.0HisXaa: 0.0 ± 0.0
Ile
4.552IleAla: 4.552 ± 0.993
1.517IleCys: 1.517 ± 1.07
4.552IleAsp: 4.552 ± 1.108
0.0IleGlu: 0.0 ± 0.0
1.517IlePhe: 1.517 ± 1.07
1.517IleGly: 1.517 ± 1.07
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
3.035IleLys: 3.035 ± 0.038
7.587IleLeu: 7.587 ± 1.147
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
3.035IlePro: 3.035 ± 2.14
3.035IleGln: 3.035 ± 2.064
4.552IleArg: 4.552 ± 1.108
3.035IleSer: 3.035 ± 0.038
1.517IleThr: 1.517 ± 1.032
6.07IleVal: 6.07 ± 0.077
0.0IleTrp: 0.0 ± 0.0
1.517IleTyr: 1.517 ± 1.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.035LysAla: 3.035 ± 0.038
0.0LysCys: 0.0 ± 0.0
1.517LysAsp: 1.517 ± 1.07
4.552LysGlu: 4.552 ± 1.108
1.517LysPhe: 1.517 ± 1.07
7.587LysGly: 7.587 ± 0.955
1.517LysHis: 1.517 ± 1.07
1.517LysIle: 1.517 ± 1.032
4.552LysLys: 4.552 ± 0.993
3.035LysLeu: 3.035 ± 2.14
3.035LysMet: 3.035 ± 2.064
3.035LysAsn: 3.035 ± 2.064
0.0LysPro: 0.0 ± 0.0
4.552LysGln: 4.552 ± 0.993
4.552LysArg: 4.552 ± 0.993
6.07LysSer: 6.07 ± 2.178
1.517LysThr: 1.517 ± 1.032
7.587LysVal: 7.587 ± 3.248
1.517LysTrp: 1.517 ± 1.07
7.587LysTyr: 7.587 ± 0.955
0.0LysXaa: 0.0 ± 0.0
Leu
3.035LeuAla: 3.035 ± 2.064
3.035LeuCys: 3.035 ± 0.038
4.552LeuAsp: 4.552 ± 3.095
10.622LeuGlu: 10.622 ± 1.185
1.517LeuPhe: 1.517 ± 1.07
7.587LeuGly: 7.587 ± 3.248
4.552LeuHis: 4.552 ± 1.108
4.552LeuIle: 4.552 ± 1.108
3.035LeuLys: 3.035 ± 2.064
4.552LeuLeu: 4.552 ± 1.108
1.517LeuMet: 1.517 ± 1.032
1.517LeuAsn: 1.517 ± 1.07
4.552LeuPro: 4.552 ± 0.993
1.517LeuGln: 1.517 ± 1.07
3.035LeuArg: 3.035 ± 0.038
3.035LeuSer: 3.035 ± 0.038
3.035LeuThr: 3.035 ± 0.038
3.035LeuVal: 3.035 ± 0.038
4.552LeuTrp: 4.552 ± 3.21
1.517LeuTyr: 1.517 ± 1.07
0.0LeuXaa: 0.0 ± 0.0
Met
1.517MetAla: 1.517 ± 1.032
0.0MetCys: 0.0 ± 0.0
6.07MetAsp: 6.07 ± 2.025
1.517MetGlu: 1.517 ± 1.032
3.035MetPhe: 3.035 ± 2.064
4.552MetGly: 4.552 ± 3.095
1.517MetHis: 1.517 ± 1.032
1.517MetIle: 1.517 ± 1.07
3.035MetLys: 3.035 ± 2.064
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.517MetPro: 1.517 ± 1.032
1.517MetGln: 1.517 ± 1.032
0.0MetArg: 0.0 ± 0.0
1.517MetSer: 1.517 ± 1.032
0.0MetThr: 0.0 ± 0.0
4.552MetVal: 4.552 ± 0.993
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.517AsnCys: 1.517 ± 1.032
3.035AsnAsp: 3.035 ± 2.064
4.552AsnGlu: 4.552 ± 1.108
1.517AsnPhe: 1.517 ± 1.032
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
1.517AsnIle: 1.517 ± 1.032
1.517AsnLys: 1.517 ± 1.07
3.035AsnLeu: 3.035 ± 0.038
3.035AsnMet: 3.035 ± 2.064
3.035AsnAsn: 3.035 ± 0.038
0.0AsnPro: 0.0 ± 0.0
1.517AsnGln: 1.517 ± 1.032
4.552AsnArg: 4.552 ± 0.993
1.517AsnSer: 1.517 ± 1.032
1.517AsnThr: 1.517 ± 1.07
0.0AsnVal: 0.0 ± 0.0
1.517AsnTrp: 1.517 ± 1.07
3.035AsnTyr: 3.035 ± 2.14
0.0AsnXaa: 0.0 ± 0.0
Pro
1.517ProAla: 1.517 ± 1.032
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
3.035ProGlu: 3.035 ± 0.038
0.0ProPhe: 0.0 ± 0.0
3.035ProGly: 3.035 ± 0.038
1.517ProHis: 1.517 ± 1.07
4.552ProIle: 4.552 ± 0.993
3.035ProLys: 3.035 ± 2.14
0.0ProLeu: 0.0 ± 0.0
0.0ProMet: 0.0 ± 0.0
1.517ProAsn: 1.517 ± 1.07
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
1.517ProArg: 1.517 ± 1.032
4.552ProSer: 4.552 ± 0.993
4.552ProThr: 4.552 ± 0.993
1.517ProVal: 1.517 ± 1.07
1.517ProTrp: 1.517 ± 1.032
4.552ProTyr: 4.552 ± 3.095
0.0ProXaa: 0.0 ± 0.0
Gln
4.552GlnAla: 4.552 ± 0.993
0.0GlnCys: 0.0 ± 0.0
6.07GlnAsp: 6.07 ± 2.178
4.552GlnGlu: 4.552 ± 1.108
0.0GlnPhe: 0.0 ± 0.0
4.552GlnGly: 4.552 ± 3.095
1.517GlnHis: 1.517 ± 1.032
1.517GlnIle: 1.517 ± 1.032
6.07GlnLys: 6.07 ± 2.178
1.517GlnLeu: 1.517 ± 1.07
3.035GlnMet: 3.035 ± 0.774
1.517GlnAsn: 1.517 ± 1.032
1.517GlnPro: 1.517 ± 1.032
3.035GlnGln: 3.035 ± 0.038
4.552GlnArg: 4.552 ± 1.108
0.0GlnSer: 0.0 ± 0.0
1.517GlnThr: 1.517 ± 1.032
1.517GlnVal: 1.517 ± 1.032
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
7.587ArgAla: 7.587 ± 1.147
1.517ArgCys: 1.517 ± 1.07
0.0ArgAsp: 0.0 ± 0.0
3.035ArgGlu: 3.035 ± 2.064
0.0ArgPhe: 0.0 ± 0.0
3.035ArgGly: 3.035 ± 2.064
0.0ArgHis: 0.0 ± 0.0
7.587ArgIle: 7.587 ± 3.248
3.035ArgLys: 3.035 ± 0.038
3.035ArgLeu: 3.035 ± 0.038
3.035ArgMet: 3.035 ± 2.064
4.552ArgAsn: 4.552 ± 0.993
1.517ArgPro: 1.517 ± 1.032
1.517ArgGln: 1.517 ± 1.07
3.035ArgArg: 3.035 ± 0.038
6.07ArgSer: 6.07 ± 2.025
1.517ArgThr: 1.517 ± 1.032
6.07ArgVal: 6.07 ± 2.178
1.517ArgTrp: 1.517 ± 1.07
1.517ArgTyr: 1.517 ± 1.07
0.0ArgXaa: 0.0 ± 0.0
Ser
4.552SerAla: 4.552 ± 1.108
0.0SerCys: 0.0 ± 0.0
4.552SerAsp: 4.552 ± 0.993
0.0SerGlu: 0.0 ± 0.0
1.517SerPhe: 1.517 ± 1.07
6.07SerGly: 6.07 ± 2.025
0.0SerHis: 0.0 ± 0.0
3.035SerIle: 3.035 ± 0.038
3.035SerLys: 3.035 ± 2.064
4.552SerLeu: 4.552 ± 1.108
3.035SerMet: 3.035 ± 2.064
3.035SerAsn: 3.035 ± 0.038
3.035SerPro: 3.035 ± 0.038
3.035SerGln: 3.035 ± 2.064
4.552SerArg: 4.552 ± 0.993
1.517SerSer: 1.517 ± 1.032
6.07SerThr: 6.07 ± 4.127
1.517SerVal: 1.517 ± 1.07
0.0SerTrp: 0.0 ± 0.0
3.035SerTyr: 3.035 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
1.517ThrAla: 1.517 ± 1.032
0.0ThrCys: 0.0 ± 0.0
4.552ThrAsp: 4.552 ± 3.095
3.035ThrGlu: 3.035 ± 0.038
1.517ThrPhe: 1.517 ± 1.032
3.035ThrGly: 3.035 ± 0.038
3.035ThrHis: 3.035 ± 2.064
4.552ThrIle: 4.552 ± 1.108
0.0ThrLys: 0.0 ± 0.0
1.517ThrLeu: 1.517 ± 1.032
3.035ThrMet: 3.035 ± 0.038
4.552ThrAsn: 4.552 ± 0.993
4.552ThrPro: 4.552 ± 0.993
1.517ThrGln: 1.517 ± 1.032
0.0ThrArg: 0.0 ± 0.0
3.035ThrSer: 3.035 ± 0.038
3.035ThrThr: 3.035 ± 2.064
4.552ThrVal: 4.552 ± 1.108
0.0ThrTrp: 0.0 ± 0.0
1.517ThrTyr: 1.517 ± 1.032
0.0ThrXaa: 0.0 ± 0.0
Val
6.07ValAla: 6.07 ± 2.178
1.517ValCys: 1.517 ± 1.07
6.07ValAsp: 6.07 ± 0.077
3.035ValGlu: 3.035 ± 0.038
9.105ValPhe: 9.105 ± 2.217
7.587ValGly: 7.587 ± 1.147
3.035ValHis: 3.035 ± 0.038
1.517ValIle: 1.517 ± 1.032
3.035ValLys: 3.035 ± 2.14
4.552ValLeu: 4.552 ± 1.108
1.517ValMet: 1.517 ± 1.032
1.517ValAsn: 1.517 ± 1.07
1.517ValPro: 1.517 ± 1.07
7.587ValGln: 7.587 ± 3.248
4.552ValArg: 4.552 ± 3.095
3.035ValSer: 3.035 ± 0.038
3.035ValThr: 3.035 ± 0.038
7.587ValVal: 7.587 ± 0.955
0.0ValTrp: 0.0 ± 0.0
3.035ValTyr: 3.035 ± 2.14
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
3.035TrpAsp: 3.035 ± 2.14
1.517TrpGlu: 1.517 ± 1.032
1.517TrpPhe: 1.517 ± 1.07
1.517TrpGly: 1.517 ± 1.032
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
3.035TrpLys: 3.035 ± 2.14
1.517TrpLeu: 1.517 ± 1.07
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.517TrpSer: 1.517 ± 1.07
3.035TrpThr: 3.035 ± 2.14
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.517TrpTyr: 1.517 ± 1.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.035TyrAla: 3.035 ± 2.14
0.0TyrCys: 0.0 ± 0.0
1.517TyrAsp: 1.517 ± 1.032
1.517TyrGlu: 1.517 ± 1.07
0.0TyrPhe: 0.0 ± 0.0
3.035TyrGly: 3.035 ± 0.038
4.552TyrHis: 4.552 ± 1.108
1.517TyrIle: 1.517 ± 1.032
4.552TyrLys: 4.552 ± 3.095
6.07TyrLeu: 6.07 ± 2.178
0.0TyrMet: 0.0 ± 0.0
1.517TyrAsn: 1.517 ± 1.032
3.035TyrPro: 3.035 ± 2.064
3.035TyrGln: 3.035 ± 2.14
3.035TyrArg: 3.035 ± 2.14
0.0TyrSer: 0.0 ± 0.0
3.035TyrThr: 3.035 ± 0.038
6.07TyrVal: 6.07 ± 2.025
1.517TyrTrp: 1.517 ± 1.07
1.517TyrTyr: 1.517 ± 1.07
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (660 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski