Amino acid dipepetide frequency for Hubei sobemo-like virus 19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.094AlaAla: 4.094 ± 2.413
0.0AlaCys: 0.0 ± 0.0
3.071AlaAsp: 3.071 ± 1.746
3.071AlaGlu: 3.071 ± 1.81
1.024AlaPhe: 1.024 ± 0.603
4.094AlaGly: 4.094 ± 2.921
0.0AlaHis: 0.0 ± 0.0
6.141AlaIle: 6.141 ± 1.842
5.118AlaLys: 5.118 ± 3.017
2.047AlaLeu: 2.047 ± 1.207
5.118AlaMet: 5.118 ± 1.238
2.047AlaAsn: 2.047 ± 1.207
1.024AlaPro: 1.024 ± 1.175
1.024AlaGln: 1.024 ± 0.603
2.047AlaArg: 2.047 ± 0.571
2.047AlaSer: 2.047 ± 1.207
4.094AlaThr: 4.094 ± 0.635
4.094AlaVal: 4.094 ± 0.635
0.0AlaTrp: 0.0 ± 0.0
4.094AlaTyr: 4.094 ± 1.143
0.0AlaXaa: 0.0 ± 0.0
Cys
1.024CysAla: 1.024 ± 0.603
0.0CysCys: 0.0 ± 0.0
1.024CysAsp: 1.024 ± 0.603
2.047CysGlu: 2.047 ± 1.207
1.024CysPhe: 1.024 ± 0.603
3.071CysGly: 3.071 ± 1.81
0.0CysHis: 0.0 ± 0.0
2.047CysIle: 2.047 ± 2.35
1.024CysLys: 1.024 ± 0.603
2.047CysLeu: 2.047 ± 0.571
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.047CysArg: 2.047 ± 0.571
1.024CysSer: 1.024 ± 0.603
0.0CysThr: 0.0 ± 0.0
2.047CysVal: 2.047 ± 0.571
1.024CysTrp: 1.024 ± 0.603
1.024CysTyr: 1.024 ± 1.175
0.0CysXaa: 0.0 ± 0.0
Asp
2.047AspAla: 2.047 ± 1.207
1.024AspCys: 1.024 ± 1.175
5.118AspAsp: 5.118 ± 1.238
2.047AspGlu: 2.047 ± 2.35
3.071AspPhe: 3.071 ± 1.81
3.071AspGly: 3.071 ± 1.746
1.024AspHis: 1.024 ± 1.175
5.118AspIle: 5.118 ± 3.017
8.188AspLys: 8.188 ± 1.27
5.118AspLeu: 5.118 ± 0.54
1.024AspMet: 1.024 ± 0.603
1.024AspAsn: 1.024 ± 1.175
4.094AspPro: 4.094 ± 2.921
1.024AspGln: 1.024 ± 1.175
5.118AspArg: 5.118 ± 0.54
3.071AspSer: 3.071 ± 1.81
1.024AspThr: 1.024 ± 1.175
3.071AspVal: 3.071 ± 0.032
2.047AspTrp: 2.047 ± 2.35
3.071AspTyr: 3.071 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
3.071GluAla: 3.071 ± 0.032
1.024GluCys: 1.024 ± 0.603
2.047GluAsp: 2.047 ± 1.207
6.141GluGlu: 6.141 ± 3.62
2.047GluPhe: 2.047 ± 0.571
2.047GluGly: 2.047 ± 1.207
1.024GluHis: 1.024 ± 0.603
2.047GluIle: 2.047 ± 0.571
6.141GluLys: 6.141 ± 3.62
5.118GluLeu: 5.118 ± 3.017
2.047GluMet: 2.047 ± 1.199
2.047GluAsn: 2.047 ± 1.207
3.071GluPro: 3.071 ± 1.746
3.071GluGln: 3.071 ± 1.81
2.047GluArg: 2.047 ± 1.207
3.071GluSer: 3.071 ± 1.81
3.071GluThr: 3.071 ± 0.032
4.094GluVal: 4.094 ± 2.921
1.024GluTrp: 1.024 ± 1.175
1.024GluTyr: 1.024 ± 1.175
0.0GluXaa: 0.0 ± 0.0
Phe
3.071PheAla: 3.071 ± 0.032
0.0PheCys: 0.0 ± 0.0
5.118PheAsp: 5.118 ± 2.318
3.071PheGlu: 3.071 ± 0.032
2.047PhePhe: 2.047 ± 1.207
3.071PheGly: 3.071 ± 1.746
1.024PheHis: 1.024 ± 0.603
3.071PheIle: 3.071 ± 1.746
2.047PheLys: 2.047 ± 1.207
4.094PheLeu: 4.094 ± 1.143
0.0PheMet: 0.0 ± 0.0
2.047PheAsn: 2.047 ± 1.207
1.024PhePro: 1.024 ± 0.603
2.047PheGln: 2.047 ± 1.207
3.071PheArg: 3.071 ± 1.746
1.024PheSer: 1.024 ± 1.175
0.0PheThr: 0.0 ± 0.0
4.094PheVal: 4.094 ± 0.635
0.0PheTrp: 0.0 ± 0.0
1.024PheTyr: 1.024 ± 0.603
0.0PheXaa: 0.0 ± 0.0
Gly
2.047GlyAla: 2.047 ± 1.207
1.024GlyCys: 1.024 ± 1.175
4.094GlyAsp: 4.094 ± 1.143
8.188GlyGlu: 8.188 ± 0.508
5.118GlyPhe: 5.118 ± 2.318
5.118GlyGly: 5.118 ± 5.874
0.0GlyHis: 0.0 ± 0.0
4.094GlyIle: 4.094 ± 2.413
4.094GlyLys: 4.094 ± 0.635
3.071GlyLeu: 3.071 ± 0.032
2.047GlyMet: 2.047 ± 0.571
3.071GlyAsn: 3.071 ± 0.032
1.024GlyPro: 1.024 ± 1.175
3.071GlyGln: 3.071 ± 1.81
1.024GlyArg: 1.024 ± 0.603
5.118GlySer: 5.118 ± 2.318
1.024GlyThr: 1.024 ± 0.603
4.094GlyVal: 4.094 ± 1.143
4.094GlyTrp: 4.094 ± 2.921
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
3.071HisAla: 3.071 ± 1.746
1.024HisCys: 1.024 ± 0.603
1.024HisAsp: 1.024 ± 0.603
0.0HisGlu: 0.0 ± 0.0
1.024HisPhe: 1.024 ± 0.603
4.094HisGly: 4.094 ± 1.143
0.0HisHis: 0.0 ± 0.0
1.024HisIle: 1.024 ± 1.175
0.0HisLys: 0.0 ± 0.0
3.071HisLeu: 3.071 ± 1.81
2.047HisMet: 2.047 ± 0.571
1.024HisAsn: 1.024 ± 0.603
1.024HisPro: 1.024 ± 0.603
0.0HisGln: 0.0 ± 0.0
2.047HisArg: 2.047 ± 0.571
1.024HisSer: 1.024 ± 1.175
0.0HisThr: 0.0 ± 0.0
2.047HisVal: 2.047 ± 1.207
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.071IleAla: 3.071 ± 0.032
2.047IleCys: 2.047 ± 1.207
7.165IleAsp: 7.165 ± 1.111
0.0IleGlu: 0.0 ± 0.0
3.071IlePhe: 3.071 ± 1.746
3.071IleGly: 3.071 ± 1.746
0.0IleHis: 0.0 ± 0.0
2.047IleIle: 2.047 ± 1.207
3.071IleLys: 3.071 ± 0.032
3.071IleLeu: 3.071 ± 1.81
2.047IleMet: 2.047 ± 1.207
1.024IleAsn: 1.024 ± 0.603
3.071IlePro: 3.071 ± 1.81
1.024IleGln: 1.024 ± 0.603
2.047IleArg: 2.047 ± 0.571
9.212IleSer: 9.212 ± 0.096
5.118IleThr: 5.118 ± 1.238
3.071IleVal: 3.071 ± 1.746
0.0IleTrp: 0.0 ± 0.0
2.047IleTyr: 2.047 ± 0.571
0.0IleXaa: 0.0 ± 0.0
Lys
3.071LysAla: 3.071 ± 1.746
3.071LysCys: 3.071 ± 1.81
2.047LysAsp: 2.047 ± 1.207
5.118LysGlu: 5.118 ± 3.017
1.024LysPhe: 1.024 ± 0.603
4.094LysGly: 4.094 ± 0.635
3.071LysHis: 3.071 ± 0.032
5.118LysIle: 5.118 ± 2.318
6.141LysLys: 6.141 ± 1.842
5.118LysLeu: 5.118 ± 2.318
3.071LysMet: 3.071 ± 1.81
4.094LysAsn: 4.094 ± 0.635
2.047LysPro: 2.047 ± 0.571
3.071LysGln: 3.071 ± 1.746
4.094LysArg: 4.094 ± 1.143
4.094LysSer: 4.094 ± 1.143
6.141LysThr: 6.141 ± 3.62
8.188LysVal: 8.188 ± 3.048
0.0LysTrp: 0.0 ± 0.0
5.118LysTyr: 5.118 ± 1.238
0.0LysXaa: 0.0 ± 0.0
Leu
10.235LeuAla: 10.235 ± 1.079
3.071LeuCys: 3.071 ± 1.746
7.165LeuAsp: 7.165 ± 2.889
7.165LeuGlu: 7.165 ± 0.667
7.165LeuPhe: 7.165 ± 4.667
4.094LeuGly: 4.094 ± 1.143
3.071LeuHis: 3.071 ± 1.746
4.094LeuIle: 4.094 ± 2.921
9.212LeuLys: 9.212 ± 0.096
17.4LeuLeu: 17.4 ± 0.412
2.047LeuMet: 2.047 ± 1.207
4.094LeuAsn: 4.094 ± 2.413
3.071LeuPro: 3.071 ± 0.032
1.024LeuGln: 1.024 ± 1.175
5.118LeuArg: 5.118 ± 3.017
6.141LeuSer: 6.141 ± 0.064
5.118LeuThr: 5.118 ± 1.238
6.141LeuVal: 6.141 ± 0.064
2.047LeuTrp: 2.047 ± 1.207
2.047LeuTyr: 2.047 ± 0.571
0.0LeuXaa: 0.0 ± 0.0
Met
1.024MetAla: 1.024 ± 0.603
0.0MetCys: 0.0 ± 0.0
1.024MetAsp: 1.024 ± 0.603
2.047MetGlu: 2.047 ± 1.207
0.0MetPhe: 0.0 ± 0.0
1.024MetGly: 1.024 ± 1.175
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
5.118MetLys: 5.118 ± 3.017
5.118MetLeu: 5.118 ± 0.54
2.047MetMet: 2.047 ± 0.571
2.047MetAsn: 2.047 ± 0.571
2.047MetPro: 2.047 ± 0.571
3.071MetGln: 3.071 ± 0.032
2.047MetArg: 2.047 ± 2.35
2.047MetSer: 2.047 ± 1.207
0.0MetThr: 0.0 ± 0.0
2.047MetVal: 2.047 ± 1.207
1.024MetTrp: 1.024 ± 1.175
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.024AsnAla: 1.024 ± 0.603
0.0AsnCys: 0.0 ± 0.0
3.071AsnAsp: 3.071 ± 0.032
2.047AsnGlu: 2.047 ± 1.207
1.024AsnPhe: 1.024 ± 0.603
3.071AsnGly: 3.071 ± 1.81
1.024AsnHis: 1.024 ± 0.603
1.024AsnIle: 1.024 ± 0.603
1.024AsnLys: 1.024 ± 0.603
7.165AsnLeu: 7.165 ± 0.667
3.071AsnMet: 3.071 ± 1.537
2.047AsnAsn: 2.047 ± 1.207
3.071AsnPro: 3.071 ± 0.032
3.071AsnGln: 3.071 ± 0.032
3.071AsnArg: 3.071 ± 1.746
3.071AsnSer: 3.071 ± 1.746
3.071AsnThr: 3.071 ± 0.032
2.047AsnVal: 2.047 ± 0.571
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.024ProAla: 1.024 ± 0.603
1.024ProCys: 1.024 ± 0.603
2.047ProAsp: 2.047 ± 0.571
3.071ProGlu: 3.071 ± 0.032
1.024ProPhe: 1.024 ± 0.603
2.047ProGly: 2.047 ± 0.571
1.024ProHis: 1.024 ± 1.175
5.118ProIle: 5.118 ± 2.318
3.071ProLys: 3.071 ± 0.032
3.071ProLeu: 3.071 ± 1.746
1.024ProMet: 1.024 ± 0.603
1.024ProAsn: 1.024 ± 1.175
2.047ProPro: 2.047 ± 0.571
4.094ProGln: 4.094 ± 0.635
1.024ProArg: 1.024 ± 0.603
3.071ProSer: 3.071 ± 1.746
2.047ProThr: 2.047 ± 1.207
2.047ProVal: 2.047 ± 0.571
1.024ProTrp: 1.024 ± 1.175
1.024ProTyr: 1.024 ± 1.175
0.0ProXaa: 0.0 ± 0.0
Gln
2.047GlnAla: 2.047 ± 2.35
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.047GlnGlu: 2.047 ± 1.207
0.0GlnPhe: 0.0 ± 0.0
2.047GlnGly: 2.047 ± 1.207
0.0GlnHis: 0.0 ± 0.0
1.024GlnIle: 1.024 ± 0.603
6.141GlnLys: 6.141 ± 3.492
7.165GlnLeu: 7.165 ± 1.111
0.0GlnMet: 0.0 ± 0.0
3.071GlnAsn: 3.071 ± 1.81
3.071GlnPro: 3.071 ± 0.032
3.071GlnGln: 3.071 ± 1.746
2.047GlnArg: 2.047 ± 1.207
3.071GlnSer: 3.071 ± 0.032
1.024GlnThr: 1.024 ± 0.603
3.071GlnVal: 3.071 ± 1.81
1.024GlnTrp: 1.024 ± 0.603
1.024GlnTyr: 1.024 ± 0.603
0.0GlnXaa: 0.0 ± 0.0
Arg
3.071ArgAla: 3.071 ± 1.81
1.024ArgCys: 1.024 ± 0.603
3.071ArgAsp: 3.071 ± 0.032
2.047ArgGlu: 2.047 ± 0.571
1.024ArgPhe: 1.024 ± 1.175
3.071ArgGly: 3.071 ± 1.746
1.024ArgHis: 1.024 ± 0.603
5.118ArgIle: 5.118 ± 3.017
3.071ArgLys: 3.071 ± 1.81
5.118ArgLeu: 5.118 ± 0.54
1.024ArgMet: 1.024 ± 1.175
2.047ArgAsn: 2.047 ± 0.571
3.071ArgPro: 3.071 ± 1.746
2.047ArgGln: 2.047 ± 2.35
2.047ArgArg: 2.047 ± 1.207
4.094ArgSer: 4.094 ± 0.635
3.071ArgThr: 3.071 ± 0.032
2.047ArgVal: 2.047 ± 0.571
1.024ArgTrp: 1.024 ± 1.175
2.047ArgTyr: 2.047 ± 2.35
0.0ArgXaa: 0.0 ± 0.0
Ser
4.094SerAla: 4.094 ± 2.413
0.0SerCys: 0.0 ± 0.0
6.141SerAsp: 6.141 ± 0.064
1.024SerGlu: 1.024 ± 0.603
2.047SerPhe: 2.047 ± 0.571
5.118SerGly: 5.118 ± 2.318
3.071SerHis: 3.071 ± 1.81
4.094SerIle: 4.094 ± 2.413
6.141SerLys: 6.141 ± 3.492
8.188SerLeu: 8.188 ± 4.064
2.047SerMet: 2.047 ± 2.35
3.071SerAsn: 3.071 ± 0.032
1.024SerPro: 1.024 ± 0.603
3.071SerGln: 3.071 ± 1.81
2.047SerArg: 2.047 ± 1.207
6.141SerSer: 6.141 ± 1.714
5.118SerThr: 5.118 ± 1.238
1.024SerVal: 1.024 ± 1.175
3.071SerTrp: 3.071 ± 1.746
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.024ThrAla: 1.024 ± 0.603
3.071ThrCys: 3.071 ± 1.81
2.047ThrAsp: 2.047 ± 0.571
2.047ThrGlu: 2.047 ± 1.207
2.047ThrPhe: 2.047 ± 1.207
2.047ThrGly: 2.047 ± 1.207
0.0ThrHis: 0.0 ± 0.0
1.024ThrIle: 1.024 ± 1.175
2.047ThrLys: 2.047 ± 1.207
8.188ThrLeu: 8.188 ± 1.27
0.0ThrMet: 0.0 ± 0.0
4.094ThrAsn: 4.094 ± 1.143
1.024ThrPro: 1.024 ± 0.603
2.047ThrGln: 2.047 ± 1.207
3.071ThrArg: 3.071 ± 0.032
3.071ThrSer: 3.071 ± 0.032
4.094ThrThr: 4.094 ± 0.635
7.165ThrVal: 7.165 ± 2.445
1.024ThrTrp: 1.024 ± 0.603
2.047ThrTyr: 2.047 ± 1.207
0.0ThrXaa: 0.0 ± 0.0
Val
4.094ValAla: 4.094 ± 2.413
2.047ValCys: 2.047 ± 0.571
2.047ValAsp: 2.047 ± 1.207
4.094ValGlu: 4.094 ± 1.143
4.094ValPhe: 4.094 ± 0.635
6.141ValGly: 6.141 ± 1.842
4.094ValHis: 4.094 ± 2.413
2.047ValIle: 2.047 ± 1.207
4.094ValLys: 4.094 ± 1.143
5.118ValLeu: 5.118 ± 2.318
1.024ValMet: 1.024 ± 1.175
2.047ValAsn: 2.047 ± 0.571
3.071ValPro: 3.071 ± 0.032
3.071ValGln: 3.071 ± 0.032
4.094ValArg: 4.094 ± 2.921
4.094ValSer: 4.094 ± 0.635
4.094ValThr: 4.094 ± 2.413
5.118ValVal: 5.118 ± 3.017
3.071ValTrp: 3.071 ± 1.81
1.024ValTyr: 1.024 ± 1.175
0.0ValXaa: 0.0 ± 0.0
Trp
2.047TrpAla: 2.047 ± 1.207
0.0TrpCys: 0.0 ± 0.0
1.024TrpAsp: 1.024 ± 1.175
1.024TrpGlu: 1.024 ± 1.175
2.047TrpPhe: 2.047 ± 0.571
0.0TrpGly: 0.0 ± 0.0
2.047TrpHis: 2.047 ± 0.571
0.0TrpIle: 0.0 ± 0.0
1.024TrpLys: 1.024 ± 1.175
5.118TrpLeu: 5.118 ± 0.54
1.024TrpMet: 1.024 ± 0.603
0.0TrpAsn: 0.0 ± 0.0
2.047TrpPro: 2.047 ± 0.571
0.0TrpGln: 0.0 ± 0.0
1.024TrpArg: 1.024 ± 0.603
2.047TrpSer: 2.047 ± 2.35
1.024TrpThr: 1.024 ± 1.175
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.024TyrCys: 1.024 ± 1.175
2.047TyrAsp: 2.047 ± 0.571
0.0TyrGlu: 0.0 ± 0.0
1.024TyrPhe: 1.024 ± 0.603
1.024TyrGly: 1.024 ± 0.603
2.047TyrHis: 2.047 ± 2.35
1.024TyrIle: 1.024 ± 0.603
1.024TyrLys: 1.024 ± 1.175
5.118TyrLeu: 5.118 ± 2.318
0.0TyrMet: 0.0 ± 0.0
3.071TyrAsn: 3.071 ± 0.032
1.024TyrPro: 1.024 ± 1.175
2.047TyrGln: 2.047 ± 0.571
1.024TyrArg: 1.024 ± 1.175
0.0TyrSer: 0.0 ± 0.0
2.047TyrThr: 2.047 ± 1.207
3.071TyrVal: 3.071 ± 1.81
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (978 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski