Amino acid dipepetide frequency for Hubei sobemo-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.641AlaAla: 3.641 ± 2.237
1.214AlaCys: 1.214 ± 0.746
0.0AlaAsp: 0.0 ± 0.0
7.282AlaGlu: 7.282 ± 4.164
1.214AlaPhe: 1.214 ± 0.746
4.854AlaGly: 4.854 ± 1.337
0.0AlaHis: 0.0 ± 0.0
6.068AlaIle: 6.068 ± 2.751
8.495AlaLys: 8.495 ± 3.419
7.282AlaLeu: 7.282 ± 0.155
4.854AlaMet: 4.854 ± 0.823
3.641AlaAsn: 3.641 ± 0.077
3.641AlaPro: 3.641 ± 0.077
4.854AlaGln: 4.854 ± 2.982
6.068AlaArg: 6.068 ± 0.591
7.282AlaSer: 7.282 ± 2.314
2.427AlaThr: 2.427 ± 1.491
2.427AlaVal: 2.427 ± 0.668
1.214AlaTrp: 1.214 ± 1.414
1.214AlaTyr: 1.214 ± 1.414
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.214CysIle: 1.214 ± 1.414
0.0CysLys: 0.0 ± 0.0
1.214CysLeu: 1.214 ± 1.414
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.214CysArg: 1.214 ± 1.414
1.214CysSer: 1.214 ± 0.746
0.0CysThr: 0.0 ± 0.0
1.214CysVal: 1.214 ± 0.746
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.214AspAla: 1.214 ± 0.746
0.0AspCys: 0.0 ± 0.0
1.214AspAsp: 1.214 ± 1.414
6.068AspGlu: 6.068 ± 1.569
1.214AspPhe: 1.214 ± 0.746
3.641AspGly: 3.641 ± 2.082
0.0AspHis: 0.0 ± 0.0
1.214AspIle: 1.214 ± 0.746
1.214AspLys: 1.214 ± 1.414
3.641AspLeu: 3.641 ± 2.237
3.641AspMet: 3.641 ± 0.869
1.214AspAsn: 1.214 ± 0.746
1.214AspPro: 1.214 ± 0.746
0.0AspGln: 0.0 ± 0.0
2.427AspArg: 2.427 ± 0.668
3.641AspSer: 3.641 ± 0.077
6.068AspThr: 6.068 ± 0.591
2.427AspVal: 2.427 ± 2.828
1.214AspTrp: 1.214 ± 1.414
2.427AspTyr: 2.427 ± 1.491
0.0AspXaa: 0.0 ± 0.0
Glu
2.427GluAla: 2.427 ± 1.491
0.0GluCys: 0.0 ± 0.0
4.854GluAsp: 4.854 ± 0.823
3.641GluGlu: 3.641 ± 2.237
4.854GluPhe: 4.854 ± 1.337
4.854GluGly: 4.854 ± 0.823
3.641GluHis: 3.641 ± 4.242
1.214GluIle: 1.214 ± 1.414
6.068GluLys: 6.068 ± 0.591
8.495GluLeu: 8.495 ± 0.9
1.214GluMet: 1.214 ± 0.746
2.427GluAsn: 2.427 ± 0.668
4.854GluPro: 4.854 ± 0.823
3.641GluGln: 3.641 ± 2.237
3.641GluArg: 3.641 ± 2.237
6.068GluSer: 6.068 ± 3.728
3.641GluThr: 3.641 ± 2.237
1.214GluVal: 1.214 ± 0.746
8.495GluTrp: 8.495 ± 1.259
2.427GluTyr: 2.427 ± 0.668
0.0GluXaa: 0.0 ± 0.0
Phe
3.641PheAla: 3.641 ± 2.237
1.214PheCys: 1.214 ± 0.746
2.427PheAsp: 2.427 ± 0.668
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
3.641PheGly: 3.641 ± 0.077
1.214PheHis: 1.214 ± 0.746
1.214PheIle: 1.214 ± 1.414
0.0PheLys: 0.0 ± 0.0
3.641PheLeu: 3.641 ± 2.082
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
2.427PhePro: 2.427 ± 0.668
0.0PheGln: 0.0 ± 0.0
2.427PheArg: 2.427 ± 0.668
4.854PheSer: 4.854 ± 1.337
1.214PheThr: 1.214 ± 0.746
3.641PheVal: 3.641 ± 2.237
2.427PheTrp: 2.427 ± 2.828
1.214PheTyr: 1.214 ± 0.746
0.0PheXaa: 0.0 ± 0.0
Gly
7.282GlyAla: 7.282 ± 2.005
1.214GlyCys: 1.214 ± 1.414
2.427GlyAsp: 2.427 ± 2.828
1.214GlyGlu: 1.214 ± 0.746
4.854GlyPhe: 4.854 ± 0.823
6.068GlyGly: 6.068 ± 1.569
1.214GlyHis: 1.214 ± 0.746
1.214GlyIle: 1.214 ± 0.746
0.0GlyLys: 0.0 ± 0.0
4.854GlyLeu: 4.854 ± 0.823
1.214GlyMet: 1.214 ± 1.414
1.214GlyAsn: 1.214 ± 0.746
3.641GlyPro: 3.641 ± 4.242
1.214GlyGln: 1.214 ± 0.746
6.068GlyArg: 6.068 ± 1.569
7.282GlySer: 7.282 ± 2.314
4.854GlyThr: 4.854 ± 2.982
2.427GlyVal: 2.427 ± 1.491
4.854GlyTrp: 4.854 ± 3.496
4.854GlyTyr: 4.854 ± 1.337
0.0GlyXaa: 0.0 ± 0.0
His
1.214HisAla: 1.214 ± 1.414
0.0HisCys: 0.0 ± 0.0
1.214HisAsp: 1.214 ± 0.746
1.214HisGlu: 1.214 ± 0.746
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.214HisLys: 1.214 ± 1.414
1.214HisLeu: 1.214 ± 0.746
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.427HisPro: 2.427 ± 0.668
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.427HisSer: 2.427 ± 2.828
1.214HisThr: 1.214 ± 0.746
3.641HisVal: 3.641 ± 0.077
1.214HisTrp: 1.214 ± 0.746
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.427IleAla: 2.427 ± 0.668
0.0IleCys: 0.0 ± 0.0
1.214IleAsp: 1.214 ± 1.414
4.854IleGlu: 4.854 ± 2.982
0.0IlePhe: 0.0 ± 0.0
1.214IleGly: 1.214 ± 1.414
2.427IleHis: 2.427 ± 1.491
0.0IleIle: 0.0 ± 0.0
1.214IleLys: 1.214 ± 0.746
1.214IleLeu: 1.214 ± 0.746
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
1.214IlePro: 1.214 ± 0.746
3.641IleGln: 3.641 ± 0.077
1.214IleArg: 1.214 ± 1.414
1.214IleSer: 1.214 ± 1.414
0.0IleThr: 0.0 ± 0.0
2.427IleVal: 2.427 ± 2.828
1.214IleTrp: 1.214 ± 1.414
3.641IleTyr: 3.641 ± 2.082
0.0IleXaa: 0.0 ± 0.0
Lys
7.282LysAla: 7.282 ± 2.314
0.0LysCys: 0.0 ± 0.0
1.214LysAsp: 1.214 ± 0.746
2.427LysGlu: 2.427 ± 1.491
1.214LysPhe: 1.214 ± 1.414
2.427LysGly: 2.427 ± 0.668
3.641LysHis: 3.641 ± 2.082
0.0LysIle: 0.0 ± 0.0
4.854LysLys: 4.854 ± 0.823
7.282LysLeu: 7.282 ± 4.164
1.214LysMet: 1.214 ± 0.746
1.214LysAsn: 1.214 ± 0.746
1.214LysPro: 1.214 ± 0.746
2.427LysGln: 2.427 ± 0.668
6.068LysArg: 6.068 ± 3.728
6.068LysSer: 6.068 ± 1.569
2.427LysThr: 2.427 ± 0.668
4.854LysVal: 4.854 ± 2.982
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
13.35LeuAla: 13.35 ± 6.915
0.0LeuCys: 0.0 ± 0.0
4.854LeuAsp: 4.854 ± 0.823
10.922LeuGlu: 10.922 ± 0.232
1.214LeuPhe: 1.214 ± 1.414
4.854LeuGly: 4.854 ± 2.982
0.0LeuHis: 0.0 ± 0.0
0.0LeuIle: 0.0 ± 0.0
3.641LeuLys: 3.641 ± 2.237
7.282LeuLeu: 7.282 ± 2.005
2.427LeuMet: 2.427 ± 2.828
3.641LeuAsn: 3.641 ± 0.077
3.641LeuPro: 3.641 ± 2.082
1.214LeuGln: 1.214 ± 0.746
4.854LeuArg: 4.854 ± 3.496
4.854LeuSer: 4.854 ± 1.337
8.495LeuThr: 8.495 ± 3.06
9.709LeuVal: 9.709 ± 3.805
3.641LeuTrp: 3.641 ± 4.242
3.641LeuTyr: 3.641 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
2.427MetAla: 2.427 ± 0.668
0.0MetCys: 0.0 ± 0.0
2.427MetAsp: 2.427 ± 0.668
2.427MetGlu: 2.427 ± 1.491
2.427MetPhe: 2.427 ± 1.491
3.641MetGly: 3.641 ± 0.077
1.214MetHis: 1.214 ± 1.414
1.214MetIle: 1.214 ± 1.414
2.427MetLys: 2.427 ± 0.668
3.641MetLeu: 3.641 ± 0.077
2.427MetMet: 2.427 ± 1.491
2.427MetAsn: 2.427 ± 2.828
1.214MetPro: 1.214 ± 1.414
1.214MetGln: 1.214 ± 1.414
3.641MetArg: 3.641 ± 2.237
1.214MetSer: 1.214 ± 0.746
2.427MetThr: 2.427 ± 1.491
4.854MetVal: 4.854 ± 1.337
1.214MetTrp: 1.214 ± 1.414
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.214AsnAla: 1.214 ± 1.414
0.0AsnCys: 0.0 ± 0.0
1.214AsnAsp: 1.214 ± 0.746
1.214AsnGlu: 1.214 ± 0.746
1.214AsnPhe: 1.214 ± 0.746
3.641AsnGly: 3.641 ± 0.077
0.0AsnHis: 0.0 ± 0.0
2.427AsnIle: 2.427 ± 1.491
1.214AsnLys: 1.214 ± 0.746
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
1.214AsnAsn: 1.214 ± 1.414
1.214AsnPro: 1.214 ± 1.414
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
4.854AsnSer: 4.854 ± 1.337
1.214AsnThr: 1.214 ± 0.746
2.427AsnVal: 2.427 ± 0.668
0.0AsnTrp: 0.0 ± 0.0
2.427AsnTyr: 2.427 ± 1.491
0.0AsnXaa: 0.0 ± 0.0
Pro
1.214ProAla: 1.214 ± 1.414
0.0ProCys: 0.0 ± 0.0
1.214ProAsp: 1.214 ± 0.746
10.922ProGlu: 10.922 ± 0.232
3.641ProPhe: 3.641 ± 0.077
3.641ProGly: 3.641 ± 4.242
2.427ProHis: 2.427 ± 1.491
2.427ProIle: 2.427 ± 2.828
1.214ProLys: 1.214 ± 0.746
2.427ProLeu: 2.427 ± 0.668
2.427ProMet: 2.427 ± 2.828
0.0ProAsn: 0.0 ± 0.0
3.641ProPro: 3.641 ± 0.077
2.427ProGln: 2.427 ± 1.491
1.214ProArg: 1.214 ± 1.414
3.641ProSer: 3.641 ± 2.237
1.214ProThr: 1.214 ± 1.414
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.214ProTyr: 1.214 ± 0.746
0.0ProXaa: 0.0 ± 0.0
Gln
1.214GlnAla: 1.214 ± 1.414
1.214GlnCys: 1.214 ± 1.414
1.214GlnAsp: 1.214 ± 0.746
1.214GlnGlu: 1.214 ± 0.746
1.214GlnPhe: 1.214 ± 0.746
3.641GlnGly: 3.641 ± 2.237
0.0GlnHis: 0.0 ± 0.0
3.641GlnIle: 3.641 ± 2.237
2.427GlnLys: 2.427 ± 1.491
3.641GlnLeu: 3.641 ± 2.237
0.0GlnMet: 0.0 ± 0.533
0.0GlnAsn: 0.0 ± 0.0
1.214GlnPro: 1.214 ± 0.746
0.0GlnGln: 0.0 ± 0.0
4.854GlnArg: 4.854 ± 1.337
3.641GlnSer: 3.641 ± 0.077
1.214GlnThr: 1.214 ± 1.414
2.427GlnVal: 2.427 ± 1.491
0.0GlnTrp: 0.0 ± 0.0
2.427GlnTyr: 2.427 ± 0.668
0.0GlnXaa: 0.0 ± 0.0
Arg
3.641ArgAla: 3.641 ± 0.077
1.214ArgCys: 1.214 ± 1.414
1.214ArgAsp: 1.214 ± 0.746
6.068ArgGlu: 6.068 ± 2.751
2.427ArgPhe: 2.427 ± 0.668
2.427ArgGly: 2.427 ± 1.491
0.0ArgHis: 0.0 ± 0.0
2.427ArgIle: 2.427 ± 0.668
4.854ArgLys: 4.854 ± 2.982
6.068ArgLeu: 6.068 ± 1.569
4.854ArgMet: 4.854 ± 5.656
3.641ArgAsn: 3.641 ± 0.077
3.641ArgPro: 3.641 ± 0.077
3.641ArgGln: 3.641 ± 2.082
4.854ArgArg: 4.854 ± 0.823
1.214ArgSer: 1.214 ± 0.746
1.214ArgThr: 1.214 ± 1.414
6.068ArgVal: 6.068 ± 1.569
2.427ArgTrp: 2.427 ± 1.491
1.214ArgTyr: 1.214 ± 0.746
0.0ArgXaa: 0.0 ± 0.0
Ser
3.641SerAla: 3.641 ± 2.237
0.0SerCys: 0.0 ± 0.0
4.854SerAsp: 4.854 ± 2.982
6.068SerGlu: 6.068 ± 0.591
3.641SerPhe: 3.641 ± 2.082
9.709SerGly: 9.709 ± 1.646
0.0SerHis: 0.0 ± 0.0
2.427SerIle: 2.427 ± 1.491
6.068SerLys: 6.068 ± 1.569
6.068SerLeu: 6.068 ± 2.751
4.854SerMet: 4.854 ± 0.823
1.214SerAsn: 1.214 ± 0.746
2.427SerPro: 2.427 ± 1.491
6.068SerGln: 6.068 ± 1.569
1.214SerArg: 1.214 ± 1.414
12.136SerSer: 12.136 ± 7.456
3.641SerThr: 3.641 ± 2.237
12.136SerVal: 12.136 ± 3.137
1.214SerTrp: 1.214 ± 1.414
3.641SerTyr: 3.641 ± 2.237
0.0SerXaa: 0.0 ± 0.0
Thr
7.282ThrAla: 7.282 ± 4.474
0.0ThrCys: 0.0 ± 0.0
2.427ThrAsp: 2.427 ± 0.668
2.427ThrGlu: 2.427 ± 1.491
2.427ThrPhe: 2.427 ± 0.668
3.641ThrGly: 3.641 ± 2.082
0.0ThrHis: 0.0 ± 0.0
1.214ThrIle: 1.214 ± 0.746
1.214ThrLys: 1.214 ± 0.746
8.495ThrLeu: 8.495 ± 1.259
6.068ThrMet: 6.068 ± 1.569
2.427ThrAsn: 2.427 ± 1.491
1.214ThrPro: 1.214 ± 1.414
3.641ThrGln: 3.641 ± 2.237
2.427ThrArg: 2.427 ± 0.668
3.641ThrSer: 3.641 ± 2.237
0.0ThrThr: 0.0 ± 0.0
2.427ThrVal: 2.427 ± 0.668
0.0ThrTrp: 0.0 ± 0.0
1.214ThrTyr: 1.214 ± 0.746
0.0ThrXaa: 0.0 ± 0.0
Val
4.854ValAla: 4.854 ± 0.823
0.0ValCys: 0.0 ± 0.0
4.854ValAsp: 4.854 ± 1.337
4.854ValGlu: 4.854 ± 0.823
1.214ValPhe: 1.214 ± 1.414
3.641ValGly: 3.641 ± 0.077
0.0ValHis: 0.0 ± 0.0
1.214ValIle: 1.214 ± 1.414
6.068ValLys: 6.068 ± 0.591
6.068ValLeu: 6.068 ± 2.751
3.641ValMet: 3.641 ± 2.237
1.214ValAsn: 1.214 ± 0.746
2.427ValPro: 2.427 ± 1.491
1.214ValGln: 1.214 ± 0.746
6.068ValArg: 6.068 ± 1.569
9.709ValSer: 9.709 ± 3.805
6.068ValThr: 6.068 ± 0.591
7.282ValVal: 7.282 ± 0.155
1.214ValTrp: 1.214 ± 1.414
3.641ValTyr: 3.641 ± 2.237
0.0ValXaa: 0.0 ± 0.0
Trp
4.854TrpAla: 4.854 ± 3.496
0.0TrpCys: 0.0 ± 0.0
2.427TrpAsp: 2.427 ± 2.828
2.427TrpGlu: 2.427 ± 0.668
1.214TrpPhe: 1.214 ± 0.746
1.214TrpGly: 1.214 ± 1.414
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.641TrpLeu: 3.641 ± 2.082
1.214TrpMet: 1.214 ± 1.414
0.0TrpAsn: 0.0 ± 0.0
1.214TrpPro: 1.214 ± 1.414
0.0TrpGln: 0.0 ± 0.0
4.854TrpArg: 4.854 ± 1.337
1.214TrpSer: 1.214 ± 0.746
2.427TrpThr: 2.427 ± 2.828
2.427TrpVal: 2.427 ± 2.828
0.0TrpTrp: 0.0 ± 0.0
1.214TrpTyr: 1.214 ± 0.746
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.854TyrAla: 4.854 ± 1.337
0.0TyrCys: 0.0 ± 0.0
2.427TyrAsp: 2.427 ± 1.491
2.427TyrGlu: 2.427 ± 1.491
1.214TyrPhe: 1.214 ± 0.746
1.214TyrGly: 1.214 ± 0.746
1.214TyrHis: 1.214 ± 0.746
0.0TyrIle: 0.0 ± 0.0
3.641TyrLys: 3.641 ± 2.237
6.068TyrLeu: 6.068 ± 0.591
1.214TyrMet: 1.214 ± 0.746
0.0TyrAsn: 0.0 ± 0.0
2.427TyrPro: 2.427 ± 0.668
1.214TyrGln: 1.214 ± 1.414
0.0TyrArg: 0.0 ± 0.0
4.854TyrSer: 4.854 ± 0.823
2.427TyrThr: 2.427 ± 1.491
1.214TyrVal: 1.214 ± 1.414
0.0TyrTrp: 0.0 ± 0.0
1.214TyrTyr: 1.214 ± 1.414
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (825 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski