Amino acid dipepetide frequency for Hubei sobemo-like virus 45

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.373AlaAla: 8.373 ± 3.96
1.196AlaCys: 1.196 ± 0.935
2.392AlaAsp: 2.392 ± 1.632
5.981AlaGlu: 5.981 ± 2.328
4.785AlaPhe: 4.785 ± 0.239
7.177AlaGly: 7.177 ± 4.895
1.196AlaHis: 1.196 ± 0.816
2.392AlaIle: 2.392 ± 1.632
4.785AlaLys: 4.785 ± 0.239
4.785AlaLeu: 4.785 ± 1.512
7.177AlaMet: 7.177 ± 0.358
1.196AlaAsn: 1.196 ± 0.816
1.196AlaPro: 1.196 ± 0.935
3.589AlaGln: 3.589 ± 0.696
3.589AlaArg: 3.589 ± 0.696
5.981AlaSer: 5.981 ± 2.925
1.196AlaThr: 1.196 ± 0.935
3.589AlaVal: 3.589 ± 0.696
2.392AlaTrp: 2.392 ± 1.87
4.785AlaTyr: 4.785 ± 3.263
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.196CysCys: 1.196 ± 0.935
0.0CysAsp: 0.0 ± 0.0
1.196CysGlu: 1.196 ± 0.816
1.196CysPhe: 1.196 ± 0.816
3.589CysGly: 3.589 ± 1.055
0.0CysHis: 0.0 ± 0.0
1.196CysIle: 1.196 ± 0.935
0.0CysLys: 0.0 ± 0.0
1.196CysLeu: 1.196 ± 0.935
1.196CysMet: 1.196 ± 0.816
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.196CysGln: 1.196 ± 0.816
1.196CysArg: 1.196 ± 0.935
1.196CysSer: 1.196 ± 0.816
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.392AspAla: 2.392 ± 1.632
1.196AspCys: 1.196 ± 0.816
4.785AspAsp: 4.785 ± 1.99
1.196AspGlu: 1.196 ± 0.935
2.392AspPhe: 2.392 ± 1.87
2.392AspGly: 2.392 ± 1.87
2.392AspHis: 2.392 ± 1.87
1.196AspIle: 1.196 ± 0.816
2.392AspLys: 2.392 ± 0.119
5.981AspLeu: 5.981 ± 2.925
0.0AspMet: 0.0 ± 0.621
0.0AspAsn: 0.0 ± 0.0
1.196AspPro: 1.196 ± 0.935
2.392AspGln: 2.392 ± 0.119
2.392AspArg: 2.392 ± 0.119
2.392AspSer: 2.392 ± 1.632
3.589AspThr: 3.589 ± 1.055
5.981AspVal: 5.981 ± 2.328
5.981AspTrp: 5.981 ± 2.925
3.589AspTyr: 3.589 ± 1.055
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
3.589GluAsp: 3.589 ± 1.055
8.373GluGlu: 8.373 ± 0.458
1.196GluPhe: 1.196 ± 0.935
5.981GluGly: 5.981 ± 2.925
1.196GluHis: 1.196 ± 0.935
3.589GluIle: 3.589 ± 1.055
2.392GluLys: 2.392 ± 1.87
4.785GluLeu: 4.785 ± 3.263
1.196GluMet: 1.196 ± 0.816
0.0GluAsn: 0.0 ± 0.0
2.392GluPro: 2.392 ± 1.87
1.196GluGln: 1.196 ± 0.935
3.589GluArg: 3.589 ± 1.055
4.785GluSer: 4.785 ± 0.239
3.589GluThr: 3.589 ± 2.447
3.589GluVal: 3.589 ± 2.447
4.785GluTrp: 4.785 ± 1.99
1.196GluTyr: 1.196 ± 0.816
0.0GluXaa: 0.0 ± 0.0
Phe
3.589PheAla: 3.589 ± 0.696
0.0PheCys: 0.0 ± 0.0
3.589PheAsp: 3.589 ± 1.055
2.392PheGlu: 2.392 ± 0.119
1.196PhePhe: 1.196 ± 0.816
7.177PheGly: 7.177 ± 1.393
0.0PheHis: 0.0 ± 0.0
3.589PheIle: 3.589 ± 2.806
0.0PheLys: 0.0 ± 0.0
3.589PheLeu: 3.589 ± 1.055
1.196PheMet: 1.196 ± 0.935
0.0PheAsn: 0.0 ± 0.0
3.589PhePro: 3.589 ± 1.055
0.0PheGln: 0.0 ± 0.0
5.981PheArg: 5.981 ± 0.577
3.589PheSer: 3.589 ± 1.055
0.0PheThr: 0.0 ± 0.0
1.196PheVal: 1.196 ± 0.935
1.196PheTrp: 1.196 ± 0.935
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
8.373GlyAla: 8.373 ± 0.458
1.196GlyCys: 1.196 ± 0.935
4.785GlyAsp: 4.785 ± 1.512
3.589GlyGlu: 3.589 ± 1.055
4.785GlyPhe: 4.785 ± 0.239
3.589GlyGly: 3.589 ± 0.696
4.785GlyHis: 4.785 ± 1.99
1.196GlyIle: 1.196 ± 0.935
2.392GlyLys: 2.392 ± 1.632
5.981GlyLeu: 5.981 ± 0.577
1.196GlyMet: 1.196 ± 0.816
2.392GlyAsn: 2.392 ± 1.632
7.177GlyPro: 7.177 ± 3.144
4.785GlyGln: 4.785 ± 3.263
4.785GlyArg: 4.785 ± 1.512
8.373GlySer: 8.373 ± 3.96
4.785GlyThr: 4.785 ± 3.263
8.373GlyVal: 8.373 ± 1.293
2.392GlyTrp: 2.392 ± 0.119
5.981GlyTyr: 5.981 ± 2.925
0.0GlyXaa: 0.0 ± 0.0
His
2.392HisAla: 2.392 ± 1.87
0.0HisCys: 0.0 ± 0.0
1.196HisAsp: 1.196 ± 0.935
1.196HisGlu: 1.196 ± 0.816
1.196HisPhe: 1.196 ± 0.935
7.177HisGly: 7.177 ± 1.393
0.0HisHis: 0.0 ± 0.0
2.392HisIle: 2.392 ± 1.87
1.196HisLys: 1.196 ± 0.935
2.392HisLeu: 2.392 ± 0.119
1.196HisMet: 1.196 ± 0.935
1.196HisAsn: 1.196 ± 0.816
0.0HisPro: 0.0 ± 0.0
2.392HisGln: 2.392 ± 0.119
0.0HisArg: 0.0 ± 0.0
2.392HisSer: 2.392 ± 0.119
1.196HisThr: 1.196 ± 0.816
3.589HisVal: 3.589 ± 0.696
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.392IleAla: 2.392 ± 0.119
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
2.392IleGlu: 2.392 ± 0.119
0.0IlePhe: 0.0 ± 0.0
2.392IleGly: 2.392 ± 0.119
1.196IleHis: 1.196 ± 0.816
0.0IleIle: 0.0 ± 0.0
1.196IleLys: 1.196 ± 0.935
1.196IleLeu: 1.196 ± 0.816
2.392IleMet: 2.392 ± 0.669
0.0IleAsn: 0.0 ± 0.0
2.392IlePro: 2.392 ± 1.87
2.392IleGln: 2.392 ± 1.87
0.0IleArg: 0.0 ± 0.0
2.392IleSer: 2.392 ± 1.87
2.392IleThr: 2.392 ± 0.119
7.177IleVal: 7.177 ± 1.393
1.196IleTrp: 1.196 ± 0.935
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.785LysAla: 4.785 ± 1.512
0.0LysCys: 0.0 ± 0.0
1.196LysAsp: 1.196 ± 0.816
0.0LysGlu: 0.0 ± 0.0
2.392LysPhe: 2.392 ± 1.87
1.196LysGly: 1.196 ± 0.816
3.589LysHis: 3.589 ± 0.696
1.196LysIle: 1.196 ± 0.816
2.392LysLys: 2.392 ± 0.119
3.589LysLeu: 3.589 ± 0.696
2.392LysMet: 2.392 ± 0.119
0.0LysAsn: 0.0 ± 0.0
2.392LysPro: 2.392 ± 1.87
1.196LysGln: 1.196 ± 0.935
3.589LysArg: 3.589 ± 0.696
4.785LysSer: 4.785 ± 3.741
1.196LysThr: 1.196 ± 0.935
3.589LysVal: 3.589 ± 0.696
0.0LysTrp: 0.0 ± 0.0
1.196LysTyr: 1.196 ± 0.935
0.0LysXaa: 0.0 ± 0.0
Leu
4.785LeuAla: 4.785 ± 0.239
1.196LeuCys: 1.196 ± 0.816
10.766LeuAsp: 10.766 ± 6.666
9.569LeuGlu: 9.569 ± 3.98
2.392LeuPhe: 2.392 ± 0.119
3.589LeuGly: 3.589 ± 0.696
3.589LeuHis: 3.589 ± 2.806
2.392LeuIle: 2.392 ± 0.119
2.392LeuLys: 2.392 ± 1.632
4.785LeuLeu: 4.785 ± 1.99
0.0LeuMet: 0.0 ± 0.0
3.589LeuAsn: 3.589 ± 0.696
3.589LeuPro: 3.589 ± 0.696
2.392LeuGln: 2.392 ± 1.87
2.392LeuArg: 2.392 ± 0.119
4.785LeuSer: 4.785 ± 1.512
3.589LeuThr: 3.589 ± 2.447
8.373LeuVal: 8.373 ± 0.458
3.589LeuTrp: 3.589 ± 1.055
4.785LeuTyr: 4.785 ± 0.239
0.0LeuXaa: 0.0 ± 0.0
Met
4.785MetAla: 4.785 ± 0.239
0.0MetCys: 0.0 ± 0.0
1.196MetAsp: 1.196 ± 0.816
3.589MetGlu: 3.589 ± 1.055
0.0MetPhe: 0.0 ± 0.0
5.981MetGly: 5.981 ± 1.174
1.196MetHis: 1.196 ± 0.816
1.196MetIle: 1.196 ± 0.935
2.392MetLys: 2.392 ± 0.119
3.589MetLeu: 3.589 ± 1.055
0.0MetMet: 0.0 ± 0.0
2.392MetAsn: 2.392 ± 0.119
2.392MetPro: 2.392 ± 1.632
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.196MetSer: 1.196 ± 0.816
1.196MetThr: 1.196 ± 0.935
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.589AsnAla: 3.589 ± 2.447
0.0AsnCys: 0.0 ± 0.0
1.196AsnAsp: 1.196 ± 0.935
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
0.0AsnGly: 0.0 ± 0.0
1.196AsnHis: 1.196 ± 0.816
2.392AsnIle: 2.392 ± 1.632
0.0AsnLys: 0.0 ± 0.0
1.196AsnLeu: 1.196 ± 0.935
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
0.0AsnPro: 0.0 ± 0.0
3.589AsnGln: 3.589 ± 2.447
3.589AsnArg: 3.589 ± 0.696
3.589AsnSer: 3.589 ± 0.696
1.196AsnThr: 1.196 ± 0.816
1.196AsnVal: 1.196 ± 0.935
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.785ProAla: 4.785 ± 0.239
1.196ProCys: 1.196 ± 0.935
2.392ProAsp: 2.392 ± 1.87
1.196ProGlu: 1.196 ± 0.935
3.589ProPhe: 3.589 ± 0.696
5.981ProGly: 5.981 ± 0.577
2.392ProHis: 2.392 ± 0.119
2.392ProIle: 2.392 ± 0.119
1.196ProLys: 1.196 ± 0.816
2.392ProLeu: 2.392 ± 1.87
1.196ProMet: 1.196 ± 0.816
2.392ProAsn: 2.392 ± 1.632
2.392ProPro: 2.392 ± 1.632
1.196ProGln: 1.196 ± 0.816
0.0ProArg: 0.0 ± 0.0
5.981ProSer: 5.981 ± 1.174
3.589ProThr: 3.589 ± 0.696
1.196ProVal: 1.196 ± 0.816
0.0ProTrp: 0.0 ± 0.0
3.589ProTyr: 3.589 ± 1.055
0.0ProXaa: 0.0 ± 0.0
Gln
3.589GlnAla: 3.589 ± 2.806
0.0GlnCys: 0.0 ± 0.0
1.196GlnAsp: 1.196 ± 0.816
1.196GlnGlu: 1.196 ± 0.816
3.589GlnPhe: 3.589 ± 1.055
4.785GlnGly: 4.785 ± 0.239
1.196GlnHis: 1.196 ± 0.816
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
2.392GlnLeu: 2.392 ± 0.119
1.196GlnMet: 1.196 ± 0.816
2.392GlnAsn: 2.392 ± 0.119
1.196GlnPro: 1.196 ± 0.816
2.392GlnGln: 2.392 ± 1.87
8.373GlnArg: 8.373 ± 1.293
1.196GlnSer: 1.196 ± 0.816
1.196GlnThr: 1.196 ± 0.816
5.981GlnVal: 5.981 ± 1.174
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.196ArgAla: 1.196 ± 0.816
0.0ArgCys: 0.0 ± 0.0
2.392ArgAsp: 2.392 ± 0.119
3.589ArgGlu: 3.589 ± 2.806
3.589ArgPhe: 3.589 ± 1.055
3.589ArgGly: 3.589 ± 0.696
1.196ArgHis: 1.196 ± 0.816
0.0ArgIle: 0.0 ± 0.0
2.392ArgLys: 2.392 ± 1.632
11.962ArgLeu: 11.962 ± 0.597
1.196ArgMet: 1.196 ± 0.935
1.196ArgAsn: 1.196 ± 0.816
2.392ArgPro: 2.392 ± 1.632
0.0ArgGln: 0.0 ± 0.0
1.196ArgArg: 1.196 ± 0.816
5.981ArgSer: 5.981 ± 2.328
0.0ArgThr: 0.0 ± 0.0
8.373ArgVal: 8.373 ± 2.209
4.785ArgTrp: 4.785 ± 3.741
3.589ArgTyr: 3.589 ± 0.696
0.0ArgXaa: 0.0 ± 0.0
Ser
9.569SerAla: 9.569 ± 3.024
2.392SerCys: 2.392 ± 1.632
2.392SerAsp: 2.392 ± 1.87
2.392SerGlu: 2.392 ± 0.119
3.589SerPhe: 3.589 ± 0.696
9.569SerGly: 9.569 ± 4.775
2.392SerHis: 2.392 ± 1.87
1.196SerIle: 1.196 ± 0.935
5.981SerLys: 5.981 ± 1.174
1.196SerLeu: 1.196 ± 0.816
2.392SerMet: 2.392 ± 0.119
1.196SerAsn: 1.196 ± 0.935
5.981SerPro: 5.981 ± 2.328
3.589SerGln: 3.589 ± 0.696
3.589SerArg: 3.589 ± 1.055
4.785SerSer: 4.785 ± 0.239
1.196SerThr: 1.196 ± 0.816
8.373SerVal: 8.373 ± 0.458
2.392SerTrp: 2.392 ± 0.119
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.392ThrAla: 2.392 ± 1.632
0.0ThrCys: 0.0 ± 0.0
2.392ThrAsp: 2.392 ± 1.632
2.392ThrGlu: 2.392 ± 1.632
1.196ThrPhe: 1.196 ± 0.816
2.392ThrGly: 2.392 ± 0.119
0.0ThrHis: 0.0 ± 0.0
4.785ThrIle: 4.785 ± 0.239
0.0ThrLys: 0.0 ± 0.0
4.785ThrLeu: 4.785 ± 1.512
0.0ThrMet: 0.0 ± 0.0
1.196ThrAsn: 1.196 ± 0.816
2.392ThrPro: 2.392 ± 0.119
3.589ThrGln: 3.589 ± 1.055
0.0ThrArg: 0.0 ± 0.0
2.392ThrSer: 2.392 ± 0.119
4.785ThrThr: 4.785 ± 1.512
3.589ThrVal: 3.589 ± 2.447
1.196ThrTrp: 1.196 ± 0.816
2.392ThrTyr: 2.392 ± 0.119
0.0ThrXaa: 0.0 ± 0.0
Val
8.373ValAla: 8.373 ± 3.96
2.392ValCys: 2.392 ± 0.119
3.589ValAsp: 3.589 ± 0.696
2.392ValGlu: 2.392 ± 0.119
3.589ValPhe: 3.589 ± 1.055
10.766ValGly: 10.766 ± 3.84
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
4.785ValLys: 4.785 ± 0.239
7.177ValLeu: 7.177 ± 2.109
3.589ValMet: 3.589 ± 1.055
1.196ValAsn: 1.196 ± 0.816
5.981ValPro: 5.981 ± 1.174
4.785ValGln: 4.785 ± 1.99
7.177ValArg: 7.177 ± 3.144
5.981ValSer: 5.981 ± 4.079
3.589ValThr: 3.589 ± 0.696
8.373ValVal: 8.373 ± 0.458
2.392ValTrp: 2.392 ± 1.632
1.196ValTyr: 1.196 ± 0.935
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.392TrpAsp: 2.392 ± 1.87
2.392TrpGlu: 2.392 ± 0.119
1.196TrpPhe: 1.196 ± 0.935
1.196TrpGly: 1.196 ± 0.816
2.392TrpHis: 2.392 ± 0.119
0.0TrpIle: 0.0 ± 0.0
2.392TrpLys: 2.392 ± 1.87
8.373TrpLeu: 8.373 ± 4.795
1.196TrpMet: 1.196 ± 0.935
1.196TrpAsn: 1.196 ± 0.816
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
4.785TrpArg: 4.785 ± 0.239
0.0TrpSer: 0.0 ± 0.0
1.196TrpThr: 1.196 ± 0.935
1.196TrpVal: 1.196 ± 0.935
0.0TrpTrp: 0.0 ± 0.0
3.589TrpTyr: 3.589 ± 1.055
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.392TyrAla: 2.392 ± 0.119
2.392TyrCys: 2.392 ± 0.119
3.589TyrAsp: 3.589 ± 1.055
2.392TyrGlu: 2.392 ± 0.119
0.0TyrPhe: 0.0 ± 0.0
2.392TyrGly: 2.392 ± 0.119
1.196TyrHis: 1.196 ± 0.816
0.0TyrIle: 0.0 ± 0.0
2.392TyrLys: 2.392 ± 1.87
1.196TyrLeu: 1.196 ± 0.935
1.196TyrMet: 1.196 ± 0.816
1.196TyrAsn: 1.196 ± 0.935
2.392TyrPro: 2.392 ± 1.87
1.196TyrGln: 1.196 ± 0.935
2.392TyrArg: 2.392 ± 0.119
2.392TyrSer: 2.392 ± 1.632
2.392TyrThr: 2.392 ± 1.632
3.589TyrVal: 3.589 ± 0.696
1.196TyrTrp: 1.196 ± 0.935
2.392TyrTyr: 2.392 ± 0.119
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (837 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski